Query lcl|NC_020844.1_cdsid_YP_007673697.1 [gene=SLPG_00015] [protein=hypothetical protein] [protein_id=YP_007673697.1] [location=10772..12139] Match_columns 455 No_of_seqs 136 out of 206 Neff 8.4 Searched_HMMs 1612 Date Thu Nov 7 16:03:13 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_15 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_15_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97265 Length: 513 100.0 8E-138 5E-141 772.2 52.1 444 1-455 1-475 (513) 2 protein:vir:95149 Length: 501 100.0 1E-133 9E-137 748.8 50.7 441 1-455 1-486 (501) 3 protein:vir:80453 Length: 535 100.0 3E-133 2E-136 747.2 50.4 443 1-455 32-511 (535) 4 protein:vir:95014 Length: 491 100.0 3E-126 2E-129 708.7 50.7 440 1-455 1-477 (491) 5 protein:vir:78393 Length: 489 100.0 2E-125 1E-128 704.7 50.2 440 1-455 1-476 (489) 6 protein:vir:94956 Length: 452 100.0 1E-122 7E-126 689.2 49.9 420 1-455 1-444 (452) 7 protein:vir:96783 Length: 488 100.0 1E-122 7E-126 689.1 48.2 434 1-453 14-488 (488) 8 protein:vir:94805 Length: 492 100.0 3.1E-49 2E-52 286.5 34.6 410 1-455 37-468 (492) 9 protein:vir:106571 Length: 499 100.0 3.6E-49 2.2E-52 286.2 33.8 420 1-455 13-457 (499) 10 protein:vir:97336 Length: 492 100.0 5.2E-49 3.2E-52 285.3 33.5 411 1-455 37-468 (492) 11 protein:vir:102950 Length: 471 100.0 6.7E-48 4.1E-51 279.2 35.8 421 1-455 1-457 (471) 12 protein:vir:1236 Length: 483 # 100.0 3.8E-48 2.3E-51 280.6 34.2 411 1-455 29-459 (483) 13 protein:vir:93747 Length: 472 100.0 5.5E-48 3.4E-51 279.7 34.6 411 1-455 18-448 (472) 14 protein:vir:106639 Length: 481 100.0 2.7E-47 1.7E-50 275.9 38.3 412 1-455 23-460 (481) 15 protein:vir:102330 Length: 451 100.0 6.4E-48 4E-51 279.3 34.7 423 1-455 1-441 (451) 16 protein:vir:96179 Length: 468 100.0 7.5E-47 4.7E-50 273.5 36.4 411 1-455 1-455 (468) 17 protein:vir:5961 Length: 503 # 100.0 7.1E-47 4.4E-50 273.6 36.2 416 1-455 1-470 (503) 18 protein:vir:105292 Length: 478 100.0 6.2E-47 3.9E-50 273.9 35.9 410 1-455 1-456 (478) 19 protein:vir:107112 Length: 478 100.0 4.9E-47 3.1E-50 274.5 34.8 413 1-455 1-456 (478) 20 protein:vir:3964 Length: 453 # 100.0 1.2E-46 7.5E-50 272.3 35.6 407 1-455 11-432 (453) 21 protein:vir:94498 Length: 474 100.0 1.4E-46 8.7E-50 272.0 35.9 407 1-455 7-450 (474) 22 protein:vir:97447 Length: 474 100.0 1.4E-46 8.7E-50 272.0 35.9 407 1-455 7-450 (474) 23 protein:vir:95113 Length: 474 100.0 3.1E-46 1.9E-49 270.1 37.6 410 1-455 7-450 (474) 24 protein:vir:9871 Length: 429 # 100.0 2E-46 1.2E-49 271.1 35.7 406 1-455 1-414 (429) 25 protein:vir:96240 Length: 511 100.0 5.2E-46 3.2E-49 268.9 37.6 415 1-455 31-482 (511) 26 protein:vir:3609 Length: 452 # 100.0 4.6E-46 2.9E-49 269.1 37.2 407 1-455 11-431 (452) 27 protein:vir:103951 Length: 511 100.0 6.4E-46 4E-49 268.3 38.0 415 1-455 31-482 (511) 28 protein:vir:96266 Length: 474 100.0 1.3E-46 8.4E-50 272.1 34.0 410 1-455 7-450 (474) 29 protein:vir:95899 Length: 474 100.0 1.3E-46 8.4E-50 272.1 34.0 410 1-455 7-450 (474) 30 protein:vir:9306 Length: 511 # 100.0 4.9E-46 3.1E-49 269.0 37.0 415 1-455 31-482 (511) 31 protein:vir:99781 Length: 511 100.0 4.6E-46 2.9E-49 269.1 35.7 415 1-455 31-482 (511) 32 protein:vir:96366 Length: 511 100.0 1.3E-45 7.9E-49 266.7 36.9 415 1-455 31-482 (511) 33 protein:vir:78805 Length: 511 100.0 1.3E-45 7.9E-49 266.7 36.9 415 1-455 31-482 (511) 34 protein:vir:97171 Length: 512 100.0 3.8E-46 2.4E-49 269.6 33.9 415 1-455 31-483 (512) 35 protein:vir:105461 Length: 470 100.0 6.9E-46 4.3E-49 268.2 35.0 419 1-455 1-454 (470) 36 protein:vir:733 Length: 453 # 100.0 2.9E-45 1.8E-48 264.7 36.7 407 1-455 11-437 (453) 37 protein:vir:99522 Length: 470 100.0 1.4E-45 8.6E-49 266.5 34.3 410 1-455 19-450 (470) 38 protein:vir:105889 Length: 474 100.0 7.3E-46 4.5E-49 268.1 32.5 407 1-455 10-455 (474) 39 protein:vir:94101 Length: 474 100.0 7.3E-46 4.5E-49 268.1 32.5 407 1-455 10-455 (474) 40 protein:vir:96839 Length: 474 100.0 3.4E-45 2.1E-48 264.4 36.0 411 1-455 1-454 (474) 41 protein:vir:79043 Length: 479 100.0 2.7E-45 1.6E-48 265.0 34.5 413 1-455 7-464 (479) 42 protein:vir:2427 Length: 485 # 100.0 5.1E-45 3.2E-48 263.4 36.0 418 1-455 13-451 (485) 43 protein:vir:95806 Length: 440 100.0 4.7E-45 2.9E-48 263.6 35.5 406 8-455 1-424 (440) 44 protein:vir:2732 Length: 501 # 100.0 2E-44 1.3E-47 260.1 37.0 409 1-455 30-479 (501) 45 protein:vir:104082 Length: 485 100.0 6.6E-45 4.1E-48 262.8 33.6 419 1-455 8-465 (485) 46 protein:vir:99072 Length: 479 100.0 3.7E-44 2.3E-47 258.7 37.5 415 1-455 4-440 (479) 47 protein:vir:96494 Length: 501 100.0 2.3E-44 1.4E-47 259.8 35.9 409 1-455 30-476 (501) 48 protein:vir:4223 Length: 486 # 100.0 3.8E-44 2.4E-47 258.6 36.7 418 1-455 8-451 (486) 49 protein:vir:2341 Length: 488 # 100.0 3.8E-44 2.4E-47 258.6 36.3 423 1-455 1-458 (488) 50 protein:vir:7768 Length: 484 # 100.0 7.3E-44 4.5E-47 257.1 37.0 421 1-455 1-454 (484) 51 protein:vir:4898 Length: 502 # 100.0 7.7E-44 4.8E-47 256.9 36.8 409 1-455 31-473 (502) 52 protein:vir:80680 Length: 441 100.0 2.3E-43 1.4E-46 254.3 37.5 409 1-455 1-426 (441) 53 protein:vir:94546 Length: 506 100.0 7.5E-44 4.6E-47 257.0 32.9 415 1-455 22-481 (506) 54 protein:vir:78227 Length: 480 100.0 2.4E-43 1.5E-46 254.3 35.2 418 1-455 1-446 (480) 55 protein:vir:78537 Length: 480 100.0 3.1E-43 1.9E-46 253.7 35.2 418 1-455 1-442 (480) 56 protein:vir:9922 Length: 489 # 100.0 7.9E-43 4.9E-46 251.4 37.3 420 1-455 1-471 (489) 57 protein:vir:2500 Length: 501 # 100.0 3.2E-42 2E-45 248.1 35.7 404 1-455 18-468 (501) 58 protein:vir:98444 Length: 434 100.0 5E-42 3.1E-45 247.0 34.0 381 40-455 1-419 (434) 59 protein:vir:9751 Length: 422 # 100.0 1.7E-41 1.1E-44 244.1 35.9 404 1-452 1-422 (422) 60 protein:vir:78083 Length: 537 100.0 1.1E-41 6.9E-45 245.1 34.5 414 1-455 1-479 (537) 61 protein:vir:105819 Length: 456 100.0 2.5E-41 1.5E-44 243.2 34.3 408 1-455 1-445 (456) 62 protein:vir:102602 Length: 456 100.0 2.5E-41 1.5E-44 243.2 34.3 408 1-455 1-445 (456) 63 protein:vir:9568 Length: 410 # 100.0 5.2E-40 3.2E-43 235.9 37.5 391 13-455 1-410 (410) 64 protein:vir:99916 Length: 504 100.0 3.9E-40 2.4E-43 236.6 36.3 414 1-455 18-469 (504) 65 protein:vir:7987 Length: 456 # 100.0 2.5E-40 1.6E-43 237.7 33.7 408 1-455 1-445 (456) 66 protein:vir:94742 Length: 409 100.0 1.4E-39 8.5E-43 233.6 35.0 390 1-435 1-409 (409) 67 protein:vir:8184 Length: 474 # 100.0 3.1E-39 1.9E-42 231.7 34.8 414 1-455 12-462 (474) 68 protein:vir:1634 Length: 409 # 100.0 1.2E-38 7.3E-42 228.5 33.7 391 1-435 1-409 (409) 69 protein:vir:38 Length: 496 # N 100.0 1.2E-29 7.5E-33 179.1 41.8 416 1-455 16-485 (496) 70 protein:vir:80959 Length: 499 100.0 5.7E-29 3.5E-32 175.4 42.1 416 1-455 16-488 (499) 71 protein:vir:101494 Length: 527 100.0 1.1E-28 6.9E-32 173.8 33.6 410 1-455 21-495 (527) 72 protein:vir:102239 Length: 527 100.0 1.3E-28 8.1E-32 173.5 33.5 410 1-455 21-495 (527) 73 protein:vir:79703 Length: 505 99.9 1.5E-26 9.2E-30 162.2 39.6 417 1-455 20-498 (505) 74 protein:vir:1587 Length: 508 # 99.9 1.8E-24 1.1E-27 150.8 39.8 416 1-455 22-495 (508) 75 protein:vir:3028 Length: 500 # 99.9 4.7E-24 2.9E-27 148.5 39.8 419 1-455 18-490 (500) 76 protein:vir:9815 Length: 500 # 99.9 4.7E-24 2.9E-27 148.5 39.8 419 1-455 18-490 (500) 77 protein:vir:7430 Length: 563 # 99.9 1.7E-25 1.1E-28 156.4 31.1 422 1-455 9-533 (563) 78 protein:vir:4782 Length: 522 # 99.9 6.8E-22 4.2E-25 136.6 41.2 420 1-455 18-503 (522) 79 protein:vir:78907 Length: 518 99.9 7.1E-20 4.4E-23 125.6 38.6 417 1-455 1-506 (518) 80 protein:vir:98883 Length: 517 99.8 7.3E-19 4.6E-22 120.0 36.0 417 1-455 18-501 (517) 81 protein:vir:104437 Length: 714 99.3 1.5E-10 9.1E-14 74.5 34.1 428 1-455 1-606 (714) 82 protein:vir:80040 Length: 461 99.2 6.9E-11 4.3E-14 76.3 27.0 413 1-455 1-457 (461) 83 protein:vir:93630 Length: 776 99.2 2.2E-10 1.4E-13 73.5 28.8 426 1-455 23-615 (776) 84 protein:vir:105619 Length: 772 99.2 5.8E-10 3.6E-13 71.2 34.1 427 1-455 14-611 (772) 85 protein:vir:10117 Length: 714 99.2 8.1E-10 5E-13 70.4 36.8 426 1-455 8-606 (714) 86 protein:vir:3296 Length: 714 # 99.2 8.1E-10 5E-13 70.4 36.8 426 1-455 8-606 (714) 87 protein:vir:817 Length: 714 # 99.2 8.1E-10 5E-13 70.4 36.8 426 1-455 8-606 (714) 88 protein:vir:2764 Length: 714 # 99.2 8.1E-10 5E-13 70.4 36.8 426 1-455 8-606 (714) 89 protein:vir:9950 Length: 714 # 99.2 8.1E-10 5E-13 70.4 36.8 426 1-455 8-606 (714) 90 protein:vir:94599 Length: 641 99.1 1.5E-09 9.1E-13 69.0 28.4 434 1-455 5-599 (641) 91 protein:vir:80165 Length: 651 99.0 4.5E-09 2.8E-12 66.3 36.3 429 1-455 3-590 (651) 92 protein:vir:3420 Length: 533 # 99.0 6.7E-09 4.2E-12 65.4 27.9 412 1-455 1-507 (533) 93 protein:vir:105429 Length: 708 98.9 1.5E-08 9.1E-12 63.5 31.9 432 1-455 1-595 (708) 94 protein:vir:79063 Length: 491 98.9 2E-08 1.2E-11 62.8 32.9 389 1-455 7-411 (491) 95 protein:vir:389 Length: 530 # 98.9 2E-08 1.3E-11 62.8 29.6 407 1-455 1-504 (530) 96 protein:vir:2198 Length: 536 # 98.9 2.1E-08 1.3E-11 62.6 32.7 420 1-455 1-503 (536) 97 protein:vir:108295 Length: 711 98.9 2.3E-08 1.4E-11 62.5 35.6 434 1-455 1-614 (711) 98 protein:vir:10321 Length: 495 98.9 2.4E-08 1.5E-11 62.4 30.1 412 1-455 1-473 (495) 99 protein:vir:107880 Length: 491 98.9 2.5E-08 1.6E-11 62.2 30.8 381 1-455 1-408 (491) 100 protein:vir:77597 Length: 725 98.8 3E-08 1.9E-11 61.8 33.4 432 1-455 1-583 (725) 101 protein:vir:79538 Length: 502 98.8 3.7E-08 2.3E-11 61.3 33.9 410 1-455 1-473 (502) 102 protein:vir:99232 Length: 526 98.8 5.1E-08 3.1E-11 60.6 33.7 389 1-455 1-428 (526) 103 protein:vir:105520 Length: 706 98.8 5.2E-08 3.2E-11 60.5 32.0 431 1-455 1-594 (706) 104 protein:vir:108215 Length: 469 98.8 5.6E-08 3.4E-11 60.4 33.6 400 1-455 1-444 (469) 105 protein:vir:103860 Length: 528 98.8 6.1E-08 3.8E-11 60.1 32.5 387 1-455 1-425 (528) 106 protein:vir:8883 Length: 543 # 98.7 6.9E-08 4.3E-11 59.8 30.6 421 1-455 1-503 (543) 107 protein:vir:79233 Length: 526 98.7 9.4E-08 5.9E-11 59.1 34.5 393 1-455 1-430 (526) 108 protein:vir:94709 Length: 522 98.7 1E-07 6.5E-11 58.9 33.3 418 1-455 1-498 (522) 109 protein:vir:100920 Length: 725 98.7 1.2E-07 7.5E-11 58.5 31.1 432 1-455 1-583 (725) 110 protein:vir:1538 Length: 535 # 98.7 1.4E-07 8.5E-11 58.2 33.3 415 1-455 1-501 (535) 111 protein:vir:99672 Length: 532 98.6 1.7E-07 1E-10 57.8 30.3 421 1-455 1-504 (532) 112 protein:vir:100039 Length: 522 98.6 2.3E-07 1.4E-10 57.0 29.7 401 1-455 1-488 (522) 113 protein:vir:104338 Length: 422 98.6 2.6E-07 1.6E-10 56.7 23.7 390 1-455 1-419 (422) 114 protein:vir:10447 Length: 536 98.6 2.6E-07 1.6E-10 56.7 35.3 420 1-455 1-503 (536) 115 protein:vir:3361 Length: 535 # 98.6 2.8E-07 1.7E-10 56.5 32.9 417 1-455 1-501 (535) 116 protein:vir:8846 Length: 705 # 98.5 3.1E-07 1.9E-10 56.3 34.2 435 1-455 1-598 (705) 117 protein:vir:95315 Length: 559 98.5 3.1E-07 1.9E-10 56.3 31.6 417 1-455 1-516 (559) 118 protein:vir:7321 Length: 556 # 98.5 3.2E-07 2E-10 56.2 31.7 416 1-455 1-535 (556) 119 protein:vir:79647 Length: 435 98.5 1.9E-07 1.2E-10 57.5 21.6 401 1-455 5-435 (435) 120 protein:vir:172 Length: 708 # 98.5 3.7E-07 2.3E-10 55.9 34.4 432 1-455 1-595 (708) 121 protein:vir:99853 Length: 488 98.5 3.8E-07 2.3E-10 55.8 32.9 381 8-455 1-403 (488) 122 protein:vir:107404 Length: 555 98.5 3.9E-07 2.4E-10 55.7 35.1 417 1-455 1-516 (555) 123 protein:vir:98506 Length: 555 98.5 3.9E-07 2.4E-10 55.7 35.1 417 1-455 1-516 (555) 124 protein:vir:107822 Length: 555 98.5 3.9E-07 2.4E-10 55.7 35.1 417 1-455 1-516 (555) 125 protein:vir:107742 Length: 537 98.5 4.2E-07 2.6E-10 55.6 24.0 406 1-455 48-507 (537) 126 protein:vir:96738 Length: 505 98.5 4.8E-07 3E-10 55.2 35.6 416 1-455 8-483 (505) 127 protein:vir:95449 Length: 584 98.5 4.9E-07 3.1E-10 55.2 27.6 435 1-454 1-584 (584) 128 protein:vir:103765 Length: 549 98.4 6.5E-07 4E-10 54.5 34.8 417 1-455 1-536 (549) 129 protein:vir:1986 Length: 512 # 98.4 1E-06 6.2E-10 53.5 31.7 383 1-455 1-426 (512) 130 protein:vir:78161 Length: 355 98.3 1.8E-06 1.1E-09 52.0 26.3 277 156-455 1-315 (355) 131 protein:vir:3520 Length: 720 # 98.2 2.2E-06 1.3E-09 51.7 30.9 435 1-455 1-594 (720) 132 protein:vir:107662 Length: 427 98.2 2.8E-06 1.7E-09 51.0 25.4 390 1-455 1-421 (427) 133 protein:vir:1785 Length: 555 # 98.2 3.1E-06 1.9E-09 50.8 30.6 409 1-455 1-512 (555) 134 protein:vir:95542 Length: 548 98.2 3.2E-06 2E-09 50.7 30.2 417 1-455 1-480 (548) 135 protein:vir:9263 Length: 725 # 98.1 3.8E-06 2.4E-09 50.3 33.3 434 1-455 1-583 (725) 136 protein:vir:102668 Length: 547 98.1 4E-06 2.5E-09 50.2 35.2 413 1-455 1-527 (547) 137 protein:vir:96068 Length: 765 98.1 4.2E-06 2.6E-09 50.1 25.8 397 1-455 71-508 (765) 138 protein:vir:5249 Length: 437 # 98.0 6.3E-06 3.9E-09 49.1 26.7 391 1-455 1-421 (437) 139 protein:vir:6382 Length: 553 # 98.0 6.7E-06 4.1E-09 49.0 32.1 412 1-455 1-521 (553) 140 protein:vir:105641 Length: 516 98.0 6.7E-06 4.2E-09 48.9 30.4 415 1-455 1-511 (516) 141 protein:vir:98816 Length: 446 98.0 7.1E-06 4.4E-09 48.8 27.2 403 1-440 3-446 (446) 142 protein:vir:96988 Length: 516 98.0 7.2E-06 4.5E-09 48.8 29.8 413 1-455 1-495 (516) 143 protein:vir:95821 Length: 763 97.9 1.1E-05 6.7E-09 47.8 31.3 437 1-455 9-620 (763) 144 protein:vir:80211 Length: 514 97.9 1.1E-05 7E-09 47.7 29.9 402 1-455 1-494 (514) 145 protein:vir:345 Length: 663 # 97.7 2.6E-05 1.6E-08 45.7 24.9 422 1-455 1-528 (663) 146 protein:vir:94572 Length: 535 97.7 3E-05 1.8E-08 45.4 33.8 418 1-455 1-507 (535) 147 protein:vir:99563 Length: 862 97.6 3.9E-05 2.4E-08 44.8 22.4 415 1-455 66-544 (862) 148 protein:vir:7017 Length: 515 # 97.6 3.9E-05 2.4E-08 44.7 30.9 417 1-455 1-498 (515) 149 protein:vir:95254 Length: 488 97.4 7.3E-05 4.5E-08 43.3 30.8 410 1-455 1-462 (488) 150 protein:vir:78942 Length: 510 97.4 8.5E-05 5.2E-08 42.9 32.4 407 1-455 1-502 (510) 151 protein:vir:103330 Length: 517 97.2 0.00012 7.5E-08 42.1 32.5 410 1-455 1-497 (517) 152 protein:vir:78696 Length: 542 97.2 0.00014 9E-08 41.6 33.5 403 1-455 1-503 (542) 153 protein:vir:94049 Length: 532 97.0 0.0002 1.2E-07 40.8 28.6 408 1-455 1-499 (532) 154 protein:vir:77981 Length: 448 96.6 0.0005 3.1E-07 38.7 27.5 386 1-455 1-427 (448) 155 protein:vir:3843 Length: 397 # 96.1 0.0011 6.5E-07 36.9 25.4 365 1-455 1-385 (397) 156 protein:vir:6322 Length: 510 # 96.0 0.0011 7E-07 36.7 33.9 404 1-455 1-498 (510) 157 protein:vir:107605 Length: 432 95.8 0.0015 9.4E-07 36.0 26.0 375 17-455 1-430 (432) 158 protein:vir:105002 Length: 432 95.8 0.0015 9.4E-07 36.0 26.0 375 17-455 1-430 (432) 159 protein:vir:102855 Length: 432 95.8 0.0015 9.4E-07 36.0 26.0 375 17-455 1-430 (432) 160 protein:vir:3139 Length: 599 # 95.6 0.0018 1.1E-06 35.6 21.0 426 1-455 1-587 (599) 161 protein:vir:4952 Length: 386 # 95.5 0.002 1.2E-06 35.4 30.5 367 1-455 1-383 (386) 162 protein:vir:79511 Length: 448 95.3 0.0024 1.5E-06 35.0 28.9 394 1-455 1-438 (448) 163 protein:vir:7407 Length: 392 # 95.0 0.003 1.9E-06 34.4 30.0 365 1-455 3-389 (392) 164 protein:vir:79772 Length: 648 94.9 0.0032 2E-06 34.3 30.2 392 1-455 43-484 (648) 165 protein:vir:4194 Length: 540 # 94.8 0.0034 2.1E-06 34.1 33.9 388 1-455 6-453 (540) 166 protein:vir:100882 Length: 383 94.7 0.0038 2.4E-06 33.9 26.6 364 1-455 1-383 (383) 167 protein:vir:7853 Length: 518 # 93.5 0.0075 4.6E-06 32.2 28.5 379 1-455 1-431 (518) 168 protein:vir:1266 Length: 416 # 93.2 0.0083 5.1E-06 32.0 23.5 363 71-455 1-415 (416) 169 protein:vir:3153 Length: 467 # 92.8 0.0099 6.2E-06 31.6 30.0 365 52-455 1-460 (467) 170 protein:vir:81072 Length: 432 91.7 0.014 9E-06 30.7 29.1 381 1-455 1-425 (432) 171 protein:vir:4156 Length: 542 # 91.6 0.015 9.3E-06 30.6 33.3 391 1-455 6-451 (542) 172 protein:vir:101648 Length: 518 91.2 0.017 1.1E-05 30.3 31.2 377 1-455 3-431 (518) 173 protein:vir:3989 Length: 392 # 90.8 0.019 1.2E-05 30.0 30.1 367 1-455 3-389 (392) 174 protein:vir:1023 Length: 392 # 90.8 0.019 1.2E-05 30.0 30.1 367 1-455 3-389 (392) 175 protein:vir:63755 Length: 547 90.6 0.02 1.3E-05 29.9 29.3 396 1-455 11-502 (547) 176 protein:vir:4854 Length: 386 # 90.6 0.02 1.3E-05 29.9 29.9 367 1-455 1-382 (386) 177 protein:vir:81152 Length: 411 89.5 0.026 1.6E-05 29.3 26.7 366 1-454 1-411 (411) 178 protein:vir:4509 Length: 424 # 87.6 0.038 2.4E-05 28.4 25.5 375 16-455 1-419 (424) 179 protein:vir:100187 Length: 385 86.7 0.043 2.7E-05 28.1 27.9 358 1-455 1-382 (385) 180 protein:vir:960 Length: 413 # 85.6 0.052 3.2E-05 27.6 31.1 376 1-455 1-412 (413) 181 protein:vir:102080 Length: 429 85.5 0.052 3.2E-05 27.6 29.1 384 1-455 1-427 (429) 182 protein:vir:105782 Length: 449 85.5 0.052 3.2E-05 27.6 21.3 392 1-453 1-449 (449) 183 protein:vir:80644 Length: 551 85.2 0.055 3.4E-05 27.5 28.2 391 1-455 18-500 (551) 184 protein:vir:4828 Length: 382 # 83.8 0.066 4.1E-05 27.1 30.4 364 1-455 1-378 (382) 185 protein:vir:105064 Length: 421 83.4 0.069 4.3E-05 27.0 28.1 374 1-455 1-416 (421) 186 protein:vir:10362 Length: 432 83.2 0.07 4.4E-05 26.9 29.6 381 1-455 1-423 (432) 187 protein:vir:8418 Length: 409 # 82.8 0.074 4.6E-05 26.8 28.8 369 1-455 1-406 (409) 188 protein:vir:102118 Length: 409 82.4 0.077 4.8E-05 26.7 27.6 379 1-455 1-409 (409) 189 protein:vir:189 Length: 424 # 81.2 0.088 5.5E-05 26.4 27.6 381 1-453 1-424 (424) 190 protein:vir:5737 Length: 419 # 80.7 0.092 5.7E-05 26.3 26.2 373 1-455 1-410 (419) 191 protein:vir:3868 Length: 417 # 80.4 0.095 5.9E-05 26.2 28.9 369 1-455 1-403 (417) 192 protein:vir:93610 Length: 454 77.7 0.12 7.6E-05 25.6 30.4 368 23-455 1-419 (454) 193 protein:vir:1884 Length: 424 # 76.3 0.14 8.5E-05 25.3 28.5 380 1-455 1-422 (424) 194 protein:vir:103219 Length: 201 74.3 0.16 9.9E-05 24.9 14.0 185 218-455 1-201 (201) 195 protein:vir:1431 Length: 419 # 74.2 0.16 0.0001 24.9 27.6 371 1-455 7-413 (419) 196 protein:vir:9359 Length: 348 # 73.8 0.17 0.0001 24.9 27.7 321 72-455 1-347 (348) 197 protein:vir:97060 Length: 432 70.8 0.2 0.00013 24.4 30.2 374 1-455 1-423 (432) 198 protein:vir:6240 Length: 457 # 70.0 0.21 0.00013 24.2 25.2 387 1-455 1-436 (457) 199 protein:vir:95378 Length: 406 68.5 0.24 0.00015 24.0 29.1 363 1-455 1-399 (406) 200 protein:vir:1380 Length: 422 # 67.8 0.25 0.00015 23.9 29.1 383 1-455 1-419 (422) 201 protein:vir:81095 Length: 416 67.4 0.25 0.00016 23.9 28.4 369 1-455 1-414 (416) 202 protein:vir:4598 Length: 416 # 67.4 0.25 0.00016 23.9 28.4 369 1-455 1-414 (416) 203 protein:vir:96579 Length: 576 66.2 0.27 0.00017 23.7 21.5 390 1-455 27-511 (576) 204 protein:vir:78641 Length: 278 63.6 0.31 0.00019 23.3 21.6 262 72-400 1-278 (278) 205 protein:vir:9408 Length: 441 # 60.6 0.37 0.00023 23.0 25.8 370 1-455 23-439 (441) 206 protein:vir:79984 Length: 441 60.6 0.37 0.00023 23.0 25.8 370 1-455 23-439 (441) 207 protein:vir:105154 Length: 525 56.8 0.45 0.00028 22.5 12.8 411 1-455 1-484 (525) 208 protein:vir:101647 Length: 460 55.0 0.49 0.0003 22.3 29.9 386 1-455 1-458 (460) 209 protein:vir:483 Length: 413 # 52.1 0.56 0.00035 22.0 28.1 380 1-455 1-410 (413) 210 protein:vir:98396 Length: 441 51.0 0.59 0.00037 21.8 28.5 373 1-455 23-439 (441) 211 protein:vir:80333 Length: 419 49.7 0.63 0.00039 21.7 28.8 364 1-455 1-403 (419) 212 protein:vir:1082 Length: 359 # 46.5 0.73 0.00045 21.3 27.1 347 1-432 1-359 (359) 213 protein:vir:4337 Length: 434 # 45.3 0.78 0.00048 21.2 28.4 381 1-455 1-434 (434) 214 protein:vir:8317 Length: 409 # 44.5 0.8 0.0005 21.1 24.3 362 1-453 13-409 (409) 215 protein:vir:4454 Length: 414 # 43.2 0.85 0.00053 21.0 29.6 371 1-455 1-410 (414) 216 protein:vir:96980 Length: 409 42.2 0.9 0.00056 20.8 26.8 379 1-455 1-408 (409) 217 protein:vir:81218 Length: 423 42.0 0.9 0.00056 20.8 28.5 378 1-455 1-420 (423) 218 protein:vir:100150 Length: 437 39.2 1 0.00064 20.5 32.3 373 1-455 1-423 (437) 219 protein:vir:2683 Length: 412 # 38.3 1.1 0.00067 20.4 26.4 382 1-455 1-411 (412) 220 protein:vir:4995 Length: 384 # 37.4 1.1 0.0007 20.3 29.9 366 1-455 1-380 (384) 221 protein:vir:9702 Length: 406 # 32.5 1.4 0.00088 19.8 27.2 371 1-455 1-403 (406) 222 protein:vir:94426 Length: 409 31.8 1.5 0.00091 19.7 25.6 377 1-455 1-408 (409) 223 protein:vir:95965 Length: 385 28.6 1.7 0.0011 19.3 23.7 353 1-455 1-384 (385) 224 protein:vir:80134 Length: 403 23.6 2.3 0.0014 18.6 25.6 363 1-455 1-403 (403) 225 protein:vir:99452 Length: 651 22.7 2.4 0.0015 18.5 24.9 422 1-455 1-524 (651) 226 protein:vir:102727 Length: 945 21.0 2.7 0.0017 18.2 31.6 380 1-455 71-528 (945) No 1 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=100.00 E-value=8.1e-138 Score=772.17 Aligned_cols=444 Identities=25% Similarity=0.440 Sum_probs=411.8 Q ss_pred CCCC---CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQ---DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~---~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |++. +|+++||+|.+++|+|++|||||+|+++||++|++||||+++|++++|++||+||+|+|+|++||++|+|++| T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf 80 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPF 80 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhh Confidence 9987 4899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCceec-cCCHHHHH-HHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc-----chhhhhHHhccCCceEEE Q lcl|NC_020844. 78 GREVEVT-EGTGPSED-LLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ-----FRNREEERQAGVRPYWSQ 150 (455) Q Consensus 78 ~k~p~i~-~~~~~l~~-~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~-----~~t~ad~~~~~~rPy~~~ 150 (455) +|||+++ ..|+.+.. |++|||++|++|++|++++++++|.+|+||+|||||..+. ++|+||+++.++|||++. T Consensus 81 ~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~ 160 (513) T protein:vir:97 81 SEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVM 160 (513) T ss_pred hcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceEEE Confidence 9999996 36788875 8899999999999999999999999999999999997643 689999999999999999 Q ss_pred echhhhccceeeccCCeeEEEEEEEEeee-----------eEEEEEecCeEEEEEEeCCc-----EEEEeccCCCCCCce Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWEDA-----------ETILVLRPDDWERWRKNDAG-----IWEVDEFGRITIGVV 214 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E~~-----------e~~~v~~~~~~~~~~~~~~g-----~~~~~~~g~~~l~~v 214 (455) |+|++||||++++++|+.+|++++++|++ ++||||+++.|++||...++ +|.+++++.|+|++| T Consensus 161 ~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l~~I 240 (513) T protein:vir:97 161 IKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGLNYV 240 (513) T ss_pred ecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcCCce Confidence 99999999999999999999999999863 57999999999999976443 789999999999999 Q ss_pred eEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccC Q lcl|NC_020844. 215 PMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPP 294 (455) Q Consensus 215 P~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~ 294 (455) |||||. ..++++.++.|||++||++|++|||++|||++++|++++|++|++|++.++. +++.+|++++|.+|. T Consensus 241 P~v~~~--~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~-----~~i~iG~~~~~~lpe 313 (513) T protein:vir:97 241 PLVTFY--ADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDS-----DPVVVGPNKVLYNPD 313 (513) T ss_pred eEEEEe--cCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCC-----CceEeeccccccCCC Confidence 999884 5567899999999999999999999999999999999999999999987653 368999999999994 Q ss_pred CCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 295 NAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALY 374 (455) Q Consensus 295 ~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~ 374 (455) .+++++|+||+|+++++++++|+++++||+++|++++..+++++||+++++++++.+|+|+.++.+|++|++++|+ T Consensus 314 ----~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~ 389 (513) T protein:vir:97 314 ----PAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEFLKRKTGGQTATARALDSAEATSDLSAMTGLFEDALAQALD 389 (513) T ss_pred ----CCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2468999999999999999999999999999999999988889999999999999999999999999999999999 Q ss_pred HHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc---- Q lcl|NC_020844. 375 ITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELP---- 450 (455) Q Consensus 375 ~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~---- 450 (455) |||+|+|.++.+++|+||+||....+++++++++++++++|.||++|++++|+|+|||++++|++++.++++++.+ T Consensus 390 ~~a~wlg~~~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~ 469 (513) T protein:vir:97 390 ITADWLRLGPNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMG 469 (513) T ss_pred HHHHHhCCCCCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccC Confidence 9999999877678999999999999999999999999999999999999999999999999997777777766543 Q ss_pred -cCCCC Q lcl|NC_020844. 451 -GGIEE 455 (455) Q Consensus 451 -~~l~~ 455 (455) ++++. T Consensus 470 ~~~~d~ 475 (513) T protein:vir:97 470 RAGLDL 475 (513) T ss_pred CCCccc Confidence 23332 No 2 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=100.00 E-value=1.5e-133 Score=748.83 Aligned_cols=441 Identities=18% Similarity=0.249 Sum_probs=405.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCC-----CCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFP-----HEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~-----~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) || ||+++||+|.+++|+|++|||||+|+++||++|++||||++ +|++++|++|++||+|+|+|++||++|+|+ T Consensus 1 m~--~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~ 78 (501) T protein:vir:95 1 MP--NVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQ 78 (501) T ss_pred CC--CCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhh Confidence 99 69999999999999999999999999999999999999864 556789999999999999999999999999 Q ss_pred hhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCC--cchhhhhHHhccCCceEEEech Q lcl|NC_020844. 76 PFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQ--QFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 76 lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~--~~~t~ad~~~~~~rPy~~~~~~ 153 (455) +|+|+|+++ .|+.++.|++|||++|++|++|++++++++|.+|+||+|||||..+ ..+|+||++++++|||++.|+| T Consensus 79 vf~k~p~~~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~ 157 (501) T protein:vir:95 79 VFMRDPVVK-VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYVYSP 157 (501) T ss_pred hhcCCccee-CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEEecH Confidence 999999995 7999999999999999999999999999999999999999999763 4689999999999999999999 Q ss_pred hhhccceeeccCCeeEEEEEEEEee------------eeEEEEEecC-----eEEEEEEeCC----------------cE Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWED------------AETILVLRPD-----DWERWRKNDA----------------GI 200 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E~------------~e~~~v~~~~-----~~~~~~~~~~----------------g~ 200 (455) ++||||++++++|+.+|++++++|. +++|||++++ .|++||.... .+ T Consensus 158 ~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~ 237 (501) T protein:vir:95 158 TEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVV 237 (501) T ss_pred hhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccce Confidence 9999999999999999999999985 4579999875 5789987543 25 Q ss_pred EEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCcc Q lcl|NC_020844. 201 WEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPK 280 (455) Q Consensus 201 ~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~ 280 (455) |.+++.|.|+|++||||++ +..++++.++.|||++||++|++|||++|||++++|++++|++|++|++.++.++..++ T Consensus 238 ~~~~~~g~~~l~~IPfv~~--~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~ 315 (501) T protein:vir:95 238 YKPTDAQGKRLTEIPFMFI--GSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKG 315 (501) T ss_pred eeeeccCCCcCCeeeEEEE--ecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCC Confidence 7788899999999999976 66788999999999999999999999999999999999999999999999998888889 Q ss_pred ceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 281 PLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTAQSGNLTRISAAQAASKGNSAVQQ 360 (455) Q Consensus 281 ~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~ 360 (455) ++.+|++++|.+|.+ ++++|+|++|+++. +++|+++++||+++|+++++.+++++||+++++++++.+|.|+. T Consensus 316 ~i~~G~~~~~~lP~~-----~~~~~ie~~~~~i~--~~~l~~l~~~m~~~Ga~ll~~~~~~~Ta~~~~~~~~~~~S~L~~ 388 (501) T protein:vir:95 316 SVNFGSRGGIPLPVG-----ADAKLLQASENTML--KEAMDTKERQMVALGAKLVEQKEVQRTATEAELEAASEGSTLSS 388 (501) T ss_pred ceeecccccccCCCC-----CceeEEecChhhHH--HHHHHHHHHHHHHHHHhhccCCccchhHHHHHHHHHHHhHHHHH Confidence 999999999999954 58999999999985 78999999999999999999888899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHH Q lcl|NC_020844. 361 WAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEE 440 (455) Q Consensus 361 ~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~ 440 (455) +|.++++|++++|+|||+|+|+++.+++|+||+||....+++++++++++++++|.||++|++++|+++||++++.++++ T Consensus 389 ~a~~le~al~~~l~~~a~w~g~~~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~ 468 (501) T protein:vir:95 389 ATKNVSAAFEWALKWAARWVGQADSGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAK 468 (501) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHH Confidence 99999999999999999999998878999999999999999999999999999999999999999999999988766554 Q ss_pred HHHHHHhhccc-----CCCC Q lcl|NC_020844. 441 ESERLISELPG-----GIEE 455 (455) Q Consensus 441 e~~ri~~e~~~-----~l~~ 455 (455) ++|+++..+ .+.. T Consensus 469 --e~i~~~~~~~~~~~~~~~ 486 (501) T protein:vir:95 469 --EKIAKDTAEAMALATPAN 486 (501) T ss_pred --HHHHhhhcCcccccccCC Confidence 455443221 1111 No 3 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=100.00 E-value=2.9e-133 Score=747.23 Aligned_cols=443 Identities=20% Similarity=0.251 Sum_probs=406.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCC-----CCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFP-----HEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~-----~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) || ||+++||+|.+++++|++|||||+|+++||++|++||||++ +|++++|++||+||+|+|+|++||++|+|+ T Consensus 32 m~--dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~ 109 (535) T protein:vir:80 32 LP--NVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQ 109 (535) T ss_pred CC--CCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHhch Confidence 99 69999999999999999999999999999999999999987 456788999999999999999999999999 Q ss_pred hhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 76 PFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 76 lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) +|+|+|+++ .|+.++.|++|||++|++|++|++++++++|.+|+||+|||||..+..+|++|+++.++|||++.|+|++ T Consensus 110 vfrk~p~~~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~ 188 (535) T protein:vir:80 110 VFSRDPIRQ-LPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTS 188 (535) T ss_pred hhcCCccee-ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhh Confidence 999999995 7999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hccceeeccCCeeEEEEEEEEeee------------eEEEEEecC---eE--EEEEEeCC-------cEEEEeccCCCCC Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWEDA------------ETILVLRPD---DW--ERWRKNDA-------GIWEVDEFGRITI 211 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E~~------------e~~~v~~~~---~~--~~~~~~~~-------g~~~~~~~g~~~l 211 (455) ||||++++++|+.+|++++++|.+ ++||||+++ .| ++|+...+ +++...+.+.|+| T Consensus 189 IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l 268 (535) T protein:vir:80 189 IINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPF 268 (535) T ss_pred ccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeecccCCCccc Confidence 999999999999999999999864 489999884 45 45554322 2456778899999 Q ss_pred CceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccc-cCccceeeccceee Q lcl|NC_020844. 212 GVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSE-GSPKPLDTGPDTVL 290 (455) Q Consensus 212 ~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~-~~~~~~~~g~~~~~ 290 (455) ++||||+| +..++++.++.|||++||++|++|||++|||++++|++++|++|++|++..+.+. ..++++.+|++++| T Consensus 269 ~~IPfv~~--~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~ 346 (535) T protein:vir:80 269 KEIPFQFI--GPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAII 346 (535) T ss_pred CeeEEEEe--ecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccc Confidence 99999976 5677899999999999999999999999999999999999999999999887443 45678999999999 Q ss_pred eccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 291 YAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALE 370 (455) Q Consensus 291 ~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~ 370 (455) .+|++ ++++|+|++++++. +++|+++++||+++|++++...++++|++++++++++.+|+|+.+|.++++|++ T Consensus 347 ~lP~~-----~~~~~~e~~~~~~a--~~~l~~~e~qM~~lGa~ll~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~ 419 (535) T protein:vir:80 347 PLPQG-----ATAGILQITPNSVP--FEAMTHKESQMIAMGANLLVKSGGNRTFGEAQQEEASEQSILSACTKNVSMAFR 419 (535) T ss_pred cCCCC-----CCcceeeeccchhH--HHHHHHHHHHHHHHHHHhhccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHH Confidence 99965 47999999999987 578999999999999999999899999999999999999999999999999999 Q ss_pred HHHHHHHHHcCCC--CCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhh Q lcl|NC_020844. 371 NALYITNLWMDAP--DTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISE 448 (455) Q Consensus 371 ~~l~~~a~~~g~~--~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e 448 (455) ++|+|||+|+|.. ++++.|++|+||....+++++++++++++++|.||++|++++|+|+|||+++.++++|+.||+.| T Consensus 420 ~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E 499 (535) T protein:vir:80 420 KALRWANQFQTGIVNDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVE 499 (535) T ss_pred HHHHHHHHHcCCccCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhh Confidence 9999999999963 45688999999999999999999999999999999999999999999999999999999999998 Q ss_pred ccc-----CCCC Q lcl|NC_020844. 449 LPG-----GIEE 455 (455) Q Consensus 449 ~~~-----~l~~ 455 (455) ... |+.. T Consensus 500 ~~~~~~~~g~~~ 511 (535) T protein:vir:80 500 FIAKTAAAGKVG 511 (535) T ss_pred hhhccccCCCCC Confidence 432 2111 No 4 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=100.00 E-value=3e-126 Score=708.75 Aligned_cols=440 Identities=17% Similarity=0.194 Sum_probs=398.7 Q ss_pred CCCC-----CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCC-CCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQ-----DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPH-EQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~-----~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~-e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |-+. ||+++||+|.+++|+|++|||||+|++ .+..++.|||++++ +++++|++|++||+|+|+|++||++|+| T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~-~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G 79 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDL-VGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVG 79 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcc-hhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhc Confidence 6543 699999999999999999999999964 45557789999886 5577899999999999999999999999 Q ss_pred hhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 75 KPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 75 ~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) ++|+|+|+++ .|+.++.|++|||++|++|++|++++++++|.+|+||+|||||..+ +.|+||++++++|||++.|+|+ T Consensus 80 ~vfrk~p~~~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~-~~T~Ade~~~~~rPy~~~~~~~ 157 (491) T protein:vir:95 80 SVMRKEPEIN-IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETA-AATAAEQNAGLLNPTIAFYTTE 157 (491) T ss_pred hhhcCCceee-ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCc-ccCHHHHHHhcCCcEEEEechh Confidence 9999999995 7999999999999999999999999999999999999999999875 5799999999999999999999 Q ss_pred hhccceeeccCCeeEEEEEEEEee--------------eeEEEEEecC-----eEEEEEEeCCc------EEEEeccCCC Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWED--------------AETILVLRPD-----DWERWRKNDAG------IWEVDEFGRI 209 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E~--------------~e~~~v~~~~-----~~~~~~~~~~g------~~~~~~~g~~ 209 (455) +||||++++++|+.+|+++|++|+ +++||||+++ .+++||+..+| ++...+.|.+ T Consensus 158 ~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~ 237 (491) T protein:vir:95 158 NIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGES 237 (491) T ss_pred hhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCCc Confidence 999999999999999999999985 3579999875 56889876544 2344578999 Q ss_pred CCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCC---CccccccCccceeecc Q lcl|NC_020844. 210 TIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVE---PDTDSEGSPKPLDTGP 286 (455) Q Consensus 210 ~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~---~~~~~~~~~~~~~~g~ 286 (455) +|++||||++ +..++++.++.|||+|||++|++|||++|||++++|++++|++|++|.+ .++.++++++++.+|+ T Consensus 238 ~l~~IPfv~~--~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g~ 315 (491) T protein:vir:95 238 LRGVIPFTFI--GATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFGS 315 (491) T ss_pred ccCeeEEEEE--ecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEecC Confidence 9999999987 4567899999999999999999999999999999999999999999955 5667788889999999 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) ++++.+|++ ++++|+|++++++ .++.|+++++||+.+|++++..+ +++|++++++++++.+|+|+.++.+++ T Consensus 316 ~~~~~lP~~-----~~~~~ie~~~~~~--~~~~l~~~e~qm~~~Ga~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~~e 387 (491) T protein:vir:95 316 RCGHNLGYG-----GSAQLIQAGENNL--ARQNMLDKEQQAIQIGAQLITPS-QQITAESARIQRGADTSVMATIARNVS 387 (491) T ss_pred cCCcCCCCC-----CccceeecCcchH--HHHHHHHHHHHHHHHHHHhccCC-cchhHHHHHHHHHHhhHHHHHHHHHHH Confidence 999999964 4899999999887 48999999999999999998764 589999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCC-CCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHH Q lcl|NC_020844. 367 DALENALYITNLWMDAP-DTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERL 445 (455) Q Consensus 367 ~a~~~~l~~~a~~~g~~-~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri 445 (455) +|++++|+|||+|+|++ +.++.|++|+||....+++++++++++++++|.||++|++++|+++||++ .++|+++++| T Consensus 388 ~al~~~l~~~a~w~G~~~~~~v~i~~n~dF~~~~~~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~--~~~e~~~~~i 465 (491) T protein:vir:95 388 QAYTDALRWVAMMLGKPEDSEVEFQLNMDFFLQPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTD--WTDEDILNAI 465 (491) T ss_pred HHHHHHHHHHHHHcCCCCCCceEEEeecccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCC--ccHHHHHHHH Confidence 99999999999999985 55789999999999999999999999999999999999999999999984 4789999999 Q ss_pred Hhhccc--CCCC Q lcl|NC_020844. 446 ISELPG--GIEE 455 (455) Q Consensus 446 ~~e~~~--~l~~ 455 (455) ++|.+. .... T Consensus 466 e~~~~~~~~~~~ 477 (491) T protein:vir:95 466 EDAPLPSGAVTQ 477 (491) T ss_pred HhcCCCCCcccc Confidence 999653 3322 No 5 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=100.00 E-value=1.6e-125 Score=704.71 Aligned_cols=440 Identities=18% Similarity=0.184 Sum_probs=396.4 Q ss_pred CCCC-----CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCC-CHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQ-----DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHE-QNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~-----~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e-~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |-+. ||+++||+|.+++|+|++|||||+|++..+. +..|||+++++ ++++|++|++||+|+|+|++||++|+| T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~-r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G 79 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYL-RNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVG 79 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccc-cCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhc Confidence 6543 6999999999999999999999999765443 45799998875 467799999999999999999999999 Q ss_pred hhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 75 KPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 75 ~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) ++|+|||+++ .|+.++.|++|||++|++|++|++++++++|.+|+||+|||||..+ ++|+||++++++|||++.|+|+ T Consensus 80 ~vfrk~p~~~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~-~~T~ade~~~~~rPy~~~~~~~ 157 (489) T protein:vir:78 80 SVMRKEPEIN-IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETG-AATAAEQNAGLLNPTIAFYTTE 157 (489) T ss_pred hhhcCCccee-ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCC-CcCHHHHHHhcCCcEEEEechh Confidence 9999999994 7999999999999999999999999999999999999999999865 6899999999999999999999 Q ss_pred hhccceeeccCCeeEEEEEEEEee--------------eeEEEEEecC-----eEEEEEEeCCcE------EEEeccCCC Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWED--------------AETILVLRPD-----DWERWRKNDAGI------WEVDEFGRI 209 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E~--------------~e~~~v~~~~-----~~~~~~~~~~g~------~~~~~~g~~ 209 (455) +||||++++++|+.+|+++|++|+ ++|||||+++ ++++||...+|. ....+.|.+ T Consensus 158 ~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~ 237 (489) T protein:vir:78 158 NIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGES 237 (489) T ss_pred hhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCCC Confidence 999999999999999999999985 3579999987 457888765542 123478999 Q ss_pred CCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCC---CccccccCccceeecc Q lcl|NC_020844. 210 TIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVE---PDTDSEGSPKPLDTGP 286 (455) Q Consensus 210 ~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~---~~~~~~~~~~~~~~g~ 286 (455) +|++||||++ +..++++.++.|||+|||++|++|||++|||++++|++++|++|++|.+ .++.++++++++++|+ T Consensus 238 ~l~~IPfv~~--~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g~ 315 (489) T protein:vir:78 238 LRGVIPFTFI--GATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFGS 315 (489) T ss_pred ccCeeeEEEE--ecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccceeeCC Confidence 9999999987 4567899999999999999999999999999999999999999999976 4556788889999999 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) ++++.+|.+ ++++|+|++++++. +++|++++++|+++|+++++. ++++|++++++++++.+|+|+.+|.+++ T Consensus 316 ~~~~~lp~~-----~~~~~ie~~~~~~~--r~~l~~le~qm~~lGa~l~~~-~~~~Ta~~~~~~~~~~~S~L~~~a~~~e 387 (489) T protein:vir:78 316 RRGHNLGYG-----GSAQLIQAGENNLA--RQNMLDKEQQAIQIGAQLITP-TQQITAQSARIQRGADTSVMATIARNVS 387 (489) T ss_pred cccccCCCC-----CCcceeccCcchHH--HHHHHHHHHHHHHHhhhhccC-CcchhHHHHHHHHHHhhHHHHHHHHHHH Confidence 999999965 47999999998874 899999999999999999985 4589999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCC-CCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHH Q lcl|NC_020844. 367 DALENALYITNLWMDAP-DTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERL 445 (455) Q Consensus 367 ~a~~~~l~~~a~~~g~~-~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri 445 (455) +|++++|+|||+|+|++ +.++.|++|+||....+++++++++++++++|.||++|++++|+++||+++ +++++.++| T Consensus 388 ~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~~~~d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~--~~e~~~~ei 465 (489) T protein:vir:78 388 QAYTDALRWVAVMLGKPEDTEVEFRLNMDFFLEPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDW--TDADIKDAV 465 (489) T ss_pred HHHHHHHHHHHHHcCCCCCCceEEEeecccCcccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCc--cHHHHHHHH Confidence 99999999999999986 456899999999999999999999999999999999999999999999864 678888999 Q ss_pred Hhhccc-CCCC Q lcl|NC_020844. 446 ISELPG-GIEE 455 (455) Q Consensus 446 ~~e~~~-~l~~ 455 (455) ++|++. +.+. T Consensus 466 ~~~~~~~~~~~ 476 (489) T protein:vir:78 466 ADQPLPVATEV 476 (489) T ss_pred hhcCCCcccCC Confidence 999653 3333 No 6 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=100.00 E-value=1.1e-122 Score=689.17 Aligned_cols=420 Identities=14% Similarity=0.143 Sum_probs=374.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) || |+++||+|.+++++|++|||||+|+++||++|++||||+++|++++|++|++||+|+|++++||++|+|++|+|| T Consensus 1 m~---V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k~ 77 (452) T protein:vir:94 1 MP---IETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQP 77 (452) T ss_pred CC---CCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcCC Confidence 99 999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccce Q lcl|NC_020844. 81 VEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQ 160 (455) Q Consensus 81 p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~ 160 (455) |+++ .|+.++.+ ++|++|++|++|++++++++|.+|+||+|||||.. +.|||++.|+|++||||+ T Consensus 78 p~~~-~p~~l~~~--~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~------------g~rPy~~~~~~~~Ii~W~ 142 (452) T protein:vir:94 78 PVIT-HPDAMSKY--FEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLT------------GGDPYISVYTTENILNWE 142 (452) T ss_pred ceec-ccHHHHHH--HhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccC------------CCceEEEEechhhhcCcc Confidence 9995 68999888 67999999999999999999999999999999964 568999999999999999 Q ss_pred eeccCCeeEEEEEEEEee--------------eeEEEEEe--cCeEEE--EEEeCCcEE-----EEeccCCCCCCceeEE Q lcl|NC_020844. 161 SERQGAGERITYMRMWED--------------AETILVLR--PDDWER--WRKNDAGIW-----EVDEFGRITIGVVPMV 217 (455) Q Consensus 161 ~~~~~~~~~l~~v~~~E~--------------~e~~~v~~--~~~~~~--~~~~~~g~~-----~~~~~g~~~l~~vP~V 217 (455) ++.+|+ +++++++|. +++||||+ ++.|++ |+..+++.| ..++.+.++|++|||| T Consensus 143 ~~~~g~---l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v 219 (452) T protein:vir:94 143 EDEDGR---LLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFF 219 (452) T ss_pred ccccCC---eeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEE Confidence 998876 556666653 34789976 777765 444555555 4557788999999999 Q ss_pred EEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCC Q lcl|NC_020844. 218 PFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAE 297 (455) Q Consensus 218 ~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~ 297 (455) ++ +..++++.++.|||+|||++|++|||++|||++++|++++|++|++|++..+ ++.+|++++|.+|+ T Consensus 220 ~~--~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~-------~i~iG~~~~~~lpe--- 287 (452) T protein:vir:94 220 CI--TPSGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS-------TMHIGSTKAWVIPE--- 287 (452) T ss_pred EE--cCCCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC-------ceEecccccccCCC--- Confidence 77 5667788999999999999999999999999999999999999999987544 68999999999994 Q ss_pred ccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccCCC-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 298 GQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTAQS-GNLTRISAAQAASKGNSAVQQWAAGLKDALENALYIT 376 (455) Q Consensus 298 ~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~~~-~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~ 376 (455) .+++++|+|++|+++++++++|++++++|+.+|++++..++ +++|++++++++++.+|.|+.++.++++|++++|+|| T Consensus 288 -~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~ 366 (452) T protein:vir:94 288 -VAAKVGFLEFTGQGLQSLEKALSEKQAQLASLSARLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCI 366 (452) T ss_pred -CCCcceEEccCchhHHHHHHHHHHHHHHHHHHHHHhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH Confidence 24589999999999999999999999999999999998876 5678888899999999999999999999999999999 Q ss_pred HHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 377 NLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 377 a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) |+|+|.+. ++.|+||+||....+++++++++++++++|.||++|++++|+++||++++.+.+..++.++.+.+.--++ T Consensus 367 a~w~g~~~-~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~~~~~ 444 (452) T protein:vir:94 367 MDMESMGG-TLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPSPSNT 444 (452) T ss_pred HHHcCCCC-ceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcccCCC Confidence 99999854 7899999999999999999999999999999999999999999999987655554444444444433444 No 7 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=100.00 E-value=1.2e-122 Score=689.05 Aligned_cols=434 Identities=13% Similarity=0.134 Sum_probs=380.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCC-----CHHH-----------HHHH-hhccccCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHE-----QNKR-----------YDLR-KSVAKMTN 63 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e-----~~~~-----------Y~~r-l~ra~~~n 63 (455) || |+++||+|.+++|+|++++|| |+++||++|++||||++.+ ++.. |.+| ++||+|+| T Consensus 14 m~---V~~~hp~y~a~~~~W~~~~d~--g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n 88 (488) T protein:vir:96 14 ML---TPIYHPDYLVNAPQWLRNLDC--VMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVN 88 (488) T ss_pred ec---ccccCHHHHHHhhhhhHhhhh--hhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCc Confidence 99 999999999999999999985 6778999999999998753 2333 4333 34899999 Q ss_pred hHHHHHHHHhhhhhcCCceeccC-CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhc Q lcl|NC_020844. 64 VFRDVLEGLASKPFGREVEVTEG-TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQA 142 (455) Q Consensus 64 ~~~~~v~~~~g~lf~k~p~i~~~-~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~ 142 (455) ++++||++|+|++|+|+|+++.. ++.++.|++|||++|++|++|++++++++|.+|+||+|||||+ .+.|+||++++ T Consensus 89 ~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~--~~~T~ade~~~ 166 (488) T protein:vir:96 89 IVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHP--ESATMADWNKG 166 (488) T ss_pred hhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCC--CcCCHHHHHHh Confidence 99999999999999999999743 4679999999999999999999999999999999999999995 35799999999 Q ss_pred cCCceEEEechhhhccceeeccCCeeEEEEEEEEeeee-----------E--EEEEecCeEEEEEEeCC---cEEEEecc Q lcl|NC_020844. 143 GVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAE-----------T--ILVLRPDDWERWRKNDA---GIWEVDEF 206 (455) Q Consensus 143 ~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e-----------~--~~v~~~~~~~~~~~~~~---g~~~~~~~ 206 (455) ++|||++.|+|++||||++++++|+.+|++++++|+++ + ++.|+++.|++|+...+ ++|++.+. T Consensus 167 ~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e~~~~~~ 246 (488) T protein:vir:96 167 KKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSDEWTPVLI 246 (488) T ss_pred cCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcccceEeecC Confidence 99999999999999999999999999999999998642 3 44478899999986433 46788899 Q ss_pred CCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEec--CCCccccccCccceee Q lcl|NC_020844. 207 GRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANG--VEPDTDSEGSPKPLDT 284 (455) Q Consensus 207 g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G--~~~~~~~~~~~~~~~~ 284 (455) |.++|++|||||| +..++++.++.|||+|||++|++|||++|||++++|++++|++++.+ .+.+++++.++.++.+ T Consensus 247 g~~~l~~IP~v~~--~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~~~~~~~~g~~~ 324 (488) T protein:vir:96 247 NSKQSDTIPFFLA--SSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTMASEMNPLGFTL 324 (488) T ss_pred CCcccCeeEEEEE--ecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCcccccccccceeee Confidence 9999999999987 56678999999999999999999999999999999999999999753 3444455554445555 Q ss_pred ccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 285 GPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAG 364 (455) Q Consensus 285 g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~ 364 (455) |+..++..| .|+++|+|++++++ .+++|+++++||+++|+++++.+ +++||+++++++++.+|+|+.+|.+ T Consensus 325 ~~~~~~~~~------~g~~~~~e~~~~~l--~~~~l~~l~~qm~~~Ga~l~~~~-~~~Ta~~~~~~~~~~~S~L~~~a~~ 395 (488) T protein:vir:96 325 AGRMPYYVK------NGDVKVIQAQFSPE--TENKVEKLFEQAVKVGASLFTQQ-SNETATGAAIRSGSSTASMATLGNN 395 (488) T ss_pred ccccccccc------CCceeecCCchhHH--HHHHHHHHHHHHHHHhHhhccCC-CcchHHHHHHHHHHhhHHHHHHHHH Confidence 544444333 46899999999887 48889999999999999999864 5799999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHcCCC-----CCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHH Q lcl|NC_020844. 365 LKDALENALYITNLWMDAP-----DTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPE 439 (455) Q Consensus 365 ~~~a~~~~l~~~a~~~g~~-----~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e 439 (455) +++|++++|+|||+|+|++ +.+++|+||+||....+++++++++++++++|.||++|++++|+|+|||+++.++| T Consensus 396 le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e 475 (488) T protein:vir:96 396 VEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKE 475 (488) T ss_pred HHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHH Confidence 9999999999999999964 34678999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcccCC Q lcl|NC_020844. 440 EESERLISELPGGI 453 (455) Q Consensus 440 ~e~~ri~~e~~~~l 453 (455) +|++||++++.+ | T Consensus 476 ~~~~~ie~~g~~-~ 488 (488) T protein:vir:96 476 EFDEHIAELGFG-M 488 (488) T ss_pred HHHHHHhhcCCC-C Confidence 999999986544 5 No 8 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=3.1e-49 Score=286.49 Aligned_cols=410 Identities=11% Similarity=0.063 Sum_probs=292.0 Q ss_pred CCCCCCCcc-------CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQDPTKR-------SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~v~~~-------~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~ 73 (455) +.+.+.... -..+....++++++.+.|.|.+.+-.+...+..... .+..+...-+..|+++.+|++.+ T Consensus 37 ~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~-----~~~~~~~~ri~~n~~k~Ivd~~~ 111 (492) T protein:vir:94 37 RTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGA-----VDPLKPDDRMITNFHANLVDQKV 111 (492) T ss_pred ccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc-----ccccccccccccchHHHHHHHHH Confidence 222222111 123445568889999999997654322111111111 11111111134799999999999 Q ss_pred hhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 74 SKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 74 g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) |++|++||+++..++.....++++. +++++..+..+++.++++|++|++|+.++.+ +|-++.++| T Consensus 112 ~yl~G~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg-------------~~~~~~~~p 176 (492) T protein:vir:94 112 SYIVGKPIAFKHTDDEVVKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG-------------EFKLFRVPA 176 (492) T ss_pred hhhcccCceeccCchHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEEEecCCC-------------ceEEEEEcc Confidence 9999999999866666666666664 3689999999999999999999999876543 367889999 Q ss_pred hhhcc-ceeeccCCeeEEEEEEEE--eeeeEEEEEecCeEEEEEEeCCc---------EEEEeccCCCCCCceeEEEEEe Q lcl|NC_020844. 154 SAILE-IQSERQGAGERITYMRMW--EDAETILVLRPDDWERWRKNDAG---------IWEVDEFGRITIGVVPMVPFIT 221 (455) Q Consensus 154 ~~ii~-w~~~~~~~~~~l~~v~~~--E~~e~~~v~~~~~~~~~~~~~~g---------~~~~~~~g~~~l~~vP~V~f~~ 221 (455) .+++. |+....+ ..+..+++. +..+.+.+|+++.+..|+...++ .+......+|+||.||||+|++ T Consensus 177 ~~~~~v~d~~~~~--~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 254 (492) T protein:vir:94 177 EQGIPIWTDKEHE--ELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN 254 (492) T ss_pred cceEEEEcCCCCC--ceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecC Confidence 99886 4432322 234445543 34456788999888877655443 2233456689999999999987 Q ss_pred cccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCcccc Q lcl|NC_020844. 222 GRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHG 301 (455) Q Consensus 222 ~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~ 301 (455) +....+ .|.++..+..+..+..|++.+.+.+.++|+++++|++..+..++..+ ++...++.++. ++ T Consensus 255 n~~~~s------d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~---~~~~~~~~~~~-----~~ 320 (492) T protein:vir:94 255 NDLEIS------DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRL---LRYYGAIKVSD-----NG 320 (492) T ss_pred CCCCCC------chHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHH---HhhccceecCC-----CC Confidence 654333 34455555555666789999999999999999999987766554432 23345666654 45 Q ss_pred ceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 302 ~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) +++|++.+. +.+.++..++.+++.|+.++..+. ...+++.||+|+++....+...+..+...|+.+|++++++++. T Consensus 321 ~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~ 399 (492) T protein:vir:94 321 GVDTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 399 (492) T ss_pred cceeEeccC-CHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 789988664 447788999999999999988763 2345789999999999999999999999999999999999999 Q ss_pred HcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 379 WMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 379 ~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ++|.......+++.+++......++.++++.++ .|+||+||+++.| +...|+++|++||++|.+...+. T Consensus 400 ~~~~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl--~giiS~et~~~~l------~~v~d~~~E~eri~~E~~~~~~~ 468 (492) T protein:vir:94 400 HFDIKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENH------PFVEDLQAELERIEQEQMEYNKQ 468 (492) T ss_pred HhcCCcccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHHHHHHHHhh Confidence 999876555566665554444456778888776 5999999998866 55668999999999885533333 No 9 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=3.6e-49 Score=286.16 Aligned_cols=420 Identities=11% Similarity=0.068 Sum_probs=291.8 Q ss_pred CCCCC---CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQD---PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~---v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) +.+.+ +..---.+....++++++.+.|.|.+.+.. .+......-..|+ -.|+++.+|++.+||+| T Consensus 13 ~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~--------~~~~~~~~~~~ki----~~n~~~~Iv~~~~~~l~ 80 (499) T protein:vir:10 13 VNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEK--------HEFDNATVEAANV----MVNHAKYITDMNVGFMT 80 (499) T ss_pred hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc--------CCcCcCCCCccee----ecchHHHHHHHHhhhhc Confidence 22112 222223445566889999999999765421 1211111112233 26999999999999999 Q ss_pred cCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh----hHHhccCCceEEEech Q lcl|NC_020844. 78 GREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE----EERQAGVRPYWSQVNH 153 (455) Q Consensus 78 ~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a----d~~~~~~rPy~~~~~~ 153 (455) ++||+++..++....-+.++ .+.++++.++..+++.++++|++|++|+.++.+.+.... .......++.+..++| T Consensus 81 g~p~~~~~~~~~~~~~l~~~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p 159 (499) T protein:vir:10 81 GNPVKYVAEKGKNIDDILEV-FNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDP 159 (499) T ss_pred ccCceeecCChhHHHHHHHH-HhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcc Confidence 99999975555443333344 245789999999999999999999999887766543221 1112233466888999 Q ss_pred hhhccceeeccCCeeEEEEEEEEee--------eeEEEEEecCeEEEEEEeCCc----EEEEeccCCCCCCceeEEEEEe Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWED--------AETILVLRPDDWERWRKNDAG----IWEVDEFGRITIGVVPMVPFIT 221 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E~--------~e~~~v~~~~~~~~~~~~~~g----~~~~~~~g~~~l~~vP~V~f~~ 221 (455) ++++....+.. +...+..+++.+. +..+.+|+++.+..|+..+.+ .+......+|+||.||||+|.+ T Consensus 160 ~~~~~v~~d~~-~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 238 (499) T protein:vir:10 160 RATVVVCDDTV-EHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRN 238 (499) T ss_pred cceEEEecCCC-CcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceEEecC Confidence 99886544433 3445555554332 346788999999888765443 3456778899999999999987 Q ss_pred cccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCcccc Q lcl|NC_020844. 222 GRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHG 301 (455) Q Consensus 222 ~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~ 301 (455) +....+. +.++..+..+.....|++.+.+.+.++|+++++|...+++.++.. ....+.++.++. ++++ T Consensus 239 ~~~~~~d------~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~---~~~~~~~~~~~~---~~~~ 306 (499) T protein:vir:10 239 NEERQGD------FEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQ---RLKRGAIEAPPR---EEGA 306 (499) T ss_pred CCCCCCc------hHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhh---hhhhcceeccCC---CCCC Confidence 6543333 344444444555677899999999999999999988765544322 233344555442 3467 Q ss_pred ceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 302 ~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) +++|++.+.+ .+.++..++.+.++|+.++..+.. ..++|.||+|+++....+...+..+...|+.+|++++++++. T Consensus 307 d~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~ 385 (499) T protein:vir:10 307 DIEWLTKSFD-ETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQT 385 (499) T ss_pred cceEEeccCC-HHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8999997764 577899999999999999887632 235789999999999999999999999999999999999999 Q ss_pred HcCCCCCce---EEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 379 WMDAPDTNP---EADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 379 ~~g~~~~~~---~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +++..+... .+++...........+.++++.++ .|+||+||+++.| +...|+++|++||++|.++.+.. T Consensus 386 ~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l------~~v~d~~~E~~ri~~E~~~~~~~ 457 (499) T protein:vir:10 386 IVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWL------PDVDNPQDVIDEMNQQDAETIKK 457 (499) T ss_pred HHhccCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHH Confidence 987643221 223333233333356778888876 6999999999765 67778999999999885432211 No 10 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=5.2e-49 Score=285.29 Aligned_cols=411 Identities=11% Similarity=0.078 Sum_probs=291.5 Q ss_pred CCCCCCCc-------cCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQDPTK-------RSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~v~~-------~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~ 73 (455) +.+.+... --..+....+++.++.+.|.|.+.+-.....+.. ....+..+-..-+..|+++.+|++++ T Consensus 37 ~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~-----~~~~~~~~~~~ri~~n~~k~Ivd~~~ 111 (492) T protein:vir:97 37 RTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDA-----TGAVDPLKPDDRMITNFHANLVDQKV 111 (492) T ss_pred cCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccc-----cccccccccccccccchHHHHHHHHh Confidence 22222111 1123445568888888999887543211111110 00111111111124799999999999 Q ss_pred hhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 74 SKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 74 g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) ||+|++||+++..++.....++++. +++++..+..+++.++++|++|++|+.++.+ +|-++.++| T Consensus 112 ~yl~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg-------------~~~~~~~~p 176 (492) T protein:vir:97 112 SYIVGKPIAFKHTDDEVVKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG-------------EFKLFRVPA 176 (492) T ss_pred hhhcccCceeccCchHHHHHHHHHH--hccHHHHHHHHHHHHhhcCeEEEEEEecCCC-------------ceEEEEEcc Confidence 9999999999876676777777664 3689999999999999999999999876544 366888999 Q ss_pred hhhccceeeccCCeeEEEEEEEEe--eeeEEEEEecCeEEEEEEeCCcE---------EEEeccCCCCCCceeEEEEEec Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWE--DAETILVLRPDDWERWRKNDAGI---------WEVDEFGRITIGVVPMVPFITG 222 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E--~~e~~~v~~~~~~~~~~~~~~g~---------~~~~~~g~~~l~~vP~V~f~~~ 222 (455) ++++....+... .+.+..+++.. ..+.+.+|+++.+..|+..+++. +......+|+||.||||+|+++ T Consensus 177 ~~~~~i~d~~~~-~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn 255 (492) T protein:vir:97 177 EQGIPIWTDKEH-EELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN 255 (492) T ss_pred cceEEEEcCCCC-CceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCC Confidence 999874333222 23444555533 44567889999888887654431 2234557899999999999876 Q ss_pred ccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccc Q lcl|NC_020844. 223 RRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGS 302 (455) Q Consensus 223 ~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~ 302 (455) ....+. +.++..+..+..+..|++.+.+.+.++|+++++|++.++..++..+ ++...++.++. +++ T Consensus 256 ~~g~sd------~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~---~~~~~~~~~~~-----~~~ 321 (492) T protein:vir:97 256 DLEISD------IFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRL---LRYYGAIKVSD-----NGG 321 (492) T ss_pred CCCCCc------hHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHH---HhhccceecCC-----CCc Confidence 544344 3444444445556778999999999999999999987765554332 33445666664 347 Q ss_pred eeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 303 WNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLW 379 (455) Q Consensus 303 ~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~ 379 (455) ++|++.+. +.+.++..++++++.|+.++..+. ...++|.||+|+++....+......+...|+.+|++++++++.+ T Consensus 322 ~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~ 400 (492) T protein:vir:97 322 VDTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH 400 (492) T ss_pred ceeEeccC-CHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 89988664 557889999999999999987663 33457899999999999999999999999999999999999999 Q ss_pred cCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 380 MDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 380 ~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +|.......+++.++........+.++++.++ +|+||+||+++.| +...|+++|++||++|.++..+. T Consensus 401 ~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl--~G~iS~et~l~~l------~~v~d~~~Eleri~~E~~~~~~~ 468 (492) T protein:vir:97 401 FDIKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENH------PFVEDLQAELERIEQEQTEYNKQ 468 (492) T ss_pred hcCCcccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHh Confidence 99876555666665444444456778888876 6999999998866 56668999999999986533222 No 11 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=6.7e-48 Score=279.22 Aligned_cols=421 Identities=12% Similarity=0.061 Sum_probs=297.9 Q ss_pred CCCCCCCccC----HHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHh---hccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQDPTKRS----ADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRK---SVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~v~~~~----p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl---~ra~~~n~~~~~v~~~~ 73 (455) |....+...- ..+....+++..+.+.|.|.+.+..+...+.++............. ..-+..|+++.+|++.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 5544332211 1223456788899999999876543322222221111111111111 11134799999999999 Q ss_pred hhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 74 SKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 74 g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) ||+|++||+++..++.....++.+. .++++...+.+++.++++|++|++|+.+... .+|-+..++| T Consensus 81 ~yl~G~p~~~~~~~~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~------------g~~~~~~~~p 146 (471) T protein:vir:10 81 AYALTYPPTFDVDDKKVNDMIVDVL--GDDYERISKQLCVNAGNAGIAWLHVWKDASD------------NSFRYACVDS 146 (471) T ss_pred hhhcccCceeccCChHHHHHHHHHH--hcCHHHHHHHHHHHHhhCCeEEEEEEeeCCC------------CeeEEEEEcc Confidence 9999999999877777777777764 3689999999999999999999999875321 1467888999 Q ss_pred hhhccceeeccCCeeEEEEEEEEe--------eeeEEEEEecCeEEEEEEeCCc-------------------EEEEecc Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWE--------DAETILVLRPDDWERWRKNDAG-------------------IWEVDEF 206 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E--------~~e~~~v~~~~~~~~~~~~~~g-------------------~~~~~~~ 206 (455) ++++....+... .+.+..+++.+ .+..+.+|+.+.+..|+..+++ .+...+. T Consensus 147 ~~~~~i~d~~~~-~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (471) T protein:vir:10 147 KEVIPIYSKSLD-KKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNS 225 (471) T ss_pred cceEEEEcCCCC-CceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccccccccccccc Confidence 998864433322 23444444432 2345778898888888765433 2333455 Q ss_pred CCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeecc Q lcl|NC_020844. 207 GRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGP 286 (455) Q Consensus 207 g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~ 286 (455) .+|+||.||||+|.++....+.+....+|+|.++ ...|++.+.+.+.++|+++++|++.+...++..+.. . T Consensus 226 ~~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d------~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~---~ 296 (471) T protein:vir:10 226 FKHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYD------KVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLK---R 296 (471) T ss_pred ccCCCCceeEEEeccCCCCCCchHHHHHHHHHHH------HHHHHHHHHHHHhhCceeeeecCCccccchhHHHhh---c Confidence 6899999999999877665555555555555444 456888888899999999999988776555443322 2 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc--CCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT--AQSGNLTRISAAQAASKGNSAVQQWAAG 364 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~--~~~~~~sa~a~~~~~~~~~s~L~~~a~~ 364 (455) ..++.++.+..+.+++++|++.+.+ .+.++..++++++.|+.++..|.. ...+|.||+|++.....+......+... T Consensus 297 ~~~i~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~ 375 (471) T protein:vir:10 297 YKMIKMDNDGMGDQSGVTTIAIDIP-TEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQ 375 (471) T ss_pred CCeEEecCCCCccCccceEEeecCC-hHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHH Confidence 4467777776677889999997764 588999999999999999877633 2347899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHH Q lcl|NC_020844. 365 LKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESER 444 (455) Q Consensus 365 ~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~r 444 (455) |..+|++++++++.++|..+ ...+++...........+.++++.++ .|+||.||+++.| |...|+++|++| T Consensus 376 ~~~~l~~~~~li~~~~~~~d-~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~------p~v~D~~~E~er 446 (471) T protein:vir:10 376 FRSGYATLVKMILKHLGLSD-KLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSN------PIVEDWQDELRL 446 (471) T ss_pred HHHHHHHHHHHHHHHhccCC-CceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHH Confidence 99999999999999998754 23444443333333456777877775 6999999998765 677789999999 Q ss_pred HHhhcccCCCC Q lcl|NC_020844. 445 LISELPGGIEE 455 (455) Q Consensus 445 i~~e~~~~l~~ 455 (455) |++|.++..+. T Consensus 447 i~~E~~~~~~~ 457 (471) T protein:vir:10 447 QKAEQEGRSEK 457 (471) T ss_pred HHHHHHHHHhc Confidence 99986544332 No 12 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=3.8e-48 Score=280.57 Aligned_cols=411 Identities=11% Similarity=0.062 Sum_probs=289.1 Q ss_pred CCCCC------CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQD------PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~------v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) ++.+. +..--..+....+++.++.+.|.|.+.+-.+...+.+... ....+...-+..|+++.+|++.+| T Consensus 29 ~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~-----~~~~~~~~ki~~n~~k~Ivd~~~~ 103 (483) T protein:vir:12 29 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGA-----VDPLKPDDRMITNFHANLVDQKVS 103 (483) T ss_pred cCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccc-----ccccccccccccchHHHHHHHHhh Confidence 33221 1111112344557888888889887554222222211111 111111111347999999999999 Q ss_pred hhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 75 KPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 75 ~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) ++|++||+++..++.....++++. +++++..+..+++.++++|++|++|+.++.+ +|-++.++|. T Consensus 104 ~l~G~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~-------------~~~i~~~~p~ 168 (483) T protein:vir:12 104 YIVGKPIAFKHTDDEVVKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG-------------EFKLFRVPAE 168 (483) T ss_pred hhcccCceeccCChHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEEEEcCCC-------------ceEEEEEccc Confidence 999999999866666666666664 3679999999999999999999999876543 3678999999 Q ss_pred hhccceeeccCCeeEEEEEEEEe--eeeEEEEEecCeEEEEEEeCCcE---------EEEeccCCCCCCceeEEEEEecc Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWE--DAETILVLRPDDWERWRKNDAGI---------WEVDEFGRITIGVVPMVPFITGR 223 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E--~~e~~~v~~~~~~~~~~~~~~g~---------~~~~~~g~~~l~~vP~V~f~~~~ 223 (455) +++....+... .+.+..+++.+ ..+.+.+|+++.+..|...++.. +......+|+||.||||+|+++. T Consensus 169 ~~~~v~d~~~~-~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~ 247 (483) T protein:vir:12 169 QGIPIWTDKEH-EELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND 247 (483) T ss_pred ceEEEEcCCCC-CceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCC Confidence 99863333222 22444455543 45678899999888776554331 12234567999999999998765 Q ss_pred cCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccce Q lcl|NC_020844. 224 RKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSW 303 (455) Q Consensus 224 ~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~ 303 (455) ...+.. .++..+..+.....|++.+.+.+.++|+++++|+..++..++..+ +....++.++. ++++ T Consensus 248 ~g~sd~------e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~---~~~~~~~~~~~-----~~~~ 313 (483) T protein:vir:12 248 LEISDI------FMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL---LRYYGAIKVSD-----NGGV 313 (483) T ss_pred CCCCch------hhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHh---hhhccccccCC-----CCcc Confidence 544444 344444445555678888999999999999999987765554332 23344666654 4579 Q ss_pred eEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020844. 304 NFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM 380 (455) Q Consensus 304 ~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~ 380 (455) +|++.+. +.+.++..++.+++.|+.++..+. ...++|.||+|+++....+...+..+...|+.+|++++++++.++ T Consensus 314 ~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~ 392 (483) T protein:vir:12 314 DTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF 392 (483) T ss_pred eEEeecC-CHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9998764 457889999999999999987763 333578999999999999999999999999999999999999999 Q ss_pred CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 381 DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 381 g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) |.......+++.+.........+.++++.++ +|+||+||+++.| +...|+++|++||++|.+...+. T Consensus 393 ~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~------~~v~d~~~E~~ri~~E~~~~~~~ 459 (483) T protein:vir:12 393 DIKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENH------PFVEDLQAELERIEQEQMEYNKQ 459 (483) T ss_pred cCCCccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHHHHHHHHhh Confidence 9876555566654444444456778888776 6999999998866 55668999999999986533322 No 13 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=5.5e-48 Score=279.67 Aligned_cols=411 Identities=11% Similarity=0.060 Sum_probs=287.5 Q ss_pred CCCCC------CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQD------PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~------v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |+.+. +..--..+....+++.++.+.|.|.+.+-.....+...... ..++ ...-+..|+++.+|++++| T Consensus 18 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~---~~~~--~~~ri~~n~~~~ivd~~~~ 92 (472) T protein:vir:93 18 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV---DPLK--PDDRMITNFHANLVDQKVS 92 (472) T ss_pred ecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccc---cccc--cccccccchHHHHHHHHhh Confidence 33211 11111233455678888888888875542211111111111 1111 1111346999999999999 Q ss_pred hhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 75 KPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 75 ~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) ++|++||+++..++....+++++. +++++..+..+++.++++|++|++|+.+..+ +|-+..++|+ T Consensus 93 ~l~g~~~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~-------------~~~i~~~~p~ 157 (472) T protein:vir:93 93 YIVGKPIAFKHTDDEVVKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG-------------EFKLFRVPAE 157 (472) T ss_pred hhcccCeeeccCChHHHHHHHHHH--hccHHHHHHHHHHHHhhcCeEEEEEEECCCC-------------ceEEEEEccc Confidence 999999999876676677777664 4689999999999999999999999876544 3668889999 Q ss_pred hhccceeeccCCeeEEEEEEEEe--eeeEEEEEecCeEEEEEEeCCcE---------EEEeccCCCCCCceeEEEEEecc Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWE--DAETILVLRPDDWERWRKNDAGI---------WEVDEFGRITIGVVPMVPFITGR 223 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E--~~e~~~v~~~~~~~~~~~~~~g~---------~~~~~~g~~~l~~vP~V~f~~~~ 223 (455) +++....+... .+.+..+++.. ..+.+.+|+++.+..|+...+.. +......+|+|+.||||+|+++. T Consensus 158 ~~~~i~d~~~~-~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~ 236 (472) T protein:vir:93 158 QGIPIWTDKEH-EELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND 236 (472) T ss_pred ceEEEEcCCCC-CceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCC Confidence 99874332222 22344444433 44567889998887776554332 12234568999999999998765 Q ss_pred cCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccce Q lcl|NC_020844. 224 RKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSW 303 (455) Q Consensus 224 ~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~ 303 (455) ...+.. .++..+..+..+..|++...+.+.++|+++++|++..+..++..+ ++...++.++. ++++ T Consensus 237 ~g~s~~------e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~---~~~~~~~~~~~-----~~~~ 302 (472) T protein:vir:93 237 LEISDI------FMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL---LRYYGAIKVSD-----NGGV 302 (472) T ss_pred CCCCch------hhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHH---HhhccccccCC-----CCcc Confidence 443433 444444445555778999999999999999999987665544332 23345666654 4579 Q ss_pred eEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020844. 304 NFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM 380 (455) Q Consensus 304 ~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~ 380 (455) +|++.+. +.+.++..++.+++.|+.++..+. ...++|.||+|+++....+...+..+...|+.+|++++++++.++ T Consensus 303 ~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~ 381 (472) T protein:vir:93 303 DTIQVEV-PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF 381 (472) T ss_pred eeEeecC-CHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9998664 457889999999999999987763 334578999999999999999999999999999999999999999 Q ss_pred CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 381 DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 381 g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) |.......+++.++........+.++++.++ +|+||+||+++.| +...|+++|++||++|.....+. T Consensus 382 ~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~--~giis~et~l~~l------~~~~d~~~E~~ri~~E~~~~~~~ 448 (472) T protein:vir:93 382 DIKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENH------PFVEDLQAELERIEQEQMEYNKQ 448 (472) T ss_pred CCCcccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHh Confidence 9876555566654444443446777887775 6999999998766 55668999999999985432222 No 14 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=2.7e-47 Score=275.89 Aligned_cols=412 Identities=10% Similarity=0.048 Sum_probs=284.6 Q ss_pred CCCCC-------CCccCHHH-HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHH Q lcl|NC_020844. 1 MPEQD-------PTKRSADA-EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGL 72 (455) Q Consensus 1 m~~~~-------v~~~~p~y-~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~ 72 (455) |++.+ +..--..+ ....++|+++.+.|.|.+.. ++.+. .....+..+..+-+..|+++.+|+++ T Consensus 23 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~------i~~~~--~~~~~~~~~~~~ki~~n~~~~ivd~~ 94 (481) T protein:vir:10 23 VSDLAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTD------ILAGE--RRLQKYGDKADHRAVHNYAKYVSRFI 94 (481) T ss_pred eecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc------cccCc--cccccccccccceeecchHHHHHHHH Confidence 44332 11111222 23457888888888886331 11111 11112222222234579999999999 Q ss_pred hhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 73 ASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 73 ~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) +|++|++||+++..++...+.+.++... ++++.++..+++.++.+|++|++|+.+..+ +|.++.++ T Consensus 95 ~~~l~g~~~~~~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg-------------~~~i~~~~ 160 (481) T protein:vir:10 95 VGYLTGNPITITHQDNQTNDKIIELNDL-NDADEVNSDLALNLSIYGRAYEIVYRDFED-------------RDTFKVLD 160 (481) T ss_pred HhhhccCCceEecCChhHHHHHHHHHHh-cChhHHHHHHHHHHHhcCeEEEEEEeCCCC-------------eEEEEEEc Confidence 9999999999986666655555555433 689999999999999999999999876543 36789999 Q ss_pred hhhhccceeeccCCeeEEEEEEEEe-------eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccC Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMRMWE-------DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRK 225 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~~~E-------~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~ 225 (455) |.+++.+..+... .+.+..+++.. .+..+.+|+++.+..|+.. ++.|..++..+|+||.||||+|.++... T Consensus 161 p~~~~~v~d~~~~-~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~-~~~~~~~~~~~~~~g~vPvv~~~n~~~g 238 (481) T protein:vir:10 161 PKSTFVVYDQTLD-KKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIK-GGTYHRVEEVEHYYNDVPIIEYLNDQFK 238 (481) T ss_pred ccceEEEEcCCCC-CceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEec-CCceeecccccccCCceeEEEeecCCCC Confidence 9999875433332 33444444432 1346678999988877654 4668888999999999999999865442 Q ss_pred CCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCC----CCcccc Q lcl|NC_020844. 226 GKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPN----AEGQHG 301 (455) Q Consensus 226 ~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~----~~~~~~ 301 (455) .+.+.++..+..+..+..|++.+.+++.++|+++++|....+.+.+.. +. .+.++.++.. ..+.++ T Consensus 239 ------~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~--~~--~~~~~~~~~~~~~~~~~~~~ 308 (481) T protein:vir:10 239 ------QGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKA--FR--DANMIHLEPGTNANGSEGKA 308 (481) T ss_pred ------CCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCCccchhh--hh--hccceeccccccccCCCCCc Confidence 333455555555666778999999999999999999976555433321 11 1122222211 223467 Q ss_pred ceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 302 ~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) +++|++.+. +.+.++..++.+++.|+.++..+. ...++|.||+|++.....+.+.+..+...|+.+|++++++++. T Consensus 309 ~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 387 (481) T protein:vir:10 309 EVKYVYKQY-DVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLN 387 (481) T ss_pred ceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 899998776 457788999999999999988763 2235789999999999999999999999999999999999999 Q ss_pred HcCCCCCc----eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCC Q lcl|NC_020844. 379 WMDAPDTN----PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIE 454 (455) Q Consensus 379 ~~g~~~~~----~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~ 454 (455) +++..... ..+++.+........++.++++.++ .|+||.||+++.| +...|+++|++||++|.++... T Consensus 388 ~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl--~g~is~et~~~~l------~~i~d~~~E~~ri~~E~~~~~~ 459 (481) T protein:vir:10 388 NVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNAL--SGGVSESTRLSLL------DFIDNPKEELEKMQEEEAQREK 459 (481) T ss_pred HHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC------CCCCCHHHHHHHHHHHHHHHHh Confidence 98764321 2334433222223346677887776 5999999998865 5556799999999998654333 Q ss_pred C Q lcl|NC_020844. 455 E 455 (455) Q Consensus 455 ~ 455 (455) . T Consensus 460 ~ 460 (481) T protein:vir:10 460 Q 460 (481) T ss_pred h Confidence 3 No 15 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=6.4e-48 Score=279.32 Aligned_cols=423 Identities=11% Similarity=0.020 Sum_probs=298.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |....+..---.+....++.+.+.+.|.|.+.+..+...++.+... ..++.- .| +..|+++.+|++.+||+|++| T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~---~~~~~~-~k-i~~n~~~~Ivd~~~~yl~G~p 75 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDEN---PLRNAD-NR-ISHNFHEILVDEKASYMFTYP 75 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccc---cccccc-cc-cccchHHHHHHhhhhheeccc Confidence 5544443322334456778888999999976553332222222211 111110 01 236999999999999999999 Q ss_pred ceecc-CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccc Q lcl|NC_020844. 81 VEVTE-GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEI 159 (455) Q Consensus 81 p~i~~-~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w 159 (455) |+++. +++....+++.+. .++++.....+++.++++|++|++|+.++...+.... ..++-+..++|++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~--~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~-----~~~~~~~~i~p~~~~~v 148 (451) T protein:vir:10 76 VLFDIDNNKELNEKVTDVL--GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVT-----NQTFKYGVVNTEEIIPI 148 (451) T ss_pred ceeecCCcHHHHHHHHHHh--ccCHHHHHHHHHHHHhhcCeEEEEEeecCCccccccc-----ccceeEEEEcccceEEE Confidence 99863 3445566666543 4789999999999999999999999877654332211 12355788899999863 Q ss_pred eeeccCCeeEEEEEEEEe------------eeeEEEEEecCeEEEEEEeCC---cEEEEeccCCCCCCceeEEEEEeccc Q lcl|NC_020844. 160 QSERQGAGERITYMRMWE------------DAETILVLRPDDWERWRKNDA---GIWEVDEFGRITIGVVPMVPFITGRR 224 (455) Q Consensus 160 ~~~~~~~~~~l~~v~~~E------------~~e~~~v~~~~~~~~~~~~~~---g~~~~~~~g~~~l~~vP~V~f~~~~~ 224 (455) ..+.+. .+.+..+++.. .+..+.+|++..+..|+..+. +.+..++..+|+||+||||+|.++.. T Consensus 149 ydd~~~-~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~ 227 (451) T protein:vir:10 149 YRNGIE-RELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIK 227 (451) T ss_pred EcCCCC-CceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCC Confidence 333332 23444454421 134567889888777765332 34556777899999999999987665 Q ss_pred CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCcccccee Q lcl|NC_020844. 225 KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWN 304 (455) Q Consensus 225 ~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~ 304 (455) ..+.+ .++..+..+.....|++.+.+.+.++|+++++|+..++..++..+ +....++.++.++.+.+++++ T Consensus 228 ~~~d~------e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~---~~~~~~i~~~~~~~~~~~~~~ 298 (451) T protein:vir:10 228 KQSDL------SKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKE---LKRYKTIKTETDSEGDSGGLK 298 (451) T ss_pred CCCch------hhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHH---HhhCCeEEecCcCCccCCcce Confidence 44444 444444445556779999999999999999999987766555433 334567777777777888999 Q ss_pred EEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020844. 305 FVEPATESLKFLAEQIESVSKEIRELGRQPLTA--QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDA 382 (455) Q Consensus 305 ~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~--~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~ 382 (455) |++.+. +.+..++.++++++.|+.++..|... ..+|.||+|.++....+...+..+...|+.+|++++++++.++|. T Consensus 299 ~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~ 377 (451) T protein:vir:10 299 TMQIEI-PTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGV 377 (451) T ss_pred EEeecC-CHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 999776 45888999999999999999876332 347899999999999999999999999999999999999999997 Q ss_pred CCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 383 PDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 383 ~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+ ...+++.++........+.++++.++ .|+||+||++..| |...|+++|++++++|.+...++ T Consensus 378 ~d-~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~------p~v~d~~~e~~~~~ee~~~~~~~ 441 (451) T protein:vir:10 378 TD-YKKIQQTYTRNMMSNDLEDADIATKS--VGIIPTKIILRHH------PWVDDVEEAEKLYLEEKKIQASK 441 (451) T ss_pred CC-ccceeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHH Confidence 53 22333433333333356778888876 5999999998765 77778999999998775543333 No 16 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=7.5e-47 Score=273.46 Aligned_cols=411 Identities=10% Similarity=0.037 Sum_probs=284.3 Q ss_pred CCCC-------------------------CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHH Q lcl|NC_020844. 1 MPEQ-------------------------DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLR 55 (455) Q Consensus 1 m~~~-------------------------~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~r 55 (455) |.+- .+..-...+....++.++..+.|.|.+.+-.....+.. +.....++.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~---~~~~~~~~~~ 77 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNV---KGEIDPFKPD 77 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccc---cccccccccc Confidence 3321 01111123344567777788888887554221111111 1111122211 Q ss_pred hhccccCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchh Q lcl|NC_020844. 56 KSVAKMTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRN 135 (455) Q Consensus 56 l~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t 135 (455) . -+..|+++.+|++.+|++|++||+++..++.....+.++. +++++..+..+++.++.+|++|++|+.+..+ T Consensus 78 ~--ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~---- 149 (468) T protein:vir:96 78 W--RMYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVL--NHKWDDKLVDILTAASNKGVEWIQPYVDEQG---- 149 (468) T ss_pred c--ccccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHH--hcCHHHHHHHHHHHHhhcCeEEEEEEEcCCC---- Confidence 1 1247999999999999999999999876666666666664 3688999999999999999999999876544 Q ss_pred hhhHHhccCCceEEEechhhhcc-ceeeccCCeeEEEEEEEE--eeeeEEEEEecCeEEEEEEeCCc------------- Q lcl|NC_020844. 136 REEERQAGVRPYWSQVNHSAILE-IQSERQGAGERITYMRMW--EDAETILVLRPDDWERWRKNDAG------------- 199 (455) Q Consensus 136 ~ad~~~~~~rPy~~~~~~~~ii~-w~~~~~~~~~~l~~v~~~--E~~e~~~v~~~~~~~~~~~~~~g------------- 199 (455) +|.+..++|.+++. |.....+ ..+..+++. +..+.+.+|+++.+..|+..+++ T Consensus 150 ---------~~~i~~~~p~~~~~v~~~~~~~--~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (468) T protein:vir:96 150 ---------EFKTFRVPAEQAIPIWTNKERD--ELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQ 218 (468) T ss_pred ---------ceEEEEEcccceEEEEcCCCCC--ceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccc Confidence 36688999999886 4322222 233344443 34567888999988888754432 Q ss_pred EEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc Q lcl|NC_020844. 200 IWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP 279 (455) Q Consensus 200 ~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~ 279 (455) .+......+|++|+||||+|.++... .+.|.++..+..+.....|++.+.+.+.++|+++++|+..++..++.. T Consensus 219 ~~~~~~~~~~~~~~iPvv~~~n~~~g------~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~ 292 (468) T protein:vir:96 219 AHYYVGNKSMSWNRVPFIPFKNNPQE------VSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMY 292 (468) T ss_pred cceeeccccccCCcccEEEecCCCCC------CCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhh Confidence 23445667899999999999876543 333455555555666677888889999999999999998766544432 Q ss_pred cceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHH Q lcl|NC_020844. 280 KPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNS 356 (455) Q Consensus 280 ~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s 356 (455) + +....++.++.+ ++++++|++.+.. .+.++..++.++++|+.++..+. .+.++|.||+|.+.....+.+ T Consensus 293 ~---~~~~~~i~~~~d---~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~ 365 (468) T protein:vir:96 293 N---LKYYKAINVDGD---GSGGVDTIQIDVP-VQSAKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDL 365 (468) T ss_pred h---hhcCceEEecCC---CCCcceEEeecCC-hHHHHHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHH Confidence 2 222446666543 3567999997764 47889999999999999987663 233568999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCC Q lcl|NC_020844. 357 AVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNF 436 (455) Q Consensus 357 ~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~ 436 (455) .+..+...|+.+|++++++++.++|.+.....+++..+........+.++. +.++|+||+||+++.| +... T Consensus 366 k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~i~f~~~~p~d~~e~a~~---~~~~g~iS~et~i~~l------~~v~ 436 (468) T protein:vir:96 366 KANKLKNKTLTALQELLQYIIDFYKLSIKVQDVEITFNFNVMVNELEQSQI---GVNSQYLSKETVVTNH------PWVD 436 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCCcCHHHHHHH---HHhcCCCchHHHHHhC------CCCC Confidence 999999999999999999999999986544445555444333333444544 4567999999998755 6666 Q ss_pred CHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 437 DPEEESERLISELPGGIEE 455 (455) Q Consensus 437 d~e~e~~ri~~e~~~~l~~ 455 (455) |+++|++||++|.++..+. T Consensus 437 D~~~E~~ri~~E~~~~~~~ 455 (468) T protein:vir:96 437 DPVAEMERIDQEELALPSI 455 (468) T ss_pred CHHHHHHHHHHHHHHHHHH Confidence 8999999999986532222 No 17 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=7.1e-47 Score=273.59 Aligned_cols=416 Identities=10% Similarity=0.051 Sum_probs=289.7 Q ss_pred CCC------------CCC--------CccCHH----H--HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHH Q lcl|NC_020844. 1 MPE------------QDP--------TKRSAD----A--EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDL 54 (455) Q Consensus 1 m~~------------~~v--------~~~~p~----y--~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~ 54 (455) |++ ..+ ...-.+ + ....++++++.+.|.|.+.+..+...+....++......+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 80 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKT 80 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccc Confidence 111 000 000000 1 01236788888888887765443322222222222222221 Q ss_pred HhhccccCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch Q lcl|NC_020844. 55 RKSVAKMTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR 134 (455) Q Consensus 55 rl~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~ 134 (455) - .| +..|+++.+|++.++|+|++||+++.+.+....+++.+. .++++.....+++.++.+|++|++|+.++.+ T Consensus 81 ~-~r-i~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg--- 153 (503) T protein:vir:59 81 N-NR-TSHAWHKLFVDQKTQYLVGEPVTFTSDNKTLLEYVNELA--DDDFDDILNETVKNMSNKGIEYWHPFVDEEG--- 153 (503) T ss_pred c-ce-eecchHHHHHHHHHhhhhcCCeeeccCcHHHHHHHHHHH--hcCHHHHHHHHHHHHhhCCeEEEEEeecCCC--- Confidence 1 12 237999999999999999999999877777777777664 4789999999999999999999999876643 Q ss_pred hhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe-------eeeEEEEEecCeEEEEEEeCCcE------- Q lcl|NC_020844. 135 NREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE-------DAETILVLRPDDWERWRKNDAGI------- 200 (455) Q Consensus 135 t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-------~~e~~~v~~~~~~~~~~~~~~g~------- 200 (455) +|.++.++|.+++....+... .+.+..+++.. .+..+.+|+++.+..|+..+++. T Consensus 154 ----------~~~i~~~~p~~~~~i~d~~~~-~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~ 222 (503) T protein:vir:59 154 ----------EFDYVIFPAEEMIVVYKDNTR-RDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYG 222 (503) T ss_pred ----------ceEEEEEccceeEEEEeCCCC-CceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCccccccccc Confidence 367999999999874444333 33344444432 23467789999988887654432 Q ss_pred ------EEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccc Q lcl|NC_020844. 201 ------WEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTD 274 (455) Q Consensus 201 ------~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~ 274 (455) +...+..+|+++.||||+|.++....+.. .++..+..+..+..|++.+.+.+.++|+++++|++.+++ T Consensus 223 ~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~------~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~ 296 (503) T protein:vir:59 223 ENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDL------KFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENP 296 (503) T ss_pred ccccccceeecceeccCCccceEEecCCCCCCcch------hhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCcccc Confidence 12334568999999999998766544433 344444445556778899999999999999999988776 Q ss_pred cccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHH Q lcl|NC_020844. 275 SEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAA 351 (455) Q Consensus 275 ~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~ 351 (455) .++..+ +....++.+|. +++++|++.+. +.+.++..++.+++.|+.++..+.. ..+++.||+|++... T Consensus 297 ~~~~~~---~~~~~~~~~~~-----~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~ 367 (503) T protein:vir:59 297 KEFTAN---LRYHSVIKVSG-----DGGVDTLRAEI-PVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLY 367 (503) T ss_pred chhhhh---hhcccceeccC-----CCcceeEeccC-CHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHH Confidence 555432 33455666664 34789998776 4477899999999999999877642 336789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC-----CceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHH Q lcl|NC_020844. 352 SKGNSAVQQWAAGLKDALENALYITNLWMDAPD-----TNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEF 426 (455) Q Consensus 352 ~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~-----~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l 426 (455) ..+...+..+...|+.+|++++++++.+++... ....+++.+.........+.++++.+++++|+||+||+++.| T Consensus 368 ~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l 447 (503) T protein:vir:59 368 ALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARN 447 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhC Confidence 999999999999999999999999998876532 111244443333333356889999999999999999998765 Q ss_pred HhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 427 QRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 427 ~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +...|+++|++||++|.+...+. T Consensus 448 ------~~v~d~~~E~~ri~~E~~~~~~~ 470 (503) T protein:vir:59 448 ------PFVQDPEEELARIEEEMNQYAEM 470 (503) T ss_pred ------CCCCCHHHHHHHHHHHHHHHHhh Confidence 66668999999999876432221 No 18 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=6.2e-47 Score=273.90 Aligned_cols=410 Identities=11% Similarity=0.048 Sum_probs=284.2 Q ss_pred CCCCCCCc---------------------------cCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHH Q lcl|NC_020844. 1 MPEQDPTK---------------------------RSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYD 53 (455) Q Consensus 1 m~~~~v~~---------------------------~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~ 53 (455) |.+ |++ -...+....+++++++..|.|.+.+.... ++.. .....+. T Consensus 1 ~~~--~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~----~~~~-~~~~~~~ 73 (478) T protein:vir:10 1 MIS--INWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAP----PKRD-VNGDYDE 73 (478) T ss_pred Ccc--ccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccc----cccc-ccccccc Confidence 552 322 12233334466666777777754432110 0110 1111122 Q ss_pred HHhhccccCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcc Q lcl|NC_020844. 54 LRKSVAKMTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQF 133 (455) Q Consensus 54 ~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~ 133 (455) .+-..-+..|+++.+|++.+|++|++||+++..++.....+.++. +++++.....+++.++++|++|++|+.+..+ T Consensus 74 ~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g-- 149 (478) T protein:vir:10 74 TKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTL--NHKWDDKLVDILTAASNKGIEWVQPYVDEEG-- 149 (478) T ss_pred ccccceeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHH--hcCHHHHHHHHHHHHHhcCeEEEEEEecCCC-- Confidence 221122457999999999999999999999876666666666553 4689999999999999999999999876544 Q ss_pred hhhhhHHhccCCceEEEechhhhcc-ceeeccCCeeEEEEEEEEe--eeeEEEEEecCeEEEEEEeCCc----------- Q lcl|NC_020844. 134 RNREEERQAGVRPYWSQVNHSAILE-IQSERQGAGERITYMRMWE--DAETILVLRPDDWERWRKNDAG----------- 199 (455) Q Consensus 134 ~t~ad~~~~~~rPy~~~~~~~~ii~-w~~~~~~~~~~l~~v~~~E--~~e~~~v~~~~~~~~~~~~~~g----------- 199 (455) +|.++.++|.+++. |.....+ ..+.++++.+ ..+.+.+|+++.+..|+..++. T Consensus 150 -----------~~~~~~~~p~~~~~i~d~~~~~--~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~ 216 (478) T protein:vir:10 150 -----------EFKTFRVPAEQAVPIWTNKERD--ELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDH 216 (478) T ss_pred -----------eeEEEEEcccceEEEEcCCCCC--ceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccccc Confidence 36688899999886 4432222 2333444433 3456888999888777654322 Q ss_pred --EEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc Q lcl|NC_020844. 200 --IWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG 277 (455) Q Consensus 200 --~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~ 277 (455) .+......+|++|.||||+|.+++...+ .+.++..+..+.....|++.+.+.+.++|+++++|++.++..++ T Consensus 217 ~~~~~~~~~~~~~~~~vPvv~~~n~~~g~s------d~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~ 290 (478) T protein:vir:10 217 IQPHYYQGNKLMSWGRVPFIPFKNNPQEVS------DLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDF 290 (478) T ss_pred cccceecccccccCCccceEEeccCCCCCC------cHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchh Confidence 2334556689999999999987654333 34455555555556778899999999999999999987765544 Q ss_pred CccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHH Q lcl|NC_020844. 278 SPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKG 354 (455) Q Consensus 278 ~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~ 354 (455) ..+. ....++.++.+ ++++++|++.+. +.+.++..++.+++.|+.++..+. ...++|.||+|++.....+ T Consensus 291 ~~~~---~~~~~~~~~~~---~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l 363 (478) T protein:vir:10 291 MHNL---KYYKAISVAGE---SGSGVDTIKVEV-PIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNL 363 (478) T ss_pred hhhh---hhcceEEecCC---CCCcceEEeecC-ChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHH Confidence 3322 22345555432 356899998765 568889999999999999987663 2234789999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCC Q lcl|NC_020844. 355 NSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSD 434 (455) Q Consensus 355 ~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~ 434 (455) ......+...|+.+|++++++++.+.|.+.....+++.++........+.++++.++ +|+||+||+++.| +. T Consensus 364 ~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l------~~ 435 (478) T protein:vir:10 364 DLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILSNH------AW 435 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEecCCCCCCHHHHHHHHHHH--hCCCChHHHHHhC------CC Confidence 999999999999999999999999998765444455544333333346677777765 8999999998866 55 Q ss_pred CCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 435 NFDPEEESERLISELPGGIEE 455 (455) Q Consensus 435 ~~d~e~e~~ri~~e~~~~l~~ 455 (455) ..|+++|++||++|.++..+. T Consensus 436 v~D~~~E~~ri~~E~~~~~~~ 456 (478) T protein:vir:10 436 VEDPVAEMERIEQENIELNQQ 456 (478) T ss_pred CCCHHHHHHHHHHHHHHHHhh Confidence 668999999999886542222 No 19 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=4.9e-47 Score=274.46 Aligned_cols=413 Identities=11% Similarity=0.048 Sum_probs=286.3 Q ss_pred CCCCC-------------------------CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHH Q lcl|NC_020844. 1 MPEQD-------------------------PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLR 55 (455) Q Consensus 1 m~~~~-------------------------v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~r 55 (455) |.+-+ +..-...+....++++++.+.|.|.+.+..+...+-.+ ...++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~-----~~~~~~~ 75 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVN-----GDYDETK 75 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcc-----ccccccc Confidence 33221 11112234455677778888888865542211111000 0001111 Q ss_pred hhccccCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchh Q lcl|NC_020844. 56 KSVAKMTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRN 135 (455) Q Consensus 56 l~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t 135 (455) -..-+-.|+++.+|++.+||+|++||+++..++.....+.++. +++++.....+++.++++|++|++|+.++.+ T Consensus 76 ~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~---- 149 (478) T protein:vir:10 76 PDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTL--NHKWDDKLVDILTAASNKGIEWVQPYVDEEG---- 149 (478) T ss_pred ccceeccchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEEEecCCC---- Confidence 1111247999999999999999999999876666666565553 3689999999999999999999999877644 Q ss_pred hhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEE--eeeeEEEEEecCeEEEEEEeCCc-------------E Q lcl|NC_020844. 136 REEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMW--EDAETILVLRPDDWERWRKNDAG-------------I 200 (455) Q Consensus 136 ~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~--E~~e~~~v~~~~~~~~~~~~~~g-------------~ 200 (455) +|-++.++|.+++....+...+. .+..+++. +..+.+.+|+++.+..|+..++. . T Consensus 150 ---------~~~~~~~~p~~~~~v~d~~~~~~-~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 219 (478) T protein:vir:10 150 ---------EFKTFRVPAEQAVPIWTNKERDE-LQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQP 219 (478) T ss_pred ---------ceEEEEEcccceEEEEcCCCCCc-eEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeecccccccccccc Confidence 36688899999886433332222 23334433 34567888999988888765432 2 Q ss_pred EEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCcc Q lcl|NC_020844. 201 WEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPK 280 (455) Q Consensus 201 ~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~ 280 (455) +......+|++|+||||+|.++....+ .|.++..+..+.....|++.+.+.+.++|+++++|.+.++..++..+ T Consensus 220 ~~~~~~~~~~~g~vPvv~~~n~~~g~s------d~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 293 (478) T protein:vir:10 220 HYYQGNKLMSWGRVPFIPFKNNPQEVS------DLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHN 293 (478) T ss_pred ceecccccccCCcceEEEeccCCCCCC------cHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhh Confidence 334556789999999999987654333 34555555556666788999999999999999999987765554332 Q ss_pred ceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHH Q lcl|NC_020844. 281 PLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSA 357 (455) Q Consensus 281 ~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~ 357 (455) . ....++.++.+ .+++++|++.+. +.+.++..++.+++.|+.++..+. .+.++|.||+|+++....+... T Consensus 294 ~---~~~~~~~~~~~---~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k 366 (478) T protein:vir:10 294 L---KYYKAISVAGE---SGSGVDTIKVEV-PIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLK 366 (478) T ss_pred h---hhCceeEecCC---CCCcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHH Confidence 2 22345555422 356799998776 457889999999999999988764 2335789999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC Q lcl|NC_020844. 358 VQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD 437 (455) Q Consensus 358 L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d 437 (455) +..+...|+.+|++++++++.+.|.+.....+++..+........+.++++.++ +|+||+||+++.+ +...| T Consensus 367 ~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~--~g~iS~et~i~~~------~~v~d 438 (478) T protein:vir:10 367 ANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILGNH------SWVQD 438 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCCCCCCCHHHHHHHHHHH--hCCCChHHHHHhC------CCCCC Confidence 999999999999999999999999865434455544343333345667776654 7999999998755 56678 Q ss_pred HHHHHHHHHhhcccCCCC Q lcl|NC_020844. 438 PEEESERLISELPGGIEE 455 (455) Q Consensus 438 ~e~e~~ri~~e~~~~l~~ 455 (455) +++|++||++|.+...++ T Consensus 439 ~~~E~~ri~~E~~~~~~~ 456 (478) T protein:vir:10 439 PVAEMERIEQENIELNQQ 456 (478) T ss_pred HHHHHHHHHHHHHHHHHh Confidence 999999999996643322 No 20 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=1.2e-46 Score=272.32 Aligned_cols=407 Identities=11% Similarity=0.054 Sum_probs=288.9 Q ss_pred CCCCCCCcc------CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKR------SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~------~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) ||.+..-+. --.+....++++.+.+.|.|.+.+.. .|. .....-..|+ ..|+++.+|++++| T Consensus 11 ~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~-----~~~---~~~~~~~~ki----~~n~~~~ivd~~~~ 78 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDA-----EPT---KDLWKPDNRL----TVNFTKYIVDTFTG 78 (453) T ss_pred cCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhc-----CCC---ccccCcccee----ecchHHHHHHHHhh Confidence 886542111 11234455788999999999765432 111 1111111222 36999999999999 Q ss_pred hhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 75 KPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 75 ~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) ++|++||+++..++.....+.++... |+++..+..+++.++.+|++|++|+.+..+ +|-+..++|. T Consensus 79 ~l~g~~~~~~~~d~~~~~~l~~i~~~-N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g-------------~~~i~~~~p~ 144 (453) T protein:vir:39 79 YFNGIPVKKSHSDKETLSKLQEFDNL-NDMEDEESELAKMACIYGRAFELLYQNEET-------------QTNVIYNTPE 144 (453) T ss_pred hhcccCceeccCChHHHHHHHHHHHh-cChhHHHHHHHHHHhhcCeEEEEEEecCCC-------------ceEEEEEccc Confidence 99999999986666666666666544 789999999999999999999999876543 3678899999 Q ss_pred hhccceeeccCCeeEEEEEEEEee---eeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccC Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWED---AETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVF 231 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E~---~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~ 231 (455) +++.+..+..+ .+.+..+++... ...+.+|+++.+..|+. .++.|.+++..+|++|.||||+|++++.. T Consensus 145 ~~~~v~d~~~~-~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~-~~~~~~~~~~~~~~~g~vPvv~~~n~~~g------ 216 (453) T protein:vir:39 145 NMFMVYDDTIK-QEPLFAVRYGYDDDYKLYGEVYTKETTYALNG-TMGFYNMTEQAPNPFDDLPVVEFYFNEER------ 216 (453) T ss_pred ceEEEecCCCC-CeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEe-cCCceeeecccccCCCceeEEEecCCCCC------ Confidence 99876544433 445555666432 34577899998877754 45668888899999999999999865543 Q ss_pred cchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCC-CCccccceeEEeeCc Q lcl|NC_020844. 232 HPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPN-AEGQHGSWNFVEPAT 310 (455) Q Consensus 232 ~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~-~~~~~~~~~~le~~~ 310 (455) .+.|.++..+..+..+..|++..++.+.++|+++++|...++..... +-.+.++.++.+ ..+++++++|++.+. T Consensus 217 ~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~lt~~~ 291 (453) T protein:vir:39 217 MSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEEDLKN-----IRSNRVINYYGESSEAKNVDVKFLEKPD 291 (453) T ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCCchhhhh-----hhhcceeeecCCCCCCCCCceeEEeecC Confidence 33344555555566678899999999999999999997654322111 111234444322 234677899999765 Q ss_pred hhHHHHHHHHHHHHHHHHHhhhhhccC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCce- Q lcl|NC_020844. 311 ESLKFLAEQIESVSKEIRELGRQPLTA--QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNP- 387 (455) Q Consensus 311 ~~l~~~~~~l~~~~~~m~~~g~~~l~~--~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~- 387 (455) +.+.++..++.+.+.|+.++..+... ..+|.||+|++.....+......+...|+.++++++++++.+++..+... T Consensus 292 -~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 370 (453) T protein:vir:39 292 -SDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEA 370 (453) T ss_pred -CHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccc Confidence 45788999999999999998766332 34689999999999999999999999999999999999999876543221 Q ss_pred --EEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 388 --EADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 388 --~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+++.+........++.++++.++ +|+||+||+++.| +...|+++|++||++|..+.... T Consensus 371 ~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~l~~l------~~v~D~~~E~~ri~~E~~~~~~~ 432 (453) T protein:vir:39 371 WKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVI------SVIPDVQAEMEKIKKEEASTAIF 432 (453) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHH Confidence 223332222222246677777765 7999999999766 55668999999999996654322 No 21 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=1.4e-46 Score=271.98 Aligned_cols=407 Identities=12% Similarity=0.062 Sum_probs=282.0 Q ss_pred CCCCCCCccCH---------------H--------HHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSA---------------D--------AEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKS 57 (455) Q Consensus 1 m~~~~v~~~~p---------------~--------y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ 57 (455) || ++.+|- + +....++.+++.+.|.|.+.+..+...+.+..+.+ . . .... T Consensus 7 ~~---~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~--~-~-~~~~ 79 (474) T protein:vir:94 7 MP---WDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNID--Y-D-KPDW 79 (474) T ss_pred cc---CCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccc--c-c-cCcc Confidence 55 332222 1 22234566667777777655432211111111110 0 0 1111 Q ss_pred ccccCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh Q lcl|NC_020844. 58 VAKMTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE 137 (455) Q Consensus 58 ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a 137 (455) | +..|+++.+|+.+++++|++||+++..++....+++.+. .++++..+..+++.++++|++|++|+.+..+ T Consensus 80 k-i~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~------ 150 (474) T protein:vir:94 80 R-ITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVL--DTRWDNKLIDILTATSNKGIDWLQVYINENG------ 150 (474) T ss_pred e-eecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHH--hccHHHHHHHHHHHHhhcCceEEEEEecCCC------ Confidence 2 237999999999999999999999877777777777664 3679999999999999999999999876544 Q ss_pred hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEE--eeeeEEEEEecCeEEEEEEeCCcE---------EEEecc Q lcl|NC_020844. 138 EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMW--EDAETILVLRPDDWERWRKNDAGI---------WEVDEF 206 (455) Q Consensus 138 d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~--E~~e~~~v~~~~~~~~~~~~~~g~---------~~~~~~ 206 (455) +|.+..++|.+++....+... ...+..+++. +..+.+.+|+++.+..|+..+++. +..... T Consensus 151 -------~~~i~~~~p~~~~~v~d~~~~-~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 222 (474) T protein:vir:94 151 -------EMKLFRVPAEQAIPIWVDKER-EELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHF 222 (474) T ss_pred -------eeEEEEEcccceEEEEcCCCC-CceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccc Confidence 367889999999874433322 2344444443 345568889999988887655432 112234 Q ss_pred CCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeecc Q lcl|NC_020844. 207 GRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGP 286 (455) Q Consensus 207 g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~ 286 (455) .+|++|+||||+|+++....+. |.++..+..+.....|++.+.+.+.++|+++++|++.++..++..+. .. T Consensus 223 ~~~~~g~vPvv~~~nn~~g~sd------~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~---~~ 293 (474) T protein:vir:94 223 SNGNWGRVPFIAFKNNPEEVSD------IWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGL---KY 293 (474) T ss_pred cccCCCccceEEecCCcCCCCc------HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhh---hc Confidence 6799999999999876554343 34444444455556788888889999999999999877655544332 23 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAA 363 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~ 363 (455) ..++.++. +++++|++.+. +.+.++..++.+.+.|+.++..+. .+.++|.||+|.++....+...+..+.. T Consensus 294 ~~~i~~~~-----~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~ 367 (474) T protein:vir:94 294 YKAINVDG-----DGGVETIQVEV-PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKN 367 (474) T ss_pred cceeeccC-----CCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 45566553 45799998765 557889999999999999988763 2235789999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 364 GLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 364 ~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) .|+.+|++++++++.++|.......+++..+........+.++++ .++|+||+||+++.+ +...|+++|++ T Consensus 368 ~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~---~~~g~iS~et~l~~l------~~v~D~~~E~e 438 (474) T protein:vir:94 368 KATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQII---AQSQYLSRETLVKSS------PLVDDYKAELE 438 (474) T ss_pred HHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHH---HHcCCCCHHHHHHhC------CCCCCHHHHHH Confidence 999999999999999999866444555554333332234445544 457999999998765 56678999999 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) ||++|.++..+. T Consensus 439 ri~~E~~~~~~~ 450 (474) T protein:vir:94 439 RIEQEQMEYNKQ 450 (474) T ss_pred HHHHHHHHHHhh Confidence 999986543222 No 22 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=1.4e-46 Score=271.98 Aligned_cols=407 Identities=12% Similarity=0.062 Sum_probs=282.0 Q ss_pred CCCCCCCccCH---------------H--------HHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSA---------------D--------AEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKS 57 (455) Q Consensus 1 m~~~~v~~~~p---------------~--------y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ 57 (455) || ++.+|- + +....++.+++.+.|.|.+.+..+...+.+..+.+ . . .... T Consensus 7 ~~---~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~--~-~-~~~~ 79 (474) T protein:vir:97 7 MP---WDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNID--Y-D-KPDW 79 (474) T ss_pred cc---CCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccc--c-c-cCcc Confidence 55 332222 1 22234566667777777655432211111111110 0 0 1111 Q ss_pred ccccCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh Q lcl|NC_020844. 58 VAKMTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE 137 (455) Q Consensus 58 ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a 137 (455) | +..|+++.+|+.+++++|++||+++..++....+++.+. .++++..+..+++.++++|++|++|+.+..+ T Consensus 80 k-i~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~------ 150 (474) T protein:vir:97 80 R-ITTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVL--DTRWDNKLIDILTATSNKGIDWLQVYINENG------ 150 (474) T ss_pred e-eecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHH--hccHHHHHHHHHHHHhhcCceEEEEEecCCC------ Confidence 2 237999999999999999999999877777777777664 3679999999999999999999999876544 Q ss_pred hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEE--eeeeEEEEEecCeEEEEEEeCCcE---------EEEecc Q lcl|NC_020844. 138 EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMW--EDAETILVLRPDDWERWRKNDAGI---------WEVDEF 206 (455) Q Consensus 138 d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~--E~~e~~~v~~~~~~~~~~~~~~g~---------~~~~~~ 206 (455) +|.+..++|.+++....+... ...+..+++. +..+.+.+|+++.+..|+..+++. +..... T Consensus 151 -------~~~i~~~~p~~~~~v~d~~~~-~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 222 (474) T protein:vir:97 151 -------EMKLFRVPAEQAIPIWVDKER-EELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHF 222 (474) T ss_pred -------eeEEEEEcccceEEEEcCCCC-CceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccc Confidence 367889999999874433322 2344444443 345568889999988887655432 112234 Q ss_pred CCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeecc Q lcl|NC_020844. 207 GRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGP 286 (455) Q Consensus 207 g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~ 286 (455) .+|++|+||||+|+++....+. |.++..+..+.....|++.+.+.+.++|+++++|++.++..++..+. .. T Consensus 223 ~~~~~g~vPvv~~~nn~~g~sd------~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~---~~ 293 (474) T protein:vir:97 223 SNGNWGRVPFIAFKNNPEEVSD------IWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGL---KY 293 (474) T ss_pred cccCCCccceEEecCCcCCCCc------HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhh---hc Confidence 6799999999999876554343 34444444455556788888889999999999999877655544332 23 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAA 363 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~ 363 (455) ..++.++. +++++|++.+. +.+.++..++.+.+.|+.++..+. .+.++|.||+|.++....+...+..+.. T Consensus 294 ~~~i~~~~-----~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~ 367 (474) T protein:vir:97 294 YKAINVDG-----DGGVETIQVEV-PVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKN 367 (474) T ss_pred cceeeccC-----CCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 45566553 45799998765 557889999999999999988763 2235789999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 364 GLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 364 ~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) .|+.+|++++++++.++|.......+++..+........+.++++ .++|+||+||+++.+ +...|+++|++ T Consensus 368 ~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~---~~~g~iS~et~l~~l------~~v~D~~~E~e 438 (474) T protein:vir:97 368 KATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQII---AQSQYLSRETLVKSS------PLVDDYKAELE 438 (474) T ss_pred HHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHH---HHcCCCCHHHHHHhC------CCCCCHHHHHH Confidence 999999999999999999866444555554333332234445544 457999999998765 56678999999 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) ||++|.++..+. T Consensus 439 ri~~E~~~~~~~ 450 (474) T protein:vir:97 439 RIEQEQMEYNKQ 450 (474) T ss_pred HHHHHHHHHHhh Confidence 999986543222 No 23 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=3.1e-46 Score=270.05 Aligned_cols=410 Identities=12% Similarity=0.074 Sum_probs=283.8 Q ss_pred CCCCCCCcc----------C----------HHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccc Q lcl|NC_020844. 1 MPEQDPTKR----------S----------ADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAK 60 (455) Q Consensus 1 m~~~~v~~~----------~----------p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~ 60 (455) ||..-+.+. + ..+....++++++.+.|.|.+.+-.+ ..| ..............| + T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r-~~~---~~~~~~~~~~~~~~k-i 81 (474) T protein:vir:95 7 MPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQ-MKK---VDVYGNIDYDKPDWR-I 81 (474) T ss_pred cCCCCchhhHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhcc-ccc---cccccccccccccce-e Confidence 664321110 0 12344456677777777776544211 011 000000011111112 2 Q ss_pred cCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHH Q lcl|NC_020844. 61 MTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEER 140 (455) Q Consensus 61 ~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~ 140 (455) ..|+++.+|++.++++|++||+++..++....+++.+. .++++..+..+++.++++|++|++|+.+..+ T Consensus 82 ~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~--------- 150 (474) T protein:vir:95 82 TTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVL--DTRWDNKLIDILTATSNKGIDWLQVYINENG--------- 150 (474) T ss_pred ccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHH--hccHHHHHHHHHHHHhhcCcEEEEEEecCCC--------- Confidence 47999999999999999999999877777777776663 3679999999999999999999999876544 Q ss_pred hccCCceEEEechhhhccceeeccCCeeEEEEEEEE--eeeeEEEEEecCeEEEEEEeCCcE---------EEEeccCCC Q lcl|NC_020844. 141 QAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMW--EDAETILVLRPDDWERWRKNDAGI---------WEVDEFGRI 209 (455) Q Consensus 141 ~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~--E~~e~~~v~~~~~~~~~~~~~~g~---------~~~~~~g~~ 209 (455) +|-++.++|.+++....+...+. .+..+++. +..+.+.+|+++.+..|+..+++. +......+| T Consensus 151 ----~~~i~~~~p~~~~~v~d~~~~~~-~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:95 151 ----EMKLFRVPAEQAIPIWVDKEREE-LKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNG 225 (474) T ss_pred ----ceEEEEEcccceEEEEcCCCCCc-eEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCccccccccccc Confidence 36688899999987443333222 33344443 344578889999888887655431 112334679 Q ss_pred CCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeecccee Q lcl|NC_020844. 210 TIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTV 289 (455) Q Consensus 210 ~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~ 289 (455) ++|.||||+|.++....+. |.++..+..+.....|++.+.+.+.++|+++++|++.++..++..+. ....+ T Consensus 226 ~~g~iPvv~~~nn~~g~sd------~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~---~~~~~ 296 (474) T protein:vir:95 226 NWGRVPFIAFKNNPEEVSD------IWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGL---KYYKA 296 (474) T ss_pred CCCccceEeecCCCCCCCc------HHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhh---hccce Confidence 9999999999876544343 44444444455557788888899999999999999877655443322 23456 Q ss_pred eeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 290 LYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 290 ~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) +.++. +++++|++.+. +.++++..++.+.++|+.++..+. ...+++.||+|.++....+.+....+...|+ T Consensus 297 i~~~~-----~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~ 370 (474) T protein:vir:95 297 INVDG-----DGGVETIQVEV-PVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKAT 370 (474) T ss_pred eeccC-----CCceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 66653 45799998764 568899999999999999988763 2234789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHH Q lcl|NC_020844. 367 DALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLI 446 (455) Q Consensus 367 ~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~ 446 (455) .+|++++++++.++|.......+++.+++.......+.++++. ++|+||++|++..| +...|+++|++||+ T Consensus 371 ~~l~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~~e~a~~~~---~~g~iS~et~i~~l------~~v~d~~~E~~ri~ 441 (474) T protein:vir:95 371 VAIQELIGFIIDFNNLKMDVKDIEISFNFNRMMNDAEQSQIIA---QSQYLSRETLVKSS------PLVDDYKAELERIE 441 (474) T ss_pred HHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHHHH---hcCCCchHHHHHhC------CCCCCHHHHHHHHH Confidence 9999999999999997655555666655544433455555544 56999999998755 56678999999999 Q ss_pred hhcccCCCC Q lcl|NC_020844. 447 SELPGGIEE 455 (455) Q Consensus 447 ~e~~~~l~~ 455 (455) +|..+..+. T Consensus 442 ~E~~~~~~~ 450 (474) T protein:vir:95 442 QEQMEYNKQ 450 (474) T ss_pred HHHHHHHhc Confidence 986432222 No 24 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=2e-46 Score=271.11 Aligned_cols=406 Identities=11% Similarity=0.038 Sum_probs=297.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |..+.+..---.+....++++++++.|.|.+.+. .+.+... ..-.+| +-.|+++.+|+..++++|++| T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il-------~~~~~~~-~~~~~k----i~~n~~~~ivd~~~~~l~g~~ 68 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAIL-------QQKQKEQ-YKPDNR----LVVNFAKYIVDTFNGYFIGVP 68 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------ccccccc-CCCcce----eecchHHHHHHHHhhhhcccC Confidence 7766554444445667799999999999976542 2222111 111122 247999999999999999999 Q ss_pred ceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccce Q lcl|NC_020844. 81 VEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQ 160 (455) Q Consensus 81 p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~ 160 (455) |+++..++..+..++++. +.|+++.++..+++.++++|++|++|+.++.+ +|-++.++|.+++... T Consensus 69 ~~~~~~~~~~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g-------------~~~~~~~~p~~~~~v~ 134 (429) T protein:vir:98 69 VQTSHENKQVSNYLELLD-GYNDQDDNNAELSKICSIYGHGYELVFNDENA-------------EAGITYLTPLEAFIVY 134 (429) T ss_pred ceeecCChHHHHHHHHHH-hhcCHhHHHHHHHHHHhhcCeEEEEEEecCCC-------------cEEEEEEcccceEEEE Confidence 999877777777777764 34789999999999999999999999866543 3678899999998643 Q ss_pred eeccCCeeEEEEEEEEee---eeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHH Q lcl|NC_020844. 161 SERQGAGERITYMRMWED---AETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKD 237 (455) Q Consensus 161 ~~~~~~~~~l~~v~~~E~---~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~d 237 (455) .+.. ..+.+..+++... ...+.+|+.+.++.|... ++.+.+.+..+|++|.||||+|.++.. +.+.+.+ T Consensus 135 dd~~-~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~vPvv~~~n~~~------g~sd~e~ 206 (429) T protein:vir:98 135 DDSI-RQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDG-EKGIEIGESEPHPFDGVPMIEYVENEE------RQSLLAS 206 (429) T ss_pred eCCC-CCceEEEEEEEEecCceEEEEEEeCceEEEEEec-CCceEecccccccCCccceEEecCCCC------CCCcHHH Confidence 3333 3345555665432 344566787777777544 445777888899999999999976543 4455666 Q ss_pred HHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHH Q lcl|NC_020844. 238 ALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLA 317 (455) Q Consensus 238 la~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~ 317 (455) +..+..+..+..|++.+.+.+.++|+++++|....+.... .+...+++.+|.+ .+++++++|++.+. +.+.++ T Consensus 207 v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~~~~~~-----~~~~~~~~~~~~~-~~~~~~~~~l~~~~-~~~~~~ 279 (429) T protein:vir:98 207 VVTLINAFNKAISEKANDVEYFADAYLKILGAELDDETLK-----SLRDTRIINLKDT-DAQQLTVEFLQKPD-ADATQE 279 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCCcchhh-----hHhhCceeeccCC-CCCCcceeEEeecC-CHHHHH Confidence 6777778888899999999999999999999876542211 1223467777643 35677899999776 568889 Q ss_pred HHHHHHHHHHHHhhhhhcc--CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCce---EEEEe Q lcl|NC_020844. 318 EQIESVSKEIRELGRQPLT--AQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNP---EADVY 392 (455) Q Consensus 318 ~~l~~~~~~m~~~g~~~l~--~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~---~v~v~ 392 (455) ..++.+.+.|+.++..+.. .+.+|.||+|++.....+......+...|+.+|++++++++.+++...... .+++. T Consensus 280 ~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~ 359 (429) T protein:vir:98 280 HLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYK 359 (429) T ss_pred HHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEE Confidence 9999999999999876632 234689999999999999999999999999999999999999987653221 23333 Q ss_pred cccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 393 TDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 393 ~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +.........+.++++.++ +|+||+||+++.| +...|+++|++||++|.+..++. T Consensus 360 f~~~~p~~~~~~a~~~~kl--~g~is~et~~~~l------~~v~d~~~E~~ri~~E~~~~~~~ 414 (429) T protein:vir:98 360 FTRNLPANLLEESQIAGNL--AGIVSEETQVGVL------SIVENPQKEIERKNSDKSTLISR 414 (429) T ss_pred eCCCCCcCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHH Confidence 2222222346778888875 7999999998766 55668999999999996643333 No 25 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=5.2e-46 Score=268.86 Aligned_cols=415 Identities=11% Similarity=0.027 Sum_probs=284.3 Q ss_pred CCCCCCCcc-CH----HH-----HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MPEQDPTKR-SA----DA-----EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~v~~~-~p----~y-----~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |+....... .+ .+ ....++++++.+.|.|.+.+ ++.+......++... | +..|+++.+|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i--------~~~~~~~~~~~~~~~-k-i~~n~~k~Iv~ 100 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN--------LVELTRRKEEYMADN-R-VAHDYASYISD 100 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc--------ccccCcCcccccCcc-e-eecchHHHHHH Confidence 554332211 11 11 23457899999999997553 111111112222111 2 34799999999 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) ..+|++|++||+++..++.....+.++... ++++.+...+++.++++|++|++|+.++.+. |-++. T Consensus 101 ~~~~yl~g~p~~~~~~~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~-------------~~i~~ 166 (511) T protein:vir:96 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQDDE-------------TRLYK 166 (511) T ss_pred HHHhhhccCCceeecCchHHHHHHHHHHhh-cCHHHHHHHHHHHHHhcCeeEEEEEeCCCCc-------------eEEEE Confidence 999999999999987777777777777654 6899999999999999999999998765443 66888 Q ss_pred echhhhccceeeccCCeeEEEEEEEEee----------eeEEEEEecCeEEEEEEeCCcE----EEEeccCCCCCCceeE Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWED----------AETILVLRPDDWERWRKNDAGI----WEVDEFGRITIGVVPM 216 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E~----------~e~~~v~~~~~~~~~~~~~~g~----~~~~~~g~~~l~~vP~ 216 (455) ++|++++....+... ...+..+++... +..+.+|+++.+..|...+++. +......+|+++.||| T Consensus 167 ~~p~~~~~vydd~~~-~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv 245 (511) T protein:vir:96 167 SDAMSTFVIYDNTIE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (511) T ss_pred EccceeEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceee Confidence 999999874433332 345555665431 2346689999888776654432 2334567899999999 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc---cceeeccceeeecc Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP---KPLDTGPDTVLYAP 293 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~---~~~~~g~~~~~~lp 293 (455) |+|.++....+... ++..+..+.....|++.+.+++.++|+++++|....+..++.. +.+..........+ T Consensus 246 v~~~nn~~g~gd~e------~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:96 246 TEFSNNERRKGDYE------KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred EEecCCCCCCCchh------hhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceeccccccccc Confidence 99987655444444 4444444555678899999999999999999977555433321 11111111111111 Q ss_pred -CCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 294 -PNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 294 -~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~ 369 (455) ....+++++++|++.+. +.+.++..++++.+.|+.++..|. ...++|.||+|+++....+......+...|+.+| T Consensus 320 ~~~~~~~~~~~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l 398 (511) T protein:vir:96 320 EGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11234577899998655 457789999999999999988763 2335789999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCC---Cc---eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 370 ENALYITNLWMDAPD---TN---PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 370 ~~~l~~~a~~~g~~~---~~---~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) ++++++++.+++... .. ..+++..+.......++.++++.++ .|+||+||+++.| +...|+++|++ T Consensus 399 ~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l------~~v~D~~~E~~ 470 (511) T protein:vir:96 399 RRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF------SFFQDPELEVK 470 (511) T ss_pred HHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC------CCCCCHHHHHH Confidence 999999998865421 11 1233332222233346678887775 6999999998765 56668999999 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) ||++|....++. T Consensus 471 ri~~E~~~~~~~ 482 (511) T protein:vir:96 471 KIEEDEKESIKK 482 (511) T ss_pred HHHHHHHHHHHH Confidence 999996543332 No 26 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=4.6e-46 Score=269.13 Aligned_cols=407 Identities=11% Similarity=0.017 Sum_probs=292.5 Q ss_pred CC------CCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MP------EQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~------~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) +| ...+..--..+....++++++.+.|.|.+.+..+ .+... ..-..| +..|+++.+|++.+| T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~-------~~~~~-~~~~~k----i~~n~~~~ivd~~~~ 78 (452) T protein:vir:36 11 FSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDE-------PAKDS-WKPDNR----LAVNFTKYIVDTFTG 78 (452) T ss_pred cCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccC-------ccccc-cCccce----eecchHHHHHHHHhh Confidence 33 2222222223445567888888999997654321 11111 000112 347999999999999 Q ss_pred hhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 75 KPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 75 ~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) ++|++||+++..++..+..+.++.. .|+++..+..+++.++.+|++|++|+.++.+ +|-+..++|. T Consensus 79 ~l~g~~~~~~~~d~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g-------------~~~i~~~~p~ 144 (452) T protein:vir:36 79 YFNGIPVKKSHSDKEILTKLQEFDN-LNDMEDEESELAKMACIYGRAFEFLYQDEDT-------------QTNVVYNSPE 144 (452) T ss_pred hhcccCceeecCChhHHHHHHHHHh-hcChhHHHHHHHHHHHhcCeEEEEEEecCCC-------------eeEEEEEccc Confidence 9999999997666666666666653 3789999999999999999999999865533 3678899999 Q ss_pred hhccceeeccCCeeEEEEEEEEe---eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccC Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWE---DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVF 231 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E---~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~ 231 (455) +++.+..+.. ..+.+..+++.. ....+.+|+++.+..|+.. ++.|.+.+..+|++|.||||+|.+++.. T Consensus 145 ~~~~v~d~~~-~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~-~~~~~~~~~~~~~~g~iPvv~~~n~~~g------ 216 (452) T protein:vir:36 145 NMFMVYDDTV-KQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGE-NDEISFGEGTYNPYPDLPVVEFYFNEER------ 216 (452) T ss_pred ceEEEEcCCC-CCceEEEEEEEEecCceEEEEEEecCeEEEEEEc-CCceEEecceeccCCcccEEEecCCCCC------ Confidence 9987544333 234555555543 3446778998887777654 4557778888999999999999766543 Q ss_pred cchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCch Q lcl|NC_020844. 232 HPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATE 311 (455) Q Consensus 232 ~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~ 311 (455) .+.|.++..+..+..+..|++.+.+.+.++|+++++|.........+ +-...++.++.++.+.+++++|++.+. T Consensus 217 ~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~l~~~~- 290 (452) T protein:vir:36 217 MSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVEEEDLKN-----IRSNRVINYYADGEGKNVDVKFLEKPD- 290 (452) T ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcCchhhhh-----hhhcceEEecCCCCccCCcceeEeecC- Confidence 33455555555566678899999999999999999998765432221 223457888888777888999999765 Q ss_pred hHHHHHHHHHHHHHHHHHhhhhhccC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCc--- Q lcl|NC_020844. 312 SLKFLAEQIESVSKEIRELGRQPLTA--QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTN--- 386 (455) Q Consensus 312 ~l~~~~~~l~~~~~~m~~~g~~~l~~--~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~--- 386 (455) +.+.++..++.+.+.|+.++..+... +.+|.||+|++.....+......+...|+.++++++++++.+++..... T Consensus 291 ~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 370 (452) T protein:vir:36 291 SDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSW 370 (452) T ss_pred CHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc Confidence 45788999999999999998776432 3468999999999999999999999999999999999999987653221 Q ss_pred eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 387 PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 387 ~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ..+++.+........++.++++.++ +|+||.||+++.| +...|+++|++||++|..+.... T Consensus 371 ~~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~------~~~~d~~~E~~ri~~E~~~~~~~ 431 (452) T protein:vir:36 371 KDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVI------SVIPDVQAEMEKIKKEEASTAIF 431 (452) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHH Confidence 1234443333333346677777765 7999999999766 55668999999999986543221 No 27 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=6.4e-46 Score=268.34 Aligned_cols=415 Identities=10% Similarity=0.028 Sum_probs=284.1 Q ss_pred CCCCCCCcc-CH----HH-----HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MPEQDPTKR-SA----DA-----EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~v~~~-~p----~y-----~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |+....+.. .+ .+ ...+++++++.+.|.|.+.+ ++.+......++... | +..|+++.+|+ T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i--------~~~~~~~~~~~~~~~-k-i~~n~~k~Iv~ 100 (511) T protein:vir:10 31 YDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKN--------LVELTRRKEEYMADN-R-VAHDYASYISD 100 (511) T ss_pred CchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc--------ccccCcccccccCcc-e-eecchHHHHHH Confidence 553322221 11 11 12358899999999997553 111111112222211 2 33799999999 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) .++||+|++||+++..++.....+.++... ++++.+...+++.++++|++|++|+.++.+. |-++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~-------------~~i~~ 166 (511) T protein:vir:10 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYEIMIRNQDDE-------------TRLYK 166 (511) T ss_pred HHhhhhcccCceeecCchHHHHHHHHHHhh-cCHHHHHHHHHHHHHhcCeeEEEEEeCCCCc-------------eEEEE Confidence 999999999999987777777777777544 6899999999999999999999998765443 66888 Q ss_pred echhhhccceeeccCCeeEEEEEEEEee----------eeEEEEEecCeEEEEEEeCCcE----EEEeccCCCCCCceeE Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWED----------AETILVLRPDDWERWRKNDAGI----WEVDEFGRITIGVVPM 216 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E~----------~e~~~v~~~~~~~~~~~~~~g~----~~~~~~g~~~l~~vP~ 216 (455) ++|.+++....+... .+.+..+++... +..+.+|+++.+..|...+++. +......+|++|.||| T Consensus 167 ~~p~~~~~vydd~~~-~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv 245 (511) T protein:vir:10 167 SDAMSTFVIYDNTIE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (511) T ss_pred EccceeEEEEcCCCC-CceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeE Confidence 999999875444433 345556665431 3346788998887776655442 2334567899999999 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc---cceeeccceeeecc Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP---KPLDTGPDTVLYAP 293 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~---~~~~~g~~~~~~lp 293 (455) |+|.++....+......+|+| +.....|++.+.+++.++|+++++|....+..+... ..+..........+ T Consensus 246 v~f~nn~~g~gd~e~v~~liD------a~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:10 246 TEFSNNERRKGDYEKVITLID------LYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred EEecCCCCCCCchhhhHHHHH------HHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceeccccccccc Confidence 999876654444444444444 555577888899999999999999976555443321 11111111111111 Q ss_pred -CCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 294 -PNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 294 -~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~ 369 (455) ....+++++++|++.+. +.+.++..++++.+.|+.++..|. ...++|.||+|+++....+......+...|+.+| T Consensus 320 ~~~~~~~~~d~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l 398 (511) T protein:vir:10 320 EGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11234577899998654 557789999999999999987764 3345789999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCC---C--c-eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 370 ENALYITNLWMDAPD---T--N-PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 370 ~~~l~~~a~~~g~~~---~--~-~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) ++++++++.+++... . + ..+++.+.-......++.++++.++ .|+||+||+++.| +...|+++|++ T Consensus 399 ~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl--~G~iS~et~~~~l------~~v~d~~~E~~ 470 (511) T protein:vir:10 399 RRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF------SFFQDPELEVK 470 (511) T ss_pred HHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhC------CCCCCHHHHHH Confidence 999999998875421 1 1 1233332222223346778888886 5999999998765 56668999999 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) ||++|.+..++. T Consensus 471 ri~~E~~~~~~~ 482 (511) T protein:vir:10 471 KIEEDEKESIKK 482 (511) T ss_pred HHHHHHHHHHHH Confidence 999996543332 No 28 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=1.3e-46 Score=272.06 Aligned_cols=410 Identities=12% Similarity=0.084 Sum_probs=280.3 Q ss_pred CCCCCC--------------------CccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccc Q lcl|NC_020844. 1 MPEQDP--------------------TKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAK 60 (455) Q Consensus 1 m~~~~v--------------------~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~ 60 (455) ||..-+ ..---.+....++.+.+.+.|.|.+.+... +.........+..+-..-+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~-----~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:96 7 MPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQ-----AYKQDLHGNIDYTKPDWRI 81 (474) T ss_pred CCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccc-----cchhhhccccccccccccc Confidence 332110 000112334456777777778887654321 1111100011111111123 Q ss_pred cCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHH Q lcl|NC_020844. 61 MTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEER 140 (455) Q Consensus 61 ~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~ 140 (455) ..|+++.+|++.+|++|++||+++..++.....+.++. +++++.....+++.++.+|++|++|+.++.+. T Consensus 82 ~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~--~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~-------- 151 (474) T protein:vir:96 82 TTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVL--DTRWDNKLIDILTAASNKGIDWLQVYINEDGE-------- 151 (474) T ss_pred ccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCc-------- Confidence 47999999999999999999999866666666666653 46899999999999999999999998765443 Q ss_pred hccCCceEEEechhhhccceeeccCCeeEEEEEEEEe--eeeEEEEEecCeEEEEEEeCCcE---------EEEeccCCC Q lcl|NC_020844. 141 QAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE--DAETILVLRPDDWERWRKNDAGI---------WEVDEFGRI 209 (455) Q Consensus 141 ~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E--~~e~~~v~~~~~~~~~~~~~~g~---------~~~~~~g~~ 209 (455) |-++.++|++++....+... ...+.++++.. ....+.+|+++.+..|...+++. +......+| T Consensus 152 -----~~i~~~~p~~~~~v~d~~~~-~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:96 152 -----LKLFRVPAEQAIPIWTDKER-EQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTG 225 (474) T ss_pred -----eEEEEEcccceEEEEcCCCC-CceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCccccc Confidence 56888999999864333322 23444455433 34568889999888887655442 233455689 Q ss_pred CCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeecccee Q lcl|NC_020844. 210 TIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTV 289 (455) Q Consensus 210 ~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~ 289 (455) ++|.||||+|.++....+.+....+|+|. .....|++.+.+.+.++|+++++|++.++..++..+ .....+ T Consensus 226 ~~~~vPvv~~~nn~~~~~d~e~v~~liDa------~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~---~~~~~~ 296 (474) T protein:vir:96 226 SWERVPFIAFKNNPEEVSDIWMYKSFVDA------IDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEG---LKYYKA 296 (474) T ss_pred CCCccceEEecCCCCCCCchHHHHHHHHH------HHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhh---hhccce Confidence 99999999998766544544444555554 445678888888999999999999887654444332 233445 Q ss_pred eeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 290 LYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 290 ~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) +.++. +++++|++.+. +.+.++..++++.+.|+.++..+. .+.+++.||+|.++....+......+...|+ T Consensus 297 i~~~~-----~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~ 370 (474) T protein:vir:96 297 INVSS-----DGGVETIQVEV-PVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKAN 370 (474) T ss_pred eeccC-----CCceeEEeccC-CHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66653 45799998765 447889999999999999988763 2335689999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHH Q lcl|NC_020844. 367 DALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLI 446 (455) Q Consensus 367 ~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~ 446 (455) .+|++++++++.+.|.......+++.++........+.++. +.++|+||+||+++.| +...|+++|++||+ T Consensus 371 ~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~---~~~~giiS~et~~~~l------p~v~D~~~E~eri~ 441 (474) T protein:vir:96 371 VALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQI---GAQSQYLSKETLVRHH------PWVDDPKAELERLD 441 (474) T ss_pred HHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHH---HHHcCCCChHHHHHhC------CCCCCHHHHHHHHH Confidence 99999999999999976544444444322222223344443 4457999999998765 66778999999999 Q ss_pred hhcccCCCC Q lcl|NC_020844. 447 SELPGGIEE 455 (455) Q Consensus 447 ~e~~~~l~~ 455 (455) +|..+..+. T Consensus 442 ~E~~~~~~~ 450 (474) T protein:vir:96 442 EEQLELNKQ 450 (474) T ss_pred HHHHHHHhh Confidence 885433222 No 29 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=1.3e-46 Score=272.06 Aligned_cols=410 Identities=12% Similarity=0.084 Sum_probs=280.3 Q ss_pred CCCCCC--------------------CccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccc Q lcl|NC_020844. 1 MPEQDP--------------------TKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAK 60 (455) Q Consensus 1 m~~~~v--------------------~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~ 60 (455) ||..-+ ..---.+....++.+.+.+.|.|.+.+... +.........+..+-..-+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~-----~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:95 7 MPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQ-----AYKQDLHGNIDYTKPDWRI 81 (474) T ss_pred CCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccc-----cchhhhccccccccccccc Confidence 332110 000112334456777777778887654321 1111100011111111123 Q ss_pred cCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHH Q lcl|NC_020844. 61 MTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEER 140 (455) Q Consensus 61 ~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~ 140 (455) ..|+++.+|++.+|++|++||+++..++.....+.++. +++++.....+++.++.+|++|++|+.++.+. T Consensus 82 ~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~--~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~-------- 151 (474) T protein:vir:95 82 TTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVL--DTRWDNKLIDILTAASNKGIDWLQVYINEDGE-------- 151 (474) T ss_pred ccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHH--hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCc-------- Confidence 47999999999999999999999866666666666653 46899999999999999999999998765443 Q ss_pred hccCCceEEEechhhhccceeeccCCeeEEEEEEEEe--eeeEEEEEecCeEEEEEEeCCcE---------EEEeccCCC Q lcl|NC_020844. 141 QAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE--DAETILVLRPDDWERWRKNDAGI---------WEVDEFGRI 209 (455) Q Consensus 141 ~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E--~~e~~~v~~~~~~~~~~~~~~g~---------~~~~~~g~~ 209 (455) |-++.++|++++....+... ...+.++++.. ....+.+|+++.+..|...+++. +......+| T Consensus 152 -----~~i~~~~p~~~~~v~d~~~~-~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:95 152 -----LKLFRVPAEQAIPIWTDKER-EQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTG 225 (474) T ss_pred -----eEEEEEcccceEEEEcCCCC-CceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCccccc Confidence 56888999999864333322 23444455433 34568889999888887655442 233455689 Q ss_pred CCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeecccee Q lcl|NC_020844. 210 TIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTV 289 (455) Q Consensus 210 ~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~ 289 (455) ++|.||||+|.++....+.+....+|+|. .....|++.+.+.+.++|+++++|++.++..++..+ .....+ T Consensus 226 ~~~~vPvv~~~nn~~~~~d~e~v~~liDa------~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~---~~~~~~ 296 (474) T protein:vir:95 226 SWERVPFIAFKNNPEEVSDIWMYKSFVDA------IDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEG---LKYYKA 296 (474) T ss_pred CCCccceEEecCCCCCCCchHHHHHHHHH------HHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhh---hhccce Confidence 99999999998766544544444555554 445678888888999999999999887654444332 233445 Q ss_pred eeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 290 LYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 290 ~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) +.++. +++++|++.+. +.+.++..++++.+.|+.++..+. .+.+++.||+|.++....+......+...|+ T Consensus 297 i~~~~-----~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~ 370 (474) T protein:vir:95 297 INVSS-----DGGVETIQVEV-PVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKAN 370 (474) T ss_pred eeccC-----CCceeEEeccC-CHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66653 45799998765 447889999999999999988763 2335689999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHH Q lcl|NC_020844. 367 DALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLI 446 (455) Q Consensus 367 ~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~ 446 (455) .+|++++++++.+.|.......+++.++........+.++. +.++|+||+||+++.| +...|+++|++||+ T Consensus 371 ~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~---~~~~giiS~et~~~~l------p~v~D~~~E~eri~ 441 (474) T protein:vir:95 371 VALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQI---GAQSQYLSKETLVRHH------PWVDDPKAELERLD 441 (474) T ss_pred HHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHH---HHHcCCCChHHHHHhC------CCCCCHHHHHHHHH Confidence 99999999999999976544444444322222223344443 4457999999998765 66778999999999 Q ss_pred hhcccCCCC Q lcl|NC_020844. 447 SELPGGIEE 455 (455) Q Consensus 447 ~e~~~~l~~ 455 (455) +|..+..+. T Consensus 442 ~E~~~~~~~ 450 (474) T protein:vir:95 442 EEQLELNKQ 450 (474) T ss_pred HHHHHHHhh Confidence 885433222 No 30 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=4.9e-46 Score=268.97 Aligned_cols=415 Identities=10% Similarity=0.039 Sum_probs=285.5 Q ss_pred CCCCCCCcc-CH----HH-----HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MPEQDPTKR-SA----DA-----EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~v~~~-~p----~y-----~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |+....... .+ .+ ....|+++++.+.|.|.+.+- .+.... ...++.. +-+..|+++.+|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il-------~~~~~~-~~~~~~~--~ki~~n~~k~Iv~ 100 (511) T protein:vir:93 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNL-------VELTRR-KEEYMAD--NRVAHDYASYISD 100 (511) T ss_pred ccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccc-------cccCcC-cccccCc--ceeecchHHHHHH Confidence 554332211 11 11 234678999999999976531 111111 1111111 1134799999999 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) .++|++|++||+++..++.....+.++... ++++.+...+++.++.+|++|++|+.++.+ +|-++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~ay~~vy~de~~-------------~~~i~~ 166 (511) T protein:vir:93 101 FINGYFLGNPIQYQDDDKDVLEVIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQDD-------------ETRLYK 166 (511) T ss_pred HHhhhhcccCeeeccCChHHHHHHHHHHhh-cCHhHHHHHHHHHHHhcCeeEEEEEeCCCC-------------ceEEEE Confidence 999999999999987777777777777544 689999999999999999999999876544 367889 Q ss_pred echhhhccceeeccCCeeEEEEEEEEe----------eeeEEEEEecCeEEEEEEeCCcE----EEEeccCCCCCCceeE Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWE----------DAETILVLRPDDWERWRKNDAGI----WEVDEFGRITIGVVPM 216 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E----------~~e~~~v~~~~~~~~~~~~~~g~----~~~~~~g~~~l~~vP~ 216 (455) ++|++++....+... .+.+..+++.. .+..+.+|+++.+..|+..+++. +......+|++|.||| T Consensus 167 ~~p~~~~~vydd~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 245 (511) T protein:vir:93 167 SDAMSTFVIYDNTIE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (511) T ss_pred EccceeEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccce Confidence 999999864433332 33555565542 13456789999888887655432 2233456899999999 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc---cceeeccceeee-c Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP---KPLDTGPDTVLY-A 292 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~---~~~~~g~~~~~~-l 292 (455) |+|.++....+.. .++..+..+.....|++.+.+++.++|+++++|....+..+... ..+......... . T Consensus 246 v~~~nn~~g~gd~------e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:93 246 TEFSNNERRKGDY------EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred EEecCCCCCCCch------hhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceeccccccccc Confidence 9998765544444 44444444555678999999999999999999977665443321 111111111111 1 Q ss_pred cCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 293 PPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 293 p~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~ 369 (455) ......++++++|++.+. +.+.++..++++.+.|+.++..|. ...++|.||+|+++....+......+...|+.+| T Consensus 320 ~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l 398 (511) T protein:vir:93 320 EGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111234678899998664 457789999999999999988773 2345789999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCCC-----c-eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 370 ENALYITNLWMDAPDT-----N-PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 370 ~~~l~~~a~~~g~~~~-----~-~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) ++++++++.+++.... + ..+++..+.......++.++++.++ .|+||.||+++.| +...|+++|++ T Consensus 399 ~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l------~~v~d~~~E~~ 470 (511) T protein:vir:93 399 RRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF------SFFQDPELEVK 470 (511) T ss_pred HHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHH Confidence 9999999887654221 1 1233333223333356778888876 6999999998765 66678999999 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) ||++|....++. T Consensus 471 ri~~E~~~~~~~ 482 (511) T protein:vir:93 471 KIEEDEKESIKK 482 (511) T ss_pred HHHHHHHHHHHH Confidence 999986543332 No 31 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=4.6e-46 Score=269.12 Aligned_cols=415 Identities=10% Similarity=0.031 Sum_probs=286.2 Q ss_pred CCCCCCCcc-CH----HH-----HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MPEQDPTKR-SA----DA-----EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~v~~~-~p----~y-----~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |+....... .+ .+ ....|+++++.+.|.|.+.+ ++.+......++.. +-+..|+++.+|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i--------~~~~~~~~~~~~~~--~ki~~n~~k~Iv~ 100 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN--------LVELTRRKEEYMAD--NRVAHDYASYISD 100 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc--------ccccCcccccccCc--ceeecchHHHHHH Confidence 554332211 11 12 12357899999999997553 11111111122111 1134799999999 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) +++|++|++||+++..++.....+.++... ++++.+...+++.++.+|++|++|+.++.+ +|.++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~a~~~vy~ded~-------------~~~i~~ 166 (511) T protein:vir:99 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQDD-------------ETRLYK 166 (511) T ss_pred HHHhhhcccCceeecCchHHHHHHHHHHhh-cCHhHHHHHHHHHHHhcCeeEEEEEeCCCC-------------ceEEEE Confidence 999999999999987777777777777544 689999999999999999999999876544 367899 Q ss_pred echhhhccceeeccCCeeEEEEEEEEe----------eeeEEEEEecCeEEEEEEeCCcEE----EEeccCCCCCCceeE Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWE----------DAETILVLRPDDWERWRKNDAGIW----EVDEFGRITIGVVPM 216 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E----------~~e~~~v~~~~~~~~~~~~~~g~~----~~~~~g~~~l~~vP~ 216 (455) ++|++++....+... .+.+..+++.. .+..+.+|+++.++.|+..+.+.+ ......+|++|.||| T Consensus 167 ~~p~~~~~vyd~~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 245 (511) T protein:vir:99 167 SDAMSTFVIYDNTIE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (511) T ss_pred EccceeEEEEcCCCC-CceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccce Confidence 999999875444333 33455555432 123567899998888877655432 234567899999999 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc---cceeeccceeeecc Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP---KPLDTGPDTVLYAP 293 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~---~~~~~g~~~~~~lp 293 (455) |+|.++....+ .|.++..+..+.....|++.+.+++.++|+++++|....+..+... ..+..........+ T Consensus 246 v~~~nn~~g~s------d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:99 246 TEFSNNERRKG------DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred EEecCCCCCCC------chhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceeccccccccc Confidence 99987654333 3444555555666678999999999999999999976555433321 11111111111111 Q ss_pred -CCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 294 -PNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 294 -~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~ 369 (455) .....++++++|++.+. +.+.++..++++++.|+.++..|. ...++|.||+|+++....+......+...|+.+| T Consensus 320 ~~~~~~~~~d~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l 398 (511) T protein:vir:99 320 EGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11234567899999764 457889999999999999988763 3345789999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCC---C--c-eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 370 ENALYITNLWMDAPD---T--N-PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 370 ~~~l~~~a~~~g~~~---~--~-~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) ++++++++.+++... . + ..+++.+........++.++++.++ .|+||+||+++.| +...|+++|++ T Consensus 399 ~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl--~GiiS~et~l~~l------~~v~D~~~E~~ 470 (511) T protein:vir:99 399 RRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF------SFFQDPELEVK 470 (511) T ss_pred HHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHHHhC------CCCCCHHHHHH Confidence 999999999875421 1 1 1233333333333346778888776 5999999998866 56668999999 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) ||++|..+.+.. T Consensus 471 ri~~E~~~~~~~ 482 (511) T protein:vir:99 471 KIEEDEKESIKK 482 (511) T ss_pred HHHHHHHHHHHH Confidence 999996644433 No 32 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=1.3e-45 Score=266.70 Aligned_cols=415 Identities=11% Similarity=0.038 Sum_probs=282.8 Q ss_pred CCCCCCCcc-CH----HH-----HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MPEQDPTKR-SA----DA-----EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~v~~~-~p----~y-----~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |+....... .+ .+ ....++++++.+.|.|.+.+ ++.+......++... | +..|+++.+|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i--------l~~~~~~~~~~~~~~-k-i~~n~~k~Iv~ 100 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN--------LVELTRRKEEYMADN-R-VAHDYASYISD 100 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCcc--------ccccCcccccccCcc-e-eecchHHHHHH Confidence 443222111 11 11 12457888999999997553 111111111222111 2 34799999999 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) ..+|++|++||+++..++.....+.++... ++++.+...+++.++++|++|++|+.++.+ +|-++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg-------------~~~i~~ 166 (511) T protein:vir:96 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQDD-------------ETRLYK 166 (511) T ss_pred HHhhhhcccCceeecCchHHHHHHHHHHhh-cChhHHHHHHHHHHHhcCeeEEEEEeCCCC-------------ceEEEE Confidence 999999999999987777777777777544 689999999999999999999999876544 367889 Q ss_pred echhhhccceeeccCCeeEEEEEEEEe----------eeeEEEEEecCeEEEEEEeCCcEE----EEeccCCCCCCceeE Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWE----------DAETILVLRPDDWERWRKNDAGIW----EVDEFGRITIGVVPM 216 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E----------~~e~~~v~~~~~~~~~~~~~~g~~----~~~~~g~~~l~~vP~ 216 (455) ++|++++....+... ...+..+++.. .+..+.+|+++.+..|+..+++.. ......+|++|.||| T Consensus 167 ~~p~~~~~v~dd~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 245 (511) T protein:vir:96 167 SDAMSTFIIYDNTVE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPI 245 (511) T ss_pred EcccceEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccce Confidence 999999875433332 33455555432 123567899998888876654422 234567899999999 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc---cceeeccceeeec- Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP---KPLDTGPDTVLYA- 292 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~---~~~~~g~~~~~~l- 292 (455) |+|.++....+... ++..+..+.....|++.+.+++.++|+++++|....+.++... ..+.......+.. T Consensus 246 v~~~n~~~g~gd~e------~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:96 246 TEFSNNERRKGDYE------KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDA 319 (511) T ss_pred EEecCCCCCCCchh------hhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceecc Confidence 99987654444444 4444444555577899999999999999999976555433321 1112222222221 Q ss_pred cCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 293 PPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 293 p~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~ 369 (455) .......+++++|++.+. +.+.++..++++.++|+.++..|. ...++|.||+|+++....+......+...|+.+| T Consensus 320 ~~~~~~~~~~~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l 398 (511) T protein:vir:96 320 EGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122234577899998664 457789999999999999987663 2335789999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCCC-----c-eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 370 ENALYITNLWMDAPDT-----N-PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 370 ~~~l~~~a~~~g~~~~-----~-~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) ++++++++.+++.... + ..+++.+.........+.++++.++ .|+||+||+++.| +...|+++|++ T Consensus 399 ~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl--~G~iS~et~l~~l------~~v~d~~~El~ 470 (511) T protein:vir:96 399 RRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF------SFFQDPELEVK 470 (511) T ss_pred HHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC------CCCCCHHHHHH Confidence 9999999998764211 1 1233332222223346778888877 4999999998765 66668999999 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) ||++|....+.. T Consensus 471 ri~~E~~~~~~~ 482 (511) T protein:vir:96 471 KIEEDEKESIKK 482 (511) T ss_pred HHHHHHHHHHHH Confidence 999996543332 No 33 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=1.3e-45 Score=266.70 Aligned_cols=415 Identities=11% Similarity=0.038 Sum_probs=282.8 Q ss_pred CCCCCCCcc-CH----HH-----HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MPEQDPTKR-SA----DA-----EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~v~~~-~p----~y-----~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |+....... .+ .+ ....++++++.+.|.|.+.+ ++.+......++... | +..|+++.+|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i--------l~~~~~~~~~~~~~~-k-i~~n~~k~Iv~ 100 (511) T protein:vir:78 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN--------LVELTRRKEEYMADN-R-VAHDYASYISD 100 (511) T ss_pred ccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCcc--------ccccCcccccccCcc-e-eecchHHHHHH Confidence 443222111 11 11 12457888999999997553 111111111222111 2 34799999999 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) ..+|++|++||+++..++.....+.++... ++++.+...+++.++++|++|++|+.++.+ +|-++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg-------------~~~i~~ 166 (511) T protein:vir:78 101 FINGYFLGNPIQYQDDDKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQDD-------------ETRLYK 166 (511) T ss_pred HHhhhhcccCceeecCchHHHHHHHHHHhh-cChhHHHHHHHHHHHhcCeeEEEEEeCCCC-------------ceEEEE Confidence 999999999999987777777777777544 689999999999999999999999876544 367889 Q ss_pred echhhhccceeeccCCeeEEEEEEEEe----------eeeEEEEEecCeEEEEEEeCCcEE----EEeccCCCCCCceeE Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWE----------DAETILVLRPDDWERWRKNDAGIW----EVDEFGRITIGVVPM 216 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E----------~~e~~~v~~~~~~~~~~~~~~g~~----~~~~~g~~~l~~vP~ 216 (455) ++|++++....+... ...+..+++.. .+..+.+|+++.+..|+..+++.. ......+|++|.||| T Consensus 167 ~~p~~~~~v~dd~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 245 (511) T protein:vir:78 167 SDAMSTFIIYDNTVE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPI 245 (511) T ss_pred EcccceEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccce Confidence 999999875433332 33455555432 123567899998888876654422 234567899999999 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc---cceeeccceeeec- Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP---KPLDTGPDTVLYA- 292 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~---~~~~~g~~~~~~l- 292 (455) |+|.++....+... ++..+..+.....|++.+.+++.++|+++++|....+.++... ..+.......+.. T Consensus 246 v~~~n~~~g~gd~e------~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:78 246 TEFSNNERRKGDYE------KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDA 319 (511) T ss_pred EEecCCCCCCCchh------hhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceecc Confidence 99987654444444 4444444555577899999999999999999976555433321 1112222222221 Q ss_pred cCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 293 PPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 293 p~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~ 369 (455) .......+++++|++.+. +.+.++..++++.++|+.++..|. ...++|.||+|+++....+......+...|+.+| T Consensus 320 ~~~~~~~~~~~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l 398 (511) T protein:vir:78 320 EGRETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122234577899998664 457789999999999999987663 2335789999999999999999999999999999 Q ss_pred HHHHHHHHHHcCCCCC-----c-eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 370 ENALYITNLWMDAPDT-----N-PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 370 ~~~l~~~a~~~g~~~~-----~-~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) ++++++++.+++.... + ..+++.+.........+.++++.++ .|+||+||+++.| +...|+++|++ T Consensus 399 ~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl--~G~iS~et~l~~l------~~v~d~~~El~ 470 (511) T protein:vir:78 399 RRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF------SFFQDPELEVK 470 (511) T ss_pred HHHHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC------CCCCCHHHHHH Confidence 9999999998764211 1 1233332222223346778888877 4999999998765 66668999999 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) ||++|....+.. T Consensus 471 ri~~E~~~~~~~ 482 (511) T protein:vir:78 471 KIEEDEKESIKK 482 (511) T ss_pred HHHHHHHHHHHH Confidence 999996543332 No 34 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=3.8e-46 Score=269.60 Aligned_cols=415 Identities=11% Similarity=0.040 Sum_probs=282.3 Q ss_pred CCCCC---------CCcc-CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MPEQD---------PTKR-SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~---------v~~~-~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |+..+ +..- .-......|+++++.+.|.|.+.+ ++.+......++... | +..|+++.+|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i--------~~~~~~~~~~~~~~~-k-i~~n~~k~Ivd 100 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKN--------LVELTRRKEEYMADN-R-VAHDYASYISD 100 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcc--------ccccCcccccccCcc-e-eecchHHHHHH Confidence 33211 1110 011123468899999999997553 111111111222111 2 34799999999 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) +++||+|++||+++..++.....+.++... ++++.....+++.++++|++|++|+.++.+. |-++. T Consensus 101 ~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~-------------~~i~~ 166 (512) T protein:vir:97 101 FINGYFLGNPIQCQDDDKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYGKAYELMIRNQDDE-------------TRLYK 166 (512) T ss_pred HHhhhhcccCceeccCChHHHHHHHHHHhh-cCHHHHHHHHHHHHHhcCeEEEEEEeCCCCc-------------eEEEE Confidence 999999999999987777777777777544 6899999999999999999999998765443 66889 Q ss_pred echhhhccceeeccCCeeEEEEEEEEe----------eeeEEEEEecCeEEEEEEeCCcE----EEEeccCCCCCCceeE Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWE----------DAETILVLRPDDWERWRKNDAGI----WEVDEFGRITIGVVPM 216 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E----------~~e~~~v~~~~~~~~~~~~~~g~----~~~~~~g~~~l~~vP~ 216 (455) ++|++++....+... .+.+..+++.. .+..+.+|+++.+..|+..+++. +...+..+|++|.||| T Consensus 167 ~~p~~~~~iyd~~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 245 (512) T protein:vir:97 167 SDAMSTFVIYDNTIE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (512) T ss_pred EcccceEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccce Confidence 999999874433332 34555565542 13456789999887777654432 2334567899999999 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCcc----ceeecccee-ee Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPK----PLDTGPDTV-LY 291 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~----~~~~g~~~~-~~ 291 (455) |+|.+++...+... ++..+..+..+..|++.+.+++.++|+++++|....+..++... .+...+... .. T Consensus 246 v~~~nn~~~~gd~e------~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (512) T protein:vir:97 246 TEFSNNERRKGDYE------KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENR 319 (512) T ss_pred EeecCCCCCCCchh------hhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhc Confidence 99987655444444 44444445556779999999999999999999876654443211 111111000 00 Q ss_pred ccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 292 APPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAAGLKDA 368 (455) Q Consensus 292 lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a 368 (455) .+....+++++++|++.+. +.+.++..++++.+.|+.++..+.. ..++|.||+|+++....+......+...|+.+ T Consensus 320 ~~~~~~~~~~d~~~l~~~~-~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 398 (512) T protein:vir:97 320 DTGIETEGSVDGGYIYKQY-DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKG 398 (512) T ss_pred ccccCCCCCcceEEEeecC-CHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111234567899998764 5577899999999999999887642 33578999999999999999999999999999 Q ss_pred HHHHHHHHHHHcCCCC---Cce---EEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHH Q lcl|NC_020844. 369 LENALYITNLWMDAPD---TNP---EADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEES 442 (455) Q Consensus 369 ~~~~l~~~a~~~g~~~---~~~---~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~ 442 (455) |++++++++.+++... ... .+++.++.......++.++++.++ .|+||+||+++.| +...|+++|+ T Consensus 399 l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl--~giiS~et~~~~l------~~v~d~~~E~ 470 (512) T protein:vir:97 399 LRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLF------SFFQDPELEV 470 (512) T ss_pred HHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHH Confidence 9999999998865321 111 223322222223346778888776 5999999998765 5666899999 Q ss_pred HHHHhhcccCCCC Q lcl|NC_020844. 443 ERLISELPGGIEE 455 (455) Q Consensus 443 ~ri~~e~~~~l~~ 455 (455) +||++|....+.. T Consensus 471 eri~~E~~~~~~~ 483 (512) T protein:vir:97 471 KKIEEDEKESIKK 483 (512) T ss_pred HHHHHHHHHHHHH Confidence 9999996543322 No 35 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=6.9e-46 Score=268.19 Aligned_cols=419 Identities=9% Similarity=0.012 Sum_probs=288.5 Q ss_pred CCCCC----CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhh-hcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQD----PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFK-RYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~----v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~-~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) |--.. |.+.-..+....++++.+++.|.|.+.+..+.. .|-...++...... +...-+..|+++.+|++.+|| T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~--~~~~ki~~n~~k~Iv~~~~~y 78 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLR--SADNRIPSNFYQLLVDQEAGY 78 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccc--cCCcccccchHHHHHHhhhhh Confidence 44222 122223345667889999999999887643321 11111111111111 111123489999999999999 Q ss_pred hhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 76 PFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 76 lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) +|++||+++..++.....+.++-. ++++..+..+++.++++|++|++++.++.+. +-+..++|.+ T Consensus 79 l~G~p~~~~~~d~~~~~~l~~~~~--~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~-------------~~~~~~~p~~ 143 (470) T protein:vir:10 79 VASVFPDIDVGKDADNKKIIDVLG--DDRALTLNGLLVDSSNAGRAWLHYWIDEDGN-------------FRYGIIQPDQ 143 (470) T ss_pred eeccceeeecCchHHHHHHHHHHh--hhHHHHHHHHHHHHhhcCeeEEEEEecCCCc-------------eEEEEEcccc Confidence 999999997544444333333322 4677777899999999999999998776543 4578899999 Q ss_pred hccceeeccCCeeEEEEEEEEe--------eeeEEEEEecCeEEEEEEeCCcEEE-------------------EeccCC Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWE--------DAETILVLRPDDWERWRKNDAGIWE-------------------VDEFGR 208 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E--------~~e~~~v~~~~~~~~~~~~~~g~~~-------------------~~~~g~ 208 (455) ++....+.+.+ ..+..+++.. .+..+.+|+.+.+..|+....+... ..+..+ T Consensus 144 ~~~v~d~~~~~-~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (470) T protein:vir:10 144 ITPIYATTLDN-KLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLK 222 (470) T ss_pred eEEEEcCCCCC-ceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccc Confidence 88754443332 2344455433 2345778998888888755443211 123347 Q ss_pred CCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccce Q lcl|NC_020844. 209 ITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDT 288 (455) Q Consensus 209 ~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~ 288 (455) |+||+||||+|+++....+......+|+| +.....|++.+.+.+.++|+++++|...++..++..+.. ... T Consensus 223 ~~~g~vPvv~~~nn~~g~sd~e~v~~liD------a~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~---~~~ 293 (470) T protein:vir:10 223 HNFGRVPFIEFSKNKYRLPELNKYKGLID------AYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLR---KYK 293 (470) T ss_pred cCCCeeeEEEeecCCCCCCchhHHHHHHH------HHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhh---hcC Confidence 89999999999887654444444444444 444567888899999999999999998776655544332 344 Q ss_pred eeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC--CCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 289 VLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA--QSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 289 ~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~--~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) ++.++.++.+.+++++|++.+.+ .+..+..+++++++|+.++..+... ..+|.||+|.++....+......+...|+ T Consensus 294 ~i~~~~~~~~~~~~~~~lt~~~~-~~~~~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~ 372 (470) T protein:vir:10 294 SIKINNTGNGDNSGVDKLQIDIP-VEARDDALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFE 372 (470) T ss_pred eEeccCCCCCcCceeEEEeecCC-hHHHHHHHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 67777777777889999997765 5888999999999999998876432 34689999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcCCCCCc-eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHH Q lcl|NC_020844. 367 DALENALYITNLWMDAPDTN-PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERL 445 (455) Q Consensus 367 ~a~~~~l~~~a~~~g~~~~~-~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri 445 (455) .+|++++++++.++|..+.+ ..+++..+........+.++.+.++ +|+||.||+++.| |...|+++|++|| T Consensus 373 ~~l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~------p~v~D~~~E~eri 444 (470) T protein:vir:10 373 HAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN------PIVDDWQQELKDL 444 (470) T ss_pred HHHHHHHHHHHHHhcccCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhC------CCCCCHHHHHHHH Confidence 99999999999999874322 2344443333333345566666654 7999999998765 6677899999999 Q ss_pred HhhcccCCCC Q lcl|NC_020844. 446 ISELPGGIEE 455 (455) Q Consensus 446 ~~e~~~~l~~ 455 (455) ++|.++.... T Consensus 445 ~~E~~e~~~~ 454 (470) T protein:vir:10 445 AKDKEENDPY 454 (470) T ss_pred HHHHHHHHHh Confidence 9886543322 No 36 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=2.9e-45 Score=264.73 Aligned_cols=407 Identities=8% Similarity=0.005 Sum_probs=285.2 Q ss_pred CCC------CCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPE------QDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~------~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) ||. +.+..---.+....++.+++.+.|.|.+.+..... +.+... ..| +..|+++.+|++.+| T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~----~~~~~~----~~k----i~~n~~~~ivd~~~~ 78 (453) T protein:vir:73 11 YSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKA----KDSWKP----DNR----LTNNFAKYIVDTFVG 78 (453) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCC----CCccCc----cce----eecchHHHHHHHhhh Confidence 442 22222222445566888889999999877643211 111110 112 236999999999999 Q ss_pred hhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 75 KPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 75 ~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) ++|++||+++.+++....++.++. +.|+++.....+++.++.+|++|++|+.++.+ +|-++.++|. T Consensus 79 ~l~g~~~~~~~~d~~~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~-------------~~~i~~~~p~ 144 (453) T protein:vir:73 79 YFNGIPIKKTHDDKSVLEAMQLFD-NLNDMEDEESELAKIACVYGRAYELMYQNEST-------------ESEVIYCSPL 144 (453) T ss_pred hhcccCceeecCChHHHHHHHHHH-HhcChhHHHHHHHHHHHhcCeEEEEEEeCCCC-------------ceEEEEEccc Confidence 999999999876666666676664 34789999999999999999999999876544 3668889999 Q ss_pred hhccceeeccCCeeEEEEEEEEe---eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccC Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWE---DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVF 231 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E---~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~ 231 (455) +++....+.. +.+.+..+++.. ....+.+|+++.+..|+. .++.|...+..+|++|.||||+|++++...+.. T Consensus 145 ~~~~v~dd~~-~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~-~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~-- 220 (453) T protein:vir:73 145 NVFMVYDDSI-KQKPLFAVYYGFDEEGNLSGTVYTLLETISITG-KAGEVKFGESTYNVYSDLPIVEYNFNEERQSIF-- 220 (453) T ss_pred ceEEEEeCCC-CceeEEEEEEEEecCceEEEEEEeCCeEEEEEe-cCCceEEccceeccCCceeEEEecCCCCCCcch-- Confidence 9986443433 344455555432 234577899887766654 455678888889999999999998765543443 Q ss_pred cchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeecc------CCCCccccceeE Q lcl|NC_020844. 232 HPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAP------PNAEGQHGSWNF 305 (455) Q Consensus 232 ~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp------~~~~~~~~~~~~ 305 (455) .++..+..+..+..|+..+.+.+.++|+++++|.+.++...... ....++.+. .+..+.+++++| T Consensus 221 ----~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~d~~~ 291 (453) T protein:vir:73 221 ----EPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDEEDAKNI-----KDNRLINFFDKNSNGQGTNAAKVDVKF 291 (453) T ss_pred ----hhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCchhhhcc-----cccccccccccccccccccccCceeEE Confidence 44444444555677888889999999999999987654222111 011111110 112345678999 Q ss_pred EeeCchhHHHHHHHHHHHHHHHHHhhhhhccC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020844. 306 VEPATESLKFLAEQIESVSKEIRELGRQPLTA--QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAP 383 (455) Q Consensus 306 le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~--~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~ 383 (455) ++.+.+ .+.++..++.+.+.|+.++..+... ..+|.||+|.+.....+......+...|+.+|++++++++.+++.. T Consensus 292 l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~ 370 (453) T protein:vir:73 292 LDKPDS-DVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNA 370 (453) T ss_pred eeecCC-HHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 987654 4778999999999999998776433 2368999999999999999999999999999999999999987653 Q ss_pred CCc---eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 384 DTN---PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 384 ~~~---~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ... ..+++.++........+.++++.++ .|+||.||+++.+ +...|+++|++||++|.++.+.. T Consensus 371 ~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~--~giis~et~~~~~------~~~~d~~~E~~ri~~E~~~~~~~ 437 (453) T protein:vir:73 371 SNKDAWKDIEYTFTRNEPKDIKEQAETANIL--KGITSEETALSVI------SVIPDVQAEMEKIKKKKLLQLSL 437 (453) T ss_pred CCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHH Confidence 321 1233332222223346778888776 4999999998766 56668999999999997765544 No 37 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=1.4e-45 Score=266.51 Aligned_cols=410 Identities=13% Similarity=0.089 Sum_probs=282.3 Q ss_pred CCCCC---C---CccCHHH-HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQD---P---TKRSADA-EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~---v---~~~~p~y-~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~ 73 (455) ||... + ..-...+ ....++++++++.|.|.+.+... ++.+ .... .| +..|+++.+|++++ T Consensus 19 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~-----~~~~--~~~~--~k----i~~n~~~~Ivd~~~ 85 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTA-----PEKE--TGAD--NR----IVVNSAKYVVDVYN 85 (470) T ss_pred eCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccC-----cccc--cCCc--ce----eecchHHHHHHHHh Confidence 66332 1 1100111 23458899999999997654321 1111 1111 12 34799999999999 Q ss_pred hhhhcCCceeccCC-HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 74 SKPFGREVEVTEGT-GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 74 g~lf~k~p~i~~~~-~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) ||+|++|++++..+ ......+.++. +.++++.+++.+++.++.+|++|++|+.++.+ +|.++.++ T Consensus 86 ~~l~g~p~~~~~~~d~~~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg-------------~~~i~~~~ 151 (470) T protein:vir:99 86 GYFCGIEPKLALLNDSSKIDEIARWN-RQENFFDTINEISKQCDIFGRSIASIYQGEDA-------------RPHLMYSS 151 (470) T ss_pred hhhccCCeeEeeCCchhHHHHHHHHH-HhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCC-------------eEEEEEEc Confidence 99999999986432 22222222332 34789999999999999999999999866543 37789999 Q ss_pred hhhhccceeeccCCeeEEEEEEEEee------eeEEEEEecCeEEEEEEeCCc-EEEEeccCCCCCCceeEEEEEecccC Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMRMWED------AETILVLRPDDWERWRKNDAG-IWEVDEFGRITIGVVPMVPFITGRRK 225 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~~~E~------~e~~~v~~~~~~~~~~~~~~g-~~~~~~~g~~~l~~vP~V~f~~~~~~ 225 (455) |.+++....+.. .+..+..+++... ...+.+|+++.+..|...+.+ .+...+..+|++|.||||+|.++... T Consensus 152 p~~~~~i~d~~~-~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 230 (470) T protein:vir:99 152 PNHAFIIYDDTV-QRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEER 230 (470) T ss_pred cceeEEEEcCCC-CcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCC Confidence 999886433322 3445555555431 123556777776666554433 34456778899999999999765543 Q ss_pred CCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeE Q lcl|NC_020844. 226 GKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNF 305 (455) Q Consensus 226 ~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~ 305 (455) .+. +.++..+..+..+..|++.+++.+.++|+++++|.....++.++. ...+....++.+|....+.+++++| T Consensus 231 ~sd------~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (470) T protein:vir:99 231 QGI------FDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNP-KFDFKNNRVLYVSQLDPDTNPQIGF 303 (470) T ss_pred Ccc------hHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccch-hhhhhhcceeeecCCCCCCCCcceE Confidence 333 444444444555678999999999999999999988766554432 2345556778887766677889999 Q ss_pred EeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020844. 306 VEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDA 382 (455) Q Consensus 306 le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~ 382 (455) ++.+. ..+.++..++.+.+.|+.++..+. .+.++|.||+|++.....+......+...|+.+|++++++++.+++. T Consensus 304 l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 382 (470) T protein:vir:99 304 IAKPD-ADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFN 382 (470) T ss_pred EeecC-ChHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 98765 557788999999999999988763 22357899999999999999999999999999999999999988765 Q ss_pred CCC----ceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 383 PDT----NPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 383 ~~~----~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ... ...+++.+.-.......+.++++.++ .|+||+||+++.| + ..|+++|++||++|....++. T Consensus 383 ~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl--~giis~et~l~~l------~-~vd~~~E~eri~~E~~~~~~~ 450 (470) T protein:vir:99 383 NKQDQELWSELDFKFTRNLPEDMASAIDNAKNA--EGIVSKKTQLGMI------P-DIEPDAEMKQIAKEKADAIKQ 450 (470) T ss_pred cCCcccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHHHhC------C-CCCHHHHHHHHHHHHHHHHHH Confidence 321 11233332222222346677888776 4999999998866 3 448999999999986432221 No 38 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=7.3e-46 Score=268.05 Aligned_cols=407 Identities=7% Similarity=-0.017 Sum_probs=277.2 Q ss_pred CCCCCCCccCHHHHH--------HHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHH------HHHHhh---ccccCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEA--------MSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKR------YDLRKS---VAKMTN 63 (455) Q Consensus 1 m~~~~v~~~~p~y~~--------~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~------Y~~rl~---ra~~~n 63 (455) +.+.++ .|+... ..+++......|.|..... ..+.+.+...... ++.... .-+..| T Consensus 10 ~~~~~~---~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n 82 (474) T protein:vir:10 10 IEAQGI---LPKHIEALIESHKDDRERMVNLYNRYKTHIDYV----PIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNS 82 (474) T ss_pred ccccCC---CHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchh----hhhcchhhhhhhhhhhcccccccccCcccccccc Confidence 332222 222211 1233333333444421110 0111111111111 111111 114589 Q ss_pred hHHHHHHHHhhhhhcCCceecc-----CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhh Q lcl|NC_020844. 64 VFRDVLEGLASKPFGREVEVTE-----GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREE 138 (455) Q Consensus 64 ~~~~~v~~~~g~lf~k~p~i~~-----~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad 138 (455) +++.+|++.+|++|++||+++. ..+.++.++.++... ++++.+...+++.++.+|+||++|+.++.+ T Consensus 83 ~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~------- 154 (474) T protein:vir:10 83 FDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIR-NSVDDEDSEIGKMAAICGYGARLAYIDTNG------- 154 (474) T ss_pred hHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhh-cCHhHHHHHHHHHHhhcCeEEEEEEeCCCC------- Confidence 9999999999999999999963 345677777777544 589999999999999999999999876544 Q ss_pred HHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee--------eeEEEEEecCeEEEEEEeCCcEEEEeccCCCC Q lcl|NC_020844. 139 ERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED--------AETILVLRPDDWERWRKNDAGIWEVDEFGRIT 210 (455) Q Consensus 139 ~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~--------~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~ 210 (455) +|.++.++|++++.+..+.. ..+..+++... +..+.+|++..+..|+..+.+.|..++..+|+ T Consensus 155 ------~~~~~~i~p~~~~~v~d~~~---~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:10 155 ------DIRIKNIDPYNVIFVGDNIL---EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHL 225 (474) T ss_pred ------eeEEEEEcccceEEEEcCCC---ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCC Confidence 36799999999987653322 24445555432 23567889988888888777788888999999 Q ss_pred CCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceee Q lcl|NC_020844. 211 IGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVL 290 (455) Q Consensus 211 l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~ 290 (455) +|.||||+|+++....+ .|.++..+..+.....|++.+.+++.++|+++++|....+... .+. ....++ T Consensus 226 ~g~vPvv~~~n~~~g~s------d~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~~--~~~---~~~~~i 294 (474) T protein:vir:10 226 FDYNPLFGVPNNKEMIG------DAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEMI--QET---QKSGAF 294 (474) T ss_pred CCccceEEecCCCCCCC------chHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchhh--hhh---hhccee Confidence 99999999987655333 3445555555666678999999999999999999987654221 111 122344 Q ss_pred eccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 291 YAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAAGLKD 367 (455) Q Consensus 291 ~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~ 367 (455) .++ +++++++|++.+. +.+.++..++.+++.|+.++..+.. +.++|.||+|+++....+...+..+...|+. T Consensus 295 ~~~----~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~ 369 (474) T protein:vir:10 295 ELF----DKDMDVKYLTKDV-NDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTA 369 (474) T ss_pred Eec----CCCCceeEEeccC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 442 2345899999775 4578899999999999999887642 2357899999999999999999999999999 Q ss_pred HHHHHHHHHHHHcCCCCCc------eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH Q lcl|NC_020844. 368 ALENALYITNLWMDAPDTN------PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE 441 (455) Q Consensus 368 a~~~~l~~~a~~~g~~~~~------~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e 441 (455) +|++++++++.+++....+ ..+++...-.......+.++++.++ .|+||+||+++.| +...|+++| T Consensus 370 ~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l------~~v~d~~~E 441 (474) T protein:vir:10 370 MLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQS------QLVDDVDYE 441 (474) T ss_pred HHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHH Confidence 9999999999987653211 1233322222223346677777775 5999999999876 566689999 Q ss_pred HHHHHhhcccCCCC Q lcl|NC_020844. 442 SERLISELPGGIEE 455 (455) Q Consensus 442 ~~ri~~e~~~~l~~ 455 (455) ++||++|.+...+. T Consensus 442 ~eri~~E~~e~~~~ 455 (474) T protein:vir:10 442 LDEMEKESLEFNDK 455 (474) T ss_pred HHHHHHHHHHHHhh Confidence 99999886533322 No 39 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=7.3e-46 Score=268.05 Aligned_cols=407 Identities=7% Similarity=-0.017 Sum_probs=277.2 Q ss_pred CCCCCCCccCHHHHH--------HHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHH------HHHHhh---ccccCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEA--------MSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKR------YDLRKS---VAKMTN 63 (455) Q Consensus 1 m~~~~v~~~~p~y~~--------~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~------Y~~rl~---ra~~~n 63 (455) +.+.++ .|+... ..+++......|.|..... ..+.+.+...... ++.... .-+..| T Consensus 10 ~~~~~~---~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n 82 (474) T protein:vir:94 10 IEAQGI---LPKHIEALIESHKDDRERMVNLYNRYKTHIDYV----PIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNS 82 (474) T ss_pred ccccCC---CHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchh----hhhcchhhhhhhhhhhcccccccccCcccccccc Confidence 332222 222211 1233333333444421110 0111111111111 111111 114589 Q ss_pred hHHHHHHHHhhhhhcCCceecc-----CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhh Q lcl|NC_020844. 64 VFRDVLEGLASKPFGREVEVTE-----GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREE 138 (455) Q Consensus 64 ~~~~~v~~~~g~lf~k~p~i~~-----~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad 138 (455) +++.+|++.+|++|++||+++. ..+.++.++.++... ++++.+...+++.++.+|+||++|+.++.+ T Consensus 83 ~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~------- 154 (474) T protein:vir:94 83 FDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIR-NSVDDEDSEIGKMAAICGYGARLAYIDTNG------- 154 (474) T ss_pred hHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhh-cCHhHHHHHHHHHHhhcCeEEEEEEeCCCC------- Confidence 9999999999999999999963 345677777777544 589999999999999999999999876544 Q ss_pred HHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee--------eeEEEEEecCeEEEEEEeCCcEEEEeccCCCC Q lcl|NC_020844. 139 ERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED--------AETILVLRPDDWERWRKNDAGIWEVDEFGRIT 210 (455) Q Consensus 139 ~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~--------~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~ 210 (455) +|.++.++|++++.+..+.. ..+..+++... +..+.+|++..+..|+..+.+.|..++..+|+ T Consensus 155 ------~~~~~~i~p~~~~~v~d~~~---~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:94 155 ------DIRIKNIDPYNVIFVGDNIL---EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHL 225 (474) T ss_pred ------eeEEEEEcccceEEEEcCCC---ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCC Confidence 36799999999987653322 24445555432 23567889988888888777788888999999 Q ss_pred CCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceee Q lcl|NC_020844. 211 IGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVL 290 (455) Q Consensus 211 l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~ 290 (455) +|.||||+|+++....+ .|.++..+..+.....|++.+.+++.++|+++++|....+... .+. ....++ T Consensus 226 ~g~vPvv~~~n~~~g~s------d~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~~--~~~---~~~~~i 294 (474) T protein:vir:94 226 FDYNPLFGVPNNKEMIG------DAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEMI--QET---QKSGAF 294 (474) T ss_pred CCccceEEecCCCCCCC------chHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchhh--hhh---hhccee Confidence 99999999987655333 3445555555666678999999999999999999987654221 111 122344 Q ss_pred eccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 291 YAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAAGLKD 367 (455) Q Consensus 291 ~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~ 367 (455) .++ +++++++|++.+. +.+.++..++.+++.|+.++..+.. +.++|.||+|+++....+...+..+...|+. T Consensus 295 ~~~----~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~ 369 (474) T protein:vir:94 295 ELF----DKDMDVKYLTKDV-NDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTA 369 (474) T ss_pred Eec----CCCCceeEEeccC-CHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 442 2345899999775 4578899999999999999887642 2357899999999999999999999999999 Q ss_pred HHHHHHHHHHHHcCCCCCc------eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH Q lcl|NC_020844. 368 ALENALYITNLWMDAPDTN------PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE 441 (455) Q Consensus 368 a~~~~l~~~a~~~g~~~~~------~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e 441 (455) +|++++++++.+++....+ ..+++...-.......+.++++.++ .|+||+||+++.| +...|+++| T Consensus 370 ~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l------~~v~d~~~E 441 (474) T protein:vir:94 370 MLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSERTRLGQS------QLVDDVDYE 441 (474) T ss_pred HHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHH Confidence 9999999999987653211 1233322222223346677777775 5999999999876 566689999 Q ss_pred HHHHHhhcccCCCC Q lcl|NC_020844. 442 SERLISELPGGIEE 455 (455) Q Consensus 442 ~~ri~~e~~~~l~~ 455 (455) ++||++|.+...+. T Consensus 442 ~eri~~E~~e~~~~ 455 (474) T protein:vir:94 442 LDEMEKESLEFNDK 455 (474) T ss_pred HHHHHHHHHHHHhh Confidence 99999886533322 No 40 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=3.4e-45 Score=264.35 Aligned_cols=411 Identities=12% Similarity=0.061 Sum_probs=279.8 Q ss_pred CCCCC--CCc-cCH----------------------HHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHH Q lcl|NC_020844. 1 MPEQD--PTK-RSA----------------------DAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLR 55 (455) Q Consensus 1 m~~~~--v~~-~~p----------------------~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~r 55 (455) |.+=+ .+. .|- .+....++.+++.+.|.|.+.+......+. +.. .....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~-~~~----~~~~~~ 75 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLD-NKG----EIDPLK 75 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhc-ccc----cccccc Confidence 33210 000 000 122344667777777777655433221111 100 111111 Q ss_pred hhccccCChHHHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchh Q lcl|NC_020844. 56 KSVAKMTNVFRDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRN 135 (455) Q Consensus 56 l~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t 135 (455) -..-+..|+++.+|++.+|++|++||+++..++.....+..+. .++++.....+++.++.+|++|++|+.++.+ T Consensus 76 ~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~---- 149 (474) T protein:vir:96 76 PDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQEVL--NHKWDDKLVDILTAASNKGIEWLQPYIDENG---- 149 (474) T ss_pred cchhcccchHHHHHHhhhhhhcccCceeecCchHHHHHHHHHH--hcCHHHHHHHHHHHHHhcCeeEEEEEecCCC---- Confidence 1111347999999999999999999999866666555555553 2578888899999999999999999876543 Q ss_pred hhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe--eeeEEEEEecCeEEEEEEeCCcEE------------ Q lcl|NC_020844. 136 REEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE--DAETILVLRPDDWERWRKNDAGIW------------ 201 (455) Q Consensus 136 ~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E--~~e~~~v~~~~~~~~~~~~~~g~~------------ 201 (455) +|.++.++|++++.+..+...+ ..+..+++.+ ..+.+.+|+++.+..|+..+++.. T Consensus 150 ---------~~~i~~~~p~~~~~v~d~~~~~-~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:96 150 ---------EFKTFRVPAEQAIPIWTNKERD-TLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQS 219 (474) T ss_pred ---------ceEEEEEcccceEEEEcCCCCC-ceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccc Confidence 3678999999998755443332 2344444433 355688899998888875543321 Q ss_pred -EEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCcc Q lcl|NC_020844. 202 -EVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPK 280 (455) Q Consensus 202 -~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~ 280 (455) ......+|++|+||||+|.++....+......+|+| +.....|++.+.+.+.++|+++++|+++++..++.. T Consensus 220 ~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liD------a~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~- 292 (474) T protein:vir:96 220 HYYVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIID------AMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMR- 292 (474) T ss_pred cccccccccCCCceeEEEeccCCCCCCcHHHHHHHHH------HHHHHHHHHHHHHHHhccceeeeecCCcccccchhh- Confidence 123455799999999999876554444434444444 444567888888899999999999998776554432 Q ss_pred ceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHHHHHHHH Q lcl|NC_020844. 281 PLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAASKGNSA 357 (455) Q Consensus 281 ~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~~~~~s~ 357 (455) .+....++.+|. .+++++|++.+. +.++.+..+++++++|+.++..+.. +.++|.||+|.++....+... T Consensus 293 --~~~~~~~i~~~~----~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k 365 (474) T protein:vir:96 293 --NLKYYKAINVDG----DGSGVDTIQIEV-PVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLK 365 (474) T ss_pred --hhhcCceEEecC----CCCceeEEeecC-ChHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHH Confidence 234456777763 356799999765 4578899999999999999876632 334689999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC Q lcl|NC_020844. 358 VQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD 437 (455) Q Consensus 358 L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d 437 (455) ...+...|+.+|++++++++.+.|.+.....+++..++.......+.++ .+.++|+||+||+++.| +...| T Consensus 366 ~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~i~i~f~~~~p~~~~e~~~---~~~~ag~iS~et~~~~~------~~v~d 436 (474) T protein:vir:96 366 ANKLKNKTLTALQELLQYIIDFYKLNIKVQDVEITFNFNVMVNELEQSQ---IGVQSQYLSKETVVTNH------PWVDD 436 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHH---HHHhcCCCchHHHHHhC------CCCCC Confidence 9999999999999999999999997654444555433333222233333 34568999999998765 56668 Q ss_pred HHHHHHHHHhhcccCCCC Q lcl|NC_020844. 438 PEEESERLISELPGGIEE 455 (455) Q Consensus 438 ~e~e~~ri~~e~~~~l~~ 455 (455) +++|++||++|.....+. T Consensus 437 ~~~E~~ri~~E~~e~~~~ 454 (474) T protein:vir:96 437 PVAELERIEQDNIDFNKQ 454 (474) T ss_pred HHHHHHHHHHHHHHHHhc Confidence 999999999885532222 No 41 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=2.7e-45 Score=264.97 Aligned_cols=413 Identities=11% Similarity=0.079 Sum_probs=284.3 Q ss_pred CCCCCCCccCH------------HH--HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHH Q lcl|NC_020844. 1 MPEQDPTKRSA------------DA--EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFR 66 (455) Q Consensus 1 m~~~~v~~~~p------------~y--~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~ 66 (455) ||.+-+....+ ++ ....++++++.+.|.|.+.+..+...+.- ..+.....++. ..-+-.|+++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~-~~~~~~~~~~~--~~ki~~~~~~ 83 (479) T protein:vir:79 7 SETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLL-DGAKVDDFTKV--NNKAINNYHK 83 (479) T ss_pred cccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCccccccccccc-ccccccccccC--cceeecchHH Confidence 55433221111 11 01346788888888887665322111110 00000111111 1113479999 Q ss_pred HHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCc Q lcl|NC_020844. 67 DVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRP 146 (455) Q Consensus 67 ~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rP 146 (455) .+|++++|++|++||+++..++..+.+++.+. .++++..+..+++.++.+|++|++|+.++.+ +| T Consensus 84 ~Ivd~~~~~l~g~p~~~~~~~~~~~~~~~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~-------------~~ 148 (479) T protein:vir:79 84 LLVDQKVGYSVGNPIVFNADDDNLTKLLNDLL--GEEFDDTITELYLNASNKGVEWLHPYINRKG-------------EF 148 (479) T ss_pred HHHHHHHhhhhcCCceeccCCHHHHHHHHHHH--hcCHHHHHHHHHHHHHhcCeEEEEEEeCCCC-------------ce Confidence 99999999999999999877777777877664 3689999999999999999999999876644 36 Q ss_pred eEEEechhhhccceeeccCCeeEEEEEEEEe-------eeeEEEEEecCeEEEEEEeCCcE------------------E Q lcl|NC_020844. 147 YWSQVNHSAILEIQSERQGAGERITYMRMWE-------DAETILVLRPDDWERWRKNDAGI------------------W 201 (455) Q Consensus 147 y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-------~~e~~~v~~~~~~~~~~~~~~g~------------------~ 201 (455) .+..++|.+++....+.. ..+.+..+++.. .+..+.+|+++.+..|+..+++. + T Consensus 149 ~i~~~~p~~~~~v~d~~~-~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (479) T protein:vir:79 149 KYVIIPAEEAIPIWDSKR-QRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGH 227 (479) T ss_pred EEEEEccceeEEEEeCCC-CCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCccccccccccccccccccccc Confidence 789999999987433222 223444444432 23356788888888777654432 1 Q ss_pred EEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccc Q lcl|NC_020844. 202 EVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKP 281 (455) Q Consensus 202 ~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~ 281 (455) ...+..+|+||+||||+|+++....+. |.++..+..+.....|++.+++.+.++|+++++|++.+...++..+ T Consensus 228 ~~~~~~~~~~~~vPvv~~~nn~~g~sd------~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~- 300 (479) T protein:vir:79 228 FRINNKEQGWGKVPFIPFKNNEKCVSD------LTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDN- 300 (479) T ss_pred ccccccccCCCcccEEEecCCCCCCcc------hhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhh- Confidence 233556899999999999876553333 4445555555666778999999999999999999877665554332 Q ss_pred eeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC--CCcchhHHHHHHHHHHHHHHHH Q lcl|NC_020844. 282 LDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA--QSGNLTRISAAQAASKGNSAVQ 359 (455) Q Consensus 282 ~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~--~~~~~sa~a~~~~~~~~~s~L~ 359 (455) +....++.++. +++++|++.+. +.+.++..++.+++.|+.++..+... ..++.||+|.+.....+.+... T Consensus 301 --~~~~~~i~~~~-----~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~ 372 (479) T protein:vir:79 301 --IRYYKSIKVDG-----GGGVDKLEINI-PVEAKKELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCS 372 (479) T ss_pred --hhhccceecCC-----CCcceEEeccC-CHHHHHHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHH Confidence 22334666653 45799999776 46889999999999999998766432 3478999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCC----ceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCC Q lcl|NC_020844. 360 QWAAGLKDALENALYITNLWMDAPDT----NPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDN 435 (455) Q Consensus 360 ~~a~~~~~a~~~~l~~~a~~~g~~~~----~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~ 435 (455) .+...|+.+|++++++++.+++.... ...+++...........+.++++.++ .|+||.||+++.| +.. T Consensus 373 ~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l------~~v 444 (479) T protein:vir:79 373 KTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNH------PWV 444 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhC------CCC Confidence 99999999999999999998776421 12233332222222346677877775 6999999998765 556 Q ss_pred CCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 436 FDPEEESERLISELPGGIEE 455 (455) Q Consensus 436 ~d~e~e~~ri~~e~~~~l~~ 455 (455) .|+++|++||++|.++..+. T Consensus 445 ~d~~~E~~ri~~E~~~~~~~ 464 (479) T protein:vir:79 445 EDVNDELERLKKQEDTQKEY 464 (479) T ss_pred CCHHHHHHHHHHHHHHHHHH Confidence 67999999999986532221 No 42 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=5.1e-45 Score=263.40 Aligned_cols=418 Identities=14% Similarity=0.091 Sum_probs=287.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |+....+.--..|....++++++.+.|.|.+.++ |+++.-. ..+++.. ...||++.+|++++++++..+ T Consensus 13 ~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~-----~~~~~~~---~~~~~~~---~~~n~~~~ivd~~~~~l~~~g 81 (485) T protein:vir:24 13 DPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPE-----AIGVTVP---VQMQSLL---AHVGYPRLYVDSIAERQAVEG 81 (485) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhccCchh-----hcCcccc---hhhhhhh---hccchHHHHHHHHhhhhccCc Confidence 3322122223455666789999999999976653 4443322 3343322 236999999999999999988 Q ss_pred ceeccCCH---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhc Q lcl|NC_020844. 81 VEVTEGTG---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAIL 157 (455) Q Consensus 81 p~i~~~~~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii 157 (455) +++.+... .+..++. .|+++.+++.+++.+++||++|++|+.++.+.... -...+|-++.++|++++ T Consensus 82 ~~~~~~~~~~~~l~~i~~-----~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~-----~~~~~~~i~~~~p~~~~ 151 (485) T protein:vir:24 82 FRLGDADEADEELWQWWQ-----ANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLG-----WDPNVPLIRVEPPTRMY 151 (485) T ss_pred eecCCCchhHHHHHHHHH-----hcChhHHHHHHHHHHhhcCceEEEEecCCcccccc-----cCCCcceEEEeccceeE Confidence 88754322 2444442 37899999999999999999999998765433210 11235789999999998 Q ss_pred cceeeccCCeeEEEEEE-EEe----eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCc Q lcl|NC_020844. 158 EIQSERQGAGERITYMR-MWE----DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFH 232 (455) Q Consensus 158 ~w~~~~~~~~~~l~~v~-~~E----~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~ 232 (455) ....+.++. ....++ +.. .+..+.+|+++.+..|.. .+|.|......+|++|.||||+|++++...+. .+. T Consensus 152 ~i~D~~~~~--~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~-~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~-~G~ 227 (485) T protein:vir:24 152 AEIDPRIGR--PAKAIRVAYDAEGNEIQAATLYTPNETFGWFR-AEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDL-YGT 227 (485) T ss_pred EEeeCCcCc--eeEEEEEEEeecCCeEEEEEEEcCCcEEEEEe-cCCceEeecccccCCCcccEEEeccCcccCCc-CCc Confidence 654444432 222222 222 234577888887665544 45678888889999999999999877655443 466 Q ss_pred chHH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccc---ccCccceeeccceeeeccCCCCccccceeEEee Q lcl|NC_020844. 233 PPMK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDS---EGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEP 308 (455) Q Consensus 233 ~pL~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~---~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~ 308 (455) +.|. ++..+..+..+..|++..++++.++|+++++|.+..... +.....+..+.+.+|.+|. ++++|.|+ T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~------~~~~~~q~ 301 (485) T protein:vir:24 228 SEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFED------AEGKIQQF 301 (485) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccCC------CCceEEee Confidence 6665 466666677788999999999999999999998765321 1122334556667777753 35788899 Q ss_pred CchhHHHHHHHHHHHHHHHHHhhhhhccC---CCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC Q lcl|NC_020844. 309 ATESLKFLAEQIESVSKEIRELGRQPLTA---QSG-NLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPD 384 (455) Q Consensus 309 ~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~~-~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~ 384 (455) +.++++.+.+.|+.+..++...+..+... ++. +.||+|+++....+......+...|+.+|++++++++.+.+... T Consensus 302 ~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~ 381 (485) T protein:vir:24 302 SAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGD 381 (485) T ss_pred cccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 99998888888887777777665554332 222 37999999999999999999999999999999999999876432 Q ss_pred Cc---eEEEEecccCCCCCCHHHHHHHHHHHHcC--CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 385 TN---PEADVYTDFRADTGEEQTPGYLLTMRQNG--DISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 385 ~~---~~v~v~~df~~~~~~~~~~~al~~~~~~G--~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .. ..+++.+.-......++.++++.+++++| ++|+||+++.| +.+.+..+|++++++|..+.... T Consensus 382 ~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l------~~~~d~~~e~~~~~ee~~~~~~~ 451 (485) T protein:vir:24 382 VPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDM------GYSIAEREEMRRWDEEEAAMGLG 451 (485) T ss_pred CccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhC------CCCHhHHHHHHHHHHHHhhhhhh Confidence 11 23333321111223467899999999865 89999998654 45556667788877664432111 No 43 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=4.7e-45 Score=263.59 Aligned_cols=406 Identities=11% Similarity=0.011 Sum_probs=279.1 Q ss_pred ccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCCceeccC- Q lcl|NC_020844. 8 KRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGREVEVTEG- 86 (455) Q Consensus 8 ~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~- 86 (455) ...--.....|+|+++.+.|.|.+.. .|-+ ..... .++. ..-+..|+++.+|++.+|++|++||+++.. T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~-----~~~~-~~~~~--~~~~--~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~ 70 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFS-----ILSG-HRRLD--DEKA--DYRVRHKWGGYISSFATGYVIGNPVSIGVME 70 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcc-----cccc-ccccc--ccCC--cceeecchHHHHHHhhhhheeccCceEeeCC Confidence 22333345688999999999996442 1111 11110 1111 111357999999999999999999999632 Q ss_pred --CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccceeecc Q lcl|NC_020844. 87 --TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQ 164 (455) Q Consensus 87 --~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~ 164 (455) ++.....+.++. +.++++.....+++.++.+|++|++|+.+..+ +|-++.++|.+++....+. T Consensus 71 ~~~~~~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~-------------~~~i~~~~p~~~~~~~d~~- 135 (440) T protein:vir:95 71 GGSADQLSTIKDIE-WQNDINALNSDLAFDASVYGRAYEYHFRDKDK-------------VDRVVLISPLEMFVIRDLT- 135 (440) T ss_pred CccHHHHHHHHHHH-HhcCHhHHHHHHHHHHhhcCeEEEEEEecCCC-------------ceEEEEEcccceEEEEcCC- Confidence 222333444443 45689999999999999999999999865443 3678899999998744333 Q ss_pred CCeeEEEEEEEEee--eeEEEEEecCeEEEEEEeC--CcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHH Q lcl|NC_020844. 165 GAGERITYMRMWED--AETILVLRPDDWERWRKND--AGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALD 240 (455) Q Consensus 165 ~~~~~l~~v~~~E~--~e~~~v~~~~~~~~~~~~~--~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~ 240 (455) ...+.+..+++.+. ...+.+|+++.+..|+..+ .+.+...+..+|+||.||||+|+++... .+.+.++.. T Consensus 136 ~~~~~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g------~sd~e~v~~ 209 (440) T protein:vir:95 136 VEQNIIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFR------MGDYESEIS 209 (440) T ss_pred CCCceEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCC------CCchhhhHH Confidence 33345556665543 4457789999988887654 3467778888999999999999865543 344455555 Q ss_pred HHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeecc----CCCCccccceeEEeeCchhHHHH Q lcl|NC_020844. 241 LQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAP----PNAEGQHGSWNFVEPATESLKFL 316 (455) Q Consensus 241 l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp----~~~~~~~~~~~~le~~~~~l~~~ 316 (455) +..+.....|++.+.+.+.++|+++++|............ ..+-....+.++ ....+++++++|++.+. +.+.+ T Consensus 210 lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~-~~~~~ 287 (440) T protein:vir:95 210 LIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDA-AKMKDANMLFLKTGISTTGQQTTADASYIYKQY-DVNGT 287 (440) T ss_pred HHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccch-hhhhhccceecccccccccCCCCcceeEEeecC-CHHHH Confidence 5556667889999999999999999999754322111110 011111222221 22335677899999775 56888 Q ss_pred HHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC----CceEE Q lcl|NC_020844. 317 AEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPD----TNPEA 389 (455) Q Consensus 317 ~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~----~~~~v 389 (455) +..++.+++.|+.++..|. ...++|.||+|.+.....+......+...|+.+|.+++++++.+++... ....+ T Consensus 288 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v 367 (440) T protein:vir:95 288 EAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKL 367 (440) T ss_pred HHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccc Confidence 9999999999999987663 2335789999999999999999999999999999999999999876432 11223 Q ss_pred EEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 390 DVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 390 ~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ++.+.........+.++++.++ +|+||.||+++.| + ..|+++|++||++|.++--.+ T Consensus 368 ~i~f~~~~p~~~~~~ad~~~kl--~g~iS~et~~~~l------~-~~d~~~E~~ri~~E~~~~~~~ 424 (440) T protein:vir:95 368 TFTFHPNIPQDVWTEIKAYIEA--GGEISQETLMENA------S-FTDYKTEHSRILKQGGSSDLE 424 (440) T ss_pred eEEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHhC------C-CCCcHHHHHHHHHHHHHhhhh Confidence 3332222223356788888876 6899999999876 3 346678899999886643222 No 44 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=2e-44 Score=260.12 Aligned_cols=409 Identities=10% Similarity=0.030 Sum_probs=276.7 Q ss_pred CCCC------CCCccCHH---H-HHHHHHHHHHHHHhcCH-HHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MPEQ------DPTKRSAD---A-EAMSDYLQTVDDVLEGV-TGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~~~------~v~~~~p~---y-~~~~~~w~~i~d~~~G~-~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v 69 (455) ++.. ++..-... + ....++|+++.+.|.|. +.+.. +...+. ....+. | ...|+++.+| T Consensus 30 ~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~-------~~~~~~-~~~~~~--k-i~~n~~k~Iv 98 (501) T protein:vir:27 30 ADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQ-------FGRRKD-REMADK--R-AVHNYGRMIS 98 (501) T ss_pred cccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc-------cCccCc-cccccc--e-eccchHHHHH Confidence 2211 11100110 1 23457888898999885 23211 111111 111111 1 3479999999 Q ss_pred HHHhhhhhcCCceeccCC----HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCC Q lcl|NC_020844. 70 EGLASKPFGREVEVTEGT----GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVR 145 (455) Q Consensus 70 ~~~~g~lf~k~p~i~~~~----~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~r 145 (455) ++++||+|++||+++..+ +.+..++.++.. .++++.++..+++.++++|++|++|+.++.+ + T Consensus 99 d~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~-------------~ 164 (501) T protein:vir:27 99 KFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGR-INDIDSHNRTLIRDLSQTGRAYEVIYRNEYD-------------E 164 (501) T ss_pred HHHhhhhcccCeeEecCCccchHHHHHHHHHHHH-hcChhHHHHHHHHHHhhCCeEEEEEEeCCCC-------------c Confidence 999999999999997432 445666666654 4689999999999999999999999876543 3 Q ss_pred ceEEEechhhhccceeeccCCeeEEEEEEEEe------eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEE Q lcl|NC_020844. 146 PYWSQVNHSAILEIQSERQGAGERITYMRMWE------DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPF 219 (455) Q Consensus 146 Py~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E------~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f 219 (455) |.++.++|.+++....+... ...+..+++.. .+..+.+|+++.+..|.. ++.+...+..+|+||.||||+| T Consensus 165 ~~i~~~~p~~~~~v~d~~~~-~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~--~~~~~~~~~~~~~~g~vPvv~~ 241 (501) T protein:vir:27 165 TRIKRLNPLETFVIYDNSLE-DNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDA--SDDFNEISVTTHAFGTVPITEF 241 (501) T ss_pred eEEEEEccceeEEEecCCCC-CceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEe--CCceeeccccccCCCcccEEEe Confidence 67899999999864433333 33455555543 234677889887766653 3445667788999999999999 Q ss_pred EecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCC---- Q lcl|NC_020844. 220 ITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPN---- 295 (455) Q Consensus 220 ~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~---- 295 (455) .++....+. +.++..+..+..+..|++.+.+++.++|+++++|......++...+ +....++.++.+ T Consensus 242 ~nn~~g~sd------~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 312 (501) T protein:vir:27 242 LNNVDGIGD------YETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASD---MKRTRLMQLKPPKSAD 312 (501) T ss_pred cCCCCCCCc------hhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhh---hhhcCceeeccccccc Confidence 876543333 4444455555566778999999999999999999876554332211 122233333221 Q ss_pred CCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 296 AEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENA 372 (455) Q Consensus 296 ~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~ 372 (455) ....+++++|+..+. +.+.++..++++.+.|+.++..+. ...++|.||+|+++....+......+...|+.+|+++ T Consensus 313 ~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~ 391 (501) T protein:vir:27 313 GKEGTVKAEYLTKSY-DVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRR 391 (501) T ss_pred CCCCCcceeeeeccC-CHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123456789998765 446778889999999999988763 2345789999999999999999999999999999999 Q ss_pred HHHHHHHcCCCCCce-----EEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHh Q lcl|NC_020844. 373 LYITNLWMDAPDTNP-----EADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLIS 447 (455) Q Consensus 373 l~~~a~~~g~~~~~~-----~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~ 447 (455) +++++.+++...... .+++.+.-......++.++++.++ +|+||+||+++.| +...|+++|++||++ T Consensus 392 ~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl--~g~iS~et~l~~l------~~v~D~~~E~eri~~ 463 (501) T protein:vir:27 392 YRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQETALSLS------GLVESPNEELDKINK 463 (501) T ss_pred HHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhC------CCCCCHHHHHHHHHH Confidence 999999876543211 233332222222246678888775 6999999998765 666689999999988 Q ss_pred hccc--------CCCC Q lcl|NC_020844. 448 ELPG--------GIEE 455 (455) Q Consensus 448 e~~~--------~l~~ 455 (455) |.+. .+++ T Consensus 464 E~~e~~~~~~~~~~~~ 479 (501) T protein:vir:27 464 EVSEIDFKGYSNDFNE 479 (501) T ss_pred HHHhhhHhhhcCcccc Confidence 8542 1222 No 45 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=6.6e-45 Score=262.80 Aligned_cols=419 Identities=14% Similarity=0.096 Sum_probs=288.3 Q ss_pred CCCCC-----CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQD-----PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~-----v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) |-+.+ +..--..+....++++++.+.|.|.+.++ |+++. -...+++.. + ..||++.+|++++++ T Consensus 8 ~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~-----~~~~~---~~~~~~~~~--~-~~n~~~~ivd~~~~~ 76 (485) T protein:vir:10 8 QEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPE-----AIGVT---VPIQMQSLL--A-HVGYPRLYVDSIAER 76 (485) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcch-----hcCCC---CChhhhhhh--h-hcCcHHHHHHHHHhh Confidence 32221 33444667777899999999999976543 44433 334444322 2 369999999999999 Q ss_pred hhcCCceeccCCH---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 76 PFGREVEVTEGTG---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 76 lf~k~p~i~~~~~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) ++....++.++++ .+..++ +.|+++.++..+++.+++||+||++|+..+.+.... ....+|.+..++ T Consensus 77 l~~~g~~~~~~~~~~~~~~~i~-----~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~-----~~~~~~~i~~~~ 146 (485) T protein:vir:10 77 QAVEGFRFGDADEADEELWQWW-----QANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLG-----WDPNTPIIRVEP 146 (485) T ss_pred hcccceecCCCchhHHHHHHHH-----HhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccc-----cCCCeeEEEEEc Confidence 9877777644332 244444 347899999999999999999999998765433211 123468899999 Q ss_pred hhhhccceeeccCCeeEEEEEEEEe----eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCc Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMRMWE----DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKK 228 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~~~E----~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~ 228 (455) |++++....+..+ .....+.++.. ..+.+.+|+++.+..|+..+ +.|...+..+|++|+||||+|+++....+. T Consensus 147 p~~~~~~~D~~~~-~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~-~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~ 224 (485) T protein:vir:10 147 PTRMYAEIDPRIG-RVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVE-NEWQEWFNNPHGLGVVPVVPIPNRTRLSDL 224 (485) T ss_pred cceeEEEEcCCCC-ceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcC-CceEEeccccCCCCcccEEEeccccccCCC Confidence 9998864433333 32222222221 24457789998877776544 568888888999999999999987665443 Q ss_pred ccCcchHH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccc--cc-CccceeeccceeeeccCCCCcccccee Q lcl|NC_020844. 229 WVFHPPMK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDS--EG-SPKPLDTGPDTVLYAPPNAEGQHGSWN 304 (455) Q Consensus 229 ~~~~~pL~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~--~~-~~~~~~~g~~~~~~lp~~~~~~~~~~~ 304 (455) .+.+.|. ++..+..+..+..|+...++++.++|+++++|.+..... +. ....+..+.+.+|.+|. ++++ T Consensus 225 -~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~------~d~k 297 (485) T protein:vir:10 225 -YGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFED------AEGK 297 (485) T ss_pred -CCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccceeccCC------CCce Confidence 4666554 456666677788999999999999999999998765421 11 12234445566777652 3688 Q ss_pred EEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC---CCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020844. 305 FVEPATESLKFLAEQIESVSKEIRELGRQPLTA---QSG-NLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM 380 (455) Q Consensus 305 ~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~~-~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~ 380 (455) |.|++.++++.+.+.++.+..++..++..+... ++. +.||+|++.....+......+...|..+|+++++++..+. T Consensus 298 ~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~ 377 (485) T protein:vir:10 298 IQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMM 377 (485) T ss_pred EEeecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999888888888888887766554322 223 3799999999999999999999999999999999998887 Q ss_pred CCCCC---ceEEEEecccCCCCCCHHHHHHHHHHHHcC--CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhccc---- Q lcl|NC_020844. 381 DAPDT---NPEADVYTDFRADTGEEQTPGYLLTMRQNG--DISREGLWMEFQRLGELSDNFDPEEESERLISELPG---- 451 (455) Q Consensus 381 g~~~~---~~~v~v~~df~~~~~~~~~~~al~~~~~~G--~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~---- 451 (455) +.... ...+++.+.-......++.++++.+++++| ++|+||+++.| | .+.+..++++++++|..+ T Consensus 378 ~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~l---g---~~~~~~~~~~~~~ee~~~~~~~ 451 (485) T protein:vir:10 378 KGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDM---G---YSIAEREEMRRWDEEEAAMGLG 451 (485) T ss_pred CCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---C---CCHhHHHHHHHHHHHHHHHHHH Confidence 64321 123444421112223477899999999976 89999998754 3 444555677777655322 Q ss_pred ----------CCCC Q lcl|NC_020844. 452 ----------GIEE 455 (455) Q Consensus 452 ----------~l~~ 455 (455) +.++ T Consensus 452 ~~~~~~~~~~~~~~ 465 (485) T protein:vir:10 452 LIGTMVDPNPTVPG 465 (485) T ss_pred HHHHhhccCCCCCC Confidence 1111 No 46 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=3.7e-44 Score=258.72 Aligned_cols=415 Identities=10% Similarity=0.060 Sum_probs=287.8 Q ss_pred CCCCCCCcc----------CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCC-CCCCCCHHHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MPEQDPTKR----------SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVK-QFPHEQNKRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~~~~v~~~----------~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLp-k~~~e~~~~Y~~rl~ra~~~n~~~~~v 69 (455) -|+.+++.. .+.+....++++++++.|.|.+.++ +++ +.+.+......++ + ..||++.+| T Consensus 4 ~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~-----~~~~~~~~~~~~~~~~~---~-~~n~~~~iV 74 (479) T protein:vir:99 4 LPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVP-----DLATRHKNKEREVLQQL---S-RKPWMGLMV 74 (479) T ss_pred CCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCccc-----ccccccCChhHHHHHHH---h-hcCcHHHHH Confidence 455544432 2567778899999999999975543 222 1222222222222 2 359999999 Q ss_pred HHHhhhhhcCCceecc--CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCce Q lcl|NC_020844. 70 EGLASKPFGREVEVTE--GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPY 147 (455) Q Consensus 70 ~~~~g~lf~k~p~i~~--~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy 147 (455) ++++++++-...+..+ .++.+..++. -|+++..+..+++.++++|++|++|+...... .....|. T Consensus 75 d~~~~~l~~~gf~~~d~~~~~~~~~i~~-----~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~--------d~~g~~~ 141 (479) T protein:vir:99 75 NSFAQQLIVDGYRKTGTNENAKGWDTWR-----LNQMDKQQFWLNRAVLTFGYAFIKVTSGISPL--------DGTTVAR 141 (479) T ss_pred HHHHhhcccccccCCCchhhHHHHHHHH-----hcChhHHHHHHHHHHhhcCceEEEEecCCCCc--------CCCCceE Confidence 9999999877666532 1223444443 36899999999999999999999997321110 0122477 Q ss_pred EEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCC Q lcl|NC_020844. 148 WSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGK 227 (455) Q Consensus 148 ~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~ 227 (455) ++.++|++++....+.......+..+. ......+.+|+...+++|+.. +|.|...+..+|+||+||||||++++.... T Consensus 142 i~~~~p~~~~~iydd~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~ 219 (479) T protein:vir:99 142 IKCIDPRDAFAIWEDPYWDEWPKYLLE-RQPNGQYWWWTEEDYSIFEFK-QGKFIYRETVSHDYGHIPFVRYVNVMDLRG 219 (479) T ss_pred EEEechhheEEEecCCcccceeeEEEe-ecCceeEEEEecceEEEEEec-CCceeeccccccCCCCcceEEeecCCCcCc Confidence 999999999864333333333333333 333445667888888877654 566888889999999999999998765422 Q ss_pred cccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEe Q lcl|NC_020844. 228 KWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVE 307 (455) Q Consensus 228 ~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le 307 (455) .+.+.|.++..+..+..+..|+...++++.++|++|++|....++..+..+...+...+++.++ +.+++|.| T Consensus 220 --~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~i~~~~------~~~~~~~q 291 (479) T protein:vir:99 220 --VCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEKMRFAQESMLISQ------NEKASFGA 291 (479) T ss_pred --CCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhccccccccceeec------CCCceEEE Confidence 3666677888888888889999999999999999999998877666666555565556676653 23578889 Q ss_pred eCchhHHHHHHHHHHHHHHHHHhhhhhcc--CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC Q lcl|NC_020844. 308 PATESLKFLAEQIESVSKEIRELGRQPLT--AQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDT 385 (455) Q Consensus 308 ~~~~~l~~~~~~l~~~~~~m~~~g~~~l~--~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~ 385 (455) ++..+++.+.+.++.+..+|...+..+.. +..+|.||+|++.....+......+.+.|..+|++++++++.+.|.... T Consensus 292 ~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~ 371 (479) T protein:vir:99 292 IPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEE 371 (479) T ss_pred ecccchHHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcc Confidence 99888888888888888888776665543 2346899999999999999999999999999999999999999987542 Q ss_pred --ceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc---c--CCCC Q lcl|NC_020844. 386 --NPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELP---G--GIEE 455 (455) Q Consensus 386 --~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~---~--~l~~ 455 (455) ...+++.+........++.++++++++++|+||+||+++.| +.++.+ +++|++++.+ + .+.+ T Consensus 372 ~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l-------~gv~~~-~~e~~~~~~~~~~~~~~~~~ 440 (479) T protein:vir:99 372 ATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMI-------PNLDQS-TVNGWKEIYDREGDFGKYMR 440 (479) T ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhc-------CCCCHH-HHHHHHHHHHHHHHHHHHHH Confidence 23444443212223347889999999999999999999876 333322 2233322211 0 0111 No 47 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=2.3e-44 Score=259.79 Aligned_cols=409 Identities=10% Similarity=0.036 Sum_probs=276.7 Q ss_pred CCCC------C---CCccCHHHH-HHHHHHHHHHHHhcCH-HHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MPEQ------D---PTKRSADAE-AMSDYLQTVDDVLEGV-TGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~~~------~---v~~~~p~y~-~~~~~w~~i~d~~~G~-~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v 69 (455) |++. + +.+--..|. ...++|+.+.+.|.|. +.+. - +.... ...+. ..-+..|+++.+| T Consensus 30 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~------~-~~~~~--~~~~~--~~ri~~n~~k~Iv 98 (501) T protein:vir:96 30 ADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVL------K-SGRRK--DNEMA--DKRAVHNYGRMIS 98 (501) T ss_pred ccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccc------C-ccccC--ccccc--cceeecchHHHHH Confidence 2211 1 111111222 3357888888999885 2221 1 11111 11111 1124589999999 Q ss_pred HHHhhhhhcCCceeccC----CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCC Q lcl|NC_020844. 70 EGLASKPFGREVEVTEG----TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVR 145 (455) Q Consensus 70 ~~~~g~lf~k~p~i~~~----~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~r 145 (455) ++++||+|++||+++.. .+.+..++.++. +.++++..+..+++.++++|++|++|+.++.+ + T Consensus 99 d~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg-------------~ 164 (501) T protein:vir:96 99 KFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIG-RINDLDSLNRTLIRDLSQTGRAYEVIYRSEYD-------------E 164 (501) T ss_pred HHHhhhhcccCeeEeeCCccchhHHHHHHHHHH-HhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCC-------------c Confidence 99999999999999632 344566666554 34789999999999999999999999865543 3 Q ss_pred ceEEEechhhhccceeeccCCeeEEEEEEEEe------eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEE Q lcl|NC_020844. 146 PYWSQVNHSAILEIQSERQGAGERITYMRMWE------DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPF 219 (455) Q Consensus 146 Py~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E------~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f 219 (455) |.++.++|++++....+... ...+..+++.+ ....+.+|+++.+..|.. ++.+...+..+|+||.||||+| T Consensus 165 ~~i~~~~p~~~~~v~d~~~~-~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~--~~~~~~~~~~~~~~g~vPvv~~ 241 (501) T protein:vir:96 165 TRIKRLSPLETFVIYDNSLE-DNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDA--SDDFNEISVTTHAFGTVPITEY 241 (501) T ss_pred eEEEEEccceeEEEEcCCCC-CceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEee--CCCceeccccccCCCccceEEe Confidence 67899999999864433333 34455555543 234577889887776643 3445567778999999999999 Q ss_pred EecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCC---- Q lcl|NC_020844. 220 ITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPN---- 295 (455) Q Consensus 220 ~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~---- 295 (455) .+++... +.+.++..+..+..+..|++.+.+++.++|+++++|......++.. ..+....++.++.. T Consensus 242 ~nn~~g~------sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~---~~~~~~~~~~~~~~~~~~ 312 (501) T protein:vir:96 242 LNNIDGI------GDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA---SDMKRTRLMQLKPPKSAD 312 (501) T ss_pred cCCccCC------CchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccch---hhhhhcCeeeeccccccc Confidence 8665433 3344445555555567789999999999999999998766543321 12223334444322 Q ss_pred CCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 296 AEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENA 372 (455) Q Consensus 296 ~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~ 372 (455) ....+++++|+..+.+ .+.++..++++.+.|+.++..+. ...++|.||+|+++....+......+...|+.+|+++ T Consensus 313 ~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~ 391 (501) T protein:vir:96 313 GKEGTVKAEYLTKSYD-VSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRR 391 (501) T ss_pred ccccCcceeeEeccCC-HHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1234567889886653 36678889999999999987763 2345789999999999999999999999999999999 Q ss_pred HHHHHHHcCCCCCc-----eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHh Q lcl|NC_020844. 373 LYITNLWMDAPDTN-----PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLIS 447 (455) Q Consensus 373 l~~~a~~~g~~~~~-----~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~ 447 (455) +++++.+++..... ..+++.+........++.++++.++ .|+||+||+++.| +...|+++|++||++ T Consensus 392 ~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl--~g~iS~et~~~~l------~~v~D~~~E~~ri~~ 463 (501) T protein:vir:96 392 YRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQETALSLS------GLVESPNEELDKINK 463 (501) T ss_pred HHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhC------CCCCCHHHHHHHHHH Confidence 99999987653211 1233332222223346778888876 5999999998866 566689999999998 Q ss_pred hcccC-----CCC Q lcl|NC_020844. 448 ELPGG-----IEE 455 (455) Q Consensus 448 e~~~~-----l~~ 455 (455) |.+.. .++ T Consensus 464 E~~~~~~~~~~~~ 476 (501) T protein:vir:96 464 EMSEIDFKGYSND 476 (501) T ss_pred HHHHhhccccccc Confidence 85421 111 No 48 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=3.8e-44 Score=258.62 Aligned_cols=418 Identities=14% Similarity=0.077 Sum_probs=289.1 Q ss_pred CCCCC-----CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQD-----PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~-----v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) |.+.+ +..--..|....++++++.+.|.|.+.++ |++.. -...+++. + ...||++.+|++++++ T Consensus 8 ~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~-----~~~~~---~~~~~~~~--~-~v~n~~~~iVd~~~~~ 76 (486) T protein:vir:42 8 MEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPE-----AIGVT---VPREMQQL--L-AHVGYPRLYVDSVAER 76 (486) T ss_pred CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcch-----hcccc---cchhHhhh--h-hccchHHHHHHHHHhh Confidence 43333 22333456667789999999999987654 33321 12333332 2 2369999999999999 Q ss_pred hhcCCceeccCCHH---HHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 76 PFGREVEVTEGTGP---SEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 76 lf~k~p~i~~~~~~---l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) ++-...++.+.... +..++ +.|+++....++++.+++||++|++|+.++.+.... ....+|.++.++ T Consensus 77 l~~~g~~~~~~~~~~~~~~~i~-----~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~-----~~~~~~~i~~~~ 146 (486) T protein:vir:42 77 QAVEGFRLGDADEADEELWQWW-----QANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLG-----WDQNVPIIRVEP 146 (486) T ss_pred hcccceecCCCchhHHHHHHHH-----HhcChhHHHHHHHHHHhhcCceEEEEecCCcccccc-----cCCCeeEEEEec Confidence 98777766543322 34443 347899999999999999999999998765433211 123458899999 Q ss_pred hhhhccceeeccCCeeEEEEEEEEe-----eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCC Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMRMWE-----DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGK 227 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~~~E-----~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~ 227 (455) |++++.+..+..+ +....+++.. .+..+.+|+++.+..|+. .+|.|...+..+|++|+||||+|+++....+ T Consensus 147 p~~~~~i~d~~~~--~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~-~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~ 223 (486) T protein:vir:42 147 PTRMHAEIDPRIN--RVSKAIRVAYDKEGNEIQAATLYTPMETIGWFR-ADGEWAEWFNVPHGLGVVPVVPLPNRTRLSD 223 (486) T ss_pred ccceEEEEeCCCC--CeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEe-cCCcEEeecceecCCCCceEEEeccccccCC Confidence 9999976554443 3444444432 234567889887666654 4567888888999999999999988765443 Q ss_pred cccCcchHH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccC--c-cceeeccceeeeccCCCCccccce Q lcl|NC_020844. 228 KWVFHPPMK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGS--P-KPLDTGPDTVLYAPPNAEGQHGSW 303 (455) Q Consensus 228 ~~~~~~pL~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~--~-~~~~~g~~~~~~lp~~~~~~~~~~ 303 (455) + .+.+.|. ++..+..+..+..|+...++++.++|+++++|.+.+...... . .......+.+|.+|. +++ T Consensus 224 ~-~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~ 296 (486) T protein:vir:42 224 L-YGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARILAFED------AEG 296 (486) T ss_pred C-CCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchhcccCC------CCc Confidence 3 3555454 345555577778899999999999999999998765432111 1 122334456676653 368 Q ss_pred eEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC---CCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 304 NFVEPATESLKFLAEQIESVSKEIRELGRQPLTA---QSG-NLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLW 379 (455) Q Consensus 304 ~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~~-~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~ 379 (455) +|.|++.++++.+.+.++.+..++...+..+... ++. +.||+|+++....+......+...|+.+|++++++++.+ T Consensus 297 ~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~ 376 (486) T protein:vir:42 297 KIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRI 376 (486) T ss_pred eEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8999999998888888888877777766555432 222 379999999999999999999999999999999999998 Q ss_pred cCCCCC---ceEEEEecccCCCCCCHHHHHHHHHHHHc--CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCC Q lcl|NC_020844. 380 MDAPDT---NPEADVYTDFRADTGEEQTPGYLLTMRQN--GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIE 454 (455) Q Consensus 380 ~g~~~~---~~~v~v~~df~~~~~~~~~~~al~~~~~~--G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~ 454 (455) .+.... ...+++.+........++.++++.+++++ |++|+||+++.| +.+.++.+|++|+++|..+... T Consensus 377 ~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~l------g~~~d~~~e~~~~~~e~~~~~~ 450 (486) T protein:vir:42 377 MKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDM------GYSVKEREEMRRWDEEEAAMGL 450 (486) T ss_pred hcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcC------CCChhHHHHHHHHHHHHHHHHH Confidence 875321 12344443222223357789999999986 789999998754 5666778889998776532111 Q ss_pred C Q lcl|NC_020844. 455 E 455 (455) Q Consensus 455 ~ 455 (455) . T Consensus 451 ~ 451 (486) T protein:vir:42 451 G 451 (486) T ss_pred H Confidence 1 No 49 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=3.8e-44 Score=258.63 Aligned_cols=423 Identities=13% Similarity=0.097 Sum_probs=287.3 Q ss_pred CCCCCCCc-------cCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQDPTK-------RSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~v~~-------~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~ 73 (455) |+...-.. --..+....++++++.+.|.|.+.++ |+|. .-...+++++ ...||++.+|++++ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~-----~~~~---~~~~~~~~~~---~~~n~~~~ivd~~a 69 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERRPD-----AIGL---AVPLDMRKYL---AHVGYPRTYVDAIA 69 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchh-----hcCc---ccchhhhhhh---hhcchHHHHHHHHH Confidence 77664322 23566667799999999999975543 3332 2234444432 23799999999999 Q ss_pred hhhhcCCceec---------cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccC Q lcl|NC_020844. 74 SKPFGREVEVT---------EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGV 144 (455) Q Consensus 74 g~lf~k~p~i~---------~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~ 144 (455) .+++-....+. ...+.....+.++. +-|+++.+.+.+++.+++||++|++|+.......- .-... T Consensus 70 ~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~-~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~-----~~~~~ 143 (488) T protein:vir:23 70 ERQELEGFRIPSANGEEPESGGENDPASELWDWW-QANNLDIEATLGHTDALIYGTAYITISMPDPEVDF-----DVDPE 143 (488) T ss_pred HhhhccceeccCCcccccccccchhHHHHHHHHH-HhcChhHHHHHHHHHHhhcCceEEEEecCCccccc-----CCCCC Confidence 77743332221 11222233333343 34789999999999999999999999765422110 01123 Q ss_pred CceEEEechhhhccceeeccCCeeEEEEEEEEe-----eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEE Q lcl|NC_020844. 145 RPYWSQVNHSAILEIQSERQGAGERITYMRMWE-----DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPF 219 (455) Q Consensus 145 rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-----~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f 219 (455) .|.++.++|++++.+..+..+. .+..+++.. .+..+.+|+++.+..|.. .+|.|.+.+..+|++|+|||||| T Consensus 144 ~~~i~~~~p~~~~~~~d~~~~~--~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~-~~~~~~~~~~~~h~~g~vPvv~f 220 (488) T protein:vir:23 144 VPLIRVEPPTALYAEVDPRTRK--VLYAIRAIYGADGNEIVSATLYLPDTTMTWLR-AEGEWEAPTSTPHGLEMVPVIPI 220 (488) T ss_pred cceEEEeccceeEEEEecCCCc--eEEEEEEEEecCCCcEEEEEEEecCcEEEEEe-cCCceEeccccccCCCCcceEEe Confidence 4789999999998865554442 333333322 234566888887766644 45678888889999999999999 Q ss_pred EecccCCCcccCcchHH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc---CccceeeccceeeeccCC Q lcl|NC_020844. 220 ITGRRKGKKWVFHPPMK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG---SPKPLDTGPDTVLYAPPN 295 (455) Q Consensus 220 ~~~~~~~~~~~~~~pL~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~---~~~~~~~g~~~~~~lp~~ 295 (455) +++....++ .+.+.|. ++..|..++.+..|+...++++.++|+++++|.+....... ..+.+..+.+.+|.++.+ T Consensus 221 ~n~~~~~~~-~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g 299 (488) T protein:vir:23 221 SNRTRLSDL-YGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFEGG 299 (488) T ss_pred ccccccCCc-CCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCCCC Confidence 877654433 4556564 45566678888999999999999999999999876543221 223455666778877643 Q ss_pred CCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC---CC-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 296 AEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA---QS-GNLTRISAAQAASKGNSAVQQWAAGLKDALEN 371 (455) Q Consensus 296 ~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~-~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~ 371 (455) .+++|.|+++++++.+.+.|+.+..++...+..+... ++ ++.||.|++.....+......+...|+.+|++ T Consensus 300 -----~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 374 (488) T protein:vir:23 300 -----EGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQ 374 (488) T ss_pred -----CCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3688999999999999988888888888776655432 22 23799999999999999999999999999999 Q ss_pred HHHHHHHHcCCCCC---ceEEEEecccCCCCCCHHHHHHHHHHHHcC--CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHH Q lcl|NC_020844. 372 ALYITNLWMDAPDT---NPEADVYTDFRADTGEEQTPGYLLTMRQNG--DISREGLWMEFQRLGELSDNFDPEEESERLI 446 (455) Q Consensus 372 ~l~~~a~~~g~~~~---~~~v~v~~df~~~~~~~~~~~al~~~~~~G--~is~et~~~~l~~~~vl~~~~d~e~e~~ri~ 446 (455) ++++++.+.|.... ...+++.+.-......++.++++.+++++| ++|+||+++.| +...++.+++++++ T Consensus 375 ~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l------~~~~d~~~~~~~~~ 448 (488) T protein:vir:23 375 AMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDM------GYTIVEREQMRQWL 448 (488) T ss_pred HHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhC------CCCchHHHHHHHHH Confidence 99999999875321 123333321112223467899999999975 79999999876 44555667777665 Q ss_pred hhccc-CCCC Q lcl|NC_020844. 447 SELPG-GIEE 455 (455) Q Consensus 447 ~e~~~-~l~~ 455 (455) +|..+ +... T Consensus 449 ~~~~~~~~~~ 458 (488) T protein:vir:23 449 EQDQKQGLGL 458 (488) T ss_pred HHHHHHHHHH Confidence 44221 0110 No 50 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=7.3e-44 Score=257.07 Aligned_cols=421 Identities=12% Similarity=0.069 Sum_probs=286.4 Q ss_pred CCCC----CCCcc-------CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MPEQ----DPTKR-------SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~~~----~v~~~-------~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v 69 (455) |++. +-.+. -..+....++++++++.|.|.+.++ ++++. -...+++++ ...||++.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~-----~~~~~---~~~~~~~~~---~~~n~~~~iv 69 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPD-----AVGVT---VPQQMQKLL---AHVGYPRLYI 69 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccch-----hcccc---cchhHHhhh---hhcCcHHHHH Confidence 4433 22221 1123455688899999999987654 33322 223333333 3469999999 Q ss_pred HHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEE Q lcl|NC_020844. 70 EGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWS 149 (455) Q Consensus 70 ~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~ 149 (455) ++++++++....++.++.+.-+.+.+ +. +-|+++.+.+.+++.+++||++|++|+.++.+... .....+|.++ T Consensus 70 d~~~~~l~~~g~~~~~~~~~~~~l~~-i~-~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~-----~~~~~~~~i~ 142 (484) T protein:vir:77 70 DAIAARQELEGFRLGGADKADEQLWD-WW-QANDLDIESTLGHTDSLVHGRSYITISKPDPNIDP-----GVDPEVPIIR 142 (484) T ss_pred HHHHhhhccCceecCCcchhHHHHHH-HH-HhcCHhHHHHHHHHHHhhcCceEEEEecCCCCccc-----ccccccceEE Confidence 99999999888777543332222322 22 34789999999999999999999999876554321 1223468899 Q ss_pred EechhhhccceeeccCCeeEEEEEEEEe-----eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEeccc Q lcl|NC_020844. 150 QVNHSAILEIQSERQGAGERITYMRMWE-----DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRR 224 (455) Q Consensus 150 ~~~~~~ii~w~~~~~~~~~~l~~v~~~E-----~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~ 224 (455) .++|++++....+..+ ..+..+++.. .+..+.+|+++.+..|.. .+|.|..+++.+|++|+||||+|+++.. T Consensus 143 ~~~p~~~~~~~D~~~~--~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~-~~~~~~~~~~~~~~~g~vPvv~f~N~~~ 219 (484) T protein:vir:77 143 VEPPTNLYAQIDPRTR--QVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNR-EDGQWVQVANVAHNLEMVPVIPIPNRTR 219 (484) T ss_pred EeccceeEEEecCCCC--ceEEEEEEEEeecCCcEEEEEEEecCeEEEEEe-cCCceEeeccccCCCCCcceEEeccccc Confidence 9999999865444333 3333333322 234566788887655544 4567988999999999999999987665 Q ss_pred CCCcccCcchHH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccC---ccceeeccceeeeccCCCCccc Q lcl|NC_020844. 225 KGKKWVFHPPMK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGS---PKPLDTGPDTVLYAPPNAEGQH 300 (455) Q Consensus 225 ~~~~~~~~~pL~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~---~~~~~~g~~~~~~lp~~~~~~~ 300 (455) . +...+.+.|. ++..|..+..+..|+...++++.++|+++++|.+.+...... ...+..+.+.+|.+|. T Consensus 220 ~-~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 292 (484) T protein:vir:77 220 L-SDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAFED------ 292 (484) T ss_pred c-CccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhhcccCC------ Confidence 4 3334555554 455666677788899999999999999999998765422111 1223445566777653 Q ss_pred cceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC---CCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 301 GSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA---QSGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYIT 376 (455) Q Consensus 301 ~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~ 376 (455) .+++|.|++.++++.+.+.++.+..++...+..+... ++.| .||+|+++....+......+...|+.+|+++++++ T Consensus 293 ~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~ 372 (484) T protein:vir:77 293 HESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVA 372 (484) T ss_pred CCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3588899999999888888888888887766554332 2223 79999999999999999999999999999999999 Q ss_pred HHHcCCCCCc---eEEEEecccCCCCCCHHHHHHHHHHHHcC--CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhccc Q lcl|NC_020844. 377 NLWMDAPDTN---PEADVYTDFRADTGEEQTPGYLLTMRQNG--DISREGLWMEFQRLGELSDNFDPEEESERLISELPG 451 (455) Q Consensus 377 a~~~g~~~~~---~~v~v~~df~~~~~~~~~~~al~~~~~~G--~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~ 451 (455) +.+.|..... ..+++.+........++.++++.+++++| ++|++|+++.| +...++.+|++++++|..+ T Consensus 373 ~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l------~~~~~~~~e~~~~~~ee~~ 446 (484) T protein:vir:77 373 YKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDM------GYSITEREEMRKWDEEEQA 446 (484) T ss_pred HHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcC------CCChhHHHHHHHHHHHHHH Confidence 9987653211 22333322222233578899999999975 89999998766 4455667778887766432 Q ss_pred CC----CC Q lcl|NC_020844. 452 GI----EE 455 (455) Q Consensus 452 ~l----~~ 455 (455) .. +. T Consensus 447 ~~~~~~~~ 454 (484) T protein:vir:77 447 QGLGLMGT 454 (484) T ss_pred HHHHHHhh Confidence 11 10 No 51 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=7.7e-44 Score=256.93 Aligned_cols=409 Identities=10% Similarity=0.041 Sum_probs=275.7 Q ss_pred CCC------CC---CCccCHHH-HHHHHHHHHHHHHhcCH-HHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MPE------QD---PTKRSADA-EAMSDYLQTVDDVLEGV-TGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~~------~~---v~~~~p~y-~~~~~~w~~i~d~~~G~-~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v 69 (455) |.+ ++ +..---.+ ....++++.+.+.|.|. +.+ +.+..... . . +..+-+..|+++.+| T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i-------~~~~~~~~-~-~--~~~~ki~~n~~k~Iv 99 (502) T protein:vir:48 31 ADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDV-------LKSGRRKD-N-E--MADKRAVHNYGRMIS 99 (502) T ss_pred ccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-------cccccccc-c-c--cccceeecchHHHHH Confidence 211 11 11111111 23357888888888884 222 11111111 1 1 111124479999999 Q ss_pred HHHhhhhhcCCceeccC----CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCC Q lcl|NC_020844. 70 EGLASKPFGREVEVTEG----TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVR 145 (455) Q Consensus 70 ~~~~g~lf~k~p~i~~~----~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~r 145 (455) +.++||+|++|++++.. .+.+..++.++... |+++.++..+++.++.+|++|++|+.++.+. T Consensus 100 d~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~-N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~------------- 165 (502) T protein:vir:48 100 KFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRI-NDIDTHNRNLIRDLSQTGRAYEVIYRSEYDE------------- 165 (502) T ss_pred HHHhhhhcccCeeEecCCccchhHHHHHHHHHHhh-cCHhHHHHHHHHHHhhcCeEEEEEEeCCCCc------------- Confidence 99999999999999642 24466667766544 6899999999999999999999998765443 Q ss_pred ceEEEechhhhccceeeccCCeeEEEEEEEEe------eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEE Q lcl|NC_020844. 146 PYWSQVNHSAILEIQSERQGAGERITYMRMWE------DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPF 219 (455) Q Consensus 146 Py~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E------~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f 219 (455) |-++.++|.+++....+... ...+..+++.. ....+.+|+++.+..| ..++.+...+..+|++|.||||+| T Consensus 166 ~~i~~~~p~~~~~vydd~~~-~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~--~~~~~~~~~~~~~~~~g~vPvv~~ 242 (502) T protein:vir:48 166 TRIKRLSPLETFVIYDNSLE-DNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTL--DASDSFNEISVTPHAFGTVPITEF 242 (502) T ss_pred eEEEEEcccceEEEEcCCCC-CceEEEEEEEEEeecCCcEEEEEEEeCCeEEEE--EeCCceeeccceecCCCccceEEe Confidence 56888999998764333332 23444455432 2345678888866544 455567778888999999999999 Q ss_pred EecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccC----C Q lcl|NC_020844. 220 ITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPP----N 295 (455) Q Consensus 220 ~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~----~ 295 (455) .+++...+......+|+ .+..+..|++.+.+.+.+.|+++++|......++... .+....++.++. + T Consensus 243 ~nn~~g~sd~e~v~~li------Da~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 313 (502) T protein:vir:48 243 LNNADGIGDYETELYLI------DLYDSAESDTANHMSDMADAILAIYGDLALPQGMQAS---DMKRTRLMQLKPPKSAD 313 (502) T ss_pred cCCCCCCCchhhhHHHH------HHHHHHHHHHHHHHHHhcCceeeeecCcccccccchh---hhhhcceeecccccccc Confidence 87665444444444444 4555677899999999999999999987554333211 122223333321 2 Q ss_pred CCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 296 AEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENA 372 (455) Q Consensus 296 ~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~ 372 (455) ..+++++++|++.+. +.+.++..++++.++|+.++..|. ...++|.||+|+++....+......+...|+.+|+++ T Consensus 314 ~~~~~~d~~~l~~~~-~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~ 392 (502) T protein:vir:48 314 GKEGTVKAEYLTKSY-DVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRR 392 (502) T ss_pred ccccCcceeEeeecC-CHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234567899998775 447788899999999999987763 2335789999999999999999999999999999999 Q ss_pred HHHHHHHcCCCCCc-----eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHh Q lcl|NC_020844. 373 LYITNLWMDAPDTN-----PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLIS 447 (455) Q Consensus 373 l~~~a~~~g~~~~~-----~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~ 447 (455) +++++.+++..... ..+++.+.-.......+.++++.++ +|+||+||+++.| +...|+++|++||++ T Consensus 393 ~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~l~~l------~~v~D~~~E~~ri~~ 464 (502) T protein:vir:48 393 YRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDL--GGQVSQETALSLS------GLVENPTEELDKINE 464 (502) T ss_pred HHHHHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhC------CCCCCHHHHHHHHHH Confidence 99999987653211 1233332112222246678888776 6999999998866 556689999999988 Q ss_pred hccc-CCCC Q lcl|NC_020844. 448 ELPG-GIEE 455 (455) Q Consensus 448 e~~~-~l~~ 455 (455) |.+. ..+. T Consensus 465 E~~~~~~~~ 473 (502) T protein:vir:48 465 ESSKIDFKG 473 (502) T ss_pred HHHhhhhhc Confidence 7542 1111 No 52 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=2.3e-43 Score=254.34 Aligned_cols=409 Identities=10% Similarity=0.042 Sum_probs=288.5 Q ss_pred CCCCC---CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQD---PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~---v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |.+.. +..--..+....++++.+++.|.|.+.++ ++|+. ....|++.+ ...||++.+|++++++++ T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~-----~~~~~---~~~~~~~~k---~~~n~~~~ivd~~~~~l~ 69 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVR-----DLGVA---IPPELQRVQ---TVVSWPGIAVDALEERLD 69 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcch-----hcCcc---cchhhhhhh---hhcchHHHHHHHHHhhhc Confidence 66543 33334456666788889999999976653 33332 233444332 347999999999999997 Q ss_pred cCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhc Q lcl|NC_020844. 78 GREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAIL 157 (455) Q Consensus 78 ~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii 157 (455) ....+. .+++.++.+++. |+++.++..++++++.+|++|++|+.+..+. |.++.++|++++ T Consensus 70 ~~g~~~-~d~~~l~~i~~~-----n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~-------------~~i~~~~p~~~~ 130 (441) T protein:vir:80 70 WLGWTN-GDGYGLDGVYAA-----NRLATASCDVHLDALIFGLSFVAIIPHGDGT-------------VSVRPQSPKNCT 130 (441) T ss_pred cccccC-CChHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeeEEEEEeCCCCc-------------eEEEEEccceEE Confidence 655543 345677777653 6899999999999999999999998665443 668999999988 Q ss_pred cceeeccCCeeEEEEEEEEee---eeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcch Q lcl|NC_020844. 158 EIQSERQGAGERITYMRMWED---AETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPP 234 (455) Q Consensus 158 ~w~~~~~~~~~~l~~v~~~E~---~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~p 234 (455) ....+.. +.....++++.+. ...+.+|+++.+..|...+++.|..++..+|++|+||||+|.+++...+ ..+.+. T Consensus 131 ~i~d~~~-~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~-~~G~s~ 208 (441) T protein:vir:80 131 GKFSADG-SRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSR-IDGRSE 208 (441) T ss_pred EEEeCCC-CceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCc-cCCccc Confidence 6333333 3444444444332 2345678888877777788888888999999999999999987765433 345666 Q ss_pred HH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhH Q lcl|NC_020844. 235 MK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESL 313 (455) Q Consensus 235 L~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l 313 (455) |. ++..+..+.....|+...++++.++|+++++|.+.+...+. ......+.+|.+|.++.+. .+++.+++.+++ T Consensus 209 l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~---~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~~~ 283 (441) T protein:vir:80 209 ITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQP---GWVLSMASVWAVDKDDDGD--TPNVGSFPVNSP 283 (441) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCccccccc---hhhhcccccccCCCCCCCC--cceeEecCccch Confidence 64 35566667778899999999999999999999876654332 2344557788888766433 477788888889 Q ss_pred HHHHHHHHHHHHHHHHhhhhhccC---CCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCc--- Q lcl|NC_020844. 314 KFLAEQIESVSKEIRELGRQPLTA---QSGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTN--- 386 (455) Q Consensus 314 ~~~~~~l~~~~~~m~~~g~~~l~~---~~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~--- 386 (455) +.+.+.|+.+..+++..+..+... .+.+ .||+|+++....+......+...|+.+|.+++++++.++|..... T Consensus 284 ~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~ 363 (441) T protein:vir:80 284 TPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADF 363 (441) T ss_pred HHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc Confidence 999988888888888776655322 2323 699999999999999999999999999999999999998864321 Q ss_pred -eEEEEecccCCCCCCHHHHHHHHHHHHcCCC--CHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 387 -PEADVYTDFRADTGEEQTPGYLLTMRQNGDI--SREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 387 -~~v~v~~df~~~~~~~~~~~al~~~~~~G~i--s~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ..+++.+.-.......+.++++.+++++|++ |++|+++.+ +.. ++|++|+++|..+..+. T Consensus 364 ~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l------~~~---~~e~~~~~~e~~e~~~~ 426 (441) T protein:vir:80 364 FGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEML------GLD---DVQVEAVMRHRAESSDP 426 (441) T ss_pred ceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhC------CCC---HHHHHHHHHHHHHHHHH Confidence 2333332222223346889999999999975 677776544 222 23444554443322111 No 53 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=7.5e-44 Score=257.02 Aligned_cols=415 Identities=10% Similarity=0.034 Sum_probs=280.2 Q ss_pred CCCCCCCccC-HHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRS-ADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~-p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |..+.+..-. -......++|+.+.+.|.|.+..=-....+.+ ... .-..| +-.|+++.+|++.+|++|++ T Consensus 22 l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~---~~~--~~~~k----i~~n~~~~Iv~~~~~~l~G~ 92 (506) T protein:vir:94 22 LTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRH---EDG--KADHR----ATHSFAKYIADFQTSYSVGN 92 (506) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccc---ccc--CCcce----eecchHHHHHHHhhhhhccc Confidence 3322221110 01234578899999999997542000001111 000 00122 23799999999999999999 Q ss_pred CceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccc Q lcl|NC_020844. 80 EVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEI 159 (455) Q Consensus 80 ~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w 159 (455) ||+++..++..+..+.++.. -|+++..+..++++++.+|++|++|+.++.+ +|-++.++|.+++.. T Consensus 93 p~~~~~~d~~~~~~l~~~~~-~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~-------------~~~i~~~~p~~~~~v 158 (506) T protein:vir:94 93 PINVKLPDDGSNSGFDTFNK-ANDVDAENYDLFLDMSRYGRAYEYVYRGEDN-------------EEHLAKLDPLDTFVI 158 (506) T ss_pred CceeecCcchHHHHHHHHHh-ccCHhHHHHHHHHHHHhcCeEEEEEEecCCC-------------eeEEEEEcccceEEE Confidence 99997666666666666553 3789999999999999999999999876543 367889999999864 Q ss_pred eeeccCCeeEEEEEEEEee-----------eeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCc Q lcl|NC_020844. 160 QSERQGAGERITYMRMWED-----------AETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKK 228 (455) Q Consensus 160 ~~~~~~~~~~l~~v~~~E~-----------~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~ 228 (455) ..+...+ ..+..+++... ...+.+|++..+..|.... ..|.+.+...|+||.||||+|+++....+. T Consensus 159 ~dd~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~-~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd 236 (506) T protein:vir:94 159 YSTDVDP-KPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTP-IMGKMQVDTTKPITTFPVVEFKNSNFRLGD 236 (506) T ss_pred ecCCCCC-ceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEecccc-CccceeccccccCCccceEEecCCCCCCCc Confidence 4333332 34444554321 1234567877777664433 346677778899999999999877665555 Q ss_pred ccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccc---------------------eeeccc Q lcl|NC_020844. 229 WVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKP---------------------LDTGPD 287 (455) Q Consensus 229 ~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~---------------------~~~g~~ 287 (455) +.+..||+|.++ ...|++.+.+.+.+.|+++++|....+........ ..+... T Consensus 237 ~e~~~~liDa~d------~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (506) T protein:vir:94 237 FENVLPLIDLYD------AAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDA 310 (506) T ss_pred hhhhHHHHHHHH------HHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhc Confidence 555566655554 46788888899999999999997644321110000 001112 Q ss_pred eeeeccCCC----CccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 288 TVLYAPPNA----EGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQQ 360 (455) Q Consensus 288 ~~~~lp~~~----~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~~ 360 (455) .++.++.+. .+.+++++||..+. ..+..+..++.+.+.|+.++..|. ...+++.||+|.++....+...... T Consensus 311 ~~~~~~~~~~~~~~~~~~d~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~ 389 (506) T protein:vir:94 311 NMLLLKSGMTVNGTQTSVDAKYINKTY-DVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELAST 389 (506) T ss_pred CeeeecccccccCccccccceeeeecC-CHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHH Confidence 344444322 23566899998776 457889999999999999988773 2335789999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCCCCc-----eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCC Q lcl|NC_020844. 361 WAAGLKDALENALYITNLWMDAPDTN-----PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDN 435 (455) Q Consensus 361 ~a~~~~~a~~~~l~~~a~~~g~~~~~-----~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~ 435 (455) +...|+.+|++++++++.+++..... ..+++.++........+.++++.++ +|+||+||+++.| |.. T Consensus 390 k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l------p~v 461 (506) T protein:vir:94 390 KRRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQL------PGV 461 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC------CCC Confidence 99999999999999999887643211 1234443333333346778888776 6999999998865 666 Q ss_pred CCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 436 FDPEEESERLISELPGGIEE 455 (455) Q Consensus 436 ~d~e~e~~ri~~e~~~~l~~ 455 (455) .|+++|++||++|....... T Consensus 462 ~d~~~E~~ri~~E~~~~~~~ 481 (506) T protein:vir:94 462 TNPQDIVDMMKEQSANGDYS 481 (506) T ss_pred CCHHHHHHHHHHHHHHHhhc Confidence 78999999999986532221 No 54 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=2.4e-43 Score=254.29 Aligned_cols=418 Identities=15% Similarity=0.107 Sum_probs=286.0 Q ss_pred CCCCC--CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQD--PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~--v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |++.. +..--..+....++++++++.|.|.+.+| |++ ..-...++++. ...||++.+|++++++++. T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~-----~~~---~~~~~~~~~~~---~~~n~~~~ivd~~~~~l~~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK-----TIG---IGAPPELAYLD---VQPGWVATYLRTLSDRLDI 69 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----ccc---cccchhHhhhh---hhcchHHHHHHHHHhhhcc Confidence 88654 34344456677899999999999976553 333 22233444432 3479999999999999988 Q ss_pred CCceeccCCH---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 79 REVEVTEGTG---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 79 k~p~i~~~~~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) ...++.++.+ .++.++ +.|+++.++..+++.++++|+||++|+....... ....+|.+..++|++ T Consensus 70 ~g~~~~~d~~~~~~l~~i~-----~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~-------d~~g~~~i~~~~p~~ 137 (480) T protein:vir:78 70 EGFRISEDSEGLEELWNWW-----QANDLDEESVLGHDDSLTFGRSYITVSHPDVESG-------DPAGIPLIRVESPLY 137 (480) T ss_pred CceecCCCchhHHHHHHHH-----HhcCHHHHHHHHHHHHhhcCceEEEEecCccccC-------CCCCeeEEEEEcccc Confidence 8877754333 344444 3478999999999999999999999985432211 123458899999999 Q ss_pred hccceeeccCCeeEEEEEEEEe------eeeEEEEEecCeEEEEEEeCCc--EEEE-eccCCCCCCceeEEEEEecccCC Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWE------DAETILVLRPDDWERWRKNDAG--IWEV-DEFGRITIGVVPMVPFITGRRKG 226 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E------~~e~~~v~~~~~~~~~~~~~~g--~~~~-~~~g~~~l~~vP~V~f~~~~~~~ 226 (455) ++....+... ++....+++.. .+..+.+|+++.+..|+..++. .|.. .+..+|+||+||||+|+++.... T Consensus 138 ~~~~~D~~~~-~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~ 216 (480) T protein:vir:78 138 MYAELDPRNT-RRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) T ss_pred eEEEEcCCCc-cceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccC Confidence 9874433332 22233333321 2456788999988878765443 3333 35568999999999998776543 Q ss_pred CcccCcchHH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccC-ccceeeccceeeeccCCCCcccccee Q lcl|NC_020844. 227 KKWVFHPPMK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGS-PKPLDTGPDTVLYAPPNAEGQHGSWN 304 (455) Q Consensus 227 ~~~~~~~pL~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~-~~~~~~g~~~~~~lp~~~~~~~~~~~ 304 (455) ++ .+.+.|. ++..|..+..+..|+...++++.++|+++++|.+.+...+.. ...+....+.++.++ +.+++ T Consensus 217 ~~-~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~ 289 (480) T protein:vir:78 217 NR-YGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA------SEAAK 289 (480) T ss_pred Cc-cCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhccCC------CCCce Confidence 33 4566565 366777778889999999999999999999998766433222 222333344555443 34688 Q ss_pred EEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC---CCc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020844. 305 FVEPATESLKFLAEQIESVSKEIRELGRQPLTA---QSG-NLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM 380 (455) Q Consensus 305 ~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~~-~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~ 380 (455) |.|++.++++.+.+.++.+..+++..+..+... .+. +.||+|.++....+......+...|..+|.+++++++.+. T Consensus 290 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~ 369 (480) T protein:vir:78 290 ISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIM 369 (480) T ss_pred EEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 999999999999999988888888776654332 232 3799999999999999999999999999999999999999 Q ss_pred CCCCC--ceEEEEecccCCCCCCHHHHHHHHHHHHcC--CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccC---- Q lcl|NC_020844. 381 DAPDT--NPEADVYTDFRADTGEEQTPGYLLTMRQNG--DISREGLWMEFQRLGELSDNFDPEEESERLISELPGG---- 452 (455) Q Consensus 381 g~~~~--~~~v~v~~df~~~~~~~~~~~al~~~~~~G--~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~---- 452 (455) |.... ...+++.+.-......++.++++.+++++| ++|++|+++.| +.+.++.+++++++++.... T Consensus 370 g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l------g~~~d~~~~~~~~~~e~~~~~~~~ 443 (480) T protein:vir:78 370 GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL------GYTATQREQMRDWDKQETEDMIDT 443 (480) T ss_pred CCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcC------CCCHhHHHHHHHHHHHHHHHHHHH Confidence 85321 123444421112223478899999999875 79999998865 44445555555443332211 Q ss_pred CCC Q lcl|NC_020844. 453 IEE 455 (455) Q Consensus 453 l~~ 455 (455) +.. T Consensus 444 ~~~ 446 (480) T protein:vir:78 444 LYS 446 (480) T ss_pred hhc Confidence 111 No 55 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=3.1e-43 Score=253.66 Aligned_cols=418 Identities=15% Similarity=0.115 Sum_probs=287.8 Q ss_pred CCCCC--CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQD--PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~--v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |++.. +..--..+....++++++.+.|.|.+.++ |++.. -...+++.. ...||++.+|++++++++. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~-----~~~~~---~~~~~~~~~---~~~n~~~~ivd~~~~~l~~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK-----TIGIG---APPELAYLD---VQPGWVATYLRTLSDRLDI 69 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccch-----hcccc---cchhhhhhh---hhcchHHHHHHHHHhhhcc Confidence 88654 44444566777899999999999976653 33322 223333322 2479999999999999998 Q ss_pred CCceeccCC---HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 79 REVEVTEGT---GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 79 k~p~i~~~~---~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) ...++.++. +.+..++ +.|+++..+..+++.++++|+||++|+....... ....+|.+..++|++ T Consensus 70 ~g~~~~~d~~~~~~l~~i~-----~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~-------d~~~~~~i~~~~p~~ 137 (480) T protein:vir:78 70 EGFRISEDSEGLEELWNWW-----QANDLDEESVLGHDDSLTFGRAYITVSHPDVESG-------DPAGIPLIRVESPLY 137 (480) T ss_pred CceecCCCchhHHHHHHHH-----HhcCHHHHHHHHHHHHhhcCceEEEeecCccccC-------CCCCeeEEEEEcccc Confidence 887775433 3344444 3478999999999999999999999985432111 123458899999999 Q ss_pred hccceeeccCCeeEEEEEEEEe------eeeEEEEEecCeEEEEEEeCCc--EEEE-eccCCCCCCceeEEEEEecccCC Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWE------DAETILVLRPDDWERWRKNDAG--IWEV-DEFGRITIGVVPMVPFITGRRKG 226 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E------~~e~~~v~~~~~~~~~~~~~~g--~~~~-~~~g~~~l~~vP~V~f~~~~~~~ 226 (455) ++....+.+. ++....+++.. .+..+.+|+++.+..|+..++. .|.. .+..+|++|+||||+|+++.... T Consensus 138 ~~~i~D~~~~-~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~ 216 (480) T protein:vir:78 138 MYAELDPRNT-RRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) T ss_pred eEEEEcCCCc-cceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccC Confidence 9863333322 22333344322 1345788999988777766543 2332 35668999999999998776543 Q ss_pred CcccCcchHH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccc-ccCccceeeccceeeeccCCCCcccccee Q lcl|NC_020844. 227 KKWVFHPPMK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDS-EGSPKPLDTGPDTVLYAPPNAEGQHGSWN 304 (455) Q Consensus 227 ~~~~~~~pL~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~-~~~~~~~~~g~~~~~~lp~~~~~~~~~~~ 304 (455) ++ .+.+.|. ++..|..+..+..|+...++++.++|+++++|.+.+... +...+.+....+.++.++ +++++ T Consensus 217 ~~-~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~ 289 (480) T protein:vir:78 217 NR-YGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA------SEAAK 289 (480) T ss_pred Cc-cCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhccCC------CCCce Confidence 33 4566665 466777788889999999999999999999998765432 222233333445555543 34688 Q ss_pred EEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC---CCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020844. 305 FVEPATESLKFLAEQIESVSKEIRELGRQPLTA---QSGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM 380 (455) Q Consensus 305 ~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~ 380 (455) |.|++.++++.+.+.++.+..+++..+..+... ++.| .||+|.++....+......+...|+.+|++++++++.+. T Consensus 290 ~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~ 369 (480) T protein:vir:78 290 ISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIM 369 (480) T ss_pred EEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 999999999999999999999888776655333 2333 799999999999999999999999999999999999998 Q ss_pred CCCCC--ceEEEEecccCCCCCCHHHHHHHHHHHHcC--CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 381 DAPDT--NPEADVYTDFRADTGEEQTPGYLLTMRQNG--DISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 381 g~~~~--~~~v~v~~df~~~~~~~~~~~al~~~~~~G--~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) |.... ...+++.+.-......++.++++.+++++| ++|++|+++.| +.+.++.+|+++++++......+ T Consensus 370 ~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l------g~~~d~~~e~~~~~~~~~~~~~~ 442 (480) T protein:vir:78 370 GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL------GYTATQREQMRDWDKQETEDMID 442 (480) T ss_pred CCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcC------CCCHhHHHHHHHHHHHHHHHHHH Confidence 85432 223444432222233478899999999875 79999988755 45555666665554443221111 No 56 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=7.9e-43 Score=251.39 Aligned_cols=420 Identities=11% Similarity=0.055 Sum_probs=282.6 Q ss_pred CCCCCCCc-----c-CHH--------H-HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChH Q lcl|NC_020844. 1 MPEQDPTK-----R-SAD--------A-EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVF 65 (455) Q Consensus 1 m~~~~v~~-----~-~p~--------y-~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~ 65 (455) |+.+|... + .|+ + ....++++++++.|.|.+.+... |........ .+| +..|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~-----~~~~~~~~~--~~k----i~~n~~ 69 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYR-----PAKTDKYAA--DNR----IASDFA 69 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccc-----cccccccCC--cce----eecchH Confidence 55544211 1 111 1 23458899999999997654321 111111101 112 347999 Q ss_pred HHHHHHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCC Q lcl|NC_020844. 66 RDVLEGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVR 145 (455) Q Consensus 66 ~~~v~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~r 145 (455) +.+|++++|++|++||+++..++..+.+++++... ++++.++..+++.++.+|++|++|+...... ...+ T Consensus 70 ~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d---------~~~~ 139 (489) T protein:vir:99 70 KYITVFEQGYMLGVPVEYKNENKDLQAAIDLMSVR-NNEDYHNVKIKTDLSIYGRAYELLTVEKIDD---------KKTE 139 (489) T ss_pred HHHHHHHhhhhccCCceeecCChhHHHHHHHHHhh-cChhHHHHHHHHHHhhCCeEEEEEeeccCcC---------CCcc Confidence 99999999999999999987777778888777654 6899999999999999999999997432211 2346 Q ss_pred ceEEEechhhhccceeeccCCeeEEEEEEEEe-------eeeEEEEEecCeEEEEEEeC--CcEEEEeccCCCCCCceeE Q lcl|NC_020844. 146 PYWSQVNHSAILEIQSERQGAGERITYMRMWE-------DAETILVLRPDDWERWRKND--AGIWEVDEFGRITIGVVPM 216 (455) Q Consensus 146 Py~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-------~~e~~~v~~~~~~~~~~~~~--~g~~~~~~~g~~~l~~vP~ 216 (455) |.+..++|.+++....+... .+.+..+++.+ ....+.+|+++.+..|+... .+.+.+.+..+|++|.||| T Consensus 140 ~~i~~~~p~~~~~v~dd~~~-~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPv 218 (489) T protein:vir:99 140 VKLYQLPAEQTFVIYDDTYQ-RNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPV 218 (489) T ss_pred eEEEEEcccceEEEEcCCCC-CceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeE Confidence 88999999998864433322 33444444332 23457789999888787643 2345677888999999999 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc-cceeec---------- Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP-KPLDTG---------- 285 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~-~~~~~g---------- 285 (455) |+|+++....+. |.++..+..++.+..|++.+.+.+.++|+++++|......+.... ...... T Consensus 219 v~~~n~~~~~s~------~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (489) T protein:vir:99 219 NEYANNEERTGA------YESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIG 292 (489) T ss_pred EEeecCCCCCCc------hhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccc Confidence 999876543333 344444444556677899999999999999999976544322110 011111 Q ss_pred --cceeeeccCCC--CccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_020844. 286 --PDTVLYAPPNA--EGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAV 358 (455) Q Consensus 286 --~~~~~~lp~~~--~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L 358 (455) ...++.+..+. .+.+.+++|+..+. +.+.++..++++++.|+.++..+. ...++|.||+|++.....+.+.. T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~ 371 (489) T protein:vir:99 293 FKKAQVLILDDNPNPNGVKPQAYFLKKEY-DTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYR 371 (489) T ss_pred cccceeeeeccccCccccccceeeeeecC-ChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHH Confidence 12233332221 23455788888655 457788999999999999987763 23357899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCCCc-------eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCC Q lcl|NC_020844. 359 QQWAAGLKDALENALYITNLWMDAPDTN-------PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGE 431 (455) Q Consensus 359 ~~~a~~~~~a~~~~l~~~a~~~g~~~~~-------~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~v 431 (455) ..+...|+.+|++++++++.+++..... ..+++.++.......++.++++.++ .|+||+||+++.|. ++ T Consensus 372 ~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl--~giis~et~~~~l~--~v 447 (489) T protein:vir:99 372 EKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNL--YGIVSDQTIFEILN--TV 447 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHHHhcC--CC Confidence 9999999999999999999998653211 1233332222223346678888876 59999999988661 11 Q ss_pred CCCCCCHHHHHHHHHhhccc--CCCC Q lcl|NC_020844. 432 LSDNFDPEEESERLISELPG--GIEE 455 (455) Q Consensus 432 l~~~~d~e~e~~ri~~e~~~--~l~~ 455 (455) .+.++++|++||++|.+. ++.+ T Consensus 448 --~~~d~~~E~~ri~~E~~~~~~~~~ 471 (489) T protein:vir:99 448 --TGVDAEAELKRLKEEADKKQSLPE 471 (489) T ss_pred --CchhHHHHHHHHHHHHHHHhcccc Confidence 123789999999888543 3333 No 57 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=3.2e-42 Score=248.06 Aligned_cols=404 Identities=13% Similarity=0.086 Sum_probs=275.9 Q ss_pred CCCCCC---------CccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHH Q lcl|NC_020844. 1 MPEQDP---------TKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEG 71 (455) Q Consensus 1 m~~~~v---------~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~ 71 (455) -|+.+. ..-.+.|...+++++++.+.|.|.+.+ .+||+.- ...|+....++ ..||++.+|++ T Consensus 18 ~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~-----~~~~~~~---~~~~~~~~~~~-v~n~~~~ivd~ 88 (501) T protein:vir:25 18 FPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGR-----PEVPEGA---SDEVKELAKLS-VKNVLSLVRDS 88 (501) T ss_pred CCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----hhccccC---ChhhhhhHhhh-hcChHHHHHHH Confidence 333322 223456777889999999999986543 3455332 34455433332 36999999999 Q ss_pred HhhhhhcCCceecc--CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEE Q lcl|NC_020844. 72 LASKPFGREVEVTE--GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWS 149 (455) Q Consensus 72 ~~g~lf~k~p~i~~--~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~ 149 (455) ++++++-...+..+ ..+.+..++ +-|+|+....++++.+++||++|++|+..+.+ |.++ T Consensus 89 ~a~~l~~~gf~~~d~~~~~~l~~i~-----~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~--------------~~i~ 149 (501) T protein:vir:25 89 FAQNLSVVGYRNALAKENDPAWEMW-----QRNRMDARQAEVHRPALTYGASYVTVTPTDEG--------------PVFR 149 (501) T ss_pred HHhhhcccceecCCccchHHHHHHH-----HhcChhHHHHHHHHHHhhcCceEEEEecCCCC--------------CeEE Confidence 99999877766642 223344443 34789999999999999999999999754422 4688 Q ss_pred Eechhhhcc-ceeeccCCeeEEEEEEEEe------eeeEEEEEecCeEEEEEEe-------CCcEE-------------E Q lcl|NC_020844. 150 QVNHSAILE-IQSERQGAGERITYMRMWE------DAETILVLRPDDWERWRKN-------DAGIW-------------E 202 (455) Q Consensus 150 ~~~~~~ii~-w~~~~~~~~~~l~~v~~~E------~~e~~~v~~~~~~~~~~~~-------~~g~~-------------~ 202 (455) .++|.+++. |.....+ .+.+..+++.. ...++.+|++..+..|... ..+.| . T Consensus 150 ~~sp~~~~~iy~D~~~~-~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (501) T protein:vir:25 150 TRSPRQILAVYADPSVD-AWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVI 228 (501) T ss_pred EeccccEEEEEecCCCC-cceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeecccccccccccccccccccc Confidence 899999975 5433333 33333333321 1234556666544443321 11111 1 Q ss_pred EeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccce Q lcl|NC_020844. 203 VDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPL 282 (455) Q Consensus 203 ~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~ 282 (455) .+..++|+++.||||+|.+++..++. +.+.+.++..+..+..+..|+...++++.++|++|+.|++.+..+. . T Consensus 229 ~~~~~~~~~~~vPiv~f~N~~~~~~~--g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~-----~ 301 (501) T protein:vir:25 229 EHGATFEGKPVCPVVRFVNGRDADDM--IVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAEV-----L 301 (501) T ss_pred ccccccCCccceeeEeccCccccCcc--ccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccch-----h Confidence 22356899999999999887665443 4555666777777888899999999999999999999998765443 2 Q ss_pred eeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_020844. 283 DTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQSGNLTRISAAQAASKGNSAVQ 359 (455) Q Consensus 283 ~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~~~~sa~a~~~~~~~~~s~L~ 359 (455) .+..+++|.+| +++++|.|++...++.+.+.++.+..+|...+..+. .+.++|.||+|+++....+..... T Consensus 302 ~~~~~~i~~~~------~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~ 375 (501) T protein:vir:25 302 KASALRVWTFE------DPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLA 375 (501) T ss_pred hhcccceeccC------CCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHH Confidence 34446677765 235788899999999999999999999988776664 344568999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCC--ceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC Q lcl|NC_020844. 360 QWAAGLKDALENALYITNLWMDAPDT--NPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD 437 (455) Q Consensus 360 ~~a~~~~~a~~~~l~~~a~~~g~~~~--~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d 437 (455) .+...|+.+|++++++++.+.|.... ...+++.+........++.++++.+++++| ||.+|++..+ +.++ T Consensus 376 ~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~g-is~et~~~~~-------~g~~ 447 (501) T protein:vir:25 376 AKRESFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAG-IPIEHLLSMV-------PGMT 447 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcC-CCHHHHHHHc-------CCCC Confidence 99999999999999999999885432 234455433332334478999999999888 6999998876 3444 Q ss_pred HHHHHHHHHhhcc----cCCCC Q lcl|NC_020844. 438 PEEESERLISELP----GGIEE 455 (455) Q Consensus 438 ~e~e~~ri~~e~~----~~l~~ 455 (455) +++ ++|++++.. .++-+ T Consensus 448 ~~~-ie~~~~~~~e~~~~~~~~ 468 (501) T protein:vir:25 448 QQT-IQAIKDSLRGGEVKSLVD 468 (501) T ss_pred HHH-HHHHHHHHHHHhHHHHHH Confidence 432 233332211 11111 No 58 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=5e-42 Score=247.02 Aligned_cols=381 Identities=13% Similarity=0.117 Sum_probs=264.0 Q ss_pred cCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCCceecc--CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHh Q lcl|NC_020844. 40 YVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGREVEVTE--GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVA 117 (455) Q Consensus 40 yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i~~--~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~ 117 (455) |||+.-. ++|+.+.+++ ..||++.+|++++++++....+..+ .+..+.+++ +.|+++.++..+++.+++ T Consensus 1 ~l~~~~~---~~~~~~~~~~-v~n~~~~ivd~~~~~l~~~gf~~~d~~~~~~~~~i~-----~~N~~d~~~~~~~~~a~i 71 (434) T protein:vir:98 1 MLPKNAE---QAFLDFQRKA-RTNFCGLIANASVHRLLALGVTGPDGEPDTRASRWW-----QANRLDSRQKLVWRMAMA 71 (434) T ss_pred CCCCCcc---HHHHHhhhhh-hccchHHHHHHHHhhhccCceecCCCchHHHHHHHH-----HhcChhHHHHHHHHHHhh Confidence 8886544 7777766554 4699999999999999877766532 222344444 347899999999999999 Q ss_pred hCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee----eeEEEEEecCeEEEE Q lcl|NC_020844. 118 DSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED----AETILVLRPDDWERW 193 (455) Q Consensus 118 ~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~----~e~~~v~~~~~~~~~ 193 (455) +|++|++|+.++.+... ....+|.++.++|++++....+..+ + .+..+++.+. .....+|.++....| T Consensus 72 ~G~ay~~v~~~~~~~~~------~~~~~~~I~~~~p~~~~~i~D~~~~-~-~~~ai~~~~~~~~~~~~~~~~~~~~~~~~ 143 (434) T protein:vir:98 72 QSAGYMLVGAHPTRTED------NGRPSPLITMEHPSECIVEYDPETG-E-PLVGLKVWHNDIDGFGYARVFFDDTSFPY 143 (434) T ss_pred cCceEEEEecCCCcccc------cCCceeEEEEeccceeEEEEeCCCC-c-eEEEEEEEEeccCCceEEEEEEeCcEEEE Confidence 99999999876544311 1234688999999998863333333 3 4444444321 123345555544333 Q ss_pred EEe--CC-------cEEEE----eccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcC Q lcl|NC_020844. 194 RKN--DA-------GIWEV----DEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTA 260 (455) Q Consensus 194 ~~~--~~-------g~~~~----~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~ 260 (455) ... .. ..|+. ....+|++|+||||||++++.... .+.+.+.++.++..+..+..|+...++++.+ T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~--~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a 221 (434) T protein:vir:98 144 RTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE--DPEPEFAGVLDIQDRVNLGILNRMAASRFSG 221 (434) T ss_pred EEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCc--CCcchhhhHHHHHHHHHHHHHHHHHHHHHhc Confidence 321 11 11221 234579999999999988765333 2455567777777788889999999999999 Q ss_pred CceEEEecCCCccccccCccce------eeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_020844. 261 FPMLSANGVEPDTDSEGSPKPL------DTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQP 334 (455) Q Consensus 261 ~p~~~~~G~~~~~~~~~~~~~~------~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~ 334 (455) +|+++++|.+.....+...+.+ ..+.+.+|.+| +.+++|.|++.+.++.+.+.|+.+..++...+..+ T Consensus 222 ~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~------~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p 295 (434) T protein:vir:98 222 FRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASE------GENTQFGQLDATDLSGFLKEHASDVRDMLTISQTP 295 (434) T ss_pred chhhhhcCCCcccccccccccchhhhhhhccccccccCC------CCCceEEEecCcchHHHHHHHHHHHHHHhcccCCC Confidence 9999999988776544433332 23344556554 34688999999999999999999988888776655 Q ss_pred cc---CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHH Q lcl|NC_020844. 335 LT---AQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTM 411 (455) Q Consensus 335 l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~ 411 (455) .. +..+|.||+|+++....+......+...|+.+|++++++++.+.|.+.....+++.+.-......++.++++.++ T Consensus 296 ~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl 375 (434) T protein:vir:98 296 TYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATKL 375 (434) T ss_pred HHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHHH Confidence 33 334589999999999999999999999999999999999999999765544555553222233347899999999 Q ss_pred HHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccC-C---------CC Q lcl|NC_020844. 412 RQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGG-I---------EE 455 (455) Q Consensus 412 ~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~-l---------~~ 455 (455) +++| +|.+++++.| |+ + ++|++|+++|..+. + .+ T Consensus 376 ~~~g-~~~e~~~~~l---g~-----~-~~e~~r~~~e~~~~~~~~~~~~~~~~~ 419 (434) T protein:vir:98 376 KSIG-YPLDVIAEEL---DE-----S-PARVRRIVAGAASQALLAASLLPAPGA 419 (434) T ss_pred HhcC-CcHHHHHHhC---CC-----C-HHHHHHHHHHHHHHHHHHHhhhccCCC Confidence 9888 5999987755 22 2 34566666553321 0 01 No 59 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=1.7e-41 Score=244.06 Aligned_cols=404 Identities=13% Similarity=0.026 Sum_probs=294.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |....++.=-..+....+++.++.+.|.|.+.++ |||. .-.+.++.+.+ . ..||++.+|++++++++-.. T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~-----~~~~---~~p~~~~~~~~-~-v~nw~~~~Vd~~a~rl~~~G 70 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRYRYYAMDDRDD-----TRSI---VMPNNVREMYR-S-VLEWTAKGVDSLADRIIFRE 70 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhcCCChh-----hcCc---cccHHHHHHHH-h-hcchhHHHHHHHHhccccce Confidence 9988899999999999999999999999986653 4543 33455655433 3 46999999999999987766 Q ss_pred ceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccce Q lcl|NC_020844. 81 VEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQ 160 (455) Q Consensus 81 p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~ 160 (455) .+. .+..+.+++. -|+|+..+..+++.||+||+||++|...+.. .+|.++.++|++++.+. T Consensus 71 f~~--~d~~l~~~w~-----~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~------------~~p~i~~~sp~~~~~i~ 131 (422) T protein:vir:97 71 FTN--DDFNAWEIFK-----ANNPDIFFDTAIQSALIASCCFVYIMPGAED------------GLPKMQVIEASKATGIL 131 (422) T ss_pred eeC--CchhHHHHHH-----hcChHHHHHHHHHHHHHhcceeEEEeeCCCC------------CeeEEEEechhhEEEEE Confidence 554 3445555553 3789999999999999999999999643321 14789999999998765 Q ss_pred eeccCCeeEEEEEEEEeee----eEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchH- Q lcl|NC_020844. 161 SERQGAGERITYMRMWEDA----ETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPM- 235 (455) Q Consensus 161 ~~~~~~~~~l~~v~~~E~~----e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL- 235 (455) .+. .++....+.++..+. ..+.+++...+..+ ..+|.+. ..+|++|+||+|||++++...++ .+.+.+ T Consensus 132 D~~-~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~g~vPvv~~~n~~~~~~~-~G~s~I~ 204 (422) T protein:vir:97 132 DPT-TFLLTEGYAILESDSNGNPTLEAYFTDKDIWYY--PKKGKPY---NIKNPTGHPLLVPIIHRPDAVRP-FGRSRIT 204 (422) T ss_pred eCC-CCcceeeEEEEEecCCCcEEEEEEEcCceEEEE--cCCCccc---cccCCCCCcceEEecccCCCccc-cCccccc Confidence 443 444444444443221 12333444433333 3334332 34799999999999977655443 456665 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHH Q lcl|NC_020844. 236 KDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKF 315 (455) Q Consensus 236 ~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~ 315 (455) .++..+..+..+..++...+.+|.++|++|++|++.+.. ..+......+++|.+|.+.+++ .+++-|+++.+++. T Consensus 205 e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~---~~~~~~~~~~~i~~~~~de~~~--~~~v~q~~~~~l~~ 279 (422) T protein:vir:97 205 KAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAK---PMEKWRATVSTLLEISKDEDGD--KPTVGQFTTASMAP 279 (422) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccc---cCchhhhhhhhhhccCCCCCCC--cceeeecCCCChhH Confidence 456677778899999999999999999999999865331 2234455556899999876544 47778999999999 Q ss_pred HHHHHHHHHHHHHHhhhhhccC---CCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC------ Q lcl|NC_020844. 316 LAEQIESVSKEIRELGRQPLTA---QSGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDT------ 385 (455) Q Consensus 316 ~~~~l~~~~~~m~~~g~~~l~~---~~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~------ 385 (455) +.+.|+.+..+++..+..+... .+.| .||+|++.....+......+...|+.+|++++|+++...|..+. T Consensus 280 ~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~ 359 (422) T protein:vir:97 280 FMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFM 359 (422) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhc Confidence 9999999999998887776433 3344 79999999999999999999999999999999999888774321 Q ss_pred ceEEEEecccCCCCC-CHHHHHHHHHHHHc--CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccC Q lcl|NC_020844. 386 NPEADVYTDFRADTG-EEQTPGYLLTMRQN--GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGG 452 (455) Q Consensus 386 ~~~v~v~~df~~~~~-~~~~~~al~~~~~~--G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~ 452 (455) +..+...+.+..... .++.++++.|++++ |+++.+++++.| |+ .+++.|..|++++...| T Consensus 360 ~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~l---g~----~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 360 DTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLT---GV----KGADKPIPAITEVTTDG 422 (422) T ss_pred cceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHc---CC----CchhHHHHHHHhhhccC Confidence 112322222222211 36778999999998 889999998877 33 35678888998887666 No 60 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=1.1e-41 Score=245.10 Aligned_cols=414 Identities=10% Similarity=0.055 Sum_probs=275.2 Q ss_pred CCCCCCCccCHH-------------HHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHH--HHhhccccCChH Q lcl|NC_020844. 1 MPEQDPTKRSAD-------------AEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYD--LRKSVAKMTNVF 65 (455) Q Consensus 1 m~~~~v~~~~p~-------------y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~--~rl~ra~~~n~~ 65 (455) |+..=.+..-.. .....++.+.+++.|.|.+.+..+...+.-.........++ +| +..|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnk----i~~nf~ 76 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVK----ISHGFF 76 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccc----cccchH Confidence 432211111111 12334667778888888876643322211111111111111 12 347999 Q ss_pred HHHHHHHhhhhhcCCceeccC---CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhc Q lcl|NC_020844. 66 RDVLEGLASKPFGREVEVTEG---TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQA 142 (455) Q Consensus 66 ~~~v~~~~g~lf~k~p~i~~~---~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~ 142 (455) +.+|++.+||+|++||+++.. ...+...+..+. +++++.....+++.++++|++|.+++.++.+. T Consensus 77 k~Ivd~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~--~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~---------- 144 (537) T protein:vir:78 77 TELVDQLAQYLLSNGVEVKVKDEDNTQLDEILQEYF--DEDFQATIDTLVTNASKKGFEGIFARTTSEGK---------- 144 (537) T ss_pred HHHHHHHhhhhcccCceeecCcchhHHHHHHHHHHh--hccHHHHHHHHHHHHhhcCeeEEEeeecCCCc---------- Confidence 999999999999999999743 234455555442 46788889999999999999999999877654 Q ss_pred cCCceEEEechhhhccceeeccCCeeEEEEEEEE----------eeeeEEEEEecCeEEEEEEeCCcEEE---------- Q lcl|NC_020844. 143 GVRPYWSQVNHSAILEIQSERQGAGERITYMRMW----------EDAETILVLRPDDWERWRKNDAGIWE---------- 202 (455) Q Consensus 143 ~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~----------E~~e~~~v~~~~~~~~~~~~~~g~~~---------- 202 (455) +.++.++|++++....+ .+....+..+... ..+..+.+|+++.++.|+...++.+. T Consensus 145 ---~~~~~i~p~~~~pv~d~-~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~ 220 (537) T protein:vir:78 145 ---LKFQTVDGLTLIPVFDD-YGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNP 220 (537) T ss_pred ---eEEEEEccceeEEEEcC-CCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccc Confidence 45788999998763323 2333333322211 12346778999998888766544211 Q ss_pred ---------------------EeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCC Q lcl|NC_020844. 203 ---------------------VDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAF 261 (455) Q Consensus 203 ---------------------~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~ 261 (455) .....+|+||.||||+|+++....+.+....+|+|..+. ..|++.+.+.+.+. T Consensus 221 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~------~~S~~an~~~~~~~ 294 (537) T protein:vir:78 221 NPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDV------MNCFLSNNLQDFSE 294 (537) T ss_pred cccceeeeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHH------HHHhhhhHHHHhcC Confidence 122346899999999999888776777677777776665 55888888899999 Q ss_pred ceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc--cCCC Q lcl|NC_020844. 262 PMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL--TAQS 339 (455) Q Consensus 262 p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l--~~~~ 339 (455) |+++++|...++.+++..+.. ...++.++ +.+++++|+..+. +.++.+..++++++.|+.++..+. .... T Consensus 295 ~ilvi~g~~~~~~~~~~~~l~---~~~~i~v~----~d~~~v~~l~~~~-~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~ 366 (537) T protein:vir:78 295 AIYVVKGFSGDSTDKLRQNIK---AKKMIGVN----GDNAGMEIQTVSI-PYEARKAKMDIDVENIYRSGMGFNSTAVGD 366 (537) T ss_pred ceeeeecCCCccchhHHHHHh---hcCceeec----CCCCceeEEEecC-CHHHHHHHHHHHHHHHHHhcCCCCCccccc Confidence 999999998776555443322 22344454 2356899998776 447889999999999999976553 3345 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC----ceEEEEecccCCCCCCHHHHHHHHHHHHcC Q lcl|NC_020844. 340 GNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDT----NPEADVYTDFRADTGEEQTPGYLLTMRQNG 415 (455) Q Consensus 340 ~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~----~~~v~v~~df~~~~~~~~~~~al~~~~~~G 415 (455) +|.||+|.++....+......+.+.|..+|++++++++.+++.... ...+++...........+.++.+.++.+.| T Consensus 367 gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~g 446 (537) T protein:vir:78 367 GNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETE 446 (537) T ss_pred cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcC Confidence 7899999999999999999999999999999999999988765321 112333322222223356788888888999 Q ss_pred CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 416 DISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 416 ~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +||+||+++.| |...|+|.| +++++|.+...++ T Consensus 447 iiS~eT~l~~~------p~vdd~e~e-k~~~ee~~~~~~~ 479 (537) T protein:vir:78 447 ALKIGNIMTVA------PRIGDDETL-KLIAEELDLDYNE 479 (537) T ss_pred cchHHHHHHhC------CCCCCHHHH-HHHHHHHHhhhhh Confidence 99999998766 555566443 4444443332222 No 61 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=2.5e-41 Score=243.18 Aligned_cols=408 Identities=13% Similarity=0.083 Sum_probs=279.7 Q ss_pred CCCCCCC----ccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPT----KRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~----~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) |...++. +--..+....++++++++.|.|.+.+ .|+|+.-. +.++.+..+. ..||++.+|++.+|++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i-----~~~~~~~~---~~~~~~~~k~-~~n~~~~ivd~~~~~l 71 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL-----PELTRNTS---AAWRSFQREA-RTNWGLMVRDSVADRI 71 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----hhcCcccC---hhhhhhhhhh-hcchHHHHHHHHHhhh Confidence 8866542 23345566779999999999997554 35554433 3455443333 3799999999999999 Q ss_pred hcCCceeccCC--H---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEe Q lcl|NC_020844. 77 FGREVEVTEGT--G---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQV 151 (455) Q Consensus 77 f~k~p~i~~~~--~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~ 151 (455) +.+|+++...+ + .+..+++ -|+++.+...+++.+++||++|++|+.++.+. |.++.+ T Consensus 72 ~~~~~~~~~~~d~~~~~~~~~i~~-----~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~-------------~~i~~~ 133 (456) T protein:vir:10 72 IPNGITVGGSADSDLALRARRIWR-----DNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT-------------ATITAD 133 (456) T ss_pred ccCCeecCCCCCcchHHHHHHHHH-----hcChhhHHHHHHHHHhhcCeeEEEEeeCCCCc-------------eEEEEE Confidence 99999885321 1 2344443 47899999999999999999999998665543 678899 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeeee----EEEEEecCe-EEEEE-------------EeCCcEEEEeccCCCCCCc Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDAE----TILVLRPDD-WERWR-------------KNDAGIWEVDEFGRITIGV 213 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~e----~~~v~~~~~-~~~~~-------------~~~~g~~~~~~~g~~~l~~ 213 (455) +|++++....+.. ++..+..+++.+..+ ...+|.+.. .+.|+ ....+.|+.....+|.+++ T Consensus 134 ~p~~~~~i~d~~~-~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (456) T protein:vir:10 134 SPETMVVSVDPLQ-PWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSP 212 (456) T ss_pred ccceeEEEEcCCC-CcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCc Confidence 9999886544333 344444555544321 122222222 12211 1234566777777899999 Q ss_pred eeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcc---ccccCc----cceeecc Q lcl|NC_020844. 214 VPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDT---DSEGSP----KPLDTGP 286 (455) Q Consensus 214 vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~---~~~~~~----~~~~~g~ 286 (455) +|||+|. |+.. .+.+.++.++..+..+..|+....+++.++|+++++|..... +..+++ +.+..+. T Consensus 213 ~pvv~~~-N~~g------~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~ 285 (456) T protein:vir:10 213 PPVVVYQ-NPDG------MGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAP 285 (456) T ss_pred eeEEEec-CCCC------CchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhc Confidence 9999873 3332 233445555555777788898899999999999999976443 222211 1122344 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAA 363 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~ 363 (455) +.+|.+|. ++++.|++.++++.+.+.++.+..+|..++..+.. +.++|.||+|+++....+......+.. T Consensus 286 ~~~~~~~~-------~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~ 358 (456) T protein:vir:10 286 GALWELPP-------GVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLS 358 (456) T ss_pred cccccCCC-------CcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHH Confidence 55666653 24556777888899999999999999888776643 345688999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 364 GLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 364 ~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) .|+.+|++++|+++.+.|..+ ...+++.+.-......++.++++.+++++|++|++|+++.| |+.+.++ .++|++ T Consensus 359 ~f~~~l~~~~rl~~~~~g~~~-~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~l---g~~~~~i-~~~e~e 433 (456) T protein:vir:10 359 IAKIGLEAILVKALQIEGESV-EDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNIL---NYNADQI-KQDDLD 433 (456) T ss_pred HHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhC---CCCHHHH-HHHHHH Confidence 999999999999999888543 23445543222223347889999999999999999987644 4433222 456888 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) |+++|..+...+ T Consensus 434 r~~~e~~~~~~~ 445 (456) T protein:vir:10 434 RAREQITLFAGN 445 (456) T ss_pred HHHHHHHHHhhh Confidence 998886543333 No 62 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=2.5e-41 Score=243.18 Aligned_cols=408 Identities=13% Similarity=0.083 Sum_probs=279.7 Q ss_pred CCCCCCC----ccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPT----KRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~----~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) |...++. +--..+....++++++++.|.|.+.+ .|+|+.-. +.++.+..+. ..||++.+|++.+|++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i-----~~~~~~~~---~~~~~~~~k~-~~n~~~~ivd~~~~~l 71 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL-----PELTRNTS---AAWRSFQREA-RTNWGLMVRDSVADRI 71 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----hhcCcccC---hhhhhhhhhh-hcchHHHHHHHHHhhh Confidence 8866542 23345566779999999999997554 35554433 3455443333 3799999999999999 Q ss_pred hcCCceeccCC--H---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEe Q lcl|NC_020844. 77 FGREVEVTEGT--G---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQV 151 (455) Q Consensus 77 f~k~p~i~~~~--~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~ 151 (455) +.+|+++...+ + .+..+++ -|+++.+...+++.+++||++|++|+.++.+. |.++.+ T Consensus 72 ~~~~~~~~~~~d~~~~~~~~~i~~-----~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~-------------~~i~~~ 133 (456) T protein:vir:10 72 IPNGITVGGSADSDLALRARRIWR-----DNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT-------------ATITAD 133 (456) T ss_pred ccCCeecCCCCCcchHHHHHHHHH-----hcChhhHHHHHHHHHhhcCeeEEEEeeCCCCc-------------eEEEEE Confidence 99999885321 1 2344443 47899999999999999999999998665543 678899 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeeee----EEEEEecCe-EEEEE-------------EeCCcEEEEeccCCCCCCc Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDAE----TILVLRPDD-WERWR-------------KNDAGIWEVDEFGRITIGV 213 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~e----~~~v~~~~~-~~~~~-------------~~~~g~~~~~~~g~~~l~~ 213 (455) +|++++....+.. ++..+..+++.+..+ ...+|.+.. .+.|+ ....+.|+.....+|.+++ T Consensus 134 ~p~~~~~i~d~~~-~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (456) T protein:vir:10 134 SPETMVVSVDPLQ-PWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSP 212 (456) T ss_pred ccceeEEEEcCCC-CcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCc Confidence 9999886544333 344444555544321 122222222 12211 1234566777777899999 Q ss_pred eeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcc---ccccCc----cceeecc Q lcl|NC_020844. 214 VPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDT---DSEGSP----KPLDTGP 286 (455) Q Consensus 214 vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~---~~~~~~----~~~~~g~ 286 (455) +|||+|. |+.. .+.+.++.++..+..+..|+....+++.++|+++++|..... +..+++ +.+..+. T Consensus 213 ~pvv~~~-N~~g------~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~ 285 (456) T protein:vir:10 213 PPVVVYQ-NPDG------MGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAP 285 (456) T ss_pred eeEEEec-CCCC------CchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhc Confidence 9999873 3332 233445555555777788898899999999999999976443 222211 1122344 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAA 363 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~ 363 (455) +.+|.+|. ++++.|++.++++.+.+.++.+..+|..++..+.. +.++|.||+|+++....+......+.. T Consensus 286 ~~~~~~~~-------~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~ 358 (456) T protein:vir:10 286 GALWELPP-------GVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLS 358 (456) T ss_pred cccccCCC-------CcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHH Confidence 55666653 24556777888899999999999999888776643 345688999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 364 GLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 364 ~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) .|+.+|++++|+++.+.|..+ ...+++.+.-......++.++++.+++++|++|++|+++.| |+.+.++ .++|++ T Consensus 359 ~f~~~l~~~~rl~~~~~g~~~-~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~l---g~~~~~i-~~~e~e 433 (456) T protein:vir:10 359 IAKIGLEAILVKALQIEGESV-EDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNIL---NYNADQI-KQDDLD 433 (456) T ss_pred HHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhC---CCCHHHH-HHHHHH Confidence 999999999999999888543 23445543222223347889999999999999999987644 4433222 456888 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) |+++|..+...+ T Consensus 434 r~~~e~~~~~~~ 445 (456) T protein:vir:10 434 RAREQITLFAGN 445 (456) T ss_pred HHHHHHHHHhhh Confidence 998886543333 No 63 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=5.2e-40 Score=235.94 Aligned_cols=391 Identities=13% Similarity=0.053 Sum_probs=276.0 Q ss_pred HHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCCceeccCCHHHHH Q lcl|NC_020844. 13 AEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGREVEVTEGTGPSED 92 (455) Q Consensus 13 y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~~~l~~ 92 (455) .....++.+++.+.|.|.+.++ ||+ .+-.++|+.+. ++ ..||++.+|+++++++.-...+. ++..+.+ T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~-----~~~---~~~p~~~~~~~-~~-v~nw~~~~Vds~a~rl~~~Gf~~--~d~~l~~ 68 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEA-----PTG---ITIPAHIRAKY-QA-VLGWAAKGVDSLADRLIFRAFAN--DDFNVTE 68 (410) T ss_pred CCcchhhHHHHHHHhcCCCCcc-----ccc---hhccHHHHhHH-Hh-hcchhHHHHHHhHhhhccccccC--CCchHHH Confidence 3344677888889999976543 343 23335666554 33 46999999999999997666543 3455666 Q ss_pred HHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEE Q lcl|NC_020844. 93 LLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITY 172 (455) Q Consensus 93 ~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~ 172 (455) ++. -|+|+.++..+++.||+||+||++|...+.+ +|.++.++|.+++.+..+. .+ +.... T Consensus 69 i~~-----~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~-------------~~~i~~~sP~~~~~i~Dp~-~~-~~~~a 128 (410) T protein:vir:95 69 IFD-----RNNPDIFFDSAILSALIGSCSFVYISKGEDD-------------EVRLQVIESSNATGVIDPI-TG-LLVEG 128 (410) T ss_pred HHh-----hcChHHHHHHHHHHHHHhCceeEEEecCCCC-------------ceEEEEEcccceEEEEeCC-CC-ceEEE Confidence 663 4899999999999999999999999643322 3789999999998755443 32 34433 Q ss_pred EEEEe-----eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchH-HHHHHHHHHHH Q lcl|NC_020844. 173 MRMWE-----DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPM-KDALDLQVALF 246 (455) Q Consensus 173 v~~~E-----~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL-~dla~l~~~hy 246 (455) +++.+ ......+|+++.+..++.+ ++.| ..+|++|+||+|||++++.... ..+.+.+ .++..+..+.. T Consensus 129 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~----~~~~~~g~vPvV~f~n~~~l~~-~~G~s~I~~~v~~l~da~~ 202 (410) T protein:vir:95 129 YAVLARDDYNRPTLEAYFEPNATHFIPKD-GEPY----SVTNETGIPLLVPVIHRPDAVR-PFGRSRITRAGMYYQKYAK 202 (410) T ss_pred EEEEEecCCCeEEEEEEEeCCcEEEEeeC-Cccc----cccCCCCCcceEEecccccCCc-cCCccccchhHHHHHHHHH Confidence 33322 1234567888776655443 3444 3479999999999997765433 3466655 35666777889 Q ss_pred HhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHH Q lcl|NC_020844. 247 RKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKE 326 (455) Q Consensus 247 ~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~ 326 (455) +..++...+.+|.++|++|+.|++.+. ...+......+++|.+|.+.+++ .++|.|+++.+++.+.+.|+.+..+ T Consensus 203 r~~~~~~~~~e~~a~pqr~i~G~d~d~---~~~~~~~~~~~~i~~~~~~~~~~--~~~v~q~~~~~l~~~~~~l~~l~~~ 277 (410) T protein:vir:95 203 RTLERADITAEFYSWPQKYILGLDPDA---EPMEKWKATVSSLLTISSSDKGV--KPSVGQFTTASMSPFTEQLRTAAAG 277 (410) T ss_pred HHHHHHHHHHHHhcchhheeeccCCCC---CcCchhhhhhhhheeccCCCCCC--cceEEecCCCChHHHHHHHHHHHHH Confidence 999999999999999999999986532 12234555567899999876543 5778899999999999999999999 Q ss_pred HHHhhhhhccC---CCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC----CceEEEEecc--cC Q lcl|NC_020844. 327 IRELGRQPLTA---QSGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPD----TNPEADVYTD--FR 396 (455) Q Consensus 327 m~~~g~~~l~~---~~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~----~~~~v~v~~d--f~ 396 (455) +...+..+... .+.| .||+|++.....+......+...|+.+|++++|++....+..+ ....+++.+. ++ T Consensus 278 ~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d 357 (410) T protein:vir:95 278 FAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFE 357 (410) T ss_pred HhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCC Confidence 98887776543 3344 7999999999999999999999999999999999887765322 2233444432 12 Q ss_pred CC-CCCHHHHHHHHHHHHc--CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 397 AD-TGEEQTPGYLLTMRQN--GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 397 ~~-~~~~~~~~al~~~~~~--G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +. ...++.+++++|++++ |+++++++++.| |+ +++++..+..+|..+ -.| T Consensus 358 ~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~l---g~-----~~~~~~~~~~~e~~~-~g~ 410 (410) T protein:vir:95 358 ADANTMTMIGDGVVKLNQALPGYINAETIRDLT---GI-----AGDMSAKPVVSEGGS-NGE 410 (410) T ss_pred cchhhHHHHHHHHHHHHHhccCCccHHHHHHhc---CC-----ChHHHHHHHHHHHHh-CCC Confidence 21 2247899999999998 899999998876 33 333444333333332 222 No 64 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=3.9e-40 Score=236.64 Aligned_cols=414 Identities=10% Similarity=0.038 Sum_probs=282.0 Q ss_pred CCCCC---CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQD---PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~---v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |++.. +..=-..+....++++++.+.|.|.+.++ |+|. .-...+++.+ ...||++.+|++++++++ T Consensus 18 l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~-----~~~~---~~p~~~~~~~---~v~n~~~~iVd~~a~rl~ 86 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIR-----QIGN---LIPPEYLRTA---TVLGWSAKAVDTLARRCN 86 (504) T ss_pred CCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccch-----hccc---cccHHHHHHh---hccCcHHHHHHHHHhhhc Confidence 44332 12223335666788889999999976653 4443 2334555333 247999999999999988 Q ss_pred cCCceeccCC---HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 78 GREVEVTEGT---GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 78 ~k~p~i~~~~---~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) -...++.+.. ..+..++ +-|+|+.+...+++.+++||++|++|+..+.+. .+|.++.++|+ T Consensus 87 ~~Gf~~~d~~~~~~~l~~i~-----~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~-----------~~~~I~~~sP~ 150 (504) T protein:vir:99 87 LESFVWPDGDYGSIGGPDVW-----DENFFATKANNAMVSSLIHGPAFLINTEGGAGE-----------PDSLIHVKSAM 150 (504) T ss_pred cceeeCCCCChhhHHHHHHH-----HhcChhhHHHHHHHHHHhhCceeEEEecCCCCC-----------ceeEEEEeccc Confidence 7777664322 2344554 347899999999999999999999997554433 24778999999 Q ss_pred hhccceeeccCCeeEEEEEEE-Ee---eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCccc Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRM-WE---DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWV 230 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~-~E---~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~ 230 (455) +++....+.. ++....+.++ ++ +...+.+|+++.+..|+.++++.|.. +..+|++| ||||||++++... ... T Consensus 151 ~~~~iyD~~~-~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~-~~~~~~~g-vPvV~~~n~~~~~-~~~ 226 (504) T protein:vir:99 151 QATGEWNSRR-NAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHA-DVRTHKLG-VPVEVLPYKPRED-RPL 226 (504) T ss_pred eeEEEEeCCC-CceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeee-ccccCCCC-cceEEecccccCc-ccc Confidence 9875333333 3322222222 21 12346789999877777777787764 56689998 9999998765533 333 Q ss_pred CcchHH-HHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccc--cccCc-cceeeccceeeeccCCCCc---cccce Q lcl|NC_020844. 231 FHPPMK-DALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTD--SEGSP-KPLDTGPDTVLYAPPNAEG---QHGSW 303 (455) Q Consensus 231 ~~~pL~-dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~--~~~~~-~~~~~g~~~~~~lp~~~~~---~~~~~ 303 (455) +.+.+. ++..+..+..+..++...++++.++|++|+.|+.+++. .++++ +......+++|.+|.+..+ ++.++ T Consensus 227 G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 306 (504) T protein:vir:99 227 GSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARA 306 (504) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccc Confidence 555553 55666668888999999999999999999999987653 22222 2344555678889876542 34568 Q ss_pred eEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc-----CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 304 NFVEPATESLKFLAEQIESVSKEIRELGRQPLT-----AQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 304 ~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~-----~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) +|.|+++.+++.+.+.|+.+..+|...+..+.. +..++.||+|++.....+......+...|+.+|++++|++.. T Consensus 307 ~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~ 386 (504) T protein:vir:99 307 DVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALA 386 (504) T ss_pred eeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 899999999999999999999999888877643 234678999999999999999999999999999999999887 Q ss_pred HcCCC----CCceEEEEec-ccCCCCCCHHHHHHHHHHHHcCCC---CHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc Q lcl|NC_020844. 379 WMDAP----DTNPEADVYT-DFRADTGEEQTPGYLLTMRQNGDI---SREGLWMEFQRLGELSDNFDPEEESERLISELP 450 (455) Q Consensus 379 ~~g~~----~~~~~v~v~~-df~~~~~~~~~~~al~~~~~~G~i---s~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~ 450 (455) +.+.. +....+++.+ +-.+. ..++.++++.+++++|.. +.+++++.| |+ +++ |++|++++.. T Consensus 387 ~~~~~~~~~~~~~~~~v~w~d~~~~-s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~l---g~-----~~~-ei~r~~~e~~ 456 (504) T protein:vir:99 387 IKNGLDRIPPEWKTIDSKFRSPLYL-SKAAQADAGAKMLGAGPEWLKETEVGLELL---GL-----TPQ-QAKRALAERR 456 (504) T ss_pred HhcCCCccccccccceeEecCCCcc-CHHHHHHHHHHHHhhccccccchHHHHhhc---CC-----CHH-HHHHHHHHHH Confidence 76532 2223344432 33333 237889999999999852 356776654 44 222 2334333211 Q ss_pred c----C----CCC Q lcl|NC_020844. 451 G----G----IEE 455 (455) Q Consensus 451 ~----~----l~~ 455 (455) . + |.+ T Consensus 457 ~~~~~~~~~~l~~ 469 (504) T protein:vir:99 457 RASSVSIIEALNR 469 (504) T ss_pred HHhhHHHHHHHhc Confidence 0 0 000 No 65 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=2.5e-40 Score=237.66 Aligned_cols=408 Identities=13% Similarity=0.097 Sum_probs=276.5 Q ss_pred CCCCCCCcc----CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPTKR----SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~~~----~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) |...++... --.+....++++++++.|.|.+.++ |+|+. ...+|+.+-.++ ..||++.+|+++++++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~-----~~~~~---~~~~~~~~~~~~-~~n~~~~ivd~~~~~l 71 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP-----ELTRN---TSAAWRSFQREA-RTNWGLMVRDSVADRI 71 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChh-----hcCcc---cChhhchhhhhh-hcchHHHHHHHHHhhh Confidence 776655322 2245566788999999999975543 44432 234455443332 3799999999999999 Q ss_pred hcCCceeccCC-----HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEe Q lcl|NC_020844. 77 FGREVEVTEGT-----GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQV 151 (455) Q Consensus 77 f~k~p~i~~~~-----~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~ 151 (455) +++|+++...+ +.+..+++ .|+++.+++.+++.++++|++|++|+.++.+. |.+..+ T Consensus 72 ~~~g~~~~~~~d~~~~~~~~~~~~-----~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~-------------~~i~~~ 133 (456) T protein:vir:79 72 IPNGITVGGSADSDLALRARRIWR-----DNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT-------------ATITAD 133 (456) T ss_pred ccCCeecCCCCCccHHHHHHHHHH-----hcChhHHHHHHHHHHhhcCeeEEEEeeCCCCc-------------eEEEEe Confidence 99999875322 23444443 36899999999999999999999998665543 568899 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeee----eEEEEEecCeE-EEEEE-------------eCCcEEEEeccCCCCCCc Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDA----ETILVLRPDDW-ERWRK-------------NDAGIWEVDEFGRITIGV 213 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~----e~~~v~~~~~~-~~~~~-------------~~~g~~~~~~~g~~~l~~ 213 (455) +|++++....+.. +......+++.... ....+|.++.. +.++. ...+.|......+|.+++ T Consensus 134 ~p~~~~~i~d~~~-~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (456) T protein:vir:79 134 SPETMVVSVDPLQ-PWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSP 212 (456) T ss_pred ccceeEEEEcCCC-CCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCc Confidence 9999886443333 33344444444322 12234443332 22211 123445666677899999 Q ss_pred eeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcc---ccccCc----cceeecc Q lcl|NC_020844. 214 VPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDT---DSEGSP----KPLDTGP 286 (455) Q Consensus 214 vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~---~~~~~~----~~~~~g~ 286 (455) ||||+|. |+...+. +.++.++..+..+..|+....+++.++|+++++|..... +..++. +....+. T Consensus 213 ~pvv~~~-N~~~~gd------~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~ 285 (456) T protein:vir:79 213 PPVVVYQ-NPDGMGE------VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAP 285 (456) T ss_pred eeEEEec-CCCCCch------hhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhc Confidence 9999874 4433333 334444444555677888889999999999999976442 222211 1233455 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAA 363 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~ 363 (455) +.+|.+|.+ +++.|++..+++.+.+.|+.+..+|+..+..+.. +.++|.||+|++.....+......+.. T Consensus 286 ~~~~~~~~~-------~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~ 358 (456) T protein:vir:79 286 GALWELPPG-------VDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLS 358 (456) T ss_pred cccccCCCC-------cceeeecccChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHH Confidence 566766543 4455777888888999999999999887766543 344689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 364 GLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 364 ~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) .|+.+|+++++++..+.|..+ ...+++.+........++.+++++++.++|++|++|+++.| |+.+.++ .++|++ T Consensus 359 ~f~~~l~~~~~l~~~~~g~~~-~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~l---g~~~~~i-~~~e~~ 433 (456) T protein:vir:79 359 IAKIGLEAILVKALQIEGESV-EDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNIL---NYNADQI-KQDDLD 433 (456) T ss_pred HHHHHHHHHHHHHHHhcCCCc-cccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcC---CCCHHHH-HHHHHH Confidence 999999999999999988654 23455543222222347889999999999999999886544 5533322 457788 Q ss_pred HHHhhcccCCCC Q lcl|NC_020844. 444 RLISELPGGIEE 455 (455) Q Consensus 444 ri~~e~~~~l~~ 455 (455) |+++|..+.+.+ T Consensus 434 r~~~e~~~~~~~ 445 (456) T protein:vir:79 434 RAREQITLFAGN 445 (456) T ss_pred HHHHHHHHHhhh Confidence 888886654444 No 66 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=1.4e-39 Score=233.65 Aligned_cols=390 Identities=13% Similarity=0.051 Sum_probs=282.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |..+.+..=--.+....++++++.+.|.|.+.++ ||+. +-.++++.+.+ . ..||++.+|++++++++-.. T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~-----~~~~---~~p~~~~~~~~-~-v~nw~~~iVds~a~rl~~~G 70 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDR-----FKGI---TIPQALSQQYR-S-ILGWCAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchh-----hcCh---hhhHHHHHHHh-h-hcchhHHHHHHhHhhcccCc Confidence 8888888888888999999999999999987654 4432 22234443332 2 46999999999999987655 Q ss_pred ceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccce Q lcl|NC_020844. 81 VEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQ 160 (455) Q Consensus 81 p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~ 160 (455) .+ .++..+.+++. -|+|+.++..+++.+++||+||++|...+.+ +|.++.++|.+++.+. T Consensus 71 f~--~~d~~l~~i~~-----~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg-------------~~~i~~~sp~~~~~i~ 130 (409) T protein:vir:94 71 FE--NDDFTVNEIFE-----ENNPDIFFDSAVLSSLIASCSFTYISKGEND-------------AVRLQVIEAVNATGII 130 (409) T ss_pred cc--CCchHHHHHHH-----hcChhHHHHHHHHHHHHhcceeEEEecCCCC-------------ceEEEEeccceEEEEE Confidence 43 34566777764 4789999999999999999999999744322 4789999999988755 Q ss_pred eeccCCeeEEEEEEEEee-----eeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchH Q lcl|NC_020844. 161 SERQGAGERITYMRMWED-----AETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPM 235 (455) Q Consensus 161 ~~~~~~~~~l~~v~~~E~-----~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL 235 (455) .+..+ +....+++.+. .....+|+++.+..+.. .++.|.. .+|++|+||+|||++++...+ ..+.+.+ T Consensus 131 D~~~~--~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~-~~~~~~~---~~n~~g~vPvV~f~n~~~~~~-~~G~s~I 203 (409) T protein:vir:94 131 DPITG--LLTEGYAVLERDENNNVVLEAHFLPDRTDYYYR-DSRNNIS---IANPTGHPLLVPIIHRPDAVR-PFGRSRI 203 (409) T ss_pred ecCCC--ceeeeEEEEEecCCCceEEEEEEecCcEEEEEe-cCceeEe---eeCCCCCcceEEecccccccc-ccCcccc Confidence 44333 34444444321 23445677776654433 3444543 369999999999997766544 3466666 Q ss_pred -HHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHH Q lcl|NC_020844. 236 -KDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLK 314 (455) Q Consensus 236 -~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~ 314 (455) .++..+..+..+..++...+.+|.++|++|++|++.+. ...+....+.+++|.+|.+.+++ ++++.|+++.+++ T Consensus 204 ~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~---~~~~~~~~~~~~i~~~~~d~dg~--~~~v~q~~~~~l~ 278 (409) T protein:vir:94 204 TRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDA---EPMETWKATVSSMLQFTKDEDGD--KPTLGQFTQPSMS 278 (409) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCC---cccchhhhhHHHhhcCCCCCCCC--CceEEecCCCChh Confidence 35667777889999999999999999999999986432 12234566667899999886544 5777899999999 Q ss_pred HHHHHHHHHHHHHHHhhhhhccC---CCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC----Cc Q lcl|NC_020844. 315 FLAEQIESVSKEIRELGRQPLTA---QSGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPD----TN 386 (455) Q Consensus 315 ~~~~~l~~~~~~m~~~g~~~l~~---~~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~----~~ 386 (455) .+.+.++.+..+++..+..+.+. .+.| .||+|++.....+......+...|+.+|++++|++....+..+ +. T Consensus 279 ~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~ 358 (409) T protein:vir:94 279 PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQF 358 (409) T ss_pred HHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccc Confidence 99999999999999887776543 2334 8999999988888888889999999999999999888776432 11 Q ss_pred eEEEEec--ccCCCCC-CHHHHHHHHHHHHcC--CCCHHHHHHHHHhcCCCCCC Q lcl|NC_020844. 387 PEADVYT--DFRADTG-EEQTPGYLLTMRQNG--DISREGLWMEFQRLGELSDN 435 (455) Q Consensus 387 ~~v~v~~--df~~~~~-~~~~~~al~~~~~~G--~is~et~~~~l~~~~vl~~~ 435 (455) ..+++.+ .+.+... -++.++++.|++++| +.+.+++++.| |+-.++ T Consensus 359 ~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~l---G~~~~d 409 (409) T protein:vir:94 359 RKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLT---GIEGGE 409 (409) T ss_pred ccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHc---CCCCCC Confidence 2233332 2222211 257789999999998 66778887765 664444 No 67 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=3.1e-39 Score=231.67 Aligned_cols=414 Identities=11% Similarity=0.028 Sum_probs=282.6 Q ss_pred CCCCC---CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQD---PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~---v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |++.. +..---.+....+++.++.+.|.|.+.++ |||. .-...|+.+. ...||++.+|++++.++. T Consensus 12 l~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~-----~~~~---~~p~~~r~~~---~v~nw~~~~Vd~~a~rl~ 80 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQ-----YVGT---LIPPQYFNLG---LVLGWTGKAVDALARRCN 80 (474) T ss_pred CChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChh-----hccc---cccHHHHHHH---hhcChHHHHHHHHHhhhc Confidence 55443 22223455566688888999999976653 4443 3345666443 247999999999999987 Q ss_pred cCCceecc---CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 78 GREVEVTE---GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 78 ~k~p~i~~---~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) -...++.+ ++..+..++ +-|+|+.+...+++.+|+||++|++|..++.+.+ +|.++.++|. T Consensus 81 ~~Gf~~~d~~~~~~~l~~iw-----~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~-----------~~~i~~~sp~ 144 (474) T protein:vir:81 81 LEGFVWPDGDLDSLGGTEVV-----DDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEP-----------EALIHVKDAS 144 (474) T ss_pred ccceECCCCCccchHHHHHH-----HhcChhHHHHHHHHHHHhhCceeEEEecCCCCCc-----------eeEEEEeccc Confidence 76666533 223355555 3578999999999999999999999976544332 4789999999 Q ss_pred hhccceeeccCCeeEEEEEEEEe----eeeEEEEEecCeEEEEEEe-CCcEEEEeccCCCCCCceeEEEEEecccCCCcc Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWE----DAETILVLRPDDWERWRKN-DAGIWEVDEFGRITIGVVPMVPFITGRRKGKKW 229 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E----~~e~~~v~~~~~~~~~~~~-~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~ 229 (455) +++....+ ..++....+.++.. ......+|.++.+..++.. +++.|. .+..+|++| ||+|||++++...++ T Consensus 145 ~~~~~~D~-~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~-~~~~~~~~g-vPvV~~~n~~~~~~~- 220 (474) T protein:vir:81 145 EATGEWNR-RRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQ-VDRDEHVYG-VPAQVLPYKPAPKRP- 220 (474) T ss_pred eEEEEEeC-CCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceee-eccCCCCCC-cceEEecccccccCc- Confidence 98863333 33343333333322 1234567888877665544 445564 466789998 799999987664443 Q ss_pred cCcchH-HHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccc--ccCc-cceeeccceeeeccCCCCccc---cc Q lcl|NC_020844. 230 VFHPPM-KDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDS--EGSP-KPLDTGPDTVLYAPPNAEGQH---GS 302 (455) Q Consensus 230 ~~~~pL-~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~--~~~~-~~~~~g~~~~~~lp~~~~~~~---~~ 302 (455) .+.+.+ .++..+..+..+..++...+.+|.++|++|+.|+++.... ++++ +......+++|.+|.+.++.. .. T Consensus 221 ~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~ 300 (474) T protein:vir:81 221 FGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLAR 300 (474) T ss_pred CCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCccccccccccc Confidence 456655 3666666788999999999999999999999999876533 2222 223344457888887764322 24 Q ss_pred eeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC----C-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 303 WNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA----Q-SGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITN 377 (455) Q Consensus 303 ~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~----~-~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a 377 (455) ++|.|++.++++.+.+.|+.+..++...+..+.+. + .++.||.+++.....+......+...|+.+|++++|++. T Consensus 301 ~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~ 380 (474) T protein:vir:81 301 ADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRAL 380 (474) T ss_pred ccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 68899999999999999999999998888777442 2 344899999999999999999999999999999999999 Q ss_pred HHcCCCC------CceEEEEe-cccCCCCCCHHHHHHHHHHHHcC--CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhh Q lcl|NC_020844. 378 LWMDAPD------TNPEADVY-TDFRADTGEEQTPGYLLTMRQNG--DISREGLWMEFQRLGELSDNFDPEEESERLISE 448 (455) Q Consensus 378 ~~~g~~~------~~~~v~v~-~df~~~~~~~~~~~al~~~~~~G--~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e 448 (455) ...|... ....+++. +|....+ .++.++++.|++++| +++++++++.+ |+ +++ +++|++.+ T Consensus 381 ~i~~~~~~~~~~~~~~~~~v~W~d~~~~s-~a~~aDa~~Kl~~a~~~~~~~~~~~~~l---g~-----t~~-~i~~~~~~ 450 (474) T protein:vir:81 381 AMKNKVAIDEIPDEWKSIDAKWRDPRYLS-KSAQADAGMKQLAAVPWLAETEVGLELI---GL-----TPQ-QARRAMAD 450 (474) T ss_pred HHhCCCCccccchhhccceeEecCCCccC-HHHHHHHHHHHHhcccCCCcHHHHHhhc---CC-----CHH-HHHHHHHH Confidence 9887421 12344444 3444433 488999999999986 45566665543 33 322 33333322 Q ss_pred cc-----cCCCC Q lcl|NC_020844. 449 LP-----GGIEE 455 (455) Q Consensus 449 ~~-----~~l~~ 455 (455) .. +-++. T Consensus 451 ~~~~~~~~~~~~ 462 (474) T protein:vir:81 451 KRRVQGRGTLQA 462 (474) T ss_pred HHHHhHHHHHHH Confidence 11 11111 No 68 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=1.2e-38 Score=228.53 Aligned_cols=391 Identities=13% Similarity=0.060 Sum_probs=279.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |..+.+..=-..+....+++.++.+.|.|.+.++ ||+. +-.+.++.+.+ . ..||++.+|+++++++.-.. T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~-----~~~~---~~p~~~~~~~~-~-v~nw~~~iVds~a~rl~~~G 70 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDR-----FKGI---TIPQALSQQYR-S-ILGWCAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchh-----hcch---hhhHHHHHHHh-h-hcChhHHHHHHhHhhccccc Confidence 9888888888899999999999999999986654 4432 23345544432 2 46999999999999986655 Q ss_pred ceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccce Q lcl|NC_020844. 81 VEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQ 160 (455) Q Consensus 81 p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~ 160 (455) .+ .++..+.++++ -|+|+..+..+++.|++||+||++|...+.+ +|.++.++|.+++... T Consensus 71 f~--~~d~~l~~i~~-----~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg-------------~~~i~~~sP~~~~~i~ 130 (409) T protein:vir:16 71 FE--NDDFTVNEIFE-----ENNPDIFFDSTVLSALIASCSFTYISKGEND-------------AVRLQVIEATNATGII 130 (409) T ss_pred cc--CcchHHHHHHH-----hcChhHHHHHHHHHHHHhCceeEEEecCCCC-------------ceEEEEEcccceEEEe Confidence 43 34566777763 4899999999999999999999999743322 4789999999988654 Q ss_pred eeccCCeeEEEEEEEEe----eeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchH- Q lcl|NC_020844. 161 SERQGAGERITYMRMWE----DAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPM- 235 (455) Q Consensus 161 ~~~~~~~~~l~~v~~~E----~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL- 235 (455) .+..+ +....+.++.. ......+|.++....+.. .++.|. ..+|++|+||+|||++++...++ .+.+.+ T Consensus 131 D~~~~-~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~g~vPvV~f~n~~~~~~~-~G~seI~ 204 (409) T protein:vir:16 131 DPITG-LLTEGYAVLERDENNNVVLEAHFLPDRTDYYYR-DSRNNI---SIANPTGNPLLVPIIHRPDAVRP-FGRSRIT 204 (409) T ss_pred ecccc-cceeeeEEEEecCCCceEEEEEEecCcEEEEEe-cCcccc---ceecCCCCcceEEeccccccccc-CCccccc Confidence 33333 22222222222 123445677775544333 334343 34799999999999987665443 466665 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHH Q lcl|NC_020844. 236 KDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKF 315 (455) Q Consensus 236 ~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~ 315 (455) .++..+..+..+..++...+.+|.++|++|+.|++.+. ...+......+++|.+|.+.+++ .+++.|+++.+++. T Consensus 205 ~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~---~~~~~~~~~~~~i~~~~~d~~g~--~~~v~q~~~~~l~~ 279 (409) T protein:vir:16 205 RSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDA---EPMETWKATVSSMLQFTKDEDGD--KPTLGQFTQPSMSP 279 (409) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCC---CccchhhhhhhHhhccCCCCCCC--CceEEecCCCChhH Confidence 34666777889999999999999999999999986432 12234566667899999887544 47777999999999 Q ss_pred HHHHHHHHHHHHHHhhhhhccC---CCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC----ce Q lcl|NC_020844. 316 LAEQIESVSKEIRELGRQPLTA---QSGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDT----NP 387 (455) Q Consensus 316 ~~~~l~~~~~~m~~~g~~~l~~---~~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~----~~ 387 (455) +.+.++.+..+++..+..+.+. ++.| .||+|++.....+......+...|+.+|++++|++....|..+. .. T Consensus 280 ~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~ 359 (409) T protein:vir:16 280 FTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFS 359 (409) T ss_pred HHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhc Confidence 9999999999999887776543 3345 79999999888888888999999999999999999888764321 12 Q ss_pred EEEEecc--cCCC-CCCHHHHHHHHHHHHcCC--CCHHHHHHHHHhcCCCCCC Q lcl|NC_020844. 388 EADVYTD--FRAD-TGEEQTPGYLLTMRQNGD--ISREGLWMEFQRLGELSDN 435 (455) Q Consensus 388 ~v~v~~d--f~~~-~~~~~~~~al~~~~~~G~--is~et~~~~l~~~~vl~~~ 435 (455) .+++.+. +.+. ..-++.++++.|++++|. ...+++++.| |+-..+ T Consensus 360 ~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~---g~~~~d 409 (409) T protein:vir:16 360 KTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLT---GIKGAE 409 (409) T ss_pred cceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhc---cCCCCC Confidence 3333331 1111 112788999999999964 3457776655 564444 No 69 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=1.2e-29 Score=179.12 Aligned_cols=416 Identities=10% Similarity=-0.005 Sum_probs=243.2 Q ss_pred CC-CCC-------C-CccCHHHHHHHHHHHHHHHHhcCHHHH-HHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MP-EQD-------P-TKRSADAEAMSDYLQTVDDVLEGVTGM-RRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~-~~~-------v-~~~~p~y~~~~~~w~~i~d~~~G~~~~-r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |= .+. + -..|++.......|+ .+|.|.+.. +.....+-+ ..+..| -.-.|+++.+|+ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~---~yy~g~~~~~~~~~~~~~~-------~~~~~~---~~~~n~~k~i~~ 82 (496) T protein:vir:38 16 MGLLKALKDVKDHKKVNANDEDYKYIDMWK---RLYQGHYAEWHNLNYEHNG-------NPVNRR---QLSMNLPKVTAK 82 (496) T ss_pred hccchhhHHHHhcCCCcCCHHHHHHHHHHH---HHhcCCCchhhcchhccCC-------Cccccc---eeecchHHHHHH Confidence 10 000 1 112666666666764 568886442 211111100 011111 123599999999 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) .+++++|++||+++...+..+++++++-. .+++++.++.+++.++.+|.+|+.|+.+..+. |.+.. T Consensus 83 ~~a~~l~~~p~~i~~~d~~~~e~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~-------------~~i~~ 148 (496) T protein:vir:38 83 YMSKLLFNEKVKINIDDKAAEEFVLNVLK-TNGFTKNMERYIEYGEAMGGFVIKVYHDGNKN-------------VKVSF 148 (496) T ss_pred HHhhhhhCCcceEeeCChHHHHHHHHHHh-ccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCc-------------EEEEE Confidence 99999999999998777777788777653 36799999999999999999999998765443 67899 Q ss_pred echhhhccceeeccCCeeEEEEEEEEe-eeeEEEE-----EecCe----EEEEEEeCC---cEEE----Ee-----ccCC Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWE-DAETILV-----LRPDD----WERWRKNDA---GIWE----VD-----EFGR 208 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E-~~e~~~v-----~~~~~----~~~~~~~~~---g~~~----~~-----~~g~ 208 (455) ++|++++....+. +.....+++.... .-..++. |..+. +++|+...+ |..+ +. ...- T Consensus 149 v~~~~~~P~~~~~-~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~ 227 (496) T protein:vir:38 149 ATADCMYPLSNDS-ENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPL 227 (496) T ss_pred EcccceEEEEecC-CcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceee Confidence 9999998744433 3333333332111 1111111 11222 233443322 1111 11 1123 Q ss_pred CCCCceeEEEEEeccc---CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEE----Ee---cCCCccccccC Q lcl|NC_020844. 209 ITIGVVPMVPFITGRR---KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLS----AN---GVEPDTDSEGS 278 (455) Q Consensus 209 ~~l~~vP~V~f~~~~~---~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~----~~---G~~~~~~~~~~ 278 (455) +++++.||+.|.++.. ..+...|.+.|.++.++..+.....|++.+.++.....+++ +. +..+.....+. T Consensus 228 ~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~ 307 (496) T protein:vir:38 228 PDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYFD 307 (496) T ss_pred cCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCccccCCC Confidence 4678888887754322 22233467777777777777777778887777764434433 11 11111111111 Q ss_pred ccceeeccceee-eccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhh---hc-cCCCcchhHHHHHHHHHH Q lcl|NC_020844. 279 PKPLDTGPDTVL-YAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQ---PL-TAQSGNLTRISAAQAASK 353 (455) Q Consensus 279 ~~~~~~g~~~~~-~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~---~l-~~~~~~~sa~a~~~~~~~ 353 (455) . ..... .+..+..+.+..+..++++- ..+.+.+.++.+.+++...... .+ ...++..||++...+... T Consensus 308 ~------~~~~~~~~~~~~~~~~~~i~~~~~~i-~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~ 380 (496) T protein:vir:38 308 S------TDEAFFLYQGDQDDNGKAIKDISVEI-RSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSE 380 (496) T ss_pred C------ccceEEEeecCCCcccccceeecccc-CHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHH Confidence 1 11111 11111211222233343332 2366777777777777554322 12 234567899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-------cCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHH Q lcl|NC_020844. 354 GNSAVQQWAAGLKDALENALYITNLW-------MDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEF 426 (455) Q Consensus 354 ~~s~L~~~a~~~~~a~~~~l~~~a~~-------~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l 426 (455) +.+....+.+.++.+|+++++.+..+ -|....+..+++..+........++++.+.+++++|+||++|++..+ T Consensus 381 l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~ 460 (496) T protein:vir:38 381 TYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRA 460 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhc Confidence 99999999999999999998876543 23333334455554333333446778999999999999999987654 Q ss_pred HhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 427 QRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 427 ~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) -++ .+.+.++|++||++|....+.+ T Consensus 461 --~~~--~d~ea~~el~ri~~E~~~~~~~ 485 (496) T protein:vir:38 461 --WNI--TEAEADEWAEMLAKEKQAEMPN 485 (496) T ss_pred --CCC--ChHHHHHHHHHHHHhhhccCcc Confidence 122 2223556999999998765544 No 70 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=5.7e-29 Score=175.45 Aligned_cols=416 Identities=10% Similarity=0.002 Sum_probs=246.2 Q ss_pred CC----------CCCCCccCHHHHHHHHHHHHHHHHhcCHHH-HHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MP----------EQDPTKRSADAEAMSDYLQTVDDVLEGVTG-MRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~----------~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~-~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v 69 (455) |- ..+|. .|+++......|+ .+|.|.+. ++... |-..... ...++ .-.|+++.+| T Consensus 16 ~~~~~~~~~~~~~~~i~-~~~~~~~~i~~~~---~~Y~g~~~~~~~~~--~~~~~~~-----~~~~~---~s~n~~~~iv 81 (499) T protein:vir:80 16 MGLLKSLKDVTDHKKVN-ANDEDYKYIDMWK---RLYQGNYAEWHNLN--YEHNGNP-----VNRRQ---LSMNLPKVTA 81 (499) T ss_pred hccccchhhhhcCCCCc-CCHHHHHHHHHHH---HHhcCCcchhhccc--cccCCCc-----cccce---eecchHHHHH Confidence 21 12222 4777888888886 45888643 22111 1000000 01111 2269999999 Q ss_pred HHHhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEE Q lcl|NC_020844. 70 EGLASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWS 149 (455) Q Consensus 70 ~~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~ 149 (455) +.+++++|++||+++..++..+++++++-. .+++...++.+++.|+.+|.+|+.|+++..+ +|-+. T Consensus 82 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~-~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~-------------~~~i~ 147 (499) T protein:vir:80 82 KYMSKLLFNEKVKINIDDETAEEFVLNVLK-TNGFTKNMERYIEYGEAMGGFVIKVYHDGNK-------------NVKVS 147 (499) T ss_pred HHHHHhhhCCcceEeeCCHHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCcEEEEEEECCCC-------------cEEEE Confidence 999999999999998777778888777653 3679999999999999999999999987544 36689 Q ss_pred EechhhhccceeeccCCeeEEEEEEEE--ee--eeEEEE--EecCe---E----EEEEEeCC---c-EEE---Eecc--- Q lcl|NC_020844. 150 QVNHSAILEIQSERQGAGERITYMRMW--ED--AETILV--LRPDD---W----ERWRKNDA---G-IWE---VDEF--- 206 (455) Q Consensus 150 ~~~~~~ii~w~~~~~~~~~~l~~v~~~--E~--~e~~~v--~~~~~---~----~~~~~~~~---g-~~~---~~~~--- 206 (455) .++|.+++....+. +....++++... +. +.++.. |.... + ++|+.... | .+. +.+. T Consensus 148 ~v~a~~~~Pi~~d~-~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~ 226 (499) T protein:vir:80 148 FATADCMYPLSNDS-ENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEP 226 (499) T ss_pred EEcCCceEEEEecC-CCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCC Confidence 99999998632332 333334333211 11 111111 21111 1 22332211 1 111 1111 Q ss_pred --CCCCCCceeEEEEEeccc---CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEE-------EecCCCccc Q lcl|NC_020844. 207 --GRITIGVVPMVPFITGRR---KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLS-------ANGVEPDTD 274 (455) Q Consensus 207 --g~~~l~~vP~V~f~~~~~---~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~-------~~G~~~~~~ 274 (455) .-++++++||+.|.++-. +.+...+.+.|.++.++..+.....|.+.+.++.....+++ ..+.++... T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~ 306 (499) T protein:vir:80 227 VVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT 306 (499) T ss_pred ceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcc Confidence 124688899998865322 22233467777777777777777788888877776656655 112222211 Q ss_pred cccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhh---h-ccCCCcchhHHHHHHH Q lcl|NC_020844. 275 SEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQ---P-LTAQSGNLTRISAAQA 350 (455) Q Consensus 275 ~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~---~-l~~~~~~~sa~a~~~~ 350 (455) ..+..+...+ ..++.+..+.+..+..++++-. .+++.+.++.+.+++...... . -...++++||++...+ T Consensus 307 ~~~~~~~~~~-----~~~~~~~~~~~~~i~~~~~~ir-~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~ 380 (499) T protein:vir:80 307 QYFDSTDEAF-----FLYQGEQDDNGKAIKDISVEIR-STEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSE 380 (499) T ss_pred cCCCccccee-----eEeeccCCCCcCceeEecCcCC-hHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHH Confidence 1122111111 1122222222223444444432 255677777777766553221 1 2234567899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHc---C----CCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHH Q lcl|NC_020844. 351 ASKGNSAVQQWAAGLKDALENALYITNLWM---D----APDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLW 423 (455) Q Consensus 351 ~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~---g----~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~ 423 (455) .+.+.+....+...++.+|+++++.+..|. + .......+++.++........++++.+.+++++|++|++|++ T Consensus 381 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 460 (499) T protein:vir:80 381 KSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIAL 460 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHH Confidence 999999999999999999999988776542 2 122233455544433434446778899999999999999987 Q ss_pred HHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 424 MEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 424 ~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ..+ -|+ .+.+.++|++||++|....+.+ T Consensus 461 ~~~--~~~--~d~ea~~el~~i~~E~~~~~~~ 488 (499) T protein:vir:80 461 QRA--WNI--TEAEADEWAEMLAKEKQAEIPN 488 (499) T ss_pred hhc--CCC--ChHHHHHHHHHHHHHhhcCCCC Confidence 654 122 2223557899999997665554 No 71 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=99.96 E-value=1.1e-28 Score=173.85 Aligned_cols=410 Identities=12% Similarity=0.069 Sum_probs=264.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) ||+. ..+.=.+.+..+++..|.+.|... +.=.+...++ +. =+|-++.|.. ++++.++ T Consensus 21 ~p~~----v~~~d~~Rl~aY~l~~~~y~n~~~-----~~~~~lrg~~--~~----~~r~~~~ps~--------~~~~~~~ 77 (527) T protein:vir:10 21 FPNA----VTDFDKARLASYRLYEDMYLTNTS-----DYQVILRGGD--EG----DQRPIYVPNG--------EKLIEAK 77 (527) T ss_pred Cccc----CCHHHHHHHHHHHHHHHHhcCchh-----heeeecCCcc--cc----ccceeeehhh--------HHhhCCc Confidence 6633 445555667888999999888522 1111111111 11 1233334444 4444444 Q ss_pred ceec---------cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE-eeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 81 VEVT---------EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV-DYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 81 p~i~---------~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV-d~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) -.|. ...+.++..+.+.. +-++|++++.+.-+.++..|-+.++| .-|. +..+-||.++. T Consensus 78 ~~~~~~g~~~~~~~~~e~v~~~lr~~~-~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~----------k~~~~R~~v~~ 146 (527) T protein:vir:10 78 MRFLGQGLKWEFSKKDAKVDDAIKVLF-DRENWEQKFESLKRWTEIRGDYVLLLIGDDE----------KDEGSRLSLHE 146 (527) T ss_pred ceeeccCccccccchhHHHHHHHHHHH-HHhhhHHHHHHHHHhhhhhcceeEEEeeccC----------CCcCCCceEee Confidence 3332 22344566555433 44789999999999999999655444 3222 12355899999 Q ss_pred echhhhccceeeccCCeeEEEEEE--EEeeee-----E-EE-------------------------EEecCeEEEEEEe- Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMR--MWEDAE-----T-IL-------------------------VLRPDDWERWRKN- 196 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~--~~E~~e-----~-~~-------------------------v~~~~~~~~~~~~- 196 (455) ++|.++..|......+.-.=.+++ |+...+ . .| .|+.+.|.--... T Consensus 147 ~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p 226 (527) T protein:vir:10 147 VDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESP 226 (527) T ss_pred cCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccc Confidence 999999988544333322222223 322111 0 11 1111111100000 Q ss_pred -CCcE------EEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecC Q lcl|NC_020844. 197 -DAGI------WEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGV 269 (455) Q Consensus 197 -~~g~------~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~ 269 (455) .+.. -+..+..++++++||+|.|.+.+..++ ..+.+.|.++..+..+..++.||+.-++.+++.|+.+++|+ T Consensus 227 ~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~-~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~ 305 (527) T protein:vir:10 227 LEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNA-MFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSA 305 (527) T ss_pred cchhhhhhhcCceeeecccCCCCccceEeecCCCcccc-ccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeeccc Confidence 0010 123456788999999998855455444 45999999999999999999999999999999999999999 Q ss_pred CCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC-----CCcchhH Q lcl|NC_020844. 270 EPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA-----QSGNLTR 344 (455) Q Consensus 270 ~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~-----~~~~~sa 344 (455) ...+ ..++.+++.+|++.+|.||.+ +++..|. ....++.+++.++.+.++|+.++..|... .+++.|| T Consensus 306 ~~vd-~~G~~~~~~VgPG~iweL~e~-----ak~~~v~-~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG 378 (527) T protein:vir:10 306 PPRD-SRGNMVPWTISPLGMVEHGQN-----NKIYRVN-GVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESG 378 (527) T ss_pred cccc-ccCCcCccccCCceeEecCCC-----cceeecc-chhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHH Confidence 9886 457778899999999999854 4777775 44578889999999999999998877433 2457899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH----cCCC--CCc--eEEEEecccCCCCCCHHHHHHHHHHHHcC Q lcl|NC_020844. 345 ISAAQAASKGNSAVQQWAAGLKDALENALY-ITNLW----MDAP--DTN--PEADVYTDFRADTGEEQTPGYLLTMRQNG 415 (455) Q Consensus 345 ~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~-~~a~~----~g~~--~~~--~~v~v~~df~~~~~~~~~~~al~~~~~~G 415 (455) .+..+..+.+-+..+....-++....|... |+-+| .+.. +.+ ..+.+.+........++.+.++.+++++| T Consensus 379 ~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aG 458 (527) T protein:vir:10 379 IALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAG 458 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcC Confidence 999998888877776666656666655433 33233 3332 222 23444432333344578899999999999 Q ss_pred CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 416 DISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 416 ~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +||.+|..+.|.+.+- -.|+|.|+++|.++....... T Consensus 459 i~S~~tAv~~L~~~~g---~eD~E~E~~~I~~era~~a~a 495 (527) T protein:vir:10 459 LIPAKKLTEELSKIMG---FELTEEDFKQATEDKKTQGIA 495 (527) T ss_pred chhHHHHHHHHHhccC---CCChHHHHHHHHHHHHHHhHH Confidence 9999999999977643 347889988888875543333 No 72 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=99.96 E-value=1.3e-28 Score=173.47 Aligned_cols=410 Identities=12% Similarity=0.069 Sum_probs=264.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) ||+. ..+.=.+.+..+++..|.+.|... +.=.+...++ +. =+|-++.|.. ++++.++ T Consensus 21 ~p~~----v~~~d~~Rl~aY~l~~~~y~n~~~-----~~~~~lrg~~--~~----~~r~~~~ps~--------~~~~~~~ 77 (527) T protein:vir:10 21 FPNA----VTDFDKARLASYRLYEDMYLTNTS-----DYQVILRGGD--EG----DQRPIYVPNG--------EKLIEAK 77 (527) T ss_pred Cccc----CCHHHHHHHHHHHHHHHHhcCchh-----heeeecCCcc--cc----ccceeeehhh--------HHhhCCc Confidence 6633 445555667888999999888522 1111111111 11 1233334444 4444444 Q ss_pred ceec---------cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE-eeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 81 VEVT---------EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV-DYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 81 p~i~---------~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV-d~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) -.|. ...+.++..+.+.. +-++|++++.+.-+.++..|-+.++| .-|. +..+-||.++. T Consensus 78 ~~~~~~g~~~~~~~~~e~v~~~lr~~~-~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~----------k~~~~R~~v~~ 146 (527) T protein:vir:10 78 MRFLGQGLKWEFSKKDAKVDDAIRVLF-DRENWEQKFESLKRWTEIRGDYVLLLIGDDE----------KDEGSRLSLHE 146 (527) T ss_pred ceeeccCccccccchhHHHHHHHHHHH-HHhhhHHHHHHHHHhhhhhcceeEEEeeccC----------CCcCCCceEee Confidence 3332 22344566554432 44789999999999999999655444 3222 12355899999 Q ss_pred echhhhccceeeccCCeeEEEEEE--EEeeee-----E-EE-------------------------EEecCeEEEEEEe- Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMR--MWEDAE-----T-IL-------------------------VLRPDDWERWRKN- 196 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~--~~E~~e-----~-~~-------------------------v~~~~~~~~~~~~- 196 (455) ++|.++..|......+.-.=.+++ |+...+ . .| .|+.+.|.--... T Consensus 147 ~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p 226 (527) T protein:vir:10 147 VDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESP 226 (527) T ss_pred cCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccc Confidence 999999988544333322222223 322111 0 11 1111111100000 Q ss_pred -CCcE------EEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecC Q lcl|NC_020844. 197 -DAGI------WEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGV 269 (455) Q Consensus 197 -~~g~------~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~ 269 (455) .+.. -+..+..++++++||+|.|.+.+..++ ..+.+.|.++..+..+..++.||+.-++.+++.|+.+++|+ T Consensus 227 ~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~-~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~ 305 (527) T protein:vir:10 227 LEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNA-MFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSA 305 (527) T ss_pred cchhhhhhhcCceeeecccCCCCccceEeecCCCcccc-ccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeeccc Confidence 0010 123456788999999998855455444 45999999999999999999999999999999999999999 Q ss_pred CCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC-----CCcchhH Q lcl|NC_020844. 270 EPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA-----QSGNLTR 344 (455) Q Consensus 270 ~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~-----~~~~~sa 344 (455) ...+ ..++.+++.+|++.+|.||.+ +++..|. ....++.+++.++.+.++|+.++..|... .+++.|| T Consensus 306 ~~vd-~~G~~~~~~VgPG~iweL~e~-----ak~~~v~-~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG 378 (527) T protein:vir:10 306 PPRD-SRGNMVPWTISPLGMVEHGQN-----NKIYRVN-GVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESG 378 (527) T ss_pred cccc-ccCCcCccccCCceeEecCCC-----cceeecc-chhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHH Confidence 9886 457778899999999999854 4777775 44578889999999999999998877433 2457899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH----cCCC--CCc--eEEEEecccCCCCCCHHHHHHHHHHHHcC Q lcl|NC_020844. 345 ISAAQAASKGNSAVQQWAAGLKDALENALY-ITNLW----MDAP--DTN--PEADVYTDFRADTGEEQTPGYLLTMRQNG 415 (455) Q Consensus 345 ~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~-~~a~~----~g~~--~~~--~~v~v~~df~~~~~~~~~~~al~~~~~~G 415 (455) .+..+..+.+-+..+....-++....|... |+-+| .+.. +.+ ..+.+.+........++.++++.+++++| T Consensus 379 ~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aG 458 (527) T protein:vir:10 379 IALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAG 458 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcC Confidence 999998888877777666656666655433 33233 3332 222 23444432233344578899999999999 Q ss_pred CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 416 DISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 416 ~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +||.+|..+.|.+.+- -.|+|.|+++|.++....... T Consensus 459 iiS~etAv~~L~~~~g---~eD~E~E~~~I~~era~~a~a 495 (527) T protein:vir:10 459 LIPAKKLTEELSKIMG---FELTEEDFRQATEDKKTQGIA 495 (527) T ss_pred chhHHHHHHHHHhccC---CCchHHHHHHHHHHHHHHhHH Confidence 9999999999977643 347888888888875543333 No 73 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=99.95 E-value=1.5e-26 Score=162.18 Aligned_cols=417 Identities=13% Similarity=0.054 Sum_probs=242.7 Q ss_pred CCC--------CCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHH Q lcl|NC_020844. 1 MPE--------QDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGL 72 (455) Q Consensus 1 m~~--------~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~ 72 (455) |-. .+|+- .|+|.+....|+ .+|.|.... |-.... . ...+.|...+ .|+++.+++.+ T Consensus 20 ~~~~~~~i~d~~~i~~-~~~~~~~i~~~~---~~Y~g~~~~-------l~~~~~-~-~~~~~~~~~s--lnl~~~i~~~~ 84 (505) T protein:vir:79 20 MTKSLGQIIDDPRINL-PADEVERIARDK---RYYMDDFKQ-------VTHKNS-Y-GDTQKHELQS--VNVTKLASAKL 84 (505) T ss_pred chhhhhhhhcccCCCC-CHHHHHHHHHHH---HHhcCCCcc-------cccccc-C-CCccccceee--cchHHHHHHHH Confidence 110 01222 466777777775 458885331 111000 0 1111122222 59999999999 Q ss_pred hhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 73 ASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 73 ~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) ++++|+++|+|+...+..+++++.+- +.++|...+..++..++.+|.+++.++.+. + +|-+..++ T Consensus 85 A~ll~~e~~~i~~~d~~~~e~l~~i~-~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~-------------~~~i~~v~ 149 (505) T protein:vir:79 85 ASLIFNEQCQVTVSDETANDFLDDVF-QQNDFYTTFEEKLEEWIALGSGCVRPYVDS-G-------------KIKLAWAT 149 (505) T ss_pred HhhhcCCCceeecCChHHHHHHHHHH-HhccHHHHHHHHHHHHhhcCCeEEEEEEeC-C-------------ceEEEEEc Confidence 99999999999866666777776653 346799999999999999999999998753 2 24577889 Q ss_pred hhhhccceeeccCCeeEEEEEE-EEeee----eEEEEEe-----cCe----EEEEEEeCC---cE---------EE-Eec Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMR-MWEDA----ETILVLR-----PDD----WERWRKNDA---GI---------WE-VDE 205 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~-~~E~~----e~~~v~~-----~~~----~~~~~~~~~---g~---------~~-~~~ 205 (455) |++++....+. ++...+.++. +.... ..|+.++ .+. .++|+.... |. |. +.+ T Consensus 150 ad~~~P~~~d~-~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~ 228 (505) T protein:vir:79 150 ADQVYPLQADT-NQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEP 228 (505) T ss_pred CCeeEEEEEcC-CCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCc Confidence 99988744443 3343443332 21110 1122221 111 234543322 21 11 011 Q ss_pred c-CCCCCCceeEEEEEeccc---CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecC------CCcccc Q lcl|NC_020844. 206 F-GRITIGVVPMVPFITGRR---KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGV------EPDTDS 275 (455) Q Consensus 206 ~-g~~~l~~vP~V~f~~~~~---~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~------~~~~~~ 275 (455) + ..+++++.+|+.|.++-. +.....+.+.+.++.++..+.....|.+.+.+......+.+=..+ ...... T Consensus 229 ~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~ 308 (505) T protein:vir:79 229 QVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQAS 308 (505) T ss_pred ceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccc Confidence 1 124567777877653211 112235788888888888888888888888887666665551111 111111 Q ss_pred ccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhh----hccCCCcchhHHHHHHHH Q lcl|NC_020844. 276 EGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQ----PLTAQSGNLTRISAAQAA 351 (455) Q Consensus 276 ~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~----~l~~~~~~~sa~a~~~~~ 351 (455) +..++ ....+.....+..+.+++..+..++++-. .+++.+.++.+.+++...+.. +-...++.+||++...+. T Consensus 309 ~~~~~--~fd~~~~~y~~~~~~~~~~~i~~~~~~ir-~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~ 385 (505) T protein:vir:79 309 ETHPP--MFDPDETVYQAMYGDASEVGFHDATSPIR-VADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNN 385 (505) T ss_pred ccccc--CCCccceeeeeccCCCCCCceEEecccCC-HHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHH Confidence 11111 11112222222223333444555555432 255677777777766543221 222344678999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC-------------CCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCC Q lcl|NC_020844. 352 SKGNSAVQQWAAGLKDALENALYITNLWMDA-------------PDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDIS 418 (455) Q Consensus 352 ~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~-------------~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is 418 (455) +.+.+....+...++++|+++++.++.+... ...+..++++++-.......++++...+++++|++| T Consensus 386 ~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s 465 (505) T protein:vir:79 386 SQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMP 465 (505) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCC Confidence 9999999999999999999999987665311 111233444332222233466788899999999999 Q ss_pred HHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 419 REGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 419 ~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +++++..+ -|+ .+-+.++|++||++|..+...| T Consensus 466 ~e~~l~~~--~~~--~eeea~~el~ri~~E~~~~~p~ 498 (505) T protein:vir:79 466 KKQFLMRN--YGL--DEEEADEWLAQIDAENSTAEPE 498 (505) T ss_pred HHHHHHhc--CCC--ChHHHHHHHHHHHHhccccCCC Confidence 99988655 222 1223567899999997665555 No 74 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=99.93 E-value=1.8e-24 Score=150.76 Aligned_cols=416 Identities=11% Similarity=0.011 Sum_probs=242.4 Q ss_pred CCC------CCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPE------QDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~------~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) ++. .+| ...|++......|+.+ |.|..... .|-+ .. ...+.|+.. -.|+++.+++.+++ T Consensus 22 ~~~~~~~~~~~i-~~~~~~~~ri~~~~~~---y~g~~~~~----~~~~---~~--~~~~~~~~~--sln~~~~i~~~~A~ 86 (508) T protein:vir:15 22 GSLSKITDDPRI-SIDPDEYVRIQTDLDY---YSDKLQYI----HYQA---SD--GIKKKRLKN--TINMAKTAARRIAS 86 (508) T ss_pred cchHHhhccccc-ccCHHHHHHHHHHHHH---hcCCCccc----cccc---CC--CCcccccee--ecchHHHHHHHHHh Confidence 221 123 2467777777777665 88854311 1211 11 111122221 25999999999999 Q ss_pred hhhcCCceecc-CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 75 KPFGREVEVTE-GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 75 ~lf~k~p~i~~-~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) ++|+.+|+++. +++..+++++.+- +.++|...+...+..++.+|.+++.++.+.. +|-+..++| T Consensus 87 lv~~e~~~i~v~~~~~~~e~l~~il-~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~--------------~~~i~~v~a 151 (508) T protein:vir:15 87 VVFNEKAEIHVKDNNEADKFLNDVL-EDNDFKNKFEEALEKGVALGGFAMRPYIDGN--------------HIKIAWVRA 151 (508) T ss_pred hhhCCCceEEeCCchHHHHHHHHHH-HhccHHHHHHHHHHHHhhcCceEEEEEEeCC--------------eeEEEEEcC Confidence 99999999874 4455556665543 2367899999999999999999999987532 245778899 Q ss_pred hhhccceeeccCCeeEEEEEE-E-Eee---eeEEEEEe--------cCe--EEEEEEeCC---cEEEE---ecc------ Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMR-M-WED---AETILVLR--------PDD--WERWRKNDA---GIWEV---DEF------ 206 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~-~-~E~---~e~~~v~~--------~~~--~~~~~~~~~---g~~~~---~~~------ 206 (455) .+++....+. ++....+++. + +.+ ...|+.++ ++. .++|+.... |.... +++ T Consensus 152 d~~~P~~~d~-~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~ 230 (508) T protein:vir:15 152 DQFYPLQSNT-NDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAP 230 (508) T ss_pred CeeEEEEEcC-CCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCc Confidence 9988654433 3333333322 1 111 11222221 111 234443221 21111 111 Q ss_pred --CCCCCCceeEEEEEeccc---CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccc Q lcl|NC_020844. 207 --GRITIGVVPMVPFITGRR---KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKP 281 (455) Q Consensus 207 --g~~~l~~vP~V~f~~~~~---~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~ 281 (455) ..+++.++||+.|.++-. +.....+.+.+.++.++..+.....|.+.+.++.....+.+-..+-.. +.++.+ . T Consensus 231 ~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~-d~~~~~-~ 308 (508) T protein:vir:15 231 QVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRF-DDEHKP-T 308 (508) T ss_pred ceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcC-CCCCcc-c Confidence 124677788887754211 112335788888888888888888888988887555555553232211 111111 1 Q ss_pred eeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhh----ccCCCcchhHHHHHHHHHHHHHH Q lcl|NC_020844. 282 LDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQP----LTAQSGNLTRISAAQAASKGNSA 357 (455) Q Consensus 282 ~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~----l~~~~~~~sa~a~~~~~~~~~s~ 357 (455) +..+.+-...++.+. ..+..+..++++-. .+.+.+.++.+.+++....... -...++.+||++...+.+.+.+. T Consensus 309 ~~~~~~~~~~~~~~~-~~~~~i~~~~~~ir-~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t 386 (508) T protein:vir:15 309 FDTEQNVYVGVLSDD-NNGLGVKDMTTPIR-TVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQT 386 (508) T ss_pred cCCCCeeEEeccCCC-CCCCceeEeecccC-hHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHH Confidence 111111122233222 23334666665532 2456777777776664432111 12234568999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHc---CCC------------CCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHH Q lcl|NC_020844. 358 VQQWAAGLKDALENALYITNLWM---DAP------------DTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGL 422 (455) Q Consensus 358 L~~~a~~~~~a~~~~l~~~a~~~---g~~------------~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~ 422 (455) ...+...++.+|+++++.+..+. +.- ..+..++++++-.......++++.+.+++.+|++|++++ T Consensus 387 ~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~ 466 (508) T protein:vir:15 387 RSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTF 466 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHH Confidence 99999999999999988765542 211 112334444333333444678889999999999999998 Q ss_pred HHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 423 WMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 423 ~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +..+ -|+ .+-+.++|++||++|.++.... T Consensus 467 i~~~--~g~--~deea~~el~ri~~E~~~~~~~ 495 (508) T protein:vir:15 467 LQRN--YGM--TDEQAAEELAKIQSEAPTDTFE 495 (508) T ss_pred HHhc--CCC--ChHHHHHHHHHHHHhccccCcc Confidence 8654 222 1223567899999997654333 No 75 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=99.93 E-value=4.7e-24 Score=148.45 Aligned_cols=419 Identities=10% Similarity=0.002 Sum_probs=239.2 Q ss_pred CCCC---------CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHH Q lcl|NC_020844. 1 MPEQ---------DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEG 71 (455) Q Consensus 1 m~~~---------~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~ 71 (455) |-.+ +|. ..|++.+....|.. +|.|.... + ++.... ...+.+... -.|+++.+++. T Consensus 18 ~~~~~~~~~~~~~~i~-~~~~~~~~i~~~~~---~Y~g~~~~-------~-~~~~~~-~~~~~~~~~--slnl~~~i~~~ 82 (500) T protein:vir:30 18 MTTQSLTNITDHPKIA-ISKLEYDRITTNLK---YYKSDWDS-------V-LYLNTD-GETKKRDLN--HLPIARTAAKK 82 (500) T ss_pred hhcchhhhhhcccccc-CCHHHHHHHHHHHH---HhcCCCCC-------c-ccccCC-CCcccCcee--ecchHHHHHHH Confidence 1111 122 35666666666644 57774210 1 111111 011122222 25999999999 Q ss_pred HhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEe Q lcl|NC_020844. 72 LASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQV 151 (455) Q Consensus 72 ~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~ 151 (455) +++++|+.+|+++...+..+++++++-. .+++...+..++..++..|.+++.++.+. .+|.+..+ T Consensus 83 ~A~lv~~e~~~i~~~d~~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~--------------~~~~I~~v 147 (500) T protein:vir:30 83 IASLVFNEQAEIKVDDDAANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVDG--------------DKVRVAFV 147 (500) T ss_pred HhhhhcCCcceEecCChHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEeC--------------CceEEEEE Confidence 9999999999998766777777776642 36799999999999999999999987642 13678889 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeee---e-EEEE-----EecCe-----EEEEEEeCC---cE-------EE-Eecc Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDA---E-TILV-----LRPDD-----WERWRKNDA---GI-------WE-VDEF 206 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~---e-~~~v-----~~~~~-----~~~~~~~~~---g~-------~~-~~~~ 206 (455) +|.+++.+..+..+........+...+. + .|+. |+.+. .++|+.... |. |. +.++ T Consensus 148 ~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 227 (500) T protein:vir:30 148 QAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDE 227 (500) T ss_pred cCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcc Confidence 9999987665444332222222211111 1 1211 33222 134443211 11 11 0011 Q ss_pred -CCCCCCceeEEEEEec---ccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcc----ccc-c Q lcl|NC_020844. 207 -GRITIGVVPMVPFITG---RRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDT----DSE-G 277 (455) Q Consensus 207 -g~~~l~~vP~V~f~~~---~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~----~~~-~ 277 (455) ..++++..||+.|..+ ..+.+...|.+.+.++.++..+.....|.+.+.+......+.+-..+-..+ ..+ . T Consensus 228 ~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~ 307 (500) T protein:vir:30 228 AKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVV 307 (500) T ss_pred eEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCcccc Confidence 1245666667665432 222223347777777777777777778888888876555555422221110 001 0 Q ss_pred CccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh----hhccCCCcchhHHHHHHHHHH Q lcl|NC_020844. 278 SPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGR----QPLTAQSGNLTRISAAQAASK 353 (455) Q Consensus 278 ~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~----~~l~~~~~~~sa~a~~~~~~~ 353 (455) .+........-...++.+ .+.+..+..++++- ..+.+.+.++.+.+++...++ .+-...++.+||++...+.+. T Consensus 308 ~~~~~d~~~~~~~~~~~~-~~~~~~i~~~~~~i-r~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~ 385 (500) T protein:vir:30 308 PRPRFESDQNVYIRMGGR-DLDSSAIQDLTTPI-RADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSD 385 (500) T ss_pred CCcccCCCcceEEEcCCC-CCcCcceeEecccc-ChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHH Confidence 111111111111123322 12233455554443 235566666666666533221 122234567899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHc-------CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHH Q lcl|NC_020844. 354 GNSAVQQWAAGLKDALENALYITNLWM-------DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEF 426 (455) Q Consensus 354 ~~s~L~~~a~~~~~a~~~~l~~~a~~~-------g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l 426 (455) +.+....+...++.+|+++++.+..+. +..+.+..+.++++-.......++++.+.+++++|++|+++++..+ T Consensus 386 ~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~ 465 (500) T protein:vir:30 386 TYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKV 465 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc Confidence 999999999999999999998776442 2223334455554433333346778999999999999999987543 Q ss_pred HhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 427 QRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 427 ~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) -|+ .+.+.++|++||++|.+...+. T Consensus 466 --~g~--~eeea~~~l~~i~~E~~~~~~~ 490 (500) T protein:vir:30 466 --LNV--TEEKAQEIAAEINTGIVDEINQ 490 (500) T ss_pred --CCC--CHHHHHHHHHHHHHhccccCCC Confidence 233 2223567889999987766655 No 76 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=99.93 E-value=4.7e-24 Score=148.45 Aligned_cols=419 Identities=10% Similarity=0.002 Sum_probs=239.2 Q ss_pred CCCC---------CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHH Q lcl|NC_020844. 1 MPEQ---------DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEG 71 (455) Q Consensus 1 m~~~---------~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~ 71 (455) |-.+ +|. ..|++.+....|.. +|.|.... + ++.... ...+.+... -.|+++.+++. T Consensus 18 ~~~~~~~~~~~~~~i~-~~~~~~~~i~~~~~---~Y~g~~~~-------~-~~~~~~-~~~~~~~~~--slnl~~~i~~~ 82 (500) T protein:vir:98 18 MTTQSLTNITDHPKIA-ISKLEYDRITTNLK---YYKSDWDS-------V-LYLNTD-GETKKRDLN--HLPIARTAAKK 82 (500) T ss_pred hhcchhhhhhcccccc-CCHHHHHHHHHHHH---HhcCCCCC-------c-ccccCC-CCcccCcee--ecchHHHHHHH Confidence 1111 122 35666666666644 57774210 1 111111 011122222 25999999999 Q ss_pred HhhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEe Q lcl|NC_020844. 72 LASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQV 151 (455) Q Consensus 72 ~~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~ 151 (455) +++++|+.+|+++...+..+++++++-. .+++...+..++..++..|.+++.++.+. .+|.+..+ T Consensus 83 ~A~lv~~e~~~i~~~d~~~~~~l~~il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~--------------~~~~I~~v 147 (500) T protein:vir:98 83 IASLVFNEQAEIKVDDDAANEFISETLK-NDRFNKNFERYLESCLALGGLAMRPYVDG--------------DKVRVAFV 147 (500) T ss_pred HhhhhcCCcceEecCChHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCCEEEEEEEeC--------------CceEEEEE Confidence 9999999999998766777777776642 36799999999999999999999987642 13678889 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeee---e-EEEE-----EecCe-----EEEEEEeCC---cE-------EE-Eecc Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDA---E-TILV-----LRPDD-----WERWRKNDA---GI-------WE-VDEF 206 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~---e-~~~v-----~~~~~-----~~~~~~~~~---g~-------~~-~~~~ 206 (455) +|.+++.+..+..+........+...+. + .|+. |+.+. .++|+.... |. |. +.++ T Consensus 148 ~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~ 227 (500) T protein:vir:98 148 QAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDE 227 (500) T ss_pred cCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcc Confidence 9999987665444332222222211111 1 1211 33222 134443211 11 11 0011 Q ss_pred -CCCCCCceeEEEEEec---ccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcc----ccc-c Q lcl|NC_020844. 207 -GRITIGVVPMVPFITG---RRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDT----DSE-G 277 (455) Q Consensus 207 -g~~~l~~vP~V~f~~~---~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~----~~~-~ 277 (455) ..++++..||+.|..+ ..+.+...|.+.+.++.++..+.....|.+.+.+......+.+-..+-..+ ..+ . T Consensus 228 ~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~ 307 (500) T protein:vir:98 228 AKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVV 307 (500) T ss_pred eEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCcccc Confidence 1245666667665432 222223347777777777777777778888888876555555422221110 001 0 Q ss_pred CccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh----hhccCCCcchhHHHHHHHHHH Q lcl|NC_020844. 278 SPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGR----QPLTAQSGNLTRISAAQAASK 353 (455) Q Consensus 278 ~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~----~~l~~~~~~~sa~a~~~~~~~ 353 (455) .+........-...++.+ .+.+..+..++++- ..+.+.+.++.+.+++...++ .+-...++.+||++...+.+. T Consensus 308 ~~~~~d~~~~~~~~~~~~-~~~~~~i~~~~~~i-r~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~ 385 (500) T protein:vir:98 308 PRPRFESDQNVYIRMGGR-DLDSSAIQDLTTPI-RADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSD 385 (500) T ss_pred CCcccCCCcceEEEcCCC-CCcCcceeEecccc-ChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHH Confidence 111111111111123322 12233455554443 235566666666666533221 122234567899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHc-------CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHH Q lcl|NC_020844. 354 GNSAVQQWAAGLKDALENALYITNLWM-------DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEF 426 (455) Q Consensus 354 ~~s~L~~~a~~~~~a~~~~l~~~a~~~-------g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l 426 (455) +.+....+...++.+|+++++.+..+. +..+.+..+.++++-.......++++.+.+++++|++|+++++..+ T Consensus 386 ~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~ 465 (500) T protein:vir:98 386 TYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKV 465 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhc Confidence 999999999999999999998776442 2223334455554433333346778999999999999999987543 Q ss_pred HhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 427 QRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 427 ~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) -|+ .+.+.++|++||++|.+...+. T Consensus 466 --~g~--~eeea~~~l~~i~~E~~~~~~~ 490 (500) T protein:vir:98 466 --LNV--TEEKAQEIAAEINTGIVDEINQ 490 (500) T ss_pred --CCC--CHHHHHHHHHHHHHhccccCCC Confidence 233 2223567889999987766655 No 77 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=99.92 E-value=1.7e-25 Score=156.37 Aligned_cols=422 Identities=12% Similarity=0.109 Sum_probs=260.2 Q ss_pred CCCCC------CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQD------PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~------v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) ||+.- =+-..+.-.+.+.++++..|.|.|... .+=+-.++++ .+-++.|..+.+|++ +. T Consensus 9 ~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~------el~~il~G~d--------r~~~~~ps~r~~V~~-~~ 73 (563) T protein:vir:74 9 DPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAE------TLKLVLRGDD--------SVPILMPSGRKIVEA-VH 73 (563) T ss_pred CCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchh------hhhhhcCCCc--------eeeeccchHHHHHHH-HH Confidence 44321 233455556677888888888988532 1111122332 345566788999999 55 Q ss_pred hhhcCCceecc-----C---CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE-eeCCCCcchhhhhHHhccCC Q lcl|NC_020844. 75 KPFGREVEVTE-----G---TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV-DYPAGQQFRNREEERQAGVR 145 (455) Q Consensus 75 ~lf~k~p~i~~-----~---~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV-d~p~~~~~~t~ad~~~~~~r 145 (455) ++++++..++- + ...++.++.+.. +.++|...+.+.-+.++..|-+.++| .-| ++..+.| T Consensus 74 ~~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~-~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp----------~K~~g~R 142 (563) T protein:vir:74 74 RFLGVGFDYLVEPDMGDEGIRQSLNAYFRTTF-KREAIKAKFTSNKRWGLIRGDAHFYIHADP----------NKKAGER 142 (563) T ss_pred HhcCCCcEEecCccccCcchHHHHHHHHHHHH-HHhhhHHHHHHHHHhhhhhcceeEEEeecc----------ccccCCC Confidence 77799988841 1 234666666654 34789999999999999999655544 322 1234568 Q ss_pred ceEEEechhhhccceeeccC-CeeEEEEEEE--Ee--e----eeEE---------------------EEEecCeEE---- Q lcl|NC_020844. 146 PYWSQVNHSAILEIQSERQG-AGERITYMRM--WE--D----AETI---------------------LVLRPDDWE---- 191 (455) Q Consensus 146 Py~~~~~~~~ii~w~~~~~~-~~~~l~~v~~--~E--~----~e~~---------------------~v~~~~~~~---- 191 (455) |-++.|+|.++..|...... |.. ++.+.. ++ + .-++ ..|+.+.|. T Consensus 143 ~rv~~vDP~~~fp~~dpd~v~g~~-~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~ 221 (563) T protein:vir:74 143 ISVDEVDPRQIFLIEDGSTVVGFH-MVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGA 221 (563) T ss_pred ceEeecCCceeeeccCCCCcccce-eeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCc Confidence 88999999988877654322 111 111111 00 0 0000 112222110 Q ss_pred ---EEEEeCCcEEE-----EeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCce Q lcl|NC_020844. 192 ---RWRKNDAGIWE-----VDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPM 263 (455) Q Consensus 192 ---~~~~~~~g~~~-----~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~ 263 (455) .+....++.+. .++.-+++++.||+|-|.+.+..++++ +.+.|.++..+..+++++.+|..-++.+++.|+ T Consensus 222 ~~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~W-G~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi 300 (563) T protein:vir:74 222 ISDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSW-GTSQLEGMETLAYALNQSLTDEDATIVFQGLGM 300 (563) T ss_pred cchhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCccccc-chhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCe Confidence 11112222221 112236789999998775555555555 899999999999999999999999999999999 Q ss_pred EEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHH-HHHHhhhhhcc-----C Q lcl|NC_020844. 264 LSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSK-EIRELGRQPLT-----A 337 (455) Q Consensus 264 ~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~-~m~~~g~~~l~-----~ 337 (455) .++.|..+.+..-++.....+|++.+|.+|.+. +.+.+..|. ....++.++..|+.+.. .|+.++..+-. . T Consensus 301 ~vl~~~~p~d~~~g~~~~w~vgpG~i~El~~~~--~~g~l~~v~-g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD 377 (563) T protein:vir:74 301 YVTNASAPVDPNTGELTDWNIGPMQIVEIAGNR--NDNYFERVS-GVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVD 377 (563) T ss_pred EEeccccccccccccccccccCCceeEeccCCc--cccceeeec-chhhhHHHHHHHHHHHHHHHHhhccCcceeecccc Confidence 999998877655555556779999999998664 334555554 22455666666666655 66666554422 1 Q ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHH---------HHHcCCCCCceEEEEecccCCC-- Q lcl|NC_020844. 338 QSGNLTRISAAQAASKGNSAVQQWAAGLKDALEN--------ALYIT---------NLWMDAPDTNPEADVYTDFRAD-- 398 (455) Q Consensus 338 ~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~--------~l~~~---------a~~~g~~~~~~~v~v~~df~~~-- 398 (455) .+.++||.|..+....+.|.+..+...+...+.+ .|.++ +.|.|..+......++--|.+. T Consensus 378 ~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P 457 (563) T protein:vir:74 378 VTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMP 457 (563) T ss_pred cccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCC Confidence 2457999999999998888766666555555544 22222 2345554433332223223322 Q ss_pred CCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc---------------------cCCCC Q lcl|NC_020844. 399 TGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELP---------------------GGIEE 455 (455) Q Consensus 399 ~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~---------------------~~l~~ 455 (455) ..-++-+..++.++++|+||++|..+.|.+.|..-++ .|.|.++|+.+.= +|..| T Consensus 458 ~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pd--ae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~ 533 (563) T protein:vir:74 458 VNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPE--VDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGE 533 (563) T ss_pred ccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCc--HHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCc Confidence 2235678889999999999999999999988764443 4555554443311 11111 No 78 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=99.90 E-value=6.8e-22 Score=136.63 Aligned_cols=420 Identities=10% Similarity=-0.008 Sum_probs=234.6 Q ss_pred CCCC---CCCc-----cCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHH Q lcl|NC_020844. 1 MPEQ---DPTK-----RSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGL 72 (455) Q Consensus 1 m~~~---~v~~-----~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~ 72 (455) |.++ ++.. .|+++......|+ .+|.|...- .+|. ... .....|... -.|+++.+++.+ T Consensus 18 ~~~~~~~~i~~~~~i~~~~~~~~~i~~~~---~~y~g~~~~----~~~~---~~~--~~~~~~~~~--slnl~~~i~~~~ 83 (522) T protein:vir:47 18 MQTSNLNSILEHPKIAVTQEEYDRIKRNL---VYYQSKWDD----VQYK---NTD--GDIKSRPMN--HLPIARTASKKI 83 (522) T ss_pred hhcccchhccccCCCCCCHHHHHHHHHHH---HHhcCCccc----cccc---ccC--cchhcccce--ecchHHHHHHHH Confidence 2222 2221 2777666666664 458774220 1111 111 111112211 259999999999 Q ss_pred hhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 73 ASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 73 ~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) ++++|..+++++..++..+++++.+-. -+.+...+...+..++..|.+++.++.+. + +|.+..++ T Consensus 84 A~lv~~e~~~i~v~d~~~~~~l~~~l~-~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~-------------~~~i~~v~ 148 (522) T protein:vir:47 84 ASLVYNEQATITTKNEILQKFLDDMLT-NDRFNKNFERYLESCLALGGLAMRPYIDG-D-------------KVRVAFIQ 148 (522) T ss_pred hhhhcCCcceeecCChHHHHHHHHHHh-hcchHHHHHHHHHHhhccCCEEEEEEEcC-C-------------ceEEEEEc Confidence 999999999998666777777766543 36789999999999999999999987653 2 25577788 Q ss_pred hhhhccceeeccCCeeEEEEEEEEe---eee-EEEEEec-------------------C--eEEEEEEeCC---cE---- Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMRMWE---DAE-TILVLRP-------------------D--DWERWRKNDA---GI---- 200 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~~~E---~~e-~~~v~~~-------------------~--~~~~~~~~~~---g~---- 200 (455) |.+++.-..+..+-.+...+.+... ... .+..++. + ..+.|+.... |. T Consensus 149 ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l 228 (522) T protein:vir:47 149 APVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNL 228 (522) T ss_pred CCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccc Confidence 8888764333222122111111110 000 1111211 1 1123432211 11 Q ss_pred -----EEE-ec-cCCCCCCceeEEEEEecc---cCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCC Q lcl|NC_020844. 201 -----WEV-DE-FGRITIGVVPMVPFITGR---RKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVE 270 (455) Q Consensus 201 -----~~~-~~-~g~~~l~~vP~V~f~~~~---~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~ 270 (455) |.- .+ ..-+++.+.+|+.|..+- .+.+...+.+.+.++.++..+....-+.+.+.+......+.+=..+- T Consensus 229 ~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l 308 (522) T protein:vir:47 229 SELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLT 308 (522) T ss_pred cccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHh Confidence 110 00 012356667777765321 11123346777777777777777777777777776665555522211 Q ss_pred C----ccccc-cCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh-h--h-hhccCCCcc Q lcl|NC_020844. 271 P----DTDSE-GSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL-G--R-QPLTAQSGN 341 (455) Q Consensus 271 ~----~~~~~-~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~-g--~-~~l~~~~~~ 341 (455) . ..... ..+.....+.+-...++.+. +.+.++..++|.-.. +.+.+.++.+.+.+-.. | . .+-...++. T Consensus 309 ~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~-~~~~~i~~~~~~ir~-e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~ 386 (522) T protein:vir:47 309 QRQYQRPDGTIDFRPRFDVEQNVYMQIGGSS-MDAGGITDLTSPIRA-NDYILAISEGLKLFEMQIGVSSGMFTFDGQGM 386 (522) T ss_pred ccCCCCCCcccccccccCcccceEeecCCCC-CCCCcceeeccccCh-HHHHHHHHHHHHHHHHHhCCCccccCcccccc Confidence 1 00000 00001111111111222221 223346666655432 45566666655555332 2 1 122234567 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC-------CCCCceEEEEecccCCCCCCHHHHHHHHHHHHc Q lcl|NC_020844. 342 LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMD-------APDTNPEADVYTDFRADTGEEQTPGYLLTMRQN 414 (455) Q Consensus 342 ~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g-------~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~ 414 (455) +||++...+.+.+.+....+...++.||+++++.++.+.. .......++++++-........+++.+.+++++ T Consensus 387 kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~a 466 (522) T protein:vir:47 387 KTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAA 466 (522) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhc Confidence 8999999999999999999999999999999998876642 223344555555433334446778899999999 Q ss_pred CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 415 GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 415 G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) |++|+++++..+ -|+ .+-+.++|++||++|..+.... T Consensus 467 G~~s~e~~i~~~--~g~--~eeea~~el~ri~~E~~~~~~~ 503 (522) T protein:vir:47 467 GFSTKKRAIGKT--LNI--SGVEAEKELNAINSELLPMNDA 503 (522) T ss_pred CCCCHHHHHHhc--CCC--ChHHHHHHHHHHHHhhccCCCC Confidence 999999987644 233 1223567999999986543332 No 79 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=99.86 E-value=7.1e-20 Score=125.56 Aligned_cols=417 Identities=8% Similarity=-0.023 Sum_probs=222.5 Q ss_pred CCCCCCCccCHHHHH-------------HHHHHHHHHHHhcCHH-HHH-------HhhhhcCCCCCCCCHHHHHHHhhcc Q lcl|NC_020844. 1 MPEQDPTKRSADAEA-------------MSDYLQTVDDVLEGVT-GMR-------RNFKRYVKQFPHEQNKRYDLRKSVA 59 (455) Q Consensus 1 m~~~~v~~~~p~y~~-------------~~~~w~~i~d~~~G~~-~~r-------~~g~~yLpk~~~e~~~~Y~~rl~ra 59 (455) |- |...-..+.. ..+.|.. ++.|.. ..+ -++..|- ++.. .|. T Consensus 1 ~~---~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~w~~~~~---~~~~--------~~~ 63 (518) T protein:vir:78 1 MG---VWSVMTRFIKGWLNGKPNGSEPELIPKYLP---LVPDNQKEWSKDSYLTSLWAQGYV---PTVH--------DKL 63 (518) T ss_pred Cc---chhhHHHHHHHhhcCCCCccchhccHHHhh---hcccchhhhhhhhhhhhhcccCCC---Cccc--------ccc Confidence 22 2221111111 1111111 122210 000 0112221 1111 122 Q ss_pred ccCChHHHHHHHHhhhhhcCCceecc------CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcc Q lcl|NC_020844. 60 KMTNVFRDVLEGLASKPFGREVEVTE------GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQF 133 (455) Q Consensus 60 ~~~n~~~~~v~~~~g~lf~k~p~i~~------~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~ 133 (455) .-.|.++.+++.++.++|+.+++|+- +++.++.+++.+- +.+.++..+...+..++..|.+++-++... + T Consensus 64 ~~~~l~~~i~~~~A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il-~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~-- 139 (518) T protein:vir:78 64 MNSGTGNEIVVVAAEYISGKPLSIDVTGVNGSKDENLTKQLKEAL-RIDNFDSKSVKIVELAGGSGVSAVKINILN-G-- 139 (518) T ss_pred ccCChHHHHHHHHHHhhcCCCceEEecCccccCcHHHHHHHHHHH-HhccHHHHHHHHHHHhhccCceEEEEEEEC-C-- Confidence 33689999999999999999998852 3566666666643 346799999999999999999998776532 1 Q ss_pred hhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeee---EEEEEe---------------cC--eEEEE Q lcl|NC_020844. 134 RNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAE---TILVLR---------------PD--DWERW 193 (455) Q Consensus 134 ~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e---~~~v~~---------------~~--~~~~~ 193 (455) +|-+..++|.+++.... .|.....+++......+ .++.++ ++ .+++| T Consensus 140 -----------~~~i~~v~ad~~~P~~~--~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly 206 (518) T protein:vir:78 140 -----------RPSISVHSSSQFWIDFK--NNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVI 206 (518) T ss_pred -----------eeEEEEEcCCeeEEEee--cCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEe Confidence 35577788888776322 12222233322111101 122111 11 12344 Q ss_pred EEeCCcE--------EE-----EeccC------CCCCCceeEEEEEecccCC----CcccCcchHHHHHHHHHHHHHhHH Q lcl|NC_020844. 194 RKNDAGI--------WE-----VDEFG------RITIGVVPMVPFITGRRKG----KKWVFHPPMKDALDLQVALFRKEC 250 (455) Q Consensus 194 ~~~~~g~--------~~-----~~~~g------~~~l~~vP~V~f~~~~~~~----~~~~~~~pL~dla~l~~~hy~~~s 250 (455) +.+.+.. +. ....+ ..+....||++|+.+...| +-..+.+.+.++.++..+.....+ T Consensus 207 ~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s 286 (518) T protein:vir:78 207 KIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFT 286 (518) T ss_pred eecCcccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHH Confidence 4321110 00 00000 1133457777775443222 222478888888888888888889 Q ss_pred HHHHHHHHcCCceEEEecCCCcccccc---CccceeeccceeeeccCCC-Cccccc--eeEEeeCchhHHHHHHHHHHHH Q lcl|NC_020844. 251 ALEFAEMMTAFPMLSANGVEPDTDSEG---SPKPLDTGPDTVLYAPPNA-EGQHGS--WNFVEPATESLKFLAEQIESVS 324 (455) Q Consensus 251 d~~~~l~~~~~p~~~~~G~~~~~~~~~---~~~~~~~g~~~~~~lp~~~-~~~~~~--~~~le~~~~~l~~~~~~l~~~~ 324 (455) .+.+.+......+.+-..+-......+ ....+..+.+....++.+. .+++++ +..++|.-.. +.+.+.++.+. T Consensus 287 ~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~-e~~~~~~~~~l 365 (518) T protein:vir:78 287 VYMREGEKTKTKIAASERMFRKKVNKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRD-GSYRETMEYFA 365 (518) T ss_pred HHHHHHHhCCceeeechhHhccCCCCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccCh-HHHHHHHHHHH Confidence 999998775555544322221110000 0011222222222232221 122222 3344444332 45666666666 Q ss_pred HHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC---------CCCceEEEEe Q lcl|NC_020844. 325 KEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDA---------PDTNPEADVY 392 (455) Q Consensus 325 ~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~---------~~~~~~v~v~ 392 (455) +++... +...+...++.+||++...+.+.+.+.+..+...++.+++++++.+++++.. ......++++ T Consensus 366 ~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~ 445 (518) T protein:vir:78 366 QKAVSKSGYNPATFNLGNREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIE 445 (518) T ss_pred HHHHHhhCCChhhcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEE Confidence 665332 2222333456799999999999999999999999999999999877655321 1122344444 Q ss_pred cccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCC-CC Q lcl|NC_020844. 393 TDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGI-EE 455 (455) Q Consensus 393 ~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l-~~ 455 (455) ++-.......++++.+.+++++|++|+++.++.+. .++ .+-++++|.+||++|..... -| T Consensus 446 f~D~i~~D~~~~~~~~~~~v~aGimS~e~~i~~~~-~~~--~deea~~e~~ri~~E~~~~~~~~ 506 (518) T protein:vir:78 446 FPDPMSVNLNELSSTLNNMNSALAMSVEEKVKLIH-PKW--EDEEIQAEVKRIYLENAIGEVPD 506 (518) T ss_pred eCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhC-CCC--CHHHHHHHHHHHHHHhcccCCCC Confidence 33233333456677788899999999999877541 122 22346788999999854321 11 No 80 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=99.82 E-value=7.3e-19 Score=120.00 Aligned_cols=417 Identities=10% Similarity=-0.011 Sum_probs=226.1 Q ss_pred CCCC---------CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHH Q lcl|NC_020844. 1 MPEQ---------DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEG 71 (455) Q Consensus 1 m~~~---------~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~ 71 (455) |..+ +| ...+++.+...+|+.+ |.|.... +.....+ ...+.|... -.|+++.+++. T Consensus 18 ~~~~~~~~~~~~~~i-~~~~~~~~~I~~w~~~---Y~g~~~~-------~~~~~~~--~~~~~~~~~--sl~~~~~i~~~ 82 (517) T protein:vir:98 18 LSGQTLKSINDHEKI-NIDPNELARIERNLRQ---YEGDYPQ-------VEYINSQ--GKIQERDYM--TLNLRKLSADV 82 (517) T ss_pred hcccchhHhhcCCce-ecCHHHHHHHHHHHHH---hcCCCcc-------ccccccc--cccccccee--ecCcHHHHHHH Confidence 2222 22 2356777777888554 7775321 1111111 111111111 25999999999 Q ss_pred HhhhhhcCCceeccCC-----------HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHH Q lcl|NC_020844. 72 LASKPFGREVEVTEGT-----------GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEER 140 (455) Q Consensus 72 ~~g~lf~k~p~i~~~~-----------~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~ 140 (455) +++++|..+++|+... ...+++++.+- +.++|...+...+..++..|.+++-++.+.. T Consensus 83 ~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~-~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~---------- 151 (517) T protein:vir:98 83 LSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVF-QHNKFIKNLSDYLEPTFALGGLTVRPYVDNG---------- 151 (517) T ss_pred hhhhhcCCcceEEecccccccccccchhHHHHHHHHHH-HhccHHHHHHHHHHHHhhhCCEEEEEEEeCC---------- Confidence 9999999999885321 12345554443 3467999999999999999999998876532 Q ss_pred hccCCceEEEechhhhccceeeccCCeeEEEEE-EEEeee----eEEEEEec---C-------eE----EEEEEeCC--- Q lcl|NC_020844. 141 QAGVRPYWSQVNHSAILEIQSERQGAGERITYM-RMWEDA----ETILVLRP---D-------DW----ERWRKNDA--- 198 (455) Q Consensus 141 ~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v-~~~E~~----e~~~v~~~---~-------~~----~~~~~~~~--- 198 (455) ++-+..++|.+++.+..+..+ ....+.+ +..... ..|+.++. + .+ ++|+.... T Consensus 152 ----~~~I~~v~ad~~~Pl~~~~~~-v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~l 226 (517) T protein:vir:98 152 ----EIEFSWALANAFYPLRSNSNG-ISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEI 226 (517) T ss_pred ----eeEEEEEcCCeeEEEEecCCC-eEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccc Confidence 255888899999876654433 2222111 111111 11222221 1 11 23432211 Q ss_pred cEEEE--------ecc-CCCCCCceeEEEEEe---cccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEE Q lcl|NC_020844. 199 GIWEV--------DEF-GRITIGVVPMVPFIT---GRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSA 266 (455) Q Consensus 199 g~~~~--------~~~-g~~~l~~vP~V~f~~---~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~ 266 (455) |.... .+. ..+++.+.+|+.|.+ |..+.....|.+.+.++.++-.+....-+.+.+.+......+.+= T Consensus 227 G~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp 306 (517) T protein:vir:98 227 GKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVS 306 (517) T ss_pred cccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecC Confidence 11110 011 113455544554432 222222334777777777777777777777777777655555542 Q ss_pred ecCCCcc-ccccCccceeeccceeeeccCCCCccccceeEEeeCchh-HHHHHHHHHHHHHHHHHhhh----hhccCCCc Q lcl|NC_020844. 267 NGVEPDT-DSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATES-LKFLAEQIESVSKEIRELGR----QPLTAQSG 340 (455) Q Consensus 267 ~G~~~~~-~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~-l~~~~~~l~~~~~~m~~~g~----~~l~~~~~ 340 (455) ..+-..+ +..+...+.......-++.+-+.. .++.++-++++.- .+.+.+.++.+.+++...+. .+-..+.+ T Consensus 307 ~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~--~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~ 384 (517) T protein:vir:98 307 DVMLRTVPDESGMPPPQVFDPDVNVYKSIRMG--TDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRS 384 (517) T ss_pred hhhhccccCCCCcccCCCCCcccceeeeccCC--CCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccc Confidence 2221111 111111111111111111111111 1122333344322 24566666666665543221 12223345 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc------C-CCCCceEEEEecccCCCCCCHHHHHHHHHHHH Q lcl|NC_020844. 341 NLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM------D-APDTNPEADVYTDFRADTGEEQTPGYLLTMRQ 413 (455) Q Consensus 341 ~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~------g-~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~ 413 (455) .+||++...+.+.+.+....+...++.+|+++++.+..+. + ..+.+..++++++=.......+++..+.+++. T Consensus 385 ~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~ 464 (517) T protein:vir:98 385 MKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKT 464 (517) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHh Confidence 6899999999999999999999999999999988775542 1 11233445555433333334678888999999 Q ss_pred cCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 414 NGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 414 ~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +|++|.++++..+ .|+ .+-+.++|+.||++|..+ .+. T Consensus 465 aG~ms~~~~i~~~--~g~--~eeeA~~e~~~i~~E~~~-~~~ 501 (517) T protein:vir:98 465 FGFIPTVEAIQRI--FKV--PKKTAEQWLEEIRKDQIE-LDP 501 (517) T ss_pred cCCCCHHHHHHHh--CCC--ChHHHHHHHHHHHHhccc-cCC Confidence 9999999987654 344 233467888999988543 221 No 81 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.30 E-value=1.5e-10 Score=74.50 Aligned_cols=428 Identities=12% Similarity=0.090 Sum_probs=189.7 Q ss_pred CCCCCCCccC--HHHHH----HHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCH-----HHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MPEQDPTKRS--ADAEA----MSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQN-----KRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~~~~v~~~~--p~y~~----~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~-----~~Y~~rl~ra~~~n~~~~~v 69 (455) |.++ ++.++ |+=.. ....|..+.....+...+|+...+-+-.+.+..+ ...+.+-.-.+.+|.++.+| T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v 79 (714) T protein:vir:10 1 MKNE-INTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEVIQVLKDRGQPMTIHNLIAPTV 79 (714) T ss_pred CCcC-cCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHH Confidence 7764 33311 11100 1122333444456666666443322222233333 33445555566799999999 Q ss_pred HHHhhhhhcCCceecc-----C------CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhh Q lcl|NC_020844. 70 EGLASKPFGREVEVTE-----G------TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREE 138 (455) Q Consensus 70 ~~~~g~lf~k~p~i~~-----~------~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad 138 (455) +..+|+--...+.+.- + .+.+..++..+- .-++.+.-+..++..++.+|++|+=|....... T Consensus 80 ~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~-~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~------ 152 (714) T protein:vir:10 80 DGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADAC-RLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPF------ 152 (714) T ss_pred HHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHH-HhhchhHHHHHHHHHhhhcccceEEeeeccCCC------ Confidence 9999999777665531 1 122344443332 345678888999999999999999553322110 Q ss_pred HHhccCCceEEEechhhhc-cceeecc---CCeeEE-------------------------------------------- Q lcl|NC_020844. 139 ERQAGVRPYWSQVNHSAIL-EIQSERQ---GAGERI-------------------------------------------- 170 (455) Q Consensus 139 ~~~~~~rPy~~~~~~~~ii-~w~~~~~---~~~~~l-------------------------------------------- 170 (455) +..+++..++|.+|+ ++..... +-++.+ T Consensus 153 ----~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~ 228 (714) T protein:vir:10 153 ----GPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPL 228 (714) T ss_pred ----CCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccc Confidence 123455566666554 3321110 011111 Q ss_pred -------------------------EE-EEEEeeeeEEEEEecCeEEEEEEeC----------CcE------------EE Q lcl|NC_020844. 171 -------------------------TY-MRMWEDAETILVLRPDDWERWRKND----------AGI------------WE 202 (455) Q Consensus 171 -------------------------~~-v~~~E~~e~~~v~~~~~~~~~~~~~----------~g~------------~~ 202 (455) .. ..+.+......++.+...++...+. .|. +. T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~ 308 (714) T protein:vir:10 229 MSAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREA 308 (714) T ss_pred cccchhhcccccccccccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEE Confidence 00 0011111111111111111110000 000 00 Q ss_pred ------EeccC--CCCCCceeEEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCc Q lcl|NC_020844. 203 ------VDEFG--RITIGVVPMVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPD 272 (455) Q Consensus 203 ------~~~~g--~~~l~~vP~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~ 272 (455) +...+ +.+.+.+|+|||+.... .+-+. .-+..+.+.+...+...|-+.+++. +.-+.+..|.... T Consensus 309 ~~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~---G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~gav~~ 383 (714) T protein:vir:10 309 WFVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPY---GLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQL 383 (714) T ss_pred EEecchhhhcCCCCCCCCceeeEEecceeeeccCccc---eehhhhhhHHHHHHHHHHHHHHHHh--CCceeeccccccc Confidence 01223 34556789988753322 22122 2233444444444445555666663 3333344454444 Q ss_pred cccccCccceeeccceeeec-cCCCCcc--ccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHH Q lcl|NC_020844. 273 TDSEGSPKPLDTGPDTVLYA-PPNAEGQ--HGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRIS 346 (455) Q Consensus 273 ~~~~~~~~~~~~g~~~~~~l-p~~~~~~--~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a 346 (455) ++.+...+.-.- +.++.+ |....+. +.++... ++..--..+...++...+.|..+++ .++...+++.||+| T Consensus 384 ~d~~~~e~~~rp--~~vi~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvA 460 (714) T protein:vir:10 384 SDNDLMEQLERP--DGIIKLNPVRKNQKSVADVFRVE-QDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVA 460 (714) T ss_pred cHHHHHHhccCC--CCeEEecccccccCCcccccccc-CCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHH Confidence 333332211111 122222 2111111 1223322 2222234556667777777766643 23444455689999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHcCCC---------CCc---eEEEEec--------------ccC Q lcl|NC_020844. 347 AAQAASKGNSAVQQWAAGLKDALE----NALYITNLWMDAP---------DTN---PEADVYT--------------DFR 396 (455) Q Consensus 347 ~~~~~~~~~s~L~~~a~~~~~a~~----~~l~~~a~~~g~~---------~~~---~~v~v~~--------------df~ 396 (455) +..+..+....|..+..++..+.. .+|.++.+|++.. +.. ..+.+|. .|+ T Consensus 461 I~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~d 540 (714) T protein:vir:10 461 ISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTH 540 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEE Confidence 887777766667767777666554 4666777776431 110 1233332 111 Q ss_pred C------C--CCCHHHHHHHHHHHHcC-----CCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCC-C Q lcl|NC_020844. 397 A------D--TGEEQTPGYLLTMRQNG-----DISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIE-E 455 (455) Q Consensus 397 ~------~--~~~~~~~~al~~~~~~G-----~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~-~ 455 (455) . . ....+.+++++++.... .+....+++.+ + .-..++..++|.+.....-. + T Consensus 541 v~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~------d-~p~~~ei~~~ir~~~~~~~~~~ 606 (714) T protein:vir:10 541 IALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL------D-VPQKQEFVERIRAALGTPKSPD 606 (714) T ss_pred EEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc------C-CcCHHHHHHHHHHHcCCCCCcc Confidence 1 0 11123455666665431 12222222221 1 11234455555444221100 0 No 82 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.25 E-value=6.9e-11 Score=76.29 Aligned_cols=413 Identities=9% Similarity=-0.022 Sum_probs=192.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHh--hhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRN--FKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~--g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |...|=...|. ....+....-+-.-+ |...-|.. ...|....--.-...+..|.. ..+.+++|+..++..++ T Consensus 1 ~~~~~~a~~~~-~~~~a~~~~~~~~~~-g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~----~~l~r~iVd~~a~d~~r 74 (461) T protein:vir:80 1 MYSIDKAKQAK-IDSKIVNRNDFMVGH-GKANSRDKLTRQTPGNGQKLDLKACENLYAS----NSIAMNIVDIISEDMVR 74 (461) T ss_pred Cccchhhhhhh-hhhhhhhhhHHHhhc-CCcchhhhhhccccCcccccCHHHHHHHHHh----CCccchhhccchHHhhc Confidence 88765333222 122222111111111 11111111 112222111111222344443 46778899999999999 Q ss_pred CCceeccCCHH----HHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc---chhh-hhHHhccCCceEEE Q lcl|NC_020844. 79 REVEVTEGTGP----SEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ---FRNR-EEERQAGVRPYWSQ 150 (455) Q Consensus 79 k~p~i~~~~~~----l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~---~~t~-ad~~~~~~rPy~~~ 150 (455) +++.|++..+. ++.+++. ..+...++.+.+.+..+|.+++++....... ..+. -+...-+...|+.. T Consensus 75 ~g~~i~~~~~~~~~~~~~~~~~-----l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~ 149 (461) T protein:vir:80 75 AGWSLKTDNKEMKKNIESKWRK-----LKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYINT 149 (461) T ss_pred CCeeeecCCHHHHHHHHHHHHH-----hhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEEe Confidence 99999765544 3444433 3578889999999999999999986422111 0000 00000011112222 Q ss_pred echhhhcc--ceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccC---CCCCCceeEEEEEecccC Q lcl|NC_020844. 151 VNHSAILE--IQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFG---RITIGVVPMVPFITGRRK 225 (455) Q Consensus 151 ~~~~~ii~--w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g---~~~l~~vP~V~f~~~~~~ 225 (455) +.+.+|.. |..+..+ -.+..|..|++......+.....+.. ...+..-+++.|.+.+.. T Consensus 150 ~~~~~i~~~~~~~dp~s----------------p~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~ 213 (461) T protein:vir:80 150 FNTQKVTQLYLNQDMFS----------------EHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFE 213 (461) T ss_pred ccccccchhhhcccCcC----------------cccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCC Confidence 22222211 0000000 00122222222211111111111111 112444566666544443 Q ss_pred CCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccc-cC--c-cceeeccceeeeccCCCCcccc Q lcl|NC_020844. 226 GKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSE-GS--P-KPLDTGPDTVLYAPPNAEGQHG 301 (455) Q Consensus 226 ~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~-~~--~-~~~~~g~~~~~~lp~~~~~~~~ 301 (455) .. .-+.|-|..+.+...+.-+....-.++++...++++.+.|+.....+. .. . -....+...+..+..+ . T Consensus 214 ~~-~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~d~~-----e 287 (461) T protein:vir:80 214 GE-TKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEALAIIKGD-----E 287 (461) T ss_pred cc-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceEEEEcCC-----c Confidence 32 336666666666554444444455677888888888887765321111 10 0 0111122233334322 2 Q ss_pred ceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CC--CcchhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQ--SGNLTRISAAQAASKGNSAVQQWAA-GLKDALENALYI 375 (455) Q Consensus 302 ~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~--~~~~sa~a~~~~~~~~~s~L~~~a~-~~~~a~~~~l~~ 375 (455) ++..++.+.++ +.+.++...++|......|.. ++ +++.||. .+...-+..+..+.. .+...+++++++ T Consensus 288 ~~e~~~~~lsg---l~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge---~D~~~yyd~i~~~qe~~l~p~le~l~~~ 361 (461) T protein:vir:80 288 QLTKESTNVSG---MKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQ---YDVMNYYARVSSIQENRLRPQLEYLTRL 361 (461) T ss_pred ceEEEecCcCC---HHHHHHHHHHHHhhhhcCCeeeeecccCCccccch---HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 45555555444 456677777777776665543 22 2333433 233445566666664 467788888887 Q ss_pred HHHHcCC-----CCCceEEEEecccCCCCCCHHH--------HHHHHHHHHcCCCCHHHHHHHHHhc-CCCCC----CCC Q lcl|NC_020844. 376 TNLWMDA-----PDTNPEADVYTDFRADTGEEQT--------PGYLLTMRQNGDISREGLWMEFQRL-GELSD----NFD 437 (455) Q Consensus 376 ~a~~~g~-----~~~~~~v~v~~df~~~~~~~~~--------~~al~~~~~~G~is~et~~~~l~~~-~vl~~----~~d 437 (455) +..-++. ++....+++.++-.. .+++.+ ++++.++.++|+||.++.++.|+.+ ++.+. ..+ T Consensus 362 i~~s~~~~~~~~~p~~~~~~i~f~~L~-~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~ 440 (461) T protein:vir:80 362 LMWASDDCGPSIDPDSFEWAIEFNPLW-NLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDS 440 (461) T ss_pred HHHHhcccccccCccccceEEEeCCCC-CCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCC Confidence 7543321 221122333321111 122222 3678888999999999999998643 33222 233 Q ss_pred HHHHHHHHHhhcc-cCCCC Q lcl|NC_020844. 438 PEEESERLISELP-GGIEE 455 (455) Q Consensus 438 ~e~e~~ri~~e~~-~~l~~ 455 (455) ++ .+.++++.+ +.-+| T Consensus 441 ~~--~~~~~~~~~~~~~~e 457 (461) T protein:vir:80 441 AE--IDKLAKLVYDAYAKK 457 (461) T ss_pred ch--hhhhhhhcccccccc Confidence 33 223332211 22233 No 83 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.23 E-value=2.2e-10 Score=73.54 Aligned_cols=426 Identities=13% Similarity=0.107 Sum_probs=188.0 Q ss_pred CCCCC-CCccCH-------HHHHH-HHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCH-----HHHHHHhhccccCChHH Q lcl|NC_020844. 1 MPEQD-PTKRSA-------DAEAM-SDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQN-----KRYDLRKSVAKMTNVFR 66 (455) Q Consensus 1 m~~~~-v~~~~p-------~y~~~-~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~-----~~Y~~rl~ra~~~n~~~ 66 (455) ||+.+ +...++ +..+. ..-++.++..+...+.+|+....-+-.+.+..+ ...+.+-...+.+|.++ T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~ 102 (776) T protein:vir:93 23 SPGEDAAQREKPANPLDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDEIDELKERGQAPTVYNVIS 102 (776) T ss_pred CCCCcccchhcccCCCCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCceEEecchH Confidence 45443 222232 11111 111222333334444444322111111233333 33444555567799999 Q ss_pred HHHHHHhhhhhcCCceec-----c----CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE--eeCCCCcchh Q lcl|NC_020844. 67 DVLEGLASKPFGREVEVT-----E----GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV--DYPAGQQFRN 135 (455) Q Consensus 67 ~~v~~~~g~lf~k~p~i~-----~----~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV--d~p~~~~~~t 135 (455) .+|+..+|+.....+.+. . ..+.+..++..+. .-++.+..+..++..++.+|++|+=| |+...+... T Consensus 103 ~~i~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~- 180 (776) T protein:vir:93 103 QSVNWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLS-DVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPI- 180 (776) T ss_pred HHHHHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHH-HhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCce- Confidence 999999999987765553 1 1233555665553 45678888999999999999999876 433322211 Q ss_pred hhhHHhccCCceEEEechhhhc-cceeecc---CCeeEEEE--------------------------------------- Q lcl|NC_020844. 136 REEERQAGVRPYWSQVNHSAIL-EIQSERQ---GAGERITY--------------------------------------- 172 (455) Q Consensus 136 ~ad~~~~~~rPy~~~~~~~~ii-~w~~~~~---~~~~~l~~--------------------------------------- 172 (455) +...++|.+|+ ++..... +-++.+.. T Consensus 181 -----------~~~~~~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (776) T protein:vir:93 181 -----------YAGAESWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDD 249 (776) T ss_pred -----------EeeccChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhcccccc Confidence 11112222222 1111000 00000000 Q ss_pred -EE---------------EEeeeeEEEEEec---------------CeE--EEEEEe--------CCcE----------- Q lcl|NC_020844. 173 -MR---------------MWEDAETILVLRP---------------DDW--ERWRKN--------DAGI----------- 200 (455) Q Consensus 173 -v~---------------~~E~~e~~~v~~~---------------~~~--~~~~~~--------~~g~----------- 200 (455) .. ......+++|.+. +.+ .+|... ..|. T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v 329 (776) T protein:vir:93 250 AMDSPEYERSMNSVTAGAVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRM 329 (776) T ss_pred cccccccccccccccccccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeee Confidence 00 0000011222110 000 000000 0000 Q ss_pred -EE------Ee--ccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCC Q lcl|NC_020844. 201 -WE------VD--EFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEP 271 (455) Q Consensus 201 -~~------~~--~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~ 271 (455) |. +. ...+.+.+.+|||||+........ .+..-+..+.+.+..++...|-+.+++ .+.++.+-.|... T Consensus 330 ~~~~~~g~~~l~~~~~p~~~~~~Pfv~~~~~~~~~~~-~~~G~v~~~~d~Q~~~N~~~s~~~~~l--~~~~~~~~~gav~ 406 (776) T protein:vir:93 330 HCAIMTTRDLMWAGPSPYRHNRYPFTPIWGFRRARDG-MPYGVIRFMRGMQDDVNKRLSKALYIL--STNKVLMEEGAVD 406 (776) T ss_pred EEEEEecchhhhccCCCCCCCccceEEecCceecccc-cccchHHhhhHHHHHHHHHHHHHHHhh--cCCceeecccccc Confidence 10 01 112345689999998765554433 344555677777777776667777765 3445555555332 Q ss_pred ccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhh---hccCCCcchhHHHHH Q lcl|NC_020844. 272 DTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQ---PLTAQSGNLTRISAA 348 (455) Q Consensus 272 ~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~---~l~~~~~~~sa~a~~ 348 (455) .. ++..... .-++.++.+-.++ .+...+.. ...-...+.+.++...+.|..+++. ++...+++.||+++. T Consensus 407 ~~-d~~~~~~--~rp~~vi~~~~~~---~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~ 479 (776) T protein:vir:93 407 DI-DEFRREA--ARPDAVMTVKNGK---LGAVKMDV-DRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQ 479 (776) T ss_pred ch-HHHHHhc--ccCCceeeeCCcc---cccccccc-CcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHH Confidence 22 1111111 1123333331111 12333333 2222244566667777777666432 233344558998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCC---------CCCceEEEEec----------ccCCC------- Q lcl|NC_020844. 349 QAASKGNSAVQQWAAGLKDALEN----ALYITNLWMDA---------PDTNPEADVYT----------DFRAD------- 398 (455) Q Consensus 349 ~~~~~~~s~L~~~a~~~~~a~~~----~l~~~a~~~g~---------~~~~~~v~v~~----------df~~~------- 398 (455) ....+....+..+..++..++.. +|.++.+|++. ++..-.|.||. .|+.. T Consensus 480 ~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~ 559 (776) T protein:vir:93 480 ARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWR 559 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccc Confidence 87777767777777776666554 55667777642 11111334431 12110 Q ss_pred -CCCHHHHHHHHHHHHcCCCCHHHH---HHH-HHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 399 -TGEEQTPGYLLTMRQNGDISREGL---WME-FQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 399 -~~~~~~~~al~~~~~~G~is~et~---~~~-l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ....+...+++++... +..+.. ... +...++ -..++-.+++++.....-.+ T Consensus 560 ~s~r~~~~~~l~ql~~~--~~p~~~~~~~~~~~e~~d~----p~~~e~~~~l~~~~~~~~p~ 615 (776) T protein:vir:93 560 ATMRQAAVAELMEVIGK--MPPEIALTMLDLLVENMDI----PNRDELVKRIRAVNGQKDPD 615 (776) T ss_pred hhHHHHHHHHHHHHHhh--cChhhHHHHHHHHHHhcCc----cchHHHHHHHHHhhcccccc Confidence 1112234444444332 222111 111 111111 12334444554432211111 No 84 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.21 E-value=5.8e-10 Score=71.24 Aligned_cols=427 Identities=12% Similarity=0.066 Sum_probs=191.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCC-----CCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVK-----QFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLp-----k~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) -|++.=-..|.+ .|..+.....+...+|+...+-+- +++.+..+..+.+-.-.+.+|.++.+|+..+|+ T Consensus 14 ~~~~~~~~~~~~------~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~ 87 (772) T protein:vir:10 14 LPPAGDTPLTVD------EYADINYEIEDQPAWRAVADKEMDYADGNQLDTELLRRQQALGIPPAVEDLIGPALLSLQGY 87 (772) T ss_pred CCcccccccCHH------HHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEEcchHHHHHHHHHH Confidence 121111122222 244455556666666654433222 223333344555655667799999999999999 Q ss_pred hhcCCceecc------C----CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCC Q lcl|NC_020844. 76 PFGREVEVTE------G----TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVR 145 (455) Q Consensus 76 lf~k~p~i~~------~----~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~r 145 (455) --...+.+.- . .+.+..++..+- .-++.+.-+..++.+++++|++|+=|++..+... .. T Consensus 88 ~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~----------~~ 156 (772) T protein:vir:10 88 EAVTRTDWRVTPNGDVGGQEVADALNYRLNTAE-RQSGADRACSEAFRPQIACGIGWVEVSRESDPFK----------FP 156 (772) T ss_pred HHhcCcceEEecCCCchHHHHHHHHHHHHHHHH-HhcChHHHHHHHHHHhhhcCceeEEeccccCCCC----------CC Confidence 9877765531 1 122444444332 3467888899999999999999998876543211 11 Q ss_pred ceEEEechhhhc-cceeec--cCCeeE----------------------------------------------------- Q lcl|NC_020844. 146 PYWSQVNHSAIL-EIQSER--QGAGER----------------------------------------------------- 169 (455) Q Consensus 146 Py~~~~~~~~ii-~w~~~~--~~~~~~----------------------------------------------------- 169 (455) .++..++|.+|+ ++.... .+-++. T Consensus 157 i~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (772) T protein:vir:10 157 YRCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAW 236 (772) T ss_pred eEEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCccccccccccccccccccc Confidence 234444444432 222111 000000 Q ss_pred --------------------EEEEE-EEeeeeEEEEEec--CeEE---------------------------EEEEeCCc Q lcl|NC_020844. 170 --------------------ITYMR-MWEDAETILVLRP--DDWE---------------------------RWRKNDAG 199 (455) Q Consensus 170 --------------------l~~v~-~~E~~e~~~v~~~--~~~~---------------------------~~~~~~~g 199 (455) ++.+. |.+...+..++.+ +++. +++..-.| T Consensus 237 ~~~~~~~~~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g 316 (772) T protein:vir:10 237 NEARAWTVQEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLG 316 (772) T ss_pred chhhccccccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEec Confidence 00000 0000001111111 1100 11100011 Q ss_pred EEEEe-ccCCCCCCceeEEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccc Q lcl|NC_020844. 200 IWEVD-EFGRITIGVVPMVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSE 276 (455) Q Consensus 200 ~~~~~-~~g~~~l~~vP~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~ 276 (455) ...+. ...+.+-+.+|+|||+.... .+-+. .-+.++.+.+...+...|-+.+++.-.+ +..-.|.....+.+ T Consensus 317 ~~~L~~~~~p~~~~~fP~vP~~g~r~~~~g~~~---G~vr~~kd~Qr~~N~~~S~~~~~l~~~~--~~~~~gav~~~d~~ 391 (772) T protein:vir:10 317 PHCLHDGPTPYTHRHFPYVPFFGFREDATGIPY---GYVRGMKYAQDSLNSGVSKLRWGMSVAR--VERTKGAVAMTDAQ 391 (772) T ss_pred ceeeccCCCCCCCCccceEEEeeeEeccCCccc---chhhhhhhHHHHHHHHHHHHHHHHhccc--ccccCCCccchhHH Confidence 11121 12334567799999864332 22222 2244566666666666777777765433 33333443333322 Q ss_pred cCccceeeccceeeeccCCC-CccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHHH Q lcl|NC_020844. 277 GSPKPLDTGPDTVLYAPPNA-EGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAAS 352 (455) Q Consensus 277 ~~~~~~~~g~~~~~~lp~~~-~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~~ 352 (455) +....-. ++.++.+-.++ ...+.++...+ +..--..+.+.|+...+.|..+++ -++...+++.||+|+..+.. T Consensus 392 ~~e~~ar--p~~vi~~~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~ 468 (772) T protein:vir:10 392 FRRQIAR--PDADIVLDENHMAKPGARFDVKR-DYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIE 468 (772) T ss_pred HHHhccC--CCCeEEeCCccccCCCCCccccC-CccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHH Confidence 2211111 12333332221 11234455443 232334566677777777777643 23333455689999887777 Q ss_pred HHHHHHHHHHHHHHHHH----HHHHHHHHHHcCCC---------CC--ceEEEEec--------------ccCCC----- Q lcl|NC_020844. 353 KGNSAVQQWAAGLKDAL----ENALYITNLWMDAP---------DT--NPEADVYT--------------DFRAD----- 398 (455) Q Consensus 353 ~~~s~L~~~a~~~~~a~----~~~l~~~a~~~g~~---------~~--~~~v~v~~--------------df~~~----- 398 (455) +....|..+-.|+..+. +.+|.++.+|++.. +. .-.+.+|. |.... T Consensus 469 qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~ 548 (772) T protein:vir:10 469 QSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVA 548 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEE Confidence 66666666666666554 55677888887531 10 11133331 10000 Q ss_pred --------CCCHHHHHHHHHHHHcCCCCHHHHHHHHHh-cCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 399 --------TGEEQTPGYLLTMRQNGDISREGLWMEFQR-LGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 399 --------~~~~~~~~al~~~~~~G~is~et~~~~l~~-~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ....+.+++++++... ++.+..-.-+-- .+..+ .-..++-.++|.+.....--| T Consensus 549 i~~~p~~~t~r~~~~~~m~ql~~~--~~P~~~~~~~~~~le~~D-~p~~~ei~~~ir~~~~~~~pe 611 (772) T protein:vir:10 549 LEDVPSTNSYRGQQLNAMSEAVKS--MPPQYQAAVLPFLVSLMD-VPFKRDVVEAIRAVDQQQTPE 611 (772) T ss_pred eeccccchHHHHHHHHHHHHHHhc--cChhHHHHHHHHHHhhcC-CCChHHHHHHHHHHhccCChH Confidence 0112334455554322 233321110000 00000 011233444444332221111 No 85 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.18 E-value=8.1e-10 Score=70.42 Aligned_cols=426 Identities=11% Similarity=0.100 Sum_probs=189.6 Q ss_pred CCCCCCCcc-CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCC-----CHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKR-SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHE-----QNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~-~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e-----~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |++..-+-. ...+...+.. +...+.....+|+....-+-.+.+. .....+.+-.-.+.+|.++.+|+..+| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQA---LCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLG 84 (714) T ss_pred ccCCCCcchhHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHh Confidence 665532211 1233333332 3333444555554332211112232 223344555556679999999999999 Q ss_pred hhhcCCceecc-----C------CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 75 KPFGREVEVTE-----G------TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 75 ~lf~k~p~i~~-----~------~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~ 143 (455) +--...+.+.- + .+.+..++..+- .-++.+.-...++..++.+|++|+=|....... + T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~-~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~----------~ 153 (714) T protein:vir:10 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADAC-RLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF----------G 153 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHH-HhhchhHHHHHHHHHhhhcCcceEEeccccCCC----------C Confidence 99776665531 1 122344444333 245688888999999999999997765433211 1 Q ss_pred CCceEEEechhhhc-cceeec---cCCeeEEEE----------------------------------------------- Q lcl|NC_020844. 144 VRPYWSQVNHSAIL-EIQSER---QGAGERITY----------------------------------------------- 172 (455) Q Consensus 144 ~rPy~~~~~~~~ii-~w~~~~---~~~~~~l~~----------------------------------------------- 172 (455) ..+++..++|.+|+ ++.... .+-++.+.. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:10 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 22456666666654 332211 111121110 Q ss_pred -----------------------EEEEeeeeEEEEEecCeEEEEEEeCC----------c--------------EEE--- Q lcl|NC_020844. 173 -----------------------MRMWEDAETILVLRPDDWERWRKNDA----------G--------------IWE--- 202 (455) Q Consensus 173 -----------------------v~~~E~~e~~~v~~~~~~~~~~~~~~----------g--------------~~~--- 202 (455) ..+.+...++.++.+...++...... | .|. T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:10 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000001111111110000000000 0 000 Q ss_pred -EeccC--CCCCCceeEEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc Q lcl|NC_020844. 203 -VDEFG--RITIGVVPMVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG 277 (455) Q Consensus 203 -~~~~g--~~~l~~vP~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~ 277 (455) +...+ +.+-+.+|+|||+.... .+-+.. -+.++.+.+...+...|-+.+++ .+.-+.+..|-...++.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:10 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHH Confidence 11122 34456799999864432 222221 22344444444444455566665 3333444445444433332 Q ss_pred Cccceeeccceeeec-cCCCCcc--ccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHH Q lcl|NC_020844. 278 SPKPLDTGPDTVLYA-PPNAEGQ--HGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAA 351 (455) Q Consensus 278 ~~~~~~~g~~~~~~l-p~~~~~~--~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~ 351 (455) ....-.- +.++.+ |....+. ...+.... +..--..+.+.++...+.|..+++ .++...+++.||+++..+. T Consensus 389 ~e~~arp--~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:10 389 MEQIERP--DGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHhccCC--CCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 2111111 122222 3211111 12233222 222334556667777777766643 2344445568999888877 Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCC---------CCc---eEEEEec----------------ccCCCC Q lcl|NC_020844. 352 SKGNSAVQQWAAGLKDALEN----ALYITNLWMDAP---------DTN---PEADVYT----------------DFRADT 399 (455) Q Consensus 352 ~~~~s~L~~~a~~~~~a~~~----~l~~~a~~~g~~---------~~~---~~v~v~~----------------df~~~~ 399 (455) .+....|..+-.|+..+..+ +|.++.+|++.. +.. -.+.+|. |+.... T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:10 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 77666666666666665544 566777877531 111 0233331 111111 Q ss_pred C------CHHHHHHHHHHHHc-----CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCC-C Q lcl|NC_020844. 400 G------EEQTPGYLLTMRQN-----GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIE-E 455 (455) Q Consensus 400 ~------~~~~~~al~~~~~~-----G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~-~ 455 (455) . -.+...+++++.+. +.+....+++.+ + .-..++-.++|.+.....-. + T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~------d-~p~~~el~~~ir~~~~~~~~~~ 606 (714) T protein:vir:10 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL------D-VPQKQEFVERIRAALGTPKSPD 606 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc------C-CCCHHHHHHHHHHHcCCCCCcc Confidence 1 13445666666543 112222232222 1 12344555555444221110 0 No 86 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.18 E-value=8.1e-10 Score=70.42 Aligned_cols=426 Identities=11% Similarity=0.100 Sum_probs=189.6 Q ss_pred CCCCCCCcc-CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCC-----CHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKR-SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHE-----QNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~-~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e-----~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |++..-+-. ...+...+.. +...+.....+|+....-+-.+.+. .....+.+-.-.+.+|.++.+|+..+| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQA---LCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLG 84 (714) T ss_pred ccCCCCcchhHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHh Confidence 665532211 1233333332 3333444555554332211112232 223344555556679999999999999 Q ss_pred hhhcCCceecc-----C------CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 75 KPFGREVEVTE-----G------TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 75 ~lf~k~p~i~~-----~------~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~ 143 (455) +--...+.+.- + .+.+..++..+- .-++.+.-...++..++.+|++|+=|....... + T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~-~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~----------~ 153 (714) T protein:vir:32 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADAC-RLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF----------G 153 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHH-HhhchhHHHHHHHHHhhhcCcceEEeccccCCC----------C Confidence 99776665531 1 122344444333 245688888999999999999997765433211 1 Q ss_pred CCceEEEechhhhc-cceeec---cCCeeEEEE----------------------------------------------- Q lcl|NC_020844. 144 VRPYWSQVNHSAIL-EIQSER---QGAGERITY----------------------------------------------- 172 (455) Q Consensus 144 ~rPy~~~~~~~~ii-~w~~~~---~~~~~~l~~----------------------------------------------- 172 (455) ..+++..++|.+|+ ++.... .+-++.+.. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:32 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 22456666666654 332211 111121110 Q ss_pred -----------------------EEEEeeeeEEEEEecCeEEEEEEeCC----------c--------------EEE--- Q lcl|NC_020844. 173 -----------------------MRMWEDAETILVLRPDDWERWRKNDA----------G--------------IWE--- 202 (455) Q Consensus 173 -----------------------v~~~E~~e~~~v~~~~~~~~~~~~~~----------g--------------~~~--- 202 (455) ..+.+...++.++.+...++...... | .|. T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:32 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000001111111110000000000 0 000 Q ss_pred -EeccC--CCCCCceeEEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc Q lcl|NC_020844. 203 -VDEFG--RITIGVVPMVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG 277 (455) Q Consensus 203 -~~~~g--~~~l~~vP~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~ 277 (455) +...+ +.+-+.+|+|||+.... .+-+.. -+.++.+.+...+...|-+.+++ .+.-+.+..|-...++.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:32 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHH Confidence 11122 34456799999864432 222221 22344444444444455566665 3333444445444433332 Q ss_pred Cccceeeccceeeec-cCCCCcc--ccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHH Q lcl|NC_020844. 278 SPKPLDTGPDTVLYA-PPNAEGQ--HGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAA 351 (455) Q Consensus 278 ~~~~~~~g~~~~~~l-p~~~~~~--~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~ 351 (455) ....-.- +.++.+ |....+. ...+.... +..--..+.+.++...+.|..+++ .++...+++.||+++..+. T Consensus 389 ~e~~arp--~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:32 389 MEQIERP--DGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHhccCC--CCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 2111111 122222 3211111 12233222 222334556667777777766643 2344445568999888877 Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCC---------CCc---eEEEEec----------------ccCCCC Q lcl|NC_020844. 352 SKGNSAVQQWAAGLKDALEN----ALYITNLWMDAP---------DTN---PEADVYT----------------DFRADT 399 (455) Q Consensus 352 ~~~~s~L~~~a~~~~~a~~~----~l~~~a~~~g~~---------~~~---~~v~v~~----------------df~~~~ 399 (455) .+....|..+-.|+..+..+ +|.++.+|++.. +.. -.+.+|. |+.... T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:32 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 77666666666666665544 566777877531 111 0233331 111111 Q ss_pred C------CHHHHHHHHHHHHc-----CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCC-C Q lcl|NC_020844. 400 G------EEQTPGYLLTMRQN-----GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIE-E 455 (455) Q Consensus 400 ~------~~~~~~al~~~~~~-----G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~-~ 455 (455) . -.+...+++++.+. +.+....+++.+ + .-..++-.++|.+.....-. + T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~------d-~p~~~el~~~ir~~~~~~~~~~ 606 (714) T protein:vir:32 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL------D-VPQKQEFVERIRAALGTPKSPD 606 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc------C-CCCHHHHHHHHHHHcCCCCCcc Confidence 1 13445666666543 112222232222 1 12344555555444221110 0 No 87 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.18 E-value=8.1e-10 Score=70.42 Aligned_cols=426 Identities=11% Similarity=0.100 Sum_probs=189.6 Q ss_pred CCCCCCCcc-CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCC-----CHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKR-SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHE-----QNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~-~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e-----~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |++..-+-. ...+...+.. +...+.....+|+....-+-.+.+. .....+.+-.-.+.+|.++.+|+..+| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQA---LCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLG 84 (714) T ss_pred ccCCCCcchhHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHh Confidence 665532211 1233333332 3333444555554332211112232 223344555556679999999999999 Q ss_pred hhhcCCceecc-----C------CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 75 KPFGREVEVTE-----G------TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 75 ~lf~k~p~i~~-----~------~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~ 143 (455) +--...+.+.- + .+.+..++..+- .-++.+.-...++..++.+|++|+=|....... + T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~-~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~----------~ 153 (714) T protein:vir:81 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADAC-RLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF----------G 153 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHH-HhhchhHHHHHHHHHhhhcCcceEEeccccCCC----------C Confidence 99776665531 1 122344444333 245688888999999999999997765433211 1 Q ss_pred CCceEEEechhhhc-cceeec---cCCeeEEEE----------------------------------------------- Q lcl|NC_020844. 144 VRPYWSQVNHSAIL-EIQSER---QGAGERITY----------------------------------------------- 172 (455) Q Consensus 144 ~rPy~~~~~~~~ii-~w~~~~---~~~~~~l~~----------------------------------------------- 172 (455) ..+++..++|.+|+ ++.... .+-++.+.. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:81 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 22456666666654 332211 111121110 Q ss_pred -----------------------EEEEeeeeEEEEEecCeEEEEEEeCC----------c--------------EEE--- Q lcl|NC_020844. 173 -----------------------MRMWEDAETILVLRPDDWERWRKNDA----------G--------------IWE--- 202 (455) Q Consensus 173 -----------------------v~~~E~~e~~~v~~~~~~~~~~~~~~----------g--------------~~~--- 202 (455) ..+.+...++.++.+...++...... | .|. T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:81 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000001111111110000000000 0 000 Q ss_pred -EeccC--CCCCCceeEEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc Q lcl|NC_020844. 203 -VDEFG--RITIGVVPMVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG 277 (455) Q Consensus 203 -~~~~g--~~~l~~vP~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~ 277 (455) +...+ +.+-+.+|+|||+.... .+-+.. -+.++.+.+...+...|-+.+++ .+.-+.+..|-...++.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:81 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHH Confidence 11122 34456799999864432 222221 22344444444444455566665 3333444445444433332 Q ss_pred Cccceeeccceeeec-cCCCCcc--ccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHH Q lcl|NC_020844. 278 SPKPLDTGPDTVLYA-PPNAEGQ--HGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAA 351 (455) Q Consensus 278 ~~~~~~~g~~~~~~l-p~~~~~~--~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~ 351 (455) ....-.- +.++.+ |....+. ...+.... +..--..+.+.++...+.|..+++ .++...+++.||+++..+. T Consensus 389 ~e~~arp--~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:81 389 MEQIERP--DGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHhccCC--CCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 2111111 122222 3211111 12233222 222334556667777777766643 2344445568999888877 Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCC---------CCc---eEEEEec----------------ccCCCC Q lcl|NC_020844. 352 SKGNSAVQQWAAGLKDALEN----ALYITNLWMDAP---------DTN---PEADVYT----------------DFRADT 399 (455) Q Consensus 352 ~~~~s~L~~~a~~~~~a~~~----~l~~~a~~~g~~---------~~~---~~v~v~~----------------df~~~~ 399 (455) .+....|..+-.|+..+..+ +|.++.+|++.. +.. -.+.+|. |+.... T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:81 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 77666666666666665544 566777877531 111 0233331 111111 Q ss_pred C------CHHHHHHHHHHHHc-----CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCC-C Q lcl|NC_020844. 400 G------EEQTPGYLLTMRQN-----GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIE-E 455 (455) Q Consensus 400 ~------~~~~~~al~~~~~~-----G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~-~ 455 (455) . -.+...+++++.+. +.+....+++.+ + .-..++-.++|.+.....-. + T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~------d-~p~~~el~~~ir~~~~~~~~~~ 606 (714) T protein:vir:81 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL------D-VPQKQEFVERIRAALGTPKSPD 606 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc------C-CCCHHHHHHHHHHHcCCCCCcc Confidence 1 13445666666543 112222232222 1 12344555555444221110 0 No 88 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.18 E-value=8.1e-10 Score=70.42 Aligned_cols=426 Identities=11% Similarity=0.100 Sum_probs=189.6 Q ss_pred CCCCCCCcc-CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCC-----CHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKR-SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHE-----QNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~-~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e-----~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |++..-+-. ...+...+.. +...+.....+|+....-+-.+.+. .....+.+-.-.+.+|.++.+|+..+| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQA---LCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLG 84 (714) T ss_pred ccCCCCcchhHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHh Confidence 665532211 1233333332 3333444555554332211112232 223344555556679999999999999 Q ss_pred hhhcCCceecc-----C------CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 75 KPFGREVEVTE-----G------TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 75 ~lf~k~p~i~~-----~------~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~ 143 (455) +--...+.+.- + .+.+..++..+- .-++.+.-...++..++.+|++|+=|....... + T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~-~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~----------~ 153 (714) T protein:vir:27 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADAC-RLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF----------G 153 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHH-HhhchhHHHHHHHHHhhhcCcceEEeccccCCC----------C Confidence 99776665531 1 122344444333 245688888999999999999997765433211 1 Q ss_pred CCceEEEechhhhc-cceeec---cCCeeEEEE----------------------------------------------- Q lcl|NC_020844. 144 VRPYWSQVNHSAIL-EIQSER---QGAGERITY----------------------------------------------- 172 (455) Q Consensus 144 ~rPy~~~~~~~~ii-~w~~~~---~~~~~~l~~----------------------------------------------- 172 (455) ..+++..++|.+|+ ++.... .+-++.+.. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:27 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 22456666666654 332211 111121110 Q ss_pred -----------------------EEEEeeeeEEEEEecCeEEEEEEeCC----------c--------------EEE--- Q lcl|NC_020844. 173 -----------------------MRMWEDAETILVLRPDDWERWRKNDA----------G--------------IWE--- 202 (455) Q Consensus 173 -----------------------v~~~E~~e~~~v~~~~~~~~~~~~~~----------g--------------~~~--- 202 (455) ..+.+...++.++.+...++...... | .|. T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:27 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000001111111110000000000 0 000 Q ss_pred -EeccC--CCCCCceeEEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc Q lcl|NC_020844. 203 -VDEFG--RITIGVVPMVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG 277 (455) Q Consensus 203 -~~~~g--~~~l~~vP~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~ 277 (455) +...+ +.+-+.+|+|||+.... .+-+.. -+.++.+.+...+...|-+.+++ .+.-+.+..|-...++.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:27 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHH Confidence 11122 34456799999864432 222221 22344444444444455566665 3333444445444433332 Q ss_pred Cccceeeccceeeec-cCCCCcc--ccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHH Q lcl|NC_020844. 278 SPKPLDTGPDTVLYA-PPNAEGQ--HGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAA 351 (455) Q Consensus 278 ~~~~~~~g~~~~~~l-p~~~~~~--~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~ 351 (455) ....-.- +.++.+ |....+. ...+.... +..--..+.+.++...+.|..+++ .++...+++.||+++..+. T Consensus 389 ~e~~arp--~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:27 389 MEQIERP--DGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHhccCC--CCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 2111111 122222 3211111 12233222 222334556667777777766643 2344445568999888877 Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCC---------CCc---eEEEEec----------------ccCCCC Q lcl|NC_020844. 352 SKGNSAVQQWAAGLKDALEN----ALYITNLWMDAP---------DTN---PEADVYT----------------DFRADT 399 (455) Q Consensus 352 ~~~~s~L~~~a~~~~~a~~~----~l~~~a~~~g~~---------~~~---~~v~v~~----------------df~~~~ 399 (455) .+....|..+-.|+..+..+ +|.++.+|++.. +.. -.+.+|. |+.... T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:27 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 77666666666666665544 566777877531 111 0233331 111111 Q ss_pred C------CHHHHHHHHHHHHc-----CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCC-C Q lcl|NC_020844. 400 G------EEQTPGYLLTMRQN-----GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIE-E 455 (455) Q Consensus 400 ~------~~~~~~al~~~~~~-----G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~-~ 455 (455) . -.+...+++++.+. +.+....+++.+ + .-..++-.++|.+.....-. + T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~------d-~p~~~el~~~ir~~~~~~~~~~ 606 (714) T protein:vir:27 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL------D-VPQKQEFVERIRAALGTPKSPD 606 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc------C-CCCHHHHHHHHHHHcCCCCCcc Confidence 1 13445666666543 112222232222 1 12344555555444221110 0 No 89 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.18 E-value=8.1e-10 Score=70.42 Aligned_cols=426 Identities=11% Similarity=0.100 Sum_probs=189.6 Q ss_pred CCCCCCCcc-CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCC-----CHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKR-SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHE-----QNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~-~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e-----~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |++..-+-. ...+...+.. +...+.....+|+....-+-.+.+. .....+.+-.-.+.+|.++.+|+..+| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQA---LCSDIDSQPKWRDAANKACAYYDGDQLPPEVLQVLKDRGQPMTIHNLIAPTVDGVLG 84 (714) T ss_pred ccCCCCcchhHHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCcEEeccHHHHHHHHHh Confidence 665532211 1233333332 3333444555554332211112232 223344555556679999999999999 Q ss_pred hhhcCCceecc-----C------CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 75 KPFGREVEVTE-----G------TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 75 ~lf~k~p~i~~-----~------~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~ 143 (455) +--...+.+.- + .+.+..++..+- .-++.+.-...++..++.+|++|+=|....... + T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~-~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~----------~ 153 (714) T protein:vir:99 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADAC-RLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPF----------G 153 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHH-HhhchhHHHHHHHHHhhhcCcceEEeccccCCC----------C Confidence 99776665531 1 122344444333 245688888999999999999997765433211 1 Q ss_pred CCceEEEechhhhc-cceeec---cCCeeEEEE----------------------------------------------- Q lcl|NC_020844. 144 VRPYWSQVNHSAIL-EIQSER---QGAGERITY----------------------------------------------- 172 (455) Q Consensus 144 ~rPy~~~~~~~~ii-~w~~~~---~~~~~~l~~----------------------------------------------- 172 (455) ..+++..++|.+|+ ++.... .+-++.+.. T Consensus 154 ~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:99 154 PEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred CCeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchh Confidence 22456666666654 332211 111121110 Q ss_pred -----------------------EEEEeeeeEEEEEecCeEEEEEEeCC----------c--------------EEE--- Q lcl|NC_020844. 173 -----------------------MRMWEDAETILVLRPDDWERWRKNDA----------G--------------IWE--- 202 (455) Q Consensus 173 -----------------------v~~~E~~e~~~v~~~~~~~~~~~~~~----------g--------------~~~--- 202 (455) ..+.+...++.++.+...++...... | .|. T Consensus 234 ~~~~~~~~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~ 313 (714) T protein:vir:99 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGP 313 (714) T ss_pred hhccccccccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecC Confidence 00000001111111110000000000 0 000 Q ss_pred -EeccC--CCCCCceeEEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc Q lcl|NC_020844. 203 -VDEFG--RITIGVVPMVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG 277 (455) Q Consensus 203 -~~~~g--~~~l~~vP~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~ 277 (455) +...+ +.+-+.+|+|||+.... .+-+.. -+.++.+.+...+...|-+.+++ .+.-+.+..|-...++.+. T Consensus 314 ~~L~~~~~p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~ 388 (714) T protein:vir:99 314 HFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDL 388 (714) T ss_pred cccccCCCCCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHH Confidence 11122 34456799999864432 222221 22344444444444455566665 3333444445444433332 Q ss_pred Cccceeeccceeeec-cCCCCcc--ccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHH Q lcl|NC_020844. 278 SPKPLDTGPDTVLYA-PPNAEGQ--HGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAA 351 (455) Q Consensus 278 ~~~~~~~g~~~~~~l-p~~~~~~--~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~ 351 (455) ....-.- +.++.+ |....+. ...+.... +..--..+.+.++...+.|..+++ .++...+++.||+++..+. T Consensus 389 ~e~~arp--~~vi~~~p~~~~~~~~~~~~~~~~-~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq 465 (714) T protein:vir:99 389 MEQIERP--DGIIKLNPVRKNQKSVADVFRVEQ-DFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLV 465 (714) T ss_pred HHhccCC--CCceeecccccccCCCCccccccC-CCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHH Confidence 2111111 122222 3211111 12233222 222334556667777777766643 2344445568999888877 Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHHHHHcCCC---------CCc---eEEEEec----------------ccCCCC Q lcl|NC_020844. 352 SKGNSAVQQWAAGLKDALEN----ALYITNLWMDAP---------DTN---PEADVYT----------------DFRADT 399 (455) Q Consensus 352 ~~~~s~L~~~a~~~~~a~~~----~l~~~a~~~g~~---------~~~---~~v~v~~----------------df~~~~ 399 (455) .+....|..+-.|+..+..+ +|.++.+|++.. +.. -.+.+|. |+.... T Consensus 466 ~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~ 545 (714) T protein:vir:99 466 EQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAP 545 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEee Confidence 77666666666666665544 566777877531 111 0233331 111111 Q ss_pred C------CHHHHHHHHHHHHc-----CCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCC-C Q lcl|NC_020844. 400 G------EEQTPGYLLTMRQN-----GDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIE-E 455 (455) Q Consensus 400 ~------~~~~~~al~~~~~~-----G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~-~ 455 (455) . -.+...+++++.+. +.+....+++.+ + .-..++-.++|.+.....-. + T Consensus 546 ~p~~~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~------d-~p~~~el~~~ir~~~~~~~~~~ 606 (714) T protein:vir:99 546 VQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLL------D-VPQKQEFVERIRAALGTPKSPD 606 (714) T ss_pred ccCchHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhc------C-CCCHHHHHHHHHHHcCCCCCcc Confidence 1 13445666666543 112222232222 1 12344555555444221110 0 No 90 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=99.12 E-value=1.5e-09 Score=69.00 Aligned_cols=434 Identities=11% Similarity=0.030 Sum_probs=185.0 Q ss_pred CCCCCCCccCHHHHHH---------HHHHHHHHHHhcC-HHHHHHhhhhc----------CCCCC---CCCHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAM---------SDYLQTVDDVLEG-VTGMRRNFKRY----------VKQFP---HEQNKRYDLRKS 57 (455) Q Consensus 1 m~~~~v~~~~p~y~~~---------~~~w~~i~d~~~G-~~~~r~~g~~y----------Lpk~~---~e~~~~Y~~rl~ 57 (455) ||.+-|..+-+.=..+ +.+|+..++-..= +..+++...-| .|+-. +......+. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~--- 81 (641) T protein:vir:94 5 MPTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRH--- 81 (641) T ss_pred CCcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccc--- Confidence 6665554333322221 2333333221100 11122222222 22221 111111211 Q ss_pred ccccCChHHHHHHHHhhhhhc----CCceec------cCCH---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEE Q lcl|NC_020844. 58 VAKMTNVFRDVLEGLASKPFG----REVEVT------EGTG---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIR 124 (455) Q Consensus 58 ra~~~n~~~~~v~~~~g~lf~----k~p~i~------~~~~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~l 124 (455) .++.+.+..+++.++..+.+ .++-+. ++.+ .++.++.+.-.+ +++.......+++++.+|.+++- T Consensus 82 -ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~-~~~~~~~~~~~~d~~~~g~~iv~ 159 (641) T protein:vir:94 82 -RINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEA-ASIRDIFETYVRNLVLYGVSTYR 159 (641) T ss_pred -cccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhh-cchHHHHHHHHHHHhhcCceEEE Confidence 35666666666666555533 222221 1111 133444443323 34455556899999999999888 Q ss_pred EeeCCC----------Ccchhhh-----hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeee---------- Q lcl|NC_020844. 125 VDYPAG----------QQFRNRE-----EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDA---------- 179 (455) Q Consensus 125 Vd~p~~----------~~~~t~a-----d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~---------- 179 (455) |+.... +...... +.......+.+..++|.+|+. +...+...-++++++++. T Consensus 160 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~---dps~~~~~~~f~~~r~t~~t~~~l~~eg 236 (641) T protein:vir:94 160 LGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWL---DTSGGKNTGTFVRLRHTREELHELVTSG 236 (641) T ss_pred eehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheee---cCCCCcccccceehhhhHHHHHHHHhcC Confidence 764310 0000000 000011123344445555431 111111111111222100 Q ss_pred ----eEEEE-------------------EecCeEEEEEE----------------eCCcEEEEeccCCCCCCceeEEEEE Q lcl|NC_020844. 180 ----ETILV-------------------LRPDDWERWRK----------------NDAGIWEVDEFGRITIGVVPMVPFI 220 (455) Q Consensus 180 ----e~~~v-------------------~~~~~~~~~~~----------------~~~g~~~~~~~g~~~l~~vP~V~f~ 220 (455) +.+.+ -....|++|+. ...|...+..++...++..||+.++ T Consensus 237 ~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r 316 (641) T protein:vir:94 237 YYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTT 316 (641) T ss_pred CCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEec Confidence 00000 00111222211 1122223334444557788999887 Q ss_pred ecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccc Q lcl|NC_020844. 221 TGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQH 300 (455) Q Consensus 221 ~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~ 300 (455) +.....+. -|.+|..++...........-...+.++.+..|.+.+.. +.-.++..+..+++.++... .. T Consensus 317 ~~~~~~~~-YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~-----~~~~~~~~l~~~PG~ii~~~-----~~ 385 (641) T protein:vir:94 317 LLPDRDSV-YGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVE-----DGILKREDVKAKPGAVFKVA-----QH 385 (641) T ss_pred ceecCCcc-cCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecc-----ccccccceeeccCCcceeeC-----CC Confidence 76666555 478888888888777777777777777888888876532 11123344677777776542 12 Q ss_pred cceeEEeeCchhHHHHHHHHHHHHHHHHHh-hhh-hc---cCCCc-chhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_020844. 301 GSWNFVEPATESLKFLAEQIESVSKEIREL-GRQ-PL---TAQSG-NLTRISAAQAASKGNSAVQQWAAGLKD-ALENAL 373 (455) Q Consensus 301 ~~~~~le~~~~~l~~~~~~l~~~~~~m~~~-g~~-~l---~~~~~-~~sa~a~~~~~~~~~s~L~~~a~~~~~-a~~~~l 373 (455) +.+..+.+....+......++.++..+... +.. ++ ...++ +.||++.+.........|..+.+.|++ .+..+| T Consensus 386 ~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll 465 (641) T protein:vir:94 386 GSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLL 465 (641) T ss_pred CcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 346666544434444455566666655432 222 11 22222 468998888888888888888888875 444344 Q ss_pred ----HHHHHHcC-------------------CCCCceEEEEecccCCCCC-----CHHHHHHHHHHHHc-CCCC------ Q lcl|NC_020844. 374 ----YITNLWMD-------------------APDTNPEADVYTDFRADTG-----EEQTPGYLLTMRQN-GDIS------ 418 (455) Q Consensus 374 ----~~~a~~~g-------------------~~~~~~~v~v~~df~~~~~-----~~~~~~al~~~~~~-G~is------ 418 (455) .++.+..- .++.+. +.+.++.+... .++.++.+..+.+. |..+ T Consensus 466 ~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L--~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~ 543 (641) T protein:vir:94 466 NKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYL--HYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQIGQSL 543 (641) T ss_pred HHHHHHHHHhccchhhhhhhchhhhcccCCCCCccce--eeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcC Confidence 33333211 111111 22222221111 12334444444443 2222 Q ss_pred --HHHHHHHHHhcCCCCC-------CCCHHHHHHHHHhhccc----------CCCC Q lcl|NC_020844. 419 --REGLWMEFQRLGELSD-------NFDPEEESERLISELPG----------GIEE 455 (455) Q Consensus 419 --~et~~~~l~~~~vl~~-------~~d~e~e~~ri~~e~~~----------~l~~ 455 (455) ...+...++..|+-.+ +..++....+.++..+. +.++ T Consensus 544 d~~~~~~~~~~~~g~~~p~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~ 599 (641) T protein:vir:94 544 DYALILEDLLRQMRFTDPMRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLND 599 (641) T ss_pred CHHHHHHHHHHHhCCCCchhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHH Confidence 1122333333332111 11111100011100000 1111 No 91 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.03 E-value=4.5e-09 Score=66.35 Aligned_cols=429 Identities=11% Similarity=0.053 Sum_probs=213.0 Q ss_pred CCCCCCCcc-------CHHHHHHHHHHHHHHHHhc-CHHHHHHhhhhcCCCCCCCCHHHHHH-----------Hhhcccc Q lcl|NC_020844. 1 MPEQDPTKR-------SADAEAMSDYLQTVDDVLE-GVTGMRRNFKRYVKQFPHEQNKRYDL-----------RKSVAKM 61 (455) Q Consensus 1 m~~~~v~~~-------~p~y~~~~~~w~~i~d~~~-G~~~~r~~g~~yLpk~~~e~~~~Y~~-----------rl~ra~~ 61 (455) |++.++... |.--.-.+.+|+..++... -...++++.+.|..+ .+....|.. ....+++ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~--~~~~~y~~~~~~~~~~~~~~~~rs~~~ 80 (651) T protein:vir:80 3 LATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLST--PEAQDYLRDQVLRSVGDVNADWRHKIT 80 (651) T ss_pred ccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhccc--HHHHHhhccccccccCCCCCCCCcccc Confidence 776554442 2211223467777776542 234455555555443 111111211 1234578 Q ss_pred CChHHHHHHHHhhhhhcC----Cceec-----cCC------HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEe Q lcl|NC_020844. 62 TNVFRDVLEGLASKPFGR----EVEVT-----EGT------GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVD 126 (455) Q Consensus 62 ~n~~~~~v~~~~g~lf~k----~p~i~-----~~~------~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd 126 (455) +|.++.+|+.+...+++. +..+. +.. +.++.++.+- ....++......+...++.+|.+.+-|. T Consensus 81 ~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~-l~~~~~~~~~~~~~~d~l~~G~~i~kv~ 159 (651) T protein:vir:80 81 TGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDK-LTEGKFRAAYANFLRQLLITGNSVLALP 159 (651) T ss_pred ChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHH-hhccCcHHHHHHHHHhhcccCceEEEEe Confidence 999999999888777542 22222 111 1134444321 1345688888899999999999988775 Q ss_pred eCCCCc---chhhh---------------hHHhccCCceEEEechhhhc-cceeecc-CCeeEEEEEEE--Ee------- Q lcl|NC_020844. 127 YPAGQQ---FRNRE---------------EERQAGVRPYWSQVNHSAIL-EIQSERQ-GAGERITYMRM--WE------- 177 (455) Q Consensus 127 ~p~~~~---~~t~a---------------d~~~~~~rPy~~~~~~~~ii-~w~~~~~-~~~~~l~~v~~--~E------- 177 (455) ....-. ..+.. ........|.+..++|.+++ +-....+ +..... +.++ .+ T Consensus 160 we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~-~~~~t~~~l~~l~~~ 238 (651) T protein:vir:80 160 WRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIR-KLTKTKADILNLLSE 238 (651) T ss_pred ecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceee-eeeeeHHHHHHHHhc Confidence 431100 00000 01112245778888888876 2111111 111111 1110 00 Q ss_pred ----e------e-----------------------------eEEEEEecCeEEEEE---EeCCcEEE----------Eec Q lcl|NC_020844. 178 ----D------A-----------------------------ETILVLRPDDWERWR---KNDAGIWE----------VDE 205 (455) Q Consensus 178 ----~------~-----------------------------e~~~v~~~~~~~~~~---~~~~g~~~----------~~~ 205 (455) . + ..+.| |+.|. ..+.+.|. +.. T Consensus 239 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v-----~E~~~~~d~e~~~~~~~~v~~~g~~il~~ 313 (651) T protein:vir:80 239 GYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVEL-----LEYWGDIHLENKTYHDVVVTIMGNEVLRF 313 (651) T ss_pred ccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEE-----EEEEEEeeccCCceEEEEEEEcCcEEecc Confidence 0 0 01111 12221 12222221 111 Q ss_pred cCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe-c-CCCccccccCcccee Q lcl|NC_020844. 206 FGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN-G-VEPDTDSEGSPKPLD 283 (455) Q Consensus 206 ~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~-G-~~~~~~~~~~~~~~~ 283 (455) +....+...||+.++..+..+.+ -+.++...+++...........+.+.+..++.|+..+. + +. ++..+. T Consensus 314 ~~~~~~~~~Pf~~~~~~~~~~~~-yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~-------~~~~l~ 385 (651) T protein:vir:80 314 EQNPYWCGRPFVIGTYIPTARQP-YAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLL-------QPEDVY 385 (651) T ss_pred cccCCCCCCCeeeecceecCccc-cCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccc-------cHHHhh Confidence 12223456799988777666555 48889999999988888888888999999999998774 2 22 122234 Q ss_pred eccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhh--hccC----CCcchhHHHHHHHHHHHHHH Q lcl|NC_020844. 284 TGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQ--PLTA----QSGNLTRISAAQAASKGNSA 357 (455) Q Consensus 284 ~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~--~l~~----~~~~~sa~a~~~~~~~~~s~ 357 (455) .+++.++... ..+++..+++....+......++.++..|....+. ...+ ..++.||++.+......... T Consensus 386 ~~pg~vi~~~-----~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~ 460 (651) T protein:vir:80 386 TEPGKVFLVS-----DHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNR 460 (651) T ss_pred cCCCceEEec-----CCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHH Confidence 4555665442 12356667765555555566778888777665332 2222 12457888888888888899 Q ss_pred HHHHHHHHHHH-----HHHHHHHHHHHcCCCC------Cc----eEE-----EEecccCCCCCC-------HHHHHHHHH Q lcl|NC_020844. 358 VQQWAAGLKDA-----LENALYITNLWMDAPD------TN----PEA-----DVYTDFRADTGE-------EQTPGYLLT 410 (455) Q Consensus 358 L~~~a~~~~~a-----~~~~l~~~a~~~g~~~------~~----~~v-----~v~~df~~~~~~-------~~~~~al~~ 410 (455) |..+.++|++. ++.+|.++.++...+. .. ..+ .++.+|....+. .+.++.+.. T Consensus 461 l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~ 540 (651) T protein:vir:80 461 LSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLT 540 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHH Confidence 99999998874 3456666666542210 00 000 122222221111 133444555 Q ss_pred HHHc-CCCC--------HHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 411 MRQN-GDIS--------REGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 411 ~~~~-G~is--------~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +.+. +..+ ...+.+.++..|+-.++ .-+...+.+.+....+ T Consensus 541 ~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~----~~l~~~~q~~~~~~~~ 590 (651) T protein:vir:80 541 FIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEPE----AYLKQQDQQAPANPQE 590 (651) T ss_pred HHHhhccCCccchhhhHHHHHHHHHHHcCCCCcH----HhcCCCccchhhhhhH Confidence 4443 2222 12233334445552211 1111000111000000 No 92 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.00 E-value=6.7e-09 Score=65.38 Aligned_cols=412 Identities=10% Similarity=0.003 Sum_probs=187.0 Q ss_pred CCCCCCC-ccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCC--HHHHHH-Hhhcc--c--cCChHHHHHHHH Q lcl|NC_020844. 1 MPEQDPT-KRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQ--NKRYDL-RKSVA--K--MTNVFRDVLEGL 72 (455) Q Consensus 1 m~~~~v~-~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~--~~~Y~~-rl~ra--~--~~n~~~~~v~~~ 72 (455) |.+..+. ..-|+ .+.++ +.+...++|..........+.|..-.-+ ...... ...|| . =.++.+..|+.+ T Consensus 1 ~~~p~~~~~~~~~--~~~~~-~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~ 77 (533) T protein:vir:34 1 MKTPTIPTLLGPD--GMTSL-REYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLH 77 (533) T ss_pred CCCchhhhhhccc--ccchH-HHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 4322221 11111 11222 2223333332111111122444322111 111111 11111 1 267889999999 Q ss_pred hhhhhcCCceeccCC----------------HH----HHHHHHh----hccCC-CCHHHHHHHHHHHHHhhCcEEEEEee Q lcl|NC_020844. 73 ASKPFGREVEVTEGT----------------GP----SEDLLED----IDGRG-NNLTTFAGQLFFQGVADSISWIRVDY 127 (455) Q Consensus 73 ~g~lf~k~p~i~~~~----------------~~----l~~~~~d----~d~~G-~~l~~~~~~~~~~al~~G~~~~lVd~ 127 (455) +..+.+..+++...| .. ++.|.++ +|-.| .++.++.+.+.+..+..|-|++..-+ T Consensus 78 ~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~ 157 (533) T protein:vir:34 78 QDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATW 157 (533) T ss_pred HHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeee Confidence 999988766654211 12 3333332 34444 48999999999999999999998755 Q ss_pred CCCCcchhhhhHHhccCCc-eEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEecc Q lcl|NC_020844. 128 PAGQQFRNREEERQAGVRP-YWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEF 206 (455) Q Consensus 128 p~~~~~~t~ad~~~~~~rP-y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~ 206 (455) .+.+... -| -+-+++|+.|=+......++ . +.. .++.=..-.+-.|.+++....+.. T Consensus 158 ~~~~g~~----------~~~~lq~ie~d~l~~~~~~~~~~-~-i~~-----GIe~d~~Gr~~aY~i~~~~~~~~~----- 215 (533) T protein:vir:34 158 DTSSSRL----------FRTQFRMVSPKRISNPNNTGDSR-N-CRA-----GVQINDSGAALGYYVSEDGYPGWM----- 215 (533) T ss_pred ccCCCCc----------cceEEEEechhhcCCCCCCCCCC-c-eEe-----eeEECCCCCeEEEEEeecCCCCcc----- Confidence 4332210 12 26788898886544322222 1 111 111000011223444443322211 Q ss_pred CCCCCCce------e---EEEEEecccCCCcccCcchHHHHHHHHHHHHH-hHHHHHHHHHHcCCceEEEe-cCCCcc-- Q lcl|NC_020844. 207 GRITIGVV------P---MVPFITGRRKGKKWVFHPPMKDALDLQVALFR-KECALEFAEMMTAFPMLSAN-GVEPDT-- 273 (455) Q Consensus 207 g~~~l~~v------P---~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~-~~sd~~~~l~~~~~p~~~~~-G~~~~~-- 273 (455) ...+..+ | |+-++ .....+-.-+.|.|..++..-..+-. ..+.+..+. .++.-..+|+ ...... T Consensus 216 -~~~~~~~~~~~~v~a~~VlH~f-~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~-i~A~~a~fi~~~~~~~~~~ 292 (533) T protein:vir:34 216 -PQKWTWIPRELPGGRASFIHVF-EPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAI-VKAMYAATIESELDTQSAM 292 (533) T ss_pred -ccccceeeeeeccChhHeeeec-cccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHH-HhhhheeeeecCCCccccc Confidence 1112222 2 22122 33334444577777766553222221 123333333 3333344444 211100 Q ss_pred ----------------------ccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH-h Q lcl|NC_020844. 274 ----------------------DSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE-L 330 (455) Q Consensus 274 ----------------------~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~-~ 330 (455) ........+.++++++..|+.+ -+++|++++-.+ ..+...++.+...|.. + T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG-----e~i~~~~~~~p~-~~~~~f~~~~lr~iAagl 366 (533) T protein:vir:34 293 DFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPG-----DSLNLQTAQDTD-NGYSVFEQSLLRYIAAGL 366 (533) T ss_pred ccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCC-----CeeeecCCCCCC-CCHHHHHHHHHHHHHhhc Confidence 0011123467899999988754 379999866322 2234444555555533 2 Q ss_pred --hhhhccCC--Ccc-hhHHHHHHHHHHHHHHHHH-HHHHH-HHHHHHHHHHHHHHcCCCC--CceE--EE------Eec Q lcl|NC_020844. 331 --GRQPLTAQ--SGN-LTRISAAQAASKGNSAVQQ-WAAGL-KDALENALYITNLWMDAPD--TNPE--AD------VYT 393 (455) Q Consensus 331 --g~~~l~~~--~~~-~sa~a~~~~~~~~~s~L~~-~a~~~-~~a~~~~l~~~a~~~g~~~--~~~~--v~------v~~ 393 (455) +...+.+. ..| .|+-+...++.......+. ++..| .-.++..|+ .|...|.-+ .+.. +. ++- T Consensus 367 Gi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~-~ail~G~i~~p~~~~~~~~~~~~~~~~~ 445 (533) T protein:vir:34 367 GVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLE-EAIVRRVVTLPSKARFSFQEARSAWGNC 445 (533) T ss_pred CCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHcCcccCCCccCCCchhhHHhhhce Confidence 22345444 234 4444444555444444433 22211 112222222 222334311 1100 00 011 Q ss_pred ccCCC----CCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc----cCCCC Q lcl|NC_020844. 394 DFRAD----TGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELP----GGIEE 455 (455) Q Consensus 394 df~~~----~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~----~~l~~ 455 (455) +|... ..+..++++...++.+|+.|.+....+. ..||++.++.++.|.+ -||.- T Consensus 446 ~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~--------G~D~~ev~~q~a~e~~~~~~~gl~~ 507 (533) T protein:vir:34 446 DWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR--------GDDYQEIFAQQVRETMERRAAGLKP 507 (533) T ss_pred eeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc--------CCCHHHHHHHHHHHHHHHHhcCCCC Confidence 22211 2246889999999999999999766542 3478888887777743 23322 No 93 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=98.92 E-value=1.5e-08 Score=63.53 Aligned_cols=432 Identities=12% Similarity=0.090 Sum_probs=186.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhh---hhcC----CCCCCCCHHHHHHHhhc----cccCChHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNF---KRYV----KQFPHEQNKRYDLRKSV----AKMTNVFRDVL 69 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g---~~yL----pk~~~e~~~~Y~~rl~r----a~~~n~~~~~v 69 (455) |++.+- . -+...+.+ ++.+....+..|+.. +.|. -+++.+..+..+.|.+- ..-+|.++.+| T Consensus 1 m~~~~~-~---~~~~~~~~---~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v 73 (708) T protein:vir:10 1 MAETLE-K---KHERIMLR---FDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATEL 73 (708) T ss_pred CchhHH-H---HHHHHHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHH Confidence 885422 1 22222222 333333333334332 1232 13444444445444331 34589999999 Q ss_pred HHHhhhhhcCCceec------cC----CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE--eeCCCCcchhhh Q lcl|NC_020844. 70 EGLASKPFGREVEVT------EG----TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV--DYPAGQQFRNRE 137 (455) Q Consensus 70 ~~~~g~lf~k~p~i~------~~----~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV--d~p~~~~~~t~a 137 (455) +..+|+--...+.+. .. .+.+..++..+- +-++.+.-+..++.+++.+|++|+-| ||-....+... T Consensus 74 ~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~- 151 (708) T protein:vir:10 74 NRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADY-EETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD- 151 (708) T ss_pred HHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHH-HhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCC- Confidence 999999876665552 11 123444444433 34668888999999999999999866 32111110000 Q ss_pred hHHhccCCceEEEech-hhh-ccceeecc---CCeeEE-------------------------------------EEEEE Q lcl|NC_020844. 138 EERQAGVRPYWSQVNH-SAI-LEIQSERQ---GAGERI-------------------------------------TYMRM 175 (455) Q Consensus 138 d~~~~~~rPy~~~~~~-~~i-i~w~~~~~---~~~~~l-------------------------------------~~v~~ 175 (455) . .+. |+...+.| .+| ++|..... +-++.+ -.+++ T Consensus 152 -~--~~i-~i~~~~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v 227 (708) T protein:vir:10 152 -R--QRI-AIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYI 227 (708) T ss_pred -c--ccc-ceEEeecchhhcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEE Confidence 0 011 11111211 222 12211110 000100 00111 Q ss_pred EeeeeE------E-EEEec--CeEEEEE-------------------------------EeCCcEEEEeccCCCCCCcee Q lcl|NC_020844. 176 WEDAET------I-LVLRP--DDWERWR-------------------------------KNDAGIWEVDEFGRITIGVVP 215 (455) Q Consensus 176 ~E~~e~------~-~v~~~--~~~~~~~-------------------------------~~~~g~~~~~~~g~~~l~~vP 215 (455) .|..++ + .+-+| +.+..|. ..-.|...+....+.+.+.+| T Consensus 228 ~ey~~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP 307 (708) T protein:vir:10 228 AKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIP 307 (708) T ss_pred EEeeeEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCcee Confidence 111111 0 00011 1111111 111111112234566788899 Q ss_pred EEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe-----cCCCccccccCcccee----- Q lcl|NC_020844. 216 MVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN-----GVEPDTDSEGSPKPLD----- 283 (455) Q Consensus 216 ~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~-----G~~~~~~~~~~~~~~~----- 283 (455) +|||+.... ++.. ....-+.++.+.+..++...|-+.+++-.++..++++. |+...+......+... T Consensus 308 ~vP~~g~r~~~d~~~-~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 386 (708) T protein:vir:10 308 LIPVYGKRWFIDDIE-RVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) T ss_pred eEEEeeeeeccCCCc-ccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhcccc Confidence 999864322 2111 00111334444444555556666777666666554442 2222222111111100 Q ss_pred eccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhh--hccCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 284 TGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQ--PLTAQSGNLTRISAAQAASKGNSAVQQW 361 (455) Q Consensus 284 ~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~--~l~~~~~~~sa~a~~~~~~~~~s~L~~~ 361 (455) ++.......+ ....++.+++.. --..+.+.++...+.|..+++. .+.+..+|.||+++..+..+....|..+ T Consensus 387 ~~~~~G~~~~-----~~~~~~~~q~~~-~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn~SG~aI~~rq~qg~~~l~~~ 460 (708) T protein:vir:10 387 VRDKSGNIIA-----GATPAGYTQPAV-MNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIY 460 (708) T ss_pred cccccccccc-----ccCCccccCCcc-chHHHHHHHHHHHHHHHHHhCcChhHccCccchHHHHHHHHHHHHHHHHHHH Confidence 1111111111 111344455332 2234455566666677666432 2334455689999998888777777777 Q ss_pred HHHHHHHHHH----HHHHHHHHcCC---------CCCceEEEEec---------------------cc------CCCCCC Q lcl|NC_020844. 362 AAGLKDALEN----ALYITNLWMDA---------PDTNPEADVYT---------------------DF------RADTGE 401 (455) Q Consensus 362 a~~~~~a~~~----~l~~~a~~~g~---------~~~~~~v~v~~---------------------df------~~~~~~ 401 (455) -.|+..+... +|.++.+|++. ++..-.+.+|. |+ ...... T Consensus 461 ~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r 540 (708) T protein:vir:10 461 LDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARR 540 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHH Confidence 7777766554 66677777743 11111122221 11 111122 Q ss_pred HHHHHHHHHHHHcCCCCH-HH--HHHHHHhcCCCCCCCCHHHHHHHHHhhccc-CCCC Q lcl|NC_020844. 402 EQTPGYLLTMRQNGDISR-EG--LWMEFQRLGELSDNFDPEEESERLISELPG-GIEE 455 (455) Q Consensus 402 ~~~~~al~~~~~~G~is~-et--~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~-~l~~ 455 (455) .+.+++|+++........ .+ +...+ ...+ ..-..++-.++|..+.+. +..+ T Consensus 541 ~~~~~~l~qll~~~~p~~~~~~~~~~~~--l~~~-D~p~~~ei~erir~~~~~~~~~~ 595 (708) T protein:vir:10 541 DATVSVLTNVLSSMLPTDPMRPAIQGII--LDNI-DGEGLDDFKEYNRNQLLISGIAK 595 (708) T ss_pred HHHHHHHHHHHHhcCCCchhhHHHHHHH--HHhc-CCcChHHHHHHHHHhhccccccc Confidence 456677777766543211 11 11000 0011 111133444444443221 1111 No 94 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=98.88 E-value=2e-08 Score=62.79 Aligned_cols=389 Identities=12% Similarity=0.004 Sum_probs=192.2 Q ss_pred CCCCCCCcc---CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQDPTKR---SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~v~~~---~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) ||.-++-.. .+.-......+...-+...+. .+.......|.+. +.+-..|+... + ..++...++.-...+. T Consensus 7 ~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~-~~~p~~~~il~~~-~~~~~~y~~m~-~---D~~i~s~l~~Rk~av~ 80 (491) T protein:vir:79 7 VSPTEFVKFGEPDKSLSSQIATRARSIDFFALG-MYLPNPDPVLKAL-GKDIRVYRELR-A---DAHVGGCVRRRKAAVK 80 (491) T ss_pred CCCCCcccccccchhHHHHHhhhcccccccccc-ccCcchhHHHhhc-cCCHHHHHHHh-h---ChHHHHHHHHHHHHHh Confidence 443332211 122222222222111111110 0000001122211 22345677655 3 5777788888888888 Q ss_pred cCCceec--cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 78 GREVEVT--EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 78 ~k~p~i~--~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) +.+++|. +.++...+++... ..+.+++.++..+. .++-||.+.+=+- T Consensus 81 ~~~w~i~~~~~~~~~a~~i~e~-l~~~~~~~~i~~~l-da~~~G~s~~Ei~----------------------------- 129 (491) T protein:vir:79 81 ALEWGLDRGKAKSRVAKSIADV-FADLDLSRIATEML-DAVLYGYQPMEIT----------------------------- 129 (491) T ss_pred CCCcEEecCCCCHHHHHHHHHH-HhcCCHHHHHHHHH-HhhhhcceeEEEE----------------------------- Confidence 8888884 2222222333222 12347899998886 5778887765432 Q ss_pred hccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCc-EEEEeccCCCCCCceeEEEEEecccCCCcccCcch Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAG-IWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPP 234 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g-~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~p 234 (455) |+. .+|.+.+..+..+.. +.+. +++..--+++..+++ .-.. -.+.+ ||-++..+..+. ..+.+. T Consensus 130 ---w~~--~~g~~~~~~l~~r~~-~~f~-~d~~~~l~l~~~~~~~~g~~----lp~~k---~i~~~~~~~~g~-p~g~gL 194 (491) T protein:vir:79 130 ---WGK--VGNYIVPIDVVGKPA-DWFV-YDPENQLRFRSKEHWVQGEE----LPARK---FLVPRQEATYLN-PYGFPD 194 (491) T ss_pred ---Eee--cCCeeeEEeeeeecc-ccee-eccCCceEEeecCCCCCcee----ecCCC---eEEEEecCCCCC-cccchh Confidence 222 245555544443322 1111 111111112211110 0000 00122 333333343333 346666 Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc---CccceeeccceeeeccCCCCccccceeEEeeCc- Q lcl|NC_020844. 235 MKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG---SPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPAT- 310 (455) Q Consensus 235 L~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~---~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~- 310 (455) |..++..-+---....+...-++.-+.|+++.+=-....+++- ......+|.+.+..+|.+. +++|+|..+ T Consensus 195 l~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP~~~-----~ie~~ea~~~ 269 (491) T protein:vir:79 195 LSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDAETNLLLDRLEDMVQDAVAVIPDDS-----SIEIKEAAGK 269 (491) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCeEEEecCCc-----eeEEEeccCC Confidence 6676664443334567788888899999988872111122211 1223457888999999765 799999764 Q ss_pred -hhHHHHHHHHHHHHHHHHHh--hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCce Q lcl|NC_020844. 311 -ESLKFLAEQIESVSKEIREL--GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNP 387 (455) Q Consensus 311 -~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~ 387 (455) .+...+++.++...++|... |..+-...+ .|..............+...+..+.+++++++++++.|.+-+.... T Consensus 270 ~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~--gs~a~~~vh~~v~~~i~~~D~~~i~~tln~li~~l~~~N~~~~~~p 347 (491) T protein:vir:79 270 SGSADVYERLLHFCRGEVSIALLGQNQTTEAT--STRASAQAGLEVTDDIRDGDKAIVVEAMNMLIRWICDLNFDGAARP 347 (491) T ss_pred CCChhHHHHHHHHHHHHHHHHHhhhhhccCcc--cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcc Confidence 34556788888888888653 544333222 2223333444555667788888899999999999999877444334 Q ss_pred EEEEecccCCCCCCHHHHHHHHHHHHcCC-CCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc--cCCCC Q lcl|NC_020844. 388 EADVYTDFRADTGEEQTPGYLLTMRQNGD-ISREGLWMEFQRLGELSDNFDPEEESERLISELP--GGIEE 455 (455) Q Consensus 388 ~v~v~~df~~~~~~~~~~~al~~~~~~G~-is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~--~~l~~ 455 (455) .+.+. .....+...++.+.++.+.|. |+.+-+. ++.|+-.+..+ ++....-..+.. +...+ T Consensus 348 ~f~~~---e~ee~~~~~a~~~~~L~~~G~~i~~~~~~---e~~Gip~~~~~-e~~~~~~~~~~~~~~~~~~ 411 (491) T protein:vir:79 348 VFDMW---EQEQVDEIQAGRDEKLTRAGARFTPAYFK---RAYNLQDGDLD-ERPLPVSAVDAVGAASFAE 411 (491) T ss_pred eEeec---CcCchhHHHHHHHHHHHhCCCccCHHHHH---HHhCCCCCCCC-ccccCcCcccccccccccc Confidence 43322 122222334667888888888 7765444 34566433322 111100000000 00000 No 95 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.88 E-value=2e-08 Score=62.77 Aligned_cols=407 Identities=10% Similarity=0.018 Sum_probs=187.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHH---HHHHHhhcc--c--cCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNK---RYDLRKSVA--K--MTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~---~Y~~rl~ra--~--~~n~~~~~v~~~~ 73 (455) |-.-.+..+++ .+..+.++..+.|..........|.|....-+.+ .......|| . =.++.+..|+.++ T Consensus 1 ~~~~~~~~~~~-----~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 75 (530) T protein:vir:38 1 MKIPSLVGPDG-----KTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQ 75 (530) T ss_pred CccceeecCcc-----ccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 44334444332 1112223333433211111223344533221111 111111111 1 2568888888888 Q ss_pred hhhhcCCceeccCC----------------HHH----HHHHHh----hccCC-CCHHHHHHHHHHHHHhhCcEEEEEeeC Q lcl|NC_020844. 74 SKPFGREVEVTEGT----------------GPS----EDLLED----IDGRG-NNLTTFAGQLFFQGVADSISWIRVDYP 128 (455) Q Consensus 74 g~lf~k~p~i~~~~----------------~~l----~~~~~d----~d~~G-~~l~~~~~~~~~~al~~G~~~~lVd~p 128 (455) ..+.+..++....| ..+ +.|.++ +|-.| .++.++.+.+.+..+..|-|++..-+. T Consensus 76 ~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~ 155 (530) T protein:vir:38 76 DHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWD 155 (530) T ss_pred HHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeec Confidence 88887765543211 223 333332 34444 499999999999999999999988654 Q ss_pred CCCcchhhhhHHhccCCc-eEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEe---cCeEEEEEEeCCcEEEEe Q lcl|NC_020844. 129 AGQQFRNREEERQAGVRP-YWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLR---PDDWERWRKNDAGIWEVD 204 (455) Q Consensus 129 ~~~~~~t~ad~~~~~~rP-y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~---~~~~~~~~~~~~g~~~~~ 204 (455) +.+... -| -+.+++|+.|=+......++ . +.. . |.+-. +-.|.++...-.+.. T Consensus 156 ~~~g~~----------~~~~lq~ie~d~l~~~~~~~~~~-~-i~~-----G---Ie~d~~Gr~~aY~i~~~~~~~~~--- 212 (530) T protein:vir:38 156 SDSTRL----------FRTQFKMVSPKRVSNPNNIGDTR-N-CRA-----G---VKINDSGAALGYYVSDDGYPGWM--- 212 (530) T ss_pred cCCCCc----------cceEEEEechhhcCCCCCCCCCC-e-eEe-----e---eEECCCCceEEEEEeeccCCCcc--- Confidence 332110 12 26778888886544322222 1 111 1 11111 223344433211111 Q ss_pred ccCCCCCCcee---------EEEEEecccCCCcccCcchHHHHHHHHHHHHH-hHHHHHHHHHHcCCceEEEecCCCc-- Q lcl|NC_020844. 205 EFGRITIGVVP---------MVPFITGRRKGKKWVFHPPMKDALDLQVALFR-KECALEFAEMMTAFPMLSANGVEPD-- 272 (455) Q Consensus 205 ~~g~~~l~~vP---------~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~-~~sd~~~~l~~~~~p~~~~~G~~~~-- 272 (455) ...+..|| |+-+ ......+-.-+.|.|..++..-..+-. ..+.+..+ ..++.-..+|+.-.+. T Consensus 213 ---~~~~~~~~~~~~v~a~~vlH~-f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a-~i~A~~a~fi~~~~~~~~ 287 (530) T protein:vir:38 213 ---AQNWTYIPRELPGGRPSFIHV-FEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSA-IVKAMYAATIESELDTQS 287 (530) T ss_pred ---ccccceeeeeeccChhHeEee-ccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHH-HHhhhheeeeeccCCccc Confidence 11222222 2222 234444445677777776553222221 12333333 3333334455421111 Q ss_pred -----------cc------------cccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 273 -----------TD------------SEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE 329 (455) Q Consensus 273 -----------~~------------~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~ 329 (455) +. +..+...+.++++++..|+.+ -+++|++++-.+ ..+...++.+.+.|.. T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG-----e~i~~~~p~~p~-~~~~~f~~~~lr~iaa 361 (530) T protein:vir:38 288 AMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPG-----DSLNLQSAQDTD-NGYSTFEQSLLRYIAA 361 (530) T ss_pred cccccccCCcccccccccccchhhhhcccccceeccCceeeecCCC-----CeeeeeCCCCCC-CCHHHHHHHHHHHHHh Confidence 00 001223467889999888744 379999876322 2344445555555533 Q ss_pred -h--hhhhccCC--Ccc-hhHHHHHHHHHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHcCCCCC--ceEEE--------E Q lcl|NC_020844. 330 -L--GRQPLTAQ--SGN-LTRISAAQAASKGNSAVQQW-AAG-LKDALENALYITNLWMDAPDT--NPEAD--------V 391 (455) Q Consensus 330 -~--g~~~l~~~--~~~-~sa~a~~~~~~~~~s~L~~~-a~~-~~~a~~~~l~~~a~~~g~~~~--~~~v~--------v 391 (455) + +...+.+. ..| .|+-+...++.......+.. +.. |.-.++..|+ .|...|.-+. ...+. + T Consensus 362 glGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~-~av~~G~i~~p~~~~~~~~~~~~a~~ 440 (530) T protein:vir:38 362 GLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLE-EAIVRRVVTLPSKARFSFQEARTAWG 440 (530) T ss_pred hcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH-HHHHcCCccCCCCCCCCchhhHHhhh Confidence 2 22335443 234 44444444444444333321 111 1122222222 1233343210 01000 0 Q ss_pred ecccCC----CCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc----cCCCC Q lcl|NC_020844. 392 YTDFRA----DTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELP----GGIEE 455 (455) Q Consensus 392 ~~df~~----~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~----~~l~~ 455 (455) .-.|.. ...+..++++...++.+|+.|.+....+ + ..||++.++.++.|.. -||.- T Consensus 441 ~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~---~-----G~D~~~v~~q~a~e~~~~~~~Gl~~ 504 (530) T protein:vir:38 441 NANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAK---R-----GDDYQEIFAQQVRESMERRAAGLNP 504 (530) T ss_pred ceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHH---c-----CCCHHHHHHHHHHHHHHHHHcCCCC Confidence 112211 1224579999999999999999976553 2 3477888877777643 23433 No 96 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=98.88 E-value=2.1e-08 Score=62.62 Aligned_cols=420 Identities=12% Similarity=0.078 Sum_probs=199.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |++ .+...+-.....+|+.+++--.- ...+++..+-.||..-..+...=..+.. -.|-+...+.++.++..+++- T Consensus 1 m~~---~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~-~~~dst~~~a~~~Laa~l~~~ 76 (536) T protein:vir:21 1 MAE---KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQ-TPWQAVGARGLNNLASKLMLA 76 (536) T ss_pred Ccc---hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccc-ccccccHHHHHHHHHHHHHHh Confidence 996 45566666777777776553322 4556777777777543211111111222 245566666666665555310 Q ss_pred --Cce----ec-cC---------C---HHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch Q lcl|NC_020844. 80 --EVE----VT-EG---------T---GPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR 134 (455) Q Consensus 80 --~p~----i~-~~---------~---~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~ 134 (455) |+. +. .+ + ..++.|++.+. ...++++.-+..+.++...+|-+-++++-+..... T Consensus 77 ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~- 155 (536) T protein:vir:21 77 LFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY- 155 (536) T ss_pred hcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCce- Confidence 210 00 00 0 12334443321 12356888888999999999999999874433221 Q ss_pred hhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe------------e----------eeEEEEE-----ec Q lcl|NC_020844. 135 NREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE------------D----------AETILVL-----RP 187 (455) Q Consensus 135 t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E------------~----------~e~~~v~-----~~ 187 (455) -++..|+-.++.- .+...-.+.+.+|-.+ . .+.+.++ ++ T Consensus 156 -----------~~f~~~pl~~~~v---~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~ 221 (536) T protein:vir:21 156 -----------NPMKLYRLSSYVV---QRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDE 221 (536) T ss_pred -----------eeEEEEEcCeEEE---eeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEec Confidence 1344555444332 2212222333222110 0 0112222 11 Q ss_pred C--eEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEE Q lcl|NC_020844. 188 D--DWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLS 265 (455) Q Consensus 188 ~--~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~ 265 (455) + .|.+|. ...|...+.++|.+++...|++++++....++.+ +.+|..+...-.......+-..-.....+..|.+. T Consensus 222 ~~~~~~~~~-e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~l 299 (536) T protein:vir:21 222 DSGEYLRYE-EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESY-GRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGL 299 (536) T ss_pred CCCcEEEEe-ccCCeeeccccCccccccCCeeeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc Confidence 1 233332 2233333445667778999999998888777776 66777666554444444443444444444444443 Q ss_pred E-ecCCCccccccCccc-eeeccceeeeccCCCCccccceeEEee-CchhHHHHHHHHHHHHHHHHHh-hhhhc-cCCCc Q lcl|NC_020844. 266 A-NGVEPDTDSEGSPKP-LDTGPDTVLYAPPNAEGQHGSWNFVEP-ATESLKFLAEQIESVSKEIREL-GRQPL-TAQSG 340 (455) Q Consensus 266 ~-~G~~~~~~~~~~~~~-~~~g~~~~~~lp~~~~~~~~~~~~le~-~~~~l~~~~~~l~~~~~~m~~~-g~~~l-~~~~~ 340 (455) + .+-. .++.. +..|++.+ +|.. .++...++. .+..+....+.|+++++.|... =..++ ..++. T Consensus 300 v~p~g~------~~~~~~~~~~~g~~--v~g~----~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~ 367 (536) T protein:vir:21 300 VNPAGI------TQPRRLTKAQTGDF--VTGR----PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGE 367 (536) T ss_pred cCcccc------cchhhhccCCCcce--ecCC----cccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCC Confidence 3 2111 11111 22233333 2211 123444443 3566777888899988888552 11222 34455 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC----CCCceEEEEecccCCCCCCHHHHHHHHHH Q lcl|NC_020844. 341 NLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA----PDTNPEADVYTDFRADTGEEQTPGYLLTM 411 (455) Q Consensus 341 ~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~----~~~~~~v~v~~df~~~~~~~~~~~al~~~ 411 (455) ..||++...+.......|.-.-..++.- +++++.++-+ .|. +++.+.+++.. ....---.++++.++.. T Consensus 368 r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~v~~~~vs-~l~~l~r~~~~~~l~~~ 445 (536) T protein:vir:21 368 RVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TQQIPELPKEAVEPTIST-GLEAIGRGQDLDKLERC 445 (536) T ss_pred CccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCCChhhccceEEe-cHHHHHHHHHHHHHHHH Confidence 6899999999888888777666665543 2233333312 232 22222333321 11100012333333333 Q ss_pred HH--cCC--------CCHHHHHHHH-HhcCCCC-CCCCHHHHHHHHHhhcccC--CCC Q lcl|NC_020844. 412 RQ--NGD--------ISREGLWMEF-QRLGELS-DNFDPEEESERLISELPGG--IEE 455 (455) Q Consensus 412 ~~--~G~--------is~et~~~~l-~~~~vl~-~~~d~e~e~~ri~~e~~~~--l~~ 455 (455) .+ +++ |....+.+.+ ...||.+ .-+..++|.+.+.+|.... +.. T Consensus 446 ~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~ 503 (536) T protein:vir:21 446 VTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDN 503 (536) T ss_pred HHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHH Confidence 22 121 2222233322 3356633 2333455555555443211 111 No 97 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=98.87 E-value=2.3e-08 Score=62.49 Aligned_cols=434 Identities=12% Similarity=0.116 Sum_probs=192.5 Q ss_pred CCCCCCCccCHHHHHHH----------------HHHHHHHHHhcCHHHHHHhhh---hcCC--CCCCCCHHHHHHHhhcc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMS----------------DYLQTVDDVLEGVTGMRRNFK---RYVK--QFPHEQNKRYDLRKSVA 59 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~----------------~~w~~i~d~~~G~~~~r~~g~---~yLp--k~~~e~~~~Y~~rl~ra 59 (455) |+-+.-.++-|+.-+.. ..-..++....+....|+... .|.- +++.+....-+.+-.-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~~g~p~ 80 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRTERELEQRPC 80 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHHHHHhcCCCc Confidence 65443333333221111 011122233344444443222 2321 12222223344555556 Q ss_pred ccCChHHHHHHHHhhhhhcCCceeccC-------------------------------CHHHHHHHHhhccCCCCHHHHH Q lcl|NC_020844. 60 KMTNVFRDVLEGLASKPFGREVEVTEG-------------------------------TGPSEDLLEDIDGRGNNLTTFA 108 (455) Q Consensus 60 ~~~n~~~~~v~~~~g~lf~k~p~i~~~-------------------------------~~~l~~~~~d~d~~G~~l~~~~ 108 (455) +.+|.++.+|++.+|+--...+.+.-. .+.+..++..+- .-++.+.-+ T Consensus 81 ~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~ 159 (711) T protein:vir:10 81 LVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIE-YNCDAETEY 159 (711) T ss_pred EEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHH-HhcChhHHH Confidence 679999999999999998766655211 122444443332 345778888 Q ss_pred HHHHHHHHhhCcEEEEE--eeCCCCcchhhhhHHhccCCceEEEe-chhhhc-cceeecc---CCeeEEEEEEEEe---- Q lcl|NC_020844. 109 GQLFFQGVADSISWIRV--DYPAGQQFRNREEERQAGVRPYWSQV-NHSAIL-EIQSERQ---GAGERITYMRMWE---- 177 (455) Q Consensus 109 ~~~~~~al~~G~~~~lV--d~p~~~~~~t~ad~~~~~~rPy~~~~-~~~~ii-~w~~~~~---~~~~~l~~v~~~E---- 177 (455) ..++..++.+|++|+=| ||-....+ ...|-+..+ +|.+|+ ++..... +-++.+ ..++.. T Consensus 160 s~af~d~~~~G~G~~ev~~d~~~~d~~---------~~e~~i~~v~~p~~v~~Dp~a~~~D~sDar~~~-~~~~~~~~~~ 229 (711) T protein:vir:10 160 DIAFQGAVESGMGYLRVRSDYLADDSF---------EQDLIIEAIQNQFSVTIDPDAKKRDRSDMNWCL-IDDTMSKEKF 229 (711) T ss_pred HHHHHHhhhcCcceEEEEecccCCCCC---------CCCeEEeeecChhheeeCccccccChhhhccee-eeecCCHHHH Confidence 89999999999999754 43211111 011334344 355543 3322111 112221 111100 Q ss_pred ----------e--------------eeEEEE---E------------ecCeE---------------------------- Q lcl|NC_020844. 178 ----------D--------------AETILV---L------------RPDDW---------------------------- 190 (455) Q Consensus 178 ----------~--------------~e~~~v---~------------~~~~~---------------------------- 190 (455) . .+++|| | ..+.+ T Consensus 230 ~~~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 309 (711) T protein:vir:10 230 KALYPDATAEPVYEDSVADYDTWFTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKT 309 (711) T ss_pred HHhCCchhhhhhhcccccccCcccCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhce Confidence 0 011222 1 11100 Q ss_pred -EEEEEeCCcEEEEeccCCCCCCceeEEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEE- Q lcl|NC_020844. 191 -ERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSA- 266 (455) Q Consensus 191 -~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~- 266 (455) ++|+....|...+.+..+.+.+.+|||||+.... ++. .....-+.++.+.+...+...|-+.+++..++.+..++ T Consensus 310 ~~v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g~r~~~d~~-~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~ 388 (711) T protein:vir:10 310 FKTYWRKITGANVLEGPVEIPSTTIPVIPVWGKSLIIKKK-EIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGS 388 (711) T ss_pred eeEEEEEEecceeecCCCCCCCCcccEEEEeeeeeccccc-cccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeec Confidence 1111111111122233345668899999864322 111 11122244566666667777788888888888876665 Q ss_pred ecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchh Q lcl|NC_020844. 267 NGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLT 343 (455) Q Consensus 267 ~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~s 343 (455) .|.-...++..+.. ...++.++.+-.++. ..+.+.+..+..-+ ..+...++...+.|..+++ .++...+++.| T Consensus 389 ~gai~~~~~~~~e~--~~~~~~vi~~~~~~~-~~~~~~~~~~~~~~-~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~S 464 (711) T protein:vir:10 389 EGNVEGREDEWEQA--NTKNFSLLTYIPQYQ-GDPGPRRQPPAAVP-AAELTLGQNSVEKIKSTMGMYDASLGAMGNETS 464 (711) T ss_pred CcccCChHHHHHhc--cccCCCeeEeccccc-CcCCccccCCCCCC-HHHHHHHHHHHHHHHHHhCCChHHcCCCccchH Confidence 34333222212111 112233333311221 22356665543332 3455666666666655543 22334455689 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHcCC---------CCCceEEEEecc---------------- Q lcl|NC_020844. 344 RISAAQAASKGNSAVQQWAAGLKDA----LENALYITNLWMDA---------PDTNPEADVYTD---------------- 394 (455) Q Consensus 344 a~a~~~~~~~~~s~L~~~a~~~~~a----~~~~l~~~a~~~g~---------~~~~~~v~v~~d---------------- 394 (455) |++...+..+....|..+..++..+ .+.+|.++.+++.. ++..-.|.||.. T Consensus 465 g~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~ 544 (711) T protein:vir:10 465 GRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLN 544 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccc Confidence 9998888877766666666666655 45566677777632 111112334321 Q ss_pred ---cC--------CCCCCHHHHHHHHHHHHcCCCCHHH--HHHHHHhcCCCCCCCCHHHHHHHHHhhccc-CCCC Q lcl|NC_020844. 395 ---FR--------ADTGEEQTPGYLLTMRQNGDISREG--LWMEFQRLGELSDNFDPEEESERLISELPG-GIEE 455 (455) Q Consensus 395 ---f~--------~~~~~~~~~~al~~~~~~G~is~et--~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~-~l~~ 455 (455) |+ ......+.+.+|.++.. .++.-. +...+- ..+ +.-..++-.++|+...+. +..+ T Consensus 545 ~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~--~~p~~~~~~~~~il--~~~-d~p~~~el~e~lr~~~~~~~~~~ 614 (711) T protein:vir:10 545 VQKYDVVVTTGPAFATQRIEAAEAMIQFAQ--AVPSAAAVMADLIA--QNM-DWPGADVIAERLKKIVPPNVLSK 614 (711) T ss_pred eeeeEEEEeeccCchhHHHHHHHHHHHHHh--hcchhhhHHHHHHH--Hhc-CCCCHHHHHHHHHhhcCcccCcc Confidence 11 00111223344444433 222110 000000 000 111233333344333221 1111 No 98 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.87 E-value=2.4e-08 Score=62.36 Aligned_cols=412 Identities=10% Similarity=-0.004 Sum_probs=192.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCC-CCCHHHH-HHHhhcc----ccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFP-HEQNKRY-DLRKSVA----KMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~-~e~~~~Y-~~rl~ra----~~~n~~~~~v~~~~g 74 (455) |.=-+.....|......+.+..- |.|-..-+.. ...|... ......+ .....|| .=.++.+..|+.++. T Consensus 1 m~~~~~~~~a~~~~~~~~~~~~~---y~aa~~~~~~--~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 75 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLVPVGASA---YEGASGGHRW--QDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVA 75 (495) T ss_pred CCcccccccccchhhhhHHHhhh---hhccccCccc--CCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 54334444445444444443322 3332222211 1223222 1111111 1111111 126788888888888 Q ss_pred hhhcCCceeccC----------CHHHHHHHHhhccCCC-CHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 75 KPFGREVEVTEG----------TGPSEDLLEDIDGRGN-NLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 75 ~lf~k~p~i~~~----------~~~l~~~~~d~d~~G~-~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~ 143 (455) .+.+..++.+.. ...++.|.+++|-.|. +++.+.+.+.+..+..|-|++..-+.....+. T Consensus 76 ~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~--------- 146 (495) T protein:vir:10 76 AAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGL--------- 146 (495) T ss_pred hhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCC--------- Confidence 887766555311 1335677788888875 99999999999999999999877554322110 Q ss_pred CCc-eEEEechhhhccceeec--cCCeeEEEE-EEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCcee---E Q lcl|NC_020844. 144 VRP-YWSQVNHSAILEIQSER--QGAGERITY-MRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVP---M 216 (455) Q Consensus 144 ~rP-y~~~~~~~~ii~w~~~~--~~~~~~l~~-v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP---~ 216 (455) .-| -+-+++|+.|-+..... .++.. +.. |.+.+. -.+-.|.+++...+..+. ......+..|| | T Consensus 147 ~~~~~lqliepd~l~~~~~~~~~~~g~~-i~~GIe~d~~------Gr~vaY~i~~~hpgd~~~--~~~~~~~~rvpA~~v 217 (495) T protein:vir:10 147 SVPLQLQIIEPDMLASDIPDETLPSGGY-VKGGIRFSNG------GKRKAYCFYRNHPAESSL--IGDPVDTVWIKAEHV 217 (495) T ss_pred ccceEEEEechhhcCCCCCCCCCCCCCE-EEeceEECCC------CceEEEEEeecCCCcccc--cccccceeeechhhe Confidence 012 27788999885432221 11111 111 111111 111123343333221111 11111233455 2 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhH--HHHHHHHHHcCCceEEEecCCC-c---------cccccCccceee Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKE--CALEFAEMMTAFPMLSANGVEP-D---------TDSEGSPKPLDT 284 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~--sd~~~~l~~~~~p~~~~~G~~~-~---------~~~~~~~~~~~~ 284 (455) +-++ ... .+-.-+.|-|-.+.. +.++... +.+..+.- ++.-..+|+.-.+ . ..+........+ T Consensus 218 lH~f-~~r-~gQ~RGis~la~i~~--l~~l~~y~dael~~a~i-~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l 292 (495) T protein:vir:10 218 LHVT-VLT-VRSDAGAPWFQLLLR--LNELDQYEDAELVRKKT-AALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGL 292 (495) T ss_pred Eecc-ccC-CCcccCcchhHHHHH--HHHhhHHHHHHHHHHHH-hhhheeeeecCCCccccccccCccccccCcccceec Confidence 2222 222 233334554433322 2222222 33343333 3333455542111 1 111112234568 Q ss_pred ccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH---hhhhhccCC--Ccc-hhHHHHHHHHHHHHHHH Q lcl|NC_020844. 285 GPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE---LGRQPLTAQ--SGN-LTRISAAQAASKGNSAV 358 (455) Q Consensus 285 g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~--~~~-~sa~a~~~~~~~~~s~L 358 (455) +++++..|+.+ -+++|++++..+ ..+...++.+...|.. ++...+.+. +.| .|+-+...++......+ T Consensus 293 ~pG~i~~L~pG-----e~i~~~~p~~p~-~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~ 366 (495) T protein:vir:10 293 NPGTLQYLQPG-----QEVKFSNPADVG-TTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQV 366 (495) T ss_pred CCceeeecCCC-----CeeeeeCCCCCC-CCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHH Confidence 89999988743 379999876221 2234445555555543 222334443 234 44444445555555444 Q ss_pred H--HHHHHHHHH-HHHHHHHHHHHcCCCCC-ce-E---EEEecccCCC----CCCHHHHHHHHHHHHcCCCCHHHHHHHH Q lcl|NC_020844. 359 Q--QWAAGLKDA-LENALYITNLWMDAPDT-NP-E---ADVYTDFRAD----TGEEQTPGYLLTMRQNGDISREGLWMEF 426 (455) Q Consensus 359 ~--~~a~~~~~a-~~~~l~~~a~~~g~~~~-~~-~---v~v~~df~~~----~~~~~~~~al~~~~~~G~is~et~~~~l 426 (455) + .++..|-.- ++..|. .|...|.-.. +. . .-+.-+|... ..+..++++...++.+|+.|.+....+ T Consensus 367 q~~~~~~~~~~pi~~~~l~-~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~- 444 (495) T protein:vir:10 367 QHHMIIHQFCRPVGRWFMD-FAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAE- 444 (495) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHH- Confidence 3 233333332 222222 2333453110 00 0 0011233222 234688999999999999999976553 Q ss_pred HhcCCCCCCCCHHHHHHHHHhhcc----cCCC---C Q lcl|NC_020844. 427 QRLGELSDNFDPEEESERLISELP----GGIE---E 455 (455) Q Consensus 427 ~~~~vl~~~~d~e~e~~ri~~e~~----~~l~---~ 455 (455) + ..|+++.++.++.|.. -||- + T Consensus 445 --~-----G~D~~~v~~q~a~e~~~~~~~Gl~~~~~ 473 (495) T protein:vir:10 445 --R-----GYDMEELFDMISDANQLIDEYDLRLDSD 473 (495) T ss_pred --c-----CCCHHHHHHHHHHHHHHHHHcCCCCCCC Confidence 3 3477777777777643 2331 1 No 99 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=98.86 E-value=2.5e-08 Score=62.23 Aligned_cols=381 Identities=12% Similarity=0.017 Sum_probs=192.3 Q ss_pred CCCC-------CCC---ccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCC------CCCHHHHHHHhhccccCCh Q lcl|NC_020844. 1 MPEQ-------DPT---KRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFP------HEQNKRYDLRKSVAKMTNV 64 (455) Q Consensus 1 m~~~-------~v~---~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~------~e~~~~Y~~rl~ra~~~n~ 64 (455) |+.. -|. .+.|.....+.+ ..|.+... .-.++|+.. +.+...|+... + ..+ T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~~-------~~~~~~~~--~~~~~~~~~~iLr~~~~~~~~y~~m~-~---D~~ 67 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIATR-------ARSIDFFA--LGMYLPNPDPVLKALGKDIRVYRELR-A---DAH 67 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHhh-------hccccccc--ccCCccchHHHHHhcCCCHHHHHHHh-h---ChH Confidence 4322 111 111222221111 11111100 001222211 12446777664 3 578 Q ss_pred HHHHHHHHhhhhhcCCceec--cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhc Q lcl|NC_020844. 65 FRDVLEGLASKPFGREVEVT--EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQA 142 (455) Q Consensus 65 ~~~~v~~~~g~lf~k~p~i~--~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~ 142 (455) +...++.....+.+.++.|. +.++...+++... ..+.+++.++..+. .++-||.+.+=+- T Consensus 68 i~s~l~~Rk~av~~~~w~i~~~~~~~~~~e~v~e~-l~~~~~~~~l~~~l-da~~~G~s~~Ei~---------------- 129 (491) T protein:vir:10 68 VGGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADV-FADLDLSRIVTEML-DAVLYGYQPMEIT---------------- 129 (491) T ss_pred HHHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHH-HhcCCHHHHHHHHH-HhhhhcceeEEEE---------------- Confidence 88888888888888898885 2233333333322 12457999999887 6788887765331 Q ss_pred cCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCc-EEEEeccCCCCCCceeEEEEEe Q lcl|NC_020844. 143 GVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAG-IWEVDEFGRITIGVVPMVPFIT 221 (455) Q Consensus 143 ~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g-~~~~~~~g~~~l~~vP~V~f~~ 221 (455) |+. .+|.+.+..+..+.. ..+. ++++.--+++..+++ +-.. -.+.+ ||-++. T Consensus 130 ----------------w~~--~~g~~~~~~l~~r~~-~~f~-~d~~~~l~~~~~~~~~~g~~----l~~~k---~i~~~~ 182 (491) T protein:vir:10 130 ----------------WGK--VGNYIVPIDVVGKPA-DWFV-YDPENQLRFRSKDHWMQGEE----LPARK---FLVPRQ 182 (491) T ss_pred ----------------Eee--cCCeeEEEEeeeecc-ccee-eccCCceEEecCCCCCCcce----ecCCC---EEEEEe Confidence 222 244555544443322 1111 121111112211110 0000 01111 333333 Q ss_pred cccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc---CccceeeccceeeeccCCCCc Q lcl|NC_020844. 222 GRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG---SPKPLDTGPDTVLYAPPNAEG 298 (455) Q Consensus 222 ~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~---~~~~~~~g~~~~~~lp~~~~~ 298 (455) .+..+ -..+.+-|..++..-+--.....+...-++.-+.|+++.+--....+++- ......+|.+.+..+|.+. T Consensus 183 ~~~~~-~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP~~~-- 259 (491) T protein:vir:10 183 EATYL-NPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDGEKNLLLDCLEDMVQDAVAVVPDDS-- 259 (491) T ss_pred cCCCC-CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCcEEEecCCc-- Confidence 44333 33466666666665444444567778888899999998873121122221 1223557888899999765 Q ss_pred cccceeEEeeCc--hhHHHHHHHHHHHHHHHHHh--hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 299 QHGSWNFVEPAT--ESLKFLAEQIESVSKEIREL--GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALY 374 (455) Q Consensus 299 ~~~~~~~le~~~--~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~ 374 (455) +++|+|..+ .+...+++.++...++|... |..+-...+ .|-.............+...+..++++++++++ T Consensus 260 ---~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~--gs~a~~~vh~~v~~di~~~D~~~i~~tln~li~ 334 (491) T protein:vir:10 260 ---SIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEAT--STRASAQAGLEVTDDIRDGDKAVVSEAMNMLIR 334 (491) T ss_pred ---eeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCcc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 799999865 33456888888888888663 544333222 233333344445566677788889999999999 Q ss_pred HHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCC-CCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCC Q lcl|NC_020844. 375 ITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGD-ISREGLWMEFQRLGELSDNFDPEEESERLISELPGGI 453 (455) Q Consensus 375 ~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~-is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l 453 (455) +++.+.+-+.....|.+.. ....+...++.+.++.+.|. |+.+-+. ++.|+-.+..+ ++..-.......+. T Consensus 335 ~l~~~N~~~~~~p~f~~~~---~~e~~~~~a~~~~~L~~~G~~i~~~~i~---e~~Gip~~~~~--~~~~~~~~~~~~~~ 406 (491) T protein:vir:10 335 WICDLNFDGADRPVFDMWE---QEQVDEIQAGRDQKLTQAGARFTPAYFK---RAYNLQDGDLD--ERPLPVSAVDTVGA 406 (491) T ss_pred HHHHhcCCCCCcceEEecC---cCchhHHHHHHHHHHHhCCCcCCHHHHH---HHhCCCCCCcC--ccccccCCCCCccc Confidence 8888876444344454432 22233445777888888888 7765444 34566333321 11000000000000 Q ss_pred CC Q lcl|NC_020844. 454 EE 455 (455) Q Consensus 454 ~~ 455 (455) .. T Consensus 407 ~~ 408 (491) T protein:vir:10 407 AS 408 (491) T ss_pred cc Confidence 00 No 100 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=98.84 E-value=3e-08 Score=61.80 Aligned_cols=432 Identities=13% Similarity=0.066 Sum_probs=182.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhh---cCC--CCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKR---YVK--QFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~---yLp--k~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) ||++ ...|-. .+.+ ++.+......+|+.... |.- +++.+..+..+. ..|-+ +|.++.+|+..+|+ T Consensus 1 m~d~--~~~~~~---~~~~---~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~-q~rp~-~N~i~~~i~~v~g~ 70 (725) T protein:vir:77 1 MADN--ENRLES---ILSR---FDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSE 70 (725) T ss_pred CCch--HHHHHH---HHHH---HHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHHHHh-cCCCc-cccHHHHHHHHHhh Confidence 9964 323332 2222 33344444444443322 211 222222222222 33444 59999999999998 Q ss_pred hhcCCceec-----c----CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE--eeCCCCcchhhhhHHhccC Q lcl|NC_020844. 76 PFGREVEVT-----E----GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV--DYPAGQQFRNREEERQAGV 144 (455) Q Consensus 76 lf~k~p~i~-----~----~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV--d~p~~~~~~t~ad~~~~~~ 144 (455) --...+.+. . ..+.+..++..+- .-++.+.-...++..++.+|.+|+=| ||.....+.. + ..+ T Consensus 71 ~~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~-~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~--~---~~i 144 (725) T protein:vir:77 71 MRQNPIDVLYRPKDGARPDAADVLMGMYRTDM-RHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN--N---QVI 144 (725) T ss_pred HHhCCcceEEecCCccHHHHHHHHHHHHHHHH-HhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCC--c---eee Confidence 866655443 1 1223444444442 34677888889999999999999654 4422111100 0 000 Q ss_pred CceEEEechhhhc-cceeecc---------------------------------------C-------CeeEEEEEEEEe Q lcl|NC_020844. 145 RPYWSQVNHSAIL-EIQSERQ---------------------------------------G-------AGERITYMRMWE 177 (455) Q Consensus 145 rPy~~~~~~~~ii-~w~~~~~---------------------------------------~-------~~~~l~~v~~~E 177 (455) +.+....++.+|+ +|..... + ....+..+.+++ T Consensus 145 ~~~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~ 224 (725) T protein:vir:77 145 RREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYE 224 (725) T ss_pred EEeecccChhhceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEE Confidence 0000000111110 1110000 0 000111111111 Q ss_pred --eee-EEEEE-ec--CeE-------------------------------EEEEEeCCcEEEEeccCCCCCCceeEEEEE Q lcl|NC_020844. 178 --DAE-TILVL-RP--DDW-------------------------------ERWRKNDAGIWEVDEFGRITIGVVPMVPFI 220 (455) Q Consensus 178 --~~e-~~~v~-~~--~~~-------------------------------~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~ 220 (455) .+. .+..+ ++ +.+ ++|.....|...+.+..+.+-+.+|||||+ T Consensus 225 r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~ 304 (725) T protein:vir:77 225 VVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVF 304 (725) T ss_pred EEEEeeEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEe Confidence 110 01011 00 111 111111223222334445566789999986 Q ss_pred eccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEE-ecCC-CccccccCccceeeccceeeeccCCC Q lcl|NC_020844. 221 TGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSA-NGVE-PDTDSEGSPKPLDTGPDTVLYAPPNA 296 (455) Q Consensus 221 ~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~-~G~~-~~~~~~~~~~~~~~g~~~~~~lp~~~ 296 (455) .... ++.+. ...-+-++.+...-.+...|-+.+++..+..-...+ .|.. .......+++...+.....+....++ T Consensus 305 g~r~~~~g~~~-~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:77 305 GEWGFVEDKEV-YEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENSGD 383 (725) T ss_pred eeeeccCCccc-ccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHHHHHhccCCceecccccccCCCc Confidence 4332 22211 112234555555555555666666665443322221 1111 11000011122211111111111111 Q ss_pred CccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_020844. 297 EGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALEN-- 371 (455) Q Consensus 297 ~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~-- 371 (455) . ..+.+..+++.. --..+.+.|+...+.|..+++ .++...+++.||.++..+..+....|..+-.|+..+... T Consensus 384 ~-~~~~i~~~~~~~-lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g 461 (725) T protein:vir:77 384 L-PTQPLAYYENPE-VPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDG 461 (725) T ss_pred c-cccCccccCCCC-chHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 112233333222 223455667777777766533 334445556899999998888777777777777776665 Q ss_pred --HHHHHHHHcC---------CCCCceEEEEec------------------ccCCC--C------CCHHHHHHHHHHHHc Q lcl|NC_020844. 372 --ALYITNLWMD---------APDTNPEADVYT------------------DFRAD--T------GEEQTPGYLLTMRQN 414 (455) Q Consensus 372 --~l~~~a~~~g---------~~~~~~~v~v~~------------------df~~~--~------~~~~~~~al~~~~~~ 414 (455) +|.++.++++ .++..-.+.+|. .|+.. . ...+.+.+++++..+ T Consensus 462 ~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~ 541 (725) T protein:vir:77 462 EIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGK 541 (725) T ss_pred HHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHh Confidence 5677788873 222222344441 13211 0 112345555555543 Q ss_pred CC-CCH---HHHHHHHHhcCCCCCCCCHHHHHHHHHhhccc-CCCC Q lcl|NC_020844. 415 GD-ISR---EGLWMEFQRLGELSDNFDPEEESERLISELPG-GIEE 455 (455) Q Consensus 415 G~-is~---et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~-~l~~ 455 (455) .- +.. .++...+ .+++ .--.++..+++..+.+. .+.+ T Consensus 542 ~~~~~~~~~~~l~~~~---~l~d-~~~~~e~~erirkq~~~~~~~q 583 (725) T protein:vir:77 542 TPQGTPEYQLLLLQYF---TLLD-GKGVEMMRDYANKQLIQMGVKK 583 (725) T ss_pred ccccchhHHHHHHHhh---cccc-chHHHHHHHHHHhhhhhhhccC Confidence 21 111 1111111 1111 11124555666665332 2222 No 101 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=98.82 E-value=3.7e-08 Score=61.32 Aligned_cols=410 Identities=10% Similarity=0.056 Sum_probs=201.1 Q ss_pred CC--CCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHH---HHHH-Hhhcc--c--cCChHHHHHH Q lcl|NC_020844. 1 MP--EQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNK---RYDL-RKSVA--K--MTNVFRDVLE 70 (455) Q Consensus 1 m~--~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~---~Y~~-rl~ra--~--~~n~~~~~v~ 70 (455) |. ++-|....|...+...........|.|...-|.. ..-|.. .-.+. .+.. ...|| . =.++.+..|+ T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~--~~~~~~-~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~ 77 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTH--KARREN-RTADQLSQYGAVSLREQARYLDNNHDLVIGVFD 77 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCccccc--CCCCCC-CChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 54 3347788898888777777766666654332222 122211 11111 1111 11111 1 2568888888 Q ss_pred HHhhhhhcC-Ccee----ccC--------C----HHHHHHHHhhccCCC-CHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 71 GLASKPFGR-EVEV----TEG--------T----GPSEDLLEDIDGRGN-NLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 71 ~~~g~lf~k-~p~i----~~~--------~----~~l~~~~~d~d~~G~-~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) .++..+.+. .+++ ... . ..++.|.+++|-.|. +++++.+.+.+..+..|-+++..-+.+... T Consensus 78 ~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~ 157 (502) T protein:vir:79 78 KLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINS 157 (502) T ss_pred HHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCc Confidence 888888764 2222 111 1 235667778888875 999999999999999999999886544221 Q ss_pred chhhhhHHhccCCc-eEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEe---cCeEEEEEEeCCcEEEEeccCC Q lcl|NC_020844. 133 FRNREEERQAGVRP-YWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLR---PDDWERWRKNDAGIWEVDEFGR 208 (455) Q Consensus 133 ~~t~ad~~~~~~rP-y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~---~~~~~~~~~~~~g~~~~~~~g~ 208 (455) .. ....-| -+-+++|+.|=+-. .++.. + +.. |++-. +-.|.+++...+. ... T Consensus 158 ~~------~g~~~~l~lq~iepd~l~~~~---~~~~~-i-----~~G---Ve~d~~Gr~~aY~i~~~hPgd------~~~ 213 (502) T protein:vir:79 158 LT------PSAGVHFWLEALEPDFIPMTS---DESNR-L-----NQG---VFVDDWGRPEKYLVYKSRPVS------GRQ 213 (502) T ss_pred cC------CCcccceEEEEecchhcCCCC---CCCCe-e-----Eee---eEECCCCceEEEEEeecCCCC------Ccc Confidence 10 001112 36778888874322 12211 1 111 22111 1123344332211 112 Q ss_pred CCCCcee---EEEEEecccCCCcccCcchHHHHHHHHHHHHH-hHHHHHHHHHHcCCceEEEecCCCcc-----ccccCc Q lcl|NC_020844. 209 ITIGVVP---MVPFITGRRKGKKWVFHPPMKDALDLQVALFR-KECALEFAEMMTAFPMLSANGVEPDT-----DSEGSP 279 (455) Q Consensus 209 ~~l~~vP---~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~-~~sd~~~~l~~~~~p~~~~~G~~~~~-----~~~~~~ 279 (455) ..+..|| |+-++ .....+-.-+.|.|..++..-..+-. ..+.+..+.-.+.+...+-++..... .+.... T Consensus 214 ~~~~rvpA~~vlH~f-~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~ 292 (502) T protein:vir:79 214 METKEVDAERMLHLK-FVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENE 292 (502) T ss_pred cceeEechhheEEee-cccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCcc Confidence 2344566 44333 33334445577888776653322211 12344444444444444333222111 122233 Q ss_pred cceeeccceeee-ccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH-hh--hhhccCC-Ccc-hhHHHHHHHHHH Q lcl|NC_020844. 280 KPLDTGPDTVLY-APPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE-LG--RQPLTAQ-SGN-LTRISAAQAASK 353 (455) Q Consensus 280 ~~~~~g~~~~~~-lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~-~g--~~~l~~~-~~~-~sa~a~~~~~~~ 353 (455) ..+.+++++++. |+.+ -+++|++++..+ ..+...++.+..+|.. +| ...+.+. ++| .|+-+...++.. T Consensus 293 ~~~~l~pG~i~~~L~pG-----e~i~~~~p~~p~-~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r 366 (502) T protein:vir:79 293 RELTIQPGIIYDDLKPG-----EEIGMVKSDRPN-PNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTD 366 (502) T ss_pred ccccccCCccccccCCC-----ceeeeeCCCCCC-CCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHH Confidence 456788887654 6533 379998865322 2345555555555543 22 2334443 223 333333344444 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHH---HHHHcCCCC--C--ceEEEEecccCCC----CCCHHHHHHHHHHHHcCCCCHHH Q lcl|NC_020844. 354 GNSAVQQWAAGLKDALEN-ALYI---TNLWMDAPD--T--NPEADVYTDFRAD----TGEEQTPGYLLTMRQNGDISREG 421 (455) Q Consensus 354 ~~s~L~~~a~~~~~a~~~-~l~~---~a~~~g~~~--~--~~~v~v~~df~~~----~~~~~~~~al~~~~~~G~is~et 421 (455) .....+. .|...+-+ +++. .|...|.-+ . ....-+.-+|... ..+..++++...++.+|+.|++. T Consensus 367 ~~~~~q~---~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~ 443 (502) T protein:vir:79 367 GYLILQD---WFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESD 443 (502) T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHH Confidence 3333332 22222222 2221 223334311 0 0000111223222 23468899999999999999997 Q ss_pred HHHHHHhcCCCCCCCCHHHHHHHHHhhcc----cCCCC Q lcl|NC_020844. 422 LWMEFQRLGELSDNFDPEEESERLISELP----GGIEE 455 (455) Q Consensus 422 ~~~~l~~~~vl~~~~d~e~e~~ri~~e~~----~~l~~ 455 (455) ...+. ..||++.++.++.|.+ -||.- T Consensus 444 ~~a~~--------G~D~~~v~~q~a~e~~~~~~~Gl~~ 473 (502) T protein:vir:79 444 WVRAG--------GRNPDDVKRRRKAEIDENRKLDLVF 473 (502) T ss_pred HHHHc--------CCCHHHHHHHHHHHHHHHHHcCCCC Confidence 66542 3478888887777743 12211 No 102 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.78 E-value=5.1e-08 Score=60.58 Aligned_cols=389 Identities=11% Similarity=0.035 Sum_probs=186.2 Q ss_pred CCCC------CCC---ccCHHHHHHHHHHHHHHHHhc-----C------HHHHHHhhhhcCCCCCCCCHHHHHHHhhccc Q lcl|NC_020844. 1 MPEQ------DPT---KRSADAEAMSDYLQTVDDVLE-----G------VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAK 60 (455) Q Consensus 1 m~~~------~v~---~~~p~y~~~~~~w~~i~d~~~-----G------~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~ 60 (455) |+.- .|. .+.|. .+...-+++.+. | -..+|++..-.+++ .-+-|+....| T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~----~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~----~~~L~e~m~e~-- 70 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQ----TSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQA----QAELFMDMEER-- 70 (526) T ss_pred CCeeECCCCCccccccccchh----hhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHH----HHHHHHHHHhh-- Confidence 4321 010 01111 111111122111 1 01122221111110 01123444433 Q ss_pred cCChHHHHHHHHhhhhhcCCceec---cCCH---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch Q lcl|NC_020844. 61 MTNVFRDVLEGLASKPFGREVEVT---EGTG---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR 134 (455) Q Consensus 61 ~~n~~~~~v~~~~g~lf~k~p~i~---~~~~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~ 134 (455) ..++.-.++.-...+.+.+++|. ...+ ...++++..-....+++.++..+. .|+-||.+.+=+- T Consensus 71 -D~~i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eiv-------- 140 (526) T protein:vir:99 71 -DAHLFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELE-------- 140 (526) T ss_pred -ChHHHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEE-------- Confidence 56777778888888888888874 1111 222333322212235888888887 5778886655331 Q ss_pred hhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCce Q lcl|NC_020844. 135 NREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVV 214 (455) Q Consensus 135 t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~v 214 (455) |+. .+|.+.+..+..+.. ..+.+...+..++ ...+.+..+...- T Consensus 141 ------------------------w~~--~~g~~~~~~l~~r~~-~~f~~~~~~~~~l---------~~~~~~~~g~~l~ 184 (526) T protein:vir:99 141 ------------------------WAL--QGREWMPLAFHHRPQ-SWFQLNPEDQNEL---------RLRDNSPAGEALQ 184 (526) T ss_pred ------------------------Eee--cCCceeEEEeeeecc-cceeeccCCCcEE---------EecCCCCCceeec Confidence 322 244454444333321 1111111111111 1111111111101 Q ss_pred e--EEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe---cCCCccccccCccceeecccee Q lcl|NC_020844. 215 P--MVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN---GVEPDTDSEGSPKPLDTGPDTV 289 (455) Q Consensus 215 P--~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~---G~~~~~~~~~~~~~~~~g~~~~ 289 (455) | +|.+...+. .+...+.+.|..++..-+--.....+...-++.-+.|+++.+ |.+.++..........+|.+.+ T Consensus 185 ~~k~i~~~~~~~-~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~d~~ 263 (526) T protein:vir:99 185 PFGWIIHRPRAR-SGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAA 263 (526) T ss_pred CCCeEEEeecCC-cCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhhCcE Confidence 1 222232333 333456666667765544333466778888888999999987 2222221122222355788899 Q ss_pred eeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh--hhhhccCCC--cchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 290 LYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL--GRQPLTAQS--GNLTRISAAQAASKGNSAVQQWAAGL 365 (455) Q Consensus 290 ~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~--~~~sa~a~~~~~~~~~s~L~~~a~~~ 365 (455) ..+|.+. .++|++..+.+-..++..++...++|..+ |..+-...+ +.-|-.............+..-+..+ T Consensus 264 ~iiP~~~-----~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i 338 (526) T protein:vir:99 264 GIIPETM-----AIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQL 338 (526) T ss_pred EEecCCc-----eeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHH Confidence 9999765 69999987767677888889999998663 544432211 11232233344455566777888899 Q ss_pred HHHHH-HHHHHHHHHcCCCCCce--EEEEecccCCCCCCHHHHHHHHHHHHcCC-CCHHHHHHHHHhcCCCCCCCCHHHH Q lcl|NC_020844. 366 KDALE-NALYITNLWMDAPDTNP--EADVYTDFRADTGEEQTPGYLLTMRQNGD-ISREGLWMEFQRLGELSDNFDPEEE 441 (455) Q Consensus 366 ~~a~~-~~l~~~a~~~g~~~~~~--~v~v~~df~~~~~~~~~~~al~~~~~~G~-is~et~~~~l~~~~vl~~~~d~e~e 441 (455) .++++ ++++.++.|-+-..... ...+..++.........++.+.++...|. ||.+.+.+.+ |+-.+. +.|+. T Consensus 339 ~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~---Gip~~~-~~e~~ 414 (526) T protein:vir:99 339 AATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKL---GIPQPA-KNEPV 414 (526) T ss_pred HHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHh---CCCCCC-Ccccc Confidence 99996 59999988854322111 11222222222112346778888989987 8887665544 552221 11111 Q ss_pred HHHHHhhcccCCCC Q lcl|NC_020844. 442 SERLISELPGGIEE 455 (455) Q Consensus 442 ~~ri~~e~~~~l~~ 455 (455) +.......+..... T Consensus 415 l~~~~~~~~~~~~~ 428 (526) T protein:vir:99 415 LRSAAQPAILSRQH 428 (526) T ss_pred cCCCCCCccccccc Confidence 11110000000000 No 103 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.78 E-value=5.2e-08 Score=60.53 Aligned_cols=431 Identities=13% Similarity=0.086 Sum_probs=182.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhh---hhcC--C--CCCCCCHHHHHHHhh----ccccCChHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNF---KRYV--K--QFPHEQNKRYDLRKS----VAKMTNVFRDVL 69 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g---~~yL--p--k~~~e~~~~Y~~rl~----ra~~~n~~~~~v 69 (455) |++++- -.+... +..++.........|... +.|. . +++.+..+..+.+-+ -.+.+|.++.+| T Consensus 1 m~e~~~----~~~~~~---~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v 73 (706) T protein:vir:10 1 MAESRQ----KQHERV---MLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATEL 73 (706) T ss_pred CCcchH----HHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHH Confidence 996422 122222 233444444444433332 2233 1 344444444443322 246689999999 Q ss_pred HHHhhhhhcCCceec------cC----CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE--eeCCCCcchhhh Q lcl|NC_020844. 70 EGLASKPFGREVEVT------EG----TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV--DYPAGQQFRNRE 137 (455) Q Consensus 70 ~~~~g~lf~k~p~i~------~~----~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV--d~p~~~~~~t~a 137 (455) +..+|+--...+.+. .. .+.+..++..+- .-++.+.-+..++.+++.+|++|+=| ||...+.+.+.- T Consensus 74 ~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~-~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~ 152 (706) T protein:vir:10 74 NRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADY-EETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDER 152 (706) T ss_pred HHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHH-HhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCC Confidence 999999876665442 11 123445555443 34578888999999999999999755 433222111100 Q ss_pred hHHhccCCceEE-Eechh-hh-ccceeecc---CCeeEEEE--------------------------------------- Q lcl|NC_020844. 138 EERQAGVRPYWS-QVNHS-AI-LEIQSERQ---GAGERITY--------------------------------------- 172 (455) Q Consensus 138 d~~~~~~rPy~~-~~~~~-~i-i~w~~~~~---~~~~~l~~--------------------------------------- 172 (455) .++.+. ..+|. +| ++|..... +-++.+.. T Consensus 153 ------~~i~i~~v~~p~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~ 226 (706) T protein:vir:10 153 ------QRIAVEPIYDPARSVWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYI 226 (706) T ss_pred ------ccceeeeeccchhceecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCccee Confidence 001111 11222 22 22221111 11111100 Q ss_pred -------------EEEEeeee-EEEEEecCe-----------------------EEEEEEeCCcEEEEeccCCCCCCcee Q lcl|NC_020844. 173 -------------MRMWEDAE-TILVLRPDD-----------------------WERWRKNDAGIWEVDEFGRITIGVVP 215 (455) Q Consensus 173 -------------v~~~E~~e-~~~v~~~~~-----------------------~~~~~~~~~g~~~~~~~g~~~l~~vP 215 (455) +.+++.+. .++++.++. ++++.....|...+.+..+.+.+.+| T Consensus 227 ~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P 306 (706) T protein:vir:10 227 AKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIP 306 (706) T ss_pred cccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccc Confidence 00000000 000011000 01111111111112233345568999 Q ss_pred EEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe-----cCCCccccccCc--cce---e Q lcl|NC_020844. 216 MVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN-----GVEPDTDSEGSP--KPL---D 283 (455) Q Consensus 216 ~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~-----G~~~~~~~~~~~--~~~---~ 283 (455) ||||+.... ++. .....-+-++.+.....+...|-+.+++..++.-..+.. |+...+...... ..+ . T Consensus 307 ~vP~~g~r~~~d~~-~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~ 385 (706) T protein:vir:10 307 LIPVYGKRWFIDDV-ERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRT 385 (706) T ss_pred eEEEeecccccccc-CcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHHHhhhcccccccchhccc Confidence 999864332 111 011111234444444444455555665543333222211 011111100000 001 1 Q ss_pred eccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhh--hccCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 284 TGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQ--PLTAQSGNLTRISAAQAASKGNSAVQQW 361 (455) Q Consensus 284 ~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~--~l~~~~~~~sa~a~~~~~~~~~s~L~~~ 361 (455) +|......++ ....+++++++- --.++.+.++...+.|..+++. .+.+..+|.||.++..+..+....|..+ T Consensus 386 ~~~~~g~i~~-----~~~~~~~~~~~~-~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~ 459 (706) T protein:vir:10 386 VTDKTGNVVA-----PANVAGYTQAPV-LNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFIY 459 (706) T ss_pred ccCCCCcccc-----cccccccCCCcc-hHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHHH Confidence 1111111111 011344444332 2234455566666777666432 2334455789999998888777777777 Q ss_pred HHHHHHHHHHH----HHHHHHHcCC---------CCCceEEEEec---------------------ccC------CCCCC Q lcl|NC_020844. 362 AAGLKDALENA----LYITNLWMDA---------PDTNPEADVYT---------------------DFR------ADTGE 401 (455) Q Consensus 362 a~~~~~a~~~~----l~~~a~~~g~---------~~~~~~v~v~~---------------------df~------~~~~~ 401 (455) -.|+..+..++ |.++.+|++. ++....+.+|. |+. ..... T Consensus 460 ~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r 539 (706) T protein:vir:10 460 LDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARR 539 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHH Confidence 77777776664 7778887743 11111122221 111 11112 Q ss_pred HHHHHHHHHHHHcCC-CCHHH--HHHHHHhcCCCCCCCCHHHHHHHHHhhcc-cCCCC Q lcl|NC_020844. 402 EQTPGYLLTMRQNGD-ISREG--LWMEFQRLGELSDNFDPEEESERLISELP-GGIEE 455 (455) Q Consensus 402 ~~~~~al~~~~~~G~-is~et--~~~~l~~~~vl~~~~d~e~e~~ri~~e~~-~~l~~ 455 (455) .+..++++++..++. ....+ +...+ ...+ +.-..++..++|..+.+ .+..+ T Consensus 540 ~~~~~~m~el~~~~~p~~~~~~~l~~~~--~~~~-d~p~~~e~~e~irk~~~~q~~~~ 594 (706) T protein:vir:10 540 DATVNALTQLLQGMLPQDPMRPALMGII--IDNM-EGEGLDDFKAFNRRQLLTQGIVK 594 (706) T ss_pred HHHHHHHHHHHHhcCCcchhhHHHHHHH--Hhhc-CccchHHHHHHHHHhhcccCCcc Confidence 345667777766543 11112 11111 0011 11122444555554332 12222 No 104 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=98.77 E-value=5.6e-08 Score=60.36 Aligned_cols=400 Identities=11% Similarity=0.040 Sum_probs=194.5 Q ss_pred CCCCC-CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCC-CCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQD-PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFP-HEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~-v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~-~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |.+.+ +.++.|+-.....+= +..|..-+.+. ...|-.. ...-+-|+....+ -.++...++.....+.+ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~-----~~~~~~~~~~~--e~~~~lr~~~~~~ly~~m~e~---D~~i~s~l~~rk~av~~ 70 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSG-----VVDGWTVWDPF--EQTPELQWPQSVAVYSRMDNE---DSRVTSLLEAISLPIRS 70 (469) T ss_pred CCCcccCCCCccchhhhhhcc-----cccchhhcccc--ccccccccccchHHHHHHHhh---ChHHHHHHHHHHHHHhc Confidence 87764 566655544421110 00000001100 0001000 2334567766655 57888888888888888 Q ss_pred CCceec--cCCHH----HHHHHHh-hcc-----------CCCCHHHHHHHHHHHHHhhCcEEE-EEeeCCCCcchhhhhH Q lcl|NC_020844. 79 REVEVT--EGTGP----SEDLLED-IDG-----------RGNNLTTFAGQLFFQGVADSISWI-RVDYPAGQQFRNREEE 139 (455) Q Consensus 79 k~p~i~--~~~~~----l~~~~~d-~d~-----------~G~~l~~~~~~~~~~al~~G~~~~-lVd~p~~~~~~t~ad~ 139 (455) .+++|. ..++. +...+.+ +.+ ...++..++..+...++.||.+.+ +|+........ T Consensus 71 ~~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~d----- 145 (469) T protein:vir:10 71 TPWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPD----- 145 (469) T ss_pred CCceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCC----- Confidence 888884 12222 2222211 111 134577788888888988998776 44322211000 Q ss_pred HhccCCc-eEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCc--ee- Q lcl|NC_020844. 140 RQAGVRP-YWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGV--VP- 215 (455) Q Consensus 140 ~~~~~rP-y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~--vP- 215 (455) ....| -+...+++.|--|.++..++...++... ..+..... ......+. || T Consensus 146 --G~~~~~~l~~rp~~~i~~~~~~~~~~l~~~~~~~----------------------~~~~~~~~-~~~~~~~~~~lp~ 200 (469) T protein:vir:10 146 --GRFWLRKLAPRPQWTISKFNVAPDGGLESIEQIA----------------------PPARTRGS-LYVANIAPPEIPV 200 (469) T ss_pred --CceeeeeeeecCcccceeeeeccCCceeeeeecC----------------------cccccccc-cccCCCCcccccc Confidence 00000 1122222222233333333222111100 00000000 00011122 22 Q ss_pred --EEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccC---cc--ceeeccce Q lcl|NC_020844. 216 --MVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGS---PK--PLDTGPDT 288 (455) Q Consensus 216 --~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~---~~--~~~~g~~~ 288 (455) ||.+...+..+ -..+.+.|..++..-+---....+...-++.-+.|+++.+--...++++-+ .. .+..|.+. T Consensus 201 ~k~i~~~~~~~~g-~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a 279 (469) T protein:vir:10 201 NRLVVYTRNKRPG-QWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINA 279 (469) T ss_pred CcEEEEEecCCCC-CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCce Confidence 34444444433 345677777777654433345677788888889999988742222222211 11 12347778 Q ss_pred eeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh--hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 289 VLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL--GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 289 ~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) +..+|.+. ++.|++.++++ ..+...++...++|..+ |..+ +.++..-|-.............+...+..+. T Consensus 280 ~~iip~~~-----~ie~~ea~g~~-~~~~~li~~~d~~Isk~iLG~tl-Ts~~~gGS~a~~~vh~ev~~d~~~sDa~~i~ 352 (469) T protein:vir:10 280 GVGLAQGQ-----ILELLGVSGNL-PDIRRAIEGHDRSIALSGLAHFL-NLDGKGGSYALASVLEDPFTQAVHAYATSIC 352 (469) T ss_pred EEEccCCc-----eEEEeecCCCc-hHHHHHHHHHHHHHHHHHhcccc-cccCccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 88898664 79999988765 56888899998998663 4343 3222112222233444445567788888999 Q ss_pred HHHH-HHHHHHHHHc-CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCC-----HHHHHHHHHhcCCCCCCCCHH Q lcl|NC_020844. 367 DALE-NALYITNLWM-DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDIS-----REGLWMEFQRLGELSDNFDPE 439 (455) Q Consensus 367 ~a~~-~~l~~~a~~~-g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is-----~et~~~~l~~~~vl~~~~d~e 439 (455) ++++ +++++++.+- |....-..+.+.. ....+...++++.++.+.|.+. .+-++ ++.|+-.+... + T Consensus 353 ~tln~~li~~l~~lN~g~~~~~P~~~~~~---~e~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~---e~~gip~~~~~-~ 425 (469) T protein:vir:10 353 RIANQHIIEDLVDINFGVDTPAPVLTFDP---IGSRQDLTAAAVKLLYDAGVFDDDPAVKRAIR---QRFNLPSELND-T 425 (469) T ss_pred HHHHHHHHHHHHHhcCCCCCCccEEEecC---CCCcHHHHHHHHHHHHhcCCccCccccHHHHH---HHhCCCCCCCC-c Confidence 9996 5888887764 3222223554431 1122345577788888999843 32222 44566333222 2 Q ss_pred HHHHHHHhhc---ccCCCC Q lcl|NC_020844. 440 EESERLISEL---PGGIEE 455 (455) Q Consensus 440 ~e~~ri~~e~---~~~l~~ 455 (455) ......+... +....+ T Consensus 426 ~~~~~~~~~~~~~~~~~~~ 444 (469) T protein:vir:10 426 PSAEPEEPAAVPNQSAAPA 444 (469) T ss_pred ccccchhcccCCCCCcccc Confidence 2111111111 000111 No 105 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=98.76 E-value=6.1e-08 Score=60.14 Aligned_cols=387 Identities=11% Similarity=0.034 Sum_probs=187.6 Q ss_pred CCCC-C----CCccCHHHHHHHHHHHHHHHHhc-----C------HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCCh Q lcl|NC_020844. 1 MPEQ-D----PTKRSADAEAMSDYLQTVDDVLE-----G------VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNV 64 (455) Q Consensus 1 m~~~-~----v~~~~p~y~~~~~~w~~i~d~~~-----G------~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~ 64 (455) |+.- | +-...+.-....+.-.-+++.+. | -..+|++..-++++ --+-|+....| ..+ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~----~~~L~~~m~e~---D~~ 73 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQA----QAELFMDMEER---DAH 73 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHH----HHHHHHHHHhh---ChH Confidence 4321 0 00000000001111111222211 1 01122221111110 01124444433 577 Q ss_pred HHHHHHHHhhhhhcCCceecc---CCH---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhh Q lcl|NC_020844. 65 FRDVLEGLASKPFGREVEVTE---GTG---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREE 138 (455) Q Consensus 65 ~~~~v~~~~g~lf~k~p~i~~---~~~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad 138 (455) +...++.-...+.+.+++|.- +++ ...++++..-..-.+++.++..+. .++-||.+.+=+- T Consensus 74 i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~~l-da~~~G~s~~Ei~------------ 140 (528) T protein:vir:10 74 LFAEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLDCM-DGVGHGYSAIELD------------ 140 (528) T ss_pred HHHHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHHHH-hhhhhcceeEEEE------------ Confidence 888888888888888888841 111 222333322111124888887776 4777887665331 Q ss_pred HHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCcee--- Q lcl|NC_020844. 139 ERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVP--- 215 (455) Q Consensus 139 ~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP--- 215 (455) |+. .+|.+.+..+..+.. ..+.+...+..++ +.. + +..+.+| T Consensus 141 --------------------w~~--~~g~~~~~~~~~r~~-~~f~~~~~~~~~l-~~~--------~---~~~~g~~l~~ 185 (528) T protein:vir:10 141 --------------------WSL--QGREWLPQAFDHRPQ-SWFQLNPDDQDEL-RLR--------D---NSIAGEVLQP 185 (528) T ss_pred --------------------Eee--cCCceeEEEeeeecc-cceeeccCCCcEE-ecc--------C---CCCCceeecC Confidence 222 234444433333221 1111111111111 110 1 1111122 Q ss_pred --EEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe---cCCCccccccCccceeeccceee Q lcl|NC_020844. 216 --MVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN---GVEPDTDSEGSPKPLDTGPDTVL 290 (455) Q Consensus 216 --~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~---G~~~~~~~~~~~~~~~~g~~~~~ 290 (455) |+-+...+.. +...+.+.|..++..-+--.....+...-++.-+.|+++.+ |.+.++..........+|.+.+. T Consensus 186 ~k~iv~~~~~~~-g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al~~i~~~~~~ 264 (528) T protein:vir:10 186 FGWIMHKPRSRS-GYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGTPDEEKVTLLRAVTGLGHAAAG 264 (528) T ss_pred CCeEEEeecCCC-CCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhhCcEE Confidence 2222223332 33346666667766554444456788888899999999987 22222222222223557888899 Q ss_pred eccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh--hhhhccCCCc--chhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 291 YAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL--GRQPLTAQSG--NLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 291 ~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~~--~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) .+|.+. .++|++..+.+...++..++...++|..+ |..+-...+. .-|..............+..-+..++ T Consensus 265 iiP~~~-----~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~ 339 (528) T protein:vir:10 265 IIPESM-----SIDFQEASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLA 339 (528) T ss_pred EecCCc-----eeEEeecCCCChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHH Confidence 999765 69999987777778888899999998663 5444222111 12222333444455667778888999 Q ss_pred HHHH-HHHHHHHHHcCCCCCc--eEEEEecccCCCCCCHHHHHHHHHHHHcCC-CCHHHHHHHHHhcCCCCCCCCHHHHH Q lcl|NC_020844. 367 DALE-NALYITNLWMDAPDTN--PEADVYTDFRADTGEEQTPGYLLTMRQNGD-ISREGLWMEFQRLGELSDNFDPEEES 442 (455) Q Consensus 367 ~a~~-~~l~~~a~~~g~~~~~--~~v~v~~df~~~~~~~~~~~al~~~~~~G~-is~et~~~~l~~~~vl~~~~d~e~e~ 442 (455) ++++ +++++++.|-.-.... ....+..++.........++++.++...|. ||.+.+.+. .|+-.+. +.|+.+ T Consensus 340 ~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~---~gip~p~-~~e~~~ 415 (528) T protein:vir:10 340 ATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQ---LGIPLPA-NGEAVL 415 (528) T ss_pred HHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHH---hCCCCCC-CCcccc Confidence 9997 5889888885432211 111222222222112345778888989998 887755544 4652222 212211 Q ss_pred HHHHhhcccCCCC Q lcl|NC_020844. 443 ERLISELPGGIEE 455 (455) Q Consensus 443 ~ri~~e~~~~l~~ 455 (455) ..+.++.-.. T Consensus 416 ---~~~~~~~~~~ 425 (528) T protein:vir:10 416 ---GDQAGAGIAQ 425 (528) T ss_pred ---cCCCcccccc Confidence 1111110000 No 106 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.74 E-value=6.9e-08 Score=59.84 Aligned_cols=421 Identities=13% Similarity=0.076 Sum_probs=203.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCC-CCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPH-EQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~-e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |++.. +.--.......+|+.+++--.- ...+++..+-.||..-. +.... ..... -.|-+...+.++.++..+++ T Consensus 1 ~~~~~--~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~-~~~~~-~~~dst~~~a~~~Laa~l~~ 76 (543) T protein:vir:88 1 MAETK--REGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSDNS-STDYT-TPWQAVGARGLNNLSAKVML 76 (543) T ss_pred Ccccc--cCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCCcc-ccccc-ccccchHHHHHHHHHHHHHH Confidence 99632 2222233344556555443221 34456666666775321 21111 11111 14555556666666555531 Q ss_pred C--Cce----ecc-------------CCHHHHHHHHhhcc------CCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcc Q lcl|NC_020844. 79 R--EVE----VTE-------------GTGPSEDLLEDIDG------RGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQF 133 (455) Q Consensus 79 k--~p~----i~~-------------~~~~l~~~~~d~d~------~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~ 133 (455) - |++ +.- ....++.|++.+.. ..++++.-+..+.++...+|-+-++++-+..... T Consensus 77 ~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~ 156 (543) T protein:vir:88 77 ALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSN 156 (543) T ss_pred hhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccc Confidence 0 221 000 01224455433211 2356888888999999999999999975443221 Q ss_pred hhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee----------------------eeEEEEEe----- Q lcl|NC_020844. 134 RNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED----------------------AETILVLR----- 186 (455) Q Consensus 134 ~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~----------------------~e~~~v~~----- 186 (455) + .+| +..|+-.+.. +..+..|.. .+.++ ++. .+.+.|++ T Consensus 157 ~---------~~~-~~~~pl~~y~-v~~d~~G~v--~~i~r-~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr 222 (543) T protein:vir:88 157 S---------YNP-MKLYTLHNHV-VQRDAFGNV--LQIVT-LDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYID 222 (543) T ss_pred e---------ecc-eEEeEcceEE-EeeCCCCCe--eeeee-eeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEee Confidence 1 112 2223322211 222222222 21111 111 11233322 Q ss_pred c--CeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceE Q lcl|NC_020844. 187 P--DDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPML 264 (455) Q Consensus 187 ~--~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~ 264 (455) + +.|..| ..-.|.+...+++..+++..|++++++....++.+ +.+|..+...-...+...+-..-..++.+..|.+ T Consensus 223 ~~~~~~~~~-~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 300 (543) T protein:vir:88 223 DESGDFLSY-QEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHY-GRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVG 300 (543) T ss_pred cCCCccccc-ccccCeeeecCCCccccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 1 112222 12233444456667778889999998887777766 7777777666555555566666777778888886 Q ss_pred EEe--cCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHh-hhhhc-cCCC Q lcl|NC_020844. 265 SAN--GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIREL-GRQPL-TAQS 339 (455) Q Consensus 265 ~~~--G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~-g~~~l-~~~~ 339 (455) .+. |.. ++..+.-|... ..+|. ..+++..+++. +..+....+.|+++++.|.+. =..++ ..++ T Consensus 301 ~v~~~g~~-------~~~~~~~~~~g-~~v~g----~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~ 368 (543) T protein:vir:88 301 LVNPNGIT-------QVRRLVKAQTG-DFVAG----RKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSG 368 (543) T ss_pred eecccccc-------chhhcccCCCc-eeecC----CCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCC Confidence 653 111 11112222111 12331 12345555543 457888889999999998652 12222 3445 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHH-H----HHHHHHHHHHcCC----CCCceEEEEecccCCCCCCHHHHHHHHH Q lcl|NC_020844. 340 GNLTRISAAQAASKGNSAVQQWAAGLKDA-L----ENALYITNLWMDA----PDTNPEADVYTDFRADTGEEQTPGYLLT 410 (455) Q Consensus 340 ~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-~----~~~l~~~a~~~g~----~~~~~~v~v~~df~~~~~~~~~~~al~~ 410 (455) .+.||++.+.+.......|.-.-..++.- + ++++.++-+ .|. +++.+++++..-.... --.+.++.+.. T Consensus 369 ~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~~v~~~~vs~l~~l-~r~~~~~~l~~ 446 (543) T protein:vir:88 369 ERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQA-TQQIPNLPQEAVEPTVTTGAEAL-GRGQDLDKLTQ 446 (543) T ss_pred CcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhceeeeEEecHHHH-HHHHHHHHHHH Confidence 56899999999988888887766665543 2 233333322 122 3333344432211110 01233333333 Q ss_pred HHH-cCCCCHH---------HHHH-HHHhcCCCCC-CCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 411 MRQ-NGDISRE---------GLWM-EFQRLGELSD-NFDPEEESERLISELPGGIEE 455 (455) Q Consensus 411 ~~~-~G~is~e---------t~~~-~l~~~~vl~~-~~d~e~e~~ri~~e~~~~l~~ 455 (455) ..+ .|.|+.. .+.+ .....||.+. -+..++|.+.+.+|.....-. T Consensus 447 ~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~ 503 (543) T protein:vir:88 447 FLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGG 503 (543) T ss_pred HHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHH Confidence 333 1333322 2222 2334677433 335566666665553211111 No 107 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=98.71 E-value=9.4e-08 Score=59.10 Aligned_cols=393 Identities=11% Similarity=0.035 Sum_probs=184.7 Q ss_pred CCCC-CCCccCHHHHH----HHHHHHHHHHHhc-----CH------HHHHHhhhhcCCCCCCCCHHHHHHHhhccccCCh Q lcl|NC_020844. 1 MPEQ-DPTKRSADAEA----MSDYLQTVDDVLE-----GV------TGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNV 64 (455) Q Consensus 1 m~~~-~v~~~~p~y~~----~~~~w~~i~d~~~-----G~------~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~ 64 (455) |+.- |+.-+-+...+ ..+...-+++... |- ..+|++..-.+.+ .-+=|+....| ..+ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~----~~~L~edm~e~---D~~ 73 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQA----QAELFMDMEER---DAH 73 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHH----HHHHHHHHHhh---ChH Confidence 4421 00000000000 0011111111111 10 1111111100000 01223433333 467 Q ss_pred HHHHHHHHhhhhhcCCceec---cC-C--HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhh Q lcl|NC_020844. 65 FRDVLEGLASKPFGREVEVT---EG-T--GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREE 138 (455) Q Consensus 65 ~~~~v~~~~g~lf~k~p~i~---~~-~--~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad 138 (455) +.-.++.-...+.+.+++|. .+ + ....++++..-....+++.++..+.. |+-||.+.+=+ T Consensus 74 i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~ld-A~~~G~s~~Ei------------- 139 (526) T protein:vir:79 74 LFAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALD-GIGHGYSCIEL------------- 139 (526) T ss_pred HHHHHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHHh-hhhhcceeEEE------------- Confidence 77777888888888888774 11 1 12223333222122358888888775 77788665433 Q ss_pred HHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCcee--E Q lcl|NC_020844. 139 ERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVP--M 216 (455) Q Consensus 139 ~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP--~ 216 (455) .|+. .+|.+.+..+..+.. ..+.+...+.+++ ...+....+...-| + T Consensus 140 -------------------~w~~--~~g~~~~~~l~~r~~-~~F~~~~~~~~~l---------~~~~~~~~g~~l~~~k~ 188 (526) T protein:vir:79 140 -------------------EWAL--QGREWMPLAFHHRPQ-SWFQLNPEDQNEL---------RLRDNSPAGEALQPFGW 188 (526) T ss_pred -------------------EEee--cCCceeEEEeeeecc-cceEeccCCCcEE---------EecCCCCCceeecCCce Confidence 1332 244454444333321 1111111111111 11111111111111 2 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe---cCCCccccccCccceeeccceeeecc Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN---GVEPDTDSEGSPKPLDTGPDTVLYAP 293 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~---G~~~~~~~~~~~~~~~~g~~~~~~lp 293 (455) |.+...+. .+...+.+.|..++..-+--.....+...-++.-+.|+++.+ |.+.++..........+|.+.+..+| T Consensus 189 iv~~~~~~-~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~da~~iiP 267 (526) T protein:vir:79 189 IIHRPRAR-SGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAAGIIP 267 (526) T ss_pred EEEeecCC-cCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhcCcEEEec Confidence 22222233 233345565666665443333356777888888999999987 22222221222223558888999999 Q ss_pred CCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH--hhhhhccCCC--cchhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 294 PNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE--LGRQPLTAQS--GNLTRISAAQAASKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 294 ~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~--~g~~~l~~~~--~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~ 369 (455) .+. .++|++..+.+-..++..++...++|.. +|..+-...+ +.-|-.............+..-+..+.+++ T Consensus 268 ~~~-----~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tl 342 (526) T protein:vir:79 268 ETM-----AIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATL 342 (526) T ss_pred CCc-----eeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 765 6999998766667788888988999866 3554433211 112223333444555667788889999999 Q ss_pred H-HHHHHHHHHcCCCCCce--EEEEecccCCCCCCHHHHHHHHHHHHcCC-CCHHHHHHHHHhcCCCCCCCCHHHHHHHH Q lcl|NC_020844. 370 E-NALYITNLWMDAPDTNP--EADVYTDFRADTGEEQTPGYLLTMRQNGD-ISREGLWMEFQRLGELSDNFDPEEESERL 445 (455) Q Consensus 370 ~-~~l~~~a~~~g~~~~~~--~v~v~~df~~~~~~~~~~~al~~~~~~G~-is~et~~~~l~~~~vl~~~~d~e~e~~ri 445 (455) + ++++.++.|-+-..... .-.+..++.........++.+.++...|. ||.+.+.+ +.|+-.+. +.++.+... T Consensus 343 n~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e---~~gip~~~-~~e~~l~~~ 418 (526) T protein:vir:79 343 SRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYD---KLGIPQPA-KNEPVLRPA 418 (526) T ss_pred HHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHH---HhCCCCCC-Cchhhcccc Confidence 6 59999988864322111 11222222222112345777888888888 88765554 34662222 212211111 Q ss_pred Hhhccc--CCCC Q lcl|NC_020844. 446 ISELPG--GIEE 455 (455) Q Consensus 446 ~~e~~~--~l~~ 455 (455) ....+. .... T Consensus 419 ~~~~~~~~~~~~ 430 (526) T protein:vir:79 419 AQPAILSRQHGQ 430 (526) T ss_pred CCcccccccccc Confidence 111000 0000 No 108 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.69 E-value=1e-07 Score=58.86 Aligned_cols=418 Identities=11% Similarity=0.071 Sum_probs=200.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc- Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG- 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~- 78 (455) |++.+ -=+=...+.+|+.++.--.= ...+|+..+-.||..-..+...=..+..+ .|-+...+.++.++..+++ T Consensus 1 ~~~~~----~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~-~~dst~~~a~~~Las~l~~~ 75 (522) T protein:vir:94 1 MAERE----GFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTT-PWQAVGARCLNNLAAKLMLA 75 (522) T ss_pred Ccccc----hhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccc-cccccHHHHHHHHHHHHHhh Confidence 98643 11122334455554332111 34455555555664321111111112222 4666666777766665532 Q ss_pred CCcee-----c-------------cCCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch Q lcl|NC_020844. 79 REVEV-----T-------------EGTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR 134 (455) Q Consensus 79 k~p~i-----~-------------~~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~ 134 (455) -.|.. . .....++.|++.+. ...++++.-+..+.++...+|-+.++++-+..+... T Consensus 76 ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~ 155 (522) T protein:vir:94 76 LFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYS 155 (522) T ss_pred cCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCcee Confidence 11211 0 01123555655431 123568888889999999999999998644333211 Q ss_pred hhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee---------------------eeEEEEEe-----cC Q lcl|NC_020844. 135 NREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED---------------------AETILVLR-----PD 188 (455) Q Consensus 135 t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~---------------------~e~~~v~~-----~~ 188 (455) .+..|+-.+.. +..+..|. +.+.++ ++. .+.+.|++ .+ T Consensus 156 ------------~~~~~pl~~y~-v~~d~~G~--vd~i~r-~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~ 219 (522) T protein:vir:94 156 ------------PMRMYRLVSYV-VQRDAFGN--ILQIVT-IDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDD 219 (522) T ss_pred ------------eEEEEEcceEE-EeeCCCcC--eEEEee-eeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCC Confidence 24445544432 22222222 222221 111 11233321 22 Q ss_pred eEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe- Q lcl|NC_020844. 189 DWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN- 267 (455) Q Consensus 189 ~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~- 267 (455) +|..|. ...|......++..++...|++++++....++.+ +..|..+...-...+...+-......+.+..|.+.+. T Consensus 220 ~~~~~~-~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~ 297 (522) T protein:vir:94 220 EYLRYE-EVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDY-GRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNP 297 (522) T ss_pred ceeEEe-eccCceecccCCCCccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecc Confidence 333222 2233333445566678899999998888777776 7778777666656666666667777788888887663 Q ss_pred -cCCCccccccCccceeeccceeeeccCCCCccccceeEEee-CchhHHHHHHHHHHHHHHHHHhh-hhhc-cCCCcchh Q lcl|NC_020844. 268 -GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEP-ATESLKFLAEQIESVSKEIRELG-RQPL-TAQSGNLT 343 (455) Q Consensus 268 -G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~-~~~~l~~~~~~l~~~~~~m~~~g-~~~l-~~~~~~~s 343 (455) |..... .....|++.+ +|.. .++++.++. ++..+....+.|+++++.|...= ..++ ..++.+.| T Consensus 298 ~g~~~~~------~~~~~~~g~~--v~g~----~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~T 365 (522) T protein:vir:94 298 NGITQPR------RLNKAATGEF--VAGR----VEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVT 365 (522) T ss_pred cccccch------heeccCCcee--ecCC----cccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCcccc Confidence 211111 0112222222 3311 223444542 34577888888999999886631 1222 23455689 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-H----HHHHHHHHHHcCC----CCCceEEEEecccCCCCCCHHHHHHHHHHHHc Q lcl|NC_020844. 344 RISAAQAASKGNSAVQQWAAGLKDA-L----ENALYITNLWMDA----PDTNPEADVYTDFRADTGEEQTPGYLLTMRQN 414 (455) Q Consensus 344 a~a~~~~~~~~~s~L~~~a~~~~~a-~----~~~l~~~a~~~g~----~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~ 414 (455) |++.+.+.......|.-.-..++.- + ++++.++-+ .|. +++.+++++..-.. .---.+.++.+....+. T Consensus 366 AtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~~v~v~~~s~La-~~qr~~~~~~l~~~~~~ 443 (522) T protein:vir:94 366 AEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQS-AGMIPDLPKEAVEPTVSTGLE-ALGRGQDLEKLTQAVNM 443 (522) T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCcccEEeeEecHHH-HHHHHHHHHHHHHHHHH Confidence 9999999888888887766665542 2 333333322 232 23333333321110 00012333333333321 Q ss_pred -CCCCHHHH----------HHHHHhcCCCCC-CCCHHHHHHHHHhhcccC--CCC Q lcl|NC_020844. 415 -GDISREGL----------WMEFQRLGELSD-NFDPEEESERLISELPGG--IEE 455 (455) Q Consensus 415 -G~is~et~----------~~~l~~~~vl~~-~~d~e~e~~ri~~e~~~~--l~~ 455 (455) ..+..+.+ .......||.+. -+-.++|.+.+.+|.... +-. T Consensus 444 ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~ 498 (522) T protein:vir:94 444 MTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQ 498 (522) T ss_pred HHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHH Confidence 11222221 223344666332 222344444333331110 000 No 109 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.67 E-value=1.2e-07 Score=58.51 Aligned_cols=432 Identities=13% Similarity=0.063 Sum_probs=178.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhh---cCC--CCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKR---YVK--QFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~---yLp--k~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) ||++ ...|-. .+.+ ++.+.......|+.... |.- +++.+..+..+. ..|-+ +|.++.+|+..+|+ T Consensus 1 m~d~--~~~~~~---~~~~---~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~l~~-q~rp~-~N~i~~~v~~v~g~ 70 (725) T protein:vir:10 1 MADN--ENRLES---ILSR---FDADWTASDEARREAKNDLFFSRVSQWDDWLSQYTTL-QYRGQ-FDVVRPVVRKLVSE 70 (725) T ss_pred CCch--HHHHHH---HHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHh-cCCCc-ccchHHHHHHHHhh Confidence 9964 323332 2222 23334444444443322 211 222222222232 33433 69999999999999 Q ss_pred hhcCCceec-----c----CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE--eeCCCCcchhhhhHHhccC Q lcl|NC_020844. 76 PFGREVEVT-----E----GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV--DYPAGQQFRNREEERQAGV 144 (455) Q Consensus 76 lf~k~p~i~-----~----~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV--d~p~~~~~~t~ad~~~~~~ 144 (455) --...+.+. . ..+.+..++..+- .-++.+.-+..++..++.+|++|+=| ||.....+.. + ..+ T Consensus 71 e~~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~--~---~~i 144 (725) T protein:vir:10 71 MRQNPIDVLYRPKDGASPDAADVLMGMYRTDM-RHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN--N---QVI 144 (725) T ss_pred HHhCCcceEEecCCcchHHHHHHHHHHHHHHH-HhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCC--c---eee Confidence 766555442 1 1223444444442 34667888889999999999999755 4422111100 0 000 Q ss_pred CceEEEechhhhc-cceeeccC---CeeEEE----------------------------------------EEEEEeeee Q lcl|NC_020844. 145 RPYWSQVNHSAIL-EIQSERQG---AGERIT----------------------------------------YMRMWEDAE 180 (455) Q Consensus 145 rPy~~~~~~~~ii-~w~~~~~~---~~~~l~----------------------------------------~v~~~E~~e 180 (455) +-+....++.+|+ +|.....+ -++.+. .+++-|... T Consensus 145 ~~~~i~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~ 224 (725) T protein:vir:10 145 RREPIHSACSHVIWDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYE 224 (725) T ss_pred eeeecccCHhHcccCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEE Confidence 0000011122221 11110000 000000 011111100 Q ss_pred ------EEEEE-ec--CeE-------------------------------EEEEEeCCcEEEEeccCCCCCCceeEEEEE Q lcl|NC_020844. 181 ------TILVL-RP--DDW-------------------------------ERWRKNDAGIWEVDEFGRITIGVVPMVPFI 220 (455) Q Consensus 181 ------~~~v~-~~--~~~-------------------------------~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~ 220 (455) .+.++ ++ +.+ +++.....|...+.+..+.+-+.+|||||+ T Consensus 225 r~~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~ 304 (725) T protein:vir:10 225 VVEKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVF 304 (725) T ss_pred EEEEeeEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEE Confidence 00001 01 011 111111111111223334455779999986 Q ss_pred eccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe-c-CCCccccccCccceeeccceeeeccCCC Q lcl|NC_020844. 221 TGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN-G-VEPDTDSEGSPKPLDTGPDTVLYAPPNA 296 (455) Q Consensus 221 ~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~-G-~~~~~~~~~~~~~~~~g~~~~~~lp~~~ 296 (455) .... ++.+. ...-+-++.+...-++...|-+.+++..++.-..... + ++.......+++...+..-+......++ T Consensus 305 g~r~~~~g~~~-~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~ 383 (725) T protein:vir:10 305 GEWGFVEDKEV-YEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGE 383 (725) T ss_pred eeeeccCCcce-eeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcc Confidence 4332 22211 0122334444444445555666666654433222221 1 1111111111111111100000000000 Q ss_pred CccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_020844. 297 EGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALEN-- 371 (455) Q Consensus 297 ~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~-- 371 (455) . ....+.+.++.. --.++...|+...+.|..+++ .++...+++.||.++..+..+....|..+-.|+..+... T Consensus 384 ~-~~~~i~~~~~~~-~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g 461 (725) T protein:vir:10 384 M-PTQPLAYYENPE-VPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDG 461 (725) T ss_pred c-ccccCcccCCCC-chHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 111233443222 223455566777777766543 344445556889999988887777777777777666554 Q ss_pred --HHHHHHHHcCC---------CCCceEEEEec------------------ccCCC--------CCCHHHHHHHHHHHHc Q lcl|NC_020844. 372 --ALYITNLWMDA---------PDTNPEADVYT------------------DFRAD--------TGEEQTPGYLLTMRQN 414 (455) Q Consensus 372 --~l~~~a~~~g~---------~~~~~~v~v~~------------------df~~~--------~~~~~~~~al~~~~~~ 414 (455) +|.++.++++. ++..-.+.||. .|+.. ....+.+.+|+++..+ T Consensus 462 ~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~ 541 (725) T protein:vir:10 462 EIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGK 541 (725) T ss_pred HHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHh Confidence 56677787743 11112334442 12111 0012445555555543 Q ss_pred C-CCCH---HHHHHHHHhcCCCCCCCCHHHHHHHHHhhccc-CCCC Q lcl|NC_020844. 415 G-DISR---EGLWMEFQRLGELSDNFDPEEESERLISELPG-GIEE 455 (455) Q Consensus 415 G-~is~---et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~-~l~~ 455 (455) - -+.. .++...+ ..+ ..-..++..++|..+.+. ++.+ T Consensus 542 ~~~~~~~~~~~l~~~~---~~~-d~~~~~e~~erirkq~~~~~~~~ 583 (725) T protein:vir:10 542 TPQGTPEYQLLLLQYF---TLL-DGKGVEMMRDYANKQLIQMGVKK 583 (725) T ss_pred ccccchhHHHHHHHHh---hcC-CchhHHHHHHHHHhhhhhhccCC Confidence 1 1111 1121111 111 111234556667665432 2222 No 110 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.66 E-value=1.4e-07 Score=58.22 Aligned_cols=415 Identities=12% Similarity=0.085 Sum_probs=195.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |++.....-.- .....+|+.++.-..= ...+++..+-.||..-..+...=..+..+ .|-+...+.++.++..+++- T Consensus 1 m~~~~~~~~~~--~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~-~~dst~~~a~~~Laa~l~~~ 77 (535) T protein:vir:15 1 MADSKRTGLGE--DGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTT-PWQAVGARGLNNLASKLMLA 77 (535) T ss_pred CCccchhccch--HHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccc-cccccHHHHHHHHHHHHHHh Confidence 99643221111 1223455555442111 44556666666665321111111112222 35555556666665555210 Q ss_pred --Cce----ecc-------------CCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch Q lcl|NC_020844. 80 --EVE----VTE-------------GTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR 134 (455) Q Consensus 80 --~p~----i~~-------------~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~ 134 (455) |++ +.- ....++.|++.+. ...++++.-+..++++.+.+|-+-++++-+..+.. T Consensus 78 ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~- 156 (535) T protein:vir:15 78 LFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYN- 156 (535) T ss_pred hcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCce- Confidence 211 100 0123455555442 13467888889999999999999888874432221 Q ss_pred hhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeee-----------------------eEEEEEecCeEE Q lcl|NC_020844. 135 NREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDA-----------------------ETILVLRPDDWE 191 (455) Q Consensus 135 t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~-----------------------e~~~v~~~~~~~ 191 (455) .+..|+-.++. ..+...-.+.+.+|. +.. +.+.+++ . T Consensus 157 ------------~f~~~pl~~~~---v~~d~~G~vd~i~r~-~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~----~ 216 (535) T protein:vir:15 157 ------------PMKLYRLSSYV---VQRDAYGNVLQIVTR-DQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYT----H 216 (535) T ss_pred ------------eeEEEEcCeeE---EeeCCCCCeeEEEEe-EeecHHHHHHHHhHhhhccccccCCCCceeEEE----E Confidence 13344433322 222222222222221 110 1122221 1 Q ss_pred EEEEeCCcEEEE----------eccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCC Q lcl|NC_020844. 192 RWRKNDAGIWEV----------DEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAF 261 (455) Q Consensus 192 ~~~~~~~g~~~~----------~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~ 261 (455) +|...+++.|.. ...+..+++..|++++++....++.+ +.+|..+...-...+...+-......+.+.. T Consensus 217 v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~~~~ 295 (535) T protein:vir:15 217 VYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESY-GRSYCEEYLGDLRSLENLQEAIVKMSMISAK 295 (535) T ss_pred EEEecCCCcEEEEEEeeCccccccccccccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 111222222221 12244567789999998887777766 7777777666555666666667777788888 Q ss_pred ceEEEe--cCCCccccccCccce-eeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHh--hhhhc Q lcl|NC_020844. 262 PMLSAN--GVEPDTDSEGSPKPL-DTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIREL--GRQPL 335 (455) Q Consensus 262 p~~~~~--G~~~~~~~~~~~~~~-~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~--g~~~l 335 (455) |.+.+. |.. ++..+ .-|.+.+ +|. ..+++..+++. +..+....+.|+++++.|... ...+. T Consensus 296 p~~lv~~~g~~-------~~~~l~~~~~g~~--v~g----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~ 362 (535) T protein:vir:15 296 VIGLVNPAGIT-------QPRRLTKAQTGDF--VPG----RREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAV 362 (535) T ss_pred Cceeecccccc-------cchhcccCCceee--ecC----CcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcc Confidence 886653 111 11111 1222222 221 12345555533 457888889999999998663 22222 Q ss_pred cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC----CCCceEEEEecccCCCCCCHHHHH Q lcl|NC_020844. 336 TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA----PDTNPEADVYTDFRADTGEEQTPG 406 (455) Q Consensus 336 ~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~----~~~~~~v~v~~df~~~~~~~~~~~ 406 (455) ..++.+.||++.+.+.......|.-.-..++.- +++++.++-+ .|. +...+++++..-..... -.++++ T Consensus 363 ~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~v~~~yis~La~aq-r~~~~~ 440 (535) T protein:vir:15 363 QRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TSQIPELPKEAVEPTISTGLEAIG-RGQDLD 440 (535) T ss_pred cCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCccceeEEEecHHHHHH-HHHHHH Confidence 344556899999999988888887766665542 2333333322 122 23333343322111100 112333 Q ss_pred HHHHHHHc-CCCCHH---------HHH-HHHHhcCCCCC-CCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 407 YLLTMRQN-GDISRE---------GLW-MEFQRLGELSD-NFDPEEESERLISELPGGIEE 455 (455) Q Consensus 407 al~~~~~~-G~is~e---------t~~-~~l~~~~vl~~-~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+....+. ..+..+ .+. ......||-+. -+..++|.+.+.+|..+..-. T Consensus 441 ~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~ 501 (535) T protein:vir:15 441 KLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGI 501 (535) T ss_pred HHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHH Confidence 33332221 112222 222 23344666432 233444444444332211111 No 111 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=98.63 E-value=1.7e-07 Score=57.76 Aligned_cols=421 Identities=10% Similarity=0.021 Sum_probs=194.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh-- Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF-- 77 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf-- 77 (455) |++-..+... =.....+|+.++.--.= ...+++..+-.||..-..+...=..+..+ .|-+...+.++.++..+. T Consensus 1 m~~~~~~~~~--~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~-~~dst~~~a~~~LAa~L~~~ 77 (532) T protein:vir:99 1 MAEVEKTGFA--ADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTT-PWQSIGARGLNNLASKLMLA 77 (532) T ss_pred Ccchhhcccc--HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccc-cccchHHHHHHHHHHHHHHh Confidence 9863221111 12333455554332110 34456666666665421111111223333 456666666666666553 Q ss_pred --c--CCc-eecc-------------CCHHHHHHHHhh------ccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcc Q lcl|NC_020844. 78 --G--REV-EVTE-------------GTGPSEDLLEDI------DGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQF 133 (455) Q Consensus 78 --~--k~p-~i~~-------------~~~~l~~~~~d~------d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~ 133 (455) . +|. .+.- ....++.|++.+ -...++++.-+..+.++...+|-+-++++-++.... T Consensus 78 ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~ 157 (532) T protein:vir:99 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEG 157 (532) T ss_pred hcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccC Confidence 2 111 1110 012255665442 112467888889999999999999999874432211 Q ss_pred hhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee-----------------------eeEEEEEe---- Q lcl|NC_020844. 134 RNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED-----------------------AETILVLR---- 186 (455) Q Consensus 134 ~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~-----------------------~e~~~v~~---- 186 (455) ....+..|+-.+.. +..+..|.. .+.++ ++. .+.+.|++ T Consensus 158 ----------~~~~f~~~pl~~y~-v~~d~~G~v--~~ivr-r~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~ 223 (532) T protein:vir:99 158 ----------QSNAPKLYKLHNFV-VERDAYDNV--LQIVT-EDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYR 223 (532) T ss_pred ----------cccceEEEEcCeEE-EeeCCCCCe--eeEee-eeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEe Confidence 11234455544432 222333322 22221 111 01222221 Q ss_pred -cCe--EEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCce Q lcl|NC_020844. 187 -PDD--WERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPM 263 (455) Q Consensus 187 -~~~--~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~ 263 (455) ++. |..|... .|.......+..++...|++++++....++.+ +.+|-.+...-.......+-..-...+.+..|. T Consensus 224 ~~~~~~~~~~~~~-~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~ 301 (532) T protein:vir:99 224 DPEAMVFRSYQEI-DGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDY-GRSFVEEYLGDLKSLENLYEAIVKMSMISSKVL 301 (532) T ss_pred cCCCCeeEEEEee-cCceecccccccccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 111 2222211 22222333455568889999998887777666 667766655544444444444555556666666 Q ss_pred EEEe-cCCCccccccCccce-eeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHh--hhhhccCC Q lcl|NC_020844. 264 LSAN-GVEPDTDSEGSPKPL-DTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIREL--GRQPLTAQ 338 (455) Q Consensus 264 ~~~~-G~~~~~~~~~~~~~~-~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~ 338 (455) +.+. +- -.++..+ ..|++.+ +|.. .++++.++.. +..+....+.|+++++.|... ...+...+ T Consensus 302 ~lv~p~g------~~~~~~~~~~~~g~~--v~g~----~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d 369 (532) T protein:vir:99 302 FFVNPNG------VTQIRRVAKANTGDF--VAGR----KQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRG 369 (532) T ss_pred ceecccc------ccchhhhccCCCcce--ecCC----cccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCC Confidence 5542 11 1111111 2233332 3311 1234555433 457888888899999988652 11222344 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHH-H----HHHHHHHHHHcCC----CCCceEEEEecccCCCCC-CHHHHHHH Q lcl|NC_020844. 339 SGNLTRISAAQAASKGNSAVQQWAAGLKDA-L----ENALYITNLWMDA----PDTNPEADVYTDFRADTG-EEQTPGYL 408 (455) Q Consensus 339 ~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-~----~~~l~~~a~~~g~----~~~~~~v~v~~df~~~~~-~~~~~~al 408 (455) +...||++...+.......|.-.-..++.- + ++++.++-+ .|. +++.+...+.. +.. .+ -++.++.+ T Consensus 370 ~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~~~~~~iv~-~is-~Laraq~~~~l 446 (532) T protein:vir:99 370 GDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQA-TSKIPNLPKEAVEPAIAT-GLE-ALGRGHDLNKL 446 (532) T ss_pred CCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCChhhcccceee-cch-HHHHHHHHHHH Confidence 556899999999888888877666655442 2 333333322 232 12111222211 211 11 12333333 Q ss_pred HHHHHc-----C----CCCHHHHHH-HHHhcCCCC-CCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 409 LTMRQN-----G----DISREGLWM-EFQRLGELS-DNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 409 ~~~~~~-----G----~is~et~~~-~l~~~~vl~-~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ....+. + .|.-..+.+ .....||.+ .-+..++|.+.+.+|....... T Consensus 447 ~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~ 504 (532) T protein:vir:99 447 NVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGM 504 (532) T ss_pred HHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHH Confidence 222221 1 122222233 333457733 2334455555554432211111 No 112 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=98.59 E-value=2.3e-07 Score=56.99 Aligned_cols=401 Identities=9% Similarity=0.011 Sum_probs=183.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC-CCCHH-HHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP-HEQNK-RYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~-~e~~~-~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |.- ..+|+.++.-..= ...+++..+-.||..- ..... .-..+.. ..|-+...+.++.++..+. T Consensus 1 m~~-------------~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~-~~~dstg~~a~~~LAa~l~ 66 (522) T protein:vir:10 1 MKA-------------RERYNQLTTARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLT-VPWQSVGAKCCVTLAAKLM 66 (522) T ss_pred Cch-------------HHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCCCccccccc-ccccchHHHHHHHHHHHHH Confidence 541 2233322221100 2334444444555322 11111 1122332 2555666666666665552 Q ss_pred ----c--CCc-eec--------c-CC---HHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 78 ----G--REV-EVT--------E-GT---GPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 78 ----~--k~p-~i~--------~-~~---~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) . .|. .+. . .+ ..++.|++.+. ...++++.-+..+.++...+|-+-++++.+. T Consensus 67 ~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~--- 143 (522) T protein:vir:10 67 LAVLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKDG--- 143 (522) T ss_pred HhhcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCCC--- Confidence 1 111 111 0 11 12455555443 2356788889999999999999999887431 Q ss_pred chhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEE-EEee-----------------------eeEEEEEecC Q lcl|NC_020844. 133 FRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMR-MWED-----------------------AETILVLRPD 188 (455) Q Consensus 133 ~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~-~~E~-----------------------~e~~~v~~~~ 188 (455) ++.|+-.+.. +..+..| .+.+.++ ++=+ .+.+.|++ T Consensus 144 ---------------~~~~pl~~y~-v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~-- 203 (522) T protein:vir:10 144 ---------------LKTFPLTRYV-INRDGDG--NVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYT-- 203 (522) T ss_pred ---------------ceEEEcceEE-EeeCCCC--CeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEE-- Confidence 2233333321 1222222 2222111 1100 01122221 Q ss_pred eEEEEEEeCCcEEEE----------eccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHH Q lcl|NC_020844. 189 DWERWRKNDAGIWEV----------DEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMM 258 (455) Q Consensus 189 ~~~~~~~~~~g~~~~----------~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~ 258 (455) .+|.+.+.+.|.. ...+..++...|++++++....++.+ |.+|-.+...-.......+-..-...+. T Consensus 204 --~v~p~~~~~~~~~~~~~~~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~~~~~~~ 280 (522) T protein:vir:10 204 --YVKLDKSSGRWVWHQEAFDKIIPDSRSTAPKNASPWLPLRFNTVDGEDY-GRGRVEEFLGDLKSLDGLSQSLIEGAAA 280 (522) T ss_pred --EEEeeccCCceEEEEccCCccccccccccccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111212221211 12234577889999998887777666 7777776665555555555566777778 Q ss_pred cCCceEEEe--cCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHhhhhhc Q lcl|NC_020844. 259 TAFPMLSAN--GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIRELGRQPL 335 (455) Q Consensus 259 ~~~p~~~~~--G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~g~~~l 335 (455) +..|.+.+. |..... .+.-|....+ ++. ..+++..++.. +..+....+.++++++.|+..=..+. T Consensus 281 a~~p~~lv~~~~~~~~~-------~l~~~~~~~~-v~g----~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~~ 348 (522) T protein:vir:10 281 ASKVVFLVSPSSTTKPA-------TIAKAGNGAI-VQG----RPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVMN 348 (522) T ss_pred hcCCceeeccccccccc-------cccCCCCcce-ecC----CCccceeecccccccchHHHHHHHHHHHHHHHHHhhcc Confidence 888887663 222111 1111222222 221 12345555543 45677778888999888877533333 Q ss_pred cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHcCC---CCCce-EEEEecccCCCCCCHHHHH Q lcl|NC_020844. 336 TAQSGNLTRISAAQAASKGNSAVQQWAAGLKD-----ALENALYITNLWMDA---PDTNP-EADVYTDFRADTGEEQTPG 406 (455) Q Consensus 336 ~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~-----a~~~~l~~~a~~~g~---~~~~~-~v~v~~df~~~~~~~~~~~ 406 (455) ..++.+.||++...+.......|.-.-..++. -+++++.++.+ .|. .+.+. ...+ -.|...---++.++ T Consensus 349 ~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~~~~~~~-v~~is~Laraq~~~ 426 (522) T protein:vir:10 349 VRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQR-SNQIPKLPKDIVRPTI-VAGVNALGRGQDRE 426 (522) T ss_pred CCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCcccccccc-ccchhHHHHHHHHH Confidence 44456789999999998888877766665543 23444444433 232 12111 1111 12221111133333 Q ss_pred HHHHHHHc--CCCCHHHH---------HH-HHHhcCCCCC-CCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 407 YLLTMRQN--GDISREGL---------WM-EFQRLGELSD-NFDPEEESERLISELPGGIEE 455 (455) Q Consensus 407 al~~~~~~--G~is~et~---------~~-~l~~~~vl~~-~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+....+. .++..+++ .+ .....||-.. -+..++|.+.++++..+..-. T Consensus 427 ~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~ 488 (522) T protein:vir:10 427 SLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQ 488 (522) T ss_pred HHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHH Confidence 33332221 11112222 22 2233565322 222344434333332211111 No 113 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=98.57 E-value=2.6e-07 Score=56.73 Aligned_cols=390 Identities=9% Similarity=-0.015 Sum_probs=179.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCH-HHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQN-KRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~-~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |...| . | .-++++|.. +.+|-..+..-.- .-+..|.. ..+++++|+..+.-.+++ T Consensus 1 ~~~~D--~----~---------~n~~~gg~~-----~~~~~~~~~~~~~~~l~a~Y~~----~~l~~~~Vd~~aed~~r~ 56 (422) T protein:vir:10 1 MVKTD--S----Y---------ANIFLGGSD-----GSEIYGSLQNQAPTILASLYAD----NALVRRIIDTIPETALAA 56 (422) T ss_pred Cccch--h----h---------HHHHcCCCC-----CccccCcccccCHHHHHHHHHh----ChhhHHHHhhhhHHHhcC Confidence 65322 1 1 112334432 1223222222221 22233443 467888999999999999 Q ss_pred CceeccCCH--HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhc Q lcl|NC_020844. 80 EVEVTEGTG--PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAIL 157 (455) Q Consensus 80 ~p~i~~~~~--~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii 157 (455) ..+|+.+.+ .++.-++. ..+.+.++.+.+.+..||.+++++..........-.+ ..+...++..+++.+|. T Consensus 57 g~~i~~~~~~~~~~~~~~~-----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~--~~g~~~~l~v~d~~~i~ 129 (422) T protein:vir:10 57 GFHIDGIDDEPAFWSRWDD-----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVR--EGAELETVRVYDRTQVK 129 (422) T ss_pred CccccCCCHHHHHHHHHHH-----hhHHHHHHHHHHhhccccceEEEEEecCCCCcccccc--ccCceeeEEeecccccc Confidence 999976543 33333332 3678888899999999999999997533221110000 01112233333333321 Q ss_pred cceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHH Q lcl|NC_020844. 158 EIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKD 237 (455) Q Consensus 158 ~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~d 237 (455) .-.. ..+...-.+..|..|++--...+..+.+|.+.-..+..-| +|..... ...+ -+.++|.. T Consensus 130 ~~~~--------------~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~-~p~~~~~-~~~~-~G~S~l~~ 192 (422) T protein:vir:10 130 VQTR--------------EENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGER-IPNVMRR-QNDG-WGRSVLSS 192 (422) T ss_pred chhc--------------ccCccccccCcceEEEEecCCCCcceeeccceeEEeCCCC-chhhhcc-cCCc-ccchhHHH Confidence 1000 0000000011122222210000001122211111111111 0111111 1222 36788887 Q ss_pred HH-HHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccc-cccC--------ccceeeccceeeeccCCCCccccceeEEe Q lcl|NC_020844. 238 AL-DLQVALFRKECALEFAEMMTAFPMLSANGVEPDTD-SEGS--------PKPLDTGPDTVLYAPPNAEGQHGSWNFVE 307 (455) Q Consensus 238 la-~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~-~~~~--------~~~~~~g~~~~~~lp~~~~~~~~~~~~le 307 (455) ++ +.....-+++..-..+++...+.+..+.|+...-. .+.. ......|.+..+.+.. ++.++..+. T Consensus 193 ~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~----~~e~~e~~~ 268 (422) T protein:vir:10 193 DILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA----ESEEYSVLN 268 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEec----CCcceEEEe Confidence 54 44333334444557778888888888877542211 1110 0001112222333321 122465566 Q ss_pred eCchhHHHHHHHHHHHHHHHHHhhhhhccC---CC---cchhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHc Q lcl|NC_020844. 308 PATESLKFLAEQIESVSKEIRELGRQPLTA---QS---GNLTRISAAQAASKGNSAVQQWAA-GLKDALENALYITNLWM 380 (455) Q Consensus 308 ~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~---~~~sa~a~~~~~~~~~s~L~~~a~-~~~~a~~~~l~~~a~~~ 380 (455) .+.+++ .+.++...++|......|++. ++ -|.|+. .+...-+..+..+.. .+...++++++++.. T Consensus 269 ~~lsgl---~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd---~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~-- 340 (422) T protein:vir:10 269 SDIGGI---DAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQN---TALETFHKLVDRKRNAELLPILEFLIPFIVN-- 340 (422) T ss_pred cccCCh---HHHHHHHHHHHHhhhCCCeeeeccCCcccccccch---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-- Confidence 555554 555666677777766655432 22 122332 233445555555554 467788888887643 Q ss_pred CCCCCceEEEEecccCCCCC-----CHHHHHHHHHHHHcCCCCHHHHHHHHHh----cCCCCCCCCHHHHHHHHHhhccc Q lcl|NC_020844. 381 DAPDTNPEADVYTDFRADTG-----EEQTPGYLLTMRQNGDISREGLWMEFQR----LGELSDNFDPEEESERLISELPG 451 (455) Q Consensus 381 g~~~~~~~v~v~~df~~~~~-----~~~~~~al~~~~~~G~is~et~~~~l~~----~~vl~~~~d~e~e~~ri~~e~~~ 451 (455) .++.+|++|.-..+... ....+++..++.++|+|+.+..++.|+. .++.+. ..+ ++++..+.+.++ T Consensus 341 ---s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~-~~~-~~~~~~~~~~~~ 415 (422) T protein:vir:10 341 ---AEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDG-SVE-TEVTISETSNDP 415 (422) T ss_pred ---cCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCC-CCc-cccchhhcCCCC Confidence 23566766643322211 1133577788889999999999999976 333332 222 222222222222 Q ss_pred CCCC Q lcl|NC_020844. 452 GIEE 455 (455) Q Consensus 452 ~l~~ 455 (455) .-+. T Consensus 416 ~~~~ 419 (422) T protein:vir:10 416 LEVP 419 (422) T ss_pred CCCC Confidence 1111 No 114 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.57 E-value=2.6e-07 Score=56.72 Aligned_cols=420 Identities=12% Similarity=0.066 Sum_probs=197.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |++ .+...+-.....+|+.++.--.= ...+++..+-.||..-..+...=..+.. -.|-+...+.++.++..+++- T Consensus 1 m~~---~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~-~~~dst~~~a~~~Laa~l~~~ 76 (536) T protein:vir:10 1 MAE---KRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQ-TPWQAVGARGLNNLASKLMLA 76 (536) T ss_pred Ccc---hhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccc-ccccccHHHHHHHHHHHHHhh Confidence 996 45566666777777776553211 4456677676777543221111111222 245555666666665555210 Q ss_pred --Cce----ec-cC---------C---HHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch Q lcl|NC_020844. 80 --EVE----VT-EG---------T---GPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR 134 (455) Q Consensus 80 --~p~----i~-~~---------~---~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~ 134 (455) |+. +. .+ + ..++.|++.+. ...++++.-+..+.++...+|-+-++++-+..... T Consensus 77 ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~- 155 (536) T protein:vir:10 77 LFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY- 155 (536) T ss_pred hcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCce- Confidence 210 00 00 0 12334443321 12356888888999999999999999874433221 Q ss_pred hhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe------------e----------eeEEEEEe------ Q lcl|NC_020844. 135 NREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE------------D----------AETILVLR------ 186 (455) Q Consensus 135 t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E------------~----------~e~~~v~~------ 186 (455) -++..|+-.++.-. .+..| .+.+.+|-.+ . .+.+.|++ T Consensus 156 -----------~~~~~~pl~~~~v~-~d~~G--~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~ 221 (536) T protein:vir:10 156 -----------NPMKLYRLSSYVVQ-RDAFG--NVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDE 221 (536) T ss_pred -----------eeEEEEEcCeEEEe-eCCCC--CeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEec Confidence 13455554444321 12222 2333222110 0 01122221 Q ss_pred -cCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEE Q lcl|NC_020844. 187 -PDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLS 265 (455) Q Consensus 187 -~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~ 265 (455) .+.|.+|. .-.|...+.++|.+++...|++++++....++.+ +.+|..+...-.......+-..-.....+..|.+. T Consensus 222 ~~~~~~~~~-e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~l 299 (536) T protein:vir:10 222 ASGEYLRYE-EVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESY-GRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGL 299 (536) T ss_pred CCCcEEEEE-eecCccccccccccccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc Confidence 11233222 2223233344566678899999998888777776 66777666554444444444444444444444443 Q ss_pred E-ecCCCccccccCccc-eeeccceeeeccCCCCccccceeEEee-CchhHHHHHHHHHHHHHHHHHh-hhhhc-cCCCc Q lcl|NC_020844. 266 A-NGVEPDTDSEGSPKP-LDTGPDTVLYAPPNAEGQHGSWNFVEP-ATESLKFLAEQIESVSKEIREL-GRQPL-TAQSG 340 (455) Q Consensus 266 ~-~G~~~~~~~~~~~~~-~~~g~~~~~~lp~~~~~~~~~~~~le~-~~~~l~~~~~~l~~~~~~m~~~-g~~~l-~~~~~ 340 (455) + .+-. .++.. +..|++.+ +|.. .++...++. .+..+....+.|+++++.|... =..++ ..++. T Consensus 300 v~p~g~------~~~~~~~~~~~g~~--v~g~----~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~ 367 (536) T protein:vir:10 300 VNPAGI------TQPRRLTKAQTGDF--VTGR----PEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGE 367 (536) T ss_pred cCcccc------cchhhhccCCCcce--ecCC----cccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCC Confidence 3 2111 11111 22233333 2211 123444443 3566777888899988888552 11222 34455 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC----CCCceEEEEecccCCCCCCHHHHHHHHHH Q lcl|NC_020844. 341 NLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA----PDTNPEADVYTDFRADTGEEQTPGYLLTM 411 (455) Q Consensus 341 ~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~----~~~~~~v~v~~df~~~~~~~~~~~al~~~ 411 (455) ..||++.+.+.......|.-.-..++.- +++++.++-+ .|. +++.+.+++.. ....---.++++.++.. T Consensus 368 r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~v~~~~vs-~l~~l~r~~~~~~l~~~ 445 (536) T protein:vir:10 368 RVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TQQIPELPKEAVEPTIST-GLEAIGRGQDLDKLERC 445 (536) T ss_pred CccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCCChhhccceEEe-cHHHHHHHHHHHHHHHH Confidence 6899999999888888777666665543 2233333312 232 22222333321 11100012333333332 Q ss_pred HH--cCC--------CCHHHHHHH-HHhcCCCC-CCCCHHHHHHHHHhhcccC--CCC Q lcl|NC_020844. 412 RQ--NGD--------ISREGLWME-FQRLGELS-DNFDPEEESERLISELPGG--IEE 455 (455) Q Consensus 412 ~~--~G~--------is~et~~~~-l~~~~vl~-~~~d~e~e~~ri~~e~~~~--l~~ 455 (455) .+ +++ |....+.+. ....||.+ .-+..++|.+.+.+|.... +.. T Consensus 446 ~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~ 503 (536) T protein:vir:10 446 VTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDN 503 (536) T ss_pred HHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHH Confidence 22 121 222222222 23356633 2333455555555443221 111 No 115 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.56 E-value=2.8e-07 Score=56.55 Aligned_cols=417 Identities=12% Similarity=0.056 Sum_probs=195.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC-CCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP-HEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~-~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |++-..+.... .....+|+.+++-..= ...+++..+-.||..- .++.. =..+.. -.|-+...+.++.++..+++ T Consensus 1 m~~~~~~~~~~--~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~-~~~~~~-~~~dst~~~a~~~Laa~l~~ 76 (535) T protein:vir:33 1 MADSKRTGLGE--DGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDN-ESTDYT-TPWQAVGARGLNNLASKLML 76 (535) T ss_pred CChhhhhccCh--hHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCc-cccccc-ccccccHHHHHHHHHHHHHH Confidence 99543221111 1234455555442211 4455666666667532 21111 011111 13455555666666555521 Q ss_pred C--Cce--ec-cC--------------CHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcc Q lcl|NC_020844. 79 R--EVE--VT-EG--------------TGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQF 133 (455) Q Consensus 79 k--~p~--i~-~~--------------~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~ 133 (455) - |++ +. .. ...++.|++.+. ...++++.-+..+.++.+.+|-+-++++-+..+. T Consensus 77 ~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~- 155 (535) T protein:vir:33 77 ALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSY- 155 (535) T ss_pred hhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCc- Confidence 0 211 00 00 112445544331 1245688888999999999999999987443221 Q ss_pred hhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEE-EEeee---------------------eEEEEEecCeEE Q lcl|NC_020844. 134 RNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMR-MWEDA---------------------ETILVLRPDDWE 191 (455) Q Consensus 134 ~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~-~~E~~---------------------e~~~v~~~~~~~ 191 (455) ..+..|+-.++.-. .+..| .+.+.+| ++=++ +.+.+++ . T Consensus 156 ------------~~f~~~pl~~~~v~-~d~~G--~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~----~ 216 (535) T protein:vir:33 156 ------------NPMKLYRLSSYVVQ-RDAYG--NVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYT----H 216 (535) T ss_pred ------------eeeEEEEcCeeEEe-eCCCC--CeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEE----E Confidence 12444554443321 22222 2222222 11000 0011110 1 Q ss_pred EEEEeCCcEEEE----------eccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCC Q lcl|NC_020844. 192 RWRKNDAGIWEV----------DEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAF 261 (455) Q Consensus 192 ~~~~~~~g~~~~----------~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~ 261 (455) +|+...++.|.. ...+..+++..|++++++....++.+ +.+|..+...-...+...+-..-...+.+.. T Consensus 217 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~~~~ 295 (535) T protein:vir:33 217 VYLDEESGDYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESY-GRSYCEEYLGDLRSLENLQEAIVKMSMISAK 295 (535) T ss_pred EEeeCCCCcEEEEEEEeCccccccccccccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 122222222221 22334467889999998887777766 7777777666555666666667777788888 Q ss_pred ceEEEecCCCccccccCccce-eeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHh--hhhhccC Q lcl|NC_020844. 262 PMLSANGVEPDTDSEGSPKPL-DTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIREL--GRQPLTA 337 (455) Q Consensus 262 p~~~~~G~~~~~~~~~~~~~~-~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~--g~~~l~~ 337 (455) |.+.+.- +.-.++..+ .-|.+.+ +|. ..+++..+++. +..+....+.|+++++.|.+. ...+... T Consensus 296 p~~lv~~-----~g~~~~~~~~~~~~g~~--v~g----~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~ 364 (535) T protein:vir:33 296 VIGLVNP-----AGITQPRRLTKAQTGDF--VPG----RREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQR 364 (535) T ss_pred Cceeecc-----ccccchhhcccCCceee--ecC----CcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccC Confidence 8866531 011111111 1222222 221 12345555533 457888889999999998663 2222234 Q ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC----CCCceEEEEecccCCCCCCHHHHHHH Q lcl|NC_020844. 338 QSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA----PDTNPEADVYTDFRADTGEEQTPGYL 408 (455) Q Consensus 338 ~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~----~~~~~~v~v~~df~~~~~~~~~~~al 408 (455) ++.+.||++.+.+.......|.-.-..++.- +++++.++-+ .|. +...+++++..-..... -.++++.+ T Consensus 365 ~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r-~g~lP~~p~~~v~~~yis~La~aq-r~~~~~~l 442 (535) T protein:vir:33 365 TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA-TSQIPELPKEAVEPTISTGLEAIG-RGQDLDKL 442 (535) T ss_pred CCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCccceeEEEecHHHHHH-HHHHHHHH Confidence 4556899999999988888887766665542 2333333322 122 23333343322111100 11233333 Q ss_pred HHHHHc-CCCCHH---------HH-HHHHHhcCCCCC-CCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 409 LTMRQN-GDISRE---------GL-WMEFQRLGELSD-NFDPEEESERLISELPGGIEE 455 (455) Q Consensus 409 ~~~~~~-G~is~e---------t~-~~~l~~~~vl~~-~~d~e~e~~ri~~e~~~~l~~ 455 (455) ....+. ..+..+ .+ .......||-+. -+..++|.+.+.+|..+..-. T Consensus 443 ~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~ 501 (535) T protein:vir:33 443 ERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGV 501 (535) T ss_pred HHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHH Confidence 322221 112222 22 223344666322 233444555444442211111 No 116 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=98.55 E-value=3.1e-07 Score=56.30 Aligned_cols=435 Identities=10% Similarity=-0.019 Sum_probs=186.5 Q ss_pred CCCCC-CCcc--CHHHHHHHHHHHHHHHHhcCHH--HHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQD-PTKR--SADAEAMSDYLQTVDDVLEGVT--GMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~-v~~~--~p~y~~~~~~w~~i~d~~~G~~--~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) |+-.. .... +-...-....++.....+.|.- ..+++-+-|+=...+.. .+=...++.+.+..+|+.+... T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~~-----~~~~s~~~~~~v~~~v~~~~~~ 75 (705) T protein:vir:88 1 MAKRRKIKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGNE-----RPGKSGIVSRDVQETVDWIMPS 75 (705) T ss_pred CCcccccccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCcc-----cCCCCccccHHHHHHHHHHHHH Confidence 55321 1111 1111112222223333333331 22333344543322211 1113457788899899988887 Q ss_pred hhc----CCceec------cCC---HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch-------- Q lcl|NC_020844. 76 PFG----REVEVT------EGT---GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR-------- 134 (455) Q Consensus 76 lf~----k~p~i~------~~~---~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~-------- 134 (455) +.+ .+..+. ++. +.+..++.-+....+.....+..+++.+|.+|.+++=|.+....... T Consensus 76 l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~ 155 (705) T protein:vir:88 76 LMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLS 155 (705) T ss_pred HHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCC Confidence 742 232221 111 12333443333344566778889999999999998877553210000 Q ss_pred ------hhhhHH---------------------hccCCceEEEechhhhc-cce-eeccCCeeEEEEEEEE--ee----- Q lcl|NC_020844. 135 ------NREEER---------------------QAGVRPYWSQVNHSAIL-EIQ-SERQGAGERITYMRMW--ED----- 178 (455) Q Consensus 135 ------t~ad~~---------------------~~~~rPy~~~~~~~~ii-~w~-~~~~~~~~~l~~v~~~--E~----- 178 (455) ...|.. ....++-+..++|++++ +-. .+..+..+.. +..+. ++ T Consensus 156 ~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~-~~~~~t~~dl~~~g 234 (705) T protein:vir:88 156 EDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLC-HREKYTVSDLRLLG 234 (705) T ss_pred hhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEE-EEEeccHHHHHhhc Confidence 000100 01134566667888775 111 0111112211 11110 00 Q ss_pred -----e--------------------------eEEEE---EecC---eE---EEEEE---eCCcE--EEE-eccCC---- Q lcl|NC_020844. 179 -----A--------------------------ETILV---LRPD---DW---ERWRK---NDAGI--WEV-DEFGR---- 208 (455) Q Consensus 179 -----~--------------------------e~~~v---~~~~---~~---~~~~~---~~~g~--~~~-~~~g~---- 208 (455) + .+..+ +... .+ +.|.. .++|. |.. .-.|. T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~ 314 (705) T protein:vir:88 235 VPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIIS 314 (705) T ss_pred CChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCccccc Confidence 0 00000 0000 01 12211 22232 111 11122 Q ss_pred -CCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe-cCCCccccccCccceeecc Q lcl|NC_020844. 209 -ITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN-GVEPDTDSEGSPKPLDTGP 286 (455) Q Consensus 209 -~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~-G~~~~~~~~~~~~~~~~g~ 286 (455) .+++.+||+.+...+....+ .+.+....+++++..+....+-+.++++.+..|...+. |.-.. ...+...+ T Consensus 315 ~~~~~~~PF~~~~~~p~~~~~-~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~~------~d~~~~~p 387 (705) T protein:vir:88 315 NEPWDCRPFADLNAYRIAHKF-HGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNL------EDLLTNEA 387 (705) T ss_pred cccCCCCCEEEecceeecCcc-ccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccccCc------ccccccCC Confidence 26788999877555555444 46666777788887777777888888999999877663 21111 11223334 Q ss_pred ceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhh--ccC---C--CcchhHHHHHHHHHHHHHHHH Q lcl|NC_020844. 287 DTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQP--LTA---Q--SGNLTRISAAQAASKGNSAVQ 359 (455) Q Consensus 287 ~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~--l~~---~--~~~~sa~a~~~~~~~~~s~L~ 359 (455) +.++.+. .++.+.++.++.-+ ......++.+++.|..+++.. ..+ . .++.|+++.+.........+. T Consensus 388 g~vv~~~-----~~~~i~~~~~~~~~-~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~ 461 (705) T protein:vir:88 388 AGIVRVK-----SMNSITPLETPQLS-GEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQID 461 (705) T ss_pred CeeEEec-----CCCccccccCCcCc-HHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHH Confidence 4444432 11246666544322 233444566666666654332 221 1 245788888887777788888 Q ss_pred HHHHHHHH-H----HHHHHHHHHHHcCCCCC----ceEEEEe-----cccC--------CCCCC--HHHHHHHHHHHHc- Q lcl|NC_020844. 360 QWAAGLKD-A----LENALYITNLWMDAPDT----NPEADVY-----TDFR--------ADTGE--EQTPGYLLTMRQN- 414 (455) Q Consensus 360 ~~a~~~~~-a----~~~~l~~~a~~~g~~~~----~~~v~v~-----~df~--------~~~~~--~~~~~al~~~~~~- 414 (455) .++.+|.+ . ++.++.++..|+..... +-.+.++ .+|. ..... .+.+..++++.+. T Consensus 462 ~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l 541 (705) T protein:vir:88 462 LIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAV 541 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHh Confidence 88888753 3 46667777787653210 0011111 1111 11110 1122233332221 Q ss_pred ---C----CCCHHHHHHH----HHhcCCCCC------CCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 415 ---G----DISREGLWME----FQRLGELSD------NFDPEEESERLISELPGGIEE 455 (455) Q Consensus 415 ---G----~is~et~~~~----l~~~~vl~~------~~d~e~e~~ri~~e~~~~l~~ 455 (455) + .++...+.+. ++..++-.. -...+....+...+.. .+.. T Consensus 542 ~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~-e~~~ 598 (705) T protein:vir:88 542 VGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQK-EAQP 598 (705) T ss_pred hcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhh-hhhH Confidence 1 1111111111 111111000 0001111111111100 0000 No 117 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.55 E-value=3.1e-07 Score=56.26 Aligned_cols=417 Identities=12% Similarity=-0.001 Sum_probs=192.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCC---CCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPH---EQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~---e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |.+.+-..-.-.|..+. .-|.-+ +..+++..+-.||.... .+...-..+. .-.|-+...+.++.++..++ T Consensus 1 m~~~~~~~l~~r~~~l~----~~R~~~--e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~-~~~~dst~~~a~~~Las~l~ 73 (559) T protein:vir:95 1 MAETTKERLNKQFAQLE----SERQSF--EPHWRELSDYINPRGSRFLTSEVNRNDRRN-TRIIDSTGTMAARTLASGMM 73 (559) T ss_pred CChhhHHHHHHHHHHHH----HHhhHH--HHHHHHHHHHhccccCCcCCCCCCcccccc-cccccchHHHHHHHHHHHHH Confidence 98654332222222222 222223 34455655666665432 1111111122 22466666777777766663 Q ss_pred cC--Cce-----e--cc----CCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhh Q lcl|NC_020844. 78 GR--EVE-----V--TE----GTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREE 138 (455) Q Consensus 78 ~k--~p~-----i--~~----~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad 138 (455) +- ||. + .+ .....+.|++.+. ...+++..-+..+.++...+|-+-++++.+.... T Consensus 74 ~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~------ 147 (559) T protein:vir:95 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDI------ 147 (559) T ss_pred HhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCce------ Confidence 10 111 1 10 1123455544321 1235688888899999999999999987433222 Q ss_pred HHhccCCceEEEechhhhccceeeccCCeeEEEEEE-EEee--------------------------eeEEEEE----e- Q lcl|NC_020844. 139 ERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMR-MWED--------------------------AETILVL----R- 186 (455) Q Consensus 139 ~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~-~~E~--------------------------~e~~~v~----~- 186 (455) ..+..|+..++. ..+.....+.+.++ ++=+ .+.+.++ . T Consensus 148 -------~r~~~~~l~~~~---v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr 217 (559) T protein:vir:95 148 -------IRTMPFPIGSYY---LANSPRGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPN 217 (559) T ss_pred -------eEEEEeecCeEE---EeeCCCCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEecc Confidence 124445444433 22222222233222 1100 0112221 1 Q ss_pred ----cCe-------EE-EEEE-eCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcc-hHHHHHHHHHHHHHhHHHH Q lcl|NC_020844. 187 ----PDD-------WE-RWRK-NDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHP-PMKDALDLQVALFRKECAL 252 (455) Q Consensus 187 ----~~~-------~~-~~~~-~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~-pL~dla~l~~~hy~~~sd~ 252 (455) ++. |. +|-. ...+...+.++ ++...|++++++...+++.+ +.. |-.+...-...+...+-.. T Consensus 218 ~~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~es---g~~e~P~~~~Rw~~~~ge~Y-Grg~P~~~al~d~k~L~~l~~~~ 293 (559) T protein:vir:95 218 IDRDTSKLDSKNKPFKSVYYEVGGDNDKLLRES---GFDEFPIMAPRWEVNGEDVY-GSSCPGMLALGPVKALQLLQKRK 293 (559) T ss_pred ccccccccccccceEEEEEEEecCCCceeeecC---CcccCCccceeeeecCCccc-cccchHHHhhHHHHHHHHHHHHH Confidence 000 00 1111 11222222222 34667888887777776665 444 5555444444444455556 Q ss_pred HHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh-h Q lcl|NC_020844. 253 EFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL-G 331 (455) Q Consensus 253 ~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~-g 331 (455) ..+.+.+..|.+.+.+ ++....+.+.++..+..+.........+.+ +.++ ++......|+++++.|... - T Consensus 294 l~~~~~~~~pp~~v~~-------~~~~~~~~l~pgg~~~~~~~~~~~~i~p~~-~~~~-~~~~~~~~i~~~~~rI~~af~ 364 (559) T protein:vir:95 294 SQLIDKATNPPMVAPT-------SLKNQRASLLPGDITYIDQITGQDGFRPAY-LVNP-STADLVADIQDTRQIINSAYF 364 (559) T ss_pred HHHHHHHhcCceeccc-------cccccceeeeccceeeeCCCCCcccceeec-cccc-chHHHHHHHHHHHHHHHHHhh Confidence 7777888888666542 123335666677666654322111112222 1232 3455566678888888553 1 Q ss_pred hh---h-ccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC---CCCceE-EEEecccCCC Q lcl|NC_020844. 332 RQ---P-LTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA---PDTNPE-ADVYTDFRAD 398 (455) Q Consensus 332 ~~---~-l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~---~~~~~~-v~v~~df~~~ 398 (455) .. + ...++.+.||++...+.......|.-.-..++.- +++++.++-+. |. .+.... -.++-+|.. T Consensus 365 ~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~~l~~~~i~v~~is- 442 (559) T protein:vir:95 365 VDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRK-NMLPPPPDVMEGMPLKVEYIS- 442 (559) T ss_pred hhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCcccccCcceEEEeec- Confidence 11 2 2344566799999999888877777766666442 34444444332 32 111110 112222322 Q ss_pred CCC-HH---HHHHHHH---HHHc--CC-------CCHHH-HHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 399 TGE-EQ---TPGYLLT---MRQN--GD-------ISREG-LWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 399 ~~~-~~---~~~al~~---~~~~--G~-------is~et-~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+. ++ .++++.. .... ++ |.... +.......||-...+-.++|.+.+.+|....--. T Consensus 443 ~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~ 516 (559) T protein:vir:95 443 VMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQ 516 (559) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHH Confidence 121 11 1111111 1111 11 22222 2334455777655556677767665553321111 No 118 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.54 E-value=3.2e-07 Score=56.21 Aligned_cols=416 Identities=10% Similarity=-0.011 Sum_probs=194.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCC-CCHHHH--HHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPH-EQNKRY--DLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~-e~~~~Y--~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) |.+.+.. ....+|+.+..-..= +..+++..+-.||.... ..+... ..|.. -.|-+...+.++.++..+ T Consensus 1 m~~~~~~-------~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-~~~dst~~~a~~~Las~l 72 (556) T protein:vir:73 1 MAETEKE-------RLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNT-KIVDPTGSMAQRILSSGM 72 (556) T ss_pred CChhhHH-------HHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcC-ccccchHHHHHHHHHHHH Confidence 8854332 333344433322111 44556666666774421 121111 12222 256666677777776666 Q ss_pred hcC--Cce-----e--cc----CCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh Q lcl|NC_020844. 77 FGR--EVE-----V--TE----GTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE 137 (455) Q Consensus 77 f~k--~p~-----i--~~----~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a 137 (455) ++- ||. + .+ .....+.|++.+. ...+++..-+..+.++...+|-+-++++.+..+.. T Consensus 73 ~~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~---- 148 (556) T protein:vir:73 73 MSGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVI---- 148 (556) T ss_pred HHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceE---- Confidence 311 111 1 10 1123445554422 12356888888999999999999999985543322 Q ss_pred hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe-e--------------------------eeEEEEE----e Q lcl|NC_020844. 138 EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE-D--------------------------AETILVL----R 186 (455) Q Consensus 138 d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-~--------------------------~e~~~v~----~ 186 (455) .+..|+..++. ..+.....+.+.+|..+ + .+.+.++ . T Consensus 149 ---------r~~~~~l~~~~---~~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~p 216 (556) T protein:vir:73 149 ---------RTMPFPIGSYY---LANSPRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITP 216 (556) T ss_pred ---------EEEEeecceeE---EeeCCCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEec Confidence 13334443332 12212112222222100 0 0112221 1 Q ss_pred -----cCe-------EE-EE-EEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcc-hHHHHHHHHHHHHHhHHH Q lcl|NC_020844. 187 -----PDD-------WE-RW-RKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHP-PMKDALDLQVALFRKECA 251 (455) Q Consensus 187 -----~~~-------~~-~~-~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~-pL~dla~l~~~hy~~~sd 251 (455) ++. |. +| ....++...+.++ ++...|++++++....++.+ +.. |-.+...-...+...+-. T Consensus 217 r~~~~~~~~~~~~~p~~s~~~~~~~~~~~vl~es---g~~e~P~~~~Rw~~~~ge~Y-Grg~P~~~~lgD~k~L~~l~~~ 292 (556) T protein:vir:73 217 NVNRDSGKMDSKNKPYRSVYFESGGDSDKLLRES---GFDEFPILAPRWEVNGEDVY-ASSCPGMLALGQVKALQVEQKR 292 (556) T ss_pred cccccccccCcccceEEEEEEEecCCCceecccC---CcccCCceeeeeeecCCccc-ccCccHHHhHHHHHHHHHHHHH Confidence 000 10 11 1111222222222 35667888888777766665 544 555554444444445555 Q ss_pred HHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_020844. 252 LEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELG 331 (455) Q Consensus 252 ~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g 331 (455) .-...+.+..|.+.+.. +++...+.+.++.++..+.... ...+.-+.....++....+.|+++++.|...= T Consensus 293 ~l~~~~~~~~pp~~v~~-------~~~~~~~~~~pgg~~~~~~~~~--~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af 363 (556) T protein:vir:73 293 KAQLIDKATNPPMVAPT-------SLKNQRVSLLPGDVTYLDVISG--QDGFKPAYLVNPNTADLLADIQDTRQTINSAY 363 (556) T ss_pred HHHHHHHHhcCceeccc-------cccccceeeccCccccccCCCC--ccceeeeccccccHHHHHHHHHHHHHHHHHHh Confidence 67777888888776542 1233345666665554432221 12344333222346777777888888886531 Q ss_pred -h---hh-ccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC---CCCce-EEEEecccCC Q lcl|NC_020844. 332 -R---QP-LTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA---PDTNP-EADVYTDFRA 397 (455) Q Consensus 332 -~---~~-l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~---~~~~~-~v~v~~df~~ 397 (455) . .+ ...++.+.||++.+.+.......|.-.-..++.- +++++.++-+ .|. .+... .-.|+-+|.. T Consensus 364 ~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l~~~~i~v~yis 442 (556) T protein:vir:73 364 FVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMAR-KNMLPEPPDVLQGMPLRIEYIS 442 (556) T ss_pred hcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhcCceeEEEeec Confidence 1 12 2334556899999999888888877766666442 3344444433 232 11111 1122223322 Q ss_pred CCCC----HHHHHHHHH---HHHc--CC-------CCH-HHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccC-------- Q lcl|NC_020844. 398 DTGE----EQTPGYLLT---MRQN--GD-------ISR-EGLWMEFQRLGELSDNFDPEEESERLISELPGG-------- 452 (455) Q Consensus 398 ~~~~----~~~~~al~~---~~~~--G~-------is~-et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~-------- 452 (455) . +. ...+.++.. .... ++ |.. +.+.......||-...+-.++|.+.+.++.... T Consensus 443 ~-La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~ 521 (556) T protein:vir:73 443 V-MAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMA 521 (556) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHH Confidence 1 21 111111111 1111 11 222 233344555777665566677766665442211 Q ss_pred -----------CCC Q lcl|NC_020844. 453 -----------IEE 455 (455) Q Consensus 453 -----------l~~ 455 (455) |.+ T Consensus 522 ~~~~a~~~~~~~~~ 535 (556) T protein:vir:73 522 MGQAAAQGAKTLSE 535 (556) T ss_pred HHHHHHHHHHHhhh Confidence 000 No 119 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=98.54 E-value=1.9e-07 Score=57.45 Aligned_cols=401 Identities=10% Similarity=0.010 Sum_probs=181.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |+.+.-.-+. ... +.+.+.|.........-+.+... -..-+..|.. ..+++++|+..+.-.+++. T Consensus 5 m~~~~~~~~~------~D~---~~~~~~~~~g~~~~~~~~~~~~~--~~~l~~~Y~~----~~l~~~~Vd~~aed~~r~g 69 (435) T protein:vir:79 5 MSDKVKAITK------EDG---YNEIFGSKDGTFRPNAFYMQRAA--FKALSQFYEE----DGMARRIVDVIPEEMVTPG 69 (435) T ss_pred cccccccchh------hcc---hhhhhcccccccccCcccCCcCC--HHHHHHHHhc----CchhhhhhccchHHhhcCC Confidence 7755210000 011 11222322111111111111111 1122233443 4678889999999999999 Q ss_pred ceeccCC--HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhcc Q lcl|NC_020844. 81 VEVTEGT--GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILE 158 (455) Q Consensus 81 p~i~~~~--~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~ 158 (455) ..|+... +.++..++.. .+.+.++.+.+.+..+|.++|++..........-.+ ..+...++..+++.+|.. T Consensus 70 ~~i~g~~~~~~~~~~~~~l-----~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~--~~g~i~~i~v~d~~~i~~ 142 (435) T protein:vir:79 70 FKVDGVKNEKSFKSRWDEL-----RLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVK--PGAQLEDIRVYDRYQITI 142 (435) T ss_pred ceecCCChHHHHHHHHHHh-----hHHHHHHHHHHhhhccccEEEEEEecCCCCcccccc--cCCceeeEEeechhhccc Confidence 9997532 3345444432 577888899999999999999997543322111000 012233455555544432 Q ss_pred ceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHH- Q lcl|NC_020844. 159 IQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKD- 237 (455) Q Consensus 159 w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~d- 237 (455) -... .+ ...-.+..|..|++--..+.....+|.+--..+..-|+ |... .....+ -+.+||.. T Consensus 143 ~~~~-~d-------------p~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~-p~~~-~~~~~~-~G~S~l~e~ 205 (435) T protein:vir:79 143 HERE-TN-------------ARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERV-SNEK-RRQNDG-WGASILNKR 205 (435) T ss_pred hhhc-cC-------------CcccccCcceEEEEecCCCCCceEEcceeEEEecCCcc-hhhh-ccccCc-ccchHHHHH Confidence 0000 00 00000112222222100111122222222111211111 1111 112233 36677754 Q ss_pred HHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccc-cCcc--------ceeeccceeeeccCCCCccccceeEEee Q lcl|NC_020844. 238 ALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSE-GSPK--------PLDTGPDTVLYAPPNAEGQHGSWNFVEP 308 (455) Q Consensus 238 la~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~-~~~~--------~~~~g~~~~~~lp~~~~~~~~~~~~le~ 308 (455) +.+.....-++...-.++++...++++.++|+...-..+ .... ...-+.+..+.+.. ++.++..+.. T Consensus 206 ~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~----~~e~~e~~~~ 281 (435) T protein:vir:79 206 LIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDA----TDEEYEVLNS 281 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEec----CCcceEEEec Confidence 445444444445556777888888888887754221111 1100 01122233333321 1224555554 Q ss_pred CchhHHHHHHHHHHHHHHHHHhhhhhccC---CC---cchhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHcC Q lcl|NC_020844. 309 ATESLKFLAEQIESVSKEIRELGRQPLTA---QS---GNLTRISAAQAASKGNSAVQQWAA-GLKDALENALYITNLWMD 381 (455) Q Consensus 309 ~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~---~~~sa~a~~~~~~~~~s~L~~~a~-~~~~a~~~~l~~~a~~~g 381 (455) +-++ +.+.++...++|......|+.. ++ -|.|+.. +...-+..++.+.. .+...++++++++.. T Consensus 282 ~lsg---l~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~---d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~--- 352 (435) T protein:vir:79 282 DVSG---VPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNT---ALETFYKLIDRKRVEDYKPILEFLLPFMIS--- 352 (435) T ss_pred ccCC---HHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc--- Confidence 4444 4556677777777766655432 22 1333332 33334444444443 355667777776532 Q ss_pred CCCCceEEEEecccCCCCC-----CHHHHHHHHHHHHcCCCCHHHHHHHHHh----cCCCCCCCCH--HHHHHHHHhhcc Q lcl|NC_020844. 382 APDTNPEADVYTDFRADTG-----EEQTPGYLLTMRQNGDISREGLWMEFQR----LGELSDNFDP--EEESERLISELP 450 (455) Q Consensus 382 ~~~~~~~v~v~~df~~~~~-----~~~~~~al~~~~~~G~is~et~~~~l~~----~~vl~~~~d~--e~e~~ri~~e~~ 450 (455) . .+.+|+++.=+.+..- ....+++..++.+.|+|+.++.++.|+. .|+.+...+. +.+..--+.+.+ T Consensus 353 -s-~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~d~~~~~~~e 430 (435) T protein:vir:79 353 -E-TEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNIELPEPEDLDPEPGQE 430 (435) T ss_pred -C-CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCcccccCCccccCCCCCCCC Confidence 1 3566665532211111 1233567778889999999999998864 2332222111 111111122344 Q ss_pred cCCCC Q lcl|NC_020844. 451 GGIEE 455 (455) Q Consensus 451 ~~l~~ 455 (455) +|.|+ T Consensus 431 ~g~~~ 435 (435) T protein:vir:79 431 GGLNK 435 (435) T ss_pred CCCCC Confidence 56666 No 120 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=98.52 E-value=3.7e-07 Score=55.86 Aligned_cols=432 Identities=13% Similarity=0.074 Sum_probs=179.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhh---h--hcCC--CCCCCCHHHHHHHhh---c-cccCChHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNF---K--RYVK--QFPHEQNKRYDLRKS---V-AKMTNVFRDVL 69 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g---~--~yLp--k~~~e~~~~Y~~rl~---r-a~~~n~~~~~v 69 (455) |++.+- ..|-. .+.+++.++......|+.. + .|.. +++.+..+..+.|-+ | ...+|.++.+| T Consensus 1 ma~~~~-~~~~~------~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i 73 (708) T protein:vir:17 1 MAETLE-KKHER------IMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATEL 73 (708) T ss_pred CchhHH-HHHHH------HHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHH Confidence 996533 23322 2333444444444433322 1 1322 233333334443332 1 34589999999 Q ss_pred HHHhhhhhcCCceec------c----CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE--eeCCCCcchhhh Q lcl|NC_020844. 70 EGLASKPFGREVEVT------E----GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV--DYPAGQQFRNRE 137 (455) Q Consensus 70 ~~~~g~lf~k~p~i~------~----~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV--d~p~~~~~~t~a 137 (455) +..+|+==...+.+. . ..+.+..++..+-. -++.+.-+..++..++.+|++|+=| ||-....+... T Consensus 74 ~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~-~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~- 151 (708) T protein:vir:17 74 NRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYE-ETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDD- 151 (708) T ss_pred HHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHH-hcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCC- Confidence 999999655444332 1 12234445544432 3567888899999999999999855 22111000000 Q ss_pred hHHhccCCceEEEe-chhhh---------------------------------------------ccceeeccCC-eeEE Q lcl|NC_020844. 138 EERQAGVRPYWSQV-NHSAI---------------------------------------------LEIQSERQGA-GERI 170 (455) Q Consensus 138 d~~~~~~rPy~~~~-~~~~i---------------------------------------------i~w~~~~~~~-~~~l 170 (455) . .+. |+...+ ++.+| -+|.++..+. +..+ T Consensus 152 -~--~~i-~i~~~~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv 227 (708) T protein:vir:17 152 -R--QRI-AIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYI 227 (708) T ss_pred -c--ccc-ceEeeccchhheecCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEE Confidence 0 000 011111 11222 1121111111 1111 Q ss_pred EE--EEEEeeeeEEEEEec--CeEEEEE-------------------------------EeCCcEEEEeccCCCCCCcee Q lcl|NC_020844. 171 TY--MRMWEDAETILVLRP--DDWERWR-------------------------------KNDAGIWEVDEFGRITIGVVP 215 (455) Q Consensus 171 ~~--v~~~E~~e~~~v~~~--~~~~~~~-------------------------------~~~~g~~~~~~~g~~~l~~vP 215 (455) +. .+..+....+.+-++ +.+..|. ..-.|...+....+.+.+.+| T Consensus 228 ~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP 307 (708) T protein:vir:17 228 AKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIP 307 (708) T ss_pred EEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccc Confidence 10 111111111111111 1222221 111121122233456778899 Q ss_pred EEEEEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEE-----ecCCCccccccCccce-----e Q lcl|NC_020844. 216 MVPFITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSA-----NGVEPDTDSEGSPKPL-----D 283 (455) Q Consensus 216 ~V~f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~-----~G~~~~~~~~~~~~~~-----~ 283 (455) +|||+.... ++.. ....-+-++.+....++...|-+.+.+..++.-++++ .|+...+......+.. . T Consensus 308 ~vP~~g~r~~~d~~~-~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~ 386 (708) T protein:vir:17 308 LIPVYGKRWFIDDIE-RVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLRE 386 (708) T ss_pred eEEEecccccccCCC-cccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhc Confidence 999864322 1111 0011123444555445545555555554444433322 2444333322222110 0 Q ss_pred eccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh--hhccCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 284 TGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGR--QPLTAQSGNLTRISAAQAASKGNSAVQQW 361 (455) Q Consensus 284 ~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~--~~l~~~~~~~sa~a~~~~~~~~~s~L~~~ 361 (455) +.....+..+ ++ .....+++.. --.++.+.++.....|..+++ ....+.+.|.||.++..+..+....+..+ T Consensus 387 ~~~~~g~v~~-~a----~~~~~~~~~~-~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~~ 460 (708) T protein:vir:17 387 VRDKYGNIIA-GA----TPAGYTQPAV-MNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIY 460 (708) T ss_pred cCCccccccc-cc----CCcccCCCcc-ccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHHHHHH Confidence 1111111111 11 1222333322 223455667777777766643 22334455789999988887776666666 Q ss_pred HHHHHHH----HHHHHHHHHHHcCC---------CCCceEEEEec---------------------ccCC------CCCC Q lcl|NC_020844. 362 AAGLKDA----LENALYITNLWMDA---------PDTNPEADVYT---------------------DFRA------DTGE 401 (455) Q Consensus 362 a~~~~~a----~~~~l~~~a~~~g~---------~~~~~~v~v~~---------------------df~~------~~~~ 401 (455) -.|+..+ .+.+|.++.++++. +.....+.+|. |+.. .... T Consensus 461 ~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r 540 (708) T protein:vir:17 461 LDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARR 540 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHH Confidence 6666555 45567778888743 11111222221 1111 1112 Q ss_pred HHHHHHHHHHHHcCCCCH-HH--HHHHHHhcCCCCCCCCHHHHHHHHHhhccc-CCCC Q lcl|NC_020844. 402 EQTPGYLLTMRQNGDISR-EG--LWMEFQRLGELSDNFDPEEESERLISELPG-GIEE 455 (455) Q Consensus 402 ~~~~~al~~~~~~G~is~-et--~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~-~l~~ 455 (455) .+..++++++.....-.. .+ +...+- ..+ +.-..++-.++|..+.+. +..+ T Consensus 541 ~~~~~~l~qll~~~~~~~~~~~~~~~l~l--~~~-D~p~~~ei~e~ir~~~~~~~~~~ 595 (708) T protein:vir:17 541 DATVSVLTNVLSSMLPADPMRPAIQGIIL--DNI-DGEGLDDFKEYNRNQLLISGIAK 595 (708) T ss_pred HHHHHHHHHHHHhcCCccchhHHHHHHHH--Hhc-CCCChHHHHHHHHHHhhcccccc Confidence 344556666655432110 01 111110 010 111123444444433221 1111 No 121 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.52 E-value=3.8e-07 Score=55.80 Aligned_cols=381 Identities=8% Similarity=-0.068 Sum_probs=188.6 Q ss_pred ccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCC------CCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCCc Q lcl|NC_020844. 8 KRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFP------HEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGREV 81 (455) Q Consensus 8 ~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~------~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~p 81 (455) -++|+......--..++|...+- .. ..++|..+ +-+-..|+.... -.++...++.....+.+.++ T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~----~~-~l~~~~~~il~~a~~g~~~~y~~l~~----D~~i~s~l~~rk~av~~~~w 71 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPF----IS-GLQVPNDSILQRRGGNDLRVYEEILS----DAQVKTVWGQRQLAVVSREW 71 (488) T ss_pred CCccchhHHHHHHHhhhhhhccc----cC-CCCCCChHHHHhhccCCHHHHHHHhh----ChHHHHHHHHHHHHHhcCCc Confidence 33444333222112223332210 00 11233211 112355666543 56788888888888888888 Q ss_pred eecc--CCH---HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhh Q lcl|NC_020844. 82 EVTE--GTG---PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAI 156 (455) Q Consensus 82 ~i~~--~~~---~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~i 156 (455) .|.. +++ ...+++... ..+.+++.+++.+. .++-||.+.+=+- T Consensus 72 ~i~p~~~~~~~~~~ae~v~~~-l~~~~~~~~l~~~l-da~~~G~s~~Ei~------------------------------ 119 (488) T protein:vir:99 72 KVEAGGDRPIDQAAAEHLEQQ-LQRVGWDRVTSKML-FGVFYGYAVSELI------------------------------ 119 (488) T ss_pred eEEcCCCChHHHHHHHHHHHH-HhCCCHHHHHHHHH-hhhhhcceeEEEE------------------------------ Confidence 8841 111 122333222 12347999998887 5788887765432 Q ss_pred ccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeE--EEEEecccCCCcccCcch Q lcl|NC_020844. 157 LEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPM--VPFITGRRKGKKWVFHPP 234 (455) Q Consensus 157 i~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~--V~f~~~~~~~~~~~~~~p 234 (455) |+. .+|.+.+..+..+.. ..+.+...+..++......+ ....+. .|+ +..+..+.. +-..+.+. T Consensus 120 --w~~--~~g~~~~~~l~~r~~-~~f~~d~~~~l~~~~~~~~~-------~g~~lp-~~~~~i~~~~~~~~-g~p~g~gL 185 (488) T protein:vir:99 120 --YGR--DDRYITLEAIKVRNR-RRFRYDQDGGLRLLTPNNMF-------EGEPCP-APYFWHFSTGADND-DEPYGLGL 185 (488) T ss_pred --Eee--cCCeeeEeeeeeecc-cceeecCCCceEEeccCCCC-------Cccccc-cCceEEEEeecCCC-CCcccchH Confidence 221 244444444333221 11111111111111011000 000110 121 211223332 33345666 Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCC-Ccccccc---CccceeeccceeeeccCCCCccccceeEEeeCc Q lcl|NC_020844. 235 MKDALDLQVALFRKECALEFAEMMTAFPMLSANGVE-PDTDSEG---SPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPAT 310 (455) Q Consensus 235 L~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~-~~~~~~~---~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~ 310 (455) |..++..-+--.....+...-++.-+.|+++.+--. ..+.++. ......+|.+.+..+|.+. .++|++..+ T Consensus 186 l~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~~~~viP~~~-----~ie~~ea~~ 260 (488) T protein:vir:99 186 AHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTDSAIIMPAGM-----QAELLEAGR 260 (488) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcCcEEEecCCc-----eeEEeecCC Confidence 666666544333456777777888899999887211 1111111 1223457888999999765 699999877 Q ss_pred hhHHHHHHHHHHHHHHHHHh--hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCCce Q lcl|NC_020844. 311 ESLKFLAEQIESVSKEIREL--GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALE-NALYITNLWMDAPDTNP 387 (455) Q Consensus 311 ~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~-~~l~~~a~~~g~~~~~~ 387 (455) .+-..++..++...++|..+ |..+ +...+.-|-.............+...+..+.++++ +++++++.|-.-..... T Consensus 261 ~~~~~~~~li~~~d~~Isk~iLGqtl-ts~~~~Gs~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p 339 (488) T protein:vir:99 261 SGTADYKTLHDTMDATIAKVGLGQVA-STQGTPGRLGNDDLQADVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPP 339 (488) T ss_pred CChHHHHHHHHHHHHHHHHHHhhhhh-cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCc Confidence 77677888999999998664 5443 33322222223334445556677888888999996 58898888754222112 Q ss_pred EEEEecccCCCCCCHHHHHHHHHHHHc-CC-CCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 388 EADVYTDFRADTGEEQTPGYLLTMRQN-GD-ISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 388 ~v~v~~df~~~~~~~~~~~al~~~~~~-G~-is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+.+ ++.........++.+.++.+. |. |+.+-+. ++.|+-.+... ++...--.....+.-++ T Consensus 340 ~~~~--~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~---e~~Gip~~~~~-~~~~~~~~~~~~~~~~~ 403 (488) T protein:vir:99 340 RVYR--VIEEPEDITAKAERDEKVFRMSGFRPTRGYVQ---ETYGVEVESTQ-AEATAPTPSTEFAEGDQ 403 (488) T ss_pred eeEe--cCCCcccHHHHHHHHHHHHhhcCCCCCHHHHH---HHcCCCCcccc-cccccCCCcccCCCCCC Confidence 2332 333222224557778888875 76 6765444 34566433221 11111000000011010 No 122 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.51 E-value=3.9e-07 Score=55.72 Aligned_cols=417 Identities=12% Similarity=0.028 Sum_probs=197.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC---CCCHHHHHHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP---HEQNKRYDLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~---~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) |.+... + .....+|+.++.--.= +..+++..+-.||... ..+...=..|.. -.|-+...+.++.++..+ T Consensus 1 M~~~~~---~---~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-~~~dst~~~a~~~LAa~L 73 (555) T protein:vir:10 1 MAEQTE---R---KLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN-NILDNTGTRALRVLAAGM 73 (555) T ss_pred CCCccc---H---HHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc-ccccccHHHHHHHHHHHH Confidence 986532 2 2333444444332211 4456666666777621 111111122332 256666777777776666 Q ss_pred h-----------cCCceecc--CCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh Q lcl|NC_020844. 77 F-----------GREVEVTE--GTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE 137 (455) Q Consensus 77 f-----------~k~p~i~~--~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a 137 (455) . +-.+.-.. .....+.|++.+. ...+++..-+..+.++...+|-+-++++.+..+.. T Consensus 74 ~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~---- 149 (555) T protein:vir:10 74 MAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVV---- 149 (555) T ss_pred HHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceE---- Confidence 3 11111000 1123555655421 13467888888999999999999999874433221 Q ss_pred hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe-e-------------------------ee-EEEEEecCeE Q lcl|NC_020844. 138 EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE-D-------------------------AE-TILVLRPDDW 190 (455) Q Consensus 138 d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-~-------------------------~e-~~~v~~~~~~ 190 (455) .+..|+..+.. ..+...-.+.+.+|-.+ + .+ .+.|++ T Consensus 150 ---------rf~~~pl~~~~---v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~---- 213 (555) T protein:vir:10 150 ---------YHHSLTAGEYA---IAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIH---- 213 (555) T ss_pred ---------EEEEeecceeE---EeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE---- Confidence 24445544433 12222222233222100 0 01 122211 Q ss_pred EEEEEeC---Cc---------EEEEecc--C-----CCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHH Q lcl|NC_020844. 191 ERWRKND---AG---------IWEVDEF--G-----RITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECA 251 (455) Q Consensus 191 ~~~~~~~---~g---------~~~~~~~--g-----~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd 251 (455) -+|-+.+ ++ .|....+ + .-++...|++++++....++.+ +..|-.+...-.......+-. T Consensus 214 ~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~Y-Grgp~~~~lgD~k~L~~l~~~ 292 (555) T protein:vir:10 214 AIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIY-GNSPAMEALGDVRQLQHEQLR 292 (555) T ss_pred EEeeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHH Confidence 0110000 00 1222111 1 1135678999888887777666 666766555544444444444 Q ss_pred HHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_020844. 252 LEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELG 331 (455) Q Consensus 252 ~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g 331 (455) .-.+++.+..|.+.+.- ++...++.+.++....++.+..+....+ .++. +..+....+.|+++++.|...= T Consensus 293 ~l~~~~~~~~pp~~v~~-------~~~~~~~~~~pgg~~~v~~g~~~d~~~~-~~~~-~~d~~~~~~~i~~~~~rI~~af 363 (555) T protein:vir:10 293 KAQAIDYKSNPPLQLPV-------SAKNQDISTVPGGLSYVDAAAPNGGIRT-AFEV-NLDLSHLLADIVDVRERIKASF 363 (555) T ss_pred HHHHHHHHhcCceeecc-------ccccccceeccccccccccCCCCcceec-cccc-ccchHHHHHHHHHHHHHHHHHh Confidence 56667777777666531 1222234555555544433221111111 2222 2357777888999999886643 Q ss_pred h----hhc-cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC-C--CCce-EEEEecccCC Q lcl|NC_020844. 332 R----QPL-TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA-P--DTNP-EADVYTDFRA 397 (455) Q Consensus 332 ~----~~l-~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~-~--~~~~-~v~v~~df~~ 397 (455) . .++ ..++.+.||++...+.......|.-.-..++.- +++++.++-+ .|. + +... ...|+-+|.. T Consensus 364 ~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l~~~~i~v~yis 442 (555) T protein:vir:10 364 YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE-ANILPPPPQEMQGVDLNVEFVS 442 (555) T ss_pred hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhcCceeEEEecc Confidence 2 122 234556899999999888888877766665442 3334444433 232 1 1111 1122223332 Q ss_pred CCC---CHHHHHH---HHHHHH--cCC-------CCHH-HHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 398 DTG---EEQTPGY---LLTMRQ--NGD-------ISRE-GLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 398 ~~~---~~~~~~a---l~~~~~--~G~-------is~e-t~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .-- ....+.+ +++... +++ |... .+.......||-...+..++|.+.+.+|+...--. T Consensus 443 ~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~ 516 (555) T protein:vir:10 443 MLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQA 516 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHH Confidence 110 1111212 111111 121 2222 22334455777666666677777666653321111 No 123 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.51 E-value=3.9e-07 Score=55.72 Aligned_cols=417 Identities=12% Similarity=0.028 Sum_probs=197.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC---CCCHHHHHHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP---HEQNKRYDLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~---~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) |.+... + .....+|+.++.--.= +..+++..+-.||... ..+...=..|.. -.|-+...+.++.++..+ T Consensus 1 M~~~~~---~---~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-~~~dst~~~a~~~LAa~L 73 (555) T protein:vir:98 1 MAEQTE---R---KLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN-NILDNTGTRALRVLAAGM 73 (555) T ss_pred CCCccc---H---HHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc-ccccccHHHHHHHHHHHH Confidence 986532 2 2333444444332211 4456666666777621 111111122332 256666777777776666 Q ss_pred h-----------cCCceecc--CCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh Q lcl|NC_020844. 77 F-----------GREVEVTE--GTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE 137 (455) Q Consensus 77 f-----------~k~p~i~~--~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a 137 (455) . +-.+.-.. .....+.|++.+. ...+++..-+..+.++...+|-+-++++.+..+.. T Consensus 74 ~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~---- 149 (555) T protein:vir:98 74 MAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVV---- 149 (555) T ss_pred HHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceE---- Confidence 3 11111000 1123555655421 13467888888999999999999999874433221 Q ss_pred hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe-e-------------------------ee-EEEEEecCeE Q lcl|NC_020844. 138 EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE-D-------------------------AE-TILVLRPDDW 190 (455) Q Consensus 138 d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-~-------------------------~e-~~~v~~~~~~ 190 (455) .+..|+..+.. ..+...-.+.+.+|-.+ + .+ .+.|++ T Consensus 150 ---------rf~~~pl~~~~---v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~---- 213 (555) T protein:vir:98 150 ---------YHHSLTAGEYA---IAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIH---- 213 (555) T ss_pred ---------EEEEeecceeE---EeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE---- Confidence 24445544433 12222222233222100 0 01 122211 Q ss_pred EEEEEeC---Cc---------EEEEecc--C-----CCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHH Q lcl|NC_020844. 191 ERWRKND---AG---------IWEVDEF--G-----RITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECA 251 (455) Q Consensus 191 ~~~~~~~---~g---------~~~~~~~--g-----~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd 251 (455) -+|-+.+ ++ .|....+ + .-++...|++++++....++.+ +..|-.+...-.......+-. T Consensus 214 ~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~Y-Grgp~~~~lgD~k~L~~l~~~ 292 (555) T protein:vir:98 214 AIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIY-GNSPAMEALGDVRQLQHEQLR 292 (555) T ss_pred EEeeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHH Confidence 0110000 00 1222111 1 1135678999888887777666 666766555544444444444 Q ss_pred HHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_020844. 252 LEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELG 331 (455) Q Consensus 252 ~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g 331 (455) .-.+++.+..|.+.+.- ++...++.+.++....++.+..+....+ .++. +..+....+.|+++++.|...= T Consensus 293 ~l~~~~~~~~pp~~v~~-------~~~~~~~~~~pgg~~~v~~g~~~d~~~~-~~~~-~~d~~~~~~~i~~~~~rI~~af 363 (555) T protein:vir:98 293 KAQAIDYKSNPPLQLPV-------SAKNQDISTVPGGLSYVDAAAPNGGIRT-AFEV-NLDLSHLLADIVDVRERIKASF 363 (555) T ss_pred HHHHHHHHhcCceeecc-------ccccccceeccccccccccCCCCcceec-cccc-ccchHHHHHHHHHHHHHHHHHh Confidence 56667777777666531 1222234555555544433221111111 2222 2357777888999999886643 Q ss_pred h----hhc-cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC-C--CCce-EEEEecccCC Q lcl|NC_020844. 332 R----QPL-TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA-P--DTNP-EADVYTDFRA 397 (455) Q Consensus 332 ~----~~l-~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~-~--~~~~-~v~v~~df~~ 397 (455) . .++ ..++.+.||++...+.......|.-.-..++.- +++++.++-+ .|. + +... ...|+-+|.. T Consensus 364 ~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l~~~~i~v~yis 442 (555) T protein:vir:98 364 YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE-ANILPPPPQEMQGVDLNVEFVS 442 (555) T ss_pred hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhcCceeEEEecc Confidence 2 122 234556899999999888888877766665442 3334444433 232 1 1111 1122223332 Q ss_pred CCC---CHHHHHH---HHHHHH--cCC-------CCHH-HHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 398 DTG---EEQTPGY---LLTMRQ--NGD-------ISRE-GLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 398 ~~~---~~~~~~a---l~~~~~--~G~-------is~e-t~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .-- ....+.+ +++... +++ |... .+.......||-...+..++|.+.+.+|+...--. T Consensus 443 ~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~ 516 (555) T protein:vir:98 443 MLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQA 516 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHH Confidence 110 1111212 111111 121 2222 22334455777666666677777666653321111 No 124 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.51 E-value=3.9e-07 Score=55.72 Aligned_cols=417 Identities=12% Similarity=0.028 Sum_probs=197.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC---CCCHHHHHHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP---HEQNKRYDLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~---~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) |.+... + .....+|+.++.--.= +..+++..+-.||... ..+...=..|.. -.|-+...+.++.++..+ T Consensus 1 M~~~~~---~---~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-~~~dst~~~a~~~LAa~L 73 (555) T protein:vir:10 1 MAEQTE---R---KLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN-NILDNTGTRALRVLAAGM 73 (555) T ss_pred CCCccc---H---HHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc-ccccccHHHHHHHHHHHH Confidence 986532 2 2333444444332211 4456666666777621 111111122332 256666777777776666 Q ss_pred h-----------cCCceecc--CCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh Q lcl|NC_020844. 77 F-----------GREVEVTE--GTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE 137 (455) Q Consensus 77 f-----------~k~p~i~~--~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a 137 (455) . +-.+.-.. .....+.|++.+. ...+++..-+..+.++...+|-+-++++.+..+.. T Consensus 74 ~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~---- 149 (555) T protein:vir:10 74 MAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVV---- 149 (555) T ss_pred HHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceE---- Confidence 3 11111000 1123555655421 13467888888999999999999999874433221 Q ss_pred hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe-e-------------------------ee-EEEEEecCeE Q lcl|NC_020844. 138 EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE-D-------------------------AE-TILVLRPDDW 190 (455) Q Consensus 138 d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-~-------------------------~e-~~~v~~~~~~ 190 (455) .+..|+..+.. ..+...-.+.+.+|-.+ + .+ .+.|++ T Consensus 150 ---------rf~~~pl~~~~---v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~---- 213 (555) T protein:vir:10 150 ---------YHHSLTAGEYA---IAADNQGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIH---- 213 (555) T ss_pred ---------EEEEeecceeE---EeeCCCCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEE---- Confidence 24445544433 12222222233222100 0 01 122211 Q ss_pred EEEEEeC---Cc---------EEEEecc--C-----CCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHH Q lcl|NC_020844. 191 ERWRKND---AG---------IWEVDEF--G-----RITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECA 251 (455) Q Consensus 191 ~~~~~~~---~g---------~~~~~~~--g-----~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd 251 (455) -+|-+.+ ++ .|....+ + .-++...|++++++....++.+ +..|-.+...-.......+-. T Consensus 214 ~V~pr~~~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~Y-Grgp~~~~lgD~k~L~~l~~~ 292 (555) T protein:vir:10 214 AIEPRADRDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIY-GNSPAMEALGDVRQLQHEQLR 292 (555) T ss_pred EEeeccCcCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHH Confidence 0110000 00 1222111 1 1135678999888887777666 666766555544444444444 Q ss_pred HHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_020844. 252 LEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELG 331 (455) Q Consensus 252 ~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g 331 (455) .-.+++.+..|.+.+.- ++...++.+.++....++.+..+....+ .++. +..+....+.|+++++.|...= T Consensus 293 ~l~~~~~~~~pp~~v~~-------~~~~~~~~~~pgg~~~v~~g~~~d~~~~-~~~~-~~d~~~~~~~i~~~~~rI~~af 363 (555) T protein:vir:10 293 KAQAIDYKSNPPLQLPV-------SAKNQDISTVPGGLSYVDAAAPNGGIRT-AFEV-NLDLSHLLADIVDVRERIKASF 363 (555) T ss_pred HHHHHHHHhcCceeecc-------ccccccceeccccccccccCCCCcceec-cccc-ccchHHHHHHHHHHHHHHHHHh Confidence 56667777777666531 1222234555555544433221111111 2222 2357777888999999886643 Q ss_pred h----hhc-cCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC-C--CCce-EEEEecccCC Q lcl|NC_020844. 332 R----QPL-TAQSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA-P--DTNP-EADVYTDFRA 397 (455) Q Consensus 332 ~----~~l-~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~-~--~~~~-~v~v~~df~~ 397 (455) . .++ ..++.+.||++...+.......|.-.-..++.- +++++.++-+ .|. + +... ...|+-+|.. T Consensus 364 ~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~P~~l~~~~i~v~yis 442 (555) T protein:vir:10 364 YADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVE-ANILPPPPQEMQGVDLNVEFVS 442 (555) T ss_pred hcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhcCceeEEEecc Confidence 2 122 234556899999999888888877766665442 3334444433 232 1 1111 1122223332 Q ss_pred CCC---CHHHHHH---HHHHHH--cCC-------CCHH-HHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 398 DTG---EEQTPGY---LLTMRQ--NGD-------ISRE-GLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 398 ~~~---~~~~~~a---l~~~~~--~G~-------is~e-t~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .-- ....+.+ +++... +++ |... .+.......||-...+..++|.+.+.+|+...--. T Consensus 443 ~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~ 516 (555) T protein:vir:10 443 MLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQA 516 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHH Confidence 110 1111212 111111 121 2222 22334455777666666677777666653321111 No 125 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=98.51 E-value=4.2e-07 Score=55.57 Aligned_cols=406 Identities=9% Similarity=-0.014 Sum_probs=176.8 Q ss_pred CCCCCC--------CccCHHHHHHHHH---HHHHHHHhcCHHHHHHhhhhcCCC-CCCCCHHHHHHHhhccccCChHHHH Q lcl|NC_020844. 1 MPEQDP--------TKRSADAEAMSDY---LQTVDDVLEGVTGMRRNFKRYVKQ-FPHEQNKRYDLRKSVAKMTNVFRDV 68 (455) Q Consensus 1 m~~~~v--------~~~~p~y~~~~~~---w~~i~d~~~G~~~~r~~g~~yLpk-~~~e~~~~Y~~rl~ra~~~n~~~~~ 68 (455) |+...+ +.-||........ ..+. .+.|+..-.....-|.+. +.+ .+-+..|.+ ...++++ T Consensus 48 ~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~--~~l~a~Y~~----~~l~r~i 119 (537) T protein:vir:10 48 MAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFS--AYANPNLSEGLVLWYAQQAFIG--HQMCALIAT----HWLVNKA 119 (537) T ss_pred CCCCCccCcccccccccccchhccccccchhhhh--hhccccccchhhhhccccCCcc--HHHHHHHHh----Cchhhhh Confidence 221111 1112221100000 0000 011111111111112221 121 122233333 4788999 Q ss_pred HHHHhhhhhcCCceeccCC------HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh----- Q lcl|NC_020844. 69 LEGLASKPFGREVEVTEGT------GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE----- 137 (455) Q Consensus 69 v~~~~g~lf~k~p~i~~~~------~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a----- 137 (455) |+..+.-.++++..++... +..+.|..-.+ ...+.+.++.+.+.+-.||.+++++.-..... .... T Consensus 120 Vd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~--~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~-~~~~~Pl~~ 196 (537) T protein:vir:10 120 CSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDR--AFNIKKHAIQFVRKGRIFGIRIALFKVDSPDP-YYYEKPFNI 196 (537) T ss_pred hhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHH--HhhHHHHHHHHHHhcccccceEEEEeecCcCC-ccccccccc Confidence 9999999999999886422 22222222111 23577888888888888999888875432111 0000 Q ss_pred hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEE Q lcl|NC_020844. 138 EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMV 217 (455) Q Consensus 138 d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V 217 (455) +-...+..-++..++|..+-.-..+.. .++...-.++.|..|++ . | .--|+--.|.|+ T Consensus 197 ~~i~kg~~k~l~vidp~~~~~~~~~~~-----------~~dp~sp~fg~P~~y~v---~--g------~~iH~SRli~f~ 254 (537) T protein:vir:10 197 DGVMPGAYKGIVQIDPYWCAPLLDAQA-----------SSNPVSMHFYEPTYWLI---N--G------KKYHRSHLAIYI 254 (537) T ss_pred ccccccceeEEEEechhhcccccchhh-----------hccCCccccCCceeeee---c--C------eEecceeEEEec Confidence 000000001222222222110000000 00000000111222221 1 1 111221112111 Q ss_pred ----EEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccc-----eeeccce Q lcl|NC_020844. 218 ----PFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKP-----LDTGPDT 288 (455) Q Consensus 218 ----~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~-----~~~g~~~ 288 (455) |-+..+. .. ..+.|.|..+.+.....-++...-..+++....+++.+.|.....+.+...+- ..-+... T Consensus 255 g~~~p~~~~~~-~~-~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g 332 (537) T protein:vir:10 255 NDEVVDFLKPS-YI-YGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRDNYQ 332 (537) T ss_pred CCCCchhhhcc-cC-cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcCCcc Confidence 1111111 11 23677677776665555555556677888888888888775432222111000 0012233 Q ss_pred eeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCC---cchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 289 VLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQS---GNLTRISAAQAASKGNSAVQQWA 362 (455) Q Consensus 289 ~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~---~~~sa~a~~~~~~~~~s~L~~~a 362 (455) ++.++.+. -++..+..+-++ +.+.++...++|......|+. +++ -+.||. .+...-+..+..+. T Consensus 333 ~~~id~e~----e~~e~~~~~lsg---l~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe---~D~~~yyd~I~~~Q 402 (537) T protein:vir:10 333 VRVVDKDN----EDVVQIDTTLND---LDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGD---YEEASYHEECESTQ 402 (537) T ss_pred eeEecCCC----ceeEEEeccCCC---HHHHHHHHHHHHHhhhCCCceeeccCCccccccchh---HHHHHHHHHHHHHH Confidence 45554321 234444444444 455666666677666555543 222 123333 34445566666666 Q ss_pred HHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHH--------HHHHHHHHHHcCCCCHHHHHHHHHhc----- Q lcl|NC_020844. 363 AGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQ--------TPGYLLTMRQNGDISREGLWMEFQRL----- 429 (455) Q Consensus 363 ~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~--------~~~al~~~~~~G~is~et~~~~l~~~----- 429 (455) ..+.-.++.+++++..-....+.+.+|+++. .. .+++. .+++..++++.|+|+.+.+++.|+.- T Consensus 403 e~l~p~l~~l~~ll~~~~~~~~~~~~i~f~p--L~-~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~ 479 (537) T protein:vir:10 403 DDMRPLIDRHHQLVCRSHLRKRIRVKVEFPP--MD-APKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGF 479 (537) T ss_pred HHHHHHHHHHHHHHHHhcCCCCcceEEEeCC--CC-CCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCcccc Confidence 6778888888887765443333334443331 11 12221 23668888999999999999999863 Q ss_pred -CCCCCCCCHHHHHH-HHHhhcc-cCCCC Q lcl|NC_020844. 430 -GELSDNFDPEEESE-RLISELP-GGIEE 455 (455) Q Consensus 430 -~vl~~~~d~e~e~~-ri~~e~~-~~l~~ 455 (455) ++ .+..+.+++.+ .+..|.. ....+ T Consensus 480 ~~l-~~~~~~ed~e~~~~~~~~~~~~~~~ 507 (537) T protein:vir:10 480 TSI-TPAMRPTDAEDIDVDDEGKPVRIIE 507 (537) T ss_pred ccc-cCCCChhhhhcccCCccCCcCCCCC Confidence 33 23334333322 2222211 11111 No 126 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=98.48 E-value=4.8e-07 Score=55.23 Aligned_cols=416 Identities=11% Similarity=0.006 Sum_probs=194.7 Q ss_pred CCCCC--CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCC-HH---HHHH-Hhhcc--c--cCChHHHHH Q lcl|NC_020844. 1 MPEQD--PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQ-NK---RYDL-RKSVA--K--MTNVFRDVL 69 (455) Q Consensus 1 m~~~~--v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~-~~---~Y~~-rl~ra--~--~~n~~~~~v 69 (455) |.--| |. |.+...........-.|.|...-|. ...+..+.+.-+ ++ .+.. ...|| . =.++.+..| T Consensus 8 ~~~~dr~i~---~~~~~~~~~~~~~~~~y~aa~~~r~-~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av 83 (505) T protein:vir:96 8 PSLAQRMVN---WAWYRYVEPQKNAARAFEAARRDRL-GKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFY 83 (505) T ss_pred cchhhcccc---hhhhhhHHHHHHhhhhcccccCCCc-cccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 33222 33 3333222222333334544322121 122332222211 11 1111 11111 1 256778888 Q ss_pred HHHhhhhhc-CCceeccC--------C----HH----HHHHHH--hhccCC-CCHHHHHHHHHHHHHhhCcEEEEEeeCC Q lcl|NC_020844. 70 EGLASKPFG-REVEVTEG--------T----GP----SEDLLE--DIDGRG-NNLTTFAGQLFFQGVADSISWIRVDYPA 129 (455) Q Consensus 70 ~~~~g~lf~-k~p~i~~~--------~----~~----l~~~~~--d~d~~G-~~l~~~~~~~~~~al~~G~~~~lVd~p~ 129 (455) +.++..+.+ ..+++... + .. ++.|.+ ++|-.| .+++++.+.+.+..+..|-|++..-+.. T Consensus 84 ~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~ 163 (505) T protein:vir:96 84 QLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGY 163 (505) T ss_pred HHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecC Confidence 888888876 34433210 1 12 334433 356555 4899999999999999999988764432 Q ss_pred CCcchhhhhHHhccCCce-EEEechhhhccceeec-cCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccC Q lcl|NC_020844. 130 GQQFRNREEERQAGVRPY-WSQVNHSAILEIQSER-QGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFG 207 (455) Q Consensus 130 ~~~~~t~ad~~~~~~rPy-~~~~~~~~ii~w~~~~-~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g 207 (455) .. ..|+ +.+++|+.|=+..... .++. .+.. .++.=..-.+-.|.+++...+..+...... T Consensus 164 ~~------------~~~~~lqliepd~l~~~~n~~~~~~~-~i~~-----GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~ 225 (505) T protein:vir:96 164 PN------------KWGYALQILECDRLDLNYNADLQNGN-RIRM-----SIELDAWERPVAYHLLVNHPGDNSYCYHYA 225 (505) T ss_pred CC------------CcceEEEEechhhcCCCCCcccCCcC-eEEe-----ceEECCCCceEEEEEeecCCCccccccccc Confidence 21 1242 6788898886543221 1111 1111 111000011223344443322211111112 Q ss_pred CCCCCceeE---EEEEecccCCCcccCcchHHHHHHHHHHHHHh-HHHHHHHHHHcCCceEEEecCC----CccccccCc Q lcl|NC_020844. 208 RITIGVVPM---VPFITGRRKGKKWVFHPPMKDALDLQVALFRK-ECALEFAEMMTAFPMLSANGVE----PDTDSEGSP 279 (455) Q Consensus 208 ~~~l~~vP~---V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~-~sd~~~~l~~~~~p~~~~~G~~----~~~~~~~~~ 279 (455) ...+.+||- +-++ .+...+-.-+.|.|..++..-..+-.. .+.+..+ ..++.-..+|+.-. ....+.... T Consensus 226 ~~~~~rvpa~~vlH~f-~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a-~i~A~~a~fi~~~~~~~~~~~~~~~~~ 303 (505) T protein:vir:96 226 GQTYERVPADEIIHTF-VPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAA-ELGAKKVGFYEQDPEAYDQPPEDDQGE 303 (505) T ss_pred cccccccCHhHhhhhh-cccCCccccCcchHHHHHHHHHHHhHHHHHHHHHH-HHhhhheeeeecCCccCCCccccccCc Confidence 234556662 2222 334444455777777766543322221 2333333 33444445555321 112222233 Q ss_pred cceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH-hh--hhhccCC--Ccc-hhHHHHHHHHHH Q lcl|NC_020844. 280 KPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE-LG--RQPLTAQ--SGN-LTRISAAQAASK 353 (455) Q Consensus 280 ~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~-~g--~~~l~~~--~~~-~sa~a~~~~~~~ 353 (455) ....++++++..|+.+ -+++|++++-.+ ..+...++.+.++|.. +| ...+.+. ..| .|+-+...++.. T Consensus 304 ~~~~l~pG~i~~L~pG-----e~i~~~~~~~p~-~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r 377 (505) T protein:vir:96 304 IVEEVEAGTYQLLPYG-----IRFKEHKIDHPH-TNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERD 377 (505) T ss_pred cccccCCceeeecCCC-----CeeeeeCCCCCC-CCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHH Confidence 4567889999888743 379999876321 2344455555555533 22 2334443 234 444444444444 Q ss_pred HHHHHHHH-HHH-HHHHHHHHHHHHHHHcCCCC---CceEEEEecccCCC----CCCHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_020844. 354 GNSAVQQW-AAG-LKDALENALYITNLWMDAPD---TNPEADVYTDFRAD----TGEEQTPGYLLTMRQNGDISREGLWM 424 (455) Q Consensus 354 ~~s~L~~~-a~~-~~~a~~~~l~~~a~~~g~~~---~~~~v~v~~df~~~----~~~~~~~~al~~~~~~G~is~et~~~ 424 (455) ....++.. +.. |+-.++..|+ .|...|.-+ .....-++-.|... ..+..++++...++.+|+.|.+.... T Consensus 378 ~~~~~q~~~~~~~~~pi~~~~l~-~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a 456 (505) T protein:vir:96 378 LYKLLQFFVVTELLERVAGNLIS-MSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIR 456 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHH Confidence 44433322 111 2223333332 233344311 11000012223222 23468899999999999999997655 Q ss_pred HHHhcCCCCCCCCHHHHHHHHHhhcc----cCCCC Q lcl|NC_020844. 425 EFQRLGELSDNFDPEEESERLISELP----GGIEE 455 (455) Q Consensus 425 ~l~~~~vl~~~~d~e~e~~ri~~e~~----~~l~~ 455 (455) + + ..||++.++.|+.|.. -||.. T Consensus 457 ~---~-----G~D~~~v~~q~a~e~~~~~~~Gl~~ 483 (505) T protein:vir:96 457 A---A-----GDDPEDVFDEIAWEEQLMRDKGVNP 483 (505) T ss_pred H---c-----CCCHHHHHHHHHHHHHHHHHcCCCC Confidence 4 2 3478888887777743 34432 No 127 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=98.48 E-value=4.9e-07 Score=55.16 Aligned_cols=435 Identities=13% Similarity=0.053 Sum_probs=203.5 Q ss_pred CCCCCC-----CccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDP-----TKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v-----~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |+..+. -.+|-.-.-.+..|+...+...= ...+++-.. |+-...-.....-+...+..+|.|.+...++.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~-y~~a~~~~~~~~~~~~~r~~~~~~k~~~~~~~i~~ 79 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRN-YVFATDTTTTSNQGLPWKNSTTLPKLCQIRDNLHS 79 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHH-HHHhhhhhhhhhcccccccccchhHHHHHHHHHHH Confidence 776542 22344333445566666554422 111222111 11111111111112233456688888888887777 Q ss_pred hhhc----CC------ceeccCC-----HHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhH Q lcl|NC_020844. 75 KPFG----RE------VEVTEGT-----GPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEE 139 (455) Q Consensus 75 ~lf~----k~------p~i~~~~-----~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~ 139 (455) +++. .. +.+.++. +.++.+.+|==-+ .++..-++.+++.++.+|-|.+-|..-..-.-....+. T Consensus 80 ~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e-~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~ 158 (584) T protein:vir:95 80 NYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRE-SHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTL 158 (584) T ss_pred HHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhh-ccHHHHHHHHHHhhccCCceEEEEeEeecceeeecccc Confidence 7643 22 1111111 2244444321112 26888889999999999999998864322110011112 Q ss_pred HhccCCceEEEechhhhccceeec--cC-CeeEEEEEEE-E--------e---------------------------eee Q lcl|NC_020844. 140 RQAGVRPYWSQVNHSAILEIQSER--QG-AGERITYMRM-W--------E---------------------------DAE 180 (455) Q Consensus 140 ~~~~~rPy~~~~~~~~ii~w~~~~--~~-~~~~l~~v~~-~--------E---------------------------~~e 180 (455) .....+|.+..++|.+++ |.-.. ++ +...+ ..++ + + .++ T Consensus 159 v~~~~~prieriSP~d~~-~Dpsa~~i~d~~fiv-rs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~ 236 (584) T protein:vir:95 159 VPDYIGPRLVRISPLDIV-FNPLATSISDTFKIV-RSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFD 236 (584) T ss_pred ccccccceEEeeChhhee-ecCCCCCccchhhhh-hhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCcccccc Confidence 223457999999999887 33211 11 11011 0000 0 0 000 Q ss_pred ------------EEEEEecCeEEEEEEeC-------Cc--EEE---------E--eccCCCCCCceeEEEEEecccCCCc Q lcl|NC_020844. 181 ------------TILVLRPDDWERWRKND-------AG--IWE---------V--DEFGRITIGVVPMVPFITGRRKGKK 228 (455) Q Consensus 181 ------------~~~v~~~~~~~~~~~~~-------~g--~~~---------~--~~~g~~~l~~vP~V~f~~~~~~~~~ 228 (455) .+..+.++.|+++.+.+ ++ .+. + .+..+++.+.+||+-+.+-+..-++ T Consensus 237 ~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~ 316 (584) T protein:vir:95 237 KAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNL 316 (584) T ss_pred cccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeeccc Confidence 11122233344443211 11 111 1 1234567799999876555544343 Q ss_pred ccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEee Q lcl|NC_020844. 229 WVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEP 308 (455) Q Consensus 229 ~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~ 308 (455) .+.+++.-+.+++....-..--.-+.+..+..|++...+... ....+++..+..- ..+.+.++++ T Consensus 317 -yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~~---------~~~~~pg~~~~~~-----~~~~~q~~~p 381 (584) T protein:vir:95 317 -WAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEVE---------EFVWGPGAEIHLD-----QGGDVQEIAK 381 (584) T ss_pred -cCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeeccccc---------hhcccCCceeecC-----CCCCcceecC Confidence 466666655555555444333334445666778666554321 1245566666542 2345788888 Q ss_pred CchhHHHHHHHHHHHHHHHHHhhhhhccC----CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHc--C Q lcl|NC_020844. 309 ATESLKFLAEQIESVSKEIRELGRQPLTA----QSGNLTRISAAQAASKGNSAVQQWAAGLKDAL-ENALYITNLWM--D 381 (455) Q Consensus 309 ~~~~l~~~~~~l~~~~~~m~~~g~~~l~~----~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~-~~~l~~~a~~~--g 381 (455) +...+-.....|+-++..|-.+.+.+-.. ..+++|++..+.-..+.+..+..++..|.+.+ ++++.++=+|. . T Consensus 382 ~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~n 461 (584) T protein:vir:95 382 NVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRN 461 (584) T ss_pred chhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 76665555566888888888776554332 23567887777766677788888999888876 67555443331 0 Q ss_pred CCCC-ceEE----------------EEecccCCCCCC-------HHHHHHHHHHHH-------cCCCCHHHHHHHHHhcC Q lcl|NC_020844. 382 APDT-NPEA----------------DVYTDFRADTGE-------EQTPGYLLTMRQ-------NGDISREGLWMEFQRLG 430 (455) Q Consensus 382 ~~~~-~~~v----------------~v~~df~~~~~~-------~~~~~al~~~~~-------~G~is~et~~~~l~~~~ 430 (455) .+.. .+.+ ++..+|...... ++..+.+....+ .+.++...+...+...+ T Consensus 462 md~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~ 541 (584) T protein:vir:95 462 MDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVT 541 (584) T ss_pred ccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHh Confidence 1111 1111 122233211111 232333333333 22334444444343322 Q ss_pred CCCC------CCCHHH---HHHHHH---------hhcc-cCCC Q lcl|NC_020844. 431 ELSD------NFDPEE---ESERLI---------SELP-GGIE 454 (455) Q Consensus 431 vl~~------~~d~e~---e~~ri~---------~e~~-~~l~ 454 (455) -++. +...++ .+.++. +|.+ +|.- T Consensus 542 ~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 542 GLQGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred CCCcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhccCC Confidence 2221 111111 111111 1100 0000 No 128 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=98.44 E-value=6.5e-07 Score=54.52 Aligned_cols=417 Identities=10% Similarity=0.027 Sum_probs=194.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC------CCCHHHHHHHhhccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP------HEQNKRYDLRKSVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~------~e~~~~Y~~rl~ra~~~n~~~~~v~~~~ 73 (455) |.+.+ ..-......+|+.++.--.= +..+++..+--||... ..+...=..|.. -.|-+...+.++.++ T Consensus 1 m~~d~----~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~-~~~dstg~~a~~~LA 75 (549) T protein:vir:10 1 MTNDD----AKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQ-KMFDSTAPLALRNFV 75 (549) T ss_pred CCcch----HHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCccccccc-ccccchHHHHHHHHH Confidence 98643 23233333344433222111 3445555555666531 112111112222 245555566666666 Q ss_pred hhhh-----------cCCceeccC--CHHHHHHHHhhc--------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 74 SKPF-----------GREVEVTEG--TGPSEDLLEDID--------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 74 g~lf-----------~k~p~i~~~--~~~l~~~~~d~d--------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) ..+. +-.+.-... ....+.|++.+. ...++++.-+..+.++...+|-+-++++-...+. T Consensus 76 s~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~ 155 (549) T protein:vir:10 76 AAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKG 155 (549) T ss_pred HHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCe Confidence 5552 211111111 112345555321 2346788888889999999999999997433222 Q ss_pred chhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEE-EE-------e------------------eeeEEEEEe Q lcl|NC_020844. 133 FRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMR-MW-------E------------------DAETILVLR 186 (455) Q Consensus 133 ~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~-~~-------E------------------~~e~~~v~~ 186 (455) . .+..|+-.++.-.+. ..| + +.+.+| ++ + ..+.+.|++ T Consensus 156 ~-------------~f~~~pl~~~~v~~d-~~G-~-vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~ 219 (549) T protein:vir:10 156 I-------------VYRNVPMQRLWFAEN-NSG-L-IDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYH 219 (549) T ss_pred e-------------EEEEEEcCeEEEeeC-CCC-C-eEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEE Confidence 1 244455555442222 222 2 222222 11 0 012233321 Q ss_pred ---cCe-------------EE-EEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhH Q lcl|NC_020844. 187 ---PDD-------------WE-RWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKE 249 (455) Q Consensus 187 ---~~~-------------~~-~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~ 249 (455) |.. |. +|. ..++...+.++ ++...|++++++....++.+ |.+|-.+...-...+...+ T Consensus 220 ~V~pr~~~~~~~~~~~~~pf~sv~~-e~~~~~il~es---g~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~ 294 (549) T protein:vir:10 220 AVEPRADRDPRKLDGRNMQFASYWL-DEGRDRIVQNS---GFRTFPFAIGRFYVGTDDVY-GGSPAYDAMPDVRMANDMA 294 (549) T ss_pred EeecCCCCCccccccccCceEEEEE-EecCCEeeccC---CcccCCcceeeeeecCCCcc-ccchHHHHHHHHHHHHHHH Confidence 110 11 121 11222222222 35668888888877777666 7777776666555555566 Q ss_pred HHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEee-CchhHHHHHHHHHHHHHHHH Q lcl|NC_020844. 250 CALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEP-ATESLKFLAEQIESVSKEIR 328 (455) Q Consensus 250 sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~-~~~~l~~~~~~l~~~~~~m~ 328 (455) -..-.+.+.+..|.+.+.-- +..+++.+.++.....-.+ .+++..+..+ .+..+....+.|+++++.|. T Consensus 295 ~~~l~~~~~~~~p~~~v~~~-------g~~~~~~l~pgg~~~~~~~---~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~ 364 (549) T protein:vir:10 295 KTNIRGAQKLVDPPLLANED-------GVLDGFDLRSGALNWGGLN---DKGEEMVKPLLTGKQAQIGIEFAQDTRQTIN 364 (549) T ss_pred HHHHHHHHHHhcCceeeccc-------cccccceeccCCccccccC---CCCccceeeeccccchhHHHHHHHHHHHHHH Confidence 66777788888888876411 1111222222222221111 1122222221 34567777888888888886 Q ss_pred Hh--hhh-hccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHcCC-C--CCc---eEEEEecc Q lcl|NC_020844. 329 EL--GRQ-PLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKD-----ALENALYITNLWMDA-P--DTN---PEADVYTD 394 (455) Q Consensus 329 ~~--g~~-~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~-----a~~~~l~~~a~~~g~-~--~~~---~~v~v~~d 394 (455) .. .-. .+..++.+.||++...+.......|.-.-..++. -+++++.++-+ .|. + +.+ ..+.++-. T Consensus 365 ~af~~d~~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r-~g~lP~~p~~l~~~~~~~~i~ 443 (549) T protein:vir:10 365 QWFYVTLFQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAE-AGQLPDMPQELIDAGADVDVE 443 (549) T ss_pred HHHhhhhhhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCChhhhcCCceeEEE Confidence 63 111 1233566789999999888887777666665543 23444444433 243 1 111 12233333 Q ss_pred cCCCCC----CHHHHHHHHHHHH-------cC-----CCCH-HHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccC----- Q lcl|NC_020844. 395 FRADTG----EEQTPGYLLTMRQ-------NG-----DISR-EGLWMEFQRLGELSDNFDPEEESERLISELPGG----- 452 (455) Q Consensus 395 f~~~~~----~~~~~~al~~~~~-------~G-----~is~-et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~----- 452 (455) |.. .+ ..+.+.++....+ .| .|.. +.+.......||-...+-.++|.+.+.++.... T Consensus 444 yis-~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~ 522 (549) T protein:vir:10 444 YDS-PLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQ 522 (549) T ss_pred eec-HHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHH Confidence 432 11 1122222222221 11 1222 233344555777655555667666554432211 Q ss_pred CC-----------C Q lcl|NC_020844. 453 IE-----------E 455 (455) Q Consensus 453 l~-----------~ 455 (455) +- + T Consensus 523 ~~~~a~~a~~~a~~ 536 (549) T protein:vir:10 523 MLAAAPVAAGAIKD 536 (549) T ss_pred HHHHHHHHHHHHHh Confidence 10 1 No 129 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=98.37 E-value=1e-06 Score=53.48 Aligned_cols=383 Identities=10% Similarity=0.005 Sum_probs=175.1 Q ss_pred CCCC------CC---CccCHHHHHHHHHHHHHHHH-hcCH------HHHHHhhhhcCCCCCCCCHHHHHHHhhccccCCh Q lcl|NC_020844. 1 MPEQ------DP---TKRSADAEAMSDYLQTVDDV-LEGV------TGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNV 64 (455) Q Consensus 1 m~~~------~v---~~~~p~y~~~~~~w~~i~d~-~~G~------~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~ 64 (455) |+.- .+ ..+.|+-.....-|+...+. ..|- ..+|++..-.++.. -+-|.....+ ..+ T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~----~~L~~dm~~~---D~h 73 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQ----ADLAFDMEEK---DTH 73 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHH----HHHHHHHHhh---ChH Confidence 2210 00 00001100000011111110 0010 11111111111100 0112222222 567 Q ss_pred HHHHHHHHhhhhhcCCceec---cCCH---H----HHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch Q lcl|NC_020844. 65 FRDVLEGLASKPFGREVEVT---EGTG---P----SEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR 134 (455) Q Consensus 65 ~~~~v~~~~g~lf~k~p~i~---~~~~---~----l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~ 134 (455) +.-.++.-...+.+.+++|. +.++ . ++.++.++ -+++.++..+. .|+-||.+.+=+- T Consensus 74 i~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~----~~f~~~~~~ll-dA~~~G~s~~Ei~-------- 140 (512) T protein:vir:19 74 LFSELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDA----AWFEDALFDAG-DAILKGYSMQEIE-------- 140 (512) T ss_pred HHHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcC----CCHHHHHHHHH-hhhhhcceeeeeE-------- Confidence 77778888888888888884 1122 2 23333322 25888888887 5778887665331 Q ss_pred hhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCce Q lcl|NC_020844. 135 NREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVV 214 (455) Q Consensus 135 t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~v 214 (455) |+. .+|.+.+..+..+.. ..+.+...+..++ + ..+...++.. + T Consensus 141 ------------------------w~~--~~g~~~~~~~~~r~~-~~f~~~~~~~~~l-r--------~~~~~~~G~~-l 183 (512) T protein:vir:19 141 ------------------------WGW--LGKMRVPVALHHRDP-ALFCANPDNLNEL-R--------LRDASYHGLE-L 183 (512) T ss_pred ------------------------eee--eCCceeeeeeeeecc-ccceeccCCCcEE-E--------ecCCCCCcee-e Confidence 222 234444433332221 0111111111111 1 1111111110 1 Q ss_pred e---EEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe---cCCCccccccCccceeeccce Q lcl|NC_020844. 215 P---MVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN---GVEPDTDSEGSPKPLDTGPDT 288 (455) Q Consensus 215 P---~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~---G~~~~~~~~~~~~~~~~g~~~ 288 (455) | +|.+...+.. +...+...|..++..-+--.....+...-++.-+.|+++.+ |.+.++..........+|.+. T Consensus 184 ~~~k~i~~~~~~~~-g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al~~~~~~a 262 (512) T protein:vir:19 184 QPFGWFMHRAKSRT-GYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTNREKATLMQAVMDIGRRA 262 (512) T ss_pred cCCceEEEeccCCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCHHHHHHHHHHHHHHhhCc Confidence 1 3323333332 33345665666665544444556778888889999998877 222222111122334578889 Q ss_pred eeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH--hhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 289 VLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE--LGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 289 ~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~--~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) +..+|.+. .++|++..+.+-..++..++...++|.. +|..+-...+..-|-.............+...+..+. T Consensus 263 ~~iiP~~~-----~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~ 337 (512) T protein:vir:19 263 GGIIPMGM-----TLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGEVHDEVRREIRNADVGQLA 337 (512) T ss_pred EEEecCCc-----eEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHH Confidence 99999765 6999998776667788888888899865 3554433222122222334445556667788889999 Q ss_pred HHHH-HHHHHHHHHcCCCCCc----eEEEEecccCCCCCC-HHHHHHHHHHHHcCC-CCHHHHHHHHHhcCCCCCCCCHH Q lcl|NC_020844. 367 DALE-NALYITNLWMDAPDTN----PEADVYTDFRADTGE-EQTPGYLLTMRQNGD-ISREGLWMEFQRLGELSDNFDPE 439 (455) Q Consensus 367 ~a~~-~~l~~~a~~~g~~~~~----~~v~v~~df~~~~~~-~~~~~al~~~~~~G~-is~et~~~~l~~~~vl~~~~d~e 439 (455) ++++ ++++.++.|-.-.... ..+.+ +... ..+ ...++.+.++. .|. ||.+.+.+ +.|+-.+.. + T Consensus 338 ~tln~~li~~l~~~N~~~~~~~~~~p~~~f--~~~e-~eDl~~~a~~~~~l~-~G~~i~~~~i~e---~~Gip~~~~--~ 408 (512) T protein:vir:19 338 RSINRDLIYPLLALNSDSTIDINRLPGIVF--DTSE-AGDITALSDAIPKLA-AGMRIPVSWIQE---KLHIPQPVG--D 408 (512) T ss_pred HHHHHHHHHHHHHhCCCCCCCccccceEEe--cCCC-hhhHHHHHHHHHHHh-cCCCCCHHHHHH---HhCCCCCCC--c Confidence 9996 6999988876432221 22322 2211 122 23344444443 565 77665544 446522221 1 Q ss_pred HHHHHHHhhccc--CCCC Q lcl|NC_020844. 440 EESERLISELPG--GIEE 455 (455) Q Consensus 440 ~e~~ri~~e~~~--~l~~ 455 (455) ++........+. .... T Consensus 409 e~~~~~~~~~~~~~~~~~ 426 (512) T protein:vir:19 409 EAVFTIQPVVPDNGSQKE 426 (512) T ss_pred cccccCCCcccccccccc Confidence 111111111000 0000 No 130 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=98.27 E-value=1.8e-06 Score=52.02 Aligned_cols=277 Identities=11% Similarity=0.077 Sum_probs=134.9 Q ss_pred h--ccceeeccCCeeEEEEEEEEee--eeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCcee-----EEEEEecccCC Q lcl|NC_020844. 156 I--LEIQSERQGAGERITYMRMWED--AETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVP-----MVPFITGRRKG 226 (455) Q Consensus 156 i--i~w~~~~~~~~~~l~~v~~~E~--~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP-----~V~f~~~~~~~ 226 (455) | |.|+. .+|.+.+..+.++.. +.++.+.. +++...+...+.++.+.+| ||.+.+.+..+ T Consensus 1 v~Eivw~~--~~g~~~~~~l~~r~~~~~~~f~~~~----------~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g 68 (355) T protein:vir:78 1 MFEQVYRI--ENGRARLGKLAWRPPRTISRFDVAP----------DGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGA 68 (355) T ss_pred CeEEEEEe--eCCeEEEeeeeecCccceeeeeecc----------CCceeEEEecCCCCCCcceeccCCEEEEEeCCCCC Confidence 2 23543 356666655554432 12222111 1222222233334333333 34444454433 Q ss_pred CcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccc--ccc-------------C---ccceeeccce Q lcl|NC_020844. 227 KKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTD--SEG-------------S---PKPLDTGPDT 288 (455) Q Consensus 227 ~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~--~~~-------------~---~~~~~~g~~~ 288 (455) . ..+.+.|..++..-+---....+...-++.-+.|+++.+|-.+... .+. . -..+..|... T Consensus 69 ~-p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a 147 (355) T protein:vir:78 69 N-WLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAA 147 (355) T ss_pred C-ccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcce Confidence 3 3466666666654432223345555666666668888887432211 000 0 0113457677 Q ss_pred eeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh--hhhhccCCC-cchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 289 VLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL--GRQPLTAQS-GNLTRISAAQAASKGNSAVQQWAAGL 365 (455) Q Consensus 289 ~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~-~~~sa~a~~~~~~~~~s~L~~~a~~~ 365 (455) +..+|.+. ++.|++..+... .+.+.++...++|... |..+-...+ +.-|-.............+...+..+ T Consensus 148 ~~iip~g~-----~ie~~ea~g~~~-~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i 221 (355) T protein:vir:78 148 GGYIPHGA-----NFTLTGVQGKLP-EMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHI 221 (355) T ss_pred eEeecCCc-----eEEEeecCCCcc-cHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHH Confidence 88888765 699999877554 4667788888888553 433322111 11222223344455566677788888 Q ss_pred HHHHH-HHHHHHHHHcC-CCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCC-CHHHHHHHHH-hcCCCCCCCCHHHH Q lcl|NC_020844. 366 KDALE-NALYITNLWMD-APDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDI-SREGLWMEFQ-RLGELSDNFDPEEE 441 (455) Q Consensus 366 ~~a~~-~~l~~~a~~~g-~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~i-s~et~~~~l~-~~~vl~~~~d~e~e 441 (455) .++++ +++++++.+-. ....-..|.+. ... ..+...++++.++...|.+ +.+....+++ +.|+-.+... +++ T Consensus 222 ~~~ln~~li~~l~~lN~~~~~~~P~~~~~--~~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~-~~~ 297 (355) T protein:vir:78 222 ADVTQQHVVEDLVDQNWGPEEPAPRLVPA--QLG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAER-DDG 297 (355) T ss_pred HHHHHHHHHHHHHHhcCCCCCCCCEEEec--CcC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCC-Ccc Confidence 89996 58888888752 22212344432 112 2223456778888888885 4333333333 4666333222 222 Q ss_pred HHHHHhhcccCCC----C Q lcl|NC_020844. 442 SERLISELPGGIE----E 455 (455) Q Consensus 442 ~~ri~~e~~~~l~----~ 455 (455) ..-.+...+.... + T Consensus 298 ~~~~~~~~~~~~~~~~~~ 315 (355) T protein:vir:78 298 ADAAAAKAAGRRRAKRLP 315 (355) T ss_pred cCCccccccccccccccC Confidence 1111100000000 0 No 131 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.24 E-value=2.2e-06 Score=51.65 Aligned_cols=435 Identities=11% Similarity=0.097 Sum_probs=170.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhh---hhcC----CCCCCCCHHHHHH---Hhhcc-ccCChHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNF---KRYV----KQFPHEQNKRYDL---RKSVA-KMTNVFRDVL 69 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g---~~yL----pk~~~e~~~~Y~~---rl~ra-~~~n~~~~~v 69 (455) |++.+- ..|-. .. ..++.........|+.. +.|. -+++.+..+..+. +..|- +.+|.++.+| T Consensus 1 ma~~~~-~~l~~---~~---~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v 73 (720) T protein:vir:35 1 MAETLQ-KRHEQ---IM---RKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTEL 73 (720) T ss_pred CchHHH-HHHHH---HH---HHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHH Confidence 986432 22222 22 22333333334444322 1221 1333333332221 22333 4469999999 Q ss_pred HHHhhhhhcCCceec------cC----CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE--eeCCCCcchhhh Q lcl|NC_020844. 70 EGLASKPFGREVEVT------EG----TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV--DYPAGQQFRNRE 137 (455) Q Consensus 70 ~~~~g~lf~k~p~i~------~~----~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV--d~p~~~~~~t~a 137 (455) +..+|+--...+.+. .. .+.+..++..+- .-++.+.-+..++..++.+|++|+=| ||.....+... T Consensus 74 ~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~-~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~- 151 (720) T protein:vir:35 74 NRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADY-EETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDE- 151 (720) T ss_pred HHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHH-HhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcc- Confidence 999999865555442 11 122444444433 23567888899999999999999977 33222111100 Q ss_pred hHHhccCC------------ceEEEech--------------hhh---cc-------------ceeeccCCeeEEEEEEE Q lcl|NC_020844. 138 EERQAGVR------------PYWSQVNH--------------SAI---LE-------------IQSERQGAGERITYMRM 175 (455) Q Consensus 138 d~~~~~~r------------Py~~~~~~--------------~~i---i~-------------w~~~~~~~~~~l~~v~~ 175 (455) ...-.++ |....++. +++ ++ |..+-.. ...+..+.+ T Consensus 152 -~~~i~i~~v~~~~~~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~-~~~v~i~E~ 229 (720) T protein:vir:35 152 -RQRICLEPIYDPARSVWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYD-VDVVYIAKY 229 (720) T ss_pred -cceeeEecccCchhheeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccC-CCceEEEEe Confidence 0000000 11111110 000 00 0000000 001111111 Q ss_pred --Eeeee-EEEEE-ec--Ce-------------------------------EEEEEEeCCcEEEEeccCCCCCCceeEEE Q lcl|NC_020844. 176 --WEDAE-TILVL-RP--DD-------------------------------WERWRKNDAGIWEVDEFGRITIGVVPMVP 218 (455) Q Consensus 176 --~E~~e-~~~v~-~~--~~-------------------------------~~~~~~~~~g~~~~~~~g~~~l~~vP~V~ 218 (455) ++.+. ++.++ .+ +. +++|...-+|...+....+.+.+.+|+|| T Consensus 230 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP 309 (720) T protein:vir:35 230 YEVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIP 309 (720) T ss_pred eEEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEE Confidence 11110 11111 00 11 11111111222222333455778899999 Q ss_pred EEeccc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccC-- Q lcl|NC_020844. 219 FITGRR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPP-- 294 (455) Q Consensus 219 f~~~~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~-- 294 (455) |+..+. ++... ...-+-++.+.....+...|-+.+++- ..|+..-.|-.... ..+...--.-+..+...|+. T Consensus 310 ~~g~r~~~d~~~~-~~G~vr~~kd~Q~~~N~~~s~~~~~~~--~~~~~~~~~a~~~~-~~~~~~~a~~~~~~~~~l~~~~ 385 (720) T protein:vir:35 310 VYGKRWFIDDIER-VEGHIAKAMDAQRLYNLQVSMLADSAT--QDTGSIPIVGKSQI-KTLEKYWANRNKNRPAFLPLNE 385 (720) T ss_pred EEeeeeccCCCcc-cceeeecchhHHHHHHHHHHHHHHHHH--cCCccccccCcchH-HHHHHHhhcccccccccccccc Confidence 874332 21110 011122333444344445566666653 33333222211111 00100000000111111111 Q ss_pred -----CCC-ccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhh--ccCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 295 -----NAE-GQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQP--LTAQSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 295 -----~~~-~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~--l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) ++. ....++++.++.. -..++.+.|+.-...|..+++.- +.+..+|.||++...+..+....|..+-.|+. T Consensus 386 ~~~~~G~~~~~~~~~~~~~~~~-~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~ 464 (720) T protein:vir:35 386 IVDKQGNIIAPPTPVGYTQPQP-LNQAMAALLQQTGADIQEVTGSSQAMQPMPSNIAKETVNHLMHRSDMSSFIYLDNMA 464 (720) T ss_pred ccccCcccccCCCcccccCCCC-CchHHHHHHHHHHHHHHHHhCCChHHcCcccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 0112455555432 22344555666666666654322 23334468999998888777666666666666 Q ss_pred HHHH----HHHHHHHHHcCC---------CCCceEEEEec---------------------ccC----C--CCCCHHHHH Q lcl|NC_020844. 367 DALE----NALYITNLWMDA---------PDTNPEADVYT---------------------DFR----A--DTGEEQTPG 406 (455) Q Consensus 367 ~a~~----~~l~~~a~~~g~---------~~~~~~v~v~~---------------------df~----~--~~~~~~~~~ 406 (455) .+.. .+|.++.++++. ++....+.+|. |+. + .....+..+ T Consensus 465 ~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~ 544 (720) T protein:vir:35 465 KSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVS 544 (720) T ss_pred HHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHH Confidence 6554 466677777742 11111122210 111 1 111234455 Q ss_pred HHHHHHHcCCCCHHH----HHHHHHhcCCCCCCCCHHHHHHHHHhhccc-CCCC Q lcl|NC_020844. 407 YLLTMRQNGDISREG----LWMEFQRLGELSDNFDPEEESERLISELPG-GIEE 455 (455) Q Consensus 407 al~~~~~~G~is~et----~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~-~l~~ 455 (455) ++.++... +.+... +...+- ..+ +.-..++..+|+....+. +.-+ T Consensus 545 ~m~qll~~-~~p~~~~~~~~~~~il--e~~-d~p~~~e~~erirk~~~~~~~~~ 594 (720) T protein:vir:35 545 VLTNLLAG-MLPQDPMRQVLQGIIL--DNM-EGEGLDEFKEYNRKQLLTQGVVK 594 (720) T ss_pred HHHHHHHh-cCCCchhHHHHHHHHH--Hhc-CchhHHHHHHHHHhhcchhcccC Confidence 66655432 111111 100000 000 011123333444443221 1111 No 132 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=98.19 E-value=2.8e-06 Score=51.01 Aligned_cols=390 Identities=9% Similarity=-0.004 Sum_probs=180.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |+ ..++.-|. ++++|.. .|..|-+....-.-+-+..|.. ...++++|+..+.-..++. T Consensus 1 ~~----~~~~d~~~----------~~~~~~~----~~~~~~~~~~~~~~~l~a~Y~~----~~l~~~~Vd~~aed~~r~g 58 (427) T protein:vir:10 1 MK----IVKHDGYN----------DIFNGGA----DGSPKPFFMSDASYHVGSFYND----NATAKRIVDVIPEEMVTAG 58 (427) T ss_pred CC----ccccchHH----------HHhhcCC----CCcccCccccCchHHHHHHHHc----CchhhhhhccchHHhhcCC Confidence 55 23333332 2244421 1222322111111122233443 4677888999999999999 Q ss_pred ceeccCCH--HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhcc Q lcl|NC_020844. 81 VEVTEGTG--PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILE 158 (455) Q Consensus 81 p~i~~~~~--~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~ 158 (455) ++|+.+.+ .++.-++. ..+...++.+.+.+..||.+++++............ ...+...++..+++.+|.. T Consensus 59 ~~i~g~~~~~~~~~~~~~-----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~--~~~g~l~~l~v~d~~~~~~ 131 (427) T protein:vir:10 59 FKMSGVKDEKEFKSLWDS-----YKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQA--KPGAKLEGVRVYDRFAITV 131 (427) T ss_pred ccccCccHHHHHHHHHHH-----hhHHHHHHHHHHhccccceeEEEEEecCCCcccccc--CCCcceeEEEEechhcccc Confidence 99976432 33433332 357888889999999999999998643221110000 0011122333333333211 Q ss_pred ceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCc---EEEEeccCCCCCCceeEEEEEecccCCCcccCcchH Q lcl|NC_020844. 159 IQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAG---IWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPM 235 (455) Q Consensus 159 w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g---~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL 235 (455) -... .+...-.+..| +.|...+++ .+.+|.+--..+..-|+ |....+ ..++ .+.++| T Consensus 132 ~~~~--------------~dp~s~~fg~P---~~y~v~~~~~~~~~~iH~SRli~~~g~~~-p~~~~~-~~~~-~G~S~l 191 (427) T protein:vir:10 132 EKRV--------------TNARSPRYGEP---EIYKVSPGDNMQPYLIHHSRVFIADGERV-AQQARK-QNQG-WGASVL 191 (427) T ss_pred cccc--------------cCccccccCcc---eEEEEecCCCCcceEEccccEEEecCCCc-hhhhcc-cCCc-ccchhh Confidence 0000 00000001111 223222211 12222222111211111 111111 2222 356777 Q ss_pred HH-HHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccc-ccC----cc----ceeeccceeeeccCCCCccccceeE Q lcl|NC_020844. 236 KD-ALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDS-EGS----PK----PLDTGPDTVLYAPPNAEGQHGSWNF 305 (455) Q Consensus 236 ~d-la~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~-~~~----~~----~~~~g~~~~~~lp~~~~~~~~~~~~ 305 (455) .. +.+.....-++...-..+++...+.++-+.|+...-.. +.. .. ....+.+..+.+.. ++.++.. T Consensus 192 ~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~----~~e~~e~ 267 (427) T protein:vir:10 192 NKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA----ETEEYDV 267 (427) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeec----CCCceeE Confidence 64 44544443345555677888888888888776422110 000 00 01122233333321 1224555 Q ss_pred EeeCchhHHHHHHHHHHHHHHHHHhhhhhccC---CC---cchhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_020844. 306 VEPATESLKFLAEQIESVSKEIRELGRQPLTA---QS---GNLTRISAAQAASKGNSAVQQWAA-GLKDALENALYITNL 378 (455) Q Consensus 306 le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~---~~~sa~a~~~~~~~~~s~L~~~a~-~~~~a~~~~l~~~a~ 378 (455) +..+-+++ .+.++...++|......|++. ++ -|.||.. +...-+..++.+.. .+...++++++++.. T Consensus 268 ~~~~lsgl---~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~---D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~ 341 (427) T protein:vir:10 268 LNSDISGV---PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT---ALETFYKLVDRKREEDYRPLLEFLLPFIVD 341 (427) T ss_pred EecccCCh---HHHHHHHHHHHHhhhCCCeeeeccCCccccccchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 55444454 555666677777766655432 21 1333332 34445555555543 366777877777642 Q ss_pred HcCCCCCceEEEEecccCCCCCC-----HHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC---H-HHHHHHHHhhc Q lcl|NC_020844. 379 WMDAPDTNPEADVYTDFRADTGE-----EQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD---P-EEESERLISEL 449 (455) Q Consensus 379 ~~g~~~~~~~v~v~~df~~~~~~-----~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d---~-e~e~~ri~~e~ 449 (455) ..+.+++++.-+.....+ ...+++..++.++|+|+.+..++.|+..+......+ . .++.+.-+ +. T Consensus 342 -----s~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~-e~ 415 (427) T protein:vir:10 342 -----EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETT-EP 415 (427) T ss_pred -----CCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccccchhc-CC Confidence 235667666433322111 123567888889999999999999975433322111 1 11222221 22 Q ss_pred ccCCCC Q lcl|NC_020844. 450 PGGIEE 455 (455) Q Consensus 450 ~~~l~~ 455 (455) +.+.+| T Consensus 416 ~p~~~e 421 (427) T protein:vir:10 416 EPGLGE 421 (427) T ss_pred CCCCCC Confidence 333333 No 133 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=98.18 E-value=3.1e-06 Score=50.80 Aligned_cols=409 Identities=11% Similarity=0.037 Sum_probs=183.6 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |=+. ...+|+.++.--.= ...+|+..+-.||..-..+...=..+. ...|-+...+.++.++..+++- T Consensus 1 m~~~-----------~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~-~~~~dst~~~a~~~Laa~l~~~ 68 (555) T protein:vir:17 1 MKHS-----------AQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYL-PTPWQSVGSKGVNVLASKLMLS 68 (555) T ss_pred ChhH-----------HHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccc-cccccccHHHHHHHHHHHHHHh Confidence 3311 12333333221100 233455555556643221111111122 2346666667777766665321 Q ss_pred --Cce-----ec-c---------CCH---HHHHHHHhhcc------CCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcc Q lcl|NC_020844. 80 --EVE-----VT-E---------GTG---PSEDLLEDIDG------RGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQF 133 (455) Q Consensus 80 --~p~-----i~-~---------~~~---~l~~~~~d~d~------~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~ 133 (455) ||. +. . +++ .++.+++.+.. ..++++.-+..+.++...+|.+-+++|.+. T Consensus 69 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~---- 144 (555) T protein:vir:17 69 LFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGKKN---- 144 (555) T ss_pred hcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecCCc---- Confidence 111 11 0 011 13333332211 135688888999999999999988886321 Q ss_pred hhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEE-EEee----eeEE-------------------------- Q lcl|NC_020844. 134 RNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMR-MWED----AETI-------------------------- 182 (455) Q Consensus 134 ~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~-~~E~----~e~~-------------------------- 182 (455) +..|+-.++. +..+..| .+.+.+| ++=+ ++++ T Consensus 145 --------------~~~~pl~~y~-v~~d~~G--~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~ 207 (555) T protein:vir:17 145 --------------LKLYPLDRFV-VSRDGEG--NVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTA 207 (555) T ss_pred --------------eeEEEcCeEE-EeeCCCc--CeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhh Confidence 2223322211 1222222 2222111 1100 0000 Q ss_pred -E---EEecCeEEEEE--EeCCcEEEE----------eccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHH Q lcl|NC_020844. 183 -L---VLRPDDWERWR--KNDAGIWEV----------DEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALF 246 (455) Q Consensus 183 -~---v~~~~~~~~~~--~~~~g~~~~----------~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy 246 (455) . ...+..+.+|. .-..+.|.. ...+..++...|++++++....++.+ +.+|..+...-...+. T Consensus 208 ~~~~~~~~~~~~~v~t~~~~~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~ 286 (555) T protein:vir:17 208 PGGRDKGKSNDALVYTYVCRKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAY-GRGRVEEFMGDLKSLE 286 (555) T ss_pred hcccccCCCcceeEeecccccCCeeEEEEecCceeccccccccCcccCCeeeeeeeecCCCcc-ccchHHHHHHHHHHHH Confidence 0 00111122221 011111111 11234567788999998887777776 7777777666555555 Q ss_pred HhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHH Q lcl|NC_020844. 247 RKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSK 325 (455) Q Consensus 247 ~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~ 325 (455) ..+-..-..++.+..|.+.+.- +.-.++..+.-|....+ +|.. .++++.++.. +..+....+.++++++ T Consensus 287 ~l~~~~l~~~~~~~~pp~lv~~-----~g~~~~~~l~~~~~g~v-~~g~----~~~v~~~~~~~~~~~~~~~~~i~~~~~ 356 (555) T protein:vir:17 287 ALSQAMVEGSAASAKVVFMVSP-----SATTKPQNLALAANGAI-IQGR----PDDVSVVQANKAADFRTVLEMIQKLEQ 356 (555) T ss_pred HHHHHHHHHHHHHhCCceeecc-----ccccCcceeecCCCcee-ecCC----cccceeeeccccchhhHHHHHHHHHHH Confidence 5555666777777777766530 01111112233333332 2311 1235445432 3457777888888888 Q ss_pred HHHHhhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHcCC----CCCceEEEEecccC Q lcl|NC_020844. 326 EIRELGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKD-----ALENALYITNLWMDA----PDTNPEADVYTDFR 396 (455) Q Consensus 326 ~m~~~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~-----a~~~~l~~~a~~~g~----~~~~~~v~v~~df~ 396 (455) .|..+=.-+...++.+.||++.+.+.......|.-.-..++. -+++++.++.+ .|. +++.+.+++...+. T Consensus 357 ~I~~aFm~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r-~g~lP~~p~~~v~~~i~~~l~ 435 (555) T protein:vir:17 357 RISDAFLMLQVRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQK-QRKLPQLPKDLVQPTVVAGLW 435 (555) T ss_pred HHHHHHhhcCCCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCCCHhhhccceeehHH Confidence 886542222233455689999999988888888776666653 33444444444 232 12222333333221 Q ss_pred CCCCCHHHHHHHHHHH----HcC-------CCCHHHHHH-HHHhcCCCCC-CCCHHHHHHHHHhhccc------CCCC Q lcl|NC_020844. 397 ADTGEEQTPGYLLTMR----QNG-------DISREGLWM-EFQRLGELSD-NFDPEEESERLISELPG------GIEE 455 (455) Q Consensus 397 ~~~~~~~~~~al~~~~----~~G-------~is~et~~~-~l~~~~vl~~-~~d~e~e~~ri~~e~~~------~l~~ 455 (455) .. .-.+.++.++... +.+ .|....+.+ .....||.+. .+..++|++.+.++... .+.. T Consensus 436 ~l-~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~q 512 (555) T protein:vir:17 436 GV-GRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQ 512 (555) T ss_pred HH-HHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 10 0122333332222 221 122222222 3344666332 23345555555433221 1111 No 134 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=98.17 E-value=3.2e-06 Score=50.73 Aligned_cols=417 Identities=11% Similarity=0.043 Sum_probs=188.8 Q ss_pred CCCC--CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHH---HHHH-Hhhccc----cCChHHHHHH Q lcl|NC_020844. 1 MPEQ--DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNK---RYDL-RKSVAK----MTNVFRDVLE 70 (455) Q Consensus 1 m~~~--~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~---~Y~~-rl~ra~----~~n~~~~~v~ 70 (455) |--- -+....|.........+.....|.|...=|. ....|. +.-.+. .+.. ...||- =.++.+..|+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~--~~~~~~-~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~ 77 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRT--HKAKRQ-PLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLD 77 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCcccc--ccccCC-CCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 5422 2445566666555555555555555332121 112222 221211 1111 111111 1456777777 Q ss_pred HHhhhhhcC-C----ceec-cCC-----------HHHHHHHHhhccCCC-CHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 71 GLASKPFGR-E----VEVT-EGT-----------GPSEDLLEDIDGRGN-NLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 71 ~~~g~lf~k-~----p~i~-~~~-----------~~l~~~~~d~d~~G~-~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) .++..+.+. . |+.- .++ ..++.|.+++|-.|. +++++.+.+.+..+..|-|++..-+...+. T Consensus 78 ~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~ 157 (548) T protein:vir:95 78 RLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPN 157 (548) T ss_pred HHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeeccccc Confidence 766666542 1 2211 111 135677788887765 899999999999999999998775443221 Q ss_pred chhhhhHHhccCCc-eEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCC Q lcl|NC_020844. 133 FRNREEERQAGVRP-YWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITI 211 (455) Q Consensus 133 ~~t~ad~~~~~~rP-y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l 211 (455) .. ....-| -+-+++|+.|=+=... .++ .+ +..++.=..-.+-.|.+++...+..+.. .+...+ T Consensus 158 ~~------~g~~~~~~lqliepd~l~~~~~~-~~~--~i-----~~GIE~D~~Grp~aY~i~~~hPgd~~~~--~~~~~~ 221 (548) T protein:vir:95 158 YT------FATSVPFALELLEPDYLPFSYNN-LSK--GI-----VQGIERDTWRRKRAYHLLKDHPGNLQTL--GGSLAV 221 (548) T ss_pred cc------CCcccceEEEEechhhcCCCCCC-CCC--ce-----eeeeEECCCCceEEEEEeecCCCccccc--ccccce Confidence 10 001113 2677888887431111 111 11 1111100001112334444332221111 111223 Q ss_pred CceeE---EEEEecccCCCcccCcchHHHHHHHHHHHHHh-HHHHHHHHHHcCCceEEEe-cCCCc----cccccCccce Q lcl|NC_020844. 212 GVVPM---VPFITGRRKGKKWVFHPPMKDALDLQVALFRK-ECALEFAEMMTAFPMLSAN-GVEPD----TDSEGSPKPL 282 (455) Q Consensus 212 ~~vP~---V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~-~sd~~~~l~~~~~p~~~~~-G~~~~----~~~~~~~~~~ 282 (455) ..||- +-++ .+...+-.-+.|.|..+...-..+-.. .+.+..+.-.+++ ..+|+ +.... ..+......+ T Consensus 222 ~rvpA~~VlHif-~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~-a~fi~~~~~~~~~~~~~~~~~~~~~ 299 (548) T protein:vir:95 222 KRVEAERIIHIA-YRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAAL-AMYIKKGNPDSYTVEPGKDRKNRTI 299 (548) T ss_pred eeechhHheecc-cccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhh-eeeeecCCCccccCCCCcccccccc Confidence 44552 2222 333344445777777665532222221 2334433333344 44444 22211 1122223446 Q ss_pred eeccceeee-ccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH---hhhhhccCC-Ccc-hhHHHHHHHHHHHHH Q lcl|NC_020844. 283 DTGPDTVLY-APPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE---LGRQPLTAQ-SGN-LTRISAAQAASKGNS 356 (455) Q Consensus 283 ~~g~~~~~~-lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~-~~~-~sa~a~~~~~~~~~s 356 (455) .+++++++. |+.+ -+++|++++-. -..+...++.+...|.. +....+.+. ++| .|+-+...++..... T Consensus 300 ~~~pG~iv~~L~pG-----e~i~~~~p~~p-~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~ 373 (548) T protein:vir:95 300 PIAPGMVFDDLEPG-----EDVGMIESNRP-NPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYD 373 (548) T ss_pred cccCCccccccCCC-----ceeeecCCCCC-CCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHH Confidence 788888653 6533 37999886521 12334444444445433 222334443 223 343344444444443 Q ss_pred HHHHHHHHHHHHHHH-HHHHH---HHHcCCCC--C--ceEEEEecccCCC----CCCHHHHHHHHHHHHcCCCCHHHHHH Q lcl|NC_020844. 357 AVQQWAAGLKDALEN-ALYIT---NLWMDAPD--T--NPEADVYTDFRAD----TGEEQTPGYLLTMRQNGDISREGLWM 424 (455) Q Consensus 357 ~L~~~a~~~~~a~~~-~l~~~---a~~~g~~~--~--~~~v~v~~df~~~----~~~~~~~~al~~~~~~G~is~et~~~ 424 (455) ..+. .|...+-+ +++++ |...|.-+ . ....-++-+|... ..+..++++...++.+|+.|.+.... T Consensus 374 ~~q~---~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a 450 (548) T protein:vir:95 374 LLQH---EFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVAR 450 (548) T ss_pred HHHH---HHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHH Confidence 3322 22222222 33221 22234211 0 1011122233222 23468899999999999999986655 Q ss_pred HHHhcCCCCCCCCHHHHHHHHHhhcc----cCCC---C Q lcl|NC_020844. 425 EFQRLGELSDNFDPEEESERLISELP----GGIE---E 455 (455) Q Consensus 425 ~l~~~~vl~~~~d~e~e~~ri~~e~~----~~l~---~ 455 (455) + + ..||++.++.++.|.. -||. + T Consensus 451 ~---~-----G~D~~ev~~q~a~E~~~~~~~GL~~~~~ 480 (548) T protein:vir:95 451 A---R-----GRDPRELKKSRETEIKANRAAGLVFSSD 480 (548) T ss_pred H---h-----CCCHHHHHHHHHHHHHHHHHcCCCCCCc Confidence 3 3 3477887777777642 1221 0 No 135 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=98.14 E-value=3.8e-06 Score=50.30 Aligned_cols=434 Identities=12% Similarity=0.027 Sum_probs=176.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCH----HHHHHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQN----KRYDLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~----~~Y~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) ||++ +-.+...+.+ ++.+......+|+....-.-.+.+..+ ..+..-..|-+ +|.++.+|++.+|+- T Consensus 1 m~d~-----~~~~~~~~~~---~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~l~~q~rp~-~N~i~~~i~~v~g~e 71 (725) T protein:vir:92 1 MADN-----ENRLESILSR---FDADWTASDEARREAKNDLFFSRISQWDDWLSQYTTLQYRGQ-FDVVRPVVRKLVSEM 71 (725) T ss_pred CCch-----HHHHHHHHHH---HHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHHHHhcCCCc-ccchHHHHHHHHhhH Confidence 9963 2233333333 334444444444433221111223222 22322233444 699999999999987 Q ss_pred hcCCceec-----c----CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE--eeCCCCcch------hhh-- Q lcl|NC_020844. 77 FGREVEVT-----E----GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV--DYPAGQQFR------NRE-- 137 (455) Q Consensus 77 f~k~p~i~-----~----~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV--d~p~~~~~~------t~a-- 137 (455) -...+.+. . ..+.+..++..+- .-++.+.-...++..++.+|.+|+=| ||.....+. ... T Consensus 72 ~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~-~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~ 150 (725) T protein:vir:92 72 RQNPIDVLYRPKDGASPDAADVLMGMYRTDM-RHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIH 150 (725) T ss_pred HhCCcceEEecCCccHHHHHHHHHHHHHHHH-HhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecc Confidence 65554442 1 1233444554442 34677888889999999999999654 442111100 000 Q ss_pred h-HHhccCCceEEE--------------ech--------------hhhc------cceeeccCCeeEEEEEEEEeeeeEE Q lcl|NC_020844. 138 E-ERQAGVRPYWSQ--------------VNH--------------SAIL------EIQSERQGAGERITYMRMWEDAETI 182 (455) Q Consensus 138 d-~~~~~~rPy~~~--------------~~~--------------~~ii------~w~~~~~~~~~~l~~v~~~E~~e~~ 182 (455) + ...--.-|..+. ++. .++. +|..+-.. ... +++-|...+. T Consensus 151 ~~~~~V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~d~---vrv~e~~~r~ 226 (725) T protein:vir:92 151 SACSHVIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLT-QDT---IQIAEFYEVV 226 (725) T ss_pred CChhhcccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccC-CCe---EEEEEEEEEE Confidence 0 000000000000 000 0000 01000000 001 1111111110 Q ss_pred E----E--E-ec--CeE-------------------------------EEEEEeCCcEEEEeccCCCCCCceeEEEEEec Q lcl|NC_020844. 183 L----V--L-RP--DDW-------------------------------ERWRKNDAGIWEVDEFGRITIGVVPMVPFITG 222 (455) Q Consensus 183 ~----v--~-~~--~~~-------------------------------~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~ 222 (455) . + + ++ +.+ ++|.....|...+.+..+.+-+.+|||||+.. T Consensus 227 ~~~~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~ 306 (725) T protein:vir:92 227 EKKETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGE 306 (725) T ss_pred EEeeeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEee Confidence 0 0 0 01 111 11111112222223333455677999998643 Q ss_pred cc--CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe-c-CCCccccccCccceeeccceeeeccCCCCc Q lcl|NC_020844. 223 RR--KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN-G-VEPDTDSEGSPKPLDTGPDTVLYAPPNAEG 298 (455) Q Consensus 223 ~~--~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~-G-~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~ 298 (455) .. ++.+. ...-+-++.+...-.+...|-+.+++..++.-..... | ++.......+++...+...+......++. T Consensus 307 r~~~~g~~~-~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~- 384 (725) T protein:vir:92 307 WGFVEDKEV-YEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEM- 384 (725) T ss_pred eeccCCccc-ccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeeccccccccccc- Confidence 32 22211 1122344445554555555666666654433222211 1 11100000111111111000000000000 Q ss_pred cccceeEEeeCchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_020844. 299 QHGSWNFVEPATESLKFLAEQIESVSKEIRELGR---QPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALEN---- 371 (455) Q Consensus 299 ~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~---~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~---- 371 (455) ....+.+.++.. --.++...|+...+.|..+++ .++...+++.||.+...+..+....|..+-.|+..+... T Consensus 385 ~~~~i~~~~~~~-~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~ 463 (725) T protein:vir:92 385 PTQPLAYYENPE-VPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEI 463 (725) T ss_pred cccCCcccCCCC-chHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111233443322 223455666777777766533 344445566899999988877777777777776666554 Q ss_pred HHHHHHHHcCC---------CCCceEEEEec------------------ccCCC------C--CCHHHHHHHHHHHHc-C Q lcl|NC_020844. 372 ALYITNLWMDA---------PDTNPEADVYT------------------DFRAD------T--GEEQTPGYLLTMRQN-G 415 (455) Q Consensus 372 ~l~~~a~~~g~---------~~~~~~v~v~~------------------df~~~------~--~~~~~~~al~~~~~~-G 415 (455) +|.++.++++. ++....+.+|. .|+.. . .-.+.+.++.++..+ + T Consensus 464 lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~ 543 (725) T protein:vir:92 464 YQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTP 543 (725) T ss_pred HHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcc Confidence 56677777743 11222334442 12211 0 012334555555443 1 Q ss_pred CCCHH---HHHHHHHhcCCCCCCCCHHHHHHHHHhhcc-cCCCC Q lcl|NC_020844. 416 DISRE---GLWMEFQRLGELSDNFDPEEESERLISELP-GGIEE 455 (455) Q Consensus 416 ~is~e---t~~~~l~~~~vl~~~~d~e~e~~ri~~e~~-~~l~~ 455 (455) -+... ++...+ .+++ .--.++..+++..+.+ ....+ T Consensus 544 ~~~~~~~~~l~~~~---~~~d-~~~~~e~~erirkq~~~~~~~~ 583 (725) T protein:vir:92 544 QGTPEYQLLLLQYF---TLLD-GKGVEMMRDYANKQLIQMGVKK 583 (725) T ss_pred cchhHHHHHHHHHh---hccc-chHHHHHHHHHHhhhchhccCC Confidence 11111 111111 1100 0112444556655432 22222 No 136 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.13 E-value=4e-06 Score=50.15 Aligned_cols=413 Identities=11% Similarity=0.069 Sum_probs=196.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCC---C-CHHHHHH-HhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPH---E-QNKRYDL-RKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~---e-~~~~Y~~-rl~ra~~~n~~~~~v~~~~g 74 (455) |..+ ....+|+.++.--.= +..+++..+-.||.... . ....+.. +...-.|-+...+.|+.++. T Consensus 1 ~~~~----------~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las 70 (547) T protein:vir:10 1 MENS----------KIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSS 70 (547) T ss_pred CCHH----------HHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHH Confidence 4422 223333333221100 33344555555665421 1 1112211 12223566777777777776 Q ss_pred hhhcC--Cce-----ec------cCCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchh Q lcl|NC_020844. 75 KPFGR--EVE-----VT------EGTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRN 135 (455) Q Consensus 75 ~lf~k--~p~-----i~------~~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t 135 (455) .+++- ||. ++ ......+.|++.+. ...+++..-+..+.++.+.+|-+-+++..+..... T Consensus 71 ~L~~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~-- 148 (547) T protein:vir:10 71 SLHGSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEG-- 148 (547) T ss_pred HHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCC-- Confidence 66321 111 10 11123555555421 12356888888999999999999888864322110 Q ss_pred hhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEe-----------------ee------------eEEEEEe Q lcl|NC_020844. 136 REEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWE-----------------DA------------ETILVLR 186 (455) Q Consensus 136 ~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E-----------------~~------------e~~~v~~ 186 (455) .+.+..|+..++.- ..+..| .+.+.+|-.+ .+ ..+.+++ T Consensus 149 ---------~~r~~~~pl~~~~v-~~d~~G--~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~ 216 (547) T protein:vir:10 149 ---------SVVFQSSPIQDSYF-EEDSRG--QVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVM 216 (547) T ss_pred ---------ceeEEEeecceEEE-eeCCCc--CeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEE Confidence 12355555554331 122222 2222222100 00 0111110 Q ss_pred ----c-----C------------eE-EEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHH Q lcl|NC_020844. 187 ----P-----D------------DW-ERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVA 244 (455) Q Consensus 187 ----~-----~------------~~-~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~ 244 (455) . + .| .+|...+++...+.++ ++...|++++++....++.+ +..|-.+...-... T Consensus 217 ~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~es---g~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~ 292 (547) T protein:vir:10 217 CVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEG---GYYEMPAYAIRWRKSAGSQW-GFGPSHLALPDVLT 292 (547) T ss_pred EEeeccCCCCCccccceeeccccceeEEEEEecCceeeeecC---CcccCCeeeeeeeecCCccc-ccchHHHHHHHHHH Confidence 0 0 00 1222222222222222 35668999888877776665 67777666655555 Q ss_pred HHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHH Q lcl|NC_020844. 245 LFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVS 324 (455) Q Consensus 245 hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~ 324 (455) +...+-..-..++.+..|.+.+. +++..+++..+++.++.. .+ ..+++=++. +..+......|++++ T Consensus 293 L~~l~~~~l~~~~~~~~pp~~v~-------~~g~~~~~~~~pgg~~~~-~~----~~~v~pl~~-~~~~~~~~~~i~~~~ 359 (547) T protein:vir:10 293 ANRYVELVLRSSEKVIDPAIMVT-------ERGLISDIDLGASGLTVV-RD----MESMKPFES-RARFDVSSIQLTDLR 359 (547) T ss_pred HHHHHHHHHHHHHHHhcCceecc-------cccccccceecCCeeeec-CC----cccceeeec-ccchHHHHHHHHHHH Confidence 55555566777788888887653 111123466677766643 11 123443553 456777788899998 Q ss_pred HHHHHh--hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHcCC-C--CCce----EEE Q lcl|NC_020844. 325 KEIREL--GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKD-----ALENALYITNLWMDA-P--DTNP----EAD 390 (455) Q Consensus 325 ~~m~~~--g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~-----a~~~~l~~~a~~~g~-~--~~~~----~v~ 390 (455) +.|... .-.+...++.+.||++.+.+.......|.-.-..++. -+++++.++.+ .|. + +.+. ... T Consensus 360 ~rI~~af~~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~l~~~~~~~ 438 (547) T protein:vir:10 360 SAVRRIYYVDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFR-AGKLGELPSKLLESGKAA 438 (547) T ss_pred HHHHHHhhhhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhhhccCcce Confidence 888653 2223334556789999999998888887776665543 23444444433 233 1 1111 012 Q ss_pred EecccCCCCC---CHHHHHHHHHHHH-----cCC-------CCHHHH-HHHHHhcCCCCCCCCHHHHHHHHHhhcccC-- Q lcl|NC_020844. 391 VYTDFRADTG---EEQTPGYLLTMRQ-----NGD-------ISREGL-WMEFQRLGELSDNFDPEEESERLISELPGG-- 452 (455) Q Consensus 391 v~~df~~~~~---~~~~~~al~~~~~-----~G~-------is~et~-~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~-- 452 (455) ++-.|...-- ....+.++....+ +++ |.-..+ .......||-...+-.++|.+.+.+|.+.. T Consensus 439 ~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q 518 (547) T protein:vir:10 439 MDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQ 518 (547) T ss_pred EEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHH Confidence 2222322111 1122222222222 121 222233 334455777655566677766555553221 Q ss_pred ------CCC Q lcl|NC_020844. 453 ------IEE 455 (455) Q Consensus 453 ------l~~ 455 (455) +.+ T Consensus 519 ~~~qaa~~~ 527 (547) T protein:vir:10 519 KAEQAAIAE 527 (547) T ss_pred HHHHHHHHH Confidence 111 No 137 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=98.12 E-value=4.2e-06 Score=50.07 Aligned_cols=397 Identities=10% Similarity=0.008 Sum_probs=173.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhh-hhcCCC-CCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNF-KRYVKQ-FPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g-~~yLpk-~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |..-+.+..+ +.+.-+ +.-.|..++++.. .-|.+. +.+ .+-+..|.+ ...++++|+..+.-..+ T Consensus 71 ~ds~~~~~~~-------~~~~~~-~~~~~~~~~~~~~~~~~~~~~f~g--yql~alY~~----~~l~rkiVd~pAeDa~R 136 (765) T protein:vir:96 71 MDSAYGDGPT-------PAAKAA-AGGQNPYVVPTMLQDWYNSQGFIG--YQACAIISQ----HWLVDKACSMSGEDAAR 136 (765) T ss_pred cccccccccc-------chHHHh-hhccCccchhhHHHhhhcccCCcc--HHHHHHHHh----CchhhhhhhcchHHhhc Confidence 2211111111 111111 1011122222221 112221 111 122233443 46778889999999999 Q ss_pred CCceeccCCH----HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 79 REVEVTEGTG----PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 79 k~p~i~~~~~----~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) +...|+.+.. .....++.... -..+.+.++.+.+.+-.||.+++++....... .... .|+ .++ T Consensus 137 ~g~~I~~~~~e~~~~~~~~l~~~~~-rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~-~~l~-------~PL----~~~ 203 (765) T protein:vir:96 137 NGWELKSDGRKLSDEQSALIARRDM-EFRVKDNLVELNRFKNVFGVRIALFVVESDDP-DYYE-------KPF----NPD 203 (765) T ss_pred CCceeecCccccCHHHHHHHHHHHH-HhhHHHHHHHHHHHhhhceeeEEEEEecccCc-chhh-------ccc----ccc Confidence 9998864321 11122222111 12578888888888888999988876421110 0000 011 111 Q ss_pred hhccceeeccCCeeEEEEE-EEEeeee----------EEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeE----EEE Q lcl|NC_020844. 155 AILEIQSERQGAGERITYM-RMWEDAE----------TILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPM----VPF 219 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v-~~~E~~e----------~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~----V~f 219 (455) .|. .+....+..+ +++.... .-.++.|. .|...+. . -|.--.|.| ||. T Consensus 204 ~I~------kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~---~y~i~g~---~-----IH~SRli~~~g~~lpd 266 (765) T protein:vir:96 204 GIA------PGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPD---FWIISGK---K-----YHRSHLVVVRGPQPPD 266 (765) T ss_pred ccc------cceeeEEEEechhhcccccchhccccccccccCcce---eeeecCc---e-----eccceEEEecCCCchh Confidence 110 0111111100 0000000 00011122 2221111 1 122112222 111 Q ss_pred EecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccc-cccCcc----ceeeccceeeeccC Q lcl|NC_020844. 220 ITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTD-SEGSPK----PLDTGPDTVLYAPP 294 (455) Q Consensus 220 ~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~-~~~~~~----~~~~g~~~~~~lp~ 294 (455) +..+. ..+ -+.|-|.-+.+.....-++...-.++++...+.++.+.+...-.. +..... ....+...+..+.. T Consensus 267 ~lk~~-~~~-~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g~~~id~ 344 (765) T protein:vir:96 267 ILKPT-YIF-GGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRDNHGVKVIGI 344 (765) T ss_pred hhccc-cCc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcCCceeEEecC Confidence 11111 122 355555555555444445555667788888888887766532211 111100 00112223444432 Q ss_pred CCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CCC---cchhHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_020844. 295 NAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQS---GNLTRISAAQAASKGNSAVQQWAA-GLKD 367 (455) Q Consensus 295 ~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~~---~~~sa~a~~~~~~~~~s~L~~~a~-~~~~ 367 (455) + .++..++.+-+++ .+.++...++|......|++ +++ -|.||.. +...-+..+..+.. .+.. T Consensus 345 e-----e~~e~~s~~lsgl---~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~---D~~nYyD~I~s~Qe~~l~p 413 (765) T protein:vir:96 345 D-----ETMEQFDTNLSDF---DSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEH---ETISYHEELESIQEHIFDP 413 (765) T ss_pred C-----cceeEEecccCCH---HHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchH---HHHHHHHHHHHHHHHHHHH Confidence 2 2455555554554 55666667777766655532 221 2344443 34445555555553 4677 Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEecccCCCCCC-----HHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHH Q lcl|NC_020844. 368 ALENALYITNLWMDAPDTNPEADVYTDFRADTGE-----EQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEES 442 (455) Q Consensus 368 a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~-----~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~ 442 (455) .++++++++.+..+.++ +.+|+++.=+....-+ ...+++...+++.|+|+.+..++.|+..+......-.+++. T Consensus 414 ~le~L~~li~~s~~i~~-d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~ 492 (765) T protein:vir:96 414 LLERHYLLLAKSESIDV-QLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQA 492 (765) T ss_pred HHHHHHHHHHHhcCCCC-cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCcccc Confidence 89999999988755543 4555544211111111 12246778888999999999999998755433221111211 Q ss_pred H---HHHhhcccCCCC Q lcl|NC_020844. 443 E---RLISELPGGIEE 455 (455) Q Consensus 443 ~---ri~~e~~~~l~~ 455 (455) + -+..+..+.+.+ T Consensus 493 e~~~~~~pe~~~~~~~ 508 (765) T protein:vir:96 493 ETEPGMSPENLAELEK 508 (765) T ss_pred ccccCCCccccccccC Confidence 1 111100000110 No 138 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=98.04 E-value=6.3e-06 Score=49.08 Aligned_cols=391 Identities=9% Similarity=0.004 Sum_probs=178.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCC-CHHHH-HHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHE-QNKRY-DLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e-~~~~Y-~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |- .... ......|... ++....|-|..+.. +-..+ ..|.. ..+++++|+..+.-.++ T Consensus 1 ~~-------------~~D~---~~~~~~~~g~-~~~~~~~~~~~~~~~~~~~l~a~Y~~----~~l~~~~vd~~a~d~~r 59 (437) T protein:vir:52 1 MK-------------FFDG---IKSLALKLGS-KQEQTYYSPSLSLTDDLVQLEALWRD----NWIANKVCIKRPEDMVR 59 (437) T ss_pred Cc-------------hhhh---hHhHHhcCCC-ccccceeecCccccccHHHHHHHHHh----CchhhHHhhcchHHhhc Confidence 11 0011 1111222111 11112232322211 11222 33443 47788899999999999 Q ss_pred CCceeccC--CH----HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhc-cCCceEEEe Q lcl|NC_020844. 79 REVEVTEG--TG----PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQA-GVRPYWSQV 151 (455) Q Consensus 79 k~p~i~~~--~~----~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~-~~rPy~~~~ 151 (455) ++..|... ++ .++..++. ..+.+.++.+.+.+-.||.+++++...... .. +.... +.--++..+ T Consensus 60 ~~~~i~~~d~~~~~~~~~~~~~~~-----l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~-~~---~pl~~~~~~~~~~v~ 130 (437) T protein:vir:52 60 NWREIYSNDLNSKQLDLFTKFERS-----LKLRETLTKALQWSSLYGSVGLLVVTDSQN-TS---APLKPTERLKRLIIL 130 (437) T ss_pred CCceEecCCCCHHHHHHHHHHHHh-----hcHHHHHHHHHHhcccccceEEEEEecCCC-cc---cccccCCceeEEEEe Confidence 99999542 22 23444433 256777777777777899999998643211 10 00000 000012222 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeeeeEEEEEe--cCeEEEEEEeCCc-EEEEeccCCCCCCceeEEEEEecccCCCc Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDAETILVLR--PDDWERWRKNDAG-IWEVDEFGRITIGVVPMVPFITGRRKGKK 228 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~--~~~~~~~~~~~~g-~~~~~~~g~~~l~~vP~V~f~~~~~~~~~ 228 (455) ++.+|-. +... .. -... .+....|+..+++ ...+ |+--.|.|.-.. .+..... T Consensus 131 ~~~~v~~--------------~~~~-~~---dp~s~~fg~p~~y~v~~~~~~~~i-----H~SRii~~~~~~-~~~~~~~ 186 (437) T protein:vir:52 131 PKWKISP--------------TGTK-DD---DVLSPNFGRYSEYSILGGSQSITV-----HHSRLIILNAND-APLSDND 186 (437) T ss_pred chhhccc--------------cccc-cc---cccccccCcceEEEEecCCcceeE-----ccceeEEecCcc-CCCcccc Confidence 2211110 0000 00 0000 0112223222221 1111 222223331110 0112233 Q ss_pred ccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccC------ccce--eeccceeeeccCCCCccc Q lcl|NC_020844. 229 WVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGS------PKPL--DTGPDTVLYAPPNAEGQH 300 (455) Q Consensus 229 ~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~------~~~~--~~g~~~~~~lp~~~~~~~ 300 (455) ..+.|.|.-+.+.....-++...-..+++...++++-+.|+...-...+. -..+ .-+...+..+..+ T Consensus 187 ~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~----- 261 (437) T protein:vir:52 187 IWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAE----- 261 (437) T ss_pred ccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCC----- Confidence 34788887777765555555555577788888888888775321111000 0000 1122344555432 Q ss_pred cceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccCCCcc-hhHHH-HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_020844. 301 GSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTAQSGN-LTRIS-AAQAASKGNSAVQQWAA-GLKDALENALYITN 377 (455) Q Consensus 301 ~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~~~~~-~sa~a-~~~~~~~~~s~L~~~a~-~~~~a~~~~l~~~a 377 (455) .++..++.+.+++ .+.++...++|......|++.-.+. .+|-+ ...+...-+..+..+.. .+...++.+++++. T Consensus 262 ~~~e~~~~~~sgl---~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~ 338 (437) T protein:vir:52 262 NEYDRKELTFTGL---KDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLIC 338 (437) T ss_pred cceEEEecCcCCH---HHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2566666555554 4556666777776655554321111 11111 12344455555666654 46777888887665 Q ss_pred HHc-CCCCCceEEEEecccCCCCCC-----HHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHh-hcc Q lcl|NC_020844. 378 LWM-DAPDTNPEADVYTDFRADTGE-----EQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLIS-ELP 450 (455) Q Consensus 378 ~~~-g~~~~~~~v~v~~df~~~~~~-----~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~-e~~ 450 (455) .-. |.-+.+.+|+++.=.....-+ ...+++..++.++|+|+.+..++.|+..|+.+. ++.++ ++..+. +.+ T Consensus 339 ~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~-i~~~~-~~~~~~~~~~ 416 (437) T protein:vir:52 339 NELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGLFAN-ISAEH-IEELKNADEF 416 (437) T ss_pred HHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCC-CCccc-cccccCCCCC Confidence 432 322335555554211111111 123456778889999999999999999888652 33222 111111 111 Q ss_pred cCCCC Q lcl|NC_020844. 451 GGIEE 455 (455) Q Consensus 451 ~~l~~ 455 (455) ++-.+ T Consensus 417 ~~~~~ 421 (437) T protein:vir:52 417 AGNFE 421 (437) T ss_pred CCccC Confidence 10000 No 139 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.03 E-value=6.7e-06 Score=48.96 Aligned_cols=412 Identities=10% Similarity=0.050 Sum_probs=188.3 Q ss_pred CCCCC---CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCH--HHHHH-Hhhcc----ccCChHHHHHH Q lcl|NC_020844. 1 MPEQD---PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQN--KRYDL-RKSVA----KMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~---v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~--~~Y~~-rl~ra----~~~n~~~~~v~ 70 (455) |.... +....|........+.. . -|.|-.........+-|..-.-+. ..... ...|| .=.++.+..|+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~-~-~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~ 78 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGG-G-GLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVG 78 (553) T ss_pred Ccchhhhhhcccccccchhhhhhhc-c-cccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 55443 22222222233333332 2 344321111111234443221111 11111 11111 12678888888 Q ss_pred HHhhhhhcCCceeccCC-----------------H----HHHHHHHh----hccCC-CCHHHHHHHHHHHHHhhCcEEEE Q lcl|NC_020844. 71 GLASKPFGREVEVTEGT-----------------G----PSEDLLED----IDGRG-NNLTTFAGQLFFQGVADSISWIR 124 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~-----------------~----~l~~~~~d----~d~~G-~~l~~~~~~~~~~al~~G~~~~l 124 (455) .++..+.+..++....| . .++.|.++ +|-.| .+++++.+.+.+..+..|-|++. T Consensus 79 ~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 158 (553) T protein:vir:63 79 YQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLAT 158 (553) T ss_pred HHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEE Confidence 88888887766553211 1 13444432 24444 48999999999999999999988 Q ss_pred EeeCCCCcchhhhhHHhccCCc-eEEEechhhhccceeeccCCeeEEEE-EEEEeeeeEEEEEecCeEEEEEEeCCc--- Q lcl|NC_020844. 125 VDYPAGQQFRNREEERQAGVRP-YWSQVNHSAILEIQSERQGAGERITY-MRMWEDAETILVLRPDDWERWRKNDAG--- 199 (455) Q Consensus 125 Vd~p~~~~~~t~ad~~~~~~rP-y~~~~~~~~ii~w~~~~~~~~~~l~~-v~~~E~~e~~~v~~~~~~~~~~~~~~g--- 199 (455) .-+.+..... -| -+-+++|+.|=+......++ . +.. |.+.+.- .+-.|.+++...+. T Consensus 159 ~~~~~~~~~~----------~~~~lq~ie~drl~~~~~~~~~~-~-i~~GVE~d~~G------r~vaY~i~~~hPgd~~~ 220 (553) T protein:vir:63 159 AEWDRAANRP----------YATCFQMVSTDRLSNPYQQLDTP-T-LRRGVQYDKRG------RPQGYWIQVAHPGDLYQ 220 (553) T ss_pred eeeccCCCCc----------ccceEEEechhhcCCCCCCCCCC-e-eEeeeEECCCC------ceEEEEeeccCCCcccc Confidence 7554322210 12 26778888886544332222 1 111 1111110 11123333322211 Q ss_pred ------EEEEec-cCCCCC-CceeEEEEEecccCCCcccCcchHHHHHHHHHHHHH-hHHHHHHHHHHcCCceEEEecCC Q lcl|NC_020844. 200 ------IWEVDE-FGRITI-GVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFR-KECALEFAEMMTAFPMLSANGVE 270 (455) Q Consensus 200 ------~~~~~~-~g~~~l-~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~-~~sd~~~~l~~~~~p~~~~~G~~ 270 (455) .|..+. ....+- .+|.+ + .....+-.-+.|.|..++..-..+.. ..+.+..+.--+++...+-++.. T Consensus 221 ~~~~~~~~~r~~~~~~v~a~~vlH~---f-~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~ 296 (553) T protein:vir:63 221 MAPDMYKWKFVQQSKPWGRRQVIHI---L-EPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP 296 (553) T ss_pred ccccccceeeeccccccChhHheec---c-cccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC Confidence 121111 111111 11222 2 33334445677777776653322222 12444444444444444433321 Q ss_pred Ccccc--------------------------ccCccceeeccceeeeccCCCCccccceeEEeeC-c-hhHHHHHHHHHH Q lcl|NC_020844. 271 PDTDS--------------------------EGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-T-ESLKFLAEQIES 322 (455) Q Consensus 271 ~~~~~--------------------------~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~-~~l~~~~~~l~~ 322 (455) .+... ......+.++++++..|+.+ -+++|++++ + .++ ...++. T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG-----e~i~~~~p~~p~~~~---~~F~~~ 368 (553) T protein:vir:63 297 PEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPG-----TKLNLKPMGTPGGVG---SEFEAS 368 (553) T ss_pred hhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCC-----CeeeecCCCCCCCCH---HHHHHH Confidence 11000 00122467889999988744 379998876 2 344 444455 Q ss_pred HHHHHHH-h--hhhhccCC--Ccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH---HHHHcCC---CCCceEE Q lcl|NC_020844. 323 VSKEIRE-L--GRQPLTAQ--SGN-LTRISAAQAASKGNSAVQQWAAGLKDALEN-ALYI---TNLWMDA---PDTNPEA 389 (455) Q Consensus 323 ~~~~m~~-~--g~~~l~~~--~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~-~l~~---~a~~~g~---~~~~~~v 389 (455) +...|.. + +...+.+. ..| .|+-+...++.......+.+ |...+-+ ++++ .|.+.|. +.....+ T Consensus 369 ~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~---~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~ 445 (553) T protein:vir:63 369 LNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKM---CADRLATEFFTLWLEEAIAAGEVPMPPGQTRD 445 (553) T ss_pred HHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHcCCccCCCcccch Confidence 5555543 2 22334443 234 45545555555544444332 2222222 2222 2233342 1100000 Q ss_pred E----------EecccCCC----CCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc----c Q lcl|NC_020844. 390 D----------VYTDFRAD----TGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELP----G 451 (455) Q Consensus 390 ~----------v~~df~~~----~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~----~ 451 (455) . +.-.|... ..+..++++...++.+|+.|.+....+. ..||++.++.++.|.. . T Consensus 446 ~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~--------G~D~~~v~~q~a~e~~~~~~~ 517 (553) T protein:vir:63 446 LFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARL--------GGDFRKSFAQRAREDALLKKY 517 (553) T ss_pred hhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHh--------CCCHHHHHHHHHHHHHHHHHc Confidence 0 01112211 1246889999999999999999765543 3477787777777642 2 Q ss_pred CCCC Q lcl|NC_020844. 452 GIEE 455 (455) Q Consensus 452 ~l~~ 455 (455) ||-- T Consensus 518 Gl~~ 521 (553) T protein:vir:63 518 GLTF 521 (553) T ss_pred CCCC Confidence 2311 No 140 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=98.02 E-value=6.7e-06 Score=48.93 Aligned_cols=415 Identities=9% Similarity=0.042 Sum_probs=185.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCC-CCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQ-FPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk-~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |-...=...-=.-.....+|+.++.-..- +..+++..+-.||- .+.+... .+.++ .|-+...+.++.++..+.+ T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~---~~~~~-~~dstg~~a~~~LAa~l~~ 76 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDN---ETSQN-GWQGVGAQATNHLANKLAQ 76 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCc---ccccc-cccchHHHHHHHHHHHHHh Confidence 55321111111112334455544322211 34456666666663 2222211 12222 4555556666666655521 Q ss_pred C--Cce-----ecc-------------CCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 79 R--EVE-----VTE-------------GTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 79 k--~p~-----i~~-------------~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) - ||. ++- ....++.|++.+. ...++++.-+..+.++...+|.+.+++|.+ +. T Consensus 77 ~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~--~~ 154 (516) T protein:vir:10 77 VLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSK--GA 154 (516) T ss_pred hhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCC--CC Confidence 0 111 110 1112566666542 235678888889999999999998887632 11 Q ss_pred chhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeee-------------------------eEEEEE-- Q lcl|NC_020844. 133 FRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDA-------------------------ETILVL-- 185 (455) Q Consensus 133 ~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~-------------------------e~~~v~-- 185 (455) ++.|+-.+.. +..+..|+ +++.+ +++.. +.+.++ T Consensus 155 ---------------~~~~pl~~y~-v~~d~~G~--v~~iv-rr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~ 215 (516) T protein:vir:10 155 ---------------ISAIPMHHYV-VNRDTNGD--LLDII-LLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTH 215 (516) T ss_pred ---------------eEEEEcCeEE-EeeCCCCC--eEEEe-eeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEE Confidence 2333333322 22222222 22211 11110 112222 Q ss_pred ---ecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCc Q lcl|NC_020844. 186 ---RPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFP 262 (455) Q Consensus 186 ---~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p 262 (455) .++.+..|....++. .+..++..++...|++++++....++.+ |.+|-.+...-.......+-.+-.....+..| T Consensus 216 v~~~~~~~~~~~~~~d~~-~~~~~s~~~~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~ 293 (516) T protein:vir:10 216 AKYLGEGFWELKQSADDI-PVGKVSKIKSEKLPFIPLTWKRSYGEDW-GRPLAEDYSGDLFVIQFLSEAVARGAALMADI 293 (516) T ss_pred EEecCCCceEEEEeeCce-eeccccccccccCCeeeeeeeecCCCCc-ccchHHHhhHHHHHHHHHHHHHHHHHHHhcCC Confidence 222222222222222 2344556667789999998877776665 55564443332223333333344444555655 Q ss_pred eEEEe--cCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHh--hhhhccC Q lcl|NC_020844. 263 MLSAN--GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIREL--GRQPLTA 337 (455) Q Consensus 263 ~~~~~--G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~--g~~~l~~ 337 (455) .+.+. |.. ++..+.-|....+ +|.. ..+++.++.. +..+....+.|+++++.|... ...+... T Consensus 294 ~~lv~p~g~~-------~~~~l~~~~~g~~-~~g~----~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r 361 (516) T protein:vir:10 294 KYLIRPGAQT-------DVDHFVNSGTGEV-VTGV----EEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRR 361 (516) T ss_pred CcccCccccc-------chhhhccCCCcee-ecCC----cccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhcc Confidence 55542 221 1112222222222 3321 2245556644 346788888899999988663 2223344 Q ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH-HHHHHcCCCCCc-eEEEEecccCCCCCCHHHHHHHHHHHHc Q lcl|NC_020844. 338 QSGNLTRISAAQAASKGNSAVQQWAAGLKDAL-ENALY-ITNLWMDAPDTN-PEADVYTDFRADTGEEQTPGYLLTMRQN 414 (455) Q Consensus 338 ~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~-~~~l~-~~a~~~g~~~~~-~~v~v~~df~~~~~~~~~~~al~~~~~~ 414 (455) ++.+.||++...+.......|.-.-..++.=| .-++. .....+...+.+ +.+++ ..|...---++.++.+....+. T Consensus 362 d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~P~~lv~~~~-v~~i~~L~raq~~~~i~~~~q~ 440 (516) T protein:vir:10 362 DAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSFTSDLVDPVI-ITGIEALGRMAELDKLANFAQY 440 (516) T ss_pred CCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCCChhhcCcce-ehhHHHHHHHHHHHHHHHHHHH Confidence 45568999999988887777766555544332 22222 222222222221 11222 1121110112233333222220 Q ss_pred -C---CCCHH---------HHHHHHHhcCCCCCCCCHHHHHHHHHhhccc---C--------------CCC Q lcl|NC_020844. 415 -G---DISRE---------GLWMEFQRLGELSDNFDPEEESERLISELPG---G--------------IEE 455 (455) Q Consensus 415 -G---~is~e---------t~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~---~--------------l~~ 455 (455) | -++.+ .+.......|+-...+..++|.+.+.+|... . +.+ T Consensus 441 i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~~~~~~~~~~~~~~~~ 511 (516) T protein:vir:10 441 MSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQMLEEGVAKAVPGVIQQ 511 (516) T ss_pred HHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhcccchhhh Confidence 1 12221 1222344566655555666666666444311 1 111 No 141 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=98.01 E-value=7.1e-06 Score=48.81 Aligned_cols=403 Identities=10% Similarity=-0.058 Sum_probs=180.2 Q ss_pred CCCCCCCc-cCHHHHHHHHH----HHHHHHHhcCHHHH-HHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTK-RSADAEAMSDY----LQTVDDVLEGVTGM-RRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~-~~p~y~~~~~~----w~~i~d~~~G~~~~-r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |- |.. +.|+..+.... +.+.+-.+ +.+.+ |+.|. ...++.+-|+....+ ..++...+++--. T Consensus 3 ~~---~~~~p~~~~~~~~~~~~~~~~~~~g~~-~~D~~lr~~gg-----~~~~~~~l~~~m~e~---D~~v~s~l~~Rk~ 70 (446) T protein:vir:98 3 ME---VRNAPTPAIRRRTIYAMEHLGLATSYL-SEDGGYKRAGK-----PTYQQLSAWDEAAQT---EPIIAQGLDSIAL 70 (446) T ss_pred cc---ccCCCchhhhhhhhhccccchhhcccC-CcchHhhhcCC-----ChHHHHHHHHHHHhc---chHHHHHHHHHHH Confidence 44 433 66765544321 11111101 11111 11110 000122445555443 5788888888888 Q ss_pred hhhcCCceeccCCHHH----HHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCC-CCcch-hhhhHHhccCCceE Q lcl|NC_020844. 75 KPFGREVEVTEGTGPS----EDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPA-GQQFR-NREEERQAGVRPYW 148 (455) Q Consensus 75 ~lf~k~p~i~~~~~~l----~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~-~~~~~-t~ad~~~~~~rPy~ 148 (455) .+.+.+++++-.++.. ++++.+. .++..+.. ...++.||.+.+=+.+-. .+... .+..... T Consensus 71 av~~~~w~V~p~~~~~a~~v~~~l~~~-----~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~------- 137 (446) T protein:vir:98 71 SVLNKVGPYQHGDKRIKKFIDDQLRNR-----AKTWISHC-VKSIMTYGFSLSEQIYAHGARDNMPATVLDDI------- 137 (446) T ss_pred HhhcCCceecCccHHHHHHHHHHHhhc-----CchhHHHH-HHHHHhhCceeeeEEEeecccccccchhhccc------- Confidence 8899999886433333 4444443 34555544 458888997776443322 22110 0000000 Q ss_pred EEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCcee---EEEEEecccC Q lcl|NC_020844. 149 SQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVP---MVPFITGRRK 225 (455) Q Consensus 149 ~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP---~V~f~~~~~~ 225 (455) ..+.|..+. |..+..+....-.. ....++..........++... ........+.+ -.|| ||.+...+.. T Consensus 138 ~~~~~~~~r-~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~~--~~iP~~kfi~~~~~~~~ 209 (446) T protein:vir:98 138 VNYHPLQVM-LIANDNGRIVDGDT----VTASQYKSGYWVPLPPYRIGD-PPKKVDVVGSH--VRLPSHKRLFINYNTKG 209 (446) T ss_pred cccccccce-eeeccCCccccccc----cchhhcccccccCcccchhhh-hhhhcccCccc--ccccccceEEEEecCCC Confidence 001111110 22222111000000 000000000000000000000 00000000111 1245 3444455444 Q ss_pred CCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEe---cCCCccccc-------------cCccceeecccee Q lcl|NC_020844. 226 GKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSAN---GVEPDTDSE-------------GSPKPLDTGPDTV 289 (455) Q Consensus 226 ~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~---G~~~~~~~~-------------~~~~~~~~g~~~~ 289 (455) + ...+.+.|.-++..-+=-....-+...-++.-+.|+++.+ |.+..+.+. .......++.+.+ T Consensus 210 ~-~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~ 288 (446) T protein:vir:98 210 N-NPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSG 288 (446) T ss_pred C-CccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccce Confidence 3 3456666666665443333345566777888899999987 333222110 1111234566666 Q ss_pred eeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh--hhhhccCC--CcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 290 LYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL--GRQPLTAQ--SGNLTRISAAQAASKGNSAVQQWAAGL 365 (455) Q Consensus 290 ~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~--~~~~sa~a~~~~~~~~~s~L~~~a~~~ 365 (455) ..+|....+.+..++|++..+++-..++..++...++|... |..+..++ +..-|.........-..-.+..-+..+ T Consensus 289 ~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i 368 (446) T protein:vir:98 289 LVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTV 368 (446) T ss_pred eeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHH Confidence 66644433455689999988776556788888888888653 33222121 112222223333333445567788888 Q ss_pred HHHHH-HHHHHHHHHcCCCCCceEEEE-----ecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHH Q lcl|NC_020844. 366 KDALE-NALYITNLWMDAPDTNPEADV-----YTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPE 439 (455) Q Consensus 366 ~~a~~-~~l~~~a~~~g~~~~~~~v~v-----~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e 439 (455) .++++ +++++++.|-+-.. .....+ ..+|.....-...++++.++.+.|.+-..+--...++.|+-+.+-| T Consensus 369 ~~tln~~Li~~l~~lNf~~~-~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~-- 445 (446) T protein:vir:98 369 IHAFTEQVIGNLIRLNFDPA-LYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISS-- 445 (446) T ss_pred HHHHHHHHHHHHHHhCCCcc-ccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCC-- Confidence 88897 68899988764322 221211 1222221111334667777888887532111112345566332222 Q ss_pred H Q lcl|NC_020844. 440 E 440 (455) Q Consensus 440 ~ 440 (455) . T Consensus 446 ~ 446 (446) T protein:vir:98 446 T 446 (446) T ss_pred C Confidence 2 No 142 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=98.01 E-value=7.2e-06 Score=48.78 Aligned_cols=413 Identities=9% Similarity=0.027 Sum_probs=185.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC-CCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP-HEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~-~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |.++-=...-=+-.....+|+.+++-..- ...+++..+-.||..- .+.. ..+.+| .|-+...+.++.++..+.+ T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~---~~~~~~-~~dstg~~a~~~LAa~l~~ 76 (516) T protein:vir:96 1 MKQSIDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYLMNDKGD---NETSQN-GWQGVGAQATNHLANKLAQ 76 (516) T ss_pred CcchhhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccccCCCCC---ccccCC-cccchHHHHHHHHHHHHHh Confidence 76331111111112334455555443322 3455666666666422 1111 122222 4566666666666655521 Q ss_pred C--Cc-----eecc-------------CCHHHHHHHHhhcc------CCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 79 R--EV-----EVTE-------------GTGPSEDLLEDIDG------RGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 79 k--~p-----~i~~-------------~~~~l~~~~~d~d~------~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) - || .++- ....++.|++.+.. ..++++.-+..+.++...+|.+.+++|.+ +. T Consensus 77 ~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~--~~ 154 (516) T protein:vir:96 77 VLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSK--GA 154 (516) T ss_pred hhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCC--CC Confidence 0 11 1110 11135555544432 34578888889999999999988887632 21 Q ss_pred chhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee-------------------------eeEEEEE-- Q lcl|NC_020844. 133 FRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED-------------------------AETILVL-- 185 (455) Q Consensus 133 ~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~-------------------------~e~~~v~-- 185 (455) ++.|+-.+.. +..+..|+ +++.+ +++. .+.+.|+ T Consensus 155 ---------------~~~~pl~~y~-v~~d~~G~--v~~i~-rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 215 (516) T protein:vir:96 155 ---------------ISAIPMHHYV-VNRDTNGD--LLDII-LLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTH 215 (516) T ss_pred ---------------EEEEEcCeEE-EeeCCCCC--eeeeh-hhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEe Confidence 2223322211 11222221 11111 1110 0112222 Q ss_pred ---ecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCc Q lcl|NC_020844. 186 ---RPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFP 262 (455) Q Consensus 186 ---~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p 262 (455) .++.+..|....++. .+..++..++...|++++++....++.+ |.+|-.+...-.......+-.+-.....+..| T Consensus 216 v~~~~~~~~~~~~~~d~~-~~~~es~~~~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~ 293 (516) T protein:vir:96 216 AKYLGDGFWELKQSADDI-PVGKVSKIKSEKLPFIPLTWKRSYGEDW-GRPLAEDYSGDLFVIQFLSEAVARGAALMADI 293 (516) T ss_pred eeeeCCceeEEEEEeCce-eeccccccccccCCeeeeeeeecCCCCc-ccchHHHhhHHHHHHHHHHHHHHHHHHHhcCC Confidence 222222222222222 2344566677789999998877776665 55564444433333333333445555666666 Q ss_pred eEEEe--cCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHh--hhhhccC Q lcl|NC_020844. 263 MLSAN--GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIREL--GRQPLTA 337 (455) Q Consensus 263 ~~~~~--G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~--g~~~l~~ 337 (455) .+.+. |+. ++..+.-|....+ +|.. ..+++.++.. +..+....+.|+++++.|... ...+... T Consensus 294 ~~lv~p~g~~-------~~~~l~~~~~g~i-~~g~----~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r 361 (516) T protein:vir:96 294 KYLIRPGAQT-------DVDHFVNSGTGEV-VTGV----EEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRR 361 (516) T ss_pred ccccCccccc-------chhhhccCCCcee-ecCC----cccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccC Confidence 65553 221 1222233332222 3421 2245556544 346788888899999888663 2223333 Q ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHcCCCCCceEEEEecccCCCCCC----HHHHHHHHHHH Q lcl|NC_020844. 338 QSGNLTRISAAQAASKGNSAVQQWAAGLKD-ALENALYITNLWMDAPDTNPEADVYTDFRADTGE----EQTPGYLLTMR 412 (455) Q Consensus 338 ~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~-a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~----~~~~~al~~~~ 412 (455) ++.+.||++...+.......|.-.-..++. -+.-++..+..-++-...+.. ++.++.. .+. ++.+..+.... T Consensus 362 ~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p~lp~~~--v~~~~vs-~l~~l~r~~~~~~i~~~~ 438 (516) T protein:vir:96 362 DAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGESFTSDL--VDPVIIT-GIEALGRMAELDKLANFA 438 (516) T ss_pred CCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCCCcccc--ccceeec-hHHHHHHHHHHHHHHHHH Confidence 455689999999888877777665555443 222333333222332211111 2222221 111 12222222222 Q ss_pred Hc-CC---CCH--------HHH-HHHHHhcCCCCCCCCHHHHHHHHHhhc-ccCCCC Q lcl|NC_020844. 413 QN-GD---ISR--------EGL-WMEFQRLGELSDNFDPEEESERLISEL-PGGIEE 455 (455) Q Consensus 413 ~~-G~---is~--------et~-~~~l~~~~vl~~~~d~e~e~~ri~~e~-~~~l~~ 455 (455) +. |. ++. ..+ .......||-...+..++|.+.+.++. +....+ T Consensus 439 ~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~ 495 (516) T protein:vir:96 439 QYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQ 495 (516) T ss_pred HHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHH Confidence 21 11 111 122 233445676554545555555333331 111111 No 143 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=97.92 E-value=1.1e-05 Score=47.82 Aligned_cols=437 Identities=13% Similarity=0.029 Sum_probs=169.9 Q ss_pred CCCCCCCc--cCHHHHHHHHHHHHHHHH---hcCHHHHHHhhhhcCCCCCCCCHHHHHHH-hhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTK--RSADAEAMSDYLQTVDDV---LEGVTGMRRNFKRYVKQFPHEQNKRYDLR-KSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~--~~p~y~~~~~~w~~i~d~---~~G~~~~r~~g~~yLpk~~~e~~~~Y~~r-l~ra~~~n~~~~~v~~~~g 74 (455) .|-++|+. +.|+|....-.=++..|| ......++-+...+|-.+.++.+..-... =..+++++-++.+|+.+.+ T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~grs~vv~~~v~~~ve~~~~ 88 (763) T protein:vir:95 9 VPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKPPKVKGRSQVQPKLVRRQAEWRYS 88 (763) T ss_pred CCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcccccCCCccccCHHHHHHHHHHHH Confidence 23233322 223332211111112222 22222222222223332222221110000 1356788989999999988 Q ss_pred hhhc----CCceec------cCCHH---HHHHHHh-hccCCCCHHHHHHHHHHHHHhhCcEEEEEeeC----CCCcchhh Q lcl|NC_020844. 75 KPFG----REVEVT------EGTGP---SEDLLED-IDGRGNNLTTFAGQLFFQGVADSISWIRVDYP----AGQQFRNR 136 (455) Q Consensus 75 ~lf~----k~p~i~------~~~~~---l~~~~~d-~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p----~~~~~~t~ 136 (455) .|.+ .+..+. .+.+. ...++.= ++.+ ++=..++..+++.+|.+|.+.+=|+.. ........ T Consensus 89 ~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~-~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~ 167 (763) T protein:vir:95 89 ALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTK-LNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPV 167 (763) T ss_pred HHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhc-CchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehh Confidence 8843 222221 11111 1111111 2223 233455568999999999885554321 10000000 Q ss_pred h---------------h---------------------H-------------------------HhccCCceEEEechhh Q lcl|NC_020844. 137 E---------------E---------------------E-------------------------RQAGVRPYWSQVNHSA 155 (455) Q Consensus 137 a---------------d---------------------~-------------------------~~~~~rPy~~~~~~~~ 155 (455) . + + .....+|.+..++|++ T Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d 247 (763) T protein:vir:95 168 FSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPEN 247 (763) T ss_pred hhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHH Confidence 0 0 0 0012477888899998 Q ss_pred hcc---ceeeccCCeeEEEEEEE--------Eeee----------------------------------eEEEEEecCeE Q lcl|NC_020844. 156 ILE---IQSERQGAGERITYMRM--------WEDA----------------------------------ETILVLRPDDW 190 (455) Q Consensus 156 ii~---w~~~~~~~~~~l~~v~~--------~E~~----------------------------------e~~~v~~~~~~ 190 (455) ++- ++.+..+.......++. ...+ .+|+|+ T Consensus 248 ~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~----- 322 (763) T protein:vir:95 248 IIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAY----- 322 (763) T ss_pred heecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEE----- Confidence 762 22221222221111110 0000 011111 Q ss_pred EEEEE---eCCcEEEE------------eccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHH Q lcl|NC_020844. 191 ERWRK---NDAGIWEV------------DEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFA 255 (455) Q Consensus 191 ~~~~~---~~~g~~~~------------~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~ 255 (455) +.|.. .++|.+.+ ....+++.+.+|||.|...+....+ .+.+.+..+.+++..++...+-..++ T Consensus 323 E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~-~G~gi~~~~~d~Qr~~N~~~~~~~d~ 401 (763) T protein:vir:95 323 EYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDM-YGEPDAELLGDNQAVLGAVMRGMIDL 401 (763) T ss_pred EeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcc-cCCchHHHhhHHHHHHHHHHHHHHHH Confidence 22222 22332211 1223445688999977655554443 46666677778887877778888888 Q ss_pred HHHcCCceEEEe-cCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC--chhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_020844. 256 EMMTAFPMLSAN-GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA--TESLKFLAEQIESVSKEIRELGR 332 (455) Q Consensus 256 l~~~~~p~~~~~-G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~--~~~l~~~~~~l~~~~~~m~~~g~ 332 (455) +..++.|...+. |.-...+ .....++.++.+-.++... ....+.+++ +.+.......++...+++.-+.. T Consensus 402 l~~~~~~~~~v~~gav~~~d------~~~~~pg~v~~v~~g~~~~-~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~ 474 (763) T protein:vir:95 402 LGRSANGQRGMPKGMLDALN------SRRYREGEDYEYNPTQNPA-QMIIEHKFPELPQSALTMATLQNQEAESLTGVKA 474 (763) T ss_pred HHhhcCCcEEeecccccchh------hhcccCCceEEeeCCCChh-hhcccccCCCCcchHHHHHHHHHHHHHHhhCcch Confidence 888888865443 3221111 1112223333221111111 112222222 22333333333333232222221 Q ss_pred hh--ccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHcCCCC-----CceEEEEe-----cccC Q lcl|NC_020844. 333 QP--LTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDAL----ENALYITNLWMDAPD-----TNPEADVY-----TDFR 396 (455) Q Consensus 333 ~~--l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~----~~~l~~~a~~~g~~~-----~~~~v~v~-----~df~ 396 (455) .. +.+...+.|+++...........+..+..++.+++ +.+|.++.+|++... .+-.+.++ .+|+ T Consensus 475 ~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~D 554 (763) T protein:vir:95 475 FAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFD 554 (763) T ss_pred hhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcc Confidence 11 12222244555444444444455666677776665 445666777765321 00012222 1232 Q ss_pred C------CCCCHHHHHHHHHHHHc--CCCCHHHHHHHHHh-cCCCCCCCCHHHHHHHHHhhccc--CCCC Q lcl|NC_020844. 397 A------DTGEEQTPGYLLTMRQN--GDISREGLWMEFQR-LGELSDNFDPEEESERLISELPG--GIEE 455 (455) Q Consensus 397 ~------~~~~~~~~~al~~~~~~--G~is~et~~~~l~~-~~vl~~~~d~e~e~~ri~~e~~~--~l~~ 455 (455) . ...+.+.+..+..+.+. ..++......-|.+ .++.. ..+..++++...+. -.-+ T Consensus 555 V~V~~~~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~----~~~~~~~lr~~q~~~d~~~q 620 (763) T protein:vir:95 555 LEVDISTAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKR----MPKLAHDLRTWQPQPDPVQE 620 (763) T ss_pred eEEecccchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhc----hhhhHHHHHhcCCCccchhh Confidence 1 11122323334443332 22332211111111 01111 11111222222111 0000 No 144 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=97.91 E-value=1.1e-05 Score=47.72 Aligned_cols=402 Identities=9% Similarity=0.009 Sum_probs=170.8 Q ss_pred CCCCCCCccCHHH------HHHHHHHHHHHHHhcCHHHHHHhhhhcCCCC---CCCCHHHHHHHhhccccCChHHHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADA------EAMSDYLQTVDDVLEGVTGMRRNFKRYVKQF---PHEQNKRYDLRKSVAKMTNVFRDVLEG 71 (455) Q Consensus 1 m~~~~v~~~~p~y------~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~---~~e~~~~Y~~rl~ra~~~n~~~~~v~~ 71 (455) |-. +.+.. ....++|+-+.++ .||.. +.++...- .++.+ .|-+...+.++. T Consensus 1 m~~-----~~~~l~~k~~R~~~e~~w~e~a~~-------------~lP~~~~~~~~~~~~~-~~~~~-~~dstg~~a~~~ 60 (514) T protein:vir:80 1 MRQ-----QASAMWAEYRDSTAIRKAEDFAKF-------------TIASLMVDPLDKTHQA-EVVEY-DFQSAGAFLVNN 60 (514) T ss_pred Ccc-----chHHHHHHhhcchHHHHHHHHHHH-------------hcccccCCCCCCcccc-ccccc-ccchhHHHHHHH Confidence 332 11111 1133455444444 44432 22222211 12222 234444456555 Q ss_pred HhhhhhcC--Cc-----eec--c--------C---CHHHHHHHHhhcc------CCCCHHHHHHHHHHHHHhhCcEEEEE Q lcl|NC_020844. 72 LASKPFGR--EV-----EVT--E--------G---TGPSEDLLEDIDG------RGNNLTTFAGQLFFQGVADSISWIRV 125 (455) Q Consensus 72 ~~g~lf~k--~p-----~i~--~--------~---~~~l~~~~~d~d~------~G~~l~~~~~~~~~~al~~G~~~~lV 125 (455) ++..+.+- || .++ + . ...++.|++.+.. ..++++.-+..+.++...+|.+.+++ T Consensus 61 LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~ 140 (514) T protein:vir:80 61 LTAKLALTLFPPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYR 140 (514) T ss_pred HHHHHHhhhcCCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE Confidence 55544210 11 111 0 0 0124555544432 34678888899999999999998888 Q ss_pred eeCCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEE-------ee--------------eeEEEE Q lcl|NC_020844. 126 DYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMW-------ED--------------AETILV 184 (455) Q Consensus 126 d~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~-------E~--------------~e~~~v 184 (455) +- ..+. ++.|+-.+.. +..+..|+.-.+ +.+.+ |+ .+.+.+ T Consensus 141 ~~-~~~~---------------~~~~pl~~y~-v~~d~~G~v~~i-~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v 202 (514) T protein:vir:80 141 EP-GTGK---------------MLVWTMQSYT-VRRTSHGDPAVV-VLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDL 202 (514) T ss_pred ec-CCCc---------------EEEEEcCeEE-EeeCCCcCeEEE-EeeeeecHHHhhhhhhhhhhhhhccCCCCCceEE Confidence 62 1111 3344433322 222222222111 11111 00 011222 Q ss_pred Ee-----cCe----EEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHH Q lcl|NC_020844. 185 LR-----PDD----WERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFA 255 (455) Q Consensus 185 ~~-----~~~----~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~ 255 (455) ++ ++. |-+|... +|.. +..++..++...|++++++....++.+ |.+|-.+...-...+...+-..-.. T Consensus 203 ~~~v~~~~~~~~~~~sv~~e~-~g~~-i~~es~y~~~e~P~i~~Rw~~~~ge~Y-Grgp~~~al~D~k~L~~l~~~~l~~ 279 (514) T protein:vir:80 203 YTVIEWQPTPNGKRCAVWHEL-EGKR-VGPESSYPAHLCPYVPVAWNVPDGEHY-GRGYVEEYSGDFARLSILSERLGLY 279 (514) T ss_pred EEEEEeecCCCCeEEEEEEec-ccee-ecccCccccccCCeeeeeeEecCCCCc-ccchHHHHHHHHHHHHHHHHHHHHH Confidence 21 110 2222111 1111 233445556779999998887777666 6666555544333444444444555 Q ss_pred HHHcCCceEEEe--cCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_020844. 256 EMMTAFPMLSAN--GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIRELGR 332 (455) Q Consensus 256 l~~~~~p~~~~~--G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~g~ 332 (455) ...+..|.+.+. |.. ++..+.-|.... .+|.. ..+++.++.. ...+....+.|+++++.|...=. T Consensus 280 ~~~a~~~~~~v~~~g~~-------~~~~l~~~~~g~-~v~g~----~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFm 347 (514) T protein:vir:80 280 EFEALSLLNLVDEAKGG-------AVDDYRDAETGD-FVPGQ----VGSVASYERGDYNKIAQASASVESIVMRLNRAFM 347 (514) T ss_pred HHHhcCCCceeCccccc-------chhhhcccCCce-eecCC----CccceeeecCcccchHHHHHHHHHHHHHHHHHHh Confidence 556665665442 211 122223232222 23321 2346666644 45788888889999999865311 Q ss_pred hh-ccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHH-cCC----CCCceEEEEecccC--CCC Q lcl|NC_020844. 333 QP-LTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDAL-----ENALYITNLW-MDA----PDTNPEADVYTDFR--ADT 399 (455) Q Consensus 333 ~~-l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~-----~~~l~~~a~~-~g~----~~~~~~v~v~~df~--~~~ 399 (455) -. ...++.+.||++...+.......|.-.-..++.=| ++++.++-+- .|. +++.+.+++..-+. .+. T Consensus 348 l~~~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~ 427 (514) T protein:vir:80 348 YTGQVRDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSIITGIPALTRN 427 (514) T ss_pred hhccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchhhcceeeecHHHHHHH Confidence 11 12344557999999988887777766555544322 2233332211 122 22222222221111 111 Q ss_pred CCHHHHHHHHHHHHc--CC-------CCHHHHH-HHHHhcCCCCC-CCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 400 GEEQTPGYLLTMRQN--GD-------ISREGLW-MEFQRLGELSD-NFDPEEESERLISELPGGIEE 455 (455) Q Consensus 400 ~~~~~~~al~~~~~~--G~-------is~et~~-~~l~~~~vl~~-~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+.+.+...++..+. +. |....+. ......||... -...+++.+..+++..+.--+ T Consensus 428 ~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~ 494 (514) T protein:vir:80 428 IETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQ 494 (514) T ss_pred HHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHH Confidence 112222222222211 11 1122222 23344666432 233333333222221111111 No 145 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=97.71 E-value=2.6e-05 Score=45.68 Aligned_cols=422 Identities=13% Similarity=0.090 Sum_probs=188.4 Q ss_pred CCCCCCC--ccCHHHHHHHHHHHH-HHHHhcCHHHHHHhh----hhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQDPT--KRSADAEAMSDYLQT-VDDVLEGVTGMRRNF----KRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~v~--~~~p~y~~~~~~w~~-i~d~~~G~~~~r~~g----~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~ 73 (455) |+++.+. .-||+- ...+|.. |...-.+-+.++..+ +.|.--....+.. .. -+|.+--++.++. T Consensus 1 m~~~~~~~~~~tpe~--la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~~~~-----~~---r~nl~~sni~~i~ 70 (663) T protein:vir:34 1 MNESQPTDFADTPQG--WAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSAHDA-----ET---RWNLFSTNIQTQM 70 (663) T ss_pred CCccccccchhcchh--HHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCCCcc-----cc---ccchhhhhHHHHh Confidence 9986543 446765 5668865 666666655555444 4453211111111 11 3799999999999 Q ss_pred hhhhcCCceec------cC----CHHHHHHHHh-----hccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCC----Ccch Q lcl|NC_020844. 74 SKPFGREVEVT------EG----TGPSEDLLED-----IDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAG----QQFR 134 (455) Q Consensus 74 g~lf~k~p~i~------~~----~~~l~~~~~d-----~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~----~~~~ 134 (455) =-+++++|.++ +. .....++++. +..+-..|+..++.+.+.+|.+|++-+-|=|-.. .... T Consensus 71 P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~ 150 (663) T protein:vir:34 71 ASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVD 150 (663) T ss_pred hhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchhcccc Confidence 99999998774 21 1122333333 3333345999999999999999988888866221 0000 Q ss_pred hh----hh-HHhccCCceEEEechhhhc----cceeeccC--CeeE-EEEEE---EEe----------eeeE---EEEE- Q lcl|NC_020844. 135 NR----EE-ERQAGVRPYWSQVNHSAIL----EIQSERQG--AGER-ITYMR---MWE----------DAET---ILVL- 185 (455) Q Consensus 135 t~----ad-~~~~~~rPy~~~~~~~~ii----~w~~~~~~--~~~~-l~~v~---~~E----------~~e~---~~v~- 185 (455) .. .+ +.+.+. |-.-.+..|.|. .|+.=-++ .+|. +.++. +.. +.++ +.+= T Consensus 151 ~~~D~~~~~~~a~~~-~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~~~~~~~a~~~~ 229 (663) T protein:vir:34 151 AILDEATGAELAAAV-PPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARFDADGSRNLWASVPK 229 (663) T ss_pred ccCCCccccchhccc-ccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhhcCChhhhhhhhccC Confidence 00 01 111111 111111112111 11100000 1111 00110 000 0000 0000 Q ss_pred -----------------ecCeEEEEEEeCCcEEE-------EeccCCCCCC---c--eeEEEEEecccCCCcccCcchHH Q lcl|NC_020844. 186 -----------------RPDDWERWRKNDAGIWE-------VDEFGRITIG---V--VPMVPFITGRRKGKKWVFHPPMK 236 (455) Q Consensus 186 -----------------~~~~~~~~~~~~~g~~~-------~~~~g~~~l~---~--vP~V~f~~~~~~~~~~~~~~pL~ 236 (455) .-..|++|-+.++.++- +-+.++.+|| + ||+ |.++-...+..+..|++. T Consensus 230 ~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~~~L~~~~p~lgl~~ffPcPr--pl~~~~~~ds~ipvpd~~ 307 (663) T protein:vir:34 230 VGKPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYSAVLDTQPDPLGLESFFPCPK--PLLANWTTDKVVPRPDFV 307 (663) T ss_pred cCCccccCCCCCcchhcCcceeEEEecCCcEEEEEEcCcceecccCCCCCCCCCCCCCcc--cccceecCCCeecCCcHH Confidence 00123444333333211 2233444443 3 344 222333334556677766 Q ss_pred HHH-HHHHHHHHhHHHHHHHHHHcCCceEEEe-cCCCcc---ccccCccceeeccceeeeccCCCCccccceeEEeeC-- Q lcl|NC_020844. 237 DAL-DLQVALFRKECALEFAEMMTAFPMLSAN-GVEPDT---DSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-- 309 (455) Q Consensus 237 dla-~l~~~hy~~~sd~~~~l~~~~~p~~~~~-G~~~~~---~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-- 309 (455) +. ++..+|.. .++--+.+..+-.|..++. |....- ..+...+. +.+-.-|.++.+..+-.+...++-.+ T Consensus 308 -~y~~~~~E~n~-~t~Rin~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~--lvpV~~~~~~~~~gg~~k~I~~~pi~~~ 383 (663) T protein:vir:34 308 -LAQDLYKEIDL-VSTRITLLERAIRVVGVYDKSSGLTIGRLLSEAAQND--LIPVENWLTFADKGGLRGVVDWFPLEPV 383 (663) T ss_pred -HHHHHHHHHHH-HHHHHHHHHhhhhhceeeccccchhHHHHHHHhhCCC--ceecchhhhhhhhcCccchhhcccchhH Confidence 44 44445543 4454555555555555553 222111 11111111 11111222222221111223343333 Q ss_pred chhHHHHHHHHHHHHHHHHHhhhhh--ccC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCc Q lcl|NC_020844. 310 TESLKFLAEQIESVSKEIRELGRQP--LTA-QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTN 386 (455) Q Consensus 310 ~~~l~~~~~~l~~~~~~m~~~g~~~--l~~-~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~ 386 (455) -..|..+...-..+...++++.+.. +-+ ...++|+++..+......-.+..+...+++....+.++.|+++-.. T Consensus 384 ~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~--- 460 (663) T protein:vir:34 384 VAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEH--- 460 (663) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHh--- Confidence 3456666666666777777765433 333 2367999999999988888899999999999999999999987421 Q ss_pred eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC----------HHHHHHHHHhhcc--cCCC Q lcl|NC_020844. 387 PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD----------PEEESERLISELP--GGIE 454 (455) Q Consensus 387 ~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d----------~e~e~~ri~~e~~--~~l~ 454 (455) |. .+.+..++.+..-....-+-.+.+|+...+-...++ ..++.+...+-.+ +.+- T Consensus 461 --------~~-----~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~ 527 (663) T protein:vir:34 461 --------YD-----VASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFM 527 (663) T ss_pred --------cC-----HHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHH Confidence 10 111111111101111112222333332222111110 1111111111100 0000 Q ss_pred C Q lcl|NC_020844. 455 E 455 (455) Q Consensus 455 ~ 455 (455) . T Consensus 528 q 528 (663) T protein:vir:34 528 Q 528 (663) T ss_pred H Confidence 0 No 146 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=97.68 E-value=3e-05 Score=45.40 Aligned_cols=418 Identities=12% Similarity=0.068 Sum_probs=189.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |+..+-.. .=.=...+.+|+.++.--.= ...+++..+-.||..-..+...=..+.. -.|-+...+.++.++..+++- T Consensus 1 ~~~~~~~~-~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~-~~~dst~~~a~~~Laa~l~~~ 78 (535) T protein:vir:94 1 MASSQKRE-GFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYT-TPWQAVGARGLNNLASKLMLA 78 (535) T ss_pred CCchhhhh-hHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccC-CcccccHHHHHHHHHHHHHhh Confidence 98543211 11112233456555442211 4455666666666532211111011122 145555556666655554210 Q ss_pred --Cce--ec------------cCC---HHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcch Q lcl|NC_020844. 80 --EVE--VT------------EGT---GPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFR 134 (455) Q Consensus 80 --~p~--i~------------~~~---~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~ 134 (455) |++ +. ..+ ..++.|++.+. ...++++.-+..+.++...+|-+-++++-+..... T Consensus 79 ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~- 157 (535) T protein:vir:94 79 LFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYN- 157 (535) T ss_pred hcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCccc- Confidence 211 11 001 12555665431 13567888889999999999999999875432221 Q ss_pred hhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeee----------------------eEEEEEecCeEEE Q lcl|NC_020844. 135 NREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDA----------------------ETILVLRPDDWER 192 (455) Q Consensus 135 t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~----------------------e~~~v~~~~~~~~ 192 (455) .+..|+-.++. +..+..|. +.+.++ ++.+ +.+.+++ .+ T Consensus 158 ------------~f~~~pl~~y~-v~~d~~G~--vd~i~r-~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~----~v 217 (535) T protein:vir:94 158 ------------PMKLYRLSSYV-VQRDAFGT--VLQIVT-LDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYT----HI 217 (535) T ss_pred ------------ceEEEEcCeEE-EeeCCCCC--eEEEEe-eeeccHHHhhHHHHHHHHhccccCCCceeEEEE----EE Confidence 13445444432 22222222 222221 1110 1122221 12 Q ss_pred EEEeCCcEEEEe----------ccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCc Q lcl|NC_020844. 193 WRKNDAGIWEVD----------EFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFP 262 (455) Q Consensus 193 ~~~~~~g~~~~~----------~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p 262 (455) |+..+++.|..+ ..+..++...|++++++....++.+ +..|-.+...-...+...+-..-.....+..| T Consensus 218 ~~~~~~~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~ 296 (535) T protein:vir:94 218 YLDEESGEYLKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESY-GRSYCEEYLGDLRSLENLQEAIVKMSMISAKV 296 (535) T ss_pred EeeCCCCcEEEEEEecCeeeccccccCccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 223333333221 2345578889999998887777666 66776665554444444444444455555555 Q ss_pred eEEEecCCCccccccCcccee-eccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHh--hhhhccCC Q lcl|NC_020844. 263 MLSANGVEPDTDSEGSPKPLD-TGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIREL--GRQPLTAQ 338 (455) Q Consensus 263 ~~~~~G~~~~~~~~~~~~~~~-~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~ 338 (455) .+.+. ++.-.++..+. .|++.+ +|.. .+++..++.. +..+....+.|+++++.|... -..+...+ T Consensus 297 ~~lv~-----p~g~~~~~~~~~~~~g~~--v~g~----~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d 365 (535) T protein:vir:94 297 IGLVN-----PAGITQVRRLTKAQTGDF--VSGR----PEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQRT 365 (535) T ss_pred Ccccc-----cccccchhhcccCCCcee--ecCC----cccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccCC Confidence 54442 01111111122 223332 3321 2234455433 467788888899988888652 11222344 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHH-H----HHHHHHHHHHcCC----CCCceEEEEecccCCCCCCHHHHHHHH Q lcl|NC_020844. 339 SGNLTRISAAQAASKGNSAVQQWAAGLKDA-L----ENALYITNLWMDA----PDTNPEADVYTDFRADTGEEQTPGYLL 409 (455) Q Consensus 339 ~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-~----~~~l~~~a~~~g~----~~~~~~v~v~~df~~~~~~~~~~~al~ 409 (455) +.+.||++...+.......|.-.-..++.- + ++++.++-+ .|. +++.+.+++.. ....---.++++.++ T Consensus 366 ~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r-~g~lP~~p~~~v~~~~vs-~la~l~r~~~~~~l~ 443 (535) T protein:vir:94 366 GERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQA-TNQIPELPKEAVEPTIST-GMEALGRGQDLDKLE 443 (535) T ss_pred CCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh-CCCCCCCChhhccceEee-hHHHHHHHHHHHHHH Confidence 556899999999888888877766665542 2 333333322 133 22222232221 110000123333333 Q ss_pred HHHH--cCC--------CCHHHHHHH-HHhcCCCCC-CCCHHHHHHHHHhhccc------CCCC Q lcl|NC_020844. 410 TMRQ--NGD--------ISREGLWME-FQRLGELSD-NFDPEEESERLISELPG------GIEE 455 (455) Q Consensus 410 ~~~~--~G~--------is~et~~~~-l~~~~vl~~-~~d~e~e~~ri~~e~~~------~l~~ 455 (455) ...+ +++ |....+.+. ....||-.. -+..++|.+.+.+|... ++.. T Consensus 444 ~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~ 507 (535) T protein:vir:94 444 RCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAAS 507 (535) T ss_pred HHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 3332 121 222222222 233555322 22334444433333221 1111 No 147 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=97.60 E-value=3.9e-05 Score=44.78 Aligned_cols=415 Identities=9% Similarity=-0.053 Sum_probs=166.7 Q ss_pred CCCCC--CC-------ccCH-HHHHHHH-HHHHHHHHhc----CHHHHHHhhhhc-CCCCC-------CCC-HHHHHHHh Q lcl|NC_020844. 1 MPEQD--PT-------KRSA-DAEAMSD-YLQTVDDVLE----GVTGMRRNFKRY-VKQFP-------HEQ-NKRYDLRK 56 (455) Q Consensus 1 m~~~~--v~-------~~~p-~y~~~~~-~w~~i~d~~~----G~~~~r~~g~~y-Lpk~~-------~e~-~~~Y~~rl 56 (455) |...+ +. ..|. .|..+.. .-.++.|-++ |...-+. ...| +++.- .-. -.-+..|. T Consensus 66 ~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~-~s~y~~~~~~~~~~~~~~f~gyql~alY~ 144 (862) T protein:vir:99 66 VEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGK-QSSYAVPEALQDWYLSQGFIGHQACALIA 144 (862) T ss_pred ccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhcccccc-ccccccchhccccccccCcccHHHHHHHH Confidence 11000 00 0000 0111111 1112222111 1100001 1111 11110 001 11123343 Q ss_pred hccccCChHHHHHHHHhhhhhcCCceeccC-------CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCC Q lcl|NC_020844. 57 SVAKMTNVFRDVLEGLASKPFGREVEVTEG-------TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPA 129 (455) Q Consensus 57 ~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~-------~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~ 129 (455) . ...++++|+..+.-..++.+.|... ++..+.+..-.+ ...+...++.+.+.+-.||++++++.... T Consensus 145 ~----~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~--rL~v~~~l~eair~~RLyGga~ililv~~ 218 (862) T protein:vir:99 145 Q----HWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDV--EFKVKENLIEFNRFKNVFGIRVAIFVVDS 218 (862) T ss_pred h----CchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHH--HhhHHHHHHHHHHhcccccceEEEEEecC Confidence 3 4778899999999999999999642 122222222111 12466666666666667888777654221 Q ss_pred CCcchhhhhH-----HhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEe Q lcl|NC_020844. 130 GQQFRNREEE-----RQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVD 204 (455) Q Consensus 130 ~~~~~t~ad~-----~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~ 204 (455) .. +....+. ...+..=+|..++|..+-...... +..+...=.++.| ..|...+. .++ T Consensus 219 ~D-~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~-----------~~~Dp~sp~yGkP---~~y~I~g~---~IH 280 (862) T protein:vir:99 219 ED-PDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAE-----------STADPSSQFFYEP---EFWIISGQ---KYH 280 (862) T ss_pred cC-chhhhcCcCcccccccceeEEEEechhhhccccccc-----------ccccccccccCCc---eeeeecCe---eec Confidence 11 1000000 000000011111111111000000 0000000000111 22221110 111 Q ss_pred ccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc-Ccc--c Q lcl|NC_020844. 205 EFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG-SPK--P 281 (455) Q Consensus 205 ~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~-~~~--~ 281 (455) .+--..+..-|+ |....+. .. ..+.+-|.-+.+.....-++...-..+++...+.++-+.|+.....++. ... . T Consensus 281 ~SRliif~g~~v-pd~lk~a-y~-f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~ed~l~~r~~~ 357 (862) T protein:vir:99 281 RSHLIIARGPQP-ADILKPT-YI-FGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIANEDKFIQRLMF 357 (862) T ss_pred cceeEEecCCCc-hhhhhcc-CC-ccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhccHHHHHHHHHH Confidence 110001111111 0010111 11 2366666666565444444555557778888888877777643221111 100 0 Q ss_pred ee--eccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc---CC---CcchhHHHHHHHHHH Q lcl|NC_020844. 282 LD--TGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT---AQ---SGNLTRISAAQAASK 353 (455) Q Consensus 282 ~~--~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~---~~---~~~~sa~a~~~~~~~ 353 (455) +. -+...+..+..+ .++..++.+-+++ ...+....++|......|+. ++ +-+.|+.. +... T Consensus 358 ~~~~rdN~Gi~liD~e-----Ee~e~ls~slSGL---~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~---D~~n 426 (862) T protein:vir:99 358 WVRYRDNHAVKVLGTD-----ETMEQFDTSLADF---DAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEF---ETIS 426 (862) T ss_pred HHhccCcceeEEecCC-----CceeEEecccCCh---HHHHHHHHHHHHhhhCCCceeecccCcccccCchHH---HHHH Confidence 11 111234444322 2566666655555 44556666677666555543 22 22344443 3344 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHH-----HHHHHHHHHHcCCCCHHHHHHHHH Q lcl|NC_020844. 354 GNSAVQQWA-AGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQ-----TPGYLLTMRQNGDISREGLWMEFQ 427 (455) Q Consensus 354 ~~s~L~~~a-~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~-----~~~al~~~~~~G~is~et~~~~l~ 427 (455) -+..+..+. ..+...+++++.++..-+|.++ +.+|+++.=+.....+-. .+++...+.++|+||.+..+..|+ T Consensus 427 YyD~I~s~QE~~L~P~LerL~~li~~~lg~~~-d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~ 505 (862) T protein:vir:99 427 YHEELESIQEHVYMPFLQRHYLISRLSLGIQH-EIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIR 505 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHH Confidence 455555554 3477788888887766567543 556655431111111111 235567788899999999999998 Q ss_pred hcCCCCCCCCHHHHHH-----------HHHhhcccCCCC Q lcl|NC_020844. 428 RLGELSDNFDPEEESE-----------RLISELPGGIEE 455 (455) Q Consensus 428 ~~~vl~~~~d~e~e~~-----------ri~~e~~~~l~~ 455 (455) ..+......-.+++.+ .+++.+.+..++ T Consensus 506 ~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~a 544 (862) T protein:vir:99 506 DDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETA 544 (862) T ss_pred hcCCcCCCCCCcccccccCCCCcccccccccCCcccccc Confidence 7665332211111111 111110000000 No 148 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=97.60 E-value=3.9e-05 Score=44.73 Aligned_cols=417 Identities=10% Similarity=0.037 Sum_probs=189.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC-CCCHHHHHHHhhccccCChHHHHHHHHhhhh-- Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP-HEQNKRYDLRKSVAKMTNVFRDVLEGLASKP-- 76 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~-~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l-- 76 (455) |-+.+- .---+-.....+|+.++.--.- ...+++..+-.||-.- .+... .+.++ .|-+...+.++.++..+ T Consensus 1 ~~~~~~-~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~---~~~~~-~~dstg~~a~~~LAa~l~~ 75 (515) T protein:vir:70 1 MQDTIL-EYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYLMNNKGDN---ETSQN-GWQGVGAQATNHLANKLAQ 75 (515) T ss_pred Ccchhh-hhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccccCCCCCc---ccccc-cccchHHHHHHHHHHHHHH Confidence 554321 1111222334455544322211 3344555555666321 11111 12222 45555556666665554 Q ss_pred --hc--CCc-eec----------cCC---HHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 77 --FG--REV-EVT----------EGT---GPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 77 --f~--k~p-~i~----------~~~---~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) |. .|. .++ ..+ ..++.|++.+. ...++++.-+..+.++...+|.+.+++|-+ +. T Consensus 76 ~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~--~~ 153 (515) T protein:vir:70 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPSK--GA 153 (515) T ss_pred hhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeCC--CC Confidence 22 110 111 001 12344444332 124578888889999999999999988722 11 Q ss_pred chhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee-----------------------eeEEEEE---- Q lcl|NC_020844. 133 FRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED-----------------------AETILVL---- 185 (455) Q Consensus 133 ~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~-----------------------~e~~~v~---- 185 (455) +..|+-.+.. +..+..|+.-. ++.+.+=+ .+.+.++ T Consensus 154 ---------------~~~~pl~~y~-v~~d~~G~v~~-i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~ 216 (515) T protein:vir:70 154 ---------------MSAVPMHHYV-VNRDTNGDLMD-VILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQ 216 (515) T ss_pred ---------------eEEEEcCeEE-EeeCCCcCeeE-EEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEE Confidence 2233333321 22222222211 11111100 0112222 Q ss_pred -ecC-eEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCce Q lcl|NC_020844. 186 -RPD-DWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPM 263 (455) Q Consensus 186 -~~~-~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~ 263 (455) .++ .|.+|... ++. .+..++..++...|++++++....++.+ |.+|-.+...-...+...+-..-.....+..|. T Consensus 217 ~~~~~~~~~~~e~-d~~-~~~~es~y~~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~ 293 (515) T protein:vir:70 217 YAGEGFWKINQSA-DDI-PVGKESRIKSEKLPFIPLTWKRSYGEDW-GRPLAEDYSGDLFVIQFLSEAMARGAALMADIK 293 (515) T ss_pred ecCCCceEEEEec-Cce-eeccccccccccCCceeeeeeecCCCCc-ccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 112 23333222 222 2345566678889999998887777666 666766555444444444455566667777777 Q ss_pred EEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHhhh--hhccCCCc Q lcl|NC_020844. 264 LSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIRELGR--QPLTAQSG 340 (455) Q Consensus 264 ~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~g~--~~l~~~~~ 340 (455) +.+.- +.-.++..+.-|.... .+|. ..++++.++.. +..+....+.|+++++.|...=. .+...++. T Consensus 294 ~lv~~-----~g~~~~~~l~~~~~g~-iv~g----~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~ 363 (515) T protein:vir:70 294 YLIRP-----GSQTDVDHFVNSGTGE-VITG----VAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAE 363 (515) T ss_pred eeeCc-----ccccchhhccccCCce-eecC----CcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCc Confidence 66630 1111222223333222 2332 12345556643 34678888889999998866322 23344455 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH-HHHHHHcCCCCC-ceEEEEecccCCC---CCCHHHHHHHHHHHH- Q lcl|NC_020844. 341 NLTRISAAQAASKGNSAVQQWAAGLKDALEN-AL-YITNLWMDAPDT-NPEADVYTDFRAD---TGEEQTPGYLLTMRQ- 413 (455) Q Consensus 341 ~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~-~l-~~~a~~~g~~~~-~~~v~v~~df~~~---~~~~~~~~al~~~~~- 413 (455) +.||++...+.......|.-.-..++.=|-. ++ +.........+. .+.+.+- .+... ....+.+..+++... T Consensus 364 rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~~~v~~~~v-s~l~~L~r~q~~~~i~~~~q~i~~ 442 (515) T protein:vir:70 364 RVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIV-TGIEALGRMAELDKLANFAQYMSL 442 (515) T ss_pred cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCChhhccccee-hhHHHHHHHHHHHHHHHHHHHHHH Confidence 6899999998887777776555555443222 22 222332222222 1222221 12111 001222323233222 Q ss_pred cCCCCHH--------HHHH-HHHhcCCCCCCCCHHHHHHHHHhhcc-----cCCCC Q lcl|NC_020844. 414 NGDISRE--------GLWM-EFQRLGELSDNFDPEEESERLISELP-----GGIEE 455 (455) Q Consensus 414 ~G~is~e--------t~~~-~l~~~~vl~~~~d~e~e~~ri~~e~~-----~~l~~ 455 (455) ...++.+ .+.+ .....|+-...+..++|.+.+.+|.. +.+.+ T Consensus 443 ~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~~~~ 498 (515) T protein:vir:70 443 PQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNE 498 (515) T ss_pred HhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHH Confidence 1112221 1122 33345555555555666665555522 11111 No 149 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=97.41 E-value=7.3e-05 Score=43.26 Aligned_cols=410 Identities=11% Similarity=-0.015 Sum_probs=182.1 Q ss_pred CCCCC--CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCC-----CCCHHHHHHHhhccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQD--PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFP-----HEQNKRYDLRKSVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~--v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~-----~e~~~~Y~~rl~ra~~~n~~~~~v~~~~ 73 (455) |++.+ .+--.|...+-. .++.. +.....|+..+. +..-+-|+... + -.++.-.++.-. T Consensus 1 ~~~~~~~~~gl~p~rl~~i-----~~~~~------~~~~~~~~~~~~~~Lr~~~~~~ly~~m~-~---D~hi~s~l~~Rk 65 (488) T protein:vir:95 1 MADITETQESLPPFRMGEV-----GSLGL------KVKNGRIYEEPRQALRFPESIKTFQLMM-R---DPAVAASVNIIK 65 (488) T ss_pred CCCccccCCCCCHHHHHHH-----HHHhh------ccccchhhccchhhhcccchHHHHHHHh-h---ChHHHHHHHHHH Confidence 88654 333445432221 12222 222223332111 22345676655 2 478888888888 Q ss_pred hhhhcCCceecc---CCH--H---HHHHHHhhccC-CCCHHHHHHHHHHHHHhhCcEEE-EEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 74 SKPFGREVEVTE---GTG--P---SEDLLEDIDGR-GNNLTTFAGQLFFQGVADSISWI-RVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 74 g~lf~k~p~i~~---~~~--~---l~~~~~d~d~~-G~~l~~~~~~~~~~al~~G~~~~-lVd~p~~~~~~t~ad~~~~~ 143 (455) ..+.+.++.|.- .++ . ..++++.+-.. -.++.+++..+. .|+-+|.+.+ +|+....+.. T Consensus 66 ~av~~~~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~---------- 134 (488) T protein:vir:95 66 MFVRKVNWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKK---------- 134 (488) T ss_pred HHHhcCCceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-Hhhcccceeeeeeeecccccc---------- Confidence 888888888841 111 1 12333332222 235888888887 6889997765 3432221110 Q ss_pred CCceEEEechhhhccceeeccCCeeEEEEEEEEeee--eEEEEEecCeEEEEEEeCCcEEE-----EeccCCC-CCCcee Q lcl|NC_020844. 144 VRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDA--ETILVLRPDDWERWRKNDAGIWE-----VDEFGRI-TIGVVP 215 (455) Q Consensus 144 ~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~--e~~~v~~~~~~~~~~~~~~g~~~-----~~~~g~~-~l~~vP 215 (455) ++ +. ....+|++.+..+..|... .++.+ +++.-.+++........ ....+.. .--.|| T Consensus 135 -~~----------~~--~~~~dg~~~~~~i~~Rpq~~~~~f~~-d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP 200 (488) T protein:vir:95 135 -GK----------YQ--SKFDDGLIGWAKLPIRNQSTLDKWYF-DEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLP 200 (488) T ss_pred -cc----------cc--ccccCCeeeeeeeeecCcccccceee-ccCCCceeeccccccccccccccccccccccccccc Confidence 00 11 1123445544444333221 22221 11111111111000000 0000111 111255 Q ss_pred ---EEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCC---Ccc-ccccC---cc----- Q lcl|NC_020844. 216 ---MVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVE---PDT-DSEGS---PK----- 280 (455) Q Consensus 216 ---~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~---~~~-~~~~~---~~----- 280 (455) ||-+...+..+. ..+...|..++..-+---....+...-++.-+.|++++.|.. ... +.+.. .. T Consensus 201 ~~kfi~~~~~~~~g~-p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~ 279 (488) T protein:vir:95 201 RAKFMLFKYDDEYGN-PEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVV 279 (488) T ss_pred ccceEEEeecCCCCc-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHH Confidence 343444444333 345666666655432211233344444555567888877632 111 11110 00 Q ss_pred -ceeeccceeeeccCCCCccc----cceeEEeeCchhHHHHHHHHHHHHHHHHH--hhhhhccCCCcchhHHHHHHHHHH Q lcl|NC_020844. 281 -PLDTGPDTVLYAPPNAEGQH----GSWNFVEPATESLKFLAEQIESVSKEIRE--LGRQPLTAQSGNLTRISAAQAASK 353 (455) Q Consensus 281 -~~~~g~~~~~~lp~~~~~~~----~~~~~le~~~~~l~~~~~~l~~~~~~m~~--~g~~~l~~~~~~~sa~a~~~~~~~ 353 (455) .+..|+.+...+|.+...+. -.+..++..+.+...++..++...++|.. +|..+-...+..-|-......... T Consensus 280 ~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev 359 (488) T protein:vir:95 280 NDMIANDRAGLIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSL 359 (488) T ss_pred HHhhccchhheeeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHH Confidence 11234445667776542111 01223444444445567777877788755 344332222211233333444555 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHcC-CCCCceEEEEecccCCCCCC-HHHHHHHHHHHHcCC-CCHHHHHHHH-Hh Q lcl|NC_020844. 354 GNSAVQQWAAGLKDALE-NALYITNLWMD-APDTNPEADVYTDFRADTGE-EQTPGYLLTMRQNGD-ISREGLWMEF-QR 428 (455) Q Consensus 354 ~~s~L~~~a~~~~~a~~-~~l~~~a~~~g-~~~~~~~v~v~~df~~~~~~-~~~~~al~~~~~~G~-is~et~~~~l-~~ 428 (455) ...++...+..+.++++ ++++.++.+-. ....-..|.+ +... ..+ ...++++.++.+.|. ++...+.+++ ++ T Consensus 360 ~~~i~~aDa~~i~~tln~~li~~l~~~Nfg~~~~~P~~~~--~~~e-~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~ 436 (488) T protein:vir:95 360 LAMSVDILLKQIKNVINRDLVAQTYALNMWDDEEHVQITY--DDIE-TPDLEAIGSYIQKTVAVGALEVDKELSNKLREH 436 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEe--cCcC-hhhHHHHHHHHHHHHhCCCccccHHHHHHHHHH Confidence 56677788888889996 58888877752 2221233433 2211 122 344667888888888 4433233333 35 Q ss_pred cCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 429 LGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 429 ~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .|+-.+..+ +........+.+....+ T Consensus 437 ~gip~~~~~-e~~~~~~~~~~~~~~~~ 462 (488) T protein:vir:95 437 IGLPPADES-QPVSEKLSPNSQSRSGD 462 (488) T ss_pred hCCCCCCCC-ccccccCCCCCCCCCCc Confidence 667444322 22222221111111111 No 150 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=97.36 E-value=8.5e-05 Score=42.91 Aligned_cols=407 Identities=9% Similarity=-0.013 Sum_probs=175.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHH-HhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDD-VLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d-~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |= + ....+|+..++ .+ +..+++..+-.||..-..+...-..++.+ .|-+...+.++.++..+.+- T Consensus 1 mk-------~----~~~~~~~~lkr~~~--e~~w~e~a~~tlP~~~~~~~~~~~~~~~~-~~dstg~~a~~~LAa~l~~~ 66 (510) T protein:vir:78 1 MK-------S----TAAMLWEKLRDGSV--EQRAIEFAKTTLPYLMVDPMSGSRGVVEH-DFQSAGALLVNNLAAKLARS 66 (510) T ss_pred Ch-------h----HHHHHHHHHhccch--HHHHHHHHHhhccccccCCCCcccccccC-cccchHHHHHHHHHHHHHHh Confidence 22 0 11234444332 11 34455555556663322111111222333 35555566666666555210 Q ss_pred --Cce-----ec-------------cCCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcc Q lcl|NC_020844. 80 --EVE-----VT-------------EGTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQF 133 (455) Q Consensus 80 --~p~-----i~-------------~~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~ 133 (455) ||. +. .....++.|++.+. ...++++.-+..+.++...+|.+.++++-+. + T Consensus 67 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~-~-- 143 (510) T protein:vir:78 67 LFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSDE-A-- 143 (510) T ss_pred hcCCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCCC-C-- Confidence 111 11 00113555555442 1345788888899999999999888886321 1 Q ss_pred hhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee---------------------eeEEEEEe-----c Q lcl|NC_020844. 134 RNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED---------------------AETILVLR-----P 187 (455) Q Consensus 134 ~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~---------------------~e~~~v~~-----~ 187 (455) .+..|+-.+.. +..+..|+.-. ++.+++=+ .+.+.+++ + T Consensus 144 -------------~~~~~pl~~y~-v~~d~~G~vd~-i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~ 208 (510) T protein:vir:78 144 -------------TVVAWSLRSYA-VRRDATGRWMD-IVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRK 208 (510) T ss_pred -------------eEEEEEcceeE-EeeCCCcCeeE-EEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeec Confidence 13444433322 22222232211 11111100 01122221 1 Q ss_pred Ce----EEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCce Q lcl|NC_020844. 188 DD----WERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPM 263 (455) Q Consensus 188 ~~----~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~ 263 (455) +. |.+|-..++. .+...+..++...|++++++....++.+ |.+|-.+...-...+...+-..-.....+..|. T Consensus 209 ~~~~~~~sv~~e~dg~--~i~~~~~~~~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~ 285 (510) T protein:vir:78 209 GTAMDYAEMYHEIDGV--RVGETGRWPIHLCPYIVPTWNLAPGEHY-GRGHVEDYIGDFAKLSLLSEKLGLYELESLEVL 285 (510) T ss_pred CCCCcEEEEEEEecCe--eeccccccccccCCeeeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 10 2222221111 1234456667889999998877776665 555655444333333333323333333334443 Q ss_pred -EEEecCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHhhhh-hccCCCc Q lcl|NC_020844. 264 -LSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIRELGRQ-PLTAQSG 340 (455) Q Consensus 264 -~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~g~~-~l~~~~~ 340 (455) ++-.+- -.++..+.-|....+ +|... .+++.++.. +..+....+.|+++++.|...=.. +...++. T Consensus 286 ~lv~p~g------~~~~~~l~~~~~g~~-v~g~~----~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~l~~~~~~ 354 (510) T protein:vir:78 286 NLVDEAK------GAVVDDYQDAEMGDY-VPGGA----EAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQRDAE 354 (510) T ss_pred cccCCcc------ccchhhhccCCCcee-ecCCc----ccccccccCcccchHHHHHHHHHHHHHHHHHHhhccccCCCC Confidence 332211 111222222222222 34221 235555533 356777788888888888663211 1223344 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC---CCCceEEEEecccCCCCCCHHHHH---HHH Q lcl|NC_020844. 341 NLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA---PDTNPEADVYTDFRADTGEEQTPG---YLL 409 (455) Q Consensus 341 ~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~---~~~~~~v~v~~df~~~~~~~~~~~---al~ 409 (455) +.||++...+.......|.-.-..++.- +++++.++-+ .|. ++..+...+ -.|...---++.+. ++. T Consensus 355 rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~gl~p~p~~~~~~~~-v~~is~Laraq~~~~l~~~~ 432 (510) T protein:vir:78 355 RVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAI-ETGLPALSRSAAVQSMLNAS 432 (510) T ss_pred CcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCccccccee-eecccHHHHHHHHHHHHHHH Confidence 5799999998888877776665554432 2333333322 232 222222111 11221111122222 222 Q ss_pred HHHH-cCC-------CCHHHH-HHHHHhcCCCCC-CCCHHHHHHHHHhhcccCC--------------CC Q lcl|NC_020844. 410 TMRQ-NGD-------ISREGL-WMEFQRLGELSD-NFDPEEESERLISELPGGI--------------EE 455 (455) Q Consensus 410 ~~~~-~G~-------is~et~-~~~l~~~~vl~~-~~d~e~e~~ri~~e~~~~l--------------~~ 455 (455) +..+ .|. |....+ .......||.+. -+..++|.+.+.+|..... .+ T Consensus 433 q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~ 502 (510) T protein:vir:78 433 QVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASD 502 (510) T ss_pred HHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 2222 122 222222 333445676442 3334555555544422111 11 No 151 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=97.24 E-value=0.00012 Score=42.07 Aligned_cols=410 Identities=9% Similarity=0.009 Sum_probs=172.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCC-CCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQF-PHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~-~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |.=. .. -+-.....+|+.++.--.- ...+++..+-.||.. +.++.. .+.++ .|-+...+.++.++..+.+ T Consensus 1 ~~~~-~~---~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~---~~~~~-~~dstg~~a~~~LAa~l~~ 72 (517) T protein:vir:10 1 MDMR-FA---GNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD---LSSQN-AWQDDGASATNFLSNKLSQ 72 (517) T ss_pred Cccc-cc---ccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC---ccccc-cccchHHHHHHHHHHHHHH Confidence 4311 00 0111333344444221111 334455555555632 111111 12222 4555566666666655521 Q ss_pred C--Cce-----ecc-------------CCHHHHHHHHhhc------cCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 79 R--EVE-----VTE-------------GTGPSEDLLEDID------GRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 79 k--~p~-----i~~-------------~~~~l~~~~~d~d------~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) - ||. +.- ....++.|++.+. ...++++.-+..+.++...+|.+.++++ +... T Consensus 73 ~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~--~~~~ 150 (517) T protein:vir:10 73 VLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHP--DKTS 150 (517) T ss_pred hhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEe--CCCC Confidence 0 111 110 0123455554441 2345788888999999999999877764 2111 Q ss_pred chhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeee-------------------------eEEEEEe- Q lcl|NC_020844. 133 FRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDA-------------------------ETILVLR- 186 (455) Q Consensus 133 ~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~-------------------------e~~~v~~- 186 (455) .+..|+-.+.. +..+..|+ +.+.++ ++.. +.+.+++ T Consensus 151 --------------~~~~~pl~~y~-v~~d~~G~--v~~ivr-r~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~ 212 (517) T protein:vir:10 151 --------------PIQAVPLHHYC-VRRDNNGT--VLDIVF-LQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTH 212 (517) T ss_pred --------------cEEEEEcCeEE-EeeCCCcC--eEEEEe-eeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEE Confidence 13444433322 22222222 222111 1110 1122221 Q ss_pred ----cCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCc Q lcl|NC_020844. 187 ----PDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFP 262 (455) Q Consensus 187 ----~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p 262 (455) ++....|....++.. +..++..++...|++++++....++.+ +.+|-.+...-.......+-.+......+..| T Consensus 213 v~~~~~~~~~~~~~~d~~~-~~~~s~y~~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~ 290 (517) T protein:vir:10 213 AKRTKDGKYLIRQSADDVP-VGKESTVTEDKSPFLILTWKRSYGEDY-GRGMAEDHAGAFFVIQFLSEALARGMALMADV 290 (517) T ss_pred EEEeCCCceEEEEEeCcee-eccccccccccCCeeeeeeeecCCCCc-ccchHHHhHHHHHHHHHHHHHHHHHHHHhccC Confidence 111111111212221 234455567889999998887777665 56665444433333333333334444445545 Q ss_pred eEEEe--cCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHh--hhhhccC Q lcl|NC_020844. 263 MLSAN--GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIREL--GRQPLTA 337 (455) Q Consensus 263 ~~~~~--G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~--g~~~l~~ 337 (455) .+.+. |... +..+.-|.. ...+|.. .+++..++.. +..+....+.|+++++.|... ...+... T Consensus 291 ~~lv~~~~~~~-------~~~l~~~~~-g~~~~g~----~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~ 358 (517) T protein:vir:10 291 KYLVKPGSYTD-------INQFVEGGS-GAVLHGV----EGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRR 358 (517) T ss_pred CcccCcccccc-------hhhccCCCc-cccccCC----cccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhcc Confidence 44442 2211 111111211 1123311 1234445533 445777788888888888663 1122333 Q ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHH Q lcl|NC_020844. 338 QSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMR 412 (455) Q Consensus 338 ~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~ 412 (455) ++.+.||++...+.......|.-.-..++.= ++++|..+-.- + +...+.+++-.-.... --.+.++.+..+. T Consensus 359 ~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~-l-~~~~v~~~~~s~la~l-~r~~~~~~i~~~~ 435 (517) T protein:vir:10 359 DAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSI-L-TSKNVSPTILTGIEAL-GRMAELDKLGTFN 435 (517) T ss_pred CCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhh-c-CCCCccceeeccHHHH-HHHHHHHHHHHHH Confidence 4456899999988887777776655554432 22222222111 1 1222333322111000 0012222222221 Q ss_pred H---c-CCCC--------HHH-HHHHHHhcCCCCCCCCHHHHHHHHHhhccc------CCCC Q lcl|NC_020844. 413 Q---N-GDIS--------REG-LWMEFQRLGELSDNFDPEEESERLISELPG------GIEE 455 (455) Q Consensus 413 ~---~-G~is--------~et-~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~------~l~~ 455 (455) + . .-++ .+. +.......||-...+-.++|.++..++... .++. T Consensus 436 ~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ 497 (517) T protein:vir:10 436 GYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQ 497 (517) T ss_pred HHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 1 1111 111 222334456644444445554443333110 1111 No 152 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=97.17 E-value=0.00014 Score=41.64 Aligned_cols=403 Identities=12% Similarity=0.048 Sum_probs=179.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh-- Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF-- 77 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf-- 77 (455) |=+ ....+|+.+++-..= ...+++..+-.||..-..+...=..+..+ .|-+...+.++.++..++ T Consensus 1 mk~-----------~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~-~~dstg~~a~~~Laa~l~~~ 68 (542) T protein:vir:78 1 MKG-----------LAQARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQ-PYQSLGSKGVNALSSKLMLS 68 (542) T ss_pred Chh-----------HHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccc-cccchHHHHHHHHHHHHHHh Confidence 220 011233333331100 33344555555664321111111122322 456666677777666653 Q ss_pred --c--CCc-eec----------c-CC---HHHHHHHHhh------ccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCc Q lcl|NC_020844. 78 --G--REV-EVT----------E-GT---GPSEDLLEDI------DGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQ 132 (455) Q Consensus 78 --~--k~p-~i~----------~-~~---~~l~~~~~d~------d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~ 132 (455) . +|. .+. . ++ ..++.|++.+ -...++++.-+..+.++...+|-+.+++|.+. T Consensus 69 ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~--- 145 (542) T protein:vir:78 69 LFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGKKT--- 145 (542) T ss_pred hcCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecCCC--- Confidence 2 111 111 0 11 1244444332 11245788888899999999999988887331 Q ss_pred chhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEE-EEEee-------------------------eeEEEEE- Q lcl|NC_020844. 133 FRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYM-RMWED-------------------------AETILVL- 185 (455) Q Consensus 133 ~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v-~~~E~-------------------------~e~~~v~- 185 (455) ++.|+-.+.. +..+..| .+.+.+ +++=+ ...+.+. T Consensus 146 ---------------~~~~pl~~y~-v~~d~~G--~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~ 207 (542) T protein:vir:78 146 ---------------LKVYPLDRYV-IERDGDG--NVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQ 207 (542) T ss_pred ---------------ceEEecceeE-EeeCCCC--CeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEE Confidence 2233322211 1112111 111111 11100 0011111 Q ss_pred ---ecCeEEEEE--EeCCcEEEEe----------ccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHH Q lcl|NC_020844. 186 ---RPDDWERWR--KNDAGIWEVD----------EFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKEC 250 (455) Q Consensus 186 ---~~~~~~~~~--~~~~g~~~~~----------~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~s 250 (455) ......+|. ....+.|..+ .....++...|++++++....++.+ +.+|..+...-...+...+- T Consensus 208 ~v~pr~~~~~~~~~~~~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~ 286 (542) T protein:vir:78 208 GKGGRNDAEVFTCCKLVDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESY-GRGRVEEFFGDLSSLDALTR 286 (542) T ss_pred EeecccCCccccccccCCCeEEEEEEeccccccccccccccccCCceeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHH Confidence 111111110 0011112111 1224467788999988887777666 77787776665555556666 Q ss_pred HHHHHHHHcCCceEEEe--cCCCccccccCccce-eeccceeeeccCCCCccccceeEEee-CchhHHHHHHHHHHHHHH Q lcl|NC_020844. 251 ALEFAEMMTAFPMLSAN--GVEPDTDSEGSPKPL-DTGPDTVLYAPPNAEGQHGSWNFVEP-ATESLKFLAEQIESVSKE 326 (455) Q Consensus 251 d~~~~l~~~~~p~~~~~--G~~~~~~~~~~~~~~-~~g~~~~~~lp~~~~~~~~~~~~le~-~~~~l~~~~~~l~~~~~~ 326 (455) ......+.+..|.+.+. |.. ++..+ ..|++.+ +|. ..++++.++. ++..+....+.|+++++. T Consensus 287 ~~l~~~~~a~~pp~lv~~~g~~-------~~~~~~~~~~g~i--v~g----~~~~v~~~~~~~~~~~~~~~~~i~~~~~r 353 (542) T protein:vir:78 287 SLIEGSAAAAKVVFMVSPSATT-------KPQSLARAGTGAI--IQG----RAEDVSVVQANKGADFRTVQEMIRDLSQR 353 (542) T ss_pred HHHHHHHHHhcCceeecccccc-------chhhcccCCCcee--ecC----CccceeeeecccccchhHHHHHHHHHHHH Confidence 66777777888876653 211 11111 2232222 221 1234555553 355788888999999999 Q ss_pred HHHhhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHcCC-C--CCceEEEEecccCCC Q lcl|NC_020844. 327 IRELGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKD-----ALENALYITNLWMDA-P--DTNPEADVYTDFRAD 398 (455) Q Consensus 327 m~~~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~-----a~~~~l~~~a~~~g~-~--~~~~~v~v~~df~~~ 398 (455) |...=.-....++...||++.+.+.......|.-.-..++. -+++++.++.+ .|. + +.+. ++.+|.. T Consensus 354 I~~aFl~~~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r-~g~lP~~p~~l---v~~~~~s- 428 (542) T protein:vir:78 354 ISDAFLILNVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQR-SKQLPSLPKGL---VMPTVVA- 428 (542) T ss_pred HHHHhcccccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCchhc---eeeeeec- Confidence 96531111122344579999999888888888776666643 33444444433 232 1 2111 2333322 Q ss_pred CCC----HHHHHHHHHHHH---c--C------CCCHHHHHH-HHHhcCCCCC-CCCHHHHHHHHHhhcc-cCCCC Q lcl|NC_020844. 399 TGE----EQTPGYLLTMRQ---N--G------DISREGLWM-EFQRLGELSD-NFDPEEESERLISELP-GGIEE 455 (455) Q Consensus 399 ~~~----~~~~~al~~~~~---~--G------~is~et~~~-~l~~~~vl~~-~~d~e~e~~ri~~e~~-~~l~~ 455 (455) .+. .++++++....+ + | .|....+.+ .....||-.. -+..++|++..++|.. ...-+ T Consensus 429 ~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~ 503 (542) T protein:vir:78 429 GLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTA 503 (542) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHH Confidence 121 122233222111 1 1 121222222 2333566422 2232333443333311 11111 No 153 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=97.03 E-value=0.0002 Score=40.85 Aligned_cols=408 Identities=10% Similarity=0.037 Sum_probs=171.1 Q ss_pred CCCCCCCccCHH--HHHHHHHHHHHHHHhcC--------HHHHHHhhhhcCCCCC------------------------- Q lcl|NC_020844. 1 MPEQDPTKRSAD--AEAMSDYLQTVDDVLEG--------VTGMRRNFKRYVKQFP------------------------- 45 (455) Q Consensus 1 m~~~~v~~~~p~--y~~~~~~w~~i~d~~~G--------~~~~r~~g~~yLpk~~------------------------- 45 (455) |++.++ +++|+ |... ..=+++|.-.+= .+++. ...|.|+.- T Consensus 1 ~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~ 76 (532) T protein:vir:94 1 MADTDP-TPRPEITYATL-QQAQRVDAKRATHTSLGLATAHEID--PTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSF 76 (532) T ss_pred CCCCCC-CCCcceehhhh-hhHhhhhhhhhhhhhhhhhhhhhhc--ccccccccccccccccccccccCccccccccccc Confidence 887765 34444 2221 111222221110 00100 011222110 Q ss_pred --CCCHHHH---HHHhhccccCChHHHHHHHHhhhhhcCCceeccCC-----HHHHHHHHhhccCCCCHHHHHHHHHHHH Q lcl|NC_020844. 46 --HEQNKRY---DLRKSVAKMTNVFRDVLEGLASKPFGREVEVTEGT-----GPSEDLLEDIDGRGNNLTTFAGQLFFQG 115 (455) Q Consensus 46 --~e~~~~Y---~~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i~~~~-----~~l~~~~~d~d~~G~~l~~~~~~~~~~a 115 (455) ...-..| ..|. ....++++|+..+.-.+++.+++.+.. +.....++... ....+.+.++.+.+.+ T Consensus 77 ~~~~~~~~~~l~a~Y~----~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~-~~l~v~~~l~~a~~~~ 151 (532) T protein:vir:94 77 VEATSWPGFPTLALLA----QLPEYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKL-EQYNVRTLVRTVVIHD 151 (532) T ss_pred ccccccchHHHHHHHH----cCchhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHH-HhhhHHHHHHHHHHhh Confidence 0000111 1222 145668999999999999999995421 12222222221 1135778888888888 Q ss_pred HhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEE-EEEEeeeeEEEEEec-----Ce Q lcl|NC_020844. 116 VADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITY-MRMWEDAETILVLRP-----DD 189 (455) Q Consensus 116 l~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~-v~~~E~~e~~~v~~~-----~~ 189 (455) ..||.+++++.....+... -.+ .|.. ++++.|- .+....+.. -.++-....+...+| +. T Consensus 152 rlyG~a~i~i~v~~~~~~~-~~~------~p~~--l~~~~I~------~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~ 216 (532) T protein:vir:94 152 QAYGGAHVFPHLKMDGDSV-PAD------APLL--LSPSFVQ------RGCLIGFATIEPMWLSPNAYNATDPTLPSFYK 216 (532) T ss_pred hcccceEEEEEeccCCccc-ccc------cccc--ccccccc------cceeeEEEeechheecccccccccccccccCC Confidence 8999999888643222100 000 0000 1111110 000011100 000000000111110 11 Q ss_pred EEEEEEeCCcEEEEeccCCCCCCceeEE----EEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEE Q lcl|NC_020844. 190 WERWRKNDAGIWEVDEFGRITIGVVPMV----PFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLS 265 (455) Q Consensus 190 ~~~~~~~~~g~~~~~~~g~~~l~~vP~V----~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~ 265 (455) -+.|....+. . -|+--.|.|+ |-+..+. .. ..+.|-|..+.+.....-++...-..+++...+.++. T Consensus 217 P~~y~v~~g~--~-----iH~SRli~f~g~~~p~~~~~~-~~-~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k 287 (532) T protein:vir:94 217 PDSWIATSGK--K-----IHSSRIHTVVGRPVGDMLKAA-YS-FRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLA 287 (532) T ss_pred ceeEEEccCe--e-----eccceEEEecCCCchhhhccc-cc-cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceee Confidence 1122211111 1 1211112211 0000011 11 2356656666555444444444456677777777765 Q ss_pred EecCCCccccccCcc----c----eeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhcc- Q lcl|NC_020844. 266 ANGVEPDTDSEGSPK----P----LDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLT- 336 (455) Q Consensus 266 ~~G~~~~~~~~~~~~----~----~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~- 336 (455) + ++...-..++... . ..-+...++.++.+. .++..+..+-+++ .+.++...++|......|+. T Consensus 288 ~-~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~~----e~~e~~~~~lsgl---~~~l~~~~~~iAaa~~IP~t~ 359 (532) T protein:vir:94 288 T-DMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKGT----EEIQQTNTPLSGL---DSLQAQSQEQMAAVSHIPLVK 359 (532) T ss_pred e-chHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCCC----ceeEEEecccCCH---HHHHHHHHHHHHhHhCCCeee Confidence 4 4322111111100 0 011223455555322 2455555444454 55666667777766655543 Q ss_pred --CCC---cchhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH-cCCCCCceEEEEecccCCCCCCHH------ Q lcl|NC_020844. 337 --AQS---GNLTRISAAQAASKGNSAVQQWAAG-LKDALENALYITNLW-MDAPDTNPEADVYTDFRADTGEEQ------ 403 (455) Q Consensus 337 --~~~---~~~sa~a~~~~~~~~~s~L~~~a~~-~~~a~~~~l~~~a~~-~g~~~~~~~v~v~~df~~~~~~~~------ 403 (455) +++ -|.||.. +...-+..+..+..+ +.-.++.+++++..- .|..+.+.+|+++. .. .++.. T Consensus 360 LfG~sp~GlnstGe~---D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~p-L~--~~s~kEkAei~ 433 (532) T protein:vir:94 360 LLGITPNGLNASSDG---EIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSP-LM--ELDDKELAEVR 433 (532) T ss_pred eecCCcccccccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCC-CC--CCCHHHHHHHH Confidence 222 1333332 344456666666543 566778888777532 34444456666553 11 12222 Q ss_pred --HHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC--HHH---HHHHHHhhccc-------CCCC Q lcl|NC_020844. 404 --TPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD--PEE---ESERLISELPG-------GIEE 455 (455) Q Consensus 404 --~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d--~e~---e~~ri~~e~~~-------~l~~ 455 (455) .+++..++++.|+||.+.+++.|+........-+ .++ +.+.+.++... +..+ T Consensus 434 ~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (532) T protein:vir:94 434 QLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQ 499 (532) T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCC Confidence 2456677889999999999998876443211100 001 11122222111 1111 No 154 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=96.58 E-value=0.0005 Score=38.70 Aligned_cols=386 Identities=7% Similarity=-0.069 Sum_probs=170.2 Q ss_pred CCCCCCCccC----HHHH----HHHHHHHHHHHHhcCHHHHHHhhhhcCCCC-----CCCCHHHHHHHhhccccCChHHH Q lcl|NC_020844. 1 MPEQDPTKRS----ADAE----AMSDYLQTVDDVLEGVTGMRRNFKRYVKQF-----PHEQNKRYDLRKSVAKMTNVFRD 67 (455) Q Consensus 1 m~~~~v~~~~----p~y~----~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~-----~~e~~~~Y~~rl~ra~~~n~~~~ 67 (455) |+-.+ .++ |... ...+.|..++....+... .|- .+++. .+..-+-|+.... -.++.- T Consensus 1 m~kk~--~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~g~-~~~~~~~iLr~~~~~~ly~~m~~----D~hi~s 70 (448) T protein:vir:77 1 MAKRG--RKPKELVPGPGSIDPSDVPKLEGASVPVMSTSY---DVV-VDREFDELLQGKDGLLVYHKMLS----DGTVKN 70 (448) T ss_pred CCCCC--CCCcccCCcccccchhhhhhhccchhhhccccc---ccc-cccchhHhhccccchHHHHHHhh----ChHHHH Confidence 98764 222 2222 233455555554332111 010 01111 1223355766653 466777 Q ss_pred HHHHHhhhhhcCCceec--cC-CH------HHHHHHHhhcc--CCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhh Q lcl|NC_020844. 68 VLEGLASKPFGREVEVT--EG-TG------PSEDLLEDIDG--RGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNR 136 (455) Q Consensus 68 ~v~~~~g~lf~k~p~i~--~~-~~------~l~~~~~d~d~--~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ 136 (455) .++.--..+.+.++++. +. +. .+..++...+. ...+++.++..+. .|+-||.+.+=+-+ T Consensus 71 ~l~~Rk~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~l-da~~~G~s~~Eivw--------- 140 (448) T protein:vir:77 71 ALNYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYE-NAYIYGMAAGEIVL--------- 140 (448) T ss_pred HHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHHH-HhhhhcceeEEEEE--------- Confidence 77777777777787774 11 11 13333332211 1346888888885 78899977763322 Q ss_pred hhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeee--eEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCce Q lcl|NC_020844. 137 EEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDA--ETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVV 214 (455) Q Consensus 137 ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~--e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~v 214 (455) +.. .+|++.+..+..+... ..+. ++++.--+++...+ .+.-...+.. --.| T Consensus 141 -----------------------~~~-~dg~~~~~~l~~r~~~~~~~f~-~~~~~~l~~~~~~~-~~~~~~~~~~-~~~l 193 (448) T protein:vir:77 141 -----------------------TLG-ADGKLILDKIVPIHPFNIDEVL-YDEEGGPKALKLSG-EVKGGSQFVN-GLEI 193 (448) T ss_pred -----------------------eec-CCCceeeccccccCCCccceee-eecCCceEEEecCC-cccccccCCC-cccc Confidence 111 1222222222211111 1111 11111111111111 1100001110 1124 Q ss_pred eEEEEE--ecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccc--cccC--c---cceeec Q lcl|NC_020844. 215 PMVPFI--TGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTD--SEGS--P---KPLDTG 285 (455) Q Consensus 215 P~V~f~--~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~--~~~~--~---~~~~~g 285 (455) |+=.|+ ..+.. +-..+..-|.-++..-+--.....+...-++.-+.|+++.+--..... .+.. . ..+..| T Consensus 194 P~~~~i~~~~~~~-g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g 272 (448) T protein:vir:77 194 PIWKTVVFLHNDD-GSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQK 272 (448) T ss_pred ccceEEEEecCCc-CCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcC Confidence 533332 12222 333455555566654444344567778888999999999883222111 1110 0 112457 Q ss_pred cceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh--hhhhccCCCcchhHHHHHHH-HHHHHHHHHHHH Q lcl|NC_020844. 286 PDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL--GRQPLTAQSGNLTRISAAQA-ASKGNSAVQQWA 362 (455) Q Consensus 286 ~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~~~~sa~a~~~~-~~~~~s~L~~~a 362 (455) +..+..+|.+. .++|+|..+.+- .+.+.++...++|..+ |.. ++...+..++.++.-. ..-....+...+ T Consensus 273 ~~a~~iiP~g~-----~ie~~ea~~~~~-~~~~~i~~~d~~Isk~iLGqt-lTs~~~~g~~~~~~~~~~~v~~~~~~aDa 345 (448) T protein:vir:77 273 PRHGIILPDDW-----KFDTVDLKSAMP-DAIPYLTYHDAGIARALGIDF-NTVQLNMGVQAVNIGEFVSLTQQTIISLQ 345 (448) T ss_pred CceEEEecCCc-----eEEEEecCCCcc-CHHHHHHHHHHHHHHHHhccc-cccccccchhhhhhhhHHHHHHHHHHHHH Confidence 78888899765 699999776553 4667788888888553 433 3332222222222211 122334456677 Q ss_pred HHHHHHHH-HHHHHHHHHc-CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHH-hcCCCCCCCCHH Q lcl|NC_020844. 363 AGLKDALE-NALYITNLWM-DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQ-RLGELSDNFDPE 439 (455) Q Consensus 363 ~~~~~a~~-~~l~~~a~~~-g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~-~~~vl~~~~d~e 439 (455) ..+.+.++ +++++++.+- |....-..+.+. +. +++++.++.+.... +....+ +.|+ |...+.. T Consensus 346 ~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~--~~----e~eDl~~~a~~~~~-------l~~~~~~~~~i-p~~~~~~ 411 (448) T protein:vir:77 346 REFASAVNLYLIPKLVLPNWPGATRFPRLTFE--ME----ERNDFSAAANLMGM-------LINAVKDSEDI-PTELKAL 411 (448) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCCEEEec--CC----ChhhHHHHHHHhHH-------HHHHHHHHhcC-CccCCcC Confidence 77888887 4778777764 321111244332 21 33444443332211 111122 2344 2111100 Q ss_pred HHHHHHHhhcccCCCC Q lcl|NC_020844. 440 EESERLISELPGGIEE 455 (455) Q Consensus 440 ~e~~ri~~e~~~~l~~ 455 (455) -+...=+.+.+.++.+ T Consensus 412 ~~~~~~~~~~~~~~~~ 427 (448) T protein:vir:77 412 IDALPSKMRRALGVVD 427 (448) T ss_pred CCCCchhcccccCCCC Confidence 0000001112222333 No 155 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=96.07 E-value=0.0011 Score=36.91 Aligned_cols=365 Identities=13% Similarity=0.120 Sum_probs=153.1 Q ss_pred CCCCCCCccCHH-HHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSAD-AEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~-y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |+==+..+++.. ..-.-+.| +...-.|... .|+. .+..++.+.++..+..+.+++++. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~--~~~~~~~~~~------~~v~---------~~~al~~~~V~~~v~~ia~~ia~~---- 59 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDW--VNFLTGGEAQ------KYVS---------ADTALKNSDIFSLIMQLSGDLAMV---- 59 (397) T ss_pred CcchhhhhcccCcccCCchhh--hhhhcCCcCC------ceec---------hHHhhccHHHHHHHHHHHHHHhhC---- Confidence 662111010000 00000122 1111111100 0111 112233333333334444444443 Q ss_pred CceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccc Q lcl|NC_020844. 80 EVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEI 159 (455) Q Consensus 80 ~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w 159 (455) |.... .+.+..|+.... ...+..+|.+.+....+.+|-||+++.++..+.+. -+..++|..|--+ T Consensus 60 p~~~~--~~~~~~l~~~PN-~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~------------~l~~l~~~~v~i~ 124 (397) T protein:vir:38 60 RYTSE--SDRSQSIISNPS-VTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDL------------SWEYLRPSQVQPM 124 (397) T ss_pred ccccc--ccHHHHHHhcCC-CCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEE------------EEEEEcCceeEEE Confidence 43332 344566666554 35688999999999999999999998766544321 2455555554211 Q ss_pred eeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe----CCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchH Q lcl|NC_020844. 160 QSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN----DAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPM 235 (455) Q Consensus 160 ~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~----~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL 235 (455) .... +.. ..|+.. ..|....... -. ++-|..... .+...+.+|+ T Consensus 125 ~~~~--~~~----------------------~~y~~~~~~~~~~~~~~~~~----~e---iih~~~~~~-~~~~~G~s~i 172 (397) T protein:vir:38 125 LLQD--GSG----------------------LIYNINFDEPAIGYMENVPA----AD---VIHIRLLSK-NGGKTGISPL 172 (397) T ss_pred EcCC--Cce----------------------EEEEEEeccccccceeEecC----cc---EEEecCCCC-CCccccccHH Confidence 1111 100 011110 0111111111 11 222332222 2334577887 Q ss_pred HHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCCCccccccCc-----cceeec--cceeeeccCCCCccccceeEEe Q lcl|NC_020844. 236 KDALDLQVALFRKECALEFAE-MMTAFPMLSANGVEPDTDSEGSP-----KPLDTG--PDTVLYAPPNAEGQHGSWNFVE 307 (455) Q Consensus 236 ~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~~~~~~~~~~-----~~~~~g--~~~~~~lp~~~~~~~~~~~~le 307 (455) ..+.. .+.......++.... ...+.|..+++--.....+.... +...-| ++.++.++. +.+|.+ T Consensus 173 ~~~~~-~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~-------g~~~~~ 244 (397) T protein:vir:38 173 SALIN-EQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDA-------LEDYKP 244 (397) T ss_pred HHHHH-HHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCC-------CceEEe Confidence 66554 445555555554444 44567777776322221111000 001111 233455542 355555 Q ss_pred eCchhHH-HHHHHHHHHHHHHHHh-h--hhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020844. 308 PATESLK-FLAEQIESVSKEIREL-G--RQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAP 383 (455) Q Consensus 308 ~~~~~l~-~~~~~l~~~~~~m~~~-g--~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~ 383 (455) ++.++-. .+.+..+...++|+.+ | ..++.....+.+..+.. .......|.-+...++++++..|- + T Consensus 245 l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e~~--~~~~~~~l~P~~~~ie~~ln~~l~--------~ 314 (397) T protein:vir:38 245 LEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSITQI--SGQYAKSLNRYVQAIVGELNDKLH--------A 314 (397) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--HHHHHHHHHHHHHHHHHHHHHhcc--------C Confidence 4433222 2345556666666554 2 23343322222222111 111223455566666655554221 1 Q ss_pred CCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC-HHHHHHHHH--hhcccCCCC Q lcl|NC_020844. 384 DTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD-PEEESERLI--SELPGGIEE 455 (455) Q Consensus 384 ~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d-~e~e~~ri~--~e~~~~l~~ 455 (455) . . +++..|.......+.++++.++.+.|+++...+++.+..-.+.+.+.- .+....... .+..++-++ T Consensus 315 ~--~--~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~~~g~~~ 385 (397) T protein:vir:38 315 N--I--SANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQEGGEND 385 (397) T ss_pred h--h--cccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccccccccccccccccccccCCCC Confidence 1 1 122233333344566788889999999999999887755444332211 111110110 000011111 No 156 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=96.01 E-value=0.0011 Score=36.74 Aligned_cols=404 Identities=9% Similarity=-0.012 Sum_probs=171.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHH-HhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDD-VLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d-~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |= .+ ...+|+..++ -+ +..+++..+-.||..-..+...-..++.+ .|-+...+.++.++..+.+- T Consensus 1 mk----~~-------~~~~~~~lkR~~~--e~~w~e~a~~tlP~~~~~~~~~~~~~~~~-~~dstg~~a~~~LAa~l~~~ 66 (510) T protein:vir:63 1 MK----TT-------AAMLWEKLRDGSV--EQRAIEFAKTTLPYLMVDPMSGSRGVVEH-DFQSAGALLVNNLAAKLARS 66 (510) T ss_pred Ch----hH-------HHHHHHHHhccch--HHHHHHHHHhhccccCCCCCCccccccCC-CccchHHHHHHHHHHHHHhh Confidence 22 11 2234433321 11 33445555555663221111111122333 35566666666666555210 Q ss_pred --Cce-----ec-------------cCCHHHHHHHHhhcc------CCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcc Q lcl|NC_020844. 80 --EVE-----VT-------------EGTGPSEDLLEDIDG------RGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQF 133 (455) Q Consensus 80 --~p~-----i~-------------~~~~~l~~~~~d~d~------~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~ 133 (455) ||. +. .....++.|++.+.. ..++++.-+..+.++...+|.+.+++| +.+. T Consensus 67 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~--~~~~- 143 (510) T protein:vir:63 67 LFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRD--SDAA- 143 (510) T ss_pred hcCCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEc--CCCc- Confidence 111 11 001124554433321 245788888899999999999988876 2211 Q ss_pred hhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEE-------ee--------------eeEEEEEecCeEEE Q lcl|NC_020844. 134 RNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMW-------ED--------------AETILVLRPDDWER 192 (455) Q Consensus 134 ~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~-------E~--------------~e~~~v~~~~~~~~ 192 (455) .+..|+-.+.. +..+..|+.-. ++.+++ |. .+.+.+++ .+ T Consensus 144 -------------~~~~~pl~~y~-v~~d~~G~vd~-i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~----~V 204 (510) T protein:vir:63 144 -------------TVVAWSLRSYA-VRRDATGRWMD-IVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYT----HV 204 (510) T ss_pred -------------EEEEEEcceeE-EeeCCCcCeeE-EEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEE----EE Confidence 13445444422 22222232221 111211 10 01122221 12 Q ss_pred EEEeCCc--EEE---------EeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCC Q lcl|NC_020844. 193 WRKNDAG--IWE---------VDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAF 261 (455) Q Consensus 193 ~~~~~~g--~~~---------~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~ 261 (455) ++..+.+ .|. +...+..++...|++++++....++.+ |.+|-.+...-...+...+-..-.....+.. T Consensus 205 ~~~~~~~~~~~sv~~e~dg~~~~~~~~~~~~e~P~~~~Rw~~~~ge~Y-Grgp~~~~l~D~k~L~~l~~~~l~~a~~a~~ 283 (510) T protein:vir:63 205 QRKKGTAMEYAELYHEIDGVRVGKEGRWPIHLCPYIVPTWNLAPGEHY-GRGHVEDYIGDFAKLSLLSEKLGLYELESLE 283 (510) T ss_pred EeecCCCceEEEEEEEecCceeccccccccccCceeeeeeeecCCCcc-ccchHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 2222211 111 123445567789999998877776665 5556554443333333333333333333344 Q ss_pred ce-EEEe-cCCCccccccCccceeeccceeeeccCCCCccccceeEEeeC-chhHHHHHHHHHHHHHHHHHhhhh-hccC Q lcl|NC_020844. 262 PM-LSAN-GVEPDTDSEGSPKPLDTGPDTVLYAPPNAEGQHGSWNFVEPA-TESLKFLAEQIESVSKEIRELGRQ-PLTA 337 (455) Q Consensus 262 p~-~~~~-G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~-~~~l~~~~~~l~~~~~~m~~~g~~-~l~~ 337 (455) |. ++-. |+ .++..+.-|....+ +|.. ..+++.++.. +..+....+.|+++++.|...=.. +... T Consensus 284 ~~~lv~p~g~-------~~~~~~~~~~~g~~-v~g~----~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~l~~~ 351 (510) T protein:vir:63 284 VLNLVDEAKG-------AVVDDYQDAEMGDY-VPGG----AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYGANQR 351 (510) T ss_pred CCcccCcccc-------cchhhhccCCCcee-ecCC----cccceeeecCcccchHHHHHHHHHHHHHHHHHHHhhcccC Confidence 44 3332 21 11122222222111 3321 1235555533 356777788888888888764211 2233 Q ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHcCC---CCCceEEEEecccCCCCCCHHH---HH Q lcl|NC_020844. 338 QSGNLTRISAAQAASKGNSAVQQWAAGLKDA-----LENALYITNLWMDA---PDTNPEADVYTDFRADTGEEQT---PG 406 (455) Q Consensus 338 ~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a-----~~~~l~~~a~~~g~---~~~~~~v~v~~df~~~~~~~~~---~~ 406 (455) .+.+.||++...+.......|.-.-..++.- +++++.++-+ .|+ ++..+...+- .|...---++. +. T Consensus 352 ~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r-~gl~p~p~~~~~~~~v-~~is~Laraq~~~~l~ 429 (510) T protein:vir:63 352 DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIE-TGLPALSRSAAVQSML 429 (510) T ss_pred CCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCchhccccee-cchhHHHHHHHHHHHH Confidence 3455799999998888777776665554432 2333333322 233 2222222111 12111001222 22 Q ss_pred HHHHHHH-cCCCC-------HHHH-HHHHHhcCCCCC-CCCHHHHHHHHHhh-ccc---------CCCC Q lcl|NC_020844. 407 YLLTMRQ-NGDIS-------REGL-WMEFQRLGELSD-NFDPEEESERLISE-LPG---------GIEE 455 (455) Q Consensus 407 al~~~~~-~G~is-------~et~-~~~l~~~~vl~~-~~d~e~e~~ri~~e-~~~---------~l~~ 455 (455) ++.+..+ .|.+. ...+ .......||.+. -+-.++|.+.+.+| ..+ .+.+ T Consensus 430 ~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~ 498 (510) T protein:vir:63 430 NASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLE 498 (510) T ss_pred HHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222 12222 2222 233444666432 22233444443332 111 1111 No 157 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=95.75 E-value=0.0015 Score=36.04 Aligned_cols=375 Identities=11% Similarity=0.012 Sum_probs=152.3 Q ss_pred HHHHHHHHHHhcCHHHHHHhh-hhcCCCC------------CCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCCcee Q lcl|NC_020844. 17 SDYLQTVDDVLEGVTGMRRNF-KRYVKQF------------PHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGREVEV 83 (455) Q Consensus 17 ~~~w~~i~d~~~G~~~~r~~g-~~yLpk~------------~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i 83 (455) ..-|.+++...+...+ ... ..++... .+..-.. +..++.+. +...|+.+++-+-+-|..+ T Consensus 1 M~~~~r~~~~~~~~~r--~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~-~~al~~~~----v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKR--QTSQVIELNKDDEKLLEWLGISPSTISVKG-KNALKVAT----VFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCcccc--CcccccccCCchHHHHHHhCCCcCccccch-hhhhccHH----HHHHHHHHHHhhccCceEE Confidence 2344444444332110 000 0001000 0000000 11222222 2333444444333333332 Q ss_pred ---------ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 84 ---------TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 84 ---------~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) ......+..++. .-+ ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+.+++| T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L~~i~~ 140 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPN-PYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQ------------ALWPIDA 140 (432) T ss_pred EEecCCceeeccccHHHHHHHhhcc-CCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EEEEEcC Confidence 112233555553 333 45678899999999999999999999766544321 2445555 Q ss_pred hhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEE-EEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCc Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWE-RWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFH 232 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~-~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~ 232 (455) ..|--.. +..+ . +. .... .|....+|....... -. ++-|.++.. .+...+. T Consensus 141 ~~v~v~~-d~~~---~---------------~~-~~~~~~y~~~~~g~~~~~~~----~e---iih~r~~~~-~~~~~G~ 192 (432) T protein:vir:10 141 SKVTVYI-DDVG---L---------------LN-SKTKMWYVVNTGGQQRVLKP----EE---ILHFKNGIT-LDGLVGV 192 (432) T ss_pred ceeEEEE-cCcc---c---------------cc-ccceEEEEEecCCeEEEEcc----cc---EEEecCCCC-CCCcccc Confidence 5542100 0000 0 00 0011 111222222111111 11 222322221 2334578 Q ss_pred chHHHHHHHHHHHHHhHHHHH-HHHHHcCCceEEEecCC---CccccccCcc--ceeec---cceeeeccCCCCccccce Q lcl|NC_020844. 233 PPMKDALDLQVALFRKECALE-FAEMMTAFPMLSANGVE---PDTDSEGSPK--PLDTG---PDTVLYAPPNAEGQHGSW 303 (455) Q Consensus 233 ~pL~dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G~~---~~~~~~~~~~--~~~~g---~~~~~~lp~~~~~~~~~~ 303 (455) +|+.-+... +........+. ......+.|-.+++--. ++........ ...-| ++.++.+|. +.++ T Consensus 193 s~~~~~~~~-i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-----g~~~ 266 (432) T protein:vir:10 193 PTMEYLKST-LENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPV-----GYQF 266 (432) T ss_pred cHHHHHHHH-HHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCC-----CceE Confidence 887766543 34444444443 33455567877776311 1111111110 11122 344566653 2345 Q ss_pred eEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccC-CCcchhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 304 NFVEPATESLKFLAEQIESVSKEIREL---GRQPLTA-QSGNLTR-ISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 304 ~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa-~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) .-+..++...+ +.+..+...++|+.+ ...++.. ..++-+. .+... .-....|.-++..++++++..|-.-.. T Consensus 267 ~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~--~~~~~~l~P~~~~ie~~ln~kLl~~~~ 343 (432) T protein:vir:10 267 QPISLNMSDAQ-FLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ--QFYTDTLQATLTMYEQEMTYKLFLDSE 343 (432) T ss_pred EEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcChhh Confidence 54554444433 233444444555443 3333421 1222221 11111 113344555666666666543321111 Q ss_pred HcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCC--CC--------CCHHHHHHHHHhh Q lcl|NC_020844. 379 WMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELS--DN--------FDPEEESERLISE 448 (455) Q Consensus 379 ~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~--~~--------~d~e~e~~ri~~e 448 (455) + ..+..|+++.+-.........++++.++...|+++.-.+++.+ |+-| .. ..+-++...-..+ T Consensus 344 ~----~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---g~~pi~ggD~~~~~~n~~~~~~~~~~~~k 416 (432) T protein:vir:10 344 L----DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE---DLPPEAGGDRLLVNGNMLPIDMAGQAYLK 416 (432) T ss_pred c----CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCeEeecccccchhhccccccC Confidence 1 2234455554323333335567788889999999998888766 3322 11 1111111110000 Q ss_pred -cc--cCCC----C Q lcl|NC_020844. 449 -LP--GGIE----E 455 (455) Q Consensus 449 -~~--~~l~----~ 455 (455) ++ +..+ | T Consensus 417 ~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 417 GGDTNGEVSKEGNE 430 (432) T ss_pred CCCCCCCCCCCCCC Confidence 00 0000 1 No 158 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=95.75 E-value=0.0015 Score=36.04 Aligned_cols=375 Identities=11% Similarity=0.012 Sum_probs=152.3 Q ss_pred HHHHHHHHHHhcCHHHHHHhh-hhcCCCC------------CCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCCcee Q lcl|NC_020844. 17 SDYLQTVDDVLEGVTGMRRNF-KRYVKQF------------PHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGREVEV 83 (455) Q Consensus 17 ~~~w~~i~d~~~G~~~~r~~g-~~yLpk~------------~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i 83 (455) ..-|.+++...+...+ ... ..++... .+..-.. +..++.+. +...|+.+++-+-+-|..+ T Consensus 1 M~~~~r~~~~~~~~~r--~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~-~~al~~~~----v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKR--QTSQVIELNKDDEKLLEWLGISPSTISVKG-KNALKVAT----VFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCcccc--CcccccccCCchHHHHHHhCCCcCccccch-hhhhccHH----HHHHHHHHHHhhccCceEE Confidence 2344444444332110 000 0001000 0000000 11222222 2333444444333333332 Q ss_pred ---------ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 84 ---------TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 84 ---------~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) ......+..++. .-+ ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+.+++| T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L~~i~~ 140 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPN-PYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQ------------ALWPIDA 140 (432) T ss_pred EEecCCceeeccccHHHHHHHhhcc-CCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EEEEEcC Confidence 112233555553 333 45678899999999999999999999766544321 2445555 Q ss_pred hhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEE-EEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCc Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWE-RWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFH 232 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~-~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~ 232 (455) ..|--.. +..+ . +. .... .|....+|....... -. ++-|.++.. .+...+. T Consensus 141 ~~v~v~~-d~~~---~---------------~~-~~~~~~y~~~~~g~~~~~~~----~e---iih~r~~~~-~~~~~G~ 192 (432) T protein:vir:10 141 SKVTVYI-DDVG---L---------------LN-SKTKMWYVVNTGGQQRVLKP----EE---ILHFKNGIT-LDGLVGV 192 (432) T ss_pred ceeEEEE-cCcc---c---------------cc-ccceEEEEEecCCeEEEEcc----cc---EEEecCCCC-CCCcccc Confidence 5542100 0000 0 00 0011 111222222111111 11 222322221 2334578 Q ss_pred chHHHHHHHHHHHHHhHHHHH-HHHHHcCCceEEEecCC---CccccccCcc--ceeec---cceeeeccCCCCccccce Q lcl|NC_020844. 233 PPMKDALDLQVALFRKECALE-FAEMMTAFPMLSANGVE---PDTDSEGSPK--PLDTG---PDTVLYAPPNAEGQHGSW 303 (455) Q Consensus 233 ~pL~dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G~~---~~~~~~~~~~--~~~~g---~~~~~~lp~~~~~~~~~~ 303 (455) +|+.-+... +........+. ......+.|-.+++--. ++........ ...-| ++.++.+|. +.++ T Consensus 193 s~~~~~~~~-i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-----g~~~ 266 (432) T protein:vir:10 193 PTMEYLKST-LENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPV-----GYQF 266 (432) T ss_pred cHHHHHHHH-HHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCC-----CceE Confidence 887766543 34444444443 33455567877776311 1111111110 11122 344566653 2345 Q ss_pred eEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccC-CCcchhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 304 NFVEPATESLKFLAEQIESVSKEIREL---GRQPLTA-QSGNLTR-ISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 304 ~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa-~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) .-+..++...+ +.+..+...++|+.+ ...++.. ..++-+. .+... .-....|.-++..++++++..|-.-.. T Consensus 267 ~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~--~~~~~~l~P~~~~ie~~ln~kLl~~~~ 343 (432) T protein:vir:10 267 QPISLNMSDAQ-FLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ--QFYTDTLQATLTMYEQEMTYKLFLDSE 343 (432) T ss_pred EEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcChhh Confidence 54554444433 233444444555443 3333421 1222221 11111 113344555666666666543321111 Q ss_pred HcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCC--CC--------CCHHHHHHHHHhh Q lcl|NC_020844. 379 WMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELS--DN--------FDPEEESERLISE 448 (455) Q Consensus 379 ~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~--~~--------~d~e~e~~ri~~e 448 (455) + ..+..|+++.+-.........++++.++...|+++.-.+++.+ |+-| .. ..+-++...-..+ T Consensus 344 ~----~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---g~~pi~ggD~~~~~~n~~~~~~~~~~~~k 416 (432) T protein:vir:10 344 L----DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE---DLPPEAGGDRLLVNGNMLPIDMAGQAYLK 416 (432) T ss_pred c----CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCeEeecccccchhhccccccC Confidence 1 2234455554323333335567788889999999998888766 3322 11 1111111110000 Q ss_pred -cc--cCCC----C Q lcl|NC_020844. 449 -LP--GGIE----E 455 (455) Q Consensus 449 -~~--~~l~----~ 455 (455) ++ +..+ | T Consensus 417 ~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 417 GGDTNGEVSKEGNE 430 (432) T ss_pred CCCCCCCCCCCCCC Confidence 00 0000 1 No 159 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=95.75 E-value=0.0015 Score=36.04 Aligned_cols=375 Identities=11% Similarity=0.012 Sum_probs=152.3 Q ss_pred HHHHHHHHHHhcCHHHHHHhh-hhcCCCC------------CCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCCcee Q lcl|NC_020844. 17 SDYLQTVDDVLEGVTGMRRNF-KRYVKQF------------PHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGREVEV 83 (455) Q Consensus 17 ~~~w~~i~d~~~G~~~~r~~g-~~yLpk~------------~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i 83 (455) ..-|.+++...+...+ ... ..++... .+..-.. +..++.+. +...|+.+++-+-+-|..+ T Consensus 1 M~~~~r~~~~~~~~~r--~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~-~~al~~~~----v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKR--QTSQVIELNKDDEKLLEWLGISPSTISVKG-KNALKVAT----VFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCcccc--CcccccccCCchHHHHHHhCCCcCccccch-hhhhccHH----HHHHHHHHHHhhccCceEE Confidence 2344444444332110 000 0001000 0000000 11222222 2333444444333333332 Q ss_pred ---------ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 84 ---------TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 84 ---------~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) ......+..++. .-+ ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+.+++| T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L~~i~~ 140 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPN-PYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQ------------ALWPIDA 140 (432) T ss_pred EEecCCceeeccccHHHHHHHhhcc-CCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EEEEEcC Confidence 112233555553 333 45678899999999999999999999766544321 2445555 Q ss_pred hhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEE-EEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCc Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWE-RWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFH 232 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~-~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~ 232 (455) ..|--.. +..+ . +. .... .|....+|....... -. ++-|.++.. .+...+. T Consensus 141 ~~v~v~~-d~~~---~---------------~~-~~~~~~y~~~~~g~~~~~~~----~e---iih~r~~~~-~~~~~G~ 192 (432) T protein:vir:10 141 SKVTVYI-DDVG---L---------------LN-SKTKMWYVVNTGGQQRVLKP----EE---ILHFKNGIT-LDGLVGV 192 (432) T ss_pred ceeEEEE-cCcc---c---------------cc-ccceEEEEEecCCeEEEEcc----cc---EEEecCCCC-CCCcccc Confidence 5542100 0000 0 00 0011 111222222111111 11 222322221 2334578 Q ss_pred chHHHHHHHHHHHHHhHHHHH-HHHHHcCCceEEEecCC---CccccccCcc--ceeec---cceeeeccCCCCccccce Q lcl|NC_020844. 233 PPMKDALDLQVALFRKECALE-FAEMMTAFPMLSANGVE---PDTDSEGSPK--PLDTG---PDTVLYAPPNAEGQHGSW 303 (455) Q Consensus 233 ~pL~dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G~~---~~~~~~~~~~--~~~~g---~~~~~~lp~~~~~~~~~~ 303 (455) +|+.-+... +........+. ......+.|-.+++--. ++........ ...-| ++.++.+|. +.++ T Consensus 193 s~~~~~~~~-i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-----g~~~ 266 (432) T protein:vir:10 193 PTMEYLKST-LENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPV-----GYQF 266 (432) T ss_pred cHHHHHHHH-HHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCC-----CceE Confidence 887766543 34444444443 33455567877776311 1111111110 11122 344566653 2345 Q ss_pred eEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccC-CCcchhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 304 NFVEPATESLKFLAEQIESVSKEIREL---GRQPLTA-QSGNLTR-ISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 304 ~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa-~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) .-+..++...+ +.+..+...++|+.+ ...++.. ..++-+. .+... .-....|.-++..++++++..|-.-.. T Consensus 267 ~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~--~~~~~~l~P~~~~ie~~ln~kLl~~~~ 343 (432) T protein:vir:10 267 QPISLNMSDAQ-FLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ--QFYTDTLQATLTMYEQEMTYKLFLDSE 343 (432) T ss_pred EEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcChhh Confidence 54554444433 233444444555443 3333421 1222221 11111 113344555666666666543321111 Q ss_pred HcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCC--CC--------CCHHHHHHHHHhh Q lcl|NC_020844. 379 WMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELS--DN--------FDPEEESERLISE 448 (455) Q Consensus 379 ~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~--~~--------~d~e~e~~ri~~e 448 (455) + ..+..|+++.+-.........++++.++...|+++.-.+++.+ |+-| .. ..+-++...-..+ T Consensus 344 ~----~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---g~~pi~ggD~~~~~~n~~~~~~~~~~~~k 416 (432) T protein:vir:10 344 L----DKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE---DLPPEAGGDRLLVNGNMLPIDMAGQAYLK 416 (432) T ss_pred c----CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCeEeecccccchhhccccccC Confidence 1 2234455554323333335567788889999999998888766 3322 11 1111111110000 Q ss_pred -cc--cCCC----C Q lcl|NC_020844. 449 -LP--GGIE----E 455 (455) Q Consensus 449 -~~--~~l~----~ 455 (455) ++ +..+ | T Consensus 417 ~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 417 GGDTNGEVSKEGNE 430 (432) T ss_pred CCCCCCCCCCCCCC Confidence 00 0000 1 No 160 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=95.56 E-value=0.0018 Score=35.58 Aligned_cols=426 Identities=11% Similarity=0.098 Sum_probs=186.7 Q ss_pred CCCCC---------CCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCC----CCCHHHHHHHhhccccCChHH Q lcl|NC_020844. 1 MPEQD---------PTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFP----HEQNKRYDLRKSVAKMTNVFR 66 (455) Q Consensus 1 m~~~~---------v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~----~e~~~~Y~~rl~ra~~~n~~~ 66 (455) |+.+. -+..|-=-......|+...+.-.= .+.+++.. .|+-... +.+...++ .++..|.+- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~-~yi~~~~tr~t~~~~~~w~----~s~t~~k~~ 75 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELM-DYIDATDTRKTSNSKLPFK----NSTTINKLA 75 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHH-HHHhhhcccccccCCCCcc----cccchHHHH Confidence 55331 233444444556677777665321 11122111 1222111 11111122 233466777 Q ss_pred HHHHHHhhhhhcCC-ce-----ec----c-----CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCC-- Q lcl|NC_020844. 67 DVLEGLASKPFGRE-VE-----VT----E-----GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPA-- 129 (455) Q Consensus 67 ~~v~~~~g~lf~k~-p~-----i~----~-----~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~-- 129 (455) .+++.+..++|.-- |+ +. + ..+.++.+..|== .-.++..-...++.+-+.+|-|+.-|++-+ T Consensus 76 ~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl-~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~ 154 (599) T protein:vir:31 76 HLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKV-EASNLEGVIERMVDDFAVRGFCVAHTRHVKRM 154 (599) T ss_pred HHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhh-hhcchHHHHHHHHhhhcccCceeEeeeEEEcc Confidence 77777777765422 11 10 1 1122444443321 123566667788889999999999988432 Q ss_pred --CCcchhhhhHHhccCCceEEEechhhhc-cceeeccCCeeEEEEEEEEee-----------e---------------- Q lcl|NC_020844. 130 --GQQFRNREEERQAGVRPYWSQVNHSAIL-EIQSERQGAGERITYMRMWED-----------A---------------- 179 (455) Q Consensus 130 --~~~~~t~ad~~~~~~rPy~~~~~~~~ii-~w~~~~~~~~~~l~~v~~~E~-----------~---------------- 179 (455) .... .-....+.|-+..++|.+|+ +-.-..++.-..+ +|...+ . T Consensus 155 ~~~~d~----~v~~~~~~P~~ervsP~Di~~Dp~A~si~d~~fi--vRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~ 228 (599) T protein:vir:31 155 TVTAEN----QVIKNYSGTVTERLSPSDVFWDVTADSLPKAAKC--IRQLYTLGSLKREIEEGTFPLMSMEDFQKLREER 228 (599) T ss_pred eeeccc----ccccccccceEEeecccceeeCCCCCCCCcceee--eehhhhHHHHHHHhccCCccccchHHHHHHHhhc Confidence 1111 11234556889999998886 2222222211111 122110 0 Q ss_pred --------eE---------------EEE---EecCeEE-------EEEEeCCcEE-----EEe--------ccCCCCCCc Q lcl|NC_020844. 180 --------ET---------------ILV---LRPDDWE-------RWRKNDAGIW-----EVD--------EFGRITIGV 213 (455) Q Consensus 180 --------e~---------------~~v---~~~~~~~-------~~~~~~~g~~-----~~~--------~~g~~~l~~ 213 (455) +. .++ +.++-++ +|...+++.| ++. +..+.+.|. T Consensus 229 ~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~ 308 (599) T protein:vir:31 229 RTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGS 308 (599) T ss_pred cCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCC Confidence 00 011 1112111 2223333433 111 233567788 Q ss_pred eeEEEEEecccCCCcccCcchHHHHHHHHHHH---HHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceee Q lcl|NC_020844. 214 VPMVPFITGRRKGKKWVFHPPMKDALDLQVAL---FRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVL 290 (455) Q Consensus 214 vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~h---y~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~ 290 (455) .|||...+-+...+. .+..||..+..+.-.+ .|..-|. +.....|++...|.-.. ..+..+|+.+| T Consensus 309 ~Pyvv~~~~P~~~~~-yG~G~l~~~~gaQ~~lN~~~Ng~iD~---~~~~l~p~l~~~~dl~~-------eD~~~~P~~v~ 377 (599) T protein:vir:31 309 QNLHIAVYEFQKDTL-CPIGPLHRLTGMQYKLDKRENFREDL---HDRFLHPSLKKVGDVRE-------KGMRGGPNHVF 377 (599) T ss_pred CCeEEEEeeeecccc-CCCCCchhcchHHHHHHHHHHHhhhh---hhhhhcccccccccccc-------cCccCCCCcce Confidence 999866555544333 3444554444433322 3333333 33444677776654211 22344567777 Q ss_pred eccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC----CCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 291 YAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA----QSGNLTRISAAQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 291 ~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~----~~~~~sa~a~~~~~~~~~s~L~~~a~~~~ 366 (455) .+- ..+++.++.++.+... ....+...+.+|-.+++.+... ..+.+||...+.-..+.+.........++ T Consensus 378 ~~~-----d~~~vq~~~p~s~~~~-a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e 451 (599) T protein:vir:31 378 EVE-----ETGDVQYMTPPAEVLQ-PDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFE 451 (599) T ss_pred eec-----CCCccccccCchhhhh-HHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHH Confidence 653 3346766666655433 3445777788887776555422 23457888887777777777777877777 Q ss_pred HHHHH-HHH----HHHHHcCCCC------Cc----eEEEEecccCCC-----CCC-------HHHHHHHHHHHHc----C Q lcl|NC_020844. 367 DALEN-ALY----ITNLWMDAPD------TN----PEADVYTDFRAD-----TGE-------EQTPGYLLTMRQN----G 415 (455) Q Consensus 367 ~a~~~-~l~----~~a~~~g~~~------~~----~~v~v~~df~~~-----~~~-------~~~~~al~~~~~~----G 415 (455) +.+-. +++ +..+++..++ ++ .=++|.+++... .+. ++.++-+.+..++ + T Consensus 452 ~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~ 531 (599) T protein:vir:31 452 RELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAA 531 (599) T ss_pred HHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCc Confidence 76544 443 3333332211 00 113444433211 111 1112223222211 1 Q ss_pred C---CCHHHHHHHH------HhcCCCCCCCCH-HHHHH--H----HHhhcccCCCC Q lcl|NC_020844. 416 D---ISREGLWMEF------QRLGELSDNFDP-EEESE--R----LISELPGGIEE 455 (455) Q Consensus 416 ~---is~et~~~~l------~~~~vl~~~~d~-e~e~~--r----i~~e~~~~l~~ 455 (455) + ++...+...+ +.-++-.+.+-. +.+.. . ++++.+..|.. T Consensus 532 ~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~ 587 (599) T protein:vir:31 532 LAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQ 587 (599) T ss_pred cchhhHHHHHHHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhh Confidence 1 2222222111 112222222221 11111 1 11111111111 No 161 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=95.49 E-value=0.002 Score=35.43 Aligned_cols=367 Identities=8% Similarity=0.022 Sum_probs=153.2 Q ss_pred CCCCCCCccC----HHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCC-CCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQDPTKRS----ADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQF-PHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~v~~~~----p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~-~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) |. ...+. .......+.|--+-+ ..+.+.. .++.-.. ...++ .+.+..+|+-+++- T Consensus 1 M~---~f~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~v~~-~~al~----~~~v~~~i~~ia~~ 60 (386) T protein:vir:49 1 MP---IFNITNLATESPPINQESFFDIAD------------SDFLASLNSSEWVSA-ENALK----NSDLFSIISQLSND 60 (386) T ss_pred Cc---hhhhhccCCCCcccchhhhhhhhh------------ccccccccCCceech-hhhhc----cHHHHHHHHHHHHH Confidence 44 11110 000000111110100 0011110 0110000 11122 23344566666776 Q ss_pred hhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 76 PFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 76 lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) +-+-|+.+.. .....++.... ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+..++|.. T Consensus 61 ia~~p~~~~~--~~~~~l~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~------------~l~~i~~~~ 125 (386) T protein:vir:49 61 LATAKITTSR--KQLQGIVDNPS-NNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDM------------KWEYLRPSQ 125 (386) T ss_pred hhhCceeecc--chhhhhhhccC-CCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEE------------EEEEecCce Confidence 6666666642 22345555443 24578999999999999999999999765544321 244455544 Q ss_pred hccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeC--CcEEEEeccCCCCCCceeEEEEEecccCCCcccCcc Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKND--AGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHP 233 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~--~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~ 233 (455) |--...+. + ..+. +.+...+ ++....... =. |+-|..... .+...+.+ T Consensus 126 v~v~~~~~-~--~~~~-------------------y~~~~~~~~~~~~~~~~~----~e---vih~~~~~~-~~~~~G~s 175 (386) T protein:vir:49 126 VSFNRLDN-Q--NGLY-------------------YNITFDDPHIAPKQHVPQ----ND---ILHFRLLSV-DGGLTSVS 175 (386) T ss_pred eEEEEcCC-C--ceEE-------------------EEEEEcCccccceeEEcc----cc---EEEecCCCC-CCcccccc Confidence 42111100 0 0000 0111110 011001110 01 232332222 24445788 Q ss_pred hHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEecCCCcccccc---Cc--cceeeccceeeeccCCCCccccceeEEe Q lcl|NC_020844. 234 PMKDALDLQVALFRKECALEFA-EMMTAFPMLSANGVEPDTDSEG---SP--KPLDTGPDTVLYAPPNAEGQHGSWNFVE 307 (455) Q Consensus 234 pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G~~~~~~~~~---~~--~~~~~g~~~~~~lp~~~~~~~~~~~~le 307 (455) |+.-+... +.......++... ....+.|-.+++--.....++. .. ....-.++.++.+|.+ .++.-++ T Consensus 176 ~l~~~~~~-i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g-----~~~~~l~ 249 (386) T protein:vir:49 176 PLMALGRE-FNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDL-----EDFTPLE 249 (386) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCceecCCC-----ceEEEcc Confidence 87766553 3444444454444 4445668777752111111000 00 0111223455666532 2343344 Q ss_pred eCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC Q lcl|NC_020844. 308 PATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPD 384 (455) Q Consensus 308 ~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~ 384 (455) .++...+ +.+..+-..++|+.+ ...++.....+.+ .+...+. .....+.-+...++..+++.| +. T Consensus 250 ~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~-~~~~~~~-~~~~~i~~~l~~i~~~~~~~l-------~~-- 317 (386) T protein:vir:49 250 IKSNVAQ-LLSQADWTTGQFAKVYGIPESIVGGDGDQQS-SLEMIYN-IYFKSVSRYLRPFVSEMSKKL-------SC-- 317 (386) T ss_pred CChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCccc-hHHHHHH-HHHHHHHHHHHHHHHHHHHHh-------cc-- Confidence 3333332 344455556666554 3333432222211 1111111 122334444444444444433 21 Q ss_pred CceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 385 TNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 385 ~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+.+.+. .............+.++..+|+++.-.+++.+.+.|+.+.+....+....... ++|=++ T Consensus 318 -~~~~~~~--~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~~~~~~--~gGd~~ 383 (386) T protein:vir:49 318 -EVDVDIS--PAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNPNRTSL--KGGEIN 383 (386) T ss_pred -hhcccch--hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhccCCCCC--CCCCCC Confidence 1222222 11111223445667778889999999999999888887755432221111111 111111 No 162 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=95.29 E-value=0.0024 Score=34.98 Aligned_cols=394 Identities=7% Similarity=-0.074 Sum_probs=173.2 Q ss_pred CCCCCCCc----cCH--HHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCC-----CCCCHHHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MPEQDPTK----RSA--DAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQF-----PHEQNKRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~~~~v~~----~~p--~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~-----~~e~~~~Y~~rl~ra~~~n~~~~~v 69 (455) |+...-.- +.| .-....|.|..++....+... .|- ..++. .+..-+-|+.... -.++...+ T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~g~-~~~~~~~iLr~~~~~~ly~~m~~----D~hi~s~l 72 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSY---DVV-VDREFDELLQGKDGLLVYHKMLS----DGTVKNAL 72 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhccccc---ccc-cccchhHhhccccchHHHHHHhh----ChHHHHHH Confidence 66442110 111 111124555555554333211 010 01110 1223456776553 47777778 Q ss_pred HHHhhhhhcCCceec--cCCHH-------HHHHHHhhcc--CCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhh Q lcl|NC_020844. 70 EGLASKPFGREVEVT--EGTGP-------SEDLLEDIDG--RGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREE 138 (455) Q Consensus 70 ~~~~g~lf~k~p~i~--~~~~~-------l~~~~~d~d~--~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad 138 (455) +.-...+.+.++.|. ++++. +..++...+. .-.++++++..+. .|+-||.+.+=+-+. T Consensus 73 ~~Rk~av~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~l-da~~~G~s~~Eivw~---------- 141 (448) T protein:vir:79 73 NYIFGRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYE-NAYIYGMAAGEIVLT---------- 141 (448) T ss_pred HHHHHHHhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHHH-HhhhhcceeEEEEee---------- Confidence 888888888888884 12211 2233322111 1235777777666 577888777543221 Q ss_pred HHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEee--eeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeE Q lcl|NC_020844. 139 ERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWED--AETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPM 216 (455) Q Consensus 139 ~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~--~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~ 216 (455) .. .+|++.+..+..+.. ...+.+-..+...+ +...+. +.-...+..+ -.+|+ T Consensus 142 ----------------------~~-~~g~~~~~~l~~r~~~~~~~f~~~~d~~l~~-~~~~~~-~~~~~~~~~~-~~lP~ 195 (448) T protein:vir:79 142 ----------------------LG-ADGKLILDKIVPIHPFNIDEVLYDEEGGPKA-LKLSGE-VKGGSQFVSG-LEIPI 195 (448) T ss_pred ----------------------ec-CCCceecccccccCCccccceeeecCCceEE-eecCCc-ccccccCCCc-ccccc Confidence 10 122222222221111 01111111111111 111110 0000000011 12343 Q ss_pred EEEE--ecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcc--ccccC--cc---ceeeccc Q lcl|NC_020844. 217 VPFI--TGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDT--DSEGS--PK---PLDTGPD 287 (455) Q Consensus 217 V~f~--~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~--~~~~~--~~---~~~~g~~ 287 (455) =+|+ ..+.. +-..+.+-|..++..-+---....+...-++.-+.|+++.+--.... ..+.. .. .+..|.. T Consensus 196 ~~~i~~~~~~~-g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~ 274 (448) T protein:vir:79 196 WKTVVFLHNDD-GSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPR 274 (448) T ss_pred ceEEEEecCcc-CCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCc Confidence 2222 22332 33345565666665444334456777888899999999888322211 11110 11 1335788 Q ss_pred eeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh--hhhhccCCCcchhHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_020844. 288 TVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL--GRQPLTAQSGNLTRISAA-QAASKGNSAVQQWAAG 364 (455) Q Consensus 288 ~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~--g~~~l~~~~~~~sa~a~~-~~~~~~~s~L~~~a~~ 364 (455) .+..+|.+. ++.|++..+++. .+.+.++...++|..+ |.. ++.+.+..++.+.. ....-....+..-+.. T Consensus 275 a~~iiP~~~-----~ie~~ea~~~~~-~~~~~i~~~d~~Isk~iLGqt-lTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~ 347 (448) T protein:vir:79 275 HGIILPDDW-----KFDTVDLKSAMP-DAIPYLTYHDAGIARALGIDF-NTVQLNMGVQAINIGEFVSLTQQTIISLQRE 347 (448) T ss_pred eEEEecCCc-----eEEEEecCCCcc-cHHHHHHHHHHHHHHHHhhhh-hccccccchhhhhhhhHHHHHHHHHHHHHHH Confidence 888899765 799999877654 4566788888888553 433 33333222222222 1122233445667788 Q ss_pred HHHHHHH-HHHHHHHHc-CCCCCceEEEEecccCCCCCC-HHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH Q lcl|NC_020844. 365 LKDALEN-ALYITNLWM-DAPDTNPEADVYTDFRADTGE-EQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE 441 (455) Q Consensus 365 ~~~a~~~-~l~~~a~~~-g~~~~~~~v~v~~df~~~~~~-~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e 441 (455) +.+++++ ++++++.+- |....-..+.+ +.... .| ...++.+.++...+.++ +.+.. .+.++..+ .+.++ T Consensus 348 i~~tln~~li~~l~~lNfg~~~~~P~~~f--~~~e~-~Dl~~~a~~~~~l~~~~~~~-~~~~~--~~~~~p~~--~~~~~ 419 (448) T protein:vir:79 348 FASAVNLYLIPKLVLPNWPSATRFPRLTF--EMEER-NDFSAAANLMGMLINAVKDS-EDIPT--ELKALIDA--LPSKM 419 (448) T ss_pred HHHHHHHHHHHHHHHhcCCCcCCCcEEEe--cCCCh-HHHHHHHHHhhhhhccchhh-HHHHH--HhhcCCCC--CCCcc Confidence 8888874 888877764 32111123333 22211 11 22334444554444333 33322 22344222 22222 Q ss_pred HHHHHhhccc-----CCCC Q lcl|NC_020844. 442 SERLISELPG-----GIEE 455 (455) Q Consensus 442 ~~ri~~e~~~-----~l~~ 455 (455) ......+.+. +-.+ T Consensus 420 ~~a~~~~~~~~~~~~~~~~ 438 (448) T protein:vir:79 420 RRALGVVDEVREAVRQPAD 438 (448) T ss_pred ccccCCCCcccccccCCcc Confidence 2111111110 1111 No 163 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=95.00 E-value=0.003 Score=34.42 Aligned_cols=365 Identities=10% Similarity=0.023 Sum_probs=153.8 Q ss_pred CCCCCCCccCHH------HHHH--HHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSAD------AEAM--SDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGL 72 (455) Q Consensus 1 m~~~~v~~~~p~------y~~~--~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~ 72 (455) |+--+..++.+. +... ...+.-+-....|. .++ .+.. ..|.-.+.+...|+.+ T Consensus 3 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~g~---~v~~--~~al~~~~v~~~v~~i 63 (392) T protein:vir:74 3 LPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGD--------------NNE---WVSA--RAALRNSDLFSIILQL 63 (392) T ss_pred chhhhhhhcccCcccccccccccccCchhhhhhhccCC--------------CCc---ccch--hhhhcchHHHHHHHHH Confidence 332111111000 0000 00001111111110 000 0000 0111124455566666 Q ss_pred hhhhhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 73 ASKPFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 73 ~g~lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) ++-+-+-|..+... ....+++.-+ ...+-.+|.+.+....+.+|-+|+++.++..+.+. -+..++ T Consensus 64 a~~ia~lp~~~~~~--~~~~l~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~------------~L~~i~ 128 (392) T protein:vir:74 64 SSDLAIVKINAEKK--KNQGIIDNPS-TNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADM------------KWEYLR 128 (392) T ss_pred HHhhccCceeeccc--hhhhhhhhcC-CCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEE------------EEEEEc Confidence 66666656655321 2234555443 34678999999999999999999999766544321 244444 Q ss_pred hhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe--CC-c-EEEEeccCCCCCCceeEEEEEecccCCCc Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN--DA-G-IWEVDEFGRITIGVVPMVPFITGRRKGKK 228 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~--~~-g-~~~~~~~g~~~l~~vP~V~f~~~~~~~~~ 228 (455) |..|--. .+..+ ....|+.. ++ + ...... -+. |+.|..... .+. T Consensus 129 ~~~v~v~-~~~~~-----------------------~~~~y~~~~~~~~~~~~~~~~-----~~e--vih~~~~~~-~~~ 176 (392) T protein:vir:74 129 PSQVNTY-YFEYE-----------------------NGMYYNITFDDPKIEPILQAP-----QSD--LIHMKLLSI-DGG 176 (392) T ss_pred CceeEEE-EcCCC-----------------------ceEEEEEEecCCccceeEEEc-----Ccc--EEEecCCCC-CCc Confidence 4444110 00001 00111111 10 0 000010 111 222332222 244 Q ss_pred ccCcchHHHHHHHHHHHHHhHHHH-HHHHHHcCCceEEEecCCC-ccccccCc---cce--eeccceeeeccCCCCcccc Q lcl|NC_020844. 229 WVFHPPMKDALDLQVALFRKECAL-EFAEMMTAFPMLSANGVEP-DTDSEGSP---KPL--DTGPDTVLYAPPNAEGQHG 301 (455) Q Consensus 229 ~~~~~pL~dla~l~~~hy~~~sd~-~~~l~~~~~p~~~~~G~~~-~~~~~~~~---~~~--~~g~~~~~~lp~~~~~~~~ 301 (455) ..+.+|+..+... +........+ .......+.|-.+++=-.. ...++... ... .-.++.++.|+. +. T Consensus 177 ~~G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~-----g~ 250 (392) T protein:vir:74 177 KTGISPLYSLRRE-SKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDD-----LE 250 (392) T ss_pred cccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCC-----Cc Confidence 5688888766654 3333333443 3344566777777651111 11111110 000 012234556653 22 Q ss_pred ceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 302 ~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) ++.-++.++...+ +.+..+-...+|+.+ ...++...+.+.+..+... +-....|.-+...++++++..| T Consensus 251 ~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~--~~~~~~l~p~~~~ie~~l~~~l----- 322 (392) T protein:vir:74 251 EFTALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQIS--GMYASALNRYLRPAISELEYKL----- 322 (392) T ss_pred eEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHH--HHHHHHHHHHHHHHHHHHHHhc----- Confidence 3433443433322 344455555666554 2233422222222222111 1223445556666666665543 Q ss_pred HcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 379 WMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 379 ~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) . + ...+.+..-|. .........+.++...|+++....++.+.+.|+.+.+.-..+ -+.....+.-+| T Consensus 323 --~-~--~~~~~~~~~~~--~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~~e---nl~~~~~Gd~~~ 389 (392) T protein:vir:74 323 --S-D--HISVNMRPAID--PLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAPE---NTNKKTTGQSNE 389 (392) T ss_pred --c-c--hhcccchhhhc--CCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCccccchhc---CCCCCCCCCCCC Confidence 1 1 12222222221 223445677888899999999999999999998754332111 111111111222 No 164 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=94.93 E-value=0.0032 Score=34.30 Aligned_cols=392 Identities=9% Similarity=-0.018 Sum_probs=143.6 Q ss_pred CCCCC----CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQD----PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~----v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) ||-.. -.++-|.+.+..-.-+-+.+.-+|.+. + -.+|. +-..+..... -.++.+++|+.++..+ T Consensus 43 ~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~-------~-~epp~-d~~~l~~l~~---~np~V~~aI~iia~~i 110 (648) T protein:vir:79 43 MPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRD-------F-EEPEF-DFNEITSAYN---TEGYVRQAVDKYIEMM 110 (648) T ss_pred cCCCCcccccccccchhHHHHHhHHHHHhhcCCccc-------c-ccCCc-CHHHHHHHHh---cChHHHHHHHHHHHHH Confidence 43211 022333322221111112222222211 1 11111 2233322221 1456777888888777 Q ss_pred hcCCceeccC-----CH-HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 77 FGREVEVTEG-----TG-PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 77 f~k~p~i~~~-----~~-~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~ 150 (455) .+-+..+... .. ....++..-. ...+..+|.+.+..+.+.+|-+|+.+.++..+.. |.... T Consensus 111 a~l~~~i~~~~~~~~~~~~~~~ll~rPn-~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~------------~~~l~ 177 (648) T protein:vir:79 111 FKADWDFVSKNPNAVEYIRMRFTLMAEA-TQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALP------------FQGMN 177 (648) T ss_pred hhCcceEEecCCccchhhHHHHHhhccC-CCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCcc------------chhhh Confidence 7776655311 11 1122222111 2457889999999999999999999987665431 11100 Q ss_pred -echhhhccceeeccCCeeEEEEEEEEeeeeEEEEE--ecCeEE--EEEEeCCcEEEEeccCCCCCCceeEEEEEecccC Q lcl|NC_020844. 151 -VNHSAILEIQSERQGAGERITYMRMWEDAETILVL--RPDDWE--RWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRK 225 (455) Q Consensus 151 -~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~--~~~~~~--~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~ 225 (455) +....+ ..+.+.+.+ . .+++++. ..+... +|...+++....... =. ++-|..+... T Consensus 178 ~~~~~~~-----~~v~~l~pl-----~--p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~----~d---IIHik~~~~~ 238 (648) T protein:vir:79 178 VMGVGDS-----MPVAGYFPL-----N--LASMKVKRDKFGMIKGWQQEQEGQDKPQKFKP----ED---IVHIYYKREK 238 (648) T ss_pred hhhhccc-----cceeeeEee-----c--CceeEEEEcCCCceeeeEEEecCCceeEEecC----cc---EEEEccCCCC Confidence 000000 001111111 0 0111111 111111 222222221111110 11 3333322222 Q ss_pred CCcccCcchHHHHHHHHHHHHHhHHHH-HHHHHHcCCceEEEec-CCCccccccC--ccceeeccceeeeccCCCCcccc Q lcl|NC_020844. 226 GKKWVFHPPMKDALDLQVALFRKECAL-EFAEMMTAFPMLSANG-VEPDTDSEGS--PKPLDTGPDTVLYAPPNAEGQHG 301 (455) Q Consensus 226 ~~~~~~~~pL~dla~l~~~hy~~~sd~-~~~l~~~~~p~~~~~G-~~~~~~~~~~--~~~~~~g~~~~~~lp~~~~~~~~ 301 (455) +...+.||+..++.. +...+...++ .......+.|..+++- ......+..+ .+.+.-+-++....+ .+. T Consensus 239 -d~~~GlSpi~~a~~a-I~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~g-----g~v 311 (648) T protein:vir:79 239 -GRAFGTPWLLPALDD-IRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEG-----GMV 311 (648) T ss_pred -CCceeccHHHHHHHH-HHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhcccccccc-----ccc Confidence 334588888776654 3333434443 3445567788777751 1111111000 000100111111111 111 Q ss_pred ceeEEeeCc----hhHHHHHHHHHHHHHHHHHh---hhhhcc-CCCcch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPAT----ESLKFLAEQIESVSKEIREL---GRQPLT-AQSGNL-TRISAAQAASKGNSAVQQWAAGLKDALENA 372 (455) Q Consensus 302 ~~~~le~~~----~~l~~~~~~l~~~~~~m~~~---g~~~l~-~~~~~~-sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~ 372 (455) +++.+.++. ..++ +.+..+-..++|+.+ ...++. ..+.+. ++.+....+ ...+..+...+...++.. T Consensus 312 ~~~~~~i~~~~s~~dlq-fle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~---~~~i~~l~~~i~~~le~~ 387 (648) T protein:vir:79 312 TTERVNISSIASNQIID-AKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDF---KDRIKALQKVMATFINEF 387 (648) T ss_pred ccceeeccccCCHHHHH-HHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHH---HHHHHHHHHHHHHHHHHH Confidence 223222221 1221 333344445555443 223332 112222 222221111 222222333333333221 Q ss_pred H-H---H---HHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHH Q lcl|NC_020844. 373 L-Y---I---TNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERL 445 (455) Q Consensus 373 l-~---~---~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri 445 (455) + + + +..|+.. +..+.|.++ +. .+......++.+.+++++|++|..+.++.+ |+- |.-+.+.. ..+ T Consensus 388 ~~~~ll~e~~l~~~l~~-d~~ieF~~~-~L-lr~D~~~~a~~~~~l~~~GilT~NEaR~~l---Glp-Pi~~g~~~-~~l 459 (648) T protein:vir:79 388 MVKEILMEGGFDPVLNP-DDKVEFRFN-EI-DMDSKIKLENQAVFLYEHNAISEDEMRELI---GRD-PVDDGEGR-AKM 459 (648) T ss_pred HHHHHhhhhhccccccc-cceEEEeec-cc-chhhHHHHHHHHHHHHhCCCcCHHHHHHHh---CCC-CCCCCCCc-ccc Confidence 1 1 1 1222221 112333332 11 111123445667888899999999888765 442 22111100 011 Q ss_pred Hhhc---------------ccCCCC Q lcl|NC_020844. 446 ISEL---------------PGGIEE 455 (455) Q Consensus 446 ~~e~---------------~~~l~~ 455 (455) ..+. +.+-+. T Consensus 460 ~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (648) T protein:vir:79 460 HLQMVTIAQATALAALAPTPAGGSS 484 (648) T ss_pred ccccccchhccccccCCCCCCCCCC Confidence 1110 000000 No 165 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=94.81 E-value=0.0034 Score=34.09 Aligned_cols=388 Identities=11% Similarity=0.008 Sum_probs=168.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHH--hhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRR--NFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~--~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |+..+-.+ |..+..... ...+.. .++-|-|..+ ...+.+-+. -..+....|+.++..+-+ T Consensus 6 ~~~~~~~~-----------~~~~~~~~~-~~~~~~~~~~~~~~pp~~---~~~La~~~~---~n~~v~scI~~ia~~ia~ 67 (540) T protein:vir:41 6 LSIKSLEK-----------YRAIKGDTD-SQALKEDRFEEYVEPKVH---PLVLLSLLQ---VNPYHASACSIKANDILR 67 (540) T ss_pred cChhhccc-----------hhhhhcccc-ccccccCCCCccccCCCC---HHHHHHHHH---hcHHHHHHHHHHHHHHhc Confidence 66433332 222222111 111111 1233444332 233322221 235668888888888888 Q ss_pred CCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhcc Q lcl|NC_020844. 79 REVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILE 158 (455) Q Consensus 79 k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~ 158 (455) -+..+......+..++.|.. .+..+|.+.+....+.+|-+|+++-++..|.+. -+..++|..|- T Consensus 68 ~~~~i~~~~~~~~~~lpN~~---~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~------------~L~~i~~~~V~- 131 (540) T protein:vir:41 68 TGYLIDGDDGGVEELLRACR---PSFEFILLQALEDLQVFNYCTLEVVRDDQGEPV------------RLDYIPAHTVR- 131 (540) T ss_pred CCceEecCccchhhhccCCC---CCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEE------------EEEEeCCcceE- Confidence 88877655556777777654 468899999999999999999999776655432 24555565552 Q ss_pred ceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEe-ccCC--CCCCceeEEEEEecccCCCcccCcchH Q lcl|NC_020844. 159 IQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVD-EFGR--ITIGVVPMVPFITGRRKGKKWVFHPPM 235 (455) Q Consensus 159 w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~-~~g~--~~l~~vP~V~f~~~~~~~~~~~~~~pL 235 (455) ....+.. +.....+...+|....+..-... ..+. ..+..=-++.|.... ..+...+.+|+ T Consensus 132 --v~~~~~~--------------~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~-~~~~~~G~Spi 194 (540) T protein:vir:41 132 --VHRDGSR--------------YMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPS-PICSYYGVPRY 194 (540) T ss_pred --EeEcCce--------------eEeeecCceeeeeecccccceeeccccccceeecccceEEecCCC-CCCCcccccHH Confidence 1111111 11111121222211111000000 0111 011111133333322 22445688898 Q ss_pred HHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEe--cC-CCcccc----------ccC----cc--ceeeccceeeeccCC Q lcl|NC_020844. 236 KDALDLQVALFRKECALEFA-EMMTAFPMLSAN--GV-EPDTDS----------EGS----PK--PLDTGPDTVLYAPPN 295 (455) Q Consensus 236 ~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~--G~-~~~~~~----------~~~----~~--~~~~g~~~~~~lp~~ 295 (455) ..++.. +..-....++... ....+.|-.+++ |. .++... ..+ .+ ...-.++.++.|... T Consensus 195 ~~~~~~-i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~ 273 (540) T protein:vir:41 195 LSAAPS-ILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIP 273 (540) T ss_pred HHHHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecC Confidence 866553 3333333444333 345667877665 22 111100 000 00 001112334444321 Q ss_pred CCccccceeEEeeC--chhHHHHHHHHHHHHHHHHHh---hhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 296 AEGQHGSWNFVEPA--TESLKFLAEQIESVSKEIREL---GRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAAGLKD 367 (455) Q Consensus 296 ~~~~~~~~~~le~~--~~~l~~~~~~l~~~~~~m~~~---g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~ 367 (455) . +....++|..+. ....+ +.+..+...++|+.+ ...++. .++.+.|..+ .....-....|.-+...++. T Consensus 274 ~-~~~~g~~~~pl~~~~~d~q-fle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~e-q~~~~f~~~tL~P~~~~ie~ 350 (540) T protein:vir:41 274 G-GDTVEVTFTPLNTSQKELS-FREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAE-VARRTYYESVVRPQQEIVSS 350 (540) T ss_pred C-CcccceeEEecccchhHHH-HHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHH-HHHHHHHHHHHHHHHHHHHH Confidence 1 112245555443 33322 334444455555443 233331 1222222211 22222334567777777888 Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCH--------- Q lcl|NC_020844. 368 ALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDP--------- 438 (455) Q Consensus 368 a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~--------- 438 (455) ++++.|. .. . +.+..|+++.+= .+.......+.++.+.|+++.-..++.| .++-+ ..++ T Consensus 351 ~ln~~L~--~~-~---~~~~~i~f~~~~---ll~~D~~~~~~~lv~~G~lT~NE~Re~L--~g~e~-gdd~~l~p~n~~~ 418 (540) T protein:vir:41 351 VLTDFIQ--LK-L---DPGARFVFNEEI---LMESEFVHNYALLVQCGVLTPSEVREKL--FGLDG-GPDMFMVPSSIGK 418 (540) T ss_pred HHHHhhh--hc-c---CCceEEEecchh---hcchHHHHHHHHHHhCCCCCHHHHHHHh--CcCcC-CCccccccccccc Confidence 8876442 11 1 223455544311 1223345556678899999998888765 23321 1110 Q ss_pred HH--------------HHHHHHhhccc----CCCC Q lcl|NC_020844. 439 EE--------------ESERLISELPG----GIEE 455 (455) Q Consensus 439 e~--------------e~~ri~~e~~~----~l~~ 455 (455) .+ +.+++.++.+. .+++ T Consensus 419 ~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~ 453 (540) T protein:vir:41 419 SAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISS 453 (540) T ss_pred ccccccccccCCCCccccccccchhcccccCcccc Confidence 00 00000000000 0000 No 166 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=94.67 E-value=0.0038 Score=33.86 Aligned_cols=364 Identities=14% Similarity=0.077 Sum_probs=153.6 Q ss_pred CCCCCC-CccCHHHHHHHHHHH--HHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQDP-TKRSADAEAMSDYLQ--TVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~v-~~~~p~y~~~~~~w~--~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |--=+. .....+.......|. .+....+|.. ..|+- . +..++ ...+..+|+.+++-+- T Consensus 1 Mg~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~------~~~v~------~---~~~l~----~~~v~~~i~~ia~~ia 61 (383) T protein:vir:10 1 MGLLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQ------LSYVS------A---LSALQ----NTNVYSVINRIASDVS 61 (383) T ss_pred CCcccccccccccccccccccchhhhhhhccCcc------ccccc------h---hHhhc----chHHHHHHHHHHHhhc Confidence 442110 011111111111110 0111111110 00100 0 11222 2345556777777666 Q ss_pred cCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhc Q lcl|NC_020844. 78 GREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAIL 157 (455) Q Consensus 78 ~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii 157 (455) +-|..+.. .....++...+ ...+-.+|.+.+....+.+|-+|++++... +..+++..+ T Consensus 62 ~~~~~~~~--~~~~~ll~~PN-~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~------------------~~~~p~~~~- 119 (383) T protein:vir:10 62 SAHFKTEN--TATLNRLESPS-SLIGRFSFWQGALMQLCLSGNDYIPLVGQN------------------LEHIPNSDV- 119 (383) T ss_pred cCceeecc--cchhhhhhCCC-CCCCHHHHHHHHHHHhhhcCCeEEEEEcCc------------------eeEeecCcc- Confidence 66666643 23455666554 356889999999999999999999986221 111111111 Q ss_pred cceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEe-cccCCCcccCcchHH Q lcl|NC_020844. 158 EIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFIT-GRRKGKKWVFHPPMK 236 (455) Q Consensus 158 ~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~-~~~~~~~~~~~~pL~ 236 (455) + +.+... ..+.+..+....++...... .-. ++-|+. +........+.+|+. T Consensus 120 --~------------v~~~~~-------~~~~~~~~~~~~~~~~~~~~----~~e---vih~r~~~~~~~~~~~G~s~l~ 171 (383) T protein:vir:10 120 --Q------------INYLPG-------NMGIVYTVLESNDRPKMVLR----QDQ---MLHFRLMPDPQYRYLIGRSPLE 171 (383) T ss_pred --e------------EEEEEc-------CCceEEEEEEcCCceEEEEc----ccc---eEEeccCCCCcccccccccHHH Confidence 0 000000 00011111111112111111 111 222322 121122345778887 Q ss_pred HHHHHHHHHHHhHHHHHHHHHHcCCceEEEec---CCCccccc-cCc--cceeec--cceeeeccCCCCccccceeEEee Q lcl|NC_020844. 237 DALDLQVALFRKECALEFAEMMTAFPMLSANG---VEPDTDSE-GSP--KPLDTG--PDTVLYAPPNAEGQHGSWNFVEP 308 (455) Q Consensus 237 dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G---~~~~~~~~-~~~--~~~~~g--~~~~~~lp~~~~~~~~~~~~le~ 308 (455) -+......+.....-........+.|-.+++- +..++..+ .+. +...-| ++.++.++. +.++.-++. T Consensus 172 ~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~-----g~~~~~l~~ 246 (383) T protein:vir:10 172 SLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPD-----GFDYTQLEM 246 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCC-----CceEEecCC Confidence 66554333333222233444555677766662 21111111 100 011112 234555653 234555555 Q ss_pred CchhHHHHHHHHHHHHHHHHHh---hhhhccCC-Ccc---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020844. 309 ATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGN---LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMD 381 (455) Q Consensus 309 ~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~---~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g 381 (455) ++..++.+.+..+...++|+.+ ...++... .++ .+..+.... -...|.-++..+++.++..| .+ T Consensus 247 ~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~---~~~~l~P~~~~ie~~l~~~l------~~ 317 (383) T protein:vir:10 247 KTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKAT---YLANLNSYVNPIVDELRLKM------NA 317 (383) T ss_pred ChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHH---HHHHHHHHHHHHHHHHHHhh------CC Confidence 5555544445555556666553 22333321 111 122222211 12235555555655555432 22 Q ss_pred CCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 382 APDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 382 ~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ..++++.+-.........++++.++.+.|+++...+++.+..-++.+.+.. +. .-.....++|=+| T Consensus 318 -----~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~-~~--~~~~~~~~gGd~e 383 (383) T protein:vir:10 318 -----PDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLP-EF--KPLTNETKGGDDK 383 (383) T ss_pred -----ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCccc-cc--CCCcccCCCCCCC Confidence 124444322233334566788999999999999999988765555433321 11 1111222233333 No 167 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=93.46 E-value=0.0075 Score=32.24 Aligned_cols=379 Identities=11% Similarity=-0.039 Sum_probs=153.2 Q ss_pred CCCCC-CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHH-HHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQD-PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRY-DLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~-v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y-~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |==.+ =....|+-.+..+ -+++.+.|.-+. | .+--. .-.-+ ..+.+. +.+...|+-++.-+-+ T Consensus 1 ~~~~~~~~~~~p~~~~~~~---~~~~~~~~~~~~---g---~~~~~--~~~~~~~~~~~~----~~V~acV~~IA~~iA~ 65 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSP---QMQDSYYYAPAV---G---MQLER--QFSLYGGIYKNQ----PWVRTVIAKRAQALAR 65 (518) T ss_pred CcccCceeeccchhhhhhh---hhhhccccccee---c---eeccc--ccchhhHHhhhh----HHHHHHHHHHHHhhcc Confidence 21100 0111222222222 155555542110 0 00000 00011 112222 2334444444444433 Q ss_pred CCcee---------ccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEE Q lcl|NC_020844. 79 REVEV---------TEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWS 149 (455) Q Consensus 79 k~p~i---------~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~ 149 (455) -|..+ ...+..+..|+..-. ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+. T Consensus 66 lp~~l~~~~~~~~~~~~~~~~~~Ll~~PN-~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~------------~L~ 132 (518) T protein:vir:78 66 LPVKCMFTSGDTETEEHDTGYAKLLADPC-EYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPE------------KLM 132 (518) T ss_pred CceEEEEEcCCccccccchHHHHHHhCCC-CCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE------------EEE Confidence 33222 112334555666554 34677899999999999999999999765544321 144 Q ss_pred EechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeC--CcEEEEeccCCCCCCceeEEEEEecccCCC Q lcl|NC_020844. 150 QVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKND--AGIWEVDEFGRITIGVVPMVPFITGRRKGK 227 (455) Q Consensus 150 ~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~--~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~ 227 (455) .++|..|.-+.... ++.. +..|.... ++..... +-+. |+.|..... .+ T Consensus 133 ~l~p~~Vtv~~~~~-~~~~---------------------~y~~~~~~~~~~~~~~~-----~~~e--IiHir~~~~-dg 182 (518) T protein:vir:78 133 PMHPSRVAIKRNSR-TGRY---------------------EYYFQAGAGVGTQLVSF-----ADDE--VVPIRFFNP-DG 182 (518) T ss_pred EECCCceEEEEcCC-CCEE---------------------EEEEEecCCccceeEEe-----cCCc--EEEecCCCC-Cc Confidence 44554442211110 0000 00000000 0010000 0111 122222111 23 Q ss_pred cccCcchHHHHHHHHHHHHHhHHHHH-HHHHHcCCceEEEecCCCcccc---ccCc--cceeec---cceeeeccCCCCc Q lcl|NC_020844. 228 KWVFHPPMKDALDLQVALFRKECALE-FAEMMTAFPMLSANGVEPDTDS---EGSP--KPLDTG---PDTVLYAPPNAEG 298 (455) Q Consensus 228 ~~~~~~pL~dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G~~~~~~~---~~~~--~~~~~g---~~~~~~lp~~~~~ 298 (455) ...+.+|+.-++.. +.......++. ......+.|-.+++.-..-.++ ..+. +...-| ++.++.|+. T Consensus 183 ~~~G~Spi~~~~~~-i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~---- 257 (518) T protein:vir:78 183 LERGLSLMESLKST-IFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEE---- 257 (518) T ss_pred ccccccHHHHHHHH-HHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCceeEcCC---- Confidence 34577887766553 34444444443 3345567787777632111111 1110 011122 234566653 Q ss_pred cccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 299 QHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNLTRISAAQAASKGNSAVQQWAAGLKDALENALY 374 (455) Q Consensus 299 ~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~ 374 (455) +.++.-++.++...+ +.+..+-...+|+.+ ...++.-. .++-|..+ .....-....|.-+...++.+++..|- T Consensus 258 -G~~~~~l~~~~~d~q-~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e-~~~~~f~~~tL~P~~~~ie~eln~~L~ 334 (518) T protein:vir:78 258 -GMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNIS-AQMRAFYRDTMAIPIARIQSAMDKYVG 334 (518) T ss_pred -CceEEeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 223443444433322 233333344444332 22223211 12222111 111222344566677777777765441 Q ss_pred HHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCH-HHH------HHHHHh Q lcl|NC_020844. 375 ITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDP-EEE------SERLIS 447 (455) Q Consensus 375 ~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~-e~e------~~ri~~ 447 (455) .. .+ .+..++++.+-..+......++++.++++.|+++.-.+++.+ |+-+ .-++ -++ +..+.. T Consensus 335 --~~-~~---~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~---gl~p-ie~~~gD~~~v~~n~~pl~~ 404 (518) T protein:vir:78 335 --QY-WV---RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM---GLPR-SDDPKADELYANSALQPLGA 404 (518) T ss_pred --cc-cc---CcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCCCCceeeecccceeccc Confidence 11 11 234455554333333345668889999999999998888766 3322 1110 000 001100 Q ss_pred hcc-------------------cCCCC Q lcl|NC_020844. 448 ELP-------------------GGIEE 455 (455) Q Consensus 448 e~~-------------------~~l~~ 455 (455) ... +..++ T Consensus 405 ~~~~~~~g~~~~~~~~~~~~~~~~~~~ 431 (518) T protein:vir:78 405 TPDGAVEGEEAPAPKRPASTPVASLDQ 431 (518) T ss_pred ccccccCCCCCCCCCCCCccccccccc Confidence 000 00000 No 168 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=93.23 E-value=0.0083 Score=31.99 Aligned_cols=363 Identities=12% Similarity=0.114 Sum_probs=133.5 Q ss_pred HHhhhhhcCCceeccCCHHHHHHHHhhccC-----CC--------CHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh Q lcl|NC_020844. 71 GLASKPFGREVEVTEGTGPSEDLLEDIDGR-----GN--------NLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE 137 (455) Q Consensus 71 ~~~g~lf~k~p~i~~~~~~l~~~~~d~d~~-----G~--------~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a 137 (455) -+.+.+|+|.-...........++.+..+. |. ....+.+-+-.-|-..+.+-+.|+....+...... T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~~~~ 80 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGIERKP 80 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccccc Confidence 222233322211111111111122111111 11 11222233333444445555555433222111110 Q ss_pred h---HHhccCCc--eEEEechhhhccceeeccCCeeEEEEEEEEe--eeeEEEEEecCeEEEEEEeCCcE-EEEec-cCC Q lcl|NC_020844. 138 E---ERQAGVRP--YWSQVNHSAILEIQSERQGAGERITYMRMWE--DAETILVLRPDDWERWRKNDAGI-WEVDE-FGR 208 (455) Q Consensus 138 d---~~~~~~rP--y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E--~~e~~~v~~~~~~~~~~~~~~g~-~~~~~-~g~ 208 (455) + ......|| +.+...=-+.+-+..--.|.-. .++.... .+.....+.++.++++....++. +..+. .|. T Consensus 81 ~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~--~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~g~ 158 (416) T protein:vir:12 81 EHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAY--SYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVLNGK 158 (416) T ss_pred ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeE--EEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEecCCe Confidence 0 01112233 2222211111111111122211 1111111 13344555666666665444443 22221 111 Q ss_pred C-CCCceeEEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHH-HHHHHcCCceEEEecCCCcccccc----Cc--c Q lcl|NC_020844. 209 I-TIGVVPMVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALE-FAEMMTAFPMLSANGVEPDTDSEG----SP--K 280 (455) Q Consensus 209 ~-~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G~~~~~~~~~----~~--~ 280 (455) . .+..=-++.|..... +...+.+|+.-++.. +........+. ......+.|-.+++- .....++. .. + T Consensus 159 ~~~~~~~eiih~~~~~~--~~~~G~s~i~~~~~~-i~~~~~~~~~~~~~~~ng~~p~~il~~-~~~~~~e~~~~~~~~~~ 234 (416) T protein:vir:12 159 AIELYDYEVLHFKGLST--DGIHGKSPIGVVREH-IGAQAAATKYNAKLYKNEATPRGILKV-PAFLDEKPKENVRKEWK 234 (416) T ss_pred EEEecCccEEEecCcCC--CCcccccHHHHHHHH-HHHHHHHHHHHHHHHhcCCCCceEEec-CCCCCHHHHHHHHHHHH Confidence 0 111111233332222 335678887766653 33334344443 344556778777762 11111111 10 0 Q ss_pred ceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH---hhhhhccCC-Ccchh-HHHHHHHHHHHH Q lcl|NC_020844. 281 PLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE---LGRQPLTAQ-SGNLT-RISAAQAASKGN 355 (455) Q Consensus 281 ~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~-~~~~s-a~a~~~~~~~~~ 355 (455) .. .+.+.+..+|.+ .++.-++.++...+. .+..+-...+|+. +...++... .++-+ .++.. ..-.. T Consensus 235 ~~-~~~~~~~vl~~g-----~~~~~l~~~~~d~q~-~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~--~~f~~ 305 (416) T protein:vir:12 235 RV-NKVENIAIIDYG-----LEYQSISMPLQEAQF-VESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQS--IEYVR 305 (416) T ss_pred HH-hcCCCeeecCCC-----ceEEEccCChhhHHH-HHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHH--HHHHH Confidence 11 234556666632 345445544444332 2333444444433 223334221 12211 12221 11234 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCC-- Q lcl|NC_020844. 356 SAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELS-- 433 (455) Q Consensus 356 s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~-- 433 (455) ..|.-++..++++++.-|- .... ...+..|+++.+=.........++++.++...|+++.-.+++.+ |+-| T Consensus 306 ~~l~P~~~~ie~~l~~~l~--~~~~--~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---gl~Pi~ 378 (416) T protein:vir:12 306 NTLQPWIVNFEQELNVKLF--LDHD--QKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELL---ERNPIE 378 (416) T ss_pred HHHHHHHHHHHHHHHHhhc--Cchh--hcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCC Confidence 4555666666666554221 1110 11234455553222223335567788889999999999888866 3322 Q ss_pred CC--------CCH---HHHHHHHHhhcc----cCCCC Q lcl|NC_020844. 434 DN--------FDP---EEESERLISELP----GGIEE 455 (455) Q Consensus 434 ~~--------~d~---e~e~~ri~~e~~----~~l~~ 455 (455) +. ..+ .++.+..++... ..-+| T Consensus 379 ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~gge~~~~ 415 (416) T protein:vir:12 379 NGDKYISSLNYVFLDFLEEYQRLKAGGAMKGGDNKNE 415 (416) T ss_pred CcceeeeccccccccccchhhccccccccCCCCCcCC Confidence 11 000 111111111110 01111 No 169 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=92.80 E-value=0.0099 Score=31.57 Aligned_cols=365 Identities=11% Similarity=-0.012 Sum_probs=148.0 Q ss_pred HHHHhhccccCChHHHHHHHHhhhhhcCCceec---------cCCHHH---HHHHHhhcc---------CCCCHHHHHHH Q lcl|NC_020844. 52 YDLRKSVAKMTNVFRDVLEGLASKPFGREVEVT---------EGTGPS---EDLLEDIDG---------RGNNLTTFAGQ 110 (455) Q Consensus 52 Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~p~i~---------~~~~~l---~~~~~d~d~---------~G~~l~~~~~~ 110 (455) .+..++ -.+++...|+.++..+-+-|..+. ...... ..++....- ...++..|++. T Consensus 1 l~~l~~---~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~ 77 (467) T protein:vir:31 1 MAELLE---HNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQT 77 (467) T ss_pred Chhhhh---cCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHH Confidence 112222 246677777777777766665441 011111 222221110 01256788899 Q ss_pred HHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeE Q lcl|NC_020844. 111 LFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDW 190 (455) Q Consensus 111 ~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~ 190 (455) +....+.+|-+|+++-++..+.+. -+..++|..|.-.+. +...+ .+... ..++....+.. T Consensus 78 ~~~~l~l~Gn~~i~~~r~~~G~~~------------~l~~l~~~~v~~~~d----~~~~~---~~~~~-~~~~~~~~~~~ 137 (467) T protein:vir:31 78 AWTDYEAIGWLTIEILTQTDGTPT------------GLAYVPGHTIRKRMD----ERGFV---QLLEE-KEKYFGVAGDR 137 (467) T ss_pred HHHHHHhcCCeEEEEEECCCCcEE------------EEEEeCCceeEeeee----cceeE---eecCC-ceeeEEecccc Confidence 999999999999988765544322 255555655532111 00000 00000 00000000000 Q ss_pred EEEEEeCCcE--EE--EeccCCCCCC-cee---EEEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCC Q lcl|NC_020844. 191 ERWRKNDAGI--WE--VDEFGRITIG-VVP---MVPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFA-EMMTAF 261 (455) Q Consensus 191 ~~~~~~~~g~--~~--~~~~g~~~l~-~vP---~V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~ 261 (455) .. ....+. .. ....+..... .+| ++.|+... ..+...+.+|+.-+.... ........+... ....+. T Consensus 138 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~diih~r~~~-~~~~~~G~s~~~~~~~~i-~~~~~~~~~~~~~f~ng~~ 213 (467) T protein:vir:31 138 YQ--TNGNGDLDPVFVDADDGSTGTSVSNPANELIFKRNHS-PLYPHYGAPDIIPAVKTI-RGDSAAQDYNIDFFENDGV 213 (467) T ss_pred ce--eecccceeeeeeeeccccccceeEeccccEEEecCCC-CCCCcccccHHHHHHHHH-HHHHHHHHHHHHHHhccCC Confidence 00 011110 00 0111111111 011 23333222 224446888888766643 333334444443 345667 Q ss_pred ceEEEe--cCC--CccccccC-------ccce---e------eccceeeeccCCCCccccceeEEeeCch-h-HHHHHHH Q lcl|NC_020844. 262 PMLSAN--GVE--PDTDSEGS-------PKPL---D------TGPDTVLYAPPNAEGQHGSWNFVEPATE-S-LKFLAEQ 319 (455) Q Consensus 262 p~~~~~--G~~--~~~~~~~~-------~~~~---~------~g~~~~~~lp~~~~~~~~~~~~le~~~~-~-l~~~~~~ 319 (455) |-.++. |-. .+.....+ .+.. . -..+....++.+.......++|...+.. . -..+.+. T Consensus 214 p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~ 293 (467) T protein:vir:31 214 PRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEF 293 (467) T ss_pred CceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHH Confidence 766653 422 11110000 0000 0 0112334444333222223333332211 1 1122333 Q ss_pred HHHHHHHHHH---hhhhhcc-CCCcch-hH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEec Q lcl|NC_020844. 320 IESVSKEIRE---LGRQPLT-AQSGNL-TR-ISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYT 393 (455) Q Consensus 320 l~~~~~~m~~---~g~~~l~-~~~~~~-sa-~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~ 393 (455) .+...++|+. +...++. ..+++. |. ++... .-....|.-+...++++++..| +..+.+. .+..++++. T Consensus 294 ~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~--~f~~~~l~P~~~~ie~~ln~~l--~~~~~~~--~~~~i~f~~ 367 (467) T protein:vir:31 294 RGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRK--EFAEETIQPKQHDFGELLYELV--HKQGLDA--PDWTIEFEL 367 (467) T ss_pred HHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHH--HHHHHHHHHHHHHHHHHHHHhh--cchhhcc--CCceEEEec Confidence 4444444544 3333331 112221 22 22222 2234456667777777777643 2233322 234455554 Q ss_pred ccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHH------------------------------- Q lcl|NC_020844. 394 DFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEES------------------------------- 442 (455) Q Consensus 394 df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~------------------------------- 442 (455) +-.........++++..+++.|++|.-.+++.+ |+ +|. +++++ T Consensus 368 ~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---Gl-~pi--~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (467) T protein:vir:31 368 AKPDTKLQDVEIASQRVQAMQGLLTVNELRDEF---GF-EPF--PEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDR 441 (467) T ss_pred chhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CC-CCC--CcccccCCcccccccccccCCCCcccCcCCCCCCCc Confidence 333333345567788889999999998888765 33 221 11111 Q ss_pred -----HHHHhhcc-cCCCC Q lcl|NC_020844. 443 -----ERLISELP-GGIEE 455 (455) Q Consensus 443 -----~ri~~e~~-~~l~~ 455 (455) +-++++.+ +++-| T Consensus 442 ~~~~~~~~~~~~~~~~~~~ 460 (467) T protein:vir:31 442 ADEIIDSYQADLETEQLIE 460 (467) T ss_pred ccchHhhhhhccccchhhh Confidence 10000000 00000 No 170 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=91.73 E-value=0.014 Score=30.67 Aligned_cols=381 Identities=9% Similarity=0.003 Sum_probs=150.8 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhh-h---------cCCCCCCCCHH--HHHHHhhccccCChHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFK-R---------YVKQFPHEQNK--RYDLRKSVAKMTNVFRDV 68 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~-~---------yLpk~~~e~~~--~Y~~rl~ra~~~n~~~~~ 68 (455) ||++.. ...|.++++.......+...+. . .+.-.+..... .....++.+.++-....+ T Consensus 1 ~~~~~~----------mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~I 70 (432) T protein:vir:81 1 MPDEKK----------LGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLV 70 (432) T ss_pred CCchhh----------cchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHH Confidence 885421 3344444443332222111000 0 00000000000 012233333333334444 Q ss_pred HHHHhhhhhc---CCc--eeccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhc Q lcl|NC_020844. 69 LEGLASKPFG---REV--EVTEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQA 142 (455) Q Consensus 69 v~~~~g~lf~---k~p--~i~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~ 142 (455) -+.++...|. +.. ........+-.++. .-+ ...+-.+|.+.+....+.+|-+|+++.... +.+. T Consensus 71 a~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~l~~~lll~Gnayv~i~~~~-g~~~-------- 140 (432) T protein:vir:81 71 SQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPN-STQTAFDFWQVVVTRLLLDGTAYVRKVVTD-GRIE-------- 140 (432) T ss_pred HHhhhhCceeeEEecCCcceecccchHHHHHHhccc-ccCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEE-------- Confidence 4444444332 110 00111223444442 222 235678899999999999999999986532 2211 Q ss_pred cCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEE-eCCcEEEEeccCCCCCCceeEEEEEe Q lcl|NC_020844. 143 GVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRK-NDAGIWEVDEFGRITIGVVPMVPFIT 221 (455) Q Consensus 143 ~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~-~~~g~~~~~~~g~~~l~~vP~V~f~~ 221 (455) -+..++|..|.-+.. . +|. ..|+. ..+|...... .-..+.+ .. T Consensus 141 ----~L~~l~~~~v~v~~~-~-~g~-----------------------~~y~~~~~~g~~~~~~----~~~iih~---r~ 184 (432) T protein:vir:81 141 ----SLQYLANDRLTITTD-P-KGN-----------------------TAYRYRRTDGQMIDIP----KQQIWKI---MG 184 (432) T ss_pred ----EEEEEcCCceEEEEC-C-CCc-----------------------EEEEEEecCceEEEEc----cccEEEe---cC Confidence 234445544422111 0 110 01111 0111111110 0112222 22 Q ss_pred cccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCCCccccccCc---ccee--eccceeeeccCC Q lcl|NC_020844. 222 GRRKGKKWVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANGVEPDTDSEGSP---KPLD--TGPDTVLYAPPN 295 (455) Q Consensus 222 ~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~~~~~~~~~~---~~~~--~g~~~~~~lp~~ 295 (455) .+. +...+.+||.-++.. +.......++...+ ...+.|-.++.- ...-.++..+ +.+. ..++.++.||.+ T Consensus 185 ~~~--dg~~G~spi~~~~~~-i~~~~~~~~~~~~~f~ng~~~~gil~~-~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g 260 (432) T protein:vir:81 185 YSL--DGENGLSAIRYGAQI-FGTAIAAEAQAARAFRNGQLQSVYYQI-DRFLTDDQYDSFAKKVSGSVEAGRAPLLEGG 260 (432) T ss_pred CCC--CCcccccHHHHHHHH-HHHHHHHHHHHHHHHhcCCCcceEEec-CCCCCHHHHHHHHHHHhhhhcCCCceecCCC Confidence 222 224578887765543 44444444444433 334566555552 2111111100 0111 123456667632 Q ss_pred CCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHH---HHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 296 AEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAA---SKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 296 ~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~---~~~~s~L~~~a~~~~~a~ 369 (455) .++.-++.++...+ +.+..+-...+|+.+ ...++.....+.++..+..+. .-....|.-++..++.++ T Consensus 261 -----~~~~~l~~~~~d~q-~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l 334 (432) T protein:vir:81 261 -----MDVKSLGLNPVDAQ-LLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSI 334 (432) T ss_pred -----ceEEEccCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 24444444444433 233344444555442 223332211112111111111 112345666666666666 Q ss_pred HHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCC--CCCCHH------HH Q lcl|NC_020844. 370 ENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELS--DNFDPE------EE 441 (455) Q Consensus 370 ~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~--~~~d~e------~e 441 (455) +.-|- .. ....+..|+++.+=..+....+.++++.++.+.|+++.-..++.+ |+-| .+.+.- .- T Consensus 335 ~~kLl--~~---~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~---glpp~~g~~~~~~~~~~~~p 406 (432) T protein:vir:81 335 ALNLL--SP---AERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIE---GLPKLGGNAAVLTVQSAMVP 406 (432) T ss_pred Hhhcc--Cc---cccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---CCCCCCCCcceEeecCcccc Confidence 65221 11 011234455554322333335567788889999999999888876 3322 110000 00 Q ss_pred HHHHHhhcc---cC--CCC Q lcl|NC_020844. 442 SERLISELP---GG--IEE 455 (455) Q Consensus 442 ~~ri~~e~~---~~--l~~ 455 (455) ++.+.++.+ ++ -++ T Consensus 407 l~~~~~~~~~~~~~~~~n~ 425 (432) T protein:vir:81 407 LDSIGLQASPEPASGLGNQ 425 (432) T ss_pred hhhhccCCCCCCCCCCCCc Confidence 111211111 11 111 No 171 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=91.62 E-value=0.015 Score=30.59 Aligned_cols=391 Identities=11% Similarity=0.003 Sum_probs=163.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |+..++..+-+.-..-. ..+-+-.... +..|-|. -+...+.+-+.. .++....|+.++..+-+-| T Consensus 6 ~~i~s~~~~~~i~~~~~-~s~~~~~~~~--------~~~~~pp---~~~~~la~l~~~---n~~v~scI~~ia~~IA~l~ 70 (542) T protein:vir:41 6 LSIRSLEKYKAIKREEV-ESQALGETRF--------EEYVEPK---VNPLVLLSLLQV---NPYHASACSIKANDIIRTG 70 (542) T ss_pred ccccccccchhhhhccc-cccccccccC--------CccccCC---CCHHHHHHHHhh---cHHHHHHHHHHHHHHhhCc Confidence 55555544333322211 1111100000 1122222 233444432222 3456788888888888888 Q ss_pred ceeccC-CHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccc Q lcl|NC_020844. 81 VEVTEG-TGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEI 159 (455) Q Consensus 81 p~i~~~-~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w 159 (455) ..+... ...+..++.|.+ .+..+|.+.+....+.+|-+|+++.++..+.+. -+..++|..|.- T Consensus 71 ~~~~~~~~~~l~~~lpN~~---~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~------------~L~~l~~~~v~v- 134 (542) T protein:vir:41 71 YILEGDDEGVVDEFIRACK---PSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPI------------RFEYIPSHTIRV- 134 (542) T ss_pred eeeecccchhhhhhcCCCC---CCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE------------EEEEEcCcceEE- Confidence 777532 234556666643 578999999999999999999999876655432 144455554421 Q ss_pred eeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccC----CCCCCceeEEEEEecccCCCcccCcchH Q lcl|NC_020844. 160 QSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFG----RITIGVVPMVPFITGRRKGKKWVFHPPM 235 (455) Q Consensus 160 ~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g----~~~l~~vP~V~f~~~~~~~~~~~~~~pL 235 (455) ...++... .+......++ +..|..... + ....| ..+-.. ++.|.... ..+...+.+|+ T Consensus 135 --~~d~~~~~----~~~~~~~~~~------~~~y~~~~~--~-~~~~g~~~~~~~~~e--IiHir~~~-~~~~~~Glspi 196 (542) T protein:vir:41 135 --HKDGSRYR----QTWDGVNITH------FKDYRYEGE--I-NPETGEDQDSVGANE--LVFIHIPS-PVCSYYGVPRY 196 (542) T ss_pred --EEcCCeeE----eeecCCccee------EEeeccccc--c-cccccccccccCccc--EEEecCCC-CCCCcccccHH Confidence 11111000 0000000000 001100000 0 00000 011111 23333222 23444678888 Q ss_pred HHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEe--cCCCccccc---------------cCcc--ceeeccceeeeccCC Q lcl|NC_020844. 236 KDALDLQVALFRKECALEFA-EMMTAFPMLSAN--GVEPDTDSE---------------GSPK--PLDTGPDTVLYAPPN 295 (455) Q Consensus 236 ~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~--G~~~~~~~~---------------~~~~--~~~~g~~~~~~lp~~ 295 (455) ..++.. +..-.....+... ....+.|-.+++ |...+.... .+.. +..-..+.++.|+.. T Consensus 197 ~~~~~~-i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~ 275 (542) T protein:vir:41 197 VSAAPA-ILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIP 275 (542) T ss_pred HHHHHH-HHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeecc Confidence 776653 3333333444333 444566765553 322111000 0000 000011234444321 Q ss_pred CCccccceeEEeeCchhHH-HHHHHHHHHHHHHHHh---hhhhcc---CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 296 AEGQHGSWNFVEPATESLK-FLAEQIESVSKEIREL---GRQPLT---AQSGNLTRISAAQAASKGNSAVQQWAAGLKDA 368 (455) Q Consensus 296 ~~~~~~~~~~le~~~~~l~-~~~~~l~~~~~~m~~~---g~~~l~---~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a 368 (455) . +..+.++|...+.++-. .+.+..+-..++|..+ ...++. .++.+.|..+ .....-....|.-+...++.+ T Consensus 276 ~-~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~E-q~~~~f~~~tL~P~~~~ie~~ 353 (542) T protein:vir:41 276 G-GDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAE-VTRRTYYESVVRPQQNIISSI 353 (542) T ss_pred C-CcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHH-HHHHHHHHHHHHHHHHHHHHH Confidence 1 11234556554433222 2334444455555443 222231 1122222211 122223445566666777777 Q ss_pred HHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCH---------- Q lcl|NC_020844. 369 LENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDP---------- 438 (455) Q Consensus 369 ~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~---------- 438 (455) +++.|- -. ...+..++++.+- .+.....+++..+.+.|+++...++++|. ++- +..++ T Consensus 354 ln~~L~--~~----~~~~~~~~f~~~~---ll~~d~~~~~~~~v~~GilT~NE~Re~L~--g~~-pgdd~~l~p~~~~~~ 421 (542) T protein:vir:41 354 LTDFFQ--VK----FNPKTRFKFNDET---LLESDSVRNCALLVQSGVLTPAEARERLF--GLD-GGPDIFMVPSKGAAK 421 (542) T ss_pred HHhhcc--cc----cCCceEEEecchh---hcchHHHHHHHHHHhCCCCCHHHHHHhhC--CCC-CCCcccccccccccc Confidence 765331 11 1123445444211 12233455667788999999988887662 332 21111 Q ss_pred -------------HHHHHHHHhhcccCCCC Q lcl|NC_020844. 439 -------------EEESERLISELPGGIEE 455 (455) Q Consensus 439 -------------e~e~~ri~~e~~~~l~~ 455 (455) ..+.+...+....+.++ T Consensus 422 ~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~ 451 (542) T protein:vir:41 422 SVKRQERNYEKNQIREIRKIYAKYRPRFNE 451 (542) T ss_pred ccccCCcCCCCCchhhhhhcccccCccccc Confidence 00111110000011111 No 172 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=91.19 E-value=0.017 Score=30.28 Aligned_cols=377 Identities=11% Similarity=-0.014 Sum_probs=150.2 Q ss_pred CCC-CCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHH-HHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPE-QDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRY-DLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~-~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y-~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) .++ +.++ .|+-.+..+. +++.+.|.... +. -+.. .-.-| ..+.+. +.+...|+-++.-+-+ T Consensus 3 ~~~~~~~~--~p~~~e~~~~---~~~~~~~~~~~---~~-~~~~----~~~~~~~~a~~~----~~V~acV~~IA~~iA~ 65 (518) T protein:vir:10 3 LANGQTLS--APAMAELSPQ---MQDSYYYAPAV---GM-QLER----QFSLYGGIYKNQ----PWVRTVIAKRAQALAR 65 (518) T ss_pred ccCceeec--Cchhhhhhhh---hhccccccccc---ce-eccc----ccchhhHHHhhh----HHHHHHHHHHHHhhcc Confidence 110 0111 1221111111 33444332110 00 0000 00111 112222 2333444444444432 Q ss_pred CCcee---------ccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEE Q lcl|NC_020844. 79 REVEV---------TEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWS 149 (455) Q Consensus 79 k~p~i---------~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~ 149 (455) -|..+ ......+..|+..-. ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+. T Consensus 66 lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN-~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~------------~L~ 132 (518) T protein:vir:10 66 LPVKCMFTSGDTETEESDTGYAKLLADPC-EYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPE------------KLM 132 (518) T ss_pred CceEEEEEcCCCceeccchHHHHHHcCCC-CCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EEE Confidence 22222 112334556666554 35677899999999999999999999765544321 144 Q ss_pred EechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeC--CcEEEEeccCCCCCCceeEEEEEecccCCC Q lcl|NC_020844. 150 QVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKND--AGIWEVDEFGRITIGVVPMVPFITGRRKGK 227 (455) Q Consensus 150 ~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~--~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~ 227 (455) .++|..|.-..... ++.. +..|.... ++..... +-+. |+-|.+.. ..+ T Consensus 133 ~l~p~~v~v~~~~~-~~~~---------------------~y~~~~~~~~~~~~~~~-----~~~e--ViHir~~s-~dg 182 (518) T protein:vir:10 133 PMHPSRVAIKRNSR-TGRY---------------------EYYFQAGAGVGTQLVSF-----ADDE--VVPIRFFN-PDG 182 (518) T ss_pred EECCCceEEEEcCC-CCEE---------------------EEEEEecCCccceEEEe-----cCCc--EEEecCCC-CCc Confidence 44454442111000 0000 00000000 0000000 0011 12222222 223 Q ss_pred cccCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEecCCCcccc---ccCc--cceeecc---ceeeeccCCCCc Q lcl|NC_020844. 228 KWVFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANGVEPDTDS---EGSP--KPLDTGP---DTVLYAPPNAEG 298 (455) Q Consensus 228 ~~~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G~~~~~~~---~~~~--~~~~~g~---~~~~~lp~~~~~ 298 (455) ...+.+|+.-+.. .+.......++... ....+.|-.+++--..-+++ ..+. +...-|. +.+..|+. T Consensus 183 ~~~G~spi~~a~~-~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~---- 257 (518) T protein:vir:10 183 LERGLSLMESLKS-TIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE---- 257 (518) T ss_pred ccccccHHHHHHH-HHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCcceEcCC---- Confidence 4457788765554 34444444444333 45556787777632211111 1110 0111222 23555653 Q ss_pred cccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 299 QHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTA-QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALY 374 (455) Q Consensus 299 ~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~ 374 (455) +.++.-++.++...+ +.+..+-..++|+.+ ...++.. ..++-+..+ .....-.+..|.-+...++.++++.|- T Consensus 258 -G~~~~~l~~s~~D~q-~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~e-q~~~~f~~~tL~P~l~~ie~~ln~~L~ 334 (518) T protein:vir:10 258 -GMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNIS-AQMRAFYRDTMAIPIARIQSAMDKYVG 334 (518) T ss_pred -CceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 223433444443332 233344444555432 2233321 112222211 111122334566667777777766441 Q ss_pred HHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCH-HHH------HHHHHh Q lcl|NC_020844. 375 ITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDP-EEE------SERLIS 447 (455) Q Consensus 375 ~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~-e~e------~~ri~~ 447 (455) .. .+ .+..++++.+-..+....+.+.++.++++.|++|.-.+++.+ |+-+ .-++ -++ +..|.. T Consensus 335 --~~-~~---~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~---Gl~p-ie~~~gD~~~~~~n~~pl~~ 404 (518) T protein:vir:10 335 --QY-WV---RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM---GLPR-SDDPKADELYANSALQPLGA 404 (518) T ss_pred --cc-cc---CCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCCCCCeeeecccceeccc Confidence 11 11 234455554333333345667888899999999998887765 3322 1110 010 011110 Q ss_pred hcccC-------------------CCC Q lcl|NC_020844. 448 ELPGG-------------------IEE 455 (455) Q Consensus 448 e~~~~-------------------l~~ 455 (455) ..... .++ T Consensus 405 ~~~~~~~g~~~~~~~~~~~~~~~~~~~ 431 (518) T protein:vir:10 405 TPDGAVEGEEAPAPKRPASTPVASLDQ 431 (518) T ss_pred ccccccCCCCCCCCCCCCccccccccc Confidence 00000 000 No 173 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=90.84 E-value=0.019 Score=30.05 Aligned_cols=367 Identities=10% Similarity=0.019 Sum_probs=152.0 Q ss_pred CCCCCCC-ccCH-HHHHHHHHHHHHHHHhcCH-HHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQDPT-KRSA-DAEAMSDYLQTVDDVLEGV-TGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~v~-~~~p-~y~~~~~~w~~i~d~~~G~-~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |+=-+.- +..+ .-.+....|. ..|. ..+.. .+....++.-.. +.-++ .+.+...|+.+++-+- T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~----~~~~~~~~~v~~-~~al~----~~~v~~~i~~ia~~ia 68 (392) T protein:vir:39 3 LPILNFINQTNDPPEVGSVQSYF-----PDGNDAQIME----SLLGDNNEWVSA-RAALR----NSDLFSIILQLSSDLA 68 (392) T ss_pred chhhhhhhccccccccccccccc-----ccCchhhhhh----hhcCCCCceech-HHhhc----cHHHHHHHHHHHHhhc Confidence 3321110 0000 0000000000 0000 00000 000001110000 01111 2445556666666666 Q ss_pred cCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhc Q lcl|NC_020844. 78 GREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAIL 157 (455) Q Consensus 78 ~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii 157 (455) +-|..+... ....+++.-+ ...+-.+|.+.+....+.+|-+|+++.++..+.+. -+..++|..|- T Consensus 69 ~lp~~~~~~--~~~~l~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~------------~L~~l~~~~v~ 133 (392) T protein:vir:39 69 IVKINAEKK--KNQGIIDNPS-TNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADM------------KWEYLRPSQVN 133 (392) T ss_pred cCceeeccc--hhhhHhhcCC-CCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEE------------EEEEEcCceeE Confidence 666665432 2234555443 34678999999999999999999999766544321 13344444331 Q ss_pred cceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEE--eC-Cc-EEEEeccCCCCCCceeEEEEEecccCCCcccCcc Q lcl|NC_020844. 158 EIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRK--ND-AG-IWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHP 233 (455) Q Consensus 158 ~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~--~~-~g-~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~ 233 (455) -.. ...+....|+. .+ .+ ..... +-+. ++.|..... .+...+.+ T Consensus 134 ~~~------------------------~~~~~~~~y~~~~~~~~~~~~~~~-----~~~e--iih~~~~~~-~~~~~G~s 181 (392) T protein:vir:39 134 TYY------------------------FEYENGMYYNITFDDPKIEPILQA-----PQSD--LIHMKLLSI-DGGKTGIS 181 (392) T ss_pred EEE------------------------cCCCceEEEEEEecCcccceeEEE-----cccc--EEEecCCCC-CCcccccc Confidence 000 00010001111 11 11 00000 0111 222332222 24456888 Q ss_pred hHHHHHHHHHHHHHhHHHHHH-HHHHcCCceEEEe--cCCCccccccC----cc-ceeeccceeeeccCCCCccccceeE Q lcl|NC_020844. 234 PMKDALDLQVALFRKECALEF-AEMMTAFPMLSAN--GVEPDTDSEGS----PK-PLDTGPDTVLYAPPNAEGQHGSWNF 305 (455) Q Consensus 234 pL~dla~l~~~hy~~~sd~~~-~l~~~~~p~~~~~--G~~~~~~~~~~----~~-~~~~g~~~~~~lp~~~~~~~~~~~~ 305 (455) |+..+... +.......++.. .....+.|-.+++ +-. ...++.. .. .-...++.++.+|. ..+| T Consensus 182 ~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~~~~~~~~~~~~~~~~~~g~~~vl~~-------g~~~ 252 (392) T protein:vir:39 182 PLYSLRRE-SKIQRASDRLTISSLNSSLNVPGVLTVKGGG-LLSDKDKASRSRSFMKRSRSGGPVVLDD-------LEEF 252 (392) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCCCC-CchHHHHHHHHHHHhccccCCCeeecCC-------CceE Confidence 88766653 333333344433 3445567776664 211 1111100 00 00112334556653 2445 Q ss_pred EeeC--chhHHHHHHHHHHHHHHHHHh-h--hhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020844. 306 VEPA--TESLKFLAEQIESVSKEIREL-G--RQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM 380 (455) Q Consensus 306 le~~--~~~l~~~~~~l~~~~~~m~~~-g--~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~ 380 (455) .+++ +...+ +.+..+-..++|+.+ | ..++...+.+.+..+.. .+-....|.-++..++++++..| T Consensus 253 ~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~--~~f~~~~l~P~~~~ie~~l~~~L------- 322 (392) T protein:vir:39 253 TALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQI--SGMYASALNRYLRPAISELEYKL------- 322 (392) T ss_pred EEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--HHHHHHHHHHHHHHHHHHHHHhc------- Confidence 4444 33322 345555556666554 2 23332222222222211 11233445556666666665533 Q ss_pred CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 381 DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 381 g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) . + ...+.+..-|. .........+.++...|+++....++.+.+.|+.+.++-.. +-+.....+.-+| T Consensus 323 ~-~--~~~~d~~~~~~--~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~---e~l~~~~~Gd~~~ 389 (392) T protein:vir:39 323 S-D--HISVNMRPAID--PLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAP---ENTNKKTTGQSNE 389 (392) T ss_pred c-c--cccccchhhhc--cCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchh---cCCCCCCCCCCCC Confidence 1 1 11222222221 12234456777888999999999999999999876433211 1111111111222 No 174 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=90.84 E-value=0.019 Score=30.05 Aligned_cols=367 Identities=10% Similarity=0.019 Sum_probs=152.0 Q ss_pred CCCCCCC-ccCH-HHHHHHHHHHHHHHHhcCH-HHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQDPT-KRSA-DAEAMSDYLQTVDDVLEGV-TGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~v~-~~~p-~y~~~~~~w~~i~d~~~G~-~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |+=-+.- +..+ .-.+....|. ..|. ..+.. .+....++.-.. +.-++ .+.+...|+.+++-+- T Consensus 3 m~~f~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~----~~~~~~~~~v~~-~~al~----~~~v~~~i~~ia~~ia 68 (392) T protein:vir:10 3 LPILNFINQTNDPPEVGSVQSYF-----PDGNDAQIME----SLLGDNNEWVSA-RAALR----NSDLFSIILQLSSDLA 68 (392) T ss_pred chhhhhhhccccccccccccccc-----ccCchhhhhh----hhcCCCCceech-HHhhc----cHHHHHHHHHHHHhhc Confidence 3321110 0000 0000000000 0000 00000 000001110000 01111 2445556666666666 Q ss_pred cCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhc Q lcl|NC_020844. 78 GREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAIL 157 (455) Q Consensus 78 ~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii 157 (455) +-|..+... ....+++.-+ ...+-.+|.+.+....+.+|-+|+++.++..+.+. -+..++|..|- T Consensus 69 ~lp~~~~~~--~~~~l~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~------------~L~~l~~~~v~ 133 (392) T protein:vir:10 69 IVKINAEKK--KNQGIIDNPS-TNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADM------------KWEYLRPSQVN 133 (392) T ss_pred cCceeeccc--hhhhHhhcCC-CCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEE------------EEEEEcCceeE Confidence 666665432 2234555443 34678999999999999999999999766544321 13344444331 Q ss_pred cceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEE--eC-Cc-EEEEeccCCCCCCceeEEEEEecccCCCcccCcc Q lcl|NC_020844. 158 EIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRK--ND-AG-IWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHP 233 (455) Q Consensus 158 ~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~--~~-~g-~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~ 233 (455) -.. ...+....|+. .+ .+ ..... +-+. ++.|..... .+...+.+ T Consensus 134 ~~~------------------------~~~~~~~~y~~~~~~~~~~~~~~~-----~~~e--iih~~~~~~-~~~~~G~s 181 (392) T protein:vir:10 134 TYY------------------------FEYENGMYYNITFDDPKIEPILQA-----PQSD--LIHMKLLSI-DGGKTGIS 181 (392) T ss_pred EEE------------------------cCCCceEEEEEEecCcccceeEEE-----cccc--EEEecCCCC-CCcccccc Confidence 000 00010001111 11 11 00000 0111 222332222 24456888 Q ss_pred hHHHHHHHHHHHHHhHHHHHH-HHHHcCCceEEEe--cCCCccccccC----cc-ceeeccceeeeccCCCCccccceeE Q lcl|NC_020844. 234 PMKDALDLQVALFRKECALEF-AEMMTAFPMLSAN--GVEPDTDSEGS----PK-PLDTGPDTVLYAPPNAEGQHGSWNF 305 (455) Q Consensus 234 pL~dla~l~~~hy~~~sd~~~-~l~~~~~p~~~~~--G~~~~~~~~~~----~~-~~~~g~~~~~~lp~~~~~~~~~~~~ 305 (455) |+..+... +.......++.. .....+.|-.+++ +-. ...++.. .. .-...++.++.+|. ..+| T Consensus 182 ~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~-~~~~~~~~~~~~~~~~~~~~g~~~vl~~-------g~~~ 252 (392) T protein:vir:10 182 PLYSLRRE-SKIQRASDRLTISSLNSSLNVPGVLTVKGGG-LLSDKDKASRSRSFMKRSRSGGPVVLDD-------LEEF 252 (392) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCCCC-CchHHHHHHHHHHHhccccCCCeeecCC-------CceE Confidence 88766653 333333344433 3445567776664 211 1111100 00 00112334556653 2445 Q ss_pred EeeC--chhHHHHHHHHHHHHHHHHHh-h--hhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020844. 306 VEPA--TESLKFLAEQIESVSKEIREL-G--RQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM 380 (455) Q Consensus 306 le~~--~~~l~~~~~~l~~~~~~m~~~-g--~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~ 380 (455) .+++ +...+ +.+..+-..++|+.+ | ..++...+.+.+..+.. .+-....|.-++..++++++..| T Consensus 253 ~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~--~~f~~~~l~P~~~~ie~~l~~~L------- 322 (392) T protein:vir:10 253 TALEIKSNVAQ-LLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQI--SGMYASALNRYLRPAISELEYKL------- 322 (392) T ss_pred EEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--HHHHHHHHHHHHHHHHHHHHHhc------- Confidence 4444 33322 345555556666554 2 23332222222222211 11233445556666666665533 Q ss_pred CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 381 DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 381 g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) . + ...+.+..-|. .........+.++...|+++....++.+.+.|+.+.++-.. +-+.....+.-+| T Consensus 323 ~-~--~~~~d~~~~~~--~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~---e~l~~~~~Gd~~~ 389 (392) T protein:vir:10 323 S-D--HISVNMRPAID--PLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDLPAP---ENTNKKTTGQSNE 389 (392) T ss_pred c-c--cccccchhhhc--cCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchh---cCCCCCCCCCCCC Confidence 1 1 11222222221 12234456777888999999999999999999876433211 1111111111222 No 175 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=90.57 E-value=0.02 Score=29.88 Aligned_cols=396 Identities=9% Similarity=0.006 Sum_probs=151.9 Q ss_pred CCCCCCCccCHHHHHHHH-----H-HHHHHHHhcC-HHHHHH-------hhhhcCCCCCCCCHHHHHHHhhccccCChHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSD-----Y-LQTVDDVLEG-VTGMRR-------NFKRYVKQFPHEQNKRYDLRKSVAKMTNVFR 66 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~-----~-w~~i~d~~~G-~~~~r~-------~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~ 66 (455) |-+.+....|+.-++... . -..+.-...| +.++.. ...-|-+++...+-...++.++..--.++.+ T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~~~~npiv~ 90 (547) T protein:vir:63 11 GVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILN 90 (547) T ss_pred cCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHHHHhhcCHHHH Confidence 211112222332211110 0 0000000111 000000 0122444444434333444443222246777 Q ss_pred HHHHHHhhhhhc--CC-----------c-------eeccC-C---HHHHHHHHhhccCC----CCHHHHHHHHHHHHHhh Q lcl|NC_020844. 67 DVLEGLASKPFG--RE-----------V-------EVTEG-T---GPSEDLLEDIDGRG----NNLTTFAGQLFFQGVAD 118 (455) Q Consensus 67 ~~v~~~~g~lf~--k~-----------p-------~i~~~-~---~~l~~~~~d~d~~G----~~l~~~~~~~~~~al~~ 118 (455) .+|+.++..+-. .+ + +.+.. . ..++.|+....... .+..+|++.+....+.+ T Consensus 91 ~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~ 170 (547) T protein:vir:63 91 AIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMY 170 (547) T ss_pred HHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhh Confidence 777766654421 10 0 11110 1 12456666554432 26778999999999999 Q ss_pred CcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCC Q lcl|NC_020844. 119 SISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDA 198 (455) Q Consensus 119 G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~ 198 (455) |-+|+.+-++..+.+. -+..++|..|--+... ++. ......+.++ ...+ T Consensus 171 Gn~~~~i~rd~~G~~~------------~L~~l~p~~V~~~~~~--~g~----------------~~~~~~~y~~-~~~~ 219 (547) T protein:vir:63 171 DQVNFEKVFNRNQSMV------------RFVAKDPTTIFFATTA--DGK----------------IPDNGNRFVQ-VIDQ 219 (547) T ss_pred CCEEEEEEECCCCcEE------------EEEEecCceeEEEECC--ccc----------------cccCceEEEE-EcCC Confidence 9999998765544321 1344444443211100 000 0000000011 1111 Q ss_pred cEEEEeccCCCCCCceeEEEEEecccC--CCcccCcchHHHHHHHHHHHHHhHHHHHHHHH-HcCCceEEE--ecC---C Q lcl|NC_020844. 199 GIWEVDEFGRITIGVVPMVPFITGRRK--GKKWVFHPPMKDALDLQVALFRKECALEFAEM-MTAFPMLSA--NGV---E 270 (455) Q Consensus 199 g~~~~~~~g~~~l~~vP~V~f~~~~~~--~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~-~~~~p~~~~--~G~---~ 270 (455) +..... +-.. ++-|..++.. .....+.+||..++... ........+....+ ..+.|-.++ .|- + T Consensus 220 ~~~~~~-----~~~e--iih~r~n~~~~~~~~~~G~Spi~~~~~~i-~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls 291 (547) T protein:vir:63 220 KIVATF-----NARE--MAFAVRNPRSDIYATGYGYPELEIALKQF-IAHENTEAFNDRFFSHGGTTRGILQIKAAQQQS 291 (547) T ss_pred cEEEEe-----cccc--EEEecccCCCCcccccccccHHHHHHHHH-HHHHHHHHHHHHHHHcCCCcceEEEecCCCCCC Confidence 111000 0011 2222222221 11234788887666543 33444444444444 445677555 342 2 Q ss_pred CccccccCc--cceeecccee---eeccCCCCccccceeEEeeCchhHHH-HHHHHHHHHHHHHHh-h--hhhccC--C- Q lcl|NC_020844. 271 PDTDSEGSP--KPLDTGPDTV---LYAPPNAEGQHGSWNFVEPATESLKF-LAEQIESVSKEIREL-G--RQPLTA--Q- 338 (455) Q Consensus 271 ~~~~~~~~~--~~~~~g~~~~---~~lp~~~~~~~~~~~~le~~~~~l~~-~~~~l~~~~~~m~~~-g--~~~l~~--~- 338 (455) .+....... +...-|+.+. ..++. ++++|.++..+.-.+ +.+..+-..++|+.+ | ..++.- . T Consensus 292 ~e~~~~lk~~~~~~~~G~~nagk~~vl~~------~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~ 365 (547) T protein:vir:63 292 QHALEIFKREWKNSLSGINGSWQIPVVSA------EDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNG 365 (547) T ss_pred HHHHHHHHHHHHHHhcCcccccccccccC------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCccccc Confidence 211111111 1122333322 22321 236666655433222 334444455555443 2 222210 0 Q ss_pred --------CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHH Q lcl|NC_020844. 339 --------SGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLT 410 (455) Q Consensus 339 --------~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~ 410 (455) +.+.+..+ .....-....|.-+...++.+++..|- -. .+ ....|.++ ... ..+..+...+.+ T Consensus 366 ~~~~~~~~s~t~sn~e-~~~~~~~~~tL~P~~~~ie~~ln~~L~--~~-~~---~~~~~~f~--~~~-~~~~~~~~~~~~ 435 (547) T protein:vir:63 366 GATGSKGGSLNEGNSA-EKNQASKNKGLQPLLGFIEDFINKHIV--AE-FG---DKYTFQFV--GGD-IKSELESVKILA 435 (547) T ss_pred ccccccccccchhhHH-HHHHHHHHHHHHHHHHHHHHHHHhhcc--cc-cC---CceEEEee--ccc-cccHHHHHHHHH Confidence 00111111 111122345566666667776666441 11 11 23344433 211 223444455667 Q ss_pred HHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH--------------------HHHHHhhcc------cCCCC Q lcl|NC_020844. 411 MRQNGDISREGLWMEFQRLGELSDNFDPEEE--------------------SERLISELP------GGIEE 455 (455) Q Consensus 411 ~~~~G~is~et~~~~l~~~~vl~~~~d~e~e--------------------~~ri~~e~~------~~l~~ 455 (455) +..+|+++.-.+++.+ |+-| ....-++ .++.++..+ ++.+. T Consensus 436 ~~~~g~lT~NE~R~~~---gl~P-~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (547) T protein:vir:63 436 EKAKVAMTVNEVRKEL---NLPG-DVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVS 502 (547) T ss_pred HHhCCCcCHHHHHHHh---CCCC-CCCCCceeecccccccccccccccCCccccchhhccccccccCCCCC Confidence 8888999998888755 4422 1110000 000000000 00000 No 176 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=90.56 E-value=0.02 Score=29.87 Aligned_cols=367 Identities=11% Similarity=0.051 Sum_probs=154.7 Q ss_pred CCCCCCCcc-CHHHHHHH-HHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKR-SADAEAMS-DYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~-~p~y~~~~-~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |.==+..++ .++..+.. ..|...+....+ .. ..|. | -+ -+..++. +.+...|+-+++-+-. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~-~------v~---~~~~~~~----~~v~~~i~~ia~~ia~ 63 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLS--TL-NGSE-W------VS---AESALRN----SDLFSIINQLSNDLAT 63 (386) T ss_pred Ccccccccccccccccccccccccccchhcc--cc-cCCc-e------ec---hhhhhcc----hHHHHHHHHHHHhhcc Confidence 552111110 00000100 011111100000 00 0000 0 00 0112222 3344555555555555 Q ss_pred CCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhcc Q lcl|NC_020844. 79 REVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILE 158 (455) Q Consensus 79 k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~ 158 (455) -|..+.. .....++.... ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+..++|..|-- T Consensus 64 ~p~~~~~--~~~~~l~~~pN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~------------~L~~l~~~~v~v 128 (386) T protein:vir:48 64 VKLTASR--KQLQGIIDNPS-NNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDM------------KWEYLRPSQVSF 128 (386) T ss_pred Cceeecc--chhHHHhhcCC-CCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEE------------EEEEecCceeEE Confidence 5555532 33455666555 35678899999999999999999999765543321 244444444421 Q ss_pred ceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe--CC--cEEEEeccCCCCCCceeEEEEEecccCCCcccCcch Q lcl|NC_020844. 159 IQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN--DA--GIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPP 234 (455) Q Consensus 159 w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~--~~--g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~p 234 (455) ... ..+ .-..|+.. +. +...... -+. |+-|..... .+...+.+| T Consensus 129 ~~~-~~~-----------------------~~~~y~~~~~~~~~~~~~~~~-----~~e--vih~~~~~~-~~~~~G~s~ 176 (386) T protein:vir:48 129 NRL-DNK-----------------------DGIYYNITFDDPRIPPKQHVP-----QGD--VLHFKLLSV-DGGLTSVSP 176 (386) T ss_pred EEc-CCC-----------------------ceEEEEEEecCccccceeEec-----Ccc--EEEecCCCC-CCceeeccH Confidence 100 000 00011111 00 0000010 111 222332222 234457888 Q ss_pred HHHHHHHHHHHHHhHHHH-HHHHHHcCCceEEEecCCCcccccc---C--ccceeeccceeeeccCCCCccccceeEEee Q lcl|NC_020844. 235 MKDALDLQVALFRKECAL-EFAEMMTAFPMLSANGVEPDTDSEG---S--PKPLDTGPDTVLYAPPNAEGQHGSWNFVEP 308 (455) Q Consensus 235 L~dla~l~~~hy~~~sd~-~~~l~~~~~p~~~~~G~~~~~~~~~---~--~~~~~~g~~~~~~lp~~~~~~~~~~~~le~ 308 (455) +.-++.. +.......++ .......+.|-.+++--.....+.. . .....-.++.++.|+. +.++.-+.. T Consensus 177 i~~~~~~-i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~-----g~~~~~l~~ 250 (386) T protein:vir:48 177 LMALSRE-LNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDD-----LEEFTPLEI 250 (386) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCCceecCC-----CceEEEcCC Confidence 8766543 3333334444 3444556778877763221111110 0 0011122344555653 223333333 Q ss_pred CchhHHHHHHHHHHHHHHHHHh-h--hhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC Q lcl|NC_020844. 309 ATESLKFLAEQIESVSKEIREL-G--RQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDT 385 (455) Q Consensus 309 ~~~~l~~~~~~l~~~~~~m~~~-g--~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~ 385 (455) ++...+ +.+..+...++|+.+ | ..++...+...+..+....+ ....|.-+...++..+++.|- . T Consensus 251 ~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~~~~~--~~~~l~P~~~~ie~~l~~~l~--------~-- 317 (386) T protein:vir:48 251 KSNVSQ-LLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEMSLDL--YNKAVSRYLRPFLSELSQKLS--------C-- 317 (386) T ss_pred ChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHH--HHHHHHHHHHHHHHHHHHhhc--------c-- Confidence 333322 344455555666553 2 23342222222323322222 344466666666666665441 0 Q ss_pred ceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 386 NPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 386 ~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) ...+.+...|. .........+-++..+|+++.-..++.+.+.++.+.+. ++ .+........|-++ T Consensus 318 ~~~~~~~~~~~--~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~-~~--~~~~~~~~~~gGd~ 382 (386) T protein:vir:48 318 DVDADILPAVD--PTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKEL-PE--GENPNKTTLKGGEI 382 (386) T ss_pred hhhcchhhhhc--cChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccc-hh--hcCCCCCccCCCCC Confidence 11111211121 12233455666788899999999999888777765332 11 11111112223233 No 177 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=89.46 E-value=0.026 Score=29.25 Aligned_cols=366 Identities=8% Similarity=-0.059 Sum_probs=147.2 Q ss_pred CC---------CC---CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHH Q lcl|NC_020844. 1 MP---------EQ---DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDV 68 (455) Q Consensus 1 m~---------~~---~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~ 68 (455) |- .. .++...|.. ....+|. .+-+ +..++. ..+... T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~~----------~~~~g~~--------~~~~----------~~al~~----~~V~~~ 48 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPLL----------LQWLGVD--------PDTP----------RNQLSE----ATYFAC 48 (411) T ss_pred CchHHHHHhhccCcccccccchHHH----------HHHhcCc--------ccCh----------hhhhcc----HHHHHH Confidence 22 10 011111111 0111111 0000 111111 223344 Q ss_pred HHHHhhhhhcCCcee---------ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhh Q lcl|NC_020844. 69 LEGLASKPFGREVEV---------TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREE 138 (455) Q Consensus 69 v~~~~g~lf~k~p~i---------~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad 138 (455) |+-+++-+-+-|..+ ......+..++. .-+ ...+-.+|.+.+....+.+|-+|+++.... +.+. T Consensus 49 v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~---- 122 (411) T protein:vir:81 49 LKILSESLGKLPLKMYQKTERGIVKSDREELYNLLKLRPN-PYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-PQLQ---- 122 (411) T ss_pred HHHHHHhHhhCceeEEEecCCceeeecccHHHHHHhhccC-CCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CceE---- Confidence 444444444333332 111233444443 222 356889999999999999999999987542 2211 Q ss_pred HHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEE Q lcl|NC_020844. 139 ERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVP 218 (455) Q Consensus 139 ~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~ 218 (455) . +..++|..|--... . ++.. ......+..|....+|.-... +-+. ++- T Consensus 123 -------~-l~~l~~~~v~~~~~-~-~~~~---------------~~~~~~~~~~~~~~~g~~~~~-----~~~e--iih 170 (411) T protein:vir:81 123 -------A-LWILPSQYVTIVVD-D-RGLL---------------GEKNAIWYRYNDPYDGKMYVF-----RNDE--ILH 170 (411) T ss_pred -------E-EEEECCceEEEEEc-C-cccc---------------cccceEEEEEEecCCceEEEE-----cccc--EEE Confidence 1 33444544421110 0 0000 000001111211112211111 1111 222 Q ss_pred EEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCCCcccc---ccCcc--ceeecc---cee Q lcl|NC_020844. 219 FITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANGVEPDTDS---EGSPK--PLDTGP---DTV 289 (455) Q Consensus 219 f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~~~~~~---~~~~~--~~~~g~---~~~ 289 (455) |..+... +...+.+|+.-++. .+.......++.... ...+.|-.+++.-..-.++ ..... ...-|. +.+ T Consensus 171 ~k~~~~~-~~~~G~s~~~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~ 248 (411) T protein:vir:81 171 FKTSVTF-DGITGLSVRDVLKH-TVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAGKI 248 (411) T ss_pred EcCCCCC-CCcccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCc Confidence 3322222 23457788776665 345555555555444 4556788887642211111 11100 111222 335 Q ss_pred eeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHH---hhhhhccCC-Ccc-hhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 290 LYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRE---LGRQPLTAQ-SGN-LTRISAAQAASKGNSAVQQWAAG 364 (455) Q Consensus 290 ~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~-~~~-~sa~a~~~~~~~~~s~L~~~a~~ 364 (455) +.++.+ .++.-++.++...+ +.+..+....+|+. +...++... .++ .+..+... .-....|.-++.. T Consensus 249 ~vl~~g-----~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~--~f~~~~l~P~~~~ 320 (411) T protein:vir:81 249 IPVPLG-----MKLVPLDIKLTDSQ-FFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNL--AFYVDTLLYVLKQ 320 (411) T ss_pred eecCCC-----ceEEEccCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHH--HHHHHHHHHHHHH Confidence 556532 23433443333332 22334444555544 333334221 122 12222211 2233445666666 Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHH---- Q lcl|NC_020844. 365 LKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEE---- 440 (455) Q Consensus 365 ~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~---- 440 (455) ++++++.-|---.++ ..+..|+++.+=.........++++.++.+.|+++.-.+++.+ |+-| .-.-++ T Consensus 321 ie~~l~~~ll~~~~~----~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~---gl~p-~~ggD~~~~~ 392 (411) T protein:vir:81 321 YEEEITYKILSNDLI----SQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYL---DMPA-DDYGNNLMAN 392 (411) T ss_pred HHHHHHhhcCChhhc----CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCCCeeeec Confidence 666665432111111 1234455543222223335567888899999999998888765 4422 210000 Q ss_pred ----HHHHHHhhcc-cCCC Q lcl|NC_020844. 441 ----ESERLISELP-GGIE 454 (455) Q Consensus 441 ----e~~ri~~e~~-~~l~ 454 (455) -++.+.++.. +|=+ T Consensus 393 ~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 393 GNYIPLSMLGANYGKGGDS 411 (411) T ss_pred cCccchhhhhhhhccCCCC Confidence 0122222211 1111 No 178 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=87.55 E-value=0.038 Score=28.37 Aligned_cols=375 Identities=9% Similarity=-0.012 Sum_probs=149.4 Q ss_pred HHHHHHHHHHHhcCHHHHHHhh--hhcC--CCCCCCCHH-------------HHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 16 MSDYLQTVDDVLEGVTGMRRNF--KRYV--KQFPHEQNK-------------RYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 16 ~~~~w~~i~d~~~G~~~~r~~g--~~yL--pk~~~e~~~-------------~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |+-+|-.---.=+|...+..+- ++=+ |.-+...+. .-+..++.+.++..+..+-+..++..|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp~~ 80 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPLH 80 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCCCCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCceE Confidence 2111110000002222221110 0000 000000000 0012222222333333344444443331 Q ss_pred ---CCc-eec-cCCHHHHHHH-HhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 79 ---REV-EVT-EGTGPSEDLL-EDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 79 ---k~p-~i~-~~~~~l~~~~-~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) +.- ... .....+..++ ..-. ...+-.+|.+.+....+.+|-+|+++..+..+.+. .+..++ T Consensus 81 v~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~------------~L~~l~ 147 (424) T protein:vir:45 81 VMRRHKGKVEPARDHPAFYLVHDEPN-TWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVI------------SLDCCM 147 (424) T ss_pred EEEecCCceeecccchHHHHHHhhcc-cCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE------------EEEEec Confidence 110 000 1122344444 2232 34678899999999999999999999765544321 244444 Q ss_pred hhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe-CCcEEEEeccCCCCCCceeEEEEEecccCCCcccC Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN-DAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVF 231 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~-~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~ 231 (455) |..+. ....++ -.+|+.. .++...+.. =.+|.+ ..... +...+ T Consensus 148 ~~~v~---i~~~~~-----------------------~~~y~~~~~~~~~~~~~-----~eVih~---r~~~~--d~~~G 191 (424) T protein:vir:45 148 PWETT---LMNTGG-----------------------RYTYGLYNEYGAFAISP-----DDMIHI---RALGN--NQKMG 191 (424) T ss_pred CceEE---EEEcCC-----------------------eEEEEEEecCceEEECc-----ccEEEe---cCcCC--CCccc Confidence 44431 011111 0111111 112111111 122332 32222 33468 Q ss_pred cchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEec---CCCccccccCc--cceeec----cceeeeccCCCCcccc Q lcl|NC_020844. 232 HPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANG---VEPDTDSEGSP--KPLDTG----PDTVLYAPPNAEGQHG 301 (455) Q Consensus 232 ~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G---~~~~~~~~~~~--~~~~~g----~~~~~~lp~~~~~~~~ 301 (455) .+|+.-+.. .+.......++... ....+.|-.+++- ++++....... +...-| ++.++.++.+ . T Consensus 192 ~spi~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g-----~ 265 (424) T protein:vir:45 192 LSPIMQHAE-TIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLPAD-----L 265 (424) T ss_pred ccHHHHHHH-HHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeEcCCC-----c Confidence 888875554 35555555555444 4456778777762 22211111111 011112 2345666532 2 Q ss_pred ceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITN 377 (455) Q Consensus 302 ~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a 377 (455) ++.=++.++...+ +.+..+-...+|+.+ ...++... +++-|. ..+....-....|.-+...++++++.-|---. T Consensus 266 ~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn-~eq~~~~f~~~tL~P~~~~ie~~ln~kLl~~~ 343 (424) T protein:vir:45 266 DYKALTVSPVDAQ-IIDMMKLNRSMIAGIFNIPAHMINDLEKATFSN-ISAQAIQFVRYTMMPWVTNWEQELNRRLFTRA 343 (424) T ss_pred eEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCChh Confidence 3443444433322 233344444455432 23333211 122121 11122222345566666666666665332111 Q ss_pred HHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHH------HHhhccc Q lcl|NC_020844. 378 LWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESER------LISELPG 451 (455) Q Consensus 378 ~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~r------i~~e~~~ 451 (455) ++ ..+..|+++.+-..+......++++.++.+.|+++.-..++.+ |+- |.-.-++-+-- ..+..+. T Consensus 344 e~----~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~---gl~-pi~ggD~~~~~~n~~~~~~~~~~~ 415 (424) T protein:vir:45 344 EL----AAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFE---DMN-PVEGLDEMLVSVNAANPAGDFKPP 415 (424) T ss_pred hh----cCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCC-CCCCcceeeecccccccccccCCC Confidence 11 1234455554333333335567888889999999998888755 332 21111110000 0011111 Q ss_pred CCCC Q lcl|NC_020844. 452 GIEE 455 (455) Q Consensus 452 ~l~~ 455 (455) .-++ T Consensus 416 ~~~~ 419 (424) T protein:vir:45 416 KNDE 419 (424) T ss_pred CCCC Confidence 1111 No 179 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=86.74 E-value=0.043 Score=28.05 Aligned_cols=358 Identities=14% Similarity=0.091 Sum_probs=151.3 Q ss_pred CCCCCC-----CccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQDP-----TKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~v-----~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) |--=+. ..........-+.| ......|... .++ +. +..++ ...+..+|+.+++- T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~------~~v------~~---~~al~----~~~v~~~i~~ia~~ 59 (385) T protein:vir:10 1 MGLLTPRNFNKRKAKNMVYPSNPAF--FTTTVGGMQL------SYV------SA---LSALQ----NTNVYSVINRIASD 59 (385) T ss_pred Cccccchhcccccccccccccchhh--hhhhccccCc------ccc------CH---HHhhc----cHHHHHHHHHHHHH Confidence 331110 00011111111111 1111111100 000 11 11222 23455677777777 Q ss_pred hhcCCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 76 PFGREVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 76 lf~k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) +-+-|..+.. .....++.... ...+-.+|.+.+....+.+|-+|+++.... +..+++.. T Consensus 60 ia~~p~~v~~--~~~~~ll~~PN-~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~------------------~~~~p~~~ 118 (385) T protein:vir:10 60 VASAHFKTEN--TATLNRLESPS-SLIGRFSFWQGALMQLCLSGNDYIPLVGQN------------------LEHIPNSD 118 (385) T ss_pred HhhCceeeec--cchhhhhhcCC-CCCCHHHHHHHHHHHhhhcCCeEEEEEcCc------------------eeEeecCC Confidence 7777777643 33455665443 346789999999999999999999985321 11111111 Q ss_pred hccceeeccCCeeEEEEEEEEeeeeEEEEEecC--eEEEEEEeCCcEEEEeccCCCCCCceeEEEEEe-cccCCCcccCc Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWEDAETILVLRPD--DWERWRKNDAGIWEVDEFGRITIGVVPMVPFIT-GRRKGKKWVFH 232 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~--~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~-~~~~~~~~~~~ 232 (455) .. +.+...+ .+..+....++....... -. ++.|.+ +....+...+. T Consensus 119 ~~------------------------v~~~~~~~~~~~~~~~~~~~~~~~~~~----~e---iihik~~~~~~~~~~~G~ 167 (385) T protein:vir:10 119 VQ------------------------INYLPGNMGIVYTVLESNDRPQMVLRQ----DQ---MLHFRLMPDPQYRYLIGR 167 (385) T ss_pred ce------------------------EEEEEcCCceEEEEEEcCCceEEEEcc----cc---EEEeccCCCCcccccccc Confidence 10 0000000 000111111111111110 11 222322 22222334577 Q ss_pred chHHHHHHHHHHHHHhHHHHHHHHH-HcCCceEEEecCCCcccccc----Cc--cceeec--cceeeeccCCCCccccce Q lcl|NC_020844. 233 PPMKDALDLQVALFRKECALEFAEM-MTAFPMLSANGVEPDTDSEG----SP--KPLDTG--PDTVLYAPPNAEGQHGSW 303 (455) Q Consensus 233 ~pL~dla~l~~~hy~~~sd~~~~l~-~~~~p~~~~~G~~~~~~~~~----~~--~~~~~g--~~~~~~lp~~~~~~~~~~ 303 (455) +|+..+... +...+...++....+ ..+.|-.+++--....+++. +. +...-| ++.++.+|. +.++ T Consensus 168 s~i~~~~~~-i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~-----g~~~ 241 (385) T protein:vir:10 168 SPLESLQNA-LNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPD-----GFDY 241 (385) T ss_pred cHHHHHHHH-HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCC-----CceE Confidence 887665553 344444445444443 34567777762211111111 00 111122 223555653 3345 Q ss_pred eEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-Ccch--h-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 304 NFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNL--T-RISAAQAASKGNSAVQQWAAGLKDALENALYIT 376 (455) Q Consensus 304 ~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~--s-a~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~ 376 (455) .-++.++..++.+.+..+-..++|+.+ ...++... .++. | .++...-+ ...|.-+...++++++..| T Consensus 242 ~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~---~~~l~P~~~~ie~~l~~~l--- 315 (385) T protein:vir:10 242 TQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATY---LANLNSYVNPIVDELRLKM--- 315 (385) T ss_pred EecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHH---HHHHHHHHHHHHHHHHHhh--- Confidence 555555555444444455555555443 33334321 1111 1 12221111 2245566666666666533 Q ss_pred HHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 377 NLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 377 a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) .+ . .++++.+=..+......++++.++++.|+++.-..++.+..-++.+.+.+ +...-..+. .+-++ T Consensus 316 ---~~--~---~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~---~~~~~~~~~-~~g~~ 382 (385) T protein:vir:10 316 ---NA--P---DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLP---EFKPLTTQV-KGGDE 382 (385) T ss_pred ---CC--c---eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCc---cccCccccc-CCCCC Confidence 12 1 24444222222333556788888999999999999888766555433221 110001111 11111 No 180 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=85.61 E-value=0.052 Score=27.65 Aligned_cols=376 Identities=9% Similarity=0.009 Sum_probs=151.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHH-----HHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKR-----YDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~-----Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) ||- ++.++.. ....-|..-+.-.. ....+.. ....|. +..+... |.....++.-.+.+...|+-+++- T Consensus 1 ~~~--~~~~~~~--~~m~~F~~~~~~~~-~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ 73 (413) T protein:vir:96 1 MPG--VSEIRKD--KNLKFFNNKRSPTE-ESKAKDE-IPKAPQ-VVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADL 73 (413) T ss_pred CCc--cchhhhh--hcCCccccCCCcch-hhhhhcc-cccccc-ccccchhhHhhhccchhHHHhhchHHHHHHHHHHHh Confidence 441 1111110 00001100000000 0000000 001111 0011111 111111111145666777777777 Q ss_pred hhcCCceec--------cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCc- Q lcl|NC_020844. 76 PFGREVEVT--------EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRP- 146 (455) Q Consensus 76 lf~k~p~i~--------~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rP- 146 (455) +-+-|..+- .....+..++..---...+-.+|.+.+....+.+|-+|+++..+..+. +| T Consensus 74 ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~------------~~~ 141 (413) T protein:vir:96 74 VSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGD------------KII 141 (413) T ss_pred hccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC------------ceE Confidence 766555441 112234444422122346788999999999999999999998765432 12 Q ss_pred eEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe-CCcEEEEeccCCCCCCceeEEEEEecccC Q lcl|NC_020844. 147 YWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN-DAGIWEVDEFGRITIGVVPMVPFITGRRK 225 (455) Q Consensus 147 y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~-~~g~~~~~~~g~~~l~~vP~V~f~~~~~~ 225 (455) -+..++|..|.-.. .. +.+ .|... .++++. .-.+|. |..+... T Consensus 142 ~L~~l~~~~v~~~~---~~----------------------~~~-~y~~~~~~~~~~-------~~evih---~k~~~~~ 185 (413) T protein:vir:96 142 GLTPISPYKVTFNV---SD----------------------DDL-DYSITFDNKEYD-------PSTLLH---FVLNPSI 185 (413) T ss_pred EEEEecCceeEEEE---cC----------------------CeE-EEEEeecCcEEc-------hhhEEE---EeccCCC Confidence 24445555442111 01 100 11111 111110 112232 3333333 Q ss_pred CCcccCcchHHHHHHHHHHHHHhHHHHH-HHHHHcCCceEEEecCC---CccccccCc--cceeec---cceeeeccCCC Q lcl|NC_020844. 226 GKKWVFHPPMKDALDLQVALFRKECALE-FAEMMTAFPMLSANGVE---PDTDSEGSP--KPLDTG---PDTVLYAPPNA 296 (455) Q Consensus 226 ~~~~~~~~pL~dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G~~---~~~~~~~~~--~~~~~g---~~~~~~lp~~~ 296 (455) .+...+.+|+.-+.. .+.......++. ......+.|-.+++--. ++....... +...-| ++.++.++.+. T Consensus 186 ~~~~~G~s~~~~~~~-~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~ 264 (413) T protein:vir:96 186 ERPFIGTGYKVALKD-IVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGM 264 (413) T ss_pred CCccccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCceeeecCCc Confidence 344567887766544 344444445543 34455667877776311 111111110 011122 23344555332 Q ss_pred CccccceeEE-eeCchhHHHHHHHHHHHHHHHHH---hhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 297 EGQHGSWNFV-EPATESLKFLAEQIESVSKEIRE---LGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENA 372 (455) Q Consensus 297 ~~~~~~~~~l-e~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~ 372 (455) . ++.-+ ..+....+ +.+..+...++|+. +...++....++ .+.... -....|.-++..++++++.. T Consensus 265 ~----~~~~~~~~~~~d~q-~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~-~~~~~~----~~~~~l~P~~~~ie~~ln~~ 334 (413) T protein:vir:96 265 V----NVQQIKPLTLNDLA-INDAVTLDKKTVAGIFGVPAFLLGVGTYN-KDEFNN----FINTKIMSIAQVIQQTYNKL 334 (413) T ss_pred c----cccccccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCcch-HHHHHH----HHHHHHHHHHHHHHHHHHHh Confidence 1 11111 12223322 23333444445543 233334222211 111111 13344666666666666553 Q ss_pred HHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH--------HHH Q lcl|NC_020844. 373 LYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE--------SER 444 (455) Q Consensus 373 l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e--------~~r 444 (455) | +. .+..++++.+=.......+.++++.++...|+++.-.+++.+ |+-| .-.-++- ++. T Consensus 335 l------l~---~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---g~~p-~~~gd~~~~~~n~~~~~~ 401 (413) T protein:vir:96 335 I------VE---EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWV---GMPP-DAEMDDLLVLENYLQQKD 401 (413) T ss_pred h------CC---CCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCcceeeecccccchhh Confidence 3 11 234555543222222334567888899999999999888765 4422 2110000 111 Q ss_pred HHhhcccCCCC Q lcl|NC_020844. 445 LISELPGGIEE 455 (455) Q Consensus 445 i~~e~~~~l~~ 455 (455) +.++....-+| T Consensus 402 ~~~~~~~~~~d 412 (413) T protein:vir:96 402 LVNQKKLIQDE 412 (413) T ss_pred cccccCCCCCC Confidence 11222222222 No 181 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=85.52 E-value=0.052 Score=27.62 Aligned_cols=384 Identities=12% Similarity=0.040 Sum_probs=148.6 Q ss_pred CCCCC-C-C-ccC--HHHHHH-HHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQD-P-T-KRS--ADAEAM-SDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~-v-~-~~~--p~y~~~-~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |.--+ . + .++ +..... .+.+.+ .+..++.. ++..-.. +..++.+.++-....+-+.+++ T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~-~~~~g~~~-------------~~~~v~~-~~al~~~~v~~~i~~ia~~ia~ 65 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKL-LEWLGISP-------------STISVKG-KNALKVATVFACIKILSESVSK 65 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHH-HHHhcCCC-------------Ccceech-hhhhccHHHHHHHHHHHHhhcc Confidence 32100 0 0 000 000000 000111 11222110 0000000 0112222222333333333333 Q ss_pred hhhc---CC-cee-ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceE Q lcl|NC_020844. 75 KPFG---RE-VEV-TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYW 148 (455) Q Consensus 75 ~lf~---k~-p~i-~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~ 148 (455) .-|. +. ... ......+..++. .-+ .+.+-.+|.+.+....+.+|-+|+++.+...+.+. -+ T Consensus 66 l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L 132 (429) T protein:vir:10 66 LPLKIYQEDEYGIQRGTKHYLNNLLRLRPN-PYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQ------------AL 132 (429) T ss_pred CceEEEEecCCceeeccccHHHHHHHhhcc-CCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EE Confidence 3332 11 000 111223555543 222 45678899999999999999999999866544321 24 Q ss_pred EEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEE-EEEeCCcEEEEeccCCCCCCceeEEEEEecccCCC Q lcl|NC_020844. 149 SQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWER-WRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGK 227 (455) Q Consensus 149 ~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~-~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~ 227 (455) ..++|..|--... ..+. .. ..+.. |....+|....... -. |+-|..+. ..+ T Consensus 133 ~~i~~~~v~v~~~-~~~~------------------~~-~~~~~~~~~~~~g~~~~~~~----~e---vih~~~~~-~~~ 184 (429) T protein:vir:10 133 WPIDASKVTVYID-DVGL------------------LN-SKTKMWYVVNTGGQQRVLKP----EE---ILHFKNGI-TLD 184 (429) T ss_pred EEEcCceeEEEEc-Cccc------------------cc-ccceEEEEEccCCeEEEEcc----cc---EEEecCCC-CCC Confidence 4555554421100 0000 00 00111 11122222111111 11 22232222 123 Q ss_pred cccCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEecCC---CccccccCc--cceeec---cceeeeccCCCCc Q lcl|NC_020844. 228 KWVFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANGVE---PDTDSEGSP--KPLDTG---PDTVLYAPPNAEG 298 (455) Q Consensus 228 ~~~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G~~---~~~~~~~~~--~~~~~g---~~~~~~lp~~~~~ 298 (455) ...+.+|+..++.. +........+... ....+.|-.+++.-. ++....... +...-| ++.+..+|.+ T Consensus 185 ~~~G~s~i~~~~~~-i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g--- 260 (429) T protein:vir:10 185 GLVGVPTMEYLKST-LENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVG--- 260 (429) T ss_pred CcccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhccccccCceeecCCC--- Confidence 34578888766654 3444444444333 445567777766321 111111111 011122 3356666532 Q ss_pred cccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 299 QHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTA-QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALY 374 (455) Q Consensus 299 ~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~ 374 (455) .++.-++.++...+ +.+..+...++|+.+ ...++.. .+++-+..+ .....-....|.-++..++++++.-|- T Consensus 261 --~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e-~~~~~f~~~~l~P~~~~ie~~ln~kl~ 336 (429) T protein:vir:10 261 --YQFQPISLNMSDAQ-FLENTELTIRQIATAFGIKMHQLNDLSKATLNNIE-QQQQQFYTDTLQATLTMYEQEMTYKLF 336 (429) T ss_pred --ceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 24444444444433 233344445555443 3333421 122222111 111112344555666666666654321 Q ss_pred HHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCC-C---------CCHHHHHHH Q lcl|NC_020844. 375 ITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSD-N---------FDPEEESER 444 (455) Q Consensus 375 ~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~-~---------~d~e~e~~r 444 (455) --.. ...+..|+++.+-.........++++.++...|+++...+++.+ |+-|- + ..+-++... T Consensus 337 ~~~~----~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---gl~p~~ggD~~~~~~n~~~~d~~~~ 409 (429) T protein:vir:10 337 LDSE----LDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKE---DLPPEAGGDRLLVNGNMLPIDMAGQ 409 (429) T ss_pred Chhh----cCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCcCeeeecccccchhhccc Confidence 1111 12234566554333333345567788889999999998888766 33221 0 111111111 Q ss_pred HHhhc-c--cCCC----C Q lcl|NC_020844. 445 LISEL-P--GGIE----E 455 (455) Q Consensus 445 i~~e~-~--~~l~----~ 455 (455) .+.++ + +..+ | T Consensus 410 ~~~k~g~~~~~~~~~~~e 427 (429) T protein:vir:10 410 AYLKGGDTNGEVSKEGNE 427 (429) T ss_pred cccCCCCCCCCCCCCCCC Confidence 00000 0 0000 1 No 182 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=85.50 E-value=0.052 Score=27.61 Aligned_cols=392 Identities=11% Similarity=-0.012 Sum_probs=141.2 Q ss_pred CCCCCCCccCHHHH--HHHHHHHHHHHHhc------CHHHHHHhhhhcCCCCCCCCHHHH-HHHhhccccCChHHHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAE--AMSDYLQTVDDVLE------GVTGMRRNFKRYVKQFPHEQNKRY-DLRKSVAKMTNVFRDVLEG 71 (455) Q Consensus 1 m~~~~v~~~~p~y~--~~~~~w~~i~d~~~------G~~~~r~~g~~yLpk~~~e~~~~Y-~~rl~ra~~~n~~~~~v~~ 71 (455) |+++- .-+.- ....+-.+.||-+. |+.+-+...+-+.|+. -+.+++ ..|.. ....+.+|+. T Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~--~~~~~l~~~Yr~----~~ia~~iVd~ 70 (449) T protein:vir:10 1 MTDKL----TLAVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPEL--VTYENLYSLYRR----GGIAHGAVEK 70 (449) T ss_pred Cchhh----HHHHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCccc--CCHHHHHHHHhc----CchhHHHHHh Confidence 88651 11111 11123344555332 5555555544444432 233332 33332 2355677787 Q ss_pred HhhhhhcCCceeccC---CH-----HHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 72 LASKPFGREVEVTEG---TG-----PSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 72 ~~g~lf~k~p~i~~~---~~-----~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~ 143 (455) .+....++-|.+.+. .+ .++.-++.... +.+...+..+.+-+..+|.+++++..........-.+ ..+ T Consensus 71 ~~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~--~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~--~~~ 146 (449) T protein:vir:10 71 LVGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFT--NRLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPAT--KGR 146 (449) T ss_pred hhhhhhhcCcccccCccccchhhhHHHHHHHHHHHH--HHHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccc--cCc Confidence 777666666655311 11 12222221110 1233445566666777898888876432211100000 000 Q ss_pred CCceEEEechhhhc--cceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCc---EEEEeccCCCCCCceeEEE Q lcl|NC_020844. 144 VRPYWSQVNHSAIL--EIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAG---IWEVDEFGRITIGVVPMVP 218 (455) Q Consensus 144 ~rPy~~~~~~~~ii--~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g---~~~~~~~g~~~l~~vP~V~ 218 (455) ...++.++...+|. .|..+.. .=.+..|..|++-....++ .+. -|.-- ++. T Consensus 147 ~i~~i~v~~~~~i~~~~~~~dp~----------------sp~yg~P~~y~v~~~~~g~~~~~~~-----iH~SR---l~~ 202 (449) T protein:vir:10 147 GLQKVSVSWAGSLKVAEWDTGIN----------------SKTYGQPKLWKYTERLPNGSSRRVD-----IHPDR---VFI 202 (449) T ss_pred ceeeEEeeccccCChhhhhcCCC----------------CCCCCCceEEEEeeeccCCCcccee-----eccce---eEe Confidence 11111111111110 0000000 0001122222221110010 001 11111 122 Q ss_pred EEecccCCCcccCcchHHHHHHHHHH-----------HHHhHHHHHHHHHHcCCceEEEecCCCc---cccc----cCcc Q lcl|NC_020844. 219 FITGRRKGKKWVFHPPMKDALDLQVA-----------LFRKECALEFAEMMTAFPMLSANGVEPD---TDSE----GSPK 280 (455) Q Consensus 219 f~~~~~~~~~~~~~~pL~dla~l~~~-----------hy~~~sd~~~~l~~~~~p~~~~~G~~~~---~~~~----~~~~ 280 (455) |... +..+.+-|..+.+--+. .+++..-... +++... .-+.|+... ...+ ++.. T Consensus 203 ~~~~-----~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~-~~~~~~--~~~~~l~~~~~~~~e~~~~~~~~~ 274 (449) T protein:vir:10 203 LGDY-----SEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLN-VNFEKE--IDFTNLASLYGVSIDELQDKFNEV 274 (449) T ss_pred ecCC-----CCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHh-hhhhhh--hhhhhhhHHhhCCchHHHHHHHHH Confidence 2111 11233333333321111 1111111000 011000 001121111 0000 0000 Q ss_pred --ceeeccceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhc---cCCC-cchhHHHHHHHHHHH Q lcl|NC_020844. 281 --PLDTGPDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPL---TAQS-GNLTRISAAQAASKG 354 (455) Q Consensus 281 --~~~~g~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l---~~~~-~~~sa~a~~~~~~~~ 354 (455) .+.-|-+.. .+. .+.++..++.+.+++. +.++...+++......|+ .+++ +--+++ -+...- T Consensus 275 ~~~~~~~~~~~-~i~-----~~~d~~~~~~~~sgl~---d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst---~D~~ny 342 (449) T protein:vir:10 275 AGEINRGNDVL-MTT-----QGATVTPLVTSVADPT---ATYNVNLQTAAAGVDIPTRILIGNQQAERSST---EDQKYF 342 (449) T ss_pred HHHHhccchhe-eec-----CCcceEEEecccCChh---HHHHHHHHHHHHHhCCCeeeeeccCccccccc---hhHHHH Confidence 011122222 221 2235666665555554 345555566655544443 3332 111222 134455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHc-CCCCCceEEEEecccCCCCCCHHH-----HHHHHHHHHcC---CCCHHHHHHH Q lcl|NC_020844. 355 NSAVQQWAAGLKDALENALYITNLWM-DAPDTNPEADVYTDFRADTGEEQT-----PGYLLTMRQNG---DISREGLWME 425 (455) Q Consensus 355 ~s~L~~~a~~~~~a~~~~l~~~a~~~-g~~~~~~~v~v~~df~~~~~~~~~-----~~al~~~~~~G---~is~et~~~~ 425 (455) +..++.....+.-.+++++.++.+-. |..+.+.+|+++.=+....-+-.+ +++..++.++| +++.+.+++. T Consensus 343 yd~i~~~Q~~l~p~le~l~~~l~~s~~g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~ 422 (449) T protein:vir:10 343 NARCQSRRVDLSFEIEDFCDKLIELKIIDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTA 422 (449) T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHH Confidence 66666666677788888887765432 333445555555322221111111 34555555555 8888888765 Q ss_pred HHhcCCCCCCCCH--HHHHHHHHhhcccCC Q lcl|NC_020844. 426 FQRLGELSDNFDP--EEESERLISELPGGI 453 (455) Q Consensus 426 l~~~~vl~~~~d~--e~e~~ri~~e~~~~l 453 (455) + |+-++.-++ +++.+....+.++.. T Consensus 423 ~---~~~~~~~~~~~~e~~de~~~~~d~~a 449 (449) T protein:vir:10 423 A---GYDNDDEEPLGEEDGDEEDKATDSAA 449 (449) T ss_pred h---cccCCCCCCCCCCCCccccccCCcCC Confidence 5 333322211 111111111111111 No 183 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=85.17 E-value=0.055 Score=27.50 Aligned_cols=391 Identities=10% Similarity=0.033 Sum_probs=147.2 Q ss_pred CCC-----C-------CCCccCHHHHHHHHH---HHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChH Q lcl|NC_020844. 1 MPE-----Q-------DPTKRSADAEAMSDY---LQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVF 65 (455) Q Consensus 1 m~~-----~-------~v~~~~p~y~~~~~~---w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~ 65 (455) |.+ . +.....++-...... =...+...++... ..-|-+|+...+-..-.+.++.---.+++ T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~----~~~~~~r~~~~~~~~l~~~~~~~~~npiv 93 (551) T protein:vir:80 18 KSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSA----NPGFKTKPSIRNNQDLHGVLKKFGGNIIL 93 (551) T ss_pred hhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceec----CcccccCccccChhHHHHHHHHhhcCHHH Confidence 000 0 001111111111000 0001111111111 01122222222222222222221124667 Q ss_pred HHHHHHHhhhhhc--C---------Ccee---------cc-CC---HHHHHHHHhhccCC----CCHHHHHHHHHHHHHh Q lcl|NC_020844. 66 RDVLEGLASKPFG--R---------EVEV---------TE-GT---GPSEDLLEDIDGRG----NNLTTFAGQLFFQGVA 117 (455) Q Consensus 66 ~~~v~~~~g~lf~--k---------~p~i---------~~-~~---~~l~~~~~d~d~~G----~~l~~~~~~~~~~al~ 117 (455) +.+|+..+..+-. . +..+ +. .. ..++.|+....... .+..+|++.+....+. T Consensus 94 ~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll 173 (551) T protein:vir:80 94 NAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYM 173 (551) T ss_pred HHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHh Confidence 7777777655431 0 0011 00 01 12455555443321 3678899999999999 Q ss_pred hCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeC Q lcl|NC_020844. 118 DSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKND 197 (455) Q Consensus 118 ~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~ 197 (455) +|-+|+++-.+..|.+. -+..++|..|--... . +|. ......+.++ ... T Consensus 174 ~Gnay~~i~rd~~G~~~------------~L~~l~p~~V~v~~~-~-~g~----------------~~~~~~~y~~-~~~ 222 (551) T protein:vir:80 174 YDQVNFEKVFNRNQSMV------------RFVAKDPTTIFFATT-A-DGK----------------IPDNGNRFVQ-VID 222 (551) T ss_pred cCCEEEEEEECCCCcEE------------EEEEeCCceeEEEEC-C-ccc----------------cccCceEEEE-EeC Confidence 99999988765544321 144445544421110 0 000 0000000111 111 Q ss_pred CcEEEEeccCCCCCCceeEEEEEecccC--CCcccCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEe--cC--- Q lcl|NC_020844. 198 AGIWEVDEFGRITIGVVPMVPFITGRRK--GKKWVFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSAN--GV--- 269 (455) Q Consensus 198 ~g~~~~~~~g~~~l~~vP~V~f~~~~~~--~~~~~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~--G~--- 269 (455) ++....... =..|. |..+... .....+.+|+.-++.....+ +....+... +...+.|-.+++ |- T Consensus 223 g~~~~~~~~----~eiiH---~~~n~~~~~~~~~~G~spi~~a~~~i~~~-~a~~~~~~~~f~Ng~~p~giL~~~~~~~l 294 (551) T protein:vir:80 223 QKIVATFNA----REMAF---AVRNPRSDIYATGYGYPELEIALKQFIAH-ENTEAFNDRFFSHGGTTRGILQIKAAQQQ 294 (551) T ss_pred CcEEEEEcc----cceEE---ecccCCCCcccccccccHHHHHHHHHHHH-HHHHHHHHHHHHcCCCcceEEEEcCCCCC Confidence 111100000 01222 2222221 12234788887666544333 334444444 344567876553 32 Q ss_pred CCccccccCc--cceeecccee---eeccCCCCccccceeEEeeCc--hhHHHHHHHHHHHHHHHHHh---hhhhccC-- Q lcl|NC_020844. 270 EPDTDSEGSP--KPLDTGPDTV---LYAPPNAEGQHGSWNFVEPAT--ESLKFLAEQIESVSKEIREL---GRQPLTA-- 337 (455) Q Consensus 270 ~~~~~~~~~~--~~~~~g~~~~---~~lp~~~~~~~~~~~~le~~~--~~l~~~~~~l~~~~~~m~~~---g~~~l~~-- 337 (455) +.+.....+. +...-|+++. ..+.. + +++|.+... ..++ +.+..+-..++|+.+ ...++.- T Consensus 295 t~e~~~~lk~~~~~~~~G~~nag~~~vl~~----~--g~~~~~l~~~~~D~q-fle~~~~~~~~Ia~aFgVPp~~lG~~~ 367 (551) T protein:vir:80 295 SQHALEIFKREWKNSLSGINGSWQIPVVSA----E--DVKFVNMTPSARDME-FEKWLNYLINVISALYGIDPAEINIPN 367 (551) T ss_pred CHHHHHHHHHHHHHHhcCccccCccccccC----C--CceEEEccCChhHHH-HHHHHHHHHHHHHHHhcCCHHHcCccc Confidence 2111111111 1122333322 22321 1 355555443 3332 334444455555443 2222211 Q ss_pred CC---c------chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHH Q lcl|NC_020844. 338 QS---G------NLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYL 408 (455) Q Consensus 338 ~~---~------~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al 408 (455) .+ + +.+. .......-....|.-++..++.+++..|- -. .+ ....|.++. . ...+..+...+ T Consensus 368 ~~~~~~~~~~s~t~sn-~e~~~~~f~~~tL~P~~~~ie~~ln~~L~--~~-~~---~~~~f~f~~--~-~~~~~~~~~~~ 437 (551) T protein:vir:80 368 NGGATGSKGGSLNEGN-SAEKNQASKNKGLQPLLGFIEDFINKHIV--AE-FG---DKYTFQFVG--G-DIKSELESVKI 437 (551) T ss_pred ccccccccccccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHhhhc--cc-cC---CceEEEeec--c-ChhhHHHHHHH Confidence 00 0 1111 11222233455677777777777776442 11 12 233343331 1 12233444456 Q ss_pred HHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHH--------------------HHHHHHhhcccCCCC Q lcl|NC_020844. 409 LTMRQNGDISREGLWMEFQRLGELSDNFDPEE--------------------ESERLISELPGGIEE 455 (455) Q Consensus 409 ~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~--------------------e~~ri~~e~~~~l~~ 455 (455) .++..+|++|.-.+++.+ |+-| ....-+ +.++.++........ T Consensus 438 ~~~~~~g~lT~NE~R~~~---gl~P-~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (551) T protein:vir:80 438 LAEKAKVAMTVNEVRKEL---NLPG-DVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQ 500 (551) T ss_pred HHHHhcCCcCHHHHHHHh---CCCC-CCCCCceeecccccccccccccccCcchhhhhhccccccCc Confidence 677788999998888755 4422 111000 000000000000000 No 184 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=83.80 E-value=0.066 Score=27.07 Aligned_cols=364 Identities=11% Similarity=0.063 Sum_probs=151.4 Q ss_pred CCCCCC-CccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDP-TKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v-~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |.=-+- ..+.|. ....|..+- +..++........-.-+..++. +.+..+|+.++.-+-+- T Consensus 1 Mg~f~~~~~~~~~---~~~~~~~~~------------~~~~~~~~~~~~~v~~~~~l~~----~~v~~~i~~ia~~ia~~ 61 (382) T protein:vir:48 1 MPIFNLATESPPD---NQGGFFDVV------------DSDFLASLKGNEWVSAETALRN----SDLFSIINQLSNDLATV 61 (382) T ss_pred CccccccccCCcc---cccccccch------------hhhccccccCCcccchHhhhcc----HHHHHHHHHHHHhhccC Confidence 551110 000010 000000000 0001111100000000122332 23445566666666666 Q ss_pred CceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccc Q lcl|NC_020844. 80 EVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEI 159 (455) Q Consensus 80 ~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w 159 (455) |..+-... ...++..-+ ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+..++|..|--. T Consensus 62 ~~~~~~~~--~~~L~~~PN-~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~------------~l~~i~~~~v~v~ 126 (382) T protein:vir:48 62 KLITSRKK--LQGIVDNPS-NNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDM------------KWEYLRPSQVSFN 126 (382) T ss_pred ceeeecch--hhhhhhhcC-CCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEE------------EEEEEcCceeEEE Confidence 66553222 234444443 24578999999999999999999999765544321 2444445544211 Q ss_pred eeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCC--cEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHH Q lcl|NC_020844. 160 QSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDA--GIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKD 237 (455) Q Consensus 160 ~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~--g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~d 237 (455) ..+ .++. + ++.+...+. +..... +-.. |+-|.... ..+...+.+|+.- T Consensus 127 ~~~-~~~~--~-------------------~y~~~~~~~~~~~~~~~-----~~~e--vih~~~~~-~~~~~~G~s~l~~ 176 (382) T protein:vir:48 127 RLD-NKDG--I-------------------YYNITFDDPRIPPKQHV-----PQND--VLHFRLLS-VDGGMTSVSPLMA 176 (382) T ss_pred EcC-CCCe--E-------------------EEEEEecCccccceeEE-----cCcc--EEEecCCC-CCCccccccHHHH Confidence 100 0000 0 000111110 000000 0111 22233222 2344568888776 Q ss_pred HHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCCCccccccC---c--cceeeccceeeeccCCCCccccceeEEee--C Q lcl|NC_020844. 238 ALDLQVALFRKECALEFAE-MMTAFPMLSANGVEPDTDSEGS---P--KPLDTGPDTVLYAPPNAEGQHGSWNFVEP--A 309 (455) Q Consensus 238 la~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~~~~~~~~~---~--~~~~~g~~~~~~lp~~~~~~~~~~~~le~--~ 309 (455) +... +...+....+.... ...+.|-.+++--.....++.. . ....-.++.++.++. ..+|.+. + T Consensus 177 ~~~~-i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~-------g~~~~~l~~~ 248 (382) T protein:vir:48 177 LSRE-LDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDD-------LEDFTPLEIK 248 (382) T ss_pred HHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhccCCCCeeEcCC-------CceEEEccCC Confidence 6654 34445555554444 4456787777521111111100 0 001112345566653 2444444 3 Q ss_pred chhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCc Q lcl|NC_020844. 310 TESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTN 386 (455) Q Consensus 310 ~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~ 386 (455) +...+ ..+..+...++|+.+ ...++.....+.+..+.. ..-....|.-+...+++.+++.| . +..+ T Consensus 249 ~~d~q-~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~--~~~~~~~l~p~~~~i~~~l~~~l-------~-~~~~ 317 (382) T protein:vir:48 249 SNVSQ-LLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMS--SDLYSKAVSRYLRPFLSELSQKL-------S-CDVD 317 (382) T ss_pred hhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--HHHHHHHHHHHHHHHHHHHHHHh-------c-Chhh Confidence 33322 244455556666553 223332222122211111 12234445555666666665543 1 1111 Q ss_pred eEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 387 PEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 387 ~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) +.+...+... .......+.++..+|++++-.+++-|.+.|+++.+. ++ .+...... .|-+| T Consensus 318 --~~~~~~~~~~--~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~-~~--~~~~~~~~-~GGd~ 378 (382) T protein:vir:48 318 --ADIFPAVDPT--GSNYISRINSLVKTGTLAQNQGLYILQQAEILPKEL-PN--GENPNSTL-KGGEE 378 (382) T ss_pred --hhhhhhhccc--hhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcch-hh--hhcCCCCC-CCCCC Confidence 1122222211 123334455677889999999999998888876432 11 11221212 23333 No 185 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=83.43 E-value=0.069 Score=26.96 Aligned_cols=374 Identities=11% Similarity=0.019 Sum_probs=149.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHH-HhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc- Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMR-RNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG- 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r-~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~- 78 (455) |=-.+.......=..--+.|..+ .+|...-. ..|. ++. . +..++.+.++-....+-+..++.-|. T Consensus 1 m~~~~~~~~~~~~~s~~~~w~~~---~~~~~~~~~~~g~-~vt------~---~~al~~~~v~~~i~~Ia~~iA~lp~~~ 67 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFWEAM---LGGVRSSHSKAGV-MIT------P---ETALALSAVRACVTLLAESVAQLPVEL 67 (421) T ss_pred CCCcchhcccccccCcchhhHHH---hhhhccCcccCCc-eec------h---HHhhccHHHHHHHHHHHHhhccCceEE Confidence 54333332222212222333322 21110000 0000 000 0 11222222233333344444443332 Q ss_pred --CC--ceec-cCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEec Q lcl|NC_020844. 79 --RE--VEVT-EGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVN 152 (455) Q Consensus 79 --k~--p~i~-~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~ 152 (455) +. .... .....+..++. .-+ ...+-.+|.+.+....+.+|-+|+++.++..+.+. -+..++ T Consensus 68 ~~~~~~g~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~------------~L~~l~ 134 (421) T protein:vir:10 68 YRRDKNGGRQRATDHPIYDLIHSQPN-KKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPK------------ELIPIN 134 (421) T ss_pred EEEcCCCceeecccchHHHHHhhccc-CCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE------------EEEEec Confidence 11 1111 11223444442 222 34678889999999999999999999866544321 133444 Q ss_pred hhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCc Q lcl|NC_020844. 153 HSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFH 232 (455) Q Consensus 153 ~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~ 232 (455) |..|.-.. ..+....|+....|.... ... ++-+.+... +...+. T Consensus 135 ~~~v~v~~-------------------------~~~g~~~y~~~~~g~~~~-------~~e--iih~~~~~~--d~~~G~ 178 (421) T protein:vir:10 135 PKKVIVLK-------------------------GPDGMPYYEIPEIGETLP-------MRM--MHHVKVFSL--DGYIGS 178 (421) T ss_pred CceEEEEE-------------------------CCCceEEEEEcCCCcEEc-------hhh--EEEecCcCC--CCcccc Confidence 44432111 011111222222222111 111 122332332 234578 Q ss_pred chHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEec---CCCccccccC----c--cceeecc---ceeeeccCCCCcc Q lcl|NC_020844. 233 PPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANG---VEPDTDSEGS----P--KPLDTGP---DTVLYAPPNAEGQ 299 (455) Q Consensus 233 ~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G---~~~~~~~~~~----~--~~~~~g~---~~~~~lp~~~~~~ 299 (455) +|+.-+.. .+.......++...+ ...+.|-.+++- ......++.. . +...-|. +.++.||. T Consensus 179 spi~~~~~-~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~----- 252 (421) T protein:vir:10 179 SPIQTNAD-VLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQE----- 252 (421) T ss_pred cHHHHHHH-HHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCC----- Confidence 88775554 344444455554444 445678777762 1111011110 0 0111222 34566653 Q ss_pred ccceeEEeeCchhHHH-HHHHHHHHHHHHHHh---hhhhccC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 300 HGSWNFVEPATESLKF-LAEQIESVSKEIREL---GRQPLTA-QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALY 374 (455) Q Consensus 300 ~~~~~~le~~~~~l~~-~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~ 374 (455) ..+|.+++.++-+. +.+..+-..++|+.+ ...++.. ..++-+..+ .....-....|.-++..++++++..|- T Consensus 253 --g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e-~~~~~f~~~tl~P~~~~ie~~ln~kL~ 329 (421) T protein:vir:10 253 --GMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIE-HQGLQFVMYTLLAWLKRHEGALQRDLL 329 (421) T ss_pred --CceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHH-HHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 24554444333222 234444455555443 2233321 112222111 111112334556666666666665321 Q ss_pred HHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHH------HHHHhh Q lcl|NC_020844. 375 ITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEES------ERLISE 448 (455) Q Consensus 375 ~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~------~ri~~e 448 (455) ......+..|+++.+=..+......++++.++.+.|+++.-.+++.+ |+-| . ..-++. ..+.+- T Consensus 330 -----~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---gl~p-~-~ggD~~~~~~n~~~~~~~ 399 (421) T protein:vir:10 330 -----LPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRME---NLPP-I-AGGDKYLTPLNMVDSAQI 399 (421) T ss_pred -----CccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-C-CCcceeeecccccccccc Confidence 11111234455553322222234557788889999999999988765 3322 1 111110 000000 Q ss_pred ----------cccCCCC Q lcl|NC_020844. 449 ----------LPGGIEE 455 (455) Q Consensus 449 ----------~~~~l~~ 455 (455) .++..|+ T Consensus 400 ~~~~~~~~~~~~~e~d~ 416 (421) T protein:vir:10 400 IPGDKKPTAQQMAEIDT 416 (421) T ss_pred ccCCCCcccccCccccc Confidence 0111111 No 186 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=83.22 E-value=0.07 Score=26.91 Aligned_cols=381 Identities=9% Similarity=-0.012 Sum_probs=152.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhh-hhcCCCC-----CCC--CH--HH--HHHHhhccccCChHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNF-KRYVKQF-----PHE--QN--KR--YDLRKSVAKMTNVFRDV 68 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g-~~yLpk~-----~~e--~~--~~--Y~~rl~ra~~~n~~~~~ 68 (455) || +.+. ...+.+++.+..+...+-..+ ..-.|.. .+. +. .. -...++.+.++-....+ T Consensus 1 ~~---~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~I 70 (432) T protein:vir:10 1 MP---DEKK-------LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLV 70 (432) T ss_pred CC---CCcc-------cchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHH Confidence 76 2222 334555666655443321111 0001100 000 00 00 01223323333333344 Q ss_pred HHHHhhhhhc---CCce--eccCCHHHHHHH-HhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhc Q lcl|NC_020844. 69 LEGLASKPFG---REVE--VTEGTGPSEDLL-EDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQA 142 (455) Q Consensus 69 v~~~~g~lf~---k~p~--i~~~~~~l~~~~-~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~ 142 (455) -+.+++..|. +... .......+-.++ ..-+ ...+-.+|.+.+....+.+|-+|+++.... +.+. T Consensus 71 a~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-g~~~-------- 140 (432) T protein:vir:10 71 SQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPN-STQTAFDFWQVVVTRLLLDGTAYVRKVVTD-GRIE-------- 140 (432) T ss_pred HHhhhhCceeEEEecCCCcccccccHHHHHHHhccc-ccCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEE-------- Confidence 4444444332 1110 001122233333 2222 235788999999999999999999986532 2211 Q ss_pred cCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEec Q lcl|NC_020844. 143 GVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITG 222 (455) Q Consensus 143 ~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~ 222 (455) -+..++|..|--+. +. +|. +. |+ |.. .+|...... .-..+.+ .+. T Consensus 141 ----~L~~l~~~~v~v~~-~~-~g~--~~----------y~---------~~~-~~g~~~~~~----~~~iih~---~~~ 185 (432) T protein:vir:10 141 ----SLQYLANDRLTITT-DT-KGN--TA----------YR---------YRR-TDGQMIDIP----KQQIWKI---MGY 185 (432) T ss_pred ----EEEEEcCCceEEEE-cC-CCc--EE----------EE---------EEe-cCceEEEEc----CccEEEe---cCC Confidence 13334444442111 01 111 00 00 111 112111110 1122322 222 Q ss_pred ccCCCcccCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEecCCCcc---ccccCcccee--eccceeeeccCCC Q lcl|NC_020844. 223 RRKGKKWVFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANGVEPDT---DSEGSPKPLD--TGPDTVLYAPPNA 296 (455) Q Consensus 223 ~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G~~~~~---~~~~~~~~~~--~g~~~~~~lp~~~ 296 (455) +. +...+.+|+.-++. .+.......++... ....+.|-.+++.-.... .+.... .+. ..++.++.||. T Consensus 186 ~~--dg~~G~spi~~~~~-~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~-~~~~~~nag~~~vl~~-- 259 (432) T protein:vir:10 186 SL--DGENGLSAIRYGAQ-IFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAK-KVSGSVEAGRAPLLEG-- 259 (432) T ss_pred CC--CCcccccHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHH-HHhhhhhCCCceecCC-- Confidence 22 22457788776554 34444444444333 444567777776311111 111111 011 12244666653 Q ss_pred CccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 297 EGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQA---ASKGNSAVQQWAAGLKDALE 370 (455) Q Consensus 297 ~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~---~~~~~s~L~~~a~~~~~a~~ 370 (455) +.++.-++.++...+ +.+..+-...+|+.+ ...++.....+.++.....+ ..-....|.-+...++.+++ T Consensus 260 ---g~~~~~l~~~~~d~q-~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln 335 (432) T protein:vir:10 260 ---GMDVKSLGLNPVDAQ-LLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRRIEQSIA 335 (432) T ss_pred ---CceEEEccCChHHHH-HHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234544555444433 234444455555443 22333221111111111111 11123456666666666666 Q ss_pred HHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH--------- Q lcl|NC_020844. 371 NALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE--------- 441 (455) Q Consensus 371 ~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e--------- 441 (455) .-|- .. ....+..|+++.+=..+....+.++++.++.+.|+++.-..++.+ |+-|-+ .-++. T Consensus 336 ~kL~--~~---~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~---glppi~-g~~~~~~~~~~~~p 406 (432) T protein:vir:10 336 LNLL--SP---AERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIE---GLPKLG-GNAAVLTVQSAMVP 406 (432) T ss_pred hhhc--Cc---cccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---CCCCCC-CCcceEeecCcccc Confidence 5321 11 011234455553322223334557788889999999999888765 332211 10010 Q ss_pred HHHHHhhcc---cCCCC Q lcl|NC_020844. 442 SERLISELP---GGIEE 455 (455) Q Consensus 442 ~~ri~~e~~---~~l~~ 455 (455) ++.+.++.+ ++-++ T Consensus 407 l~~~~~~~~~~~~~~~~ 423 (432) T protein:vir:10 407 LDSIGLQASPEPASGLG 423 (432) T ss_pred hhhhcccCCCCCCCCCC Confidence 122222211 11111 No 187 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=82.78 E-value=0.074 Score=26.79 Aligned_cols=369 Identities=11% Similarity=-0.012 Sum_probs=144.1 Q ss_pred CCCCC-CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQD-PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~-v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |.=-+ +. +.+.-......|.-+ ..........|. .-.++..++.+ .+...|+.+++-+-+- T Consensus 1 Mgl~~~~f-~~~~~~~~~~~~~~~---~~~~~~~~~~g~----------~v~~~~al~~~----~v~~~v~~ia~~iA~l 62 (409) T protein:vir:84 1 MSLFTRIF-SGPSEERTLTKISGI---PSPAEDWAMHGD----------RPGANSAMTLG----AFYACVTLLADTVASL 62 (409) T ss_pred Cchhhhhh-cCCCccccccccccc---ccccchhhccCc----------ccchhhhhccH----HHHHHHHHHHHhhhhC Confidence 44211 10 000000000000000 000000000000 00112223322 3344445444444433 Q ss_pred Ccee-------ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCC-CCcchhhhhHHhccCCceEEE Q lcl|NC_020844. 80 EVEV-------TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPA-GQQFRNREEERQAGVRPYWSQ 150 (455) Q Consensus 80 ~p~i-------~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~-~~~~~t~ad~~~~~~rPy~~~ 150 (455) |..+ ......+..++. ... .+.+-.+|.+.+....+.+|-+|+++.... .+.+. -+.. T Consensus 63 p~~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~------------~L~~ 129 (409) T protein:vir:84 63 SIDAYRKKDNVRIPVSPAPKLLESTPY-PGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPT------------AIMP 129 (409) T ss_pred ceEEEEecCCcccccchHHHHhhccCC-CCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceE------------EEEE Confidence 3322 011123444443 333 567899999999999999999999875322 22111 1344 Q ss_pred echhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCccc Q lcl|NC_020844. 151 VNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWV 230 (455) Q Consensus 151 ~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~ 230 (455) ++|..|.--.....++ .+..+....+|.... .-..|. |.....+ +... T Consensus 130 l~p~~v~v~~~~~~~~----------------------~~~~~~~~~~g~~~~------~~dvih---~~~~~~~-~~~~ 177 (409) T protein:vir:84 130 IHPDCIHVTDAKDEDG----------------------DWIEPVYRIDGKVVP------NHRIMH---IKRYPVA-GCAL 177 (409) T ss_pred EcCceeEEEEcCCCcc----------------------eEEEEEecCCceEEc------hhhEEE---ecCCCCC-cccc Confidence 4444331000000000 010011111121100 111222 2222222 3345 Q ss_pred CcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEecC---CCccccccCc--cceeeccceeeeccCCCCcccccee Q lcl|NC_020844. 231 FHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANGV---EPDTDSEGSP--KPLDTGPDTVLYAPPNAEGQHGSWN 304 (455) Q Consensus 231 ~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G~---~~~~~~~~~~--~~~~~g~~~~~~lp~~~~~~~~~~~ 304 (455) +.+|+.-+... +.......++... ....+.|-.+++.- +++....... ....-+++.++.++.+ .++. T Consensus 178 G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g-----~~~~ 251 (409) T protein:vir:84 178 GMSPIEKAASA-IGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSHHNRRLPAVMSAG-----IKWQ 251 (409) T ss_pred cccHHHHHHHH-HHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhccCCCeeecCCC-----ceEE Confidence 78888765544 3333334444333 34456787777631 1111111111 0111244556666532 2333 Q ss_pred EEeeCchhHHHHHHHHHHHHHHHHH---hhhhhccCC-Ccch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 305 FVEPATESLKFLAEQIESVSKEIRE---LGRQPLTAQ-SGNL--TRISAAQAASKGNSAVQQWAAGLKDALENALYITNL 378 (455) Q Consensus 305 ~le~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~-~~~~--sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~ 378 (455) -++.++...+ +.+..+...++|+. +...++... .++. |.++.. ...-....|.-++..+++++++.| T Consensus 252 ~~~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~-~~~f~~~~l~P~~~~ie~~l~~~L----- 324 (409) T protein:vir:84 252 SVSITPNESQ-FLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQ-GINFVRHTLLPWLRCIEQALDTFL----- 324 (409) T ss_pred EccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHH-HHHHHHHHHHHHHHHHHHHHHHhc----- Confidence 3333333332 23444445555544 333334221 1222 211111 111123445666666777766543 Q ss_pred HcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH------HHHHHhh---- Q lcl|NC_020844. 379 WMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE------SERLISE---- 448 (455) Q Consensus 379 ~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e------~~ri~~e---- 448 (455) . .+..|+++.+=.........++++.++++.|+++.-.+++.+ |+-| .-. -++ +..+... T Consensus 325 --~---~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---g~~p-~~g-gD~~~~~~n~~~~~~~~~~~ 394 (409) T protein:vir:84 325 --P---RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWE---DAPP-IPE-GDIHLQPMNFVPLGYVPPEE 394 (409) T ss_pred --c---CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCC-cceeeecccccccccCCccc Confidence 1 133444443222222335567888899999999998888765 3321 100 000 0111111 Q ss_pred -----cccCCCC Q lcl|NC_020844. 449 -----LPGGIEE 455 (455) Q Consensus 449 -----~~~~l~~ 455 (455) .+.+-++ T Consensus 395 ~~~~~~~~~~~~ 406 (409) T protein:vir:84 395 PAQEPQPNSATE 406 (409) T ss_pred cCcCCCCCCccC Confidence 0111111 No 188 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=82.37 E-value=0.077 Score=26.67 Aligned_cols=379 Identities=9% Similarity=-0.032 Sum_probs=155.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |-=.....++.+-.. ..+..+-+..++... ..|+- .+..++.+ .+...|+.+++-+-+-| T Consensus 1 m~f~~~~~~~~~~~~--~~~~~~~~~~g~~~~-----~~~v~---------~~~al~~~----~v~~~i~~ia~~ia~lp 60 (409) T protein:vir:10 1 MLFRKGFKNQSQEIS--IDDKKILEWLGINPS-----ETYVN---------GKSCLKQA----TVFGCIRILSDNISKLP 60 (409) T ss_pred CcccccccCcCCCCC--CChHHHHHHhcCCcC-----cceec---------hhhhhccH----HHHHHHHHHHHhhhhCc Confidence 765555555543111 111112223332210 01110 01122222 22333444433333333 Q ss_pred cee--------ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEe Q lcl|NC_020844. 81 VEV--------TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQV 151 (455) Q Consensus 81 p~i--------~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~ 151 (455) ..+ ......+..++. .-+ ...+-.+|.+.+....+.+|-+|+++.....+.+. .+..+ T Consensus 61 ~~~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~------------~L~~i 127 (409) T protein:vir:10 61 IKIYQKKDGIKRVPDHYLEYLLKLRPN-PYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIK------------GLYPL 127 (409) T ss_pred eEEEEecCCeeeccCchHHHHHhhccC-CCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE------------EEEEE Confidence 222 011223444443 222 45678899999999999999999999876655432 35556 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccC Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVF 231 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~ 231 (455) +|..|--... ..+.... ....+..+. ...|...... .-..|. |..... +...+ T Consensus 128 ~~~~V~v~~~-~~~~~~~----------------~~~~~y~~~-~~~g~~~~~~----~~evih---~r~~~~--d~~~G 180 (409) T protein:vir:10 128 KSDGMKIFVD-DTGLLNS----------------ENNVWYLYT-DDLGQRHKFM----SDEILH---FKGLTA--DGLAG 180 (409) T ss_pred cCCceEEEEc-CCccccc----------------cceEEEEEE-eCCceeEEec----cccEEE---ecCcCC--CCccc Confidence 6665531110 0000000 000000111 1111111111 111232 222222 23457 Q ss_pred cchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEec---CCCccccccCc--cceeec---cceeeeccCCCCccccc Q lcl|NC_020844. 232 HPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANG---VEPDTDSEGSP--KPLDTG---PDTVLYAPPNAEGQHGS 302 (455) Q Consensus 232 ~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G---~~~~~~~~~~~--~~~~~g---~~~~~~lp~~~~~~~~~ 302 (455) .+|+..+... +.......++... +...+.|-.+++- ++++....... +...-| ++.+..++.+ .+ T Consensus 181 ~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g-----~~ 254 (409) T protein:vir:10 181 LSVIELLNHL-IENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLPIG-----YK 254 (409) T ss_pred ccHHHHHHHH-HHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecCCC-----ce Confidence 8887666654 4444444454443 4455667666652 11111111110 011222 2345556532 23 Q ss_pred eeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-Ccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 303 WNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITN 377 (455) Q Consensus 303 ~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a 377 (455) +.=++.++...+ +.+..+...++|+.+ ...++... .++ .+.++... .-....|.-++..++++++.-|-.-. T Consensus 255 ~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~--~f~~~~l~P~~~~ie~~ln~kL~~~~ 331 (409) T protein:vir:10 255 FEPISQKLVDAQ-FLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNR--EFYIDTLQSILNMYELEINYKLFLIS 331 (409) T ss_pred EEEccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHH--HHHHHHHHHHHHHHHHHHHHhhcCch Confidence 333344443333 233334444455443 23333211 122 22222221 12334455566666666654331111 Q ss_pred HHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCC--CCCCH----H-HHHHHHHhhcc Q lcl|NC_020844. 378 LWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELS--DNFDP----E-EESERLISELP 450 (455) Q Consensus 378 ~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~--~~~d~----e-~e~~ri~~e~~ 450 (455) + ...+..++++.+=.........++++.++++.|+++.-..++.+ |+-| ..... . .-++.+.++.. T Consensus 332 ~----~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~l---gl~p~~ggD~~~~~~n~~~~~~~~~~~~ 404 (409) T protein:vir:10 332 E----IKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELE---EDEPLEGGDVLLINGNMIPVKMAGEQYS 404 (409) T ss_pred h----ccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCcCeeeeccCccchhhcccccc Confidence 1 12234455543222233335567788899999999999888766 3322 11000 0 00111122222 Q ss_pred cCCCC Q lcl|NC_020844. 451 GGIEE 455 (455) Q Consensus 451 ~~l~~ 455 (455) .|-++ T Consensus 405 kgGe~ 409 (409) T protein:vir:10 405 KGGEK 409 (409) T ss_pred ccCCC Confidence 22222 No 189 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=81.19 E-value=0.088 Score=26.37 Aligned_cols=381 Identities=9% Similarity=-0.036 Sum_probs=160.5 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHH-hhhhcCCCCC-----CCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRR-NFKRYVKQFP-----HEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~-~g~~yLpk~~-----~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |-+.++. -+..-..--|..++....|.+.... .+...-|-.. +..-.. +..++.+. +...|+-+++ T Consensus 1 ~~~~~~~---~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~-~~al~~~~----v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYT---IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIND-ERILQIST----VWRCVSLIST 72 (424) T ss_pred CCCCccc---cccCCCCchHHHHHhhccccccccccchhhccccccccccccccccH-HHhhccHH----HHHHHHHHHH Confidence 6543331 1122223567777776655422111 0000001000 011011 22233333 3334444444 Q ss_pred hhhcCCcee---c--------cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhcc Q lcl|NC_020844. 75 KPFGREVEV---T--------EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAG 143 (455) Q Consensus 75 ~lf~k~p~i---~--------~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~ 143 (455) -+-+-|..+ . .....+..++..-=-...+-.+|.+.+....+.+|-+|+++..+..+.+. T Consensus 73 ~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~--------- 143 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI--------- 143 (424) T ss_pred hhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE--------- Confidence 443333322 0 11233444443211235678899999999999999999999765544321 Q ss_pred CCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecc Q lcl|NC_020844. 144 VRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGR 223 (455) Q Consensus 144 ~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~ 223 (455) -+.+++|..|.-. ..++. + +|+...+|....... =..|. |.+.. T Consensus 144 ---~L~~l~~~~v~v~---~~~~~--~---------------------~y~~~~~g~~~~~~~----~eVih---ir~~~ 187 (424) T protein:vir:18 144 ---SLLPLQSANMDVK---LVGKK--V---------------------VYRYQRDSEYADFSQ----KEIFH---LKGFG 187 (424) T ss_pred ---EEEEecCcceEEE---EcCCe--E---------------------EEEEEeCCeEEEecc----ccEEE---ecCcC Confidence 2444555554211 11111 0 111111221111110 01222 22222 Q ss_pred cCCCcccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCCCccccccCc------cceeec--cceeeeccC Q lcl|NC_020844. 224 RKGKKWVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANGVEPDTDSEGSP------KPLDTG--PDTVLYAPP 294 (455) Q Consensus 224 ~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~~~~~~~~~~------~~~~~g--~~~~~~lp~ 294 (455) . +...+.+|+.-+.. .+.......++...+ ...+.|-.+++--.....++... ....-| ++.+..|+. T Consensus 188 ~--dg~~G~spi~~~~~-~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~nag~~~vl~~ 264 (424) T protein:vir:18 188 F--TGLVGLSPIAFACK-SAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA 264 (424) T ss_pred C--CCcccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccC Confidence 2 33567888765544 455555556665444 44456877776322111111110 011112 223555653 Q ss_pred CCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccC-CCcchhHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 295 NAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTA-QSGNLTRISA-AQAASKGNSAVQQWAAGLKDAL 369 (455) Q Consensus 295 ~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa~a~-~~~~~~~~s~L~~~a~~~~~a~ 369 (455) +.++.-++.++...+ +.+..+-..++|+.+ ...++.. ..++.++... +....-....|.-++..+++++ T Consensus 265 -----g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~l 338 (424) T protein:vir:18 265 -----GFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) T ss_pred -----CceEEecCCChhHHH-HHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234444544444433 233344444555443 2233321 1222211111 1111222445566666666666 Q ss_pred HHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH-------- Q lcl|NC_020844. 370 ENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE-------- 441 (455) Q Consensus 370 ~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e-------- 441 (455) ++-|- ......+..|+++.+=..+......++++.++.+.|+++.-.+++.+ |+- |. ..-++ T Consensus 339 n~~L~-----~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~---gl~-pi-~ggD~~~~~~n~~ 408 (424) T protein:vir:18 339 QRWLI-----PSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTD---NMP-PL-PGGDVAMRQAQYV 408 (424) T ss_pred HhhcC-----CccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCC-CC-CCcCeeeeccCcc Confidence 65331 01111234455553322333335567888889999999998888765 332 21 11111 Q ss_pred -HHHHHhhc---ccCC Q lcl|NC_020844. 442 -SERLISEL---PGGI 453 (455) Q Consensus 442 -~~ri~~e~---~~~l 453 (455) ++.+.++. +.|. T Consensus 409 ~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 409 PITDLGTNKEPRNNGA 424 (424) T ss_pred chhhhhccCCccccCC Confidence 01111111 1112 No 190 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=80.69 E-value=0.092 Score=26.25 Aligned_cols=373 Identities=11% Similarity=0.006 Sum_probs=150.9 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |=-.+-.++.|.= ....|.-+ .+|- .-.+...+..-.. +..++.+ .+...|+-++.-+-+-| T Consensus 1 m~~~~~~~~~~~~--~~~~~~~~---~~~~--------~~~~~~~g~~v~~-~~al~~~----~v~~~i~~ia~~ia~lp 62 (419) T protein:vir:57 1 MFIPQFWKGRPSE--NRVNWQVV---PGGM--------RSSSSQAGVIITP-ETALALS----AVRACVTLLAESVAQLP 62 (419) T ss_pred CcchhhhccCCcc--cccccccc---cccc--------ccccccCCceech-HHhhccH----HHHHHHHHHHHhhccCc Confidence 5433333333221 11112111 1110 0011111111111 1222222 23444444444443333 Q ss_pred cee---------cc-CCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEE Q lcl|NC_020844. 81 VEV---------TE-GTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWS 149 (455) Q Consensus 81 p~i---------~~-~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~ 149 (455) ..+ .. ....+..++. .-. ...+-.+|.+.+....+.+|-+|+++..+..|.+. -+. T Consensus 63 ~~~~~~~~~g~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~------------~L~ 129 (419) T protein:vir:57 63 CVLYRRTENGGREIAFDHPLHDLIRYQPN-RKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDIT------------ELI 129 (419) T ss_pred eEEEEEcCCCceeccccchHHHHHhhccc-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EEE Confidence 332 11 1223555543 333 34678889999999999999999999766544321 244 Q ss_pred EechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcc Q lcl|NC_020844. 150 QVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKW 229 (455) Q Consensus 150 ~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~ 229 (455) .++|..|.-... .+....|+....|... +...| +-+..... +.. T Consensus 130 pl~~~~v~v~~~-------------------------~~g~~~y~~~~~~~~~-------~~~~v--ih~r~~~~--d~~ 173 (419) T protein:vir:57 130 PINPHKVIVLKG-------------------------PDGMPYYDIPSIGEIL-------PMRMV--HHIKSFSL--DGY 173 (419) T ss_pred EEcCcceEEEEC-------------------------CCceEEEEEcCCceEE-------chhhE--EEecCcCC--CCc Confidence 455544421000 0011112222222111 11111 11222222 234 Q ss_pred cCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEec--C-CCcccccc----Cc--cceeec---cceeeeccCCC Q lcl|NC_020844. 230 VFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANG--V-EPDTDSEG----SP--KPLDTG---PDTVLYAPPNA 296 (455) Q Consensus 230 ~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G--~-~~~~~~~~----~~--~~~~~g---~~~~~~lp~~~ 296 (455) .+.+|+.-++.. +.......++... ....+.|-.+++- . .....++. .. ....-| ++.++.++. T Consensus 174 ~G~s~i~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~-- 250 (419) T protein:vir:57 174 IGTSPIQTNPDV-LGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQE-- 250 (419) T ss_pred ccccHHHHHHHH-HHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCC-- Confidence 578887766553 3433444444443 4455778767652 1 11111111 00 001122 234555653 Q ss_pred CccccceeEEeeCchhHHH-HHHHHHHHHHHHHHh---hhhhccCC-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 297 EGQHGSWNFVEPATESLKF-LAEQIESVSKEIREL---GRQPLTAQ-SGNLTRISAAQAASKGNSAVQQWAAGLKDALEN 371 (455) Q Consensus 297 ~~~~~~~~~le~~~~~l~~-~~~~l~~~~~~m~~~---g~~~l~~~-~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~ 371 (455) ..+|.+++.+.-+. +.+..+...++|+.+ ...++... .++-+..+ .....-....|.-+...++++++. T Consensus 251 -----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e-~~~~~f~~~~l~P~~~~ie~~l~~ 324 (419) T protein:vir:57 251 -----GMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIE-HQGLQYVIYTMLAILKRHESAMMR 324 (419) T ss_pred -----CceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHH-HHHHHHHHHHHHHHHHHHHHHHHh Confidence 24554444332221 234444445555443 22333221 12222111 111112344455566666666554 Q ss_pred HHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC---HH--HHHHHHH Q lcl|NC_020844. 372 ALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD---PE--EESERLI 446 (455) Q Consensus 372 ~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d---~e--~e~~ri~ 446 (455) -|-. .....+..|+++.+=..+......++++.++.+.|+++.-..++.+..-- ++.... +- ...+.+. T Consensus 325 ~ll~-----~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p-~~ggD~~~~~~n~~~~~~~~ 398 (419) T protein:vir:57 325 DLLL-----PSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTP-IPGGDKYLTPLNMVDSKALT 398 (419) T ss_pred hccC-----ccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-CCCcCeeeeccccccccccc Confidence 2211 10112345565543222323345577788899999999999887663211 111110 00 0011111 Q ss_pred h---hcccCCCC Q lcl|NC_020844. 447 S---ELPGGIEE 455 (455) Q Consensus 447 ~---e~~~~l~~ 455 (455) + ..|....+ T Consensus 399 ~~~~~~~~~~~~ 410 (419) T protein:vir:57 399 GIGKATPQQLKD 410 (419) T ss_pred cccCCCcccCcc Confidence 1 22223333 No 191 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=80.39 E-value=0.095 Score=26.18 Aligned_cols=369 Identities=11% Similarity=0.032 Sum_probs=143.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |- ..+..+. ...+.|..- +-. .-+.|-..+.--. -.+...-++| ..|+.+++-+-+-| T Consensus 1 m~---~~~~~~~--~~~~~~~~~---------~~~--~~~~~~~~g~~~~-~~Al~~~~V~-----~cv~~ia~~iA~lp 58 (417) T protein:vir:38 1 MK---LFRGLAT--EVDPHWADH---------LLD--SGVIPSFRGGYLG-ISALRNSDVL-----TAVSIVSGDVSRFP 58 (417) T ss_pred Cc---ccccccc--CCCccchhh---------hcc--cccccccCCceec-hhhcccHHHH-----HHHHHHHHhhccCe Confidence 43 2111111 111222100 000 0122222211000 0122222222 33444444443334 Q ss_pred ceec-------cCCHHHHHHHHh-hccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCce-EEEe Q lcl|NC_020844. 81 VEVT-------EGTGPSEDLLED-IDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPY-WSQV 151 (455) Q Consensus 81 p~i~-------~~~~~l~~~~~d-~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy-~~~~ 151 (455) ..+- .....+..++.. -. ...+-.+|.+.+....+.+|-+|+++..+..+. +|. +..+ T Consensus 59 ~~~~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~------------~~~~l~~l 125 (417) T protein:vir:38 59 LVITDSSTDEVIDLANIEYLMNTKVN-KRLSAYQWKFPMMVNAILTGNAYSRIVRDPITN------------EPAMFEFY 125 (417) T ss_pred eEEEEcCCcceeccchHHHHHhcccC-cCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC------------EEEEEEEe Confidence 3320 112234444432 22 346788899999999999999999997654322 121 3344 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeE-EEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCccc Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDW-ERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWV 230 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~-~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~ 230 (455) +|..|.-... ..+.+ ..|....++....... =.+|. |...+.+ ... T Consensus 126 ~p~~v~v~~~------------------------~~~~~~y~~~~~~~~~~~~~~~----~dviH---~r~~~~d--~~~ 172 (417) T protein:vir:38 126 APSQTQVDTS------------------------DPDNIIYRFTPYNSSMQKVCGF----EDVIH---WKFFSYD--TIM 172 (417) T ss_pred CCceEEEEEc------------------------CCCeEEEEEEEcCCcEEEEecC----cceEE---ecCCCCC--Ccc Confidence 4444421110 00110 0111111121111111 11222 3333332 346 Q ss_pred CcchHHHHHHHHHHHHHhHHHHH-HHHHHcCCceEEEec---CCCccccccCcc--ceeec--cceeeeccCCCCccccc Q lcl|NC_020844. 231 FHPPMKDALDLQVALFRKECALE-FAEMMTAFPMLSANG---VEPDTDSEGSPK--PLDTG--PDTVLYAPPNAEGQHGS 302 (455) Q Consensus 231 ~~~pL~dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G---~~~~~~~~~~~~--~~~~g--~~~~~~lp~~~~~~~~~ 302 (455) +.+|+.-++.. +........+. ......+.|-.+++- ++.+..+..+.. ...-| ++.++.|+. +.+ T Consensus 173 G~s~l~~~~~~-i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~-----g~~ 246 (417) T protein:vir:38 173 GRSPLLSLGDE-IGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVDA-----TMD 246 (417) T ss_pred ccCHHHHHHHH-HHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceeccC-----Cce Confidence 88888766653 22233333332 333445667666542 111111111100 11222 334555653 224 Q ss_pred eeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 303 WNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLW 379 (455) Q Consensus 303 ~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~ 379 (455) +.=++.++..++ +.+..+-...+|+.+ ...++.......+.++.... -....|.-++..++++++..|- ... T Consensus 247 ~~~l~~~~~d~q-~le~~~~~~~~Ia~~fgVPp~~lg~~~~~s~~e~~~~~--~~~~tl~P~~~~ie~~l~~~Ll--~~~ 321 (417) T protein:vir:38 247 YQPLEVDTNVLN-LINSNNYSTAQIAKALRVPAYRLAQNSPNQSVKQLADD--YIRNDLPFYFEPITSEFELKLL--DDA 321 (417) T ss_pred EEEccCCHHHHH-HHHHHHhhHHHHHHHhCCCHHHhCCCCcchhHHHHHHH--HHHHHHHHHHHHHHHHHHhhhc--Chh Confidence 443444554433 233333344445432 22334222222222222222 2334666677777777665441 110 Q ss_pred cCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCC---------HHHHHHHHHhhcc Q lcl|NC_020844. 380 MDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFD---------PEEESERLISELP 450 (455) Q Consensus 380 ~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d---------~e~e~~ri~~e~~ 450 (455) . ..+..|+ |+...+......++.++++.|+++.-.+++.+..--+-.++.| +.+..+..+.... T Consensus 322 ~---~~~~~~~----fd~~~l~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~ 394 (417) T protein:vir:38 322 Q---RHQYCIG----FDTKSVNGLPIADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHA 394 (417) T ss_pred h---cccceEE----echhhhhHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccccccccccccccccc Confidence 0 1122333 3333344444566778889999999988876632111111111 1111111111111 Q ss_pred cCCC----C Q lcl|NC_020844. 451 GGIE----E 455 (455) Q Consensus 451 ~~l~----~ 455 (455) ..++ . T Consensus 395 ~~~kgg~~~ 403 (417) T protein:vir:38 395 AELKGGDTN 403 (417) T ss_pred cccCCCCCC Confidence 1110 0 No 192 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=77.66 E-value=0.12 Score=25.58 Aligned_cols=368 Identities=8% Similarity=-0.050 Sum_probs=145.7 Q ss_pred HHHHh-cCHHHHHHhhhhcCCCCCCCCHHH-HHH-------------Hh--hccccCChHHHHHHHHhhhhhcCCcee-- Q lcl|NC_020844. 23 VDDVL-EGVTGMRRNFKRYVKQFPHEQNKR-YDL-------------RK--SVAKMTNVFRDVLEGLASKPFGREVEV-- 83 (455) Q Consensus 23 i~d~~-~G~~~~r~~g~~yLpk~~~e~~~~-Y~~-------------rl--~ra~~~n~~~~~v~~~~g~lf~k~p~i-- 83 (455) |=|.. -+...-|.. +......+.. +.. ++ ..|.-...+...|+-+++-+-+-|..+ T Consensus 1 ~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~ 75 (454) T protein:vir:93 1 MWNLLRRTRKNQKSG-----RDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQ 75 (454) T ss_pred CCCccccCccccccc-----ccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 11111 110000000 0000000000 000 00 001111223334555544444444332 Q ss_pred -c-------cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 84 -T-------EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 84 -~-------~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) + .....+..|+..-. ...+-.+|.+.+....+.+|-+|+++.++..+.+. -+..++|.. T Consensus 76 ~~~~g~~~~~~~~~~~~L~~~PN-~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~------------~L~~i~~~~ 142 (454) T protein:vir:93 76 TDAQGIRRETRRGDIARLCRRPN-AQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIK------------ELRILDWNR 142 (454) T ss_pred eccCCccchhhhHHHHHHHhcCC-CCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEE------------EEEEEcCcc Confidence 0 11122344544333 24577899999999999999999999765544321 244555555 Q ss_pred hccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchH Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPM 235 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL 235 (455) |--... .+|. +.+ ++..... ...+..... +-+. |+-|..+.. .+...+.+|+ T Consensus 143 v~v~~~--~~g~--~~y----------~~~~~~~------~~~~~~~~~-----~~~e--ViH~k~~~~-~~~~~G~sp~ 194 (454) T protein:vir:93 143 VEPLVA--DDGE--VFY----------RITPDRN------CGITEAVTV-----PARE--VIHDRFNCF-FHPLIGLPPV 194 (454) T ss_pred eEEEEc--CCCc--EEE----------EEEeccc------cccceeEEe-----cCcc--eEEeccCCC-CCCceeccHH Confidence 421111 0110 000 0000000 000000000 0111 222222222 2334578887 Q ss_pred HHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEec---CCCccccccCc--cceeec--cceeeeccCCCCccccceeEEe Q lcl|NC_020844. 236 KDALDLQVALFRKECALEFAE-MMTAFPMLSANG---VEPDTDSEGSP--KPLDTG--PDTVLYAPPNAEGQHGSWNFVE 307 (455) Q Consensus 236 ~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G---~~~~~~~~~~~--~~~~~g--~~~~~~lp~~~~~~~~~~~~le 307 (455) ..+... +.......++...+ ...+.|-.+++- ++++..+..+. +...-| ++.++.|+. +.++.-++ T Consensus 195 ~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~-----g~~~~~l~ 268 (454) T protein:vir:93 195 YAAGLA-ATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENAGKTAILSN-----GAKYNPTT 268 (454) T ss_pred HHHHHH-HHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcccccCCceeccC-----CceEEEcc Confidence 665543 44444445554443 344567666652 11111111110 011122 234555653 23444445 Q ss_pred eCchhHHHHHHHHHHHHHHHHHh---hhhhccC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020844. 308 PATESLKFLAEQIESVSKEIREL---GRQPLTA-QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAP 383 (455) Q Consensus 308 ~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~ 383 (455) .++...+ +.+..+-...+|+.+ ...++.. .+.+-+ ........-....|.-++..++.+++..|- . T Consensus 269 ~~~~d~q-~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~s-n~e~~~~~f~~~~l~P~~~~ie~~ln~~L~-----~--- 338 (454) T protein:vir:93 269 FSPVDSQ-TVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSD-NVEALEQQYYSQCLQTLIESIELLLDEALE-----T--- 338 (454) T ss_pred cChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCcch-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-----C--- Confidence 4444433 234444455555443 2223321 112221 112222223455677777777777765431 1 Q ss_pred CCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCC--C--------CHHHHHHHHH-hhcccC Q lcl|NC_020844. 384 DTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDN--F--------DPEEESERLI-SELPGG 452 (455) Q Consensus 384 ~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~--~--------d~e~e~~ri~-~e~~~~ 452 (455) ..+..++++.+=..+......+.++.++.+.|+++.-.+++.+ |+-|-. . -+-+....-. .+.+.. T Consensus 339 ~~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~---gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~ 415 (454) T protein:vir:93 339 GENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRE---NLPPLAGGDALYLQQQNYSLEALSRRDAREDPFA 415 (454) T ss_pred CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCeeeeccCccchHhhhccCcccCCCC Confidence 2234455543222222235557788899999999998888765 332210 0 0011111100 001111 Q ss_pred CC-C Q lcl|NC_020844. 453 IE-E 455 (455) Q Consensus 453 l~-~ 455 (455) -+ + T Consensus 416 ~~~~ 419 (454) T protein:vir:93 416 SSGK 419 (454) T ss_pred CCcc Confidence 11 1 No 193 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=76.28 E-value=0.14 Score=25.31 Aligned_cols=380 Identities=8% Similarity=-0.043 Sum_probs=160.6 Q ss_pred CCCCCCCccCHHHH----HHHHHHHHHHHHhcCHHHHHH-hhhhcCCCCC-----CCCHHHHHHHhhccccCChHHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAE----AMSDYLQTVDDVLEGVTGMRR-NFKRYVKQFP-----HEQNKRYDLRKSVAKMTNVFRDVLE 70 (455) Q Consensus 1 m~~~~v~~~~p~y~----~~~~~w~~i~d~~~G~~~~r~-~g~~yLpk~~-----~e~~~~Y~~rl~ra~~~n~~~~~v~ 70 (455) |- .|.|. -...-|..+.....|...... .+...-|-.. +..- .-+..++.+.++-....+-+ T Consensus 1 ~~-------~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-~~~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 ME-------EPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSI-NDERILQISTVWRCVSLIST 72 (424) T ss_pred CC-------CCcceEeecCCCchHHHHHhhhcccccccccccccccccccccccccccc-cHHHhhccHHHHHHHHHHHH Confidence 44 23322 223567777776655432211 1111111000 1000 11233333333333344444 Q ss_pred HHhhhhhc---CCc-----eeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhc Q lcl|NC_020844. 71 GLASKPFG---REV-----EVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQA 142 (455) Q Consensus 71 ~~~g~lf~---k~p-----~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~ 142 (455) ..++..|. +.. .+ .....+-.++..-=-...+-.+|.+.+....+.+|-+|+++..+..+.+. T Consensus 73 ~iA~lp~~~~~~~~~~~~~~~-~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~-------- 143 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKV-DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI-------- 143 (424) T ss_pred hhccCceEEEEeecCCceeee-ccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE-------- Confidence 44443332 110 01 11223445443222245678899999999999999999999876554321 Q ss_pred cCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEec Q lcl|NC_020844. 143 GVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITG 222 (455) Q Consensus 143 ~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~ 222 (455) -+..++|..|.-+. .++. + +|+...+|........ ..|. |.+. T Consensus 144 ----~L~pl~~~~V~v~~---~~~~--~---------------------~y~~~~~g~~~~~~~~----eIih---~r~~ 186 (424) T protein:vir:18 144 ----SLLPLQSANMDVKL---VGKK--V---------------------VYRYQRDSEYADFSQK----EIFH---LKGF 186 (424) T ss_pred ----EEEEecCcceEEEE---cCCe--E---------------------EEEEEeCCeEEEeccc----cEEE---ecCc Confidence 24445555542111 1111 0 1111112211111110 1222 2222 Q ss_pred ccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCCCccccccC----c--cceeec--cceeeecc Q lcl|NC_020844. 223 RRKGKKWVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANGVEPDTDSEGS----P--KPLDTG--PDTVLYAP 293 (455) Q Consensus 223 ~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~~~~~~~~~----~--~~~~~g--~~~~~~lp 293 (455) .. +...+.+|+.-+.+ .+.......++...+ ...+.|-.+++--.....++.. . +...-| ++.+..|+ T Consensus 187 ~~--dg~~G~spi~~~~~-~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~g~nag~~~vl~ 263 (424) T protein:vir:18 187 GF--TGLVGLSPIAFACK-SAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE 263 (424) T ss_pred CC--CCcccccHHHHHHH-HHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceecc Confidence 22 33568888766554 344444555554443 4556787777532211111111 0 011212 23355665 Q ss_pred CCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-Ccch--hHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 294 PNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNL--TRISAAQAASKGNSAVQQWAAGLKD 367 (455) Q Consensus 294 ~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~--sa~a~~~~~~~~~s~L~~~a~~~~~ 367 (455) . +.++.-++.++...+ +.+..+-..++|+.+ ...++... .++. |..+. ....-....|.-++..+++ T Consensus 264 ~-----g~~~~~l~~~~~d~q-~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq-~~~~f~~~tl~P~~~~ie~ 336 (424) T protein:vir:18 264 A-----GFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ-QNLGFLQYTLQPYISRWEN 336 (424) T ss_pred C-----CceEEecCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHH-HHHHHHHHHHHHHHHHHHH Confidence 3 334554554444433 233344444555443 23333211 1222 22111 1112234456666666666 Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH------ Q lcl|NC_020844. 368 ALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE------ 441 (455) Q Consensus 368 a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e------ 441 (455) +++..|- ......+..|+++.+=..+......++++.++.+.|+++.-.+++.+ |+- |. ..-++ T Consensus 337 ~l~~~L~-----~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~---gl~-pi-~gGD~~~~~~n 406 (424) T protein:vir:18 337 SIQRWLI-----PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTD---NLP-PL-PGGDVAMRQSQ 406 (424) T ss_pred HHHhhcC-----CccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCC-CC-CCcCeeeeccC Confidence 6665431 01111234455553322333335567888889999999988887765 332 21 10011 Q ss_pred ---HHHHHhhcccCCCC Q lcl|NC_020844. 442 ---SERLISELPGGIEE 455 (455) Q Consensus 442 ---~~ri~~e~~~~l~~ 455 (455) ++.+.++. ..-++ T Consensus 407 ~~~l~~~~~~~-~p~~~ 422 (424) T protein:vir:18 407 YVPITDLGTNK-EPRNN 422 (424) T ss_pred ccchHhhhccC-CCccC Confidence 01111111 11111 No 194 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=74.27 E-value=0.16 Score=24.94 Aligned_cols=185 Identities=13% Similarity=0.001 Sum_probs=79.8 Q ss_pred EEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCccceeeccceeeeccCCCC Q lcl|NC_020844. 218 PFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSPKPLDTGPDTVLYAPPNAE 297 (455) Q Consensus 218 ~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~~~~~~g~~~~~~lp~~~~ 297 (455) -| ...-|.+++.-..+..+....+-...+ |+ ...+.+- + T Consensus 1 V~-----------k~~~l~~~~~~~~~~~~~r~~~~~~~~----------~~-----------------~~~~~ld--~- 39 (201) T protein:vir:10 1 MW-----------KAKGLADLCDDSDGAARLRLAQVDNNS----------GV-----------------GQAIGID--A- 39 (201) T ss_pred Cc-----------cchHHHHHhcCChHHHHHHHHHHHHhh----------hh-----------------hhhheee--c- Confidence 00 011122222111111110000000000 00 0011111 0 Q ss_pred ccccceeEEeeCchhHHHHHHHHHHHHHHHHHhhhhhccC---CC---cchhHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_020844. 298 GQHGSWNFVEPATESLKFLAEQIESVSKEIRELGRQPLTA---QS---GNLTRISAAQAASKGNSAVQQWA-AGLKDALE 370 (455) Q Consensus 298 ~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~g~~~l~~---~~---~~~sa~a~~~~~~~~~s~L~~~a-~~~~~a~~ 370 (455) .+-++..++.+-+++ .+.+....++|...+..|++. ++ -|.||. .+.+.-+..++.+. ..+.-.++ T Consensus 40 -~~e~~e~~~~~lsGl---~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge---~d~~nyyd~i~~~Qe~~l~p~le 112 (201) T protein:vir:10 40 -DSEEYNVLNSDIGGI---DTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQN---TALETFYGYVDRKRKAELLPLLE 112 (201) T ss_pred -CCcceeeeecCcCCh---HHHHHHHHHHHHhHhcCchhhhcCCCCccccccch---hHHHHHHHHHHHHHHHHHHHHHH Confidence 011344444444454 455666677777766666433 22 133443 23333445555554 33556666 Q ss_pred HHHHHHHHHcCCCCCceEEEEecccCCCCC-----CHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCC--CCCHHHHHH Q lcl|NC_020844. 371 NALYITNLWMDAPDTNPEADVYTDFRADTG-----EEQTPGYLLTMRQNGDISREGLWMEFQRLGELSD--NFDPEEESE 443 (455) Q Consensus 371 ~~l~~~a~~~g~~~~~~~v~v~~df~~~~~-----~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~--~~d~e~e~~ 443 (455) +++++. .. +.+.+|+++.=+....- ....+++..++.++|+||.+..++.|+.-+.... +-..+++++ T Consensus 113 ~l~~~~----~~-~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~ 187 (201) T protein:vir:10 113 FLLPFI----VT-EQEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAAGIIDADEARDTLRAISTEVKIGEGSIQTEVV 187 (201) T ss_pred HHHHhh----cC-CCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCCCcccc Confidence 666643 33 34566666643322211 1233577888889999999999999988554222 112233333 Q ss_pred HHHhhcccC--CCC Q lcl|NC_020844. 444 RLISELPGG--IEE 455 (455) Q Consensus 444 ri~~e~~~~--l~~ 455 (455) --+++.|.. -|+ T Consensus 188 ~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 188 INESEDPLDVSANN 201 (201) T ss_pred ccccCCCCCCCCCC Confidence 333444443 333 No 195 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=74.22 E-value=0.16 Score=24.93 Aligned_cols=371 Identities=11% Similarity=0.022 Sum_probs=149.0 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc-- Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG-- 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~-- 78 (455) +..+...+ .+.. ..| +.++.+|... ..|. ++ +. +..++.+.++...+.+-+++++..|. T Consensus 7 ~~~~~~~~-~~~~----~~~--~~~~~g~~~s--~~~~-~v------t~---~~al~~~~v~~~v~~ia~~iA~lp~~~~ 67 (419) T protein:vir:14 7 LLSNLGQT-QMSA----GGW--VSALLGSSRS--DSGQ-VV------TP---ASALALTVLQNCVTLLAESIAQLPIELY 67 (419) T ss_pred cccccccc-ccCc----chh--hHHhhcCCCc--cCCc-cc------ch---HHhhccHHHHHHHHHHHHhhccCceEEE Confidence 22221111 1111 122 3334443211 0111 10 00 12233333344444444444444332 Q ss_pred -CCce--eccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 79 -REVE--VTEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 79 -k~p~--i~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) +... ....+..+..++. .-+ ...+-.+|.+.+....+.+|-+|+++.++..+.+. -+..++|. T Consensus 68 ~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~------------~l~pl~~~ 134 (419) T protein:vir:14 68 ERSGEDRKPATDHPLYSILKYEPN-SWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQ------------GLYPLDNE 134 (419) T ss_pred EecCCccccccccHHHHHHHhhcc-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EEEEecCc Confidence 1100 0011223444443 222 24578889999999999999999999766544321 14445554 Q ss_pred hhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcch Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPP 234 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~p 234 (455) .|--+.. .+...+|+...... .+-..| +.+.+.+. +...+.+| T Consensus 135 ~v~v~~~-------------------------~~~~~~y~~~~~~~--------~~~~~i--~h~~~~~~--dg~~G~s~ 177 (419) T protein:vir:14 135 AVTVMRG-------------------------SDLKPVYRVRGSDP--------MPQRLV--HHVRWMSI--NGYTGLSP 177 (419) T ss_pred eEEEEEC-------------------------CCceEEEEEccCcc--------cchhhe--eEecCcCC--CCcccccH Confidence 4421110 00111222211110 011122 21222232 23457788 Q ss_pred HHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEec---CCCccccccCcc------ceeecc---ceeeeccCCCCcccc Q lcl|NC_020844. 235 MKDALDLQVALFRKECALEFAE-MMTAFPMLSANG---VEPDTDSEGSPK------PLDTGP---DTVLYAPPNAEGQHG 301 (455) Q Consensus 235 L~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G---~~~~~~~~~~~~------~~~~g~---~~~~~lp~~~~~~~~ 301 (455) +.-++. .+.......++.... ...+.|-.+++- .....+++..++ ...-|+ +.++.++. T Consensus 178 i~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~------- 249 (419) T protein:vir:14 178 VLLHAN-AIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQE------- 249 (419) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCC------- Confidence 766654 345555555554443 445678777652 111111111100 112232 33555653 Q ss_pred ceeEEeeCchhHHH-HHHHHHHHHHHHHHh---hhhhccC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPATESLKF-LAEQIESVSKEIREL---GRQPLTA-QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYIT 376 (455) Q Consensus 302 ~~~~le~~~~~l~~-~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~ 376 (455) ..+|.+.+-++.+. +.+..+-...+|+.+ ...++.. .+++-++.+ +....-....|.-++..++++++..|- T Consensus 250 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E-~~~~~f~~~~L~P~~~~ie~~l~~kll-- 326 (419) T protein:vir:14 250 GMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIE-HQSLQFVIYTLLPWVKRHEQAKTRDLL-- 326 (419) T ss_pred CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHhhhcc-- Confidence 24444443333221 223333444455443 2233321 122222211 111222334566666666666665331 Q ss_pred HHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCC--------CHHHHHHHH--- Q lcl|NC_020844. 377 NLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNF--------DPEEESERL--- 445 (455) Q Consensus 377 a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~--------d~e~e~~ri--- 445 (455) ......+..|+++.+=..+......++++.++.+.|+++.-..++.+..--+ +... ......+.. T Consensus 327 ---~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~-~gGD~~~~~~n~~~~~~~~~~~~~ 402 (419) T protein:vir:14 327 ---LPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPV-KGGDIYLSPMNMVDASKPQQLPVG 402 (419) T ss_pred ---CccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCcCeeeeccccccccccccccCC Confidence 1111123445555322223333556778888999999999988876522111 1110 000000000 Q ss_pred Hhh-cccCCCC Q lcl|NC_020844. 446 ISE-LPGGIEE 455 (455) Q Consensus 446 ~~e-~~~~l~~ 455 (455) +.+ .+++++| T Consensus 403 ~~~~~~~~~~e 413 (419) T protein:vir:14 403 KSEPTKAAIDE 413 (419) T ss_pred CCCCccccccc Confidence 000 1122222 No 196 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=73.81 E-value=0.17 Score=24.86 Aligned_cols=321 Identities=10% Similarity=-0.012 Sum_probs=123.0 Q ss_pred Hhhhh---hcCCceeccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCce Q lcl|NC_020844. 72 LASKP---FGREVEVTEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPY 147 (455) Q Consensus 72 ~~g~l---f~k~p~i~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy 147 (455) ++... +++.- .....+..++. .-+ ...+-.+|.+.+....+.+|-+|+++.++..|.+. - T Consensus 1 ia~lp~~~~~~~~---~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~------------~ 64 (348) T protein:vir:93 1 MASLPLKMYEDYK---VVNTEVSDLLTVSPN-NSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPS------------K 64 (348) T ss_pred CcccceEeEecCc---CcccHHHHHHHhCCC-CCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------E Confidence 11111 22211 12233555553 222 23477899999999999999999999766544321 1 Q ss_pred EEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe-CCcEEEEeccCCCCCCceeEEEEEecccCC Q lcl|NC_020844. 148 WSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN-DAGIWEVDEFGRITIGVVPMVPFITGRRKG 226 (455) Q Consensus 148 ~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~-~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~ 226 (455) +.+++|..|--+.. ..+..-.|+.. .+|.-... +-.. ++.|.... .. T Consensus 65 L~~l~~~~v~~~~~------------------------~~~~~~~y~~~~~~g~~~~~-----~~~e--iih~r~~~-~~ 112 (348) T protein:vir:93 65 LFLLNPDVVEMLIE------------------------NQSRELYYSIHAATGNKLIV-----HNMD--MLHFKHIV-AS 112 (348) T ss_pred EEEEcCCceEEEEe------------------------CCCcEEEEEEEcCCCeEEEE-----cccc--EEEecCCC-CC Confidence 33344433311100 00000011111 11111001 0111 22232221 22 Q ss_pred CcccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCcccccc----Cc--cceeeccceeeeccCCCCccc Q lcl|NC_020844. 227 KKWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEG----SP--KPLDTGPDTVLYAPPNAEGQH 300 (455) Q Consensus 227 ~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~----~~--~~~~~g~~~~~~lp~~~~~~~ 300 (455) +...+.+|+.-+.. .+........+ .+...+.|-.++.-.+....++. .. ....-+++.+..+|. + T Consensus 113 ~~~~G~s~~~~~~~-~i~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~~~~~vl~~-----g 184 (348) T protein:vir:93 113 NMVQGISPIDVLKN-TTDFDNAVRTF--NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQEP-----G 184 (348) T ss_pred CceeeccHHHHHHH-HHHHHHHHHHH--HHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCC-----C Confidence 34457777655544 23322222222 23333333223221111111110 00 011123344555553 2 Q ss_pred cceeEEeeCchhHHHHHHHHHHHHHHHHH---hhhhhccCC-Ccchh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 301 GSWNFVEPATESLKFLAEQIESVSKEIRE---LGRQPLTAQ-SGNLT-RISAAQAASKGNSAVQQWAAGLKDALENALYI 375 (455) Q Consensus 301 ~~~~~le~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~-~~~~s-a~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~ 375 (455) .++.-+..++..++ +.+..+-...+|+. +...++... +++-+ ..+... .-....|.-++..++++++.-|-- T Consensus 185 ~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~--~~~~~~l~P~~~~ie~~l~~~l~~ 261 (348) T protein:vir:93 185 VEIEPLPKKYVSED-IVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNR--FYLQHTLLPIVKQYEEEFNRKLLT 261 (348) T ss_pred ceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH--HHHHHHHHHHHHHHHHHHHHhhCC Confidence 23333333333322 22333333444433 333334322 22222 122111 122344555555566555543211 Q ss_pred HHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCC-------CCCCCCHHHHHHHHH-- Q lcl|NC_020844. 376 TNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGE-------LSDNFDPEEESERLI-- 446 (455) Q Consensus 376 ~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~v-------l~~~~d~e~e~~ri~-- 446 (455) -.+ ...+..|+++.+=.......+.++++.+++++|+++.-.+++.+..--+ ++....+-+.....+ T Consensus 262 ~~~----~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~ 337 (348) T protein:vir:93 262 KTD----REKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKS 337 (348) T ss_pred ccc----ccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcCeEeecccccccccchhhccc Confidence 101 1123445544322222223555778888999999999999887632111 000111100000010 Q ss_pred -hhcccCCCC Q lcl|NC_020844. 447 -SELPGGIEE 455 (455) Q Consensus 447 -~e~~~~l~~ 455 (455) .=++.+-+| T Consensus 338 ~~gg~~n~~~ 347 (348) T protein:vir:93 338 LKGGDKNVNE 347 (348) T ss_pred ccCCCCCcCC Confidence 111122222 No 197 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=70.81 E-value=0.2 Score=24.37 Aligned_cols=374 Identities=8% Similarity=-0.017 Sum_probs=148.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhh-hhcCCC------------CCCCCHHHHHHHhhccccCChHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNF-KRYVKQ------------FPHEQNKRYDLRKSVAKMTNVFRD 67 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g-~~yLpk------------~~~e~~~~Y~~rl~ra~~~n~~~~ 67 (455) || +.+. ...|.+++.+..+.+.+-..+ ..--|. ..+..- .-+..++. ..+.. T Consensus 1 ~~---~~~~-------~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v-~~~~a~~~----~aV~~ 65 (432) T protein:vir:97 1 MP---DEKK-------LGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAV-NADAIMRL----DAVAA 65 (432) T ss_pred CC---Cccc-------CchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCccc-chHhhhcc----hHHHH Confidence 66 2222 334455555544332211110 000110 000000 00122222 23333 Q ss_pred HHHHHhhhhhcCCcee---------ccCCHHHHHHH-HhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhh Q lcl|NC_020844. 68 VLEGLASKPFGREVEV---------TEGTGPSEDLL-EDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNRE 137 (455) Q Consensus 68 ~v~~~~g~lf~k~p~i---------~~~~~~l~~~~-~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~a 137 (455) .|+-+++-+-+-|..+ ......+-.++ ..-+ ...+-.+|.+.+....+.+|-+|+++.... +.+. T Consensus 66 ~v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN-~~~t~~~f~~~l~~~lll~Gnay~~~~~~~-g~~~--- 140 (432) T protein:vir:97 66 CVKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPN-STQTAFDFWQVVVTRLLLDGTAYVRKVVTD-GRIE--- 140 (432) T ss_pred HHHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhccc-ccCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEE--- Confidence 4444444443333322 11122333443 2222 235778899999999999999999986542 2211 Q ss_pred hHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEE-eCCcEEEEeccCCCCCCceeE Q lcl|NC_020844. 138 EERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRK-NDAGIWEVDEFGRITIGVVPM 216 (455) Q Consensus 138 d~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~-~~~g~~~~~~~g~~~l~~vP~ 216 (455) -+..++|..|--.. +. +|. + .|+. ..+|...... .-..|. T Consensus 141 ---------~L~~l~p~~v~v~~-~~-~g~--~---------------------~y~~~~~~g~~~~~~----~~~iih- 181 (432) T protein:vir:97 141 ---------SLQYLANDRLTITT-DT-KGN--T---------------------AYRYRRTDGQMIDIP----RQQIWK- 181 (432) T ss_pred ---------EEEEEcCcceEEEE-cC-CCc--E---------------------EEEEEecCceEEEEc----cccEEE- Confidence 13334444332111 00 110 0 1111 1112111110 011122 Q ss_pred EEEEecccCCCcccCcchHHHHHHHHHHHHHhHHHHHH-HHHHcCCceEEEecCC---CccccccCcccee--eccceee Q lcl|NC_020844. 217 VPFITGRRKGKKWVFHPPMKDALDLQVALFRKECALEF-AEMMTAFPMLSANGVE---PDTDSEGSPKPLD--TGPDTVL 290 (455) Q Consensus 217 V~f~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~-~l~~~~~p~~~~~G~~---~~~~~~~~~~~~~--~g~~~~~ 290 (455) |.+.+. +...+.+|+.-++.. +.......++.. .....+.|-.+++--. .+..+..+. .+. ..++.+. T Consensus 182 --~r~~~~--dg~~G~spi~~~~~~-i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~-~~~~~~nag~~~ 255 (432) T protein:vir:97 182 --IMGYSL--DGENGLSAIRYGAQI-FGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFSK-KVSGSVEAGRAP 255 (432) T ss_pred --ecCcCC--CCcccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCcceeEecCCCCCHHHHHHHHH-HHhhhhcCCCce Confidence 222222 234577887766543 333333444433 3344567766665211 111111111 011 1223466 Q ss_pred eccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHH---HHHHHHHHHHHHH Q lcl|NC_020844. 291 YAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAA---SKGNSAVQQWAAG 364 (455) Q Consensus 291 ~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~---~~~~s~L~~~a~~ 364 (455) .|+. +.++.-+..++...+ +.+..+-...+|+.+ ...++.....+.++....++. .-....|.-++.. T Consensus 256 vl~~-----g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ 329 (432) T protein:vir:97 256 LLEG-----GMDVKSLGLNPVDAQ-LLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRR 329 (432) T ss_pred ecCC-----CceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHHHHHHHHH Confidence 6653 234444444444432 234444445555442 233332111111111111111 1123455556666 Q ss_pred HHHHHHHHHHHHHHHcCCC-CCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH-- Q lcl|NC_020844. 365 LKDALENALYITNLWMDAP-DTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE-- 441 (455) Q Consensus 365 ~~~a~~~~l~~~a~~~g~~-~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e-- 441 (455) ++.+++.-| +... ..+..++++.+=.........++++.++...|++|.-..++.+ |+ ++.-.-++. T Consensus 330 ie~~ln~kL------l~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~---gl-pp~~g~~~~~~ 399 (432) T protein:vir:97 330 IEQSIALNL------LTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIE---GL-PKLGGNAAVLT 399 (432) T ss_pred HHHHHhhhc------cCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh---CC-CCCCCCcceEe Confidence 666666522 1111 1233455543222233334567888899999999998888765 33 221110000 Q ss_pred -------HHHHHhhcc---cCCCC Q lcl|NC_020844. 442 -------SERLISELP---GGIEE 455 (455) Q Consensus 442 -------~~ri~~e~~---~~l~~ 455 (455) ++.+..+.. ++-++ T Consensus 400 ~~~~~~pl~~~~~~~~~~~~~~~~ 423 (432) T protein:vir:97 400 VQSAMVPLDSIGLQASPEPASGLG 423 (432) T ss_pred ecccccchhhhcccCCCCCCCCCC Confidence 112222211 11111 No 198 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=70.02 E-value=0.21 Score=24.25 Aligned_cols=387 Identities=12% Similarity=0.046 Sum_probs=144.4 Q ss_pred CCCCC--CCccCHHHH--HHHHHHHHHHHH--hcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQD--PTKRSADAE--AMSDYLQTVDDV--LEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~--v~~~~p~y~--~~~~~w~~i~d~--~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g 74 (455) |-=-+ ..+.++... .....|.-.... ..|... ..| +.-. -+..++.+.++-..+.+-+++++ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~g---------~~v~-~~~al~~~~v~~~i~~ia~~iA~ 68 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATA--SSG---------ERVT-PHDALQVSAVFASVRLLSETIAT 68 (457) T ss_pred Cchhhhhhccccccccccccccccccchhhhhhccccc--cCC---------ceec-hHHhhccHHHHHHHHHHHHhHhh Confidence 33000 000000000 000000000000 000000 000 0000 01223333333334444444444 Q ss_pred hhhc---CCc-ee-ccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEE Q lcl|NC_020844. 75 KPFG---REV-EV-TEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWS 149 (455) Q Consensus 75 ~lf~---k~p-~i-~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~ 149 (455) ..|. +.- .. ......+..++..-. ...+-.+|.+.+....+.+|-+|++|... .+.+. -+. T Consensus 69 lp~~~~~~~~~~~~~~~~~~~~~ll~~pn-~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~------------~l~ 134 (457) T protein:vir:62 69 LPLSTYSKRGGTRKEIDTPEWLDFPNAEP-GGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIA------------GLD 134 (457) T ss_pred CceEEEEecCCccccccchHHHHhccccC-CCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEE------------EEE Confidence 4332 110 00 012233444443322 24578999999999999999999998533 22211 123 Q ss_pred EechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcc Q lcl|NC_020844. 150 QVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKW 229 (455) Q Consensus 150 ~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~ 229 (455) .+.|..|.-.+....+. ....+..|+....|.-..... -+.-. ++-|..... .+.. T Consensus 135 ~l~p~~v~v~~~~~~~~-------------------~~~~~~~y~~~~~g~~~~~~~-~~~~e---iih~r~~~~-~~~~ 190 (457) T protein:vir:62 135 VLDPTKIHVHMVMVDGL-------------------RRKVFEAYDIDADGNEVLLGW-FTPRD---VLHIPGMML-PGDF 190 (457) T ss_pred EEcCcceEEEEeccCCc-------------------cceeEEEEEEccCCceeEEEe-eCccc---eEEecCCCC-CCce Confidence 33344432111110000 011122333222221100000 00011 222332222 2334 Q ss_pred cCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEecCCCccccccCcc------ceeecc---ceeeeccCCCCcc Q lcl|NC_020844. 230 VFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANGVEPDTDSEGSPK------PLDTGP---DTVLYAPPNAEGQ 299 (455) Q Consensus 230 ~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G~~~~~~~~~~~~------~~~~g~---~~~~~lp~~~~~~ 299 (455) .+.+|+.-++. .+.......++... ....+.|-.+++- .....++.... ...-|. +.++.|+. T Consensus 191 ~G~sp~~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~-~~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~----- 263 (457) T protein:vir:62 191 VGCSPISYARE-SIGLALAAQKYGAHFFRNGAMPGAVVEV-PGTMSEEGLARAREAWRAANSGVDNAHRVALLTE----- 263 (457) T ss_pred ecccHHHHHHH-HHHHHHHHHHHHHHHHhccCCcceEEEc-CCCCCHHHHHHHHHHHHHHhcCccccCcceecCC----- Confidence 67888766554 34444545555443 4455777776662 11111111110 112232 34566653 Q ss_pred ccceeEEeeCchhHHHHHHHHHHHHHHHHH---hhhhhccCC-Ccch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 300 HGSWNFVEPATESLKFLAEQIESVSKEIRE---LGRQPLTAQ-SGNL--TRISAAQAASKGNSAVQQWAAGLKDALENAL 373 (455) Q Consensus 300 ~~~~~~le~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~-~~~~--sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l 373 (455) +.++.-+..++...+ +.+..+-...+|+. +...++... .++. |.++... .+-....|.-+...++.+++..| T Consensus 264 g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~-~~f~~~~l~P~~~~ie~~ln~~L 341 (457) T protein:vir:62 264 GAKFSKVAMSPDEAQ-FLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQN-IAFTMFSLRPWLERIEAGFNRLL 341 (457) T ss_pred CceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHH-HHHHHHHHHHHHHHHHHHHHhhh Confidence 224444444443332 23333444444443 233334211 1222 2221111 11223445666666666666533 Q ss_pred HHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCH--HHHH-----HHHH Q lcl|NC_020844. 374 YITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDP--EEES-----ERLI 446 (455) Q Consensus 374 ~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~--e~e~-----~ri~ 446 (455) - .... ..+..++++.+=..+......+.++.++++.|+++.-.+++.+ |+- |.-+. ++-. ..+. T Consensus 342 ~--~~~~---~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~---gl~-pi~~g~~D~~~~~~n~~~~~ 412 (457) T protein:vir:62 342 F--AETA---DRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAE---DMT-PLPDGLGEKYRVPLNLGEIG 412 (457) T ss_pred c--Cccc---cCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCC-CCCCCCcceeeecccccccc Confidence 1 1111 1223344443222333335567778889999999999888765 331 11111 0000 0000 Q ss_pred hh---cc------------cCCCC Q lcl|NC_020844. 447 SE---LP------------GGIEE 455 (455) Q Consensus 447 ~e---~~------------~~l~~ 455 (455) .+ .+ ...++ T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~ 436 (457) T protein:vir:62 413 EEPEPEPAPAPPAIDPPAEEPADD 436 (457) T ss_pred ccccccccCCCccCCCCccCCCCC Confidence 00 00 00000 No 199 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=68.46 E-value=0.24 Score=24.01 Aligned_cols=363 Identities=12% Similarity=0.019 Sum_probs=139.1 Q ss_pred CCCCCCCccCH-HHH-HHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSA-DAE-AMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p-~y~-~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |---+.-++-+ .-. ...+.+ ..-...|. +. .... -.+...++ ..++..+|+.++.-+-+ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~--~~~~~~~~---------~~-~~~~---~~~~~~~~----~~~v~~~i~~ia~~ia~ 61 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGY--VGLFMSGE---------DV-SFLV---PGYVRLSD----NPEVRMAVHKIADLISS 61 (406) T ss_pred Ccchhhhccccccccccccchh--hhhhccCc---------cc-Cccc---cCHHHHhh----cHHHHHHHHHHHHhhcc Confidence 44211100000 000 000000 00000000 00 0000 01222222 24556667777776666 Q ss_pred CCceec---------cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcE--EEEEeeCCCCcchhhhhHHhccCCce Q lcl|NC_020844. 79 REVEVT---------EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSIS--WIRVDYPAGQQFRNREEERQAGVRPY 147 (455) Q Consensus 79 k~p~i~---------~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~--~~lVd~p~~~~~~t~ad~~~~~~rPy 147 (455) -|..+- ..++....|+..-. ...+-.+|.+.+....+.+|.+ |+++.+...+.+. - T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~l~~~PN-~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~------------~ 128 (406) T protein:vir:95 62 MTIYLMQNTEDGDIRIRNELSRKIDITPY-SLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLID------------E 128 (406) T ss_pred CceEEEEecCCcceeecchHHHHHhhccC-CCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEE------------E Confidence 555441 11223334444333 3467889999999999998765 4445443333211 1 Q ss_pred EEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCC Q lcl|NC_020844. 148 WSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGK 227 (455) Q Consensus 148 ~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~ 227 (455) +.+++|..|--.. +.+.+++ .. ++.- . ..-. ++-|..+....+ T Consensus 129 l~~i~~~~v~~~~-------------------------~~~~~~~-~~--~~~~--~----~~~e---vih~~~~~~~~~ 171 (406) T protein:vir:95 129 LVPLTPSKVNFLD-------------------------TPDGYQV-LY--GGQT--F----NYDE---VLHFIYNPDPER 171 (406) T ss_pred EEEEcCceeEEEE-------------------------cCCeEEE-Ee--ccEE--E----chhH---EEEeeccCCCCC Confidence 2333333321100 0011111 00 1100 0 0111 222333333334 Q ss_pred cccCcchHHHHHHHHHHHHHhHHHHHH-HHHHcCCceEEEec---CCCccccccCcc--ceeecc---ceeeeccCCCCc Q lcl|NC_020844. 228 KWVFHPPMKDALDLQVALFRKECALEF-AEMMTAFPMLSANG---VEPDTDSEGSPK--PLDTGP---DTVLYAPPNAEG 298 (455) Q Consensus 228 ~~~~~~pL~dla~l~~~hy~~~sd~~~-~l~~~~~p~~~~~G---~~~~~~~~~~~~--~~~~g~---~~~~~lp~~~~~ 298 (455) ...+.+|+.-+... +........+.. .....+.|..+++- ++++........ ...-|. +....++.+. T Consensus 172 ~~~G~s~i~~~~~~-i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~-- 248 (406) T protein:vir:95 172 PYIGRGYRVVLKDI-ADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAGQPWIIPAEL-- 248 (406) T ss_pred CccccCHHHHHHHH-HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeecCCC-- Confidence 44577887655443 333333444433 34455667666653 222222111111 111222 2233444221 Q ss_pred ccccee-EEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 299 QHGSWN-FVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALY 374 (455) Q Consensus 299 ~~~~~~-~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~ 374 (455) .... +...++...+ +.+..+....+|+.+ ...++...+ +...+... -....|.-++..++++++..| T Consensus 249 --~~~~~~~~~~~~d~q-~~e~~~~~~~~Ia~~fgVp~~~lg~~~-~~~~~~~~----~~~~~l~P~~~~ie~~l~~~l- 319 (406) T protein:vir:95 249 --LEVEQVKPLSLKDIA-INEAVELDKRTVAGMFGVPAFLLGIGE-FNRDEYNN----FINSTILPIAKGIEQELTRKL- 319 (406) T ss_pred --ccccccccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCC-chHHHHHH----HHHHHHHHHHHHHHHHHHHhc- Confidence 1111 1122333322 234444445555443 333342111 11111111 123344555555555544322 Q ss_pred HHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCC----------CCCCHHHHHHH Q lcl|NC_020844. 375 ITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELS----------DNFDPEEESER 444 (455) Q Consensus 375 ~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~----------~~~d~e~e~~r 444 (455) +. +.+..|+++.+=.........++.+.++...|+++...+++.+ |+-| ....+-+.... T Consensus 320 -----~~--~~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~---gl~p~~~gd~~~~~~n~~~~~~~~~ 389 (406) T protein:vir:95 320 -----LI--SPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWL---GLSPKEGLSELVILENYIPLDKIGD 389 (406) T ss_pred -----CC--CCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCcceeeeccCccchhhccc Confidence 11 2334455543222222234567788899999999999888765 3321 11111111111 Q ss_pred HHhhcccCCCC Q lcl|NC_020844. 445 LISELPGGIEE 455 (455) Q Consensus 445 i~~e~~~~l~~ 455 (455) -+.+ ++|=++ T Consensus 390 ~~~~-k~g~~~ 399 (406) T protein:vir:95 390 QSKL-KGGDNS 399 (406) T ss_pred cccc-CCCCCC Confidence 1111 111111 No 200 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=67.77 E-value=0.25 Score=23.91 Aligned_cols=383 Identities=8% Similarity=-0.066 Sum_probs=149.3 Q ss_pred CCCCC-CCcc-------CHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHH Q lcl|NC_020844. 1 MPEQD-PTKR-------SADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGL 72 (455) Q Consensus 1 m~~~~-v~~~-------~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~ 72 (455) |-=-+ ...+ ...+..-. .+. ..+...+...|. .+........+ ++ .+.+...|+-+ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~-~~~-----~~~~~~~~~~g~-----~~~~~v~~~~a-l~----~~~v~~ci~~i 64 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDI-GID-----ISDSNFWEKFGI-----KLNFSVRGKRA-LK----ENTVYVCTKIR 64 (422) T ss_pred CchhhhhhhccCCccchhhhhhhcc-ccc-----cCcchhhhhccc-----cCCcccchhhh-hc----cHHHHHHHHHH Confidence 22100 0000 00000000 000 000000000000 00000000111 11 12333445555 Q ss_pred hhhhhcCCcee-----ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCc Q lcl|NC_020844. 73 ASKPFGREVEV-----TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRP 146 (455) Q Consensus 73 ~g~lf~k~p~i-----~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rP 146 (455) ++-+-+-|..+ ......+..++. ... ...+-.+|.+.+....+.+|-+|+++.+...+.+. T Consensus 65 a~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~------------ 131 (422) T protein:vir:13 65 AESIGKLSLKIYKDKEEYKEHELYYLLRYKPN-PLMSSINFWKCLETQRTLKGNAYAYIERDRKGKII------------ 131 (422) T ss_pred HHhhhhCceEEEecCcccccchHHHHHhhhcc-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------ Confidence 55544444443 111223444443 222 23567899999999999999999999776544321 Q ss_pred eEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCC Q lcl|NC_020844. 147 YWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKG 226 (455) Q Consensus 147 y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~ 226 (455) -+..++|..|.-+..+ .+...... ..+.++....+....+. +-..|.+ ..+ ... T Consensus 132 ~L~~i~~~~v~~~~~~-~~~~~~~~----------------~~~y~~~~~~g~~~~~~-----~~eiih~---~~~-~~~ 185 (422) T protein:vir:13 132 GLYPINSDNVTKIIDD-DNFLSSLS----------------KVWYVVTDKNGKEHKLL-----PDEMLHF---IGD-ITL 185 (422) T ss_pred EEEEECCcceEEEEcC-Ccceeccc----------------eEEEEEEeCCCeEEEEc-----ccceEEE---cCC-CCC Confidence 3556666665422211 11000000 00111111111111110 1122222 211 122 Q ss_pred CcccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecC---CCccccccCcc--ceeec---cceeeeccCCCC Q lcl|NC_020844. 227 KKWVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANGV---EPDTDSEGSPK--PLDTG---PDTVLYAPPNAE 297 (455) Q Consensus 227 ~~~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~---~~~~~~~~~~~--~~~~g---~~~~~~lp~~~~ 297 (455) +...+.+|+.-+... +.......++...+ ...+.|-.+++-- +++........ ...-| .+.+..++.+ T Consensus 186 ~~~~G~s~~~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g-- 262 (422) T protein:vir:13 186 DGLIGIKPLDYLRCT-IENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESMSNGLENAHSISLLPFG-- 262 (422) T ss_pred CCcccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCC-- Confidence 344578887766653 44455455554443 4445788777631 11111111110 01112 2345556532 Q ss_pred ccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 298 GQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTA-QSGNLTRISAAQAASKGNSAVQQWAAGLKDALENAL 373 (455) Q Consensus 298 ~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l 373 (455) .++.=++.++...+ +.+..+-...+|+.+ ...++.. ..++-+..+. ....-....|.-++.+++++++.-| T Consensus 263 ---~~~~~l~~~~~d~q-~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~-~~~~f~~~~l~P~~~~ie~~l~~~L 337 (422) T protein:vir:13 263 ---YQFQPISLSMADAQ-FLENSKLTKRELAATFGMKSYHLNDLERATFNNLTE-QQKDFYVTTLQSSLTVYEQEIQDKL 337 (422) T ss_pred ---ceeeeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH-HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 23433444444333 223333344444432 3333322 1222121111 1111234445666666666665432 Q ss_pred -HHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHH--------HHHHH Q lcl|NC_020844. 374 -YITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPE--------EESER 444 (455) Q Consensus 374 -~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e--------~e~~r 444 (455) .-... ..+..|+++.+-..+......++++.++++.|++|.-.+++.+ |+-| .-.-+ ..++. T Consensus 338 l~~~~~-----~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~---gl~p-~~ggD~~~~~~n~~~l~~ 408 (422) T protein:vir:13 338 FSQYET-----LQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRE---NLPP-VEGGDRLLVNGNMIPIEM 408 (422) T ss_pred CChhhh-----cCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCcCeeeeccCccchhh Confidence 11111 1234455543322222234557778889999999999888765 3322 10000 01122 Q ss_pred HHhhcccCCCC Q lcl|NC_020844. 445 LISELPGGIEE 455 (455) Q Consensus 445 i~~e~~~~l~~ 455 (455) +.++...+-++ T Consensus 409 ~~~~~~~~g~~ 419 (422) T protein:vir:13 409 AGEQYKKGGEK 419 (422) T ss_pred cccccccCCCc Confidence 22222222222 No 201 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=67.44 E-value=0.25 Score=23.86 Aligned_cols=369 Identities=12% Similarity=0.005 Sum_probs=146.6 Q ss_pred CCCCCCCccCHHHHHH---HH-HHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHH--HHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAM---SD-YLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRY--DLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~~p~y~~~---~~-~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y--~~rl~ra~~~n~~~~~v~~~~g 74 (455) |- +.++. +-... .. -+..+..+.+ +.+.....| ...++.+.++..+..+-+++++ T Consensus 1 Mg---~f~~~-~~r~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~ 61 (416) T protein:vir:81 1 MG---IFYKN-EKRDLQYNEDDLQMMVQTLPG---------------FQGTKLRQYKDIEAIRHSDIFTAVMMIASDLAR 61 (416) T ss_pred CC---ccccc-ccccccCCCcchhHHHHHhcc---------------ccccCccccchhhhhcchHHHHHHHHHHHhhcc Confidence 55 22111 00000 00 0111211111 111111111 0112222222233333344444 Q ss_pred hhhcCCceecc-----CCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceE Q lcl|NC_020844. 75 KPFGREVEVTE-----GTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYW 148 (455) Q Consensus 75 ~lf~k~p~i~~-----~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~ 148 (455) . |..+.. ....+..++. .-. ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+ T Consensus 62 ~----p~~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~------------~L 124 (416) T protein:vir:81 62 M----PIRVTVNGQINYSDRIVNLLNTRPN-PMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM------------NL 124 (416) T ss_pred C----ceEEecCccccccchHHHHHhcccc-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EE Confidence 3 333311 1122334432 222 24567899999999999999999999766544321 14 Q ss_pred EEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCc Q lcl|NC_020844. 149 SQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKK 228 (455) Q Consensus 149 ~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~ 228 (455) ..++|..|.-+.. . +|+ +.+ +.......+.... ...+...| +-|.+.+. +. T Consensus 125 ~~i~~~~v~v~~~-~-~g~--~~~------------------~~~~~~~~~~~~~---~~~~~~ev--ihir~~~~--d~ 175 (416) T protein:vir:81 125 TFRKTSEIELKSD-A-RGR--LYY------------------FHQRIDSNGNNIE---RNVKFEDM--LDIKFYSL--DG 175 (416) T ss_pred EEEcCceeEEEEC-C-Ccc--EEE------------------EEEEecCCCceeE---EEEccccE--EEeccCCC--CC Confidence 4555555522111 1 111 000 0001111110000 00011111 22232222 23 Q ss_pred ccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCCCccccccC----c--cceeecc---ceeeeccCCCCc Q lcl|NC_020844. 229 WVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANGVEPDTDSEGS----P--KPLDTGP---DTVLYAPPNAEG 298 (455) Q Consensus 229 ~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~~~~~~~~~----~--~~~~~g~---~~~~~lp~~~~~ 298 (455) ..+.+|+.-+.. .+.......++...+ ...+.|-.+++=-....+++.. . +...-|. +.++.|+. T Consensus 176 ~~G~s~i~~~~~-~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~---- 250 (416) T protein:vir:81 176 INGLSLLDTLSR-TIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE---- 250 (416) T ss_pred ccccCHHHHHHH-HHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCC---- Confidence 467788766554 445555555655544 4456677776511111111111 0 1112232 23555653 Q ss_pred cccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 299 QHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYI 375 (455) Q Consensus 299 ~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~ 375 (455) +.++.-++.++..++ +.+..+....+|+.+ ...++.....+.+.++....+ . ..|.-++.+++.+++..| T Consensus 251 -g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~--~-~~l~P~~~~ie~~ln~~l-- 323 (416) T protein:vir:81 251 -SMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY--L-STLKPYITCVCAELNFKF-- 323 (416) T ss_pred -CceeEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH--H-HHHHHHHHHHHHHHhhhc-- Confidence 223443444444333 233344444455443 223332222222323332222 2 246666667777666543 Q ss_pred HHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH----------HHHH Q lcl|NC_020844. 376 TNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE----------SERL 445 (455) Q Consensus 376 ~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e----------~~ri 445 (455) ..+ ..+..++++.+=.........+.++.++.+.|+++.-.+++.+ |+-| .-+.+.. ++.+ T Consensus 324 ~~~-----~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~---gl~p-~~~gd~~~~~~~~n~~~~~~~ 394 (416) T protein:vir:81 324 NDE-----YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRD---GLAP-IPGGNGSIHRVDLNHVNIELV 394 (416) T ss_pred ccc-----ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCCCcceEeecccccccccc Confidence 111 1233444443222222335567778889999999999988876 3422 1111110 0111 Q ss_pred Hhh--cccCC--------CC Q lcl|NC_020844. 446 ISE--LPGGI--------EE 455 (455) Q Consensus 446 ~~e--~~~~l--------~~ 455 (455) .+. ...+. ++ T Consensus 395 ~~~~~~~~~~~~~~~kgGe~ 414 (416) T protein:vir:81 395 DEYQMNKSRATDKKLKGGEE 414 (416) T ss_pred cccCcccccccccccCCCCC Confidence 100 00111 11 No 202 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=67.44 E-value=0.25 Score=23.86 Aligned_cols=369 Identities=12% Similarity=0.005 Sum_probs=146.6 Q ss_pred CCCCCCCccCHHHHHH---HH-HHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHH--HHHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAM---SD-YLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRY--DLRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~~p~y~~~---~~-~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y--~~rl~ra~~~n~~~~~v~~~~g 74 (455) |- +.++. +-... .. -+..+..+.+ +.+.....| ...++.+.++..+..+-+++++ T Consensus 1 Mg---~f~~~-~~r~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~ 61 (416) T protein:vir:45 1 MG---IFYKN-EKRDLQYNEDDLQMMVQTLPG---------------FQGTKLRQYKDIEAIRHSDIFTAVMMIASDLAR 61 (416) T ss_pred CC---ccccc-ccccccCCCcchhHHHHHhcc---------------ccccCccccchhhhhcchHHHHHHHHHHHhhcc Confidence 55 22111 00000 00 0111211111 111111111 0112222222233333344444 Q ss_pred hhhcCCceecc-----CCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceE Q lcl|NC_020844. 75 KPFGREVEVTE-----GTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYW 148 (455) Q Consensus 75 ~lf~k~p~i~~-----~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~ 148 (455) . |..+.. ....+..++. .-. ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+ T Consensus 62 ~----p~~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~------------~L 124 (416) T protein:vir:45 62 M----PIRVTVNGQINYSDRIVNLLNTRPN-PMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM------------NL 124 (416) T ss_pred C----ceEEecCccccccchHHHHHhcccc-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EE Confidence 3 333311 1122334432 222 24567899999999999999999999766544321 14 Q ss_pred EEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCc Q lcl|NC_020844. 149 SQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKK 228 (455) Q Consensus 149 ~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~ 228 (455) ..++|..|.-+.. . +|+ +.+ +.......+.... ...+...| +-|.+.+. +. T Consensus 125 ~~i~~~~v~v~~~-~-~g~--~~~------------------~~~~~~~~~~~~~---~~~~~~ev--ihir~~~~--d~ 175 (416) T protein:vir:45 125 TFRKTSEIELKSD-A-RGR--LYY------------------FHQRIDSNGNNIE---RNVKFEDM--LDIKFYSL--DG 175 (416) T ss_pred EEEcCceeEEEEC-C-Ccc--EEE------------------EEEEecCCCceeE---EEEccccE--EEeccCCC--CC Confidence 4555555522111 1 111 000 0001111110000 00011111 22232222 23 Q ss_pred ccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCCCccccccC----c--cceeecc---ceeeeccCCCCc Q lcl|NC_020844. 229 WVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANGVEPDTDSEGS----P--KPLDTGP---DTVLYAPPNAEG 298 (455) Q Consensus 229 ~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~~~~~~~~~----~--~~~~~g~---~~~~~lp~~~~~ 298 (455) ..+.+|+.-+.. .+.......++...+ ...+.|-.+++=-....+++.. . +...-|. +.++.|+. T Consensus 176 ~~G~s~i~~~~~-~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~---- 250 (416) T protein:vir:45 176 INGLSLLDTLSR-TIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE---- 250 (416) T ss_pred ccccCHHHHHHH-HHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCC---- Confidence 467788766554 445555555655544 4456677776511111111111 0 1112232 23555653 Q ss_pred cccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 299 QHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYI 375 (455) Q Consensus 299 ~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~ 375 (455) +.++.-++.++..++ +.+..+....+|+.+ ...++.....+.+.++....+ . ..|.-++.+++.+++..| T Consensus 251 -g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~--~-~~l~P~~~~ie~~ln~~l-- 323 (416) T protein:vir:45 251 -SMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY--L-STLKPYITCVCAELNFKF-- 323 (416) T ss_pred -CceeEeccCCHHHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH--H-HHHHHHHHHHHHHHhhhc-- Confidence 223443444444333 233344444455443 223332222222323332222 2 246666667777666543 Q ss_pred HHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH----------HHHH Q lcl|NC_020844. 376 TNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE----------SERL 445 (455) Q Consensus 376 ~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e----------~~ri 445 (455) ..+ ..+..++++.+=.........+.++.++.+.|+++.-.+++.+ |+-| .-+.+.. ++.+ T Consensus 324 ~~~-----~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~---gl~p-~~~gd~~~~~~~~n~~~~~~~ 394 (416) T protein:vir:45 324 NDE-----YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRD---GLAP-IPGGNGSIHRVDLNHVNIELV 394 (416) T ss_pred ccc-----ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCCCcceEeecccccccccc Confidence 111 1233444443222222335567778889999999999988876 3422 1111110 0111 Q ss_pred Hhh--cccCC--------CC Q lcl|NC_020844. 446 ISE--LPGGI--------EE 455 (455) Q Consensus 446 ~~e--~~~~l--------~~ 455 (455) .+. ...+. ++ T Consensus 395 ~~~~~~~~~~~~~~~kgGe~ 414 (416) T protein:vir:45 395 DEYQMNKSRATDKKLKGGEE 414 (416) T ss_pred cccCcccccccccccCCCCC Confidence 100 00111 11 No 203 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=66.24 E-value=0.27 Score=23.70 Aligned_cols=390 Identities=8% Similarity=0.010 Sum_probs=137.0 Q ss_pred CCC-----------CCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHH Q lcl|NC_020844. 1 MPE-----------QDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVL 69 (455) Q Consensus 1 m~~-----------~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v 69 (455) |.+ .+....++.+..-...-+-+...+.|.+..+. ++.+......+..++..---.++++.+| T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~------~p~~~~~~~~~~~~l~~~~~npiv~~~I 100 (576) T protein:vir:96 27 IDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRT------KRSYMKNSDNLHDVLKQFGNNPILNAII 100 (576) T ss_pred cccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccc------cCcchhhhhhhHHHHHHhhcCHHHHHHH Confidence 211 00111111111111110111111222222111 1111111122223332110134556666 Q ss_pred HHHhhhhhc------------------CCceecc-C-----CHHHHHHHHhhccC----CCCHHHHHHHHHHHHHhhCcE Q lcl|NC_020844. 70 EGLASKPFG------------------REVEVTE-G-----TGPSEDLLEDIDGR----GNNLTTFAGQLFFQGVADSIS 121 (455) Q Consensus 70 ~~~~g~lf~------------------k~p~i~~-~-----~~~l~~~~~d~d~~----G~~l~~~~~~~~~~al~~G~~ 121 (455) +.++.-+-. +...... . ...+..++.+.... -.+..+|++.+....+.+|-+ T Consensus 101 ~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna 180 (576) T protein:vir:96 101 LTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQV 180 (576) T ss_pred HHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCe Confidence 665543321 0111100 0 01123333322211 136789999999999999999 Q ss_pred EEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEE-eCCcE Q lcl|NC_020844. 122 WIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRK-NDAGI 200 (455) Q Consensus 122 ~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~-~~~g~ 200 (455) |+++-+...+.+... -+..++|..|.-..... +.. +.. ...|.. ..++. T Consensus 181 ~~~i~~~rd~~g~~~----------~L~pl~p~~V~v~~~~d--g~~----------------~~~--~~~~~~~~~~~~ 230 (576) T protein:vir:96 181 NFEKVFNKKNATTMD----------KFIAVDPSTIFYATDKN--GKI----------------IKG--GKRFVQVINKKV 230 (576) T ss_pred EEEEEEecCCCCceE----------EEEEeCCceeEEEECCC--Cce----------------eee--eeEEEEecCCce Confidence 998755443322110 14444554442111100 000 000 001111 11111 Q ss_pred EEEeccCCCCCCceeEEEEEecccC--CCcccCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEe--cC-CCcc- Q lcl|NC_020844. 201 WEVDEFGRITIGVVPMVPFITGRRK--GKKWVFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSAN--GV-EPDT- 273 (455) Q Consensus 201 ~~~~~~g~~~l~~vP~V~f~~~~~~--~~~~~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~--G~-~~~~- 273 (455) ....... ..|- ++.+... .....+.+||..++... ........+... ....+.|-.+++ |- ...+ T Consensus 231 ~~~~~~~----dii~---~~~~~~~d~~~~~~G~Spi~~a~~~i-~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e 302 (576) T protein:vir:96 231 VASFTSR----EMAM---GIRNPRTELSSSGYGLSEVEIAMKQF-IAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQR 302 (576) T ss_pred EEEeccc----ceEE---EeecCCCCcccCcccccHHHHHHHHH-HHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHH Confidence 1111100 1122 2222221 11335788887666543 334444444443 345567876664 32 1111 Q ss_pred -ccccCc--cceeeccce----eeeccCCCCccccceeEEeeCchhHH-HHHHHHHHHHHHHHHh---hhhhccC-CCcc Q lcl|NC_020844. 274 -DSEGSP--KPLDTGPDT----VLYAPPNAEGQHGSWNFVEPATESLK-FLAEQIESVSKEIREL---GRQPLTA-QSGN 341 (455) Q Consensus 274 -~~~~~~--~~~~~g~~~----~~~lp~~~~~~~~~~~~le~~~~~l~-~~~~~l~~~~~~m~~~---g~~~l~~-~~~~ 341 (455) ....+. +...-|+++ ++.++. .++|..++.++-. .+.+..+-..++|+.+ ...++.- ..++ T Consensus 303 ~~~~lr~~~~~~~~G~~nag~~p~vl~~-------G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~ 375 (576) T protein:vir:96 303 ALENFKREWKSSFSGINGSWQVPVVMAD-------DIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGG 375 (576) T ss_pred HHHHHHHHHHHHhccccccccceeecCC-------CceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHcccccccc Confidence 111111 111223322 234442 2555554433222 2244445555555443 2222311 1111 Q ss_pred hh----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCC-HHHHHHHHH Q lcl|NC_020844. 342 LT----------RISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGE-EQTPGYLLT 410 (455) Q Consensus 342 ~s----------a~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~-~~~~~al~~ 410 (455) .+ +........-....|.-+...++.+++..| +..+ + ....++ |...... ..+..++.. T Consensus 376 ~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L--l~~~-~---~~~~~~----f~r~d~~~~~e~~~~~~ 445 (576) T protein:vir:96 376 ATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHI--ISEY-S---DKYVFQ----FVGGDTKSELDKIKILQ 445 (576) T ss_pred ccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhh--chhc-c---CceEEE----eccCCHHHHHHHHHHHH Confidence 10 111111112233446666666666666533 1111 1 223333 3322221 122334445 Q ss_pred HHHcCCCCHHHHHHHHHhcCCCCCCCC-----------------------HHHHHHHHHhhcc--cCCCC Q lcl|NC_020844. 411 MRQNGDISREGLWMEFQRLGELSDNFD-----------------------PEEESERLISELP--GGIEE 455 (455) Q Consensus 411 ~~~~G~is~et~~~~l~~~~vl~~~~d-----------------------~e~e~~ri~~e~~--~~l~~ 455 (455) +...|+++.-.+++.+ |+-| .-. .+...+++..-.+ .+-+. T Consensus 446 ~~~~G~lT~NE~R~~~---gl~p-iegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 511 (576) T protein:vir:96 446 EEVKTYKTVNEARKEK---GLKP-IEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDD 511 (576) T ss_pred HHhcCccCHHHHHHHh---CCCC-CCCcceeccccccccccccccCCCCCCccccccccccccccCCCCC Confidence 5667999998887765 3321 100 0000111100000 00000 No 204 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=63.63 E-value=0.31 Score=23.34 Aligned_cols=262 Identities=9% Similarity=-0.014 Sum_probs=103.7 Q ss_pred Hhhh---hhcCCceeccCCHHHHHHH-HhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCce Q lcl|NC_020844. 72 LASK---PFGREVEVTEGTGPSEDLL-EDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPY 147 (455) Q Consensus 72 ~~g~---lf~k~p~i~~~~~~l~~~~-~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy 147 (455) +++. ++++... .. ..+..++ ..-+ ...+-.+|.+.+....+.+|-+|+++..+..|.+. - T Consensus 1 ia~l~~~~~~~~~~--~~-~~l~~lL~~~PN-~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~------------~ 64 (278) T protein:vir:78 1 MASLPLKMYEDYKV--VN-TEVSDLLTVSPN-NSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPS------------K 64 (278) T ss_pred CccceeEEEecCcc--cc-cHHHHHHHhcCC-CCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEE------------E Confidence 2222 1222221 12 2344444 2222 34678889999999999999999999766544321 2 Q ss_pred EEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCC Q lcl|NC_020844. 148 WSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGK 227 (455) Q Consensus 148 ~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~ 227 (455) +..++|..|.- ..+..++ .+ ++.+.. .+|...... -.. ++-|.... ..+ T Consensus 65 l~~l~~~~v~v-~~~~~~~--~~-------------------~y~~~~-~~g~~~~~~-----~~e--vih~~~~~-~~~ 113 (278) T protein:vir:78 65 LFLLNPDVVEM-LIENQSR--EL-------------------YYSIHA-ATGNKLIVH-----NMD--MLHFKHIV-ASN 113 (278) T ss_pred EEEECCceeEE-EEcCCCc--eE-------------------EEEEEc-CCceEEEEc-----ccc--EEEECCCC-CCC Confidence 44455554421 0000000 00 011111 111111111 111 22232221 223 Q ss_pred cccCcchHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccCc---c---ceeeccceeeeccCCCCcccc Q lcl|NC_020844. 228 KWVFHPPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGSP---K---PLDTGPDTVLYAPPNAEGQHG 301 (455) Q Consensus 228 ~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~~---~---~~~~g~~~~~~lp~~~~~~~~ 301 (455) ...+.+|+.-+.... ...+....+ ....+...|..++.. ...-.++... . ...-+++.++.+|. +. T Consensus 114 ~~~G~s~~~~~~~~i-~~~~~~~~~-~~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~-----g~ 185 (278) T protein:vir:78 114 MVQGISPIDVLKNTT-DFDNAVRTF-NLTEMQKPDSFMLKY-GSNVGKEKRQQVLEDFKQYYEENGGILFQEP-----GV 185 (278) T ss_pred CeeeccHHHHHHHHH-HHHHHHHHH-HHHHhcCCCcEEEEe-CCCCCHHHHHHHHHHHHHHhccCCCceecCC-----Cc Confidence 345778876665432 222222222 233444445444432 1111111100 0 01123345566653 23 Q ss_pred ceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 302 SWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITN 377 (455) Q Consensus 302 ~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a 377 (455) ++.-+..++...+ ..+.++...++|+.. ...++... .++-+..+ .....-....|.-++..++++++..| T Consensus 186 ~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~-~~~~~~~~~~l~P~~~~i~~~ln~~L---- 259 (278) T protein:vir:78 186 EIEPLPKKYVSED-IVASENLTRERVANVFQLPSVFLNARSNTNFAKNE-ELNRFYLQHTLLPIVKQYEEEFNRKL---- 259 (278) T ss_pred eEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHhhc---- Confidence 4444444444332 234444455555442 22233221 12222111 11112233446666666666666543 Q ss_pred HHcCCC--CCceEEEEecccCCCCC Q lcl|NC_020844. 378 LWMDAP--DTNPEADVYTDFRADTG 400 (455) Q Consensus 378 ~~~g~~--~~~~~v~v~~df~~~~~ 400 (455) +... ..+..|+++ .+.+ T Consensus 260 --~~~~e~~~g~~~~f~----~~~l 278 (278) T protein:vir:78 260 --LTKTDREKIGILNLT----LNLI 278 (278) T ss_pred --CChhHhcCCceEEEe----cccC Confidence 1111 122334433 3333 No 205 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=60.62 E-value=0.37 Score=22.96 Aligned_cols=370 Identities=13% Similarity=0.042 Sum_probs=146.0 Q ss_pred CCCCCCCccCHHHHHHH----HHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHH--HHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMS----DYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYD--LRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~----~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~--~rl~ra~~~n~~~~~v~~~~g 74 (455) |---.+.++. +-.... +....+.. +|-+.+.+-..|. .-++.+.++-.+..+-+++++ T Consensus 23 ~~~~~lf~~~-e~R~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~ 86 (441) T protein:vir:94 23 LVVVGIFYKN-EKRDLQYNEDDLQMMVQT---------------LPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLAR 86 (441) T ss_pred hhcccccccc-ccccccCCCcchHHHHHH---------------hcccCcccccccchhhhhccHHHHHHHHHHHHhhcc Confidence 1100011000 000000 00000111 1111111111111 112222222223333333333 Q ss_pred hhhcCCceec-----cCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceE Q lcl|NC_020844. 75 KPFGREVEVT-----EGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYW 148 (455) Q Consensus 75 ~lf~k~p~i~-----~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~ 148 (455) . |..+. .....+-.++. .-. ...+-.+|.+.+....+.+|-+|+++.....|.+. -+ T Consensus 87 l----p~~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L 149 (441) T protein:vir:94 87 M----PIRVTVNGQINYSDRIVNLLNTRPN-PMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM------------NL 149 (441) T ss_pred C----ceeeecCccccccchHHHHHhcccC-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EE Confidence 3 33321 11222334442 222 23567889999999999999999999765544321 14 Q ss_pred EEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCc--EEEEeccCCCCCCceeEEEEEecccCC Q lcl|NC_020844. 149 SQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAG--IWEVDEFGRITIGVVPMVPFITGRRKG 226 (455) Q Consensus 149 ~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g--~~~~~~~g~~~l~~vP~V~f~~~~~~~ 226 (455) ..++|..|.-+.. ..+ . +.+ ..+.....+ ..... ..-.+|. |...+. T Consensus 150 ~~i~~~~v~v~~d-~~g-~--~~~------------------~~~~~~~~~~~~~~~~----~~~dvih---~k~~~~-- 198 (441) T protein:vir:94 150 TFRKTSEIELKSD-ARG-R--LYY------------------FHQRIDSNGNNIERNV----KFEDMLD---IKFYSL-- 198 (441) T ss_pred EEEcCceeEEEEC-CCc-c--EEE------------------EEEEeccCCceeEEEE----ccccEEE---eccCCC-- Confidence 4555555532111 111 0 000 001111110 00000 0111222 222222 Q ss_pred CcccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEe--cCC-Cccccc-cCc--cceeecc---ceeeeccCCC Q lcl|NC_020844. 227 KKWVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSAN--GVE-PDTDSE-GSP--KPLDTGP---DTVLYAPPNA 296 (455) Q Consensus 227 ~~~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~--G~~-~~~~~~-~~~--~~~~~g~---~~~~~lp~~~ 296 (455) +...+.+||.-+.. .+.......++...+ ...+.|..+++ |-. ++...+ .+. +...-|. +.++.||. T Consensus 199 dg~~G~spl~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~-- 275 (441) T protein:vir:94 199 DGINGLSLLDTLSR-TIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE-- 275 (441) T ss_pred CCccccCHHHHHHH-HHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCC-- Confidence 23467888766654 445555555554444 45567877776 321 111101 010 1122232 34566653 Q ss_pred CccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 297 EGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENAL 373 (455) Q Consensus 297 ~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l 373 (455) +.++.-++.++..++ +.+..+....+|+.+ ...++.....+.+.++....+. ..|.-+...++.+++..| T Consensus 276 ---G~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~~~---~tl~P~~~~ie~eln~kl 348 (441) T protein:vir:94 276 ---SMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYL---STLKPYITCVCAELNFKF 348 (441) T ss_pred ---CceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHH---HHHHHHHHHHHHHHhhhc Confidence 224444444444433 233344444555443 2233422222233333333222 246667777777776543 Q ss_pred HHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHH-------------H Q lcl|NC_020844. 374 YITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPE-------------E 440 (455) Q Consensus 374 ~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e-------------~ 440 (455) ..+ ..+..++++.+=.........++++.++.+.|+++....++.+ |+-| .-+.+ + T Consensus 349 --~~~-----~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~---gl~P-i~ggd~~~~~~~~n~~~~~ 417 (441) T protein:vir:94 349 --NDE-----YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRD---GLAP-IPGGNGSIHRVDLNHVNIE 417 (441) T ss_pred --ccc-----ccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCCCcceEeecccccccc Confidence 111 1233455443222333335567778888999999999888765 3322 11111 1 Q ss_pred HHHHHHhhc----c---cCCCC Q lcl|NC_020844. 441 ESERLISEL----P---GGIEE 455 (455) Q Consensus 441 e~~ri~~e~----~---~~l~~ 455 (455) ..+..+... + .|-++ T Consensus 418 ~~~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:94 418 LVDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred cccccccccccccccccCCCCC Confidence 010000000 0 01111 No 206 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=60.62 E-value=0.37 Score=22.96 Aligned_cols=370 Identities=13% Similarity=0.042 Sum_probs=146.0 Q ss_pred CCCCCCCccCHHHHHHH----HHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHH--HHhhccccCChHHHHHHHHhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMS----DYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYD--LRKSVAKMTNVFRDVLEGLAS 74 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~----~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~--~rl~ra~~~n~~~~~v~~~~g 74 (455) |---.+.++. +-.... +....+.. +|-+.+.+-..|. .-++.+.++-.+..+-+++++ T Consensus 23 ~~~~~lf~~~-e~R~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~ 86 (441) T protein:vir:79 23 LVVVGIFYKN-EKRDLQYNEDDLQMMVQT---------------LPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLAR 86 (441) T ss_pred hhcccccccc-ccccccCCCcchHHHHHH---------------hcccCcccccccchhhhhccHHHHHHHHHHHHhhcc Confidence 1100011000 000000 00000111 1111111111111 112222222223333333333 Q ss_pred hhhcCCceec-----cCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceE Q lcl|NC_020844. 75 KPFGREVEVT-----EGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYW 148 (455) Q Consensus 75 ~lf~k~p~i~-----~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~ 148 (455) . |..+. .....+-.++. .-. ...+-.+|.+.+....+.+|-+|+++.....|.+. -+ T Consensus 87 l----p~~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L 149 (441) T protein:vir:79 87 M----PIRVTVNGQINYSDRIVNLLNTRPN-PMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM------------NL 149 (441) T ss_pred C----ceeeecCccccccchHHHHHhcccC-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EE Confidence 3 33321 11222334442 222 23567889999999999999999999765544321 14 Q ss_pred EEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCc--EEEEeccCCCCCCceeEEEEEecccCC Q lcl|NC_020844. 149 SQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAG--IWEVDEFGRITIGVVPMVPFITGRRKG 226 (455) Q Consensus 149 ~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g--~~~~~~~g~~~l~~vP~V~f~~~~~~~ 226 (455) ..++|..|.-+.. ..+ . +.+ ..+.....+ ..... ..-.+|. |...+. T Consensus 150 ~~i~~~~v~v~~d-~~g-~--~~~------------------~~~~~~~~~~~~~~~~----~~~dvih---~k~~~~-- 198 (441) T protein:vir:79 150 TFRKTSEIELKSD-ARG-R--LYY------------------FHQRIDSNGNNIERNV----KFEDMLD---IKFYSL-- 198 (441) T ss_pred EEEcCceeEEEEC-CCc-c--EEE------------------EEEEeccCCceeEEEE----ccccEEE---eccCCC-- Confidence 4555555532111 111 0 000 001111110 00000 0111222 222222 Q ss_pred CcccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEe--cCC-Cccccc-cCc--cceeecc---ceeeeccCCC Q lcl|NC_020844. 227 KKWVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSAN--GVE-PDTDSE-GSP--KPLDTGP---DTVLYAPPNA 296 (455) Q Consensus 227 ~~~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~--G~~-~~~~~~-~~~--~~~~~g~---~~~~~lp~~~ 296 (455) +...+.+||.-+.. .+.......++...+ ...+.|..+++ |-. ++...+ .+. +...-|. +.++.||. T Consensus 199 dg~~G~spl~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~-- 275 (441) T protein:vir:79 199 DGINGLSLLDTLSR-TIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE-- 275 (441) T ss_pred CCccccCHHHHHHH-HHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCC-- Confidence 23467888766654 445555555554444 45567877776 321 111101 010 1122232 34566653 Q ss_pred CccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 297 EGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENAL 373 (455) Q Consensus 297 ~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l 373 (455) +.++.-++.++..++ +.+..+....+|+.+ ...++.....+.+.++....+. ..|.-+...++.+++..| T Consensus 276 ---G~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~~~---~tl~P~~~~ie~eln~kl 348 (441) T protein:vir:79 276 ---SMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYL---STLKPYITCVCAELNFKF 348 (441) T ss_pred ---CceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHH---HHHHHHHHHHHHHHhhhc Confidence 224444444444433 233344444555443 2233422222233333333222 246667777777776543 Q ss_pred HHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHH-------------H Q lcl|NC_020844. 374 YITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPE-------------E 440 (455) Q Consensus 374 ~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e-------------~ 440 (455) ..+ ..+..++++.+=.........++++.++.+.|+++....++.+ |+-| .-+.+ + T Consensus 349 --~~~-----~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~---gl~P-i~ggd~~~~~~~~n~~~~~ 417 (441) T protein:vir:79 349 --NDE-----YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRD---GLAP-IPGGNGSIHRVDLNHVNIE 417 (441) T ss_pred --ccc-----ccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCCCcceEeecccccccc Confidence 111 1233455443222333335567778888999999999888765 3322 11111 1 Q ss_pred HHHHHHhhc----c---cCCCC Q lcl|NC_020844. 441 ESERLISEL----P---GGIEE 455 (455) Q Consensus 441 e~~ri~~e~----~---~~l~~ 455 (455) ..+..+... + .|-++ T Consensus 418 ~~~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:79 418 LVDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred cccccccccccccccccCCCCC Confidence 010000000 0 01111 No 207 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=56.79 E-value=0.45 Score=22.49 Aligned_cols=411 Identities=13% Similarity=0.100 Sum_probs=169.9 Q ss_pred CCC-----------CCCCccCHHHHHHHHH----HHHHHHHh----cCH-HHHHHhhhh------cCCCCCCCCHHHHH- Q lcl|NC_020844. 1 MPE-----------QDPTKRSADAEAMSDY----LQTVDDVL----EGV-TGMRRNFKR------YVKQFPHEQNKRYD- 53 (455) Q Consensus 1 m~~-----------~~v~~~~p~y~~~~~~----w~~i~d~~----~G~-~~~r~~g~~------yLpk~~~e~~~~Y~- 53 (455) |.- +.++-..-+|.+|.-. +...+|+. .|= .-|-.+|.. -|--+-. ..+.|. T Consensus 1 ~~~~~~~~~~~~t~~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~f~-npd~~~~ 79 (525) T protein:vir:10 1 MTRTKGSKNKSTTIEKQSLQIEQLQEHINELERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLWFN-NPDKYIN 79 (525) T ss_pred CCCCcCCcccccchhhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhc-ChHHHHH Confidence 331 1244455667777665 33333321 110 111111110 0111111 223453 Q ss_pred HHhhccccCChHHHHHHHHhhhhhcCCc---eeccCCHHHHHHHHhhccCCCCHHHH--HHHHHHHHHh-hCcEEEEEee Q lcl|NC_020844. 54 LRKSVAKMTNVFRDVLEGLASKPFGREV---EVTEGTGPSEDLLEDIDGRGNNLTTF--AGQLFFQGVA-DSISWIRVDY 127 (455) Q Consensus 54 ~rl~ra~~~n~~~~~v~~~~g~lf~k~p---~i~~~~~~l~~~~~d~d~~G~~l~~~--~~~~~~~al~-~G~~~~lVd~ 127 (455) +-...+.||-+.---|.++-..+|+-|| +|... ...+..-+++..-+.+|+.. -+++.++.|. ...++-||-. T Consensus 80 ~i~~l~~y~yi~~~~v~ql~~li~~lp~l~y~i~~~-~~~k~~~~~~s~~n~~l~k~i~hk~ltrdll~q~a~~gtlig~ 158 (525) T protein:vir:10 80 NIVNLLTYYYIIDGNVFQLYDLIFSLPPLDYQIKVL-KRDKDYKEDLSTINLYLEKKIQHKQLTRDLLVQLAHSGTLIGT 158 (525) T ss_pred HHHHHHHHhhhhcchHHHHHHHHHhcCCcceeehhh-hhccchhhHHHHHHHHHHHhHHHHHHHHHHHHHhhccCceeEe Confidence 3444555555555568889999999886 23211 11111112221122344441 1234443332 3344444421 Q ss_pred CCCCcchhhhhHHhccCCceEEEechhhhccceeeccCCeeEEEE-EEEE-eee--eE------EE-EEecCeEEEEEEe Q lcl|NC_020844. 128 PAGQQFRNREEERQAGVRPYWSQVNHSAILEIQSERQGAGERITY-MRMW-EDA--ET------IL-VLRPDDWERWRKN 196 (455) Q Consensus 128 p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~-v~~~-E~~--e~------~~-v~~~~~~~~~~~~ 196 (455) |......||+-++.--.-+ +-+.+.+|.|+.+- ..+- |.. ++ .. +.+...|+-|..- T Consensus 159 -----------wlg~~~~py~~vf~~~kyv-fp~~r~~g~~v~vid~~~f~~~~~~~r~~~~~~lsp~i~~~~y~~~~~~ 226 (525) T protein:vir:10 159 -----------WLGSKREPYFNVFNNLKYV-FPYGRAKGKMVAVIDLQWFDEMSELERKLTFENLSPLITENKYKKWKEY 226 (525) T ss_pred -----------eecCCCCcchhhhhhhhhh-ccccccCCceEEEEehHHhhhhhHHHHHHHHHhhchhhhhhhhhHHhhc Confidence 1122334776655432211 22345566665421 1111 100 00 10 1122333333321 Q ss_pred CCcEEEEeccCCCCCCceeE---EEEEecc----cCCCcccCcchHHHHHHHHHHHHHhHHHHHHHHH---HcCCceEEE Q lcl|NC_020844. 197 DAGIWEVDEFGRITIGVVPM---VPFITGR----RKGKKWVFHPPMKDALDLQVALFRKECALEFAEM---MTAFPMLSA 266 (455) Q Consensus 197 ~~g~~~~~~~g~~~l~~vP~---V~f~~~~----~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~---~~~~p~~~~ 266 (455) .+ .++....+-..|+ ++.+.+. .+.|..-..|.|.|+. |-+..-|+++++. ..++.++.+ T Consensus 227 ~~-----~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~-----hk~klrd~EqsIA~kii~a~avLk~ 296 (525) T protein:vir:10 227 NG-----ENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQ-----HKQKLRDLEQSIADKIIKAMAVLKF 296 (525) T ss_pred cc-----ccchhheeeecccceeEEeeecccccCcccCcchhhhHHHHHH-----HHHHHHHHHHHHHHHhhhhheeeee Confidence 11 1111122222232 3333321 2223344566677764 5555666666543 234445555 Q ss_pred ecCCCccccccC--------------ccceeecc-ceeeeccCCCCccccceeEEeeCc--hhHHHHH-HHHHHHHHHHH Q lcl|NC_020844. 267 NGVEPDTDSEGS--------------PKPLDTGP-DTVLYAPPNAEGQHGSWNFVEPAT--ESLKFLA-EQIESVSKEIR 328 (455) Q Consensus 267 ~G~~~~~~~~~~--------------~~~~~~g~-~~~~~lp~~~~~~~~~~~~le~~~--~~l~~~~-~~l~~~~~~m~ 328 (455) .|-.+.+...-. .+....-+ -.++.+|.=+ .+.|=+... .++...+ +.++.-+.-.+ T Consensus 297 gg~~gn~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa-----~~efp~ik~~~~glDg~K~d~I~~DI~~A~ 371 (525) T protein:vir:10 297 RGKDDNDSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMPDFA-----TFEFPEIKNGDKTLDPKKYDSIDNDITNAT 371 (525) T ss_pred ccccCccccCchHHHHHHHHHHHHHHhcccccccCeEEEecccee-----ecccccccCcccCCCchhhhhhhhhhhhhh Confidence 554433211100 00011100 1233344221 222222221 3333222 23333333334 Q ss_pred HhhhhhccCCCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHH Q lcl|NC_020844. 329 ELGRQPLTAQSGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGY 407 (455) Q Consensus 329 ~~g~~~l~~~~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~a 407 (455) -++..++.+.++| .|+ +......+..+..|-+..+++.++++.|+ ++. +.+..+-+|-|-+...--..-++. T Consensus 372 GlS~sL~nGdggNyAta---slnld~fykkigVm~e~Iee~y~kL~d~V---l~~-~k~~nyifnydkd~pi~~kkk~d~ 444 (525) T protein:vir:10 372 GISQVLTNGTKGNYASA---KLNLDVFYKKIGVMLEIIEEIYNQLIDII---LGE-EKGCNYIFQYNKDTPIEREKKLDT 444 (525) T ss_pred ccceeeecCCCCceeee---eeeHHHHHHHHHHHHHHHHHHHHHHHhhh---cCc-ccCcceEEecCCCchhhhhhhhhh Confidence 4555566666666 332 12222345566677777888888888877 553 334554444222222112445889 Q ss_pred HHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcccCCCC Q lcl|NC_020844. 408 LLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELPGGIEE 455 (455) Q Consensus 408 l~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~~~l~~ 455 (455) |+++...|.+++-.+ +-.|| +++-+.|+-+-.++.- .|.| T Consensus 445 LIkL~d~g~s~k~vl----dl~gi-s~e~y~E~s~yEtE~l---kl~E 484 (525) T protein:vir:10 445 LIKLEAQGYSAKYVL----DILGI-SSEEYFEESIYEIEKL---KLRE 484 (525) T ss_pred hhhhhccchhhhhhh----hhhcc-CcchHHHHHHHHHHHH---HHhh Confidence 999999999887543 22344 3443444322222211 1333 No 208 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=55.00 E-value=0.49 Score=22.28 Aligned_cols=386 Identities=9% Similarity=0.020 Sum_probs=147.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHH-HHhhhhcCCCCCCCCHHHH--HHHhhccccCChHHHHHHHHhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGM-RRNFKRYVKQFPHEQNKRY--DLRKSVAKMTNVFRDVLEGLASKP 76 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~-r~~g~~yLpk~~~e~~~~Y--~~rl~ra~~~n~~~~~v~~~~g~l 76 (455) |++- +.+--++ .+..-.+ .+.+ +--|..+ ...+. +...+ ...++ .+.+-..|+.+++-+ T Consensus 1 ~~~~-~~~~~~~----------~~~~~~~~~~~~~~~~g~~~-~~~~~-~~~~~~~~~a~~----~~~v~~~v~~ia~~i 63 (460) T protein:vir:10 1 MANR-IIRALRE----------LTGLDNKFNDAFIKYIGQTF-TKYDN-NGKTYLEQGYNI----NPDVYSCISQMAAKT 63 (460) T ss_pred Cchh-HHHHHhh----------hhccCCCchHHHHHhhcccc-CCCcc-chhhhhHHHHhc----chHHHHHHHHHHHhh Confidence 5521 0000000 0000000 0000 0001111 11111 11111 11222 234444555555555 Q ss_pred hcCCceec---c----------------------------------CCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhC Q lcl|NC_020844. 77 FGREVEVT---E----------------------------------GTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADS 119 (455) Q Consensus 77 f~k~p~i~---~----------------------------------~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G 119 (455) -+-|..+- . .......|+..-+ ...+-.+|.+.+....+.+| T Consensus 64 A~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN-~~~t~~~f~~~~~~~lll~G 142 (460) T protein:vir:10 64 VAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPN-PTQTWADIYSLYKTYMRLNG 142 (460) T ss_pred hhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCC-CCCCHHHHHHHHHHHHhhcC Confidence 54444331 0 0111223333322 23578899999999999999 Q ss_pred cEEEEEeeCCCCcchhhhhHHhccCCc-eEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCC Q lcl|NC_020844. 120 ISWIRVDYPAGQQFRNREEERQAGVRP-YWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDA 198 (455) Q Consensus 120 ~~~~lVd~p~~~~~~t~ad~~~~~~rP-y~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~ 198 (455) -+|+++..+..+.. ..+| -+.+++|..|.-+..+. + ..+.+ ...+..|....+ T Consensus 143 nay~~i~r~~~~~~---------~G~~~~L~~l~~~~v~v~~~~~--~-~~~~~--------------~~~~~~~~~~~~ 196 (460) T protein:vir:10 143 NCYFYLMSPDDGIN---------AGVPSQMYVLPAHLIKIVLKDD--I-NLLST--------------DSPIKSYMLIQG 196 (460) T ss_pred CeEEEEEecCCCcc---------CceeEEEEEEcCceEEEEEcCC--C-ceeee--------------eeeeeEEEEecC Confidence 99999976543321 0112 13444455443211111 0 00000 001111111112 Q ss_pred cEEEEeccCCCCCCceeEEEEEeccc----CCCcccCcchHHHHHHHHHHHHHhHHHHHHHHHHc-CCceEEEecCCCcc Q lcl|NC_020844. 199 GIWEVDEFGRITIGVVPMVPFITGRR----KGKKWVFHPPMKDALDLQVALFRKECALEFAEMMT-AFPMLSANGVEPDT 273 (455) Q Consensus 199 g~~~~~~~g~~~l~~vP~V~f~~~~~----~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~l~~~-~~p~~~~~G~~~~~ 273 (455) |.-.... .-..|. |..... .++...+.+|+.-++. .+.......++....+.. +.|-.+++ ..... T Consensus 197 g~~~~~~----~~evih---~r~~~~~~~~~~~~~~G~sp~~~~~~-~i~~~~~~~~~~~~~f~ng~~~~~i~~-~~~~l 267 (460) T protein:vir:10 197 DQFIEFN----EDEVIH---TKYANPNFDLQGSHLYGMSPIRAILR-NINSQNSTIDNNVKTMQNGGVFGFIHG-GSTGL 267 (460) T ss_pred ceeEEec----ccceEE---EecCCCCcccccCccccccHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcceeee-cCCCC Confidence 2111110 011222 222222 2234567888876654 344444455555554444 44544443 22111 Q ss_pred ccc----cCc--cceeec---cceeeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC--- Q lcl|NC_020844. 274 DSE----GSP--KPLDTG---PDTVLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ--- 338 (455) Q Consensus 274 ~~~----~~~--~~~~~g---~~~~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~--- 338 (455) .++ .+. +...-| ++.++.|+. +.++.-++.++...+ +.+..+...++|+.+ ...++... T Consensus 268 ~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~-----g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 341 (460) T protein:vir:10 268 TQPQADSLKQRLTEMDKSPDRLSQIAGASG-----EIAFTKISLNTDELK-PFDYLKYDQKAICNALGWSDKLLNNNEGG 341 (460) T ss_pred CHHHHHHHHHHHHHHhcCccccCCceecCC-----CceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCC Confidence 111 110 001122 233555642 223433443333322 344455556666553 22333211 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCC Q lcl|NC_020844. 339 SGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDIS 418 (455) Q Consensus 339 ~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is 418 (455) +.+-|..+ .....-....|.-++..++++++.-| +-.+. ...+..|++ ||...........++.++++.|+++ T Consensus 342 t~~~sn~e-~~~~~f~~~~l~P~~~~ie~~ln~kl--~~~~~--~~~~~~i~~--d~~~l~~l~~d~~~~~~~~~~g~~T 414 (460) T protein:vir:10 342 GLNTGNLE-EERKRVVTDNIQPDLVILKQAFDKKF--IKRFK--GYENAVIEW--DISELPEMQTDMVAMASWLNTIPVT 414 (460) T ss_pred CCccccHH-HHHHHHHHHHHHHHHHHHHHHHHHhh--cCccc--ccCCceEEe--ecchhhhHHHHHHHHHHHHhCCCCC Confidence 11112111 11112233456666666766666532 11111 122334444 3332211234566777889999999 Q ss_pred HHHHHHHHHhcCCCCCCCCHHHH------HHHHHhh----cccCCCC Q lcl|NC_020844. 419 REGLWMEFQRLGELSDNFDPEEE------SERLISE----LPGGIEE 455 (455) Q Consensus 419 ~et~~~~l~~~~vl~~~~d~e~e------~~ri~~e----~~~~l~~ 455 (455) ...+++.+ |+-|-....-++ ...+++- .+++-|+ T Consensus 415 ~NE~R~~~---g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq 458 (460) T protein:vir:10 415 PNEIRIAM---KYETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFNQ 458 (460) T ss_pred HHHHHHHh---CCCCCCCCCCCeeeecccccchhhcccccCCCcccC Confidence 99888766 443211110110 0111111 1122222 No 209 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=52.11 E-value=0.56 Score=21.95 Aligned_cols=380 Identities=9% Similarity=-0.055 Sum_probs=146.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhh---h Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKP---F 77 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~l---f 77 (455) |-=.....++.+-... .+.-+-+.+++...- ..|..+ +. +..++.+.++-....+-+.+++.. + T Consensus 1 ~~f~~~f~r~~~~~~~--~~~~~~~~~~~~~~~-~~g~~v-------~~---~~~l~~~~v~~~i~~Ia~~iA~~p~~~~ 67 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVT--TPAELAEAIGLSYDT-YTGKRI-------SS---QRAMRLTAVYSCVRVLAESVGMLPCSLY 67 (413) T ss_pred CccchhhccCccCCcc--chHHHHHhhhcCccc-ccCcee-------ch---hhhhccHHHHHHHHHHHHhhhhCceEEE Confidence 3322222222211111 111222233321100 011111 11 223333333333333334444322 2 Q ss_pred cCCcee--ccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 78 GREVEV--TEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 78 ~k~p~i--~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) ++.... ...+..+..++..-=-...+-.+|.+.+....+.+|-+|+++.... +.+. -+..++|.. T Consensus 68 ~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~-g~~~------------~L~~l~~~~ 134 (413) T protein:vir:48 68 KISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKAL-GEVV------------ELLPIDPGC 134 (413) T ss_pred EecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCC-CcEE------------EEEEEcCce Confidence 222111 0113345566532111356788999999999999999999986431 2110 133444444 Q ss_pred hccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEE-eCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcch Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRK-NDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPP 234 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~-~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~p 234 (455) |.-.. ++... .+|+. ...|...... .-.+|.+ ..... +...+.+| T Consensus 135 v~~~~----~~~~~---------------------~~y~~~~~~g~~~~~~----~~evih~---~~~~~--d~~~G~s~ 180 (413) T protein:vir:48 135 VEPKL----NSQWQ---------------------PVYQVTFPDGSVDVLT----QDEIWHV---RTLTL--DGLVGLNP 180 (413) T ss_pred EEEEE----cCCce---------------------EEEEEEecCceEEEEc----cccEEEe---cCcCC--CCcccccH Confidence 32100 00000 01111 0111111110 1112222 22222 23457788 Q ss_pred HHHHHHHHHHHHHhHHHHHHHHH-HcCCceEEEecCC---CccccccCc--cceeecc---ceeeeccCCCCccccceeE Q lcl|NC_020844. 235 MKDALDLQVALFRKECALEFAEM-MTAFPMLSANGVE---PDTDSEGSP--KPLDTGP---DTVLYAPPNAEGQHGSWNF 305 (455) Q Consensus 235 L~dla~l~~~hy~~~sd~~~~l~-~~~~p~~~~~G~~---~~~~~~~~~--~~~~~g~---~~~~~lp~~~~~~~~~~~~ 305 (455) +.-++.. +.......++...++ ..+.|-.+++.-. .+..+.... +...-|. +.++.++.+ .++.= T Consensus 181 i~~~~~~-i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g-----~~~~~ 254 (413) T protein:vir:48 181 IAYAREA-ISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAHRPMILEMG-----LDWKS 254 (413) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCC-----ceEEe Confidence 7766654 444445555555444 4567777776422 111111111 1112232 234555532 23443 Q ss_pred EeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-Ccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020844. 306 VEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGN-LTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWM 380 (455) Q Consensus 306 le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~-~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~ 380 (455) +..++...+ +.+..+....+|+.+ ...++... .++ .+.++... .-....|.-++..++++++.-|- - T Consensus 255 l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~--~f~~~~i~P~~~~ie~~l~~~L~--~--- 326 (413) T protein:vir:48 255 MALNAEDSQ-FLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGL--GFINYSLVPYLTRIEQRINTGLV--R--- 326 (413) T ss_pred ccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHH--HHHHHHHHHHHHHHHHHHHhhcc--C--- Confidence 443333332 233344444444432 33333221 112 12122221 11233455555566655554221 0 Q ss_pred CCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCH--HHHHHH---HHhh----cc- Q lcl|NC_020844. 381 DAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDP--EEESER---LISE----LP- 450 (455) Q Consensus 381 g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~--e~e~~r---i~~e----~~- 450 (455) .....+..|+++.+=.........++++.++.+.|+++.-.+++.+..-- +++.... .-.... ..++ .+ T Consensus 327 ~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p-~~ggD~~~~~~n~~~~~~~~~~~~~~~~~ 405 (413) T protein:vir:48 327 ESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNP-RPGGDVYLTPMNMTTSPSAGDDNGKKKES 405 (413) T ss_pred ccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-CCCcceeeccccccccccccccCCCCCCC Confidence 11112344555432222222345577888899999999998887663211 1111000 000000 0000 00 Q ss_pred cCCCC Q lcl|NC_020844. 451 GGIEE 455 (455) Q Consensus 451 ~~l~~ 455 (455) +.-+| T Consensus 406 ~~~~~ 410 (413) T protein:vir:48 406 GDADK 410 (413) T ss_pred CCccc Confidence 01111 No 210 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=51.01 E-value=0.59 Score=21.83 Aligned_cols=373 Identities=13% Similarity=0.049 Sum_probs=147.2 Q ss_pred CCCCCCCccCHHHHHH--HHHH-HHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAM--SDYL-QTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPF 77 (455) Q Consensus 1 m~~~~v~~~~p~y~~~--~~~w-~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf 77 (455) |.=-.+.++ ++-.+. -..| ...-+.+.|-.. ..++.|-+ ...++.+.++-.+..+-+++++. T Consensus 23 ~~~~~~f~~-~e~r~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~----------~~al~~~~V~acv~~Ia~~iA~l-- 87 (441) T protein:vir:98 23 LVVVGIFYK-NEKRDLQYNEDDLQMMVQTLPGFQG--TKLRQYKD----------IEAIRHSDIFTAVMMIASDLARM-- 87 (441) T ss_pred hhccccccc-cccccccCCCcchHHHHHHhhcccc--cCccccch----------hhhhccHHHHHHHHHHHHhhccC-- Confidence 111012111 000000 0000 000011111000 00111111 11122222223333344444443 Q ss_pred cCCceecc-----CCHHHHHHH-HhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEe Q lcl|NC_020844. 78 GREVEVTE-----GTGPSEDLL-EDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQV 151 (455) Q Consensus 78 ~k~p~i~~-----~~~~l~~~~-~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~ 151 (455) |..+.. ....+-.++ ..-. ...+-.+|.+.+....+.+|-+|+++.+...+.+. -+..+ T Consensus 88 --pl~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~------------~L~~i 152 (441) T protein:vir:98 88 --PIRVTVNGQINYSDRIVNLLNTRPN-PMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPM------------NLTFR 152 (441) T ss_pred --ceEEecCCcccccchHHHHHhcccc-cCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEE------------EEEEE Confidence 333311 122233333 2233 34577889999999999999999999766544321 24555 Q ss_pred chhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeC--CcEEEEeccCCCCCCceeEEEEEecccCCCcc Q lcl|NC_020844. 152 NHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKND--AGIWEVDEFGRITIGVVPMVPFITGRRKGKKW 229 (455) Q Consensus 152 ~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~--~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~ 229 (455) +|..|.-.. +..+ .+.+ +.+.... .+...... .-.+|. |...+. +.. T Consensus 153 ~~~~v~v~~-~~~g---~~~~------------------~~~~~~~~~~~~~~~~~----~~dviH---ir~~~~--dg~ 201 (441) T protein:vir:98 153 KTSEIELKL-DARG---RLYY------------------FHQRIDSNGNNIERNVK----FEDMLD---IKFYSL--DGI 201 (441) T ss_pred cCceeEEEE-CCCC---cEEE------------------EEEEeccCcceeeEEEc----cccEEE---eccCCC--CCc Confidence 565553211 1111 0100 0000000 00000000 011222 222222 234 Q ss_pred cCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEe--cC-CCccccc-cCc--cceeecc---ceeeeccCCCCcc Q lcl|NC_020844. 230 VFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSAN--GV-EPDTDSE-GSP--KPLDTGP---DTVLYAPPNAEGQ 299 (455) Q Consensus 230 ~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~--G~-~~~~~~~-~~~--~~~~~g~---~~~~~lp~~~~~~ 299 (455) .+.+|+.-+.. .+.......++... ....+.|-.+++ |. .++...+ .+. ....-|. +.++.|+. T Consensus 202 ~G~spi~~~~~-~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~----- 275 (441) T protein:vir:98 202 NGLSLLDTLSR-TIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE----- 275 (441) T ss_pred cccCHHHHHHH-HHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCC----- Confidence 67888766554 34444445555444 445566777775 32 1111111 110 1112232 33566653 Q ss_pred ccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 300 HGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYIT 376 (455) Q Consensus 300 ~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~ 376 (455) +.++.-++.++..++ +.+..+-...+|+.+ ...++.....+.+.++....+. + .|.-+..+++++++..| T Consensus 276 g~~~~~l~~~~~d~q-~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~y~--~-tl~P~~~~ie~~ln~~L--- 348 (441) T protein:vir:98 276 SMTFDQLEVDTEVLK-LIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYL--S-TLKPYITCVCAELNFKF--- 348 (441) T ss_pred CceEEEccCChhHHH-HHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHH--H-HHHHHHHHHHHHHHhhc--- Confidence 224444444444433 233344444455443 3333432222333333333222 2 46666667777666543 Q ss_pred HHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHH-------------HHHH Q lcl|NC_020844. 377 NLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPE-------------EESE 443 (455) Q Consensus 377 a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e-------------~e~~ 443 (455) .....+..|+++.+=.........++++.++.+.|+++...+++.+ |+-| .-+.+ +..+ T Consensus 349 ----~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~---gl~p-i~gGd~~~~~~~~n~~~~~~~~ 420 (441) T protein:vir:98 349 ----NDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRD---GLAP-IPGGNGSIHRVDLNHVNIELVD 420 (441) T ss_pred ----cccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCCCcceEeeccccccccccc Confidence 1112234455443222333335567778889999999999888765 3322 11111 1111 Q ss_pred HHHhhcc-------cCCCC Q lcl|NC_020844. 444 RLISELP-------GGIEE 455 (455) Q Consensus 444 ri~~e~~-------~~l~~ 455 (455) ..+.... .|-++ T Consensus 421 ~~q~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:98 421 EYQMNKSRATDKKLKGGEE 439 (441) T ss_pred ccccccccccccccCCCCC Confidence 1111100 11111 No 211 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=49.73 E-value=0.63 Score=21.68 Aligned_cols=364 Identities=11% Similarity=-0.008 Sum_probs=146.6 Q ss_pred CCCCC-----CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhh Q lcl|NC_020844. 1 MPEQD-----PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASK 75 (455) Q Consensus 1 m~~~~-----v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~ 75 (455) |==+. .....|.- ..| +.++.+|... ..|. +. .. +.-++.+.++- .|+-+++- T Consensus 1 m~~~~~~~~~~~~~~~~~----~~~--~~~~~g~~~s--~~~~-~v--------~~-~~al~~~~v~~----cv~~ia~~ 58 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGS----GGW--VSALLGSARS--EAGQ-VV--------TP-ASALSLTVLQN----CVTLLAES 58 (419) T ss_pred CCcccccccccCcCCCCc----chh--hHHhhccccc--ccCc-cc--------Ch-HHhhccHHHHH----HHHHHHHh Confidence 43111 11111111 111 1222322111 0111 00 00 12222333333 44444444 Q ss_pred hhcCCcee---c------cCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCC Q lcl|NC_020844. 76 PFGREVEV---T------EGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVR 145 (455) Q Consensus 76 lf~k~p~i---~------~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~r 145 (455) +-+-|..+ + .....+..++. .-+ ...+-.+|.+.+....+.+|-+|+++..+..|.+. T Consensus 59 ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~----------- 126 (419) T protein:vir:80 59 IAQLPVELYERSGDDRKPATDHPLYSILKYEPN-PWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQ----------- 126 (419) T ss_pred hccCceEEEEecCCCcccccccHHHHHHHhhcc-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE----------- Confidence 43333332 0 11223445443 222 34577899999999999999999999765544321 Q ss_pred ceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccC Q lcl|NC_020844. 146 PYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRK 225 (455) Q Consensus 146 Py~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~ 225 (455) -+..++|..|--.. ..+.+.+|+..+.. ..+-..| +.+.+.+. T Consensus 127 -~L~~i~~~~v~i~~-------------------------~~~~~~~y~~~~~~--------~~~~~~i--~h~~~~~~- 169 (419) T protein:vir:80 127 -GLYPLDNEAVTVMK-------------------------GPDLKPMYRVAGAD--------PLPQRLV--HHVRWMSI- 169 (419) T ss_pred -EEEEecCceEEEEE-------------------------CCCceEEEEEcCcc--------ccchhhe--EEecCCCC- Confidence 14444454442100 01111223222111 0111122 22222333 Q ss_pred CCcccCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEe--cCC-CccccccCc------cceeecc---ceeeec Q lcl|NC_020844. 226 GKKWVFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSAN--GVE-PDTDSEGSP------KPLDTGP---DTVLYA 292 (455) Q Consensus 226 ~~~~~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~--G~~-~~~~~~~~~------~~~~~g~---~~~~~l 292 (455) +...+.+|+.-++. .+.......++... ....+.|-.+++ +.. ...+.+..+ +...-|+ +.++.+ T Consensus 170 -d~~~G~s~i~~~~~-~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl 247 (419) T protein:vir:80 170 -NGYTGLSPVLLHAN-AIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALL 247 (419) T ss_pred -CCcccccHHHHHHH-HHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceec Confidence 23457888766554 34444444454433 344567877765 211 111111100 0111232 345666 Q ss_pred cCCCCccccceeEEeeCchhHHH-HHHHHHHHHHHHHHh---hhhhccC-CCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 293 PPNAEGQHGSWNFVEPATESLKF-LAEQIESVSKEIREL---GRQPLTA-QSGNLTRISAAQAASKGNSAVQQWAAGLKD 367 (455) Q Consensus 293 p~~~~~~~~~~~~le~~~~~l~~-~~~~l~~~~~~m~~~---g~~~l~~-~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~ 367 (455) +. ..+|.+.+-++.+. +.+..+-..++|+.+ ...++.. .+++-+..+ .....-....|.-+..++++ T Consensus 248 ~~-------g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e-~~~~~f~~~~l~P~~~~ie~ 319 (419) T protein:vir:80 248 QE-------GMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIE-HQSLQFVIYTLLPWVKRHEQ 319 (419) T ss_pred CC-------CceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHH-HHHHHHHHHHHHHHHHHHHH Confidence 53 24444444322221 233344445555443 2233321 122222211 11112234456666677777 Q ss_pred HHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHH----- Q lcl|NC_020844. 368 ALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEES----- 442 (455) Q Consensus 368 a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~----- 442 (455) +++.-|- ......+..|+++.+=..+......++++.++.+.|+++.-..++.+ |+-| .-. -++. T Consensus 320 ~l~~kll-----~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---g~~p-~~g-GD~~~~~~n 389 (419) T protein:vir:80 320 AKTRDLL-----LPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLE---NMPP-VKG-GDIYLSPMN 389 (419) T ss_pred HHhhhcc-----CccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCC-cceeeeccc Confidence 7765331 11111234455543222222234557778888999999999888765 3321 100 0010 Q ss_pred -HHHHhhcccCCCC Q lcl|NC_020844. 443 -ERLISELPGGIEE 455 (455) Q Consensus 443 -~ri~~e~~~~l~~ 455 (455) ..+....+....+ T Consensus 390 ~~~~~~~~~~~~~~ 403 (419) T protein:vir:80 390 MVDASKPQPIPMGK 403 (419) T ss_pred cccccccccccCCC Confidence 0111111100011 No 212 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=46.53 E-value=0.73 Score=21.33 Aligned_cols=347 Identities=11% Similarity=0.062 Sum_probs=137.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |.--++..++ .-..-...|..+ ..|. ..+-...- +.. .+ ++. ..+-..|+.+++-+-+.| T Consensus 1 M~~~~~f~~r-~~~~~~~~~~~~---~~~~-------~~~~~~~v--~~~--~a-l~~----~av~~cv~~ia~~ia~~p 60 (359) T protein:vir:10 1 MSILNPFERR-SSITPNNYYPFM---VQNG-------SIVPNSLV--DAT--EA-LKN----SDLYAVTSLISSDIAGTR 60 (359) T ss_pred Ccccchhhcc-ccCCCCcchhhh---hccc-------cccCCccc--CHH--Hh-hcc----hHHHHHHHHHHHhhhcCc Confidence 7743322111 000000111111 1111 00000000 000 11 111 222345555555555544 Q ss_pred ceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhccce Q lcl|NC_020844. 81 VEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILEIQ 160 (455) Q Consensus 81 p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~w~ 160 (455) .+ ..+.+..|+..-.. ..+-.+|.+.+....+.+|-+|++|..+..+.+. -+..++|..|.- T Consensus 61 ~~---~~~~~~~L~~~PN~-~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~------------~l~~l~~~~v~i-- 122 (359) T protein:vir:10 61 FI---GNQVFTSVLNNPSH-LTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMK------------ELRLIPSNAITI-- 122 (359) T ss_pred cc---cchHHHHHhhcccc-cCCHHHHHHHHHHhccccCceEEEEEECCCCeEE------------EEEEeCCceEEE-- Confidence 42 23445666655443 3577889999999999999999999765544321 133334433321 Q ss_pred eeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHHHHHH Q lcl|NC_020844. 161 SERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMKDALD 240 (455) Q Consensus 161 ~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~dla~ 240 (455) ...++. + .+++... .++...... .=.+|.|--+..+....+...+.+|+.-+.. T Consensus 123 -~~~~~~--~----------~y~~~~~---------~~~~~~~~~----~~evih~~~~~~~~~~~dg~~G~spi~~~~~ 176 (359) T protein:vir:10 123 -DLTDDT--L----------TYEVNQF---------DDYPSAKYN----ASEMIHVKIMAYGVDTLHNLVGHSPLESLTS 176 (359) T ss_pred -EEcCCe--E----------EEEEEec---------CCceEEEEc----ccceEEeccCCCCCCccCccccccHHHHHHH Confidence 000100 0 0111111 111111110 1122332111111112234568888876555 Q ss_pred HHHHHHHhHHHHHHHH-HHcCCceEEEecCC----CccccccCc--cceeecc--ceeeeccCCCCccccceeEEeeCch Q lcl|NC_020844. 241 LQVALFRKECALEFAE-MMTAFPMLSANGVE----PDTDSEGSP--KPLDTGP--DTVLYAPPNAEGQHGSWNFVEPATE 311 (455) Q Consensus 241 l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~----~~~~~~~~~--~~~~~g~--~~~~~lp~~~~~~~~~~~~le~~~~ 311 (455) . +.......++.... ...+.|-.+++--. .+..+..+. +...-|. +.+..|+. +.++.-++.++. T Consensus 177 ~-i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~-----g~~~~~l~~~~~ 250 (359) T protein:vir:10 177 E-IGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQ-----SADFSTVSINAD 250 (359) T ss_pred H-HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecCC-----CcceeeecCCHH Confidence 3 33334444444443 44456777775211 111111110 1111121 23555653 234554555444 Q ss_pred hHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceE Q lcl|NC_020844. 312 SLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPE 388 (455) Q Consensus 312 ~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~ 388 (455) ..+ +.+..+....+|+.+ ...++.... ..++.....+.... ..|......+...+++.|-- . +..+. +. T Consensus 251 d~q-~le~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~~~e~~~~-~~l~~~l~p~~~~l~~~l~~--~-~~~~~-~~- 322 (359) T protein:vir:10 251 VAN-YLNSMNWGRTQIAKAFGVSDSYLNGTG-DQQSSLDQIKDLYV-NALNRFIEPLISELRIKCDS--S-IGVDM-SP- 322 (359) T ss_pred HHH-HHHHHHHHHHHHHHHhCCCHHHhCCCC-cccccHHHHHHHHH-HHHHHHHHHHHHHHHHHhhh--h-hcccc-hh- Confidence 433 234444444455432 333342221 22112222222211 22333333333334333211 0 11110 00 Q ss_pred EEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCC Q lcl|NC_020844. 389 ADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGEL 432 (455) Q Consensus 389 v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl 432 (455) -.+|+ +......+.++.+.|+++.-..++.+..-.|+ T Consensus 323 ---~~~~d----~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 323 ---ITDYS----NSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred ---hhhcC----HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 01121 23445567788899999999999888766666 No 213 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=45.27 E-value=0.78 Score=21.19 Aligned_cols=381 Identities=9% Similarity=-0.042 Sum_probs=141.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC--HHHHHHhh----hhcCCCC--CCCCHHHHHHHhhccccCChHHHHHHHH Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG--VTGMRRNF----KRYVKQF--PHEQNKRYDLRKSVAKMTNVFRDVLEGL 72 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G--~~~~r~~g----~~yLpk~--~~e~~~~Y~~rl~ra~~~n~~~~~v~~~ 72 (455) |..---. ...+.+...+-.+.| ...+.... ..++-.. .+..- ..+..++.+ .+-..|+.+ T Consensus 1 ~~~~l~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v-~~~~al~~~----~V~~~i~~i 68 (434) T protein:vir:43 1 MSKSLGK-------VLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKV-TVDKAMKLS----AVWACVRLI 68 (434) T ss_pred Cccchhh-------hhhhcccccchhhhcccccccccCchHHHHHHhcCCccCCcee-chhhhhccH----HHHHHHHHH Confidence 4421100 000011111110000 00000000 0000000 00000 011112222 222344444 Q ss_pred hhhhhcCCcee---------c-cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhc Q lcl|NC_020844. 73 ASKPFGREVEV---------T-EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQA 142 (455) Q Consensus 73 ~g~lf~k~p~i---------~-~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~ 142 (455) +.-+-.-|..+ . .....+..++..-=-...+-.+|.+.+....+.+|-+|+++... .+.+. T Consensus 69 a~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~-------- 139 (434) T protein:vir:43 69 STSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPA-------- 139 (434) T ss_pred HHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEE-------- Confidence 44444333332 1 11223455553211234578899999999999999999998643 22211 Q ss_pred cCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEE-eCCcEEEEeccCCCCCCceeEEEEEe Q lcl|NC_020844. 143 GVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRK-NDAGIWEVDEFGRITIGVVPMVPFIT 221 (455) Q Consensus 143 ~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~-~~~g~~~~~~~g~~~l~~vP~V~f~~ 221 (455) -+..++|..|--+. +. +| ...|+. ..+|...... .-.+|. |.. T Consensus 140 ----~L~~l~p~~v~~~~-~~-~g-----------------------~~~y~~~~~~g~~~~~~----~~eVih---~~~ 183 (434) T protein:vir:43 140 ----ALDFLLPSRVDLEC-DE-NG-----------------------RLKYFYTTKKGARREIE----RTNMLH---IPA 183 (434) T ss_pred ----EEEEEcCcceEEEE-cC-CC-----------------------eEEEEEEecCceEEEEc----cccEEE---ecC Confidence 13444454442111 00 00 011111 1112111111 111122 222 Q ss_pred cccCCCcccCcchHHHHHHHHHHHHHhHHHHH-HHHHHcCCceEEEecCCC---ccccccCcc-ceeec---cceeeecc Q lcl|NC_020844. 222 GRRKGKKWVFHPPMKDALDLQVALFRKECALE-FAEMMTAFPMLSANGVEP---DTDSEGSPK-PLDTG---PDTVLYAP 293 (455) Q Consensus 222 ~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G~~~---~~~~~~~~~-~~~~g---~~~~~~lp 293 (455) .+. +...+.+|+.-++.. +.......++. ......+.|-.+++--.. +..+..+.. .-..| ++.+..|| T Consensus 184 ~~~--dg~~G~spi~~~~~~-i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~~~g~~nag~~~vl~ 260 (434) T protein:vir:43 184 FTL--DGRIGLSAIRYGVDV-FGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFREYVKSVSGAMNSGRSPVLE 260 (434) T ss_pred cCC--CCccccCHHHHHHHH-HHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHHHHHHhcCccccCCccccC Confidence 222 234578887666554 33333344443 334455678777753221 111111100 00122 23345554 Q ss_pred CCCCccccceeEEeeC--chhHHHHHHHHHHHHHHHHHh---hhhhccCC-CcchhHHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 294 PNAEGQHGSWNFVEPA--TESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNLTRISA-AQAASKGNSAVQQWAAGLK 366 (455) Q Consensus 294 ~~~~~~~~~~~~le~~--~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~sa~a~-~~~~~~~~s~L~~~a~~~~ 366 (455) . ..+|.+++ +...+ +.+..+-...+|+.+ ...++... .++.++... ....+-....|.-++.+++ T Consensus 261 ~-------g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie 332 (434) T protein:vir:43 261 Q-------GITPETIGINPVDAQ-LLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQIQ 332 (434) T ss_pred C-------CceEEEccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 2 24454444 33332 233344445555443 22223211 111111111 1111122344666667777 Q ss_pred HHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCH-----HHH Q lcl|NC_020844. 367 DALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDP-----EEE 441 (455) Q Consensus 367 ~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~-----e~e 441 (455) .+++.-|---..+ .+..|+++.+=..+......++++.++...|+++.-.+++.+..--+ +..... -.- T Consensus 333 ~~ln~kL~~~~~~-----~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~-~ggD~~~~~~n~~~ 406 (434) T protein:vir:43 333 QCVNKRLLTAPER-----IRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPEL-PGGDILTVQSNLVP 406 (434) T ss_pred HHHHhhcCChhhh-----cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC-CCCCeEeeccCccc Confidence 7766543111111 12345554322223333455788888999999999988876532111 110000 000 Q ss_pred HHHHHhhccc-C-------------CCC Q lcl|NC_020844. 442 SERLISELPG-G-------------IEE 455 (455) Q Consensus 442 ~~ri~~e~~~-~-------------l~~ 455 (455) ++.+.++.+. . ..| T Consensus 407 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 407 IDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred hhhhhccCCCcchhhhhhccCCCCCCCC Confidence 1112111110 0 111 No 214 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=44.48 E-value=0.8 Score=21.10 Aligned_cols=362 Identities=11% Similarity=0.015 Sum_probs=138.5 Q ss_pred CCCC--------------C---CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCC Q lcl|NC_020844. 1 MPEQ--------------D---PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTN 63 (455) Q Consensus 1 m~~~--------------~---v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n 63 (455) -|.. + ...+..+..+....|...... +|-. ..+.+. .+..... +..++- . T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~------~~~~~~-~~~~~t~-~~~~~~----~ 79 (409) T protein:vir:83 13 IPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAW-SGYP------ESWATP-SWGSAQD-KLRTLI----D 79 (409) T ss_pred CCCcccccccccccCCCCceeeccCCCcchhhhhccccccccc-cccc------cccccc-Cccccch-hhHhhh----H Confidence 1100 0 122333334444444333221 1110 111111 1111111 122222 2 Q ss_pred hHHHHHHHHhhhhhcCCceec---cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEE-eeCCCCcchhhhhH Q lcl|NC_020844. 64 VFRDVLEGLASKPFGREVEVT---EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRV-DYPAGQQFRNREEE 139 (455) Q Consensus 64 ~~~~~v~~~~g~lf~k~p~i~---~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lV-d~p~~~~~~t~ad~ 139 (455) .+...|+-+++-+-+-|..+- ...+.+..++..-=-...+-.+|.+.+....+. |-+|+++ .....+.+. T Consensus 80 ~v~acV~~Ia~~iA~lpl~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~----- 153 (409) T protein:vir:83 80 VAWACIDLNASVLSSMPIYRMRNGRIIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPI----- 153 (409) T ss_pred HHHHHHHHHHHhhccCceEEeeCCccccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEE----- Confidence 333344444444433343221 111223333322111235778888888777655 8888763 333333221 Q ss_pred HhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEE Q lcl|NC_020844. 140 RQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPF 219 (455) Q Consensus 140 ~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f 219 (455) -+..++|..|- +. ...+....|+..+.. .+- .++-| T Consensus 154 -------~L~pl~p~~v~------------------------v~-~~~~g~~~y~~~~~~---------~~~---eiiHi 189 (409) T protein:vir:83 154 -------RFRVVPPWLVN------------------------VE-LKKGARREYRIGGLN---------VTD---EILHI 189 (409) T ss_pred -------EEEEECCcceE------------------------EE-EcCCceEEEEEcccc---------Ccc---ceEEe Confidence 13333333221 00 011111122221110 011 23322 Q ss_pred EecccCCCcccCcchHHHHHHHHHHHHHhHHHH-HHHHHHcCCceEEEec---CCCccccccCcc--ceeec-cceeeec Q lcl|NC_020844. 220 ITGRRKGKKWVFHPPMKDALDLQVALFRKECAL-EFAEMMTAFPMLSANG---VEPDTDSEGSPK--PLDTG-PDTVLYA 292 (455) Q Consensus 220 ~~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~-~~~l~~~~~p~~~~~G---~~~~~~~~~~~~--~~~~g-~~~~~~l 292 (455) ..... .+...+.+|+.-++.....+.. ..++ ...+...+.|..+++- ++++..+..... ....| ++..+.+ T Consensus 190 r~~~~-~~~~~G~spi~~~~~~i~~~~a-~~~~~~~~f~nga~p~gil~~~~~ls~e~~~~~~~~~~~~~~~nag~~~il 267 (409) T protein:vir:83 190 RYQGN-TADAHGHGPLESAAPRQVVIGL-LQKYVQNLAETGGVPLYWLGVERRLSETEAVDLMDRWIESRSKYAGHPALV 267 (409) T ss_pred CCCCC-CCCcccccHHHHHHHHHHHHHH-HHHHHHHHHhcCCCcceEeecCCCCCHHHHHHHHHHHHHhhCCccCcccee Confidence 22211 2334578888777765544443 3344 3334456788888762 122111111111 01111 2233444 Q ss_pred cCCCCccccce-eEEeeCchhHHHHHHHHHHHHHHHHH---hhhhhccCCCcchhHH---HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 293 PPNAEGQHGSW-NFVEPATESLKFLAEQIESVSKEIRE---LGRQPLTAQSGNLTRI---SAAQAASKGNSAVQQWAAGL 365 (455) Q Consensus 293 p~~~~~~~~~~-~~le~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~~~~~sa~---a~~~~~~~~~s~L~~~a~~~ 365 (455) .. ++++ +.+..++..++ +.+..+-...+|.. +...++.......+++ ......+-....|.-++..+ T Consensus 268 ~~-----g~~~~~~~~~s~~d~q-~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~i 341 (409) T protein:vir:83 268 TG-----GATLNQAKSMSAQDLS-LMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAV 341 (409) T ss_pred cC-----CcccccccCCCHHHHH-HHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHH Confidence 32 2232 23444554443 23333333444433 2333332111111111 11111122334556666666 Q ss_pred HHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHH Q lcl|NC_020844. 366 KDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERL 445 (455) Q Consensus 366 ~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri 445 (455) ++++++-| +. . +..|+++.+=..+....+..+++.++.+.|++|.-+.++.+ |+ ++. +--+++ T Consensus 342 e~~l~~~L------l~--~-~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~~---gl-pp~----~ggd~l 404 (409) T protein:vir:83 342 MAALDRWA------LP--S-PQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEARAME---RL-HSE----AAAVRL 404 (409) T ss_pred HHHHHHhh------CC--C-CcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CC-CCC----CCCccc Confidence 66666422 11 1 22455554322232334556777788889999988776643 33 221 111222 Q ss_pred HhhcccCC Q lcl|NC_020844. 446 ISELPGGI 453 (455) Q Consensus 446 ~~e~~~~l 453 (455) . .+|+ T Consensus 405 ~---~~gv 409 (409) T protein:vir:83 405 S---GGGV 409 (409) T ss_pred C---CCCC Confidence 1 1233 No 215 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=43.20 E-value=0.85 Score=20.96 Aligned_cols=371 Identities=8% Similarity=-0.092 Sum_probs=142.1 Q ss_pred CCCCC-CCc-cCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQD-PTK-RSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~-v~~-~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |-=-+ +.. +.+.. ...+.-+.+..++... ...|..+ +. +..++.+ .+...|+-++.-+-+ T Consensus 1 Mg~f~~lf~r~~~~~---~~~~~~~~~~~~~~~~-~~~g~~v-------~~---~~al~~~----~v~~~i~~Ia~~ia~ 62 (414) T protein:vir:44 1 MVFFSGLFQRKSDAP---VTTPAELADAIGLSYD-TYTGKQI-------SS---QRAMRLT----AVFSCVRVLAESVGM 62 (414) T ss_pred CchhhhhhccCccCc---ccchhhHhHhhccCcc-ccCCcee-------ch---hhhhccH----HHHHHHHHHHHHhcc Confidence 33111 111 11110 0011112222222100 0011111 00 1223322 233344444444433 Q ss_pred CCcee---------ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceE Q lcl|NC_020844. 79 REVEV---------TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYW 148 (455) Q Consensus 79 k~p~i---------~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~ 148 (455) -|..+ ......+..++. .-+ ...+-.+|.+.+....+.+|-+|+++-.. .+.+. -+ T Consensus 63 ~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~------------~L 128 (414) T protein:vir:44 63 LPCNLYHLNGSLKQRATGERLHKLISTHPN-GYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVA------------EL 128 (414) T ss_pred CceEEEEecCCceeecccchHHHHHHhhcc-cCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEE------------EE Confidence 33222 011233444442 233 34678889999999999999999988533 22211 24 Q ss_pred EEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe-CCcEEEEeccCCCCCCceeEEEEEecccCCC Q lcl|NC_020844. 149 SQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN-DAGIWEVDEFGRITIGVVPMVPFITGRRKGK 227 (455) Q Consensus 149 ~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~-~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~ 227 (455) ..++|..|..... . ++ .+ +|+.. .+|...... .-.+|. |.+... + T Consensus 129 ~~l~~~~v~~~~~-~-~~--~~---------------------~y~~~~~~g~~~~~~----~~evih---~~~~~~--d 174 (414) T protein:vir:44 129 LPVDPGCVVPKLN-S-SW--EP---------------------VYQVTFPDGSTDVLS----QEDIWH---VRTLTL--D 174 (414) T ss_pred EEEcCceEEEEEC-C-CC--cE---------------------EEEEEecCceEEEEc----cccEEE---ecCCCC--C Confidence 4455554421110 0 00 00 11111 111111110 111222 222222 2 Q ss_pred cccCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEec---CCCccccccCc--cceeecc---ceeeeccCCCCc Q lcl|NC_020844. 228 KWVFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANG---VEPDTDSEGSP--KPLDTGP---DTVLYAPPNAEG 298 (455) Q Consensus 228 ~~~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G---~~~~~~~~~~~--~~~~~g~---~~~~~lp~~~~~ 298 (455) ...+.+|+.-+.. .+.......++...+ ...+.|..+++- ++++....... +...-|. +.++.+|.+ T Consensus 175 ~~~G~s~i~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g--- 250 (414) T protein:vir:44 175 GLVGLNPIAYARE-AISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILEMG--- 250 (414) T ss_pred CcccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCC--- Confidence 2457888765554 344444444554443 445667777763 22221111111 0111222 235556532 Q ss_pred cccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-CcchhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 299 QHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNLTR-ISAAQAASKGNSAVQQWAAGLKDALENAL 373 (455) Q Consensus 299 ~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~sa-~a~~~~~~~~~s~L~~~a~~~~~a~~~~l 373 (455) .++.-++.++...+ +.+..+-...+|+.+ ...++... +++-+. ++... .-....|.-++.+++++++..| T Consensus 251 --~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~--~~~~~~l~P~~~~ie~~ln~~L 325 (414) T protein:vir:44 251 --LDWKSMALNAEDSQ-FLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGL--GFINYSLVPYLTRIEQRINTGL 325 (414) T ss_pred --ceEEEccCChHHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH--HHHHHHHHHHHHHHHHHHHhhc Confidence 23443443333333 233344444455443 22333221 122121 11111 1123445556566666665432 Q ss_pred HHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHH-----HH---- Q lcl|NC_020844. 374 YITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEES-----ER---- 444 (455) Q Consensus 374 ~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~-----~r---- 444 (455) +......+..|+++.+=..+......++++.++.+.|+++.-..++.+ |+-| .-.-+.-. .. T Consensus 326 -----~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---gl~p-~~ggD~~~~~~n~~~~~~~ 396 (414) T protein:vir:44 326 -----VRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLE---DMNP-RPGGDVYLTPMNMTTKPSD 396 (414) T ss_pred -----CCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCcceecccccccccCCc Confidence 111111233455553322222234557778889999999998888755 4422 11001000 00 Q ss_pred ---HHhhcccCCCC Q lcl|NC_020844. 445 ---LISELPGGIEE 455 (455) Q Consensus 445 ---i~~e~~~~l~~ 455 (455) ..++.+.+=.| T Consensus 397 ~~~~~~~~~~~~~d 410 (414) T protein:vir:44 397 GSKAGKQKDNANAD 410 (414) T ss_pred cccCCCCCCCCCCC Confidence 00011100001 No 216 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=42.15 E-value=0.9 Score=20.84 Aligned_cols=379 Identities=8% Similarity=0.002 Sum_probs=142.3 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHH--HHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKR--YDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~--Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |...++-.+--. ..++...+++.. +. -.+.+ +.+.+-.. ...+++.+ .+...|+-+++-+-. T Consensus 1 ~~~~~~~~~~k~--------~~~~~~~~~~~~-~~--~~~~~-~~~~~~~~v~~~~a~~~~----~V~~ci~~ia~~ia~ 64 (409) T protein:vir:96 1 MAKENIVTRIKK--------KLIDNWIDQSAS-KL--YDFSP-WKNKSFWGVINNTLETNE----TIFSAITKLSNSMAS 64 (409) T ss_pred Cccccchhhhhh--------HHhhhhhccccc-cc--ccccc-ccCccccccchhhHhhhH----HHHHHHHHHHHhhhh Confidence 776554333111 112222322110 00 00111 01111000 11233322 233444444444444 Q ss_pred CCcee---c-cCCHHHHHHH-HhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 79 REVEV---T-EGTGPSEDLL-EDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 79 k~p~i---~-~~~~~l~~~~-~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) -|..+ + .....+..++ .... ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+..++| T Consensus 65 lp~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L~~l~~ 131 (409) T protein:vir:96 65 LPLKMYEDYKVVNTEVSDLLTVSPN-NSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPS------------KLFLLNP 131 (409) T ss_pred CceEEeecccccchhHHHHHhhhcc-cCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEE------------EEEEEcC Confidence 34333 1 1122344444 3333 34678899999999999999999999766544321 2444445 Q ss_pred hhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEE-eCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCc Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRK-NDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFH 232 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~-~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~ 232 (455) ..|--+..+ .+....|+. ..+|....... -..|. |..... .+...+. T Consensus 132 ~~v~v~~~~------------------------~~~~~~y~~~~~~g~~~~~~~----~evih---~r~~~~-~~~~~G~ 179 (409) T protein:vir:96 132 DVVEMLIEN------------------------QSRELYYSIHAATGNKLIVHN----MDMLH---FKHIVA-SNMVQGI 179 (409) T ss_pred ceeEEEEeC------------------------CCcEEEEEEEcCCceEEEEcc----ccEEE---eCCCCC-CCccccc Confidence 444211110 000001111 11111111111 11222 222111 2334577 Q ss_pred chHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccC----c--cceeeccceeeeccCCCCccccceeEE Q lcl|NC_020844. 233 PPMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGS----P--KPLDTGPDTVLYAPPNAEGQHGSWNFV 306 (455) Q Consensus 233 ~pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~----~--~~~~~g~~~~~~lp~~~~~~~~~~~~l 306 (455) +||.-+.+. +........+ .+...+.+-.++.-....-.++.. . ....-+++.+..++.+ .++.-+ T Consensus 180 s~l~~~~~~-i~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g-----~~~~~l 251 (409) T protein:vir:96 180 SPIDVLKNT-TDFDNAVRTF--NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPG-----VEIEPL 251 (409) T ss_pred cHHHHHHHH-HHHHHHHHHH--HHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCC-----ceEEEc Confidence 787655543 2222222222 233333332222211111111110 0 0011233445555532 233333 Q ss_pred eeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-Ccch-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020844. 307 EPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNL-TRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMD 381 (455) Q Consensus 307 e~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~-sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g 381 (455) +.++..++ +.+..+....+|+.+ ...++... .++- +.++.. ..-....|.-++..++++++.-|-- .. + T Consensus 252 ~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~--~~f~~~~l~P~~~~ie~~l~~~Ll~--~~-~ 325 (409) T protein:vir:96 252 PKKYVSED-IVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELN--RFYLQHTLLPIVKQYEEEFNRKLLT--KT-D 325 (409) T ss_pred CCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH--HHHHHHHHHHHHHHHHHHHHhhcCC--cc-c Confidence 33333322 233333344445432 22333221 1121 212211 1223344555666666655542210 00 1 Q ss_pred CCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCC-------CCCCCCHHHHHHHHHhh---ccc Q lcl|NC_020844. 382 APDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGE-------LSDNFDPEEESERLISE---LPG 451 (455) Q Consensus 382 ~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~v-------l~~~~d~e~e~~ri~~e---~~~ 451 (455) ...+..|+++.+=.........++++.++++.|+++.-..++.+..--+ ++....+-+.....+.. ++. T Consensus 326 -~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~ 404 (409) T protein:vir:96 326 -REKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDK 404 (409) T ss_pred -ccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeeecccccccccchhhcccccCCCC Confidence 1123445555322222223555778888999999999888886632111 00000011111111111 111 Q ss_pred CCCC Q lcl|NC_020844. 452 GIEE 455 (455) Q Consensus 452 ~l~~ 455 (455) +-+| T Consensus 405 n~~e 408 (409) T protein:vir:96 405 NVNE 408 (409) T ss_pred CcCC Confidence 2222 No 217 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=41.97 E-value=0.9 Score=20.82 Aligned_cols=378 Identities=7% Similarity=-0.054 Sum_probs=141.1 Q ss_pred CCCCC-CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHH-hhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQD-PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLR-KSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~-v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~r-l~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |-=-+ +..+.+.....-+.| + .|..+++....-........ ++. +.+...|+-+++-+-+ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~--~------------~~~~~~~~~~~~~~~~~~~~~~~~----~~v~~~i~~ia~~ia~ 62 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIE--L------------VGPIFESLKLSTKNMTVEQIWEDQ----PHLRTVTTFIARNVAS 62 (423) T ss_pred CchhHhhccccccccCccccc--c------------ccccccccccccchhhHHHHHHhh----hHHHHHHHHHHHhHhh Confidence 22000 000000000000111 0 11111111110001112221 222 3344455555555544 Q ss_pred CCcee---c-------cCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceE Q lcl|NC_020844. 79 REVEV---T-------EGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYW 148 (455) Q Consensus 79 k~p~i---~-------~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~ 148 (455) -|..+ . .....+..++..-+ ...+-.+|.+.+....+.+|-+|++|.....+.... .| + T Consensus 63 lp~~~~~~~~dg~~~~~~~~~~~~ll~~PN-~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~---------~~-l 131 (423) T protein:vir:81 63 LQLQAFERVEDGGRERVREGHLARVCKLAN-SDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPT---------LD-I 131 (423) T ss_pred CceEEEEEecCCceeeeccchHHHHhhcCC-CCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcce---------EE-E Confidence 44322 0 11233556665543 346889999999999999999999997543222110 00 1 Q ss_pred EEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecC----eEEEEEE-eCCcEEEEeccCCCCCCceeEEEEEecc Q lcl|NC_020844. 149 SQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPD----DWERWRK-NDAGIWEVDEFGRITIGVVPMVPFITGR 223 (455) Q Consensus 149 ~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~----~~~~~~~-~~~g~~~~~~~g~~~l~~vP~V~f~~~~ 223 (455) ..+++..+. ++.+..+ .|++... ..+|...... .-..|.+ . +. T Consensus 132 ~p~~~~~v~------------------------~~~~~~~~~~~~Y~~~~~~~~~g~~~~~~----~~evih~---r-~~ 179 (423) T protein:vir:81 132 RPIPVSWVQ------------------------RRAYKDGWGSLDYIIIESGDNDGRSVKVP----GERVIHR---H-GY 179 (423) T ss_pred eecccceee------------------------eeeccCCCcceEEEEEEecCCCceEEEEc----ccceEEe---c-CC Confidence 112121110 0001000 0111111 1122221111 0112222 2 11 Q ss_pred cCCCcccCcchHHHHHHHHHHHHHhHHHHHHH-HHHcCCceEEEecC--------CCccccccCcc---cee---eccce Q lcl|NC_020844. 224 RKGKKWVFHPPMKDALDLQVALFRKECALEFA-EMMTAFPMLSANGV--------EPDTDSEGSPK---PLD---TGPDT 288 (455) Q Consensus 224 ~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~~-l~~~~~p~~~~~G~--------~~~~~~~~~~~---~~~---~g~~~ 288 (455) ...+...+.+|+.-++. .+........+... ....+.|-.+++-- +.+..+..... ... -.++. T Consensus 180 ~~~~~~~G~spi~~~~~-~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~ 258 (423) T protein:vir:81 180 NPKTMKRGKSPVQSLRD-ILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGG 258 (423) T ss_pred CCCCccccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCc Confidence 22234457788776654 34444444444333 34456787777521 11111111100 011 11345 Q ss_pred eeeccCCCCccccceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCC-CcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 289 VLYAPPNAEGQHGSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNLTRISAAQAASKGNSAVQQWAAG 364 (455) Q Consensus 289 ~~~lp~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~sa~a~~~~~~~~~s~L~~~a~~ 364 (455) ++.|+.+ .++.-++.+....+ ..+..+-...+|+.+ ...++... .++-+..+ .....-....|.-++.. T Consensus 259 ~~vl~~g-----~~~~~l~~s~~d~q-~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e-~~~~~f~~~~L~P~~~~ 331 (423) T protein:vir:81 259 TLLLEDG-----MKAENFHTTSKDEQ-TVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVR-EFRKALYGDNLGSWIRI 331 (423) T ss_pred ceecCCC-----ceEEeccCChhhHH-HHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHH-HHHHHHHHHHHHHHHHH Confidence 6666532 23433333333322 222233334444443 22333211 11212211 11112223345556666 Q ss_pred HHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHH-HcCCCCHHHHHHHHHhcCCCCCCCCHHHHHH Q lcl|NC_020844. 365 LKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMR-QNGDISREGLWMEFQRLGELSDNFDPEEESE 443 (455) Q Consensus 365 ~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~-~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ 443 (455) ++++++..|- -. .+....+..++++.+-..+.......+++.++. +.|+++.-.+++.+ |+ +|.- .-++.- T Consensus 332 ie~~l~~~L~--~~-~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~---gl-~p~~-gGD~~~ 403 (423) T protein:vir:81 332 IQDVMNLFLL--PR-VGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMD---NL-PSID-GGDDLA 403 (423) T ss_pred HHHHHhhhhc--Cc-cccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHh---CC-CCCC-Ccceee Confidence 6666665431 11 122233445666543333322334455555555 46999988887765 44 2221 111111 Q ss_pred HH-----HhhcccCCCC Q lcl|NC_020844. 444 RL-----ISELPGGIEE 455 (455) Q Consensus 444 ri-----~~e~~~~l~~ 455 (455) .= .++.++.-++ T Consensus 404 ~p~n~~~~~~~~~~~~~ 420 (423) T protein:vir:81 404 RPLNTEFGDSEDAPGEE 420 (423) T ss_pred cccccccCccCCCCCCC Confidence 00 0001111111 No 218 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=39.21 E-value=1 Score=20.52 Aligned_cols=373 Identities=10% Similarity=-0.010 Sum_probs=143.2 Q ss_pred CCC---CCCCccCHHHHHHHH------HHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHH Q lcl|NC_020844. 1 MPE---QDPTKRSADAEAMSD------YLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEG 71 (455) Q Consensus 1 m~~---~~v~~~~p~y~~~~~------~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~ 71 (455) |-. +-+++..+.+..... .-.+. +..+|. +-..++.-. .+..++.+. +...|+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~-~~~~~~-----------~~~~g~~v~-~~~al~~~~----v~~ci~~ 63 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFW-SAWGGM-----------GSSSGETVT-ADSALQLSA----VWSCVRL 63 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHH-Hhhccc-----------ccCCCceec-hHhhhccHH----HHHHHHH Confidence 320 001111111111000 00000 011110 000111000 112222222 2334444 Q ss_pred HhhhhhcCCcee---------c-cCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHH Q lcl|NC_020844. 72 LASKPFGREVEV---------T-EGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEER 140 (455) Q Consensus 72 ~~g~lf~k~p~i---------~-~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~ 140 (455) +++-+-+-|..+ . ..+..+..++. .-. ...+-.+|.+.+....+.+|-+|+++.... +.+. T Consensus 64 Ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~------ 135 (437) T protein:vir:10 64 IAETIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPN-AENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLI------ 135 (437) T ss_pred HHHHHhhCceeEEEEcCCCceeeccccHHHHHhhccCC-cCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEE------ Confidence 444443333222 0 12233445443 222 346788999999999999999999997542 3221 Q ss_pred hccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEE Q lcl|NC_020844. 141 QAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFI 220 (455) Q Consensus 141 ~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~ 220 (455) -+..++|..|--+.. .+|. +. ..|. ..+|...... .-..|. |. T Consensus 136 ------~L~~l~p~~v~i~~~--~~g~--~~-------------------y~~~-~~~g~~~~~~----~~dIih---~r 178 (437) T protein:vir:10 136 ------GLELMLPQRTTVKRL--TSGA--LQ-------------------YTYR-NVDGTVSTLA----EDDVFH---VR 178 (437) T ss_pred ------EEEEEcCcceEEEEC--CCCe--EE-------------------EEEE-ecCceEEEEc----cccEEE---ec Confidence 134444544321111 0010 00 0011 1112111111 011222 22 Q ss_pred ecccCCCcccCcchHHHHHHHHHHHHHhHHHHHH-HHHHcCCceEEEecCC---CccccccCcc--ceeecc---ceeee Q lcl|NC_020844. 221 TGRRKGKKWVFHPPMKDALDLQVALFRKECALEF-AEMMTAFPMLSANGVE---PDTDSEGSPK--PLDTGP---DTVLY 291 (455) Q Consensus 221 ~~~~~~~~~~~~~pL~dla~l~~~hy~~~sd~~~-~l~~~~~p~~~~~G~~---~~~~~~~~~~--~~~~g~---~~~~~ 291 (455) .... +...+.+|+.-++.. +.......++.. .....+.|-.+++--. ++........ ...-|. +.++. T Consensus 179 ~~~~--d~~~G~spi~~~~~~-i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~v 255 (437) T protein:vir:10 179 GFSL--DGLMGLTPIQYAREV-LGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGAMQAGKTMV 255 (437) T ss_pred CcCC--CCcccccHHHHHHHH-HHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCccccCccee Confidence 2222 234678887665543 333333444433 3445567877777322 1111111100 111222 34555 Q ss_pred ccCCCCccccceeEEeeC--chhHHHHHHHHHHHHHHHHHh---hhhhccCC-Ccch--hHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 292 APPNAEGQHGSWNFVEPA--TESLKFLAEQIESVSKEIREL---GRQPLTAQ-SGNL--TRISAAQAASKGNSAVQQWAA 363 (455) Q Consensus 292 lp~~~~~~~~~~~~le~~--~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~-~~~~--sa~a~~~~~~~~~s~L~~~a~ 363 (455) |+. .++|.+++ +..++ +.+..+....+|+.+ ...++... .++. |..+. ....-....|.-+.. T Consensus 256 l~~-------g~~~~~l~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~-~~~~f~~~tl~P~~~ 326 (437) T protein:vir:10 256 LEA-------GMKYQAITMNPGDVQ-LLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQ-QTLGFLTFTLRPWLT 326 (437) T ss_pred ccC-------CceEEeccCChhhHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHH-HHHHHHHHHHHHHHH Confidence 653 24554444 33332 233334444445443 23333211 1111 22111 111123444555666 Q ss_pred HHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCC--CCCCH--- Q lcl|NC_020844. 364 GLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELS--DNFDP--- 438 (455) Q Consensus 364 ~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~--~~~d~--- 438 (455) .++++++.-|- .. +. .....|+++.+=..+....+.++++.++...|+++.-.+++.+ |+-| .+.+. T Consensus 327 ~ie~~l~~kll--~~--~e-~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~---gl~pi~gg~~~~~~ 398 (437) T protein:vir:10 327 RIEQAARRSLL--RP--GE-RDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKE---NLPPMGGNAAVLTV 398 (437) T ss_pred HHHHHHHhhcc--Cc--cc-cCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCcceEee Confidence 66666665331 11 11 1123455543222222235567788889999999999888766 3322 11100 Q ss_pred H---HHHHHHHhhccc-----CCCC Q lcl|NC_020844. 439 E---EESERLISELPG-----GIEE 455 (455) Q Consensus 439 e---~e~~ri~~e~~~-----~l~~ 455 (455) . .-++.+.++.+. +++. T Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (437) T protein:vir:10 399 QSALLPIDKLGEHTTATAAQDALKA 423 (437) T ss_pred cCcccchhhccCcCCCcchhccccc Confidence 0 011222222111 1111 No 219 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=38.25 E-value=1.1 Score=20.41 Aligned_cols=382 Identities=7% Similarity=-0.003 Sum_probs=141.1 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcC-HHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEG-VTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G-~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |. +-.+..-.....+. ++.....- +..+-. ...+.++ ....- ..+..++... +...|+-+++-+-.- T Consensus 1 m~---~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-~~~~~~~-~~~~v-~~~~a~~~~~----v~~~i~~ia~~iA~l 68 (412) T protein:vir:26 1 MN---VIAKENIVTRIKKK--LIDNWIDQSTSKLYD-FSPWKNR-SFWGV-INNTLETNET----IFSAITKLSNSMASL 68 (412) T ss_pred Cc---cchhhhhhhhhhhh--Hhhhhhccccccccc-ccccCCc-ccccc-chhhhhccHH----HHHHHHHHHHhHhhC Confidence 66 33222222211111 11111110 000000 0011110 11110 1123333332 333344444444333 Q ss_pred Ccee----ccCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 80 EVEV----TEGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 80 ~p~i----~~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) |..+ ......+..++. ... ...+-.+|.+.+....+.+|-+|+++.+...+.+. -+.+++|. T Consensus 69 p~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L~~l~~~ 135 (412) T protein:vir:26 69 PLKMYEDYKVVNTEVSDLLTVSPN-NSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPS------------KLFLLNPD 135 (412) T ss_pred ceeEeeccccccchHHHHHHhhcc-cCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEE------------EEEEEcCc Confidence 3332 112233444443 333 34688999999999999999999999766544321 13444444 Q ss_pred hhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe-CCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcc Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN-DAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHP 233 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~-~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~ 233 (455) .|--+.... + ..+ .|+.. .+|.-.... .-..|. |.+.... +...+.+ T Consensus 136 ~v~v~~~~~-~--~~~---------------------~y~~~~~~g~~~~~~----~~evih---~~~~~~~-~~~~G~s 183 (412) T protein:vir:26 136 VVEMLIENQ-S--REL---------------------YYSIHAATGNKLIVH----NMDMLH---FKHIVAS-NMVQGIS 183 (412) T ss_pred eeEEEEeCC-C--cEE---------------------EEEEEcCCceEEEEc----cccEEE---eCCCCCC-CCccccc Confidence 442111110 0 000 01111 111111111 011222 2221122 2344777 Q ss_pred hHHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecCCCccccccC---cc--ceeeccceeeeccCCCCccccceeEEee Q lcl|NC_020844. 234 PMKDALDLQVALFRKECALEFAEMMTAFPMLSANGVEPDTDSEGS---PK--PLDTGPDTVLYAPPNAEGQHGSWNFVEP 308 (455) Q Consensus 234 pL~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~~~~~~~~~~---~~--~~~~g~~~~~~lp~~~~~~~~~~~~le~ 308 (455) |+.-++. .+........+ ........|-.+++.-....++... .. ...-+++.++.++. +.++.-++. T Consensus 184 ~i~~~~~-~i~~~~a~~~~-~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~-----g~~~~~l~~ 256 (412) T protein:vir:26 184 PIDVLKN-TTDFDNAVRTF-NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEP-----GVEIEPLPK 256 (412) T ss_pred HHHHHHH-HHHHHHHHHHH-HHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCC-----CceEEEcCC Confidence 7765543 23333222222 2223333333333211111111110 00 01123344555653 223333333 Q ss_pred CchhHHHHHHHHHHHHHHHHHh---hhhhccCCC-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC Q lcl|NC_020844. 309 ATESLKFLAEQIESVSKEIREL---GRQPLTAQS-GNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPD 384 (455) Q Consensus 309 ~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~-~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~ 384 (455) ++...+ +.+..+-...+|+.+ ...++.... .+-+..+ .....-....|.-+...++++++.-|---..+ . T Consensus 257 ~~~d~q-~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e-~~~~~f~~~~l~P~~~~ie~~ln~kLl~~~~~----~ 330 (412) T protein:vir:26 257 KYVSED-IVASENLTRERVANVFQLPSVFLNARSNTNFAKNE-ELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDR----E 330 (412) T ss_pred ChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCcccc----c Confidence 333322 233333344555443 223343221 1211111 11111223445555555665555422111111 1 Q ss_pred CceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCC----------CCCHHHHHHHHH---hhccc Q lcl|NC_020844. 385 TNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSD----------NFDPEEESERLI---SELPG 451 (455) Q Consensus 385 ~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~----------~~d~e~e~~ri~---~e~~~ 451 (455) .+..|+++.+=.......+.++++.++..+|+++.-..++.+ |+-|- ...+-+.....+ +-++. T Consensus 331 ~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~---gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~ 407 (412) T protein:vir:26 331 KNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE---DLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDK 407 (412) T ss_pred CcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCcCeeeecccccccccchhhcccccCCCC Confidence 123455543222222335567888889999999999888876 33221 100101111111 01122 Q ss_pred CCCC Q lcl|NC_020844. 452 GIEE 455 (455) Q Consensus 452 ~l~~ 455 (455) +-+| T Consensus 408 n~~e 411 (412) T protein:vir:26 408 NVNE 411 (412) T ss_pred CcCC Confidence 2222 No 220 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=37.39 E-value=1.1 Score=20.31 Aligned_cols=366 Identities=9% Similarity=0.017 Sum_probs=143.8 Q ss_pred CCCCCC-CccCHHHHHHHHHHHHH-HHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDP-TKRSADAEAMSDYLQTV-DDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v-~~~~p~y~~~~~~w~~i-~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |-==+. ....+.-......|--+ ....-| ... .+ .| .+.. ..++. ..+..+|+.++.-+-+ T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-~~-~~------v~~~---~al~~----~~V~~~i~~Ia~~ia~ 63 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDITDPEFLD--ALN-GS-EW------VSAE---TALKN----SDLFSIISQLSNDLAT 63 (384) T ss_pred CccccccccCcccccccchhhccccchhhcc--ccc-CC-ce------echh---hhhcc----HHHHHHHHHHHHHHhh Confidence 552110 01111111111112111 111101 000 00 11 1111 12332 3344556666666666 Q ss_pred CCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechhhhcc Q lcl|NC_020844. 79 REVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHSAILE 158 (455) Q Consensus 79 k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~~ii~ 158 (455) -|..+.. .....++..-+ ...+-.+|.+.+....+.+|-+|+++..+..+.+. -+.+++|..|-- T Consensus 64 l~~~~~~--~~~~~l~~~PN-~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~------------~L~~l~~~~v~v 128 (384) T protein:vir:49 64 AKITTSR--KQLQGIVDNPS-NNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDM------------KWEYLRPSQVSF 128 (384) T ss_pred Cceeeec--chhhhhhhccC-CCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEE------------EEEEEcCceeEE Confidence 5665532 22334554444 24578999999999999999999999766544321 244444444421 Q ss_pred ceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeC--CcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchHH Q lcl|NC_020844. 159 IQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKND--AGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPMK 236 (455) Q Consensus 159 w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~--~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL~ 236 (455) ...+ .++ .+ ++.+...+ .+....... =.+|. |..... .+...+.+|+. T Consensus 129 ~~~~-~~~--~~-------------------~y~~~~~~~~~~~~~~~~~----~eVih---~~~~~~-~~~~~G~s~i~ 178 (384) T protein:vir:49 129 NRLD-NQN--GL-------------------YYNITFDDPRIPPKQHVPQ----GDILH---FRLLSV-DGGLTSVSPLM 178 (384) T ss_pred EEcC-CCc--eE-------------------EEEEEecCccccceeEecC----ccEEE---ecCCCC-CCceeeccHHH Confidence 0000 000 00 00111111 010000100 11222 222222 24456788877 Q ss_pred HHHHHHHHHHHhHHHHH-HHHHHcCCceEEEecCCCccccccC-----ccceeeccceeeeccCCCCccccceeEEeeCc Q lcl|NC_020844. 237 DALDLQVALFRKECALE-FAEMMTAFPMLSANGVEPDTDSEGS-----PKPLDTGPDTVLYAPPNAEGQHGSWNFVEPAT 310 (455) Q Consensus 237 dla~l~~~hy~~~sd~~-~~l~~~~~p~~~~~G~~~~~~~~~~-----~~~~~~g~~~~~~lp~~~~~~~~~~~~le~~~ 310 (455) .+... +........+. ......+.|-.+++--.....++.. ...-.-.++.++.++. +.++.-++.++ T Consensus 179 ~~~~~-i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~-----g~~~~~l~~~~ 252 (384) T protein:vir:49 179 ALGRE-LNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLDD-----LEDFTPLEIKS 252 (384) T ss_pred HHHHH-HHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecCC-----CceEEEccCCh Confidence 66553 33344444443 3445567777777521111111000 0001112234555643 22333333333 Q ss_pred hhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCce Q lcl|NC_020844. 311 ESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNP 387 (455) Q Consensus 311 ~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~ 387 (455) ...+ +.+..+...++|+.+ ...++.....+. ++....+.. ..+.+.....-+...+++.|.. .++ T Consensus 253 ~d~q-~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~-~~~~~~~~~-~~~~i~~~l~pi~~~i~~~l~~---~l~------ 320 (384) T protein:vir:49 253 NVAQ-LLSQADWTTGQFAKVYGIPESVVGGEGDKQ-SSLEMIYNI-YFKAVSRFLRPFVSELSKKLSC---EVD------ 320 (384) T ss_pred hhHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-ccHHHHHHH-HHHHHHHHHHHHHHHHHHHhch---hhh------ Confidence 3332 345555566666553 333343222221 111122111 1122222223333333333210 011 Q ss_pred EEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHHHHHHHhhcc-cCCCC Q lcl|NC_020844. 388 EADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEESERLISELP-GGIEE 455 (455) Q Consensus 388 ~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e~~ri~~e~~-~~l~~ 455 (455) ..+...+... .......+..+..+|++++-.+++.+.+.|+++ .|..+++.-.+ .|-++ T Consensus 321 -~~~~~~~~~~--~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~------ne~r~~~~~~p~~gGd~ 380 (384) T protein:vir:49 321 -ADILPAVDPT--GSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP------KDLPEGETDSTLKGGET 380 (384) T ss_pred -hhhhhhhhcc--chHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC------hhHHHHcCCCCCCCCCC Confidence 0111111110 011122233456789999999998888887764 12334444333 23333 No 221 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=32.53 E-value=1.4 Score=19.75 Aligned_cols=371 Identities=11% Similarity=0.059 Sum_probs=138.9 Q ss_pred CCCCCCCccCH-HHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhc- Q lcl|NC_020844. 1 MPEQDPTKRSA-DAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFG- 78 (455) Q Consensus 1 m~~~~v~~~~p-~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~- 78 (455) |+ ...+.. ......+.|. ...+|... ..|.... +....||| -.+..+-+.++...|. T Consensus 1 m~---~f~~~~~~~~~~~~~~~---~~~~~~~~-----~~~~~~~---------Al~~~~V~-~~i~~Ia~~iA~lp~~~ 59 (406) T protein:vir:97 1 MS---FFQPLGTSKVSYDDYIS---SVLAGDVS-----QKYLGVS---------ALKNSDIL-TATSIIAGDIARFPLVK 59 (406) T ss_pred Cc---cccccCCCCCCcchHHH---HHhcCCCC-----cccccch---------hhccHHHH-HHHHHHHHhhhhCeeEE Confidence 65 221110 0111112222 22444211 1222211 11222222 2233333333333331 Q ss_pred --CCceeccCCHHHHHHHHhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCC-CCcchhhhhHHhccCCceEEEechhh Q lcl|NC_020844. 79 --REVEVTEGTGPSEDLLEDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPA-GQQFRNREEERQAGVRPYWSQVNHSA 155 (455) Q Consensus 79 --k~p~i~~~~~~l~~~~~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~-~~~~~t~ad~~~~~~rPy~~~~~~~~ 155 (455) +...+. ....+..++..-=-...+-.+|.+.+....+.+|-+|+++.... .+.+. -+..++|.+ T Consensus 60 ~~~~g~~~-~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~------------~L~~i~p~~ 126 (406) T protein:vir:97 60 KDVNGDII-HDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQAL------------QFQFYRPSE 126 (406) T ss_pred EecCcccc-ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEE------------EEEEECCCe Confidence 111111 12335555532112456889999999999999999999997643 22211 244555555 Q ss_pred hccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcchH Q lcl|NC_020844. 156 ILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPPM 235 (455) Q Consensus 156 ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~pL 235 (455) |.-... .++. +. + .+....+|...... .-.+|. |...+. +...+.+|+ T Consensus 127 v~v~~~--~~~~--~~----------y---------~~~~~~~~~~~~~~----~~evih---~r~~~~--dg~~G~spi 174 (406) T protein:vir:97 127 TTVEET--DNHE--IV----------Y---------TFTDMLTAKQVKCF----AHDVIH---WKFFSH--DTILGRSPL 174 (406) T ss_pred eEEEEc--CCce--EE----------E---------EEEecCCceEEEEc----cccEEE---ecCCCC--CCcccccHH Confidence 421110 0110 00 0 01111111111110 011222 222222 234578887 Q ss_pred HHHHHHHHHHHHhHHHHHHHHHHc-CCceEEEe-cC--CCccccccCc--cceeecc--ceeeeccCCCCccccceeEEe Q lcl|NC_020844. 236 KDALDLQVALFRKECALEFAEMMT-AFPMLSAN-GV--EPDTDSEGSP--KPLDTGP--DTVLYAPPNAEGQHGSWNFVE 307 (455) Q Consensus 236 ~dla~l~~~hy~~~sd~~~~l~~~-~~p~~~~~-G~--~~~~~~~~~~--~~~~~g~--~~~~~lp~~~~~~~~~~~~le 307 (455) .-+.+ .+.......++....+.. +.|-.++. +- +.+..+..+. +...-|. +.++.|+. +.++.-++ T Consensus 175 ~~~~~-~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~-----g~~~~~l~ 248 (406) T protein:vir:97 175 LSLGD-EIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMREGSVGGSPLVFDS-----TMEYTPLE 248 (406) T ss_pred HHHHH-HHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHhcccccCceeecCC-----CceEEEcc Confidence 65544 233333344444333333 33433322 11 1111010000 0111222 23455542 22344344 Q ss_pred eCchhHHHHHHHHHHHHHHHHH---hhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC Q lcl|NC_020844. 308 PATESLKFLAEQIESVSKEIRE---LGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPD 384 (455) Q Consensus 308 ~~~~~l~~~~~~l~~~~~~m~~---~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~ 384 (455) .++..++. .+..+-...+|.. +...++.... .-+.++.... .-....|.-++..++++++.-|- ..... T Consensus 249 ~~~~d~q~-le~~~~~~~~Ia~afgVPp~~lg~~~-~~~~~e~~~~-~f~~~~l~P~~~~ie~~l~~kll-----~~~~~ 320 (406) T protein:vir:97 249 IDTNVLQL-ITSNNFSTAQIAKALRVPSYKLGVNS-PNQSVAQLME-DYVTNDLPFYFDAITSELGLKTL-----NDKDR 320 (406) T ss_pred CCHHHHHH-HHHHHhhHHHHHHHhCCCHHHcCCCC-CcchHHHHHH-HHHHHHHHHHHHHHHHHHhhhhc-----Chhhc Confidence 44444332 2333333444433 2223332211 1122221111 11234456666666666655331 11001 Q ss_pred CceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCC---------CCHHHHH----HHHHhhccc Q lcl|NC_020844. 385 TNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDN---------FDPEEES----ERLISELPG 451 (455) Q Consensus 385 ~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~---------~d~e~e~----~ri~~e~~~ 451 (455) ....+++ +...+....++++.++++.|+++....++.+..--+-.+. .-+-++. +.......+ T Consensus 321 ~~~~i~f----d~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~g 396 (406) T protein:vir:97 321 RLYHIEF----DTRSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGIKGKG 396 (406) T ss_pred cceeEEE----ecCccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhcccccccccccccCC Confidence 1233333 3334455667888899999999999988877321111111 0111111 111111111 Q ss_pred C---CCC Q lcl|NC_020844. 452 G---IEE 455 (455) Q Consensus 452 ~---l~~ 455 (455) | -++ T Consensus 397 g~~~~~~ 403 (406) T protein:vir:97 397 GEVNAEE 403 (406) T ss_pred CCCCCCC Confidence 1 111 No 222 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=31.80 E-value=1.5 Score=19.66 Aligned_cols=377 Identities=8% Similarity=0.016 Sum_probs=142.4 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHH--HHHHHhhccccCChHHHHHHHHhhhhhc Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNK--RYDLRKSVAKMTNVFRDVLEGLASKPFG 78 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~--~Y~~rl~ra~~~n~~~~~v~~~~g~lf~ 78 (455) |.-+++-.+-.. +....|. +.. ..+. ..+-+ +...+-. ....+++. ..+...|+-+++-+-. T Consensus 1 ~~~~~~~~~~k~--~~~~~~~------~~~-~~~~--~~~~~-~~~~~~~~v~~~~a~~~----~~v~~~i~~Ia~~ia~ 64 (409) T protein:vir:94 1 MAKENIVTRIKK--KLIDNWI------DQS-ASKL--YDFSP-WKNKSFWGVINNTLETN----ETIFSAITKLSNSMAS 64 (409) T ss_pred Ccccccchhhhh--HHhhhhh------cCC-cccc--ccccc-ccCccccccchhhhhcc----HHHHHHHHHHHHhhhh Confidence 886665333222 2233332 110 0000 00111 0011100 01123332 2334445555555544 Q ss_pred CCcee----ccCCHHHHHHH-HhhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEech Q lcl|NC_020844. 79 REVEV----TEGTGPSEDLL-EDIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNH 153 (455) Q Consensus 79 k~p~i----~~~~~~l~~~~-~d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~ 153 (455) -|..+ ......+..++ .... ...+-.+|.+.+....+.+|-+|+++.....+.+. -+..++| T Consensus 65 lp~~~~~~~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~------------~L~~l~~ 131 (409) T protein:vir:94 65 LPLKMYEDYKVVNTEVSDLLTVSPN-NSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPS------------KLFLLNP 131 (409) T ss_pred CceeEeecccccchhHHHHHhhhcc-cCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE------------EEEEEcC Confidence 44433 11223344444 2233 34688999999999999999999999765444321 1334444 Q ss_pred hhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEe-CCcEEEEeccCCCCCCceeEEEEEecccCCCcccCc Q lcl|NC_020844. 154 SAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKN-DAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFH 232 (455) Q Consensus 154 ~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~-~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~ 232 (455) ..|--+.... ++ .+ .|+.. .+|...... .-.+|. |.... ..+...+. T Consensus 132 ~~v~v~~~~~-~~--~~---------------------~y~~~~~~g~~~~~~----~~dvih---~r~~~-~~~~~~G~ 179 (409) T protein:vir:94 132 DVVEMLIENQ-SR--EL---------------------YYSIHAATGNKLIVH----NMDMLH---FKHIV-ASNMVQGI 179 (409) T ss_pred ceeEEEEeCC-Cc--EE---------------------EEEEEcCCceEEEEc----cccEEE---ecCCC-CCCccccc Confidence 4442111100 00 00 11111 111111010 011222 22111 12334577 Q ss_pred chHHHHHHHHHHHHHhHHHHHHHHHHcCCc-eEEEe-cCCCcc--ccccCcc--ceeeccceeeeccCCCCccccceeEE Q lcl|NC_020844. 233 PPMKDALDLQVALFRKECALEFAEMMTAFP-MLSAN-GVEPDT--DSEGSPK--PLDTGPDTVLYAPPNAEGQHGSWNFV 306 (455) Q Consensus 233 ~pL~dla~l~~~hy~~~sd~~~~l~~~~~p-~~~~~-G~~~~~--~~~~~~~--~~~~g~~~~~~lp~~~~~~~~~~~~l 306 (455) +||.-+++. +........+ .+...+.+ ..++. +-...+ ....... ...-+++.+..++. +.++.-+ T Consensus 180 s~l~~~~~~-i~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~-----g~~~~~l 251 (409) T protein:vir:94 180 SPIDVLKNT-TDFDNAVRTF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEP-----GVEIEPL 251 (409) T ss_pred cHHHHHHHH-HHHHHHHHHH--HHHhcCCCCeeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCC-----CceEEEc Confidence 787655442 2222222222 23333333 33332 211111 0000000 11123334555542 2233333 Q ss_pred eeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCC-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020844. 307 EPATESLKFLAEQIESVSKEIREL---GRQPLTAQS-GNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDA 382 (455) Q Consensus 307 e~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~-~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~ 382 (455) +.++...+ +.+..+-...+|+.+ ...++.... ++-+.. ......-....|.-++..++++++.-|---. + T Consensus 252 ~~~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~-e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~---~- 325 (409) T protein:vir:94 252 PKKYVSED-IVASENLTRERVANVFQLPSVFLNARSNTNFAKN-EELNRFYLQHTLLPIVKQYEEEFNRKLLTKT---D- 325 (409) T ss_pred CCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccH-HHHHHHHHHHHHHHHHHHHHHHHHHhhCCcc---c- Confidence 33333322 233333344444332 223332211 121111 1111112334455555555555554221100 1 Q ss_pred CCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCC----------CCCHHHHHHHHHhh---c Q lcl|NC_020844. 383 PDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSD----------NFDPEEESERLISE---L 449 (455) Q Consensus 383 ~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~----------~~d~e~e~~ri~~e---~ 449 (455) ...+..|+++.+=.......+.++++.++++.|+++.-..++.+ |+-|- ...+-+.....+.. + T Consensus 326 ~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~kGG 402 (409) T protein:vir:94 326 REKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE---DLPPVEGGDKPLISGDLYPIDTPLELRKSLKGG 402 (409) T ss_pred ccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCcCeEeecccccccccchhhcccccCC Confidence 11234455543222222235567888889999999998888766 33111 00011111111111 1 Q ss_pred ccCCCC Q lcl|NC_020844. 450 PGGIEE 455 (455) Q Consensus 450 ~~~l~~ 455 (455) +.+-+| T Consensus 403 ~~n~~e 408 (409) T protein:vir:94 403 DKNVNE 408 (409) T ss_pred CCCcCC Confidence 112222 No 223 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=28.58 E-value=1.7 Score=19.27 Aligned_cols=353 Identities=10% Similarity=0.007 Sum_probs=133.9 Q ss_pred CCCC-CCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcC Q lcl|NC_020844. 1 MPEQ-DPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGR 79 (455) Q Consensus 1 m~~~-~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k 79 (455) |.=- .+..++.+. . ..+.+. ... .. .. +.+++. ..+..+|+.+++-+-+- T Consensus 1 Mg~f~~~f~~~~~~-------~---~~~~~~-~~~----~~-------~~---~~a~~~----~~v~~~i~~ia~~ia~~ 51 (385) T protein:vir:95 1 MGLFDSVFKRHSEL-------S---WMYDLE-FLQ----DK-------SK---KAYLKQ----IALNTVVEMVARTISQS 51 (385) T ss_pred CchhhhhhccCccc-------c---cccchh-hhh----cc-------ch---hhhhhh----HHHHHHHHHHHHHHccc Confidence 4410 011111110 0 001110 000 00 00 112222 23345556555555555 Q ss_pred Cceec----cCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchhhhhHHhccCCceEEEechh Q lcl|NC_020844. 80 EVEVT----EGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVADSISWIRVDYPAGQQFRNREEERQAGVRPYWSQVNHS 154 (455) Q Consensus 80 ~p~i~----~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~~~~~~ 154 (455) |..+- .....+..++. .-+ ...+-.+|.+.+....+.+|-+|++++...... |-.....+. T Consensus 52 p~~~~~~~~~~~~~l~~lL~~~PN-~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~~~~-------------~~~~~~~~~ 117 (385) T protein:vir:95 52 EFRVMKNNTKEKGTLYYLLNVRPN-RNQNAVDFWQKFIFKLIMDNEVLVVKNDEGHFF-------------VADDFEKED 117 (385) T ss_pred ceeeeecCccccchHHHHHhcccC-cCCCHHHHHHHHHHHHhhcCceEEEEecCCCee-------------ecccccccc Confidence 54431 11233445543 222 345778999999999999999999886321100 000000000 Q ss_pred hhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcccCcch Q lcl|NC_020844. 155 AILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKWVFHPP 234 (455) Q Consensus 155 ~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~~~~~p 234 (455) .+ .....+ ...+.+ .+.+.-. ..+-.. ++-|..... .+...+.+| T Consensus 118 ~~---------~~~~~~-------~~~~~~-----------~~~~~~~-----~~~~~e--iih~~~~~~-~~~~~G~s~ 162 (385) T protein:vir:95 118 EL---------GLYSHR-------FTNVLV-----------NDFEFKR-----VFTMDD--VIYLKYNNQ-KLDAFSLGL 162 (385) T ss_pred cc---------cccccc-------ceeeee-----------cccceee-----eecccc--EEEecCCCC-CcccccchH Confidence 00 000000 000000 0000000 011111 122222222 234457777 Q ss_pred HHHHHHHHHHHHHhHHHHHHHHHHcCCceEEEecC-CCccccccC----c------cceeeccceeeeccCCCCccccce Q lcl|NC_020844. 235 MKDALDLQVALFRKECALEFAEMMTAFPMLSANGV-EPDTDSEGS----P------KPLDTGPDTVLYAPPNAEGQHGSW 303 (455) Q Consensus 235 L~dla~l~~~hy~~~sd~~~~l~~~~~p~~~~~G~-~~~~~~~~~----~------~~~~~g~~~~~~lp~~~~~~~~~~ 303 (455) +..+.... ... ..+..+.+.|-.++.-- .....++.. . .+..-+.+.++.++.+. ++ T Consensus 163 ~~~~~~~i-~~~------~~~~~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~-----~~ 230 (385) T protein:vir:95 163 FEDYGEIF-GRM------IDLQMLNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGL-----AY 230 (385) T ss_pred HHHHHHHH-HHH------HHHHHhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCc-----ee Confidence 76665533 211 12334455555554321 111111110 0 01111223355565332 34 Q ss_pred eEEeeCch-----hHHHHHHHHHHHHHHHHH---hhhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 304 NFVEPATE-----SLKFLAEQIESVSKEIRE---LGRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYI 375 (455) Q Consensus 304 ~~le~~~~-----~l~~~~~~l~~~~~~m~~---~g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~ 375 (455) .-++.+.. ....+.+..+....+|+. +...++.++. .+.++ ....-....|.-++..++++++.-|-- T Consensus 231 ~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~--sn~e~--~~~~~~~~~l~P~~~~ie~~l~~~L~~ 306 (385) T protein:vir:95 231 EEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLGEM--ADLEK--TIESYLQFCINPLLRKIEAELNSKFFY 306 (385) T ss_pred EeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC--cCHHH--HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 43332221 112234444445555544 3334453222 11122 111223445666666666666653311 Q ss_pred HHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCHHHH------HHHHHhhc Q lcl|NC_020844. 376 TNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDPEEE------SERLISEL 449 (455) Q Consensus 376 ~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~e~e------~~ri~~e~ 449 (455) -..+ ....|+++.+=.......+.++++.++++.|+++.-.+++.+ |+-|-+...-++ ...+.+-. T Consensus 307 ~~~~-----~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~---g~~p~~~~~gd~~~~~~n~~~~~~~k 378 (385) T protein:vir:95 307 QDEY-----LNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMT---GEEPADDPELDKFIITKNLQSADAFK 378 (385) T ss_pred hhhc-----ccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCCceeeecccceeccccc Confidence 1111 122344443222222234567888889999999999888876 443211101111 11222111 Q ss_pred ccCCCC Q lcl|NC_020844. 450 PGGIEE 455 (455) Q Consensus 450 ~~~l~~ 455 (455) .+.-++ T Consensus 379 gge~~~ 384 (385) T protein:vir:95 379 GGESNE 384 (385) T ss_pred CCCCCC Confidence 111111 No 224 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=23.55 E-value=2.3 Score=18.61 Aligned_cols=363 Identities=11% Similarity=0.020 Sum_probs=136.2 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCChHHHHHHHHhhhhhcCC Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTNVFRDVLEGLASKPFGRE 80 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n~~~~~v~~~~g~lf~k~ 80 (455) |.-=++..+-. .+..+. -+..+..+. . .. ......|...... +.+...|+.+++-+-+-| T Consensus 1 Mg~~~~f~~k~--~~~~~~--~~~~~~~~~-~-----~~------~~~~~~~~~~~~~----~~V~~~I~~ia~~iA~~p 60 (403) T protein:vir:80 1 MGLFNFFRRKT--RSEPTN--AISWFLTQE-A-----YD------TLAIPGYTRLSDN----PEVRMAVHKIAELISSMT 60 (403) T ss_pred Ccccccccccc--cccccc--hhhhhcccc-c-----cc------ccccchhhhhhhh----HHHHHHHHHHHHhhhhCc Confidence 55322211100 000000 000001110 0 00 0001112221111 223344555544443333 Q ss_pred cee---c-----cCCHHHHHHHH-hhccCCCCHHHHHHHHHHHHHhh--CcEEEEEeeCCCCcchhhhhHHhccCCceEE Q lcl|NC_020844. 81 VEV---T-----EGTGPSEDLLE-DIDGRGNNLTTFAGQLFFQGVAD--SISWIRVDYPAGQQFRNREEERQAGVRPYWS 149 (455) Q Consensus 81 p~i---~-----~~~~~l~~~~~-d~d~~G~~l~~~~~~~~~~al~~--G~~~~lVd~p~~~~~~t~ad~~~~~~rPy~~ 149 (455) ..+ . .....+..++. .-. ...+-.+|.+.+..+.+.. |-+|+++..+..+.+. -+. T Consensus 61 ~~~~~~~~~g~~~~~~~~~~lL~~~PN-~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~------------~L~ 127 (403) T protein:vir:80 61 IHLMQNTDNGDIRIKNELSRKIDINPY-SLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLID------------ELI 127 (403) T ss_pred eEEEEecCCceeecCChHHHHHhccCC-cCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEE------------EEE Confidence 322 0 01122334332 222 2346778999988888875 5578887655433221 122 Q ss_pred EechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcEEEEeccCCCCCCceeEEEEEecccCCCcc Q lcl|NC_020844. 150 QVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGIWEVDEFGRITIGVVPMVPFITGRRKGKKW 229 (455) Q Consensus 150 ~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~~~~~~~g~~~l~~vP~V~f~~~~~~~~~~ 229 (455) .++|..|--+ .+.+.+++|... ...+-+. ++.|..+....+.. T Consensus 128 ~l~p~~v~~~-------------------------~~~~g~~~~y~~----------~~~~~~e--iih~~~~~~~~~~~ 170 (403) T protein:vir:80 128 PLAPSKVSFV-------------------------DTDTGYQIWYQG----------KAYNYDE--VLHFIVNPDPEKPY 170 (403) T ss_pred EEcCCeeEEE-------------------------EcCCceEEEEee----------cccchhh--EEEEeccCCCcCcc Confidence 3333333100 011111221100 0111122 22233333333445 Q ss_pred cCcchHHHHHHHHHHHHHhHHHHHHHH-HHcCCceEEEecCC---CccccccCcc--ceeec---cceeeeccCCCCccc Q lcl|NC_020844. 230 VFHPPMKDALDLQVALFRKECALEFAE-MMTAFPMLSANGVE---PDTDSEGSPK--PLDTG---PDTVLYAPPNAEGQH 300 (455) Q Consensus 230 ~~~~pL~dla~l~~~hy~~~sd~~~~l-~~~~~p~~~~~G~~---~~~~~~~~~~--~~~~g---~~~~~~lp~~~~~~~ 300 (455) .+.+|+.-+.. .+........+.... ...+.|-.++.--. .+..++.... ....| ++..+.+|.+.. . T Consensus 171 ~G~s~~~~~~~-~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~- 247 (403) T protein:vir:80 171 MGRGYRVVLKD-IVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELL-D- 247 (403) T ss_pred ccccHHHHHHH-HHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeeccccc-c- Confidence 67787665544 444444444544333 34466777764211 1111111100 00111 223344553221 0 Q ss_pred cceeEEeeCchhHHHHHHHHHHHHHHHHHh---hhhhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020844. 301 GSWNFVEPATESLKFLAEQIESVSKEIREL---GRQPLTAQSGNLTRISAAQAASKGNSAVQQWAAGLKDALENALYITN 377 (455) Q Consensus 301 ~~~~~le~~~~~l~~~~~~l~~~~~~m~~~---g~~~l~~~~~~~sa~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a 377 (455) ...+...+...++ +.+..+-...+|+.+ ...++.... ..+++... -....|.-++..++++++.-| T Consensus 248 -~~~~~~l~~~d~q-~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~~----f~~~~l~P~~~~ie~~l~~kl---- 316 (403) T protein:vir:80 248 -VEQVKPLSLKDLA-IHETVELDKRTVAGIFGVPAFLLGVGK-YDKDEYNN----FINSTILPIAKGIEQELTRKL---- 316 (403) T ss_pred -cceeccCCHHHHH-HHHHHHHhHHHHHHHhCCCHHHcCCCC-ccHHHHHH----HHHHHHHHHHHHHHHHHHHhc---- Confidence 0122222333332 234444444555443 223332111 12222222 233445566666666555422 Q ss_pred HHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCC----------CCCHHH---HHHH Q lcl|NC_020844. 378 LWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSD----------NFDPEE---ESER 444 (455) Q Consensus 378 ~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~----------~~d~e~---e~~r 444 (455) + .+.+..++++.+=.......+.++++.++.+.|+++.-.+++.+ |+-|- ...+-+ +.+. T Consensus 317 --l--~~~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~---gl~p~~ggd~~~~~~n~~pl~~~~~~~~ 389 (403) T protein:vir:80 317 --L--ISPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWL---GLSPKEGLSELVILENYIPLDKIGDQNK 389 (403) T ss_pred --c--CCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCeEeecccccchhhccchhh Confidence 1 12334454442211222234567888889999999999888765 33221 011111 1111 Q ss_pred HHhhccc----CCCC Q lcl|NC_020844. 445 LISELPG----GIEE 455 (455) Q Consensus 445 i~~e~~~----~l~~ 455 (455) .+ .++. +=+| T Consensus 390 ~k-~ge~~~~~~~~~ 403 (403) T protein:vir:80 390 LK-GGEKGGADGQTD 403 (403) T ss_pred cc-CCCCCCCCCCCC Confidence 11 0110 1111 No 225 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=22.71 E-value=2.4 Score=18.49 Aligned_cols=422 Identities=9% Similarity=-0.006 Sum_probs=158.4 Q ss_pred CCCCC-----------------CCccCHHHHHHHHHHHHHHHHhcCHHHHHHhhhhcCCCCCCCCHHHHHHHhhccccCC Q lcl|NC_020844. 1 MPEQD-----------------PTKRSADAEAMSDYLQTVDDVLEGVTGMRRNFKRYVKQFPHEQNKRYDLRKSVAKMTN 63 (455) Q Consensus 1 m~~~~-----------------v~~~~p~y~~~~~~w~~i~d~~~G~~~~r~~g~~yLpk~~~e~~~~Y~~rl~ra~~~n 63 (455) |++.. ...+.|+=.++. .| +...+++ |-++|. +.++.+..+. ..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~-----~~~~~~~---~~~p~~-~~~~L~~~~e---~~~ 62 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIP------DH-----RIQSHNV---GVNPPY-NPDRLAAFLE---LNE 62 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccc------hh-----hhcccCC---CCCCCC-CHHHHHHHHh---cCh Confidence 55332 111122211110 00 1111111 222332 4455555554 468 Q ss_pred hHHHHHHHHhhhhhcCCceec------cC---CH---HHHHHHHhh----------ccCCCCHHHHHHHHHHHHHhhCcE Q lcl|NC_020844. 64 VFRDVLEGLASKPFGREVEVT------EG---TG---PSEDLLEDI----------DGRGNNLTTFAGQLFFQGVADSIS 121 (455) Q Consensus 64 ~~~~~v~~~~g~lf~k~p~i~------~~---~~---~l~~~~~d~----------d~~G~~l~~~~~~~~~~al~~G~~ 121 (455) +.+..|+.+...+-+-+..+. .. .+ ....++++. -.-+.+...+.+.+..+.+.+|-+ T Consensus 63 ~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna 142 (651) T protein:vir:99 63 TLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWL 142 (651) T ss_pred HHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhH Confidence 889999999888866654431 11 11 122233221 011246677877777777777766 Q ss_pred EEEEeeCCCCcchhhh-----------hHHhcc--CCceEEEechhhhc------cceeeccCCeeEEEEEEEEeeeeEE Q lcl|NC_020844. 122 WIRVDYPAGQQFRNRE-----------EERQAG--VRPYWSQVNHSAIL------EIQSERQGAGERITYMRMWEDAETI 182 (455) Q Consensus 122 ~~lVd~p~~~~~~t~a-----------d~~~~~--~rPy~~~~~~~~ii------~w~~~~~~~~~~l~~v~~~E~~e~~ 182 (455) ++=|-....+.+...+ +..... ..+.+...+-.... .|..-..+..+.+ .+.....++ T Consensus 143 ~ieiIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~---~~g~~~~~~ 219 (651) T protein:vir:99 143 ALEMLTDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFG---EAGDRYRGQ 219 (651) T ss_pred hhhhhhcCccchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEE---Eeeccccce Confidence 5533211111111000 000000 00000000000000 0000000001111 111111111 Q ss_pred EE------------EecCeEEEEE----EeCCcEEEEeccCCCCCCcee---EEEEEecccCCCcccCcchHHHHHHHHH Q lcl|NC_020844. 183 LV------------LRPDDWERWR----KNDAGIWEVDEFGRITIGVVP---MVPFITGRRKGKKWVFHPPMKDALDLQV 243 (455) Q Consensus 183 ~v------------~~~~~~~~~~----~~~~g~~~~~~~g~~~l~~vP---~V~f~~~~~~~~~~~~~~pL~dla~l~~ 243 (455) .. +..+.+..+. ....|.|...+.+ ....+| |+-|..... .+...+.+|+.-++. .+ T Consensus 220 ~~~~~~~~~~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~--~~~~~~~~eViHir~~~~-~~g~~G~spl~~a~~-~i 295 (651) T protein:vir:99 220 EVVIDESGDEPTIRYREDEESEREPIFVDRETGDVTTGDAN--GLENRPANELIFIPNPSI-LEDDYGVPDWVSAIR-TI 295 (651) T ss_pred eeeeccCCcceeEEeccCcceeeeeecccceeeeEEEcCCC--ceeEecccceEEecCCCC-CCCcccccHHHHHHH-HH Confidence 11 0111111110 1111223222221 111222 233332222 234468888887665 34 Q ss_pred HHHHhHHHHHHHHHHc-CCceEEEe--cCCCccc--cccCc--cceeeccceeeeccCCCCc----cccceeEEeeCchh Q lcl|NC_020844. 244 ALFRKECALEFAEMMT-AFPMLSAN--GVEPDTD--SEGSP--KPLDTGPDTVLYAPPNAEG----QHGSWNFVEPATES 312 (455) Q Consensus 244 ~hy~~~sd~~~~l~~~-~~p~~~~~--G~~~~~~--~~~~~--~~~~~g~~~~~~lp~~~~~----~~~~~~~le~~~~~ 312 (455) .......++...++.. +.|..+++ |-....+ ...+. +...-++++.+.|+.++.. .+.+++|..++-.+ T Consensus 296 ~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~ 375 (651) T protein:vir:99 296 SADEAAKDYNRDFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGI 375 (651) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCc Confidence 5555566665555444 56777765 3211111 11100 1122345566666543211 12246666554322 Q ss_pred H--HHHHHHHHHHHHHHHHh---hhhhc-cCCCcchhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC Q lcl|NC_020844. 313 L--KFLAEQIESVSKEIREL---GRQPL-TAQSGNLTR-ISAAQAASKGNSAVQQWAAGLKDALENALYITNLWMDAPDT 385 (455) Q Consensus 313 l--~~~~~~l~~~~~~m~~~---g~~~l-~~~~~~~sa-~a~~~~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~ 385 (455) . ..+.+..+....+|..+ ...++ ...+++-|. ++... .-....|.-+...++++++..| +....+..+. T Consensus 376 ~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~--~f~~~tL~P~~~~ie~eln~kL--l~~~e~~~~~ 451 (651) T protein:vir:99 376 SEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDK--DFALEVIQPEQHTFAEWLYQII--HQQALGVTDW 451 (651) T ss_pred hhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHH--HHHHHHHHHHHHHHHHHHHHhh--cCccccccCc Confidence 1 22334444445555443 22222 112223222 22222 2234456666666776666533 2222222222 Q ss_pred ceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCCCCCCCH--HHHHHHHHh----hcccCC-CC Q lcl|NC_020844. 386 NPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRLGELSDNFDP--EEESERLIS----ELPGGI-EE 455 (455) Q Consensus 386 ~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~~vl~~~~d~--e~e~~ri~~----e~~~~l-~~ 455 (455) ...|+++.+-..+......++++..+.+.|+++.-.+++.+ |+-+ .-+. ..-+..+.. +...+. ++ T Consensus 452 ~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~l---glpp-i~~~~gd~~l~~~~~~~~g~~~~gge~~ 524 (651) T protein:vir:99 452 TIEYELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREEL---GLDP-LGEPYGEMTLSEFEAEVAGDVAGGGETE 524 (651) T ss_pred eEEEEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCC-CCCccccccccccccccccccccCCCCc Confidence 23344432222222334556777889999999998888765 4422 1110 000000100 000000 11 No 226 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=21.04 E-value=2.7 Score=18.25 Aligned_cols=380 Identities=12% Similarity=0.039 Sum_probs=146.7 Q ss_pred CCCCCCCccCHHHHHHHHHHHHHHHH--hcCHHHHHHhhhhcCCCCCCCCH----HHHH-HHhhccccCChHHHHHHHHh Q lcl|NC_020844. 1 MPEQDPTKRSADAEAMSDYLQTVDDV--LEGVTGMRRNFKRYVKQFPHEQN----KRYD-LRKSVAKMTNVFRDVLEGLA 73 (455) Q Consensus 1 m~~~~v~~~~p~y~~~~~~w~~i~d~--~~G~~~~r~~g~~yLpk~~~e~~----~~Y~-~rl~ra~~~n~~~~~v~~~~ 73 (455) |--+.+ ---|....|.|.. +. ..|+ ...++|.....+- ..+. ..++ ...+...|+.++ T Consensus 71 ~kk~~i---~~pfkkk~~~~~~--d~f~~s~e------s~s~vtsls~pdaf~~vnVs~~~Alk----nsaV~scI~~IA 135 (945) T protein:vir:10 71 LKKEKI---IVPYNHQEPPFKF--NLFEYSPE------SLMYLPSISDPDAFFLINLFRKYRFN----NDSKLIKVSEIP 135 (945) T ss_pred HHhhcc---cccccccccchhh--hhhhccCc------cceecccccCccceeeehhhhhhhhc----cHHHHHHHHHHH Confidence 211111 0012222333322 11 2222 1235553321110 0111 1111 233344455554 Q ss_pred hhhhcCCcee----c-----------cCCHHHHHHHH--hhccCCC-CHHHHHHHHHHHHHhhCcEEEEEeeCCCCcchh Q lcl|NC_020844. 74 SKPFGREVEV----T-----------EGTGPSEDLLE--DIDGRGN-NLTTFAGQLFFQGVADSISWIRVDYPAGQQFRN 135 (455) Q Consensus 74 g~lf~k~p~i----~-----------~~~~~l~~~~~--d~d~~G~-~l~~~~~~~~~~al~~G~~~~lVd~p~~~~~~t 135 (455) +-+-+-|..+ . .....+..|+. |.+..+. .+..|.+.+....+.+|-+|+++-++..|.+. T Consensus 136 ~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii- 214 (945) T protein:vir:10 136 KKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLV- 214 (945) T ss_pred hhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE- Confidence 4444444322 0 01233556664 3322222 22347778889999999999999765544321 Q ss_pred hhhHHhccCCceEEEechhhhccceeeccCCeeEEEEEEEEeeeeEEEEEecCeEEEEEEeCCcE-EEEeccCCCCCCce Q lcl|NC_020844. 136 REEERQAGVRPYWSQVNHSAILEIQSERQGAGERITYMRMWEDAETILVLRPDDWERWRKNDAGI-WEVDEFGRITIGVV 214 (455) Q Consensus 136 ~ad~~~~~~rPy~~~~~~~~ii~w~~~~~~~~~~l~~v~~~E~~e~~~v~~~~~~~~~~~~~~g~-~~~~~~g~~~l~~v 214 (455) -+..++|..|--... ..+ .. +..|+...+|. .... ..-+.| T Consensus 215 -----------~L~pLdPs~Vti~~d-dDG-~~---------------------~y~Yv~~idG~~~~~v----~a~DvI 256 (945) T protein:vir:10 215 -----------AITPVDGTTIKPILS-EDT-GI---------------------VVGYVQEVDGAIVAHF----DKRDVV 256 (945) T ss_pred -----------EEEEECCcceEEEEc-CCC-cE---------------------EEEEEEecCCceEEEe----cCCceE Confidence 244555554421110 001 00 01111111111 0000 011223 Q ss_pred eEEEEEecccCCC-cccCcchHHHHHHHHHHHHHhHHHHHHHHH--HcCCceEEEe--cCCC-----------ccccccC Q lcl|NC_020844. 215 PMVPFITGRRKGK-KWVFHPPMKDALDLQVALFRKECALEFAEM--MTAFPMLSAN--GVEP-----------DTDSEGS 278 (455) Q Consensus 215 P~V~f~~~~~~~~-~~~~~~pL~dla~l~~~hy~~~sd~~~~l~--~~~~p~~~~~--G~~~-----------~~~~~~~ 278 (455) -++ ++...++. ...+.+|+.-++. .+........+....+ ..+.|-.++. |-.. +.....+ T Consensus 257 lhi--rn~s~DG~~~GyGlSPIeaa~~-aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlK 333 (945) T protein:vir:10 257 LFR--QNLTPDVYMYGYSLPPIEILYK-VILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQ 333 (945) T ss_pred EEe--ccCCCCcccccCCchHHHHHHH-HHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHHHH Confidence 222 22222221 2347788876664 3333344444433333 3457765553 2111 1000000 Q ss_pred c--cceeeccce--eeeccCCCCccccceeEEeeCchhHHH-HHHHHHHHHHHHHHh---hhhhcc-CCCcchhHHHHHH Q lcl|NC_020844. 279 P--KPLDTGPDT--VLYAPPNAEGQHGSWNFVEPATESLKF-LAEQIESVSKEIREL---GRQPLT-AQSGNLTRISAAQ 349 (455) Q Consensus 279 ~--~~~~~g~~~--~~~lp~~~~~~~~~~~~le~~~~~l~~-~~~~l~~~~~~m~~~---g~~~l~-~~~~~~sa~a~~~ 349 (455) . ....-|.++ .+.++. .++|.+.+.++-.+ +.+..+-...+|+.+ ...++. .++.+.|..+ .. T Consensus 334 e~wee~~sG~NnG~piVLde-------Gmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiE-qq 405 (945) T protein:vir:10 334 RQLQAIMMGDYTQVPILSGG-------KFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAE-VM 405 (945) T ss_pred HHHHHHhCCcccccceecCC-------CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHH-HH Confidence 0 001112222 233432 24554444333222 234445555555443 222231 2222222222 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccCCCCCCHHHHHHHHHHHHcCCCCHHHHHHHHHhc Q lcl|NC_020844. 350 AASKGNSAVQQWAAGLKDALENALYITNLWMDAPDTNPEADVYTDFRADTGEEQTPGYLLTMRQNGDISREGLWMEFQRL 429 (455) Q Consensus 350 ~~~~~~s~L~~~a~~~~~a~~~~l~~~a~~~g~~~~~~~v~v~~df~~~~~~~~~~~al~~~~~~G~is~et~~~~l~~~ 429 (455) ...-....|.-+...++++++..|. ... ......++++. .......+.++++.++.+.|++|.-.+++.+ T Consensus 406 ~~~Fv~~tL~Pil~~IEqeLNrkLl---~~~--eg~~i~fdFd~--ldl~D~ksraEal~kli~sGiLTiNEvRe~l--- 475 (945) T protein:vir:10 406 ASLTKAKGLEPLMATISKGFDEVVS---EFR--NEKDIKLWFKE--DDLEKERDWWNIIQGQLNTGFRSINEARMEK--- 475 (945) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc---ccc--cCceeEEEecc--hhccCHHHHHHHHHHHHhCCCcCHHHHHHHh--- Confidence 2223345577777778888776542 111 12233444433 2222224557778888999999998888765 Q ss_pred CCCCC-----------CCCHHHHHHHHH------------hhcc----cCCCC Q lcl|NC_020844. 430 GELSD-----------NFDPEEESERLI------------SELP----GGIEE 455 (455) Q Consensus 430 ~vl~~-----------~~d~e~e~~ri~------------~e~~----~~l~~ 455 (455) |+-|- ...+.++...-+ .+.+ ++-+| T Consensus 476 GLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dE 528 (945) T protein:vir:10 476 GLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDE 528 (945) T ss_pred CCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCC Confidence 32111 111111110000 0000 00011 Done!