Query lcl|NC_021326.1_cdsid_YP_008059004.1 [gene=M175_gp31] [protein=putative portal protein] [protein_id=YP_008059004.1] [location=complement(25642..26979)] Match_columns 445 No_of_seqs 128 out of 529 Neff 10.1 Searched_HMMs 1612 Date Thu Nov 7 19:00:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_31 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_31_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94805 Length: 492 100.0 2E-106 1E-109 600.3 52.4 445 1-445 48-492 (492) 2 protein:vir:1236 Length: 483 # 100.0 2E-106 1E-109 600.1 52.5 445 1-445 39-483 (483) 3 protein:vir:97336 Length: 492 100.0 2E-106 2E-109 599.6 52.5 445 1-445 48-492 (492) 4 protein:vir:93747 Length: 472 100.0 2E-105 2E-108 594.1 52.6 445 1-445 28-472 (472) 5 protein:vir:96266 Length: 474 100.0 4E-104 2E-107 587.5 50.1 444 1-445 31-474 (474) 6 protein:vir:95899 Length: 474 100.0 4E-104 2E-107 587.5 50.1 444 1-445 31-474 (474) 7 protein:vir:97447 Length: 474 100.0 3E-103 2E-106 582.3 51.5 444 1-445 31-474 (474) 8 protein:vir:94498 Length: 474 100.0 3E-103 2E-106 582.3 51.5 444 1-445 31-474 (474) 9 protein:vir:95113 Length: 474 100.0 7E-102 4E-105 575.2 51.6 444 1-445 31-474 (474) 10 protein:vir:107112 Length: 478 100.0 4E-100 3E-103 565.3 51.2 443 1-445 30-478 (478) 11 protein:vir:105292 Length: 478 100.0 1.9E-99 1E-102 561.8 51.2 443 1-444 30-478 (478) 12 protein:vir:96839 Length: 474 100.0 4.8E-99 3E-102 559.6 50.8 440 1-441 30-474 (474) 13 protein:vir:96179 Length: 468 100.0 1.3E-97 8E-101 551.8 51.0 433 1-437 30-468 (468) 14 protein:vir:105461 Length: 470 100.0 4.3E-95 2.7E-98 537.9 50.0 436 1-443 9-470 (470) 15 protein:vir:102950 Length: 471 100.0 7.2E-95 4.4E-98 536.7 50.0 433 1-435 9-471 (471) 16 protein:vir:5961 Length: 503 # 100.0 3.1E-94 1.9E-97 533.3 50.4 443 1-445 33-495 (503) 17 protein:vir:97171 Length: 512 100.0 4.1E-94 2.5E-97 532.6 50.2 434 1-445 44-511 (512) 18 protein:vir:9306 Length: 511 # 100.0 7.6E-94 4.7E-97 531.1 49.9 434 1-445 44-510 (511) 19 protein:vir:102330 Length: 451 100.0 8.3E-94 5.2E-97 530.9 49.5 423 1-431 5-451 (451) 20 protein:vir:99781 Length: 511 100.0 7.9E-94 4.9E-97 531.0 48.9 434 1-445 44-511 (511) 21 protein:vir:96240 Length: 511 100.0 2.6E-93 1.6E-96 528.2 50.3 434 1-445 44-510 (511) 22 protein:vir:103951 Length: 511 100.0 3E-93 1.8E-96 527.9 50.4 434 1-445 44-510 (511) 23 protein:vir:96366 Length: 511 100.0 2.2E-93 1.4E-96 528.6 49.3 434 1-445 44-510 (511) 24 protein:vir:78805 Length: 511 100.0 2.2E-93 1.4E-96 528.6 49.3 434 1-445 44-510 (511) 25 protein:vir:106571 Length: 499 100.0 2.8E-93 1.7E-96 528.0 49.8 433 1-445 20-487 (499) 26 protein:vir:79043 Length: 479 100.0 8E-93 5E-96 525.5 49.4 435 1-436 23-479 (479) 27 protein:vir:3609 Length: 452 # 100.0 1.8E-92 1.1E-95 523.5 50.4 422 1-445 21-452 (452) 28 protein:vir:3964 Length: 453 # 100.0 9.9E-92 6.1E-95 519.5 49.6 422 1-443 21-453 (453) 29 protein:vir:2732 Length: 501 # 100.0 9E-92 5.6E-95 519.7 49.3 429 1-445 43-498 (501) 30 protein:vir:4898 Length: 502 # 100.0 1.6E-91 9.9E-95 518.4 49.2 428 1-445 44-498 (502) 31 protein:vir:96494 Length: 501 100.0 1.9E-91 1.2E-94 517.9 48.9 428 1-445 43-497 (501) 32 protein:vir:9871 Length: 429 # 100.0 6.4E-91 3.9E-94 515.1 49.8 416 1-441 5-429 (429) 33 protein:vir:94101 Length: 474 100.0 2.9E-90 1.8E-93 511.4 47.5 427 1-445 19-474 (474) 34 protein:vir:105889 Length: 474 100.0 2.9E-90 1.8E-93 511.4 47.5 427 1-445 19-474 (474) 35 protein:vir:99522 Length: 470 100.0 1.7E-89 1.1E-92 507.2 49.8 424 1-441 29-470 (470) 36 protein:vir:94546 Length: 506 100.0 7E-90 4.3E-93 509.4 47.2 429 1-445 26-503 (506) 37 protein:vir:733 Length: 453 # 100.0 1.1E-89 6.6E-93 508.4 48.0 417 1-437 21-453 (453) 38 protein:vir:106639 Length: 481 100.0 1.3E-88 8.1E-92 502.4 50.9 428 1-444 34-481 (481) 39 protein:vir:95806 Length: 440 100.0 1.9E-88 1.2E-91 501.5 47.6 421 1-438 1-440 (440) 40 protein:vir:78083 Length: 537 100.0 2.8E-88 1.8E-91 500.6 47.4 441 1-445 16-522 (537) 41 protein:vir:9922 Length: 489 # 100.0 3.8E-86 2.3E-89 488.9 47.2 428 1-442 19-489 (489) 42 protein:vir:78537 Length: 480 100.0 1.3E-75 7.9E-79 431.2 44.9 429 1-445 7-466 (480) 43 protein:vir:78227 Length: 480 100.0 2.3E-75 1.4E-78 429.8 44.8 429 1-445 7-466 (480) 44 protein:vir:4223 Length: 486 # 100.0 4.2E-75 2.6E-78 428.4 43.1 425 1-445 17-477 (486) 45 protein:vir:2500 Length: 501 # 100.0 2.2E-74 1.3E-77 424.5 43.7 432 1-445 31-494 (501) 46 protein:vir:2427 Length: 485 # 100.0 4.6E-74 2.9E-77 422.6 44.4 425 1-445 17-483 (485) 47 protein:vir:104082 Length: 485 100.0 1.4E-73 8.8E-77 420.0 45.2 425 1-445 17-478 (485) 48 protein:vir:2341 Length: 488 # 100.0 1.1E-73 7E-77 420.5 44.4 426 1-445 12-486 (488) 49 protein:vir:102602 Length: 456 100.0 4.2E-73 2.6E-76 417.4 43.4 424 1-434 9-456 (456) 50 protein:vir:105819 Length: 456 100.0 4.2E-73 2.6E-76 417.4 43.4 424 1-434 9-456 (456) 51 protein:vir:99072 Length: 479 100.0 3.1E-73 1.9E-76 418.1 42.4 425 1-445 18-470 (479) 52 protein:vir:80680 Length: 441 100.0 1.9E-72 1.2E-75 413.8 44.2 411 1-439 8-441 (441) 53 protein:vir:7768 Length: 484 # 100.0 1.2E-72 7.2E-76 415.0 42.1 425 1-445 16-481 (484) 54 protein:vir:7987 Length: 456 # 100.0 7.8E-72 4.9E-75 410.4 43.4 424 1-434 9-456 (456) 55 protein:vir:99916 Length: 504 100.0 3.7E-67 2.3E-70 384.8 42.3 423 1-445 25-492 (504) 56 protein:vir:98444 Length: 434 100.0 1.1E-65 6.6E-69 376.8 40.9 400 31-445 1-431 (434) 57 protein:vir:9751 Length: 422 # 100.0 1.2E-63 7.6E-67 365.5 37.8 390 1-418 5-422 (422) 58 protein:vir:94742 Length: 409 100.0 3.7E-62 2.3E-65 357.4 39.1 377 1-405 5-409 (409) 59 protein:vir:9568 Length: 410 # 100.0 3.3E-62 2.1E-65 357.6 37.8 384 6-421 1-410 (410) 60 protein:vir:8184 Length: 474 # 100.0 1.5E-61 9.2E-65 354.1 41.1 415 1-436 19-474 (474) 61 protein:vir:38 Length: 496 # N 100.0 3.1E-60 1.9E-63 346.8 44.3 428 1-436 21-496 (496) 62 protein:vir:1634 Length: 409 # 100.0 6E-61 3.7E-64 350.7 37.6 377 1-405 5-409 (409) 63 protein:vir:80959 Length: 499 100.0 1.1E-57 6.7E-61 332.9 45.9 428 1-435 8-499 (499) 64 protein:vir:79703 Length: 505 100.0 2.5E-52 1.6E-55 303.5 44.2 421 1-431 10-505 (505) 65 protein:vir:1587 Length: 508 # 100.0 3.8E-52 2.3E-55 302.5 41.3 426 1-445 10-508 (508) 66 protein:vir:4782 Length: 522 # 100.0 2E-49 1.3E-52 287.5 45.1 431 1-444 10-522 (522) 67 protein:vir:9815 Length: 500 # 100.0 1E-48 6.5E-52 283.6 42.0 422 1-445 10-500 (500) 68 protein:vir:3028 Length: 500 # 100.0 1E-48 6.5E-52 283.6 42.0 422 1-445 10-500 (500) 69 protein:vir:101494 Length: 527 100.0 3.1E-47 1.9E-50 275.6 37.0 437 1-445 20-523 (527) 70 protein:vir:102239 Length: 527 100.0 3.8E-47 2.4E-50 275.1 36.9 437 1-445 20-523 (527) 71 protein:vir:78907 Length: 518 100.0 3.8E-45 2.3E-48 264.1 41.5 429 1-439 6-518 (518) 72 protein:vir:98883 Length: 517 100.0 3.1E-42 1.9E-45 248.2 43.0 428 1-444 10-517 (517) 73 protein:vir:7430 Length: 563 # 100.0 1.2E-40 7.2E-44 239.5 35.4 434 1-445 18-542 (563) 74 protein:vir:94956 Length: 452 100.0 1.8E-31 1.1E-34 189.1 34.8 418 1-445 2-451 (452) 75 protein:vir:97265 Length: 513 100.0 8.6E-31 5.4E-34 185.4 37.1 429 1-445 8-501 (513) 76 protein:vir:80453 Length: 535 100.0 7.5E-28 4.6E-31 169.3 37.8 433 1-445 34-535 (535) 77 protein:vir:95149 Length: 501 99.9 7.1E-27 4.4E-30 163.9 38.2 430 1-443 3-501 (501) 78 protein:vir:78393 Length: 489 99.9 2.2E-26 1.4E-29 161.3 37.7 419 1-444 1-489 (489) 79 protein:vir:95014 Length: 491 99.9 2.9E-25 1.8E-28 155.1 34.0 421 1-438 1-491 (491) 80 protein:vir:96783 Length: 488 99.9 1E-23 6.3E-27 146.6 33.4 411 1-422 1-488 (488) 81 protein:vir:93630 Length: 776 99.8 1E-20 6.2E-24 130.2 31.8 433 1-445 46-675 (776) 82 protein:vir:108295 Length: 711 99.8 4.8E-19 3E-22 121.0 30.4 435 1-445 31-668 (711) 83 protein:vir:9950 Length: 714 # 99.8 4.6E-17 2.9E-20 110.1 33.7 417 1-445 20-613 (714) 84 protein:vir:3296 Length: 714 # 99.8 4.6E-17 2.9E-20 110.1 33.7 417 1-445 20-613 (714) 85 protein:vir:10117 Length: 714 99.8 4.6E-17 2.9E-20 110.1 33.7 417 1-445 20-613 (714) 86 protein:vir:817 Length: 714 # 99.8 4.6E-17 2.9E-20 110.1 33.7 417 1-445 20-613 (714) 87 protein:vir:2764 Length: 714 # 99.8 4.6E-17 2.9E-20 110.1 33.7 417 1-445 20-613 (714) 88 protein:vir:8846 Length: 705 # 99.7 1.1E-16 6.7E-20 108.1 33.2 426 1-445 18-629 (705) 89 protein:vir:80040 Length: 461 99.7 4.6E-17 2.9E-20 110.1 28.5 401 1-445 8-460 (461) 90 protein:vir:104437 Length: 714 99.7 7.3E-16 4.5E-19 103.6 35.0 424 1-445 24-620 (714) 91 protein:vir:80165 Length: 651 99.7 2.4E-15 1.5E-18 100.7 34.4 428 1-445 24-623 (651) 92 protein:vir:105619 Length: 772 99.7 5.5E-16 3.4E-19 104.2 30.3 431 1-445 26-650 (772) 93 protein:vir:5249 Length: 437 # 99.6 1.6E-14 1E-17 96.2 30.9 386 13-445 1-437 (437) 94 protein:vir:96738 Length: 505 99.6 6.1E-13 3.8E-16 87.6 37.8 425 1-445 18-504 (505) 95 protein:vir:77597 Length: 725 99.6 1.2E-13 7.2E-17 91.5 32.8 433 1-445 7-628 (725) 96 protein:vir:389 Length: 530 # 99.6 1.1E-12 6.5E-16 86.3 38.0 426 1-445 13-525 (530) 97 protein:vir:3420 Length: 533 # 99.6 1.2E-12 7.7E-16 85.9 37.5 424 1-445 11-529 (533) 98 protein:vir:100920 Length: 725 99.6 1.8E-14 1.1E-17 96.0 26.7 430 1-445 7-631 (725) 99 protein:vir:6382 Length: 553 # 99.5 2E-12 1.3E-15 84.7 36.8 425 1-445 17-553 (553) 100 protein:vir:9263 Length: 725 # 99.5 4.1E-14 2.5E-17 94.0 26.4 430 1-445 7-631 (725) 101 protein:vir:95449 Length: 584 99.5 1.3E-13 7.9E-17 91.3 28.3 414 1-431 20-584 (584) 102 protein:vir:79538 Length: 502 99.5 6.6E-12 4.1E-15 81.9 37.7 414 1-445 3-502 (502) 103 protein:vir:107742 Length: 537 99.4 5E-13 3.1E-16 88.0 26.6 395 1-445 57-527 (537) 104 protein:vir:95542 Length: 548 99.4 1.5E-11 9.2E-15 79.9 36.9 413 1-445 16-514 (548) 105 protein:vir:79647 Length: 435 99.4 2.9E-13 1.8E-16 89.3 24.5 385 2-445 1-433 (435) 106 protein:vir:10321 Length: 495 99.4 1.8E-11 1.1E-14 79.5 37.4 417 1-445 8-495 (495) 107 protein:vir:105429 Length: 708 99.4 2.7E-12 1.7E-15 84.0 29.5 437 1-445 8-636 (708) 108 protein:vir:94049 Length: 532 99.4 3.6E-12 2.2E-15 83.3 30.2 396 1-445 57-513 (532) 109 protein:vir:3139 Length: 599 # 99.4 1.3E-12 8.1E-16 85.7 27.3 423 1-437 21-599 (599) 110 protein:vir:3520 Length: 720 # 99.4 3.3E-12 2.1E-15 83.5 28.2 434 1-445 8-643 (720) 111 protein:vir:96068 Length: 765 99.4 3.6E-12 2.2E-15 83.3 26.3 391 1-445 79-539 (765) 112 protein:vir:104338 Length: 422 99.3 1.6E-11 9.9E-15 79.8 28.8 381 13-445 1-422 (422) 113 protein:vir:99563 Length: 862 99.3 1.5E-11 9.5E-15 79.9 28.0 400 1-445 94-565 (862) 114 protein:vir:95821 Length: 763 99.3 1.2E-10 7.4E-14 75.0 31.9 419 1-445 31-656 (763) 115 protein:vir:107662 Length: 427 99.3 8.5E-12 5.3E-15 81.3 25.2 376 6-445 1-426 (427) 116 protein:vir:105520 Length: 706 99.3 1.3E-10 7.8E-14 74.9 31.6 436 1-445 8-635 (706) 117 protein:vir:172 Length: 708 # 99.2 3.4E-10 2.1E-13 72.5 33.8 438 1-445 8-636 (708) 118 protein:vir:94599 Length: 641 99.2 2E-10 1.2E-13 73.8 29.0 437 1-445 28-612 (641) 119 protein:vir:103765 Length: 549 99.1 1.6E-09 9.9E-13 68.8 34.6 428 1-445 8-546 (549) 120 protein:vir:95315 Length: 559 99.0 5.7E-09 3.6E-12 65.8 33.7 429 1-445 5-542 (559) 121 protein:vir:7321 Length: 556 # 98.9 1.3E-08 8.2E-12 63.8 31.8 432 1-445 8-542 (556) 122 protein:vir:1538 Length: 535 # 98.9 1.6E-08 1E-11 63.3 36.0 428 1-445 13-525 (535) 123 protein:vir:3153 Length: 467 # 98.9 2.5E-08 1.6E-11 62.2 30.2 379 43-445 1-444 (467) 124 protein:vir:107822 Length: 555 98.8 3.5E-08 2.2E-11 61.5 34.8 429 1-445 6-543 (555) 125 protein:vir:107404 Length: 555 98.8 3.5E-08 2.2E-11 61.5 34.8 429 1-445 6-543 (555) 126 protein:vir:98506 Length: 555 98.8 3.5E-08 2.2E-11 61.5 34.8 429 1-445 6-543 (555) 127 protein:vir:102668 Length: 547 98.8 4E-08 2.5E-11 61.2 38.9 420 1-437 6-547 (547) 128 protein:vir:63755 Length: 547 98.7 8.5E-08 5.3E-11 59.4 29.7 398 1-445 3-527 (547) 129 protein:vir:3361 Length: 535 # 98.7 9.7E-08 6E-11 59.0 37.1 428 1-445 13-526 (535) 130 protein:vir:80644 Length: 551 98.7 1.2E-07 7.5E-11 58.5 30.1 375 1-445 58-534 (551) 131 protein:vir:105782 Length: 449 98.7 1.2E-07 7.6E-11 58.5 23.8 400 1-445 1-446 (449) 132 protein:vir:94709 Length: 522 98.5 3E-07 1.9E-10 56.3 36.3 423 1-443 11-522 (522) 133 protein:vir:8883 Length: 543 # 98.5 3.2E-07 2E-10 56.2 34.1 428 1-445 13-541 (543) 134 protein:vir:96579 Length: 576 98.5 3.2E-07 2E-10 56.2 26.5 383 1-445 95-526 (576) 135 protein:vir:2198 Length: 536 # 98.5 3.5E-07 2.2E-10 56.0 34.4 426 1-445 12-536 (536) 136 protein:vir:10447 Length: 536 98.5 3.9E-07 2.4E-10 55.7 34.5 426 1-445 12-536 (536) 137 protein:vir:79233 Length: 526 98.4 8.6E-07 5.3E-10 53.8 33.4 398 1-445 12-447 (526) 138 protein:vir:78696 Length: 542 98.4 8.8E-07 5.5E-10 53.8 34.3 424 1-445 4-540 (542) 139 protein:vir:102080 Length: 429 98.4 9E-07 5.6E-10 53.7 29.8 379 1-445 3-427 (429) 140 protein:vir:95599 Length: 563 98.4 1.1E-06 6.9E-10 53.2 26.5 374 1-445 96-528 (563) 141 protein:vir:99312 Length: 563 98.4 1.1E-06 6.9E-10 53.2 26.5 374 1-445 96-528 (563) 142 protein:vir:94572 Length: 535 98.4 1.1E-06 6.9E-10 53.2 33.0 423 1-441 14-535 (535) 143 protein:vir:3843 Length: 397 # 98.3 1.2E-06 7.4E-10 53.1 29.1 375 1-443 3-397 (397) 144 protein:vir:99232 Length: 526 98.3 1.4E-06 8.9E-10 52.6 34.6 397 1-445 4-447 (526) 145 protein:vir:103860 Length: 528 98.3 1.6E-06 9.7E-10 52.4 34.9 398 1-445 4-449 (528) 146 protein:vir:100039 Length: 522 98.3 2E-06 1.3E-09 51.8 33.7 418 1-443 2-522 (522) 147 protein:vir:1266 Length: 416 # 98.2 2.5E-06 1.6E-09 51.3 31.6 373 1-444 2-416 (416) 148 protein:vir:102727 Length: 945 98.2 3.3E-06 2.1E-09 50.6 30.6 398 1-445 66-535 (945) 149 protein:vir:99853 Length: 488 98.2 3.5E-06 2.2E-09 50.5 31.5 380 2-445 1-410 (488) 150 protein:vir:102855 Length: 432 98.1 3.6E-06 2.2E-09 50.4 32.6 379 1-445 3-430 (432) 151 protein:vir:107605 Length: 432 98.1 3.6E-06 2.2E-09 50.4 32.6 379 1-445 3-430 (432) 152 protein:vir:105002 Length: 432 98.1 3.6E-06 2.2E-09 50.4 32.6 379 1-445 3-430 (432) 153 protein:vir:1326 Length: 457 # 98.1 4E-06 2.5E-09 50.2 27.6 392 1-445 3-456 (457) 154 protein:vir:81095 Length: 416 98.1 4E-06 2.5E-09 50.2 30.6 381 1-445 3-416 (416) 155 protein:vir:4598 Length: 416 # 98.1 4E-06 2.5E-09 50.2 30.6 381 1-445 3-416 (416) 156 protein:vir:107880 Length: 491 98.1 4.3E-06 2.6E-09 50.0 34.9 382 1-445 17-421 (491) 157 protein:vir:106716 Length: 698 98.1 4.3E-06 2.7E-09 50.0 20.8 393 1-445 77-561 (698) 158 protein:vir:1785 Length: 555 # 98.1 4.6E-06 2.9E-09 49.8 32.0 423 1-445 4-555 (555) 159 protein:vir:1380 Length: 422 # 98.1 5.5E-06 3.4E-09 49.4 31.4 383 1-445 3-422 (422) 160 protein:vir:79063 Length: 491 98.1 5.9E-06 3.7E-09 49.3 34.6 380 1-445 17-421 (491) 161 protein:vir:3648 Length: 695 # 98.0 6.8E-06 4.2E-09 48.9 24.8 385 1-445 77-542 (695) 162 protein:vir:99672 Length: 532 98.0 7E-06 4.4E-09 48.8 34.6 422 1-444 13-532 (532) 163 protein:vir:101541 Length: 694 98.0 7.3E-06 4.5E-09 48.7 24.8 392 1-445 86-557 (694) 164 protein:vir:78589 Length: 695 98.0 7.6E-06 4.7E-09 48.7 24.9 384 1-445 77-542 (695) 165 protein:vir:80796 Length: 574 98.0 7.7E-06 4.7E-09 48.6 33.5 382 1-445 58-522 (574) 166 protein:vir:6240 Length: 457 # 97.9 1.2E-05 7.8E-09 47.5 28.8 397 1-445 3-449 (457) 167 protein:vir:9359 Length: 348 # 97.9 1.4E-05 8.5E-09 47.3 26.6 314 67-444 1-348 (348) 168 protein:vir:8418 Length: 409 # 97.9 1.4E-05 8.7E-09 47.2 30.4 378 1-445 3-409 (409) 169 protein:vir:93610 Length: 454 97.8 1.5E-05 9.4E-09 47.0 34.3 390 1-445 1-438 (454) 170 protein:vir:79984 Length: 441 97.8 1.6E-05 9.7E-09 46.9 31.8 373 20-445 1-441 (441) 171 protein:vir:9408 Length: 441 # 97.8 1.6E-05 9.7E-09 46.9 31.8 373 20-445 1-441 (441) 172 protein:vir:4952 Length: 386 # 97.8 1.7E-05 1.1E-08 46.7 31.1 362 1-445 3-386 (386) 173 protein:vir:6322 Length: 510 # 97.8 1.7E-05 1.1E-08 46.7 34.5 415 1-439 4-510 (510) 174 protein:vir:79772 Length: 648 97.8 1.9E-05 1.2E-08 46.5 33.7 393 1-445 53-503 (648) 175 protein:vir:103330 Length: 517 97.8 1.9E-05 1.2E-08 46.4 34.5 413 1-444 11-517 (517) 176 protein:vir:4156 Length: 542 # 97.8 2.2E-05 1.4E-08 46.1 28.0 398 1-445 1-472 (542) 177 protein:vir:102118 Length: 409 97.6 3.7E-05 2.3E-08 44.9 28.4 372 1-442 1-409 (409) 178 protein:vir:81152 Length: 411 97.6 3.7E-05 2.3E-08 44.9 29.8 367 1-445 3-410 (411) 179 protein:vir:4454 Length: 414 # 97.6 3.9E-05 2.4E-08 44.8 31.6 372 1-444 3-414 (414) 180 protein:vir:98396 Length: 441 97.6 4E-05 2.5E-08 44.7 31.7 377 20-445 1-441 (441) 181 protein:vir:2683 Length: 412 # 97.6 4.6E-05 2.9E-08 44.4 29.4 378 1-444 3-412 (412) 182 protein:vir:108215 Length: 469 97.5 5.3E-05 3.3E-08 44.0 31.0 404 2-445 1-462 (469) 183 protein:vir:78942 Length: 510 97.5 5.5E-05 3.4E-08 43.9 35.0 413 1-438 4-510 (510) 184 protein:vir:1986 Length: 512 # 97.5 5.9E-05 3.7E-08 43.8 36.3 381 1-445 17-445 (512) 185 protein:vir:100882 Length: 383 97.4 6.6E-05 4.1E-08 43.5 28.2 354 1-443 3-383 (383) 186 protein:vir:1023 Length: 392 # 97.4 6.8E-05 4.2E-08 43.4 29.2 367 1-440 1-392 (392) 187 protein:vir:3989 Length: 392 # 97.4 6.8E-05 4.2E-08 43.4 29.2 367 1-440 1-392 (392) 188 protein:vir:101648 Length: 518 97.4 7.7E-05 4.8E-08 43.1 31.5 388 1-445 1-450 (518) 189 protein:vir:7017 Length: 515 # 97.3 9.5E-05 5.9E-08 42.6 36.9 409 1-444 14-515 (515) 190 protein:vir:100691 Length: 535 97.3 0.0001 6.2E-08 42.5 34.3 389 1-445 53-529 (535) 191 protein:vir:93943 Length: 409 97.3 0.00011 7E-08 42.2 28.4 373 1-444 6-409 (409) 192 protein:vir:7407 Length: 392 # 97.3 0.00012 7.2E-08 42.2 30.9 367 1-440 1-392 (392) 193 protein:vir:7853 Length: 518 # 97.2 0.00013 7.9E-08 41.9 31.4 393 1-445 1-450 (518) 194 protein:vir:77981 Length: 448 97.2 0.00015 9E-08 41.6 29.2 389 1-445 11-439 (448) 195 protein:vir:94426 Length: 409 97.2 0.00015 9.1E-08 41.6 28.0 376 1-444 1-409 (409) 196 protein:vir:103219 Length: 201 97.2 2.3E-05 1.4E-08 46.0 11.5 175 243-445 1-201 (201) 197 protein:vir:96980 Length: 409 97.1 0.00016 1E-07 41.3 27.1 376 1-444 1-409 (409) 198 protein:vir:3868 Length: 417 # 97.1 0.00017 1E-07 41.3 31.0 366 20-445 1-415 (417) 199 protein:vir:105064 Length: 421 97.0 0.00021 1.3E-07 40.8 28.8 377 1-445 2-420 (421) 200 protein:vir:5737 Length: 419 # 97.0 0.00022 1.4E-07 40.6 29.9 364 17-445 1-414 (419) 201 protein:vir:4194 Length: 540 # 96.9 0.00025 1.6E-07 40.3 27.8 399 5-445 1-468 (540) 202 protein:vir:78161 Length: 355 96.7 0.00037 2.3E-07 39.4 24.3 292 110-445 1-335 (355) 203 protein:vir:101647 Length: 460 96.7 0.00041 2.5E-07 39.2 29.6 382 1-442 4-460 (460) 204 protein:vir:483 Length: 413 # 96.6 0.00045 2.8E-07 39.0 31.4 373 1-445 2-411 (413) 205 protein:vir:100187 Length: 385 96.6 0.00047 2.9E-07 38.9 28.9 355 1-445 3-384 (385) 206 protein:vir:4509 Length: 424 # 96.6 0.00048 3E-07 38.8 30.3 378 1-445 1-424 (424) 207 protein:vir:4854 Length: 386 # 96.6 0.00048 3E-07 38.8 31.1 351 20-445 1-386 (386) 208 protein:vir:96988 Length: 516 96.6 0.00049 3E-07 38.7 36.2 410 1-445 15-515 (516) 209 protein:vir:80211 Length: 514 96.6 0.00051 3.2E-07 38.6 35.2 420 1-441 1-514 (514) 210 protein:vir:4828 Length: 382 # 96.5 0.00053 3.3E-07 38.5 31.8 358 1-445 3-382 (382) 211 protein:vir:79511 Length: 448 96.4 0.00064 4E-07 38.1 31.5 379 1-445 11-438 (448) 212 protein:vir:100650 Length: 395 96.3 0.00075 4.7E-07 37.7 25.5 354 16-445 1-395 (395) 213 protein:vir:101289 Length: 395 96.3 0.00075 4.7E-07 37.7 25.5 354 16-445 1-395 (395) 214 protein:vir:9507 Length: 395 # 96.3 0.00075 4.7E-07 37.7 25.5 354 16-445 1-395 (395) 215 protein:vir:100150 Length: 437 96.2 0.00087 5.4E-07 37.4 32.5 384 2-445 1-436 (437) 216 protein:vir:4337 Length: 434 # 96.1 0.001 6.5E-07 36.9 30.3 374 1-445 4-433 (434) 217 protein:vir:81072 Length: 432 96.0 0.0011 7.1E-07 36.7 29.3 365 1-445 33-432 (432) 218 protein:vir:105641 Length: 516 96.0 0.0012 7.2E-07 36.7 35.6 409 1-445 15-515 (516) 219 protein:vir:104500 Length: 537 95.9 0.0012 7.6E-07 36.5 26.0 395 1-445 46-527 (537) 220 protein:vir:99452 Length: 651 95.9 0.0013 7.8E-07 36.5 22.8 417 1-445 18-537 (651) 221 protein:vir:10362 Length: 432 95.6 0.0018 1.1E-06 35.6 29.4 362 1-445 33-432 (432) 222 protein:vir:189 Length: 424 # 95.1 0.0029 1.8E-06 34.5 30.7 378 1-443 2-424 (424) 223 protein:vir:1884 Length: 424 # 95.0 0.0029 1.8E-06 34.5 31.3 378 1-443 2-424 (424) 224 protein:vir:98816 Length: 446 95.0 0.003 1.9E-06 34.4 28.3 377 1-408 13-446 (446) 225 protein:vir:103177 Length: 533 95.0 0.0031 1.9E-06 34.4 23.6 393 1-445 45-528 (533) 226 protein:vir:78641 Length: 278 95.0 0.0031 1.9E-06 34.4 26.3 252 67-372 1-278 (278) 227 protein:vir:97060 Length: 432 94.8 0.0034 2.1E-06 34.1 29.6 363 1-445 33-432 (432) 228 protein:vir:95378 Length: 406 94.8 0.0036 2.2E-06 34.0 28.9 365 1-445 3-406 (406) 229 protein:vir:5665 Length: 511 # 94.6 0.0039 2.4E-06 33.8 20.8 378 1-432 48-511 (511) 230 protein:vir:104259 Length: 403 94.1 0.0055 3.4E-06 33.0 29.2 366 1-445 3-403 (403) 231 protein:vir:106999 Length: 564 94.0 0.0057 3.5E-06 32.9 22.8 403 1-445 40-544 (564) 232 protein:vir:1431 Length: 419 # 94.0 0.0058 3.6E-06 32.8 31.9 370 1-445 2-413 (419) 233 protein:vir:4995 Length: 384 # 93.6 0.007 4.3E-06 32.4 29.9 358 1-445 3-382 (384) 234 protein:vir:104892 Length: 558 93.2 0.0084 5.2E-06 32.0 27.0 404 1-445 46-544 (558) 235 protein:vir:9702 Length: 406 # 92.9 0.0097 6E-06 31.6 29.8 366 20-443 1-406 (406) 236 protein:vir:80333 Length: 419 92.2 0.012 7.6E-06 31.1 32.8 370 1-445 1-413 (419) 237 protein:vir:101806 Length: 516 91.7 0.015 9.1E-06 30.6 22.8 377 1-445 57-515 (516) 238 protein:vir:101189 Length: 516 91.7 0.015 9.1E-06 30.6 22.8 377 1-445 57-515 (516) 239 protein:vir:5839 Length: 533 # 91.2 0.017 1E-05 30.3 22.0 390 1-445 19-524 (533) 240 protein:vir:1082 Length: 359 # 90.3 0.022 1.4E-05 29.7 30.0 330 1-405 3-359 (359) 241 protein:vir:960 Length: 413 # 90.1 0.022 1.4E-05 29.6 28.1 364 6-443 1-413 (413) 242 protein:vir:6210 Length: 394 # 90.1 0.023 1.4E-05 29.6 27.5 356 1-445 3-393 (394) 243 protein:vir:100598 Length: 516 89.8 0.024 1.5E-05 29.4 21.0 380 1-445 53-515 (516) 244 protein:vir:1661 Length: 378 # 89.6 0.026 1.6E-05 29.3 21.5 323 13-442 1-378 (378) 245 protein:vir:103458 Length: 524 89.1 0.028 1.7E-05 29.1 22.8 382 1-432 57-524 (524) 246 protein:vir:7208 Length: 524 # 89.1 0.028 1.7E-05 29.1 22.8 382 1-432 57-524 (524) 247 protein:vir:95254 Length: 488 88.7 0.031 1.9E-05 28.9 28.1 400 1-445 7-480 (488) 248 protein:vir:8100 Length: 466 # 88.6 0.031 2E-05 28.8 31.7 395 1-445 3-466 (466) 249 protein:vir:93867 Length: 378 85.4 0.053 3.3E-05 27.6 21.6 323 20-442 1-378 (378) 250 protein:vir:345 Length: 663 # 85.0 0.056 3.5E-05 27.4 33.2 406 1-445 16-589 (663) 251 protein:vir:81218 Length: 423 80.9 0.091 5.6E-05 26.3 31.8 370 1-445 3-422 (423) 252 protein:vir:95965 Length: 385 80.6 0.093 5.8E-05 26.2 29.1 342 1-442 3-385 (385) 253 protein:vir:6896 Length: 523 # 78.5 0.11 7E-05 25.8 23.7 380 1-432 57-523 (523) 254 protein:vir:98265 Length: 524 76.6 0.13 8.3E-05 25.4 23.9 382 1-445 59-523 (524) 255 protein:vir:8317 Length: 409 # 73.7 0.17 0.0001 24.8 28.3 356 1-430 3-409 (409) 256 protein:vir:106282 Length: 521 73.0 0.18 0.00011 24.7 24.8 377 1-445 56-520 (521) 257 protein:vir:100249 Length: 431 72.8 0.18 0.00011 24.7 32.9 379 1-439 3-431 (431) 258 protein:vir:94002 Length: 378 71.2 0.2 0.00012 24.4 23.0 329 20-443 1-378 (378) 259 protein:vir:80134 Length: 403 63.8 0.31 0.00019 23.4 28.9 358 1-444 3-403 (403) 260 protein:vir:4089 Length: 395 # 61.9 0.34 0.00021 23.1 27.0 356 1-445 3-394 (395) 261 protein:vir:78310 Length: 376 59.4 0.39 0.00024 22.8 30.1 340 1-444 3-376 (376) 262 protein:vir:6596 Length: 521 # 58.2 0.42 0.00026 22.7 26.9 378 1-445 60-520 (521) 263 protein:vir:94666 Length: 723 57.2 0.44 0.00027 22.5 33.8 369 29-445 1-443 (723) 264 protein:vir:78191 Length: 351 55.8 0.47 0.00029 22.4 22.5 295 18-379 1-351 (351) 265 protein:vir:98853 Length: 219 52.9 0.54 0.00034 22.0 15.3 196 148-376 1-219 (219) 266 protein:vir:858 Length: 378 # 50.6 0.6 0.00037 21.8 22.4 323 1-442 3-378 (378) 267 protein:vir:108049 Length: 524 49.1 0.65 0.0004 21.6 24.6 380 1-445 57-523 (524) 268 protein:vir:94869 Length: 378 47.2 0.71 0.00044 21.4 23.4 323 1-445 3-378 (378) 269 protein:vir:81017 Length: 521 46.5 0.73 0.00046 21.3 26.4 378 1-445 62-520 (521) 270 protein:vir:4698 Length: 251 # 42.8 0.87 0.00054 20.9 20.0 234 1-284 3-251 (251) 271 protein:vir:267 Length: 348 # 38.4 1.1 0.00066 20.4 24.6 298 24-379 1-348 (348) 272 protein:vir:98643 Length: 395 35.1 1.3 0.00078 20.0 27.9 354 1-445 3-394 (395) 273 protein:vir:79207 Length: 351 26.7 1.9 0.0012 19.0 22.6 294 18-385 1-351 (351) 274 protein:vir:79150 Length: 368 22.1 2.5 0.0016 18.4 22.3 316 5-386 1-368 (368) 275 protein:vir:101418 Length: 569 21.3 2.6 0.0016 18.3 23.3 402 1-433 76-569 (569) No 1 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=1.8e-106 Score=600.29 Aligned_cols=445 Identities=98% Similarity=1.452 Sum_probs=430.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|++||++|..+++++.++.+||+|+|+|+.++++.......+..++++|+++||+++||++.++|++|+|++++++++. T Consensus 48 ~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~~ 127 (492) T protein:vir:94 48 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 127 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcccCceeccCchH Confidence 99999999999999999999999999999999988888888888999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|+.|++|+++..+.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++|+|..++..+++ T Consensus 128 ~~~~l~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~ 207 (492) T protein:vir:94 128 VVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 207 (492) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|...............+.++.+..+|+||.||||+|+|+++|.|+|+++++|+|+||.++|++++.+++++ T Consensus 208 ~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 287 (492) T protein:vir:94 208 YWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 287 (492) T ss_pred EEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999999998888888888888888899999999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|++.+...++...++..+++.++++++++|++++.+.++++.++++|++.|+.+|++|+++++++++++||+| T Consensus 288 ~p~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 367 (492) T protein:vir:94 288 ELTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 367 (492) T ss_pred CceeeeecCCcccchhhHHHHhhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccCchHHH Confidence 99999999998888888888899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |++++++|..||.++++.|+.+|++++++++++++.+.+..+++|+|++++|+|.++.++++++++|++|+||+++++|+ T Consensus 368 l~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~ 447 (492) T protein:vir:94 368 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 447 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.+++..+.+.+++.+++++++.++| T Consensus 448 v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 448 VEDLQAELERIEQEQMEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred CCCHHHHHHHHHHHHHHHHhhccccccccCCCCccccCCccccCC Confidence 999999999999999999999999999999999999999999999 No 2 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=2e-106 Score=600.06 Aligned_cols=445 Identities=99% Similarity=1.459 Sum_probs=430.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..++++|.++.+||+|+|+|+.+++...........++++||++||+++||++.++|++|+|+++++++++ T Consensus 39 ~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~~ 118 (483) T protein:vir:12 39 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 118 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCChH Confidence 99999999999999999999999999999999988888888889999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|++|++|++++.+.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++|+|..++..+++ T Consensus 119 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~ 198 (483) T protein:vir:12 119 VVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 198 (483) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|...............+.++.+..+|+||.||||+|+|++.|.|+|+++++|||+||.++|++++.+++++ T Consensus 199 ~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 278 (483) T protein:vir:12 199 YWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 278 (483) T ss_pred EEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999999998888888887888888899999999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|+..+..+++...++..+++.++++++++|++++.+.+++++++++|++.|+.+|++|+++++++++++||+| T Consensus 279 ~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 358 (483) T protein:vir:12 279 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 358 (483) T ss_pred CceeeeecCCcccchhHHHhhhhccccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHH Confidence 99999999998888888888899899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |++++.+|..||.++++.|+.+|++++++++++++.+.++.+++|+|++++|+|.++.++++++++|++|+||+++++|+ T Consensus 359 l~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~GiiS~et~~~~~~~ 438 (483) T protein:vir:12 359 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 438 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.+++..+++.+++.++++++++++| T Consensus 439 v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 439 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 483 (483) T ss_pred CCCHHHHHHHHHHHHHHHHhhcccccccccCCcccCCCCCcccCC Confidence 999999999999999999999999999999999999999999999 No 3 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=2.5e-106 Score=599.57 Aligned_cols=445 Identities=98% Similarity=1.448 Sum_probs=429.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+||++|..++++|.++.+||+|+|+|+.++++..........++++|+++||+++||++.++|++|+|++++++++. T Consensus 48 ~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~ 127 (492) T protein:vir:97 48 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 127 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcccCceeccCchH Confidence 89999999999999999999999999999999988888888889999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|++|++|++++.+.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++|+|..++..+++ T Consensus 128 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~ 207 (492) T protein:vir:97 128 VVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 207 (492) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|....+..........+.++.+..+|+||.||||+|+|++.|+|+|+++++|||+||.++|++++.+++++ T Consensus 208 ~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 287 (492) T protein:vir:97 208 YWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 287 (492) T ss_pred EEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999999998888877777788888899999999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|+..+...++...++..+++.++++++++|++++.+.+++++++++|+++|+.+|++|+++++++++++||+| T Consensus 288 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~A 367 (492) T protein:vir:97 288 ELTYVLKNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 367 (492) T ss_pred cceeeeecCCcccchhHHHHHhhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCcHHHH Confidence 99999999998888888888899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |++++++|..||.++++.|+.+|++++++++++++.+.+..+++|+|++++|+|.++.++++++++|++|+||+++++|+ T Consensus 368 l~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~G~iS~et~l~~l~~ 447 (492) T protein:vir:97 368 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 447 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.+++..+.+.+.+.+.++++++++| T Consensus 448 v~d~~~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 448 VEDLQAELERIEQEQTEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCcccccccccccC Confidence 999999999999999999999999999999999999999999999 No 4 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=2.4e-105 Score=594.10 Aligned_cols=445 Identities=99% Similarity=1.459 Sum_probs=429.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|+.++++|.++++||+|+|+|+.++............++++|+++||+++||++.++|++|+|+++++++++ T Consensus 28 ~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~~~~d~~ 107 (472) T protein:vir:93 28 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 107 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcccCeeeccCChH Confidence 99999999999999999999999999999999888888888888999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|+.|++|+++..+.+++++++++|++|++||.|++|++++++++|.+++|+|+++..+++.+++|+|..++..+++ T Consensus 108 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~ 187 (472) T protein:vir:93 108 VVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 187 (472) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEE Confidence 99999999999999999999999999999999999999999999999999999999988889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|...............+.++....+|+||.||||+|+|+++|+|+|+++++|+|+||.++|++++.+++++ T Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~ 267 (472) T protein:vir:93 188 YWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 267 (472) T ss_pred EEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999999898888877777888888889999999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|++.+..+++...++..+++.++++++++|++++++.+++++++++|+++|+++|++|+++++.+++++||+| T Consensus 268 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A 347 (472) T protein:vir:93 268 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 347 (472) T ss_pred CceeEeecCCcccchhhHHHHhhccccccCCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCchHHH Confidence 99999999998888888888899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |++++.+|..||.++++.|+.+|++++++++++++.+.+..+++|+|++++|+|.++.++++++++|++|+||+++++|+ T Consensus 348 l~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~~giis~et~l~~l~~ 427 (472) T protein:vir:93 348 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 427 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.+++..+.+++++.+.++++++++| T Consensus 428 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 428 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 472 (472) T ss_pred CCCHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCCCcccCC Confidence 999999999999999999999999999999999999999999999 No 5 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=3.9e-104 Score=587.53 Aligned_cols=444 Identities=56% Similarity=0.948 Sum_probs=422.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..+++++.++++||+|+|+|+.++.+..........++++||++||+++||++.++|++|+|++++++++. T Consensus 31 ~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~ 110 (474) T protein:vir:96 31 MIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDK 110 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCceeccCChH Confidence 99999999999999999999999999999999888777777788889999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.++.|++|+++..+.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++|+|..+...+++ T Consensus 111 ~~~~l~~~~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~ 190 (474) T protein:vir:96 111 VLDVIHQVLDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVE 190 (474) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999989999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|...+...................+|++|.||||+|+|++.|.|+|+++++|||+||.++|++++.+++++ T Consensus 191 vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~ 270 (474) T protein:vir:96 191 YWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESV 270 (474) T ss_pred EEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999887776666656666777778899999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|+..+..+++...++..+++.++++++++|++++.+.+++++++++|+++|+.+|++|+++++++++++||+| T Consensus 271 ~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 350 (474) T protein:vir:96 271 ELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIA 350 (474) T ss_pred cchhhhcCCCcccccchhhhhhccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHH Confidence 99999999988877788888888899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |+++++++.+||.++++.|+.+|+++++++++++|...+..+|+++|++++|+|.++.++++++ +|++|+||+++++|+ T Consensus 351 lk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~ 429 (474) T protein:vir:96 351 LKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPW 429 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCC Confidence 9999999999999999999999999999999999999999999999999999999999999877 599999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.++...+++++++.++.+++++++| T Consensus 430 v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 430 VDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred CCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 999999999999999999999988999999999999999999999 No 6 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=3.9e-104 Score=587.53 Aligned_cols=444 Identities=56% Similarity=0.948 Sum_probs=422.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..+++++.++++||+|+|+|+.++.+..........++++||++||+++||++.++|++|+|++++++++. T Consensus 31 ~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~ 110 (474) T protein:vir:95 31 MIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDK 110 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCceeccCChH Confidence 99999999999999999999999999999999888777777788889999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.++.|++|+++..+.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++|+|..+...+++ T Consensus 111 ~~~~l~~~~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~ 190 (474) T protein:vir:95 111 VLDVIHQVLDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVE 190 (474) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999989999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|...+...................+|++|.||||+|+|++.|.|+|+++++|||+||.++|++++.+++++ T Consensus 191 vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~ 270 (474) T protein:vir:95 191 YWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESV 270 (474) T ss_pred EEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 99999999999887776666656666777778899999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|+..+..+++...++..+++.++++++++|++++.+.+++++++++|+++|+.+|++|+++++++++++||+| T Consensus 271 ~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 350 (474) T protein:vir:95 271 ELIYILRGYEGEDLSEFMEGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIA 350 (474) T ss_pred cchhhhcCCCcccccchhhhhhccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHH Confidence 99999999988877788888888899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |+++++++.+||.++++.|+.+|+++++++++++|...+..+|+++|++++|+|.++.++++++ +|++|+||+++++|+ T Consensus 351 lk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~ 429 (474) T protein:vir:95 351 LKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPW 429 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCC Confidence 9999999999999999999999999999999999999999999999999999999999999877 599999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.++...+++++++.++.+++++++| T Consensus 430 v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 430 VDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred CCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 999999999999999999999988999999999999999999999 No 7 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=3.5e-103 Score=582.30 Aligned_cols=444 Identities=58% Similarity=0.969 Sum_probs=423.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..++++|+++.+||+|+|+|+.++++....+.....++++||++||+++||++.++|++|+|++++++++. T Consensus 31 ~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~ 110 (474) T protein:vir:97 31 MIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDEN 110 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHH Confidence 99999999999999999999999999999998888877778888899999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|+.|++|++...+.++++.++++|++|+++|.|++|++++++++|++++|+|+++..+++.+++|+|..++..+++ T Consensus 111 ~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~ 190 (474) T protein:vir:97 111 VLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVE 190 (474) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|....+..................+|++|+||||+|+|++.|.|+|+++++|||+||.++|++++.+++++ T Consensus 191 ~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~ 270 (474) T protein:vir:97 191 FWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESV 270 (474) T ss_pred EEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999998887766665555566677778899999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|++.+...++...++..+++.++++++++|++++.+.+++++++++|++.|+.+|++|+++++++++++||+| T Consensus 271 ~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 350 (474) T protein:vir:97 271 ELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIA 350 (474) T ss_pred CceeeeecCCcccchhhhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHH Confidence 99999999988877788888888899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |+++++++.+||.++++.|+.+|++++++++++++.+.+..+++|+|++++|.|.++.+++++++ |++|+||+++++|+ T Consensus 351 l~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~ 429 (474) T protein:vir:97 351 LKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPL 429 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999999999999886 89999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.++...+.+.+++.++++++++++| T Consensus 430 v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 430 VDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred CCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 999999999999999999999999999999999999999999999 No 8 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=3.5e-103 Score=582.30 Aligned_cols=444 Identities=58% Similarity=0.969 Sum_probs=423.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..++++|+++.+||+|+|+|+.++++....+.....++++||++||+++||++.++|++|+|++++++++. T Consensus 31 ~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~ 110 (474) T protein:vir:94 31 MIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDEN 110 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCceeccCcHH Confidence 99999999999999999999999999999998888877778888899999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|+.|++|++...+.++++.++++|++|+++|.|++|++++++++|++++|+|+++..+++.+++|+|..++..+++ T Consensus 111 ~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~ 190 (474) T protein:vir:94 111 VLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVE 190 (474) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|....+..................+|++|+||||+|+|++.|.|+|+++++|||+||.++|++++.+++++ T Consensus 191 ~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~ 270 (474) T protein:vir:94 191 FWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESV 270 (474) T ss_pred EEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999998887766665555566677778899999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|++.+...++...++..+++.++++++++|++++.+.+++++++++|++.|+.+|++|+++++++++++||+| T Consensus 271 ~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~A 350 (474) T protein:vir:94 271 ELIYILKGYEGEDLEEFMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIA 350 (474) T ss_pred CceeeeecCCcccchhhhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccccHHHH Confidence 99999999988877788888888899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |+++++++.+||.++++.|+.+|++++++++++++.+.+..+++|+|++++|.|.++.+++++++ |++|+||+++++|+ T Consensus 351 l~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~ 429 (474) T protein:vir:94 351 LKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPL 429 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999999999999886 89999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.++...+.+.+++.++++++++++| T Consensus 430 v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 430 VDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred CCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 999999999999999999999999999999999999999999999 No 9 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=7e-102 Score=575.15 Aligned_cols=444 Identities=59% Similarity=0.972 Sum_probs=421.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..+++++.++.+||+|+|+|++++.+..........++++||++||++.||++.++|++|+|+++++++++ T Consensus 31 ~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~ 110 (474) T protein:vir:95 31 MIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTYSCEDES 110 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCceeccCchH Confidence 99999999999999999999999999999999888888888888899999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|+.|++|+++..+.++++.++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++++|..++..+++ T Consensus 111 ~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~ 190 (474) T protein:vir:95 111 VLKIIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVE 190 (474) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +|+...+++|.......................+|++|.||||+|+|++.|.|+|+++++|||+||.++|++++.+++++ T Consensus 191 ~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~ 270 (474) T protein:vir:95 191 FWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESV 270 (474) T ss_pred EEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99999999988877665555555555566677899999999999999999999999999999999999999999999999 Q ss_pred CCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHH Q lcl|NC_021326. 241 ELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVA 320 (445) Q Consensus 241 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~A 320 (445) +|+++++|++.+...++...++..+++.++++++++|++++.+.++++.+++.|.++|+..|++|+++++++++++||+| T Consensus 271 ~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~A 350 (474) T protein:vir:95 271 ELIYILKGYEGQDLEEFMRGLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIA 350 (474) T ss_pred CceeeeecCCcccchhhhhhhhccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHH Confidence 99999999998877788888888999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPF 400 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~ 400 (445) |+++++++..||.++++.|+.+|++++++|++++|...+..+++|+|++++|.|.++.+++++++ |++|+||+++++|+ T Consensus 351 lk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~~e~a~~~~~~-g~iS~et~i~~l~~ 429 (474) T protein:vir:95 351 LKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKMDVKDIEISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPL 429 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHHHHhc-CCCchHHHHHhCCC Confidence 99999999999999999999999999999999999999999999999999999999999999885 99999999999999 Q ss_pred CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+++|++||++|+++.++.++...+.++++..++++++++++| T Consensus 430 v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 430 VDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred CCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccCCCC Confidence 999999999999999999999999999999999999999999999 No 10 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=4.3e-100 Score=565.32 Aligned_cols=443 Identities=49% Similarity=0.882 Sum_probs=404.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+++++|..++++++++.+||+|+|+|+.++............++++||++||++.||++.++|+||+||++++++++ T Consensus 30 ~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~ 109 (478) T protein:vir:10 30 MILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDK 109 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcccCceeecCChH Confidence 99999999999999999999999999999999888877778888899999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|+.+++|++...+.++++.++++|++|++||.|++|++++++++|.+++|+|+++..+++.+++|+|..++..+++ T Consensus 110 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~ 189 (478) T protein:vir:10 110 ALKQIQHTLNHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVE 189 (478) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeeccc----ccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYS----NNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTF 236 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~ 236 (445) +|+...+++|............ ...........+|+||+||||+|+|++.|.|+|+++++|||+||.++|++++.+ T Consensus 190 ~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~ 269 (478) T protein:vir:10 190 YWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTF 269 (478) T ss_pred EEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999988877655432221 122233345678999999999999999999999999999999999999999999 Q ss_pred HHhcCCeeEEecCCcccchhHHHhhhhCceeecc--CCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccC Q lcl|NC_021326. 237 KDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVS--DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS 314 (445) Q Consensus 237 ~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~ 314 (445) +++++|+++++|++.+...++...++..+++.++ ++++++|++++.+.++++.++++|++.|+.+|++|+++++++++ T Consensus 270 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~ 349 (478) T protein:vir:10 270 DESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGN 349 (478) T ss_pred HHhhCcceeeecCCcccccchhhhhhhCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCcccccc Confidence 9999999999999988878888888888888774 56889999999999999999999999999999999999999999 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHH Q lcl|NC_021326. 315 APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETV 394 (445) Q Consensus 315 ~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~ 394 (445) ++||+||++++++|.+||.++++.|+.+|++++++++++++...+..+++|+|++++|+|.++.+++++++.|++|+||+ T Consensus 350 n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~~g~iS~et~ 429 (478) T protein:vir:10 350 SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETI 429 (478) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCCCCCCCHHHHHHHHHHHhCCCChHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 395 LENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 395 l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++++|+++|+++|++||++|+++.++..++..++..+. +..+++|.++| T Consensus 430 i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~--~~~~~~d~~~e 478 (478) T protein:vir:10 430 LGNHSWVQDPVAEMERIEQENIELNQQLPDIEEGLNDE--QQRQSEDNQSE 478 (478) T ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHhccccCCCCccc--ccccCcCCCCC Confidence 99999999999999999999999888776666555443 33444555555 No 11 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=1.9e-99 Score=561.81 Aligned_cols=443 Identities=49% Similarity=0.893 Sum_probs=402.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|+.++++|.++++||+|+|+++.++.+..........++++||++||+++||++.++|+||+|+++++++++ T Consensus 30 ~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~ 109 (478) T protein:vir:10 30 MILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDK 109 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCeeeecCChH Confidence 99999999999999999999999999999998887777777788889999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|+.+++|++.+++.+++++++++|++|+++|.|++|++++++++|++++|+|+++..+++.+++|+|..++..+++ T Consensus 110 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~ 189 (478) T protein:vir:10 110 ALKQIQHTLNHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVE 189 (478) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecc----cccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDY----SNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTF 236 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~ 236 (445) +|+...+++|........... ............+|++|+||||+|+|++.|.|+|+++++|||+||.++|++++.+ T Consensus 190 ~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~ 269 (478) T protein:vir:10 190 YWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTF 269 (478) T ss_pred EEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999887765543322 1222333445678999999999999999999999999999999999999999999 Q ss_pred HHhcCCeeEEecCCcccchhHHHhhhhCceeecc--CCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccC Q lcl|NC_021326. 237 KDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVS--DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS 314 (445) Q Consensus 237 ~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~ 314 (445) +++++|+++++|++.+...++...++..+++.++ ++++++|++++++.+++++++++|++.|+.+|++|+++++++++ T Consensus 270 ~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~ 349 (478) T protein:vir:10 270 DESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGN 349 (478) T ss_pred HHhhCceeeeecCCccccchhhhhhhhcceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccccc Confidence 9999999999999888777888788888888775 56889999999999999999999999999999999999999999 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHH Q lcl|NC_021326. 315 APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETV 394 (445) Q Consensus 315 ~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~ 394 (445) |+||+||+++++.|.+|+.++++.|+.+|+++++++++++|...+..+++|+|++++|+|.++.++++++++|++|+||+ T Consensus 350 n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~ 429 (478) T protein:vir:10 350 SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETI 429 (478) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEecCCCCCCHHHHHHHHHHHhCCCChHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 395 LENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 395 l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 444 (445) ++++|+++|+++|++||++|+++..+...+...+ ..++.++++++++.. T Consensus 430 ~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 430 LSNHAWVEDPVAEMERIEQENIELNQQLPDIEEG-LNGEQQRQSENNQPE 478 (478) T ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHhhccccccc-cCCCCCCCCCCCCCC Confidence 9999999999999999999999888877776543 333333333333333 No 12 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=4.8e-99 Score=559.61 Aligned_cols=440 Identities=52% Similarity=0.919 Sum_probs=398.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|+.+++++.++.+||+|+|+|+.++.+........+.++++||++||++.||++.++||+|+|+++++++++ T Consensus 30 ~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~ 109 (474) T protein:vir:96 30 MIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDK 109 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhhhhhcccCceeecCchH Confidence 99999999999999999999999999999999988888888888899999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.+++|++|+++..+.++++.++++|++|+++|.|++|++++++++|++++|+|+++..+++.+++|+|..+...+++ T Consensus 110 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~ 189 (474) T protein:vir:96 110 SLKTIQEVLNHKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVE 189 (474) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeeccccc----ccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNN----LENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTF 236 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~ 236 (445) +|+...+++|.............. .........+|++|+||||+|+|++.|.|+|+++++|||+||.++|++++.+ T Consensus 190 ~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~ 269 (474) T protein:vir:96 190 YWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTF 269 (474) T ss_pred EEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999887766544332221 1223334578999999999999999999999999999999999999999999 Q ss_pred HHhcCCeeEEecCCcccchhHHHhhhhCceeeccC-CCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCc Q lcl|NC_021326. 237 KDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSD-NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSA 315 (445) Q Consensus 237 ~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~ 315 (445) +++++|+++++|+..+...++...++..+++.+++ +++++|++++.+.++++.++++++++|+.+|++|++++++++++ T Consensus 270 ~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n 349 (474) T protein:vir:96 270 DESTELIYILKGYEGQDLDEFMRNLKYYKAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGNS 349 (474) T ss_pred HHhccceeeeecCCcccccchhhhhhcCceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccc Confidence 99999999999998877777777888888998874 67899999999999999999999999999999999999999999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHH Q lcl|NC_021326. 316 PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVL 395 (445) Q Consensus 316 ~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l 395 (445) +||+|+++++++|.+|+.++++.|+.+|++++++++++++...+..+++|+|++++|.|.++.++++++ +|++|+||++ T Consensus 350 ~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~i~i~f~~~~p~~~~e~~~~~~~-ag~iS~et~~ 428 (474) T protein:vir:96 350 PSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNIKVQDVEITFNFNVMVNELEQSQIGVQ-SQYLSKETVV 428 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHHHHh-cCCCchHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999998765 5999999999 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCC Q lcl|NC_021326. 396 ENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSND 441 (445) Q Consensus 396 ~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d 441 (445) +++|+++|+++|++||++|+++..+..++.......+..+++++.| T Consensus 429 ~~~~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 429 TNHPWVDDPVAELERIEQDNIDFNKQLPPLEGDANGRAQDNESETN 474 (474) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHhcccccccccccccCCCcccCC Confidence 9999999999999999999988887766654433222222222222 No 13 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=1.3e-97 Score=551.81 Aligned_cols=433 Identities=48% Similarity=0.872 Sum_probs=393.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..++++|.++.+||+|+|+++.++..........+.++++||++||++.||++.++|++|+|+++++++++ T Consensus 30 ~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~ 109 (468) T protein:vir:96 30 MILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTYGTEDEK 109 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhccCCceeccCChH Confidence 89999999999999999999999999999998888777777788889999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.|++|++|++++.+.+++++++++|++|++||.|++|++++++++|.+++|+|+++..+++.+++|+|..++..+++ T Consensus 110 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~ 189 (468) T protein:vir:96 110 SLKTIQEVLNHKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVE 189 (468) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEE Confidence 99999999999999999999999999999999999999999999999999999999998889999999999999999999 Q ss_pred EEecceEEEEEEecceeeecccc----cccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSN----NLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTF 236 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~ 236 (445) +|+...+++|............. ..........+|+||+||||+|+|++.|.|+|+++++|+|+||.++|++++.+ T Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~ 269 (468) T protein:vir:96 190 YWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTF 269 (468) T ss_pred EEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHH Confidence 99999999988776654433221 22234455678999999999999999999999999999999999999999999 Q ss_pred HHhcCCeeEEecCCcccchhHHHhhhhCceeeccC--CCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccC Q lcl|NC_021326. 237 KDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSD--NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS 314 (445) Q Consensus 237 ~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~ 314 (445) +++++|+++++|+..+....+...++..+++.+++ +++++|++++++.++++.++++|+++|+.+|++|+++++++++ T Consensus 270 ~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~ 349 (468) T protein:vir:96 270 DEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGN 349 (468) T ss_pred HHhcCceeeeecCCccccchhhhhhhcCceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccccccccc Confidence 99999999999998877777777788888888763 4779999999999999999999999999999999999999999 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHH Q lcl|NC_021326. 315 APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETV 394 (445) Q Consensus 315 ~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~ 394 (445) ++||+|++++++++.+|+.++++.|+++|+++++++++++|...+..+++|+|++++|.|.++.|++++++ |++|+||+ T Consensus 350 n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~i~f~~~~p~d~~e~a~~~~~~-g~iS~et~ 428 (468) T protein:vir:96 350 SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLSIKVQDVEITFNFNVMVNELEQSQIGVNS-QYLSKETV 428 (468) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCCcCHHHHHHHHHhc-CCCchHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999988764 99999999 Q ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCC Q lcl|NC_021326. 395 LENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKE 437 (445) Q Consensus 395 l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 437 (445) ++++|+++|+++|++||++|+++..+..+.+.+.+.+ ++. T Consensus 429 i~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~---~~~ 468 (468) T protein:vir:96 429 VTNHPWVDDPVAEMERIDQEELALPSIEEGLNGKENN---EPT 468 (468) T ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHHhhccCCCCCC---CCC Confidence 9999999999999999999998877765543332111 111 No 14 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=4.3e-95 Score=537.91 Aligned_cols=436 Identities=28% Similarity=0.445 Sum_probs=380.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc----ccccccccccccccccchHHHHHHHHHhhhhccCeeecc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVD----ATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH 76 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~----~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~ 76 (445) +|.+++.+|+.++++|.++.+||+|+|+|+.++.... ........++++||++||++.||++.++|++|+||++++ T Consensus 9 ~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~G~p~~~~~ 88 (470) T protein:vir:10 9 LIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVASVFPDIDV 88 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhheeccceeeec Confidence 8888888899999999999999999999988865432 122334567899999999999999999999999999999 Q ss_pred CchHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc Q lcl|NC_021326. 77 TDDEVIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE 156 (445) Q Consensus 77 ~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~ 156 (445) +++...+.+++++++++.+.+.+++++++++|++|+++|+|++|++++++++|.+++|+|+++..+++.+++|+|...+. T Consensus 89 ~d~~~~~~l~~~~~~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~ 168 (470) T protein:vir:10 89 GKDADNKKIIDVLGDDRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATTLDNKLLGILRSYKQLDP 168 (470) T ss_pred CchHHHHHHHHHHhhhHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeec Confidence 99999999999999999999999999999999999999999999999999999999999999988999999999976543 Q ss_pred ------eeEEEEecceEEEEEEecceeeecc----------cccccccccccccccccccceEEecCCCCcCccHHHHHH Q lcl|NC_021326. 157 ------TKVEYWDKITVNYYVYENGSLIPDY----------SNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKT 220 (445) Q Consensus 157 ------~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~ 220 (445) ..+++|+...+++|........... ....+.......+|+||.||||+|+|++.|.|+|+++++ T Consensus 169 ~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~ 248 (470) T protein:vir:10 169 DSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYRLPELNKYKG 248 (470) T ss_pred CCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCCCCCCchhHHHH Confidence 2467888888888876654332211 112234445567899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccC-----CCceeeEeccCChHHHHHHHHHHH Q lcl|NC_021326. 221 LIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSD-----NGGVDTIQVEVPVENSKKYLDELY 295 (445) Q Consensus 221 lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~l~~~~~~~~~~~~i~~l~ 295 (445) |||+||.++|++++.++++++|+++++|+..+..+++...++..+++.++. +++++|++++.+.++++.++++|+ T Consensus 249 liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~ 328 (470) T protein:vir:10 249 LIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITR 328 (470) T ss_pred HHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHH Confidence 999999999999999999999999999998888788888888888888754 467999999999999999999999 Q ss_pred HHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCcceEEEEeCCCCCCC Q lcl|NC_021326. 296 QKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-KGEHKDVDISFNYNKVAN 374 (445) Q Consensus 296 ~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~i~v~f~~~~p~d 374 (445) ++|+.+|++|++++..+ |++||+||++++++|.+||+++++.|+++|++++++|+++++. ..+..+++|+|++++|.| T Consensus 329 ~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d 407 (470) T protein:vir:10 329 KNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVED 407 (470) T ss_pred HHHHHHhCCCCCCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccceeeEEeccCCCCC Confidence 99999999999998887 6899999999999999999999999999999999999999886 446789999999999999 Q ss_pred HHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcC Q lcl|NC_021326. 375 TELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQ 443 (445) Q Consensus 375 ~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 443 (445) .++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..+...+..+ .+.+|++ T Consensus 408 ~~e~~~~~~~~~g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~------~~~dde~ 470 (470) T protein:vir:10 408 SLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDKEENDPYSNQADELNG------KGVNDEQ 470 (470) T ss_pred HHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhccccccCC------CCCCCCC Confidence 99999999999999999999999999999999999999999887776655433221 1222222 No 15 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=7.2e-95 Score=536.72 Aligned_cols=433 Identities=29% Similarity=0.493 Sum_probs=382.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccc--------cccccccccccccccchHHHHHHHHHhhhhccCe Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDA--------TGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPI 72 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~--------~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~ 72 (445) +|.+++++|++++++|.++++||+|+|+|+.++..... .......++++||++||++.||++.++|++|+|+ T Consensus 9 ~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~ 88 (471) T protein:vir:10 9 IISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKKAYALTYPP 88 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhhhhhcccCc Confidence 88999999999999999999999999999987655322 1222345678999999999999999999999999 Q ss_pred eeccCchHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECC-CCcEEEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 73 AFKHTDDEVIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDE-EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 73 ~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) ++++++++..+.|+.|++|++++.+.++++.++++|++|+++|.++ +|++++.+++|.+++|+|+++..+++.+++|+| T Consensus 89 ~~~~~~~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~ 168 (471) T protein:vir:10 89 TFDVDDKKVNDMIVDVLGDDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLDKKSIGVLRVY 168 (471) T ss_pred eeccCChHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 9999999999999999999999999999999999999999999985 699999999999999999998888999999999 Q ss_pred eeec------ceeEEEEecceEEEEEEecceeeeccc----------ccccccccccccccccccceEEecCCCCcCccH Q lcl|NC_021326. 152 KLEN------ETKVEYWDKITVNYYVYENGSLIPDYS----------NNLENSKTHFSTGSWGKIPFIPFKNNDLEISDI 215 (445) Q Consensus 152 ~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~ 215 (445) .... ...+++|+...+++|............ ..+........+|+||.||||+|+|+..|.|+| T Consensus 169 ~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~sd~ 248 (471) T protein:vir:10 169 SSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKNNEIETNDL 248 (471) T ss_pred EeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEeccCCCCCCch Confidence 7642 235788888888888876654333221 123344555678999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccC-----CCceeeEeccCChHHHHHH Q lcl|NC_021326. 216 FMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSD-----NGGVDTIQVEVPVENSKKY 290 (445) Q Consensus 216 ~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~l~~~~~~~~~~~~ 290 (445) +++++|||+||.++|++++.++++++|+++++|++.+..+++...++..+++.+++ +++++|++++.+.++++.+ T Consensus 249 e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 328 (471) T protein:vir:10 249 KPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLI 328 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecCChHHHHHH Confidence 99999999999999999999999999999999998887788888888888888754 3589999999999999999 Q ss_pred HHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCC Q lcl|NC_021326. 291 LDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYN 370 (445) Q Consensus 291 i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~ 370 (445) +++|+++|+.+|++|+++++++ |++||+||++++++|..||.++++.|+.+|++++++++++++.. +..+++|+|+++ T Consensus 329 ~~~l~~~I~~~s~tp~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-d~~~i~i~f~~~ 406 (471) T protein:vir:10 329 LERTKKQIFISGQGVNPETDKL-GNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLS-DKLKIKQTWTRN 406 (471) T ss_pred HHHHHHHHHHHhCCcCCCcccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CCceeEEEeCCC Confidence 9999999999999999998887 57999999999999999999999999999999999999999875 467899999999 Q ss_pred CCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCC Q lcl|NC_021326. 371 KVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQ 435 (445) Q Consensus 371 ~p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 435 (445) +|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++.++..++..+..++...+ T Consensus 407 ~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 407 SINNDTEMAQVVSTLATITSRENVAKSNPIVEDWQDELRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred CCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccCCCCCccccC Confidence 99999999999999999999999999999999999999999999998877777665554444444 No 16 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=3.1e-94 Score=533.25 Aligned_cols=443 Identities=35% Similarity=0.585 Sum_probs=383.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccc---cccccccccccccccchHHHHHHHHHhhhhccCeeeccC Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDA---TGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT 77 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~---~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~ 77 (445) +|.++|++| +.++++++++||.|+|+|+.++..... .......++++|+++||+++||++.++|++|+|++++++ T Consensus 33 ~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~~~ 110 (503) T protein:vir:59 33 MIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEPVTFTSD 110 (503) T ss_pred HHHHHHHhh--cHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHHHHHHhhhhcCCeeeccC Confidence 788888887 468899999999999999888765432 223445678899999999999999999999999999999 Q ss_pred chHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc- Q lcl|NC_021326. 78 DDEVIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE- 156 (445) Q Consensus 78 d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~- 156 (445) +++..+.|+.|++|+++.++.+++++++++|++|++||.|++|++++++++|.+++|+|++...+++.+++|+|..... T Consensus 111 d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~ 190 (503) T protein:vir:59 111 NKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIM 190 (503) T ss_pred cHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEecCC Confidence 9999999999999999999999999999999999999999999999999999999999999888999999999976543 Q ss_pred ----eeEEEEecceEEEEEEecceeeeccc----ccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ----TKVEYWDKITVNYYVYENGSLIPDYS----NNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRR 228 (445) Q Consensus 157 ----~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~ 228 (445) .++++|+...+++|............ ...........+|++++||||+|+|++.|.|+|+++++|||+||++ T Consensus 191 ~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~ 270 (503) T protein:vir:59 191 GEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSI 270 (503) T ss_pred CceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHH Confidence 36789999999888766544322211 1111223445789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFS 308 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~ 308 (445) +|++++.++++++|+++++|.+.+...++...++..+++.++++++++|++++++.++++.+++.|++.|+.+|++|+++ T Consensus 271 ~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 350 (503) T protein:vir:59 271 TSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNS 350 (503) T ss_pred HHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCC Confidence 99999999999999999999998888888888899999999999999999999999999999999999999999999999 Q ss_pred cccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCHHHHHHHHH Q lcl|NC_021326. 309 SDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANTELQVQTAQ 383 (445) Q Consensus 309 ~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~~~~~~~~~ 383 (445) ++.+++++||+|+++++++|.++|+++++.|+.+|++++++++++++..+ ...+++|+|++++|+|.++.+++++ T Consensus 351 ~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~ 430 (503) T protein:vir:59 351 PETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLV 430 (503) T ss_pred cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999887643 3457899999999999999999999 Q ss_pred HH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCc-CCC Q lcl|NC_021326. 384 QS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDK-QSE 445 (445) Q Consensus 384 ~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~-~~~ 445 (445) ++ +|++|+||+++++|+++|+++|++||++|+++.++..+...+.......++++.+.+ +.+ T Consensus 431 kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 495 (503) T protein:vir:59 431 QGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAE 495 (503) T ss_pred HHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCccc Confidence 98 589999999999999999999999999999888877666554433222222211111 111 No 17 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=4.1e-94 Score=532.57 Aligned_cols=434 Identities=18% Similarity=0.279 Sum_probs=365.8 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) +|.++|++|.. +.++|+++.+||+|+|+++.++... ....++++|+++||+++||++.++|++|+|++++++++ T Consensus 44 ~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~-----~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~ 118 (512) T protein:vir:97 44 EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----KEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK 118 (512) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-----cccccCcceeecchHHHHHHHHhhhhcccCceeccCCh Confidence 78899999875 5789999999999999987765432 33456788999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-- Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-- 156 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-- 156 (445) +..+.|++|++ |+++.++.+++++++++|++|+++|.|++|++++++++|.+++|+|++...+++.+++|+|..... T Consensus 119 ~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~ 198 (512) T protein:vir:97 119 DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (512) T ss_pred HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc Confidence 99999999996 689999999999999999999999999999999999999999999999888899999999976432 Q ss_pred ------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s 230 (445) .++++|+...+++|........ .....+....+|+||.||||+|+|++.|.|+|+++++|||+||.++| T Consensus 199 ~~~~~~~~~~vyt~~~i~~~~~~~~~~~-----~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S 273 (512) T protein:vir:97 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGL-----KLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCcc-----cccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHH Confidence 3467888888887765543321 22344556789999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCcee--------------eccCCCceeeEeccCChHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAI--------------KVSDNGGVDTIQVEVPVENSKKYLDELYQ 296 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~l~~~~~~~~~~~~i~~l~~ 296 (445) ++++.++++++|+++++|.......++.. .+..+++ ..+++++++|++++.+.+++++++++|.+ T Consensus 274 ~~~~~~~~~~~~~lv~~G~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~ 352 (512) T protein:vir:97 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRK-QKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNS 352 (512) T ss_pred HHHHHHHHhcCceeeeecCccCCchhhhh-hhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHH Confidence 99999999999999999976655444332 1222222 12456889999999999999999999999 Q ss_pred HHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCC Q lcl|NC_021326. 297 KIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYN 370 (445) Q Consensus 297 ~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~ 370 (445) .|+.+|++|+++++.+++++||+||++++++|.+++.++++.|+.+|++++++|+++++... +..+++++|+++ T Consensus 353 ~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~ 432 (512) T protein:vir:97 353 DIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN 432 (512) T ss_pred HHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCC Confidence 99999999999999999999999999999999999999999999999999999999876432 445789999999 Q ss_pred CCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCc----CCC Q lcl|NC_021326. 371 KVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDK----QSE 445 (445) Q Consensus 371 ~p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~----~~~ 445 (445) +|+|.++.++++++++|++|+||+++++|+++|+++|++||++|+++.++..........++..++++++++ ..| T Consensus 433 ~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) T protein:vir:97 433 LPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) T ss_pred CCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcccccccc Confidence 999999999999999999999999999999999999999999999887766544333222222222111111 111 No 18 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=7.6e-94 Score=531.10 Aligned_cols=434 Identities=18% Similarity=0.269 Sum_probs=365.5 Q ss_pred ChHHHHHHHH-HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~-~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .|.++|++|. .+.++|+++++||+|+|+++.++... ....++++||++||+++||++.++|++|+|++++++++ T Consensus 44 ~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~-----~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~ 118 (511) T protein:vir:93 44 EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK 118 (511) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC-----cccccCcceeecchHHHHHHHHhhhhcccCeeeccCCh Confidence 6999999997 46789999999999999988765443 33456788999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-- Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-- 156 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-- 156 (445) +..+.|+.|++ |+++.++.+++++++++|++|++||.|++|++++++++|.+++|+|++...+++.+++|+|..... T Consensus 119 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~ 198 (511) T protein:vir:93 119 DVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc Confidence 99999999986 689999999999999999999999999999999999999999999999888899999999975422 Q ss_pred ------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s 230 (445) .++++|+...+++|........ .....+....+|++|.||||+|+|++.|.|+|+++++|||+||.++| T Consensus 199 ~~~~~~~~~~iyt~~~i~~~~~~~~~~~-----~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S 273 (511) T protein:vir:93 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGL-----KLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCcc-----ccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHH Confidence 3467888888887765543321 22334456678999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCcee-------------eccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAI-------------KVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ++++.++++++|+++++|.......+... .+..+++ ...++++++|++++.+.++++.++++|.+. T Consensus 274 ~~~~~~~~~~~~~lv~~G~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~ 352 (511) T protein:vir:93 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRK-QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 352 (511) T ss_pred HHHHHHHHhhCcceeeecCcccCchhhcc-cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 99999999999999999976554443322 1222222 235678899999999999999999999999 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYNK 371 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~~ 371 (445) |+.+|++|+++++++++++||+||+++++++.+||.++++.|+.+|++++++++++++... +..+++++|++++ T Consensus 353 I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~ 432 (511) T protein:vir:93 353 IHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNL 432 (511) T ss_pred HHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999876543 4457899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCC----CCCCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQ----KERSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~----~~~~~d~~~~ 445 (445) |.|.++.+++++++.|++|+||+++++|+++|+++|++||++|+++.++..........++..+ +++.+.+.+| T Consensus 433 p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) T protein:vir:93 433 PKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred CCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCccccccccc Confidence 9999999999999999999999999999999999999999999987776554433222222221 1111111111 No 19 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=8.3e-94 Score=530.88 Aligned_cols=423 Identities=29% Similarity=0.496 Sum_probs=376.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc-h Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-D 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d-~ 79 (445) .|.+||++|+.++++|.++++||+|+|+|+.++.....+......++++||++||+++||++.++|++|+|++|++++ + T Consensus 5 ~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~ 84 (451) T protein:vir:10 5 KIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFDIDNNK 84 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceeecCCcH Confidence 999999999999999999999999999999988777777777788899999999999999999999999999998755 5 Q ss_pred HHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCC--------CcEEEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 80 EVIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEE--------GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 80 ~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--------g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) +..+.|+.|++|+++..+.+++++++++|.+|+++|+|++ |++++++++|++++|+|+++..+++.+++|+| T Consensus 85 ~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~~~ir~~ 164 (451) T protein:vir:10 85 ELNEKVTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIERELEAVIRYY 164 (451) T ss_pred HHHHHHHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 6778899999999999999999999999999999999975 78999999999999999998889999999999 Q ss_pred eeecc----------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHH Q lcl|NC_021326. 152 KLENE----------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTL 221 (445) Q Consensus 152 ~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~l 221 (445) ..... .++++|+...++.|...... ...........+|+||+||||+|+|++.|.|+|+++++| T Consensus 165 ~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~------~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e~v~~l 238 (451) T protein:vir:10 165 IQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVS------CCGSQIEHITVQHRFNSVPFVEFSNNIKKQSDLSKYKKI 238 (451) T ss_pred EeeecccccccceEEEEEEEEeCCeEEEEEecccC------ccccccccccccCCCCeeeEEEeccCCCCCCchhhHHHH Confidence 65433 24667777777766544332 222334455678999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeecc-----CCCceeeEeccCChHHHHHHHHHHHH Q lcl|NC_021326. 222 IDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVS-----DNGGVDTIQVEVPVENSKKYLDELYQ 296 (445) Q Consensus 222 id~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~l~~~~~~~~~~~~i~~l~~ 296 (445) ||+||.++|++++.++++++|+++++|+..+...++...++..+++.++ ++++++|++++.+.+++++++++|.+ T Consensus 239 iDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 318 (451) T protein:vir:10 239 LDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKELKRYKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKK 318 (451) T ss_pred HHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHHhhCCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHH Confidence 9999999999999999999999999999888888888888888888775 35789999999999999999999999 Q ss_pred HHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHH Q lcl|NC_021326. 297 KIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTE 376 (445) Q Consensus 297 ~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~ 376 (445) .|+.+|++|+++++++ ||+||+||++++.+|.+||.++++.|+++|++++++++++++.. +..+++++|++++|.|.+ T Consensus 319 ~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~-d~~~i~i~f~~~~p~n~~ 396 (451) T protein:vir:10 319 QIYESGQGLQQDTENF-GNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVT-DYKKIQQTYTRNMMSNDL 396 (451) T ss_pred HHHHHhCccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CccceeEEecCCCCCCHH Confidence 9999999999998887 58999999999999999999999999999999999999999865 578899999999999999 Q ss_pred HHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCC Q lcl|NC_021326. 377 LQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGAD 431 (445) Q Consensus 377 ~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 431 (445) +.++++++++|++|+||+++++|+++|+++|++++++|+++..++..+..+.-.+ T Consensus 397 e~~~~~~kl~g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 397 EDADIATKSVGIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred HHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 9999999999999999999999999999999999998887766655443332222 No 20 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=7.9e-94 Score=531.01 Aligned_cols=434 Identities=18% Similarity=0.274 Sum_probs=365.4 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .|.++|++|.. +.++|+++++||.|+|++++++... ....++++||++||+++||++.++|++|+|++++++++ T Consensus 44 ~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-----~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~ 118 (511) T protein:vir:99 44 EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK 118 (511) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-----cccccCcceeecchHHHHHHHHHhhhcccCceeecCch Confidence 68899999975 6789999999999999988765432 34456788999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-- Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-- 156 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-- 156 (445) +..+.|+.|++ |+++.++.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++|+|..... T Consensus 119 ~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~ 198 (511) T protein:vir:99 119 DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc Confidence 99999999985 689999999999999999999999999999999999999999999999888899999999976422 Q ss_pred ------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s 230 (445) .++++|+...++.|....... ......+....+|+||.||||+|+|++.|.|+|+++++|||+||.++| T Consensus 199 ~~~~~~~~~~vyt~~~i~~~~~~~~~~-----~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S 273 (511) T protein:vir:99 199 TDEDEVFTVDLFTSHGVYRYLTSRTNG-----LKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred CccceEEEEEEEeCCcEEEEEecCCcc-----ccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 246788888888776554322 122344556779999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceee-------------ccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIK-------------VSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ++++.++++++|+++++|.......+... .+..+++. ..++++++|++++.+.+++++++++|.+. T Consensus 274 ~~~~~~~~~~~~~lv~~G~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~ 352 (511) T protein:vir:99 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRK-QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 352 (511) T ss_pred HHHHHHHHhhchhhhhccCcccCchhhcc-cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 99999999999999999975544333221 22222222 35578899999999999999999999999 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYNK 371 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~~ 371 (445) |+.+|++|+++++++++++||+||++++++|..||.++++.|+.+|++++++++++++... +..+++++|++++ T Consensus 353 I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~ 432 (511) T protein:vir:99 353 IHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNL 432 (511) T ss_pred HHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999987543 3457899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCC---CCC--CCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQ---QKE--RSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~---~~~--~~~d~~~~ 445 (445) |.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++.++......+...++.. +++ +..++++| T Consensus 433 p~n~~e~~~~~~kl~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~e 511 (511) T protein:vir:99 433 PKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDSTKDSIDKKE 511 (511) T ss_pred CcCHHHHHHHHHHHhccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCCcCcccccC Confidence 999999999999999999999999999999999999999999998877665443332222211 111 11122222 No 21 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=2.6e-93 Score=528.16 Aligned_cols=434 Identities=18% Similarity=0.276 Sum_probs=364.7 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .|.++|++|.. +.++|+++++||.|+|+++.++... ....++++||++||+++||++.++|++|+||+++++++ T Consensus 44 ~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-----~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~ 118 (511) T protein:vir:96 44 EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK 118 (511) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcC-----cccccCcceeecchHHHHHHHHHhhhccCCceeecCch Confidence 69999999975 6789999999999999988765432 34456788999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-- Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-- 156 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-- 156 (445) +.++.|+.|++ |+++..+.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++++++|+|..... T Consensus 119 ~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~ 198 (511) T protein:vir:96 119 DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc Confidence 99999999996 689999999999999999999999999999999999999999999999888899999999976422 Q ss_pred ------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s 230 (445) .++++|+...++.|........ .....+....+|+||.||||+|+|++.|.|+|+++++|||+||.++| T Consensus 199 ~~~~~~~~~~iyt~~~i~~~~~~~~~~~-----~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S 273 (511) T protein:vir:96 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGL-----KLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCcc-----cccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 2467888888887765543221 22334456679999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCcee-------------eccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAI-------------KVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ++++.++++++|+++++|.......+... .+..+++ ...++++++|++++.+.+++++++++|.+. T Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~ 352 (511) T protein:vir:96 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRK-QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 352 (511) T ss_pred HHHHHHHHhhCceeeeecCccCCchhhcc-cccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 99999999999999999976544433322 1112222 234578899999999999999999999999 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYNK 371 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~~ 371 (445) |+.+|++|+++++++++++||+||+++++++.+++.++++.|+.+|++++++++++++... +..+++++|++++ T Consensus 353 I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~ 432 (511) T protein:vir:96 353 IHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNL 432 (511) T ss_pred HHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999876543 4467899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCC----CCCCCCCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADG----AQQKERSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~----~~~~~~~~d~~~~ 445 (445) |.|.++.+++++++.|++|+||+++++|+++|+++|++||++|+++..+..........++ ..++++.+.+++| T Consensus 433 p~n~~e~~~~~~kl~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) T protein:vir:96 433 PKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCccccccccc Confidence 9999999999999999999999999999999999999999999987766554333222222 2222222222222 No 22 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=3e-93 Score=527.86 Aligned_cols=434 Identities=18% Similarity=0.272 Sum_probs=364.0 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .|.++|++|.. +.++|+++.+||+|+|+++.++... ....++++|+++||+++||++.++|++|+|++++++++ T Consensus 44 ~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~-----~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~ 118 (511) T protein:vir:10 44 EVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK 118 (511) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc-----cccccCcceeecchHHHHHHHHhhhhcccCceeecCch Confidence 69999999975 5799999999999999998765442 34456788999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-- Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-- 156 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-- 156 (445) +..+.|+.|++ |+++..+.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++|+|..... T Consensus 119 ~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~ 198 (511) T protein:vir:10 119 DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccc Confidence 99999999985 689999999999999999999999999999999999999999999999888899999999976432 Q ss_pred ------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s 230 (445) .++++|+...++.|........ .....+....+|+||.||||+|+|++.|.|+|+++++|||+||.++| T Consensus 199 ~~~~~~~~~~iyt~~~i~~~~~~~~~~~-----~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S 273 (511) T protein:vir:10 199 TDEDEVFTVDLFTSHGVYRYLTSRTNGL-----KLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred CccceEEEEEEEeCCcEEEEEecCCCcc-----cccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 2467888888877765543221 12334456678999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceee-------------ccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIK-------------VSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ++++.++++++|+++++|.......+... .+..+++. ..++++++|++++.+.+++++++++|.+. T Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~ 352 (511) T protein:vir:10 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRK-QKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 352 (511) T ss_pred HHHHHHHHhhCceeeeeccccCCchhhcc-chhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 99999999999999999976544433322 22222222 24568899999999999999999999999 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYNK 371 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~~ 371 (445) |+.+|++|+++++++++++||+||+++++++.+|+.++++.|+.+|++++++++++++... +..+++++|++++ T Consensus 353 I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~ 432 (511) T protein:vir:10 353 IHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL 432 (511) T ss_pred HHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999876532 4567899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC----CCCCCCCCCCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGG----ADGAQQKERSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~----~~~~~~~~~~~d~~~~ 445 (445) |+|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+......... .++..++++.+++++| T Consensus 433 p~d~~~~~~~~~kl~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) T protein:vir:10 433 PKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred CcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCCCCcccCccccc Confidence 9999999999999999999999999999999999999999999987766543322211 1222222222222222 No 23 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=2.2e-93 Score=528.57 Aligned_cols=434 Identities=18% Similarity=0.277 Sum_probs=364.2 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .|.++|++|.. +.++|+++.+||+|+|+++.++... ....++++||++||+++||++.++|++|+||+++++++ T Consensus 44 ~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~-----~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~ 118 (511) T protein:vir:96 44 EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK 118 (511) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc-----cccccCcceeecchHHHHHHHHhhhhcccCceeecCch Confidence 68999999974 6789999999999999987665432 34556788999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-- Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-- 156 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-- 156 (445) +..+.|+.|++ |+++.++.++++.++++|++|+++|.|++|++++++++|.+++|+|++...+++.+++|+|..... T Consensus 119 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~ 198 (511) T protein:vir:96 119 DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc Confidence 99999999995 789999999999999999999999999999999999999999999999888899999999976432 Q ss_pred ------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s 230 (445) .++++|+...++.|........ ..........+|++|.||||+|+|++.|.|+|+++++|||+||.++| T Consensus 199 ~~~~~~~~~~vyt~~~i~~~~~~~~~~~-----~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S 273 (511) T protein:vir:96 199 TDEDEVFTVDLFTSHGVYRYLTNRTNGL-----KLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCcc-----cccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 2467888888877765543221 22234556789999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceee-------------ccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIK-------------VSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ++++.++++++|+++++|.......+... ....+++. ..++++++|++++.+.+++++++++|.+. T Consensus 274 ~~~~~~~~~~~~~lv~~G~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~ 352 (511) T protein:vir:96 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRK-QKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 352 (511) T ss_pred HHHHHHHHhhcchhheecCccCCchhhcc-cccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 99999999999999999976544433322 22222222 24467899999999999999999999999 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYNK 371 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~~ 371 (445) |+.+|++|+++++++++++||+||++++++|.+|+.++++.|+.+|++++++++++++... +..+++++|++++ T Consensus 353 I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~ 432 (511) T protein:vir:96 353 IHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL 432 (511) T ss_pred HHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999887532 4457899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCC----CCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERS----NDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~----~d~~~~ 445 (445) |.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..........++..+++++ ++..+| T Consensus 433 p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 510 (511) T protein:vir:96 433 PKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred CcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCccccc Confidence 99999999999999999999999999999999999999999999887665544332222222222111 111111 No 24 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=2.2e-93 Score=528.57 Aligned_cols=434 Identities=18% Similarity=0.277 Sum_probs=364.2 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .|.++|++|.. +.++|+++.+||+|+|+++.++... ....++++||++||+++||++.++|++|+||+++++++ T Consensus 44 ~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~-----~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~ 118 (511) T protein:vir:78 44 EVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR-----KEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK 118 (511) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcc-----cccccCcceeecchHHHHHHHHhhhhcccCceeecCch Confidence 68999999974 6789999999999999987665432 34556788999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-- Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-- 156 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-- 156 (445) +..+.|+.|++ |+++.++.++++.++++|++|+++|.|++|++++++++|.+++|+|++...+++.+++|+|..... T Consensus 119 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~ 198 (511) T protein:vir:78 119 DVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTKPIDK 198 (511) T ss_pred HHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccc Confidence 99999999995 789999999999999999999999999999999999999999999999888899999999976432 Q ss_pred ------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s 230 (445) .++++|+...++.|........ ..........+|++|.||||+|+|++.|.|+|+++++|||+||.++| T Consensus 199 ~~~~~~~~~~vyt~~~i~~~~~~~~~~~-----~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~liDa~~~~~S 273 (511) T protein:vir:78 199 TDEDEVFTVDLFTSHGVYRYLTNRTNGL-----KLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred cccceEEEEEEEeCCcEEEEEecCCCcc-----cccccccccccCcCcccceEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 2467888888877765543221 22234556789999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceee-------------ccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIK-------------VSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ++++.++++++|+++++|.......+... ....+++. ..++++++|++++.+.+++++++++|.+. T Consensus 274 ~~~~~~~~~~~~~lv~~G~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~ 352 (511) T protein:vir:78 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRK-QKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 352 (511) T ss_pred HHHHHHHHhhcchhheecCccCCchhhcc-cccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 99999999999999999976544433322 22222222 24467899999999999999999999999 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYNK 371 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~~ 371 (445) |+.+|++|+++++++++++||+||++++++|.+|+.++++.|+.+|++++++++++++... +..+++++|++++ T Consensus 353 I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~ 432 (511) T protein:vir:78 353 IHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNL 432 (511) T ss_pred HHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999887532 4457899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCC----CCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERS----NDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~----~d~~~~ 445 (445) |.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..........++..+++++ ++..+| T Consensus 433 p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 510 (511) T protein:vir:78 433 PKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred CcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCCCCCCccCccccc Confidence 99999999999999999999999999999999999999999999887665544332222222222111 111111 No 25 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=2.8e-93 Score=527.99 Aligned_cols=433 Identities=16% Similarity=0.181 Sum_probs=371.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..+.++|+++.+||+|+|+|++++.. ...++++||++||++.||++.++|+||+|+++++++++ T Consensus 20 ~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~-------~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~~~~ 92 (499) T protein:vir:10 20 AINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFD-------NATVEAANVMVNHAKYITDMNVGFMTGNPVKYVAEKGK 92 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcC-------cCCCCcceeecchHHHHHHHHhhhhcccCceeecCChh Confidence 79999999999999999999999999999876543 33467889999999999999999999999999999999 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCC-----------------cEEEEEEccceeEEEEcCCCCC Q lcl|NC_021326. 81 VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEG-----------------EFKLFRVPAEQGIPIWTDKEHE 142 (445) Q Consensus 81 ~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g-----------------~~~i~~~~p~~~~~v~d~~~~~ 142 (445) ..+.++++++ |+++.++.++++.++++|++|+++|.+++| +++++.++|.+++++|++.... T Consensus 93 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~ 172 (499) T protein:vir:10 93 NIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEH 172 (499) T ss_pred HHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEecCCCCc Confidence 8888988885 689999999999999999999999999887 4678999999999999998888 Q ss_pred ceEEEEEEEeeecc------eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHH Q lcl|NC_021326. 143 ELEAFIRMYKLENE------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIF 216 (445) Q Consensus 143 ~~~~~v~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~ 216 (445) ++.+++|+|...+. .++++|++..++.|........ ..+..+....+|+||.||||+|.|++.|.|+|+ T Consensus 173 ~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~-----~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~d~e 247 (499) T protein:vir:10 173 DPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEV-----SANDPIVYDGENLFGAVPIIEFRNNEERQGDFE 247 (499) T ss_pred ceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccc-----cCcceecccccCCCCccceEEecCCCCCCCchH Confidence 89999999876532 3578899999888876654322 233455667789999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeec--cCCCceeeEeccCChHHHHHHHHHH Q lcl|NC_021326. 217 MYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKV--SDNGGVDTIQVEVPVENSKKYLDEL 294 (445) Q Consensus 217 ~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~~~~i~~l 294 (445) ++++|||+||.++|++++.++++++|+++++|+..+...+....++...++.+ +++++++|++++.+.+++++++++| T Consensus 248 ~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l 327 (499) T protein:vir:10 248 QLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSI 327 (499) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHH Confidence 99999999999999999999999999999999887766665555555555554 4667899999999999999999999 Q ss_pred HHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---CcceEEEEeCCCC Q lcl|NC_021326. 295 YQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG---EHKDVDISFNYNK 371 (445) Q Consensus 295 ~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~---~~~~i~v~f~~~~ 371 (445) .+.|+.+|++|+++++.+++++||+|+++++++|.+|+.++++.|+.+|++++++++++++..+ ++.+++++|++++ T Consensus 328 ~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~ 407 (499) T protein:vir:10 328 ENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDASGCKISLVANI 407 (499) T ss_pred HHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999999987654 5568899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC------CCCCCCCCCCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGG------ADGAQQKERSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~------~~~~~~~~~~~d~~~~ 445 (445) |.|.++.+++++++.|++|+||+++++|+++|+++|++||++|+++.++.......+. .++..++.+++++++. T Consensus 408 p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (499) T protein:vir:10 408 PSNLSDVVNNVKNADGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAG 487 (499) T ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCc Confidence 9999999999999999999999999999999999999999999987665443322111 1222222222222222 No 26 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=8e-93 Score=525.49 Aligned_cols=435 Identities=39% Similarity=0.710 Sum_probs=377.2 Q ss_pred ChHHHHHHHH--HHHHHHHHHHHHhcCCCccccccccccccc--cccccccccccccchHHHHHHHHHhhhhccCeeecc Q lcl|NC_021326. 1 MIVRYIKQHL--EKLPEISIGQEYYEQRPDIVKEPKPVDATG--AVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH 76 (445) Q Consensus 1 ~l~~~i~~~~--~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~--~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~ 76 (445) -+.++|+.|. .++++|+++++||+|+|++++++....... .....++++|+++||++.||++.++|++|+|+++++ T Consensus 23 ~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~p~~~~~ 102 (479) T protein:vir:79 23 NLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGNPIVFNA 102 (479) T ss_pred HHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhcCCceecc Confidence 3444455542 267899999999999999999877654333 344567889999999999999999999999999999 Q ss_pred CchHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeec- Q lcl|NC_021326. 77 TDDEVIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN- 155 (445) Q Consensus 77 ~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~- 155 (445) +++...+.|+.|++|+++..+.++++.++++|++|+++|.|++|++++++++|.+++|+|++...+++.+++|+|.... T Consensus 103 ~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~ 182 (479) T protein:vir:79 103 DDDNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDI 182 (479) T ss_pred CCHHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeec Confidence 9999999999999999999999999999999999999999999999999999999999999988889999999997643 Q ss_pred ----ceeEEEEecceEEEEEEecceeeeccc---------ccccccccccccccccccceEEecCCCCcCccHHHHHHHH Q lcl|NC_021326. 156 ----ETKVEYWDKITVNYYVYENGSLIPDYS---------NNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLI 222 (445) Q Consensus 156 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~li 222 (445) ..++++|+...+++|............ ...........+|+||.||||+|+|++.|.|+|+++++|| T Consensus 183 ~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~sd~~~v~~li 262 (479) T protein:vir:79 183 DGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCVSDLTFYKSLI 262 (479) T ss_pred CCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCCCCCCcchhhhHHHH Confidence 236788999998888766544322111 1122334456789999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHh Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFG 302 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s 302 (445) |+||.++|++++.++++++|+++++|.+.+..+++...++..+++.++++++++|++++.+.++++++++.|++.|+.+| T Consensus 263 Da~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s 342 (479) T protein:vir:79 263 DIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFG 342 (479) T ss_pred HHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhhhhhccceecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999987777777778888899999999999999999999999999999999999999 Q ss_pred CccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CcceEEEEeCCCCCCCHHHH Q lcl|NC_021326. 303 QAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVANTELQ 378 (445) Q Consensus 303 ~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~----~~~~i~v~f~~~~p~d~~~~ 378 (445) ++|+++++.+ |++||+|+++++++|.++|..+++.|+.+|++++++++++++..+ +..+++|+|++++|.|.++. T Consensus 343 ~~p~~~~~~~-gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~ 421 (479) T protein:vir:79 343 QGVNPESQNT-GDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEK 421 (479) T ss_pred Cccccccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHH Confidence 9999988876 679999999999999999999999999999999999999987654 55788999999999999999 Q ss_pred HHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCC Q lcl|NC_021326. 379 VQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQK 436 (445) Q Consensus 379 ~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 436 (445) ++++++++|++|+||+++++|+++|+++|++||++|+++..+......+......++. T Consensus 422 a~~~~kl~g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 422 IDMAAKSTGIVSDETIVSNHPWVEDVNDELERLKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred HHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhccCcccCCCcCcC Confidence 9999999999999999999999999999999999999876665544433222222222 No 27 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=1.8e-92 Score=523.52 Aligned_cols=422 Identities=21% Similarity=0.273 Sum_probs=366.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+||++|..+++||.++++||+|+|+|++++.. ...++++|+++||+++||++.++|++|+|++++++++. T Consensus 21 ~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~-------~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~ 93 (452) T protein:vir:36 21 VVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAK-------DSWKPDNRLAVNFTKYIVDTFTGYFNGIPVKKSHSDKE 93 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccCccc-------cccCccceeecchHHHHHHHHhhhhcccCceeecCChh Confidence 99999999999999999999999999999876543 34467889999999999999999999999999999999 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeec-cee Q lcl|NC_021326. 81 VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN-ETK 158 (445) Q Consensus 81 ~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~-~~~ 158 (445) ..+.|+.+++ |+++..+.+++++++++|++|+++|.|++|++++++++|.+++|+|++...+++.+++|+|...+ ..+ T Consensus 94 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~ 173 (452) T protein:vir:36 94 ILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMVYDDTVKQEPLFAVRYGVDEDKKLQ 173 (452) T ss_pred HHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEE Confidence 9999999886 78999999999999999999999999999999999999999999999988889999999987544 446 Q ss_pred EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKD 238 (445) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~ 238 (445) +++|+...++.+..... ++......+|+||+||||+|+|++.|.|+|+++++|+|+||+++|++++.+++ T Consensus 174 ~~vyt~~~i~~~~~~~~----------~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~ 243 (452) T protein:vir:36 174 GEVYTLLETIKISGEND----------EISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDY 243 (452) T ss_pred EEEEecCeEEEEEEcCC----------ceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHH Confidence 78898888877754432 33445567899999999999999999999999999999999999999999999 Q ss_pred hcCCeeEEecCCcccchhHHHhhhhCceeeccCC-----CceeeEeccCChHHHHHHHHHHHHHHHHHhCcccccccccc Q lcl|NC_021326. 239 SNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDN-----GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG 313 (445) Q Consensus 239 ~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~ 313 (445) +++|+++++|+..+. +....++..+++.++.+ ++++|++++.+.++++++++.+.++|+.+|++|+++++++ T Consensus 244 ~~~p~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~- 320 (452) T protein:vir:36 244 FSDQYLTFLGAAVEE--EDLKNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESF- 320 (452) T ss_pred hcCceeEeecCCcCc--hhhhhhhhcceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccc- Confidence 999999999987644 33445566667776543 4689999999999999999999999999999999988887 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---CcceEEEEeCCCCCCCHHHHHHHHHHHhccCC Q lcl|NC_021326. 314 SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG---EHKDVDISFNYNKVANTELQVQTAQQSMGIVS 390 (445) Q Consensus 314 ~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~---~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s 390 (445) |++||+||++++++|.+||.++++.|+.+|++++++++++++..+ +..+++|+|++++|.|.++.++++++++|++| T Consensus 321 gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~g~iS 400 (452) T protein:vir:36 321 GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDIKEQAETANILMGITS 400 (452) T ss_pred cCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHhccCC Confidence 578999999999999999999999999999999999999887654 56688999999999999999999999999999 Q ss_pred hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 391 HETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 391 ~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +||+++++|+++|+++|++||++|+++..+...+...+ .++ .++..++|++ | T Consensus 401 ~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~-~~~~~~~~~~-e 452 (452) T protein:vir:36 401 QETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPS-EKG-TDTVVSETNE-E 452 (452) T ss_pred hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccCC-CCc-ccccCccccC-C Confidence 99999999999999999999999998876655443222 222 2222222222 2 No 28 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=9.9e-92 Score=519.50 Aligned_cols=422 Identities=21% Similarity=0.261 Sum_probs=364.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+||++|..+++|++++++||+|+|+|+.++.. ...++++|+++||+++||++.++|++|+|+++++++++ T Consensus 21 ~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~-------~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~ 93 (453) T protein:vir:39 21 VVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTK-------DLWKPDNRLTVNFTKYIVDTFTGYFNGIPVKKSHSDKE 93 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCc-------cccCccceeecchHHHHHHHHhhhhcccCceeccCChH Confidence 99999999999999999999999999999877643 34467889999999999999999999999999999999 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeec-cee Q lcl|NC_021326. 81 VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN-ETK 158 (445) Q Consensus 81 ~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~-~~~ 158 (445) ..+.|+++|. |+++..+.+++++++++|++|++||.|++|++++++++|.+++|+|++...+++.+++|++...+ ... T Consensus 94 ~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~ 173 (453) T protein:vir:39 94 TLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFMVYDDTIKQEPLFAVRYGYDDDYKLY 173 (453) T ss_pred HHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEecCCCCCeEEEEEEEEEeCCeEEE Confidence 9999998886 78999999999999999999999999999999999999999999999888888999999986544 345 Q ss_pred EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKD 238 (445) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~ 238 (445) +++|+...++++..... .+......+|++|.||||+|+|++.|+|+|+++++|||+||+++|++++.+++ T Consensus 174 ~~~yt~~~i~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~ 243 (453) T protein:vir:39 174 GEVYTKETTYALNGTMG----------FYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDY 243 (453) T ss_pred EEEEeCCeEEEEEecCC----------ceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHH Confidence 78899888877654322 23334557899999999999999999999999999999999999999999999 Q ss_pred hcCCeeEEecCCcccchhHHHhhhhCceeec------cCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccc Q lcl|NC_021326. 239 SNELTYVLTNYDDQELPEFKRLLRYYGAIKV------SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF 312 (445) Q Consensus 239 ~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~ 312 (445) +++|+++++|...+. +....++..+++.+ +++++++|++++.+.+++++++++|++.|+.+|++|+++++.+ T Consensus 244 ~~~p~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~ 321 (453) T protein:vir:39 244 FSDQYLTFLGAAVEE--EDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESF 321 (453) T ss_pred hhCceeeeecCCCCc--hhhhhhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc Confidence 999999999976543 22334455555554 3568899999999999999999999999999999999988877 Q ss_pred cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---CcceEEEEeCCCCCCCHHHHHHHHHHHhccC Q lcl|NC_021326. 313 GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG---EHKDVDISFNYNKVANTELQVQTAQQSMGIV 389 (445) Q Consensus 313 ~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~---~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~ 389 (445) |++||+||++++++|..|+.++++.|+.+|++++++++++++..+ +..+++|+|++++|.|.++.++++++++|++ T Consensus 322 -gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~g~i 400 (453) T protein:vir:39 322 -GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNEPKDIKEQAETANILMGIT 400 (453) T ss_pred -cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHhccC Confidence 578999999999999999999999999999999999999987654 5568899999999999999999999999999 Q ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcC Q lcl|NC_021326. 390 SHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQ 443 (445) Q Consensus 390 s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 443 (445) |+||+++++|+++|+++|++||++|+++..+..+....+... ..+.....++| T Consensus 401 s~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~~~~~~~~~e 453 (453) T protein:vir:39 401 SQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKG-TDTVVPETNEE 453 (453) T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCC-CCCCCCCcCCC Confidence 999999999999999999999999998877665544333222 22222222222 No 29 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=9e-92 Score=519.73 Aligned_cols=429 Identities=19% Similarity=0.224 Sum_probs=361.2 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) +|+++|++|.. +.+||+++.+||+|+|+.+..+. ......++++|+++||+++||++.++|++|+||+++++++ T Consensus 43 ~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~-----~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~ 117 (501) T protein:vir:27 43 LLKNFINHHKLRQAPRIQELLDYARGENHDVLQFG-----RRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDN 117 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccC-----ccCccccccceeccchHHHHHHHHhhhhcccCeeEecCCc Confidence 79999999975 57899999999999864332221 1234556788999999999999999999999999998764 Q ss_pred ----HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeee Q lcl|NC_021326. 80 ----EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE 154 (445) Q Consensus 80 ----~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 154 (445) .+.+.|++|+. |+++..+.+++++++++|++|++||.+++|++++++++|.+++|+|++...+++.+++|+|..+ T Consensus 118 ~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~ 197 (501) T protein:vir:27 118 DNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRG 197 (501) T ss_pred cchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeEEEecCCCCCceEEEEEEEEee Confidence 45566777774 7899999999999999999999999999999999999999999999998888999999999864 Q ss_pred cc----eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 155 NE----TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 155 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s 230 (445) .. ..+++|+...++++.... ++++....+|+||+||||+|+|++.|.|+|+++++|||+||.++| T Consensus 198 ~~~~~~~~~~vyt~~~v~~~~~~~-----------~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S 266 (501) T protein:vir:27 198 TLQNAKDVVEIYTNEHIYTLDASD-----------DFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAES 266 (501) T ss_pred ecCCcEEEEEEEeCCeEEEEEeCC-----------ceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 43 356788887777665332 334556688999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeecc---------CCCceeeEeccCChHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVS---------DNGGVDTIQVEVPVENSKKYLDELYQKIMLF 301 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~ 301 (445) ++++.++++++|+++++|...+..++....++..+++.+. .+++++|++++.+.++++.++++|++.|+.+ T Consensus 267 ~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 346 (501) T protein:vir:27 267 DTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIF 346 (501) T ss_pred HHHHHHHHhcCceeeeecCccCCcccchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHH Confidence 9999999999999999998777666666666666666653 2457899999999999999999999999999 Q ss_pred hCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCHH Q lcl|NC_021326. 302 GQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANTE 376 (445) Q Consensus 302 s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~~ 376 (445) |++|+++++++++++||+||++++++|.+|+..+++.|+.+|++++++++++++..+ +..+++|+|++++|.|.+ T Consensus 347 s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~ 426 (501) T protein:vir:27 347 TNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLN 426 (501) T ss_pred hCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHH Confidence 999999999999999999999999999999999999999999999999999987653 345789999999999999 Q ss_pred HHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh--hhccccCCCCCCC-CCCCCCCcCCC Q lcl|NC_021326. 377 LQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ--LPNLDDGGADGAQ-QKERSNDKQSE 445 (445) Q Consensus 377 ~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~--~~~~~~~~~~~~~-~~~~~~d~~~~ 445 (445) +.++++++++|++|+||+++++|+++|+++|++||++|+++.... ..+..+..++..+ +++..+|+.++ T Consensus 427 e~ad~~~kl~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~~~~~d~~e~ 498 (501) T protein:vir:27 427 EQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFER 498 (501) T ss_pred HHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCCCCCCccccccc Confidence 999999999999999999999999999999999999998764433 2223332222222 22233333222 No 30 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=1.6e-91 Score=518.35 Aligned_cols=428 Identities=18% Similarity=0.231 Sum_probs=364.9 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCC-ccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRP-DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~-~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d 78 (445) +|+++|++|+. +.+||+++.+||+|+| ++..+.. .....++++|+++|||++||++.++|++|+|+++++++ T Consensus 44 ~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~------~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d 117 (502) T protein:vir:48 44 LLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGR------RKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 117 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc------ccccccccceeecchHHHHHHHHhhhhcccCeeEecCC Confidence 79999999985 5789999999999985 4544332 23445678899999999999999999999999999865 Q ss_pred h----HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 79 D----EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 79 ~----~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) + ...+.|+++|. |+++..+.+++++++++|++|+++|.|++|++++++++|.+++|+|+++..+++.+++|+|.. T Consensus 118 ~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~ 197 (502) T protein:vir:48 118 NEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNR 197 (502) T ss_pred ccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEE Confidence 4 35566777774 789999999999999999999999999999999999999999999999888899999999976 Q ss_pred ecc----eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHH Q lcl|NC_021326. 154 ENE----TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRL 229 (445) Q Consensus 154 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~ 229 (445) ... ..+++|+...++++... ..++.....+|+||.||||+|+|++.|.|+|+++++|||+||.++ T Consensus 198 ~~~~~~~~~~~iyt~~~i~~~~~~-----------~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~ 266 (502) T protein:vir:48 198 GTLQNAKDVVEIYTNQHIYTLDAS-----------DSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAE 266 (502) T ss_pred eecCCcEEEEEEEeCCeEEEEEeC-----------CceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHH Confidence 433 35678888877766432 233455667899999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeecc---------CCCceeeEeccCChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 230 SDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVS---------DNGGVDTIQVEVPVENSKKYLDELYQKIML 300 (445) Q Consensus 230 s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~ 300 (445) |++++.+++++.|+++++|......++....++..+++.+. ++++++|++++.+.+++++++++|.++|+. T Consensus 267 S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~ 346 (502) T protein:vir:48 267 SDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHV 346 (502) T ss_pred HHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHH Confidence 99999999999999999998776666666666666666553 357899999999999999999999999999 Q ss_pred HhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCH Q lcl|NC_021326. 301 FGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANT 375 (445) Q Consensus 301 ~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~ 375 (445) +|++|+++++.+++++||+||++++++|.+|+.++++.|+.+|++++++++++++..+ +..+++++|++++|+|. T Consensus 347 ~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~ 426 (502) T protein:vir:48 347 FTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSL 426 (502) T ss_pred HhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCH Confidence 9999999999999999999999999999999999999999999999999999987543 45578999999999999 Q ss_pred HHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-h-hhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 376 ELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNK-Q-LPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 376 ~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~-~-~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++.++++++++|++|+||+++++|+++|+++|++||++|+++... . .+...+..+.+.++..+++.+++| T Consensus 427 ~e~a~~~~kl~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e~~~~~~~ 498 (502) T protein:vir:48 427 YEQVSILNDLGGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKETHTDDFE 498 (502) T ss_pred HHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccccccCCCccCCCCcCcC Confidence 999999999999999999999999999999999999999875332 2 222334444454554555555555 No 31 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=1.9e-91 Score=517.94 Aligned_cols=428 Identities=18% Similarity=0.227 Sum_probs=360.8 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCC-ccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRP-DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~-~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d 78 (445) +|+++|++|.. +.++|+++.+||.|+| .+..+.. .....++++|+++||+++||++.++|++|+|+++++++ T Consensus 43 ~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~------~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~ 116 (501) T protein:vir:96 43 LLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGR------RKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 116 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccc------cCccccccceeecchHHHHHHHHhhhhcccCeeEeeCC Confidence 79999999986 4689999999999985 4443322 23345678899999999999999999999999998865 Q ss_pred ----hHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 79 ----DEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 79 ----~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) +...+.|+.+|+ |+++..+.+++++++++|++|+++|.|++|++++++++|.+++|+|++...+++.+++++|.. T Consensus 117 ~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 196 (501) T protein:vir:96 117 NDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNR 196 (501) T ss_pred ccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 345667777775 789999999999999999999999999999999999999999999999888999999999976 Q ss_pred ecc----eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHH Q lcl|NC_021326. 154 ENE----TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRL 229 (445) Q Consensus 154 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~ 229 (445) ... ..+++|+...++.+... .+.++....+|+||+||||+|+|++.|.|+|+++++|||+||+++ T Consensus 197 ~~~~~~~~~~~vyt~~~i~~~~~~-----------~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~ 265 (501) T protein:vir:96 197 GTLQSAKDVVEIYTDEHIYTLDAS-----------DDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAE 265 (501) T ss_pred ecCCCcEEEEEEEcCCcEEEEeeC-----------CCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHH Confidence 443 35677877777766433 233455667899999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeecc---------CCCceeeEeccCChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 230 SDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVS---------DNGGVDTIQVEVPVENSKKYLDELYQKIML 300 (445) Q Consensus 230 s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~ 300 (445) |++++.++++++|+++++|+.....+++...++..+++.++ .+++++|++++.+.++++.++++|++.|+. T Consensus 266 s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~ 345 (501) T protein:vir:96 266 SDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHI 345 (501) T ss_pred HHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHH Confidence 99999999999999999999877766666667776666653 345789999999999999999999999999 Q ss_pred HhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCH Q lcl|NC_021326. 301 FGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANT 375 (445) Q Consensus 301 ~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~ 375 (445) +|++|+++++++++++||+||++++++|.+|+..+++.|+.+|++++++++++++..+ +..+++|+|++++|.|. T Consensus 346 ~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~ 425 (501) T protein:vir:96 346 FTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSL 425 (501) T ss_pred HhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCCcCH Confidence 9999999999999999999999999999999999999999999999999999987643 45578999999999999 Q ss_pred HHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhh--hccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 376 ELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQL--PNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 376 ~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++.++++++++|++|+||+++++|+++|+++|++||++|+++..... .+..+..+....+..+.+.+++| T Consensus 426 ~e~ad~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e 497 (501) T protein:vir:96 426 NEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFE 497 (501) T ss_pred HHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCccc Confidence 99999999999999999999999999999999999999998754322 12222222222222223333333 No 32 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=6.4e-91 Score=515.07 Aligned_cols=416 Identities=20% Similarity=0.243 Sum_probs=362.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.++|++|..+.++|+++.+||+|+|+|+.++.+ ...++++||++||+++||++.++|++|+|++++++++. T Consensus 5 ~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~-------~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~ 77 (429) T protein:vir:98 5 LLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQK-------EQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHENKQ 77 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-------ccCCCcceeecchHHHHHHHHhhhhcccCceeecCChH Confidence 99999999999999999999999999999876543 34567889999999999999999999999999999999 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-ee Q lcl|NC_021326. 81 VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-TK 158 (445) Q Consensus 81 ~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-~~ 158 (445) .++.|+.|++ |+++..+.+++++++++|++|+++|.+++|++++++++|.+++|+|++...+++.+++|+|..++. .. T Consensus 78 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~ 157 (429) T protein:vir:98 78 VSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGGVLE 157 (429) T ss_pred HHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEEEEEEEEecCceEE Confidence 9999999985 689999999999999999999999999999999999999999999998888889999999976544 34 Q ss_pred EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKD 238 (445) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~ 238 (445) ..+|+...+++|.... .++......+|++|+||||+|+|+++|+|+|+++++|+|+||+++|++++.+++ T Consensus 158 ~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s~~~~~~~~ 227 (429) T protein:vir:98 158 GSYSDASNITYFKDGE----------KGIEIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKAISEKANDVEY 227 (429) T ss_pred EEEEeCceEEEEEecC----------CceEecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5667776666654322 223444567899999999999999999999999999999999999999999999 Q ss_pred hcCCeeEEecCCcccchhHHHhhhhCceeeccC----CCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccC Q lcl|NC_021326. 239 SNELTYVLTNYDDQELPEFKRLLRYYGAIKVSD----NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS 314 (445) Q Consensus 239 ~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~ 314 (445) +++|+++++|...+. ++...++..+++.+++ +++++|++++.+.++++++++.|.+.|+.+|++|+++++++ | T Consensus 228 ~~~p~~~i~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g 304 (429) T protein:vir:98 228 FADAYLKILGAELDD--ETLKSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTAMVANISDESF-G 304 (429) T ss_pred hcCceeeeecCCCCc--chhhhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccc-c Confidence 999999999987643 4455667778888754 35789999999999999999999999999999999998877 6 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---CcceEEEEeCCCCCCCHHHHHHHHHHHhccCCh Q lcl|NC_021326. 315 APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG---EHKDVDISFNYNKVANTELQVQTAQQSMGIVSH 391 (445) Q Consensus 315 ~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~---~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~ 391 (445) |+||+||++++++|.+|+.++++.|+.+|++++++++++++..+ +..+++|+|++++|.|.++.++++++++|++|+ T Consensus 305 n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl~g~is~ 384 (429) T protein:vir:98 305 TASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPANLLEESQIAGNLAGIVSE 384 (429) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHHhccCch Confidence 79999999999999999999999999999999999999987654 445789999999999999999999999999999 Q ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCC Q lcl|NC_021326. 392 ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSND 441 (445) Q Consensus 392 et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d 441 (445) ||+++++|+++|+++|++||++|+++..+..+...... +.+++.| T Consensus 385 et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~-----~~~~~~~ 429 (429) T protein:vir:98 385 ETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQ-----NTTTILE 429 (429) T ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCC-----CCCCCCC Confidence 99999999999999999999999988766543322211 1111111 No 33 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=2.9e-90 Score=511.44 Aligned_cols=427 Identities=22% Similarity=0.287 Sum_probs=364.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCc---cccccc-------cccccccccccccccccccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPD---IVKEPK-------PVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~---i~~~~~-------~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~ 70 (445) +|+++|+.|+.+..++..+.+||+|.++ +..++. ..+........++++||++||++.||++.++|++|+ T Consensus 19 ~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~ 98 (474) T protein:vir:94 19 HIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGV 98 (474) T ss_pred HHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHhhheecc Confidence 8999999999999999999999999754 222221 112334556677899999999999999999999999 Q ss_pred CeeeccC-----chHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 71 PIAFKHT-----DDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 71 ~~~~~~~-----d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~ 144 (445) |++++++ ++...+.|++|++ |+++.++.+++++++++|++|+++|.|++|++++++++|.+++|+|++ ..++ T Consensus 99 pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v~d~--~~~~ 176 (474) T protein:vir:94 99 PVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKNIDPYNVIFVGDN--ILEP 176 (474) T ss_pred ceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEEEcccceEEEEcC--CCce Confidence 9999874 4567788888885 689999999999999999999999999999999999999999999986 4568 Q ss_pred EEEEEEEeeecce------eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHH Q lcl|NC_021326. 145 EAFIRMYKLENET------KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMY 218 (445) Q Consensus 145 ~~~v~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v 218 (445) .+++++|...... .+++|+...++.|.... ...+......+|+||.||||+|+|++.|.|+|+++ T Consensus 177 ~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~---------~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v 247 (474) T protein:vir:94 177 TYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEG---------IDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKV 247 (474) T ss_pred EEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecC---------CCcccccccccCCCCccceEEecCCCCCCCchHHH Confidence 8899998765432 35566666655554321 22344555678999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeec-cCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 219 KTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKV-SDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 219 ~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ++|||+||.++|++++.++++++|+++++|+..+. +....++..+++.+ +++++++|++++.+.+++++++++|+++ T Consensus 248 ~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~--~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 325 (474) T protein:vir:94 248 IHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSE--EMIQETQKSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKN 325 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCc--hhhhhhhhcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHH Confidence 99999999999999999999999999999987654 33444566666655 7789999999999999999999999999 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYNK 371 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~~ 371 (445) |+.+|++|+++++++++++||+||++++++|.+||.++++.|+.+|++++++++++++..+ ++.+++++|++++ T Consensus 326 I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~ 405 (474) T protein:vir:94 326 IMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNI 405 (474) T ss_pred HHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999987642 4567899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) |.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..++...++.++..++ .++| T Consensus 406 p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~-----~~s~ 474 (474) T protein:vir:94 406 PVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQN-----NQSE 474 (474) T ss_pred CCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCcc-----ccCC Confidence 99999999999999999999999999999999999999999999998888777665554444433 3333 No 34 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=2.9e-90 Score=511.44 Aligned_cols=427 Identities=22% Similarity=0.287 Sum_probs=364.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCc---cccccc-------cccccccccccccccccccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPD---IVKEPK-------PVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~---i~~~~~-------~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~ 70 (445) +|+++|+.|+.+..++..+.+||+|.++ +..++. ..+........++++||++||++.||++.++|++|+ T Consensus 19 ~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~yl~g~ 98 (474) T protein:vir:10 19 HIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGYLHGV 98 (474) T ss_pred HHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHhhheecc Confidence 8999999999999999999999999754 222221 112334556677899999999999999999999999 Q ss_pred CeeeccC-----chHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 71 PIAFKHT-----DDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 71 ~~~~~~~-----d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~ 144 (445) |++++++ ++...+.|++|++ |+++.++.+++++++++|++|+++|.|++|++++++++|.+++|+|++ ..++ T Consensus 99 pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v~d~--~~~~ 176 (474) T protein:vir:10 99 PVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKNIDPYNVIFVGDN--ILEP 176 (474) T ss_pred ceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEEEcccceEEEEcC--CCce Confidence 9999874 4567788888885 689999999999999999999999999999999999999999999986 4568 Q ss_pred EEEEEEEeeecce------eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHH Q lcl|NC_021326. 145 EAFIRMYKLENET------KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMY 218 (445) Q Consensus 145 ~~~v~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v 218 (445) .+++++|...... .+++|+...++.|.... ...+......+|+||.||||+|+|++.|.|+|+++ T Consensus 177 ~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~---------~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v 247 (474) T protein:vir:10 177 TYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEG---------IDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKV 247 (474) T ss_pred EEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecC---------CCcccccccccCCCCccceEEecCCCCCCCchHHH Confidence 8899998765432 35566666655554321 22344555678999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeec-cCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 219 KTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKV-SDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 219 ~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ++|||+||.++|++++.++++++|+++++|+..+. +....++..+++.+ +++++++|++++.+.+++++++++|+++ T Consensus 248 ~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~--~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 325 (474) T protein:vir:10 248 IHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSE--EMIQETQKSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKN 325 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCc--hhhhhhhhcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHH Confidence 99999999999999999999999999999987654 33444566666655 7789999999999999999999999999 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC------CcceEEEEeCCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG------EHKDVDISFNYNK 371 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~------~~~~i~v~f~~~~ 371 (445) |+.+|++|+++++++++++||+||++++++|.+||.++++.|+.+|++++++++++++..+ ++.+++++|++++ T Consensus 326 I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~ 405 (474) T protein:vir:10 326 IMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNI 405 (474) T ss_pred HHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCC Confidence 9999999999999999999999999999999999999999999999999999999987642 4567899999999 Q ss_pred CCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) |.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..++...++.++..++ .++| T Consensus 406 p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~-----~~s~ 474 (474) T protein:vir:10 406 PVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQN-----NQSE 474 (474) T ss_pred CCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCcc-----ccCC Confidence 99999999999999999999999999999999999999999999998888777665554444433 3333 No 35 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=1.7e-89 Score=507.24 Aligned_cols=424 Identities=18% Similarity=0.203 Sum_probs=356.6 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) +|.++|++|.. +.++|+++.+||+|+|+|++++.. ..++++|+++||+++||++.++|++|+|++++++++ T Consensus 29 ~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~--------~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~~d 100 (470) T protein:vir:99 29 ELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPEK--------ETGADNRIVVNSAKYVVDVYNGYFCGIEPKLALLND 100 (470) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHhccccccccCccc--------ccCCcceeecchHHHHHHHHhhhhccCCeeEeeCCc Confidence 89999999976 558999999999999998876532 346788999999999999999999999999988654 Q ss_pred -HHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecce Q lcl|NC_021326. 80 -EVIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 157 (445) Q Consensus 80 -~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 157 (445) ...+.+++++ .|+++..+.+++++++++|++|+++|.+++|++++++++|.+++|+|++....++.+++|+|..+... T Consensus 101 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~ 180 (470) T protein:vir:99 101 SSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNSNN 180 (470) T ss_pred hhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeEEEEcCCCCcceEEEEEEEEEecCC Confidence 4556677766 57899999999999999999999999999999999999999999999998888899999999876554 Q ss_pred e----EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 158 K----VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLS 233 (445) Q Consensus 158 ~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~ 233 (445) . ..+|+...++.+.... ...........+|++|.||||+|+|++.|+|+|+++++|||+||+++|+++ T Consensus 181 ~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~ 252 (470) T protein:vir:99 181 WTDAYGVIQYADKFYKFKGYD--------IEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLINALDKVISQKA 252 (470) T ss_pred eeEEEEEEEecCeEEEEEecc--------cccccccccccccCCCccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHH Confidence 3 3445555444433221 122333445678999999999999999999999999999999999999999 Q ss_pred HHHHHhcCCeeEEecCCccc--chhHHHhhhhCceeecc-----CCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_021326. 234 NTFKDSNELTYVLTNYDDQE--LPEFKRLLRYYGAIKVS-----DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVD 306 (445) Q Consensus 234 ~~~~~~~~~~l~~~g~~~~~--~~~~~~~~~~~~~~~~~-----~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~ 306 (445) +.++++++|+++++|+..+. .+++...++..+++.++ ++++++|++++.+.+.+++++++|.+.|+.+|++|+ T Consensus 253 ~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~ 332 (470) T protein:vir:99 253 NQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPN 332 (470) T ss_pred HHHHHhcCceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCcc Confidence 99999999999999986443 23445556666666553 467899999999999999999999999999999999 Q ss_pred cccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CcceEEEEeCCCCCCCHHHHHHHH Q lcl|NC_021326. 307 FSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVANTELQVQTA 382 (445) Q Consensus 307 ~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~----~~~~i~v~f~~~~p~d~~~~~~~~ 382 (445) ++++++++++||+||++++++|.+|+.++++.|+.+|++++++++++++... +..+++++|++++|+|.++.++++ T Consensus 333 ~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~ 412 (470) T protein:vir:99 333 IQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNA 412 (470) T ss_pred ccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHH Confidence 9999999999999999999999999999999999999999999999987653 456889999999999999999999 Q ss_pred HHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCC Q lcl|NC_021326. 383 QQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSND 441 (445) Q Consensus 383 ~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d 441 (445) ++++|++|+||+++++|++ |+++|++||++|+++.++...............++.+++ T Consensus 413 ~kl~giis~et~l~~l~~v-d~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 413 KNAEGIVSKKTQLGMIPDI-EPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred HHHhccCCHHHHHHhCCCC-CHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCccCC Confidence 9999999999999999999 799999999999988776655544433332222222222 No 36 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=7e-90 Score=509.37 Aligned_cols=429 Identities=18% Similarity=0.240 Sum_probs=357.5 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .|.+||++|.. +.++|+++.+||+|+|+++.++.. ......++++|+++||++.||++.++|++|+|++++++++ T Consensus 26 ~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~----~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~~d~ 101 (506) T protein:vir:94 26 KIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQS----RRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKLPDD 101 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc----ccccccCCcceeecchHHHHHHHhhhhhcccCceeecCcc Confidence 68889999865 678999999999999976544322 2334557889999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeeccee Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETK 158 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~ 158 (445) ..++.|+.|++ |+++..+.++++.++++|++|++||+|++|++++++++|.+++|+|++...+++.+++|+|....... T Consensus 102 ~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~ 181 (506) T protein:vir:94 102 GSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDD 181 (506) T ss_pred hHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEecCCCCCceEEEEEEEeeeeccC Confidence 99999999985 78999999999999999999999999999999999999999999999988889999999997654332 Q ss_pred ---------EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 ---------VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRL 229 (445) Q Consensus 159 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~ 229 (445) .++|+...+..+. ....++......+|+||.||||+|+|++.|.|+|+++++|||+||.++ T Consensus 182 ~~~~~~~~~~~~yt~~~~~~~~----------~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~~~liDa~d~~~ 251 (506) T protein:vir:94 182 NQVSTINYVPETWTADTYTLYN----------PTPIMGKMQVDTTKPITTFPVVEFKNSNFRLGDFENVLPLIDLYDAAQ 251 (506) T ss_pred CceeEEEEEEEEEeCceEEEec----------cccCccceeccccccCCccceEEecCCCCCCCchhhhHHHHHHHHHHH Confidence 2233333333221 122233445567899999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCeeEEecCCcc------------------------cchhHHHhhhhCceeeccC---------CCcee Q lcl|NC_021326. 230 SDLSNTFKDSNELTYVLTNYDDQ------------------------ELPEFKRLLRYYGAIKVSD---------NGGVD 276 (445) Q Consensus 230 s~~~~~~~~~~~~~l~~~g~~~~------------------------~~~~~~~~~~~~~~~~~~~---------~~~~~ 276 (445) |++++.+++++.|+++++|.... ....+...++..+++.+++ +++++ T Consensus 252 S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 331 (506) T protein:vir:94 252 SDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAK 331 (506) T ss_pred HHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccCccccccce Confidence 99999999999999999986422 1222334455666666654 35799 Q ss_pred eEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_021326. 277 TIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI 356 (445) Q Consensus 277 ~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~ 356 (445) |++++.+.+++++++++|.+.|+.+|++|+++++++++++||+||++++.+|.+||.++++.|+++|++++++++++++. T Consensus 332 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~ 411 (506) T protein:vir:94 332 YINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENS 411 (506) T ss_pred eeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999875 Q ss_pred CC-----CcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCC Q lcl|NC_021326. 357 KG-----EHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGAD 431 (445) Q Consensus 357 ~~-----~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 431 (445) .. +..+++|+|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+........+. T Consensus 412 ~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~lp~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~- 490 (506) T protein:vir:94 412 IHGDWTFDPQELTFTFRDNLPADNISQIKALVQAGATLPQKYLYQQLPGVTNPQDIVDMMKEQSANGDYSFDQNGVISN- 490 (506) T ss_pred cCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHhhcchhhcCCCc- Confidence 32 455789999999999999999999999999999999999999999999999999999876555433322222 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) ..+..+.++++.| T Consensus 491 -~~~~~~~~~~~~~ 503 (506) T protein:vir:94 491 -DGQTNTTATQTDE 503 (506) T ss_pred -ccCcccccccccc Confidence 2233344444444 No 37 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=1.1e-89 Score=508.38 Aligned_cols=417 Identities=20% Similarity=0.227 Sum_probs=358.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) .|.++|++|..++++|+++.+||+|+|+|++++.. ...++++|+++||+++||++.++|++|+|++++++++. T Consensus 21 ~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~-------~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~ 93 (453) T protein:vir:73 21 VVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAK-------DSWKPDNRLTNNFAKYIVDTFVGYFNGIPIKKTHDDKS 93 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccchhcCCCC-------CccCccceeecchHHHHHHHhhhhhcccCceeecCChH Confidence 89999999999999999999999999998765432 34567889999999999999999999999999999999 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeee-ccee Q lcl|NC_021326. 81 VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE-NETK 158 (445) Q Consensus 81 ~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~-~~~~ 158 (445) .++.|+.|++ |++...+.+++++++++|++|+++|.|++|.+++++++|.+++|+|+++....+.++++++... ...+ T Consensus 94 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~ 173 (453) T protein:vir:73 94 VLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLS 173 (453) T ss_pred HHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEeCCCCceeEEEEEEEEecCceEE Confidence 9999999985 7899999999999999999999999999999999999999999999998888889999877543 3456 Q ss_pred EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKD 238 (445) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~ 238 (445) +++|+...++++..... ++......+|+||.||||+|+|++.|.|+|+++++|||+||+++|++++.+++ T Consensus 174 ~~vyt~~~i~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~ 243 (453) T protein:vir:73 174 GTVYTLLETISITGKAG----------EVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSYNKVTSEKANDVEY 243 (453) T ss_pred EEEEeCCeEEEEEecCC----------ceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHH Confidence 78898888777654322 23444567899999999999999999999999999999999999999999999 Q ss_pred hcCCeeEEecCCcccchhHHHhhhhCceee-----------ccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_021326. 239 SNELTYVLTNYDDQELPEFKRLLRYYGAIK-----------VSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDF 307 (445) Q Consensus 239 ~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~ 307 (445) +++|+++++|+..+. +....++..+++. .+.+++++|++++.+.++++++++.|.+.|+.+|++|++ T Consensus 244 ~~~~~l~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~ 321 (453) T protein:vir:73 244 FSDQYLVFLGAEVDE--EDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANI 321 (453) T ss_pred hccceeeeecCCCCc--hhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCccc Confidence 999999999986543 2222233333222 234577999999999999999999999999999999999 Q ss_pred ccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---CcceEEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_021326. 308 SSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG---EHKDVDISFNYNKVANTELQVQTAQQ 384 (445) Q Consensus 308 ~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~---~~~~i~v~f~~~~p~d~~~~~~~~~~ 384 (445) +++.+ |++||+|+++++.+|.+||+++++.|+.+|++++++++++++..+ +..+++|+|++++|.|.++.++++++ T Consensus 322 ~~~~~-gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k 400 (453) T protein:vir:73 322 SDENF-GNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETANI 400 (453) T ss_pred Ccccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHH Confidence 98887 679999999999999999999999999999999999999876544 56788999999999999999999999 Q ss_pred HhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCC Q lcl|NC_021326. 385 SMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKE 437 (445) Q Consensus 385 ~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 437 (445) ++|++|+||+++++|+++|+++|++||++|+++..+..........+....+= T Consensus 401 ~~giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 401 LKGITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQMRGNL 453 (453) T ss_pred HhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCcchhhhcCC Confidence 99999999999999999999999999999999877665543221111111111 No 38 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=1.3e-88 Score=502.41 Aligned_cols=428 Identities=18% Similarity=0.281 Sum_probs=364.1 Q ss_pred ChHHHHHHHH-HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~-~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .|+++|++|. .++++|+++.+||+|+|+.+..+.. .......++++|+++||++.||++.++|++|+|++++++++ T Consensus 34 ~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~---~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~ 110 (481) T protein:vir:10 34 NLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGER---RLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNPITITHQDN 110 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCcc---ccccccccccceeecchHHHHHHHHHhhhccCCceEecCCh Confidence 8999999986 6789999999999999865433222 22344556788999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-- Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-- 156 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-- 156 (445) ...+.++++|+ |+++.++.++++.++++|++|+++|.+++|++++++++|++++|+|++...+++.+++++|...+. T Consensus 111 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~ 190 (481) T protein:vir:10 111 QTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDK 190 (481) T ss_pred hHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEcccceEEEEcCCCCCceEEEEEEEEEeeCCC Confidence 99999999885 689999999999999999999999999999999999999999999999888899999999976543 Q ss_pred ---eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 ---TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLS 233 (445) Q Consensus 157 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~ 233 (445) .++++|+...++++..... ++......+|+||+||||+|+|++.|+|+|+++++|||+||+++|+++ T Consensus 191 ~~~~~~~~y~~~~i~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~~~~~s~~~ 260 (481) T protein:vir:10 191 VPVQHVEVYTTDKIYYIEIKGG----------TYHRVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLYDSAQSDTA 260 (481) T ss_pred ceEEEEEEEecCeEEEEEecCC----------ceeecccccccCCceeEEEeecCCCCCCchhhHHHHHHHHHHHHHHHH Confidence 3567888888777654432 233445678999999999999999999999999999999999999999 Q ss_pred HHHHHhcCCeeEEecCCcccchhHHHhhhhCceeec---------cCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021326. 234 NTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKV---------SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQA 304 (445) Q Consensus 234 ~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~ 304 (445) +.+++++.|+++++|......+. ...++..+++.+ +++++++|++++.+.+++++++++|++.|+++|++ T Consensus 261 ~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~ 339 (481) T protein:vir:10 261 NYMTDLNDAMLAIIGNVDLDSED-AKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNT 339 (481) T ss_pred HHHHHhcCceeEeecCcCCCccc-hhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCC Confidence 99999999999999975443332 222333333332 34578999999999999999999999999999999 Q ss_pred cccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CcceEEEEeCCCCCCCHHHHHH Q lcl|NC_021326. 305 VDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVANTELQVQ 380 (445) Q Consensus 305 p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~----~~~~i~v~f~~~~p~d~~~~~~ 380 (445) |+++++.+++++||+|+++++++|..|++++++.|+.+|++++++++++++... +..+++++|++++|+|.++.++ T Consensus 340 p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~ 419 (481) T protein:vir:10 340 PDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESIN 419 (481) T ss_pred ccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHH Confidence 999999999999999999999999999999999999999999999999987654 4567899999999999999999 Q ss_pred HHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 381 TAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 381 ~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 444 (445) ++++++|++|.||+++++|+++|+++|++||++|+++..+..+....+ +.....++.+|.+| T Consensus 420 ~~~kl~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~--~~~~~~~~~dd~~g 481 (481) T protein:vir:10 420 AFNALSGGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYG--EAFENHLNVDDSNG 481 (481) T ss_pred HHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCC--ccCCCCCCCCCCCC Confidence 999999999999999999999999999999999998877765433222 23333444455555 No 39 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=1.9e-88 Score=501.47 Aligned_cols=421 Identities=17% Similarity=0.225 Sum_probs=353.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD- 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~- 79 (445) ||..++ .++.+||+++.+||+|+|+++.++.. .....++++||++||++.||++.++|++|+|++++++++ T Consensus 1 ~~~~~~---~~~~~r~~~l~~yy~g~~~~~~~~~~-----~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~ 72 (440) T protein:vir:95 1 MLAAFL---GSQKQRLAILASYAQGDNFSILSGHR-----RLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGG 72 (440) T ss_pred ChhhHH---HHHHHHHHHHHHHhccCCcccccccc-----cccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCc Confidence 443333 34678899999999999998765433 234456788999999999999999999999999987553 Q ss_pred --HHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecc Q lcl|NC_021326. 80 --EVIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE 156 (445) Q Consensus 80 --~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~ 156 (445) +..+.|+++| .|+++..+.+++++++++|++|+++|.|++|++++++++|++++|+|++...+++.+++++|..++. T Consensus 73 ~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~ 152 (440) T protein:vir:95 73 SADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYADK 152 (440) T ss_pred cHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCc Confidence 4455777776 5789999999999999999999999999999999999999999999999888899999999999999 Q ss_pred eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTF 236 (445) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~ 236 (445) .++++|+...+++|..... ...+.......+|+||.||||+|+|++.|.|+|+++++|||+||.++|++++.+ T Consensus 153 ~~~~vyt~~~~~~~~~~~~-------~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~ 225 (440) T protein:vir:95 153 VNMTVYTKDKVITYKPYSN-------NSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYM 225 (440) T ss_pred eEEEEEeCCeEEEEEEecC-------CccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 9999999999887764432 123445556678999999999999999999999999999999999999999999 Q ss_pred HHhcCCeeEEecCCccc--chhHHHhhhhCceeec---------cCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCcc Q lcl|NC_021326. 237 KDSNELTYVLTNYDDQE--LPEFKRLLRYYGAIKV---------SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAV 305 (445) Q Consensus 237 ~~~~~~~l~~~g~~~~~--~~~~~~~~~~~~~~~~---------~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p 305 (445) +++++|+++++|..... .++....++..+++.+ +++++++|++++.+.+++++++++|.+.|+.+|++| T Consensus 226 ~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p 305 (440) T protein:vir:95 226 SDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIP 305 (440) T ss_pred HHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 99999999999964321 2233333444443332 456789999999999999999999999999999999 Q ss_pred ccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CcceEEEEeCCCCCCCHHHHHHH Q lcl|NC_021326. 306 DFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVANTELQVQT 381 (445) Q Consensus 306 ~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~----~~~~i~v~f~~~~p~d~~~~~~~ 381 (445) +++++.+++++||+||++++++|.+|+.++++.|+++|++++++++++++... +..+++++|++++|+|.++.+++ T Consensus 306 ~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~ 385 (440) T protein:vir:95 306 NLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKA 385 (440) T ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHH Confidence 99999999999999999999999999999999999999999999999876543 56789999999999999999999 Q ss_pred HHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCC Q lcl|NC_021326. 382 AQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKER 438 (445) Q Consensus 382 ~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 438 (445) +++++|++|+||+++++|+++ +++|++||++|+++.+.......+.. +++.++.+ T Consensus 386 ~~kl~g~iS~et~~~~l~~~d-~~~E~~ri~~E~~~~~~~~~~~~~~~-~~~~~~~e 440 (440) T protein:vir:95 386 YIEAGGEISQETLMENASFTD-YKTEHSRILKQGGSSDLEIGQIVGDA-DVGQADTE 440 (440) T ss_pred HHHHhccCcHHHHHHhCCCCC-cHHHHHHHHHHHHHhhhhHHhhccCC-CCCCcCCC Confidence 999999999999999999985 46899999999987666654433322 22222111 No 40 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=2.8e-88 Score=500.55 Aligned_cols=441 Identities=21% Similarity=0.289 Sum_probs=355.4 Q ss_pred ChHHHHHHH--HHHHHHHHHHHHHhcCCCcccccccccc---ccccccccccccccccchHHHHHHHHHhhhhccCeeec Q lcl|NC_021326. 1 MIVRYIKQH--LEKLPEISIGQEYYEQRPDIVKEPKPVD---ATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK 75 (445) Q Consensus 1 ~l~~~i~~~--~~~~~~~~~~~~yy~G~~~i~~~~~~~~---~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~ 75 (445) +|.+.+..| ..+++++.++++||+|+|+|+.++.... .....+..++++||++||++.||++.++||+|+||+|+ T Consensus 16 ~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~yl~G~Pv~~~ 95 (537) T protein:vir:78 16 LLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQLAQYLLSNGVEVK 95 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHHhhhhcccCceee Confidence 555544444 3578999999999999999998886643 23345667899999999999999999999999999999 Q ss_pred cCch---HHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEe Q lcl|NC_021326. 76 HTDD---EVIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYK 152 (445) Q Consensus 76 ~~d~---~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 152 (445) ++++ +..+.|+.+++++++..+.+++++++++|++|+++|.|++|++++++++|.++||+|++ ..++.+++|+|. T Consensus 96 ~~d~~~~e~~~~l~~~~~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p~~~~pv~d~--~~~~~~~~~~y~ 173 (537) T protein:vir:78 96 VKDEDNTQLDEILQEYFDEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDGLTLIPVFDD--YGVLKMIIRWYS 173 (537) T ss_pred cCcchhHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEccceeEEEEcC--CCCceeEEEEEe Confidence 8654 45667788888899999999999999999999999999999999999999999999997 467888888875 Q ss_pred ee----------cceeEEEEecceEEEEEEecceeeec----------------------cccccccccccccccccccc Q lcl|NC_021326. 153 LE----------NETKVEYWDKITVNYYVYENGSLIPD----------------------YSNNLENSKTHFSTGSWGKI 200 (445) Q Consensus 153 ~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~g~i 200 (445) .. ...++++|+...+++|.......... .............+|+||.| T Consensus 174 ~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 253 (537) T protein:vir:78 174 EIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKF 253 (537) T ss_pred eeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccccccccccccccCCcce Confidence 32 22468899999999988765432211 11122334455678999999 Q ss_pred ceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeecc-CCCceeeEe Q lcl|NC_021326. 201 PFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVS-DNGGVDTIQ 279 (445) Q Consensus 201 Pvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~ 279 (445) |||+|+|++.|.|+|+++++|||+||.++|++++.++++++|+++++|+..+..+++...++..+++.++ ++++|+|++ T Consensus 254 Pvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~d~~~v~~l~ 333 (537) T protein:vir:78 254 PFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNGDNAGMEIQT 333 (537) T ss_pred eEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecCCCCceeEEE Confidence 9999999999999999999999999999999999999999999999999888888888889988888886 578899999 Q ss_pred ccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC- Q lcl|NC_021326. 280 VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG- 358 (445) Q Consensus 280 ~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~- 358 (445) ++.+.++++.++++|++.||.+|++|+.+. .++||+||+||++++++|.+||..+++.|+++|++++++|+++++..+ T Consensus 334 ~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~-~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~ 412 (537) T protein:vir:78 334 VSIPYEARKAKMDIDVENIYRSGMGFNSTA-VGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGL 412 (537) T ss_pred ecCCHHHHHHHHHHHHHHHHHhcCCCCCcc-ccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 999999999999999999999999999765 456789999999999999999999999999999999999999987653 Q ss_pred ---CcceEEEEeCCCCCCCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhc--------c Q lcl|NC_021326. 359 ---EHKDVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPN--------L 425 (445) Q Consensus 359 ---~~~~i~v~f~~~~p~d~~~~~~~~~~~~--g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~--------~ 425 (445) +..+|+++|++++|.|.++.+++++++. |++|+||+++++|+++|++.| +++++|.+.......+ . T Consensus 413 ~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~e-k~~~ee~~~~~~~~~~~~~~~~~~~ 491 (537) T protein:vir:78 413 GEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDETL-KLIAEELDLDYNELKDALAEQDAQS 491 (537) T ss_pred cccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHH-HHHHHHHHhhhhhhhhhhhhhcccc Confidence 5678999999999999999999999874 899999999999999998433 3333332211111000 0 Q ss_pred ----cc--CCCC-CCCCCCCCCCc----CCC Q lcl|NC_021326. 426 ----DD--GGAD-GAQQKERSNDK----QSE 445 (445) Q Consensus 426 ----~~--~~~~-~~~~~~~~~d~----~~~ 445 (445) .+ ...+ ...++..++++ +|| T Consensus 492 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 522 (537) T protein:vir:78 492 LDVSPDVQAMLDGLPVNANQPPVDPNQPVAD 522 (537) T ss_pred cCcCcchhhhcCCCCCCCCCCCCCccCCCCC Confidence 00 0000 01111111122 222 No 41 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=3.8e-86 Score=488.91 Aligned_cols=428 Identities=22% Similarity=0.273 Sum_probs=353.3 Q ss_pred ChHHHHHHHH-HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~-~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) -|.++|++|. .+.++|+++.+||+|+|+++.++... ...++++|+++||+++||++.++|++|+|++++++++ T Consensus 19 ~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~------~~~~~~~ki~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 92 (489) T protein:vir:99 19 QLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKT------DKYAADNRIASDFAKYITVFEQGYMLGVPVEYKNENK 92 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccc------cccCCcceeecchHHHHHHHHhhhhccCCceeecCCh Confidence 5888999886 57799999999999999998776442 3345678999999999999999999999999999999 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEE----CCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeee Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL----DEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE 154 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~----d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 154 (445) ...+.|+.+++ |+++..+.++++.++++|++|+++|. |++|++++++++|.+++|+|++...+++.+++++|..+ T Consensus 93 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~ 172 (489) T protein:vir:99 93 DLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDID 172 (489) T ss_pred hHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceEEEEcCCCCCceEEEEEEEEEe Confidence 99999999985 78999999999999999999999986 56789999999999999999988788999999999765 Q ss_pred cc-----eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHH Q lcl|NC_021326. 155 NE-----TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRL 229 (445) Q Consensus 155 ~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~ 229 (445) .. ..+++|+...++.|..... ...+.......+|+||.||||+|+|++.|.|+|+++++|+|+||.++ T Consensus 173 ~~~~~~~~~~~~y~~~~i~~~~~~~~-------~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~s~~~~v~~liDa~d~~~ 245 (489) T protein:vir:99 173 YGSGKRKQIIKAYTSDTIYTYEDYNL-------ETKGMRLKDYEGHFFKGVPVNEYANNEERTGAYESVLDNIDAYDLSQ 245 (489) T ss_pred cCCCceEEEEEEEeCCcEEEEEecCC-------CcccceecccccccCCceeEEEeecCCCCCCchhhhHHHHHHHHHHH Confidence 43 2567788777766654321 12233344567899999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCeeEEecCCcccchhHH--H--------------hhhhCceeeccC-------CCceeeEeccCChHH Q lcl|NC_021326. 230 SDLSNTFKDSNELTYVLTNYDDQELPEFK--R--------------LLRYYGAIKVSD-------NGGVDTIQVEVPVEN 286 (445) Q Consensus 230 s~~~~~~~~~~~~~l~~~g~~~~~~~~~~--~--------------~~~~~~~~~~~~-------~~~~~~l~~~~~~~~ 286 (445) |++++.++++++|+++++|......+... . ..+..+++.+.+ +.+++|++++.+.++ T Consensus 246 s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 325 (489) T protein:vir:99 246 SELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAG 325 (489) T ss_pred HHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCChHH Confidence 99999999999999999997544322111 1 111222333322 357899999999999 Q ss_pred HHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC------- Q lcl|NC_021326. 287 SKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGE------- 359 (445) Q Consensus 287 ~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~------- 359 (445) +++++++|.+.|+.+|++|+++++++++++||+||+++++++.+|+.++++.|+.+|++++++++++++..+. T Consensus 326 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 405 (489) T protein:vir:99 326 SEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSL 405 (489) T ss_pred HHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999876432 Q ss_pred cceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCC--CHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCC Q lcl|NC_021326. 360 HKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVE--DLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKE 437 (445) Q Consensus 360 ~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~--d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 437 (445) ..+++|+|++++|.|.++.++++++++|++|+||+++++|+++ |+++|++||++|+++.++..+....+..++ ++.. T Consensus 406 ~~~i~v~f~~~~p~d~~~~~~~~~kl~giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~-~~~~ 484 (489) T protein:vir:99 406 VNDTSIVFTPNLPQNDNEIVTAAQNLYGIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASG-QEEP 484 (489) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCC-CcCC Confidence 3468999999999999999999999999999999999999986 788999999999887665433322222211 1111 Q ss_pred CCCCc Q lcl|NC_021326. 438 RSNDK 442 (445) Q Consensus 438 ~~~d~ 442 (445) .+..+ T Consensus 485 ~~~~p 489 (489) T protein:vir:99 485 TAEKP 489 (489) T ss_pred CCCCC Confidence 11111 No 42 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=1.3e-75 Score=431.19 Aligned_cols=429 Identities=12% Similarity=0.101 Sum_probs=328.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+|+++|..+.++++++.+||+|+|++.+.+.... ....++|+++||+++||++.++|++++++... ++++ T Consensus 7 ~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~------~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~-~d~~ 79 (480) T protein:vir:78 7 HVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAP------PELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSE 79 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccc------hhhhhhhhhcchHHHHHHHHHhhhccCceecC-CCch Confidence 9999999999999999999999999998755433321 11235678999999999999999999998654 4556 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEE------CCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) ..+.++.+| .|+++.++.+++++++++|+||++||. |++|.+++++++|.+++++||+...+++.+++++|.. T Consensus 80 ~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~ 159 (480) T protein:vir:78 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) T ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEe Confidence 677777777 488999999999999999999999996 4678999999999999999999888999999999865 Q ss_pred ecc----eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccHHH-HHHHHH Q lcl|NC_021326. 154 ENE----TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM-YKTLID 223 (445) Q Consensus 154 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~-v~~lid 223 (445) ... ..+++|+...+++|........ ....+....+|+||+||||+|.|+. +|.|++++ |++|+| T Consensus 160 ~d~~~~~~~~~~y~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~D 233 (480) T protein:vir:78 160 RDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) T ss_pred ecCCcceEEEEEEeCCeEEEEEecCCCcc------cccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHH Confidence 443 3467788777776665433211 1122345578999999999999874 58999986 899999 Q ss_pred HHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHH---hhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR---LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIML 300 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~ 300 (445) +||+++|++++.++++++|+++++|.+.+...+... .....+.+...+++++++.+++. ...+++++.++..|+. T Consensus 234 a~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~~i~~ 311 (480) T protein:vir:78 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA--AELRNFAEEMEVFRKE 311 (480) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhccCCCCCceEEecCc--cCHHHHHHHHHHHHHH Confidence 999999999999999999999999987544322111 11122334445667888887664 3456667777777777 Q ss_pred HhCccccccccccC----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--CcceEEEEeCCCCCCC Q lcl|NC_021326. 301 FGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG--EHKDVDISFNYNKVAN 374 (445) Q Consensus 301 ~s~~p~~~~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~--~~~~i~v~f~~~~p~d 374 (445) ++++|+++...+++ ++||+||++++.+|..||.++++.|+.+|++++++++.+.+... +...++++|+++.|+| T Consensus 312 ~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s 391 (480) T protein:vir:78 312 AASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPT 391 (480) T ss_pred HhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCC Confidence 77666665555543 37999999999999999999999999999999999999988654 5568999999999999 Q ss_pred HHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-hhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 375 TELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 375 ~~~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .++.+++++|++ |++|++|+++++|+++|+.+|++++++++.+.... +.....+.++. .+....++.+.| T Consensus 392 ~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 466 (480) T protein:vir:78 392 VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA-TPKPTVTETKTE 466 (480) T ss_pred HHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCcc-ccCCCCCCCCCc Confidence 999999998873 47999999999999999888988887777654432 22222222221 111111111112 No 43 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=2.3e-75 Score=429.81 Aligned_cols=429 Identities=11% Similarity=0.100 Sum_probs=327.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+|+++|..+.++++++.+||+|+|++.+.+.... ....++|+++||+++||++.++|+.+++++.. ++++ T Consensus 7 ~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~------~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~-~d~~ 79 (480) T protein:vir:78 7 HVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAP------PELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSE 79 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc------hhHhhhhhhcchHHHHHHHHHhhhccCceecC-CCch Confidence 9999999999999999999999999998755433221 22235678999999999999999999998654 4555 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEE------CCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) ..+.++++| .|+++.++.+++++++++|+||++||. |++|.+++++++|.+++++||+...+++.+++++|.. T Consensus 80 ~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~ 159 (480) T protein:vir:78 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) T ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEe Confidence 667777777 488999999999999999999999996 4578999999999999999999888999999999865 Q ss_pred ecc----eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccHHH-HHHHHH Q lcl|NC_021326. 154 ENE----TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM-YKTLID 223 (445) Q Consensus 154 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~-v~~lid 223 (445) ... ..+++|+...+.+|........ ......+..+|+||+||||+|.|++ +|.|+|++ |++|+| T Consensus 160 ~~~~~~~~~~~~y~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) T protein:vir:78 160 RDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) T ss_pred ecCCCceEEEEEEeCCeEEEEEecCCCcc------ccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHH Confidence 443 3467788877777665433211 1122335578999999999999864 68999986 999999 Q ss_pred HHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHH---hhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR---LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIML 300 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~ 300 (445) +||+++|++++.++++++|+++++|.+.+...+... .....+.+...+++++++.+++. ...+++++.++..|++ T Consensus 234 a~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~~i~~ 311 (480) T protein:vir:78 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA--AELRNFAEEMEVFRKE 311 (480) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhccCCCCCceEEecCc--cCHHHHHHHHHHHHHH Confidence 999999999999999999999999987654332111 11223344445677888887664 3456666777777777 Q ss_pred HhCcccccccccc----CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--CcceEEEEeCCCCCCC Q lcl|NC_021326. 301 FGQAVDFSSDKFG----SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG--EHKDVDISFNYNKVAN 374 (445) Q Consensus 301 ~s~~p~~~~~~~~----~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~--~~~~i~v~f~~~~p~d 374 (445) ++++++++...++ +++||+||++++..|..|+.++++.|+.+|++++++++.+.|... +...++++|+++.++| T Consensus 312 ~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~~i~v~f~~~~~~s 391 (480) T protein:vir:78 312 AASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPT 391 (480) T ss_pred HhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCC Confidence 7666655554444 347999999999999999999999999999999999999988543 5578999999999999 Q ss_pred HHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-HhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 375 TELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYN-KQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 375 ~~~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .++.+++++|+. |++|++|+++++|+++|+.++|+++++|+.+.. ..+.......+..... +..++..+| T Consensus 392 ~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 466 (480) T protein:vir:78 392 VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK-PTVTETKTE 466 (480) T ss_pred HHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCC-CCCCCCCCc Confidence 999999998863 479999999999999988888888777665533 2232222222222111 111111122 No 44 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=4.2e-75 Score=428.35 Aligned_cols=425 Identities=12% Similarity=0.098 Sum_probs=327.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+|+++|..+++|++++.+||+|+|++.+.+...+.. ..+.++++|||++||++.++|+...+++... ++. T Consensus 17 ~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~------~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~~-~~~ 89 (486) T protein:vir:42 17 VREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPRE------MQQLLAHVGYPRLYVDSVAERQAVEGFRLGD-ADE 89 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchh------HhhhhhccchHHHHHHHHHhhhcccceecCC-Cch Confidence 899999999999999999999999999886654332211 1244678899999999999999888876543 344 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECC--------CCcEEEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDE--------EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~--------~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) ..+.++.+| .|+++....++++++++||++|++||.++ ++.+++++++|.+++++||+. .+++.+++++| T Consensus 90 ~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~-~~~~~~~~~~~ 168 (486) T protein:vir:42 90 ADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPR-INRVSKAIRVA 168 (486) T ss_pred hHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCC-CCCeEEEEEEE Confidence 445556666 58899999999999999999999999875 456899999999999999976 46788999888 Q ss_pred eeecc---eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccHHH-HHHHH Q lcl|NC_021326. 152 KLENE---TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM-YKTLI 222 (445) Q Consensus 152 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~-v~~li 222 (445) ...+. ..+.+|+...++++....+ .+......+|+||+||||+|.|++ +|.|+|++ |++|| T Consensus 169 ~~~~~~~~~~~~~y~~~~~~~~~~~~~----------~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~li 238 (486) T protein:vir:42 169 YDKEGNEIQAATLYTPMETIGWFRADG----------EWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMT 238 (486) T ss_pred EecCCCeEEEEEEEcCCcEEEEEecCC----------cEEeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHH Confidence 75443 2466788777776654432 222334568999999999999974 58999985 89999 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH----HHhh-hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF----KRLL-RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~----~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) |+||+++|++++..+++++|+++++|++.+..... ...+ ...+.++..+++++++.+++ ..+.+++++.++.. T Consensus 239 Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~--~~~~e~~~~~l~~~ 316 (486) T protein:vir:42 239 DAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARILAFEDAEGKIQQFS--AAELANFTNALDQI 316 (486) T ss_pred HHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchhcccCCCCceEEeec--ccCHHHHHHHHHHH Confidence 99999999999999999999999999865432211 1111 12234445566778886654 45678899999999 Q ss_pred HHHHhCccccccccccCc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCcceEEEEeCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSA----PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK---GEHKDVDISFNYN 370 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~----~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~---~~~~~i~v~f~~~ 370 (445) |+.++.+|++++..++++ +||+||++++.+|.+|++++++.|+.+|++++++++++.+.. .+..++++.|+++ T Consensus 317 i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~ 396 (486) T protein:vir:42 317 AKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDP 396 (486) T ss_pred HHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCC Confidence 999998888877766643 799999999999999999999999999999999999987753 3557899999999 Q ss_pred CCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhc-cccCCCC-CCCCCCCCCCcCC Q lcl|NC_021326. 371 KVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPN-LDDGGAD-GAQQKERSNDKQS 444 (445) Q Consensus 371 ~p~d~~~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~-~~~~~~~-~~~~~~~~~d~~~ 444 (445) .|+|.++.+++++|++ |++|++|+++++|+++|+.+||+|+++|+.+......+ ..+.... ++++...+++..+ T Consensus 397 ~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (486) T protein:vir:42 397 STPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQ 476 (486) T ss_pred CCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCC Confidence 9999999999999975 78999999999999999999999999888654443322 2221111 1111111111111 Q ss_pred C Q lcl|NC_021326. 445 E 445 (445) Q Consensus 445 ~ 445 (445) + T Consensus 477 ~ 477 (486) T protein:vir:42 477 P 477 (486) T ss_pred c Confidence 1 No 45 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=2.2e-74 Score=424.45 Aligned_cols=432 Identities=12% Similarity=0.042 Sum_probs=324.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccc-ccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKP-DDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~-~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) ++.+|++.|..++++++++.+||+|+|++...+..... ..+. +.++++|||++||++.++|++.++++ +++. T Consensus 31 l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~-----~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~--~~d~ 103 (501) T protein:vir:25 31 LVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASD-----EVKELAKLSVKNVLSLVRDSFAQNLSVVGYR--NALA 103 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCCh-----hhhhhHhhhhcChHHHHHHHHHhhhccccee--cCCc Confidence 79999999999999999999999999987654433322 2222 23467899999999999999877755 4444 Q ss_pred HHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCC-CCCceEEEEEEEeeecc- Q lcl|NC_021326. 80 EVIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDK-EHEELEAFIRMYKLENE- 156 (445) Q Consensus 80 ~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~-~~~~~~~~v~~~~~~~~- 156 (445) ...+.++.+| .|+++....++++++++||++|++||.+++| ++|++++|.+++++|+++ ....+.+++++|..... T Consensus 104 ~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~-~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~ 182 (501) T protein:vir:25 104 KENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG-PVFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDA 182 (501) T ss_pred cchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-CeEEEeccccEEEEEecCCCCcceeEEEEEEeecccc Confidence 4456666776 5889999999999999999999999999888 589999999999999654 45568899998865432 Q ss_pred ---eeEEEEecceEEEEEEecceeeeccc-----------ccccccccccccccccccceEEecCC----CCcCccHHHH Q lcl|NC_021326. 157 ---TKVEYWDKITVNYYVYENGSLIPDYS-----------NNLENSKTHFSTGSWGKIPFIPFKNN----DLEISDIFMY 218 (445) Q Consensus 157 ---~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~g~iPvv~~~n~----~~g~s~~~~v 218 (445) ..+.+|+...++.+............ ...+..+....+|+||.||||+|.|+ .+|+|+|+++ T Consensus 183 ~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~~~g~sdie~v 262 (501) T protein:vir:25 183 KPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDADDMIVGEVAPL 262 (501) T ss_pred CcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCccccCccccchhhhh Confidence 34667777666655433221111110 11222333456899999999999994 4589999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccC-ChHHHHHHHHHHHHH Q lcl|NC_021326. 219 KTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEV-PVENSKKYLDELYQK 297 (445) Q Consensus 219 ~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~i~~l~~~ 297 (445) ++|+|+||+++|++++..+++++|+++++|++.+..+.+. ....+++ ..+++++++.+++. +.+.+.+.++.+..+ T Consensus 263 ~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~--~~~~~i~-~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~ 339 (501) T protein:vir:25 263 ILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAEVLK--ASALRVW-TFEDPEVKAQAFPPASVEPYNLILEEMLQH 339 (501) T ss_pred HHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccchhh--hccccee-ccCCCCceEEEecccChHHHHHHHHHHHHH Confidence 9999999999999999999999999999999876655433 3344444 44567788887664 557777888888888 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--CcceEEEEeCCCCCCCH Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG--EHKDVDISFNYNKVANT 375 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~--~~~~i~v~f~~~~p~d~ 375 (445) |+..|.+|+.+++.+++|+||+||++++.+|.+++.++++.|+.+|++++++++.+.+... ...++++.|+++.|+|. T Consensus 340 i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~ 419 (501) T protein:vir:25 340 VAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSF 419 (501) T ss_pred HHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCH Confidence 8888999988888888899999999999999999999999999999999999999998654 34678999999999999 Q ss_pred HHHHHHHHHHhcc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH--hhhc----cccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 376 ELQVQTAQQSMGI-VSHETVLENHPFVEDLQAELERIEQEQMEYNK--QLPN----LDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 376 ~~~~~~~~~~~g~-~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~--~~~~----~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++.+++++|++|+ +|.+|++.++|++++++ ++++++++++... .... .......+..+.....+.+++ T Consensus 420 ~~~ada~~kl~~~gis~et~~~~~~g~~~~~--ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (501) T protein:vir:25 420 GAVVDGITKLASAGIPIEHLLSMVPGMTQQT--IQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGG 494 (501) T ss_pred HHHHHHHHHHHhcCCCHHHHHHHcCCCCHHH--HHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCcccccccc Confidence 9999999999876 89999999999998644 3444433322111 1111 111111121222222222222 No 46 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=4.6e-74 Score=422.63 Aligned_cols=425 Identities=12% Similarity=0.084 Sum_probs=324.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) ++..|+++|..+++|++++.+||+|+|++.+.+...... ..++++++||+++||++.++|+++++++.. +++. T Consensus 17 ~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~------~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~-~~~~ 89 (485) T protein:vir:24 17 ARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQ------MQSLLAHVGYPRLYVDSIAERQAVEGFRLG-DADE 89 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchh------hhhhhhccchHHHHHHHHhhhhccCceecC-CCch Confidence 778899999999999999999999999886554333222 235678899999999999999999998744 3445 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCC--------CcEEEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEE--------GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--------g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) ..+.++++| .|+++..+.++++++++||++|++||.+++ +.++|+.++|.+++++||+.. +++.++++++ T Consensus 90 ~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~-~~~~~~~~~~ 168 (485) T protein:vir:24 90 ADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRI-GRPAKAIRVA 168 (485) T ss_pred hHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCc-CceeEEEEEE Confidence 556667777 488999999999999999999999999865 567899999999999999875 5677776666 Q ss_pred eeecc---eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccHH-HHHHHH Q lcl|NC_021326. 152 KLENE---TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIF-MYKTLI 222 (445) Q Consensus 152 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~-~v~~li 222 (445) ..... ..+.+|+...++++....+ .+......+|+||.||||+|.|++ +|.|+++ .+++|| T Consensus 169 ~~~~~~~~~~~~~y~~~~~~~~~~~~~----------~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~li 238 (485) T protein:vir:24 169 YDAEGNEIQAATLYTPNETFGWFRAEG----------EWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMT 238 (485) T ss_pred EeecCCeEEEEEEEcCCcEEEEEecCC----------ceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHH Confidence 54433 3456777776666654332 222334568999999999999874 6899998 499999 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH----HHhh-hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF----KRLL-RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~----~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) |+||+++|++++.++++++|+++++|.+.+..... ...+ ...+.++..+++++++.++ +..+++++++.++.. T Consensus 239 Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~--~~~~~e~~~~~l~~~ 316 (485) T protein:vir:24 239 DAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQF--SAAELANFTNALDQI 316 (485) T ss_pred HHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccCCCCceEEee--cccchHHHHHHHHHH Confidence 99999999999999999999999999865432211 1111 1233455556677887654 456678899999999 Q ss_pred HHHHhCccccccccccC----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCcceEEEEeCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK---GEHKDVDISFNYN 370 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~---~~~~~i~v~f~~~ 370 (445) |++++++|++++..+++ ++||+||++++.+|.+||+++++.|+.+|++++++++.+.+.. .+..+++++|+++ T Consensus 317 i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~ 396 (485) T protein:vir:24 317 AKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDMLRMETVWRDP 396 (485) T ss_pred HHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccceeeEEecCC Confidence 99999888887776664 3799999999999999999999999999999999999986543 3567899999999 Q ss_pred CCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhh-hccccCCC-CCCC-CCCCCCC-- Q lcl|NC_021326. 371 KVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQL-PNLDDGGA-DGAQ-QKERSND-- 441 (445) Q Consensus 371 ~p~d~~~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~-~~~~~~~~-~~~~-~~~~~~d-- 441 (445) .|+|.++.+++++|++ |++|+||+++++|+++|+.+|++++++|+.+..... ....+... .+++ ++.+.++ T Consensus 397 ~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 476 (485) T protein:vir:24 397 STPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQ 476 (485) T ss_pred CCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCc Confidence 9999999999999874 589999999999999988899999988876533322 11111111 1111 1111111 Q ss_pred --cC-CC Q lcl|NC_021326. 442 --KQ-SE 445 (445) Q Consensus 442 --~~-~~ 445 (445) .+ ++ T Consensus 477 ~~~~~~~ 483 (485) T protein:vir:24 477 PAIEGGD 483 (485) T ss_pred cCCCCCC Confidence 11 11 No 47 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=1.4e-73 Score=419.98 Aligned_cols=425 Identities=12% Similarity=0.080 Sum_probs=324.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|++|+++|..+++|++++.+||+|+|++.+.+....... .++++++|||++||++.++||++++++.. ++++ T Consensus 17 ~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~------~~~~~~~n~~~~ivd~~~~~l~~~g~~~~-~~~~ 89 (485) T protein:vir:10 17 ARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQM------QSLLAHVGYPRLYVDSIAERQAVEGFRFG-DADE 89 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhh------hhhhhhcCcHHHHHHHHHhhhcccceecC-CCch Confidence 8899999999999999999999999998866554432221 24567789999999999999998887653 4445 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECC--------CCcEEEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDE--------EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~--------~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) ..+.++++| .|+++.+..++++++++||+||++||.++ ++.++|++++|.+++++||+.. +++.++++++ T Consensus 90 ~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~-~~~~~~~~~~ 168 (485) T protein:vir:10 90 ADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRI-GRVSKAIRVA 168 (485) T ss_pred hHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCCC-CceeEEEEEE Confidence 556666666 58899999999999999999999999975 4678899999999999998764 5566666666 Q ss_pred eeecc---eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccHHH-HHHHH Q lcl|NC_021326. 152 KLENE---TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM-YKTLI 222 (445) Q Consensus 152 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~-v~~li 222 (445) ..... ..+.+|+...++++..... ++......+|++|+||||+|+|+. +|.|+|++ |++|| T Consensus 169 ~~~~~~~~~~~~~y~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~li 238 (485) T protein:vir:10 169 YDAEGNEIQAATLYTPNDIFGWYRVEN----------EWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMT 238 (485) T ss_pred EeeCCCeEEEEEEEeCCeEEEEEEcCC----------ceEEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHH Confidence 54433 2466788877776654322 233334568999999999999874 58999985 89999 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH----HHhhh-hCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF----KRLLR-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) |+||+++|++++.++++++|+++++|...+..... ...++ ..+.++..++++++|.+++ ..+++++++.++.. T Consensus 239 Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d~k~~q~~--~~~~~~~~~~l~~~ 316 (485) T protein:vir:10 239 DAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFS--AAELANFTNALDQI 316 (485) T ss_pred HHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccceeccCCCCceEEeec--ccchHHHHHHHHHH Confidence 99999999999999999999999999765432111 11111 2234455566788887654 44577888888888 Q ss_pred HHHHhCccccccccccC----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCcceEEEEeCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK---GEHKDVDISFNYN 370 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~---~~~~~i~v~f~~~ 370 (445) |++++.+|++++..+++ ++||+||++++.+|.+|++++++.|+.+|++++++++.+.+.. .+..+++|.|+++ T Consensus 317 i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~ 396 (485) T protein:vir:10 317 AKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDP 396 (485) T ss_pred HHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCC Confidence 88888777766655543 4799999999999999999999999999999999999887643 3456889999999 Q ss_pred CCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhh---hccccCCCCCCCCCCCCCCcC Q lcl|NC_021326. 371 KVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQL---PNLDDGGADGAQQKERSNDKQ 443 (445) Q Consensus 371 ~p~d~~~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~---~~~~~~~~~~~~~~~~~~d~~ 443 (445) .|+|.++.+++++|++ |++|+||+++++|++++..++++++++|+.+..... ......+.+++.+.++++++. T Consensus 397 ~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (485) T protein:vir:10 397 STPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPA 476 (485) T ss_pred CCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCc Confidence 9999999999999984 489999999999999888889999887775433221 112223333333333333333 Q ss_pred CC Q lcl|NC_021326. 444 SE 445 (445) Q Consensus 444 ~~ 445 (445) ++ T Consensus 477 ~~ 478 (485) T protein:vir:10 477 AL 478 (485) T ss_pred CC Confidence 32 No 48 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=1.1e-73 Score=420.51 Aligned_cols=426 Identities=11% Similarity=0.053 Sum_probs=324.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeec----- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK----- 75 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~----- 75 (445) +|.+|+++|..+.+|++++.+||+|+|++.+.+..... . ..++|+++|||++||++.+++++.+++.+. T Consensus 12 ~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~-----~-~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~~~~ 85 (488) T protein:vir:23 12 LRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPL-----D-MRKYLAHVGYPRTYVDAIAERQELEGFRIPSANGE 85 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccch-----h-hhhhhhhcchHHHHHHHHHHhhhccceeccCCccc Confidence 99999999999999999999999999988765543322 1 235678999999999999987766655432 Q ss_pred ----cCchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC--------CCCcEEEEEEccceeEEEEcCCCCC Q lcl|NC_021326. 76 ----HTDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD--------EEGEFKLFRVPAEQGIPIWTDKEHE 142 (445) Q Consensus 76 ----~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d--------~~g~~~i~~~~p~~~~~v~d~~~~~ 142 (445) +++++..+.++++|+ |+++.+..++++++++||++|++|+.+ +++.++|++++|.+++++||+. .+ T Consensus 86 ~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~d~~-~~ 164 (488) T protein:vir:23 86 EPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALYAEVDPR-TR 164 (488) T ss_pred ccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccceeEEEEecC-CC Confidence 345566677777774 889999999999999999999999874 4567899999999999999975 46 Q ss_pred ceEEEEEEEeeecce---eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCcc Q lcl|NC_021326. 143 ELEAFIRMYKLENET---KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISD 214 (445) Q Consensus 143 ~~~~~v~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~ 214 (445) ++.+++++|...+.. ...+|+...++++.... .++......+|+||+||||+|.|++ +|+|+ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~----------~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~ 234 (488) T protein:vir:23 165 KVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAE----------GEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGTSE 234 (488) T ss_pred ceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecC----------CceEeccccccCCCCcceEEeccccccCCcCCccc Confidence 788888877654432 34566666665554322 2233445678999999999999865 58899 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH------HHhhhhCceeeccCCCceeeEeccCChHHH Q lcl|NC_021326. 215 IFM-YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF------KRLLRYYGAIKVSDNGGVDTIQVEVPVENS 287 (445) Q Consensus 215 ~~~-v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 287 (445) +++ |++|+|+||+++|++++.++++++|+++++|+..+..... ........++.+++++++++.+++ ..++ T Consensus 235 i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~--~~~~ 312 (488) T protein:vir:23 235 ISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFS--AAEL 312 (488) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCCCCCCceeEecC--CCCh Confidence 974 8999999999999999999999999999999865432211 111223445566777788887655 4456 Q ss_pred HHHHHHHHHHHHHHhCccccccccccC----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---Cc Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG---EH 360 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~---~~ 360 (445) +++++.++..|+.++.++++++..+++ ++||+||++++++|.+|++++++.|+.+|++++++++++.+... +. T Consensus 313 ~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~ 392 (488) T protein:vir:23 313 RNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEY 392 (488) T ss_pred HHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhh Confidence 778888888888887777666555543 47999999999999999999999999999999999999887543 55 Q ss_pred ceEEEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-Hhhhc----cccCCCC Q lcl|NC_021326. 361 KDVDISFNYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYN-KQLPN----LDDGGAD 431 (445) Q Consensus 361 ~~i~v~f~~~~p~d~~~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~-~~~~~----~~~~~~~ 431 (445) .+++++|+++.|+|.++.+++++|++ |++|+||+++++|+++++.+|++++++|+.+.. ..+.. ....+.. T Consensus 393 ~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (488) T protein:vir:23 393 YRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKP 472 (488) T ss_pred ccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccC Confidence 68999999999999999999999873 579999999999999999999998876654322 22211 1122222 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) +..+.++.+|++.. T Consensus 473 ~~~~~~~~~~~e~~ 486 (488) T protein:vir:23 473 GEAPVGEPPAPEPD 486 (488) T ss_pred CCCCCCCCCCCCCC Confidence 22333333333333 No 49 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=4.2e-73 Score=417.36 Aligned_cols=424 Identities=11% Similarity=0.039 Sum_probs=317.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc-h Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-D 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d-~ 79 (445) +|++|+++|..+++|++++.+||+|+|++++.+...... ....++|+++|||++||++.++|++|+|+++.+.+ . T Consensus 9 ~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~----~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~d~ 84 (456) T protein:vir:10 9 WLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA----WRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChh----hhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCc Confidence 999999999999999999999999999886655443322 22235789999999999999999999999997643 3 Q ss_pred HHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeeccee Q lcl|NC_021326. 80 EVIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETK 158 (445) Q Consensus 80 ~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~ 158 (445) +..+.++++| .|+++....++++++++||++|++||.+++|.+++++++|.+++++||+...+++.+++++|...+... T Consensus 85 ~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~ 164 (456) T protein:vir:10 85 DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDLDAES 164 (456) T ss_pred chHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEecCCce Confidence 4445565666 588999999999999999999999999999999999999999999999988889999999997654432 Q ss_pred --EEEEecceEE-EEEE----ecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 --VEYWDKITVN-YYVY----ENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSD 231 (445) Q Consensus 159 --~~~~~~~~~~-~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~ 231 (445) ..+|....+. .+.. ..............+......+|.+|.|||+++. |++|.|+|+++++++|+||+++|+ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-N~~g~gd~e~vi~liDa~~~~~s~ 243 (456) T protein:vir:10 165 DFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEVEPHIDIINRINRAELQ 243 (456) T ss_pred eEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEec-CCCCCchhhhhHHHHHHHHHHHHH Confidence 2333322222 1110 0111111111122223334468899999998875 578999999999999999999999 Q ss_pred HHHHHHHhcCCeeEEecCCcccch--hHH------Hhh--hhCceeeccCCCceeeEecc-CChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 232 LSNTFKDSNELTYVLTNYDDQELP--EFK------RLL--RYYGAIKVSDNGGVDTIQVE-VPVENSKKYLDELYQKIML 300 (445) Q Consensus 232 ~~~~~~~~~~~~l~~~g~~~~~~~--~~~------~~~--~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~i~~l~~~i~~ 300 (445) +++..+++++|+++++|....... +.. ..+ ....++..+++.+ +.+.+ .+.+.+.+.++.+...|+. T Consensus 244 ~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~--~~q~~~~~~~~~~~~l~~~i~~~~~ 321 (456) T protein:vir:10 244 LLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD--IWESQANDFTPMLSAIKEHIRQLSS 321 (456) T ss_pred HHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcc--eEEecccChhHHHHHHHHHHHHHHh Confidence 999999999999999997533211 110 011 1222333444444 43333 4556677777777777777 Q ss_pred HhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHH Q lcl|NC_021326. 301 FGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 380 (445) Q Consensus 301 ~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~ 380 (445) .+++|...++..++|+||+||++++.+|.+|+..+++.|+.+|++++++++++.|... ..+++++|+++.|+|.++.++ T Consensus 322 ~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~-~~~~~v~w~~~~~~~~~~~ad 400 (456) T protein:vir:10 322 ATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-EDTVDVSFESPDRVTLGEKYS 400 (456) T ss_pred ccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCCCcCHHHHHH Confidence 7788887777777789999999999999999999999999999999999999887543 457999999999999999999 Q ss_pred HHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCC Q lcl|NC_021326. 381 TAQQSM--GIVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQ 434 (445) Q Consensus 381 ~~~~~~--g~~s~et~l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 434 (445) +++|+. |++|.++++++++++++ .++|++|+++|+........+..+ .++.. T Consensus 401 a~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~--~~~~~ 456 (456) T protein:vir:10 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQ--EDGSR 456 (456) T ss_pred HHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCC--CCCCC Confidence 999985 78999999999988654 345788887777543332222211 22222 No 50 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=4.2e-73 Score=417.36 Aligned_cols=424 Identities=11% Similarity=0.039 Sum_probs=317.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc-h Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-D 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d-~ 79 (445) +|++|+++|..+++|++++.+||+|+|++++.+...... ....++|+++|||++||++.++|++|+|+++.+.+ . T Consensus 9 ~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~----~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~d~ 84 (456) T protein:vir:10 9 WLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA----WRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChh----hhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCc Confidence 999999999999999999999999999886655443322 22235789999999999999999999999997643 3 Q ss_pred HHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeeccee Q lcl|NC_021326. 80 EVIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETK 158 (445) Q Consensus 80 ~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~ 158 (445) +..+.++++| .|+++....++++++++||++|++||.+++|.+++++++|.+++++||+...+++.+++++|...+... T Consensus 85 ~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~ 164 (456) T protein:vir:10 85 DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDLDAES 164 (456) T ss_pred chHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEecCCce Confidence 4445565666 588999999999999999999999999999999999999999999999988889999999997654432 Q ss_pred --EEEEecceEE-EEEE----ecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 --VEYWDKITVN-YYVY----ENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSD 231 (445) Q Consensus 159 --~~~~~~~~~~-~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~ 231 (445) ..+|....+. .+.. ..............+......+|.+|.|||+++. |++|.|+|+++++++|+||+++|+ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-N~~g~gd~e~vi~liDa~~~~~s~ 243 (456) T protein:vir:10 165 DFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEVEPHIDIINRINRAELQ 243 (456) T ss_pred eEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEec-CCCCCchhhhhHHHHHHHHHHHHH Confidence 2333322222 1110 0111111111122223334468899999998875 578999999999999999999999 Q ss_pred HHHHHHHhcCCeeEEecCCcccch--hHH------Hhh--hhCceeeccCCCceeeEecc-CChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 232 LSNTFKDSNELTYVLTNYDDQELP--EFK------RLL--RYYGAIKVSDNGGVDTIQVE-VPVENSKKYLDELYQKIML 300 (445) Q Consensus 232 ~~~~~~~~~~~~l~~~g~~~~~~~--~~~------~~~--~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~i~~l~~~i~~ 300 (445) +++..+++++|+++++|....... +.. ..+ ....++..+++.+ +.+.+ .+.+.+.+.++.+...|+. T Consensus 244 ~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~--~~q~~~~~~~~~~~~l~~~i~~~~~ 321 (456) T protein:vir:10 244 LLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVD--IWESQANDFTPMLSAIKEHIRQLSS 321 (456) T ss_pred HHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcc--eEEecccChhHHHHHHHHHHHHHHh Confidence 999999999999999997533211 110 011 1222333444444 43333 4556677777777777777 Q ss_pred HhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHH Q lcl|NC_021326. 301 FGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 380 (445) Q Consensus 301 ~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~ 380 (445) .+++|...++..++|+||+||++++.+|.+|+..+++.|+.+|++++++++++.|... ..+++++|+++.|+|.++.++ T Consensus 322 ~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~-~~~~~v~w~~~~~~~~~~~ad 400 (456) T protein:vir:10 322 ATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-EDTVDVSFESPDRVTLGEKYS 400 (456) T ss_pred ccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCCCcCHHHHHH Confidence 7788887777777789999999999999999999999999999999999999887543 457999999999999999999 Q ss_pred HHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCC Q lcl|NC_021326. 381 TAQQSM--GIVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQ 434 (445) Q Consensus 381 ~~~~~~--g~~s~et~l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 434 (445) +++|+. |++|.++++++++++++ .++|++|+++|+........+..+ .++.. T Consensus 401 a~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~--~~~~~ 456 (456) T protein:vir:10 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQ--EDGSR 456 (456) T ss_pred HHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCC--CCCCC Confidence 999985 78999999999988654 345788887777543332222211 22222 No 51 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=3.1e-73 Score=418.10 Aligned_cols=425 Identities=11% Similarity=0.079 Sum_probs=310.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) |+.+|+++|..+++|++++.+||+|+|++++.+..... ....+..+++++|||++||++.++|++.++++ +++.+ T Consensus 18 ~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~---~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~--~~d~~ 92 (479) T protein:vir:99 18 LETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKN---KEREVLQQLSRKPWMGLMVNSFAQQLIVDGYR--KTGTN 92 (479) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCC---hhHHHHHHHhhcCcHHHHHHHHHhhccccccc--CCCch Confidence 66789999999999999999999999998766543221 22223344567899999999999999877754 45555 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEE-----CCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeee Q lcl|NC_021326. 81 VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL-----DEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE 154 (445) Q Consensus 81 ~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~-----d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 154 (445) ..+.++++|+ |+++....++++++++||++|++||. |++|.+++++++|.+++++|++...+. ..+..+... T Consensus 93 ~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~~~~~iydd~~~~~--~~~~~~~~~ 170 (479) T protein:vir:99 93 ENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPRDAFAIWEDPYWDE--WPKYLLERQ 170 (479) T ss_pred hhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechhheEEEecCCcccc--eeeEEEeec Confidence 5566666765 88999999999999999999999995 667899999999999999998765443 222333344 Q ss_pred cceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCC----CCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 155 NETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN----DLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~----~~g~s~~~~v~~lid~~~~~~s 230 (445) ......+|+...++.+.... ..+......+|+||+||||+|.|+ ++|.|+|+++++|||+||+++| T Consensus 171 ~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~g~sd~e~v~~liDa~~~~~s 240 (479) T protein:vir:99 171 PNGQYWWWTEEDYSIFEFKQ----------GKFIYRETVSHDYGHIPFVRYVNVMDLRGVCYGDVEPLVTVAKAIDKTGL 240 (479) T ss_pred CceeEEEEecceEEEEEecC----------CceeeccccccCCCCcceEEeecCCCcCcCCcchhHHHHHHHHHHHHHHH Confidence 44556667666655544322 233444567899999999999998 5789999999999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHH---HhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFK---RLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDF 307 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~ 307 (445) ++++.++++++|+++++|.......... ......+++. .+++++++.+++ ...++++++.++..|+.+++.+++ T Consensus 241 ~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~i~~-~~~~~~~~~q~~--~~~~~~~~~~l~~~i~~i~~~t~~ 317 (479) T protein:vir:99 241 DILLVQHHQSFQIRWATGLMLPEGANADQEKMRFAQESMLI-SQNEKASFGAIP--AAPLDGLLNAYKESLLEFLALAQL 317 (479) T ss_pred HHHHHHHHhhchhhhhcCCCcccccccchhcccccccccee-ecCCCceEEEec--ccchHHHHHHHHHHHHHHhccCCC Confidence 9999999999999999998654433222 1233344444 456678887655 344566666666666666554444 Q ss_pred ccccc--cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC--cceEEEEeCCCCCCCHHHHHHHHH Q lcl|NC_021326. 308 SSDKF--GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGE--HKDVDISFNYNKVANTELQVQTAQ 383 (445) Q Consensus 308 ~~~~~--~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~--~~~i~v~f~~~~p~d~~~~~~~~~ 383 (445) +...+ .+|+||+||++++.+|..++.++++.|+.+|++++++++.+.+.... ..+++++|+++.++|.++.+++++ T Consensus 318 p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~ 397 (479) T protein:vir:99 318 PPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQFADAWA 397 (479) T ss_pred CHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHH Confidence 33333 46799999999999999999999999999999999999999987654 346889999999999999999999 Q ss_pred HH--hccCChHHHHHhCCCCCCHHHH-HHHHHHHHHHHHHhhhccc--------cCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 384 QS--MGIVSHETVLENHPFVEDLQAE-LERIEQEQMEYNKQLPNLD--------DGGADGAQQKERSNDKQSE 445 (445) Q Consensus 384 ~~--~g~~s~et~l~~l~~~~d~~~E-~~ri~~E~~~~~~~~~~~~--------~~~~~~~~~~~~~~d~~~~ 445 (445) |+ +|++|.||+++++|++++++.| +++.++++.+..+...... .++.++..+.++...++++ T Consensus 398 kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (479) T protein:vir:99 398 KMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGE 470 (479) T ss_pred HHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcc Confidence 97 4799999999999999875522 3334333333222222211 1122222233333333333 No 52 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=1.9e-72 Score=413.77 Aligned_cols=411 Identities=14% Similarity=0.110 Sum_probs=315.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|++|+++|..+.++++++.+||+|+|++...+..... ...++|+++|||++||++.++|+.+++++ ++++ T Consensus 8 ~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~------~~~~~k~~~n~~~~ivd~~~~~l~~~g~~--~~d~- 78 (441) T protein:vir:80 8 LIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPP------ELQRVQTVVSWPGIAVDALEERLDWLGWT--NGDG- 78 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccch------hhhhhhhhcchHHHHHHHHHhhhcccccc--CCCh- Confidence 89999999999999999999999999987554433221 22467899999999999999999776654 4443 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeec-cee Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN-ETK 158 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~-~~~ 158 (445) +.++++| .|+++..+.+++++++++|+||++||.|++|.+++++++|.+++++||+.......++++++..++ ..+ T Consensus 79 --~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~ 156 (441) T protein:vir:80 79 --YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVE 156 (441) T ss_pred --HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCCCCceeEEEEEEEEecCceEE Confidence 3455655 489999999999999999999999999999999999999999999999876554555455544332 245 Q ss_pred EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccHHH-HHHHHHHHHHHHHHH Q lcl|NC_021326. 159 VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM-YKTLIDAYNRRLSDL 232 (445) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~-v~~lid~~~~~~s~~ 232 (445) +++|+...+.++..... ..+......+|+||+||||+|.|++ +|.|+++. +++|||+||.++|++ T Consensus 157 ~~vy~~~~~~~~~~~~~---------~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~ 227 (441) T protein:vir:80 157 AELLLPDVIVQVERRGS---------REWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQ 227 (441) T ss_pred EEEEecCeEEEEEEcCC---------cceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHH Confidence 67787777666543322 2344456689999999999999875 48899864 999999999999999 Q ss_pred HHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCc---eeeEeccCChHHHHHHHHHHHHHHHHHhCcccccc Q lcl|NC_021326. 233 SNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGG---VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS 309 (445) Q Consensus 233 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~ 309 (445) ++.++++++|+++++|+..+............+++.++.+.+ +++.+ .+....+.+++.++..|+.++.++++++ T Consensus 228 ~~~~~~~~~~~~~i~G~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~i~~~~~~~~~p~ 305 (441) T protein:vir:80 228 SVNRDFYAYPQRWVTGVSADEFSQPGWVLSMASVWAVDKDDDGDTPNVGS--FPVNSPTPYSDQMRLLAQLTAGEAAVPE 305 (441) T ss_pred HHHHHhhcCceeeeecCCccccccchhhhcccccccCCCCCCCCcceeEe--cCccchHHHHHHHHHHHHHHhcccCCCH Confidence 999999999999999987766555444555666777665543 44444 4455677788888888888877776655 Q ss_pred ccccC----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC----cceEEEEeCCCCCCCHHHHHHH Q lcl|NC_021326. 310 DKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGE----HKDVDISFNYNKVANTELQVQT 381 (445) Q Consensus 310 ~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~----~~~i~v~f~~~~p~d~~~~~~~ 381 (445) ..+++ ++||+||++++.+|..++.++++.|+.+|++++++++++++...+ ..+++++|+++.|+|.++.+++ T Consensus 306 ~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~ 385 (441) T protein:vir:80 306 RYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRAATADA 385 (441) T ss_pred HHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHH Confidence 54442 369999999999999999999999999999999999999886553 4578999999999999999999 Q ss_pred HHHHh--cc--CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCC Q lcl|NC_021326. 382 AQQSM--GI--VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERS 439 (445) Q Consensus 382 ~~~~~--g~--~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (445) ++|++ |+ +|++++++++|+++ +|++|+++|+++..+.+....+ ....++++. T Consensus 386 ~~kl~~~g~~~~s~~~~~~~l~~~~---~e~~~~~~e~~e~~~~~~~~~~---~~~~~~~~~ 441 (441) T protein:vir:80 386 VTKLVGAGILPADSRTVLEMLGLDD---VQVEAVMRHRAESSDPLAVLAG---AISRQTNEV 441 (441) T ss_pred HHHHHhcCcccccHHHHHHhCCCCH---HHHHHHHHHHHHHHHHHHHHhh---hhhcccccC Confidence 99875 43 68899999999875 4555666665554443332222 122222233 No 53 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=1.2e-72 Score=414.97 Aligned_cols=425 Identities=12% Similarity=0.079 Sum_probs=324.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) ++.++++.+..+.++++++.+||+|+|++.+.+...... ..+.++++|||++||++.++++++++++...+ +. T Consensus 16 ~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~------~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~-~~ 88 (484) T protein:vir:77 16 AREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVPQQ------MQKLLAHVGYPRLYIDAIAARQELEGFRLGGA-DK 88 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchh------HHhhhhhcCcHHHHHHHHHhhhccCceecCCc-ch Confidence 788888888888899999999999999876544332211 12345788999999999999999999886543 34 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCc--------EEEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGE--------FKLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~--------~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) ..+.++++| .|+++....++++++++||++|++||.+++|. ++|++++|.+++++||+. .+++.+++++| T Consensus 89 ~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~-~~~~~~a~~~~ 167 (484) T protein:vir:77 89 ADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPR-TRQVMRAIRAI 167 (484) T ss_pred hHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCC-CCceEEEEEEE Confidence 445555665 58899999999999999999999999998874 579999999999999976 57899999998 Q ss_pred eeecce---eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccHHH-HHHHH Q lcl|NC_021326. 152 KLENET---KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM-YKTLI 222 (445) Q Consensus 152 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~-v~~li 222 (445) ..+... ++.+|+...+.++.... ..+......+|+||+||||+|.|+. +|.|+|++ |++|+ T Consensus 168 ~~~~~~~~~~~~~y~~~~~~~~~~~~----------~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~ 237 (484) T protein:vir:77 168 EDEEGNEVIGATLYLPNNTVIWNRED----------GQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVT 237 (484) T ss_pred EeecCCcEEEEEEEecCeEEEEEecC----------CceEeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHH Confidence 776543 34566666655554332 2233334568999999999999865 58999985 99999 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH----HHhhh-hCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF----KRLLR-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) |+||+++|++++..+++++|+++++|.+.+..... ...++ ..+.+...+++++++.+++ ..+.+++++.++.. T Consensus 238 Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~--~~~~e~~~~~l~~~ 315 (484) T protein:vir:77 238 DAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAFEDHESKAQQFS--AAELRNFVDALDAL 315 (484) T ss_pred HHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhhcccCCCCceeEeec--CCChHHHHHHHHHH Confidence 99999999999999999999999999865432211 11111 1223444456678876554 45567888888888 Q ss_pred HHHHhCccccccccccC----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---CcceEEEEeCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG---EHKDVDISFNYN 370 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~----~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~---~~~~i~v~f~~~ 370 (445) |+.++.+|++++..+++ ++||+||++++.+|.+|+.++++.|+.+|++++++++.+.+... +..+++++|+++ T Consensus 316 i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~ 395 (484) T protein:vir:77 316 DRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDP 395 (484) T ss_pred HHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccceEEecCC Confidence 99888877776665553 37999999999999999999999999999999999999876543 456789999999 Q ss_pred CCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhc-ccc------CCCCCCCCCCCC Q lcl|NC_021326. 371 KVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPN-LDD------GGADGAQQKERS 439 (445) Q Consensus 371 ~p~d~~~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~-~~~------~~~~~~~~~~~~ 439 (445) .|+|.++.+++++|++ |++|++|+++++|+++++.+||+++++|+.+......+ ..+ +.++..++++.. T Consensus 396 ~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (484) T protein:vir:77 396 STPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQ 475 (484) T ss_pred CCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCccccc Confidence 9999999999999974 58999999999999999889999998887654432221 111 112222233344 Q ss_pred CCcCCC Q lcl|NC_021326. 440 NDKQSE 445 (445) Q Consensus 440 ~d~~~~ 445 (445) ++.+++ T Consensus 476 ~~~~~~ 481 (484) T protein:vir:77 476 PNPAEE 481 (484) T ss_pred CCCccc Confidence 444444 No 54 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=7.8e-72 Score=410.42 Aligned_cols=424 Identities=11% Similarity=0.043 Sum_probs=318.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc-h Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-D 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d-~ 79 (445) +|++|+++|..+.++++++.+||+|+|++.+.+...... ....++++++||+++||++.++|++|+|+++.+.+ . T Consensus 9 ~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~----~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~d~ 84 (456) T protein:vir:79 9 WLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA----WRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChh----hchhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCc Confidence 999999999999999999999999999987654433221 12233467889999999999999999999987644 3 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeeccee Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETK 158 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~ 158 (445) +..+.++++|+ |+++....+++++++++|++|+++|.+++|.+++++++|.+++++||+....++.+++++|...+... T Consensus 85 ~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~d~~~ 164 (456) T protein:vir:79 85 DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAES 164 (456) T ss_pred cHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCCCceEEEEEEEEecCCce Confidence 45566667764 78999999999999999999999999999999999999999999999988889999999997655433 Q ss_pred --EEEEecceEEEEEEe-----cceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 --VEYWDKITVNYYVYE-----NGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSD 231 (445) Q Consensus 159 --~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~ 231 (445) ..+|....+..+... .............+......+|.++.|||++|. |+.|.|+|+++++|||+||+++|+ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-N~~~~gd~e~v~~liD~~~~~~s~ 243 (456) T protein:vir:79 165 DFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEVEPHIDIINRINRAELQ 243 (456) T ss_pred eEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEec-CCCCCchhhhhHHHHHHHHHHHHH Confidence 344444433332211 111111111122223334568999999999985 588999999999999999999999 Q ss_pred HHHHHHHhcCCeeEEecCCcccch-----hH---HHhh--hhCceeeccCCCceeeEec-cCChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 232 LSNTFKDSNELTYVLTNYDDQELP-----EF---KRLL--RYYGAIKVSDNGGVDTIQV-EVPVENSKKYLDELYQKIML 300 (445) Q Consensus 232 ~~~~~~~~~~~~l~~~g~~~~~~~-----~~---~~~~--~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~i~~l~~~i~~ 300 (445) +++.++++++|+++++|....... .. ...+ ....++..+++. ++.+. +.+.+.+.+.++.+..+|+. T Consensus 244 ~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~--~~~q~~~~~~~~~~~~l~~~i~~i~~ 321 (456) T protein:vir:79 244 LLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGV--DIWESQTNDFTPMLSAIKEHIRQLSS 321 (456) T ss_pred HHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCCCCc--ceeeecccChHHHHHHHHHHHHHHHh Confidence 999999999999999997543211 00 0111 122233344444 44333 24556666666666667777 Q ss_pred HhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHH Q lcl|NC_021326. 301 FGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 380 (445) Q Consensus 301 ~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~ 380 (445) .+++|...++.+++|+||+||++++.+|.+|++.+++.|+++|++++++++++.|.. +..+++++|+++.|+|.++.++ T Consensus 322 ~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~-~~~~i~v~w~~~~~~s~~~~ad 400 (456) T protein:vir:79 322 ATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-VEDTVDVSFESPDRVTLGEKYS 400 (456) T ss_pred hcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccccceEEeCCCCCcCHHHHHH Confidence 777787777777778999999999999999999999999999999999999998865 3467999999999999999999 Q ss_pred HHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCC Q lcl|NC_021326. 381 TAQQSM--GIVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQ 434 (445) Q Consensus 381 ~~~~~~--g~~s~et~l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 434 (445) +++|++ |++|.++++++++++++ .++|++|+++|.......+.+ .+..+++. T Consensus 401 a~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~--~~~~~~~~ 456 (456) T protein:vir:79 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQ--RPQEDGSR 456 (456) T ss_pred HHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhhhHhh--cCCCCCCC Confidence 999974 78999999999888654 346777777776543322221 22222222 No 55 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=3.7e-67 Score=384.81 Aligned_cols=423 Identities=13% Similarity=0.106 Sum_probs=301.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+|+++|..++++++++.+||+|+|++.+.+...+.. .+ ++++++||+++||+++++++..++++... ++. T Consensus 25 ~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~-----~~-~~~~v~n~~~~iVd~~a~rl~~~Gf~~~d-~~~ 97 (504) T protein:vir:99 25 KVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPE-----YL-RTATVLGWSAKAVDTLARRCNLESFVWPD-GDY 97 (504) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHH-----HH-HHhhccCcHHHHHHHHHhhhccceeeCCC-CCh Confidence 899999999999999999999999999876654333222 12 44678999999999999999999987543 344 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCc--EEEEEEccceeEEEEcCCCCCceEEEEEEEeeecce Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGE--FKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 157 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~--~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 157 (445) ....++++| .|+++....++++++++||++|++||.+++|+ +.|++++|.+++++||+. .+++.++++++..+... T Consensus 98 ~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD~~-~~~~~~a~~~~~~d~~g 176 (504) T protein:vir:99 98 GSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWNSR-RNAMDSLLSITSRDAEG 176 (504) T ss_pred hhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEeCC-CCceeEEEEEEEecCCC Confidence 455566666 58899999999999999999999999998876 568999999999999975 46777778777554443 Q ss_pred ---eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCC-----CCcCccHH-HHHHHHHHHHHH Q lcl|NC_021326. 158 ---KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIF-MYKTLIDAYNRR 228 (445) Q Consensus 158 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~-~v~~lid~~~~~ 228 (445) .+.+|+...++++..... +.+..+..+|++| ||||+|.|+ ++|.|++. ++++|+|++|++ T Consensus 177 ~~~~~~~y~~~~~~~~~~~~~----------~~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~ 245 (504) T protein:vir:99 177 HPTGIALYEDGVTVTADMDDD----------GDWHADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKG 245 (504) T ss_pred eEEEEEEEcCCcEEEEEEcCC----------ceeeeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHH Confidence 245777777666554322 2233456789998 999999986 36899886 799999999999 Q ss_pred HHHHHHHHHHhcCCeeEEecCCcccchh----H--HHhhhhCceeeccCCC--------ceeeEeccC-ChHHHHHHHHH Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLTNYDDQELPE----F--KRLLRYYGAIKVSDNG--------GVDTIQVEV-PVENSKKYLDE 293 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~g~~~~~~~~----~--~~~~~~~~~~~~~~~~--------~~~~l~~~~-~~~~~~~~i~~ 293 (445) ++++++..+++++|+++++|++.+...+ . .......+++.++++. ++++.+.+. +.+.+.+.++. T Consensus 246 ~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~ 325 (504) T protein:vir:99 246 CIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQ 325 (504) T ss_pred HHHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHH Confidence 9999999999999999999986543211 1 1122234455565443 355544432 33444444444 Q ss_pred HHHHHHHHhCccccccc--cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCcceEEEEe Q lcl|NC_021326. 294 LYQKIMLFGQAVDFSSD--KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KGEHKDVDISF 367 (445) Q Consensus 294 l~~~i~~~s~~p~~~~~--~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~i~v~f 367 (445) +..+|...|++|..+++ +..+++||+||++++.+|..|+.++++.|+.+|++++++++.+.+. ..+..++++.| T Consensus 326 ~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w 405 (504) T protein:vir:99 326 IAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKF 405 (504) T ss_pred HHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEe Confidence 44445555677765554 2346899999999999999999999999999999999999988764 33457789999 Q ss_pred CCCCCCCHHHHHHHHHHHhcc-----CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH--hhhccccC----CCCCCCCC Q lcl|NC_021326. 368 NYNKVANTELQVQTAQQSMGI-----VSHETVLENHPFVEDLQAELERIEQEQMEYNK--QLPNLDDG----GADGAQQK 436 (445) Q Consensus 368 ~~~~p~d~~~~~~~~~~~~g~-----~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~--~~~~~~~~----~~~~~~~~ 436 (445) +++.++|.++.+++++|+++. .+.+++++++++.+ +|++|+++|+++... ........ +..+..++ T Consensus 406 ~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~---~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 482 (504) T protein:vir:99 406 RSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTP---QQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQD 482 (504) T ss_pred cCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCH---HHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCC Confidence 999999999999999998642 23688999997743 566666655433211 11111110 01111111 Q ss_pred CCCCCcCC-C Q lcl|NC_021326. 437 ERSNDKQS-E 445 (445) Q Consensus 437 ~~~~d~~~-~ 445 (445) +.+.++.. + T Consensus 483 ~~~~e~a~~~ 492 (504) T protein:vir:99 483 QGAGEPPANE 492 (504) T ss_pred cCCCCCCCCC Confidence 11111111 1 No 56 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=1.1e-65 Score=376.78 Aligned_cols=400 Identities=11% Similarity=0.036 Sum_probs=286.4 Q ss_pred cccccccccccccccccccc-cccchHHHHHHHHHhhhhccCeeeccCchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcC Q lcl|NC_021326. 31 KEPKPVDATGAVDPLKPDDR-MITNFHANLVDQKVSYIVGKPIAFKHTDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKG 108 (445) Q Consensus 31 ~~~~~~~~~~~~~~~~~~~r-i~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G 108 (445) +.+.... ...+..+| +++|||++||++.++++.+++++ ++|.+.++.++++|+ |+++....++++++++|| T Consensus 1 ~l~~~~~-----~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~--~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G 73 (434) T protein:vir:98 1 MLPKNAE-----QAFLDFQRKARTNFCGLIANASVHRLLALGVT--GPDGEPDTRASRWWQANRLDSRQKLVWRMAMAQS 73 (434) T ss_pred CCCCCcc-----HHHHHhhhhhhccchHHHHHHHHhhhccCcee--cCCCchHHHHHHHHHhcChhHHHHHHHHHHhhcC Confidence 1122211 22223333 57899999999999999888765 556666777777774 889999999999999999 Q ss_pred eEEEEEEECCCC-------cEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecce--eEEEEecceEEEEEE--eccee Q lcl|NC_021326. 109 IEWLHPYLDEEG-------EFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET--KVEYWDKITVNYYVY--ENGSL 177 (445) Q Consensus 109 ~~~~~v~~d~~g-------~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~--~~~~~ 177 (445) ++|++||.++++ .+.|++++|.+++++||+.. +++.+++++|..+... ...+|.......+.. ..... T Consensus 74 ~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~-~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (434) T protein:vir:98 74 AGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPET-GEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTRERTGAR 152 (434) T ss_pred ceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCC-CceEEEEEEEEeccCCceEEEEEEeCcEEEEEEeeccccc Confidence 999999987654 46799999999999999764 5799999988765443 233443333332222 11111 Q ss_pred eeccc--ccccccccccccccccccceEEecCC----CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCc Q lcl|NC_021326. 178 IPDYS--NNLENSKTHFSTGSWGKIPFIPFKNN----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD 251 (445) Q Consensus 178 ~~~~~--~~~~~~~~~~~~~~~g~iPvv~~~n~----~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~ 251 (445) ..... ...........+|+||+||||+|.|+ .+|.|+|++++++||+||+++|++++..+++++|+++++|.+. T Consensus 153 ~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~ 232 (434) T protein:vir:98 153 LPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKF 232 (434) T ss_pred cccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCc Confidence 11111 11223344567899999999999998 5789999999999999999999999999999999999999876 Q ss_pred ccchhHHH--------hhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccc---cccccCcchHHH Q lcl|NC_021326. 252 QELPEFKR--------LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFS---SDKFGSAPSGVA 320 (445) Q Consensus 252 ~~~~~~~~--------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~---~~~~~~~~Sg~A 320 (445) +...+... .....+++.+.+++++++.+.+ ....+++++.++..|+.++..++++ ++...+|+||+| T Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~--~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~A 310 (434) T protein:vir:98 233 AKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQLD--ATDLSGFLKEHASDVRDMLTISQTPTYLYATDLVNISADT 310 (434) T ss_pred ccccccccccchhhhhhhccccccccCCCCCceEEEec--CcchHHHHHHHHHHHHHHhcccCCCHHHhccccCChHHHH Confidence 54332211 1123334556667788886654 3345556666666666665555444 443346899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhcc-CChHHHHHhCC Q lcl|NC_021326. 321 LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGI-VSHETVLENHP 399 (445) Q Consensus 321 i~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~-~s~et~l~~l~ 399 (445) |++++.+|..|+.++++.|+.+|++++++++.+.|...+..+++++|+++.|+|.++.+++++|+.|+ +|.+++++++| T Consensus 311 l~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~~~e~~~~~lg 390 (434) T protein:vir:98 311 IGALDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATKLKSIGYPLDVIAEELD 390 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHHHHhcCCcHHHHHHhCC Confidence 99999999999999999999999999999999999888888999999999999999999999999876 89999999998 Q ss_pred CCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 400 FVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 400 ~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +.+ +|++|+++|.++............ +....+..+++++- T Consensus 391 ~~~---~e~~r~~~e~~~~~~~~~~~~~~~--~~~~~g~~~~~~~~ 431 (434) T protein:vir:98 391 ESP---ARVRRIVAGAASQALLAASLLPAP--GAPSAGNVPDSGGA 431 (434) T ss_pred CCH---HHHHHHHHHHHHHHHHHHhhhccC--CCCCCCCCCcccCC Confidence 853 688888877655333221111100 00111111111111 No 57 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=1.2e-63 Score=365.49 Aligned_cols=390 Identities=13% Similarity=0.034 Sum_probs=297.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) .|.+|++++..+.+++.++.+||+|+|++.+.+..... ..+..+++++|||+++|+++++++.-++++ ++|.. T Consensus 5 ~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~-----~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~--~~d~~ 77 (422) T protein:vir:97 5 GMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPN-----NVREMYRSVLEWTAKGVDSLADRIIFREFT--NDDFN 77 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccH-----HHHHHHHhhcchhHHHHHHHHhccccceee--CCchh Confidence 89999999999999999999999999988665544322 233445677899999999999998777764 45543 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECC-CCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeE Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDE-EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKV 159 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~ 159 (445) .++.|..|+++....++++++++||+||++|+.++ +|.++|++++|.+++++||+. .+++.++++.|..+..... T Consensus 78 ---l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D~~-~~~~~~a~~~~~~~~~~~~ 153 (422) T protein:vir:97 78 ---AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILDPT-TFLLTEGYAILESDSNGNP 153 (422) T ss_pred ---HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEeCC-CCcceeeEEEEEecCCCcE Confidence 34444469999999999999999999999999986 688999999999999999876 4567777777765544332 Q ss_pred ---EEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccH-HHHHHHHHHHHHHHH Q lcl|NC_021326. 160 ---EYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDI-FMYKTLIDAYNRRLS 230 (445) Q Consensus 160 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~-~~v~~lid~~~~~~s 230 (445) .+++...++++.. .+.....+|++|+||+|+|.|++ +|.|++ +++++|+|++|++++ T Consensus 154 ~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~ 219 (422) T protein:vir:97 154 TLEAYFTDKDIWYYPK--------------KGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLE 219 (422) T ss_pred EEEEEEcCceEEEEcC--------------CCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHH Confidence 2333333322221 11122358999999999999864 689998 689999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCC---ceeeEeccCChHHHHHHHHHHHHHHHHH---hCc Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNG---GVDTIQVEVPVENSKKYLDELYQKIMLF---GQA 304 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~i~~l~~~i~~~---s~~ 304 (445) ++....+++++|+++++|++.+............+++.++++. ++++.+ .+..+..+|++.++..++.+ |++ T Consensus 220 ~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~de~~~~~~v~q--~~~~~l~~~~~~l~~~~~~~a~~s~l 297 (422) T protein:vir:97 220 RAEVTAEFYSFPQKYVLGMDPDAKPMEKWRATVSTLLEISKDEDGDKPTVGQ--FTTASMAPFMEHLKMYASLFAGGSGL 297 (422) T ss_pred HHHHHHHHhcchhhhhcccCcccccCchhhhhhhhhhccCCCCCCCcceeee--cCCCChhHHHHHHHHHHHHHhcccCC Confidence 9999999999999999999765543333333344666776543 356544 34444455555555555555 566 Q ss_pred cccccccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CcceEEEEeCCCCCCC---HH Q lcl|NC_021326. 305 VDFSSDKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVAN---TE 376 (445) Q Consensus 305 p~~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~----~~~~i~v~f~~~~p~d---~~ 376 (445) |..+++..+. ++||+||++++.+|..|+.++++.|+.+|++++++++.+.+... ...++++.|+++.|.+ .+ T Consensus 298 P~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a 377 (422) T protein:vir:97 298 TLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADANMLT 377 (422) T ss_pred CHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChHHHH Confidence 6555554443 47999999999999999999999999999999999999877543 4557899999888887 67 Q ss_pred HHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_021326. 377 LQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEY 418 (445) Q Consensus 377 ~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~ 418 (445) +.+++++|+. |+++.+++++++|+ ++++.|..|+++++.+- T Consensus 378 ~~aDa~~Kl~~a~~~~~~~~~~~~~lg~-~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 378 LVGDGAIKLNQAIPGFMDADVIRDLTGV-KGADKPIPAITEVTTDG 422 (422) T ss_pred HHHHHHHHHHhhccccccHHHHHHHcCC-CchhHHHHHHHhhhccC Confidence 7788888874 68999999999988 67888999998886543 No 58 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=3.7e-62 Score=357.40 Aligned_cols=377 Identities=12% Similarity=0.010 Sum_probs=288.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+|++++..+.+++.++.+||+|+|++.+.....+ ...+.++|+++|||++||+++++++.-++++ ++|.. T Consensus 5 ~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p-----~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~--~~d~~ 77 (409) T protein:vir:94 5 GIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIP-----QALSQQYRSILGWCAKGVDSLADRLVFREFE--NDDFT 77 (409) T ss_pred HHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhh-----HHHHHHHhhhcchhHHHHHHhHhhcccCccc--CCchH Confidence 9999999999999999999999999998765443322 2233456788899999999999998777754 55543 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeeccee- Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETK- 158 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~- 158 (445) ++++| .|+++....++++++++||++|++|+.+++|+++|++++|.+++.+||+. .+++.++++++..+...+ T Consensus 78 ----l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~-~~~~~~a~~~~~~d~~~~~ 152 (409) T protein:vir:94 78 ----VNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPI-TGLLTEGYAVLERDENNNV 152 (409) T ss_pred ----HHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecC-CCceeeeEEEEEecCCCce Confidence 34555 58899999999999999999999999999999999999999999999885 577999999887654443 Q ss_pred --EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccH-HHHHHHHHHHHHHHH Q lcl|NC_021326. 159 --VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDI-FMYKTLIDAYNRRLS 230 (445) Q Consensus 159 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~-~~v~~lid~~~~~~s 230 (445) ..+|....+..+...... ....+|++|+||+|+|.|++ +|.|++ +++++|+|++|++++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~-------------~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~ 219 (409) T protein:vir:94 153 VLEAHFLPDRTDYYYRDSRN-------------NISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLE 219 (409) T ss_pred EEEEEEecCcEEEEEecCce-------------eEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHH Confidence 235555555444322211 12357999999999999864 689998 579999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCC---CceeeEeccCChHHHHHHHHHHHHHHHH---HhCc Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDN---GGVDTIQVEVPVENSKKYLDELYQKIML---FGQA 304 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~l~~~~~~~~~~~~i~~l~~~i~~---~s~~ 304 (445) ++....+++++|+++++|++.+............+++.++++ .++++.+.+ ..+.++|++.++..++. .|++ T Consensus 220 ~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~--~~~l~~~~~~l~~~~~~~a~~t~l 297 (409) T protein:vir:94 220 RADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQFT--QPSMSPFTEQLRTAAAGFAGETGL 297 (409) T ss_pred HHHHHHHHhcChhheeEecCCCCcccchhhhhHHHhhcCCCCCCCCCceEEecC--CCChhHHHHHHHHHHHHHhhhcCC Confidence 999999999999999999876543332222333456666543 446665543 33344555555555555 4556 Q ss_pred ccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----CCcceEEEEeCCCCCCC---HH Q lcl|NC_021326. 305 VDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK----GEHKDVDISFNYNKVAN---TE 376 (445) Q Consensus 305 p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~----~~~~~i~v~f~~~~p~d---~~ 376 (445) |..+++..+ +++||+||++++.+|..++.++++.|+.+|++++++++.+.+.. .+..++++.|.|..|.+ .+ T Consensus 298 P~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a 377 (409) T protein:vir:94 298 TLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADASMLS 377 (409) T ss_pred CHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchHHHH Confidence 655554333 35899999999999999999999999999999999999987753 34567999999777666 56 Q ss_pred HHHHHHHHHh--c--cCChHHHHHhCCCCCCHH Q lcl|NC_021326. 377 LQVQTAQQSM--G--IVSHETVLENHPFVEDLQ 405 (445) Q Consensus 377 ~~~~~~~~~~--g--~~s~et~l~~l~~~~d~~ 405 (445) +.|++++|++ | +.+.++++.++|+.++ + T Consensus 378 ~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~-d 409 (409) T protein:vir:94 378 LIGDGAIKLNQAIPEFINKDTIRDLTGIEGG-E 409 (409) T ss_pred HHHHHHHHHHHhcccccchhHHHHHcCCCCC-C Confidence 7788899985 3 5678999999998653 2 No 59 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=3.3e-62 Score=357.64 Aligned_cols=384 Identities=11% Similarity=0.029 Sum_probs=289.2 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchHHHHHH Q lcl|NC_021326. 6 IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVIKRI 85 (445) Q Consensus 6 i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l 85 (445) ++.| .+|+.++.+||+|+|++.+.+...+. ..+.++|+++|||+++|+++++++.-++++ .+|.. + T Consensus 1 l~~~---~~r~~~~~~yY~g~~~~~~~~~~~p~-----~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~--~~d~~----l 66 (410) T protein:vir:95 1 MNLY---QSRVNLRYKHYAMQHYEAPTGITIPA-----HIRAKYQAVLGWAAKGVDSLADRLIFRAFA--NDDFN----V 66 (410) T ss_pred CCcc---hhhHHHHHHHhcCCCCccccchhccH-----HHHhHHHhhcchhHHHHHHhHhhhcccccc--CCCch----H Confidence 4444 56788889999999988655443322 234556778899999999999999888764 44443 3 Q ss_pred HHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecce---eEEE Q lcl|NC_021326. 86 DEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET---KVEY 161 (445) Q Consensus 86 ~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~---~~~~ 161 (445) +++| .|+++....++++++++||+||++|+.+++|.++|++++|.+++++||+ ..+++.++++++..++.. ...+ T Consensus 67 ~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp-~~~~~~~al~~~~~~~~~~~~~~~~ 145 (410) T protein:vir:95 67 TEIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDP-ITGLLVEGYAVLARDDYNRPTLEAY 145 (410) T ss_pred HHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeC-CCCceEEEEEEEEecCCCeEEEEEE Confidence 4444 5899999999999999999999999999999999999999999999998 467899999888765543 3456 Q ss_pred EecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 162 WDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDI-FMYKTLIDAYNRRLSDLSNT 235 (445) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~-~~v~~lid~~~~~~s~~~~~ 235 (445) |++..+.++..... ....+|++|.||+|+|.|++ +|.|++ +++++++|++|++++++... T Consensus 146 ~~~~~~~~~~~~~~--------------~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~ 211 (410) T protein:vir:95 146 FEPNATHFIPKDGE--------------PYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADIT 211 (410) T ss_pred EeCCcEEEEeeCCc--------------cccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHH Confidence 66666655543211 12357999999999999753 589988 57999999999999999999 Q ss_pred HHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCC---ceeeEecc-CChHHHHHHHHHHHHHHHHHhCcccccccc Q lcl|NC_021326. 236 FKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNG---GVDTIQVE-VPVENSKKYLDELYQKIMLFGQAVDFSSDK 311 (445) Q Consensus 236 ~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~l~~~-~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~ 311 (445) .+++++|+++++|++.+............+++.++++. .+++.+.+ .+.+.+.+.++.+..++...|++|...++. T Consensus 212 ~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~ 291 (410) T protein:vir:95 212 AEFYSWPQKYILGLDPDAEPMEKWKATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGF 291 (410) T ss_pred HHHhcchhheeeccCCCCCcCchhhhhhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhcc Confidence 99999999999999765544433334445677776543 35665544 244445455555555555556677665554 Q ss_pred ccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----CCcceEEEEeCC---CCCCCHHHHHHHHH Q lcl|NC_021326. 312 FGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK----GEHKDVDISFNY---NKVANTELQVQTAQ 383 (445) Q Consensus 312 ~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~----~~~~~i~v~f~~---~~p~d~~~~~~~~~ 383 (445) .+. ++||+||++++.+|..|+.++++.|+.+|++++++++.+.+.. .+..++++.|.+ +...+.++.+++++ T Consensus 292 ~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~ 371 (410) T protein:vir:95 292 VSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVV 371 (410) T ss_pred ccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHH Confidence 333 5899999999999999999999999999999999999987643 345678999984 45568899999998 Q ss_pred HHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh Q lcl|NC_021326. 384 QSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ 421 (445) Q Consensus 384 ~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~ 421 (445) |+. |+++.++++++||++++ +..|++.|...+..+ T Consensus 372 Kl~~a~~g~~~~~~~~~~lg~~~~---~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 372 KLNQALPGYINAETIRDLTGIAGD---MSAKPVVSEGGSNGE 410 (410) T ss_pred HHHHhccCCccHHHHHHhcCCChH---HHHHHHHHHHHhCCC Confidence 873 68999999999999653 333444333332222 No 60 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=1.5e-61 Score=354.08 Aligned_cols=415 Identities=14% Similarity=0.109 Sum_probs=300.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+|++++..+.+++.++.+||+|+|++.+.+...+... + +.++++|||+++|+++++++..++++...+++ T Consensus 19 ~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~-----r-~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d~~~- 91 (474) T protein:vir:81 19 LINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQY-----F-NLGLVLGWTGKAVDALARRCNLEGFVWPDGDL- 91 (474) T ss_pred HHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHH-----H-HHHhhcChHHHHHHHHHhhhcccceECCCCCc- Confidence 8999999999999999999999999998766554433222 2 33578999999999999999999987543333 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCc--EEEEEEccceeEEEEcCCCCCceEEEEEEEeeecce Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGE--FKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 157 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~--~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 157 (445) ....++++| .|+++....++++++++||++|++|+.+++|. ++|++++|.+++++||+.. +++.+++..+..+... T Consensus 92 ~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~-~~~~~al~~~~~~~~g 170 (474) T protein:vir:81 92 DSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRR-RGLNNLLSIIDKDKEG 170 (474) T ss_pred cchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCC-CcceeeeEEEEEcCCC Confidence 333445555 69999999999999999999999999977664 7899999999999998864 5677777666554443 Q ss_pred e---EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccH-HHHHHHHHHHHHH Q lcl|NC_021326. 158 K---VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDI-FMYKTLIDAYNRR 228 (445) Q Consensus 158 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~-~~v~~lid~~~~~ 228 (445) + ..+|.+..++++..... ...+..+..+|++| ||+|+|.|++ +|+|++ +++++|+|++|++ T Consensus 171 ~~~~~~ly~~~~~~~~~~~~~---------~~~w~~~~~~~~~g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~ 240 (474) T protein:vir:81 171 KVLSLALYLDNETVTAQRDKA---------TLKWQVDRDEHVYG-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVRE 240 (474) T ss_pred cEEEEEEEeCCcEEEEEEcCc---------cceeeeccCCCCCC-cceEEecccccccCcCCccccchhHHHHHHHHHHH Confidence 2 34666666655443221 22234456789998 7999999864 689988 5899999999999 Q ss_pred HHHHHHHHHHhcCCeeEEecCCcccchh----HH--HhhhhCceeeccCCCceee------EeccCChHHHHHHHHHHHH Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLTNYDDQELPE----FK--RLLRYYGAIKVSDNGGVDT------IQVEVPVENSKKYLDELYQ 296 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~g~~~~~~~~----~~--~~~~~~~~~~~~~~~~~~~------l~~~~~~~~~~~~i~~l~~ 296 (445) ++++....+++++|+++++|++.+...+ .. ......+++.++++.+.+. ..++.+..++++|++.++. T Consensus 241 ~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~ 320 (474) T protein:vir:81 241 LARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSDING 320 (474) T ss_pred HHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHHHHH Confidence 9999999999999999999987643221 11 1112233455555543221 1244444455555555555 Q ss_pred HHH---HHhCccccccc--cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC------CCcceEEE Q lcl|NC_021326. 297 KIM---LFGQAVDFSSD--KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK------GEHKDVDI 365 (445) Q Consensus 297 ~i~---~~s~~p~~~~~--~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~------~~~~~i~v 365 (445) .+. ..|++|..+++ ++.+++||+||++.+.+|..|+.++++.|+.+|++++++++.+.+.. .+..++++ T Consensus 321 ~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v 400 (474) T protein:vir:81 321 LAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDA 400 (474) T ss_pred HHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhcccee Confidence 544 45677765554 34567999999999999999999999999999999999999998743 23568899 Q ss_pred EeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHH--HHHhhhccccCCCCCCCCC Q lcl|NC_021326. 366 SFNYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQME--YNKQLPNLDDGGADGAQQK 436 (445) Q Consensus 366 ~f~~~~p~d~~~~~~~~~~~~----g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~--~~~~~~~~~~~~~~~~~~~ 436 (445) .|.++..++.++.+++++|+. |+.+.+++++++++. +++++|+++++.+ ....+......+..++..+ T Consensus 401 ~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t---~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 401 KWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLT---PQQARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred EecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCC---HHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 999999999999999999975 456678888888775 3566666655432 2222222222221211111 No 61 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=3.1e-60 Score=346.85 Aligned_cols=428 Identities=11% Similarity=0.086 Sum_probs=300.5 Q ss_pred ChHHHHHH-----HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeec Q lcl|NC_021326. 1 MIVRYIKQ-----HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK 75 (445) Q Consensus 1 ~l~~~i~~-----~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~ 75 (445) -|++++.+ +.+++.++.++++||+|+|+++.++.... ....+.++++++|||+.||++.++|++|+|++++ T Consensus 21 ~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~----~~~~~~~~~~~~n~~k~i~~~~a~~l~~~p~~i~ 96 (496) T protein:vir:38 21 ALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEH----NGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKIN 96 (496) T ss_pred hhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhcc----CCCccccceeecchHHHHHHHHhhhhhCCcceEe Confidence 23333322 33566789999999999998876543322 1223345678899999999999999999999999 Q ss_pred cCchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeee Q lcl|NC_021326. 76 HTDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE 154 (445) Q Consensus 76 ~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 154 (445) ++++..++.|+++++ |+|...+.+++..++++|.+|+.+|.|++|++++.+++|.+++|+|++...-...++++.+..+ T Consensus 97 ~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~ 176 (496) T protein:vir:38 97 IDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSNDSENVDECVIANSFHKN 176 (496) T ss_pred eCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEcccceEEEEecCCcEEEEEEEEEEEeC Confidence 999999999999986 6799999999999999999999999999999999999999999998875322224445444443 Q ss_pred cce--eEEEEe----cceEEE--EEEecceeeec---ccccccccccccccccccccceEEecCC---------CCcCcc Q lcl|NC_021326. 155 NET--KVEYWD----KITVNY--YVYENGSLIPD---YSNNLENSKTHFSTGSWGKIPFIPFKNN---------DLEISD 214 (445) Q Consensus 155 ~~~--~~~~~~----~~~~~~--~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~iPvv~~~n~---------~~g~s~ 214 (445) +.. .+++|+ .+.+.+ |.......... .....+...+....+++.++||++|+++ +.|.|+ T Consensus 177 ~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd 256 (496) T protein:vir:38 177 NKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISV 256 (496) T ss_pred CeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCch Confidence 322 133332 112222 21111111100 0011122223344567788899988653 458999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe-------cCCcccchhHHHhhhhCceeeccCCC---ceeeEeccCCh Q lcl|NC_021326. 215 IFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT-------NYDDQELPEFKRLLRYYGAIKVSDNG---GVDTIQVEVPV 284 (445) Q Consensus 215 ~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~-------g~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~ 284 (445) |+++++++|+||.++|++++.++....++.+-. +..+.....+....+...+.....++ .++.++.++.. T Consensus 257 ~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~ 336 (496) T protein:vir:38 257 YANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRS 336 (496) T ss_pred HhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCccccCCCCccceEEEeecCCCcccccceeeccccCH Confidence 999999999999999999999998776666621 11222112222222222233322222 34555567788 Q ss_pred HHHHHHHHHHHHHHHHHhCccccccc-cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------c Q lcl|NC_021326. 285 ENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-------I 356 (445) Q Consensus 285 ~~~~~~i~~l~~~i~~~s~~p~~~~~-~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-------~ 356 (445) +++...++.+.+.|...++++...++ ..+++.||+|++++++.|.+++..+++.|+.+|++++++++.+.. . T Consensus 337 e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~ 416 (496) T protein:vir:38 337 TEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGE 416 (496) T ss_pred HHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 88999999988888888877654333 334677899999999999999999999999999999998876532 3 Q ss_pred CCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHhhhccccCCCCC Q lcl|NC_021326. 357 KGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQ--AELERIEQEQMEYNKQLPNLDDGGADG 432 (445) Q Consensus 357 ~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~ 432 (445) ..+..+++++|++++|.|..+.+++++++ +|++|++|+++.+++++|.+ +|++|+++|+++... ..+.++.+ T Consensus 417 ~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~~~----~~d~~~~~ 492 (496) T protein:vir:38 417 VVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNITEAEADEWAEMLAKEKQAEMP----NNDMNGIF 492 (496) T ss_pred CCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhhccCc----cccccCCC Confidence 34556789999999999999999999987 59999999999999998755 588899888754321 11111111 Q ss_pred CCCC Q lcl|NC_021326. 433 AQQK 436 (445) Q Consensus 433 ~~~~ 436 (445) ++++ T Consensus 493 ~~~e 496 (496) T protein:vir:38 493 GEEE 496 (496) T ss_pred CCCC Confidence 1111 No 62 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=6e-61 Score=350.72 Aligned_cols=377 Identities=13% Similarity=0.020 Sum_probs=286.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|.+|++++..+.+++.++.+||+|+|++.+...... ...+.++|+++|||++||+++++++.-++++ ++|.. T Consensus 5 ~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p-----~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~--~~d~~ 77 (409) T protein:vir:16 5 GIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIP-----QALSQQYRSILGWCAKGVDSLADRLVFREFE--NDDFT 77 (409) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhh-----HHHHHHHhhhcChhHHHHHHhHhhccccccc--CcchH Confidence 9999999999999999999999999998765443322 2233455678899999999999998777754 45543 Q ss_pred HHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeE Q lcl|NC_021326. 81 VIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKV 159 (445) Q Consensus 81 ~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~ 159 (445) ++++| .|+++....++++++++||++|++|+.+++|+++|++++|.+++++||+. .+++.+++++|..+...+. T Consensus 78 ----l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D~~-~~~~~~a~~~~~~d~~~~~ 152 (409) T protein:vir:16 78 ----VNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIIDPI-TGLLTEGYAVLERDENNNV 152 (409) T ss_pred ----HHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEeecc-cccceeeeEEEEecCCCce Confidence 34455 58999999999999999999999999999999999999999999999875 6788888888876544432 Q ss_pred ---EEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccH-HHHHHHHHHHHHHHH Q lcl|NC_021326. 160 ---EYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDI-FMYKTLIDAYNRRLS 230 (445) Q Consensus 160 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~-~~v~~lid~~~~~~s 230 (445) .+|..+.+..+..... .....+|++|+||+|+|.|++ +|.|++ +++++++|++|++++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~ 219 (409) T protein:vir:16 153 VLEAHFLPDRTDYYYRDSR-------------NNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLE 219 (409) T ss_pred EEEEEEecCcEEEEEecCc-------------cccceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHH Confidence 3444444433332111 112357999999999999864 689998 579999999999999 Q ss_pred HHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCC---CceeeEeccCChHHHHHHHHHHHH---HHHHHhCc Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDN---GGVDTIQVEVPVENSKKYLDELYQ---KIMLFGQA 304 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~l~~~~~~~~~~~~i~~l~~---~i~~~s~~ 304 (445) ++....+++++|+++++|++.+............+++.++++ .++++.+. +..++++|++.++. +++..|++ T Consensus 220 ~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~~~v~q~--~~~~l~~~~~~l~~~~~~~a~~s~l 297 (409) T protein:vir:16 220 RADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDKPTLGQF--TQPSMSPFTEQLRTAAAGFAGETGL 297 (409) T ss_pred HHHHHHHHhcChhheeEecCCCCCccchhhhhhhHhhccCCCCCCCCceEEec--CCCChhHHHHHHHHHHHHHhhhcCC Confidence 999999999999999999976543332223334556666543 34565443 44444455555555 45555566 Q ss_pred cccccccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CcceEEEEeCCCCCCC---HH Q lcl|NC_021326. 305 VDFSSDKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVAN---TE 376 (445) Q Consensus 305 p~~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~----~~~~i~v~f~~~~p~d---~~ 376 (445) |..+++..+. ++||+||++++.+|..|+.++++.|+.+|++++++++.+.+... ...++++.|.++.+.+ .+ T Consensus 298 P~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a 377 (409) T protein:vir:16 298 TLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADASMLS 377 (409) T ss_pred CHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchhhHH Confidence 6555554333 47999999999999999999999999999999999999977543 3467899999877554 78 Q ss_pred HHHHHHHHHhc----cCChHHHHHhCCCCCCHH Q lcl|NC_021326. 377 LQVQTAQQSMG----IVSHETVLENHPFVEDLQ 405 (445) Q Consensus 377 ~~~~~~~~~~g----~~s~et~l~~l~~~~d~~ 405 (445) +.+++++|+.+ +...+++++++++..+ + T Consensus 378 ~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~-d 409 (409) T protein:vir:16 378 LIGDGAIKLNQAIPEFINKDTIRDLTGIKGA-E 409 (409) T ss_pred HHHHHHHHHHhhcccccchhHHHHhccCCCC-C Confidence 88999999853 4567899999988642 2 No 63 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=1.1e-57 Score=332.89 Aligned_cols=428 Identities=12% Similarity=0.100 Sum_probs=300.8 Q ss_pred ChHHHHHH------------------HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHH Q lcl|NC_021326. 1 MIVRYIKQ------------------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ 62 (445) Q Consensus 1 ~l~~~i~~------------------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~ 62 (445) .|++++++ +.+.+.++.++++||+|+|+.+...... .....+.++++++|+++.||++ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~----~~~~~~~~~~~s~n~~~~iv~~ 83 (499) T protein:vir:80 8 GVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYE----HNGNPVNRRQLSMNLPKVTAKY 83 (499) T ss_pred HHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccc----cCCCccccceeecchHHHHHHH Confidence 22233322 3345678999999999999866543221 1122334567899999999999 Q ss_pred HHhhhhccCeeeccCchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCC Q lcl|NC_021326. 63 KVSYIVGKPIAFKHTDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEH 141 (445) Q Consensus 63 ~~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~ 141 (445) .|+|++|+|++++++++..++.|+++++ |+|...+.+++..++.+|.+|+.+|.|++|++++.+++|.+++|+|.+... T Consensus 84 ~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~~~Pi~~d~~~ 163 (499) T protein:vir:80 84 MSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSNDSEN 163 (499) T ss_pred HHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCceEEEEecCCC Confidence 9999999999999999999999999996 679999999999999999999999999999999999999999998776432 Q ss_pred CceEEEEEEEeeecce--eEEEEe--c-----ceEEE--EEEecceeeec-c--cccccccccccccccccccceEEecC Q lcl|NC_021326. 142 EELEAFIRMYKLENET--KVEYWD--K-----ITVNY--YVYENGSLIPD-Y--SNNLENSKTHFSTGSWGKIPFIPFKN 207 (445) Q Consensus 142 ~~~~~~v~~~~~~~~~--~~~~~~--~-----~~~~~--~~~~~~~~~~~-~--~~~~~~~~~~~~~~~~g~iPvv~~~n 207 (445) -...+++..+..++.. .++++. . ++++. |.......... . .......++....++++++||++|++ T Consensus 164 ~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~ 243 (499) T protein:vir:80 164 VDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKP 243 (499) T ss_pred eEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeecC Confidence 2234445444443321 122221 1 11111 11111110000 0 00112223333445688999999875 Q ss_pred C---------CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE-------ecCCcccchhHHHhhhhCceeeccC Q lcl|NC_021326. 208 N---------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-------TNYDDQELPEFKRLLRYYGAIKVSD 271 (445) Q Consensus 208 ~---------~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~-------~g~~~~~~~~~~~~~~~~~~~~~~~ 271 (445) + +.|.|+|+++++++|+||.++|++++.++....++.+. .+.++.....+....+...++.... T Consensus 244 ~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 323 (499) T protein:vir:80 244 NIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGEQ 323 (499) T ss_pred CccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcccCCCcccceeeEeeccC Confidence 3 45899999999999999999999999999988888772 2222222223333334444443322 Q ss_pred -CC--ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc-cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 272 -NG--GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELL 347 (445) Q Consensus 272 -~~--~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~-~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 347 (445) ++ .++.++.++..+++...++.+.+.|...++++.-.++ ..+++.||++++++.+.+..++..+++.|+.+|++++ T Consensus 324 ~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~ 403 (499) T protein:vir:80 324 DDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMI 403 (499) T ss_pred CCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 3566667788899999999999999888877654333 3346678999999999999999999999999999999 Q ss_pred HHHHHHhc-------cCCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHH--HHHHHHHHHHH Q lcl|NC_021326. 348 WFVFEHFD-------IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQ--AELERIEQEQM 416 (445) Q Consensus 348 ~~~~~~~~-------~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~--~E~~ri~~E~~ 416 (445) ++++.+.. ...+..++++.|++.++.|..+.+++.+++ +|++|++|++.++++++|.+ +|++||++|++ T Consensus 404 ~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~ea~~el~~i~~E~~ 483 (499) T protein:vir:80 404 VSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITEAEADEWAEMLAKEKQ 483 (499) T ss_pred HHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChHHHHHHHHHHHHHhh Confidence 99886532 123456799999999999999999999886 59999999999999988755 57888888875 Q ss_pred HHHHhhhccccCCCCCCCC Q lcl|NC_021326. 417 EYNKQLPNLDDGGADGAQQ 435 (445) Q Consensus 417 ~~~~~~~~~~~~~~~~~~~ 435 (445) ... +..+.++..+..+ T Consensus 484 ~~~---~~~d~~g~~ge~e 499 (499) T protein:vir:80 484 AEI---PNNDMTGIFGEEE 499 (499) T ss_pred cCC---CCCCccccCCCCC Confidence 432 1111111111111 No 64 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=2.5e-52 Score=303.47 Aligned_cols=421 Identities=15% Similarity=0.180 Sum_probs=292.3 Q ss_pred ChHHHHHH------------------HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHH Q lcl|NC_021326. 1 MIVRYIKQ------------------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ 62 (445) Q Consensus 1 ~l~~~i~~------------------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~ 62 (445) ++++++.+ -.+.+.++.++.+||.|+++.+..... ....+.++++++|+++.||+. T Consensus 10 ~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~------~~~~~~~~~~slnl~~~i~~~ 83 (505) T protein:vir:79 10 LFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNS------YGDTQKHELQSVNVTKLASAK 83 (505) T ss_pred HHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCcccccccc------CCCccccceeecchHHHHHHH Confidence 33333221 124667888899999999986643321 112233456788999999999 Q ss_pred HHhhhhccCeeeccCchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCC Q lcl|NC_021326. 63 KVSYIVGKPIAFKHTDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEH 141 (445) Q Consensus 63 ~~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~ 141 (445) .|+|++|+|++++++++..++.|+++++ |+|...+.+++..++..|.+++.+|+| .|+++|.+++|++++|++.+... T Consensus 84 ~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-~~~~~i~~v~ad~~~P~~~d~~~ 162 (505) T protein:vir:79 84 LASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD-SGKIKLAWATADQVYPLQADTNQ 162 (505) T ss_pred HHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe-CCceEEEEEcCCeeEEEEEcCCC Confidence 9999999999999999999999999886 679999999999999999999999998 47899999999999998654433 Q ss_pred CceEEEEEEEeeeccee------EEEEe----cceEEEEEEe--cceeeecc----c-ccccccccccccccccccceEE Q lcl|NC_021326. 142 EELEAFIRMYKLENETK------VEYWD----KITVNYYVYE--NGSLIPDY----S-NNLENSKTHFSTGSWGKIPFIP 204 (445) Q Consensus 142 ~~~~~~v~~~~~~~~~~------~~~~~----~~~~~~~~~~--~~~~~~~~----~-~~~~~~~~~~~~~~~g~iPvv~ 204 (445) ....+++..|...+... +++|+ .+++....+. ....+... . ......++....+++.++||++ T Consensus 163 ~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~ 242 (505) T protein:vir:79 163 VNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAF 242 (505) T ss_pred eEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEE Confidence 33344444443322222 22232 2222222221 11100000 0 0011112223335677778888 Q ss_pred ecC----C-----CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC---C----cccchh----HHHhhhhC Q lcl|NC_021326. 205 FKN----N-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---D----DQELPE----FKRLLRYY 264 (445) Q Consensus 205 ~~n----~-----~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~---~----~~~~~~----~~~~~~~~ 264 (445) |++ + +.|.|+|+++++++|++|.++|+++++++....++.+-..+ + +..... +......+ T Consensus 243 ~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~~y 322 (505) T protein:vir:79 243 YRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETHPPMFDPDETVY 322 (505) T ss_pred ecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccccccccCCCccceee Confidence 754 2 35899999999999999999999999999988888773221 1 000000 11111112 Q ss_pred ceeecc-CCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc-cccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 265 GAIKVS-DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLYTNLNLKADKLARKAKVA 342 (445) Q Consensus 265 ~~~~~~-~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~-~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 342 (445) ..+..+ .++.++.+++++..+++...++.+.+.|...++.+.-.++ ...+..||+++++..+.+.++++++++.|+.+ T Consensus 323 ~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~~~~a 402 (505) T protein:vir:79 323 QAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQVEKT 402 (505) T ss_pred eeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHHHHHH Confidence 222222 2334666677788899999999999999988877543332 23456789999999999999999999999999 Q ss_pred HHHHHHHHHHHhccC-------------CCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HH Q lcl|NC_021326. 343 IQELLWFVFEHFDIK-------------GEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVED--LQ 405 (445) Q Consensus 343 l~~~~~~~~~~~~~~-------------~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d--~~ 405 (445) |+++++.++.+.... ....+++|.|++.++.|..+.++..+++ +|++|++++++.+++++| ++ T Consensus 403 l~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~~~~~~eeea~ 482 (505) T protein:vir:79 403 IKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFLMRNYGLDEEEAD 482 (505) T ss_pred HHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCChHHHH Confidence 999999998764321 1234688999999999999999988876 599999999999999976 66 Q ss_pred HHHHHHHHHHHHHHHhhhccccCCCC Q lcl|NC_021326. 406 AELERIEQEQMEYNKQLPNLDDGGAD 431 (445) Q Consensus 406 ~E~~ri~~E~~~~~~~~~~~~~~~~~ 431 (445) +|++||++|+...+ +...+.+++ T Consensus 483 ~el~ri~~E~~~~~---p~~~~~gg~ 505 (505) T protein:vir:79 483 EWLAQIDAENSTAE---PEFNQFGGD 505 (505) T ss_pred HHHHHHHHhccccC---CCchhccCC Confidence 78999999986532 333333323 No 65 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=3.8e-52 Score=302.51 Aligned_cols=426 Identities=16% Similarity=0.193 Sum_probs=289.4 Q ss_pred ChHHHHHH------------------HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHH Q lcl|NC_021326. 1 MIVRYIKQ------------------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ 62 (445) Q Consensus 1 ~l~~~i~~------------------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~ 62 (445) ++++.+++ -.+++.+|.++.+||+|+|+.+..... ....+...++++|+++.||+. T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~------~~~~~~~~~~sln~~~~i~~~ 83 (508) T protein:vir:15 10 LFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQAS------DGIKKKRLKNTINMAKTAARR 83 (508) T ss_pred HHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccC------CCCccccceeecchHHHHHHH Confidence 22222222 124667899999999999875432211 111223346789999999999 Q ss_pred HHhhhhccCeeecc-CchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEE-EcCC Q lcl|NC_021326. 63 KVSYIVGKPIAFKH-TDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPI-WTDK 139 (445) Q Consensus 63 ~~~~l~g~~~~~~~-~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v-~d~~ 139 (445) .|++++|+|+++++ +++..++.|+++++ |+|...+.+++..++..|.+++.+|+|. ++++|.+++|++++|+ |+.+ T Consensus 84 ~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~i~~v~ad~~~P~~~d~~ 162 (508) T protein:vir:15 84 IASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG-NHIKIAWVRADQFYPLQSNTN 162 (508) T ss_pred HHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC-CeeEEEEEcCCeeEEEEEcCC Confidence 99999999999998 55667778998885 6799999999999999999999999985 6799999999999997 4543 Q ss_pred CCCceEEEEEEEeeecceeEEEEe-----------cceEEEEEEecce--eeeccc-----ccccccccccccccccccc Q lcl|NC_021326. 140 EHEELEAFIRMYKLENETKVEYWD-----------KITVNYYVYENGS--LIPDYS-----NNLENSKTHFSTGSWGKIP 201 (445) Q Consensus 140 ~~~~~~~~v~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~--~~~~~~-----~~~~~~~~~~~~~~~g~iP 201 (445) .... .++++.+...+..+..+|+ .+.+.+..+.... .+.... ......++....+++.++| T Consensus 163 ~~~~-~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~ 241 (508) T protein:vir:15 163 DISE-AAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPL 241 (508) T ss_pred CeEE-EEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcce Confidence 2222 2233333222222222222 2333322222111 000000 0011112233345677888 Q ss_pred eEEecC---------CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE---ecCCcccchhHHHhhhhCceeec Q lcl|NC_021326. 202 FIPFKN---------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL---TNYDDQELPEFKRLLRYYGAIKV 269 (445) Q Consensus 202 vv~~~n---------~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~---~g~~~~~~~~~~~~~~~~~~~~~ 269 (445) |++|++ ++.|.|+|+++++++|++|.++|+++++++....++.+. ...+......+....+.+..+.. T Consensus 242 f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d~~~~~~~~~~~~~~~~~~~ 321 (508) T protein:vir:15 242 FAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFDDEHKPTFDTEQNVYVGVLS 321 (508) T ss_pred eEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCCCCCccccCCCCeeEEeccC Confidence 988865 246899999999999999999999999998887777773 33333332222222222333333 Q ss_pred cCC--CceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc-cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 270 SDN--GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQEL 346 (445) Q Consensus 270 ~~~--~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~-~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~ 346 (445) +++ ..++.+++++..+.+.+.++.+.+.|...++.+.-.++ ..++..||+++++..+.+.+++.++++.|+.+|+++ T Consensus 322 ~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~~~~~al~~l 401 (508) T protein:vir:15 322 DDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSYLTMVEKAIDEL 401 (508) T ss_pred CCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 34677778889999999999999999888877643333 233567999999999999999999999999999999 Q ss_pred HHHHHHHhccC---------------CCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHH Q lcl|NC_021326. 347 LWFVFEHFDIK---------------GEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVED--LQAE 407 (445) Q Consensus 347 ~~~~~~~~~~~---------------~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d--~~~E 407 (445) +++++.+.... ....+++|.|++.++.|..++++..+++ +|++|+++++++++++++ +++| T Consensus 402 v~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~~g~~deea~~e 481 (508) T protein:vir:15 402 CQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQRNYGMTDEQAAEE 481 (508) T ss_pred HHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHH Confidence 99988765421 1234688999999999999999998886 599999999999999876 5678 Q ss_pred HHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 408 LERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 408 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++||++|+.+....-+. ..+. ++..|| T Consensus 482 l~ri~~E~~~~~~~~~~---------~~~~--~g~~ge 508 (508) T protein:vir:15 482 LAKIQSEAPTDTFEGGR---------SAIL--NGGDGE 508 (508) T ss_pred HHHHHHhccccCccccc---------cccC--CCCCCC Confidence 99999997542211111 1111 111111 No 66 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=2e-49 Score=287.51 Aligned_cols=431 Identities=15% Similarity=0.151 Sum_probs=294.4 Q ss_pred ChHHHHHH-----------------HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHH Q lcl|NC_021326. 1 MIVRYIKQ-----------------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 63 (445) Q Consensus 1 ~l~~~i~~-----------------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~ 63 (445) +|+++..+ +.+++.+|.++.+||+|+++....... ......++++++|+++.||+.. T Consensus 10 ~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~------~~~~~~~~~~slnl~~~i~~~~ 83 (522) T protein:vir:47 10 FFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNT------DGDIKSRPMNHLPIARTASKKI 83 (522) T ss_pred HHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCccccccccc------CcchhcccceecchHHHHHHHH Confidence 33333322 557789999999999998754321111 1112234578889999999999 Q ss_pred HhhhhccCeeeccCchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCC Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHE 142 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~ 142 (445) |++++++|++++++++..++.|+.+++ |+|...+.+++..++..|.+++.+|+| .+++++.+++|++++|+..++. + T Consensus 84 A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~~~~~i~~v~ad~~~P~~~~~~-~ 161 (522) T protein:vir:47 84 ASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-GDKVRVAFIQAPVFFPLESNTQ-D 161 (522) T ss_pred hhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-CCceEEEEEcCCceEEEEEcCC-c Confidence 999999999999999999999999986 679899999999999999999999998 5789999999999999854432 3 Q ss_pred ceEEEE--EEEeeecceeEEEEec----------------------ceEEEEEEec--ceeeec----cc-ccccccccc Q lcl|NC_021326. 143 ELEAFI--RMYKLENETKVEYWDK----------------------ITVNYYVYEN--GSLIPD----YS-NNLENSKTH 191 (445) Q Consensus 143 ~~~~~v--~~~~~~~~~~~~~~~~----------------------~~~~~~~~~~--~~~~~~----~~-~~~~~~~~~ 191 (445) ...+++ +.+..... +..+|+. +.+....+.. ...+.. .+ .......+. T Consensus 162 ~~e~a~~~~~~~~~~~-~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~ 240 (522) T protein:vir:47 162 VSSAAILTKTIKSEGR-KNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPV 240 (522) T ss_pred eEEEEEEEEEEeeccc-ceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCc Confidence 333333 22222222 1222222 1222111111 000000 00 001111223 Q ss_pred cccccccccceEEecCC---------CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC-----ccc---- Q lcl|NC_021326. 192 FSTGSWGKIPFIPFKNN---------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD-----DQE---- 253 (445) Q Consensus 192 ~~~~~~g~iPvv~~~n~---------~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~-----~~~---- 253 (445) ....++.+++|++|+++ +.|.|+|++.++++|++|.++|+++++++....++.+...+. ... T Consensus 241 ~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~ 320 (522) T protein:vir:47 241 TVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGTID 320 (522) T ss_pred eEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCCCCcccc Confidence 33456677778887652 469999999999999999999999999999998887733221 100 Q ss_pred -chhHHHhhhhCceeec--cCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccc-cccccccCcchHHHHHHHHHHHH Q lcl|NC_021326. 254 -LPEFKRLLRYYGAIKV--SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVD-FSSDKFGSAPSGVALEFLYTNLN 329 (445) Q Consensus 254 -~~~~~~~~~~~~~~~~--~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~-~~~~~~~~~~Sg~Ai~~~~~~l~ 329 (445) ...+...-..+..+.. .++.+++.+++++..+.+.+.++.+.+.|...++... ......++..||+++++..+.+. T Consensus 321 ~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~~ 400 (522) T protein:vir:47 321 FRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDTY 400 (522) T ss_pred cccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHHH Confidence 0011111111222222 2334577777888888888888888887777666543 22223345678999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcc-------CCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC Q lcl|NC_021326. 330 LKADKLARKAKVAIQELLWFVFEHFDI-------KGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF 400 (445) Q Consensus 330 ~k~~~~~~~~~~~l~~~~~~~~~~~~~-------~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~ 400 (445) ++++++++.++.+|+++++.++.+... .....+++|.|++.++.|..++++..+++ +|++|++++++++++ T Consensus 401 ~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e~~i~~~~g 480 (522) T protein:vir:47 401 QMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKKRAIGKTLN 480 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC Confidence 999999999999999999999876532 23456789999999999999999998885 599999999998877 Q ss_pred CCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 401 VED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 401 ~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 444 (445) +++ +++|++||++|+.+.. +...+...+..+.++..|++| T Consensus 481 ~~eeea~~el~ri~~E~~~~~----~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 481 ISGVEAEKELNAINSELLPMN----DAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred CChHHHHHHHHHHHHhhccCC----CCCCCCCCCCCcccccCCCCC Confidence 765 5678999998875432 212222223355556666666 No 67 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=1e-48 Score=283.64 Aligned_cols=422 Identities=15% Similarity=0.128 Sum_probs=281.2 Q ss_pred ChHHHHHH-----------------HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHH Q lcl|NC_021326. 1 MIVRYIKQ-----------------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 63 (445) Q Consensus 1 ~l~~~i~~-----------------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~ 63 (445) ++++.+.+ -.+++.+|.++.+||+|+++.+..... ....+.++++++|+++.||+.. T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~slnl~~~i~~~~ 83 (500) T protein:vir:98 10 LVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNT------DGETKKRDLNHLPIARTAAKKI 83 (500) T ss_pred HHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccC------CCCcccCceeecchHHHHHHHH Confidence 22221110 125678999999999999764322111 1122345577889999999999 Q ss_pred HhhhhccCeeeccCchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCC Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHE 142 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~ 142 (445) |++++|+|++++++++..++.|+++++ |+|...+.+++..++..|.+++.+|+|. ++++|.+++|++++|+..+... T Consensus 84 A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ad~~~P~~~d~~~- 161 (500) T protein:vir:98 84 ASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQAPVFLPLQSNTQD- 161 (500) T ss_pred hhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeeEEEEEcCCC- Confidence 999999999999999999999999986 5799999999999999999999999985 7899999999999998655433 Q ss_pred ceEEEEEE--Eeeecce-----eEEEEe-----cceEEEEEEecc------eeeecccccccccccccccccccccceEE Q lcl|NC_021326. 143 ELEAFIRM--YKLENET-----KVEYWD-----KITVNYYVYENG------SLIPDYSNNLENSKTHFSTGSWGKIPFIP 204 (445) Q Consensus 143 ~~~~~v~~--~~~~~~~-----~~~~~~-----~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 204 (445) ...+++.+ +...... .++.|+ .+.++...+... ..+... ......++.....++.++||++ T Consensus 162 ~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~-~~~~~l~~~~~~~~~~~p~f~~ 240 (500) T protein:vir:98 162 VSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLS-EVYKDLKDEAKVTDVTRPIFTY 240 (500) T ss_pred eEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccc-cccCCcCcceEeccCCCccEEE Confidence 33333322 2222211 123332 223332222211 111000 0112222333445677777877 Q ss_pred ecC---------CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC--------cccchhHHHhhh--hCc Q lcl|NC_021326. 205 FKN---------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD--------DQELPEFKRLLR--YYG 265 (445) Q Consensus 205 ~~n---------~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~--------~~~~~~~~~~~~--~~~ 265 (445) |++ .+.|.|+|+++++++|++|.++|+++++++....++.+...+- ++....+.-... .+. T Consensus 241 ~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~ 320 (500) T protein:vir:98 241 LKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYI 320 (500) T ss_pred ecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcccCCCcceEE Confidence 753 2458999999999999999999999999999888777743321 111111111111 111 Q ss_pred eeeccCC--CceeeEeccCChHHHHHHHHHHHHHHHHHhCcccc-ccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 266 AIKVSDN--GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDF-SSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVA 342 (445) Q Consensus 266 ~~~~~~~--~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~-~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 342 (445) .+...++ ..++.+++++..+++...++.+.+.|...++.+.- .+...++..||+++++..+.+..++.++++.|+.+ T Consensus 321 ~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~a 400 (500) T protein:vir:98 321 RMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQS 400 (500) T ss_pred EcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222 33566667778888888888887777766665432 22233466789999999999999999999999999 Q ss_pred HHHHHHHHHHHhcc-------CCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHH Q lcl|NC_021326. 343 IQELLWFVFEHFDI-------KGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVED--LQAELERI 411 (445) Q Consensus 343 l~~~~~~~~~~~~~-------~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d--~~~E~~ri 411 (445) |++++++++.+... .....++.|.|++.++.|..+.++..+++ +|++|++++++++.++++ ++++++|| T Consensus 401 l~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eeea~~~l~~i 480 (500) T protein:vir:98 401 LKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEKAQEIAAEI 480 (500) T ss_pred HHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHHHHHHHHHH Confidence 99999999876431 22345689999999999999999998886 599999999988866654 44567777 Q ss_pred HHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 412 EQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 412 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+....... ....|.-|| T Consensus 481 ~~E~~~~~~~~--------------~~~~~~~g~ 500 (500) T protein:vir:98 481 NTGIVDEINQQ--------------RTDTHLYGE 500 (500) T ss_pred HHhccccCCCC--------------CccccccCC Confidence 76642211111 112222222 No 68 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=1e-48 Score=283.64 Aligned_cols=422 Identities=15% Similarity=0.128 Sum_probs=281.2 Q ss_pred ChHHHHHH-----------------HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHH Q lcl|NC_021326. 1 MIVRYIKQ-----------------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 63 (445) Q Consensus 1 ~l~~~i~~-----------------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~ 63 (445) ++++.+.+ -.+++.+|.++.+||+|+++.+..... ....+.++++++|+++.||+.. T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~slnl~~~i~~~~ 83 (500) T protein:vir:30 10 LVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNT------DGETKKRDLNHLPIARTAAKKI 83 (500) T ss_pred HHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccC------CCCcccCceeecchHHHHHHHH Confidence 22221110 125678999999999999764322111 1122345577889999999999 Q ss_pred HhhhhccCeeeccCchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCC Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHE 142 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~ 142 (445) |++++|+|++++++++..++.|+++++ |+|...+.+++..++..|.+++.+|+|. ++++|.+++|++++|+..+... T Consensus 84 A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ad~~~P~~~d~~~- 161 (500) T protein:vir:30 84 ASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQAPVFLPLQSNTQD- 161 (500) T ss_pred hhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeeEEEEEcCCC- Confidence 999999999999999999999999986 5799999999999999999999999985 7899999999999998655433 Q ss_pred ceEEEEEE--Eeeecce-----eEEEEe-----cceEEEEEEecc------eeeecccccccccccccccccccccceEE Q lcl|NC_021326. 143 ELEAFIRM--YKLENET-----KVEYWD-----KITVNYYVYENG------SLIPDYSNNLENSKTHFSTGSWGKIPFIP 204 (445) Q Consensus 143 ~~~~~v~~--~~~~~~~-----~~~~~~-----~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 204 (445) ...+++.+ +...... .++.|+ .+.++...+... ..+... ......++.....++.++||++ T Consensus 162 ~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~-~~~~~l~~~~~~~~~~~p~f~~ 240 (500) T protein:vir:30 162 VSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLS-EVYKDLKDEAKVTDVTRPIFTY 240 (500) T ss_pred eEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccc-cccCCcCcceEeccCCCccEEE Confidence 33333322 2222211 123332 223332222211 111000 0112222333445677777877 Q ss_pred ecC---------CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC--------cccchhHHHhhh--hCc Q lcl|NC_021326. 205 FKN---------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD--------DQELPEFKRLLR--YYG 265 (445) Q Consensus 205 ~~n---------~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~--------~~~~~~~~~~~~--~~~ 265 (445) |++ .+.|.|+|+++++++|++|.++|+++++++....++.+...+- ++....+.-... .+. T Consensus 241 ~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~ 320 (500) T protein:vir:30 241 LKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYI 320 (500) T ss_pred ecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcccCCCcceEE Confidence 753 2458999999999999999999999999999888777743321 111111111111 111 Q ss_pred eeeccCC--CceeeEeccCChHHHHHHHHHHHHHHHHHhCcccc-ccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 266 AIKVSDN--GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDF-SSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVA 342 (445) Q Consensus 266 ~~~~~~~--~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~-~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 342 (445) .+...++ ..++.+++++..+++...++.+.+.|...++.+.- .+...++..||+++++..+.+..++.++++.|+.+ T Consensus 321 ~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~a 400 (500) T protein:vir:30 321 RMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQS 400 (500) T ss_pred EcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222 33566667778888888888887777766665432 22233466789999999999999999999999999 Q ss_pred HHHHHHHHHHHhcc-------CCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHH Q lcl|NC_021326. 343 IQELLWFVFEHFDI-------KGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVED--LQAELERI 411 (445) Q Consensus 343 l~~~~~~~~~~~~~-------~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d--~~~E~~ri 411 (445) |++++++++.+... .....++.|.|++.++.|..+.++..+++ +|++|++++++++.++++ ++++++|| T Consensus 401 l~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eeea~~~l~~i 480 (500) T protein:vir:30 401 LKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEKAQEIAAEI 480 (500) T ss_pred HHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHHHHHHHHHH Confidence 99999999876431 22345689999999999999999998886 599999999988866654 44567777 Q ss_pred HHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 412 EQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 412 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++|+....... ....|.-|| T Consensus 481 ~~E~~~~~~~~--------------~~~~~~~g~ 500 (500) T protein:vir:30 481 NTGIVDEINQQ--------------RTDTHLYGE 500 (500) T ss_pred HHhccccCCCC--------------CccccccCC Confidence 76642211111 112222222 No 69 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=3.1e-47 Score=275.57 Aligned_cols=437 Identities=13% Similarity=0.123 Sum_probs=305.2 Q ss_pred ChHHHHHHH-HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee--eccC Q lcl|NC_021326. 1 MIVRYIKQH-LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA--FKHT 77 (445) Q Consensus 1 ~l~~~i~~~-~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~--~~~~ 77 (445) =+-+....| +.|+..|..+.+||.|.+.-+....+... ..+ .-.+.++..++|++..-.|+ +.+.. .+.. T Consensus 20 ~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~--~~~----~r~~~~ps~~~~~~~~~~~~-~~g~~~~~~~~ 92 (527) T protein:vir:10 20 NFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGD--EGD----QRPIYVPNGEKLIEAKMRFL-GQGLKWEFSKK 92 (527) T ss_pred cCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCcc--ccc----cceeeehhhHHhhCCcceee-ccCccccccch Confidence 111123444 56889999999999998643332222111 111 11367777788887765554 44444 3445 Q ss_pred chHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCCCCCceEEE--EEE Q lcl|NC_021326. 78 DDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDKEHEELEAF--IRM 150 (445) Q Consensus 78 d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~~~~~~~~~--v~~ 150 (445) ++.+.+.++.|.+ +++..++.+..+++.+.|++++++-+|+. ++++++.+||.+.||+.|+...+.+..+ +.- T Consensus 93 ~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~ 172 (527) T protein:vir:10 93 DAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDE 172 (527) T ss_pred hHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeecCCCCCceeeEEEeee Confidence 6788899988885 78899999999999999998877776642 4799999999999999887655555544 323 Q ss_pred EeeecceeEEEE-----------------ecceEEEEEEecceeeeccc-------------cccccccccccccccccc Q lcl|NC_021326. 151 YKLENETKVEYW-----------------DKITVNYYVYENGSLIPDYS-------------NNLENSKTHFSTGSWGKI 200 (445) Q Consensus 151 ~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~g~i 200 (445) |......+..++ .......| .+..|....+. ......+.+..++++++| T Consensus 173 ~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~y-t~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fi 251 (527) T protein:vir:10 173 YPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKY-TEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTL 251 (527) T ss_pred ccCCccccccceehhhhhhhhhcCcccccccCcceee-eeceeeccccccccccccchhhhhhhcCceeeecccCCCCcc Confidence 443322221111 00111111 22222221111 112344456789999999 Q ss_pred ceEEecCC-----CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccc-hhHH-HhhhhCceeeccCCC Q lcl|NC_021326. 201 PFIPFKNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-PEFK-RLLRYYGAIKVSDNG 273 (445) Q Consensus 201 Pvv~~~n~-----~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~-~~~~-~~~~~~~~~~~~~~~ 273 (445) |+|+|+|- .+|+|+++++++++|++|.++|+.+.++.+...|+.+++|+...+. ++.. ..+....++.+++++ T Consensus 252 PvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~a 331 (527) T protein:vir:10 252 PVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNN 331 (527) T ss_pred ceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCCceeEecCCCc Confidence 99999763 4799999999999999999999999999999999999999754321 1111 124455677899999 Q ss_pred ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc--cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_021326. 274 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD--KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW-FV 350 (445) Q Consensus 274 ~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~--~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-~~ 350 (445) ++..+......+.++.|++.|.+.|+..|++|.+.++ ..++++||.|++..+++|.+++.+++..++-..++..+ ++ T Consensus 332 k~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~ 411 (527) T protein:vir:10 332 KIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLV 411 (527) T ss_pred ceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhH Confidence 9888776668889999999999999999999999988 44578999999999999999999999888888876543 22 Q ss_pred ----HHHhcc--CC--CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHH Q lcl|NC_021326. 351 ----FEHFDI--KG--EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENH---PFVEDLQAELERIEQEQME 417 (445) Q Consensus 351 ----~~~~~~--~~--~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l---~~~~d~~~E~~ri~~E~~~ 417 (445) ..+.+. .+ ....+.++|.+.+|+|.++.++.+.++ +|++|.+||+++| ++++|+++|+++|.+|++. T Consensus 412 ~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~ 491 (527) T protein:vir:10 412 TQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFELTEEDFKQATEDKKT 491 (527) T ss_pred HHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCChHHHHHHHHHHHHH Confidence 222222 22 234679999999999999999999887 5999999998887 7899999999999999887 Q ss_pred HHHhhhccccCCCCCCCCCCCCCC----cCCC Q lcl|NC_021326. 418 YNKQLPNLDDGGADGAQQKERSND----KQSE 445 (445) Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~d----~~~~ 445 (445) ..+...+..+..+-.+...+.-++ +.+- T Consensus 492 ~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~ 523 (527) T protein:vir:10 492 QGIAQAEAADPFGAQMAAEQGIPDEEDDQALN 523 (527) T ss_pred HhHHhhhhcCchhhhhccccCCCCCCcccccC Confidence 666655544433222122222222 2222 No 70 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=3.8e-47 Score=275.07 Aligned_cols=437 Identities=12% Similarity=0.122 Sum_probs=305.3 Q ss_pred ChHHHHHHH-HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee--eccC Q lcl|NC_021326. 1 MIVRYIKQH-LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA--FKHT 77 (445) Q Consensus 1 ~l~~~i~~~-~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~--~~~~ 77 (445) =+-+....| +.|+..|..+.+||.|.+.-+....+... ..+ .-.+.++..++|++..-.|+ +.+.. .+.. T Consensus 20 ~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~--~~~----~r~~~~ps~~~~~~~~~~~~-~~g~~~~~~~~ 92 (527) T protein:vir:10 20 NFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGD--EGD----QRPIYVPNGEKLIEAKMRFL-GQGLKWEFSKK 92 (527) T ss_pred cCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCcc--ccc----cceeeehhhHHhhCCcceee-ccCccccccch Confidence 111123444 56889999999999998643332222111 111 11367777788877765554 44444 3345 Q ss_pred chHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCCCCCceEEE--EEE Q lcl|NC_021326. 78 DDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDKEHEELEAF--IRM 150 (445) Q Consensus 78 d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~~~~~~~~~--v~~ 150 (445) ++.+.+.++.|.+ +++..++.+..+++.+.|++++++-+|+. ++++++.+||.+.||+.|+...+.+..+ +.- T Consensus 93 ~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~ 172 (527) T protein:vir:10 93 DAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDE 172 (527) T ss_pred hHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeecCCCCCceeeEEEeee Confidence 6788889988875 78899999999999999998877776642 4799999999999999887655555544 323 Q ss_pred EeeecceeEEEE-----------------ecceEEEEEEecceeeeccc-------------cccccccccccccccccc Q lcl|NC_021326. 151 YKLENETKVEYW-----------------DKITVNYYVYENGSLIPDYS-------------NNLENSKTHFSTGSWGKI 200 (445) Q Consensus 151 ~~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~g~i 200 (445) |......+..++ .......| .+..|....+. ......+.+..++++++| T Consensus 173 ~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~y-t~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fi 251 (527) T protein:vir:10 173 YPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKY-TEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTL 251 (527) T ss_pred ccCCccccccceehhhhhhhhhcCcccccccCcceee-eeceeeccccccccccccchhhhhhhcCceeeecccCCCCcc Confidence 443322221111 00111111 22222221111 112344456789999999 Q ss_pred ceEEecCC-----CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccc-hhHH-HhhhhCceeeccCCC Q lcl|NC_021326. 201 PFIPFKNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-PEFK-RLLRYYGAIKVSDNG 273 (445) Q Consensus 201 Pvv~~~n~-----~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~-~~~~-~~~~~~~~~~~~~~~ 273 (445) |+|+|+|- .+|+|+++++++++|++|.++|+.+.++.+...|+.+++|+...+. ++.. ..+....++.+++++ T Consensus 252 PvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgPG~iweL~e~a 331 (527) T protein:vir:10 252 PVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISPLGMVEHGQNN 331 (527) T ss_pred ceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCCceeEecCCCc Confidence 99999763 4799999999999999999999999999999999999999754321 1111 124455677899999 Q ss_pred ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc--cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_021326. 274 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD--KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW-FV 350 (445) Q Consensus 274 ~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~--~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~-~~ 350 (445) ++..+......+.++.|++.|.+.|+..|++|.+.++ ..++++||.|++..+++|.+++.+++..++-..++..+ ++ T Consensus 332 k~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~ 411 (527) T protein:vir:10 332 KIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLV 411 (527) T ss_pred ceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhH Confidence 9888776668889999999999999999999999988 44578999999999999999999999888888876543 22 Q ss_pred ----HHHhcc--CC--CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHH Q lcl|NC_021326. 351 ----FEHFDI--KG--EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENH---PFVEDLQAELERIEQEQME 417 (445) Q Consensus 351 ----~~~~~~--~~--~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l---~~~~d~~~E~~ri~~E~~~ 417 (445) ..+.+. .+ ....+.++|.+.+|+|.++.++.+.++ +|++|.+||+++| ++++|+++|+++|.+|++. T Consensus 412 ~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~ 491 (527) T protein:vir:10 412 TQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIMGFELTEEDFRQATEDKKT 491 (527) T ss_pred HHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCCchHHHHHHHHHHHHH Confidence 222222 22 234679999999999999999999887 5999999998887 7899999999999999987 Q ss_pred HHHhhhccccCCCCCCCCCCCCCC----cCCC Q lcl|NC_021326. 418 YNKQLPNLDDGGADGAQQKERSND----KQSE 445 (445) Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~d----~~~~ 445 (445) ..+...+..+..+-.+...+.-++ +.+- T Consensus 492 ~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~ 523 (527) T protein:vir:10 492 QGIAQAEAADPFGAQMAAEQGIPDEEDDQALN 523 (527) T ss_pred HhHHhhhhcCchhhhhccccCCCCCCcccccC Confidence 666655544433222122222222 2222 No 71 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=3.8e-45 Score=264.13 Aligned_cols=429 Identities=17% Similarity=0.108 Sum_probs=280.2 Q ss_pred ChHHHHHHHHH------HHHHHHHHHHHhcCCCccccccc---cccccccccccccccccccchHHHHHHHHHhhhhccC Q lcl|NC_021326. 1 MIVRYIKQHLE------KLPEISIGQEYYEQRPDIVKEPK---PVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKP 71 (445) Q Consensus 1 ~l~~~i~~~~~------~~~~~~~~~~yy~G~~~i~~~~~---~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~ 71 (445) .|+++|+-+-. ++.++....++|.++..-..+.. ..+.+. ..+.....++++|+++.|++..|++++|++ T Consensus 6 ~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~-~~~~~~~~~~~~~l~~~i~~~~A~ll~~e~ 84 (518) T protein:vir:78 6 VMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQG-YVPTVHDKLMNSGTGNEIVVVAAEYISGKP 84 (518) T ss_pred hHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccC-CCCccccccccCChHHHHHHHHHHhhcCCC Confidence 78888888752 33455555566666542211111 111111 122334457889999999999999999999 Q ss_pred eeecc------CchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 72 IAFKH------TDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 72 ~~~~~------~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~ 144 (445) ++++. +++..++.|+.+++ |+|...+.+.+..++..|.+++.++++ +|+++|.+++|++++|+|+++. + T Consensus 85 ~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~~i~~v~ad~~~P~~~~g~---~ 160 (518) T protein:vir:78 85 LSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-NGRPSISVHSSSQFWIDFKNNE---P 160 (518) T ss_pred ceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-CCeeEEEEEcCCeeEEEeecCc---E Confidence 98864 46778889999885 679999999999999999999999997 5899999999999999998754 3 Q ss_pred EEEEEEEeeecceeEEEEe------------------cceEEEEEEe--cceeeecc-----ccc-----cccccccccc Q lcl|NC_021326. 145 EAFIRMYKLENETKVEYWD------------------KITVNYYVYE--NGSLIPDY-----SNN-----LENSKTHFST 194 (445) Q Consensus 145 ~~~v~~~~~~~~~~~~~~~------------------~~~~~~~~~~--~~~~~~~~-----~~~-----~~~~~~~~~~ 194 (445) ..++.........+..+|+ .+.+++..+. ....+... ... .......... T Consensus 161 ~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~ 240 (518) T protein:vir:78 161 FRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSV 240 (518) T ss_pred EEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccCccceee Confidence 3333222222122211221 1222222211 11100000 000 0000001111 Q ss_pred ccccccceEEe-cC----C-----CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCc-----c---cchh Q lcl|NC_021326. 195 GSWGKIPFIPF-KN----N-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD-----Q---ELPE 256 (445) Q Consensus 195 ~~~g~iPvv~~-~n----~-----~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~-----~---~~~~ 256 (445) ......|+++| +| + +.|.|+|+++++++|++|.++|+++++++....++.+...+-. . .... T Consensus 241 ~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~ 320 (518) T protein:vir:78 241 SIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERMFRKKVNKSTDKEEWS 320 (518) T ss_pred ccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhHhccCCCCCCCccccc Confidence 12234566555 32 2 3499999999999999999999999999987777777433311 0 0011 Q ss_pred HHHhhhhCceeeccC--CCc----eeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHH Q lcl|NC_021326. 257 FKRLLRYYGAIKVSD--NGG----VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNL 330 (445) Q Consensus 257 ~~~~~~~~~~~~~~~--~~~----~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~ 330 (445) +......+..+.... +++ ++.+++++..+++...++.+.+.|...++.+.-.++..++..||+++++..+.+.+ T Consensus 321 fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~~~~~~~ 400 (518) T protein:vir:78 321 MNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSLQDATVR 400 (518) T ss_pred cCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHHHHHHHH Confidence 111122232333222 222 45666778889999999999999988887765445444457899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccC---------CCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHh-C Q lcl|NC_021326. 331 KADKLARKAKVAIQELLWFVFEHFDIK---------GEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLEN-H 398 (445) Q Consensus 331 k~~~~~~~~~~~l~~~~~~~~~~~~~~---------~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~-l 398 (445) ++.+++..++.+|+++++.++.++..- .+...+.|.|++.++.|..+.+++.+++ +|++|+++++++ . T Consensus 401 t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~e~~i~~~~ 480 (518) T protein:vir:78 401 KIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSVEEKVKLIH 480 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhC Confidence 999999999999999999998875532 1234688999999999999999998875 599999999986 4 Q ss_pred CCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCC Q lcl|NC_021326. 399 PFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERS 439 (445) Q Consensus 399 ~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (445) ++++| +++|++||++|+.......++ +.++.++.+. T Consensus 481 ~~~~deea~~e~~ri~~E~~~~~~~~p~-----~~~g~~~~~g 518 (518) T protein:vir:78 481 PKWEDEEIQAEVKRIYLENAIGEVPDPE-----AIGGMETKGG 518 (518) T ss_pred CCCCHHHHHHHHHHHHHHhcccCCCCCc-----cccCCCCCCC Confidence 66654 567899999998654322221 1111111111 No 72 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=3.1e-42 Score=248.17 Aligned_cols=428 Identities=14% Similarity=0.108 Sum_probs=278.8 Q ss_pred ChHHHHHH-----------------HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHH Q lcl|NC_021326. 1 MIVRYIKQ-----------------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQK 63 (445) Q Consensus 1 ~l~~~i~~-----------------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~ 63 (445) +++++..+ -.+...++.++.+||+|+++-+... ... ...+..+++++|+++.|+..+ T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~-~~~-----~~~~~~~~~sl~~~~~i~~~~ 83 (517) T protein:vir:98 10 FFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYI-NSQ-----GKIQERDYMTLNLRKLSADVL 83 (517) T ss_pred HHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccc-ccc-----ccccccceeecCcHHHHHHHh Confidence 33333221 2245678889999999998754321 111 111223467889999999999 Q ss_pred HhhhhccCeeeccCch-----------HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccce Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDD-----------EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQ 131 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~-----------~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 131 (445) ++++++++++++.++. ..++.|+.+++ |+|...+.+.+..++..|.+++.+|+|. |.++|.+++|++ T Consensus 84 A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~~~I~~v~ad~ 162 (517) T protein:vir:98 84 SGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN-GEIEFSWALANA 162 (517) T ss_pred hhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC-CeeEEEEEcCCe Confidence 9999999999987652 35788888886 5799999999999999999999999984 789999999999 Q ss_pred eEEEEcCCCCCceEEEEEE--Eeeeccee-----EEEEecc---------eEEE--EEEecceeeecc---ccccccccc Q lcl|NC_021326. 132 GIPIWTDKEHEELEAFIRM--YKLENETK-----VEYWDKI---------TVNY--YVYENGSLIPDY---SNNLENSKT 190 (445) Q Consensus 132 ~~~v~d~~~~~~~~~~v~~--~~~~~~~~-----~~~~~~~---------~~~~--~~~~~~~~~~~~---~~~~~~~~~ 190 (445) ++|+-.+. .+...+++.+ +...+... .+.+... +++. |.......+... ........+ T Consensus 163 ~~Pl~~~~-~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~ 241 (517) T protein:vir:98 163 FYPLRSNS-NGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQE 241 (517) T ss_pred eEEEEecC-CCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccccccCCCc Confidence 99964332 3444444322 22222211 1222111 1111 111111100000 000111222 Q ss_pred ccccccccccceEEecC---------CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhH- Q lcl|NC_021326. 191 HFSTGSWGKIPFIPFKN---------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEF- 257 (445) Q Consensus 191 ~~~~~~~g~iPvv~~~n---------~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~- 257 (445) ...-.++.+++|++|++ .+.|.|+|+++++++|++|.++|+++++++....++.+...+- .+..+.. T Consensus 242 ~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l~~~~~~~g~~~ 321 (517) T protein:vir:98 242 KTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVMLRTVPDESGMPP 321 (517) T ss_pred ceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhhhccccCCCCccc Confidence 23334556666667654 2469999999999999999999999999999888887744432 1111100 Q ss_pred ----HHhhhhCceeecc-CCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc-cccCcchHHHHHHHHHHHHHH Q lcl|NC_021326. 258 ----KRLLRYYGAIKVS-DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLYTNLNLK 331 (445) Q Consensus 258 ----~~~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~-~~~~~~Sg~Ai~~~~~~l~~k 331 (445) ......+..+..+ ++..++..++++..+++.+.++.+.+.|...++.+.-.++ ...+..+|+++++..+.+.++ T Consensus 322 ~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~~~~~t 401 (517) T protein:vir:98 322 PQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSENDLTYRT 401 (517) T ss_pred CCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHHHHHHH Confidence 0011112222222 2233455556677889999999999988888887644333 223556899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc-------CCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCC Q lcl|NC_021326. 332 ADKLARKAKVAIQELLWFVFEHFDI-------KGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVE 402 (445) Q Consensus 332 ~~~~~~~~~~~l~~~~~~~~~~~~~-------~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~ 402 (445) ++++++.++.+|++++++++.+... .....++.|.|.+.++.|..+.++..+++ +|++|+++++.++.+++ T Consensus 402 ~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~~~~i~~~~g~~ 481 (517) T protein:vir:98 402 RNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPTVEAIQRIFKVP 481 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHhCCCC Confidence 9999999999999999998765422 12345789999999999999999998885 59999999998876665 Q ss_pred C--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 403 D--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 403 d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 444 (445) + +++|+.||++|+.+. .+ ......+.+...++++ T Consensus 482 eeeA~~e~~~i~~E~~~~----~~----~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 482 KKTAEQWLEEIRKDQIEL----DP----VTISQRAQKRMFGDEE 517 (517) T ss_pred hHHHHHHHHHHHHhcccc----CC----CCccccccCCCCCCCC Confidence 4 456788888887532 11 1111122222222222 No 73 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=1.2e-40 Score=239.53 Aligned_cols=434 Identities=10% Similarity=0.067 Sum_probs=277.4 Q ss_pred ChHHHHHH-HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccC-- Q lcl|NC_021326. 1 MIVRYIKQ-HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT-- 77 (445) Q Consensus 1 ~l~~~i~~-~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~-- 77 (445) ........ .+.|+.+|..+.+||.|++--+..... + ...+ -+..|+++++|++.+.| +|.++.|+.+ T Consensus 18 ~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~--G----~dr~---~~~~ps~r~~V~~~~~~-Lg~~~~~~Ve~~ 87 (563) T protein:vir:74 18 GDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLR--G----DDSV---PILMPSGRKIVEAVHRF-LGVGFDYLVEPD 87 (563) T ss_pred cccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcC--C----Ccee---eeccchHHHHHHHHHHh-cCCCcEEecCcc Confidence 33333444 456899999999999998742211111 0 1111 25566888999996654 5999999652 Q ss_pred --ch----HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 78 --DD----EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 78 --d~----~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) ++ .++..|..|.+ +++..++.+..+++.+.|++++++-.|++ +++++..+||.+.||+-+++.. .-.+ T Consensus 88 ~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~fp~~dpd~v-~g~~ 166 (563) T protein:vir:74 88 MGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDPRQIFLIEDGSTV-VGFH 166 (563) T ss_pred ccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCCceeeeccCCCCc-ccce Confidence 22 24455566664 67888899999999999998877776642 5899999999999995443322 1111 Q ss_pred EEE---EEeeecceeEEEEecce-----------EEEEEEecc-eeeeccc-----------ccc------ccccccccc Q lcl|NC_021326. 147 FIR---MYKLENETKVEYWDKIT-----------VNYYVYENG-SLIPDYS-----------NNL------ENSKTHFST 194 (445) Q Consensus 147 ~v~---~~~~~~~~~~~~~~~~~-----------~~~~~~~~~-~~~~~~~-----------~~~------~~~~~~~~~ 194 (445) .++ -|.........+.-..+ ...+.+... |....+. ... ...+...-| T Consensus 167 ~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~~~LP 246 (563) T protein:vir:74 167 MVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSAQHDEEEEELP 246 (563) T ss_pred eeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchhhhhhhhchhhhcc Confidence 112 22222111100000000 000000000 1100000 000 111334458 Q ss_pred ccccccceEEecCC-----CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCccc--chhH-HHhhhhCce Q lcl|NC_021326. 195 GSWGKIPFIPFKNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQE--LPEF-KRLLRYYGA 266 (445) Q Consensus 195 ~~~g~iPvv~~~n~-----~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~--~~~~-~~~~~~~~~ 266 (445) ++++.||+|+|+|- .+|.|.+++++++++++|.++++.+.++.+...|+.++.|..... .++. ..++....+ T Consensus 247 ~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~~~g~~~~w~vgpG~i 326 (563) T protein:vir:74 247 EPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDPNTGELTDWNIGPMQI 326 (563) T ss_pred ccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEeccccccccccccccccccCCcee Confidence 89999999998763 479999999999999999999999999999999999988653211 1111 123566677 Q ss_pred eeccCCCceee---EeccCChHHHHHHHHHHHH-HHHHHhCccccccc--cccCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 267 IKVSDNGGVDT---IQVEVPVENSKKYLDELYQ-KIMLFGQAVDFSSD--KFGSAPSGVALEFLYTNLNLKADKLARKAK 340 (445) Q Consensus 267 ~~~~~~~~~~~---l~~~~~~~~~~~~i~~l~~-~i~~~s~~p~~~~~--~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~ 340 (445) +.++++..+.+ +....+.+.++.|++.+.. .|+..|++|...++ ..+...||.|++..+.+|.+++.+++..+. T Consensus 327 ~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~ 406 (563) T protein:vir:74 327 VEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLAANEEKELEMI 406 (563) T ss_pred EeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHHHhhhhhHHHHH Confidence 78887655433 3333466889999988776 88999999999888 556789999999999999999999998777 Q ss_pred HHHHH----HHHHHHHHh-c-----cCC---------CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC- Q lcl|NC_021326. 341 VAIQE----LLWFVFEHF-D-----IKG---------EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENH- 398 (445) Q Consensus 341 ~~l~~----~~~~~~~~~-~-----~~~---------~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l- 398 (445) ..+++ .+.+++... + ... .-..+.++|.+.+|.|.++.++.++.+ +|++|.+||+.+| T Consensus 407 ~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSretAv~~L~ 486 (563) T protein:vir:74 407 VVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQAHLILRKMAVAKLR 486 (563) T ss_pred HHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHHHHHHHHHHHHcCchhHHHHHHHHH Confidence 76666 344444221 1 100 112478899999999999999988776 6999999998887 Q ss_pred --CCC-CCHHHHHHHHHHHHHHHHHhh---hc---cccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 399 --PFV-EDLQAELERIEQEQMEYNKQL---PN---LDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 399 --~~~-~d~~~E~~ri~~E~~~~~~~~---~~---~~~~~~~~~~~~~~~~d~~~~ 445 (445) +|. +|++.|+++|+.++=..+..+ .+ .....++++.++++.+|..+- T Consensus 487 ~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g~p 542 (563) T protein:vir:74 487 SIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQGNP 542 (563) T ss_pred hCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccccCCc Confidence 654 477888888876653332111 11 112223444444444444322 No 74 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=100.00 E-value=1.8e-31 Score=189.13 Aligned_cols=418 Identities=11% Similarity=0.055 Sum_probs=269.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCc--------cccccccccccccccccccccccccchHHHHHHHHHhhhhccCe Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPD--------IVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPI 72 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~--------i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~ 72 (445) =|..--..|....++++...+.|.|... ++..+.+....+.....++ +-.|+++.+++.++|++|.++| T Consensus 2 ~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA---~~~n~~~~t~~~~~G~vf~k~p 78 (452) T protein:vir:94 2 PIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRA---LFYSITSKTLSALSGMVLDQPP 78 (452) T ss_pred CCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhc---cCCchHHHHHHHHhchhhcCCc Confidence 1111123466777888999999988543 2222222223333223332 3469999999999999999999 Q ss_pred eeccCchHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCC-cEEEEEEccceeEEEEcCCCCCceEEEE-EE Q lcl|NC_021326. 73 AFKHTDDEVIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEG-EFKLFRVPAEQGIPIWTDKEHEELEAFI-RM 150 (445) Q Consensus 73 ~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g-~~~i~~~~p~~~~~v~d~~~~~~~~~~v-~~ 150 (445) +++..+.- .....+-..++++.+...+...++.+|+++++|..+..| +|.+..++|.+++- |+.+..+.+..++ |. T Consensus 79 ~~~~p~~l-~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~-W~~~~~g~l~~v~lre 156 (452) T protein:vir:94 79 VITHPDAM-SKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILN-WEEDEDGRLLMVVLRE 156 (452) T ss_pred eecccHHH-HHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcC-ccccccCCeeEEEEEE Confidence 98765332 122112234789999999999999999999999988765 79999999999875 5544445554432 22 Q ss_pred Eeee--cce-----eEE---EEe--cceEEEEEEecceeeecccccccccccccccccccccceEEecCCC----CcCcc Q lcl|NC_021326. 151 YKLE--NET-----KVE---YWD--KITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND----LEISD 214 (445) Q Consensus 151 ~~~~--~~~-----~~~---~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~----~g~s~ 214 (445) .... ..+ .++ ++. ++.+..++++...... -.......+....|+++.||||++.... .+.+. T Consensus 157 ~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~--~~~~~~~~~~~~~~~l~~IP~v~~~~~~~~~~~~~pP 234 (452) T protein:vir:94 157 FYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKV--WELAKTSTIQNVGVTMDYIPFFCITPSGLSMTPAKPP 234 (452) T ss_pred EEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCce--eeeccceeecCCCcccceeEEEEEcCCCCCCCCCccc Confidence 2111 111 111 111 1222222211110000 0012234445567899999999886543 24667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccC-CCceeeEeccCC-hHHHHHHHH Q lcl|NC_021326. 215 IFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSD-NGGVDTIQVEVP-VENSKKYLD 292 (445) Q Consensus 215 ~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~-~~~~~~~i~ 292 (445) +.++..+.-++.+..|++.+.++..++|++++.|.+.... ..++...++.+++ +++++|++.+.+ .+..+..++ T Consensus 235 Ll~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~~----i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~ 310 (452) T protein:vir:94 235 MIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQST----MHIGSTKAWVIPEVAAKVGFLEFTGQGLQSLEKALS 310 (452) T ss_pred hHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCCc----eEecccccccCCCCCCcceEEccCchhHHHHHHHHH Confidence 8888889889999999999999999999999999765321 2345666788886 888999998864 477889999 Q ss_pred HHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCC Q lcl|NC_021326. 293 ELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKV 372 (445) Q Consensus 293 ~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p 372 (445) .+++.+...+.- +-.....++.|++|.........+........+..++++++++++.++|...+ ..+++.-.-..+ T Consensus 311 ~le~~m~~~Ga~--ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~~~-~~v~~n~dF~~~ 387 (452) T protein:vir:94 311 EKQAQLASLSAR--LIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMGGT-LNIKLNSAFLDS 387 (452) T ss_pred HHHHHHHHHHHH--hhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCc-eEEEeccccccc Confidence 999998877642 22223345778888877777777788888888999999999999999997532 233332111222 Q ss_pred CCHHHHHHHHHHH--hccCChHHHHHhC--CCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 373 ANTELQVQTAQQS--MGIVSHETVLENH--PFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 373 ~d~~~~~~~~~~~--~g~~s~et~l~~l--~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .-..+.++++.++ .|.+|++|++..| .++-|+++|.+++..|...... .+.++.+++.|+ T Consensus 388 ~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~-------------~~~~~~~~~~~~ 451 (452) T protein:vir:94 388 KLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEP-------------SPSNTPPNPSSK 451 (452) T ss_pred cCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCc-------------ccCCCCCCCccC Confidence 2234566666654 6899999998766 4556777888888877543211 111112222222 No 75 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=100.00 E-value=8.6e-31 Score=185.41 Aligned_cols=429 Identities=11% Similarity=0.081 Sum_probs=274.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCC--------ccccccccccccccccccccccccccchHHHHHHHHHhhhhccCe Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRP--------DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPI 72 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~--------~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~ 72 (445) .+..=-..|....++++.+.+.|.|.. -++..+.+....+.....++ +..|+++.+++.+++++|.++| T Consensus 8 ~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA---~~~n~~~~tl~~l~G~vf~k~p 84 (513) T protein:vir:97 8 SPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASA---VLLNMVEQTLDTLSGKPFSEPI 84 (513) T ss_pred CCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcc---cCCChHHHHHHHHhhhhhhcCc Confidence 333333456678899999999998853 23333444444444333333 4579999999999999999999 Q ss_pred eeccCchH-HHH-HHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC------------------cEEEEEEccc Q lcl|NC_021326. 73 AFKHTDDE-VIK-RIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG------------------EFKLFRVPAE 130 (445) Q Consensus 73 ~~~~~d~~-~~~-~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g------------------~~~i~~~~p~ 130 (445) ++..+... ..+ +++++. .++++.++..+.+.++.+|+++++|..+..+ +|.+..++|. T Consensus 85 ~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e 164 (513) T protein:vir:97 85 KLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVMIKPE 164 (513) T ss_pred ccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceEEEecHh Confidence 88654333 222 334444 3689999999999999999999999876432 4889999999 Q ss_pred eeEEEEcC-CCCC--ceEEEEEEEe--eeccee-------EEEEecceEEEEEEecceeeeccccccccccccccccccc Q lcl|NC_021326. 131 QGIPIWTD-KEHE--ELEAFIRMYK--LENETK-------VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWG 198 (445) Q Consensus 131 ~~~~v~d~-~~~~--~~~~~v~~~~--~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 198 (445) +++- |+. .+.+ .+.. +++.. .+.+.. +.+++.+.+..|+...... ....++.......|+++ T Consensus 165 ~Iin-W~~~~v~G~~~L~~-v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~----~~~~e~~~~~~g~~~l~ 238 (513) T protein:vir:97 165 CLLF-ARSEVINGVEVLQH-VRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSN----AQKEEWALADEWATGLN 238 (513) T ss_pred hhcC-cceeccCcceeeee-EEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCC----ccccceEEecCCCCcCC Confidence 9865 432 1222 2333 33322 121110 1123333333322211110 11123444455678999 Q ss_pred ccceEEecCCC----CcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccC-CC Q lcl|NC_021326. 199 KIPFIPFKNND----LEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSD-NG 273 (445) Q Consensus 199 ~iPvv~~~n~~----~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~-~~ 273 (445) .||||++.... .+.+-|.++..+..++-+..|++.++++..++|++++.|.+.+..+. ..+....++.+++ ++ T Consensus 239 ~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~~~--i~iG~~~~~~lpe~~~ 316 (513) T protein:vir:97 239 YVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDSDP--VVVGPNKVLYNPDPAG 316 (513) T ss_pred ceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCCCc--eEeeccccccCCCCCC Confidence 99999987433 24667888899988888999999999999999999999986653222 1244555677785 78 Q ss_pred ceeeEeccCC-hHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 274 GVDTIQVEVP-VENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE 352 (445) Q Consensus 274 ~~~~l~~~~~-~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~ 352 (445) +++|++.+.+ .+..+..++.+++.|...+..+- ...+++.||+|.+.......+........+..++++++++++. T Consensus 317 ~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~ll---~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~ 393 (513) T protein:vir:97 317 RFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEFL---KRKTGGQTATARALDSAEATSDLSAMTGLFEDALAQALDITAD 393 (513) T ss_pred cceeeccCchhHHHHHHHHHHHHHHHHHHHHHhh---ccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8999999865 46778889999999988775431 2234579999999999999999999999999999999999999 Q ss_pred HhccCCCcceEEE--EeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCC---CCHHHHHHHHHHHHHHHHHhh Q lcl|NC_021326. 353 HFDIKGEHKDVDI--SFNYNKVANTELQVQTAQQS--MGIVSHETVLENH---PFV---EDLQAELERIEQEQMEYNKQL 422 (445) Q Consensus 353 ~~~~~~~~~~i~v--~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l---~~~---~d~~~E~~ri~~E~~~~~~~~ 422 (445) ++|...+...+++ .|..... ..+.++++.++ .|.+|.+|.++.| +.+ .|.+++.++++++.++..... T Consensus 394 wlg~~~~~~~v~in~dF~~~~~--~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~ 471 (513) T protein:vir:97 394 WLRLGPNGGTVELVKDYDLEEM--DAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRA 471 (513) T ss_pred HhCCCCCccEEEeccccCcccC--CHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCC Confidence 9997654333333 2432111 23455665554 6899999987655 222 145566777776654433211 Q ss_pred ----hccccC---CCCCCCCCCCCCCcCCC Q lcl|NC_021326. 423 ----PNLDDG---GADGAQQKERSNDKQSE 445 (445) Q Consensus 423 ----~~~~~~---~~~~~~~~~~~~d~~~~ 445 (445) ...... +..+..+++.++.+.+| T Consensus 472 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (513) T protein:vir:97 472 GLDLDPAQKNPPEGGEGEGEGEGEGGEGGE 501 (513) T ss_pred CccccccCCCCCCCCCCCCCCCCCCCCCCC Confidence 111111 11111111111111111 No 76 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.96 E-value=7.5e-28 Score=169.31 Aligned_cols=433 Identities=11% Similarity=0.063 Sum_probs=260.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccc-----cccc--------cccccccccccccccccccchHHHHHHHHHhhh Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIV-----KEPK--------PVDATGAVDPLKPDDRMITNFHANLVDQKVSYI 67 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~-----~~~~--------~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l 67 (445) =|..==..|....++++...+.+.|...+- +.|. +....+.....++ +..|+++.+++.+++++ T Consensus 34 dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA---~~~n~~~~tl~~l~G~v 110 (535) T protein:vir:80 34 NVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRA---IFYNVTARTLDGMMGQV 110 (535) T ss_pred CCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhc---cCCChhHHHHHHHhchh Confidence 122222335667788889999998863211 1111 1112233333332 45799999999999999 Q ss_pred hccCeeeccCchHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC-------------cEEEEEEcccee Q lcl|NC_021326. 68 VGKPIAFKHTDDEVIKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG-------------EFKLFRVPAEQG 132 (445) Q Consensus 68 ~g~~~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g-------------~~~i~~~~p~~~ 132 (445) |.++++++.. +....+++++. .++++.++..+.+.++.+|+++++|.....+ +|.+..++|.++ T Consensus 111 frk~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~I 189 (535) T protein:vir:80 111 FSRDPIRQLP-PALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSI 189 (535) T ss_pred hcCCcceecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhc Confidence 9999988643 33444444443 2589999999999999999999999876554 488999999997 Q ss_pred EEEEcCC-CC--CceEEEE-EEEe-eecce----eEE---EEec-----ceEEEEEEecceeeecccccccccccccccc Q lcl|NC_021326. 133 IPIWTDK-EH--EELEAFI-RMYK-LENET----KVE---YWDK-----ITVNYYVYENGSLIPDYSNNLENSKTHFSTG 195 (445) Q Consensus 133 ~~v~d~~-~~--~~~~~~v-~~~~-~~~~~----~~~---~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (445) +- |+.. +. ..+..++ +-.. ..++. .++ +... +.++.|..+... ...........+....| T Consensus 190 in-W~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~--~~~~~~~~~~~~~~g~~ 266 (535) T protein:vir:80 190 IN-WRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQE--EMYYSYSKHVPTDGNGN 266 (535) T ss_pred cC-ccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCC--ccccccceeecccCCCc Confidence 65 4432 22 2344332 2221 11111 111 1111 111111111110 00111112223345678 Q ss_pred cccccceEEecCCC----CcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH----HHhhhhCcee Q lcl|NC_021326. 196 SWGKIPFIPFKNND----LEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF----KRLLRYYGAI 267 (445) Q Consensus 196 ~~g~iPvv~~~n~~----~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~----~~~~~~~~~~ 267 (445) +++.||||++.... .+.+-|.++..+.-++-+.-|++.+.++..++|++++.|.+.....+. ...+....++ T Consensus 267 ~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~iG~~~~~ 346 (535) T protein:vir:80 267 PFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHLGSRAII 346 (535) T ss_pred ccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEecCcccc Confidence 99999999885332 245667788888778888889999999999999999999865432211 1224455677 Q ss_pred eccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 268 KVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELL 347 (445) Q Consensus 268 ~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~ 347 (445) .++++++++|++.+.+.-+. ..++.+.+.+..+....- ....++.++++.+...+...+........+..++.+++ T Consensus 347 ~lP~~~~~~~~e~~~~~~a~-~~l~~~e~qM~~lGa~ll---~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL 422 (535) T protein:vir:80 347 PLPQGATAGILQITPNSVPF-EAMTHKESQMIAMGANLL---VKSGGNRTFGEAQQEEASEQSILSACTKNVSMAFRKAL 422 (535) T ss_pred cCCCCCCcceeeeccchhHH-HHHHHHHHHHHHHHHHhh---ccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH Confidence 89999999999887654443 457777777777654331 22234555555566667777788888999999999999 Q ss_pred HHHHHHhccCCCcceEEEEeCC-CCCCC-HHHHHHHHHHH--hccCChHHHHHhC---CCC---CCHHHHHHHHHHHHHH Q lcl|NC_021326. 348 WFVFEHFDIKGEHKDVDISFNY-NKVAN-TELQVQTAQQS--MGIVSHETVLENH---PFV---EDLQAELERIEQEQME 417 (445) Q Consensus 348 ~~~~~~~~~~~~~~~i~v~f~~-~~p~d-~~~~~~~~~~~--~g~~s~et~l~~l---~~~---~d~~~E~~ri~~E~~~ 417 (445) ++++.++|...+..+++|..++ ....+ ..+.++.+.++ .|.+|.+|++..| +.+ .+.++|+.||+.|..+ T Consensus 423 ~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~ 502 (535) T protein:vir:80 423 RWANQFQTGIVNDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIA 502 (535) T ss_pred HHHHHHcCCccCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhh Confidence 9999999976554555554332 12222 23455556555 6899999997655 333 2356778888888655 Q ss_pred HHHhhhccccCCCCCCCC------CCCCCCcCCC Q lcl|NC_021326. 418 YNKQLPNLDDGGADGAQQ------KERSNDKQSE 445 (445) Q Consensus 418 ~~~~~~~~~~~~~~~~~~------~~~~~d~~~~ 445 (445) .........+ .+.++.+ .+...++.+. T Consensus 503 ~~~~~g~~~d-~~~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 503 KTAAAGKVGD-AASGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred ccccCCCCCC-CCCCCCCcCcccCCccccccCCC Confidence 4333222121 1111121 1111222222 No 77 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.95 E-value=7.1e-27 Score=163.93 Aligned_cols=430 Identities=10% Similarity=0.049 Sum_probs=258.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcc-----ccccc--------cccccccccccccccccccchHHHHHHHHHhhh Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDI-----VKEPK--------PVDATGAVDPLKPDDRMITNFHANLVDQKVSYI 67 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i-----~~~~~--------~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l 67 (445) =|..==..|....++++...+.+.|...+ .+.|. +....+.....++ +-.|+++.+++.+++++ T Consensus 3 ~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA---~~~n~~~~t~~~l~G~v 79 (501) T protein:vir:95 3 NVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRA---VFYNVARRTLFGLVGQV 79 (501) T ss_pred CCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhcc---ccCchHHHHHHHHhhhh Confidence 01111124567778899999999986432 11111 1112222222322 45799999999999999 Q ss_pred hccCeeeccCchHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC---------------cEEEEEEccc Q lcl|NC_021326. 68 VGKPIAFKHTDDEVIKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG---------------EFKLFRVPAE 130 (445) Q Consensus 68 ~g~~~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g---------------~~~i~~~~p~ 130 (445) |.++|+++.+ +....+++++. .++++.++..+.+.++.+|+++++|..+..+ +|.+..++|. T Consensus 80 f~k~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~~~~~ 158 (501) T protein:vir:95 80 FMRDPVVKVP-ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYVYSPT 158 (501) T ss_pred hcCCcceeCc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEEecHh Confidence 9999998643 33444455543 3589999999999999999999999875432 4889999999 Q ss_pred eeEEEEcCC-CCC--ceEEEE-EEEeeecce-----eEEE---Eec---c--eEEEEEEeccee-----eecc--ccccc Q lcl|NC_021326. 131 QGIPIWTDK-EHE--ELEAFI-RMYKLENET-----KVEY---WDK---I--TVNYYVYENGSL-----IPDY--SNNLE 186 (445) Q Consensus 131 ~~~~v~d~~-~~~--~~~~~v-~~~~~~~~~-----~~~~---~~~---~--~~~~~~~~~~~~-----~~~~--~~~~~ 186 (445) +++- |+.. +.+ .+..++ +-...+.+. .++. ... . .++.|+...... +... ..... T Consensus 159 ~Iin-W~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~~~~ 237 (501) T protein:vir:95 159 EIIN-WRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQYVV 237 (501) T ss_pred hhcC-cceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccccce Confidence 9765 4422 222 343332 211111111 1111 111 1 112222111110 0000 00111 Q ss_pred ccccccccccccccceEEecCCCC----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH---HH Q lcl|NC_021326. 187 NSKTHFSTGSWGKIPFIPFKNNDL----EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF---KR 259 (445) Q Consensus 187 ~~~~~~~~~~~g~iPvv~~~n~~~----g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~---~~ 259 (445) +.......|+++.||||++..... +.+.+.++..+.-++-+.-|++.+.++..++|+++++|.+.+..... .. T Consensus 238 ~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~i 317 (501) T protein:vir:95 238 YKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWVTNVLKGSV 317 (501) T ss_pred eeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccccccCCCCce Confidence 222234568999999998854332 35567677777666666778999999999999999999876432211 12 Q ss_pred hhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 260 LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 339 (445) Q Consensus 260 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~ 339 (445) .+....++.+|++++++|++.+.+.- .+..++.+++.+....... . ...+++.||+|.+.......+........+ T Consensus 318 ~~G~~~~~~lP~~~~~~~ie~~~~~i-~~~~l~~l~~~m~~~Ga~l--l-~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~l 393 (501) T protein:vir:95 318 NFGSRGGIPLPVGADAKLLQASENTM-LKEAMDTKERQMVALGAKL--V-EQKEVQRTATEAELEAASEGSTLSSATKNV 393 (501) T ss_pred eecccccccCCCCCceeEEecChhhH-HHHHHHHHHHHHHHHHHhh--c-cCCccchhHHHHHHHHHHHhHHHHHHHHHH Confidence 23445577889999999999765443 3667888888887775332 1 233457899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCcceEEEEeCCCC-CCC-HHHHHHHHHHH--hccCChHHHHHhC---CCCC-CHHHHHHHH Q lcl|NC_021326. 340 KVAIQELLWFVFEHFDIKGEHKDVDISFNYNK-VAN-TELQVQTAQQS--MGIVSHETVLENH---PFVE-DLQAELERI 411 (445) Q Consensus 340 ~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~-p~d-~~~~~~~~~~~--~g~~s~et~l~~l---~~~~-d~~~E~~ri 411 (445) ..++.+++++++.++|...+...+++ ++.. +.. ..+.++++.++ .|.+|.+|+++.| +.++ +.+.|.++| T Consensus 394 e~al~~~l~~~a~w~g~~~~~~~v~i--~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i 471 (501) T protein:vir:95 394 SAAFEWALKWAARWVGQADSGVKFEL--NTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKI 471 (501) T ss_pred HHHHHHHHHHHHHHcCCCCCceEEEE--ecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHH Confidence 99999999999999998654444443 3322 222 34445666654 6899999996654 4443 455677777 Q ss_pred HHHHHHHHHhhhccccCCCCCCCCCCCCCCcC Q lcl|NC_021326. 412 EQEQMEYNKQLPNLDDGGADGAQQKERSNDKQ 443 (445) Q Consensus 412 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 443 (445) ..|..+.... +..+.+...+.......+.| T Consensus 472 ~~~~~~~~~~--~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 472 AKDTAEAMAL--ATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred HhhhcCcccc--cccCCCCCCCcccccccCCC Confidence 7665432111 11111111111111112222 No 78 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.95 E-value=2.2e-26 Score=161.27 Aligned_cols=419 Identities=12% Similarity=0.064 Sum_probs=257.2 Q ss_pred ChHH---------HHHHHHHHHHHHHHHHHHhcCCCc-------ccccccccc-ccccccccccccccccchHHHHHHHH Q lcl|NC_021326. 1 MIVR---------YIKQHLEKLPEISIGQEYYEQRPD-------IVKEPKPVD-ATGAVDPLKPDDRMITNFHANLVDQK 63 (445) Q Consensus 1 ~l~~---------~i~~~~~~~~~~~~~~~yy~G~~~-------i~~~~~~~~-~~~~~~~~~~~~ri~~n~~~~iv~~~ 63 (445) ||.. ==..|....++++...+.|.|... .+..+.... ..+.....++ +..|+++.+++.+ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA---~~~n~~~~tl~~l 77 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGG---IVYNFTRRTLSGM 77 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhcc---ccCChHHHHHHHH Confidence 3321 123456777889999999999531 111111111 1122222222 3579999999999 Q ss_pred HhhhhccCeeeccCchHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC------------cEEEEEEcc Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDDEVIKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG------------EFKLFRVPA 129 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g------------~~~i~~~~p 129 (445) +|++|.++|+++.++ ....+++++. .++++.++..+.+.++.+|+++++|..+..+ +|.+..++| T Consensus 78 ~G~vfrk~p~~~~p~-~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~ 156 (489) T protein:vir:78 78 VGSVMRKEPEINIPK-ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTT 156 (489) T ss_pred hchhhcCCcceeccH-HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEech Confidence 999999999986543 2444555444 2689999999999999999999999987655 689999999 Q ss_pred ceeEEEEcCCCC--CceEEEEEEEe--e-eccee--------EEEEecc-----eEEEEEEecceeeecccccccc---c Q lcl|NC_021326. 130 EQGIPIWTDKEH--EELEAFIRMYK--L-ENETK--------VEYWDKI-----TVNYYVYENGSLIPDYSNNLEN---S 188 (445) Q Consensus 130 ~~~~~v~d~~~~--~~~~~~v~~~~--~-~~~~~--------~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~---~ 188 (445) .+++-.-...+. ..+..++..-. . +.... +.+++.. ++..|+.... ...... . T Consensus 157 ~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~------g~~~~~~~~~ 230 (489) T protein:vir:78 157 ENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAE------GGAQEDVVEI 230 (489) T ss_pred hhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecC------CcccceeeEE Confidence 998653222222 23444322211 1 11111 1111111 1111111111 111111 1 Q ss_pred ccccccccccccceEEecCCCC----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHH------ Q lcl|NC_021326. 189 KTHFSTGSWGKIPFIPFKNNDL----EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFK------ 258 (445) Q Consensus 189 ~~~~~~~~~g~iPvv~~~n~~~----g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~------ 258 (445) .+....|+++.||||++..... +.+-|.++..+.-++-+.-|++.+.++..++|++++.|.+....+... T Consensus 231 ~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~ 310 (489) T protein:vir:78 231 YPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNG 310 (489) T ss_pred eccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccc Confidence 1234568899999998864322 356677888887777788899999999999999999997643322111 Q ss_pred HhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHh-CccccccccccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 259 RLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFG-QAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLAR 337 (445) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s-~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~ 337 (445) ..+.....+.++.+++++|++.+.+.. .+..++.+++.+..+. .+. .. +++.||++.+.......+....... T Consensus 311 i~~g~~~~~~lp~~~~~~~ie~~~~~~-~r~~l~~le~qm~~lGa~l~----~~-~~~~Ta~~~~~~~~~~~S~L~~~a~ 384 (489) T protein:vir:78 311 IKFGSRRGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQLI----TP-TQQITAQSARIQRGADTSVMATIAR 384 (489) T ss_pred eeeCCcccccCCCCCCcceeccCcchH-HHHHHHHHHHHHHHHhhhhc----cC-CcchhHHHHHHHHHHhhHHHHHHHH Confidence 112344467788899999998875443 3666777777777654 332 22 3468999998888888999999999 Q ss_pred HHHHHHHHHHHHHHHHhccCCCc-ceEE--EEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC--CCCCCHHHHHHH Q lcl|NC_021326. 338 KAKVAIQELLWFVFEHFDIKGEH-KDVD--ISFNYNKVANTELQVQTAQQS--MGIVSHETVLENH--PFVEDLQAELER 410 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~~~~~~~~-~~i~--v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l--~~~~d~~~E~~r 410 (445) .+..++.+++++++.++|.+.+. ..++ ..|... +. ..+.++++.++ .|.+|.+|.+..| ..+-|+ ..++ T Consensus 385 ~~e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~~-~~-d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~--~~e~ 460 (489) T protein:vir:78 385 NVSQAYTDALRWVAVMLGKPEDTEVEFRLNMDFFLE-PM-TAQDRAAWMADINAGLLPATAYYAALRKAGVTDW--TDAD 460 (489) T ss_pred HHHHHHHHHHHHHHHHcCCCCCCceEEEeecccCcc-cC-CHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCc--cHHH Confidence 99999999999999999986432 2332 234321 12 23456666654 6899999988755 233332 2233 Q ss_pred HHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 411 IEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 411 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 444 (445) ++.|.++. ....+..+++..+++.++++. T Consensus 461 ~~~ei~~~-----~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 461 IKDAVADQ-----PLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHHHhhc-----CCCcccCCcccCCCCcccccC Confidence 33333221 112223344455555555555 No 79 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.93 E-value=2.9e-25 Score=155.12 Aligned_cols=421 Identities=12% Similarity=0.072 Sum_probs=252.5 Q ss_pred ChH---------HHHHHHHHHHHHHHHHHHHhcCCCc-------cccccccccc-cccccccccccccccchHHHHHHHH Q lcl|NC_021326. 1 MIV---------RYIKQHLEKLPEISIGQEYYEQRPD-------IVKEPKPVDA-TGAVDPLKPDDRMITNFHANLVDQK 63 (445) Q Consensus 1 ~l~---------~~i~~~~~~~~~~~~~~~yy~G~~~-------i~~~~~~~~~-~~~~~~~~~~~ri~~n~~~~iv~~~ 63 (445) ||. .==..|....++++...+.|.|..- ++..+..... .+.....++ +..|+++.+++.+ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA---~~~n~~~~tl~~l 77 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGG---IVYNFTRRTLSGM 77 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcc---cCCChHHHHHHHH Confidence 222 1223456777889999999998431 1111111111 122222222 4569999999999 Q ss_pred HhhhhccCeeeccCchHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC------------cEEEEEEcc Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDDEVIKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG------------EFKLFRVPA 129 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g------------~~~i~~~~p 129 (445) +|++|.++|+++.++ ....+++++. .++++.++..+.+.++.+|+++++|..+..+ +|.+..++| T Consensus 78 ~G~vfrk~p~~~~p~-~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~ 156 (491) T protein:vir:95 78 VGSVMRKEPEINIPK-ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTT 156 (491) T ss_pred hchhhcCCceeeccH-HHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEech Confidence 999999999986543 3444555444 2689999999999999999999999987654 589999999 Q ss_pred ceeEEEEcCCC--CCceEEEEEEEee---ec-cee-----EE---EEe-----cceEEEEEEecceeeeccccccccccc Q lcl|NC_021326. 130 EQGIPIWTDKE--HEELEAFIRMYKL---EN-ETK-----VE---YWD-----KITVNYYVYENGSLIPDYSNNLENSKT 190 (445) Q Consensus 130 ~~~~~v~d~~~--~~~~~~~v~~~~~---~~-~~~-----~~---~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (445) .+++-.-...+ ...+..+ ++... .+ ... ++ ++. .+.+..|+..... .......+..+ T Consensus 157 ~~IinW~~~~v~g~~~L~~v-~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g---~~~~~~~~~~~ 232 (491) T protein:vir:95 157 ENIVNWRLTRVGSVNRVTMV-VLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEG---GAQEEVVEIYP 232 (491) T ss_pred hhhcCceeeeeCCceeeeEE-EEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCC---cceeeeeeeee Confidence 99865321111 2234443 22221 11 110 11 111 0111111111000 00011122223 Q ss_pred ccccccccccceEEecCCCC----cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHH------h Q lcl|NC_021326. 191 HFSTGSWGKIPFIPFKNNDL----EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR------L 260 (445) Q Consensus 191 ~~~~~~~g~iPvv~~~n~~~----g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~------~ 260 (445) ....|+++.||||++..... +.+-|.++..+.-++-+.-|++.+.++..++|++++.|.+....+.... . T Consensus 233 ~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~ 312 (491) T protein:vir:95 233 DLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIK 312 (491) T ss_pred cCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeE Confidence 44578999999998864322 3556778888877777788999999999999999999976533222111 1 Q ss_pred hhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHh-CccccccccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 261 LRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFG-QAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 339 (445) Q Consensus 261 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s-~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~ 339 (445) +.....+.+|.+++++|++.+.+.- .+..++.++..+.... .+. .. +++.||++.........+........+ T Consensus 313 ~g~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~~Ga~l~----~~-~~~~Ta~~~~~~~~~~~S~L~~~a~~~ 386 (491) T protein:vir:95 313 FGSRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQLI----TP-SQQITAESARIQRGADTSVMATIARNV 386 (491) T ss_pred ecCcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHHHHHHhc----cC-CcchhHHHHHHHHHHhhHHHHHHHHHH Confidence 2333456678899999999875543 4666777777666653 332 12 346899999999899999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCc-ceEE--EEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCC--CCC--CHHHHHHH Q lcl|NC_021326. 340 KVAIQELLWFVFEHFDIKGEH-KDVD--ISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHP--FVE--DLQAELER 410 (445) Q Consensus 340 ~~~l~~~~~~~~~~~~~~~~~-~~i~--v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~--~~~--d~~~E~~r 410 (445) ..++.+++++++.++|.+.+. ..++ ..|... +. ..+.++++.++ .|.+|.+|.+..|- .+. +.+++.++ T Consensus 387 e~al~~~l~~~a~w~G~~~~~~v~i~~n~dF~~~-~~-~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~~~~e~~~~~ 464 (491) T protein:vir:95 387 SQAYTDALRWVAMMLGKPEDSEVEFQLNMDFFLQ-PM-TAQDRAAWMADINAGLLPATAYYAALRKAGVTDWTDEDILNA 464 (491) T ss_pred HHHHHHHHHHHHHHcCCCCCCceEEEeecccccc-cC-CHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHHHHHH Confidence 999999999999999976442 2332 334321 12 23456666654 68999999887552 332 33455555 Q ss_pred HHHHHHHHHHhhhccccCCCCCCCCCCC Q lcl|NC_021326. 411 IEQEQMEYNKQLPNLDDGGADGAQQKER 438 (445) Q Consensus 411 i~~E~~~~~~~~~~~~~~~~~~~~~~~~ 438 (445) |++|.-.. .......+..+..++++++ T Consensus 465 ie~~~~~~-~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 465 IEDAPLPS-GAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHhcCCCC-CccccccccchhhhhhccC Confidence 54332100 0001111222222222222 No 80 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.91 E-value=1e-23 Score=146.64 Aligned_cols=411 Identities=12% Similarity=0.065 Sum_probs=232.2 Q ss_pred ChHHHH--------------HHHHHHHHHHHHHHHHhcC------CCcccccc-----ccccccccccccccc-----c- Q lcl|NC_021326. 1 MIVRYI--------------KQHLEKLPEISIGQEYYEQ------RPDIVKEP-----KPVDATGAVDPLKPD-----D- 49 (445) Q Consensus 1 ~l~~~i--------------~~~~~~~~~~~~~~~yy~G------~~~i~~~~-----~~~~~~~~~~~~~~~-----~- 49 (445) ||+-|+ ..|....+++++..+-+.| ..-+|..+ .+....+.....+.+ . T Consensus 1 ~~~~~~~~~~~~~m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~ 80 (488) T protein:vir:96 1 MLKCLYIKHRGFFMLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLT 80 (488) T ss_pred CceeEEEeecceeecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhh Confidence 222221 1222333333333222221 11111100 000001111111000 0 Q ss_pred --c-cccchHHHHHHHHHhhhhccCeeeccCc-hHHHHHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECCCC--- Q lcl|NC_021326. 50 --R-MITNFHANLVDQKVSYIVGKPIAFKHTD-DEVIKRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDEEG--- 120 (445) Q Consensus 50 --r-i~~n~~~~iv~~~~~~l~g~~~~~~~~d-~~~~~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g--- 120 (445) | +-.|+++.+++.++|++|.++|+++.++ +....+++++. .++++.++..+.+.++.+|+++++|..++.+ T Consensus 81 ~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ 160 (488) T protein:vir:96 81 WRLANYVNIVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATM 160 (488) T ss_pred hhccccCchhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCH Confidence 1 3459999999999999999999998654 45656666554 3689999999999999999999999987654 Q ss_pred --------cEEEEEEccceeEEEEcCCCCC--ceEEEEEEEe-eecceeEEEEecceEEEEEEecc-eeee---cccccc Q lcl|NC_021326. 121 --------EFKLFRVPAEQGIPIWTDKEHE--ELEAFIRMYK-LENETKVEYWDKITVNYYVYENG-SLIP---DYSNNL 185 (445) Q Consensus 121 --------~~~i~~~~p~~~~~v~d~~~~~--~~~~~v~~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~ 185 (445) +|.+..++|.+++-.-...+.+ .+..+ ++.. ....+....+....+..+....+ +.+. ...... T Consensus 161 ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v-~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~ 239 (488) T protein:vir:96 161 ADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYL-SLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSD 239 (488) T ss_pred HHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEE-EEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCccc Confidence 5899999999987532222222 34433 2222 11111111111222222222221 1111 111111 Q ss_pred cccccccccccccccceEEecCCC----CcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccc--hhHH- Q lcl|NC_021326. 186 ENSKTHFSTGSWGKIPFIPFKNND----LEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL--PEFK- 258 (445) Q Consensus 186 ~~~~~~~~~~~~g~iPvv~~~n~~----~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~--~~~~- 258 (445) ++.......|+++.||||++.... .+.+-+.++..+.-++-+.-|++.+.++..++|++++.+.+.... .... T Consensus 240 e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~~~~~~~ 319 (488) T protein:vir:96 240 EWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTMASEMNP 319 (488) T ss_pred ceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCccccccccc Confidence 222223457799999999986433 245667788888778888889999999999999998643322211 1110 Q ss_pred Hhhhh-CceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhC-ccccccccccCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 259 RLLRY-YGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQ-AVDFSSDKFGSAPSGVALEFLYTNLNLKADKLA 336 (445) Q Consensus 259 ~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~-~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~ 336 (445) ..+.. .......+.|+++|++.+.+.- .+..++.+++++...+. ++ .. +++.||++.........+...... T Consensus 320 ~g~~~~~~~~~~~~~g~~~~~e~~~~~l-~~~~l~~l~~qm~~~Ga~l~----~~-~~~~Ta~~~~~~~~~~~S~L~~~a 393 (488) T protein:vir:96 320 LGFTLAGRMPYYVKNGDVKVIQAQFSPE-TENKVEKLFEQAVKVGASLF----TQ-QSNETATGAAIRSGSSTASMATLG 393 (488) T ss_pred ceeeecccccccccCCceeecCCchhHH-HHHHHHHHHHHHHHHhHhhc----cC-CCcchHHHHHHHHHHhhHHHHHHH Confidence 00111 1111223467788877654432 36678888888776553 22 12 346789999888888899999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCc---ceEEEEeCCC-CCCC-HHHHHHHHHHH--hccCChHHHHHhC--CCCC----C Q lcl|NC_021326. 337 RKAKVAIQELLWFVFEHFDIKGEH---KDVDISFNYN-KVAN-TELQVQTAQQS--MGIVSHETVLENH--PFVE----D 403 (445) Q Consensus 337 ~~~~~~l~~~~~~~~~~~~~~~~~---~~i~v~f~~~-~p~d-~~~~~~~~~~~--~g~~s~et~l~~l--~~~~----d 403 (445) ..+..+++++++++++++|...+. ..++|..++. .... ..+.++++.++ .|.+|++|.+..| .++- + T Consensus 394 ~~le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~ 473 (488) T protein:vir:96 394 NNVEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNLPQVSWFELLKRARVVRGDMS 473 (488) T ss_pred HHHHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCcCCccCC Confidence 999999999999999999987643 2344443322 2211 24456666665 6899999987655 2332 2 Q ss_pred HHHHHHHHHHHHHHHHHhh Q lcl|NC_021326. 404 LQAELERIEQEQMEYNKQL 422 (445) Q Consensus 404 ~~~E~~ri~~E~~~~~~~~ 422 (445) .++|.+||+++ .--+ T Consensus 474 ~e~~~~~ie~~----g~~~ 488 (488) T protein:vir:96 474 KEEFDEHIAEL----GFGM 488 (488) T ss_pred HHHHHHHHhhc----CCCC Confidence 34444444421 0000 No 81 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.85 E-value=1e-20 Score=130.23 Aligned_cols=433 Identities=15% Similarity=0.129 Sum_probs=210.6 Q ss_pred ChHHHHHHHHHH-------HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCe- Q lcl|NC_021326. 1 MIVRYIKQHLEK-------LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPI- 72 (445) Q Consensus 1 ~l~~~i~~~~~~-------~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~- 72 (445) ++.+++..++.. +....+..+||.|+|= . .. .........++ .+++|.++.+|+..+++...+.+ T Consensus 46 ~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw--~--~~-~~~~l~~~g~p--~~~~N~i~~~i~~v~g~~~~nr~~ 118 (776) T protein:vir:93 46 LHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQW--S--QD-EIDELKERGQA--PTVYNVISQSVNWIIGSEKRGRSD 118 (776) T ss_pred HHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCC--C--HH-HHHHHHhcCCc--eEEecchHHHHHHHHHHHHhCCcc Confidence 555555544332 2344556789999861 1 01 01111222233 47899999999999999877654 Q ss_pred -eecc---CchHHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC--C-cEEEEEEccceeEEEEcCCC Q lcl|NC_021326. 73 -AFKH---TDDEVIKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE--G-EFKLFRVPAEQGIPIWTDKE 140 (445) Q Consensus 73 -~~~~---~d~~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--g-~~~i~~~~p~~~~~v~d~~~ 140 (445) .+.. ++.+..+. ++.+++ |+.......+..+++++|.||+-|+.+.+ + .+++.+++|.+++ ||+.. T Consensus 119 ~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~~~~~p~~i~--~Dp~a 196 (776) T protein:vir:93 119 FKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYAGAESWRNIL--WDSTY 196 (776) T ss_pred eEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEeeccChhhee--ecccc Confidence 4433 23333343 344443 67888899999999999999999988653 3 4556788888875 44321 Q ss_pred CC----ceEEE-EEEEeeec----------------------------c---------------------------e--- Q lcl|NC_021326. 141 HE----ELEAF-IRMYKLEN----------------------------E---------------------------T--- 157 (445) Q Consensus 141 ~~----~~~~~-v~~~~~~~----------------------------~---------------------------~--- 157 (445) .. ...++ .+.|...+ . . T Consensus 197 ~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 276 (776) T protein:vir:93 197 RRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAYARKRVR 276 (776) T ss_pred ccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhcccccccccccccccccccccccccccCCCeEE Confidence 10 11111 11110000 0 0 Q ss_pred eEEEEecceEEEEEEecc----eeeeccc-----------c----------------ccccccc--ccccccccccceEE Q lcl|NC_021326. 158 KVEYWDKITVNYYVYENG----SLIPDYS-----------N----------------NLENSKT--HFSTGSWGKIPFIP 204 (445) Q Consensus 158 ~~~~~~~~~~~~~~~~~~----~~~~~~~-----------~----------------~~~~~~~--~~~~~~~g~iPvv~ 204 (445) .+++|+...+...+.... ....... + ..+.... ...+.+.+++|+|+ T Consensus 277 v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~~~Pfv~ 356 (776) T protein:vir:93 277 MIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMWAGPSPYRHNRYPFTP 356 (776) T ss_pred EEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhhccCCCCCCCccceEE Confidence 012222111110000000 0000000 0 0000011 11234457889987 Q ss_pred ecCC-----CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHH-hhhhCceeeccCCC--cee Q lcl|NC_021326. 205 FKNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR-LLRYYGAIKVSDNG--GVD 276 (445) Q Consensus 205 ~~n~-----~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~-~~~~~~~~~~~~~~--~~~ 276 (445) ++.. -.|.|.+..+++.++.+|..+|.+.+.+. +.++.+-.|..... ++... ..+.+.++.+..+. ..+ T Consensus 357 ~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~--~~~~~~~~gav~~~-d~~~~~~~rp~~vi~~~~~~~~~~~ 433 (776) T protein:vir:93 357 IWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS--TNKVLMEEGAVDDI-DEFRREAARPDAVMTVKNGKLGAVK 433 (776) T ss_pred ecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc--CCceeeccccccch-HHHHHhcccCCceeeeCCccccccc Confidence 7642 24789999999999999999999988763 45555555544332 23222 23455666665554 334 Q ss_pred eEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_021326. 277 TIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI 356 (445) Q Consensus 277 ~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~ 356 (445) +.....-..++...+..+...|...|++.+.+.+..+++.||+|+..+............+.|..+++++.++++.+... T Consensus 434 ~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~ 513 (776) T protein:vir:93 434 MDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNLRLAFQQHGEKELSLIEQ 513 (776) T ss_pred cccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43322234567777888899999999998888887778899999988777766666666666666666666655544321 Q ss_pred C----------C---C--cc----------------eEEEEeCCCCCCCHHHHHHHHHHHhccCChHH-------HHHhC Q lcl|NC_021326. 357 K----------G---E--HK----------------DVDISFNYNKVANTELQVQTAQQSMGIVSHET-------VLENH 398 (445) Q Consensus 357 ~----------~---~--~~----------------~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et-------~l~~l 398 (445) . + . +. +|.|.=.+..+.-..+..+.++.+.+.+..+. +++.. T Consensus 514 ~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~~e~~ 593 (776) T protein:vir:93 514 YMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIALTMLDLLVENM 593 (776) T ss_pred hcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhHHHHHHHHHHhc Confidence 1 1 0 10 11111122222212223333333333222221 12221 Q ss_pred --CCCCCHHHHHHHHH---------------------HHHHHHHHhhhccc--cCCC-------CCCC---CCCCCCCcC Q lcl|NC_021326. 399 --PFVEDLQAELERIE---------------------QEQMEYNKQLPNLD--DGGA-------DGAQ---QKERSNDKQ 443 (445) Q Consensus 399 --~~~~d~~~E~~ri~---------------------~E~~~~~~~~~~~~--~~~~-------~~~~---~~~~~~d~~ 443 (445) +..++..+.+++.. .+..+..+...... ...+ .... +-.....+. T Consensus 594 d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~~aqa~~~~~~a 673 (776) T protein:vir:93 594 DIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVAEAKAKHISRMA 673 (776) T ss_pred CccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHHhhhhhhhhhcc Confidence 11111111111110 00000000000000 0000 0000 000000000 Q ss_pred CC Q lcl|NC_021326. 444 SE 445 (445) Q Consensus 444 ~~ 445 (445) .. T Consensus 674 ~~ 675 (776) T protein:vir:93 674 IR 675 (776) T ss_pred hh Confidence 00 No 82 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.80 E-value=4.8e-19 Score=121.00 Aligned_cols=435 Identities=13% Similarity=0.061 Sum_probs=222.0 Q ss_pred ChHHHHHHHH-------HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHL-------EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~-------~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) ++.++...+. +.+....+-.+||.|+|= . ... ........++ .+.+|.++.+|+..+|+-..+.+. T Consensus 31 ~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw-~---~~~-~~~l~~~g~p--~~~~N~i~~~v~~v~g~~~~nr~~ 103 (711) T protein:vir:10 31 LLATARERARDGATYWKDNWEAAEDDLKFLGGEQW-P---SQV-RTERELEQRP--CLVNNVLPTFVDQVLGDQRQNRPA 103 (711) T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCC-C---HHH-HHHHHhcCCC--cEEEcchHHHHHHHhhhHhhCCcc Confidence 6666666543 233444566899999861 1 010 0111222222 478999999999999998776655 Q ss_pred e--cc-------------------------CchHHHHHHHH----Hhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC--- Q lcl|NC_021326. 74 F--KH-------------------------TDDEVIKRIDE----VLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE--- 118 (445) Q Consensus 74 ~--~~-------------------------~d~~~~~~l~~----~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~--- 118 (445) + .+ ++.+..+.+.. +.+ ++.......+..+++++|.||+-++.|. T Consensus 104 ~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~ 183 (711) T protein:vir:10 104 IKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLAD 183 (711) T ss_pred eEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcceEEEEecccCC Confidence 4 22 22344444433 333 5677888899999999999999887542 Q ss_pred ---CCcEEEEEE-ccceeEEEEcCCCC----CceE-EEEEEEeeecc------------------------------eeE Q lcl|NC_021326. 119 ---EGEFKLFRV-PAEQGIPIWTDKEH----EELE-AFIRMYKLENE------------------------------TKV 159 (445) Q Consensus 119 ---~g~~~i~~~-~p~~~~~v~d~~~~----~~~~-~~v~~~~~~~~------------------------------~~~ 159 (445) +|+++|+.+ +|.+++ ||+... .... ++.+.|...+. ... T Consensus 184 d~~~~e~~i~~v~~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~ 261 (711) T protein:vir:10 184 DSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVS 261 (711) T ss_pred CCCCCCeEEeeecChhhee--eCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhcccccccCcccCcceeeEE Confidence 478888777 798864 554211 1111 22222221000 012 Q ss_pred EEEecceEEEEEE--ecceeeeccc------------------------------ccccccccccccccccccceEEecC Q lcl|NC_021326. 160 EYWDKITVNYYVY--ENGSLIPDYS------------------------------NNLENSKTHFSTGSWGKIPFIPFKN 207 (445) Q Consensus 160 ~~~~~~~~~~~~~--~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~g~iPvv~~~n 207 (445) ++|....+..... ..+....... ..+........+.+.+++|+|+|.- T Consensus 262 E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g 341 (711) T protein:vir:10 262 EYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWG 341 (711) T ss_pred EEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCCCCCCcccEEEEee Confidence 3333222111110 0000000000 0000011112334557788887642 Q ss_pred C-------CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE-ecCCcccchhHHHh-hhhCceeeccCCC----c Q lcl|NC_021326. 208 N-------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-TNYDDQELPEFKRL-LRYYGAIKVSDNG----G 274 (445) Q Consensus 208 ~-------~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~-~~~~~~~~~~~~~----~ 274 (445) . ..+.|.+..+++.++.+|...|.+...+...+.+.+++ .|......+..... .+.++++.+..+. . T Consensus 342 ~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~~~~~~~e~~~~~~~vi~~~~~~~~~~~ 421 (711) T protein:vir:10 342 KSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEGREDEWEQANTKNFSLLTYIPQYQGDPG 421 (711) T ss_pred eeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCChHHHHHhccccCCCeeEecccccCcCC Confidence 1 22467889999999999999999999999988876665 44433222223333 3455666665443 3 Q ss_pred eeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_021326. 275 VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH- 353 (445) Q Consensus 275 ~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~- 353 (445) +.++..+.-..++...++.....|...|++.+.+.+..+++.||+||......-..........+..+.+++.++++.+ T Consensus 422 ~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li 501 (711) T protein:vir:10 422 PRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMI 501 (711) T ss_pred ccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4555444455777888898999999999998877777788899999988877766666666666666666665555543 Q ss_pred ---hccC------C---Ccc---------------------------eEEEEeCCCCCCCHHHHHHHHHHHhccCChH-- Q lcl|NC_021326. 354 ---FDIK------G---EHK---------------------------DVDISFNYNKVANTELQVQTAQQSMGIVSHE-- 392 (445) Q Consensus 354 ---~~~~------~---~~~---------------------------~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~e-- 392 (445) +... + ... +|.|.=.+..+.-..+.+..++.+.+.+|.- T Consensus 502 ~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~~~p~~~~ 581 (711) T protein:vir:10 502 PHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQAVPSAAA 581 (711) T ss_pred HHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHhhcchhhh Confidence 3210 0 000 1122223344443444444445544444321 Q ss_pred ----HHHHhCCCCCCHHHHHHHH--------------------HHHHHHHHHhh----hcccc--CCCCCCCCCC----- Q lcl|NC_021326. 393 ----TVLENHPFVEDLQAELERI--------------------EQEQMEYNKQL----PNLDD--GGADGAQQKE----- 437 (445) Q Consensus 393 ----t~l~~l~~~~d~~~E~~ri--------------------~~E~~~~~~~~----~~~~~--~~~~~~~~~~----- 437 (445) .+++.+++ .+.++-.+++ .+|++...... ..... .......... T Consensus 582 ~~~~~il~~~d~-p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~ 660 (711) T protein:vir:10 582 VMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADML 660 (711) T ss_pred HHHHHHHHhcCC-CCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12232221 1111111111 11111000000 00000 0000000000 Q ss_pred CCCCcCCC Q lcl|NC_021326. 438 RSNDKQSE 445 (445) Q Consensus 438 ~~~d~~~~ 445 (445) +...+... T Consensus 661 qa~~e~~~ 668 (711) T protein:vir:10 661 KAQLETEE 668 (711) T ss_pred HHHHHHHH Confidence 00000000 No 83 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.75 E-value=4.6e-17 Score=110.15 Aligned_cols=417 Identities=14% Similarity=0.101 Sum_probs=211.9 Q ss_pred ChHHHHHHHH-------HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHL-------EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~-------~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +-.+++..+. +-+....+..+||.|+| .. ...........++ .+++|.++.+|+..+|+--.+.+. T Consensus 20 ~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q--w~---~~~~~~l~~~g~p--~~~~N~i~~~v~~v~g~~~~nr~~ 92 (714) T protein:vir:99 20 FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ--LP---PEVLQVLKDRGQP--MTIHNLIAPTVDGVLGMEAKTRTD 92 (714) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC--CC---HHHHHHHHhcCCC--cEEeccHHHHHHHHHhHHHhCCcc Confidence 2233333322 22344456789999976 11 0111111222233 478999999999999998776654 Q ss_pred e--ccC--ch---HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcC Q lcl|NC_021326. 74 F--KHT--DD---EVIKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTD 138 (445) Q Consensus 74 ~--~~~--d~---~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~ 138 (445) + .+. ++ +..+. ++.+++ ++.......+..+++++|.||+-++.+.+ +.++|..++|.+++ ||+ T Consensus 93 ~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~--~Dp 170 (714) T protein:vir:99 93 LVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVF--WDW 170 (714) T ss_pred eEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhhee--ecc Confidence 4 331 22 23333 444444 57888889999999999999999988654 57899999999975 443 Q ss_pred CCC----Cce-EEEEEEEeeecce-------------------------------------------------------- Q lcl|NC_021326. 139 KEH----EEL-EAFIRMYKLENET-------------------------------------------------------- 157 (445) Q Consensus 139 ~~~----~~~-~~~v~~~~~~~~~-------------------------------------------------------- 157 (445) ... ... .++.+.|...+.. T Consensus 171 ~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (714) T protein:vir:99 171 LSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRER 250 (714) T ss_pred ccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccc Confidence 211 111 1222222110000 Q ss_pred -e---EEEEecceEEEEEEe--cceeeeccccc---------------------------cccccccc--ccccccccce Q lcl|NC_021326. 158 -K---VEYWDKITVNYYVYE--NGSLIPDYSNN---------------------------LENSKTHF--STGSWGKIPF 202 (445) Q Consensus 158 -~---~~~~~~~~~~~~~~~--~~~~~~~~~~~---------------------------~~~~~~~~--~~~~~g~iPv 202 (445) . +++|+.......+.. ++..+...... .+...... .|.+.+++|+ T Consensus 251 ~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~ 330 (714) T protein:vir:99 251 RRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPL 330 (714) T ss_pred cEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeE Confidence 0 011111111000000 00000000000 00000111 2333356777 Q ss_pred EEecCCC---Cc--CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHh-hhhCceeeccCCC--- Q lcl|NC_021326. 203 IPFKNND---LE--ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL-LRYYGAIKVSDNG--- 273 (445) Q Consensus 203 v~~~n~~---~g--~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~-~~~~~~~~~~~~~--- 273 (445) |++.-.. .| .|.+..+++.++.+|...|.+...+ .+...++..|........+... .+.++++.+..+. T Consensus 331 vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~ 408 (714) T protein:vir:99 331 VPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQ 408 (714) T ss_pred EEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceeeccccccc Confidence 7654322 22 4778889999999999999988766 3555555555544433333332 2344455553221 Q ss_pred -----ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 274 -----GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 348 (445) Q Consensus 274 -----~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 348 (445) .+++.....-...+...+......|...|++.+.+.+..++..||+||......-..........+..+.+++.+ T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 488 (714) T protein:vir:99 409 KSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGR 488 (714) T ss_pred CCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123222222345667778888888999999888777777788999999877665554444455555555555544 Q ss_pred ----HHHHHhccC------C--Cc--------------------------ceEEEEeCCCCCCCHHHHHHHHHHHhccCC Q lcl|NC_021326. 349 ----FVFEHFDIK------G--EH--------------------------KDVDISFNYNKVANTELQVQTAQQSMGIVS 390 (445) Q Consensus 349 ----~~~~~~~~~------~--~~--------------------------~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s 390 (445) ++..++... + +. .+|.|.=.+..|....+.++.++.+.+.++ T Consensus 489 ~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~ 568 (714) T protein:vir:99 489 LLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLP 568 (714) T ss_pred HHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcC Confidence 444443211 0 00 011222233444444555666666654444 Q ss_pred h-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 391 H-------ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 391 ~-------et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) . ..+++.+++ ++.++-+++|.+-. +. .+ ..+ ...++.. T Consensus 569 p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~-------~~-~~-~~~-------~~~~e~q 613 (714) T protein:vir:99 569 PQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL-------GT-PK-SPD-------EMTPEEQ 613 (714) T ss_pred chhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc-------CC-CC-Ccc-------ccchhhH Confidence 3 344555543 44444455554311 00 00 000 0000111 No 84 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.75 E-value=4.6e-17 Score=110.15 Aligned_cols=417 Identities=14% Similarity=0.101 Sum_probs=211.9 Q ss_pred ChHHHHHHHH-------HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHL-------EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~-------~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +-.+++..+. +-+....+..+||.|+| .. ...........++ .+++|.++.+|+..+|+--.+.+. T Consensus 20 ~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q--w~---~~~~~~l~~~g~p--~~~~N~i~~~v~~v~g~~~~nr~~ 92 (714) T protein:vir:32 20 FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ--LP---PEVLQVLKDRGQP--MTIHNLIAPTVDGVLGMEAKTRTD 92 (714) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC--CC---HHHHHHHHhcCCC--cEEeccHHHHHHHHHhHHHhCCcc Confidence 2233333322 22344456789999976 11 0111111222233 478999999999999998776654 Q ss_pred e--ccC--ch---HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcC Q lcl|NC_021326. 74 F--KHT--DD---EVIKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTD 138 (445) Q Consensus 74 ~--~~~--d~---~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~ 138 (445) + .+. ++ +..+. ++.+++ ++.......+..+++++|.||+-++.+.+ +.++|..++|.+++ ||+ T Consensus 93 ~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~--~Dp 170 (714) T protein:vir:32 93 LVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVF--WDW 170 (714) T ss_pred eEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhhee--ecc Confidence 4 331 22 23333 444444 57888889999999999999999988654 57899999999975 443 Q ss_pred CCC----Cce-EEEEEEEeeecce-------------------------------------------------------- Q lcl|NC_021326. 139 KEH----EEL-EAFIRMYKLENET-------------------------------------------------------- 157 (445) Q Consensus 139 ~~~----~~~-~~~v~~~~~~~~~-------------------------------------------------------- 157 (445) ... ... .++.+.|...+.. T Consensus 171 ~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (714) T protein:vir:32 171 LSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRER 250 (714) T ss_pred ccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccc Confidence 211 111 1222222110000 Q ss_pred -e---EEEEecceEEEEEEe--cceeeeccccc---------------------------cccccccc--ccccccccce Q lcl|NC_021326. 158 -K---VEYWDKITVNYYVYE--NGSLIPDYSNN---------------------------LENSKTHF--STGSWGKIPF 202 (445) Q Consensus 158 -~---~~~~~~~~~~~~~~~--~~~~~~~~~~~---------------------------~~~~~~~~--~~~~~g~iPv 202 (445) . +++|+.......+.. ++..+...... .+...... .|.+.+++|+ T Consensus 251 ~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~ 330 (714) T protein:vir:32 251 RRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPL 330 (714) T ss_pred cEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeE Confidence 0 011111111000000 00000000000 00000111 2333356777 Q ss_pred EEecCCC---Cc--CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHh-hhhCceeeccCCC--- Q lcl|NC_021326. 203 IPFKNND---LE--ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL-LRYYGAIKVSDNG--- 273 (445) Q Consensus 203 v~~~n~~---~g--~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~-~~~~~~~~~~~~~--- 273 (445) |++.-.. .| .|.+..+++.++.+|...|.+...+ .+...++..|........+... .+.++++.+..+. T Consensus 331 vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~ 408 (714) T protein:vir:32 331 VPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQ 408 (714) T ss_pred EEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceeeccccccc Confidence 7654322 22 4778889999999999999988766 3555555555544433333332 2344455553221 Q ss_pred -----ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 274 -----GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 348 (445) Q Consensus 274 -----~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 348 (445) .+++.....-...+...+......|...|++.+.+.+..++..||+||......-..........+..+.+++.+ T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 488 (714) T protein:vir:32 409 KSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGR 488 (714) T ss_pred CCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123222222345667778888888999999888777777788999999877665554444455555555555544 Q ss_pred ----HHHHHhccC------C--Cc--------------------------ceEEEEeCCCCCCCHHHHHHHHHHHhccCC Q lcl|NC_021326. 349 ----FVFEHFDIK------G--EH--------------------------KDVDISFNYNKVANTELQVQTAQQSMGIVS 390 (445) Q Consensus 349 ----~~~~~~~~~------~--~~--------------------------~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s 390 (445) ++..++... + +. .+|.|.=.+..|....+.++.++.+.+.++ T Consensus 489 ~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~ 568 (714) T protein:vir:32 489 LLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLP 568 (714) T ss_pred HHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcC Confidence 444443211 0 00 011222233444444555666666654444 Q ss_pred h-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 391 H-------ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 391 ~-------et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) . ..+++.+++ ++.++-+++|.+-. +. .+ ..+ ...++.. T Consensus 569 p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~-------~~-~~-~~~-------~~~~e~q 613 (714) T protein:vir:32 569 PQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL-------GT-PK-SPD-------EMTPEEQ 613 (714) T ss_pred chhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc-------CC-CC-Ccc-------ccchhhH Confidence 3 344555543 44444455554311 00 00 000 0000111 No 85 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.75 E-value=4.6e-17 Score=110.15 Aligned_cols=417 Identities=14% Similarity=0.101 Sum_probs=211.9 Q ss_pred ChHHHHHHHH-------HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHL-------EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~-------~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +-.+++..+. +-+....+..+||.|+| .. ...........++ .+++|.++.+|+..+|+--.+.+. T Consensus 20 ~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q--w~---~~~~~~l~~~g~p--~~~~N~i~~~v~~v~g~~~~nr~~ 92 (714) T protein:vir:10 20 FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ--LP---PEVLQVLKDRGQP--MTIHNLIAPTVDGVLGMEAKTRTD 92 (714) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC--CC---HHHHHHHHhcCCC--cEEeccHHHHHHHHHhHHHhCCcc Confidence 2233333322 22344456789999976 11 0111111222233 478999999999999998776654 Q ss_pred e--ccC--ch---HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcC Q lcl|NC_021326. 74 F--KHT--DD---EVIKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTD 138 (445) Q Consensus 74 ~--~~~--d~---~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~ 138 (445) + .+. ++ +..+. ++.+++ ++.......+..+++++|.||+-++.+.+ +.++|..++|.+++ ||+ T Consensus 93 ~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~--~Dp 170 (714) T protein:vir:10 93 LVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVF--WDW 170 (714) T ss_pred eEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhhee--ecc Confidence 4 331 22 23333 444444 57888889999999999999999988654 57899999999975 443 Q ss_pred CCC----Cce-EEEEEEEeeecce-------------------------------------------------------- Q lcl|NC_021326. 139 KEH----EEL-EAFIRMYKLENET-------------------------------------------------------- 157 (445) Q Consensus 139 ~~~----~~~-~~~v~~~~~~~~~-------------------------------------------------------- 157 (445) ... ... .++.+.|...+.. T Consensus 171 ~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (714) T protein:vir:10 171 LSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRER 250 (714) T ss_pred ccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccc Confidence 211 111 1222222110000 Q ss_pred -e---EEEEecceEEEEEEe--cceeeeccccc---------------------------cccccccc--ccccccccce Q lcl|NC_021326. 158 -K---VEYWDKITVNYYVYE--NGSLIPDYSNN---------------------------LENSKTHF--STGSWGKIPF 202 (445) Q Consensus 158 -~---~~~~~~~~~~~~~~~--~~~~~~~~~~~---------------------------~~~~~~~~--~~~~~g~iPv 202 (445) . +++|+.......+.. ++..+...... .+...... .|.+.+++|+ T Consensus 251 ~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~ 330 (714) T protein:vir:10 251 RRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPL 330 (714) T ss_pred cEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeE Confidence 0 011111111000000 00000000000 00000111 2333356777 Q ss_pred EEecCCC---Cc--CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHh-hhhCceeeccCCC--- Q lcl|NC_021326. 203 IPFKNND---LE--ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL-LRYYGAIKVSDNG--- 273 (445) Q Consensus 203 v~~~n~~---~g--~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~-~~~~~~~~~~~~~--- 273 (445) |++.-.. .| .|.+..+++.++.+|...|.+...+ .+...++..|........+... .+.++++.+..+. T Consensus 331 vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~ 408 (714) T protein:vir:10 331 VPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQ 408 (714) T ss_pred EEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceeeccccccc Confidence 7654322 22 4778889999999999999988766 3555555555544433333332 2344455553221 Q ss_pred -----ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 274 -----GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 348 (445) Q Consensus 274 -----~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 348 (445) .+++.....-...+...+......|...|++.+.+.+..++..||+||......-..........+..+.+++.+ T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 488 (714) T protein:vir:10 409 KSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGR 488 (714) T ss_pred CCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123222222345667778888888999999888777777788999999877665554444455555555555544 Q ss_pred ----HHHHHhccC------C--Cc--------------------------ceEEEEeCCCCCCCHHHHHHHHHHHhccCC Q lcl|NC_021326. 349 ----FVFEHFDIK------G--EH--------------------------KDVDISFNYNKVANTELQVQTAQQSMGIVS 390 (445) Q Consensus 349 ----~~~~~~~~~------~--~~--------------------------~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s 390 (445) ++..++... + +. .+|.|.=.+..|....+.++.++.+.+.++ T Consensus 489 ~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~ 568 (714) T protein:vir:10 489 LLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLP 568 (714) T ss_pred HHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcC Confidence 444443211 0 00 011222233444444555666666654444 Q ss_pred h-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 391 H-------ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 391 ~-------et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) . ..+++.+++ ++.++-+++|.+-. +. .+ ..+ ...++.. T Consensus 569 p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~-------~~-~~-~~~-------~~~~e~q 613 (714) T protein:vir:10 569 PQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL-------GT-PK-SPD-------EMTPEEQ 613 (714) T ss_pred chhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc-------CC-CC-Ccc-------ccchhhH Confidence 3 344555543 44444455554311 00 00 000 0000111 No 86 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.75 E-value=4.6e-17 Score=110.15 Aligned_cols=417 Identities=14% Similarity=0.101 Sum_probs=211.9 Q ss_pred ChHHHHHHHH-------HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHL-------EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~-------~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +-.+++..+. +-+....+..+||.|+| .. ...........++ .+++|.++.+|+..+|+--.+.+. T Consensus 20 ~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q--w~---~~~~~~l~~~g~p--~~~~N~i~~~v~~v~g~~~~nr~~ 92 (714) T protein:vir:81 20 FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ--LP---PEVLQVLKDRGQP--MTIHNLIAPTVDGVLGMEAKTRTD 92 (714) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC--CC---HHHHHHHHhcCCC--cEEeccHHHHHHHHHhHHHhCCcc Confidence 2233333322 22344456789999976 11 0111111222233 478999999999999998776654 Q ss_pred e--ccC--ch---HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcC Q lcl|NC_021326. 74 F--KHT--DD---EVIKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTD 138 (445) Q Consensus 74 ~--~~~--d~---~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~ 138 (445) + .+. ++ +..+. ++.+++ ++.......+..+++++|.||+-++.+.+ +.++|..++|.+++ ||+ T Consensus 93 ~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~--~Dp 170 (714) T protein:vir:81 93 LVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVF--WDW 170 (714) T ss_pred eEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhhee--ecc Confidence 4 331 22 23333 444444 57888889999999999999999988654 57899999999975 443 Q ss_pred CCC----Cce-EEEEEEEeeecce-------------------------------------------------------- Q lcl|NC_021326. 139 KEH----EEL-EAFIRMYKLENET-------------------------------------------------------- 157 (445) Q Consensus 139 ~~~----~~~-~~~v~~~~~~~~~-------------------------------------------------------- 157 (445) ... ... .++.+.|...+.. T Consensus 171 ~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (714) T protein:vir:81 171 LSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRER 250 (714) T ss_pred ccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccc Confidence 211 111 1222222110000 Q ss_pred -e---EEEEecceEEEEEEe--cceeeeccccc---------------------------cccccccc--ccccccccce Q lcl|NC_021326. 158 -K---VEYWDKITVNYYVYE--NGSLIPDYSNN---------------------------LENSKTHF--STGSWGKIPF 202 (445) Q Consensus 158 -~---~~~~~~~~~~~~~~~--~~~~~~~~~~~---------------------------~~~~~~~~--~~~~~g~iPv 202 (445) . +++|+.......+.. ++..+...... .+...... .|.+.+++|+ T Consensus 251 ~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~ 330 (714) T protein:vir:81 251 RRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPL 330 (714) T ss_pred cEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeE Confidence 0 011111111000000 00000000000 00000111 2333356777 Q ss_pred EEecCCC---Cc--CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHh-hhhCceeeccCCC--- Q lcl|NC_021326. 203 IPFKNND---LE--ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL-LRYYGAIKVSDNG--- 273 (445) Q Consensus 203 v~~~n~~---~g--~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~-~~~~~~~~~~~~~--- 273 (445) |++.-.. .| .|.+..+++.++.+|...|.+...+ .+...++..|........+... .+.++++.+..+. T Consensus 331 vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~ 408 (714) T protein:vir:81 331 VPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQ 408 (714) T ss_pred EEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceeeccccccc Confidence 7654322 22 4778889999999999999988766 3555555555544433333332 2344455553221 Q ss_pred -----ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 274 -----GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 348 (445) Q Consensus 274 -----~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 348 (445) .+++.....-...+...+......|...|++.+.+.+..++..||+||......-..........+..+.+++.+ T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 488 (714) T protein:vir:81 409 KSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGR 488 (714) T ss_pred CCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123222222345667778888888999999888777777788999999877665554444455555555555544 Q ss_pred ----HHHHHhccC------C--Cc--------------------------ceEEEEeCCCCCCCHHHHHHHHHHHhccCC Q lcl|NC_021326. 349 ----FVFEHFDIK------G--EH--------------------------KDVDISFNYNKVANTELQVQTAQQSMGIVS 390 (445) Q Consensus 349 ----~~~~~~~~~------~--~~--------------------------~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s 390 (445) ++..++... + +. .+|.|.=.+..|....+.++.++.+.+.++ T Consensus 489 ~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~ 568 (714) T protein:vir:81 489 LLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLP 568 (714) T ss_pred HHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcC Confidence 444443211 0 00 011222233444444555666666654444 Q ss_pred h-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 391 H-------ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 391 ~-------et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) . ..+++.+++ ++.++-+++|.+-. +. .+ ..+ ...++.. T Consensus 569 p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~-------~~-~~-~~~-------~~~~e~q 613 (714) T protein:vir:81 569 PQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL-------GT-PK-SPD-------EMTPEEQ 613 (714) T ss_pred chhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc-------CC-CC-Ccc-------ccchhhH Confidence 3 344555543 44444455554311 00 00 000 0000111 No 87 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.75 E-value=4.6e-17 Score=110.15 Aligned_cols=417 Identities=14% Similarity=0.101 Sum_probs=211.9 Q ss_pred ChHHHHHHHH-------HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHL-------EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~-------~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +-.+++..+. +-+....+..+||.|+| .. ...........++ .+++|.++.+|+..+|+--.+.+. T Consensus 20 ~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q--w~---~~~~~~l~~~g~p--~~~~N~i~~~v~~v~g~~~~nr~~ 92 (714) T protein:vir:27 20 FSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ--LP---PEVLQVLKDRGQP--MTIHNLIAPTVDGVLGMEAKTRTD 92 (714) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC--CC---HHHHHHHHhcCCC--cEEeccHHHHHHHHHhHHHhCCcc Confidence 2233333322 22344456789999976 11 0111111222233 478999999999999998776654 Q ss_pred e--ccC--ch---HHHHH----HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcC Q lcl|NC_021326. 74 F--KHT--DD---EVIKR----IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTD 138 (445) Q Consensus 74 ~--~~~--d~---~~~~~----l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~ 138 (445) + .+. ++ +..+. ++.+++ ++.......+..+++++|.||+-++.+.+ +.++|..++|.+++ ||+ T Consensus 93 ~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~~v~--~Dp 170 (714) T protein:vir:27 93 LVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRNEVF--WDW 170 (714) T ss_pred eEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchhhee--ecc Confidence 4 331 22 23333 444444 57888889999999999999999988654 57899999999975 443 Q ss_pred CCC----Cce-EEEEEEEeeecce-------------------------------------------------------- Q lcl|NC_021326. 139 KEH----EEL-EAFIRMYKLENET-------------------------------------------------------- 157 (445) Q Consensus 139 ~~~----~~~-~~~v~~~~~~~~~-------------------------------------------------------- 157 (445) ... ... .++.+.|...+.. T Consensus 171 ~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (714) T protein:vir:27 171 LSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRER 250 (714) T ss_pred ccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccccccccccccc Confidence 211 111 1222222110000 Q ss_pred -e---EEEEecceEEEEEEe--cceeeeccccc---------------------------cccccccc--ccccccccce Q lcl|NC_021326. 158 -K---VEYWDKITVNYYVYE--NGSLIPDYSNN---------------------------LENSKTHF--STGSWGKIPF 202 (445) Q Consensus 158 -~---~~~~~~~~~~~~~~~--~~~~~~~~~~~---------------------------~~~~~~~~--~~~~~g~iPv 202 (445) . +++|+.......+.. ++..+...... .+...... .|.+.+++|+ T Consensus 251 ~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~ 330 (714) T protein:vir:27 251 RRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPL 330 (714) T ss_pred cEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCCCCCceeE Confidence 0 011111111000000 00000000000 00000111 2333356777 Q ss_pred EEecCCC---Cc--CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHh-hhhCceeeccCCC--- Q lcl|NC_021326. 203 IPFKNND---LE--ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRL-LRYYGAIKVSDNG--- 273 (445) Q Consensus 203 v~~~n~~---~g--~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~-~~~~~~~~~~~~~--- 273 (445) |++.-.. .| .|.+..+++.++.+|...|.+...+ .+...++..|........+... .+.++++.+..+. T Consensus 331 vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~~p~~~~~ 408 (714) T protein:vir:27 331 VPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKLNPVRKNQ 408 (714) T ss_pred EEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceeeccccccc Confidence 7654322 22 4778889999999999999988766 3555555555544433333332 2344455553221 Q ss_pred -----ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 274 -----GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 348 (445) Q Consensus 274 -----~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 348 (445) .+++.....-...+...+......|...|++.+.+.+..++..||+||......-..........+..+.+++.+ T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 488 (714) T protein:vir:27 409 KSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGR 488 (714) T ss_pred CCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123222222345667778888888999999888777777788999999877665554444455555555555544 Q ss_pred ----HHHHHhccC------C--Cc--------------------------ceEEEEeCCCCCCCHHHHHHHHHHHhccCC Q lcl|NC_021326. 349 ----FVFEHFDIK------G--EH--------------------------KDVDISFNYNKVANTELQVQTAQQSMGIVS 390 (445) Q Consensus 349 ----~~~~~~~~~------~--~~--------------------------~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s 390 (445) ++..++... + +. .+|.|.=.+..|....+.++.++.+.+.++ T Consensus 489 ~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~~~ 568 (714) T protein:vir:27 489 LLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLP 568 (714) T ss_pred HHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhhcC Confidence 444443211 0 00 011222233444444555666666654444 Q ss_pred h-------HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 391 H-------ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 391 ~-------et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) . ..+++.+++ ++.++-+++|.+-. +. .+ ..+ ...++.. T Consensus 569 p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~-------~~-~~-~~~-------~~~~e~q 613 (714) T protein:vir:27 569 PQVQAVVLDLWVNLLDV-PQKQEFVERIRAAL-------GT-PK-SPD-------EMTPEEQ 613 (714) T ss_pred chhhhhHHHHHHHhcCC-CCHHHHHHHHHHHc-------CC-CC-Ccc-------ccchhhH Confidence 3 344555543 44444455554311 00 00 000 0000111 No 88 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.73 E-value=1.1e-16 Score=108.13 Aligned_cols=426 Identities=15% Similarity=0.120 Sum_probs=203.1 Q ss_pred ChHHHHHH----HHHHHH-HHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhh----cc- Q lcl|NC_021326. 1 MIVRYIKQ----HLEKLP-EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIV----GK- 70 (445) Q Consensus 1 ~l~~~i~~----~~~~~~-~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~----g~- 70 (445) +|...++. |...+. ...+..+||.|+..-. ..+.+ .+++.+.....|+.....|. +. T Consensus 18 ~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~-----------~~~~~--s~~~~~~v~~~v~~~~~~l~~~~~~~~ 84 (705) T protein:vir:88 18 HLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN-----------ERPGK--SGIVSRDVQETVDWIMPSLMKVFTSGG 84 (705) T ss_pred HHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc-----------ccCCC--CccccHHHHHHHHHHHHHHHHhhcCCC Confidence 44444433 333443 5567889999985211 11112 35677777777787777653 32 Q ss_pred -Ceeecc---CchHHHHHHHHHh------ccCHHHHHHHHHHHHHhcCeEEEEEEECCC--------------------- Q lcl|NC_021326. 71 -PIAFKH---TDDEVIKRIDEVL------GNRFDDKLHSVLTGASNKGIEWLHPYLDEE--------------------- 119 (445) Q Consensus 71 -~~~~~~---~d~~~~~~l~~~~------~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--------------------- 119 (445) .+.+.+ +|....+.++.+. .|+....+...+++++++|.+++.||++.. T Consensus 85 ~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~ 164 (705) T protein:vir:88 85 QVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILS 164 (705) T ss_pred ceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhhhhh Confidence 345544 3444444433322 255667788999999999999999887431 Q ss_pred ---------------------------CcEEEEEEccceeEEEEcCCCCCceE-EEEEEEeeeccee------------- Q lcl|NC_021326. 120 ---------------------------GEFKLFRVPAEQGIPIWTDKEHEELE-AFIRMYKLENETK------------- 158 (445) Q Consensus 120 ---------------------------g~~~i~~~~p~~~~~v~d~~~~~~~~-~~v~~~~~~~~~~------------- 158 (445) |++++..|+|.++++-.+........ .+.+++.+..+.. T Consensus 165 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~ 244 (705) T protein:vir:88 165 DPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELP 244 (705) T ss_pred hhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHhhcCChhHhhhhh Confidence 56888899999886433211111111 1222222110000 Q ss_pred ---------------------------EEEEecceEE-EEEEecceeee-ccccc--------ccccccccccccccccc Q lcl|NC_021326. 159 ---------------------------VEYWDKITVN-YYVYENGSLIP-DYSNN--------LENSKTHFSTGSWGKIP 201 (445) Q Consensus 159 ---------------------------~~~~~~~~~~-~~~~~~~~~~~-~~~~~--------~~~~~~~~~~~~~g~iP 201 (445) ...++..... .+.+++..... ...+. .+.... ...+++++| T Consensus 245 ~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il--~~~~~~~~P 322 (705) T protein:vir:88 245 YDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYII--SNEPWDCRP 322 (705) T ss_pred cccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCcccc--ccccCCCCC Confidence 0000000000 01111111100 00000 000000 112456778 Q ss_pred eEEe-----cCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCcee Q lcl|NC_021326. 202 FIPF-----KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVD 276 (445) Q Consensus 202 vv~~-----~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (445) |+.+ +...+|.|.++.+.++++.+|..++.+.+.+...+.|.+.+...... ..+.. ..+.++++.+...+.+. T Consensus 323 F~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~-~~d~~-~~~pg~vv~~~~~~~i~ 400 (705) T protein:vir:88 323 FADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVN-LEDLL-TNEAAGIVRVKSMNSIT 400 (705) T ss_pred EEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccccC-ccccc-ccCCCeeEEecCCCccc Confidence 8754 34557899999999999999999999999999999987665322111 11111 13445566666666677 Q ss_pred eEeccCChHHHHHHHHHHHHHHHHHhCccccccc----cccCcchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_021326. 277 TIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD----KFGSAPSGVALEFLYTNLNLKADKLARKAKV-AIQELLWFVF 351 (445) Q Consensus 277 ~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~----~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~~~~ 351 (445) ++..+.-.......++.+...+...|++++...+ ..+++.|+.|+..+......+.....+.|.. +++.++++++ T Consensus 401 ~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~ 480 (705) T protein:vir:88 401 PLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLH 480 (705) T ss_pred cccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7766555566777889999999999999987665 2344678888888877777777777777753 5555555554 Q ss_pred HHhccCCC----------cc-----------eEEEEeCCCCCCCHHHHHHHHHHHh----c---------cCChHHH--- Q lcl|NC_021326. 352 EHFDIKGE----------HK-----------DVDISFNYNKVANTELQVQTAQQSM----G---------IVSHETV--- 394 (445) Q Consensus 352 ~~~~~~~~----------~~-----------~i~v~f~~~~p~d~~~~~~~~~~~~----g---------~~s~et~--- 394 (445) .++....+ +. ++.+.-... ..+..+....+..+. . +++.... T Consensus 481 ~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~-~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~~~~~~ 559 (705) T protein:vir:88 481 DHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIG-NMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQNLYNI 559 (705) T ss_pred HHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccc-cchHHHHHHHHHHHHHHHHHhhcccchhhhcChHHHHHH Confidence 44322211 11 112221111 111222211111110 1 1111111 Q ss_pred ----HHhCCCCCCHHH------HHH----HHHHHHHHHH------HhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 395 ----LENHPFVEDLQA------ELE----RIEQEQMEYN------KQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 395 ----l~~l~~~~d~~~------E~~----ri~~E~~~~~------~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+.. .+.++++ .++ +.+.++.+.. +..........+...+..+..-.+.| T Consensus 560 ~~el~e~~-~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E 629 (705) T protein:vir:88 560 LKEVTENA-GYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVE 629 (705) T ss_pred HHHHHHhh-hhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0001 0000000 000 0000000000 00000000000000000000000000 No 89 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.71 E-value=4.6e-17 Score=110.15 Aligned_cols=401 Identities=10% Similarity=0.058 Sum_probs=204.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCC-Ccccccccc-ccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQR-PDIVKEPKP-VDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~-~~i~~~~~~-~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d 78 (445) -..+...+ ...+..+. ..+=.|. .+....... ..........+.-+ -.+.+++.+|+..++.++.+++.+++++ T Consensus 8 ~~~~~~~~-a~~~~~~~--~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY-~~~~l~r~iVd~~a~d~~r~g~~i~~~~ 83 (461) T protein:vir:80 8 KQAKIDSK-IVNRNDFM--VGHGKANSRDKLTRQTPGNGQKLDLKACENLY-ASNSIAMNIVDIISEDMVRAGWSLKTDN 83 (461) T ss_pred hhhhhhhh-hhhhhHHH--hhcCCcchhhhhhccccCcccccCHHHHHHHH-HhCCccchhhccchHHhhcCCeeeecCC Confidence 33332211 11111111 0000111 000000000 00000000000001 1357889999999999999999999999 Q ss_pred hHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc---EEEEEEccceeEEEEcCCCCCceEEEEEEEee- Q lcl|NC_021326. 79 DEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGE---FKLFRVPAEQGIPIWTDKEHEELEAFIRMYKL- 153 (445) Q Consensus 79 ~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~---~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~- 153 (445) ++..+.++.+|+ -+....+.++.+.+.+||.+++++-...... ....-+.|.. .+.+.....+|.. T Consensus 84 ~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~---------~~~~~~l~~~~~~~ 154 (461) T protein:vir:80 84 KEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKT---------IKSIPYINTFNTQK 154 (461) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCccccc---------ccceeEEEeccccc Confidence 888888888775 4677889999999999999998886532110 0011111111 0001000000000 Q ss_pred -------ecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHH Q lcl|NC_021326. 154 -------ENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTL 221 (445) Q Consensus 154 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~l 221 (445) .+-..-.++.+ ..|........................|. -++++|.+ ...|.|.++.+.+. T Consensus 155 i~~~~~~~dp~sp~fg~P---~~y~i~~~~~~~~~~~~~~~~~~~~~iH~---SRii~~~~~~~~~~~~G~S~le~~~~~ 228 (461) T protein:vir:80 155 VTQLYLNQDMFSEHFGEV---EFFEVNRVSQLGEEILSGTTASTSEQIHR---SRIIHEQGLRFEGETKGRSIFESLYDI 228 (461) T ss_pred cchhhhcccCcCcccccc---eEEEEeccccccccccccccCccceEEcc---ccEEEecCCCCCccccCcchHHHHHHH Confidence 00000011111 11111110000000000000111111232 23555533 33689999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCeeEEecCCcc---cchhHHHhh----hhCceeeccCCCceeeEeccCChHHHHHHHHHH Q lcl|NC_021326. 222 IDAYNRRLSDLSNTFKDSNELTYVLTNYDDQ---ELPEFKRLL----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDEL 294 (445) Q Consensus 222 id~~~~~~s~~~~~~~~~~~~~l~~~g~~~~---~~~~~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l 294 (445) +.+++.+.-.....+.....+.+...+...- ........+ ...+++.++.+.+ +.+.+.+.+.+...++.+ T Consensus 229 l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~d~~e~--~e~~~~~lsgl~~~l~~~ 306 (461) T protein:vir:80 229 ITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEALAIIKGDEQ--LTKESTNVSGMKDLLDYG 306 (461) T ss_pred HHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceEEEEcCCcc--eEEEecCcCCHHHHHHHH Confidence 9999999988888888888777766553221 111111111 2334555666555 444556677889999999 Q ss_pred HHHHHHHhCcccc--ccccccCcchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhcc-----CCCcceEEEE Q lcl|NC_021326. 295 YQKIMLFGQAVDF--SSDKFGSAPSGVALEFLYTNLNLKADKLA-RKAKVAIQELLWFVFEHFDI-----KGEHKDVDIS 366 (445) Q Consensus 295 ~~~i~~~s~~p~~--~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~~~~~~~~~-----~~~~~~i~v~ 366 (445) ...|...+++|-. .+...|++.||..=... ...+++.++ ..++..+++++.+++...+. +.+..+++++ T Consensus 307 ~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~---yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~ 383 (461) T protein:vir:80 307 WDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMN---YYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIE 383 (461) T ss_pred HHHHhhhhcCCeeeeecccCCccccchHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEEE Confidence 9999999999974 33344666777654333 444555555 56899999999988764432 2234578999 Q ss_pred eCCCCCCCHHHHHHHHHHH---------hccCChHHHHHhC----CCCCC-----HHHHHHHHHHHHHHHHHhhhccccC Q lcl|NC_021326. 367 FNYNKVANTELQVQTAQQS---------MGIVSHETVLENH----PFVED-----LQAELERIEQEQMEYNKQLPNLDDG 428 (445) Q Consensus 367 f~~~~p~d~~~~~~~~~~~---------~g~~s~et~l~~l----~~~~d-----~~~E~~ri~~E~~~~~~~~~~~~~~ 428 (445) |++-.+.+..+.|++..+. .|++|.+++.+.+ +..++ ..+|.+.+.++..+ T Consensus 384 f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------- 452 (461) T protein:vir:80 384 FNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYD----------- 452 (461) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccc----------- Confidence 9999999999998865442 3778877765433 11110 01122221111100 Q ss_pred CCCCCCCCCCCCCcCCC Q lcl|NC_021326. 429 GADGAQQKERSNDKQSE 445 (445) Q Consensus 429 ~~~~~~~~~~~~d~~~~ 445 (445) ...+++++ T Consensus 453 ---------~~~~e~~~ 460 (461) T protein:vir:80 453 ---------AYAKKNAD 460 (461) T ss_pred ---------cccccCCC Confidence 00001111 No 90 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.71 E-value=7.3e-16 Score=103.56 Aligned_cols=424 Identities=14% Similarity=0.102 Sum_probs=210.0 Q ss_pred ChHHHHHHH---HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee--c Q lcl|NC_021326. 1 MIVRYIKQH---LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF--K 75 (445) Q Consensus 1 ~l~~~i~~~---~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~--~ 75 (445) ++.++.... .+-.....+-.+||.|+| .. ...........++ .+++|.++.+|+..+++.-.+.+.+ . T Consensus 24 ~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Q--w~---~~~~~~l~~~g~p--~~~~N~i~~~v~~v~g~~~~nr~~~~v~ 96 (714) T protein:vir:10 24 QLLSLCSDIDSQPLWRDAANKACAYYDGDQ--LA---PEVIQVLKDRGQP--MTIHNLIAPTVDGVLGMEAKTRTDLIVM 96 (714) T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHhhcCCC--CC---HHHHHHHHhcCCC--cEEeccHHHHHHHHHHHHHhCCcceEEe Confidence 333333322 222344557789999986 11 1111122222333 4789999999999999987776554 3 Q ss_pred cC--ch---HHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcCCCC- Q lcl|NC_021326. 76 HT--DD---EVIKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTDKEH- 141 (445) Q Consensus 76 ~~--d~---~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~~~~- 141 (445) +. ++ +..+.+ +.+++ ++.......+..+++++|.+|+-++.+.+ +.++|..++|.+++ ||+... T Consensus 97 pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~~~~i~i~~v~p~~v~--~Dp~a~~ 174 (714) T protein:vir:10 97 SDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPFGPEFKVSTVSRNEVF--WDWLSRE 174 (714) T ss_pred cCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccCCCCCCeEEEecChhhee--ecccccc Confidence 31 22 234443 34443 57888889999999999999999988754 67999999999875 443211 Q ss_pred ---CceEEE-EEEEee--------------------------------------------------e-------cc---e Q lcl|NC_021326. 142 ---EELEAF-IRMYKL--------------------------------------------------E-------NE---T 157 (445) Q Consensus 142 ---~~~~~~-v~~~~~--------------------------------------------------~-------~~---~ 157 (445) ....++ .+.|.. . +. . T Consensus 175 ~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rV~ 254 (714) T protein:vir:10 175 ADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQRERRRVL 254 (714) T ss_pred CChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhhcccccccccccccCcceEE Confidence 011111 111100 0 00 0 Q ss_pred eEEEEecceEEEEEEe--cceeeeccccc---------------------------cccccccc--ccccccccceEEec Q lcl|NC_021326. 158 KVEYWDKITVNYYVYE--NGSLIPDYSNN---------------------------LENSKTHF--STGSWGKIPFIPFK 206 (445) Q Consensus 158 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~---------------------------~~~~~~~~--~~~~~g~iPvv~~~ 206 (445) .+++|+.......... ++..+...... .+...... .|.+.+++|+|++. T Consensus 255 v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~ 334 (714) T protein:vir:10 255 LQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFW 334 (714) T ss_pred EEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcCCCCCCCCceeeEEec Confidence 1122222211111110 00000000000 00000111 23344567777664 Q ss_pred CCC---C--cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh-hhCceeeccCC----C--- Q lcl|NC_021326. 207 NND---L--EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL-RYYGAIKVSDN----G--- 273 (445) Q Consensus 207 n~~---~--g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~-~~~~~~~~~~~----~--- 273 (445) -.. . ..|.+..+++.++.+|...|.+...+. +...++..|............. +.++++.+... . T Consensus 335 g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~~~~~~~~ 412 (714) T protein:vir:10 335 GYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQLSDNDLMEQLERPDGIIKLNPVRKNQKSVA 412 (714) T ss_pred ceeeeccCccceehhhhhhHHHHHHHHHHHHHHHHh--CCceeeccccccccHHHHHHhccCCCCeEEecccccccCCcc Confidence 321 2 357888899999999999999887663 4444554555443333333332 33445555321 1 Q ss_pred -ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_021326. 274 -GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF- 351 (445) Q Consensus 274 -~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~- 351 (445) .++......-...+...++.....|...|++.+.+.+..+++.||+||..+...-..........+..+.+++.++++ T Consensus 413 ~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~ 492 (714) T protein:vir:10 413 DVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLA 492 (714) T ss_pred ccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 123222222345677778888899999999888777777788999999877665555555555555555555555444 Q ss_pred ---HHhccCC--------Cc----c----------------------eEEEEeCCCCCCCHHHHHHHHHHHhccCCh--- Q lcl|NC_021326. 352 ---EHFDIKG--------EH----K----------------------DVDISFNYNKVANTELQVQTAQQSMGIVSH--- 391 (445) Q Consensus 352 ---~~~~~~~--------~~----~----------------------~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~--- 391 (445) .++.... +. . +|.|.=.+..+.-..+.++.++++.+.++. T Consensus 493 li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~ 572 (714) T protein:vir:10 493 YLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQ 572 (714) T ss_pred HHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhh Confidence 4332110 00 0 011111223333344444555554433332 Q ss_pred ----HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 392 ----ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 392 ----et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ...++.+.+ ...++-+++|.+-. +. .+ ..+...+.+.......+ T Consensus 573 ~~~~~~~le~~d~-p~~~ei~~~ir~~~-------~~-~~-~~~~~~~e~q~~q~~~~ 620 (714) T protein:vir:10 573 AVVLDLWVNLLDV-PQKQEFVERIRAAL-------GT-PK-SPDEMTPEEQEVAAQQQ 620 (714) T ss_pred hhHHHHHHHhcCC-cCHHHHHHHHHHHc-------CC-CC-CccccCcchhHHHHHHH Confidence 234455433 34444455554221 00 00 00000000000000000 No 91 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.68 E-value=2.4e-15 Score=100.72 Aligned_cols=428 Identities=11% Similarity=0.047 Sum_probs=201.3 Q ss_pred ChHHHHHHHHHHH----HHHHH----------HHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh Q lcl|NC_021326. 1 MIVRYIKQHLEKL----PEISI----------GQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 66 (445) Q Consensus 1 ~l~~~i~~~~~~~----~~~~~----------~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~ 66 (445) +|.+..+++.+.. +++.. +.+||.|...- .......+.| ++++.|.....|+..... T Consensus 24 ~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~-------~~~~~~~~~r--s~~~~~~v~~~ve~~~~~ 94 (651) T protein:vir:80 24 YVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLR-------SVGDVNADWR--HKITTGKAFEAIETIHAY 94 (651) T ss_pred HHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhcccccc-------ccCCCCCCCC--ccccChhHHHHHHHHHHH Confidence 4555555544322 23333 34455553210 0111111223 368889999999888777 Q ss_pred hhcc----C--eeecc-CchH----HHHHHHHHhc-----cCHHHHHHHHHHHHHhcCeEEEEEEECC------------ Q lcl|NC_021326. 67 IVGK----P--IAFKH-TDDE----VIKRIDEVLG-----NRFDDKLHSVLTGASNKGIEWLHPYLDE------------ 118 (445) Q Consensus 67 l~g~----~--~~~~~-~d~~----~~~~l~~~~~-----n~~~~~~~~~~~~~~~~G~~~~~v~~d~------------ 118 (445) |+.. + +.+.. .+++ ..+.+..++. .++.....++..+++++|.+++.|+++. T Consensus 95 l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~ 174 (651) T protein:vir:80 95 LMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVR 174 (651) T ss_pred HHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehheecc Confidence 6432 2 23322 2222 2333544443 4577777788999999999999987652 Q ss_pred -------------------CCcEEEEEEccceeEEEEcCCCC--CceEEEEEEEeeecc--------------------- Q lcl|NC_021326. 119 -------------------EGEFKLFRVPAEQGIPIWTDKEH--EELEAFIRMYKLENE--------------------- 156 (445) Q Consensus 119 -------------------~g~~~i~~~~p~~~~~v~d~~~~--~~~~~~v~~~~~~~~--------------------- 156 (445) .|.|++..++|.++++ |++.. ....++++.+.+... T Consensus 175 ~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~--dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~ 252 (651) T protein:vir:80 175 TPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFY--DPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDVVEH 252 (651) T ss_pred ccccccccceeeeccceeeeceeEEEEecHHHeee--cCCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHHHHhh Confidence 1567899999999875 43221 122233333221100 Q ss_pred ---------------------------eeEEEEecceEEEEEEecceeeecccccccccccccccccc-cccceEEec-- Q lcl|NC_021326. 157 ---------------------------TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSW-GKIPFIPFK-- 206 (445) Q Consensus 157 ---------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~iPvv~~~-- 206 (445) ..+++|.. +..+..++...........+........+++ ..+|++.++ T Consensus 253 ~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~--~~~~d~e~~~~~~~~v~~~g~~il~~~~~~~~~~~Pf~~~~~~ 330 (651) T protein:vir:80 253 KCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEY--WGDIHLENKTYHDVVVTIMGNEVLRFEQNPYWCGRPFVIGTYI 330 (651) T ss_pred hccccccCCccccccccCCCccccccccceEEEEE--EEEeeccCCceEEEEEEEcCcEEecccccCCCCCCCeeeecce Confidence 00011100 0000001100000000000111111122322 245776543 Q ss_pred ---CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEecc-C Q lcl|NC_021326. 207 ---NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVE-V 282 (445) Q Consensus 207 ---n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~ 282 (445) ...+|.|..+.+.+.+..+|.....+.+.+...+.|.+.+......+..+. ....++++.++..+++.++... . T Consensus 331 ~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l--~~~pg~vi~~~~~~~~~~l~~~~~ 408 (651) T protein:vir:80 331 PTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDV--YTEPGKVFLVSDHGDLQPLANQSS 408 (651) T ss_pred ecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHh--hcCCCceEEecCCCCceeeccCcc Confidence 345799999999999999999999999999999999987753322222222 2345667777878888887654 3 Q ss_pred ChHHHHHHHHHHHHHHHHHhCcccccccc---ccCcchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccCC Q lcl|NC_021326. 283 PVENSKKYLDELYQKIMLFGQAVDFSSDK---FGSAPSGVALEFLYTNLNLKADKLARKAKV-AIQELLWFVFEHFDIKG 358 (445) Q Consensus 283 ~~~~~~~~i~~l~~~i~~~s~~p~~~~~~---~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~~~~~~~~~~~ 358 (445) +.......++.+...+...++++.+..+. ..++.+|.++......+........+.|.. +++.+++.++.++.... T Consensus 409 ~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~ 488 (651) T protein:vir:80 409 NFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFT 488 (651) T ss_pred cchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 45566778999999999999998765442 223557777777767777766666666655 44544444444332211 Q ss_pred Cc---------------------ceEEEEeCCCCCCCH---HHHHHHHHHH------hccCC---h-----HH---HHHh Q lcl|NC_021326. 359 EH---------------------KDVDISFNYNKVANT---ELQVQTAQQS------MGIVS---H-----ET---VLEN 397 (445) Q Consensus 359 ~~---------------------~~i~v~f~~~~p~d~---~~~~~~~~~~------~g~~s---~-----et---~l~~ 397 (445) +. .++.+.+.- .+... .+..+.+.++ .+..+ . .. .++. T Consensus 489 ~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~i-v~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~ 567 (651) T protein:vir:80 489 DQPGMVRVAGDEAGAYEYYELDVEDLQKEVRL-VPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQH 567 (651) T ss_pred CcccceeecccccccccccccCccceeeeeee-eeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHH Confidence 10 122222211 11121 1222222221 11111 1 01 1222 Q ss_pred CCCCCCHHHHHH-----HHHHHHHHHHHhhhcccc--CCCCCCCCCCCCC--CcCCC Q lcl|NC_021326. 398 HPFVEDLQAELE-----RIEQEQMEYNKQLPNLDD--GGADGAQQKERSN--DKQSE 445 (445) Q Consensus 398 l~~~~d~~~E~~-----ri~~E~~~~~~~~~~~~~--~~~~~~~~~~~~~--d~~~~ 445 (445) ++ +.++..=+. .....+.+...++..... .......+.+... ..+.| T Consensus 568 ~g-~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 623 (651) T protein:vir:80 568 WG-FEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQADGGTQMMSE 623 (651) T ss_pred cC-CCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22 222221000 000000000000000000 0000000000000 00000 No 92 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.68 E-value=5.5e-16 Score=104.25 Aligned_cols=431 Identities=12% Similarity=0.094 Sum_probs=203.9 Q ss_pred ChHHHH---HHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee--c Q lcl|NC_021326. 1 MIVRYI---KQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF--K 75 (445) Q Consensus 1 ~l~~~i---~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~--~ 75 (445) .+..+. ..+.+-+....+..+||.|+| ... ..........++ .+++|.++.+|+..+++...+.+.+ . T Consensus 26 ~~~~~~~~~~~q~~~r~~a~~d~~fy~G~Q--W~~---~~~~~l~~~g~p--~~~~N~i~~~v~~v~g~~~~nr~d~~v~ 98 (772) T protein:vir:10 26 EYADINYEIEDQPAWRAVADKEMDYADGNQ--LDT---ELLRRQQALGIP--PAVEDLIGPALLSLQGYEAVTRTDWRVT 98 (772) T ss_pred HHHHHHHHHhccHHHHHHHHHHHHhhcCCC--CCH---HHHHHHHhcCCC--cEEEcchHHHHHHHHHHHHhcCcceEEe Confidence 222222 223344455566788999986 111 111111222233 4789999999999999987776544 3 Q ss_pred cC----chHHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcCCCCCc Q lcl|NC_021326. 76 HT----DDEVIKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTDKEHEE 143 (445) Q Consensus 76 ~~----d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~~~~~~ 143 (445) +. +.+..+.+ +.+++ ++.......+..+++++|.+|+-++.+.+ +.++|..++|.+++ ||+..... T Consensus 99 Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i~i~~v~p~~v~--~Dp~a~~D 176 (772) T protein:vir:10 99 PNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFPYRCRPIRRDEIH--WDMKCGDD 176 (772) T ss_pred cCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCCeEEEeeCcccce--ecCCCCCC Confidence 32 22334433 34443 67888899999999999999999888754 46889999999865 55533222 Q ss_pred e---EEEE-EEEee----------------------------------ec---------------------------ce- Q lcl|NC_021326. 144 L---EAFI-RMYKL----------------------------------EN---------------------------ET- 157 (445) Q Consensus 144 ~---~~~v-~~~~~----------------------------------~~---------------------------~~- 157 (445) + .+++ ..|.. ++ .. T Consensus 177 ~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 256 (772) T protein:vir:10 177 WEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQEDHWYNPTSKE 256 (772) T ss_pred HHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchhhccccccccccccCCce Confidence 1 1111 10100 00 00 Q ss_pred --eEEEEecceEEEEEEe--cceeeecc---------------------------cccccccccc--cccccccccceEE Q lcl|NC_021326. 158 --KVEYWDKITVNYYVYE--NGSLIPDY---------------------------SNNLENSKTH--FSTGSWGKIPFIP 204 (445) Q Consensus 158 --~~~~~~~~~~~~~~~~--~~~~~~~~---------------------------~~~~~~~~~~--~~~~~~g~iPvv~ 204 (445) -+++|+.......+.. .+..+... ..-.+..... ..|.+.+.+|+|+ T Consensus 257 Vrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~~~~~fP~vP 336 (772) T protein:vir:10 257 ICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPYTHRHFPYVP 336 (772) T ss_pred EEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecceeeccCCCCCCCCccceEE Confidence 1122222111111000 00000000 0000111111 1233445678776 Q ss_pred ecCC---CC--cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh-hhCceeeccCCC----c Q lcl|NC_021326. 205 FKNN---DL--EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL-RYYGAIKVSDNG----G 274 (445) Q Consensus 205 ~~n~---~~--g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~-~~~~~~~~~~~~----~ 274 (445) +.-. .. ..|.+..+++.++.+|...|.+...+...+ +..=.|........+.... +.+.++.+..+. + T Consensus 337 ~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~--~~~~~gav~~~d~~~~e~~arp~~vi~~~~~~~~~~~ 414 (772) T protein:vir:10 337 FFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVAR--VERTKGAVAMTDAQFRRQIARPDADIVLDENHMAKPG 414 (772) T ss_pred EeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhccc--ccccCCCccchhHHHHHhccCCCCeEEeCCccccCCC Confidence 6532 12 357888999999999999999888775543 2222333332222333333 334455554431 2 Q ss_pred --eeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HH Q lcl|NC_021326. 275 --VDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQEL----LW 348 (445) Q Consensus 275 --~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~----~~ 348 (445) ++......-..++...+......|...|++.+.+.+..++..||+||...-..-..........+..+.+++ +. T Consensus 415 ~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~ 494 (772) T protein:vir:10 415 ARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLA 494 (772) T ss_pred CCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222223456777788888889999998887777777889999998765554444445555555555555 44 Q ss_pred HHHHHhccC-------CCcc------------------------eE-----EEEe--CCCCCCCHHHHHHHHHHHhccCC Q lcl|NC_021326. 349 FVFEHFDIK-------GEHK------------------------DV-----DISF--NYNKVANTELQVQTAQQSMGIVS 390 (445) Q Consensus 349 ~~~~~~~~~-------~~~~------------------------~i-----~v~f--~~~~p~d~~~~~~~~~~~~g~~s 390 (445) +|..++... .+.. +| +|+. .+..+.=..+.++.++++.+.++ T Consensus 495 li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~~~~~ 574 (772) T protein:vir:10 495 MIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAVKSMP 574 (772) T ss_pred HHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEeeccccchHHHHHHHHHHHHHHhccC Confidence 444443211 0100 11 1111 11111112333444445544444 Q ss_pred hHHHH-------HhCCCCCCHHHHHHHHHHHH----------------HHHHH-hhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 391 HETVL-------ENHPFVEDLQAELERIEQEQ----------------MEYNK-QLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 391 ~et~l-------~~l~~~~d~~~E~~ri~~E~----------------~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+... +.+ .....++-.+++++-. +...+ ...+...-.... +-...+.+..- T Consensus 575 P~~~~~~~~~~le~~-D~p~~~ei~~~ir~~~~~~~peq~~~~~~q~~qq~~~~~~~el~~~q~~a--~~~~~~A~a~~ 650 (772) T protein:vir:10 575 PQYQAAVLPFLVSLM-DVPFKRDVVEAIRAVDQQQTPEQIQQQIDQAVQDALAKAGNDIKLRELEI--KERKADSEISG 650 (772) T ss_pred hhHHHHHHHHHHhhc-CCCChHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH Confidence 43322 222 1122222233333211 00000 000000000000 00000000000 No 93 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.61 E-value=1.6e-14 Score=96.18 Aligned_cols=386 Identities=9% Similarity=0.003 Sum_probs=199.5 Q ss_pred HHHHHHHHHHhcCCCccccccccccccccccccccccc-----cccchHHHHHHHHHhhhhccCeeeccCch--HHHHHH Q lcl|NC_021326. 13 LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDR-----MITNFHANLVDQKVSYIVGKPIAFKHTDD--EVIKRI 85 (445) Q Consensus 13 ~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~r-----i~~n~~~~iv~~~~~~l~g~~~~~~~~d~--~~~~~l 85 (445) ....+-+...--|-- ...+................ -.+.+++.+|+..+.-++.+++.+++++. +..+.+ T Consensus 1 ~~~~D~~~~~~~~~g---~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~~~~ 77 (437) T protein:vir:52 1 MKFFDGIKSLALKLG---SKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQLDLF 77 (437) T ss_pred CchhhhhHhHHhcCC---CccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHHHHHH Confidence 111111222221110 00000000000000000000 13578899999999999999999988643 333456 Q ss_pred HHHhcc-CHHHHHHHHHHHHHhcCeEEEEEEECCC---------CcEE-EEEEccceeEEEEcC-CCCCceEEE-EEEEe Q lcl|NC_021326. 86 DEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDEE---------GEFK-LFRVPAEQGIPIWTD-KEHEELEAF-IRMYK 152 (445) Q Consensus 86 ~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~~---------g~~~-i~~~~p~~~~~v~d~-~~~~~~~~~-v~~~~ 152 (445) +..++. ++...+.++.+.+-.||.+++++-.|.. |.++ +.+++|.++.|.... .+...+.++ ..+|. T Consensus 78 ~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~ 157 (437) T protein:vir:52 78 TKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYS 157 (437) T ss_pred HHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEEE Confidence 666654 6788888999999999999999877642 3333 667777666543211 111111111 11111 Q ss_pred eecceeEEEEecceEEEEEEecceeeeccccccccccccccccccc-ccceEEecCCCCcCccHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 153 LENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWG-KIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSD 231 (445) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~ 231 (445) +........+.+.++ .|.-+ .+| ...++-+|.|.++.+.+-+.+++.+.-. T Consensus 158 v~~~~~~~~iH~SRi--------------------------i~~~~~~~~--~~~~~~~G~s~le~~~~~i~~~~~~~~~ 209 (437) T protein:vir:52 158 ILGGSQSITVHHSRL--------------------------IILNANDAP--LSDNDIWGVSDLEKIIDVLKRFDSASVN 209 (437) T ss_pred EecCCcceeEcccee--------------------------EEecCccCC--CccccccCCchHHHHHHHHHHHHHHHHH Confidence 111100000000010 11111 122 1224456999999999999999999888 Q ss_pred HHHHHHHhcCCeeEEecCCc----ccchhH---HH---h-hhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 232 LSNTFKDSNELTYVLTNYDD----QELPEF---KR---L-LRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIML 300 (445) Q Consensus 232 ~~~~~~~~~~~~l~~~g~~~----~~~~~~---~~---~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~ 300 (445) ....+.....+.+.+.|... ...... .. . ....+++.++.+.+ |-+.+.+...+...++.....|.. T Consensus 210 ~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~--~e~~~~~~sgl~~~l~~~~~~iaa 287 (437) T protein:vir:52 210 VGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENE--YDRKELTFTGLKDLLTEFRNAVAG 287 (437) T ss_pred HHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcc--eEEEecCcCCHHHHHHHHHHHHHH Confidence 88778777777776665311 111111 11 1 12345666666554 444556677788889999999999 Q ss_pred HhCcccccc--ccccCcchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHH Q lcl|NC_021326. 301 FGQAVDFSS--DKFGSAPSGVALEFLYTNLNLKADKLA-RKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTEL 377 (445) Q Consensus 301 ~s~~p~~~~--~~~~~~~Sg~Ai~~~~~~l~~k~~~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~ 377 (445) .+++|-.-+ ...+|-.||..-...+. ..++..+ ..+.+.+++++.+++....... ..++.++|++-...+..+ T Consensus 288 a~~iP~t~L~G~s~~Glasge~D~~~yy---d~i~~~Qe~~l~p~le~l~~~i~~~~~g~~-~~~~~~~f~pL~~~s~ke 363 (437) T protein:vir:52 288 AADMPVTILFGQSVSGLASGDEDIQNYH---EAIRRLQETRLRPIFEIIDPLICNELFGGL-PADWWFEFVPLTTVKQEQ 363 (437) T ss_pred HhcCchhhhcCcCcccccccHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCcCCcCHHH Confidence 999996433 22233345554443333 3444444 5688899999998875433222 246889999998889888 Q ss_pred HHHHHHHH---------hccCChHHHHHhC------CCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCC-CCCCCCC Q lcl|NC_021326. 378 QVQTAQQS---------MGIVSHETVLENH------PFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQ-QKERSND 441 (445) Q Consensus 378 ~~~~~~~~---------~g~~s~et~l~~l------~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~-~~~~~~d 441 (445) .+++..+. .|+++.+.+.+.| +.+++ +++ +..+......+....+.. +.....+ T Consensus 364 kae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~--~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (437) T protein:vir:52 364 QINMLNTFATAANTLIQNGVLNEYQIANELRESGLFANISA--EHI--------EELKNADEFAGNFEEPEKMEGAQVQN 433 (437) T ss_pred HHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCCCc--ccc--------ccccCCCCCCCccCCCCCCCCCCCCC Confidence 88764332 3677777666543 22221 100 001111111111111111 1111111 Q ss_pred cCCC Q lcl|NC_021326. 442 KQSE 445 (445) Q Consensus 442 ~~~~ 445 (445) ++.+ T Consensus 434 ~~~~ 437 (437) T protein:vir:52 434 SEDQ 437 (437) T ss_pred CCCC Confidence 1111 No 94 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.59 E-value=6.1e-13 Score=87.55 Aligned_cols=425 Identities=9% Similarity=0.020 Sum_probs=225.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHH-----HhcCCCcccccccc-ccccccccccccccc-cccchHHHHHHHHHhhhhc-cCe Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQE-----YYEQRPDIVKEPKP-VDATGAVDPLKPDDR-MITNFHANLVDQKVSYIVG-KPI 72 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~-----yy~G~~~i~~~~~~-~~~~~~~~~~~~~~r-i~~n~~~~iv~~~~~~l~g-~~~ 72 (445) ...............|+-... -+.+. +.....+. ..........|+..- ..++|++-+|+..+..++| .++ T Consensus 18 ~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~-~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~Gi 96 (505) T protein:vir:96 18 AWYRYVEPQKNAARAFEAARRDRLGKAWLRR-ASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLKNNVIGPKGM 96 (505) T ss_pred hhhhhHHHHHHhhhhcccccCCCccccccCC-CCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhcCCCcc Confidence 222222222333334443210 01000 00000000 000111111111110 1357999999999999999 588 Q ss_pred eecc--------CchHHHHHHHHHhc----c---------CHHHHHHHHHHHHHhcCeEEEEEEECCCC--cEEEEEEcc Q lcl|NC_021326. 73 AFKH--------TDDEVIKRIDEVLG----N---------RFDDKLHSVLTGASNKGIEWLHPYLDEEG--EFKLFRVPA 129 (445) Q Consensus 73 ~~~~--------~d~~~~~~l~~~~~----n---------~~~~~~~~~~~~~~~~G~~~~~v~~d~~g--~~~i~~~~p 129 (445) ++.+ .+++.++.++..|. . +|......+.+..+..|.+|+.......+ .+++..++| T Consensus 97 ~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~~lqliep 176 (505) T protein:vir:96 97 TFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNKWGYALQILEC 176 (505) T ss_pred eeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCCcceEEEEech Confidence 7754 25555555544432 1 24445556777888999999887665433 268999999 Q ss_pred ceeEEEEcC--CCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEE Q lcl|NC_021326. 130 EQGIPIWTD--KEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIP 204 (445) Q Consensus 130 ~~~~~v~d~--~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~ 204 (445) +.+-.-++. .....+..+|.+ +..+. .+.|+............ .......+.+|| |+| T Consensus 177 d~l~~~~n~~~~~~~~i~~GIe~---d~~Gr-------~~aY~i~~~hPgd~~~~-------~~~~~~~~~rvpa~~vlH 239 (505) T protein:vir:96 177 DRLDLNYNADLQNGNRIRMSIEL---DAWER-------PVAYHLLVNHPGDNSYC-------YHYAGQTYERVPADEIIH 239 (505) T ss_pred hhcCCCCCcccCCcCeEEeceEE---CCCCc-------eEEEEEeecCCCccccc-------cccccccccccCHhHhhh Confidence 886322211 112334555543 11111 12222222111000000 000112244555 344 Q ss_pred ecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcc-------cchhHHHhhhhCceeeccCC Q lcl|NC_021326. 205 FKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQ-------ELPEFKRLLRYYGAIKVSDN 272 (445) Q Consensus 205 ~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~-------~~~~~~~~~~~~~~~~~~~~ 272 (445) +.. ...|.|.|.+++..+..++...........-.+.-..+++..... ..+.....+....+..++.+ T Consensus 240 ~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG 319 (505) T protein:vir:96 240 TFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQGEIVEEVEAGTYQLLPYG 319 (505) T ss_pred hhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccccCccccccCCceeeecCCC Confidence 332 336899999999999888887766666666655545555542111 11111223444556667888 Q ss_pred CceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_021326. 273 GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKV-AIQELLWFVF 351 (445) Q Consensus 273 ~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~~~~ 351 (445) .++++.+.+.+...+..+...+.+.|..-.++|--...+.-++.|-.+.+..+..........+..|.. .++.+++..+ T Consensus 320 e~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l 399 (505) T protein:vir:96 320 IRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLI 399 (505) T ss_pred CeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 899999888888999999999999999988888433322223345556777767666666666665554 3333333322 Q ss_pred HH---hcc-C--CCc--ceEEEEeCC-CCC-CCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_021326. 352 EH---FDI-K--GEH--KDVDISFNY-NKV-ANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYN 419 (445) Q Consensus 352 ~~---~~~-~--~~~--~~i~v~f~~-~~p-~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~ 419 (445) +. .|. + ... .-..+.|.. ..+ .|....+++.... .|+.|.+.++...+. |+++.++++.+|++... T Consensus 400 ~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~--D~~~v~~q~a~e~~~~~ 477 (505) T protein:vir:96 400 SMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGD--DPEDVFDEIAWEEQLMR 477 (505) T ss_pred HHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--CHHHHHHHHHHHHHHHH Confidence 22 121 1 100 113566743 222 5777777766554 699999999998865 89999999999987654 Q ss_pred Hhh-hccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 420 KQL-PNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 420 ~~~-~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +.- ....+.........+++++.++| T Consensus 478 ~~Gl~~~~~~~~~~~~~~~~~~~~~~d 504 (505) T protein:vir:96 478 DKGVNPTPPEQESKDATTDEEDDSASD 504 (505) T ss_pred HcCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 432 11122222222233333333333 No 95 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.58 E-value=1.2e-13 Score=91.50 Aligned_cols=433 Identities=8% Similarity=0.026 Sum_probs=194.5 Q ss_pred ChHHHHHH-------HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQ-------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~-------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) .+.++... ..+-......-.+||.|+|= .. . .....+...|..+|.++.+|+..+|+---+.+. T Consensus 7 ~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw--~~--~-----~~~~l~~q~rp~~N~i~~~i~~v~g~~~~nr~d 77 (725) T protein:vir:77 7 RLESILSRFDADWTASDEARREAKNDLFFSRVSQW--DD--W-----LSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPID 77 (725) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCC--CH--H-----HHHHHHhcCCCccccHHHHHHHHHhhHHhCCcc Confidence 34443333 23334456667899999861 10 0 001111222446799999999999987666544 Q ss_pred --ecc---CchHHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC---C---CCcEEEEEE----ccceeE Q lcl|NC_021326. 74 --FKH---TDDEVIKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD---E---EGEFKLFRV----PAEQGI 133 (445) Q Consensus 74 --~~~---~d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d---~---~g~~~i~~~----~p~~~~ 133 (445) +.+ ++.+..+.+ +.+.+ ++.....+.+..+++++|.||+-|+.| + ++.++|... +|.+++ T Consensus 78 ~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~~~~v~ 157 (725) T protein:vir:77 78 VLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVI 157 (725) T ss_pred eEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccChhhce Confidence 333 333344443 33333 578888889999999999999888643 2 234444433 344443 Q ss_pred EEEcCCCCC-ce----EEEEEEEee----------------------------------ecceeEEEEecceEEEEEE-- Q lcl|NC_021326. 134 PIWTDKEHE-EL----EAFIRMYKL----------------------------------ENETKVEYWDKITVNYYVY-- 172 (445) Q Consensus 134 ~v~d~~~~~-~~----~~~v~~~~~----------------------------------~~~~~~~~~~~~~~~~~~~-- 172 (445) ||+.... .+ .++++.|.. +...-+++|+...+..... T Consensus 158 --~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~~~~~~~ 235 (725) T protein:vir:77 158 --WDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIY 235 (725) T ss_pred --eCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEeeEEEEe Confidence 3432110 00 011111110 0001133444333221111 Q ss_pred ec---ceeeecc-------------c-------------------ccccccccccccccccccceEEecC-------CCC Q lcl|NC_021326. 173 EN---GSLIPDY-------------S-------------------NNLENSKTHFSTGSWGKIPFIPFKN-------NDL 210 (445) Q Consensus 173 ~~---~~~~~~~-------------~-------------------~~~~~~~~~~~~~~~g~iPvv~~~n-------~~~ 210 (445) .. +...... . ..+........+.+.+.+|+|+|.- .+. T Consensus 236 ~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~ 315 (725) T protein:vir:77 236 QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEV 315 (725) T ss_pred cCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccCCccc Confidence 11 0000000 0 0000000111233345677776532 122 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE-ecCCcccchhHHHhhhhCce-----eeccCC----CceeeEec Q lcl|NC_021326. 211 EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-TNYDDQELPEFKRLLRYYGA-----IKVSDN----GGVDTIQV 280 (445) Q Consensus 211 g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~~~-----~~~~~~----~~~~~l~~ 280 (445) +.|.+.++++.++.+|...|.+...+-..+.-..++ .|.. +.............. +...++ +.+..... T Consensus 316 ~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~ 394 (725) T protein:vir:77 316 YEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENSGDLPTQPLAYYEN 394 (725) T ss_pred ccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhh-hHHHHHHHhccCCceecccccccCCCcccccCccccCC Confidence 348888999999999999999988776655433322 2211 111111111111101 111111 11222222 Q ss_pred cCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhcc Q lcl|NC_021326. 281 EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF----EHFDI 356 (445) Q Consensus 281 ~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~----~~~~~ 356 (445) +.=...+..+++.....|...|++.+.+.+..++.+||+|+...-......+......+..+.+++.++++ .++.. T Consensus 395 ~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~~~~ 474 (725) T protein:vir:77 395 PEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDV 474 (725) T ss_pred CCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC Confidence 22224555678888889999998887777777778999999888776666666666666666666654444 44321 Q ss_pred C---------CC--c------------------------ceEEEEeCCCCCCCHHHHHHHHHHHhccCCh------HHHH Q lcl|NC_021326. 357 K---------GE--H------------------------KDVDISFNYNKVANTELQVQTAQQSMGIVSH------ETVL 395 (445) Q Consensus 357 ~---------~~--~------------------------~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~------et~l 395 (445) . +. . .++.|.=.|..+.=..+.+..++.+...++. .++. T Consensus 475 ~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~ 554 (725) T protein:vir:77 475 PRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLL 554 (725) T ss_pred CcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccccchhHHHHHH Confidence 1 00 0 1112222222222223334444444333321 1222 Q ss_pred HhCCCC--CCHHHHHHHHHHHHHHHHHhhhccc----------------------cCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 396 ENHPFV--EDLQAELERIEQEQMEYNKQLPNLD----------------------DGGADGAQQKERSNDKQSE 445 (445) Q Consensus 396 ~~l~~~--~d~~~E~~ri~~E~~~~~~~~~~~~----------------------~~~~~~~~~~~~~~d~~~~ 445 (445) ..++.. +..++.++++.++........+... ...........+..--..| T Consensus 555 ~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e 628 (725) T protein:vir:77 555 QYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQ 628 (725) T ss_pred HhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222221 2223445555543322111000000 0000000000000000000 No 96 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.57 E-value=1.1e-12 Score=86.25 Aligned_cols=426 Identities=11% Similarity=-0.017 Sum_probs=214.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc-ccccccccccccccc-cccchHHHHHHHHHhhhhccCeeeccC- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKP-VDATGAVDPLKPDDR-MITNFHANLVDQKVSYIVGKPIAFKHT- 77 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~-~~~~~~~~~~~~~~r-i~~n~~~~iv~~~~~~l~g~~~~~~~~- 77 (445) ...+....+..-.... .+-..|..+...-.+. ..........|+..- ..++|++.+|+..+..++|.+++..+. T Consensus 13 ~~~~~~~~~~~~a~~~---~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~~~~p 89 (530) T protein:vir:38 13 TSLREYAGYHGGGGGF---GGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVGSFFRLSYRP 89 (530) T ss_pred cchHHHhhhhcccCCC---CCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCceeeecc Confidence 1222222221000000 0000111100000000 000111111111110 135799999999999999999877542 Q ss_pred -----------chHHHHH----HHHHhcc-----------CHHHHHHHHHHHHHhcCeEEEEEEECCCC----cEEEEEE Q lcl|NC_021326. 78 -----------DDEVIKR----IDEVLGN-----------RFDDKLHSVLTGASNKGIEWLHPYLDEEG----EFKLFRV 127 (445) Q Consensus 78 -----------d~~~~~~----l~~~~~n-----------~~~~~~~~~~~~~~~~G~~~~~v~~d~~g----~~~i~~~ 127 (445) +++.++. |+.|-.+ +|.....-+.+..++.|.+|+.+..++++ .+++..+ T Consensus 90 ~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~~~~~~lq~i 169 (530) T protein:vir:38 90 SWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTRLFRTQFKMV 169 (530) T ss_pred chhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCCCccceEEEEe Confidence 2333344 3333221 34455556677889999999988766543 2689999 Q ss_pred ccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC Q lcl|NC_021326. 128 PAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN 207 (445) Q Consensus 128 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 207 (445) +|+.+---++......+..+|.+ +..+. .+.|+........ .....+.........+.--|+|+.. T Consensus 170 e~d~l~~~~~~~~~~~i~~GIe~---d~~Gr-------~~aY~i~~~~~~~----~~~~~~~~~~~~~~v~a~~vlH~f~ 235 (530) T protein:vir:38 170 SPKRVSNPNNIGDTRNCRAGVKI---NDSGA-------ALGYYVSDDGYPG----WMAQNWTYIPRELPGGRPSFIHVFE 235 (530) T ss_pred chhhcCCCCCCCCCCeeEeeeEE---CCCCc-------eEEEEEeeccCCC----ccccccceeeeeeccChhHeEeecc Confidence 99886432322233445565543 11111 1222222111000 0000010000011122222555543 Q ss_pred -----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCc-------------ccchhH------------ Q lcl|NC_021326. 208 -----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD-------------QELPEF------------ 257 (445) Q Consensus 208 -----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~-------------~~~~~~------------ 257 (445) ...|.|.|.+++..+..++.............+.-..+++.... ...... T Consensus 236 ~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (530) T protein:vir:38 236 PMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYY 315 (530) T ss_pred ccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcc Confidence 33689999999988888887765555555554444444442111 000000 Q ss_pred ---HHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 258 ---KRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK 334 (445) Q Consensus 258 ---~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~ 334 (445) ...+....+..+..+.++++.+.+.+...+..+...+.+.|....++|--.+.+.-+..|-.+.+..+......... T Consensus 316 ~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~ 395 (530) T protein:vir:38 316 SAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMG 395 (530) T ss_pred cccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHH Confidence 00133444556778888999988888889999999999999998888843332211234555666666666666666 Q ss_pred HHHHHHHH-HHHHHHHHHH--Hh-cc---CC----Cc-----ceEEEEeCCC--CCCCHHHHHHHHHHH--hccCChHHH Q lcl|NC_021326. 335 LARKAKVA-IQELLWFVFE--HF-DI---KG----EH-----KDVDISFNYN--KVANTELQVQTAQQS--MGIVSHETV 394 (445) Q Consensus 335 ~~~~~~~~-l~~~~~~~~~--~~-~~---~~----~~-----~~i~v~f~~~--~p~d~~~~~~~~~~~--~g~~s~et~ 394 (445) .+..+... ++.+++..++ ++ |. +. ++ .-..+.|..+ .-.|....+++.... .|+.|.+.+ T Consensus 396 ~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~ 475 (530) T protein:vir:38 396 RRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKE 475 (530) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHH Confidence 66655443 3333332222 21 11 11 11 0134566432 235777777666554 699999999 Q ss_pred HHhCCCCCCHHHHHHHHHHHHHHHHHh-hhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 395 LENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 395 l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +...+. |+++.++++++|++...+. +....+.....+......+.+++| T Consensus 476 ~a~~G~--D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d 525 (530) T protein:vir:38 476 CAKRGD--DYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQD 525 (530) T ss_pred HHHcCC--CHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCC Confidence 988865 8999999999998776554 211111111111111111111111 No 97 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.56 E-value=1.2e-12 Score=85.86 Aligned_cols=424 Identities=10% Similarity=-0.021 Sum_probs=215.2 Q ss_pred ChHHHHHHHHHHHHHHHHH----HHHhcCCCcccccccc-ccccccccccccccc-cccchHHHHHHHHHhhhhccCeee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIG----QEYYEQRPDIVKEPKP-VDATGAVDPLKPDDR-MITNFHANLVDQKVSYIVGKPIAF 74 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~----~~yy~G~~~i~~~~~~-~~~~~~~~~~~~~~r-i~~n~~~~iv~~~~~~l~g~~~~~ 74 (445) -+..+... .....|... .+-..|.++...-++. ..........|+..- ..++|++-.|+..+++++|.+++. T Consensus 11 ~~~~~~~~--~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~ 88 (533) T protein:vir:34 11 GPDGMTSL--REYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRL 88 (533) T ss_pred cccccchH--HHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCcee Confidence 11111111 111122211 1111121111111100 000111111111111 135799999999999999999887 Q ss_pred ccC------------chHHHHH----HHHHhcc-----------CHHHHHHHHHHHHHhcCeEEEEEEECCCC----cEE Q lcl|NC_021326. 75 KHT------------DDEVIKR----IDEVLGN-----------RFDDKLHSVLTGASNKGIEWLHPYLDEEG----EFK 123 (445) Q Consensus 75 ~~~------------d~~~~~~----l~~~~~n-----------~~~~~~~~~~~~~~~~G~~~~~v~~d~~g----~~~ 123 (445) .+. +++.++. |+.|.++ +|......+++..++.|.+|+.....+.+ .++ T Consensus 89 ~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~g~~~~~~ 168 (533) T protein:vir:34 89 SHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQ 168 (533) T ss_pred eeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccCCCCccceE Confidence 652 2233333 4444322 34455556677889999999998766543 368 Q ss_pred EEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc-- Q lcl|NC_021326. 124 LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP-- 201 (445) Q Consensus 124 i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP-- 201 (445) +..++|+.+-.-++......+..+|.+ +..+. .+.|+........ .....++.. -....+| T Consensus 169 lq~ie~d~l~~~~~~~~~~~i~~GIe~---d~~Gr-------~~aY~i~~~~~~~----~~~~~~~~~---~~~~~v~a~ 231 (533) T protein:vir:34 169 FRMVSPKRISNPNNTGDSRNCRAGVQI---NDSGA-------ALGYYVSEDGYPG----WMPQKWTWI---PRELPGGRA 231 (533) T ss_pred EEEechhhcCCCCCCCCCCceEeeeEE---CCCCC-------eEEEEEeecCCCC----cccccccee---eeeeccChh Confidence 899999886433332233445555543 11111 1222222111000 000000000 0011223 Q ss_pred -eEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcc-------------cc-hhHH--- Q lcl|NC_021326. 202 -FIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQ-------------EL-PEFK--- 258 (445) Q Consensus 202 -vv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~-------------~~-~~~~--- 258 (445) |+|+.. ...|.|.|.+++..+..++.............+.-..+++..... .. +... T Consensus 232 ~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (533) T protein:vir:34 232 SFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWI 311 (533) T ss_pred HeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccc Confidence 444432 346899999999988888877665555555554444444422110 00 0000 Q ss_pred -----------HhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHH Q lcl|NC_021326. 259 -----------RLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTN 327 (445) Q Consensus 259 -----------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~ 327 (445) ..+....+..+..+.++++.+.+.+...+..+...+.+.|....++|--...+.-++.|-.+.+..+.. T Consensus 312 ~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e 391 (533) T protein:vir:34 312 GEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANE 391 (533) T ss_pred hhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHH Confidence 013444456678888999998888888999999999999998888874332222123455556666666 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHH--Hh-cc---CC----Ccc-----eEEEEeCCC--CCCCHHHHHHHHHHH--hc Q lcl|NC_021326. 328 LNLKADKLARKAKVAI-QELLWFVFE--HF-DI---KG----EHK-----DVDISFNYN--KVANTELQVQTAQQS--MG 387 (445) Q Consensus 328 l~~k~~~~~~~~~~~l-~~~~~~~~~--~~-~~---~~----~~~-----~i~v~f~~~--~p~d~~~~~~~~~~~--~g 387 (445) ........+..|...+ +.+++..++ ++ |. +. ++. -..+.|..+ .-.|....+++.... .| T Consensus 392 ~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G 471 (533) T protein:vir:34 392 SWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAG 471 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcC Confidence 6666666665554432 223332221 11 11 11 110 134667432 235777777766654 69 Q ss_pred cCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-hhccccCCCCCC-CCCCCCCCcCCC Q lcl|NC_021326. 388 IVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGA-QQKERSNDKQSE 445 (445) Q Consensus 388 ~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~-~~~~~~~d~~~~ 445 (445) +.|.+.++...+. |+++.++++++|.+...+. +....+...... ...+++++++.+ T Consensus 472 ~~s~~~~~a~~G~--D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~ 529 (533) T protein:vir:34 472 LSTYEKECAKRGD--DYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSD 529 (533) T ss_pred CCCHHHHHHHcCC--CHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCccc Confidence 9999999998865 8999999999998776554 211111111111 111112222222 No 98 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.56 E-value=1.8e-14 Score=95.97 Aligned_cols=430 Identities=9% Similarity=0.042 Sum_probs=196.0 Q ss_pred ChHHHHHHH-------HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQH-------LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~-------~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) .+.++...+ .+-.....+-.+||.|+|= .. . .....+...|..+|.++.+|+..+|+---+.+. T Consensus 7 ~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW--~~--~-----~~~~l~~q~rp~~N~i~~~v~~v~g~e~~nr~d 77 (725) T protein:vir:10 7 RLESILSRFDADWTASDEARREAKNDLFFSRVSQW--DD--W-----LSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPID 77 (725) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC--CH--H-----HHHHHHhcCCCcccchHHHHHHHHhhHHhCCcc Confidence 444444333 3334456667889999861 10 0 001111222345799999999999997665544 Q ss_pred e--cc---CchHHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC---CC---CcEEEEEE----ccceeE Q lcl|NC_021326. 74 F--KH---TDDEVIKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD---EE---GEFKLFRV----PAEQGI 133 (445) Q Consensus 74 ~--~~---~d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d---~~---g~~~i~~~----~p~~~~ 133 (445) + .+ ++.+..+.+ +.+.+ ++.....+.+..+++++|.||+-|..| ++ +.++|... +|.+++ T Consensus 78 ~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~v~ 157 (725) T protein:vir:10 78 VLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVI 157 (725) T ss_pred eEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccCHhHcc Confidence 3 33 333444443 33333 578888889999999999999887543 22 34444433 344444 Q ss_pred EEEcCCCC-Cc---eE-EEEEEEeeec-------------------------------c---eeEEEEecceEEEEE--E Q lcl|NC_021326. 134 PIWTDKEH-EE---LE-AFIRMYKLEN-------------------------------E---TKVEYWDKITVNYYV--Y 172 (445) Q Consensus 134 ~v~d~~~~-~~---~~-~~v~~~~~~~-------------------------------~---~~~~~~~~~~~~~~~--~ 172 (445) ||+... .. .. ++++.|.... . ..+++|....+.... . T Consensus 158 --~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~ 235 (725) T protein:vir:10 158 --WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIY 235 (725) T ss_pred --cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEeeEEEEe Confidence 443210 00 11 1111111100 0 012333322221111 1 Q ss_pred ec---ceeeecccc--------------------------------cccccccccccccccccceEEecC-------CCC Q lcl|NC_021326. 173 EN---GSLIPDYSN--------------------------------NLENSKTHFSTGSWGKIPFIPFKN-------NDL 210 (445) Q Consensus 173 ~~---~~~~~~~~~--------------------------------~~~~~~~~~~~~~~g~iPvv~~~n-------~~~ 210 (445) .+ +........ .+........+.+.+.+|+|+|.- .+. T Consensus 236 ~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g~~~ 315 (725) T protein:vir:10 236 QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEV 315 (725) T ss_pred ccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCCcce Confidence 10 110000000 000000111233335577776532 122 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeec---c-CCC-----ceeeEecc Q lcl|NC_021326. 211 EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKV---S-DNG-----GVDTIQVE 281 (445) Q Consensus 211 g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~---~-~~~-----~~~~l~~~ 281 (445) +.|.+.++++.++.+|...|.+...+...+.-.+.+.....+..............+.. . .+| .+.+...+ T Consensus 316 ~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~ 395 (725) T protein:vir:10 316 YEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENP 395 (725) T ss_pred eeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcccccccCcccCCC Confidence 34889999999999999999999887665544333322111111111111111111111 1 111 12222222 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhccC Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFV----FEHFDIK 357 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~----~~~~~~~ 357 (445) .-..++...++.....|...|++.+.+.+..+++.||+|+.................+..+.+++.+++ ..+++.. T Consensus 396 ~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~e 475 (725) T protein:vir:10 396 EVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVP 475 (725) T ss_pred CchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 223466668888889999999988777777778899999988876666665556666666665554444 4443211 Q ss_pred ------C-C----c------------------------ceEEEEeCCCCCCCHHHHHHHHHHHhccCCh------HHHHH Q lcl|NC_021326. 358 ------G-E----H------------------------KDVDISFNYNKVANTELQVQTAQQSMGIVSH------ETVLE 396 (445) Q Consensus 358 ------~-~----~------------------------~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~------et~l~ 396 (445) + + . .++.|.-.|..+.-..+.+..++.+.+.++. ..++. T Consensus 476 r~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~ 555 (725) T protein:vir:10 476 RNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPEYQLLLLQ 555 (725) T ss_pred cEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchhHHHHHHH Confidence 0 0 0 1122222333332233444444454433331 22333 Q ss_pred hCCC--CCCHHHHHHHHHHHHHHHH----------------------Hhhhcc-------ccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 397 NHPF--VEDLQAELERIEQEQMEYN----------------------KQLPNL-------DDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 397 ~l~~--~~d~~~E~~ri~~E~~~~~----------------------~~~~~~-------~~~~~~~~~~~~~~~d~~~~ 445 (445) .++. .+..++..++|.++..... +..... .....+..+ ...+... T Consensus 556 ~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~k----a~aE~~k 631 (725) T protein:vir:10 556 YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK----AQNQTLS 631 (725) T ss_pred HhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH----HHHHHHH Confidence 3322 2223333444443221100 000000 000000000 0000000 No 99 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.54 E-value=2e-12 Score=84.70 Aligned_cols=425 Identities=8% Similarity=-0.025 Sum_probs=213.5 Q ss_pred ChHHHHHHHHHHHHHHHHH---HHHhcCCCcccccccc-ccccccccccccccc-cccchHHHHHHHHHhhhhccCeeec Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIG---QEYYEQRPDIVKEPKP-VDATGAVDPLKPDDR-MITNFHANLVDQKVSYIVGKPIAFK 75 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~---~~yy~G~~~i~~~~~~-~~~~~~~~~~~~~~r-i~~n~~~~iv~~~~~~l~g~~~~~~ 75 (445) -..+.... ...|.-. .+...|-.+...-.+. ..........|+..- -.++|++-+|+..++.++|.+++.. T Consensus 17 ~~~~~~~~----~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~Gi~~~ 92 (553) T protein:vir:63 17 PEQSASLG----GGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQRDSIVGAQYRLN 92 (553) T ss_pred chhhhhhh----cccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCceee Confidence 11111111 1122211 0111111110000000 000011111111110 1247999999999999999998875 Q ss_pred cC-------------chHHHH----HHHHHhcc-----------CHHHHHHHHHHHHHhcCeEEEEEEECCCC----cEE Q lcl|NC_021326. 76 HT-------------DDEVIK----RIDEVLGN-----------RFDDKLHSVLTGASNKGIEWLHPYLDEEG----EFK 123 (445) Q Consensus 76 ~~-------------d~~~~~----~l~~~~~n-----------~~~~~~~~~~~~~~~~G~~~~~v~~d~~g----~~~ 123 (445) +. ++..++ .|+.|.++ +|......+++..++.|.+|+.....+.+ .++ T Consensus 93 ~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~~~~~~~~ 172 (553) T protein:vir:63 93 SMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEWDRAANRPYATC 172 (553) T ss_pred eccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeeeccCCCCcccce Confidence 42 122233 34444322 34445555677889999999887665432 357 Q ss_pred EEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecc--eeeecccccccccccccccccccccc Q lcl|NC_021326. 124 LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENG--SLIPDYSNNLENSKTHFSTGSWGKIP 201 (445) Q Consensus 124 i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~g~iP 201 (445) +..++|+.+-.-+.......+..+|.+ +..+. .+.|+..... ...........+. .. -.+.++| T Consensus 173 lq~ie~drl~~~~~~~~~~~i~~GVE~---d~~Gr-------~vaY~i~~~hPgd~~~~~~~~~~~~-r~---~~~~~v~ 238 (553) T protein:vir:63 173 FQMVSTDRLSNPYQQLDTPTLRRGVQY---DKRGR-------PQGYWIQVAHPGDLYQMAPDMYKWK-FV---QQSKPWG 238 (553) T ss_pred EEEechhhcCCCCCCCCCCeeEeeeEE---CCCCc-------eEEEEeeccCCCcccccccccccee-ee---ccccccC Confidence 899999887543433334455666543 11111 1122222111 1000000000000 00 0111222 Q ss_pred ---eEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcc-cch----------------- Q lcl|NC_021326. 202 ---FIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQ-ELP----------------- 255 (445) Q Consensus 202 ---vv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~-~~~----------------- 255 (445) |+|+. ....|.|.|.+++..+..++.............+.-..+++..... ... T Consensus 239 a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (553) T protein:vir:63 239 RRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFG 318 (553) T ss_pred hhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCChhhhhhhccccccccccccccc Confidence 33332 2346899999999998888877666555555555444444422110 000 Q ss_pred ------------hHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHH Q lcl|NC_021326. 256 ------------EFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEF 323 (445) Q Consensus 256 ------------~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~ 323 (445) .....+....+..+..+.++++.+.+.+...+..+...+.+.|....++|--...+.-+..|-.+.+. T Consensus 319 ~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~ 398 (553) T protein:vir:63 319 KYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQA 398 (553) T ss_pred ccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHH Confidence 00112334445567788899999888888899999999999999888887432222112344455666 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHH--Hh-c---cCCCc------------ceEEEEeCCCC--CCCHHHHHHHH Q lcl|NC_021326. 324 LYTNLNLKADKLARKAKVA-IQELLWFVFE--HF-D---IKGEH------------KDVDISFNYNK--VANTELQVQTA 382 (445) Q Consensus 324 ~~~~l~~k~~~~~~~~~~~-l~~~~~~~~~--~~-~---~~~~~------------~~i~v~f~~~~--p~d~~~~~~~~ 382 (445) .+..........+..|... ++-+++..++ ++ | .+... .-..+.|..+- -.|....+++. T Consensus 399 ~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~ 478 (553) T protein:vir:63 399 GIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAA 478 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHH Confidence 6666666665555555443 2333332222 22 1 11110 11346674433 25777777766 Q ss_pred HHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccc------------cCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 383 QQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLD------------DGGADGAQQKERSNDKQSE 445 (445) Q Consensus 383 ~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~------------~~~~~~~~~~~~~~d~~~~ 445 (445) ... .|+.|.+.++...+. |+++.++++++|.+...+.--... ........+....+.+++| T Consensus 479 ~~~i~~G~~t~~~~~a~~G~--D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 479 VMRIDAGLSTYEREIARLGG--DFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHHHHcCCCCHHHHHHHhCC--CHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 654 699999999998865 899999999999876554311000 0011112222223333333 No 100 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.53 E-value=4.1e-14 Score=93.98 Aligned_cols=430 Identities=9% Similarity=0.027 Sum_probs=190.6 Q ss_pred ChHHHHHHH-------HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQH-------LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~-------~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) .+.++...+ .+-.....+-.+||.|+|= .. . .....+...|..+|.++.+|+..+|+---+.+. T Consensus 7 ~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw--~~--~-----~~~~l~~q~rp~~N~i~~~i~~v~g~e~~nr~d 77 (725) T protein:vir:92 7 RLESILSRFDADWTASDEARREAKNDLFFSRISQW--DD--W-----LSQYTTLQYRGQFDVVRPVVRKLVSEMRQNPID 77 (725) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCC--CH--H-----HHHHHHhcCCCcccchHHHHHHHHhhHHhCCcc Confidence 444444332 3334456667889999861 10 0 001111222346799999999999987655444 Q ss_pred --ecc---CchHHHHHHH----HHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC---C---CCcEEEEEE---ccc-eeE Q lcl|NC_021326. 74 --FKH---TDDEVIKRID----EVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD---E---EGEFKLFRV---PAE-QGI 133 (445) Q Consensus 74 --~~~---~d~~~~~~l~----~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d---~---~g~~~i~~~---~p~-~~~ 133 (445) +.+ ++....+.+. .+.+ ++.....+.+..+++++|.||+-|+.| + ++.++|... +|. +++ T Consensus 78 ~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~V~ 157 (725) T protein:vir:92 78 VLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVI 157 (725) T ss_pred eEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCChhhcc Confidence 333 3334444433 3333 578888899999999999999887643 2 244554432 333 232 Q ss_pred EEEcCCCCC-ce----EEEEEEEee----------------------------------ecceeEEEEecceEEEEEE-- Q lcl|NC_021326. 134 PIWTDKEHE-EL----EAFIRMYKL----------------------------------ENETKVEYWDKITVNYYVY-- 172 (445) Q Consensus 134 ~v~d~~~~~-~~----~~~v~~~~~----------------------------------~~~~~~~~~~~~~~~~~~~-- 172 (445) ||+.... .+ .++++.|.. +...-+++|+...+..... T Consensus 158 --~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~~ 235 (725) T protein:vir:92 158 --WDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIY 235 (725) T ss_pred --cCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEeeeEEee Confidence 3332110 00 011111110 0000123333322221111 Q ss_pred ec---ceeeeccc--------------------------------ccccccccccccccccccceEEecC-------CCC Q lcl|NC_021326. 173 EN---GSLIPDYS--------------------------------NNLENSKTHFSTGSWGKIPFIPFKN-------NDL 210 (445) Q Consensus 173 ~~---~~~~~~~~--------------------------------~~~~~~~~~~~~~~~g~iPvv~~~n-------~~~ 210 (445) .+ +....... ..+........+.+.+.+|+|+|.- .+. T Consensus 236 ~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~ 315 (725) T protein:vir:92 236 QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEV 315 (725) T ss_pred cCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccCCccc Confidence 00 11100000 0000000111233345577776532 122 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeec---c-CCC-----ceeeEecc Q lcl|NC_021326. 211 EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKV---S-DNG-----GVDTIQVE 281 (445) Q Consensus 211 g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~---~-~~~-----~~~~l~~~ 281 (445) +.|.+.++++.++.+|...|.+...+-..+.-.+++.-...+..............+.. . .+| .+++...+ T Consensus 316 ~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~~ 395 (725) T protein:vir:92 316 YEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYENP 395 (725) T ss_pred ccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCccceeeccccccccccccccCCcccCCC Confidence 34888999999999999999998877655543333221111111111111111111111 1 111 12222222 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHhccC Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW----FVFEHFDIK 357 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~----~~~~~~~~~ 357 (445) .-..++..+++.....|...+++.+...+..+++.||+|+..+-.............+..+.+.+.+ +|..+++.. T Consensus 396 ~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~ 475 (725) T protein:vir:92 396 EVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYDVP 475 (725) T ss_pred CchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 2335666788888899999999877777777788999999887666555555555555555555544 444443211 Q ss_pred ------C-Cc----------------------------ceEEEEeCCCCCCCHHHHHHHHHHHhccCCh------HHHHH Q lcl|NC_021326. 358 ------G-EH----------------------------KDVDISFNYNKVANTELQVQTAQQSMGIVSH------ETVLE 396 (445) Q Consensus 358 ------~-~~----------------------------~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~------et~l~ 396 (445) + +. .++.|.-.|..+.-..+.+..++.+.+.++. .+++. T Consensus 476 r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~ 555 (725) T protein:vir:92 476 RNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQ 555 (725) T ss_pred cEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchhHHHHHHHH Confidence 0 00 1111222222222233334444444333321 11222 Q ss_pred hCC--CCCCHHHHHHHHHHHHHHHHH----------------------hhhcc-------ccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 397 NHP--FVEDLQAELERIEQEQMEYNK----------------------QLPNL-------DDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 397 ~l~--~~~d~~~E~~ri~~E~~~~~~----------------------~~~~~-------~~~~~~~~~~~~~~~d~~~~ 445 (445) .++ ..+..++.++++.++...... ..... ....++..+ .+-+... T Consensus 556 ~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~k----aqaE~~k 631 (725) T protein:vir:92 556 YFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK----AQNQTLS 631 (725) T ss_pred HhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHH----HHHHHHH Confidence 221 111122333444322111000 00000 000000000 0000000 No 101 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.52 E-value=1.3e-13 Score=91.29 Aligned_cols=414 Identities=12% Similarity=0.053 Sum_probs=220.0 Q ss_pred ChHHHHHHH----HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc----C- Q lcl|NC_021326. 1 MIVRYIKQH----LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK----P- 71 (445) Q Consensus 1 ~l~~~i~~~----~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~----~- 71 (445) .|.+....+ +....++.++++||.+-.. + .+. ..+.+. .+++.+|.+.-+++..+.+++.- . T Consensus 20 ~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~---~--~~~--~~~~~~--r~~~~~~k~~~~~~~i~~~l~~~~Fp~~~ 90 (584) T protein:vir:95 20 WVAYLWDRFNNQRRQKIEEWKELRNYVFATDT---T--TTS--NQGLPW--KNSTTLPKLCQIRDNLHSNYFSSLFPNDD 90 (584) T ss_pred HHHHHHHHHHhhhchhhccCHHHHHHHHhhhh---h--hhh--hccccc--ccccchhHHHHHHHHHHHHHHHhhcCccc Confidence 122222222 2333455778899887431 1 111 111122 34788899888888888776432 1 Q ss_pred -eee---ccCch--HHHHHHHHHhcc-----CHHHHHHHHHHHHHhcCeEEEEEEECCC-------------CcEEEEEE Q lcl|NC_021326. 72 -IAF---KHTDD--EVIKRIDEVLGN-----RFDDKLHSVLTGASNKGIEWLHPYLDEE-------------GEFKLFRV 127 (445) Q Consensus 72 -~~~---~~~d~--~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~~~~v~~d~~-------------g~~~i~~~ 127 (445) +.+ ..++. ...+.++.+..| ++.....++.+++.++|.|+..+.+... .++++.-+ T Consensus 91 w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~~~v~~~~~prieri 170 (584) T protein:vir:95 91 WLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDGTLVPDYIGPRLVRI 170 (584) T ss_pred eeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeeccccccccccceEEee Confidence 222 22222 335667766543 6777888999999999999999877432 26899999 Q ss_pred ccceeEEEEcCCC--CCceEEEEEEEeeecc------------------------------------ee----------- Q lcl|NC_021326. 128 PAEQGIPIWTDKE--HEELEAFIRMYKLENE------------------------------------TK----------- 158 (445) Q Consensus 128 ~p~~~~~v~d~~~--~~~~~~~v~~~~~~~~------------------------------------~~----------- 158 (445) +|..+| ||++. ....-+.+|.+.+... .+ T Consensus 171 SP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 248 (584) T protein:vir:95 171 SPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVEDFDKAAGFDVDGFGN 248 (584) T ss_pred Chhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCcccccccccccccccccc Confidence 999987 55443 2222222222211000 00 Q ss_pred -EEEEecceEEEEEEecceee---------eccccccccccc--ccccccccccceEEecC-----CCCcCccHHHHHHH Q lcl|NC_021326. 159 -VEYWDKITVNYYVYENGSLI---------PDYSNNLENSKT--HFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTL 221 (445) Q Consensus 159 -~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~--~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~l 221 (445) .+++...++..+.+.+.... .......+.... ...+.+++++|++..+. .-+|.|+...+.++ T Consensus 249 ~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~s~yG~gi~~ll~d~ 328 (584) T protein:vir:95 249 LYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPDNLWAMGPLDNLVGM 328 (584) T ss_pred cccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeeccccCCCchhhhhhH Confidence 01111111221111000000 000000011111 23446679999976643 34799999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccC-ChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 222 IDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEV-PVENSKKYLDELYQKIML 300 (445) Q Consensus 222 id~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~i~~l~~~i~~ 300 (445) ++.+|.+.-.+.+.+..+..|.+...+...+. ..+.+..+.....+++.++.++. +..+..+.+..+...+.+ T Consensus 329 Q~~lna~~r~~iDnl~l~~~pv~k~~~~~~~~------~~~pg~~~~~~~~~~~q~~~p~a~~~~s~~~~lq~~e~~me~ 402 (584) T protein:vir:95 329 QYRIDHLENAKADAVDLIIQPPLKIIGEVEEF------VWGPGAEIHLDQGGDVQEIAKNVNYIINADNQIQMLEDRMEL 402 (584) T ss_pred HHHHhHHHHHHHHHHHHhcCcceeeccccchh------cccCCceeecCCCCCcceecCchhhhhHHHHHHHHHHHHHHh Confidence 99999999999999999999976655443221 13456677888888889988774 445555668888888899 Q ss_pred HhCccccccccc-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCcc----------------- Q lcl|NC_021326. 301 FGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKVAI-QELLWFVFEHFDIKGEHK----------------- 361 (445) Q Consensus 301 ~s~~p~~~~~~~-~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l-~~~~~~~~~~~~~~~~~~----------------- 361 (445) .|++|..+.+.. .++.++..+...+..+..-...+...|...+ ++++.++.++....-+.. T Consensus 403 ~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd~~~~vr~~n~e~~~~~f~~ 482 (584) T protein:vir:95 403 YAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMDGSDVIRVMDTDLGVKEFMS 482 (584) T ss_pred hhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCceeeeccccccccccc Confidence 999998766532 3456666777778888877888888888766 888888877643221111 Q ss_pred ----eEEEEe--CCCCCCCHHHHHHHHHHHh------------ccCChHHH---H---HhCCC----CCC----HHHHHH Q lcl|NC_021326. 362 ----DVDISF--NYNKVANTELQVQTAQQSM------------GIVSHETV---L---ENHPF----VED----LQAELE 409 (445) Q Consensus 362 ----~i~v~f--~~~~p~d~~~~~~~~~~~~------------g~~s~et~---l---~~l~~----~~d----~~~E~~ 409 (445) ++.-.| ..--..-..+.++..+.+. +.++.... + ..+|. ..+ .+.|.+ T Consensus 483 i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~p~~~~~~~~~~~~~Q~~~q 562 (584) T protein:vir:95 483 VTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGLQGYEIFRPNVAVAEQAETQ 562 (584) T ss_pred cChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCCCcccccCCCcccchhHHHH Confidence 111111 1000111122222222211 11222221 1 12231 111 122233 Q ss_pred HHHHHHHHHHHhhhccccCCCC Q lcl|NC_021326. 410 RIEQEQMEYNKQLPNLDDGGAD 431 (445) Q Consensus 410 ri~~E~~~~~~~~~~~~~~~~~ 431 (445) ....+.++.-.......-.++- T Consensus 563 ~~~~~~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 563 SLVAQAQEDLQLQAQMPAEGAI 584 (584) T ss_pred hhhHHHHHHHHHHHhhhhccCC Confidence 2222222221222222211221 No 102 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.48 E-value=6.6e-12 Score=81.88 Aligned_cols=414 Identities=10% Similarity=-0.006 Sum_probs=226.1 Q ss_pred ChHHHHHHHH----HHHHHHHHHHHHhcCCCcccccccc---cc--c------cccccccccccc-cccchHHHHHHHHH Q lcl|NC_021326. 1 MIVRYIKQHL----EKLPEISIGQEYYEQRPDIVKEPKP---VD--A------TGAVDPLKPDDR-MITNFHANLVDQKV 64 (445) Q Consensus 1 ~l~~~i~~~~----~~~~~~~~~~~yy~G~~~i~~~~~~---~~--~------~~~~~~~~~~~r-i~~n~~~~iv~~~~ 64 (445) +|.++|.=+. -+........+-|+|-.. .+... .. . .......|+..- ...+|++..|+..+ T Consensus 3 ~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~--~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 3 ILDDVIGVFSPGWKAARLRSRAVIQAYEAVKT--TRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred hHhhHHhhcChHHHHHHHhhHHHHhhccccCc--ccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 8888776543 122222223344666431 11000 00 0 000001111110 13579999999999 Q ss_pred hhhhcc-Ceeecc----Cc----hHHHHHHHHHh----cc-------CHHHHHHHHHHHHHhcCeEEEEEEECCCCc--- Q lcl|NC_021326. 65 SYIVGK-PIAFKH----TD----DEVIKRIDEVL----GN-------RFDDKLHSVLTGASNKGIEWLHPYLDEEGE--- 121 (445) Q Consensus 65 ~~l~g~-~~~~~~----~d----~~~~~~l~~~~----~n-------~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~--- 121 (445) +.++|. ++.+.+ .+ ++.++.++..| ++ +|......+.+..+..|.+|+.+..++++. T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 999996 555432 22 33444444333 22 455566667788899999999987765432 Q ss_pred -----EEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeeccccccccccccccccc Q lcl|NC_021326. 122 -----FKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGS 196 (445) Q Consensus 122 -----~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (445) +++..++|+.+- ...+ ....+..+|.+ +..+. .+.|+........ ..... T Consensus 161 g~~~~l~lq~iepd~l~-~~~~-~~~~i~~GVe~---d~~Gr-------~~aY~i~~~hPgd-------------~~~~~ 215 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIP-MTSD-ESNRLNQGVFV---DDWGR-------PEKYLVYKSRPVS-------------GRQME 215 (502) T ss_pred CcccceEEEEecchhcC-CCCC-CCCeeEeeeEE---CCCCc-------eEEEEEeecCCCC-------------Ccccc Confidence 589999998863 2222 23445555543 11111 1222222111100 00112 Q ss_pred ccccc---eEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcc---------cchhHHH Q lcl|NC_021326. 197 WGKIP---FIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQ---------ELPEFKR 259 (445) Q Consensus 197 ~g~iP---vv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~---------~~~~~~~ 259 (445) +.+|| |+|+.. ...|.|.|.+++..+..++.............+....+++....+ ....... T Consensus 216 ~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~ 295 (502) T protein:vir:79 216 TKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENEREL 295 (502) T ss_pred eeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCccccc Confidence 23555 555543 346899999999999888877766655555555544555432211 0111111 Q ss_pred hhhhCceee-ccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc-cccCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 260 LLRYYGAIK-VSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLYTNLNLKADKLAR 337 (445) Q Consensus 260 ~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~-~~~~~~Sg~Ai~~~~~~l~~k~~~~~~ 337 (445) .+..+.++. ++.|.++++.+.+.+...+..++..+.+.|....++|--.+. .++ . |-.+++..+..........+. T Consensus 296 ~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s-~-nySs~R~~~~e~~r~~~~~q~ 373 (502) T protein:vir:79 296 TIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN-G-TYSAQRQELVESTDGYLILQD 373 (502) T ss_pred cccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-c-hHHHHHHHHHHHHHHHHHHHH Confidence 233333454 678889999988888889999999999999999998842222 232 2 555677777777777766666 Q ss_pred HHHHH-HHHHHHHHHH--Hh-cc---CC---CcceEEEEeCCCC--CCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC Q lcl|NC_021326. 338 KAKVA-IQELLWFVFE--HF-DI---KG---EHKDVDISFNYNK--VANTELQVQTAQQS--MGIVSHETVLENHPFVED 403 (445) Q Consensus 338 ~~~~~-l~~~~~~~~~--~~-~~---~~---~~~~i~v~f~~~~--p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d 403 (445) .|... ++.+++..++ ++ |. +. ...-..+.|..+- -.|....+++...+ +|+.|.+..+...+. | T Consensus 374 ~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~--D 451 (502) T protein:vir:79 374 WFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGR--N 451 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--C Confidence 55543 3333332222 11 21 11 1122356774332 25777777766554 699999999998865 8 Q ss_pred HHHHHHHHHHHHHHHHHhhhccc---------cCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 404 LQAELERIEQEQMEYNKQLPNLD---------DGGADGAQQKERSNDKQSE 445 (445) Q Consensus 404 ~~~E~~ri~~E~~~~~~~~~~~~---------~~~~~~~~~~~~~~d~~~~ 445 (445) +++.++++++|++...+.--... .......+++++.+++.+| T Consensus 452 ~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 452 PDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 99999999999876544321111 0111111223333333333 No 103 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.45 E-value=5e-13 Score=88.01 Aligned_cols=395 Identities=11% Similarity=0.028 Sum_probs=199.2 Q ss_pred ChHHHH-------HHH-HHHHHHH---------HHH-HHHhcCCCccccccccccccccccccccccccccchHHHHHHH Q lcl|NC_021326. 1 MIVRYI-------KQH-LEKLPEI---------SIG-QEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQ 62 (445) Q Consensus 1 ~l~~~i-------~~~-~~~~~~~---------~~~-~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~ 62 (445) ++...- ... ......+ ..+ ..||.... .......-+ -.+.+++.+|+. T Consensus 57 ~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~l~a~Y-~~~~l~r~iVd~ 122 (537) T protein:vir:10 57 MMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAF-------------IGHQMCALI-ATHWLVNKACSQ 122 (537) T ss_pred ccccccccccchhccccccchhhhhhhccccccchhhhhccccCC-------------ccHHHHHHH-HhCchhhhhhhh Confidence 111100 000 0000000 001 11222111 000000001 135788999999 Q ss_pred HHhhhhccCeeeccCch-----HHHHHHHHHh-ccCHHHHHHHHHHHHHhcCeEEEEEEECC-CCc-------------- Q lcl|NC_021326. 63 KVSYIVGKPIAFKHTDD-----EVIKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDE-EGE-------------- 121 (445) Q Consensus 63 ~~~~l~g~~~~~~~~d~-----~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g~-------------- 121 (445) .+.-++.+++.+++++. +..+.|+..+ +-+....+.++.+.+..||.+++++..+. ++. T Consensus 123 ~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg 202 (537) T protein:vir:10 123 MPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPG 202 (537) T ss_pred hhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCccccccccccccccc Confidence 99999999999988543 2334444444 33677888889999999999988876532 221 Q ss_pred -E-EEEEEccceeEEEEcC---CCCCceEEE-EEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccc Q lcl|NC_021326. 122 -F-KLFRVPAEQGIPIWTD---KEHEELEAF-IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTG 195 (445) Q Consensus 122 -~-~i~~~~p~~~~~v~d~---~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (445) + .+.+++|..+.|.... .+...+.++ -.+|.+... .+.+.++..+ T Consensus 203 ~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~g~----~iH~SRli~f------------------------- 253 (537) T protein:vir:10 203 AYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLINGK----KYHRSHLAIY------------------------- 253 (537) T ss_pred ceeEEEEechhhcccccchhhhccCCccccCCceeeeecCe----EecceeEEEe------------------------- Confidence 1 2556666665543211 011111110 011111000 0000111000 Q ss_pred ccc-ccceEEe-cCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchh-HHHhh-------hhCc Q lcl|NC_021326. 196 SWG-KIPFIPF-KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPE-FKRLL-------RYYG 265 (445) Q Consensus 196 ~~g-~iPvv~~-~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~-~~~~~-------~~~~ 265 (445) .| .+|-..- .++-+|.|.++.+.+-+..++.+.-..+..+.....+++.+.+...-..++ ....+ ...+ T Consensus 254 -~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n~g 332 (537) T protein:vir:10 254 -INDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTATRDNYQ 332 (537) T ss_pred -cCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHhhcCCcc Confidence 00 0111111 123469999999999999999998888888888888887766543211111 11111 2234 Q ss_pred eeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCcccccc-c-cc-cCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 266 AIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS-D-KF-GSAPSGVALEFLYTNLNLKADKLARKAKVA 342 (445) Q Consensus 266 ~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~-~-~~-~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 342 (445) ++.++.++ .+|...+.+...+...++...+.|...+++|-.-+ + .. |-+.||..-...+...+ +.++..+... T Consensus 333 ~~~id~e~-e~~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I---~~~Qe~l~p~ 408 (537) T protein:vir:10 333 VRVVDKDN-EDVVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEEC---ESTQDDMRPL 408 (537) T ss_pred eeEecCCC-ceeEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHH---HHHHHHHHHH Confidence 55555543 24555556777788899999999999999996522 2 22 23566776555544444 4444457999 Q ss_pred HHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHH-------HH--hccCChHHHHHhCCCCCCH-HHHH-HHH Q lcl|NC_021326. 343 IQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQ-------QS--MGIVSHETVLENHPFVEDL-QAEL-ERI 411 (445) Q Consensus 343 l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~-------~~--~g~~s~et~l~~l~~~~d~-~~E~-~ri 411 (445) +++++.+++...... ..+++++|++-...|..+.|++.. ++ .|++|.+++++.|..-.+. ...+ ..+ T Consensus 409 l~~l~~ll~~~~~~~--~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~ 486 (537) T protein:vir:10 409 IDRHHQLVCRSHLRK--RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAM 486 (537) T ss_pred HHHHHHHHHHhcCCC--CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCC Confidence 999999988655433 357899999999999999887533 22 3799998888776321100 0000 000 Q ss_pred HHHHHHHHH--h--hh-ccccCCCCC--CCCCCCCCCcCCC Q lcl|NC_021326. 412 EQEQMEYNK--Q--LP-NLDDGGADG--AQQKERSNDKQSE 445 (445) Q Consensus 412 ~~E~~~~~~--~--~~-~~~~~~~~~--~~~~~~~~d~~~~ 445 (445) ..|..+... . .+ ...+..+++ ..+...++++..+ T Consensus 487 ~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (537) T protein:vir:10 487 RPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSSGESAND 527 (537) T ss_pred ChhhhhcccCCccCCcCCCCCCCCCccccCCCCccccccCC Confidence 111111000 0 00 000000000 0011111111111 No 104 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.44 E-value=1.5e-11 Score=79.94 Aligned_cols=413 Identities=9% Similarity=-0.026 Sum_probs=210.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCc-----cccccccc----cccccccccccccc-cccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPD-----IVKEPKPV----DATGAVDPLKPDDR-MITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~-----i~~~~~~~----~~~~~~~~~~~~~r-i~~n~~~~iv~~~~~~l~g~ 70 (445) =++++..+... .. |+|-.. -...+... .........|+..= -.++|++-+|+..++.++|. T Consensus 16 a~~R~~ar~~~--~~-------y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~ 86 (548) T protein:vir:95 16 VARRLAAREAI--QA-------YEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERVVGG 86 (548) T ss_pred HHHHHHhHHHh--cc-------ccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhccCc Confidence 22222222211 11 333210 00000000 00000011111100 13579999999999999983 Q ss_pred -Ceeecc----Cch----HHHH----HHHHHhc-------cCHHHHHHHHHHHHHhcCeEEEEEEECCCC--------cE Q lcl|NC_021326. 71 -PIAFKH----TDD----EVIK----RIDEVLG-------NRFDDKLHSVLTGASNKGIEWLHPYLDEEG--------EF 122 (445) Q Consensus 71 -~~~~~~----~d~----~~~~----~l~~~~~-------n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g--------~~ 122 (445) ++.+.+ .++ +.++ .|+.|-. .+|......+++..++.|.+|+....++.+ -+ T Consensus 87 ~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~~~~ 166 (548) T protein:vir:95 87 SGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTFATSVPF 166 (548) T ss_pred cccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccCCcccce Confidence 444432 222 2333 3444432 246666666788889999999988765432 15 Q ss_pred EEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc- Q lcl|NC_021326. 123 KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP- 201 (445) Q Consensus 123 ~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP- 201 (445) ++..++|+.+-.-++ .....+..+|.+ +..+. .+.|+............ .....+-+|| T Consensus 167 ~lqliepd~l~~~~~-~~~~~i~~GIE~---D~~Gr-------p~aY~i~~~hPgd~~~~---------~~~~~~~rvpA 226 (548) T protein:vir:95 167 ALELLEPDYLPFSYN-NLSKGIVQGIER---DTWRR-------KRAYHLLKDHPGNLQTL---------GGSLAVKRVEA 226 (548) T ss_pred EEEEechhhcCCCCC-CCCCceeeeeEE---CCCCc-------eEEEEEeecCCCccccc---------ccccceeeech Confidence 899999988632222 223445555543 11111 12222222111000000 0001122333 Q ss_pred --eEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCccc--------chhHHHhhhhCce Q lcl|NC_021326. 202 --FIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQE--------LPEFKRLLRYYGA 266 (445) Q Consensus 202 --vv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~--------~~~~~~~~~~~~~ 266 (445) |+|+. ....|.|.|.+++..+..++...........-.+.-..+++....+. .......+..+.+ T Consensus 227 ~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~pG~i 306 (548) T protein:vir:95 227 ERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEPGKDRKNRTIPIAPGMV 306 (548) T ss_pred hHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCCCcccccccccccCCcc Confidence 33332 23468999999999888888777666665665555455555321110 0011112233333 Q ss_pred ee-ccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021326. 267 IK-VSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAI-Q 344 (445) Q Consensus 267 ~~-~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l-~ 344 (445) +. +..+.++++.+.+.+...+..+...+.+.|..-.++|.-...+..+ .|-.+.+..+..........+..|...+ + T Consensus 307 v~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~ 385 (548) T protein:vir:95 307 FDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD-GTYSAQRQELVEGWLGYDLLQHEFIDYWCR 385 (548) T ss_pred ccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43 6778889999888888899999999999999988888422222112 2556677766666666666655554333 2 Q ss_pred HHHHHHHHH--h-cc---CC---CcceEEEEeCC-CCC-CCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHH Q lcl|NC_021326. 345 ELLWFVFEH--F-DI---KG---EHKDVDISFNY-NKV-ANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERI 411 (445) Q Consensus 345 ~~~~~~~~~--~-~~---~~---~~~~i~v~f~~-~~p-~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri 411 (445) .+.+..++. + |. +. ...-+.+.|.. ..+ .|....+++...+ +|+.|.+.++...+. |+++.++++ T Consensus 386 Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~--D~~ev~~q~ 463 (548) T protein:vir:95 386 PVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGR--DPRELKKSR 463 (548) T ss_pred HHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCC--CHHHHHHHH Confidence 233332222 1 11 11 11235677843 333 6777777776654 599999999998865 899999999 Q ss_pred HHHHHHHHHhhhcccc--------CCCCCCCCCCC---------CCCcCCC Q lcl|NC_021326. 412 EQEQMEYNKQLPNLDD--------GGADGAQQKER---------SNDKQSE 445 (445) Q Consensus 412 ~~E~~~~~~~~~~~~~--------~~~~~~~~~~~---------~~d~~~~ 445 (445) .+|.+...+.--.+.. ++.++..++++ ..|+..| T Consensus 464 a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (548) T protein:vir:95 464 ETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARE 514 (548) T ss_pred HHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHH Confidence 9998765443211111 11111111110 1111111 No 105 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.44 E-value=2.9e-13 Score=89.33 Aligned_cols=385 Identities=11% Similarity=0.060 Sum_probs=190.7 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc-ccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 2 IVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKP-VDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 2 l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~-~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) ..=+..+.+.+...-+-+...+.+.-.. .++.. ............-+ -.+.+++.+|+..+.-++.+++.++++++. T Consensus 1 ~~~~m~~~~~~~~~~D~~~~~~~~~~g~-~~~~~~~~~~~~~~~l~~~Y-~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~ 78 (435) T protein:vir:79 1 MGVFMSDKVKAITKEDGYNEIFGSKDGT-FRPNAFYMQRAAFKALSQFY-EEDGMARRIVDVIPEEMVTPGFKVDGVKNE 78 (435) T ss_pred CCcccccccccchhhcchhhhhcccccc-cccCcccCCcCCHHHHHHHH-hcCchhhhhhccchHHhhcCCceecCCChH Confidence 1111111122222222232333332110 00000 00000000000000 135788999999999999999999876532 Q ss_pred HHHHHHHHhcc-CHHHHHHHHHHHHHhcCeEEEEEEECC----------CCcE-EEEEEccceeEEEEcCCCCCceEEE- Q lcl|NC_021326. 81 VIKRIDEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDE----------EGEF-KLFRVPAEQGIPIWTDKEHEELEAF- 147 (445) Q Consensus 81 ~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~----------~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~- 147 (445) +.++..++. +....+.++.+.+..||.+++++-... +|.+ .+.+++|.++.|-.-+.+...+.++ T Consensus 79 --~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~ 156 (435) T protein:vir:79 79 --KSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSVRYGE 156 (435) T ss_pred --HHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCCcccccCc Confidence 344444433 677888999999999999988876522 1222 3555666555432111111111110 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecceeeeccccccccccccccccccc-------ccceE-EecCCCCcCccH-HHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWG-------KIPFI-PFKNNDLEISDI-FMY 218 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-------~iPvv-~~~n~~~g~s~~-~~v 218 (445) ..+|.+.... .......|.-- .+|-. ...++.+|.|.+ +.+ T Consensus 157 P~~y~v~~~~------------------------------~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~ 206 (435) T protein:vir:79 157 PKLYKISPGG------------------------------DIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRL 206 (435) T ss_pred ceEEEEecCC------------------------------CCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHH Confidence 0111111000 00000111111 11211 112455688876 678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC-----cccchhHH-H------hhhhCceeecc-CCCceeeEeccCChH Q lcl|NC_021326. 219 KTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD-----DQELPEFK-R------LLRYYGAIKVS-DNGGVDTIQVEVPVE 285 (445) Q Consensus 219 ~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~-----~~~~~~~~-~------~~~~~~~~~~~-~~~~~~~l~~~~~~~ 285 (445) .+-+..++.+.......+.....+.+.+.|.. ........ + .....+.+.+. ++.+ |-+.+.+.. T Consensus 207 ~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~--~e~~~~~ls 284 (435) T protein:vir:79 207 IEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEE--YEVLNSDVS 284 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcc--eEEEecccC Confidence 88889999988888888877777777665531 11111111 1 01123334443 3333 444456677 Q ss_pred HHHHHHHHHHHHHHHHhCccccc--cccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcce Q lcl|NC_021326. 286 NSKKYLDELYQKIMLFGQAVDFS--SDKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKD 362 (445) Q Consensus 286 ~~~~~i~~l~~~i~~~s~~p~~~--~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 362 (445) .+...++.....|...+++|-.- +...+| +.||..-...+...+... .+..+.+.+++++.+++.- .+ T Consensus 285 gl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~--Qe~~l~p~l~~l~~li~~s-------~d 355 (435) T protein:vir:79 285 GVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDRK--RVEDYKPILEFLLPFMISE-------TE 355 (435) T ss_pred CHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhcC-------CC Confidence 88999999999999999999632 223333 466766555555554433 2466788888888886531 46 Q ss_pred EEEEeCCCCCCCHHHHHHHHHHHh---------ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCC Q lcl|NC_021326. 363 VDISFNYNKVANTELQVQTAQQSM---------GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGA 433 (445) Q Consensus 363 i~v~f~~~~p~d~~~~~~~~~~~~---------g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 433 (445) +.++|+|-...++.+.|++..+.+ |+++.+++.+.| ...- ............-++.. T Consensus 356 ~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L-------------~~~~-~~~~~~~~~~~~~~~~~ 421 (435) T protein:vir:79 356 WSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTL-------------RSIC-PDLKIMDNDNIELPEPE 421 (435) T ss_pred CeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHH-------------HHhc-cccCCCCcccccCCccc Confidence 789999999999999888655432 555555544333 1000 00000000111111111 Q ss_pred CCCCCCCCcCCC Q lcl|NC_021326. 434 QQKERSNDKQSE 445 (445) Q Consensus 434 ~~~~~~~d~~~~ 445 (445) ..+.+.+.+.+| T Consensus 422 d~~~~~~~e~g~ 433 (435) T protein:vir:79 422 DLDPEPGQEGGL 433 (435) T ss_pred cCCCCCCCCCCC Confidence 111222222222 No 106 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.43 E-value=1.8e-11 Score=79.51 Aligned_cols=417 Identities=11% Similarity=0.003 Sum_probs=208.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCC------cccccc-cc-ccccccccccccccc-cccchHHHHHHHHHhhhhccC Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRP------DIVKEP-KP-VDATGAVDPLKPDDR-MITNFHANLVDQKVSYIVGKP 71 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~------~i~~~~-~~-~~~~~~~~~~~~~~r-i~~n~~~~iv~~~~~~l~g~~ 71 (445) ++. .-.....+.. . .-|+|-. ...... +. ..........|+..- ..++|++-.|+..++.++|.+ T Consensus 8 ~~a-~~~~~~~~~~--~---~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~G 81 (495) T protein:vir:10 8 YQS-LASGLLVPVG--A---SAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGNG 81 (495) T ss_pred ccc-cchhhhhHHH--h---hhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCCC Confidence 110 0000000000 0 0122311 111000 00 000001111111110 135799999999999999998 Q ss_pred eeecc--CchHHHHHHHHHh----cc-------CHHHHHHHHHHHHHhcCeEEEEEEECC--CC---cEEEEEEccceeE Q lcl|NC_021326. 72 IAFKH--TDDEVIKRIDEVL----GN-------RFDDKLHSVLTGASNKGIEWLHPYLDE--EG---EFKLFRVPAEQGI 133 (445) Q Consensus 72 ~~~~~--~d~~~~~~l~~~~----~n-------~~~~~~~~~~~~~~~~G~~~~~v~~d~--~g---~~~i~~~~p~~~~ 133 (445) ++..+ ++++.++.++..| .+ +|......+++..++.|.+|+.+...+ +| .+++..++|+.+- T Consensus 82 i~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd~l~ 161 (495) T protein:vir:10 82 LTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPDMLA 161 (495) T ss_pred cccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechhhcC Confidence 87764 5666665555444 22 455566667888899999998776543 33 3689999999973 Q ss_pred -EEEc--CCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEe-- Q lcl|NC_021326. 134 -PIWT--DKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPF-- 205 (445) Q Consensus 134 -~v~d--~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~-- 205 (445) |.-. ......+..+|.+ +..+. .+.|+........... ......+.+|| |+|+ T Consensus 162 ~~~~~~~~~~g~~i~~GIe~---d~~Gr-------~vaY~i~~~hpgd~~~---------~~~~~~~~rvpA~~vlH~f~ 222 (495) T protein:vir:10 162 SDIPDETLPSGGYVKGGIRF---SNGGK-------RKAYCFYRNHPAESSL---------IGDPVDTVWIKAEHVLHVTV 222 (495) T ss_pred CCCCCCCCCCCCEEEeceEE---CCCCc-------eEEEEEeecCCCcccc---------cccccceeeechhheEeccc Confidence 3211 1222345666654 11111 1122221111100000 00001122333 2333 Q ss_pred --cCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCccc-------------chhHHHhhhhCceeecc Q lcl|NC_021326. 206 --KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQE-------------LPEFKRLLRYYGAIKVS 270 (445) Q Consensus 206 --~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~-------------~~~~~~~~~~~~~~~~~ 270 (445) +....|.|.+.+++.| ..++.............+.-..+++....+. .......+....+..+. T Consensus 223 ~r~gQ~RGis~la~i~~l-~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~ 301 (495) T protein:vir:10 223 LTVRSDAGAPWFQLLLRL-NELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQ 301 (495) T ss_pred cCCCcccCcchhHHHHHH-HHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccccccCcccceecCCceeeecC Confidence 2334689988877764 4555444443444444444344444221110 00011223444556678 Q ss_pred CCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHHHH Q lcl|NC_021326. 271 DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLAR-KAKV-AIQELLW 348 (445) Q Consensus 271 ~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~-~~~~-~l~~~~~ 348 (445) .+.++++.+.+.+...+..+...+.+.|..-.++|--...+.-++.|-.+++..+..........+. .+.. .++.+++ T Consensus 302 pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~ 381 (495) T protein:vir:10 302 PGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGR 381 (495) T ss_pred CCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8889999988888889999999999999988888743222222234455566666666666655443 3333 3333433 Q ss_pred HHHHH---hcc-C-CCc-----ceEEEEeCCCC--CCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHH Q lcl|NC_021326. 349 FVFEH---FDI-K-GEH-----KDVDISFNYNK--VANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQE 414 (445) Q Consensus 349 ~~~~~---~~~-~-~~~-----~~i~v~f~~~~--p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E 414 (445) ..++. .|. . .++ .-..+.|..+- -.|....+++.... +|+.|.+.++...+. |+++.++++++| T Consensus 382 ~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~--D~~~v~~q~a~e 459 (495) T protein:vir:10 382 WFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGY--DMEELFDMISDA 459 (495) T ss_pred HHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCC--CHHHHHHHHHHH Confidence 33221 121 1 111 11356674332 35777777776654 699999999998865 899999999998 Q ss_pred HHHHHHhhhccc-c----CCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 415 QMEYNKQLPNLD-D----GGADGAQQKERSNDKQSE 445 (445) Q Consensus 415 ~~~~~~~~~~~~-~----~~~~~~~~~~~~~d~~~~ 445 (445) ++...+.--.+. + .+.....++.+.+++++| T Consensus 460 ~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 460 NQLIDEYDLRLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHHHcCCCCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 876544311111 0 111111222233333333 No 107 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.43 E-value=2.7e-12 Score=84.02 Aligned_cols=437 Identities=11% Similarity=0.028 Sum_probs=191.1 Q ss_pred ChHHHHHHHHH-------HHHHHHHHHHHh--cCCCcccccccc-ccccccccccccccccccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHLE-------KLPEISIGQEYY--EQRPDIVKEPKP-VDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~~-------~~~~~~~~~~yy--~G~~~i~~~~~~-~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~ 70 (445) ++.++...+.. -+.+...-.+|| .|+| ...... .-........+| .+++|.++.+|+..+|+-..+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~Q--W~~~~~~~l~~~~q~~grP--~~~~N~i~~~v~~v~g~~~~n 83 (708) T protein:vir:10 8 KHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQ--WEGATAAGTKLDEQFEKYP--KFEINKVATELNRIIAEYRNN 83 (708) T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC--CCHHHHHHHHHhhhhcCCC--ceEEcchHHHHHHHHHHHHhC Confidence 44444443322 112222223455 4665 110000 000000001122 377899999999999998777 Q ss_pred Ceee--ccC----chHHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---------CcEEE-EEEcc Q lcl|NC_021326. 71 PIAF--KHT----DDEVIKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---------GEFKL-FRVPA 129 (445) Q Consensus 71 ~~~~--~~~----d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---------g~~~i-~~~~p 129 (445) .+.+ .+. +.+..+.+ +.+.+ ++.......+..+++++|.||+.+..|.. .++++ .+.+| T Consensus 84 r~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~~~~~p 163 (708) T protein:vir:10 84 RITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDP 163 (708) T ss_pred CcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccceEEeecc Confidence 6554 322 23334443 34443 67888899999999999999998865421 12333 23344 Q ss_pred c-eeEEEEcCCCC-Cc----eEEEEEEEeee-------------------------------cceeEEEEecceEEEE-- Q lcl|NC_021326. 130 E-QGIPIWTDKEH-EE----LEAFIRMYKLE-------------------------------NETKVEYWDKITVNYY-- 170 (445) Q Consensus 130 ~-~~~~v~d~~~~-~~----~~~~v~~~~~~-------------------------------~~~~~~~~~~~~~~~~-- 170 (445) . .++ ||+... .. ..++++.|... ...-.++|....+... T Consensus 164 ~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~~~~~~~~ 241 (708) T protein:vir:10 164 SRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVI 241 (708) T ss_pred hhhcc--cCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEEEEEEEEE Confidence 3 332 442210 00 11111111100 0001222222111110 Q ss_pred EEe---cceeeecccc--------------------------------cccccccccccccccccceEEecCC------- Q lcl|NC_021326. 171 VYE---NGSLIPDYSN--------------------------------NLENSKTHFSTGSWGKIPFIPFKNN------- 208 (445) Q Consensus 171 ~~~---~~~~~~~~~~--------------------------------~~~~~~~~~~~~~~g~iPvv~~~n~------- 208 (445) ... ++........ .+........+-+++.+|+|+|.-. T Consensus 242 ~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d~~ 321 (708) T protein:vir:10 242 SYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDI 321 (708) T ss_pred EEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccCCC Confidence 000 0100000000 0000111234456677888876421 Q ss_pred CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccc-hhHHHhhhhCcee-e---c-cCCCc-------e Q lcl|NC_021326. 209 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-PEFKRLLRYYGAI-K---V-SDNGG-------V 275 (445) Q Consensus 209 ~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~-~~~~~~~~~~~~~-~---~-~~~~~-------~ 275 (445) +...|.+.++++.++.+|...|.+...+-.......++........ ............. . . ...|. . T Consensus 322 ~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~~~~ 401 (708) T protein:vir:10 322 ERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPA 401 (708) T ss_pred cccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccccccccccccCCc Confidence 2235778899999999999999998887766655444322111110 0000000000000 0 0 11111 1 Q ss_pred eeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_021326. 276 DTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF---- 351 (445) Q Consensus 276 ~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~---- 351 (445) ..+..+.-..++..+++.....|...|++.+...+. .+|.||+||...-..-..........+..+.+++.++++ T Consensus 402 ~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~-~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~ 480 (708) T protein:vir:10 402 GYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAR 480 (708) T ss_pred cccCCccchHHHHHHHHHHHHHHHHHhCcChhHccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122222334556777788888888999887766664 456899999887666666666666666666665555444 Q ss_pred HHhccC------C-Cc----------------------c-------eEEEEeCCCCCCCHHHHHHHHHHHhccCChH--- Q lcl|NC_021326. 352 EHFDIK------G-EH----------------------K-------DVDISFNYNKVANTELQVQTAQQSMGIVSHE--- 392 (445) Q Consensus 352 ~~~~~~------~-~~----------------------~-------~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~e--- 392 (445) .+++.. + +. . +|.|.=.|..+.-..+.++.++++.+.++.. T Consensus 481 ~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~ 560 (708) T protein:vir:10 481 EVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPM 560 (708) T ss_pred HHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHHHhcCCCchh Confidence 433211 0 00 0 1222223444444455566666654433321 Q ss_pred H------HHHhCCCCCCHHHHHHHHHHHHHHH-------------HHhhhccccCCCCCCC-----CCCCCCCcCCC Q lcl|NC_021326. 393 T------VLENHPFVEDLQAELERIEQEQMEY-------------NKQLPNLDDGGADGAQ-----QKERSNDKQSE 445 (445) Q Consensus 393 t------~l~~l~~~~d~~~E~~ri~~E~~~~-------------~~~~~~~~~~~~~~~~-----~~~~~~d~~~~ 445 (445) + +++.+ ..+..++-+++|++..... ....+.......+... +....+.+... T Consensus 561 ~~~~~~~~l~~~-D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~qAe~~k 636 (708) T protein:vir:10 561 RPAIQGIILDNI-DGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQK 636 (708) T ss_pred hHHHHHHHHHhc-CCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 22332 2233334444554322100 0000000000000000 00000000000 No 108 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.43 E-value=3.6e-12 Score=83.31 Aligned_cols=396 Identities=9% Similarity=0.012 Sum_probs=194.7 Q ss_pred ChHHHHHHH-HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQH-LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~-~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) ++.-.+ -. ..+-.+. ...||.... .. .+ ....-.+ .+.+++.+|+..+.=++-++++++++++ T Consensus 57 ~~a~~~-g~~~~~~~~~--~~~~~~~~~-~~---~~--------~l~a~Y~-~~~l~r~~Vd~~aed~~r~~~~i~~~~~ 120 (532) T protein:vir:94 57 AMAMDY-GLQTGRNGRN--ALSFVEATS-WP---GF--------PTLALLA-QLPEYRTMHETPADECVRAWGKITCSSK 120 (532) T ss_pred cccccc-ccCccccccc--ccccccccc-cc---hH--------HHHHHHH-cCchhhhhhccchHHHhhCCceEeeCCc Confidence 111000 00 0000000 001221110 00 00 0000000 2467799999999999999999987432 Q ss_pred -----HHHHHHHHHhcc-CHHHHHHHHHHHHHhcCeEEEEEEECCCCc-------------------E-EEEEEccceeE Q lcl|NC_021326. 80 -----EVIKRIDEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDEEGE-------------------F-KLFRVPAEQGI 133 (445) Q Consensus 80 -----~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~-------------------~-~i~~~~p~~~~ 133 (445) +....++..+.. +....+.++.+.+..||.+++++-.+.+|. + .+.+++|.++. T Consensus 121 ~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~ 200 (532) T protein:vir:94 121 DELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLS 200 (532) T ss_pred cccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheec Confidence 233344443332 567788889999999999988876643321 1 24555565554 Q ss_pred EE-EcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccc-cccceEEe-cCCCC Q lcl|NC_021326. 134 PI-WTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSW-GKIPFIPF-KNNDL 210 (445) Q Consensus 134 ~v-~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~iPvv~~-~n~~~ 210 (445) |- |+..+...+.+ ...++|.. ..+. ........|.- ..+|-... .++-+ T Consensus 201 p~~~~~~dp~sp~f----------g~P~~y~v--------~~g~----------~iH~SRli~f~g~~~p~~~~~~~~~~ 252 (532) T protein:vir:94 201 PNAYNATDPTLPSF----------YKPDSWIA--------TSGK----------KIHSSRIHTVVGRPVGDMLKAAYSFR 252 (532) T ss_pred cccccccccccccc----------CCceeEEE--------ccCe----------eeccceEEEecCCCchhhhccccccc Confidence 43 11111111111 01111100 0000 00000000100 01221111 12346 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCc----ccchhHHHhh-------hhCceeeccCCCceeeEe Q lcl|NC_021326. 211 EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD----QELPEFKRLL-------RYYGAIKVSDNGGVDTIQ 279 (445) Q Consensus 211 g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~----~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~ 279 (445) |.|.+..+.+-+..++.+.-..+..+..+....+.. +... +........+ ...+++.++.+. -+|-+ T Consensus 253 G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~-~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~~-e~~e~ 330 (532) T protein:vir:94 253 GVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLAT-DMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKGT-EEIQQ 330 (532) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeee-chHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCCC-ceeEE Confidence 899999999999999998888877777777666654 3211 1111111111 123345555332 23444 Q ss_pred ccCChHHHHHHHHHHHHHHHHHhCccccc--cccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_021326. 280 VEVPVENSKKYLDELYQKIMLFGQAVDFS--SDKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI 356 (445) Q Consensus 280 ~~~~~~~~~~~i~~l~~~i~~~s~~p~~~--~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~ 356 (445) .+.+.+.+...++...+.|...+++|-.- +...+| +.||..-...+...+. ...+..+...+++++++++..... T Consensus 331 ~~~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~--s~Qe~~l~p~le~l~~~l~~s~~g 408 (532) T protein:vir:94 331 TNTPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIA--GYQATNLTPLMEWIIDLIQLSEYG 408 (532) T ss_pred EecccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhcC Confidence 55667778889999999999999999652 222222 4566654444444433 223356788999999888754322 Q ss_pred CCCcceEEEEeCCCCCCCHHHHHHHHHH-------H--hccCChHHHHHhCCCCC------C--HHHHHHHHHHHHHHHH Q lcl|NC_021326. 357 KGEHKDVDISFNYNKVANTELQVQTAQQ-------S--MGIVSHETVLENHPFVE------D--LQAELERIEQEQMEYN 419 (445) Q Consensus 357 ~~~~~~i~v~f~~~~p~d~~~~~~~~~~-------~--~g~~s~et~l~~l~~~~------d--~~~E~~ri~~E~~~~~ 419 (445) ..+ .++.++|++-...+..+.+++..+ + .|++|.+.+.+++..-. + ..++++....+..+.. T Consensus 409 ~~~-~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 487 (532) T protein:vir:94 409 QID-PGLAWEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLM 487 (532) T ss_pred CCC-CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhc Confidence 221 368899999888888888775432 2 37899998887763211 1 1122222222222222 Q ss_pred HhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 420 KQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 420 ~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ....+..+.+..+...+.++.+++.+ T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~d~~~ 513 (532) T protein:vir:94 488 AAALNPPATAPQTPNPQPDSEDDQTD 513 (532) T ss_pred ccccCCCCCCCCCCCCCCCCCCCCCC Confidence 22222222222222222222222222 No 109 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=99.42 E-value=1.3e-12 Score=85.73 Aligned_cols=423 Identities=11% Similarity=0.013 Sum_probs=222.8 Q ss_pred ChHHHHHHH-------HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccC-- Q lcl|NC_021326. 1 MIVRYIKQH-------LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKP-- 71 (445) Q Consensus 1 ~l~~~i~~~-------~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~-- 71 (445) ++.+++.++ ......+.++++|-.-- + .+.+... ..+.| |++.+|-..-+++.+..++++-- T Consensus 21 ~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~-~----tr~t~~~--~~~w~--~s~t~~k~~~~~~~l~a~~~~~~fp 91 (599) T protein:vir:31 21 FIDELVVLFTNMENARAQKDREDKELMDYIDAT-D----TRKTSNS--KLPFK--NSTTINKLAHLHLMITTSYMEHLLP 91 (599) T ss_pred HHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhh-c----ccccccC--CCCcc--cccchHHHHHHHHHHHHHHHhhhcC Confidence 333333332 23334555666663211 0 1111111 11222 46777877788888888776532 Q ss_pred ----eeecc---C--chHHHHHHHHHhcc-----CHHHHHHHHHHHHHhcCeEEEEEEEC------CCC-------cEEE Q lcl|NC_021326. 72 ----IAFKH---T--DDEVIKRIDEVLGN-----RFDDKLHSVLTGASNKGIEWLHPYLD------EEG-------EFKL 124 (445) Q Consensus 72 ----~~~~~---~--d~~~~~~l~~~~~n-----~~~~~~~~~~~~~~~~G~~~~~v~~d------~~g-------~~~i 124 (445) +.+.+ + ..+..+.++.+.++ ++......+..+.+.+|-|+..+.+. +|| .|++ T Consensus 92 ~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~d~~v~~~~~~P~~ 171 (599) T protein:vir:31 92 NRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNYSGTVT 171 (599) T ss_pred CccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeecccccccccccceE Confidence 22222 2 23455667777654 56677777888999999998877632 122 4789 Q ss_pred EEEccceeEEEEcCCCCCceEEEEEEEeeeccee---------------------------------------------- Q lcl|NC_021326. 125 FRVPAEQGIPIWTDKEHEELEAFIRMYKLENETK---------------------------------------------- 158 (445) Q Consensus 125 ~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~---------------------------------------------- 158 (445) ..++|..+++-.+-+......+.+|.+.++.+.. T Consensus 172 ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~g~D~~~~d 251 (599) T protein:vir:31 172 ERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKFDSLHKK 251 (599) T ss_pred EeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhhhccccccc Confidence 9999999875443333333344455544211100 Q ss_pred -----EEEEecceEEEEEEec--------cee---eecccccccccccccccccccccceEEec-----CCCCcCccHHH Q lcl|NC_021326. 159 -----VEYWDKITVNYYVYEN--------GSL---IPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFM 217 (445) Q Consensus 159 -----~~~~~~~~~~~~~~~~--------~~~---~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~ 217 (445) .++|.+..+..+.+-+ ... +.++.+..-.......|.++|..|++... ..-+|.|.+.. T Consensus 252 ~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~yG~G~l~~ 331 (599) T protein:vir:31 252 GYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLCPIGPLHR 331 (599) T ss_pred cccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccccCCCCCchh Confidence 0000000010000000 000 00000000000112234567778887543 34578999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 218 YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 218 v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) +.++++.+|.+...+.+....+..|+++..|......-+ .....++.+.+.+++.++.++++.......+..+... T Consensus 332 ~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~eD~~----~~P~~v~~~~d~~~vq~~~p~s~~~~a~~~is~~e~~ 407 (599) T protein:vir:31 332 LTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVREKGMR----GGPNHVFEVEETGDVQYMTPPAEVLQPDNQLSITLQL 407 (599) T ss_pred cchHHHHHHHHHHHhhhhhhhhhcccccccccccccCcc----CCCCcceeecCCCccccccCchhhhhHHHHHHHHHHH Confidence 999999999999999999999999988877763322111 2245678889999999999988888888889988888 Q ss_pred HHHHhCcccccccc-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCc--------------- Q lcl|NC_021326. 298 IMLFGQAVDFSSDK-FGSAPSGVALEFLYTNLNLKADKLARKAKVAI-QELLWFVFEHFDIKGEH--------------- 360 (445) Q Consensus 298 i~~~s~~p~~~~~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l-~~~~~~~~~~~~~~~~~--------------- 360 (445) +-+.|+.|..+.+. ..+..++..+..+.........++.+.|...+ +.+++.+.+......+. T Consensus 408 mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~ 487 (599) T protein:vir:31 408 MEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTAT 487 (599) T ss_pred HHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeccccccee Confidence 99999999876653 33566888888888888888888888887765 44666555543321111 Q ss_pred -ceEE-------EEeCCCCCCCHHHHHHHHHHHh---------ccCC---hHHH---HH---hC--CCC--CCH-----H Q lcl|NC_021326. 361 -KDVD-------ISFNYNKVANTELQVQTAQQSM---------GIVS---HETV---LE---NH--PFV--EDL-----Q 405 (445) Q Consensus 361 -~~i~-------v~f~~~~p~d~~~~~~~~~~~~---------g~~s---~et~---l~---~l--~~~--~d~-----~ 405 (445) .+|. +.+.+--..-..+..+.++++. ++.| ++.. +. .+ ..+ ..+ + T Consensus 488 f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq 567 (599) T protein:vir:31 488 FLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQ 567 (599) T ss_pred eEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHHHHHHHhccccccCCCchhHHHHH Confidence 1111 1111111122233444444432 2333 2221 11 11 011 111 1 Q ss_pred HHHHHHHHHHHHHH-HhhhccccCCCCCCCCCC Q lcl|NC_021326. 406 AELERIEQEQMEYN-KQLPNLDDGGADGAQQKE 437 (445) Q Consensus 406 ~E~~ri~~E~~~~~-~~~~~~~~~~~~~~~~~~ 437 (445) .+....+++.++.. .++..-.-++++. ++.+ T Consensus 568 ~~~~m~Q~~lq~~~~~~~~~~~~~~~~~-~~~~ 599 (599) T protein:vir:31 568 QLARMAQKSTQQTEETALTQEEVGGPTT-DTGQ 599 (599) T ss_pred HHHHHHHHHHHHhHhhhhhhhhcCCCCc-ccCC Confidence 22222222222111 1222212222221 1111 No 110 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.40 E-value=3.3e-12 Score=83.50 Aligned_cols=434 Identities=10% Similarity=0.058 Sum_probs=185.2 Q ss_pred ChHHHHHHHHHH-------HHHHHHHHHHhc--CCCcccccccccccc-ccccccccccccccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHLEK-------LPEISIGQEYYE--QRPDIVKEPKPVDAT-GAVDPLKPDDRMITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~~~-------~~~~~~~~~yy~--G~~~i~~~~~~~~~~-~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~ 70 (445) +|.++..++... ......-.+||. |+| .........+ ......+| .+.+|.++.+|+..+|+---+ T Consensus 8 ~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~Q--W~~~~~~~~~~~l~~~~~P--~~~~N~i~~~v~~v~g~~~~n 83 (720) T protein:vir:35 8 RHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQ--WEGATAAGSELGKHFEKYP--KFEINKISTELNRIISEYRHN 83 (720) T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCC--CCHHHHHHHHHHHhhCCCC--eEEEccHHHHHHHHHhHHHhC Confidence 444444443321 122333456665 554 1111100000 11111223 367899999999999998665 Q ss_pred Cee--eccC----chHHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC----C-----CcEEEEEE-cc Q lcl|NC_021326. 71 PIA--FKHT----DDEVIKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----E-----GEFKLFRV-PA 129 (445) Q Consensus 71 ~~~--~~~~----d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~----~-----g~~~i~~~-~p 129 (445) .+. +.+. +....+.+ +.+.+ ++.....+.+..+++++|.||+-|+.|- + +.+++..+ +| T Consensus 84 r~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i~~v~~~ 163 (720) T protein:vir:35 84 RITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICLEPIYDP 163 (720) T ss_pred CCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeEecccCc Confidence 544 4332 23334443 33333 6788888999999999999999987642 1 13344332 23 Q ss_pred -ceeEEEEcCCCCC-c---e-EEEEEEEeeec------------------------------ceeEEEEecceEE--EEE Q lcl|NC_021326. 130 -EQGIPIWTDKEHE-E---L-EAFIRMYKLEN------------------------------ETKVEYWDKITVN--YYV 171 (445) Q Consensus 130 -~~~~~v~d~~~~~-~---~-~~~v~~~~~~~------------------------------~~~~~~~~~~~~~--~~~ 171 (445) .+++ ||+.... . - .++++.|-..+ ..-+++|....+. .+. T Consensus 164 ~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~~~~~~~~~~ 241 (720) T protein:vir:35 164 ARSVW--FDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEVKKESVDVVS 241 (720) T ss_pred hhhee--ecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEeeEEEEEEEEEEE Confidence 3332 3332110 0 0 11111111000 0012233222211 111 Q ss_pred Ee---cceeeecccc--------------------------------cccccccccccccccccceEEecCC---CC--- Q lcl|NC_021326. 172 YE---NGSLIPDYSN--------------------------------NLENSKTHFSTGSWGKIPFIPFKNN---DL--- 210 (445) Q Consensus 172 ~~---~~~~~~~~~~--------------------------------~~~~~~~~~~~~~~g~iPvv~~~n~---~~--- 210 (445) .. ++........ .+........+-+++.+|+|+|.-. .. T Consensus 242 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~ 321 (720) T protein:vir:35 242 FQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWFIDDIE 321 (720) T ss_pred eecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeeccCCCc Confidence 00 0111000000 0000111123455677888876421 12 Q ss_pred -cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCc----ee-ec----cCCC------- Q lcl|NC_021326. 211 -EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYG----AI-KV----SDNG------- 273 (445) Q Consensus 211 -g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~----~~-~~----~~~~------- 273 (445) ..|.+.++++.++.+|...|.+...+-. .+...-.|.... ............ .. .+ ...| T Consensus 322 ~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~--~~~~~~~~a~~~-~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~~~~~~ 398 (720) T protein:vir:35 322 RVEGHIAKAMDAQRLYNLQVSMLADSATQ--DTGSIPIVGKSQ-IKTLEKYWANRNKNRPAFLPLNEIVDKQGNIIAPPT 398 (720) T ss_pred ccceeeecchhHHHHHHHHHHHHHHHHHc--CCccccccCcch-HHHHHHHhhccccccccccccccccccCcccccCCC Confidence 2577888999999999999999988753 334433342221 111221111110 00 00 1111 Q ss_pred ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHH Q lcl|NC_021326. 274 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQEL----LWF 349 (445) Q Consensus 274 ~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~----~~~ 349 (445) ...+...+.-......+++.-...|...|++.+..++..+ |+||+||...-..-..........+..+.+++ +.+ T Consensus 399 ~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~s-n~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~l 477 (720) T protein:vir:35 399 PVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPS-NIAKETVNHLMHRSDMSSFIYLDNMAKSLKRAGEVWLSM 477 (720) T ss_pred cccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2233333333345566677777888888888776666554 58999998765554444444555555555554 444 Q ss_pred HHHHhccC------C-Cc----------------------c-------eEEEEeCCCCCCCHHHHHHHHHHHhccCChHH Q lcl|NC_021326. 350 VFEHFDIK------G-EH----------------------K-------DVDISFNYNKVANTELQVQTAQQSMGIVSHET 393 (445) Q Consensus 350 ~~~~~~~~------~-~~----------------------~-------~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et 393 (445) |..+++.. + +. . +|.+.=.|..+.-..+.++.++++.+.++... T Consensus 478 I~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~qll~~~~p~~ 557 (720) T protein:vir:35 478 AREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTNLLAGMLPQD 557 (720) T ss_pred HHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHHHHHhcCCCc Confidence 44443211 0 00 0 11122233333334444555555544333221 Q ss_pred ---------HHHhCCCCCCHHHHHHHHHHHHHHH--------------------HH-hhhccccCCCCC-----CCCCCC Q lcl|NC_021326. 394 ---------VLENHPFVEDLQAELERIEQEQMEY--------------------NK-QLPNLDDGGADG-----AQQKER 438 (445) Q Consensus 394 ---------~l~~l~~~~d~~~E~~ri~~E~~~~--------------------~~-~~~~~~~~~~~~-----~~~~~~ 438 (445) .++.+++ +..++-.+++.+..... .+ .........+.. ..+..+ T Consensus 558 ~~~~~~~~~ile~~d~-p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~qaqae~~kaq 636 (720) T protein:vir:35 558 PMRQVLQGIILDNMEG-EGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLMQGQAEVQKAK 636 (720) T ss_pred hhHHHHHHHHHHhcCc-hhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHHHHHHH Confidence 2222211 11222233332211100 00 000000000000 000000 Q ss_pred CCCcCCC Q lcl|NC_021326. 439 SNDKQSE 445 (445) Q Consensus 439 ~~d~~~~ 445 (445) ....... T Consensus 637 a~~~~~q 643 (720) T protein:vir:35 637 NEELAIQ 643 (720) T ss_pred HHHHHHH Confidence 0000000 No 111 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.37 E-value=3.6e-12 Score=83.32 Aligned_cols=391 Identities=10% Similarity=0.032 Sum_probs=195.6 Q ss_pred ChHHHHHHH---HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccC Q lcl|NC_021326. 1 MIVRYIKQH---LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT 77 (445) Q Consensus 1 ~l~~~i~~~---~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~ 77 (445) -+.-+..-. ...... ..+..||....- ..+ ... .-+ -.+.+++.+|+..+.-++.+++.++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~f-~gy----------ql~-alY-~~~~l~rkiVd~pAeDa~R~g~~I~~~ 144 (765) T protein:vir:96 79 PTPAAKAAAGGQNPYVVP-TMLQDWYNSQGF-IGY----------QAC-AII-SQHWLVDKACSMSGEDAARNGWELKSD 144 (765) T ss_pred ccchHHHhhhccCccchh-hHHHhhhcccCC-ccH----------HHH-HHH-HhCchhhhhhhcchHHhhcCCceeecC Confidence 011111111 000011 112233322110 000 000 000 135788999999999999999999885 Q ss_pred chH----HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC-CC---------------cE-EEEEEccceeEEE Q lcl|NC_021326. 78 DDE----VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE-EG---------------EF-KLFRVPAEQGIPI 135 (445) Q Consensus 78 d~~----~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g---------------~~-~i~~~~p~~~~~v 135 (445) +++ ..+.++..++ =++...+.++.+.+-.||.+++++-.+. ++ .+ .+.+++|..+.+. T Consensus 145 ~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~ 224 (765) T protein:vir:96 145 GRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQ 224 (765) T ss_pred ccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhcccc Confidence 432 3344444433 2677888999999999999988776542 21 11 2444555444432 Q ss_pred Ec---CCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccc--cceEE-ecCCC Q lcl|NC_021326. 136 WT---DKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGK--IPFIP-FKNND 209 (445) Q Consensus 136 ~d---~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--iPvv~-~~n~~ 209 (445) -. ..+..... ++.+ ..|..... ........ .|.. +|-+. -.++- T Consensus 225 ~v~e~~~Dp~sp~---------------fg~P---~~y~i~g~-----------~IH~SRli-~~~g~~lpd~lk~~~~~ 274 (765) T protein:vir:96 225 LTAESTADPSAEH---------------FYEP---DFWIISGK-----------KYHRSHLV-VVRGPQPPDILKPTYIF 274 (765) T ss_pred cchhccccccccc---------------cCcc---eeeeecCc-----------eeccceEE-EecCCCchhhhccccCc Confidence 10 00000000 1111 00100000 00000000 0111 12111 11234 Q ss_pred CcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCccc-chhHHH-------hhhhCceeeccCCCceeeEecc Q lcl|NC_021326. 210 LEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQE-LPEFKR-------LLRYYGAIKVSDNGGVDTIQVE 281 (445) Q Consensus 210 ~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~-~~~~~~-------~~~~~~~~~~~~~~~~~~l~~~ 281 (445) +|.|.++.+.+-+..++.+.......+......++.+.+...-. ...... .....+++.++.+.+ |-+.+ T Consensus 275 ~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g~~~id~ee~--~e~~s 352 (765) T protein:vir:96 275 GGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRDNHGVKVIGIDET--MEQFD 352 (765) T ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcCCceeEEecCCcc--eeEEe Confidence 69999999999999999998888888888877777665442211 111111 112334566666554 44455 Q ss_pred CChHHHHHHHHHHHHHHHHHhCcccc--cccc-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDF--SSDK-FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG 358 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~--~~~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~ 358 (445) .+...+...++...+.|...+++|-. .+.. .|-+.||..-...+...+.. ..+..+...+++++.+++...+.. T Consensus 353 ~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s--~Qe~~l~p~le~L~~li~~s~~i~- 429 (765) T protein:vir:96 353 TNLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELES--IQEHIFDPLLERHYLLLAKSESID- 429 (765) T ss_pred cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHhcCCC- Confidence 67778899999999999999999963 2222 23356777544444444332 223678899999999988664433 Q ss_pred CcceEEEEeCCCCCCCHHHHHHHHHH-------H--hccCChHHHHHhCC------C--CCCHHHHHH-HHHHHHHHHHH Q lcl|NC_021326. 359 EHKDVDISFNYNKVANTELQVQTAQQ-------S--MGIVSHETVLENHP------F--VEDLQAELE-RIEQEQMEYNK 420 (445) Q Consensus 359 ~~~~i~v~f~~~~p~d~~~~~~~~~~-------~--~g~~s~et~l~~l~------~--~~d~~~E~~-ri~~E~~~~~~ 420 (445) .+++++|++-...+..+.|++..+ + .|++|.+.+++.|. + ++|-+.|.+ -+..|. .+ T Consensus 430 --~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~---~~ 504 (765) T protein:vir:96 430 --VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPEN---LA 504 (765) T ss_pred --CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccc---cc Confidence 368999999999999998875433 2 48899988888762 1 122111100 010000 00 Q ss_pred hhhccccCC----CCCCCCC---C---CCCCcCCC Q lcl|NC_021326. 421 QLPNLDDGG----ADGAQQK---E---RSNDKQSE 445 (445) Q Consensus 421 ~~~~~~~~~----~~~~~~~---~---~~~d~~~~ 445 (445) ......... .....++ . ..++.++. T Consensus 505 ~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~ 539 (765) T protein:vir:96 505 ELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPA 539 (765) T ss_pred cccCCCcccccccCccccccCCCCccCCCCccccc Confidence 000000000 0000000 0 00000000 No 112 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.35 E-value=1.6e-11 Score=79.77 Aligned_cols=381 Identities=9% Similarity=0.031 Sum_probs=188.5 Q ss_pred HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchHHHHHHHHHhccC Q lcl|NC_021326. 13 LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVIKRIDEVLGNR 92 (445) Q Consensus 13 ~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~n~ 92 (445) ..+.+-+...+-|-++--..... ...........-. -.+.+++.+|+..+.-++.++++++++++.. .....|.+=+ T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~-~~~~~~~~l~a~Y-~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~~-~~~~~~~~l~ 77 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGS-LQNQAPTILASLY-ADNALVRRIIDTIPETALAAGFHIDGIDDEP-AFWSRWDDLE 77 (422) T ss_pred CccchhhHHHHcCCCCCccccCc-ccccCHHHHHHHH-HhChhhHHHHhhhhHHHhcCCccccCCCHHH-HHHHHHHHhh Confidence 22222223334342211000000 0000000000001 1357889999999999999999999876532 2222332336 Q ss_pred HHHHHHHHHHHHHhcCeEEEEEEECC----------CCcE-EEEEEccceeEEEEcCCCCCceEEE-EEEEeeecceeEE Q lcl|NC_021326. 93 FDDKLHSVLTGASNKGIEWLHPYLDE----------EGEF-KLFRVPAEQGIPIWTDKEHEELEAF-IRMYKLENETKVE 160 (445) Q Consensus 93 ~~~~~~~~~~~~~~~G~~~~~v~~d~----------~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~-v~~~~~~~~~~~~ 160 (445) ....+.++.+.+..||.+++++-... +|.+ .+.+++|.++.|..-+.+...+.++ -.+|.+..... T Consensus 78 ~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~~~-- 155 (422) T protein:vir:10 78 MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYRITTNES-- 155 (422) T ss_pred HHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEEEecCCC-- Confidence 77888899999999999998887632 2222 3556666655443111111111111 01111111000 Q ss_pred EEecceEEEEEEecceeeeccccccccccccccccccc-ccce-EEecCCCCcCccHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWG-KIPF-IPFKNNDLEISDIFM-YKTLIDAYNRRLSDLSNTFK 237 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~iPv-v~~~n~~~g~s~~~~-v~~lid~~~~~~s~~~~~~~ 237 (445) .......+....+.-| .+|- ....++.+|.|.+.. +.+-+..++.+.......+. T Consensus 156 ----------------------~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~ 213 (422) T protein:vir:10 156 ----------------------DMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLK 213 (422) T ss_pred ----------------------CcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000000000000 1221 123345578998876 67888888888888888787 Q ss_pred HhcCCeeEEecCC-----cccchhHH-H---h---hhhCceeec-cCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021326. 238 DSNELTYVLTNYD-----DQELPEFK-R---L---LRYYGAIKV-SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQA 304 (445) Q Consensus 238 ~~~~~~l~~~g~~-----~~~~~~~~-~---~---~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~ 304 (445) ......+.+.|.. ........ + . ....+.+.+ +++.+ |-+.+.+.+.+...++.....|...+++ T Consensus 214 ~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~--~e~~~~~lsgl~~~~~~~~~~iaaa~~I 291 (422) T protein:vir:10 214 RKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEE--YSVLNSDIGGIDAFLDKKFDRIVALSGI 291 (422) T ss_pred HhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcc--eEEEecccCChHHHHHHHHHHHHhhhCC Confidence 7777777666521 11111100 0 0 012223333 33343 4445566778889999999999999999 Q ss_pred ccccc--ccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHH Q lcl|NC_021326. 305 VDFSS--DKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQT 381 (445) Q Consensus 305 p~~~~--~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~ 381 (445) |-.-+ ...+| |.||..-...+...+.. ..+..+.+.+++++.+++. ..++.++|+|-...++.+.|++ T Consensus 292 P~t~L~G~s~~Glnatgd~d~~~yyd~i~~--~Qe~~l~p~l~~l~~~i~~-------s~~~~~~f~pL~~~sekekaei 362 (422) T protein:vir:10 292 HEIILKNKNVGGVSSSQNTALETFHKLVDR--KRNAELLPILEFLIPFIVN-------AEEWSVEFNPLAQESSKDKAEI 362 (422) T ss_pred CeeeeccCCcccccccchHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhcc-------cCCcEEEeCCCCCCCHHHHHHH Confidence 96422 22222 35566555444444332 2245678899998888763 1467899999999999998887 Q ss_pred HHHHh---------ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCC-CCCCCCCCCcCCC Q lcl|NC_021326. 382 AQQSM---------GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADG-AQQKERSNDKQSE 445 (445) Q Consensus 382 ~~~~~---------g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~~ 445 (445) ..+.+ |+++.+.+.+.|.. . .......+...+...+. ..++...+++++| T Consensus 363 ~~~~a~a~~~~~~~g~i~~~e~r~~L~~-------------~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 363 LEKNVNSIAALIAAGAMDIDEARDTLRT-------------I-APEVKINDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHhhh-------------h-cccccCCCCCCccccchhhcCCCCCCCCCCC Confidence 55432 55666555544411 0 00000000000000000 0001111222222 No 113 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.34 E-value=1.5e-11 Score=79.88 Aligned_cols=400 Identities=8% Similarity=-0.026 Sum_probs=194.6 Q ss_pred ChHHHHHHHHHHHHHHHH-------HHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISI-------GQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~-------~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) .+..++.. .+..+.. ...|+..+-. .................+ -.+.+++.+|+..+.-++.+++. T Consensus 94 ~~~~~~~D---gl~n~~~~lG~~~~~s~y~~~~~~---~~~~~~~~f~gyql~alY-~~~~larkiVd~pAeDatR~g~~ 166 (862) T protein:vir:99 94 AITGFAMD---DGGGAPVPIGAEGKQSSYAVPEAL---QDWYLSQGFIGHQACALI-AQHWLVDKACSLAGEDAIRNGWH 166 (862) T ss_pred hhhhhhhh---cchhhhhhccccccccccccchhc---cccccccCcccHHHHHHH-HhCchhhhhhhhhhHHHhhCCce Confidence 11111111 0000000 0011111000 000000000000000001 13578899999999999999999 Q ss_pred eccCc------hHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC-CC---------------cE-EEEEEcc Q lcl|NC_021326. 74 FKHTD------DEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE-EG---------------EF-KLFRVPA 129 (445) Q Consensus 74 ~~~~d------~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g---------------~~-~i~~~~p 129 (445) +.+.+ ++..+.++..+. -++...+.++.+.+-.||.+++++-.+. ++ .+ .+.+++| T Consensus 167 I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp 246 (862) T protein:vir:99 167 LKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDP 246 (862) T ss_pred EeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEech Confidence 98732 233445555443 3677788888888888998877665432 22 11 2555666 Q ss_pred ceeEEEE---cCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEE-e Q lcl|NC_021326. 130 EQGIPIW---TDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIP-F 205 (445) Q Consensus 130 ~~~~~v~---d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~ 205 (445) ..+.|.- ...+...+.+ ...++|...... -..+++....+ ..+|-.. - T Consensus 247 ~w~~p~~v~~~~~Dp~sp~y----------GkP~~y~I~g~~----IH~SRliif~g--------------~~vpd~lk~ 298 (862) T protein:vir:99 247 YWMMPMLTAESTADPSSQFF----------YEPEFWIISGQK----YHRSHLIIARG--------------PQPADILKP 298 (862) T ss_pred hhhccccccccccccccccc----------CCceeeeecCee----eccceeEEecC--------------CCchhhhhc Confidence 5554421 0001111111 001111100000 00000000000 0111110 0 Q ss_pred cCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccc-hhHHHh-------hhhCceeeccCCCceee Q lcl|NC_021326. 206 KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-PEFKRL-------LRYYGAIKVSDNGGVDT 277 (445) Q Consensus 206 ~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~-~~~~~~-------~~~~~~~~~~~~~~~~~ 277 (445) .++.+|.|.++.+.+.+..++.+.......+..+...++.+.+...-.. ...... ....+++.++.+.+ | T Consensus 299 ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN~Gi~liD~eEe--~ 376 (862) T protein:vir:99 299 TYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIANEDKFIQRLMFWVRYRDNHAVKVLGTDET--M 376 (862) T ss_pred cCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhccHHHHHHHHHHHHhccCcceeEEecCCCc--e Confidence 2334799999999999999999988888888888877776655432111 111111 11234566665544 4 Q ss_pred EeccCChHHHHHHHHHHHHHHHHHhCccccc-cc-c-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021326. 278 IQVEVPVENSKKYLDELYQKIMLFGQAVDFS-SD-K-FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF 354 (445) Q Consensus 278 l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~-~~-~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~ 354 (445) -+.+.+.+.+...++...+.|...+++|-.- ++ . .|-+.||..=...+...+... .+..+.+.|++++.++..-+ T Consensus 377 e~ls~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~--QE~~L~P~LerL~~li~~~l 454 (862) T protein:vir:99 377 EQFDTSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESI--QEHVYMPFLQRHYLISRLSL 454 (862) T ss_pred eEEecccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHhc Confidence 4455677788889999999999999999652 22 2 234567775444454444433 24568889998888776555 Q ss_pred ccCCCcceEEEEeCCCCCCCHHHHHHHHH-------HH--hccCChHHHHHhC--------CCCCCHHHHH-HHHHHHHH Q lcl|NC_021326. 355 DIKGEHKDVDISFNYNKVANTELQVQTAQ-------QS--MGIVSHETVLENH--------PFVEDLQAEL-ERIEQEQM 416 (445) Q Consensus 355 ~~~~~~~~i~v~f~~~~p~d~~~~~~~~~-------~~--~g~~s~et~l~~l--------~~~~d~~~E~-~ri~~E~~ 416 (445) +.. .+++++|++-...+..+.|++.. ++ .|++|.+.++.+| +.+++.+.|- .-+..|.. T Consensus 455 g~~---~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~ 531 (862) T protein:vir:99 455 GIQ---HEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENL 531 (862) T ss_pred CCC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccc Confidence 533 46899999999999999887643 22 4889988887753 2222221110 00111111 Q ss_pred HHHHhhhccccCCCC-------CCCCCC-CCCCcCCC Q lcl|NC_021326. 417 EYNKQLPNLDDGGAD-------GAQQKE-RSNDKQSE 445 (445) Q Consensus 417 ~~~~~~~~~~~~~~~-------~~~~~~-~~~d~~~~ 445 (445) .. ..+..+.... ++.+.. .+.+.+-+ T Consensus 532 ~~---~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~ 565 (862) T protein:vir:99 532 AA---YQKAGAAQETASAKETQAGAAVTTAEGDQPNV 565 (862) T ss_pred cc---cccCCcccccccccccccccCCccccCCcccc Confidence 00 0000000000 000000 00000000 No 114 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.32 E-value=1.2e-10 Score=74.99 Aligned_cols=419 Identities=9% Similarity=0.036 Sum_probs=173.5 Q ss_pred ChHHHHHH----HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh----hhccC- Q lcl|NC_021326. 1 MIVRYIKQ----HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY----IVGKP- 71 (445) Q Consensus 1 ~l~~~i~~----~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~----l~g~~- 71 (445) .|++.+.. |...+.+...+.+||.|.-+.. +... ..|+ +++.+.....|+..... +++.+ T Consensus 31 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-------~grs--~vv~~~v~~~ve~~~~~l~~~f~~~~~ 99 (763) T protein:vir:95 31 ALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK--PPKV-------KGRS--QVQPKLVRRQAEWRYSALTEPFLGSNK 99 (763) T ss_pred HHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc--cccc-------CCCc--cccCHHHHHHHHHHHHHHHHhhcCCCc Confidence 33333332 3344455555666654432211 1111 1122 45566555555555443 34432 Q ss_pred -eeecc---CchHHH----HHHHHHh--ccCHHHHHHHHHHHHHhcCeEEEEEEECC----------------------- Q lcl|NC_021326. 72 -IAFKH---TDDEVI----KRIDEVL--GNRFDDKLHSVLTGASNKGIEWLHPYLDE----------------------- 118 (445) Q Consensus 72 -~~~~~---~d~~~~----~~l~~~~--~n~~~~~~~~~~~~~~~~G~~~~~v~~d~----------------------- 118 (445) |.+.+ +|.+.. ..++..+ .|+-.......+++++++|.+++.||++. T Consensus 100 ~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~ 179 (763) T protein:vir:95 100 LFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQA 179 (763) T ss_pred EEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhccccchhHH Confidence 45543 232222 2344433 35667778899999999999999987630 Q ss_pred -------------------------------------------------------CCcEEEEEEccceeEEEEcCC-CCC Q lcl|NC_021326. 119 -------------------------------------------------------EGEFKLFRVPAEQGIPIWTDK-EHE 142 (445) Q Consensus 119 -------------------------------------------------------~g~~~i~~~~p~~~~~v~d~~-~~~ 142 (445) .++|+|..|+|.++++-++.. +.. T Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~sD~~ 259 (763) T protein:vir:95 180 DALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDIN 259 (763) T ss_pred HHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHheecCCCCCchh Confidence 124577789999987533211 111 Q ss_pred ceEE-EEEEEeeecce-------------------eE-------------EEEe--cceEEEEEEecceee--------- Q lcl|NC_021326. 143 ELEA-FIRMYKLENET-------------------KV-------------EYWD--KITVNYYVYENGSLI--------- 178 (445) Q Consensus 143 ~~~~-~v~~~~~~~~~-------------------~~-------------~~~~--~~~~~~~~~~~~~~~--------- 178 (445) ...+ +.+.+.+..+. .. ...+ ...+. ++++.... T Consensus 260 Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~--v~E~y~~~d~~gdg~~~ 337 (763) T protein:vir:95 260 KAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVV--AYEYWGFWDIEGNGVLE 337 (763) T ss_pred hCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEE--EEEeeeeeccCCcceeE Confidence 1122 22222211100 00 0000 00111 11111000 Q ss_pred -eccccccccccccc--ccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE-ecC Q lcl|NC_021326. 179 -PDYSNNLENSKTHF--STGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-TNY 249 (445) Q Consensus 179 -~~~~~~~~~~~~~~--~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~-~g~ 249 (445) ..... .+....+. .+.+.+++||+.++ ...+|.|.+..++++++.+|..++.+.+.+...+.|.+.+ .|. T Consensus 338 ~~~v~~-~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~ga 416 (763) T protein:vir:95 338 PIVATW-IGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKGM 416 (763) T ss_pred EEEEEE-EcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeeccc Confidence 00000 01111112 23344677877553 3457899999999999999999999999999988876544 333 Q ss_pred CcccchhHHHhhhhCceeeccCCCce----eeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccC--cchHHHHHH Q lcl|NC_021326. 250 DDQELPEFKRLLRYYGAIKVSDNGGV----DTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS--APSGVALEF 323 (445) Q Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~--~~Sg~Ai~~ 323 (445) .. ..+.. ..+.++++.+..+... .++..+.........+..+...+-..|+++..+.+..++ +.++.++.. T Consensus 417 v~-~~d~~--~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~~~~~tat~v~~ 493 (763) T protein:vir:95 417 LD-ALNSR--RYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGESYGDVAAGIRG 493 (763) T ss_pred cc-chhhh--cccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcccccchhHHHHH Confidence 21 11111 1234445555443332 233333333455556666666677778777665432221 122233433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-----------Ccce---------EEEEeCCCCCCCH-HHHHHHH Q lcl|NC_021326. 324 LYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-----------EHKD---------VDISFNYNKVANT-ELQVQTA 382 (445) Q Consensus 324 ~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~-----------~~~~---------i~v~f~~~~p~d~-~~~~~~~ 382 (445) .............+.|..+++.+.+.+++++-... ++.. .+|+-... +.+. .+.+..+ T Consensus 494 l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~-~as~~~q~~~~l 572 (763) T protein:vir:95 494 VLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS-TAEVDNQKSQDL 572 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecc-cchHHHHHHHHH Confidence 33444444445555665666655555554432211 1111 11111111 1111 1112212 Q ss_pred HHH---h-ccCChHH---HH----HhCCC---C---------CCHH----H--HHHHHHHHHHHHHHhhhccccCCCCCC Q lcl|NC_021326. 383 QQS---M-GIVSHET---VL----ENHPF---V---------EDLQ----A--ELERIEQEQMEYNKQLPNLDDGGADGA 433 (445) Q Consensus 383 ~~~---~-g~~s~et---~l----~~l~~---~---------~d~~----~--E~~ri~~E~~~~~~~~~~~~~~~~~~~ 433 (445) ..+ . ..++... .+ +.... + .++. + ++++.+.+.+.....+. -. T Consensus 573 ~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~~akaq--------~~ 644 (763) T protein:vir:95 573 GFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEELRSKIR--------LN 644 (763) T ss_pred HHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHHHHHHH--------HH Confidence 211 1 1122111 01 00000 0 0000 0 00000000000000000 00 Q ss_pred CCCCCCCCcCCC Q lcl|NC_021326. 434 QQKERSNDKQSE 445 (445) Q Consensus 434 ~~~~~~~d~~~~ 445 (445) +........+.+ T Consensus 645 qaqa~~~~aq~e 656 (763) T protein:vir:95 645 DAQAQKAMAERD 656 (763) T ss_pred HHHHHHHHHHHH Confidence 000000000000 No 115 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.31 E-value=8.5e-12 Score=81.28 Aligned_cols=376 Identities=11% Similarity=0.048 Sum_probs=186.5 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchHHHHHH Q lcl|NC_021326. 6 IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVIKRI 85 (445) Q Consensus 6 i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l 85 (445) ++.++ -+-+.++.-|.++-..++...... ..... .-. -.+.+++.+|+..+.-++.+++.++++++. +.+ T Consensus 1 ~~~~~-----~d~~~~~~~~~~~~~~~~~~~~~~-~~~l~-a~Y-~~~~l~~~~Vd~~aed~~r~g~~i~g~~~~--~~~ 70 (427) T protein:vir:10 1 MKIVK-----HDGYNDIFNGGADGSPKPFFMSDA-SYHVG-SFY-NDNATAKRIVDVIPEEMVTAGFKMSGVKDE--KEF 70 (427) T ss_pred CCccc-----cchHHHHhhcCCCCcccCccccCc-hHHHH-HHH-HcCchhhhhhccchHHhhcCCccccCccHH--HHH Confidence 22111 111122233332211111110000 00000 000 135778999999999999999999986543 334 Q ss_pred HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----------CcE-EEEEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 86 DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----------GEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 86 ~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----------g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) +..|+ =+....+.++.+.+..||.+++++-.+.+ |.+ .+.+++|.++.|-..+.+...+.+ T Consensus 71 ~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~f------- 143 (427) T protein:vir:10 71 KSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRY------- 143 (427) T ss_pred HHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCcccccc------- Confidence 44443 26778889999999999999998866432 221 244555544433211111111100 Q ss_pred ecceeEEEEecceEEEEEEecceeeeccccccccccccccccccc-------ccceEE-ecCCCCcCccHH-HHHHHHHH Q lcl|NC_021326. 154 ENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWG-------KIPFIP-FKNNDLEISDIF-MYKTLIDA 224 (445) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-------~iPvv~-~~n~~~g~s~~~-~v~~lid~ 224 (445) +.+ ..|...... .......|+-- .+|-.. ..++.+|.|.+. .+.+-+.. T Consensus 144 --------g~P---~~y~v~~~~-----------~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~ 201 (427) T protein:vir:10 144 --------GEP---EIYKVSPGD-----------NMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICD 201 (427) T ss_pred --------Ccc---eEEEEecCC-----------CCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHH Confidence 000 011100000 00000112111 112111 124557888875 57788888 Q ss_pred HHHHHHHHHHHHHHhcCCeeEEecCC-----cccchhH-HH------hhhhCceeecc-CCCceeeEeccCChHHHHHHH Q lcl|NC_021326. 225 YNRRLSDLSNTFKDSNELTYVLTNYD-----DQELPEF-KR------LLRYYGAIKVS-DNGGVDTIQVEVPVENSKKYL 291 (445) Q Consensus 225 ~~~~~s~~~~~~~~~~~~~l~~~g~~-----~~~~~~~-~~------~~~~~~~~~~~-~~~~~~~l~~~~~~~~~~~~i 291 (445) ++.+.......+.......+.+.|.. ....... .+ .....+.+.+. ++. +|-+.+.+...+...+ T Consensus 202 ~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e--~~e~~~~~lsgl~~~~ 279 (427) T protein:vir:10 202 YDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETE--EYDVLNSDISGVPEFL 279 (427) T ss_pred HHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCC--ceeEEecccCChHHHH Confidence 88888888887888777777665532 1111111 11 01123333333 333 3444556777788899 Q ss_pred HHHHHHHHHHhCcccccc--ccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeC Q lcl|NC_021326. 292 DELYQKIMLFGQAVDFSS--DKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFN 368 (445) Q Consensus 292 ~~l~~~i~~~s~~p~~~~--~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~ 368 (445) +...+.|...+++|-.-+ ...+| |.||..-...+...+.. ..+..+.+.+++++.+++.- .++.++|+ T Consensus 280 ~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~--~Qe~~l~p~l~~l~~~i~~s-------~~~~~~f~ 350 (427) T protein:vir:10 280 SSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDR--KREEDYRPLLEFLLPFIVDE-------EEWSIEFE 350 (427) T ss_pred HHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhhcC-------CCcEEEeC Confidence 999999999999996422 22232 46666655555554442 23456888899888887621 46789999 Q ss_pred CCCCCCHHHHHHHHHHH---------hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh--hhccccCC--CCCCCC Q lcl|NC_021326. 369 YNKVANTELQVQTAQQS---------MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ--LPNLDDGG--ADGAQQ 435 (445) Q Consensus 369 ~~~p~d~~~~~~~~~~~---------~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~--~~~~~~~~--~~~~~~ 435 (445) |-...+..+.+++..+. .|+++.+.+.+.| ...- ..... ..+..... ...+.+ T Consensus 351 pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L-------------~~~~-~~~~~~~~~~~~~e~~~~~~e~~ 416 (427) T protein:vir:10 351 PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIA-PEFKLKDGNNINIREPEETTEPE 416 (427) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHH-------------Hhhh-ccccCCCCccccccccchhcCCC Confidence 99999999998765443 2566666655433 1100 00000 00000000 000011 Q ss_pred CCCCCCcCCC Q lcl|NC_021326. 436 KERSNDKQSE 445 (445) Q Consensus 436 ~~~~~d~~~~ 445 (445) +..+++.+.| T Consensus 417 p~~~e~~~d~ 426 (427) T protein:vir:10 417 PGLGEKLEDE 426 (427) T ss_pred CCCCCCCCCC Confidence 1111111111 No 116 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.31 E-value=1.3e-10 Score=74.86 Aligned_cols=436 Identities=10% Similarity=0.030 Sum_probs=190.0 Q ss_pred ChHHHHHHHH-------HHHHHHHHHHHHh--cCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccC Q lcl|NC_021326. 1 MIVRYIKQHL-------EKLPEISIGQEYY--EQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKP 71 (445) Q Consensus 1 ~l~~~i~~~~-------~~~~~~~~~~~yy--~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~ 71 (445) ++.++...+. +...+...-.+|| .|+| ........-........++ .+++|.++.+|+..+++.--+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~Q-W~~~~~~~l~~~~q~~grP--~~~~N~i~~~v~~v~g~~~~nr 84 (706) T protein:vir:10 8 QHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQ-WEGATVAGTKLDEQFEKYP--KFEINKVATELNRIISEYRNNR 84 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcc-CCHHHHHHHHhhhhhcCCC--ceEecchHHHHHHHhhHHHhCC Confidence 4444444433 3333444455676 4554 1100000000000001222 4788999999999999987665 Q ss_pred eee--cc----CchHHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC---------CCcEEEEEE-ccc Q lcl|NC_021326. 72 IAF--KH----TDDEVIKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE---------EGEFKLFRV-PAE 130 (445) Q Consensus 72 ~~~--~~----~d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~---------~g~~~i~~~-~p~ 130 (445) +.+ .+ ++.+..+.+ +.+.+ ++.......+..+++++|.||+-+..|- ++++.+..+ +|. T Consensus 85 ~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~i~~v~~p~ 164 (706) T protein:vir:10 85 ISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIAVEPIYDPA 164 (706) T ss_pred CceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccceeeeeccch Confidence 443 33 123344443 33333 6788889999999999999999886542 124444433 565 Q ss_pred eeEEEEcCCC----CCce-EEEEEEEeeec------------------------------ceeEEEEecceE--E--EEE Q lcl|NC_021326. 131 QGIPIWTDKE----HEEL-EAFIRMYKLEN------------------------------ETKVEYWDKITV--N--YYV 171 (445) Q Consensus 131 ~~~~v~d~~~----~~~~-~~~v~~~~~~~------------------------------~~~~~~~~~~~~--~--~~~ 171 (445) ..+ +||+.. .... .++.+.|...+ ....++|..... . .|. T Consensus 165 ~~v-~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~~~~~~~~~~~~ 243 (706) T protein:vir:10 165 RSV-WFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEVRKESVDVISYR 243 (706) T ss_pred hce-ecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceecccccccceeEEEEEee Confidence 422 355421 1111 12222121100 001112221110 0 000 Q ss_pred Ee-cceeee-cc---------ccccc----------------------ccccccccccccccceEEecCCC---C----c Q lcl|NC_021326. 172 YE-NGSLIP-DY---------SNNLE----------------------NSKTHFSTGSWGKIPFIPFKNND---L----E 211 (445) Q Consensus 172 ~~-~~~~~~-~~---------~~~~~----------------------~~~~~~~~~~~g~iPvv~~~n~~---~----g 211 (445) .. ...... .. ....+ .......+.+.+++|+|+|.-.. + . T Consensus 244 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~~~~~ 323 (706) T protein:vir:10 244 QPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDVERV 323 (706) T ss_pred ccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeeccccccccCcc Confidence 00 000000 00 00000 00001123344778888764322 2 3 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhC-----ceeecc----CCCc-------e Q lcl|NC_021326. 212 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYY-----GAIKVS----DNGG-------V 275 (445) Q Consensus 212 ~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~-----~~~~~~----~~~~-------~ 275 (445) .|.+.++++.++.+|..+|.+.+.+-.... ..-.|..+. ........... ..+.+. .+|. . T Consensus 324 ~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~--~~~~~~~~~-i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~~~~~~~ 400 (706) T protein:vir:10 324 EGHIAKAMDPQRLYNLQVSMLADAAAQDPG--QTPIVDMEQ-IRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVVAPANVA 400 (706) T ss_pred cceeccchhhHHHHHHHHHHHHHHHHhcCC--cccccchhH-HHHHHHHhhhcccccccchhcccccCCCCccccccccc Confidence 578889999999999999999887644433 222222111 11111000000 001110 1111 1 Q ss_pred eeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--- Q lcl|NC_021326. 276 DTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE--- 352 (445) Q Consensus 276 ~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~--- 352 (445) .++..+.-..++...++.....|...|++.+.+.+.. +|.||+||...-.............+..+.+++-+++++ T Consensus 401 ~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~-sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~li~ 479 (706) T protein:vir:10 401 GYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMP-SNVARETVNSLLNRSDMASFIYLDNMAKSLKRAGEIWLSMAR 479 (706) T ss_pred ccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222334566677888888999998877666654 468999998886666666666666666676666544444 Q ss_pred -HhccC---------CC---------------c-----ceE-----EEEe--CCCCCCCHHHHHHHHHHHhcc-CCh--H Q lcl|NC_021326. 353 -HFDIK---------GE---------------H-----KDV-----DISF--NYNKVANTELQVQTAQQSMGI-VSH--E 392 (445) Q Consensus 353 -~~~~~---------~~---------------~-----~~i-----~v~f--~~~~p~d~~~~~~~~~~~~g~-~s~--e 392 (445) ++... +. + .+| +|+. .+..+.-..+..+.++.+.+. .+. . T Consensus 480 ~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~~~p~~~~ 559 (706) T protein:vir:10 480 EIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQGMLPQDPM 559 (706) T ss_pred HHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHhcCCcchh Confidence 33211 00 0 011 2222 334444455555666655432 221 1 Q ss_pred H------HHHhCCCCCCHHHHHHHHHHHHH-------------HHHH---hhhccc-cCC-CCCCCCCCCCCCcCCC Q lcl|NC_021326. 393 T------VLENHPFVEDLQAELERIEQEQM-------------EYNK---QLPNLD-DGG-ADGAQQKERSNDKQSE 445 (445) Q Consensus 393 t------~l~~l~~~~d~~~E~~ri~~E~~-------------~~~~---~~~~~~-~~~-~~~~~~~~~~~d~~~~ 445 (445) + +++.+.+ +..++-.++|++... +... ++.... +.. .....+-.+.+.+... T Consensus 560 ~~~l~~~~~~~~d~-p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~qA~~~k 635 (706) T protein:vir:10 560 RPALMGIIIDNMEG-EGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVAQAEAQK 635 (706) T ss_pred hHHHHHHHHhhcCc-cchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2333321 222222334432110 0000 000000 000 0000000000000000 No 117 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.25 E-value=3.4e-10 Score=72.51 Aligned_cols=438 Identities=11% Similarity=0.035 Sum_probs=183.5 Q ss_pred ChHHHHHHHHH-------HHHHH--HHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccC Q lcl|NC_021326. 1 MIVRYIKQHLE-------KLPEI--SIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKP 71 (445) Q Consensus 1 ~l~~~i~~~~~-------~~~~~--~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~ 71 (445) ++.++..++.. ...+. +.-..||.|+|= .......-.....-..+| .+++|.++.+|+..+|+---+. T Consensus 8 ~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw-~~~~~~~l~~~~q~~~rP--~~~~N~i~~~i~~v~g~e~~nr 84 (708) T protein:vir:17 8 KHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQW-EGATAAGTKLDEQFEKYP--KFEINKVATELNRIIAEYRNNR 84 (708) T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCC-CHHHHHHHHhhhhhcCCC--ceEEcchHHHHHHHHhhHhhCC Confidence 33333333221 11112 111368999761 000000000000001122 4778999999999999976555 Q ss_pred ee--eccC----chHHHHHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC---CC------CcEEEEEE--cc Q lcl|NC_021326. 72 IA--FKHT----DDEVIKRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD---EE------GEFKLFRV--PA 129 (445) Q Consensus 72 ~~--~~~~----d~~~~~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d---~~------g~~~i~~~--~p 129 (445) +. +.+. +.+..+.+ +.+.+ ++.....+.+..+++++|.||+-+..| ++ .++.|..+ ++ T Consensus 85 ~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i~~~~~~~ 164 (708) T protein:vir:17 85 ITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPS 164 (708) T ss_pred cceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccceEeeccch Confidence 44 4332 23334444 33333 678888899999999999999877442 21 23444333 33 Q ss_pred ceeEEEEcCCCCC-c---eE-EEEEEEeeec-------------------------------ceeEEEEecceEE--EEE Q lcl|NC_021326. 130 EQGIPIWTDKEHE-E---LE-AFIRMYKLEN-------------------------------ETKVEYWDKITVN--YYV 171 (445) Q Consensus 130 ~~~~~v~d~~~~~-~---~~-~~v~~~~~~~-------------------------------~~~~~~~~~~~~~--~~~ 171 (445) .+++ ||+.... . .. ++++.|...+ .-.+++|...... .+. T Consensus 165 ~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~ 242 (708) T protein:vir:17 165 RSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEVRKESVDVIS 242 (708) T ss_pred hhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEEEEEeeeeeEEEE Confidence 4543 5543210 0 01 1111111000 0001222111110 000 Q ss_pred Ee---cceeeecc--------------------------------cccccccccccccccccccceEEecCC---CCc-- Q lcl|NC_021326. 172 YE---NGSLIPDY--------------------------------SNNLENSKTHFSTGSWGKIPFIPFKNN---DLE-- 211 (445) Q Consensus 172 ~~---~~~~~~~~--------------------------------~~~~~~~~~~~~~~~~g~iPvv~~~n~---~~g-- 211 (445) .. ++...... ...+........+-+++.+|+|+|.-. ..| T Consensus 243 ~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d~~~ 322 (708) T protein:vir:17 243 YRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIE 322 (708) T ss_pred EecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEecccccccCCC Confidence 00 00000000 000111111234455667888876422 122 Q ss_pred --CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe-----cCCccc----chhH--HHhhhh-CceeeccCCCc-ee Q lcl|NC_021326. 212 --ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT-----NYDDQE----LPEF--KRLLRY-YGAIKVSDNGG-VD 276 (445) Q Consensus 212 --~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~-----g~~~~~----~~~~--~~~~~~-~~~~~~~~~~~-~~ 276 (445) .|.+.++++.++.+|...|.+...+-.......++. |..... .... ...... ...-.+..++. .. T Consensus 323 ~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~a~~~~ 402 (708) T protein:vir:17 323 RVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNIIAGATPAG 402 (708) T ss_pred cccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccccccCCcc Confidence 467779999999999999999887766655444322 111100 0000 000000 00111111111 11 Q ss_pred eEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHH Q lcl|NC_021326. 277 TIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQEL----LWFVFE 352 (445) Q Consensus 277 ~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~----~~~~~~ 352 (445) ....+.-...+...++.....|...|++.+.+.+. .+|+||+|+...-..-..........+..+.+++ +.+|.. T Consensus 403 ~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~-~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~ 481 (708) T protein:vir:17 403 YTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMARE 481 (708) T ss_pred cCCCccccHHHHHHHHHHHHHHHHhcCCChHHccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12222223566777888888899999888776665 4568999998776655555555555555555554 445554 Q ss_pred HhccC------C-Cc----------------------ceE-----EEEe--CCCCCCCHHHHHHHHHHHhccCChH---H Q lcl|NC_021326. 353 HFDIK------G-EH----------------------KDV-----DISF--NYNKVANTELQVQTAQQSMGIVSHE---T 393 (445) Q Consensus 353 ~~~~~------~-~~----------------------~~i-----~v~f--~~~~p~d~~~~~~~~~~~~g~~s~e---t 393 (445) +++.. + +. .++ +|.. .+..+.-..+..+.++++.+.++.. + T Consensus 482 ~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~~~~~~~~~~ 561 (708) T protein:vir:17 482 VYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPADPMR 561 (708) T ss_pred HcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHHhcCCccchh Confidence 44211 0 00 011 1111 2222222333344455544332211 1 Q ss_pred ------HHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccc-------------cCCCCCC-----CCCCCCCCcCCC Q lcl|NC_021326. 394 ------VLENHPFVEDLQAELERIEQEQMEYNKQLPNLD-------------DGGADGA-----QQKERSNDKQSE 445 (445) Q Consensus 394 ------~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~-------------~~~~~~~-----~~~~~~~d~~~~ 445 (445) +++.++ .+..++-.++|.+.........+... ....... .+....+.+... T Consensus 562 ~~~~~l~l~~~D-~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qAe~~k 636 (708) T protein:vir:17 562 PAIQGIILDNID-GEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQK 636 (708) T ss_pred HHHHHHHHHhcC-CCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223322 22333333444332211000000000 0000000 000000000000 No 118 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=99.24 E-value=2e-10 Score=73.77 Aligned_cols=437 Identities=11% Similarity=0.096 Sum_probs=193.1 Q ss_pred ChHHHHHHHHHHH----HHHHHHHHHhcCCCcccc--cccccccccc-ccccccccccccchHHHHHHHHHhhhhc---- Q lcl|NC_021326. 1 MIVRYIKQHLEKL----PEISIGQEYYEQRPDIVK--EPKPVDATGA-VDPLKPDDRMITNFHANLVDQKVSYIVG---- 69 (445) Q Consensus 1 ~l~~~i~~~~~~~----~~~~~~~~yy~G~~~i~~--~~~~~~~~~~-~~~~~~~~ri~~n~~~~iv~~~~~~l~g---- 69 (445) .|.+.++..++.. ++++.+++||........ .+........ .... .+++..+.+...++.+++-+++ T Consensus 28 ~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--r~ki~~~~~~~~~~~l~s~Lm~~~~p 105 (641) T protein:vir:94 28 VVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADW--RHRINTGHTFEVVETLVAYFKGATFP 105 (641) T ss_pred HHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcc--cccccchhHHHHHHHHhhHHhhhhcC Confidence 3555555554322 456667777765432111 0111111111 1111 2367888888888888766644 Q ss_pred cC--eeec---cCchHHHHHHHHHh-----ccCHHHHHHHHHHHHHhcCeEEEEEEECC------------C-------- Q lcl|NC_021326. 70 KP--IAFK---HTDDEVIKRIDEVL-----GNRFDDKLHSVLTGASNKGIEWLHPYLDE------------E-------- 119 (445) Q Consensus 70 ~~--~~~~---~~d~~~~~~l~~~~-----~n~~~~~~~~~~~~~~~~G~~~~~v~~d~------------~-------- 119 (445) .+ +.+. .++.+..+.++.++ .+++.....+..++++.+|.+++.++++. . T Consensus 106 ~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~ 185 (641) T protein:vir:94 106 SDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWE 185 (641) T ss_pred CCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhcccchhhccccc Confidence 22 3332 24445445555544 35667777888999999999998886531 1 Q ss_pred --------CcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeec-------c--------------e----------eEE Q lcl|NC_021326. 120 --------GEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN-------E--------------T----------KVE 160 (445) Q Consensus 120 --------g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~-------~--------------~----------~~~ 160 (445) ..+++..++|.+++ +|++....-..++++..... + . .+. T Consensus 186 ~~~v~~~~~~~r~~~v~~~di~--~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~ 263 (641) T protein:vir:94 186 DVAVNRQRSELRIEPLSPYDVW--LDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVN 263 (641) T ss_pred ccceecccceeeEEecchhhee--ecCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhcccccccccccccccc Confidence 23456666776654 34332221111222221110 0 0 000 Q ss_pred EEecceEEEEEEec-----c-eeeecccccccccccccccc-cccccceEEecC-----CCCcCccHHHHHHHHHHHHHH Q lcl|NC_021326. 161 YWDKITVNYYVYEN-----G-SLIPDYSNNLENSKTHFSTG-SWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRR 228 (445) Q Consensus 161 ~~~~~~~~~~~~~~-----~-~~~~~~~~~~~~~~~~~~~~-~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~ 228 (445) ..+...+..+...+ + ..........+........+ .+...|++.++. .-+|.|....+.+.+..+|.. T Consensus 264 ~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l 343 (641) T protein:vir:94 264 GTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVL 343 (641) T ss_pred cccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecCCcccCCChHHHHHHHHHHHHHH Confidence 00000001110000 0 00000000011111111122 245668876543 457999999999999999999 Q ss_pred HHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCceeeccCCCceeeEecc-CChHHHHHHHHHHHHHHHHHhCcccc Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVE-VPVENSKKYLDELYQKIMLFGQAVDF 307 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~i~~l~~~i~~~s~~p~~ 307 (445) ...+.+.+.....|.+.+.....-..... ....++++..+..++++++... .+.......++.+...+....++..+ T Consensus 344 ~r~~ld~~~~~~~p~~~~~~~~~~~~~~l--~~~PG~ii~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 421 (641) T protein:vir:94 344 TNGRLDNLVLHINKMWTLVEDGILKREDV--KAKPGAVFKVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPL 421 (641) T ss_pred HHHHHHHHHHHhCCeeeecccccccccee--eccCCcceeeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhh Confidence 99999999999999886543211111111 1234455666666777877543 33344445566666555554444332 Q ss_pred ccc---cccCcchHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccCC---------------------Ccce Q lcl|NC_021326. 308 SSD---KFGSAPSGVALEFLYTNLNLKADKLARKAKV-AIQELLWFVFEHFDIKG---------------------EHKD 362 (445) Q Consensus 308 ~~~---~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~-~l~~~~~~~~~~~~~~~---------------------~~~~ 362 (445) ..+ ..+.+.+|+++..+......+.....+.|.. ++..+++-++.++.... ...+ T Consensus 422 ~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~ 501 (641) T protein:vir:94 422 IGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEY 501 (641) T ss_pred hcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccc Confidence 221 1122457777777777777777777777763 55555554444332111 1122 Q ss_pred EEEEeCCCCCCCHHH---HHHHHHHHhc------cCCh-----------HHHHHhCCCCCCHH-------HHHHHHH--- Q lcl|NC_021326. 363 VDISFNYNKVANTEL---QVQTAQQSMG------IVSH-----------ETVLENHPFVEDLQ-------AELERIE--- 412 (445) Q Consensus 363 i~v~f~~~~p~d~~~---~~~~~~~~~g------~~s~-----------et~l~~l~~~~d~~-------~E~~ri~--- 412 (445) +...|.- .|....+ .++.+..+.+ ..|. +.+++.++. .++. .+-+... T Consensus 502 L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~-~~p~~~ir~~~~~~~~~~~~~ 579 (641) T protein:vir:94 502 LHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRF-TDPMRYIKKAEAPPAAPPIAP 579 (641) T ss_pred eeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCC-CCchhhccCccCchhHHHHHH Confidence 3333221 2333222 2333332222 1221 112222221 1111 0111111 Q ss_pred HHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 413 QEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 413 ~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +|.++..........+...+.....--+.+.+. T Consensus 580 ~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~ 612 (641) T protein:vir:94 580 AEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSD 612 (641) T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHH Confidence 111111111111001111110000000111111 No 119 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=99.13 E-value=1.6e-09 Score=68.82 Aligned_cols=428 Identities=8% Similarity=-0.023 Sum_probs=199.6 Q ss_pred ChHHHHHHHH----HHHHHHHHHHHHhcCCCccccccccc--cccccccccccccccccchHHHHHHHHHhhhhcc--C- Q lcl|NC_021326. 1 MIVRYIKQHL----EKLPEISIGQEYYEQRPDIVKEPKPV--DATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P- 71 (445) Q Consensus 1 ~l~~~i~~~~----~~~~~~~~~~~yy~G~~~i~~~~~~~--~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~- 71 (445) +++++.+++. +|.+....+.++|+=- +|.+.... .......-.+.+.++..+-+...++.+++.|++- | T Consensus 8 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~--lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l~~~ltpp 85 (549) T protein:vir:10 8 ILQALNADHGRMKEKRQSYEAVWNDVIDYL--MPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAMDSMITPA 85 (549) T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHh--ccccccccccCCCCCCcccccccccccchHHHHHHHHHHHHHhhccCC Confidence 5555555443 2333334444444321 11111000 0000001111123456677788888888777643 2 Q ss_pred ----eeeccCch------HHHHHHH-------HHh---ccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccce Q lcl|NC_021326. 72 ----IAFKHTDD------EVIKRID-------EVL---GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQ 131 (445) Q Consensus 72 ----~~~~~~d~------~~~~~l~-------~~~---~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 131 (445) +++...++ ++...+. ..+ ..||.....++.++..++|.+.+++..+..+.++++.++-.+ T Consensus 86 ~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~~f~~~pl~~ 165 (549) T protein:vir:10 86 TQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGIVYRNVPMQR 165 (549) T ss_pred CCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCeeEEEEEEcCe Confidence 23333332 2222222 211 357888899999999999999999887776777888888888 Q ss_pred eEEEEcCCCCCceEEEEEEEeee------------------------cceeEEEEecceEEEEEEeccee---------- Q lcl|NC_021326. 132 GIPIWTDKEHEELEAFIRMYKLE------------------------NETKVEYWDKITVNYYVYENGSL---------- 177 (445) Q Consensus 132 ~~~v~d~~~~~~~~~~v~~~~~~------------------------~~~~~~~~~~~~~~~~~~~~~~~---------- 177 (445) ++.--|. .+++..++|.++.. ....++++. .+.-.... T Consensus 166 ~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~------~V~pr~~~~~~~~~~~~~ 237 (549) T protein:vir:10 166 LWFAENN--SGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYH------AVEPRADRDPRKLDGRNM 237 (549) T ss_pred EEEeeCC--CCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEE------EeecCCCCCccccccccC Confidence 6654443 57777776654321 111122221 11100000 Q ss_pred ---eecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC Q lcl|NC_021326. 178 ---IPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY 249 (445) Q Consensus 178 ---~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~ 249 (445) .......... -....+|..+|++.+. ++.+|+|-.....+-+..+|...-......+....|.+.+.-. T Consensus 238 pf~sv~~e~~~~~---il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~ 314 (549) T protein:vir:10 238 QFASYWLDEGRDR---IVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANED 314 (549) T ss_pred ceEEEEEEecCCE---eeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccc Confidence 0000000010 1112344556765543 3568999999999999999999888999999999998876321 Q ss_pred CcccchhHHHhhhhCcee--ecc--CCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHH Q lcl|NC_021326. 250 DDQELPEFKRLLRYYGAI--KVS--DNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLY 325 (445) Q Consensus 250 ~~~~~~~~~~~~~~~~~~--~~~--~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~ 325 (445) ...+..+ +..++.. ... ++..+..+....+.......++.++..|...-....+.....+...|++.+.... T Consensus 315 g~~~~~~----l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~TAtEV~~r~ 390 (549) T protein:vir:10 315 GVLDGFD----LRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMTATEVLQRA 390 (549) T ss_pred cccccce----eccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCccHHHHHHHH Confidence 1111111 1222211 112 2233555544456677777788887766664432221112223456777776544 Q ss_pred HHHHHHHH----HH-HHHHHHHHHHHHHHHHHHhccCC-------CcceEEEEeCCCCCCCH-HHHHH-------HHHHH Q lcl|NC_021326. 326 TNLNLKAD----KL-ARKAKVAIQELLWFVFEHFDIKG-------EHKDVDISFNYNKVANT-ELQVQ-------TAQQS 385 (445) Q Consensus 326 ~~l~~k~~----~~-~~~~~~~l~~~~~~~~~~~~~~~-------~~~~i~v~f~~~~p~d~-~~~~~-------~~~~~ 385 (445) ..+..... +. ...+.+-+.+++.++.+..-.+. ....++|++..++-+.. .+.++ .+..+ T Consensus 391 ~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~~~~~~~l 470 (549) T protein:vir:10 391 QEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQWLQQLGIV 470 (549) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44333322 21 23444555666665555322221 23456666654443311 11111 11222 Q ss_pred hcc-------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhh--hccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 386 MGI-------VSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQL--PNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 386 ~g~-------~s~et~l~~l---~~~~----d~~~E~~ri~~E~~~~~~~~--~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +++ +....++..+ -+++ -.++|++++.++.++..+.. .......+..++.-.+..+..+. T Consensus 471 aq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~~a~~~~~~~ta~~~ 546 (549) T protein:vir:10 471 SQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAGAIKDLSDAQTAAQT 546 (549) T ss_pred hccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCcc Confidence 222 2223333322 1222 13466655554332211111 11011111111111222222222 No 120 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=99.01 E-value=5.7e-09 Score=65.76 Aligned_cols=429 Identities=10% Similarity=0.019 Sum_probs=196.2 Q ss_pred ChHHHHHHH---H-HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQH---L-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~---~-~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) +.+++.+++ + +|.+-...+.++|+= -+|.+....... .....+-+.++..+-+...++++++.|++- | T Consensus 5 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~-~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~~ 81 (559) T protein:vir:95 5 TKERLNKQFAQLESERQSFEPHWRELSDY--INPRGSRFLTSE-VNRNDRRNTRIIDSTGTMAARTLASGMMSGITSPAR 81 (559) T ss_pred hHHHHHHHHHHHHHHhhHHHHHHHHHHHH--hccccCCcCCCC-CCcccccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 333333332 2 222222333333321 111111111000 001111233566677888888888777643 2 Q ss_pred --eeeccCc------hHHHHHHH--------HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEE Q lcl|NC_021326. 72 --IAFKHTD------DEVIKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPI 135 (445) Q Consensus 72 --~~~~~~d------~~~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v 135 (445) +++...+ .++.+.|. .+..+||...+.++.++..++|.+.+++..+..+.++++.++..+++.. T Consensus 82 ~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~~~l~~~~v~ 161 (559) T protein:vir:95 82 PWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMPFPIGSYYLA 161 (559) T ss_pred cccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEEeecCeEEEe Confidence 2333322 23333332 2224678888999999999999999988777666788999999887655 Q ss_pred EcCCCCCceEEEEEEEeeec-------------------------ceeEEEEecceEEEEEEecceee------------ Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLEN-------------------------ETKVEYWDKITVNYYVYENGSLI------------ 178 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~------------ 178 (445) -|. .+++..++|.++..- ...++++ +.++...... T Consensus 162 ~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~------~~V~pr~~~~~~~~~~~~~pf~ 233 (559) T protein:vir:95 162 NSP--RGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVM------HSVYPNIDRDTSKLDSKNKPFK 233 (559) T ss_pred eCC--CCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEE------EEEeccccccccccccccceEE Confidence 443 577777776543211 0011211 1111000000 Q ss_pred -ecccccccccccccccccccccceEEec-----CCCCcCcc-HHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCc Q lcl|NC_021326. 179 -PDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISD-IFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD 251 (445) Q Consensus 179 -~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~-~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~ 251 (445) ..+....+.. .-....+|..+|++.++ +..+|+|. .....+-+..+|...-......+....|.+.+.+... T Consensus 234 s~~~e~~~~~~-~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~ 312 (559) T protein:vir:95 234 SVYYEVGGDND-KLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTSLK 312 (559) T ss_pred EEEEEecCCCc-eeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceecccccc Confidence 0000000000 00112334456665443 34578985 8889999999999988899999999999877643221 Q ss_pred ccchhHHHhhhhCceeeccCCC---ceeeE-eccCChHHHHHHHHHHHHHHHHHhCccc--cccccccCcchHHHHHHHH Q lcl|NC_021326. 252 QELPEFKRLLRYYGAIKVSDNG---GVDTI-QVEVPVENSKKYLDELYQKIMLFGQAVD--FSSDKFGSAPSGVALEFLY 325 (445) Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~---~~~~l-~~~~~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~~~~~~Sg~Ai~~~~ 325 (445) .. . ..+..+++......+ .++.+ +.+.+...+...++.++..|...-.... ......+...|++.+.... T Consensus 313 ~~---~-~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~ 388 (559) T protein:vir:95 313 NQ---R-ASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRSMPVEAVIEMK 388 (559) T ss_pred cc---c-eeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCCCCHHHHHHHH Confidence 11 1 112333333333222 23333 2234455556667777666655443211 1112233456777776654 Q ss_pred HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCH-HHHHH-------HHHHHhc Q lcl|NC_021326. 326 TNLNLKADK-----LARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANT-ELQVQ-------TAQQSMG 387 (445) Q Consensus 326 ~~l~~k~~~-----~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~-~~~~~-------~~~~~~g 387 (445) ..+.....- ....+.+-+.+++.++.+..-.+. ...+++|++..++.... ...++ .+..+++ T Consensus 389 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~~~~~~~~laq 468 (559) T protein:vir:95 389 EEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLASTVNFIGQLAQ 468 (559) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 444333222 122344455555665555322111 23456777755443311 11111 2222222 Q ss_pred c-------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhhhcccc--CCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 388 I-------VSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLPNLDD--GGADGAQQKERSNDKQSE 445 (445) Q Consensus 388 ~-------~s~et~l~~l---~~~~----d~~~E~~ri~~E~~~~~~~~~~~~~--~~~~~~~~~~~~~d~~~~ 445 (445) + +....++..+ -+++ -.++|++++++++++.++.++...- ..+...+.-.+.+....+ T Consensus 469 ~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~~~~~ 542 (559) T protein:vir:95 469 VKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKTSDPS 542 (559) T ss_pred cChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCCChh Confidence 2 2233333322 1222 1246666665554433322111000 000111111111111111 No 121 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.93 E-value=1.3e-08 Score=63.77 Aligned_cols=432 Identities=9% Similarity=0.037 Sum_probs=194.9 Q ss_pred ChHHHHHHHHH-HH---HHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLE-KL---PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~-~~---~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) -|++..+..+. |. .+++.+.+|. +|.+........ ....+-..++..+-+...++++++.|++- | T Consensus 8 ~l~~r~~~l~~~R~~~e~~w~e~~~~~-----lP~~~~~~~~~~-~~~~~~~~~~~dst~~~a~~~Las~l~~~ltpp~~ 81 (556) T protein:vir:73 8 RLLKQLAQLKNERTSFESHWLDLSDFI-----NPRGSRFLTSDV-NRDDRRNTKIVDPTGSMAQRILSSGMMSGITSPAR 81 (556) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHh-----ccccCCcCCCCC-CcchhhcCccccchHHHHHHHHHHHHHHhhcCCCC Confidence 22222333332 32 3334444442 111111110000 00011123566677888888888777543 2 Q ss_pred --eeeccCc------hHHHHHHH-------H-HhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEE Q lcl|NC_021326. 72 --IAFKHTD------DEVIKRID-------E-VLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPI 135 (445) Q Consensus 72 --~~~~~~d------~~~~~~l~-------~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v 135 (445) +++...+ .++.+.|. . +..+||...+.++.++..++|.+.+++..+..+-++++.++..+++.- T Consensus 82 ~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~~l~~~~~~ 161 (556) T protein:vir:73 82 PWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPFPIGSYYLA 161 (556) T ss_pred cccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEeecceeEEe Confidence 2333322 22333322 2 223678888999999999999999998888777788889988887654 Q ss_pred EcCCCCCceEEEEEEEeeecc--------e-----------eEEEEecceEEEEEEecceee-------------ecccc Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLENE--------T-----------KVEYWDKITVNYYVYENGSLI-------------PDYSN 183 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~~--------~-----------~~~~~~~~~~~~~~~~~~~~~-------------~~~~~ 183 (445) -| ..+++..++|.++..-. . ....-....+.+.++...... ..+.. T Consensus 162 ~d--~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~~~~~ 239 (556) T protein:vir:73 162 NS--PRGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNKPYRSVYFES 239 (556) T ss_pred eC--CCCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccceEEEEEEEe Confidence 44 35777777765543210 0 000000011111111100000 00000 Q ss_pred cccccccccccccccccceEEec-----CCCCcCcc-HHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH Q lcl|NC_021326. 184 NLENSKTHFSTGSWGKIPFIPFK-----NNDLEISD-IFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF 257 (445) Q Consensus 184 ~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~-~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~ 257 (445) ..+.. .-...-+|..+|++.+. ++.+|+|. .....+-+..+|...-......+....|.+.+....... . T Consensus 240 ~~~~~-~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~~---~ 315 (556) T protein:vir:73 240 GGDSD-KLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTSLKNQ---R 315 (556) T ss_pred cCCCc-eecccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceecccccccc---c Confidence 00000 00112234556766543 35679995 888999999999998888999999999987764422111 0 Q ss_pred HHhhhhCceeec--cCC-CceeeEe-ccCChHHHHHHHHHHHHHHHHHhCccc--cccccccCcchHHHHHHHHHHHHHH Q lcl|NC_021326. 258 KRLLRYYGAIKV--SDN-GGVDTIQ-VEVPVENSKKYLDELYQKIMLFGQAVD--FSSDKFGSAPSGVALEFLYTNLNLK 331 (445) Q Consensus 258 ~~~~~~~~~~~~--~~~-~~~~~l~-~~~~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~~~~~~Sg~Ai~~~~~~l~~k 331 (445) ..+..+++... +.+ .+++.+. ...+.......++.++..|...-.... ......+...|++.+......+... T Consensus 316 -~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~ 394 (556) T protein:vir:73 316 -VSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLM 394 (556) T ss_pred -eeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHH Confidence 11222332222 222 2345442 334556666777777777754433221 1111223456777776554444333 Q ss_pred HHH----H-HHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCHH-HHHH-------HHHHHhcc----- Q lcl|NC_021326. 332 ADK----L-ARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANTE-LQVQ-------TAQQSMGI----- 388 (445) Q Consensus 332 ~~~----~-~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~~-~~~~-------~~~~~~g~----- 388 (445) ..- . ...+.+-+.+++.++.+..-.+. ...+++|++..++-.... ..++ .+..++++ T Consensus 395 LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~ 474 (556) T protein:vir:73 395 LGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEAL 474 (556) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhH Confidence 222 1 22334445555555554322111 234577777554432111 1111 12222222 Q ss_pred --CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhhhcc--ccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 389 --VSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLPNL--DDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 389 --~s~et~l~~l---~~~~----d~~~E~~ri~~E~~~~~~~~~~~--~~~~~~~~~~~~~~~d~~~~ 445 (445) +..+.++..+ -+++ -.++|++.+.+++++..+.++.. ....+.+++.-.+.+.+..+ T Consensus 475 d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~~~~~ 542 (556) T protein:vir:73 475 DKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQTSDPS 542 (556) T ss_pred hcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCHH Confidence 2233333322 1222 12455555544432222111100 00000000000111111111 No 122 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.91 E-value=1.6e-08 Score=63.31 Aligned_cols=428 Identities=9% Similarity=0.004 Sum_probs=198.5 Q ss_pred ChHHHHHHHHHH----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) ..++..+.++.+ ..+++.+.+|..- ..-..... ...+...++..+-+...++++++.|++- | T Consensus 13 ~~k~r~~~l~~~R~~~e~~w~e~~~~~lP-----~~~~~~~~----~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~ 83 (535) T protein:vir:15 13 GAKATYDRLTNDRRAYETRAENCAQYTIP-----SLFPKESD----NESTDYTTPWQAVGARGLNNLASKLMLALFPMQS 83 (535) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----cccCCCCC----cccccccccccccHHHHHHHHHHHHHHhhcCCCc Confidence 344444444433 3444444444322 11000000 0111112455566777788887776542 2 Q ss_pred -eeeccCch-------------HHHHHHH--------HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEcc Q lcl|NC_021326. 72 -IAFKHTDD-------------EVIKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPA 129 (445) Q Consensus 72 -~~~~~~d~-------------~~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p 129 (445) +++...+. ++...+. .+..+||...+.++.++..++|.+.+++..+..+.++++.++- T Consensus 84 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl 163 (535) T protein:vir:15 84 WMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRL 163 (535) T ss_pred ccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEc Confidence 22222221 1222222 2234688999999999999999998888777667778888877 Q ss_pred ceeEEEEcCCCCCceEEEEEEEeee--------------------cceeEEEEecceEEEEEEecceeeecccccccccc Q lcl|NC_021326. 130 EQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKITVNYYVYENGSLIPDYSNNLENSK 189 (445) Q Consensus 130 ~~~~~v~d~~~~~~~~~~v~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (445) .+++..-| ..+++...+|.++.. ....+++|+... ...+................ T Consensus 164 ~~~~v~~d--~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~---~~~~~~~~~~~~e~~g~~~~ 238 (535) T protein:vir:15 164 SSYVVQRD--AYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVY---LDEESGDYLKYEEVEDVEID 238 (535) T ss_pred CeeEEeeC--CCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEE---EecCCCcEEEEEEeeCcccc Confidence 66554433 357777777665431 111223332111 11111111111000111111 Q ss_pred cccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhC Q lcl|NC_021326. 190 THFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYY 264 (445) Q Consensus 190 ~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 264 (445) ......++..+|++.+. ++.+|+|-..+..+-+..+|...-......+....|.+.+.-.......... .... T Consensus 239 ~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l~--~~~~ 316 (535) T protein:vir:15 239 GSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT--KAQT 316 (535) T ss_pred ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhcc--cCCc Confidence 11233456667776553 3568999999999999999999888999999999998765311111111111 1122 Q ss_pred ceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH-----HHH Q lcl|NC_021326. 265 GAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK-----LAR 337 (445) Q Consensus 265 ~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~-----~~~ 337 (445) +.+..+..++++.+... .+.......++.++..|...-.... .....+...+|+.+......+.....- ... T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E 395 (535) T protein:vir:15 317 GDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 (535) T ss_pred eeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHH Confidence 33444455666666433 4667778888888877765432211 111223456777776543333333221 112 Q ss_pred HHHHHHHHHHHHHHHHhccCC-CcceEEEEeCCCCCCC-HHHHHHHHH----HHhcc--------CChHHHHHhC---CC Q lcl|NC_021326. 338 KAKVAIQELLWFVFEHFDIKG-EHKDVDISFNYNKVAN-TELQVQTAQ----QSMGI--------VSHETVLENH---PF 400 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~~~~~~-~~~~i~v~f~~~~p~d-~~~~~~~~~----~~~g~--------~s~et~l~~l---~~ 400 (445) .+.+-+.+++.++.+..-.+. ....++++|.-++... ..+.++.+. .++++ +....++..+ -+ T Consensus 396 ll~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~G 475 (535) T protein:vir:15 396 LQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIG 475 (535) T ss_pred HHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcC Confidence 223333333443332211111 1234667765444321 111222222 22221 2223333222 12 Q ss_pred CC-----CHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VE-----DLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~-----d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++ -.++|++++.+++.+.....+.....+...+.+....++.-+- T Consensus 476 vp~~~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~~~~~ 525 (535) T protein:vir:15 476 IDTSGILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALATSSPEAMQG 525 (535) T ss_pred CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhccChHHHHH Confidence 22 2346666655554433322222111111111111111111000 No 123 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.86 E-value=2.5e-08 Score=62.25 Aligned_cols=379 Identities=12% Similarity=0.084 Sum_probs=157.4 Q ss_pred cccccccccccchHHHHHHHHHhhhhccCeeeccC--------chHHHHHHHHHhc----c-----------CHHHHHHH Q lcl|NC_021326. 43 DPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT--------DDEVIKRIDEVLG----N-----------RFDDKLHS 99 (445) Q Consensus 43 ~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~--------d~~~~~~l~~~~~----n-----------~~~~~~~~ 99 (445) -+..+. ..+....+|+..++.+.+-|+.+... .....+.+..++. | .+...+.. T Consensus 1 l~~l~~---~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~ 77 (467) T protein:vir:31 1 MAELLE---HNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQT 77 (467) T ss_pred Chhhhh---cCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHH Confidence 111111 24678889999999998888776321 1122233333332 2 12344556 Q ss_pred HHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceee Q lcl|NC_021326. 100 VLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLI 178 (445) Q Consensus 100 ~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (445) +..+.+.+|.+|+.+..+..|++ .+..++|..+.+..+.. . +..........+......+.... .+... T Consensus 78 ~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~---~------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 147 (467) T protein:vir:31 78 AWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDER---G------FVQLLEEKEKYFGVAGDRYQTNG-NGDLD 147 (467) T ss_pred HHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecc---e------eEeecCCceeeEEeccccceeec-cccee Confidence 77889999999999989988886 47888888877654431 0 11111111000000000000000 00000 Q ss_pred ecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE--ecCCc Q lcl|NC_021326. 179 PDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL--TNYDD 251 (445) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~--~g~~~ 251 (445) ............ ....+..=-|+|++. ...|.|.+......++....+..-....+...+.|-.++ +|... T Consensus 148 ~~~~~~~~~~~~--~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l 225 (467) T protein:vir:31 148 PVFVDADDGSTG--TSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAEL 225 (467) T ss_pred eeeeeecccccc--ceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCC Confidence 000000000000 000111111455542 235777777665555544433333333445555665443 45322 Q ss_pred --ccchhHHHhhhh-------------------CceeeccCCCceeeEe-----cc---CChHHHHHHHHHHHHHHHHHh Q lcl|NC_021326. 252 --QELPEFKRLLRY-------------------YGAIKVSDNGGVDTIQ-----VE---VPVENSKKYLDELYQKIMLFG 302 (445) Q Consensus 252 --~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~l~-----~~---~~~~~~~~~i~~l~~~i~~~s 302 (445) +.....+..+.. ...+.++.+.+.+.+. .. .....+.+..+...+.|...- T Consensus 226 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~f 305 (467) T protein:vir:31 226 TEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVH 305 (467) T ss_pred CHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHh Confidence 111112211110 1122233333332221 11 123445666677777788888 Q ss_pred CccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-Hhc--cCCCcceEEEEeCCCCCCCHHHHH Q lcl|NC_021326. 303 QAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE-HFD--IKGEHKDVDISFNYNKVANTELQV 379 (445) Q Consensus 303 ~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~-~~~--~~~~~~~i~v~f~~~~p~d~~~~~ 379 (445) ++|....+-..++..+..++......... .+.+-++.+-..+-. ++. .......+++.+...+..|..+.+ T Consensus 306 gVpp~~lG~~~~~~~~s~~e~~~~~f~~~------~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~ 379 (467) T protein:vir:31 306 DVPPVIAGVVESGAFSTDAEEQRKEFAEE------TIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEI 379 (467) T ss_pred CCCHHHcccCCCCCcccCHHHHHHHHHHH------HHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHH Confidence 88854332211111111121111111111 111111111111111 111 111223466777788889999999 Q ss_pred HHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 380 QTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 380 ~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +++.++ .|+++.-.++++++.-+-++.++.-. .--......+........+ +..+..+++.++ T Consensus 380 ~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 444 (467) T protein:vir:31 380 ASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYGG--ETLVAEVTGGSGPGGGIGD-QIEQLVEDRADE 444 (467) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCcccccCC--cccccccccccCCCCcccC-cCCCCCCCcccc Confidence 988876 58999999999886522111111000 0000000000111111111 111111111111 No 124 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.82 E-value=3.5e-08 Score=61.48 Aligned_cols=429 Identities=9% Similarity=-0.013 Sum_probs=196.6 Q ss_pred ChHHHH---HHHH-HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYI---KQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i---~~~~-~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) ..+++. +..+ +|.+-...+.++|+ +-+|.+.......... ..+...++..+-+...++++++.|++- | T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~-~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~ 82 (555) T protein:vir:10 6 ERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNR-GEKRHNNILDNTGTRALRVLAAGMMAGMTSPAR 82 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCc-chhcccccccccHHHHHHHHHHHHHHhhcCCCC Confidence 222222 2332 23222333444442 1122211111111111 111234566777888888888777643 2 Q ss_pred --eeeccCch------HHHHHHH-------H-HhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEE Q lcl|NC_021326. 72 --IAFKHTDD------EVIKRID-------E-VLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPI 135 (445) Q Consensus 72 --~~~~~~d~------~~~~~l~-------~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v 135 (445) +++...+. ++.+.|. . +..+||...+.++.++..++|.+.+++..|..+.+++..++..+++.- T Consensus 83 ~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~v~ 162 (555) T protein:vir:10 83 PWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYAIA 162 (555) T ss_pred cccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeEEe Confidence 23333221 2333322 2 223678888999999999999999988888777788888988887654 Q ss_pred EcCCCCCceEEEEEEEeeec-------------------------ceeEEEEecceEEEEEEecceee------------ Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLEN-------------------------ETKVEYWDKITVNYYVYENGSLI------------ 178 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~------------ 178 (445) -|. .+++..++|.++..- +..++++ +.++-..... T Consensus 163 ~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~------~~V~pr~~~~~~~~~~~~~p~~ 234 (555) T protein:vir:10 163 ADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVI------HAIEPRADRDPSKRDDRNMAWK 234 (555) T ss_pred eCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE------EEEeeccCcCcCCCCccccceE Confidence 443 577777776543211 0111111 1111000000 Q ss_pred -ecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcc Q lcl|NC_021326. 179 -PDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQ 252 (445) Q Consensus 179 -~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~ 252 (445) ..+....+.. .-...-+|..+|++.+. .+.+|+|-.....+-+..+|...-.....++....|.+.+...... T Consensus 235 s~~~~~~~d~~-~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~ 313 (555) T protein:vir:10 235 SVYFEPGADET-RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN 313 (555) T ss_pred EEEEEeccCCc-cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc Confidence 0000000000 00122234566776543 3567999999999999999998777888889888887765332111 Q ss_pred cchhHHHhhhhCceeecc--CCCc--eeeEeccCChHHHHHHHHHHHHHHHHHhCcc--ccccccccCcchHHHHHHHHH Q lcl|NC_021326. 253 ELPEFKRLLRYYGAIKVS--DNGG--VDTIQVEVPVENSKKYLDELYQKIMLFGQAV--DFSSDKFGSAPSGVALEFLYT 326 (445) Q Consensus 253 ~~~~~~~~~~~~~~~~~~--~~~~--~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p--~~~~~~~~~~~Sg~Ai~~~~~ 326 (445) . . ..+..+++..+. ..++ ........+.......++.++..|...-... .......+...||+.+..... T Consensus 314 ~---~-~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~ 389 (555) T protein:vir:10 314 Q---D-ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHE 389 (555) T ss_pred c---c-ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHH Confidence 1 0 112222222222 2222 1222334466777788888888776544322 111112234567777765443 Q ss_pred HHHHHHHH-----HHHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCHHH-HH-------HHHHHHhcc Q lcl|NC_021326. 327 NLNLKADK-----LARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANTEL-QV-------QTAQQSMGI 388 (445) Q Consensus 327 ~l~~k~~~-----~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~~~-~~-------~~~~~~~g~ 388 (445) .+.....- ....+.+-+.+++.++.+..-.+. ....++|.+..++-+.... .+ +.+..++++ T Consensus 390 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~ 469 (555) T protein:vir:10 390 EKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGI 469 (555) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 33333222 123333444455555444221111 2345667765544332111 11 122222232 Q ss_pred -------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhhhc--cccCCCCCCCCC-CCCCCcCCC Q lcl|NC_021326. 389 -------VSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLPN--LDDGGADGAQQK-ERSNDKQSE 445 (445) Q Consensus 389 -------~s~et~l~~l---~~~~----d~~~E~~ri~~E~~~~~~~~~~--~~~~~~~~~~~~-~~~~d~~~~ 445 (445) +....++..+ -+++ -.++|++++.+++++..+.++. .........+.- +..-+.++. T Consensus 470 ~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~ 543 (555) T protein:vir:10 470 KPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNA 543 (555) T ss_pred ChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchh Confidence 2223333221 1222 1346666666554432222111 111111111100 111122223 No 125 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.82 E-value=3.5e-08 Score=61.48 Aligned_cols=429 Identities=9% Similarity=-0.013 Sum_probs=196.6 Q ss_pred ChHHHH---HHHH-HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYI---KQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i---~~~~-~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) ..+++. +..+ +|.+-...+.++|+ +-+|.+.......... ..+...++..+-+...++++++.|++- | T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~-~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~ 82 (555) T protein:vir:10 6 ERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNR-GEKRHNNILDNTGTRALRVLAAGMMAGMTSPAR 82 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCc-chhcccccccccHHHHHHHHHHHHHHhhcCCCC Confidence 222222 2332 23222333444442 1122211111111111 111234566777888888888777643 2 Q ss_pred --eeeccCch------HHHHHHH-------H-HhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEE Q lcl|NC_021326. 72 --IAFKHTDD------EVIKRID-------E-VLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPI 135 (445) Q Consensus 72 --~~~~~~d~------~~~~~l~-------~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v 135 (445) +++...+. ++.+.|. . +..+||...+.++.++..++|.+.+++..|..+.+++..++..+++.- T Consensus 83 ~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~v~ 162 (555) T protein:vir:10 83 PWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYAIA 162 (555) T ss_pred cccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeEEe Confidence 23333221 2333322 2 223678888999999999999999988888777788888988887654 Q ss_pred EcCCCCCceEEEEEEEeeec-------------------------ceeEEEEecceEEEEEEecceee------------ Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLEN-------------------------ETKVEYWDKITVNYYVYENGSLI------------ 178 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~------------ 178 (445) -|. .+++..++|.++..- +..++++ +.++-..... T Consensus 163 ~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~------~~V~pr~~~~~~~~~~~~~p~~ 234 (555) T protein:vir:10 163 ADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVI------HAIEPRADRDPSKRDDRNMAWK 234 (555) T ss_pred eCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE------EEEeeccCcCcCCCCccccceE Confidence 443 577777776543211 0111111 1111000000 Q ss_pred -ecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcc Q lcl|NC_021326. 179 -PDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQ 252 (445) Q Consensus 179 -~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~ 252 (445) ..+....+.. .-...-+|..+|++.+. .+.+|+|-.....+-+..+|...-.....++....|.+.+...... T Consensus 235 s~~~~~~~d~~-~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~ 313 (555) T protein:vir:10 235 SVYFEPGADET-RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN 313 (555) T ss_pred EEEEEeccCCc-cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc Confidence 0000000000 00122234566776543 3567999999999999999998777888889888887765332111 Q ss_pred cchhHHHhhhhCceeecc--CCCc--eeeEeccCChHHHHHHHHHHHHHHHHHhCcc--ccccccccCcchHHHHHHHHH Q lcl|NC_021326. 253 ELPEFKRLLRYYGAIKVS--DNGG--VDTIQVEVPVENSKKYLDELYQKIMLFGQAV--DFSSDKFGSAPSGVALEFLYT 326 (445) Q Consensus 253 ~~~~~~~~~~~~~~~~~~--~~~~--~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p--~~~~~~~~~~~Sg~Ai~~~~~ 326 (445) . . ..+..+++..+. ..++ ........+.......++.++..|...-... .......+...||+.+..... T Consensus 314 ~---~-~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~ 389 (555) T protein:vir:10 314 Q---D-ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHE 389 (555) T ss_pred c---c-ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHH Confidence 1 0 112222222222 2222 1222334466777788888888776544322 111112234567777765443 Q ss_pred HHHHHHHH-----HHHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCHHH-HH-------HHHHHHhcc Q lcl|NC_021326. 327 NLNLKADK-----LARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANTEL-QV-------QTAQQSMGI 388 (445) Q Consensus 327 ~l~~k~~~-----~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~~~-~~-------~~~~~~~g~ 388 (445) .+.....- ....+.+-+.+++.++.+..-.+. ....++|.+..++-+.... .+ +.+..++++ T Consensus 390 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~ 469 (555) T protein:vir:10 390 EKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGI 469 (555) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 33333222 123333444455555444221111 2345667765544332111 11 122222232 Q ss_pred -------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhhhc--cccCCCCCCCCC-CCCCCcCCC Q lcl|NC_021326. 389 -------VSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLPN--LDDGGADGAQQK-ERSNDKQSE 445 (445) Q Consensus 389 -------~s~et~l~~l---~~~~----d~~~E~~ri~~E~~~~~~~~~~--~~~~~~~~~~~~-~~~~d~~~~ 445 (445) +....++..+ -+++ -.++|++++.+++++..+.++. .........+.- +..-+.++. T Consensus 470 ~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~ 543 (555) T protein:vir:10 470 KPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNA 543 (555) T ss_pred ChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchh Confidence 2223333221 1222 1346666666554432222111 111111111100 111122223 No 126 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.82 E-value=3.5e-08 Score=61.48 Aligned_cols=429 Identities=9% Similarity=-0.013 Sum_probs=196.6 Q ss_pred ChHHHH---HHHH-HHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYI---KQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i---~~~~-~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) ..+++. +..+ +|.+-...+.++|+ +-+|.+.......... ..+...++..+-+...++++++.|++- | T Consensus 6 ~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~-~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~ 82 (555) T protein:vir:98 6 ERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNR-GEKRHNNILDNTGTRALRVLAAGMMAGMTSPAR 82 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCc-chhcccccccccHHHHHHHHHHHHHHhhcCCCC Confidence 222222 2332 23222333444442 1122211111111111 111234566777888888888777643 2 Q ss_pred --eeeccCch------HHHHHHH-------H-HhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEE Q lcl|NC_021326. 72 --IAFKHTDD------EVIKRID-------E-VLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPI 135 (445) Q Consensus 72 --~~~~~~d~------~~~~~l~-------~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v 135 (445) +++...+. ++.+.|. . +..+||...+.++.++..++|.+.+++..|..+.+++..++..+++.- T Consensus 83 ~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~~~v~ 162 (555) T protein:vir:98 83 PWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGEYAIA 162 (555) T ss_pred cccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecceeEEe Confidence 23333221 2333322 2 223678888999999999999999988888777788888988887654 Q ss_pred EcCCCCCceEEEEEEEeeec-------------------------ceeEEEEecceEEEEEEecceee------------ Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLEN-------------------------ETKVEYWDKITVNYYVYENGSLI------------ 178 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~------------ 178 (445) -|. .+++..++|.++..- +..++++ +.++-..... T Consensus 163 ~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~------~~V~pr~~~~~~~~~~~~~p~~ 234 (555) T protein:vir:98 163 ADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVI------HAIEPRADRDPSKRDDRNMAWK 234 (555) T ss_pred eCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEE------EEEeeccCcCcCCCCccccceE Confidence 443 577777776543211 0111111 1111000000 Q ss_pred -ecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcc Q lcl|NC_021326. 179 -PDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQ 252 (445) Q Consensus 179 -~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~ 252 (445) ..+....+.. .-...-+|..+|++.+. .+.+|+|-.....+-+..+|...-.....++....|.+.+...... T Consensus 235 s~~~~~~~d~~-~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~~~~ 313 (555) T protein:vir:98 235 SVYFEPGADET-RTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPVSAKN 313 (555) T ss_pred EEEEEeccCCc-cccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccc Confidence 0000000000 00122234566776543 3567999999999999999998777888889888887765332111 Q ss_pred cchhHHHhhhhCceeecc--CCCc--eeeEeccCChHHHHHHHHHHHHHHHHHhCcc--ccccccccCcchHHHHHHHHH Q lcl|NC_021326. 253 ELPEFKRLLRYYGAIKVS--DNGG--VDTIQVEVPVENSKKYLDELYQKIMLFGQAV--DFSSDKFGSAPSGVALEFLYT 326 (445) Q Consensus 253 ~~~~~~~~~~~~~~~~~~--~~~~--~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p--~~~~~~~~~~~Sg~Ai~~~~~ 326 (445) . . ..+..+++..+. ..++ ........+.......++.++..|...-... .......+...||+.+..... T Consensus 314 ~---~-~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TAtEV~~r~~ 389 (555) T protein:vir:98 314 Q---D-ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTATEVAERHE 389 (555) T ss_pred c---c-ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccHHHHHHHHH Confidence 1 0 112222222222 2222 1222334466777788888888776544322 111112234567777765443 Q ss_pred HHHHHHHH-----HHHHHHHHHHHHHHHHHHHhccCC-----CcceEEEEeCCCCCCCHHH-HH-------HHHHHHhcc Q lcl|NC_021326. 327 NLNLKADK-----LARKAKVAIQELLWFVFEHFDIKG-----EHKDVDISFNYNKVANTEL-QV-------QTAQQSMGI 388 (445) Q Consensus 327 ~l~~k~~~-----~~~~~~~~l~~~~~~~~~~~~~~~-----~~~~i~v~f~~~~p~d~~~-~~-------~~~~~~~g~ 388 (445) .+.....- ....+.+-+.+++.++.+..-.+. ....++|.+..++-+.... .+ +.+..++++ T Consensus 390 E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~l~~i~~laq~ 469 (555) T protein:vir:98 390 EKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRFVGNLGAVAGI 469 (555) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 33333222 123333444455555444221111 2345667765544332111 11 122222232 Q ss_pred -------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhhhc--cccCCCCCCCCC-CCCCCcCCC Q lcl|NC_021326. 389 -------VSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLPN--LDDGGADGAQQK-ERSNDKQSE 445 (445) Q Consensus 389 -------~s~et~l~~l---~~~~----d~~~E~~ri~~E~~~~~~~~~~--~~~~~~~~~~~~-~~~~d~~~~ 445 (445) +....++..+ -+++ -.++|++++.+++++..+.++. .........+.- +..-+.++. T Consensus 470 ~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~~~~~ 543 (555) T protein:vir:98 470 KPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTSKQNA 543 (555) T ss_pred ChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCcchh Confidence 2223333221 1222 1346666666554432222111 111111111100 111122223 No 127 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.81 E-value=4e-08 Score=61.17 Aligned_cols=420 Identities=11% Similarity=0.068 Sum_probs=207.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc--ccccccccccccccccchHHHHHHHHHhhhhcc--C----- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVD--ATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P----- 71 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~--~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~----- 71 (445) |++++=.-..+|.+-...+.+||+= -+|.+..... ........+.+.++..+-+...++++++.|++- | T Consensus 6 l~~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~~~W 83 (547) T protein:vir:10 6 IVKRLDFLKTDRKNVEQIWDCIRKY--IMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPATKW 83 (547) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHH--hcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCCCcc Confidence 3333222222343333444455432 1222211110 011111122345677788888888888877653 2 Q ss_pred eeeccCc------hHHHHHHHH--------HhccCHHHHHHHHHHHHHhcCeEEEEEEECC--CCcEEEEEEccceeEEE Q lcl|NC_021326. 72 IAFKHTD------DEVIKRIDE--------VLGNRFDDKLHSVLTGASNKGIEWLHPYLDE--EGEFKLFRVPAEQGIPI 135 (445) Q Consensus 72 ~~~~~~d------~~~~~~l~~--------~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~--~g~~~i~~~~p~~~~~v 135 (445) +++...| .++...+.. +...||...+.++.++..++|.+.+++..|+ .+.++++.++..+++.- T Consensus 84 F~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v~ 163 (547) T protein:vir:10 84 FELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYFE 163 (547) T ss_pred cccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEEe Confidence 2333322 223333322 2235788889999999999999988887654 35688999998887655 Q ss_pred EcCCCCCceEEEEEEEeeecc------------------------e---eEEEEecceEEEEEEeccee----------- Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLENE------------------------T---KVEYWDKITVNYYVYENGSL----------- 177 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~~------------------------~---~~~~~~~~~~~~~~~~~~~~----------- 177 (445) -|. .+++..++|.++..-. . .++++ +.++-.... T Consensus 164 ~d~--~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~------~~v~~~~~~~~~~~~~~~~~ 235 (547) T protein:vir:10 164 EDS--RGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVV------MCVFTRYDKKQNRNAGTVLA 235 (547) T ss_pred eCC--CcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEE------EEEeeccCCCCCccccceee Confidence 443 5677776665432100 0 11111 111000000 Q ss_pred -------eecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeE Q lcl|NC_021326. 178 -------IPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV 245 (445) Q Consensus 178 -------~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~ 245 (445) ....... + ...-....+|..+|++.+. ++.+|+|-.....+-+..+|...-......+....|.+. T Consensus 236 ~~~~p~~s~~~e~~-~-~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~ 313 (547) T protein:vir:10 236 PTERPFGKKWILKE-G-AVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIM 313 (547) T ss_pred ccccceeEEEEEec-C-ceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 0000000 0 0001122345567776553 356799999999999999999988899999999999886 Q ss_pred EecCCcccchhHHHhhhhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHH Q lcl|NC_021326. 246 LTNYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLY 325 (445) Q Consensus 246 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~ 325 (445) +.-..... . ..+..++++..++..+++.+....+.......++.++..|...-....+ ....+...+++.+.... T Consensus 314 v~~~g~~~---~-~~~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~-~~~~~~~~TAtEV~~r~ 388 (547) T protein:vir:10 314 VTERGLIS---D-IDLGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQL-QMKDSPAMTATEVQVRY 388 (547) T ss_pred cccccccc---c-ceecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhh-hcCCCccccHHHHHHHH Confidence 53211111 1 1234455555566667787777777777788888888877654332221 12223456777776554 Q ss_pred HHHHHHHHHH-----HHHHHHHHHHHHHHHHHHhccCC--------CcceEEEEeCCCCCCCHHH-HHH-------HHHH Q lcl|NC_021326. 326 TNLNLKADKL-----ARKAKVAIQELLWFVFEHFDIKG--------EHKDVDISFNYNKVANTEL-QVQ-------TAQQ 384 (445) Q Consensus 326 ~~l~~k~~~~-----~~~~~~~l~~~~~~~~~~~~~~~--------~~~~i~v~f~~~~p~d~~~-~~~-------~~~~ 384 (445) ..+.....-. ...+.+-+.+++.++.+..-.+. ....++|++..++-+.... .+. .+.. T Consensus 389 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~ 468 (547) T protein:vir:10 389 ELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQ 468 (547) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHH Confidence 4443332221 12334445555555544322221 2345677776555443211 111 2222 Q ss_pred Hhcc-------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHHHHhhh---------cc---ccCCCCCCCCCC Q lcl|NC_021326. 385 SMGI-------VSHETVLENH---PFVE----DLQAELERIEQEQMEYNKQLP---------NL---DDGGADGAQQKE 437 (445) Q Consensus 385 ~~g~-------~s~et~l~~l---~~~~----d~~~E~~ri~~E~~~~~~~~~---------~~---~~~~~~~~~~~~ 437 (445) ++++ +....++..+ -+++ -.++|++.+.+++++..+.+. .. .+.++...++.+ T Consensus 469 laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 469 LAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQAALKENQ 547 (547) T ss_pred hhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccchhccC Confidence 2232 2223333322 1232 124666666555433222211 11 111111112222 No 128 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.72 E-value=8.5e-08 Score=59.35 Aligned_cols=398 Identities=9% Similarity=0.005 Sum_probs=155.2 Q ss_pred ChHHHHHHHH------------------HHHHHHHHHHHHhcCCCcc-----cccccccccccccccccc------ccc- Q lcl|NC_021326. 1 MIVRYIKQHL------------------EKLPEISIGQEYYEQRPDI-----VKEPKPVDATGAVDPLKP------DDR- 50 (445) Q Consensus 1 ~l~~~i~~~~------------------~~~~~~~~~~~yy~G~~~i-----~~~~~~~~~~~~~~~~~~------~~r- 50 (445) +..++..... .+......+.++=.++..- ........+-......++ ..+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~ 82 (547) T protein:vir:63 3 LFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKK 82 (547) T ss_pred hhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHHHH Confidence 1111111100 0000001122222221110 000000000000000000 000 Q ss_pred -cccchHHHHHHHHHhhhhc-----------cCeeec---------cCchHHHHHHHHHhcc----------CHHHHHHH Q lcl|NC_021326. 51 -MITNFHANLVDQKVSYIVG-----------KPIAFK---------HTDDEVIKRIDEVLGN----------RFDDKLHS 99 (445) Q Consensus 51 -i~~n~~~~iv~~~~~~l~g-----------~~~~~~---------~~d~~~~~~l~~~~~n----------~~~~~~~~ 99 (445) ...+....+|++.++.+.+ -++.+. ..+......+..++.+ .+..++.. T Consensus 83 ~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~ 162 (547) T protein:vir:63 83 FGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKK 162 (547) T ss_pred hhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHH Confidence 1123455555555543321 111211 1222233345554421 23455666 Q ss_pred HHHHHHhcCeEEEEEEECCCCcEE-EEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceee Q lcl|NC_021326. 100 VLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLI 178 (445) Q Consensus 100 ~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (445) +..+.+.+|.+|+.+..+..|++. +..++|..+.++.++.- ......++|+..........+... T Consensus 163 lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~~~~~~~------------- 228 (547) T protein:vir:63 163 IVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADG-KIPDNGNRFVQVIDQKIVATFNAR------------- 228 (547) T ss_pred HHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCcc-ccccCceEEEEEcCCcEEEEeccc------------- Confidence 778899999999999999999865 78899998877755421 111111111111111000000000 Q ss_pred ecccccccccccccccccccccceEEecC--------CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeE--Eec Q lcl|NC_021326. 179 PDYSNNLENSKTHFSTGSWGKIPFIPFKN--------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV--LTN 248 (445) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~g~iPvv~~~n--------~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~--~~g 248 (445) . |+|++. ...|.|.++.+...+.....+..-....+...+.|-.+ +.| T Consensus 229 --------------------e--iih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~ 286 (547) T protein:vir:63 229 --------------------E--MAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKA 286 (547) T ss_pred --------------------c--EEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecC Confidence 0 222221 23478888777777766665555555556666666544 344 Q ss_pred C---CcccchhHHHhhh-------hC-ceeeccCCCceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccccccccccCc Q lcl|NC_021326. 249 Y---DDQELPEFKRLLR-------YY-GAIKVSDNGGVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSA 315 (445) Q Consensus 249 ~---~~~~~~~~~~~~~-------~~-~~~~~~~~~~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~ 315 (445) . +.+....++..+. .. ++..+ .+++++|..... ....+.+..+...+.|...-++|....+-...+ T Consensus 287 ~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl-~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~ 365 (547) T protein:vir:63 287 AQQQSQHALEIFKREWKNSLSGINGSWQIPVV-SAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNG 365 (547) T ss_pred CCCCCHHHHHHHHHHHHHHhcCcccccccccc-cCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCccccc Confidence 3 2222222332221 11 12222 233455555444 334455666667777777778876544321111 Q ss_pred ---------chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCcceEEEEeCCCCCCCHHHHHHHHH Q lcl|NC_021326. 316 ---------PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGEHKDVDISFNYNKVANTELQVQTAQ 383 (445) Q Consensus 316 ---------~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~~i~v~f~~~~p~d~~~~~~~~~ 383 (445) .+...++... ...+...|.-++..+...+.. ......+.+.|......+..+.+.... T Consensus 366 ~~~~~~~~s~t~sn~e~~~----------~~~~~~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~~~~~~~~~~~~~ 435 (547) T protein:vir:63 366 GATGSKGGSLNEGNSAEKN----------QASKNKGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVKILA 435 (547) T ss_pred ccccccccccchhhHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEeeccccccHHHHHHHHH Confidence 1111111111 111222222222222211111 111235678888777777777665443 Q ss_pred H-HhccCChHHHHHhCCC---CCCHHHH-----H----HHHHHHH-----H-HHHHhhhccc--cCCCCCCCCC------ Q lcl|NC_021326. 384 Q-SMGIVSHETVLENHPF---VEDLQAE-----L----ERIEQEQ-----M-EYNKQLPNLD--DGGADGAQQK------ 436 (445) Q Consensus 384 ~-~~g~~s~et~l~~l~~---~~d~~~E-----~----~ri~~E~-----~-~~~~~~~~~~--~~~~~~~~~~------ 436 (445) . ..|+++.-.++++++. ++.-+.- + +..++++ . +......... ..+.++..++ T Consensus 436 ~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (547) T protein:vir:63 436 EKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTT 515 (547) T ss_pred HHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcccC Confidence 2 3589999888887744 2110000 0 0011111 0 0000000000 0001111111 Q ss_pred CCCCCcCC---C Q lcl|NC_021326. 437 ERSNDKQS---E 445 (445) Q Consensus 437 ~~~~d~~~---~ 445 (445) ++.+++.+ + T Consensus 516 ~~~~~d~~~~~~ 527 (547) T protein:vir:63 516 GDIGKDGQRKDK 527 (547) T ss_pred CCcCccccccCc Confidence 10000000 0 No 129 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.70 E-value=9.7e-08 Score=59.03 Aligned_cols=428 Identities=9% Similarity=0.018 Sum_probs=198.4 Q ss_pred ChHHHHHHHHHH----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) -+++..+.++.+ ..+++.+.+|..- ..-..... ...+...++..+-+...++++++.|++- | T Consensus 13 ~~~~r~~~l~~~R~~~e~~w~e~~~~~lP-----~~~~~~~~----~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~ 83 (535) T protein:vir:33 13 GAKATYDRLTNDRRAYETRAENCAQYTIP-----SLFPKESD----NESTDYTTPWQAVGARGLNNLASKLMLALFPMQS 83 (535) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----cccCCCCC----cccccccccccccHHHHHHHHHHHHHHhhcCCCc Confidence 344444444432 3444444444322 11000000 0011112344566677777777776542 2 Q ss_pred -eeeccCch-------------HHHHHHH--------HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEcc Q lcl|NC_021326. 72 -IAFKHTDD-------------EVIKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPA 129 (445) Q Consensus 72 -~~~~~~d~-------------~~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p 129 (445) +++...+. ++...+. .+..+||...+.++.++..++|.+.+++..+..+.++++.++- T Consensus 84 WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl 163 (535) T protein:vir:33 84 WMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRL 163 (535) T ss_pred ccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEc Confidence 22222221 1222221 2234688999999999999999999988777666778888876 Q ss_pred ceeEEEEcCCCCCceEEEEEEEeee--------------------cceeEEEEecceEEEEEEecceeeecccccccccc Q lcl|NC_021326. 130 EQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKITVNYYVYENGSLIPDYSNNLENSK 189 (445) Q Consensus 130 ~~~~~v~d~~~~~~~~~~v~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (445) .+++.. .+. .+++...+|.++.. ....+++|+.. +...+................ T Consensus 164 ~~~~v~-~d~-~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v---~~~~~~~~~~~~~~~~~~~~~ 238 (535) T protein:vir:33 164 SSYVVQ-RDA-YGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHV---YLDEESGDYLKYEEVEDVEID 238 (535) T ss_pred CeeEEe-eCC-CCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEE---EeeCCCCcEEEEEEEeCcccc Confidence 665443 333 57777776665431 01112222111 011111111100000011111 Q ss_pred cccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhC Q lcl|NC_021326. 190 THFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYY 264 (445) Q Consensus 190 ~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 264 (445) ......++..+|++.+. ++.+|+|-..+..+-+..+|...-......+....|.+.+.-.......... .... T Consensus 239 ~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~~--~~~~ 316 (535) T protein:vir:33 239 GSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRLT--KAQT 316 (535) T ss_pred ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc--cCCc Confidence 11222346667776553 3568999999999999999999888999999999998765321111111111 1122 Q ss_pred ceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH-----HHH Q lcl|NC_021326. 265 GAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK-----LAR 337 (445) Q Consensus 265 ~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~-----~~~ 337 (445) +.+..+..++++.+... .+.......++.++..|...-.... .....+...+|+.+......+.....- ... T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~-~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E 395 (535) T protein:vir:33 317 GDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 (535) T ss_pred eeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHH Confidence 33444455666666433 4667778888888877765432211 111223456777776543333333221 112 Q ss_pred HHHHHHHHHHHHHHHHhccCC-CcceEEEEeCCCCCCC-HHHHHHHHH----HHhcc--------CChHHHHHhC---CC Q lcl|NC_021326. 338 KAKVAIQELLWFVFEHFDIKG-EHKDVDISFNYNKVAN-TELQVQTAQ----QSMGI--------VSHETVLENH---PF 400 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~~~~~~-~~~~i~v~f~~~~p~d-~~~~~~~~~----~~~g~--------~s~et~l~~l---~~ 400 (445) .+.+-+.+++.++.+..-.+. ....++++|.-++... ..+.++.+. .++++ +....++..+ -+ T Consensus 396 ll~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~G 475 (535) T protein:vir:33 396 LQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIG 475 (535) T ss_pred HHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcC Confidence 223333333443332211111 1234667765444321 111222222 22221 2223333222 12 Q ss_pred CC-----CHHHHHHHHHHHHHHHHHhhhccc-cCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VE-----DLQAELERIEQEQMEYNKQLPNLD-DGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~-----d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~~~~~~~~~d~~~~ 445 (445) ++ -.++|++++.+++.+.....+... -+++.++......++.++= T Consensus 476 vp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 526 (535) T protein:vir:33 476 IDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEAMQGA 526 (535) T ss_pred CCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChhHHHH Confidence 22 234666666655544333322211 1122222222221111111 No 130 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.67 E-value=1.2e-07 Score=58.50 Aligned_cols=375 Identities=13% Similarity=0.055 Sum_probs=150.6 Q ss_pred ChH--HHHHHHHHH-----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhh------ Q lcl|NC_021326. 1 MIV--RYIKQHLEK-----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYI------ 67 (445) Q Consensus 1 ~l~--~~i~~~~~~-----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l------ 67 (445) +|. .+-..+..| ...+..+.+-| ++++++.. -...|++..+.+. T Consensus 58 ~~~~~~~~~~~~~r~~~~~~~~l~~~~~~~-~~npiv~~----------------------~I~~ia~~IA~~~~~~~~~ 114 (551) T protein:vir:80 58 VIGSMSANPGFKTKPSIRNNQDLHGVLKKF-GGNIILNA----------------------IINTRSNQVSMYCKPARHS 114 (551) T ss_pred cccceecCcccccCccccChhHHHHHHHHh-hcCHHHHH----------------------HHHHHHHHHhhhhhhhhhh Confidence 221 000111110 01111111222 11222110 0122222222211 Q ss_pred -hccCeeec---------cCchHHHHHHHHHhc--c--------CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEE Q lcl|NC_021326. 68 -VGKPIAFK---------HTDDEVIKRIDEVLG--N--------RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFR 126 (445) Q Consensus 68 -~g~~~~~~---------~~d~~~~~~l~~~~~--n--------~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~ 126 (445) -|-++.+. ..+......+..++. | .+..++..+..+.+.+|.+|+.+..+..|+|. +.. T Consensus 115 ~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~ 194 (551) T protein:vir:80 115 EKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVA 194 (551) T ss_pred cCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEE Confidence 01122221 112222233444432 1 23345566778889999999999899999865 788 Q ss_pred EccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec Q lcl|NC_021326. 127 VPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK 206 (445) Q Consensus 127 ~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~ 206 (445) ++|..+.++.++.- ......++|+..........+... . |+|++ T Consensus 195 l~p~~V~v~~~~~g-~~~~~~~~y~~~~~g~~~~~~~~~---------------------------------e--iiH~~ 238 (551) T protein:vir:80 195 KDPTTIFFATTADG-KIPDNGNRFVQVIDQKIVATFNAR---------------------------------E--MAFAV 238 (551) T ss_pred eCCceeEEEECCcc-ccccCceEEEEEeCCcEEEEEccc---------------------------------c--eEEec Confidence 99999887765431 111111122111111100000000 0 22222 Q ss_pred C--------CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE--ecCC---cccchhHHHhhh-------h-Cc Q lcl|NC_021326. 207 N--------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL--TNYD---DQELPEFKRLLR-------Y-YG 265 (445) Q Consensus 207 n--------~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~--~g~~---~~~~~~~~~~~~-------~-~~ 265 (445) . ...|.|.++.+...+.....+.......+...+.|-.++ .|.. .+....++..+. . ++ T Consensus 239 ~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~ 318 (551) T protein:vir:80 239 RNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQ 318 (551) T ss_pred ccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCc Confidence 1 124778787777777666655555555666667776443 4432 222222232221 1 11 Q ss_pred eeeccCCCceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccccccccccCc---------chHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 266 AIKVSDNGGVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSA---------PSGVALEFLYTNLNLKADK 334 (445) Q Consensus 266 ~~~~~~~~~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~---------~Sg~Ai~~~~~~l~~k~~~ 334 (445) +..+. +++++|..... ....+.+..+...+.|...-++|....+-.+.+ .+..-++.. T Consensus 319 ~~vl~-~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~---------- 387 (551) T protein:vir:80 319 IPVVS-AEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEK---------- 387 (551) T ss_pred ccccc-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHH---------- Confidence 22222 33455554443 444566667777778888888886544311110 111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhcc---CCCcceEEEEeCCCCCCCHHHHHHHHHH-HhccCChHHHHHhCCC---CCCHHH- Q lcl|NC_021326. 335 LARKAKVAIQELLWFVFEHFDI---KGEHKDVDISFNYNKVANTELQVQTAQQ-SMGIVSHETVLENHPF---VEDLQA- 406 (445) Q Consensus 335 ~~~~~~~~l~~~~~~~~~~~~~---~~~~~~i~v~f~~~~p~d~~~~~~~~~~-~~g~~s~et~l~~l~~---~~d~~~- 406 (445) ....+...|.-++..+...+.. ......+.+.|......+..+.+..... ..|+++.-.++++++. ++.-+. T Consensus 388 ~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~f~f~~~~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~ 467 (551) T protein:vir:80 388 NQASKNKGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIP 467 (551) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccccCCceEEEeeccChhhHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCcee Confidence 1111222222222222111110 1122457788887777777776664432 2588999888887754 211000 Q ss_pred --------HHHHHHHHHHHHHHh---hhcc-----ccCCCCCCCCCCCC------------CCcCCC Q lcl|NC_021326. 407 --------ELERIEQEQMEYNKQ---LPNL-----DDGGADGAQQKERS------------NDKQSE 445 (445) Q Consensus 407 --------E~~ri~~E~~~~~~~---~~~~-----~~~~~~~~~~~~~~------------~d~~~~ 445 (445) ..+..+.++.+..++ .... .+.+.+...++..+ .+++++ T Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 534 (551) T protein:vir:80 468 LNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNA 534 (551) T ss_pred ecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccccCcccc Confidence 001111111000000 0000 00111111111110 000110 No 131 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.66 E-value=1.2e-07 Score=58.47 Aligned_cols=400 Identities=10% Similarity=-0.012 Sum_probs=155.2 Q ss_pred ChHHHH--HHHHHHHHHHHHHHHHhc-CCCccccccccccccccccccc------cccccccchHHHHHHHHHhhhhccC Q lcl|NC_021326. 1 MIVRYI--KQHLEKLPEISIGQEYYE-QRPDIVKEPKPVDATGAVDPLK------PDDRMITNFHANLVDQKVSYIVGKP 71 (445) Q Consensus 1 ~l~~~i--~~~~~~~~~~~~~~~yy~-G~~~i~~~~~~~~~~~~~~~~~------~~~ri~~n~~~~iv~~~~~~l~g~~ 71 (445) |=+|+- -.|..+..++.+..+-|. +--..-.........+...... .-+| ....++.||+..++-++-+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr-~~~ia~~iVd~~~d~~~~~~ 79 (449) T protein:vir:10 1 MTDKLTLAVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYR-RGGIAHGAVEKLVGKCWQTN 79 (449) T ss_pred CchhhHHHHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHh-cCchhHHHHHhhhhhhhhcC Confidence 333321 112222223322222221 1000000011111111000000 0011 13456789999988775544 Q ss_pred eeec-cCchH-------HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCc Q lcl|NC_021326. 72 IAFK-HTDDE-------VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEE 143 (445) Q Consensus 72 ~~~~-~~d~~-------~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~ 143 (445) +.+. ..+.+ ....++....+.+...+.++.+++..+|.+++++-.+ +|+.--.-+++. +. T Consensus 80 ~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~-d~~~l~~Pl~~~-----------~~ 147 (449) T protein:vir:10 80 PEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIR-DEKDWNLPATKG-----------RG 147 (449) T ss_pred cccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEec-CCCCCCcccccC-----------cc Confidence 4343 22211 1122334444566677888888989999988877663 333211101110 11 Q ss_pred eEEEEEEEee--------ecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccH Q lcl|NC_021326. 144 LEAFIRMYKL--------ENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDI 215 (445) Q Consensus 144 ~~~~v~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~ 215 (445) +..+.-+|.. .+-..-.++.+.. |..... . .+........|.--.+.++.. ...|.|.+ T Consensus 148 i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~---y~v~~~-----~---~g~~~~~~~iH~SRl~~~~~~--~~~g~~~L 214 (449) T protein:vir:10 148 LQKVSVSWAGSLKVAEWDTGINSKTYGQPKL---WKYTER-----L---PNGSSRRVDIHPDRVFILGDY--SEDAIGFL 214 (449) T ss_pred eeeEEeeccccCChhhhhcCCCCCCCCCceE---EEEeee-----c---cCCCccceeeccceeEeecCC--CCCChhHH Confidence 1111111110 0000001111111 110000 0 000111112333322222211 12356666 Q ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHhcCC----e-----eEEecCCcccchh-H---HHhh-hhCceeeccCCCcee Q lcl|NC_021326. 216 FMYKTLIDAYNRRLSD-----LSNTFKDSNEL----T-----YVLTNYDDQELPE-F---KRLL-RYYGAIKVSDNGGVD 276 (445) Q Consensus 216 ~~v~~lid~~~~~~s~-----~~~~~~~~~~~----~-----l~~~g~~~~~~~~-~---~~~~-~~~~~~~~~~~~~~~ 276 (445) +++-+-+-.++.+.-. +.+..+..... + .-+.+.+.+...+ + ...+ +....+.+.++.+ T Consensus 215 ~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~i~~~~d-- 292 (449) T protein:vir:10 215 EPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEINRGNDVLMTTQGAT-- 292 (449) T ss_pred HHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhccchheeecCCcc-- Confidence 6544333222222111 11111111100 0 0011222221111 1 1111 2222334455554 Q ss_pred eEeccCChHHHHHHHHHHHHHHHHHhCcccc--ccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021326. 277 TIQVEVPVENSKKYLDELYQKIMLFGQAVDF--SSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF 354 (445) Q Consensus 277 ~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~--~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~ 354 (445) |.+.+.+.......++...+.+...+++|-. .+...++..|..-++ . -...++.++..+.+.|++++.+++... T Consensus 293 ~~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D~~-n---yyd~i~~~Q~~l~p~le~l~~~l~~s~ 368 (449) T protein:vir:10 293 VTPLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTEDQK-Y---FNARCQSRRVDLSFEIEDFCDKLIELK 368 (449) T ss_pred eEEEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccchhHH-H---HHHHHHHHHHhhhHHHHHHHHHHHHhh Confidence 4455566777888888888889999999853 222333322322232 2 333444445568999999998876543 Q ss_pred ccCCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCC Q lcl|NC_021326. 355 DIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQ 434 (445) Q Consensus 355 ~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 434 (445) ..... .++.|+|+|-...+..+.|++..+.+...+........+-+ +. .|+. +..... ...+..... T Consensus 369 ~g~~~-~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~-~~-~EiR-------~~~~~~---~~~~~~~~~ 435 (449) T protein:vir:10 369 IIDAV-AKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAF-SR-EEIR-------TAAGYD---NDDEEPLGE 435 (449) T ss_pred cCCCC-CceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCc-CH-HHHH-------HHhccc---CCCCCCCCC Confidence 32222 47999999999999999998776654322211100011111 11 1111 000000 000111111 Q ss_pred CCCCCCCcCCC Q lcl|NC_021326. 435 QKERSNDKQSE 445 (445) Q Consensus 435 ~~~~~~d~~~~ 445 (445) +...+.+++.+ T Consensus 436 e~~de~~~~~d 446 (449) T protein:vir:10 436 EDGDEEDKATD 446 (449) T ss_pred CCCccccccCC Confidence 11111222222 No 132 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.55 E-value=3e-07 Score=56.31 Aligned_cols=423 Identities=10% Similarity=0.009 Sum_probs=192.7 Q ss_pred ChHHHHHHHHHH----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) =+++..+.++.+ ..+++.+.+|..- ..-...... ......++..+-+...++.+++.|++- | T Consensus 11 ~~~~r~~~l~~~R~~~e~~w~e~~~y~lP-----~~~~~~~~~----~~~~~~~~~dst~~~a~~~Las~l~~~ltP~~~ 81 (522) T protein:vir:94 11 GAKAVYDRLKNGRQPYETRAQNCAAVTIP-----SLFPKESDN----SSTEYTTPWQAVGARCLNNLAAKLMLALFPQSP 81 (522) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----cccCCCCCc----ccccccccccccHHHHHHHHHHHHHhhcCCCCc Confidence 244445554433 3444445555322 111110010 111122455677777888888777542 2 Q ss_pred -eeeccCc-------------hHHHHHHH--------HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEc Q lcl|NC_021326. 72 -IAFKHTD-------------DEVIKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVP 128 (445) Q Consensus 72 -~~~~~~d-------------~~~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~ 128 (445) +++...+ .++.+.+. .+..+||...+.++.++..++|.+++++..+.+|.+ .++.++ T Consensus 82 WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~p 161 (522) T protein:vir:94 82 WMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYR 161 (522) T ss_pred ccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEE Confidence 1222111 11222222 223468899999999999999999988776665544 466776 Q ss_pred cceeEEEEcCCCCCceEEEEEEEeeec------------------ceeEEEEecceEEEEEEecceeeeccccccccccc Q lcl|NC_021326. 129 AEQGIPIWTDKEHEELEAFIRMYKLEN------------------ETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKT 190 (445) Q Consensus 129 p~~~~~v~d~~~~~~~~~~v~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (445) -.+. .+-.+. .+++...+|.++..- ...+++++... .....+.. ........... T Consensus 162 l~~y-~v~~d~-~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~----~~~~~~~~-~~~~~g~~~~~ 234 (522) T protein:vir:94 162 LVSY-VVQRDA-FGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIY----RQDDEYLR-YEEVEGIEVTG 234 (522) T ss_pred cceE-EEeeCC-CcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEE----eeCCceeE-EeeccCceecc Confidence 6664 343333 577766666554311 11233322110 11111111 00101111111 Q ss_pred ccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhCc Q lcl|NC_021326. 191 HFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYG 265 (445) Q Consensus 191 ~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~ 265 (445) .....+|..+|++.+. ++.+|+|-.....+-+..+|...-......+....|.+.+.-.......... ....+ T Consensus 235 ~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~~--~~~~g 312 (522) T protein:vir:94 235 TDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRLN--KAATG 312 (522) T ss_pred cCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchhee--ccCCc Confidence 1222356677876553 3468999999999999999999999999999999998776311111111111 11223 Q ss_pred eeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH-----HHHH Q lcl|NC_021326. 266 AIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK-----LARK 338 (445) Q Consensus 266 ~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~-----~~~~ 338 (445) .+..+..++++.+... .+.......++.++..|...-....+. ...+...||+.+......+.....- .... T Consensus 313 ~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~-~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~ 391 (522) T protein:vir:94 313 EFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAV-QRNAERVTAEEIRYVAGELEATLGGVYSVQSQEL 391 (522) T ss_pred eeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhc-cCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHH Confidence 3444445556655433 366777788888887776654322211 1223456777776544433333322 1222 Q ss_pred HHHHHHHHHHHHHHHhccCC-CcceEEEEeCCCCCC-CHHHHHHHHHHH----hcc--------CChHHHHH----hCCC Q lcl|NC_021326. 339 AKVAIQELLWFVFEHFDIKG-EHKDVDISFNYNKVA-NTELQVQTAQQS----MGI--------VSHETVLE----NHPF 400 (445) Q Consensus 339 ~~~~l~~~~~~~~~~~~~~~-~~~~i~v~f~~~~p~-d~~~~~~~~~~~----~g~--------~s~et~l~----~l~~ 400 (445) +.+-+.+++.++.+..-.+. ....+++++.-++.. ...+.++.+... +++ +....++. .+ + T Consensus 392 l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~-G 470 (522) T protein:vir:94 392 QLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNAL-G 470 (522) T ss_pred HHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHc-C Confidence 23333444444433222111 112356666433322 111122222211 111 12222222 22 2 Q ss_pred CC-----CHHHHHHHHHHHHHHHHHh--hhccccCC--CCCCCCCCCCCCcC Q lcl|NC_021326. 401 VE-----DLQAELERIEQEQMEYNKQ--LPNLDDGG--ADGAQQKERSNDKQ 443 (445) Q Consensus 401 ~~-----d~~~E~~ri~~E~~~~~~~--~~~~~~~~--~~~~~~~~~~~d~~ 443 (445) |+ -.++|++.+.+++.+.... .......+ +..+.+..++-..+ T Consensus 471 v~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 522 (522) T protein:vir:94 471 IDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGEDMAQA 522 (522) T ss_pred CChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchhhhcC Confidence 21 1245665555554322211 11111111 11111111111111 No 133 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.54 E-value=3.2e-07 Score=56.21 Aligned_cols=428 Identities=10% Similarity=0.035 Sum_probs=193.1 Q ss_pred ChHHHHHHHHHH----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--Ce-- Q lcl|NC_021326. 1 MIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--PI-- 72 (445) Q Consensus 1 ~l~~~i~~~~~~----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~~-- 72 (445) .+++..+.++.+ ..+++.+.+|..- ..-.. .. .. ......++..+-+...++++++.|++- |. T Consensus 13 ~~~~r~~~l~~~R~~~e~~w~e~~~y~lP-----~~~~~-~~-~~--~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~ 83 (543) T protein:vir:88 13 GAKAVYERLKNDRVPYETRAENCAKVTIP-----SLFPK-DS-DN--SSTDYTTPWQAVGARGLNNLSAKVMLALFPLQS 83 (543) T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHhcc-----ccCCC-CC-Cc--ccccccccccchHHHHHHHHHHHHHHhhcCCCc Confidence 444444554432 2344444444322 11000 00 00 011112455666777888888776542 22 Q ss_pred --eeccCch-------------HHHHHHH--------HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE---EEE Q lcl|NC_021326. 73 --AFKHTDD-------------EVIKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK---LFR 126 (445) Q Consensus 73 --~~~~~d~-------------~~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~---i~~ 126 (445) ++...+. ++...|. .+..+||...+.++.++..++|.+.+++..+....++ ++. T Consensus 84 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~~~~ 163 (543) T protein:vir:88 84 WMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKL 163 (543) T ss_pred ccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccccceecceEE Confidence 2222221 1222221 2223688899999999999999998877655432222 334 Q ss_pred EccceeEEEEcCCCCCceEEEEEEEeee-------------------cceeEEEEecceEEEEEEecceeeecccccccc Q lcl|NC_021326. 127 VPAEQGIPIWTDKEHEELEAFIRMYKLE-------------------NETKVEYWDKITVNYYVYENGSLIPDYSNNLEN 187 (445) Q Consensus 127 ~~p~~~~~v~d~~~~~~~~~~v~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (445) ++- .-|.+..+. .+++...+|.++.. ....+++++... .+.+.........-.+.. T Consensus 164 ~pl-~~y~v~~d~-~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~---pr~~~~~~~~~~~~~~~~ 238 (543) T protein:vir:88 164 YTL-HNHVVQRDA-FGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIY---IDDESGDFLSYQEIEGVE 238 (543) T ss_pred eEc-ceEEEeeCC-CCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEE---eecCCCcccccccccCee Confidence 433 334444444 57777776665421 111233332111 011111101111111111 Q ss_pred cccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhh Q lcl|NC_021326. 188 SKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLR 262 (445) Q Consensus 188 ~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~ 262 (445) ........++..+|++.+. ++.+|+|-.....+-+..+|...-......+....|.+.+.-.......... .. T Consensus 239 v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~~~--~~ 316 (543) T protein:vir:88 239 VDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRRLV--KA 316 (543) T ss_pred eecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhcc--cC Confidence 2222233445677876553 3468999999999999999999888999999999998775321111111111 11 Q ss_pred hCceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH-----H Q lcl|NC_021326. 263 YYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK-----L 335 (445) Q Consensus 263 ~~~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~-----~ 335 (445) ..+.+..+..+++..+... .+.......++.++..|...-....+ ....+...+|+.+......+.....- . T Consensus 317 ~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~-~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~ 395 (543) T protein:vir:88 317 QTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSA-VQRSGERVTAEEIRYVASELEDTLGGVYSILS 395 (543) T ss_pred CCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhh-ccCCCCcccHHHHHHHHHHHHHHHhHHHHHHH Confidence 2233444455666665433 46777888888888877654332221 11223456777776543333333222 1 Q ss_pred HHHHHHHHHHHHHHHHHHhccCC-CcceEEEEeCCC-CCCCHHHHHHHHHHH---hccCC---------hHHHHHhC--- Q lcl|NC_021326. 336 ARKAKVAIQELLWFVFEHFDIKG-EHKDVDISFNYN-KVANTELQVQTAQQS---MGIVS---------HETVLENH--- 398 (445) Q Consensus 336 ~~~~~~~l~~~~~~~~~~~~~~~-~~~~i~v~f~~~-~p~d~~~~~~~~~~~---~g~~s---------~et~l~~l--- 398 (445) ...+.+-+.+++.++.+..-.+. ....+++.+.-+ -+......++.+... .|.++ .+.++..+ T Consensus 396 ~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~ 475 (543) T protein:vir:88 396 QELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANA 475 (543) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHH Confidence 22223333444444433221111 112345555322 222222333332221 22222 23333322 Q ss_pred CCCC-----CHHHHHHHHHHHHHHHHHhhhccc-cCCCCCCC-------------CCCCCCCcCCC Q lcl|NC_021326. 399 PFVE-----DLQAELERIEQEQMEYNKQLPNLD-DGGADGAQ-------------QKERSNDKQSE 445 (445) Q Consensus 399 ~~~~-----d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~~~-------------~~~~~~d~~~~ 445 (445) -+++ -.++|++++.+++++......... .+++.... +-+....+.+- T Consensus 476 ~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 541 (543) T protein:vir:88 476 IGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAMESAMDTAGVQPGPIAT 541 (543) T ss_pred hCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHHHHHhhhcCCCCCCCCC Confidence 1331 134667777665543222211111 11100000 01111111111 No 134 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.54 E-value=3.2e-07 Score=56.18 Aligned_cols=383 Identities=10% Similarity=0.082 Sum_probs=149.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +|..+|+.....+..+.....+..+.-.. +..... .+.. .......-......++.--.+.. ++ T Consensus 95 iv~~~I~~ia~~vA~~~~~~~~~~~~~~~---~i~lk~--------~~~~-~~~~~~~~~~~l~~~l~~~~~~~---~p- 158 (576) T protein:vir:96 95 ILNAIILTRSNQVAMYCQPSRYNERGLGF---EVRMRD--------LDAE-PGKKEKEEIKRIENFILNTGRDK---DI- 158 (576) T ss_pred HHHHHHHHHHHHHHhhhhhhhhccccccc---eeEEec--------CcCc-cchhhhHhhhhHHhhHhhccCCC---CC- Confidence 45555555544444443222222111000 000000 0000 00000000011111110000000 00 Q ss_pred HHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCC--CcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecce Q lcl|NC_021326. 81 VIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEE--GEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 157 (445) Q Consensus 81 ~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~--g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 157 (445) +...+..+...+..+.+.+|.+|+.+..+.+ |++ .+..++|..+.++.+... ..+....+++...... T Consensus 159 --------~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg-~~~~~~~~~~~~~~~~ 229 (576) T protein:vir:96 159 --------DRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNG-KIIKGGKRFVQVINKK 229 (576) T ss_pred --------ccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCC-ceeeeeeEEEEecCCc Confidence 0113455667788889999999998876554 444 478899999888766531 1111111121111111 Q ss_pred eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 158 KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFK 237 (445) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~ 237 (445) ....+....+.+ |... |..-......|.|.+..+...+.....+..-....+. T Consensus 230 ~~~~~~~~dii~-------------------------~~~~--~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ 282 (576) T protein:vir:96 230 VVASFTSREMAM-------------------------GIRN--PRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFS 282 (576) T ss_pred eEEEecccceEE-------------------------Eeec--CCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 100000000000 0000 0000001234777777766666666655555555566 Q ss_pred HhcCCeeEE--ecC---CcccchhHHHhhh--------hCc-eeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhC Q lcl|NC_021326. 238 DSNELTYVL--TNY---DDQELPEFKRLLR--------YYG-AIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQ 303 (445) Q Consensus 238 ~~~~~~l~~--~g~---~~~~~~~~~~~~~--------~~~-~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~ 303 (445) ..+.|-.++ .|. +.+....++..+. .++ .+.++++.+.+.++.......+.+..+...+.|+..-+ T Consensus 283 Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afg 362 (576) T protein:vir:96 283 HGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYG 362 (576) T ss_pred ccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhC Confidence 666776544 342 2222222232221 122 24456665555555444556677778888888888888 Q ss_pred cccccccccc-Ccc---------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCcceEEEEeCCC Q lcl|NC_021326. 304 AVDFSSDKFG-SAP---------SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGEHKDVDISFNYN 370 (445) Q Consensus 304 ~p~~~~~~~~-~~~---------Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~~i~v~f~~~ 370 (445) +|....+-.. ++. +...++.... ..+...|.-++..+...+.. ......+.+.|.+. T Consensus 363 VPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~----------~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~r~ 432 (576) T protein:vir:96 363 IDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQ----------QSQNKGLQPLLRFIEDLINTHIISEYSDKYVFQFVGG 432 (576) T ss_pred CCHHHccccccccccccccccccccccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhhchhccCceEEEeccC Confidence 8864433211 101 1111111111 11222222222222211110 11124466778776 Q ss_pred CCCCHHHHHHHHHH-HhccCChHHHHHhCCC--CCCHHHH-----HHHHH-----------HHHHHHHHhhhccccCCCC Q lcl|NC_021326. 371 KVANTELQVQTAQQ-SMGIVSHETVLENHPF--VEDLQAE-----LERIE-----------QEQMEYNKQLPNLDDGGAD 431 (445) Q Consensus 371 ~p~d~~~~~~~~~~-~~g~~s~et~l~~l~~--~~d~~~E-----~~ri~-----------~E~~~~~~~~~~~~~~~~~ 431 (445) .+.+..+...+... ..|+++.-.++++++. +++-+.- +..+. ++++...+.......+... T Consensus 433 d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 512 (576) T protein:vir:96 433 DTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDE 512 (576) T ss_pred CHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccCCCCCC Confidence 66666555544433 2599998888887743 2110000 00000 0000000000000000011 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) ...++...+..+++ T Consensus 513 ~~~~~s~~~~~~g~ 526 (576) T protein:vir:96 513 EPQQESTEDKVDGR 526 (576) T ss_pred CCCCCCCCCccccc Confidence 00111001111111 No 135 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=98.53 E-value=3.5e-07 Score=55.97 Aligned_cols=426 Identities=9% Similarity=-0.002 Sum_probs=186.6 Q ss_pred ChHHHHHHHHHH----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) -+++..+.++.+ ..+++.+.+|..- ..-.. .+.. ..+...++..+-+...++++++.|++- | T Consensus 12 ~~~~r~~~lk~~R~~~e~~w~e~~~~~lP-----~~~~~-~~~~---~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~ 82 (536) T protein:vir:21 12 GAKSVYERLKNDRAPYETRAQNCAQYTIP-----SLFPK-DSDN---ASTDYQTPWQAVGARGLNNLASKLMLALFPMQT 82 (536) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----cccCC-CCCc---ccccccccccccHHHHHHHHHHHHHHhhcCCCc Confidence 444445555433 3344444444321 11111 0100 111123466677777888888776542 2 Q ss_pred -eeeccCchH-------------HHHHH-------H-HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEc Q lcl|NC_021326. 72 -IAFKHTDDE-------------VIKRI-------D-EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVP 128 (445) Q Consensus 72 -~~~~~~d~~-------------~~~~l-------~-~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~ 128 (445) +++...+.. +...+ . .+..+||...+.++.++..++|.+.+++..+..+.+ .++.++ T Consensus 83 WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~p 162 (536) T protein:vir:21 83 WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYR 162 (536) T ss_pred ccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEE Confidence 222222211 11112 1 222468889999999999999999987765544333 466777 Q ss_pred cceeEEEEcCCCCCceEEEEEEEeeec--------------------ceeEEEEecceEEEEEEecceeeeccccccccc Q lcl|NC_021326. 129 AEQGIPIWTDKEHEELEAFIRMYKLEN--------------------ETKVEYWDKITVNYYVYENGSLIPDYSNNLENS 188 (445) Q Consensus 129 p~~~~~v~d~~~~~~~~~~v~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (445) -.+++..-| . .+++..++|.++..- ...+++++... ...+........+...... T Consensus 163 l~~~~v~~d-~-~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~---~~~~~~~~~~~~e~~g~~v 237 (536) T protein:vir:21 163 LSSYVVQRD-A-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIY---LDEDSGEYLRYEEVEGMEV 237 (536) T ss_pred cCeEEEeeC-C-CCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEE---EecCCCcEEEEeccCCeee Confidence 666544333 3 577777766554211 11222222111 0111111111111111111 Q ss_pred ccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE-ecCCcccchhHHHhhh Q lcl|NC_021326. 189 KTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-TNYDDQELPEFKRLLR 262 (445) Q Consensus 189 ~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~ 262 (445) .......+|..+|++.+. .+.+|+|-..+..+-+..+|...-...........|.+.+ .+... ....... T Consensus 238 ~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~----~~~~~~~ 313 (536) T protein:vir:21 238 QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT----QPRRLTK 313 (536) T ss_pred ccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCccccc----chhhhcc Confidence 122334456778887654 3457999999999999999988777777667766655443 22111 1111111 Q ss_pred -hCceeeccCCCceeeEe--ccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH----- Q lcl|NC_021326. 263 -YYGAIKVSDNGGVDTIQ--VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK----- 334 (445) Q Consensus 263 -~~~~~~~~~~~~~~~l~--~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~----- 334 (445) ..+.+..+..+++..+. ...+.......++.++..|...-....+. ...+...||+.+......+.....- T Consensus 314 ~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~-~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl 392 (536) T protein:vir:21 314 AQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAV-QRTGERVTAEEIRYVASELEDTLGGVYSIL 392 (536) T ss_pred CCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcc-cCCCCCccHHHHHHHHHHHHHHhhHHHHHH Confidence 22233323334454443 33466667777888877775543322211 1223456777776654444433222 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCC-CcceEEEEeCCCCC-CCHHHHHHHHHH----Hhcc--------CChHHHHH---- Q lcl|NC_021326. 335 LARKAKVAIQELLWFVFEHFDIKG-EHKDVDISFNYNKV-ANTELQVQTAQQ----SMGI--------VSHETVLE---- 396 (445) Q Consensus 335 ~~~~~~~~l~~~~~~~~~~~~~~~-~~~~i~v~f~~~~p-~d~~~~~~~~~~----~~g~--------~s~et~l~---- 396 (445) ....+.+-+.+++.++.+..-.+. ...-+++.+.-++. ....+.++.+.. ++++ +....++. T Consensus 393 ~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~ 472 (536) T protein:vir:21 393 SQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIAN 472 (536) T ss_pred HHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHH Confidence 112223333444444332211111 11123444432222 122222222221 2221 22233332 Q ss_pred hCCCCC----CHHHHHHHHHHHHHHHHHhhhc------ccc-----CCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 397 NHPFVE----DLQAELERIEQEQMEYNKQLPN------LDD-----GGADGAQQKERSNDKQSE 445 (445) Q Consensus 397 ~l~~~~----d~~~E~~ri~~E~~~~~~~~~~------~~~-----~~~~~~~~~~~~~d~~~~ 445 (445) .++..+ -.++|++.+.+++++..+..+. ... +.....+.....+-+++= T Consensus 473 ~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 473 AIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhccccCCCC Confidence 232211 1346776666554332221111 011 111111111111111111 No 136 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.51 E-value=3.9e-07 Score=55.72 Aligned_cols=426 Identities=9% Similarity=0.003 Sum_probs=186.0 Q ss_pred ChHHHHHHHHHH----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) -+++..+.++.+ ..+++.+.+|..- ..-.. .+.. ..+...++..+-+...++++++.|++- | T Consensus 12 ~~~~r~~~l~~~R~~~e~~w~e~~~~~lP-----~~~~~-~~~~---~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~ 82 (536) T protein:vir:10 12 GAKSVYERLKNDRAPYETRAQNCAQYTIP-----SLFPK-DSDN---ASTDYQTPWQAVGARGLNNLASKLMLALFPMQT 82 (536) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----cccCC-CCCc---ccccccccccccHHHHHHHHHHHHHhhhcCCCc Confidence 444444554433 3344445555321 11111 0110 111123456677777888888776542 2 Q ss_pred -eeeccCchH-------------HHHHH-------H-HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEc Q lcl|NC_021326. 72 -IAFKHTDDE-------------VIKRI-------D-EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVP 128 (445) Q Consensus 72 -~~~~~~d~~-------------~~~~l-------~-~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~ 128 (445) +++...+.. +...+ . .+..+||...+.++.++..++|.+.+++..+..+.+ .++.++ T Consensus 83 WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~p 162 (536) T protein:vir:10 83 WMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYR 162 (536) T ss_pred ccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEE Confidence 222222211 11112 1 222468889999999999999999987765544333 466777 Q ss_pred cceeEEEEcCCCCCceEEEEEEEeeec--------------------ceeEEEEecceEEEEEEecceeeeccccccccc Q lcl|NC_021326. 129 AEQGIPIWTDKEHEELEAFIRMYKLEN--------------------ETKVEYWDKITVNYYVYENGSLIPDYSNNLENS 188 (445) Q Consensus 129 p~~~~~v~d~~~~~~~~~~v~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (445) -.+++.--| . .+++..++|.++..- ...+++++... ...+.........-..... T Consensus 163 l~~~~v~~d-~-~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~---~~~~~~~~~~~~e~~g~~v 237 (536) T protein:vir:10 163 LSSYVVQRD-A-FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIY---LDEASGEYLRYEEVEGMEV 237 (536) T ss_pred cCeEEEeeC-C-CCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEE---EecCCCcEEEEEeecCccc Confidence 666544333 3 577777776654310 11122222110 0101111111111111111 Q ss_pred ccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE-ecCCcccchhHHHhhh Q lcl|NC_021326. 189 KTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL-TNYDDQELPEFKRLLR 262 (445) Q Consensus 189 ~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~ 262 (445) .......+|..+|++.+. .+.+|+|-..+..+-+..+|...-...........|.+.+ .+... ....... T Consensus 238 ~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~----~~~~~~~ 313 (536) T protein:vir:10 238 QGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGIT----QPRRLTK 313 (536) T ss_pred cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCccccc----chhhhcc Confidence 122233456677876654 3467999999999999999988777777667766655443 22111 1111111 Q ss_pred -hCceeeccCCCceeeEe--ccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH----- Q lcl|NC_021326. 263 -YYGAIKVSDNGGVDTIQ--VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK----- 334 (445) Q Consensus 263 -~~~~~~~~~~~~~~~l~--~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~----- 334 (445) ..+.+..+..+++..+. ...+.......++.++..|...-....+. ...+...||+.+......+.....- T Consensus 314 ~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~-~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl 392 (536) T protein:vir:10 314 AQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAV-QRTGERVTAEEIRYVASELEDTLGGVYSIL 392 (536) T ss_pred CCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcc-cCCCCCccHHHHHHHHHHHHHHhhHHHHHH Confidence 22233323334454443 33466667777888877775543322211 1223456777776654444433222 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCC-CcceEEEEeCCCCC-CCHHHHHHHHH----HHhcc--------CChHHHHH---- Q lcl|NC_021326. 335 LARKAKVAIQELLWFVFEHFDIKG-EHKDVDISFNYNKV-ANTELQVQTAQ----QSMGI--------VSHETVLE---- 396 (445) Q Consensus 335 ~~~~~~~~l~~~~~~~~~~~~~~~-~~~~i~v~f~~~~p-~d~~~~~~~~~----~~~g~--------~s~et~l~---- 396 (445) ....+.+-+.+++.++.+..-.+. ...-+++.+.-++. ....+.++.+. .++++ +....++. T Consensus 393 ~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~ 472 (536) T protein:vir:10 393 SQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIAN 472 (536) T ss_pred HHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHH Confidence 112223333444444332211111 11123444432222 12222222222 12221 22233332 Q ss_pred hCCCCC----CHHHHHHHHHHHHHHHHHhhhc------cccCCCCCC-CCCCCCCCcCC----C Q lcl|NC_021326. 397 NHPFVE----DLQAELERIEQEQMEYNKQLPN------LDDGGADGA-QQKERSNDKQS----E 445 (445) Q Consensus 397 ~l~~~~----d~~~E~~ri~~E~~~~~~~~~~------~~~~~~~~~-~~~~~~~d~~~----~ 445 (445) .++..+ -.++|++.+.+++++..+..+. .....+..+ +..+...+..| = T Consensus 473 ~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 473 AIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVGLQPGI 536 (536) T ss_pred HcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhccccCCCC Confidence 232211 1356777666554332221111 011011111 11111111111 1 No 137 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=98.39 E-value=8.6e-07 Score=53.84 Aligned_cols=398 Identities=11% Similarity=0.011 Sum_probs=185.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccc----cc------ccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDD----RM------ITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~----ri------~~n~~~~iv~~~~~~l~g~ 70 (445) +=...+.+. ..+++.-..+-|.+ |+...................+. .| ......-.+.+....+.+. T Consensus 12 ~~~~~~~~~--~~~~~~~~~~~~~~-~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~~Rk~av~~~ 88 (526) T protein:vir:79 12 IRPQQLREP--QTSRLAGLAKEFAQ-HPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKRKRAILGL 88 (526) T ss_pred cCccccchh--hhhhhhhhhhhccc-CCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHHHHHHHhCC Confidence 111111111 11122222222222 11111000000000000000000 01 1244555666667777888 Q ss_pred CeeeccC------chHHHHHHHHHhcc--CHHHHHHHHHHHHHhcCeEE-EEEEECCCCcEE---EEEEccceeEEEEcC Q lcl|NC_021326. 71 PIAFKHT------DDEVIKRIDEVLGN--RFDDKLHSVLTGASNKGIEW-LHPYLDEEGEFK---LFRVPAEQGIPIWTD 138 (445) Q Consensus 71 ~~~~~~~------d~~~~~~l~~~~~n--~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~~~---i~~~~p~~~~~v~d~ 138 (445) +..+... +....+++++++.+ ++...+..+ .++.-+|.++ .++|...+|... +.+.+|..+. |++ T Consensus 89 ~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~-ldA~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~F~--~~~ 165 (526) T protein:vir:79 89 DWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDA-LDGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQ--LNP 165 (526) T ss_pred CceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHH-HhhhhhcceeEEEEEeecCCceeEEEeeeecccceE--ecc Confidence 8887642 23455677888764 455554444 4578888864 556655455433 3344443221 222 Q ss_pred CCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec--CCCCcCccHH Q lcl|NC_021326. 139 KEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK--NNDLEISDIF 216 (445) Q Consensus 139 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~--n~~~g~s~~~ 216 (445) ... ... ++. .....+. ...+++.|-.++-. .++.|.|.+. T Consensus 166 ~~~--~~l--~~~-~~~~~g~---------------------------------~l~~~k~iv~~~~~~~g~p~g~gLlr 207 (526) T protein:vir:79 166 EDQ--NEL--RLR-DNSPAGE---------------------------------ALQPFGWIIHRPRARSGYVARSGLFR 207 (526) T ss_pred CCC--cEE--Eec-CCCCCce---------------------------------eecCCceEEEeecCCcCCccccchHH Confidence 111 100 000 0000000 01123332222211 3567888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH------HHhhhhCceeeccCCCceeeEecc-CChHHHHH Q lcl|NC_021326. 217 MYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF------KRLLRYYGAIKVSDNGGVDTIQVE-VPVENSKK 289 (445) Q Consensus 217 ~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~ 289 (445) .+....--=+..+.+++.-++.+..|+++.+=......++. ...+....+..++.+..++|++.. .+...++. T Consensus 208 ~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~ 287 (526) T protein:vir:79 208 VLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLA 287 (526) T ss_pred HHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHH Confidence 76555554555788888889999999998763222222222 234455667888999999999854 45567899 Q ss_pred HHHHHHHHHHHHhCccccccccccCcchHHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCC--cceEEE Q lcl|NC_021326. 290 YLDELYQKIMLFGQAVDFSSDKFGSAPSGVAL-EFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGE--HKDVDI 365 (445) Q Consensus 290 ~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai-~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~~--~~~i~v 365 (445) +++.+.+.|...--.--++.+...+..+.-|+ +....-....+..-.+.+...+. ++++.++.+...... ..-..+ T Consensus 288 li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~ 367 (526) T protein:vir:79 288 MMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRL 367 (526) T ss_pred HHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceE Confidence 99999888876531111111111111111111 11122233344555667777774 578877776543322 123567 Q ss_pred EeCCCCCCCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCc Q lcl|NC_021326. 366 SFNYNKVANTELQVQTAQQSM--GI-VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDK 442 (445) Q Consensus 366 ~f~~~~p~d~~~~~~~~~~~~--g~-~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 442 (445) .|....+.|..+.++.+.+++ |+ +|.+.+.+.++. +.++. -+.+.+...... ..+..................+ T Consensus 368 ~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gi-p~~~~-~e~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 444 (526) T protein:vir:79 368 VFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGI-PQPAK-NEPVLRPAAQPA-ILSRQHGQRVAALATIVGPRYG 444 (526) T ss_pred EeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhCC-CCCCC-chhhccccCCcc-ccccccccccccccccccccCc Confidence 888888999999999998875 66 999999998854 32221 111111000000 0000000000000011111111 Q ss_pred CCC Q lcl|NC_021326. 443 QSE 445 (445) Q Consensus 443 ~~~ 445 (445) +.+ T Consensus 445 ~~~ 447 (526) T protein:vir:79 445 DQQ 447 (526) T ss_pred hhh Confidence 111 No 138 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=98.39 E-value=8.8e-07 Score=53.77 Aligned_cols=424 Identities=10% Similarity=0.033 Sum_probs=186.4 Q ss_pred ChHHHHHHHHHH----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) +.++..+.++.+ ..+++.+.+|.. |.+-...... ......++..+-+...++++++.|++- | T Consensus 4 ~a~~r~~~l~~~R~~~e~~w~e~~~y~l-----P~~~~~~~~~----~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 74 (542) T protein:vir:78 4 LAQARYSAMRADREDFLDMARRCAALTL-----PYLLTEDGHA----SGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQT 74 (542) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhc-----cccCCCCCCc----ccccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 444455555443 234444444432 1111110000 011123455677778888888777643 2 Q ss_pred --eeeccCc----------hH----HHHHH--------HHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEE Q lcl|NC_021326. 72 --IAFKHTD----------DE----VIKRI--------DEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 127 (445) Q Consensus 72 --~~~~~~d----------~~----~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~ 127 (445) +++...+ ++ +...+ ..+..+||...+.++.++..++|.+.+++ ++++ ++++ T Consensus 75 ~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~~~~---~~~~ 149 (542) T protein:vir:78 75 SFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFA--GKKT---LKVY 149 (542) T ss_pred ccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEe--cCCC---ceEE Confidence 2333222 11 11111 23335788999999999999999987654 5543 4555 Q ss_pred ccceeEEEEcCCCCCceEEEEEEEeeecce--------e----------EEEEecceEEEEEEe--c-----------ce Q lcl|NC_021326. 128 PAEQGIPIWTDKEHEELEAFIRMYKLENET--------K----------VEYWDKITVNYYVYE--N-----------GS 176 (445) Q Consensus 128 ~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~--------~----------~~~~~~~~~~~~~~~--~-----------~~ 176 (445) +-.+ |.+-.+. .+++..++|.++..-.. . .+-.....+.+.+.. . .. T Consensus 150 pl~~-y~v~~d~-~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~~~~~ 227 (542) T protein:vir:78 150 PLDR-YVIERDG-DGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKLVDGQ 227 (542) T ss_pred ecce-eEEeeCC-CCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccccCCCe Confidence 5444 3444443 57777776666432100 0 000000000000000 0 00 Q ss_pred eeecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCc Q lcl|NC_021326. 177 LIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD 251 (445) Q Consensus 177 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~ 251 (445) .....+-............+|..+|++.+. .+.+|+|-.....+-+..+|...-......+....|.+.+.-... T Consensus 228 ~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~~~g~ 307 (542) T protein:vir:78 228 HRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVSPSAT 307 (542) T ss_pred EEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccc Confidence 000000000000111223456667776543 346899999999999999999988899999999999876532111 Q ss_pred ccchhHHHhhhhCceeeccCCCceeeEe--ccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHH Q lcl|NC_021326. 252 QELPEFKRLLRYYGAIKVSDNGGVDTIQ--VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLN 329 (445) Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~ 329 (445) ....... ....+.+..+..++++.+. ...+.......++.++..|...-.+-+. ..+...+|+.+......+. T Consensus 308 ~~~~~~~--~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~~---~d~~rvTAtEV~~r~~E~~ 382 (542) T protein:vir:78 308 TKPQSLA--RAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILNV---RQSERTTATEVREVQMELD 382 (542) T ss_pred cchhhcc--cCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhccccc---CCcccccHHHHHHHHHHHH Confidence 1111111 1122344444455666554 3346677788888888877655332211 2233467777665433333 Q ss_pred HHHHH-----HHHHHHHHHHHHHHHHHHHhccCCC-cceEEEEeCCCCCC-CHHHHHHHH-------HHHhc------cC Q lcl|NC_021326. 330 LKADK-----LARKAKVAIQELLWFVFEHFDIKGE-HKDVDISFNYNKVA-NTELQVQTA-------QQSMG------IV 389 (445) Q Consensus 330 ~k~~~-----~~~~~~~~l~~~~~~~~~~~~~~~~-~~~i~v~f~~~~p~-d~~~~~~~~-------~~~~g------~~ 389 (445) ....- ....+.+-+.+++.++.+..-.+.- ..-+++.+.-++.. -....++.+ .++.| .+ T Consensus 383 ~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~~~~~~i~~~~~p~~l~~~i 462 (542) T protein:vir:78 383 RQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALIEFMQTVGQAMGPEALQQFI 462 (542) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcC Confidence 33221 1223333444444544432222211 11255655443321 111112222 12111 12 Q ss_pred ChHHHHHhC---CCCC-----CHHHHHHHHHHHHHHHHHhh------hcccc--------CCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 390 SHETVLENH---PFVE-----DLQAELERIEQEQMEYNKQL------PNLDD--------GGADGAQQKERSNDKQSE 445 (445) Q Consensus 390 s~et~l~~l---~~~~-----d~~~E~~ri~~E~~~~~~~~------~~~~~--------~~~~~~~~~~~~~d~~~~ 445 (445) ....++..+ -+++ ..++|+++.+++.++.+... ..... ...+...+.+..+-.+|| T Consensus 463 d~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~~~~~~~~~a~~~~~~~~~~~~~ 540 (542) T protein:vir:78 463 DPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEKMMQQINAPGQEAPAGPQTGE 540 (542) T ss_pred CHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhhcCCCCcCCCCCCcccc Confidence 222233222 1332 12345544444432211111 11010 011111111222223344 No 139 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.39 E-value=9e-07 Score=53.72 Aligned_cols=379 Identities=9% Similarity=0.050 Sum_probs=168.8 Q ss_pred ChHHHHHHHHHHHH-------HHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHLEKLP-------EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~~~~~-------~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +++++.+-.+.... ....+.++.-+... .. ......-+.+.-...+|+..++-+.+-|+. T Consensus 3 ~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~-----------~~--~v~~~~al~~~~v~~~i~~ia~~ia~l~~~ 69 (429) T protein:vir:10 3 SVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS-----------TI--SVKGKNALKVATVFACIKILSESVSKLPLK 69 (429) T ss_pred hhhhhhcccccCcccccccCCChHHHHHHhcCCCC-----------cc--eechhhhhccHHHHHHHHHHHHhhccCceE Confidence 55554432211000 00111111110000 00 000000012223344666666665555665 Q ss_pred ec--cCc---hHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCC Q lcl|NC_021326. 74 FK--HTD---DEVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEH 141 (445) Q Consensus 74 ~~--~~d---~~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~ 141 (445) +- .++ ......+...+. | ........+..+.+.+|.+|+++..+..|++ .+..++|..+.+..++. T Consensus 70 ~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~-- 147 (429) T protein:vir:10 70 IYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDV-- 147 (429) T ss_pred EEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCc-- Confidence 41 111 111111222221 2 3456677788899999999999999998986 57888998887765542 Q ss_pred CceEEEEE-EEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccH Q lcl|NC_021326. 142 EELEAFIR-MYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDI 215 (445) Q Consensus 142 ~~~~~~v~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~ 215 (445) +.+....+ +|.......... +..=-|+|++. ...|.|.+ T Consensus 148 ~~~~~~~~~~~~~~~~g~~~~-----------------------------------~~~~evih~~~~~~~~~~~G~s~i 192 (429) T protein:vir:10 148 GLLNSKTKMWYVVNTGGQQRV-----------------------------------LKPEEILHFKNGITLDGLVGVPTM 192 (429) T ss_pred ccccccceEEEEEccCCeEEE-----------------------------------EccccEEEecCCCCCCCcccccHH Confidence 11111111 111100000000 00001444432 23477888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC---CcccchhHHHhhh--------hCceeeccCCCceeeEeccCCh Q lcl|NC_021326. 216 FMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPV 284 (445) Q Consensus 216 ~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~ 284 (445) ..+...++.......-....+...+.|-.++... +.+........+. ..+++.++++.+.+.+...... T Consensus 193 ~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d 272 (429) T protein:vir:10 193 EYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSD 272 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhccccccCceeecCCCceEEEccCChhH Confidence 7777777766655555555666666776665532 2222222222221 2345556666555555433334 Q ss_pred HHHHHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----cCC Q lcl|NC_021326. 285 ENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----IKG 358 (445) Q Consensus 285 ~~~~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~~~ 358 (445) ..+.+..+...+.|+..-++|....+... ++-|+ ++... ...+...|.-++..+...+. ... T Consensus 273 ~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn--~e~~~----------~~f~~~~l~P~~~~ie~~ln~kl~~~~~ 340 (429) T protein:vir:10 273 AQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQQ----------QQFYTDTLQATLTMYEQEMTYKLFLDSE 340 (429) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcChhh Confidence 45556667777888888888865443222 11111 11111 11222233333333322221 111 Q ss_pred --CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCC Q lcl|NC_021326. 359 --EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQ 434 (445) Q Consensus 359 --~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 434 (445) ....+++.+...+..|..+.++.+.++ .|+++...++++++.-.- +..++.---. ....+........++++ T Consensus 341 ~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~ggD~~~~~~--n~~~~d~~~~~~~k~g~ 416 (429) T protein:vir:10 341 LDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE--AGGDRLLVNG--NMLPIDMAGQAYLKGGD 416 (429) T ss_pred cCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeeecc--cccchhhccccccCCCC Confidence 112344445566678999999998887 489999999888754211 1111000000 00000000011112222 Q ss_pred CCCCCCCcCCC Q lcl|NC_021326. 435 QKERSNDKQSE 445 (445) Q Consensus 435 ~~~~~~d~~~~ 445 (445) +.++..++.+| T Consensus 417 ~~~~~~~~~~e 427 (429) T protein:vir:10 417 TNGEVSKEGNE 427 (429) T ss_pred CCCCCCCCCCC Confidence 33333333333 No 140 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.35 E-value=1.1e-06 Score=53.23 Aligned_cols=374 Identities=12% Similarity=0.110 Sum_probs=141.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhc-cCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG-KPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g-~~~~~~~~d~ 79 (445) ++..+|+........+.-...+ ....+++-+. .....+..+. T Consensus 96 i~~~~I~t~~~~vA~~~~~~~~-------------------------------------~~~~~~~~i~l~~~~~~~~~~ 138 (563) T protein:vir:95 96 ILNAIILTRSNQVAMYCQPARY-------------------------------------SEKGLGFEVRLRDLDAEPGRK 138 (563) T ss_pred HHHHHHHHHHHHHHHHhhhhhh-------------------------------------hcccccceeEEeecCCCcchh Confidence 2222222222222222111111 1111111000 0000000000 Q ss_pred --HHHHHHHHHhc----------cCHHHHHHHHHHHHHhcCeEEEEEE--ECCCCcE-EEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 80 --EVIKRIDEVLG----------NRFDDKLHSVLTGASNKGIEWLHPY--LDEEGEF-KLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 80 --~~~~~l~~~~~----------n~~~~~~~~~~~~~~~~G~~~~~v~--~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~ 144 (445) .....+..++. ..+..+...+..+.+.+|.+|+.+. .+..|++ .+..++|..+.+..++. +.+ T Consensus 139 ~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~--g~~ 216 (563) T protein:vir:95 139 EKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKK--GKI 216 (563) T ss_pred hhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCC--Cce Confidence 00011111110 1344566678889999999988765 4555765 47889999988776643 221 Q ss_pred E-EEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHH Q lcl|NC_021326. 145 E-AFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLID 223 (445) Q Consensus 145 ~-~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid 223 (445) . ...+++..........+....+- -|... |-........|.|.+..+...+. T Consensus 217 ~~~~~~y~~~~~g~~~~~~~~~evI-------------------------~~~~~--~~~d~~~~~~G~Spi~~a~~~i~ 269 (563) T protein:vir:95 217 IKGGKRFVQVVDKRVVASFTSRELA-------------------------MGIRN--PRTELSSSGYGLSEVEIAMKEFI 269 (563) T ss_pred eccceeEEEEeCCceeEEecCcceE-------------------------EEecc--CCCCcccCcccchHHHHHHHHHH Confidence 1 11111111111110000000000 00000 00000012357777776666666 Q ss_pred HHHHHHHHHHHHHHHhcCCeeEE--ecCC---cccchhHHHhhh--------hCce-eeccCCCceeeEeccCChHHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYVL--TNYD---DQELPEFKRLLR--------YYGA-IKVSDNGGVDTIQVEVPVENSKK 289 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~~--~g~~---~~~~~~~~~~~~--------~~~~-~~~~~~~~~~~l~~~~~~~~~~~ 289 (445) ....+..-..+.+...+.|-.++ .|.. .+........+. .+++ +.++++.+.+-++.+.....+.+ T Consensus 270 ~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle 349 (563) T protein:vir:95 270 AYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEK 349 (563) T ss_pred HHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHH Confidence 55555555555566667776554 3432 222222222221 1222 44555555555544444556677 Q ss_pred HHHHHHHHHHHHhCccccccccc-----cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCcc Q lcl|NC_021326. 290 YLDELYQKIMLFGQAVDFSSDKF-----GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGEHK 361 (445) Q Consensus 290 ~i~~l~~~i~~~s~~p~~~~~~~-----~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~ 361 (445) ..+...+.|+..-++|....+-. .+...|..+... .+. ......+...|.-++..+...+.. ..... T Consensus 350 ~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~s--n~e---~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~ 424 (563) T protein:vir:95 350 WLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEA--DPG---KKQQQSQNKGLQPLLRFIEDLVNRHIISEYGD 424 (563) T ss_pred HHHHHHHHHHHHhCCCHHHccccccccccccccccchhhc--cHH---HHHHHHHHHHHHHHHHHHHHHHHhhhchhccc Confidence 77788888888888886443311 111111111100 000 011112222222222222211111 11123 Q ss_pred eEEEEeCCCCCCCHHHHHHHHH-HHhccCChHHHHHhCCCC--CCHHH-----------HHHHH-HHHHHHHHHhhhccc Q lcl|NC_021326. 362 DVDISFNYNKVANTELQVQTAQ-QSMGIVSHETVLENHPFV--EDLQA-----------ELERI-EQEQMEYNKQLPNLD 426 (445) Q Consensus 362 ~i~v~f~~~~p~d~~~~~~~~~-~~~g~~s~et~l~~l~~~--~d~~~-----------E~~ri-~~E~~~~~~~~~~~~ 426 (445) .+.+.|.+..+.+..+...+.. ...|+++.-.++++++.- +.-+. ....- ..+.+.......... T Consensus 425 ~~~~~f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (563) T protein:vir:95 425 KYTFQFVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMM 504 (563) T ss_pred ccEEEeccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcc Confidence 4567787765555555444322 125899988888776442 21000 00000 000000000000011 Q ss_pred cCCCCCCCCCCC-CC----CcCCC Q lcl|NC_021326. 427 DGGADGAQQKER-SN----DKQSE 445 (445) Q Consensus 427 ~~~~~~~~~~~~-~~----d~~~~ 445 (445) .....+..+++. .+ +++++ T Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~~~ 528 (563) T protein:vir:95 505 SLLEGDNDDSEEGQSTDSSNDDKE 528 (563) T ss_pred cccCCCCCCCCCCCCCCCCCCccc Confidence 111111111111 11 11111 No 141 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.35 E-value=1.1e-06 Score=53.23 Aligned_cols=374 Identities=12% Similarity=0.110 Sum_probs=141.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhc-cCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG-KPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g-~~~~~~~~d~ 79 (445) ++..+|+........+.-...+ ....+++-+. .....+..+. T Consensus 96 i~~~~I~t~~~~vA~~~~~~~~-------------------------------------~~~~~~~~i~l~~~~~~~~~~ 138 (563) T protein:vir:99 96 ILNAIILTRSNQVAMYCQPARY-------------------------------------SEKGLGFEVRLRDLDAEPGRK 138 (563) T ss_pred HHHHHHHHHHHHHHHHhhhhhh-------------------------------------hcccccceeEEeecCCCcchh Confidence 2222222222222222111111 1111111000 0000000000 Q ss_pred --HHHHHHHHHhc----------cCHHHHHHHHHHHHHhcCeEEEEEE--ECCCCcE-EEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 80 --EVIKRIDEVLG----------NRFDDKLHSVLTGASNKGIEWLHPY--LDEEGEF-KLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 80 --~~~~~l~~~~~----------n~~~~~~~~~~~~~~~~G~~~~~v~--~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~ 144 (445) .....+..++. ..+..+...+..+.+.+|.+|+.+. .+..|++ .+..++|..+.+..++. +.+ T Consensus 139 ~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~--g~~ 216 (563) T protein:vir:99 139 EKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKK--GKI 216 (563) T ss_pred hhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCC--Cce Confidence 00011111110 1344566678889999999988765 4555765 47889999988776643 221 Q ss_pred E-EEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHH Q lcl|NC_021326. 145 E-AFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLID 223 (445) Q Consensus 145 ~-~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid 223 (445) . ...+++..........+....+- -|... |-........|.|.+..+...+. T Consensus 217 ~~~~~~y~~~~~g~~~~~~~~~evI-------------------------~~~~~--~~~d~~~~~~G~Spi~~a~~~i~ 269 (563) T protein:vir:99 217 IKGGKRFVQVVDKRVVASFTSRELA-------------------------MGIRN--PRTELSSSGYGLSEVEIAMKEFI 269 (563) T ss_pred eccceeEEEEeCCceeEEecCcceE-------------------------EEecc--CCCCcccCcccchHHHHHHHHHH Confidence 1 11111111111110000000000 00000 00000012357777776666666 Q ss_pred HHHHHHHHHHHHHHHhcCCeeEE--ecCC---cccchhHHHhhh--------hCce-eeccCCCceeeEeccCChHHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYVL--TNYD---DQELPEFKRLLR--------YYGA-IKVSDNGGVDTIQVEVPVENSKK 289 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~~--~g~~---~~~~~~~~~~~~--------~~~~-~~~~~~~~~~~l~~~~~~~~~~~ 289 (445) ....+..-..+.+...+.|-.++ .|.. .+........+. .+++ +.++++.+.+-++.+.....+.+ T Consensus 270 ~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle 349 (563) T protein:vir:99 270 AYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEK 349 (563) T ss_pred HHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHH Confidence 55555555555566667776554 3432 222222222221 1222 44555555555544444556677 Q ss_pred HHHHHHHHHHHHhCccccccccc-----cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCcc Q lcl|NC_021326. 290 YLDELYQKIMLFGQAVDFSSDKF-----GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGEHK 361 (445) Q Consensus 290 ~i~~l~~~i~~~s~~p~~~~~~~-----~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~ 361 (445) ..+...+.|+..-++|....+-. .+...|..+... .+. ......+...|.-++..+...+.. ..... T Consensus 350 ~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~s--n~e---~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~ 424 (563) T protein:vir:99 350 WLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEA--DPG---KKQQQSQNKGLQPLLRFIEDLVNRHIISEYGD 424 (563) T ss_pred HHHHHHHHHHHHhCCCHHHccccccccccccccccchhhc--cHH---HHHHHHHHHHHHHHHHHHHHHHHhhhchhccc Confidence 77788888888888886443311 111111111100 000 011112222222222222211111 11123 Q ss_pred eEEEEeCCCCCCCHHHHHHHHH-HHhccCChHHHHHhCCCC--CCHHH-----------HHHHH-HHHHHHHHHhhhccc Q lcl|NC_021326. 362 DVDISFNYNKVANTELQVQTAQ-QSMGIVSHETVLENHPFV--EDLQA-----------ELERI-EQEQMEYNKQLPNLD 426 (445) Q Consensus 362 ~i~v~f~~~~p~d~~~~~~~~~-~~~g~~s~et~l~~l~~~--~d~~~-----------E~~ri-~~E~~~~~~~~~~~~ 426 (445) .+.+.|.+..+.+..+...+.. ...|+++.-.++++++.- +.-+. ....- ..+.+.......... T Consensus 425 ~~~~~f~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (563) T protein:vir:99 425 KYTFQFVGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMM 504 (563) T ss_pred ccEEEeccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcc Confidence 4567787765555555444322 125899988888776442 21000 00000 000000000000011 Q ss_pred cCCCCCCCCCCC-CC----CcCCC Q lcl|NC_021326. 427 DGGADGAQQKER-SN----DKQSE 445 (445) Q Consensus 427 ~~~~~~~~~~~~-~~----d~~~~ 445 (445) .....+..+++. .+ +++++ T Consensus 505 ~~~~~~~~~~~~~~~~~~~~~~~~ 528 (563) T protein:vir:99 505 SLLEGDNDDSEEGQSTDSSNDDKE 528 (563) T ss_pred cccCCCCCCCCCCCCCCCCCCccc Confidence 111111111111 11 11111 No 142 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=98.35 E-value=1.1e-06 Score=53.21 Aligned_cols=423 Identities=9% Similarity=-0.000 Sum_probs=183.9 Q ss_pred ChHHHHHHHHH----HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLE----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~----~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) =+++..+.++. -..+++.+.+|..- ..-..... . ......++..+-+...++.+++.|++- | T Consensus 14 ~~~~r~~~l~~~R~~~e~~w~e~~~y~lP-----~~~~~~~~--~--~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~ 84 (535) T protein:vir:94 14 GAKAVYDALKNDRNSYETRAENCAKYTIP-----SLFPKDSD--N--ASTDYTTPWQAVGARGLNNLASKLMLALFPMQT 84 (535) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----ccCCCCCC--c--cccccCCcccccHHHHHHHHHHHHHhhhcCCCC Confidence 22222333332 23344444444321 11101000 0 111223456667777777777776542 2 Q ss_pred -eeeccCch-------------HHHHHHHH--------HhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEcc Q lcl|NC_021326. 72 -IAFKHTDD-------------EVIKRIDE--------VLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPA 129 (445) Q Consensus 72 -~~~~~~d~-------------~~~~~l~~--------~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p 129 (445) +++...+. ++.+.+.. +..+||...+.++.++..++|.+.+++..+.....+++.++- T Consensus 85 WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl 164 (535) T protein:vir:94 85 WMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGTYNPMKLYRL 164 (535) T ss_pred ccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCcccceEEEEc Confidence 22222221 12223322 234689999999999999999998887666544456777765 Q ss_pred ceeEEEEcCCCCCceEEEEEEEeeec-------------------ceeEEEEecceEEEEEEecceeeeccccccccccc Q lcl|NC_021326. 130 EQGIPIWTDKEHEELEAFIRMYKLEN-------------------ETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKT 190 (445) Q Consensus 130 ~~~~~v~d~~~~~~~~~~v~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (445) .+. .+-.+. .+++...+|.++..- ...+++++... ...+...........+..... T Consensus 165 ~~y-~v~~d~-~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~---~~~~~~~~~~~~e~~g~~~~~ 239 (535) T protein:vir:94 165 SSY-VVQRDA-FGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIY---LDEESGEYLKYEEIDGVEVEG 239 (535) T ss_pred CeE-EEeeCC-CCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEE---eeCCCCcEEEEEEecCeeecc Confidence 554 344443 577767666554211 11223332110 011111111000001111111 Q ss_pred ccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh-hhC Q lcl|NC_021326. 191 HFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL-RYY 264 (445) Q Consensus 191 ~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~-~~~ 264 (445) .....+|..+|++.+. .+.+|+|-..+..+-+..+|...-...........|.+.+.-... ....... ... T Consensus 240 ~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~---~~~~~~~~~~~ 316 (535) T protein:vir:94 240 TDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGI---TQVRRLTKAQT 316 (535) T ss_pred ccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccc---cchhhcccCCC Confidence 1234466677876654 346799999999999999998877777777777777655421001 1111111 122 Q ss_pred ceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH-----HHH Q lcl|NC_021326. 265 GAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK-----LAR 337 (445) Q Consensus 265 ~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~-----~~~ 337 (445) +.+..+..+++.++... .+.+.....++.++..|...-.... .....+...+|+.+......+.....- ... T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E 395 (535) T protein:vir:94 317 GDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNS-AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQE 395 (535) T ss_pred ceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhh-hccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHH Confidence 34443444556555433 4566777778887777765442222 111223446777776543333333221 122 Q ss_pred HHHHHHHHHHHHHHHHhccCC-CcceEEEEeCCCCC-CCHHHHHHHH----HHHhcc--------CChHHHHHhC---CC Q lcl|NC_021326. 338 KAKVAIQELLWFVFEHFDIKG-EHKDVDISFNYNKV-ANTELQVQTA----QQSMGI--------VSHETVLENH---PF 400 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~~~~~~-~~~~i~v~f~~~~p-~d~~~~~~~~----~~~~g~--------~s~et~l~~l---~~ 400 (445) .+.+-+.+++.++.+..-.+. ...-+++.+.-++. ....+.++.+ ..++++ +....++..+ .+ T Consensus 396 lL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~G 475 (535) T protein:vir:94 396 LQLPMVRVLLKQLQATNQIPELPKEAVEPTISTGMEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIG 475 (535) T ss_pred HHHHHHHHHHHHHHhCCCCCCCChhhccceEeehHHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhC Confidence 223333444444333211111 11123444422221 1111122222 112222 2223333222 12 Q ss_pred CC-----CHHHHHHHHHHHHHHHHHhhhc------cccCCC--CCCC------CCCCCCC Q lcl|NC_021326. 401 VE-----DLQAELERIEQEQMEYNKQLPN------LDDGGA--DGAQ------QKERSND 441 (445) Q Consensus 401 ~~-----d~~~E~~ri~~E~~~~~~~~~~------~~~~~~--~~~~------~~~~~~d 441 (445) ++ -.++|++++.+++++..+.... ..++.+ ++.. +-+-.++ T Consensus 476 vp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 476 IDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATASPENMKAAAAQAGMAPN 535 (535) T ss_pred CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccChHHHHHHHHHhccCCC Confidence 22 1345666555444332221111 111000 0000 0000111 No 143 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.34 E-value=1.2e-06 Score=53.05 Aligned_cols=375 Identities=12% Similarity=0.003 Sum_probs=159.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +.++.-...+.....-..+..++.|... +.... .+.-+.++-...+|+..++-+.+-|+. .+++. T Consensus 3 ~f~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~v~--~~~al~~~~V~~~v~~ia~~ia~~p~~--~~~~~ 67 (397) T protein:vir:38 3 LLKLNKSHSQGFSLNDPDWVNFLTGGEA-----------QKYVS--ADTALKNSDIFSLIMQLSGDLAMVRYT--SESDR 67 (397) T ss_pred chhhhhcccCcccCCchhhhhhhcCCcC-----------Cceec--hHHhhccHHHHHHHHHHHHHHhhCccc--ccccH Confidence 2221110000000000111111111100 00000 000011122233455555544444543 33333 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeeccee Q lcl|NC_021326. 81 VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETK 158 (445) Q Consensus 81 ~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~ 158 (445) ...++..-.. .........+..+.+.+|.+|+.+-.+.+|++ .+..++|..+.+..+.+ .+.+.+.+ ....... T Consensus 68 ~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~-~~~~~y~~---~~~~~~~ 143 (397) T protein:vir:38 68 SQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQD-GSGLIYNI---NFDEPAI 143 (397) T ss_pred HHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCC-CceEEEEE---Eeccccc Confidence 3222221111 13456677788899999999999988888876 57889999887765542 12211110 0000000 Q ss_pred EEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 159 VEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLS 233 (445) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~ 233 (445) .... .-+... |+|++. ...|.|.+......+.....+..-.. T Consensus 144 -------~~~~------------------------~~~~~e--iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 190 (397) T protein:vir:38 144 -------GYME------------------------NVPAAD--VIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTL 190 (397) T ss_pred -------ccee------------------------EecCcc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 0000 000001 333322 13588888877777776666655566 Q ss_pred HHHHHhcCCeeEEecCC---cccchhHHHhh------h-hCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhC Q lcl|NC_021326. 234 NTFKDSNELTYVLTNYD---DQELPEFKRLL------R-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQ 303 (445) Q Consensus 234 ~~~~~~~~~~l~~~g~~---~~~~~~~~~~~------~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~ 303 (445) +.+...+.|-.++.-.. .+........+ . ..+.+.++++.+++-+........+.+..+.....|+..-+ T Consensus 191 ~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afg 270 (397) T protein:vir:38 191 KALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYG 270 (397) T ss_pred HHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhC Confidence 66677777766655322 11111111111 1 22344555555544444444555667778888888888888 Q ss_pred ccccccccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHH Q lcl|NC_021326. 304 AVDFSSDKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTA 382 (445) Q Consensus 304 ~p~~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~ 382 (445) +|....+...+ +.+..+.+. .+...|.-++..+...+..+-- ..+++.+...+-.|..+.++.+ T Consensus 271 Vp~~~lg~~~~~~~~~e~~~~--------------~~~~~l~P~~~~ie~~ln~~l~-~~~~~~~~~~~~~d~~~~~~~~ 335 (397) T protein:vir:38 271 VPDSYLNGQGDQQSSITQISG--------------QYAKSLNRYVQAIVGELNDKLH-ANISANIRFAIDAMGDQYASTI 335 (397) T ss_pred CCHHHhCCCCCcccHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhcc-ChhcccccccccCCHHHHHHHH Confidence 88654433222 222211111 1222333333332222111110 1122223333445777888887 Q ss_pred HHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcC Q lcl|NC_021326. 383 QQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQ 443 (445) Q Consensus 383 ~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 443 (445) .++ .|+++...+++.++.-.-...++-........ ........++..++.+.++..+|++ T Consensus 336 ~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~-~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 336 SSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQ-AIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HHHHhCCCcCHHHHHHHhCCCCCCCCccccccccccc-cccccccccCCCCCCCCCCCCCCCC Confidence 776 48999999988774321000000000000000 0001111111122222222222322 No 144 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.31 E-value=1.4e-06 Score=52.62 Aligned_cols=397 Identities=11% Similarity=0.024 Sum_probs=186.4 Q ss_pred ChH--------HHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccc----cc------ccchHHHHHHH Q lcl|NC_021326. 1 MIV--------RYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDD----RM------ITNFHANLVDQ 62 (445) Q Consensus 1 ~l~--------~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~----ri------~~n~~~~iv~~ 62 (445) +|. ..+.+. ...+..-..+.|.+ |+...................+. .+ ......-.+.+ T Consensus 4 ~~d~~g~p~~~~~~~~~--~~~~~~~~~~~~~~-~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~ 80 (526) T protein:vir:99 4 IVDVYGNPIRTQQLREP--QTSRLAGLAKEFAQ-HPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSK 80 (526) T ss_pred eECCCCCccccccccch--hhhhhhhhhhhhcc-cCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 110 000000 11222222233322 21111100000000000000000 00 12344555666 Q ss_pred HHhhhhccCeeeccC------chHHHHHHHHHhcc--CHHHHHHHHHHHHHhcCeEE-EEEEECCCCcEE---EEEEccc Q lcl|NC_021326. 63 KVSYIVGKPIAFKHT------DDEVIKRIDEVLGN--RFDDKLHSVLTGASNKGIEW-LHPYLDEEGEFK---LFRVPAE 130 (445) Q Consensus 63 ~~~~l~g~~~~~~~~------d~~~~~~l~~~~~n--~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~~~---i~~~~p~ 130 (445) ...-+.+.+..+... +....+++++++.+ ++...+..+. ++.-+|.++ .++|...+|... +.+.+|. T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 159 (526) T protein:vir:99 81 RKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQS 159 (526) T ss_pred HHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 666777888777642 22455677888764 4665555544 688888864 556665455433 3444443 Q ss_pred eeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec--CC Q lcl|NC_021326. 131 QGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK--NN 208 (445) Q Consensus 131 ~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~--n~ 208 (445) .+. |+.... ... ++. .....+. ...+++.|-.++-. .+ T Consensus 160 ~f~--~~~~~~--~~l--~~~-~~~~~g~---------------------------------~l~~~k~i~~~~~~~~g~ 199 (526) T protein:vir:99 160 WFQ--LNPEDQ--NEL--RLR-DNSPAGE---------------------------------ALQPFGWIIHRPRARSGY 199 (526) T ss_pred cee--eccCCC--cEE--Eec-CCCCCce---------------------------------eecCCCeEEEeecCCcCC Confidence 321 222111 100 000 0000000 01123332222211 35 Q ss_pred CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH------HHhhhhCceeeccCCCceeeEecc- Q lcl|NC_021326. 209 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF------KRLLRYYGAIKVSDNGGVDTIQVE- 281 (445) Q Consensus 209 ~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~l~~~- 281 (445) +.|.|.+..+....--=+..+.+++.-++.++.|+++.+=......++. ...+....+..++.+..++|++.. T Consensus 200 p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~d~~~iiP~~~~ie~~ea~~ 279 (526) T protein:vir:99 200 VARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAAGIIPETMAIDFQQAAQ 279 (526) T ss_pred ccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCC Confidence 6788888876555555555788888889999999998763222222221 234455667888999999999854 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCC Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVAL-EFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGE 359 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai-~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~~ 359 (445) .+...++.+++.+.+.|...--.--++.+...+..+.-|+ +....-....+..-.+.+...+. ++++.++.+...... T Consensus 280 ~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~ 359 (526) T protein:vir:99 280 GSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSP 359 (526) T ss_pred CCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcC Confidence 4556789999999888876531111121111111111121 22222233444555667777774 588887777543221 Q ss_pred --cceEEEEeCCCCCCCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccc-cCCCCCC Q lcl|NC_021326. 360 --HKDVDISFNYNKVANTELQVQTAQQSM--GI-VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLD-DGGADGA 433 (445) Q Consensus 360 --~~~i~v~f~~~~p~d~~~~~~~~~~~~--g~-~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~~ 433 (445) ..-..+.|....+.|..+.++.+.+++ |+ +|.+.+.+.++. +.+... +.+...... ...+... ....... T Consensus 360 ~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gi-p~~~~~-e~~l~~~~~--~~~~~~~~~~~~~~~ 435 (526) T protein:vir:99 360 DVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGI-PQPAKN-EPVLRSAAQ--PAILSRQHGQRVAAL 435 (526) T ss_pred CccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCC-CCCCCc-ccccCCCCC--Ccccccccccccccc Confidence 224577888889999999999998875 66 999999998855 222110 000000000 0000000 0000000 Q ss_pred CCCCCCCCcCCC Q lcl|NC_021326. 434 QQKERSNDKQSE 445 (445) Q Consensus 434 ~~~~~~~d~~~~ 445 (445) ........++.+ T Consensus 436 ~~~~~~~~~~~~ 447 (526) T protein:vir:99 436 ATIVGPRYGDQQ 447 (526) T ss_pred cccccccCcchh Confidence 000000001111 No 145 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=98.30 E-value=1.6e-06 Score=52.42 Aligned_cols=398 Identities=10% Similarity=-0.022 Sum_probs=186.3 Q ss_pred Ch--------HHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccc----cc------ccchHHHHHHH Q lcl|NC_021326. 1 MI--------VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDD----RM------ITNFHANLVDQ 62 (445) Q Consensus 1 ~l--------~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~----ri------~~n~~~~iv~~ 62 (445) +| ...+.+ ....+..-..+.|.+ |+...................+. ++ ......-.+.+ T Consensus 4 ~~d~~g~p~~~~~~~~--~~~~~~~~~~~~~~~-~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~ 80 (528) T protein:vir:10 4 IVDIYGNPLRTQQLRK--QQTAHLAGLAKEFAN-HPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSK 80 (528) T ss_pred eECCCCCccccccccc--hhhhhhhhhhhhhcc-cCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 11 000000 011222222233322 11111100000000000000000 01 13455566777 Q ss_pred HHhhhhccCeeeccC------chHHHHHHHHHhcc--CHHHHHHHHHHHHHhcCeEE-EEEEECCCCcEE---EEEEccc Q lcl|NC_021326. 63 KVSYIVGKPIAFKHT------DDEVIKRIDEVLGN--RFDDKLHSVLTGASNKGIEW-LHPYLDEEGEFK---LFRVPAE 130 (445) Q Consensus 63 ~~~~l~g~~~~~~~~------d~~~~~~l~~~~~n--~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~~~---i~~~~p~ 130 (445) ....+.+.+.++... +....+++++++.+ ++.. +..-..++..+|.++ +++|...+|... +.+++|. T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~-~i~~~lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~ 159 (528) T protein:vir:10 81 RKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIED-LMLDCMDGVGHGYSAIELDWSLQGREWLPQAFDHRPQS 159 (528) T ss_pred HHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHH-HHHHHHhhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 777788888888642 12345567777754 3544 344455678888875 556654455433 3334443 Q ss_pred eeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEe--cCC Q lcl|NC_021326. 131 QGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPF--KNN 208 (445) Q Consensus 131 ~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~--~n~ 208 (445) .+. |+.. +.+..-.+ .....+.. -.+++.+=.++. ..+ T Consensus 160 ~f~--~~~~--~~~~l~~~---~~~~~g~~---------------------------------l~~~k~iv~~~~~~~g~ 199 (528) T protein:vir:10 160 WFQ--LNPD--DQDELRLR---DNSIAGEV---------------------------------LQPFGWIMHKPRSRSGY 199 (528) T ss_pred cee--eccC--CCcEEecc---CCCCCcee---------------------------------ecCCCeEEEeecCCCCC Confidence 221 2211 11110000 00000000 012222111111 134 Q ss_pred CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC---CcccchhHH---HhhhhCceeeccCCCceeeEecc- Q lcl|NC_021326. 209 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFK---RLLRYYGAIKVSDNGGVDTIQVE- 281 (445) Q Consensus 209 ~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~~---~~~~~~~~~~~~~~~~~~~l~~~- 281 (445) +.|.|.+..+....---+..+.+++.-++.++.|+++.+=. +.++-.... ..+....+..++.+..++|++.. T Consensus 200 p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al~~i~~~~~~iiP~~~~ie~~ea~~ 279 (528) T protein:vir:10 200 VARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGTPDEEKVTLLRAVTGLGHAAAGIIPESMSIDFQEASK 279 (528) T ss_pred ccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceeEEeecCC Confidence 56888888776666666677888899999999999887632 222211121 23455667778999999999865 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccc-cccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCC- Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKG- 358 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~-~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~- 358 (445) .+...++.+++.+.+.|...--.--++.+ ..++..|...-+....-....+..-.+.+...+. +++..++.+..... T Consensus 280 ~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~ 359 (528) T protein:vir:10 280 GSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNL 359 (528) T ss_pred CChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCC Confidence 45667889999988887765422111111 1111111111112222233444555667777774 57777777654332 Q ss_pred -CcceEEEEeCCCCCCCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHH--HHHHHHHHHHHHHHHhhhccccCCCCC Q lcl|NC_021326. 359 -EHKDVDISFNYNKVANTELQVQTAQQSM--GI-VSHETVLENHPFVEDLQ--AELERIEQEQMEYNKQLPNLDDGGADG 432 (445) Q Consensus 359 -~~~~i~v~f~~~~p~d~~~~~~~~~~~~--g~-~s~et~l~~l~~~~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~ 432 (445) ...-..+.|....+.|..+.++.+.+++ |+ +|.+.+.+.++. +.++ +++..- +.................. T Consensus 360 ~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gi-p~p~~~e~~~~~--~~~~~~~~~~~~~~~~~~~ 436 (528) T protein:vir:10 360 DARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLGI-PLPANGEAVLGD--QAGAGIAQLSRRPGPRIAA 436 (528) T ss_pred CccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCCcccccC--CCcccccccCccccccccc Confidence 1233578888889999999999998875 66 999999998854 2221 111100 0000000000000000000 Q ss_pred CCCCCCCCCcCCC Q lcl|NC_021326. 433 AQQKERSNDKQSE 445 (445) Q Consensus 433 ~~~~~~~~d~~~~ 445 (445) ..+......++.+ T Consensus 437 ~~~~~~~~~~~~~ 449 (528) T protein:vir:10 437 LAQVIGPRYRDQE 449 (528) T ss_pred ccccccccccccc Confidence 0011111111111 No 146 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=98.25 E-value=2e-06 Score=51.80 Aligned_cols=418 Identities=10% Similarity=0.056 Sum_probs=186.8 Q ss_pred ChHHHHHHHHH-H---HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLE-K---LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~-~---~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) =+++..+.++. | ..+++.+.+|.. |.+-... ....... +...++..+-+...++.+++-|++- | T Consensus 2 ~~~~r~~~L~~~R~~~e~~w~e~~~~tl-----P~~~~~~-~~~~~~~-~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 74 (522) T protein:vir:10 2 KARERYNQLTTARQMFLDKAVECSELTL-----PYLIDDD-ISSRPNH-KSLTVPWQSVGAKCCVTLAAKLMLAVLPPQT 74 (522) T ss_pred chHHHHHHHHHHhhHHHHHHHHHHHHhh-----hcccCCC-CCCCccc-ccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 23333333333 2 333344444432 1111110 0011111 1223466677778888888777542 2 Q ss_pred --eeeccCchH------------HHHHH--------HHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEcc Q lcl|NC_021326. 72 --IAFKHTDDE------------VIKRI--------DEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPA 129 (445) Q Consensus 72 --~~~~~~d~~------------~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p 129 (445) +++...+.. +.+.+ ..+..+||...+.++.++..++|.+.++ .++++ +++++- T Consensus 75 ~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly--~~~~~---~~~~pl 149 (522) T protein:vir:10 75 SFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIF--MGKDG---LKTFPL 149 (522) T ss_pred ccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEE--EcCCC---ceEEEc Confidence 233332211 12221 1223468899999999999999998865 45554 445554 Q ss_pred ceeEEEEcCCCCCceEEEEEEEeee----------------------cceeEEEEecceEEEEEEecceeeecccccccc Q lcl|NC_021326. 130 EQGIPIWTDKEHEELEAFIRMYKLE----------------------NETKVEYWDKITVNYYVYENGSLIPDYSNNLEN 187 (445) Q Consensus 130 ~~~~~v~d~~~~~~~~~~v~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (445) .+ |.+-.+. .+++..++|.++.. ....+++++... .. .............+. T Consensus 150 ~~-y~v~~d~-~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~---p~-~~~~~~~~~~~~~~~ 223 (522) T protein:vir:10 150 TR-YVINRDG-DGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVK---LD-KSSGRWVWHQEAFDK 223 (522) T ss_pred ce-EEEeeCC-CCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEE---ee-ccCCceEEEEccCCc Confidence 45 3333443 57777666655421 111123222111 01 111111111111111 Q ss_pred -cccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh Q lcl|NC_021326. 188 -SKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL 261 (445) Q Consensus 188 -~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 261 (445) .......-++..+|++.+. .+.+|+|-.....+-+..++...-......+....|.+.+.-.......... . T Consensus 224 ~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~l~--~ 301 (522) T protein:vir:10 224 IIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPATIA--K 301 (522) T ss_pred cccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccccc--C Confidence 1111123466677776553 3468999999999999999999888888889999998776321111111111 1 Q ss_pred hhCceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH----- Q lcl|NC_021326. 262 RYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK----- 334 (445) Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~----- 334 (445) ...+.+..+..+++..+... .+.......++.++..|...-.+. ....+...+++.+......+.+...- T Consensus 302 ~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~---~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl 378 (522) T protein:vir:10 302 AGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVM---NVRNAERVTAEEVRLTQLELEQQLGGIFSLL 378 (522) T ss_pred CCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhc---cCCCCCCCCHHHHHHHHHHHHHHhhHHHHHH Confidence 22233444555666665433 456677888888888777653221 12223456787776654443333221 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCC---Cc-ceEEEEeCCCCCCCHHHHHHHHHH----HhccCChHH---------HHHh Q lcl|NC_021326. 335 LARKAKVAIQELLWFVFEHFDIKG---EH-KDVDISFNYNKVANTELQVQTAQQ----SMGIVSHET---------VLEN 397 (445) Q Consensus 335 ~~~~~~~~l~~~~~~~~~~~~~~~---~~-~~i~v~f~~~~p~d~~~~~~~~~~----~~g~~s~et---------~l~~ 397 (445) ....+.+-+.+++.++.+..-.+. +. ....|++-.++- ..+.++.+.. ++.++..+. ++.. T Consensus 379 ~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~La--raq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~ 456 (522) T protein:vir:10 379 VIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALG--RGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKR 456 (522) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHH--HHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHH Confidence 123334444555555443221111 11 112233333222 2222222221 111221222 2222 Q ss_pred C---CCCC-----CHHHHHHHHHHHHHHHHHh---hhccc---cCC-CCC-----CCCCCCCCCcC Q lcl|NC_021326. 398 H---PFVE-----DLQAELERIEQEQMEYNKQ---LPNLD---DGG-ADG-----AQQKERSNDKQ 443 (445) Q Consensus 398 l---~~~~-----d~~~E~~ri~~E~~~~~~~---~~~~~---~~~-~~~-----~~~~~~~~d~~ 443 (445) + -+|+ -.++|++.+++++++.... ++... +.. .++ +-+..+++.+| T Consensus 457 ~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 457 LAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQLMDEEQPPMEE 522 (522) T ss_pred HHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHHHHHHhCCCCCC Confidence 1 1222 1245555444443322221 11111 111 111 11222333333 No 147 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.21 E-value=2.5e-06 Score=51.28 Aligned_cols=373 Identities=12% Similarity=0.070 Sum_probs=176.2 Q ss_pred ChHHHHHHHHHHH----HHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee-c Q lcl|NC_021326. 1 MIVRYIKQHLEKL----PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF-K 75 (445) Q Consensus 1 ~l~~~i~~~~~~~----~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~-~ 75 (445) +++++.++..... .-...+..+|-|..-.. +... -...-+..+....+|+..++-+..-|+.+ . T Consensus 2 ~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~v----~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~ 70 (416) T protein:vir:12 2 LLERMFEKRSGSSDHEDGFNNILLNMFGGRKTAS-------GERV----SESNSLVQPDIFACVNVLSDDIAKLPIHTYK 70 (416) T ss_pred ccchhcccccCccccCccchhHHHHhhcCccccc-------Ccee----chhhhhccHHHHHHHHHHHHhhhhCceEEEE Confidence 5666655543211 11233445554431000 0000 00011223444557777777666667654 1 Q ss_pred cCch---HH--HHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 76 HTDD---EV--IKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 76 ~~d~---~~--~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~ 144 (445) .++. .. ......++. | ........+..+.+.+|.+|+++..+..|.+ .+..++|..+.++.++. .+.+ T Consensus 71 ~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~-~~~~ 149 (416) T protein:vir:12 71 RTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPT-TGML 149 (416) T ss_pred ecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCC-CcEE Confidence 1111 10 112222221 2 3456677788899999999999999888876 47889999888766543 2221 Q ss_pred EEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHH Q lcl|NC_021326. 145 EAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKT 220 (445) Q Consensus 145 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~ 220 (445) +|..........+... .+++++ +...|.|.+..+.. T Consensus 150 -----~~~~~~~g~~~~~~~~-----------------------------------eiih~~~~~~~~~~G~s~i~~~~~ 189 (416) T protein:vir:12 150 -----WYQTVLNGKAIELYDY-----------------------------------EVLHFKGLSTDGIHGKSPIGVVRE 189 (416) T ss_pred -----EEEEecCCeEEEecCc-----------------------------------cEEEecCcCCCCcccccHHHHHHH Confidence 1111000000000000 012221 22357788877777 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHh----hhhCceeeccCCCceeeEeccCChHHHHHHHHH Q lcl|NC_021326. 221 LIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRL----LRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 293 (445) Q Consensus 221 lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~ 293 (445) .++..........+.+...+.|-.++.-.. .+....+... ....+++.++++.+.+.++.......+.+..+. T Consensus 190 ~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 269 (416) T protein:vir:12 190 HIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKRVNKVENIAIIDYGLEYQSISMPLQEAQFVESMKF 269 (416) T ss_pred HHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhcCCCeeecCCCceEEEccCChhhHHHHHHHHH Confidence 777766665555666677777766655322 1112222221 134556666766666655544444556666777 Q ss_pred HHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----C--CCcceEEE Q lcl|NC_021326. 294 LYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----K--GEHKDVDI 365 (445) Q Consensus 294 l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-----~--~~~~~i~v 365 (445) ....|...-++|....+... ++-|. ++... ...+...|.-++..+...+.. . .....+++ T Consensus 270 ~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e~~~----------~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~f 337 (416) T protein:vir:12 270 NKAQISMIYKVPLHKLNELDKATFSN--IEHQS----------IEYVRNTLQPWIVNFEQELNVKLFLDHDQKSGHYVKF 337 (416) T ss_pred HHHHHHHHhCCCHHHhCCccCCCccc--HHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCchhhcCCceEEe Confidence 77888888888865443222 22111 11111 112222333333333222211 1 11234555 Q ss_pred EeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHH---HHHHHHHHHHhhhccccCCCCCCCCCCC Q lcl|NC_021326. 366 SFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELER---IEQEQMEYNKQLPNLDDGGADGAQQKER 438 (445) Q Consensus 366 ~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~r---i~~E~~~~~~~~~~~~~~~~~~~~~~~~ 438 (445) .+..-+..|..+.++.+.++ .|+++.-.++++++. +++-+.-+.. +..+.....+. ...++ ..++. T Consensus 338 d~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~~~~~~~---~~~~~----~~~gg 410 (416) T protein:vir:12 338 NIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNYVFLDFLEEYQR---LKAGG----AMKGG 410 (416) T ss_pred echhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccchhhc---ccccc----ccCCC Confidence 56667788999999988876 489999999888754 2221110000 00000000000 00111 11111 Q ss_pred CCCcCC Q lcl|NC_021326. 439 SNDKQS 444 (445) Q Consensus 439 ~~d~~~ 444 (445) ++..+| T Consensus 411 e~~~~g 416 (416) T protein:vir:12 411 DNKNEG 416 (416) T ss_pred CCcCCC Confidence 122222 No 148 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.16 E-value=3.3e-06 Score=50.63 Aligned_cols=398 Identities=13% Similarity=0.059 Sum_probs=154.7 Q ss_pred ChHHHHHH------HHHHHHHHHHHHHHhcCCCcccccccccc-ccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQ------HLEKLPEISIGQEYYEQRPDIVKEPKPVD-ATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~------~~~~~~~~~~~~~yy~G~~~i~~~~~~~~-~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) =..+.+++ ++.+.+......--+.|... ...+.... ..+.......+......-...+|+..++-+.+-|+. T Consensus 66 ~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~-s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlk 144 (945) T protein:vir:10 66 RKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESL-MYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELE 144 (945) T ss_pred hhhhHHHhhcccccccccccchhhhhhhccCccc-eecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceE Confidence 01111111 22222211110001222211 00000000 000000000011111223344677777777677776 Q ss_pred e--ccCchH---------HHHHHHHHhc--cCH-------HHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEcccee Q lcl|NC_021326. 74 F--KHTDDE---------VIKRIDEVLG--NRF-------DDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQG 132 (445) Q Consensus 74 ~--~~~d~~---------~~~~l~~~~~--n~~-------~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~ 132 (445) + ..++.. ....+..++. |.. ......+..+.+.+|.+|+.+..+.+|++ .+..++|..+ T Consensus 145 lYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~V 224 (945) T protein:vir:10 145 IYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTTI 224 (945) T ss_pred EEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcce Confidence 4 111110 1112223332 211 12344567889999999999999999987 4888999998 Q ss_pred EEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC----- Q lcl|NC_021326. 133 IPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----- 207 (445) Q Consensus 133 ~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----- 207 (445) .+..++. +.... ++....+......+... .. ++++++ T Consensus 225 ti~~ddD--G~~~y--~Yv~~idG~~~~~v~a~--------------------------------Dv--Ilhirn~s~DG 266 (945) T protein:vir:10 225 KPILSED--TGIVV--GYVQEVDGAIVAHFDKR--------------------------------DV--VLFRQNLTPDV 266 (945) T ss_pred EEEEcCC--CcEEE--EEEEecCCceEEEecCC--------------------------------ce--EEEeccCCCCc Confidence 8766542 11111 11101000000000000 00 111111 Q ss_pred --CCCcCccHHHHHHHHHHHHHHHHHHHHHHH-HhcCCe--eEEecCC-----------cccchhHHHhhh-------hC Q lcl|NC_021326. 208 --NDLEISDIFMYKTLIDAYNRRLSDLSNTFK-DSNELT--YVLTNYD-----------DQELPEFKRLLR-------YY 264 (445) Q Consensus 208 --~~~g~s~~~~v~~lid~~~~~~s~~~~~~~-~~~~~~--l~~~g~~-----------~~~~~~~~~~~~-------~~ 264 (445) ...|.|.++.+...+.....+.....+.+. ..+.|- +.+.+.. .+....++..+. .. T Consensus 267 ~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG 346 (945) T protein:vir:10 267 YMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQ 346 (945) T ss_pred ccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccc Confidence 012555555444433333222222222222 234554 3333321 111112222111 12 Q ss_pred ceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHH-HHHHHHHHHHHHH Q lcl|NC_021326. 265 GAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNL-KADKLARKAKVAI 343 (445) Q Consensus 265 ~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~-k~~~~~~~~~~~l 343 (445) +.+.++++.+.+.++.......+.+..+.....|+..-++|....+...+ .++..++........ .+.-....+...+ T Consensus 347 ~piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~-st~SNiEqq~~~Fv~~tL~Pil~~IEqeL 425 (945) T protein:vir:10 347 VPILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEG-SNKATAEVMASLTKAKGLEPLMATISKGF 425 (945) T ss_pred cceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCC-CCcchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23344555554444444445556677777778888888888654432221 222222222222211 1222222222222 Q ss_pred HHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHH---HHHH--HH Q lcl|NC_021326. 344 QELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAEL---ERIE--QE 414 (445) Q Consensus 344 ~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~---~ri~--~E 414 (445) .+.+ -.......+.+.|+.....+..+.++++.++ .|+++.-.++++++.- +.-+.-+ ..+. .+ T Consensus 426 NrkL-------l~~~eg~~i~fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~ 498 (945) T protein:vir:10 426 DEVV-------SEFRNEKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDE 498 (945) T ss_pred HHhc-------cccccCceeEEEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccccccc Confidence 2111 1122345678888777677888888888876 5899999998887432 1100100 0000 00 Q ss_pred HHHHHH-hhhcc-ccCCC-CCCCCCC---CCCCcCCC Q lcl|NC_021326. 415 QMEYNK-QLPNL-DDGGA-DGAQQKE---RSNDKQSE 445 (445) Q Consensus 415 ~~~~~~-~~~~~-~~~~~-~~~~~~~---~~~d~~~~ 445 (445) .....+ ..++. ....+ .+..+++ +..+..+| T Consensus 499 ~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE 535 (945) T protein:vir:10 499 QAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSE 535 (945) T ss_pred ccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCc Confidence 000000 00000 00000 0000000 00111111 No 149 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.15 E-value=3.5e-06 Score=50.48 Aligned_cols=380 Identities=8% Similarity=-0.047 Sum_probs=185.8 Q ss_pred hHH--HHHHHHHHHHHHHHHHHHhcCCCccccccccccccccc--cccccccccccchHHHHHHHHHhhhhccCeeeccC Q lcl|NC_021326. 2 IVR--YIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAV--DPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT 77 (445) Q Consensus 2 l~~--~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~--~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~ 77 (445) |++ +.++......-.+.+..|.-|-.+ ++ +.-....... ..... . .......-.+.+....+.+.+..+.+. T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~-~~-~~il~~a~~g~~~~y~~-l-~~D~~i~s~l~~rk~av~~~~w~i~p~ 76 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQV-PN-DSILQRRGGNDLRVYEE-I-LSDAQVKTVWGQRQLAVVSREWKVEAG 76 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCC-CC-hHHHHhhccCCHHHHHH-H-hhChHHHHHHHHHHHHHhcCCceEEcC Confidence 332 111111111111111223222110 00 0000000000 00000 0 123556667778888888998888753 Q ss_pred c-----hHHHHHHHHHhcc-CHHHHHHHHHHHHHhcCeEE-EEEEECCCCcEE---EEEEccceeEEEEcCCCCCceEEE Q lcl|NC_021326. 78 D-----DEVIKRIDEVLGN-RFDDKLHSVLTGASNKGIEW-LHPYLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAF 147 (445) Q Consensus 78 d-----~~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~~~---i~~~~p~~~~~v~d~~~~~~~~~~ 147 (445) + ....+++++++++ ++...+..+. ++..+|.++ .++|...+|.+. +.+++|..+ .|+.. +.+.. T Consensus 77 ~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f--~~d~~--~~l~~- 150 (488) T protein:vir:99 77 GDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRF--RYDQD--GGLRL- 150 (488) T ss_pred CCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccce--eecCC--CceEE- Confidence 3 2344677777765 5666655554 678899875 556654456543 344444432 22321 11110 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEe--cCCCCcCccHHHHHHHHHHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPF--KNNDLEISDIFMYKTLIDAY 225 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~--~n~~~g~s~~~~v~~lid~~ 225 (445) ... ... ......+.+++.+-.++. ..++.|.|.+..+....-.- T Consensus 151 -----------------------~~~--------~~~---~~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK 196 (488) T protein:vir:99 151 -----------------------LTP--------NNM---FEGEPCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFK 196 (488) T ss_pred -----------------------ecc--------CCC---CCccccccCceEEEEeecCCCCCcccchHHHHHHHHHHHH Confidence 000 000 000011122222211111 23567889998876665555 Q ss_pred HHHHHHHHHHHHHhcCCeeEEecCC-cccchhH------HHhhhhCceeeccCCCceeeEecc-CChHHHHHHHHHHHHH Q lcl|NC_021326. 226 NRRLSDLSNTFKDSNELTYVLTNYD-DQELPEF------KRLLRYYGAIKVSDNGGVDTIQVE-VPVENSKKYLDELYQK 297 (445) Q Consensus 226 ~~~~s~~~~~~~~~~~~~l~~~g~~-~~~~~~~------~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~i~~l~~~ 297 (445) +..+.+++.-++.++.|+++.+-.. ..+..+. ...+....+..++.+.++++++.. .+...++.+++.+.+. T Consensus 197 ~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~ 276 (488) T protein:vir:99 197 RNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDAT 276 (488) T ss_pred HhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHHHHHHHHH Confidence 6678888888999999998876322 1111121 234456667788999999999865 4556688999999888 Q ss_pred HHHHhCcccccccc-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCCcceEEEEeCCCCCCCH Q lcl|NC_021326. 298 IMLFGQAVDFSSDK-FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANT 375 (445) Q Consensus 298 i~~~s~~p~~~~~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~ 375 (445) |...--.--++.+. .|+...|.. ...-....+..-.+.+...+. +++..++.+..-.. .-..+.|....+.|. T Consensus 277 Isk~iLGqtlts~~~~Gs~a~~~v---h~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~--~~p~~~~~~~e~edl 351 (488) T protein:vir:99 277 IAKVGLGQVASTQGTPGRLGNDDL---QADVRLDLVKADADLICESFNLGPARWLTEWNFPGA--QPPRVYRVIEEPEDI 351 (488) T ss_pred HHHHHhhhhhcccccccchhhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCc--CCceeEecCCCcccH Confidence 76653111112222 122222222 222334445566677777774 47777777653222 224567777788888 Q ss_pred HHHHHHHHHHh---cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 376 ELQVQTAQQSM---GI-VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 376 ~~~~~~~~~~~---g~-~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+.++.+.+++ |+ ++.+.+.+.++. +.++.+ +.. ..+......+ +.....+.... T Consensus 352 ~~~a~~~~~l~~~~G~~i~~~~i~e~~Gi-p~~~~~--------~~~--~~~~~~~~~~----~~~~~~~~~~~ 410 (488) T protein:vir:99 352 TAKAERDEKVFRMSGFRPTRGYVQETYGV-EVESTQ--------AEA--TAPTPSTEFA----EGDQPSDPAAA 410 (488) T ss_pred HHHHHHHHHHHhhcCCCCCHHHHHHHcCC-CCcccc--------ccc--ccCCCcccCC----CCCCCCCchHH Confidence 88888888763 66 888888888754 221110 000 0000000000 11111111111 No 150 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.15 E-value=3.6e-06 Score=50.44 Aligned_cols=379 Identities=9% Similarity=0.030 Sum_probs=166.6 Q ss_pred ChHHHHHHHH--HHHH--H------HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHL--EKLP--E------ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~--~~~~--~------~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~ 70 (445) |+.++.+-+. .+.. . ...+..+.-+... .. ......-+.++-...+|+..++-+.+- T Consensus 3 ~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~-----------~~--~v~~~~al~~~~v~~~i~~ia~~ia~l 69 (432) T protein:vir:10 3 IVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS-----------TI--SVKGKNALKVATVFACIKILSESVSKL 69 (432) T ss_pred hHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC-----------cc--ccchhhhhccHHHHHHHHHHHHhhccC Confidence 4444322211 0000 0 0001111100000 00 000000011222334566666666566 Q ss_pred Ceeec--cCc---hHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcC Q lcl|NC_021326. 71 PIAFK--HTD---DEVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTD 138 (445) Q Consensus 71 ~~~~~--~~d---~~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~ 138 (445) |+.+- .++ ......+...+. | .-......+..+.+.+|.+|+++..+..|++ .+..++|..+.++.++ T Consensus 70 p~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~ 149 (432) T protein:vir:10 70 PLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDD 149 (432) T ss_pred ceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 76641 111 111111222221 2 3456677788899999999999999988986 5788899888776654 Q ss_pred CCCCceEEEEE-EEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcC Q lcl|NC_021326. 139 KEHEELEAFIR-MYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEI 212 (445) Q Consensus 139 ~~~~~~~~~v~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~ 212 (445) .. .+..-.+ +|..........+ +.. -|+|+++ ...|. T Consensus 150 ~~--~~~~~~~~~y~~~~~g~~~~~---------------------------------~~~--eiih~r~~~~~~~~~G~ 192 (432) T protein:vir:10 150 VG--LLNSKTKMWYVVNTGGQQRVL---------------------------------KPE--EILHFKNGITLDGLVGV 192 (432) T ss_pred cc--cccccceEEEEEecCCeEEEE---------------------------------ccc--cEEEecCCCCCCCcccc Confidence 21 1111000 1111000000000 000 1344432 23578 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEecc Q lcl|NC_021326. 213 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVE 281 (445) Q Consensus 213 s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~ 281 (445) |.+......++....+..-....+...+.|-.++.... .+........+. ..+++.++.+.+.+.+..+ T Consensus 193 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~ 272 (432) T protein:vir:10 193 PTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLN 272 (432) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCC Confidence 88877777777666655555566677777776655322 221222222221 2345566666655555443 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----c Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----I 356 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~ 356 (445) .....+.+..+...+.|+..-++|....+.... .+...++... ...+...|+-++..+...+. . T Consensus 273 ~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~-~~~s~~e~~~----------~~~~~~~l~P~~~~ie~~ln~kLl~~ 341 (432) T protein:vir:10 273 MSDAQFLENTELTIRQIATAFGIKMHQLNDLSK-ATLNNIEQQQ----------QQFYTDTLQATLTMYEQEMTYKLFLD 341 (432) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCh Confidence 344455666777778888888888654432211 1111122111 11222233333333322211 1 Q ss_pred CC--CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHH-HHHHHHhhhccccCCCC Q lcl|NC_021326. 357 KG--EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQE-QMEYNKQLPNLDDGGAD 431 (445) Q Consensus 357 ~~--~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E-~~~~~~~~~~~~~~~~~ 431 (445) .. ....+++.+...+..|..+.++++.++ .|+++.-.++++++.-+-+ ..++..-- +......... ...+ T Consensus 342 ~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~--ggD~~~~~~n~~~~~~~~~---~~~k 416 (432) T protein:vir:10 342 SELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEA--GGDRLLVNGNMLPIDMAGQ---AYLK 416 (432) T ss_pred hhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeEeecccccchhhccc---cccC Confidence 11 122345555667778999999998887 4899999998887542210 00000000 0000000000 0011 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) ++++.++...+.+| T Consensus 417 ~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 417 GGDTNGEVSKEGNE 430 (432) T ss_pred CCCCCCCCCCCCCC Confidence 11111111111122 No 151 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.15 E-value=3.6e-06 Score=50.44 Aligned_cols=379 Identities=9% Similarity=0.030 Sum_probs=166.6 Q ss_pred ChHHHHHHHH--HHHH--H------HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHL--EKLP--E------ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~--~~~~--~------~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~ 70 (445) |+.++.+-+. .+.. . ...+..+.-+... .. ......-+.++-...+|+..++-+.+- T Consensus 3 ~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~-----------~~--~v~~~~al~~~~v~~~i~~ia~~ia~l 69 (432) T protein:vir:10 3 IVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS-----------TI--SVKGKNALKVATVFACIKILSESVSKL 69 (432) T ss_pred hHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC-----------cc--ccchhhhhccHHHHHHHHHHHHhhccC Confidence 4444322211 0000 0 0001111100000 00 000000011222334566666666566 Q ss_pred Ceeec--cCc---hHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcC Q lcl|NC_021326. 71 PIAFK--HTD---DEVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTD 138 (445) Q Consensus 71 ~~~~~--~~d---~~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~ 138 (445) |+.+- .++ ......+...+. | .-......+..+.+.+|.+|+++..+..|++ .+..++|..+.++.++ T Consensus 70 p~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~ 149 (432) T protein:vir:10 70 PLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDD 149 (432) T ss_pred ceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 76641 111 111111222221 2 3456677788899999999999999988986 5788899888776654 Q ss_pred CCCCceEEEEE-EEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcC Q lcl|NC_021326. 139 KEHEELEAFIR-MYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEI 212 (445) Q Consensus 139 ~~~~~~~~~v~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~ 212 (445) .. .+..-.+ +|..........+ +.. -|+|+++ ...|. T Consensus 150 ~~--~~~~~~~~~y~~~~~g~~~~~---------------------------------~~~--eiih~r~~~~~~~~~G~ 192 (432) T protein:vir:10 150 VG--LLNSKTKMWYVVNTGGQQRVL---------------------------------KPE--EILHFKNGITLDGLVGV 192 (432) T ss_pred cc--cccccceEEEEEecCCeEEEE---------------------------------ccc--cEEEecCCCCCCCcccc Confidence 21 1111000 1111000000000 000 1344432 23578 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEecc Q lcl|NC_021326. 213 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVE 281 (445) Q Consensus 213 s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~ 281 (445) |.+......++....+..-....+...+.|-.++.... .+........+. ..+++.++.+.+.+.+..+ T Consensus 193 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~ 272 (432) T protein:vir:10 193 PTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLN 272 (432) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCC Confidence 88877777777666655555566677777776655322 221222222221 2345566666655555443 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----c Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----I 356 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~ 356 (445) .....+.+..+...+.|+..-++|....+.... .+...++... ...+...|+-++..+...+. . T Consensus 273 ~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~-~~~s~~e~~~----------~~~~~~~l~P~~~~ie~~ln~kLl~~ 341 (432) T protein:vir:10 273 MSDAQFLENTELTIRQIATAFGIKMHQLNDLSK-ATLNNIEQQQ----------QQFYTDTLQATLTMYEQEMTYKLFLD 341 (432) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCh Confidence 344455666777778888888888654432211 1111122111 11222233333333322211 1 Q ss_pred CC--CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHH-HHHHHHhhhccccCCCC Q lcl|NC_021326. 357 KG--EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQE-QMEYNKQLPNLDDGGAD 431 (445) Q Consensus 357 ~~--~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E-~~~~~~~~~~~~~~~~~ 431 (445) .. ....+++.+...+..|..+.++++.++ .|+++.-.++++++.-+-+ ..++..-- +......... ...+ T Consensus 342 ~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~--ggD~~~~~~n~~~~~~~~~---~~~k 416 (432) T protein:vir:10 342 SELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEA--GGDRLLVNGNMLPIDMAGQ---AYLK 416 (432) T ss_pred hhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeEeecccccchhhccc---cccC Confidence 11 122345555667778999999998887 4899999998887542210 00000000 0000000000 0011 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) ++++.++...+.+| T Consensus 417 ~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 417 GGDTNGEVSKEGNE 430 (432) T ss_pred CCCCCCCCCCCCCC Confidence 11111111111122 No 152 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.15 E-value=3.6e-06 Score=50.44 Aligned_cols=379 Identities=9% Similarity=0.030 Sum_probs=166.6 Q ss_pred ChHHHHHHHH--HHHH--H------HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHL--EKLP--E------ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~--~~~~--~------~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~ 70 (445) |+.++.+-+. .+.. . ...+..+.-+... .. ......-+.++-...+|+..++-+.+- T Consensus 3 ~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~-----------~~--~v~~~~al~~~~v~~~i~~ia~~ia~l 69 (432) T protein:vir:10 3 IVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS-----------TI--SVKGKNALKVATVFACIKILSESVSKL 69 (432) T ss_pred hHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC-----------cc--ccchhhhhccHHHHHHHHHHHHhhccC Confidence 4444322211 0000 0 0001111100000 00 000000011222334566666666566 Q ss_pred Ceeec--cCc---hHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcC Q lcl|NC_021326. 71 PIAFK--HTD---DEVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTD 138 (445) Q Consensus 71 ~~~~~--~~d---~~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~ 138 (445) |+.+- .++ ......+...+. | .-......+..+.+.+|.+|+++..+..|++ .+..++|..+.++.++ T Consensus 70 p~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~ 149 (432) T protein:vir:10 70 PLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDD 149 (432) T ss_pred ceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 76641 111 111111222221 2 3456677788899999999999999988986 5788899888776654 Q ss_pred CCCCceEEEEE-EEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcC Q lcl|NC_021326. 139 KEHEELEAFIR-MYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEI 212 (445) Q Consensus 139 ~~~~~~~~~v~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~ 212 (445) .. .+..-.+ +|..........+ +.. -|+|+++ ...|. T Consensus 150 ~~--~~~~~~~~~y~~~~~g~~~~~---------------------------------~~~--eiih~r~~~~~~~~~G~ 192 (432) T protein:vir:10 150 VG--LLNSKTKMWYVVNTGGQQRVL---------------------------------KPE--EILHFKNGITLDGLVGV 192 (432) T ss_pred cc--cccccceEEEEEecCCeEEEE---------------------------------ccc--cEEEecCCCCCCCcccc Confidence 21 1111000 1111000000000 000 1344432 23578 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEecc Q lcl|NC_021326. 213 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVE 281 (445) Q Consensus 213 s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~ 281 (445) |.+......++....+..-....+...+.|-.++.... .+........+. ..+++.++.+.+.+.+..+ T Consensus 193 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~ 272 (432) T protein:vir:10 193 PTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLN 272 (432) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCC Confidence 88877777777666655555566677777776655322 221222222221 2345566666655555443 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----c Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----I 356 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~ 356 (445) .....+.+..+...+.|+..-++|....+.... .+...++... ...+...|+-++..+...+. . T Consensus 273 ~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~-~~~s~~e~~~----------~~~~~~~l~P~~~~ie~~ln~kLl~~ 341 (432) T protein:vir:10 273 MSDAQFLENTELTIRQIATAFGIKMHQLNDLSK-ATLNNIEQQQ----------QQFYTDTLQATLTMYEQEMTYKLFLD 341 (432) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCh Confidence 344455666777778888888888654432211 1111122111 11222233333333322211 1 Q ss_pred CC--CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHH-HHHHHHhhhccccCCCC Q lcl|NC_021326. 357 KG--EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQE-QMEYNKQLPNLDDGGAD 431 (445) Q Consensus 357 ~~--~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E-~~~~~~~~~~~~~~~~~ 431 (445) .. ....+++.+...+..|..+.++++.++ .|+++.-.++++++.-+-+ ..++..-- +......... ...+ T Consensus 342 ~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~--ggD~~~~~~n~~~~~~~~~---~~~k 416 (432) T protein:vir:10 342 SELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEA--GGDRLLVNGNMLPIDMAGQ---AYLK 416 (432) T ss_pred hhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeEeecccccchhhccc---cccC Confidence 11 122345555667778999999998887 4899999998887542210 00000000 0000000000 0011 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) ++++.++...+.+| T Consensus 417 ~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 417 GGDTNGEVSKEGNE 430 (432) T ss_pred CCCCCCCCCCCCCC Confidence 11111111111122 No 153 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.13 E-value=4e-06 Score=50.16 Aligned_cols=392 Identities=11% Similarity=0.028 Sum_probs=161.7 Q ss_pred ChHHHHHHHHHHH-HHHHHHHHHhcCCCcccccccccccc--ccccccccccccccchHHHHHHHHHhhhhccCeeecc- Q lcl|NC_021326. 1 MIVRYIKQHLEKL-PEISIGQEYYEQRPDIVKEPKPVDAT--GAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH- 76 (445) Q Consensus 1 ~l~~~i~~~~~~~-~~~~~~~~yy~G~~~i~~~~~~~~~~--~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~- 76 (445) ++.++..+..... ..... .+..+... ....... ......-+..-+.+.=.-.+|+..++-+.+-|+.+-- T Consensus 3 ~~~~l~~r~~~~~~~~~~~-----~~~~~~~~-~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~ 76 (457) T protein:vir:13 3 FWSALFGRGHSPALDGIEA-----RAWEPYDP-SIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSK 76 (457) T ss_pred hhhhhhccccccccccccc-----ccccccch-HHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEe Confidence 4444433322110 00000 00000000 0000000 0000000000011111223566666666566666421 Q ss_pred C--c--hHHHHHHHHHhc---cC--HHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 77 T--D--DEVIKRIDEVLG---NR--FDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 77 ~--d--~~~~~~l~~~~~---n~--~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) . . +.....+-..+. |. .......+..+.+.+|.+|+.+..+ .|++ .+..++|..+.++.+...... .. T Consensus 77 ~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~-~~ 154 (457) T protein:vir:13 77 RGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLR-RK 154 (457) T ss_pred cCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCcc-ce Confidence 1 1 111112223332 11 3355666778889999999988665 4664 577888888876544322111 11 Q ss_pred EEEEEeeeccee-E--EEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHH Q lcl|NC_021326. 147 FIRMYKLENETK-V--EYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMY 218 (445) Q Consensus 147 ~v~~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v 218 (445) ..+.|....... . ..+... -|+++++ ...|.|.+... T Consensus 155 ~~~~y~~~~~~~~~~~~~~~~~-----------------------------------diih~~~~~~~~~~~G~s~i~~~ 199 (457) T protein:vir:13 155 VFEAYDIDADGNEVLLGWFTPR-----------------------------------DVLHIPGMMLPGDFVGCSPISYA 199 (457) T ss_pred eEEEEEEecCCceeeEEeeCcc-----------------------------------ceEEecCCCCCCccccccHHHHH Confidence 111122111110 0 000000 1222221 23578888777 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEeccCChHHH Q lcl|NC_021326. 219 KTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENS 287 (445) Q Consensus 219 ~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~ 287 (445) ...|.....+..-....+...+.|-.++.-.. .+.....+..+. ..+++.++++.+.+.+..+.....+ T Consensus 200 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~ 279 (457) T protein:vir:13 200 RESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQF 279 (457) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHH Confidence 77666666555555555667777766655322 111122222111 2345666766666655544444455 Q ss_pred HHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhccC-CCc Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE-----HFDIK-GEH 360 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~-----~~~~~-~~~ 360 (445) .+..+.....|...-++|....+... ++.++..++....... ...|.-++..+.. ++... ... T Consensus 280 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~----------~~tl~P~~~~ie~~ln~~L~~~~~~~~ 349 (457) T protein:vir:13 280 LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFT----------MFSLRPWLERIEAGFNRLLFAETADRF 349 (457) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHH----------HHHHHHHHHHHHHHHHHhhcCccccCc Confidence 56666777778888888865443322 2222222332222111 1122222222211 11111 122 Q ss_pred ceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHH-------HHHHHHHHH-HHHHhhhccccC Q lcl|NC_021326. 361 KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAE-------LERIEQEQM-EYNKQLPNLDDG 428 (445) Q Consensus 361 ~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E-------~~ri~~E~~-~~~~~~~~~~~~ 428 (445) ..+++.++..+..|..+.++.+.++ +|+++.-.++++++.- ++.... +..+.+.-+ +.....+..... T Consensus 350 ~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~ 429 (457) T protein:vir:13 350 RFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPP 429 (457) T ss_pred eeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeeccccccccccccccccCCCCCCCCC Confidence 3355556677778999999988876 5899998888877442 221101 111111000 000000001111 Q ss_pred CCCCCCCCC----------CCCCcCCC Q lcl|NC_021326. 429 GADGAQQKE----------RSNDKQSE 445 (445) Q Consensus 429 ~~~~~~~~~----------~~~d~~~~ 445 (445) ..++..+++ .+.+++.| T Consensus 430 ~~~~~~~~~~~g~~d~~~~~~~~~~~~ 456 (457) T protein:vir:13 430 AEEPDEEPEPEGKPDDEGATEEDDEDD 456 (457) T ss_pred ccccCCCCCCCCCCccccCCCCccccc Confidence 111111111 11111111 No 154 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.13 E-value=4e-06 Score=50.15 Aligned_cols=381 Identities=10% Similarity=0.036 Sum_probs=163.4 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) +..+.-++-.. .......+....-|..... . .......-+.+.=.-.+|+..++-+.+-|+.+..+.. T Consensus 3 ~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~---------~--~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~ 71 (416) T protein:vir:81 3 IFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK---------L--RQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ 71 (416) T ss_pred cccccccccccCCCcchhHHHHHhccccccC---------c--cccchhhhhcchHHHHHHHHHHHhhccCceEEecCcc Confidence 32221100000 0000011111111110000 0 0000000011111122566666666666776643222 Q ss_pred H-HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 80 E-VIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 80 ~-~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) . ....+-..+. | ....+...+....+.+|.+|+.+..+..|++ .+..++|..+.++.+. .+.+.+.... T Consensus 72 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~--~g~~~~~~~~- 148 (416) T protein:vir:81 72 INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDA--RGRLYYFHQR- 148 (416) T ss_pred ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECC--CccEEEEEEE- Confidence 1 1111222221 2 2345666778888999999999999998986 4788899998877654 2333221111 Q ss_pred eeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHHHHHH Q lcl|NC_021326. 152 KLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLIDAYNR 227 (445) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid~~~~ 227 (445) ....... ....+ +... |++++ +...|.|.+......++.... T Consensus 149 -~~~~~~~------~~~~~-------------------------~~~e--vihir~~~~d~~~G~s~i~~~~~~i~~~~~ 194 (416) T protein:vir:81 149 -IDSNGNN------IERNV-------------------------KFED--MLDIKFYSLDGINGLSLLDTLSRTIESDNN 194 (416) T ss_pred -ecCCCce------eEEEE-------------------------cccc--EEEeccCCCCCccccCHHHHHHHHHHHHHH Confidence 1111000 00000 0000 22222 123577888777777766555 Q ss_pred HHHHHHHHHHHhcCCeeEEe--cCCcc-cc-hhHHHhh----h----hCceeeccCCCceeeEeccCChHHHHHHHHHHH Q lcl|NC_021326. 228 RLSDLSNTFKDSNELTYVLT--NYDDQ-EL-PEFKRLL----R----YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELY 295 (445) Q Consensus 228 ~~s~~~~~~~~~~~~~l~~~--g~~~~-~~-~~~~~~~----~----~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~ 295 (445) ...-..+.+...+.|-.++. |...+ .. +..+..+ . .++++.++++.+.+.++.+.....+.+..+..+ T Consensus 195 ~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 274 (416) T protein:vir:81 195 GKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSST 274 (416) T ss_pred HHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHH Confidence 54555555666777766654 33221 11 1111111 1 223556666655555544444445666677777 Q ss_pred HHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hccCCCcceEEEEeCCCC Q lcl|NC_021326. 296 QKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH----FDIKGEHKDVDISFNYNK 371 (445) Q Consensus 296 ~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~i~v~f~~~~ 371 (445) +.|+..-++|....+...++.|.+..... |...|.-++..+... +........+++.+...+ T Consensus 275 ~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~--------------~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~ 340 (416) T protein:vir:81 275 REIAGVFGIPLHKFGIETANMSITDANLD--------------YLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIR 340 (416) T ss_pred HHHHHHhCCCHHHcCCCCCCccHHHHHHH--------------HHHHHHHHHHHHHHHHhhhccccccCceEEEechhhh Confidence 88888888886443321122221211111 111222222222211 111112234555555556 Q ss_pred CCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) -.|..+.++.+.++ .|+++.-.++++++. +++.+...-.+...-.. .+............. ...-.++++.| T Consensus 341 ~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~-~~~~~~~~~~~~~~~-~~~~kgGe~n~ 416 (416) T protein:vir:81 341 VVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVN-IELVDEYQMNKSRAT-DKKLKGGEENE 416 (416) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccc-cccccccCccccccc-ccccCCCCCCC Confidence 67888899888876 589999999988744 22322211111100000 000000000000100 11111222222 No 155 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.13 E-value=4e-06 Score=50.15 Aligned_cols=381 Identities=10% Similarity=0.036 Sum_probs=163.4 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) +..+.-++-.. .......+....-|..... . .......-+.+.=.-.+|+..++-+.+-|+.+..+.. T Consensus 3 ~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~---------~--~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~ 71 (416) T protein:vir:45 3 IFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK---------L--RQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ 71 (416) T ss_pred cccccccccccCCCcchhHHHHHhccccccC---------c--cccchhhhhcchHHHHHHHHHHHhhccCceEEecCcc Confidence 32221100000 0000011111111110000 0 0000000011111122566666666666776643222 Q ss_pred H-HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 80 E-VIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 80 ~-~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) . ....+-..+. | ....+...+....+.+|.+|+.+..+..|++ .+..++|..+.++.+. .+.+.+.... T Consensus 72 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~--~g~~~~~~~~- 148 (416) T protein:vir:45 72 INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDA--RGRLYYFHQR- 148 (416) T ss_pred ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECC--CccEEEEEEE- Confidence 1 1111222221 2 2345666778888999999999999998986 4788899998877654 2333221111 Q ss_pred eeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHHHHHH Q lcl|NC_021326. 152 KLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLIDAYNR 227 (445) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid~~~~ 227 (445) ....... ....+ +... |++++ +...|.|.+......++.... T Consensus 149 -~~~~~~~------~~~~~-------------------------~~~e--vihir~~~~d~~~G~s~i~~~~~~i~~~~~ 194 (416) T protein:vir:45 149 -IDSNGNN------IERNV-------------------------KFED--MLDIKFYSLDGINGLSLLDTLSRTIESDNN 194 (416) T ss_pred -ecCCCce------eEEEE-------------------------cccc--EEEeccCCCCCccccCHHHHHHHHHHHHHH Confidence 1111000 00000 0000 22222 123577888777777766555 Q ss_pred HHHHHHHHHHHhcCCeeEEe--cCCcc-cc-hhHHHhh----h----hCceeeccCCCceeeEeccCChHHHHHHHHHHH Q lcl|NC_021326. 228 RLSDLSNTFKDSNELTYVLT--NYDDQ-EL-PEFKRLL----R----YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELY 295 (445) Q Consensus 228 ~~s~~~~~~~~~~~~~l~~~--g~~~~-~~-~~~~~~~----~----~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~ 295 (445) ...-..+.+...+.|-.++. |...+ .. +..+..+ . .++++.++++.+.+.++.+.....+.+..+..+ T Consensus 195 ~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 274 (416) T protein:vir:45 195 GKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSST 274 (416) T ss_pred HHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHH Confidence 54555555666777766654 33221 11 1111111 1 223556666655555544444445666677777 Q ss_pred HHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hccCCCcceEEEEeCCCC Q lcl|NC_021326. 296 QKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH----FDIKGEHKDVDISFNYNK 371 (445) Q Consensus 296 ~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~i~v~f~~~~ 371 (445) +.|+..-++|....+...++.|.+..... |...|.-++..+... +........+++.+...+ T Consensus 275 ~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~--------------~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~f~~~~l~ 340 (416) T protein:vir:45 275 REIAGVFGIPLHKFGIETANMSITDANLD--------------YLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIR 340 (416) T ss_pred HHHHHHhCCCHHHcCCCCCCccHHHHHHH--------------HHHHHHHHHHHHHHHHhhhccccccCceEEEechhhh Confidence 88888888886443321122221211111 111222222222211 111112234555555556 Q ss_pred CCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 372 VANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 372 p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) -.|..+.++.+.++ .|+++.-.++++++. +++.+...-.+...-.. .+............. ...-.++++.| T Consensus 341 ~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~-~~~~~~~~~~~~~~~-~~~~kgGe~n~ 416 (416) T protein:vir:45 341 VVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVN-IELVDEYQMNKSRAT-DKKLKGGEENE 416 (416) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccc-cccccccCccccccc-ccccCCCCCCC Confidence 67888899888876 589999999988744 22322211111100000 000000000000100 11111222222 No 156 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=98.12 E-value=4.3e-06 Score=50.03 Aligned_cols=382 Identities=10% Similarity=0.004 Sum_probs=184.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc-cccc-ccccccccccccchHHHHHHHHHhhhhccCeeecc-- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVD-ATGA-VDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-- 76 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~-~~~~-~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~-- 76 (445) .+++-.. .++.--....++.-+- -.+.+....- .... ..... .. .......-.+++...-+.+.+..+.+ T Consensus 17 ~~~~~~~---~~ia~~~~~~~~~~~~-~~~~~~~~iLr~~~~~~~~y~-~m-~~D~~i~s~l~~Rk~av~~~~w~i~~~~ 90 (491) T protein:vir:10 17 EPDKSLS---SQIATRARSIDFFALG-MYLPNPDPVLKALGKDIRVYR-EL-RADAHVGGCVRRRKAAVKALEWGLDRGK 90 (491) T ss_pred cCChHHH---HHHHhhhccccccccc-CCccchHHHHHhcCCCHHHHH-HH-hhChHHHHHHHHHHHHHhCCCcEEecCC Confidence 1111111 1111000011111000 0011111110 0000 00000 00 12455666777777888899988875 Q ss_pred CchHHHHHHHHHhcc-CHHHHHHHHHHHHHhcCeEE-EEEEECCCCcEE---EEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 77 TDDEVIKRIDEVLGN-RFDDKLHSVLTGASNKGIEW-LHPYLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 77 ~d~~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~~~---i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) +++...+++.++++. ++...+..+ .++..||.++ .++|...+|... +.+++|..+ .|+.. +.+.. T Consensus 91 ~~~~~~e~v~e~l~~~~~~~~l~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f--~~d~~--~~l~~----- 160 (491) T protein:vir:10 91 AKSRVAKSIADVFADLDLSRIVTEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWF--VYDPE--NQLRF----- 160 (491) T ss_pred CCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeEEEEeeeecccce--eeccC--CceEE----- Confidence 344566778887764 566666655 4788899975 556655455543 444555433 23321 11111 Q ss_pred eeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEe--cCCCCcCccHHHHHHHHHHHHHHH Q lcl|NC_021326. 152 KLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPF--KNNDLEISDIFMYKTLIDAYNRRL 229 (445) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~--~n~~~g~s~~~~v~~lid~~~~~~ 229 (445) ...... .. . ....+++.|-.++- ..++.|.|.+..+....-.-+..+ T Consensus 161 -------------------~~~~~~--------~~-g---~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~ 209 (491) T protein:vir:10 161 -------------------RSKDHW--------MQ-G---EELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGL 209 (491) T ss_pred -------------------ecCCCC--------CC-c---ceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHH Confidence 000000 00 0 00112222211211 135678999988777777777788 Q ss_pred HHHHHHHHHhcCCeeEEecCCcccchhH------HHhhhhCceeeccCCCceeeEeccC---ChHHHHHHHHHHHHHHHH Q lcl|NC_021326. 230 SDLSNTFKDSNELTYVLTNYDDQELPEF------KRLLRYYGAIKVSDNGGVDTIQVEV---PVENSKKYLDELYQKIML 300 (445) Q Consensus 230 s~~~~~~~~~~~~~l~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~i~~l~~~i~~ 300 (445) .+++.-++.++.|+++.+-......++. ...+....+..++.+.++++++... +...++.+++.+.+.|.. T Consensus 210 ~~w~~f~E~yG~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk 289 (491) T protein:vir:10 210 KFWVQFTEKYGSPMLVGKHPRSASDGEKNLLLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSI 289 (491) T ss_pred HHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHH Confidence 8999999999999988764322222221 2334556677889999999998653 345688888888887766 Q ss_pred HhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHH Q lcl|NC_021326. 301 FGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 380 (445) Q Consensus 301 ~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~ 380 (445) .--.-.++.+..|+...|..- ..-....+..-.+.+...+.++++-++.+...... ...+.|.... .+....++ T Consensus 290 ~iLGqtlTt~~~gs~a~~~vh---~~v~~di~~~D~~~i~~tln~li~~l~~~N~~~~~--~p~f~~~~~~-e~~~~~a~ 363 (491) T protein:vir:10 290 ALLGQNQTTEATSTRASAQAG---LEVTDDIRDGDKAVVSEAMNMLIRWICDLNFDGAD--RPVFDMWEQE-QVDEIQAG 363 (491) T ss_pred HHhhhhcccCcccchhHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--cceEEecCcC-chhHHHHH Confidence 431111122222222222221 12233334444567777788888877777655433 3445665432 33356778 Q ss_pred HHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 381 TAQQSM--GI-VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 381 ~~~~~~--g~-~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+.+++ |+ ++.+.+.+.++. +.++.+-. .....+......... .+....+.++-+ T Consensus 364 ~~~~L~~~G~~i~~~~i~e~~Gi-p~~~~~~~--------~~~~~~~~~~~~~~~-~~~~~~~~~~~d 421 (491) T protein:vir:10 364 RDQKLTQAGARFTPAYFKRAYNL-QDGDLDER--------PLPVSAVDTVGAASF-AEFEAPDQDALD 421 (491) T ss_pred HHHHHHhCCCcCCHHHHHHHhCC-CCCCcCcc--------ccccCCCCCcccccc-cccCCCCCCchH Confidence 787765 66 888888888854 32211100 000000000000000 111111111111 No 157 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=98.11 E-value=4.3e-06 Score=49.98 Aligned_cols=393 Identities=8% Similarity=0.012 Sum_probs=179.8 Q ss_pred ChHHHHHHHHH-------HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHLE-------KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~~-------~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +=.--+++.+. .-.-++-+ .+|.|-.= +.++. ..... .++..+.++.+.+..+..+-+. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~F-~Gy~~-----la~la-------Q~~eyr~~~~~ia~e~~R~w~~ 142 (698) T protein:vir:10 77 VSNYTPRERRAASYALDFNGTSMDAL-SFVTSSGF-PGFPT-----LVLLA-------QLPEYRAMHEVLADECIRTWGE 142 (698) T ss_pred cccCCccccchhhhhhcccccccccc-hhhhccCc-chHHH-----HHHHh-------hccchhhHHHHHHHHhhcccce Confidence 00000000000 00111111 23333210 00000 00000 0122233333433333222222 Q ss_pred ec-------------------c-CchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc----------- Q lcl|NC_021326. 74 FK-------------------H-TDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGE----------- 121 (445) Q Consensus 74 ~~-------------------~-~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~----------- 121 (445) ++ . .+.+..+.|..-++ =+....+.+..+.+-.||.+..++-++.++. T Consensus 143 ~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~ 222 (698) T protein:vir:10 143 AIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPY 222 (698) T ss_pred eccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCccccccccccccc Confidence 11 1 12234445544433 3678888999999999999987776644331 Q ss_pred ------E-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEE-Eecceeeecccccccccccccc Q lcl|NC_021326. 122 ------F-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYV-YENGSLIPDYSNNLENSKTHFS 193 (445) Q Consensus 122 ------~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 193 (445) + .+.+++|..+.|-.-+. .++ ..-.+|.+..+...- ....++... T Consensus 223 ~I~kGslKGL~ViDp~~vtP~~~n~--~dP------------~spdfgkP~~y~V~G~~IH~SRL~~------------- 275 (698) T protein:vir:10 223 TVPKGSFQGLRVVEPYWVTPNNYNS--INP------------VADDFYKPSTWWMIGSEVHATRLHT------------- 275 (698) T ss_pred cccCccceeeeeecccccccchhhh--ccc------------hhhccCCCceEEEecceecceeEEE------------- Confidence 1 15666666665521110 000 000111111111000 000000000 Q ss_pred cccccccceEE-ecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC----CcccchhHH------Hhh- Q lcl|NC_021326. 194 TGSWGKIPFIP-FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY----DDQELPEFK------RLL- 261 (445) Q Consensus 194 ~~~~g~iPvv~-~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~----~~~~~~~~~------~~~- 261 (445) ..-..+|-.. -..+..|.|....+.+-+++.+++.-.....+..+....+. +++ ......+.. ... T Consensus 276 -~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~-~dla~aL~~g~~~~l~~R~eli~~~R 353 (698) T protein:vir:10 276 -IVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQALTPGANVDLSMRAELINRYR 353 (698) T ss_pred -ecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHH-HHHHHhcCChhhHHHHHHHHHHHHhc Confidence 0000112111 01234688888888888888888776666655444333321 111 000111111 111 Q ss_pred hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCcccccccc--ccC-cchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 262 RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDK--FGS-APSGVALEFLYTNLNLKADKLARK 338 (445) Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~--~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~ 338 (445) ...+++.++ +.+=+|.+.+.+...+...+.+...+|...+++|-.-+.+ -+| |.||++-...+.+.+. ...+.. T Consensus 354 sn~G~~llD-k~~Eefeq~st~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~~ 430 (698) T protein:vir:10 354 DNRNILFLD-KATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR--AYQRNA 430 (698) T ss_pred CccceEEEe-cCCcceEEEecCcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHH--HHHHHH Confidence 233445554 3334677888899999999999999999999999643332 223 6888887777777765 344788 Q ss_pred HHHHHHHHHHHHHHHh-ccCCCcceEEEEeCCCCCCCHHHHHHHHHH---------HhccCChHHHHHhC------CCC- Q lcl|NC_021326. 339 AKVAIQELLWFVFEHF-DIKGEHKDVDISFNYNKVANTELQVQTAQQ---------SMGIVSHETVLENH------PFV- 401 (445) Q Consensus 339 ~~~~l~~~~~~~~~~~-~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~---------~~g~~s~et~l~~l------~~~- 401 (445) +...+++++.++..-. |.. + .++.++|+|-...++.+.|++-.+ ..|+++...+..+| +|. T Consensus 431 L~p~L~rl~~ii~rS~~G~i-d-p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~ 508 (698) T protein:vir:10 431 LQQLMNDVIVMIQLSLFGAV-D-PSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAG 508 (698) T ss_pred HHHHHHHHHHHHHHHhcCCC-C-CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCcccc Confidence 9999999999876543 443 2 368899999999999998886443 24778877766655 221 Q ss_pred -CCHHHHH-----HHHHHHHHHHHHhhhccccCCCC-------CCCCCCCCCCcCCC Q lcl|NC_021326. 402 -EDLQAEL-----ERIEQEQMEYNKQLPNLDDGGAD-------GAQQKERSNDKQSE 445 (445) Q Consensus 402 -~d~~~E~-----~ri~~E~~~~~~~~~~~~~~~~~-------~~~~~~~~~d~~~~ 445 (445) .|.+.+- ..++.+.. ..+...-+++. ++.+....+-...+ T Consensus 509 ~~d~~d~p~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 561 (698) T protein:vir:10 509 KLDANDDPGAPADDDIDGVLT----YVQRMAEGGDTGAPTAPGGARAGATAPPAAAN 561 (698) T ss_pred ccCCcccCCCCCCCcchHHHh----hhcCCcCCCCcccccccccccCCCCCCccccc Confidence 1111110 01111110 00000000000 00011111111111 No 158 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=98.10 E-value=4.6e-06 Score=49.82 Aligned_cols=423 Identities=8% Similarity=0.024 Sum_probs=180.7 Q ss_pred ChHHHHHHHHHHH----HHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEKL----PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~~----~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) .+++..+.++.+. .+++.+.+|.. |.+-.. .+. ...+...++..+-+...++++++.|++- | T Consensus 4 ~~~~r~~~l~~~R~~~e~~w~e~~~y~l-----P~~~~~-~~~---~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~ 74 (555) T protein:vir:17 4 SAQAKYMMLRADREDYLDSGRQSARLTL-----PYILTD-EGH---VQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNT 74 (555) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhc-----ccccCC-CCC---cccccccccccccHHHHHHHHHHHHHHhhcCCCC Confidence 4445555554332 33334444432 111111 010 1111223456677778888888777642 2 Q ss_pred --eeeccCc---------hHHHHHHHH------------HhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEc Q lcl|NC_021326. 72 --IAFKHTD---------DEVIKRIDE------------VLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVP 128 (445) Q Consensus 72 --~~~~~~d---------~~~~~~l~~------------~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~ 128 (445) +++...+ +.....++. +..+||...+.++.++..++|.+.++ .++++ +++++ T Consensus 75 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly--~~~~~---~~~~p 149 (555) T protein:vir:17 75 SFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLY--QGKKN---LKLYP 149 (555) T ss_pred cccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE--ecCCc---eeEEE Confidence 2333332 122222222 22368889999999999999998764 45554 44454 Q ss_pred cceeEEEEcCCCCCceEEEEEEEeeecce------------------------eE---------EEEecceEEEEE---E Q lcl|NC_021326. 129 AEQGIPIWTDKEHEELEAFIRMYKLENET------------------------KV---------EYWDKITVNYYV---Y 172 (445) Q Consensus 129 p~~~~~v~d~~~~~~~~~~v~~~~~~~~~------------------------~~---------~~~~~~~~~~~~---~ 172 (445) -.+ |.+-.|. .+++..++|.++..-.. .. .......+..|. . T Consensus 150 l~~-y~v~~d~-~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~~~~ 227 (555) T protein:vir:17 150 LDR-FVVSRDG-EGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTYVCR 227 (555) T ss_pred cCe-EEEeeCC-CcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeecccc Confidence 444 3343433 57777766655421100 00 000000111110 0 Q ss_pred ecceeeecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe Q lcl|NC_021326. 173 ENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT 247 (445) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~ 247 (445) ..+......+............-+|..+|++.+. ++.+|+|-..+..+-+..+|...-......+....|.+.+. T Consensus 228 ~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~lv~ 307 (555) T protein:vir:17 228 KDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVFMVS 307 (555) T ss_pred cCCeeEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec Confidence 0000000000000000001123456667776553 35689999999999999999998889999999999987652 Q ss_pred cCCcccchhHHHhhhhCceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHH Q lcl|NC_021326. 248 NYDDQELPEFKRLLRYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLY 325 (445) Q Consensus 248 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~ 325 (445) -.......+.. ....+.+..+..++++.+... .+.......++.++..|...-.+- + ...+...+|+.+.... T Consensus 308 ~~g~~~~~~l~--~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~--~-~~d~~r~TAtEV~~r~ 382 (555) T protein:vir:17 308 PSATTKPQNLA--LAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLML--Q-VRQSERTTATEVQATV 382 (555) T ss_pred cccccCcceee--cCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhc--C-CCCcccchHHHHHHHH Confidence 11111111111 111223333344456655433 355666777777777665543221 1 1223456777776544 Q ss_pred HHHHHHHHHH-----HHHHHHHHHHHHHHHHHHhccC---CCcceEEEEeCCCCCCCHHHHHH----HHHHHhcc----- Q lcl|NC_021326. 326 TNLNLKADKL-----ARKAKVAIQELLWFVFEHFDIK---GEHKDVDISFNYNKVANTELQVQ----TAQQSMGI----- 388 (445) Q Consensus 326 ~~l~~k~~~~-----~~~~~~~l~~~~~~~~~~~~~~---~~~~~i~v~f~~~~p~d~~~~~~----~~~~~~g~----- 388 (445) ..+.....-. ...+.+-+.+++.++.+..-.+ .+...+++.- +.......+.++ .++.++.+ T Consensus 383 ~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~-~l~~l~r~~~~~~l~~~~~~laq~~~~p~ 461 (555) T protein:vir:17 383 QELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVA-GLWGVGRGQDKQQLMEFITTLAQTMGPEI 461 (555) T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceee-hHHHHHHHHHHHHHHHHHHHHHhhcCchh Confidence 4333333221 2333344444555444422221 1222222221 111111111122 12222222 Q ss_pred ----CChHHHHH----hCCC----CCCHHHHHHHHHHHHHHHHHhhhccc------cC----CC--CC------------ Q lcl|NC_021326. 389 ----VSHETVLE----NHPF----VEDLQAELERIEQEQMEYNKQLPNLD------DG----GA--DG------------ 432 (445) Q Consensus 389 ----~s~et~l~----~l~~----~~d~~~E~~ri~~E~~~~~~~~~~~~------~~----~~--~~------------ 432 (445) +....++. .++. +-..++|++++++++++.+++..... .. .+ .+ T Consensus 462 ~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 541 (555) T protein:vir:17 462 AMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQAMQLIQQQQEGAQDAGA 541 (555) T ss_pred HhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHhccccchhhhhHHHH Confidence 22233332 3322 11234666665544332221111100 00 00 00 Q ss_pred -CCCCCCCCCcCCC Q lcl|NC_021326. 433 -AQQKERSNDKQSE 445 (445) Q Consensus 433 -~~~~~~~~d~~~~ 445 (445) ..+....+.-++- T Consensus 542 a~~~~~~~~~~~~~ 555 (555) T protein:vir:17 542 AESETSSAEAQAGA 555 (555) T ss_pred HHhhcCCcccccCC Confidence 0000000000000 No 159 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.06 E-value=5.5e-06 Score=49.42 Aligned_cols=383 Identities=9% Similarity=0.013 Sum_probs=170.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccc---cccccccccccccchHHHHHHHHHhhhhccCeeeccC Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATG---AVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT 77 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~---~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~ 77 (445) ++.++.++.+..-. ....+.....+.-......... .........-+..+-...+|+..++-+.+-|+.+--. T Consensus 3 ~f~~lf~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~ 78 (422) T protein:vir:13 3 FLRGLFNKKNNNDE----KRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKD 78 (422) T ss_pred hhhhhhhccCCccc----hhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEec Confidence 55555444321100 0000000000000000000000 0000000000222333456666666666667765221 Q ss_pred -----chHHHHHHHHHhcc---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEE Q lcl|NC_021326. 78 -----DDEVIKRIDEVLGN---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFI 148 (445) Q Consensus 78 -----d~~~~~~l~~~~~n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v 148 (445) +......|.. .-| ........+..+.+.+|.+|+.+..+..|++ .+..++|..+.++.+++.......-+ T Consensus 79 ~~~~~~~~~~~lL~~-~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~ 157 (422) T protein:vir:13 79 KEEYKEHELYYLLRY-KPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKV 157 (422) T ss_pred CcccccchHHHHHhh-hcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceeccceE Confidence 2222222221 112 2346677788899999999999999988886 47889999998887653211000000 Q ss_pred EEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHH Q lcl|NC_021326. 149 RMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLID 223 (445) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid 223 (445) +| .. ....+.... . ... -++++. +...|.|.+..+...++ T Consensus 158 ~y-~~-----------------~~~~g~~~~--------~-------~~~--eiih~~~~~~~~~~~G~s~~~~~~~~i~ 202 (422) T protein:vir:13 158 WY-VV-----------------TDKNGKEHK--------L-------LPD--EMLHFIGDITLDGLIGIKPLDYLRCTIE 202 (422) T ss_pred EE-EE-----------------EeCCCeEEE--------E-------ccc--ceEEEcCCCCCCCcccccHHHHHHHHHH Confidence 00 00 000000000 0 000 122222 12357888887777777 Q ss_pred HHHHHHHHHHHHHHHhcCCeeEEecC---CcccchhHHHhhh--------hCceeeccCCCceeeEeccCChHHHHHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENSKKYLD 292 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~ 292 (445) ....+..-....+...+.|-.++.-. +.+........+. ..+++.++++.+++.+........+.+..+ T Consensus 203 ~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~ 282 (422) T protein:vir:13 203 NGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSK 282 (422) T ss_pred HHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHH Confidence 66666555566666667776665432 2222222222221 223555666666555554444455666677 Q ss_pred HHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hccCCC--cceEEE Q lcl|NC_021326. 293 ELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH-----FDIKGE--HKDVDI 365 (445) Q Consensus 293 ~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~-----~~~~~~--~~~i~v 365 (445) .....|+..-++|....+...+ .+...++.... ..+...|.-+++.+... +....- ...+++ T Consensus 283 ~~~~~Ia~~fgVpp~~lg~~~~-~~~sn~e~~~~----------~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~f 351 (422) T protein:vir:13 283 LTKRELAATFGMKSYHLNDLER-ATFNNLTEQQK----------DFYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEF 351 (422) T ss_pred HHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHHH----------HHHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEe Confidence 7778888888888754432211 11112221111 11222232222222221 111111 123444 Q ss_pred EeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcC Q lcl|NC_021326. 366 SFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQ 443 (445) Q Consensus 366 ~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 443 (445) .+...+..|..+.++.+.++ .|+++.-.++++++.-. -+..+++-. . .+.. .-...+++..+.+++. T Consensus 352 d~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p--~~ggD~~~~------~--~n~~-~l~~~~~~~~~~g~~~ 420 (422) T protein:vir:13 352 NVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPP--VEGGDRLLV------N--GNMI-PIEMAGEQYKKGGEKG 420 (422) T ss_pred echhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC--CCCcCeeee------c--cCcc-chhhcccccccCCCcC Confidence 44556667889999998876 48999999998875421 111111000 0 0000 0001111112222222 Q ss_pred CC Q lcl|NC_021326. 444 SE 445 (445) Q Consensus 444 ~~ 445 (445) ++ T Consensus 421 g~ 422 (422) T protein:vir:13 421 GK 422 (422) T ss_pred CC Confidence 22 No 160 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=98.05 E-value=5.9e-06 Score=49.25 Aligned_cols=380 Identities=9% Similarity=0.004 Sum_probs=180.9 Q ss_pred Ch-HHHHHHHHHHHHHHHHHHHHhcCCCcccccccccc-cccc-ccccccccccccchHHHHHHHHHhhhhccCeeecc- Q lcl|NC_021326. 1 MI-VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVD-ATGA-VDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH- 76 (445) Q Consensus 1 ~l-~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~-~~~~-~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~- 76 (445) .. +.++.+ +.-..+-.+.+-+. .+...+...- .... ..... .. .......-.+.+...-+.+.+..+.+ T Consensus 17 ~~~~~~~~~----ia~~~~~~~~~~~~-~~~p~~~~il~~~~~~~~~y~-~m-~~D~~i~s~l~~Rk~av~~~~w~i~~~ 89 (491) T protein:vir:79 17 EPDKSLSSQ----IATRARSIDFFALG-MYLPNPDPVLKALGKDIRVYR-EL-RADAHVGGCVRRRKAAVKALEWGLDRG 89 (491) T ss_pred ccchhHHHH----Hhhhcccccccccc-ccCcchhHHHhhccCCHHHHH-HH-hhChHHHHHHHHHHHHHhCCCcEEecC Confidence 11 111111 11111101111111 0111111100 0000 00000 00 12455666777777778888988875 Q ss_pred -CchHHHHHHHHHhcc-CHHHHHHHHHHHHHhcCeEE-EEEEECCCCcE---EEEEEccceeEEEEcCCCCCceEEEEEE Q lcl|NC_021326. 77 -TDDEVIKRIDEVLGN-RFDDKLHSVLTGASNKGIEW-LHPYLDEEGEF---KLFRVPAEQGIPIWTDKEHEELEAFIRM 150 (445) Q Consensus 77 -~d~~~~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~~---~i~~~~p~~~~~v~d~~~~~~~~~~v~~ 150 (445) +++...+++.+++++ ++...+..+ .++..+|.++ .++|...+|.. ++.+++|..+. |+.. +.+.. T Consensus 90 ~~~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~--~d~~--~~l~l---- 160 (491) T protein:vir:79 90 KAKSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFV--YDPE--NQLRF---- 160 (491) T ss_pred CCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeeeccccee--eccC--CceEE---- Confidence 334456778887765 455555544 5688899875 55665555654 35555554432 3321 11111 Q ss_pred EeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEe--cCCCCcCccHHHHHHHHHHHHHH Q lcl|NC_021326. 151 YKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPF--KNNDLEISDIFMYKTLIDAYNRR 228 (445) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~--~n~~~g~s~~~~v~~lid~~~~~ 228 (445) ..... ..+. ....+++.|-+.+. ..++.|.|.+..+....-.-+.. T Consensus 161 --------------------~~~~~--------~~~g----~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~ 208 (491) T protein:vir:79 161 --------------------RSKEH--------WVQG----EELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGG 208 (491) T ss_pred --------------------eecCC--------CCCc----eeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhh Confidence 00000 0000 00112333222221 13567889998877666666667 Q ss_pred HHHHHHHHHHhcCCeeEEecC---CcccchhH---HHhhhhCceeeccCCCceeeEeccC---ChHHHHHHHHHHHHHHH Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLTNY---DDQELPEF---KRLLRYYGAIKVSDNGGVDTIQVEV---PVENSKKYLDELYQKIM 299 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~---~~~~~~~~~~~~~~~~~~~~l~~~~---~~~~~~~~i~~l~~~i~ 299 (445) +.+++.-++.++.|+++.+=. +.++-... ...+....+..++.+.++++++... +...++.+++.+.+.|. T Consensus 209 ~~~w~~f~E~~G~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Is 288 (491) T protein:vir:79 209 LKFWVQFTEKYGSPMLVGKHPRSASDAETNLLLDRLEDMVQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVS 288 (491) T ss_pred HHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHH Confidence 888999999999999887632 22211111 2334556677889999999997542 34568888888888776 Q ss_pred HHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCH-HHH Q lcl|NC_021326. 300 LFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANT-ELQ 378 (445) Q Consensus 300 ~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~-~~~ 378 (445) ...-.--++.++.|+...|.. ...-....+..-.+.+...+.++++-++.+.+... ..+.+.|.+ +.+. ... T Consensus 289 k~iLGqtlTt~~~gs~a~~~v---h~~v~~~i~~~D~~~i~~tln~li~~l~~~N~~~~--~~p~f~~~e--~ee~~~~~ 361 (491) T protein:vir:79 289 IALLGQNQTTEATSTRASAQA---GLEVTDDIRDGDKAIVVEAMNMLIRWICDLNFDGA--ARPVFDMWE--QEQVDEIQ 361 (491) T ss_pred HHHhhhhhccCcccchhhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--CcceEeecC--cCchhHHH Confidence 643110112222233322322 22223344455567777888888888887765432 233444443 3333 446 Q ss_pred HHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 379 VQTAQQSM--GI-VSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 379 ~~~~~~~~--g~-~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++.+.+++ |+ ++.+.+.+.++. +.++.+-+ .. ....+.... .. ...+....+.++.+ T Consensus 362 a~~~~~L~~~G~~i~~~~~~e~~Gi-p~~~~~e~-~~------~~~~~~~~~-~~-~~~~~~~~~~~~~d 421 (491) T protein:vir:79 362 AGRDEKLTRAGARFTPAYFKRAYNL-QDGDLDER-PL------PVSAVDAVG-AA-SFAEFEAPDQDALD 421 (491) T ss_pred HHHHHHHHhCCCccCHHHHHHHhCC-CCCCCCcc-cc------CcCcccccc-cc-cccccCCCCCcchH Confidence 77777764 66 888888888854 32211100 00 000000000 00 00011111111111 No 161 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=98.02 E-value=6.8e-06 Score=48.93 Aligned_cols=385 Identities=8% Similarity=-0.006 Sum_probs=179.1 Q ss_pred ChHHHHHHHH------H-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHL------E-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~------~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +=.--.++.+ + .-.-++-+ .+|.|-.= +.++. ..... .++..+.++.+.+..+..+-+. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~F-~Gy~~-----la~la-------Q~~eyr~~~~~ia~e~~R~w~~ 142 (695) T protein:vir:36 77 VSNYTPRERRAASYALDFNGTSMDAL-SFVTSSGF-PGFPT-----LVLLA-------QLPEYRAMHEVLADECIRTWGE 142 (695) T ss_pred ccccCccccchhhhhhcccccccccc-hhhhccCc-chHHH-----HHHHh-------hccchhhHHHHHHHHhhcccce Confidence 0000000000 0 00111111 23333210 00000 00000 0122233333433333222222 Q ss_pred ec-------------------c-CchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc----------- Q lcl|NC_021326. 74 FK-------------------H-TDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGE----------- 121 (445) Q Consensus 74 ~~-------------------~-~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~----------- 121 (445) ++ . .+.+..+.|+.-++ =+....+.+..+.+-.||.+..++-++.++. T Consensus 143 ~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~ 222 (695) T protein:vir:36 143 AIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPY 222 (695) T ss_pred ecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccc Confidence 11 1 12234455554443 3677888999999999999987776654331 Q ss_pred ------EE-EEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEE-Eecceeeecccccccccccccc Q lcl|NC_021326. 122 ------FK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYV-YENGSLIPDYSNNLENSKTHFS 193 (445) Q Consensus 122 ------~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 193 (445) ++ +.+++|..+.|-.-+. .++. .-.+|.+..+...- .-..++. T Consensus 223 ~I~kGslKGl~ViDp~~vtP~~~n~--~dP~------------spdfgkP~~y~V~G~kIH~SRL--------------- 273 (695) T protein:vir:36 223 TVPKGSFQGLRVVEPYWVTPNNYNS--INPV------------ADDFYKPSTWWMIGTEVHATRL--------------- 273 (695) T ss_pred cccCcceeeeEeecccccccchhhh--ccch------------hhccCCCceEEEeceEEeeeeE--------------- Confidence 11 5666777666632110 0000 00111111111000 0000000 Q ss_pred ccccc--ccceEE-ecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeE------EecCCcccchh---HHHhh Q lcl|NC_021326. 194 TGSWG--KIPFIP-FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV------LTNYDDQELPE---FKRLL 261 (445) Q Consensus 194 ~~~~g--~iPvv~-~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~------~~g~~~~~~~~---~~~~~ 261 (445) +.+. .+|-+. -..+..|.|....+.+-+++.+++.-.....+..+....+. +.+....+... ..... T Consensus 274 -~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~dla~aL~~g~~~~l~~R~eli~~~ 352 (695) T protein:vir:36 274 -HTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMDLAQALMPGANVDLSMRAELINRY 352 (695) T ss_pred -EEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhHHHHHHHHHHhhcChhHHHHHHHHHHHHHh Confidence 0010 111110 01234688888888888888888876666665544433221 11111000000 00111 Q ss_pred -hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccc--cC-cchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 262 -RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF--GS-APSGVALEFLYTNLNLKADKLAR 337 (445) Q Consensus 262 -~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~--~~-~~Sg~Ai~~~~~~l~~k~~~~~~ 337 (445) ...+++.++ +.+=+|.+.+.+...+...+.+....|...+++|-.-+.+. +| |.||++-...+.+.+. ...+. T Consensus 353 Rsn~G~~llD-k~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~ 429 (695) T protein:vir:36 353 RDNRNILFLD-KATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR--AYQRN 429 (695) T ss_pred cCccceEEEe-cCCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHH--HHHHH Confidence 233445554 33346778888999999999999999999999996533322 23 6888887777777765 34478 Q ss_pred HHHHHHHHHHHHHHHH-hccCCCcceEEEEeCCCCCCCHHHHHHHHHHH---------hccCChHHHHHhCC------CC Q lcl|NC_021326. 338 KAKVAIQELLWFVFEH-FDIKGEHKDVDISFNYNKVANTELQVQTAQQS---------MGIVSHETVLENHP------FV 401 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~-~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~---------~g~~s~et~l~~l~------~~ 401 (445) .+...+++++.++..- +|.. + .++.++|+|--..++.+.|++-.+- .|+++...+..+|. +. T Consensus 430 ~L~p~L~rl~~ii~rS~~G~i-d-pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~ 507 (695) T protein:vir:36 430 ALQQLMNDVIVMIQLSLFGAV-D-PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYA 507 (695) T ss_pred HHHHHHHHHHHHHHHHhcCCC-C-CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccc Confidence 8999999999987654 3443 2 3688999999988999988864431 37777777666641 21 Q ss_pred --CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 402 --EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 402 --~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) -|...+=-...+++..- ..+.-+...+.+++++ T Consensus 508 ~~~D~~d~p~~~~~~~~~~-----------~~~~~~~~~~~~~~~~ 542 (695) T protein:vir:36 508 GKLDANDDPGVPADDDIDG-----------VLTYVQRLAEGGDTGA 542 (695) T ss_pred cccccccCCCcCccchhhh-----------hHhhhcCcccccccCC Confidence 01000000000000000 0000011111111111 No 162 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=98.02 E-value=7e-06 Score=48.84 Aligned_cols=422 Identities=11% Similarity=0.049 Sum_probs=185.5 Q ss_pred ChHHHHHHHHHH----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEK----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) -+++..+.++.+ ..+++.+.+|.. +.+-..... ...+...++..+-+...++++++.|++- | T Consensus 13 ~~~~r~~~l~~~R~~~e~~w~e~~~~~l-----P~~~~~~~~----~~~~~~~~~~dst~~~a~~~LAa~L~~~ltpp~~ 83 (532) T protein:vir:99 13 GAAAAYNRLKNDRGAYETRAEDCATYTI-----PSVFPSATA----DGSTSYTTPWQSIGARGLNNLASKLMLALFPVGS 83 (532) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhh-----hcccCCCCC----cchhhccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 334444444433 233344444432 111111111 1111223566677788888888777543 2 Q ss_pred --eeeccCchH-------------HHHHH--------HHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEE Q lcl|NC_021326. 72 --IAFKHTDDE-------------VIKRI--------DEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLF 125 (445) Q Consensus 72 --~~~~~~d~~-------------~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~ 125 (445) +++...+.. +...| ..+..+||...+.++.++..++|.+.+++..++. ...+++ T Consensus 84 ~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~f~ 163 (532) T protein:vir:99 84 SFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPK 163 (532) T ss_pred ccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccccCcccceE Confidence 233333221 22222 2223468999999999999999999998866432 345677 Q ss_pred EEccceeEEEEcCCCCCceEEEEEEEeee--------------------cceeEEEEecceEEEEEEecceeeecccc-c Q lcl|NC_021326. 126 RVPAEQGIPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKITVNYYVYENGSLIPDYSN-N 184 (445) Q Consensus 126 ~~~p~~~~~v~d~~~~~~~~~~v~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 184 (445) .++-.+. .+-.+. .+++...+|..+.. ....+++|+.... ............ . T Consensus 164 ~~pl~~y-~v~~d~-~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~----~~~~~~~~~~~~~~ 237 (532) T protein:vir:99 164 LYKLHNF-VVERDA-YDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYR----DPEAMVFRSYQEID 237 (532) T ss_pred EEEcCeE-EEeeCC-CCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEe----cCCCCeeEEEEeec Confidence 7776664 333333 56776666544321 1112333221110 011100000000 0 Q ss_pred ccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe-cCCcccchhHH Q lcl|NC_021326. 185 LENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT-NYDDQELPEFK 258 (445) Q Consensus 185 ~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~-g~~~~~~~~~~ 258 (445) ...........++..+|++.+. .+.+|+|-.....+-+..+|...-...........|.+.+. +... ... T Consensus 238 g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~----~~~ 313 (532) T protein:vir:99 238 GEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVT----QIR 313 (532) T ss_pred CceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecccccc----chh Confidence 1111111122335567776553 34679999999999999999887777777788888776543 1111 111 Q ss_pred Hhh-hhCceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH- Q lcl|NC_021326. 259 RLL-RYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK- 334 (445) Q Consensus 259 ~~~-~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~- 334 (445) ... ...+.+..+..++++++... .+.......++.++..|...-.... .....+...+|+.+......+.....- T Consensus 314 ~~~~~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-~~~~d~~r~TAtEV~~r~~E~~~~LGpv 392 (532) T protein:vir:99 314 RVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNS-AVQRGGDRVTAEEIRYVAGELEDTLGGV 392 (532) T ss_pred hhccCCCcceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-cccCCCCcccHHHHHHHHHHHHHHhhHH Confidence 111 12233433344455555433 4567777788888777765432221 111223446777776543333333221 Q ss_pred ----HHHHHHHHHHHHHHHHHHHhccC---CCcceEEE-EeCCCCCCCHHHHHHHH----HHHhcc-------CChHHHH Q lcl|NC_021326. 335 ----LARKAKVAIQELLWFVFEHFDIK---GEHKDVDI-SFNYNKVANTELQVQTA----QQSMGI-------VSHETVL 395 (445) Q Consensus 335 ----~~~~~~~~l~~~~~~~~~~~~~~---~~~~~i~v-~f~~~~p~d~~~~~~~~----~~~~g~-------~s~et~l 395 (445) ....+.+-+.+++.++.+-.-.+ .+...+.+ ++-. |-...+.++.+ +.++.+ +....++ T Consensus 393 ~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~is--~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~ 470 (532) T protein:vir:99 393 YSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLE--ALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVK 470 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccceeecch--HHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHH Confidence 12222333344444443321111 11212222 2211 22222222222 222222 2223333 Q ss_pred H----hCCC----CCCHHHHHHHHHHHHHHHHHhhhc-----cccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 396 E----NHPF----VEDLQAELERIEQEQMEYNKQLPN-----LDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 396 ~----~l~~----~~d~~~E~~ri~~E~~~~~~~~~~-----~~~~~~~~~~~~~~~~d~~~ 444 (445) . .++. +--.++|++.+.++++........ ...+.+.......+..-+++ T Consensus 471 ~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 471 MRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred HHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhHHhhcCCCCC Confidence 2 2221 112345555555443222211111 11111121122222222222 No 163 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=98.01 E-value=7.3e-06 Score=48.74 Aligned_cols=392 Identities=8% Similarity=0.023 Sum_probs=179.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeec----- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK----- 75 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~----- 75 (445) -..... .+ .-.-++-+ .+|.|-.= +.++. .+... .++..+.++.+.+..+..+-+.++ T Consensus 86 ~~~~~~-~~--~~~~~~~l-~~~~~~~F-~Gy~~-----la~la-------Q~~eyr~~~~~ia~e~~R~w~~~~~~~~e 148 (694) T protein:vir:10 86 AASYAL-DF--NGTSMDAL-SFVTSSGF-PGFPT-----LVLLA-------QLPEYRAMHEVLADECIRTWGEAIGGTKE 148 (694) T ss_pred hhhhhh-cc--Ccccccch-hhhhccCc-chHHH-----HHHHh-------hccchhhHHHHHHHHhhcccceeccccch Confidence 000000 00 00111112 23433210 00000 00000 012223333344433322222211 Q ss_pred --------------c-CchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc-----------------E Q lcl|NC_021326. 76 --------------H-TDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGE-----------------F 122 (445) Q Consensus 76 --------------~-~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~-----------------~ 122 (445) . .+.+..+.|..-++ =+....+.++.+.+-.||.+..++-++.++. + T Consensus 149 ~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGsl 228 (694) T protein:vir:10 149 KADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSF 228 (694) T ss_pred hhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCccccccccccccccccCcce Confidence 1 12234445544433 3677888999999999999987776644331 1 Q ss_pred E-EEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEE-Eecceeeeccccccccccccccccccc-- Q lcl|NC_021326. 123 K-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYV-YENGSLIPDYSNNLENSKTHFSTGSWG-- 198 (445) Q Consensus 123 ~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g-- 198 (445) + +.+++|..+.|-.-+. .++. .-.+|.+..+...- .-..++. +.+. T Consensus 229 KGl~ViDp~~vtP~~~n~--~dP~------------spdfgkP~~y~V~G~~IH~SRL----------------~~f~g~ 278 (694) T protein:vir:10 229 QGLRVVEPYWVTPNNYNS--INPV------------ADDFYKPSTWWMIGTEVHATRL----------------HTIVSR 278 (694) T ss_pred eeeEeecccccccchhhh--ccch------------hhccCCCceEEEeceEEeeeeE----------------EEecCC Confidence 1 5666777666632110 0000 00111111111000 0000000 0010 Q ss_pred ccceEE-ecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC-----CcccchhHH-----Hhh-hhCce Q lcl|NC_021326. 199 KIPFIP-FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY-----DDQELPEFK-----RLL-RYYGA 266 (445) Q Consensus 199 ~iPvv~-~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~-----~~~~~~~~~-----~~~-~~~~~ 266 (445) .+|-+. -..+..|.|....+.+-+++.+++.......+..+....+. +++ ......-.. ... ...++ T Consensus 279 plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk-~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~ 357 (694) T protein:vir:10 279 PVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQALMPGANVDLSMRAELINRYRDNRNI 357 (694) T ss_pred CchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHH-HHHHHhhcChhHHHHHHHHHHHHHhcCccce Confidence 111110 01234688988888888888888876666666544433221 110 011101011 111 23344 Q ss_pred eeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccc--cC-cchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 267 IKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF--GS-APSGVALEFLYTNLNLKADKLARKAKVAI 343 (445) Q Consensus 267 ~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~--~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l 343 (445) +.++ +.+=+|.+.+.+...+...+.+....|...+++|-.-+.+. +| |.||++-...+.+.+. ...+..+...+ T Consensus 358 ~llD-k~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~~L~p~L 434 (694) T protein:vir:10 358 LFLD-KATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR--AYQRNALQQLM 434 (694) T ss_pred EEEe-cCCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHH--HHHHHHHHHHH Confidence 5554 33346778888999999999999999999999996533322 23 6888887777777765 34478899999 Q ss_pred HHHHHHHHHH-hccCCCcceEEEEeCCCCCCCHHHHHHHHHHH---------hccCChHHHHHhC------CCC--CCHH Q lcl|NC_021326. 344 QELLWFVFEH-FDIKGEHKDVDISFNYNKVANTELQVQTAQQS---------MGIVSHETVLENH------PFV--EDLQ 405 (445) Q Consensus 344 ~~~~~~~~~~-~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~---------~g~~s~et~l~~l------~~~--~d~~ 405 (445) ++++.++..- +|.. + .++.++|+|--..++.+.|++-.+- .|+++...+..+| ++. -|.. T Consensus 435 ~rl~~ii~rS~~G~i-d-p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~ 512 (694) T protein:vir:10 435 NDVIVMIQLSLFGAV-D-PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDAN 512 (694) T ss_pred HHHHHHHHHHhcCCC-C-CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccc Confidence 9999987654 3443 2 3688999999888999988864431 3777777766664 121 0100 Q ss_pred HHHHHHHHHHH----HHHHhhhccccCCCCCCC-CCCCCCCcCCC Q lcl|NC_021326. 406 AELERIEQEQM----EYNKQLPNLDDGGADGAQ-QKERSNDKQSE 445 (445) Q Consensus 406 ~E~~ri~~E~~----~~~~~~~~~~~~~~~~~~-~~~~~~d~~~~ 445 (445) .+=-...+++. ...+......+.+..++. +....+-.... T Consensus 513 d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~ 557 (694) T protein:vir:10 513 DDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVAN 557 (694) T ss_pred cCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccc Confidence 00000000000 000000000100000000 00000000000 No 164 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=98.00 E-value=7.6e-06 Score=48.66 Aligned_cols=384 Identities=7% Similarity=-0.018 Sum_probs=179.7 Q ss_pred ChHHHHHHHHH-------HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYIKQHLE-------KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i~~~~~-------~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +=.--+++.+. .-.-++-+ .+|.|-.= +.++. ..... .++..+.++.+.+..+..+-+. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~F-~Gy~~-----la~la-------Q~~eyr~~~~~ia~e~~R~w~~ 142 (695) T protein:vir:78 77 VSNYTPRERRAASYALDFNGTSMDAL-SFVTSSGF-PGFPT-----LVLLA-------QLPEYRAMHEVLADECIRTWGE 142 (695) T ss_pred cccCCccccchhhhhhcccccccccc-hhhhccCc-chHHH-----HHHHh-------hccchhhHHHHHHHHhhcccce Confidence 00000000000 00111112 23333210 00000 00000 0122233333444333222222 Q ss_pred ec-------------------c-CchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCc----------- Q lcl|NC_021326. 74 FK-------------------H-TDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGE----------- 121 (445) Q Consensus 74 ~~-------------------~-~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~----------- 121 (445) ++ . .+.+..+.|..-++ =+....+.+..+.+-.||.+..++-++.++. T Consensus 143 ~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~ 222 (695) T protein:vir:78 143 AIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPY 222 (695) T ss_pred eccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccc Confidence 11 1 12234445544433 3677888999999999999987776654331 Q ss_pred ------EE-EEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEE-Eecceeeecccccccccccccc Q lcl|NC_021326. 122 ------FK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYV-YENGSLIPDYSNNLENSKTHFS 193 (445) Q Consensus 122 ------~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 193 (445) ++ +.+++|..+.|-.-+. .++. .-.+|.+..+...- .-..++. T Consensus 223 ~I~kGslKGl~ViDp~~vtP~~~n~--~dP~------------spdfgkP~~y~V~G~kIH~SRL--------------- 273 (695) T protein:vir:78 223 TVPKGSFQGLRVVEPYWVTPNNYNS--INPV------------ADDFYKPSTWWMIGTEVHATRL--------------- 273 (695) T ss_pred cccCcceeeeEeecccccccchhhh--ccch------------hhccCCCceEEEeceEEeeeeE--------------- Confidence 11 5666777666632110 0000 00111111111000 0000000 Q ss_pred ccccc--ccceEE-ecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC-----CcccchhHH-----Hh Q lcl|NC_021326. 194 TGSWG--KIPFIP-FKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY-----DDQELPEFK-----RL 260 (445) Q Consensus 194 ~~~~g--~iPvv~-~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~-----~~~~~~~~~-----~~ 260 (445) +.+. .+|-+. -..+..|.|....+.+-+++.+++.-.....+..+....+. +++ ......-.. .. T Consensus 274 -~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk-~dla~~L~~g~~~~l~~R~eli~~ 351 (695) T protein:vir:78 274 -HTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQALMPGANVDLSMRAELINR 351 (695) T ss_pred -EEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHH-HHHHHhhcChhHHHHHHHHHHHHH Confidence 0010 111110 01234688988888888888888876666666544433321 110 011101011 11 Q ss_pred h-hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccc--cC-cchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 261 L-RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF--GS-APSGVALEFLYTNLNLKADKLA 336 (445) Q Consensus 261 ~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~--~~-~~Sg~Ai~~~~~~l~~k~~~~~ 336 (445) . ...+++.++ +.+=+|.+.+.+...+...+.+....|...+++|-.-+.+. +| |.||++-...+.+.+. ...+ T Consensus 352 ~Rsn~G~~llD-k~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe 428 (695) T protein:vir:78 352 YRDNRNILFLD-KATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR--AYQR 428 (695) T ss_pred hcCccceEEEe-cCCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHH--HHHH Confidence 1 233445554 33346778888999999999999999999999996533322 23 6888887777777765 3447 Q ss_pred HHHHHHHHHHHHHHHHH-hccCCCcceEEEEeCCCCCCCHHHHHHHHHHH---------hccCChHHHHHhCC------C Q lcl|NC_021326. 337 RKAKVAIQELLWFVFEH-FDIKGEHKDVDISFNYNKVANTELQVQTAQQS---------MGIVSHETVLENHP------F 400 (445) Q Consensus 337 ~~~~~~l~~~~~~~~~~-~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~---------~g~~s~et~l~~l~------~ 400 (445) ..+...+++++.++..- +|.. + .++.++|+|--..++.+.|++-.+- .|+++...+..+|. + T Consensus 429 ~~L~p~L~rl~~ii~rS~~G~i-d-pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y 506 (695) T protein:vir:78 429 NALQQLMNDVIVMIQLSLFGAV-D-PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPY 506 (695) T ss_pred HHHHHHHHHHHHHHHHHhcCCC-C-CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccc Confidence 88999999999987654 3443 2 3688999999988999988864431 37777777666641 2 Q ss_pred C--CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 V--EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) . -|...+=-...+++..-. .+.-+...+.+++++ T Consensus 507 ~~~~D~~d~p~~~~~~~~~~~-----------~~~~~~~~~~~~~~~ 542 (695) T protein:vir:78 507 AGKLDANDDPGVPADDDIDGV-----------LTYVQRLAEGGDTGA 542 (695) T ss_pred ccccccccCCCcCccchhhhh-----------HhhhcCcccccccCC Confidence 1 010000000000000000 000011111111111 No 165 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.00 E-value=7.7e-06 Score=48.63 Aligned_cols=382 Identities=11% Similarity=0.074 Sum_probs=148.8 Q ss_pred ChHHHH------HHHHHH-----HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhc Q lcl|NC_021326. 1 MIVRYI------KQHLEK-----LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVG 69 (445) Q Consensus 1 ~l~~~i------~~~~~~-----~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g 69 (445) .-.+++ ..+..+ -..+..+.+-|. ..++...-... +. ..+..+++.|.+..++ T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~iv~~~i~~---------~~--~~V~~~~~~i~~~ia~---- 121 (574) T protein:vir:80 58 YMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFG-NNIILNAIINT---------RS--NQVSMYCKPARNSETG---- 121 (574) T ss_pred ccchhhhhccccccccCcCccCCcccHHHHHHhhc-cChhHHHHHHH---------HH--HHHHHHHHHHHhhhcc---- Confidence 111111 111100 001111112221 12222111000 00 0011223333333222 Q ss_pred cCeeec---cC------chHHHHHHHHHhcc----------CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEEEcc Q lcl|NC_021326. 70 KPIAFK---HT------DDEVIKRIDEVLGN----------RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPA 129 (445) Q Consensus 70 ~~~~~~---~~------d~~~~~~l~~~~~n----------~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p 129 (445) -|+.+- .+ .......+..++.+ .+..+...+..+.+.+|.+|+.+..+.+|+|. +..++| T Consensus 122 lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p 201 (574) T protein:vir:80 122 VGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDP 201 (574) T ss_pred CceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcC Confidence 222221 00 00111122233211 23446667788889999999998888889864 788899 Q ss_pred ceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec--- Q lcl|NC_021326. 130 EQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK--- 206 (445) Q Consensus 130 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~--- 206 (445) ..+.+..+.... ......+|+..........+.... |++++ T Consensus 202 ~~V~v~~d~~~~-~~~~~~~y~~~~~g~~~~~~~~~e-----------------------------------iih~~~~~ 245 (574) T protein:vir:80 202 TTIFLATNGEGK-LIKNGERFVQVIDNRIVAKFNERE-----------------------------------LAFAVRNP 245 (574) T ss_pred ceeEEEEcCccc-cccCceEEEEEeCCceEEEEcccc-----------------------------------EEEEeccC Confidence 998877654210 000111122111111100000000 22221 Q ss_pred -----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE--ecC---CcccchhHHHhhh-------h-Cce-e Q lcl|NC_021326. 207 -----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL--TNY---DDQELPEFKRLLR-------Y-YGA-I 267 (445) Q Consensus 207 -----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~--~g~---~~~~~~~~~~~~~-------~-~~~-~ 267 (445) ....|.|.+..+...++....+..-..+.+...+.|-.++ .+. +.+....++..+. . +++ + T Consensus 246 ~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~v 325 (574) T protein:vir:80 246 RADIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPV 325 (574) T ss_pred CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhcccccccccee Confidence 1224788887777777766666555556666667776444 343 2222222222221 1 122 2 Q ss_pred eccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCc-c--------hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 268 KVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSA-P--------SGVALEFLYTNLNLKADKLARK 338 (445) Q Consensus 268 ~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~-~--------Sg~Ai~~~~~~l~~k~~~~~~~ 338 (445) .++++.+..-++.......+.+..+.....|...-++|....+-.... . +...++.... .. T Consensus 326 l~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~----------~f 395 (574) T protein:vir:80 326 VSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQ----------AS 395 (574) T ss_pred ecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHH----------HH Confidence 334444444444333444556667777778888888886443321110 0 0011111111 11 Q ss_pred HHHHHHHHHHHHHHHhcc---CCCcceEEEEeCCCCCCCHHHHHHHHHH-HhccCChHHHHHhCCC--CCCHH------- Q lcl|NC_021326. 339 AKVAIQELLWFVFEHFDI---KGEHKDVDISFNYNKVANTELQVQTAQQ-SMGIVSHETVLENHPF--VEDLQ------- 405 (445) Q Consensus 339 ~~~~l~~~~~~~~~~~~~---~~~~~~i~v~f~~~~p~d~~~~~~~~~~-~~g~~s~et~l~~l~~--~~d~~------- 405 (445) +...|.-++..+...+.. ......+.+.|.+....+..+...+... ..|+++.-.++++++. ++.-+ T Consensus 396 ~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~~~d~~~~~~~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n 475 (574) T protein:vir:80 396 QNKGLQPLLRFIEDTVNTYIVAEFGEKYQFQFRGGDLSAQLDKLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVH 475 (574) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhcCCceEEEecccchhhHHHHHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccc Confidence 111121111111111110 1112346778877666666655544332 3589999998887643 21100 Q ss_pred -HHHHHH------HHHHHHHHHhhhccccCCCCCCCC-CCCCCCcCCC Q lcl|NC_021326. 406 -AELERI------EQEQMEYNKQLPNLDDGGADGAQQ-KERSNDKQSE 445 (445) Q Consensus 406 -~E~~ri------~~E~~~~~~~~~~~~~~~~~~~~~-~~~~~d~~~~ 445 (445) ..+... ..+.++. ...+.....+.+++.+ +.+..+.+.+ T Consensus 476 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~~~~d 522 (574) T protein:vir:80 476 IQAIGQALQEEQLEYQRSQD-RLNRLLELSGGDVEQPEPEEPKDSQND 522 (574) T ss_pred eeecccccccccCCccchhc-cccccccccCCCCCCCCCCCCCCcccc Confidence 000000 0001110 0111111111111111 1111111111 No 166 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=97.89 E-value=1.2e-05 Score=47.47 Aligned_cols=397 Identities=10% Similarity=-0.001 Sum_probs=159.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccc------cccchHHHHHHHHHhhhhccCeee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDR------MITNFHANLVDQKVSYIVGKPIAF 74 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~r------i~~n~~~~iv~~~~~~l~g~~~~~ 74 (445) ++..+..+.......-.. ...|.. .............+.-. +.+.=...+|+..++-+.+-|+.+ T Consensus 3 ~~~~l~~~~~~~~~~~~~-~~~~~~--------~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~ 73 (457) T protein:vir:62 3 FWSALFGRGHSPALDAAE-GRAWEP--------YDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLST 73 (457) T ss_pred hhhhhhcccccccccccc-cccccc--------chhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEE Confidence 333332211110000000 000000 00000000000000000 111111224555555555556654 Q ss_pred ccCc----hHH-HHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCc Q lcl|NC_021326. 75 KHTD----DEV-IKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEE 143 (445) Q Consensus 75 ~~~d----~~~-~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~ 143 (445) --.+ ... ...+..++. | ....+...+..+.+.+|.+|+.+..+ .|++ .+..++|..+.+..+.... . T Consensus 74 ~~~~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~-~ 151 (457) T protein:vir:62 74 YSKRGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDG-L 151 (457) T ss_pred EEecCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCC-c Confidence 2111 111 112222221 2 24556677788899999999988554 4655 4677888887664432211 1 Q ss_pred eEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHH Q lcl|NC_021326. 144 LEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMY 218 (445) Q Consensus 144 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v 218 (445) .....+.|......... ....+ +-.. |+++++ ...|.|.+... T Consensus 152 ~~~~~~~y~~~~~g~~~-----~~~~~-------------------------~~~e--iih~r~~~~~~~~~G~sp~~~~ 199 (457) T protein:vir:62 152 RRKVFEAYDIDADGNEV-----LLGWF-------------------------TPRD--VLHIPGMMLPGDFVGCSPISYA 199 (457) T ss_pred cceeEEEEEEccCCcee-----EEEee-------------------------Cccc--eEEecCCCCCCceecccHHHHH Confidence 11111111111111000 00000 0000 233321 13577877776 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhh----h----hCceeeccCCCceeeEeccCChHHH Q lcl|NC_021326. 219 KTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLL----R----YYGAIKVSDNGGVDTIQVEVPVENS 287 (445) Q Consensus 219 ~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~----~----~~~~~~~~~~~~~~~l~~~~~~~~~ 287 (445) ...+.....+.......+...+.|-.++.-.. .+.....+..+ . ..+++.++.+.+.+.+..+.....+ T Consensus 200 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~ 279 (457) T protein:vir:62 200 RESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQF 279 (457) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHH Confidence 66666666555555566677777766554322 11112222211 1 1335666666665555544444456 Q ss_pred HHHHHHHHHHHHHHhCccccccccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CCcceEEE Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-GEHKDVDI 365 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~i~v 365 (445) .+..+..+..|+..-++|....+...+ +.++..++.........+ ..-+-..|.+.+.. .++... .....+++ T Consensus 280 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~---l~P~~~~ie~~ln~--~L~~~~~~~~~~i~f 354 (457) T protein:vir:62 280 LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFS---LRPWLERIEAGFNR--LLFAETADRFRFVKF 354 (457) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHH---HHHHHHHHHHHHHh--hhcCccccCceEEEe Confidence 666777778888888888754433222 222333332222221111 01111112111111 111111 12233455 Q ss_pred EeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCH--HHH-----HHHHHHHHHHH-HHhhhc---cccCCC Q lcl|NC_021326. 366 SFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDL--QAE-----LERIEQEQMEY-NKQLPN---LDDGGA 430 (445) Q Consensus 366 ~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~--~~E-----~~ri~~E~~~~-~~~~~~---~~~~~~ 430 (445) .+...+..|..+.++.+.++ .|+++.-.++++++. +++. ++- +..+....+.. .+..++ ...... T Consensus 355 d~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (457) T protein:vir:62 355 NLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPA 434 (457) T ss_pred echhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCC Confidence 55666677899999988876 589999999988754 2222 111 11111110000 000000 000111 Q ss_pred CCCCCCCCCCCcCCC Q lcl|NC_021326. 431 DGAQQKERSNDKQSE 445 (445) Q Consensus 431 ~~~~~~~~~~d~~~~ 445 (445) ++.++++.++++.++ T Consensus 435 ~~~~~~~~~~~~d~~ 449 (457) T protein:vir:62 435 DDEEPDNAEGDPDEG 449 (457) T ss_pred CCCCCCCCCCCCccc Confidence 111111111111111 No 167 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=97.87 E-value=1.4e-05 Score=47.25 Aligned_cols=314 Identities=14% Similarity=0.060 Sum_probs=139.5 Q ss_pred hhccCeeeccC----chHHHHHHHHHhcc---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcC Q lcl|NC_021326. 67 IVGKPIAFKHT----DDEVIKRIDEVLGN---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTD 138 (445) Q Consensus 67 l~g~~~~~~~~----d~~~~~~l~~~~~n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~ 138 (445) +..-|+.+.-. +......|.. .-| .-......+....+.+|.+|+++..+..|++ .+..++|..+.++.++ T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~ 79 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTV-SPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIEN 79 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHh-CCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeC Confidence 22223333111 1122222210 012 2345566677888999999999999988987 4777888888776554 Q ss_pred CCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCc Q lcl|NC_021326. 139 KEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEIS 213 (445) Q Consensus 139 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s 213 (445) . .+.+.+ +++. .....+ .+ +.. -|+++++ ...|.| T Consensus 80 ~-~~~~~y--~~~~-~~g~~~---------~~-------------------------~~~--eiih~r~~~~~~~~~G~s 119 (348) T protein:vir:93 80 Q-SRELYY--SIHA-ATGNKL---------IV-------------------------HNM--DMLHFKHIVASNMVQGIS 119 (348) T ss_pred C-CcEEEE--EEEc-CCCeEE---------EE-------------------------ccc--cEEEecCCCCCCceeecc Confidence 2 121111 1100 000000 00 000 1334332 224777 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCC-eeEE-ecCCc--ccchhHHHhh----h-hCceeeccCCCceeeEeccCCh Q lcl|NC_021326. 214 DIFMYKTLIDAYNRRLSDLSNTFKDSNEL-TYVL-TNYDD--QELPEFKRLL----R-YYGAIKVSDNGGVDTIQVEVPV 284 (445) Q Consensus 214 ~~~~v~~lid~~~~~~s~~~~~~~~~~~~-~l~~-~g~~~--~~~~~~~~~~----~-~~~~~~~~~~~~~~~l~~~~~~ 284 (445) .++.+...++..+.+. .. .+..+..+ -.++ .+... +........+ . ..+++.++++.+++.+..+... T Consensus 120 ~~~~~~~~i~~~~~~~-~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d 196 (348) T protein:vir:93 120 PIDVLKNTTDFDNAVR-TF--NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVS 196 (348) T ss_pred HHHHHHHHHHHHHHHH-HH--HHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhH Confidence 7766655555443332 11 23333333 2222 22221 1111111111 1 2335555655555555444344 Q ss_pred HHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----cCC- Q lcl|NC_021326. 285 ENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----IKG- 358 (445) Q Consensus 285 ~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~~~- 358 (445) ..+.+..+.....|+..-++|....+.. +..+...++.....+ +...|.-+++.+...+. ... T Consensus 197 ~q~~e~~~~~~~~Ia~~fgVP~~~lg~~-~~~~~~~~e~~~~~~----------~~~~l~P~~~~ie~~l~~~l~~~~~~ 265 (348) T protein:vir:93 197 EDIVASENLTRERVANVFQLPSIFLNAR-SNTNFAKNEELNRFY----------LQHTLLPIVKQYEEEFNRKLLTKTDR 265 (348) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCC-CCCCcccHHHHHHHH----------HHHHHHHHHHHHHHHHHHhhCCcccc Confidence 4566667777788888888886544322 122222222211111 12222222222222111 111 Q ss_pred -CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHhhhccc-cCCCCC Q lcl|NC_021326. 359 -EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQMEYNKQLPNLD-DGGADG 432 (445) Q Consensus 359 -~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~ 432 (445) ....+++.+...+..|..+.++++.++ +|+++.-.++++++.- ++-+.=+ + ........ ...... T Consensus 266 ~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~--~-------~~n~~~~~~~~~~~~ 336 (348) T protein:vir:93 266 EKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL--I-------SGDLYPIDTPLELRK 336 (348) T ss_pred cCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcCeEe--e-------cccccccccchhhcc Confidence 123355555666677889999998887 4899999999888552 1100000 0 00000000 000111 Q ss_pred CCCCCCCCCcCC Q lcl|NC_021326. 433 AQQKERSNDKQS 444 (445) Q Consensus 433 ~~~~~~~~d~~~ 444 (445) ....++++++++ T Consensus 337 ~~~gg~~n~~~~ 348 (348) T protein:vir:93 337 SLKGGDKNVNES 348 (348) T ss_pred cccCCCCCcCCC Confidence 122333333333 No 168 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=97.86 E-value=1.4e-05 Score=47.18 Aligned_cols=378 Identities=12% Similarity=-0.001 Sum_probs=161.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc--Cc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH--TD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~--~d 78 (445) |+.++.++..... ... -+.|-.. +.... .......-.+.-+.++-...+|+..++-+.+-|+.+-- ++ T Consensus 3 l~~~~f~~~~~~~-~~~----~~~~~~~----~~~~~-~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~ 72 (409) T protein:vir:84 3 LFTRIFSGPSEER-TLT----KISGIPS----PAEDW-AMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDN 72 (409) T ss_pred hhhhhhcCCCccc-ccc----ccccccc----ccchh-hccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCC Confidence 4444433321110 000 0000000 00000 00000000001112233455677777666666765421 11 Q ss_pred hH--HHHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEE-ECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEE Q lcl|NC_021326. 79 DE--VIKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPY-LDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIR 149 (445) Q Consensus 79 ~~--~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~-~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~ 149 (445) .. .....+-+.. | .-......+..+.+.+|.+|+++. .+..|++ .+..++|..+.+....+.... .... T Consensus 73 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~--~~~~ 150 (409) T protein:vir:84 73 VRIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGD--WIEP 150 (409) T ss_pred cccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcce--EEEE Confidence 11 1111111211 2 345667778889999999998764 5667775 478889988876543322111 1100 Q ss_pred EEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHH Q lcl|NC_021326. 150 MYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDA 224 (445) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~ 224 (445) .+... +.. +.. . -|+++++ ...|.|.+..+...++. T Consensus 151 ~~~~~---g~~-~~~---------------------------------~--dvih~~~~~~~~~~~G~s~i~~~~~~i~~ 191 (409) T protein:vir:84 151 VYRID---GKV-VPN---------------------------------H--RIMHIKRYPVAGCALGMSPIEKAASAIGL 191 (409) T ss_pred EecCC---ceE-Ech---------------------------------h--hEEEecCCCCCcccccccHHHHHHHHHHH Confidence 11000 000 000 0 0233321 13578888777777776 Q ss_pred HHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhh-----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHH Q lcl|NC_021326. 225 YNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQ 296 (445) Q Consensus 225 ~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~ 296 (445) ...+.....+.+...+.|-.+++... .+........+ ...+++.++++.+.+.+........+.+..+...+ T Consensus 192 ~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~ 271 (409) T protein:vir:84 192 GLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRS 271 (409) T ss_pred HHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHH Confidence 66555555566667777766665322 21112222111 12334555555554444333333455566667778 Q ss_pred HHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCC Q lcl|NC_021326. 297 KIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKA-DKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVAN 374 (445) Q Consensus 297 ~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d 374 (445) .|+..-++|..-.+... ++.++..++.....+...+ .-....+...+.+. + .....+++.+...+..| T Consensus 272 ~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~-------L---~~g~~i~fd~~~l~~~d 341 (409) T protein:vir:84 272 EIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTF-------L---PRGQFVKFNVDGLMRGD 341 (409) T ss_pred HHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHh-------c---cCCCeEEEechhhhccC Confidence 88888888865443222 2222333332222111111 11111111111111 1 12244666667777789 Q ss_pred HHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCC-CCCCcCCC Q lcl|NC_021326. 375 TELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKE-RSNDKQSE 445 (445) Q Consensus 375 ~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~-~~~d~~~~ 445 (445) ..+.++++.++ .|+++.-.++++++.-.- +..+..-.- ........... .++.++++ ....+.++ T Consensus 342 ~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~--~ggD~~~~~--~n~~~~~~~~~--~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 342 VTARFTAYQMGLQNGIWSVNEVRAWEDAPPI--PEGDIHLQP--MNFVPLGYVPP--EEPAQEPQPNSATEGNK 409 (409) T ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceeeec--ccccccccCCc--cccCcCCCCCCccCCCC Confidence 99999988876 489999999988754221 111100000 00000000000 00111111 01111111 No 169 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=97.85 E-value=1.5e-05 Score=47.01 Aligned_cols=390 Identities=10% Similarity=0.048 Sum_probs=159.2 Q ss_pred ChHHHHHHHHHHHH----HHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc Q lcl|NC_021326. 1 MIVRYIKQHLEKLP----EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH 76 (445) Q Consensus 1 ~l~~~i~~~~~~~~----~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~ 76 (445) |..-+-...+...+ +-.-+...+..-.....- ........-+..-+.+.=...+|+..++-+.+-|+.+-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-----~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~ 75 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAG-----AWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQ 75 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcc-----hhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 11100000000000 000000111000000000 000000000000011111223566666555555666421 Q ss_pred -C-c---hHH-HHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 77 -T-D---DEV-IKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 77 -~-d---~~~-~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~ 144 (445) + + ... ...+..++. | .-......+..+.+.+|.+|+.+-.+..|++ .+..++|..+.++.++. +.+ T Consensus 76 ~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~ 153 (454) T protein:vir:93 76 TDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADD--GEV 153 (454) T ss_pred eccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCC--CcE Confidence 1 1 111 112222332 3 2345667778899999999999999888886 58889999988877642 332 Q ss_pred EEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec-----CCCCcCccHHHHH Q lcl|NC_021326. 145 EAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYK 219 (445) Q Consensus 145 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~ 219 (445) .+.+ .. ...... ....... .. -|+|++ +...|.|.+.... T Consensus 154 ~y~~--~~-~~~~~~-------~~~~~~~-----------------------~~--eViH~k~~~~~~~~~G~sp~~~~~ 198 (454) T protein:vir:93 154 FYRI--TP-DRNCGI-------TEAVTVP-----------------------AR--EVIHDRFNCFFHPLIGLPPVYAAG 198 (454) T ss_pred EEEE--Ee-cccccc-------ceeEEec-----------------------Cc--ceEEeccCCCCCCceeccHHHHHH Confidence 2211 10 000000 0000000 00 023322 2235778777766 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCeeEEe--cC-CcccchhHHHhhh-------hCceeeccCCCceeeEeccCChHHHHH Q lcl|NC_021326. 220 TLIDAYNRRLSDLSNTFKDSNELTYVLT--NY-DDQELPEFKRLLR-------YYGAIKVSDNGGVDTIQVEVPVENSKK 289 (445) Q Consensus 220 ~lid~~~~~~s~~~~~~~~~~~~~l~~~--g~-~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~~~~~~~ 289 (445) ..+.....+.......+...+.|-.++. +. +.+.....+..+. .++++.++.+.+.+.+........+.+ T Consensus 199 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le 278 (454) T protein:vir:93 199 LAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVE 278 (454) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHH Confidence 6666555554444555566666655544 22 1222222222111 233555666666555554444455566 Q ss_pred HHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeC Q lcl|NC_021326. 290 YLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNL-KADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFN 368 (445) Q Consensus 290 ~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~-k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~ 368 (445) ..+.....|+..-++|....+...+. +...++.....+.. .+.=....+...+. ..+- ......+++.+. T Consensus 279 ~~~~~~~~Ia~~fgVPp~~lg~~~~~-t~sn~e~~~~~f~~~~l~P~~~~ie~~ln-------~~L~-~~~~~~~~f~~~ 349 (454) T protein:vir:93 279 QLKMTAEIVCSVFRVPAYKIGVGQPP-SSDNVEALEQQYYSQCLQTLIESIELLLD-------EALE-TGENESTEFDVT 349 (454) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCC-cchhHHHHHHHHHHHHHHHHHHHHHHHHH-------Hhhc-CCCCcEEEeech Confidence 66777778888888886544322211 11112111111111 11111111111111 1111 122234566666 Q ss_pred CCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHH--------HHHHHHHHHHHHHHhhhccccCCCCCCC-C Q lcl|NC_021326. 369 YNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQA--------ELERIEQEQMEYNKQLPNLDDGGADGAQ-Q 435 (445) Q Consensus 369 ~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~--------E~~ri~~E~~~~~~~~~~~~~~~~~~~~-~ 435 (445) ..+..|..+.++.+.++ .|+++.-.++++++.- ++-++ -+..+.+.... ..+. ...+..... + T Consensus 350 ~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~---~~~~-~~~~~~~~~~~ 425 (454) T protein:vir:93 350 TLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDAR---EDPF-ASSGKTASVPQ 425 (454) T ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcc---cCCC-CCCccCCCCCC Confidence 77778999999988876 5899998888877542 11000 11111111100 0000 000000000 0 Q ss_pred CCCCCC---cCCC Q lcl|NC_021326. 436 KERSND---KQSE 445 (445) Q Consensus 436 ~~~~~d---~~~~ 445 (445) +..+.| +.+| T Consensus 426 ~~~~~d~~~~~~e 438 (454) T protein:vir:93 426 AVAASDGNKAITE 438 (454) T ss_pred CCCCCCCCCCccC Confidence 000011 1111 No 170 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.84 E-value=1.6e-05 Score=46.92 Aligned_cols=373 Identities=10% Similarity=0.021 Sum_probs=159.1 Q ss_pred HHHhcCCC----------c--------ccc----ccccccccc-----------cccc---cccccccccchHHHHHHHH Q lcl|NC_021326. 20 QEYYEQRP----------D--------IVK----EPKPVDATG-----------AVDP---LKPDDRMITNFHANLVDQK 63 (445) Q Consensus 20 ~~yy~G~~----------~--------i~~----~~~~~~~~~-----------~~~~---~~~~~ri~~n~~~~iv~~~ 63 (445) .++|.-+- + ++. +........ .... .....-+.++=.-.+|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~I 80 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMI 80 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHH Confidence 11221100 0 000 000000000 0000 0000000011111245555 Q ss_pred HhhhhccCeeeccCch-HHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEE Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDD-EVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPI 135 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~-~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v 135 (445) ++-+..-|+.+.-+.. .....+-..+ . | .-..+...+..+.+.+|.+|+.+..+..|++ .+..++|..+.++ T Consensus 81 a~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 160 (441) T protein:vir:79 81 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 160 (441) T ss_pred HHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 5555555666532211 1111111222 1 3 2345566778889999999999999988986 4888999999887 Q ss_pred EcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC----CCCc Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----NDLE 211 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~g 211 (445) .++. +.+.+.+.............+.. . -|++++. ...| T Consensus 161 ~d~~--g~~~~~~~~~~~~~~~~~~~~~~---------------------------------~--dvih~k~~~~dg~~G 203 (441) T protein:vir:79 161 SDAR--GRLYYFHQRIDSNGNNIERNVKF---------------------------------E--DMLDIKFYSLDGING 203 (441) T ss_pred ECCC--ccEEEEEEEeccCCceeEEEEcc---------------------------------c--cEEEeccCCCCCccc Confidence 7642 33332211111000000000000 0 0222221 2357 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe--cCCc-ccc-hhHHHhh----h----hCceeeccCCCceeeEe Q lcl|NC_021326. 212 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--NYDD-QEL-PEFKRLL----R----YYGAIKVSDNGGVDTIQ 279 (445) Q Consensus 212 ~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~--g~~~-~~~-~~~~~~~----~----~~~~~~~~~~~~~~~l~ 279 (445) .|.+..+...++.......-..+.+...+.|-.++. |... +.. +..+..+ . .++++.++++.+.+.++ T Consensus 204 ~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~ 283 (441) T protein:vir:79 204 LSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLE 283 (441) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEcc Confidence 787777666666555454445555667777766654 4321 111 1111111 1 13356666666655555 Q ss_pred ccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---- Q lcl|NC_021326. 280 VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD---- 355 (445) Q Consensus 280 ~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~---- 355 (445) .+.....+.+..+.....|+..-++|....+...++.|....... |...|.-++..+...+. T Consensus 284 ~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~--------------~~~tl~P~~~~ie~eln~kl~ 349 (441) T protein:vir:79 284 VDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLD--------------YLSTLKPYITCVCAELNFKFN 349 (441) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHH--------------HHHHHHHHHHHHHHHHhhhcc Confidence 444445566667777778888888886544322222221111111 11112222222211111 Q ss_pred cCCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCCCC Q lcl|NC_021326. 356 IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGGAD 431 (445) Q Consensus 356 ~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 431 (445) .......+++.+...+-.|..+.++.+.++ .|+++...++++++. +++.+..+-.+...-. .....+.......+ T Consensus 350 ~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~-~~~~~~~~~~~~~~ 428 (441) T protein:vir:79 350 DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHV-NIELVDEYQMNKSR 428 (441) T ss_pred ccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccccc-cccccccccccccc Confidence 111123344444555667888889888876 589999999888754 2222221111100000 00000000101111 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) . .+....+++++| T Consensus 429 ~-~~~~~kgGe~~e 441 (441) T protein:vir:79 429 A-TDKKLKGGEENE 441 (441) T ss_pred c-cccccCCCCCCC Confidence 1 111112222222 No 171 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.84 E-value=1.6e-05 Score=46.92 Aligned_cols=373 Identities=10% Similarity=0.021 Sum_probs=159.1 Q ss_pred HHHhcCCC----------c--------ccc----ccccccccc-----------cccc---cccccccccchHHHHHHHH Q lcl|NC_021326. 20 QEYYEQRP----------D--------IVK----EPKPVDATG-----------AVDP---LKPDDRMITNFHANLVDQK 63 (445) Q Consensus 20 ~~yy~G~~----------~--------i~~----~~~~~~~~~-----------~~~~---~~~~~ri~~n~~~~iv~~~ 63 (445) .++|.-+- + ++. +........ .... .....-+.++=.-.+|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~I 80 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMI 80 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHHH Confidence 11221100 0 000 000000000 0000 0000000011111245555 Q ss_pred HhhhhccCeeeccCch-HHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEE Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDD-EVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPI 135 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~-~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v 135 (445) ++-+..-|+.+.-+.. .....+-..+ . | .-..+...+..+.+.+|.+|+.+..+..|++ .+..++|..+.++ T Consensus 81 a~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 160 (441) T protein:vir:94 81 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 160 (441) T ss_pred HHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEE Confidence 5555555666532211 1111111222 1 3 2345566778889999999999999988986 4888999999887 Q ss_pred EcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC----CCCc Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----NDLE 211 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~g 211 (445) .++. +.+.+.+.............+.. . -|++++. ...| T Consensus 161 ~d~~--g~~~~~~~~~~~~~~~~~~~~~~---------------------------------~--dvih~k~~~~dg~~G 203 (441) T protein:vir:94 161 SDAR--GRLYYFHQRIDSNGNNIERNVKF---------------------------------E--DMLDIKFYSLDGING 203 (441) T ss_pred ECCC--ccEEEEEEEeccCCceeEEEEcc---------------------------------c--cEEEeccCCCCCccc Confidence 7642 33332211111000000000000 0 0222221 2357 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe--cCCc-ccc-hhHHHhh----h----hCceeeccCCCceeeEe Q lcl|NC_021326. 212 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--NYDD-QEL-PEFKRLL----R----YYGAIKVSDNGGVDTIQ 279 (445) Q Consensus 212 ~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~--g~~~-~~~-~~~~~~~----~----~~~~~~~~~~~~~~~l~ 279 (445) .|.+..+...++.......-..+.+...+.|-.++. |... +.. +..+..+ . .++++.++++.+.+.++ T Consensus 204 ~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~ 283 (441) T protein:vir:94 204 LSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLE 283 (441) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEcc Confidence 787777666666555454445555667777766654 4321 111 1111111 1 13356666666655555 Q ss_pred ccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---- Q lcl|NC_021326. 280 VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD---- 355 (445) Q Consensus 280 ~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~---- 355 (445) .+.....+.+..+.....|+..-++|....+...++.|....... |...|.-++..+...+. T Consensus 284 ~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~--------------~~~tl~P~~~~ie~eln~kl~ 349 (441) T protein:vir:94 284 VDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLD--------------YLSTLKPYITCVCAELNFKFN 349 (441) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHH--------------HHHHHHHHHHHHHHHHhhhcc Confidence 444445566667777778888888886544322222221111111 11112222222211111 Q ss_pred cCCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCCCC Q lcl|NC_021326. 356 IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGGAD 431 (445) Q Consensus 356 ~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 431 (445) .......+++.+...+-.|..+.++.+.++ .|+++...++++++. +++.+..+-.+...-. .....+.......+ T Consensus 350 ~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~-~~~~~~~~~~~~~~ 428 (441) T protein:vir:94 350 DEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHV-NIELVDEYQMNKSR 428 (441) T ss_pred ccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccccc-cccccccccccccc Confidence 111123344444555667888889888876 589999999888754 2222221111100000 00000000101111 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) . .+....+++++| T Consensus 429 ~-~~~~~kgGe~~e 441 (441) T protein:vir:94 429 A-TDKKLKGGEENE 441 (441) T ss_pred c-cccccCCCCCCC Confidence 1 111112222222 No 172 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=97.82 E-value=1.7e-05 Score=46.74 Aligned_cols=362 Identities=10% Similarity=-0.001 Sum_probs=155.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) +..++-...............+..+.. .. ....... .....-+..+-.-.+|+..++-+.+-|+.+.-.. T Consensus 3 ~f~~~~~~~~~~~~~~~~~~~~~~~~~----~~--~~~~~~~--v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~-- 72 (386) T protein:vir:49 3 IFNITNLATESPPINQESFFDIADSDF----LA--SLNSSEW--VSAENALKNSDLFSIISQLSNDLATAKITTSRKQ-- 72 (386) T ss_pred hhhhhccCCCCcccchhhhhhhhhccc----cc--cccCCce--echhhhhccHHHHHHHHHHHHHhhhCceeeccch-- Confidence 222221111000000000000000000 00 0000000 0000011122223455666666666666553222 Q ss_pred HHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecc-e Q lcl|NC_021326. 81 VIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE-T 157 (445) Q Consensus 81 ~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~-~ 157 (445) ....+..=.. .........+..+.+.+|.+|+.+..+..|++ .+..++|..+.++.++. .+.+.+.+.+ .... . T Consensus 73 ~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~-~~~~~y~~~~--~~~~~~ 149 (386) T protein:vir:49 73 LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDN-QNGLYYNITF--DDPHIA 149 (386) T ss_pred hhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCC-CceEEEEEEE--cCcccc Confidence 1122211111 13456677788899999999999988888886 57888998887765542 1222111100 0000 0 Q ss_pred eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 158 KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDL 232 (445) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~ 232 (445) ....+ +... |+|++. ...|.|.+..+...++....+..-. T Consensus 150 ~~~~~---------------------------------~~~e--vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~ 194 (386) T protein:vir:49 150 PKQHV---------------------------------PQND--ILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLT 194 (386) T ss_pred ceeEE---------------------------------cccc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH Confidence 00000 0001 333332 1257888887777777666555555 Q ss_pred HHHHHHhcCCeeEEec--CCcccc-hhHHHhh-----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021326. 233 SNTFKDSNELTYVLTN--YDDQEL-PEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQA 304 (445) Q Consensus 233 ~~~~~~~~~~~l~~~g--~~~~~~-~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~ 304 (445) .+.+...+.|-.++.- ...++. ....... ...+++.++++.+++-+..+.....+.+..+.....|+..-++ T Consensus 195 ~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgV 274 (386) T protein:vir:49 195 ISALKNALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGI 274 (386) T ss_pred HHHHHccCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 5666777777666543 222111 1111111 1233555565555555544444455667778888888888888 Q ss_pred ccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHH Q lcl|NC_021326. 305 VDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQ 383 (445) Q Consensus 305 p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~ 383 (445) |..-.+... +..++..++..+ ...++.+++.+...+...- ...+++.....+-.|..+.+..+. T Consensus 275 Pp~~lg~~~~~~~~~~~~~~~~--------------~~~i~~~l~~i~~~~~~~l-~~~~~~~~~~~~~~d~~~~~~~~~ 339 (386) T protein:vir:49 275 PESIVGGDGDQQSSLEMIYNIY--------------FKSVSRYLRPFVSEMSKKL-SCEVDVDISPAVDPTGSNYISLIN 339 (386) T ss_pred CHHHhCCCCCccchHHHHHHHH--------------HHHHHHHHHHHHHHHHHHh-cchhcccchhhhccCHHHHHHHHH Confidence 875543222 223333333221 1122222222211111000 012333334444456666777776 Q ss_pred HH--hccCChHHHHHhCC---CCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 384 QS--MGIVSHETVLENHP---FVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 384 ~~--~g~~s~et~l~~l~---~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++ +|+++.-.+++++. +..+ ++.+. . .......+++++ +++ + T Consensus 340 ~l~~~g~~t~nE~r~~l~~~~~~~~---~~~~~--------~---~~~~~~~~gGd~-----~~~-~ 386 (386) T protein:vir:49 340 SMVKSGTLAQNQGLYILQQAEILPK---ELPDG--------K---NPNRTSLKGGEI-----NEQ-D 386 (386) T ss_pred HHHhCCCcCHHHHHHHHhhCCCCCC---cCcch--------h---ccCCCCCCCCCC-----CCC-C Confidence 65 48899888887652 2221 11000 0 000000011111 111 1 No 173 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=97.82 E-value=1.7e-05 Score=46.72 Aligned_cols=415 Identities=10% Similarity=-0.004 Sum_probs=174.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C-----ee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P-----IA 73 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~-----~~ 73 (445) .+.++.++++ |.+-...+.++++=- +|..-...... ......++..+-+...++++++.|++- | ++ T Consensus 4 ~~~~~~~~lk-R~~~e~~w~e~a~~t--lP~~~~~~~~~----~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) T protein:vir:63 4 TAAMLWEKLR-DGSVEQRAIEFAKTT--LPYLMVDPMSG----SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) T ss_pred HHHHHHHHHh-ccchHHHHHHHHHhh--ccccCCCCCCc----cccccCCCccchHHHHHHHHHHHHHhhhcCCCCcccc Confidence 3333333332 222223344444221 11111110000 111112345566777777777776542 2 22 Q ss_pred eccCch-------------HHHHHH--------HHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEcccee Q lcl|NC_021326. 74 FKHTDD-------------EVIKRI--------DEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQG 132 (445) Q Consensus 74 ~~~~d~-------------~~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~ 132 (445) +...++ ++.+.+ ..+..+||...+.++.++..++|.+.++ .++++. +++.++-.+ T Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~--~~~~~~-~~~~~pl~~- 152 (510) T protein:vir:63 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY--RDSDAA-TVVAWSLRS- 152 (510) T ss_pred cCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEE--EcCCCc-EEEEEEcce- Confidence 333221 122222 2233468899999999999999997555 466654 566665555 Q ss_pred EEEEcCCCCCceEEEEEEEeee--------------------cceeEEEEecceEEEEEEecc--eeeeccccccccccc Q lcl|NC_021326. 133 IPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKITVNYYVYENG--SLIPDYSNNLENSKT 190 (445) Q Consensus 133 ~~v~d~~~~~~~~~~v~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 190 (445) |.+..|. .+++...+|.++.. ....+++++... ..... ..........+.... T Consensus 153 y~v~~d~-~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~----~~~~~~~~~~sv~~e~dg~~~~ 227 (510) T protein:vir:63 153 YAVRRDA-TGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQ----RKKGTAMEYAELYHEIDGVRVG 227 (510) T ss_pred eEEeeCC-CcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEE----eecCCCceEEEEEEEecCceec Confidence 4444443 46666655554321 011222222111 00010 000000000001111 Q ss_pred ccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh-hhC Q lcl|NC_021326. 191 HFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL-RYY 264 (445) Q Consensus 191 ~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~-~~~ 264 (445) .....++..+|++.+. .+.+|+|-.....+-+..++...-...........|.+.+.- +.. ....... ... T Consensus 228 ~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p-~g~--~~~~~~~~~~~ 304 (510) T protein:vir:63 228 KEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKG--AVVDDYQDAEM 304 (510) T ss_pred cccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCc-ccc--cchhhhccCCC Confidence 1223345567776553 345799999999999999998877777766776766644321 110 0111111 122 Q ss_pred ceeeccCCCceeeEec--cCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH-----HHH Q lcl|NC_021326. 265 GAIKVSDNGGVDTIQV--EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK-----LAR 337 (445) Q Consensus 265 ~~~~~~~~~~~~~l~~--~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~-----~~~ 337 (445) +.+..+...+++.+.. ..+.......++.++..|...-.. ++. ...+...||+.+......+.....- ... T Consensus 305 g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-~l~-~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E 382 (510) T protein:vir:63 305 GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAEN 382 (510) T ss_pred ceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHh-hcc-cCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 3343333445666543 245677777788877777664322 111 2223446777776543333333221 122 Q ss_pred HHHHHHHHHHHHHHHHhccCC---C-cceEEEEeCCCCCCCHHHHHHHH---HH----Hhc---c---CChHHHHHhC-- Q lcl|NC_021326. 338 KAKVAIQELLWFVFEHFDIKG---E-HKDVDISFNYNKVANTELQVQTA---QQ----SMG---I---VSHETVLENH-- 398 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~~~~~~---~-~~~i~v~f~~~~p~d~~~~~~~~---~~----~~g---~---~s~et~l~~l-- 398 (445) .+.+-+.+++.++.+- +... + .....|++-..+-+ .+.++.+ .+ +.+ + +....++..+ T Consensus 383 ~l~Pli~r~~~il~r~-gl~p~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~ 459 (510) T protein:vir:63 383 LQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSR--SAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWA 459 (510) T ss_pred HHHHHHHHHHHHHHhc-cCCCCCchhcccceecchhHHHH--HHHHHHHHHHHHHHHHhcCchhhhccCCHHHHHHHHHH Confidence 2333344444444331 2111 1 11112222222221 1121111 11 111 1 1223333322 Q ss_pred -CCCC-----CHHHHHHHHHHHHHHHH----HhhhccccCCCCCCCCCCCC Q lcl|NC_021326. 399 -PFVE-----DLQAELERIEQEQMEYN----KQLPNLDDGGADGAQQKERS 439 (445) Q Consensus 399 -~~~~-----d~~~E~~ri~~E~~~~~----~~~~~~~~~~~~~~~~~~~~ 439 (445) -+|+ -.++|++++.+++.+.. +.......+.+..+.....- T Consensus 460 ~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 460 AFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 1231 12455555544422111 11111111111111111111 No 174 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=97.79 E-value=1.9e-05 Score=46.51 Aligned_cols=393 Identities=9% Similarity=0.044 Sum_probs=159.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) ...+.-...-.|.. ......-+ |.+++..-+... ... .+ ..-..++...+|+..+.-+.+-|+.+...++. T Consensus 53 ~~~d~~~~~~~r~g-~~~~~~~~-g~~~~~epp~d~-~~l----~~--l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~ 123 (648) T protein:vir:79 53 AKRDPKMSLVKRIG-LAIMDGGG-GGRDFEEPEFDF-NEI----TS--AYNTEGYVRQAVDKYIEMMFKADWDFVSKNPN 123 (648) T ss_pred ccccchhHHHHHhH-HHHHhhcC-CccccccCCcCH-HHH----HH--HHhcChHHHHHHHHHHHHHhhCcceEEecCCc Confidence 11111000111110 11111112 222221111000 000 00 00124567778888888888878776654432 Q ss_pred -HHHH-HHH-H-hcc---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE----------------EEEEccceeEEEEc Q lcl|NC_021326. 81 -VIKR-IDE-V-LGN---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK----------------LFRVPAEQGIPIWT 137 (445) Q Consensus 81 -~~~~-l~~-~-~~n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~----------------i~~~~p~~~~~v~d 137 (445) +... ... . .-| +...+...+..+.+.+|.+|+.+-.+.+|.+- +..++|..+.+..+ T Consensus 124 ~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d 203 (648) T protein:vir:79 124 AVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRD 203 (648) T ss_pred cchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEc Confidence 1111 111 1 112 45566777888999999999999998887421 11122222222211 Q ss_pred CCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec-----CCCCcC Q lcl|NC_021326. 138 DKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEI 212 (445) Q Consensus 138 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~ 212 (445) + . ..+..|.+.... +.. ...|..=.|+|++ +...|. T Consensus 204 ~--~-----------------------g~~~~Y~y~~~g---------~~~-----~~~~~~~dIIHik~~~~~d~~~Gl 244 (648) T protein:vir:79 204 K--F-----------------------GMIKGWQQEQEG---------QDK-----PQKFKPEDIVHIYYKREKGRAFGT 244 (648) T ss_pred C--C-----------------------CceeeeEEEecC---------Cce-----eEEecCccEEEEccCCCCCCceec Confidence 1 0 011111111000 000 0001001244443 234688 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEec-CCcccchhHHH---hh-hhCceeeccC-CCceeeEecc--CC- Q lcl|NC_021326. 213 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTN-YDDQELPEFKR---LL-RYYGAIKVSD-NGGVDTIQVE--VP- 283 (445) Q Consensus 213 s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g-~~~~~~~~~~~---~~-~~~~~~~~~~-~~~~~~l~~~--~~- 283 (445) |.+..+...|+....+......-+...+.|-.++.- ......+.... .+ ...+...+.+ ..+.+.+..+ .. T Consensus 245 Spi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~gg~v~~~~~~i~~~~s~ 324 (648) T protein:vir:79 245 PWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEGGMVTTERVNISSIASN 324 (648) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhcccccccccccccceeeccccCCH Confidence 888877777776666555556667777888766542 11111111111 11 1111122222 2222322221 12 Q ss_pred -hHHHHHHHHHHHHHHHHHhCccccccccccC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH---HHhcc Q lcl|NC_021326. 284 -VENSKKYLDELYQKIMLFGQAVDFSSDKFGS--APSGVALEFLYTNLNLKADKLARKAKVAIQEL-LWFVF---EHFDI 356 (445) Q Consensus 284 -~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~-~~~~~---~~~~~ 356 (445) ...+.+..+...+.|...-++|....+...+ ..++.+....+.. .+.-.+..+...+... ++.+. .+... T Consensus 325 ~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~---~i~~l~~~i~~~le~~~~~~ll~e~~l~~~ 401 (648) T protein:vir:79 325 QIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKD---RIKALQKVMATFINEFMVKEILMEGGFDPV 401 (648) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 1235555666777888888888755432221 1223333322222 1212222222222211 11111 11111 Q ss_pred CCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHH-----HHHHH--HHh-hhc Q lcl|NC_021326. 357 KGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQ-----EQMEY--NKQ-LPN 424 (445) Q Consensus 357 ~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~-----E~~~~--~~~-~~~ 424 (445) ......+++.|++....|....++.+.++ +|++|...++++++. +++.... ..+.. .++.. ... .+. T Consensus 402 l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~-~~l~~~~~~~~~~~~~~~~~~~~~ 480 (648) T protein:vir:79 402 LNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGR-AKMHLQMVTIAQATALAALAPTPA 480 (648) T ss_pred ccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCc-cccccccccchhccccccCCCCCC Confidence 11123467778877778888888888775 599999999988754 2221110 01110 00000 000 000 Q ss_pred ccc-CCCCCC-CCCCCCCCcCCC Q lcl|NC_021326. 425 LDD-GGADGA-QQKERSNDKQSE 445 (445) Q Consensus 425 ~~~-~~~~~~-~~~~~~~d~~~~ 445 (445) ... ..++.+ ...+..+++..| T Consensus 481 ~~~~~~a~~eg~~~e~~~~~~~~ 503 (648) T protein:vir:79 481 GGSSASASGDKKKKATDNKTKPT 503 (648) T ss_pred CCCCCCccccccccccCCCCCCC Confidence 000 000000 001111111111 No 175 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=97.78 E-value=1.9e-05 Score=46.41 Aligned_cols=413 Identities=10% Similarity=0.020 Sum_probs=174.8 Q ss_pred ChHHHHHHHHH-HHH---HHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLE-KLP---EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~-~~~---~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) -|.+..+.++. |.+ +++.+.+|.. |.+-..... .....++..+-+...++++++.|++- | T Consensus 11 ~l~~r~~~Lk~~R~~~e~~w~e~~~~~l-----P~~~~~~~~------~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 79 (517) T protein:vir:10 11 KIPKLYEQLVGKRSPFLSRAENYSRFTL-----PYLMADVND------DLSSQNAWQDDGASATNFLSNKLSQVLFPAQR 79 (517) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHhc-----cccccCCCC------CccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 33444444433 322 3444444432 111111000 01122455667777788887776542 2 Q ss_pred --eeeccCch-------------HHHHHH--------HHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEc Q lcl|NC_021326. 72 --IAFKHTDD-------------EVIKRI--------DEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVP 128 (445) Q Consensus 72 --~~~~~~d~-------------~~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~ 128 (445) +++...+. ++...+ ..+..+||...+.++.++..++|.+.++ .++ +...++.++ T Consensus 80 ~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly--~~~-~~~~~~~~p 156 (517) T protein:vir:10 80 SFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMY--HPD-KTSPIQAVP 156 (517) T ss_pred ccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE--EeC-CCCcEEEEE Confidence 23333222 122222 2223468999999999999999998654 443 333566665 Q ss_pred cceeEEEEcCCCCCceEEEEEEEee----------------------ecceeEEEEecceEEEEEEecceeeeccccccc Q lcl|NC_021326. 129 AEQGIPIWTDKEHEELEAFIRMYKL----------------------ENETKVEYWDKITVNYYVYENGSLIPDYSNNLE 186 (445) Q Consensus 129 p~~~~~v~d~~~~~~~~~~v~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (445) -.+ |.+..|. .+++...++..+. +....+++|+... ...+..+..... ..+ T Consensus 157 l~~-y~v~~d~-~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~---~~~~~~~~~~~~--~d~ 229 (517) T protein:vir:10 157 LHH-YCVRRDN-NGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAK---RTKDGKYLIRQS--ADD 229 (517) T ss_pred cCe-EEEeeCC-CcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEE---EeCCCceEEEEE--eCc Confidence 555 4444443 4666555544321 1112233332111 111111111000 001 Q ss_pred ccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh Q lcl|NC_021326. 187 NSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL 261 (445) Q Consensus 187 ~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 261 (445) .........++..+|++.+. ++.+|+|-.....+-+..++...-...........|.+.+.-.......... . T Consensus 230 ~~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l~--~ 307 (517) T protein:vir:10 230 VPVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQFV--E 307 (517) T ss_pred eeeccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhcc--C Confidence 11111122234567776553 3468999999999999999988777777677777666654211111101100 1 Q ss_pred hhCceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 262 RYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 339 (445) Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~ 339 (445) ...+.+..+..+++..+... .+.......++.++..|...-....+. ...+...||+.+... ..++...+ T Consensus 308 ~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~-~~~~~rvTAtEV~~r-------~~E~~~~L 379 (517) T protein:vir:10 308 GGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMT-RRDAERVTAYEIQRD-------AMLVEQSL 379 (517) T ss_pred CCccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhh-ccCCccccHHHHHHH-------HHHHHHHh Confidence 11123333334455555432 356667777777777776554332211 122234677766543 34444445 Q ss_pred HHHHHH--------HHHHHHHHhccCCCcceEEEEeCCCCC-CCHHHHHHHHHHH---hc-c----------CChHHHHH Q lcl|NC_021326. 340 KVAIQE--------LLWFVFEHFDIKGEHKDVDISFNYNKV-ANTELQVQTAQQS---MG-I----------VSHETVLE 396 (445) Q Consensus 340 ~~~l~~--------~~~~~~~~~~~~~~~~~i~v~f~~~~p-~d~~~~~~~~~~~---~g-~----------~s~et~l~ 396 (445) +..+.+ ++..++..+....-...+++.+.-++. ......++.+... .| + +..+.++. T Consensus 380 Gpv~~rl~~Ell~Pli~r~~~~l~~~l~~~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~ 459 (517) T protein:vir:10 380 GGVYSLFATTFQGPLARWFMNGISSILTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTD 459 (517) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhhcCCCCccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHH Confidence 544333 222222222222222234444322221 1112222222221 11 1 11222332 Q ss_pred hC---CCCC----CHHHHHHHHHHHHHHHHHh---hhccccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 397 NH---PFVE----DLQAELERIEQEQMEYNKQ---LPNLDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 397 ~l---~~~~----d~~~E~~ri~~E~~~~~~~---~~~~~~~~~~~~~~~~~~~d~~~ 444 (445) .+ -+++ -.++|+++.++++.+.... +.......+.....++..++-.+ T Consensus 460 ~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 460 WVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred HHHHHhCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 21 1222 1235555544443322111 11111111111111111111111 No 176 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=97.75 E-value=2.2e-05 Score=46.10 Aligned_cols=398 Identities=14% Similarity=0.086 Sum_probs=157.7 Q ss_pred ChHHHHHH-----HHHHH-H--HHHH-HHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccC Q lcl|NC_021326. 1 MIVRYIKQ-----HLEKL-P--EISI-GQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKP 71 (445) Q Consensus 1 ~l~~~i~~-----~~~~~-~--~~~~-~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~ 71 (445) |-...+.. .+... . ..+. -..++.+- . .+...... ..+..-..+....+|+..+..+.+-| T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~---~-~pp~~~~~------la~l~~~n~~v~scI~~ia~~IA~l~ 70 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRFEEY---V-EPKVNPLV------LLSLLQVNPYHASACSIKANDIIRTG 70 (542) T ss_pred CccccccccccccchhhhhccccccccccccCCcc---c-cCCCCHHH------HHHHHhhcHHHHHHHHHHHHHHhhCc Confidence 11111100 00000 0 0000 00000000 0 00000000 00000123456778888888888888 Q ss_pred eeeccCchHHHHHHHHHhcc---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEEEccceeEEEEcCCCCCceEEE Q lcl|NC_021326. 72 IAFKHTDDEVIKRIDEVLGN---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEAF 147 (445) Q Consensus 72 ~~~~~~d~~~~~~l~~~~~n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~ 147 (445) +.+..++.. .+..++.| +.......+..+.+.+|.+|+.+..+..|++. +..++|..+.+..+... + T Consensus 71 ~~~~~~~~~---~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~------~ 141 (542) T protein:vir:41 71 YILEGDDEG---VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSR------Y 141 (542) T ss_pred eeeecccch---hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCe------e Confidence 888655443 33444444 35566777888999999999999999889865 78888888766544321 1 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecce-eeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGS-LIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTL 221 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~l 221 (445) +.++..... .++...... ......+. .. ..+..=-|+|+++ ...|.|.+...... T Consensus 142 ~~~~~~~~~-----------~~~~~y~~~~~~~~~~g~--~~------~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~ 202 (542) T protein:vir:41 142 RQTWDGVNI-----------THFKDYRYEGEINPETGE--DQ------DSVGANELVFIHIPSPVCSYYGVPRYVSAAPA 202 (542) T ss_pred EeeecCCcc-----------eeEEeecccccccccccc--cc------cccCcccEEEecCCCCCCCcccccHHHHHHHH Confidence 111111111 011100000 00000000 00 0011111455542 23578877766665 Q ss_pred HHHHHHHHHHHHHHHHHhcCCeeE--EecCCccc-----------chhHHHhh----h----h-Cceeecc----CCCce Q lcl|NC_021326. 222 IDAYNRRLSDLSNTFKDSNELTYV--LTNYDDQE-----------LPEFKRLL----R----Y-YGAIKVS----DNGGV 275 (445) Q Consensus 222 id~~~~~~s~~~~~~~~~~~~~l~--~~g~~~~~-----------~~~~~~~~----~----~-~~~~~~~----~~~~~ 275 (445) +.....+.....+.+...+.|-.+ +.|...+. ...++..+ . . .+++.++ .++++ T Consensus 203 i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~ 282 (542) T protein:vir:41 203 ILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKV 282 (542) T ss_pred HHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccce Confidence 555444433344445555566444 44432211 11111111 1 1 2233332 23455 Q ss_pred eeEeccC--ChHHHHHHHHHHHHHHHHHhCccccccccccCcc-hHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 276 DTIQVEV--PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAP-SGVALEFLYTNL-NLKADKLARKAKVAIQELLWFVF 351 (445) Q Consensus 276 ~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~-Sg~Ai~~~~~~l-~~k~~~~~~~~~~~l~~~~~~~~ 351 (445) +|..... ....+.+..+...+.|...-++|....+...++. ++.-++...... ...+.-....+...+.+ T Consensus 283 ~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~------ 356 (542) T protein:vir:41 283 TFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTD------ 356 (542) T ss_pred eEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHh------ Confidence 6655443 3345556667777888888888865443222111 111111111111 11111112222222221 Q ss_pred HHhccCCCcceEEEEeCC--CCCCCHHHHHHHHHHHhccCChHHHHHhCCCCC---CHH-----HHHHHHHHH------- Q lcl|NC_021326. 352 EHFDIKGEHKDVDISFNY--NKVANTELQVQTAQQSMGIVSHETVLENHPFVE---DLQ-----AELERIEQE------- 414 (445) Q Consensus 352 ~~~~~~~~~~~i~v~f~~--~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~---d~~-----~E~~ri~~E------- 414 (445) .+.... ...+.+.|+. .+..|..+.++. ....|+++...+++.|+.++ |+- ...+.++.. T Consensus 357 -~L~~~~-~~~~~~~f~~~~ll~~d~~~~~~~-~v~~GilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~~ 433 (542) T protein:vir:41 357 -FFQVKF-NPKTRFKFNDETLLESDSVRNCAL-LVQSGVLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERNYEKN 433 (542) T ss_pred -hccccc-CCceEEEecchhhcchHHHHHHHH-HHhCCCCCHHHHHHhhCCCCCCCccccccccccccccccCCcCCCCC Confidence 111111 1244566643 333443433332 22368999888887664432 220 000111100 Q ss_pred -HHHHHHhhhcccc----CC---CCCCCCCCCCCCcCCC Q lcl|NC_021326. 415 -QMEYNKQLPNLDD----GG---ADGAQQKERSNDKQSE 445 (445) Q Consensus 415 -~~~~~~~~~~~~~----~~---~~~~~~~~~~~d~~~~ 445 (445) ..+..+......+ .. ..+.......+.+++| T Consensus 434 ~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (542) T protein:vir:41 434 QIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLAE 472 (542) T ss_pred chhhhhhcccccCccccccccccccchhhcccccchhhh Confidence 0011111111110 00 0011111122222233 No 177 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.61 E-value=3.7e-05 Score=44.87 Aligned_cols=372 Identities=11% Similarity=0.013 Sum_probs=162.7 Q ss_pred Ch-HHHHHHHHHHHHHHHH-HHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeec-cC Q lcl|NC_021326. 1 MI-VRYIKQHLEKLPEISI-GQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK-HT 77 (445) Q Consensus 1 ~l-~~~i~~~~~~~~~~~~-~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~-~~ 77 (445) |+ ++.-+++.+....... +-.+ -|..+. +... . ..+-+.+.-...+|+..++-+.+-|+.+- .. T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~-~g~~~~--------~~~v---~-~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 67 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEW-LGINPS--------ETYV---N-GKSCLKQATVFGCIRILSDNISKLPIKIYQKK 67 (409) T ss_pred CcccccccCcCCCCCCChHHHHHH-hcCCcC--------ccee---c-hhhhhccHHHHHHHHHHHHhhhhCceEEEEec Confidence 11 1111111111000000 0011 111000 0000 0 00001223334466666666655666541 11 Q ss_pred ch---HHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEE Q lcl|NC_021326. 78 DD---EVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAF 147 (445) Q Consensus 78 d~---~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~ 147 (445) +. .....+...+ . | ........+..+.+.+|.+|+.+..+..|++ .+..++|..+.++.++........- T Consensus 68 ~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~ 147 (409) T protein:vir:10 68 DGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENN 147 (409) T ss_pred CCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccce Confidence 11 1111122222 1 2 3456677788899999999999999988886 4778888888777654211100000 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLID 223 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid 223 (445) +.|. +....+.... .+.. -|++++ +...|.|.+..+...++ T Consensus 148 ~~y~------------------~~~~~g~~~~---------------~~~~--evih~r~~~~d~~~G~s~i~~~~~~i~ 192 (409) T protein:vir:10 148 VWYL------------------YTDDLGQRHK---------------FMSD--EILHFKGLTADGLAGLSVIELLNHLIE 192 (409) T ss_pred EEEE------------------EEeCCceeEE---------------eccc--cEEEecCcCCCCcccccHHHHHHHHHH Confidence 0000 0000000000 0000 123332 22357788877777776 Q ss_pred HHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhh-------h-hCceeeccCCCceeeEeccCChHHHHHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLL-------R-YYGAIKVSDNGGVDTIQVEVPVENSKKYLD 292 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~-------~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~ 292 (445) ....+.......+...+.|-.++.... .+..+.....+ . ..+++.++++.+.+.+..+.....+.+..+ T Consensus 193 ~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~ 272 (409) T protein:vir:10 193 NGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQ 272 (409) T ss_pred HHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHH Confidence 666555555666677777766655322 22222222111 1 223555666665555544444455666677 Q ss_pred HHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----cCC--CcceEEE Q lcl|NC_021326. 293 ELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----IKG--EHKDVDI 365 (445) Q Consensus 293 ~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~~~--~~~~i~v 365 (445) ...+.|+..-++|....+... ..++..++... ...+..+|+-+++.+...+. ... ....+++ T Consensus 273 ~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~e~~~----------~~f~~~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~f 341 (409) T protein:vir:10 273 LTIRQIASVFGVKMHQLNDLD-RATHSNITEQN----------REFYIDTLQSILNMYELEINYKLFLISEIKNGFYSKF 341 (409) T ss_pred HHHHHHHHHhCCCHHHcCCCC-CCccccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCchhccCCcEEEE Confidence 788888888888865443221 11222222111 11222223333332222211 111 1123455 Q ss_pred EeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCc Q lcl|NC_021326. 366 SFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDK 442 (445) Q Consensus 366 ~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 442 (445) .+...+-.|..+.++.+.++ +|+++.-.++++++.-.- +-.+++ .... +.. ...+.+++..+.+++ T Consensus 342 d~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~--~ggD~~------~~~~--n~~-~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 342 NVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPL--EGGDVL------LING--NMI-PVKMAGEQYSKGGEK 409 (409) T ss_pred echhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCee------eecc--Ccc-chhhccccccccCCC Confidence 55566667899999988887 589999898888754210 000000 0000 000 000111111111111 No 178 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=97.61 E-value=3.7e-05 Score=44.85 Aligned_cols=367 Identities=12% Similarity=0.037 Sum_probs=163.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccc-ccccccccccccchHHHHHHHHHhhhhccCeeec--cC Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGA-VDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK--HT 77 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~-~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~--~~ 77 (445) +..++.+. +....+-............ .....+.+-+.++-...+|+..++-+.+-|+.+- .+ T Consensus 3 ~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~ 68 (411) T protein:vir:81 3 WWSRLTRF--------------FRPRNETVDMTNPLLLQWLGVDPDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTE 68 (411) T ss_pred hHHHHHhh--------------ccCcccccccchHHHHHHhcCcccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecC Confidence 44333222 2221111000000000000 0000011112222234466666666666666551 11 Q ss_pred -------chHHHHHHHHHhcc---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 78 -------DDEVIKRIDEVLGN---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 78 -------d~~~~~~l~~~~~n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) +......|.. --| ........+..+.+.+|.+|+++..+ .|++ .+..++|..+.++.++........ T Consensus 69 ~~~~~~~~~~l~~lL~~-~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~ 146 (411) T protein:vir:81 69 RGIVKSDREELYNLLKL-RPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKN 146 (411) T ss_pred CceeeecccHHHHHHhh-ccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccc Confidence 1112222211 112 34566677888899999999998887 4665 477899999888776532110000 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEe-cceeeecccccccccccccccccccccceEEec-----CCCCcCccHHHHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYE-NGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKT 220 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~ 220 (445) .+ +| .+... .+... ..+.. -|++++ +...|.|.+..+.. T Consensus 147 ~~-~~-----------------~~~~~~~g~~~---------------~~~~~--eiih~k~~~~~~~~~G~s~~~~~~~ 191 (411) T protein:vir:81 147 AI-WY-----------------RYNDPYDGKMY---------------VFRND--EILHFKTSVTFDGITGLSVRDVLKH 191 (411) T ss_pred eE-EE-----------------EEEecCCceEE---------------EEccc--cEEEEcCCCCCCCcccccHHHHHHH Confidence 00 00 00000 00000 00000 133333 22357787777777 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEeccCChHHHHH Q lcl|NC_021326. 221 LIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENSKK 289 (445) Q Consensus 221 lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~~~ 289 (445) .++.......-..+.+...+.|-.++.... .+.....+..+. ..+++.++++.+.+.+........+.+ T Consensus 192 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e 271 (411) T protein:vir:81 192 TVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFE 271 (411) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHH Confidence 776666665555666677777877665432 121122222221 123455566655555543333445556 Q ss_pred HHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----C--CCcc Q lcl|NC_021326. 290 YLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----K--GEHK 361 (445) Q Consensus 290 ~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-----~--~~~~ 361 (445) ..+.....|+..-++|....+... ++-+. ++.. ....+...|.-++..+...+.. . .... T Consensus 272 ~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n--~e~~----------~~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~ 339 (411) T protein:vir:81 272 LKKYTALQIAAAFGIKPNQINDYEKSSYAS--AEAQ----------NLAFYVDTLLYVLKQYEEEITYKILSNDLISQGH 339 (411) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCCchh--HHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCChhhcCCCc Confidence 677778888888888865443222 21111 1111 1122233333333333332221 1 1123 Q ss_pred eEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCC Q lcl|NC_021326. 362 DVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERS 439 (445) Q Consensus 362 ~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (445) .+++.+...+..|..+.++++.++ .|+++.-.++++++.-..+ ..+..-.. ........ .+++.. T Consensus 340 ~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~--ggD~~~~~-----~n~~pl~~----~~~~~~-- 406 (411) T protein:vir:81 340 YFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADD--YGNNLMAN-----GNYIPLSM----LGANYG-- 406 (411) T ss_pred EEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeeeec-----cCccchhh----hhhhhc-- Confidence 345555666778899999988886 4899998998888652211 00000000 00000000 000000 Q ss_pred CCcCCC Q lcl|NC_021326. 440 NDKQSE 445 (445) Q Consensus 440 ~d~~~~ 445 (445) +.|| T Consensus 407 --kgGd 410 (411) T protein:vir:81 407 --KGGD 410 (411) T ss_pred --cCCC Confidence 1111 No 179 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=97.60 E-value=3.9e-05 Score=44.77 Aligned_cols=372 Identities=9% Similarity=0.001 Sum_probs=163.3 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccC-- Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHT-- 77 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~-- 77 (445) +++++.++... .......+...+.+..+ ... +... -+..-+.+.-...+|+..++-+.+-|+.+--. T Consensus 3 ~f~~lf~r~~~~~~~~~~~~~~~~~~~~~-----~~~-g~~v----~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~ 72 (414) T protein:vir:44 3 FFSGLFQRKSDAPVTTPAELADAIGLSYD-----TYT-GKQI----SSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNG 72 (414) T ss_pred hhhhhhccCccCcccchhhHhHhhccCcc-----ccC-Ccee----chhhhhccHHHHHHHHHHHHHhccCceEEEEecC Confidence 55544443221 11111222222221110 000 0000 00000122233446666666666666654211 Q ss_pred -------chHHHHHHHHHhcc---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 78 -------DDEVIKRIDEVLGN---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 78 -------d~~~~~~l~~~~~n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) +......|.. .-| ........+..+.+.+|.+|+++..+ .|++ .+..++|..+.+.++.. +.+.+ T Consensus 73 ~~~~~~~~~~~~~lL~~-~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~--~~~~y 148 (414) T protein:vir:44 73 SLKQRATGERLHKLIST-HPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSS--WEPVY 148 (414) T ss_pred CceeecccchHHHHHHh-hcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCC--CcEEE Confidence 1111122211 112 34556777888899999999998776 5776 47888999988776642 22221 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLI 222 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~li 222 (445) .+ ... ......+... -|++++ +...|.|.+..+...+ T Consensus 149 ~~---~~~-~g~~~~~~~~-----------------------------------evih~~~~~~d~~~G~s~i~~~~~~i 189 (414) T protein:vir:44 149 QV---TFP-DGSTDVLSQE-----------------------------------DIWHVRTLTLDGLVGLNPIAYAREAI 189 (414) T ss_pred EE---Eec-CceEEEEccc-----------------------------------cEEEecCCCCCCcccccHHHHHHHHH Confidence 11 000 0000000000 022222 2235788887777666 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEEecC---CcccchhHHHhhh--------hCceeeccCCCceeeEeccCChHHHHHHH Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENSKKYL 291 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~~~~i 291 (445) +..........+.+...+.|-.++.-. +.+....++..+. ..+++.++++.+.+.+..+.....+.+.. T Consensus 190 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~ 269 (414) T protein:vir:44 190 SLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETR 269 (414) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHH Confidence 666655555556666677776655432 2222222222221 12345556555555444333344555666 Q ss_pred HHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CC--CcceEEE Q lcl|NC_021326. 292 DELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KG--EHKDVDI 365 (445) Q Consensus 292 ~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~--~~~~i~v 365 (445) +.....|+..-++|....+..++ .+...++... ...+...|.-+++.+...+.. .. ....+++ T Consensus 270 ~~~~~~Ia~~fgVpp~~l~~~~~-~t~~n~e~~~----------~~~~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~f 338 (414) T protein:vir:44 270 KFQLEEICRLFRVPLHMVQNTDR-ATFNNIEELG----------LGFINYSLVPYLTRIEQRINTGLVRKSKQGVFYAKF 338 (414) T ss_pred HHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCCccccCceEEEE Confidence 77777888888888644332211 1111111111 111222333333322221111 11 1223444 Q ss_pred EeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhc-cccCCCCC--CCCCCCCC Q lcl|NC_021326. 366 SFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPN-LDDGGADG--AQQKERSN 440 (445) Q Consensus 366 ~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~-~~~~~~~~--~~~~~~~~ 440 (445) .+...+..|..+.++.+.++ +|+++.-.++++++.-. .+..+.. ....... ......+. ..++..++ T Consensus 339 d~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p--~~ggD~~------~~~~n~~~~~~~~~~~~~~~~~~~~d 410 (414) T protein:vir:44 339 NAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNP--RPGGDVY------LTPMNMTTKPSDGSKAGKQKDNANAD 410 (414) T ss_pred echhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC--CCCccee------cccccccccCCccccCCCCCCCCCCC Confidence 44566667889999998887 48999999998875421 1100000 0000000 00000111 11111111 Q ss_pred CcCC Q lcl|NC_021326. 441 DKQS 444 (445) Q Consensus 441 d~~~ 444 (445) .+++ T Consensus 411 ~~~~ 414 (414) T protein:vir:44 411 ETTS 414 (414) T ss_pred CCCC Confidence 1111 No 180 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=97.59 E-value=4e-05 Score=44.67 Aligned_cols=377 Identities=10% Similarity=0.016 Sum_probs=159.5 Q ss_pred HHHhcCCCc------------------ccccc--c--cccccc-----------cccc---cccccccccchHHHHHHHH Q lcl|NC_021326. 20 QEYYEQRPD------------------IVKEP--K--PVDATG-----------AVDP---LKPDDRMITNFHANLVDQK 63 (445) Q Consensus 20 ~~yy~G~~~------------------i~~~~--~--~~~~~~-----------~~~~---~~~~~ri~~n~~~~iv~~~ 63 (445) .++|.-+-- ++.+. + ...... .... ..+..-+.++=.-.+|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~I 80 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMI 80 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHHH Confidence 122221100 00000 0 000000 0000 0000000011111245555 Q ss_pred HhhhhccCeeeccCch-HHHH-HHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEEEccceeEEE Q lcl|NC_021326. 64 VSYIVGKPIAFKHTDD-EVIK-RIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPI 135 (445) Q Consensus 64 ~~~l~g~~~~~~~~d~-~~~~-~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v 135 (445) ++-+..-|+.+.-+.. .... ....+.. | .-......+..+.+.+|.+|+.+..+.+|++. +..++|..+.+. T Consensus 81 a~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~ 160 (441) T protein:vir:98 81 ASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELK 160 (441) T ss_pred HHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEE Confidence 5555555666532211 1111 1222221 3 23456667788889999999999999888864 788999998887 Q ss_pred EcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCc Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLE 211 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g 211 (445) .++ .+.+.+.+.............+... -|++++ +...| T Consensus 161 ~~~--~g~~~~~~~~~~~~~~~~~~~~~~~-----------------------------------dviHir~~~~dg~~G 203 (441) T protein:vir:98 161 LDA--RGRLYYFHQRIDSNGNNIERNVKFE-----------------------------------DMLDIKFYSLDGING 203 (441) T ss_pred ECC--CCcEEEEEEEeccCcceeeEEEccc-----------------------------------cEEEeccCCCCCccc Confidence 764 2333332211100000000000000 022222 12347 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe--cCCc-ccc-hhHHHhhh--------hCceeeccCCCceeeEe Q lcl|NC_021326. 212 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--NYDD-QEL-PEFKRLLR--------YYGAIKVSDNGGVDTIQ 279 (445) Q Consensus 212 ~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~--g~~~-~~~-~~~~~~~~--------~~~~~~~~~~~~~~~l~ 279 (445) .|.+..+...++.......-....+...+.|-.++. +... +.. +..+..+. ..+++.++++.+.+.++ T Consensus 204 ~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~ 283 (441) T protein:vir:98 204 LSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLE 283 (441) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEcc Confidence 777777666666655554445555666677766654 4322 111 11222221 12356666666666555 Q ss_pred ccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC Q lcl|NC_021326. 280 VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGE 359 (445) Q Consensus 280 ~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 359 (445) .+.....+.+..+.....|+..-++|....+...++.|.+.....+ ...+.- +-..|++.+... +.-... T Consensus 284 ~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~y---~~tl~P----~~~~ie~~ln~~---L~~~~~ 353 (441) T protein:vir:98 284 VDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDY---LSTLKP----YITCVCAELNFK---FNDEYV 353 (441) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHH---HHHHHH----HHHHHHHHHHhh---cccccc Confidence 4444455666667777788888888865543222222222111111 011111 111111111111 111111 Q ss_pred cceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCC Q lcl|NC_021326. 360 HKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQ 435 (445) Q Consensus 360 ~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 435 (445) ...+++.....+-.|..+.++.+.++ .|+++.-.++++++. +++.+..+-.+...- ..............+.. + T Consensus 354 ~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~-~~~~~~~~~q~~~~~~~-~ 431 (441) T protein:vir:98 354 NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNH-VNIELVDEYQMNKSRAT-D 431 (441) T ss_pred CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccc-ccccccccccccccccc-c Confidence 22344444555667888899988876 589999999888744 222211110000000 00000000011111111 1 Q ss_pred CCCCCCcCCC Q lcl|NC_021326. 436 KERSNDKQSE 445 (445) Q Consensus 436 ~~~~~d~~~~ 445 (445) ....+++++| T Consensus 432 ~~~kgGe~ne 441 (441) T protein:vir:98 432 KKLKGGEENE 441 (441) T ss_pred cccCCCCCCC Confidence 1112222223 No 181 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.55 E-value=4.6e-05 Score=44.36 Aligned_cols=378 Identities=11% Similarity=0.024 Sum_probs=153.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcC-CCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQ-RPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G-~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) .+.+ +....+...- ...++... -.............+.... .+.-+..+-...+|+..++-+..-|+.+--..+ T Consensus 3 ~~~~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~--~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~ 77 (412) T protein:vir:26 3 VIAK--ENIVTRIKKK-LIDNWIDQSTSKLYDFSPWKNRSFWGVI--NNTLETNETIFSAITKLSNSMASLPLKMYEDYK 77 (412) T ss_pred cchh--hhhhhhhhhh-HhhhhhcccccccccccccCCccccccc--hhhhhccHHHHHHHHHHHHhHhhCceeEeeccc Confidence 2211 1111111100 00111100 0000000000000111000 011122233444566666666556765522222 Q ss_pred HHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEe Q lcl|NC_021326. 80 EVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYK 152 (445) Q Consensus 80 ~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 152 (445) .....+...+ . | .-..+...+..+.+.+|.+|+.+..+..|++ .+..++|..+.+..++.. +.+.+ ++. T Consensus 78 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~~~y--~~~- 153 (412) T protein:vir:26 78 VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RELYY--SIH- 153 (412) T ss_pred cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cEEEE--EEE- Confidence 2222222222 1 3 2345566788899999999999999999986 578889998887766531 21111 110 Q ss_pred eecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHH Q lcl|NC_021326. 153 LENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNR 227 (445) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~ 227 (445) ...... ..+.. .. |+|+++ ...|.|.+.-+...++..+. T Consensus 154 ~~~g~~-~~~~~---------------------------------~e--vih~~~~~~~~~~~G~s~i~~~~~~i~~~~a 197 (412) T protein:vir:26 154 AATGNK-LIVHN---------------------------------MD--MLHFKHIVASNMVQGISPIDVLKNTTDFDNA 197 (412) T ss_pred cCCceE-EEEcc---------------------------------cc--EEEeCCCCCCCCcccccHHHHHHHHHHHHHH Confidence 000000 00000 00 233322 23477777655555554333 Q ss_pred HHHHHHHHHHHhc-CCeeEEe-cCCcc--cchhHHHhh----h-hCceeeccCCCceeeEeccCChHHHHHHHHHHHHHH Q lcl|NC_021326. 228 RLSDLSNTFKDSN-ELTYVLT-NYDDQ--ELPEFKRLL----R-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKI 298 (445) Q Consensus 228 ~~s~~~~~~~~~~-~~~l~~~-g~~~~--~~~~~~~~~----~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i 298 (445) +. .. + +.... .+-.++. +.... ........+ . ..+++.++++.+.+.+........+.+..+.....| T Consensus 198 ~~-~~-~-~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 274 (412) T protein:vir:26 198 VR-TF-N-LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERV 274 (412) T ss_pred HH-HH-H-HHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHH Confidence 21 11 1 22322 2333332 22221 111112111 1 233555555555554443333445566666677788 Q ss_pred HHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----cCC--CcceEEEEeCCCC Q lcl|NC_021326. 299 MLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----IKG--EHKDVDISFNYNK 371 (445) Q Consensus 299 ~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~~~--~~~~i~v~f~~~~ 371 (445) +..-++|..-.+..+ ..+...++... ...+...|.-.+..+...+. ... ....+++.+...+ T Consensus 275 a~afgVPp~~lg~~~-~~~~sn~e~~~----------~~f~~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~ 343 (412) T protein:vir:26 275 ANVFQLPSVFLNARS-NTNFAKNEELN----------RFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYL 343 (412) T ss_pred HHHhCCCHHHhCCCC-CCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhh Confidence 888888865443221 11111111111 11122223333333322211 111 1223444455666 Q ss_pred CCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC-CCCCCCCCCCCCcCC Q lcl|NC_021326. 372 VANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGG-ADGAQQKERSNDKQS 444 (445) Q Consensus 372 p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~-~~~~~~~~~~~d~~~ 444 (445) ..|..+.++++.++ +|+++.-.++++++.-+- +..++.-- ........... .......++++.++| T Consensus 344 ~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~--~ggD~~~~-----~~n~~~~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 344 RADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGGDKPLI-----SGDLYPIDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeee-----cccccccccchhhcccccCCCCCcCCC Confidence 78899999998886 489999999988754211 10110000 00000000000 000111222222222 No 182 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=97.51 E-value=5.3e-05 Score=44.04 Aligned_cols=404 Identities=11% Similarity=0.042 Sum_probs=172.3 Q ss_pred hHHHHHH--HHHHHHHHHH-----HHHHhcC-CC-ccccccccccccccccccccccccccchHHHHHHHHHhhhhccCe Q lcl|NC_021326. 2 IVRYIKQ--HLEKLPEISI-----GQEYYEQ-RP-DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPI 72 (445) Q Consensus 2 l~~~i~~--~~~~~~~~~~-----~~~yy~G-~~-~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~ 72 (445) ..+++.- ++-+..+.-- ..+.|+- ++ +.+..+ ..-....... -......-.+++....+.+-+. T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~---~~~~ly~~m~----e~D~~i~s~l~~rk~av~~~~w 73 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWP---QSVAVYSRMD----NEDSRVTSLLEAISLPIRSTPW 73 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccccc---cchHHHHHHH----hhChHHHHHHHHHHHHHhcCCc Confidence 0111100 0011111100 0011110 00 000000 0000000000 0134455566666677778777 Q ss_pred eecc--CchHHHHHHHHHhc------------------cCHHHHHHHHHHHHHhcCeEE-EEEEECC----CCcEEEEEE Q lcl|NC_021326. 73 AFKH--TDDEVIKRIDEVLG------------------NRFDDKLHSVLTGASNKGIEW-LHPYLDE----EGEFKLFRV 127 (445) Q Consensus 73 ~~~~--~d~~~~~~l~~~~~------------------n~~~~~~~~~~~~~~~~G~~~-~~v~~d~----~g~~~i~~~ 127 (445) ++.. ++++..+++.+.+. ..+...+.++...++.||.++ +++|... +|...+.-+ T Consensus 74 ~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l 153 (469) T protein:vir:10 74 RIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKL 153 (469) T ss_pred eEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeee Confidence 7754 33333343333221 134566677777788889975 5666532 355443332 Q ss_pred c---cceeE-EEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceE Q lcl|NC_021326. 128 P---AEQGI-PIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFI 203 (445) Q Consensus 128 ~---p~~~~-~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 203 (445) . +..+. -.|++ .+.+.. ++....... +... ......+ .....+...|-.+ T Consensus 154 ~~rp~~~i~~~~~~~--~~~l~~-~~~~~~~~~----------~~~~---------~~~~~~~----~~~lp~~k~i~~~ 207 (469) T protein:vir:10 154 APRPQWTISKFNVAP--DGGLES-IEQIAPPAR----------TRGS---------LYVANIA----PPEIPVNRLVVYT 207 (469) T ss_pred eecCcccceeeeecc--CCceee-eeecCcccc----------cccc---------cccCCCC----ccccccCcEEEEE Confidence 2 21110 01111 111111 110000000 0000 0000000 0000111211111 Q ss_pred Ee--cCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHH------Hhhh--hCceeeccCCC Q lcl|NC_021326. 204 PF--KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFK------RLLR--YYGAIKVSDNG 273 (445) Q Consensus 204 ~~--~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~------~~~~--~~~~~~~~~~~ 273 (445) +- ..++.|.|.+..+-...---+..+.+++.-++.+..|+++.+-......++.. ..+. ...++.++++. T Consensus 208 ~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~iip~~~ 287 (469) T protein:vir:10 208 RNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVGLAQGQ 287 (469) T ss_pred ecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCc Confidence 11 13567889998766665555667888999999999999987654332222221 1222 23355679999 Q ss_pred ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_021326. 274 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPS-GVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVF 351 (445) Q Consensus 274 ~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~ 351 (445) ++++++...+...+..+++.+.+.|...--..-++.+..+|.-+ |..- ..-....+..-.+.+...+. ++++-++ T Consensus 288 ~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~vh---~ev~~d~~~sDa~~i~~tln~~li~~l~ 364 (469) T protein:vir:10 288 ILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASVL---EDPFTQAVHAYATSICRIANQHIIEDLV 364 (469) T ss_pred eEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999988887889999999988887654322223222122111 2221 22233344455566667774 4677666 Q ss_pred HHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHh--cc-----CChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-Hhhh Q lcl|NC_021326. 352 EHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-----VSHETVLENHPFVEDLQAELERIEQEQMEYN-KQLP 423 (445) Q Consensus 352 ~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~--g~-----~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~-~~~~ 423 (445) .+.. .....-..+.|.... .+....++.+.+++ |+ ++.+.+.+.++. +.++.+ +.+....+... ...+ T Consensus 365 ~lN~-g~~~~~P~~~~~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gi-p~~~~~-~~~~~~~~~~~~~~~~ 440 (469) T protein:vir:10 365 DINF-GVDTPAPVLTFDPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNL-PSELND-TPSAEPEEPAAVPNQS 440 (469) T ss_pred HhcC-CCCCCccEEEecCCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCC-CCCCCC-cccccchhcccCCCCC Confidence 6542 222233466775433 44566677777764 65 455667777743 322221 11111111100 0000 Q ss_pred ccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 424 NLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 424 ~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .........+..+..+.+...+ T Consensus 441 ~~~~~~~~~~~~~~~~~~~~~~ 462 (469) T protein:vir:10 441 AAPARTRSSGNADARARAPKAD 462 (469) T ss_pred ccccccCCCCCcccccccCCCh Confidence 0111111111111111111111 No 183 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=97.50 E-value=5.5e-05 Score=43.93 Aligned_cols=413 Identities=10% Similarity=0.000 Sum_probs=175.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C-----ee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P-----IA 73 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~-----~~ 73 (445) .++++.++++ |.+-...+.+|++=- +|......... ......++..+-+...++++++.|++- | ++ T Consensus 4 ~~~~~~~~lk-r~~~e~~w~e~a~~t--lP~~~~~~~~~----~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 76 (510) T protein:vir:78 4 TAAMLWEKLR-DGSVEQRAIEFAKTT--LPYLMVDPMSG----SRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFR 76 (510) T ss_pred HHHHHHHHHh-ccchHHHHHHHHHhh--ccccccCCCCc----ccccccCcccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 4444444442 222233344444221 11111110000 011112344556677777777766542 2 23 Q ss_pred eccCch-------------HHHHHHH--------HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEcccee Q lcl|NC_021326. 74 FKHTDD-------------EVIKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQG 132 (445) Q Consensus 74 ~~~~d~-------------~~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~ 132 (445) +...+. ++.+.+. .+..+||...+.++.++..++|.+.+++ ++++. +++.++-.+ T Consensus 77 l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~~~~~-~~~~~pl~~- 152 (510) T protein:vir:78 77 SELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYR--NSDEA-TVVAWSLRS- 152 (510) T ss_pred cCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEE--eCCCC-eEEEEEcce- Confidence 333221 1222221 2224688899999999999999986654 44443 456665555 Q ss_pred EEEEcCCCCCceEEEEEEEeee--------------------cceeEEEEecceEEEEEEecc--eeeeccccccccccc Q lcl|NC_021326. 133 IPIWTDKEHEELEAFIRMYKLE--------------------NETKVEYWDKITVNYYVYENG--SLIPDYSNNLENSKT 190 (445) Q Consensus 133 ~~v~d~~~~~~~~~~v~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~ 190 (445) |.+..|. .+++...+|.++.. ....+++++... ..... ..........+.... T Consensus 153 y~v~~d~-~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~----~~~~~~~~~~sv~~e~dg~~i~ 227 (510) T protein:vir:78 153 YAVRRDA-TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQ----RRKGTAMDYAEMYHEIDGVRVG 227 (510) T ss_pred eEEeeCC-CcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEE----eecCCCCcEEEEEEEecCeeec Confidence 4444443 56666665554421 111222222111 00000 000000000011111 Q ss_pred ccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh-hhC Q lcl|NC_021326. 191 HFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL-RYY 264 (445) Q Consensus 191 ~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~-~~~ 264 (445) .....++..+|++.+. .+.+|+|-.....+-+..+|...-...........|.+.+.- +.- -...... ... T Consensus 228 ~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p-~g~--~~~~~l~~~~~ 304 (510) T protein:vir:78 228 ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKG--AVVDDYQDAEM 304 (510) T ss_pred cccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCC-ccc--cchhhhccCCC Confidence 1223334556776553 345799999999999999998876666666666666544321 110 0111111 122 Q ss_pred ceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHH-----HHH Q lcl|NC_021326. 265 GAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADK-----LAR 337 (445) Q Consensus 265 ~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~-----~~~ 337 (445) +.+..+...+++.+... .+.......++.++..|...-.+ ++. ...+...||+.+......+.....- ... T Consensus 305 g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~-~l~-~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E 382 (510) T protein:vir:78 305 GDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GAN-QRDAERVTAEEVRITAEEAENTLGGTYSLLAEN 382 (510) T ss_pred ceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhh-ccc-cCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHH Confidence 33433344456665432 45666777788877777654322 221 2223446777776543333333221 122 Q ss_pred HHHHHHHHHHHHHHHHhccCC----CcceEEEEeCCCCCCCHHHHHHHH---HH-Hh--c----c---CChHHHHH---- Q lcl|NC_021326. 338 KAKVAIQELLWFVFEHFDIKG----EHKDVDISFNYNKVANTELQVQTA---QQ-SM--G----I---VSHETVLE---- 396 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~~~~~~----~~~~i~v~f~~~~p~d~~~~~~~~---~~-~~--g----~---~s~et~l~---- 396 (445) .+.+-+.+++.++.+. +... ......|++-..+-+ .+.++.+ .+ +. + + +....++. T Consensus 383 ~l~Pli~r~~~il~r~-gl~p~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~ 459 (510) T protein:vir:78 383 LQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSR--SAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWA 459 (510) T ss_pred HHHHHHHHHHHHHHhc-cCCCCCcccccceeeecccHHHH--HHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHH Confidence 2333444444444331 2111 112223333322222 2222211 11 11 1 1 22233332 Q ss_pred hCCCCC-----CHHHHHHHHHHHHHHHHHhhh----c-cccCCCCCCCCCCC Q lcl|NC_021326. 397 NHPFVE-----DLQAELERIEQEQMEYNKQLP----N-LDDGGADGAQQKER 438 (445) Q Consensus 397 ~l~~~~-----d~~~E~~ri~~E~~~~~~~~~----~-~~~~~~~~~~~~~~ 438 (445) .++ |+ -.++|++.+.+++++.+.+.. . ....+.-++...+= T Consensus 460 ~~G-v~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 460 AFS-VDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HhC-CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccCCCC Confidence 232 31 135666666655432221111 1 11111112222211 No 184 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=97.48 E-value=5.9e-05 Score=43.75 Aligned_cols=381 Identities=9% Similarity=-0.040 Sum_probs=183.5 Q ss_pred ChHHH----------HHHHHHH---HHHHHH-HHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh Q lcl|NC_021326. 1 MIVRY----------IKQHLEK---LPEISI-GQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY 66 (445) Q Consensus 1 ~l~~~----------i~~~~~~---~~~~~~-~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~ 66 (445) +.+.- ...|-.+ -.++.. +..--.|+.. .+. ....+-. -......-.+.+.... T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~-----~~~--~L~~dm~-----~~D~hi~s~l~~Rk~a 84 (512) T protein:vir:19 17 EMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLT-----AQA--DLAFDME-----EKDTHLFSELSKRRLA 84 (512) T ss_pred ccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHH-----HHH--HHHHHHH-----hhChHHHHHHHHHHHH Confidence 11000 0000000 001101 0001111100 000 0000000 0134455566677777 Q ss_pred hhccCeeeccC---c---hHHHHHHHHHhcc--CHHHHHHHHHHHHHhcCeEE-EEEEECCCCcEE---EEEEccceeEE Q lcl|NC_021326. 67 IVGKPIAFKHT---D---DEVIKRIDEVLGN--RFDDKLHSVLTGASNKGIEW-LHPYLDEEGEFK---LFRVPAEQGIP 134 (445) Q Consensus 67 l~g~~~~~~~~---d---~~~~~~l~~~~~n--~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~~~---i~~~~p~~~~~ 134 (445) +.+.+.++... + ....+++++++.+ ++...+..+ .++..+|.++ .++|.-.+|... +.+++|..+ T Consensus 85 v~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~l-ldA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f-- 161 (512) T protein:vir:19 85 IQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDA-GDAILKGYSMQEIEWGWLGKMRVPVALHHRDPALF-- 161 (512) T ss_pred HhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHH-HhhhhhcceeeeeEeeeeCCceeeeeeeeeccccc-- Confidence 78888888642 1 2455677888764 465555544 4688889875 555653344432 445555433 Q ss_pred EEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEe--cCCCCcC Q lcl|NC_021326. 135 IWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPF--KNNDLEI 212 (445) Q Consensus 135 v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~--~n~~~g~ 212 (445) .|+......+ ++. .. ... . ....+++.|-.++- ..++.|. T Consensus 162 ~~~~~~~~~l----r~~----------------------~~--------~~~-G---~~l~~~k~i~~~~~~~~g~p~g~ 203 (512) T protein:vir:19 162 CANPDNLNEL----RLR----------------------DA--------SYH-G---LELQPFGWFMHRAKSRTGYVGTN 203 (512) T ss_pred eeccCCCcEE----Eec----------------------CC--------CCC-c---eeecCCceEEEeccCCCCCcccc Confidence 2222111111 000 00 000 0 00112332222221 2356788 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhH------HHhhhhCceeeccCCCceeeEecc-CChH Q lcl|NC_021326. 213 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEF------KRLLRYYGAIKVSDNGGVDTIQVE-VPVE 285 (445) Q Consensus 213 s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~l~~~-~~~~ 285 (445) |.+..+....-.-+..+.+++.-++.++.|+++.+=......++. ...+....+..++.+..+++++.. .+.. T Consensus 204 gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~ 283 (512) T protein:vir:19 204 GLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTNREKATLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSD 283 (512) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCCceEEEeecCCCCHH Confidence 999887666666677888999999999999988663222222221 234456667788999999999864 4556 Q ss_pred HHHHHHHHHHHHHHHHhCcccccccc--ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCCc-- Q lcl|NC_021326. 286 NSKKYLDELYQKIMLFGQAVDFSSDK--FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGEH-- 360 (445) Q Consensus 286 ~~~~~i~~l~~~i~~~s~~p~~~~~~--~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~-- 360 (445) .++.+++.+.+.|...--.-.++.+. .|+...| +....-....+..-.+.+...+. ++++-++.+....... T Consensus 284 ~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~---~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~ 360 (512) T protein:vir:19 284 PFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLG---EVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDIN 360 (512) T ss_pred HHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCcc Confidence 68899999888877653111112221 1212212 11222344445556677777774 5778777665443322 Q ss_pred ceEEEEeCCCCCCCHHHHHHHHHHHh-cc-CChHHHHHhCCCCCCHHHH--HHHHHHHHHHHHHhhhcccc-CCCCCCCC Q lcl|NC_021326. 361 KDVDISFNYNKVANTELQVQTAQQSM-GI-VSHETVLENHPFVEDLQAE--LERIEQEQMEYNKQLPNLDD-GGADGAQQ 435 (445) Q Consensus 361 ~~i~v~f~~~~p~d~~~~~~~~~~~~-g~-~s~et~l~~l~~~~d~~~E--~~ri~~E~~~~~~~~~~~~~-~~~~~~~~ 435 (445) .-..+.|...-+.|....++.+.+++ |+ +|.+.+.+.++. +.++.+ ..... ...+.... .......+ T Consensus 361 ~~p~~~f~~~e~eDl~~~a~~~~~l~~G~~i~~~~i~e~~Gi-p~~~~~e~~~~~~-------~~~~~~~~~~~~~~~~~ 432 (512) T protein:vir:19 361 RLPGIVFDTSEAGDITALSDAIPKLAAGMRIPVSWIQEKLHI-PQPVGDEAVFTIQ-------PVVPDNGSQKEAALSAE 432 (512) T ss_pred ccceEEecCCChhhHHHHHHHHHHHhcCCCCCHHHHHHHhCC-CCCCCccccccCC-------Ccccccccccccccccc Confidence 23567888888999999998887764 76 888888888854 322211 11000 00000000 00000000 Q ss_pred CC---CCCCcCCC Q lcl|NC_021326. 436 KE---RSNDKQSE 445 (445) Q Consensus 436 ~~---~~~d~~~~ 445 (445) .. ..-|...+ T Consensus 433 ~~~~~~~~d~~~~ 445 (512) T protein:vir:19 433 DIPQEDDIDRMGV 445 (512) T ss_pred CCCchhhHhHHhh Confidence 00 00011111 No 185 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.44 E-value=6.6e-05 Score=43.51 Aligned_cols=354 Identities=7% Similarity=-0.025 Sum_probs=149.5 Q ss_pred ChHHHH-HHHHHHH--HH-HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc Q lcl|NC_021326. 1 MIVRYI-KQHLEKL--PE-ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH 76 (445) Q Consensus 1 ~l~~~i-~~~~~~~--~~-~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~ 76 (445) ++.+.. .+...+. .. -..+....-|.-. ... .-...-+.++-...+|+..++-+..-|+++. T Consensus 3 ~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~---------~~~----v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~- 68 (383) T protein:vir:10 3 LLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQ---------LSY----VSALSALQNTNVYSVINRIASDVSSAHFKTE- 68 (383) T ss_pred cccccccccccccccccccchhhhhhhccCcc---------ccc----cchhHhhcchHHHHHHHHHHHhhccCceeec- Confidence 332211 0000000 00 0000011100000 000 0000011122233455666555555566553 Q ss_pred CchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeec Q lcl|NC_021326. 77 TDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN 155 (445) Q Consensus 77 ~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~ 155 (445) +......+..-.. .........+..+.+.+|.+|+.+..+. ..+..++|.++.+..+.. .+.. .+.... T Consensus 69 -~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~---~~~~p~~~~~v~~~~~~~---~~~~---~~~~~~ 138 (383) T protein:vir:10 69 -NTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNM---GIVY---TVLESN 138 (383) T ss_pred -ccchhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCcceEEEEEcCC---ceEE---EEEEcC Confidence 2222222321111 1345566778888899999999876542 222233333333222211 1110 000000 Q ss_pred ceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-------CCCcCccHHHHHHHHHHHHHH Q lcl|NC_021326. 156 ETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-------NDLEISDIFMYKTLIDAYNRR 228 (445) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-------~~~g~s~~~~v~~lid~~~~~ 228 (445) ......+ ..=-|+|+++ ...|.|.+......++....+ T Consensus 139 ~~~~~~~-----------------------------------~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~ 183 (383) T protein:vir:10 139 DRPKMVL-----------------------------------RQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKA 183 (383) T ss_pred CceEEEE-----------------------------------cccceEEeccCCCCcccccccccHHHHHHHHHHHHHHH Confidence 0000000 0001344432 124788888777777777666 Q ss_pred HHHHHHHHHHhcCCeeEEe--cCCc--ccchhHHHhhh-------hCceeeccCCCceeeEeccCChHH-HHHHHHHHHH Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLT--NYDD--QELPEFKRLLR-------YYGAIKVSDNGGVDTIQVEVPVEN-SKKYLDELYQ 296 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~--g~~~--~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~~~~-~~~~i~~l~~ 296 (445) ..-....+...+.|-.++. |... +........+. ..+++.++++.+++.+..+..... +.+..+...+ T Consensus 184 ~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~ 263 (383) T protein:vir:10 184 SKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSAD 263 (383) T ss_pred HHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHH Confidence 6666666677777755544 3221 11112222221 223455565555555544333333 3456677788 Q ss_pred HHHHHhCcccccccc-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCH Q lcl|NC_021326. 297 KIMLFGQAVDFSSDK-FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANT 375 (445) Q Consensus 297 ~i~~~s~~p~~~~~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~ 375 (445) .|+..-++|....+. ..++.++..++.. ...|...|+-+++.+...+..+--...+++.+...+..|. T Consensus 264 ~Ia~afgVPp~~lg~~~~~~~~~sn~eq~-----------~~~~~~~l~P~~~~ie~~l~~~l~~~~~~f~~~~l~~~d~ 332 (383) T protein:vir:10 264 QISKAFGVPSDILGGGTSTESQHSNIDQI-----------KATYLANLNSYVNPIVDELRLKMNAPDLELDIKDMLDVDD 332 (383) T ss_pred HHHHHhCCCHHHcCCccCCCCccccHHHH-----------HHHHHHHHHHHHHHHHHHHHHhhCCceEEeechhhhccCH Confidence 888888888644332 1112222112211 0111122222323222222111112356777778888999 Q ss_pred HHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcC Q lcl|NC_021326. 376 ELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQ 443 (445) Q Consensus 376 ~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 443 (445) .+.++++.++ .|+++...++++++.-.-+..++ ........+.+.+|+| T Consensus 333 ~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~-------------------~~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 333 SILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNL-------------------PEFKPLTNETKGGDDK 383 (383) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcc-------------------cccCCCcccCCCCCCC Confidence 9999998887 48999999988875421000000 0000001111122222 No 186 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=97.44 E-value=6.8e-05 Score=43.44 Aligned_cols=367 Identities=12% Similarity=0.043 Sum_probs=157.3 Q ss_pred ChHHHHHHHHHHH--HHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc Q lcl|NC_021326. 1 MIVRYIKQHLEKL--PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~--~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d 78 (445) |+-.+.+...... +.......++-...+..-.+......... .....-+.++-...+|+..++-+..-|+.+.-.+ T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 78 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEW--VSARAALRNSDLFSIILQLSSDLAIVKINAEKKK 78 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCce--echHHhhccHHHHHHHHHHHHhhccCceeeccch Confidence 3333333222110 00000001110000000000000000000 0000001122234466666666655566543222 Q ss_pred hHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecc Q lcl|NC_021326. 79 DEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE 156 (445) Q Consensus 79 ~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~ 156 (445) . ...+..=.. .........+..+.+.+|.+|+.+..+..|++ .+..++|..+.+..+.. .+.+.+ ++...... T Consensus 79 ~--~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~-~~~~~y--~~~~~~~~ 153 (392) T protein:vir:10 79 N--QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEY-ENGMYY--NITFDDPK 153 (392) T ss_pred h--hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCC-CceEEE--EEEecCcc Confidence 1 112211111 12355667778899999999999999998986 57888898887765532 121111 11000000 Q ss_pred e-eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 T-KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s 230 (445) . ....+. ... |+|++. ...|.|.+..+...++....+.. T Consensus 154 ~~~~~~~~---------------------------------~~e--iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~ 198 (392) T protein:vir:10 154 IEPILQAP---------------------------------QSD--LIHMKLLSIDGGKTGISPLYSLRRESKIQRASDR 198 (392) T ss_pred cceeEEEc---------------------------------ccc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHH Confidence 0 000000 001 333322 13578888777777766655555 Q ss_pred HHHHHHHHhcCCeeEEe--cCCcccchh---HHHhh----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLT--NYDDQELPE---FKRLL----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLF 301 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~--g~~~~~~~~---~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~ 301 (445) -....+...+.|-.++. +........ +.... ...+++.++++.+.+.+........+.+..+..++.|+.. T Consensus 199 ~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 278 (392) T protein:vir:10 199 LTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKV 278 (392) T ss_pred HHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHH Confidence 55555666777755542 322111111 11111 2234556666655555544444556667777788888888 Q ss_pred hCccccccccccCcchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHH Q lcl|NC_021326. 302 GQAVDFSSDKFGSAPSG-VALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 380 (445) Q Consensus 302 s~~p~~~~~~~~~~~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~ 380 (445) -++|....+..+.+.|. .+.+ ..+...|.-+++.+...+...- ...+++......-.|..+.+. T Consensus 279 fgVpp~~lg~~~~~~~~~~~~~--------------~f~~~~l~P~~~~ie~~l~~~L-~~~~~~d~~~~~~~d~~~~~~ 343 (392) T protein:vir:10 279 YGLPDSYIGGQGDQQSSIQQIS--------------GMYASALNRYLRPAISELEYKL-SDHISVNMRPAIDPLGDNYLS 343 (392) T ss_pred hCCCHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhc-cccccccchhhhccCHHHHHH Confidence 88886544432222222 1111 1222223333332222211110 011223333333456677777 Q ss_pred HHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCC Q lcl|NC_021326. 381 TAQQS--MGIVSHETVLENH---PFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSN 440 (445) Q Consensus 381 ~~~~~--~g~~s~et~l~~l---~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (445) .+.++ .|+++...+.+++ ++..+ |+.+ ...++...++ ++.+..+ T Consensus 344 ~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~--------~e~l~~~~~G-----d~~~p~p 392 (392) T protein:vir:10 344 TISTATRWGALAENQATFVLQEAGYIPK---DLPA--------PENTNKKTTG-----QSNEPVP 392 (392) T ss_pred HHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccch--------hcCCCCCCCC-----CCCCCCC Confidence 77776 5889988877654 55432 2111 0112222111 1111112 No 187 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=97.44 E-value=6.8e-05 Score=43.44 Aligned_cols=367 Identities=12% Similarity=0.043 Sum_probs=157.3 Q ss_pred ChHHHHHHHHHHH--HHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc Q lcl|NC_021326. 1 MIVRYIKQHLEKL--PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~--~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d 78 (445) |+-.+.+...... +.......++-...+..-.+......... .....-+.++-...+|+..++-+..-|+.+.-.+ T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 78 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEW--VSARAALRNSDLFSIILQLSSDLAIVKINAEKKK 78 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCce--echHHhhccHHHHHHHHHHHHhhccCceeeccch Confidence 3333333222110 00000001110000000000000000000 0000001122234466666666655566543222 Q ss_pred hHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecc Q lcl|NC_021326. 79 DEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE 156 (445) Q Consensus 79 ~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~ 156 (445) . ...+..=.. .........+..+.+.+|.+|+.+..+..|++ .+..++|..+.+..+.. .+.+.+ ++...... T Consensus 79 ~--~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~-~~~~~y--~~~~~~~~ 153 (392) T protein:vir:39 79 N--QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEY-ENGMYY--NITFDDPK 153 (392) T ss_pred h--hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCC-CceEEE--EEEecCcc Confidence 1 112211111 12355667778899999999999999998986 57888898887765532 121111 11000000 Q ss_pred e-eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 T-KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s 230 (445) . ....+. ... |+|++. ...|.|.+..+...++....+.. T Consensus 154 ~~~~~~~~---------------------------------~~e--iih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~ 198 (392) T protein:vir:39 154 IEPILQAP---------------------------------QSD--LIHMKLLSIDGGKTGISPLYSLRRESKIQRASDR 198 (392) T ss_pred cceeEEEc---------------------------------ccc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHH Confidence 0 000000 001 333322 13578888777777766655555 Q ss_pred HHHHHHHHhcCCeeEEe--cCCcccchh---HHHhh----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLT--NYDDQELPE---FKRLL----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLF 301 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~--g~~~~~~~~---~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~ 301 (445) -....+...+.|-.++. +........ +.... ...+++.++++.+.+.+........+.+..+..++.|+.. T Consensus 199 ~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 278 (392) T protein:vir:39 199 LTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKV 278 (392) T ss_pred HHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHH Confidence 55555666777755542 322111111 11111 2234556666655555544444556667777788888888 Q ss_pred hCccccccccccCcchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHH Q lcl|NC_021326. 302 GQAVDFSSDKFGSAPSG-VALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 380 (445) Q Consensus 302 s~~p~~~~~~~~~~~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~ 380 (445) -++|....+..+.+.|. .+.+ ..+...|.-+++.+...+...- ...+++......-.|..+.+. T Consensus 279 fgVpp~~lg~~~~~~~~~~~~~--------------~f~~~~l~P~~~~ie~~l~~~L-~~~~~~d~~~~~~~d~~~~~~ 343 (392) T protein:vir:39 279 YGLPDSYIGGQGDQQSSIQQIS--------------GMYASALNRYLRPAISELEYKL-SDHISVNMRPAIDPLGDNYLS 343 (392) T ss_pred hCCCHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHhc-cccccccchhhhccCHHHHHH Confidence 88886544432222222 1111 1222223333332222211110 011223333333456677777 Q ss_pred HHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCC Q lcl|NC_021326. 381 TAQQS--MGIVSHETVLENH---PFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSN 440 (445) Q Consensus 381 ~~~~~--~g~~s~et~l~~l---~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (445) .+.++ .|+++...+.+++ ++..+ |+.+ ...++...++ ++.+..+ T Consensus 344 ~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~--------~e~l~~~~~G-----d~~~p~p 392 (392) T protein:vir:39 344 TISTATRWGALAENQATFVLQEAGYIPK---DLPA--------PENTNKKTTG-----QSNEPVP 392 (392) T ss_pred HHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccch--------hcCCCCCCCC-----CCCCCCC Confidence 77776 5889988877654 55432 2111 0112222111 1111112 No 188 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=97.39 E-value=7.7e-05 Score=43.14 Aligned_cols=388 Identities=10% Similarity=0.033 Sum_probs=159.7 Q ss_pred ChHH---HHHHH--HHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee- Q lcl|NC_021326. 1 MIVR---YIKQH--LEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF- 74 (445) Q Consensus 1 ~l~~---~i~~~--~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~- 74 (445) ||.- .+..- .++.+.+ ...|.+.... ..+.... .......-..++....+|+..++-+-.-|+.+ T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~---~~~~~~~~~~-~~~~~~~-----~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~ 71 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQM---QDSYYYAPAV-GMQLERQ-----FSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred CcccCceeecCchhhhhhhhh---hccccccccc-ceecccc-----cchhhHHHhhhHHHHHHHHHHHHhhccCceEEE Confidence 1100 00000 0011111 1111111000 0000000 00000000112333556666666665556554 Q ss_pred --ccCc--hHHHHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 75 --KHTD--DEVIKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 75 --~~~d--~~~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~ 144 (445) +.+. +.....+..++. | .-..+...+..+.+.+|.+|+.+..+.+|++ .+..++|..+.+..+.. .+.+ T Consensus 72 ~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~-~~~~ 150 (518) T protein:vir:10 72 FTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR-TGRY 150 (518) T ss_pred EEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCC-CCEE Confidence 1111 111122233332 3 2345666778889999999999999999986 47889999888776642 2222 Q ss_pred EEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC----C-CCcCccHHHHH Q lcl|NC_021326. 145 EAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----N-DLEISDIFMYK 219 (445) Q Consensus 145 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~-~~g~s~~~~v~ 219 (445) .+ ++. ....... ....+ . -.. |+|+++ . ..|.|.+.... T Consensus 151 ~y--~~~-~~~~~~~------~~~~~--~-----------------------~~e--ViHir~~s~dg~~~G~spi~~a~ 194 (518) T protein:vir:10 151 EY--YFQ-AGAGVGT------QLVSF--A-----------------------DDE--VVPIRFFNPDGLERGLSLMESLK 194 (518) T ss_pred EE--EEE-ecCCccc------eEEEe--c-----------------------CCc--EEEecCCCCCcccccccHHHHHH Confidence 21 111 1100000 00000 0 000 233321 1 24777777666 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEeccCChHHHH Q lcl|NC_021326. 220 TLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENSK 288 (445) Q Consensus 220 ~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~~ 288 (445) ..+.....+.......+...+.|-.++.... .+....++..+. .++++.++++.+.+.++.......+. T Consensus 195 ~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~l 274 (518) T protein:vir:10 195 STIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFI 274 (518) T ss_pred HHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHH Confidence 6666555555555555666777766654322 222222222221 12355666666655555444444566 Q ss_pred HHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hccCCCcceE Q lcl|NC_021326. 289 KYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH-----FDIKGEHKDV 363 (445) Q Consensus 289 ~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~-----~~~~~~~~~i 363 (445) +..+.....|...-++|....+...+ .+...++...... +..+|.-++..+... +........+ T Consensus 275 e~r~~~~~eIa~afgVPp~~lg~~~~-~t~sn~eq~~~~f----------~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~ 343 (518) T protein:vir:10 275 EARQLNREEVCGVYDIAPPIVHILDR-ATFSNISAQMRAF----------YRDTMAIPIARIQSAMDKYVGQYWVRKNRM 343 (518) T ss_pred HHHHHHHHHHHHHhCCCHHHhccCCC-CCchhHHHHHHHH----------HHHHHHHHHHHHHHHHHHhhcccccCCceE Confidence 66677777888888888654432211 1111122111111 111222222222111 1111112234 Q ss_pred EEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-H------HHHHHHHHHHH--HHhhhccccCCC Q lcl|NC_021326. 364 DISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQA-E------LERIEQEQMEY--NKQLPNLDDGGA 430 (445) Q Consensus 364 ~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~-E------~~ri~~E~~~~--~~~~~~~~~~~~ 430 (445) ++.....+..|..+.++++.++ +|+++.-.++++++. ++++.. + +..+..-.... .+..+...+... T Consensus 344 ~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~ 423 (518) T protein:vir:10 344 KFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPAS 423 (518) T ss_pred EEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCc Confidence 5555666778999999988876 489999889888754 222111 0 11111000000 000000000000 Q ss_pred ----CCCCC-CCC----CCCcC---CC Q lcl|NC_021326. 431 ----DGAQQ-KER----SNDKQ---SE 445 (445) Q Consensus 431 ----~~~~~-~~~----~~d~~---~~ 445 (445) ...++ ++. +.+.+ +| T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (518) T protein:vir:10 424 TPVASLDQSPPTSVPGLSPTNSDRSTD 450 (518) T ss_pred cccccccccccccCCCCCccccccccc Confidence 00000 000 00000 00 No 189 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=97.32 E-value=9.5e-05 Score=42.64 Aligned_cols=409 Identities=11% Similarity=0.073 Sum_probs=180.3 Q ss_pred ChHHHHHHHHHHH-H---HHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C--- Q lcl|NC_021326. 1 MIVRYIKQHLEKL-P---EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P--- 71 (445) Q Consensus 1 ~l~~~i~~~~~~~-~---~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~--- 71 (445) -|.+..+.++.+. + +++.+.+|..- ..-. ... . .+...++..+-+...++++++.|++- | T Consensus 14 ~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP-----~~~~---~~~--~-~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~ 82 (515) T protein:vir:70 14 KIPKLWEKFSKKRSPYLDRAKHFAKLTLP-----YLMN---NKG--D-NETSQNGWQGVGAQATNHLANKLAQVLFPAQR 82 (515) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHHhcc-----cccC---CCC--C-cccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 4445555554333 2 33444444322 1100 000 0 01112345566677777777766542 2 Q ss_pred --eeeccCch---------H----HHHHH--------HHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEc Q lcl|NC_021326. 72 --IAFKHTDD---------E----VIKRI--------DEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVP 128 (445) Q Consensus 72 --~~~~~~d~---------~----~~~~l--------~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~ 128 (445) +++...+. . +.+.+ ..+..+||...+.++.++..++|.+.+++ |+++. +++++ T Consensus 83 ~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--d~~~~--~~~~p 158 (515) T protein:vir:70 83 SFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSKGA--MSAVP 158 (515) T ss_pred cccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEE--eCCCC--eEEEE Confidence 23332211 1 11111 12334688899999999999999987655 66665 44455 Q ss_pred cceeEEEEcCCCCCceEEEEEEEee----------------------ecceeEEEEecceEEEEEEecceeeeccccccc Q lcl|NC_021326. 129 AEQGIPIWTDKEHEELEAFIRMYKL----------------------ENETKVEYWDKITVNYYVYENGSLIPDYSNNLE 186 (445) Q Consensus 129 p~~~~~v~d~~~~~~~~~~v~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (445) -.+ |.+..+. .+++...+|.++. +....+++|+... ..-...+.... ...+ T Consensus 159 l~~-y~v~~d~-~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~---~~~~~~~~~~~--e~d~ 231 (515) T protein:vir:70 159 MHH-YVVNRDT-NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQ---YAGEGFWKINQ--SADD 231 (515) T ss_pred cCe-EEEeeCC-CcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEE---ecCCCceEEEE--ecCc Confidence 444 4444443 5666666554432 1111233332111 01011011100 0011 Q ss_pred ccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh Q lcl|NC_021326. 187 NSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL 261 (445) Q Consensus 187 ~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~ 261 (445) .........++..+|++.+. ++.+|+|-.....+-+..+|...-...........|.+.+.-......... .. T Consensus 232 ~~~~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~l--~~ 309 (515) T protein:vir:70 232 IPVGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDHF--VN 309 (515) T ss_pred eeeccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhhc--cc Confidence 11111222234566766543 346899999999999999998888888888888888876532111111110 01 Q ss_pred hhCceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 262 RYYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKA 339 (445) Q Consensus 262 ~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~ 339 (445) ...+.+..+..++++.+... .+.......++.++..|...-....+ ....+...+++.+.. +..++...+ T Consensus 310 ~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l-~~rd~~rvTAtEV~~-------r~~E~~~~L 381 (515) T protein:vir:70 310 SGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETM-TRRDAERVTAVEIQR-------DALEIEQNM 381 (515) T ss_pred cCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhh-hccCCccccHHHHHH-------HHHHHHHHh Confidence 22234444445566666533 35677778888888777654332211 111123457766654 445555566 Q ss_pred HHHHHHH--------HHHHHHHhccCCCcceEEEEeCCCCCCCHHHH---HHHHHHH---hc-----------cCChHHH Q lcl|NC_021326. 340 KVAIQEL--------LWFVFEHFDIKGEHKDVDISFNYNKVANTELQ---VQTAQQS---MG-----------IVSHETV 394 (445) Q Consensus 340 ~~~l~~~--------~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~---~~~~~~~---~g-----------~~s~et~ 394 (445) +..+.++ +.++..-..-..-...+.+.+.. +.+.+.. ++.+... .+ .+....+ T Consensus 382 Gpv~srL~~Ell~Pli~r~~~~~~p~~P~~~v~~~~vs--~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~ 459 (515) T protein:vir:70 382 GGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVT--GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDY 459 (515) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhCCCCChhhcccceeh--hHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHH Confidence 6655553 22221111111111113333211 2222222 2222111 11 1111112 Q ss_pred H----HhCCC---CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 395 L----ENHPF---VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 395 l----~~l~~---~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 444 (445) + ..++. +--.++|++.+.+++++..+...- ....+..+...-.+.-+++ T Consensus 460 ~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~~-~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 460 MDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAML-NEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred HHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHH-HHhhhhhcccchhhhhccC Confidence 2 22211 112346676666554443322111 1111111111111111122 No 190 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=97.31 E-value=0.0001 Score=42.52 Aligned_cols=389 Identities=13% Similarity=0.084 Sum_probs=145.2 Q ss_pred ChHHHHHHHHH-----HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeec Q lcl|NC_021326. 1 MIVRYIKQHLE-----KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK 75 (445) Q Consensus 1 ~l~~~i~~~~~-----~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~ 75 (445) +....- -++. ....+..+.+.|.+ +++...-.. .+.+ .+.-|+ ++...+.-+.+=|+.+. T Consensus 53 ~~~~~~-g~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~i~---------t~~~--~va~~~--~i~~~s~~~~~~~i~l~ 117 (535) T protein:vir:10 53 ADGNVA-GQYSVASISDVLSTKKLLKAYAD-NDIVQAIIR---------TRTN--QVLTYS--NPSRYNRNGVGFKVELK 117 (535) T ss_pred ccCCcc-cccccCccccccCHHHHHHHhcc-ChhHHHHHH---------HHHH--HHHHHH--HHHHHhcccCcceeEEE Confidence 110000 0000 00111122222222 122111000 0110 111122 11222222222233221 Q ss_pred -----cCch--HHHHHHHHHhc---cC-------HHHHHHHHHHHHHhcC-eEEEEEEECCCCcEE-EEEEccceeEEEE Q lcl|NC_021326. 76 -----HTDD--EVIKRIDEVLG---NR-------FDDKLHSVLTGASNKG-IEWLHPYLDEEGEFK-LFRVPAEQGIPIW 136 (445) Q Consensus 76 -----~~d~--~~~~~l~~~~~---n~-------~~~~~~~~~~~~~~~G-~~~~~v~~d~~g~~~-i~~~~p~~~~~v~ 136 (445) .+.. .....+..++. |. +...+..+..+.+.+| .+|+.+..+..|++. +..++|..+.+.. T Consensus 118 ~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~ 197 (535) T protein:vir:10 118 DATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISY 197 (535) T ss_pred eccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEE Confidence 1111 11112222332 21 1234455666777765 579988888889875 8889999988776 Q ss_pred cCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC--------C Q lcl|NC_021326. 137 TDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN--------N 208 (445) Q Consensus 137 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n--------~ 208 (445) +......... ++..........+... . |++++. . T Consensus 198 d~~~~~~~~~---~~~~~~~~~~~~~~~~---------------------------------e--iih~~~~~~~~~~~~ 239 (535) T protein:vir:10 198 SPRSKDQPRK---FEQFVSETKSVKFSER---------------------------------N--LTFINYWNLSDTDRR 239 (535) T ss_pred cCccccCceE---EEEEecCceeEEECcc---------------------------------c--EEEEeccCCCCcccc Confidence 6432211111 1111111000000000 0 223221 2 Q ss_pred CCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe--cCCc-----ccchhHHHhhh-------hCceeeccCCCc Q lcl|NC_021326. 209 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--NYDD-----QELPEFKRLLR-------YYGAIKVSDNGG 274 (445) Q Consensus 209 ~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~--g~~~-----~~~~~~~~~~~-------~~~~~~~~~~~~ 274 (445) ..|.|.++.+...+.....+..-..+.+...+.|-.++. +... +....++..+. ..+.+.+-.+.+ T Consensus 240 ~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g 319 (535) T protein:vir:10 240 GYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKD 319 (535) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCC Confidence 247777877777776666555555555666677755443 3211 11222222221 111122223334 Q ss_pred eeeEeccCC--hHHHHHHHHHHHHHHHHHhCcccccccccc----CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 275 VDTIQVEVP--VENSKKYLDELYQKIMLFGQAVDFSSDKFG----SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 348 (445) Q Consensus 275 ~~~l~~~~~--~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~----~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~ 348 (445) ++|.....+ ...+.+..+...+.|...-++|....+-.. ++.++... ..+..... ......+...|.-++. T Consensus 320 ~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~-~~~~s~~E--~~~~~~~~~~L~P~l~ 396 (535) T protein:vir:10 320 AKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKS-VNEGSTAK--AKLESSKDKGLTPLLS 396 (535) T ss_pred ceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhh-hhhhhhHH--HHHHHHHHHHHHHHHH Confidence 566555543 344555566666777777788764433211 11111111 00000000 1111122223333333 Q ss_pred HHHHHhcc---CCCcceEEEEeCCCCCCCHHHHHHHHHH-HhccCChHHHHHhCCC--CCCHHHHHHHHHHHHH-----H Q lcl|NC_021326. 349 FVFEHFDI---KGEHKDVDISFNYNKVANTELQVQTAQQ-SMGIVSHETVLENHPF--VEDLQAELERIEQEQM-----E 417 (445) Q Consensus 349 ~~~~~~~~---~~~~~~i~v~f~~~~p~d~~~~~~~~~~-~~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~-----~ 417 (445) .+...+.. ......+.+.|......|..+.+++... ..|+++.-.++++++. ++.-+.-+-.+..+.- . T Consensus 397 ~ie~~ln~~Ll~~~~~~~~f~f~~l~~~d~~~r~~~~~~~~~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~ 476 (535) T protein:vir:10 397 FIEQVINDKIMRYVDTDYRFSFTLGDAQDKLQEEQVWKLKLANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGF 476 (535) T ss_pred HHHHHHhhhcccccCCeEEEEeccccccCHHHHHHHHHHHHcCCCCHHHHHHHhCCCCCCCccccccccchhhccccccc Confidence 22222211 1112357788887777888777666543 3577898888887743 2110000000000000 0 Q ss_pred HHHhhhccc-----------------------cCCCCCCCC--CCCCCCcCCC Q lcl|NC_021326. 418 YNKQLPNLD-----------------------DGGADGAQQ--KERSNDKQSE 445 (445) Q Consensus 418 ~~~~~~~~~-----------------------~~~~~~~~~--~~~~~d~~~~ 445 (445) .....++.. .+..++..+ +..++++.|. T Consensus 477 ~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 529 (535) T protein:vir:10 477 GQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVSN 529 (535) T ss_pred ccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCCCcccc Confidence 000000000 000000000 0000111110 No 191 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.26 E-value=0.00011 Score=42.23 Aligned_cols=373 Identities=12% Similarity=0.026 Sum_probs=151.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) ++.+.-+....+ +.+.--+ ................. .+.-+.+.-...+|+..++-+..-|+.+--..+. T Consensus 6 ~~~~~~~~~~~~------~~~~~~~--~~~~~~~~~~~~~~~v~--~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~ 75 (409) T protein:vir:93 6 IVTRIKKKLIDN------WIDQSTS--KLYDFSPWKNRSFWGVI--NNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 75 (409) T ss_pred hhhhhhhhhhhh------hhccccc--cccccccccCccccccc--hhhhhccHHHHHHHHHHHHhhhhCceeEeecccc Confidence 222222221111 0000000 00000000000000000 0000122333445666666655556665322222 Q ss_pred HHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 81 VIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 81 ~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) ....+...+ . | .-..+...+..+.+.+|.+|+++..+..|++ .+..++|..+.+..++. .+.+. |.. T Consensus 76 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~-~~~~~-----y~~ 149 (409) T protein:vir:93 76 VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ-SRELY-----YSI 149 (409) T ss_pred ccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCC-CcEEE-----EEE Confidence 222222222 1 3 2455567778889999999999999988875 57888898887766542 12111 111 Q ss_pred ecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHH Q lcl|NC_021326. 154 ENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRR 228 (445) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~ 228 (445) ....+.. + .+ . ... |+|+++ ...|.|.+..+...++..+.+ T Consensus 150 ~~~~g~~------~-~~--~-----------------------~~e--Vih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~ 195 (409) T protein:vir:93 150 HAATGNK------L-IV--H-----------------------NMD--MLHFKHIVASNMVQGISPIDVLKNTTDFDNAV 195 (409) T ss_pred EcCCceE------E-EE--c-----------------------ccc--EEEeCCCCCCCccccccHHHHHHHHHHHHHHH Confidence 1000000 0 00 0 000 333332 124777776655555544332 Q ss_pred HHHHHHHHHHhcCC-eeEE-ecCCcc--cchhHHHhh----h-hCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHH Q lcl|NC_021326. 229 LSDLSNTFKDSNEL-TYVL-TNYDDQ--ELPEFKRLL----R-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIM 299 (445) Q Consensus 229 ~s~~~~~~~~~~~~-~l~~-~g~~~~--~~~~~~~~~----~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~ 299 (445) .. . . +..+..+ -.++ .+.... ........+ . ..+++.++++.+++.+........+.+..+..+..|+ T Consensus 196 ~~-~-~-~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia 272 (409) T protein:vir:93 196 RT-F-N-LTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVA 272 (409) T ss_pred HH-H-H-HHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHH Confidence 11 1 1 2333332 2222 332221 111111111 1 2335555655555544433334455666666777888 Q ss_pred HHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----ccCC--CcceEEEEeCCCCC Q lcl|NC_021326. 300 LFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF-----DIKG--EHKDVDISFNYNKV 372 (445) Q Consensus 300 ~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~-----~~~~--~~~~i~v~f~~~~p 372 (445) ..-++|....+..+ +.+...++.... ..+...|.-++..+...+ .... ....+++.+..-+- T Consensus 273 ~~fgVPp~~lg~~~-~~~~sn~e~~~~----------~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~ 341 (409) T protein:vir:93 273 NVFQLPSVFLNARS-NTNFAKNEELNR----------FYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLR 341 (409) T ss_pred HHhCCCHHHhCCCC-CCCcccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhc Confidence 88888865443222 122112221111 112222333333222211 1111 11234444455556 Q ss_pred CCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCC-CCCCCCCCCCCcCC Q lcl|NC_021326. 373 ANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGA-DGAQQKERSNDKQS 444 (445) Q Consensus 373 ~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~-~~~~~~~~~~d~~~ 444 (445) .|..+.++++.++ +|+++.-.++++++.-+- +..++.-- ............ ......++.+.+|| T Consensus 342 ~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~--~ggD~~~~-----~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 342 ADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGGDKPLI-----SGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeee-----cccccccccchhhcccccCCCCCcCCC Confidence 7888999988876 488999999888754211 10000000 000000000000 00011122222222 No 192 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.25 E-value=0.00012 Score=42.16 Aligned_cols=367 Identities=10% Similarity=-0.004 Sum_probs=159.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccc-cccc-ccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP-KPVD-ATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~-~~~~-~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d 78 (445) |+-.+.+.++....... . .-+.+...-...+ .... .........+..-+.++=...+|+..++-+.+-|+.+.-.. T Consensus 1 m~m~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~ 78 (392) T protein:vir:74 1 MILPILNFINQTNDPPE-A-GSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKK 78 (392) T ss_pred CcchhhhhhhcccCccc-c-cccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccch Confidence 33333322221100000 0 0000000000000 0000 00000000000001122234466666666666666553222 Q ss_pred hHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecc Q lcl|NC_021326. 79 DEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENE 156 (445) Q Consensus 79 ~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~ 156 (445) . ...++.-.. ..-......+..+.+.+|.+|+.+..+.+|++ .+..++|..+.+..+.. .+.+.+ ++...... T Consensus 79 ~--~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~-~~~~~y--~~~~~~~~ 153 (392) T protein:vir:74 79 N--QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEY-ENGMYY--NITFDDPK 153 (392) T ss_pred h--hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCC-CceEEE--EEEecCCc Confidence 1 122221111 12355666778899999999999999988986 57888998887766542 122211 11000000 Q ss_pred -eeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 157 -TKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 157 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s 230 (445) .....+.. .. |+|++. ...|.|.+..+...++....+.. T Consensus 154 ~~~~~~~~~---------------------------------~e--vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~ 198 (392) T protein:vir:74 154 IEPILQAPQ---------------------------------SD--LIHMKLLSIDGGKTGISPLYSLRRESKIQRASDR 198 (392) T ss_pred cceeEEEcC---------------------------------cc--EEEecCCCCCCccccccHHHHHHHHHHHHHHHHH Confidence 00000000 00 222221 23578888877777766666655 Q ss_pred HHHHHHHHhcCCeeEEe--cCCcccchh---HHHhh----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 231 DLSNTFKDSNELTYVLT--NYDDQELPE---FKRLL----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLF 301 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~~--g~~~~~~~~---~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~ 301 (445) -....+...+.|-.+++ +........ +.... ...+++.++++.+.+.+........+.+..+.....|... T Consensus 199 ~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 278 (392) T protein:vir:74 199 LTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKV 278 (392) T ss_pred HHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHH Confidence 56666677777766554 321111111 11111 1234556666655555544444556667777777888888 Q ss_pred hCccccccccccCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHH Q lcl|NC_021326. 302 GQAVDFSSDKFGSAPS-GVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQ 380 (445) Q Consensus 302 s~~p~~~~~~~~~~~S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~ 380 (445) -++|....+..+.+.| ..+.+. .+...|.-.++.+...+...- ...+++.+...+..|..+.++ T Consensus 279 fgVPp~~lg~~~~~~~~~e~~~~--------------~~~~~l~p~~~~ie~~l~~~l-~~~~~~~~~~~~~~d~~~~~~ 343 (392) T protein:vir:74 279 YGLPDSYIGGQGDQQSSIQQISG--------------MYASALNRYLRPAISELEYKL-SDHISVNMRPAIDPLGDNYLS 343 (392) T ss_pred hCCCHHHhCCCCCcccHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhc-cchhcccchhhhcCCHHHHHH Confidence 8888654443322222 222221 122222222222222111110 011233333444456777777 Q ss_pred HHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCC Q lcl|NC_021326. 381 TAQQS--MGIVSHETVLENH---PFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSN 440 (445) Q Consensus 381 ~~~~~--~g~~s~et~l~~l---~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (445) .+.++ .|+++...+.+++ ++..+ |+.+ ...++...++ ++.+..+ T Consensus 344 ~~~~l~~~g~~t~near~~~~~~g~~pn---e~r~--------~enl~~~~~G-----d~~~p~p 392 (392) T protein:vir:74 344 TISTATRWGALAENQATFVLQEAGYIPK---DLPA--------PENTNKKTTG-----QSNEPVP 392 (392) T ss_pred HHHHHHhCCCcCHHHHHHHHHhCCCCcc---ccch--------hcCCCCCCCC-----CCCCCCC Confidence 77776 4899998887664 43321 1111 0112221111 1111111 No 193 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=97.22 E-value=0.00013 Score=41.94 Aligned_cols=393 Identities=10% Similarity=0.036 Sum_probs=159.7 Q ss_pred ChHHHHHHHHHHHHH----HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPE----ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH 76 (445) Q Consensus 1 ~l~~~i~~~~~~~~~----~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~ 76 (445) ||. .+-..---+. .....+-|-|.... ..+..... ......-..++....+|+..+.-+-+-|+.+-- T Consensus 1 ~~~--~~~~~~~~p~~~~~~~~~~~~~~~~~~~-g~~~~~~~-----~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~ 72 (518) T protein:vir:78 1 MLL--ANGQTLSAPAMAELSPQMQDSYYYAPAV-GMQLERQF-----SLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) T ss_pred Ccc--cCceeeccchhhhhhhhhhhccccccee-ceeccccc-----chhhHHhhhhHHHHHHHHHHHHhhccCceEEEE Confidence 110 0000000000 01111111111000 00000000 000000011233455667666666666665411 Q ss_pred --Cch---HHHHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceE Q lcl|NC_021326. 77 --TDD---EVIKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELE 145 (445) Q Consensus 77 --~d~---~~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~ 145 (445) .+. .....+..++. | .-..+...+..+.+.+|.+|+.+..+..|++ .+..++|..+.+..+... +.+. T Consensus 73 ~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~-~~~~ 151 (518) T protein:vir:78 73 TSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-GRYE 151 (518) T ss_pred EcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCC-CEEE Confidence 111 11112222332 3 2345666778889999999999999998886 478889988887766421 2221 Q ss_pred EEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC----C-CCcCccHHHHHH Q lcl|NC_021326. 146 AFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----N-DLEISDIFMYKT 220 (445) Q Consensus 146 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~-~~g~s~~~~v~~ 220 (445) + ++ ........ ....+ +-.. |+++++ . ..|.|.+..... T Consensus 152 y--~~-~~~~~~~~------~~~~~-------------------------~~~e--IiHir~~~~dg~~~G~Spi~~~~~ 195 (518) T protein:vir:78 152 Y--YF-QAGAGVGT------QLVSF-------------------------ADDE--VVPIRFFNPDGLERGLSLMESLKS 195 (518) T ss_pred E--EE-EecCCccc------eeEEe-------------------------cCCc--EEEecCCCCCcccccccHHHHHHH Confidence 1 11 11100000 00000 0001 233321 1 246777776666 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhh--------hhCceeeccCCCceeeEeccCChHHHHH Q lcl|NC_021326. 221 LIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLL--------RYYGAIKVSDNGGVDTIQVEVPVENSKK 289 (445) Q Consensus 221 lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 289 (445) .+.....+.....+.+...+.|-.++.... .+....++..+ ..++++.++++.+.+.++.+.....+.+ T Consensus 196 ~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le 275 (518) T protein:vir:78 196 TIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIE 275 (518) T ss_pred HHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHH Confidence 666555555555555677777766665322 11122222222 1233566666665555554434445556 Q ss_pred HHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeC Q lcl|NC_021326. 290 YLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLK-ADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFN 368 (445) Q Consensus 290 ~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~ 368 (445) ..+.....|...-++|....+...+ .+...++.....+... +.-....+...+.+ .+ +........+++... T Consensus 276 ~r~~~~~eIa~afgVPp~~lg~~~~-st~sn~e~~~~~f~~~tL~P~~~~ie~eln~---~L---~~~~~~~~~~~fd~~ 348 (518) T protein:vir:78 276 ARQLNREEVCGVYDIAPPIVHILDR-ATFSNISAQMRAFYRDTMAIPIARIQSAMDK---YV---GQYWVRKNRMKFDID 348 (518) T ss_pred HHHHHHHHHHHHhCCCHHHhccCCC-CCchhHHHHHHHHHHHHHHHHHHHHHHHHHH---hh---cccccCcceEEeech Confidence 6666677788877888644332211 1111122111111111 11111111111111 11 111112223455555 Q ss_pred CCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHH-------HHHHHHHHHHH--HHhhhccccCCC----C Q lcl|NC_021326. 369 YNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAE-------LERIEQEQMEY--NKQLPNLDDGGA----D 431 (445) Q Consensus 369 ~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E-------~~ri~~E~~~~--~~~~~~~~~~~~----~ 431 (445) ..+..|..+.++++.++ .|+++.-.++++++. ++++... +..+..-.... .+..+...+... . T Consensus 349 ~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~ 428 (518) T protein:vir:78 349 DVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVAS 428 (518) T ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcccccc Confidence 67778999999988876 489999889888754 3321110 01111000000 000000000000 0 Q ss_pred C-----CCCCCCCCCcC---CC Q lcl|NC_021326. 432 G-----AQQKERSNDKQ---SE 445 (445) Q Consensus 432 ~-----~~~~~~~~d~~---~~ 445 (445) . +..+.-+++.+ +| T Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~ 450 (518) T protein:vir:78 429 LDQSPPASVPGLSPTNSDRSTD 450 (518) T ss_pred cccCccccCCCCCccccccccc Confidence 0 00000000000 00 No 194 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=97.16 E-value=0.00015 Score=41.62 Aligned_cols=389 Identities=10% Similarity=-0.038 Sum_probs=163.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHH-----hcCCCcccccccccc-ccccccccccccccccchHHHHHHHHHhhhhccCeee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEY-----YEQRPDIVKEPKPVD-ATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 74 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~y-----y~G~~~i~~~~~~~~-~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~ 74 (445) ++..-........+.+.....- |.|-.+ .+....- ........+. . .......-.+.+....+.+.+.++ T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~--~~~~~iLr~~~~~~ly~~-m-~~D~hi~s~l~~Rk~av~~~~w~v 86 (448) T protein:vir:77 11 LVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVD--REFDELLQGKDGLLVYHK-M-LSDGTVKNALNYIFGRIRSAKWYV 86 (448) T ss_pred cCCcccccchhhhhhhccchhhhcccccccccc--cchhHhhccccchHHHHH-H-hhChHHHHHHHHHHHHHhcCCceE Confidence 1111111111111111111111 111100 0000000 0000000000 0 012445556666667778888888 Q ss_pred ccCc-----hHHHHHHHHHhcc--------CHHHHHHHHHHHHHhcCeEE-EEEEE-CCCCcEEEEEE---cccee-EEE Q lcl|NC_021326. 75 KHTD-----DEVIKRIDEVLGN--------RFDDKLHSVLTGASNKGIEW-LHPYL-DEEGEFKLFRV---PAEQG-IPI 135 (445) Q Consensus 75 ~~~d-----~~~~~~l~~~~~n--------~~~~~~~~~~~~~~~~G~~~-~~v~~-d~~g~~~i~~~---~p~~~-~~v 135 (445) .+.+ ....+++.+++.. +|.+.+..+ .++..+|.++ +++|. ..+|...+..+ +|... ... T Consensus 87 ~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~~~~l~~r~~~~~~~f~ 165 (448) T protein:vir:77 87 EPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFNIDEVL 165 (448) T ss_pred ecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCceeeccccccCCCccceee Confidence 6422 2344566666542 355554444 6889999975 46664 45676543222 22211 011 Q ss_pred EcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC----CCCc Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----NDLE 211 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~g 211 (445) |+.. +.+.. .+ ..... ..........+-+++. ++++.. ++.| T Consensus 166 ~~~~--~~l~~---------------~~--------~~~~~-------~~~~~~~~~~~lP~~~--~i~~~~~~~g~p~g 211 (448) T protein:vir:77 166 YDEE--GGPKA---------------LK--------LSGEV-------KGGSQFVNGLEIPIWK--TVVFLHNDDGSFTG 211 (448) T ss_pred eecC--CceEE---------------Ee--------cCCcc-------cccccCCCccccccce--EEEEecCCcCCccc Confidence 1111 11110 00 00000 0000000011123333 233332 4567 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCccc--chh------HHHhhh--hCceeeccCCCceeeEecc Q lcl|NC_021326. 212 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQE--LPE------FKRLLR--YYGAIKVSDNGGVDTIQVE 281 (445) Q Consensus 212 ~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~--~~~------~~~~~~--~~~~~~~~~~~~~~~l~~~ 281 (445) .|.+..+.-..--=+..+.+++.-++.++.|+++.+-..... ..+ ...++. ...+..++++.++++++.. T Consensus 212 ~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~ 291 (448) T protein:vir:77 212 QSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLK 291 (448) T ss_pred chHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecC Confidence 888877655544456677888999999999999877432211 111 112222 2335568999999999977 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCCc Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGEH 360 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~ 360 (445) .+...+..+++.+.+.|...--.--++.+..+ ..++.+......-....+..-.+.+...+. +++.-++.+. ..... T Consensus 292 ~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~-g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lN-fg~~~ 369 (448) T protein:vir:77 292 SAMPDAIPYLTYHDAGIARALGIDFNTVQLNM-GVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPN-WPGAT 369 (448) T ss_pred CCccCHHHHHHHHHHHHHHHHhcccccccccc-chhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCC Confidence 66666778888888887665422112222222 222223222111112222333444555554 4666555543 22222 Q ss_pred ceEEEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCC Q lcl|NC_021326. 361 KDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSN 440 (445) Q Consensus 361 ~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (445) .-..+.|....+.|..+.++.+.++++.+ .+.+ ++..+..+. -......+.... +..+.+..... T Consensus 370 ~~P~~~f~~~e~eDl~~~a~~~~~l~~~~-----~~~~-~ip~~~~~~-------~~~~~~~~~~~~--~~~~~~~~~~~ 434 (448) T protein:vir:77 370 RFPRLTFEMEERNDFSAAANLMGMLINAV-----KDSE-DIPTELKAL-------IDALPSKMRRAL--GVVDEVREAVR 434 (448) T ss_pred CCCEEEecCCChhhHHHHHHHhHHHHHHH-----HHHh-cCCccCCcC-------CCCCchhccccc--CCCCCCCchhh Confidence 23467888888888888888888776531 1111 111100000 000000000000 00000011111 Q ss_pred CcCCC Q lcl|NC_021326. 441 DKQSE 445 (445) Q Consensus 441 d~~~~ 445 (445) ++... T Consensus 435 ~~~~~ 439 (448) T protein:vir:77 435 QPADS 439 (448) T ss_pred cchhh Confidence 11111 No 195 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=97.16 E-value=0.00015 Score=41.61 Aligned_cols=376 Identities=11% Similarity=0.023 Sum_probs=151.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCC-ccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRP-DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~-~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) |.++=|... ++. ..+.++..... ............+..... +.-+.++-...+|+..++-+..-|+.+--..+ T Consensus 1 ~~~~~~~~~---~k~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~--~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~ 74 (409) T protein:vir:94 1 MAKENIVTR---IKK-KLIDNWIDQSASKLYDFSPWKNKSFWGVIN--NTLETNETIFSAITKLSNSMASLPLKMYEDYK 74 (409) T ss_pred Ccccccchh---hhh-HHhhhhhcCCcccccccccccCccccccch--hhhhccHHHHHHHHHHHHhhhhCceeEeeccc Confidence 322222111 111 00111111100 000000000001110000 00011222344556555555555665422111 Q ss_pred HHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEe Q lcl|NC_021326. 80 EVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYK 152 (445) Q Consensus 80 ~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 152 (445) .....+...+ . | +-..+...+..+.+.+|.+|+++..+..|++ .+..++|..+.++.++.. +.+.+ ++ . T Consensus 75 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~~~y--~~-~ 150 (409) T protein:vir:94 75 VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RELYY--SI-H 150 (409) T ss_pred ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cEEEE--EE-E Confidence 1111122222 1 2 2345556678889999999999999988886 578889988877766431 21111 00 0 Q ss_pred eecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHH Q lcl|NC_021326. 153 LENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNR 227 (445) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~ 227 (445) ......+. +.. .. |+|+++ ...|.|.+......++.... T Consensus 151 ~~~g~~~~-~~~---------------------------------~d--vih~r~~~~~~~~~G~s~l~~~~~~i~~~~~ 194 (409) T protein:vir:94 151 AATGNKLI-VHN---------------------------------MD--MLHFKHIVASNMVQGISPIDVLKNTTDFDNA 194 (409) T ss_pred cCCceEEE-Ecc---------------------------------cc--EEEecCCCCCCccccccHHHHHHHHHHHHHH Confidence 00000000 000 00 233321 22477777666555554443 Q ss_pred HHHHHHHHHHHhcC-CeeEE-ecCCc--ccchhHHHhh-----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHH Q lcl|NC_021326. 228 RLSDLSNTFKDSNE-LTYVL-TNYDD--QELPEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKI 298 (445) Q Consensus 228 ~~s~~~~~~~~~~~-~~l~~-~g~~~--~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i 298 (445) +.. . .+..+.. +-.++ .+... +........+ ...+++.++++.+++.+........+.+..+.....| T Consensus 195 ~~~-~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 271 (409) T protein:vir:94 195 VRT-F--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERV 271 (409) T ss_pred HHH-H--HHHhcCCCCeeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHH Confidence 321 1 1233332 22333 23222 1111112111 1233555665555555443333445556666677788 Q ss_pred HHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hccCC--CcceEEEEeCCCC Q lcl|NC_021326. 299 MLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH-----FDIKG--EHKDVDISFNYNK 371 (445) Q Consensus 299 ~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~-----~~~~~--~~~~i~v~f~~~~ 371 (445) +..-++|....+..+ ..+...++.....+ +...|.-++..+... +...+ ....+++....-+ T Consensus 272 a~~fgVPp~~lg~~~-~~~~sn~e~~~~~f----------~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll 340 (409) T protein:vir:94 272 ANVFQLPSVFLNARS-NTNFAKNEELNRFY----------LQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYL 340 (409) T ss_pred HHHhCCCHHHhCCCC-CCCcccHHHHHHHH----------HHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhh Confidence 888888865443221 22222222211111 122222222222211 11111 1122444445556 Q ss_pred CCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCC--CCCCCCCCCCcCC Q lcl|NC_021326. 372 VANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGAD--GAQQKERSNDKQS 444 (445) Q Consensus 372 p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~--~~~~~~~~~d~~~ 444 (445) ..|..+.++++.++ +|+++.-.+++.++.-+- +-.+.+-- ....... +.... .....++++.+|| T Consensus 341 ~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~--~ggD~~~~-----~~n~~~~-~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 341 RADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGGDKPLI-----SGDLYPI-DTPLELRKSLKGGDKNVNES 409 (409) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeEee-----ccccccc-ccchhhcccccCCCCCcCCC Confidence 67888999988887 589998888887754211 00000000 0000000 00000 0111222222222 No 196 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=97.15 E-value=2.3e-05 Score=46.00 Aligned_cols=175 Identities=10% Similarity=0.130 Sum_probs=89.8 Q ss_pred eeEEecCC---cccchhHHHhh------h-hCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccc--cc Q lcl|NC_021326. 243 TYVLTNYD---DQELPEFKRLL------R-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFS--SD 310 (445) Q Consensus 243 ~l~~~g~~---~~~~~~~~~~~------~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~--~~ 310 (445) ++.+.|.. ..........+ + ...++.+.++ +-+|-+.+.+.+.+...+......|...+++|-.- +. T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~~~~ld~~-~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~ 79 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVDNNSGVGQAIGIDAD-SEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIILKGK 79 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHHHhhhhhhhheeecC-CcceeeeecCcCChHHHHHHHHHHHHhHhcCchhhhcCC Confidence 11111110 00000110000 1 0122222222 23466677888899999999999999999999542 22 Q ss_pred cccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHH---- Q lcl|NC_021326. 311 KFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQS---- 385 (445) Q Consensus 311 ~~~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~---- 385 (445) ..+| |.||..-...|...+... .+..+.+.+++++.+++. ..++.++|+|-...+..+.|++..+. T Consensus 80 sp~Glnatge~d~~nyyd~i~~~--Qe~~l~p~le~l~~~~~~-------~~~~~~~f~pL~~~s~kekAei~~~~a~a~ 150 (201) T protein:vir:10 80 NVGGVSASQNTALETFYGYVDRK--RKAELLPLLEFLLPFIVT-------EQEWSVEFNPLSQVSDKDKSEILEKNVNSV 150 (201) T ss_pred CCccccccchhHHHHHHHHHHHH--HHHHHHHHHHHHHHhhcC-------CCCceEeeCCCCCCCHHHHHHHHHHHHHHH Confidence 3233 457776655555554432 346788888888876542 24688999999999999998876553 Q ss_pred -----hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcC----CC Q lcl|NC_021326. 386 -----MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQ----SE 445 (445) Q Consensus 386 -----~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~----~~ 445 (445) .|++|.+.+.+.| .+.- ...-..++..+...+..++.|++ .| T Consensus 151 ~~~~~~g~i~~~e~r~~L-------------~~~~-----~~~~~~~~~~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 151 AALIAAGIIDADEARDTL-------------RAIS-----TEVKIGEGSIQTEVVINESEDPLDVSANN 201 (201) T ss_pred HHHHHcCCCCHHHHHHHH-------------HhcC-----CcCCCCCCCCCccccccccCCCCCCCCCC Confidence 2566665555443 1110 00011111111111111111222 11 No 197 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.12 E-value=0.00016 Score=41.33 Aligned_cols=376 Identities=11% Similarity=0.021 Sum_probs=151.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCC--ccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRP--DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~--~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d 78 (445) |.++=|-. |++. ..+.++- |.. ............+..... +.-+..+-...+|+..++-+..-|+.+--.. T Consensus 1 ~~~~~~~~---~~k~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~--~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~ 73 (409) T protein:vir:96 1 MAKENIVT---RIKK-KLIDNWI-DQSASKLYDFSPWKNKSFWGVIN--NTLETNETIFSAITKLSNSMASLPLKMYEDY 73 (409) T ss_pred Cccccchh---hhhh-HHhhhhh-ccccccccccccccCccccccch--hhHhhhHHHHHHHHHHHHhhhhCceEEeecc Confidence 22221111 1111 0011111 111 000000000011110000 0001122234455555555555565542222 Q ss_pred hHHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEE Q lcl|NC_021326. 79 DEVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMY 151 (445) Q Consensus 79 ~~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~ 151 (445) +.....+...+ . | +-..+...++.+.+.+|.+|+.+..+..|++ .+..++|..+.++.++.. +.+. | T Consensus 74 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~-~~~~-----y 147 (409) T protein:vir:96 74 KVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS-RELY-----Y 147 (409) T ss_pred cccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC-cEEE-----E Confidence 21112222222 1 2 2345566778889999999999999888875 477788888877765431 1111 1 Q ss_pred eeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHH Q lcl|NC_021326. 152 KLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYN 226 (445) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~ 226 (445) ......+. .+... .. -|++++ +...|.|.+..+...++..+ T Consensus 148 ~~~~~~g~---------~~~~~-----------------------~~--evih~r~~~~~~~~~G~s~l~~~~~~i~~~~ 193 (409) T protein:vir:96 148 SIHAATGN---------KLIVH-----------------------NM--DMLHFKHIVASNMVQGISPIDVLKNTTDFDN 193 (409) T ss_pred EEEcCCce---------EEEEc-----------------------cc--cEEEeCCCCCCCccccccHHHHHHHHHHHHH Confidence 11000000 00000 00 133332 12347777766666555443 Q ss_pred HHHHHHHHHHHHhcCC-eeEE-ecCCcc--cchhHHHhh----h-hCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 227 RRLSDLSNTFKDSNEL-TYVL-TNYDDQ--ELPEFKRLL----R-YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 227 ~~~s~~~~~~~~~~~~-~l~~-~g~~~~--~~~~~~~~~----~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) .+. .. .+..+..+ -.++ .+.... ........+ . ..+++.++++.+++.+..+.....+.+..+..... T Consensus 194 ~~~-~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 270 (409) T protein:vir:96 194 AVR-TF--NLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRER 270 (409) T ss_pred HHH-HH--HHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHH Confidence 322 21 12233332 2222 232221 111222211 1 23455566666655554443444556666777778 Q ss_pred HHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hccCCC--cceEEEEeCCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH-----FDIKGE--HKDVDISFNYN 370 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~-----~~~~~~--~~~i~v~f~~~ 370 (445) |+..-++|....+... ..+...++... ...+...|.-++..+... +..... ...+++....- T Consensus 271 Ia~~fgVPp~~lg~~~-~~~~s~~e~~~----------~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l 339 (409) T protein:vir:96 271 VANVFQLPSIFLNARS-NTNFAKNEELN----------RFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSY 339 (409) T ss_pred HHHHhCCCHHHhCCCC-CCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhh Confidence 8888888865443221 11111122111 112222232222222221 111111 12344444555 Q ss_pred CCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC-CCCCCCCCCCCCcCC Q lcl|NC_021326. 371 KVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGG-ADGAQQKERSNDKQS 444 (445) Q Consensus 371 ~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~-~~~~~~~~~~~d~~~ 444 (445) +-.|..+.++++.++ +|+++.-.++++++.-+- +-.+.+-- ........... .....+.++++.++| T Consensus 340 l~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi--~ggD~~~~-----~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 340 LRADSATQAEVYFKAVRSGYYTINDIREWEDLPPV--EGGDKPLI-----SGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred hccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCC--CCcceeee-----cccccccccchhhcccccCCCCCcCCC Confidence 667889999998887 489999888888754210 00000000 00000000000 001112222222222 No 198 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=97.11 E-value=0.00017 Score=41.28 Aligned_cols=366 Identities=12% Similarity=0.047 Sum_probs=150.3 Q ss_pred HHHhcCCCcccc-----ccc-----cc-cccccccccccccccccchHHHHHHHHHhhhhccCeeec-c-CchHH-HHHH Q lcl|NC_021326. 20 QEYYEQRPDIVK-----EPK-----PV-DATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK-H-TDDEV-IKRI 85 (445) Q Consensus 20 ~~yy~G~~~i~~-----~~~-----~~-~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~-~-~d~~~-~~~l 85 (445) .++|++...... +.. .. .+.+.. ..+ .++.. .-.+|+..++-+..-|+.+. . .+... ...+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--~~A-l~~~~--V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~ 75 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLG--ISA-LRNSD--VLTAVSIVSGDVSRFPLVITDSSTDEVIDLANI 75 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceec--hhh-cccHH--HHHHHHHHHHhhccCeeEEEEcCCcceeccchH Confidence 344444331100 000 00 000000 000 11111 12356666666666666542 1 11111 1112 Q ss_pred HHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCC-cEE-EEEEccceeEEEEcCCCCCceEEEEEEEeeecce Q lcl|NC_021326. 86 DEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEG-EFK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 157 (445) Q Consensus 86 ~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g-~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 157 (445) ...+ . | ....+...+..+.+.+|.+|+.+..+..| .+. +..++|..+.+..++. +++.+. +...... T Consensus 76 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~--~~~~y~---~~~~~~~ 150 (417) T protein:vir:38 76 EYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDP--DNIIYR---FTPYNSS 150 (417) T ss_pred HHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCC--CeEEEE---EEEcCCc Confidence 2222 1 2 23456667788899999999999988764 343 5668888876654432 222211 0000000 Q ss_pred eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC----CCCcCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 158 KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----NDLEISDIFMYKTLIDAYNRRLSDLS 233 (445) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~g~s~~~~v~~lid~~~~~~s~~~ 233 (445) ..... +... |+|++. ...|.|.+.-+...+.....+..-.. T Consensus 151 ~~~~~---------------------------------~~~d--viH~r~~~~d~~~G~s~l~~~~~~i~~~~~~~~~~~ 195 (417) T protein:vir:38 151 MQKVC---------------------------------GFED--VIHWKFFSYDTIMGRSPLLSLGDEIGLQESGVSTLQ 195 (417) T ss_pred EEEEe---------------------------------cCcc--eEEecCCCCCCccccCHHHHHHHHHHHHHHHHHHHH Confidence 00000 0000 233321 23477777766666655554444445 Q ss_pred HHHHHhcCCeeEEecC---CcccchhHHHhh-------hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhC Q lcl|NC_021326. 234 NTFKDSNELTYVLTNY---DDQELPEFKRLL-------RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQ 303 (445) Q Consensus 234 ~~~~~~~~~~l~~~g~---~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~ 303 (445) ..+...+.|-.++.-. +.+.....+..+ ..++.+.++++.+.+.++.......+.+..+.....|+..-+ T Consensus 196 ~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fg 275 (417) T protein:vir:38 196 KFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALR 275 (417) T ss_pred HHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhC Confidence 5556666776554422 111112222221 123345555555444443333333444555555667777777 Q ss_pred ccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCcceEEEEeCCCCCCCHHHHH Q lcl|NC_021326. 304 AVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD----IKGEHKDVDISFNYNKVANTELQV 379 (445) Q Consensus 304 ~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~----~~~~~~~i~v~f~~~~p~d~~~~~ 379 (445) +|....+..+...+...+. ...+...|.-++..+..-+. ...+.....+.|.... .+.+..+ T Consensus 276 VPp~~lg~~~~~s~~e~~~-------------~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~~~~~~~fd~~~-l~~~~~~ 341 (417) T protein:vir:38 276 VPAYRLAQNSPNQSVKQLA-------------DDYIRNDLPFYFEPITSEFELKLLDDAQRHQYCIGFDTKS-VNGLPIA 341 (417) T ss_pred CCHHHhCCCCcchhHHHHH-------------HHHHHHHHHHHHHHHHHHHHhhhcChhhcccceEEechhh-hhHHHHH Confidence 7765443222222211111 11222233333333322211 1112233456664221 1222222 Q ss_pred HHHHHH--hccCChHHHHHhCCC--CCCHHH-H----HHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 380 QTAQQS--MGIVSHETVLENHPF--VEDLQA-E----LERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 380 ~~~~~~--~g~~s~et~l~~l~~--~~d~~~-E----~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) + +.++ .|+++.-.++++++. +++... + +.-+..+.....+........++++..+.++..+.+++ T Consensus 342 ~-~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~~~~~~~~~ 415 (417) T protein:vir:38 342 D-VNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAAELKGGDTNAKGNQNGSGTNA 415 (417) T ss_pred H-HHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccccccccccccccccccccCCCCCCCCCCCcCCCCcC Confidence 2 2232 589999999988754 332211 0 00000111111111112222233333344444444444 No 199 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=97.02 E-value=0.00021 Score=40.80 Aligned_cols=377 Identities=9% Similarity=0.016 Sum_probs=159.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee-cc-Cc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF-KH-TD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~-~~-~d 78 (445) .+.++.+.-+.....-.-+.....|-.. .+. ..+ ...-+..-+.+.-...+|+..++-+..-|+.+ .. .+ T Consensus 2 ~~~~~~~~~~~~~s~~~~w~~~~~~~~~---~~~---~~g--~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~ 73 (421) T protein:vir:10 2 FIPQMFEGKKRSVSGGGFWEAMLGGVRS---SHS---KAG--VMITPETALALSAVRACVTLLAESVAQLPVELYRRDKN 73 (421) T ss_pred CCcchhcccccccCcchhhHHHhhhhcc---Ccc---cCC--ceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCC Confidence 2222222211111101111111111100 000 000 00000000112223346666666665666654 11 11 Q ss_pred h---HH--HHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEEEccceeEEEEcCCCCCceEEE Q lcl|NC_021326. 79 D---EV--IKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEAF 147 (445) Q Consensus 79 ~---~~--~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~ 147 (445) . .. ......+.. | ....+...+..+.+.+|.+|+.+..+.+|++. +..++|..+.++.++. +.+ T Consensus 74 g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~--g~~--- 148 (421) T protein:vir:10 74 GGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPD--GMP--- 148 (421) T ss_pred CceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCC--ceE--- Confidence 1 11 112222221 2 24455666788999999999999999888864 7778888887655432 211 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLID 223 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid 223 (445) +|....... .+. ... +++++ +...|.|.+..+...++ T Consensus 149 --~y~~~~~g~------------------~~~-----------------~~e--iih~~~~~~d~~~G~spi~~~~~~i~ 189 (421) T protein:vir:10 149 --YYEIPEIGE------------------TLP-----------------MRM--MHHVKVFSLDGYIGSSPIQTNADVLG 189 (421) T ss_pred --EEEEcCCCc------------------EEc-----------------hhh--EEEecCcCCCCcccccHHHHHHHHHH Confidence 111110000 000 000 22222 22347777777666666 Q ss_pred HHHHHHHHHHHHHHHhcCCeeEEecC---Ccccchh----HHHhh--------hhCceeeccCCCceeeEeccCChHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPE----FKRLL--------RYYGAIKVSDNGGVDTIQVEVPVENSK 288 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~----~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~~~ 288 (445) ..........+.+...+.|-.++.-. .....++ ....+ ...+++.++++.+.+.+........+. T Consensus 190 ~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~ 269 (421) T protein:vir:10 190 LNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLL 269 (421) T ss_pred HHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHH Confidence 55555444555566677776665421 1111111 11111 123455666666555554444444566 Q ss_pred HHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCcc-- Q lcl|NC_021326. 289 KYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KGEHK-- 361 (445) Q Consensus 289 ~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~~~-- 361 (445) +..+...+.|+..-++|....+... ++-|. ++... ...+...|.-++..+...+.. ..+.. T Consensus 270 e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e~~~----------~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~ 337 (421) T protein:vir:10 270 QSRQWGVEEVCRLYKIPPHMVQMLAKATNNN--IEHQG----------LQFVMYTLLAWLKRHEGALQRDLLLPSERRDL 337 (421) T ss_pred HHHHHhHHHHHHHhCCCHHHcCCCcCCcccc--HHHHH----------HHHHHHHHHHHHHHHHHHHhhhccCccccCCe Confidence 6667777888888888865433222 11111 11111 111222222222222221111 11122 Q ss_pred eEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCC-CCCCCCC Q lcl|NC_021326. 362 DVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGAD-GAQQKER 438 (445) Q Consensus 362 ~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~-~~~~~~~ 438 (445) .+++.+...+..|..+.++.+.++ .|+++.-.++++++.-. -+--++.--- -..........+... .+.++.+ T Consensus 338 ~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p--~~ggD~~~~~--~n~~~~~~~~~~~~~~~~~~~~e 413 (421) T protein:vir:10 338 YIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPP--IAGGDKYLTP--LNMVDSAQIIPGDKKPTAQQMAE 413 (421) T ss_pred EEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC--CCCcceeeec--cccccccccccCCCCcccccCcc Confidence 344444556667889999988876 58999999998875421 0001110000 000000011111111 1111111 Q ss_pred CCCcCCC Q lcl|NC_021326. 439 SNDKQSE 445 (445) Q Consensus 439 ~~d~~~~ 445 (445) +++=.++ T Consensus 414 ~d~~~~~ 420 (421) T protein:vir:10 414 IDTILSR 420 (421) T ss_pred ccccccc Confidence 1111111 No 200 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=96.99 E-value=0.00022 Score=40.60 Aligned_cols=364 Identities=7% Similarity=0.017 Sum_probs=163.6 Q ss_pred HHHHHHhcCCCcccccccc--------ccccccccccccccccccchHHHHHHHHHhhhhccCeee-c-cCch---H-HH Q lcl|NC_021326. 17 SIGQEYYEQRPDIVKEPKP--------VDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF-K-HTDD---E-VI 82 (445) Q Consensus 17 ~~~~~yy~G~~~i~~~~~~--------~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~-~-~~d~---~-~~ 82 (445) -.+.++++++... ..... ...........+.+-+.+.-...+|+..+.-+.+-|+.+ . .++. . .. T Consensus 1 m~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~~~ 79 (419) T protein:vir:57 1 MFIPQFWKGRPSE-NRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGREIAFD 79 (419) T ss_pred CcchhhhccCCcc-ccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceecccc Confidence 1223333333110 00000 000000000001111222333556666666666666664 1 1111 1 11 Q ss_pred HHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeec Q lcl|NC_021326. 83 KRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN 155 (445) Q Consensus 83 ~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~ 155 (445) ..+...+ . | ....+...+..+.+.+|.+|+.+..+..|++ .+..++|..+.+..+.. +. .+|.... T Consensus 80 ~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~--g~-----~~y~~~~ 152 (419) T protein:vir:57 80 HPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPD--GM-----PYYDIPS 152 (419) T ss_pred chHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCC--ce-----EEEEEcC Confidence 1122222 1 2 3455667788899999999999999988985 57788888887654432 11 1122211 Q ss_pred ceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 156 ETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLIDAYNRRLSD 231 (445) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid~~~~~~s~ 231 (445) ... .+. ... |++++ +...|.|.+..+...++........ T Consensus 153 ~~~--~~~---------------------------------~~~--vih~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~ 195 (419) T protein:vir:57 153 IGE--ILP---------------------------------MRM--VHHIKSFSLDGYIGTSPIQTNPDVLGLGIAVEQH 195 (419) T ss_pred Cce--EEc---------------------------------hhh--EEEecCcCCCCcccccHHHHHHHHHHHHHHHHHH Confidence 110 000 000 12222 1235788887777777765555444 Q ss_pred HHHHHHHhcCCeeEEecC---Cc----ccchhHHHhhh--------hCceeeccCCCceeeEeccCChHHHHHHHHHHHH Q lcl|NC_021326. 232 LSNTFKDSNELTYVLTNY---DD----QELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQ 296 (445) Q Consensus 232 ~~~~~~~~~~~~l~~~g~---~~----~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~ 296 (445) ....+...+.|-.++.-. .. +..+.....+. .++++.++++.+++-+........+.+..+..++ T Consensus 196 ~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~ 275 (419) T protein:vir:57 196 AAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVN 275 (419) T ss_pred HHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHH Confidence 555566667776555421 11 11112222111 2345566666555555444444556677777778 Q ss_pred HHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCC--cceEEEEeCCC Q lcl|NC_021326. 297 KIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KGE--HKDVDISFNYN 370 (445) Q Consensus 297 ~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~--~~~i~v~f~~~ 370 (445) .|+..-++|....+.... .+...++... ...+...|.-.+..+...+.. ..+ ...+++.+... T Consensus 276 ~Ia~~fgVPp~~lg~~~~-~t~sn~e~~~----------~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~l 344 (419) T protein:vir:57 276 EVCRLYKVPPHMIQDLQK-STNNNIEHQG----------LQYVIYTMLAILKRHESAMMRDLLLPSERRDFYIEFNVSSL 344 (419) T ss_pred HHHHHhCCCHHHhCCCCC-CccccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhh Confidence 888888888644432211 1111121111 112222333333333221111 111 22344444556 Q ss_pred CCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCC--CCCCCCCCcCCC Q lcl|NC_021326. 371 KVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGA--QQKERSNDKQSE 445 (445) Q Consensus 371 ~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~--~~~~~~~d~~~~ 445 (445) +..|..+.++.+.++ .|+++.-.++++++.-.- +.-+.+- .. ..........+. ..+++.++.++- T Consensus 345 l~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~ggD~~~------~~-~n~~~~~~~~~~~~~~~~~~~~~~~~ 414 (419) T protein:vir:57 345 LRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPI--PGGDKYL------TP-LNMVDSKALTGIGKATPQQLKDIEAI 414 (419) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeee------ec-cccccccccccccCCCcccCcchhhh Confidence 677899999988876 589999999998855211 1000000 00 000011111111 112222222222 No 201 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=96.93 E-value=0.00025 Score=40.33 Aligned_cols=399 Identities=15% Similarity=0.082 Sum_probs=156.7 Q ss_pred HHHHHHH--HHHHHHHHH------HHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc Q lcl|NC_021326. 5 YIKQHLE--KLPEISIGQ------EYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH 76 (445) Q Consensus 5 ~i~~~~~--~~~~~~~~~------~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~ 76 (445) |-+-|.. .+.++.... ..+.+...-...|....... ++.--.......+|+..+..+.+-|+.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~L------a~~~~~n~~v~scI~~ia~~ia~~~~~i~~ 74 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKEDRFEEYVEPKVHPLVL------LSLLQVNPYHASACSIKANDILRTGYLIDG 74 (540) T ss_pred CCCcccChhhccchhhhhccccccccccCCCCccccCCCCHHHH------HHHHHhcHHHHHHHHHHHHHHhcCCceEec Confidence 1111110 001111111 11111100000000000000 000012345677889999999899988877 Q ss_pred CchHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeec Q lcl|NC_021326. 77 TDDEVIKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN 155 (445) Q Consensus 77 ~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~ 155 (445) .+.....++-..+ .+.......+..+.+.+|.+|+.+..+..|++ .+..++|..+.+..+.. +++...+ T Consensus 75 ~~~~~~~~lpN~~-~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~---------~~~~~~d 144 (540) T protein:vir:41 75 DDGGVEELLRACR-PSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGS---------RYMQTWD 144 (540) T ss_pred CccchhhhccCCC-CCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCc---------eeEeeec Confidence 7665544331111 13456677788899999999999999888876 47888888876654432 1111111 Q ss_pred ceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHH Q lcl|NC_021326. 156 ETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLS 230 (445) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s 230 (445) .....++..+...... .... .. ....+..=-|+|+++ ...|.|.+......+.....+.. T Consensus 145 ~~~~~~~~~~~~~~~~-------~~~~--g~------~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~ 209 (540) T protein:vir:41 145 GIHVTYFKDYRYEGEV-------NPDN--GE------DQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDE 209 (540) T ss_pred Cceeeeeeccccccee-------eccc--cc------cceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHH Confidence 1111111100000000 0000 00 000111112455543 23578877765555555444444 Q ss_pred HHHHHHHHhcCCeeEE--ecCCcccch-----------hHHHhh---------hhCceeecc----CCCceeeEeccC-- Q lcl|NC_021326. 231 DLSNTFKDSNELTYVL--TNYDDQELP-----------EFKRLL---------RYYGAIKVS----DNGGVDTIQVEV-- 282 (445) Q Consensus 231 ~~~~~~~~~~~~~l~~--~g~~~~~~~-----------~~~~~~---------~~~~~~~~~----~~~~~~~l~~~~-- 282 (445) -..+-+...+.|-.++ .|...+... .+...+ ...+++.++ .+++++|..... T Consensus 210 ~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~ 289 (540) T protein:vir:41 210 YNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQ 289 (540) T ss_pred HHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccch Confidence 4444555666675554 343221110 011111 012233332 134456554443 Q ss_pred ChHHHHHHHHHHHHHHHHHhCccccccccc---cCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_021326. 283 PVENSKKYLDELYQKIMLFGQAVDFSSDKF---GSA-PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG 358 (445) Q Consensus 283 ~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~---~~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~ 358 (445) ....+.+..+...+.|...-++|....+-. +.+ .+.......+. ...+.-....+...+.+. +... T Consensus 290 ~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~--~~tL~P~~~~ie~~ln~~-------L~~~- 359 (540) T protein:vir:41 290 KELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYY--ESVVRPQQEIVSSVLTDF-------IQLK- 359 (540) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHH--HHHHHHHHHHHHHHHHHh-------hhhc- Confidence 344566667777778888888886544311 111 11222211111 111112222222222221 1111 Q ss_pred CcceEEEEeCCCCCC--CHHHHHHHHHHHhccCChHHHHHhCCCCC---CHH-----HHHHHHHHHHHHHHHhhhc-ccc Q lcl|NC_021326. 359 EHKDVDISFNYNKVA--NTELQVQTAQQSMGIVSHETVLENHPFVE---DLQ-----AELERIEQEQMEYNKQLPN-LDD 427 (445) Q Consensus 359 ~~~~i~v~f~~~~p~--d~~~~~~~~~~~~g~~s~et~l~~l~~~~---d~~-----~E~~ri~~E~~~~~~~~~~-~~~ 427 (445) ....+.+.|+..... |.++..+. ....|+++.-.+++.|+..+ |+- --+..++.......+..++ ... T Consensus 360 ~~~~~~i~f~~~~ll~~D~~~~~~~-lv~~G~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k 438 (540) T protein:vir:41 360 LDPGARFVFNEEILMESEFVHNYAL-LVQCGVLTPSEVREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKR 438 (540) T ss_pred cCCceEEEecchhhcchHHHHHHHH-HHhCCCCCHHHHHHHhCcCcCCCcccccccccccccccccccccCCCCcccccc Confidence 113456667543322 32222222 12358999888887553332 210 0001111111000000000 000 Q ss_pred --CCCCCCCCC--------CCCCCc--CCC Q lcl|NC_021326. 428 --GGADGAQQK--------ERSNDK--QSE 445 (445) Q Consensus 428 --~~~~~~~~~--------~~~~d~--~~~ 445 (445) ...++..+. ++.+++ +++ T Consensus 439 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (540) T protein:vir:41 439 TYAKYKPRIQEIISSESPLEDKKKKIDEVL 468 (540) T ss_pred ccchhcccccCccccccccccccccccccc Confidence 000000000 000000 000 No 202 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=96.74 E-value=0.00037 Score=39.38 Aligned_cols=292 Identities=10% Similarity=-0.047 Sum_probs=129.9 Q ss_pred EEEEEEECCCCcEE---EEEEccceeE-EEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccc Q lcl|NC_021326. 110 EWLHPYLDEEGEFK---LFRVPAEQGI-PIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNL 185 (445) Q Consensus 110 ~~~~v~~d~~g~~~---i~~~~p~~~~-~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (445) +++++|.-.+|... +.+.+|..+. -.+++ .+.+.. ++......... T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~--~~~l~~-~~~~~~~g~~~--------------------------- 50 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAP--DGGLVA-IEQWGVFGKAT--------------------------- 50 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeecc--CCceeE-EEecCCCCCCc--------------------------- Confidence 88888876666543 3334443221 01121 111111 10000000000 Q ss_pred cccccccccccccccceEEe--cCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccc--------- Q lcl|NC_021326. 186 ENSKTHFSTGSWGKIPFIPF--KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL--------- 254 (445) Q Consensus 186 ~~~~~~~~~~~~g~iPvv~~--~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~--------- 254 (445) ....+.+.|-.++- ..++.|.|.+..+.-..---+..+.+++.-++.+..|+.+.+|...... T Consensus 51 ------~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~ 124 (355) T protein:vir:78 51 ------VRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAE 124 (355) T ss_pred ------ceeccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHH Confidence 00011111111111 2356788888876555555566778888889999888887776432110 Q ss_pred ---hhH-------HHhhh--hCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccc---cCcchHH Q lcl|NC_021326. 255 ---PEF-------KRLLR--YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF---GSAPSGV 319 (445) Q Consensus 255 ---~~~-------~~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~---~~~~Sg~ 319 (445) .+. ...+. ...+..++.+.++++++...+...+..+++.+.+.|...--..-++.+.. |+...|. T Consensus 125 ~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~ 204 (355) T protein:vir:78 125 QWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGD 204 (355) T ss_pred HHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHH Confidence 000 11111 12355678999999998777666788888888888766542222222111 1111122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHh--cc-CChH--- Q lcl|NC_021326. 320 ALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-VSHE--- 392 (445) Q Consensus 320 Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~--g~-~s~e--- 392 (445) . ...-....+..-.+.+...+. ++++-++.+... ....-..+.|.. .+.+....++.+.++. |+ ++.+ T Consensus 205 v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~-~~~~~P~~~~~~-~~~~~~~~a~~~~~l~~~G~~~~~~~~~ 279 (355) T protein:vir:78 205 T---FASFFTGSLNAVMKHIADVTQQHVVEDLVDQNWG-PEEPAPRLVPAQ-LGKEQPVTAEAIRALVECGAFTADPELE 279 (355) T ss_pred H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CCCCCCEEEecC-cChhHHHHHHHHHHHHhCCCccccHHHH Confidence 2 222233344445566666674 577777665422 222234566754 4455566778887764 65 5533 Q ss_pred -HHHHhCCCCCCHHH---HHHHHHHHHHHHHHhhhcc--ccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 393 -TVLENHPFVEDLQA---ELERIEQEQMEYNKQLPNL--DDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 393 -t~l~~l~~~~d~~~---E~~ri~~E~~~~~~~~~~~--~~~~~~~~~~~~~~~d~~~~ 445 (445) .+.+.++. +.+.+ ++. -.++........... ....+.+..... +..+.++ T Consensus 280 ~~~~e~~gi-p~p~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~~-~a~~~~~ 335 (355) T protein:vir:78 280 KDLRARYGL-PAPAERDDGAD-AAAAKAAGRRRAKRLPGQRQGAALPSRSP-RADPPRR 335 (355) T ss_pred HHHHHHhCC-CCCCCCCcccC-CccccccccccccccCCccccccccccCC-CCCChhh Confidence 34566643 32211 111 000100000000000 000111111111 1111122 No 203 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.69 E-value=0.00041 Score=39.16 Aligned_cols=382 Identities=12% Similarity=0.041 Sum_probs=152.9 Q ss_pred ChHHHHHHHHHHHH-HHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc--C Q lcl|NC_021326. 1 MIVRYIKQHLEKLP-EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH--T 77 (445) Q Consensus 1 ~l~~~i~~~~~~~~-~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~--~ 77 (445) +|.+++++...... ....+.+++ |.. +.. .+..+.... ...-...+..-.+|+..++-+.+-|+.+-- . T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~-g~~--~~~---~~~~~~~~~--~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~ 75 (460) T protein:vir:10 4 RIIRALRELTGLDNKFNDAFIKYI-GQT--FTK---YDNNGKTYL--EQGYNINPDVYSCISQMAAKTVAVPYTIKVVKD 75 (460) T ss_pred hHHHHHhhhhccCCCchHHHHHhh-ccc--cCC---Cccchhhhh--HHHHhcchHHHHHHHHHHHhhhhCceEEEeccC Confidence 66666655432221 123333333 211 000 000000000 000112344455677777766666665421 1 Q ss_pred chH-------------------------------HHHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCC-- Q lcl|NC_021326. 78 DDE-------------------------------VIKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEE-- 119 (445) Q Consensus 78 d~~-------------------------------~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~-- 119 (445) +.. ....+..++. | ....+...+..+.+.+|.+|+++..+.. T Consensus 76 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~ 155 (460) T protein:vir:10 76 TKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGI 155 (460) T ss_pred CccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCc Confidence 100 0011111221 2 2345566677789999999999887543 Q ss_pred --CcEE-EEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeeccccccccccccccccc Q lcl|NC_021326. 120 --GEFK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGS 196 (445) Q Consensus 120 --g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (445) |.+. +..++|..+.+..+++. .... + ...+..+.......... .+ T Consensus 156 ~~G~~~~L~~l~~~~v~v~~~~~~--~~~~----~------------~~~~~~~~~~~~g~~~~--------------~~ 203 (460) T protein:vir:10 156 NAGVPSQMYVLPAHLIKIVLKDDI--NLLS----T------------DSPIKSYMLIQGDQFIE--------------FN 203 (460) T ss_pred cCceeEEEEEEcCceEEEEEcCCC--ceee----e------------eeeeeEEEEecCceeEE--------------ec Confidence 5554 77888888877655432 1111 0 00011111000000000 00 Q ss_pred ccccceEEec----------CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC---CcccchhHHHhhh- Q lcl|NC_021326. 197 WGKIPFIPFK----------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFKRLLR- 262 (445) Q Consensus 197 ~g~iPvv~~~----------n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~~~~~~- 262 (445) .. -|+|++ ....|.|.+..+...+.............+...+.|-.++... +.+.....+..+. T Consensus 204 ~~--evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~ 281 (460) T protein:vir:10 204 ED--EVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTE 281 (460) T ss_pred cc--ceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHH Confidence 00 133332 1235677777766666665555544555556666665554322 1222222222121 Q ss_pred -------hCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 263 -------YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADK 334 (445) Q Consensus 263 -------~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~ 334 (445) .++++.++++.+.+.+........+.+..+...+.|+..-++|....+... +..++..++.....+ T Consensus 282 ~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f------ 355 (460) T protein:vir:10 282 MDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRV------ 355 (460) T ss_pred HhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHH------ Confidence 223555666655555544444455666677777888888888865443221 222222232221111 Q ss_pred HHHHHHHHHHHHHHHHHHHhc-----cCCCcceEEEEeCCCCCCCHHHHHHHHHH--HhccCChHHHHHhCCCC--CCHH Q lcl|NC_021326. 335 LARKAKVAIQELLWFVFEHFD-----IKGEHKDVDISFNYNKVANTELQVQTAQQ--SMGIVSHETVLENHPFV--EDLQ 405 (445) Q Consensus 335 ~~~~~~~~l~~~~~~~~~~~~-----~~~~~~~i~v~f~~~~p~d~~~~~~~~~~--~~g~~s~et~l~~l~~~--~d~~ 405 (445) +...|.-++..+...+. .........+.|.-...........+... ..|+++.-.++++++.- ++. T Consensus 356 ----~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~NE~R~~~g~~pi~~~- 430 (460) T protein:vir:10 356 ----VTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDMVAMASWLNTIPVTPNEIRIAMKYETLNQD- 430 (460) T ss_pred ----HHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC- Confidence 11222222222221111 11111223344422111111122222222 25899999998887542 211 Q ss_pred HHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCc Q lcl|NC_021326. 406 AELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDK 442 (445) Q Consensus 406 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 442 (445) -.++.- ........+...+...+..+.+++ T Consensus 431 -~gD~~~------~~~n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 431 -GMDIVF------MPSNKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred -CCCeee------ecccccchhhcccccCCCcccCCC Confidence 000000 000000000001111111111111 No 204 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=96.64 E-value=0.00045 Score=38.96 Aligned_cols=373 Identities=9% Similarity=0.004 Sum_probs=164.4 Q ss_pred ChHHHHHHHHH-HHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc-Cc Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-TD 78 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~-~d 78 (445) ++.++-++... .......+...+-+.... . .+... -.+.-+.++-...+|+..++-+.+-|+.+-- ++ T Consensus 2 ~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~-----~---~g~~v--~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~ 71 (413) T protein:vir:48 2 FFSGLFQRKSDAPVTTPAELAEAIGLSYDT-----Y---TGKRI--SSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISG 71 (413) T ss_pred ccchhhccCccCCccchHHHHHhhhcCccc-----c---cCcee--chhhhhccHHHHHHHHHHHHhhhhCceEEEEecC Confidence 44444433221 111222233333221100 0 00000 0000012233344666666666666665421 11 Q ss_pred h----HHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEE Q lcl|NC_021326. 79 D----EVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAF 147 (445) Q Consensus 79 ~----~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~ 147 (445) + .....+..++. | ........+..+.+.+|.+|+++..+ .|++ .+..++|..+.+..+.. +.+.+. T Consensus 72 ~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~--~~~~y~ 148 (413) T protein:vir:48 72 TLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQ--WQPVYQ 148 (413) T ss_pred CcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCC--ceEEEE Confidence 1 11111222221 2 34566777888999999999998775 5765 46778888887776642 222211 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLID 223 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid 223 (445) + ... ......+.... |++++ +...|.|.+..+...++ T Consensus 149 ~---~~~-~g~~~~~~~~e-----------------------------------vih~~~~~~d~~~G~s~i~~~~~~i~ 189 (413) T protein:vir:48 149 V---TFP-DGSVDVLTQDE-----------------------------------IWHVRTLTLDGLVGLNPIAYAREAIS 189 (413) T ss_pred E---Eec-CceEEEEcccc-----------------------------------EEEecCcCCCCcccccHHHHHHHHHH Confidence 0 000 00000000000 12221 23457888877777777 Q ss_pred HHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEeccCChHHHHHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENSKKYLD 292 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~ 292 (445) ............+...+.|-.++.... .+..+.+...+. .++++.++++.+++.+........+.+..+ T Consensus 190 ~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~ 269 (413) T protein:vir:48 190 LAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRK 269 (413) T ss_pred HHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHH Confidence 666555555555666677766655422 222222222221 123455555555555544334445566677 Q ss_pred HHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC--cceEEEE Q lcl|NC_021326. 293 ELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD----IKGE--HKDVDIS 366 (445) Q Consensus 293 ~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~----~~~~--~~~i~v~ 366 (445) .....|+..-++|....+..+. .+...++... ...+...|.-++..+...+. ...+ ...+++. T Consensus 270 ~~~~~Ia~~fgVPp~~lg~~~~-~t~~n~e~~~----------~~f~~~~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd 338 (413) T protein:vir:48 270 FQLEEICRLFRVPLHMVQNTDR-ATFNNIEELG----------LGFINYSLVPYLTRIEQRINTGLVRESKQGKFYAKFN 338 (413) T ss_pred HHHHHHHHHhCCCHHHhCCCcC-CCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHhhccCccccCCeEEEEe Confidence 7778888888888644432211 1111111111 11111222222222221111 1111 2234444 Q ss_pred eCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-hhccccCCCCCCCCCCCCCCcC Q lcl|NC_021326. 367 FNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQKERSNDKQ 443 (445) Q Consensus 367 f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~~~~~~d~~ 443 (445) +...+-.|..+.++++.++ .|+++.-.++++++.-. -+..+..- ... ....... ++...++.+++|++ T Consensus 339 ~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p--~~ggD~~~------~~~n~~~~~~~-~~~~~~~~~~~~~~ 409 (413) T protein:vir:48 339 AGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNP--RPGGDVYL------TPMNMTTSPSA-GDDNGKKKESGDAD 409 (413) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC--CCCcceee------ccccccccccc-cccCCCCCCCCCcc Confidence 5566667889999998887 48999989888875421 11000000 000 0111111 11111111111111 Q ss_pred CC Q lcl|NC_021326. 444 SE 445 (445) Q Consensus 444 ~~ 445 (445) .. T Consensus 410 ~~ 411 (413) T protein:vir:48 410 KT 411 (413) T ss_pred cc Confidence 11 No 205 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=96.62 E-value=0.00047 Score=38.85 Aligned_cols=355 Identities=7% Similarity=-0.025 Sum_probs=143.6 Q ss_pred ChHHHHHHHHH-HHHHHHH---HHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc Q lcl|NC_021326. 1 MIVRYIKQHLE-KLPEISI---GQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH 76 (445) Q Consensus 1 ~l~~~i~~~~~-~~~~~~~---~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~ 76 (445) +..+......+ +..-+.. ...++.|.-. ....-...-+...-...+|+..++-+..-|+++. T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~- 68 (385) T protein:vir:10 3 LLTPRNFNKRKAKNMVYPSNPAFFTTTVGGMQ-------------LSYVSALSALQNTNVYSVINRIASDVASAHFKTE- 68 (385) T ss_pred cccchhcccccccccccccchhhhhhhccccC-------------ccccCHHHhhccHHHHHHHHHHHHHHhhCceeee- Confidence 33221111000 0000000 0011100000 0000000001122234466666666666676653 Q ss_pred CchHHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeec Q lcl|NC_021326. 77 TDDEVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLEN 155 (445) Q Consensus 77 ~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~ 155 (445) +......+..=.. .........+..+.+.+|.+|+.+..+. ..+..++|..+.+..+. ..+.+ ++.... T Consensus 69 -~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---~~~~p~~~~~v~~~~~~---~~~~~---~~~~~~ 138 (385) T protein:vir:10 69 -NTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGN---MGIVY---TVLESN 138 (385) T ss_pred -ccchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCCceEEEEEcC---CceEE---EEEEcC Confidence 2222222221100 1234556667788889999999886542 12223333333322221 11100 000000 Q ss_pred ceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-------CCCcCccHHHHHHHHHHHHHH Q lcl|NC_021326. 156 ETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-------NDLEISDIFMYKTLIDAYNRR 228 (445) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-------~~~g~s~~~~v~~lid~~~~~ 228 (445) ......+ +.. -|++++. ...|.|.+......++..... T Consensus 139 ~~~~~~~---------------------------------~~~--eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~ 183 (385) T protein:vir:10 139 DRPQMVL---------------------------------RQD--QMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKA 183 (385) T ss_pred CceEEEE---------------------------------ccc--cEEEeccCCCCcccccccccHHHHHHHHHHHHHHH Confidence 0000000 000 1333321 224778887777777666655 Q ss_pred HHHHHHHHHHhcCCeeEEe--cCCc-c-cchhHHHhhh-------hCceeeccCCCceeeEeccCChHH-HHHHHHHHHH Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLT--NYDD-Q-ELPEFKRLLR-------YYGAIKVSDNGGVDTIQVEVPVEN-SKKYLDELYQ 296 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~--g~~~-~-~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~~~~-~~~~i~~l~~ 296 (445) ..-..+.+...+.|-.++. |... + ........+. ..+++.++++.+++.+........ +.+..+.... T Consensus 184 ~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~ 263 (385) T protein:vir:10 184 SKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSAD 263 (385) T ss_pred HHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHH Confidence 5555565666677766554 3221 1 1111222111 223455565555544443322223 2355666777 Q ss_pred HHHHHhCcccccccc-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCH Q lcl|NC_021326. 297 KIMLFGQAVDFSSDK-FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANT 375 (445) Q Consensus 297 ~i~~~s~~p~~~~~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~ 375 (445) .|+..-++|....+. ..++.++..++.... ... ..|.-.++.+...+..+--...+++.+.+.+..|. T Consensus 264 ~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~-~~~----------~~l~P~~~~ie~~l~~~l~~~~~~f~~~~ll~~d~ 332 (385) T protein:vir:10 264 QISKAFGVPSDILGGGTSTESQHSNIDQIKA-TYL----------ANLNSYVNPIVDELRLKMNAPDLELDIKDMLDVDD 332 (385) T ss_pred HHHHHhCCCHHHcCCccCCCcccccHHHHHH-HHH----------HHHHHHHHHHHHHHHHhhCCceEEeechhhhccCH Confidence 888888888654332 112222222221111 111 11111222111111111111246666677778899 Q ss_pred HHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 376 ELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 376 ~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+.++++.++ .|+++.-.+++.++.-.=+...+.+.. ....+ -.+++++| T Consensus 333 ~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~------------------~~~~~--~~~g~~~d 384 (385) T protein:vir:10 333 SALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFK------------------PLTTQ--VKGGDEGD 384 (385) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCcccc------------------Ccccc--cCCCCCCC Confidence 9999999887 489998888877643110001110000 00000 00111111 No 206 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=96.60 E-value=0.00048 Score=38.79 Aligned_cols=378 Identities=10% Similarity=-0.007 Sum_probs=157.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc-ccccc-cccccccc------cccccchHHHHHHHHHhhhhccCe Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKP-VDATG-AVDPLKPD------DRMITNFHANLVDQKVSYIVGKPI 72 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~-~~~~~-~~~~~~~~------~ri~~n~~~~iv~~~~~~l~g~~~ 72 (445) ||.=+-..+.---.-+..+..+|+.+.- ...... ..... ......+. +-+.+.=...+|+..++-+.+-|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~-~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp~ 79 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSL-ENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPL 79 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCC-CCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCce Confidence 1110000000000001111122222110 000000 00000 00000000 001111223356666666666676 Q ss_pred eecc-Cch---HH-HHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCC Q lcl|NC_021326. 73 AFKH-TDD---EV-IKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKE 140 (445) Q Consensus 73 ~~~~-~d~---~~-~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~ 140 (445) .+-- ++. .+ ...+-..+ . | .-......+..+.+.+|.+|+.+-.+..|++ .+..++|..+.+..++ T Consensus 80 ~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~-- 157 (424) T protein:vir:45 80 HVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTG-- 157 (424) T ss_pred EEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEcC-- Confidence 5411 111 11 01111222 1 2 2345566678889999999999999888986 4777888877654332 Q ss_pred CCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHH Q lcl|NC_021326. 141 HEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIF 216 (445) Q Consensus 141 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~ 216 (445) +++.+. + ... ..... . .+ . -|++++ +...|.|.+. T Consensus 158 -~~~~y~--~---~~~-----------------~~~~~---------~----~~---~--eVih~r~~~~d~~~G~spi~ 196 (424) T protein:vir:45 158 -GRYTYG--L---YNE-----------------YGAFA---------I----SP---D--DMIHIRALGNNQKMGLSPIM 196 (424) T ss_pred -CeEEEE--E---Eec-----------------CceEE---------E----Cc---c--cEEEecCcCCCCcccccHHH Confidence 111110 0 000 00000 0 00 0 123332 2335777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC---CcccchhHHHhh----h-----hCceeeccCCCceeeEeccCCh Q lcl|NC_021326. 217 MYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFKRLL----R-----YYGAIKVSDNGGVDTIQVEVPV 284 (445) Q Consensus 217 ~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~~~~~----~-----~~~~~~~~~~~~~~~l~~~~~~ 284 (445) .....++.......-..+.+...+.|-.++.-. +.+.....+..+ . .++++.++++.+.+.+..+... T Consensus 197 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d 276 (424) T protein:vir:45 197 QHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVD 276 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeEcCCCceEEEccCChhH Confidence 666666554444444445556667776665532 222222222211 1 1235556666555555443333 Q ss_pred HHHHHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----CC Q lcl|NC_021326. 285 ENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----KG 358 (445) Q Consensus 285 ~~~~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-----~~ 358 (445) ..+.+..+.....|...-++|....+... ++-|+ ++.. ....+...|.-.++.+...+.. .. T Consensus 277 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~eq~----------~~~f~~~tL~P~~~~ie~~ln~kLl~~~e 344 (424) T protein:vir:45 277 AQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSN--ISAQ----------AIQFVRYTMMPWVTNWEQELNRRLFTRAE 344 (424) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHH----------HHHHHHHHHHHHHHHHHHHHHHhcCChhh Confidence 44556667777788888888865443222 11121 1111 1112222333333333222211 11 Q ss_pred --CcceEEEEeCCCCCCCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCC Q lcl|NC_021326. 359 --EHKDVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQ 434 (445) Q Consensus 359 --~~~~i~v~f~~~~p~d~~~~~~~~~~~~--g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 434 (445) ....+++.+...+..|..+.++.+.++. |+++.-.++++++.-. -+.-+.. .... +... ...... T Consensus 345 ~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~p--i~ggD~~------~~~~--n~~~-~~~~~~ 413 (424) T protein:vir:45 345 LAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNP--VEGLDEM------LVSV--NAAN-PAGDFK 413 (424) T ss_pred hcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC--CCCccee------eecc--cccc-cccccC Confidence 1123444455666678899999988874 8999999998875421 0000000 0000 1110 111222 Q ss_pred CCCCCCCcCCC Q lcl|NC_021326. 435 QKERSNDKQSE 445 (445) Q Consensus 435 ~~~~~~d~~~~ 445 (445) ++..++++++| T Consensus 414 ~~~~~~~~~~~ 424 (424) T protein:vir:45 414 PPKNDEGKTNE 424 (424) T ss_pred CCCCCCCCCCC Confidence 22333333333 No 207 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=96.60 E-value=0.00048 Score=38.78 Aligned_cols=351 Identities=12% Similarity=0.029 Sum_probs=146.4 Q ss_pred HHHhcC----CCcccccc----ccccccccccccccc------cccccchHHHHHHHHHhhhhccCeeeccCchHHHHHH Q lcl|NC_021326. 20 QEYYEQ----RPDIVKEP----KPVDATGAVDPLKPD------DRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVIKRI 85 (445) Q Consensus 20 ~~yy~G----~~~i~~~~----~~~~~~~~~~~~~~~------~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l 85 (445) ..+|.. .+...... ........ ...... .-+.++-...+|+..++-+.+-|+.+. +......+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~--~~~~~~l~ 77 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFL-STLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTAS--RKQLQGII 77 (386) T ss_pred Ccccccccccccccccccccccccccchhc-ccccCCceechhhhhcchHHHHHHHHHHHhhccCceeec--cchhHHHh Confidence 112211 00000000 00000000 000000 001112223455555555555565543 22222222 Q ss_pred HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEe Q lcl|NC_021326. 86 DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWD 163 (445) Q Consensus 86 ~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~ 163 (445) ..-.. .........+..+.+.+|.+|+.+-.+..|++ .+..++|..+.+..+.. .+.+.+ ++ ....... T Consensus 78 ~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~-~~~~~y--~~-~~~~~~~----- 148 (386) T protein:vir:48 78 DNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDN-KDGIYY--NI-TFDDPRI----- 148 (386) T ss_pred hcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCC-CceEEE--EE-EecCccc----- Confidence 22111 13455667788899999999999989888875 57788888887765532 111110 00 0000000 Q ss_pred cceEEEEEEecceeeecccccccccccccccccccccceEEecCC-----CCcCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 164 KITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFMYKTLIDAYNRRLSDLSNTFKD 238 (445) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~~v~~lid~~~~~~s~~~~~~~~ 238 (445) ... . ..+-. -|+|+++. ..|.|.+......+.....+.......+.. T Consensus 149 --~~~-~-----------------------~~~~~--evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~n 200 (386) T protein:vir:48 149 --PPK-Q-----------------------HVPQG--DVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKN 200 (386) T ss_pred --cce-e-----------------------EecCc--cEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 000 0 00000 13343321 247787877666666655555555556666 Q ss_pred hcCCeeEEecCCcccchhHH---Hhh-----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc Q lcl|NC_021326. 239 SNELTYVLTNYDDQELPEFK---RLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD 310 (445) Q Consensus 239 ~~~~~l~~~g~~~~~~~~~~---~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~ 310 (445) .+.|-.++.-......+... ... ...+++.++++.+.+.+........+.+..+.....|+..-++|....+ T Consensus 201 g~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg 280 (386) T protein:vir:48 201 ALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVG 280 (386) T ss_pred cCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhC Confidence 77776666543222211111 111 1223445555555444433333445667777778888888888865443 Q ss_pred cccCcch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHH--h Q lcl|NC_021326. 311 KFGSAPS--GVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQS--M 386 (445) Q Consensus 311 ~~~~~~S--g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~ 386 (445) ..++..+ ...+.+. ...|.-++..+...+...- ...+++.+...+..+....+..+.++ . T Consensus 281 ~~~~~~~~e~~~~~~~---------------~~~l~P~~~~ie~~l~~~l-~~~~~~~~~~~~~~d~~~~~~~~~~l~~~ 344 (386) T protein:vir:48 281 GQGDQQSSLEMSLDLY---------------NKAVSRYLRPFLSELSQKL-SCDVDADILPAVDPTGSNSVSRINSMVKS 344 (386) T ss_pred CCCCcccHHHHHHHHH---------------HHHHHHHHHHHHHHHHHhh-cchhhcchhhhhccChHHHHHHHHHHHhC Confidence 2222222 1112111 1122222222221111100 01122222333334555566666665 5 Q ss_pred ccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 387 GIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 387 g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) |+++.-.+++.++. +.. -|+.+. ........ ++++.+ ++| T Consensus 345 g~~t~nE~r~~lg~~~~~~--~~~~~~--------~~~~~~~~---~gGd~~------~~~ 386 (386) T protein:vir:48 345 GTLAQNQGLYILQQAEILP--KELPEG--------ENPNKTTL---KGGEIN------GED 386 (386) T ss_pred CCcCHHHHHHHhhcCCCCC--ccchhh--------cCCCCCcc---CCCCCC------CCC Confidence 89999888887632 221 111110 00001111 111111 111 No 208 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=96.58 E-value=0.00049 Score=38.72 Aligned_cols=410 Identities=10% Similarity=0.051 Sum_probs=179.4 Q ss_pred ChHHHHHHHHHHHHH-HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C-----e Q lcl|NC_021326. 1 MIVRYIKQHLEKLPE-ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P-----I 72 (445) Q Consensus 1 ~l~~~i~~~~~~~~~-~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~-----~ 72 (445) -|++..+.++.+... ...+.++++=-- |..- .+... .+...++..+-+...++++++-|++- | + T Consensus 15 ~l~~r~~~L~~~R~~~e~~w~e~a~~~l--P~~~--~~~~~----~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 86 (516) T protein:vir:96 15 KIPKLWEKFSNKRSSFLDRAKHYSKLTL--PYLM--NDKGD----NETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFF 86 (516) T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHhhc--cccc--CCCCC----ccccCCcccchHHHHHHHHHHHHHhhhcCCCCccc Confidence 555556655544322 233444432111 1110 00000 11122455666777788887776542 2 2 Q ss_pred eeccCch-------------HHHHHH-------H-HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccce Q lcl|NC_021326. 73 AFKHTDD-------------EVIKRI-------D-EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQ 131 (445) Q Consensus 73 ~~~~~d~-------------~~~~~l-------~-~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 131 (445) ++...+. ++.+.+ . .+..+||...+.++.++..++|.+.++ .++++. ++.++-.+ T Consensus 87 ~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~--~d~~~~--~~~~pl~~ 162 (516) T protein:vir:96 87 RVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLY--KPSKGA--ISAIPMHH 162 (516) T ss_pred ccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEE--ecCCCC--EEEEEcCe Confidence 3333221 122222 1 222468889999999999999998754 567665 44554444 Q ss_pred eEEEEcCCCCCceEEEEEEEe----------------------eecceeEEEEecceEEEEEEecceeeecccccccccc Q lcl|NC_021326. 132 GIPIWTDKEHEELEAFIRMYK----------------------LENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSK 189 (445) Q Consensus 132 ~~~v~d~~~~~~~~~~v~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (445) |.+..+. .+++...++..+ .+....+++|+..... -+..+... ....+... T Consensus 163 -y~v~~d~-~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~---~~~~~~~~--~~~d~~~~ 235 (516) T protein:vir:96 163 -YVVNRDT-NGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYL---GDGFWELK--QSADDIPV 235 (516) T ss_pred -EEEeeCC-CCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeee---CCceeEEE--EEeCceee Confidence 4444443 455544443221 0111223333221110 00000000 00011111 Q ss_pred cccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhC Q lcl|NC_021326. 190 THFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYY 264 (445) Q Consensus 190 ~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 264 (445) ......+|..+|++.+. .+.+|+|-.....+-+..+|...-.+.........|.+.+.-......... ..... T Consensus 236 ~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l--~~~~~ 313 (516) T protein:vir:96 236 GKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHF--VNSGT 313 (516) T ss_pred ccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhhh--ccCCC Confidence 11122334456766543 346899999999999999998877788888888877765521111111110 11222 Q ss_pred ceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 265 GAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVA 342 (445) Q Consensus 265 ~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 342 (445) +.+..+..++++.+... .+.......++.++..|...-.+..+. ...+...||+.+.. +..++...++.. T Consensus 314 g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~-~r~~~rvTAtEV~~-------r~~E~~~~LGpv 385 (516) T protein:vir:96 314 GEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMT-RRDAERVTAVEIQR-------DALEIEQNMGGV 385 (516) T ss_pred ceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhc-cCCCccccHHHHHH-------HHHHHHHHhhhH Confidence 34444444556665433 356777788888887776643321111 12233467777654 344455555555 Q ss_pred HHHH--------HHHHHHHhccCCCcceEEEEeCCCCCCCHHHHH---HHHH-------HHhcc-------CChHHHHHh Q lcl|NC_021326. 343 IQEL--------LWFVFEHFDIKGEHKDVDISFNYNKVANTELQV---QTAQ-------QSMGI-------VSHETVLEN 397 (445) Q Consensus 343 l~~~--------~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~---~~~~-------~~~g~-------~s~et~l~~ 397 (445) +.++ +..++..++-.--...+++.+..+ .+.+..+ +.+. .++++ +....++.. T Consensus 386 ~~rl~~Ell~Pli~r~l~~~~p~lp~~~v~~~~vs~--l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~ 463 (516) T protein:vir:96 386 YSLFATTMQSPVAMWGLLEAGESFTSDLVDPVIITG--IEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDW 463 (516) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCccccccceeech--HHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHH Confidence 5443 222222222111112233333211 1122211 1111 11211 122333332 Q ss_pred C---CCCC----CHHHHHHHHHHHHHHHHHhhhccc-cCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 398 H---PFVE----DLQAELERIEQEQMEYNKQLPNLD-DGGADGAQQKERSNDKQSE 445 (445) Q Consensus 398 l---~~~~----d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~~~~~~~~~d~~~~ 445 (445) + -+++ -.++|++++.+++.+..+..+... -+.+.+ +..+++-.| T Consensus 464 ~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~----~~~~~~~~~ 515 (516) T protein:vir:96 464 VRGQISAELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVP----GVIQQELKE 515 (516) T ss_pred HHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhh----HHhhccccc Confidence 2 1221 234666655555444333221111 111111 111111122 No 209 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=96.56 E-value=0.00051 Score=38.64 Aligned_cols=420 Identities=9% Similarity=0.006 Sum_probs=167.4 Q ss_pred ChHHHHHHHH--HH---HHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C-- Q lcl|NC_021326. 1 MIVRYIKQHL--EK---LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P-- 71 (445) Q Consensus 1 ~l~~~i~~~~--~~---~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~-- 71 (445) |=.++-..+. +| ..+++.+.+|.- |..-....... .... ...+...+-+...++.+++-|++- | T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~l-----P~~~~~~~~~~-~~~~-~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 73 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTI-----ASLMVDPLDKT-HQAE-VVEYDFQSAGAFLVNNLTAKLALTLFPPG 73 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhc-----ccccCCCCCCc-cccc-ccccccchhHHHHHHHHHHHHHhhhcCCC Confidence 2222211111 12 233334444431 11110000000 0000 011223445566677777666542 2 Q ss_pred ---eeeccCch-------------HHHHHHH--------HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEE Q lcl|NC_021326. 72 ---IAFKHTDD-------------EVIKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRV 127 (445) Q Consensus 72 ---~~~~~~d~-------------~~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~ 127 (445) +++..+++ ++...+. .+..+||...+.++.++..++|.+.+++..+. + .++.+ T Consensus 74 ~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~-~--~~~~~ 150 (514) T protein:vir:80 74 RPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGT-G--KMLVW 150 (514) T ss_pred CcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCC-C--cEEEE Confidence 23333221 1222221 22247899999999999999999877763322 2 35566 Q ss_pred ccceeEEEEcCCCCCceEEEEEEEee--------------------ecceeEEEEecceEEEEEEecceeeecccccccc Q lcl|NC_021326. 128 PAEQGIPIWTDKEHEELEAFIRMYKL--------------------ENETKVEYWDKITVNYYVYENGSLIPDYSNNLEN 187 (445) Q Consensus 128 ~p~~~~~v~d~~~~~~~~~~v~~~~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (445) +-.+ |.+-.|. .+++...+|..+. +....+++|+..... ...............+. T Consensus 151 pl~~-y~v~~d~-~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~--~~~~~~~~sv~~e~~g~ 226 (514) T protein:vir:80 151 TMQS-YTVRRTS-HGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQ--PTPNGKRCAVWHELEGK 226 (514) T ss_pred EcCe-EEEeeCC-CcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEee--cCCCCeEEEEEEeccce Confidence 5444 4444443 4666555544431 111223333211100 00000000000000111 Q ss_pred cccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhh Q lcl|NC_021326. 188 SKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLR 262 (445) Q Consensus 188 ~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~ 262 (445) ........++..+|++.+. .+.+|+|-.....+-+..+|...-...........|.+.+.-......... ... T Consensus 227 ~i~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~l--~~~ 304 (514) T protein:vir:80 227 RVGPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDDY--RDA 304 (514) T ss_pred eecccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhhh--ccc Confidence 1111112223346766543 346799999999999999998877777777777777665421111111110 112 Q ss_pred hCceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHH----H-H Q lcl|NC_021326. 263 YYGAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKAD----K-L 335 (445) Q Consensus 263 ~~~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~----~-~ 335 (445) ..+.+..+..++++.+... .+.......++.++..|...-.+.. . ...+...|++.+......+..... + . T Consensus 305 ~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~-~-~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~ 382 (514) T protein:vir:80 305 ETGDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTG-Q-VRDAERVTVEEIRTVAEEAENLLGGVYSLLA 382 (514) T ss_pred CCceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhc-c-CCCCCCCCHHHHHHHHHHHHHHhhHHHHHHH Confidence 2234444445556666543 4667777888888887765322211 1 122344677777654333333221 1 1 Q ss_pred HHHHHHHHHHHHHHHHHHh-cc----CCCcceEEEEeCCCC-CCCHHHHHHHH-------HHHhcc-------CChHHHH Q lcl|NC_021326. 336 ARKAKVAIQELLWFVFEHF-DI----KGEHKDVDISFNYNK-VANTELQVQTA-------QQSMGI-------VSHETVL 395 (445) Q Consensus 336 ~~~~~~~l~~~~~~~~~~~-~~----~~~~~~i~v~f~~~~-p~d~~~~~~~~-------~~~~g~-------~s~et~l 395 (445) ...+.+-+.+++.++.+.. |. ..+. +.+.+.-++ +......++.+ ..+.++ +....++ T Consensus 383 ~Ell~Pli~r~~~il~r~~~g~lP~~p~~l--~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~ 460 (514) T protein:vir:80 383 ETLQAPLAYLTMYEASRGNGGMLLGIAQGV--YRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLV 460 (514) T ss_pred HHHHHHHHHHHHHHHhhhccCCCCCCCchh--hcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHH Confidence 2222333444444443211 11 1122 233332111 12222222222 222222 2233334 Q ss_pred HhC---CCC------CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCC Q lcl|NC_021326. 396 ENH---PFV------EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSND 441 (445) Q Consensus 396 ~~l---~~~------~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d 441 (445) ..+ -++ .+. ++++..+++.++.++++.........++.+.+--.- T Consensus 461 ~~~a~~~Gvp~~~i~~~~-e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 461 ERIFANNSVDLSTLSKDP-DVVAAEAEQEAALAQQQLDVASGALAAETSAGVLTS 514 (514) T ss_pred HHHHHHhCCCHhhccCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCC Confidence 332 122 222 222222222111111111111111111111000000 No 210 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=96.54 E-value=0.00053 Score=38.55 Aligned_cols=358 Identities=9% Similarity=-0.012 Sum_probs=149.9 Q ss_pred ChHHHHHHHHHHHHH-HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPE-ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~-~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) +++++.+.-...... ...+...+-+- ..... ..-...-+..+-...+|+..++-+-+-|+.+.-... T Consensus 3 ~f~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~--~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 70 (382) T protein:vir:48 3 IFNLATESPPDNQGGFFDVVDSDFLAS----------LKGNE--WVSAETALRNSDLFSIINQLSNDLATVKLITSRKKL 70 (382) T ss_pred cccccccCCcccccccccchhhhcccc----------ccCCc--ccchHhhhccHHHHHHHHHHHHhhccCceeeecchh Confidence 443332211100000 00000000000 00000 000000011222334566666665555665432221 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecce Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 157 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 157 (445) ...+..=.. .........+..+.+.+|.+|+.+..|..|++ .+..++|..+.++.++. .+.+.+ ++ ...... T Consensus 71 --~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~-~~~~~y--~~-~~~~~~ 144 (382) T protein:vir:48 71 --QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDN-KDGIYY--NI-TFDDPR 144 (382) T ss_pred --hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCC-CCeEEE--EE-EecCcc Confidence 122221111 13456677788899999999999999988886 67888999987766542 111111 00 000000 Q ss_pred eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 158 KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDL 232 (445) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~ 232 (445) . ..... .+.. -|+|++. ...|.|.+..+...++....+..-. T Consensus 145 ~------~~~~~-------------------------~~~~--evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~ 191 (382) T protein:vir:48 145 I------PPKQH-------------------------VPQN--DVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLT 191 (382) T ss_pred c------cceeE-------------------------EcCc--cEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH Confidence 0 00000 0000 1333331 2357788877777777666665556 Q ss_pred HHHHHHhcCCeeEEec--CCcc-cchhHHHhh-----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021326. 233 SNTFKDSNELTYVLTN--YDDQ-ELPEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQA 304 (445) Q Consensus 233 ~~~~~~~~~~~l~~~g--~~~~-~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~ 304 (445) .+.+...+.|-.++.- ...+ ......... ..++++.++++.+.+.+........+.+..+...+.|+..-++ T Consensus 192 ~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgV 271 (382) T protein:vir:48 192 INSLKNALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGI 271 (382) T ss_pred HHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhhccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 6667777777666543 2211 111111111 1234556666655555544444455667777788888888888 Q ss_pred cccccccccCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHH Q lcl|NC_021326. 305 VDFSSDKFGSAPS-GVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQ 383 (445) Q Consensus 305 p~~~~~~~~~~~S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~ 383 (445) |....+..+++.+ ..+.+ ..+...|.-+++.+...+...-- ..+........-.+.......+. T Consensus 272 p~~~lg~~~~~~~~~~~~~--------------~~~~~~l~p~~~~i~~~l~~~l~-~~~~~~~~~~~~~~~~~~~~~~~ 336 (382) T protein:vir:48 272 PDNVVGGQGDQQSSLEMSS--------------DLYSKAVSRYLRPFLSELSQKLS-CDVDADIFPAVDPTGSNYISRIN 336 (382) T ss_pred CHHHhCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHhc-ChhhhhhhhhhccchhHHHHHHH Confidence 8655443222222 11211 12222222222222211111000 00011001111122333344444 Q ss_pred HH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 384 QS--MGIVSHETVLENH---PFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 384 ~~--~g~~s~et~l~~l---~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++ +|+++.-.+++.+ ++..+...+. ....+. . +++++.+ +| T Consensus 337 ~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~----------~~~~~~-~----~GGd~~~------~~ 382 (382) T protein:vir:48 337 SLVKTGTLAQNQGLYILQQAEILPKELPNG----------ENPNST-L----KGGEEDG------QD 382 (382) T ss_pred HHhhcCccCHHHHHHHHhhCCCCCcchhhh----------hcCCCC-C----CCCCCCC------CC Confidence 44 4888888887765 4433311110 000011 1 1111111 11 No 211 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=96.42 E-value=0.00064 Score=38.08 Aligned_cols=379 Identities=9% Similarity=-0.028 Sum_probs=159.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHH-----hcCCC-----ccccccccccccccccccccccccccchHHHHHHHHHhhhhcc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEY-----YEQRP-----DIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 70 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~y-----y~G~~-----~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~ 70 (445) ++..--.......+.+.....- |.|-- +|+..+ + . ....+. . .......-.+.+....+.+. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~----~-~-~~ly~~-m-~~D~hi~s~l~~Rk~av~~~ 82 (448) T protein:vir:79 11 LVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGK----D-G-LLVYHK-M-LSDGTVKNALNYIFGRIRSA 82 (448) T ss_pred ccCcccccccccchhhhhhhhhhcccccccccccchhHhhccc----c-c-hHHHHH-H-hhChHHHHHHHHHHHHHhcC Confidence 1100000000000000000000 11100 000000 0 0 000000 0 01234555666777777888 Q ss_pred CeeeccCc-----hHHHHHHHHHhcc--------CHHHHHHHHHHHHHhcCeEE-EEEEE-CCCCcEEEEE---Ecccee Q lcl|NC_021326. 71 PIAFKHTD-----DEVIKRIDEVLGN--------RFDDKLHSVLTGASNKGIEW-LHPYL-DEEGEFKLFR---VPAEQG 132 (445) Q Consensus 71 ~~~~~~~d-----~~~~~~l~~~~~n--------~~~~~~~~~~~~~~~~G~~~-~~v~~-d~~g~~~i~~---~~p~~~ 132 (445) +..+.+.+ ....+++.+++.. +|.+ +..-..++..+|.++ +++|. ..+|+..+.. .+|... T Consensus 83 ~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~-~~~~~lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r~~~~~ 161 (448) T protein:vir:79 83 KWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGR-LFAIYENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFNI 161 (448) T ss_pred CceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHH-HHHHHHHhhhhcceeEEEEeeecCCCceecccccccCCccc Confidence 88886422 2244455555532 2333 334456788889875 45664 4567653322 222211 Q ss_pred -EEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC---- Q lcl|NC_021326. 133 -IPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN---- 207 (445) Q Consensus 133 -~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n---- 207 (445) ...|+.. +.+.. .. ..... . .........+-+++. ++++.. T Consensus 162 ~~f~~~~d--~~l~~---------------~~--------~~~~~-----~--~~~~~~~~~~lP~~~--~i~~~~~~~g 207 (448) T protein:vir:79 162 DEVLYDEE--GGPKA---------------LK--------LSGEV-----K--GGSQFVSGLEIPIWK--TVVFLHNDDG 207 (448) T ss_pred cceeeecC--CceEE---------------ee--------cCCcc-----c--ccccCCCccccccce--EEEEecCccC Confidence 1112211 11111 00 00000 0 000000001112332 333332 Q ss_pred CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCccc--chh------HHHhhh--hCceeeccCCCceee Q lcl|NC_021326. 208 NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQE--LPE------FKRLLR--YYGAIKVSDNGGVDT 277 (445) Q Consensus 208 ~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~--~~~------~~~~~~--~~~~~~~~~~~~~~~ 277 (445) ++.|.|.+..+.-..---+..+.+++.-++.+..|+++.+-....+ ..+ ...++. ...+..++++.++++ T Consensus 208 ~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~ 287 (448) T protein:vir:79 208 SFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDT 287 (448) T ss_pred CcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCCceEEE Confidence 3567888887666555566678889999999999999877432211 111 112222 233556899999999 Q ss_pred EeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcc Q lcl|NC_021326. 278 IQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFEHFDI 356 (445) Q Consensus 278 l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~~~~~ 356 (445) ++...+...+..+++.+.+.|...--.--++.+..+ ..++.+......-....+..-.+.+...+. +++.-++.+.. T Consensus 288 ~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~-g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNf- 365 (448) T protein:vir:79 288 VDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNM-GVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNW- 365 (448) T ss_pred EecCCCcccHHHHHHHHHHHHHHHHhhhhhcccccc-chhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC- Confidence 988766666777888888777654311112222222 222222221111122222334455555564 36665555432 Q ss_pred CCCcceEEEEeCCCCCCCHHHHHHHHHHHhccCCh-HHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCC---C- Q lcl|NC_021326. 357 KGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSH-ETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGA---D- 431 (445) Q Consensus 357 ~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~g~~s~-et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~---~- 431 (445) .....-..+.|....+.|..+.++.+.++++.... +........+. +...+.. . T Consensus 366 g~~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~~~~~~~~~~~~~p---------------------~~~~~~~~~a~~ 424 (448) T protein:vir:79 366 PSATRFPRLTFEMEERNDFSAAANLMGMLINAVKDSEDIPTELKALI---------------------DALPSKMRRALG 424 (448) T ss_pred CCcCCCcEEEecCCChHHHHHHHHHhhhhhccchhhHHHHHHhhcCC---------------------CCCCCccccccC Confidence 12222346778877788888888888777654211 11111111111 1111000 0 Q ss_pred CCCCCCCCCCcCCC Q lcl|NC_021326. 432 GAQQKERSNDKQSE 445 (445) Q Consensus 432 ~~~~~~~~~d~~~~ 445 (445) ...+..++.....+ T Consensus 425 ~~~~~~~~~~~~~~ 438 (448) T protein:vir:79 425 VVDEVREAVRQPAD 438 (448) T ss_pred CCCcccccccCCcc Confidence 00111111111111 No 212 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=96.32 E-value=0.00075 Score=37.71 Aligned_cols=354 Identities=9% Similarity=-0.021 Sum_probs=128.5 Q ss_pred HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchHHHHHHHH-Hhc--c- Q lcl|NC_021326. 16 ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVIKRIDE-VLG--N- 91 (445) Q Consensus 16 ~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~-~~~--n- 91 (445) +-.+.+.+..+.+...-.. ..... ......-+.......+|+..++-+..-|+.+-..+......+-. +.. | T Consensus 1 Mg~f~~lf~~~~~~~~~~~--~~~~~--~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~ 76 (395) T protein:vir:10 1 MSILEKIFKTRKDITYMLD--LDMIE--DLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNT 76 (395) T ss_pred CchhhhhhccCcccccccc--chhcc--ccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCc Confidence 2222233333221100000 00000 00001112233445566666666655666543332222222222 221 2 Q ss_pred --CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEE--EEcCCCCCceEEEEEEEeeecceeEEEEecceE Q lcl|NC_021326. 92 --RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIP--IWTDKEHEELEAFIRMYKLENETKVEYWDKITV 167 (445) Q Consensus 92 --~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~--v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 167 (445) ........+..+.+..|.+|.++..+ +.+ ..+++....+ ++++. ...+.. . ...+ T Consensus 77 ~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~---~---------~~~~ 135 (395) T protein:vir:10 77 DLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTV---K---------DYTY 135 (395) T ss_pred CCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEE---c---------Ccee Confidence 23445555666777778777655433 221 1122221111 11110 000000 0 0000 Q ss_pred EEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021326. 168 NYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNEL 242 (445) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~ 242 (445) .. .+..=-|++++. ...|.|.++.....++.. ...+...+.+ T Consensus 136 ~~--------------------------~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~-------~~~~~~~~~~ 182 (395) T protein:vir:10 136 QR--------------------------TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRM-------IGAQLKNYQI 182 (395) T ss_pred ee--------------------------eeccccEEEEccCCCCcccccchHHHHHHHHHHHH-------HHHHHhcCCC Confidence 00 000001333321 123555554444443322 2233444444 Q ss_pred eeEEecCCcccchh----HHHhhh-------hCc--eeeccCCCceeeEeccCC-----hHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021326. 243 TYVLTNYDDQELPE----FKRLLR-------YYG--AIKVSDNGGVDTIQVEVP-----VENSKKYLDELYQKIMLFGQA 304 (445) Q Consensus 243 ~l~~~g~~~~~~~~----~~~~~~-------~~~--~~~~~~~~~~~~l~~~~~-----~~~~~~~i~~l~~~i~~~s~~ 304 (445) -.++.-......++ ....+. ..+ ++.++++.+.+.++.... ...+.+..+...+.|+..-++ T Consensus 183 ~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~V 262 (395) T protein:vir:10 183 RGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGI 262 (395) T ss_pred ceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCC Confidence 43332221111111 111111 111 222444444444432211 124555666677778888888 Q ss_pred cccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hccCCCcceEEEEeCCCCCCCHHHHH Q lcl|NC_021326. 305 VDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH-----FDIKGEHKDVDISFNYNKVANTELQV 379 (445) Q Consensus 305 p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~-----~~~~~~~~~i~v~f~~~~p~d~~~~~ 379 (445) |........++.+..... .+...|.-++..+... +........+++.+...+..|..+.+ T Consensus 263 Pp~~l~~~~sn~e~~~~~---------------~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~ 327 (395) T protein:vir:10 263 PPGLIYGETADLEKNTLV---------------FEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYA 327 (395) T ss_pred CHHHhcCcccCHHHHHHH---------------HHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHH Confidence 865443211111111111 1112222222222211 11111112245556666778888999 Q ss_pred HHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHH-HHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 380 QTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQ-MEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 380 ~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +++.++ .|+++.-.++++++.- ++.. .++.---. -...+.. ...+.........+.++++.|+ T Consensus 328 ~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~~~~~~n~~~~~~~-~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 328 EAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDEYLITKNYEKANSG-ENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cceeeecccccccccc-ccccCcccccccCCCCCCCCCC Confidence 988876 4889999988887542 2210 01000000 0000000 0011111111222333333344 No 213 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=96.32 E-value=0.00075 Score=37.71 Aligned_cols=354 Identities=9% Similarity=-0.021 Sum_probs=128.5 Q ss_pred HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchHHHHHHHH-Hhc--c- Q lcl|NC_021326. 16 ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVIKRIDE-VLG--N- 91 (445) Q Consensus 16 ~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~-~~~--n- 91 (445) +-.+.+.+..+.+...-.. ..... ......-+.......+|+..++-+..-|+.+-..+......+-. +.. | T Consensus 1 Mg~f~~lf~~~~~~~~~~~--~~~~~--~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~ 76 (395) T protein:vir:10 1 MSILEKIFKTRKDITYMLD--LDMIE--DLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNT 76 (395) T ss_pred CchhhhhhccCcccccccc--chhcc--ccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCc Confidence 2222233333221100000 00000 00001112233445566666666655666543332222222222 221 2 Q ss_pred --CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEE--EEcCCCCCceEEEEEEEeeecceeEEEEecceE Q lcl|NC_021326. 92 --RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIP--IWTDKEHEELEAFIRMYKLENETKVEYWDKITV 167 (445) Q Consensus 92 --~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~--v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 167 (445) ........+..+.+..|.+|.++..+ +.+ ..+++....+ ++++. ...+.. . ...+ T Consensus 77 ~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~---~---------~~~~ 135 (395) T protein:vir:10 77 DLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTV---K---------DYTY 135 (395) T ss_pred CCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEE---c---------Ccee Confidence 23445555666777778777655433 221 1122221111 11110 000000 0 0000 Q ss_pred EEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021326. 168 NYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNEL 242 (445) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~ 242 (445) .. .+..=-|++++. ...|.|.++.....++.. ...+...+.+ T Consensus 136 ~~--------------------------~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~-------~~~~~~~~~~ 182 (395) T protein:vir:10 136 QR--------------------------TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRM-------IGAQLKNYQI 182 (395) T ss_pred ee--------------------------eeccccEEEEccCCCCcccccchHHHHHHHHHHHH-------HHHHHhcCCC Confidence 00 000001333321 123555554444443322 2233444444 Q ss_pred eeEEecCCcccchh----HHHhhh-------hCc--eeeccCCCceeeEeccCC-----hHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021326. 243 TYVLTNYDDQELPE----FKRLLR-------YYG--AIKVSDNGGVDTIQVEVP-----VENSKKYLDELYQKIMLFGQA 304 (445) Q Consensus 243 ~l~~~g~~~~~~~~----~~~~~~-------~~~--~~~~~~~~~~~~l~~~~~-----~~~~~~~i~~l~~~i~~~s~~ 304 (445) -.++.-......++ ....+. ..+ ++.++++.+.+.++.... ...+.+..+...+.|+..-++ T Consensus 183 ~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~V 262 (395) T protein:vir:10 183 RGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGI 262 (395) T ss_pred ceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCC Confidence 43332221111111 111111 111 222444444444432211 124555666677778888888 Q ss_pred cccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hccCCCcceEEEEeCCCCCCCHHHHH Q lcl|NC_021326. 305 VDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH-----FDIKGEHKDVDISFNYNKVANTELQV 379 (445) Q Consensus 305 p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~-----~~~~~~~~~i~v~f~~~~p~d~~~~~ 379 (445) |........++.+..... .+...|.-++..+... +........+++.+...+..|..+.+ T Consensus 263 Pp~~l~~~~sn~e~~~~~---------------~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~ 327 (395) T protein:vir:10 263 PPGLIYGETADLEKNTLV---------------FEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYA 327 (395) T ss_pred CHHHhcCcccCHHHHHHH---------------HHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHH Confidence 865443211111111111 1112222222222211 11111112245556666778888999 Q ss_pred HHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHH-HHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 380 QTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQ-MEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 380 ~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +++.++ .|+++.-.++++++.- ++.. .++.---. -...+.. ...+.........+.++++.|+ T Consensus 328 ~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~~~~~~n~~~~~~~-~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 328 EAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDEYLITKNYEKANSG-ENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cceeeecccccccccc-ccccCcccccccCCCCCCCCCC Confidence 988876 4889999988887542 2210 01000000 0000000 0011111111222333333344 No 214 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=96.32 E-value=0.00075 Score=37.71 Aligned_cols=354 Identities=9% Similarity=-0.021 Sum_probs=128.5 Q ss_pred HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchHHHHHHHH-Hhc--c- Q lcl|NC_021326. 16 ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVIKRIDE-VLG--N- 91 (445) Q Consensus 16 ~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~-~~~--n- 91 (445) +-.+.+.+..+.+...-.. ..... ......-+.......+|+..++-+..-|+.+-..+......+-. +.. | T Consensus 1 Mg~f~~lf~~~~~~~~~~~--~~~~~--~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~~~~ll~~~PN~ 76 (395) T protein:vir:95 1 MSILEKIFKTRKDITYMLD--LDMIE--DLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKNDVYYKLNIKPNT 76 (395) T ss_pred CchhhhhhccCcccccccc--chhcc--ccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccccchHHHHHHhccCc Confidence 2222233333221100000 00000 00001112233445566666666655666543332222222222 221 2 Q ss_pred --CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEE--EEcCCCCCceEEEEEEEeeecceeEEEEecceE Q lcl|NC_021326. 92 --RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIP--IWTDKEHEELEAFIRMYKLENETKVEYWDKITV 167 (445) Q Consensus 92 --~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~--v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~ 167 (445) ........+..+.+..|.+|.++..+ +.+ ..+++....+ ++++. ...+.. . ...+ T Consensus 77 ~~t~~~f~~~~~~~lll~g~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~-----~~~~~~---~---------~~~~ 135 (395) T protein:vir:95 77 DLSSDSFWQQVIYKLIYDNEVLIVVSDS--KEL--LIADSFYREEYALYDDI-----FKDVTV---K---------DYTY 135 (395) T ss_pred CCCHHHHHHHHHHHHhhCCceEEEEecC--CCe--EecCCccceeEeecCcc-----eeEEEE---c---------Ccee Confidence 23445555666777778777655433 221 1122221111 11110 000000 0 0000 Q ss_pred EEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021326. 168 NYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNEL 242 (445) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~ 242 (445) .. .+..=-|++++. ...|.|.++.....++.. ...+...+.+ T Consensus 136 ~~--------------------------~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~-------~~~~~~~~~~ 182 (395) T protein:vir:95 136 QR--------------------------TFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRM-------IGAQLKNYQI 182 (395) T ss_pred ee--------------------------eeccccEEEEccCCCCcccccchHHHHHHHHHHHH-------HHHHHhcCCC Confidence 00 000001333321 123555554444443322 2233444444 Q ss_pred eeEEecCCcccchh----HHHhhh-------hCc--eeeccCCCceeeEeccCC-----hHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021326. 243 TYVLTNYDDQELPE----FKRLLR-------YYG--AIKVSDNGGVDTIQVEVP-----VENSKKYLDELYQKIMLFGQA 304 (445) Q Consensus 243 ~l~~~g~~~~~~~~----~~~~~~-------~~~--~~~~~~~~~~~~l~~~~~-----~~~~~~~i~~l~~~i~~~s~~ 304 (445) -.++.-......++ ....+. ..+ ++.++++.+.+.++.... ...+.+..+...+.|+..-++ T Consensus 183 ~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~V 262 (395) T protein:vir:95 183 RGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGI 262 (395) T ss_pred ceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCC Confidence 43332221111111 111111 111 222444444444432211 124555666677778888888 Q ss_pred cccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hccCCCcceEEEEeCCCCCCCHHHHH Q lcl|NC_021326. 305 VDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEH-----FDIKGEHKDVDISFNYNKVANTELQV 379 (445) Q Consensus 305 p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~-----~~~~~~~~~i~v~f~~~~p~d~~~~~ 379 (445) |........++.+..... .+...|.-++..+... +........+++.+...+..|..+.+ T Consensus 263 Pp~~l~~~~sn~e~~~~~---------------~~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~ 327 (395) T protein:vir:95 263 PPGLIYGETADLEKNTLV---------------FEKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYA 327 (395) T ss_pred CHHHhcCcccCHHHHHHH---------------HHHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHH Confidence 865443211111111111 1112222222222211 11111112245556666778888999 Q ss_pred HHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHH-HHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 380 QTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQ-MEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 380 ~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~-~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +++.++ .|+++.-.++++++.- ++.. .++.---. -...+.. ...+.........+.++++.|+ T Consensus 328 ~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~--~d~~~~~~n~~~~~~~-~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 328 EAIDKLVSSGSFTRNEVRIMLGEEPSDNPE--LDEYLITKNYEKANSG-ENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cceeeecccccccccc-ccccCcccccccCCCCCCCCCC Confidence 988876 4889999988887542 2210 01000000 0000000 0011111111222333333344 No 215 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=96.21 E-value=0.00087 Score=37.37 Aligned_cols=384 Identities=9% Similarity=0.021 Sum_probs=164.0 Q ss_pred hHHHHHHHHHHHHHHHHH-HHHhcCCCcccccccc--ccc---cccccccccccccccchHHHHHHHHHhhhhccCeeec Q lcl|NC_021326. 2 IVRYIKQHLEKLPEISIG-QEYYEQRPDIVKEPKP--VDA---TGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK 75 (445) Q Consensus 2 l~~~i~~~~~~~~~~~~~-~~yy~G~~~i~~~~~~--~~~---~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~ 75 (445) .++- ...+..+.... ..|+ |..-....+.. ... ........++.-+.++-...+|+..++-+..-|+.+- T Consensus 1 ~~~~---~~~~~~~~~~~~~~~~-g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~ 76 (437) T protein:vir:10 1 MKQG---KQRALGRIKSSFLKWL-GVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLY 76 (437) T ss_pred CCcc---hhhhhhhhHHhhhhhc-CCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEE Confidence 1111 22222333221 2222 32110000000 000 0000000000011122233456666665555565531 Q ss_pred ---cCc--h-HHHH-HHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEEEccceeEEEEcCCCCC Q lcl|NC_021326. 76 ---HTD--D-EVIK-RIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHE 142 (445) Q Consensus 76 ---~~d--~-~~~~-~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~ 142 (445) .+. . .... ....+.. | ....+...+..+.+.+|.+|+++..+. |++. +..++|..+.+..+.+ + T Consensus 77 ~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l~p~~v~i~~~~~--g 153 (437) T protein:vir:10 77 QTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELMLPQRTTVKRLTS--G 153 (437) T ss_pred EEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEEcCcceEEEECCC--C Confidence 111 0 0111 1222221 2 345566677888999999999998874 7754 7888898887765432 2 Q ss_pred ceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHH Q lcl|NC_021326. 143 ELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMY 218 (445) Q Consensus 143 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v 218 (445) .+.+. + ... ......+... -|+|++ +...|.|.+..+ T Consensus 154 ~~~y~--~-~~~-~g~~~~~~~~-----------------------------------dIih~r~~~~d~~~G~spi~~~ 194 (437) T protein:vir:10 154 ALQYT--Y-RNV-DGTVSTLAED-----------------------------------DVFHVRGFSLDGLMGLTPIQYA 194 (437) T ss_pred eEEEE--E-Eec-CceEEEEccc-----------------------------------cEEEecCcCCCCcccccHHHHH Confidence 22110 0 000 0000000000 022222 223577877776 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEeccCChHHH Q lcl|NC_021326. 219 KTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENS 287 (445) Q Consensus 219 ~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~ 287 (445) ...++............+...+.|-.++.... .+.....+..+. .++++.++++.+.+.+........+ T Consensus 195 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~ 274 (437) T protein:vir:10 195 REVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQL 274 (437) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHH Confidence 66666555555555666677777766665422 222222222221 1335556665555544443344455 Q ss_pred HHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCC--c Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KGE--H 360 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~~--~ 360 (445) .+..+.....|+..-++|....+... ++..+..++.....+ +...|.-.+..+...+.. ..+ . T Consensus 275 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f----------~~~tl~P~~~~ie~~l~~kll~~~e~~~ 344 (437) T protein:vir:10 275 LETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGF----------LTFTLRPWLTRIEQAARRSLLRPGERDQ 344 (437) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHH----------HHHHHHHHHHHHHHHHHhhccCccccCc Confidence 66667777788888888865443222 222222333221111 222222222222221111 111 1 Q ss_pred ceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHH------HHHHHHHHHHHHHhhhc-cccCC Q lcl|NC_021326. 361 KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAE------LERIEQEQMEYNKQLPN-LDDGG 429 (445) Q Consensus 361 ~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E------~~ri~~E~~~~~~~~~~-~~~~~ 429 (445) ..+++.+...+..|..+.++++.++ +|+++.-.++++++.- +.-..- +..+++- .+..+. ....+ T Consensus 345 ~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 420 (437) T protein:vir:10 345 FYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVLTVQSALLPIDKL----GEHTTATAAQDA 420 (437) T ss_pred eEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEeecCcccchhhc----cCcCCCcchhcc Confidence 2345555666778899999988876 4899999998877442 111000 1111110 011111 11111 Q ss_pred CCCCCCCCCCCCcCCC Q lcl|NC_021326. 430 ADGAQQKERSNDKQSE 445 (445) Q Consensus 430 ~~~~~~~~~~~d~~~~ 445 (445) ..++..+.+.....+| T Consensus 421 ~~~~~~~~~~~~~~~e 436 (437) T protein:vir:10 421 LKAWLYQEEKTRATQE 436 (437) T ss_pred ccccCCCCCCCCcccc Confidence 1111111111111122 No 216 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=96.07 E-value=0.001 Score=36.93 Aligned_cols=374 Identities=10% Similarity=0.013 Sum_probs=156.9 Q ss_pred ChHHHHHHHHHHH-HHH------------HHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhh Q lcl|NC_021326. 1 MIVRYIKQHLEKL-PEI------------SIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYI 67 (445) Q Consensus 1 ~l~~~i~~~~~~~-~~~------------~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l 67 (445) .|.+.+....... ..+ ......+-|.. .. .+.. .....-+.+.=.-.+|+..++-+ T Consensus 4 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~------~~---~g~~--v~~~~al~~~~V~~~i~~ia~~i 72 (434) T protein:vir:43 4 SLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRE------SS---SGKK--VTVDKAMKLSAVWACVRLISTSV 72 (434) T ss_pred chhhhhhhcccccchhhhcccccccccCchHHHHHHhcCC------cc---CCce--echhhhhccHHHHHHHHHHHHhh Confidence 4444443322111 000 00011111110 00 0000 00000011111223556666555 Q ss_pred hccCeee-c--cCc--hH-HHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEE Q lcl|NC_021326. 68 VGKPIAF-K--HTD--DE-VIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIP 134 (445) Q Consensus 68 ~g~~~~~-~--~~d--~~-~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~ 134 (445) -.-|+.+ . .+. .. ..-.+...+ . | .-..+...+..+.+.+|.+|+.+..+ .|++ .+..++|..+.+ T Consensus 73 a~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~ 151 (434) T protein:vir:43 73 AGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDL 151 (434) T ss_pred hhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEE Confidence 5556654 1 111 11 111222222 2 3 23556677788999999999888766 5775 467889988877 Q ss_pred EEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCC Q lcl|NC_021326. 135 IWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDL 210 (445) Q Consensus 135 v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~ 210 (445) +.+.. +.+.+ +..........+. .--|++++ +... T Consensus 152 ~~~~~--g~~~y----~~~~~~g~~~~~~-----------------------------------~~eVih~~~~~~dg~~ 190 (434) T protein:vir:43 152 ECDEN--GRLKY----FYTTKKGARREIE-----------------------------------RTNMLHIPAFTLDGRI 190 (434) T ss_pred EEcCC--CeEEE----EEEecCceEEEEc-----------------------------------cccEEEecCcCCCCcc Confidence 66542 22221 1111000000000 00122222 2234 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh-------hCceeeccCCCceeeEec Q lcl|NC_021326. 211 EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR-------YYGAIKVSDNGGVDTIQV 280 (445) Q Consensus 211 g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~ 280 (445) |.|.+......+........-....+...+.|-.++.-.. .+..+.++..+. .++++.++++.+.+.++. T Consensus 191 G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~ 270 (434) T protein:vir:43 191 GLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFREYVKSVSGAMNSGRSPVLEQGITPETIGI 270 (434) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHHHHHHhcCccccCCccccCCCceEEEccC Confidence 6776766555555544443334444555666755554322 221222222221 233455555555444443 Q ss_pred cCChHHHHHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---- Q lcl|NC_021326. 281 EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD---- 355 (445) Q Consensus 281 ~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~---- 355 (445) ......+.+..+.....|+..-++|....+... ++.++..++..... .+...|.-++..+...+. T Consensus 271 ~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~----------f~~~~L~P~~~~ie~~ln~kL~ 340 (434) T protein:vir:43 271 NPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLA----------FLTFSISSITNQIQQCVNKRLL 340 (434) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHH----------HHHHHHHHHHHHHHHHHHhhcC Confidence 334455666777778888888888865443222 22223333322111 122223322222222111 Q ss_pred -cCC-CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHH-----HHHHHHHHHHHHHHhhhc Q lcl|NC_021326. 356 -IKG-EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQA-----ELERIEQEQMEYNKQLPN 424 (445) Q Consensus 356 -~~~-~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~-----E~~ri~~E~~~~~~~~~~ 424 (445) ... ....+++.+...+..|..+.++.+.++ .|+++.-.++++++.- ++-+. -+..+. .. .+ T Consensus 341 ~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~--~~------~~ 412 (434) T protein:vir:43 341 TAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGDILTVQSNLVPID--QL------GQ 412 (434) T ss_pred ChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccCccchh--hh------hc Confidence 111 112344555566678899999998876 4899999988876442 11000 000110 00 00 Q ss_pred cccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 425 LDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 425 ~~~~~~~~~~~~~~~~d~~~~ 445 (445) ........+..+..++.++-| T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~ 433 (434) T protein:vir:43 413 SNKSQAVRAALMNWFSQPEPQ 433 (434) T ss_pred cCCCcchhhhhhccCCCCCCC Confidence 000000001111111111111 No 217 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=96.00 E-value=0.0011 Score=36.71 Aligned_cols=365 Identities=11% Similarity=0.042 Sum_probs=152.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee--ccCc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF--KHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~--~~~d 78 (445) -+....... -...+..... ..... -+.++-...+|+..++-+..-|+.+ ...+ T Consensus 33 ~~~~~~~~~-----------~~~~~~~g~~----v~~~~----------al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~ 87 (432) T protein:vir:81 33 PVNATARDL-----------GIIISDTGAA----VNADA----------IMRLDAVAACVKLVSQAIAAMPLTMYMRTPD 87 (432) T ss_pred cCccchhhh-----------cccccccCcc----cchHh----------hhccHHHHHHHHHHHHhhhhCceeeEEecCC Confidence 111111110 0000000000 00000 0111222334555555554556543 1111 Q ss_pred h---HHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEE Q lcl|NC_021326. 79 D---EVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFI 148 (445) Q Consensus 79 ~---~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v 148 (445) . ....-+-..+ . | .-..+...+..+.+.+|.+|+.+..+ +|++ .+..++|..+.+..++. +.+.+ T Consensus 88 g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~--g~~~y-- 162 (432) T protein:vir:81 88 GRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPK--GNTAY-- 162 (432) T ss_pred cceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCC--CcEEE-- Confidence 1 0111111222 1 2 23455667778889999999988775 4765 46778998888776642 22221 Q ss_pred EEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHHH Q lcl|NC_021326. 149 RMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLIDA 224 (445) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid~ 224 (445) ++... ......+... -|++++ +...|.|.+......++. T Consensus 163 ~~~~~--~g~~~~~~~~-----------------------------------~iih~r~~~~dg~~G~spi~~~~~~i~~ 205 (432) T protein:vir:81 163 RYRRT--DGQMIDIPKQ-----------------------------------QIWKIMGYSLDGENGLSAIRYGAQIFGT 205 (432) T ss_pred EEEec--CceEEEEccc-----------------------------------cEEEecCCCCCCcccccHHHHHHHHHHH Confidence 11110 0000000000 012221 123467777666665555 Q ss_pred HHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhh----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 225 YNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLL----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 225 ~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ...........+...+.|-.++.-.. .+....+...+ ...+++.++++.+.+.+........+.+..+..... T Consensus 206 ~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~ 285 (432) T protein:vir:81 206 AIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVES 285 (432) T ss_pred HHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHH Confidence 54444444444555566655444221 11112222221 234466677666665555444445566667777788 Q ss_pred HHHHhCcccccccccc--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCcceEEEEe--CC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFG--SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD----IKGEHKDVDISF--NY 369 (445) Q Consensus 298 i~~~s~~p~~~~~~~~--~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~----~~~~~~~i~v~f--~~ 369 (445) |+..-++|....+... +...+..++.....+. ...|.-.+..+...+. ...+.....++| .. T Consensus 286 Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~----------~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~ 355 (432) T protein:vir:81 286 ICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL----------TMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSA 355 (432) T ss_pred HHHHhCCCHHHcCCcCCccccccchHHHHHHHHH----------HHHHHHHHHHHHHHHHhhccCccccCceEEEeechh Confidence 8888888865443322 1222333432222221 1122222222211111 111112334444 55 Q ss_pred CCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 370 NKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 370 ~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+..|..+.++.+.++ .|+++.-.++++++.- ++- ..+-.+..-..-............ .....++.+++-++ T Consensus 356 llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~-~~~~~~~~~~~pl~~~~~~~~~~~--~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 356 LLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGN-AAVLTVQSAMVPLDSIGLQASPEP--ASGLGNQQQDKVSK 432 (432) T ss_pred hhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-cceEeecCcccchhhhccCCCCCC--CCCCCCcccccccC Confidence 6677899999988876 5899999999887542 110 110000000000000000000000 01111111111111 No 218 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=95.99 E-value=0.0012 Score=36.67 Aligned_cols=409 Identities=11% Similarity=0.081 Sum_probs=177.0 Q ss_pred ChHHHHHHHHHHHHH-HHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhcc--C-----e Q lcl|NC_021326. 1 MIVRYIKQHLEKLPE-ISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK--P-----I 72 (445) Q Consensus 1 ~l~~~i~~~~~~~~~-~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~--~-----~ 72 (445) -|++..+.++.+... ...+.++++=- +|..-.. .. .. +...++..+-+...++.+++-|++- | + T Consensus 15 ~l~~r~~~L~~~R~~~e~~w~e~a~~~--lP~~~~~---~~--~~-~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 86 (516) T protein:vir:10 15 KIPKLWEKFSTKRSSFLDRAKHYSKLT--LPYLMND---KG--DN-ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFF 86 (516) T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHhh--cccccCC---CC--Cc-ccccccccchHHHHHHHHHHHHHhhhcCCCCccc Confidence 555555555433322 23344443211 1111100 00 01 1112455666777788877776542 2 2 Q ss_pred eeccCch-------------HHHHHHH--------HHhccCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccce Q lcl|NC_021326. 73 AFKHTDD-------------EVIKRID--------EVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQ 131 (445) Q Consensus 73 ~~~~~d~-------------~~~~~l~--------~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~ 131 (445) ++...+. ++.+.+. .+..+||...+.++.++..++|.+.+ |.|+++.+ +.++-.+ T Consensus 87 ~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~d~~~~~--~~~pl~~ 162 (516) T protein:vir:10 87 RVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCML--YKPSKGAI--SAIPMHH 162 (516) T ss_pred cccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeE--EecCCCCe--EEEEcCe Confidence 3333221 1222221 22246888999999999999999864 45776654 4454444 Q ss_pred eEEEEcCCCCCceEEEEEEEee----------------------ecceeEEEEecceEEEEEEecceeeecccccccccc Q lcl|NC_021326. 132 GIPIWTDKEHEELEAFIRMYKL----------------------ENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSK 189 (445) Q Consensus 132 ~~~v~d~~~~~~~~~~v~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (445) |.+..|. .+++...++..+. +....+++|+.... .....+.... ...+... T Consensus 163 -y~v~~d~-~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~---~~~~~~~~~~--~~d~~~~ 235 (516) T protein:vir:10 163 -YVVNRDT-NGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKY---LGEGFWELKQ--SADDIPV 235 (516) T ss_pred -EEEeeCC-CCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEe---cCCCceEEEE--eeCceee Confidence 4444444 4566555543321 11122333321110 0011111100 0011111 Q ss_pred cccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhhhhC Q lcl|NC_021326. 190 THFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYY 264 (445) Q Consensus 190 ~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~ 264 (445) .....-+|..+|++.+. .+.+|+|-.....+-+..+|...-...........|.+.+.-......... ..... T Consensus 236 ~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~l--~~~~~ 313 (516) T protein:vir:10 236 GKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDHF--VNSGT 313 (516) T ss_pred ccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhhh--ccCCC Confidence 11122334566766543 346899998999999999998877777777888877765521111111110 11122 Q ss_pred ceeeccCCCceeeEecc--CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 265 GAIKVSDNGGVDTIQVE--VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVA 342 (445) Q Consensus 265 ~~~~~~~~~~~~~l~~~--~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 342 (445) +.+..+..++++.+... .+.+.....++.++..|...-....+ ....+...|++.+.. +..++...++.. T Consensus 314 g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l-~~rd~~rvTAtEV~~-------r~~E~~~~LGpv 385 (516) T protein:vir:10 314 GEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETM-TRRDAERVTAVEIQR-------DALEIEQNMGGV 385 (516) T ss_pred ceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhh-hccCCccccHHHHHH-------HHHHHHHHhhhH Confidence 34444444556665433 35677777888888777654332211 111223467777654 444555555555 Q ss_pred HHHHH--------HHHHH-Hhc-cCCCcceEEEEeCCCCCCCHHHHHH---HHH-------HHhccCC-------h---- Q lcl|NC_021326. 343 IQELL--------WFVFE-HFD-IKGEHKDVDISFNYNKVANTELQVQ---TAQ-------QSMGIVS-------H---- 391 (445) Q Consensus 343 l~~~~--------~~~~~-~~~-~~~~~~~i~v~f~~~~p~d~~~~~~---~~~-------~~~g~~s-------~---- 391 (445) +.++- ..... .+. .+.....+++ ..+.+.+..++ .+. .++++-+ . T Consensus 386 ~~rl~~Ell~Pli~r~~~~~~p~~P~~lv~~~~----v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~~ 461 (516) T protein:vir:10 386 YSLFATTMQSPVAMWGLLEAGDSFTSDLVDPVI----ITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYM 461 (516) T ss_pred HHHHHHHHHHHHHHHHHHhhCCCCChhhcCcce----ehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHHH Confidence 54431 11111 111 1111111111 11122222222 111 1111111 1 Q ss_pred HHHHHhCCC---CCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 392 ETVLENHPF---VEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 392 et~l~~l~~---~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +.+...++- +--.++|++.+.+++.+..+..+.....+. ..++--+++--| T Consensus 462 ~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~~~~~~~~---~~~~~~~~~~~~ 515 (516) T protein:vir:10 462 DWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQMLEEGVAK---AVPGVIQQELKE 515 (516) T ss_pred HHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHhhh---cccchhhhhhhc Confidence 122222211 112346666666555443332222111110 011111111111 No 219 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=95.94 E-value=0.0012 Score=36.54 Aligned_cols=395 Identities=14% Similarity=0.111 Sum_probs=162.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~~~~d~ 79 (445) +-.. ++.-.+.+.+|..+..+++-+.. ...||+..+-+ ...+|+.+..++- T Consensus 46 ~e~~-~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV~i~Ld~~ 97 (537) T protein:vir:10 46 FDGT-IRNDHELITRYREMVLNPECDSA---------------------------VDDVVNETICGNFDDVPISIDLHNL 97 (537) T ss_pred cccc-cchHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceEEEEeccc Confidence 1000 00112222333333333332221 12233332222 1234556554442 Q ss_pred HHH----HHH----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 80 EVI----KRI----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 80 ~~~----~~l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) +.. +.+ +.++. =+|+...++..+...+.|+.|+...+|.+ |-..+..+||+.+-.|..-.. .... T Consensus 98 ~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr~lDPr~i~~vR~i~~--~~~~ 175 (537) T protein:vir:10 98 KQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVELRYVDPRKIRKVTEYEA--KRPE 175 (537) T ss_pred ccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCCccceeeEeecc--cCCc Confidence 222 222 22221 26788888999999999999999988743 667789999998865532110 0001 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEec------CCCCcCccHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK------NNDLEISDIFM 217 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~------n~~~g~s~~~~ 217 (445) .++.... ...+.. ....+|.+....... .+..--+|| +++.. |+....|-+.. T Consensus 176 ~~~~~~~----~~~v~~-~~~eyf~ynp~g~~~-------------~~~~~vkI~~dAI~y~hSGl~d~n~~~i~syLhk 237 (537) T protein:vir:10 176 ALRTQDL----NQQLTQ-QSASYFLYNPKGLKN-------------STNQGMKIAPDSIAYCHSGIQDLNKNMVLSHLHK 237 (537) T ss_pred cceEEec----ceeeee-cccceeeeccccccc-------------cCCCceeccHhheeeecccceeCCCCeeeeeehh Confidence 1111000 000000 111112222111100 001111233 12221 22233455544 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----h--------------------hCceee Q lcl|NC_021326. 218 YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----R--------------------YYGAIK 268 (445) Q Consensus 218 v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~--------------------~~~~~~ 268 (445) .+.....|- ++-|.+...+..+.|-+=+.-.+..+. +...+.+ + .-.-++ T Consensus 238 AiKp~NQLk-m~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyW 316 (537) T protein:vir:10 238 AIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 316 (537) T ss_pred hhHHHHhhH-HHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhc Confidence 332222221 345555556666666432221111111 1111111 1 000122 Q ss_pred cc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccccccccccC-cc-hHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 269 VS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKV 341 (445) Q Consensus 269 ~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~-~~-Sg~Ai~~~~~~l~~k~~~~~~~~~~ 341 (445) ++ +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|---.+.-++ +. -|..|-..+.....-+.+.+..|.. T Consensus 317 LPRReGgrgTEItTLpGgqnlge-m~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~ 395 (537) T protein:vir:10 317 LPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSE 395 (537) T ss_pred ccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHH Confidence 32 122 123333332 3333 333666677777777776321111111 11 1223444444455666777888888 Q ss_pred HHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHHh---c-cCChHHHHHhCCCCCC- Q lcl|NC_021326. 342 AIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQSM---G-IVSHETVLENHPFVED- 403 (445) Q Consensus 342 ~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~~---g-~~s~et~l~~l~~~~d- 403 (445) .+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+. | .+|.+++++.+--.+| T Consensus 396 lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDe 475 (537) T protein:vir:10 396 LFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTES 475 (537) T ss_pred HHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHH Confidence 8888777544333432 222 457788865444334444433 33332 4 4799999987644443 Q ss_pred -HHHHHHHHHHHHHHHHHhhhcccc----CCCCCC-----CCCCCCCCcCCC Q lcl|NC_021326. 404 -LQAELERIEQEQMEYNKQLPNLDD----GGADGA-----QQKERSNDKQSE 445 (445) Q Consensus 404 -~~~E~~ri~~E~~~~~~~~~~~~~----~~~~~~-----~~~~~~~d~~~~ 445 (445) ..++-+.|++|..+-.-.-+.... +.+++. .+..+.++..++ T Consensus 476 eI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (537) T protein:vir:10 476 EIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAV 527 (537) T ss_pred HHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCC Confidence 445566777776542111111111 111111 011111111111 No 220 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=95.92 E-value=0.0013 Score=36.48 Aligned_cols=417 Identities=12% Similarity=0.023 Sum_probs=157.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc---- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH---- 76 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~---- 76 (445) .-.+...+-+..-..+.. .++|.++. +.. |.+......... =.+++...+|+..+..+.|-++.+.. T Consensus 18 ~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~-p~~~~~~L~~~~------e~~~~~~~~i~~~~~~iag~g~~~~~~~~~ 88 (651) T protein:vir:99 18 LGGEADLAKSPNSTQIPD-HRIQSHNV-GVN-PPYNPDRLAAFL------ELNETLATGIRKKSRYEVGFGFDLVPAQGV 88 (651) T ss_pred ccccccccccccccccch-hhhcccCC-CCC-CCCCHHHHHHHH------hcChHHHHHHHHHhhhhhccCceeeecccC Confidence 000111000000011111 12333332 221 222111111110 12578899999999999888766532 Q ss_pred --C--chHHHHHHHHHhcc----------------CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEE Q lcl|NC_021326. 77 --T--DDEVIKRIDEVLGN----------------RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPI 135 (445) Q Consensus 77 --~--d~~~~~~l~~~~~n----------------~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v 135 (445) + .++-.+..+.+|.+ ........+..+...+|.+|+-+..+..|++ .+..++|..+- + T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~lp~~~~R-v 167 (651) T protein:vir:99 89 DGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPVGLAYVPARTVR-V 167 (651) T ss_pred CCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhhcChhhee-e Confidence 1 22233344444432 2234445556677778888877766655543 22223332111 0 Q ss_pred EcCCC--------------CCceEE---------------EEEEEeeecceeEEEE--ecceEEEEEEecceeeecc--- Q lcl|NC_021326. 136 WTDKE--------------HEELEA---------------FIRMYKLENETKVEYW--DKITVNYYVYENGSLIPDY--- 181 (445) Q Consensus 136 ~d~~~--------------~~~~~~---------------~v~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~--- 181 (445) ..... ...+.. ++..+.........++ ....+.............. T Consensus 168 ~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~ 247 (651) T protein:vir:99 168 RRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIFV 247 (651) T ss_pred ecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcceeeeeecc Confidence 00000 000000 0000000000000000 0000000000000000000 Q ss_pred cccccccccccccccccccc---eEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE--ecCCc Q lcl|NC_021326. 182 SNNLENSKTHFSTGSWGKIP---FIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL--TNYDD 251 (445) Q Consensus 182 ~~~~~~~~~~~~~~~~g~iP---vv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~--~g~~~ 251 (445) ........ .........+| |++|++ ...|.|.+..+...+........-..+-+...+.|-.++ +|... T Consensus 248 ~~~~g~~~-~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~l 326 (651) T protein:vir:99 248 DRETGDVT-TGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTGGEL 326 (651) T ss_pred cceeeeEE-EcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCC Confidence 00000000 00001111122 566653 235788887766666655555444555556666676664 34321 Q ss_pred --ccchhHHHhhh-----hCceeeccC---------CCceeeEeccC---ChHHHHHHHHHHHHHHHHHhCccccccccc Q lcl|NC_021326. 252 --QELPEFKRLLR-----YYGAIKVSD---------NGGVDTIQVEV---PVENSKKYLDELYQKIMLFGQAVDFSSDKF 312 (445) Q Consensus 252 --~~~~~~~~~~~-----~~~~~~~~~---------~~~~~~l~~~~---~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~ 312 (445) +........+. .++++.++. +.+++|..... ....+.+..+.....|...-++|....+.. T Consensus 327 s~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~ 406 (651) T protein:vir:99 327 SEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVT 406 (651) T ss_pred CHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccC Confidence 12222222121 223343332 23556655443 234556666777777888888886443322 Q ss_pred cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----cCC--CcceEEEEeC--CCCCCCHHHHHHHHH Q lcl|NC_021326. 313 GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----IKG--EHKDVDISFN--YNKVANTELQVQTAQ 383 (445) Q Consensus 313 ~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~~~--~~~~i~v~f~--~~~p~d~~~~~~~~~ 383 (445) .. .+...++.... ..+...|.-+++.+...+. ... ....+.+.|+ ..+..|....++.+. T Consensus 407 ~~-~~~sn~E~~~~----------~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~ 475 (651) T protein:vir:99 407 DS-ANRSNSDQQDK----------DFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVR 475 (651) T ss_pred CC-CCcccHHHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHHHHHHHH Confidence 11 11111111111 1111222222222222111 111 1224555664 345568888888877 Q ss_pred HH--hccCChHHHHHhCCC--CCCHHH--HHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 384 QS--MGIVSHETVLENHPF--VEDLQA--ELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 384 ~~--~g~~s~et~l~~l~~--~~d~~~--E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++ .|+++.-.++++++. +++... -+..++ .........++.+.+.++..+.++.++ T Consensus 476 ~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~------~~~~g~~~~gge~~~~~~~~~~~~~~~ 537 (651) T protein:vir:99 476 AMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFE------AEVAGDVAGGGETEAVHEPPEENKIGE 537 (651) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCccccccccccc------cccccccccCCCCcccccCcccccccc Confidence 75 589999999988754 322110 011110 000011111111111111111111111 No 221 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=95.58 E-value=0.0018 Score=35.61 Aligned_cols=362 Identities=11% Similarity=0.056 Sum_probs=153.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee--ccCc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF--KHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~--~~~d 78 (445) -+.......-.. .. ..|.. +. .+.-+.++-...+|+..++-+.+-|+.+ ...+ T Consensus 33 ~~~~~~~~~~~~------~s--~~g~~--------v~---------~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~ 87 (432) T protein:vir:10 33 PVNATARDLGII------IS--DTGAA--------VN---------ADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPD 87 (432) T ss_pred cCcchhhhhccc------cc--ccCcc--------cc---------hhhhhcchHHHHHHHHHHHhhhhCceeEEEecCC Confidence 111111110000 00 01110 00 0000112233445566665555556653 1111 Q ss_pred h---HHHH-HHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEE Q lcl|NC_021326. 79 D---EVIK-RIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFI 148 (445) Q Consensus 79 ~---~~~~-~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v 148 (445) . .... ...-+.. | .-..+...+..+.+.+|.+|+.+..+ +|++ .+..++|..+.++.+.. +.+.+ T Consensus 88 g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~--g~~~y-- 162 (432) T protein:vir:10 88 GRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTK--GNTAY-- 162 (432) T ss_pred CcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCC--CcEEE-- Confidence 1 1111 1222221 2 23455667788899999999988775 4765 47778999988876642 22221 Q ss_pred EEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHHH Q lcl|NC_021326. 149 RMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLIDA 224 (445) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid~ 224 (445) ++... +.... .+ .. + . |++++ +...|.|.+......++. T Consensus 163 ~~~~~-~g~~~---------~~--~~----------------~-------~--iih~~~~~~dg~~G~spi~~~~~~i~~ 205 (432) T protein:vir:10 163 RYRRT-DGQMI---------DI--PK----------------Q-------Q--IWKIMGYSLDGENGLSAIRYGAQIFGT 205 (432) T ss_pred EEEec-CceEE---------EE--cC----------------c-------c--EEEecCCCCCCcccccHHHHHHHHHHH Confidence 11100 00000 00 00 0 0 12221 123477777665555554 Q ss_pred HHHHHHHHHHHHHHhcCCeeEEecCCc---ccchhHHHhh----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 225 YNRRLSDLSNTFKDSNELTYVLTNYDD---QELPEFKRLL----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 225 ~~~~~s~~~~~~~~~~~~~l~~~g~~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ......-..+.+...+.|-.++..... +..+.+...+ ...+++.++++.+.+.++.......+.+..+..... T Consensus 206 ~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~ 285 (432) T protein:vir:10 206 AIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVES 285 (432) T ss_pred HHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHH Confidence 443433344445666677666653221 1122222222 234466666666655555444445566667777788 Q ss_pred HHHHhCcccccccccc--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhccCCCcceEEEEe--C Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFG--SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE-----HFDIKGEHKDVDISF--N 368 (445) Q Consensus 298 i~~~s~~p~~~~~~~~--~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~-----~~~~~~~~~~i~v~f--~ 368 (445) |+..-++|....+... +...+..++.....+. ...|.-.+..+.. ++.. .+.....+.| . T Consensus 286 Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~----------~~tl~P~~~~ie~~ln~kL~~~-~~~~~~~~~fd~~ 354 (432) T protein:vir:10 286 ICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL----------SMTLSPWLRRIEQSIALNLLSP-AERRRYFADFDTS 354 (432) T ss_pred HHHHhCCCHHHcCCccCCcccccchHHHHHHHHH----------HHHHHHHHHHHHHHHHhhhcCc-cccCceEEEeech Confidence 8888888875544322 1222333332222221 1122222222211 1111 1122344455 4 Q ss_pred CCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccc-cCCCC-CCCCCCCCCCc Q lcl|NC_021326. 369 YNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLD-DGGAD-GAQQKERSNDK 442 (445) Q Consensus 369 ~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~-~~~~~~~~~d~ 442 (445) ..+..|..+.++.+.++ .|+++.-.++++++. +++- ..+-.+. ... ..+.... ....+ ....+++.+++ T Consensus 355 ~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~-~~~~~~~---~~~-~pl~~~~~~~~~~~~~~~~~~~~~~ 429 (432) T protein:vir:10 355 ALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGN-AAVLTVQ---SAM-VPLDSIGLQASPEPASGLGNQQQDK 429 (432) T ss_pred hhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-cceEeec---Ccc-cchhhhcccCCCCCCCCCCCccccc Confidence 55667888899888876 489999999988754 2210 0100000 000 0000000 00000 00011111111 Q ss_pred CCC Q lcl|NC_021326. 443 QSE 445 (445) Q Consensus 443 ~~~ 445 (445) -++ T Consensus 430 ~~~ 432 (432) T protein:vir:10 430 VSK 432 (432) T ss_pred ccC Confidence 111 No 222 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=95.05 E-value=0.0029 Score=34.52 Aligned_cols=378 Identities=8% Similarity=0.016 Sum_probs=162.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccc-ccc----cccccc-cccccccccccchHHHHHHHHHhhhhccCeee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP-KPV----DATGAV-DPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 74 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~-~~~----~~~~~~-~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~ 74 (445) ---++..+...+..-+.+++..+.|........ ... ...... ...-...-+.+.=...+|+..++-+.+-|+.+ T Consensus 2 ~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~v 81 (424) T protein:vir:18 2 EEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDV 81 (424) T ss_pred CCCccccccCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEE Confidence 000000011112223334455565542111000 000 000000 00000000111223346666666666666654 Q ss_pred -cc-Cch---H--HHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCC Q lcl|NC_021326. 75 -KH-TDD---E--VIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKE 140 (445) Q Consensus 75 -~~-~d~---~--~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~ 140 (445) .. .+. . ....+-..+ . | .-..+...+..+.+.+|.+|+++..+..|++ .+..++|..+.+..+. T Consensus 82 y~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~-- 159 (424) T protein:vir:18 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG-- 159 (424) T ss_pred EEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcC-- Confidence 11 111 1 111122222 1 2 3445666778899999999999998888885 4677888887765442 Q ss_pred CCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHH Q lcl|NC_021326. 141 HEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIF 216 (445) Q Consensus 141 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~ 216 (445) +.+.+ + |... .....+... -|++++ +...|.|.+. T Consensus 160 -~~~~y--~-~~~~--g~~~~~~~~-----------------------------------eVihir~~~~dg~~G~spi~ 198 (424) T protein:vir:18 160 -KKVVY--R-YQRD--SEYADFSQK-----------------------------------EIFHLKGFGFTGLVGLSPIA 198 (424) T ss_pred -CeEEE--E-EEeC--CeEEEeccc-----------------------------------cEEEecCcCCCCcccccHHH Confidence 12211 1 1100 000000000 123332 1234667666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCc----ccchhHHHhhh-------hCceeeccCCCceeeEeccCChH Q lcl|NC_021326. 217 MYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD----QELPEFKRLLR-------YYGAIKVSDNGGVDTIQVEVPVE 285 (445) Q Consensus 217 ~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~----~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~~~ 285 (445) .....+...........+.+...+.|-.++.-... +..+..+..+. .++++.++++.+.+.+....... T Consensus 199 ~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~ 278 (424) T protein:vir:18 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDA 278 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHH Confidence 55555554444444444555666677655543221 11111111111 23355566655555554443445 Q ss_pred HHHHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----cCC- Q lcl|NC_021326. 286 NSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----IKG- 358 (445) Q Consensus 286 ~~~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~~~- 358 (445) .+.+..+.....|+..-++|....+... ++..+..++.....+. ...|.-+++.+..-+. ... T Consensus 279 q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~----------~~tl~P~~~~ie~~ln~~L~~~~~~ 348 (424) T protein:vir:18 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL----------QYTLQPYISRWENSIQRWLIPSKDV 348 (424) T ss_pred HHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHH----------HHHHHHHHHHHHHHHHhhcCCcccc Confidence 5666677777788888888865443322 2232333443322222 2222222222222111 111 Q ss_pred CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-hhccccCCCCCCCC Q lcl|NC_021326. 359 EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQ 435 (445) Q Consensus 359 ~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~ 435 (445) ....+++.+...+..|..+.++.+.++ .|+++.-.++++++.-+-+.. +.. .... .......+ ++ T Consensus 349 ~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg--D~~------~~~~n~~~l~~~~----~~ 416 (424) T protein:vir:18 349 GRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGG--DVA------MRQAQYVPITDLG----TN 416 (424) T ss_pred CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee------eeccCccchhhhh----cc Confidence 123355555667778999999988887 589999999988754211000 000 0000 00000000 00 Q ss_pred CCCCCCcC Q lcl|NC_021326. 436 KERSNDKQ 443 (445) Q Consensus 436 ~~~~~d~~ 443 (445) .+..++.+ T Consensus 417 ~~~~~n~a 424 (424) T protein:vir:18 417 KEPRNNGA 424 (424) T ss_pred CCccccCC Confidence 01111111 No 223 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=95.04 E-value=0.0029 Score=34.49 Aligned_cols=378 Identities=8% Similarity=-0.002 Sum_probs=164.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccc-cc-----ccccccccccccccccccchHHHHHHHHHhhhhccCeee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP-KP-----VDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 74 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~-~~-----~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~ 74 (445) ---++.-....+..-+.++...+.|........ .. ..............-+.+.-.-.+|+..++-+.+-|+.+ T Consensus 2 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~~ 81 (424) T protein:vir:18 2 EEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLDV 81 (424) T ss_pred CCCcceEeecCCCchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceEE Confidence 000000001112233344455555432110000 00 000000000000000111222346666666666666654 Q ss_pred -ccC-ch---H--HHH-HHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCC Q lcl|NC_021326. 75 -KHT-DD---E--VIK-RIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKE 140 (445) Q Consensus 75 -~~~-d~---~--~~~-~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~ 140 (445) ..+ +. . ... ....+.. | .-..+...+..+.+.+|.+|+++..+..|++ .+..++|..+.+..+. T Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~-- 159 (424) T protein:vir:18 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG-- 159 (424) T ss_pred EEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcC-- Confidence 111 11 1 111 2222221 2 2455666778899999999999999988986 4777888887765443 Q ss_pred CCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHH Q lcl|NC_021326. 141 HEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIF 216 (445) Q Consensus 141 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~ 216 (445) +.+.+ + |... .....+.. . -|++++ +...|.|.+. T Consensus 160 -~~~~y--~-~~~~--g~~~~~~~---------------------------------~--eIih~r~~~~dg~~G~spi~ 198 (424) T protein:vir:18 160 -KKVVY--R-YQRD--SEYADFSQ---------------------------------K--EIFHLKGFGFTGLVGLSPIA 198 (424) T ss_pred -CeEEE--E-EEeC--CeEEEecc---------------------------------c--cEEEecCcCCCCcccccHHH Confidence 12111 0 1100 00000000 0 123332 2235777776 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCc----ccchhHHHhh-------hhCceeeccCCCceeeEeccCChH Q lcl|NC_021326. 217 MYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD----QELPEFKRLL-------RYYGAIKVSDNGGVDTIQVEVPVE 285 (445) Q Consensus 217 ~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~----~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~~ 285 (445) .+...++..........+.+...+.|-.++.-... +.....+..+ ..++++.++++.+.+.+....... T Consensus 199 ~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~ 278 (424) T protein:vir:18 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDA 278 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHH Confidence 66666655444444445556677777666543221 1111112111 123355666665555554444445 Q ss_pred HHHHHHHHHHHHHHHHhCccccccccccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CC-- Q lcl|NC_021326. 286 NSKKYLDELYQKIMLFGQAVDFSSDKFGSA-PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KG-- 358 (445) Q Consensus 286 ~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~-- 358 (445) .+.+..+...+.|+..-++|....+...++ ..+..++.....+. ...|.-.++.+...+.. .. T Consensus 279 q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~----------~~tl~P~~~~ie~~l~~~L~~~~~~ 348 (424) T protein:vir:18 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL----------QYTLQPYISRWENSIQRWLIPAKDV 348 (424) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHH----------HHHHHHHHHHHHHHHHhhcCCcccc Confidence 566667777788888888886554433222 22333333222221 22222222222221111 11 Q ss_pred CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-hhccccCCCCCCCC Q lcl|NC_021326. 359 EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQ 435 (445) Q Consensus 359 ~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~ 435 (445) ....+++.+...+..|..+.++.+.++ .|+++.-.++++++.-.- +.-++. .... .......+ .+ T Consensus 349 ~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi--~gGD~~------~~~~n~~~l~~~~----~~ 416 (424) T protein:vir:18 349 GRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPL--PGGDVA------MRQSQYVPITDLG----TN 416 (424) T ss_pred CCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCee------eeccCccchHhhh----cc Confidence 122345555666778999999998887 589999888888754210 000000 0000 00000000 00 Q ss_pred CCCCCCcC Q lcl|NC_021326. 436 KERSNDKQ 443 (445) Q Consensus 436 ~~~~~d~~ 443 (445) .+..++-+ T Consensus 417 ~~p~~~ga 424 (424) T protein:vir:18 417 KEPRNNGA 424 (424) T ss_pred CCCccCCC Confidence 11111111 No 224 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=94.99 E-value=0.003 Score=34.41 Aligned_cols=377 Identities=12% Similarity=0.048 Sum_probs=169.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccccc--ccchHHHHHHHHHhhhhccCeeeccCc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRM--ITNFHANLVDQKVSYIVGKPIAFKHTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri--~~n~~~~iv~~~~~~l~g~~~~~~~~d 78 (445) +.++..- .........-|. +.-+++.+. .+..... .+.--.+ ......-...+....+.+-+.++.+.+ T Consensus 13 ~~~~~~~----~~~~~~~~~g~~-~~D~~lr~~---gg~~~~~-~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~p~~ 83 (446) T protein:vir:98 13 IRRRTIY----AMEHLGLATSYL-SEDGGYKRA---GKPTYQQ-LSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQHGD 83 (446) T ss_pred hhhhhhh----ccccchhhcccC-CcchHhhhc---CCChHHH-HHHHHHHHhcchHHHHHHHHHHHHhhcCCceecCcc Confidence 2222221 111121111122 111111110 0000000 0000000 134455566666666778888888888 Q ss_pred hHHHHHHHHHhccCHHHHHHHHHHHHHhcCeEE-EEEEECCCCc-EEEEEE------ccceeEEEEcCCCCCceEEEEEE Q lcl|NC_021326. 79 DEVIKRIDEVLGNRFDDKLHSVLTGASNKGIEW-LHPYLDEEGE-FKLFRV------PAEQGIPIWTDKEHEELEAFIRM 150 (445) Q Consensus 79 ~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~-~~i~~~------~p~~~~~v~d~~~~~~~~~~v~~ 150 (445) ++..+++.+++.+-...-......++..||.++ +++|.-..|. ...+++ .|...--.++.. ..+..+ T Consensus 84 ~~~a~~v~~~l~~~~~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~--~~~~~~--- 158 (446) T protein:vir:98 84 KRIKKFIDDQLRNRAKTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLIANDN--GRIVDG--- 158 (446) T ss_pred HHHHHHHHHHHhhcCchhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccccccccceeeeccC--Cccccc--- Confidence 888899999987532233334467888899864 5677544332 111222 221111011111 110000 Q ss_pred EeeecceeEEEEecceEEEEEEecceeeeccccccccc---ccccccccccccce---EEe-c----CCCCcCccHHHHH Q lcl|NC_021326. 151 YKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENS---KTHFSTGSWGKIPF---IPF-K----NNDLEISDIFMYK 219 (445) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~g~iPv---v~~-~----n~~~g~s~~~~v~ 219 (445) .......+. ............... ......+ -.||+ +.+ . .++.|.|.+..+- T Consensus 159 ------------~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~--~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~ 222 (446) T protein:vir:98 159 ------------DTVTASQYK--SGYWVPLPPYRIGDPPKKVDVVGSH--VRLPSHKRLFINYNTKGNNPWGTSCLTSVL 222 (446) T ss_pred ------------cccchhhcc--cccccCcccchhhhhhhhcccCccc--ccccccceEEEEecCCCCCccccchHHHHH Confidence 000000000 000000000000000 0000000 01342 222 1 3467888887655 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCeeEEec---CCcccch-------------hHHHhh---hhCceeec-----cCCCce Q lcl|NC_021326. 220 TLIDAYNRRLSDLSNTFKDSNELTYVLTN---YDDQELP-------------EFKRLL---RYYGAIKV-----SDNGGV 275 (445) Q Consensus 220 ~lid~~~~~~s~~~~~~~~~~~~~l~~~g---~~~~~~~-------------~~~~~~---~~~~~~~~-----~~~~~~ 275 (445) -.----+..+-+++.-++.+..|+++.+- ....+.+ ...+.+ ....+..+ +++..+ T Consensus 223 w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~ei 302 (446) T protein:vir:98 223 DYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQV 302 (446) T ss_pred HHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceE Confidence 54444466777888889999999998763 3222111 111122 22333333 778889 Q ss_pred eeEeccCCh-HHHHHHHHHHHHHHHHHhCccccccccc-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_021326. 276 DTIQVEVPV-ENSKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKVAIQ-ELLWFVFE 352 (445) Q Consensus 276 ~~l~~~~~~-~~~~~~i~~l~~~i~~~s~~p~~~~~~~-~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~~~~~ 352 (445) ++++...+. ..++.+++.+.+.|...--..-+..+.. ++..|...=+....-....++.-.+.+...+. ++++-++. T Consensus 303 e~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~ 382 (446) T protein:vir:98 303 GALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVIGNLIR 382 (446) T ss_pred EeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999876543 4688999999988877543222222111 11112111111112222233444556666664 57777776 Q ss_pred HhccCCCc-ce---EEEEeCCCCCCCHHHHHHHHHHHh--cc-CC--hHHHHHhCCCCCCHHHHH Q lcl|NC_021326. 353 HFDIKGEH-KD---VDISFNYNKVANTELQVQTAQQSM--GI-VS--HETVLENHPFVEDLQAEL 408 (445) Q Consensus 353 ~~~~~~~~-~~---i~v~f~~~~p~d~~~~~~~~~~~~--g~-~s--~et~l~~l~~~~d~~~E~ 408 (445) +....... .. -.+.|....+.|....++.+.+++ |+ ++ .+.+.++++. ++.++.- T Consensus 383 lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~gi-P~~~~~~ 446 (446) T protein:vir:98 383 LNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGL-PDAISST 446 (446) T ss_pred hCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCc-CCCCCCC Confidence 55432221 11 124566567788888899888874 65 44 4456666643 3211100 No 225 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=94.97 E-value=0.0031 Score=34.37 Aligned_cols=393 Identities=13% Similarity=0.141 Sum_probs=165.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~~~~d~ 79 (445) +-.. ++.-.+.+.+|..+..+++-+.. ...||+..+-+ ...+|+.+..++- T Consensus 45 ~e~~-~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV~i~Ld~~ 96 (533) T protein:vir:10 45 FDGQ-VRNEYQLISRYREMVLQPECDSA---------------------------VDDIVNETICGNFDDVPVSVELSNL 96 (533) T ss_pred cccc-cchHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeeecCCCceEEEEeccc Confidence 1110 11112223334333333332221 12233333222 1234566655443 Q ss_pred HHHHH----H----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 80 EVIKR----I----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 80 ~~~~~----l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) ++.+. + +.++. =+|+...++..+...+.|+.|+..-+|.+ |-..+..+||+.+-+|..-. ....- T Consensus 97 ~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr~i~~vr~i~--~~~~~ 174 (533) T protein:vir:10 97 KVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPRKIRKINETE--QKRPE 174 (533) T ss_pred ccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeeeccccceeeeeeee--ccCCC Confidence 33322 2 22221 26788888999999999999998877643 55678999999886654210 00000 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecC------CCCcCccHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKN------NDLEISDIFM 217 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n------~~~g~s~~~~ 217 (445) .++..... ..+ ......+|.+....... . . ..+ -+|| +++... +..-.|-+. T Consensus 175 ~~~~~~~~----~~v-~~~~~eyf~Ynp~g~~~-~-~----------~~~-vkI~~dAI~y~hSGl~d~~~~~i~syLh- 235 (533) T protein:vir:10 175 QLRGLPLN----QQL-SPKSAEYFLYDPKGLKN-S-T----------TQG-LKIAPDSICYVHSGIMDLNKNMTLSHLH- 235 (533) T ss_pred ccceeecc----hhh-hccceeeeeeccccccc-c-C----------CCc-eecchhheeeeeccceeCCCCceeccch- Confidence 01110000 000 11111222222211100 0 0 000 1222 222211 111123333 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----h--------------------hCce Q lcl|NC_021326. 218 YKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----R--------------------YYGA 266 (445) Q Consensus 218 v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~--------------------~~~~ 266 (445) ..+..+|. ++-|.+...+..+.|-+=+.-.+..+. +...+.+ + .-.- T Consensus 236 --kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlED 313 (533) T protein:vir:10 236 --KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 313 (533) T ss_pred --HhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhh Confidence 33444443 345555666666666432221111111 1111111 1 0001 Q ss_pred eecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccccccccccC-cc-hHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 267 IKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKA 339 (445) Q Consensus 267 ~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~-~~-Sg~Ai~~~~~~l~~k~~~~~~~~ 339 (445) ++++ +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|---.+.-++ +. -|..|-..+.....-+.+.+..| T Consensus 314 yWLPRReGgrgTEItTLpGgqnLge-m~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF 392 (533) T protein:vir:10 314 FWLPRREGGRGTEITTLPGGQNLGE-LEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRF 392 (533) T ss_pred hcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHH Confidence 2232 122 223333332 3333 333666677777777776321111111 11 12234444444556667778888 Q ss_pred HHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHH---hc-cCChHHHHHhCCCCC Q lcl|NC_021326. 340 KVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQS---MG-IVSHETVLENHPFVE 402 (445) Q Consensus 340 ~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~---~g-~~s~et~l~~l~~~~ 402 (445) ...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+ +| .+|++++++.+--.+ T Consensus 393 s~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~t 472 (533) T protein:vir:10 393 SELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQT 472 (533) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC Confidence 888888777544443432 222 457788865444334444433 3333 24 479999998764444 Q ss_pred C--HHHHHHHHHHHHHH--------HHHhhhcccc---CCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 403 D--LQAELERIEQEQME--------YNKQLPNLDD---GGADGAQQKERSNDKQSE 445 (445) Q Consensus 403 d--~~~E~~ri~~E~~~--------~~~~~~~~~~---~~~~~~~~~~~~~d~~~~ 445 (445) | ..++-+.|++|.++ .+...+...+ ++..+++...+.++++.| T Consensus 473 Deei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (533) T protein:vir:10 473 DVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEGPDPSDE 528 (533) T ss_pred HHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCCCCcchh Confidence 3 44556677777542 1111111111 111112223344444444 No 226 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=94.97 E-value=0.0031 Score=34.37 Aligned_cols=252 Identities=13% Similarity=0.031 Sum_probs=112.4 Q ss_pred hhccCeeeccCchHHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCC Q lcl|NC_021326. 67 IVGKPIAFKHTDDEVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDK 139 (445) Q Consensus 67 l~g~~~~~~~~d~~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~ 139 (445) +..-|+.+...++.....+...+ . | ........+..+.+.+|.+|+.+..+.+|++ .+..++|..+.+..++. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 22233333222111111121121 1 2 3455677788899999999999999988975 57888898887766542 Q ss_pred CCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCcc Q lcl|NC_021326. 140 EHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISD 214 (445) Q Consensus 140 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~ 214 (445) . . ..+ |......+ ....+ +.. -|+++++ ...|.|. T Consensus 81 ~-~-~~~----y~~~~~~g-------~~~~~-------------------------~~~--evih~~~~~~~~~~~G~s~ 120 (278) T protein:vir:78 81 S-R-ELY----YSIHAATG-------NKLIV-------------------------HNM--DMLHFKHIVASNMVQGISP 120 (278) T ss_pred C-c-eEE----EEEEcCCc-------eEEEE-------------------------ccc--cEEEECCCCCCCCeeeccH Confidence 1 1 111 11100000 00000 001 1344432 2247787 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEe-cCC--cccchhHHHhh-----hhCceeeccCCCceeeEeccCChHH Q lcl|NC_021326. 215 IFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT-NYD--DQELPEFKRLL-----RYYGAIKVSDNGGVDTIQVEVPVEN 286 (445) Q Consensus 215 ~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~-g~~--~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~~~~ 286 (445) +..+...++........ +...+...|-.++. +.. .+.....+..+ ...+++.++++.+.+.+........ T Consensus 121 ~~~~~~~i~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~ 198 (278) T protein:vir:78 121 IDVLKNTTDFDNAVRTF--NLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSED 198 (278) T ss_pred HHHHHHHHHHHHHHHHH--HHHHhcCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecCCCceEEEccCChhHHH Confidence 77766666654443222 22222333433333 222 11111111111 1334566666665555544444455 Q ss_pred HHHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----CCCc Q lcl|NC_021326. 287 SKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----KGEH 360 (445) Q Consensus 287 ~~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-----~~~~ 360 (445) +.+..+...+.|+..-++|....+... ++-|. ++... ...+...+.-++..+...+.. ..-. T Consensus 199 ~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn--~~~~~----------~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~ 266 (278) T protein:vir:78 199 IVASENLTRERVANVFQLPSVFLNARSNTNFAK--NEELN----------RFYLQHTLLPIVKQYEEEFNRKLLTKTDRE 266 (278) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHH----------HHHHHHHHHHHHHHHHHHHHhhcCChhHhc Confidence 667777788888888888864443221 22111 11111 112222233333333222221 1111 Q ss_pred ceEEEEeCCCCC Q lcl|NC_021326. 361 KDVDISFNYNKV 372 (445) Q Consensus 361 ~~i~v~f~~~~p 372 (445) ....+.|+-+.- T Consensus 267 ~g~~~~f~~~~l 278 (278) T protein:vir:78 267 KIGILNLTLNLI 278 (278) T ss_pred CCceEEEecccC Confidence 223455543222 No 227 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=94.84 E-value=0.0034 Score=34.13 Aligned_cols=363 Identities=10% Similarity=0.042 Sum_probs=151.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeec--cCc Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK--HTD 78 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~--~~d 78 (445) -+.-.....- -. .. ..|.+ + .... . +.+.=...+|+..++-+-.-|+.+- ..+ T Consensus 33 ~~~~~~~~~~----~~--~~--~~g~~-v------~~~~-------a---~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~~ 87 (432) T protein:vir:97 33 PVNATARDLG----II--IS--DTGAA-V------NADA-------I---MRLDAVAACVKLVSQAVAAMPLMMYMRTPD 87 (432) T ss_pred cCchhhhhhc----cc--cc--ccCcc-c------chHh-------h---hcchHHHHHHHHHHHhhccCceEEEEecCC Confidence 1211111100 00 00 01111 0 0000 0 1111123344555554444465431 111 Q ss_pred ---hHHH-HHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEE Q lcl|NC_021326. 79 ---DEVI-KRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFI 148 (445) Q Consensus 79 ---~~~~-~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v 148 (445) +... ....-+.. | .-..+...+..+.+.+|.+|+++..+ +|++ .+..++|..+.++.+.. +.+.+ T Consensus 88 g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~--g~~~y-- 162 (432) T protein:vir:97 88 GRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTK--GNTAY-- 162 (432) T ss_pred CcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCC--CcEEE-- Confidence 1111 11222211 2 23456667788899999999988776 4765 46778999888876542 32221 Q ss_pred EEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHHH Q lcl|NC_021326. 149 RMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLIDA 224 (445) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid~ 224 (445) ++. ..+..... +... . |++++ +...|.|.+......++. T Consensus 163 ~~~-~~~g~~~~-~~~~---------------------------------~--iih~r~~~~dg~~G~spi~~~~~~i~~ 205 (432) T protein:vir:97 163 RYR-RTDGQMID-IPRQ---------------------------------Q--IWKIMGYSLDGENGLSAIRYGAQIFGT 205 (432) T ss_pred EEE-ecCceEEE-Eccc---------------------------------c--EEEecCcCCCCcccccHHHHHHHHHHH Confidence 111 00000000 0000 0 12221 123477777666555555 Q ss_pred HHHHHHHHHHHHHHhcCCeeEEecCCc---ccchhHHHhh----hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 225 YNRRLSDLSNTFKDSNELTYVLTNYDD---QELPEFKRLL----RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 225 ~~~~~s~~~~~~~~~~~~~l~~~g~~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) ...........+...+.|-.++.-... +....+...+ ..++++.++++.+.+.++.......+.+..+..... T Consensus 206 ~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 285 (432) T protein:vir:97 206 AIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFSKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVES 285 (432) T ss_pred HHHHHHHHHHHHhccCCcceeEecCCCCCHHHHHHHHHHHhhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 444444444455666677555543221 1111222211 234466667666655554444445556667777788 Q ss_pred HHHHhCccccccccccC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhccCC-CcceEEEEeCC Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFGS--APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE-----HFDIKG-EHKDVDISFNY 369 (445) Q Consensus 298 i~~~s~~p~~~~~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~-----~~~~~~-~~~~i~v~f~~ 369 (445) |+..-++|....+.... ...+..++....... ...|.-.++.+.. ++.... ....+++.+.. T Consensus 286 Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~----------~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~ 355 (432) T protein:vir:97 286 ICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL----------TMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSA 355 (432) T ss_pred HHHHhCCCHHHcCCcCCcccccchhHHHHHHHHH----------HHHHHHHHHHHHHHHhhhccCccccCceEEEeechh Confidence 88888888654432221 112233332222221 1222222222211 111111 11234444455 Q ss_pred CCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccc-cCCCC-CCCCCCCCCCcC Q lcl|NC_021326. 370 NKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLD-DGGAD-GAQQKERSNDKQ 443 (445) Q Consensus 370 ~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~-~~~~~-~~~~~~~~~d~~ 443 (445) .+..|..+.++.+.++ .|+++.-.++++++. +++ ...+-.+.. .. ..+.... ....+ ++..+++.+++- T Consensus 356 llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g-~~~~~~~~~---~~-~pl~~~~~~~~~~~~~~~~~~~~~~~ 430 (432) T protein:vir:97 356 LLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG-NAAVLTVQS---AM-VPLDSIGLQASPEPASGLGNQQQDKV 430 (432) T ss_pred hhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC-CcceEeecc---cc-cchhhhcccCCCCCCCCCCCcccccc Confidence 6677899999998887 489999999888754 211 000000000 00 0000000 00000 000011111111 Q ss_pred CC Q lcl|NC_021326. 444 SE 445 (445) Q Consensus 444 ~~ 445 (445) ++ T Consensus 431 ~~ 432 (432) T protein:vir:97 431 SK 432 (432) T ss_pred cC Confidence 11 No 228 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=94.76 E-value=0.0036 Score=34.00 Aligned_cols=365 Identities=8% Similarity=-0.022 Sum_probs=156.0 Q ss_pred ChHHHHHHHHHHHHHH-HHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeec--cC Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEI-SIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK--HT 77 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~-~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~--~~ 77 (445) ++.++-...+....+. .....++-+.... .... .....=...+....+|+..++-+..-|+.+- .+ T Consensus 3 ~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~--~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 71 (406) T protein:vir:95 3 LFDRWRRTKRKSKIRADTGYVGLFMSGEDV---------SFLV--PGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTE 71 (406) T ss_pred chhhhccccccccccccchhhhhhccCccc---------Cccc--cCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecC Confidence 3322211000000000 0001111110000 0000 0000001234556678888877777777651 11 Q ss_pred c--hHH-HHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEE--EEEECCCCcE-EEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 78 D--DEV-IKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWL--HPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 78 d--~~~-~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~--~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) + ... ......+.. | ........+..+.+.+|.++. .+-.+..|++ .+..++|..+.++.+.. + T Consensus 72 ~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~--~---- 145 (406) T protein:vir:95 72 DGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPD--G---- 145 (406) T ss_pred CcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCC--e---- Confidence 1 111 112222221 2 345566777888888876644 4455666776 46778888877665542 0 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC------CCCcCccHHHHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN------NDLEISDIFMYKT 220 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n------~~~g~s~~~~v~~ 220 (445) .++. .. ... +..--|+|++. ...|.|.+..+.. T Consensus 146 -~~~~-----------~~----------~~~-------------------~~~~evih~~~~~~~~~~~~G~s~i~~~~~ 184 (406) T protein:vir:95 146 -YQVL-----------YG----------GQT-------------------FNYDEVLHFIYNPDPERPYIGRGYRVVLKD 184 (406) T ss_pred -EEEE-----------ec----------cEE-------------------EchhHEEEeeccCCCCCCccccCHHHHHHH Confidence 0000 00 000 00001344431 1247787777777 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCeeEEec---CCcccchhHHHhhh--------hCceeeccCCC-ceeeEe-ccCChHHH Q lcl|NC_021326. 221 LIDAYNRRLSDLSNTFKDSNELTYVLTN---YDDQELPEFKRLLR--------YYGAIKVSDNG-GVDTIQ-VEVPVENS 287 (445) Q Consensus 221 lid~~~~~~s~~~~~~~~~~~~~l~~~g---~~~~~~~~~~~~~~--------~~~~~~~~~~~-~~~~l~-~~~~~~~~ 287 (445) .++....+.......+...+.|-.++.- .+.+........+. .++.+.++.++ ..+-++ .......+ T Consensus 185 ~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~ 264 (406) T protein:vir:95 185 IADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAI 264 (406) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhHHHH Confidence 7766666655556666667777655543 22222222222221 12233344433 222222 22233445 Q ss_pred HHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCcceEE Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGEHKDVD 364 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~~i~ 364 (445) .+..+.....|+..-++|..-.+. +++.+... . ..+...|.-+++.+...+.. ......++ T Consensus 265 ~e~~~~~~~~Ia~~fgVp~~~lg~-~~~~~~~~-----~----------~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~ 328 (406) T protein:vir:95 265 NEAVELDKRTVAGMFGVPAFLLGI-GEFNRDEY-----N----------NFINSTILPIAKGIEQELTRKLLISPDLYFK 328 (406) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCC-CCchHHHH-----H----------HHHHHHHHHHHHHHHHHHHHhcCCCCCcEEE Confidence 566777778888888888644321 22222111 1 12333344443333322221 11223455 Q ss_pred EEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC-CCCCCCCCCCCC Q lcl|NC_021326. 365 ISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGG-ADGAQQKERSND 441 (445) Q Consensus 365 v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~-~~~~~~~~~~~d 441 (445) +.++..+..|..+.++.+.++ .|+++...++++++.-.. +..++..--. ............ ..++.. +++ + T Consensus 329 fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~--~~gd~~~~~~--n~~~~~~~~~~~~~k~g~~-~~~-~ 402 (406) T protein:vir:95 329 FNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPK--EGLSELVILE--NYIPLDKIGDQSKLKGGDN-SGA-D 402 (406) T ss_pred eechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceeeecc--CccchhhcccccccCCCCC-CCC-C Confidence 556666678888899888776 589999999998865321 1111111000 000000000000 011111 111 1 Q ss_pred cCCC Q lcl|NC_021326. 442 KQSE 445 (445) Q Consensus 442 ~~~~ 445 (445) .+.| T Consensus 403 ~~~~ 406 (406) T protein:vir:95 403 GQTD 406 (406) T ss_pred CCCC Confidence 1111 No 229 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=94.61 E-value=0.0039 Score=33.77 Aligned_cols=378 Identities=13% Similarity=0.094 Sum_probs=160.9 Q ss_pred ChHHHHH-----HHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeee Q lcl|NC_021326. 1 MIVRYIK-----QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAF 74 (445) Q Consensus 1 ~l~~~i~-----~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~ 74 (445) .....+. +-++-+.+|..+..+++-+.. ...||+..+-+ -..+|+.+ T Consensus 48 ~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIvne~iv~d~~~~pV~l 100 (511) T protein:vir:56 48 IIPSDAQSEGTIPVKELIKSYRALAEYHEVDDA---------------------------IQEIVDEAIVYENDKEVVWL 100 (511) T ss_pred eccccccccCccchHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceEEE Confidence 0000000 001333444444444433322 12233322211 12345555 Q ss_pred ccCchHHHH--------HHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC-CcEEEEEEccceeEEEEcCCCCCce Q lcl|NC_021326. 75 KHTDDEVIK--------RIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE-GEFKLFRVPAEQGIPIWTDKEHEEL 144 (445) Q Consensus 75 ~~~d~~~~~--------~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~-g~~~i~~~~p~~~~~v~d~~~~~~~ 144 (445) ..++-++.+ ..+.++. =+|+...++..+...+.|+.|+..-.|++ |-..+..+||+.+-.|..- T Consensus 101 ~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eLr~lDPr~i~~vr~i------ 174 (511) T protein:vir:56 101 NLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIELRPLNPMKMELVREI------ 174 (511) T ss_pred EecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccccceeehhhcCcccchhhhhh------ Confidence 544332222 2222221 26788888999999999999988877654 5456888899887554321 Q ss_pred EEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEec--------CCCCcCc Q lcl|NC_021326. 145 EAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK--------NNDLEIS 213 (445) Q Consensus 145 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~--------n~~~g~s 213 (445) ..+...++.+.... ..+|.+..... .......... ....--+|| |++.. |+....| T Consensus 175 -------~~~~~~~~~v~~~~-~ey~~Y~~~~~-~~~~~~~~~~----~~~~~vkI~~daI~y~hSGL~d~~~~~g~i~s 241 (511) T protein:vir:56 175 -------QKETIDGVEVVKGT-LEYYVYKQSDY-KMPSWMSATN----RAQTSFRIPKDAIVFAHSGLMRGCADDPYIIG 241 (511) T ss_pred -------hcccccccccccce-eeeeEecCCCc-ccCccccccc----ccccceeechhheeeecccceeccCCCCeeec Confidence 11112222222211 22222222110 0000000000 001111233 22221 2222344 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCccc-----chhHHHhh------------------------hhC Q lcl|NC_021326. 214 DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQE-----LPEFKRLL------------------------RYY 264 (445) Q Consensus 214 ~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~-----~~~~~~~~------------------------~~~ 264 (445) -+...+.....|- ++-|.+...+..+.|-+=+.-.+..+ .+...+.+ ..- T Consensus 242 yLhkAiKp~NQLk-m~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMl 320 (511) T protein:vir:56 242 YLDRAIKPANQLK-MLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSML 320 (511) T ss_pred cchhhhHHHHhhH-HHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhH Confidence 4544322222221 34555556666666643222111111 11111111 000 Q ss_pred ceeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--cccc----cccCcchHHHHHHHHHHHHHHHH Q lcl|NC_021326. 265 GAIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FSSD----KFGSAPSGVALEFLYTNLNLKAD 333 (445) Q Consensus 265 ~~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~~~----~~~~~~Sg~Ai~~~~~~l~~k~~ 333 (445) .-+++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|- +..+ .++.. -|..|-..+.....-+. T Consensus 321 EDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~G-r~~EItRDEiKF~KFI~ 398 (511) T protein:vir:56 321 EDYYLPRREGSKGTEVSTLPGGQSLGD-IEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFG-QGAEITRDELKFTKFVK 398 (511) T ss_pred hhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCccccccc-cchhhhHHHHHHHHHHH Confidence 012232 122 223333333 3333 3346667777788777773 2211 12101 23344444445556667 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHHh---c-cCChHHHHH Q lcl|NC_021326. 334 KLARKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQSM---G-IVSHETVLE 396 (445) Q Consensus 334 ~~~~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~~---g-~~s~et~l~ 396 (445) +.+..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+. | .+|++++++ T Consensus 399 RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k 478 (511) T protein:vir:56 399 RLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQK 478 (511) T ss_pred HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHH Confidence 778888888888877544443432 222 457788865444334444433 33333 3 479999998 Q ss_pred hCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCC Q lcl|NC_021326. 397 NHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADG 432 (445) Q Consensus 397 ~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 432 (445) .+--.+| ..++.+.|++|..+..-+.+. . +. T Consensus 479 ~ILr~tDeei~~~~k~I~~E~k~~~~~~~e--~---~f 511 (511) T protein:vir:56 479 NILRLSDDQITAMQSEIDEEETNPRFQQDD--Q---GF 511 (511) T ss_pred HHhccCHHHHHHHHHHHHHhhcCCCCCCcc--c---CC Confidence 7644443 334445555554431111111 1 11 No 230 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=94.05 E-value=0.0055 Score=32.96 Aligned_cols=366 Identities=9% Similarity=0.025 Sum_probs=145.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc---- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH---- 76 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~---- 76 (445) |...+..+...-. +..+.+ .++..++...... ..+.-.+ ..-...+|+..++-+..-|+++.. T Consensus 3 ~~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~----t~~~~~~--~~~v~~cv~~Ia~~ia~~p~~v~~~~~~ 69 (403) T protein:vir:10 3 FKSWITEKLNPGQ----RIIRDM---EPVSHRTNRKPFT----TGQAYSK--IEILNRTANMVIDSAAECSYTVGDKYNI 69 (403) T ss_pred chhhhhhccchhh----hhhhcc---cccccccCCcccc----cHHHHHH--HHHHHHHHHHHHHHHhhCceeEeecccc Confidence 4333333322100 000000 0111110000000 0000001 112223455555555455554421 Q ss_pred ---CchHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEE Q lcl|NC_021326. 77 ---TDDEVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 147 (445) Q Consensus 77 ---~d~~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ 147 (445) .+......+...+. | ....+...+..+.+.+|.+|+.+ +. . .+..++|..+.+..+. ..... T Consensus 70 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~~--~-~l~~l~~~~~~v~~~~---~~~~~- 140 (403) T protein:vir:10 70 VTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--DG--T-SLYHVPAALMQVEADA---NKFIK- 140 (403) T ss_pred cccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--eC--c-eeEeecCcceEEEEcC---CceEE- Confidence 11111112222332 3 23455666778889999998764 22 1 2455666555433222 11111 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEE-ecCCCCcCccHHHHHHHHHHHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIP-FKNNDLEISDIFMYKTLIDAYN 226 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~-~~n~~~g~s~~~~v~~lid~~~ 226 (445) .+.... .+. +. .....|. ....+++ ..+...|.|.+..+...++... T Consensus 141 --~~~~~~----------~~~-~~------------------~~eiih~-~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~ 188 (403) T protein:vir:10 141 --KFIFNN----------QIN-YR------------------VDEIIFI-KDNSYVCGTNSQISGQSRVATVIDSLEKRS 188 (403) T ss_pred --EEEecC----------cee-ec------------------ccceEEe-cccccccCCCCCcccccHHHHHHHHHHHHH Confidence 000000 000 00 0001111 1111111 1234467788877777777666 Q ss_pred HHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCceeeEeccCC--hHHHHHHHHH Q lcl|NC_021326. 227 RRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVP--VENSKKYLDE 293 (445) Q Consensus 227 ~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~--~~~~~~~i~~ 293 (445) .+..-..+.+...+.|-.++.... .+.....+..+. ..+++.++++.+.+.+....+ ...+.+..+. T Consensus 189 ~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~ 268 (403) T protein:vir:10 189 KMLNFKEKFLDNGTVIGLILETDEILNKKLRERKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEG 268 (403) T ss_pred HHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHH Confidence 665555566666777766665422 222222222221 122455666666665554333 2344555666 Q ss_pred HHHHHHHHhCcccccccc-ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCC-- Q lcl|NC_021326. 294 LYQKIMLFGQAVDFSSDK-FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYN-- 370 (445) Q Consensus 294 l~~~i~~~s~~p~~~~~~-~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~-- 370 (445) ....|+..-++|....+. ..++.+.....+.. ..|.-.++.+...+...- ...+.+.++.. T Consensus 269 ~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~f~~---------------~tl~P~~~~ie~~l~~~L-~~~~~~d~~~~~~ 332 (403) T protein:vir:10 269 FNKSICLAFGVPQVLLDGGNNANIRPNIELFYY---------------MTIIPMLNKLTSSLTFFF-GYKITPNTKEVAA 332 (403) T ss_pred HHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHHH---------------HHHHHHHHHHHHHHHHhc-Cceeeeccchhhh Confidence 777788877888654432 11122121111211 122222222221111111 12334444422 Q ss_pred CCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhh---ccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 371 KVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLP---NLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 371 ~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~---~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +-.|..+.++++.++ .|+++.-.++++++.-.-+++...+.. ..... .....+++++++ ....+|| T Consensus 333 l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~------~p~n~~~~~~~~~~~e~~~~---~~~~~g~ 403 (403) T protein:vir:10 333 LTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKIR------IPANVAGSATGVSGQEGGRP---KGSTEGD 403 (403) T ss_pred cccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccccc------cccccccccccCCCCcCCCC---CCCcCCC Confidence 445777888887776 589999999988855321111111111 00000 011112222222 2222222 No 231 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=94.00 E-value=0.0057 Score=32.89 Aligned_cols=403 Identities=12% Similarity=0.118 Sum_probs=164.1 Q ss_pred ChHHH----HHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeeec Q lcl|NC_021326. 1 MIVRY----IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFK 75 (445) Q Consensus 1 ~l~~~----i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~~ 75 (445) .+.-. .+.-.+.+.+|..+..+++-+.. ...||+..+-+ -..+|+.+. T Consensus 40 ~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~A---------------------------v~eIVneaIv~d~~~~pV~vd 92 (564) T protein:vir:10 40 YVDTSGGQNSRNEYELIRRYRDMSLHPEVDSA---------------------------IDEIVNEFVVNDGDDKPVEVD 92 (564) T ss_pred eeecccccchhhHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceEEEE Confidence 00000 01112333444444433333221 12233332222 123566665 Q ss_pred cCchHHHHH--------HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC----CCcEEEEEEccceeEEEEcCCCC- Q lcl|NC_021326. 76 HTDDEVIKR--------IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDKEH- 141 (445) Q Consensus 76 ~~d~~~~~~--------l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~----~g~~~i~~~~p~~~~~v~d~~~~- 141 (445) .++-++.+. .+.++. =+|+...++..+...+.|+.|+..-+|. +|=..+..+||+.+-.|+..-.. T Consensus 93 L~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lDPr~i~~vr~i~~~~ 172 (564) T protein:vir:10 93 LQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYIDSLKIRKVRQKLKDV 172 (564) T ss_pred ecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhhcccceeeeeeecccc Confidence 544333222 222221 2678888899999999999999887764 35456899999988777632111 Q ss_pred -CceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccce---EEec------CCCCc Q lcl|NC_021326. 142 -EELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPF---IPFK------NNDLE 211 (445) Q Consensus 142 -~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv---v~~~------n~~~g 211 (445) .....+++-+. ..+.+.....+|.+.............+. ....+..--+||. ++.. ++..- T Consensus 173 ~~~~~~v~k~~~------~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~--~~~~~~~~ikI~~daI~y~hSGL~d~~~~~i 244 (564) T protein:vir:10 173 DPNRKEIEKGTA------LQYDYGDFIEYYIYNPKGFAGNIPMVTGS--MDWSNQEGIKIASDAIAQSTSGLMDLNKKMT 244 (564) T ss_pred ccccceeeeeee------eeccccccccceeeccccccCcccccccc--cccccccceeechhhcceecccceeCCCCce Confidence 01111111110 00011111122222211000000000000 0000000012331 2211 11111 Q ss_pred CccHHHHHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----h------------------ Q lcl|NC_021326. 212 ISDIFMYKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----R------------------ 262 (445) Q Consensus 212 ~s~~~~v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~------------------ 262 (445) .|-+. ..+..+|. ++-|.+...+..+.|-+=+.=.+..+. +...+.+ + T Consensus 245 ~gyLh---kAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~ 321 (564) T protein:vir:10 245 LSFLH---KAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKH 321 (564) T ss_pred eccch---hhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecccchh Confidence 23333 33444443 345556666666666432221111111 1111111 1 Q ss_pred --hCceeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--ccccc--cc-CcchHHHHHHHHHHHHH Q lcl|NC_021326. 263 --YYGAIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FSSDK--FG-SAPSGVALEFLYTNLNL 330 (445) Q Consensus 263 --~~~~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~--~~-~~~Sg~Ai~~~~~~l~~ 330 (445) .-.-++++ +|+ +.+.-|.++ +.... .-+.-+++.++..-.+|- +..++ +. |.. ..|-..+..... T Consensus 322 msMlEDyWLPRReGgrgTEItTLpGgqnLgem-~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~--~EItRDEiKF~K 398 (564) T protein:vir:10 322 MSMLEDFWLPRREGGRGTEITTLPGGQNLGEL-KDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKS--TEILRDELKFTK 398 (564) T ss_pred hhhHhhhcccccCCCcccceeeccccCCcchH-HHHHHHHHHHHHHhCCCcccccCCCceeecccc--cchhHHHHHHHH Confidence 00012232 122 123333332 33332 336666777777777763 22221 11 222 234333444555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHH---hc-cCChHH Q lcl|NC_021326. 331 KADKLARKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQS---MG-IVSHET 393 (445) Q Consensus 331 k~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~---~g-~~s~et 393 (445) -+.+.+..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+ +| .+|+++ T Consensus 399 FI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dy 478 (564) T protein:vir:10 399 FIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEY 478 (564) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHH Confidence 666778888888888777544333432 222 457788865444334444433 3333 24 479999 Q ss_pred HHHhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCC-------------CCCcCCC Q lcl|NC_021326. 394 VLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKER-------------SNDKQSE 445 (445) Q Consensus 394 ~l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~-------------~~d~~~~ 445 (445) +++.+--.+| ..++-+.|++|..+....-|.... ..++.+++++ ..+.+.+ T Consensus 479 i~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~-~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 544 (564) T protein:vir:10 479 IRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVN-MLDDMEKQNQAFAPELQAAQDDLAAEREIK 544 (564) T ss_pred HHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhh-cCCCccCCCCcCCcchhhhccccccccChh Confidence 9987644443 445666777776542211111000 0001111110 0011111 No 232 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=93.97 E-value=0.0058 Score=32.85 Aligned_cols=370 Identities=8% Similarity=-0.011 Sum_probs=157.6 Q ss_pred ChHHHHHHHH-HHHHHH-HHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc-C Q lcl|NC_021326. 1 MIVRYIKQHL-EKLPEI-SIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-T 77 (445) Q Consensus 1 ~l~~~i~~~~-~~~~~~-~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~-~ 77 (445) +..+...... ...... ..+....-+.. .. .+... -.+.-+.+.-...+|+..++-+..-|+.+-- . T Consensus 2 ~~~r~~~~~~~~~~~~~~~~~~~~~g~~~------s~-~~~~v----t~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~ 70 (419) T protein:vir:14 2 FFSRQLLSNLGQTQMSAGGWVSALLGSSR------SD-SGQVV----TPASALALTVLQNCVTLLAESIAQLPIELYERS 70 (419) T ss_pred cccccccccccccccCcchhhHHhhcCCC------cc-CCccc----chHHhhccHHHHHHHHHHHHhhccCceEEEEec Confidence 0000000000 000000 00000110000 00 00000 0000012222344666666666556665421 1 Q ss_pred ch---H-HHHHHHHHh---cc---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 78 DD---E-VIKRIDEVL---GN---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 78 d~---~-~~~~l~~~~---~n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) ++ . ....+...+ -| .-..+...+....+.+|.+|+++..+.+|++. +..++|..+.+..+.. +.+.+ T Consensus 71 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~--~~~~y 148 (419) T protein:vir:14 71 GEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSD--LKPVY 148 (419) T ss_pred CCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCC--ceEEE Confidence 11 0 011121222 12 23455666788899999999999999889864 7888998887665432 21111 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLI 222 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~li 222 (445) ....... . +. .. |++++ +...|.|.+..+...+ T Consensus 149 -----~~~~~~~-------------------~-----------~~------~~--i~h~~~~~~dg~~G~s~i~~~~~~i 185 (419) T protein:vir:14 149 -----RVRGSDP-------------------M-----------PQ------RL--VHHVRWMSINGYTGLSPVLLHANAI 185 (419) T ss_pred -----EEccCcc-------------------c-----------ch------hh--eeEecCcCCCCcccccHHHHHHHHH Confidence 1000000 0 00 00 12221 2335778887776666 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEEecC--C-cccchh----HHHhh--------hhCceeeccCCCceeeEeccCChHHH Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVLTNY--D-DQELPE----FKRLL--------RYYGAIKVSDNGGVDTIQVEVPVENS 287 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~~g~--~-~~~~~~----~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~~ 287 (445) +....+.....+.+...+.|-.++.-. . ....++ +...+ ...+++.++++.+...+........+ T Consensus 186 ~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~ 265 (419) T protein:vir:14 186 GHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAAL 265 (419) T ss_pred HHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhhHHH Confidence 665555545555566677776665422 1 111122 22211 11335556665555544433333345 Q ss_pred HHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CC--Ccc Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI----KG--EHK 361 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~----~~--~~~ 361 (445) .+..+.....|+..-++|....+...+ .+...++..... .+...|.-.+..+...+.. .. ... T Consensus 266 ~e~~~~~~~~Ia~~fgVpp~~lg~~~~-~t~s~~E~~~~~----------f~~~~L~P~~~~ie~~l~~kll~~~~~~~~ 334 (419) T protein:vir:14 266 IDALRLSALDIARIYKIPAHMVNELER-ATFSNIEHQSLQ----------FVIYTLLPWVKRHEQAKTRDLLLPSERKQY 334 (419) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCCCC-CCcccHHHHHHH----------HHHHHHHHHHHHHHHHHhhhccCccccCCe Confidence 555666677888888888644432211 111112222111 1222222222222211111 11 122 Q ss_pred eEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-hhhccccCCCCCCCCCCC Q lcl|NC_021326. 362 DVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNK-QLPNLDDGGADGAQQKER 438 (445) Q Consensus 362 ~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~-~~~~~~~~~~~~~~~~~~ 438 (445) .+++.+...+..|..+.++++.++ .|+++.-.++++++.-.- +.-+.. ... ........+..+..+++. T Consensus 335 ~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~--~gGD~~------~~~~n~~~~~~~~~~~~~~~~~ 406 (419) T protein:vir:14 335 FIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPV--KGGDIY------LSPMNMVDASKPQQLPVGKSEP 406 (419) T ss_pred EEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCee------eeccccccccccccccCCCCCC Confidence 344444556667889999998887 589999999988754210 000000 000 000111111112222222 Q ss_pred CCCcCCC Q lcl|NC_021326. 439 SNDKQSE 445 (445) Q Consensus 439 ~~d~~~~ 445 (445) .+..+.| T Consensus 407 ~~~~~~e 413 (419) T protein:vir:14 407 TKAAIDE 413 (419) T ss_pred ccccccc Confidence 3333333 No 233 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=93.60 E-value=0.007 Score=32.40 Aligned_cols=358 Identities=10% Similarity=0.028 Sum_probs=144.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcC-CCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQ-RPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G-~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) +..+.-..-.. +.. .....+.. ...... .+.... ......-+...=...+|+..++-+.+-|+++.-. T Consensus 3 lf~~~~~~~~~--~~~-~~~~~~~~~~~~~~~----~~~~~~--~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~-- 71 (384) T protein:vir:49 3 IFNITNLATES--PPS-NQDSFFDITDPEFLD----ALNGSE--WVSAETALKNSDLFSIISQLSNDLATAKITTSRK-- 71 (384) T ss_pred cccccccCccc--ccc-cchhhccccchhhcc----cccCCc--eechhhhhccHHHHHHHHHHHHHHhhCceeeecc-- Confidence 11110000000 000 00000000 000000 000000 0000000111222345666666666666665322 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecce Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 157 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 157 (445) .....+..=.. .........+..+.+.+|.+|+.+..+..|++ .+..++|..+.++.++. ...+.+. + ...+.. T Consensus 72 ~~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~-~~~~~y~--~-~~~~~~ 147 (384) T protein:vir:49 72 QLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDN-QNGLYYN--I-TFDDPR 147 (384) T ss_pred hhhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCC-CceEEEE--E-EecCcc Confidence 11112211111 13456677788899999999999999988886 57888998887765542 1211111 1 000000 Q ss_pred eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 158 KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDL 232 (445) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~ 232 (445) . .....+ +... |+|++. ...|.|.+..+...++....+.... T Consensus 148 ~------~~~~~~-------------------------~~~e--Vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~ 194 (384) T protein:vir:49 148 I------PPKQHV-------------------------PQGD--ILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLT 194 (384) T ss_pred c------cceeEe-------------------------cCcc--EEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHH Confidence 0 000000 0001 333332 1347787777777776666555555 Q ss_pred HHHHHHhcCCeeEEecCCcccchhHHHhh--------hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCc Q lcl|NC_021326. 233 SNTFKDSNELTYVLTNYDDQELPEFKRLL--------RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQA 304 (445) Q Consensus 233 ~~~~~~~~~~~l~~~g~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~ 304 (445) .+.+...+.|-.++.-......++..... ...+++.++++.+.+.+........+.+..+...+.|+..-++ T Consensus 195 ~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgV 274 (384) T protein:vir:49 195 LNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGI 274 (384) T ss_pred HHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCC Confidence 66667777776655432211111111111 1233555565555444443334455567777888888888888 Q ss_pred ccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHH Q lcl|NC_021326. 305 VDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQ 383 (445) Q Consensus 305 p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~ 383 (445) |....+..+ +..++..++..+...+.. ...-+...|.+. +..+ +.....+....+.......+. T Consensus 275 p~~~lg~~~~~~~~~~~~~~~~~~~i~~---~l~pi~~~i~~~-------l~~~-----l~~~~~~~~~~~~~~~~~~~~ 339 (384) T protein:vir:49 275 PESVVGGEGDKQSSLEMIYNIYFKAVSR---FLRPFVSELSKK-------LSCE-----VDADILPAVDPTGSNYIGLIN 339 (384) T ss_pred CHHHhCCCCCccccHHHHHHHHHHHHHH---HHHHHHHHHHHH-------hchh-----hhhhhhhhhhccchHHHHHHH Confidence 865544322 223444443322222111 111111111111 1111 000011111111112222222 Q ss_pred HH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 384 QS--MGIVSHETVLENH---PFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 384 ~~--~g~~s~et~l~~l---~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+ .|+.++..+++.+ |+.+ -|+.++ ...+.. + +++++| T Consensus 340 ~l~~~~~~t~~e~~~~l~~~g~~~---ne~r~~--------~~~~p~------------~-gGd~~~ 382 (384) T protein:vir:49 340 SMVKTGTLAQNQGLYVLQQAEILP---KDLPEG--------ETDSTL------------K-GGETNE 382 (384) T ss_pred HHhhcCcccHHHHHHHHhhCCCCC---hhHHHH--------cCCCCC------------C-CCCCCC Confidence 22 4677777776654 3332 222211 111111 1 111222 No 234 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=93.21 E-value=0.0084 Score=31.97 Aligned_cols=404 Identities=13% Similarity=0.098 Sum_probs=168.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~~~~d~ 79 (445) +-.. ++.-.+.+.+|..+..+++-+.. ...||+..+-+ ...+|+.+..++- T Consensus 46 ~~~~-~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV~i~Ld~~ 97 (558) T protein:vir:10 46 IEGA-YRSEYDLIRRYREMALHPEADGA---------------------------IEDVVNEAIVSDLYDSPVEVELSNL 97 (558) T ss_pred ccch-hhhHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceEEEEeccc Confidence 2111 12223344455555444433321 12233333222 1334555544432 Q ss_pred H----HHHHHHH----Hhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 80 E----VIKRIDE----VLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 80 ~----~~~~l~~----~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) + ..+.+.+ ++. =+|+.+.++..+...+.|+.|+...+|.+ |-..+..+||+.+-.|..- ..+..- T Consensus 98 ~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~i--~~~~~~ 175 (558) T protein:vir:10 98 NASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQE--KRKPGN 175 (558) T ss_pred CcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCcccceeeeee--cccccc Confidence 2 3333322 221 26788889999999999999999988743 6677899999988665431 011110 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEec------CCCCcCccHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK------NNDLEISDIFM 217 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~------n~~~g~s~~~~ 217 (445) ......+.....+ .+.+....+|.+.......... .+.. ...+++ +|| |++.. |...-.|-+. T Consensus 176 ~~~~~~~~~~~~~-~~~~~~~eyy~Y~~~~~~~~~~--~~~~---~~~~~v-kI~~dAI~y~hSGL~d~~~~~i~syLh- 247 (558) T protein:vir:10 176 QDPAIRVRSEQDV-VPNPEFEEFYIYTPKVQHPTGM--VGQM---GGKNSI-KIAKDSITMCTSGLVDRNKNRVLSYLH- 247 (558) T ss_pred ccceeeeecccce-eeccceeEeeeecCCccccccc--ceee---cCCCce-eechhheeeecccceecCCCeeeecch- Confidence 0111111221111 1122233333332221111100 0000 001111 333 22221 1111123333 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccch-----hHHHhh----h--------------------hCce Q lcl|NC_021326. 218 YKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQELP-----EFKRLL----R--------------------YYGA 266 (445) Q Consensus 218 v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~~-----~~~~~~----~--------------------~~~~ 266 (445) ..+..+|. ++-|.+...+..+.|-+=+.-.+..+.+ ...+.+ + .-.- T Consensus 248 --kAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlED 325 (558) T protein:vir:10 248 --KAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMED 325 (558) T ss_pred --HhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhh Confidence 33334443 3455566666666664322211111111 111111 1 0001 Q ss_pred eecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCcccc--ccccccCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 267 IKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDF--SSDKFGSAPSGVALEFLYTNLNLKADKLARKA 339 (445) Q Consensus 267 ~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~--~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~ 339 (445) +++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|-- ..++.-+.--+..|-..+.....-+.+.+..| T Consensus 326 yWLpRReGgrgTEItTLpGgqnLge-m~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF 404 (558) T protein:vir:10 326 FWLPRREGGRGTEITTLPGGQNLGE-LSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRF 404 (558) T ss_pred hcccccCCCCccceeeccccCCcch-HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHH Confidence 2232 122 123333332 3332 23366667777777777631 11111000112234334444555667778888 Q ss_pred HHHHHHHHHHHHHHhccCC--Cc----ceEEEEeCCCCCCCHHHHHH-------HHHHHh---c-cCChHHHHHhCCCCC Q lcl|NC_021326. 340 KVAIQELLWFVFEHFDIKG--EH----KDVDISFNYNKVANTELQVQ-------TAQQSM---G-IVSHETVLENHPFVE 402 (445) Q Consensus 340 ~~~l~~~~~~~~~~~~~~~--~~----~~i~v~f~~~~p~d~~~~~~-------~~~~~~---g-~~s~et~l~~l~~~~ 402 (445) ...+.++|+.=+-+-|+-. ++ ..|.+.|...-.-.+...++ ++..+. | .+|++++++.+--.+ T Consensus 405 s~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~t 484 (558) T protein:vir:10 405 AAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQT 484 (558) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC Confidence 8888887775444434321 22 45778886544433444443 333332 3 479999998764444 Q ss_pred C--HHHHHHHHHHHHHHHHHhhhcc---c---------cCC-CCCCCCCCCC--CCcCCC Q lcl|NC_021326. 403 D--LQAELERIEQEQMEYNKQLPNL---D---------DGG-ADGAQQKERS--NDKQSE 445 (445) Q Consensus 403 d--~~~E~~ri~~E~~~~~~~~~~~---~---------~~~-~~~~~~~~~~--~d~~~~ 445 (445) | ..++-+.|++|.++-.-..|+. . ++. .+.+.++... +..+.| T Consensus 485 DeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 544 (558) T protein:vir:10 485 DMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQA 544 (558) T ss_pred HHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccchhh Confidence 3 4455666776664311111110 0 000 0111111000 000011 No 235 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=92.86 E-value=0.0097 Score=31.62 Aligned_cols=366 Identities=11% Similarity=0.027 Sum_probs=136.5 Q ss_pred HHHhcCCCccccccccccccc-cccccccccc---cccchHHHHHHHHHhhhhccCeeeccCchH--HHHHHHHHhc--- Q lcl|NC_021326. 20 QEYYEQRPDIVKEPKPVDATG-AVDPLKPDDR---MITNFHANLVDQKVSYIVGKPIAFKHTDDE--VIKRIDEVLG--- 90 (445) Q Consensus 20 ~~yy~G~~~i~~~~~~~~~~~-~~~~~~~~~r---i~~n~~~~iv~~~~~~l~g~~~~~~~~d~~--~~~~l~~~~~--- 90 (445) ..+|+-......-........ ......+... +.+.-.-.+|+..++-+..=|+.+...+.+ ....+...+. T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~~~~~~~lL~~~P 80 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQKYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNGDIIHDEDINYLLNVKS 80 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCcccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCccccccchHHHHhhccC Confidence 233322110000000000000 0000000000 011111124555554444445544322211 1112222231 Q ss_pred c---CHHHHHHHHHHHHHhcCeEEEEEEECC-CCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecc Q lcl|NC_021326. 91 N---RFDDKLHSVLTGASNKGIEWLHPYLDE-EGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKI 165 (445) Q Consensus 91 n---~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~ 165 (445) | ....+...+..+.+.+|.+|+.+..+. .|++ .+..++|.++.+..++. +++.+. +.......... +.. T Consensus 81 N~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~--~~~~y~--~~~~~~~~~~~-~~~- 154 (406) T protein:vir:97 81 TSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDN--HEIVYT--FTDMLTAKQVK-CFA- 154 (406) T ss_pred CCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCC--ceEEEE--EEecCCceEEE-Ecc- Confidence 3 345666778888999999999998875 4654 57788898887665532 222210 00000000000 000 Q ss_pred eEEEEEEecceeeecccccccccccccccccccccceEEecC----CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021326. 166 TVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNE 241 (445) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~ 241 (445) .. |+|++. ...|.|.+..+...++....+..-....++..+. T Consensus 155 --------------------------------~e--vih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~ 200 (406) T protein:vir:97 155 --------------------------------HD--VIHWKFFSHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFS 200 (406) T ss_pred --------------------------------cc--EEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 00 222221 1236777766555555444333333334445454 Q ss_pred CeeE-EecCCc--ccchhHHHhhh-------hCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCcccccccc Q lcl|NC_021326. 242 LTYV-LTNYDD--QELPEFKRLLR-------YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDK 311 (445) Q Consensus 242 ~~l~-~~g~~~--~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~ 311 (445) |-.+ .++... +.....+..+. ..+++.++++.+...++.......+.+..+.....|...-++|....+. T Consensus 201 ~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~ 280 (406) T protein:vir:97 201 SGILTMKGAQLSGDARQRARQEFEKMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGV 280 (406) T ss_pred CceEEecCCCCCHHHHHHHHHHHHHHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCC Confidence 5333 333221 11122222121 1234555655555544433333334444555566677777777654432 Q ss_pred ccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCcceEEEEeCCCCCCCHHHHHHHHHHH-- Q lcl|NC_021326. 312 FGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD----IKGEHKDVDISFNYNKVANTELQVQTAQQS-- 385 (445) Q Consensus 312 ~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~----~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~-- 385 (445) .+. .| .+...... .+...|.-.++.+...+. ...+.....+.|. +..+....++++.++ T Consensus 281 ~~~-~~--~~e~~~~~----------f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd--~~~~~~~~~~~~~~~~~ 345 (406) T protein:vir:97 281 NSP-NQ--SVAQLMED----------YVTNDLPFYFDAITSELGLKTLNDKDRRLYHIEFD--TRSVTGRNVDEIVKLVN 345 (406) T ss_pred CCC-cc--hHHHHHHH----------HHHHHHHHHHHHHHHHHhhhhcChhhccceeEEEe--cCccchhhHHHHHHHHh Confidence 111 11 11111111 122222222222222111 1111223345553 112333444555554 Q ss_pred hccCChHHHHHhCCCCC--CHHHHHHHHHHHH-HHHHHhhhcccc---CCCCCCCCCCCCCCcC Q lcl|NC_021326. 386 MGIVSHETVLENHPFVE--DLQAELERIEQEQ-MEYNKQLPNLDD---GGADGAQQKERSNDKQ 443 (445) Q Consensus 386 ~g~~s~et~l~~l~~~~--d~~~E~~ri~~E~-~~~~~~~~~~~~---~~~~~~~~~~~~~d~~ 443 (445) .|+++...+++.++.-. ++. .++..--. -...+...+..+ ...++++. +..+|+. T Consensus 346 ~g~~T~NE~R~~~g~~p~~~~~--gD~~~~~~n~~~~~~~~~~~~~~~~~~~gg~~-~~~~~~~ 406 (406) T protein:vir:97 346 NQILTPNQGLVELGKQKSTDPN--MDRYQSSLNYVFLDKKEEYQDKVGIKGKGGEV-NAEEDKS 406 (406) T ss_pred CCCcCHHHHHHHhCCCCCCCCC--CCeEeeccCccchhcccccccccccccCCCCC-CCCCCCC Confidence 48999999988875421 111 00000000 000000001111 11111111 1111111 No 236 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=92.24 E-value=0.012 Score=31.07 Aligned_cols=370 Identities=9% Similarity=-0.003 Sum_probs=154.4 Q ss_pred Ch-HHHHHH-HHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc-- Q lcl|NC_021326. 1 MI-VRYIKQ-HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-- 76 (445) Q Consensus 1 ~l-~~~i~~-~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~-- 76 (445) |. .+...+ .....+.-.-+...+-|-.. . .... ..-...-+.+.-...+|+..++-+.+-|+.+-- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-------s-~~~~--~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~ 70 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSAR-------S-EAGQ--VVTPASALSLTVLQNCVTLLAESIAQLPVELYERS 70 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhcccc-------c-ccCc--ccChHHhhccHHHHHHHHHHHHhhccCceEEEEec Confidence 10 100000 00000000000000001000 0 0000 000000122233344666666666666766411 Q ss_pred -Cch-H-HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 77 -TDD-E-VIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 77 -~d~-~-~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) +.. . ....+...+. | .-..+...+..+.+.+|.+|+.+..+..|++. +..++|..+.+..+.. +.+. T Consensus 71 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~--~~~~- 147 (419) T protein:vir:80 71 GDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPD--LKPM- 147 (419) T ss_pred CCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCC--ceEE- Confidence 111 0 0111222221 2 23455666778899999999999999889865 7888898887665432 1111 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLI 222 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~li 222 (445) |... ... ..+. .. |++++ +...|.|.+......+ T Consensus 148 ----y~~~-------------------~~~-----------~~~~------~~--i~h~~~~~~d~~~G~s~i~~~~~~i 185 (419) T protein:vir:80 148 ----YRVA-------------------GAD-----------PLPQ------RL--VHHVRWMSINGYTGLSPVLLHANAI 185 (419) T ss_pred ----EEEc-------------------Ccc-----------ccch------hh--eEEecCCCCCCcccccHHHHHHHHH Confidence 1100 000 0000 00 22222 2235778777666666 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEEe--cC-Ccccchh----HHHhh--------hhCceeeccCCCceeeEeccCChHHH Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVLT--NY-DDQELPE----FKRLL--------RYYGAIKVSDNGGVDTIQVEVPVENS 287 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~~--g~-~~~~~~~----~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~~ 287 (445) +.......-..+.+...+.|-.++. +. .....++ .+..+ ..++++.++++.+.+-+..+.....+ T Consensus 186 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~ 265 (419) T protein:vir:80 186 GHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAAL 265 (419) T ss_pred HHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHH Confidence 6555444444455566677766654 21 1111111 12111 12335666666555555443334456 Q ss_pred HHHHHHHHHHHHHHhCcccccccccc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----c-CCCc Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----I-KGEH 360 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~-~~~~ 360 (445) .+..+...+.|+..-++|....+... ++-++ ++.... ..+...|.-++..+...+. . .... T Consensus 266 ~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n--~e~~~~----------~f~~~~l~P~~~~ie~~l~~kll~~~~~~~ 333 (419) T protein:vir:80 266 IDALRLSALDIARIYKIPAHMVNELERATFSN--IEHQSL----------QFVIYTLLPWVKRHEQAKTRDLLLPSERKQ 333 (419) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCCCCCCccc--HHHHHH----------HHHHHHHHHHHHHHHHHHhhhccCccccCC Confidence 66667777888888888864443221 11111 111111 1112222222222211111 1 1111 Q ss_pred ceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-hhccccCCCCCCCCCC Q lcl|NC_021326. 361 KDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQKE 437 (445) Q Consensus 361 ~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~~~ 437 (445) ..+++.+...+..|..+.++.+.++ .|+++.-.++++++.-.- +-.+.. .... .............+++ T Consensus 334 ~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~--~gGD~~------~~~~n~~~~~~~~~~~~~~~~ 405 (419) T protein:vir:80 334 YFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPV--KGGDIY------LSPMNMVDASKPQPIPMGKTE 405 (419) T ss_pred eEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCccee------eeccccccccccccccCCCCC Confidence 2234444556667889999888876 589999999988754210 000000 0000 0000000000011111 Q ss_pred CCCCcCCC Q lcl|NC_021326. 438 RSNDKQSE 445 (445) Q Consensus 438 ~~~d~~~~ 445 (445) .......| T Consensus 406 ~~~~~~~~ 413 (419) T protein:vir:80 406 PTKAALDE 413 (419) T ss_pred chhhhHHH Confidence 11111111 No 237 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=91.69 E-value=0.015 Score=30.64 Aligned_cols=377 Identities=14% Similarity=0.102 Sum_probs=158.6 Q ss_pred Ch--HHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeeeccC Q lcl|NC_021326. 1 MI--VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHT 77 (445) Q Consensus 1 ~l--~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~~~~ 77 (445) .+ .--++--.+-+.+|..+..+++-+.. ...||+..+-+ -..+|+.+..+ T Consensus 57 ~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV~l~L~ 109 (516) T protein:vir:10 57 FFGIDNNISGTKDLINTYRQLINNPEVERA---------------------------VANIVNEAIVYERGHKVVSLDLD 109 (516) T ss_pred eeccccccchHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceEEEEec Confidence 00 00111112233444444444433321 12233332222 23355666554 Q ss_pred chHHHHH----H----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC--CCCcEEEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 78 DDEVIKR----I----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD--EEGEFKLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 78 d~~~~~~----l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d--~~g~~~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) +-++.+. + +.++. =+|+.+.++..+...+.|+.|+....| ++|-..+..+||+.+..+.- T Consensus 110 ~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~--------- 180 (516) T protein:vir:10 110 DTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYRE--------- 180 (516) T ss_pred ccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEee--------- Confidence 4332222 2 22221 267888889999999999999886665 34656788899988765432 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEec------CCCCcCccHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK------NNDLEISDIFM 217 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~------n~~~g~s~~~~ 217 (445) ..+++..++.+..... .+|.+..+.......+.. ..++.-=+|| |++.. |...-.|-+. T Consensus 181 ----i~~~~~~~~~v~~~~~-e~~~Y~~~~~~~~~~g~~------~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~syLh- 248 (516) T protein:vir:10 181 ----IVTSDIGGTTIVKGYR-EFFIYTTGNEGYSYNGRI------FEPNTRIKIPRSAVVYASSGLMDCSDRGIIGYLH- 248 (516) T ss_pred ----ecccccccchhhhhhh-heeeeccCccccccccce------eCCCcceeechhheeeecccceeCCCCceeeeeh- Confidence 2222222222111111 111111111111111000 0000001233 22222 1111133333 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----hh--------------------Cce Q lcl|NC_021326. 218 YKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----RY--------------------YGA 266 (445) Q Consensus 218 v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~~--------------------~~~ 266 (445) ..+..+|. ++-|.+...+..+.|-+=+.-.+..+. +...+.+ +. -.- T Consensus 249 --kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlED 326 (516) T protein:vir:10 249 --NAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTED 326 (516) T ss_pred --hhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhh Confidence 33333443 345555666666666432221111111 1111111 00 001 Q ss_pred eecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCcccc--cccc-cc-CcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 267 IKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDF--SSDK-FG-SAPSGVALEFLYTNLNLKADKLAR 337 (445) Q Consensus 267 ~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~--~~~~-~~-~~~Sg~Ai~~~~~~l~~k~~~~~~ 337 (445) +++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|-- ..+. ++ +.--|..|-.-+.....-+.+.+. T Consensus 327 yWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~ 405 (516) T protein:vir:10 327 YWLMRRDGKSVTEVSSLPGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQH 405 (516) T ss_pred hcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHH Confidence 1232 122 223333333 3333 33466677777887777732 1111 11 001222333333445555667777 Q ss_pred HHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHH---hc-cCChHHHHHhCCC Q lcl|NC_021326. 338 KAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQS---MG-IVSHETVLENHPF 400 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~---~g-~~s~et~l~~l~~ 400 (445) .|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+ +| .+|++++++.+-- T Consensus 406 rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr 485 (516) T protein:vir:10 406 DFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQ 485 (516) T ss_pred HHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhc Confidence 88888887777544333432 222 457788865444334444433 3333 24 6999999987644 Q ss_pred CCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+| ..+|-+.|++|..+-.-.. ++++.+ T Consensus 486 ~tDeei~~e~k~I~~E~~~~~~~~-----------------p~~~~~ 515 (516) T protein:vir:10 486 MTEEQIAQEEKQIEQEAGIKRFQN-----------------PENEDD 515 (516) T ss_pred CCHhhHHHHHHHHHHhhhCCCCCC-----------------CCcccc Confidence 443 3344445554443211110 111111 No 238 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=91.69 E-value=0.015 Score=30.64 Aligned_cols=377 Identities=14% Similarity=0.102 Sum_probs=158.6 Q ss_pred Ch--HHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeeeccC Q lcl|NC_021326. 1 MI--VRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHT 77 (445) Q Consensus 1 ~l--~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~~~~ 77 (445) .+ .--++--.+-+.+|..+..+++-+.. ...||+..+-+ -..+|+.+..+ T Consensus 57 ~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV~l~L~ 109 (516) T protein:vir:10 57 FFGIDNNISGTKDLINTYRQLINNPEVERA---------------------------VANIVNEAIVYERGHKVVSLDLD 109 (516) T ss_pred eeccccccchHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceEEEEec Confidence 00 00111112233444444444433321 12233332222 23355666554 Q ss_pred chHHHHH----H----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC--CCCcEEEEEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 78 DDEVIKR----I----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD--EEGEFKLFRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 78 d~~~~~~----l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d--~~g~~~i~~~~p~~~~~v~d~~~~~~~~~ 146 (445) +-++.+. + +.++. =+|+.+.++..+...+.|+.|+....| ++|-..+..+||+.+..+.- T Consensus 110 ~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~--------- 180 (516) T protein:vir:10 110 DTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYRE--------- 180 (516) T ss_pred ccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEee--------- Confidence 4332222 2 22221 267888889999999999999886665 34656788899988765432 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEec------CCCCcCccHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK------NNDLEISDIFM 217 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~------n~~~g~s~~~~ 217 (445) ..+++..++.+..... .+|.+..+.......+.. ..++.-=+|| |++.. |...-.|-+. T Consensus 181 ----i~~~~~~~~~v~~~~~-e~~~Y~~~~~~~~~~g~~------~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~syLh- 248 (516) T protein:vir:10 181 ----IVTSDIGGTTIVKGYR-EFFIYTTGNEGYSYNGRI------FEPNTRIKIPRSAVVYASSGLMDCSDRGIIGYLH- 248 (516) T ss_pred ----ecccccccchhhhhhh-heeeeccCccccccccce------eCCCcceeechhheeeecccceeCCCCceeeeeh- Confidence 2222222222111111 111111111111111000 0000001233 22222 1111133333 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----hh--------------------Cce Q lcl|NC_021326. 218 YKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----RY--------------------YGA 266 (445) Q Consensus 218 v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~~--------------------~~~ 266 (445) ..+..+|. ++-|.+...+..+.|-+=+.-.+..+. +...+.+ +. -.- T Consensus 249 --kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlED 326 (516) T protein:vir:10 249 --NAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTED 326 (516) T ss_pred --hhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhh Confidence 33333443 345555666666666432221111111 1111111 00 001 Q ss_pred eecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCcccc--cccc-cc-CcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 267 IKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDF--SSDK-FG-SAPSGVALEFLYTNLNLKADKLAR 337 (445) Q Consensus 267 ~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~--~~~~-~~-~~~Sg~Ai~~~~~~l~~k~~~~~~ 337 (445) +++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|-- ..+. ++ +.--|..|-.-+.....-+.+.+. T Consensus 327 yWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~ 405 (516) T protein:vir:10 327 YWLMRRDGKSVTEVSSLPGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQH 405 (516) T ss_pred hcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHH Confidence 1232 122 223333333 3333 33466677777887777732 1111 11 001222333333445555667777 Q ss_pred HHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHH---hc-cCChHHHHHhCCC Q lcl|NC_021326. 338 KAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQS---MG-IVSHETVLENHPF 400 (445) Q Consensus 338 ~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~---~g-~~s~et~l~~l~~ 400 (445) .|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+ +| .+|++++++.+-- T Consensus 406 rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr 485 (516) T protein:vir:10 406 DFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQ 485 (516) T ss_pred HHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhc Confidence 88888887777544333432 222 457788865444334444433 3333 24 6999999987644 Q ss_pred CCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 401 VED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 401 ~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) .+| ..+|-+.|++|..+-.-.. ++++.+ T Consensus 486 ~tDeei~~e~k~I~~E~~~~~~~~-----------------p~~~~~ 515 (516) T protein:vir:10 486 MTEEQIAQEEKQIEQEAGIKRFQN-----------------PENEDD 515 (516) T ss_pred CCHhhHHHHHHHHHHhhhCCCCCC-----------------CCcccc Confidence 443 3344445554443211110 111111 No 239 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=91.23 E-value=0.017 Score=30.31 Aligned_cols=390 Identities=12% Similarity=0.044 Sum_probs=156.9 Q ss_pred ChHHHHHH---HH----HHH--------HHHHHHHHHhcCCCccccccccccccccccccccccccccchH----HHHHH Q lcl|NC_021326. 1 MIVRYIKQ---HL----EKL--------PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFH----ANLVD 61 (445) Q Consensus 1 ~l~~~i~~---~~----~~~--------~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~----~~iv~ 61 (445) |+.++..- +. ..+ ...-....||-|... + ........|.. -+.++-+ ..||+ T Consensus 19 ~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~--n------~~eLI~~YR~m-a~~~pEVd~AideIvn 89 (533) T protein:vir:58 19 FLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEF--N------RFFLYDMYDRM-DYTDPLISTVLDIIAD 89 (533) T ss_pred hhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccc--c------HHHHHHHHHHh-hccCcchhhHHHhhhc Confidence 11111110 00 000 001111223433210 0 00000111110 0011222 22333 Q ss_pred HHHhh-hhccCeeeccCchHHHHHHHHHhc--cCHHHHHHHHHHHHHhcCeEEEEEEEC-CC-CcEEEEEEccceeEEEE Q lcl|NC_021326. 62 QKVSY-IVGKPIAFKHTDDEVIKRIDEVLG--NRFDDKLHSVLTGASNKGIEWLHPYLD-EE-GEFKLFRVPAEQGIPIW 136 (445) Q Consensus 62 ~~~~~-l~g~~~~~~~~d~~~~~~l~~~~~--n~~~~~~~~~~~~~~~~G~~~~~v~~d-~~-g~~~i~~~~p~~~~~v~ 136 (445) ..+-+ -...|+.+..++.++.+.+++.+. .+|+.+.++..+...+.|+.|+..-.+ ++ |=..+..+||..+-.++ T Consensus 90 eaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDPr~i~~vr 169 (533) T protein:vir:58 90 ECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSPYIFSKRY 169 (533) T ss_pred eeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccCCcccchhhheecCCeeeEEEE Confidence 33222 245678887777666666655543 378889999999999999999988543 23 33468999999987776 Q ss_pred cCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEec------C Q lcl|NC_021326. 137 TDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK------N 207 (445) Q Consensus 137 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~------n 207 (445) .... +. + +|.+...+.. .. .++.--.|| |+++. + T Consensus 170 ~~~t--~~---------------e--------yyvy~~~~~~-~~-----------s~~~~~kI~~daI~y~~SGl~d~~ 212 (533) T protein:vir:58 170 NPET--DT---------------W--------YYVITDVYRN-VV-----------SGYFNEDIPEEDVIHFSHKIDTNF 212 (533) T ss_pred eecc--ce---------------E--------EEeecccccc-cc-----------cCccccccchhheeeeeeccccCC Confidence 5321 11 1 1111111100 00 000001233 33332 2 Q ss_pred CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCee---EE--ecCCcccchhHHHhh--------------------h Q lcl|NC_021326. 208 NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTY---VL--TNYDDQELPEFKRLL--------------------R 262 (445) Q Consensus 208 ~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l---~~--~g~~~~~~~~~~~~~--------------------~ 262 (445) .+.+.|-+...+.....|- ++-+.+...+..+.|-+ .+ -++.........+.+ + T Consensus 213 ~~~iisyLhkAiKp~NQLk-miEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddr 291 (533) T protein:vir:58 213 FPYGRSYLESARAIWNQLR-LMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGID 291 (533) T ss_pred CCceehhhhHHHHHHHHHH-HHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeecc Confidence 2334555554333332222 24444555555555432 21 121111111111111 0 Q ss_pred hC-------ceeecc--CCC-ceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccc---cccCcchHHHHHHHHHHHH Q lcl|NC_021326. 263 YY-------GAIKVS--DNG-GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD---KFGSAPSGVALEFLYTNLN 329 (445) Q Consensus 263 ~~-------~~~~~~--~~~-~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~---~~~~~~Sg~Ai~~~~~~l~ 329 (445) .. .-++++ +|+ +.+.-|.++..-.....+.-+++.++..-.+|---.. .+ |..| .|-....... T Consensus 292 k~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~f-gr~~--eItRDEiKF~ 368 (533) T protein:vir:58 292 NYFSIESILKDYFIPRRGDRRAVEIDILQGSKVDLAEDVEYMLNRLISALKVPKAFIGYEGDV-NAKN--TLATQDIKFN 368 (533) T ss_pred chhhhhhhHhhhcccccCCCccceeeecCCCCCCcHHHHHHHHHHHHHHhCCCeeecCCCCCC-ccch--hhhHHHHHHH Confidence 00 001222 111 2233344432223345577778888888888742211 22 2222 2322222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHH-------HHHHHHhccCChHHHHHhC-CCC Q lcl|NC_021326. 330 LKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQV-------QTAQQSMGIVSHETVLENH-PFV 401 (445) Q Consensus 330 ~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~-------~~~~~~~g~~s~et~l~~l-~~~ 401 (445) .-+.+.+..|..-|++.|. +. ++- ...+..+.|...-.-.+...+ +++..+.+.++++++++.+ -.. T Consensus 369 KFI~rLR~rF~~ll~~qLi--lk--~ii-t~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~t 443 (533) T protein:vir:58 369 NTIKRIQGFFVEELERMVR--MN--KEF-ADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIP 443 (533) T ss_pred HHHHHHHHHHHHHHhcccc--cc--cCc-chhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCC Confidence 3444455556665554332 11 111 112346777543333333333 3444556789999998865 334 Q ss_pred CCHHHHHHHHHHHHHHHHH-------hhhccccCC--CCCCC----------------------------CCCCCCCcCC Q lcl|NC_021326. 402 EDLQAELERIEQEQMEYNK-------QLPNLDDGG--ADGAQ----------------------------QKERSNDKQS 444 (445) Q Consensus 402 ~d~~~E~~ri~~E~~~~~~-------~~~~~~~~~--~~~~~----------------------------~~~~~~d~~~ 444 (445) +|..++.+.|++|..+-.- ........+ +++.+ +...+.|..+ T Consensus 444 dei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~ 523 (533) T protein:vir:58 444 YDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGG 523 (533) T ss_pred hhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCc Confidence 4444555566666432110 000110000 00000 0000000000 Q ss_pred C Q lcl|NC_021326. 445 E 445 (445) Q Consensus 445 ~ 445 (445) | T Consensus 524 ~ 524 (533) T protein:vir:58 524 E 524 (533) T ss_pred c Confidence 0 No 240 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=90.26 E-value=0.022 Score=29.69 Aligned_cols=330 Identities=10% Similarity=0.010 Sum_probs=135.5 Q ss_pred ChHHHHHHHHHHHH-HHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLP-EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~-~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 79 (445) ++..+ +.+.. ....+.....+.. ... ..... ....+ +.+.=.-.+|+..++-+-+-|+. ++. T Consensus 3 ~~~~f----~~r~~~~~~~~~~~~~~~~-----~~~-~~~~v-~~~~a---l~~~av~~cv~~ia~~ia~~p~~---~~~ 65 (359) T protein:vir:10 3 ILNPF----ERRSSITPNNYYPFMVQNG-----SIV-PNSLV-DATEA---LKNSDLYAVTSLISSDIAGTRFI---GNQ 65 (359) T ss_pred ccchh----hccccCCCCcchhhhhccc-----ccc-CCccc-CHHHh---hcchHHHHHHHHHHHhhhcCccc---cch Confidence 22211 10000 0000000000000 000 00000 00000 11111123556666555555552 222 Q ss_pred HHHHHHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE-EEEEccceeEEEEcCCCCCceEEEEEEEeeecce Q lcl|NC_021326. 80 EVIKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK-LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENET 157 (445) Q Consensus 80 ~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~-i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~ 157 (445) ....++..=.. ..-..+...+....+.+|.+|+.+..+..|.+. +..++|..+.+..+++ .+.+ +++...... T Consensus 66 ~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~---~~~y--~~~~~~~~~ 140 (359) T protein:vir:10 66 VFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDD---TLTY--EVNQFDDYP 140 (359) T ss_pred HHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCC---eEEE--EEEecCCce Confidence 22222221110 123445566777888899999999888888754 6778887776644432 1111 111000000 Q ss_pred eEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec---------CCCCcCccHHHHHHHHHHHHHH Q lcl|NC_021326. 158 KVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK---------NNDLEISDIFMYKTLIDAYNRR 228 (445) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~---------n~~~g~s~~~~v~~lid~~~~~ 228 (445) ...+.. . -|+|++ +...|.|.++.+...+.....+ T Consensus 141 -~~~~~~---------------------------------~--evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~ 184 (359) T protein:vir:10 141 -SAKYNA---------------------------------S--EMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEA 184 (359) T ss_pred -EEEEcc---------------------------------c--ceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHH Confidence 000000 0 022221 1224777777666666666555 Q ss_pred HHHHHHHHHHhcCCeeEEecC----CcccchhHHHhhh-------hCceeeccCCCceeeEeccCChHHHHHHHHHHHHH Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLTNY----DDQELPEFKRLLR-------YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQK 297 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~g~----~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~ 297 (445) .....+.+...+.|-.++.-. +.+.....+..+. .++++.++++.+.+.+........+.+..+..... T Consensus 185 ~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~ 264 (359) T protein:vir:10 185 NRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQ 264 (359) T ss_pred HHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHH Confidence 555555666667776665421 1111122222221 12355566665555554333333455666667777 Q ss_pred HHHHhCcccccccccc-CcchHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCCCH Q lcl|NC_021326. 298 IMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLN-LKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVANT 375 (445) Q Consensus 298 i~~~s~~p~~~~~~~~-~~~Sg~Ai~~~~~~l~-~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~ 375 (445) |+..-++|....+..+ .+.+...++..+.... ..+.- +...|.+. +.+- +.+......-.|. T Consensus 265 Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p----~~~~l~~~---l~~~---------~~~~~~~~~~~d~ 328 (359) T protein:vir:10 265 IAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEP----LISELRIK---CDSS---------IGVDMSPITDYSN 328 (359) T ss_pred HHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHH----HHHHHHHH---hhhh---------hcccchhhhhcCH Confidence 8888888865443322 2233333433222111 11111 11111111 1100 1111111111223 Q ss_pred HHHHHHHHHH--hccCChHHHHHhCCCCCCHH Q lcl|NC_021326. 376 ELQVQTAQQS--MGIVSHETVLENHPFVEDLQ 405 (445) Q Consensus 376 ~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~ 405 (445) ......+.++ +|+++.-.++++++.-. +. T Consensus 329 ~~~~~~~~~~~~~G~~t~NE~R~~l~~~p-v~ 359 (359) T protein:vir:10 329 SVFKADILNWVKEGIIEPTEAKTLLESKG-II 359 (359) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCC-CC Confidence 3333334443 58999999888763210 11 No 241 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=90.13 E-value=0.022 Score=29.62 Aligned_cols=364 Identities=9% Similarity=-0.000 Sum_probs=161.2 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcccccccc---cc---------ccccccc-ccccccc-ccchHHHHHHHHHhhhhccC Q lcl|NC_021326. 6 IKQHLEKLPEISIGQEYYEQRPDIVKEPKP---VD---------ATGAVDP-LKPDDRM-ITNFHANLVDQKVSYIVGKP 71 (445) Q Consensus 6 i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~---~~---------~~~~~~~-~~~~~ri-~~n~~~~iv~~~~~~l~g~~ 71 (445) +.-..+. +=.+...+|..+......... .. ....... ....+++ ..+....+|+..++-+..-| T Consensus 1 ~~~~~~~--~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~ 78 (413) T protein:vir:96 1 MPGVSEI--RKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMT 78 (413) T ss_pred CCccchh--hhhhcCCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCc Confidence 1111111 111122344443211100000 00 0000000 0000111 13455667777777776667 Q ss_pred eeecc--Cc--hHHHHHHHHHh-c--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCc-E-EEEEEccceeEEEEcCC Q lcl|NC_021326. 72 IAFKH--TD--DEVIKRIDEVL-G--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGE-F-KLFRVPAEQGIPIWTDK 139 (445) Q Consensus 72 ~~~~~--~d--~~~~~~l~~~~-~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~-~-~i~~~~p~~~~~v~d~~ 139 (445) +.+-- .+ ......+...+ . | ....+...+..+.+.+|.+|+.+..+.+|. + .+..++|..+.+.++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~ 158 (413) T protein:vir:96 79 IQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDD 158 (413) T ss_pred eEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcCC Confidence 76521 11 11111222222 1 2 245667778889999999999999988874 3 57888998887766542 Q ss_pred CCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC------CCCcCc Q lcl|NC_021326. 140 EHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN------NDLEIS 213 (445) Q Consensus 140 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n------~~~g~s 213 (445) .+.+.+ .. .. ... ... -|+|++. .-.|.| T Consensus 159 ---~~~y~~-----~~-----------------~~-~~~-----------------~~~--evih~k~~~~~~~~~~G~s 193 (413) T protein:vir:96 159 ---DLDYSI-----TF-----------------DN-KEY-----------------DPS--TLLHFVLNPSIERPFIGTG 193 (413) T ss_pred ---eEEEEE-----ee-----------------cC-cEE-----------------chh--hEEEEeccCCCCCcccccc Confidence 111100 00 00 000 000 1344431 124777 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh--------hCceeeccCCCc-eeeEe-c Q lcl|NC_021326. 214 DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR--------YYGAIKVSDNGG-VDTIQ-V 280 (445) Q Consensus 214 ~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~--------~~~~~~~~~~~~-~~~l~-~ 280 (445) .+..+...+.............+...+.|-.++.... .+........+. .++++.+++++. .+-+. . T Consensus 194 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~ 273 (413) T protein:vir:96 194 YKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPL 273 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccC Confidence 7776666666555555555566677777766655322 111122222221 122344444432 22111 1 Q ss_pred cCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--C Q lcl|NC_021326. 281 EVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK--G 358 (445) Q Consensus 281 ~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~--~ 358 (445) ......+.+..+.....|+..-++|....+. +.+.+..+.. .+...|.-+++.+...+... . T Consensus 274 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~-~~~~~~~~~~---------------~~~~~l~P~~~~ie~~ln~~ll~ 337 (413) T protein:vir:96 274 TLNDLAINDAVTLDKKTVAGIFGVPAFLLGV-GTYNKDEFNN---------------FINTKIMSIAQVIQQTYNKLIVE 337 (413) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CcchHHHHHH---------------HHHHHHHHHHHHHHHHHHHhhCC Confidence 2233445555666677788777887644421 1112222221 22223333333333222211 1 Q ss_pred CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCC Q lcl|NC_021326. 359 EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQK 436 (445) Q Consensus 359 ~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 436 (445) +...+++.++..+..|..+.++++.++ .|+++.-.++++++.-.. +..+.+. .. .+... ..+.+.+. T Consensus 338 ~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~--~~gd~~~------~~--~n~~~-~~~~~~~~ 406 (413) T protein:vir:96 338 EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPD--AEMDDLL------VL--ENYLQ-QKDLVNQK 406 (413) T ss_pred CCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcceee------ec--ccccc-hhhccccc Confidence 223455556677778899999988876 589999999998866321 1111000 00 00000 00111111 Q ss_pred CCCCCcC Q lcl|NC_021326. 437 ERSNDKQ 443 (445) Q Consensus 437 ~~~~d~~ 443 (445) ...++++ T Consensus 407 ~~~~~dt 413 (413) T protein:vir:96 407 KLIQDET 413 (413) T ss_pred CCCCCCC Confidence 1111111 No 242 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=90.09 E-value=0.023 Score=29.59 Aligned_cols=356 Identities=10% Similarity=0.059 Sum_probs=152.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCc-h Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTD-D 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d-~ 79 (445) |+.++-+........-.-....+.+.... .+.. .-..+-+.+.-...+|+..++-+..-|+.+...+ + T Consensus 3 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~--vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~ 71 (394) T protein:vir:62 3 LRDRFSNYLFKKAEKRGYLDNVLGKSIRY---------SGVY--VTDSNILQSSDVYELLQDISNQMVLADIVVEDEFGN 71 (394) T ss_pred hhhhhhhhccCCCCchhhhhhhhhccccc---------Cccc--cChhhhhccHHHHHHHHHHHHhhcccceEEEcCCCc Confidence 55555433211111000111122121100 0000 0000112234456677777777766677653322 1 Q ss_pred HHH-HHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 80 EVI-KRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 80 ~~~-~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) ... ..+-.++. | ........+..+.+.+|.+|+++..+..+ . +..+.+..++. . ..+|.. T Consensus 72 ~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~-----~--~~~~~~~~~~~--~-----~~~~~~ 137 (394) T protein:vir:62 72 EIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH-----L--ASNVFTELDDN--L-----VEHFNI 137 (394) T ss_pred ccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee-----c--cccceEEECCc--e-----EEEEee Confidence 111 11222222 3 23456667788899999999987432211 1 12233332221 0 000000 Q ss_pred ecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHHHHHHHH Q lcl|NC_021326. 154 ENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLIDAYNRRL 229 (445) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid~~~~~~ 229 (445) .... |..=-|++++ +...|.|.+......++...... T Consensus 138 --------------------~~~~-------------------~~~~eiih~r~~~~d~~~G~s~~~~~~~~i~~~~~~~ 178 (394) T protein:vir:62 138 --------------------GGHE-------------------IPPCMIRHVKNIGADHLRGKGILDLGRDTLEGVMSAE 178 (394) T ss_pred --------------------CCEE-------------------echhheEEecCcCCCCccccChHHHHHHHHHHHHHHH Confidence 0000 0000133332 12346777776666666555555 Q ss_pred HHHHHHHHHhcCCeeEEe--cCCcccc---hhHHHhh--------hhCceeeccCCCceeeEeccCC--hHHHHHHHHHH Q lcl|NC_021326. 230 SDLSNTFKDSNELTYVLT--NYDDQEL---PEFKRLL--------RYYGAIKVSDNGGVDTIQVEVP--VENSKKYLDEL 294 (445) Q Consensus 230 s~~~~~~~~~~~~~l~~~--g~~~~~~---~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~i~~l 294 (445) ......+...+.|-.++. +....+. +..+..+ ..+++..++.+.+.++.....+ ...+.+..+.. T Consensus 179 ~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~ 258 (394) T protein:vir:62 179 KTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVY 258 (394) T ss_pred HHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHH Confidence 455555566677755544 3322111 1122111 1133445666777777665543 33455566667 Q ss_pred HHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhccCCCcceEEEEeCC Q lcl|NC_021326. 295 YQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFE-----HFDIKGEHKDVDISFNY 369 (445) Q Consensus 295 ~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~-----~~~~~~~~~~i~v~f~~ 369 (445) ...|+..-++|....+... .++. +.. ....+...|.-++..+.. ++.. .+...+.+.|+. T Consensus 259 ~~~Ia~~fgVPp~~lg~~~-~sn~---e~~----------~~~~~~~~l~P~~~~ie~~l~~kll~~-~~~~~~~~~fd~ 323 (394) T protein:vir:62 259 KKDLGKFLGINVDTYTELI-KEDI---EKA----------MMYIHNKAVRPIMKNFEDHLSLLFYAQ-NSGKRIKFKINI 323 (394) T ss_pred HHHHHHHhCCCHHHcCCCC-CcCH---HHH----------HHHHHHHHHHHHHHHHHHHHhhhhcCc-cccCceEEEech Confidence 7788888888865443211 1111 111 111122223222222222 1222 223457788877 Q ss_pred CCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 370 NKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 370 ~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ....+....++++.++ .|+++.-.++++++.- +++. .+++.- . .+...-+...+.++...+++++| T Consensus 324 ~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~--gd~~~~------~--~n~~~~~~~~~~~~~~kgge~~e 393 (394) T protein:vir:62 324 LDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKE--SQAIYI------S--NDVTEIGKKEATDGSLGGGEENE 393 (394) T ss_pred hhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--CCeeec------c--cccccccccccccccCCCCCCCC Confidence 6666677777877776 4899999999888552 2211 111100 0 00000001111111222222222 No 243 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=89.82 E-value=0.024 Score=29.45 Aligned_cols=380 Identities=14% Similarity=0.128 Sum_probs=156.2 Q ss_pred ChHHHH------HHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCee Q lcl|NC_021326. 1 MIVRYI------KQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIA 73 (445) Q Consensus 1 ~l~~~i------~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~ 73 (445) +...++ +--.+-+.+|..+..+++-+.. ...||+..+-+ -..+|+. T Consensus 53 ~~~~~~d~~~~~~~~~~LI~~YR~ma~~pEvd~A---------------------------v~eIvneaiv~d~~~~pV~ 105 (516) T protein:vir:10 53 LMQQFFGIDNNISGTKDLINTYRQLTNNPEVERA---------------------------VANIVNEAVVYEKGHKVVS 105 (516) T ss_pred eeeeeecccCccccHHHHHHHHHHhhhccchhHH---------------------------HHHhhcceeEecCCCceEE Confidence 111111 0011223344444444433321 12233333222 2345666 Q ss_pred eccCchHHHHHH--------HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEEC--CCCcEEEEEEccceeEEEEcCCCCC Q lcl|NC_021326. 74 FKHTDDEVIKRI--------DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD--EEGEFKLFRVPAEQGIPIWTDKEHE 142 (445) Q Consensus 74 ~~~~d~~~~~~l--------~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d--~~g~~~i~~~~p~~~~~v~d~~~~~ 142 (445) +..++-++.+.+ +.++. =+|+.+.++..+...+.|+.|+....| ++|-..+..+||+.+..+.- T Consensus 106 l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~elr~lDPr~i~~vR~----- 180 (516) T protein:vir:10 106 LDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMPNPKEGIVELRRLDPRHVEYYRE----- 180 (516) T ss_pred EEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEecCcccceeeeeeeCCcceeeEEe----- Confidence 655543333322 22221 167788889999999999999886665 34656788999988765432 Q ss_pred ceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecCCC---CcCccHH Q lcl|NC_021326. 143 ELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKNND---LEISDIF 216 (445) Q Consensus 143 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n~~---~g~s~~~ 216 (445) ...+...+..+.... ..+|.+..+.......+.. ..++.--+|| +++...+- .+...+. T Consensus 181 --------i~~~~~~~~~v~~~~-~e~~~Y~~~~~~~~~~g~~------~~~~~~ikI~~daI~y~hSGl~d~~~~~i~s 245 (516) T protein:vir:10 181 --------IVTSDVGGTSVVKGY-REFFVYTTGNEGYAYNGRL------FEPNTRIKIPRSAIVYAHSGLQDCSDRGIVG 245 (516) T ss_pred --------eecccCcchhhhhce-eeeeeeecCccceeccccc------cCCCCceecchhheeeeecCcccCCCCceec Confidence 111111111111100 0111111111000000000 0111111333 22222111 0111122 Q ss_pred HHHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----hh--------------------Cc Q lcl|NC_021326. 217 MYKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----RY--------------------YG 265 (445) Q Consensus 217 ~v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~~--------------------~~ 265 (445) -+...+..+|. ++-|.+...+..+.|-+=+.-.+..+. +...+.+ +. -. T Consensus 246 yLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlE 325 (516) T protein:vir:10 246 YLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTE 325 (516) T ss_pred eehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 23333444443 345556666666666432221121111 1111111 00 00 Q ss_pred eeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--ccccc-cc-CcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 266 AIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FSSDK-FG-SAPSGVALEFLYTNLNLKADKLA 336 (445) Q Consensus 266 ~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~-~~-~~~Sg~Ai~~~~~~l~~k~~~~~ 336 (445) -+++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|- +..+. +. +.-.+..|-.-+.....-+.+.+ T Consensus 326 DyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR 404 (516) T protein:vir:10 326 DYWLMRRDGKSVTEVTSLPGAQTMGE-MDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQ 404 (516) T ss_pred hhcccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHH Confidence 12232 122 223333333 3333 3346667777888777773 21111 11 01122334333333445556667 Q ss_pred HHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHH---hc-cCChHHHHHhCC Q lcl|NC_021326. 337 RKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQS---MG-IVSHETVLENHP 399 (445) Q Consensus 337 ~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~---~g-~~s~et~l~~l~ 399 (445) ..|...+..+|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+ +| .+|++++++.+- T Consensus 405 ~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~IL 484 (516) T protein:vir:10 405 HNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNIL 484 (516) T ss_pred HHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh Confidence 777777777666433333322 222 356778855444334444433 3333 24 699999998764 Q ss_pred CCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 400 FVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 400 ~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) -.+| ..+|-+.|++|..+- -+.+ ++++.| T Consensus 485 r~tDeei~~~~k~I~~E~~~~-----~~~~------------p~~e~~ 515 (516) T protein:vir:10 485 QMTDEQIAQEEKQIEKEANVK-----RFQN------------PENEDD 515 (516) T ss_pred cCCHhHHHHHHHHHHHhhhCC-----CCCC------------CCcccc Confidence 4443 334444555444321 1111 111111 No 244 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=89.57 E-value=0.026 Score=29.31 Aligned_cols=323 Identities=12% Similarity=0.084 Sum_probs=123.3 Q ss_pred HHHHHHHHHHhcCCCcccccccccccccccccccccccc--ccchHHHHHHHHHhhhhccCeee-cc-Cc--------hH Q lcl|NC_021326. 13 LPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRM--ITNFHANLVDQKVSYIVGKPIAF-KH-TD--------DE 80 (445) Q Consensus 13 ~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri--~~n~~~~iv~~~~~~l~g~~~~~-~~-~d--------~~ 80 (445) ..=+.+...+-.+.-+- ..... .......+ .......+|+..++-+..-|+.+ .- .+ +. T Consensus 1 Mg~f~~~~~~~~~~~~~---~~~~~------~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~ 71 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNN---DTQRV------TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISM 71 (378) T ss_pred CccchhhhhhhcccccC---Cccee------eecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEccccccccccccc Confidence 00000011111110000 00000 00000011 12233445666666655556643 10 00 11 Q ss_pred HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECC-CCcEEEEEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 81 VIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDE-EGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 81 ~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~-~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) ....+...+. | ....+...+..+.+.+|.+|++...+. .|++. .+-|.... T Consensus 72 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~--~l~~~~~~-------------------- 129 (378) T protein:vir:16 72 AGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DLLFADDK-------------------- 129 (378) T ss_pred ccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EEEecCCe-------------------- Confidence 1122333332 2 345566677888999999998754432 23321 11111000 Q ss_pred ecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 154 ENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLS 233 (445) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~ 233 (445) . .|. .. -|+|+++.-.+.....++..+.++++..++ T Consensus 130 -----~---------~~~-------------------------~~--diih~r~~~~~~~~~s~l~~~~~~i~~~~~--- 165 (378) T protein:vir:16 130 -----K---------EYK-------------------------PE--ELVRLTSPFYINEDTSILDNALASIQTKLE--- 165 (378) T ss_pred -----e---------Eec-------------------------cc--ceEEecCccCccchhHHHHHHHHHHHHHHh--- Confidence 0 000 00 033333222222223334444444443322 Q ss_pred HHHHHhcCC--eeEEecC-Ccccchh----HHHhh-------hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHH Q lcl|NC_021326. 234 NTFKDSNEL--TYVLTNY-DDQELPE----FKRLL-------RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIM 299 (445) Q Consensus 234 ~~~~~~~~~--~l~~~g~-~~~~~~~----~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~ 299 (445) .+.+ ++...+. +.+...+ +...+ ...+++.++++.+.+.++.+.....+ ..++.+.+.|+ T Consensus 166 -----~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia 239 (378) T protein:vir:16 166 -----QGKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELL 239 (378) T ss_pred -----cCccceeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHH Confidence 1222 2222222 1111111 11111 12245666666665555443333333 44566777788 Q ss_pred HHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------------CCcceEEEE Q lcl|NC_021326. 300 LFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-------------GEHKDVDIS 366 (445) Q Consensus 300 ~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~-------------~~~~~i~v~ 366 (445) ..-++|..-. .+..+... ....+...|.-.+..+...+..+ ....++.+. T Consensus 240 ~~fgVPp~~l---~g~~~e~~--------------~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~ 302 (378) T protein:vir:16 240 TGYFMNENIL---LGTASQEQ--------------QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVD 302 (378) T ss_pred HHhCCCHHHh---cCCchHHH--------------HHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeec Confidence 8878875332 12111111 11223334444443333322211 112345566 Q ss_pred eCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHH-----HHHHHHHHHHHHHHhhhccccCCCCCCCCCC Q lcl|NC_021326. 367 FNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQA-----ELERIEQEQMEYNKQLPNLDDGGADGAQQKE 437 (445) Q Consensus 367 f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~-----E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 437 (445) +......|..+.++.+.++ .|+++.-.++++++.- ++-+. -+..+. . ..+.. .+..++..++ T Consensus 303 ~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~--~------~~~~~-~~~~~~~~~~ 373 (378) T protein:vir:16 303 NQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVK--N------LSDLQ-GSRKDVTSTD 373 (378) T ss_pred cchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccccc--c------hhhhc-CccCCCCCCC Confidence 6777778899999988876 4899999998887542 11100 001110 0 00000 0000001111 Q ss_pred CCCCc Q lcl|NC_021326. 438 RSNDK 442 (445) Q Consensus 438 ~~~d~ 442 (445) +.+++ T Consensus 374 e~~ne 378 (378) T protein:vir:16 374 ETNNQ 378 (378) T ss_pred CCCCC Confidence 11111 No 245 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=89.14 E-value=0.028 Score=29.09 Aligned_cols=382 Identities=15% Similarity=0.130 Sum_probs=162.4 Q ss_pred ChHHH-------HHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCe Q lcl|NC_021326. 1 MIVRY-------IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPI 72 (445) Q Consensus 1 ~l~~~-------i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~ 72 (445) +-..+ ++.-.+.+.+|..+..+++-+.. ...||+..+-+ -..+|+ T Consensus 57 ~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV 109 (524) T protein:vir:10 57 AFQTIFGSYEPGMKTTRELIDTYRNLMNNYEVDNA---------------------------VSEIVSDAIVYEDDTEVV 109 (524) T ss_pred eeeehhcccccccchHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceE Confidence 11111 11223344455555444443322 12233332222 133556 Q ss_pred eeccCchHHHHH--------HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCC Q lcl|NC_021326. 73 AFKHTDDEVIKR--------IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDK 139 (445) Q Consensus 73 ~~~~~d~~~~~~--------l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~ 139 (445) .+..++.++.+. .+.++. =+|+.+.++..+...+.|+.|+...+|.+ |-..+..+||+.+-.|.. T Consensus 110 ~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~-- 187 (524) T protein:vir:10 110 ALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVRE-- 187 (524) T ss_pred EEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCCCccccceeeeeeCCccceeeee-- Confidence 665544332222 222221 16788888999999999999999888743 666788899988744322 Q ss_pred CCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecCCC---CcCc Q lcl|NC_021326. 140 EHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKNND---LEIS 213 (445) Q Consensus 140 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n~~---~g~s 213 (445) ...+...+..+... ...+|.+..........+.. . .++.--+|| |++....- .+.- T Consensus 188 -----------i~~~~~~~~~vi~~-~~e~f~Y~~~~~~y~~~g~~--~----~~~~~ikI~~dAI~y~hSGL~d~~~~~ 249 (524) T protein:vir:10 188 -----------IITETEAGTKIVKG-YKEYFIYDTAHESYACDGRM--Y----EAGTKIKIPKAAIVYAHSGLVDCCGKN 249 (524) T ss_pred -----------eccCCCccchhhcc-hhhheeeccCccccccCccc--c----CCCcceecchhheeeeeccceeCCCCc Confidence 11111222211111 11122222221111111100 0 001111333 23322111 1111 Q ss_pred cHHHHHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----hh------------------- Q lcl|NC_021326. 214 DIFMYKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----RY------------------- 263 (445) Q Consensus 214 ~~~~v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~~------------------- 263 (445) .+.-+...+..+|. ++-|.+...+..+.|-+=+.-.+..+. +...+.+ +. T Consensus 250 i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~ms 329 (524) T protein:vir:10 250 IIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMS 329 (524) T ss_pred eeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhh Confidence 12223333444443 345556666666666432221111111 1111111 00 Q ss_pred -Cceeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--cccccccC-c-chHHHHHHHHHHHHHHHH Q lcl|NC_021326. 264 -YGAIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FSSDKFGS-A-PSGVALEFLYTNLNLKAD 333 (445) Q Consensus 264 -~~~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~~~~-~-~Sg~Ai~~~~~~l~~k~~ 333 (445) -.-+++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|- +..+..++ + -.|..|-..+.....-+. T Consensus 330 MlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~ 408 (524) T protein:vir:10 330 MTEDYWLQRRDGKAVTEVDTLPGADNTGN-MEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIR 408 (524) T ss_pred hHhhhcccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHH Confidence 0011232 122 223333332 3333 3336667777777777763 21121111 0 123334344444556667 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHHh---c-cCChHHHHH Q lcl|NC_021326. 334 KLARKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQSM---G-IVSHETVLE 396 (445) Q Consensus 334 ~~~~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~~---g-~~s~et~l~ 396 (445) +.+..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+. | .+|++++++ T Consensus 409 rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k 488 (524) T protein:vir:10 409 ELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMK 488 (524) T ss_pred HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHH Confidence 778888888888877544443432 222 457788865444334444433 33332 3 479999998 Q ss_pred hCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCC Q lcl|NC_021326. 397 NHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADG 432 (445) Q Consensus 397 ~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 432 (445) .+--.+| ..++-+.|++|..+- ..++......+. T Consensus 489 ~ILr~tDeei~~~~k~I~~E~k~~--~~~~~~~~~~~f 524 (524) T protein:vir:10 489 DILQMTDEEIEQEAKQIEEESKEA--RFQDPDQEQEDF 524 (524) T ss_pred HHhccCHHHHHHHHHHHHHHhhcC--CCCCCchhhhcC Confidence 7644443 233444444444321 111100000000 No 246 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=89.12 E-value=0.028 Score=29.08 Aligned_cols=382 Identities=15% Similarity=0.128 Sum_probs=162.5 Q ss_pred ChHHH-------HHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCe Q lcl|NC_021326. 1 MIVRY-------IKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPI 72 (445) Q Consensus 1 ~l~~~-------i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~ 72 (445) +-..+ ++.-.+.+.+|..+..+++-+.. ...||+..+-+ -..+|+ T Consensus 57 ~~~~~~g~~e~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV 109 (524) T protein:vir:72 57 AFQTIFGSYEPGMKTTRELIDTYRNLMNNYEVDNA---------------------------VSEIVSDAIVYEDDTEVV 109 (524) T ss_pred eeeehhcccccccchHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceE Confidence 11111 11223344455555444443322 12233332222 133556 Q ss_pred eeccCchHHHHH--------HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCC Q lcl|NC_021326. 73 AFKHTDDEVIKR--------IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDK 139 (445) Q Consensus 73 ~~~~~d~~~~~~--------l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~ 139 (445) .+..++.++.+. .+.++. =+|+.+.++..+...+.|+.|+...+|.+ |-..+..+||+.+-.|.. T Consensus 110 ~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~-- 187 (524) T protein:vir:72 110 ALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVRE-- 187 (524) T ss_pred EEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCCccceeeee-- Confidence 665544332222 222221 16788888999999999999999888743 666788899988744322 Q ss_pred CCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecCCC---CcCc Q lcl|NC_021326. 140 EHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKNND---LEIS 213 (445) Q Consensus 140 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n~~---~g~s 213 (445) ...+...+..+... ...+|.+..........+.. . .++.--+|| |++....- .+.- T Consensus 188 -----------i~~~~~~~~~vi~~-~~e~f~Y~~~~~~y~~~g~~--~----~~~~~ikI~~dAI~y~hSGL~d~~~~~ 249 (524) T protein:vir:72 188 -----------IITETEAGTKIVKG-YKEYFIYDTAHESYACDGRM--Y----EAGTKIKIPKAAVVYAHSGLVDCCGKN 249 (524) T ss_pred -----------eccCCCccchhhcc-hhhheeeccCccccccCccc--c----CCCcceecchhheeeeeccceeCCCCc Confidence 11111222211111 11122222221111111100 0 001111333 23322111 1111 Q ss_pred cHHHHHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----hh------------------- Q lcl|NC_021326. 214 DIFMYKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----RY------------------- 263 (445) Q Consensus 214 ~~~~v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~~------------------- 263 (445) .+.-+...+..+|. ++-|.+...+..+.|-+=+.=.+..+. +...+.+ +. T Consensus 250 i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~ms 329 (524) T protein:vir:72 250 IIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMS 329 (524) T ss_pred eeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhh Confidence 12223333444443 345556666666666432221111111 1111111 00 Q ss_pred -Cceeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--cccccccC-c-chHHHHHHHHHHHHHHHH Q lcl|NC_021326. 264 -YGAIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FSSDKFGS-A-PSGVALEFLYTNLNLKAD 333 (445) Q Consensus 264 -~~~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~~~~-~-~Sg~Ai~~~~~~l~~k~~ 333 (445) -.-+++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|- +..+..++ + -.|..|-..+.....-+. T Consensus 330 MlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~ 408 (524) T protein:vir:72 330 MTEDYWLQRRDGKAVTEVDTLPGADNTGN-MEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIR 408 (524) T ss_pred hHhhhcccccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHH Confidence 0011232 122 223333332 3333 3336667777777777763 21121111 0 123334344444556667 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHHh---c-cCChHHHHH Q lcl|NC_021326. 334 KLARKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQSM---G-IVSHETVLE 396 (445) Q Consensus 334 ~~~~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~~---g-~~s~et~l~ 396 (445) +.+..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+. | .+|++++++ T Consensus 409 rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k 488 (524) T protein:vir:72 409 ELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMK 488 (524) T ss_pred HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHH Confidence 778888888888877544443432 222 457788865444334444433 33332 3 479999998 Q ss_pred hCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCC Q lcl|NC_021326. 397 NHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADG 432 (445) Q Consensus 397 ~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 432 (445) .+--.+| ..++-+.|++|..+- ..++......+. T Consensus 489 ~ILr~tDeei~~~~k~I~~E~k~~--~~~~~~~~~~~f 524 (524) T protein:vir:72 489 DILQMTDEEIEQEAKQIEEESKEA--RFQDPDQEQEDF 524 (524) T ss_pred HHhccCHHHHHHHHHHHHHHhhcC--CCCCCchhhhcC Confidence 7644443 233444444444321 111100000000 No 247 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=88.71 E-value=0.031 Score=28.88 Aligned_cols=400 Identities=12% Similarity=0.041 Sum_probs=153.8 Q ss_pred ChHHHHHHHHHHHHHHHH------HHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISI------GQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 74 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~------~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~ 74 (445) +..=+-- .|+.++-+ .-.||+-.-+++. +......... .. ......-.+.+....+.+-+..+ T Consensus 7 ~~~gl~p---~rl~~i~~~~~~~~~~~~~~~~~~~Lr-----~~~~~~ly~~--m~-~D~hi~s~l~~Rk~av~~~~w~v 75 (488) T protein:vir:95 7 TQESLPP---FRMGEVGSLGLKVKNGRIYEEPRQALR-----FPESIKTFQL--MM-RDPAVAASVNIIKMFVRKVNWRF 75 (488) T ss_pred cCCCCCH---HHHHHHHHHhhccccchhhccchhhhc-----ccchHHHHHH--Hh-hChHHHHHHHHHHHHHhcCCceE Confidence 0000000 11111110 0112221101110 0000000000 00 13456667777777788888887 Q ss_pred ccCc-----h---HHHHHHHHHhcc---CHHHHHHHHHHHHHhcCeEE-EEEEECCCCcEEEEE-------EccceeEEE Q lcl|NC_021326. 75 KHTD-----D---EVIKRIDEVLGN---RFDDKLHSVLTGASNKGIEW-LHPYLDEEGEFKLFR-------VPAEQGIPI 135 (445) Q Consensus 75 ~~~d-----~---~~~~~l~~~~~n---~~~~~~~~~~~~~~~~G~~~-~~v~~d~~g~~~i~~-------~~p~~~~~v 135 (445) ...+ . +..+++++++.+ ++...+..+ .++..+|.++ +++|....+...... +.|..+.+ T Consensus 76 ~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~-lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~- 153 (488) T protein:vir:95 76 VPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSV-MSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPI- 153 (488) T ss_pred ecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHH-HHhhcccceeeeeeeeccccccccccccccCCeeeeeeeee- Confidence 6421 1 134566777654 344555555 4788889865 566654322111100 11111111 Q ss_pred EcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeeccccccccccccccc-ccccccc----eEEec---- Q lcl|NC_021326. 136 WTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFST-GSWGKIP----FIPFK---- 206 (445) Q Consensus 136 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~iP----vv~~~---- 206 (445) .+. .-++.|..+.+....+.. ........ .......... ..-..+| +++.. T Consensus 154 -Rpq------~~~~~f~~d~d~~l~~~~-------~~~~~~~~------~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~ 213 (488) T protein:vir:95 154 -RNQ------STLDKWYFDEDFRRVTGV-------RQNLRNVS------HIAGAINLGERPLTRKLPRAKFMLFKYDDEY 213 (488) T ss_pred -cCc------ccccceeeccCCCceeec-------cccccccc------ccccccccccccccccccccceEEEeecCCC Confidence 100 000111111111100000 00000000 0000000000 0001234 22222 Q ss_pred CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---c-ccchhHH---Hhhh---------hCceeecc Q lcl|NC_021326. 207 NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---D-QELPEFK---RLLR---------YYGAIKVS 270 (445) Q Consensus 207 n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~-~~~~~~~---~~~~---------~~~~~~~~ 270 (445) .++.|.|.+..+--.---=+..+..++.-++.+..|..+.+|.. . ....+.. ..+. ....+.++ T Consensus 214 g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP 293 (488) T protein:vir:95 214 GNPEGRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWP 293 (488) T ss_pred CccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeec Confidence 34567888876544433334566777777888877877766631 1 1112111 1111 01123345 Q ss_pred CCCceeeE---------ecc-CChHHHHHHHHHHHHHHHHHhCcccccc--ccccCcchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 271 DNGGVDTI---------QVE-VPVENSKKYLDELYQKIMLFGQAVDFSS--DKFGSAPSGVALEFLYTNLNLKADKLARK 338 (445) Q Consensus 271 ~~~~~~~l---------~~~-~~~~~~~~~i~~l~~~i~~~s~~p~~~~--~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~ 338 (445) .+.++++. ... .+...+..+++.+.+.|...--.--++. ++.|+...|. ....-....+..-.+. T Consensus 294 ~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~---vh~ev~~~i~~aDa~~ 370 (488) T protein:vir:95 294 RYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLAD---SKTSLLAMSVDILLKQ 370 (488) T ss_pred cccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHH---HHHHHHHHHHHHHHHH Confidence 55444431 111 2334577778888777765432111111 1112111111 1122233333444555 Q ss_pred HHHHHH-HHHHHHHHHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHHh--cc-CC----hHHHHHhCCCCCCHHHHHHH Q lcl|NC_021326. 339 AKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-VS----HETVLENHPFVEDLQAELER 410 (445) Q Consensus 339 ~~~~l~-~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~~--g~-~s----~et~l~~l~~~~d~~~E~~r 410 (445) +...+. +++.-++.+... ....-..+.|....+.|..+.++.+.+++ |+ ++ .+.+.+.++. +.++.. T Consensus 371 i~~tln~~li~~l~~~Nfg-~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gi-p~~~~~--- 445 (488) T protein:vir:95 371 IKNVINRDLVAQTYALNMW-DDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGL-PPADES--- 445 (488) T ss_pred HHHHHHHHHHHHHHHhcCC-CCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCC-CCCCCC--- Confidence 666664 466666555422 22233567888888888888888888874 65 55 3456666643 322110 Q ss_pred HHHHHHHHHHhhhc----cccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 411 IEQEQMEYNKQLPN----LDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 411 i~~E~~~~~~~~~~----~~~~~~~~~~~~~~~~d~~~~ 445 (445) |. ......+. ........+.......+.+.. T Consensus 446 ---e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (488) T protein:vir:95 446 ---QP-VSEKLSPNSQSRSGDGYKTAGEGTAKTPSAKDP 480 (488) T ss_pred ---cc-ccccCCCCCCCCCCcccCCCcccCCcccccccc Confidence 00 00011000 000000011111111111111 No 248 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=88.58 E-value=0.031 Score=28.82 Aligned_cols=395 Identities=10% Similarity=-0.025 Sum_probs=157.5 Q ss_pred ChHHHHHHHHH----HHHHHHHHHH--------HhcCCCc--cc---c--ccccccccccccccccccccccchHHHHHH Q lcl|NC_021326. 1 MIVRYIKQHLE----KLPEISIGQE--------YYEQRPD--IV---K--EPKPVDATGAVDPLKPDDRMITNFHANLVD 61 (445) Q Consensus 1 ~l~~~i~~~~~----~~~~~~~~~~--------yy~G~~~--i~---~--~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~ 61 (445) ||.++...... ....+..... |..|-.. +. . ........+.... ...-+.++....+|+ T Consensus 3 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~--~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 3 LIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLA--TQAYQANGPVFACML 80 (466) T ss_pred hhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccc--hhhhhccHHHHHHHH Confidence 77777766531 1222211111 1111000 00 0 0000000010000 000122345566777 Q ss_pred HHHhhhhccCeeeccCch----HH-HHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCc---------E Q lcl|NC_021326. 62 QKVSYIVGKPIAFKHTDD----EV-IKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGE---------F 122 (445) Q Consensus 62 ~~~~~l~g~~~~~~~~d~----~~-~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~---------~ 122 (445) ..+.-+..-|+.+.-.++ .. ...+..++. | ....+...+..+.+.+|.+|+.+..++.|. . T Consensus 81 ~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~~ 160 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVVV 160 (466) T ss_pred HHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCccee Confidence 777777666776532111 11 111222332 2 244566677889999999999998877654 2 Q ss_pred EEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccce Q lcl|NC_021326. 123 KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPF 202 (445) Q Consensus 123 ~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv 202 (445) .+..++|..+.+..+..... ....++ ....... .. ....+..=-| T Consensus 161 ~l~~l~~~~v~~~~~~~~~~--~~~y~~-~~~~~~~-------~~-------------------------~~~~~~~~dv 205 (466) T protein:vir:81 161 EERMVRGGRGELGGGQLGWR--KVGYLY-TEGGRQS-------GN-------------------------ESVGFLAEDV 205 (466) T ss_pred EEEEecCcceEEEEcCCCce--EEEEEE-EecCccc-------cc-------------------------ceeeeccccE Confidence 35666676666555432111 111100 0000000 00 0000001113 Q ss_pred EEecC------CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC---CcccchhHHHhhh--------hCc Q lcl|NC_021326. 203 IPFKN------NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFKRLLR--------YYG 265 (445) Q Consensus 203 v~~~n------~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~~~~~~--------~~~ 265 (445) +|++. ...|.|.+......++............+...+.|-.++.-. +.+....++..+. ..+ T Consensus 206 iHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~ 285 (466) T protein:vir:81 206 VHFAPIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNAWK 285 (466) T ss_pred EEEcCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcCcccccc Confidence 44432 124777777666666655555455555566777776665432 2222222222221 123 Q ss_pred eeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccccccc--cCcchHHHHHHHHHHHHHH-HHHHHHHHHHH Q lcl|NC_021326. 266 AIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKF--GSAPSGVALEFLYTNLNLK-ADKLARKAKVA 342 (445) Q Consensus 266 ~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~--~~~~Sg~Ai~~~~~~l~~k-~~~~~~~~~~~ 342 (445) ++.++++.+.+.++.......+.+..+.....|...-++|....+-. .+..++..++.....+... +.-....+... T Consensus 286 ~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~ 365 (466) T protein:vir:81 286 NLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGC 365 (466) T ss_pred ceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 55666666666665444455566677777888888888886544321 1122222232221111111 11112222222 Q ss_pred HHHHHHHHHHHhccCCCcceEEEEeC--CCCCCCHHHHHHH-------HHHH--hccCChHHHHHhCCCCCCHHHHHHHH Q lcl|NC_021326. 343 IQELLWFVFEHFDIKGEHKDVDISFN--YNKVANTELQVQT-------AQQS--MGIVSHETVLENHPFVEDLQAELERI 411 (445) Q Consensus 343 l~~~~~~~~~~~~~~~~~~~i~v~f~--~~~p~d~~~~~~~-------~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri 411 (445) +.+ . ++. ..+...+.+.|+ .-+-.|..+.+++ +..+ .|+ +...++..+++-+.. + + T Consensus 366 l~~---~---L~~-~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~gd~~---~--~ 432 (466) T protein:vir:81 366 IGH---V---MPD-MGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAGY-EPESVVAAVNSGDLR---L--L 432 (466) T ss_pred HHh---h---cCC-cccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCC-ChhhccccccCCccc---c--c Confidence 211 1 111 112223445554 3444565555543 2222 243 444555443322110 0 0 Q ss_pred HHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 412 EQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 412 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ........+..+........+.+...+.+++.+. T Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 433 KHTGLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred cCCCcchhhhcccccccccCCCCcccCCCCcCCC Confidence 0000000011111111111111111111111111 No 249 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=85.37 E-value=0.053 Score=27.56 Aligned_cols=323 Identities=13% Similarity=0.101 Sum_probs=122.9 Q ss_pred HHHhcCCCccccccccccccccccccccccccc--cchHHHHHHHHHhhhhccCeeec-c-Cc----h----HHHHHHHH Q lcl|NC_021326. 20 QEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI--TNFHANLVDQKVSYIVGKPIAFK-H-TD----D----EVIKRIDE 87 (445) Q Consensus 20 ~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~--~n~~~~iv~~~~~~l~g~~~~~~-~-~d----~----~~~~~l~~ 87 (445) .-+|..-.. +.+. ................+. ......+|+..++-+..-|+.+- - .+ + .....+.. T Consensus 1 Mg~f~~~~~-f~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~ 78 (378) T protein:vir:93 1 MNLFGKVVS-FSRG-KLNNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDE 78 (378) T ss_pred Cccchhhhh-hhcc-ccCCCcceeeecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccchHHH Confidence 111111100 0000 000000000000001111 12334456666666666676531 1 10 0 01112223 Q ss_pred Hhc---c---CHHHHHHHHHHHHHhcCeEEEEEEEC-CCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 88 VLG---N---RFDDKLHSVLTGASNKGIEWLHPYLD-EEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 88 ~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d-~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) .+. | ....+...+..+.+.+|.+|+++..+ ..|++.. +-| ++ . ..+ T Consensus 79 lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~--l~~-------~~-~-----------------~~~ 131 (378) T protein:vir:93 79 VLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLD--LLF-------AD-D-----------------KKE 131 (378) T ss_pred HHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEE--EEe-------cC-C-----------------eeE Confidence 332 2 23455666788999999999865443 2233211 101 00 0 000 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +. .. . |+++++.-.+.....++..+..+++...+ .+ T Consensus 132 ~~-~~---------------------------------d--iih~r~~~~~~~~~s~l~~~~~~i~~~~~--------~~ 167 (378) T protein:vir:93 132 YK-TE---------------------------------E--LVRLTSPFYINEDTSILDNALASIQTKLE--------QG 167 (378) T ss_pred ec-cc---------------------------------e--eEEecCccccchhhHHHHHHHHHHHHHHh--------cC Confidence 00 00 0 23333221222222233333344333222 12 Q ss_pred CCeeEE--ecCC-cccchhHH----Hhh-------hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccc Q lcl|NC_021326. 241 ELTYVL--TNYD-DQELPEFK----RLL-------RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVD 306 (445) Q Consensus 241 ~~~l~~--~g~~-~~~~~~~~----~~~-------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~ 306 (445) .+-.++ .+.- .+...... ..+ ...+++.++++.+++.++.+.....+ ..++.+.+.|+..-++|. T Consensus 168 ~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp 246 (378) T protein:vir:93 168 KLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNE 246 (378) T ss_pred cccceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCH Confidence 232222 2321 11111111 111 12245666666555555443333333 445667778888888875 Q ss_pred cccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------------CCcceEEEEeCCCCCC Q lcl|NC_021326. 307 FSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-------------GEHKDVDISFNYNKVA 373 (445) Q Consensus 307 ~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~-------------~~~~~i~v~f~~~~p~ 373 (445) .-. .+..|. .. ....+...|.-.++.+...+..+ ....++.+.+...+-. T Consensus 247 ~~l---~g~~~e----~~----------~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~ 309 (378) T protein:vir:93 247 NIL---LGTATQ----EQ----------QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFA 309 (378) T ss_pred HHh---cCCcHH----HH----------HHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhc Confidence 332 121111 00 11233344444444443322211 1123355556667778 Q ss_pred CHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHH-----HHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCc Q lcl|NC_021326. 374 NTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQA-----ELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDK 442 (445) Q Consensus 374 d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~-----E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 442 (445) |..+.++.+.++ .|+++.-.++++++.- ++.+. -+..+.. ..+... +..+...+++.+++ T Consensus 310 d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~--------~~~~~~-~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 310 TLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKN--------LSDLQG-SRKDVTSTDETNNQ 378 (378) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccccccccc--------hhhhcC-ccCCCCCCCCCCCC Confidence 899999998876 4899999998887542 11100 0000000 000000 00000111111111 No 250 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=84.98 E-value=0.056 Score=27.44 Aligned_cols=406 Identities=15% Similarity=0.130 Sum_probs=177.6 Q ss_pred ChHHHH-------HHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 1 MIVRYI-------KQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 1 ~l~~~i-------~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) +-+++. +++.+-..+...+.+-|.+...-. +. . +.| .|+.---|.++.--+.+.+|. T Consensus 16 la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~---~~---~--------~~r--~nl~~sni~~i~P~iYar~P~ 79 (663) T protein:vir:34 16 WAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSA---HD---A--------ETR--WNLFSTNIQTQMASLYGQTPK 79 (663) T ss_pred HHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCC---Cc---c--------ccc--cchhhhhHHHHhhhhhcCCCc Confidence 433333 333344445555666666543110 00 0 011 133322233333334444443 Q ss_pred ec------cCch----HHHHHHHHHh-------ccCHHHHHHHHHHHHHhcCeEEEEEEEC--------------CC-C- Q lcl|NC_021326. 74 FK------HTDD----EVIKRIDEVL-------GNRFDDKLHSVLTGASNKGIEWLHPYLD--------------EE-G- 120 (445) Q Consensus 74 ~~------~~d~----~~~~~l~~~~-------~n~~~~~~~~~~~~~~~~G~~~~~v~~d--------------~~-g- 120 (445) .+ .-+. .+.+.+...+ ++++...+....++++.+|++.+.+.+- +. + T Consensus 80 p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~~~~D~~~~~ 159 (663) T protein:vir:34 80 VSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVDAILDEATGA 159 (663) T ss_pred ceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchhccccccCCCcccc Confidence 32 2222 2333343322 1346677777888999999887777651 11 0 Q ss_pred ---------------cEEEEEEccceeEEEEcCCCCC-ceEEEE-EEEee-----------------------------e Q lcl|NC_021326. 121 ---------------EFKLFRVPAEQGIPIWTDKEHE-ELEAFI-RMYKL-----------------------------E 154 (445) Q Consensus 121 ---------------~~~i~~~~p~~~~~v~d~~~~~-~~~~~v-~~~~~-----------------------------~ 154 (445) .++|..+.=.++ ++++.... +..++. |.|-+ . T Consensus 160 ~~a~~~~~~e~~a~E~v~id~v~~~df--l~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~~~~~~~a~~~~~~~~~~~~ 237 (663) T protein:vir:34 160 ELAAAVPPTQRKAYECVETDYLHWQDV--LWSPARVWHEVRWLAFRNLLDMREFNARFDADGSRNLWASVPKVGKPKDGK 237 (663) T ss_pred chhcccccchhhcccceeeeeechhhc--ccchhhccccccceeeeccCCHHHHHHhhcCChhhhhhhhccCcCCccccC Confidence 123332221111 11111110 011110 11100 0 Q ss_pred ---------cceeEEEEecceEEEEEE-ecceeeecccccccccccccccccccccceEEecC----CCCcCccHHHHHH Q lcl|NC_021326. 155 ---------NETKVEYWDKITVNYYVY-ENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----NDLEISDIFMYKT 220 (445) Q Consensus 155 ---------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~~~g~s~~~~v~~ 220 (445) .....++|++.....|+. ++...... ..++..+.-.|==||...+++ +-...++|.-... T Consensus 238 ~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~~~L~------~~~p~lgl~~ffPcPrpl~~~~~~ds~ipvpd~~~y~~ 311 (663) T protein:vir:34 238 DGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYSAVLD------TQPDPLGLESFFPCPKPLLANWTTDKVVPRPDFVLAQD 311 (663) T ss_pred CCCCcchhcCcceeEEEecCCcEEEEEEcCcceecc------cCCCCCCCCCCCCCcccccceecCCCeecCCcHHHHHH Confidence 001256777665555443 33222222 122222212221245544433 2235788988999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchh-HHHhhhhCceeec------cCCCc----eeeEeccCCh---HH Q lcl|NC_021326. 221 LIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPE-FKRLLRYYGAIKV------SDNGG----VDTIQVEVPV---EN 286 (445) Q Consensus 221 lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~-~~~~~~~~~~~~~------~~~~~----~~~l~~~~~~---~~ 286 (445) +++.+|.+ ++..+.+...-.+-.+..+...+.... +.... .+..+-+ .+.++ +.++-.+.-. .. T Consensus 312 ~~~E~n~~-t~Rin~l~d~ikv~gvy~~~~g~~i~~~l~~a~-~n~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~ 389 (663) T protein:vir:34 312 LYKEIDLV-STRITLLERAIRVVGVYDKSSGLTIGRLLSEAA-QNDLIPVENWLTFADKGGLRGVVDWFPLEPVVAALTS 389 (663) T ss_pred HHHHHHHH-HHHHHHHHhhhhhceeeccccchhHHHHHHHhh-CCCceecchhhhhhhhcCccchhhcccchhHHHHHHH Confidence 99999966 555566666555544432222211111 11111 1112212 22233 2333222222 23 Q ss_pred HHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-------- Q lcl|NC_021326. 287 SKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-------- 358 (445) Q Consensus 287 ~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~-------- 358 (445) +-..-..++..++++|++.++.=+..-.+.++.|-..+-+.+..++.+++.....+.+.++++..+++..+. T Consensus 390 l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m 469 (663) T protein:vir:34 390 LRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQ 469 (663) T ss_pred HHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHH Confidence 334556788899999998877655555567888888888899999999999999999999999887653221 Q ss_pred -----------------------CcceEEEEeCCCCCCCHHHHHHHHHHH-hcc--CCh-------------HHHHHhC- Q lcl|NC_021326. 359 -----------------------EHKDVDISFNYNKVANTELQVQTAQQS-MGI--VSH-------------ETVLENH- 398 (445) Q Consensus 359 -----------------------~~~~i~v~f~~~~p~d~~~~~~~~~~~-~g~--~s~-------------et~l~~l- 398 (445) ....++|.=.-..-.|..+.-+..++. .++ +++ ..+.+++ T Consensus 470 ~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~l~Ellk 549 (663) T protein:vir:34 470 ANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSAPFLLQMLK 549 (663) T ss_pred hcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHH Confidence 111233333334444554444333322 121 111 1112221 Q ss_pred ----CCC--CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 399 ----PFV--EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 399 ----~~~--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +|- .+.+.-++++..-.++..++ .....+.++.. T Consensus 550 ~~~~~f~~~~qie~ai~~~~~~~e~aa~~-------------~~~~~pa~~~~ 589 (663) T protein:vir:34 550 WSVSGLRGSSTIEGVLDKAIAAAEEAQKQ-------------AAQQSPAPQQP 589 (663) T ss_pred HHhhcCChhhhHHHHHHHHHhhhHHHhhc-------------cCCCCcccchh Confidence 110 11111122222111111110 01111111111 No 251 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=80.89 E-value=0.091 Score=26.30 Aligned_cols=370 Identities=8% Similarity=0.003 Sum_probs=144.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCC-cccccccccccccccccccccccc--ccchHHHHHHHHHhhhhccCeee--- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRP-DIVKEPKPVDATGAVDPLKPDDRM--ITNFHANLVDQKVSYIVGKPIAF--- 74 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~-~i~~~~~~~~~~~~~~~~~~~~ri--~~n~~~~iv~~~~~~l~g~~~~~--- 74 (445) +++++-.+- ... -.... .+.... ........... ....+ .++....+|+..++-+..-|+.+ T Consensus 3 ~~~~~~~~~----~~~------~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~ 70 (423) T protein:vir:81 3 FLQKLGLAP----SVV------ATPEPIELVGPI-FESLKLSTKNM-TVEQIWEDQPHLRTVTTFIARNVASLQLQAFER 70 (423) T ss_pred hhHhhcccc----ccc------cCcccccccccc-ccccccccchh-hHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEE Confidence 333331000 000 00000 000000 00000000000 00000 12344567777777766667654 Q ss_pred ccCc--hHH-HHHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEE---EcCCCCCc Q lcl|NC_021326. 75 KHTD--DEV-IKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPI---WTDKEHEE 143 (445) Q Consensus 75 ~~~d--~~~-~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v---~d~~~~~~ 143 (445) +.+. +.. ...+..++. | ........+..+.+.+|.+|+++..+..+...+..+.|..+..+ ......+. T Consensus 71 ~~dg~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~ 150 (423) T protein:vir:81 71 VEDGGRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGS 150 (423) T ss_pred ecCCceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcc Confidence 1111 111 111222322 3 24556666778889999999998877543333333333222111 11000111 Q ss_pred eEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC----C-CCcCccHHHH Q lcl|NC_021326. 144 LEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN----N-DLEISDIFMY 218 (445) Q Consensus 144 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----~-~~g~s~~~~v 218 (445) +.+.+ ... ....+... .+..=-|+|+++ . ..|.|.+..+ T Consensus 151 ~~Y~~--~~~-----------------~~~~g~~~-----------------~~~~~evih~r~~~~~~~~~G~spi~~~ 194 (423) T protein:vir:81 151 LDYII--IES-----------------GDNDGRSV-----------------KVPGERVIHRHGYNPKTMKRGKSPVQSL 194 (423) T ss_pred eEEEE--EEe-----------------cCCCceEE-----------------EEcccceEEecCCCCCCccccccHHHHH Confidence 11100 000 00000000 000001333332 1 2478877766 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCeeEEecC--------CcccchhHHHhhh---------hCceeeccCCCceeeEecc Q lcl|NC_021326. 219 KTLIDAYNRRLSDLSNTFKDSNELTYVLTNY--------DDQELPEFKRLLR---------YYGAIKVSDNGGVDTIQVE 281 (445) Q Consensus 219 ~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~--------~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~l~~~ 281 (445) ...++.......-....+...+.|-.++.-. +.+........+. .++++.++++.+.+.++.. T Consensus 195 ~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s 274 (423) T protein:vir:81 195 RDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTT 274 (423) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCC Confidence 6666655554444455556666776666421 1111111222111 1235556666555555443 Q ss_pred CChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----c Q lcl|NC_021326. 282 VPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-----I 356 (445) Q Consensus 282 ~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-----~ 356 (445) .....+.+..+.....|+..-++|....+...+ .+...++.....+ +...|.-.+..+...+. . T Consensus 275 ~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~-~t~sn~e~~~~~f----------~~~~L~P~~~~ie~~l~~~L~~~ 343 (423) T protein:vir:81 275 SKDEQTVETTKLSLQTVAQVYGINPTMVGQLDN-ANYSNVREFRKAL----------YGDNLGSWIRIIQDVMNLFLLPR 343 (423) T ss_pred hhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCC-CCcccHHHHHHHH----------HHHHHHHHHHHHHHHHhhhhcCc Confidence 333345555566667788888888654432221 1111122111111 11122222222221111 1 Q ss_pred ---CCCcceEEEEeCCCCCCCHHHHHHHHHHH---hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCC Q lcl|NC_021326. 357 ---KGEHKDVDISFNYNKVANTELQVQTAQQS---MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGA 430 (445) Q Consensus 357 ---~~~~~~i~v~f~~~~p~d~~~~~~~~~~~---~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 430 (445) ......+++.+..-+..|..+.++++.++ .|+++.-.+++.++.-.. +..+.+ .....- . T Consensus 344 ~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~--~gGD~~----------~~p~n~--~ 409 (423) T protein:vir:81 344 VGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSI--DGGDDL----------ARPLNT--E 409 (423) T ss_pred cccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCC--CCccee----------eccccc--c Confidence 11122234444555667888888877664 489998888888754221 111000 001000 0 Q ss_pred CCCCCCCCCCCcCCC Q lcl|NC_021326. 431 DGAQQKERSNDKQSE 445 (445) Q Consensus 431 ~~~~~~~~~~d~~~~ 445 (445) . .+..+.++++.| T Consensus 410 ~--~~~~~~~~~~~~ 422 (423) T protein:vir:81 410 F--GDSEDAPGEEVE 422 (423) T ss_pred c--CccCCCCCCCCC Confidence 0 011112222223 No 252 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=80.59 E-value=0.093 Score=26.23 Aligned_cols=342 Identities=10% Similarity=0.036 Sum_probs=133.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) ++.++..+.+.. ...|.+. .. .......-+.......+|+..++-+..-|+.+--.+.. T Consensus 3 ~f~~~f~~~~~~-------~~~~~~~--~~------------~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~ 61 (385) T protein:vir:95 3 LFDSVFKRHSEL-------SWMYDLE--FL------------QDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTK 61 (385) T ss_pred hhhhhhccCccc-------ccccchh--hh------------hccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCcc Confidence 333332221110 0000000 00 00000001122334556777777666667665322222 Q ss_pred HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEE--EEEEccceeEEEEcCCCCCceEEEEEEEe Q lcl|NC_021326. 81 VIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFK--LFRVPAEQGIPIWTDKEHEELEAFIRMYK 152 (445) Q Consensus 81 ~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~--i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 152 (445) ....+...+. | .-......+..+.+.+|.+|++... +|... ..++.|... .++.+ . ++. T Consensus 62 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~--~~~~~~~~~~~~~~~~-~~~~~------~----~~~ 128 (385) T protein:vir:95 62 EKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKND--EGHFFVADDFEKEDEL-GLYSH------R----FTN 128 (385) T ss_pred ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEec--CCCeeecccccccccc-ccccc------c----cee Confidence 2222222221 2 2355666778889999999976543 33321 111111110 00000 0 000 Q ss_pred eecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCC-----CCcCccHHHHHHHHHHHHH Q lcl|NC_021326. 153 LENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFMYKTLIDAYNR 227 (445) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~-----~~g~s~~~~v~~lid~~~~ 227 (445) . .+ ....+. ...+.. -|++++.. ..|.|.+......+ . T Consensus 129 ~------~~-~~~~~~------------------------~~~~~~--eiih~~~~~~~~~~~G~s~~~~~~~~i---~- 171 (385) T protein:vir:95 129 V------LV-NDFEFK------------------------RVFTMD--DVIYLKYNNQKLDAFSLGLFEDYGEIF---G- 171 (385) T ss_pred e------ee-ccccee------------------------eeeccc--cEEEecCCCCCcccccchHHHHHHHHH---H- Confidence 0 00 000000 000111 13444321 23555444333332 2 Q ss_pred HHHHHHHHHHHhcCC--eeEEecCC---cccchhHHHhh---------hhCceeeccCCCceeeEeccC------ChHHH Q lcl|NC_021326. 228 RLSDLSNTFKDSNEL--TYVLTNYD---DQELPEFKRLL---------RYYGAIKVSDNGGVDTIQVEV------PVENS 287 (445) Q Consensus 228 ~~s~~~~~~~~~~~~--~l~~~g~~---~~~~~~~~~~~---------~~~~~~~~~~~~~~~~l~~~~------~~~~~ 287 (445) .......+...| ++++.+.. .+.....+..+ ...+++.++++.+.+.++... ....+ T Consensus 172 ---~~~~~~~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~ 248 (385) T protein:vir:95 172 ---RMIDLQMLNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSEL 248 (385) T ss_pred ---HHHHHHHhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHH Confidence 222233333333 33333321 11111112111 122245566666666554321 23456 Q ss_pred HHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----C--Ccc Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK----G--EHK 361 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~----~--~~~ 361 (445) .+..+.....|+..-++|.....+..++.+. .....+...|.-++..+...+..+ . ... T Consensus 249 ~e~~~~~~~~Ia~~fgVpp~~l~~~~sn~e~---------------~~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~ 313 (385) T protein:vir:95 249 NELKKTVLTDVARMIGVPPSLVLGEMADLEK---------------TIESYLQFCINPLLRKIEAELNSKFFYQDEYLND 313 (385) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCCCcCHHH---------------HHHHHHHHHHHHHHHHHHHHHHhhcCChhhcccc Confidence 6667777778888888876443221111111 112223333333333333322211 1 112 Q ss_pred eEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCC Q lcl|NC_021326. 362 DVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERS 439 (445) Q Consensus 362 ~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (445) .+++.+...+..|..+.++++.++ .|+++.-.++++++.-.-..+.-++.- ... +... -+..++++. T Consensus 314 ~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~~------~~~--n~~~---~~~~kgge~ 382 (385) T protein:vir:95 314 DMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKFI------ITK--NLQS---ADAFKGGES 382 (385) T ss_pred eEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceee------ecc--ccee---cccccCCCC Confidence 455566677778899999998887 489999999988855210000000000 000 0000 000111111 Q ss_pred CCc Q lcl|NC_021326. 440 NDK 442 (445) Q Consensus 440 ~d~ 442 (445) +++ T Consensus 383 ~~e 385 (385) T protein:vir:95 383 NEE 385 (385) T ss_pred CCC Confidence 111 No 253 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=78.51 E-value=0.11 Score=25.76 Aligned_cols=380 Identities=16% Similarity=0.129 Sum_probs=162.3 Q ss_pred ChHHHHH-------HHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhh-hccCe Q lcl|NC_021326. 1 MIVRYIK-------QHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYI-VGKPI 72 (445) Q Consensus 1 ~l~~~i~-------~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l-~g~~~ 72 (445) +-..+.. .-.+.+.+|..+..+++-+.. ...||+..+-+= ...|+ T Consensus 57 ~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV 109 (523) T protein:vir:68 57 MFQRMFGSQEPGLKSTRELIDTYRNLMTNYEVDNA---------------------------VSEIVSDAIVYEDDTEVV 109 (523) T ss_pred hhhhhhhccccccchHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeeecCCCceE Confidence 2222321 112333444444433333221 222444333222 34566 Q ss_pred eeccCchHHHHH--------HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC----CcEEEEEEccceeEEEEcCC Q lcl|NC_021326. 73 AFKHTDDEVIKR--------IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE----GEFKLFRVPAEQGIPIWTDK 139 (445) Q Consensus 73 ~~~~~d~~~~~~--------l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~----g~~~i~~~~p~~~~~v~d~~ 139 (445) .+..++-++.+. .+.++. =+|+.+.++..+...+.|+.|+...+|.+ |-..+..+||+.+-.|.. T Consensus 110 ~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~-- 187 (523) T protein:vir:68 110 SINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVRE-- 187 (523) T ss_pred EEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEEe-- Confidence 665544333322 222221 16788888999999999999999988743 666788899987644322 Q ss_pred CCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecCCC---CcCc Q lcl|NC_021326. 140 EHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKNND---LEIS 213 (445) Q Consensus 140 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n~~---~g~s 213 (445) ...+...++.+... ...+|.+..........+... .++.--+|| |++....- .+.- T Consensus 188 -----------i~~~~~~g~~vi~~-~~e~f~Y~~~~~~~~~~g~~~------~~~~~ikI~~dAI~y~hSGL~d~~~~~ 249 (523) T protein:vir:68 188 -----------VITTTEAGVKIVKG-YKEYFIYDTSHESYACDGRIY------EAGTKIKIPKAAIVYAHSGLVDCCGKN 249 (523) T ss_pred -----------ecCCCCcchhhhhh-hhhheeecccccccccccccc------CCCcceecchhheeeeeccceeCCCCc Confidence 11112222221111 111222222211111111000 001111333 23322111 1111 Q ss_pred cHHHHHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCccc-----chhHHHhh----hhC------------------ Q lcl|NC_021326. 214 DIFMYKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQE-----LPEFKRLL----RYY------------------ 264 (445) Q Consensus 214 ~~~~v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~-----~~~~~~~~----~~~------------------ 264 (445) .+.-+...+..+|. ++-|.+...+..+.|-+=+.-.+..+ .+...+.+ +.. T Consensus 250 i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~ms 329 (523) T protein:vir:68 250 IIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMS 329 (523) T ss_pred eeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhh Confidence 12223333444443 34555666666666643222111111 11111111 100 Q ss_pred --ceeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--ccccc--cc-CcchHHHHHHHHHHHHHHH Q lcl|NC_021326. 265 --GAIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FSSDK--FG-SAPSGVALEFLYTNLNLKA 332 (445) Q Consensus 265 --~~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~--~~-~~~Sg~Ai~~~~~~l~~k~ 332 (445) .-+++| +|+ +.+.-|.++ +.... .-+.-+++.++..-.+|- +..+. +. |. |..|-..+.....-+ T Consensus 330 MlEDyWLpRReGgrgTEItTLpGgqnlgem-~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr--~~EItRDEikF~KFI 406 (523) T protein:vir:68 330 MTEDYWLQRRDGKAVTEVDTLPGADNTGNM-EDVRWFRNALYMALRIPITRIPSDQGGIQFDA--GTSITRDELSFGKFI 406 (523) T ss_pred hHhhhcccccCCCcccceeeccccCCcChH-HHHHHHHHHHHHHhCCcceeecCCCcceeccc--ccchhHHHHHHHHHH Confidence 011232 122 223333332 33333 336667777777777763 21111 11 11 224444444455666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHHh---c-cCChHHHH Q lcl|NC_021326. 333 DKLARKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQSM---G-IVSHETVL 395 (445) Q Consensus 333 ~~~~~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~~---g-~~s~et~l 395 (445) .+.+..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+. | .+|+++++ T Consensus 407 ~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~ 486 (523) T protein:vir:68 407 RELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAM 486 (523) T ss_pred HHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHH Confidence 7778888888888877544443432 222 457788865444334444433 33332 4 47999999 Q ss_pred HhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCC Q lcl|NC_021326. 396 ENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADG 432 (445) Q Consensus 396 ~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 432 (445) +.+--.+| ..++-+.|++|..+- ..++......+. T Consensus 487 k~ILr~tDeei~~~~kqI~~E~k~~--~~~~p~~e~~~f 523 (523) T protein:vir:68 487 KDILQMSDEEIEQEAKQIEEESKEA--RFQDPDQEQEDF 523 (523) T ss_pred HHHhccCHHHHHHHHHHHHHHhhcC--CCCCCchhhhcC Confidence 87644443 233444444444321 111100000000 No 254 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=76.56 E-value=0.13 Score=25.37 Aligned_cols=382 Identities=12% Similarity=0.100 Sum_probs=162.2 Q ss_pred ChHH-------HHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCe Q lcl|NC_021326. 1 MIVR-------YIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPI 72 (445) Q Consensus 1 ~l~~-------~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~ 72 (445) +... -++.-.+.+.+|..+..+++-+.. ...||+..+-+ -..+|+ T Consensus 59 ~~~~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaIv~~~~~~pV 111 (524) T protein:vir:98 59 VFQQFYSGQDPAIQNKEQLINTYRGIMSYPEVENA---------------------------VSEIIDDAIVNEQGKDII 111 (524) T ss_pred eeeeeccccccccchHHHHHHHHHHHhhccchhhH---------------------------HHhhhcceeEecCCCceE Confidence 1111 112223344455555444443322 12233333222 134566 Q ss_pred eeccCchHHHHH--------HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcCCC Q lcl|NC_021326. 73 AFKHTDDEVIKR--------IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTDKE 140 (445) Q Consensus 73 ~~~~~d~~~~~~--------l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~~~ 140 (445) .+..++-++.+. .+.++. =+|+.+.++..+...+.|+.|+..-+|++ |-..+..+||+.+-.|..-- T Consensus 112 ~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~- 190 (524) T protein:vir:98 112 TMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESI- 190 (524) T ss_pred EEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeecc- Confidence 665544332222 222221 26788888999999999999999877643 44568889998875553211 Q ss_pred CCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecCCC--CcCccH Q lcl|NC_021326. 141 HEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKNND--LEISDI 215 (445) Q Consensus 141 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n~~--~g~s~~ 215 (445) ...+..+++.+ . ....+|.+..........+.. ..++.--+|| |++....- .+...+ T Consensus 191 ~~~~~~~~~v~-----------~-~~~e~f~Y~~~~~~~~~~g~~------~~~~~~ikI~~dAIvy~hSGL~d~~~~ii 252 (524) T protein:vir:98 191 TETLDGGVKVF-----------R-GYREFFVYSAPKAGYTYNGQI------YQANQKIKIPRSAIVYAHSGLEDCSNNII 252 (524) T ss_pred ccccccchhhc-----------c-ceeeeeeeccCCCccccccce------ecCCCceeechhheeeeccCcccCCCCee Confidence 11111111111 0 011111111111100000000 0011111333 33332211 111122 Q ss_pred HHHHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccch-----hHHHhh-hhCc---------------------- Q lcl|NC_021326. 216 FMYKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQELP-----EFKRLL-RYYG---------------------- 265 (445) Q Consensus 216 ~~v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~~-----~~~~~~-~~~~---------------------- 265 (445) .-+...+..+|. ++-|.+...+..+.|-+=+.-.+..+.+ ...+.+ ...+ T Consensus 253 syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMl 332 (524) T protein:vir:98 253 GYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMT 332 (524) T ss_pred eehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchh Confidence 223344444443 3456666666666664332222221111 111111 0000 Q ss_pred -eeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--cc--ccccc-CcchHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 266 -AIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FS--SDKFG-SAPSGVALEFLYTNLNLKADK 334 (445) Q Consensus 266 -~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~--~~~~~-~~~Sg~Ai~~~~~~l~~k~~~ 334 (445) -+++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|- +. .+++. |. |..|-..+.....-+.+ T Consensus 333 EDyWLpRReGgrgTEItTLpggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr--~~EItRDEiKF~KFI~r 409 (524) T protein:vir:98 333 EDYWLMRRDGKAITEVSTLPGGQNFSD-MDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGG--GGEITRDELKFSKFIRT 409 (524) T ss_pred hhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCceeccCCCCcccccc--ccchhHHHHHHHHHHHH Confidence 01222 122 223333332 3333 3336667777777777763 21 11111 11 22343344445556677 Q ss_pred HHHHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHH-------HHHHHh---c-cCChHHHHHh Q lcl|NC_021326. 335 LARKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQ-------TAQQSM---G-IVSHETVLEN 397 (445) Q Consensus 335 ~~~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~-------~~~~~~---g-~~s~et~l~~ 397 (445) .+..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++ ++..+. | .+|++++++. T Consensus 410 LR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ 489 (524) T protein:vir:98 410 LQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKE 489 (524) T ss_pred HHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHH Confidence 78888888887776544333332 222 35778886544433444443 333332 4 6999999987 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 398 HPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 398 l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +--.+| +|++..+++.++..+. +.+.+ ++++++| T Consensus 490 ILr~tD--eei~~~~k~I~~E~k~-~~~~~-----------p~~e~~~ 523 (524) T protein:vir:98 490 ILRMSD--EDIDEQAKLIEEESKE-ERFKN-----------PEAEEEN 523 (524) T ss_pred HhccCH--HHHHHHHHHHHHHHhC-CCCcC-----------Ccccccc Confidence 655444 4444333333332221 11110 1111111 No 255 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=73.68 E-value=0.17 Score=24.84 Aligned_cols=356 Identities=10% Similarity=0.043 Sum_probs=142.9 Q ss_pred ChHHHHHHHH-HHHHHHHHHHHHhcCCCcccccccccc--cccccc--------------cccccc----c--cccchHH Q lcl|NC_021326. 1 MIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVD--ATGAVD--------------PLKPDD----R--MITNFHA 57 (445) Q Consensus 1 ~l~~~i~~~~-~~~~~~~~~~~yy~G~~~i~~~~~~~~--~~~~~~--------------~~~~~~----r--i~~n~~~ 57 (445) ....+-.--. -..+.-.-..+|.-|+.++........ ...... ...+.. + +.++-.. T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~v~ 82 (409) T protein:vir:83 3 FWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDVAW 82 (409) T ss_pred hhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHHHH Confidence 0000000000 000000011233334333222211110 000000 000000 0 1122333 Q ss_pred HHHHHHHhhhhccCeeeccCchHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEE-EEECCCCcE-EEEEEcc Q lcl|NC_021326. 58 NLVDQKVSYIVGKPIAFKHTDDEVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLH-PYLDEEGEF-KLFRVPA 129 (445) Q Consensus 58 ~iv~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~-v~~d~~g~~-~i~~~~p 129 (445) .+|+..++-+-+-|+.+--..... +.+...+. | ........+..+.+. |.+|+. +..+.+|.+ .+..++| T Consensus 83 acV~~Ia~~iA~lpl~~~~~~~~~-~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~pl~p 160 (409) T protein:vir:83 83 ACIDLNASVLSSMPIYRMRNGRII-DSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRVVPP 160 (409) T ss_pred HHHHHHHHhhccCceEEeeCCccc-cchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEEEEECC Confidence 456666666655566442211111 11111221 2 223344445555444 888865 567888876 4788899 Q ss_pred ceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-- Q lcl|NC_021326. 130 EQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-- 207 (445) Q Consensus 130 ~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-- 207 (445) ..+.+..+.+ +.+ +|.... ... + -+|+|++. T Consensus 161 ~~v~v~~~~~--g~~-----~y~~~~-----------------~~~------------------~-----~eiiHir~~~ 193 (409) T protein:vir:83 161 WLVNVELKKG--ARR-----EYRIGG-----------------LNV------------------T-----DEILHIRYQG 193 (409) T ss_pred cceEEEEcCC--ceE-----EEEEcc-----------------ccC------------------c-----cceEEeCCCC Confidence 8877655542 111 111100 000 0 02444431 Q ss_pred ---CCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhhh------hCceeeccCCCce Q lcl|NC_021326. 208 ---NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLLR------YYGAIKVSDNGGV 275 (445) Q Consensus 208 ---~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~------~~~~~~~~~~~~~ 275 (445) ...|.|.++.....++..+.......+.+...+.|-.++.-.. .+.....+..+. .++.+.+.++.+. T Consensus 194 ~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~~ls~e~~~~~~~~~~~~~~~nag~~~il~~g~~~ 273 (409) T protein:vir:83 194 NTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVERRLSETEAVDLMDRWIESRSKYAGHPALVTGGATL 273 (409) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCCCCCHHHHHHHHHHHHHhhCCccCccceecCCccc Confidence 2357788877777777666554444555566677766654322 222222222221 1223444444432 Q ss_pred -eeEeccCChHHHHHHHHHHHHHHHHHhCccccccccccCcc--hHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 276 -DTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAP--SGVALEFLYTNLNL-KADKLARKAKVAIQELLWFVF 351 (445) Q Consensus 276 -~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~--Sg~Ai~~~~~~l~~-k~~~~~~~~~~~l~~~~~~~~ 351 (445) +.+........+.+..+.....|...-++|....+..+... +...++........ .+.-....+...+.+ . T Consensus 274 ~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~---~-- 348 (409) T protein:vir:83 274 NQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDR---W-- 348 (409) T ss_pred ccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH---h-- Confidence 23322222233444455556677777788865443211111 10112221111111 111111112222211 1 Q ss_pred HHhccCCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC Q lcl|NC_021326. 352 EHFDIKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGG 429 (445) Q Consensus 352 ~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 429 (445) ++. ....+++.+..-+-.|.++.++...++ .|+++.-.++++++.-.. + --.+...++ T Consensus 349 -Ll~---~~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~~glpp~--~--------------ggd~l~~~g 408 (409) T protein:vir:83 349 -ALP---SPQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEARAMERLHSE--A--------------AAVRLSGGG 408 (409) T ss_pred -hCC---CCcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--C--------------CCcccCCCC Confidence 111 123455555666667888888887775 488998888776533100 0 000011111 Q ss_pred C Q lcl|NC_021326. 430 A 430 (445) Q Consensus 430 ~ 430 (445) . T Consensus 409 v 409 (409) T protein:vir:83 409 V 409 (409) T ss_pred C Confidence 1 No 256 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=72.97 E-value=0.18 Score=24.72 Aligned_cols=377 Identities=10% Similarity=0.080 Sum_probs=159.6 Q ss_pred ChHHHH------HHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhh-hccCee Q lcl|NC_021326. 1 MIVRYI------KQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYI-VGKPIA 73 (445) Q Consensus 1 ~l~~~i------~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l-~g~~~~ 73 (445) +...++ +.-.+-+.+|..+..+++-+.. ...||+..+-+= ...|+. T Consensus 56 ~~~~~~~~~~~~~n~~eLI~~YR~ma~~pEvd~A---------------------------v~eIvneaiv~d~~~~pV~ 108 (521) T protein:vir:10 56 IVQSVLGYAPKIQNTKDLINQYRSLSKYHEVDNA---------------------------IDEIINDAIVQEDNRDTVY 108 (521) T ss_pred hhhhhhccccccchHHHHHHHHHHHhhccchhhH---------------------------HHhhhcceEEecCCCceEE Confidence 222221 1112223344443333332221 222444333222 345666 Q ss_pred eccCchHHHHH----H----HHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC----CCcEEEEEEccceeEEEEcCCC Q lcl|NC_021326. 74 FKHTDDEVIKR----I----DEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDKE 140 (445) Q Consensus 74 ~~~~d~~~~~~----l----~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~----~g~~~i~~~~p~~~~~v~d~~~ 140 (445) +..++-+..+. + +.++. =+|+.+.++..+...+.|+.|+..-+|. +|-..+..+||+.+-.+..- T Consensus 109 i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i-- 186 (521) T protein:vir:10 109 LDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRVN-- 186 (521) T ss_pred EEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeeee-- Confidence 65544332222 2 22221 1678888899999999999999887763 35567888999887544321 Q ss_pred CCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEec------CCCCc Q lcl|NC_021326. 141 HEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFK------NNDLE 211 (445) Q Consensus 141 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~------n~~~g 211 (445) ..+....+.+...+ ..+|.+....... ....+ .+...-+|| |++.. |.... T Consensus 187 -----------~k~~~~~~~v~~~~-~e~f~Y~~~~~~~-~~~~g-------~~~~~vkI~~daI~y~hSGL~d~~~~~i 246 (521) T protein:vir:10 187 -----------LKSNENGNDVYKGV-KEFFTYGATEDNR-YNISG-------NSNNLVQIPIDAIVYSHSGKVDIDGKTI 246 (521) T ss_pred -----------cCCCCCcchhhccc-eeeeeeccCCCce-ecCCC-------CCCcceeechhheeeecccceeCCCCce Confidence 11111111111111 1122221110000 00000 011111234 22222 22233 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh------------------------h Q lcl|NC_021326. 212 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL------------------------R 262 (445) Q Consensus 212 ~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~------------------------~ 262 (445) .|-+...+.....|- ++-|.+...+..+.|-+=+.-.+..+. +...+.+ . T Consensus 247 ~syLhkAiKp~NQLk-m~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~ms 325 (521) T protein:vir:10 247 VGYLHNVIKPANQLK-MLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLA 325 (521) T ss_pred eccchhhhHhHHhhH-HHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhh Confidence 444544322222221 345555556666666432221111111 1111111 0 Q ss_pred hCceeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--ccccc--ccCcchHHHHHHHHHHHHHHHH Q lcl|NC_021326. 263 YYGAIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FSSDK--FGSAPSGVALEFLYTNLNLKAD 333 (445) Q Consensus 263 ~~~~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~--~~~~~Sg~Ai~~~~~~l~~k~~ 333 (445) .-.-+++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|- +..++ +.-+-| ..|-..+.....-+. T Consensus 326 MlEDyWLpRReGgrgTEI~TLpggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~-~EItRDEikF~KFI~ 403 (521) T protein:vir:10 326 MTEDYWLMRRDGKATTEVSTLPGAQSMGE-MDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAG-NDITRDELQFTKYIR 403 (521) T ss_pred hHhhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccccCCCCCceecccc-cchhHHHHHHHHHHH Confidence 00012232 122 223333332 3333 3336666777777777763 21221 111111 124334444555667 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHH-------HHHHHhc------cCChHHH Q lcl|NC_021326. 334 KLARKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQ-------TAQQSMG------IVSHETV 394 (445) Q Consensus 334 ~~~~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~-------~~~~~~g------~~s~et~ 394 (445) +.+..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++ ++.++.+ .+|++++ T Consensus 404 rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi 483 (521) T protein:vir:10 404 GLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYV 483 (521) T ss_pred HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHH Confidence 778888888888777544333432 222 45778885544333444333 3344433 6999999 Q ss_pred HHhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 395 LENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 395 l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) ++.+--.+| ..++-+.|++|..+- ..++ .+++++| T Consensus 484 ~k~ILr~tDeeik~~~k~I~~E~~~~--~~~~--------------p~~e~~d 520 (521) T protein:vir:10 484 MKNILRMSDEDIKTEREKIDGELKDS--VYKN--------------PEDPMEE 520 (521) T ss_pred HHHHhcCCHhHHHHHHHHHHHhhhCC--CCCC--------------Ccchhhc Confidence 987644443 334444454444321 1111 0111111 No 257 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=72.79 E-value=0.18 Score=24.69 Aligned_cols=379 Identities=9% Similarity=-0.018 Sum_probs=153.9 Q ss_pred ChHHHHHHHHH---HHHHHHHHH-----------HHhcCCCc-cccccccccccccccccccccccccchHHHHHHHHHh Q lcl|NC_021326. 1 MIVRYIKQHLE---KLPEISIGQ-----------EYYEQRPD-IVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVS 65 (445) Q Consensus 1 ~l~~~i~~~~~---~~~~~~~~~-----------~yy~G~~~-i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~ 65 (445) |+..+-.+-.. -.++.+--. +.+.|-.+ .+..-. ..............-+.+.-...+|+..++ T Consensus 3 l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-~~~~~~g~~v~~~~al~~~~V~~ci~~Ia~ 81 (431) T protein:vir:10 3 LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYI-RRGELNGGTGRETRALRNMAVLRCVTLISG 81 (431) T ss_pred chhhhhcCcccccccccccccccccccccccccccccccccchHHHHhh-ccCccCcceechhhhhccHHHHHHHHHHHH Confidence 33322111000 000000000 00000000 000000 000000000000000112223345666666 Q ss_pred hhhccCeee-ccCch---HHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCc-EEEEEEccceeEE Q lcl|NC_021326. 66 YIVGKPIAF-KHTDD---EVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGE-FKLFRVPAEQGIP 134 (445) Q Consensus 66 ~l~g~~~~~-~~~d~---~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~-~~i~~~~p~~~~~ 134 (445) -+-.-|+.+ ..++. .....+...+. | .-..+...+..+.+.+|.+|+.+..+. |. +.+..++|..+.+ T Consensus 82 ~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~~ 160 (431) T protein:vir:10 82 TIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGSAKG 160 (431) T ss_pred hhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeEE Confidence 555566654 11111 11112222221 2 234556677888999999999998875 54 4567788888776 Q ss_pred EEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCC Q lcl|NC_021326. 135 IWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDL 210 (445) Q Consensus 135 v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~ 210 (445) ..++. +.+.+ ++ ........ .+... -|+|++ +... T Consensus 161 ~~~~~--~~~~y--~~-~~~~g~~~-~~~~~-----------------------------------dViHir~~~~dg~~ 199 (431) T protein:vir:10 161 RLTST--WQIVY--DY-TTPTGDKI-ELPAR-----------------------------------EVFHLRDLSIDGVS 199 (431) T ss_pred EEcCC--CeEEE--EE-EeCCceEE-EEchh-----------------------------------hEEEecCcCCCCcc Confidence 65532 22211 01 00000000 00000 022222 2335 Q ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecC---CcccchhHHHhhh--------hCceeeccCCCceeeEe Q lcl|NC_021326. 211 EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFKRLLR--------YYGAIKVSDNGGVDTIQ 279 (445) Q Consensus 211 g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~---~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~ 279 (445) |.|.+.-....+........-..+.+...+.|-.++.-. +.+.....+..+. .++++.++++.+.+.++ T Consensus 200 G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~ 279 (431) T protein:vir:10 200 GVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPKELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFS 279 (431) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEcc Confidence 777777666666555544444455556666776555432 2222222222221 12455666665555554 Q ss_pred ccCChHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-cCC Q lcl|NC_021326. 280 VEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFD-IKG 358 (445) Q Consensus 280 ~~~~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~-~~~ 358 (445) .......+.+..+.....|+..-++|....+... ..++..++.....+...+ -.-+-..|++.+... ++. ... T Consensus 280 ~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~-~~t~sn~eq~~~~f~~~t---L~P~~~~ie~~ln~~--Ll~~~~~ 353 (431) T protein:vir:10 280 NTAASAQQIENRNHQIEEVARMYGVPRPLLMMDD-TSWGSGIEQLAIFFIQYG---LSHWFVSWEQAAARA--FLPEKML 353 (431) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCC-CCccccHHHHHHHHHHHH---HHHHHHHHHHHHHhh--ccChhhc Confidence 4434445556666677788888888865443221 122222322222221111 111111121111110 111 011 Q ss_pred CcceEEEEeCCCCCCCHHHHHHHHHHHh------ccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCCC Q lcl|NC_021326. 359 EHKDVDISFNYNKVANTELQVQTAQQSM------GIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGGA 430 (445) Q Consensus 359 ~~~~i~v~f~~~~p~d~~~~~~~~~~~~------g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 430 (445) ....+++.+...+..|..+.++.+.++. |+++.-.++++++. ++++.. ++. ..+......+ T Consensus 354 ~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~g--D~~---------~~p~n~~~~~ 422 (431) T protein:vir:10 354 GQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVA--DQL---------RNPMTQKQKG 422 (431) T ss_pred CCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccc--cce---------ecccccccCC Confidence 1233455555566778888888887763 35888888887754 333221 110 0011111111 Q ss_pred CCCCCCCCC Q lcl|NC_021326. 431 DGAQQKERS 439 (445) Q Consensus 431 ~~~~~~~~~ 439 (445) .+.+++... T Consensus 423 ~~~~~p~~~ 431 (431) T protein:vir:10 423 SGDEPPATT 431 (431) T ss_pred CCCCCCCCC Confidence 111122222 No 258 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=71.19 E-value=0.2 Score=24.43 Aligned_cols=329 Identities=13% Similarity=0.090 Sum_probs=121.2 Q ss_pred HHHhcCCCccccccccccccccccccccccccc--cchHHHHHHHHHhhhhccCeee-c--cCch-------HHHHHHHH Q lcl|NC_021326. 20 QEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI--TNFHANLVDQKVSYIVGKPIAF-K--HTDD-------EVIKRIDE 87 (445) Q Consensus 20 ~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~--~n~~~~iv~~~~~~l~g~~~~~-~--~~d~-------~~~~~l~~ 87 (445) .-+|..... ..+... ..............+. ......+|+..++-+..-|+.+ . ..+. .....+.. T Consensus 1 Mg~f~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~ 78 (378) T protein:vir:94 1 MNLFGKVVS-FSRGKL-NNDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDE 78 (378) T ss_pred CCccccchh-cccccc-cCCcceeeeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccchHHH Confidence 111111000 000000 0000000000101111 1234446666666666666653 1 1110 01112222 Q ss_pred Hhc---c---CHHHHHHHHHHHHHhcCeEEEEE-EECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 88 VLG---N---RFDDKLHSVLTGASNKGIEWLHP-YLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 88 ~~~---n---~~~~~~~~~~~~~~~~G~~~~~v-~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) .+. | ....+...+....+.+|.+|+++ +.+..|++. .+.|... ..+ T Consensus 79 lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~--~l~p~~~-------------------------~~~ 131 (378) T protein:vir:94 79 VLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELL--DLLFADD-------------------------KKE 131 (378) T ss_pred HHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEE--EEEecCC-------------------------eeE Confidence 221 3 23456667788899999999864 444444432 1111000 000 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN 240 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~ 240 (445) +.. . -|+|+++.-.+.....++..+..+++..++. ... T Consensus 132 ---------~~~-------------------------~--diiH~~~~~~~~~g~s~l~~~~~~i~~~~~~------~~~ 169 (378) T protein:vir:94 132 ---------YKP-------------------------E--ELVRLTSPFYINEDTSILDNALASIQTKLEQ------GKL 169 (378) T ss_pred ---------eee-------------------------e--eeEEecCcCCccchhHHHHHHHHHHHHHHhc------ccc Confidence 000 0 0233332111111222333334444333221 011 Q ss_pred CCeeEEecCCcc-cchhHH----Hhh-------hhCceeeccCCCceeeEeccCChHHHHHHHHHHHHHHHHHhCccccc Q lcl|NC_021326. 241 ELTYVLTNYDDQ-ELPEFK----RLL-------RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFS 308 (445) Q Consensus 241 ~~~l~~~g~~~~-~~~~~~----~~~-------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~i~~l~~~i~~~s~~p~~~ 308 (445) .-++...+...+ ...+.. ..+ ...+++.++++.+.+.++.......+ ...+.+.+.|+..-++|..- T Consensus 170 ~gil~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~ 248 (378) T protein:vir:94 170 RGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENI 248 (378) T ss_pred cceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHH Confidence 112333332211 111111 111 12235566665555544433333333 44566677788887887543 Q ss_pred cccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------------CCcceEEEEeCCCCCCCH Q lcl|NC_021326. 309 SDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-------------GEHKDVDISFNYNKVANT 375 (445) Q Consensus 309 ~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~-------------~~~~~i~v~f~~~~p~d~ 375 (445) .. +..|.. . ....+...|.-.+..+...+..+ ....++.+.+...+-.|. T Consensus 249 l~---~~~se~----~----------~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~ 311 (378) T protein:vir:94 249 LL---GTASQE----Q----------QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATL 311 (378) T ss_pred hc---CChHHH----H----------HHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCH Confidence 31 111111 0 11233334444443333222110 111235555566777889 Q ss_pred HHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHhhhcccc-CCCCCCCCCCCCCCcC Q lcl|NC_021326. 376 ELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQMEYNKQLPNLDD-GGADGAQQKERSNDKQ 443 (445) Q Consensus 376 ~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~~~~~~~~~d~~ 443 (445) .+.++.+.++ .|+++.-.++++++.- ++-+.=+ +... ......... ..+.....+.++++.+ T Consensus 312 ~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~--~~~n----~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 312 KELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYI--ANLN----AVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeee--eccc----ccccccchhhcCCcCCCCCCCCCCCC Confidence 9999998876 4899998888877542 2211000 0000 000000000 0000000111111111 No 259 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=63.83 E-value=0.31 Score=23.37 Aligned_cols=358 Identities=8% Similarity=0.034 Sum_probs=138.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCcc-ccccccc---cccccccccccccccc-cchHHHHHHHHHhhhhccCeeec Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDI-VKEPKPV---DATGAVDPLKPDDRMI-TNFHANLVDQKVSYIVGKPIAFK 75 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i-~~~~~~~---~~~~~~~~~~~~~ri~-~n~~~~iv~~~~~~l~g~~~~~~ 75 (445) |.. +|.-+..- +..+... .............++. ++-...+|+..++-+..-|+.+- T Consensus 3 ~~~------------------~f~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~ 64 (403) T protein:vir:80 3 LFN------------------FFRRKTRSEPTNAISWFLTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLM 64 (403) T ss_pred ccc------------------cccccccccccchhhhhcccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEE Confidence 221 12111000 0000000 0000000000111111 12233566777766666666641 Q ss_pred --cCch--HHHH-HHHHHhc--c---CHHHHHHHHHHHHHhc--CeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCC Q lcl|NC_021326. 76 --HTDD--EVIK-RIDEVLG--N---RFDDKLHSVLTGASNK--GIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHE 142 (445) Q Consensus 76 --~~d~--~~~~-~l~~~~~--n---~~~~~~~~~~~~~~~~--G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~ 142 (445) .++. .... ....+.. | .-......++.+.+.. |.+|+.+..+..|++ .+..++|..+.++.+++. . T Consensus 65 ~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g-~ 143 (403) T protein:vir:80 65 QNTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTG-Y 143 (403) T ss_pred EecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCc-e Confidence 1111 1111 1222221 3 2234455566676665 667777777777876 467788888766555421 0 Q ss_pred ceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec-----CC-CCcCccHH Q lcl|NC_021326. 143 ELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NN-DLEISDIF 216 (445) Q Consensus 143 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~-~~g~s~~~ 216 (445) . + ++... .+ +.. .|+||. .+ ..|.|.+. T Consensus 144 ~----~-------------~y~~~--~~-------------------------~~~--eiih~~~~~~~~~~~~G~s~~~ 177 (403) T protein:vir:80 144 Q----I-------------WYQGK--AY-------------------------NYD--EVLHFIVNPDPEKPYMGRGYRV 177 (403) T ss_pred E----E-------------EEeec--cc-------------------------chh--hEEEEeccCCCcCccccccHHH Confidence 0 0 00000 00 001 123322 11 23667666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCeeEEecCC---cccchhHHHhh--------hhCceeeccCCC-ce-eeEeccCC Q lcl|NC_021326. 217 MYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYD---DQELPEFKRLL--------RYYGAIKVSDNG-GV-DTIQVEVP 283 (445) Q Consensus 217 ~v~~lid~~~~~~s~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~--------~~~~~~~~~~~~-~~-~~l~~~~~ 283 (445) .+...+.............+...+.|-.++.-.. .+..++....+ ..++.+.++.++ +. ++...... T Consensus 178 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~ 257 (403) T protein:vir:80 178 VLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLK 257 (403) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceeccCCHH Confidence 5555555555443434455555666666653321 11112222111 122333343332 22 22111222 Q ss_pred hHHHHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CCcce Q lcl|NC_021326. 284 VENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-GEHKD 362 (445) Q Consensus 284 ~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~ 362 (445) ...+.+..+.....|+..-++|..-.+ .+...+..... .+...|.-++..+...+..+ ....+ T Consensus 258 d~q~~e~~~~~~~~Ia~~fgVPp~~lg-~~~~~~~~~~~---------------f~~~~l~P~~~~ie~~l~~kll~~~~ 321 (403) T protein:vir:80 258 DLAIHETVELDKRTVAGIFGVPAFLLG-VGKYDKDEYNN---------------FINSTILPIAKGIEQELTRKLLISPD 321 (403) T ss_pred HHHHHHHHHHhHHHHHHHhCCCHHHcC-CCCccHHHHHH---------------HHHHHHHHHHHHHHHHHHHhccCCCC Confidence 234445566666777777777754332 12222222111 11222332322222211110 00122 Q ss_pred EEEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhcccc-CCCCCCCCCC Q lcl|NC_021326. 363 VDISF--NYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDD-GGADGAQQKE 437 (445) Q Consensus 363 i~v~f--~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~~~~~~ 437 (445) +.+.| ..-+..|..+.++.+.++ .|+++.-.++++++.-+.+.. +++-.. ..... +..... ...++++ ++ T Consensus 322 ~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~gg--d~~~~~-~n~~p-l~~~~~~~~~k~ge-~~ 396 (403) T protein:vir:80 322 LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGL--SELVIL-ENYIP-LDKIGDQNKLKGGE-KG 396 (403) T ss_pred cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--CeEeec-ccccc-hhhccchhhccCCC-CC Confidence 34455 456677888899888876 589999999888754221100 000000 00000 000000 0001111 11 Q ss_pred CCCCcCC Q lcl|NC_021326. 438 RSNDKQS 444 (445) Q Consensus 438 ~~~d~~~ 444 (445) .++++.+ T Consensus 397 ~~~~~~~ 403 (403) T protein:vir:80 397 GADGQTD 403 (403) T ss_pred CCCCCCC Confidence 1111111 No 260 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=61.93 E-value=0.34 Score=23.12 Aligned_cols=356 Identities=10% Similarity=-0.006 Sum_probs=129.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) ++.++...+........ .............. ...-+...-...+|+..++-+..-|+.+.-.++. T Consensus 3 ~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~-~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~ 67 (395) T protein:vir:40 3 FKSWVSGFFNEEQRTLN--------------LTDTVWCSIPSEKL-KELSIKKWAIDSCANKIANTLSCAEVLTYEKGEE 67 (395) T ss_pred hHHHHHhhhcccccccc--------------cccchhhccccccc-hhhhhhhHHHHHHHHHHHHHHhhCceeeccCCcc Confidence 44444333322111100 00000000000000 0001112223345555555555556665433333 Q ss_pred HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEeee Q lcl|NC_021326. 81 VIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYKLE 154 (445) Q Consensus 81 ~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~ 154 (445) ....+...+. | ........+..+.+.+|.+|+++..+. +. .|..+...... ..-.++... T Consensus 68 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~---~~----~~~~~~~~~~~------~~~~~~~~v- 133 (395) T protein:vir:40 68 VRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY---IY----VADSFTKNDKS------LYENTYTEV- 133 (395) T ss_pred ccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc---ee----ecCCccccccc------cccceeeee- Confidence 2232333332 3 234555667888899999997764432 11 11111000000 000000000 Q ss_pred cceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-CCCcCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 155 NETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-NDLEISDIFMYKTLIDAYNRRLSDLS 233 (445) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-~~~g~s~~~~v~~lid~~~~~~s~~~ 233 (445) .+..+... ...+... |+|++. +..+.+... .+...+....+... T Consensus 134 -----------~~~~~~~~-------------------~~~~~~e--vih~r~~~~~~~~~~~---~l~~~~~~~~~~~~ 178 (395) T protein:vir:40 134 -----------TLKDLTLK-------------------KEFKESE--VLHLTLNNESIKSIID---GFYLLYGDLLTAAV 178 (395) T ss_pred -----------eecCceee-------------------eeecccc--EEEeecCCCCccccch---hHHHHHHHHHHHHH Confidence 00000000 0000011 344432 222222222 22223333333333 Q ss_pred HHHHHh--cCCeeEEecCCc---ccchhHHHhh---------hhCceeeccCCCceeeEeccCChHHHHHH---HHHHHH Q lcl|NC_021326. 234 NTFKDS--NELTYVLTNYDD---QELPEFKRLL---------RYYGAIKVSDNGGVDTIQVEVPVENSKKY---LDELYQ 296 (445) Q Consensus 234 ~~~~~~--~~~~l~~~g~~~---~~~~~~~~~~---------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~---i~~l~~ 296 (445) +...+. ..+.+++..... +.....+..+ ...+++.++++.+.+.+..+.....+.+. .+.+.+ T Consensus 179 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~ 258 (395) T protein:vir:40 179 NKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFE 258 (395) T ss_pred HHHHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHH Confidence 333332 334455433221 1111111111 12224555666555544433333333332 233345 Q ss_pred HHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-----CC--CcceEEEEeCC Q lcl|NC_021326. 297 KIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-----KG--EHKDVDISFNY 369 (445) Q Consensus 297 ~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~-----~~--~~~~i~v~f~~ 369 (445) .|...-++|........++.+... ...+...|.-++..+...+.. .. ....+++.+.. T Consensus 259 ~Ia~~fgVPp~~l~~~~sn~e~~~---------------~~f~~~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ 323 (395) T protein:vir:40 259 MVANSFNIPLGLAKGDTVGLSEQV---------------NSFLMFSINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTR 323 (395) T ss_pred HHHHHhCCCHHHhcCCCcCHHHHH---------------HHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCceEEEechh Confidence 566666777544322112211111 222333344344333332221 11 12345566667 Q ss_pred CCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCC-CCCCCCCCCCCCcCC Q lcl|NC_021326. 370 NKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGG-ADGAQQKERSNDKQS 444 (445) Q Consensus 370 ~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~-~~~~~~~~~~~d~~~ 444 (445) .+..|..+.++.+.++ .|+++.-.++++++. ++++.. ++.. .. .+....+ .......++.++.++ T Consensus 324 ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~g--D~~~------~~--~n~~~~~~~~~~~kgge~~~~~~ 393 (395) T protein:vir:40 324 IKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPET--QERF------VT--KNYAPLGENEEDLKGGDINENKG 393 (395) T ss_pred hhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCC--ceee------ec--cccccccccccccCCCCCCCCcC Confidence 7778999999998876 489999999988754 222211 0000 00 0001000 111111112222222 Q ss_pred C Q lcl|NC_021326. 445 E 445 (445) Q Consensus 445 ~ 445 (445) + T Consensus 394 ~ 394 (395) T protein:vir:40 394 D 394 (395) T ss_pred C Confidence 2 No 261 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=59.35 E-value=0.39 Score=22.80 Aligned_cols=340 Identities=8% Similarity=-0.054 Sum_probs=126.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) ++.++.++... ......+-.+ . ..-...-+...-...+|+..++-+..-|+.+--.+.. T Consensus 3 ~f~~l~~~~~~----~~~~~~~~~~---------------~--~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~ 61 (376) T protein:vir:78 3 FFSELFKRNKE----IEWMWDLDFL---------------E--DKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGETS 61 (376) T ss_pred hhhhhhccCCc----cccccchhhc---------------c--ccchhhhhhhHHHHHHHHHHHHhhcccceeecccccc Confidence 33333222110 0000000000 0 0000000112334456666666666666655322222 Q ss_pred HHHHHHH-Hhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEE-EEEccceeEEEEcCCCCCceEEEEEEEee Q lcl|NC_021326. 81 VIKRIDE-VLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKL-FRVPAEQGIPIWTDKEHEELEAFIRMYKL 153 (445) Q Consensus 81 ~~~~l~~-~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i-~~~~p~~~~~v~d~~~~~~~~~~v~~~~~ 153 (445) ....+.. +.. | ........+..+.+.+|.+|+++..+..|.+.- ..+.|..+.+ ..++. T Consensus 62 ~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~-------------~~~~~- 127 (376) T protein:vir:78 62 VRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFP-------------DVFEG- 127 (376) T ss_pred ccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceee-------------eeeee- Confidence 2222222 221 3 245556677888899999999887776654321 1111111100 00000 Q ss_pred ecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-CCCcCccHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 154 ENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-NDLEISDIFMYKTLIDAYNRRLSDL 232 (445) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-~~~g~s~~~~v~~lid~~~~~~s~~ 232 (445) +.+. .+..... .+... |++++. ...+.+-. .++...+..+.... T Consensus 128 -----~~~~-~~~~~~~------------------------~~~~e--vih~~~~~~~~~~~~---~~~~~~~~~~~~~~ 172 (376) T protein:vir:78 128 -----VTVK-DYRYNRN------------------------FSMDD--VIFLEYGNERLSAFT---DGMFEDYGELFGKM 172 (376) T ss_pred -----eeee-cceeeee------------------------ecccc--EEEeccCCCCchhhh---hHHHHHHHHHHHHH Confidence 0000 0000000 00001 233321 11122222 23333344444443 Q ss_pred HHHHHH--hcCCeeEEecCC---cccchhHHHhh----h-----hCceeeccCCCceeeEeccC-----ChHHHHHHHHH Q lcl|NC_021326. 233 SNTFKD--SNELTYVLTNYD---DQELPEFKRLL----R-----YYGAIKVSDNGGVDTIQVEV-----PVENSKKYLDE 293 (445) Q Consensus 233 ~~~~~~--~~~~~l~~~g~~---~~~~~~~~~~~----~-----~~~~~~~~~~~~~~~l~~~~-----~~~~~~~~i~~ 293 (445) .....+ ...+.+++.... .+.....+..+ + ...++.++++.+.+.++... ....+.+..+. T Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~ 252 (376) T protein:vir:78 173 IRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKE 252 (376) T ss_pred HHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHH Confidence 333322 223444443211 11111111111 1 11133355555555444322 11245556666 Q ss_pred HHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCcceEEEEeCCC Q lcl|NC_021326. 294 LYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGEHKDVDISFNYN 370 (445) Q Consensus 294 l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~~i~v~f~~~ 370 (445) ..+.|+..-++|....+...++.+...+ ..+...|.-.+..+...+.. ......+...+... T Consensus 253 ~~~~Ia~~fgVPp~~l~~~~s~~e~~~~---------------~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~~~~~l 317 (376) T protein:vir:78 253 MIDYVASILGIPSSLLHGDMADLSNNMK---------------AYMEYCIDPLTKKLEDELNAKLFTFSEFLAGEHIKII 317 (376) T ss_pred HHHHHHHHhCCCHHHhCCCCCCHHHHHH---------------HHHHHHHHHHHHHHHHHHHhhhCCcccceecccchhh Confidence 6777777778876444321122222111 12222222222222221111 11112223333444 Q ss_pred CCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCC Q lcl|NC_021326. 371 KVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQS 444 (445) Q Consensus 371 ~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 444 (445) +-.|..+.++++.++ .|+++.-.++++++.- ++... ++. .. ..+-.+-+..+++| T Consensus 318 l~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~--d~~----------~~-------~~n~~~~~~~~e~g 376 (376) T protein:vir:78 318 HKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPEL--DKY----------LI-------TKNYQSADEGGEDG 376 (376) T ss_pred cccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC--cee----------ee-------ccCceehhccccCC Confidence 556888889988876 4889998988887542 11100 000 00 00001111111111 No 262 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=58.23 E-value=0.42 Score=22.67 Aligned_cols=378 Identities=11% Similarity=0.113 Sum_probs=160.4 Q ss_pred ChH--HHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeeeccC Q lcl|NC_021326. 1 MIV--RYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHT 77 (445) Q Consensus 1 ~l~--~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~~~~ 77 (445) .+. .-++.-.+.+.+|..+..+++-+.. ...||+..+-+ -..+|+.+..+ T Consensus 60 ~~~~e~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV~l~L~ 112 (521) T protein:vir:65 60 FYSTDQKISTTKQLVNTYRGLMNNHEVENA---------------------------VQNIVNDAIVFEEGHEVVSLNLE 112 (521) T ss_pred eccccchhhhHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceEEEEec Confidence 111 0111223344455554444433322 12233332222 23356666554 Q ss_pred chHHHHH--------HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcCCCCCceE Q lcl|NC_021326. 78 DDEVIKR--------IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTDKEHEELE 145 (445) Q Consensus 78 d~~~~~~--------l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~~~~~~~~ 145 (445) +-++.+. .+.++. =+|+.+.++..+...+.|+.|+..-+|++ |-..+..+||+.+..+..... T Consensus 113 ~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k----- 187 (521) T protein:vir:65 113 ATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIIT----- 187 (521) T ss_pred ccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcCCccccceeeeeeCCcceeeeeeecc----- Confidence 4332222 222221 16788888999999999999999877643 556688999998866543211 Q ss_pred EEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecC------CCCcCccHH Q lcl|NC_021326. 146 AFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKN------NDLEISDIF 216 (445) Q Consensus 146 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n------~~~g~s~~~ 216 (445) +......++.. ...+|.+..+.......+. . ..++.--+|| |++... +..-.|-+. T Consensus 188 --------~~~~~~~v~~~-~~e~f~Y~~~~~~~~~~g~--~----~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLh 252 (521) T protein:vir:65 188 --------EDTPEGKIYKA-TKEYFIYTVGNSSYCAGGQ--V----FSPNSRVKIPRSAITYAHSGLMDCDDKYIIGYLH 252 (521) T ss_pred --------cccCCcceecc-eeeeeeeecCCcceeccce--e----ecCCcceeechhheeeeeccceeCCCCeeeecch Confidence 11111111111 1111111111100000000 0 0011111223 222211 111123343 Q ss_pred HHHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----hh-------------Cc------- Q lcl|NC_021326. 217 MYKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----RY-------------YG------- 265 (445) Q Consensus 217 ~v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~~-------------~~------- 265 (445) ..+..+|. ++-|.+...+..+.|-+=+.-.+..+. +...+.+ +. .+ T Consensus 253 ---kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlE 329 (521) T protein:vir:65 253 ---RAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTE 329 (521) T ss_pred ---hhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhh Confidence 33344443 345666666666666432221121111 1111111 00 00 Q ss_pred eeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccccc--cccccC-c-chHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 266 AIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDFS--SDKFGS-A-PSGVALEFLYTNLNLKADKLA 336 (445) Q Consensus 266 ~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~~--~~~~~~-~-~Sg~Ai~~~~~~l~~k~~~~~ 336 (445) -+++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|--- .+..++ + --|..|-..+.....-+.+.+ T Consensus 330 DyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR 408 (521) T protein:vir:65 330 DYWLQRRDGKAITDVTTLPGASGMSD-IDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQ 408 (521) T ss_pred hhcccccCCCCccceeecccCCCcCh-HHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHH Confidence 01222 122 223333332 3333 333666777778777777321 121111 0 112234334444555667778 Q ss_pred HHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHH-------HHHHHh---c-cCChHHHHHhCC Q lcl|NC_021326. 337 RKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQ-------TAQQSM---G-IVSHETVLENHP 399 (445) Q Consensus 337 ~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~-------~~~~~~---g-~~s~et~l~~l~ 399 (445) ..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++ ++..+. | .+|++++++.+- T Consensus 409 ~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~IL 488 (521) T protein:vir:65 409 SQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDIL 488 (521) T ss_pred HHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh Confidence 888888888777544333332 222 35778886544433444443 333332 4 479999998764 Q ss_pred CCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 400 FVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 400 ~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) -.+| ..++.+.|++|..+- ..++ .++++++ T Consensus 489 r~tDeei~~~~k~I~~E~~~~--~~~~--------------p~~~~~~ 520 (521) T protein:vir:65 489 KYTDDQMDTEKKQIEEEANDP--RFKQ--------------TPDEIED 520 (521) T ss_pred ccCHHHHHHHHHHHHHhhhCC--CCCC--------------CcccccC Confidence 4443 223344444443321 1111 1111111 No 263 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=57.20 E-value=0.44 Score=22.54 Aligned_cols=369 Identities=10% Similarity=-0.022 Sum_probs=145.7 Q ss_pred cccccccccc--ccccccccc---ccccccchHHHHHHHHHhhhhccCeeeccCchH---HHHHHHHHhc--c---CHHH Q lcl|NC_021326. 29 IVKEPKPVDA--TGAVDPLKP---DDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE---VIKRIDEVLG--N---RFDD 95 (445) Q Consensus 29 i~~~~~~~~~--~~~~~~~~~---~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~---~~~~l~~~~~--n---~~~~ 95 (445) +-..|....+ .+....... ..-..++....+|+..++-+-+-|+.+--.+.+ .....+-+.. | +... T Consensus 1 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~lL~~~PN~~~t~~~ 80 (723) T protein:vir:94 1 MTTFPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQLWNVMPNRAMPAQV 80 (723) T ss_pred CcccccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHHHHhhCCCCCCCHHH Confidence 1111111100 000000000 000112233445666666555567665322211 1112222221 3 2345 Q ss_pred HHHHHHHHHHhcCeEEEEEEECC---CCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEEEEecceEEEEE Q lcl|NC_021326. 96 KLHSVLTGASNKGIEWLHPYLDE---EGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKITVNYYV 171 (445) Q Consensus 96 ~~~~~~~~~~~~G~~~~~v~~d~---~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 171 (445) +...+..+.+.+|.+|+.+..+. .|.| .+..++|..+.++..+....-.......|. +. T Consensus 81 f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~-----------------~~ 143 (723) T protein:vir:94 81 LKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYV-----------------IE 143 (723) T ss_pred HHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEE-----------------EE Confidence 55667778889999999987643 3444 356666665554443321100000000000 00 Q ss_pred EecceeeecccccccccccccccccccccceEEec-----CCCCcCccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeEE Q lcl|NC_021326. 172 YENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVL 246 (445) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~g~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~~ 246 (445) ...+.... .+-. -|+|++ +...|.|.+......|.....+.......+...+.|-.++ T Consensus 144 ~~~G~~~~---------------~~~~--dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL 206 (723) T protein:vir:94 144 RTDGVRVP---------------VLAD--EMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVV 206 (723) T ss_pred ecCceeEE---------------eccc--ceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEE Confidence 00000000 0000 133332 2335788777666666655544444455556666776666 Q ss_pred ecCC--cccchhHHHhhh--------hCceeeccC--------CCceeeEeccCCh--HHHHHHHHHHHHHHHHHhCccc Q lcl|NC_021326. 247 TNYD--DQELPEFKRLLR--------YYGAIKVSD--------NGGVDTIQVEVPV--ENSKKYLDELYQKIMLFGQAVD 306 (445) Q Consensus 247 ~g~~--~~~~~~~~~~~~--------~~~~~~~~~--------~~~~~~l~~~~~~--~~~~~~i~~l~~~i~~~s~~p~ 306 (445) .-.. .+........+. .++.+.++. +.+.+|.....+. ..+.+..+.....|...-++|. T Consensus 207 ~~~~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp 286 (723) T protein:vir:94 207 NLGDMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRK 286 (723) T ss_pred EcCCCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCh Confidence 5322 112212222221 122344432 1245555544443 3455555666667777778885 Q ss_pred cccccccCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCcceEEEEeCC--CCCCCHHHHHH Q lcl|NC_021326. 307 FSSDKFGSAPS-GVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---KGEHKDVDISFNY--NKVANTELQVQ 380 (445) Q Consensus 307 ~~~~~~~~~~S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---~~~~~~i~v~f~~--~~p~d~~~~~~ 380 (445) ......++..+ ..+.+. .+...|.-.+..+...+.. ......+.+.|+. .+..|..+.++ T Consensus 287 ~~i~~~st~sN~e~~~~~--------------f~~~tL~P~~~~ie~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~~r~~ 352 (723) T protein:vir:94 287 DALLGGSTYENQAEAKAA--------------VWTETLIPQMEVMASITDLQLLPDIGWTVEWDFNSVPALQEDLEAQAG 352 (723) T ss_pred hHcCCCCCcccHHHHHHH--------------HHHHHHHHHHHHHHHHHhHhhcccccCceEEeecchhhhhcCHHHHHH Confidence 43322111111 111111 1112222222222211111 1122356777764 45678888888 Q ss_pred HHHHH--hccCChHHHHHhCCC--CCCHHHHH--H-------------HHHHHHHHHHHh-----hhc--cccCCCCCCC Q lcl|NC_021326. 381 TAQQS--MGIVSHETVLENHPF--VEDLQAEL--E-------------RIEQEQMEYNKQ-----LPN--LDDGGADGAQ 434 (445) Q Consensus 381 ~~~~~--~g~~s~et~l~~l~~--~~d~~~E~--~-------------ri~~E~~~~~~~-----~~~--~~~~~~~~~~ 434 (445) .+.++ .|+++.-.+++.++. ++.-...+ . --.+|....... ..+ .....+.+.. T Consensus 353 ~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~ 432 (723) T protein:vir:94 353 RNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRATT 432 (723) T ss_pred HHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCCC Confidence 88775 489999888877633 22111110 0 000111111100 000 0111111111 Q ss_pred CCCCCCCcCCC Q lcl|NC_021326. 435 QKERSNDKQSE 445 (445) Q Consensus 435 ~~~~~~d~~~~ 445 (445) ..+.++.+..+ T Consensus 433 ~~~~~~~~~~~ 443 (723) T protein:vir:94 433 VLHHDPGPDPQ 443 (723) T ss_pred CCCCCcccCCc Confidence 11111221122 No 264 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=55.76 E-value=0.47 Score=22.37 Aligned_cols=295 Identities=10% Similarity=0.111 Sum_probs=101.4 Q ss_pred HHHHHhcCCCccccccccccccccccccccccccccch--HHHHHHH--HHhhh----hcc----Ceeecc------Cch Q lcl|NC_021326. 18 IGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNF--HANLVDQ--KVSYI----VGK----PIAFKH------TDD 79 (445) Q Consensus 18 ~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~--~~~iv~~--~~~~l----~g~----~~~~~~------~d~ 79 (445) +-++-+...+.-...+...... ....+. .+..| +..+.+. ..+|+ .|+ |+.... .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~ 75 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGS--AAPARA---EVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRAST 75 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhh--ccccee---EEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhH Confidence 0001111110000000000000 000000 00000 0000000 01111 122 111110 000 Q ss_pred HHHHHH--------HHHhccC-H-HHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEE Q lcl|NC_021326. 80 EVIKRI--------DEVLGNR-F-DDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFI 148 (445) Q Consensus 80 ~~~~~l--------~~~~~n~-~-~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v 148 (445) -....| ..+.-|. + ...+.+++.+.+.+|.+|+.+..+..|++ .+..++|..+.+..+.+ T Consensus 76 ~h~~~l~~k~n~l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~--------- 146 (351) T protein:vir:78 76 HHSSALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS--------- 146 (351) T ss_pred hhhhhhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCC--------- Confidence 000011 1111122 1 22345677888999999999999888875 46666666654433221 Q ss_pred EEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHH Q lcl|NC_021326. 149 RMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLID 223 (445) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid 223 (445) +++....... ...+ +-+ -|+++++ .-.|.|.+......+. T Consensus 147 ~~~~~~~~~~--------~~~~-------------------------~~~--eVihir~~~~~~~~yGl~~~~~a~~si~ 191 (351) T protein:vir:78 147 GFVYVNGWQE--------RHEF-------------------------APD--SVFQLVRPDINQEVYGLPEYLSSLHSAW 191 (351) T ss_pred eEEEEecCCe--------EEEE-------------------------ccc--cEEEEcCCCCCCCcccccHHHHHHHHHH Confidence 0110000000 0000 000 1333332 2357777765544444 Q ss_pred HHHHHHHHHHHHHHHhcCCeeE--EecCC--cccchhHHHhhhh-------CceeeccC---CCceeeEecc--CChHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNELTYV--LTNYD--DQELPEFKRLLRY-------YGAIKVSD---NGGVDTIQVE--VPVENS 287 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~~l~--~~g~~--~~~~~~~~~~~~~-------~~~~~~~~---~~~~~~l~~~--~~~~~~ 287 (445) .-+.+..-...-+...+.|-.+ ++|.. .+..+..+..++. .+++.+.. +.++++.-.. .....+ T Consensus 192 l~~~a~~~~~~~f~NGa~pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf 271 (351) T protein:vir:78 192 LNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEF 271 (351) T ss_pred HHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHH Confidence 3332222223334445555444 44532 2222222222221 11333322 2344554433 233456 Q ss_pred HHHHHHHHHHHHHHhCccccccccccCc------chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcc Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFGSA------PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHK 361 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~~~------~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 361 (445) .+..+...+.|...-++|....+-..++ ....+..+... .|.-+++.+.++...-+. T Consensus 272 ~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~---------------~l~P~~~~iee~n~~l~~-- 334 (351) T protein:vir:78 272 FNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRN---------------EIRPLQARFAELNDWLGD-- 334 (351) T ss_pred HHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHH---------------HHHHHHHHHHHHHhhcCc-- Confidence 6667777778888888886443322111 11111211111 111111111111111111 Q ss_pred eEEEEeCCCCCCCHHHHH Q lcl|NC_021326. 362 DVDISFNYNKVANTELQV 379 (445) Q Consensus 362 ~i~v~f~~~~p~d~~~~~ 379 (445) . -+.|++..-....+.+ T Consensus 335 ~-~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 335 E-VVRFDDYEIPPAPVAA 351 (351) T ss_pred c-ceecChhhhccccccC Confidence 1 1556543221111111 No 265 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=52.90 E-value=0.54 Score=22.04 Aligned_cols=196 Identities=9% Similarity=-0.002 Sum_probs=69.9 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLI 222 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~li 222 (445) +|. .... .+|+......+...... ....-.. |+++++ .-.|.|.+......+ T Consensus 1 ~r~---~~dg--~~~y~~~~~~~~~~g~~----------------~~~~~~e--ilH~r~~~~~~~~~Glspi~~a~~~i 57 (219) T protein:vir:98 1 MRV---CKDG--NYKYLMKKSLYDTKSEI----------------YEYNKND--VIFIKLYDPMQQVYGSPDYVGGITSA 57 (219) T ss_pred Cce---eecC--eEEEEEecceecCCcee----------------EEecccc--EEEecCCCCCCCcceecHHHHHHHHH Confidence 111 1111 11111000000000000 0000011 233332 124777776544444 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeEE--ecC--CcccchhHHHhhhh------Cce-eeccC---CCceeeEeccCCh--HH Q lcl|NC_021326. 223 DAYNRRLSDLSNTFKDSNELTYVL--TNY--DDQELPEFKRLLRY------YGA-IKVSD---NGGVDTIQVEVPV--EN 286 (445) Q Consensus 223 d~~~~~~s~~~~~~~~~~~~~l~~--~g~--~~~~~~~~~~~~~~------~~~-~~~~~---~~~~~~l~~~~~~--~~ 286 (445) ..-..+..-...-+...+.|-.++ +|. +.+..+.+...+.. .+. +.+.. +++++|.....+. .. T Consensus 58 ~~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~q 137 (219) T protein:vir:98 58 LLNSDATIFRRRYYSNGAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDE 137 (219) T ss_pred HHHHHHHHHHHHHHhcCCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHH Confidence 432222222233345566776554 343 22222222222221 122 22222 2346665544433 33 Q ss_pred HHHHHHHHHHHHHHHhCccccccccc-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CCcceEE Q lcl|NC_021326. 287 SKKYLDELYQKIMLFGQAVDFSSDKF-GSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-GEHKDVD 364 (445) Q Consensus 287 ~~~~i~~l~~~i~~~s~~p~~~~~~~-~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~i~ 364 (445) +.+..+.....|...-++|....+-. .+..++..++... ...+...|.-.+..+...++.. .-...+. T Consensus 138 fle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~----------~~f~~~tL~P~~~~ie~~ln~~~~~~~~~~ 207 (219) T protein:vir:98 138 FANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIR----------EAYQADEVLPLQEIIAESINSDYEIKSALK 207 (219) T ss_pred HHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHH----------HHHHHHHHHHHHHHHHHHhhhhhcCCCccE Confidence 44444555566777777776543311 1111111121111 1111222222222222222111 1123457 Q ss_pred EEeCCCCCCCHH Q lcl|NC_021326. 365 ISFNYNKVANTE 376 (445) Q Consensus 365 v~f~~~~p~d~~ 376 (445) +.|....+.|.. T Consensus 208 ~~F~~~~~~d~~ 219 (219) T protein:vir:98 208 VNFKQPEKRDKN 219 (219) T ss_pred EeecCcccccCC Confidence 788877777666 No 266 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=50.64 E-value=0.6 Score=21.78 Aligned_cols=323 Identities=12% Similarity=0.142 Sum_probs=118.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeec---cC Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK---HT 77 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~---~~ 77 (445) ++.++. .++...-. ..+.... ...... ..........+|+..++-+..-|+.+- .+ T Consensus 3 ~f~k~~--------------~~~~~~~~--~~~~~~~-~~~~~~----~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~ 61 (378) T protein:vir:85 3 LFGKVV--------------SFSRGKLN--NDTQRVT-AWQNEA----VEYTSAFVTNIHNKIANEITKVEFNHVKYKKS 61 (378) T ss_pred hhhhhh--------------hhhhcccc--cCCccee-eeeccc----hhhhhHHHHHHHHHHHHhHhhCceeEEEEecc Confidence 333332 12221110 0000000 000000 001122334466666666655565431 11 Q ss_pred c-------hHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEE-EEECCCCcEEEEEEccceeEEEEcCCCCCc Q lcl|NC_021326. 78 D-------DEVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLH-PYLDEEGEFKLFRVPAEQGIPIWTDKEHEE 143 (445) Q Consensus 78 d-------~~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~-v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~ 143 (445) + +....-+...+. | .-..+...+....+.+|.+|++ ++.+.+|++...+ |.++ T Consensus 62 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~---------~~~~---- 128 (378) T protein:vir:85 62 DVGSDTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLL---------FAND---- 128 (378) T ss_pred ccccccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEE---------ecCC---- Confidence 0 011122223332 2 2345556677888899999975 4445455432211 1100 Q ss_pred eEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCCCcCccHHHHHHHHH Q lcl|NC_021326. 144 LEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLID 223 (445) Q Consensus 144 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~v~~lid 223 (445) ... |... -++++.+.-.+.+....+..+.+ T Consensus 129 --------------~~~-~~~~-----------------------------------dvih~~~~~~~~~~~~~~~~a~~ 158 (378) T protein:vir:85 129 --------------KKE-YKPE-----------------------------------ELVRLVSPFYINEDTSILDNALA 158 (378) T ss_pred --------------CEE-Eccc-----------------------------------ceEEEecCcCccchhhHHHHHHH Confidence 000 0000 01122111011111111222223 Q ss_pred HHHHHHHHHHHHHHHhcCC--eeEEecCCccc-chhHH----Hhh-------hhCceeeccCCCceeeEeccCChHHHHH Q lcl|NC_021326. 224 AYNRRLSDLSNTFKDSNEL--TYVLTNYDDQE-LPEFK----RLL-------RYYGAIKVSDNGGVDTIQVEVPVENSKK 289 (445) Q Consensus 224 ~~~~~~s~~~~~~~~~~~~--~l~~~g~~~~~-~~~~~----~~~-------~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 289 (445) .++.. +. .+.+ ++.+.+...+. ..+.. ..+ ...+++.++++.+.+.++.+.....+ . T Consensus 159 ~~~~~-------~~-~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~ 229 (378) T protein:vir:85 159 SIQTK-------LE-QGKLRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-D 229 (378) T ss_pred HHHHH-------Hh-cCCcceEEEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhH-H Confidence 22221 11 1222 22233321111 11111 111 12245566666655555433333333 3 Q ss_pred HHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-----------C Q lcl|NC_021326. 290 YLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK-----------G 358 (445) Q Consensus 290 ~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~-----------~ 358 (445) .++.+.+.|+..-++|..... +++... .. ...+...|.-.+..+...+..+ . T Consensus 230 ~~~~~~~~Ia~~fgVPp~~l~--~s~~e~-----~~----------~~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~ 292 (378) T protein:vir:85 230 EIELIKSELLTGYFMNENILL--GTATQE-----QQ----------IYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKG 292 (378) T ss_pred HHHHHHHHHHHHhCCCHHHhc--CCchHH-----HH----------HHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhh Confidence 456667778888888764332 111111 10 1122333333333332222110 0 Q ss_pred --CcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-----HHHHHHHHHHHHHHhhhcccc Q lcl|NC_021326. 359 --EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQA-----ELERIEQEQMEYNKQLPNLDD 427 (445) Q Consensus 359 --~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~-----E~~ri~~E~~~~~~~~~~~~~ 427 (445) ...++.+.+...+-.|..+.++.+.++ .|+++.-.++++++. +++-+. -+..+.. ...... T Consensus 293 ~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD~~~~~~N~~~~~~--------~~~~~~ 364 (378) T protein:vir:85 293 NLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDIYIANLNAVAVKN--------LSDLQG 364 (378) T ss_pred ccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccccccccc--------chhhcC Confidence 112344445566678889999988876 489999898888744 221100 0000000 000000 Q ss_pred CCCCCCCCCCCCCCc Q lcl|NC_021326. 428 GGADGAQQKERSNDK 442 (445) Q Consensus 428 ~~~~~~~~~~~~~d~ 442 (445) +..+....++.+++ T Consensus 365 -~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 365 -SRKDVASTDETNNQ 378 (378) T ss_pred -ccCCCCCCCCCCCC Confidence 00000111111111 No 267 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=49.08 E-value=0.65 Score=21.61 Aligned_cols=380 Identities=15% Similarity=0.139 Sum_probs=161.5 Q ss_pred ChHHHH-------HHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCe Q lcl|NC_021326. 1 MIVRYI-------KQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPI 72 (445) Q Consensus 1 ~l~~~i-------~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~ 72 (445) +-..+. +.-.+.+.+|..+..+++-+.. ...||+..+-+ -..+|+ T Consensus 57 ~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV 109 (524) T protein:vir:10 57 LMQQMFGSNEPEVKNTRELIDTYRNLMNNYEVDNA---------------------------VQEIVSDAIVYEDDKEVV 109 (524) T ss_pred hhhhhhhcccchhhhHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceE Confidence 211111 1222334444444444433321 12233333222 233566 Q ss_pred eeccCchHHHH--------HHHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECC----CCcEEEEEEccceeEEEEcCC Q lcl|NC_021326. 73 AFKHTDDEVIK--------RIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDK 139 (445) Q Consensus 73 ~~~~~d~~~~~--------~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~----~g~~~i~~~~p~~~~~v~d~~ 139 (445) .+..++-++.+ ..+.++. =+|+.+.++..+...+.|+.|+..-+|. +|-..+..+||+.+-.|.. T Consensus 110 ~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~-- 187 (524) T protein:vir:10 110 ALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKMKDGVQELRRLDPRQVQYIRE-- 187 (524) T ss_pred EEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCCccccceeeeeeCCccceeeee-- Confidence 66554433222 2222221 2678888899999999999999887763 3556788899988754432 Q ss_pred CCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecCCC---CcCc Q lcl|NC_021326. 140 EHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKNND---LEIS 213 (445) Q Consensus 140 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n~~---~g~s 213 (445) ...+...++.+..... .+|.+..+.......+. . ..++.--+|| |++....- .+.- T Consensus 188 -----------i~~~~~~~~~vi~~~~-e~f~Y~~~~~~~~~~~~---~---~~~~~~ikI~~dAIvy~~SGL~d~~~~~ 249 (524) T protein:vir:10 188 -----------IVTRMEDGVKIVDGYR-EFFVYDTGHESYCADGR---I---YSAGTKVKIPRAAVVYAHSGLLDCCGKN 249 (524) T ss_pred -----------ecccCcccchhhcchh-hheeecCCCcccccCcc---e---ecCCcceecchhheeeeccCcccCCCCc Confidence 1112222222221111 12222211110000000 0 0011111344 33332211 1111 Q ss_pred cHHHHHHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCccc-----chhHHHhh----hhC------------------ Q lcl|NC_021326. 214 DIFMYKTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQE-----LPEFKRLL----RYY------------------ 264 (445) Q Consensus 214 ~~~~v~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~-----~~~~~~~~----~~~------------------ 264 (445) .+.-+...+..+|. ++-|.+...+..+.|-+=+.=.+..+ .+...+.+ +.. T Consensus 250 i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~ms 329 (524) T protein:vir:10 250 IIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMS 329 (524) T ss_pred eeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhhhh Confidence 12223333444443 34555666666666643222111111 11111111 000 Q ss_pred --ceeecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCccc--ccccc---ccCcchHHHHHHHHHHHHHHH Q lcl|NC_021326. 265 --GAIKVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVD--FSSDK---FGSAPSGVALEFLYTNLNLKA 332 (445) Q Consensus 265 --~~~~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~--~~~~~---~~~~~Sg~Ai~~~~~~l~~k~ 332 (445) .-+++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|- +..+. +..+ -+..|-..+.....-+ T Consensus 330 MlEDyWLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~g-r~~EItRDEiKF~KFI 407 (524) T protein:vir:10 330 MTEDYWLQRRDGKAVTEVDTMPGATGMSD-MDDVLYFRTALYRALRIPESRIPSESNSGVMFD-AGTAITRDELKFAKWI 407 (524) T ss_pred hHhhhcccccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCchhccCCCCcccccc-ccchhhHHHHHHHHHH Confidence 011232 122 223333333 3333 3346667777787777773 21121 1111 2223433444455666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHHh---c-cCChHHHH Q lcl|NC_021326. 333 DKLARKAKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQSM---G-IVSHETVL 395 (445) Q Consensus 333 ~~~~~~~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~~---g-~~s~et~l 395 (445) .+.+..|...+.++|+.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+. | .+|+++++ T Consensus 408 ~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~ 487 (524) T protein:vir:10 408 RQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAM 487 (524) T ss_pred HHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHH Confidence 7778888888888777544443432 222 457788865444334444433 33332 3 47999999 Q ss_pred HhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 396 ENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 396 ~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +.+--.+| ..++-+.|++|..+- ..++.. .++.| T Consensus 488 k~ILr~tDeei~~~~k~I~~E~k~~--~~~~~~--------------~~~~~ 523 (524) T protein:vir:10 488 KDFLQMTDEEINQEAKQIEEESKEA--RFQNPD--------------EEEED 523 (524) T ss_pred HHHhccCHHHHHHHHHHHHHHhhcC--CCCCCC--------------hhhhc Confidence 87644443 233344444443321 111100 01111 No 268 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=47.25 E-value=0.71 Score=21.41 Aligned_cols=323 Identities=11% Similarity=0.092 Sum_probs=119.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeee-c--cC Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF-K--HT 77 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~-~--~~ 77 (445) |..++. .+....-.....+... ...... -........+|+..++-+..-|+.+ . .. T Consensus 3 if~~~~--------------~~~~~~~~~~~~~~~~---~~~~~~----~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~ 61 (378) T protein:vir:94 3 LFGKVV--------------SFSRGKLNNDTQRVTA---WQNEAV----EYTSAFVTNIHNKIANEITKVEFNHVKYKKS 61 (378) T ss_pred hhHHhH--------------hhhhcccccCcceeee---eecchh----hhhhHHHHHHHHHHHHhHhhCceeeeeeccc Confidence 333332 1211110000001000 000000 0112334556777776666666642 1 11 Q ss_pred c-------hHHHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEE-EEECCCCcEEEEEEccceeEEEEcCCCCCc Q lcl|NC_021326. 78 D-------DEVIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLH-PYLDEEGEFKLFRVPAEQGIPIWTDKEHEE 143 (445) Q Consensus 78 d-------~~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~-v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~ 143 (445) + +.....+...+. | .-..+...+..+.+..|.+|++ ++.+..|++...+ T Consensus 62 ~~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~----------------- 124 (378) T protein:vir:94 62 DVGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLL----------------- 124 (378) T ss_pred ccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEE----------------- Confidence 0 011112223331 2 2345556678888999999875 4444444432110 Q ss_pred eEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCC---CCcCccHHHHHH Q lcl|NC_021326. 144 LEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN---DLEISDIFMYKT 220 (445) Q Consensus 144 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~---~~g~s~~~~v~~ 220 (445) +... ..+ +.. . .|+++.+. ..+.+.+. . T Consensus 125 -------~~~~---~~~-~~~---------------------------------~--dvih~~~~~~~~~~~~~~~---~ 155 (378) T protein:vir:94 125 -------FAND---KKE-YKP---------------------------------E--ELVRLTSPFYINEDTSILD---N 155 (378) T ss_pred -------EecC---cEE-ech---------------------------------h--ceeeecCcCCcccchhHHH---H Confidence 0000 000 000 0 12222210 11122222 2 Q ss_pred HHHHHHHHHHHHHHHHHHhc-CCeeEEecCCcc-cchh----HHHhh-------hhCceeeccCCCceeeEeccCChHHH Q lcl|NC_021326. 221 LIDAYNRRLSDLSNTFKDSN-ELTYVLTNYDDQ-ELPE----FKRLL-------RYYGAIKVSDNGGVDTIQVEVPVENS 287 (445) Q Consensus 221 lid~~~~~~s~~~~~~~~~~-~~~l~~~g~~~~-~~~~----~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~~~~ 287 (445) +..+++... ..++ ..++...+...+ ...+ +...+ ...+++.++++.+.+.++.......+ T Consensus 156 ~~~~~~~~~-------~~~~~~g~l~~~~~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~ 228 (378) T protein:vir:94 156 ALASIQTKL-------EQGKLRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK 228 (378) T ss_pred HHHHHHHHH-------hhCCcccceeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhH Confidence 222222221 1111 112333232111 1111 11111 11235666666655555433322333 Q ss_pred HHHHHHHHHHHHHHhCccccccccccCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---------- Q lcl|NC_021326. 288 KKYLDELYQKIMLFGQAVDFSSDKFGSAPS-GVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI---------- 356 (445) Q Consensus 288 ~~~i~~l~~~i~~~s~~p~~~~~~~~~~~S-g~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~---------- 356 (445) ..++.+.+.|+..-++|..-.. +..+ ...+ ..+...|.-++..+...+.. T Consensus 229 -~~~~~~~~~Ia~~fgvPp~~l~---g~~~e~~~~---------------~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~ 289 (378) T protein:vir:94 229 -DEIDLIKSELLTGYFMNENILL---GTATQEQQI---------------YFYNSTIIPLLIQLEKELTYKLISTNRRRV 289 (378) T ss_pred -HHHHHHHHHHHHHhCCCHHHhc---CCchHHHHH---------------HHHHHHHHHHHHHHHHHHHhhcCChhHhhh Confidence 5566677778887788753332 1111 1111 12222233333222221111 Q ss_pred ---CCCcceEEEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHHHHhhhccccCC Q lcl|NC_021326. 357 ---KGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPF--VEDLQAELERIEQEQMEYNKQLPNLDDGG 429 (445) Q Consensus 357 ---~~~~~~i~v~f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~--~~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 429 (445) ......+.+.++.-+-.|..+.++.+.++ .|+++.-.++++++. +++-+.=+ +. .........+ T Consensus 290 g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd~~~--~~-------~n~~~~~~~~ 360 (378) T protein:vir:94 290 VKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYI--AN-------LNAVAVKNLS 360 (378) T ss_pred hhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeee--ec-------ccccchhcch Confidence 01123455566777788999999998886 489999999888744 22110000 00 0000000000 Q ss_pred CC-CCCCCCC-CCCcCCC Q lcl|NC_021326. 430 AD-GAQQKER-SNDKQSE 445 (445) Q Consensus 430 ~~-~~~~~~~-~~d~~~~ 445 (445) .. +++++.. .++..+| T Consensus 361 ~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 361 DLQGNRKDVTSTDETNNQ 378 (378) T ss_pred hcccccCCCCCCCCCCCC Confidence 00 0000000 0111111 No 269 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=46.46 E-value=0.73 Score=21.32 Aligned_cols=378 Identities=11% Similarity=0.107 Sum_probs=160.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhh-hhccCeeeccCch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSY-IVGKPIAFKHTDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~-l~g~~~~~~~~d~ 79 (445) =+.--++.-.+.+.+|..+..+++-+.. ...||+..+-+ -..+|+.+..++. T Consensus 62 ~~e~~~~~~~eLI~~YR~ma~~pEvd~A---------------------------v~eIVneaiv~d~~~~pV~l~L~~~ 114 (521) T protein:vir:81 62 STDQKISTTKQLVNTYRGLMNNHEVENA---------------------------VQNIVNDAIVFEEGHEVVSLNLEAT 114 (521) T ss_pred ccccchhhHHHHHHHHHHHhhccchhhH---------------------------HHHhhcceeEecCCCceEEEEeccc Confidence 0111112223444555555544443322 12233332222 1335566554443 Q ss_pred HHHHH--------HHHHhc-cCHHHHHHHHHHHHHhcCeEEEEEEECCC---CcEEEEEEccceeEEEEcCCCCCceEEE Q lcl|NC_021326. 80 EVIKR--------IDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEE---GEFKLFRVPAEQGIPIWTDKEHEELEAF 147 (445) Q Consensus 80 ~~~~~--------l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~---g~~~i~~~~p~~~~~v~d~~~~~~~~~~ 147 (445) ++.+. .+.++. =+|+.+.++..+...+.|+.|+..-+|++ |-..+..+||+.+..+..... T Consensus 115 ~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k------- 187 (521) T protein:vir:81 115 GFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIIT------- 187 (521) T ss_pred ccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEcCCccccceeeeeeCCcceeeeeeecc------- Confidence 22222 222221 16788888999999999999998877643 556688999998866543211 Q ss_pred EEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccc---eEEecC------CCCcCccHHHH Q lcl|NC_021326. 148 IRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIP---FIPFKN------NDLEISDIFMY 218 (445) Q Consensus 148 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n------~~~g~s~~~~v 218 (445) +...++.+... ...+|.+..+.......+. . ..++.--+|| |++... +..-.|-+. T Consensus 188 ------~~~~~~~v~~~-~~e~f~Y~~~~~~~~~~g~--~----~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLh-- 252 (521) T protein:vir:81 188 ------EDTPEGKIYKA-TKEYFIYTVGNSSYCAGGQ--V----FSPNSRVKIPRSAITYAHSGLMDCDDKYIIGYLH-- 252 (521) T ss_pred ------cccCccceecc-eeeeeeeecCCccccccce--e----ecCCcceeechhheeeeeccceeCCCCeeeecch-- Confidence 11111111111 1111111111110000000 0 0011111233 222221 111123343 Q ss_pred HHHHHHHHH--HHHHHHHHHHHhcCCeeEEecCCcccc-----hhHHHhh----hh-------------Cc-------ee Q lcl|NC_021326. 219 KTLIDAYNR--RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRLL----RY-------------YG-------AI 267 (445) Q Consensus 219 ~~lid~~~~--~~s~~~~~~~~~~~~~l~~~g~~~~~~-----~~~~~~~----~~-------------~~-------~~ 267 (445) ..+..+|. ++-|.+...+..+.|-+=+.-.+..+. +...+.+ +. .+ -+ T Consensus 253 -kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDy 331 (521) T protein:vir:81 253 -RAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDY 331 (521) T ss_pred -hhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhh Confidence 33444443 345566666666666432221121111 1111111 00 00 01 Q ss_pred ecc--CCC-ceeeEeccC--ChHHHHHHHHHHHHHHHHHhCcccccc--ccccC--cchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 268 KVS--DNG-GVDTIQVEV--PVENSKKYLDELYQKIMLFGQAVDFSS--DKFGS--APSGVALEFLYTNLNLKADKLARK 338 (445) Q Consensus 268 ~~~--~~~-~~~~l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~~~--~~~~~--~~Sg~Ai~~~~~~l~~k~~~~~~~ 338 (445) ++| +|+ +.+.-|.++ +... ..-+.-+++.++..-.+|---. +..++ .--|..|-..+.....-+.+.+.. T Consensus 332 WLpRReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~r 410 (521) T protein:vir:81 332 WLQRRDGKAITDVTTLPGASGMSD-IDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQ 410 (521) T ss_pred cccccCCCcccceeecccCCCCCh-HHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHH Confidence 222 122 223333332 3333 3336667777777777773211 21111 012223433344455566777888 Q ss_pred HHHHHHHHHHHHHHHhccC--CCc----ceEEEEeCCCCCCCHHHHHHH-------HHHHh---c-cCChHHHHHhCCCC Q lcl|NC_021326. 339 AKVAIQELLWFVFEHFDIK--GEH----KDVDISFNYNKVANTELQVQT-------AQQSM---G-IVSHETVLENHPFV 401 (445) Q Consensus 339 ~~~~l~~~~~~~~~~~~~~--~~~----~~i~v~f~~~~p~d~~~~~~~-------~~~~~---g-~~s~et~l~~l~~~ 401 (445) |...+.++++.=+-+-|+- .++ ..|.+.|...-.-.+...++. +..+. | .+|++++++.+--. T Consensus 411 Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~ 490 (521) T protein:vir:81 411 FSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKY 490 (521) T ss_pred HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcc Confidence 8888887776544333332 222 357788865444334444433 33332 4 47999999876444 Q ss_pred CC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCcCCC Q lcl|NC_021326. 402 ED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDKQSE 445 (445) Q Consensus 402 ~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (445) +| ..++-+.|++|..+- ..++.. +++++ T Consensus 491 tDeei~~~~k~I~~E~~~~--~~~~p~--------------~~~~~ 520 (521) T protein:vir:81 491 TDDQMDTEKKQIEEEANDP--RFKQTP--------------DEIED 520 (521) T ss_pred CHHHHHHHHHHHHHHhhCC--CCCCCc--------------ccccC Confidence 43 233444444444321 111111 11111 No 270 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=42.75 E-value=0.87 Score=20.91 Aligned_cols=234 Identities=12% Similarity=-0.013 Sum_probs=94.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeeccCchH Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDE 80 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~~d~~ 80 (445) |-.+.-++... ..-.....+... .+. .. ... ........-+..+-...+|+..++-+..-|+.+.-..+. T Consensus 3 lF~~~~~r~~~--~~~~~~~~~~~~---~~~---~~-~~~-~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~ 72 (251) T protein:vir:46 3 IFYKNEKRDLQ--YNEDDLQMMVQT---LPS---FQ-GTK-LRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQI 72 (251) T ss_pred ccccccccccC--CCccchhhhhhh---hcc---cc-CcC-cceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCccc Confidence 11110000000 000000000000 000 00 000 000000001122333456777777776667665432211 Q ss_pred -HH-HHHHHHhc--c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEe Q lcl|NC_021326. 81 -VI-KRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYK 152 (445) Q Consensus 81 -~~-~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 152 (445) .. ....-+.. | ........+..+.+.+|.+|+.+..+.+|++ .+..++|..+.+..++. +.+.+.+.... T Consensus 73 ~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~~~~~~~~~ 150 (251) T protein:vir:46 73 NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDAR--GRLYYFHQRID 150 (251) T ss_pred cccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCC--CcEEEEEEEec Confidence 11 12222222 2 3455667788899999999999999999986 48899999998776642 33322111111 Q ss_pred eecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEec----CCCCcCccHHHHHHHHHHHHHH Q lcl|NC_021326. 153 LENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFK----NNDLEISDIFMYKTLIDAYNRR 228 (445) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~----n~~~g~s~~~~v~~lid~~~~~ 228 (445) .........+.... |++++ +...|.|.+......+...... T Consensus 151 ~~~~g~~~~~~~~d-----------------------------------iiH~r~~~~dg~~G~spi~~~~~~i~~~~~~ 195 (251) T protein:vir:46 151 SNGNNIERNVKFED-----------------------------------MLDIKFYSLDGINGLSLLDTLSRTIESDNNG 195 (251) T ss_pred cCCcceeEEECCcc-----------------------------------EEEecCcCCCCeeecCHHHHHHHHHHHHHHH Confidence 00000000000000 22222 1235788887777766666655 Q ss_pred HHHHHHHHHHhcCCeeEEecCCcccchhHHHhhh-hCceeec--cCCCceeeEeccCCh Q lcl|NC_021326. 229 LSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLR-YYGAIKV--SDNGGVDTIQVEVPV 284 (445) Q Consensus 229 ~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~l~~~~~~ 284 (445) .....+.+...+.|-.++.-...-..++....++ .+..... ...+.+. .-.+. T Consensus 196 ~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~~~~~~~~g~~n~g~~~---~gm~~ 251 (251) T protein:vir:46 196 KDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFPKVLVELNKLGKLS---YSMNQ 251 (251) T ss_pred HHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCcccccccc---cccCC Confidence 5556666677777765554321111111111111 1111100 1111111 00111 No 271 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=38.43 E-value=1.1 Score=20.43 Aligned_cols=298 Identities=10% Similarity=0.065 Sum_probs=100.1 Q ss_pred cCCCccccc-ccccccccccccc-ccc----cccccchHHHHHHHHHhhhhccCeeecc------CchHHHHHH------ Q lcl|NC_021326. 24 EQRPDIVKE-PKPVDATGAVDPL-KPD----DRMITNFHANLVDQKVSYIVGKPIAFKH------TDDEVIKRI------ 85 (445) Q Consensus 24 ~G~~~i~~~-~~~~~~~~~~~~~-~~~----~ri~~n~~~~iv~~~~~~l~g~~~~~~~------~d~~~~~~l------ 85 (445) ..++-.-.. .........+... .+. ++-...|...+-+...+| +--|+.... .+.-....| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~-~epp~~~~~La~l~~~n~~h~~~i~~k~N~ 79 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDY-WEPPISLKGLAEIANANGYHGSLLKARANY 79 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCcc-ccCCCCHHHHHHHHhhhhhhhhhHhhhhhH Confidence 111100000 0000000000000 000 000011111111110001 001111110 010000001 Q ss_pred --HHHhccC-H-HHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEEEEEeeecceeEE Q lcl|NC_021326. 86 --DEVLGNR-F-DDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVE 160 (445) Q Consensus 86 --~~~~~n~-~-~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~ 160 (445) ..+.-|. + ...+.+++.+.+.+|.+|+.+..+..|++ .+..++|..+-.. .+ +.. ++....... T Consensus 80 l~~~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~-~d---~~~-----~~~~~~g~~-- 148 (348) T protein:vir:26 80 VAGRFMNGGGLPMYKMNSACWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKR-KN---GDF-----VQLLRNNEQ-- 148 (348) T ss_pred HhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEee-ec---CcE-----EEEEecCeE-- Confidence 1111232 1 23456677888999999999999888875 3666666554321 11 110 000000000 Q ss_pred EEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 161 YWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLIDAYNRRLSDLSNT 235 (445) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid~~~~~~s~~~~~ 235 (445) ..| .-+ -|+++++ ...|.|.+...+..+..-+.+..-...- T Consensus 149 -------~~f-------------------------~~~--dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~ 194 (348) T protein:vir:26 149 -------KVF-------------------------KAK--DVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRY 194 (348) T ss_pred -------EEE-------------------------cCc--cEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 000 000 1233321 2247777765444333222221112233 Q ss_pred HHHhcCCeeEE--ecCCc--ccchhHHHhhhh-------Cceeec-cC--CCceeeEecc--CChHHHHHHHHHHHHHHH Q lcl|NC_021326. 236 FKDSNELTYVL--TNYDD--QELPEFKRLLRY-------YGAIKV-SD--NGGVDTIQVE--VPVENSKKYLDELYQKIM 299 (445) Q Consensus 236 ~~~~~~~~l~~--~g~~~--~~~~~~~~~~~~-------~~~~~~-~~--~~~~~~l~~~--~~~~~~~~~i~~l~~~i~ 299 (445) ++..+.|-.++ ++... +..+.++..++. .+.+.+ ++ +.++++.... .....+.+..+..+..|. T Consensus 195 f~NGa~pg~Il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa 274 (348) T protein:vir:26 195 YLNGAHMGFIFYATDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIF 274 (348) T ss_pred HhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHH Confidence 44455565544 44322 222222222221 112333 22 2234544332 233456666666677788 Q ss_pred HHhCccccccccc----c--CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEEeCCCCCC Q lcl|NC_021326. 300 LFGQAVDFSSDKF----G--SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVA 373 (445) Q Consensus 300 ~~s~~p~~~~~~~----~--~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~ 373 (445) ..-++|....+-. + ++....+..+....+. =....+..++. +.++... ...+++.|++..-. T Consensus 275 ~af~VPp~llGi~~~~~~~~sn~e~~~~~f~~~~l~----P~~~~ie~~ln-------~~l~~~~-~~~~~fdl~~~~e~ 342 (348) T protein:vir:26 275 VGHRFPAGMGGMLPQQGANVPDPLKVSQVYDFYEVI----PVCKRFMDAVN-------NDPEIPD-NLKLKFNLNPGVES 342 (348) T ss_pred HHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHH----HHHHHHHHHHh-------hhhCCCC-ccEEEEecCccccc Confidence 8888886443211 1 1122212211111111 11111111111 1122221 22344444443322 Q ss_pred CHHHHH Q lcl|NC_021326. 374 NTELQV 379 (445) Q Consensus 374 d~~~~~ 379 (445) +....+ T Consensus 343 ~~~~a~ 348 (348) T protein:vir:26 343 ANGSAV 348 (348) T ss_pred chhhcC Confidence 222222 No 272 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=35.07 E-value=1.3 Score=20.05 Aligned_cols=354 Identities=12% Similarity=-0.027 Sum_probs=122.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc-Cch Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH-TDD 79 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~-~d~ 79 (445) |+.++..+..... ...+. ........ ...-+...-...+|+..++-+..-|+.+-. +++ T Consensus 3 lf~~~~~~~~~~~------~~~~~-------------~~~~~~~~-~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~ 62 (395) T protein:vir:98 3 ILDFFSFKKSGTL------SDDDS-------------GSTTSEKL-TNVVLKEDALYKCVNYLARIISKSTFRLKTPEKL 62 (395) T ss_pred chhhhcCCCcccc------ccccc-------------chhhhhhc-chhhhhhHHHHHHHHHHHHHHhhCceeEEecCCc Confidence 3333211100000 00000 00000000 000011223344566666666666665422 211 Q ss_pred H-HHHHHHHHhc---c---CHHHHHHHHHHHHHhcCeEEEEEEECCCCcEEEEEEccceeEEEEcCCCCCceEEEEEEEe Q lcl|NC_021326. 80 E-VIKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRMYK 152 (445) Q Consensus 80 ~-~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~ 152 (445) . ....+...+. | .-......+....+.+|.+|+++-.+..+ ++ |......+.. ... .++. T Consensus 63 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~-----~~-~~~~~~~~~~--~~~-----~~~~ 129 (395) T protein:vir:98 63 TENQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGI-----YV-ADSFTQDKKI--SGS-----QFKV 129 (395) T ss_pred ccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCce-----ec-CCcccccccc--cCc-----ccce Confidence 1 1112222221 3 23455667788899999999887665321 11 1111111000 000 0000 Q ss_pred eecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecCCC-----CcCccHHHHHHHHH-HHH Q lcl|NC_021326. 153 LENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFMYKTLID-AYN 226 (445) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-----~g~s~~~~v~~lid-~~~ 226 (445) ... ..+.+.. ..... -|++++... .+.+.+.....++. .++ T Consensus 130 ~~~-------~~~~~~~------------------------~~~~~--evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (395) T protein:vir:98 130 SRV-------QGQTYEK------------------------TFTFD--QVIYLKNDNSDLMSKVESLWEEYGELLGHVIN 176 (395) T ss_pred eee-------cCceeee------------------------EecCc--cEEEecCCCCCccccccchhhhHHHHHHHHHH Confidence 000 0000000 00001 144443211 12222332222221 111 Q ss_pred HHH-HHHHHHHHHhcCCeeEEecCCc---ccchh----HHH----hhh--hCceeeccCCCceeeEecc------CChHH Q lcl|NC_021326. 227 RRL-SDLSNTFKDSNELTYVLTNYDD---QELPE----FKR----LLR--YYGAIKVSDNGGVDTIQVE------VPVEN 286 (445) Q Consensus 227 ~~~-s~~~~~~~~~~~~~l~~~g~~~---~~~~~----~~~----~~~--~~~~~~~~~~~~~~~l~~~------~~~~~ 286 (445) ... ..-......+..+...+.+... +...+ ... ... ..+++.++.+.+.+.+... ..... T Consensus 177 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q 256 (395) T protein:vir:98 177 NQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDD 256 (395) T ss_pred HHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccccChhHHH Confidence 111 1111111222233333222211 11111 111 111 1113334444444444321 11224 Q ss_pred HHHHHHHHHHHHHHHhCccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcceEEEE Q lcl|NC_021326. 287 SKKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDIS 366 (445) Q Consensus 287 ~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~i~v~ 366 (445) +.+..+.....|+..-++|........++.+...+.+....|.-.+.. +...+.+ .++....-...+.+. T Consensus 257 ~~e~~~~~~~~Ia~~fgVP~~~l~~~~sn~e~~~~~f~~~tl~P~~~~----ie~~l~~------kll~~~~~~~g~~f~ 326 (395) T protein:vir:98 257 IKKLKDQYMAEFAEMLGIPISLLHGDIADNQKNYELLLEGPIESLITN----IVDGLEY------AIFDKSETLQGSFIK 326 (395) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhcCCcccHHHHHHHHHHHHHHHHHHH----HHHHHHH------hcCChhhhcCcceee Confidence 444455555667777777765443211122221121111111111111 1111111 011111112234567 Q ss_pred eCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHHHHHHHHHHHHHHHHhhhccccCCCCCCCCCCCCCCc Q lcl|NC_021326. 367 FNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFV--EDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQKERSNDK 442 (445) Q Consensus 367 f~~~~p~d~~~~~~~~~~~--~g~~s~et~l~~l~~~--~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 442 (445) |...+..|..+.++.+.++ .|+++.-.++++++.- ++. ..+++- .. .++.. .++.++++++ T Consensus 327 ~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~--~gD~~~------~~--~n~~~-----~~~~gge~~~ 391 (395) T protein:vir:98 327 VTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDG--LGKVLY------MT--KNYES-----VLERGGEVDE 391 (395) T ss_pred ehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC--CCceee------ec--cccee-----cccccCCCCC Confidence 7788888999999999887 4899999999887542 221 011000 00 00000 0111222233 Q ss_pred CCC Q lcl|NC_021326. 443 QSE 445 (445) Q Consensus 443 ~~~ 445 (445) ++| T Consensus 392 ~~~ 394 (395) T protein:vir:98 392 EVE 394 (395) T ss_pred CCC Confidence 333 No 273 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=26.70 E-value=1.9 Score=19.03 Aligned_cols=294 Identities=9% Similarity=0.085 Sum_probs=100.1 Q ss_pred HHHHHhcCCCccccccccccccccccccccccccccch--HH------HHHHHHHhhhhcc----Ceeecc------Cch Q lcl|NC_021326. 18 IGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNF--HA------NLVDQKVSYIVGK----PIAFKH------TDD 79 (445) Q Consensus 18 ~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~--~~------~iv~~~~~~l~g~----~~~~~~------~d~ 79 (445) +-++-+...+.-...+...... ....+. .+..| +. .+.+-.--+..|+ |+.+.. .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~ 75 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGS--AAPARA---EVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRAST 75 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhh--ccccee---EEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhH Confidence 0001111110000000000000 000000 00000 00 0111000011122 121110 000 Q ss_pred HHHHHHH--------HHhccC-H-HHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEEcCCCCCceEEEE Q lcl|NC_021326. 80 EVIKRID--------EVLGNR-F-DDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFI 148 (445) Q Consensus 80 ~~~~~l~--------~~~~n~-~-~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~~~~~~v 148 (445) -....|. .+.-|. + ...+.+++.+.+.+|.+|+.+..+..|++ .+..++|..+.+..+.+ T Consensus 76 ~h~~~l~~k~n~l~~~~~Pnp~~t~~~f~~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~--------- 146 (351) T protein:vir:79 76 HHSSALFFKANVLASTFRPHRWLSRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS--------- 146 (351) T ss_pred hhhhhhhhhhhHHhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCC--------- Confidence 0000111 111122 1 12246677888999999999999888875 46677776654332221 Q ss_pred EEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCcCccHHHHHHHHH Q lcl|NC_021326. 149 RMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFMYKTLID 223 (445) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g~s~~~~v~~lid 223 (445) +++....... . +.. .-+ -|+++++ ...|.|.+......+. T Consensus 147 ~~~~~~~~g~--------~--~~~-----------------------~~~--eIihir~~~~~~~~yGl~~~~~a~~si~ 191 (351) T protein:vir:79 147 GFVYVNGWQE--------R--HEF-----------------------EPD--SVFQLVRPDINQEVYGLPEYLSSLHSAW 191 (351) T ss_pred eEEEEecCce--------E--EEE-----------------------cCc--cEEEeCCCCCCCCcccccHHHHHHHHHH Confidence 0111100000 0 000 001 1344432 2247777755444333 Q ss_pred HHHHHHHHH-HHHHHHhcCCe--eEEecCC--cccchhHHHhhhh-------CceeeccC---CCceeeEeccC--ChHH Q lcl|NC_021326. 224 AYNRRLSDL-SNTFKDSNELT--YVLTNYD--DQELPEFKRLLRY-------YGAIKVSD---NGGVDTIQVEV--PVEN 286 (445) Q Consensus 224 ~~~~~~s~~-~~~~~~~~~~~--l~~~g~~--~~~~~~~~~~~~~-------~~~~~~~~---~~~~~~l~~~~--~~~~ 286 (445) .-+ ....+ ..-+...+.|= +.++|.. .+..+..+..++. .+.+.+.. +.++++.-... .... T Consensus 192 l~~-~a~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~e 270 (351) T protein:vir:79 192 LNE-SSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDE 270 (351) T ss_pred HHH-HHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHH Confidence 222 22222 22234455554 3445532 2222222222321 12333322 23345544332 3445 Q ss_pred HHHHHHHHHHHHHHHhCccccccccccC------cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Q lcl|NC_021326. 287 SKKYLDELYQKIMLFGQAVDFSSDKFGS------APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEH 360 (445) Q Consensus 287 ~~~~i~~l~~~i~~~s~~p~~~~~~~~~------~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 360 (445) +.+..+..++.|...-++|....+-..+ +....+..+....+.-.+. .|+++ ...++.. T Consensus 271 f~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~--------~ie~l----n~~lg~~--- 335 (351) T protein:vir:79 271 FFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQA--------RFAEL----NDWLGDE--- 335 (351) T ss_pred HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHH--------HHHHH----HhhcCcc--- Confidence 6677777788888888888644332111 1111112111111111111 11111 1122221 Q ss_pred ceEEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_021326. 361 KDVDISFNYNKVANTELQVQTAQQS 385 (445) Q Consensus 361 ~~i~v~f~~~~p~d~~~~~~~~~~~ 385 (445) -+.|++..-..-.. ++ T Consensus 336 ---~~~F~~~~llr~d~------~a 351 (351) T protein:vir:79 336 ---VVTFDDYEIPPAPV------AA 351 (351) T ss_pred ---eeeeChhhhccccc------cC Confidence 15665432111111 11 No 274 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=22.08 E-value=2.5 Score=18.40 Aligned_cols=316 Identities=8% Similarity=0.044 Sum_probs=107.5 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCc------ccccc-ccccccccccccccc----cccccchHHHHHHHHHhhhhccCee Q lcl|NC_021326. 5 YIKQHLEKLPEISIGQEYYEQRPD------IVKEP-KPVDATGAVDPLKPD----DRMITNFHANLVDQKVSYIVGKPIA 73 (445) Q Consensus 5 ~i~~~~~~~~~~~~~~~yy~G~~~------i~~~~-~~~~~~~~~~~~~~~----~ri~~n~~~~iv~~~~~~l~g~~~~ 73 (445) |-++. ++.. .+-..+... ..... .............+. .+-..+|.... ..+..+..|+. T Consensus 1 m~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~~~~~~~~~~~~~---~~~~~~~~pi~ 71 (368) T protein:vir:79 1 MSRNK----TRRA--ARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVLDRRELLDYVECM---RMGQWYEPPMP 71 (368) T ss_pred CCccc----cccc--hhccCcccccccccCcchhhccccCceEEEEcCCceeecchhhHHHHHHHH---hccchhccCcC Confidence 00000 0000 000000000 00000 000000000000000 00000111110 00111112222 Q ss_pred ecc------C----chHH---HHHHHHHhc-cC-H-HHHHHHHHHHHHhcCeEEEEEEECCCCcE-EEEEEccceeEEEE Q lcl|NC_021326. 74 FKH------T----DDEV---IKRIDEVLG-NR-F-DDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIW 136 (445) Q Consensus 74 ~~~------~----d~~~---~~~l~~~~~-n~-~-~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~ 136 (445) +.+ . .... ...+..... |. + ...+.+++.+.+.+|.+|+.+..+..|++ .+..++|..+-... T Consensus 72 ~~~la~~~~~~~~h~~~~~~~~n~l~l~~~Pn~~~t~~~f~~l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~ 151 (368) T protein:vir:79 72 WDGLARSFRAAAHHSSAVYVKRNILVSTFIPHPLLSRATFERLVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGL 151 (368) T ss_pred HHHHHHHHhhccccchhhhhhcchhhhhcCCCcCCCHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeec Confidence 211 0 0000 001111111 22 1 12245677888999999999999888875 46667776653321 Q ss_pred cCCCCCceEEEEEEEeeecceeEEEEecceEEEEEEecceeeecccccccccccccccccccccceEEecC-----CCCc Q lcl|NC_021326. 137 TDKEHEELEAFIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLE 211 (445) Q Consensus 137 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n-----~~~g 211 (445) +. + . +|....... .+.. +-+. |+++++ ...| T Consensus 152 ~~---~-~-----~~~~~~~~~----------~~~~-----------------------~~~d--Iihir~~~~~~~~yG 187 (368) T protein:vir:79 152 DL---N-T-----YFFVQNWQQ----------PYTF-----------------------AAGS--VFHLQEPDINQEVYG 187 (368) T ss_pred cC---C-E-----EEEEecCCe----------EEEE-----------------------cccc--EEEecCCCCCCCccc Confidence 11 1 0 110000000 0000 0011 344432 2357 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHhcCCeeE--EecCC--cccchhHHHhhhh-------CceeeccC---CCceee Q lcl|NC_021326. 212 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV--LTNYD--DQELPEFKRLLRY-------YGAIKVSD---NGGVDT 277 (445) Q Consensus 212 ~s~~~~v~~lid~~~~~~s~~~~~~~~~~~~~l~--~~g~~--~~~~~~~~~~~~~-------~~~~~~~~---~~~~~~ 277 (445) .|.+......++.-+.+..-...-++..+.|-.+ ++|.. .+..+..+..++. .+++.+.. ++++++ T Consensus 188 lsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~ 267 (368) T protein:vir:79 188 LPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQL 267 (368) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeE Confidence 7877665555544333322233334555556544 44532 2222222222321 12333322 344555 Q ss_pred EeccC--ChHHHHHHHHHHHHHHHHHhCccccccccccCcchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021326. 278 IQVEV--PVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAPSG-VALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF 354 (445) Q Consensus 278 l~~~~--~~~~~~~~i~~l~~~i~~~s~~p~~~~~~~~~~~Sg-~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~~~~ 354 (445) ..... ....+.+..+...+.|...-++|....+-..+++++ ..++... ...+...|.-++..+.++. T Consensus 268 ~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~----------~~f~~~~l~Pl~~~ie~ln 337 (368) T protein:vir:79 268 LPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAA----------MVFARNEVKPLQDRLLAIN 337 (368) T ss_pred EEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHH----------HHHHHHHHHHHHHHHHHHH Confidence 54443 334566667777788888888886544322222211 1111110 1111122222222222221 Q ss_pred ccCCCcceEEEEeCCCC--CCCHHHHHHHHHHHh Q lcl|NC_021326. 355 DIKGEHKDVDISFNYNK--VANTELQVQTAQQSM 386 (445) Q Consensus 355 ~~~~~~~~i~v~f~~~~--p~d~~~~~~~~~~~~ 386 (445) ..-+. . .+.|++.. ..|....++...+.+ T Consensus 338 ~~l~~-e--~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 338 DWIGD-E--VVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred hccCc-c--eeeechhHhhcccccccCCcccccC Confidence 11111 0 24454321 222222332222222 No 275 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=21.33 E-value=2.6 Score=18.29 Aligned_cols=402 Identities=13% Similarity=0.127 Sum_probs=139.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhhccCeeecc---- Q lcl|NC_021326. 1 MIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH---- 76 (445) Q Consensus 1 ~l~~~i~~~~~~~~~~~~~~~yy~G~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv~~~~~~l~g~~~~~~~---- 76 (445) +|-...+--+.|+.+|.+|... -|...|- .-.+|++.. ..=-+.. .|.-+-+++ T Consensus 76 ~~~~~~~~pr~R~qiY~~~eeM-~~~p~Ia----------------~AlniHVta-ALggde~----TGd~vfI~p~~~~ 133 (569) T protein:vir:10 76 FIFDEVQLPEDRLQRYPLLEEM-AVYSTIA----------------TALNIHITH-ALSFDKK----TGQTFSIVPVHNG 133 (569) T ss_pred HHhhhccCchhHHHHHHHHHHH-hcCchhh----------------hhhhhhhhe-eeccccc----ccceEEEEeecCC Confidence 3333333334455555444322 1221110 000011000 0000111 223333321 Q ss_pred --CchHHHHHHHHHhccC----HHHHHHHHHHHHHhcCeEEEEEEECCC-CcEEE---EEEccceeEEEEcCCCCCceEE Q lcl|NC_021326. 77 --TDDEVIKRIDEVLGNR----FDDKLHSVLTGASNKGIEWLHPYLDEE-GEFKL---FRVPAEQGIPIWTDKEHEELEA 146 (445) Q Consensus 77 --~d~~~~~~l~~~~~n~----~~~~~~~~~~~~~~~G~~~~~v~~d~~-g~~~i---~~~~p~~~~~v~d~~~~~~~~~ 146 (445) .+.+..+.+..=++++ ++.....++.++.+||++|..+|.++. |-..+ .+.-|.-+-|.+. .++... T Consensus 134 ~~a~~daakai~~el~~dl~~~iNr~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~yt~PsfIqpFE~---g~~tvG 210 (569) T protein:vir:10 134 NDSDYDAAQALCGELMNDIGRTINKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKEFEV---SGNLAG 210 (569) T ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhhhhheeeeccCCceeEEEEecccccccccchhhh---cCceEE Confidence 2333333443333333 466778899999999999999999864 32222 2223333333222 345555 Q ss_pred EEEEEeeecceeEEEEecceEEEEEEecceeeecccccc-cccccccccccccccceEEecCCCCcCccHHH----HHHH Q lcl|NC_021326. 147 FIRMYKLENETKVEYWDKITVNYYVYENGSLIPDYSNNL-ENSKTHFSTGSWGKIPFIPFKNNDLEISDIFM----YKTL 221 (445) Q Consensus 147 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~iPvv~~~n~~~g~s~~~~----v~~l 221 (445) |.-.|..+...+...-++.....++.-....+....... +........+.-.+.|+-+ ...|.|-++. -..| T Consensus 211 F~~~~~~~~~~ti~~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~p---sn~GgSFL~~ae~pf~~l 287 (569) T protein:vir:10 211 FSGDYLKDASGKMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIET---QNYGTSLLEYAYEPYMNL 287 (569) T ss_pred eecccCCccccceeeechhhhhhhcccceeeccccchhhhhhhheeecccccccccccc---hhhhhHHHHHHHhHHHHH Confidence 555554443333332111111111100000000000000 0000001122223445432 2345554443 3344 Q ss_pred HHHHHHHHHHHHHHHHHhcCCeeEEecCCcccchhHHHhh---------------hhCc--------eeeccCCCc---- Q lcl|NC_021326. 222 IDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLL---------------RYYG--------AIKVSDNGG---- 274 (445) Q Consensus 222 id~~~~~~s~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~---------------~~~~--------~~~~~~~~~---- 274 (445) +-++.-.-+...+....-+.-.+-..|++.....+..+.+ .... ++-+-+++. T Consensus 288 ~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H~LPv~gekq~~~t 367 (569) T protein:vir:10 288 RSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGDGKGQMT 367 (569) T ss_pred HHHHHhccchhhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccccccceeeeeeecCcccccc Confidence 4444444344333333333333444555543333322211 1111 111112222 Q ss_pred eeeEeccCChHHHHHHHHHHHHHHHHHh---CccccccccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021326. 275 VDTIQVEVPVENSKKYLDELYQKIMLFG---QAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF 351 (445) Q Consensus 275 ~~~l~~~~~~~~~~~~i~~l~~~i~~~s---~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~~~ 351 (445) ++..+.+.+..+++..+-.++.....+. .+...++.-.||---|-+++..-+.. .+..-.+....+++.+++.+=+ T Consensus 368 vDt~~~~A~~~gIEdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~frtSaQaa-~RS~~iRqa~~e~in~iidiH~ 446 (569) T protein:vir:10 368 IDTQTIQADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQAA-MRASWIQQGVEEFIQRAIDIHL 446 (569) T ss_pred ccccccccCcccHHHHHHHHHHHHhhhccchhHhhHHHHhcccccccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHh Confidence 1222233344455555554544433332 12222222223323455555432221 2333444555556666554433 Q ss_pred H--Hhcc-CCCcceEEEEeCCCCCCCHHHHHHHHHH-----------Hhcc-----CC-hHHHHHhC-C---CCCCHHHH Q lcl|NC_021326. 352 E--HFDI-KGEHKDVDISFNYNKVANTELQVQTAQQ-----------SMGI-----VS-HETVLENH-P---FVEDLQAE 407 (445) Q Consensus 352 ~--~~~~-~~~~~~i~v~f~~~~p~d~~~~~~~~~~-----------~~g~-----~s-~et~l~~l-~---~~~d~~~E 407 (445) . +..+ ..+.....|+|.-....-..|..++... +.++ +- .++++..+ . ..+. .- T Consensus 447 ~fKYgevf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~~De--~~ 524 (569) T protein:vir:10 447 AFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEIDE--KI 524 (569) T ss_pred hhhcCcccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHhhcch--hH Confidence 2 2222 1233456788876554433333222111 1111 00 12222211 0 1110 00 Q ss_pred HHHHHHHH-------------------HHHHHhhhccccCCCCCC Q lcl|NC_021326. 408 LERIEQEQ-------------------MEYNKQLPNLDDGGADGA 433 (445) Q Consensus 408 ~~ri~~E~-------------------~~~~~~~~~~~~~~~~~~ 433 (445) .+++-+|- ++..+.+......+.+.. T Consensus 525 ~e~l~ae~~akp~DEe~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 569 (569) T protein:vir:10 525 SEALVNELKAKSEDDDHLMDSIIKTPPQELAQILESVFKEGNDND 569 (569) T ss_pred HHHHHhhcCCCcchhHHHHHHHhcCChHHHHHHHHHHhhccCCCC Confidence 11222221 111111111111111111 Done!