Query lcl|NC_021072.1_cdsid_YP_007877943.1 [gene=CPRG_00180] [protein=portal protein] [protein_id=YP_007877943.1] [location=complement(124188..125789)] Match_columns 533 No_of_seqs 77 out of 89 Neff 4.2 Searched_HMMs 1612 Date Thu Nov 7 16:54:36 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_179 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_179_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104500 Length: 537 100.0 2E-282 2E-285 1564.5 47.3 529 1-533 1-537 (537) 2 protein:vir:103177 Length: 533 100.0 1E-281 8E-285 1560.5 45.7 530 2-533 1-533 (533) 3 protein:vir:104892 Length: 558 100.0 1E-274 9E-278 1521.8 44.9 532 2-533 1-558 (558) 4 protein:vir:106999 Length: 564 100.0 1E-271 8E-275 1505.7 43.7 530 2-533 1-557 (564) 5 protein:vir:81017 Length: 521 100.0 7E-267 4E-270 1479.9 43.3 495 1-503 7-521 (521) 6 protein:vir:6896 Length: 523 # 100.0 2E-266 1E-269 1477.7 41.5 496 1-504 4-523 (523) 7 protein:vir:108049 Length: 524 100.0 5E-266 3E-269 1474.9 41.9 496 1-503 1-524 (524) 8 protein:vir:106282 Length: 521 100.0 2E-265 1E-268 1471.3 41.7 496 1-503 4-521 (521) 9 protein:vir:101189 Length: 516 100.0 8E-265 5E-268 1468.3 42.5 493 1-502 2-516 (516) 10 protein:vir:101806 Length: 516 100.0 8E-265 5E-268 1468.3 42.5 493 1-502 2-516 (516) 11 protein:vir:6596 Length: 521 # 100.0 1E-264 9E-268 1467.1 43.7 495 1-503 7-521 (521) 12 protein:vir:100598 Length: 516 100.0 3E-264 2E-267 1465.7 42.6 493 1-502 2-516 (516) 13 protein:vir:103458 Length: 524 100.0 6E-264 4E-267 1463.6 42.2 495 1-504 1-524 (524) 14 protein:vir:7208 Length: 524 # 100.0 6E-264 4E-267 1463.5 42.2 495 1-504 1-524 (524) 15 protein:vir:98265 Length: 524 100.0 2E-260 1E-263 1444.4 43.2 495 1-503 1-524 (524) 16 protein:vir:5665 Length: 511 # 100.0 4E-258 2E-261 1431.8 42.2 484 6-500 1-511 (511) 17 protein:vir:5839 Length: 533 # 100.0 2E-225 1E-228 1252.5 36.1 477 1-532 19-533 (533) 18 protein:vir:79538 Length: 502 99.5 1.3E-12 8.3E-16 85.7 33.7 463 1-519 1-502 (502) 19 protein:vir:96068 Length: 765 99.4 3.2E-12 2E-15 83.6 28.5 457 1-533 39-585 (765) 20 protein:vir:107742 Length: 537 99.4 4.2E-12 2.6E-15 82.9 29.0 452 1-530 20-537 (537) 21 protein:vir:94049 Length: 532 99.4 1.3E-11 8.3E-15 80.2 29.2 443 1-525 23-532 (532) 22 protein:vir:99563 Length: 862 99.3 4.9E-12 3.1E-15 82.6 26.0 460 1-533 39-585 (862) 23 protein:vir:10321 Length: 495 99.3 2.1E-10 1.3E-13 73.6 32.7 466 1-520 1-495 (495) 24 protein:vir:95542 Length: 548 99.3 3.1E-10 1.9E-13 72.7 34.7 465 13-533 1-516 (548) 25 protein:vir:5249 Length: 437 # 99.2 5.6E-10 3.5E-13 71.3 30.4 419 13-532 1-437 (437) 26 protein:vir:107662 Length: 427 99.1 5.3E-10 3.3E-13 71.4 26.2 400 23-520 1-427 (427) 27 protein:vir:104338 Length: 422 99.0 2.9E-09 1.8E-12 67.4 26.8 392 25-513 1-422 (422) 28 protein:vir:96738 Length: 505 99.0 5.2E-09 3.2E-12 66.0 34.1 456 1-520 1-505 (505) 29 protein:vir:101418 Length: 569 99.0 8.3E-10 5.1E-13 70.4 22.9 473 1-519 11-569 (569) 30 protein:vir:102727 Length: 945 98.9 1.7E-09 1.1E-12 68.6 21.0 442 1-533 55-544 (945) 31 protein:vir:6382 Length: 553 # 98.9 2.5E-08 1.6E-11 62.3 34.3 480 10-530 1-553 (553) 32 protein:vir:80644 Length: 551 98.8 3E-08 1.9E-11 61.8 32.4 454 1-533 24-545 (551) 33 protein:vir:80796 Length: 574 98.8 3.2E-08 2E-11 61.7 26.1 461 1-533 1-545 (574) 34 protein:vir:80040 Length: 461 98.8 3.2E-08 2E-11 61.7 25.7 422 1-522 1-461 (461) 35 protein:vir:3420 Length: 533 # 98.8 6.1E-08 3.8E-11 60.1 34.4 479 1-522 1-533 (533) 36 protein:vir:105782 Length: 449 98.7 3.8E-08 2.4E-11 61.3 22.8 407 1-527 1-449 (449) 37 protein:vir:78227 Length: 480 98.7 1E-07 6.2E-11 59.0 24.1 437 36-529 1-480 (480) 38 protein:vir:389 Length: 530 # 98.7 1.1E-07 7E-11 58.7 35.3 471 1-522 1-530 (530) 39 protein:vir:102080 Length: 429 98.6 1.9E-07 1.2E-10 57.4 23.9 414 1-519 1-429 (429) 40 protein:vir:7853 Length: 518 # 98.6 2.7E-07 1.7E-10 56.6 27.8 429 14-533 1-468 (518) 41 protein:vir:63755 Length: 547 98.5 3.7E-07 2.3E-10 55.8 29.6 456 1-533 1-541 (547) 42 protein:vir:7768 Length: 484 # 98.5 3.7E-07 2.3E-10 55.9 22.6 439 1-531 1-484 (484) 43 protein:vir:99312 Length: 563 98.5 4.6E-07 2.8E-10 55.3 26.8 464 2-533 1-561 (563) 44 protein:vir:95599 Length: 563 98.5 4.6E-07 2.8E-10 55.3 26.8 464 2-533 1-561 (563) 45 protein:vir:96579 Length: 576 98.5 4.7E-07 2.9E-10 55.3 30.2 457 1-533 27-573 (576) 46 protein:vir:2341 Length: 488 # 98.5 5.1E-07 3.1E-10 55.1 27.6 440 22-523 1-488 (488) 47 protein:vir:4194 Length: 540 # 98.5 5.2E-07 3.2E-10 55.0 23.0 436 1-533 6-471 (540) 48 protein:vir:3153 Length: 467 # 98.5 5.3E-07 3.3E-10 55.0 27.9 410 60-533 1-464 (467) 49 protein:vir:78537 Length: 480 98.5 5.4E-07 3.4E-10 54.9 25.1 439 36-529 1-480 (480) 50 protein:vir:96980 Length: 409 98.5 5.6E-07 3.4E-10 54.9 22.7 396 1-515 1-409 (409) 51 protein:vir:79772 Length: 648 98.4 7.5E-07 4.7E-10 54.2 29.8 446 1-533 12-521 (648) 52 protein:vir:3028 Length: 500 # 98.4 7.7E-07 4.8E-10 54.1 24.0 429 1-505 14-500 (500) 53 protein:vir:9815 Length: 500 # 98.4 7.7E-07 4.8E-10 54.1 24.0 429 1-505 14-500 (500) 54 protein:vir:78907 Length: 518 98.4 1.1E-06 6.6E-10 53.3 31.0 424 1-508 1-518 (518) 55 protein:vir:98883 Length: 517 98.3 1.4E-06 8.7E-10 52.7 23.3 448 1-510 14-517 (517) 56 protein:vir:93610 Length: 454 98.3 1.6E-06 9.7E-10 52.4 26.6 428 5-533 1-448 (454) 57 protein:vir:98444 Length: 434 98.3 1.7E-06 1E-09 52.2 29.3 403 39-515 1-434 (434) 58 protein:vir:79647 Length: 435 98.2 2.1E-06 1.3E-09 51.7 29.0 403 1-529 5-435 (435) 59 protein:vir:94426 Length: 409 98.2 2.1E-06 1.3E-09 51.7 24.4 397 1-515 1-409 (409) 60 protein:vir:107605 Length: 432 98.2 2.3E-06 1.4E-09 51.5 23.7 416 1-527 1-432 (432) 61 protein:vir:102855 Length: 432 98.2 2.3E-06 1.4E-09 51.5 23.7 416 1-527 1-432 (432) 62 protein:vir:105002 Length: 432 98.2 2.3E-06 1.4E-09 51.5 23.7 416 1-527 1-432 (432) 63 protein:vir:1326 Length: 457 # 98.2 2.7E-06 1.6E-09 51.2 27.8 431 1-529 1-457 (457) 64 protein:vir:79703 Length: 505 98.2 2.7E-06 1.7E-09 51.1 22.5 413 1-509 3-505 (505) 65 protein:vir:101648 Length: 518 98.2 3E-06 1.8E-09 50.9 27.7 434 14-533 1-468 (518) 66 protein:vir:4223 Length: 486 # 98.2 3.2E-06 2E-09 50.7 24.4 446 19-528 1-486 (486) 67 protein:vir:1587 Length: 508 # 98.2 3.3E-06 2E-09 50.7 22.9 434 1-509 14-508 (508) 68 protein:vir:2427 Length: 485 # 98.1 3.8E-06 2.4E-09 50.3 25.5 449 19-526 1-485 (485) 69 protein:vir:3843 Length: 397 # 98.1 4.2E-06 2.6E-09 50.0 25.3 388 1-529 1-397 (397) 70 protein:vir:1023 Length: 392 # 98.1 4.6E-06 2.8E-09 49.9 26.3 385 1-522 1-392 (392) 71 protein:vir:3989 Length: 392 # 98.1 4.6E-06 2.8E-09 49.9 26.3 385 1-522 1-392 (392) 72 protein:vir:106639 Length: 481 98.1 5E-06 3.1E-09 49.6 20.5 431 1-517 1-481 (481) 73 protein:vir:8418 Length: 409 # 98.0 6.4E-06 4E-09 49.1 26.2 399 1-532 1-409 (409) 74 protein:vir:99916 Length: 504 98.0 7E-06 4.3E-09 48.9 27.7 450 31-531 1-504 (504) 75 protein:vir:2683 Length: 412 # 98.0 7.1E-06 4.4E-09 48.8 24.2 398 1-515 1-412 (412) 76 protein:vir:7407 Length: 392 # 98.0 8.7E-06 5.4E-09 48.3 24.7 384 1-522 1-392 (392) 77 protein:vir:960 Length: 413 # 97.9 9.8E-06 6.1E-09 48.0 27.9 395 1-528 11-413 (413) 78 protein:vir:93943 Length: 409 97.9 1.1E-05 6.6E-09 47.8 22.3 393 1-515 1-409 (409) 79 protein:vir:2500 Length: 501 # 97.9 1.1E-05 7E-09 47.7 27.2 445 1-522 1-501 (501) 80 protein:vir:9922 Length: 489 # 97.9 1.4E-05 8.6E-09 47.2 29.0 445 16-521 1-489 (489) 81 protein:vir:38 Length: 496 # N 97.8 1.6E-05 9.6E-09 46.9 31.3 427 5-510 1-496 (496) 82 protein:vir:6240 Length: 457 # 97.8 1.6E-05 9.9E-09 46.9 25.8 429 1-533 3-456 (457) 83 protein:vir:1380 Length: 422 # 97.8 1.9E-05 1.2E-08 46.5 23.2 407 1-532 1-422 (422) 84 protein:vir:104082 Length: 485 97.8 2.2E-05 1.3E-08 46.2 25.1 438 28-530 1-485 (485) 85 protein:vir:9871 Length: 429 # 97.8 2.2E-05 1.4E-08 46.1 27.8 392 42-514 1-429 (429) 86 protein:vir:105461 Length: 470 97.7 2.3E-05 1.4E-08 46.0 31.0 403 42-515 1-470 (470) 87 protein:vir:78641 Length: 278 97.7 2.3E-05 1.4E-08 46.0 21.3 266 85-432 1-278 (278) 88 protein:vir:105889 Length: 474 97.7 3.1E-05 1.9E-08 45.3 30.3 428 1-516 1-474 (474) 89 protein:vir:94101 Length: 474 97.7 3.1E-05 1.9E-08 45.3 30.3 428 1-516 1-474 (474) 90 protein:vir:99522 Length: 470 97.6 3.4E-05 2.1E-08 45.1 32.2 428 1-514 1-470 (470) 91 protein:vir:4828 Length: 382 # 97.6 3.8E-05 2.3E-08 44.8 25.6 372 1-519 1-382 (382) 92 protein:vir:4782 Length: 522 # 97.6 4.1E-05 2.6E-08 44.6 26.9 445 1-515 14-522 (522) 93 protein:vir:483 Length: 413 # 97.5 5.2E-05 3.2E-08 44.1 26.6 401 5-523 1-413 (413) 94 protein:vir:97060 Length: 432 97.5 5.5E-05 3.4E-08 43.9 24.3 406 1-529 2-432 (432) 95 protein:vir:79063 Length: 491 97.5 6E-05 3.7E-08 43.7 25.9 425 1-533 1-456 (491) 96 protein:vir:3964 Length: 453 # 97.4 6.6E-05 4.1E-08 43.5 28.6 417 14-516 1-453 (453) 97 protein:vir:80680 Length: 441 97.4 6.7E-05 4.2E-08 43.5 23.1 408 23-515 1-441 (441) 98 protein:vir:80959 Length: 499 97.4 6.9E-05 4.3E-08 43.4 33.1 423 5-509 1-499 (499) 99 protein:vir:4156 Length: 542 # 97.4 7.6E-05 4.7E-08 43.2 30.1 428 1-533 1-473 (542) 100 protein:vir:4454 Length: 414 # 97.4 7.7E-05 4.8E-08 43.1 27.5 404 1-523 1-414 (414) 101 protein:vir:107880 Length: 491 97.4 8E-05 5E-08 43.0 26.0 424 1-533 1-456 (491) 102 protein:vir:95806 Length: 440 97.4 8.4E-05 5.2E-08 42.9 30.1 401 42-512 1-440 (440) 103 protein:vir:4995 Length: 384 # 97.4 8.7E-05 5.4E-08 42.8 22.0 378 1-493 1-384 (384) 104 protein:vir:101541 Length: 694 97.3 9.9E-05 6.2E-08 42.5 24.8 443 1-533 36-587 (694) 105 protein:vir:5961 Length: 503 # 97.3 0.00011 6.8E-08 42.3 29.4 436 1-529 11-503 (503) 106 protein:vir:100150 Length: 437 97.2 0.00012 7.7E-08 42.0 22.6 419 1-530 1-437 (437) 107 protein:vir:99072 Length: 479 97.2 0.00013 7.9E-08 41.9 25.4 440 19-528 1-479 (479) 108 protein:vir:4898 Length: 502 # 97.2 0.00013 8E-08 41.9 24.5 464 1-530 2-502 (502) 109 protein:vir:102118 Length: 409 97.2 0.00014 8.7E-08 41.7 26.5 400 5-528 1-409 (409) 110 protein:vir:100249 Length: 431 97.1 0.00016 9.6E-08 41.5 27.4 411 1-524 1-431 (431) 111 protein:vir:1266 Length: 416 # 97.1 0.00016 1E-07 41.4 25.5 400 4-515 1-416 (416) 112 protein:vir:81152 Length: 411 97.1 0.00016 1E-07 41.4 25.6 399 1-528 1-411 (411) 113 protein:vir:3609 Length: 452 # 97.1 0.00019 1.2E-07 41.0 31.8 421 1-519 1-452 (452) 114 protein:vir:9408 Length: 441 # 97.0 0.00021 1.3E-07 40.8 24.9 405 1-530 1-441 (441) 115 protein:vir:79984 Length: 441 97.0 0.00021 1.3E-07 40.8 24.9 405 1-530 1-441 (441) 116 protein:vir:96494 Length: 501 97.0 0.00021 1.3E-07 40.7 27.5 454 1-531 1-501 (501) 117 protein:vir:96179 Length: 468 97.0 0.00023 1.4E-07 40.5 28.3 413 1-512 1-468 (468) 118 protein:vir:95113 Length: 474 97.0 0.00024 1.5E-07 40.4 29.0 417 7-522 1-474 (474) 119 protein:vir:95378 Length: 406 96.8 0.0003 1.9E-07 39.9 24.8 393 1-532 1-406 (406) 120 protein:vir:9306 Length: 511 # 96.8 0.00032 2E-07 39.8 31.2 440 7-531 1-511 (511) 121 protein:vir:10362 Length: 432 96.7 0.00037 2.3E-07 39.4 25.2 403 1-529 1-432 (432) 122 protein:vir:93747 Length: 472 96.7 0.00043 2.7E-07 39.0 31.2 416 24-532 1-472 (472) 123 protein:vir:3648 Length: 695 # 96.6 0.00049 3E-07 38.7 25.2 448 1-533 37-588 (695) 124 protein:vir:4337 Length: 434 # 96.5 0.00056 3.4E-07 38.4 24.2 412 1-532 1-434 (434) 125 protein:vir:105292 Length: 478 96.5 0.00058 3.6E-07 38.3 29.5 419 1-520 1-478 (478) 126 protein:vir:81218 Length: 423 96.3 0.00075 4.6E-07 37.7 25.3 406 1-528 1-423 (423) 127 protein:vir:4952 Length: 386 # 96.3 0.00077 4.8E-07 37.7 25.1 374 1-519 1-386 (386) 128 protein:vir:2732 Length: 501 # 96.2 0.00083 5.2E-07 37.5 25.8 455 1-530 1-501 (501) 129 protein:vir:78589 Length: 695 96.2 0.00087 5.4E-07 37.4 25.0 438 1-533 46-573 (695) 130 protein:vir:1236 Length: 483 # 96.2 0.0009 5.6E-07 37.3 30.0 435 1-532 1-483 (483) 131 protein:vir:81095 Length: 416 96.2 0.00092 5.7E-07 37.2 25.2 398 1-530 1-416 (416) 132 protein:vir:4598 Length: 416 # 96.2 0.00092 5.7E-07 37.2 25.2 398 1-530 1-416 (416) 133 protein:vir:7987 Length: 456 # 96.2 0.00093 5.8E-07 37.2 24.1 406 22-506 1-456 (456) 134 protein:vir:106571 Length: 499 96.1 0.00098 6.1E-07 37.1 30.1 435 6-531 1-499 (499) 135 protein:vir:80134 Length: 403 96.1 0.00099 6.1E-07 37.1 24.0 390 1-532 1-403 (403) 136 protein:vir:101647 Length: 460 96.1 0.001 6.4E-07 37.0 28.5 424 1-528 1-460 (460) 137 protein:vir:99781 Length: 511 96.1 0.0011 6.6E-07 36.9 28.0 415 1-530 63-511 (511) 138 protein:vir:9359 Length: 348 # 96.1 0.0011 6.6E-07 36.9 26.3 335 85-515 1-348 (348) 139 protein:vir:94546 Length: 506 95.9 0.0012 7.5E-07 36.6 26.0 436 4-524 1-506 (506) 140 protein:vir:94805 Length: 492 95.9 0.0012 7.7E-07 36.5 29.0 441 1-532 1-492 (492) 141 protein:vir:97336 Length: 492 95.9 0.0013 8.1E-07 36.4 28.0 435 1-532 1-492 (492) 142 protein:vir:80333 Length: 419 95.8 0.0014 8.5E-07 36.3 25.7 405 6-533 1-417 (419) 143 protein:vir:96240 Length: 511 95.8 0.0014 8.9E-07 36.2 30.6 440 7-532 1-511 (511) 144 protein:vir:4854 Length: 386 # 95.8 0.0015 9E-07 36.1 27.0 376 3-519 1-386 (386) 145 protein:vir:105064 Length: 421 95.7 0.0016 1E-06 35.8 25.9 408 5-530 1-421 (421) 146 protein:vir:5737 Length: 419 # 95.6 0.0017 1.1E-06 35.7 26.1 403 1-531 1-419 (419) 147 protein:vir:4509 Length: 424 # 95.5 0.002 1.3E-06 35.3 29.1 397 1-519 18-424 (424) 148 protein:vir:97447 Length: 474 95.3 0.0023 1.4E-06 35.0 29.4 416 1-532 1-474 (474) 149 protein:vir:94498 Length: 474 95.3 0.0023 1.4E-06 35.0 29.4 416 1-532 1-474 (474) 150 protein:vir:100882 Length: 383 95.3 0.0024 1.5E-06 35.0 26.5 374 1-529 1-383 (383) 151 protein:vir:102950 Length: 471 95.2 0.0025 1.6E-06 34.8 33.2 388 44-514 1-471 (471) 152 protein:vir:107112 Length: 478 95.0 0.0029 1.8E-06 34.5 29.6 420 1-517 1-478 (478) 153 protein:vir:79043 Length: 479 94.9 0.0033 2E-06 34.2 28.0 409 5-515 1-479 (479) 154 protein:vir:106716 Length: 698 94.8 0.0034 2.1E-06 34.1 25.5 438 1-533 46-576 (698) 155 protein:vir:103951 Length: 511 94.8 0.0035 2.2E-06 34.1 29.4 409 1-532 63-511 (511) 156 protein:vir:81072 Length: 432 94.7 0.0037 2.3E-06 33.9 25.2 411 1-532 1-432 (432) 157 protein:vir:78805 Length: 511 94.6 0.004 2.5E-06 33.8 30.4 438 7-531 1-511 (511) 158 protein:vir:96366 Length: 511 94.6 0.004 2.5E-06 33.8 30.4 438 7-531 1-511 (511) 159 protein:vir:96266 Length: 474 94.5 0.0044 2.7E-06 33.5 29.9 418 1-532 1-474 (474) 160 protein:vir:95899 Length: 474 94.5 0.0044 2.7E-06 33.5 29.9 418 1-532 1-474 (474) 161 protein:vir:103219 Length: 201 94.4 0.0047 2.9E-06 33.4 12.5 188 265-532 1-201 (201) 162 protein:vir:97171 Length: 512 94.3 0.0047 2.9E-06 33.4 28.9 412 1-531 63-512 (512) 163 protein:vir:96839 Length: 474 94.1 0.0054 3.3E-06 33.0 28.3 418 7-519 1-474 (474) 164 protein:vir:105819 Length: 456 94.0 0.0057 3.6E-06 32.9 24.7 412 23-502 1-456 (456) 165 protein:vir:102602 Length: 456 94.0 0.0057 3.6E-06 32.9 24.7 412 23-502 1-456 (456) 166 protein:vir:9568 Length: 410 # 93.8 0.0062 3.9E-06 32.7 27.6 387 28-492 1-410 (410) 167 protein:vir:98396 Length: 441 93.8 0.0064 4E-06 32.6 25.3 416 1-530 1-441 (441) 168 protein:vir:100187 Length: 385 93.2 0.0083 5.1E-06 32.0 27.0 374 1-513 1-385 (385) 169 protein:vir:9751 Length: 422 # 92.9 0.0097 6E-06 31.6 27.3 386 28-491 1-422 (422) 170 protein:vir:1884 Length: 424 # 91.8 0.014 8.8E-06 30.7 27.8 405 1-526 1-424 (424) 171 protein:vir:8100 Length: 466 # 91.5 0.016 9.8E-06 30.5 26.8 435 1-527 1-466 (466) 172 protein:vir:1431 Length: 419 # 91.2 0.017 1.1E-05 30.3 26.6 402 5-533 1-417 (419) 173 protein:vir:9702 Length: 406 # 91.1 0.018 1.1E-05 30.2 26.3 387 1-530 1-406 (406) 174 protein:vir:8317 Length: 409 # 90.4 0.021 1.3E-05 29.8 24.6 379 1-515 3-409 (409) 175 protein:vir:94742 Length: 409 89.6 0.025 1.6E-05 29.3 23.9 377 28-476 1-409 (409) 176 protein:vir:733 Length: 453 # 89.4 0.026 1.6E-05 29.2 28.9 414 1-517 4-453 (453) 177 protein:vir:189 Length: 424 # 89.1 0.029 1.8E-05 29.1 27.5 404 1-527 1-424 (424) 178 protein:vir:78083 Length: 537 88.7 0.031 1.9E-05 28.9 28.7 442 1-533 1-536 (537) 179 protein:vir:1082 Length: 359 # 88.3 0.033 2.1E-05 28.7 22.8 342 1-456 1-359 (359) 180 protein:vir:8184 Length: 474 # 88.2 0.033 2.1E-05 28.7 24.9 423 15-510 1-474 (474) 181 protein:vir:94666 Length: 723 85.6 0.052 3.2E-05 27.6 28.1 408 20-533 1-446 (723) 182 protein:vir:1661 Length: 378 # 81.3 0.087 5.4E-05 26.4 17.5 359 1-518 1-378 (378) 183 protein:vir:100691 Length: 535 78.8 0.11 6.9E-05 25.8 28.5 449 1-528 13-535 (535) 184 protein:vir:99853 Length: 488 78.5 0.11 7.1E-05 25.8 28.0 419 10-533 1-443 (488) 185 protein:vir:100328 Length: 346 70.6 0.21 0.00013 24.3 22.9 326 1-435 1-346 (346) 186 protein:vir:93867 Length: 378 61.6 0.35 0.00022 23.1 21.5 357 1-518 1-378 (378) 187 protein:vir:1634 Length: 409 # 59.2 0.4 0.00025 22.8 25.0 367 28-476 1-409 (409) 188 protein:vir:4698 Length: 251 # 56.6 0.45 0.00028 22.5 17.0 240 1-313 1-251 (251) 189 protein:vir:102330 Length: 451 54.8 0.49 0.00031 22.3 28.7 388 44-514 1-451 (451) 190 protein:vir:108215 Length: 469 50.8 0.6 0.00037 21.8 27.9 439 15-533 1-466 (469) 191 protein:vir:99452 Length: 651 45.3 0.77 0.00048 21.2 29.2 470 1-533 1-552 (651) 192 protein:vir:95254 Length: 488 38.7 1.1 0.00065 20.5 29.0 455 13-532 1-488 (488) 193 protein:vir:3780 Length: 345 # 35.5 1.2 0.00076 20.1 22.0 316 1-435 18-345 (345) 194 protein:vir:267 Length: 348 # 26.8 1.9 0.0012 19.0 22.8 336 1-451 1-348 (348) 195 protein:vir:94002 Length: 378 24.5 2.2 0.0013 18.7 22.0 360 1-518 1-378 (378) 196 protein:vir:3743 Length: 345 # 23.5 2.3 0.0014 18.6 25.6 317 1-443 1-345 (345) 197 protein:vir:104259 Length: 403 21.6 2.6 0.0016 18.3 24.6 393 6-519 1-403 (403) No 1 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=100.00 E-value=2.5e-282 Score=1564.45 Aligned_cols=529 Identities=84% Similarity=1.306 Sum_probs=512.8 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |..+||||+|++++++++++|++||+++||++++++|+|+|++++++++++|+++||++||+||+|||||+||+|||||| T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 80 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAVDDVVNET 80 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcCh Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDP 160 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP 160 (533) ||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+|||||||++||++||++||+||| T Consensus 81 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr~lDP 160 (537) T protein:vir:10 81 ICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVELRYVDP 160 (537) T ss_pred eEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHHHHHH Q lcl|NC_021072. 161 RKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIK 240 (533) Q Consensus 161 ~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK 240 (533) |+|++||++.+++++..++..+++.++.+..+||+|||+|+++++++++|||.|||+||||||+|||+++|+||||+||| T Consensus 161 r~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~n~~~i~syLhkAiK 240 (537) T protein:vir:10 161 RKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSIAYCHSGIQDLNKNMVLSHLHKAIK 240 (537) T ss_pred ccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhheeeecccceeCCCCeeeeeehhhhH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhccccc Q lcl|NC_021072. 241 AVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR 320 (533) Q Consensus 241 ~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRR 320 (533) ||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+|||||||||||||| T Consensus 241 p~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRR 320 (537) T protein:vir:10 241 AVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR 320 (537) T ss_pred HHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 321 EGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDL 400 (533) Q Consensus 321 eggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~ 400 (533) ||||||||||||||||||||+||+||++|||+|||||+|||+++++|++||++||||||+||+|||.|||+||+.+|+++ T Consensus 321 eGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~ 400 (537) T protein:vir:10 321 EGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDL 400 (537) T ss_pred CCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHH Q lcl|NC_021072. 401 LKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEI 480 (533) Q Consensus 401 Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~ 480 (533) ||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++||||+||+|||+||+++ T Consensus 401 Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~ 480 (537) T protein:vir:10 401 LKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEI 480 (537) T ss_pred HHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhhcCCCCCCCcccccCCCCCC--------CCCCCCccccccccCCccccchhcC Q lcl|NC_021072. 481 DKQIDSEREAGLIVDPMAEMDPAMDPGN--------APPADDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 481 ~kqi~~E~~~~~~~~p~~~~~~~~~~~~--------~~~~~d~~~~~~~~~~~~~~~~~~~ 533 (533) +|||++|+++|+|++|++.++.+++.++ ..|..|++..++ +.+.+.+|+ T Consensus 481 ~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 537 (537) T protein:vir:10 481 DKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVS----PADQKRGEL 537 (537) T ss_pred HHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCC----CCCccCCCC Confidence 9999999999999999998777765443 333444444433 344556777 No 2 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=100.00 E-value=1.3e-281 Score=1560.48 Aligned_cols=530 Identities=85% Similarity=1.320 Sum_probs=509.7 Q ss_pred CccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhccee Q lcl|NC_021072. 2 SNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETI 81 (533) Q Consensus 2 ~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneai 81 (533) -++||||+|++.++.++++||+||+++||++++.+|+|+|++++++++++|+++||++||+||+|||||+||+||||||| T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 57999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChh Q lcl|NC_021072. 82 CGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPR 161 (533) Q Consensus 82 v~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~ 161 (533) |||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+|||||||++||++||++||+|||| T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr 160 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPR 160 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeeecccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHHHHHHH Q lcl|NC_021072. 162 KIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKA 241 (533) Q Consensus 162 ~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~ 241 (533) +|++||++.++++++.++..++.+++++..+||+|||+|+.+++++++|||.|||+||||||+|||+++|+||||+|||| T Consensus 161 ~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp 240 (533) T protein:vir:10 161 KIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPDSICYVHSGIMDLNKNMTLSHLHKAIKA 240 (533) T ss_pred ceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchhheeeeeccceeCCCCceeccchHhHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccC Q lcl|NC_021072. 242 VNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRRE 321 (533) Q Consensus 242 ~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRRe 321 (533) |||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||| T Consensus 241 ~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRRe 320 (533) T protein:vir:10 241 VNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRRE 320 (533) T ss_pred HHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 322 GGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLL 401 (533) Q Consensus 322 ggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~L 401 (533) |||||||||||||||||||+||+||++|||+|||||+|||+++++|++||++||||||+||+|||.|||+||+.+|+++| T Consensus 321 GgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~L 400 (533) T protein:vir:10 321 GGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLL 400 (533) T ss_pred CCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHH Q lcl|NC_021072. 402 KTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEID 481 (533) Q Consensus 402 k~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~ 481 (533) |+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++||||+||+|||+||++++ T Consensus 401 k~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~ 480 (533) T protein:vir:10 401 KTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEID 480 (533) T ss_pred HHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccc--ccc-CCccccchhcC Q lcl|NC_021072. 482 KQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQE--GPA-VDAGDAKRGEF 533 (533) Q Consensus 482 kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~--~~~-~~~~~~~~~~~ 533 (533) |||++|+++|+|++|+++++|.+...... .++...+ +|. .++.+....+| T Consensus 481 kqI~~E~k~~~~~~p~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:10 481 KQIESEMESGIIADPAAEMDPAMAAGDPD--AGGAPAEEVAPEGPDPSDERKAEF 533 (533) T ss_pred HHHHHHHhCCCCCCCcchhhHHhcCCCCC--cCCcccccCCCCCCCcchhhccCC Confidence 99999999999999999999877654322 2222222 222 33334445566 No 3 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=100.00 E-value=1.5e-274 Score=1521.85 Aligned_cols=532 Identities=58% Similarity=0.954 Sum_probs=493.7 Q ss_pred Cccccceeeecc-ccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 2 SNQLFGFSLERA-KKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 2 ~~~~fg~~i~~~-~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) -++||||+|++. ++..+++||+||+++||+.++.+++|+|+++++++.++|+++||++||+|++|||||+||+|||||| T Consensus 1 m~~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea 80 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGAIEDVVNEA 80 (558) T ss_pred CcchhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhhHHHHhhcce Confidence 569999999875 5667889999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcCh Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDP 160 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP 160 (533) ||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+|||||||++||++||++||+||| T Consensus 81 iv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDP 160 (558) T protein:vir:10 81 IVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDP 160 (558) T ss_pred eEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhceehhhccCCCcCceeEEeccc----eeeccchhceeccccccc-------cccCCcceeccchhhccccccccCCCC Q lcl|NC_021072. 161 RKIRKVTEYQQKRPEQLRGEDINT----QLTQKAAEYYLYNPKGLK-------NSTNQGMKIATDSVTYCHSGIQDLNKN 229 (533) Q Consensus 161 ~~i~~vr~~~~~~~~~~~~~~~~~----~~~~~~~e~~~y~p~~~~-------~~~~~~~kI~~dai~y~hsGl~d~~~~ 229 (533) |+|++||++.++..|+.......+ .+.+...+||+|+|++.. .++++++|||.|||+||||||+|||++ T Consensus 161 r~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y~hSGL~d~~~~ 240 (558) T protein:vir:10 161 LKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITMCTSGLVDRNKN 240 (558) T ss_pred ccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhheeeecccceecCCC Confidence 999999999999888866655443 245677899999997653 356778999999999999999999999 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccc Q lcl|NC_021072. 230 MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFM 309 (533) Q Consensus 230 ~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~m 309 (533) +|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||| T Consensus 241 ~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~m 320 (558) T protein:vir:10 241 RVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFM 320 (558) T ss_pred eeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHH Q lcl|NC_021072. 310 SMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARL 389 (533) Q Consensus 310 smlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rL 389 (533) |||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++||++||||||+||+|||.|| T Consensus 321 sMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RL 400 (558) T protein:vir:10 321 SMMEDFWLPRREGGRGTEITTLPGGQNLGELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRL 400 (558) T ss_pred hhHhhhcccccCCCCccceeeccccCCcchHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHH Q lcl|NC_021072. 390 RKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQV 469 (533) Q Consensus 390 r~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~I 469 (533) |+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++||||+| T Consensus 401 R~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~I 480 (558) T protein:vir:10 401 RKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRV 480 (558) T ss_pred HHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hCCCHHHHHHHHHHHHHhhhcCCCCCCCcc--c---------ccCCCCCCCCCCCC---ccccccccCCccccchhcC Q lcl|NC_021072. 470 LKQTDQEIKEIDKQIDSEREAGLIVDPMAE--M---------DPAMDPGNAPPADD---MSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 470 L~~tDeeI~e~~kqi~~E~~~~~~~~p~~~--~---------~~~~~~~~~~~~~d---~~~~~~~~~~~~~~~~~~~ 533 (533) |+|||+||++++|||++|+++|+|++|+++ + +++++..+.+|.++ ..++++.|.-..++...|+ T Consensus 481 Lr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (558) T protein:vir:10 481 LRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSKDTKKAEL 558 (558) T ss_pred hccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccchhhhhhhhhhhhhhhcC Confidence 999999999999999999999999999855 2 22233334444443 3333332222234445555 No 4 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=100.00 E-value=1.3e-271 Score=1505.75 Aligned_cols=530 Identities=58% Similarity=0.938 Sum_probs=478.8 Q ss_pred CccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhh--hhhHHHHHHHHHHhhhhcchhhhHHHHhhcc Q lcl|NC_021072. 2 SNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDG--TVRNEYELITRYREMVLQPECDSAVDDIVNE 79 (533) Q Consensus 2 ~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~--~~~~~~~LI~~YR~m~~~pEvd~AvdeIvne 79 (533) -++||||+|++ +...++.|++||++++|+..+ +|||+|++++++| .++|+++||++||+|++|||||+||+||||| T Consensus 1 m~~lfgf~i~~-~~~~~~~S~vpp~~~~~~~~i-~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIVne 78 (564) T protein:vir:10 1 MSQLFGFLINE-KEGQKGQSPVPPNDEASVSTV-AGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIVNE 78 (564) T ss_pred Ccchhcceeee-eccCCCCCcccCCcCCChhhh-hccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhcc Confidence 57999999999 667789999999999997765 6779999999998 5789999999999999999999999999999 Q ss_pred eeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcC Q lcl|NC_021072. 80 TICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYID 159 (533) Q Consensus 80 aiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lD 159 (533) |||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+|||||||++||++||++||+|| T Consensus 79 aIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lD 158 (564) T protein:vir:10 79 FVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYID 158 (564) T ss_pred eeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhceehhhccCCC-cCceeEEeccc--eeeccchhceeccccccc-----------cccCCcceeccchhhcccccccc Q lcl|NC_021072. 160 PRKIRKVTEYQQKR-PEQLRGEDINT--QLTQKAAEYYLYNPKGLK-----------NSTNQGMKIATDSVTYCHSGIQD 225 (533) Q Consensus 160 P~~i~~vr~~~~~~-~~~~~~~~~~~--~~~~~~~e~~~y~p~~~~-----------~~~~~~~kI~~dai~y~hsGl~d 225 (533) ||+|++||++.++. .++..+..+.. ..+....+||+|||++.. +++++++|||.+||+||||||+| T Consensus 159 Pr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSGL~d 238 (564) T protein:vir:10 159 SLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSGLMD 238 (564) T ss_pred ccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhcceeccccee Confidence 99999999999986 34444444443 345567899999998752 35677899999999999999999 Q ss_pred CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccc Q lcl|NC_021072. 226 LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDD 305 (533) Q Consensus 226 ~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d 305 (533) ||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+|| T Consensus 239 ~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrdd 318 (564) T protein:vir:10 239 LNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDD 318 (564) T ss_pred CCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCC-cccccchhhhhHHhhhHHH Q lcl|NC_021072. 306 KKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEVKFQK 384 (533) Q Consensus 306 ~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~~~~g~~~eItRDElkF~K 384 (533) +||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +|++||++||||||+||+| T Consensus 319 rk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~K 398 (564) T protein:vir:10 319 KKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTK 398 (564) T ss_pred chhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999995 8999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHH Q lcl|NC_021072. 385 FIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDY 464 (533) Q Consensus 385 fi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~ 464 (533) ||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++| T Consensus 399 FI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dy 478 (564) T protein:vir:10 399 FIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEY 478 (564) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCC--CCCCC-CCCC-----Ccccccccc--CCccccchhcC Q lcl|NC_021072. 465 MRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAM--DPGNA-PPAD-----DMSAQEGPA--VDAGDAKRGEF 533 (533) Q Consensus 465 i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~--~~~~~-~~~~-----d~~~~~~~~--~~~~~~~~~~~ 533 (533) |||+||+|||+||++++|||++|+++|+|++|..+..... .+++. +|.+ |.+.+...+ ..|++.+++.= T Consensus 479 i~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~ 557 (564) T protein:vir:10 479 IRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQ 557 (564) T ss_pred HHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccCCCCCCCCC Confidence 9999999999999999999999999999999954432211 11221 2222 111110000 11111111111 No 5 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=100.00 E-value=6.6e-267 Score=1479.86 Aligned_cols=495 Identities=36% Similarity=0.656 Sum_probs=475.5 Q ss_pred CCccccceeeec--cccccccCCCCCCCCCccccee---------ecccccccccchhhhhhHHHHHHHHHHhhhhcchh Q lcl|NC_021072. 1 MSNQLFGFSLER--AKKVPKGPSFVQKDSMDGSQPI---------VGGGYYGYSVDFDGTVRNEYELITRYREMVLQPEC 69 (533) Q Consensus 1 ~~~~~fg~~i~~--~~~~~~~~s~~~~~~~dg~~~~---------~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEv 69 (533) |-++||||++++ +++++++.|++||+++||++.+ +.|+++++|++++++++|+++||++||+||+|||| T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEv 86 (521) T protein:vir:81 7 MLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEV 86 (521) T ss_pred hhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHhhccch Confidence 778999999988 4557899999999999999766 35789999999999999999999999999999999 Q ss_pred hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCC Q lcl|NC_021072. 70 DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPR 149 (533) Q Consensus 70 d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~ 149 (533) |+||+||||||||||++++||+|+|+++++|++||++|+|||++|+++|+|+++||++||+||||||+||||||| +||+ T Consensus 87 d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~pk 165 (521) T protein:vir:81 87 ENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG-KNPK 165 (521) T ss_pred hhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEc-CCcc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 8999 Q ss_pred CCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccc-------ccccccCCcceeccchhhccccc Q lcl|NC_021072. 150 GGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPK-------GLKNSTNQGMKIATDSVTYCHSG 222 (533) Q Consensus 150 ~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~-------~~~~~~~~~~kI~~dai~y~hsG 222 (533) +||++||+|||++|++||++.+++.++. .++++..+||+|+|. |..+++++++|||.|||+||||| T Consensus 166 ~GI~Elr~lDPr~i~~vr~i~k~~~~~~-------~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSG 238 (521) T protein:vir:81 166 DGIVELRQLDPRNLEYVREIITEDTPEG-------KIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSG 238 (521) T ss_pred ccceeeeeeCCcceeeeeeecccccCcc-------ceecceeeeeeeecCCccccccceeecCCcceeechhheeeeecc Confidence 9999999999999999999999876654 456677888888874 45668899999999999999999 Q ss_pred cccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcc Q lcl|NC_021072. 223 IQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEI 302 (533) Q Consensus 223 l~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev 302 (533) |+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+| T Consensus 239 l~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev 318 (521) T protein:vir:81 239 LMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKL 318 (521) T ss_pred ceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccC--CCCcccccchhhhhHHhh Q lcl|NC_021072. 303 KDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLE--TETTFNIGRAAEITRDEV 380 (533) Q Consensus 303 ~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~--~~~~~~~g~~~eItRDEl 380 (533) +||+|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+ ++++|++|+++||||||+ T Consensus 319 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEi 398 (521) T protein:vir:81 319 KNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDEL 398 (521) T ss_pred cccccccchhhhhcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHH Confidence 999999999999999999999999999999999999999999999999999999999995 446899999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_021072. 381 KFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYF 460 (533) Q Consensus 381 kF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~ 460 (533) ||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||| T Consensus 399 KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~ 478 (521) T protein:vir:81 399 EFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYF 478 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC Q lcl|NC_021072. 461 SIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA 503 (533) Q Consensus 461 S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~ 503 (533) |++||||+||+|||+||++++|||++|+++|+|++|+++++.= T Consensus 479 s~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 479 SNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred chHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCCcccccCC Confidence 9999999999999999999999999999999999999876543 No 6 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=100.00 E-value=1.7e-266 Score=1477.69 Aligned_cols=496 Identities=37% Similarity=0.670 Sum_probs=472.6 Q ss_pred CCccccceeeecc------ccccccCCCCCCCCCcccceeeccccccc----------ccchhhhhhHHHHHHHHHHhhh Q lcl|NC_021072. 1 MSNQLFGFSLERA------KKVPKGPSFVQKDSMDGSQPIVGGGYYGY----------SVDFDGTVRNEYELITRYREMV 64 (533) Q Consensus 1 ~~~~~fg~~i~~~------~~~~~~~s~~~~~~~dg~~~~~~~~~~~~----------~~~~~~~~~~~~~LI~~YR~m~ 64 (533) --.+||||.++.. +.+..+.|++||+++||+.++..++++++ |++++++++|+++||++||+|| T Consensus 4 ~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma 83 (523) T protein:vir:68 4 NILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLM 83 (523) T ss_pred chhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHh Confidence 2368999999854 34667789999999999999986666553 6789999999999999999999 Q ss_pred hcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeec Q lcl|NC_021072. 65 LQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVID 144 (533) Q Consensus 65 ~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid 144 (533) +|||||+||+||||||||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+||||||| T Consensus 84 ~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid 163 (523) T protein:vir:68 84 TNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIID 163 (523) T ss_pred hccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccc-------cccCCcceeccchhh Q lcl|NC_021072. 145 PKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLK-------NSTNQGMKIATDSVT 217 (533) Q Consensus 145 ~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~-------~~~~~~~kI~~dai~ 217 (533) ++||++||++||+|||++|++||++.++++.+. .++++..+||+|+|++.. +++++++|||.|||| T Consensus 164 ~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~-------~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~ 236 (523) T protein:vir:68 164 PKRPKEGIKELRRLDPRQVQYVREVITTTEAGV-------KIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAIV 236 (523) T ss_pred CCCccccceeeeeeCCcceeEEEeecCCCCcch-------hhhhhhhhheeeccccccccccccccCCCcceecchhhee Confidence 999999999999999999999999999876543 456677899999987643 467899999999999 Q ss_pred ccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeC Q lcl|NC_021072. 218 YCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDA 297 (533) Q Consensus 218 y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~ 297 (533) ||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||+ T Consensus 237 y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa 316 (523) T protein:vir:68 237 YAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDA 316 (523) T ss_pred eeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC-Ccccccchhhhh Q lcl|NC_021072. 298 NTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE-TTFNIGRAAEIT 376 (533) Q Consensus 298 ~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~-~~~~~g~~~eIt 376 (533) +||+|+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++ ++|++|+++||| T Consensus 317 ~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EIt 396 (523) T protein:vir:68 317 TTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSIT 396 (523) T ss_pred cCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcceecccccchh Confidence 9999999999999999999999999999999999999999999999999999999999999999877 579999999999 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021072. 377 RDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYV 456 (533) Q Consensus 377 RDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~v 456 (533) |||+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||| T Consensus 397 RDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyv 476 (523) T protein:vir:68 397 RDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFI 476 (523) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCC Q lcl|NC_021072. 457 GKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAM 504 (533) Q Consensus 457 Gky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~ 504 (533) |||||++||||+||+|||+||++++|||++|+++|+|++|+++++- | T Consensus 477 Gky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~e~~~-f 523 (523) T protein:vir:68 477 GKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQDPDQEQED-F 523 (523) T ss_pred cccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhc-C Confidence 9999999999999999999999999999999999999999877543 2 No 7 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=100.00 E-value=5.3e-266 Score=1474.91 Aligned_cols=496 Identities=36% Similarity=0.645 Sum_probs=471.2 Q ss_pred CC-----ccccceeeeccc------cccccCCCCCCCCCcccceeeccccc--------ccccchhhhhhHHHHHHHHHH Q lcl|NC_021072. 1 MS-----NQLFGFSLERAK------KVPKGPSFVQKDSMDGSQPIVGGGYY--------GYSVDFDGTVRNEYELITRYR 61 (533) Q Consensus 1 ~~-----~~~fg~~i~~~~------~~~~~~s~~~~~~~dg~~~~~~~~~~--------~~~~~~~~~~~~~~~LI~~YR 61 (533) |. .+||||.+++.+ .++++.|++||+++||+++|.++.++ ++|++++++++|+++||++|| T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYR 80 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHH Confidence 32 589999998543 35788999999999999998866433 447789999999999999999 Q ss_pred hhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeee Q lcl|NC_021072. 62 EMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHK 141 (533) Q Consensus 62 ~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hk 141 (533) +||+|||||+||+||||||||||++++||+|+|+++++|++||++|+|||++|+++|+|+++||++||+||||||+|||| T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHk 160 (524) T protein:vir:10 81 NLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHK 160 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc-------cccccCCcceeccc Q lcl|NC_021072. 142 VIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG-------LKNSTNQGMKIATD 214 (533) Q Consensus 142 vid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~-------~~~~~~~~~kI~~d 214 (533) |||++||++||++||+|||++|++||++.+++.++. .+..+..+||+|+|.. ..+++++++|||.| T Consensus 161 iid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~-------~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~d 233 (524) T protein:vir:10 161 IINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGV-------KIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRA 233 (524) T ss_pred EeeCCCccccceeeeeeCCccceeeeeecccCcccc-------hhhcchhhheeecCCCcccccCcceecCCcceecchh Confidence 999999999999999999999999999999876654 3455667899988743 34588999999999 Q ss_pred hhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEE Q lcl|NC_021072. 215 SVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLV 294 (533) Q Consensus 215 ai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~v 294 (533) ||+||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+| T Consensus 234 AIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlv 313 (524) T protein:vir:10 234 AVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVV 313 (524) T ss_pred heeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC--Ccccccch Q lcl|NC_021072. 295 YDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE--TTFNIGRA 372 (533) Q Consensus 295 Yd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~--~~~~~g~~ 372 (533) ||++||+|+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++ ++|++|++ T Consensus 314 YDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~ 393 (524) T protein:vir:10 314 YDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAG 393 (524) T ss_pred EeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCcccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999766 58999999 Q ss_pred hhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021072. 373 AEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTM 452 (533) Q Consensus 373 ~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~ 452 (533) +||||||+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|+++++++ T Consensus 394 ~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 473 (524) T protein:vir:10 394 TAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMA 473 (524) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC Q lcl|NC_021072. 453 DPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA 503 (533) Q Consensus 453 ~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~ 503 (533) +||||||||++||||+||+|||+||++++|||++|+++|+|++|+++++-= T Consensus 474 dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 474 EPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred hhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 999999999999999999999999999999999999999999998775433 No 8 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=100.00 E-value=2.4e-265 Score=1471.32 Aligned_cols=496 Identities=36% Similarity=0.695 Sum_probs=473.8 Q ss_pred CCccccceeeecc------ccccccCCCCCCCCCccccee--------ecccccccccchhhhhhHHHHHHHHHHhhhhc Q lcl|NC_021072. 1 MSNQLFGFSLERA------KKVPKGPSFVQKDSMDGSQPI--------VGGGYYGYSVDFDGTVRNEYELITRYREMVLQ 66 (533) Q Consensus 1 ~~~~~fg~~i~~~------~~~~~~~s~~~~~~~dg~~~~--------~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~ 66 (533) --.+||||.++.. ..++++.|+++|+++||+++| .++++++++++.++.++|+++||++||+||+| T Consensus 4 ~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~ 83 (521) T protein:vir:10 4 IFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKY 83 (521) T ss_pred chhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHHhhc Confidence 2357899999853 345789999999999999766 47788899999999999999999999999999 Q ss_pred chhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCC Q lcl|NC_021072. 67 PECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPK 146 (533) Q Consensus 67 pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~ 146 (533) ||||+||+||||||||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+|||||||++ T Consensus 84 pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~ 163 (521) T protein:vir:10 84 HEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDPA 163 (521) T ss_pred cchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc-----cccccCCcceeccchhhcccc Q lcl|NC_021072. 147 NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG-----LKNSTNQGMKIATDSVTYCHS 221 (533) Q Consensus 147 ~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~-----~~~~~~~~~kI~~dai~y~hs 221 (533) ||++||++||+|||++|++||++.+++.++. .++.+..+||+|+|.+ ..++++++++||.|||||||| T Consensus 164 ~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~-------~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~hS 236 (521) T protein:vir:10 164 RPKDGIKELRLLDPRNVEYYRVNLKSNENGN-------DVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYSHS 236 (521) T ss_pred CccccceeeeeeCCcceeeeeeecCCCCCcc-------hhhccceeeeeeccCCCceecCCCCCCcceeechhheeeecc Confidence 9999999999999999999999999876654 3555668999998743 345677889999999999999 Q ss_pred ccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCc Q lcl|NC_021072. 222 GIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGE 301 (533) Q Consensus 222 Gl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGe 301 (533) ||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 237 GL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGe 316 (521) T protein:vir:10 237 GKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGK 316 (521) T ss_pred cceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCC-cccccchhhhhHHhh Q lcl|NC_021072. 302 IKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEV 380 (533) Q Consensus 302 v~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~~~~g~~~eItRDEl 380 (533) |+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++++ ||++|+++||||||+ T Consensus 317 v~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEi 396 (521) T protein:vir:10 317 VKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDEL 396 (521) T ss_pred eccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHH Confidence 9999999999999999999999999999999999999999999999999999999999999995 799999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhh--hccc Q lcl|NC_021072. 381 KFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDP--YVGK 458 (533) Q Consensus 381 kF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~--~vGk 458 (533) ||+|||.|||+||+.+|.++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|+++++++|| |||| T Consensus 397 kF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGk 476 (521) T protein:vir:10 397 QFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGK 476 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999 9999 Q ss_pred cccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC Q lcl|NC_021072. 459 YFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA 503 (533) Q Consensus 459 y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~ 503 (533) |||++||||+||+|||+||++++|||++|+++|+|++|+++++-= T Consensus 477 y~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~~p~~e~~df 521 (521) T protein:vir:10 477 YLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYKNPEDPMEEF 521 (521) T ss_pred ccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCcchhhcC Confidence 999999999999999999999999999999999999999875433 No 9 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=100.00 E-value=8.4e-265 Score=1468.33 Aligned_cols=493 Identities=38% Similarity=0.676 Sum_probs=470.2 Q ss_pred CCccccceeeec------cccccccCCCCCCCCCcccceeec-------ccccccccchhhhhhHHHHHHHHHHhhhhcc Q lcl|NC_021072. 1 MSNQLFGFSLER------AKKVPKGPSFVQKDSMDGSQPIVG-------GGYYGYSVDFDGTVRNEYELITRYREMVLQP 67 (533) Q Consensus 1 ~~~~~fg~~i~~------~~~~~~~~s~~~~~~~dg~~~~~~-------~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~p 67 (533) --.+||||.++. +++++++.|++||+++||+.++.+ +|++|++.++++.++|+++||++||+||+|| T Consensus 2 ~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~p 81 (516) T protein:vir:10 2 KFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNP 81 (516) T ss_pred CchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhcc Confidence 235799997764 345689999999999999998864 8899999999999999999999999999999 Q ss_pred hhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCC Q lcl|NC_021072. 68 ECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKN 147 (533) Q Consensus 68 Evd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~ 147 (533) |||+||+||||||||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+||||||| | T Consensus 82 Evd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~ 159 (516) T protein:vir:10 82 EVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP--N 159 (516) T ss_pred chhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec--C Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999997 8 Q ss_pred CCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccc-------ccccccCCcceeccchhhccc Q lcl|NC_021072. 148 PRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPK-------GLKNSTNQGMKIATDSVTYCH 220 (533) Q Consensus 148 ~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~-------~~~~~~~~~~kI~~dai~y~h 220 (533) |++||++||+|||++|++||++.+++.++.. +..+..+||+|+|. |..+++++++|||.||||||| T Consensus 160 ~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~-------v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~h 232 (516) T protein:vir:10 160 PKKGIAELRRLDPRFMEYYREIVTSDIGGTT-------IVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYAS 232 (516) T ss_pred ccccceeeeeeCCcceeeEeeecccccccch-------hhhhhhheeeeccCccccccccceeCCCcceeechhheeeec Confidence 9999999999999999999999888766543 44556677777764 344678899999999999999 Q ss_pred cccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCC Q lcl|NC_021072. 221 SGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTG 300 (533) Q Consensus 221 sGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TG 300 (533) |||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++|| T Consensus 233 SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TG 312 (516) T protein:vir:10 233 SGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTG 312 (516) T ss_pred ccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccc--cchhhhhHH Q lcl|NC_021072. 301 EIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNI--GRAAEITRD 378 (533) Q Consensus 301 ev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~--g~~~eItRD 378 (533) +|+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++ |+++||||| T Consensus 313 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRD 392 (516) T protein:vir:10 313 TVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRD 392 (516) T ss_pred eeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHH Confidence 999999999999999999999999999999999999999999999999999999999999999999887 999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_021072. 379 EVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGK 458 (533) Q Consensus 379 ElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGk 458 (533) |+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||| T Consensus 393 EiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 472 (516) T protein:vir:10 393 ELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGK 472 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_021072. 459 YFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP 502 (533) Q Consensus 459 y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~ 502 (533) |||++||||+||+|||+||++++|||++|+++|+|++|+++++= T Consensus 473 y~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 473 YVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIKRFQNPENEDDF 516 (516) T ss_pred ccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCCCCCCCCccccC Confidence 99999999999999999999999999999999999999877543 No 10 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=100.00 E-value=8.4e-265 Score=1468.33 Aligned_cols=493 Identities=38% Similarity=0.676 Sum_probs=470.2 Q ss_pred CCccccceeeec------cccccccCCCCCCCCCcccceeec-------ccccccccchhhhhhHHHHHHHHHHhhhhcc Q lcl|NC_021072. 1 MSNQLFGFSLER------AKKVPKGPSFVQKDSMDGSQPIVG-------GGYYGYSVDFDGTVRNEYELITRYREMVLQP 67 (533) Q Consensus 1 ~~~~~fg~~i~~------~~~~~~~~s~~~~~~~dg~~~~~~-------~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~p 67 (533) --.+||||.++. +++++++.|++||+++||+.++.+ +|++|++.++++.++|+++||++||+||+|| T Consensus 2 ~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~p 81 (516) T protein:vir:10 2 KFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNP 81 (516) T ss_pred CchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhcc Confidence 235799997764 345689999999999999998864 8899999999999999999999999999999 Q ss_pred hhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCC Q lcl|NC_021072. 68 ECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKN 147 (533) Q Consensus 68 Evd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~ 147 (533) |||+||+||||||||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+||||||| | T Consensus 82 Evd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~ 159 (516) T protein:vir:10 82 EVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP--N 159 (516) T ss_pred chhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec--C Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999997 8 Q ss_pred CCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccc-------ccccccCCcceeccchhhccc Q lcl|NC_021072. 148 PRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPK-------GLKNSTNQGMKIATDSVTYCH 220 (533) Q Consensus 148 ~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~-------~~~~~~~~~~kI~~dai~y~h 220 (533) |++||++||+|||++|++||++.+++.++.. +..+..+||+|+|. |..+++++++|||.||||||| T Consensus 160 ~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~-------v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~h 232 (516) T protein:vir:10 160 PKKGIAELRRLDPRFMEYYREIVTSDIGGTT-------IVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYAS 232 (516) T ss_pred ccccceeeeeeCCcceeeEeeecccccccch-------hhhhhhheeeeccCccccccccceeCCCcceeechhheeeec Confidence 9999999999999999999999888766543 44556677777764 344678899999999999999 Q ss_pred cccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCC Q lcl|NC_021072. 221 SGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTG 300 (533) Q Consensus 221 sGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TG 300 (533) |||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++|| T Consensus 233 SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TG 312 (516) T protein:vir:10 233 SGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTG 312 (516) T ss_pred ccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccc--cchhhhhHH Q lcl|NC_021072. 301 EIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNI--GRAAEITRD 378 (533) Q Consensus 301 ev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~--g~~~eItRD 378 (533) +|+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++ |+++||||| T Consensus 313 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRD 392 (516) T protein:vir:10 313 TVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRD 392 (516) T ss_pred eeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHH Confidence 999999999999999999999999999999999999999999999999999999999999999999887 999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_021072. 379 EVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGK 458 (533) Q Consensus 379 ElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGk 458 (533) |+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||| T Consensus 393 EiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 472 (516) T protein:vir:10 393 ELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGK 472 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_021072. 459 YFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP 502 (533) Q Consensus 459 y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~ 502 (533) |||++||||+||+|||+||++++|||++|+++|+|++|+++++= T Consensus 473 y~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 473 YVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIKRFQNPENEDDF 516 (516) T ss_pred ccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCCCCCCCCccccC Confidence 99999999999999999999999999999999999999877543 No 11 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=100.00 E-value=1.4e-264 Score=1467.08 Aligned_cols=495 Identities=36% Similarity=0.659 Sum_probs=474.7 Q ss_pred CCccccceeeec--cccccccCCCCCCCCCcccceee---------cccccccccchhhhhhHHHHHHHHHHhhhhcchh Q lcl|NC_021072. 1 MSNQLFGFSLER--AKKVPKGPSFVQKDSMDGSQPIV---------GGGYYGYSVDFDGTVRNEYELITRYREMVLQPEC 69 (533) Q Consensus 1 ~~~~~fg~~i~~--~~~~~~~~s~~~~~~~dg~~~~~---------~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEv 69 (533) |-+.++++..++ ++.+++++|+++|+++||+.++. .+++++++++++++++|+++||++||+||+|||| T Consensus 7 ~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEv 86 (521) T protein:vir:65 7 MLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEV 86 (521) T ss_pred hhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHhhccch Confidence 778889998866 57778999999999999999884 4689999999999999999999999999999999 Q ss_pred hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCC Q lcl|NC_021072. 70 DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPR 149 (533) Q Consensus 70 d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~ 149 (533) |+||+||||||||||++++||+|+|+++++|++||++|+|||++|+++|+|+++||++||+||||||+||||||| +||+ T Consensus 87 d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~pk 165 (521) T protein:vir:65 87 ENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG-KNPK 165 (521) T ss_pred hhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEc-CCcc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999 8999 Q ss_pred CCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccc-------cccccccCCcceeccchhhccccc Q lcl|NC_021072. 150 GGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNP-------KGLKNSTNQGMKIATDSVTYCHSG 222 (533) Q Consensus 150 ~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p-------~~~~~~~~~~~kI~~dai~y~hsG 222 (533) +||++||+|||++|++||++.+++.++. .++++..+||+|+| .|..+++++++|||.|||+||||| T Consensus 166 ~GI~ELr~lDPr~i~~vr~i~k~~~~~~-------~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSG 238 (521) T protein:vir:65 166 DGIVELRQLDPRNLEYVREIITEDTPEG-------KIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSG 238 (521) T ss_pred ccceeeeeeCCcceeeeeeecccccCCc-------ceecceeeeeeeecCCcceeccceeecCCcceeechhheeeeecc Confidence 9999999999999999999999877654 45667788888865 344568899999999999999999 Q ss_pred cccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcc Q lcl|NC_021072. 223 IQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEI 302 (533) Q Consensus 223 l~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev 302 (533) |+|||+|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+| T Consensus 239 l~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev 318 (521) T protein:vir:65 239 LMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKL 318 (521) T ss_pred ceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccC--CCCcccccchhhhhHHhh Q lcl|NC_021072. 303 KDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLE--TETTFNIGRAAEITRDEV 380 (533) Q Consensus 303 ~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~--~~~~~~~g~~~eItRDEl 380 (533) +||+|+||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+ ++++|++||++||||||+ T Consensus 319 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEi 398 (521) T protein:vir:65 319 KNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDEL 398 (521) T ss_pred cccccccchhhhhcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHH Confidence 999999999999999999999999999999999999999999999999999999999985 446899999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_021072. 381 KFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYF 460 (533) Q Consensus 381 kF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~ 460 (533) ||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||| T Consensus 399 KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~ 478 (521) T protein:vir:65 399 EFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYF 478 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC Q lcl|NC_021072. 461 SIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA 503 (533) Q Consensus 461 S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~ 503 (533) |++||||+||+|||+||++++|||++|+++|+|++|+++++.= T Consensus 479 S~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 479 SNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred chHHHHHHHhccCHHHHHHHHHHHHHhhhCCCCCCCcccccCC Confidence 9999999999999999999999999999999999999876543 No 12 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=100.00 E-value=2.5e-264 Score=1465.72 Aligned_cols=493 Identities=39% Similarity=0.686 Sum_probs=468.7 Q ss_pred CCccccceeee------ccccccccCCCCCCCCCcccceeec-------ccccccccchhhhhhHHHHHHHHHHhhhhcc Q lcl|NC_021072. 1 MSNQLFGFSLE------RAKKVPKGPSFVQKDSMDGSQPIVG-------GGYYGYSVDFDGTVRNEYELITRYREMVLQP 67 (533) Q Consensus 1 ~~~~~fg~~i~------~~~~~~~~~s~~~~~~~dg~~~~~~-------~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~p 67 (533) --.+||||..+ ++++++++.|++||+++||++++.+ +|++|+++++++.++++++||++||+||+|| T Consensus 2 ~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~p 81 (516) T protein:vir:10 2 KFLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNNP 81 (516) T ss_pred CchHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhcc Confidence 23578999433 3366789999999999999999964 6899999999999999999999999999999 Q ss_pred hhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCC Q lcl|NC_021072. 68 ECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKN 147 (533) Q Consensus 68 Evd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~ 147 (533) |||+||+||||||||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+||||||| | T Consensus 82 Evd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid--~ 159 (516) T protein:vir:10 82 EVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMP--N 159 (516) T ss_pred chhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEec--C Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999997 8 Q ss_pred CCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecc-------ccccccccCCcceeccchhhccc Q lcl|NC_021072. 148 PRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYN-------PKGLKNSTNQGMKIATDSVTYCH 220 (533) Q Consensus 148 ~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~-------p~~~~~~~~~~~kI~~dai~y~h 220 (533) |++||++||+|||++|++||++.+++.++..+..+. .+||+|. ..|..+++++++|||.|||+||| T Consensus 160 ~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~-------~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~h 232 (516) T protein:vir:10 160 PKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGY-------REFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAH 232 (516) T ss_pred cccceeeeeeeCCcceeeEEeeecccCcchhhhhce-------eeeeeeecCccceeccccccCCCCceecchhheeeee Confidence 999999999999999999999999988876555444 4455553 33444678899999999999999 Q ss_pred cccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCC Q lcl|NC_021072. 221 SGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTG 300 (533) Q Consensus 221 sGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TG 300 (533) |||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++|| T Consensus 233 SGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TG 312 (516) T protein:vir:10 233 SGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTG 312 (516) T ss_pred cCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccc--cchhhhhHH Q lcl|NC_021072. 301 EIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNI--GRAAEITRD 378 (533) Q Consensus 301 ev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~--g~~~eItRD 378 (533) +|+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++++|++ |+++||||| T Consensus 313 ev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRD 392 (516) T protein:vir:10 313 TVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRD 392 (516) T ss_pred eeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHH Confidence 999999999999999999999999999999999999999999999999999999999999999999887 999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_021072. 379 EVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGK 458 (533) Q Consensus 379 ElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGk 458 (533) |+||+|||.|||+|||.+|.++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||| T Consensus 393 EiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGk 472 (516) T protein:vir:10 393 ELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGK 472 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_021072. 459 YFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP 502 (533) Q Consensus 459 y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~ 502 (533) |||++||||+||+|||+||++++|||++|+++|+|++|+++++= T Consensus 473 y~s~~yi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~e~~f 516 (516) T protein:vir:10 473 YVSHDYVMKNILQMTDEQIAQEEKQIEKEANVKRFQNPENEDDF 516 (516) T ss_pred ccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCCccccC Confidence 99999999999999999999999999999999999999887554 No 13 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=100.00 E-value=6.1e-264 Score=1463.60 Aligned_cols=495 Identities=38% Similarity=0.687 Sum_probs=469.5 Q ss_pred CCccccceeeec----------cccccccCCCCCCCCCcccceee------cccccccccchhhh----hhHHHHHHHHH Q lcl|NC_021072. 1 MSNQLFGFSLER----------AKKVPKGPSFVQKDSMDGSQPIV------GGGYYGYSVDFDGT----VRNEYELITRY 60 (533) Q Consensus 1 ~~~~~fg~~i~~----------~~~~~~~~s~~~~~~~dg~~~~~------~~~~~~~~~~~~~~----~~~~~~LI~~Y 60 (533) |.-..+|| ++. .+.+.+..|++||+++|||..+. +++|+|++++++|+ ++|+++||++| T Consensus 1 m~~~~L~~-~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:10 1 MKFNVLSL-FAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCCchhhH-hhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 87777776 322 24456888999999999998883 46899999887765 78999999999 Q ss_pred HhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeee Q lcl|NC_021072. 61 REMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYH 140 (533) Q Consensus 61 R~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~h 140 (533) |+||+|||||+||+||||||||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+||| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 159 (524) T protein:vir:10 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFH 159 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccc-------ccccCCcceecc Q lcl|NC_021072. 141 KVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGL-------KNSTNQGMKIAT 213 (533) Q Consensus 141 kvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~-------~~~~~~~~kI~~ 213 (533) ||||++||++||++||+|||++|++||++.+++.++. .+.++..+||+|+|.+. .+++++++|||. T Consensus 160 Kiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~-------~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~ 232 (524) T protein:vir:10 160 KIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGT-------KIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPK 232 (524) T ss_pred EEeeCCCccccceeeeeeCCccceeeeeeccCCCccc-------hhhcchhhheeeccCccccccCccccCCCcceecch Confidence 9999999999999999999999999999999887654 45567788999988653 346789999999 Q ss_pred chhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEE Q lcl|NC_021072. 214 DSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKL 293 (533) Q Consensus 214 dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~ 293 (533) ||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+ T Consensus 233 dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNkl 312 (524) T protein:vir:10 233 AAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRV 312 (524) T ss_pred hheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC--Ccccccc Q lcl|NC_021072. 294 VYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE--TTFNIGR 371 (533) Q Consensus 294 vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~--~~~~~g~ 371 (533) |||++||+|+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++ ++|++|+ T Consensus 313 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr 392 (524) T protein:vir:10 313 VYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDS 392 (524) T ss_pred EEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999776 6899999 Q ss_pred hhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 372 AAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNT 451 (533) Q Consensus 372 ~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~ 451 (533) ++||||||+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++ T Consensus 393 ~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~ 472 (524) T protein:vir:10 393 GTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTM 472 (524) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCC Q lcl|NC_021072. 452 MDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAM 504 (533) Q Consensus 452 ~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~ 504 (533) ++||||||||++||||+||+|||+||++++|||++|+++|+|++|+++++- | T Consensus 473 ~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~-f 524 (524) T protein:vir:10 473 AEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQED-F 524 (524) T ss_pred hhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhc-C Confidence 999999999999999999999999999999999999999999999877543 2 No 14 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=100.00 E-value=6.4e-264 Score=1463.49 Aligned_cols=495 Identities=37% Similarity=0.686 Sum_probs=469.5 Q ss_pred CCccccceeeec----------cccccccCCCCCCCCCcccceee------cccccccccchhhh----hhHHHHHHHHH Q lcl|NC_021072. 1 MSNQLFGFSLER----------AKKVPKGPSFVQKDSMDGSQPIV------GGGYYGYSVDFDGT----VRNEYELITRY 60 (533) Q Consensus 1 ~~~~~fg~~i~~----------~~~~~~~~s~~~~~~~dg~~~~~------~~~~~~~~~~~~~~----~~~~~~LI~~Y 60 (533) |.-..+|| ++. .+.+.+..|++||+++|||..+. +++|+|++++++|+ ++|+++||++| T Consensus 1 m~~~~L~~-~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~Y 79 (524) T protein:vir:72 1 MKFNVLSL-FAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTY 79 (524) T ss_pred CCCchhhH-hhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHH Confidence 87777776 322 24456888999999999998883 46899999887765 78999999999 Q ss_pred HhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeee Q lcl|NC_021072. 61 REMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYH 140 (533) Q Consensus 61 R~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~h 140 (533) |+||+|||||+||+||||||||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+||| T Consensus 80 R~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 159 (524) T protein:vir:72 80 RNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFH 159 (524) T ss_pred HHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccc-------ccccCCcceecc Q lcl|NC_021072. 141 KVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGL-------KNSTNQGMKIAT 213 (533) Q Consensus 141 kvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~-------~~~~~~~~kI~~ 213 (533) ||||++||++||++||+|||++|++||++.+++.++. .+.++..+||+|+|.+. .+++++++|||. T Consensus 160 Kiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~-------~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~ 232 (524) T protein:vir:72 160 KIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGT-------KIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPK 232 (524) T ss_pred EEEeCCCccccceeeeeeCCccceeeeeeccCCCccc-------hhhcchhhheeeccCccccccCccccCCCcceecch Confidence 9999999999999999999999999999999887654 45567788999988653 346789999999 Q ss_pred chhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEE Q lcl|NC_021072. 214 DSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKL 293 (533) Q Consensus 214 dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~ 293 (533) ||||||||||+|||+++|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+ T Consensus 233 dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNkl 312 (524) T protein:vir:72 233 AAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRV 312 (524) T ss_pred hheeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC--Ccccccc Q lcl|NC_021072. 294 VYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE--TTFNIGR 371 (533) Q Consensus 294 vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~--~~~~~g~ 371 (533) |||++||+|+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++ ++|++|+ T Consensus 313 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr 392 (524) T protein:vir:72 313 VYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDS 392 (524) T ss_pred EEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999776 6899999 Q ss_pred hhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 372 AAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNT 451 (533) Q Consensus 372 ~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~ 451 (533) ++||||||+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++ T Consensus 393 ~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~ 472 (524) T protein:vir:72 393 GTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTM 472 (524) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCC Q lcl|NC_021072. 452 MDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAM 504 (533) Q Consensus 452 ~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~ 504 (533) ++||||||||++||||+||+|||+||++++|||++|+++|+|++|+++++- | T Consensus 473 ~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~-f 524 (524) T protein:vir:72 473 AEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQED-F 524 (524) T ss_pred hhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhc-C Confidence 999999999999999999999999999999999999999999999877543 2 No 15 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=100.00 E-value=2e-260 Score=1444.35 Aligned_cols=495 Identities=38% Similarity=0.680 Sum_probs=462.5 Q ss_pred CCccccc-----eee--------eccccccccCCCCCCCCCcccceee--------cccccccccchhhhhhHHHHHHHH Q lcl|NC_021072. 1 MSNQLFG-----FSL--------ERAKKVPKGPSFVQKDSMDGSQPIV--------GGGYYGYSVDFDGTVRNEYELITR 59 (533) Q Consensus 1 ~~~~~fg-----~~i--------~~~~~~~~~~s~~~~~~~dg~~~~~--------~~~~~~~~~~~~~~~~~~~~LI~~ 59 (533) |.-.=|+ |+. .++++++++.|++||+++||+.++. +|.++++|++.++.++|+++||++ T Consensus 1 ~~~~~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~ 80 (524) T protein:vir:98 1 MNFLGFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINT 80 (524) T ss_pred CCCcchhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHH Confidence 3222111 111 1235778999999999999998885 566667789999999999999999 Q ss_pred HHhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceee Q lcl|NC_021072. 60 YREMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFY 139 (533) Q Consensus 60 YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~ 139 (533) ||+||+|||||+||+|||||||||+++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+|| T Consensus 81 YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~f 160 (524) T protein:vir:98 81 YRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYF 160 (524) T ss_pred HHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc-------cccccCCcceec Q lcl|NC_021072. 140 HKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG-------LKNSTNQGMKIA 212 (533) Q Consensus 140 hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~-------~~~~~~~~~kI~ 212 (533) ||||| +++++||++||+|||++|++||++.++..+. +..++++..+||+|+|.+ ..+++++.+||| T Consensus 161 hkiid-~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~------~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~ 233 (524) T protein:vir:98 161 HKIMH-KDESKGIRELRQLDPRCMELIRESITETLDG------GVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIP 233 (524) T ss_pred EEEEc-CCCCcceeeeeeeCCccceeeeecccccccc------chhhccceeeeeeeccCCCccccccceecCCCceeec Confidence 99999 6677899999999999999999998887443 233566788999998754 445688999999 Q ss_pred cchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccE Q lcl|NC_021072. 213 TDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNK 292 (533) Q Consensus 213 ~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk 292 (533) .|||+||||||+||+++ |+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++|||| T Consensus 234 ~dAIvy~hSGL~d~~~~-iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNk 312 (524) T protein:vir:98 234 RSAIVYAHSGLEDCSNN-IIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNR 312 (524) T ss_pred hhheeeeccCcccCCCC-eeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 99999999999999976 67999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCC-CCcccccc Q lcl|NC_021072. 293 LVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLET-ETTFNIGR 371 (533) Q Consensus 293 ~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~-~~~~~~g~ 371 (533) +|||++||+||||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||++ +++|++|| T Consensus 313 lvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr 392 (524) T protein:vir:98 313 VVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGG 392 (524) T ss_pred eEeeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999985 57999999 Q ss_pred hhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 372 AAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNT 451 (533) Q Consensus 372 ~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~ 451 (533) ++||||||+||+|||.|||+||+.+|.++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++ T Consensus 393 ~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~ 472 (524) T protein:vir:98 393 GGEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQ 472 (524) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC Q lcl|NC_021072. 452 MDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA 503 (533) Q Consensus 452 ~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~ 503 (533) ++||||||||++||||+||+|||+||++++|||++|.++|+|++|+++++-= T Consensus 473 ~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 473 VEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred hccccccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCccccccC Confidence 9999999999999999999999999999999999999999999999886543 No 16 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=100.00 E-value=3.9e-258 Score=1431.77 Aligned_cols=484 Identities=40% Similarity=0.729 Sum_probs=455.9 Q ss_pred cceeee------ccccccccCCCCCCCCCccccee--------ecccccccccchhhhhhHHHHHHHHHHhhhhcchhhh Q lcl|NC_021072. 6 FGFSLE------RAKKVPKGPSFVQKDSMDGSQPI--------VGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDS 71 (533) Q Consensus 6 fg~~i~------~~~~~~~~~s~~~~~~~dg~~~~--------~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~ 71 (533) |-|--+ +++++.++.|++||+++||++++ .+|+|+|++++.++.++++ +||++||+||+|||||+ T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~-eLI~~YR~ma~~pEvd~ 79 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVK-ELIKSYRALAEYHEVDD 79 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchH-HHHHHHHHHhhccchhh Confidence 332211 12446788899999999999888 4777899999999999986 99999999999999999 Q ss_pred HHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCC Q lcl|NC_021072. 72 AVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGG 151 (533) Q Consensus 72 AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~g 151 (533) ||+||||||||||++++||+|+|+++++|++||++|++||++|+++|+|+++||++||+||||||+|||||||| ++| T Consensus 80 Av~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~---k~G 156 (511) T protein:vir:56 80 AIQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDK---DNN 156 (511) T ss_pred HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecc---ccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999986 569 Q ss_pred eEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccccc--------ccCCcceeccchhhcccccc Q lcl|NC_021072. 152 LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKN--------STNQGMKIATDSVTYCHSGI 223 (533) Q Consensus 152 I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~--------~~~~~~kI~~dai~y~hsGl 223 (533) |++||+||||+|++||++.+++.++. .+.....+||+|+|++... .++++++||.+||||||||| T Consensus 157 I~eLr~lDPr~i~~vr~i~~~~~~~~-------~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL 229 (511) T protein:vir:56 157 IIELRPLNPMKMELVREIQKETIDGV-------EVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGL 229 (511) T ss_pred eeehhhcCcccchhhhhhhccccccc-------ccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccc Confidence 99999999999999999999876643 3455669999999976321 23578999999999999999 Q ss_pred ccCC--CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCc Q lcl|NC_021072. 224 QDLN--KNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGE 301 (533) Q Consensus 224 ~d~~--~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGe 301 (533) +||| +|+|+||||+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+ T Consensus 230 ~d~~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGe 309 (511) T protein:vir:56 230 MRGCADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQ 309 (511) T ss_pred eeccCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCce Confidence 9965 66799999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC---CcccccchhhhhHH Q lcl|NC_021072. 302 IKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE---TTFNIGRAAEITRD 378 (533) Q Consensus 302 v~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~---~~~~~g~~~eItRD 378 (533) |+||+||||||||||||||||||||||||||||||||||+||+||++|||+|||||+|||+++ ++|++||++||||| T Consensus 310 v~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRD 389 (511) T protein:vir:56 310 VKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRD 389 (511) T ss_pred eccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHH Confidence 999999999999999999999999999999999999999999999999999999999999977 48999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_021072. 379 EVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGK 458 (533) Q Consensus 379 ElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGk 458 (533) |+||+|||.|||+||+.+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||| T Consensus 390 EiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGk 469 (511) T protein:vir:56 390 ELKFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGK 469 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc Q lcl|NC_021072. 459 YFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEM 500 (533) Q Consensus 459 y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~ 500 (533) |||++||||+||+|||+||++++|||++|+++++|+.|+... T Consensus 470 y~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 470 YYSHKYIQKNILRLSDDQITAMQSEIDEEETNPRFQQDDQGF 511 (511) T ss_pred ccchHHHHHHHhccCHHHHHHHHHHHHHhhcCCCCCCcccCC Confidence 999999999999999999999999999999999999887765 No 17 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=100.00 E-value=2e-225 Score=1252.46 Aligned_cols=477 Identities=22% Similarity=0.337 Sum_probs=423.6 Q ss_pred CCccccceeeeccccccccCCCCCCCCCc-ccceeecccccccccchhhhhhHHHHHHHHHHhhh-hcchhhhHHHHhhc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMD-GSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMV-LQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~d-g~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~-~~pEvd~AvdeIvn 78 (533) +.+++|||+ .++.+++.|++|++.+. +++.+.+++||| |.++|+++||++||+|+ +|||||+||++||| T Consensus 19 ~~~~~~~~~---~p~~~dG~s~i~~~~~~~~~~~~~~~~~~g------g~~~n~~eLI~~YR~ma~~~pEVd~AideIvn 89 (533) T protein:vir:58 19 FLSPMYGMG---APHGAGGSSMIPINMYHPFATAGYASRFYG------GIEFNRFFLYDMYDRMDYTDPLISTVLDIIAD 89 (533) T ss_pred hhchhhccc---CccCCCCCccccCCCCcchhhhhhhhhhhc------cccccHHHHHHHHHHhhccCcchhhHHHhhhc Confidence 888999986 46678888999987554 456667777776 57889999999999997 58999999999999 Q ss_pred ceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEc Q lcl|NC_021072. 79 ETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYI 158 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~l 158 (533) |||||++++.||+|+|+++++|++||++| +++|+|+++||++||+||||||+||||++ +||++||++||+| T Consensus 90 eaiv~d~~~~pV~v~l~~~e~s~~iK~kI-------~~lldf~~~~~~~fR~WYVDGriy~Hkii--k~~k~GI~elr~l 160 (533) T protein:vir:58 90 ECTIPNENGNIVDVVTKDIELAKAILSYL-------DYVINIEKNAYPIIRNMIKYGDMFLHILE--KGSDGTIEKFQVV 160 (533) T ss_pred eeeEecCCCceeEeecccccccHHHHHHH-------HHHhcchhhhhHHHHhhhhcceeEEEecc--CCcccchhhheec Confidence 99999999999999999999999998765 57999999999999999999999999977 5799999999999 Q ss_pred ChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccc-cccCCcceeccchhhccccccccCCCCccchhHHH Q lcl|NC_021072. 159 DPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLK-NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHK 237 (533) Q Consensus 159 DP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~-~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~ 237 (533) ||++|++||++.++ .+||+|+|.+.+ .++++.++||.+||+||||||+|||+++|+||||+ T Consensus 161 DPr~i~~vr~~~t~------------------~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhk 222 (533) T protein:vir:58 161 SPYIFSKRYNPETD------------------TWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLES 222 (533) T ss_pred CCeeeEEEEeeccc------------------eEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhH Confidence 99999999998765 488999998864 46778899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccc---hhHhh Q lcl|NC_021072. 238 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFM---SMLED 314 (533) Q Consensus 238 AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~m---smlED 314 (533) |||||||||||||||||||+||||||||||||||||||.||+|||++||++||||+|||++||+|+||++|| ||||| T Consensus 223 AiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlED 302 (533) T protein:vir:58 223 ARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKD 302 (533) T ss_pred HHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhh Confidence 999999999999999999999999999999999999999999999999999999999999999999999998 99999 Q ss_pred hcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHH Q lcl|NC_021072. 315 FWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFS 394 (533) Q Consensus 315 ywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs 394 (533) ||||||+||||||||||||| |||+|+||+||++|||+|||||+|||+++++| ||++|||||||||+|||+|||++|+ T Consensus 303 yWLpRReGgrgTEI~TLpGg-~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~f--gr~~eItRDEiKF~KFI~rLR~rF~ 379 (533) T protein:vir:58 303 YFIPRRGDRRAVEIDILQGS-KVDLAEDVEYMLNRLISALKVPKAFIGYEGDV--NAKNTLATQDIKFNNTIKRIQGFFV 379 (533) T ss_pred hcccccCCCccceeeecCCC-CCCcHHHHHHHHHHHHHHhCCCeeecCCCCCC--ccchhhhHHHHHHHHHHHHHHHHHH Confidence 99999999999999999998 59999999999999999999999999999987 9999999999999999999999998 Q ss_pred HHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH Q lcl|NC_021072. 395 ELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD 474 (533) Q Consensus 395 ~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD 474 (533) ++| ++||+||||||++|| +|+|++||||+|+|++|||++|++++++++||||| +||||+||+||| T Consensus 380 ~ll----~~qLilk~iit~eew-------~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk----~yi~k~ILr~td 444 (533) T protein:vir:58 380 EEL----ERMVRMNKEFADQDF-------RLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVRE----DWIYSNILQIPY 444 (533) T ss_pred HHH----hcccccccCcchhhe-------eeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhH----HHHHHHHhcCCh Confidence 776 559999999999999 69999999999999999999999999999999999 799999999998 Q ss_pred HHHHHHHHHHHHhhhcCCCCCCC--cccccCCCC--CC----------------CCC-------CCCccccccccCCc-- Q lcl|NC_021072. 475 QEIKEIDKQIDSEREAGLIVDPM--AEMDPAMDP--GN----------------APP-------ADDMSAQEGPAVDA-- 525 (533) Q Consensus 475 eeI~e~~kqi~~E~~~~~~~~p~--~~~~~~~~~--~~----------------~~~-------~~d~~~~~~~~~~~-- 525 (533) ||++++++|++|.++|+|+.|+ ++++|+--. .+ +++ ..+++.++.++.+. T Consensus 445 -ei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~ 523 (533) T protein:vir:58 445 -DLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGG 523 (533) T ss_pred -hhhHHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCc Confidence 6777778999999999999874 333332110 00 110 11233333222222 Q ss_pred ---cccchhc Q lcl|NC_021072. 526 ---GDAKRGE 532 (533) Q Consensus 526 ---~~~~~~~ 532 (533) ...+++| T Consensus 524 ~~~~~~p~~~ 533 (533) T protein:vir:58 524 EEELPFPEEE 533 (533) T ss_pred ccCCCCCCCC Confidence 2333333 No 18 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.51 E-value=1.3e-12 Score=85.68 Aligned_cols=463 Identities=14% Similarity=0.116 Sum_probs=247.7 Q ss_pred CCccccceeeeccccccccCCCCCC-------------CCCcccceeecccccccccchhhhhh-HHHHHHHHHHhh-hh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQK-------------DSMDGSQPIVGGGYYGYSVDFDGTVR-NEYELITRYREM-VL 65 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~-------------~~~dg~~~~~~~~~~~~~~~~~~~~~-~~~~LI~~YR~m-~~ 65 (533) |+ ..++.+++..| ...+|+-..-..++.....+.+..+. +...|..+-|.+ .+ T Consensus 1 mn------------~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rN 68 (502) T protein:vir:79 1 MA------------ILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNN 68 (502) T ss_pred Cc------------hHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhc Confidence 21 11222222111 11222211111112222222233333 688899999999 59 Q ss_pred cchhhhHHHHhhcceeeecCCCceE--EEEeccCCCcHHHHHHHHHHHHHHHH------HhcchhhhhHHHHhhhhcCce Q lcl|NC_021072. 66 QPECDSAVDDIVNETICGNFDDVPV--EVELSNLKQSDKIKKLIREEFAEILR------LLDFENRSYEIFRRWYVDGRL 137 (533) Q Consensus 66 ~pEvd~AvdeIvneaiv~d~~~~~v--~v~l~~~~~S~~ik~~I~eeF~~i~~------lL~f~~~~~~~fR~WYvDGri 137 (533) +|-+.+||+-+++-+|=+ ++..+ .++..+.+..+.+.++|..+|+.-.+ .++|..--.-.+|.|.+||.. T Consensus 69 n~~a~~av~~~~~nvVG~--ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~ 146 (502) T protein:vir:79 69 HDLVIGVFDKLEERVVGK--NGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEV 146 (502) T ss_pred ChHHHHHHHHHHHhhccC--CceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCce Confidence 999999999999887721 12222 23345555678889999999987765 345555556689999999999 Q ss_pred eeeeeecCCCC-CCCe---EEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceecc Q lcl|NC_021072. 138 FYHKVIDPKNP-RGGL---TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIAT 213 (533) Q Consensus 138 ~~hkvid~~~~-~~gI---~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~ 213 (533) |..++.++... +.|. ..|+.|+|..|.--+ ..++...-..-. .-.+.-..|+++...+.........+||. T Consensus 147 f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~~----~~~~~i~~GVe~-d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA 221 (502) T protein:vir:79 147 FAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTS----DESNRLNQGVFV-DDWGRPEKYLVYKSRPVSGRQMETKEVDA 221 (502) T ss_pred EEEEeecccCccCCCcccceEEEEecchhcCCCC----CCCCeeEeeeEE-CCCCceEEEEEeecCCCCCcccceeEech Confidence 99999976432 2222 589999999984211 111111111111 12356678888866555445556678887 Q ss_pred chhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEE Q lcl|NC_021072. 214 DSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKL 293 (533) Q Consensus 214 dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~ 293 (533) ..|.-.... ..+.---++|.|..+++.+.+|.-.+||..+-...-|-.=-+..-+.|.-....+ T Consensus 222 ~~vlH~f~~-~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~--------------- 285 (502) T protein:vir:79 222 ERMLHLKFV-RRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDG--------------- 285 (502) T ss_pred hheEEeecc-cCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccccc--------------- Confidence 654333332 2344556789999999999999999999999988888764444444333111000 Q ss_pred EeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccch Q lcl|NC_021072. 294 VYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETETTFNIGRA 372 (533) Q Consensus 294 vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~~g~~ 372 (533) .+.. .+....+|-.-==+ ..-.-|.+|..+.....-+..++ ++...+.+=.+|+||-.-|..+ |+ ++- T Consensus 286 -~~~~-----~~~~~~~l~pG~i~--~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D--~s-~ny 354 (502) T protein:vir:79 286 -NGSK-----ENERELTIQPGIIY--DDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARN--YN-GTY 354 (502) T ss_pred -CCCC-----CccccccccCCccc--cccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcc--cc-chH Confidence 0000 11111111000000 00122455555555444444333 3344444667899999998776 33 244 Q ss_pred hhhhHHhhhHHHHHHHHHHHHHHHHHHHHH----HHHHhccCCCHhHHhhhhhceeEEE--eccchHHHHHHHHHHHHHH Q lcl|NC_021072. 373 AEITRDEVKFQKFIARLRKRFSELFMDLLK----TQLILKGVMSLEEWDEMKEHIQFDF--IADNYFTELKEIEIRNERM 446 (533) Q Consensus 373 ~eItRDElkF~Kfi~rLr~~fs~if~d~Lk----~qLilkgi~t~eew~~~~~~i~~~f--~~Dn~f~E~ke~Ei~~~R~ 446 (533) |.+.---+.|-+.+.++|+.|..-|..++- ...+|.|.+..-.|.+-.......| ..--+.-.+||+.-...++ T Consensus 355 Ss~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i 434 (502) T protein:vir:79 355 SAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQI 434 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHH Confidence 555556667999999999988887776533 3567888876544443333333334 3333345566655444443 Q ss_pred HHHHHhhhhccccccHHHHHHHHhCCCHHHHH-HHHHHHHHhhhcCCCCCCCcccccC--CC--CCCCCCCCCccccc Q lcl|NC_021072. 447 NQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIK-EIDKQIDSEREAGLIVDPMAEMDPA--MD--PGNAPPADDMSAQE 519 (533) Q Consensus 447 ~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~-e~~kqi~~E~~~~~~~~p~~~~~~~--~~--~~~~~~~~d~~~~~ 519 (533) + .-.-|.+-+..+ .+..-+++- |.+...+...+.|+..+....-.++ .. ...+++.++...++ T Consensus 435 ~---------~Gl~t~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 435 R---------GGAATESDWVRA-GGRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred H---------cCCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 3 233455555555 455444433 3333333344456533222111111 11 11122222222222 No 19 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.41 E-value=3.2e-12 Score=83.60 Aligned_cols=457 Identities=14% Similarity=0.142 Sum_probs=206.1 Q ss_pred CCccccceeeeccc--cccccCCCCCCC---CCccc-----ceee---cccccccccch-------hhhhhHHHHHHHHH Q lcl|NC_021072. 1 MSNQLFGFSLERAK--KVPKGPSFVQKD---SMDGS-----QPIV---GGGYYGYSVDF-------DGTVRNEYELITRY 60 (533) Q Consensus 1 ~~~~~fg~~i~~~~--~~~~~~s~~~~~---~~dg~-----~~~~---~~~~~~~~~~~-------~~~~~~~~~LI~~Y 60 (533) -..+++++-.+.++ ......-+..|. .+|++ +... .++..++.+.. .+.+ .-++|...| T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f-~gyql~alY 117 (765) T protein:vir:96 39 KLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGF-IGYQACAII 117 (765) T ss_pred hHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhcccCC-ccHHHHHHH Confidence 01123332222211 111111111111 11221 1111 11111110000 0011 135677777 Q ss_pred HhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHH--hhhhcCcee Q lcl|NC_021072. 61 REMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFR--RWYVDGRLF 138 (533) Q Consensus 61 R~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR--~WYvDGri~ 138 (533) ++ ++.+..+|+-++.+|+. ..+.|..+.-+.++...++|.. .++.|++..+..+.+| |.|-.|-++ T Consensus 118 ~~---~~l~rkiVd~pAeDa~R-----~g~~I~~~~~e~~~~~~~~l~~----~~~rl~v~~~l~ea~~~~RlyGga~i~ 185 (765) T protein:vir:96 118 SQ---HWLVDKACSMSGEDAAR-----NGWELKSDGRKLSDEQSALIAR----RDMEFRVKDNLVELNRFKNVFGVRIAL 185 (765) T ss_pred Hh---CchhhhhhhcchHHhhc-----CCceeecCccccCHHHHHHHHH----HHHHhhHHHHHHHHHHHhhhceeeEEE Confidence 65 89999999999999985 4466665543444444444444 4445577888888888 777777666 Q ss_pred eeeeecCCC-------------CCCCeEEEEEcChhhceehhh-ccCCCcCceeEEeccceeeccchhceeccccccccc Q lcl|NC_021072. 139 YHKVIDPKN-------------PRGGLTELRYIDPRKIRKVTE-YQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNS 204 (533) Q Consensus 139 ~hkvid~~~-------------~~~gI~elr~lDP~~i~~vr~-~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~ 204 (533) +- ++..+ .+++++.|+.+||..+..... ....++ ....|+.- ++|..+.. ..- T Consensus 186 i~--i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp--------~sp~fg~P-~~y~i~g~--~IH 252 (765) T protein:vir:96 186 FV--VESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADP--------SAEHFYEP-DFWIISGK--KYH 252 (765) T ss_pred EE--ecccCcchhhccccccccccceeeEEEEechhhcccccchhccccc--------cccccCcc-eeeeecCc--eec Confidence 54 32111 234677788888877754221 111111 11112211 11111110 000 Q ss_pred cCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHH--HHHHHHHHHhcCccceEEEccCCCC-chHHHHHH Q lcl|NC_021072. 205 TNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMI--EDSLVIYRLSRAPERRIFYIDVGNL-PKNKAEQY 281 (533) Q Consensus 205 ~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~--EDalVIyRi~RAPeRrvfyIDvGnl-pk~KAeqY 281 (533) +.-.+++.-..+.+. +.......+.|-|+++...+..+... +.+.++++-. . +++.+|.... ...++... T Consensus 253 ~SRli~~~g~~lpd~---lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~---~-~v~k~~~~~~l~~~~~l~~ 325 (765) T protein:vir:96 253 RSHLVVVRGPQPPDI---LKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKR---T-STIHVDVEKAIANEDAFNA 325 (765) T ss_pred cceEEEecCCCchhh---hccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhc---c-ceeeechHhhhccHHHHHH Confidence 000111111111111 12223344678888876555554322 3445566432 2 3566665532 11112111 Q ss_pred HHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccc Q lcl|NC_021072. 282 LREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSR 360 (533) Q Consensus 282 l~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV-~YF~~kLy~aL~VP~sR 360 (533) --+.++++|+- .|- +-+ +++-+++++. .+|+-++|+ ..|...+=.+.+||+.| T Consensus 326 r~~~~~~~r~n------~g~-------~~i-----------d~ee~~e~~s--~~lsgl~d~l~~~~~~iAaas~IP~t~ 379 (765) T protein:vir:96 326 RLAFWIANRDN------HGV-------KVI-----------GIDETMEQFD--TNLSDFDSVIMNQYQLVAAIAKTPATK 379 (765) T ss_pred HHHHHHHhcCC------cee-------EEe-----------cCCcceeEEe--cccCCHHHHHHHHHHHHHhhhCCCeee Confidence 11223334321 111 000 0112344443 356666774 67899999999999999 Q ss_pred cCCC--CcccccchhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHH Q lcl|NC_021072. 361 LETE--TTFNIGRAAEITRDEVKFQKFIARLRKR-FSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELK 437 (533) Q Consensus 361 l~~~--~~~~~g~~~eItRDElkF~Kfi~rLr~~-fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~k 437 (533) |-.+ .|+|-+.-.+ .-.|+.+|+.+|.. +..+...+++. |++-|. +.+.+.|.|..=..=+|.. T Consensus 380 LfGqsp~GlnATGe~D----~~nYyD~I~s~Qe~~l~p~le~L~~l-i~~s~~--------i~~d~~i~FnpL~~~sekE 446 (765) T protein:vir:96 380 LLGTSPKGFNATGEHE----TISYHEELESIQEHIFDPLLERHYLL-LAKSES--------IDVQLEIVWNPVDSTTSQQ 446 (765) T ss_pred eccCCcccccCcchHH----HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcC--------CCCcceEEeCCCCCCCHHH Confidence 9554 5886533333 33499999999965 45555554444 455454 2346889998877788888 Q ss_pred HHHHHHHHHHHHHHhhhhccccccHHHHHHHHh--------CCCHHHHHHHHHHHHHhhhcCCCCCCCcccc--cCCCCC Q lcl|NC_021072. 438 EIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVL--------KQTDQEIKEIDKQIDSEREAGLIVDPMAEMD--PAMDPG 507 (533) Q Consensus 438 e~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL--------~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~--~~~~~~ 507 (533) .+|+...+.++++.+-.- -.+|.+-++..+. .+++++++... .+..|.... +..|+++.. ++.+.. T Consensus 447 kAei~~k~Aea~~~~~~~--Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~-~~~pe~~~~-~~~~~~~~~~~~~e~~~ 522 (765) T protein:vir:96 447 QAELNNKKAATDEIYINS--GVVSPDEVRERLRDDPRSGYNRLTDDQAETEP-GMSPENLAE-LEKAGAQSAKAKGEAER 522 (765) T ss_pred HHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHhccccCCCCCCCcccccccc-CCCcccccc-ccCCCcccccccCcccc Confidence 999999999999887443 3678888887532 23455543211 111111111 111111100 000000 Q ss_pred CCCCCCC--cccccc---ccCCccccch--------------------------------hcC Q lcl|NC_021072. 508 NAPPADD--MSAQEG---PAVDAGDAKR--------------------------------GEF 533 (533) Q Consensus 508 ~~~~~~d--~~~~~~---~~~~~~~~~~--------------------------------~~~ 533 (533) +.++++. +..+.. |.......+. +++ T Consensus 523 ~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~~~~p~~~~~~~~~~~~~~~~~~ 585 (765) T protein:vir:96 523 AEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPSRPNPRAELRNLLSDLLSKLEAL 585 (765) T ss_pred ccCCCCccCCCCcccccCCcccCCccccccccCccccCccccccccccchhcccchhhhhhcc Confidence 0000000 000000 0000000000 011 No 20 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.41 E-value=4.2e-12 Score=82.95 Aligned_cols=452 Identities=11% Similarity=0.104 Sum_probs=192.1 Q ss_pred CCcc---ccce-------eeeccccc----cccCCCCCCCCCcccceee-----------cccccccccch-------hh Q lcl|NC_021072. 1 MSNQ---LFGF-------SLERAKKV----PKGPSFVQKDSMDGSQPIV-----------GGGYYGYSVDF-------DG 48 (533) Q Consensus 1 ~~~~---~fg~-------~i~~~~~~----~~~~s~~~~~~~dg~~~~~-----------~~~~~~~~~~~-------~~ 48 (533) =+.+ .|+- .+...... .+.+.++-+...+|..... .+++.++.+.. -+ T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 99 (537) T protein:vir:10 20 RIEPRVGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQ 99 (537) T ss_pred ccccccCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhcccc Confidence 0000 1110 11110000 0001111111222221111 11111111000 00 Q ss_pred hhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCC-CcHHHHHHHHHHHHHHHHHhcchhhhhHH Q lcl|NC_021072. 49 TVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLK-QSDKIKKLIREEFAEILRLLDFENRSYEI 127 (533) Q Consensus 49 ~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~-~S~~ik~~I~eeF~~i~~lL~f~~~~~~~ 127 (533) .+. -++|...|+ .++++..||+-++++|+. ..+.|..++.+ .+....++ |...++.|++..+..+. T Consensus 100 ~~~-~~~l~a~Y~---~~~l~r~iVd~~A~d~~r-----~~~~i~~~~~~~~~~~~~~~----l~~~~~~l~~~~~l~~a 166 (537) T protein:vir:10 100 AFI-GHQMCALIA---THWLVNKACSQMPRDAMR-----KGYKIISDDGNELDPKDAKF----IDRYDRAFNIKKHAIQF 166 (537) T ss_pred CCc-cHHHHHHHH---hCchhhhhhhhhhHHhhc-----CCceeecCCcccccHHHHHH----HHHHHHHhhHHHHHHHH Confidence 111 246667775 699999999999999975 44555543322 22333344 44445556666666666 Q ss_pred HHhhhhcCceeeeeeecCCC-------------CCCCeEEEEEcChhhceehh-hccCCCcCceeEEeccceeeccchhc Q lcl|NC_021072. 128 FRRWYVDGRLFYHKVIDPKN-------------PRGGLTELRYIDPRKIRKVT-EYQQKRPEQLRGEDINTQLTQKAAEY 193 (533) Q Consensus 128 fR~WYvDGri~~hkvid~~~-------------~~~gI~elr~lDP~~i~~vr-~~~~~~~~~~~~~~~~~~~~~~~~e~ 193 (533) +|.=-+.|.=+.-..++..+ .+++++.|+.+||..+.+.. .....++... -|+.-.. T Consensus 167 ~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp--------~fg~P~~- 237 (537) T protein:vir:10 167 VRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSM--------HFYEPTY- 237 (537) T ss_pred HHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCcc--------ccCCcee- Confidence 66423334433333332112 24468889999998886532 1222222221 1111111 Q ss_pred eeccccccccccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEccCC Q lcl|NC_021072. 194 YLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIED--SLVIYRLSRAPERRIFYIDVG 271 (533) Q Consensus 194 ~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~ED--alVIyRi~RAPeRrvfyIDvG 271 (533) |..+ +...-+.-.+++.-..+.+. +..+.+..+.|-|+++...+.+...... +.++|+-. =+++.+|.. T Consensus 238 y~v~--g~~iH~SRli~f~g~~~p~~---~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~----~~v~k~~~~ 308 (537) T protein:vir:10 238 WLIN--GKKYHRSHLAIYINDEVVDF---LKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKR----QTVLKVDAA 308 (537) T ss_pred eeec--CeEecceeEEEecCCCCchh---hhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcC----CceeeechH Confidence 1100 01100001111111111111 1223344567888887666554433222 23344322 235566532 Q ss_pred C-CchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHH Q lcl|NC_021072. 272 N-LPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKK 349 (533) Q Consensus 272 n-lpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV-~YF~~k 349 (533) . |....+..-.-+.++++|+ ..|-+- + -.+ +-++.++- .+|+-++|+ ..|... T Consensus 309 ~~l~~~~~~~~r~~~~~~~r~------n~g~~~-------i-------d~e---~e~~e~~~--~~lsgl~~~l~~~~~~ 363 (537) T protein:vir:10 309 QVLANKQQFDETMSWWTATRD------NYQVRV-------V-------DKD---NEDVVQID--TTLNDLDKVIMNQYQL 363 (537) T ss_pred HhhcCHHHHHHHHHHHHhhcC------CcceeE-------e-------cCC---CceeEEEe--ccCCCHHHHHHHHHHH Confidence 1 1112111111122333332 111100 0 000 01122211 245556664 567777 Q ss_pred HHHhcCCCccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEE Q lcl|NC_021072. 350 LYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDF 427 (533) Q Consensus 350 Ly~aL~VP~sRl~~~--~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f 427 (533) +=.+++||+.||-.+ +||+-..-.+ .+. |+.||+++|.++..++..+++. |.+.-..+ ...+.|.| T Consensus 364 iAa~~~IP~t~L~G~sp~GlnatGe~D-~~~---yyd~I~~~Qe~l~p~l~~l~~l--l~~~~~~~------~~~~~i~f 431 (537) T protein:vir:10 364 VCAIARTPAPKMLGTVPTGFNSTGDYE-EAS---YHEECESTQDDMRPLIDRHHQL--VCRSHLRK------RIRVKVEF 431 (537) T ss_pred HHhhhCCCceeeccCCccccccchhHH-HHH---HHHHHHHHHHHHHHHHHHHHHH--HHHhcCCC------CcceEEEe Confidence 888889999998443 5776422222 344 9999999999888877776643 33222222 13578888 Q ss_pred eccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCC-------------HHHHHHHHHHHHHhhhcCCCC Q lcl|NC_021072. 428 IADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQT-------------DQEIKEIDKQIDSEREAGLIV 494 (533) Q Consensus 428 ~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~t-------------DeeI~e~~kqi~~E~~~~~~~ 494 (533) ..=...++...+|+...+.++++.+-.- -.+|.+-++.. |+.. ++++++.. ++.|.+.+.. T Consensus 432 ~pL~~~s~kEkAei~~~~a~a~~~~~~~--G~i~~~Evr~~-L~~~~~~g~~~l~~~~~~ed~e~~~--~~~~~~~~~~- 505 (537) T protein:vir:10 432 PPMDAPKESERADTFLKKMQAAKLAFEM--GAVDGVDVNEY-LRMDPTLGFTSITPAMRPTDAEDID--VDDEGKPVRI- 505 (537) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHHH-HhccCccccccccCCCChhhhhccc--CCccCCcCCC- Confidence 8777788988999999999999887543 26777777755 3331 11111110 0111111000 Q ss_pred CCCcccccCCCCCCCCCCCCccccccccCCccccch Q lcl|NC_021072. 495 DPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 495 ~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 530 (533) ...++.+.+..++...+.....++...+-+++ T Consensus 506 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 537 (537) T protein:vir:10 506 ----IEDQPAPSEMFGATSSGESANDPRDSGAAFED 537 (537) T ss_pred ----CCCCCCccccCCCCccccccCCCccCccccCC Confidence 00011100000000000000000000011111 No 21 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.36 E-value=1.3e-11 Score=80.19 Aligned_cols=443 Identities=14% Similarity=0.177 Sum_probs=199.0 Q ss_pred CCccc---------cceeeeccccccccCCCCCCCCCcccceeecccccc-c--ccc---hhhhhhHHHHHHHHHHhhhh Q lcl|NC_021072. 1 MSNQL---------FGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYG-Y--SVD---FDGTVRNEYELITRYREMVL 65 (533) Q Consensus 1 ~~~~~---------fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~-~--~~~---~~~~~~~~~~LI~~YR~m~~ 65 (533) |+++- =++.++. .+..+...+...+......+ ..+ . ... .....-..++|...|+ . T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~a~~~g-~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~---~ 93 (532) T protein:vir:94 23 VDAKRATHTSLGLATAHEIDP-----TAYSPYERNAAQNAMAMDYG-LQTGRNGRNALSFVEATSWPGFPTLALLA---Q 93 (532) T ss_pred hhhhhhhhhhhhhhhhhhhcc-----cccccccccccccccccccc-cCcccccccccccccccccchHHHHHHHH---c Confidence 22210 0111111 11111111222221111101 100 0 000 1111224667878886 5 Q ss_pred cchhhhHHHHhhcceeeecCCCceEEEEeccCC-CcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh--hhhcCceeeeee Q lcl|NC_021072. 66 QPECDSAVDDIVNETICGNFDDVPVEVELSNLK-QSDKIKKLIREEFAEILRLLDFENRSYEIFRR--WYVDGRLFYHKV 142 (533) Q Consensus 66 ~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~-~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~--WYvDGri~~hkv 142 (533) ++++..||+-++++|+. ..++|..++.+ ..+...++|..+++. |++..+..+.+|. .|=.|-++++.. T Consensus 94 ~~l~r~~Vd~~aed~~r-----~~~~i~~~~~~~~~~~~~~~i~~~~~~----l~v~~~l~~a~~~~rlyG~a~i~i~v~ 164 (532) T protein:vir:94 94 LPEYRTMHETPADECVR-----AWGKITCSSKDELAADKATRITQKLEQ----YNVRTLVRTVVIHDQAYGGAHVFPHLK 164 (532) T ss_pred CchhhhhhccchHHHhh-----CCceEeeCCccccchHHHHHHHHHHHh----hhHHHHHHHHHHhhhcccceEEEEEec Confidence 99999999999999996 55556543322 233455555555544 4566555555553 333334444421 Q ss_pred -------------ecCC-CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCc Q lcl|NC_021072. 143 -------------IDPK-NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQG 208 (533) Q Consensus 143 -------------id~~-~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~ 208 (533) ++++ =.+++++.|+.+||..+.+- ... .......-|+.-..|.+.. + T Consensus 165 ~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~-~~~--------~~dp~sp~fg~P~~y~v~~----------g 225 (532) T protein:vir:94 165 MDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPN-AYN--------ATDPTLPSFYKPDSWIATS----------G 225 (532) T ss_pred cCCccccccccccccccccccceeeEEEeechheeccc-ccc--------cccccccccCCceeEEEcc----------C Confidence 1111 11335678888888877441 111 1111112222222222211 1 Q ss_pred ceeccchhh-ccc----cccccCCCCccchhHHHHHHHHHHHHHHHHH--HHHHHHhcCccceEEEccCCCCc-hHHHHH Q lcl|NC_021072. 209 MKIATDSVT-YCH----SGIQDLNKNMTLSHLHKAIKAVNQLRMIEDS--LVIYRLSRAPERRIFYIDVGNLP-KNKAEQ 280 (533) Q Consensus 209 ~kI~~dai~-y~h----sGl~d~~~~~i~syL~~AiK~~NqLrm~EDa--lVIyRi~RAPeRrvfyIDvGnlp-k~KAeq 280 (533) .+|+.+-+. |.. ..+.......+.|.|+++...+.+......+ .++|+.. =.|+.++..++- ....++ T Consensus 226 ~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~----~~v~k~~~a~~ls~~~~~~ 301 (532) T protein:vir:94 226 KKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFS----MTNLATDMAQLLAPGGAQS 301 (532) T ss_pred eeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcC----CceeeechHHhhcchhHHH Confidence 122222111 000 0111222334578898887777776554433 3355432 234444432221 111122 Q ss_pred HHHH--HHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCcc-ceeecCCCCCcchHHHH-HHHHHHHHHhcCC Q lcl|NC_021072. 281 YLRE--VMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGT-EISTLPGGQNLGELEDV-KYFQKKLYKALNV 356 (533) Q Consensus 281 Yl~~--im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgT-EIsTLpGg~nLgei~DV-~YF~~kLy~aL~V 356 (533) ..+. +++++|+ .+|-+ .+ .+++ +++++. .+|+-++|+ ..|...+-.+++| T Consensus 302 ~~~r~~~~~~~~~------n~g~~-----~i-------------d~~~e~~e~~~--~~lsgl~~~l~~~~~~iAaa~~I 355 (532) T protein:vir:94 302 LDARLQLFNLYRD------NRNIG-----AL-------------DKGTEEIQQTN--TPLSGLDSLQAQSQEQMAAVSHI 355 (532) T ss_pred HHHHHHHHHhhcC------Cccce-----EE-------------cCCCceeEEEe--cccCCHHHHHHHHHHHHHhHhCC Confidence 2111 1223321 11110 00 0111 233332 234545554 7889999999999 Q ss_pred CccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchH Q lcl|NC_021072. 357 PSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKR-FSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYF 433 (533) Q Consensus 357 P~sRl~~~--~~~~~g~~~eItRDElkF~Kfi~rLr~~-fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f 433 (533) |+.||-.+ +|||-+.-.+ .+. |+.||+++|.. +..++..+++.-+..... ...+.+.|.|..=... T Consensus 356 P~t~LfG~sp~GlnstGe~D-~~~---yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g-------~~~~d~~~~f~pL~~~ 424 (532) T protein:vir:94 356 PLVKLLGITPNGLNASSDGE-IRV---WYDFIAGYQATNLTPLMEWIIDLIQLSEYG-------QIDPGLAWEWSPLMEL 424 (532) T ss_pred CeeeeecCCcccccccchHH-HHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CCCCCceEEeCCCCCC Confidence 99998443 5776422222 333 99999999955 566666665432222211 2234678899876667 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH-----------HHHHHHHHHHHHhhhcCCCCCCC-cccc Q lcl|NC_021072. 434 TELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD-----------QEIKEIDKQIDSEREAGLIVDPM-AEMD 501 (533) Q Consensus 434 ~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD-----------eeI~e~~kqi~~E~~~~~~~~p~-~~~~ 501 (533) ++...+|+...+.++++.+-.- -.+|.+-++.. |++.. +++++...+..+..... ...|. ++.. T Consensus 425 s~kEkAei~~~~a~a~~~~~~~--Gvi~~~Evr~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 500 (532) T protein:vir:94 425 DDKELAEVRQLNASTDSTLMEL--GVIDAKMVQQR-LAADPTSGYAGALGERDELDDVEEIAKQLMAAA-LNPPATAPQT 500 (532) T ss_pred CHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH-HhcCCccccccccccccccccccchhhhhcccc-cCCCCCCCCC Confidence 8888899999999998887443 36888888765 44332 23333333322222222 22221 1111 Q ss_pred --cCCCCCCCC----CCC--CccccccccCCc Q lcl|NC_021072. 502 --PAMDPGNAP----PAD--DMSAQEGPAVDA 525 (533) Q Consensus 502 --~~~~~~~~~----~~~--d~~~~~~~~~~~ 525 (533) |.-+..+.+ |+. |....+.|.-+. T Consensus 501 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:94 501 PNPQPDSEDDQTDNQPDAQADPAQNDQPVGNR 532 (532) T ss_pred CCCCCCCCCCCCCCccCCCccccccCCCcCCC Confidence 111110000 110 112222222111 No 22 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.35 E-value=4.9e-12 Score=82.57 Aligned_cols=460 Identities=12% Similarity=0.089 Sum_probs=197.4 Q ss_pred CCccccceeeeccccccc---cCCCCC--------C-----------------------CCCcccceee---c-----cc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPK---GPSFVQ--------K-----------------------DSMDGSQPIV---G-----GG 38 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~---~~s~~~--------~-----------------------~~~dg~~~~~---~-----~~ 38 (533) |-..--|+-+++++..+- ..++++ . -..||-.... + +. T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~ 118 (862) T protein:vir:99 39 LARTRQNWPVQKEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSS 118 (862) T ss_pred HHhhcccCCcccccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccc Confidence 222222554444322210 001110 0 0001100000 0 00 Q ss_pred cccc---ccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEEEe--ccCCCcHHHHHHHHHHHHH Q lcl|NC_021072. 39 YYGY---SVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEVEL--SNLKQSDKIKKLIREEFAE 113 (533) Q Consensus 39 ~~~~---~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l--~~~~~S~~ik~~I~eeF~~ 113 (533) |+.. +.....+.-..++|...|+ .++++..+|+-++++|+. ..++|.. ++-+..+...++|.++++ T Consensus 119 y~~~~~~~~~~~~~~f~gyql~alY~---~~~larkiVd~pAeDatR-----~g~~I~~~~d~~e~~~e~~~~ie~~~~- 189 (862) T protein:vir:99 119 YAVPEALQDWYLSQGFIGHQACALIA---QHWLVDKACSLAGEDAIR-----NGWHLKSLGEGEEIDEESLEKFKAIDV- 189 (862) T ss_pred cccchhccccccccCcccHHHHHHHH---hCchhhhhhhhhhHHHhh-----CCceEeecCcccccCHHHHHHHHHHHH- Confidence 1000 0000011112346666665 599999999999999996 4455553 222223344455555554 Q ss_pred HHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCC-------------CCCeEEEEEcChhhceehhh-ccCCCcCceeE Q lcl|NC_021072. 114 ILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNP-------------RGGLTELRYIDPRKIRKVTE-YQQKRPEQLRG 179 (533) Q Consensus 114 i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~-------------~~gI~elr~lDP~~i~~vr~-~~~~~~~~~~~ 179 (533) .|++..+..+.+|-=-+.|+-+.-.+++..++ ++.++.|+.|||..+.+... .+..++ T Consensus 190 ---rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp----- 261 (862) T protein:vir:99 190 ---EFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADP----- 261 (862) T ss_pred ---HhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccc----- Confidence 45555555554442122343223223332222 34578888999888765321 111111 Q ss_pred EeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHH--HHHHHHHHH Q lcl|NC_021072. 180 EDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMI--EDSLVIYRL 257 (533) Q Consensus 180 ~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~--EDalVIyRi 257 (533) ..+-|+.-. +|..+ +...-+.-.+++.-+.+.+. +....+..++|-|+++...+...-.. .-+.++++. T Consensus 262 ---~sp~yGkP~-~y~I~--g~~IH~SRliif~g~~vpd~---lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka 332 (862) T protein:vir:99 262 ---SSQFFYEPE-FWIIS--GQKYHRSHLIIARGPQPADI---LKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNK 332 (862) T ss_pred ---cccccCCce-eeeec--CeeeccceeEEecCCCchhh---hhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 111122211 11111 00000011122222222221 12333445678888765444333211 233445543 Q ss_pred hcCccceEEEccCCC-CchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCC Q lcl|NC_021072. 258 SRAPERRIFYIDVGN-LPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQN 336 (533) Q Consensus 258 ~RAPeRrvfyIDvGn-lpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~n 336 (533) . -+++.+|... |....+-..=.++++++|+- .|-+ .+ +++-+++++- .+ T Consensus 333 ~----l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN------~Gi~-----li-------------D~eEe~e~ls--~s 382 (862) T protein:vir:99 333 R----TTAIHTDTAKAIANEDKFIQRLMFWVRYRDN------HAVK-----VL-------------GTDETMEQFD--TS 382 (862) T ss_pred c----cceeechhHhhhccHHHHHHHHHHHHhccCc------ceeE-----Ee-------------cCCCceeEEe--cc Confidence 2 2344554432 22211111111234555431 1100 00 0111233332 34 Q ss_pred cchHHH-HHHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCC Q lcl|NC_021072. 337 LGELED-VKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKR-FSELFMDLLKTQLILKGVMS 412 (533) Q Consensus 337 Lgei~D-V~YF~~kLy~aL~VP~sRl~~~--~~~~~g~~~eItRDElkF~Kfi~rLr~~-fs~if~d~Lk~qLilkgi~t 412 (533) |+-++| +..|...+=.+.+||+.||-.+ .|++-+.-+++ + .|+.+|.++|.. +..++..++ .|+.... T Consensus 383 lSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~-~---nYyD~I~s~QE~~L~P~LerL~--~li~~~l-- 454 (862) T protein:vir:99 383 LADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFET-I---SYHEELESIQEHVYMPFLQRHY--LISRLSL-- 454 (862) T ss_pred cCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHH-H---HHHHHHHHHHHHHHHHHHHHHH--HHHHHhc-- Confidence 444455 5778889999999999998443 57865433332 3 399999999964 444444333 2332222 Q ss_pred HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHh--------CCCHHHHHHHHHHH Q lcl|NC_021072. 413 LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVL--------KQTDQEIKEIDKQI 484 (533) Q Consensus 413 ~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL--------~~tDeeI~e~~kqi 484 (533) | +.+.+.|.|..=..=+|...+|+.....++++.+-.- -.+|.+-++.... .++++++++.... T Consensus 455 ---g--~~~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~s--GvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~- 526 (862) T protein:vir:99 455 ---G--IQHEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDG--GVISPDEERNRIRDDKRSGYNRLTKEDAEETPGA- 526 (862) T ss_pred ---C--CCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHHhcCCcCCCCCCcccccccCCC- Confidence 1 2245788888777778888899999998888876432 3677788876521 1445555422111 Q ss_pred HHhhhcCCCCCCCcc--cccCCC---------CCCCCCCCCccccccccCCccccchhcC Q lcl|NC_021072. 485 DSEREAGLIVDPMAE--MDPAMD---------PGNAPPADDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 485 ~~E~~~~~~~~p~~~--~~~~~~---------~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 533 (533) ..|.... ...|++. +.|+.+ ..+.+++.+.+.+.+|+...+++..... T Consensus 527 ~~e~~~~-~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a 585 (862) T protein:vir:99 527 SPENLAA-YQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITA 585 (862) T ss_pred Ccccccc-cccCCcccccccccccccccCCccccCCcccccccCCCCCCCcccccccccc Confidence 1111111 1111110 011111 0011122222332223222222222111 No 23 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.28 E-value=2.1e-10 Score=73.63 Aligned_cols=466 Identities=12% Similarity=0.089 Sum_probs=235.1 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceee-cccccccccchhhhhhHHHHHHHHHHhh-hhcchhhhHHHHhhc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIV-GGGYYGYSVDFDGTVRNEYELITRYREM-VLQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~-~~~~~~~~~~~~~~~~~~~~LI~~YR~m-~~~pEvd~AvdeIvn 78 (533) |..-==|+.=-...... ...-...+|+-..- ..++.+...+.+ ...+...|..+-|.+ .++|-+.+||+-+++ T Consensus 1 m~~~~~~~~a~~~~~~~----~~~~~~y~aa~~~~~~~~~~~~s~d~~-~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 75 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLV----PVGASAYEGASGGHRWQDIGDYGPDTA-VASGIQTLRARSHHNVRNNPWATNAVATWVA 75 (495) T ss_pred CCcccccccccchhhhh----HHHhhhhhccccCcccCCCCCCChhHH-HHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 22111111100000000 00001123221111 111222222222 224688899999999 788999999999998 Q ss_pred ceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHH------hcchhhhhHHHHhhhhcCceeeeeeecCCCCCC-C Q lcl|NC_021072. 79 ETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRL------LDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRG-G 151 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~l------L~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~-g 151 (533) .+|=. + +.+.-....+.+.++|..+|+.-.+- ++|-.--...+|.|.+||-.|.-+..++..+.. . T Consensus 76 ~vVG~---G----i~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~ 148 (495) T protein:vir:10 76 AAVGN---G----LTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSV 148 (495) T ss_pred hhcCC---C----cccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCcc Confidence 87731 2 22222334588999999999987764 344444556889999999999987776533322 2 Q ss_pred eEEEEEcChhhce-ehhhccCCCcCceeEEecc-ceeeccchhceeccccccc----cccCCcceeccchhhcccccccc Q lcl|NC_021072. 152 LTELRYIDPRKIR-KVTEYQQKRPEQLRGEDIN-TQLTQKAAEYYLYNPKGLK----NSTNQGMKIATDSVTYCHSGIQD 225 (533) Q Consensus 152 I~elr~lDP~~i~-~vr~~~~~~~~~~~~~~~~-~~~~~~~~e~~~y~p~~~~----~~~~~~~kI~~dai~y~hsGl~d 225 (533) -..|+.|+|..|. +.-+... +++..+..+. ..-.+.-..|+++...+.. ......++||..-|....- .. T Consensus 149 ~~~lqliepd~l~~~~~~~~~--~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~--~r 224 (495) T protein:vir:10 149 PLQLQIIEPDMLASDIPDETL--PSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTV--LT 224 (495) T ss_pred ceEEEEechhhcCCCCCCCCC--CCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccc--cC Confidence 3789999999994 3222111 2222221111 0124566788887543322 2333457788765544331 23 Q ss_pred CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccc Q lcl|NC_021072. 226 LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDD 305 (533) Q Consensus 226 ~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d 305 (533) +.---++|.|+..+ .+++|.-.+||...-...-|-. +.+|-..+ |..-+.+-.. ..++-... T Consensus 225 ~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~--~~fi~~~~-~~~~~~~~~~--------------~~~~~~~~ 286 (495) T protein:vir:10 225 VRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALF--AAFIQEAT-ADSTGGPTIG--------------QPKRSKGG 286 (495) T ss_pred CCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhh--eeeeecCC-CccccccccC--------------ccccccCc Confidence 44444678998655 5899999999999999888855 33332221 1111100000 00111111 Q ss_pred cccchhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCC-CcccccchhhhhHHhhhHH Q lcl|NC_021072. 306 KKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE-TTFNIGRAAEITRDEVKFQ 383 (533) Q Consensus 306 ~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~-~~~~~g~~~eItRDElkF~ 383 (533) ....+| +.==++. =.-|.+|+++.....-+..++ ++...+.+=.+|+||-+-|..+ ++.|+ |.+.-.-+.|. T Consensus 287 ~~~~~l-~pG~i~~--L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nY---SS~R~~~~e~~ 360 (495) T protein:vir:10 287 KRITGL-NPGTLQY--LQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNY---SSIRAGLLEFR 360 (495) T ss_pred ccceec-CCceeee--cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH---HHHHHHHHHHH Confidence 111111 0000000 013445555555444444444 5566666778899999988665 35554 33344455699 Q ss_pred HHHHHHHHH-HHHHHHHH-----HHHHHHhccCCC-HhHHhhhhhceeEEE--eccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021072. 384 KFIARLRKR-FSELFMDL-----LKTQLILKGVMS-LEEWDEMKEHIQFDF--IADNYFTELKEIEIRNERMNQVNTMDP 454 (533) Q Consensus 384 Kfi~rLr~~-fs~if~d~-----Lk~qLilkgi~t-~eew~~~~~~i~~~f--~~Dn~f~E~ke~Ei~~~R~~~~~~~~~ 454 (533) +.+.++|.+ |..-|..+ |+ ..+|.|.++ ++-|+.-.......| -.--+--.+||+.-...+++ T Consensus 361 r~~~~~q~~~~~~~~~~pi~~~~l~-~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~------- 432 (495) T protein:vir:10 361 RLCQQVQHHMIIHQFCRPVGRWFMD-FAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVR------- 432 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHH------- Confidence 999999875 55544443 33 456777665 443333222233344 33344556777665555444 Q ss_pred hccccccHHHHHHHHhCCCHHHHH-HHHHHHHHhhhcCCCCC--CCcccccCCCCCCCCCCCCcccccc Q lcl|NC_021072. 455 YVGKYFSIDYMRRQVLKQTDQEIK-EIDKQIDSEREAGLIVD--PMAEMDPAMDPGNAPPADDMSAQEG 520 (533) Q Consensus 455 ~vGky~S~~~i~k~IL~~tDeeI~-e~~kqi~~E~~~~~~~~--p~~~~~~~~~~~~~~~~~d~~~~~~ 520 (533) .-.-|.+-+..+ .+..-+|+- |.....+...+.|+..+ |.... +.+...++ ..+.+.++. T Consensus 433 --~G~~s~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~--~~~~~~~~-~~~~~~~~e 495 (495) T protein:vir:10 433 --AGFAPISDKQAE-RGYDMEELFDMISDANQLIDEYDLRLDSDPRYVN--GSGAEQKS-VMEAALNNE 495 (495) T ss_pred --cCCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCC--CccCCCCC-CCCCCCCCC Confidence 244566666666 466555444 23333333444554322 22211 11111111 111111111 No 24 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.25 E-value=3.1e-10 Score=72.70 Aligned_cols=465 Identities=13% Similarity=0.115 Sum_probs=242.1 Q ss_pred cccccccCCCCCC-------------CCCcccceee-cccccccccchhhhh-hHHHHHHHHHHhh-hhcchhhhHHHHh Q lcl|NC_021072. 13 AKKVPKGPSFVQK-------------DSMDGSQPIV-GGGYYGYSVDFDGTV-RNEYELITRYREM-VLQPECDSAVDDI 76 (533) Q Consensus 13 ~~~~~~~~s~~~~-------------~~~dg~~~~~-~~~~~~~~~~~~~~~-~~~~~LI~~YR~m-~~~pEvd~AvdeI 76 (533) -...++.+++..| ...+|+-..- ..++... .+.+..+ .+...|..+-|.| .++|-+.+||+-+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~-~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~ 79 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQP-LGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRL 79 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCC-CChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 1222333322222 1233322211 1112111 2223333 3577899999999 7899999999999 Q ss_pred hcceeeecCCCceEEEEeccCCC----cHHHHHHHHHHHHHHHHH------hcchhhhhHHHHhhhhcCceeeeeeecCC Q lcl|NC_021072. 77 VNETICGNFDDVPVEVELSNLKQ----SDKIKKLIREEFAEILRL------LDFENRSYEIFRRWYVDGRLFYHKVIDPK 146 (533) Q Consensus 77 vneaiv~d~~~~~v~v~l~~~~~----S~~ik~~I~eeF~~i~~l------L~f~~~~~~~fR~WYvDGri~~hkvid~~ 146 (533) ++-+|=+. + +.+.-.-+.. .+.+-+.|...|+.-++- ++|-.--...+|.|.+||-+|..+..++. T Consensus 80 ~~nvVG~~--G--~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~ 155 (548) T protein:vir:95 80 EERVVGGS--G--IGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRV 155 (548) T ss_pred HHhccCcc--c--cceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeeccc Confidence 98877211 1 1222222333 345667788888877653 34444455688999999999999888753 Q ss_pred CC-CCC---eEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccc--c--cccCCcceeccchhhc Q lcl|NC_021072. 147 NP-RGG---LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGL--K--NSTNQGMKIATDSVTY 218 (533) Q Consensus 147 ~~-~~g---I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~--~--~~~~~~~kI~~dai~y 218 (533) .. ..| -..|+.|+|..|.- -.+...+...-..-. .-.+.-..|+++.+... . ......++||...|.- T Consensus 156 ~~~~~g~~~~~~lqliepd~l~~---~~~~~~~~i~~GIE~-D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlH 231 (548) T protein:vir:95 156 PNYTFATSVPFALELLEPDYLPF---SYNNLSKGIVQGIER-DTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIH 231 (548) T ss_pred ccccCCcccceEEEEechhhcCC---CCCCCCCceeeeeEE-CCCCceEEEEEeecCCCcccccccccceeeechhHhee Confidence 21 112 25889999999841 111111111111111 12355678888764322 1 2234467888766544 Q ss_pred cccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCC Q lcl|NC_021072. 219 CHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDAN 298 (533) Q Consensus 219 ~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~ 298 (533) ...- ..+.-.-++|.|+.+++.+.+|.-.+||..+-...-|-.=-+ |-.++ |.... .. T Consensus 232 if~~-~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~f--i~~~~-~~~~~------------------~~ 289 (548) T protein:vir:95 232 IAYR-KRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMY--IKKGN-PDSYT------------------VE 289 (548) T ss_pred cccc-cCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheee--eecCC-Ccccc------------------CC Confidence 4433 233355678999999999999999999999998888876333 33222 21100 00 Q ss_pred CCccccccccchhHhhhcccccCCCCccceeecCCCCCc---------chHHHHH-HHHHHHHHhcCCCccccCCCCccc Q lcl|NC_021072. 299 TGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNL---------GELEDVK-YFQKKLYKALNVPSSRLETETTFN 368 (533) Q Consensus 299 TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nL---------gei~DV~-YF~~kLy~aL~VP~sRl~~~~~~~ 368 (533) .+ -.+.....+| .-|+=|.+|+.|+.+ +..++.. -..+.+=.+|+||-+-|..+- + T Consensus 290 ~~-~~~~~~~~~~-----------~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~--s 355 (548) T protein:vir:95 290 PG-KDRKNRTIPI-----------APGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAY--D 355 (548) T ss_pred CC-cccccccccc-----------cCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccc--c Confidence 00 0011111111 123334455444333 3333322 233334567999999987763 3 Q ss_pred ccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHH----HHHhccCCCHhHHhhhhhceeEEEec--cchHHHHHHHHHH Q lcl|NC_021072. 369 IGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKT----QLILKGVMSLEEWDEMKEHIQFDFIA--DNYFTELKEIEIR 442 (533) Q Consensus 369 ~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~----qLilkgi~t~eew~~~~~~i~~~f~~--Dn~f~E~ke~Ei~ 442 (533) ++-|.+.-.-+.|.+.+.++|..|..-|..++-. ..+|+|.+..-.|.+-...+...|.. --+.-.+||+.-. T Consensus 356 -~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~ 434 (548) T protein:vir:95 356 -GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAW 434 (548) T ss_pred -hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHH Confidence 2445555666779999999999988777776433 46788877543333333444555543 3344566776655 Q ss_pred HHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHH-HHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccc Q lcl|NC_021072. 443 NERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIK-EIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGP 521 (533) Q Consensus 443 ~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~-e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~ 521 (533) ..+++ .-.-|.+-+..+ .+..-+|+. |.....+.-...|+..+......+..+..+.+.+++-..-+.. T Consensus 435 ~~~i~---------~Gl~T~~~~~a~-~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (548) T protein:vir:95 435 ELLVK---------AGFADEAEVARA-RGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVG 504 (548) T ss_pred HHHHH---------cCCCCHHHHHHH-hCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccc Confidence 55444 234566666666 566655543 3333333333344322111111111111111111111111111 Q ss_pred cCCccccchhcC Q lcl|NC_021072. 522 AVDAGDAKRGEF 533 (533) Q Consensus 522 ~~~~~~~~~~~~ 533 (533) +.-++|..++|+ T Consensus 505 ~~~~~~~~~~~~ 516 (548) T protein:vir:95 505 KMLTADEARELV 516 (548) T ss_pred cccccchhHHhh Confidence 222344444444 No 25 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.21 E-value=5.6e-10 Score=71.31 Aligned_cols=419 Identities=14% Similarity=0.113 Sum_probs=196.1 Q ss_pred cccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEE Q lcl|NC_021072. 13 AKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEV 92 (533) Q Consensus 13 ~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v 92 (533) -...|.-.+++. .-| +.-... ++.++ .+...+..+|...| +.++.+..+|+-++.+|+. ..+.| T Consensus 1 ~~~~D~~~~~~~---~~g-~~~~~~-~~~~~---~~~~~~~~~l~a~Y---~~~~l~~~~vd~~a~d~~r-----~~~~i 64 (437) T protein:vir:52 1 MKFFDGIKSLAL---KLG-SKQEQT-YYSPS---LSLTDDLVQLEALW---RDNWIANKVCIKRPEDMVR-----NWREI 64 (437) T ss_pred CchhhhhHhHHh---cCC-Cccccc-eeecC---ccccccHHHHHHHH---HhCchhhHHhhcchHHhhc-----CCceE Confidence 011111111111 000 000011 11111 11123455666666 4689999999999999996 44555 Q ss_pred EeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCC------CCCCCeEEEEEcChhhceeh Q lcl|NC_021072. 93 ELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPK------NPRGGLTELRYIDPRKIRKV 166 (533) Q Consensus 93 ~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~------~~~~gI~elr~lDP~~i~~v 166 (533) ..++ ..++-.+++. ..++.|++..+..+.+|-==+.|.-+.-.++|.. +++++++.++.+||..+.++ T Consensus 65 ~~~d--~~~~~~~~~~----~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~ 138 (437) T protein:vir:52 65 YSND--LNSKQLDLFT----KFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPT 138 (437) T ss_pred ecCC--CCHHHHHHHH----HHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccc Confidence 5322 2222223444 4445556666666655532245655555566543 24678999999999998764 Q ss_pred hhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhh-cccc-ccccCCCCccchhHHHHHHHHHH Q lcl|NC_021072. 167 TEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVT-YCHS-GIQDLNKNMTLSHLHKAIKAVNQ 244 (533) Q Consensus 167 r~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~-y~hs-Gl~d~~~~~i~syL~~AiK~~Nq 244 (533) -.... ++. .+-|+.-.. |.++.. .+.++|+.+-+. |+.. +-...+...++|.|+++...+.. T Consensus 139 ~~~~~-dp~--------s~~fg~p~~-y~v~~~------~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~ 202 (437) T protein:vir:52 139 GTKDD-DVL--------SPNFGRYSE-YSILGG------SQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKR 202 (437) T ss_pred ccccc-ccc--------ccccCcceE-EEEecC------CcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHH Confidence 33211 111 112222222 111111 112233333211 1100 01122344568999998777755 Q ss_pred HHHHHHH--HHHHHHhcCccceEEEccCC--CCch--HHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhccc Q lcl|NC_021072. 245 LRMIEDS--LVIYRLSRAPERRIFYIDVG--NLPK--NKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLP 318 (533) Q Consensus 245 Lrm~EDa--lVIyRi~RAPeRrvfyIDvG--nlpk--~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLp 318 (533) +...+.+ .++++ +... ++.++.- .|.. ..+..-..+.++++|+ ..| .+-| T Consensus 203 ~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~------~~~-------~~~~------- 258 (437) T protein:vir:52 203 FDSASVNVGDLIFE---SKID-IFKIAGLSDKIAAGMENEVASVISAVQEIKS------ATN-------SLLL------- 258 (437) T ss_pred HHHHHHHHHHHHHH---cCCC-ceecchHHHHhcCCcHHHHHHHHHHHHHhcC------CCc-------eEEE------- Confidence 5444432 33443 4333 4555420 1111 1111112222333322 111 1111 Q ss_pred ccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHH-HH Q lcl|NC_021072. 319 RREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKR-FS 394 (533) Q Consensus 319 RReggrgTEIsTLpGg~nLgei~DV-~YF~~kLy~aL~VP~sRl~~~--~~~~~g~~~eItRDElkF~Kfi~rLr~~-fs 394 (533) ..+-+++++. .+|+-++|+ ..|+..+=.+.+||+.+|-.+ +|++ .+.+=.|. |+.+|.++|.. +. T Consensus 259 ----d~~~~~e~~~--~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Gla--sge~D~~~---yyd~i~~~Qe~~l~ 327 (437) T protein:vir:52 259 ----DAENEYDRKE--LTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLA--SGDEDIQN---YHEAIRRLQETRLR 327 (437) T ss_pred ----cCCcceEEEe--cCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCccccc--ccHHHHHH---HHHHHHHHHHHHHH Confidence 0112233332 245555664 688999999999999999544 5663 23332344 99999999964 56 Q ss_pred HHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH Q lcl|NC_021072. 395 ELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD 474 (533) Q Consensus 395 ~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD 474 (533) .+...+++. |++. +|-.+.+.+.|.|..=..-++...+|+...+.++++.+-.- -.+|.+-+++. |+ T Consensus 328 p~le~l~~~-i~~~------~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~--g~i~~~e~r~~-L~--- 394 (437) T protein:vir:52 328 PIFEIIDPL-ICNE------LFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQN--GVLNEYQIANE-LR--- 394 (437) T ss_pred HHHHHHHHH-HHHH------hcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH-HH--- Confidence 666665552 2222 23334456889998777778888899999988888776433 13444444433 21 Q ss_pred HHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 475 QEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 475 eeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) ..+.|+.=.++ ++....+..++.++..+.+......+...+.+ T Consensus 395 --------------~~g~~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 395 --------------ESGLFANISAE-HIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred --------------hcCCCCCCCcc-ccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 12444411111 11001111111111111111111111111111 No 26 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.12 E-value=5.3e-10 Score=71.44 Aligned_cols=400 Identities=14% Similarity=0.153 Sum_probs=185.4 Q ss_pred CCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHH Q lcl|NC_021072. 23 VQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDK 102 (533) Q Consensus 23 ~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ 102 (533) .+.-+.||=..+.+++..++...... -+...+|...|| .++.+..+|+-++.+|+- .-++|..+. T Consensus 1 ~~~~~~d~~~~~~~~~~~~~~~~~~~-~~~~~~l~a~Y~---~~~l~~~~Vd~~aed~~r-----~g~~i~g~~------ 65 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGGADGSPKPFFM-SDASYHVGSFYN---DNATAKRIVDVIPEEMVT-----AGFKMSGVK------ 65 (427) T ss_pred CCccccchHHHHhhcCCCCcccCccc-cCchHHHHHHHH---cCchhhhhhccchHHhhc-----CCccccCcc------ Confidence 23333333333333322222211111 113456666665 688899999999999994 224443211 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeec-------CCCCCCCeEEEEEcChhhceehhhccCCCcC Q lcl|NC_021072. 103 IKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVID-------PKNPRGGLTELRYIDPRKIRKVTEYQQKRPE 175 (533) Q Consensus 103 ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid-------~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~ 175 (533) -+++|..+| +.|++..+..+.+|.=-+-|.-++-..++ +.+++++|+.++.+||..+.+- .+. .+ T Consensus 66 ~~~~~~~~~----~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~-~~~-~d-- 137 (427) T protein:vir:10 66 DEKEFKSLW----DSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVE-KRV-TN-- 137 (427) T ss_pred HHHHHHHHH----HHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhccccc-ccc-cC-- Confidence 123444444 45677777777776533444433332232 3366889999999999888541 111 11 Q ss_pred ceeEEeccceeeccchhceeccccccccccCCcceeccchhh-cccccc----ccCCCCccchhHHHHHHHHHHHHHHHH Q lcl|NC_021072. 176 QLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVT-YCHSGI----QDLNKNMTLSHLHKAIKAVNQLRMIED 250 (533) Q Consensus 176 ~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~-y~hsGl----~d~~~~~i~syL~~AiK~~NqLrm~ED 250 (533) ...+-|+.-.. |..++.+ ..+.++|+++-+. +...-+ ...++....|.|.+++ ++.|+-+|- T Consensus 138 ------p~s~~fg~P~~-y~v~~~~----~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~--~~~i~~~~~ 204 (427) T protein:vir:10 138 ------ARSPRYGEPEI-YKVSPGD----NMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSL--IDAICDYDY 204 (427) T ss_pred ------ccccccCcceE-EEEecCC----CCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHH--HHHHHHHHH Confidence 12222222222 2222211 1122444443221 111111 1122333456666543 454444443 Q ss_pred -----HHHHHHHhcCccceEEEc-cCCCC---chH--HHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccc Q lcl|NC_021072. 251 -----SLVIYRLSRAPERRIFYI-DVGNL---PKN--KAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPR 319 (533) Q Consensus 251 -----alVIyRi~RAPeRrvfyI-DvGnl---pk~--KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpR 319 (533) +.++++- -.+ |+.+ +++++ +.. .+..-+. .+.+.|+ .+|-+. + | T Consensus 205 ~~~~~~~l~~k~---~~~-v~k~~~l~~~~~~~~~~~~~~~r~~-~~~~~~~------~~~~~~-------l--~----- 259 (427) T protein:vir:10 205 CESLATQILRRK---QQA-VWKVKGLAEMCDDDDAQYAARLRLA-QVDDNSG------VGRAIG-------I--D----- 259 (427) T ss_pred HHHHHHHHHHHh---ccc-cccchhHHHHhcCccchHHHHHHHH-HHHHhcC------ccccee-------e--e----- Confidence 3345543 222 3334 33221 111 1111111 1112211 111110 0 0 Q ss_pred cCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHH-HHH Q lcl|NC_021072. 320 REGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKR-FSE 395 (533) Q Consensus 320 ReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~--~~~~~g~~~eItRDElkF~Kfi~rLr~~-fs~ 395 (533) +.+-+++++. .+|+-++| +..|...+=.+.+||+.||-.+ +|+| +-+.+ |--.|+.+|+++|.. +.. T Consensus 260 ---~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Gln-stgd~---D~~nyyd~i~~~Qe~~l~p 330 (427) T protein:vir:10 260 ---AETEEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS-ASQNT---ALETFYKLVDRKREEDYRP 330 (427) T ss_pred ---cCCCceeEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccc-cchhH---HHHHHHHHHHHHHHHHHHH Confidence 1111233221 23444555 4789999999999999999443 5775 22233 333499999999954 444 Q ss_pred HHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHH Q lcl|NC_021072. 396 LFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQ 475 (533) Q Consensus 396 if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDe 475 (533) +...+++ ||.. + +.+.|.|..-..=+|...+|+...+.++++.+-.- | .++.+-+++. T Consensus 331 ~l~~l~~--~i~~---s--------~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~-g-vi~~~e~r~~------- 388 (427) T protein:vir:10 331 LLEFLLP--FIVD---E--------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITE-Q-IIDLEEARDT------- 388 (427) T ss_pred HHHHHHH--Hhhc---C--------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc-C-CCCHHHHHHH------- Confidence 4433333 2221 1 35779999888888988999999999988876443 1 3444444432 Q ss_pred HHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCcccccc Q lcl|NC_021072. 476 EIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEG 520 (533) Q Consensus 476 eI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~ 520 (533) ...+-.+..-.-+.+.. .++++......|+..+++..+. T Consensus 389 -----L~~~~~~~~~~~~~~~~-~e~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 389 -----LRSIAPEFKLKDGNNIN-IREPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred -----HHhhhccccCCCCcccc-ccccchhcCCCCCCCCCCCCCC Confidence 22221111111111111 1222222222223333222211 No 27 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.04 E-value=2.9e-09 Score=67.41 Aligned_cols=392 Identities=13% Similarity=0.147 Sum_probs=180.0 Q ss_pred CCCCcccceeec-----ccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCC Q lcl|NC_021072. 25 KDSMDGSQPIVG-----GGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQ 99 (533) Q Consensus 25 ~~~~dg~~~~~~-----~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~ 99 (533) =...||-.+..+ +.++++.. -.+..+|...|+ .++.+..+|+-++.+|+- .-++|. T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~~~-----~~~~~~l~a~Y~---~~~l~~~~Vd~~aed~~r-----~g~~i~------ 61 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGSLQ-----NQAPTILASLYA---DNALVRRIIDTIPETALA-----AGFHID------ 61 (422) T ss_pred CccchhhHHHHcCCCCCccccCccc-----ccCHHHHHHHHH---hChhhHHHHhhhhHHHhc-----CCcccc------ Confidence 222233222221 12233221 124567777774 688999999999999973 223332 Q ss_pred cHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeee-cC------CCCCCCeEEEEEcChhhceehhhccCC Q lcl|NC_021072. 100 SDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVI-DP------KNPRGGLTELRYIDPRKIRKVTEYQQK 172 (533) Q Consensus 100 S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvi-d~------~~~~~gI~elr~lDP~~i~~vr~~~~~ 172 (533) ++.-++++.++|+. |++..+..+.+|.=-+-|.-++-..+ |. -++++.|+.++.+||..+.+. .+. . T Consensus 62 ~~~~~~~~~~~~~~----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~-~~~-~ 135 (422) T protein:vir:10 62 GIDDEPAFWSRWDD----LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQ-TRE-E 135 (422) T ss_pred CCCHHHHHHHHHHH----hhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccch-hcc-c Confidence 12234456666654 56777777766543344433333334 32 246778999999999988642 111 1 Q ss_pred CcCceeEEeccceeeccchhceeccccccccccCCcceeccchh-hcccc----ccccCCCCccchhHHHH-HHHHHHHH Q lcl|NC_021072. 173 RPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSV-TYCHS----GIQDLNKNMTLSHLHKA-IKAVNQLR 246 (533) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai-~y~hs----Gl~d~~~~~i~syL~~A-iK~~NqLr 246 (533) ++ ..+-|+.-.. |..++. +.....+|+++-+ ++... -+...+...+.|-|.++ ...+..+. T Consensus 136 dp--------~s~~fg~P~~-y~v~~~----~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~ 202 (422) T protein:vir:10 136 NP--------RNARFGEPLT-YRITTN----ESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYT 202 (422) T ss_pred Cc--------cccccCcceE-EEEecC----CCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHH Confidence 11 1112222221 111111 1112233433321 11111 12222333456767654 23233222 Q ss_pred H-HHH-HHHHHHHhcCccceEEEccC-CCC---c--hHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhccc Q lcl|NC_021072. 247 M-IED-SLVIYRLSRAPERRIFYIDV-GNL---P--KNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLP 318 (533) Q Consensus 247 m-~ED-alVIyRi~RAPeRrvfyIDv-Gnl---p--k~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLp 318 (533) . .+- +.++++-. =+++.++. .++ + ...|.+-+ ..+.++|+. +|-+. +. T Consensus 203 ~~~~~~~~l~~~~~----~~v~~~~~l~~~~~~~~~~~~~~~r~-~~~~~~~~~------~~~~~-------l~------ 258 (422) T protein:vir:10 203 NCERLATQLLKRKQ----QAVWKAKGLAELCDDSEGFGAARLRL-AQVDNNSGV------GQAIG-------ID------ 258 (422) T ss_pred HHHHHHHHHHHHhc----cccccchhHHHhcCCccchHHHHHHH-HHHHHhcCC------cccee-------Ee------ Confidence 2 222 34455542 23455552 221 1 11111111 112222221 11111 00 Q ss_pred ccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHH-HH Q lcl|NC_021072. 319 RREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKR-FS 394 (533) Q Consensus 319 RReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~--~~~~~g~~~eItRDElkF~Kfi~rLr~~-fs 394 (533) +.+-+++++. .+|+-++| +..|...+-.+.+||+.||-.+ +|||- -|.+-.|. |+.+|+++|.. +. T Consensus 259 ----~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glna-tgd~d~~~---yyd~i~~~Qe~~l~ 328 (422) T protein:vir:10 259 ----AESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSS-SQNTALET---FHKLVDRKRNAELL 328 (422) T ss_pred ----cCCcceEEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccc-cchHHHHH---HHHHHHHHHHHHHH Confidence 0111233321 23444444 5789999999999999999444 56653 12222344 99999999964 45 Q ss_pred HHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH Q lcl|NC_021072. 395 ELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD 474 (533) Q Consensus 395 ~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD 474 (533) .+...+++. |+ -. +.+.|.|..-..=+|...+|+...+.++++.+-.- -.+|.+-+++. |+... T Consensus 329 p~l~~l~~~--i~----~s-------~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~--g~i~~~e~r~~-L~~~~ 392 (422) T protein:vir:10 329 PILEFLIPF--IV----NA-------EEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAA--GAMDIDEARDT-LRTIA 392 (422) T ss_pred HHHHHHHHH--hc----cc-------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHH-hhhhc Confidence 554444433 22 11 35678898888888888899999988888776443 23444555433 22100 Q ss_pred HHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021072. 475 QEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPAD 513 (533) Q Consensus 475 eeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~ 513 (533) . ......++.++-.++.+-..++.+.|+.| T Consensus 393 ~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 393 P---------EVKINDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred c---------cccCCCCCCccccchhhcCCCCCCCCCCC Confidence 0 00001111110000000011111111111 No 28 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.02 E-value=5.2e-09 Score=66.01 Aligned_cols=456 Identities=13% Similarity=0.128 Sum_probs=235.7 Q ss_pred CCccccceeeeccccccccCCCCCCC----------CCccccee-eccccc--ccccchhhh-hhHHHHHHHHHHhh-hh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKD----------SMDGSQPI-VGGGYY--GYSVDFDGT-VRNEYELITRYREM-VL 65 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~----------~~dg~~~~-~~~~~~--~~~~~~~~~-~~~~~~LI~~YR~m-~~ 65 (533) |...- .+....++.++++-+. ..+|+-.. .++++. +....-+.. ..+...|..+-|.| .+ T Consensus 1 ~~r~~-----~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rN 75 (505) T protein:vir:96 1 MKRAE-----KKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSIN 75 (505) T ss_pred CCCCc-----cccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhc Confidence 22211 1112233444433221 12222111 122231 111222232 33688899999999 58 Q ss_pred cchhhhHHHHhhcceeeecCCCceEEE--EeccCCCcHHHHHHHHHHHHHHHH--------HhcchhhhhHHHHhhhhcC Q lcl|NC_021072. 66 QPECDSAVDDIVNETICGNFDDVPVEV--ELSNLKQSDKIKKLIREEFAEILR--------LLDFENRSYEIFRRWYVDG 135 (533) Q Consensus 66 ~pEvd~AvdeIvneaiv~d~~~~~v~v--~l~~~~~S~~ik~~I~eeF~~i~~--------lL~f~~~~~~~fR~WYvDG 135 (533) +|-+..||+-+++-+|=+ .+..++. +-...+..+.+.++|..+|+.-++ .++|..--...+|.|.+|| T Consensus 76 n~~a~~av~~~~~nvVG~--~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dG 153 (505) T protein:vir:96 76 NPYAKRFYQLLKNNVIGP--KGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDG 153 (505) T ss_pred ChHHHHHHHHHHHHhcCC--CcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCC Confidence 999999999999887721 1233332 233455578899999999997654 2334444566899999999 Q ss_pred ceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEecc-ceeeccchhceecc--ccccc----cccCCc Q lcl|NC_021072. 136 RLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDIN-TQLTQKAAEYYLYN--PKGLK----NSTNQG 208 (533) Q Consensus 136 ri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~-~~~~~~~~e~~~y~--p~~~~----~~~~~~ 208 (533) ..|..++.... ..- -..|+.|+|..|.--.. ....++..+..+. ..-.+.-..|+++. |.... ...... T Consensus 154 E~f~~~~~~~~-~~~-~~~lqliepd~l~~~~n--~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~ 229 (505) T protein:vir:96 154 EVLVREHRGYP-NKW-GYALQILECDRLDLNYN--ADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTY 229 (505) T ss_pred ceEEEEeecCC-CCc-ceEEEEechhhcCCCCC--cccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccc Confidence 99988776531 111 24689999999852111 1111211121111 01235667888875 33211 122345 Q ss_pred ceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_021072. 209 MKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGR 288 (533) Q Consensus 209 ~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~ 288 (533) .+||...|.....-. .+.---++|.|+.+++.+.+|.-.+||..+-...-|-.=-+..=|.+.+.... T Consensus 230 ~rvpa~~vlH~f~~~-r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~----------- 297 (505) T protein:vir:96 230 ERVPADEIIHTFVPW-RPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPP----------- 297 (505) T ss_pred cccCHhHhhhhhccc-CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCcc----------- Confidence 778765443333221 22234568999999999999999999999988877765444333433332110 Q ss_pred cccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCC---------cchHHHH-HHHHHHHHHhcCCCc Q lcl|NC_021072. 289 YRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQN---------LGELEDV-KYFQKKLYKALNVPS 358 (533) Q Consensus 289 ~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~n---------Lgei~DV-~YF~~kLy~aL~VP~ 358 (533) .| ..|+ ...+ -+.|| |.+|+.|+. -+..++. +-..+.+=.+|+||- T Consensus 298 ------~~-~~~~-----~~~~-----------l~pG~-i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~y 353 (505) T protein:vir:96 298 ------ED-DQGE-----IVEE-----------VEAGT-YQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAY 353 (505) T ss_pred ------cc-ccCc-----cccc-----------cCCce-eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCH Confidence 00 0011 0000 11233 555555553 3333333 333344556899998 Q ss_pred cccCCC-CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHH----HHHHhccCCCHhHHhhh-hhceeEEEeccch Q lcl|NC_021072. 359 SRLETE-TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLK----TQLILKGVMSLEEWDEM-KEHIQFDFIADNY 432 (533) Q Consensus 359 sRl~~~-~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk----~qLilkgi~t~eew~~~-~~~i~~~f~~Dn~ 432 (533) +-|..+ ++.|+ |.+.-.-+.|.+.+.++|..|..-|..++- ...+|.|.+..-.+... .-...|..-.--+ T Consensus 354 e~lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~ 430 (505) T protein:vir:96 354 NRLAHDLEGVNF---SSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDW 430 (505) T ss_pred HHHhcccccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccc Confidence 888665 44554 223334455999999999998875555422 25678887754333211 0122333333334 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHH-HHHHHHHHhhhcCCCCCCCcccccCCCCCCCCC Q lcl|NC_021072. 433 FTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIK-EIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPP 511 (533) Q Consensus 433 f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~-e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~ 511 (533) .-.+||+.-...+++. -.-|.+-+..+ .+..-+|+- |.+...+...+.|+-+.+.....+ ....+ T Consensus 431 iDP~Ke~~a~~~~i~~---------G~~t~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~----~~~~~ 496 (505) T protein:vir:96 431 VDPAKDSKAHSESIKN---------RTRSRSSIIRA-AGDDPEDVFDEIAWEEQLMRDKGVNPTPPEQESK----DATTD 496 (505) T ss_pred cChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCC----CCCCC Confidence 5566776655554442 33455666655 455544443 222323333345543322111110 01111 Q ss_pred CCCcccccc Q lcl|NC_021072. 512 ADDMSAQEG 520 (533) Q Consensus 512 ~~d~~~~~~ 520 (533) +.+.+++|. T Consensus 497 ~~~~~~~d~ 505 (505) T protein:vir:96 497 EEDDSASDD 505 (505) T ss_pred CCCCCCCCC Confidence 111111111 No 29 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=99.01 E-value=8.3e-10 Score=70.38 Aligned_cols=473 Identities=17% Similarity=0.163 Sum_probs=266.7 Q ss_pred CCccccceeeeccccc------------------cccCCCCCCCCCcccceeeccccccc--------ccchhhhhhHHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKV------------------PKGPSFVQKDSMDGSQPIVGGGYYGY--------SVDFDGTVRNEY 54 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~------------------~~~~s~~~~~~~dg~~~~~~~~~~~~--------~~~~~~~~~~~~ 54 (533) .+..+-| ..++.+.. +....|..-+.--+..+..++.+.-. ....+- -++|. T Consensus 11 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~p~~~~~~~~~~~~~~t~~~D~~~~g~~~~~~~~~~-pr~R~ 88 (569) T protein:vir:10 11 VRKALAG-VFKDNGERDNILLSALAVHGGSGYLFSRAGAPVQLSGFLGGKPGDSGMAGDGLVDGSRFIFDEVQL-PEDRL 88 (569) T ss_pred HHHHHhh-hhhcCCccchhhhhhheeecCcceEEeecCcchhhhhhhccCccccchhhhhHHHHHHHHhhhccC-chhHH Confidence 1111111 01111110 00000000000000011111100000 001111 26899 Q ss_pred HHHHHHHhhhhcchhhhHHHHhhcceeeecCC-CceEEEEe-ccCCCcH-HHHHHHHHHHHH-HHHHhcchhhhhHHHHh Q lcl|NC_021072. 55 ELITRYREMVLQPECDSAVDDIVNETICGNFD-DVPVEVEL-SNLKQSD-KIKKLIREEFAE-ILRLLDFENRSYEIFRR 130 (533) Q Consensus 55 ~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~-~~~v~v~l-~~~~~S~-~ik~~I~eeF~~-i~~lL~f~~~~~~~fR~ 130 (533) |++..|-+|+.+|-|.+|+..=|..|.-.++. +.+|.|.- .+.+.|+ ..+++|-+|... |..+ |++.++++.+. T Consensus 89 qiY~~~eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p~~~~~~a~~daakai~~el~~dl~~~--iNr~~~~lA~~ 166 (569) T protein:vir:10 89 QRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRT--INKEVAGWAFI 166 (569) T ss_pred HHHHHHHHHhcCchhhhhhhhhhheeecccccccceEEEEeecCCCCCcchHHHHHHHHHHHHHHHH--HHHHhhHHHHH Confidence 99999999999999999999999999988874 78888863 4444443 555577777765 3333 67899999999 Q ss_pred hhhcCceeeeeeecCCCCCCCeEEEEE---cChhhceehhhccCCCcCceeEEeccceeeccchhce-ecccccc----c Q lcl|NC_021072. 131 WYVDGRLFYHKVIDPKNPRGGLTELRY---IDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYY-LYNPKGL----K 202 (533) Q Consensus 131 WYvDGri~~hkvid~~~~~~gI~elr~---lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~-~y~p~~~----~ 202 (533) --+=|.-|-.+ | -+.++||++|.- -=|.-|++. + ..+++..++ .|.+.+. . T Consensus 167 ~~aFGdsYaRi-Y--~~~~~GV~dl~~s~yt~PsfIqpF--------------E----~g~~tvGF~~~~~~~~~~ti~~ 225 (569) T protein:vir:10 167 MSVFGVAYVRP-Y--AKEGIGITSFECSYYTLPSFIKEF--------------E----VSGNLAGFSGDYLKDASGKMVF 225 (569) T ss_pred HHhhhhhheee-e--ccCCceeEEEEecccccccccchh--------------h----hcCceEEeecccCCccccceee Confidence 88888877663 4 346899998852 114444221 1 111111111 1322221 1 Q ss_pred cccCCc--ceeccc----hhhccccccc-----c-C-------CCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_021072. 203 NSTNQG--MKIATD----SVTYCHSGIQ-----D-L-------NKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPER 263 (533) Q Consensus 203 ~~~~~~--~kI~~d----ai~y~hsGl~-----d-~-------~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeR 263 (533) -+++|+ +|+|.- -+.=.|+|.. + . ..+.+-|||+.|-+||-+|..-=-+|--=|+-=|.-- T Consensus 226 l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~psn~GgSFL~~ae~pf~~l~~Al~sL~~qri~dSv~~ 305 (569) T protein:vir:10 226 ADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIETQNYGTSLLEYAYEPYMNLRSAIRSLKATRFNASKID 305 (569) T ss_pred echhhhhhhcccceeeccccchhhhhhhheeecccccccccccchhhhhHHHHHHHhHHHHHHHHHHhccchhhHHHHHh Confidence 244554 344433 2222333332 1 1 2556679999999999999998888888899999999 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhcc---cEEEeeCCCCccccccccchhHhhhcccccCCCC-ccceeecCCCCCcch Q lcl|NC_021072. 264 RIFYIDVGNLPKNKAEQYLREVMGRYR---NKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGR-GTEISTLPGGQNLGE 339 (533) Q Consensus 264 rvfyIDvGnlpk~KAeqYl~~im~~~r---nk~vYd~~TGev~~d~~~msmlEDywLpRReggr-gTEIsTLpGg~nLge 339 (533) |+.=+-.-.|||.++.+|+|.+-.-.| ..+.-=+..|+.--.+. ---||-=+.|+ +--|+|=.+-.++-- T Consensus 306 ~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~------~H~LPv~gekq~~~tvDt~~~~A~~~g 379 (569) T protein:vir:10 306 RIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVT------NTLLPIMGDGKGQMTIDTQTIQADING 379 (569) T ss_pred HHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccccccc------eeeeeeecCccccccccccccccCccc Confidence 999999999999999999998765444 33332223343221111 12256666666 346776666666778 Q ss_pred HHHHHHHHHHHHHhcCCCccccCCCCccc--ccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc--cCCCHhH Q lcl|NC_021072. 340 LEDVKYFQKKLYKALNVPSSRLETETTFN--IGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILK--GVMSLEE 415 (533) Q Consensus 340 i~DV~YF~~kLy~aL~VP~sRl~~~~~~~--~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilk--gi~t~ee 415 (533) |+||...-+.|--||++-.|-|+--...+ +|-+.-+ |.-+.=..=-+-||.-.++.|..++-.++-.| ++..++| T Consensus 380 IEdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~f-rtSaQaa~RS~~iRqa~~e~in~iidiH~~fKYgevf~~~d 458 (569) T protein:vir:10 380 IEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFL-RTAIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVYPEGD 458 (569) T ss_pred HHHHHHHHHHHHhhhccchhHhhHHHHhcccccccHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccCCCC Confidence 99999999999999999999985321111 1221110 11011111135588888888988888877665 5655555 Q ss_pred HhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHh----hhh-----cc-ccccHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|NC_021072. 416 WDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTM----DPY-----VG-KYFSIDYMRRQVLKQTDQEIKEIDKQID 485 (533) Q Consensus 416 w~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~----~~~-----vG-ky~S~~~i~k~IL~~tDeeI~e~~kqi~ 485 (533) +. -+++|...+.=.+-.+.+-..+|++.+.-+ +.. .| .==-..|+.+++|+| |+.+.+ .+- T Consensus 459 rP-----~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~~-De~~~e---~l~ 529 (569) T protein:vir:10 459 RP-----YKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEI-DEKISE---ALV 529 (569) T ss_pred cc-----eEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHhhc-chhHHH---HHH Confidence 55 589999999999988888888998876544 111 11 100124566677777 332221 223 Q ss_pred HhhhcCCCCCCCcccccCCCCCCCCCC-----------CCccccc Q lcl|NC_021072. 486 SEREAGLIVDPMAEMDPAMDPGNAPPA-----------DDMSAQE 519 (533) Q Consensus 486 ~E~~~~~~~~p~~~~~~~~~~~~~~~~-----------~d~~~~~ 519 (533) .|. .+.|.+++. -|++.--.|+ .+++.++ T Consensus 530 ae~----~akp~DEe~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 569 (569) T protein:vir:10 530 NEL----KAKSEDDDH-LMDSIIKTPPQELAQILESVFKEGNDND 569 (569) T ss_pred hhc----CCCcchhHH-HHHHHhcCChHHHHHHHHHHhhccCCCC Confidence 333 233444422 1222111111 1222222 No 30 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.91 E-value=1.7e-09 Score=68.59 Aligned_cols=442 Identities=14% Similarity=0.110 Sum_probs=197.0 Q ss_pred CCccccce---eeeccccccccCCCCCCCCC--cccce-e--ecccccccccchhhhhhHHHHHHHHHHhh-hhcchhhh Q lcl|NC_021072. 1 MSNQLFGF---SLERAKKVPKGPSFVQKDSM--DGSQP-I--VGGGYYGYSVDFDGTVRNEYELITRYREM-VLQPECDS 71 (533) Q Consensus 1 ~~~~~fg~---~i~~~~~~~~~~s~~~~~~~--dg~~~-~--~~~~~~~~~~~~~~~~~~~~~LI~~YR~m-~~~pEvd~ 71 (533) -|.-...+ .+++.=++.+..+|--.... ++... . .+-.+.+..++ ...+-+++-++.+ +.++-|.. T Consensus 55 ~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~-----pdaf~~vnVs~~~AlknsaV~s 129 (945) T protein:vir:10 55 NSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISD-----PDAFFLINLFRKYRFNNDSKLI 129 (945) T ss_pred cceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccC-----ccceeeehhhhhhhhccHHHHH Confidence 22222333 33443233444433211111 01000 0 00001111110 0122234445555 56788999 Q ss_pred HHHHhhcceeeecCCCceEEEE--eccCCCcHHHHHHHHHHHHHHHHHh---cchhhhhH--------HHHhhhhcCcee Q lcl|NC_021072. 72 AVDDIVNETICGNFDDVPVEVE--LSNLKQSDKIKKLIREEFAEILRLL---DFENRSYE--------IFRRWYVDGRLF 138 (533) Q Consensus 72 AvdeIvneaiv~d~~~~~v~v~--l~~~~~S~~ik~~I~eeF~~i~~lL---~f~~~~~~--------~fR~WYvDGri~ 138 (533) ||+-|.+.+-- .|+.+- ..+......++. + .....++++| |-.-.+.+ +++.+++.|.-| T Consensus 130 cI~~IA~sIAs-----LPlklYrr~edG~~~~~~kk-~-~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAY 202 (945) T protein:vir:10 130 KVSEIPKKLTS-----KELEIYKHIEDKHVNYYLKR-I-RDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGA 202 (945) T ss_pred HHHHHHhhhcc-----CceEEEEecccCcccccccc-c-ccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeE Confidence 99999887542 455552 222222222221 1 1233455555 32223333 446678889999 Q ss_pred eeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccc---ccccCCcceeccch Q lcl|NC_021072. 139 YHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGL---KNSTNQGMKIATDS 215 (533) Q Consensus 139 ~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~---~~~~~~~~kI~~da 215 (533) +.++-| ..+.+++|.++||.+++.++.- ++...+. |.+...+. .+.+.+.+.+.+. T Consensus 203 ieIiRd---~~G~ii~L~pLdPs~Vti~~dd-----DG~~~y~------------Yv~~idG~~~~~v~a~DvIlhirn- 261 (945) T protein:vir:10 203 IVKIRD---EQGNLVAITPVDGTTIKPILSE-----DTGIVVG------------YVQEVDGAIVAHFDKRDVVLFRQN- 261 (945) T ss_pred EEEEEC---CCCcEEEEEEECCcceEEEEcC-----CCcEEEE------------EEEecCCceEEEecCCceEEEecc- Confidence 997765 3456899999999999764332 1111110 11111110 1111122221111 Q ss_pred hhccccccccCCCCccchhHHHHHHHHHHHHHHHHH-HHHHHHhcCccceEEEccCCCC---------chHHHHHHHHHH Q lcl|NC_021072. 216 VTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDS-LVIYRLSRAPERRIFYIDVGNL---------PKNKAEQYLREV 285 (533) Q Consensus 216 i~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDa-lVIyRi~RAPeRrvfyIDvGnl---------pk~KAeqYl~~i 285 (533) +... +...+.++|-|..|.+.+.....++.. .-.|+--.|.-+-+..++.++. .+..+++ +++. T Consensus 262 --~s~D---G~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~er-lKe~ 335 (945) T protein:vir:10 262 --LTPD---VYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLES-IQRQ 335 (945) T ss_pred --CCCC---cccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHH-HHHH Confidence 0011 112345678899999988776666554 4445545677788998887643 2222222 3333 Q ss_pred HHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCC Q lcl|NC_021072. 286 MGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETE 364 (533) Q Consensus 286 m~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~ 364 (533) +.+... |. .+... + ++ ..|.+++.|.....-.+ ++-..|..+...++.+||...|+.. T Consensus 336 wee~~s--------G~-NnG~p-i-VL----------deGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~ 394 (945) T protein:vir:10 336 LQAIMM--------GD-YTQVP-I-LS----------GGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGIL 394 (945) T ss_pred HHHHhC--------Cc-ccccc-e-ec----------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccC Confidence 332211 21 11111 1 12 23577777754322222 3445667788999999999999765 Q ss_pred CcccccchhhhhHHhhhHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHH Q lcl|NC_021072. 365 TTFNIGRAAEITRDEVKFQKF-IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRN 443 (533) Q Consensus 365 ~~~~~g~~~eItRDElkF~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~ 443 (533) ++.+.....+..+. |..+ +..+..++...+...|... . ....+.++|..+..-.. . T Consensus 395 e~st~SNiEqq~~~---Fv~~tL~Pil~~IEqeLNrkLl~~---------~----eg~~i~fdFd~ldl~D~-------k 451 (945) T protein:vir:10 395 EGSNKATAEVMASL---TKAKGLEPLMATISKGFDEVVSEF---------R----NEKDIKLWFKEDDLEKE-------R 451 (945) T ss_pred CCCCcchHHHHHHH---HHHHHHHHHHHHHHHHHHHhcccc---------c----cCceeEEEecchhccCH-------H Confidence 54443334443333 7654 7777777776666544211 1 13457888876654322 3 Q ss_pred HHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHH--------HHHh---hhcCCCCCCCcccccCCCCCCCCCC Q lcl|NC_021072. 444 ERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQ--------IDSE---REAGLIVDPMAEMDPAMDPGNAPPA 512 (533) Q Consensus 444 ~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kq--------i~~E---~~~~~~~~p~~~~~~~~~~~~~~~~ 512 (533) .|.++++.+-.- -++|.+-+++. +++.+-+ --++. ...+ .+.+.-+ |.....++.++...+.. T Consensus 452 sraEal~kli~s--GiLTiNEvRe~-lGLpPIe--GGD~lli~~nn~~P~d~~~ka~~ga~p-~q~aq~~~dqp~~kGGe 525 (945) T protein:vir:10 452 DWWNIIQGQLNT--GFRSINEARME-KGLEPVP--WGDVPFSGLRNWKPEDEQAKAQQGAMP-PQLAQAMADQPSQQGGG 525 (945) T ss_pred HHHHHHHHHHhC--CCcCHHHHHHH-hCCCCCC--CcceeeeccccccccccccccccCCCC-cccccCCCCCCCCCCCC Confidence 566655544322 47788887754 6765431 00000 0000 0001011 11000001000000000 Q ss_pred CCccccccccCCccccchhcC Q lcl|NC_021072. 513 DDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 513 ~d~~~~~~~~~~~~~~~~~~~ 533 (533) +|.+.. .| -++.+...+++ T Consensus 526 ~dEns~-~p-sE~kda~~e~~ 544 (945) T protein:vir:10 526 VDENSS-VP-SEQKNAGLEVL 544 (945) T ss_pred CCCCCC-CC-CcccchHHHHH Confidence 111000 00 01111111111 No 31 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.86 E-value=2.5e-08 Score=62.26 Aligned_cols=480 Identities=11% Similarity=0.098 Sum_probs=239.9 Q ss_pred eeccc-cccccCCC--------CCCCCCccccee--ecccccccccchhhhhh-HHHHHHHHHHhh-hhcchhhhHHHHh Q lcl|NC_021072. 10 LERAK-KVPKGPSF--------VQKDSMDGSQPI--VGGGYYGYSVDFDGTVR-NEYELITRYREM-VLQPECDSAVDDI 76 (533) Q Consensus 10 i~~~~-~~~~~~s~--------~~~~~~dg~~~~--~~~~~~~~~~~~~~~~~-~~~~LI~~YR~m-~~~pEvd~AvdeI 76 (533) ..+.. ..-...++ ......+|+-.. ..+++.+...+.+..+. +...|..+-|.+ .++|-+.+||+-+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~ 80 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQ 80 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 11100 00000011 111233333211 13344444444444444 577899999999 7899999999999 Q ss_pred hcceeeecCCCceEEEEecc-------CCCcHHHHHHHHHHHHHHH----------HHhcchhhhhHHHHhhhhcCceee Q lcl|NC_021072. 77 VNETICGNFDDVPVEVELSN-------LKQSDKIKKLIREEFAEIL----------RLLDFENRSYEIFRRWYVDGRLFY 139 (533) Q Consensus 77 vneaiv~d~~~~~v~v~l~~-------~~~S~~ik~~I~eeF~~i~----------~lL~f~~~~~~~fR~WYvDGri~~ 139 (533) ++-+|= .+..++-..+. -+..+.+.++|..+|..-. ..++|..--...+|.|.+||-.|. T Consensus 81 ~~nvVG---~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~ 157 (553) T protein:vir:63 81 RDSIVG---AQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLA 157 (553) T ss_pred HHhhcc---CCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEE Confidence 988873 13333333221 1335678888898898654 445565556678899999999999 Q ss_pred eeeecCCCCCCC--eEEEEEcChhhceehhhccCCCcCceeEEeccc-eeeccchhceecc--cccccccc---C----- Q lcl|NC_021072. 140 HKVIDPKNPRGG--LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINT-QLTQKAAEYYLYN--PKGLKNST---N----- 206 (533) Q Consensus 140 hkvid~~~~~~g--I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~-~~~~~~~e~~~y~--p~~~~~~~---~----- 206 (533) ..+..+. .++ -..|+.|||..|.--.. . +++..+..+.- .-.+.-..|++++ |....... . T Consensus 158 ~~~~~~~--~~~~~~~~lq~ie~drl~~~~~---~-~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~ 231 (553) T protein:vir:63 158 TAEWDRA--ANRPYATCFQMVSTDRLSNPYQ---Q-LDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFV 231 (553) T ss_pred EeeeccC--CCCcccceEEEechhhcCCCCC---C-CCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeee Confidence 9888642 222 25789999998853211 1 12212211110 1235667888875 33222111 1 Q ss_pred -Ccceeccchhhccccccc-cCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH Q lcl|NC_021072. 207 -QGMKIATDSVTYCHSGIQ-DLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLRE 284 (533) Q Consensus 207 -~~~kI~~dai~y~hsGl~-d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~ 284 (533) ...+++...| .|--.. .+.---++|.|+.+++.+.+|.-.+||...-...-|-. +.+|-.+ .|...+-+.+.. T Consensus 232 ~~~~~v~a~~v--lH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~--a~fi~~~-~~~~~~~~~~~~ 306 (553) T protein:vir:63 232 QQSKPWGRRQV--IHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASY--AAAIESE-LPPEFIHSQMSG 306 (553) T ss_pred ccccccChhHh--eecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhh--eeeeecC-CChhhhhhhccc Confidence 1234554444 443332 23345578999999999999999999999998888866 3333332 243333322221 Q ss_pred HHHhcccEEEeeCCCCccccccccchhHhhhccccc----CCC------CccceeecCCCCCcch-HHHHHHHHHHHHHh Q lcl|NC_021072. 285 VMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR----EGG------RGTEISTLPGGQNLGE-LEDVKYFQKKLYKA 353 (533) Q Consensus 285 im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRR----egg------rgTEIsTLpGg~nLge-i~DV~YF~~kLy~a 353 (533) --.. ...+|- ....+.-+..+.-..+ ++| -|.+|..+.....-+. .+=++...+.+=.+ T Consensus 307 ~~~~-------~~~~~~---~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaag 376 (553) T protein:vir:63 307 GSPN-------ADMVGI---FGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASA 376 (553) T ss_pred cccc-------cccccc---ccccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhh Confidence 0000 000000 0000100111110000 111 1333444443333333 23445556666678 Q ss_pred cCCCccccCCC-CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHH----HHHHhccCCCHhHHh-h-------h- Q lcl|NC_021072. 354 LNVPSSRLETE-TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLK----TQLILKGVMSLEEWD-E-------M- 419 (533) Q Consensus 354 L~VP~sRl~~~-~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk----~qLilkgi~t~eew~-~-------~- 419 (533) |+||-+-|..+ ++.|. |.+.-.-+.|-+.+.++|..|..-|..++- ...+|.|-|..-.+. . . T Consensus 377 lGi~Ye~lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~ 453 (553) T protein:vir:63 377 FGMSYEEFTRDFSKANY---SSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMK 453 (553) T ss_pred cCCCHHHHhhhcccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhh Confidence 99999988766 45555 233445566999999999988877776643 355788876432211 0 0 Q ss_pred hh--ceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHH-HHHHHHHHhhhcCCCCCC Q lcl|NC_021072. 420 KE--HIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIK-EIDKQIDSEREAGLIVDP 496 (533) Q Consensus 420 ~~--~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~-e~~kqi~~E~~~~~~~~p 496 (533) .. ...+..-.-.+.--+||+.-...+++. -+-|.+-+..+ ++..-+++. |.....+.-.+.|+..+. T Consensus 454 ~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~---------G~~t~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~ 523 (553) T protein:vir:63 454 EALSKCEWIGASQGQIDQLKETQAAVMRIDA---------GLSTYEREIAR-LGGDFRKSFAQRAREDALLKKYGLTFNL 523 (553) T ss_pred hhhhceeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-hCCCHHHHHHHHHHHHHHHHHcCCCCCC Confidence 01 123333344445566766655544432 23455555555 355554444 223333333445543321 Q ss_pred CcccccCCCCCCCCCCCCccccccccCCccccch Q lcl|NC_021072. 497 MAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 497 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 530 (533) ... ...+..........++.++-+..++.| T Consensus 524 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 524 SAK----RSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred CCc----cccCCCcccCCCCCCCCCCCCcccccC Confidence 111 111111111111112222222222222 No 32 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.84 E-value=3e-08 Score=61.79 Aligned_cols=454 Identities=11% Similarity=0.114 Sum_probs=197.3 Q ss_pred CCccccceeeecccccc-------ccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVP-------KGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAV 73 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~-------~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Av 73 (533) -...-=+++|+..+... .+.+.+-..-..|+. .++.....-+.......|-+..+..+.+|.|..|| T Consensus 24 ~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~------~~~~~~~~r~~~~~~~~l~~~~~~~~~npiv~~~I 97 (551) T protein:vir:80 24 HIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSM------SANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAII 97 (551) T ss_pred ccccccceeeecccccHHHHHHhhccCcceeecccccce------ecCcccccCccccChhHHHHHHHHhhcCHHHHHHH Confidence 00011122222211111 001111100111111 11111112233445556666666778899999999 Q ss_pred HHhhcceeee------cCCCceEEEEeccCC--CcHHHHHHHHHHHHHH---HHHhcch-----hhhhHHHHhhh----h Q lcl|NC_021072. 74 DDIVNETICG------NFDDVPVEVELSNLK--QSDKIKKLIREEFAEI---LRLLDFE-----NRSYEIFRRWY----V 133 (533) Q Consensus 74 deIvneaiv~------d~~~~~v~v~l~~~~--~S~~ik~~I~eeF~~i---~~lL~f~-----~~~~~~fR~WY----v 133 (533) +.|++.+-++ ..++.+..|.+.+.+ .++.-+. +++.| ++..+.. ....++++.|. + T Consensus 98 ~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~----~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll 173 (551) T protein:vir:80 98 NTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEA----TIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYM 173 (551) T ss_pred HHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHH----HHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHh Confidence 9999874432 245566667664422 2222222 33333 3333433 24445665554 5 Q ss_pred cCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceecc Q lcl|NC_021072. 134 DGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIAT 213 (533) Q Consensus 134 DGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~ 213 (533) -|.-|+.++-|. .+-+++|.+|||.+|+.+....+.......++. .+.+.+ ..+.++. T Consensus 174 ~Gnay~~i~rd~---~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~-------------~~~~g~------~~~~~~~ 231 (551) T protein:vir:80 174 YDQVNFEKVFNR---NQSMVRFVAKDPTTIFFATTADGKIPDNGNRFV-------------QVIDQK------IVATFNA 231 (551) T ss_pred cCCEEEEEEECC---CCcEEEEEEeCCceeEEEECCccccccCceEEE-------------EEeCCc------EEEEEcc Confidence 699999988865 445999999999999875443332211111111 011110 1112222 Q ss_pred chhhccccc-ccc-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccC-CCCchHHHHHHHHHHHHhcc Q lcl|NC_021072. 214 DSVTYCHSG-IQD-LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDV-GNLPKNKAEQYLREVMGRYR 290 (533) Q Consensus 214 dai~y~hsG-l~d-~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDv-Gnlpk~KAeqYl~~im~~~r 290 (533) +-+.|.+.. +-+ ..+..++|-|+.|+..+.....++....=+----|--+-|..+.. ++|.+..+++.-+.+...|. T Consensus 232 ~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~ 311 (551) T protein:vir:80 232 REMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLS 311 (551) T ss_pred cceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhc Confidence 222222110 111 113346688999999999988888876544333344555666654 34555544444444433342 Q ss_pred cEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCC-- Q lcl|NC_021072. 291 NKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED---VKYFQKKLYKALNVPSSRLETET-- 365 (533) Q Consensus 291 nk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~-- 365 (533) -- ...|.+ .++. +.|.++..|. .+..++.- .+|..+..-++.+||...|+-.+ T Consensus 312 G~----~nag~~-------~vl~---------~~g~~~~~l~--~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~ 369 (551) T protein:vir:80 312 GI----NGSWQI-------PVVS---------AEDVKFVNMT--PSARDMEFEKWLNYLINVISALYGIDPAEINIPNNG 369 (551) T ss_pred Cc----cccCcc-------cccc---------CCCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhcCCHHHcCccccc Confidence 10 011211 1221 1134555553 23333333 45577889999999999997432 Q ss_pred cccccchhhhh-----HHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHH Q lcl|NC_021072. 366 TFNIGRAAEIT-----RDEVKFQK-FIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEI 439 (533) Q Consensus 366 ~~~~g~~~eIt-----RDElkF~K-fi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~ 439 (533) +..-..++.++ -....|.. -+.-+..++...|...|-+. ....+.|+|.....-.+ T Consensus 370 ~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~--------------~~~~~~f~f~~~~~~~~---- 431 (551) T protein:vir:80 370 GATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAE--------------FGDKYTFQFVGGDIKSE---- 431 (551) T ss_pred ccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc--------------cCCceEEEeeccChhhH---- Confidence 11100111111 11122433 35556665555555443221 12346777775543332 Q ss_pred HHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH-HHH----------H-----HHHHHHHHhhhcCCCCC-------- Q lcl|NC_021072. 440 EIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD-QEI----------K-----EIDKQIDSEREAGLIVD-------- 495 (533) Q Consensus 440 Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD-eeI----------~-----e~~kqi~~E~~~~~~~~-------- 495 (533) .+|..+...+. .-++|..-++. .+++.. .+- . ...++.+.+........ T Consensus 432 ---~~~~~~~~~~~---~g~lT~NE~R~-~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 504 (551) T protein:vir:80 432 ---LESVKILAEKA---KVAMTVNEVRK-ELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNR 504 (551) T ss_pred ---HHHHHHHHHHh---cCCcCHHHHHH-HhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCC Confidence 23444333322 23578888885 477754 110 0 00001111111111110 Q ss_pred --CCcccccCCCCCCCCCCCCccccccccCCcc-ccchhcC Q lcl|NC_021072. 496 --PMAEMDPAMDPGNAPPADDMSAQEGPAVDAG-DAKRGEF 533 (533) Q Consensus 496 --p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~-~~~~~~~ 533 (533) |..+..|+..+.....++|+...+....+++ +.-.+.+ T Consensus 505 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (551) T protein:vir:80 505 VSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQGMKGDK 545 (551) T ss_pred CCCCCCCCCCccccCCCccccccccCccccchhhhhcCCCC Confidence 0111111110000111112222222222221 2222222 No 33 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.83 E-value=3.2e-08 Score=61.69 Aligned_cols=461 Identities=11% Similarity=0.129 Sum_probs=193.2 Q ss_pred CCc---ccccee----------------eeccccccccCCCCC----CCCCcccceeecccc-----cccccchhhhhhH Q lcl|NC_021072. 1 MSN---QLFGFS----------------LERAKKVPKGPSFVQ----KDSMDGSQPIVGGGY-----YGYSVDFDGTVRN 52 (533) Q Consensus 1 ~~~---~~fg~~----------------i~~~~~~~~~~s~~~----~~~~dg~~~~~~~~~-----~~~~~~~~~~~~~ 52 (533) |.. +.+|++ ....++.-...+... .+..++...+..+.+ ++......+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRN 80 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCC Confidence 111 111211 000000000000000 011111111111100 0011112233444 Q ss_pred HHHHHHHHHhhhhcchhhhHHHHhhcceeee------cCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhc-----ch Q lcl|NC_021072. 53 EYELITRYREMVLQPECDSAVDDIVNETICG------NFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLD-----FE 121 (533) Q Consensus 53 ~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~------d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~-----f~ 121 (533) ...|-..-+..+..|.|..+|+.+++.+.+| +..+.|+.|.+.+.+.... .+...+...+.++|. ++ T Consensus 81 ~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~--~~~~~~~~~l~~ll~~~~~~~n 158 (574) T protein:vir:80 81 SQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPT--SHDIANIKRIESFLENTAQFRD 158 (574) T ss_pred cccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCcc--chhhhhhhHHHHHHhccCCCCC Confidence 4444444555677899999999998875544 3457888887644322111 112234555666552 11 Q ss_pred ---hhhhHHH----HhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhce Q lcl|NC_021072. 122 ---NRSYEIF----RRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYY 194 (533) Q Consensus 122 ---~~~~~~f----R~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~ 194 (533) ....+++ +.+++-|.-|+.++-+. .+-|++|.+|||.+|+.++............+. T Consensus 159 P~~~s~~ef~~~lv~~lll~Gnayi~i~r~~---~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~------------- 222 (574) T protein:vir:80 159 PNRDNFTTFCKKLVRATYMYDQVNFEKVFDK---DGNFIKFDTVDPTTIFLATNGEGKLIKNGERFV------------- 222 (574) T ss_pred CccccHHHHHHHHHHHHHhcCCeEEEEEECC---CCcEEEEEEEcCceeEEEEcCccccccCceEEE------------- Confidence 1223343 34567799999988865 456999999999999876654433222111111 Q ss_pred ecccccc--ccccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCC Q lcl|NC_021072. 195 LYNPKGL--KNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGN 272 (533) Q Consensus 195 ~y~p~~~--~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGn 272 (533) .+...+. .+.....++|+.... + ....++.++|-|+.|+.++.....+++...=+=---+--+-|..++.+. T Consensus 223 ~~~~g~~~~~~~~~eiih~~~~~~----~--~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~ 296 (574) T protein:vir:80 223 QVIDNRIVAKFNERELAFAVRNPR----A--DIEVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQ 296 (574) T ss_pred EEeCCceEEEEccccEEEEeccCC----C--CcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC Confidence 1111110 011222233322110 0 0112345678899999999988888887655444446667777787654 Q ss_pred -CchHHHHHHHHHHHHh-cccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHH---HHHHHH Q lcl|NC_021072. 273 -LPKNKAEQYLREVMGR-YRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE---DVKYFQ 347 (533) Q Consensus 273 -lpk~KAeqYl~~im~~-~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~---DV~YF~ 347 (533) |-+..++. +++-+.. |.- . ...|. . .++++ .|.++.-|. .+..++. --+|.. T Consensus 297 ~ls~e~~~~-lk~~~~~~~~G-~---~n~g~------~-~vl~~---------~G~~~~~l~--~s~~D~qfle~~~~~~ 353 (574) T protein:vir:80 297 QQSQQALDI-FRREWRSSLAG-I---NGSWQ------I-PVVSA---------EDVKFVNMT--PSANDMQFEKWLNYLI 353 (574) T ss_pred CCCHHHHHH-HHHHHHHHhcc-c---ccccc------c-eeecC---------CCceEEEcc--CChhHHHHHHHHHHHH Confidence 44443332 3333322 221 0 01111 1 12211 245566554 2333333 345588 Q ss_pred HHHHHhcCCCccccCCCCcccc-c-chhhhhHH--h---hhHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhh Q lcl|NC_021072. 348 KKLYKALNVPSSRLETETTFNI-G-RAAEITRD--E---VKFQKF-IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEM 419 (533) Q Consensus 348 ~kLy~aL~VP~sRl~~~~~~~~-g-~~~eItRD--E---lkF~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~ 419 (533) +...++.+||...|+-.+.-.+ | .++..++. | +.|..+ +.-+..++...|...| + ++.+ T Consensus 354 ~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~L----l-----~~~~---- 420 (574) T protein:vir:80 354 NVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYI----V-----AEFG---- 420 (574) T ss_pred HHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhh----h-----hhcC---- Confidence 8899999999999975432111 1 11111111 1 124443 3444444444443322 2 2211 Q ss_pred hhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHH----------HHHHHHH----- Q lcl|NC_021072. 420 KEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEI----------KEIDKQI----- 484 (533) Q Consensus 420 ~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI----------~e~~kqi----- 484 (533) ..+.+.|.....=+ +.+....+.....-++|..-++. .++|.+-+= .....+. T Consensus 421 -~~~~~~f~~~d~~~----------~~~~~~~~~~~~~G~lT~NE~R~-~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~ 488 (574) T protein:vir:80 421 -EKYQFQFRGGDLSA----------QLDKLKIIEQEGKVFRTVNEIRH-DKGLEPIKGGDVILNGVHIQAIGQALQEEQL 488 (574) T ss_pred -CceEEEecccchhh----------HHHHHHHHHHHhCCccCHHHHHH-HhCCCCCCCCCEeeeccceeecccccccccC Confidence 34677777554322 12222112222234788888885 477754320 0000000 Q ss_pred HHhhhcCCCCCCCc------ccccCCCCCCCCCCCCccccccccCCcc--ccchhcC Q lcl|NC_021072. 485 DSEREAGLIVDPMA------EMDPAMDPGNAPPADDMSAQEGPAVDAG--DAKRGEF 533 (533) Q Consensus 485 ~~E~~~~~~~~p~~------~~~~~~~~~~~~~~~d~~~~~~~~~~~~--~~~~~~~ 533 (533) +.+...+....|.. +..+..++.+...+++.+++++++.-.+ +...+.| T Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 545 (574) T protein:vir:80 489 EYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSKKVNGKV 545 (574) T ss_pred CccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhccchhhhcCCc Confidence 00000010100000 0000000000111111222222111000 0000111 No 34 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=98.83 E-value=3.2e-08 Score=61.68 Aligned_cols=422 Identities=12% Similarity=0.102 Sum_probs=180.3 Q ss_pred CCccccceeeecccccc-ccCCCCCCCCCcc---cceee--cccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVP-KGPSFVQKDSMDG---SQPIV--GGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVD 74 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~-~~~s~~~~~~~dg---~~~~~--~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Avd 74 (533) |+ +|++++... ...+..-.+-.-| .+.-. ..+++++ +..-+..++-..|+ .++.+..+|+ T Consensus 1 ~~------~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~-----~~~~~~~~l~~lY~---~~~l~r~iVd 66 (461) T protein:vir:80 1 MY------SIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGN-----GQKLDLKACENLYA---SNSIAMNIVD 66 (461) T ss_pred Cc------cchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccccCc-----ccccCHHHHHHHHH---hCCccchhhc Confidence 43 455543221 1111110000000 00000 1111111 11123445556665 7899999999 Q ss_pred HhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCC------ Q lcl|NC_021072. 75 DIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNP------ 148 (533) Q Consensus 75 eIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~------ 148 (533) -++.+|+. ..+.+..++ ++..++|.++|+. |++..+..+.+|.=-+.|.-++-..+...++ T Consensus 67 ~~a~d~~r-----~g~~i~~~~----~~~~~~~~~~~~~----l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~ 133 (461) T protein:vir:80 67 IISEDMVR-----AGWSLKTDN----KEMKKNIESKWRK----LKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLS 133 (461) T ss_pred cchHHhhc-----CCeeeecCC----HHHHHHHHHHHHH----hhHHHHHHHHHHhhcccccEEEEEEeecCCccccCcc Confidence 99999985 345665443 5666667777665 4556666666655444554333322321111 Q ss_pred ----CCCeEEEEEcCh---hhceehhhccCCCcCceeEEeccceeeccchhceeccccc-------cccccCCcceeccc Q lcl|NC_021072. 149 ----RGGLTELRYIDP---RKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG-------LKNSTNQGMKIATD 214 (533) Q Consensus 149 ----~~gI~elr~lDP---~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~-------~~~~~~~~~kI~~d 214 (533) ++++..+.+|+| ..|.. ..+ ..++ ..+.++.-.. |..+... .........+|+.+ T Consensus 134 ~pl~~~~~~~~~~l~~~~~~~i~~-~~~-~~dp--------~sp~fg~P~~-y~i~~~~~~~~~~~~~~~~~~~~~iH~S 202 (461) T protein:vir:80 134 TAIDPKTIKSIPYINTFNTQKVTQ-LYL-NQDM--------FSEHFGEVEF-FEVNRVSQLGEEILSGTTASTSEQIHRS 202 (461) T ss_pred CCcccccccceeEEEeccccccch-hhh-cccC--------cCcccccceE-EEEeccccccccccccccCccceEEccc Confidence 233344444443 33321 111 1111 1112222221 1111110 01122234556665 Q ss_pred hhhccccccccCCCCccchhHHHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEccCC-CCchHHHHHHHHHHHHhccc Q lcl|NC_021072. 215 SVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIED--SLVIYRLSRAPERRIFYIDVG-NLPKNKAEQYLREVMGRYRN 291 (533) Q Consensus 215 ai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~ED--alVIyRi~RAPeRrvfyIDvG-nlpk~KAeqYl~~im~~~rn 291 (533) -+.-...+- .++...+.|.|+++...+..+..... +.++++ +-.+ +|.+|.- .+......+. ...+..+++ T Consensus 203 Rii~~~~~~-~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~~~~~~~~~-~~~~~~~~~ 276 (461) T protein:vir:80 203 RIIHEQGLR-FEGETKGRSIFESLYDIITVMDTSLWSVGQILYD---FAFK-VYKTDDIDALNKDDKANL-TAMLDFMFR 276 (461) T ss_pred cEEEecCCC-CCccccCcchHHHHHHHHHHHHHHHHHHHHHHHH---hCCC-ceecchHHhhhchHHHHH-HHHHHHhcC Confidence 443332221 12223357899887766655543332 223333 4333 4555421 1111222222 222333221 Q ss_pred EEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCC-cccc Q lcl|NC_021072. 292 KLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETET-TFNI 369 (533) Q Consensus 292 k~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~-~~~~ 369 (533) .+|-+- + + .+-+++++- .+|+-++| +..|...+-.+.+||+.+|-.++ |.+ T Consensus 277 ------~~g~~~-------------~----d-~~e~~e~~~--~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~- 329 (461) T protein:vir:80 277 ------TEALAI-------------I----K-GDEQLTKES--TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTL- 329 (461) T ss_pred ------CceEEE-------------E----c-CCcceEEEe--cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCcc- Confidence 122100 0 0 011122221 13445555 46888899999999999984432 332 Q ss_pred cchhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHH----hccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHH Q lcl|NC_021072. 370 GRAAEITRDEVKFQKFIARLRKR-FSELFMDLLKTQLI----LKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNE 444 (533) Q Consensus 370 g~~~eItRDElkF~Kfi~rLr~~-fs~if~d~Lk~qLi----lkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~ 444 (533) .-+.+ |.-.|+.+|.++|.. +..+...+++.=+. +..++.++. ..+.|.|..=..-+|...+|++.. T Consensus 330 asge~---D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~-----~~~~i~f~~L~~~s~kekAe~~~~ 401 (461) T protein:vir:80 330 TGAQY---DVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDS-----FEWAIEFNPLWNLDSKTDAEVRKL 401 (461) T ss_pred ccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccc-----cceEEEeCCCCCCCHHHHHHHHHH Confidence 11221 223499999999964 55555555543211 111222221 246788877767788888999999 Q ss_pred HHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---cccCCCCCCCCCCCCccccccc Q lcl|NC_021072. 445 RMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAE---MDPAMDPGNAPPADDMSAQEGP 521 (533) Q Consensus 445 R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~---~~~~~~~~~~~~~~d~~~~~~~ 521 (533) +.++++.+-.- -. +|.+|+.+... ... + .+|+.. .+++-+.......+....+++. T Consensus 402 ~a~a~~~~~~~--g~------------is~~e~r~~l~---~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 460 (461) T protein:vir:80 402 TAEADQIYIVN--GV------------LDPDEVKETRF---GRF--G--LENSSKFSGDSAEIDKLAKLVYDAYAKKNAD 460 (461) T ss_pred HHHHHHHHHhc--CC------------CCHHHHHHHHH---Hhc--C--CCCCccCCCCCchhhhhhhhccccccccCCC Confidence 99888765332 13 44444432211 110 0 011111 0000000000000001111111 Q ss_pred c Q lcl|NC_021072. 522 A 522 (533) Q Consensus 522 ~ 522 (533) + T Consensus 461 g 461 (461) T protein:vir:80 461 G 461 (461) T ss_pred C Confidence 1 No 35 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.76 E-value=6.1e-08 Score=60.13 Aligned_cols=479 Identities=11% Similarity=0.085 Sum_probs=234.8 Q ss_pred CCccccceeeeccccccccCCCCCCCC-Ccccc--eeecccccccccchhhhhh-HHHHHHHHHHhh-hhcchhhhHHHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDS-MDGSQ--PIVGGGYYGYSVDFDGTVR-NEYELITRYREM-VLQPECDSAVDD 75 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~-~dg~~--~~~~~~~~~~~~~~~~~~~-~~~~LI~~YR~m-~~~pEvd~Avde 75 (533) |..+ |+.--. ..+...+...... ..|+. .-...++.+...+-+..+. +...|..+-|.| .++|-+..||+- T Consensus 1 ~~~p--~~~~~~--~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~ 76 (533) T protein:vir:34 1 MKTP--TIPTLL--GPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQL 76 (533) T ss_pred CCCc--hhhhhh--cccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 2222 111000 0000000000000 11110 0012233333333344333 678899999999 599999999999 Q ss_pred hhcceeeecCCCceEEEEe--ccC----CCcHHHHHHHHHHHHHHHH----------HhcchhhhhHHHHhhhhcCceee Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVEL--SNL----KQSDKIKKLIREEFAEILR----------LLDFENRSYEIFRRWYVDGRLFY 139 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l--~~~----~~S~~ik~~I~eeF~~i~~----------lL~f~~~~~~~fR~WYvDGri~~ 139 (533) +++-+|= .+-.++-.. .-+ +..+.+.++|..+|..-++ .++|-.--...+|.|.+||-.|. T Consensus 77 ~~~nvVG---~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~ 153 (533) T protein:vir:34 77 HQDHIVG---SFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFV 153 (533) T ss_pred HHHHhhC---CCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEE Confidence 9988772 133333221 112 2256788889999986553 34444445678999999999999 Q ss_pred eeeecCCCCCCC--eEEEEEcChhhceehhhccCCCcCceeEEeccc-eeeccchhceecc--ccccccccC--Ccceec Q lcl|NC_021072. 140 HKVIDPKNPRGG--LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINT-QLTQKAAEYYLYN--PKGLKNSTN--QGMKIA 212 (533) Q Consensus 140 hkvid~~~~~~g--I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~-~~~~~~~e~~~y~--p~~~~~~~~--~~~kI~ 212 (533) .+..++. .++ -..|+.|+|..|.--.. .+++..+..+.- .-.+.-..|+++. |.+.....+ ....++ T Consensus 154 ~~~~~~~--~g~~~~~~lq~ie~d~l~~~~~----~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~ 227 (533) T protein:vir:34 154 QATWDTS--SSRLFRTQFRMVSPKRISNPNN----TGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELP 227 (533) T ss_pred EeeeccC--CCCccceEEEEechhhcCCCCC----CCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeec Confidence 9988752 222 35789999999952111 122211211110 1235667888874 332221111 123444 Q ss_pred cchhhcccccccc-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhccc Q lcl|NC_021072. 213 TDSVTYCHSGIQD-LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRN 291 (533) Q Consensus 213 ~dai~y~hsGl~d-~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rn 291 (533) ..+-...|---.. +.-.-++|.|..+++.+.+|.-.+||...-...-|-.=-++.=+.+......+ + T Consensus 228 v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~---~--------- 295 (533) T protein:vir:34 228 GGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDF---I--------- 295 (533) T ss_pred cChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCccccccc---c--------- Confidence 4554555544433 33456799999999999999999999999888888664433333322111000 0 Q ss_pred EEEeeCCCCccc-cccccchhHhhhcccc---cCCC------CccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccc Q lcl|NC_021072. 292 KLVYDANTGEIK-DDKKFMSMLEDFWLPR---REGG------RGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSR 360 (533) Q Consensus 292 k~vYd~~TGev~-~d~~~msmlEDywLpR---Regg------rgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sR 360 (533) .....+.-. .-....+...++.-.+ =++| -|.+|+.+..+..-+. -+-++...+.+=.+|+||-+- T Consensus 296 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~ 372 (533) T protein:vir:34 296 ---LGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQ 372 (533) T ss_pred ---cCCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHH Confidence 000000000 0000111111111000 0111 1233333333322223 233444555666789999998 Q ss_pred cCCC-CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHH----HHHhccCCCHh-----HHhhhhh-c--eeEEE Q lcl|NC_021072. 361 LETE-TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKT----QLILKGVMSLE-----EWDEMKE-H--IQFDF 427 (533) Q Consensus 361 l~~~-~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~----qLilkgi~t~e-----ew~~~~~-~--i~~~f 427 (533) |..+ ++.|. |.+.-.-+.|.+++.++|..|..=|..++-. ..+|.|.++.- +|...+. . ..|.. T Consensus 373 lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~ 449 (533) T protein:vir:34 373 LSRNYAQMSY---STARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIG 449 (533) T ss_pred HhhhcccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeecc Confidence 8766 45554 3334455669999999999888655544333 45678877531 2222222 2 23333 Q ss_pred eccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHH-HHHHhhhcCCCCCCCcccccCCCC Q lcl|NC_021072. 428 IADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDK-QIDSEREAGLIVDPMAEMDPAMDP 506 (533) Q Consensus 428 ~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~k-qi~~E~~~~~~~~p~~~~~~~~~~ 506 (533) -.--+.--+||+.-...+++ .-.-|.+-+..+ .+..-+|.-++.+ ..+...+.++ +.|...- ..... T Consensus 450 p~~~~iDP~Ke~~a~~~~i~---------~G~~s~~~~~a~-~G~D~~ev~~q~a~e~~~~~~~gl-~~~~~~~-~~~~s 517 (533) T protein:vir:34 450 SGRMAIDGLKEVQEAVMLIE---------AGLSTYEKECAK-RGDDYQEIFAQQVRETMERRAAGL-KPPAWAA-AAFES 517 (533) T ss_pred CCccccChHHHHHHHHHHHH---------cCCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHhcCC-CCCCCCC-cCccC Confidence 44444556676665554443 234566666665 4655544443322 2222333343 3222110 11111 Q ss_pred CCCCCCCCcccccccc Q lcl|NC_021072. 507 GNAPPADDMSAQEGPA 522 (533) Q Consensus 507 ~~~~~~~d~~~~~~~~ 522 (533) +..+...+.+.+.+.+ T Consensus 518 ~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 518 GLRQSTEEEKSDSRAA 533 (533) T ss_pred CCCCCCCCCcccCCCC Confidence 1111111211111111 No 36 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.73 E-value=3.8e-08 Score=61.27 Aligned_cols=407 Identities=16% Similarity=0.144 Sum_probs=184.3 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceee-------cccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIV-------GGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAV 73 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~-------~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Av 73 (533) |+.+|= +-++-.-. +....-..||-.+.. ...|.++.+ +...+-.+|...|| .+.....+| T Consensus 1 ~~~~~~-~~~~~~~~-----~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~---~~~~~~~~l~~~Yr---~~~ia~~iV 68 (449) T protein:vir:10 1 MTDKLT-LAVNHALN-----DARMARARMGLMVPTMGLDNKRHSAWCEYGF---PELVTYENLYSLYR---RGGIAHGAV 68 (449) T ss_pred CchhhH-HHHhhhcc-----hhHHHHHHHHHHHHHhcCCcccchhhhhcCC---cccCCHHHHHHHHh---cCchhHHHH Confidence 887742 21111000 000000001111111 112333211 23345678888998 566778889 Q ss_pred HHhhcceeeecCCCceEEEE---eccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHH--hhhhcCceeeeeeec---- Q lcl|NC_021072. 74 DDIVNETICGNFDDVPVEVE---LSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFR--RWYVDGRLFYHKVID---- 144 (533) Q Consensus 74 deIvneaiv~d~~~~~v~v~---l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR--~WYvDGri~~hkvid---- 144 (533) +-++++|.. +-|..+. .+.......++.++.+-|.. .+-++..+..| |.|-.+-++++.- | T Consensus 69 d~~~d~~~~----~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~-----~~~~~l~ea~~~~rl~Gga~i~i~v~-d~~~l 138 (449) T protein:vir:10 69 EKLVGKCWQ----TNPEIIEGDDADDSEDETSWEKKSKQVFTN-----RLWRSFAEADRRRLVGRYAGILLHIR-DEKDW 138 (449) T ss_pred Hhhhhhhhh----cCcccccCccccchhhhHHHHHHHHHHHHH-----HHHHHHHHHHHhhhccCcEEEEEEec-CCCCC Confidence 988888742 1122222 11222222344433332221 12233344443 3344444555421 2 Q ss_pred --CCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhh-cccc Q lcl|NC_021072. 145 --PKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVT-YCHS 221 (533) Q Consensus 145 --~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~-y~hs 221 (533) |.++++||+.+..++...|.+- ++.. +| ..+.|+.-..|. ++.. ..+++....+|+.+-+. ++.- T Consensus 139 ~~Pl~~~~~i~~i~v~~~~~i~~~-~~~~-dp--------~sp~yg~P~~y~-v~~~-~~g~~~~~~~iH~SRl~~~~~~ 206 (449) T protein:vir:10 139 NLPATKGRGLQKVSVSWAGSLKVA-EWDT-GI--------NSKTYGQPKLWK-YTER-LPNGSSRRVDIHPDRVFILGDY 206 (449) T ss_pred CcccccCcceeeEEeeccccCChh-hhhc-CC--------CCCCCCCceEEE-Eeee-ccCCCccceeeccceeEeecCC Confidence 2234567777777775555321 2211 11 112222222111 1100 00111223344443321 1110 Q ss_pred ccccCCCCccchhHHHHHHHHHHHHHHHHHHH------HHHHhcC----ccceEEEccCCCCchHHH------HHHHHHH Q lcl|NC_021072. 222 GIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLV------IYRLSRA----PERRIFYIDVGNLPKNKA------EQYLREV 285 (533) Q Consensus 222 Gl~d~~~~~i~syL~~AiK~~NqLrm~EDalV------IyRi~RA----PeRrvfyIDvGnlpk~KA------eqYl~~i 285 (533) .--..|+|+++ ||.|.-+|-+.. .-...|. -++ .+|+.+|...++ .+-+++. T Consensus 207 ------~~~g~~~L~~~---yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~---~~~~~~l~~~~~~~~e~~~~~~~~~ 274 (449) T protein:vir:10 207 ------SEDAIGFLEPA---YNAFVSLEKVEGGSGESFLKNAARQLNVNFEK---EIDFTNLASLYGVSIDELQDKFNEV 274 (449) T ss_pred ------CCCChhHHHHH---HHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhh---hhhhhhhhHHhhCCchHHHHHHHHH Confidence 01135788876 455544443321 1111111 112 245555543322 1112222 Q ss_pred ---HHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCcccc Q lcl|NC_021072. 286 ---MGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRL 361 (533) Q Consensus 286 ---m~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl 361 (533) |++-.+-+..|. -+|| +..+| +++-++| +..|...+=-+.+||+.|| T Consensus 275 ~~~~~~~~~~~~i~~--------------~~d~----------~~~~~-----~~sgl~d~l~~~~q~iaaa~~IP~t~L 325 (449) T protein:vir:10 275 AGEINRGNDVLMTTQ--------------GATV----------TPLVT-----SVADPTATYNVNLQTAAAGVDIPTRIL 325 (449) T ss_pred HHHHhccchheeecC--------------Ccce----------EEEec-----ccCChhHHHHHHHHHHHHHhCCCeeee Confidence 222222111110 0122 22333 3444555 5678888999999999999 Q ss_pred CC--CCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHH Q lcl|NC_021072. 362 ET--ETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEI 439 (533) Q Consensus 362 ~~--~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~ 439 (533) =. .+|+|- ++ |.-.|+..|...|.++......++.. |+.-++..+. +.+.|.|..=..=+|...+ T Consensus 326 ~Gqsp~glns--t~----D~~nyyd~i~~~Q~~l~p~le~l~~~-l~~s~~g~~~------~d~~i~f~pL~~~t~kEkA 392 (449) T protein:vir:10 326 IGNQQAERSS--TE----DQKYFNARCQSRRVDLSFEIEDFCDK-LIELKIIDAV------AKKAVIWDDLNEQTGTEKL 392 (449) T ss_pred eccCcccccc--ch----hHHHHHHHHHHHHHhhhHHHHHHHHH-HHHhhcCCCC------CceeEEeCCCCCCCHHHHH Confidence 33 368873 22 44459999999999988888887764 6777766553 4589999998889999999 Q ss_pred HHHHHHHHHHHHhhhhccc-cccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCcccc Q lcl|NC_021072. 440 EIRNERMNQVNTMDPYVGK-YFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQ 518 (533) Q Consensus 440 Ei~~~R~~~~~~~~~~vGk-y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~ 518 (533) ||.....++++.+-..++. .||. +|+.+.. .. .++-.+|. +.+|...+ T Consensus 393 ei~k~~A~a~~~~~~ag~~~~~~~------------~EiR~~~---~~---~~~~~~~~-------------~~e~~de~ 441 (449) T protein:vir:10 393 TNAKTMGEINQTMLGSGDNPAFSR------------EEIRTAA---GY---DNDDEEPL-------------GEEDGDEE 441 (449) T ss_pred HHHHHHHHHHHHHHHccccCCcCH------------HHHHHHh---cc---cCCCCCCC-------------CCCCCccc Confidence 9999999999887766432 3444 3333222 00 11011111 11110000 Q ss_pred ccccCCccc Q lcl|NC_021072. 519 EGPAVDAGD 527 (533) Q Consensus 519 ~~~~~~~~~ 527 (533) .++.+++. T Consensus 442 -~~~~d~~a 449 (449) T protein:vir:10 442 -DKATDSAA 449 (449) T ss_pred -cccCCcCC Confidence 01111111 No 37 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.69 E-value=1e-07 Score=58.97 Aligned_cols=437 Identities=12% Similarity=0.111 Sum_probs=193.1 Q ss_pred cccccccccchhhhhh---HHHHHHHHHHhhhhc-chhhhHHH-HhhcceeeecCCCce----------EEEEeccCCCc Q lcl|NC_021072. 36 GGGYYGYSVDFDGTVR---NEYELITRYREMVLQ-PECDSAVD-DIVNETICGNFDDVP----------VEVELSNLKQS 100 (533) Q Consensus 36 ~~~~~~~~~~~~~~~~---~~~~LI~~YR~m~~~-pEvd~Avd-eIvneaiv~d~~~~~----------v~v~l~~~~~S 100 (533) +++.--.-........ .+..+.+.|-+..+. +..-.++. +..|--+|.+.-... ..+.+ .+ . T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~--~~-d 77 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--SE-D 77 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceec--CC-C Confidence 2211111111111111 122333333111100 00000000 000000111111111 11111 11 1 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeec--CCCCCCCeEEEEEcChhhceehhhcc--CCCcCc Q lcl|NC_021072. 101 DKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVID--PKNPRGGLTELRYIDPRKIRKVTEYQ--QKRPEQ 176 (533) Q Consensus 101 ~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid--~~~~~~gI~elr~lDP~~i~~vr~~~--~~~~~~ 176 (533) +.. .+.+..|++.=+|+....++++.-.+.|+-|++.-.. ......|...+..+||+.+-.+..-. ++..-. T Consensus 78 ~~~----~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~~~~~~ 153 (480) T protein:vir:78 78 SEG----LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) T ss_pred chh----HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEE Confidence 122 2334556655578899999999999999988763221 11235788889999999986654321 111111 Q ss_pred eeEEeccceeeccchhceecccccccc-c----------------cCCcceeccchhhccccccccCCCCccchhHHHHH Q lcl|NC_021072. 177 LRGEDINTQLTQKAAEYYLYNPKGLKN-S----------------TNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAI 239 (533) Q Consensus 177 ~~~~~~~~~~~~~~~e~~~y~p~~~~~-~----------------~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~Ai 239 (533) .+++.-.+. .+......+|.|..... . +|..-+||.-. |+.-- +.....+.|=|.+.+ T Consensus 154 i~~~~~~~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~--f~n~~--~~~~~~G~s~i~~~v 228 (480) T protein:vir:78 154 VRLYTTRDD-VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVP--LTNDP--RLGNRYGRSEISPEL 228 (480) T ss_pred EEEEEeecC-CCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEE--eeccc--ccCCccCcccchhhH Confidence 222211111 11111112333322110 0 01111111111 11100 111123344444433 Q ss_pred HHH-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHh-hhc Q lcl|NC_021072. 240 KAV-NQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLE-DFW 316 (533) Q Consensus 240 K~~-Nq-Lrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlE-Dyw 316 (533) +++ .. =+++-+.+++-...-.|.|-|.=.+....+..+. + .++-.+.. ..| T Consensus 229 ~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~---------------------~-----~~~~~~~~~~~~ 282 (480) T protein:vir:78 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE---------------------N-----TTLDIYYGRILT 282 (480) T ss_pred HHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccc---------------------c-----chhhhhhhhhcc Confidence 322 11 2355566777777777777664222222111100 0 00111111 123 Q ss_pred ccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHH Q lcl|NC_021072. 317 LPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSE 395 (533) Q Consensus 317 LpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~ 395 (533) ++ |..+++.++++.. ++- ++-++-....++..-++|..=|+..+. |-..|..|.--+.....-+.++|+.|.. T Consensus 283 ~~----~~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~Sg~Alk~~~~~l~~ka~~~~~~f~~ 356 (480) T protein:vir:78 283 LA----SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRIFGG 356 (480) T ss_pred CC----CCCceEEecCccC-HHHHHHHHHHHHHHHhcccCCChHHhccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32 3346777888653 333 344666667777778888877765332 2223344555566677778999999999 Q ss_pred HHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHH Q lcl|NC_021072. 396 LFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQ 475 (533) Q Consensus 396 if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDe 475 (533) -+.+.++.-+.+.|.-...+|.. |.+.|..-..=+. .+.++.+.++-.-++-.+|.+++... |+++++ T Consensus 357 ~l~~~~~l~~~~~g~~~~~~~~~----i~v~f~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~et~~~~-lg~~~d 424 (480) T protein:vir:78 357 AWERAMRIAMQIMGREVTEEYTR----LETVWRDPSTPTV-------AAKADAVSKLYANGQGPIPKEQARID-LGYTAT 424 (480) T ss_pred HHHHHHHHHHHHcCCCcccccee----eeEEecCCCCCCH-------HHHHHHHHHHHHhccccCCHHHHHhc-CCCCHh Confidence 99999998887788544455543 6677753222122 24555556655544567899998865 899999 Q ss_pred HHHHHHHHHHHhhhcC--CCCCCCcccccCCCCCCCCCCCCccccccc-cCCccccc Q lcl|NC_021072. 476 EIKEIDKQIDSEREAG--LIVDPMAEMDPAMDPGNAPPADDMSAQEGP-AVDAGDAK 529 (533) Q Consensus 476 eI~e~~kqi~~E~~~~--~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~ 529 (533) +++++++..+++..+. ...++..+..++.++.+.+ ++....+.+| +..-+.++ T Consensus 425 ~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 425 QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT-ETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCC-CCCCccccccCCCCcccCC Confidence 9998875444433221 1222211111111111110 0101111111 12222233 No 38 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=98.68 E-value=1.1e-07 Score=58.68 Aligned_cols=471 Identities=12% Similarity=0.071 Sum_probs=236.1 Q ss_pred CCc-cccceeeeccccccccCCCCCCCCCcccce--eecccccccccchhhhhh-HHHHHHHHHHhh-hhcchhhhHHHH Q lcl|NC_021072. 1 MSN-QLFGFSLERAKKVPKGPSFVQKDSMDGSQP--IVGGGYYGYSVDFDGTVR-NEYELITRYREM-VLQPECDSAVDD 75 (533) Q Consensus 1 ~~~-~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~--~~~~~~~~~~~~~~~~~~-~~~~LI~~YR~m-~~~pEvd~Avde 75 (533) |-- -++|-. .+........-..|+.. -...+|.+...+-+..+. +...|..+-|.+ .++|-+..||+- T Consensus 1 ~~~~~~~~~~-------~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~ 73 (530) T protein:vir:38 1 MKIPSLVGPD-------GKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQL 73 (530) T ss_pred CccceeecCc-------cccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 110 011100 00000000001111111 112334443333344433 688899999999 589999999999 Q ss_pred hhcceeeecCCCceEEEE--eccC----CCcHHHHHHHHHHHHHHHH----------HhcchhhhhHHHHhhhhcCceee Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVE--LSNL----KQSDKIKKLIREEFAEILR----------LLDFENRSYEIFRRWYVDGRLFY 139 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~--l~~~----~~S~~ik~~I~eeF~~i~~----------lL~f~~~~~~~fR~WYvDGri~~ 139 (533) +++-+|= .+-.++-. ..-+ +..+.+.++|..+|..-++ .++|..--.-.+|.|.+||..|. T Consensus 74 ~~~nvVG---~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~ 150 (530) T protein:vir:38 74 HQDHIVG---SFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCV 150 (530) T ss_pred HHHHhhC---CCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEE Confidence 9988773 13223222 1112 2356788889999986553 34555555668899999999999 Q ss_pred eeeecCCCCCCC---eEEEEEcChhhceehhhccCCCcCceeEEeccc-eeeccchhceecc--ccccccccCC--ccee Q lcl|NC_021072. 140 HKVIDPKNPRGG---LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINT-QLTQKAAEYYLYN--PKGLKNSTNQ--GMKI 211 (533) Q Consensus 140 hkvid~~~~~~g---I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~-~~~~~~~e~~~y~--p~~~~~~~~~--~~kI 211 (533) -+..++ ..| -..|+.|+|..|.--.. .+++..+..+.- .-.+.-..|+++. |.+....... ...+ T Consensus 151 ~~~~~~---~~g~~~~~~lq~ie~d~l~~~~~----~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~ 223 (530) T protein:vir:38 151 QATWDS---DSTRLFRTQFKMVSPKRVSNPNN----IGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPREL 223 (530) T ss_pred Eeeecc---CCCCccceEEEEechhhcCCCCC----CCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeee Confidence 988875 233 26899999999852111 122222221111 1235567888874 3322211111 1233 Q ss_pred ccchhhcccccccc-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH------------ Q lcl|NC_021072. 212 ATDSVTYCHSGIQD-LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKA------------ 278 (533) Q Consensus 212 ~~dai~y~hsGl~d-~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KA------------ 278 (533) ++.+....|--... +.---++|.|..+++.+++|.-.+||...-...-|-.=-++.=+.+......+ T Consensus 224 ~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~ 303 (530) T protein:vir:38 224 PGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSK 303 (530) T ss_pred ccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCccccccc Confidence 44444556644433 34566789999999999999999999998888777553333222222110000 Q ss_pred -HHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCC Q lcl|NC_021072. 279 -EQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNV 356 (533) Q Consensus 279 -eqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~V 356 (533) ..+... +..+.+.-+..-..|.|. .-.-|-+|+.+..+.--+. -+=++...+.+=.+|+| T Consensus 304 ~~~~~~~-~~~~~~~~~~~l~pG~i~-----------------~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi 365 (530) T protein:vir:38 304 LTGWLGE-MAAYYSAAPVRLGGARVP-----------------HLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGV 365 (530) T ss_pred ccccchh-hhhcccccceeccCceee-----------------ecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCC Confidence 000000 111111111122222111 0012445555554432233 23445555666678999 Q ss_pred CccccCCC-CcccccchhhhhHHhhhHHHHHHHHHHHHHHHH-----HHHHHHHHHhccCCCH------hHHhhhhhcee Q lcl|NC_021072. 357 PSSRLETE-TTFNIGRAAEITRDEVKFQKFIARLRKRFSELF-----MDLLKTQLILKGVMSL------EEWDEMKEHIQ 424 (533) Q Consensus 357 P~sRl~~~-~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if-----~d~Lk~qLilkgi~t~------eew~~~~~~i~ 424 (533) |-+-|..+ ++.|+ |.+.-.-+.|-+.+.++|..|..=| ..-|+ ..++.|.+.. +.|........ T Consensus 366 ~ye~lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~-~av~~G~i~~p~~~~~~~~~~~~a~~~ 441 (530) T protein:vir:38 366 SYEQLSRNYSQMSY---STARASANESWAYFMGRRKFVASRQACQMFLCWLE-EAIVRRVVTLPSKARFSFQEARTAWGN 441 (530) T ss_pred CHHHHhcccccccH---HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH-HHHHcCCccCCCCCCCCchhhHHhhhc Confidence 99988665 45555 2233444569999999999887644 33444 4578887763 22332222222 Q ss_pred EEE--eccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHH-HHHHHHhhhcCCCCCCCcccc Q lcl|NC_021072. 425 FDF--IADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEI-DKQIDSEREAGLIVDPMAEMD 501 (533) Q Consensus 425 ~~f--~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~-~kqi~~E~~~~~~~~p~~~~~ 501 (533) ..| -.--+.-.+||+.-...+++. -.-|.+-+..+ .+..-+|+-++ ....+...+.|+ +.|..- . T Consensus 442 ~~w~~p~~~~iDP~Ke~~a~~~~i~~---------G~~s~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl-~~~~~~-~ 509 (530) T protein:vir:38 442 ANWIGSGRMAIDGLKEVQEAVMLIEA---------GLSTYEKECAK-RGDDYQEIFAQQVRESMERRAAGL-NPPAWA-A 509 (530) T ss_pred eeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCC-CCCCCc-c Confidence 333 333344566776655555442 34566666665 45554444322 222222333443 322110 0 Q ss_pred cCCCCCCCCCCCCcccccccc Q lcl|NC_021072. 502 PAMDPGNAPPADDMSAQEGPA 522 (533) Q Consensus 502 ~~~~~~~~~~~~d~~~~~~~~ 522 (533) .....+..++.++....+.++ T Consensus 510 ~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 510 AAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred cccCCCCCCCCCCCCCCCCCC Confidence 111111111111111111111 No 39 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.61 E-value=1.9e-07 Score=57.38 Aligned_cols=414 Identities=11% Similarity=0.131 Sum_probs=187.9 Q ss_pred CC--ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhc Q lcl|NC_021072. 1 MS--NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~--~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvn 78 (533) |+ ..+|||.- + +.+...+....+... . .+.|... .++. ++ -+..+++|-|.+||+-|.+ T Consensus 1 M~~~~~~f~~~~-r-----~~~~~~~~~~~~~~~-~---~~~g~~~---~~~~-----v~-~~~al~~~~v~~~i~~ia~ 61 (429) T protein:vir:10 1 MDSVKKFFNFEK-R-----QTSQVIELNKDDEKL-L---EWLGISP---STIS-----VK-GKNALKVATVFACIKILSE 61 (429) T ss_pred Cchhhhhhcccc-c-----CcccccccCCChHHH-H---HHhcCCC---Ccce-----ec-hhhhhccHHHHHHHHHHHH Confidence 54 46788742 1 111111111111111 1 1111111 0110 00 0123568999999999988 Q ss_pred ceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhH----HHHhhhhcCceeeeeeecCCCCCC Q lcl|NC_021072. 79 ETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYE----IFRRWYVDGRLFYHKVIDPKNPRG 150 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~----~fR~WYvDGri~~hkvid~~~~~~ 150 (533) .+-. .|+.+--..-+..+.. .-..++++|+-. ..+.+ ++..+.+.|.-|+.++-|. .+ T Consensus 62 ~ia~-----l~~~~~~~~~~~~~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~---~G 127 (429) T protein:vir:10 62 SVSK-----LPLKIYQEDEYGIQRG------TKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR---KG 127 (429) T ss_pred hhcc-----CceEEEEecCCceeec------cccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CC Confidence 8664 4555432221111111 012344444322 23433 4445677899999988764 45 Q ss_pred CeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCc Q lcl|NC_021072. 151 GLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNM 230 (533) Q Consensus 151 gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~ 230 (533) .+++|.+|+|..+..++.-...... ...-+|.++..+. ...++.+-+.+...+. ..++-. T Consensus 128 ~~~~L~~i~~~~v~v~~~~~~~~~~-------------~~~~~~~~~~~g~------~~~~~~~evih~~~~~-~~~~~~ 187 (429) T protein:vir:10 128 KVQALWPIDASKVTVYIDDVGLLNS-------------KTKMWYVVNTGGQ------QRVLKPEEILHFKNGI-TLDGLV 187 (429) T ss_pred cEEEEEEEcCceeEEEEcCcccccc-------------cceEEEEEccCCe------EEEEccccEEEecCCC-CCCCcc Confidence 5999999999999754432211111 1112233333322 1223333222222111 122334 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 231 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 231 i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) ++|.|..|.+++.....++....=+----+.-+-+..++ +.|.+.++++..+.+...|..- ...|. .+ T Consensus 188 G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------~~- 255 (429) T protein:vir:10 188 GVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSGL----QNSHR------IA- 255 (429) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhccc----cccCc------ee- Confidence 579999999999998888887766655545556777776 5677777766655554444220 01121 11 Q ss_pred hHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHH Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IAR 388 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~r 388 (533) .++ .|.+++.|.-. ..+.-++-.++..+.+.++++||.+.|+...+-+....++..+. |.++ |.- T Consensus 256 vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~---f~~~~l~P 322 (429) T protein:vir:10 256 LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ---FYTDTLQA 322 (429) T ss_pred ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHH Confidence 121 24556555311 11222333457788999999999999965433333333343333 4432 222 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQ 468 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~ 468 (533) +-..+.. .| -+.++++.+|. ....+.|..+ .+...+ +..|++.++.+-.- -++|.+-++. T Consensus 323 ~~~~ie~----~l-----n~kl~~~~~~~---~g~~~~fd~~----~ll~~d-~~~~~~~~~~~~~~--G~~T~NE~R~- 382 (429) T protein:vir:10 323 TLTMYEQ----EM-----TYKLFLDSELD---KGFYSKFNVD----AILRAD-IKTRYEAYRTGIQG--GFLKPNEARS- 382 (429) T ss_pred HHHHHHH----HH-----HHhhcChhhcC---CCcEEEeech----hhhcCC-HHHHHHHHHHHHhC--CCcCHHHHHH- Confidence 2222222 22 22334555554 2234555532 232221 23456666555433 4677777764 Q ss_pred HhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc---cccCCCCCCCCCCCCccccc Q lcl|NC_021072. 469 VLKQTDQEIKEIDKQIDSEREAGLIVDPMAE---MDPAMDPGNAPPADDMSAQE 519 (533) Q Consensus 469 IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~---~~~~~~~~~~~~~~d~~~~~ 519 (533) .+++.+. +.-++.+-. ... -|-+. .....++.+....++++..+ T Consensus 383 ~~gl~p~--~ggD~~~~~---~n~--~~~d~~~~~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 383 KEDLPPE--AGGDRLLVN---GNM--LPIDMAGQAYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred HhCCCCC--CCcCeeeec---ccc--cchhhccccccCCCCCCCCCCCCCCCCC Confidence 3565431 111100000 000 00000 00001111111222222221 No 40 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=98.57 E-value=2.7e-07 Score=56.59 Aligned_cols=429 Identities=12% Similarity=0.133 Sum_probs=186.0 Q ss_pred ccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHH-HHHHhhhhcchhhhHHHHhhcceeeecCCCceEEE Q lcl|NC_021072. 14 KKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELI-TRYREMVLQPECDSAVDDIVNETICGNFDDVPVEV 92 (533) Q Consensus 14 ~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI-~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v 92 (533) -++.-+.+++.|-.-+=...+...-+++.. ....+.+...+. .. .+++|-|.+||+-|.+.+-- .|+.+ T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~--~g~~~~~~~~~~~~~---~~~~~~V~acV~~IA~~iA~-----lp~~l 70 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPA--VGMQLERQFSLYGGI---YKNQPWVRTVIAKRAQALAR-----LPVKC 70 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccce--eceecccccchhhHH---hhhhHHHHHHHHHHHHhhcc-----CceEE Confidence 334444444444322222222222111111 111111111111 11 25789999999999987542 45555 Q ss_pred EeccCCCcHHHHHHHHHHHHHHHHHh----cchhhhhHHHHhhh----hcCceeeeeeecCCCCCCCeEEEEEcChhhce Q lcl|NC_021072. 93 ELSNLKQSDKIKKLIREEFAEILRLL----DFENRSYEIFRRWY----VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIR 164 (533) Q Consensus 93 ~l~~~~~S~~ik~~I~eeF~~i~~lL----~f~~~~~~~fR~WY----vDGri~~hkvid~~~~~~gI~elr~lDP~~i~ 164 (533) --.. +. . ..++.+..+.+| |-...+.++.+.|+ +.|.-|+.++-|. .+.+++|.+|+|..++ T Consensus 71 ~~~~-~~-~-----~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~---~G~~~~L~~l~p~~Vt 140 (518) T protein:vir:78 71 MFTS-GD-T-----ETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMPMHPSRVA 140 (518) T ss_pred EEEc-CC-c-----cccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---CCcEEEEEEECCCceE Confidence 3221 11 1 112233333333 33346666666654 6699999977653 5569999999999886 Q ss_pred ehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-CccchhHHHHHHHHH Q lcl|NC_021072. 165 KVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTLSHLHKAIKAVN 243 (533) Q Consensus 165 ~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~syL~~AiK~~N 243 (533) ..+.. ...... |.|..... .....+.++.+-|.+.. ....++ ..++|-|..|.+++. T Consensus 141 v~~~~---~~~~~~---------------y~~~~~~~--~~~~~~~~~~~eIiHir--~~~~dg~~~G~Spi~~~~~~i~ 198 (518) T protein:vir:78 141 IKRNS---RTGRYE---------------YYFQAGAG--VGTQLVSFADDEVVPIR--FFNPDGLERGLSLMESLKSTIF 198 (518) T ss_pred EEEcC---CCCEEE---------------EEEEecCC--ccceeEEecCCcEEEec--CCCCCcccccccHHHHHHHHHH Confidence 43221 111111 11111100 00111223333222221 001111 134688999999888 Q ss_pred HHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCC Q lcl|NC_021072. 244 QLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGG 323 (533) Q Consensus 244 qLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRRegg 323 (533) ....+++...=+----+.-+-|...+ |.|.+..+++.-+.+...|+-- ...|.+ + .|+ T Consensus 199 ~~~aa~~~~~~~f~Ng~~p~gvl~~~-~~ls~e~~~~~k~~~~~~~~G~----~nag~~------~-vL~---------- 256 (518) T protein:vir:78 199 SEDSSRNATAAMWKNAGRPNLVLRHE-KRLSPEAQQRLREQFDRAHAGS----SNTGKT------M-VVE---------- 256 (518) T ss_pred HHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcCc----ccCCce------e-EcC---------- Confidence 88888877544433345556677776 6676666655444444444310 011221 1 121 Q ss_pred CccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHHHHHHHH Q lcl|NC_021072. 324 RGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRFSELFMD 399 (533) Q Consensus 324 rgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~rLr~~fs~if~d 399 (533) .|.+++.|. .+.-+ ++-.+|....+.++++||...|+..++-+..+..+..+. |.++ |.-+-.++...|.. T Consensus 257 ~G~~~~~l~--~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~---f~~~tL~P~~~~ie~eln~ 331 (518) T protein:vir:78 257 EGMEPIPLQ--LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA---FYRDTMAIPIARIQSAMDK 331 (518) T ss_pred CCceEEecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHH---HHHHHHHHHHHHHHHHHHH Confidence 134455443 23333 344457789999999999999965443333334443333 6554 44455555554444 Q ss_pred HHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHH Q lcl|NC_021072. 400 LLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKE 479 (533) Q Consensus 400 ~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e 479 (533) .|-++ +.. .. .+.|. ..++.... +..|.+.+..+-.- -++|.+-++. .+++..-+=.. T Consensus 332 ~L~~~-----------~~~-~~--~~~fd----~~~Llr~D-~~~r~~~~~~~~~~--G~lT~NE~R~-~~gl~pie~~~ 389 (518) T protein:vir:78 332 YVGQY-----------WVR-KN--RMKFD----IDDVIQPD-WEAKSESTQKMVNS--GVATPNEGRE-IMGLPRSDDPK 389 (518) T ss_pred hhccc-----------ccC-cc--eEEee----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCCCCCCC Confidence 43222 111 12 34444 22332222 24566666666433 4778888774 46775432000 Q ss_pred HHHH--------HH-------HhhhcCCCCCCCcc------cccCCCCCCCCCCCCc----cccccccCCccccchhcC Q lcl|NC_021072. 480 IDKQ--------ID-------SEREAGLIVDPMAE------MDPAMDPGNAPPADDM----SAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 480 ~~kq--------i~-------~E~~~~~~~~p~~~------~~~~~~~~~~~~~~d~----~~~~~~~~~~~~~~~~~~ 533 (533) -++. +. +....+.-++|.+. +.+..+.++..++.++ .-++.+.-...-...+|| T Consensus 390 gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (518) T protein:vir:78 390 ADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKES 468 (518) T ss_pred CceeeecccceecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCcccccccccccccchhcccCCCCcccc Confidence 0000 00 00000000111100 0000000111111000 000000000011111222 No 41 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.52 E-value=3.7e-07 Score=55.83 Aligned_cols=456 Identities=13% Similarity=0.146 Sum_probs=185.9 Q ss_pred CCc-----cccc--------------eeeecccc----ccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHH Q lcl|NC_021072. 1 MSN-----QLFG--------------FSLERAKK----VPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELI 57 (533) Q Consensus 1 ~~~-----~~fg--------------~~i~~~~~----~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI 57 (533) |-. +-|+ +++.-.+. ..|+..- .+... .+..+..-.|.+.|. .-+..++..+|- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~-~~~~~-~~~~~~~~~~~~g~~-~~~~~~~~~~l~ 77 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNN-KEVAY-SQPVIGSMSANPGFK-TKPSIRNNQDLH 77 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcc-cchhh-hchhhheeecccccc-cCCccCChhHHH Confidence 100 0000 00000000 0111000 00000 111111111111111 112333333333 Q ss_pred HHHHhhhhcchhhhHHHHhhcceeee------cCCCceEEEEeccC--CCcHHHHHHHHHHHHHHHHHhcchh-----hh Q lcl|NC_021072. 58 TRYREMVLQPECDSAVDDIVNETICG------NFDDVPVEVELSNL--KQSDKIKKLIREEFAEILRLLDFEN-----RS 124 (533) Q Consensus 58 ~~YR~m~~~pEvd~AvdeIvneaiv~------d~~~~~v~v~l~~~--~~S~~ik~~I~eeF~~i~~lL~f~~-----~~ 124 (533) +.=+..+.+|.|..||+.|+|.+.++ ..++....+.+.+. +..+.-+.++. +.+.+++..+... .. T Consensus 78 ~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~-~l~~~l~~pn~~~~p~~~s~ 156 (547) T protein:vir:63 78 GVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIK-RIESFIEKTGVDNDINRDSF 156 (547) T ss_pred HHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHH-HHHHHHHhhCCCCCCccchH Confidence 33334467899999999998864332 23344455554331 22222222221 2333444444432 34 Q ss_pred hHHHHhh----hhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc Q lcl|NC_021072. 125 YEIFRRW----YVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG 200 (533) Q Consensus 125 ~~~fR~W----YvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~ 200 (533) .++++.| ++-|.-|+.++-|. .+-+++|.+|||.+|+.+.......+.....+.. +...+ T Consensus 157 ~~f~~~lv~d~ll~Gn~~~~i~rd~---~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~-------------~~~~~ 220 (547) T protein:vir:63 157 SSFVKKIVRDTYMYDQVNFEKVFNR---NQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQ-------------VIDQK 220 (547) T ss_pred HHHHHHHHHHHHhhCCEEEEEEECC---CCcEEEEEEecCceeEEEECCccccccCceEEEE-------------EcCCc Confidence 4555555 46699999888764 4459999999999997754433322211111110 00100 Q ss_pred --cccccCCcceeccchhhccccccccC-CCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC-CCchH Q lcl|NC_021072. 201 --LKNSTNQGMKIATDSVTYCHSGIQDL-NKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG-NLPKN 276 (533) Q Consensus 201 --~~~~~~~~~kI~~dai~y~hsGl~d~-~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvG-nlpk~ 276 (533) ..+.+...++|.... +-+. .+.+++|-|..|++++.....++....=+=---|--+-|..+... +|.+. T Consensus 221 ~~~~~~~~eiih~r~n~-------~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e 293 (547) T protein:vir:63 221 IVATFNAREMAFAVRNP-------RSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQH 293 (547) T ss_pred EEEEeccccEEEecccC-------CCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHH Confidence 011122223332211 1111 134466889999999998888877665444444555666666644 34444 Q ss_pred HHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHh Q lcl|NC_021072. 277 KAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED---VKYFQKKLYKA 353 (533) Q Consensus 277 KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D---V~YF~~kLy~a 353 (533) .+++.-+.+...|. | ..+..+. .++.+ .|.++..|- .+..++.- .+|..+.+-++ T Consensus 294 ~~~~lk~~~~~~~~---------G-~~nagk~-~vl~~---------~g~~~~~l~--~~~~d~qfle~~~~~~~~Ia~a 351 (547) T protein:vir:63 294 ALEIFKREWKNSLS---------G-INGSWQI-PVVSA---------EDVKFVNMT--PSARDMEFEKWLNYLINVISAL 351 (547) T ss_pred HHHHHHHHHHHHhc---------C-ccccccc-ccccC---------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHH Confidence 33333333333332 1 1111111 12211 134455553 33444444 34566889999 Q ss_pred cCCCccccCCCC--c--------ccccchhhhhHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhc Q lcl|NC_021072. 354 LNVPSSRLETET--T--------FNIGRAAEITRDEVKFQK-FIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEH 422 (533) Q Consensus 354 L~VP~sRl~~~~--~--------~~~g~~~eItRDElkF~K-fi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~ 422 (533) .+||...|+..+ . ++....++..+ .|.. -+.-+..++...|...|-.. .... T Consensus 352 fgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~---~~~~~tL~P~~~~ie~~ln~~L~~~--------------~~~~ 414 (547) T protein:vir:63 352 YGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQ---ASKNKGLQPLLGFIEDFINKHIVAE--------------FGDK 414 (547) T ss_pred hCCCHHHcCcccccccccccccccchhhHHHHHH---HHHHHHHHHHHHHHHHHHHhhcccc--------------cCCc Confidence 999999997432 1 11222222222 2433 35555555555554433211 1134 Q ss_pred eeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHH-H----------HHHH-----HHHHHH Q lcl|NC_021072. 423 IQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQ-E----------IKEI-----DKQIDS 486 (533) Q Consensus 423 i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDe-e----------I~e~-----~kqi~~ 486 (533) +.|.|.....-.+ .+|..+...+. .-++|..-++.. +++... + +... .++.+. T Consensus 415 ~~~~f~~~~~~~~-------~~~~~~~~~~~---~g~lT~NE~R~~-~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~ 483 (547) T protein:vir:63 415 YTFQFVGGDIKSE-------LESVKILAEKA---KVAMTVNEVRKE-LNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEH 483 (547) T ss_pred eEEEeeccccccH-------HHHHHHHHHHh---CCCcCHHHHHHH-hCCCCCCCCCceeecccccccccccccccCCcc Confidence 6777765443222 23333322221 235788888854 777541 1 0000 001111 Q ss_pred hhhcCCCCCC----C------cccccCCCCCCCCCCCCccccccccCCcc-ccchhcC Q lcl|NC_021072. 487 EREAGLIVDP----M------AEMDPAMDPGNAPPADDMSAQEGPAVDAG-DAKRGEF 533 (533) Q Consensus 487 E~~~~~~~~p----~------~~~~~~~~~~~~~~~~d~~~~~~~~~~~~-~~~~~~~ 533 (533) |........| . .++.|..........+|+...+....+++ +.-.+.+ T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 541 (547) T protein:vir:63 484 EKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQGMKGDK 541 (547) T ss_pred ccchhhccccccccCCCCCCCCCCCCCCcccCCCcCccccccCccccchhhhhcCCCC Confidence 1111111100 0 00000000000111122222222222222 1222222 No 42 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.51 E-value=3.7e-07 Score=55.85 Aligned_cols=439 Identities=15% Similarity=0.162 Sum_probs=193.4 Q ss_pred CCccccceeeeccccccccCCCCCCC-CCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhc---------chh- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKD-SMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQ---------PEC- 69 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~-~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~---------pEv- 69 (533) |+.+| |-. +.+....+. . ... .-..-+.+.+.++.|-+-.+. |+. T Consensus 1 ~~~~~------------------~~~~~~~~~~~~~-~-l~~----~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~ 56 (484) T protein:vir:77 1 MTSPL------------------QKQENVDPEKARE-E-MLN----LFTERTQDLGDNTAYYESERRPDAVGVTVPQQMQ 56 (484) T ss_pred CCCcc------------------cccCCCCHHHHHH-H-HHH----HHHHHHHHHHHHHHHHhccccchhcccccchhHH Confidence 22111 100 011000000 0 000 000001122222333221100 111 Q ss_pred ---------hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeee Q lcl|NC_021072. 70 ---------DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYH 140 (533) Q Consensus 70 ---------d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~h 140 (533) .-+|+-.+.-.+ .+ .+. . +.++. ..+++..|..--+|+....+.++.-++.|+-|++ T Consensus 57 ~~~~~~n~~~~ivd~~~~~l~---~~--g~~--~---~~~~~----~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~ 122 (484) T protein:vir:77 57 KLLAHVGYPRLYIDAIAARQE---LE--GFR--L---GGADK----ADEQLWDWWQANDLDIESTLGHTDSLVHGRSYIT 122 (484) T ss_pred hhhhhcCcHHHHHHHHHhhhc---cC--cee--c---CCcch----hHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEE Confidence 111111111110 01 111 1 11121 2345667777778999999999999999998888 Q ss_pred eeecCCCC----CCCeEEEEEcChhhceehhhccCCCcCc----eeEEeccceeeccchhceeccccccc---------- Q lcl|NC_021072. 141 KVIDPKNP----RGGLTELRYIDPRKIRKVTEYQQKRPEQ----LRGEDINTQLTQKAAEYYLYNPKGLK---------- 202 (533) Q Consensus 141 kvid~~~~----~~gI~elr~lDP~~i~~vr~~~~~~~~~----~~~~~~~~~~~~~~~e~~~y~p~~~~---------- 202 (533) .-.+.... ..+...++.++|+.+-.+.. ..... ++++.. +..+......+|.|.... T Consensus 123 v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D---~~~~~~~~a~~~~~~--~~~~~~~~~~~y~~~~~~~~~~~~~~~~ 197 (484) T protein:vir:77 123 ISKPDPNIDPGVDPEVPIIRVEPPTNLYAQID---PRTRQVMRAIRAIED--EEGNEVIGATLYLPNNTVIWNREDGQWV 197 (484) T ss_pred EecCCCCcccccccccceEEEeccceeEEEec---CCCCceEEEEEEEEe--ecCCcEEEEEEEecCeEEEEEecCCceE Confidence 55543221 23345688889988854432 11111 111111 111111111223222111 Q ss_pred ---cccCCcceeccchhhccccccccCCCCccchhHHHHHHHH-HHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHH Q lcl|NC_021072. 203 ---NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAV-NQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNK 277 (533) Q Consensus 203 ---~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~-NqL-rm~EDalVIyRi~RAPeRrvfyIDvGnlpk~K 277 (533) ..+|..-+||. +.|++.- +.....+.|=+.+.++++ ..+ +.+-+.+++-+.+-.|.|-|.-.+....+.. T Consensus 198 ~~~~~~~~~g~vPv--v~f~N~~--~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~- 272 (484) T protein:vir:77 198 QVANVAHNLEMVPV--IPIPNRT--RLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVD- 272 (484) T ss_pred eeccccCCCCCcce--EEecccc--ccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhccc- Confidence 01222223332 2233211 111123345555544443 332 4555666777777677775543333332211 Q ss_pred HHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCC Q lcl|NC_021072. 278 AEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVP 357 (533) Q Consensus 278 AeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP 357 (533) ..+|...-+ ...-.+|..-. -++.+.++++.+-=+-++-++-.-.++....++| T Consensus 273 -------------------~~~~~~~~~----~~~~~~~~~~~---~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p 326 (484) T protein:vir:77 273 -------------------PETGQTLFD----AYLARILAFED---HESKAQQFSAAELRNFVDALDALDRKAAAYTGLP 326 (484) T ss_pred -------------------ccccchhhh----hhhhhhcccCC---CCceeEeecCCChHHHHHHHHHHHHHHhcccCCC Confidence 112211100 01113454321 2355777776541123344555556666777888 Q ss_pred ccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CHhHHhhhhhceeEEEeccchHHHH Q lcl|NC_021072. 358 SSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVM-SLEEWDEMKEHIQFDFIADNYFTEL 436 (533) Q Consensus 358 ~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~-t~eew~~~~~~i~~~f~~Dn~f~E~ 436 (533) .+-|+..+. |...+..|.--+..+-.-+.+.|+.|..-+.+.++.-+.+.|.. ...+| ..|.+.|..-..-+. T Consensus 327 ~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~w~~~~~~s~- 400 (484) T protein:vir:77 327 PYYLSFSSE-NPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEY----YRMESIWRDPSTPTY- 400 (484) T ss_pred HHHhccccC-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc----ccceEEecCCCCCCH- Confidence 888864332 33344456666666777789999999999999998777666542 12222 347788854332222 Q ss_pred HHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC-CCCCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 437 KEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGL-IVDPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 437 ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~-~~~p~~~~~~~~~~~~~~~~~d~ 515 (533) .+.++.+.++..-.-..+|.++++.. |++++.+++++++..++|...+. ..++.....+ +.++.+ +. T Consensus 401 ------~~~ad~~~kl~~~g~gi~s~et~~~~-l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~--~~~~~~---~~ 468 (484) T protein:vir:77 401 ------AAKADAATKLYNNGQGVIPKERARID-MGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDP--SGGGNP---DN 468 (484) T ss_pred ------HHHHHHHHHHHhccCCCCCHHHHHhc-CCCChhHHHHHHHHHHHHHHHHHHHHhhhccccc--cCCCCC---CC Confidence 34556666665443357888998854 99999999987665555433221 0111111000 000100 11 Q ss_pred cccccccCCccccchh Q lcl|NC_021072. 516 SAQEGPAVDAGDAKRG 531 (533) Q Consensus 516 ~~~~~~~~~~~~~~~~ 531 (533) ..++.++.++++...+ T Consensus 469 ~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 469 PETPEPQPNPAEEAAA 484 (484) T ss_pred CCcccccCCCccccCC Confidence 1111111222222222 No 43 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.49 E-value=4.6e-07 Score=55.34 Aligned_cols=464 Identities=13% Similarity=0.152 Sum_probs=188.9 Q ss_pred Ccccc-----c-------------------eeeeccccccccCCCCCCCCCccccee-----e--cccccccccchhhhh Q lcl|NC_021072. 2 SNQLF-----G-------------------FSLERAKKVPKGPSFVQKDSMDGSQPI-----V--GGGYYGYSVDFDGTV 50 (533) Q Consensus 2 ~~~~f-----g-------------------~~i~~~~~~~~~~s~~~~~~~dg~~~~-----~--~~~~~~~~~~~~~~~ 50 (533) -+-|| | ++|+.+++.+..-.... +..+|.... + ..+..|++.. .+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~~~~~~~~~~~~~~~-~~~~ 78 (563) T protein:vir:99 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLT-KSLYGQQQAYAEPFIEMMDTNPEFRDK-RSYM 78 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHH-hhhccCCCcchhhhHhhhccccccccc-ccCC Confidence 11111 1 11111111100000000 000000000 0 1111111111 1223 Q ss_pred hHHHHHHHHHHhhhhcchhhhHHHHhhccee--eec----CCCceEEEEeccCCC--cHHHHHHHHHHHHHHHHHhcchh Q lcl|NC_021072. 51 RNEYELITRYREMVLQPECDSAVDDIVNETI--CGN----FDDVPVEVELSNLKQ--SDKIKKLIREEFAEILRLLDFEN 122 (533) Q Consensus 51 ~~~~~LI~~YR~m~~~pEvd~AvdeIvneai--v~d----~~~~~v~v~l~~~~~--S~~ik~~I~eeF~~i~~lL~f~~ 122 (533) .+...|-..-|.++..|.|..+|+.+.+.+- ||. .++..+.|.|.+... ++.-...+. .....+..+..+. T Consensus 79 ~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~-~l~~~l~~~~~~~ 157 (563) T protein:vir:99 79 KNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMK-RIEDFIVNTGKDK 157 (563) T ss_pred CCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhH-HHHHHhhhcCCCC Confidence 3444444445566678999999999887632 332 344334555533332 221111111 1222222222221 Q ss_pred -----hhhHHHHh----hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhc Q lcl|NC_021072. 123 -----RSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEY 193 (533) Q Consensus 123 -----~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~ 193 (533) ...++++. .++.|.-|+.+++. .+..+-+++|.+|||.+++.++...+.... .... T Consensus 158 ~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~-rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~--------------~~~~ 222 (563) T protein:vir:99 158 DVDRDSFQTFCKKIVRDTYIYDQVNFEKVFN-KNNKTKLEKFIAVDPSTIFYATDKKGKIIK--------------GGKR 222 (563) T ss_pred CCCcchHHHHHHHHHHHHHhcCCeEEEEEEE-ecCCCceEEEEEeCCceeEEEECCCCceec--------------ccee Confidence 34454443 56778999988886 345566999999999999765443222100 0011 Q ss_pred eeccccccccccCCcceeccchhhcccccc-cc-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC Q lcl|NC_021072. 194 YLYNPKGLKNSTNQGMKIATDSVTYCHSGI-QD-LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG 271 (533) Q Consensus 194 ~~y~p~~~~~~~~~~~kI~~dai~y~hsGl-~d-~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvG 271 (533) |+|-..+. ....++.+.+.|..-+. .| ..+..++|-|+.|++++.....+|+...=+=---+--+-|.-+..+ T Consensus 223 y~~~~~g~-----~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~ 297 (563) T protein:vir:99 223 FVQVVDKR-----VVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSD 297 (563) T ss_pred EEEEeCCc-----eeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCC Confidence 11111111 11122222222211111 11 1245577999999999999888888766554455667777778776 Q ss_pred C-CchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHH Q lcl|NC_021072. 272 N-LPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKK 349 (533) Q Consensus 272 n-lpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~k 349 (533) . |.+..+++.-+.+-..|+.- ...|+ ..-.+ ..|.+++.|.-...-.+ ++-.+|..+. T Consensus 298 ~~ls~e~~~~~~~~~~~~~~G~----~nagk------~~~vl----------~~G~~~~~l~~~~~d~qfle~~~~~~~~ 357 (563) T protein:vir:99 298 QQQSQHALENFKREWKSSLSGI----NGSWQ------IPVVM----------ADDIKFVNMTPTANDMQFEKWLNYLINI 357 (563) T ss_pred CCCCHHHHHHHHHHHHHHhccc----ccccc------ceEEc----------CCCceEEeccCChhHHHHHHHHHHHHHH Confidence 4 55544444444433334320 00111 10011 12455555543322222 4555678899 Q ss_pred HHHhcCCCccccCCCC--cccc-cchhhhhHH---h--hhHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhh Q lcl|NC_021072. 350 LYKALNVPSSRLETET--TFNI-GRAAEITRD---E--VKFQKF-IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMK 420 (533) Q Consensus 350 Ly~aL~VP~sRl~~~~--~~~~-g~~~eItRD---E--lkF~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~ 420 (533) +.++.+||...|+-.. ++.- ..++.+++. + +-|... +.-+..++...|.. .|+ +.. . T Consensus 358 Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~----~L~-----~~~-----~ 423 (563) T protein:vir:99 358 ISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNR----HII-----SEY-----G 423 (563) T ss_pred HHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHh----hhc-----hhc-----c Confidence 9999999999996432 2211 112223322 1 124332 34444444443333 222 211 1 Q ss_pred hceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHH---HH------------HHHHHHH Q lcl|NC_021072. 421 EHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQE---IK------------EIDKQID 485 (533) Q Consensus 421 ~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDee---I~------------e~~kqi~ 485 (533) ..+.++|.+...= .|.+.+....-...-++|..-++. .+++..-+ +- +..++.+ T Consensus 424 ~~~~~~f~r~D~~----------~~~e~~~~~~~~~~G~lT~NE~R~-~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~ 492 (563) T protein:vir:99 424 DKYTFQFVGGDTK----------SATDKLNILKLETQIFKTVNEARE-EQGKKPIEGGDIILDASFLQGTAQLQQDKQYN 492 (563) T ss_pred cccEEEeccCCHH----------HHHHHHHHHHHhcCCccCHHHHHH-HhCCCCCCCcceeecccccccccccccccCCC Confidence 3567788765432 233333222222234677777774 46775432 00 0000000 Q ss_pred HhhhcCCCCC-----CCcccccCCCCCC------------CCCCCCccccccccCCccccc----hhcC Q lcl|NC_021072. 486 SEREAGLIVD-----PMAEMDPAMDPGN------------APPADDMSAQEGPAVDAGDAK----RGEF 533 (533) Q Consensus 486 ~E~~~~~~~~-----p~~~~~~~~~~~~------------~~~~~d~~~~~~~~~~~~~~~----~~~~ 533 (533) .+..+..... +....+|..++.+ .++.++.++.....-..++.. -.+| T Consensus 493 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 561 (563) T protein:vir:99 493 DGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGEKSSDF 561 (563) T ss_pred ccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccccccccCcCcccc Confidence 0000000000 0000000000000 001000000000000001100 1111 No 44 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.49 E-value=4.6e-07 Score=55.34 Aligned_cols=464 Identities=13% Similarity=0.152 Sum_probs=188.9 Q ss_pred Ccccc-----c-------------------eeeeccccccccCCCCCCCCCccccee-----e--cccccccccchhhhh Q lcl|NC_021072. 2 SNQLF-----G-------------------FSLERAKKVPKGPSFVQKDSMDGSQPI-----V--GGGYYGYSVDFDGTV 50 (533) Q Consensus 2 ~~~~f-----g-------------------~~i~~~~~~~~~~s~~~~~~~dg~~~~-----~--~~~~~~~~~~~~~~~ 50 (533) -+-|| | ++|+.+++.+..-.... +..+|.... + ..+..|++.. .+-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~~~~~~~~~~~~~~~-~~~~ 78 (563) T protein:vir:95 1 MADLFKQFRLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLT-KSLYGQQQAYAEPFIEMMDTNPEFRDK-RSYM 78 (563) T ss_pred ChhhhhhhhcccccccccccceeeccCChhhhHhhhhccchhHHHHH-hhhccCCCcchhhhHhhhccccccccc-ccCC Confidence 11111 1 11111111100000000 000000000 0 1111111111 1223 Q ss_pred hHHHHHHHHHHhhhhcchhhhHHHHhhccee--eec----CCCceEEEEeccCCC--cHHHHHHHHHHHHHHHHHhcchh Q lcl|NC_021072. 51 RNEYELITRYREMVLQPECDSAVDDIVNETI--CGN----FDDVPVEVELSNLKQ--SDKIKKLIREEFAEILRLLDFEN 122 (533) Q Consensus 51 ~~~~~LI~~YR~m~~~pEvd~AvdeIvneai--v~d----~~~~~v~v~l~~~~~--S~~ik~~I~eeF~~i~~lL~f~~ 122 (533) .+...|-..-|.++..|.|..+|+.+.+.+- ||. .++..+.|.|.+... ++.-...+. .....+..+..+. T Consensus 79 ~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~-~l~~~l~~~~~~~ 157 (563) T protein:vir:95 79 KNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMK-RIEDFIVNTGKDK 157 (563) T ss_pred CCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhH-HHHHHhhhcCCCC Confidence 3444444445566678999999999887632 332 344334555533332 221111111 1222222222221 Q ss_pred -----hhhHHHHh----hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhc Q lcl|NC_021072. 123 -----RSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEY 193 (533) Q Consensus 123 -----~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~ 193 (533) ...++++. .++.|.-|+.+++. .+..+-+++|.+|||.+++.++...+.... .... T Consensus 158 ~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~-rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~--------------~~~~ 222 (563) T protein:vir:95 158 DVDRDSFQTFCKKIVRDTYIYDQVNFEKVFN-KNNKTKLEKFIAVDPSTIFYATDKKGKIIK--------------GGKR 222 (563) T ss_pred CCCcchHHHHHHHHHHHHHhcCCeEEEEEEE-ecCCCceEEEEEeCCceeEEEECCCCceec--------------ccee Confidence 34454443 56778999988886 345566999999999999765443222100 0011 Q ss_pred eeccccccccccCCcceeccchhhcccccc-cc-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC Q lcl|NC_021072. 194 YLYNPKGLKNSTNQGMKIATDSVTYCHSGI-QD-LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG 271 (533) Q Consensus 194 ~~y~p~~~~~~~~~~~kI~~dai~y~hsGl-~d-~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvG 271 (533) |+|-..+. ....++.+.+.|..-+. .| ..+..++|-|+.|++++.....+|+...=+=---+--+-|.-+..+ T Consensus 223 y~~~~~g~-----~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~ 297 (563) T protein:vir:95 223 FVQVVDKR-----VVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSD 297 (563) T ss_pred EEEEeCCc-----eeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCC Confidence 11111111 11122222222211111 11 1245577999999999999888888766554455667777778776 Q ss_pred C-CchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHH Q lcl|NC_021072. 272 N-LPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKK 349 (533) Q Consensus 272 n-lpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~k 349 (533) . |.+..+++.-+.+-..|+.- ...|+ ..-.+ ..|.+++.|.-...-.+ ++-.+|..+. T Consensus 298 ~~ls~e~~~~~~~~~~~~~~G~----~nagk------~~~vl----------~~G~~~~~l~~~~~d~qfle~~~~~~~~ 357 (563) T protein:vir:95 298 QQQSQHALENFKREWKSSLSGI----NGSWQ------IPVVM----------ADDIKFVNMTPTANDMQFEKWLNYLINI 357 (563) T ss_pred CCCCHHHHHHHHHHHHHHhccc----ccccc------ceEEc----------CCCceEEeccCChhHHHHHHHHHHHHHH Confidence 4 55544444444433334320 00111 10011 12455555543322222 4555678899 Q ss_pred HHHhcCCCccccCCCC--cccc-cchhhhhHH---h--hhHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhh Q lcl|NC_021072. 350 LYKALNVPSSRLETET--TFNI-GRAAEITRD---E--VKFQKF-IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMK 420 (533) Q Consensus 350 Ly~aL~VP~sRl~~~~--~~~~-g~~~eItRD---E--lkF~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~ 420 (533) +.++.+||...|+-.. ++.- ..++.+++. + +-|... +.-+..++...|.. .|+ +.. . T Consensus 358 Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~----~L~-----~~~-----~ 423 (563) T protein:vir:95 358 ISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNR----HII-----SEY-----G 423 (563) T ss_pred HHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHh----hhc-----hhc-----c Confidence 9999999999996432 2211 112223322 1 124332 34444444443333 222 211 1 Q ss_pred hceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHH---HH------------HHHHHHH Q lcl|NC_021072. 421 EHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQE---IK------------EIDKQID 485 (533) Q Consensus 421 ~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDee---I~------------e~~kqi~ 485 (533) ..+.++|.+...= .|.+.+....-...-++|..-++. .+++..-+ +- +..++.+ T Consensus 424 ~~~~~~f~r~D~~----------~~~e~~~~~~~~~~G~lT~NE~R~-~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~ 492 (563) T protein:vir:95 424 DKYTFQFVGGDTK----------SATDKLNILKLETQIFKTVNEARE-EQGKKPIEGGDIILDASFLQGTAQLQQDKQYN 492 (563) T ss_pred cccEEEeccCCHH----------HHHHHHHHHHHhcCCccCHHHHHH-HhCCCCCCCcceeecccccccccccccccCCC Confidence 3567788765432 233333222222234677777774 46775432 00 0000000 Q ss_pred HhhhcCCCCC-----CCcccccCCCCCC------------CCCCCCccccccccCCccccc----hhcC Q lcl|NC_021072. 486 SEREAGLIVD-----PMAEMDPAMDPGN------------APPADDMSAQEGPAVDAGDAK----RGEF 533 (533) Q Consensus 486 ~E~~~~~~~~-----p~~~~~~~~~~~~------------~~~~~d~~~~~~~~~~~~~~~----~~~~ 533 (533) .+..+..... +....+|..++.+ .++.++.++.....-..++.. -.+| T Consensus 493 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 561 (563) T protein:vir:95 493 DGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQGRKGEKSSDF 561 (563) T ss_pred ccccchhhhhcccccCCCCCCCCCCCCCCCCCCccccccccccccccccccccCccccccccCcCcccc Confidence 0000000000 0000000000000 001000000000000001100 1111 No 45 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.49 E-value=4.7e-07 Score=55.26 Aligned_cols=457 Identities=12% Similarity=0.102 Sum_probs=185.3 Q ss_pred CCccccce-eee--ccccccccCCCCCC-CCCcccceee-ccccc--ccccchhhhhhHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_021072. 1 MSNQLFGF-SLE--RAKKVPKGPSFVQK-DSMDGSQPIV-GGGYY--GYSVDFDGTVRNEYELITRYREMVLQPECDSAV 73 (533) Q Consensus 1 ~~~~~fg~-~i~--~~~~~~~~~s~~~~-~~~dg~~~~~-~~~~~--~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Av 73 (533) |-..+=++ +.. +...+.|..--.+. ........+. +.+|. +++... ..+...+ -+.++.+|-|..+| T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~---~~~~~~~---l~~~~~npiv~~~I 100 (576) T protein:vir:96 27 IDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKN---SDNLHDV---LKQFGNNPILNAII 100 (576) T ss_pred cccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhh---hhhhHHH---HHHhhcCHHHHHHH Confidence 11111110 111 11111111100000 0000000000 00111 111111 1122222 34455689999999 Q ss_pred HHhhcceee------ecCCCceEEEEeccCCC--cHHHHHHHHHHHHHHHHHhcch----hhhhHHHHh----hhhcCce Q lcl|NC_021072. 74 DDIVNETIC------GNFDDVPVEVELSNLKQ--SDKIKKLIREEFAEILRLLDFE----NRSYEIFRR----WYVDGRL 137 (533) Q Consensus 74 deIvneaiv------~d~~~~~v~v~l~~~~~--S~~ik~~I~eeF~~i~~lL~f~----~~~~~~fR~----WYvDGri 137 (533) +-|.+.+-+ .+.++..+.|.+..... ++.-..++...-..+.+++... ....++++. +++-|.- T Consensus 101 ~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna 180 (576) T protein:vir:96 101 LTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQV 180 (576) T ss_pred HHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCe Confidence 999876432 13444555555533322 2222222222112233333221 134555554 5678999 Q ss_pred eeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhh Q lcl|NC_021072. 138 FYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVT 217 (533) Q Consensus 138 ~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~ 217 (533) |+.++.+. +..+-+++|.+|||.+++.++...+. .+... .-++.+... .....++.+.+. T Consensus 181 ~~~i~~~r-d~~g~~~~L~pl~p~~V~v~~~~dg~-----~~~~~--------~~~~~~~~~------~~~~~~~~~dii 240 (576) T protein:vir:96 181 NFEKVFNK-KNATTMDKFIAVDPSTIFYATDKNGK-----IIKGG--------KRFVQVINK------KVVASFTSREMA 240 (576) T ss_pred EEEEEEec-CCCCceEEEEEeCCceeEEEECCCCc-----eeeee--------eEEEEecCC------ceEEEecccceE Confidence 99998874 33455999999999999775433211 11000 000000000 011223333322 Q ss_pred cccccc-cc-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC-CCchHHHHHHHHHHHHhcccEEE Q lcl|NC_021072. 218 YCHSGI-QD-LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG-NLPKNKAEQYLREVMGRYRNKLV 294 (533) Q Consensus 218 y~hsGl-~d-~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvG-nlpk~KAeqYl~~im~~~rnk~v 294 (533) +...+. .| ..++.++|-|+.|.+++.....++....=+=---|--+-|..+..+ .+.+..+++-.+.+-..|+-- T Consensus 241 ~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~-- 318 (576) T protein:vir:96 241 MGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGI-- 318 (576) T ss_pred EEeecCCCCcccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc-- Confidence 211111 11 1244567899999999988887777665443344566777777764 455555544444443334311 Q ss_pred eeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCC-cchHHHHHHHHHHHHHhcCCCccccCCCC-c------ Q lcl|NC_021072. 295 YDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQN-LGELEDVKYFQKKLYKALNVPSSRLETET-T------ 366 (533) Q Consensus 295 Yd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~n-Lgei~DV~YF~~kLy~aL~VP~sRl~~~~-~------ 366 (533) ...|. ..-+++ .|.+++.|.-... +.-++-.+|..+.+.++++||...|+... + T Consensus 319 --~nag~------~p~vl~----------~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~ 380 (576) T protein:vir:96 319 --NGSWQ------VPVVMA----------DDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGK 380 (576) T ss_pred --ccccc------ceeecC----------CCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccc Confidence 00111 111111 1456666643222 22244556788999999999999996432 1 Q ss_pred ----ccccchhhhhHHhhhHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHH Q lcl|NC_021072. 367 ----FNIGRAAEITRDEVKFQKF-IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEI 441 (533) Q Consensus 367 ----~~~g~~~eItRDElkF~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei 441 (533) ++.....+..+. |.++ +.-+..++...|.. .|+ +. ....+.++|.+...=+ T Consensus 381 ~~~s~t~sn~e~~~~~---f~~~tL~P~~~~ie~~ln~----~Ll-----~~-----~~~~~~~~f~r~d~~~------- 436 (576) T protein:vir:96 381 GGNTLNEADPGKKQQQ---SQNKGLQPLLRFIEDLINT----HII-----SE-----YSDKYVFQFVGGDTKS------- 436 (576) T ss_pred cccccccccHHHHHHH---HHHHHHHHHHHHHHHHHHh----hhc-----hh-----ccCceEEEeccCCHHH------- Confidence 222233333333 5443 44444444444443 222 11 1234677777654322 Q ss_pred HHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHH-----------------HHhhhcCCCCCCCcccccCC Q lcl|NC_021072. 442 RNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQI-----------------DSEREAGLIVDPMAEMDPAM 504 (533) Q Consensus 442 ~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi-----------------~~E~~~~~~~~p~~~~~~~~ 504 (533) |.+.++.......-++|..-++. .+++..-+ --++.+ +.+..+.....|.....++- T Consensus 437 ---~~e~~~~~~~~~~G~lT~NE~R~-~~gl~pie--gGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 510 (576) T protein:vir:96 437 ---ELDKIKILQEEVKTYKTVNEARK-EKGLKPIE--GGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPD 510 (576) T ss_pred ---HHHHHHHHHHHhcCccCHHHHHH-HhCCCCCC--CcceeccccccccccccccCCCCCCccccccccccccccCCCC Confidence 22332222222223677777764 46765421 000000 00000000000000000000 Q ss_pred CCCCCCCCCCccccccccCCcc----------------------------------ccchhcC Q lcl|NC_021072. 505 DPGNAPPADDMSAQEGPAVDAG----------------------------------DAKRGEF 533 (533) Q Consensus 505 ~~~~~~~~~d~~~~~~~~~~~~----------------------------------~~~~~~~ 533 (533) +..+.++..+...+...+-+++ -++..+| T Consensus 511 ~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 573 (576) T protein:vir:96 511 DEEPQQESTEDKVDGRESNDPTKIDSPVGTDGQLKDQDNVKSQEGSNKGQGTKGKGNEKPSDF 573 (576) T ss_pred CCCCCCCCCCCcccccccccCCCCCCccccccccCCCCcccccccccccccccccCCCCcccc Confidence 0000000000000000000100 0111111 No 46 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.48 E-value=5.1e-07 Score=55.10 Aligned_cols=440 Identities=13% Similarity=0.106 Sum_probs=196.9 Q ss_pred CCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhh-------------------hcchhhhHHHHhhcceee Q lcl|NC_021072. 22 FVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMV-------------------LQPECDSAVDDIVNETIC 82 (533) Q Consensus 22 ~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~-------------------~~pEvd~AvdeIvneaiv 82 (533) .++..+.|-.. .+.. ....-..-+.+...+..|=... ..+=+.-+|+..++=.++ T Consensus 1 ~~~~~~~d~~~-~i~~-----L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~ 74 (488) T protein:vir:23 1 MAETESIDPEK-LRDQ-----LLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQEL 74 (488) T ss_pred CCcccCCCHHH-HHHH-----HHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhc Confidence 44444444321 1111 0001111112223333331110 001111122221111100 Q ss_pred ec-CCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecC----CCCCCCeEEEEE Q lcl|NC_021072. 83 GN-FDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDP----KNPRGGLTELRY 157 (533) Q Consensus 83 ~d-~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~----~~~~~gI~elr~ 157 (533) -- .-+.++.......+ .+...+++..|+..-+|+....+.++.+++.|+-|+..-.+. -....|...++. T Consensus 75 ~Gf~~~~~~~~~~~~~~-----d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~ 149 (488) T protein:vir:23 75 EGFRIPSANGEEPESGG-----ENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRV 149 (488) T ss_pred cceeccCCccccccccc-----chhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEE Confidence 00 00111111211211 222334566677777899999999999999999887633211 113445566788 Q ss_pred cChhhceehhhccCCCcCceeEEeccc-eeeccchhceeccccccc-------------cccCCcceeccchhhcccccc Q lcl|NC_021072. 158 IDPRKIRKVTEYQQKRPEQLRGEDINT-QLTQKAAEYYLYNPKGLK-------------NSTNQGMKIATDSVTYCHSGI 223 (533) Q Consensus 158 lDP~~i~~vr~~~~~~~~~~~~~~~~~-~~~~~~~e~~~y~p~~~~-------------~~~~~~~kI~~dai~y~hsGl 223 (533) ++|+.+-.+..- .......+..... .-.+......+|.|.... ..+|..-++|. +.|.+..- T Consensus 150 ~~p~~~~~~~d~--~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPv--v~f~n~~~ 225 (488) T protein:vir:23 150 EPPTALYAEVDP--RTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPV--IPISNRTR 225 (488) T ss_pred eccceeEEEEec--CCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcce--EEeccccc Confidence 899887554321 1111111111000 000111111233332211 11233333333 23333221 Q ss_pred ccCCCCccchhHHHHHHHH-H-HHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCc Q lcl|NC_021072. 224 QDLNKNMTLSHLHKAIKAV-N-QLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGE 301 (533) Q Consensus 224 ~d~~~~~i~syL~~AiK~~-N-qLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGe 301 (533) .....+.|-|.+.++++ . -=+++-+..+.-..+.-|.|-|.=.+....+.... +.+.+..+..| T Consensus 226 --~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~- 291 (488) T protein:vir:23 226 --LSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAE-----------TGQRMFDAYMA- 291 (488) T ss_pred --cCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccccc-----------ccchhhhhhhh- Confidence 12234456666555443 2 23455566666676767777554222111111000 00011111111 Q ss_pred cccccccchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhh Q lcl|NC_021072. 302 IKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEV 380 (533) Q Consensus 302 v~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDEl 380 (533) ..|.- ++|-+.++-++++.. ++ -++-++=....++...++|..-|+..+. |-..+..|.--+. T Consensus 292 ------------~v~~~--~~g~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~~ 355 (488) T protein:vir:23 292 ------------RILAF--EGGEGAHAEQFSAAE-LRNFVDALDALDRKAASYSGLPPQYLSSSSD-NPASAEAIKAAES 355 (488) T ss_pred ------------hhccC--CCCCCceeEecCCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHHHHHH Confidence 22322 234445677777654 43 2333444455566678888777754321 2223445666666 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_021072. 381 KFQKFIARLRKRFSELFMDLLKTQLILKGVMS-LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKY 459 (533) Q Consensus 381 kF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t-~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky 459 (533) .+-.-+.+.++.|..-+.+.++.-+.+.|... +.+| ..|.+.|..-..=+. .+.++.+.++..-+... T Consensus 356 ~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~f~~~~~~s~-------~~~ada~~kl~~~g~~~ 424 (488) T protein:vir:23 356 RLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEY----YRMETVWRDPSTPTY-------AAKADAAAKLFANGAGL 424 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhh----ccceEEecCCCCCCH-------HHHHHHHHHHHhccccc Confidence 67777888999999999999988776655432 2333 347888864332222 34555566665443367 Q ss_pred ccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCC------CCCCcccccCCCCCCCCCCCCccccccccC Q lcl|NC_021072. 460 FSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLI------VDPMAEMDPAMDPGNAPPADDMSAQEGPAV 523 (533) Q Consensus 460 ~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~------~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~ 523 (533) +|.++++.. |++++++++++++..++|..+... ....++..++.. +..+...++.+++ T Consensus 425 ~s~et~~~~-l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~e~~~a 488 (488) T protein:vir:23 425 IPRERGWVD-MGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEA-----PVGEPPAPEPDAA 488 (488) T ss_pred CCHHHHHHh-CCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCC-----CCCCCCCCCCCCC Confidence 899998866 799999998876654444332111 111111111111 1111111111111 No 47 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.47 E-value=5.2e-07 Score=55.02 Aligned_cols=436 Identities=12% Similarity=0.067 Sum_probs=180.5 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) ||..-++--- ..+....|.+-....- ..++..-.+ -..|- +.+..+|-|.+||+-|++.+ T Consensus 6 ~~~~~~~~~~---~~~~~~~~~~~~~~~~-------~~~~~pp~~-------~~~La---~~~~~n~~v~scI~~ia~~i 65 (540) T protein:vir:41 6 LSIKSLEKYR---AIKGDTDSQALKEDRF-------EEYVEPKVH-------PLVLL---SLLQVNPYHASACSIKANDI 65 (540) T ss_pred cChhhccchh---hhhccccccccccCCC-------CccccCCCC-------HHHHH---HHHHhcHHHHHHHHHHHHHH Confidence 4444333210 0111111221111100 112211111 11122 23356788899999998886 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHh-cchhhhhHHHH----hhhhcCceeeeeeecCCCCCCCeEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLL-DFENRSYEIFR----RWYVDGRLFYHKVIDPKNPRGGLTEL 155 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL-~f~~~~~~~fR----~WYvDGri~~hkvid~~~~~~gI~el 155 (533) .. .+..+....- .+. +++ +....+.++++ .+++.|.-|+.++-+. .+.+++| T Consensus 66 a~-----~~~~i~~~~~----~~~-----------~~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~---~G~~~~L 122 (540) T protein:vir:41 66 LR-----TGYLIDGDDG----GVE-----------ELLRACRPSFEFILLQALEDLQVFNYCTLEVVRDD---QGEPVRL 122 (540) T ss_pred hc-----CCceEecCcc----chh-----------hhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECC---CCcEEEE Confidence 53 3333332221 111 112 33334444444 4677899999998864 4569999 Q ss_pred EEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhH Q lcl|NC_021072. 156 RYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHL 235 (533) Q Consensus 156 r~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL 235 (533) .+|||.+++..+.-. + ++...+.........|.|..............++.+-+.+.+.. -..++-.++|-| T Consensus 123 ~~i~~~~V~v~~~~~-----~--~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~-~~~~~~~G~Spi 194 (540) T protein:vir:41 123 DYIPAHTVRVHRDGS-----R--YMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLP-SPICSYYGVPRY 194 (540) T ss_pred EEeCCcceEEeEcCc-----e--eEeeecCceeeeeecccccceeeccccccceeecccceEEecCC-CCCCCcccccHH Confidence 999999997543211 1 11000000000000011111111111112223333333222111 012233566999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCch----HHHHHHHHHHHHhc-ccEEEeeCCCCccccccccch Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPK----NKAEQYLREVMGRY-RNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk----~KAeqYl~~im~~~-rnk~vYd~~TGev~~d~~~ms 310 (533) ..|.+++.....+++...=+----|--.-|..++.+-.++ .++.+-+++.+.++ .+.. .|-..+..+.| T Consensus 195 ~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~-----~g~~~nag~~~- 268 (540) T protein:vir:41 195 LSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNF-----KYLKEAPHTPL- 268 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHh-----ccccccccceE- Confidence 9999988888777776543322234445566666433332 22233333322221 1111 01111111111 Q ss_pred hHhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCC--CCcccccchhhhhHHhhhHHHH Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVKFQKF 385 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~--~~~~~~g~~~eItRDElkF~Kf 385 (533) .++.=. .+..|.+++.|. .+..++ +-.++..+.+.++++||...++. .++++.....+..+. |.+. T Consensus 269 vLe~~~----~~~~g~~~~pl~--~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~---f~~~ 339 (540) T protein:vir:41 269 VFSIPG----GDTVEVTFTPLN--TSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRT---YYES 339 (540) T ss_pred EEecCC----CcccceeEEecc--cchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHH---HHHH Confidence 222100 012345555553 233333 34457788899999999999963 245665566665555 6554 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHH Q lcl|NC_021072. 386 -IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDY 464 (533) Q Consensus 386 -i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~ 464 (533) +.-+..++...+...|.+++ ...+.+.|..+. +...++ ..+++.+- -.-++|.+- T Consensus 340 tL~P~~~~ie~~ln~~L~~~~--------------~~~~~i~f~~~~----ll~~D~-~~~~~~lv-----~~G~lT~NE 395 (540) T protein:vir:41 340 VVRPQQEIVSSVLTDFIQLKL--------------DPGARFVFNEEI----LMESEF-VHNYALLV-----QCGVLTPSE 395 (540) T ss_pred HHHHHHHHHHHHHHHhhhhcc--------------CCceEEEecchh----hcchHH-HHHHHHHH-----hCCCCCHHH Confidence 66777777776665553321 123566676443 333332 23333221 124677777 Q ss_pred HHHHHhCCCHHHHHHHH------HHHHHhhhcCCCCCC------CcccccCCCC--CCCCCCCCccccccccCCccccch Q lcl|NC_021072. 465 MRRQVLKQTDQEIKEID------KQIDSEREAGLIVDP------MAEMDPAMDP--GNAPPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 465 i~k~IL~~tDeeI~e~~------kqi~~E~~~~~~~~p------~~~~~~~~~~--~~~~~~~d~~~~~~~~~~~~~~~~ 530 (533) ++.+++.+..-+-.-+. ..+.....+.--++| .++.+|+..+ ...++.++.+ .+.+... T Consensus 396 ~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~ 468 (540) T protein:vir:41 396 VREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKK-------KKIDEVL 468 (540) T ss_pred HHHHhCcCcCCCcccccccccccccccccccccCCCCccccccccchhcccccCcccccccccccc-------ccccccc Confidence 77554444321100000 000000000000000 0111111111 0011111100 0011111 Q ss_pred hcC Q lcl|NC_021072. 531 GEF 533 (533) Q Consensus 531 ~~~ 533 (533) ++| T Consensus 469 ~~~ 471 (540) T protein:vir:41 469 SDF 471 (540) T ss_pred ccc Confidence 111 No 48 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.47 E-value=5.3e-07 Score=54.99 Aligned_cols=410 Identities=12% Similarity=0.136 Sum_probs=170.3 Q ss_pred HHhhhh-cchhhhHHHHhhcceeeecCCCceEEEEecc-CCCcHHHHHHHHHHHHHHHHHhc-c------------hhhh Q lcl|NC_021072. 60 YREMVL-QPECDSAVDDIVNETICGNFDDVPVEVELSN-LKQSDKIKKLIREEFAEILRLLD-F------------ENRS 124 (533) Q Consensus 60 YR~m~~-~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~-~~~S~~ik~~I~eeF~~i~~lL~-f------------~~~~ 124 (533) -|.|+. +|-|..+|+-|.+.+. +.|+.|.... .+.. .+..++++.+...|. - .... T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia-----~~p~~i~~~~~~~~~----~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~ 71 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVA-----GFGINIIPHPEAEDP----DRDGEQYERVWDFWFGDDSNWQVGPMESERATA 71 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhh-----cCCeEEEEccCcccc----cchhhhhhhHHHHhhccCCCccccchhhHhhHH Confidence 667755 6999999999998875 3566665322 1111 112233444433221 1 1122 Q ss_pred hHHHHh----hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCC---cCceeEEeccc-e-eeccchhcee Q lcl|NC_021072. 125 YEIFRR----WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKR---PEQLRGEDINT-Q-LTQKAAEYYL 195 (533) Q Consensus 125 ~~~fR~----WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~---~~~~~~~~~~~-~-~~~~~~e~~~ 195 (533) .++++. .++.|.-|..++-+. .+.+++|.+|||.+++..+...+.. .....++..+. . ........+. T Consensus 72 ~~~~~~~~~~l~l~Gn~~i~~~r~~---~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 148 (467) T protein:vir:31 72 TNVLQTAWTDYEAIGWLTIEILTQT---DGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDP 148 (467) T ss_pred HHHHHHHHHHHHhcCCeEEEEEECC---CCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceee Confidence 333333 345699999988754 5669999999999997543321110 00011110000 0 0000000000 Q ss_pred ccccccccccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcC-ccceEEEccCCCCc Q lcl|NC_021072. 196 YNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRA-PERRIFYIDVGNLP 274 (533) Q Consensus 196 y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RA-PeRrvfyIDvGnlp 274 (533) .-...........+.++.+.|.+... .-..++-.++|-+..|.+.+..-..++....=+ ..++ --+-+..+..+.+. T Consensus 149 ~~~~~~~~~~~~~~~~~~~diih~r~-~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~-f~ng~~p~gil~~~~~~l~ 226 (467) T protein:vir:31 149 VFVDADDGSTGTSVSNPANELIFKRN-HSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDF-FENDGVPRIAIIVKGAELT 226 (467) T ss_pred eeeeeccccccceeEeccccEEEecC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH-HhccCCCceEEEecCcCCC Confidence 00001111233345555555543321 111233456789999988876665555443211 1122 22344555545555 Q ss_pred hHHHHHHHHHHHH-hcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCC----------------- Q lcl|NC_021072. 275 KNKAEQYLREVMG-RYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQN----------------- 336 (533) Q Consensus 275 k~KAeqYl~~im~-~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~n----------------- 336 (533) + ++.+-+++.+. .|++..- -.++.-.++..|-.+..|++|.. T Consensus 227 ~-e~~~~~~~~~~~~~~~~~~-------------------~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~ 286 (467) T protein:vir:31 227 E-KGREEMRNLIEDNNEDNHR-------------------TAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDE 286 (467) T ss_pred H-HHHHHHHHHHHhhhcchhh-------------------hhhhhhcccccccccccccCCCcccccceeEEeccccChh Confidence 4 44444444443 3332110 00111111112222333333321 Q ss_pred cchHHHH-HHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHh Q lcl|NC_021072. 337 LGELEDV-KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRFSELFMDLLKTQLILKGVMSLE 414 (533) Q Consensus 337 Lgei~DV-~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~e 414 (533) =.++-+. ++..+...++.+||.+.|+...+-+.+.+ +...-..|.++ +.-+..++...|...|-++ + T Consensus 287 d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~--~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~----~----- 355 (467) T protein:vir:31 287 EASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTD--AEEQRKEFAEETIQPKQHDFGELLYELVHKQ----G----- 355 (467) T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccC--HHHHHHHHHHHHHHHHHHHHHHHHHHhhcch----h----- Confidence 1122222 34556699999999999965433344322 22222236544 4555555555444333211 1 Q ss_pred HHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhh--hcCC Q lcl|NC_021072. 415 EWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSER--EAGL 492 (533) Q Consensus 415 ew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~--~~~~ 492 (533) + ......|+|++..--. .-+..|++.+..+- -.-++|..-+++. +++..- .+ .+. .... T Consensus 356 ~-~~~~~~i~f~~~~l~~-------~d~~~~~~~~~~~~--~~G~~T~NE~R~~-~Gl~pi--~d------~~~~~~~~~ 416 (467) T protein:vir:31 356 L-DAPDWTIEFELAKPDT-------KLQDVEIASQRVQA--MQGLLTVNELRDE-FGFEPF--PE------EHVYGGETL 416 (467) T ss_pred h-ccCCceEEEecchhhc-------cCHHHHHHHHHHHH--hCCCcCHHHHHHH-hCCCCC--Cc------ccccCCccc Confidence 1 1112335555543211 12234555555432 2247778877744 666431 10 000 0111 Q ss_pred CCCCCcccccCCCCC--CCCCCCCccccccccCCcccc-----chhcC Q lcl|NC_021072. 493 IVDPMAEMDPAMDPG--NAPPADDMSAQEGPAVDAGDA-----KRGEF 533 (533) Q Consensus 493 ~~~p~~~~~~~~~~~--~~~~~~d~~~~~~~~~~~~~~-----~~~~~ 533 (533) .+.+..+..|+-+.+ .+++.++...+.-....+... |++.= T Consensus 417 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (467) T protein:vir:31 417 VAEVTGGSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEIGAN 464 (467) T ss_pred ccccccccCCCCcccCcCCCCCCCcccchHhhhhhccccchhhhhccc Confidence 111112222211111 111111111111001111111 11111 No 49 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.47 E-value=5.4e-07 Score=54.94 Aligned_cols=439 Identities=13% Similarity=0.124 Sum_probs=193.1 Q ss_pred cccccccccchhhhh---hHHHHHHHHHHhhhhc---------ch------hhhHHHHhhcceeeecCCCceEEEEeccC Q lcl|NC_021072. 36 GGGYYGYSVDFDGTV---RNEYELITRYREMVLQ---------PE------CDSAVDDIVNETICGNFDDVPVEVELSNL 97 (533) Q Consensus 36 ~~~~~~~~~~~~~~~---~~~~~LI~~YR~m~~~---------pE------vd~AvdeIvneaiv~d~~~~~v~v~l~~~ 97 (533) +++.--.-....... ..+...++.|-+-.+. ++ |-+=..-||+..+-+ .. +..+.. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~-l~--~~g~~~--- 74 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDR-LD--IEGFRI--- 74 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhh-hc--cCceec--- Confidence 111111111111111 1233344444222110 10 011111111111000 00 001111 Q ss_pred CCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeec--CCCCCCCeEEEEEcChhhceehhhcc--CCC Q lcl|NC_021072. 98 KQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVID--PKNPRGGLTELRYIDPRKIRKVTEYQ--QKR 173 (533) Q Consensus 98 ~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid--~~~~~~gI~elr~lDP~~i~~vr~~~--~~~ 173 (533) +..+.. .+....|++.-+|+....++++.-.+.|+-|++.--. ....+.|-..++.+||+.+-.+..-. ++. T Consensus 75 ~~d~~~----~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~~~ 150 (480) T protein:vir:78 75 SEDSEG----LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) T ss_pred CCCchh----HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCccce Confidence 111122 2334455655678899999999999999998773211 01135677789999999886644321 111 Q ss_pred cCceeEEeccceeeccchhceecccccccc-c--cCCcceeccchhhccc-ccccc---------CCCCccchhHHHHHH Q lcl|NC_021072. 174 PEQLRGEDINTQLTQKAAEYYLYNPKGLKN-S--TNQGMKIATDSVTYCH-SGIQD---------LNKNMTLSHLHKAIK 240 (533) Q Consensus 174 ~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~-~--~~~~~kI~~dai~y~h-sGl~d---------~~~~~i~syL~~AiK 240 (533) .-..+++...+. .+......+|.|..... . ......-.++..++.| .|-+. .....+.|=|.+.++ T Consensus 151 ~~~i~~~~~~d~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~ 229 (480) T protein:vir:78 151 TRAVRLYTTRDD-VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELR 229 (480) T ss_pred EEEEEEEEeecC-CcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHH Confidence 111222211111 11111112233322110 0 0000000001111111 12111 111234454554433 Q ss_pred HH-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccccccc-chhHhhhcc Q lcl|NC_021072. 241 AV-NQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKF-MSMLEDFWL 317 (533) Q Consensus 241 ~~-Nq-Lrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~-msmlEDywL 317 (533) ++ .. =+++-+.+++-....-|.|-+.=.+....+..+ .+ .++ ..+-...|+ T Consensus 230 ~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~---------------------~~-----~~~~~~~~~~~~~ 283 (480) T protein:vir:78 230 KVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG---------------------EN-----TTLDIYYGRILTL 283 (480) T ss_pred HHHHHHHHHHHHHHHHHHhhcchhhhhhCCCcccccccc---------------------cc-----chhhhhhhhhccC Confidence 22 11 135667777777777787765422221111000 00 001 111122344 Q ss_pred cccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHH Q lcl|NC_021072. 318 PRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSEL 396 (533) Q Consensus 318 pRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~i 396 (533) + |-+.++-++++.. ++- ++-++-....++..-++|..-|+..+. |-..|..|.--+...-.-+.+.|+.|..- T Consensus 284 ~----~~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~ 357 (480) T protein:vir:78 284 A----SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRIFGGA 357 (480) T ss_pred C----CCCceEEecCccC-HHHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3 2346787888754 333 333566666677778888777764321 21223345555556667789999999999 Q ss_pred HHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHH Q lcl|NC_021072. 397 FMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQE 476 (533) Q Consensus 397 f~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDee 476 (533) +.+.++.-+.+.|.-...+|. .|.+.|..-..=+. .+.++.+.++-.-++..+|.+++.. +|++++++ T Consensus 358 l~~~~rl~~~~~~~~~~~~~~----~i~v~w~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~et~~~-~lg~~~d~ 425 (480) T protein:vir:78 358 WERAMRIAMQIMGREVTEEYT----RLETVWRDPSTPTV-------AAKADAVSKLYANGQGPIPKEQARI-DLGYTATQ 425 (480) T ss_pred HHHHHHHHHHHcCCCccccce----eeeEEecCCCCCCH-------HHHHHHHHHHHHhcccCCCHHHHHh-cCCCCHhH Confidence 999999877777753334443 46778754322222 2456666666555556788888874 58999999 Q ss_pred HHHHHHHHHHhhhc--CCCCCCCcccccCCCCCCCCCCCCccccccccCCccccc Q lcl|NC_021072. 477 IKEIDKQIDSEREA--GLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAK 529 (533) Q Consensus 477 I~e~~kqi~~E~~~--~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 529 (533) ++++.+..++|... +....|..+...+.++++.+..++...+...+..-+-++ T Consensus 426 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 426 REQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCccCCCcccCCCcCCC Confidence 99877544443221 112222211111111111111111111111111112222 No 50 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=98.46 E-value=5.6e-07 Score=54.88 Aligned_cols=396 Identities=13% Similarity=0.158 Sum_probs=175.1 Q ss_pred CC-ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcc Q lcl|NC_021072. 1 MS-NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNE 79 (533) Q Consensus 1 ~~-~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvne 79 (533) |+ ..+||- ++. ...++.. .++. .+... -.+ +.+ ....++. -....++|-|.+||+-|++. T Consensus 1 ~~~~~~~~~-~k~-~~~~~~~---~~~~-~~~~~-~~~-~~~---~~~~~v~--------~~~a~~~~~V~~ci~~ia~~ 61 (409) T protein:vir:96 1 MAKENIVTR-IKK-KLIDNWI---DQSA-SKLYD-FSP-WKN---KSFWGVI--------NNTLETNETIFSAITKLSNS 61 (409) T ss_pred Cccccchhh-hhh-HHhhhhh---cccc-ccccc-ccc-ccC---ccccccc--------hhhHhhhHHHHHHHHHHHHh Confidence 32 122221 000 0000000 0000 00000 000 111 0111111 11234678899999999988 Q ss_pred eeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHH----HhhhhcCceeeeeeecCCCCCCC Q lcl|NC_021072. 80 TICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIF----RRWYVDGRLFYHKVIDPKNPRGG 151 (533) Q Consensus 80 aiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~f----R~WYvDGri~~hkvid~~~~~~g 151 (533) +-. .|+.+- ...+..+ ..+..+|+. ...+.++. ..+++.|.-|+.++-|. .+- T Consensus 62 ia~-----lp~~~~-~~~~~~~----------~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~---~G~ 122 (409) T protein:vir:96 62 MAS-----LPLKMY-EDYKVVN----------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---YHQ 122 (409) T ss_pred hhh-----CceEEe-ecccccc----------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC---CCc Confidence 654 344332 1211111 123344432 23454444 44677899988876553 445 Q ss_pred eEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccc--ccccCCcceeccchhhccccccccCCCC Q lcl|NC_021072. 152 LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGL--KNSTNQGMKIATDSVTYCHSGIQDLNKN 229 (533) Q Consensus 152 I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~--~~~~~~~~kI~~dai~y~hsGl~d~~~~ 229 (533) +++|.+|+|..++.+..- ...... |.++...+. .+.....++|+... ..++- T Consensus 123 ~~~L~~l~~~~v~v~~~~---~~~~~~--------------y~~~~~~g~~~~~~~~evih~r~~~---------~~~~~ 176 (409) T protein:vir:96 123 PSKLFLLNPDVVEMLIEN---QSRELY--------------YSIHAATGNKLIVHNMDMLHFKHIV---------ASNMV 176 (409) T ss_pred EEEEEEEcCceeEEEEeC---CCcEEE--------------EEEEcCCceEEEEccccEEEeCCCC---------CCCcc Confidence 899999999999754321 111110 011111111 01112223332110 11122 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccc Q lcl|NC_021072. 230 MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFM 309 (533) Q Consensus 230 ~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~m 309 (533) .+.|.|..|...+.....++.. ..+...+.| . +...-.+.|.+.+++...+.+.+.|.| .|.+ + T Consensus 177 ~G~s~l~~~~~~i~~~~~~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~n-------~g~~------~ 240 (409) T protein:vir:96 177 QGISPIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYEE-------NGGI------L 240 (409) T ss_pred ccccHHHHHHHHHHHHHHHHHH-HHHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhhc-------CCCe------e Confidence 3457777777776666556554 355555544 2 344445777777776666665554432 2322 1 Q ss_pred hhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HH Q lcl|NC_021072. 310 SMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IA 387 (533) Q Consensus 310 smlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~ 387 (533) .++ -|.+++.|.-. +.+.-++-..|..+.+.++++||.+.|+..+.-+.+...+..+. |.++ |. T Consensus 241 -vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~---f~~~~l~ 306 (409) T protein:vir:96 241 -FQE----------PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRF---YLQHTLL 306 (409) T ss_pred -ecC----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHH Confidence 111 25667777421 22222333446678899999999999986655555555555555 6655 33 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHH Q lcl|NC_021072. 388 RLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRR 467 (533) Q Consensus 388 rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k 467 (533) -+-.++. +.|-.. ++++.++. ....+.|. ..++...+ +..|++++..+-.- -++|..-++. T Consensus 307 P~~~~ie----~~l~~~-----Ll~~~~~~---~g~~i~fd----~~~ll~~d-~~~~~e~~~~~~~~--G~~T~NE~R~ 367 (409) T protein:vir:96 307 PIVKQYE----EEFNRK-----LLTKTDRE---KNRYFKFN----VKSYLRAD-SATQAEVYFKAVRS--GYYTINDIRE 367 (409) T ss_pred HHHHHHH----HHHHhh-----cCCccccc---CcceEEee----chhhhccC-HHHHHHHHHHHHhC--CCCCHHHHHH Confidence 3333322 223333 33444443 23455665 33443333 34566666655333 3777777764 Q ss_pred HHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 468 QVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 468 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~ 515 (533) .+++.+-+ --++-+- .....|-...........||+...+++ T Consensus 368 -~~g~~pi~--ggD~~~~---~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 368 -WEDLPPVE--GGDKPLI---SGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred -HhCCCCCC--Ccceeee---cccccccccchhhcccccCCCCCcCCC Confidence 36664321 0000000 000000000000001111222222222 No 51 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.42 E-value=7.5e-07 Score=54.16 Aligned_cols=446 Identities=15% Similarity=0.161 Sum_probs=186.5 Q ss_pred CCccccce---------eeeccccccccCCCCCCCCCcccc---------------eeecccccccccch-hhhhhHHHH Q lcl|NC_021072. 1 MSNQLFGF---------SLERAKKVPKGPSFVQKDSMDGSQ---------------PIVGGGYYGYSVDF-DGTVRNEYE 55 (533) Q Consensus 1 ~~~~~fg~---------~i~~~~~~~~~~s~~~~~~~dg~~---------------~~~~~~~~~~~~~~-~~~~~~~~~ 55 (533) -+.-+|-- ..+..-.+-.+++..|+...-+.. .+..+++.+. ++ ++.+ +-.. T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~--~~~epp~-d~~~ 88 (648) T protein:vir:79 12 SRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGR--DFEEPEF-DFNE 88 (648) T ss_pred hhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCcc--ccccCCc-CHHH Confidence 01111100 000000001111221111111110 0011111111 11 1111 1222 Q ss_pred HHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh----h Q lcl|NC_021072. 56 LITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRR----W 131 (533) Q Consensus 56 LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~----W 131 (533) |-+- ...+|.|..||+.|++.+..+ +..+.-.+-...+..+. ..++..-+.....+++++. + T Consensus 89 l~~l---~~~np~V~~aI~iia~~ia~l-----~~~i~~~~~~~~~~~~~------~~ll~rPn~~~t~~~f~~~l~~~l 154 (648) T protein:vir:79 89 ITSA---YNTEGYVRQAVDKYIEMMFKA-----DWDFVSKNPNAVEYIRM------RFTLMAEATQIPTNQLFIEIAEDL 154 (648) T ss_pred HHHH---HhcChHHHHHHHHHHHHHhhC-----cceEEecCCccchhhHH------HHHhhccCCCCCHHHHHHHHHHHH Confidence 2222 245899999999988776542 33333222111111111 1122222444555555554 5 Q ss_pred hhcCceeeeeeecCCCC------------CCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccc Q lcl|NC_021072. 132 YVDGRLFYHKVIDPKNP------------RGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPK 199 (533) Q Consensus 132 YvDGri~~hkvid~~~~------------~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~ 199 (533) .+-|.-|+.++-|.+.- ..-++++..++|.+++.++...+ + ..+|.|.+. T Consensus 155 ll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g-----~-------------~~~Y~y~~~ 216 (648) T protein:vir:79 155 VKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFG-----M-------------IKGWQQEQE 216 (648) T ss_pred HhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCC-----c-------------eeeeEEEec Confidence 58899999988764321 11256677777777754332111 0 112444332 Q ss_pred cc----ccccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHH-HHHHhcCccceEEEccCCCCc Q lcl|NC_021072. 200 GL----KNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLV-IYRLSRAPERRIFYIDVGNLP 274 (533) Q Consensus 200 ~~----~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalV-IyRi~RAPeRrvfyIDvGnlp 274 (533) +. .+.+...++|... .+.++-.++|-|..|+..+....-+++..- .++=.--| .-++.+..+... T Consensus 217 g~~~~~~~~~~dIIHik~~---------~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P-~gil~~~~~~~~ 286 (648) T protein:vir:79 217 GQDKPQKFKPEDIVHIYYK---------REKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHP-LWHVKVGLEQEG 286 (648) T ss_pred CCceeEEecCccEEEEccC---------CCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCc-cEEEEeCCCccc Confidence 21 1122233333311 123344567899999888877666665443 22222223 455555555555 Q ss_pred hHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCC-C--CCcchHHHHHHHHHHHH Q lcl|NC_021072. 275 KNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPG-G--QNLGELEDVKYFQKKLY 351 (533) Q Consensus 275 k~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpG-g--~nLgei~DV~YF~~kLy 351 (533) ...+++.++.+-..|++..+ .. |+...+...++- + +.+.-++=.++..+... T Consensus 287 ~e~~k~~~e~~~~~~~~~~i---~g----------------------g~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa 341 (648) T protein:vir:79 287 FGAEEGEVDLVRGEVENMDV---EG----------------------GMVTTERVNISSIASNQIIDAKEYLKHFEQRAF 341 (648) T ss_pred hHHHHHHHHHHHHhcccccc---cc----------------------cccccceeeccccCCHHHHHHHHHHHHHHHHHH Confidence 55566666666666655332 11 111222222221 1 12222333477889999 Q ss_pred HhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccc Q lcl|NC_021072. 352 KALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADN 431 (533) Q Consensus 352 ~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn 431 (533) ++.+||...|+..++-+...++..... |...|..|+..+..++...+-+.+++..-++ .|-.....++|+|..-. T Consensus 342 ~aFgVPP~lLG~~~~ss~stae~~~~~---~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~--~~l~~d~~ieF~~~~Ll 416 (648) T protein:vir:79 342 TVLGVSELMMGRGGTASRSTGDNLSSD---FKDRIKALQKVMATFINEFMVKEILMEGGFD--PVLNPDDKVEFRFNEID 416 (648) T ss_pred HHhCCCHhHcccCCCccchHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc--ccccccceEEEeecccc Confidence 999999999975544444334433333 7778888888777777766555555544322 23322345666665221 Q ss_pred hHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHH----HHHHH---HH-HhhhcCCCCCCCcccccC Q lcl|NC_021072. 432 YFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIK----EIDKQ---ID-SEREAGLIVDPMAEMDPA 503 (533) Q Consensus 432 ~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~----e~~kq---i~-~E~~~~~~~~p~~~~~~~ 503 (533) .-.+. .|.+.+.++ +-+-++|.+-++. .+++..-+=. ....+ .. .....+..+.|..+... T Consensus 417 r~D~~-------~~a~~~~~l--~~~GilT~NEaR~-~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~- 485 (648) T protein:vir:79 417 MDSKI-------KLENQAVFL--YEHNAISEDEMRE-LIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSA- 485 (648) T ss_pred hhhHH-------HHHHHHHHH--HhCCCcCHHHHHH-HhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCC- Confidence 11221 223333322 2345788888885 4787653200 00000 00 00001111111110000 Q ss_pred CCCCCCCCCCCcccccccc------C-CccccchhcC Q lcl|NC_021072. 504 MDPGNAPPADDMSAQEGPA------V-DAGDAKRGEF 533 (533) Q Consensus 504 ~~~~~~~~~~d~~~~~~~~------~-~~~~~~~~~~ 533 (533) ..+++++..+++..+.|. . ..+++...-. T Consensus 486 -~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~~~~~~~ 521 (648) T protein:vir:79 486 -SASGDKKKKATDNKTKPTNQHGTKTSPKKQTNGRHV 521 (648) T ss_pred -CccccccccccCCCCCCCCCCCcCCCCccccchhhh Confidence 000111111111000000 0 0011111111 No 52 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.41 E-value=7.7e-07 Score=54.10 Aligned_cols=429 Identities=13% Similarity=0.148 Sum_probs=203.9 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeeccccccc-----ccchhhhhhHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGY-----SVDFDGTVRNEYELITRYREMVLQPECDSAVDD 75 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~-----~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Avde 75 (533) +-.+|||.++++.-...+. + .++..-..+.-....|-|- +.+..+... .+.+.++ +-....+++ T Consensus 14 ~~~~~~~~~~~~~~~~~~i-~--~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-----~~~~~sl---nl~~~i~~~ 82 (500) T protein:vir:30 14 SKYVMTTQSLTNITDHPKI-A--ISKLEYDRITTNLKYYKSDWDSVLYLNTDGETK-----KRDLNHL---PIARTAAKK 82 (500) T ss_pred HHHHhhcchhhhhhccccc-c--CCHHHHHHHHHHHHHhcCCCCCcccccCCCCcc-----cCceeec---chHHHHHHH Confidence 5557788887764332222 1 1211111111111111111 011111111 1112222 111222222 Q ss_pred hhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEE Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTEL 155 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~el 155 (533) .++=+. + .|..+.+++ +..++.++.+++--+|.....+.+....+-|..+|+..+|... ..+ T Consensus 83 ~A~lv~----~-e~~~i~~~d--------~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~-----~~I 144 (500) T protein:vir:30 83 IASLVF----N-EQAEIKVDD--------DAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK-----VRV 144 (500) T ss_pred Hhhhhc----C-CcceEecCC--------hHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc-----eEE Confidence 222111 1 233344433 2345566777777789999999999999999999999998422 346 Q ss_pred EEcChhhceehhhccCCCcCceeEEe--c------cceeec------------cchhceeccccccccccCCcceeccch Q lcl|NC_021072. 156 RYIDPRKIRKVTEYQQKRPEQLRGED--I------NTQLTQ------------KAAEYYLYNPKGLKNSTNQGMKIATDS 215 (533) Q Consensus 156 r~lDP~~i~~vr~~~~~~~~~~~~~~--~------~~~~~~------------~~~e~~~y~p~~~~~~~~~~~kI~~da 215 (533) .+++|.++-+++--.+.. ...++. . ...++. +.-.+.+|.-... ...+ ..++... T Consensus 145 ~~v~ad~~~P~~~d~~~~--~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~-~~lG--~~v~l~~ 219 (500) T protein:vir:30 145 AFVQAPVFLPLQSNTQDV--SSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDK-AKVG--SRVPLSE 219 (500) T ss_pred EEEcCCeeEEEEEcCCCe--EEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccc-cccC--ccccccc Confidence 778888886643321111 111110 0 000000 0011122221100 0011 1111111 Q ss_pred hhccc-------ccc---------------ccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCC Q lcl|NC_021072. 216 VTYCH-------SGI---------------QDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNL 273 (533) Q Consensus 216 i~y~h-------sGl---------------~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnl 273 (533) + |.. .|+ .+..++.++|-++.|.-.+..|-..-+.+. |..|.=++|||. +..=+ T Consensus 220 ~-~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~~~i~v-~~~~l 295 (500) T protein:vir:30 220 V-YKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQRRVAV-PESLT 295 (500) T ss_pred c-cCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCcceeee-chHHh Confidence 1 110 111 233456688999999999888887777765 788887778765 11110 Q ss_pred chHHHHHHHHHHHHhcccEEEeeCCCCccc-------cccccchhHhhhcccccCCCCccceeecCCCCCcch-HHHHHH Q lcl|NC_021072. 274 PKNKAEQYLREVMGRYRNKLVYDANTGEIK-------DDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKY 345 (533) Q Consensus 274 pk~KAeqYl~~im~~~rnk~vYd~~TGev~-------~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~Y 345 (533) + ...+..+|+.. +++.|..|-.+ -++ +.-|+.+...--.++ ..-+.+ T Consensus 296 ~------------------~~~~~~~g~~~~~~~~d~~~~~~~~~~~~-----~~~--~~~i~~~~~~ir~e~~~~~l~~ 350 (500) T protein:vir:30 296 A------------------LTVRTTDGDVVPRPRFESDQNVYIRMGGR-----DLD--SSAIQDLTTPIRADDYIKAINE 350 (500) T ss_pred c------------------ccCCCCCccccCCcccCCCcceEEEcCCC-----CCc--CcceeEeccccChHHHHHHHHH Confidence 0 01111122110 12222221100 011 122555543221222 234566 Q ss_pred HHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCHhHHhhhhhc Q lcl|NC_021072. 346 FQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGV---MSLEEWDEMKEH 422 (533) Q Consensus 346 F~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi---~t~eew~~~~~~ 422 (533) +.+.+=.+.+++-+.|+.+++ ....++||.-.+-.-..-+.+.|+.|...+.++++.=|-+... ++. +-..... T Consensus 351 ~l~~i~~~~gls~~~~~~~~~-g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~--~~~~~~~ 427 (500) T protein:vir:30 351 GLSLFEMQIGVSAGLFSFDGK-SMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQS--EVPSMDN 427 (500) T ss_pred HHHHHHHHhCCCccccccCcC-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--CCCCCcc Confidence 777777788888888876654 2345677765555555557777777777777777765433211 100 0011224 Q ss_pred eeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_021072. 423 IQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP 502 (533) Q Consensus 423 i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~ 502 (533) +.++|. |+.+.. +++++ +.+.++.-- | .+|..+.+.+..++||+|.+++.++|++|.... ...|+++.++ T Consensus 428 v~v~f~-d~i~~d-~~~~~-----~~~~~~v~a-G-i~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~-~~~~~~~~~~ 497 (500) T protein:vir:30 428 ISISLD-DGVFTD-RDAEL-----DYWIKVVNA-G-FGTREMAIQKVLNVTEEKAQEIAAEINTGIVDE-INQQRTDTHL 497 (500) T ss_pred eEEEeC-CCCCCC-HHHHH-----HHHHHHHHc-C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc-CCCCCccccc Confidence 778885 443333 22221 111122111 3 488888777778999999999999999885322 2223333222 Q ss_pred CCC Q lcl|NC_021072. 503 AMD 505 (533) Q Consensus 503 ~~~ 505 (533) .-+ T Consensus 498 ~g~ 500 (500) T protein:vir:30 498 YGE 500 (500) T ss_pred cCC Confidence 222 No 53 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.41 E-value=7.7e-07 Score=54.10 Aligned_cols=429 Identities=13% Similarity=0.148 Sum_probs=203.9 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeeccccccc-----ccchhhhhhHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGY-----SVDFDGTVRNEYELITRYREMVLQPECDSAVDD 75 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~-----~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Avde 75 (533) +-.+|||.++++.-...+. + .++..-..+.-....|-|- +.+..+... .+.+.++ +-....+++ T Consensus 14 ~~~~~~~~~~~~~~~~~~i-~--~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-----~~~~~sl---nl~~~i~~~ 82 (500) T protein:vir:98 14 SKYVMTTQSLTNITDHPKI-A--ISKLEYDRITTNLKYYKSDWDSVLYLNTDGETK-----KRDLNHL---PIARTAAKK 82 (500) T ss_pred HHHHhhcchhhhhhccccc-c--CCHHHHHHHHHHHHHhcCCCCCcccccCCCCcc-----cCceeec---chHHHHHHH Confidence 5557788887764332222 1 1211111111111111111 011111111 1112222 111222222 Q ss_pred hhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEE Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTEL 155 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~el 155 (533) .++=+. + .|..+.+++ +..++.++.+++--+|.....+.+....+-|..+|+..+|... ..+ T Consensus 83 ~A~lv~----~-e~~~i~~~d--------~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~-----~~I 144 (500) T protein:vir:98 83 IASLVF----N-EQAEIKVDD--------DAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDK-----VRV 144 (500) T ss_pred Hhhhhc----C-CcceEecCC--------hHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc-----eEE Confidence 222111 1 233344433 2345566777777789999999999999999999999998422 346 Q ss_pred EEcChhhceehhhccCCCcCceeEEe--c------cceeec------------cchhceeccccccccccCCcceeccch Q lcl|NC_021072. 156 RYIDPRKIRKVTEYQQKRPEQLRGED--I------NTQLTQ------------KAAEYYLYNPKGLKNSTNQGMKIATDS 215 (533) Q Consensus 156 r~lDP~~i~~vr~~~~~~~~~~~~~~--~------~~~~~~------------~~~e~~~y~p~~~~~~~~~~~kI~~da 215 (533) .+++|.++-+++--.+.. ...++. . ...++. +.-.+.+|.-... ...+ ..++... T Consensus 145 ~~v~ad~~~P~~~d~~~~--~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~-~~lG--~~v~l~~ 219 (500) T protein:vir:98 145 AFVQAPVFLPLQSNTQDV--SSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDK-AKVG--SRVPLSE 219 (500) T ss_pred EEEcCCeeEEEEEcCCCe--EEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccc-cccC--ccccccc Confidence 778888886643321111 111110 0 000000 0011122221100 0011 1111111 Q ss_pred hhccc-------ccc---------------ccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCC Q lcl|NC_021072. 216 VTYCH-------SGI---------------QDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNL 273 (533) Q Consensus 216 i~y~h-------sGl---------------~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnl 273 (533) + |.. .|+ .+..++.++|-++.|.-.+..|-..-+.+. |..|.=++|||. +..=+ T Consensus 220 ~-~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~~~i~v-~~~~l 295 (500) T protein:vir:98 220 V-YKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQRRVAV-PESLT 295 (500) T ss_pred c-cCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCcceeee-chHHh Confidence 1 110 111 233456688999999999888887777765 788887778765 11110 Q ss_pred chHHHHHHHHHHHHhcccEEEeeCCCCccc-------cccccchhHhhhcccccCCCCccceeecCCCCCcch-HHHHHH Q lcl|NC_021072. 274 PKNKAEQYLREVMGRYRNKLVYDANTGEIK-------DDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKY 345 (533) Q Consensus 274 pk~KAeqYl~~im~~~rnk~vYd~~TGev~-------~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~Y 345 (533) + ...+..+|+.. +++.|..|-.+ -++ +.-|+.+...--.++ ..-+.+ T Consensus 296 ~------------------~~~~~~~g~~~~~~~~d~~~~~~~~~~~~-----~~~--~~~i~~~~~~ir~e~~~~~l~~ 350 (500) T protein:vir:98 296 A------------------LTVRTTDGDVVPRPRFESDQNVYIRMGGR-----DLD--SSAIQDLTTPIRADDYIKAINE 350 (500) T ss_pred c------------------ccCCCCCccccCCcccCCCcceEEEcCCC-----CCc--CcceeEeccccChHHHHHHHHH Confidence 0 01111122110 12222221100 011 122555543221222 234566 Q ss_pred HHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCHhHHhhhhhc Q lcl|NC_021072. 346 FQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGV---MSLEEWDEMKEH 422 (533) Q Consensus 346 F~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi---~t~eew~~~~~~ 422 (533) +.+.+=.+.+++-+.|+.+++ ....++||.-.+-.-..-+.+.|+.|...+.++++.=|-+... ++. +-..... T Consensus 351 ~l~~i~~~~gls~~~~~~~~~-g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~--~~~~~~~ 427 (500) T protein:vir:98 351 GLSLFEMQIGVSAGLFSFDGK-SMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQS--EVPSMDN 427 (500) T ss_pred HHHHHHHHhCCCccccccCcC-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--CCCCCcc Confidence 777777788888888876654 2345677765555555557777777777777777765433211 100 0011224 Q ss_pred eeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_021072. 423 IQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP 502 (533) Q Consensus 423 i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~ 502 (533) +.++|. |+.+.. +++++ +.+.++.-- | .+|..+.+.+..++||+|.+++.++|++|.... ...|+++.++ T Consensus 428 v~v~f~-d~i~~d-~~~~~-----~~~~~~v~a-G-i~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~-~~~~~~~~~~ 497 (500) T protein:vir:98 428 ISISLD-DGVFTD-RDAEL-----DYWIKVVNA-G-FGTREMAIQKVLNVTEEKAQEIAAEINTGIVDE-INQQRTDTHL 497 (500) T ss_pred eEEEeC-CCCCCC-HHHHH-----HHHHHHHHc-C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc-CCCCCccccc Confidence 778885 443333 22221 111122111 3 488888777778999999999999999885322 2223333222 Q ss_pred CCC Q lcl|NC_021072. 503 AMD 505 (533) Q Consensus 503 ~~~ 505 (533) .-+ T Consensus 498 ~g~ 500 (500) T protein:vir:98 498 YGE 500 (500) T ss_pred cCC Confidence 222 No 54 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.36 E-value=1.1e-06 Score=53.33 Aligned_cols=424 Identities=17% Similarity=0.174 Sum_probs=201.5 Q ss_pred CCccccceeeecc-ccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhh--------------- Q lcl|NC_021072. 1 MSNQLFGFSLERA-KKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMV--------------- 64 (533) Q Consensus 1 ~~~~~fg~~i~~~-~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~--------------- 64 (533) |-. +-.|++. +---++ .|+ .. ..+||..|..++ T Consensus 1 ~~~---~~~~~~~i~~w~~~---~~~-~~------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 49 (518) T protein:vir:78 1 MGV---WSVMTRFIKGWLNG---KPN-GS------------------------EPELIPKYLPLVPDNQKEWSKDSYLTS 49 (518) T ss_pred Ccc---hhhHHHHHHHhhcC---CCC-cc------------------------chhccHHHhhhcccchhhhhhhhhhhh Confidence 110 0001110 000000 000 00 122333322221 Q ss_pred ---------------hcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHH Q lcl|NC_021072. 65 ---------------LQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFR 129 (533) Q Consensus 65 ---------------~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR 129 (533) ..|--..++++.++=++ + .+++|.++..+.++ .+..++.++.+++-.+|.....+.+. T Consensus 50 ~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll~----~-e~~~i~v~~~~~~d--~e~~~~~l~~il~~n~f~~~~~~~~e 122 (518) T protein:vir:78 50 LWAQGYVPTVHDKLMNSGTGNEIVVVAAEYIS----G-KPLSIDVTGVNGSK--DENLTKQLKEALRIDNFDSKSVKIVE 122 (518) T ss_pred hcccCCCCccccccccCChHHHHHHHHHHhhc----C-CCceEEecCccccC--cHHHHHHHHHHHHhccHHHHHHHHHH Confidence 01111222333332221 1 23344443332211 12345566778888899999999999 Q ss_pred hhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEE--e----ccc-eee--------------- Q lcl|NC_021072. 130 RWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGE--D----INT-QLT--------------- 187 (533) Q Consensus 130 ~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~--~----~~~-~~~--------------- 187 (533) .+..-|.++|+..+|.. + ..+..++|.++=++... + +...+. + .+. .++ T Consensus 123 ~a~a~G~~~~k~~~d~~---~--~~i~~v~ad~~~P~~~~-g---~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~ 193 (518) T protein:vir:78 123 LAGGSGVSAVKINILNG---R--PSISVHSSSQFWIDFKN-N---EPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEG 193 (518) T ss_pred HhhccCceEEEEEEECC---e--eEEEEEcCCeeEEEeec-C---cEEEEEEEEEeecCCcceeEEEEEeecccccccee Confidence 99999999999999742 2 36777888888664321 1 111110 0 000 000 Q ss_pred ----ccchhceeccccccccccCCccee-----------------------ccchhhcccc---ccccCCCCccchhHHH Q lcl|NC_021072. 188 ----QKAAEYYLYNPKGLKNSTNQGMKI-----------------------ATDSVTYCHS---GIQDLNKNMTLSHLHK 237 (533) Q Consensus 188 ----~~~~e~~~y~p~~~~~~~~~~~kI-----------------------~~dai~y~hs---Gl~d~~~~~i~syL~~ 237 (533) .+.-.+.+|.-......+.+.+.+ +.-.+.|..- -..+..++.++|-|+. T Consensus 194 ~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~ 273 (518) T protein:vir:78 194 KKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQ 273 (518) T ss_pred ecccceeEEEEEeeecCcccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhh Confidence 000111122111000001110000 0111111111 1233345668899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcc Q lcl|NC_021072. 238 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWL 317 (533) Q Consensus 238 AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywL 317 (533) |.-++..|-..=+ -+.|..|.=++|||.-+ .=|+ ...+..++... .. ..--.++|. T Consensus 274 ~~~~id~lD~~~s--~~~~e~~~g~~~i~v~~-~~l~------------------~~~~~~~~~~~--~~-fd~~~~~y~ 329 (518) T protein:vir:78 274 CTNYLFAVDYFFT--VYMREGEKTKTKIAASE-RMFR------------------KKVNKSTDKEE--WS-MNVDEDYFM 329 (518) T ss_pred hhHHHHHHHHHHH--HHHHHHHhCCceeeech-hHhc------------------cCCCCCCCccc--cc-cCCCCceEE Confidence 9988888877666 45688888777777621 1110 00011111000 00 000112232 Q ss_pred ccc----CCCC-ccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_021072. 318 PRR----EGGR-GTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 391 (533) Q Consensus 318 pRR----eggr-gTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~ 391 (533) +.. +|+. .+.|+++...=.-.+ ..-+..+.+.+..+.+++-+-|+.+++- --++||.+..-+-...+.+.|+ T Consensus 330 ~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~--~TATei~s~~~~~~~t~~~~~~ 407 (518) T protein:vir:78 330 QFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNRE--VKATEIWSLQDATVRKIEKKKR 407 (518) T ss_pred EecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCccccc--ccHHHHHHHHHHHHHHHHHHHH Confidence 221 1222 234666554322222 3336777888888899988888654332 1356777665556667888888 Q ss_pred HHHHHHHHHHHHHHHh-ccCCCHhHHhhhh--hceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHH Q lcl|NC_021072. 392 RFSELFMDLLKTQLIL-KGVMSLEEWDEMK--EHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQ 468 (533) Q Consensus 392 ~fs~if~d~Lk~qLil-kgi~t~eew~~~~--~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~ 468 (533) .+...+.++++.-|-| +.......|.... -.+.++|.. +.+.. ++ +.++.++++... | .+|++..+++ T Consensus 408 ~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D-~i~~D-~~-----~~~~~~~~~v~a-G-imS~e~~i~~ 478 (518) T protein:vir:78 408 LIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPD-PMSVN-LN-----ELSSTLNNMNSA-L-AMSVEEKVKL 478 (518) T ss_pred HHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCC-CCCCC-HH-----HHHHHHHHHHhc-C-CCCHHHHHHH Confidence 8888888877764432 3222111122122 236666663 22222 11 122222222222 3 6899987776 Q ss_pred H-hCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCC-CC Q lcl|NC_021072. 469 V-LKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDP-GN 508 (533) Q Consensus 469 I-L~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~-~~ 508 (533) . -..||+|.+++-++|++|......++|.+- .+|++ || T Consensus 479 ~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~--~g~~~~~g 518 (518) T protein:vir:78 479 IHPKWEDEEIQAEVKRIYLENAIGEVPDPEAI--GGMETKGG 518 (518) T ss_pred hCCCCCHHHHHHHHHHHHHHhcccCCCCCccc--cCCCCCCC Confidence 4 378999999999999999877655555433 23332 22 No 55 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.32 E-value=1.4e-06 Score=52.67 Aligned_cols=448 Identities=13% Similarity=0.126 Sum_probs=211.3 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccch-----hhhhhHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDF-----DGTVRNEYELITRYREMVLQPECDSAVDD 75 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~-----~~~~~~~~~LI~~YR~m~~~pEvd~Avde 75 (533) |..++|+-+++.+-. ...+..|+.....+..-..-|-|..... ++.-+++ ...++ +--..+..+ T Consensus 14 ~~~~~~~~~~~~~~~---~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~-----~~~sl---~~~~~i~~~ 82 (517) T protein:vir:98 14 GGYALSGQTLKSIND---HEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQER-----DYMTL---NLRKLSADV 82 (517) T ss_pred HHHHhcccchhHhhc---CCceecCHHHHHHHHHHHHHhcCCCccccccccccccccc-----ceeec---CcHHHHHHH Confidence 666677655554222 2233334333333332222222222111 1111110 01111 011112222 Q ss_pred hhcceeeecCCCceEEEEeccCCCcH--HHHH-HHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVELSNLKQSD--KIKK-LIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l~~~~~S~--~ik~-~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI 152 (533) .++=+. ..+.+|.+++....+ +... .-++-++.++.--+|.....+.+.....-|-.+|-..+|. |- T Consensus 83 ~A~Ll~-----~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-----~~ 152 (517) T protein:vir:98 83 LSGLVF-----NEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN-----GE 152 (517) T ss_pred hhhhhc-----CCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC-----Ce Confidence 221110 122334443333221 1112 2344566777777899999999999999999999888884 23 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEe------ccceeeccc----------------hhceeccccccccccCCcce Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGED------INTQLTQKA----------------AEYYLYNPKGLKNSTNQGMK 210 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~------~~~~~~~~~----------------~e~~~y~p~~~~~~~~~~~k 210 (533) ..+..++|.++=|++-..+......-+.. ....++..- -.+.+|... ..... +.. T Consensus 153 ~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~-~~~~l--G~~ 229 (517) T protein:vir:98 153 IEFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSD-NEGEI--GKR 229 (517) T ss_pred eEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecC-CCccc--ccc Confidence 45788888888664321111110000000 000010000 000111100 00011 111 Q ss_pred eccchh-------hcccccc---------------ccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEc Q lcl|NC_021072. 211 IATDSV-------TYCHSGI---------------QDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYI 268 (533) Q Consensus 211 I~~dai-------~y~hsGl---------------~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyI 268 (533) |+-..+ +|. .|+ .+..++.++|-++.|+-.+--|-..=+. +.|..|.=.+|||. T Consensus 230 v~L~~~~e~l~~~~~~-~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~--~~~e~~~g~~~i~v- 305 (517) T protein:vir:98 230 IPLEELYEGMQEKTYI-QGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQ--FWWEIKMGQRTVFV- 305 (517) T ss_pred ccccccccCCCcceeE-CCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHH--HHHHHHhCCcceec- Confidence 111000 000 111 2334677899999999888877755554 45888887777775 Q ss_pred cCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCc-chHHHHHHHH Q lcl|NC_021072. 269 DVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNL-GELEDVKYFQ 347 (533) Q Consensus 269 DvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nL-gei~DV~YF~ 347 (533) +..=|++.... +......++| .+++.|+.+.-+ .|+. -|+++.+.==. .-..-+.++. T Consensus 306 p~~~l~~~~~~-------~g~~~~~~~d------~~~~~y~~~~~~------~~~~--~i~~~~~~iR~e~~~~~~~~~L 364 (517) T protein:vir:98 306 SDVMLRTVPDE-------SGMPPPQVFD------PDVNVYKSIRMG------TDEE--FVKDVTHDIRTEQYKEAINQAL 364 (517) T ss_pred ChhhhccccCC-------CCcccCCCCC------cccceeeeccCC------CCCC--ceeeeccccchHHHHHHHHHHH Confidence 22211100000 0000001111 123334332211 1211 24444442111 2245567788 Q ss_pred HHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCHhHHhhhhhcee Q lcl|NC_021072. 348 KKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKG---VMSLEEWDEMKEHIQ 424 (533) Q Consensus 348 ~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkg---i~t~eew~~~~~~i~ 424 (533) +.+-...++|-+-|+.++. ....++||...+-.-..-+.+.|+.+...+.++++.-|.|.. +++..-+. ...+. T Consensus 365 ~~i~~~~Gls~~t~~~~~~-~~kTATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~--~~~v~ 441 (517) T protein:vir:98 365 RTLEMELKLSVGTFSFDGR-SMKTATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPS--AEHIG 441 (517) T ss_pred HHHHHHhCCCccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC--CcceE Confidence 8888899999999987754 235678887777776677889999888888888888764432 11111111 12366 Q ss_pred EEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCC Q lcl|NC_021072. 425 FDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAM 504 (533) Q Consensus 425 ~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~ 504 (533) ++|. |+.+.. ++++. +...++.-- | .+|....+.+..++|++|.+++-.+|++|.... -+.|..+...+- T Consensus 442 v~f~-D~i~~D-~~~~~-----~~~~~~v~a-G-~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~-~~~~~~~~~~~~ 511 (517) T protein:vir:98 442 VDFD-DGVFQD-RSALL-----RFYGQAKTF-G-FIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIEL-DPVTISQRAQKR 511 (517) T ss_pred EEcC-CCCCCC-HHHHH-----HHHHHHHhc-C-CCCHHHHHHHhCCCChHHHHHHHHHHHHhcccc-CCCCccccccCC Confidence 7775 333332 22222 112222111 3 388888777788999999999999999988643 122221111100 Q ss_pred CCCCCC Q lcl|NC_021072. 505 DPGNAP 510 (533) Q Consensus 505 ~~~~~~ 510 (533) -.|++. T Consensus 512 ~~gd~e 517 (517) T protein:vir:98 512 MFGDEE 517 (517) T ss_pred CCCCCC Confidence 011111 No 56 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.30 E-value=1.6e-06 Score=52.42 Aligned_cols=428 Identities=13% Similarity=0.131 Sum_probs=183.8 Q ss_pred ccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeec Q lcl|NC_021072. 5 LFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGN 84 (533) Q Consensus 5 ~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d 84 (533) +|.|--...+....+.+..+..-.+. ....+..|.|. ...+.. ++ -...+++|.|..||+-|.+.+-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~---~~~g~~-----v~-~~~al~~~~V~~~v~~Ia~~iA~-- 68 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSL-FQAVAEPFAGA---WQQGVK-----AD-PEAVLSFHAVFACISLISQDIAK-- 68 (454) T ss_pred CCCccccCcccccccccccchhhhhh-hhhhhhhhcch---hhcCcc-----cC-hHHhhccHHHHHHHHHHHHhhcc-- Confidence 66664443222222222211111110 01111122221 111110 00 12345678899999999887542 Q ss_pred CCCceEEEEe-ccCCCcHHHHHHHHHHHHHHHHHh----cchhhhhHHH----HhhhhcCceeeeeeecCCCCCCCeEEE Q lcl|NC_021072. 85 FDDVPVEVEL-SNLKQSDKIKKLIREEFAEILRLL----DFENRSYEIF----RRWYVDGRLFYHKVIDPKNPRGGLTEL 155 (533) Q Consensus 85 ~~~~~v~v~l-~~~~~S~~ik~~I~eeF~~i~~lL----~f~~~~~~~f----R~WYvDGri~~hkvid~~~~~~gI~el 155 (533) .|+.|-= +..+..+.+ .+..+.+| |-...+.++. ..+++.|.-|..++-+. .+-+++| T Consensus 69 ---lp~~~~~~~~~g~~~~~-------~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~---~G~~~~L 135 (454) T protein:vir:93 69 ---MRLRLMQTDAQGIRRET-------RRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNA---RGQIKEL 135 (454) T ss_pred ---CceEEEEeccCCccchh-------hhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECC---CCcEEEE Confidence 4555531 121111112 12222222 3333444444 44678899999988764 3449999 Q ss_pred EEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhH Q lcl|NC_021072. 156 RYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHL 235 (533) Q Consensus 156 r~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL 235 (533) .+++|..++.++.- ++..+ |.|.+.... .......++.+-|.+...+. ..++-.++|-+ T Consensus 136 ~~i~~~~v~v~~~~-----~g~~~--------------y~~~~~~~~-~~~~~~~~~~~eViH~k~~~-~~~~~~G~sp~ 194 (454) T protein:vir:93 136 RILDWNRVEPLVAD-----DGEVF--------------YRITPDRNC-GITEAVTVPAREVIHDRFNC-FFHPLIGLPPV 194 (454) T ss_pred EEEcCcceEEEEcC-----CCcEE--------------EEEEecccc-ccceeEEecCcceEEeccCC-CCCCceeccHH Confidence 99999999754321 11111 111111000 00112223333332222221 12333567999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) ..|.+.+.....+++...=+=---+--+-+..++ |.|.+..+++--+. .+....- ..+|.+- +++ T Consensus 195 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~-~~~~~~g----~n~g~~~-------vl~-- 259 (454) T protein:vir:93 195 YAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIP-GSITEENAKKLKSN-WDSGYTG----ENAGKTA-------ILS-- 259 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHHHHH-HHHHhcc----cccCCce-------ecc-- Confidence 9999999999999887653322223345566666 56665554433222 2221110 0122211 221 Q ss_pred cccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRF 393 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~rLr~~f 393 (533) .|.+++.|.=. ..+.-++-.+|....+.++++||...|+..++-+....++..+. |.++ |.-+..++ T Consensus 260 --------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~---f~~~~l~P~~~~i 328 (454) T protein:vir:93 260 --------NGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQ---YYSQCLQTLIESI 328 (454) T ss_pred --------CCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHH---HHHHHHHHHHHHH Confidence 13444444321 11222344457778999999999999975544344344444443 5443 44455555 Q ss_pred HHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCC Q lcl|NC_021072. 394 SELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQT 473 (533) Q Consensus 394 s~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~t 473 (533) ...+...| + +..+ ..+.|. +.++...+ +..|++.+..+-.- -++|.+-++. .+++. T Consensus 329 e~~ln~~L----~-----~~~~-------~~~~f~----~~~ll~~D-~~~r~~~~~~~~~~--G~~T~NE~R~-~~gl~ 384 (454) T protein:vir:93 329 ELLLDEAL----E-----TGEN-------ESTEFD----VTTLLRMD-SERRMKTLGDAVKN--TLLTPNEARK-RENLP 384 (454) T ss_pred HHHHHHhh----c-----CCCC-------cEEEee----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCC Confidence 44444333 2 2221 234554 23332222 24667766665433 4778888874 47775 Q ss_pred HHHHHHHHHH--------HHHhhhcCCCCCC-CcccccCCCCCCCCCCCCccccccccCCccccchhcC Q lcl|NC_021072. 474 DQEIKEIDKQ--------IDSEREAGLIVDP-MAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 474 DeeI~e~~kq--------i~~E~~~~~~~~p-~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 533 (533) +-+ --++- +..-.+...-++| .....++..+ ..++..|++. .++....|.-.+.| T Consensus 385 pi~--ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~~--~~~e~~~d~~~~~~ 448 (454) T protein:vir:93 385 PLA--GGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVP-QAVAASDGNK--AITETEHDAVKAMF 448 (454) T ss_pred CCC--CCCeeeeccCccchHhhhccCcccCCCCCCccCCCCC-CCCCCCCCCC--CccCCccchhhhhh Confidence 421 00000 0000000000000 0000111100 0011111111 11111122222222 No 57 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.28 E-value=1.7e-06 Score=52.23 Aligned_cols=403 Identities=11% Similarity=0.096 Sum_probs=189.4 Q ss_pred ccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHh Q lcl|NC_021072. 39 YYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLL 118 (533) Q Consensus 39 ~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL 118 (533) |-.. ..+ +-.+.=+.++-..-+.-+|+-.++-+.+ ++ +.+.+.+..+. ...|...= T Consensus 1 ~l~~------~~~---~~~~~~~~~~v~n~~~~ivd~~~~~l~~---~g----f~~~d~~~~~~--------~~~i~~~N 56 (434) T protein:vir:98 1 MLPK------NAE---QAFLDFQRKARTNFCGLIANASVHRLLA---LG----VTGPDGEPDTR--------ASRWWQAN 56 (434) T ss_pred CCCC------Ccc---HHHHHhhhhhhccchHHHHHHHHhhhcc---Cc----eecCCCchHHH--------HHHHHHhc Confidence 1111 111 0000111111122334455544443321 11 22223222222 33355555 Q ss_pred cchhhhhHHHHhhhhcCceeeeeeecCCCC---CCCeEEEEEcChhhceehhhccCCC-cCceeEEeccceeeccchhce Q lcl|NC_021072. 119 DFENRSYEIFRRWYVDGRLFYHKVIDPKNP---RGGLTELRYIDPRKIRKVTEYQQKR-PEQLRGEDINTQLTQKAAEYY 194 (533) Q Consensus 119 ~f~~~~~~~fR~WYvDGri~~hkvid~~~~---~~gI~elr~lDP~~i~~vr~~~~~~-~~~~~~~~~~~~~~~~~~e~~ 194 (533) +|+....+.++.-++.||-|+..-.++... .++-..++.+||+.+-.+..-.... .-..+++.....-. .....| T Consensus 57 ~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~-~~~~~~ 135 (434) T protein:vir:98 57 RLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGF-GYARVF 135 (434) T ss_pred ChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCc-eEEEEE Confidence 788999999999999999999866653221 1223347778888775433211110 00011111000000 000111 Q ss_pred eccccc---------ccc----------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHH-HHH Q lcl|NC_021072. 195 LYNPKG---------LKN----------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQL-RMI 248 (533) Q Consensus 195 ~y~p~~---------~~~----------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqL-rm~ 248 (533) +|+... ... .+|..-++|. +.|.+.--.+ ..+.|=++..+-...-+ +++ T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPv--v~f~N~~~~~---~~g~sd~e~vi~liDa~~~~~ 210 (434) T protein:vir:98 136 FDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQL--VEFARMPDLG---EDPEPEFAGVLDIQDRVNLGI 210 (434) T ss_pred EeCcEEEEEEeeccccccccccccceecccccccccCCCCccce--EEeccCCCcC---cCCcchhhhHHHHHHHHHHHH Confidence 111100 000 0000011111 1122211110 22345555444433332 345 Q ss_pred HHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccce Q lcl|NC_021072. 249 EDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEI 328 (533) Q Consensus 249 EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEI 328 (533) -+.+++-+.+-.|.|-+. |--+.. .-|...+.+..++.+..-....|+.- +-++++ T Consensus 211 s~~~~~~~~~a~p~~~i~----G~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~i~~~~---~~~~~~ 266 (434) T protein:vir:98 211 LNRMAASRFSGFRQKWIK----GHKFAK-----------------RTDPATGMTVVDQPFVPSPSAVWASE---GENTQF 266 (434) T ss_pred HHHHHHHHHhcchhhhhc----CCCccc-----------------ccccccccchhhhhhhccccccccCC---CCCceE Confidence 577777777777766553 211110 00222233322222322223346543 234667 Q ss_pred eecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021072. 329 STLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLIL 407 (533) Q Consensus 329 sTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLil 407 (533) ..+++.. ++.. +-++-.-..+....++|.+-|+.+.+ | -.+..|.-.+.....-+.+.|+.|..-+.++++.-+.+ T Consensus 267 ~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~-n-~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~ 343 (434) T protein:vir:98 267 GQLDATD-LSGFLKEHASDVRDMLTISQTPTYLYATDLV-N-ISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQ 343 (434) T ss_pred EEecCcc-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-C-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 7888654 3333 33566677888889999888864211 1 12344555566677778889999999999999888888 Q ss_pred ccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHh Q lcl|NC_021072. 408 KGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSE 487 (533) Q Consensus 408 kgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E 487 (533) .|+ +. +|. .+.+.|..-..=+ +.+..+++.++..- | +|.++++ +.|+++++||+++.++.+++ T Consensus 344 ~g~-~~-~~~----~~~v~w~~~~~~s-------~~~~ada~~kl~~~-g--~~~e~~~-~~lg~~~~e~~r~~~e~~~~ 406 (434) T protein:vir:98 344 AGV-PE-DYT----EAEVRWANPAHVT-------MAVKADAATKLKSI-G--YPLDVIA-EELDESPARVRRIVAGAASQ 406 (434) T ss_pred cCC-Ch-hhe----eeeEEecCCCCCC-------HHHHHHHHHHHHhc-C--CcHHHHH-HhCCCCHHHHHHHHHHHHHH Confidence 886 33 332 3677786533222 23456666666542 2 5888776 55899999999877766665 Q ss_pred hhcCCCCCCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 488 REAGLIVDPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 488 ~~~~~~~~p~~~~~~~~~~~~~~~~~d~ 515 (533) ........|.+.+.++-...+++...|| T Consensus 407 ~~~~~~~~~~~~~~~~g~~~~~~~~~dg 434 (434) T protein:vir:98 407 ALLAASLLPAPGAPSAGNVPDSGGAVDG 434 (434) T ss_pred HHHHHhhhccCCCCCCCCCCcccCCCCC Confidence 4433333332222111111122222233 No 58 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=98.24 E-value=2.1e-06 Score=51.67 Aligned_cols=403 Identities=15% Similarity=0.171 Sum_probs=174.0 Q ss_pred CCccccceeeeccccccccCCCCC-CCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQ-KDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNE 79 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~-~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvne 79 (533) |+.+ .+.....+|..- -...+|. ... ..|+++. -+-.+|-..|+ .++.+..+|+-++.+ T Consensus 5 m~~~--------~~~~~~~D~~~~~~~~~~g~-~~~-~~~~~~~-------~~~~~l~~~Y~---~~~l~~~~Vd~~aed 64 (435) T protein:vir:79 5 MSDK--------VKAITKEDGYNEIFGSKDGT-FRP-NAFYMQR-------AAFKALSQFYE---EDGMARRIVDVIPEE 64 (435) T ss_pred cccc--------cccchhhcchhhhhcccccc-ccc-CcccCCc-------CCHHHHHHHHh---cCchhhhhhccchHH Confidence 4433 111112222211 0111111 000 1111111 12335555554 579999999999999 Q ss_pred eeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeee-cC------CCCCCCe Q lcl|NC_021072. 80 TICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVI-DP------KNPRGGL 152 (533) Q Consensus 80 aiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvi-d~------~~~~~gI 152 (533) |+.+- +.|. ..+. ++++. ..++.|++..+..+.+|.=-+.|.-++-..+ |. -++++.| T Consensus 65 ~~r~g-----~~i~--g~~~----~~~~~----~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i 129 (435) T protein:vir:79 65 MVTPG-----FKVD--GVKN----EKSFK----SRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQL 129 (435) T ss_pred hhcCC-----ceec--CCCh----HHHHH----HHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCce Confidence 99632 3333 1111 23343 4444456777777766644444543332223 32 2567788 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchh-hcccccc----ccCC Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSV-TYCHSGI----QDLN 227 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai-~y~hsGl----~d~~ 227 (533) +.++.+||..+.+- .+. .+ ...+-|+.-.. |..++.+. ....+|+.+-+ +++..-+ .... T Consensus 130 ~~i~v~d~~~i~~~-~~~-~d--------p~sp~fg~P~~-y~v~~~~~----~~~~~iH~SRli~~~g~~~p~~~~~~~ 194 (435) T protein:vir:79 130 EDIRVYDRYQITIH-ERE-TN--------ARSVRYGEPKL-YKISPGGD----IPEFFVHYSRICIIDGERVSNEKRRQN 194 (435) T ss_pred eeEEeechhhccch-hhc-cC--------CcccccCcceE-EEEecCCC----CCceEEcceeEEEecCCcchhhhcccc Confidence 99999999888541 111 11 12222333222 22232221 12334444432 2221111 1112 Q ss_pred CCccchhH-HHHHHHHHHHHHHHH--HHHHHHHhcCccceEEEcc-CCCC-----chHHHHHHHHHHHHhccc---EEEe Q lcl|NC_021072. 228 KNMTLSHL-HKAIKAVNQLRMIED--SLVIYRLSRAPERRIFYID-VGNL-----PKNKAEQYLREVMGRYRN---KLVY 295 (533) Q Consensus 228 ~~~i~syL-~~AiK~~NqLrm~ED--alVIyRi~RAPeRrvfyID-vGnl-----pk~KAeqYl~~im~~~rn---k~vY 295 (533) +....|-| +++...+.+...... +.++++-. . +|+.++ +.++ ....+..-+. .++++|+ -++- T Consensus 195 ~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~---~-~v~~~~~l~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~~i 269 (435) T protein:vir:79 195 DGWGASILNKRLIEAIVDYNYCQELATQLLRRKQ---Q-AVWKARDLALMCDDEEGRYAARLRLA-QVDDESGVGKAIGI 269 (435) T ss_pred CcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc---C-ccccchhHHHhhcCccchHHHHHHHH-HHHHhcCCCCceeE Confidence 23334544 444443333322222 33444432 2 234442 2221 1111111111 1233322 1221 Q ss_pred eCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCC--Ccccccch Q lcl|NC_021072. 296 DANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE--TTFNIGRA 372 (533) Q Consensus 296 d~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~--~~~~~g~~ 372 (533) |+.+ -+++++. .+|+-++| +.+|...+-.+.+||+.||-.+ +|+| +-+ T Consensus 270 ~~~~--------------------------e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~gln-stg 320 (435) T protein:vir:79 270 DATD--------------------------EEYEVLN--SDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVS-ASQ 320 (435) T ss_pred ecCC--------------------------cceEEEe--cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccc-cch Confidence 1111 1233332 23555555 4789999999999999998443 5665 222 Q ss_pred hhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021072. 373 AEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTM 452 (533) Q Consensus 373 ~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~ 452 (533) .+-.+. |+.+|+++|.. .+..+|++ |+.-.+.+ +.+.|.|..=..=+|...+|+...+.++++.+ T Consensus 321 d~d~~~---yyd~i~~~Qe~---~l~p~l~~-l~~li~~s--------~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~ 385 (435) T protein:vir:79 321 NTALET---FYKLIDRKRVE---DYKPILEF-LLPFMISE--------TEWSIEFEPLSVPSDKDKAEIMAKNVESVVKL 385 (435) T ss_pred hHHHHH---HHHHHHHHHHH---HHHHHHHH-HHHHhhcC--------CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHH Confidence 233344 99999999853 33333332 11111222 35678887766777888899999888888776 Q ss_pred hhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccccc Q lcl|NC_021072. 453 DPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAK 529 (533) Q Consensus 453 ~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 529 (533) -.- -.+|.+-+++. ...+-.+.. +..+...+ .++ +.+++...+ -..++.+ T Consensus 386 ~~~--g~i~~~e~r~~------------L~~~~~~~~--~~~~~~~~----~~~---~~d~~~~~~----~e~g~~~ 435 (435) T protein:vir:79 386 KAE--QAINLKETRDT------------LRSICPDLK--IMDNDNIE----LPE---PEDLDPEPG----QEGGLNK 435 (435) T ss_pred Hhc--CCCCHHHHHHH------------HHHhccccC--CCCccccc----CCc---cccCCCCCC----CCCCCCC Confidence 332 23444444433 221111111 11100000 000 000000000 0001111 No 59 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=98.24 E-value=2.1e-06 Score=51.66 Aligned_cols=397 Identities=14% Similarity=0.158 Sum_probs=174.4 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |+-+ ....+.-|-...+....+.+.. ..+.+.......++. .+...++|.|.+||+-|++.+ T Consensus 1 ~~~~---------~~~~~~k~~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~--------~~~a~~~~~v~~~i~~Ia~~i 62 (409) T protein:vir:94 1 MAKE---------NIVTRIKKKLIDNWIDQSASKL-YDFSPWKNKSFWGVI--------NNTLETNETIFSAITKLSNSM 62 (409) T ss_pred Cccc---------ccchhhhhHHhhhhhcCCcccc-cccccccCccccccc--------hhhhhccHHHHHHHHHHHHhh Confidence 4321 1112211211111111111110 011111111111111 112346789999999999886 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhH----HHHhhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYE----IFRRWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~----~fR~WYvDGri~~hkvid~~~~~~gI 152 (533) -- .|+.+- ...+..+ ..++.+|+- ...+.+ ++..+++.|.-|..++-|. .+-+ T Consensus 63 a~-----lp~~~~-~~~~~~~----------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~---~G~~ 123 (409) T protein:vir:94 63 AS-----LPLKMY-EDYKVVN----------TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---YHQP 123 (409) T ss_pred hh-----CceeEe-ecccccc----------hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcE Confidence 53 344332 1211111 123334432 234444 4455788899998876654 4448 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccc--cccCCcceeccchhhccccccccCCCCc Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLK--NSTNQGMKIATDSVTYCHSGIQDLNKNM 230 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~--~~~~~~~kI~~dai~y~hsGl~d~~~~~ 230 (533) ++|.+|+|..+..+..- +.... . |.++.+.+.. +.....++|+.. -..++-. T Consensus 124 ~~L~~l~~~~v~v~~~~---~~~~~--~------------y~~~~~~g~~~~~~~~dvih~r~~---------~~~~~~~ 177 (409) T protein:vir:94 124 SKLFLLNPDVVEMLIEN---QSREL--Y------------YSIHAATGNKLIVHNMDMLHFKHI---------VASNMVQ 177 (409) T ss_pred EEEEEEcCceeEEEEeC---CCcEE--E------------EEEEcCCceEEEEccccEEEecCC---------CCCCccc Confidence 99999999999654321 11111 0 1111111111 111222333211 0112223 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 231 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 231 i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) ++|-|..|.+.......++.. -+....+.| .++..-.+.+.+.+++...+.+.+.|. ++|.+ + T Consensus 178 G~s~l~~~~~~i~~~~~~~~~-~~~~~~~~~--~~i~~~~~~l~~e~~~~~~~~~~~~~~-------~~g~~------~- 240 (409) T protein:vir:94 178 GISPIDVLKNTTDFDNAVRTF-NLTEMQKPD--SFMLKYGSNVGKEKRQQVLEDFKQYYE-------ENGGI------L- 240 (409) T ss_pred cccHHHHHHHHHHHHHHHHHH-HHHhcCCCC--eeEEecCCCCCHHHHHHHHHHHHHHhh-------cCCCe------e- Confidence 456677777766665555554 345555544 233334556666666555555544442 23322 1 Q ss_pred hHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HH Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-AR 388 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi-~r 388 (533) .++ .|.+++.|.-. +.+.-++-..|-.+.+.++++||...|+..+.-+.....+..+. |.+++ .- T Consensus 241 vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~---f~~~~l~P 307 (409) T protein:vir:94 241 FQE----------PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF---YLQHTLLP 307 (409) T ss_pred ecC----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHH Confidence 121 25777777532 22233444456678899999999999986655555555555555 66553 33 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQ 468 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~ 468 (533) +-.++ .+.|-.. ++++.+|.. ...|.|. ..++...+ +..|++.+..+-.- -+++..-++. T Consensus 308 ~~~~i----e~~ln~~-----Ll~~~~~~~---~~~i~fd----~~~ll~~d-~~~~~~~~~~~~~~--G~~T~NE~R~- 367 (409) T protein:vir:94 308 IVKQY----EEEFNRK-----LLTKTDREK---NRYFKFN----VKSYLRAD-SATQAEVYFKAVRS--GYYTINDIRE- 367 (409) T ss_pred HHHHH----HHHHHHh-----hCCcccccC---cceEEee----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH- Confidence 32322 2223333 334444432 2345565 33443333 35666666655332 4677777764 Q ss_pred HhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 469 VLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 469 IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~ 515 (533) .+++.+-+ --++-+- .....+-......+....||+...+++ T Consensus 368 ~~g~~p~~--ggD~~~~---~~n~~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 368 WEDLPPVE--GGDKPLI---SGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred HhCCCCCC--CcCeEee---cccccccccchhhcccccCCCCCcCCC Confidence 46654321 0000000 000000000000000111111111111 No 60 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.23 E-value=2.3e-06 Score=51.51 Aligned_cols=416 Identities=11% Similarity=0.120 Sum_probs=185.9 Q ss_pred CC-----ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_021072. 1 MS-----NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDD 75 (533) Q Consensus 1 ~~-----~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Avde 75 (533) |+ ...|||.-.. .....+.+. ++... ..+.|... .++. ++ -+...++|.|.++|+- T Consensus 1 M~~~~r~~~~~~~~~r~---~~~~~~~~~----~~~~~---~~~~g~~~---~~~~-----v~-~~~al~~~~v~~~i~~ 61 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQ---TSQVIELNK----DDEKL---LEWLGISP---STIS-----VK-GKNALKVATVFACIKI 61 (432) T ss_pred CChHHHHHHhcCccccC---cccccccCC----chHHH---HHHhCCCc---Cccc-----cc-hhhhhccHHHHHHHHH Confidence 33 2356654221 111111111 11100 01111111 1111 11 1234668999999999 Q ss_pred hhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHHh----hhhcCceeeeeeecCCC Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFRR----WYVDGRLFYHKVIDPKN 147 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR~----WYvDGri~~hkvid~~~ 147 (533) |.+.+-. .|+.|.-..-+..+. .. -..++++|+. ...+.++++. +.+.|.-|+.++-|. T Consensus 62 ia~~ia~-----lp~~~~~~~~~~~~~---~~---~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-- 128 (432) T protein:vir:10 62 LSESVSK-----LPLKIYQEDEYGIQR---GT---KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-- 128 (432) T ss_pred HHHhhcc-----CceEEEEecCCceee---cc---ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-- Confidence 9887543 455543211111111 11 1234444432 2345554444 566799999988764 Q ss_pred CCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCC Q lcl|NC_021072. 148 PRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLN 227 (533) Q Consensus 148 ~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~ 227 (533) .+-+++|.+|+|.+++.++.-...... ...-+|.+...+.. ..++.+-|.+...+. ..+ T Consensus 129 -~G~~~~L~~i~~~~v~v~~d~~~~~~~-------------~~~~~y~~~~~g~~------~~~~~~eiih~r~~~-~~~ 187 (432) T protein:vir:10 129 -KGKVQALWPIDASKVTVYIDDVGLLNS-------------KTKMWYVVNTGGQQ------RVLKPEEILHFKNGI-TLD 187 (432) T ss_pred -CCcEEEEEEEcCceeEEEEcCcccccc-------------cceEEEEEecCCeE------EEEccccEEEecCCC-CCC Confidence 445999999999999754432111111 11122333333221 223333222222111 122 Q ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccc Q lcl|NC_021072. 228 KNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKK 307 (533) Q Consensus 228 ~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~ 307 (533) +-.++|.|..|++++.....++....=+----+.-+-|..++ +.|.+..+++..+.+...|..- ...|. T Consensus 188 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------ 256 (432) T protein:vir:10 188 GLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSGL----QNSHR------ 256 (432) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhccc----ccCCc------ Confidence 334679999999999999888887666654445556777776 4677666666555544444320 01121 Q ss_pred cchhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH- Q lcl|NC_021072. 308 FMSMLEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF- 385 (533) Q Consensus 308 ~msmlEDywLpRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf- 385 (533) .+ .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|+..+.-+.....+..+. |.++ T Consensus 257 ~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~---~~~~~ 322 (432) T protein:vir:10 257 IA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ---FYTDT 322 (432) T ss_pred ce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHH Confidence 11 221 2455555532 112222344567789999999999999965433333334443333 5432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHH Q lcl|NC_021072. 386 IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYM 465 (533) Q Consensus 386 i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i 465 (533) |.-+-.++. +.|-.. ++++.+|. ..+.+.|. ++++.... +..|++++..+-.- -++|.+-+ T Consensus 323 l~P~~~~ie----~~ln~k-----Ll~~~~~~---~g~~~~fd----~~~l~~~d-~~~~~~~~~~~~~~--G~~t~NE~ 383 (432) T protein:vir:10 323 LQATLTMYE----QEMTYK-----LFLDSELD---KGFYSKFN----VDAILRAD-IKTRYEAYRTGIQG--GFLKPNEA 383 (432) T ss_pred HHHHHHHHH----HHHHHh-----hcChhhcC---CCcEEEee----chhhhcCC-HHHHHHHHHHHHhC--CCcCHHHH Confidence 222322222 222222 33444444 23345555 33333222 23566666555433 46777777 Q ss_pred HHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCC-cccccCCCCCCCCCCCCccccccccCCccc Q lcl|NC_021072. 466 RRQVLKQTDQEIKEIDKQIDSEREAGLIVDPM-AEMDPAMDPGNAPPADDMSAQEGPAVDAGD 527 (533) Q Consensus 466 ~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 527 (533) +. ++++.+- +--++-+-. ....+--. .+.....+..+.....+++.. + T Consensus 384 R~-~~g~~pi--~ggD~~~~~---~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~--------~ 432 (432) T protein:vir:10 384 RS-KEDLPPE--AGGDRLLVN---GNMLPIDMAGQAYLKGGDTNGEVSKEGNEG--------N 432 (432) T ss_pred HH-HhCCCCC--CCCCeEeec---ccccchhhccccccCCCCCCCCCCCCCCCC--------C Confidence 64 4666431 100000000 00000000 000000000011111111111 1 No 61 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.23 E-value=2.3e-06 Score=51.51 Aligned_cols=416 Identities=11% Similarity=0.120 Sum_probs=185.9 Q ss_pred CC-----ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_021072. 1 MS-----NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDD 75 (533) Q Consensus 1 ~~-----~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Avde 75 (533) |+ ...|||.-.. .....+.+. ++... ..+.|... .++. ++ -+...++|.|.++|+- T Consensus 1 M~~~~r~~~~~~~~~r~---~~~~~~~~~----~~~~~---~~~~g~~~---~~~~-----v~-~~~al~~~~v~~~i~~ 61 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQ---TSQVIELNK----DDEKL---LEWLGISP---STIS-----VK-GKNALKVATVFACIKI 61 (432) T ss_pred CChHHHHHHhcCccccC---cccccccCC----chHHH---HHHhCCCc---Cccc-----cc-hhhhhccHHHHHHHHH Confidence 33 2356654221 111111111 11100 01111111 1111 11 1234668999999999 Q ss_pred hhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHHh----hhhcCceeeeeeecCCC Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFRR----WYVDGRLFYHKVIDPKN 147 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR~----WYvDGri~~hkvid~~~ 147 (533) |.+.+-. .|+.|.-..-+..+. .. -..++++|+. ...+.++++. +.+.|.-|+.++-|. T Consensus 62 ia~~ia~-----lp~~~~~~~~~~~~~---~~---~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-- 128 (432) T protein:vir:10 62 LSESVSK-----LPLKIYQEDEYGIQR---GT---KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-- 128 (432) T ss_pred HHHhhcc-----CceEEEEecCCceee---cc---ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-- Confidence 9887543 455543211111111 11 1234444432 2345554444 566799999988764 Q ss_pred CCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCC Q lcl|NC_021072. 148 PRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLN 227 (533) Q Consensus 148 ~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~ 227 (533) .+-+++|.+|+|.+++.++.-...... ...-+|.+...+.. ..++.+-|.+...+. ..+ T Consensus 129 -~G~~~~L~~i~~~~v~v~~d~~~~~~~-------------~~~~~y~~~~~g~~------~~~~~~eiih~r~~~-~~~ 187 (432) T protein:vir:10 129 -KGKVQALWPIDASKVTVYIDDVGLLNS-------------KTKMWYVVNTGGQQ------RVLKPEEILHFKNGI-TLD 187 (432) T ss_pred -CCcEEEEEEEcCceeEEEEcCcccccc-------------cceEEEEEecCCeE------EEEccccEEEecCCC-CCC Confidence 445999999999999754432111111 11122333333221 223333222222111 122 Q ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccc Q lcl|NC_021072. 228 KNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKK 307 (533) Q Consensus 228 ~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~ 307 (533) +-.++|.|..|++++.....++....=+----+.-+-|..++ +.|.+..+++..+.+...|..- ...|. T Consensus 188 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------ 256 (432) T protein:vir:10 188 GLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSGL----QNSHR------ 256 (432) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhccc----ccCCc------ Confidence 334679999999999999888887666654445556777776 4677666666555544444320 01121 Q ss_pred cchhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH- Q lcl|NC_021072. 308 FMSMLEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF- 385 (533) Q Consensus 308 ~msmlEDywLpRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf- 385 (533) .+ .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|+..+.-+.....+..+. |.++ T Consensus 257 ~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~---~~~~~ 322 (432) T protein:vir:10 257 IA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ---FYTDT 322 (432) T ss_pred ce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHH Confidence 11 221 2455555532 112222344567789999999999999965433333334443333 5432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHH Q lcl|NC_021072. 386 IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYM 465 (533) Q Consensus 386 i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i 465 (533) |.-+-.++. +.|-.. ++++.+|. ..+.+.|. ++++.... +..|++++..+-.- -++|.+-+ T Consensus 323 l~P~~~~ie----~~ln~k-----Ll~~~~~~---~g~~~~fd----~~~l~~~d-~~~~~~~~~~~~~~--G~~t~NE~ 383 (432) T protein:vir:10 323 LQATLTMYE----QEMTYK-----LFLDSELD---KGFYSKFN----VDAILRAD-IKTRYEAYRTGIQG--GFLKPNEA 383 (432) T ss_pred HHHHHHHHH----HHHHHh-----hcChhhcC---CCcEEEee----chhhhcCC-HHHHHHHHHHHHhC--CCcCHHHH Confidence 222322222 222222 33444444 23345555 33333222 23566666555433 46777777 Q ss_pred HHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCC-cccccCCCCCCCCCCCCccccccccCCccc Q lcl|NC_021072. 466 RRQVLKQTDQEIKEIDKQIDSEREAGLIVDPM-AEMDPAMDPGNAPPADDMSAQEGPAVDAGD 527 (533) Q Consensus 466 ~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 527 (533) +. ++++.+- +--++-+-. ....+--. .+.....+..+.....+++.. + T Consensus 384 R~-~~g~~pi--~ggD~~~~~---~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~--------~ 432 (432) T protein:vir:10 384 RS-KEDLPPE--AGGDRLLVN---GNMLPIDMAGQAYLKGGDTNGEVSKEGNEG--------N 432 (432) T ss_pred HH-HhCCCCC--CCCCeEeec---ccccchhhccccccCCCCCCCCCCCCCCCC--------C Confidence 64 4666431 100000000 00000000 000000000011111111111 1 No 62 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.23 E-value=2.3e-06 Score=51.51 Aligned_cols=416 Identities=11% Similarity=0.120 Sum_probs=185.9 Q ss_pred CC-----ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_021072. 1 MS-----NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDD 75 (533) Q Consensus 1 ~~-----~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Avde 75 (533) |+ ...|||.-.. .....+.+. ++... ..+.|... .++. ++ -+...++|.|.++|+- T Consensus 1 M~~~~r~~~~~~~~~r~---~~~~~~~~~----~~~~~---~~~~g~~~---~~~~-----v~-~~~al~~~~v~~~i~~ 61 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQ---TSQVIELNK----DDEKL---LEWLGISP---STIS-----VK-GKNALKVATVFACIKI 61 (432) T ss_pred CChHHHHHHhcCccccC---cccccccCC----chHHH---HHHhCCCc---Cccc-----cc-hhhhhccHHHHHHHHH Confidence 33 2356654221 111111111 11100 01111111 1111 11 1234668999999999 Q ss_pred hhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHHh----hhhcCceeeeeeecCCC Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFRR----WYVDGRLFYHKVIDPKN 147 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR~----WYvDGri~~hkvid~~~ 147 (533) |.+.+-. .|+.|.-..-+..+. .. -..++++|+. ...+.++++. +.+.|.-|+.++-|. T Consensus 62 ia~~ia~-----lp~~~~~~~~~~~~~---~~---~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-- 128 (432) T protein:vir:10 62 LSESVSK-----LPLKIYQEDEYGIQR---GT---KHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDR-- 128 (432) T ss_pred HHHhhcc-----CceEEEEecCCceee---cc---ccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECC-- Confidence 9887543 455543211111111 11 1234444432 2345554444 566799999988764 Q ss_pred CCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCC Q lcl|NC_021072. 148 PRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLN 227 (533) Q Consensus 148 ~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~ 227 (533) .+-+++|.+|+|.+++.++.-...... ...-+|.+...+.. ..++.+-|.+...+. ..+ T Consensus 129 -~G~~~~L~~i~~~~v~v~~d~~~~~~~-------------~~~~~y~~~~~g~~------~~~~~~eiih~r~~~-~~~ 187 (432) T protein:vir:10 129 -KGKVQALWPIDASKVTVYIDDVGLLNS-------------KTKMWYVVNTGGQQ------RVLKPEEILHFKNGI-TLD 187 (432) T ss_pred -CCcEEEEEEEcCceeEEEEcCcccccc-------------cceEEEEEecCCeE------EEEccccEEEecCCC-CCC Confidence 445999999999999754432111111 11122333333221 223333222222111 122 Q ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccc Q lcl|NC_021072. 228 KNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKK 307 (533) Q Consensus 228 ~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~ 307 (533) +-.++|.|..|++++.....++....=+----+.-+-|..++ +.|.+..+++..+.+...|..- ...|. T Consensus 188 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------ 256 (432) T protein:vir:10 188 GLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSGL----QNSHR------ 256 (432) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhccc----ccCCc------ Confidence 334679999999999999888887666654445556777776 4677666666555544444320 01121 Q ss_pred cchhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH- Q lcl|NC_021072. 308 FMSMLEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF- 385 (533) Q Consensus 308 ~msmlEDywLpRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf- 385 (533) .+ .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|+..+.-+.....+..+. |.++ T Consensus 257 ~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~---~~~~~ 322 (432) T protein:vir:10 257 IA-LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ---FYTDT 322 (432) T ss_pred ce-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHH Confidence 11 221 2455555532 112222344567789999999999999965433333334443333 5432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHH Q lcl|NC_021072. 386 IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYM 465 (533) Q Consensus 386 i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i 465 (533) |.-+-.++. +.|-.. ++++.+|. ..+.+.|. ++++.... +..|++++..+-.- -++|.+-+ T Consensus 323 l~P~~~~ie----~~ln~k-----Ll~~~~~~---~g~~~~fd----~~~l~~~d-~~~~~~~~~~~~~~--G~~t~NE~ 383 (432) T protein:vir:10 323 LQATLTMYE----QEMTYK-----LFLDSELD---KGFYSKFN----VDAILRAD-IKTRYEAYRTGIQG--GFLKPNEA 383 (432) T ss_pred HHHHHHHHH----HHHHHh-----hcChhhcC---CCcEEEee----chhhhcCC-HHHHHHHHHHHHhC--CCcCHHHH Confidence 222322222 222222 33444444 23345555 33333222 23566666555433 46777777 Q ss_pred HHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCC-cccccCCCCCCCCCCCCccccccccCCccc Q lcl|NC_021072. 466 RRQVLKQTDQEIKEIDKQIDSEREAGLIVDPM-AEMDPAMDPGNAPPADDMSAQEGPAVDAGD 527 (533) Q Consensus 466 ~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 527 (533) +. ++++.+- +--++-+-. ....+--. .+.....+..+.....+++.. + T Consensus 384 R~-~~g~~pi--~ggD~~~~~---~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~--------~ 432 (432) T protein:vir:10 384 RS-KEDLPPE--AGGDRLLVN---GNMLPIDMAGQAYLKGGDTNGEVSKEGNEG--------N 432 (432) T ss_pred HH-HhCCCCC--CCCCeEeec---ccccchhhccccccCCCCCCCCCCCCCCCC--------C Confidence 64 4666431 100000000 00000000 000000000011111111111 1 No 63 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.20 E-value=2.7e-06 Score=51.16 Aligned_cols=431 Identities=15% Similarity=0.109 Sum_probs=187.2 Q ss_pred CC--ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhc Q lcl|NC_021072. 1 MS--NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~--~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvn 78 (533) |. ..|||+.=.+ ......+-.-.+.+.......+....|. -|.. +..+++|.|.+||+-|.+ T Consensus 1 Mg~~~~l~~r~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~g~-------------~V~~-~~al~~~~V~~~v~~Ia~ 64 (457) T protein:vir:13 1 MGFWSALFGRGHSP--ALDGIEARAWEPYDPSIYNLGAVAASGE-------------TVTP-HDALQVSAVFASVRLLSE 64 (457) T ss_pred Cchhhhhhcccccc--cccccccccccccchHHHhhcccccCCc-------------eech-HHhhccHHHHHHHHHHHH Confidence 43 3566642222 1111111111111111111111111111 1111 244668999999999998 Q ss_pred ceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch---hhhhHHHHhhh----hcCceeeeeeecCCCCCCC Q lcl|NC_021072. 79 ETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE---NRSYEIFRRWY----VDGRLFYHKVIDPKNPRGG 151 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~---~~~~~~fR~WY----vDGri~~hkvid~~~~~~g 151 (533) .+-. .|+.+--..-+..+.++. ..++.+|+-. -.+.++++.++ +.|.-|+.+ .+ ..+. T Consensus 65 ~iA~-----lp~~~~~~~~~~~~~~~~------~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i-~~---~~g~ 129 (457) T protein:vir:13 65 TIAT-----LPLSTYSKRGGSRKEIVT------PEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAV-RW---QGPN 129 (457) T ss_pred hhcc-----CceEEEEecCCccccccc------chHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEE-Ee---cCCc Confidence 8554 355543222222222211 2344555532 34566666544 568888764 33 2467 Q ss_pred eEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-Cc Q lcl|NC_021072. 152 LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NM 230 (533) Q Consensus 152 I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~ 230 (533) +++|.+|+|..+..++...+.....+ +.... +.....+... ..+.+...++|+.-. .++ -+ T Consensus 130 ~~~l~~l~p~~v~v~~~~~~~~~~~~--~~~y~-~~~~~~~~~~-----~~~~~~diih~~~~~----------~~~~~~ 191 (457) T protein:vir:13 130 IVGLDVLDPTKIHVHMVMVDGLRRKV--FEAYD-IDADGNEVLL-----GWFTPRDVLHIPGMM----------LPGDFV 191 (457) T ss_pred EEEEEEEccCceEEEEecCCCcccee--EEEEE-EecCCceeeE-----EeeCccceEEecCCC----------CCCccc Confidence 99999999999976544322211111 10000 0000000000 011223344444321 111 24 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 231 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 231 i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) ++|-+..|.+++.....+++...=+=---+--+-|..++ |.|-+...++..+.+...|+.. +. .|.+ + T Consensus 192 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~g~---~n-ag~~------~- 259 (457) T protein:vir:13 192 GCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVP-GTMSEEGLARAREAWRAANSGV---DN-AHRV------A- 259 (457) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcCc---cc-cCcc------e- Confidence 568899999988888888876554444444555666665 5666655555444444444320 11 1211 1 Q ss_pred hHhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH- Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI- 386 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi- 386 (533) .++ .|.+++.|. .+..++ +=-+|....+.++++||...|+.-.+-+... +.+...-+.|.+++ T Consensus 260 vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-sn~eq~~~~f~~~tl 326 (457) T protein:vir:13 260 LLT----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWG-SGLAEQNIAFTMFSL 326 (457) T ss_pred ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccccc-chHHHHHHHHHHHHH Confidence 111 244555552 222332 3334778889999999999996543322211 22333333476653 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHH Q lcl|NC_021072. 387 ARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMR 466 (533) Q Consensus 387 ~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~ 466 (533) .-+..+ +.+.|-..|+.+ .+. ....+.|. +..+... -+..|.+++..+-.- -++|.+-++ T Consensus 327 ~P~~~~----ie~~ln~~L~~~-----~~~----~~~~i~fd----~~~l~~~-D~~~r~~~~~~~~~~--G~~T~NE~R 386 (457) T protein:vir:13 327 RPWLER----IEAGFNRLLFAE-----TAD----RFRFVKFN----LDEIKRG-APKERMELWSLGLQN--GIYSIDEVR 386 (457) T ss_pred HHHHHH----HHHHHHHhhcCc-----ccc----CceeEEee----chhhhcc-CHHHHHHHHHHHHhC--CCcCHHHHH Confidence 333333 333344444333 222 22344554 2233322 234566666655432 478888887 Q ss_pred HHHhCCCHHH-----HHHHH---HHHHHhhhcCCCCCCCcccccC----CCCCCCCCCCCccccccccCCccccc Q lcl|NC_021072. 467 RQVLKQTDQE-----IKEID---KQIDSEREAGLIVDPMAEMDPA----MDPGNAPPADDMSAQEGPAVDAGDAK 529 (533) Q Consensus 467 k~IL~~tDee-----I~e~~---kqi~~E~~~~~~~~p~~~~~~~----~~~~~~~~~~d~~~~~~~~~~~~~~~ 529 (533) . .++|.+-+ .--+. ..+.+..+....+.|.+...+. .+....+..||... ..-+..|+- T Consensus 387 ~-~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~---~~~~~~~~~ 457 (457) T protein:vir:13 387 A-AEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGA---TEEDDEDDA 457 (457) T ss_pred H-HhCCCCCCCCcccceeeccccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccC---CCCcccccC Confidence 4 47876421 00000 0011111111112222222221 11111111111111 111111111 No 64 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.20 E-value=2.7e-06 Score=51.10 Aligned_cols=413 Identities=14% Similarity=0.180 Sum_probs=194.7 Q ss_pred CCccc------------cceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhh--c Q lcl|NC_021072. 1 MSNQL------------FGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVL--Q 66 (533) Q Consensus 1 ~~~~~------------fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~--~ 66 (533) |-..+ ++-+++.+- ++ +-+..++. ....|+.+|.|.. + T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~i~--d~-~~i~~~~~-------------------------~~~~i~~~~~~Y~g~~ 54 (505) T protein:vir:79 3 FWDTLKNLFRKGSAAVGMTKSLGQII--DD-PRINLPAD-------------------------EVERIARDKRYYMDDF 54 (505) T ss_pred hHHHHHHHHHHhhhhhcchhhhhhhh--cc-cCCCCCHH-------------------------HHHHHHHHHHHhcCCC Confidence 11111 111111100 00 01111111 1122222333211 1 Q ss_pred chh--------------------hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhH Q lcl|NC_021072. 67 PEC--------------------DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYE 126 (533) Q Consensus 67 pEv--------------------d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~ 126 (533) |.+ ..++++.++=++ + .|+.+.+++ +.-++-++.++.--+|.....+ T Consensus 55 ~~l~~~~~~~~~~~~~~~slnl~~~i~~~~A~ll~----~-e~~~i~~~d--------~~~~e~l~~i~~~n~f~~~~~~ 121 (505) T protein:vir:79 55 KQVTHKNSYGDTQKHELQSVNVTKLASAKLASLIF----N-EQCQVTVSD--------ETANDFLDDVFQQNDFYTTFEE 121 (505) T ss_pred ccccccccCCCccccceeecchHHHHHHHHHhhhc----C-CCceeecCC--------hHHHHHHHHHHHhccHHHHHHH Confidence 111 122222222111 1 233444443 1223445666666679999999 Q ss_pred HHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEec-----cc--eeec----------- Q lcl|NC_021072. 127 IFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDI-----NT--QLTQ----------- 188 (533) Q Consensus 127 ~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~-----~~--~~~~----------- 188 (533) .+..+..-|..+|+..+|.. -..+.+++|.++=|+.--.+.... ..+... .+ .++. T Consensus 122 ~~e~a~a~G~~~~k~~~D~~-----~~~i~~v~ad~~~P~~~d~~~~~~-~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~ 195 (505) T protein:vir:79 122 KLEEWIALGSGCVRPYVDSG-----KIKLAWATADQVYPLQADTNQVNE-LAIASRTTEVENHRTIYYTLLEFHQWDHGD 195 (505) T ss_pred HHHHHhhcCCeEEEEEEeCC-----ceEEEEEcCCeeEEEEEcCCCeEE-EEEEEEEEEecCCcceEEEEEEEEEecCce Confidence 99999999999999999842 245778888887664211111000 001100 00 0010 Q ss_pred cchhceeccccccccccCCcceeccch----------hhccccc--c-----------ccCCCCccchhHHHHHHHHHHH Q lcl|NC_021072. 189 KAAEYYLYNPKGLKNSTNQGMKIATDS----------VTYCHSG--I-----------QDLNKNMTLSHLHKAIKAVNQL 245 (533) Q Consensus 189 ~~~e~~~y~p~~~~~~~~~~~kI~~da----------i~y~hsG--l-----------~d~~~~~i~syL~~AiK~~NqL 245 (533) +.-.+.+|.-.. ..+. +..++... ++|.+.. + .+..++.++|-++.|.-.+..| T Consensus 196 ~~I~n~ly~~~~-~~~l--G~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~l 272 (505) T protein:vir:79 196 YVITNELYRSEA-AETV--GINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAI 272 (505) T ss_pred EEEEEEEEecCC-CCcc--CcccchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHH Confidence 000111121000 0000 11111111 1111100 0 1233556789999998888777 Q ss_pred HHHHHHHHHHHHhcCccceEEE----cc---CCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhccc Q lcl|NC_021072. 246 RMIEDSLVIYRLSRAPERRIFY----ID---VGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLP 318 (533) Q Consensus 246 rm~EDalVIyRi~RAPeRrvfy----ID---vGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLp 318 (533) -..=+ -+.|..|.=.+|||. +. .|+.+.... ..-++| .+++.+.++.-| T Consensus 273 D~~~s--~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~------------~~~~fd------~~~~~y~~~~~~---- 328 (505) T protein:vir:79 273 NRTHD--QFVDEVKKGQRRLIVPAEWLKTGSSYGGQASET------------HPPMFD------PDETVYQAMYGD---- 328 (505) T ss_pred HHHHH--HHHHHHHhcccceeechHHhcccCCCCcccccc------------cccCCC------ccceeeeeccCC---- Confidence 75444 345777777777765 21 111110000 000011 122333333211 Q ss_pred ccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_021072. 319 RREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELF 397 (533) Q Consensus 319 RReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if 397 (533) +| +.-|+++.+.---.+ .+-+..+.+.+....+++-+-|+.++. ....++||....-.-..-+.+.|+.|...+ T Consensus 329 --~~--~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~-~~~TAtei~s~~~~l~~t~~~~~~~~~~al 403 (505) T protein:vir:79 329 --AS--EVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPS-GIQTATEVVTNNSQTYQTRSSYITQVEKTI 403 (505) T ss_pred --CC--CCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCcc-ccchHHHHHHHHhHHHHHHHHHHHHHHHHH Confidence 22 223777776432232 345777888889999999999986643 234567776665555556778888888888 Q ss_pred HHHHHHHHHhccCCC-------HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHh Q lcl|NC_021072. 398 MDLLKTQLILKGVMS-------LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVL 470 (533) Q Consensus 398 ~d~Lk~qLilkgi~t-------~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL 470 (533) .++++.=|-+..+.. ...++.-.-.+.++|... .+.. +++++- ..+.+++ - | .+|.++++++.. T Consensus 404 ~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~-i~~d-~~~~~~-~~~~~v~----~-G-i~s~e~~l~~~~ 474 (505) T protein:vir:79 404 KALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDG-VFVD-QESKRA-ADLQAVQ----A-Q-VMPKKQFLMRNY 474 (505) T ss_pred HHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCC-CCCC-HHHHHH-HHHHHHH----c-C-CCCHHHHHHhcC Confidence 887777654433211 111111112466666532 2222 222221 1222211 1 3 588899888889 Q ss_pred CCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCC Q lcl|NC_021072. 471 KQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNA 509 (533) Q Consensus 471 ~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~ 509 (533) +.||+|.+++-++|++|.... .|+|+ +-||+ T Consensus 475 ~~~eeea~~el~ri~~E~~~~-~p~~~-------~~gg~ 505 (505) T protein:vir:79 475 GLDEEEADEWLAQIDAENSTA-EPEFN-------QFGGD 505 (505) T ss_pred CCChHHHHHHHHHHHHhcccc-CCCch-------hccCC Confidence 999999999999999987542 23221 22222 No 65 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=98.18 E-value=3e-06 Score=50.89 Aligned_cols=434 Identities=11% Similarity=0.094 Sum_probs=178.1 Q ss_pred ccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHh-hhhcchhhhHHHHhhcceeeecCCCceEEE Q lcl|NC_021072. 14 KKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYRE-MVLQPECDSAVDDIVNETICGNFDDVPVEV 92 (533) Q Consensus 14 ~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~-m~~~pEvd~AvdeIvneaiv~d~~~~~v~v 92 (533) -++..+.+++.|-.-+=...+... |.+ .......+... ...|-. .+++|-|.+||+-|.+.+- ..|+.+ T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~-~~~-~~~~~~~~~~~---~~~~~~~a~~~~~V~acV~~IA~~iA-----~lpl~l 70 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDS-YYY-APAVGMQLERQ---FSLYGGIYKNQPWVRTVIAKRAQALA-----RLPVKC 70 (518) T ss_pred CcccCceeecCchhhhhhhhhhcc-ccc-ccccceecccc---cchhhHHHhhhHHHHHHHHHHHHhhc-----cCceEE Confidence 344444444444322211111111 111 00011111110 111212 3578999999999998753 244444 Q ss_pred EeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhh----hcCceeeeeeecCCCCCCCeEEEEEcChhhceehhh Q lcl|NC_021072. 93 ELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWY----VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTE 168 (533) Q Consensus 93 ~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WY----vDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~ 168 (533) --..-+.....+ ..-+..+++.=|-...+.++.+.|. +.|.-|..++-|. .+.+++|.+|+|..+...+. T Consensus 71 ~~~~~~~~~~~~---~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~---~G~~~~L~~l~p~~v~v~~~ 144 (518) T protein:vir:10 71 MFTSGDTETEES---DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMPMHPSRVAIKRN 144 (518) T ss_pred EEEcCCCceecc---chHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEEEEEECCCceEEEEc Confidence 221111111111 1111122222234456666666544 6799999877653 45699999999998854322 Q ss_pred ccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-CccchhHHHHHHHHHHHHH Q lcl|NC_021072. 169 YQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTLSHLHKAIKAVNQLRM 247 (533) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~syL~~AiK~~NqLrm 247 (533) - ......+..... .+....... +.....++|+..+ .++ ..++|-|..|.+++..... T Consensus 145 ~---~~~~~~y~~~~~--~~~~~~~~~-------~~~~eViHir~~s----------~dg~~~G~spi~~a~~~i~~~~a 202 (518) T protein:vir:10 145 S---RTGRYEYYFQAG--AGVGTQLVS-------FADDEVVPIRFFN----------PDGLERGLSLMESLKSTIFSEDS 202 (518) T ss_pred C---CCCEEEEEEEec--CCccceEEE-------ecCCcEEEecCCC----------CCcccccccHHHHHHHHHHHHHH Confidence 1 111111110000 000000011 1122333333221 122 1356889999888888888 Q ss_pred HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccc Q lcl|NC_021072. 248 IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTE 327 (533) Q Consensus 248 ~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTE 327 (533) +++...=+----+.-+-|.-++ +.|.+..+++--+.+-..|.- ..+.|.+ + .++ + |.+ T Consensus 203 ~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~k~~~~~~~~G----~~nag~v------~-vL~-------~---G~~ 260 (518) T protein:vir:10 203 SRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGKT------M-VVE-------E---GME 260 (518) T ss_pred HHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcC----ccccCcc------e-EcC-------C---Cce Confidence 7776433322234455566665 445555444433333333321 0111211 1 222 2 344 Q ss_pred eeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 328 ISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRFSELFMDLLKTQL 405 (533) Q Consensus 328 IsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~rLr~~fs~if~d~Lk~qL 405 (533) ++.|.= ...+.-++-.+|..+.+.++++||...|+..++-+..+.++..+. |..+ |.-+-.++...|...| T Consensus 261 ~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~---f~~~tL~P~l~~ie~~ln~~L---- 333 (518) T protein:vir:10 261 PIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA---FYRDTMAIPIARIQSAMDKYV---- 333 (518) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHH---HHHHHHHHHHHHHHHHHHHhh---- Confidence 544431 122223445568889999999999999964433334344444444 5544 4444444444444333 Q ss_pred HhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHH-- Q lcl|NC_021072. 406 ILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQ-- 483 (533) Q Consensus 406 ilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kq-- 483 (533) ++..+ -.. .+.|. ..++.... +..|.+.+..+-.- -++|..-+++ .+++..-+-..-++. T Consensus 334 -----~~~~~---~~~--~~~fd----~~~llr~D-~~~r~~~~~~~~~~--G~lT~NE~R~-~~Gl~pie~~~gD~~~~ 395 (518) T protein:vir:10 334 -----GQYWV---RKN--RMKFD----IDDVIQPD-WEAKSESTQKMVNS--GVATPNEGRE-IMGLPRSDDPKADELYA 395 (518) T ss_pred -----ccccc---CCc--eEEEe----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCCCCCCCCCeeee Confidence 22211 112 34443 23332222 24566666666443 4788888774 477754321000000 Q ss_pred ------HH-------HhhhcCCCCCCCcc------cccCCCCCCCCCCC-----CccccccccCCccccchhcC Q lcl|NC_021072. 484 ------ID-------SEREAGLIVDPMAE------MDPAMDPGNAPPAD-----DMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 484 ------i~-------~E~~~~~~~~p~~~------~~~~~~~~~~~~~~-----d~~~~~~~~~~~~~~~~~~~ 533 (533) +. +....+.-++|.+. +.|....++..++. |.... .|.........+|| T Consensus 396 ~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 468 (518) T protein:vir:10 396 NSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKT-EPRRLMQKPPPKES 468 (518) T ss_pred cccceecccccccccCCCCCCCCCCCCccccccccccccccCCCCCccccccccccccc-chhccccCCCcccc Confidence 00 00000000011100 00000000111110 00000 00000111111222 No 66 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.17 E-value=3.2e-06 Score=50.68 Aligned_cols=446 Identities=12% Similarity=0.087 Sum_probs=190.8 Q ss_pred cCCCCCCCCCc-ccceeecccccccccchhhhhhHHHHHHHHHHhhhhc-chhhhHHHH-hhcceeeecCCCc------- Q lcl|NC_021072. 19 GPSFVQKDSMD-GSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQ-PECDSAVDD-IVNETICGNFDDV------- 88 (533) Q Consensus 19 ~~s~~~~~~~d-g~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~-pEvd~Avde-Ivneaiv~d~~~~------- 88 (533) -+++.+--.+. ....++.. ... .-..-..+.+++..|=+-... +..-.++.. .-|--+|++.-.. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~-l~~----~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~ 75 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREE-MIS----AFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAE 75 (486) T ss_pred CCCCCCCCCCcccHHHHHHH-HHH----HHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHh Confidence 11221111000 00111110 000 000011122223333211100 000001100 0000011111111 Q ss_pred ---eEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCC----CCCCCeEEEEEcChh Q lcl|NC_021072. 89 ---PVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPK----NPRGGLTELRYIDPR 161 (533) Q Consensus 89 ---~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~----~~~~gI~elr~lDP~ 161 (533) ++...+ +.++.. .+++..|+..-+|+....++++.-++.|+-|.+.-.+.. ...+|...++.+||+ T Consensus 76 ~l~~~g~~~---~~~~~~----~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~ 148 (486) T protein:vir:42 76 RQAVEGFRL---GDADEA----DEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPT 148 (486) T ss_pred hhcccceec---CCCchh----HHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEeccc Confidence 111111 111211 234566666667899999999999999999888655431 234677788999999 Q ss_pred hceehhhccC-CCcCceeEEeccceeeccchhceecccccccc-------------ccCCcceeccchhhccccccccCC Q lcl|NC_021072. 162 KIRKVTEYQQ-KRPEQLRGEDINTQLTQKAAEYYLYNPKGLKN-------------STNQGMKIATDSVTYCHSGIQDLN 227 (533) Q Consensus 162 ~i~~vr~~~~-~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~-------------~~~~~~kI~~dai~y~hsGl~d~~ 227 (533) .+-.+..--. +..-..+++.. .-.+.....-+|.|..... .+|..-+||.= .|.+- .++. T Consensus 149 ~~~~i~d~~~~~~~~~~~~~~~--~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv--~~~n~--~~~~ 222 (486) T protein:vir:42 149 RMHAEIDPRINRVSKAIRVAYD--KEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVV--PLPNR--TRLS 222 (486) T ss_pred ceEEEEeCCCCCeEEEEEEEEe--cCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEE--Eeccc--cccC Confidence 8755443111 10111111110 0011111122344432211 11222222221 12211 0111 Q ss_pred CCccchhHHHH----HHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccc Q lcl|NC_021072. 228 KNMTLSHLHKA----IKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIK 303 (533) Q Consensus 228 ~~~i~syL~~A----iK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~ 303 (533) +..+.|-|.+. +..+|. .+-+..++-..+-.|.|-|.-.+....+.... +.+.+.++..| T Consensus 223 ~~~G~s~i~~~v~~liDa~~~--~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~--- 286 (486) T protein:vir:42 223 DLYGTSEITPELRSMTDAAAR--ILMLMQATAELMGVPQRLIFGIKPEEIGVDSE-----------TGQTLFDAYLA--- 286 (486) T ss_pred CCCCcccchhhHHHHHHHHHH--HHHHHHHHHHhhcchHHHhhcCCccccccccc-----------cccchhhhhhc--- Confidence 11233433332 333332 34455666666666666554333222110000 01111111122 Q ss_pred cccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHH Q lcl|NC_021072. 304 DDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQ 383 (533) Q Consensus 304 ~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~ 383 (533) ..|..-.+ ..++..+|+..-=.-++-++=.-.++...-++|..-|+..+. |-..+..|.--+.... T Consensus 287 ----------~~~~~~~~---~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~ 352 (486) T protein:vir:42 287 ----------RILAFEDA---EGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAIRAAESRLI 352 (486) T ss_pred ----------hhcccCCC---CceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHH Confidence 22322111 234556665431123333444445566667888777754332 2223445677777788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCC-HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccH Q lcl|NC_021072. 384 KFIARLRKRFSELFMDLLKTQLILKGVMS-LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSI 462 (533) Q Consensus 384 Kfi~rLr~~fs~if~d~Lk~qLilkgi~t-~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~ 462 (533) .-+.+.|..|..-+...++.-+.+.|... +.+| ..|.+.|.....=+. .+.++.+.++..-+...+|. T Consensus 353 ~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~----~~i~v~w~~~~~~s~-------~~~ad~~~kl~~~~~g~~s~ 421 (486) T protein:vir:42 353 KKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAATKLYGNGQGVIPR 421 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHhcccCCCCH Confidence 88899999999999999998777776532 2233 257888864433222 34555565555443357898 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC-C---CCcccccCCCCCCCCCCCCccccccccCCcccc Q lcl|NC_021072. 463 DYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV-D---PMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDA 528 (533) Q Consensus 463 ~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~-~---p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 528 (533) ++++ ..|++++++++++++.-+++...+.-. + ......++.+..++++.++...+. ..++. T Consensus 422 et~~-~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 486 (486) T protein:vir:42 422 ERAR-IDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIES----SGGDA 486 (486) T ss_pred HHHH-hcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCC----CCCCC Confidence 9998 569999999998776444433322211 0 001111222222233333222111 11111 No 67 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.17 E-value=3.3e-06 Score=50.67 Aligned_cols=434 Identities=12% Similarity=0.112 Sum_probs=200.7 Q ss_pred CCccc-cceeeeccccccccCCCCCCCCCcccceeecccccccccch-----hhhhhHHHHHHHHHHhh-hhcchhhhHH Q lcl|NC_021072. 1 MSNQL-FGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDF-----DGTVRNEYELITRYREM-VLQPECDSAV 73 (533) Q Consensus 1 ~~~~~-fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~-----~~~~~~~~~LI~~YR~m-~~~pEvd~Av 73 (533) +..++ +..+|+.+-.. +.++. ++..-..+.--...|.|-...+ ++.-..+ .+.++ ....-|+..- T Consensus 14 ~~~~~~~~~~~~~~~~~-~~i~~--~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~-----~~~sln~~~~i~~~~A 85 (508) T protein:vir:15 14 GAAATGVTGSLSKITDD-PRISI--DPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKR-----LKNTINMAKTAARRIA 85 (508) T ss_pred HHHHhccccchHHhhcc-ccccc--CHHHHHHHHHHHHHhcCCCcccccccCCCCcccc-----ceeecchHHHHHHHHH Confidence 11122 23333331110 11111 1111000110011111110000 0100000 01111 1111122222 Q ss_pred HHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeE Q lcl|NC_021072. 74 DDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLT 153 (533) Q Consensus 74 deIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~ 153 (533) +-+.+| |+.+.++. ++.. .+-...++.--+|.....+.+....+-|..+|+..+|.. + . T Consensus 86 ~lv~~e---------~~~i~v~~---~~~~----~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~----~-~ 144 (508) T protein:vir:15 86 SVVFNE---------KAEIHVKD---NNEA----DKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGN----H-I 144 (508) T ss_pred hhhhCC---------CceEEeCC---chHH----HHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCC----e-e Confidence 222233 33444322 1222 112355665567999999999999999999999999842 2 4 Q ss_pred EEEEcChhhceehhhccCCCcCceeEEec-------cceeec------------cchhceeccccccccccCCcceeccc Q lcl|NC_021072. 154 ELRYIDPRKIRKVTEYQQKRPEQLRGEDI-------NTQLTQ------------KAAEYYLYNPKGLKNSTNQGMKIATD 214 (533) Q Consensus 154 elr~lDP~~i~~vr~~~~~~~~~~~~~~~-------~~~~~~------------~~~e~~~y~p~~~~~~~~~~~kI~~d 214 (533) .+.+++|.++=+++--.+.-.. ..+... ...++. +.-.+.+|...... .. +..++.. T Consensus 145 ~i~~v~ad~~~P~~~d~~~~~~-~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~-~l--G~~v~l~ 220 (508) T protein:vir:15 145 KIAWVRADQFYPLQSNTNDISE-AAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPD-IV--GNQVPLS 220 (508) T ss_pred EEEEEcCCeeEEEEEcCCCeEE-EEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCch-hc--Ccccchh Confidence 5788888887554221111000 001100 000110 11112223221100 01 1222222 Q ss_pred hhh-c-------ccccc---------------ccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC Q lcl|NC_021072. 215 SVT-Y-------CHSGI---------------QDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG 271 (533) Q Consensus 215 ai~-y-------~hsGl---------------~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvG 271 (533) .+. | .-.|+ .+..++.++|-++.|+-.+..|-..-+. +.|..|.=.+|||.-+.- T Consensus 221 ~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~--~~~e~~~~~~~i~v~~~~ 298 (508) T protein:vir:15 221 TLPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQ--FIWEIRLGQKHIAVQPGM 298 (508) T ss_pred hcccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHH--HHHHHHhcccceeechHH Confidence 110 0 01122 1234566889999999888888766665 558889888898862210 Q ss_pred CCchHHHHHHHHHHHHhcccEEEeeCCCCccc--cccccchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHH Q lcl|NC_021072. 272 NLPKNKAEQYLREVMGRYRNKLVYDANTGEIK--DDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQK 348 (533) Q Consensus 272 nlpk~KAeqYl~~im~~~rnk~vYd~~TGev~--~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~YF~~ 348 (533) + -+|..+|..- +++.|..|-.| ...|.-|+.+...--.+ -.+-+..+.+ T Consensus 299 --------------l-------~~d~~~~~~~~~~~~~~~~~~~~-------~~~~~~i~~~~~~ir~e~~~~~~~~~l~ 350 (508) T protein:vir:15 299 --------------L-------RFDDEHKPTFDTEQNVYVGVLSD-------DNNGLGVKDMTTPIRTVQYKDAIDHFIK 350 (508) T ss_pred --------------h-------cCCCCCccccCCCCeeEEeccCC-------CCCCCceeEeecccChHHHHHHHHHHHH Confidence 0 0233333321 23333332111 12223365555432222 2455777888 Q ss_pred HHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-------cCCCH--hHHhhh Q lcl|NC_021072. 349 KLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILK-------GVMSL--EEWDEM 419 (533) Q Consensus 349 kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilk-------gi~t~--eew~~~ 419 (533) .+....+++.+-|+.+++ ....++||...+-.-..-+.+.|+.|...+.++++.=|-+- +.... ..+... T Consensus 351 ~~~~~~gls~~~f~~~~~-~~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~ 429 (508) T protein:vir:15 351 EFEVQIGLSTGTFSYSND-GVKTATEVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQ 429 (508) T ss_pred HHHHHhCCCchhcccccC-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccC Confidence 999999999988876643 33457777666655555577777777777777666644322 11110 112222 Q ss_pred hhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc Q lcl|NC_021072. 420 KEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAE 499 (533) Q Consensus 420 ~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~ 499 (533) ...+.++|... -.+-+++++ +...++..- | .+|++..+++..+.||+|.+++-++|++|....... +.. T Consensus 430 ~~~v~v~f~D~--i~~d~~~~~-----~~~~~~v~a-G-i~s~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~--~~~ 498 (508) T protein:vir:15 430 PLDIECHFDDG--VFVNKDKQL-----EEDAKVLAI-G-ALSKQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFE--GGR 498 (508) T ss_pred CcceEEEeCCC--CCCCHHHHH-----HHHHHHHhc-C-CCCHHHHHHhcCCCChHHHHHHHHHHHHhccccCcc--ccc Confidence 23366666532 222233322 222222211 3 588888877778999999999999999997654322 222 Q ss_pred cccCCCCCCC Q lcl|NC_021072. 500 MDPAMDPGNA 509 (533) Q Consensus 500 ~~~~~~~~~~ 509 (533) ..|..+.-|+ T Consensus 499 ~~~~~g~~ge 508 (508) T protein:vir:15 499 SAILNGGDGE 508 (508) T ss_pred cccCCCCCCC Confidence 2222222222 No 68 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.14 E-value=3.8e-06 Score=50.29 Aligned_cols=449 Identities=12% Similarity=0.101 Sum_probs=185.8 Q ss_pred cCCCCCCCCCcc-cceeecccccccccchhhhhhHHHHHHHHHHhhhh-cchhhhHH-HHhhcceeeecCCCceEEEEec Q lcl|NC_021072. 19 GPSFVQKDSMDG-SQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVL-QPECDSAV-DDIVNETICGNFDDVPVEVELS 95 (533) Q Consensus 19 ~~s~~~~~~~dg-~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~-~pEvd~Av-deIvneaiv~d~~~~~v~v~l~ 95 (533) -+++++--.++- ...+... .... .... ..+....+.|=.-.+ .+....++ .+.-+--+|++.-..+|+--.+ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~-L~~~---~~~~-~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 75 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDE-MVSA---FEDQ-NQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAE 75 (485) T ss_pred CCCCCCCCCcccchHHHHHH-HHHH---HHHH-HHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhh Confidence 122222111110 0000000 0000 0000 112222222211100 00000000 0000111111111111110000 Q ss_pred -------cCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCC----CCCCeEEEEEcChhhce Q lcl|NC_021072. 96 -------NLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKN----PRGGLTELRYIDPRKIR 164 (533) Q Consensus 96 -------~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~----~~~gI~elr~lDP~~i~ 164 (533) ..+.++... +++..|...=+|+....++++.-++.|+-|...-.|... ...|-..++.+||+.+- T Consensus 76 ~l~~~g~~~~~~~~~~----~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~ 151 (485) T protein:vir:24 76 RQAVEGFRLGDADEAD----EELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMY 151 (485) T ss_pred hhccCceecCCCchhH----HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeE Confidence 011122222 234455555578889999999999999998885544321 13355578888998775 Q ss_pred ehhhccCCCcCceeEEecc-ceeeccchhceeccccccc-------------cccCCcceeccchhhccccccccCCCCc Q lcl|NC_021072. 165 KVTEYQQKRPEQLRGEDIN-TQLTQKAAEYYLYNPKGLK-------------NSTNQGMKIATDSVTYCHSGIQDLNKNM 230 (533) Q Consensus 165 ~vr~~~~~~~~~~~~~~~~-~~~~~~~~e~~~y~p~~~~-------------~~~~~~~kI~~dai~y~hsGl~d~~~~~ 230 (533) .+..--. .....+.... +...+.....-+|.+.... ..+|..-+||.= .|.+.. ++.+.. T Consensus 152 ~i~D~~~--~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv--~f~n~~--~~~~~~ 225 (485) T protein:vir:24 152 AEIDPRI--GRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVV--PLPNRT--RLSDLY 225 (485) T ss_pred EEeeCCc--CceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEE--EeccCc--ccCCcC Confidence 4432111 1111111100 0000111111122222110 012222223221 222211 112223 Q ss_pred cchhHHHHHHHH-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccccccc Q lcl|NC_021072. 231 TLSHLHKAIKAV-NQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKF 308 (533) Q Consensus 231 i~syL~~AiK~~-Nq-Lrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ 308 (533) +.|-|.+.++++ .. -+++-+..++-..+-.|.|-+.=.+....+...- +...+.+...| T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~-------- 286 (485) T protein:vir:24 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPE-----------TGQTLFDAYLA-------- 286 (485) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccc-----------cccchhhhccc-------- Confidence 445555544443 22 2556677777777777777554222111110000 00111111112 Q ss_pred chhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHH Q lcl|NC_021072. 309 MSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIA 387 (533) Q Consensus 309 msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~ 387 (533) ..|+.--+ +.++..++... +. -++-++-.-..+...-++|..-|+..+. |-..+..|.--+...-.-+. T Consensus 287 -----~i~~~~~~---~~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~ 356 (485) T protein:vir:24 287 -----RILAFEDA---EGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAIRAAESRLIKKVE 356 (485) T ss_pred -----ceeccCCC---CceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHHHHHHHHHHHHH Confidence 23443222 23455565533 22 2222333333444455777777754332 21133456666677777889 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCC-CHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHH Q lcl|NC_021072. 388 RLRKRFSELFMDLLKTQLILKGVM-SLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMR 466 (533) Q Consensus 388 rLr~~fs~if~d~Lk~qLilkgi~-t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~ 466 (533) +.|..|..-+...++.-+.+.+-. ...+| ..|.+.|.....=+. .+.++.+.++..-+-..+|.++++ T Consensus 357 ~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~----~~i~v~f~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~et~~ 425 (485) T protein:vir:24 357 RKNAIFGGAWEEAMRLAYRLMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAATKLYGNGQGVIPRERAR 425 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCcccc----ceeeEEecCCCCCCH-------HHHHHHHHHHHhcccccCCHHHHH Confidence 999999999999988765554422 22233 357888864432222 345566666655433578999998 Q ss_pred HHHhCCCHHHHHHHHHHHHHhhhcCCCC-CCCccc---ccCCCCCCCCCCCCccccccccCCcc Q lcl|NC_021072. 467 RQVLKQTDQEIKEIDKQIDSEREAGLIV-DPMAEM---DPAMDPGNAPPADDMSAQEGPAVDAG 526 (533) Q Consensus 467 k~IL~~tDeeI~e~~kqi~~E~~~~~~~-~p~~~~---~~~~~~~~~~~~~d~~~~~~~~~~~~ 526 (533) + .|++++++++++++..++|...+.-. +..... .++.+.+++.+.+..+.+ +.+++ T Consensus 426 ~-~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~---~~~~a 485 (485) T protein:vir:24 426 K-DMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTPAPKPQPAIE---GGDSA 485 (485) T ss_pred h-hCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCCCCCCccCCC---CCCCC Confidence 5 59999999998877655554433211 111111 111111111111111111 01111 No 69 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.12 E-value=4.2e-06 Score=50.04 Aligned_cols=388 Identities=15% Similarity=0.152 Sum_probs=172.0 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |. ||... +...+..|...+.- .....++-.+.++ . .+....+|.|.+||+-|++.+ T Consensus 1 M~--~f~~~----~~~~~~~~~~~~~~----~~~~~~~~~~~~v------~--------~~~al~~~~V~~~v~~ia~~i 56 (397) T protein:vir:38 1 MP--LLKLN----KSHSQGFSLNDPDW----VNFLTGGEAQKYV------S--------ADTALKNSDIFSLIMQLSGDL 56 (397) T ss_pred Cc--chhhh----hcccCcccCCchhh----hhhhcCCcCCcee------c--------hHHhhccHHHHHHHHHHHHHH Confidence 43 34321 11122223222110 1111111111111 1 122356899999999999886 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhh----hHHHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRS----YEIFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~----~~~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -. .|+.++ . ..+..++..=+=.-.+ +.+++.+++.|.-|+.++-|. .+.+++|. T Consensus 57 a~-----~p~~~~--~------------~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~---~g~~~~l~ 114 (397) T protein:vir:38 57 AM-----VRYTSE--S------------DRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNT---NGVDLSWE 114 (397) T ss_pred hh-----Cccccc--c------------cHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC---CCcEEEEE Confidence 53 333322 1 0111222211222233 445556888999999988764 45699999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCC-ccchhH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKN-MTLSHL 235 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~-~i~syL 235 (533) ++||..++.++.. +++...+..... .+ ..+....++.+-+.|.. ....++. .+.|-| T Consensus 115 ~l~~~~v~i~~~~---~~~~~~y~~~~~------------~~-----~~~~~~~~~~~eiih~~--~~~~~~~~~G~s~i 172 (397) T protein:vir:38 115 YLRPSQVQPMLLQ---DGSGLIYNINFD------------EP-----AIGYMENVPAADVIHIR--LLSKNGGKTGISPL 172 (397) T ss_pred EEcCceeEEEEcC---CCceEEEEEEec------------cc-----cccceeEecCccEEEec--CCCCCCccccccHH Confidence 9999998653322 111111110000 00 01111223333332222 1122222 356899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) ..|.+.+.....+++...-+--.-+.-+-++.++.+ +.+. +.+-+++.+..++.- + +.|. .+ .+ T Consensus 173 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~e-~~~~~~~~~~~~~~~---~-n~~~------~~-vl--- 236 (397) T protein:vir:38 173 SALINEQQIKDASNELTLKALKQSVTASAVLTIQKG-GLLD-AETRIARSKEISKQI---H-NSDG------PV-VI--- 236 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCHH-HHHHHHHHHHHHhcc---c-ccCC------ce-ec--- Confidence 999999999999888877655556666777777765 3433 334444444332210 1 1111 11 11 Q ss_pred cccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFS 394 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs 394 (533) ..|.+++.|.-..+-.+ ++=.++..+.+.++++||...|+...+-+ +.+.....-|.+-+.-+...+ T Consensus 237 -------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~----~~~e~~~~~~~~~l~P~~~~i- 304 (397) T protein:vir:38 237 -------DALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ----SSITQISGQYAKSLNRYVQAI- 304 (397) T ss_pred -------CCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHHHHHHHHHHHHHHHH- Confidence 12566666654333333 45567889999999999999997543321 222222222322233333333 Q ss_pred HHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH Q lcl|NC_021072. 395 ELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD 474 (533) Q Consensus 395 ~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD 474 (533) .+.|-..| ++..+|+ +.+.+.. -+..|.+.++.+-. +.++|.+-+++ +|++.. T Consensus 305 ---e~~ln~~l-----~~~~~~~-----~~~~~~~-----------d~~~~~~~~~~~~~--~G~~t~nE~R~-~lg~~p 357 (397) T protein:vir:38 305 ---VGELNDKL-----HANISAN-----IRFAIDA-----------MGDQYASTISSSVK--GGTIAGNQARF-ILQNSG 357 (397) T ss_pred ---HHHHHHhc-----cChhccc-----ccccccC-----------CHHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCC Confidence 33333332 2333332 1222211 13455555555422 24677777764 355533 Q ss_pred HHHHHHHHHHHHhhhcCCCCCCCcc---cccCCCCCCCCCCCCccccccccCCccccc Q lcl|NC_021072. 475 QEIKEIDKQIDSEREAGLIVDPMAE---MDPAMDPGNAPPADDMSAQEGPAVDAGDAK 529 (533) Q Consensus 475 eeI~e~~kqi~~E~~~~~~~~p~~~---~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 529 (533) -+ .+-.+.|... ..+.... .+.+++++..+.+ .+|.+ T Consensus 358 ~~-------------~~d~~~~~~~~~~~~~~~~~--~~g~~~~~~~~e~---~~~~~ 397 (397) T protein:vir:38 358 YL-------------AKDLPDPEKEPQQAIQLIQQ--EGGENDGNNSDER---GSDPE 397 (397) T ss_pred CC-------------CCcccccccccccccccccc--ccCCCCCCCCCCC---CCCCC Confidence 10 0001111100 0001110 0111111111111 11111 No 70 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.10 E-value=4.6e-06 Score=49.86 Aligned_cols=385 Identities=14% Similarity=0.137 Sum_probs=164.7 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |--.+|.|- ++.....+. +.+.....+|......+.+.+.. +. -|.. +..+++|-|.+||+-|.+.+ T Consensus 1 m~m~~f~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~----~~------~v~~-~~al~~~~v~~~i~~ia~~i 67 (392) T protein:vir:10 1 MILPILNFI-NQTNDPPEV-GSVQSYFPDGNDAQIMESLLGDN----NE------WVSA-RAALRNSDLFSIILQLSSDL 67 (392) T ss_pred Ccchhhhhh-hcccccccc-cccccccccCchhhhhhhhcCCC----Cc------eech-HHhhccHHHHHHHHHHHHhh Confidence 666666653 221221111 11122222222222222222210 00 0111 22346899999999999885 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhH----HHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYE----IFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~----~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -- .|+.+. +. .. ..+++.=|-.-.+.+ ++..+++.|.-|..++-|. .+.+++|. T Consensus 68 a~-----lp~~~~--~~-----~~-------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~g~~~~L~ 125 (392) T protein:vir:10 68 AI-----VKINAE--KK-----KN-------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA---NGADMKWE 125 (392) T ss_pred cc-----Cceeec--cc-----hh-------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC---CCcEEEEE Confidence 43 333332 11 00 112222333334444 4446788899999988764 45699999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-CccchhH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTLSHL 235 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~syL 235 (533) +++|..++.++.. ......+...... +.......| .+...++|+.. ..++ -.++|.| T Consensus 126 ~l~~~~v~~~~~~---~~~~~~y~~~~~~--~~~~~~~~~-------~~~eiih~~~~----------~~~~~~~G~s~i 183 (392) T protein:vir:10 126 YLRPSQVNTYYFE---YENGMYYNITFDD--PKIEPILQA-------PQSDLIHMKLL----------SIDGGKTGISPL 183 (392) T ss_pred EEcCceeEEEEcC---CCceEEEEEEecC--cccceeEEE-------ccccEEEecCC----------CCCCccccccHH Confidence 9999999754322 1111111110000 000000111 11223333322 1222 2457999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) ..|...++....+++...-+=---+--+-+..++.+..+..++.+.++ ..|+.. ...|.+ + .+ T Consensus 184 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~---~~~~~~----~~~g~~------~-vl--- 246 (392) T protein:vir:10 184 YSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----SRSGGP------V-VL--- 246 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----ccCCCe------e-ec--- Confidence 999999999888887765433333455666667665545444433333 233321 111211 1 11 Q ss_pred cccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRKRF 393 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi-~rLr~~f 393 (533) | .|++++.|.-...-.+ ++=.+|..+.+.++++||...|+..+..+ ...+-.+. |..++ .-+-.++ T Consensus 247 --~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~~~~~~~~---f~~~~l~P~~~~i 314 (392) T protein:vir:10 247 --D-----DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ--SSIQQISG---MYASALNRYLRPA 314 (392) T ss_pred --C-----CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHHHHHHH---HHHHHHHHHHHHH Confidence 1 2567777754333333 55567888999999999999996533221 11111222 44322 2222222 Q ss_pred HHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCC Q lcl|NC_021072. 394 SELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQT 473 (533) Q Consensus 394 s~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~t 473 (533) .. .|...|. + .+.+++. .+.+... ..+...+..+- -+-.+ | T Consensus 315 e~----~l~~~L~-----~---------~~~~d~~------~~~~~d~-~~~~~~~~~l~--~~g~~------------t 355 (392) T protein:vir:10 315 IS----ELEYKLS-----D---------HISVNMR------PAIDPLG-DNYLSTISTAT--RWGAL------------A 355 (392) T ss_pred HH----HHHHhcc-----c---------cccccch------hhhccCH-HHHHHHHHHHH--hCCCc------------C Confidence 22 2222221 1 1111111 0000000 01111111110 01223 3 Q ss_pred HHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCcccccccc Q lcl|NC_021072. 474 DQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPA 522 (533) Q Consensus 474 DeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~ 522 (533) ..|.-++. ...|.. |+ +-+..+ +.+|..++..++... T Consensus 356 ~nE~r~~l------~~~g~~--p~--e~r~~e--~l~~~~~Gd~~~p~p 392 (392) T protein:vir:10 356 ENQATFVL------QEAGYI--PK--DLPAPE--NTNKKTTGQSNEPVP 392 (392) T ss_pred HHHHHHHH------HhcCCC--cc--ccchhc--CCCCCCCCCCCCCCC Confidence 33332221 123332 22 122222 233333332222111 No 71 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.10 E-value=4.6e-06 Score=49.86 Aligned_cols=385 Identities=14% Similarity=0.137 Sum_probs=164.7 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |--.+|.|- ++.....+. +.+.....+|......+.+.+.. +. -|.. +..+++|-|.+||+-|.+.+ T Consensus 1 m~m~~f~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~----~~------~v~~-~~al~~~~v~~~i~~ia~~i 67 (392) T protein:vir:39 1 MILPILNFI-NQTNDPPEV-GSVQSYFPDGNDAQIMESLLGDN----NE------WVSA-RAALRNSDLFSIILQLSSDL 67 (392) T ss_pred Ccchhhhhh-hcccccccc-cccccccccCchhhhhhhhcCCC----Cc------eech-HHhhccHHHHHHHHHHHHhh Confidence 666666653 221221111 11122222222222222222210 00 0111 22346899999999999885 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhH----HHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYE----IFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~----~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -- .|+.+. +. .. ..+++.=|-.-.+.+ ++..+++.|.-|..++-|. .+.+++|. T Consensus 68 a~-----lp~~~~--~~-----~~-------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~g~~~~L~ 125 (392) T protein:vir:39 68 AI-----VKINAE--KK-----KN-------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA---NGADMKWE 125 (392) T ss_pred cc-----Cceeec--cc-----hh-------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC---CCcEEEEE Confidence 43 333332 11 00 112222333334444 4446788899999988764 45699999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-CccchhH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTLSHL 235 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~syL 235 (533) +++|..++.++.. ......+...... +.......| .+...++|+.. ..++ -.++|.| T Consensus 126 ~l~~~~v~~~~~~---~~~~~~y~~~~~~--~~~~~~~~~-------~~~eiih~~~~----------~~~~~~~G~s~i 183 (392) T protein:vir:39 126 YLRPSQVNTYYFE---YENGMYYNITFDD--PKIEPILQA-------PQSDLIHMKLL----------SIDGGKTGISPL 183 (392) T ss_pred EEcCceeEEEEcC---CCceEEEEEEecC--cccceeEEE-------ccccEEEecCC----------CCCCccccccHH Confidence 9999999754322 1111111110000 000000111 11223333322 1222 2457999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) ..|...++....+++...-+=---+--+-+..++.+..+..++.+.++ ..|+.. ...|.+ + .+ T Consensus 184 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~---~~~~~~----~~~g~~------~-vl--- 246 (392) T protein:vir:39 184 YSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----SRSGGP------V-VL--- 246 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----ccCCCe------e-ec--- Confidence 999999999888887765433333455666667665545444433333 233321 111211 1 11 Q ss_pred cccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRKRF 393 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi-~rLr~~f 393 (533) | .|++++.|.-...-.+ ++=.+|..+.+.++++||...|+..+..+ ...+-.+. |..++ .-+-.++ T Consensus 247 --~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~~~~~~~~---f~~~~l~P~~~~i 314 (392) T protein:vir:39 247 --D-----DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ--SSIQQISG---MYASALNRYLRPA 314 (392) T ss_pred --C-----CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHHHHHHH---HHHHHHHHHHHHH Confidence 1 2567777754333333 55567888999999999999996533221 11111222 44322 2222222 Q ss_pred HHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCC Q lcl|NC_021072. 394 SELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQT 473 (533) Q Consensus 394 s~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~t 473 (533) .. .|...|. + .+.+++. .+.+... ..+...+..+- -+-.+ | T Consensus 315 e~----~l~~~L~-----~---------~~~~d~~------~~~~~d~-~~~~~~~~~l~--~~g~~------------t 355 (392) T protein:vir:39 315 IS----ELEYKLS-----D---------HISVNMR------PAIDPLG-DNYLSTISTAT--RWGAL------------A 355 (392) T ss_pred HH----HHHHhcc-----c---------cccccch------hhhccCH-HHHHHHHHHHH--hCCCc------------C Confidence 22 2222221 1 1111111 0000000 01111111110 01223 3 Q ss_pred HHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCcccccccc Q lcl|NC_021072. 474 DQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPA 522 (533) Q Consensus 474 DeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~ 522 (533) ..|.-++. ...|.. |+ +-+..+ +.+|..++..++... T Consensus 356 ~nE~r~~l------~~~g~~--p~--e~r~~e--~l~~~~~Gd~~~p~p 392 (392) T protein:vir:39 356 ENQATFVL------QEAGYI--PK--DLPAPE--NTNKKTTGQSNEPVP 392 (392) T ss_pred HHHHHHHH------HhcCCC--cc--ccchhc--CCCCCCCCCCCCCCC Confidence 33332221 123332 22 122222 233333332222111 No 72 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.08 E-value=5e-06 Score=49.63 Aligned_cols=431 Identities=12% Similarity=0.100 Sum_probs=174.8 Q ss_pred CCcc---ccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhH-HHHHHHHHHhhhhcchhh------ Q lcl|NC_021072. 1 MSNQ---LFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRN-EYELITRYREMVLQPECD------ 70 (533) Q Consensus 1 ~~~~---~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~-~~~LI~~YR~m~~~pEvd------ 70 (533) |.-- +++...+. -....+..++..+ .+...- +..-++. ..+.+.+|+.+..+++-. T Consensus 1 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~---~~~~~~-------i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~ 66 (481) T protein:vir:10 1 MTVYTINNINTKFSP----LANDDFVVSDLAE---LLKEEN-------LRNFISRHQTEQVPRLEMLESYYLNRNTDILA 66 (481) T ss_pred CeeEeeehhchhccc----ccCceeeeecchh---hcCHHH-------HHHHHHHHHHHHHHHHHHHHHHhcCCCccccc Confidence 1110 00000000 0000011111000 000000 0000000 111222222222222111 Q ss_pred ------------------hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhh Q lcl|NC_021072. 71 ------------------SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWY 132 (533) Q Consensus 71 ------------------~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WY 132 (533) +-...||+...-+ .-+.|+.+.+++ +.. .+.+..++...+|+....+.++..+ T Consensus 67 ~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~-l~g~~~~~~~~d----~~~----~~~l~~~~~~n~~~~~~~~~~~~~~ 137 (481) T protein:vir:10 67 GERRLQKYGDKADHRAVHNYAKYVSRFIVGY-LTGNPITITHQD----NQT----NDKIIELNDLNDADEVNSDLALNLS 137 (481) T ss_pred CccccccccccccceeecchHHHHHHHHHhh-hccCCceEecCC----hhH----HHHHHHHHHhcChhHHHHHHHHHHH Confidence 1112223221111 124666666554 222 2345566777789999999999999 Q ss_pred hcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCcee---EEeccceeeccchhceecccccccc------ Q lcl|NC_021072. 133 VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLR---GEDINTQLTQKAAEYYLYNPKGLKN------ 203 (533) Q Consensus 133 vDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~---~~~~~~~~~~~~~e~~~y~p~~~~~------ 203 (533) +-|+-|++.-+|. +|-..+..+||+.+-++..-... ..... ++.....-.....-..+|.+..... T Consensus 138 ~~G~~~~~~~~d~----dg~~~i~~~~p~~~~~v~d~~~~-~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~ 212 (481) T protein:vir:10 138 IYGRAYEIVYRDF----EDRDTFKVLDPKSTFVVYDQTLD-KKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGG 212 (481) T ss_pred hcCeEEEEEEeCC----CCeEEEEEEcccceEEEEcCCCC-CceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCC Confidence 9999999987763 46778999999999776432211 11111 1111000000001111333322110 Q ss_pred -------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCch Q lcl|NC_021072. 204 -------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPK 275 (533) Q Consensus 204 -------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvGnlpk 275 (533) .+|.--+||. ++ -+++....|-++..+....-+. ++=+....-+-++.|-+-+. |..+ T Consensus 213 ~~~~~~~~~~~~g~vPv-----v~----~~n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~----g~~~- 278 (481) T protein:vir:10 213 TYHRVEEVEHYYNDVPI-----IE----YLNDQFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAII----GNVD- 278 (481) T ss_pred ceeecccccccCCceeE-----EE----eecCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEee----cCcC- Confidence 1111112221 11 0122334455554444433332 22333334444555544332 1111 Q ss_pred HHHHHHHHHHHHhcccEEEeeCCCCc-cccccccchhHhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHh Q lcl|NC_021072. 276 NKAEQYLREVMGRYRNKLVYDANTGE-IKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKA 353 (533) Q Consensus 276 ~KAeqYl~~im~~~rnk~vYd~~TGe-v~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~-DV~YF~~kLy~a 353 (533) .|.++|. ++.++........-+.+ .+.+.++..|....+...+. -+.-.++.+|.. T Consensus 279 -------------------~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 336 (481) T protein:vir:10 279 -------------------LDSEDAKAFRDANMIHLEPGTNANG---SEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKY 336 (481) T ss_pred -------------------CCccchhhhhhccceeccccccccC---CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 0111111 11111111111111111 12223455554444443333 356667778898 Q ss_pred cCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchH Q lcl|NC_021072. 354 LNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYF 433 (533) Q Consensus 354 L~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f 433 (533) ..+|---++.-++ +. .|..|.........-+.+.|..|...+.++++.=+-+-++-...+++ ...+.+.|.....- T Consensus 337 s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~--~~~i~v~f~~~~~~ 412 (481) T protein:vir:10 337 TNTPDLNDEQFSG-VQ-SGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHN--YAELTITFTPNLPK 412 (481) T ss_pred hCCcccccccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc--cceeeEEeCCCCCc Confidence 8988433322211 11 22234444444556688888888888888776543333332222222 23578888755444 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH---HHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCC Q lcl|NC_021072. 434 TELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD---QEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAP 510 (533) Q Consensus 434 ~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD---eeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~ 510 (533) .+. +.++++.++. | .+|.+++++. |...+ +|++.++++-+++.+... .......++. .. T Consensus 413 ~~~-------~~a~~~~kl~---g-~is~et~~~~-l~~i~d~~~E~~ri~~E~~~~~~~~~----~~~~~~~~~~--~~ 474 (481) T protein:vir:10 413 SMM-------ESINAFNALS---G-GVSESTRLSL-LDFIDNPKEELEKMQEEEAQREKQAD----KRGYGEAFEN--HL 474 (481) T ss_pred CHH-------HHHHHHHHHh---c-cCChHHHHHh-CCCCCCHHHHHHHHHHHHHHHHhhhh----hccCCccCCC--CC Confidence 443 3444555553 4 4899999977 55543 344444443322222110 0001111111 11 Q ss_pred CCCCccc Q lcl|NC_021072. 511 PADDMSA 517 (533) Q Consensus 511 ~~~d~~~ 517 (533) .+||++- T Consensus 475 ~~dd~~g 481 (481) T protein:vir:10 475 NVDDSNG 481 (481) T ss_pred CCCCCCC Confidence 1233222 No 73 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.03 E-value=6.4e-06 Score=49.05 Aligned_cols=399 Identities=14% Similarity=0.124 Sum_probs=174.5 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeec-ccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVG-GGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNE 79 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~-~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvne 79 (533) |+. |.- +++.+...+... .. .|...... ..+.|.. -.+....++|-|.+||+-|.+. T Consensus 1 Mgl--~~~-~f~~~~~~~~~~--~~---~~~~~~~~~~~~~g~~--------------v~~~~al~~~~v~~~v~~ia~~ 58 (409) T protein:vir:84 1 MSL--FTR-IFSGPSEERTLT--KI---SGIPSPAEDWAMHGDR--------------PGANSAMTLGAFYACVTLLADT 58 (409) T ss_pred Cch--hhh-hhcCCCcccccc--cc---cccccccchhhccCcc--------------cchhhhhccHHHHHHHHHHHHh Confidence 543 331 122111111111 00 01110000 0011110 1233445689999999999998 Q ss_pred eeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHH----hhhhcCceeeeeeecCCCCCCC Q lcl|NC_021072. 80 TICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFR----RWYVDGRLFYHKVIDPKNPRGG 151 (533) Q Consensus 80 aiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR----~WYvDGri~~hkvid~~~~~~g 151 (533) +-. .|+.+--.. +..+ + +...++++|+- ...+.++++ .+++.|.-|..+.+ .+..+- T Consensus 59 iA~-----lp~~~~~~~-~~~~-~------~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~--~~~~g~ 123 (409) T protein:vir:84 59 VAS-----LSIDAYRKK-DNVR-I------PVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISA--RDEANR 123 (409) T ss_pred hhh-----CceEEEEec-CCcc-c------ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE--ECCCCc Confidence 653 344332111 1111 1 12345555542 234444444 57788998876554 345667 Q ss_pred eEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCcc Q lcl|NC_021072. 152 LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMT 231 (533) Q Consensus 152 I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i 231 (533) +++|++|+|..++.... ...+...+. +.|...+..+...+.++|...+..-. ..+ T Consensus 124 ~~~L~~l~p~~v~v~~~---~~~~~~~~~-------------~~~~~~g~~~~~~dvih~~~~~~~~~---------~~G 178 (409) T protein:vir:84 124 PTAIMPIHPDCIHVTDA---KDEDGDWIE-------------PVYRIDGKVVPNHRIMHIKRYPVAGC---------ALG 178 (409) T ss_pred eEEEEEEcCceeEEEEc---CCCcceEEE-------------EEecCCceEEchhhEEEecCCCCCcc---------ccc Confidence 99999999998853211 111111111 11111111122223344433221111 245 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchh Q lcl|NC_021072. 232 LSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSM 311 (533) Q Consensus 232 ~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msm 311 (533) +|-++.|.+.+.....+++...-+----+--+-+.-++ |+|.+..+++..+.....+.| .|.+ + . T Consensus 179 ~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~n-------~g~~------~-v 243 (409) T protein:vir:84 179 MSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSD-ADLTPDQVKQTQKQWIQSHHN-------RRLP------A-V 243 (409) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcc-------CCCe------e-e Confidence 68899998888888777776654333333345555554 578777777766666555432 2321 1 1 Q ss_pred HhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHH Q lcl|NC_021072. 312 LEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLR 390 (533) Q Consensus 312 lEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr 390 (533) ++ .|.+++.|--. ..+.-++-.++..+.+.++++||.+.|+...+-+.. ++.+.-.-+.|..++ |+ T Consensus 244 l~----------~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-~sn~e~~~~~f~~~~--l~ 310 (409) T protein:vir:84 244 MS----------AGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSW-GTGIEEQGINFVRHT--LL 310 (409) T ss_pred cC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccc-cchHHHHHHHHHHHH--HH Confidence 11 24445554321 122334555677899999999999999753322221 122222223355432 33 Q ss_pred HHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHh Q lcl|NC_021072. 391 KRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVL 470 (533) Q Consensus 391 ~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL 470 (533) --+. .+.+.|-..| ..| ..+.|. +..+...++ ..|++.+..+-.- -++|.+-++. ++ T Consensus 311 P~~~-~ie~~l~~~L-~~g-------------~~i~fd----~~~l~~~d~-~~~~~~~~~~~~~--G~~t~NE~R~-~~ 367 (409) T protein:vir:84 311 PWLR-CIEQALDTFL-PRG-------------QFVKFN----VDGLMRGDV-TARFTAYQMGLQN--GIWSVNEVRA-WE 367 (409) T ss_pred HHHH-HHHHHHHHhc-cCC-------------CeEEEe----chhhhccCH-HHHHHHHHHHHhC--CCcCHHHHHH-Hh Confidence 2222 2233333332 223 234454 334443332 4566666655433 3677777664 35 Q ss_pred CCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 471 KQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 471 ~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) ++.+- +.-++-+ .+... .|-....+..+..+..|.+ +++..+ T Consensus 368 g~~p~--~ggD~~~---~~~n~--~~~~~~~~~~~~~~~~~~~-------------~~~gn~ 409 (409) T protein:vir:84 368 DAPPI--PEGDIHL---QPMNF--VPLGYVPPEEPAQEPQPNS-------------ATEGNK 409 (409) T ss_pred CCCCC--CCcceee---ecccc--cccccCCccccCcCCCCCC-------------ccCCCC Confidence 55431 1100000 00000 0000011111111111111 111111 No 74 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.02 E-value=7e-06 Score=48.86 Aligned_cols=450 Identities=11% Similarity=0.033 Sum_probs=207.3 Q ss_pred cceeecc--cccccccch-------hhhhhHH-HHHHHHHHhhhhcchhhhHHHHhh--------cceeeecCCCceEE- Q lcl|NC_021072. 31 SQPIVGG--GYYGYSVDF-------DGTVRNE-YELITRYREMVLQPECDSAVDDIV--------NETICGNFDDVPVE- 91 (533) Q Consensus 31 ~~~~~~~--~~~~~~~~~-------~~~~~~~-~~LI~~YR~m~~~pEvd~AvdeIv--------neaiv~d~~~~~v~- 91 (533) -++|..+ .|...-.++ -..+-++ ..-..+|+.+..+++-+..+..+- +--+|++--..+|. T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~ 80 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDT 80 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHH Confidence 1112111 111110001 0111111 111123333433433333332210 00111111111110 Q ss_pred ------EE-eccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhce Q lcl|NC_021072. 92 ------VE-LSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIR 164 (533) Q Consensus 92 ------v~-l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~ 164 (533) ++ ....+.++. .++...+...=+|+....+..+.-++.||-|. .|. ......+..-++.++|+..- T Consensus 81 ~a~rl~~~Gf~~~d~~~~-----~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~-~v~-~~~d~~~~~~I~~~sP~~~~ 153 (504) T protein:vir:99 81 LARRCNLESFVWPDGDYG-----SIGGPDVWDENFFATKANNAMVSSLIHGPAFL-INT-EGGAGEPDSLIHVKSAMQAT 153 (504) T ss_pred HHhhhccceeeCCCCChh-----hHHHHHHHHhcChhhHHHHHHHHHHhhCceeE-EEe-cCCCCCceeEEEEeccceeE Confidence 11 011111221 12345566666788889999999999999774 344 22234455667888998774 Q ss_pred ehhhccCCCcCceeEEecc--ceeeccchhceeccccccc-------------cccCCcceeccchhhccccccccCCCC Q lcl|NC_021072. 165 KVTEYQQKRPEQLRGEDIN--TQLTQKAAEYYLYNPKGLK-------------NSTNQGMKIATDSVTYCHSGIQDLNKN 229 (533) Q Consensus 165 ~vr~~~~~~~~~~~~~~~~--~~~~~~~~e~~~y~p~~~~-------------~~~~~~~kI~~dai~y~hsGl~d~~~~ 229 (533) -+. +...+...+.... ...-+......+|.|.... ..+|.. -+| .+.|++..-.+...| T Consensus 154 ~iy---D~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~-gvP--vV~~~n~~~~~~~~G 227 (504) T protein:vir:99 154 GEW---NSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKL-GVP--VEVLPYKPREDRPLG 227 (504) T ss_pred EEE---eCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCC-Ccc--eEEecccccCccccC Confidence 332 2222221111000 0000111111223332211 011111 133 455555433222211 Q ss_pred -c-cchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccc-ccc Q lcl|NC_021072. 230 -M-TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIK-DDK 306 (533) Q Consensus 230 -~-i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~-~d~ 306 (533) . |.-.|-..+..+| +.+-+.++.=..+=.|.|-|+=.+-..++... |+.. .-+ T Consensus 228 ~sei~~~v~~l~Da~~--~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d----------------------~~~~~~~~ 283 (504) T protein:vir:99 228 SSRITRPVMSLQQRAL--KGCIRMDGHADVYSFPQLILLGADAKNFRNKD----------------------GSMKPAWQ 283 (504) T ss_pred cccchhhHHHHHHHHH--HHHHHHHHHHHHhcchhhhhccCCcccccccc----------------------ccccchhh Confidence 1 1112233333333 45666677777777777766533322211111 1100 000 Q ss_pred ccchhHhhhcccccC-----CCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhh Q lcl|NC_021072. 307 KFMSMLEDFWLPRRE-----GGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEV 380 (533) Q Consensus 307 ~~msmlEDywLpRRe-----ggrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDEl 380 (533) ..++ .=+.||.-+ ++-++++..++++. |+-. +-++-.-..+....++|..-|+..+.-|-..+..|.-.+. T Consensus 284 ~~~~--~i~~~~~~~~~~~~~~~~~~~~q~~~~~-l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~ 360 (504) T protein:vir:99 284 IALA--RVFALPDDEDEPDAARARADVKQFPASS-PQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASRE 360 (504) T ss_pred hhhh--hhhcCCCccccccccCccceeeecCCCC-hHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHH Confidence 0000 112234321 23356788888875 5433 3355555566666899988886543333334556777888 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC--HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_021072. 381 KFQKFIARLRKRFSELFMDLLKTQLILKGVMS--LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGK 458 (533) Q Consensus 381 kF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t--~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGk 458 (533) ...+-+.+.|++|..-+.+++|.-|.+.+... ..+|. .+.+.|..-..=+ +.++.+++..+..-+.. T Consensus 361 ~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~----~~~v~w~d~~~~s-------~a~~aDa~~Kl~~ag~~ 429 (504) T protein:vir:99 361 DLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWK----TIDSKFRSPLYLS-------KAAQADAGAKMLGAGPE 429 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc----cceeEecCCCccC-------HHHHHHHHHHHHhhccc Confidence 88888999999999999999999887766543 23343 4677775332222 25677778887776666 Q ss_pred cccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccc--cCCCCCCCCCCCCccccccccCCccccchh Q lcl|NC_021072. 459 YFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMD--PAMDPGNAPPADDMSAQEGPAVDAGDAKRG 531 (533) Q Consensus 459 y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~--~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 531 (533) +++...+..+.|++|++||+.+....+++...+.+....+... ++.+..++.+..+.+.+..++..+..+-+| T Consensus 430 l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 430 WLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAALGRPTLVG 504 (504) T ss_pred cccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCccCCCcccCC Confidence 7777666677789999999977776666554443221111110 011111122222223333333333333344 No 75 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=98.01 E-value=7.1e-06 Score=48.81 Aligned_cols=398 Identities=13% Similarity=0.136 Sum_probs=175.0 Q ss_pred CCccccceeeeccccccccC-CCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGP-SFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNE 79 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~-s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvne 79 (533) |+ .|+-+=.....+++.. ....+.... . ..+.+.......++ ..+...++|-|.+||+-|.+. T Consensus 1 m~--~~~~~~~~~~~~~~~~~~~~~~~~~~--~----~~~~~~~~~~~~~v--------~~~~a~~~~~v~~~i~~ia~~ 64 (412) T protein:vir:26 1 MN--VIAKENIVTRIKKKLIDNWIDQSTSK--L----YDFSPWKNRSFWGV--------INNTLETNETIFSAITKLSNS 64 (412) T ss_pred Cc--cchhhhhhhhhhhhHhhhhhcccccc--c----ccccccCCcccccc--------chhhhhccHHHHHHHHHHHHh Confidence 43 1211000001111110 111111000 0 01111111111111 123456889999999999988 Q ss_pred eeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHH-hcchhhhhHH----HHhhhhcCceeeeeeecCCCCCCCeEE Q lcl|NC_021072. 80 TICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRL-LDFENRSYEI----FRRWYVDGRLFYHKVIDPKNPRGGLTE 154 (533) Q Consensus 80 aiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~l-L~f~~~~~~~----fR~WYvDGri~~hkvid~~~~~~gI~e 154 (533) +-. .|+.+- ...+. . ......+++. =+-...+.++ +..+.+.|.-|+.++-+. .+.+++ T Consensus 65 iA~-----lp~~~~-~~~~~---~----~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~---~G~~~~ 128 (412) T protein:vir:26 65 MAS-----LPLKMY-EDYKV---V----NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---YHQPSK 128 (412) T ss_pred Hhh-----CceeEe-ecccc---c----cchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC---CCcEEE Confidence 664 344332 11111 1 1112223321 1222344444 445788899999877654 455899 Q ss_pred EEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecc-cccc--ccccCCcceeccchhhccccccccCCCCcc Q lcl|NC_021072. 155 LRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYN-PKGL--KNSTNQGMKIATDSVTYCHSGIQDLNKNMT 231 (533) Q Consensus 155 lr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~-p~~~--~~~~~~~~kI~~dai~y~hsGl~d~~~~~i 231 (533) |.+|+|..++..+.- +..... |.|. ..+. .+.....++|+.. -..++-.+ T Consensus 129 L~~l~~~~v~v~~~~-----~~~~~~-------------y~~~~~~g~~~~~~~~evih~~~~---------~~~~~~~G 181 (412) T protein:vir:26 129 LFLLNPDVVEMLIEN-----QSRELY-------------YSIHAATGNKLIVHNMDMLHFKHI---------VASNMVQG 181 (412) T ss_pred EEEEcCceeEEEEeC-----CCcEEE-------------EEEEcCCceEEEEccccEEEeCCC---------CCCCCccc Confidence 999999999754332 111111 1111 1111 1122233333211 01122234 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchh Q lcl|NC_021072. 232 LSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSM 311 (533) Q Consensus 232 ~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msm 311 (533) +|-|..|.+.......+++. -++.-.+.| . +.....+.+-+.++++..+.+.+.+. ..|.+ + . T Consensus 182 ~s~i~~~~~~i~~~~a~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~-------~~g~~------~-v 244 (412) T protein:vir:26 182 ISPIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVGKEKRQQVLEDFKQYYE-------ENGGI------L-F 244 (412) T ss_pred ccHHHHHHHHHHHHHHHHHH-HHHhcCCCC-c-eEEecCCCCCHHHHHHHHHHHHHHhh-------cCCCe------e-e Confidence 56777777666655555555 344444443 3 33334567777777666665544332 22321 1 1 Q ss_pred HhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HH Q lcl|NC_021072. 312 LEDFWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IA 387 (533) Q Consensus 312 lEDywLpRReggrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~ 387 (533) + ..|.+++.|. .+..++ +=..|-...+.++++||...|+..++-+.+..++..+. |.++ |. T Consensus 245 l----------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~---f~~~~l~ 309 (412) T protein:vir:26 245 Q----------EPGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF---YLQHTLL 309 (412) T ss_pred c----------CCCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHH Confidence 1 1356777774 223333 33335668899999999999987666666667666665 6665 33 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHH Q lcl|NC_021072. 388 RLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRR 467 (533) Q Consensus 388 rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k 467 (533) -+..+ +.+.|-..| +++.+|. ....|.|. ..++.... +.+|++.+..+-.- -+++.+-++. T Consensus 310 P~~~~----ie~~ln~kL-----l~~~~~~---~~~~~~fd----~~~l~~~d-~~~~~~~~~~~~~~--G~~t~NE~R~ 370 (412) T protein:vir:26 310 PIVKQ----YEEEFNRKL-----LTKTDRE---KNRYFKFN----VKSYLRAD-SATQAEVYFKAVRS--GYYTINDIRE 370 (412) T ss_pred HHHHH----HHHHHHhhc-----CCccccc---CcceEEee----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH Confidence 33332 233333333 3444443 22345555 33333332 34566666555332 4667777664 Q ss_pred HHhCCCHHHHHHHHHHHHHhhhcCCCCCCC-cccccCCCCCCCCCCCCc Q lcl|NC_021072. 468 QVLKQTDQEIKEIDKQIDSEREAGLIVDPM-AEMDPAMDPGNAPPADDM 515 (533) Q Consensus 468 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~-~~~~~~~~~~~~~~~~d~ 515 (533) +|++.+-+ --++-+- .....+ .+ ..+.+....||+...+++ T Consensus 371 -~~gl~p~~--ggD~~~~---~~n~~~-~~~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 371 -WEDLPPVE--GGDKPLI---SGDLYP-IDTPLELRKSLKGGDKNVNES 412 (412) T ss_pred -HhCCCCCC--CcCeeee---cccccc-cccchhhcccccCCCCCcCCC Confidence 35554321 0000000 000000 00 000001111111111111 No 76 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=97.97 E-value=8.7e-06 Score=48.33 Aligned_cols=384 Identities=13% Similarity=0.131 Sum_probs=164.2 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |.-.+|.|--.. ..... .+.+......+......+.+.+ ..+.. +. -+..+++|-|.+||+-|.+.+ T Consensus 1 m~m~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~----~~g~~------v~-~~~al~~~~v~~~v~~ia~~i 67 (392) T protein:vir:74 1 MILPILNFINQT-NDPPE-AGSVQSYFPDGNDAQIMESLLG----DNNEW------VS-ARAALRNSDLFSIILQLSSDL 67 (392) T ss_pred Ccchhhhhhhcc-cCccc-ccccccccccCchhhhhhhccC----CCCcc------cc-hhhhhcchHHHHHHHHHHHhh Confidence 777777652221 11111 1111111111111111111111 00110 11 123457899999999999885 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhh----HHHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSY----EIFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~----~~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -. .|+.+. .. .. . .+++.=|-.-.+. .++..+++.|.-|..++-|. .+.+++|. T Consensus 68 a~-----lp~~~~--~~----~~-~-------~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~G~~~~L~ 125 (392) T protein:vir:74 68 AI-----VKINAE--KK----KN-Q-------GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNA---NGADMKWE 125 (392) T ss_pred cc-----Cceeec--cc----hh-h-------hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC---CCcEEEEE Confidence 43 344432 10 01 1 1222222222333 34456888899998877664 45699999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-CccchhH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTLSHL 235 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~syL 235 (533) +|+|.+++.++.. .+....+...... +.......+. ....++|+.. ..++ -.++|-| T Consensus 126 ~i~~~~v~v~~~~---~~~~~~y~~~~~~--~~~~~~~~~~-------~~evih~~~~----------~~~~~~~G~s~i 183 (392) T protein:vir:74 126 YLRPSQVNTYYFE---YENGMYYNITFDD--PKIEPILQAP-------QSDLIHMKLL----------SIDGGKTGISPL 183 (392) T ss_pred EEcCceeEEEEcC---CCceEEEEEEecC--CccceeEEEc-------CccEEEecCC----------CCCCccccccHH Confidence 9999999654322 1111111100000 0000011111 1223333322 1222 2356999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) ..|+..+.....++....=+=---+--+-+..++-+..+..++.+.++ ..|... ...|. .+ .++ T Consensus 184 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~---~~~~~~----~n~g~------~~-vl~-- 247 (392) T protein:vir:74 184 YSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----SRSGG------PV-VLD-- 247 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----ccCCC------ee-ecC-- Confidence 999999999888887766555555556677777766555544433333 233221 11121 11 221 Q ss_pred cccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHH-HHHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQK-FIARLRKRF 393 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~K-fi~rLr~~f 393 (533) .|++++.|.-...-.| ++=.+|..+...++++||...++..+..+ ...+-.+. |.. .+.-+.+++ T Consensus 248 --------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~~~e~~~~---~~~~~l~p~~~~i 314 (392) T protein:vir:74 248 --------DLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ--SSIQQISG---MYASALNRYLRPA 314 (392) T ss_pred --------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHHHHHHH---HHHHHHHHHHHHH Confidence 2566776643222222 45567888999999999999996533221 11111222 332 233333333 Q ss_pred HHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhc-cccccHHHHHHHHhCC Q lcl|NC_021072. 394 SELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYV-GKYFSIDYMRRQVLKQ 472 (533) Q Consensus 394 s~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~v-Gky~S~~~i~k~IL~~ 472 (533) ..-+ ...|. + .+.+++. .+-+... ..+... ++..+ +..+|..-++ T Consensus 315 e~~l----~~~l~-----~---------~~~~~~~------~~~~~d~-~~~~~~---~~~l~~~g~~t~near------ 360 (392) T protein:vir:74 315 ISEL----EYKLS-----D---------HISVNMR------PAIDPLG-DNYLST---ISTATRWGALAENQAT------ 360 (392) T ss_pred HHHH----HHhcc-----c---------hhcccch------hhhcCCH-HHHHHH---HHHHHhCCCcCHHHHH------ Confidence 3333 22221 1 1111111 0000000 011111 11111 1233333333 Q ss_pred CHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCcccccccc Q lcl|NC_021072. 473 TDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPA 522 (533) Q Consensus 473 tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~ 522 (533) ++. ...|.. |+ +-+..+ +.+|..++..++... T Consensus 361 ------~~~------~~~g~~--pn--e~r~~e--nl~~~~~Gd~~~p~p 392 (392) T protein:vir:74 361 ------FVL------QEAGYI--PK--DLPAPE--NTNKKTTGQSNEPVP 392 (392) T ss_pred ------HHH------HhCCCC--cc--ccchhc--CCCCCCCCCCCCCCC Confidence 221 123432 32 112221 233333333222111 No 77 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=97.94 E-value=9.8e-06 Score=48.04 Aligned_cols=395 Identities=11% Similarity=0.078 Sum_probs=179.9 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |.-.+|+.+-.... .++..... ....+.. .....+ +.+.. ..-|..++.+|-|.+||+-|.+.+ T Consensus 11 ~~m~~F~~~~~~~~--~~~~~~~~-~~~~~~~-~~~~~~---~~~~~---------~~~~~~~~~~~~v~~cI~~ia~~i 74 (413) T protein:vir:96 11 KNLKFFNNKRSPTE--ESKAKDEI-PKAPQVV-MTLPNF---FKELI---------SDGYTKLSDSPEVRMAVDCIADLV 74 (413) T ss_pred hcCCccccCCCcch--hhhhhccc-ccccccc-ccchhh---Hhhhc---------cchhHHHhhchHHHHHHHHHHHhh Confidence 22345555322211 10100000 0111000 000101 11111 122445678999999999999886 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHH----HhhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIF----RRWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~f----R~WYvDGri~~hkvid~~~~~~gI 152 (533) -- .|+.+--..-+..+.+ . ..++.+|+. .-.+.++. ..+...|.-|.-++-|. ...-+ T Consensus 75 a~-----~~~~~~~~~~~~~~~~----~---~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~--~g~~~ 140 (413) T protein:vir:96 75 SN-----MTIQLMQNGETGDKRI----K---NDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQV--SGDKI 140 (413) T ss_pred cc-----CceEEEEecCCCcccc----c---cHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC--CCCce Confidence 43 3444421111111111 1 234444432 23455554 44567799998877653 22348 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccc Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTL 232 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~ 232 (533) ++|.++||..++.... ++... |.|...+..+.+.+.++|....-.. ++-.+. T Consensus 141 ~~L~~l~~~~v~~~~~-----~~~~~---------------y~~~~~~~~~~~~evih~k~~~~~~--------~~~~G~ 192 (413) T protein:vir:96 141 IGLTPISPYKVTFNVS-----DDDLD---------------YSITFDNKEYDPSTLLHFVLNPSIE--------RPFIGT 192 (413) T ss_pred EEEEEecCceeEEEEc-----CCeEE---------------EEEeecCcEEchhhEEEEeccCCCC--------Cccccc Confidence 8999999999965321 11111 1111122222223344443221100 111356 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhH Q lcl|NC_021072. 233 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSML 312 (533) Q Consensus 233 syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msml 312 (533) |.|..|.+++.....+++...=+--.-+.-+-++.++ ++|.+..+++..+.+...|..- ...|. .+ .+ T Consensus 193 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~g~------~~-vl 260 (413) T protein:vir:96 193 GYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVD-SDSDELSDEEGRENFEEMYLKR----KEAGK------PW-II 260 (413) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-CCCCHHHHHHHHHHHHHHhcCc----cccCc------ee-ee Confidence 9999999999988888887665555566667777877 5677777766555554444321 01111 11 11 Q ss_pred hhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_021072. 313 EDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 392 (533) Q Consensus 313 EDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~ 392 (533) + -+|...+++..+. -..+.-++-..|-.+.+.++++||...|+.. .+++-+. ..|.++ .|+- T Consensus 261 ~------~~~~~~~~~~~~~-~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~------~~~~~~~--~~~~~~--~l~P- 322 (413) T protein:vir:96 261 P------EGMVNVQQIKPLT-LNDLAINDAVTLDKKTVAGIFGVPAFLLGVG------TYNKDEF--NNFINT--KIMS- 322 (413) T ss_pred c------CCcccccccccCC-hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC------cchHHHH--HHHHHH--HHHH- Confidence 1 0111112333222 1233334555677899999999999999531 1111111 124332 2333 Q ss_pred HHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCC Q lcl|NC_021072. 393 FSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQ 472 (533) Q Consensus 393 fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~ 472 (533) +...+.+.|-..|+-+| ..+.|. +.++.... +.+|.+.+..+-.- -++|.+-++. .+++ T Consensus 323 ~~~~ie~~ln~~ll~~~-------------~~~~fd----~~~ll~~d-~~~~~~~~~~~~~~--G~~t~NE~R~-~~g~ 381 (413) T protein:vir:96 323 IAQVIQQTYNKLIVEED-------------MYFSLN----PRSLYNYS-LTEMVSAGAQMTQL--NALRRNEFRN-WVGM 381 (413) T ss_pred HHHHHHHHHHHhhCCCC-------------cEEEEe----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCC Confidence 22334555555554332 345565 44444333 34566666554332 2455555542 2333 Q ss_pred CHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCcccc Q lcl|NC_021072. 473 TDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDA 528 (533) Q Consensus 473 tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 528 (533) .+ .|+.+ .=.-..+..|.++...+. ....+|| T Consensus 382 ~p--------------------~~~gd--~~~~~~n~~~~~~~~~~~--~~~~~dt 413 (413) T protein:vir:96 382 PP--------------------DAEMD--DLLVLENYLQQKDLVNQK--KLIQDET 413 (413) T ss_pred CC--------------------CCCcc--eeeecccccchhhccccc--CCCCCCC Confidence 22 12111 111112222333222221 1233444 No 78 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.92 E-value=1.1e-05 Score=47.83 Aligned_cols=393 Identities=14% Similarity=0.132 Sum_probs=170.8 Q ss_pred CCccccceeeeccccccccCCCCCCC----CCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKD----SMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDI 76 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~----~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeI 76 (533) |+- ++...+.-+....+ ...+ ... +.+.......++. .+...++|-|.+||+-| T Consensus 1 ~~~---------~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~v~--------~~~~~~~~~V~~ci~~I 58 (409) T protein:vir:93 1 MAK---------ENIVTRIKKKLIDNWIDQSTSK-LYD----FSPWKNRSFWGVI--------NNTLETNETIFSAITKL 58 (409) T ss_pred CCc---------cchhhhhhhhhhhhhhcccccc-ccc----cccccCccccccc--------hhhhhccHHHHHHHHHH Confidence 432 22222111111000 0000 000 0010000000111 12345788999999999 Q ss_pred hcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHH----HHhhhhcCceeeeeeecCCCC Q lcl|NC_021072. 77 VNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEI----FRRWYVDGRLFYHKVIDPKNP 148 (533) Q Consensus 77 vneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~----fR~WYvDGri~~hkvid~~~~ 148 (533) .+.+-. .|+.+- ...+ ..+ . .+..+|+- ...+.++ +..+.++|.-|+.++-|. T Consensus 59 a~~ia~-----lp~~~~-~~~~---~~~----~---~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~--- 119 (409) T protein:vir:93 59 SNSMAS-----LPLKMY-EDYK---VVN----T---EVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI--- 119 (409) T ss_pred HHhhhh-----CceeEe-eccc---ccc----c---hHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECC--- Confidence 988653 344432 1111 111 1 23333432 2344444 445678899998877653 Q ss_pred CCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccc-ccc--ccccCCcceeccchhhcccccccc Q lcl|NC_021072. 149 RGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNP-KGL--KNSTNQGMKIATDSVTYCHSGIQD 225 (533) Q Consensus 149 ~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p-~~~--~~~~~~~~kI~~dai~y~hsGl~d 225 (533) .+-+++|.+|+|..++..+.- +..... |.+.. .+. .+.....++|+... . T Consensus 120 ~G~~~~L~~l~~~~v~~~~~~-----~~~~~~-------------y~~~~~~g~~~~~~~~eVih~r~~~---------~ 172 (409) T protein:vir:93 120 YHQPSKLFLLNPDVVEMLIEN-----QSRELY-------------YSIHAATGNKLIVHNMDMLHFKHIV---------A 172 (409) T ss_pred CCcEEEEEEEcCceeEEEEeC-----CCcEEE-------------EEEEcCCceEEEEccccEEEeCCCC---------C Confidence 445899999999999653321 111111 11111 111 01122223332110 1 Q ss_pred CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccc Q lcl|NC_021072. 226 LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDD 305 (533) Q Consensus 226 ~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d 305 (533) .++-.+.|-|..|.+++.....++..- +.. ..+|-.-+ ..--+.|.+.+++...+.+.+.|. ..|.+ T Consensus 173 ~~~~~G~s~i~~~~~~i~~~~~~~~~~-~~~-~~~~~~~i-~~~~~~l~~e~~~~~~~~~~~~~~-------~~g~~--- 239 (409) T protein:vir:93 173 SNMVQGISPIDVLKNTTDFDNAVRTFN-LTE-MQKPDSFM-LKYGSNVGKEKRQQVLEDFKQYYE-------ENGGI--- 239 (409) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHH-HHh-cCCCCceE-EecCCCCCHHHHHHHHHHHHHHhh-------cCCCe--- Confidence 111234466666666666555555542 333 33343323 334456666666655555444332 12321 Q ss_pred cccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHH Q lcl|NC_021072. 306 KKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQK 384 (533) Q Consensus 306 ~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~K 384 (533) + .+ + .|.+++.|.-. ..+.-++-..|-...+.++++||...|+..++-+.+...+..+. |.. T Consensus 240 ---~-vl--------~--~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~---f~~ 302 (409) T protein:vir:93 240 ---L-FQ--------E--PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF---YLQ 302 (409) T ss_pred ---e-ec--------C--CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHH Confidence 1 11 1 25667776421 11222333446778899999999999987665566666565554 666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHH Q lcl|NC_021072. 385 FIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDY 464 (533) Q Consensus 385 fi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~ 464 (533) ++ |+-.+. .+.+.|-..| +++.++. ....|.|. ..++.... +..|.+.+..+-.- -++|..- T Consensus 303 ~~--l~P~~~-~ie~~l~~~L-----l~~~~~~---~~~~~~fd----~~~ll~~d-~~~~~~~~~~~~~~--G~~T~NE 364 (409) T protein:vir:93 303 HT--LLPIVK-QYEEEFNRKL-----LTKTDRE---KNRYFKFN----VKSYLRAD-SATQAEVYFKAVRS--GYYTIND 364 (409) T ss_pred HH--HHHHHH-HHHHHHHhhc-----CCccccc---CcceEEee----chhhhccC-HHHHHHHHHHHHhC--CCcCHHH Confidence 53 333221 2333344443 3445554 23455665 33444333 35667666655433 4677777 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 465 MRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 465 i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~ 515 (533) ++.. |++.+-+ --++-+- .....+-....+.+....||+...+++ T Consensus 365 ~R~~-~g~~p~~--ggD~~~~---~~n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 365 IREW-EDLPPVE--GGDKPLI---SGDLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred HHHH-hCCCCCC--CcCeeee---cccccccccchhhcccccCCCCCcCCC Confidence 7743 5554321 0000000 000000000000011111111111111 No 79 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=97.91 E-value=1.1e-05 Score=47.70 Aligned_cols=445 Identities=15% Similarity=0.121 Sum_probs=196.2 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceee-cccccccccchhhhhhHHHHHHHHHHhhhh---------cchhh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIV-GGGYYGYSVDFDGTVRNEYELITRYREMVL---------QPECD 70 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~-~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~---------~pEvd 70 (533) |..+| +.+-..+-+.--.|....+...... ...+... ... -..+.+++..|=.-.+ .++-. T Consensus 1 ~~~~~-----~~~~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~---~~~-~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~ 71 (501) T protein:vir:25 1 MTVPV-----DVIADAPAADVEFPEDSMSREQLGALVADMWRL---HIS-ERQWLDRIYEYTKGLRGRPEVPEGASDEVK 71 (501) T ss_pred Ccccc-----hhhhccCcccccCCcccCChHHHHHHHHHHHHH---HHH-HHHHHHHHHHHHhcCCCchhccccCChhhh Confidence 33322 1111111111111222222111000 0001110 011 1113333333311110 01100 Q ss_pred hHHHHhh-cc-eeeecCC-C--ceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecC Q lcl|NC_021072. 71 SAVDDIV-NE-TICGNFD-D--VPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDP 145 (533) Q Consensus 71 ~AvdeIv-ne-aiv~d~~-~--~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~ 145 (533) .-...+| |- ..+.|.- + .+....+.+.+.. ++...+...-+|+...++.++.-++-|+-|+..-.|. T Consensus 72 ~~~~~~v~n~~~~ivd~~a~~l~~~gf~~~d~~~~--------~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de 143 (501) T protein:vir:25 72 ELAKLSVKNVLSLVRDSFAQNLSVVGYRNALAKEN--------DPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTD 143 (501) T ss_pred hhHhhhhcChHHHHHHHHHhhhcccceecCCccch--------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCC Confidence 0001111 21 1111100 0 0111112121111 1234455556688899999999999999887755543 Q ss_pred CCCCCCeEEEEEcChhhceehhhccCCCcCc------eeEEeccceeeccchhceeccccccc--------c-------- Q lcl|NC_021072. 146 KNPRGGLTELRYIDPRKIRKVTEYQQKRPEQ------LRGEDINTQLTQKAAEYYLYNPKGLK--------N-------- 203 (533) Q Consensus 146 ~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~------~~~~~~~~~~~~~~~e~~~y~p~~~~--------~-------- 203 (533) + | ..++.++|+..-.| +..+... .+++.-... .+.....-+|.+...+ . T Consensus 144 --~--~-~~i~~~sp~~~~~i---y~D~~~~~~~~~ai~~~~~~~~-~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 214 (501) T protein:vir:25 144 --E--G-PVFRTRSPRQILAV---YADPSVDAWPQYALETWVAQKD-AKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQA 214 (501) T ss_pred --C--C-CeEEEeccccEEEE---EecCCCCcceeEEEEEEeeccc-cCcceeEEEecCeeEEEEecCceeeeecccccc Confidence 2 2 24667788877432 1111111 111110000 0000001112221100 0 Q ss_pred ---ccCCcceeccchhhcccc--cccc----CC----CCccchhHHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEcc Q lcl|NC_021072. 204 ---STNQGMKIATDSVTYCHS--GIQD----LN----KNMTLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRIFYID 269 (533) Q Consensus 204 ---~~~~~~kI~~dai~y~hs--Gl~d----~~----~~~i~syL~~AiK~~NqL-rm~EDalVIyRi~RAPeRrvfyID 269 (533) ........++......+- |.+. +| .+.+.|-++..+...+-+ +++-+.+++-..+-.|.|-+.=.+ T Consensus 215 ~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~ 294 (501) T protein:vir:25 215 TQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWT 294 (501) T ss_pred ccccccccccccccccccccCCccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCC Confidence 000001111110000011 1110 11 233455555433222222 355677788888888887776544 Q ss_pred CCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHH Q lcl|NC_021072. 270 VGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKK 349 (533) Q Consensus 270 vGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~k 349 (533) ....+..++ +.+ ..|+. +| -+++|.++|+..-=+-++-++-.-.. T Consensus 295 ~~~~~~~~~----------~~~----------------------~i~~~--~~-~~~~~~q~~~~~~~~~~~~l~~~i~~ 339 (501) T protein:vir:25 295 GSKAEVLKA----------SAL----------------------RVWTF--ED-PEVKAQAFPPASVEPYNLILEEMLQH 339 (501) T ss_pred CCccchhhh----------ccc----------------------ceecc--CC-CCceEEEecccChHHHHHHHHHHHHH Confidence 333321111 111 12332 12 23567788875422233445555556 Q ss_pred HHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEec Q lcl|NC_021072. 350 LYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIA 429 (533) Q Consensus 350 Ly~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~ 429 (533) +...-++|..-|+..++ |. .+..|.--+....+-+.+.|+.|..-+.+++|.-+.++|.....+|. .|.+.|.. T Consensus 340 i~~~s~~P~~~~~~~~~-N~-Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~----~i~v~w~~ 413 (501) T protein:vir:25 340 VAMVAQISPAQVTGKMI-NV-SAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADS----GAEVLWRD 413 (501) T ss_pred HHhhcCCChhhhccccC-Ch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccce----eeeEEecC Confidence 66677889776664322 22 34456666667788899999999999999999999899864333332 46777753 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc-ccCCCCC- Q lcl|NC_021072. 430 DNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEM-DPAMDPG- 507 (533) Q Consensus 430 Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~-~~~~~~~- 507 (533) -..= -+.+..+++..+..- | +|.+++...++++|++||+++.++.++|...+++..+.... .|..+.. T Consensus 414 ~~~~-------s~~~~ada~~kl~~~-g--is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 483 (501) T protein:vir:25 414 TEAR-------SFGAVVDGITKLASA-G--IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPP 483 (501) T ss_pred CCCC-------CHHHHHHHHHHHHhc-C--CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCC Confidence 3221 124566777776654 4 69999999999999999998888888877766544332211 1111111 Q ss_pred --CCCCCCCcccc-cccc Q lcl|NC_021072. 508 --NAPPADDMSAQ-EGPA 522 (533) Q Consensus 508 --~~~~~~d~~~~-~~~~ 522 (533) ..++.+.+..+ ++++ T Consensus 484 ~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 484 QAAAQALNEGGVNGNGGA 501 (501) T ss_pred CCCccccccccCCCCCCC Confidence 11111111111 1111 No 80 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=97.87 E-value=1.4e-05 Score=47.22 Aligned_cols=445 Identities=11% Similarity=0.057 Sum_probs=179.4 Q ss_pred ccccC-CCCCC-CCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcch---------------------hhhH Q lcl|NC_021072. 16 VPKGP-SFVQK-DSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPE---------------------CDSA 72 (533) Q Consensus 16 ~~~~~-s~~~~-~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pE---------------------vd~A 72 (533) ..+.. +..+. .+.+... ....-... ......+|+.+..+.+ +.+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~----------~~~~i~~~--~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~ 68 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQ----------LKNYISRF--KAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDF 68 (489) T ss_pred CCccceeeeCCCCCCCHHH----------HHHHHHHH--HHHHHHHHHHHHHHhcccCccccccccccccCCcceeecch Confidence 00000 00000 0000000 00000000 1112222333332222 1112 Q ss_pred HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 73 VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 73 vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI 152 (533) ..-||+-..-+ .-+.|+.+..++ +.+ .+.+..+.+.-+|+....+..+.+.+-|+-|....+.+....+|- T Consensus 69 ~~~iv~~~~~~-l~g~~~~~~~~d----~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~ 139 (489) T protein:vir:99 69 AKYITVFEQGY-MLGVPVEYKNEN----KDL----QAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTE 139 (489) T ss_pred HHHHHHHHhhh-hccCCceeecCC----hhH----HHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcc Confidence 22223221100 124566666544 333 334555666668999999999999999999998888765567788 Q ss_pred EEEEEcChhhceehhhccCCCcCceeE---Eeccceeeccchhceecccccccc-c----cCCcceeccchhhccc-ccc Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRG---EDINTQLTQKAAEYYLYNPKGLKN-S----TNQGMKIATDSVTYCH-SGI 223 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~---~~~~~~~~~~~~e~~~y~p~~~~~-~----~~~~~kI~~dai~y~h-sGl 223 (533) ..+..+||+++-++..-... .....+ +.....-.....-..+|.+..... . .....++.. .+.| .|- T Consensus 140 ~~i~~~~p~~~~~v~dd~~~-~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~---~~~~~~g~ 215 (489) T protein:vir:99 140 VKLYQLPAEQTFVIYDDTYQ-RNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKD---YEGHFFKG 215 (489) T ss_pred eEEEEEcccceEEEEcCCCC-CceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecc---cccccCCc Confidence 99999999999776532111 111111 110000000111223444432210 0 000111100 0111 111 Q ss_pred cc----CCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCC Q lcl|NC_021072. 224 QD----LNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDAN 298 (533) Q Consensus 224 ~d----~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~ 298 (533) ++ +|+....|-++..+.....+.. +-+....-+..+.|-+-+.=..... ...-++....+.. -+.. T Consensus 216 vPvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~---~~~~~~~~~~~~~------~~~~ 286 (489) T protein:vir:99 216 VPVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTG---ADENDYLDDGRLN------PNGR 286 (489) T ss_pred eeEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCccc---ccchhhhhhcccc------cccc Confidence 11 1333445666655555444422 2333333344445554443322111 1111111111100 0111 Q ss_pred CCccc-cccccchhHhhhcccccCC--CCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccch-- Q lcl|NC_021072. 299 TGEIK-DDKKFMSMLEDFWLPRREG--GRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRA-- 372 (533) Q Consensus 299 TGev~-~d~~~msmlEDywLpRReg--grgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~-- 372 (533) .+... .+.+.+ +++..... |.+..+.-|.-..+.+.. .-+.-+.+.+|+-.++|- +..++ |. |.. T Consensus 287 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~-~~-~n~Sg 357 (489) T protein:vir:99 287 LAISIGFKKAQV-----LILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPD--TQDMK-FS-GVQSG 357 (489) T ss_pred ccccccccccee-----eeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcc--ccccc-cc-ccchH Confidence 11100 111111 11111111 122234434333332222 234566778888889983 22221 11 222 Q ss_pred hhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhh-hhhceeEEEeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 373 AEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDE-MKEHIQFDFIADNYFTELKEIEIRNERMNQVNT 451 (533) Q Consensus 373 ~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~-~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~ 451 (533) ..|..-+..-..-+.+-|+.|...+.++++.=+-+-|+.....|.. ....|.+.|...---.+. +.++++.+ T Consensus 358 ~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~-------~~~~~~~k 430 (489) T protein:vir:99 358 ESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDN-------EIVTAAQN 430 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHH-------HHHHHHHH Confidence 2222222222333556666666666666654332223322122211 223477888544333332 23344445 Q ss_pred hhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccc Q lcl|NC_021072. 452 MDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGP 521 (533) Q Consensus 452 ~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~ 521 (533) + +| .+|.+++++.+=..++++.+++-++|++|.....-. + + +..++..-.++.+..+.| T Consensus 431 l---~g-iis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~-~--~----~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 431 L---YG-IVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSL-P--E----PRLVGDASGQEEPTAEKP 489 (489) T ss_pred H---hc-cCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhcc-c--c----ccccCCCCCCcCCCCCCC Confidence 4 35 389999998866677778888777888876543211 1 1 111111111111111222 No 81 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=97.84 E-value=1.6e-05 Score=46.95 Aligned_cols=427 Identities=12% Similarity=0.153 Sum_probs=192.3 Q ss_pred ccceeeeccccccccC--CCCCCCCCccc-ceeecccccccccchhhhhhHHHHHHHHHHhhhh--cchh---------- Q lcl|NC_021072. 5 LFGFSLERAKKVPKGP--SFVQKDSMDGS-QPIVGGGYYGYSVDFDGTVRNEYELITRYREMVL--QPEC---------- 69 (533) Q Consensus 5 ~fg~~i~~~~~~~~~~--s~~~~~~~dg~-~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~--~pEv---------- 69 (533) +|+.-+...+..-+.- .....+-.+.. +.+ -.+....|++++.+.. |+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~ 65 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNA---------------NDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGN 65 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcC---------------CHHHHHHHHHHHHHhcCCCchhhcchhccCCC Confidence 4443333321111100 00000000000 000 0012233333333321 1111 Q ss_pred ------------hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCce Q lcl|NC_021072. 70 ------------DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRL 137 (533) Q Consensus 70 ------------d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri 137 (533) ...++..++=+ =+.|+.+.+++ + ..++-+..+++--+|.+...+++....+-|.. T Consensus 66 ~~~~~~~~~n~~k~i~~~~a~~l-----~~~p~~i~~~d----~----~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~ 132 (496) T protein:vir:38 66 PVNRRQLSMNLPKVTAKYMSKLL-----FNEKVKINIDD----K----AAEEFVLNVLKTNGFTKNMERYIEYGEAMGGF 132 (496) T ss_pred ccccceeecchHHHHHHHHhhhh-----hCCcceEeeCC----h----HHHHHHHHHHhccCHHHHHHHHHHHHhhhCcE Confidence 11111111111 14566676654 2 23334566776677999999999999999999 Q ss_pred eeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEE---ecc-ceeec----------cchhceecccccccc Q lcl|NC_021072. 138 FYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGE---DIN-TQLTQ----------KAAEYYLYNPKGLKN 203 (533) Q Consensus 138 ~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~---~~~-~~~~~----------~~~e~~~y~p~~~~~ 203 (533) |++..+|.+ |=..+..++|.++=++....+.... .... ... ..++. +...+.+|.-... . T Consensus 133 ~~~~~~D~~----~~~~i~~v~~~~~~P~~~~~~~~~~-~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~-~ 206 (496) T protein:vir:38 133 VIKVYHDGN----KNVKVSFATADCMYPLSNDSENVDE-CVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDP-N 206 (496) T ss_pred EEEEEEcCC----CcEEEEEEcccceEEEEecCCcEEE-EEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCc-c Confidence 999999852 3457899999998665332221110 0011 000 00000 0011112211110 0 Q ss_pred ccCCccee----------------ccchhhccccc---cccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccce Q lcl|NC_021072. 204 STNQGMKI----------------ATDSVTYCHSG---IQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERR 264 (533) Q Consensus 204 ~~~~~~kI----------------~~dai~y~hsG---l~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRr 264 (533) ..++++.+ +.-.++|-.-- -.+..++.++|-|+.++.....|-..=..+ .+-++.-.+| T Consensus 207 ~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~--~~~~~~~~~~ 284 (496) T protein:vir:38 207 ELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSY--YQEFKLGKKK 284 (496) T ss_pred ccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHH--HHHHhhcccc Confidence 11111111 00001110000 013335567889999988877776554443 3667776666 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhccccc-CCCCccceeecCCCCCcch-HHH Q lcl|NC_021072. 265 IFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR-EGGRGTEISTLPGGQNLGE-LED 342 (533) Q Consensus 265 vfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRR-eggrgTEIsTLpGg~nLge-i~D 342 (533) +|. +.. +++ . ..|..++.+. -++.-.+.|....- .++.|.-|+++.+.-...+ ..- T Consensus 285 i~v-~~~---------~l~----~-----~~~~~g~~~~---~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~ 342 (496) T protein:vir:38 285 VLV-PSS---------FVK----T-----AVNLDGSTTQ---YFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIES 342 (496) T ss_pred eec-chH---------Hhh----c-----cCCCCCcccc---CCCCccceEEEeecCCCcccccceeeccccCHHHHHHH Confidence 664 211 110 0 0111222111 01111111111111 1222234666555322221 334 Q ss_pred HHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHH-------HHHhccCCCHhH Q lcl|NC_021072. 343 VKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKT-------QLILKGVMSLEE 415 (533) Q Consensus 343 V~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~-------qLilkgi~t~ee 415 (533) +..+.+.+....++|-+-|+.+++-+ -.+++|.-....-..-+.+.++.|...+.++++. .+.++|.. T Consensus 343 l~~~l~~i~~~~g~~~~~f~~~~~g~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~---- 417 (496) T protein:vir:38 343 INAMLRIYAMQVGLSAGTFTFDENGL-KTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEV---- 417 (496) T ss_pred HHHHHHHHHHhhCCChhhcCCCcccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC---- Confidence 67778899999999999887654221 1355664443333333444454454544444443 34445532 Q ss_pred HhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCC Q lcl|NC_021072. 416 WDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVD 495 (533) Q Consensus 416 w~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~ 495 (533) |+ ...+.+.|.. +.... . . +.++.+.++.. .| .+|.++++++....||+|.+++.++|++|.... .++ T Consensus 418 ~~--~~~i~v~f~d-~i~~d-~-~----~~~~~~~~~~~-~G-iiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~-~~~ 485 (496) T protein:vir:38 418 VE--LDTITVDFDD-SIAQD-E-D----TTINRYTNAKN-QG-MIPLKIALQRAWNITEAEADEWAEMLAKEKQAE-MPN 485 (496) T ss_pred CC--ccceEEEeCC-CCCCC-H-H----HHHHHHHHHHh-cC-CCCHHHHHHhcCCCChHHHHHHHHHHHHhhhcc-Ccc Confidence 22 2457888873 32221 1 1 12222333321 24 589999988888999999999999999988755 221 Q ss_pred CCcccccCCCCCCCC Q lcl|NC_021072. 496 PMAEMDPAMDPGNAP 510 (533) Q Consensus 496 p~~~~~~~~~~~~~~ 510 (533) | +.+...|++. T Consensus 486 ~----d~~~~~~~~e 496 (496) T protein:vir:38 486 N----DMNGIFGEEE 496 (496) T ss_pred c----cccCCCCCCC Confidence 1 1111122222 No 82 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=97.83 E-value=1.6e-05 Score=46.87 Aligned_cols=429 Identities=15% Similarity=0.119 Sum_probs=179.3 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) +-+.|||..-. +..+.+.+..-...........++...|- -|. -+..+++|-|-+||+-|.+.+ T Consensus 3 ~~~~l~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-------------~v~-~~~al~~~~v~~~i~~ia~~i 66 (457) T protein:vir:62 3 FWSALFGRGHS--PALDAAEGRAWEPYDPSIYNLGATASSGE-------------RVT-PHDALQVSAVFASVRLLSETI 66 (457) T ss_pred hhhhhhccccc--cccccccccccccchhhhhhccccccCCc-------------eec-hHHhhccHHHHHHHHHHHHhH Confidence 33556763111 11111110000000000000000001110 011 133456899999999988775 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc---hhhhhHHHHh----hhhcCceeeeeeecCCCCCCCeE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF---ENRSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLT 153 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f---~~~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~ 153 (533) -- .|+.|--..-+..+.++ =..++.+++- .-.+.++++. +.+.|.-|+-+ .+ ..++++ T Consensus 67 A~-----lp~~~~~~~~~~~~~~~------~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i-~~---~~g~~~ 131 (457) T protein:vir:62 67 AT-----LPLSTYSKRGGTRKEID------TPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAV-RW---AGPNIA 131 (457) T ss_pred hh-----CceEEEEecCCcccccc------chHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEE-Ee---CCCcEE Confidence 42 45555322211111111 1122333322 1245555555 56679988764 33 257899 Q ss_pred EEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc-----cccccCCcceeccchhhccccccccCCC Q lcl|NC_021072. 154 ELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG-----LKNSTNQGMKIATDSVTYCHSGIQDLNK 228 (533) Q Consensus 154 elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~-----~~~~~~~~~kI~~dai~y~hsGl~d~~~ 228 (533) +|.+|+|..+...+.......... +.. |.+...+ ..+.+...++|+.-+ .++ T Consensus 132 ~l~~l~p~~v~v~~~~~~~~~~~~--~~~-----------y~~~~~g~~~~~~~~~~~eiih~r~~~----------~~~ 188 (457) T protein:vir:62 132 GLDVLDPTKIHVHMVMVDGLRRKV--FEA-----------YDIDADGNEVLLGWFTPRDVLHIPGMM----------LPG 188 (457) T ss_pred EEEEEcCcceEEEEeccCCcccee--EEE-----------EEEccCCceeEEEeeCccceEEecCCC----------CCC Confidence 999999999965443322111110 000 1111111 011222334443221 112 Q ss_pred -CccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccc Q lcl|NC_021072. 229 -NMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKK 307 (533) Q Consensus 229 -~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~ 307 (533) -.++|-++.|++++.....+++...-+=---+--+-|..++ |.|-+..+++..+.+...|+.. .+.|. T Consensus 189 ~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~G~----~nag~------ 257 (457) T protein:vir:62 189 DFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP-GTMSEEGLARAREAWRAANSGV----DNAHR------ 257 (457) T ss_pred ceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcCc----cccCc------ Confidence 24568999999999888888876654433334455677776 5676665555444444334311 01121 Q ss_pred cchhHhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHH Q lcl|NC_021072. 308 FMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQK 384 (533) Q Consensus 308 ~msmlEDywLpRReggrgTEIsTLpGg~nLgei---~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~K 384 (533) .+ .++ .|.+++.|. .+..++ +=-+|....+.++++||...|+..++-+... +.+...-+.|.+ T Consensus 258 ~~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-sn~eq~~~~f~~ 323 (457) T protein:vir:62 258 VA-LLT----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWG-SGLAEQNIAFTM 323 (457) T ss_pred ce-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccccc-chHHHHHHHHHH Confidence 11 121 244555552 333332 3334677889999999999996544333211 122222233666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHH Q lcl|NC_021072. 385 FIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDY 464 (533) Q Consensus 385 fi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~ 464 (533) ++ |+--+ ..+.+.|-..|+ ++.+. ....+.|. +..+... -+..|++++..+-.- -++|..- T Consensus 324 ~~--l~P~~-~~ie~~ln~~L~-----~~~~~----~~~~i~fd----~~~l~~~-d~~~r~~~~~~~~~~--G~~T~NE 384 (457) T protein:vir:62 324 FS--LRPWL-ERIEAGFNRLLF-----AETAD----RFRFVKFN----LDEIKRG-APKERMELWSLGLQN--GIYSIDE 384 (457) T ss_pred HH--HHHHH-HHHHHHHHhhhc-----Ccccc----CceEEEee----chhhhcc-CHHHHHHHHHHHHhC--CCcCHHH Confidence 53 32211 233334444443 33332 23345555 2233221 234567766665432 4788888 Q ss_pred HHHHHhCCCHHHHHHHHH--------HHHHhhhcCCCCCCCcccccCCCC-CCCCCCCCccccccccCCccccchhcC Q lcl|NC_021072. 465 MRRQVLKQTDQEIKEIDK--------QIDSEREAGLIVDPMAEMDPAMDP-GNAPPADDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 465 i~k~IL~~tDeeI~e~~k--------qi~~E~~~~~~~~p~~~~~~~~~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~ 533 (533) +++ +++|..-+=-..++ .+.........+.+.+...+..++ .+..++++... .+..+++.++= T Consensus 385 ~R~-~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~d~~~~~~~~~ 456 (457) T protein:vir:62 385 VRA-AEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGD-----PDEGETEDDDD 456 (457) T ss_pred HHH-HhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCC-----Ccccccccccc Confidence 885 46775421000000 000000000001111111100000 00001100000 00111111111 No 83 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=97.79 E-value=1.9e-05 Score=46.46 Aligned_cols=407 Identities=14% Similarity=0.149 Sum_probs=174.1 Q ss_pred CC--ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhc Q lcl|NC_021072. 1 MS--NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~--~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvn 78 (533) |. ..||+-.=.. .+..+ ....+..+....+.+.+.. +...... +.. +...++|-|.+||+-|.+ T Consensus 1 MG~f~~lf~~~~~~----~~~~~---~~~~~~~~~~~~~~~~~~~-g~~~~~~-----v~~-~~al~~~~v~~ci~~ia~ 66 (422) T protein:vir:13 1 MGFLRGLFNKKNNN----DEKRS---NYDEDIGIDISDSNFWEKF-GIKLNFS-----VRG-KRALKENTVYVCTKIRAE 66 (422) T ss_pred CchhhhhhhccCCc----cchhh---hhhhccccccCcchhhhhc-cccCCcc-----cch-hhhhccHHHHHHHHHHHH Confidence 43 2333311111 11111 0111111111111111110 0001100 111 122457889999988877 Q ss_pred ceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhH----HHHhhhhcCceeeeeeecCCCCCC Q lcl|NC_021072. 79 ETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYE----IFRRWYVDGRLFYHKVIDPKNPRG 150 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~----~fR~WYvDGri~~hkvid~~~~~~ 150 (533) .+-- .|+.+- .+. +.+++ ..++++|+.. -.+.+ ++..+++.|.-|..++-|. .+ T Consensus 67 ~iA~-----lp~~~~-~~~---~~~~~------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~G 128 (422) T protein:vir:13 67 SIGK-----LSLKIY-KDK---EEYKE------HELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDR---KG 128 (422) T ss_pred hhhh-----CceEEE-ecC---ccccc------chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CC Confidence 6442 444442 111 11111 1344444322 23334 4444778899999988764 44 Q ss_pred CeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecc-ccccc--cccCCcceeccchhhccccccccCC Q lcl|NC_021072. 151 GLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYN-PKGLK--NSTNQGMKIATDSVTYCHSGIQDLN 227 (533) Q Consensus 151 gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~-p~~~~--~~~~~~~kI~~dai~y~hsGl~d~~ 227 (533) -+++|.+++|..++.+....+.... ...-+|.+. +.|.. ..+.+.+++... ...+ T Consensus 129 ~~~~L~~i~~~~v~~~~~~~~~~~~-------------~~~~~y~~~~~~g~~~~~~~~eiih~~~~---------~~~~ 186 (422) T protein:vir:13 129 KIIGLYPINSDNVTKIIDDDNFLSS-------------LSKVWYVVTDKNGKEHKLLPDEMLHFIGD---------ITLD 186 (422) T ss_pred cEEEEEEECCcceEEEEcCCcceec-------------cceEEEEEEeCCCeEEEEcccceEEEcCC---------CCCC Confidence 5999999999999764432211100 001112221 22211 122333444322 1122 Q ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccc Q lcl|NC_021072. 228 KNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKK 307 (533) Q Consensus 228 ~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~ 307 (533) +-.++|-|..|.+++.....+++...=+----|--+-+...+ ++|-+..+++..+.+...|.-. + ..| + T Consensus 187 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~---~-n~~------~ 255 (422) T protein:vir:13 187 GLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYV-GDLDEKAKKIFKKEFESMSNGL---E-NAH------S 255 (422) T ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeC-CCCCHHHHHHHHHHHHHHhcCc---c-ccC------C Confidence 334578999999998888877766543333334566677776 4666665555555444443210 0 011 1 Q ss_pred cchhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH- Q lcl|NC_021072. 308 FMSMLEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF- 385 (533) Q Consensus 308 ~msmlEDywLpRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf- 385 (533) .+ .++ + |++++.|.= ...+.-++-.++....+.++++||...|...+.-+....++..+. |..+ T Consensus 256 ~~-vl~-------~---g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~---f~~~~ 321 (422) T protein:vir:13 256 IS-LLP-------F---GYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKD---FYVTT 321 (422) T ss_pred ce-ecC-------C---CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHH Confidence 11 111 2 344443321 122223445567888999999999999975443333333332222 4332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHH Q lcl|NC_021072. 386 IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYM 465 (533) Q Consensus 386 i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i 465 (533) |.-+-.++.. .|-.. ++++.+. .....+.|.. .++...+ +..|.+++..+-.- -++|.+-+ T Consensus 322 l~P~~~~ie~----~l~~~-----Ll~~~~~---~~g~~i~fd~----~~l~r~d-~~~~~~~~~~~~~~--G~~T~NE~ 382 (422) T protein:vir:13 322 LQSSLTVYEQ----EIQDK-----LFSQYET---LQDVKAEFNV----DTILRSD-IKTRYEAYRIGIQG--GFIEANEA 382 (422) T ss_pred HHHHHHHHHH----HHHHh-----hCChhhh---cCCceEEeec----hhhhcCC-HHHHHHHHHHHHhC--CCcCHHHH Confidence 2222222222 22222 2333322 2234455542 2332211 23455555544332 36777776 Q ss_pred HHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 466 RRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 466 ~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) +. .+++.+-+ .-++ .+ -+.+..|.+....+.. ++..+-++ T Consensus 383 R~-~~gl~p~~--ggD~---------~~-----------~~~n~~~l~~~~~~~~----~~g~~~g~ 422 (422) T protein:vir:13 383 RR-RENLPPVE--GGDR---------LL-----------VNGNMIPIEMAGEQYK----KGGEKGGK 422 (422) T ss_pred HH-HhCCCCCC--CcCe---------ee-----------eccCccchhhcccccc----cCCCcCCC Confidence 64 35554321 0000 00 0111111111111100 11111111 No 84 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=97.76 E-value=2.2e-05 Score=46.15 Aligned_cols=438 Identities=14% Similarity=0.120 Sum_probs=187.2 Q ss_pred Ccccceeeccccccc--ccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHh--------hcceeeecCCCceE------- Q lcl|NC_021072. 28 MDGSQPIVGGGYYGY--SVDFDGTVRNEYELITRYREMVLQPECDSAVDDI--------VNETICGNFDDVPV------- 90 (533) Q Consensus 28 ~dg~~~~~~~~~~~~--~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeI--------vneaiv~d~~~~~v------- 90 (533) +.|.+....-.-... ...+......+. .+|+.+..+.+=+..+..+ .+--++++.-..+| T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~---~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDST---QNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhh Confidence 222221110000000 000111111111 1122222221111111110 00000111101111 Q ss_pred ---EEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCC----CCCCCeEEEEEcChhhc Q lcl|NC_021072. 91 ---EVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPK----NPRGGLTELRYIDPRKI 163 (533) Q Consensus 91 ---~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~----~~~~gI~elr~lDP~~i 163 (533) .+.. +.++.. .+.+..|+..-+|+....++++.-++-|+-|...-.+.. .+.+|-..++.++|+.+ T Consensus 78 ~~~g~~~---~~~~~~----~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~ 150 (485) T protein:vir:10 78 AVEGFRF---GDADEA----DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRM 150 (485) T ss_pred cccceec---CCCchh----HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEcccee Confidence 1111 111111 223445565567889999999999999999987554432 22445567888899887 Q ss_pred eehhhccCC-CcCceeEEeccceeeccchhceeccccccc-------------cccCCcceeccchhhccccccccCCCC Q lcl|NC_021072. 164 RKVTEYQQK-RPEQLRGEDINTQLTQKAAEYYLYNPKGLK-------------NSTNQGMKIATDSVTYCHSGIQDLNKN 229 (533) Q Consensus 164 ~~vr~~~~~-~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~-------------~~~~~~~kI~~dai~y~hsGl~d~~~~ 229 (533) -.+..-... ..-..+++.... .+...-..+|.+.... ..+|..-+||. +.|++.. ++.+. T Consensus 151 ~~~~D~~~~~~~~~~~~~~~~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~n~~--~~~~~ 224 (485) T protein:vir:10 151 YAEIDPRIGRVSKAIRVAYDAE--GNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPV--VPIPNRT--RLSDL 224 (485) T ss_pred EEEEcCCCCceeEEEEEEEeeC--CCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccE--EEecccc--ccCCC Confidence 554432111 111111111100 0001111223332211 11222223333 2233321 11122 Q ss_pred ccchhHHHHHHHH-HH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccc Q lcl|NC_021072. 230 MTLSHLHKAIKAV-NQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKK 307 (533) Q Consensus 230 ~i~syL~~AiK~~-Nq-Lrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~ 307 (533) .+.|-+.+.+.++ .. -+++-+..++-..+-.|.|-+.=.+..+++.. ..+|...-+. T Consensus 225 ~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~--------------------~~~~~~~~~~- 283 (485) T protein:vir:10 225 YGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVD--------------------PETGQTLFDA- 283 (485) T ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCccccccc--------------------ccccchhhhh- Confidence 3344444332221 21 23566777777777777776553332222211 1111111000 Q ss_pred cchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH Q lcl|NC_021072. 308 FMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI 386 (533) Q Consensus 308 ~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi 386 (533) ..-..|+.- +-+.++-.+++.. ++ -++-++=.-..++..-++|.+-|+..+ -|-..+..|..-+..+..-+ T Consensus 284 ---~~~~i~~~~---~~d~k~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~-~n~~Sg~Al~~~~~~l~~k~ 355 (485) T protein:vir:10 284 ---YLARILAFE---DAEGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAA-DNPASAEAIRAAESRLIKKV 355 (485) T ss_pred ---cccceeccC---CCCceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhcccc-CchhHHHHHHHHHHHHHHHH Confidence 001224331 1123454555433 22 122233334445555777877775433 22223445777777788889 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCC-HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHH Q lcl|NC_021072. 387 ARLRKRFSELFMDLLKTQLILKGVMS-LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYM 465 (533) Q Consensus 387 ~rLr~~fs~if~d~Lk~qLilkgi~t-~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i 465 (533) .+.|..|..-+...++.-+.+.|... ..+| ..|.+.|..-..-+. .+..+++.++..-+-..+|.+++ T Consensus 356 ~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~w~~~~~~~~-------~~~ada~~kl~~ag~~~~s~et~ 424 (485) T protein:vir:10 356 ERKNSIFGGAWEEAMRLAYRMMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAASKLYNGGTGVIPRERA 424 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCCcccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHhccccCCCHHHH Confidence 99999999999999987666655321 1222 357888865433332 34555566655443357899999 Q ss_pred HHHHhCCCHHHHHHHHHHHHHhhhcCC-----CCCCCcccccCCCCCCCCCCCCccccccccCCccccch Q lcl|NC_021072. 466 RRQVLKQTDQEIKEIDKQIDSEREAGL-----IVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 466 ~k~IL~~tDeeI~e~~kqi~~E~~~~~-----~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 530 (533) +. .|++++++++++++..+++...+. ..+|....+.+.+.+ .+.+.|+-.++.+-+ T Consensus 425 ~~-~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 425 RK-DMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPA--------PAPKPAALESGGDAA 485 (485) T ss_pred HH-hCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCcc--------ccccCcCCCCCCCCC Confidence 85 599999999988776565544322 222221110000000 000001111111111 No 85 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=97.75 E-value=2.2e-05 Score=46.10 Aligned_cols=392 Identities=13% Similarity=0.144 Sum_probs=180.2 Q ss_pred cccchhhhh-hHHHHHHHHHHhhhhcchhhh--------------------HHHHhhcceeeecCCCceEEEEeccCCCc Q lcl|NC_021072. 42 YSVDFDGTV-RNEYELITRYREMVLQPECDS--------------------AVDDIVNETICGNFDDVPVEVELSNLKQS 100 (533) Q Consensus 42 ~~~~~~~~~-~~~~~LI~~YR~m~~~pEvd~--------------------AvdeIvneaiv~d~~~~~v~v~l~~~~~S 100 (533) --...-..+ +.-..-+.+|+.+..+.+-+. -..-||+-.. .=.=+.|+.+..++ T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~-~~l~g~~~~~~~~~---- 75 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFN-GYFIGVPVQTSHEN---- 75 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHh-hhhcccCceeecCC---- Confidence 000000000 000111233444333333221 1112222111 11124566666544 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCce--- Q lcl|NC_021072. 101 DKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQL--- 177 (533) Q Consensus 101 ~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~--- 177 (533) +.. ++.+..+.+--+|+....+.++.+++-|+-|++..+| ++|-..++.+||+.+-.+..-.... ... T Consensus 76 ~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d----~~g~~~~~~~~p~~~~~v~dd~~~~-~~~~~i 146 (429) T protein:vir:98 76 KQV----SNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFND----ENAEAGITYLTPLEAFIVYDDSIRQ-KPLFAV 146 (429) T ss_pred hHH----HHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEec----CCCcEEEEEEcccceEEEEeCCCCC-ceEEEE Confidence 333 3345666666789999999999999999999887665 3577889999999997654322111 111 Q ss_pred eEEeccceeeccchhceeccccccc--cccCCcceec------cchhhccccccccCCCCccchhHHHHHHHHHHHHH-H Q lcl|NC_021072. 178 RGEDINTQLTQKAAEYYLYNPKGLK--NSTNQGMKIA------TDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRM-I 248 (533) Q Consensus 178 ~~~~~~~~~~~~~~e~~~y~p~~~~--~~~~~~~kI~------~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm-~ 248 (533) +++...+ ......+|.+.... .....+..+. ...+..++ -+|+....|-++..+.....+.. + T Consensus 147 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~----~~n~~~g~sd~e~v~~liD~~d~~~ 218 (429) T protein:vir:98 147 RYFYNKG----GVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIE----YVENEERQSLLASVVTLINAFNKAI 218 (429) T ss_pred EEEEecC----ceEEEEEEeCceEEEEEecCCceEecccccccCCccceEE----ecCCCCCCCcHHHHHHHHHHHHHHH Confidence 1111111 00111112211100 0001111110 11111111 12334455667776666665543 4 Q ss_pred HHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccce Q lcl|NC_021072. 249 EDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEI 328 (533) Q Consensus 249 EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEI 328 (533) -+....-+.++.|-+-+.=.+. .. +-+++++ . ++++ .+|- +||.+..+ T Consensus 219 s~~~~~~~~~~~p~~~i~g~~~---~~----~~~~~~~-~--~~~~---------------------~~~~-~~~~~~~~ 266 (429) T protein:vir:98 219 SEKANDVEYFADAYLKILGAEL---DD----ETLKSLR-D--TRII---------------------NLKD-TDAQQLTV 266 (429) T ss_pred HHHHHHHHHhcCceeeeecCCC---Cc----chhhhHh-h--Ccee---------------------eccC-CCCCCcce Confidence 4555556777778776653222 21 1111111 1 1111 1111 23444556 Q ss_pred eecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 329 STLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFIARLRKRFSELFMDLLKTQL 405 (533) Q Consensus 329 sTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~--eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qL 405 (533) ..|--..+.+.... ++-+.+.+|+...+|- +..++ +|++| .|.--+.....-+.+.|+.|..-+.++++.=+ T Consensus 267 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~---~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 341 (429) T protein:vir:98 267 EFLQKPDADATQEHLLDRLENLIFRTAMVAN--ISDES---FGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIA 341 (429) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCccc---cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65544444544443 6788888899999993 33322 24333 23333333444566666666666666655433 Q ss_pred HhccCCC-HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHH Q lcl|NC_021072. 406 ILKGVMS-LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQI 484 (533) Q Consensus 406 ilkgi~t-~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi 484 (533) -+-++.. ..+| ..|.+.|.....-.+ .+.++++.++ +| .+|.++++.. |...++ -+++.++| T Consensus 342 ~~~~~~~~~~d~----~~i~v~f~~~~p~~~-------~~~a~~~~kl---~g-~is~et~~~~-l~~v~d-~~~E~~ri 404 (429) T protein:vir:98 342 SYPTSKIGPKDW----IGIKYKFTRNLPANL-------LEESQIAGNL---AG-IVSEETQVGV-LSIVEN-PQKEIERK 404 (429) T ss_pred HHhccCCCcccc----ccceEEeCCCCCcCH-------HHHHHHHHHH---hc-cCchHHHHHh-CCCCCC-HHHHHHHH Confidence 3323322 2222 247778865433333 2334455555 34 5999999976 565432 23445556 Q ss_pred HHhhhcCCCCCCCcccccCCCCCCCCCCCC Q lcl|NC_021072. 485 DSEREAGLIVDPMAEMDPAMDPGNAPPADD 514 (533) Q Consensus 485 ~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d 514 (533) ++|.... ++ ++.++....+.+.+.| T Consensus 405 ~~E~~~~-~~----~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 405 NSDKSTL-IS----RQAGGLNGQNTTTILE 429 (429) T ss_pred HHHHHHH-HH----HHHhhhcCCCCCCCCC Confidence 6665532 11 1222222222223333 No 86 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=97.74 E-value=2.3e-05 Score=46.02 Aligned_cols=403 Identities=14% Similarity=0.112 Sum_probs=175.8 Q ss_pred cccch-----hhhhhHHHHHHHHHHhhhhcchhhhHH-------------------------------HHhhcceeeecC Q lcl|NC_021072. 42 YSVDF-----DGTVRNEYELITRYREMVLQPECDSAV-------------------------------DDIVNETICGNF 85 (533) Q Consensus 42 ~~~~~-----~~~~~~~~~LI~~YR~m~~~pEvd~Av-------------------------------deIvneaiv~d~ 85 (533) --... +-.+......+.+|+.+..+.+-..+| ..||+-.+-+ . T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~y-l 79 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGY-V 79 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhh-e Confidence 00011 011112345566666665554433321 1222221111 1 Q ss_pred CCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhcee Q lcl|NC_021072. 86 DDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRK 165 (533) Q Consensus 86 ~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~ 165 (533) =+.||.+..++ +...++|.+-|.. +|+....++.+.|.+-|+-|.+.-+| ++|-..+..+||..+-+ T Consensus 80 ~G~p~~~~~~d----~~~~~~l~~~~~~-----~~~~~~~~l~~~~~~~G~a~~~~y~d----~~~~~~~~~~~p~~~~~ 146 (470) T protein:vir:10 80 ASVFPDIDVGK----DADNKKIIDVLGD-----DRALTLNGLLVDSSNAGRAWLHYWID----EDGNFRYGIIQPDQITP 146 (470) T ss_pred eccceeeecCc----hHHHHHHHHHHhh-----hHHHHHHHHHHHHhhcCeeEEEEEec----CCCceEEEEEcccceEE Confidence 25777776655 3333334333321 56777778899999999999998886 34678899999999877 Q ss_pred hhhccC--CCcCceeEEeccceeec-cchhceecccccccc-c---cCCcceeccc---------------hhhccc-cc Q lcl|NC_021072. 166 VTEYQQ--KRPEQLRGEDINTQLTQ-KAAEYYLYNPKGLKN-S---TNQGMKIATD---------------SVTYCH-SG 222 (533) Q Consensus 166 vr~~~~--~~~~~~~~~~~~~~~~~-~~~e~~~y~p~~~~~-~---~~~~~kI~~d---------------ai~y~h-sG 222 (533) |..-.. +..-..+++........ ...-+.+|.+..... . ......-+.. ..++.| -| T Consensus 147 v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (470) T protein:vir:10 147 IYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (470) T ss_pred EEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCC Confidence 643321 11111122211110000 001112222222110 0 0000000000 000011 01 Q ss_pred ccc----CCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeC Q lcl|NC_021072. 223 IQD----LNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDA 297 (533) Q Consensus 223 l~d----~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~ 297 (533) .++ +++....|=|+..+.....+.. +=+....-+.+..|-.-+.-.+.-+++ +.+. -+.+++-- T Consensus 227 ~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~-----~~~~-~~~~~~~i----- 295 (470) T protein:vir:10 227 RVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLH-----QFMN-DLRKYKSI----- 295 (470) T ss_pred eeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccc-----hhhh-hhhhcCeE----- Confidence 111 1233445667766665555533 333344445555555544432221211 1111 12222222 Q ss_pred CCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhh Q lcl|NC_021072. 298 NTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEIT 376 (533) Q Consensus 298 ~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eIt 376 (533) +++-.+.|.|..+..|--..+.... .-+.-+.+.+|+-..+|- +..++ +|.+|... T Consensus 296 ------------------~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~--~~~~~---~gn~Sg~A 352 (470) T protein:vir:10 296 ------------------KINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGID--PANFE---SSNASGVA 352 (470) T ss_pred ------------------eccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCC--CCccc---cccchHHH Confidence 2222233334445555554444433 334567778888889984 22222 24443322 Q ss_pred H--HhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021072. 377 R--DEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDP 454 (533) Q Consensus 377 R--DElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~ 454 (533) . -...--.-+.+.+..|...|.++++.=+-+-|+. .-+| ..|.+.|...--=.+... +++++.+ T Consensus 353 lk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~-~~d~----~~i~i~f~~~~p~d~~e~-------~~~~~~~-- 418 (470) T protein:vir:10 353 IKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTK-------AQIVSTV-- 418 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-Cccc----ceeeEEeccCCCCCHHHH-------HHHHHHH-- Confidence 1 1111223366677777777777666433223432 2233 356777765443333222 3334444 Q ss_pred hccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 455 YVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 455 ~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~ 515 (533) +| .+|.+++++. |...++ .+++-++|++|.....-..++. +..++++ ..|+- T Consensus 419 -~g-~iS~et~l~~-~p~v~D-~~~E~eri~~E~~e~~~~~~~~---~~~~~~~--~dde~ 470 (470) T protein:vir:10 419 -AN-YSSKEAVAKA-NPIVDD-WQQELKDLAKDKEENDPYSNQA---DELNGKG--VNDEQ 470 (470) T ss_pred -hc-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHhhccc---cccCCCC--CCCCC Confidence 45 4899999977 666532 4445566666654431111111 1111110 00000 No 87 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.74 E-value=2.3e-05 Score=46.01 Aligned_cols=266 Identities=15% Similarity=0.204 Sum_probs=135.4 Q ss_pred CCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhc----chhhhh----HHHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 85 FDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLD----FENRSY----EIFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 85 ~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~----f~~~~~----~~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) --..|+.+- ++.+. . ...++++|+ -.-.+. .+++.+++.|.-|+.++-+. .+.+++|. T Consensus 1 ia~l~~~~~-~~~~~---~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~---~G~~~~l~ 66 (278) T protein:vir:78 1 MASLPLKMY-EDYKV---V-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---YHQPSKLF 66 (278) T ss_pred CccceeEEE-ecCcc---c-------ccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECC---CCcEEEEE Confidence 122333332 12111 1 123344443 222344 44555778899999988764 44599999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLH 236 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~ 236 (533) +|+|..++..... ++...+ |.|... .+..+.++.+.+.+.. ..-..++-.+.|.+. T Consensus 67 ~l~~~~v~v~~~~-----~~~~~~-------------y~~~~~-----~g~~~~~~~~evih~~-~~~~~~~~~G~s~~~ 122 (278) T protein:vir:78 67 LLNPDVVEMLIEN-----QSRELY-------------YSIHAA-----TGNKLIVHNMDMLHFK-HIVASNMVQGISPID 122 (278) T ss_pred EECCceeEEEEcC-----CCceEE-------------EEEEcC-----CceEEEEccccEEEEC-CCCCCCCeeeccHHH Confidence 9999999653221 111111 111111 0112233333332221 111122334679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhc Q lcl|NC_021072. 237 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 316 (533) Q Consensus 237 ~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDyw 316 (533) .|.+++.....++..- .+...+.| .-+++ .-++|.+..+++..+.+-..+ + ..|. .+ .++ T Consensus 123 ~~~~~i~~~~~~~~~~-~~~~~~~~-~~i~~-~~~~l~~e~~~~~~~~~~~~~------~-~~g~------~~-vl~--- 182 (278) T protein:vir:78 123 VLKNTTDFDNAVRTFN-LTEMQKPD-SFMLK-YGSNVGKEKRQQVLEDFKQYY------E-ENGG------IL-FQE--- 182 (278) T ss_pred HHHHHHHHHHHHHHHH-HHHhcCCC-cEEEE-eCCCCCHHHHHHHHHHHHHHh------c-cCCC------ce-ecC--- Confidence 9999999888887764 45555555 44443 446777766655444332222 1 2232 22 111 Q ss_pred ccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHH Q lcl|NC_021072. 317 LPRREGGRGTEISTLPGGQNLGELED---VKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKR 392 (533) Q Consensus 317 LpRReggrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~rLr~~ 392 (533) .|+++..|. .+.-+++- .++..+.+.++++||.+.++..++-+....++..+. |..+ |..+..+ T Consensus 183 -------~g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~---~~~~~l~P~~~~ 250 (278) T protein:vir:78 183 -------PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF---YLQHTLLPIVKQ 250 (278) T ss_pred -------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHHHHH Confidence 256777774 33334333 357889999999999999976555455444444433 5554 4444444 Q ss_pred HHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccch Q lcl|NC_021072. 393 FSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNY 432 (533) Q Consensus 393 fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~ 432 (533) +...|. ..| +++.|+. -..+|+|+... - T Consensus 251 i~~~ln----~~L-----~~~~e~~-~g~~~~f~~~~--l 278 (278) T protein:vir:78 251 YEEEFN----RKL-----LTKTDRE-KIGILNLTLNL--I 278 (278) T ss_pred HHHHHH----hhc-----CChhHhc-CCceEEEeccc--C Confidence 444433 333 4455544 22344444332 2 No 88 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.66 E-value=3.1e-05 Score=45.27 Aligned_cols=428 Identities=12% Similarity=0.092 Sum_probs=174.0 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcc------------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQP------------- 67 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~p------------- 67 (533) |.-.=|=..|+.. +.+. ..+.. +.. ........-..+.+.|+....++ T Consensus 1 ~~~~~~~~~~~~~-------------~~~~--e~i~~-~i~---~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~ 61 (474) T protein:vir:10 1 MTLYKLIDDIEAQ-------------GILP--KHIEA-LIE---SHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKED 61 (474) T ss_pred CchHHHHhhcccc-------------CCCH--HHHHH-HHH---HhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhh Confidence 1110000000000 0000 00000 000 00000000112223333221100 Q ss_pred -----------------hhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh Q lcl|NC_021072. 68 -----------------ECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRR 130 (533) Q Consensus 68 -----------------Evd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~ 130 (533) -+-+-...||+-.+-+ .=+.||.+.++.. +...+.+++.+..+++--+|+....+.++. T Consensus 62 ~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~y-l~g~pv~~~~~~~---~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~ 137 (474) T protein:vir:10 62 FETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGY-LHGVPVTYDLDEN---AEKNEKLKKFITNFAIRNSVDDEDSEIGKM 137 (474) T ss_pred hhhcccccccccCcccccccchHHHHHHhHhhh-eeccceeEeeCCC---CcchHHHHHHHHHHHhhcCHhHHHHHHHHH Confidence 1112222233321111 1267888887432 222344445556666666899999999999 Q ss_pred hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeecc-chhceeccccccc-cc---c Q lcl|NC_021072. 131 WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQK-AAEYYLYNPKGLK-NS---T 205 (533) Q Consensus 131 WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~-~~e~~~y~p~~~~-~~---~ 205 (533) ..+-|+-|.+.-+| ++|-..++.+||+.+-+|.....+.--..+++......-+. .....+|.+.... +. . T Consensus 138 ~~~~G~a~~~~~~d----~~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~ 213 (474) T protein:vir:10 138 AAICGYGARLAYID----TNGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGI 213 (474) T ss_pred HhhcCeEEEEEEeC----CCCeeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCC Confidence 99999999886665 34567899999999877654222211111111111100000 0011233332211 00 0 Q ss_pred CCcceeccchhhccc-ccccc----CCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_021072. 206 NQGMKIATDSVTYCH-SGIQD----LNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAE 279 (533) Q Consensus 206 ~~~~kI~~dai~y~h-sGl~d----~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAe 279 (533) .....+. .+.| .|.++ +++....|-|+..+.....+.. +-+....-+-++.|-+-+.-. .++. T Consensus 214 ~~~~~~~----~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~---~~~~---- 282 (474) T protein:vir:10 214 DALQEVG----RYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM---GMSE---- 282 (474) T ss_pred Ccccccc----cccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC---CCCc---- Confidence 0000000 1111 22211 2344556777776666665543 333334445555555443321 2221 Q ss_pred HHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCc Q lcl|NC_021072. 280 QYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPS 358 (533) Q Consensus 280 qYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~ 358 (533) +-+..+ .. + |. .|++ + .+..+..|--..+.. ...-+.-+.+.+|....+|- T Consensus 283 ~~~~~~-~~--~--------~~-------------i~~~--~--~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 334 (474) T protein:vir:10 283 EMIQET-QK--S--------GA-------------FELF--D--KDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVN 334 (474) T ss_pred hhhhhh-hh--c--------ce-------------eEec--C--CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcc Confidence 111111 00 1 11 1111 0 112244443322322 23445667788888888884 Q ss_pred cccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CHhHHhhhhhceeEEEeccchHH Q lcl|NC_021072. 359 SRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVM----SLEEWDEMKEHIQFDFIADNYFT 434 (533) Q Consensus 359 sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~----t~eew~~~~~~i~~~f~~Dn~f~ 434 (533) --.+.-+| |. .|..|..-......-+.+.+..|..-+...++.=+-+-++. .+.+| ..|.+.|...---. T Consensus 335 ~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~----~~i~~~f~~~~p~d 408 (474) T protein:vir:10 335 FNSDEFNG-NV-PIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSY----LNLIFKFTRNIPVN 408 (474) T ss_pred cccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc----ccceEEeCCCCCCC Confidence 21111111 11 12223322222333456666666666666665543322221 23334 35788887654444 Q ss_pred HHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCC Q lcl|NC_021072. 435 ELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADD 514 (533) Q Consensus 435 E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d 514 (533) +... ++++..+. | .+|.+++++. |...+ +.+++-++|++|..+..-..|+... ++ ..+.+..++ T Consensus 409 ~~e~-------a~~~~kl~---g-~iS~et~~~~-l~~v~-d~~~E~eri~~E~~e~~~~~~~~~~-~~--~~~~~~~~~ 472 (474) T protein:vir:10 409 KLEE-------SQVLINLK---G-QVSERTRLGQ-SQLVD-DVDYELDEMEKESLEFNDKLPDIDE-GD--ANDKSQNNQ 472 (474) T ss_pred HHHH-------HHHHHHHh---c-cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhcccccC-CC--cCCCCcccc Confidence 4333 34444442 4 4899999987 66544 3445555555555433222221110 00 000111111 Q ss_pred cc Q lcl|NC_021072. 515 MS 516 (533) Q Consensus 515 ~~ 516 (533) +. T Consensus 473 s~ 474 (474) T protein:vir:10 473 SE 474 (474) T ss_pred CC Confidence 00 No 89 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.66 E-value=3.1e-05 Score=45.27 Aligned_cols=428 Identities=12% Similarity=0.092 Sum_probs=174.0 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcc------------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQP------------- 67 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~p------------- 67 (533) |.-.=|=..|+.. +.+. ..+.. +.. ........-..+.+.|+....++ T Consensus 1 ~~~~~~~~~~~~~-------------~~~~--e~i~~-~i~---~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~ 61 (474) T protein:vir:94 1 MTLYKLIDDIEAQ-------------GILP--KHIEA-LIE---SHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKED 61 (474) T ss_pred CchHHHHhhcccc-------------CCCH--HHHHH-HHH---HhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhh Confidence 1110000000000 0000 00000 000 00000000112223333221100 Q ss_pred -----------------hhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh Q lcl|NC_021072. 68 -----------------ECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRR 130 (533) Q Consensus 68 -----------------Evd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~ 130 (533) -+-+-...||+-.+-+ .=+.||.+.++.. +...+.+++.+..+++--+|+....+.++. T Consensus 62 ~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~y-l~g~pv~~~~~~~---~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~ 137 (474) T protein:vir:94 62 FETGGNVRRLDVSVNNKLNNSFDSEIVDTRVGY-LHGVPVTYDLDEN---AEKNEKLKKFITNFAIRNSVDDEDSEIGKM 137 (474) T ss_pred hhhcccccccccCcccccccchHHHHHHhHhhh-eeccceeEeeCCC---CcchHHHHHHHHHHHhhcCHhHHHHHHHHH Confidence 1112222233321111 1267888887432 222344445556666666899999999999 Q ss_pred hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeecc-chhceeccccccc-cc---c Q lcl|NC_021072. 131 WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQK-AAEYYLYNPKGLK-NS---T 205 (533) Q Consensus 131 WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~-~~e~~~y~p~~~~-~~---~ 205 (533) ..+-|+-|.+.-+| ++|-..++.+||+.+-+|.....+.--..+++......-+. .....+|.+.... +. . T Consensus 138 ~~~~G~a~~~~~~d----~~~~~~~~~i~p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~ 213 (474) T protein:vir:94 138 AAICGYGARLAYID----TNGDIRIKNIDPYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGI 213 (474) T ss_pred HhhcCeEEEEEEeC----CCCeeEEEEEcccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCC Confidence 99999999886665 34567899999999877654222211111111111100000 0011233332211 00 0 Q ss_pred CCcceeccchhhccc-ccccc----CCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_021072. 206 NQGMKIATDSVTYCH-SGIQD----LNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAE 279 (533) Q Consensus 206 ~~~~kI~~dai~y~h-sGl~d----~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAe 279 (533) .....+. .+.| .|.++ +++....|-|+..+.....+.. +-+....-+-++.|-+-+.-. .++. T Consensus 214 ~~~~~~~----~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~---~~~~---- 282 (474) T protein:vir:94 214 DALQEVG----RYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM---GMSE---- 282 (474) T ss_pred Ccccccc----cccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC---CCCc---- Confidence 0000000 1111 22211 2344556777776666665543 333334445555555443321 2221 Q ss_pred HHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCc Q lcl|NC_021072. 280 QYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPS 358 (533) Q Consensus 280 qYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~ 358 (533) +-+..+ .. + |. .|++ + .+..+..|--..+.. ...-+.-+.+.+|....+|- T Consensus 283 ~~~~~~-~~--~--------~~-------------i~~~--~--~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~ 334 (474) T protein:vir:94 283 EMIQET-QK--S--------GA-------------FELF--D--KDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVN 334 (474) T ss_pred hhhhhh-hh--c--------ce-------------eEec--C--CCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcc Confidence 111111 00 1 11 1111 0 112244443322322 23445667788888888884 Q ss_pred cccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CHhHHhhhhhceeEEEeccchHH Q lcl|NC_021072. 359 SRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVM----SLEEWDEMKEHIQFDFIADNYFT 434 (533) Q Consensus 359 sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~----t~eew~~~~~~i~~~f~~Dn~f~ 434 (533) --.+.-+| |. .|..|..-......-+.+.+..|..-+...++.=+-+-++. .+.+| ..|.+.|...---. T Consensus 335 ~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~----~~i~~~f~~~~p~d 408 (474) T protein:vir:94 335 FNSDEFNG-NV-PIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSY----LNLIFKFTRNIPVN 408 (474) T ss_pred cccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc----ccceEEeCCCCCCC Confidence 21111111 11 12223322222333456666666666666665543322221 23334 35788887654444 Q ss_pred HHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCC Q lcl|NC_021072. 435 ELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADD 514 (533) Q Consensus 435 E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d 514 (533) +... ++++..+. | .+|.+++++. |...+ +.+++-++|++|..+..-..|+... ++ ..+.+..++ T Consensus 409 ~~e~-------a~~~~kl~---g-~iS~et~~~~-l~~v~-d~~~E~eri~~E~~e~~~~~~~~~~-~~--~~~~~~~~~ 472 (474) T protein:vir:94 409 KLEE-------SQVLINLK---G-QVSERTRLGQ-SQLVD-DVDYELDEMEKESLEFNDKLPDIDE-GD--ANDKSQNNQ 472 (474) T ss_pred HHHH-------HHHHHHHh---c-cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhcccccC-CC--cCCCCcccc Confidence 4333 34444442 4 4899999987 66544 3445555555555433222221110 00 000111111 Q ss_pred cc Q lcl|NC_021072. 515 MS 516 (533) Q Consensus 515 ~~ 516 (533) +. T Consensus 473 s~ 474 (474) T protein:vir:94 473 SE 474 (474) T ss_pred CC Confidence 00 No 90 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.64 E-value=3.4e-05 Score=45.08 Aligned_cols=428 Identities=14% Similarity=0.109 Sum_probs=181.9 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhH-HHHHHHHHHhhhhcchhh--------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRN-EYELITRYREMVLQPECD--------- 70 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~-~~~LI~~YR~m~~~pEvd--------- 70 (533) |..-=+| +.. ......+.-|.+.+=.... +..-++. ......+|+.+..+.+-+ T Consensus 1 ~~~~~~~----~~~-~~~~~~~~~~~~~~~~~~~-----------i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~ 64 (470) T protein:vir:99 1 MKDINYG----RDK-VTGNSSFIFPKGEKLTSNE-----------LLGFIAYNETVLKPRYRENMKLYLGKHKILTAPEK 64 (470) T ss_pred CccccCC----ccc-ccCCceEEeCCCCCcCHHH-----------HHHHHHHHHHhhHHHHHHHHHHhccccccccCccc Confidence 1110000 000 0000011111111000000 0000000 001112233333332221 Q ss_pred ----------hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeee Q lcl|NC_021072. 71 ----------SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYH 140 (533) Q Consensus 71 ----------~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~h 140 (533) +-...||+...-+ .=+.|+.+.+.+ .++ . .+.+..+..--+|+....++++...+-|+.|.+ T Consensus 65 ~~~~~~ki~~n~~~~Ivd~~~~~-l~g~p~~~~~~~--d~~-~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~ 136 (470) T protein:vir:99 65 ETGADNRIVVNSAKYVVDVYNGY-FCGIEPKLALLN--DSS-K----IDEIARWNRQENFFDTINEISKQCDIFGRSIAS 136 (470) T ss_pred ccCCcceeecchHHHHHHHHhhh-hccCCeeEeeCC--chh-H----HHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEE Confidence 1222233321111 124566665432 111 1 223455566668999999999999999999988 Q ss_pred eeecCCCCCCCeEEEEEcChhhceehhhccCCCcCcee---EEeccceeeccchhceeccccccc--------------- Q lcl|NC_021072. 141 KVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLR---GEDINTQLTQKAAEYYLYNPKGLK--------------- 202 (533) Q Consensus 141 kvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~---~~~~~~~~~~~~~e~~~y~p~~~~--------------- 202 (533) .-+|. .|-..+..+||+.+-++..-... ..... ++.....-. ...-..+|.+.... T Consensus 137 v~~d~----dg~~~i~~~~p~~~~~i~d~~~~-~~~~~~vr~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (470) T protein:vir:99 137 IYQGE----DARPHLMYSSPNHAFIIYDDTVQ-RQPLAFVHYQIDNSNNW-TDAYGVIQYADKFYKFKGYDIEEDTNAAG 210 (470) T ss_pred EEeCC----CCeEEEEEEccceeEEEEcCCCC-cceEEEEEEEEEecCCe-eEEEEEEEecCeEEEEEeccccccccccc Confidence 76653 46678999999999765432211 11111 111100000 00001122221110 Q ss_pred cccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHH Q lcl|NC_021072. 203 NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQY 281 (533) Q Consensus 203 ~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqY 281 (533) ..+|..-+||.-.+ .++....|=++..+....-+. ++=+....-+.++.|.+-+.- ..++..+.-+- T Consensus 211 ~~~~~~g~vPvv~~---------~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g---~~~~~~~~g~~ 278 (470) T protein:vir:99 211 YAINPYGLVPAVEF---------FENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIG---FKLPEDDEGNP 278 (470) T ss_pred ccccCCCccceEee---------cCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---CCcccccccch Confidence 11121122222111 123334455555444444433 455556666777777665532 22221111011 Q ss_pred HHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccc Q lcl|NC_021072. 282 LREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSR 360 (533) Q Consensus 282 l~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sR 360 (533) +..+ -.+++++ +|=.+++.|..+.+|....+...... +.-+.+.+|...++|-.- T Consensus 279 ~~~~---~~~~~~~---------------------~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 334 (470) T protein:vir:99 279 KFDF---KNNRVLY---------------------VSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQ 334 (470) T ss_pred hhhh---hhcceee---------------------ecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCcccc Confidence 1110 0112211 22223344556777876666665543 788889999999999432 Q ss_pred cCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHH Q lcl|NC_021072. 361 LETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIE 440 (533) Q Consensus 361 l~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~E 440 (533) ++..+| |. .+..|...+.....-+.+.+..|...+.++++.-+-+-+.....+++ ...|.+.|...-.-.+. T Consensus 335 ~~~~~~-n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~--~~~i~v~f~~~~p~~~~---- 406 (470) T protein:vir:99 335 DKNFAG-NS-SGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQEL--WSELDFKFTRNLPEDMA---- 406 (470) T ss_pred cccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc--cccceEEeCCCCCcCHH---- Confidence 222111 11 23345555445555577777888888877776544333332222222 23578888544333332 Q ss_pred HHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCC--CCCCCCC Q lcl|NC_021072. 441 IRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPG--NAPPADD 514 (533) Q Consensus 441 i~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~--~~~~~~d 514 (533) +.++++..+. | .+|.++++.. |...|.+ ++.++|++|....+-. . .+..+..+.. +...++| T Consensus 407 ---e~a~~~~kl~---g-iis~et~l~~-l~~vd~~--~E~eri~~E~~~~~~~-~-~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 407 ---SAIDNAKNAE---G-IVSKKTQLGM-IPDIEPD--AEMKQIAKEKADAIKQ-T-QQLSMPIDILKRDNNAEEE 470 (470) T ss_pred ---HHHHHHHHHh---c-cCCHHHHHHh-CCCCCHH--HHHHHHHHHHHHHHHH-H-HhhcCCCCcCCCCCCccCC Confidence 3444455553 5 3899999987 5555422 2333444444322100 0 0111111111 1111111 No 91 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=97.61 E-value=3.8e-05 Score=44.84 Aligned_cols=372 Identities=12% Similarity=0.118 Sum_probs=161.0 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |+ ||....++ + +.+..+........+.++...... +.. +..+++|-|.+||+-|++.+ T Consensus 1 Mg--~f~~~~~~-----~------~~~~~~~~~~~~~~~~~~~~~~~~-v~~--------~~~l~~~~v~~~i~~ia~~i 58 (382) T protein:vir:48 1 MP--IFNLATES-----P------PDNQGGFFDVVDSDFLASLKGNEW-VSA--------ETALRNSDLFSIINQLSNDL 58 (382) T ss_pred Cc--cccccccC-----C------cccccccccchhhhccccccCCcc-cch--------HhhhccHHHHHHHHHHHHhh Confidence 43 23221111 0 111122222222222222111111 110 12257899999999999886 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHH----hhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFR----RWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR----~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -- .|+.+. ... .. .|+..=|-.-.+.++.+ .+++.|.-|+.++-|. .+-+++|+ T Consensus 59 a~-----~~~~~~--~~~-----~~-------~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~---~G~~~~l~ 116 (382) T protein:vir:48 59 AT-----VKLITS--RKK-----LQ-------GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE---NGRDMKWE 116 (382) T ss_pred cc-----Cceeee--cch-----hh-------hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC---CCcEEEEE Confidence 53 333332 111 11 12222333334455444 4677899999987764 34589999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-CccchhH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTLSHL 235 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~syL 235 (533) +++|..++.++.. ..+...+....... ...... .+.....++|.... .++ -.+.|.| T Consensus 117 ~i~~~~v~v~~~~---~~~~~~y~~~~~~~--~~~~~~-------~~~~~evih~~~~~----------~~~~~~G~s~l 174 (382) T protein:vir:48 117 YLRPSQVSFNRLD---NKDGIYYNITFDDP--RIPPKQ-------HVPQNDVLHFRLLS----------VDGGMTSVSPL 174 (382) T ss_pred EEcCceeEEEEcC---CCCeEEEEEEecCc--ccccee-------EEcCccEEEecCCC----------CCCccccccHH Confidence 9999999754322 11211111000000 000000 11222333343221 111 2356899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) ..|++++.....++....=+--.-+--+-+..++.+ +.+..+++..+..-..++ ..|.+ + .+ T Consensus 175 ~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~~-------n~g~~------~-vl--- 236 (382) T protein:vir:48 175 MALSRELDIQKASGNLTINSLKNALNANGILKIKGG-GLLDFKTKLSRSRQAMKQ-------MQGGP------L-VL--- 236 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CChHHHHHHHHHHHhhcc-------CCCCe------e-Ec--- Confidence 999999999998888877766666777888888754 444455554443322211 12321 1 11 Q ss_pred cccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRF 393 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~rLr~~f 393 (533) .+ |.+++.|.-. ..+.-++-.+|..+.+.++++||...|+..+.-+ ..++-.+. |.++ |.-+-+.+ T Consensus 237 ----~~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~--~~~~~~~~---~~~~~l~p~~~~i 304 (382) T protein:vir:48 237 ----DD---LEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQ--SSLEMSSD---LYSKAVSRYLRPF 304 (382) T ss_pred ----CC---CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHHHHHHH---HHHHHHHHHHHHH Confidence 12 4455555422 2223355667888999999999999996433211 12222222 4332 23333333 Q ss_pred HHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCC Q lcl|NC_021072. 394 SELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQT 473 (533) Q Consensus 394 s~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~t 473 (533) ...|...| ++..+.+ +...+..+.+ -...+++.+-. +-.++..-++ ++| T Consensus 305 ~~~l~~~l---------~~~~~~~-----~~~~~~~~~~--------~~~~~~~~l~~-----~g~~t~~e~r-~~l--- 353 (382) T protein:vir:48 305 LSELSQKL---------SCDVDAD-----IFPAVDPTGS--------NYISRINSLVK-----TGTLAQNQGL-YIL--- 353 (382) T ss_pred HHHHHHHh---------cChhhhh-----hhhhhccchh--------HHHHHHHHHhh-----cCccCHHHHH-HHH--- Confidence 33332222 1111111 0111111111 11112211110 1222333332 111 Q ss_pred HHHHHHHHHHHHHhhhcCCCCCCCccc---ccCCCCCCCCCCCCccccc Q lcl|NC_021072. 474 DQEIKEIDKQIDSEREAGLIVDPMAEM---DPAMDPGNAPPADDMSAQE 519 (533) Q Consensus 474 DeeI~e~~kqi~~E~~~~~~~~p~~~~---~~~~~~~~~~~~~d~~~~~ 519 (533) ...|..++|-... .|..++|++ +.++ T Consensus 354 --------------~~~g~~~~~~~~~~~~~~~~~GGd~------~~~~ 382 (382) T protein:vir:48 354 --------------QQAEILPKELPNGENPNSTLKGGEE------DGQD 382 (382) T ss_pred --------------hhCCCCCcchhhhhcCCCCCCCCCC------CCCC Confidence 1334443322111 111122221 1111 No 92 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=97.59 E-value=4.1e-05 Score=44.62 Aligned_cols=445 Identities=14% Similarity=0.111 Sum_probs=196.4 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeeccccc-cc-----ccchhhhhhHHHHHHHHHHhh-hhcchhhhHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYY-GY-----SVDFDGTVRNEYELITRYREM-VLQPECDSAV 73 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~-~~-----~~~~~~~~~~~~~LI~~YR~m-~~~pEvd~Av 73 (533) |-.++++.+.+.+....+. ..++..-..+. ....+| |- +.+.++... .+.+.++ ...--|+... T Consensus 14 ~~~~~~~~~~~~i~~~~~i---~~~~~~~~~i~-~~~~~y~g~~~~~~~~~~~~~~~-----~~~~~slnl~~~i~~~~A 84 (522) T protein:vir:47 14 GRYYMQTSNLNSILEHPKI---AVTQEEYDRIK-RNLVYYQSKWDDVQYKNTDGDIK-----SRPMNHLPIARTASKKIA 84 (522) T ss_pred HHHHhhcccchhccccCCC---CCCHHHHHHHH-HHHHHhcCCcccccccccCcchh-----cccceecchHHHHHHHHh Confidence 4445554443332111110 00110000000 011111 10 000111111 1112222 1111122222 Q ss_pred HHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeE Q lcl|NC_021072. 74 DDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLT 153 (533) Q Consensus 74 deIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~ 153 (533) +-+.|| |+.+.+++ +..++.+..++.--+|.....+.+-.+..-|-.+|...+|. |-. T Consensus 85 ~lv~~e---------~~~i~v~d--------~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-----~~~ 142 (522) T protein:vir:47 85 SLVYNE---------QATITTKN--------EILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDG-----DKV 142 (522) T ss_pred hhhcCC---------cceeecCC--------hHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcC-----Cce Confidence 222233 33444433 24455667777777899999999999999999999999973 234 Q ss_pred EEEEcChhhceehhhccCCCcCceeEEe----ccc---eeec----------------------cchhceeccccccccc Q lcl|NC_021072. 154 ELRYIDPRKIRKVTEYQQKRPEQLRGED----INT---QLTQ----------------------KAAEYYLYNPKGLKNS 204 (533) Q Consensus 154 elr~lDP~~i~~vr~~~~~~~~~~~~~~----~~~---~~~~----------------------~~~e~~~y~p~~~~~~ 204 (533) .+.+++|.++-|++-..+......-+.. .+. .++. +.-.+.+|.-... .+ T Consensus 143 ~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~-~~ 221 (522) T protein:vir:47 143 RVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVN-DV 221 (522) T ss_pred EEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCC-cc Confidence 6888898888775321111100000000 000 0110 0001111211000 00 Q ss_pred cCCcceeccchh--------hcccccc---------------ccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_021072. 205 TNQGMKIATDSV--------TYCHSGI---------------QDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAP 261 (533) Q Consensus 205 ~~~~~kI~~dai--------~y~hsGl---------------~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAP 261 (533) .+ ..++-..+ ....+|+ .+..++.++|-++.|+-.+--|-..=+ .+.|.+|.= T Consensus 222 lG--~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s--~~~~e~~~g 297 (522) T protein:vir:47 222 LG--QRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYD--EFMWEVRMG 297 (522) T ss_pred cC--ccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHH--HHHHHHHhc Confidence 01 11111000 0011121 233466778999998887776664444 456777877 Q ss_pred cceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCC-CCccceeecCCCCCcchH Q lcl|NC_021072. 262 ERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREG-GRGTEISTLPGGQNLGEL 340 (533) Q Consensus 262 eRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReg-grgTEIsTLpGg~nLgei 340 (533) .+|||. |-.=++ ..-+..+|+...-..+- --+.+|.+-..+ +-|--|+++...---++. T Consensus 298 ~~~i~v-~~~~l~------------------~~~~~~~g~~~~~~~fd-~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~ 357 (522) T protein:vir:47 298 QRRVIV-PEHLTQ------------------RQYQRPDGTIDFRPRFD-VEQNVYMQIGGSSMDAGGITDLTSPIRANDY 357 (522) T ss_pred cceeec-chHHhc------------------cCCCCCCcccccccccC-cccceEeecCCCCCCCCcceeeccccChHHH Confidence 788775 111110 11122233211100000 001111111110 011125555443322222 Q ss_pred H-HHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCHhHH Q lcl|NC_021072. 341 E-DVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKG---VMSLEEW 416 (533) Q Consensus 341 ~-DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkg---i~t~eew 416 (533) . =+..+.+.+=...+++-+-|+.+++. ...++||...+-.-..-+.+.|+.+...+.++++.=|-|-. .++..-+ T Consensus 358 ~~~~~~~l~~i~~~~gls~~tf~~~~~~-~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~ 436 (522) T protein:vir:47 358 ILAISEGLKLFEMQIGVSSGMFTFDGQG-MKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIP 436 (522) T ss_pred HHHHHHHHHHHHHHhCCCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCC Confidence 1 24555666666777777777766542 34567776666665556788888877777777776553321 1111111 Q ss_pred hhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCC Q lcl|NC_021072. 417 DEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDP 496 (533) Q Consensus 417 ~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p 496 (533) . ...|.++|. |+.+.. +++++-+.+ .+++ .| .+|....+.+..+.||+|.+++-++|++|.... .+| T Consensus 437 ~--~~~i~v~f~-D~i~~D-~~~~~~~~~-~~v~-----aG-~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~--~~~ 503 (522) T protein:vir:47 437 E--LDDISVNLD-DGVFTD-RHAELDYWA-KMVA-----AG-FSTKKRAIGKTLNISGVEAEKELNAINSELLPM--NDA 503 (522) T ss_pred C--cceeEEEcC-CCCCCC-HHHHHHHHH-HHHh-----cC-CCCHHHHHHhcCCCChHHHHHHHHHHHHhhccC--CCC Confidence 1 123677776 544444 222222211 1111 13 588888767778999999999999999986543 112 Q ss_pred CcccccCCCCCCCCCCCCc Q lcl|NC_021072. 497 MAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 497 ~~~~~~~~~~~~~~~~~d~ 515 (533) .+...+++.....+.+.++ T Consensus 504 ~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 504 ELAIYGMHDQNEEKADDKG 522 (522) T ss_pred CCCCCCCCCcccccCCCCC Confidence 2222222111111111111 No 93 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=97.52 E-value=5.2e-05 Score=44.07 Aligned_cols=401 Identities=14% Similarity=0.123 Sum_probs=180.6 Q ss_pred ccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeec Q lcl|NC_021072. 5 LFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGN 84 (533) Q Consensus 5 ~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d 84 (533) .|--.+++. .....+..+.+-.+. ......++.|-.+ . -....++|-|.+||+-|.+.+-. T Consensus 1 ~~f~~~f~r--~~~~~~~~~~~~~~~-~~~~~~~~~g~~v------~--------~~~~l~~~~v~~~i~~Ia~~iA~-- 61 (413) T protein:vir:48 1 MFFSGLFQR--KSDAPVTTPAELAEA-IGLSYDTYTGKRI------S--------SQRAMRLTAVYSCVRVLAESVGM-- 61 (413) T ss_pred Cccchhhcc--CccCCccchHHHHHh-hhcCcccccCcee------c--------hhhhhccHHHHHHHHHHHHhhhh-- Confidence 222223221 111111111111111 0000001111111 0 02235688999999999998553 Q ss_pred CCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhH----HHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 85 FDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYE----IFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 85 ~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~----~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) .|+.+--..-+..+.+++ ..++++|+. .-.+.+ ++..+++.|.-|..++-+ .+.+++|. T Consensus 62 ---~p~~~~~~~~~~~~~~~~------~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~----~g~~~~L~ 128 (413) T protein:vir:48 62 ---LPCSLYKISGTLKTRVVD------ERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA----LGEVVELL 128 (413) T ss_pred ---CceEEEEecCCcceeecc------cHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC----CCcEEEEE Confidence 444443111111111110 234444432 234444 444567789998875432 45699999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccc--ccccCCcceeccchhhccccccccCCCCccchh Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGL--KNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSH 234 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~--~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~sy 234 (533) +|+|..++..... +...++ .++.+.|. .+.+.+.++|..- ..++-.++|- T Consensus 129 ~l~~~~v~~~~~~-----~~~~~y-------------~~~~~~g~~~~~~~~evih~~~~----------~~d~~~G~s~ 180 (413) T protein:vir:48 129 PIDPGCVEPKLNS-----QWQPVY-------------QVTFPDGSVDVLTQDEIWHVRTL----------TLDGLVGLNP 180 (413) T ss_pred EEcCceEEEEEcC-----CceEEE-------------EEEecCceEEEEccccEEEecCc----------CCCCcccccH Confidence 9999999653221 111111 11111111 1122233333211 1223356789 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhh Q lcl|NC_021072. 235 LHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 314 (533) Q Consensus 235 L~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlED 314 (533) |+.|.+++.....+++...-+---.+.-+-++.++ +.+.+..+++-.+.+...|+.- ...|.+ | .+ T Consensus 181 i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~~~~e~~~~~~~~~~~~~~g~----~n~g~~------~-vl-- 246 (413) T protein:vir:48 181 IAYAREAISLAAATEEHGARLFGNGAVTSGVLRTE-QKLTPDAYERLKKDFEERHTGL----GNAHRP------M-IL-- 246 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCHHHHHHHHHHHHHHhcCc----cccCcc------e-ec-- Confidence 99999999888888877766555556667888887 5677776666666655555431 111211 1 11 Q ss_pred hcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHH Q lcl|NC_021072. 315 FWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRF 393 (533) Q Consensus 315 ywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~f 393 (533) ..|.++..|.-. +.+.-++-.++....+.++++||..-|+..+.-+....++..+. |.++ .|+- + T Consensus 247 --------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~---f~~~--~i~P-~ 312 (413) T protein:vir:48 247 --------EMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLG---FINY--SLVP-Y 312 (413) T ss_pred --------CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHH---HHHH--HHHH-H Confidence 134566666321 22222445568889999999999999975443333333333322 5543 2221 1 Q ss_pred HHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCC Q lcl|NC_021072. 394 SELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQT 473 (533) Q Consensus 394 s~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~t 473 (533) ...+.+. +-++++++.++.. ..+.|. +.++...+ +..|.++++.+-. +-+++.+-++. ++++. T Consensus 313 ~~~ie~~-----l~~~L~~~~~~~~----~~~~fd----~~~l~~~d-~~~~~~~~~~~~~--~g~~T~NE~R~-~~g~~ 375 (413) T protein:vir:48 313 LTRIEQR-----INTGLVRESKQGK----FYAKFN----AGALLRGD-MKSRFEAYATGIN--WGIYSPNDCRD-LEDMN 375 (413) T ss_pred HHHHHHH-----HHhhccCccccCC----eEEEEe----chhhhccC-HHHHHHHHHHHHh--CCCcCHHHHHH-HhCCC Confidence 1122222 3334455555532 344454 23343321 2345555554322 24666676663 35554 Q ss_pred HHHHHHHHHHHHHhhhcCCCCCCCcccccCCCC-CCCCCCCCccccccccC Q lcl|NC_021072. 474 DQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDP-GNAPPADDMSAQEGPAV 523 (533) Q Consensus 474 DeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~-~~~~~~~d~~~~~~~~~ 523 (533) +-+ -.+..+. |. +..+.... ...++..+...+++.+. T Consensus 376 p~~-----------ggD~~~~-~~-n~~~~~~~~~~~~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 376 PRP-----------GGDVYLT-PM-NMTTSPSAGDDNGKKKESGDADKTAS 413 (413) T ss_pred CCC-----------Ccceeec-cc-cccccccccccCCCCCCCCCccccCC Confidence 321 0111111 10 00111000 01111111111111111 No 94 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=97.50 E-value=5.5e-05 Score=43.92 Aligned_cols=406 Identities=15% Similarity=0.186 Sum_probs=168.7 Q ss_pred CCcc---ccce----eeecccccc-ccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhH Q lcl|NC_021072. 1 MSNQ---LFGF----SLERAKKVP-KGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSA 72 (533) Q Consensus 1 ~~~~---~fg~----~i~~~~~~~-~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~A 72 (533) .-.. +||. -....+... ...|+.|- +. ....+++...+....+ .-...+++|-|..| T Consensus 2 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~------~~~~~~~~~~~~g~~v--------~~~~a~~~~aV~~~ 66 (432) T protein:vir:97 2 PDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPV-NA------TARDLGIIISDTGAAV--------NADAIMRLDAVAAC 66 (432) T ss_pred CCcccCchhhhhHhhcCCccccccccccccccC-ch------hhhhhcccccccCccc--------chHhhhcchHHHHH Confidence 1111 1221 111101000 00011110 00 0001111111111111 11234577999999 Q ss_pred HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHH----hhhhcCceeeeeeec Q lcl|NC_021072. 73 VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFR----RWYVDGRLFYHKVID 144 (533) Q Consensus 73 vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR----~WYvDGri~~hkvid 144 (533) |+-|.+.+-. .|+.|--...+.. ++.+. .-++++|+. ...+.++.+ .+.+.|.-|..++-+ T Consensus 67 v~~Ia~~ia~-----lp~~~y~~~~~g~---~~~~~---~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~ 135 (432) T protein:vir:97 67 VKLVSQAVAA-----MPLMMYMRTPDGR---KEAVN---HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT 135 (432) T ss_pred HHHHHHhhcc-----CceEEEEecCCCc---ccccc---cHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec Confidence 9999887543 4555532221111 11111 234455543 235555444 467889998887664 Q ss_pred CCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccc Q lcl|NC_021072. 145 PKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQ 224 (533) Q Consensus 145 ~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~ 224 (533) .+.+++|.+|+|..++.++... +..++ .++...+ ....++.+-|.+.. + . T Consensus 136 ----~g~~~~L~~l~p~~v~v~~~~~-----g~~~y-------------~~~~~~g------~~~~~~~~~iih~r-~-~ 185 (432) T protein:vir:97 136 ----DGRIESLQYLANDRLTITTDTK-----GNTAY-------------RYRRTDG------QMIDIPRQQIWKIM-G-Y 185 (432) T ss_pred ----CCcEEEEEEEcCcceEEEEcCC-----CcEEE-------------EEEecCc------eEEEEccccEEEec-C-c Confidence 2569999999999997643321 11111 0111111 11223322222211 0 0 Q ss_pred cCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccc Q lcl|NC_021072. 225 DLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD 304 (533) Q Consensus 225 d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~ 304 (533) ..++-.++|.|..|.+++.....+++...=+=---+--.-|..+| +.|-+..++.. ++-+.... ..|.+ T Consensus 186 ~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~-~~~~~~~~-------nag~~-- 254 (432) T protein:vir:97 186 SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDSF-SKKVSGSV-------EAGRA-- 254 (432) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecC-CCCCHHHHHHH-HHHHhhhh-------cCCCc-- Confidence 122335679999999998877777765432221122234455555 44544443332 22111111 11221 Q ss_pred ccccchhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCC--cccccchhhhhHHh Q lcl|NC_021072. 305 DKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETET--TFNIGRAAEITRDE 379 (533) Q Consensus 305 d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~--~~~~g~~~eItRDE 379 (533) + .++ .|.+++.|. .+..+ ++-.+|....+.++++||...|+... ..+.|.+ +.-.- T Consensus 255 ----~-vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~--~e~~~ 315 (432) T protein:vir:97 255 ----P-LLE----------GGMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSG--IESQQ 315 (432) T ss_pred ----e-ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchh--HHHHH Confidence 1 221 234455542 22223 33456888999999999999997543 3333322 32222 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_021072. 380 VKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKY 459 (533) Q Consensus 380 lkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky 459 (533) +.|.++ .|+--+. .+.+.|-. +++++.++. ...++|. +..+.... +..|.+.+..+-. +-+ T Consensus 316 ~~f~~~--tl~P~~~-~ie~~ln~-----kLl~~~e~~----~~~~~fd----~~~llr~d-~~~r~~~~~~~~~--~G~ 376 (432) T protein:vir:97 316 LGFLTM--TLSPWLR-RIEQSIAL-----NLLTPAERR----RYFADFD----TSALLRAD-SAARSSYYSQLVN--NGL 376 (432) T ss_pred HHHHHH--HHHHHHH-HHHHHHhh-----hccCccccC----ceEEEee----chhhhccC-HHHHHHHHHHHHh--CCC Confidence 236543 3333221 23333333 334555443 3456665 33333222 3467777666522 257 Q ss_pred ccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCC----CCCCCCCCccccccccCCccccc Q lcl|NC_021072. 460 FSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDP----GNAPPADDMSAQEGPAVDAGDAK 529 (533) Q Consensus 460 ~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~----~~~~~~~d~~~~~~~~~~~~~~~ 529 (533) +|.+-++. .++|...+ .. ...+. .+....|-... ...+..++.+.+ ++...+ T Consensus 377 ~T~NE~R~-~~glpp~~--g~---------~~~~~-~~~~~~pl~~~~~~~~~~~~~~~~~~~-----~~~~~~ 432 (432) T protein:vir:97 377 MTRDEARE-IEGLPKLG--GN---------AAVLT-VQSAMVPLDSIGLQASPEPASGLGNQQ-----QDKVSK 432 (432) T ss_pred CCHHHHHH-HhCCCCCC--CC---------cceEe-ecccccchhhhcccCCCCCCCCCCCcc-----cccccC Confidence 77777764 45664311 00 00000 00000010000 000011111111 000000 No 95 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=97.47 E-value=6e-05 Score=43.73 Aligned_cols=425 Identities=14% Similarity=0.043 Sum_probs=201.8 Q ss_pred CCccccceeeecccccccc---CCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKG---PSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIV 77 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~---~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIv 77 (533) |...|.|-.=+..+..... .+.+.... ..+..+.+.+-....+..++.+.-.++.|++|..++.|.++++-+. T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~~----~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m~~D~~i~s~l~~Rk 76 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATRA----RSIDFFALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRRK 76 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhhc----cccccccccccCcchhHHHhhccCCHHHHHHHhhChHHHHHHHHHH Confidence 8888876443332211100 00111000 0011222333334445555544445789999999999999999998 Q ss_pred cceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEE Q lcl|NC_021072. 78 NETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRY 157 (533) Q Consensus 78 neaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~ 157 (533) ..+... +..|... +.++.+.+.|++ .++-++|++-..+++--. .-|--+++++....+..-.+.++.. T Consensus 77 ~av~~~-----~w~i~~~--~~~~~~a~~i~e----~l~~~~~~~~i~~~lda~-~~G~s~~Ei~w~~~~g~~~~~~l~~ 144 (491) T protein:vir:79 77 AAVKAL-----EWGLDRG--KAKSRVAKSIAD----VFADLDLSRIATEMLDAV-LYGYQPMEITWGKVGNYIVPIDVVG 144 (491) T ss_pred HHHhCC-----CcEEecC--CCCHHHHHHHHH----HHhcCCHHHHHHHHHHhh-hhcceeEEEEEeecCCeeeEEeeee Confidence 776643 3344321 223344444444 444567777666665433 3688889998877555556678888 Q ss_pred cChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccc-hhhccccccccCCCCccchhHH Q lcl|NC_021072. 158 IDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATD-SVTYCHSGIQDLNKNMTLSHLH 236 (533) Q Consensus 158 lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~d-ai~y~hsGl~d~~~~~i~syL~ 236 (533) .+|+.+++-.+ .+..+... . ....+.-+|+. .+++.|. -...++...|-|+ T Consensus 145 r~~~~f~~d~~------~~l~l~~~------~--------------~~~~g~~lp~~k~i~~~~~--~~~g~p~g~gLl~ 196 (491) T protein:vir:79 145 KPADWFVYDPE------NQLRFRSK------E--------------HWVQGEELPARKFLVPRQE--ATYLNPYGFPDLS 196 (491) T ss_pred ecccceeeccC------CceEEeec------C--------------CCCCceeecCCCeEEEEec--CCCCCcccchhHH Confidence 88887754111 11111100 0 00112223332 2333331 1223355668899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 237 KAIKAVNQLRMIEDSLVIYRLS-RAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 237 ~AiK~~NqLrm~EDalVIyRi~-RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) .|..+|-=.+......+.+=-. =.|-| +...|-|+-.+.|. ..++.+. ...+.- | -.+ T Consensus 197 ~~~w~~~fK~~~~~~w~~f~E~~G~P~~-igky~~~a~~~ek~-~l~~al~-~~~~~a------~------~vi------ 255 (491) T protein:vir:79 197 MCFWPTTFKKGGLKFWVQFTEKYGSPML-VGKHPRSASDAETN-LLLDRLE-DMVQDA------V------AVI------ 255 (491) T ss_pred HHHHHHHHHHhhHHHHHHHHHHcCCCeE-EEecCCCCCHHHHH-HHHHHHH-HHhcCe------E------EEe------ Confidence 9999888777776666655443 34554 44557776555443 2333222 222111 1 111 Q ss_pred cccccCCCCccceeecCCCCCcch----HHHHHHHHHHHHHhcCCCccc-cCCCC--cccccchhhhhHHhhhHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGGQNLGE----LEDVKYFQKKLYKALNVPSSR-LETET--TFNIGRAAEITRDEVKFQKFIAR 388 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg~nLge----i~DV~YF~~kLy~aL~VP~sR-l~~~~--~~~~g~~~eItRDElkF~Kfi~r 388 (533) | .|++|+.+.-+..-|. ..=++|..++.-+++- +- |.+++ +.++| ++- .| -+...++. T Consensus 256 --P-----~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iL---GqtlTt~~~gs~a~~---~vh-~~-v~~~i~~~ 320 (491) T protein:vir:79 256 --P-----DDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALL---GQNQTTEATSTRASA---QAG-LE-VTDDIRDG 320 (491) T ss_pred --c-----CCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHh---hhhhccCcccchhhH---HHH-HH-HHHHHHHH Confidence 1 4688888743222222 2338888888877762 11 22332 33332 221 12 25566777 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQ 468 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~ 468 (533) .++..+..|.++++--+.+.+- ......|.|.. ..+ +...+.+.+..+-+. |-=++.+|++++ T Consensus 321 D~~~i~~tln~li~~l~~~N~~--------~~~~p~f~~~e------~ee--~~~~~a~~~~~L~~~-G~~i~~~~~~e~ 383 (491) T protein:vir:79 321 DKAIVVEAMNMLIRWICDLNFD--------GAARPVFDMWE------QEQ--VDEIQAGRDEKLTRA-GARFTPAYFKRA 383 (491) T ss_pred HHHHHHHHHHHHHHHHHHhcCC--------CCCcceEeecC------cCc--hhHHHHHHHHHHHhC-CCccCHHHHHHH Confidence 7777888888766655555553 11334455442 222 222334444444443 434899999866 Q ss_pred HhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCC--CCCCcccccccc-----------------CCccccc Q lcl|NC_021072. 469 VLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAP--PADDMSAQEGPA-----------------VDAGDAK 529 (533) Q Consensus 469 IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~--~~~d~~~~~~~~-----------------~~~~~~~ 529 (533) +++...+.++.-. + .+.|.+.....+.....+ .+.|...+..++ +.++++. T Consensus 384 -~Gip~~~~~e~~~--------~-~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~ 453 (491) T protein:vir:79 384 -YNLQDGDLDERPL--------P-VSAVDAVGAASFAEFEAPDQDALDAALNALSARDLNADAQALVAPLLKRIANGASA 453 (491) T ss_pred -hCCCCCCCCcccc--------C-cCcccccccccccccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCH Confidence 7776543322110 0 001111100111111111 111111111000 0000000 Q ss_pred hhcC Q lcl|NC_021072. 530 RGEF 533 (533) Q Consensus 530 ~~~~ 533 (533) + |+ T Consensus 454 ~-e~ 456 (491) T protein:vir:79 454 D-EL 456 (491) T ss_pred H-HH Confidence 0 00 No 96 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.44 E-value=6.6e-05 Score=43.51 Aligned_cols=417 Identities=14% Similarity=0.162 Sum_probs=184.9 Q ss_pred ccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhh-----hcch---------hhhHHHHhhcc Q lcl|NC_021072. 14 KKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMV-----LQPE---------CDSAVDDIVNE 79 (533) Q Consensus 14 ~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~-----~~pE---------vd~AvdeIvne 79 (533) -+..+..-|.-|++.+-....+.. +.. ...... .+...+..|=+-. ..++ +.+-..-||+- T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~l~~-~i~---~~~~~~-~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~ 75 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEVVTK-FME---KHRLEV-ARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLTVNFTKYIVDT 75 (453) T ss_pred CeecCCcceEcCCCCCCCHHHHHH-HHH---HHHHHH-HHHHHHHHHhhccCchhcCCCccccCccceeecchHHHHHHH Confidence 122233344444433321111111 000 001111 1222122221110 0000 00111122221 Q ss_pred eeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcC Q lcl|NC_021072. 80 TICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYID 159 (533) Q Consensus 80 aiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lD 159 (533) ..-+ .-+.|+.+..++ +. ..+.+..++.--+|+....+..+.+++-|+-|++.-.| ++|-..++.+| T Consensus 76 ~~~~-l~g~~~~~~~~d----~~----~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d----~~g~~~i~~~~ 142 (453) T protein:vir:39 76 FTGY-FNGIPVKKSHSD----KE----TLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQN----EETQTNVIYNT 142 (453) T ss_pred Hhhh-hcccCceeccCC----hH----HHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEec----CCCceEEEEEc Confidence 1100 124555554433 22 23456677777789999999999999999999887665 34667899999 Q ss_pred hhhceehhhccCCCcC--ceeEEeccceeeccchhceecccccccc-------------ccCCcceeccchhhccccccc Q lcl|NC_021072. 160 PRKIRKVTEYQQKRPE--QLRGEDINTQLTQKAAEYYLYNPKGLKN-------------STNQGMKIATDSVTYCHSGIQ 224 (533) Q Consensus 160 P~~i~~vr~~~~~~~~--~~~~~~~~~~~~~~~~e~~~y~p~~~~~-------------~~~~~~kI~~dai~y~hsGl~ 224 (533) |+.+-.+..-.....- ..+++..... ..-..+|.+..... .+|..-+||.=. | T Consensus 143 p~~~~~v~d~~~~~~~~~~ir~~~~~~~----~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~--~------ 210 (453) T protein:vir:39 143 PENMFMVYDDTIKQEPLFAVRYGYDDDY----KLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVE--F------ 210 (453) T ss_pred ccceEEEecCCCCCeEEEEEEEEEeCCe----EEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEE--e------ Confidence 9999776543221110 0111111111 11112333332210 111111222111 1 Q ss_pred cCCCCccchhHHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcc-cEEEeeCCCCcc Q lcl|NC_021072. 225 DLNKNMTLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYR-NKLVYDANTGEI 302 (533) Q Consensus 225 d~~~~~i~syL~~AiK~~NqL-rm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~r-nk~vYd~~TGev 302 (533) +++....|=++..+....-+ +++-+....-+.++.|-+-+.-. +++... +.+.+ +++. . T Consensus 211 -~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~---~~~~~~--------~~~~~~~~~~-~------ 271 (453) T protein:vir:39 211 -YFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGA---AVEEED--------LKNIRSNRVI-N------ 271 (453) T ss_pred -cCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC---CCCchh--------hhhhhhccee-e------ Confidence 12344556665554444333 34556666667778886655432 233211 11111 1111 0 Q ss_pred ccccccchhHhhhccccc-CCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHH Q lcl|NC_021072. 303 KDDKKFMSMLEDFWLPRR-EGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRD 378 (533) Q Consensus 303 ~~d~~~msmlEDywLpRR-eggrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~--eItRD 378 (533) ++-. ..+.+.++.+|....+.+.+. -+.-+.+.+|....+|- +..+ .| |++| .|.-- T Consensus 272 --------------~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~-~~--gn~Sg~Al~~~ 332 (453) T protein:vir:39 272 --------------YYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDE-SF--GSSSGVSLAYK 332 (453) T ss_pred --------------ecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccc-cc--cCChHHHHHHH Confidence 1100 112344567766555555444 35667777888888884 2222 22 3332 33333 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_021072. 379 EVKFQKFIARLRKRFSELFMDLLKTQLILKGVM-SLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVG 457 (533) Q Consensus 379 ElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~-t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vG 457 (533) +.....-+.+.|..|..-+..+++.=+-+-+.. ...+|. .|.+.|...-.=.+ .+.++++..+ +| T Consensus 333 ~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~----~i~v~f~~~~p~~~-------~~~a~~~~kl---~g 398 (453) T protein:vir:39 333 LQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWK----DIEYTFTRNEPKDI-------KEQAETANIL---MG 398 (453) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc----cceEEeCCCCCcCH-------HHHHHHHHHH---hc Confidence 333445577777888887777777533332322 223333 46788864332222 2345556655 35 Q ss_pred ccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCcc Q lcl|NC_021072. 458 KYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMS 516 (533) Q Consensus 458 ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~ 516 (533) .+|.+++++. |...++ .+++-++|++|.....-.++. .+.++.+..++.+.++-- T Consensus 399 -~is~et~l~~-l~~v~D-~~~E~~ri~~E~~~~~~~~~~-~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 399 -ITSQETALSV-ISVIPD-VQAEMEKIKKEEASTAIFDKD-KQPSEKGTDTVVPETNEE 453 (453) T ss_pred -cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHHHh-ccCCCCCCCCCCCCcCCC Confidence 4899999976 666542 334445566666543222111 111111111121111111 No 97 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=97.44 E-value=6.7e-05 Score=43.46 Aligned_cols=408 Identities=11% Similarity=0.082 Sum_probs=181.8 Q ss_pred CCCCCCcccceeecccccccccchhhhhhH-HHHHHHHHHhhhhcchhhhHHHHh--------hcceeeecCCCceEE-- Q lcl|NC_021072. 23 VQKDSMDGSQPIVGGGYYGYSVDFDGTVRN-EYELITRYREMVLQPECDSAVDDI--------VNETICGNFDDVPVE-- 91 (533) Q Consensus 23 ~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~-~~~LI~~YR~m~~~pEvd~AvdeI--------vneaiv~d~~~~~v~-- 91 (533) ..++..+ .-..+-+ -.....+|+.+..+.+=...+... -|--++.+.-..+|+ T Consensus 1 ~~~~~~~----------------~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~ 64 (441) T protein:vir:80 1 MNSDELA----------------LIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDAL 64 (441) T ss_pred CCccHHH----------------HHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHH Confidence 0000000 0000111 111122333333333222222111 011111111111110 Q ss_pred -----EEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceeh Q lcl|NC_021072. 92 -----VELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKV 166 (533) Q Consensus 92 -----v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~v 166 (533) ++--..+..+. .+.+.+.-+|+....+.++.-.+-|+-|.+.-.| ..|-..++.+||+.+-.| T Consensus 65 ~~~l~~~g~~~~d~~~--------l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d----~~g~~~i~~~~p~~~~~i 132 (441) T protein:vir:80 65 EERLDWLGWTNGDGYG--------LDGVYAANRLATASCDVHLDALIFGLSFVAIIPH----GDGTVSVRPQSPKNCTGK 132 (441) T ss_pred HhhhccccccCCChHH--------HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeC----CCCceEEEEEccceEEEE Confidence 11001112222 3445555678899999999999999998876554 346678999999998665 Q ss_pred hhccCCCcCceeEEeccceeeccchhceecccccccc--------------ccCCcceeccchhhccccccccCCCCccc Q lcl|NC_021072. 167 TEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKN--------------STNQGMKIATDSVTYCHSGIQDLNKNMTL 232 (533) Q Consensus 167 r~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~--------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~ 232 (533) ..-. ......+...............+|.+..... .+|.--++|. ++|....-.+ ...+. T Consensus 133 ~d~~--~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~n~~~~~--~~~G~ 206 (441) T protein:vir:80 133 FSAD--GSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPL--VPIVNRRRTS--RIDGR 206 (441) T ss_pred EeCC--CCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeE--EEeeccccCC--ccCCc Confidence 4321 1112211111111011111112333322110 1111111221 1121111111 11223 Q ss_pred hhHHHHHHHHH-H-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 233 SHLHKAIKAVN-Q-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 233 syL~~AiK~~N-q-Lrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) |-|...++++- - -+++-+..++-+.+..|.|-+.=.+.+..+. +. .... T Consensus 207 s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~--------~~--------------~~~~------- 257 (441) T protein:vir:80 207 SEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQ--------PG--------------WVLS------- 257 (441) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCcccccc--------ch--------------hhhc------- Confidence 43433333321 1 2456677778888888877554211111110 00 0000 Q ss_pred hHhhhc-ccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHH Q lcl|NC_021072. 311 MLEDFW-LPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARL 389 (533) Q Consensus 311 mlEDyw-LpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rL 389 (533) + .-+| +|--++|.+.++..+|++.-=.-++-++=....++...++|.+-|+..+. +...|..|.--+...-.-+.+. T Consensus 258 ~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-~~~Sg~Al~~~~~~l~~k~~~~ 335 (441) T protein:vir:80 258 M-ASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITS-NPPSGEALAAEESRLVKRAERR 335 (441) T ss_pred c-cccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCC-cchHHHHHHHHHHHHHHHHHHH Confidence 0 0122 34444555567777876431122333444556777778888777754332 2112333444444455667888 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHH Q lcl|NC_021072. 390 RKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQV 469 (533) Q Consensus 390 r~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~I 469 (533) ++.|..-+...++.=+-+.|. ..++......+.+.|...-.=. +.+.++.+.++..-+-...|.++++ .. T Consensus 336 ~~~f~~~l~~~~~l~~~~~~~--~~~~~~~~~~i~~~f~~~~~~~-------~~e~ad~~~kl~~~g~~~~s~~~~~-~~ 405 (441) T protein:vir:80 336 QTSFGQGWLSVGFLAAKALDS--RVDEADFFGDVGLRWRDASTPT-------RAATADAVTKLVGAGILPADSRTVL-EM 405 (441) T ss_pred HHHHHHHHHHHHHHHHHHhcC--CCcccccceeeeEEeCCCCCcC-------HHHHHHHHHHHHhcCcccccHHHHH-Hh Confidence 888888888888765555553 2334444456788887543222 2355666666655433456878877 56 Q ss_pred hCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 470 LKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 470 L~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~ 515 (533) |+++++|++++.+.-+++.. . +. +..+. -....+.. T Consensus 406 l~~~~~e~~~~~~e~~e~~~-~-~~----~~~~~----~~~~~~~~ 441 (441) T protein:vir:80 406 LGLDDVQVEAVMRHRAESSD-P-LA----VLAGA----ISRQTNEV 441 (441) T ss_pred CCCCHHHHHHHHHHHHHHHH-H-HH----HHhhh----hhcccccC Confidence 89999999977664333221 1 11 10000 00000011 No 98 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=97.43 E-value=6.9e-05 Score=43.39 Aligned_cols=423 Identities=12% Similarity=0.153 Sum_probs=194.2 Q ss_pred ccceeeecccccccc--CCCCCCCCCc-ccceeecccccccccchhhhhhHHHHHHHHHHhhhh--cchhh--------- Q lcl|NC_021072. 5 LFGFSLERAKKVPKG--PSFVQKDSMD-GSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVL--QPECD--------- 70 (533) Q Consensus 5 ~fg~~i~~~~~~~~~--~s~~~~~~~d-g~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~--~pEvd--------- 70 (533) +|.=-++..+..-+. ..-...+..+ -.+.+ -.....-|..+|.+.. +|.+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~---------------~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~ 65 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNA---------------NDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGN 65 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcC---------------CHHHHHHHHHHHHHhcCCcchhhccccccCCC Confidence 222111111111110 0000000000 00000 0013344555555532 22111 Q ss_pred ---------hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeee Q lcl|NC_021072. 71 ---------SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHK 141 (533) Q Consensus 71 ---------~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hk 141 (533) +--..||++..-+=. +.|+.+.+++ +. .++.+..++.--+|.+...+.+..-..-|..++|. T Consensus 66 ~~~~~~~s~n~~~~iv~~~a~~l~-~ep~~i~~~d----~~----~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~ 136 (499) T protein:vir:80 66 PVNRRQLSMNLPKVTAKYMSKLLF-NEKVKINIDD----ET----AEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKV 136 (499) T ss_pred ccccceeecchHHHHHHHHHHhhh-CCcceEeeCC----HH----HHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEE Confidence 111222222110001 3466676655 22 23334445555569999999999999999999999 Q ss_pred eecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeE-E-e---cc-ceeec-------------cchhceeccccccc Q lcl|NC_021072. 142 VIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRG-E-D---IN-TQLTQ-------------KAAEYYLYNPKGLK 202 (533) Q Consensus 142 vid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~-~-~---~~-~~~~~-------------~~~e~~~y~p~~~~ 202 (533) .+|.. |=..+..++|.++=++.-- ......+ + + .. +.++. +.-.+.+|...... T Consensus 137 ~~D~~----~~~~i~~v~a~~~~Pi~~d---~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~ 209 (499) T protein:vir:80 137 YHDGN----KNVKVSFATADCMYPLSND---SENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPN 209 (499) T ss_pred EECCC----CcEEEEEEcCCceEEEEec---CCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCcc Confidence 99852 3356899999998665321 1111111 1 0 00 00000 00011112111100 Q ss_pred cccCCcceeccchh------------------hccccc---cccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_021072. 203 NSTNQGMKIATDSV------------------TYCHSG---IQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAP 261 (533) Q Consensus 203 ~~~~~~~kI~~dai------------------~y~hsG---l~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAP 261 (533) .. +..++...+ .|-.-- -.++.++.++|-|+.|...+..|-..-+.++ |..+.= T Consensus 210 -~l--G~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~--~e~~~~ 284 (499) T protein:vir:80 210 -EL--GGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYY--QEFKLG 284 (499) T ss_pred -cc--CcccchhhhccCcCCceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHH--HHHHhc Confidence 01 111111111 111000 0134456678999999998888887777654 778887 Q ss_pred cceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcc----c-cccccchhHhhhcccccCCCCccceeecCCCCC Q lcl|NC_021072. 262 ERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEI----K-DDKKFMSMLEDFWLPRREGGRGTEISTLPGGQN 336 (533) Q Consensus 262 eRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev----~-~d~~~msmlEDywLpRReggrgTEIsTLpGg~n 336 (533) .+|+|. +..=++. .-|.. |+. . +++.+..+ +=..++.|--|+++.+.-. T Consensus 285 ~~~i~v-~~~~l~~------------------~~~~~-g~~~~~~~~~~~~~~~~------~~~~~~~~~~i~~~~~~ir 338 (499) T protein:vir:80 285 KKKVLV-PSSFVKT------------------AVNLD-GSTTQYFDSTDEAFFLY------QGEQDDNGKAIKDISVEIR 338 (499) T ss_pred ccceec-chhhhhc------------------cCCCC-CCcccCCCcccceeeEe------eccCCCCcCceeEecCcCC Confidence 777774 2111100 00011 111 1 22222111 0011122223677665443 Q ss_pred cch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHH-------Hhc Q lcl|NC_021072. 337 LGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQL-------ILK 408 (533) Q Consensus 337 Lge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qL-------ilk 408 (533) -.+ ..-+..+.+.+....|+|-+-|+.+++- ...++||.-....-..-+...++.|..-+.++++.=| .+. T Consensus 339 ~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g-~~TAtei~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~ 417 (499) T protein:vir:80 339 STEFIESINAMLRIYAMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYD 417 (499) T ss_pred hHHHHHHHHHHHHHHHHhcCCChhhcCCCccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 332 3567788889999999998888765432 1235666544333222345555555555555444422 233 Q ss_pred cCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhh Q lcl|NC_021072. 409 GVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSER 488 (533) Q Consensus 409 gi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~ 488 (533) |. .|+ ...+.++|... .... .+. .++.+.++-- .| .+|.++++.+....||+|.+++.++|++|. T Consensus 418 ~~----~~~--~~~v~v~f~d~-i~~d-~~~-----~~~~~~~~~~-~G-i~S~et~l~~~~~~~d~ea~~el~~i~~E~ 482 (499) T protein:vir:80 418 GD----TVE--LDTITVDFDDS-IAQD-EDT-----TINRYTTAKN-QG-MIPLKIALQRAWNITEAEADEWAEMLAKEK 482 (499) T ss_pred CC----CCC--ccceEEEeCCC-CCCC-HHH-----HHHHHHHHHH-cC-CCCHHHHHhhcCCCChHHHHHHHHHHHHHh Confidence 32 232 24678888533 2222 111 1122222211 13 588899988888999999999999999997 Q ss_pred hcCCCCCCCcccccCCCCCCC Q lcl|NC_021072. 489 EAGLIVDPMAEMDPAMDPGNA 509 (533) Q Consensus 489 ~~~~~~~p~~~~~~~~~~~~~ 509 (533) ... .++|+ .. |..+..+ T Consensus 483 ~~~-~~~~d--~~-g~~ge~e 499 (499) T protein:vir:80 483 QAE-IPNND--MT-GIFGEEE 499 (499) T ss_pred hcC-CCCCC--cc-ccCCCCC Confidence 654 33221 11 1111111 No 99 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=97.40 E-value=7.6e-05 Score=43.15 Aligned_cols=428 Identities=12% Similarity=0.088 Sum_probs=175.2 Q ss_pred CCccccceeeecc--ccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhc Q lcl|NC_021072. 1 MSNQLFGFSLERA--KKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~~~~fg~~i~~~--~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvn 78 (533) |-..+ |+|... ++.-+..+.. . ... .....+.|+.. .+ +...|-+ -+..+|-|.+||+-|.+ T Consensus 1 ~~~~~--~~i~s~~~~~~i~~~~~~--s-~~~-~~~~~~~~~~p------p~-~~~~la~---l~~~n~~v~scI~~ia~ 64 (542) T protein:vir:41 1 MFNYH--LSIRSLEKYKAIKREEVE--S-QAL-GETRFEEYVEP------KV-NPLVLLS---LLQVNPYHASACSIKAN 64 (542) T ss_pred Ccccc--ccccccccchhhhhcccc--c-ccc-ccccCCccccC------CC-CHHHHHH---HHhhcHHHHHHHHHHHH Confidence 33333 344431 1111111110 0 000 00001112211 11 1222222 22457889999999998 Q ss_pred ceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHh-cchhhhh----HHHHhhhhcCceeeeeeecCCCCCCCeE Q lcl|NC_021072. 79 ETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLL-DFENRSY----EIFRRWYVDGRLFYHKVIDPKNPRGGLT 153 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL-~f~~~~~----~~fR~WYvDGri~~hkvid~~~~~~gI~ 153 (533) .+-. .|+.+.-+..+ .+.+.+ +-.-.+. .+++.+++-|.-|++++-|. .+-++ T Consensus 65 ~IA~-----l~~~~~~~~~~--------------~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~---~G~~~ 122 (542) T protein:vir:41 65 DIIR-----TGYILEGDDEG--------------VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDD---RGDPI 122 (542) T ss_pred HHhh-----Cceeeecccch--------------hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---CCcEE Confidence 7653 34444321110 111222 3333333 44555777899999988765 45699 Q ss_pred EEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhcccccccc-CCCCccc Q lcl|NC_021072. 154 ELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQD-LNKNMTL 232 (533) Q Consensus 154 elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d-~~~~~i~ 232 (533) +|++|||.+++..+.. +...........+ ....|.|..............++.+-+.+.. ..+ .++-.++ T Consensus 123 ~L~~l~~~~v~v~~d~-----~~~~~~~~~~~~~--~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir--~~~~~~~~~Gl 193 (542) T protein:vir:41 123 RFEYIPSHTIRVHKDG-----SRYRQTWDGVNIT--HFKDYRYEGEINPETGEDQDSVGANELVFIH--IPSPVCSYYGV 193 (542) T ss_pred EEEEEcCcceEEEEcC-----CeeEeeecCCcce--eEEeecccccccccccccccccCcccEEEec--CCCCCCCcccc Confidence 9999999999754321 1111111100000 0000111110000011111222222221111 011 2233556 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCC---------CCchHHHHHHHHHHHHhcccEEEeeCCCCccc Q lcl|NC_021072. 233 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG---------NLPKNKAEQYLREVMGRYRNKLVYDANTGEIK 303 (533) Q Consensus 233 syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvG---------nlpk~KAeqYl~~im~~~rnk~vYd~~TGev~ 303 (533) |-+..|+..+.....++....=+=--.+--+-|.++..+ .+-+...+..-+.+...|+.-. .+.|. T Consensus 194 spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~---~n~gk-- 268 (542) T protein:vir:41 194 PRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLK---EAPHT-- 268 (542) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhh---cccCc-- Confidence 899999998888777776654443333555667777543 2222333333333333332100 01111 Q ss_pred cccccchhHhhhcccccCC-CCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCC--cccccchhhhhH Q lcl|NC_021072. 304 DDKKFMSMLEDFWLPRREG-GRGTEISTLPGGQNLGELE---DVKYFQKKLYKALNVPSSRLETET--TFNIGRAAEITR 377 (533) Q Consensus 304 ~d~~~msmlEDywLpRReg-grgTEIsTLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~--~~~~g~~~eItR 377 (533) .+ .++ .-.| ..|.+++.| +.+..++. -..+..+.+.++++||...|+... +++.....+..+ T Consensus 269 ----~~-vL~-----~~~~~~~g~~~~pl--~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~ 336 (542) T protein:vir:41 269 ----PL-VFS-----IPGGDTVKVTFTPL--NTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRR 336 (542) T ss_pred ----ee-Eee-----ccCCcccceeEEEc--CCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHH Confidence 11 111 1111 134455544 33333332 335567889999999999997543 343333344333 Q ss_pred HhhhHHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_021072. 378 DEVKFQK-FIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYV 456 (533) Q Consensus 378 DElkF~K-fi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~v 456 (533) . |.+ -|.-+++++...+...|-+. .+ ..+.+.|..+..... + +...+..+ .- T Consensus 337 ~---f~~~tL~P~~~~ie~~ln~~L~~~---------~~-----~~~~~~f~~~~ll~~----d----~~~~~~~~--v~ 389 (542) T protein:vir:41 337 T---YYESVVRPQQNIISSILTDFFQVK---------FN-----PKTRFKFNDETLLES----D----SVRNCALL--VQ 389 (542) T ss_pred H---HHHHHHHHHHHHHHHHHHhhcccc---------cC-----CceEEEecchhhcch----H----HHHHHHHH--Hh Confidence 3 543 34667777766666443322 22 234667764433322 1 11112211 12 Q ss_pred cccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCC-------------cccccCCC-CCCCCCCCC---c---- Q lcl|NC_021072. 457 GKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPM-------------AEMDPAMD-PGNAPPADD---M---- 515 (533) Q Consensus 457 Gky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~-------------~~~~~~~~-~~~~~~~~d---~---- 515 (533) +-++|.+-++.++.++..-+ ..+-.|. .+.++.-+ .....+.+| . T Consensus 390 ~GilT~NE~Re~L~g~~pgd--------------d~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~ 455 (542) T protein:vir:41 390 SGVLTPAEARERLFGLDGGP--------------DIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISS 455 (542) T ss_pred CCCCCHHHHHHhhCCCCCCC--------------ccccccccccccccccCCcCCCCCchhhhhhcccccCccccccccc Confidence 24566666654332332110 0000010 00000000 000001111 0 Q ss_pred cccccccCCccccchhcC Q lcl|NC_021072. 516 SAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 516 ~~~~~~~~~~~~~~~~~~ 533 (533) ......+-...+.+++|| T Consensus 456 ~~~~~~~~~~~~~~~~~~ 473 (542) T protein:vir:41 456 KLSAEEKKKKIDESLAEF 473 (542) T ss_pred cccchhhcccccchhhhh Confidence 001111122345555666 No 100 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=97.39 E-value=7.7e-05 Score=43.13 Aligned_cols=404 Identities=14% Similarity=0.122 Sum_probs=180.9 Q ss_pred CCccccceeeeccccccccCCCCCCCC-CcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDS-MDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNE 79 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~-~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvne 79 (533) |.. |- .+.+.+. . .....+.. .++ .......+.|..+.- .....+|-|.+||+-|.+. T Consensus 1 Mg~--f~-~lf~r~~--~-~~~~~~~~~~~~-~~~~~~~~~g~~v~~--------------~~al~~~~v~~~i~~Ia~~ 59 (414) T protein:vir:44 1 MVF--FS-GLFQRKS--D-APVTTPAELADA-IGLSYDTYTGKQISS--------------QRAMRLTAVFSCVRVLAES 59 (414) T ss_pred Cch--hh-hhhccCc--c-CcccchhhHhHh-hccCccccCCceech--------------hhhhccHHHHHHHHHHHHH Confidence 432 21 1222111 1 11111111 110 000001111111110 1224688899999999887 Q ss_pred eeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhh----hHHHHhhhhcCceeeeeeecCCCCCCC Q lcl|NC_021072. 80 TICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRS----YEIFRRWYVDGRLFYHKVIDPKNPRGG 151 (533) Q Consensus 80 aiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~----~~~fR~WYvDGri~~hkvid~~~~~~g 151 (533) +-- .|+.|--.. +..+ +.. .-..++++|+- ...+ +.++..+++.|.-|..++-+ .+. T Consensus 60 ia~-----~p~~~~~~~-~~~~---~~~--~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~----~g~ 124 (414) T protein:vir:44 60 VGM-----LPCNLYHLN-GSLK---QRA--TGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA----FGE 124 (414) T ss_pred hcc-----CceEEEEec-CCce---eec--ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC----CCc Confidence 542 333332111 1111 100 11234444432 2233 34455577899998875432 477 Q ss_pred eEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCcc Q lcl|NC_021072. 152 LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMT 231 (533) Q Consensus 152 I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i 231 (533) +.+|.+|+|.++...... + +..+ |.+..+.| ....++.+.+.+... + ..++-.+ T Consensus 125 ~~~L~~l~~~~v~~~~~~---~--~~~~-------------y~~~~~~g------~~~~~~~~evih~~~-~-~~d~~~G 178 (414) T protein:vir:44 125 VAELLPVDPGCVVPKLNS---S--WEPV-------------YQVTFPDG------STDVLSQEDIWHVRT-L-TLDGLVG 178 (414) T ss_pred EEEEEEEcCceEEEEECC---C--CcEE-------------EEEEecCc------eEEEEccccEEEecC-C-CCCCccc Confidence 999999999998643221 1 1101 11111111 111233332222221 1 1233356 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchh Q lcl|NC_021072. 232 LSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSM 311 (533) Q Consensus 232 ~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msm 311 (533) +|-+..|..++.....+++...-+----+--+-++.++ ++|.+..+++..+.+...|+.- ...|. .+ . T Consensus 179 ~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------~~-v 246 (414) T protein:vir:44 179 LNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTE-QTLSDQAYERLKKDFEERHTGL----GNAHR------PM-I 246 (414) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHhcCc----cccCc------ce-e Confidence 78899999988888888877665554455557788887 4677777666666655555420 01121 11 1 Q ss_pred HhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHH Q lcl|NC_021072. 312 LEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLR 390 (533) Q Consensus 312 lEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr 390 (533) + ..|++++.|.-. ..+.-++-.++....+.++++||.+.|+..+.-+....++..+. |.+++ |+ T Consensus 247 l----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~---~~~~~--l~ 311 (414) T protein:vir:44 247 L----------EMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLG---FINYS--LV 311 (414) T ss_pred c----------CCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHH--HH Confidence 1 124556665321 22223445667788899999999999976544444444444333 55432 32 Q ss_pred HHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHh Q lcl|NC_021072. 391 KRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVL 470 (533) Q Consensus 391 ~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL 470 (533) - +...+.+.|-.. ++++.++. .+.+.|..+ ++...+ +..|.+.++.+-. +-+++.+-++. ++ T Consensus 312 P-~~~~ie~~ln~~-----L~~~~~~~----~~~i~fd~~----~ll~~d-~~~~~~~~~~~~~--~G~~t~NE~R~-~~ 373 (414) T protein:vir:44 312 P-YLTRIEQRINTG-----LVRKSKQG----VFYAKFNAG----ALLRGD-MKSRFEAYATGIN--WGIYSPNDCRD-LE 373 (414) T ss_pred H-HHHHHHHHHHhh-----cCCccccC----ceEEEEech----hhhccC-HHHHHHHHHHHHh--CCCcCHHHHHH-Hh Confidence 2 222333444333 34555553 345566533 333222 2345555554432 25677777774 46 Q ss_pred CCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccC Q lcl|NC_021072. 471 KQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAV 523 (533) Q Consensus 471 ~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~ 523 (533) ++.+-+ -.+..+..-+....|. .....++..|.+.++.+.. T Consensus 374 gl~p~~-----------ggD~~~~~~n~~~~~~-~~~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 374 DMNPRP-----------GGDVYLTPMNMTTKPS-DGSKAGKQKDNANADETTS 414 (414) T ss_pred CCCCCC-----------CcceecccccccccCC-ccccCCCCCCCCCCCCCCC Confidence 664311 0111111000000010 1111111112122222222 No 101 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=97.38 E-value=8e-05 Score=43.05 Aligned_cols=424 Identities=14% Similarity=0.043 Sum_probs=199.4 Q ss_pred CCccccceeeecccccc---ccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVP---KGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIV 77 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~---~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIv 77 (533) |...|.|-.=+..+... .-.+.+..... + ..+- .+..+....+..++..-..++.|++|..++.|.++++-.. T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~~~~-~-~~~~--~~~~~~~~~~~iLr~~~~~~~~y~~m~~D~~i~s~l~~Rk 76 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIATRAR-S-IDFF--ALGMYLPNPDPVLKALGKDIRVYRELRADAHVGGCVRRRK 76 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHhhhc-c-cccc--cccCCccchHHHHHhcCCCHHHHHHHhhChHHHHHHHHHH Confidence 99988875544322111 00111111111 1 1111 1222334445555543334789999999999999999997 Q ss_pred cceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEE Q lcl|NC_021072. 78 NETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRY 157 (533) Q Consensus 78 neaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~ 157 (533) ..+.. .+..|.- -+.++.+.+.|++-| +.++|++-..+++-- ..-|--++++|....+..-.+.++.. T Consensus 77 ~av~~-----~~w~i~~--~~~~~~~~e~v~e~l----~~~~~~~~l~~~lda-~~~G~s~~Ei~w~~~~g~~~~~~l~~ 144 (491) T protein:vir:10 77 AAVKA-----LEWGLDR--GKAKSRVAKSIADVF----ADLDLSRIVTEMLDA-VLYGYQPMEITWGKVGNYIVPIDVVG 144 (491) T ss_pred HHHhC-----CCcEEec--CCCCHHHHHHHHHHH----hcCCHHHHHHHHHHh-hhhcceeEEEEEeecCCeeEEEEeee Confidence 76653 2333332 122344445555443 445777666666643 34688888998876544455668888 Q ss_pred cChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccc-hhhccccccccCCCCccchhHH Q lcl|NC_021072. 158 IDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATD-SVTYCHSGIQDLNKNMTLSHLH 236 (533) Q Consensus 158 lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~d-ai~y~hsGl~d~~~~~i~syL~ 236 (533) ++|+.+++-++ .+..+... .. ...+.-++.. .++++|.. ...+....|-|+ T Consensus 145 r~~~~f~~d~~------~~l~~~~~------~~--------------~~~g~~l~~~k~i~~~~~~--~~~~p~g~gLl~ 196 (491) T protein:vir:10 145 KPADWFVYDPE------NQLRFRSK------DH--------------WMQGEELPARKFLVPRQEA--TYLNPYGFPDLS 196 (491) T ss_pred ecccceeeccC------CceEEecC------CC--------------CCCcceecCCCEEEEEecC--CCCCcccchhHH Confidence 88887754111 11111100 00 0112223222 33344322 122355668899 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-hcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 237 KAIKAVNQLRMIEDSLVIYRL-SRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 237 ~AiK~~NqLrm~EDalVIyRi-~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) .|..+|--.+...-..+.+=- .=.|-|-.- .|.|.-.+.|. ..++.+ ....+.-+ | . T Consensus 197 ~~~w~~~fK~~~~~~w~~f~E~yG~P~~igk-y~~~a~~~ek~-~l~~al-~~~~~~a~-----~-------v------- 254 (491) T protein:vir:10 197 MCFWPTTFKKGGLKFWVQFTEKYGSPMLVGK-HPRSASDGEKN-LLLDCL-EDMVQDAV-----A-------V------- 254 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCeEEEe-cCCCCCHHHHH-HHHHHH-HHHhcCcE-----E-------E------- Confidence 999998888866666555433 344555444 47665444432 233332 22222111 1 1 Q ss_pred cccccCCCCccceeecCCCCCcchH----HHHHHHHHHHHHhcCCCccccCCCC--cccccchhhhhHHhhhHHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGGQNLGEL----EDVKYFQKKLYKALNVPSSRLETET--TFNIGRAAEITRDEVKFQKFIARL 389 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg~nLgei----~DV~YF~~kLy~aL~VP~sRl~~~~--~~~~g~~~eItRDElkF~Kfi~rL 389 (533) +| .|++|+.+.-+.+-|.. .=++|..++.-+++-= .=|.+++ +.++| ++-.+ -+...++.. T Consensus 255 -iP-----~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG--qtlTt~~~gs~a~~---~vh~~--v~~di~~~D 321 (491) T protein:vir:10 255 -VP-----DDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLG--QNQTTEATSTRASA---QAGLE--VTDDIRDGD 321 (491) T ss_pred -ec-----CCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhh--hhcccCcccchhHH---HHHHH--HHHHHHHHH Confidence 12 46889888543333332 3388888887776431 1122332 33332 22111 245566666 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHH Q lcl|NC_021072. 390 RKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQV 469 (533) Q Consensus 390 r~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~I 469 (533) ++..+..+..+++--+.+++. .....+|.|... . |..+.+.+.+..+.+. |==++.+|++++ T Consensus 322 ~~~i~~tln~li~~l~~~N~~--------~~~~p~f~~~~~------~--e~~~~~a~~~~~L~~~-G~~i~~~~i~e~- 383 (491) T protein:vir:10 322 KAVVSEAMNMLIRWICDLNFD--------GADRPVFDMWEQ------E--QVDEIQAGRDQKLTQA-GARFTPAYFKRA- 383 (491) T ss_pred HHHHHHHHHHHHHHHHHhcCC--------CCCcceEEecCc------C--chhHHHHHHHHHHHhC-CCcCCHHHHHHH- Confidence 777777777755554555543 123345555422 1 2334445555555544 434899999866 Q ss_pred hCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC--CCCCCC--CCCCCcccccccc------C-----------Ccccc Q lcl|NC_021072. 470 LKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA--MDPGNA--PPADDMSAQEGPA------V-----------DAGDA 528 (533) Q Consensus 470 L~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~--~~~~~~--~~~~d~~~~~~~~------~-----------~~~~~ 528 (533) +++...+.++. ..+.+.....++ +..... .++.|....+..+ + .++.+ T Consensus 384 ~Gip~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s 452 (491) T protein:vir:10 384 YNLQDGDLDER-----------PLPVSAVDTVGAASFAEFEAPDQDALDAALNTLSARDLNADAQALVAPLLKRIANGAS 452 (491) T ss_pred hCCCCCCcCcc-----------ccccCCCCCcccccccccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 77765443321 111111111000 000000 0000110000000 0 00000 Q ss_pred chhcC Q lcl|NC_021072. 529 KRGEF 533 (533) Q Consensus 529 ~~~~~ 533 (533) . +|+ T Consensus 453 ~-~e~ 456 (491) T protein:vir:10 453 A-DEL 456 (491) T ss_pred H-HHH Confidence 0 000 No 102 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=97.37 E-value=8.4e-05 Score=42.93 Aligned_cols=401 Identities=10% Similarity=0.067 Sum_probs=183.0 Q ss_pred cccchhhhhhHHHHHHHHHHhhhhcchh-----------------hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHH Q lcl|NC_021072. 42 YSVDFDGTVRNEYELITRYREMVLQPEC-----------------DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIK 104 (533) Q Consensus 42 ~~~~~~~~~~~~~~LI~~YR~m~~~pEv-----------------d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik 104 (533) .-.++-..-+.+.+.+.+|=.= .|+.+ -+-...||+-.. .=.-+.|+.+...+.+.++.+ T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g-~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~-~~l~g~~~~~~~~~~~~~~~~- 77 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQG-DNFSILSGHRRLDDEKADYRVRHKWGGYISSFAT-GYVIGNPVSIGVMEGGSADQL- 77 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhcc-CCcccccccccccccCCcceeecchHHHHHHhhh-hheeccCceEeeCCCccHHHH- Confidence 1111111111222222222110 01110 011112222211 001356667766554443332 Q ss_pred HHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccc Q lcl|NC_021072. 105 KLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINT 184 (533) Q Consensus 105 ~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~ 184 (533) +.+..++..-+|+....+..+.+++-|+-|.+.-+|. +|-..+..++|+.+-++..- ..++.-...... T Consensus 78 ----~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~----~~~~~i~~~~p~~~~~~~d~---~~~~~~~~~i~~ 146 (440) T protein:vir:95 78 ----STIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDK----DKVDRVVLISPLEMFVIRDL---TVEQNIIAAVHL 146 (440) T ss_pred ----HHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecC----CCceEEEEEcccceEEEEcC---CCCCceEEEEEE Confidence 2355666666899999999999999999999977763 35567888999999775432 222111111110 Q ss_pred eeeccchhceecccccccc----------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-H Q lcl|NC_021072. 185 QLTQKAAEYYLYNPKGLKN----------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-M 247 (533) Q Consensus 185 ~~~~~~~e~~~y~p~~~~~----------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m 247 (533) ..........+|.+..... .+|.--+||.=.+ +|+....|=++..+.....+. + T Consensus 147 ~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~---------~n~~~g~sd~e~v~~lida~~~~ 217 (440) T protein:vir:95 147 PIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEW---------WNNRFRMGDYESEISLIDAYDAG 217 (440) T ss_pred EEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEe---------eCCCCCCCchhhhHHHHHHHHHH Confidence 0011111112344332210 0111111221111 223334455555555444443 3 Q ss_pred HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhccccc----CCC Q lcl|NC_021072. 248 IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRR----EGG 323 (533) Q Consensus 248 ~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRR----egg 323 (533) +-+.+..-+-.+.|-+-+.-.+.+.-.... ..+.+++.+. .|++.. .++ T Consensus 218 ~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e--------------------~~~~~~~~~~-------~~~~~~~~~~~~~ 270 (440) T protein:vir:95 218 QSDTANYMSDLNDAMLLVKGDLDGIKLSPE--------------------DAAKMKDANM-------LFLKTGISTTGQQ 270 (440) T ss_pred HHHHHHHHHHhhcceeeeecccccCCCCcc--------------------chhhhhhccc-------eecccccccccCC Confidence 344445556666666555432111100000 0011111111 232222 223 Q ss_pred CccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 324 RGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLK 402 (533) Q Consensus 324 rgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk 402 (533) .+..++.|-...+++.. .-+.-+.+.+|...++|-=-++.-++ |. .+..|.--+.....-+.+.|..|..-+.++++ T Consensus 271 ~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~ 348 (440) T protein:vir:95 271 TTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNS-TS-SGIALLYKMIGLEQVRKDKETYFTKALRRRYE 348 (440) T ss_pred CCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 34457777665665544 44777888899999998421111111 11 12223333333444577778888888888887 Q ss_pred HHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHH Q lcl|NC_021072. 403 TQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDK 482 (533) Q Consensus 403 ~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~k 482 (533) .=+-+-+++...+|+. ..+.+.|..--.-.+. +.++++..+ +| .+|.++++.. |...|.+ ++.+ T Consensus 349 li~~~~~~~~~~~~~~--~~v~i~f~~~~p~~~~-------~~ad~~~kl---~g-~iS~et~~~~-l~~~d~~--~E~~ 412 (440) T protein:vir:95 349 LISNIHKAINGPVIEA--NKLTFTFHPNIPQDVW-------TEIKAYIEA---GG-EISQETLMEN-ASFTDYK--TEHS 412 (440) T ss_pred HHHHHHhhcCCccccc--ccceEEeCCCCCCCHH-------HHHHHHHHH---hc-cCcHHHHHHh-CCCCCcH--HHHH Confidence 6554545555555553 3567777654443342 344445555 34 4999999987 5665543 3344 Q ss_pred HHHHhhhcCCCCCCCcccccCCCCCCCCCC Q lcl|NC_021072. 483 QIDSEREAGLIVDPMAEMDPAMDPGNAPPA 512 (533) Q Consensus 483 qi~~E~~~~~~~~p~~~~~~~~~~~~~~~~ 512 (533) +|++|......+ ..+..+....+++-++ T Consensus 413 ri~~E~~~~~~~--~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 413 RILKQGGSSDLE--IGQIVGDADVGQADTE 440 (440) T ss_pred HHHHHHHHhhhh--HHhhccCCCCCCcCCC Confidence 555554432111 1111111111111111 No 103 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.35 E-value=8.7e-05 Score=42.84 Aligned_cols=378 Identities=11% Similarity=0.076 Sum_probs=158.7 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |+ ||.+. .... ..++...+........++.+...+.. .+. -+...++|-|.+||+-|.+.+ T Consensus 1 Mg--lf~~~----~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~--------~~~al~~~~V~~~i~~Ia~~i 61 (384) T protein:vir:49 1 MP--IFNIT----NLAT----ESPPSNQDSFFDITDPEFLDALNGSE-WVS--------AETALKNSDLFSIISQLSNDL 61 (384) T ss_pred Cc--ccccc----ccCc----ccccccchhhccccchhhcccccCCc-eec--------hhhhhccHHHHHHHHHHHHHH Confidence 43 34321 1111 11111111111122222222111100 011 022356889999999998875 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhH----HHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYE----IFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~----~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -- .|+.+. .. ....|+..-|=...+.+ ++..+++.|.-|.-++-|. .+-+++|. T Consensus 62 a~-----l~~~~~--~~------------~~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~---~g~~~~L~ 119 (384) T protein:vir:49 62 AT-----AKITTS--RK------------QLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE---NGRDMKWE 119 (384) T ss_pred hh-----Cceeee--cc------------hhhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECC---CCcEEEEE Confidence 43 333332 11 11223333333334444 4455778899999877764 34599999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-CccchhH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTLSHL 235 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~syL 235 (533) +|+|..++.++.. +.+...+....+. ....... .....+.++|.... .++ -.++|.| T Consensus 120 ~l~~~~v~v~~~~---~~~~~~y~~~~~~--~~~~~~~-------~~~~~eVih~~~~~----------~~~~~~G~s~i 177 (384) T protein:vir:49 120 YLRPSQVSFNRLD---NQNGLYYNITFDD--PRIPPKQ-------HVPQGDILHFRLLS----------VDGGLTSVSPL 177 (384) T ss_pred EEcCceeEEEEcC---CCceEEEEEEecC--cccccee-------EecCccEEEecCCC----------CCCceeeccHH Confidence 9999999753321 1111111000000 0000000 11122233333221 111 1356899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) ..|+..++....++....=+--.-+--+-+..++.+..+..++.+ ...++... ...|.+ T Consensus 178 ~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~----~~~~~~~~----~n~~~~------------- 236 (384) T protein:vir:49 178 MALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQ----SRSRQAMK----QMQGGP------------- 236 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHH----HHHHHhcc----cCCccc------------- Confidence 999999998888888776544444566677777766655554433 23333211 112221 Q ss_pred cccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFS 394 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs 394 (533) +.=.+ |.++..|.-. ..+.-++-.++..+.+.++++||.+.|+..++-.. ..+.+ |-.+..|+.-.-.-+. T Consensus 237 -~vl~~---g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~-~~~~~---~~~~~~~i~~~l~pi~ 308 (384) T protein:vir:49 237 -LVLDD---LEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQS-SLEMI---YNIYFKAVSRFLRPFV 308 (384) T ss_pred -eecCC---CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc-cHHHH---HHHHHHHHHHHHHHHH Confidence 11112 4566666422 22222455678889999999999999975432111 11121 1123334333222222 Q ss_pred HHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH Q lcl|NC_021072. 395 ELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD 474 (533) Q Consensus 395 ~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD 474 (533) ..+...|-..|.+. +....+. +...+.|. +.++-.+.+ ..|.++...+-.. | +.+.+ + ++.+.+.. T Consensus 309 ~~i~~~l~~~l~~~-~~~~~~~----~~~~~~~~----~~~l~~~~~-~t~~e~~~~l~~~-g-~~~ne-~-r~~~~~~p 374 (384) T protein:vir:49 309 SELSKKLSCEVDAD-ILPAVDP----TGSNYIGL----INSMVKTGT-LAQNQGLYVLQQA-E-ILPKD-L-PEGETDST 374 (384) T ss_pred HHHHHHhchhhhhh-hhhhhhc----cchHHHHH----HHHHhhcCc-ccHHHHHHHHhhC-C-CCChh-H-HHHcCCCC Confidence 22222222221110 0000000 00111111 222222222 2233333332211 1 23322 2 22333322 Q ss_pred HHHHHHHHHHHHhhhcCCC Q lcl|NC_021072. 475 QEIKEIDKQIDSEREAGLI 493 (533) Q Consensus 475 eeI~e~~kqi~~E~~~~~~ 493 (533) - +..-.+..| T Consensus 375 ~---------~gGd~~~~~ 384 (384) T protein:vir:49 375 L---------KGGETNEQY 384 (384) T ss_pred C---------CCCCCCCCC Confidence 1 111122223 No 104 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=97.31 E-value=9.9e-05 Score=42.53 Aligned_cols=443 Identities=16% Similarity=0.216 Sum_probs=174.0 Q ss_pred CCccccce------eeeccccccccCCCCCCCC---Ccccceeeccccc-------ccccchhhhhhHHHHH-------- Q lcl|NC_021072. 1 MSNQLFGF------SLERAKKVPKGPSFVQKDS---MDGSQPIVGGGYY-------GYSVDFDGTVRNEYEL-------- 56 (533) Q Consensus 1 ~~~~~fg~------~i~~~~~~~~~~s~~~~~~---~dg~~~~~~~~~~-------~~~~~~~~~~~~~~~L-------- 56 (533) -+.+-.-. +|+. .+...+++|.- ..-...+..++|. .|..++.+.....-.+ T Consensus 36 ~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~G 111 (694) T protein:vir:10 36 AAAQPVPADFARRGALNA----LDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPG 111 (694) T ss_pred cCCCcccCCccccccchh----hcccccCCCCcchhhhhhccccccCCCccccchhhhhhccCcccccchhhhhccCcch Confidence 11110000 1111 11222222211 0001122222221 2222222222211111 Q ss_pred HHHHHhhhhcchhhhHHHHhhcceeee------cCCCc----eEEEEeccCC--CcHHHHHHHHHHHHHHHHHhcchhhh Q lcl|NC_021072. 57 ITRYREMVLQPECDSAVDDIVNETICG------NFDDV----PVEVELSNLK--QSDKIKKLIREEFAEILRLLDFENRS 124 (533) Q Consensus 57 I~~YR~m~~~pEvd~AvdeIvneaiv~------d~~~~----~v~v~l~~~~--~S~~ik~~I~eeF~~i~~lL~f~~~~ 124 (533) +..--.|+|+||+++++.=|+.||+-. ..... -+.+.-+..+ .++.| ++|..||+. |++..+. T Consensus 112 y~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi-~~L~~e~er----l~V~~~l 186 (694) T protein:vir:10 112 FPTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQL-KQINDEIER----LRIRDAV 186 (694) T ss_pred HHHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHH-HHHHHHHHH----HHHHHHH Confidence 233456899999999999999999532 11111 0222211222 22344 556666663 3333333 Q ss_pred hHHHHhhhhcCceeeeeeecC--------------CCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccc Q lcl|NC_021072. 125 YEIFRRWYVDGRLFYHKVIDP--------------KNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKA 190 (533) Q Consensus 125 ~~~fR~WYvDGri~~hkvid~--------------~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~ 190 (533) .+.++-=-+-|.-.....|+. +-.|+.++.|+.|||..+.+-- .. +...... T Consensus 187 ~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~-~n--------~~dP~sp----- 252 (694) T protein:vir:10 187 RTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNN-YN--------SINPVAD----- 252 (694) T ss_pred HHHHHhhccccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccch-hh--------hccchhh----- Confidence 333322112222221222222 1236678889999998885511 00 0011111 Q ss_pred hhceeccccccccccCCcceeccchhhccc-----cccccCCCCccchhHHHHHHHHHH-HHHHHHHHHHHHHhcCccce Q lcl|NC_021072. 191 AEYYLYNPKGLKNSTNQGMKIATDSVTYCH-----SGIQDLNKNMTLSHLHKAIKAVNQ-LRMIEDSLVIYRLSRAPERR 264 (533) Q Consensus 191 ~e~~~y~p~~~~~~~~~~~kI~~dai~y~h-----sGl~d~~~~~i~syL~~AiK~~Nq-Lrm~EDalVIyRi~RAPeRr 264 (533) ..|.|....-. +.+|+.+-+.--. --|...-...++|.+..+..-..+ +++...+.=+ ++.+.-+ T Consensus 253 ---dfgkP~~y~V~---G~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~L--i~~~~v~- 323 (694) T protein:vir:10 253 ---DFYKPSTWWMI---GTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI--VKQFSVS- 323 (694) T ss_pred ---ccCCCceEEEe---ceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHH--HHhhhhH- Confidence 12333221111 1122211110000 001111122345655555544333 3333332221 1111111 Q ss_pred EEEccCC-CCchHHHHHHH--HHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHH Q lcl|NC_021072. 265 IFYIDVG-NLPKNKAEQYL--REVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE 341 (533) Q Consensus 265 vfyIDvG-nlpk~KAeqYl--~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~ 341 (533) ++-.|.. -|.....++.. -+++++||+-. |-+--|+ =.|+|- +. ..+|+-++ T Consensus 324 ~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~------G~~llDk----~~Eefe-------------q~--stslSGLd 378 (694) T protein:vir:10 324 GILMDLAQALMPGANVDLSMRAELINRYRDNR------NILFLDK----ATEEFF-------------QF--NTPLSGLD 378 (694) T ss_pred HHHHHHHHhhcChhHHHHHHHHHHHHHhcCcc------ceEEEec----CCcceE-------------EE--ecccCCHH Confidence 0011211 01111122222 25556665211 1111000 013442 11 24788888 Q ss_pred HHH-HHHHHHHHhcCCCccccCC--CCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHH-----HHHhccCCCH Q lcl|NC_021072. 342 DVK-YFQKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKT-----QLILKGVMSL 413 (533) Q Consensus 342 DV~-YF~~kLy~aL~VP~sRl~~--~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~-----qLilkgi~t~ 413 (533) ||. =|..-+=-+.+||+.||=. -.|||--.-+ |.-.|...|..+|. ..+...|++ |+-+-| T Consensus 379 dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G---- 447 (694) T protein:vir:10 379 ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEG----EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFG---- 447 (694) T ss_pred HHHHHHHHHHHhhhcCchhhhhccCcccccccchh----hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcC---- Confidence 875 4888888899999999943 3688752222 33348888988885 334444443 333334 Q ss_pred hHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCC Q lcl|NC_021072. 414 EEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLI 493 (533) Q Consensus 414 eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~ 493 (533) .+...|.|.|+.=..-+|..-+||...+.+....+-.- | .++.+-|+ ..+..+...+ | T Consensus 448 ----~idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~-g-vI~~~evr---------------~rL~~d~~s~-Y 505 (694) T protein:vir:10 448 ----AVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE-Q-VIRPDQVA---------------ARLNTEPDGP-Y 505 (694) T ss_pred ----CCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh-c-CCCHHHHH---------------HHHhcCCCcc-c Confidence 35567999999877788888888888887765543222 1 12222222 2222221111 2 Q ss_pred C-CCCcccccCCCCCC--------------CCCCCC--------------------ccccccccCCccccc-------hh Q lcl|NC_021072. 494 V-DPMAEMDPAMDPGN--------------APPADD--------------------MSAQEGPAVDAGDAK-------RG 531 (533) Q Consensus 494 ~-~p~~~~~~~~~~~~--------------~~~~~d--------------------~~~~~~~~~~~~~~~-------~~ 531 (533) - .-+++.+|+.+..+ .+...+ ++.+++++.+..-.- .+ T Consensus 506 ~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~~~ag~v~~~~g 585 (694) T protein:vir:10 506 AGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDG 585 (694) T ss_pred ccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCCCccceeeEEEEEeCC Confidence 1 11122222221110 000000 111111111110000 00 Q ss_pred cC Q lcl|NC_021072. 532 EF 533 (533) Q Consensus 532 ~~ 533 (533) ++ T Consensus 586 ~v 587 (694) T protein:vir:10 586 KV 587 (694) T ss_pred EE Confidence 00 No 105 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=97.27 E-value=0.00011 Score=42.29 Aligned_cols=436 Identities=12% Similarity=0.107 Sum_probs=171.5 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHH------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAV------- 73 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Av------- 73 (533) .-..+.|+.....+.. .+.....+. .+- ..+-+.+|+.+..+++-+..| T Consensus 11 ~~~~~~~~~~~~~~~~-----------~~~~~~~i~------------~~i-~~~~~~~~~~~~~YY~g~~~i~~~~~~~ 66 (503) T protein:vir:59 11 HTEELNEIIVESAKEI-----------AEPDTTMIQ------------KLI-DEHNPEPLLKGVRYYMCENDIEKKRRTY 66 (503) T ss_pred hHHhHHHhhhhhhhhc-----------cchhHHHHH------------HHH-HhhcHHHHHHHHHHhccccchhhccchh Confidence 1111111111110000 000000000 000 001123344443333322211 Q ss_pred -----------------------HHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh Q lcl|NC_021072. 74 -----------------------DDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRR 130 (533) Q Consensus 74 -----------------------deIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~ 130 (533) ..||+-..-+ .-+.|+.+..++ +.+.+.+. +| .+ -+|+....+..+. T Consensus 67 ~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~y-l~g~~~~~~~~d----~~~~~~l~-~~---~~-n~~~~~~~~~~~~ 136 (503) T protein:vir:59 67 YDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQY-LVGEPVTFTSDN----KTLLEYVN-EL---AD-DDFDDILNETVKN 136 (503) T ss_pred cccccccccccccccceeecchHHHHHHHHHhh-hhcCCeeeccCc----HHHHHHHH-HH---Hh-cCHHHHHHHHHHH Confidence 2222211111 124566665544 33333322 12 22 2788999999999 Q ss_pred hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeE---Eeccceeeccchhceecccccccc-c-c Q lcl|NC_021072. 131 WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRG---EDINTQLTQKAAEYYLYNPKGLKN-S-T 205 (533) Q Consensus 131 WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~---~~~~~~~~~~~~e~~~y~p~~~~~-~-~ 205 (533) .++-|+.|++.-+|. +|-..+..+||+.+-++..-... .....+ +.....-.....-..+|.+..... . . T Consensus 137 ~~~~G~~~~~v~~d~----dg~~~i~~~~p~~~~~i~d~~~~-~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~ 211 (503) T protein:vir:59 137 MSNKGIEYWHPFVDE----EGEFDYVIFPAEEMIVVYKDNTR-RDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKI 211 (503) T ss_pred HhhCCeEEEEEeecC----CCceEEEEEccceeEEEEeCCCC-CceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEc Confidence 999999999977763 46678999999998775443211 111111 111000000001111333322110 0 0 Q ss_pred C---------------Ccceeccchhhcccccccc-CCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEc Q lcl|NC_021072. 206 N---------------QGMKIATDSVTYCHSGIQD-LNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYI 268 (533) Q Consensus 206 ~---------------~~~kI~~dai~y~hsGl~d-~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyI 268 (533) . ........+..|..--++. .++....|-|+.++....-+. ++-+.+..-+.++.|-+.+--. T Consensus 212 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~ 291 (503) T protein:vir:59 212 DGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNY 291 (503) T ss_pred CCcccccccccccccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecC Confidence 0 0000001111111000111 133445666666666655554 3355555667777775543323 Q ss_pred cCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHH Q lcl|NC_021072. 269 DVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQ 347 (533) Q Consensus 269 DvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~YF~ 347 (533) ++-+.+ +.... |..+ +++ .+| ++| .+..|-...+.+.. .-+.-++ T Consensus 292 ~~~~~~-----~~~~~-~~~~--~~~---------------------~~~--~~~---~~~~l~~~~~~~~~~~~~~~l~ 337 (503) T protein:vir:59 292 DGENPK-----EFTAN-LRYH--SVI---------------------KVS--GDG---GVDTLRAEIPVDSAAKELERIQ 337 (503) T ss_pred Cccccc-----hhhhh-hhcc--cce---------------------ecc--CCC---cceeEeccCCHHHHHHHHHHHH Confidence 222211 11111 1111 111 011 111 24444443333332 3446667 Q ss_pred HHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEE Q lcl|NC_021072. 348 KKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDF 427 (533) Q Consensus 348 ~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f 427 (533) +.+|+...+|---.+.-+| +. .+..|.........-+.+.+..|...+.++++.=+-+-++....++.+. ..|.+.| T Consensus 338 ~~i~~~s~~p~~~~~~~~~-~~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~-~~i~i~f 414 (503) T protein:vir:59 338 DELYKSAQAVDNSPETIGG-GA-TGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPD-KELTMTF 414 (503) T ss_pred HHHHHHhcccCCCcccccc-cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc-cceeEEe Confidence 7788888888322111111 11 2223333333333446677777777777766653333344333433332 3478888 Q ss_pred eccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH---HHHHHHHHHHHHhhhcCCCCCCCc-ccccC Q lcl|NC_021072. 428 IADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD---QEIKEIDKQIDSEREAGLIVDPMA-EMDPA 503 (533) Q Consensus 428 ~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD---eeI~e~~kqi~~E~~~~~~~~p~~-~~~~~ 503 (533) ...-.-.+. +.++++.++-.- | .+|++++++. |...+ +|++.+ ++|.....-..+.. ...++ T Consensus 415 ~~~~p~d~~-------~~~~~~~kl~~~-G-iiS~et~l~~-l~~v~d~~~E~~ri----~~E~~~~~~~~~~~~~~~~~ 480 (503) T protein:vir:59 415 TRTRIQNDS-------EIVQSLVQGVTG-G-IMSKETAVAR-NPFVQDPEEELARI----EEEMNQYAEMQGNLLDDEGG 480 (503) T ss_pred CCCCCCCHH-------HHHHHHHHHHhC-C-CCchHHHHHh-CCCCCCHHHHHHHH----HHHHHHHHhhhccccCccCC Confidence 644333332 344444444222 3 5899999977 55543 444444 33333221111110 11111 Q ss_pred CCCCCCCCCCCccccccccCCccccc Q lcl|NC_021072. 504 MDPGNAPPADDMSAQEGPAVDAGDAK 529 (533) Q Consensus 504 ~~~~~~~~~~d~~~~~~~~~~~~~~~ 529 (533) .+.++ .+++...+...-.++++- T Consensus 481 ~~~~~---~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 481 DDDLE---EDDPNAGAAESGGAGQVS 503 (503) T ss_pred CCCCC---cCCCCCCcccCCCCCCcC Confidence 11110 111111111111111111 No 106 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=97.23 E-value=0.00012 Score=42.01 Aligned_cols=419 Identities=16% Similarity=0.142 Sum_probs=182.4 Q ss_pred CC---ccccceeeeccccccc-cCCCCCCCCC-cccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_021072. 1 MS---NQLFGFSLERAKKVPK-GPSFVQKDSM-DGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDD 75 (533) Q Consensus 1 ~~---~~~fg~~i~~~~~~~~-~~s~~~~~~~-dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Avde 75 (533) |. .+.||.. +.+ ..++-+|-+. ++... ..+.+......-.+ .....+.+|-|.+||+- T Consensus 1 ~~~~~~~~~~~~------~~~~~~~~g~~~s~~~~~~~---~~~~~~~~~~g~~v--------~~~~al~~~~v~~ci~~ 63 (437) T protein:vir:10 1 MKQGKQRALGRI------KSSFLKWLGVPISLTDGSFW---SAWGGMGSSSGETV--------TADSALQLSAVWSCVRL 63 (437) T ss_pred CCcchhhhhhhh------HHhhhhhcCCcccCCchhHH---HhhcccccCCCcee--------chHhhhccHHHHHHHHH Confidence 21 2222210 000 0011111111 11110 11111111100001 12334678999999999 Q ss_pred hhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHH-HHHHHh----cchhhhhHHHHh----hhhcCceeeeeeecCC Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFA-EILRLL----DFENRSYEIFRR----WYVDGRLFYHKVIDPK 146 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~-~i~~lL----~f~~~~~~~fR~----WYvDGri~~hkvid~~ 146 (533) |.+.+-- .|+.+--...+..+.+ + .+ .++.+| |-...++++.+. +.+.|.-|+.++-| T Consensus 64 Ia~~ia~-----lp~~~~~~~~~g~~~~---~---~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-- 130 (437) T protein:vir:10 64 IAETIAT-----LPLNLYQTKPDGTRVL---A---KQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-- 130 (437) T ss_pred HHHHHhh-----CceeEEEEcCCCceee---c---cccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-- Confidence 9987532 3433321111111100 0 11 233333 333455555444 56789988875543 Q ss_pred CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccC Q lcl|NC_021072. 147 NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDL 226 (533) Q Consensus 147 ~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~ 226 (533) .+.+++|.+|+|..+...+. .++... |.++.+.|. ...++.+-|.+.. + ... T Consensus 131 --~g~~~~L~~l~p~~v~i~~~-----~~g~~~-------------y~~~~~~g~------~~~~~~~dIih~r-~-~~~ 182 (437) T protein:vir:10 131 --AGVLIGLELMLPQRTTVKRL-----TSGALQ-------------YTYRNVDGT------VSTLAEDDVFHVR-G-FSL 182 (437) T ss_pred --CCcEEEEEEEcCcceEEEEC-----CCCeEE-------------EEEEecCce------EEEEccccEEEec-C-cCC Confidence 36799999999999864321 111111 111122211 1123333222221 0 112 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccccc Q lcl|NC_021072. 227 NKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDK 306 (533) Q Consensus 227 ~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~ 306 (533) ++-.++|-|..|.+++.....+++...=+----+--+-|..++ +.|.+.++++..+.+-.+|+.- ...|.+ T Consensus 183 d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~nag~~---- 253 (437) T protein:vir:10 183 DGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD-QILQKEKRAEIRTDLAEQFGGA----MQAGKT---- 253 (437) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcCc----cccCcc---- Confidence 2335678899999999888888877665555556667777776 6788888777666655554320 011221 Q ss_pred ccchhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCccccc--chhhhhHHhhhHH Q lcl|NC_021072. 307 KFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIG--RAAEITRDEVKFQ 383 (533) Q Consensus 307 ~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g--~~~eItRDElkF~ 383 (533) + .++ .|++++.|.-...-.+ ++=-++-.+.+.++++||...|+...+-+.. ..++..+. |. T Consensus 254 --~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~---f~ 317 (437) T protein:vir:10 254 --M-VLE----------AGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLG---FL 317 (437) T ss_pred --e-ecc----------CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHH---HH Confidence 1 121 2456666632221222 3333466788999999999999754332221 22333333 55 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHH Q lcl|NC_021072. 384 KFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSID 463 (533) Q Consensus 384 Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~ 463 (533) .+ .|+-.+. .+.+.|-+- ++++.++.. ..+.|. +..+.... +..|.+.+..+-.- -++|.+ T Consensus 318 ~~--tl~P~~~-~ie~~l~~k-----ll~~~e~~~----~~~~fd----~~~ll~~d-~~~r~~~~~~~~~~--G~~T~N 378 (437) T protein:vir:10 318 TF--TLRPWLT-RIEQAARRS-----LLRPGERDQ----FYAEFS----VEGLLRAD-SAGRAAFYSTMTQN--GLMTRD 378 (437) T ss_pred HH--HHHHHHH-HHHHHHHhh-----ccCccccCc----eEEEEe----chhhhccC-HHHHHHHHHHHHhC--CCcCHH Confidence 44 2332222 233333333 345555542 234454 33333322 45677776665332 467777 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC-CCCcccccCCCCCCCCCCCCccccccccCCccccch Q lcl|NC_021072. 464 YMRRQVLKQTDQEIKEIDKQIDSEREAGLIV-DPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 464 ~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~-~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 530 (533) -++. +|++.+-+ .-+..+- .....++ +-..+..+.....+..+..|+..+ -++++.|+ T Consensus 379 E~R~-~~gl~pi~--gg~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~e~ 437 (437) T protein:vir:10 379 ECRA-KENLPPMG--GNAAVLT--VQSALLPIDKLGEHTTATAAQDALKAWLYQEE----KTRATQER 437 (437) T ss_pred HHHH-HhCCCCCC--CCcceEe--ecCcccchhhccCcCCCcchhccccccCCCCC----CCCccccC Confidence 7774 46664422 0000000 0000010 000000000000000011111111 11122222 No 107 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.21 E-value=0.00013 Score=41.93 Aligned_cols=440 Identities=9% Similarity=0.035 Sum_probs=177.8 Q ss_pred cCCCCCCCCCcccceeecccccccccchhhhhh---HHHHHHHHHHhhhh---------cchhhhHHHH--hhcc-eeee Q lcl|NC_021072. 19 GPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVR---NEYELITRYREMVL---------QPECDSAVDD--IVNE-TICG 83 (533) Q Consensus 19 ~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~---~~~~LI~~YR~m~~---------~pEvd~Avde--Ivne-aiv~ 83 (533) -+-+ |..+.+... +... -- .+...... .+.+....|=+-.. .++..+.+.. ++|= ..+. T Consensus 1 ~~~~-p~~~l~~~~-~~~~-~~---~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iV 74 (479) T protein:vir:99 1 MIDL-PDEDLSSEG-LAKY-LE---TKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMV 74 (479) T ss_pred CccC-CcccCChhH-HHHH-HH---HHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHH Confidence 1111 111122110 0000 00 00000111 11112222211110 0111111111 0111 0000 Q ss_pred cC---CCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecC--CCCCCCeEEEEEc Q lcl|NC_021072. 84 NF---DDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDP--KNPRGGLTELRYI 158 (533) Q Consensus 84 d~---~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~--~~~~~gI~elr~l 158 (533) |. .-.+..+.+.+ ++. .+++..|+..=+|+....+.++.-++-|+-|.. |.-. ....+|...++.+ T Consensus 75 d~~~~~l~~~gf~~~d---~~~-----~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~-v~~~~~~~d~~g~~~i~~~ 145 (479) T protein:vir:99 75 NSFAQQLIVDGYRKTG---TNE-----NAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIK-VTSGISPLDGTTVARIKCI 145 (479) T ss_pred HHHHhhcccccccCCC---chh-----hHHHHHHHHhcChhHHHHHHHHHHhhcCceEEE-EecCCCCcCCCCceEEEEe Confidence 00 00111122111 111 223455555556888889999999999997664 3311 1235678889999 Q ss_pred ChhhceehhhccCCCcCce-eEEeccceeeccchhce------ecccc---cc--ccccCCcceeccchhhccccccccC Q lcl|NC_021072. 159 DPRKIRKVTEYQQKRPEQL-RGEDINTQLTQKAAEYY------LYNPK---GL--KNSTNQGMKIATDSVTYCHSGIQDL 226 (533) Q Consensus 159 DP~~i~~vr~~~~~~~~~~-~~~~~~~~~~~~~~e~~------~y~p~---~~--~~~~~~~~kI~~dai~y~hsGl~d~ 226 (533) ||+.+-.+. +...... ..+..... ......+| .|... +. ...+|..-++|. +.|++..-.+ T Consensus 146 ~p~~~~~iy---dd~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPv--v~f~n~~~~~- 218 (479) T protein:vir:99 146 DPRDAFAIW---EDPYWDEWPKYLLERQ-PNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPF--VRYVNVMDLR- 218 (479) T ss_pred chhheEEEe---cCCcccceeeEEEeec-CceeEEEEecceEEEEEecCCceeeccccccCCCCcce--EEeecCCCcC- Confidence 999986542 2221111 11111000 00001111 11111 11 112222222332 2233321111 Q ss_pred CCCccchhHHHHHHHHHHH-HHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccc Q lcl|NC_021072. 227 NKNMTLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDD 305 (533) Q Consensus 227 ~~~~i~syL~~AiK~~NqL-rm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d 305 (533) ..+.|=++..+.....+ +.+-+..++-.....|.|-++-. .++.... .-...+.-+.++ T Consensus 219 --~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~---~~~~~~~--~~~~~~~~~~~~------------- 278 (479) T protein:vir:99 219 --GVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGL---MLPEGAN--ADQEKMRFAQES------------- 278 (479) T ss_pred --cCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCC---Ccccccc--cchhcccccccc------------- Confidence 12334444433333322 34556666667777777765521 2211000 000000000011 Q ss_pred cccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH Q lcl|NC_021072. 306 KKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF 385 (533) Q Consensus 306 ~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf 385 (533) .|.. .+-+.++-++|+..--.-++-++=....+...-++|..-|+..+ | -.+..|.--+...-.- T Consensus 279 ---------i~~~---~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~--n-~Sg~Al~~~~~~l~~k 343 (479) T protein:vir:99 279 ---------MLIS---QNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQIV--N-VAADALAAGTRQTMQK 343 (479) T ss_pred ---------ceee---cCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHccccc--c-hHHHHHHHHHHHHHHH Confidence 1211 12245676777544222222233333344455577766554211 1 0122344444445667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHH Q lcl|NC_021072. 386 IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYM 465 (533) Q Consensus 386 i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i 465 (533) +.+.|+.|..-+.+.|+.-+.+.|.--..++ -.|.+.|..-..=+ +.+..+.+.+|..- | .+|.+++ T Consensus 344 a~~~~~~f~~al~~~~~l~~~~~~~~~~~~~----~~i~~~w~~~~~~s-------~~~~ad~~~kl~~a-g-~is~et~ 410 (479) T protein:vir:99 344 LFEKQATWKASHNQTMRLVNKIEGRTEEATD----LDFTITWQDVTIQS-------LAQFADAWAKMVES-L-KIPAEGV 410 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCccccc----eeeeEEecCCCCCC-------HHHHHHHHHHHHhc-C-CCCHHHH Confidence 8899999999999999988888886332222 13666664221111 12455555555432 3 3999999 Q ss_pred HHHHhCCCHHHHHHHHHHHHHhhhcC-----CCCCCCcccccCCCCCCCCCCCCccccc-cccCCcccc Q lcl|NC_021072. 466 RRQVLKQTDQEIKEIDKQIDSEREAG-----LIVDPMAEMDPAMDPGNAPPADDMSAQE-GPAVDAGDA 528 (533) Q Consensus 466 ~k~IL~~tDeeI~e~~kqi~~E~~~~-----~~~~p~~~~~~~~~~~~~~~~~d~~~~~-~~~~~~~~~ 528 (533) ++.+.++|+.+++.+.+..+++...+ +...|......+...+..+...-.+..+ +.+++.+-. T Consensus 411 l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 411 WDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred HHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCcchhccCCCCC Confidence 99988999999987766555543322 2222221111111111111111111111 112333222 No 108 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=97.21 E-value=0.00013 Score=41.91 Aligned_cols=464 Identities=11% Similarity=0.075 Sum_probs=175.5 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhH-----HHHHHHHHHhhhhc-----chhh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRN-----EYELITRYREMVLQ-----PECD 70 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~-----~~~LI~~YR~m~~~-----pEvd 70 (533) |--.+|=-|=-..... ..-+...-+.. +....+-.-.......++. ......+|+.+..+ +.+- T Consensus 2 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~----~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~ 75 (502) T protein:vir:48 2 MEQTLFTDSTGQDLVL--NLRFHRESRIR----YRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVL 75 (502) T ss_pred ceeEEEEecchhHHHh--hcccChhHHhh----hcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 3333331111110000 00000000000 0000000000000000000 00111122222221 2211 Q ss_pred hH-----------------HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhh Q lcl|NC_021072. 71 SA-----------------VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYV 133 (533) Q Consensus 71 ~A-----------------vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYv 133 (533) .. -.-||+. .+.=.=+.|+.+...+.+..+.+ .+-+..++..-+|+....+..+...+ T Consensus 76 ~~~~~~~~~~~~~ki~~n~~k~Ivd~-~~~yl~g~p~~~~~~d~~~~~~~----~~~l~~~~~~N~~~~~~~~~~~~~~~ 150 (502) T protein:vir:48 76 KSGRRKDNEMADKRAVHNYGRMISKF-KTGYLAGNPIRVEYDDNEDNSQN----DDAIKRIGRINDIDTHNRNLIRDLSQ 150 (502) T ss_pred ccccccccccccceeecchHHHHHHH-HhhhhcccCeeEecCCccchhHH----HHHHHHHHhhcCHhHHHHHHHHHHhh Confidence 10 0111111 11112366777777664444444 44456667777899999999999999 Q ss_pred cCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCC--CcCceeEEeccceeeccchhceecccccccc--ccCCcc Q lcl|NC_021072. 134 DGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQK--RPEQLRGEDINTQLTQKAAEYYLYNPKGLKN--STNQGM 209 (533) Q Consensus 134 DGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~--~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~--~~~~~~ 209 (533) -|+.|++.-+|. .|-..++.+||+..-.|..-... ..-.++++..... -....-.-+|.+...+. ..+... T Consensus 151 ~G~a~~~v~~de----dg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~-~~~~~~~~iyt~~~i~~~~~~~~~~ 225 (502) T protein:vir:48 151 TGRAYEVIYRSE----YDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTL-QNAKDVVEIYTNQHIYTLDASDSFN 225 (502) T ss_pred cCeEEEEEEeCC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEeec-CCcEEEEEEEeCCeEEEEEeCCcee Confidence 999999877763 45577899999999776432111 1111112211000 00011111333332210 001111 Q ss_pred eeccchhhccccccc-cCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHH Q lcl|NC_021072. 210 KIATDSVTYCHSGIQ-DLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMG 287 (533) Q Consensus 210 kI~~dai~y~hsGl~-d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~ 287 (533) .+....-.|..-=++ -.++....|=++.++....-+. ++=+....-+-++.|-+-+.=......+... .- +. T Consensus 226 ~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~--~~----~~ 299 (502) T protein:vir:48 226 EISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQA--SD----MK 299 (502) T ss_pred eccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccch--hh----hh Confidence 111100000000000 0234455566665554444332 2233333444555554444322111111000 00 00 Q ss_pred hcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_021072. 288 RYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETT 366 (533) Q Consensus 288 ~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~ 366 (533) + .++++ +...... -+.+.+..+.+|-...+...+. -+.-+.+.+|...++|---++.-+| T Consensus 300 ~--~~~~~-------------~~~~~~~----~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~ 360 (502) T protein:vir:48 300 R--TRLMQ-------------LKPPKSA----DGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSG 360 (502) T ss_pred h--cceee-------------ccccccc----cccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccccc Confidence 0 11110 0000000 0112233455554433333333 3566677788888888422222111 Q ss_pred ccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-HhHHhhhhhceeEEEeccchHHHHHHHHHHHHH Q lcl|NC_021072. 367 FNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMS-LEEWDEMKEHIQFDFIADNYFTELKEIEIRNER 445 (533) Q Consensus 367 ~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t-~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R 445 (533) |. .|..|..-......-+.+.++.|..-+.+.++.=+-+-++.. ..+++ ...|.+.|..-..-.+ .+. T Consensus 361 -n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~i~f~~~~p~d~-------~e~ 429 (502) T protein:vir:48 361 -NA-SGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFD--ESRLKITFTPNLPKSL-------YEQ 429 (502) T ss_pred -Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccceEEeCCCCCcCH-------HHH Confidence 11 222343333334455666677777777776655332222211 11121 2447888864333223 234 Q ss_pred HHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC-CC-CcccccCCCCCCCCCCCCccccccccC Q lcl|NC_021072. 446 MNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV-DP-MAEMDPAMDPGNAPPADDMSAQEGPAV 523 (533) Q Consensus 446 ~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~-~p-~~~~~~~~~~~~~~~~~d~~~~~~~~~ 523 (533) ++++.++. | .+|.+++++. |..+++. +++.++|++|....-.. .| ......+. +++..+++... T Consensus 430 a~~~~kl~---g-~iS~et~l~~-l~~v~D~-~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-~~d~~~e~~~~------- 495 (502) T protein:vir:48 430 VSILNDLG---G-QVSQETALSL-SGLVENP-TEELDKINEESSKIDFKGYPSYFYDNVGK-YTDEVKETHTD------- 495 (502) T ss_pred HHHHHHHh---c-cCcHHHHHHh-CCCCCCH-HHHHHHHHHHHHhhhhhcccccccccccc-cCCCccCCCCc------- Confidence 45556653 4 4899999987 6665432 33444555554432111 11 11111110 11111111100 Q ss_pred Cccccch Q lcl|NC_021072. 524 DAGDAKR 530 (533) Q Consensus 524 ~~~~~~~ 530 (533) +..+..| T Consensus 496 ~~~~~~~ 502 (502) T protein:vir:48 496 DFERVYE 502 (502) T ss_pred CcCCCCC Confidence 0111111 No 109 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.18 E-value=0.00014 Score=41.72 Aligned_cols=400 Identities=13% Similarity=0.126 Sum_probs=178.7 Q ss_pred ccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeec Q lcl|NC_021072. 5 LFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGN 84 (533) Q Consensus 5 ~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d 84 (533) .|..++++. +..+...++...+ ...++...+.++. -+..+++|-|.+||+-|.+.+-. T Consensus 1 m~f~~~~~~----~~~~~~~~~~~~~--~~~g~~~~~~~v~--------------~~~al~~~~v~~~i~~ia~~ia~-- 58 (409) T protein:vir:10 1 MLFRKGFKN----QSQEISIDDKKIL--EWLGINPSETYVN--------------GKSCLKQATVFGCIRILSDNISK-- 58 (409) T ss_pred CcccccccC----cCCCCCCChHHHH--HHhcCCcCcceec--------------hhhhhccHHHHHHHHHHHHhhhh-- Confidence 232233331 1111111111111 1111111111111 12345788899999999888553 Q ss_pred CCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHH----hhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 85 FDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFR----RWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 85 ~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR----~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) .|+.|--+ .+..+.. +-..++.+|+. ...+.++.+ .+.+.|.-|+.++-+. .+-+++|. T Consensus 59 ---lp~~~~~~-~~~~~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~G~~~~L~ 125 (409) T protein:vir:10 59 ---LPIKIYQK-KDGIKRV------PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKK---NGEIKGLY 125 (409) T ss_pred ---CceEEEEe-cCCeeec------cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---CCcEEEEE Confidence 34444211 1111111 11234444432 234444444 4677899999987764 45599999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLH 236 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~ 236 (533) +|+|..++.+....+..... ..-.|.|.+.+ +....++.+-+.+.. ++ ..++-.++|-|+ T Consensus 126 ~i~~~~V~v~~~~~~~~~~~-------------~~~~y~~~~~~-----g~~~~~~~~evih~r-~~-~~d~~~G~s~i~ 185 (409) T protein:vir:10 126 PLKSDGMKIFVDDTGLLNSE-------------NNVWYLYTDDL-----GQRHKFMSDEILHFK-GL-TADGLAGLSVIE 185 (409) T ss_pred EEcCCceEEEEcCCcccccc-------------ceEEEEEEeCC-----ceeEEeccccEEEec-Cc-CCCCcccccHHH Confidence 99999997544321111100 01111221111 112233333332222 11 223335679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhc Q lcl|NC_021072. 237 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 316 (533) Q Consensus 237 ~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDyw 316 (533) .|..++.....+++...=+=--.+.-+-|..++ +.|.+..+++ +++.+++...-. ...|. .+ +++ T Consensus 186 ~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~-~~~~~~~~~~g~---~n~~~------~~-vl~--- 250 (409) T protein:vir:10 186 LLNHLIENGKSSETYLNNFFKNGLQVKGLVQYA-GDLNPEAEEV-FKENFERMSSGL---KNAHR------IA-MLP--- 250 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCHHHHHH-HHHHHHHHhccc---cccCC------ce-ecC--- Confidence 999999888887776554433344556777776 4566555544 333333321110 01121 11 221 Q ss_pred ccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHH Q lcl|NC_021072. 317 LPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSE 395 (533) Q Consensus 317 LpRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~ 395 (533) .|++++.|.- ...+.-++-.+|..+.+.++++||.+.|+..+.-+....++..+. |..+ .|+- +.. T Consensus 251 -------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~---f~~~--~l~P-~~~ 317 (409) T protein:vir:10 251 -------IGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNRE---FYID--TLQS-ILN 317 (409) T ss_pred -------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHH---HHHH--HHHH-HHH Confidence 2456666632 222334556678999999999999999975443333334444433 5543 2322 112 Q ss_pred HHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHH Q lcl|NC_021072. 396 LFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQ 475 (533) Q Consensus 396 if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDe 475 (533) .+.+.|-..|+ ++.+. .....+.|.. ..+...+ +..|.+.+..+-.- -+++.+-++. +|++.. T Consensus 318 ~ie~~ln~kL~-----~~~~~---~~~~~~~fd~----~~ll~~d-~~~~~~~~~~~~~~--G~~T~NE~R~-~lgl~p- 380 (409) T protein:vir:10 318 MYELEINYKLF-----LISEI---KNGFYSKFNV----DTILRAD-IKTRYESYKEAIQN--GFKTPNEIRE-LEEDEP- 380 (409) T ss_pred HHHHHHHHhhc-----Cchhc---cCCcEEEEec----hhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCC- Confidence 23333333333 33332 2334455552 2332221 23455555443222 3566666653 344421 Q ss_pred HHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCcccc Q lcl|NC_021072. 476 EIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDA 528 (533) Q Consensus 476 eI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 528 (533) ++ +. +.-.-+.+..|.++...+. ...++. T Consensus 381 -----------------~~--gg--D~~~~~~n~~~~~~~~~~~---~kgGe~ 409 (409) T protein:vir:10 381 -----------------LE--GG--DVLLINGNMIPVKMAGEQY---SKGGEK 409 (409) T ss_pred -----------------CC--Cc--CeeeeccCccchhhccccc---cccCCC Confidence 11 10 0001122222332222111 112222 No 110 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=97.14 E-value=0.00016 Score=41.47 Aligned_cols=411 Identities=15% Similarity=0.127 Sum_probs=183.3 Q ss_pred CCccccceeeeccccccccCCC-CCCCCC-c-ccceeeccccccccc----chhhhhhHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSF-VQKDSM-D-GSQPIVGGGYYGYSV----DFDGTVRNEYELITRYREMVLQPECDSAV 73 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~-~~~~~~-d-g~~~~~~~~~~~~~~----~~~~~~~~~~~LI~~YR~m~~~pEvd~Av 73 (533) |. ||.| +.+.+...+..++ ++|... . .+.+..+..+.+... +..+....----+ ......++|-|..|| T Consensus 1 Mg--l~d~-~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v-~~~~al~~~~V~~ci 76 (431) T protein:vir:10 1 MG--LFDF-IRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTG-RETRALRNMAVLRCV 76 (431) T ss_pred Cc--chhh-hhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCccee-chhhhhccHHHHHHH Confidence 76 6666 2221111011110 111100 0 011111111211100 0000000000001 123445789999999 Q ss_pred HHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHH----HHhhhhcCceeeeeeecC Q lcl|NC_021072. 74 DDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYEI----FRRWYVDGRLFYHKVIDP 145 (533) Q Consensus 74 deIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~----fR~WYvDGri~~hkvid~ 145 (533) +-|.+.+-- .|+.|- ...+..+.. . -..++.+|+.. -.+.++ +..+.+.|.-|..++-| T Consensus 77 ~~Ia~~iA~-----lp~~v~-~~~~~~~~~----~--~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~- 143 (431) T protein:vir:10 77 TLISGTIGM-----LPMNLI-SSDDSKQVL----T--DDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWS- 143 (431) T ss_pred HHHHHhhcc-----CceEEE-EecCceeee----c--cchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc- Confidence 988877532 455542 111111111 1 12455555432 234444 44467889999887664 Q ss_pred CCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhcccccccc Q lcl|NC_021072. 146 KNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQD 225 (533) Q Consensus 146 ~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d 225 (533) .+++++|.++||..+..+... ++.-+ |.++.+.|. ...++.+-|.+.. +. . T Consensus 144 ---~g~~~~L~pl~~~~v~~~~~~-----~~~~~-------------y~~~~~~g~------~~~~~~~dViHir-~~-~ 194 (431) T protein:vir:10 144 ---GNRPIRLIPMDRGSAKGRLTS-----TWQIV-------------YDYTTPTGD------KIELPAREVFHLR-DL-S 194 (431) T ss_pred ---CCceEEEEEEcCceeEEEEcC-----CCeEE-------------EEEEeCCce------EEEEchhhEEEec-Cc-C Confidence 367899999999998653321 11000 111112211 1223333222211 11 2 Q ss_pred CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccc Q lcl|NC_021072. 226 LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDD 305 (533) Q Consensus 226 ~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d 305 (533) .++-.++|.|+.|.+++.....+++...=+----|--+-|.-.+ ++|.+.++++.-+.+...|..- ...|. T Consensus 195 ~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~g~----~n~g~---- 265 (431) T protein:vir:10 195 IDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVP-KELSDNAYGRMKASVQENHTGS----ENAGS---- 265 (431) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHHHHHHHHHhcCc----cccCC---- Confidence 23345678999999999888888887765555555556666666 4677666655555544444320 11121 Q ss_pred cccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHH Q lcl|NC_021072. 306 KKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQK 384 (533) Q Consensus 306 ~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~K 384 (533) .+ .+ + .|.+++.|.-. ..+.-++--+|-...+.++++||..-|+...+-+.-+..+..+. |.+ T Consensus 266 --~~-vl--------~--~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~---f~~ 329 (431) T protein:vir:10 266 --WM-LL--------E--EGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAIF---FIQ 329 (431) T ss_pred --ce-ec--------C--CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHHH---HHH Confidence 11 11 1 14455555321 11222333345567899999999999976543222233344444 554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhh--ccccccH Q lcl|NC_021072. 385 FIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPY--VGKYFSI 462 (533) Q Consensus 385 fi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~--vGky~S~ 462 (533) ++ |+--+. .+.+.|-.. +++++++. ...+.|. +.+|...+ +..|.+.+..+-.- -..++|. T Consensus 330 ~t--L~P~~~-~ie~~ln~~-----Ll~~~~~~----~~~~~fd----~~~llr~d-~~~r~~~~~~~~~~G~~~g~lT~ 392 (431) T protein:vir:10 330 YG--LSHWFV-SWEQAAARA-----FLPEKMLG----QRQFKFN----EGALLRGT-LNDQAAFFSKALGAGGQSPWMKQ 392 (431) T ss_pred HH--HHHHHH-HHHHHHHhh-----ccChhhcC----CceEEEe----chhhhccC-HHHHHHHHHHHHhcccccCccCH Confidence 42 333222 223333333 33444443 2345565 44443332 45677666655332 2345666 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc--ccCCCCCCCCCCCCccccccccCC Q lcl|NC_021072. 463 DYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEM--DPAMDPGNAPPADDMSAQEGPAVD 524 (533) Q Consensus 463 ~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~--~~~~~~~~~~~~~d~~~~~~~~~~ 524 (533) +-+++. ++| +.++.|..+. .|.+ ..+.++.++ +|+.- T Consensus 393 NE~R~~-~gl------------------~p~~~~~gD~~~~p~n----~~~~~~~~~--~p~~~ 431 (431) T protein:vir:10 393 NEVREM-LDL------------------PRADDPVADQLRNPMT----QKQKGSGDE--PPATT 431 (431) T ss_pred HHHHHH-hCC------------------CCCCCccccceecccc----cccCCCCCC--CCCCC Confidence 666642 433 2333333221 1111 111111111 12111 No 111 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=97.12 E-value=0.00016 Score=41.37 Aligned_cols=400 Identities=12% Similarity=0.082 Sum_probs=177.9 Q ss_pred cccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeee Q lcl|NC_021072. 4 QLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICG 83 (533) Q Consensus 4 ~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~ 83 (533) -||.--+++ +. ..+...+ +........|.+..+.....+ .......+|.|.+||+-|.+.+-. T Consensus 1 m~~~~~f~~--~~--~~~~~~~----~~~~~~~~~~~~~~~~~~~~v--------~~~~al~~~~v~~~i~~Ia~~ia~- 63 (416) T protein:vir:12 1 MLLERMFEK--RS--GSSDHED----GFNNILLNMFGGRKTASGERV--------SESNSLVQPDIFACVNVLSDDIAK- 63 (416) T ss_pred Cccchhccc--cc--CccccCc----cchhHHHHhhcCcccccCcee--------chhhhhccHHHHHHHHHHHHhhhh- Confidence 233211111 11 1111111 111111122222111111111 122345789999999999888654 Q ss_pred cCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhc----chhhhhH----HHHhhhhcCceeeeeeecCCCCCCCeEEE Q lcl|NC_021072. 84 NFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLD----FENRSYE----IFRRWYVDGRLFYHKVIDPKNPRGGLTEL 155 (533) Q Consensus 84 d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~----f~~~~~~----~fR~WYvDGri~~hkvid~~~~~~gI~el 155 (533) .|+.+--..-+..+..+ ...++.+|. -...+.+ ++..+++.|.-|..++-|. .+-+.+| T Consensus 64 ----l~~~~~~~~~~~~~~~~------~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~---~G~~~~L 130 (416) T protein:vir:12 64 ----LPIHTYKRTDGGIERKP------EHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGS---HGYPEAL 130 (416) T ss_pred ----CceEEEEecCCcccccc------ccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEEE Confidence 34443211111111110 112233332 2233334 4445677899999877653 3459999 Q ss_pred EEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccc--ccccCCcceeccchhhccccccccCCCCccch Q lcl|NC_021072. 156 RYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGL--KNSTNQGMKIATDSVTYCHSGIQDLNKNMTLS 233 (533) Q Consensus 156 r~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~--~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~s 233 (533) .+|||..++.+.... .... +|.+...|. .+.....++|... ..++-.++| T Consensus 131 ~~l~~~~v~v~~~~~-----~~~~-------------~~~~~~~g~~~~~~~~eiih~~~~----------~~~~~~G~s 182 (416) T protein:vir:12 131 FPLRPDYTNAYVHPT-----TGML-------------WYQTVLNGKAIELYDYEVLHFKGL----------STDGIHGKS 182 (416) T ss_pred EEECCcceEEEEeCC-----CcEE-------------EEEEecCCeEEEecCccEEEecCc----------CCCCccccc Confidence 999999996533211 1111 111111111 1112223333211 122335679 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHh Q lcl|NC_021072. 234 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLE 313 (533) Q Consensus 234 yL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlE 313 (533) .|+.|.+++......+....=+=-..+.-+-|..++ +.+.+..+++ +++-.++.. +.|.+. .++ T Consensus 183 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~-~~~~~~~~~-------~~~~~~-------vl~ 246 (416) T protein:vir:12 183 PIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVP-AFLDEKPKEN-VRKEWKRVN-------KVENIA-------IID 246 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecC-CCCCHHHHHH-HHHHHHHHh-------cCCCee-------ecC Confidence 999999999988888887665544556667777776 3555544444 333333321 112211 121 Q ss_pred hhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHH Q lcl|NC_021072. 314 DFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRK 391 (533) Q Consensus 314 DywLpRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~rLr~ 391 (533) .|++++.|.- ...+.-++-.+|....+.++++||.+-|+..+.-+....++..+. |..+ |.-+-. T Consensus 247 ----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~---f~~~~l~P~~~ 313 (416) T protein:vir:12 247 ----------YGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIE---YVRNTLQPWIV 313 (416) T ss_pred ----------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHH---HHHHHHHHHHH Confidence 1455555532 122333455677789999999999999975544444444454444 5433 333444 Q ss_pred HHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhC Q lcl|NC_021072. 392 RFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLK 471 (533) Q Consensus 392 ~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~ 471 (533) ++...|. ..|+ ++.++. ....+.|. +..+.... ...|.+.+..+-.- -++|.+-++. .|+ T Consensus 314 ~ie~~l~----~~l~-----~~~~~~---~g~~i~fd----~~~l~~~d-~~~~~~~~~~~~~~--G~~T~NE~R~-~~g 373 (416) T protein:vir:12 314 NFEQELN----VKLF-----LDHDQK---SGHYVKFN----IDSELRGD-SKTQAEYLKTLHET--GVLNKDEIRE-LLE 373 (416) T ss_pred HHHHHHH----Hhhc-----Cchhhc---CCceEEee----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhC Confidence 3333333 2322 222221 22345554 22332222 24566666665433 4777887775 366 Q ss_pred CCHHHHHHHHHHHHHhhhcCCCC-C-CCccccc--CCCCCCCCCCCCc Q lcl|NC_021072. 472 QTDQEIKEIDKQIDSEREAGLIV-D-PMAEMDP--AMDPGNAPPADDM 515 (533) Q Consensus 472 ~tDeeI~e~~kqi~~E~~~~~~~-~-p~~~~~~--~~~~~~~~~~~d~ 515 (533) +.+-+ --++-+. .....+ + .+..+.+ +...+|..+..+| T Consensus 374 l~Pi~--ggd~~~~---~~n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 374 RNPIE--NGDKYIS---SLNYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred CCCCC--Ccceeee---ccccccccccchhhccccccccCCCCCcCCC Confidence 65421 0000000 000000 0 0000000 0000000011111 No 112 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=97.12 E-value=0.00016 Score=41.35 Aligned_cols=399 Identities=14% Similarity=0.146 Sum_probs=175.5 Q ss_pred CC--ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhc Q lcl|NC_021072. 1 MS--NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~--~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvn 78 (533) |. ..+++|--.+ ....+... ... .++.|.... +-+....+|-|.+||+-|.+ T Consensus 1 MG~~~~~~~~~~~~----~~~~~~~~--------~~~-~~~~g~~~~-------------~~~~al~~~~V~~~v~~Ia~ 54 (411) T protein:vir:81 1 MGWWSRLTRFFRPR----NETVDMTN--------PLL-LQWLGVDPD-------------TPRNQLSEATYFACLKILSE 54 (411) T ss_pred CchHHHHHhhccCc----ccccccch--------HHH-HHHhcCccc-------------ChhhhhccHHHHHHHHHHHH Confidence 43 3355542111 01111110 001 112121100 11334578999999999998 Q ss_pred ceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHHHH----hhhhcCceeeeeeecCCCCCC Q lcl|NC_021072. 79 ETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYEIFR----RWYVDGRLFYHKVIDPKNPRG 150 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~fR----~WYvDGri~~hkvid~~~~~~ 150 (533) .+-. .|+.+--..-+....+ .-..++++|+.. ..+.++.+ .+.+.|.-|..++.| .+ T Consensus 55 ~iA~-----lp~~~~~~~~~~~~~~------~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~----~g 119 (411) T protein:vir:81 55 SLGK-----LPLKMYQKTERGIVKS------DREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS----GP 119 (411) T ss_pred hHhh-----CceeEEEecCCceeee------cccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec----CC Confidence 7542 3444421111110000 112344555432 24444444 456779999987765 46 Q ss_pred CeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecc-ccccccccCCcceeccchhhccccccccCCCC Q lcl|NC_021072. 151 GLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYN-PKGLKNSTNQGMKIATDSVTYCHSGIQDLNKN 229 (533) Q Consensus 151 gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~-p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~ 229 (533) .+.+|.+++|..++.++.-......+. .-+|.|. +.+ +..+.++.+-+.+...+. ..++- T Consensus 120 ~~~~l~~l~~~~v~~~~~~~~~~~~~~-------------~~~~~~~~~~~-----g~~~~~~~~eiih~k~~~-~~~~~ 180 (411) T protein:vir:81 120 QLQALWILPSQYVTIVVDDRGLLGEKN-------------AIWYRYNDPYD-----GKMYVFRNDEILHFKTSV-TFDGI 180 (411) T ss_pred ceEEEEEECCceEEEEEcCcccccccc-------------eEEEEEEecCC-----ceEEEEccccEEEEcCCC-CCCCc Confidence 799999999999976544322111110 1111111 100 112223333332222111 22334 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccc Q lcl|NC_021072. 230 MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFM 309 (533) Q Consensus 230 ~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~m 309 (533) .++|-+..|...+.....+++...=+--.-+--+-+..++ +.|.+..+++..+.+...|.-- .+.|. .+ T Consensus 181 ~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~g~------~~ 249 (411) T protein:vir:81 181 TGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYT-GDLNQEARDRLVKGFEQFANGS----KNAGK------II 249 (411) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHhcCc----cccCC------ce Confidence 5678888888888888877776654443434445555565 4566665555444443333210 01121 11 Q ss_pred hhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHH Q lcl|NC_021072. 310 SMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIAR 388 (533) Q Consensus 310 smlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~r 388 (533) .++ .|.+++.|.-. ..+.-++-.++..+.+.++++||...|+...+-+...+.+..+. |.+++ T Consensus 250 -vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~---f~~~~-- 313 (411) T protein:vir:81 250 -PVP----------LGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLA---FYVDT-- 313 (411) T ss_pred -ecC----------CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHH---HHHHH-- Confidence 111 24556655321 12222344568899999999999999965544455555444433 55442 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQ 468 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~ 468 (533) |+-- ...+.+.|-..| +++.++. ....+.|..+ ++...+ ...|.+.+..+-.- -++|.+-++. T Consensus 314 l~P~-~~~ie~~l~~~l-----l~~~~~~---~~~~~~fd~~----~ll~~d-~~~~~~~~~~~~~~--g~~t~NE~R~- 376 (411) T protein:vir:81 314 LLYV-LKQYEEEITYKI-----LSNDLIS---QGHYFKFNVN----VILRAD-IKTQMDSLSTAVQN--GIMTPNEARD- 376 (411) T ss_pred HHHH-HHHHHHHHHhhc-----CChhhcC---CCcEEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH- Confidence 3221 122233333333 3444433 2334555522 332222 22344444443222 3455555542 Q ss_pred HhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCcccc Q lcl|NC_021072. 469 VLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDA 528 (533) Q Consensus 469 IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 528 (533) ++++.+ .|+.+ .-....+.-|.++...+. ...+|. T Consensus 377 ~~gl~p--------------------~~ggD--~~~~~~n~~pl~~~~~~~---~kgGd~ 411 (411) T protein:vir:81 377 YLDMPA--------------------DDYGN--NLMANGNYIPLSMLGANY---GKGGDS 411 (411) T ss_pred HhCCCC--------------------CCCCC--eeeeccCccchhhhhhhh---ccCCCC Confidence 333321 12110 000111111111111111 112222 No 113 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=97.06 E-value=0.00019 Score=41.03 Aligned_cols=421 Identities=15% Similarity=0.177 Sum_probs=182.0 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhh---------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECD---------- 70 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd---------- 70 (533) |. .++ .--+.-|.+.+=....+.. + ++.-..-+.+|+.+..+.+=. T Consensus 1 ~~-------~~~------~~~~~~~~~~~~~~~~i~~-~----------i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~ 56 (452) T protein:vir:36 1 MK-------YKP------PKLMTFSKDEPITVEVVTK-F----------MEKHKLEVARYEYLKNMYLGIMAIDDEPAKD 56 (452) T ss_pred Cc-------ccC------ceeEEcCCccCCCHHHHHH-H----------HHHHHHHHHHHHHHHHHhccccccccCcccc Confidence 11 111 0011111111111000000 0 000011112233322222211 Q ss_pred ----------hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeee Q lcl|NC_021072. 71 ----------SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYH 140 (533) Q Consensus 71 ----------~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~h 140 (533) +-..-||+-.. .=.-+.|+.+..++ +. ..+.+..+++--+|+....+.++.+.+-|+-|++ T Consensus 57 ~~~~~~ki~~n~~~~ivd~~~-~~l~g~~~~~~~~d----~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~ 127 (452) T protein:vir:36 57 SWKPDNRLAVNFTKYIVDTFT-GYFNGIPVKKSHSD----KE----ILTKLQEFDNLNDMEDEESELAKMACIYGRAFEF 127 (452) T ss_pred ccCccceeecchHHHHHHHHh-hhhcccCceeecCC----hh----HHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEE Confidence 11122222211 11124666665443 22 2344566777678999999999999999999999 Q ss_pred eeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccc-c-ccCCcceeccchhhc Q lcl|NC_021072. 141 KVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLK-N-STNQGMKIATDSVTY 218 (533) Q Consensus 141 kvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~-~-~~~~~~kI~~dai~y 218 (533) .-+|. +|-..+..+||+.+-+|...... .....+..-... .....-..+|.+.... + ....+..+... +. T Consensus 128 v~~d~----~g~~~i~~~~p~~~~~v~d~~~~-~~~~~~i~~~~~-~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~--~~ 199 (452) T protein:vir:36 128 LYQDE----DTQTNVVYNSPENMFMVYDDTVK-QEPLFAVRYGVD-EDKKLQGEVYTLLETIKISGENDEISFGEG--TY 199 (452) T ss_pred EEecC----CCeeEEEEEcccceEEEEcCCCC-CceEEEEEEEEe-cCceEEEEEEecCeEEEEEEcCCceEEecc--ee Confidence 87763 46678999999999776543211 111111110000 0011111234443321 0 01111111100 00 Q ss_pred cccccc---c-CCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEE Q lcl|NC_021072. 219 CHSGIQ---D-LNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKL 293 (533) Q Consensus 219 ~hsGl~---d-~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~ 293 (533) -..|-+ . .++..+.|-++..+.....+.. +=+....-+..+.|-+-+. .+.++... +.++ .. +++ T Consensus 200 ~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~---g~~~~~~~----~~~~-~~--~~~ 269 (452) T protein:vir:36 200 NPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFL---GAAVEEED----LKNI-RS--NRV 269 (452) T ss_pred ccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEee---cCCcCchh----hhhh-hh--cce Confidence 011111 1 2234555666655555554433 3444455566777766553 22222211 1111 11 122 Q ss_pred EeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccch Q lcl|NC_021072. 294 VYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETTFNIGRA 372 (533) Q Consensus 294 vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~ 372 (533) +.=..+| .+.|..+.+|....+.+.+. -+.-+.+.+|.-..+|- +..++.-|. .+ T Consensus 270 ~~~~~~~---------------------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~-Sg 325 (452) T protein:vir:36 270 INYYADG---------------------EGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDESFGSS-SG 325 (452) T ss_pred EEecCCC---------------------CccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCcccccCC-cH Confidence 2111111 22334566666555544433 35667788888889984 332221111 22 Q ss_pred hhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 373 AEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVM-SLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNT 451 (533) Q Consensus 373 ~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~-t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~ 451 (533) ..|..-+.....-+.+.+..|...+..+++.=+-+.+.. ...+|. .|.+.|...---.+ .+.+++++. T Consensus 326 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~----~i~i~f~~~~p~d~-------~~~a~~~~k 394 (452) T protein:vir:36 326 VSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWK----DIEYTFTRNEPKDI-------KEQAETANI 394 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc----cceEEeCCCCCcCH-------HHHHHHHHH Confidence 234333334445567777777777777777544333322 233454 46777764433223 233445555 Q ss_pred hhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCC--CCCCCccccc Q lcl|NC_021072. 452 MDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNA--PPADDMSAQE 519 (533) Q Consensus 452 ~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~--~~~~d~~~~~ 519 (533) + +| .+|.+++++. |..+++ .+++.++|++|.....-. .+..+.++. ....+.+.++ T Consensus 395 ~---~g-~iS~et~~~~-~~~~~d-~~~E~~ri~~E~~~~~~~------~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 395 L---MG-ITSQETALSV-ISVIPD-VQAEMEKIKKEEASTAIF------DKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred H---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHH------HhhccCCCCcccccCccccCC Confidence 5 34 4899999976 566542 445555666665433111 111111111 1111111111 No 114 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.02 E-value=0.00021 Score=40.80 Aligned_cols=405 Identities=14% Similarity=0.131 Sum_probs=173.9 Q ss_pred CC-----ccccceeeeccccccccC----------CCCCCCCCc-ccceeecccccccccchhhhhhHHHHHHHHHH--h Q lcl|NC_021072. 1 MS-----NQLFGFSLERAKKVPKGP----------SFVQKDSMD-GSQPIVGGGYYGYSVDFDGTVRNEYELITRYR--E 62 (533) Q Consensus 1 ~~-----~~~fg~~i~~~~~~~~~~----------s~~~~~~~d-g~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR--~ 62 (533) |+ .-..||+-.+...+.-+. +...+.... ....... ++.+. .+ ..|. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~-~~~~~----~~---------~~~~~~~ 66 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLP-GFQGT----KL---------RQYKDIE 66 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhc-ccCcc----cc---------cccchhh Confidence 43 345677666644443121 111111000 0000000 11111 00 1121 2 Q ss_pred hhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHHH----Hhhhhc Q lcl|NC_021072. 63 MVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYEIF----RRWYVD 134 (533) Q Consensus 63 m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~f----R~WYvD 134 (533) -+.+|-|..||+-|.+.+-. .|+.+- .+.. .. .-..++.+|+-. -.+.++. ..+.+. T Consensus 67 al~~~~V~~cv~~Ia~~iA~-----lp~~~~-~~~~--~~-------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~ 131 (441) T protein:vir:94 67 AIRHSDIFTAVMMIASDLAR-----MPIRVT-VNGQ--IN-------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLT 131 (441) T ss_pred hhccHHHHHHHHHHHHhhcc-----Cceeee-cCcc--cc-------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhc Confidence 35678899999999887543 455442 1111 01 123456666533 2344444 446778 Q ss_pred CceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc--cccccCCcceec Q lcl|NC_021072. 135 GRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG--LKNSTNQGMKIA 212 (533) Q Consensus 135 Gri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~--~~~~~~~~~kI~ 212 (533) |.-|+.++-|. .+-+++|.+|+|..+..++. .++.-++... .+.. +..+ ..+.+...++|. T Consensus 132 Gnay~~i~r~~---~G~~~~L~~i~~~~v~v~~d-----~~g~~~~~~~-~~~~--------~~~~~~~~~~~~dvih~k 194 (441) T protein:vir:94 132 SHGYIEITRDK---TGEPMNLTFRKTSEIELKSD-----ARGRLYYFHQ-RIDS--------NGNNIERNVKFEDMLDIK 194 (441) T ss_pred CCeEEEEEECC---CCcEEEEEEEcCceeEEEEC-----CCccEEEEEE-Eecc--------CCceeEEEEccccEEEec Confidence 99999977753 34489999999999965332 1111111000 0000 0000 011222233332 Q ss_pred cchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccE Q lcl|NC_021072. 213 TDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNK 292 (533) Q Consensus 213 ~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk 292 (533) . ...++-.++|-|+.|.+++.....+++...=+=---|--+-|..++ |.+...+|.+=+++-+++.- T Consensus 195 ~----------~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~e~~e~~r~~~~~~~-- 261 (441) T protein:vir:94 195 F----------YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSF-- 261 (441) T ss_pred c----------CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCHHHHHHHHHHHHHHh-- Confidence 1 1122234678899999998888777777654433445566777777 45544444333443222211 Q ss_pred EEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCC-ccccc Q lcl|NC_021072. 293 LVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIG 370 (533) Q Consensus 293 ~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~~~~g 370 (533) +| ..+..+.+ .++ .|.+++.|.=. ..+.-++-.++..+.+.++++||.+.|+..+ +++ T Consensus 262 ------~G-~~nag~~~-vl~----------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s-- 321 (441) T protein:vir:94 262 ------SG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMS-- 321 (441) T ss_pred ------cC-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCcc-- Confidence 12 11111122 222 14455555321 1122234456778889999999999996432 221 Q ss_pred chhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 371 RAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVN 450 (533) Q Consensus 371 ~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~ 450 (533) .++. .+-|..-+.-+-.++...|.. .|. + ++. ...+.|. +.++...+ ...|.+.+. T Consensus 322 -~~q~---~~~~~~tl~P~~~~ie~eln~----kl~-----~--~~~----~~~~~fd----~~~llr~D-~~~~~~~~~ 377 (441) T protein:vir:94 322 -ITDA---NLDYLSTLKPYITCVCAELNF----KFN-----D--EYV----NREFKFD----TTEIRVVD-EKTQAEIDK 377 (441) T ss_pred -HHHH---HHHHHHHHHHHHHHHHHHHhh----hcc-----c--ccc----CceEEee----chhhhccC-HHHHHHHHH Confidence 1222 233444444444444333332 221 1 121 2344554 33333222 334566555 Q ss_pred HhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc--ccCC-CC---CCCCCCCCccccccccCC Q lcl|NC_021072. 451 TMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEM--DPAM-DP---GNAPPADDMSAQEGPAVD 524 (533) Q Consensus 451 ~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~--~~~~-~~---~~~~~~~d~~~~~~~~~~ 524 (533) .+-.- -++|.+-++.. +++.+ ++.++... -+.+ -+ .++.+.......+ .... T Consensus 378 ~~i~~--G~~T~NE~R~~-~gl~P------------------i~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~-~~~k 435 (441) T protein:vir:94 378 INIDS--GKMNIDEIRQR-DGLAP------------------IPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD-KKLK 435 (441) T ss_pred HHHhC--CCcCHHHHHHH-hCCCC------------------CCCCCcceEeecccccccccccccccccccccc-cccC Confidence 54322 35666666532 44432 33222111 0000 00 0000000000000 0011 Q ss_pred ccccch Q lcl|NC_021072. 525 AGDAKR 530 (533) Q Consensus 525 ~~~~~~ 530 (533) -+|.-+ T Consensus 436 gGe~~e 441 (441) T protein:vir:94 436 GGEENE 441 (441) T ss_pred CCCCCC Confidence 111111 No 115 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.02 E-value=0.00021 Score=40.80 Aligned_cols=405 Identities=14% Similarity=0.131 Sum_probs=173.9 Q ss_pred CC-----ccccceeeeccccccccC----------CCCCCCCCc-ccceeecccccccccchhhhhhHHHHHHHHHH--h Q lcl|NC_021072. 1 MS-----NQLFGFSLERAKKVPKGP----------SFVQKDSMD-GSQPIVGGGYYGYSVDFDGTVRNEYELITRYR--E 62 (533) Q Consensus 1 ~~-----~~~fg~~i~~~~~~~~~~----------s~~~~~~~d-g~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR--~ 62 (533) |+ .-..||+-.+...+.-+. +...+.... ....... ++.+. .+ ..|. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~-~~~~~----~~---------~~~~~~~ 66 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLP-GFQGT----KL---------RQYKDIE 66 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhc-ccCcc----cc---------cccchhh Confidence 43 345677666644443121 111111000 0000000 11111 00 1121 2 Q ss_pred hhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHHH----Hhhhhc Q lcl|NC_021072. 63 MVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYEIF----RRWYVD 134 (533) Q Consensus 63 m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~f----R~WYvD 134 (533) -+.+|-|..||+-|.+.+-. .|+.+- .+.. .. .-..++.+|+-. -.+.++. ..+.+. T Consensus 67 al~~~~V~~cv~~Ia~~iA~-----lp~~~~-~~~~--~~-------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~ 131 (441) T protein:vir:79 67 AIRHSDIFTAVMMIASDLAR-----MPIRVT-VNGQ--IN-------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLT 131 (441) T ss_pred hhccHHHHHHHHHHHHhhcc-----Cceeee-cCcc--cc-------ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhc Confidence 35678899999999887543 455442 1111 01 123456666533 2344444 446778 Q ss_pred CceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc--cccccCCcceec Q lcl|NC_021072. 135 GRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG--LKNSTNQGMKIA 212 (533) Q Consensus 135 Gri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~--~~~~~~~~~kI~ 212 (533) |.-|+.++-|. .+-+++|.+|+|..+..++. .++.-++... .+.. +..+ ..+.+...++|. T Consensus 132 Gnay~~i~r~~---~G~~~~L~~i~~~~v~v~~d-----~~g~~~~~~~-~~~~--------~~~~~~~~~~~~dvih~k 194 (441) T protein:vir:79 132 SHGYIEITRDK---TGEPMNLTFRKTSEIELKSD-----ARGRLYYFHQ-RIDS--------NGNNIERNVKFEDMLDIK 194 (441) T ss_pred CCeEEEEEECC---CCcEEEEEEEcCceeEEEEC-----CCccEEEEEE-Eecc--------CCceeEEEEccccEEEec Confidence 99999977753 34489999999999965332 1111111000 0000 0000 011222233332 Q ss_pred cchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccE Q lcl|NC_021072. 213 TDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNK 292 (533) Q Consensus 213 ~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk 292 (533) . ...++-.++|-|+.|.+++.....+++...=+=---|--+-|..++ |.+...+|.+=+++-+++.- T Consensus 195 ~----------~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~e~~e~~r~~~~~~~-- 261 (441) T protein:vir:79 195 F----------YSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSF-- 261 (441) T ss_pred c----------CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCHHHHHHHHHHHHHHh-- Confidence 1 1122234678899999998888777777654433445566777777 45544444333443222211 Q ss_pred EEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCC-ccccc Q lcl|NC_021072. 293 LVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIG 370 (533) Q Consensus 293 ~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~~~~g 370 (533) +| ..+..+.+ .++ .|.+++.|.=. ..+.-++-.++..+.+.++++||.+.|+..+ +++ T Consensus 262 ------~G-~~nag~~~-vl~----------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s-- 321 (441) T protein:vir:79 262 ------SG-TKQAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMS-- 321 (441) T ss_pred ------cC-ccccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCcc-- Confidence 12 11111122 222 14455555321 1122234456778889999999999996432 221 Q ss_pred chhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 371 RAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVN 450 (533) Q Consensus 371 ~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~ 450 (533) .++. .+-|..-+.-+-.++...|.. .|. + ++. ...+.|. +.++...+ ...|.+.+. T Consensus 322 -~~q~---~~~~~~tl~P~~~~ie~eln~----kl~-----~--~~~----~~~~~fd----~~~llr~D-~~~~~~~~~ 377 (441) T protein:vir:79 322 -ITDA---NLDYLSTLKPYITCVCAELNF----KFN-----D--EYV----NREFKFD----TTEIRVVD-EKTQAEIDK 377 (441) T ss_pred -HHHH---HHHHHHHHHHHHHHHHHHHhh----hcc-----c--ccc----CceEEee----chhhhccC-HHHHHHHHH Confidence 1222 233444444444444333332 221 1 121 2344554 33333222 334566555 Q ss_pred HhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc--ccCC-CC---CCCCCCCCccccccccCC Q lcl|NC_021072. 451 TMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEM--DPAM-DP---GNAPPADDMSAQEGPAVD 524 (533) Q Consensus 451 ~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~--~~~~-~~---~~~~~~~d~~~~~~~~~~ 524 (533) .+-.- -++|.+-++.. +++.+ ++.++... -+.+ -+ .++.+.......+ .... T Consensus 378 ~~i~~--G~~T~NE~R~~-~gl~P------------------i~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~-~~~k 435 (441) T protein:vir:79 378 INIDS--GKMNIDEIRQR-DGLAP------------------IPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD-KKLK 435 (441) T ss_pred HHHhC--CCcCHHHHHHH-hCCCC------------------CCCCCcceEeecccccccccccccccccccccc-cccC Confidence 54322 35666666532 44432 33222111 0000 00 0000000000000 0011 Q ss_pred ccccch Q lcl|NC_021072. 525 AGDAKR 530 (533) Q Consensus 525 ~~~~~~ 530 (533) -+|.-+ T Consensus 436 gGe~~e 441 (441) T protein:vir:79 436 GGEENE 441 (441) T ss_pred CCCCCC Confidence 111111 No 116 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=97.01 E-value=0.00021 Score=40.75 Aligned_cols=454 Identities=11% Similarity=0.069 Sum_probs=197.0 Q ss_pred CCccccceeeeccccccc------cCCCCCCCCCcccceeecccccccccchhhhhhHHH--HHHHHHHhhhhc-----c Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPK------GPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEY--ELITRYREMVLQ-----P 67 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~------~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~--~LI~~YR~m~~~-----p 67 (533) |--.||--++-....... ...+.-++..+-.... ...-..+-+.. ....+|+.+..+ | T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~ 71 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNN---------WELLKNFINHHKLRQAPRIQELLDYARGENH 71 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCCh---------HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 777776555544322111 0111111111110000 00001110000 111122222222 2 Q ss_pred hh-----------------hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh Q lcl|NC_021072. 68 EC-----------------DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRR 130 (533) Q Consensus 68 Ev-----------------d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~ 130 (533) .+ -+-..-||+...-+ .=+.|+.+.+++.+.++.+.+ -+..+++.-+|+....+.++. T Consensus 72 ~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~y-l~g~p~~~~~~~~~~~~~~~~----~l~~~~~~n~~~~~~~~~~~~ 146 (501) T protein:vir:96 72 DVLKSGRRKDNEMADKRAVHNYGRMISKFKTGY-LAGNPIRVEYDDNDDNSQNDD----AIKRIGRINDLDSLNRTLIRD 146 (501) T ss_pred cccCccccCccccccceeecchHHHHHHHHhhh-hcccCeeEeeCCccchhHHHH----HHHHHHHhcCHHHHHHHHHHH Confidence 11 01111123221110 126788888877666655544 455566667899999999999 Q ss_pred hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCc---eeEEeccceeeccchhceecccccccc--cc Q lcl|NC_021072. 131 WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQ---LRGEDINTQLTQKAAEYYLYNPKGLKN--ST 205 (533) Q Consensus 131 WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~---~~~~~~~~~~~~~~~e~~~y~p~~~~~--~~ 205 (533) .++-|+-|.+.-.|. .|-..+..+||+.+-+|..-... ... .+++.... ..+...-..+|.+..... .. T Consensus 147 ~~~~G~a~~~v~~de----dg~~~i~~~~p~~~~~v~d~~~~-~~~~~~v~~~~~~~-~~~~~~~~~vyt~~~i~~~~~~ 220 (501) T protein:vir:96 147 LSQTGRAYEVIYRSE----YDETRIKRLSPLETFVIYDNSLE-DNSIAAVRYYNRGT-LQSAKDVVEIYTDEHIYTLDAS 220 (501) T ss_pred HhhcCeEEEEEEEcC----CCceEEEEEccceeEEEEcCCCC-CceEEEEEEEEeec-CCCcEEEEEEEcCCcEEEEeeC Confidence 999999999877763 45677899999999776432111 111 11111100 001111122344433211 01 Q ss_pred CCcceeccchhhccc-ccc---cc-CCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHH Q lcl|NC_021072. 206 NQGMKIATDSVTYCH-SGI---QD-LNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAE 279 (533) Q Consensus 206 ~~~~kI~~dai~y~h-sGl---~d-~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAe 279 (533) +....+.. ..| .|. +. .|+....|=++..+.....+. ++=+....-+.++.|-+-+.-.+....+... T Consensus 221 ~~~~~~~~----~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~-- 294 (501) T protein:vir:96 221 DDFNEISV----TTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA-- 294 (501) T ss_pred CCceeccc----cccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccch-- Confidence 11100000 011 111 10 123445566666555554443 4444445556666776665443322211100 Q ss_pred HHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCC----CCccceeecCCCCCcchH-HHHHHHHHHHHHhc Q lcl|NC_021072. 280 QYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREG----GRGTEISTLPGGQNLGEL-EDVKYFQKKLYKAL 354 (533) Q Consensus 280 qYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReg----grgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL 354 (533) .. |.. ++ -++++-..+ +.+..+..|-...+...+ .-++-+.+.+|... T Consensus 295 ---~~-~~~--~~---------------------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s 347 (501) T protein:vir:96 295 ---SD-MKR--TR---------------------LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFT 347 (501) T ss_pred ---hh-hhh--cC---------------------eeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHh Confidence 00 000 11 122222221 122345544433333222 22345567778888 Q ss_pred CCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCH-hHHhhhhhceeEEEeccchH Q lcl|NC_021072. 355 NVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSL-EEWDEMKEHIQFDFIADNYF 433 (533) Q Consensus 355 ~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~-eew~~~~~~i~~~f~~Dn~f 433 (533) ++|---++.-++ |. .+..|.--.......+.+.++.|..-+.+.++.=+-+-++..+ .+++ ...|.+.|...-.- T Consensus 348 ~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~i~f~~~~p~ 423 (501) T protein:vir:96 348 NTPDMSDTNFSG-NT-SGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFD--ESLLKITFTPNLPK 423 (501) T ss_pred CCcccCcccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cccceEEeCCCCCc Confidence 988433332221 11 2223433334455667777888888887777664433333211 1222 23477888644333 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCC Q lcl|NC_021072. 434 TELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPAD 513 (533) Q Consensus 434 ~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~ 513 (533) .+ .+.++++..+. | .+|.+++++. |...++ .+++.++|++|....-......+.++.. +. ..+ T Consensus 424 n~-------~e~ad~~~kl~---g-~iS~et~~~~-l~~v~D-~~~E~~ri~~E~~~~~~~~~~~~~~~~~---~~-~~~ 486 (501) T protein:vir:96 424 SL-------NEQVSILTGLG---G-QVSQETALSL-SGLVES-PNEELDKINKEMSEIDFKGYSNDFNEHV---GK-YTD 486 (501) T ss_pred CH-------HHHHHHHHHHh---c-cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHhhccccccchhhcc---cc-cCC Confidence 33 34455666664 4 4899999987 555442 3344555666654321111111111111 11 111 Q ss_pred CccccccccCCccccchh Q lcl|NC_021072. 514 DMSAQEGPAVDAGDAKRG 531 (533) Q Consensus 514 d~~~~~~~~~~~~~~~~~ 531 (533) + +.....|.++.+.+ T Consensus 487 ~---~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 487 E---VKETHTDDFEREYE 501 (501) T ss_pred c---CCCCCCCccccccC Confidence 1 11122344444444 No 117 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=96.98 E-value=0.00023 Score=40.55 Aligned_cols=413 Identities=10% Similarity=0.093 Sum_probs=167.3 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhh-hHHHHHHHHHHhhhhcchhh--------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTV-RNEYELITRYREMVLQPECD--------- 70 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~-~~~~~LI~~YR~m~~~pEvd--------- 70 (533) |-..-.+ +.+.--.....++-+...... .....+ ..-..-+.+|+.+..+.+-. T Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--------------~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~ 64 (468) T protein:vir:96 1 MIDIFWP--NEKPYHERVVEQIKPQYETQE--------------EMILRLITKHKENVEDITVGERYYNHQPDVLFNAPK 64 (468) T ss_pred CccccCC--cCceeehheeecccccccCcH--------------HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc Confidence 1100000 000000000000000000000 000000 00111122222222221111 Q ss_pred ------------------hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhh Q lcl|NC_021072. 71 ------------------SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWY 132 (533) Q Consensus 71 ------------------~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WY 132 (533) +-...||+...- =.-+.|+.+..++.+..+. +..++. =+|+....++.+.++ T Consensus 65 ~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~-~l~g~p~~~~~~d~~~~~~--------l~~~~~-n~~~~~~~~~~~~~~ 134 (468) T protein:vir:96 65 RNVKGEIDPFKPDWRMYTNYHQNLVDQKVA-YAVANPVTYGTEDEKSLKT--------IQEVLN-HKWDDKLVDILTAAS 134 (468) T ss_pred ccccccccccccccccccchHHHHHHHHHh-hhccCCceeccCChHHHHH--------HHHHHh-cCHHHHHHHHHHHHh Confidence 111222222111 1125677776655322222 222322 167888889999999 Q ss_pred hcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCc---eeEEeccceeeccchhceeccccccc------- Q lcl|NC_021072. 133 VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQ---LRGEDINTQLTQKAAEYYLYNPKGLK------- 202 (533) Q Consensus 133 vDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~---~~~~~~~~~~~~~~~e~~~y~p~~~~------- 202 (533) +-|+.|.+.-+|. +|-..+..+||+.+-+|..-. ...+. .+++..... ....+|.+.... T Consensus 135 ~~G~~~~~v~~d~----~~~~~i~~~~p~~~~~v~~~~-~~~~~~~~ir~~~~~~~-----~~~~~~~~~~~~~~~~~~~ 204 (468) T protein:vir:96 135 NKGVEWIQPYVDE----QGEFKTFRVPAEQAIPIWTNK-ERDELKAFIRLYELDGG-----ERVEYWTANDVTFYELKDG 204 (468) T ss_pred hcCeEEEEEEEcC----CCceEEEEEcccceEEEEcCC-CCCceEEEEEEEEecCc-----eEEEEEeCCeEEEEEEcCC Confidence 9999999877753 456789999999987653211 11111 111111110 111122221110 Q ss_pred ----------cccCCcceeccchhhcccccccc----CCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEE Q lcl|NC_021072. 203 ----------NSTNQGMKIATDSVTYCHSGIQD----LNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFY 267 (533) Q Consensus 203 ----------~~~~~~~kI~~dai~y~hsGl~d----~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfy 267 (533) .............. ..|.+. +|+....|-++..+.....|.+ +-+..-.-+-++.|-+-+.- T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g 281 (468) T protein:vir:96 205 QLIPDYYQGEEHVQAHYYVGNKSM---SWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKG 281 (468) T ss_pred ceeecccccccccccceeeccccc---cCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec Confidence 00000000000000 001110 1234456777776666555543 33334445666666543332 Q ss_pred ccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHH Q lcl|NC_021072. 268 IDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYF 346 (533) Q Consensus 268 IDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~YF 346 (533) .+... .++.+.. |..++ . ++++= +++.+ +..|....+.... .-++-+ T Consensus 282 ~~~~~-----~~~~~~~-~~~~~--~---------------------i~~~~-d~~~~--~~~l~~~~~~~~~~~~~~~l 329 (468) T protein:vir:96 282 YEGED-----LEEFMYN-LKYYK--A---------------------INVDG-DGSGG--VDTIQIDVPVQSAKEYLDML 329 (468) T ss_pred CCccc-----cchhhhh-hhcCc--e---------------------EEecC-CCCCc--ceEEeecCChHHHHHHHHHH Confidence 11111 1111110 11111 1 12222 22222 4444333333333 347778 Q ss_pred HHHHHHhcCCCccccCCCCccc-ccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeE Q lcl|NC_021072. 347 QKKLYKALNVPSSRLETETTFN-IGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQF 425 (533) Q Consensus 347 ~~kLy~aL~VP~sRl~~~~~~~-~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~ 425 (533) .+.+|...++|- +..++ |. --.|..|..-......-+.+.+..|...+.++++.=+-+.|+ .-+| ..|.+ T Consensus 330 ~~~I~~~s~~p~--~~~~~-~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~--~~d~----~~i~i 400 (468) T protein:vir:96 330 RDYVIEFGQGVD--FQQDK-FGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL--SIKV----QDVEI 400 (468) T ss_pred HHHHHHHhCccc--ccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceeeE Confidence 888999999983 33222 21 112222333333334446777777888887777765555554 1233 34677 Q ss_pred EEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCC Q lcl|NC_021072. 426 DFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMD 505 (533) Q Consensus 426 ~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~ 505 (533) .|...---.+...++ ++.++ | .+|.+++++. |...++ .+++.++|++|.....-. ++.-++ T Consensus 401 ~f~~~~p~d~~e~a~-------~~~~~----g-~iS~et~i~~-l~~v~D-~~~E~~ri~~E~~~~~~~-----~~~~~~ 461 (468) T protein:vir:96 401 TFNFNVMVNELEQSQ-------IGVNS----Q-YLSKETVVTN-HPWVDD-PVAEMERIDQEELALPSI-----EEGLNG 461 (468) T ss_pred EecCCCCcCHHHHHH-------HHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHH-----hhccCC Confidence 776444333332222 33332 3 6899999977 555432 445566666665543211 111111 Q ss_pred CCCCCCC Q lcl|NC_021072. 506 PGNAPPA 512 (533) Q Consensus 506 ~~~~~~~ 512 (533) .++..|. T Consensus 462 ~~~~~~~ 468 (468) T protein:vir:96 462 KENNEPT 468 (468) T ss_pred CCCCCCC Confidence 1222222 No 118 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=96.95 E-value=0.00024 Score=40.42 Aligned_cols=417 Identities=11% Similarity=0.110 Sum_probs=170.5 Q ss_pred ceeeeccccc-cccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcch----------------- Q lcl|NC_021072. 7 GFSLERAKKV-PKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPE----------------- 68 (533) Q Consensus 7 g~~i~~~~~~-~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pE----------------- 68 (533) =|.+.+-+.- +...-.++.=...+. .... .........+.+ +.+|+.+..+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~----~i~~~i~~~~~~---~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~ 71 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFE--TQEE----MIIRLIDDHRKQ---LDKITVGQRYYDKDNDIVKQMKKVDVYGN 71 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccC--ChHH----HHHHHHHHHHHH---HHHHHHHHHHhcccCchhccccccccccc Confidence 1111110000 000000000000000 0000 000000011111 122222222211 Q ss_pred ----------hhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCcee Q lcl|NC_021072. 69 ----------CDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLF 138 (533) Q Consensus 69 ----------vd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~ 138 (533) +-+-...||+-.. .=.-+.|+.+..++ +...+.+ +.++. =+|+....+.++.+++-|+-| T Consensus 72 ~~~~~~~~ki~~n~~~~Ivd~~~-~~l~g~p~~~~~~d----~~~~~~l----~~~~~-n~~~~~~~e~~~~~~~~G~~~ 141 (474) T protein:vir:95 72 IDYDKPDWRITTNFHQNLVDQKV-SYVASKPVTYSCED----ESVLKII----HDVLD-TRWDNKLIDILTATSNKGIDW 141 (474) T ss_pred cccccccceeccchHHHHHHHHH-hhhccCCceeccCc----hHHHHHH----HHHHh-ccHHHHHHHHHHHHhhcCcEE Confidence 1122222333211 11235677776654 3333333 33332 268889999999999999999 Q ss_pred eeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCcee---EEeccceeeccchhceeccccccc-c----------- Q lcl|NC_021072. 139 YHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLR---GEDINTQLTQKAAEYYLYNPKGLK-N----------- 203 (533) Q Consensus 139 ~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~---~~~~~~~~~~~~~e~~~y~p~~~~-~----------- 203 (533) .+.-+| .+|-..+..++|+.+-+|..-.. ..+... ++..... ...-+|.+.... + T Consensus 142 ~~v~~d----~~~~~~i~~~~p~~~~~v~d~~~-~~~~~~~i~~~~~~~~-----~~~~~y~~~~~~~~~~~~~~~~~~~ 211 (474) T protein:vir:95 142 LQVYIN----ENGEMKLFRVPAEQAIPIWVDKE-REELKSFIRYYKFNNE-----EKVEFWTDTTVTYYVLENGGLIPDY 211 (474) T ss_pred EEEEec----CCCceEEEEEcccceEEEEcCCC-CCceEEEEEEEEEcCe-----eEEEEEeCCeEEEEEEcCCcccccc Confidence 886665 34567899999999977643211 111111 1111111 111122221110 0 Q ss_pred -----------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCC Q lcl|NC_021072. 204 -----------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVG 271 (533) Q Consensus 204 -----------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvG 271 (533) .++..-+||.-.+ .++....|=++..+.....+. ++-+....-+.++.|-+-+.-.+.. T Consensus 212 ~~~~~~~~~~~~~~~~g~iPvv~~---------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~ 282 (474) T protein:vir:95 212 YYGANHIQSHFSNGNWGRVPFIAF---------KNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQ 282 (474) T ss_pred ccCcccccccccccCCCccceEee---------cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcc Confidence 0111112221111 123345566777666666654 4555555667777776554433222 Q ss_pred CCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHH Q lcl|NC_021072. 272 NLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKL 350 (533) Q Consensus 272 nlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~YF~~kL 350 (533) ... . ...-+..+ +. +++ +++. +++.|-...+++.. .-+.-+.+.+ T Consensus 283 ~~~-----~-~~~~~~~~--~~---------------------i~~---~~~~--~~~~l~~~~~~~~~~~~~~~l~~~i 328 (474) T protein:vir:95 283 DLE-----E-FMRGLKYY--KA---------------------INV---DGDG--GVETIQVEVPVSSTKEYIDLMRAYI 328 (474) T ss_pred cch-----h-hhhhhhcc--ce---------------------eec---cCCC--ceeEEeecCCHHHHHHHHHHHHHHH Confidence 111 0 00001000 01 111 1222 24444433444433 3346677888 Q ss_pred HHhcCCCccccCCCCc-ccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEec Q lcl|NC_021072. 351 YKALNVPSSRLETETT-FNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIA 429 (533) Q Consensus 351 y~aL~VP~sRl~~~~~-~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~ 429 (533) |....+|- +..++. -+. .|..|..-+..-..-+.+.+..|...+.++++.=+-+-|+ ..+| ..|.+.|.. T Consensus 329 ~~~s~~p~--~~~~~~~~n~-Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~--~~d~----~~i~v~f~~ 399 (474) T protein:vir:95 329 MEFGQGVD--FQTDKFGSAP-SGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL--KMDV----KDIEISFNF 399 (474) T ss_pred HHHhCCcc--cccccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceeeEEecc Confidence 98999983 222221 111 2233444433344446777777777777777654444443 2233 446777754 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCC Q lcl|NC_021072. 430 DNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNA 509 (533) Q Consensus 430 Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~ 509 (533) .---.+. +.++++.++ | .+|.+++++. |..+++ -+++.++|++|.....-..+... ....+. T Consensus 400 ~~p~d~~-------e~a~~~~~~----g-~iS~et~i~~-l~~v~d-~~~E~~ri~~E~~~~~~~~~~~~----~~~~d~ 461 (474) T protein:vir:95 400 NRMMNDA-------EQSQIIAQS----Q-YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQLPNLD----DGGADG 461 (474) T ss_pred CCCcCHH-------HHHHHHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhcccccc----cccCCC Confidence 3332232 233344442 4 5899999986 565432 22344455555433211111100 000111 Q ss_pred CCCCCcccccccc Q lcl|NC_021072. 510 PPADDMSAQEGPA 522 (533) Q Consensus 510 ~~~~d~~~~~~~~ 522 (533) ...++....+.|. T Consensus 462 ~~~~~~~~~~~~~ 474 (474) T protein:vir:95 462 AQQQERSNDKESE 474 (474) T ss_pred CcCCCCCccCCCC Confidence 1111111111111 No 119 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=96.85 E-value=0.0003 Score=39.89 Aligned_cols=393 Identities=13% Similarity=0.092 Sum_probs=173.6 Q ss_pred CCc--cccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhc Q lcl|NC_021072. 1 MSN--QLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~~--~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvn 78 (533) |.. .+++++ ..+. . .. .+.......++.+ ..... ..+.....+|.|.+||+-|.+ T Consensus 1 Mg~f~~~~~~~-----~~~~-~-~~----~~~~~~~~~~~~~---~~~~~---------~~~~~~~~~~~v~~~i~~ia~ 57 (406) T protein:vir:95 1 MGLFDRWRRTK-----RKSK-I-RA----DTGYVGLFMSGED---VSFLV---------PGYVRLSDNPEVRMAVHKIAD 57 (406) T ss_pred Ccchhhhcccc-----cccc-c-cc----cchhhhhhccCcc---cCccc---------cCHHHHhhcHHHHHHHHHHHH Confidence 432 232221 1111 1 00 1111111111111 00000 012344678999999999998 Q ss_pred ceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHh----cchhhhhHHHHh----hhhcCc--eeeeeeecCCCC Q lcl|NC_021072. 79 ETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLL----DFENRSYEIFRR----WYVDGR--LFYHKVIDPKNP 148 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL----~f~~~~~~~fR~----WYvDGr--i~~hkvid~~~~ 148 (533) .+-- .|+.+--..-+..+.+ ...+..+| +-.-.+.++++. ++..|. .|..++-+ . T Consensus 58 ~ia~-----~~~~~~~~~~~~~~~~-------~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~---~ 122 (406) T protein:vir:95 58 LISS-----MTIYLMQNTEDGDIRI-------RNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYT---A 122 (406) T ss_pred hhcc-----CceEEEEecCCcceee-------cchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEEC---C Confidence 8653 3444321111111111 12333333 223355555554 455654 45555544 3 Q ss_pred CCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC Q lcl|NC_021072. 149 RGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK 228 (533) Q Consensus 149 ~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~ 228 (533) ++-+++|.+++|.+++.+... ++.++.. .+..+.+.+.+++..... ..++ T Consensus 123 ~g~~~~l~~i~~~~v~~~~~~-----~~~~~~~-----------------~~~~~~~~evih~~~~~~--------~~~~ 172 (406) T protein:vir:95 123 DGLIDELVPLTPSKVNFLDTP-----DGYQVLY-----------------GGQTFNYDEVLHFIYNPD--------PERP 172 (406) T ss_pred CCcEEEEEEEcCceeEEEEcC-----CeEEEEe-----------------ccEEEchhHEEEeeccCC--------CCCC Confidence 556999999999999763332 1111111 111112223333332111 1122 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccccccc Q lcl|NC_021072. 229 NMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKF 308 (533) Q Consensus 229 ~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ 308 (533) -.++|.+..|..++.....++....-+----+.-+-+..++. .+.+..+++..+.+..+|+.-. . .|. . T Consensus 173 ~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~-~l~~e~~~~~~~~~~~~~~g~~---n-~~~------~ 241 (406) T protein:vir:95 173 YIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYLQAT---E-AGQ------P 241 (406) T ss_pred ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHHHHHHHHHHhcccc---c-cCC------c Confidence 345799999999999998888888777666677777777774 5777788777777666664211 0 010 0 Q ss_pred chhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHH Q lcl|NC_021072. 309 MSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIAR 388 (533) Q Consensus 309 msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~r 388 (533) + +++ -+|..-++++.+.. ..+.-++-.++....++++++||...|+..+ .. |-.+..|+.. T Consensus 242 ~-v~~------~~~~~~~~~~~~~~-~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~------~~-----~~~~~~~~~~ 302 (406) T protein:vir:95 242 W-IIP------AELLEVEQVKPLSL-KDIAINEAVELDKRTVAGMFGVPAFLLGIGE------FN-----RDEYNNFINS 302 (406) T ss_pred e-eec------CCCccccccccCCh-hHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC------ch-----HHHHHHHHHH Confidence 0 000 01111233333322 2233345567888999999999999885211 11 1122233222 Q ss_pred -HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHH Q lcl|NC_021072. 389 -LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRR 467 (533) Q Consensus 389 -Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k 467 (533) |+- +...+.+.|-..|+ ++.+ +.+.|. +.++.... ...|.+.+..+-.- -+++.+-++. T Consensus 303 ~l~P-~~~~ie~~l~~~l~-----~~~~-------~~~~fd----~~~l~~~d-~~~~~~~~~~l~~~--G~~t~NE~R~ 362 (406) T protein:vir:95 303 TILP-IAKGIEQELTRKLL-----ISPD-------LYFKFN----PRSLYAYD-LKELAEVGSNMYVR--GIMEGNEVRD 362 (406) T ss_pred HHHH-HHHHHHHHHHHhcC-----CCCC-------cEEEee----chhhhcCC-HHHHHHHHHHHHhC--CCcCHHHHHH Confidence 222 22333444444443 3333 245554 33333222 23466666555332 3667777764 Q ss_pred HHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 468 QVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 468 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) .|++..- +.. +..+...+ +.+-+.. .+ .....++.++++.-+.| T Consensus 363 -~~gl~p~--~~g---------d~~~~~~n------~~~~~~~--~~-~~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 363 -WLGLSPK--EGL---------SELVILEN------YIPLDKI--GD-QSKLKGGDNSGADGQTD 406 (406) T ss_pred -HhCCCCC--CCc---------ceeeeccC------ccchhhc--cc-ccccCCCCCCCCCCCCC Confidence 3566431 110 11111000 0000000 00 00000001111111111 No 120 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=96.82 E-value=0.00032 Score=39.78 Aligned_cols=440 Identities=10% Similarity=0.035 Sum_probs=171.2 Q ss_pred ceeeeccccccccCCCCCCCCCccccee-----ecccccccccchhh---hhhHHHHHHH--------HHHhhhhcchhh Q lcl|NC_021072. 7 GFSLERAKKVPKGPSFVQKDSMDGSQPI-----VGGGYYGYSVDFDG---TVRNEYELIT--------RYREMVLQPECD 70 (533) Q Consensus 7 g~~i~~~~~~~~~~s~~~~~~~dg~~~~-----~~~~~~~~~~~~~~---~~~~~~~LI~--------~YR~m~~~pEvd 70 (533) =+++++.. .....-|...- ....|. +...... ...-...+|+ +|+.+..+.+-. T Consensus 1 ~~~~~~~~---------~~~~~~~~~~~~~~~~~n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~ 70 (511) T protein:vir:93 1 MLKVNEFE---------TDTDLRGNINYLFNDEANVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGK 70 (511) T ss_pred Cccccchh---------hhhhhhhhhhhhhhhhhCCccc-ccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhccc Confidence 11121110 00000111100 011111 1100000 0111112222 344443332222 Q ss_pred hH----------------------HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHH Q lcl|NC_021072. 71 SA----------------------VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIF 128 (533) Q Consensus 71 ~A----------------------vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~f 128 (533) .. ..-||+... .=.-+.|+.+.+++ +. ..+.+..+++.-+|+....++. T Consensus 71 ~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~~~~d----~~----~~~~l~~~~~~n~~~~~~~~~~ 141 (511) T protein:vir:93 71 TKNLVELTRRKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQYQDDD----KD----VLEVIEAFNDLNDVESHNRSLG 141 (511) T ss_pred CccccccCcCcccccCcceeecchHHHHHHHHh-hhhcccCeeeccCC----hH----HHHHHHHHHhhcCHhHHHHHHH Confidence 11 122222211 11236777776554 22 2344566776778999999999 Q ss_pred HhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCcee---EEecc--ceee-ccchhceeccccccc Q lcl|NC_021072. 129 RRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLR---GEDIN--TQLT-QKAAEYYLYNPKGLK 202 (533) Q Consensus 129 R~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~---~~~~~--~~~~-~~~~e~~~y~p~~~~ 202 (533) +...+-|+.|.+..+|. .|-..+..+||+.+-+|...... ..... ++... .... ....-.-+|.+.... T Consensus 142 ~~~~~~G~ay~~vy~de----~~~~~i~~~~p~~~~~vydd~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~ 216 (511) T protein:vir:93 142 LDLSIYGKAYELMIRNQ----DDETRLYKSDAMSTFVIYDNTIE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVY 216 (511) T ss_pred HHHHhcCeeEEEEEeCC----CCceEEEEEccceeEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEE Confidence 99999999999877753 45678999999999776443211 11111 11110 0000 000111134443221 Q ss_pred c------------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccc Q lcl|NC_021072. 203 N------------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPER 263 (533) Q Consensus 203 ~------------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeR 263 (533) . .+|.--+||.= .| +++....|-++..+.....+.. +=+....-+-++.|-+ T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv--~~-------~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l 287 (511) T protein:vir:93 217 RYLTSRTNGLKLTPRENGFESHSFERMPIT--EF-------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (511) T ss_pred EEEecCCCccccccccccccccCCCccceE--Ee-------cCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcce Confidence 0 01111122211 11 1233455667766666555542 2233333344555544 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccc-cccccchhHh-hhc--ccccCCCCccceeecCCCCCcch Q lcl|NC_021072. 264 RIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIK-DDKKFMSMLE-DFW--LPRREGGRGTEISTLPGGQNLGE 339 (533) Q Consensus 264 rvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~-~d~~~msmlE-Dyw--LpRReggrgTEIsTLpGg~nLge 339 (533) -+.=..... .++++ +....+-.+. ..| ...-....|..+..|-...+... T Consensus 288 v~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 341 (511) T protein:vir:93 288 LIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQG 341 (511) T ss_pred eeecCcccC--------------------------chhhcccccccceecccccccccccccCCCCcceeEEeecCCHHH Confidence 333211000 01111 1111111111 111 11112223344555544333322 Q ss_pred -HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhh Q lcl|NC_021072. 340 -LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDE 418 (533) Q Consensus 340 -i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~ 418 (533) -.-+.-..+.+|+-.++|---.+.-+| |. .|..|..-...-..-+.+.++.|..-+.+.++.=+-+-++....++.. T Consensus 342 ~~~~~~~L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~ 419 (511) T protein:vir:93 342 TEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANK 419 (511) T ss_pred HHHHHHHHHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Confidence 233344567777888888532222111 11 122233332233334555566666666555544222212222222322 Q ss_pred hhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC---CCC Q lcl|NC_021072. 419 MKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGL---IVD 495 (533) Q Consensus 419 ~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~---~~~ 495 (533) -...+.+.|...---.+ .+.++++..+. | .+|.++++.. |...++ .+++.++|++|..... ... T Consensus 420 d~~~i~~~f~~~~p~n~-------~e~~~~~~kl~---g-~iS~et~~~~-l~~v~d-~~~E~~ri~~E~~~~~~~~~~~ 486 (511) T protein:vir:93 420 DFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTLMSL-FSFFQD-PELEVKKIEEDEKESIKKAQKG 486 (511) T ss_pred ccccceEEeCCCCCCCH-------HHHHHHHHHHh---c-cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHHhhh Confidence 23347788864322222 23455556653 4 4899999976 555441 2233344444443221 000 Q ss_pred CCcccccCCCCCCCCCCCCccccccccCCccccchh Q lcl|NC_021072. 496 PMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRG 531 (533) Q Consensus 496 p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 531 (533) +.....+..+..+++..++.+. +++ T Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~-----------~~~ 511 (511) T protein:vir:93 487 IYKDPRDINDDEQDDDTKDTVD-----------KKE 511 (511) T ss_pred cccCCCCCCCCCCCCccccccc-----------ccC Confidence 1111111111111111111111 111 No 121 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=96.75 E-value=0.00037 Score=39.42 Aligned_cols=403 Identities=16% Similarity=0.192 Sum_probs=171.3 Q ss_pred CC-cc---ccceeeeccccccccCCCCCC--CCCccccee-----ecccccccccchhhhhhHHHHHHHHHHhhhhcchh Q lcl|NC_021072. 1 MS-NQ---LFGFSLERAKKVPKGPSFVQK--DSMDGSQPI-----VGGGYYGYSVDFDGTVRNEYELITRYREMVLQPEC 69 (533) Q Consensus 1 ~~-~~---~fg~~i~~~~~~~~~~s~~~~--~~~dg~~~~-----~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEv 69 (533) |- .+ +||. + +..+.++ .+..|..+. ....+++...+....+. -...+++|-| T Consensus 1 ~~~~~~~~~~~~-~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~--------~~~al~~~~V 63 (432) T protein:vir:10 1 MPDEKKLGLLGQ-L--------KAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVN--------ADAIMRLDAV 63 (432) T ss_pred CCCCcccchhhh-h--------HhhcCCccccccccccccccCcchhhhhcccccccCcccc--------hhhhhcchHH Confidence 11 11 1221 0 0011111 011110000 00001111111111111 1234577999 Q ss_pred hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHHH----HhhhhcCceeeee Q lcl|NC_021072. 70 DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYEIF----RRWYVDGRLFYHK 141 (533) Q Consensus 70 d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~f----R~WYvDGri~~hk 141 (533) ..||+-|.+.+-- .|+.|--...+..+ +.+ -.-++++|+.. ..+.++. ..+++.|.-|..+ T Consensus 64 ~~~i~~Ia~~ia~-----lp~~~y~~~~~g~~---~~~---~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~ 132 (432) T protein:vir:10 64 AACVKLVSQAIAA-----MPLTMYMRTPDGRK---EAV---NHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRK 132 (432) T ss_pred HHHHHHHHHhhhh-----CceeEEEecCCCcc---ccc---ccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEE Confidence 9999999887542 45554322211111 111 12345555432 3444444 4577889998886 Q ss_pred eecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhcccc Q lcl|NC_021072. 142 VIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHS 221 (533) Q Consensus 142 vid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hs 221 (533) +-+ .+.+++|.+|+|..++.++... +...+. ++......+.+ .....++|..- T Consensus 133 ~~~----~g~~~~L~~l~~~~v~v~~~~~-----g~~~y~----~~~~~g~~~~~-------~~~~iih~~~~------- 185 (432) T protein:vir:10 133 VVT----DGRIESLQYLANDRLTITTDTK-----GNTAYR----YRRTDGQMIDI-------PKQQIWKIMGY------- 185 (432) T ss_pred Eec----CCcEEEEEEEcCCceEEEEcCC-----CcEEEE----EEecCceEEEE-------cCccEEEecCC------- Confidence 653 3579999999999997644321 111110 00000111111 12222333211 Q ss_pred ccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCc Q lcl|NC_021072. 222 GIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGE 301 (533) Q Consensus 222 Gl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGe 301 (533) ..++-.++|.|..|.+++......++...=+=---+.-.-|..+| +.|-+...++..+. ++... ..|. T Consensus 186 ---~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~~~~-~~~~~-------nag~ 253 (432) T protein:vir:10 186 ---SLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQID-RFLTDDQYDSFAKK-VSGSV-------EAGR 253 (432) T ss_pred ---CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHHHHHH-Hhhhh-------hCCC Confidence 122334679999999999887777775443222223345566655 45555544443322 11111 1121 Q ss_pred cccccccchhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCC--cccccch-hhh Q lcl|NC_021072. 302 IKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETET--TFNIGRA-AEI 375 (533) Q Consensus 302 v~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~--~~~~g~~-~eI 375 (533) .+ +++ + |++++.|. .+..+ ++-.+|....+.++++||...|+... .++.|.. ++. T Consensus 254 ------~~-vl~-------~---g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~ 314 (432) T protein:vir:10 254 ------AP-LLE-------G---GMDVKSLG--LNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQ 314 (432) T ss_pred ------ce-ecC-------C---CceEEEcc--CChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHH Confidence 11 221 2 34555552 22333 33456888899999999999997543 3433332 222 Q ss_pred hHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_021072. 376 TRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPY 455 (533) Q Consensus 376 tRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~ 455 (533) .+. |..+ .|+--+. .+.+.|-..| +++.++. ...++|.. .++.... +..|.+.++.+-. T Consensus 315 ~~~---f~~~--tl~P~~~-~ie~~ln~kL-----~~~~~~~----~~~~~fd~----~~ll~~d-~~~r~~~~~~~~~- 373 (432) T protein:vir:10 315 QLG---FLSM--TLSPWLR-RIEQSIALNL-----LSPAERR----RYFADFDT----SALLRAD-SAARSSYYSQLVN- 373 (432) T ss_pred HHH---HHHH--HHHHHHH-HHHHHHHhhh-----cCccccC----ceEEEeec----hhhhccC-HHHHHHHHHHHHh- Confidence 333 5543 3333222 2333344443 3444432 35666763 3433222 3457777766632 Q ss_pred ccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCC----CCCCCCccccccccCCccccc Q lcl|NC_021072. 456 VGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGN----APPADDMSAQEGPAVDAGDAK 529 (533) Q Consensus 456 vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~----~~~~~d~~~~~~~~~~~~~~~ 529 (533) .-++|.+-++. .+++..-+ ..+ ..+. .+....|-...+. .+..++.+++ ++.+.+ T Consensus 374 -~G~~T~NE~R~-~~glppi~--g~~---------~~~~-~~~~~~pl~~~~~~~~~~~~~~~~~~~-----~~~~~~ 432 (432) T protein:vir:10 374 -NGLMTRDEARE-IEGLPKLG--GNA---------AVLT-VQSAMVPLDSIGLQASPEPASGLGNQQ-----QDKVSK 432 (432) T ss_pred -CCCCCHHHHHH-HhCCCCCC--CCc---------ceEe-ecCcccchhhhcccCCCCCCCCCCCcc-----cccccC Confidence 24677777774 36664311 000 0000 0000001000000 0001111111 000001 No 122 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=96.66 E-value=0.00043 Score=39.03 Aligned_cols=416 Identities=11% Similarity=0.066 Sum_probs=173.4 Q ss_pred CCCCCcccceeecccccccc-cchhhh-----hhHHHHHHHHHHhhhhcchhh--------------------------- Q lcl|NC_021072. 24 QKDSMDGSQPIVGGGYYGYS-VDFDGT-----VRNEYELITRYREMVLQPECD--------------------------- 70 (533) Q Consensus 24 ~~~~~dg~~~~~~~~~~~~~-~~~~~~-----~~~~~~LI~~YR~m~~~pEvd--------------------------- 70 (533) --+...+....-...+.... ..+... +..-...+.+|+.+..+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~ 80 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMIT 80 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccccccc Confidence 11111111222222222111 001000 011122233444443332221 Q ss_pred hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCC Q lcl|NC_021072. 71 SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRG 150 (533) Q Consensus 71 ~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~ 150 (533) +-...||+-.. .=.-+.|+.+..++.+ .. +.++.++. -+|+....+.++.+++-|+-|.+.-+| ++ T Consensus 81 n~~~~ivd~~~-~~l~g~~~~~~~~d~~----~~----~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d----~d 146 (472) T protein:vir:93 81 NFHANLVDQKV-SYIVGKPIAFKHTDDE----VV----KRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLD----EE 146 (472) T ss_pred chHHHHHHHHh-hhhcccCeeeccCChH----HH----HHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEEC----CC Confidence 11112222211 1113566677655532 22 22333333 268899999999999999998886665 34 Q ss_pred CeEEEEEcChhhceehhhccCCCcCcee---EEeccc----ee-eccchhceeccccccc-------------cccCCcc Q lcl|NC_021072. 151 GLTELRYIDPRKIRKVTEYQQKRPEQLR---GEDINT----QL-TQKAAEYYLYNPKGLK-------------NSTNQGM 209 (533) Q Consensus 151 gI~elr~lDP~~i~~vr~~~~~~~~~~~---~~~~~~----~~-~~~~~e~~~y~p~~~~-------------~~~~~~~ 209 (533) |-..+..+||+.+-++..-.. ...... ++.... ++ +.....+|.+...... ..++.-- T Consensus 147 ~~~~i~~~~p~~~~~i~d~~~-~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (472) T protein:vir:93 147 GEFKLFRVPAEQGIPIWTDKE-HEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWG 225 (472) T ss_pred CceEEEEEcccceEEEEcCCC-CCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCC Confidence 567899999999977643211 111111 111100 00 0011111222211110 0111111 Q ss_pred eeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_021072. 210 KIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGR 288 (533) Q Consensus 210 kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~ 288 (533) +||.-.+. ++....|=++..+....-+. ++=+....-+.+.-|-+-+.-.+.-.. .+..+. +. T Consensus 226 ~vPvv~~~---------nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~-----~~~~~~-~~- 289 (472) T protein:vir:93 226 KIPFIPFK---------NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRL-LR- 289 (472) T ss_pred CcceEEec---------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccc-----hhhHHH-Hh- Confidence 22221111 12334455655444444333 445555556666666444432221111 111111 11 Q ss_pred cccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcc Q lcl|NC_021072. 289 YRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTF 367 (533) Q Consensus 289 ~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~ 367 (533) .++..--++ +..+.+|-...+.+. ..-+.-+.+.+|+..++|---++.-++ T Consensus 290 -------------------------~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~- 341 (472) T protein:vir:93 290 -------------------------YYGAIKVSD--NGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS- 341 (472) T ss_pred -------------------------hccccccCC--CCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCcccccc- Confidence 111111111 123454433333332 234666777888888888432222111 Q ss_pred cccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHH Q lcl|NC_021072. 368 NIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMN 447 (533) Q Consensus 368 ~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~ 447 (533) |. .|..|.--+.....-+.+.++.|...+.++++.=+-+-|+ ..+|. .|.+.|....-=.+. +.++ T Consensus 342 n~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~--~~~~~----~i~v~f~~~~p~~~~-------~~~~ 407 (472) T protein:vir:93 342 AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHK----DVDISFNYNKVANTE-------LQVQ 407 (472) T ss_pred Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----eeeEEeCCCCCCCHH-------HHHH Confidence 11 2223444445555667888888888888888764444453 34554 366777532221222 2344 Q ss_pred HHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccc Q lcl|NC_021072. 448 QVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGD 527 (533) Q Consensus 448 ~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 527 (533) ++..+ +| .+|.+++++.+-..+| -+++.++|++|..+..-..+.. ++.+..+.++.++.+.. T Consensus 408 ~~~k~---~g-iis~et~l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~---~~~~~d~~~~~~~~~~~--------- 469 (472) T protein:vir:93 408 TAQQS---MG-IVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNL---DDGGADGAQQQERSNNK--------- 469 (472) T ss_pred HHHHH---hc-cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhccCc---CcccCCCCCCCCCCCcc--------- Confidence 55555 35 4899999987433443 2333344444443322111110 11111111111111111 Q ss_pred cchhc Q lcl|NC_021072. 528 AKRGE 532 (533) Q Consensus 528 ~~~~~ 532 (533) +.| T Consensus 470 --~~e 472 (472) T protein:vir:93 470 --ESE 472 (472) T ss_pred --cCC Confidence 111 No 123 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=96.59 E-value=0.00049 Score=38.73 Aligned_cols=448 Identities=17% Similarity=0.217 Sum_probs=171.6 Q ss_pred CCccccceeeecc---ccccccCCCCCCCC--Ccccceeeccccc-------ccccchhhhhhHHHHH--------HHHH Q lcl|NC_021072. 1 MSNQLFGFSLERA---KKVPKGPSFVQKDS--MDGSQPIVGGGYY-------GYSVDFDGTVRNEYEL--------ITRY 60 (533) Q Consensus 1 ~~~~~fg~~i~~~---~~~~~~~s~~~~~~--~dg~~~~~~~~~~-------~~~~~~~~~~~~~~~L--------I~~Y 60 (533) -.++-.--.+-+- .-++-++.+-|.++ .--..++..++|. .|..++.+.....-.+ +..- T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~l 116 (695) T protein:vir:36 37 AAAQPVPADFARRGALNALDAAPVVEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTL 116 (695) T ss_pred ccccccchhhhhcccccccccccccCCCcccccceeceecccccCccccchhhhhhcccccccccchhhhccCcchHHHH Confidence 0000000000000 00111111111110 0001111111111 1111111111111111 2334 Q ss_pred HhhhhcchhhhHHHHhhcceeee------cCCCc----eEEEEeccCCCcH-HHHHHHHHHHHHHHHHhcchhhhhHHHH Q lcl|NC_021072. 61 REMVLQPECDSAVDDIVNETICG------NFDDV----PVEVELSNLKQSD-KIKKLIREEFAEILRLLDFENRSYEIFR 129 (533) Q Consensus 61 R~m~~~pEvd~AvdeIvneaiv~------d~~~~----~v~v~l~~~~~S~-~ik~~I~eeF~~i~~lL~f~~~~~~~fR 129 (533) -.|+|+||+++++.=|+.||+-. ..... -+.+.-+..+.++ .-.++|..|++. |++..+..+.++ T Consensus 117 a~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~er----L~V~~~l~eaik 192 (695) T protein:vir:36 117 VLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER----LRIRDAVRTTVI 192 (695) T ss_pred HHHhhccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHH----HHHHHHHHHHHH Confidence 56899999999999999999642 11111 0222222222222 233556777663 333333333333 Q ss_pred hhhhcCceeeeeeecCC--------------CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhcee Q lcl|NC_021072. 130 RWYVDGRLFYHKVIDPK--------------NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYL 195 (533) Q Consensus 130 ~WYvDGri~~hkvid~~--------------~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 195 (533) -=-+-|.-.....|+.. -.|+.++.|+.|||..+.+-- .. +...... .. T Consensus 193 ~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~-~n--------~~dP~sp--------df 255 (695) T protein:vir:36 193 HDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNN-YN--------SINPVAD--------DF 255 (695) T ss_pred hhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccch-hh--------hccchhh--------cc Confidence 22222222222223221 236678889999998885511 00 0011111 12 Q ss_pred ccccccccccCCcceeccchhhccc-----cccccCCCCccchhHHHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEcc Q lcl|NC_021072. 196 YNPKGLKNSTNQGMKIATDSVTYCH-----SGIQDLNKNMTLSHLHKAIKAVNQ-LRMIEDSLVIYRLSRAPERRIFYID 269 (533) Q Consensus 196 y~p~~~~~~~~~~~kI~~dai~y~h-----sGl~d~~~~~i~syL~~AiK~~Nq-Lrm~EDalVIyRi~RAPeRrvfyID 269 (533) |.|....-. +.+|+.+-+.--. --|...-...++|.+..+..-..+ +++...+.=+ +.++.-+- +-.| T Consensus 256 gkP~~y~V~---G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~L--i~~~~v~~-lk~d 329 (695) T protein:vir:36 256 YKPSTWWMI---GTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI--VKQFSVSG-ILMD 329 (695) T ss_pred CCCceEEEe---ceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHH--HHhhhHHH-HHHH Confidence 333221111 1122211110000 001111122245655555544333 3333332211 11111110 0112 Q ss_pred CC-CCchHHHHHHH--HHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHH-H Q lcl|NC_021072. 270 VG-NLPKNKAEQYL--REVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVK-Y 345 (533) Q Consensus 270 vG-nlpk~KAeqYl--~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~-Y 345 (533) .. -|.....++.. -+++++||+-. |-+--|+ =.|+|- +. ..+|+-++||. = T Consensus 330 la~aL~~g~~~~l~~R~eli~~~Rsn~------G~~llDk----~~Eefe-------------q~--stslSGLddVi~q 384 (695) T protein:vir:36 330 LAQALMPGANVDLSMRAELINRYRDNR------NILFLDK----ATEEFF-------------QF--NTPLSGLDALQAQ 384 (695) T ss_pred HHHhhcChhHHHHHHHHHHHHHhcCcc------ceEEEec----CCcceE-------------EE--ecccCCHHHHHHH Confidence 10 00111112222 25556665211 1111000 013442 11 24788888875 4 Q ss_pred HHHHHHHhcCCCccccCC--CCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHH-----HHHhccCCCHhHHhh Q lcl|NC_021072. 346 FQKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKT-----QLILKGVMSLEEWDE 418 (533) Q Consensus 346 F~~kLy~aL~VP~sRl~~--~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~-----qLilkgi~t~eew~~ 418 (533) |..-+=-+.+||+.||=. -.|||--.-+ |.-.|...|..+|. ..+...|++ |+-+-| . T Consensus 385 f~q~VAgaa~IPltkLfGqSPkGlNATGE~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G--------~ 449 (695) T protein:vir:36 385 AQEQMSAVSHIPLIKLLGITPTGLNASSEG----EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFG--------A 449 (695) T ss_pred HHHHHHhhhcCchhhhhccCcccccccchh----hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcC--------C Confidence 888888899999999943 3688752222 33348888988885 334444443 333334 3 Q ss_pred hhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC-CCC Q lcl|NC_021072. 419 MKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV-DPM 497 (533) Q Consensus 419 ~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~-~p~ 497 (533) +...|.|.|+.=..-+|..-+||...+.+....+-.- | .++.+-|+ ..+..+...+ |- .-+ T Consensus 450 idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~-g-vI~~~evr---------------~rL~~d~~s~-Y~~~~D 511 (695) T protein:vir:36 450 VDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE-Q-VIRPDQVA---------------ARLNTEPDGP-YAGKLD 511 (695) T ss_pred CCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh-c-CCCHHHHH---------------HHHhcCCCcc-cccccc Confidence 5567999999877788888888888887765543222 1 12222222 2222221111 21 112 Q ss_pred cccccCCCCCC--------------CCCCCC--------------------ccccccccCCccccc-------hhcC Q lcl|NC_021072. 498 AEMDPAMDPGN--------------APPADD--------------------MSAQEGPAVDAGDAK-------RGEF 533 (533) Q Consensus 498 ~~~~~~~~~~~--------------~~~~~d--------------------~~~~~~~~~~~~~~~-------~~~~ 533 (533) ++.+|+.+..+ .+...+ ++.+++++.+..-.- .+++ T Consensus 512 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~v 588 (695) T protein:vir:36 512 ANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDAAMRAAGAVYVVDGKV 588 (695) T ss_pred cccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCCCccceeeEEEEEeCCEE Confidence 22222221110 000000 011111111110000 0000 No 124 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=96.51 E-value=0.00056 Score=38.43 Aligned_cols=412 Identities=15% Similarity=0.144 Sum_probs=179.7 Q ss_pred CCccccceeeeccccccccCCCCCCCCC--ccccee--ecc----cccccccchhhhhhHHHHHHHHHHhhhhcchhhhH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSM--DGSQPI--VGG----GYYGYSVDFDGTVRNEYELITRYREMVLQPECDSA 72 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~--dg~~~~--~~~----~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~A 72 (533) |+.-|+.. +.+ +.+ ++++.. -|.-.+ ..+ .|.|..+..... + ..+..+++|-|.+| T Consensus 1 ~~~~l~~~-~~~------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~-------v-~~~~al~~~~V~~~ 64 (434) T protein:vir:43 1 MSKSLGKV-LSS------ATS-APRSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKK-------V-TVDKAMKLSAVWAC 64 (434) T ss_pred Cccchhhh-hhh------ccc-ccchhhhcccccccccCchHHHHHHhcCCccCCce-------e-chhhhhccHHHHHH Confidence 87777542 111 111 111110 000000 011 111111110000 1 23455678999999 Q ss_pred HHHhhcceeeecCCCceEEEEe-ccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHH----hhhhcCceeeeeee Q lcl|NC_021072. 73 VDDIVNETICGNFDDVPVEVEL-SNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFR----RWYVDGRLFYHKVI 143 (533) Q Consensus 73 vdeIvneaiv~d~~~~~v~v~l-~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR----~WYvDGri~~hkvi 143 (533) |+-|.+.+-. .|+.+-- +..+..... .=..++++|+. ...+.++.+ .+.+.|.-|..+.- T Consensus 65 i~~ia~~ia~-----lp~~~~~~~~~g~~~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~ 133 (434) T protein:vir:43 65 VRLISTSVAG-----LPLGVYERKADGSRVDA------RSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRR 133 (434) T ss_pred HHHHHHhhhh-----CceEEEEEcCCCccccc------cccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe Confidence 9999887543 4555421 111111111 11234455533 345666644 46778998877432 Q ss_pred cCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhcccccc Q lcl|NC_021072. 144 DPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGI 223 (533) Q Consensus 144 d~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl 223 (533) + .+-+++|.+|+|..++.++.. ++.. .|+.+...+ ....++.+-|.+.+ ++ T Consensus 134 ~----~G~~~~L~~l~p~~v~~~~~~-----~g~~-------------~y~~~~~~g------~~~~~~~~eVih~~-~~ 184 (434) T protein:vir:43 134 A----AGRPAALDFLLPSRVDLECDE-----NGRL-------------KYFYTTKKG------ARREIERTNMLHIP-AF 184 (434) T ss_pred C----CCcEEEEEEEcCcceEEEEcC-----CCeE-------------EEEEEecCc------eEEEEccccEEEec-Cc Confidence 2 466899999999999754321 1110 111111111 12334444333332 11 Q ss_pred ccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccc Q lcl|NC_021072. 224 QDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIK 303 (533) Q Consensus 224 ~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~ 303 (533) ..++-.++|-+..|+..+.....+++...-+----+--.-|..++ +.|.+.+++ =+++.++++..- ...|.+- T Consensus 185 -~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~-~~r~~~~~~~g~----~nag~~~ 257 (434) T protein:vir:43 185 -TLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVD-RILQPAQRE-EFREYVKSVSGA----MNSGRSP 257 (434) T ss_pred -CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecC-CCCCHHHHH-HHHHHHHHhcCc----cccCCcc Confidence 223345678899999988888877776543332223334455554 456554444 456666554211 1122211 Q ss_pred cccccchhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCC-cccccc-hhhhhHHhh Q lcl|NC_021072. 304 DDKKFMSMLEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGR-AAEITRDEV 380 (533) Q Consensus 304 ~d~~~msmlEDywLpRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~-~~~~g~-~~eItRDEl 380 (533) +++ .|.+++.|.- .+.+.-++-.++..+.+.++++||..-|+... +-+.+. .++..+. T Consensus 258 -------vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~-- 318 (434) T protein:vir:43 258 -------VLE----------QGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLA-- 318 (434) T ss_pred -------ccC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHH-- Confidence 221 2556666632 11222234456778889999999999986543 222222 2333333 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_021072. 381 KFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYF 460 (533) Q Consensus 381 kF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~ 460 (533) |.+++ |+--+.. +.+.|- ..+++.+++.. ..+.|. +..+.... +..|.+.+..+-.- -++ T Consensus 319 -f~~~~--L~P~~~~-ie~~ln-----~kL~~~~~~~~----~~~~fd----~~~llr~d-~~~r~~~~~~~~~~--G~~ 378 (434) T protein:vir:43 319 -FLTFS--ISSITNQ-IQQCVN-----KRLLTAPERIR----YYAEFS----LEGFLKAD-SAGRAAWYSTMAQN--GFM 378 (434) T ss_pred -HHHHH--HHHHHHH-HHHHHH-----hhcCChhhhcC----ceEEEe----chhhhccC-HHHHHHHHHHHHhC--CCc Confidence 54432 3332221 122222 23456666542 345555 23333222 24566666655322 466 Q ss_pred cHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC--CCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 461 SIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA--MDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 461 S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) |.+-++. .+++.+- +. .+-.+. |. +.-|- .+....++......+ ..+++.+.+| T Consensus 379 T~NE~R~-~~gl~p~--~g---------gD~~~~-~~-n~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 434 (434) T protein:vir:43 379 TRNEGRR-KENLPEL--PG---------GDILTV-QS-NLVPIDQLGQSNKSQAVRAALM----NWFSQPEPQE 434 (434) T ss_pred CHHHHHH-HhCCCCC--CC---------CCeEee-cc-CccchhhhhccCCCcchhhhhh----ccCCCCCCCC Confidence 6666664 3555321 00 000000 00 00000 000111111111100 1111222222 No 125 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=96.48 E-value=0.00058 Score=38.32 Aligned_cols=419 Identities=12% Similarity=0.064 Sum_probs=172.5 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchh----------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPEC----------- 69 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEv----------- 69 (533) |-.-=.++...-.++. ..+.-+..+.+-. . +.--++.-..-+.+|+.+..+.+- T Consensus 1 ~~~~~~~~~~~~~~e~--~~~~~~~~~~~~~--~-----------i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~ 65 (478) T protein:vir:10 1 MISINWPWDKPYHEQV--VEQIKPKYETQEE--M-----------ILRLVREHKENIDNITMGERYYNHHPDILDAPPKR 65 (478) T ss_pred CccccCCCCchhHHHH--HHHHhhccCCcHH--H-----------HHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccc Confidence 2110011100000000 0000000000000 0 000011111112223222222211 Q ss_pred ----------------hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhh Q lcl|NC_021072. 70 ----------------DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYV 133 (533) Q Consensus 70 ----------------d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYv 133 (533) -+-...||+-..-+ .-+.||.+..++ +...+.|. .+++ -+|+....++++.+++ T Consensus 66 ~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~-l~g~~~~~~~~~----d~~~~~l~----~~~~-n~~~~~~~~~~~~~~~ 135 (478) T protein:vir:10 66 DVNGDYDETKPDWRMYTNYHQNLVDQKVAY-AVANPVTFGVDN----DKALKQIQ----HTLN-HKWDDKLVDILTAASN 135 (478) T ss_pred ccccccccccccceeccchHHHHHHHHHhh-hccCCeeeecCC----hHHHHHHH----HHHh-cCHHHHHHHHHHHHHh Confidence 12222233221111 236777776655 33333333 2333 2688899999999999 Q ss_pred cCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCce---eEEeccceeeccchhceeccccccc-------- Q lcl|NC_021072. 134 DGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQL---RGEDINTQLTQKAAEYYLYNPKGLK-------- 202 (533) Q Consensus 134 DGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~---~~~~~~~~~~~~~~e~~~y~p~~~~-------- 202 (533) -|+-|++.-+|. +|-..+..+||+.+-+|..-.. ..+.. +++... ......+|.+.... T Consensus 136 ~G~~~~~~~~d~----~g~~~~~~~~p~~~~~i~d~~~-~~~~~~~v~~~~~~-----~~~~~~~y~~~~i~~~~~~~~~ 205 (478) T protein:vir:10 136 KGIEWVQPYVDE----EGEFKTFRVPAEQAVPIWTNKE-RDELQAFIRVYELD-----GAERVEYWTKDDVTYYELKEGQ 205 (478) T ss_pred cCeEEEEEEecC----CCeeEEEEEcccceEEEEcCCC-CCceEEEEEEEEec-----CceEEEEEeCCeEEEEEEcCCe Confidence 999999977763 4667899999999977543211 11111 111111 01111122221110 Q ss_pred -------------------cccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCcc Q lcl|NC_021072. 203 -------------------NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPE 262 (533) Q Consensus 203 -------------------~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPe 262 (533) ..++.--++|.=.+ +++....|=++..+....-+. ++=+....-+-++.|- T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~---------~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~ 276 (478) T protein:vir:10 206 LIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPF---------KNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELI 276 (478) T ss_pred eeccccccccccccceecccccccCCccceEEe---------ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCce Confidence 00111112221111 123344555555444444443 4445555556667775 Q ss_pred ceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-H Q lcl|NC_021072. 263 RRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-E 341 (533) Q Consensus 263 RrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~ 341 (533) +-+.-.+..+.. +....+....=++++.-+|| ++.+|-...+...+ . T Consensus 277 ~~~~g~~~~~~~-----------------------------~~~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~ 324 (478) T protein:vir:10 277 YILKGYEGEDMK-----------------------------DFMHNLKYYKAISVAGESGS---GVDTIKVEVPIDSVKE 324 (478) T ss_pred eeeecCCccccc-----------------------------hhhhhhhhcceEEecCCCCC---cceEEeecCChHHHHH Confidence 543322221110 00001111111234433333 35555444444433 4 Q ss_pred HHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhh Q lcl|NC_021072. 342 DVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKE 421 (533) Q Consensus 342 DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~ 421 (533) -+.-+.+.+|+-.++|---.+.-+| |. .|..|..-+..-..-+.+.+..|...+..+++.=+-+.|+ ..+|. T Consensus 325 ~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~--~~~~~---- 396 (478) T protein:vir:10 325 YTKMLRDYIIEFGQGVDFQQDKFGN-SP-SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL--DVKVQ---- 396 (478) T ss_pred HHHHHHHHHHHHhCccccCcccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc---- Confidence 5667788899999988422211111 11 1222333322333345666677777777766654444453 22333 Q ss_pred ceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccc Q lcl|NC_021072. 422 HIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMD 501 (533) Q Consensus 422 ~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~ 501 (533) .|.+.|..----.+. +.+++++.+. | .+|.+++++. |...++ .+++.++|++|.... .+.. . + T Consensus 397 ~i~i~f~~~~p~d~~-------e~a~~~~kl~---g-~iS~et~~~~-l~~v~D-~~~E~~ri~~E~~~~-~~~~--~-~ 459 (478) T protein:vir:10 397 DIEITFNFNVMVNEL-------ENSQIAMNST---G-LLSKETILSN-HAWVED-PVAEMERIEQENIEL-NQQL--P-D 459 (478) T ss_pred cceEEecCCCCCCHH-------HHHHHHHHHh---C-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-Hhhc--c-c Confidence 467777543332332 3344555553 3 4899999976 666532 334445555554321 1100 0 0 Q ss_pred cCCCCCCCCCCCCcccccc Q lcl|NC_021072. 502 PAMDPGNAPPADDMSAQEG 520 (533) Q Consensus 502 ~~~~~~~~~~~~d~~~~~~ 520 (533) ...+..+++..++.+++.. T Consensus 460 ~~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 460 IEEGLNGEQQRQSENNQPE 478 (478) T ss_pred cccccCCCCCCCCCCCCCC Confidence 0112222222222222221 No 126 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=96.32 E-value=0.00075 Score=37.73 Aligned_cols=406 Identities=11% Similarity=0.071 Sum_probs=166.9 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHh-hhhcchhhhHHHHhhcc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYRE-MVLQPECDSAVDDIVNE 79 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~-m~~~pEvd~AvdeIvne 79 (533) |.. |.+ + ...+...+-+++-...|......++... ... ... ...+|-|.+||+-|.+. T Consensus 1 Mg~--~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~--------~~~~~~~~~~v~~~i~~ia~~ 59 (423) T protein:vir:81 1 MGF--LQK-L---GLAPSVVATPEPIELVGPIFESLKLSTK-------NMT--------VEQIWEDQPHLRTVTTFIARN 59 (423) T ss_pred Cch--hHh-h---ccccccccCccccccccccccccccccc-------hhh--------HHHHHHhhhHHHHHHHHHHHh Confidence 432 111 1 1112222222222223322111111111 011 111 24789999999999987 Q ss_pred eeeecCCCceEEEE-eccCCCcHHHHHHHHHHHHHHHHHhc---chhhhhHHH----HhhhhcCceeeeeeecCCCCCCC Q lcl|NC_021072. 80 TICGNFDDVPVEVE-LSNLKQSDKIKKLIREEFAEILRLLD---FENRSYEIF----RRWYVDGRLFYHKVIDPKNPRGG 151 (533) Q Consensus 80 aiv~d~~~~~v~v~-l~~~~~S~~ik~~I~eeF~~i~~lL~---f~~~~~~~f----R~WYvDGri~~hkvid~~~~~~g 151 (533) +-- .|+.|- -..-+..+.+++ ..++++|. =...+.++. ..+.+.|.-|..++-|. .-... T Consensus 60 ia~-----lp~~~~~~~~dg~~~~~~~------~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~-~~~~~ 127 (423) T protein:vir:81 60 VAS-----LQLQAFERVEDGGRERVRE------GHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDL-GVDTP 127 (423) T ss_pred Hhh-----CceEEEEEecCCceeeecc------chHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcCcc Confidence 542 454442 111111222211 12334442 112444444 34678899888765553 22344 Q ss_pred eEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCC-c Q lcl|NC_021072. 152 LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKN-M 230 (533) Q Consensus 152 I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~-~ 230 (533) +..|++++|..++. +...+. .....+.. ++.++.. +..+.++.+.|.+.+ ....++. . T Consensus 128 ~~~l~p~~~~~v~~-~~~~~~-~~~~~Y~~-----------~~~~~~~------g~~~~~~~~evih~r--~~~~~~~~~ 186 (423) T protein:vir:81 128 TLDIRPIPVSWVQR-RAYKDG-WGSLDYII-----------IESGDND------GRSVKVPGERVIHRH--GYNPKTMKR 186 (423) T ss_pred eEEEeecccceeee-eeccCC-CcceEEEE-----------EEecCCC------ceEEEEcccceEEec--CCCCCCccc Confidence 56777777766633 111110 00010000 0001111 122334444332222 1112222 4 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEE-eeCCCCccccccccc Q lcl|NC_021072. 231 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLV-YDANTGEIKDDKKFM 309 (533) Q Consensus 231 i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~v-Yd~~TGev~~d~~~m 309 (533) ++|-+..|..++.....+++...=+=---+.-+-|+..|-...|.+-.++-.+.+..+++.... --..+|.+- T Consensus 187 G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~------ 260 (423) T protein:vir:81 187 GKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTL------ 260 (423) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcce------ Confidence 5688999998888777777764433222345666777775443322122222233333332211 112234321 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcchHHHH---HHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH Q lcl|NC_021072. 310 SMLEDFWLPRREGGRGTEISTLPGGQNLGELEDV---KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI 386 (533) Q Consensus 310 smlEDywLpRReggrgTEIsTLpGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi 386 (533) .|+ .|.+++.|. .+.-++.-+ ++-.....++++||...++..++-+..+.++..+. |..+ T Consensus 261 -vl~----------~g~~~~~l~--~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~---f~~~- 323 (423) T protein:vir:81 261 -LLE----------DGMKAENFH--TTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKA---LYGD- 323 (423) T ss_pred -ecC----------CCceEEecc--CChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHH---HHHH- Confidence 221 245566653 333344333 35567799999999998865433333333343433 6665 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHH Q lcl|NC_021072. 387 ARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMR 466 (533) Q Consensus 387 ~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~ 466 (533) .|+-.+ ..+.+.|-..|+ +..+|+.-. ..+.|. +..+....+ +.|.+.++.+-.-. -++|.+-++ T Consensus 324 -~L~P~~-~~ie~~l~~~L~-----~~~~~~~~~--~~~~fd----~~~llr~d~-~~r~~~~~~~l~~~-G~~T~NE~R 388 (423) T protein:vir:81 324 -NLGSWI-RIIQDVMNLFLL-----PRVGIDNEK--FYFEFN----LEEKLRASF-EEAAEIKRAAVGNV-AWMTINEVR 388 (423) T ss_pred -HHHHHH-HHHHHHHhhhhc-----CccccccCc--cEEEec----chhhhccCH-HHHHHHHHHHHhCC-CCcCHHHHH Confidence 233211 224444444443 333333322 334444 223332222 34555555432211 255555555 Q ss_pred HHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc--cccCC-CCCCCCCCCCccccccccCCcccc Q lcl|NC_021072. 467 RQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAE--MDPAM-DPGNAPPADDMSAQEGPAVDAGDA 528 (533) Q Consensus 467 k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~--~~~~~-~~~~~~~~~d~~~~~~~~~~~~~~ 528 (533) . ++++.+ -|+.+ ..|.+ ..++. .+ +.-+..+| T Consensus 389 ~-~~gl~p--------------------~~gGD~~~~p~n~~~~~~--------~~-~~~~~~~t 423 (423) T protein:vir:81 389 A-MDNLPS--------------------IDGGDDLARPLNTEFGDS--------ED-APGEEVET 423 (423) T ss_pred H-HhCCCC--------------------CCCcceeecccccccCcc--------CC-CCCCCCCC Confidence 3 233322 12111 01111 00100 00 00111222 No 127 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=96.30 E-value=0.00077 Score=37.66 Aligned_cols=374 Identities=11% Similarity=0.083 Sum_probs=157.8 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |.. |.+- ++ ...+.+. +.++-.......+.+..... ..+. . +....+|-|.+||+-|++.+ T Consensus 1 M~~--f~~~-~~---~~~~~~~----~~~~~~~~~~~~~~~~~~~~-~~v~-------~-~~al~~~~v~~~i~~ia~~i 61 (386) T protein:vir:49 1 MPI--FNIT-NL---ATESPPI----NQESFFDIADSDFLASLNSS-EWVS-------A-ENALKNSDLFSIISQLSNDL 61 (386) T ss_pred Cch--hhhh-cc---CCCCccc----chhhhhhhhhccccccccCC-ceec-------h-hhhhccHHHHHHHHHHHHHh Confidence 542 3221 11 1111111 11111111111111111000 0011 1 11245889999999998875 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhh----hHHHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRS----YEIFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~----~~~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -- .|+.+. .. - .+.++..-+-...+ +.++..|+..|.-|..++-+. .+-+++|. T Consensus 62 a~-----~p~~~~--~~-----~-------~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~g~~~~l~ 119 (386) T protein:vir:49 62 AT-----AKITTS--RK-----Q-------LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRND---NGRDMKWE 119 (386) T ss_pred hh-----Cceeec--cc-----h-------hhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC---CCcEEEEE Confidence 42 333332 11 1 12233223333333 345556888999999877764 34589999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc---cccccCCcceeccchhhccccccccCCC-Cccc Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG---LKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTL 232 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~---~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~ 232 (533) +++|..++.++.. ++....+.. ....+.+ ..+.....++|... ..++ -.++ T Consensus 120 ~i~~~~v~v~~~~-----~~~~~~y~~----------~~~~~~~~~~~~~~~~evih~~~~----------~~~~~~~G~ 174 (386) T protein:vir:49 120 YLRPSQVSFNRLD-----NQNGLYYNI----------TFDDPHIAPKQHVPQNDILHFRLL----------SVDGGLTSV 174 (386) T ss_pred EecCceeEEEEcC-----CCceEEEEE----------EEcCccccceeEEccccEEEecCC----------CCCCccccc Confidence 9999998654321 111111100 0001111 01122233333322 1122 1356 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhH Q lcl|NC_021072. 233 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSML 312 (533) Q Consensus 233 syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msml 312 (533) |.|..|++.+.....+++...-+--..+--+-+..++....+.. ++ .++..+... + ...|.+ + .+ T Consensus 175 s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~-~~-~~~~~~~~~-----~-~n~g~~------~-vl 239 (386) T protein:vir:49 175 SPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDF-KT-KVSRSRQAM-----K-QMQGGP------L-VL 239 (386) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHH-HH-HHHHHHHHh-----c-cCCCCc------e-ec Confidence 99999999999999998888766666677788888886554433 33 333333221 1 122321 1 11 Q ss_pred hhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_021072. 313 EDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 391 (533) Q Consensus 313 EDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~ 391 (533) . .|.+++.|.-. +.+.-++-.++....+.++++||.+.|+.+++- -..++.+ + ..+..| |+. T Consensus 240 -------~---~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~-~~~~~~~-~--~~~~~~---i~~ 302 (386) T protein:vir:49 240 -------D---DLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQ-QSSLEMI-Y--NIYFKS---VSR 302 (386) T ss_pred -------C---CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-cchHHHH-H--HHHHHH---HHH Confidence 1 24566666321 112223445788899999999999999753221 1111211 2 112222 333 Q ss_pred HHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhC Q lcl|NC_021072. 392 RFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLK 471 (533) Q Consensus 392 ~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~ 471 (533) .+..+.. .|...|. . .+.++.. ++.+.+. ..+...+..+- -+-. T Consensus 303 ~l~~i~~-~~~~~l~-~-------------~~~~~~~------~~~~~d~-~~~~~~~~~l~--~~g~------------ 346 (386) T protein:vir:49 303 YLRPFVS-EMSKKLS-C-------------EVDVDIS------PAVDPTG-SNYISLINSMV--KSGT------------ 346 (386) T ss_pred HHHHHHH-HHHHHhc-c-------------hhcccch------hhhccCH-HHHHHHHHHHH--hCCC------------ Confidence 3333222 1222221 1 1122111 1110000 11112222110 0122 Q ss_pred CCHHHHHHHHHHHHHhhhcCCCCCCCccc-ccCC--CCCCCCCCCCccccc Q lcl|NC_021072. 472 QTDQEIKEIDKQIDSEREAGLIVDPMAEM-DPAM--DPGNAPPADDMSAQE 519 (533) Q Consensus 472 ~tDeeI~e~~kqi~~E~~~~~~~~p~~~~-~~~~--~~~~~~~~~d~~~~~ 519 (533) +|..|+-++.. ..+..++|-... ++.. ..||+. +.++ T Consensus 347 ~t~nE~r~~l~------~~~~~~~~~~~~~~~~~~~~~gGd~-----~~~~ 386 (386) T protein:vir:49 347 LAQNQGLYILQ------QAEILPKELPDGKNPNRTSLKGGEI-----NEQD 386 (386) T ss_pred cCHHHHHHHHh------hCCCCCCcCcchhccCCCCCCCCCC-----CCCC Confidence 34444433321 233333221111 1111 111111 1111 No 128 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=96.24 E-value=0.00083 Score=37.47 Aligned_cols=455 Identities=11% Similarity=0.061 Sum_probs=176.4 Q ss_pred CCccccceeeec----ccccccc--CCCCCCCCCcccceeecccccccccchhhhh-h-HHHHHHHHHHhhhhcchhhh- Q lcl|NC_021072. 1 MSNQLFGFSLER----AKKVPKG--PSFVQKDSMDGSQPIVGGGYYGYSVDFDGTV-R-NEYELITRYREMVLQPECDS- 71 (533) Q Consensus 1 ~~~~~fg~~i~~----~~~~~~~--~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~-~-~~~~LI~~YR~m~~~pEvd~- 71 (533) |-.-||=-+=-. .....+. ..+..+...+...... ..-..+ . ..-....+|+.+..+.+-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~ 71 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHRESRIRYRADNLEELMVNNW---------ELLKNFINHHKLRQAPRIQELLDYARGENH 71 (501) T ss_pred CCceeEEeccchhhhhhcccChhHHHhhccccccccccccH---------HHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 555554222111 0011100 0111111111110000 000000 0 00011122222222222110 Q ss_pred ---------------------HHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh Q lcl|NC_021072. 72 ---------------------AVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRR 130 (533) Q Consensus 72 ---------------------AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~ 130 (533) =..-||+...- =.=+.|+.+.+++.+..+.+.+ -+..++..-+|+....++++. T Consensus 72 ~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~-yl~g~p~~~~~~d~~~~~~~~~----~l~~~~~~n~~~~~~~~~~~~ 146 (501) T protein:vir:27 72 DVLQFGRRKDREMADKRAVHNYGRMISKFKTG-YLAGNPIRVEYDDNDNNSQNDD----TIKRIGRINDIDSHNRTLIRD 146 (501) T ss_pred cccccCccCccccccceeccchHHHHHHHHhh-hhcccCeeEecCCccchHHHHH----HHHHHHHhcChhHHHHHHHHH Confidence 01112221110 0125677888776555444444 455567777899999999999 Q ss_pred hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCC--cCceeEEeccceeeccchhceecccccccc--ccC Q lcl|NC_021072. 131 WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKR--PEQLRGEDINTQLTQKAAEYYLYNPKGLKN--STN 206 (533) Q Consensus 131 WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~--~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~--~~~ 206 (533) .++-|+-|.+.-.|. .|=..+..+||+.+-+|..-.... .-..+++.... ..+...-..+|.+..... ..+ T Consensus 147 ~~~~G~a~~~vy~de----d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~-~~~~~~~~~vyt~~~v~~~~~~~ 221 (501) T protein:vir:27 147 LSQTGRAYEVIYRNE----YDETRIKRLNPLETFVIYDNSLEDNSIAAVRYYNRGT-LQNAKDVVEIYTNEHIYTLDASD 221 (501) T ss_pred HhhCCeEEEEEEeCC----CCceEEEEEccceeEEEecCCCCCceEEEEEEEEeee-cCCcEEEEEEEeCCeEEEEEeCC Confidence 999999999877653 344678899999997764332111 01111221100 001111122343332210 000 Q ss_pred Ccceeccchhhccc-ccc---c-cCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHH Q lcl|NC_021072. 207 QGMKIATDSVTYCH-SGI---Q-DLNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQ 280 (533) Q Consensus 207 ~~~kI~~dai~y~h-sGl---~-d~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeq 280 (533) ....+.. +.| .|. + -+++....|=++..+.....+-. +=+....-+.++.|-+-+.-.+..+.+... T Consensus 222 ~~~~~~~----~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~--- 294 (501) T protein:vir:27 222 DFNEISV----TTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA--- 294 (501) T ss_pred ceeeccc----cccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccch--- Confidence 1100000 111 111 1 02334455666665555554433 233333445556665554432222211000 Q ss_pred HHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccC----CCCccceeecCCCCCcchH-HHHHHHHHHHHHhcC Q lcl|NC_021072. 281 YLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRRE----GGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALN 355 (533) Q Consensus 281 Yl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRRe----ggrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~ 355 (533) .. |.++ + ..+++--+ ++.+..+..|-...+...+ .-+.-+.+.+|.-.+ T Consensus 295 --~~-~~~~--~---------------------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~ 348 (501) T protein:vir:27 295 --SD-MKRT--R---------------------LMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTN 348 (501) T ss_pred --hh-hhhc--C---------------------ceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhC Confidence 00 0000 0 11221111 1112234444222222211 224555677888888 Q ss_pred CCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCH-hHHhhhhhceeEEEeccchHH Q lcl|NC_021072. 356 VPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSL-EEWDEMKEHIQFDFIADNYFT 434 (533) Q Consensus 356 VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~-eew~~~~~~i~~~f~~Dn~f~ 434 (533) +|---.+.-++ |. .+..|.--......-+.+.++.|..-+.++++.=+-+-++... .+++ ...|.+.|...-.-. T Consensus 349 ~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~v~f~~~~p~n 424 (501) T protein:vir:27 349 IPDMSDTNFSG-NT-SGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFD--ESLLKITFTPNLPKS 424 (501) T ss_pred CcccCcccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cccceEEeCCCCCcC Confidence 88422222111 11 1222322222334556677777777777766553332232221 1111 234788886443323 Q ss_pred HHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCC Q lcl|NC_021072. 435 ELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADD 514 (533) Q Consensus 435 E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d 514 (533) + .+.++++..+. | .+|.+++++. |...++ .+++.++|++|....-.. .....-++. .+...| T Consensus 425 ~-------~e~ad~~~kl~---g-~iS~et~l~~-l~~v~D-~~~E~eri~~E~~e~~~~---~~~~~~~~~--~~~~~d 486 (501) T protein:vir:27 425 L-------NEQVSILTGLG---G-QVSQETALSL-SGLVES-PNEELDKINKEVSEIDFK---GYSNDFNEH--VGKYTD 486 (501) T ss_pred H-------HHHHHHHHHHh---c-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHhhhHh---hhcCccccc--cccccC Confidence 3 23344555553 4 3899999987 544431 223333444444321111 000000000 011111 Q ss_pred ccccccccCCccccch Q lcl|NC_021072. 515 MSAQEGPAVDAGDAKR 530 (533) Q Consensus 515 ~~~~~~~~~~~~~~~~ 530 (533) ...+.. +-+..+..| T Consensus 487 ~~~~~~-~d~~e~~~~ 501 (501) T protein:vir:27 487 EVKETH-TDDFERAYE 501 (501) T ss_pred CCCCCc-cccccccCC Confidence 111110 011111111 No 129 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=96.21 E-value=0.00087 Score=37.36 Aligned_cols=438 Identities=16% Similarity=0.206 Sum_probs=171.3 Q ss_pred CC--ccccceeeeccccccccC--CCCCCCCCcccceeeccccc-------ccccchhhhhhHHHHH--------HHHHH Q lcl|NC_021072. 1 MS--NQLFGFSLERAKKVPKGP--SFVQKDSMDGSQPIVGGGYY-------GYSVDFDGTVRNEYEL--------ITRYR 61 (533) Q Consensus 1 ~~--~~~fg~~i~~~~~~~~~~--s~~~~~~~dg~~~~~~~~~~-------~~~~~~~~~~~~~~~L--------I~~YR 61 (533) |. -.|- -++-++ -+.|.-..--..++..++|. .|..++.+.....-.+ +..-- T Consensus 46 ~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la 117 (695) T protein:vir:78 46 MGRRGALN--------ALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLV 117 (695) T ss_pred hccccccc--------ccccccccCCCcccccceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHH Confidence 11 0000 000000 00000011111112111111 1111111111111111 23345 Q ss_pred hhhhcchhhhHHHHhhcceeee------cCCCc----eEEEEeccCC--CcHHHHHHHHHHHHHHHHHhcchhhhhHHHH Q lcl|NC_021072. 62 EMVLQPECDSAVDDIVNETICG------NFDDV----PVEVELSNLK--QSDKIKKLIREEFAEILRLLDFENRSYEIFR 129 (533) Q Consensus 62 ~m~~~pEvd~AvdeIvneaiv~------d~~~~----~v~v~l~~~~--~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR 129 (533) .|+|+||+++++.=|+.||+-. ..... -+.+.-+..+ .++.| ++|..||+. |++..+..+.++ T Consensus 118 ~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi-~~L~~e~er----L~V~~~l~eaik 192 (695) T protein:vir:78 118 LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQL-KQINDEIER----LRIRDAVRTTVI 192 (695) T ss_pred HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHH-HHHHHHHHH----HHHHHHHHHHHH Confidence 6899999999999999999532 11111 0222211222 22344 556666663 333333333332 Q ss_pred hhhhcCceeeeeeecCC--------------CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhcee Q lcl|NC_021072. 130 RWYVDGRLFYHKVIDPK--------------NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYL 195 (533) Q Consensus 130 ~WYvDGri~~hkvid~~--------------~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 195 (533) -=-+-|.-.....|+.. -.|+.++.|+.|||..+.+-- .. +...... .. T Consensus 193 ~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~-~n--------~~dP~sp--------df 255 (695) T protein:vir:78 193 HDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNN-YN--------SINPVAD--------DF 255 (695) T ss_pred hhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccch-hh--------hccchhh--------cc Confidence 21222222222223221 236678889999998885511 00 0011111 12 Q ss_pred ccccccccccCCcceeccchhhccc-----cccccCCCCccchhHHHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEcc Q lcl|NC_021072. 196 YNPKGLKNSTNQGMKIATDSVTYCH-----SGIQDLNKNMTLSHLHKAIKAVNQ-LRMIEDSLVIYRLSRAPERRIFYID 269 (533) Q Consensus 196 y~p~~~~~~~~~~~kI~~dai~y~h-----sGl~d~~~~~i~syL~~AiK~~Nq-Lrm~EDalVIyRi~RAPeRrvfyID 269 (533) |.|....-. +.+|+.+-+.--. --|...-...++|.+..+..-..+ +++...+.=+ ++.+.-+ ++-.| T Consensus 256 gkP~~y~V~---G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~L--i~~~~v~-~lk~d 329 (695) T protein:vir:78 256 YKPSTWWMI---GTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI--VKQFSVS-GILMD 329 (695) T ss_pred CCCceEEEe---ceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHH--HHhhhhH-HHHHH Confidence 333221111 1122211110000 001111122345655555544333 3333332222 1111111 11112 Q ss_pred CC-CCchHHHHHHH--HHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHH-H Q lcl|NC_021072. 270 VG-NLPKNKAEQYL--REVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVK-Y 345 (533) Q Consensus 270 vG-nlpk~KAeqYl--~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~-Y 345 (533) .. -|.....++.. -+++++||+-. |-+--|+ =.|+|- +. ..+|+-++||. = T Consensus 330 la~~L~~g~~~~l~~R~eli~~~Rsn~------G~~llDk----~~Eefe-------------q~--stslSGLddVi~q 384 (695) T protein:vir:78 330 LAQALMPGANVDLSMRAELINRYRDNR------NILFLDK----ATEEFF-------------QF--NTPLSGLDALQAQ 384 (695) T ss_pred HHHhhcChhHHHHHHHHHHHHHhcCcc------ceEEEec----CCcceE-------------EE--ecccCCHHHHHHH Confidence 11 01111122222 25556665211 1111000 013442 11 24788888875 4 Q ss_pred HHHHHHHhcCCCccccCC--CCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHH-----HHHhccCCCHhHHhh Q lcl|NC_021072. 346 FQKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKT-----QLILKGVMSLEEWDE 418 (533) Q Consensus 346 F~~kLy~aL~VP~sRl~~--~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~-----qLilkgi~t~eew~~ 418 (533) |..-+=-+.+||+.||=. -.|||--.-+ |.-.|...|..+|. ..+...|++ |+-+-| . T Consensus 385 f~q~VAgaa~IPltkLfGqSPkGlNATGE~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G--------~ 449 (695) T protein:vir:78 385 AQEQMSAVSHIPLIKLLGITPTGLNASSEG----EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFG--------A 449 (695) T ss_pred HHHHHHhhhcCchhhhhccCCccccccchh----hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcC--------C Confidence 888888899999999943 3688752222 33348888988885 334444443 333334 3 Q ss_pred hhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC-CCC Q lcl|NC_021072. 419 MKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV-DPM 497 (533) Q Consensus 419 ~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~-~p~ 497 (533) +...|.|.|+.=..-+|..-+||...+.+....+-.- | .++.+-|+ ..+..+...+ |- .-+ T Consensus 450 idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~-g-vI~~~evr---------------~rL~~d~~s~-Y~~~~D 511 (695) T protein:vir:78 450 VDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQE-Q-VIRPDQVA---------------ARLNTEPDGP-YAGKLD 511 (695) T ss_pred CCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh-c-CCCHHHHH---------------HHHhcCCCcc-cccccc Confidence 5567899999877788888888888887765543222 1 12222222 2222222111 21 112 Q ss_pred cccccCCCCCC--------------CCCCCCccccccccCCc-------------cccchhcC Q lcl|NC_021072. 498 AEMDPAMDPGN--------------APPADDMSAQEGPAVDA-------------GDAKRGEF 533 (533) Q Consensus 498 ~~~~~~~~~~~--------------~~~~~d~~~~~~~~~~~-------------~~~~~~~~ 533 (533) ++.+|+.+..+ .+...+.. +.+++..+ .+.....+ T Consensus 512 ~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~ag~~~~ 573 (695) T protein:vir:78 512 ANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPG-GARAGATAPPTVANVNANVKPREAGAQDA 573 (695) T ss_pred cccCCCcCccchhhhhHhhhcCcccccccCCCC-CCCCCCCCCCceeeeeccccccccCCCCc Confidence 22222221110 00000000 00011110 01111111 No 130 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=96.19 E-value=0.0009 Score=37.28 Aligned_cols=435 Identities=11% Similarity=0.103 Sum_probs=173.2 Q ss_pred CCccc-cceeeec----cccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhh---- Q lcl|NC_021072. 1 MSNQL-FGFSLER----AKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDS---- 71 (533) Q Consensus 1 ~~~~~-fg~~i~~----~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~---- 71 (533) |...| -|=-|.. .-.......+..+.+.+-.... +..-+..-...+.+|+.+..+.+-.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~-----------i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~ 69 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEM-----------IVRYIKQHLEKLPEISIGQEYYEQRPDIVK 69 (483) T ss_pred CccchhcCCceeecCcchhhhhhhcccccCCchhhHHHH-----------HHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 22211 1111111 0000111111111111100000 00000011122344555444433321 Q ss_pred -----------------------HHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHH Q lcl|NC_021072. 72 -----------------------AVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIF 128 (533) Q Consensus 72 -----------------------AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~f 128 (533) -..-||+.. +.=.-+.|+.+..++.+ ..+ .++.+++ =+|+....+++ T Consensus 70 ~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~-~~~l~G~p~~~~~~d~~----~~~----~l~~~~~-n~~~~~~~~~~ 139 (483) T protein:vir:12 70 EPKPVDATGAVDPLKPDDRMITNFHANLVDQK-VSYIVGKPIAFKHTDDE----VVK----RIDEVLG-NRFDDKLHSVL 139 (483) T ss_pred ccccccccccccccccccccccchHHHHHHHH-hhhhcccCceeccCChH----HHH----HHHHHHh-ccHHHHHHHHH Confidence 111122221 11123667777665522 222 2333332 26888889999 Q ss_pred HhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCc---eeEEeccce----e-eccchhceeccccc Q lcl|NC_021072. 129 RRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQ---LRGEDINTQ----L-TQKAAEYYLYNPKG 200 (533) Q Consensus 129 R~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~---~~~~~~~~~----~-~~~~~e~~~y~p~~ 200 (533) +..++-|+-|.+.-+|. +|-..++.+||+.+-.+..-. ..... .+++...+. + +.....+|.+.-.. T Consensus 140 ~~~~~~G~~y~~v~~d~----d~~~~i~~~~p~~~~~v~d~~-~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~ 214 (483) T protein:vir:12 140 TGASNKGIEWLHPYLDE----EGEFKLFRVPAEQGIPIWTDK-EHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGS 214 (483) T ss_pred HHHhhCCeEEEEEEEcC----CCceEEEEEcccceEEEEcCC-CCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCe Confidence 99999999999977763 455789999999987654311 11111 111111100 0 00111112211111 Q ss_pred cccccCCcceeccchhhcccc--cccc----CCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCC Q lcl|NC_021072. 201 LKNSTNQGMKIATDSVTYCHS--GIQD----LNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNL 273 (533) Q Consensus 201 ~~~~~~~~~kI~~dai~y~hs--Gl~d----~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvGnl 273 (533) ........ .....+.+..- |.++ +++....|=++..+.....+. ++=+....-+.++.|-+-+.-.+.-++ T Consensus 215 ~~~~~~~~--~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~ 292 (483) T protein:vir:12 215 LIPDYSNN--LENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL 292 (483) T ss_pred eeeccccc--ccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccc Confidence 00000000 00000001100 1111 112334455655444444443 345555555667777554332222221 Q ss_pred chHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHH Q lcl|NC_021072. 274 PKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYK 352 (533) Q Consensus 274 pk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~YF~~kLy~ 352 (533) + +..+ .+..++...-++ |.++.+|-...+.+.. .-+.-+.+.+|. T Consensus 293 ~-----~~~~---------------------------~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~ 338 (483) T protein:vir:12 293 P-----EFKR---------------------------LLRYYGAIKVSD--NGGVDTIQVEVPVENSKKYLDELYQKIML 338 (483) T ss_pred h-----hHHH---------------------------hhhhccccccCC--CCcceEEeecCCHHHHHHHHHHHHHHHHH Confidence 1 1111 111111111112 1235555443343332 334566677888 Q ss_pred hcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccch Q lcl|NC_021072. 353 ALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNY 432 (533) Q Consensus 353 aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~ 432 (533) ..++|---.+.-++ +. .|..|.--+.....-+.+.++.|...+.++++.=+-+-|+ ..+|.. |.+.|....- T Consensus 339 ~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~--~~~~~~----i~v~f~~~~p 410 (483) T protein:vir:12 339 FGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHKD----VDISFNYNKV 410 (483) T ss_pred HhCCCCCCcccccc-Cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccce----eeEEeCCCCC Confidence 88888422222111 11 2223443444445557778888888888877754333343 345554 5666754333 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCC Q lcl|NC_021072. 433 FTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPA 512 (533) Q Consensus 433 f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~ 512 (533) -.+. +.+++++.+. | .+|.+++++. |...+ +.+++.++|++|..+..-..++. ++.+..+.+.+ T Consensus 411 ~~~~-------~~a~~~~kl~---G-iiS~et~~~~-~~~v~-d~~~E~~ri~~E~~~~~~~~~~~---~~~~~d~~~~~ 474 (483) T protein:vir:12 411 ANTE-------LQVQTAQQSM---G-IVSHETVLEN-HPFVE-DLQAELERIEQEQMEYNKQLPNL---DDGGADGAQQQ 474 (483) T ss_pred CCHH-------HHHHHHHHHh---c-cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhcccc---cccccCCcccC Confidence 2232 2244555553 4 4899999987 44432 12333444455443322111110 11111111111 Q ss_pred CCccccccccCCccccchhc Q lcl|NC_021072. 513 DDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 513 ~d~~~~~~~~~~~~~~~~~~ 532 (533) ++.+ .+++| T Consensus 475 ~~~~-----------~~e~e 483 (483) T protein:vir:12 475 ERSN-----------NKESE 483 (483) T ss_pred CCCC-----------cccCC Confidence 1111 11111 No 131 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=96.17 E-value=0.00092 Score=37.24 Aligned_cols=398 Identities=14% Similarity=0.143 Sum_probs=168.6 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |.. |.. ..+ ..+..++....+......+ +.+.. ...+.. ..-..++-|..||+-|.+.+ T Consensus 1 Mg~--f~~-----~~~--r~~~~~~~~~~~~~~~~~~-~~~~~---~~~~~~--------~~al~~~~v~~cv~~Ia~~i 59 (416) T protein:vir:81 1 MGI--FYK-----NEK--RDLQYNEDDLQMMVQTLPG-FQGTK---LRQYKD--------IEAIRHSDIFTAVMMIASDL 59 (416) T ss_pred CCc--ccc-----ccc--ccccCCCcchhHHHHHhcc-ccccC---ccccch--------hhhhcchHHHHHHHHHHHhh Confidence 543 321 111 1111122221221111111 11100 000100 12245778999999988875 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhh----hHHHHhhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRS----YEIFRRWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~----~~~fR~WYvDGri~~hkvid~~~~~~gI 152 (533) -. .|+.+. ++.. ... -..++.+|+-. ..+ +.++..+...|.-|+.++-|. .+-+ T Consensus 60 A~-----~p~~~~-~~~~--~~~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~G~~ 121 (416) T protein:vir:81 60 AR-----MPIRVT-VNGQ--INY-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK---TGEP 121 (416) T ss_pred cc-----CceEEe-cCcc--ccc-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcE Confidence 53 454443 1111 111 12455665432 223 344455788999999977764 3448 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeecc-chhceeccccccccccCCcceeccchhhccccccccCCCCcc Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQK-AAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMT 231 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~-~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i 231 (533) ++|.+|+|..++.++...+. ..+... .+... ......| .+...++|.. ...++-.+ T Consensus 122 ~~L~~i~~~~v~v~~~~~g~----~~~~~~--~~~~~~~~~~~~~-------~~~evihir~----------~~~d~~~G 178 (416) T protein:vir:81 122 MNLTFRKTSEIELKSDARGR----LYYFHQ--RIDSNGNNIERNV-------KFEDMLDIKF----------YSLDGING 178 (416) T ss_pred EEEEEEcCceeEEEECCCcc----EEEEEE--EecCCCceeEEEE-------ccccEEEecc----------CCCCCccc Confidence 99999999999654322111 000000 00000 0000011 1122233321 11122345 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH-HHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 232 LSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLRE-VMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 232 ~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~-im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) +|-|+.|++++......++..-=+=-.-+--+-|..++ |.+...+|.+=+++ +...|. | .++..+.+ T Consensus 179 ~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~~---------g-~~nag~~~- 246 (416) T protein:vir:81 179 LSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS---------G-TKQAGKVV- 246 (416) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc---------C-ccccCcee- Confidence 68999999999888888877664434445556667777 45543444333333 222221 1 11111111 Q ss_pred hHhhhcccccCCCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCCccccCCCC-cccccchhhhhHHhhhHHHHHHH Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGGQNL-GELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEVKFQKFIAR 388 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~~-~~~~g~~~eItRDElkF~Kfi~r 388 (533) .++ .|.+++.|.-...- .-++-.++.++.+.++++||.+.|+.++ +++ +.-..+-|..-|.- T Consensus 247 vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~------~~~~~~~~~~~l~P 310 (416) T protein:vir:81 247 VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMS------ITDANLDYLSTLKP 310 (416) T ss_pred ecC----------CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcc------HHHHHHHHHHHHHH Confidence 121 14555555322211 2244456678899999999999996432 222 12222334444444 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQ 468 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~ 468 (533) +-.++...|...|- + ++. ...+.|. +.++...+ ...|++.++.+-.- -++|.+-++.. T Consensus 311 ~~~~ie~~ln~~l~---------~--~~~----~~~~~f~----~~~l~~~D-~~~~~~~~~~~~~~--G~~T~NE~R~~ 368 (416) T protein:vir:81 311 YITCVCAELNFKFN---------D--EYV----NREFKFD----TTEIRVVD-EKTQAEIDKINIDS--GKMNIDEIRQR 368 (416) T ss_pred HHHHHHHHHhhhcc---------c--ccc----CceEEEe----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHHH Confidence 44444444443321 1 121 1234444 33333222 23455555554332 35555555532 Q ss_pred HhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc---CCCCC---CCCCCCCccccccccCCccccch Q lcl|NC_021072. 469 VLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP---AMDPG---NAPPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 469 IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~---~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 530 (533) |++ +.++.++....+ +.-+. ++.+........ ....-+|.-+ T Consensus 369 -~gl------------------~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~-~~~kgGe~n~ 416 (416) T protein:vir:81 369 -DGL------------------APIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD-KKLKGGEENE 416 (416) T ss_pred -hCC------------------CCCCCCCcceEeecccccccccccccCcccccccc-cccCCCCCCC Confidence 333 334444322111 00000 000001000000 0011122111 No 132 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=96.17 E-value=0.00092 Score=37.24 Aligned_cols=398 Identities=14% Similarity=0.143 Sum_probs=168.6 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |.. |.. ..+ ..+..++....+......+ +.+.. ...+.. ..-..++-|..||+-|.+.+ T Consensus 1 Mg~--f~~-----~~~--r~~~~~~~~~~~~~~~~~~-~~~~~---~~~~~~--------~~al~~~~v~~cv~~Ia~~i 59 (416) T protein:vir:45 1 MGI--FYK-----NEK--RDLQYNEDDLQMMVQTLPG-FQGTK---LRQYKD--------IEAIRHSDIFTAVMMIASDL 59 (416) T ss_pred CCc--ccc-----ccc--ccccCCCcchhHHHHHhcc-ccccC---ccccch--------hhhhcchHHHHHHHHHHHhh Confidence 543 321 111 1111122221221111111 11100 000100 12245778999999988875 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhh----hHHHHhhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRS----YEIFRRWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~----~~~fR~WYvDGri~~hkvid~~~~~~gI 152 (533) -. .|+.+. ++.. ... -..++.+|+-. ..+ +.++..+...|.-|+.++-|. .+-+ T Consensus 60 A~-----~p~~~~-~~~~--~~~-------~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~G~~ 121 (416) T protein:vir:45 60 AR-----MPIRVT-VNGQ--INY-------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK---TGEP 121 (416) T ss_pred cc-----CceEEe-cCcc--ccc-------cchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcE Confidence 53 454443 1111 111 12455665432 223 344455788999999977764 3448 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeecc-chhceeccccccccccCCcceeccchhhccccccccCCCCcc Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQK-AAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMT 231 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~-~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i 231 (533) ++|.+|+|..++.++...+. ..+... .+... ......| .+...++|.. ...++-.+ T Consensus 122 ~~L~~i~~~~v~v~~~~~g~----~~~~~~--~~~~~~~~~~~~~-------~~~evihir~----------~~~d~~~G 178 (416) T protein:vir:45 122 MNLTFRKTSEIELKSDARGR----LYYFHQ--RIDSNGNNIERNV-------KFEDMLDIKF----------YSLDGING 178 (416) T ss_pred EEEEEEcCceeEEEECCCcc----EEEEEE--EecCCCceeEEEE-------ccccEEEecc----------CCCCCccc Confidence 99999999999654322111 000000 00000 0000011 1122233321 11122345 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHH-HHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 232 LSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLRE-VMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 232 ~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~-im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) +|-|+.|++++......++..-=+=-.-+--+-|..++ |.+...+|.+=+++ +...|. | .++..+.+ T Consensus 179 ~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~~---------g-~~nag~~~- 246 (416) T protein:vir:45 179 LSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS---------G-TKQAGKVV- 246 (416) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc---------C-ccccCcee- Confidence 68999999999888888877664434445556667777 45543444333333 222221 1 11111111 Q ss_pred hHhhhcccccCCCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCCccccCCCC-cccccchhhhhHHhhhHHHHHHH Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGGQNL-GELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEVKFQKFIAR 388 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~~-~~~~g~~~eItRDElkF~Kfi~r 388 (533) .++ .|.+++.|.-...- .-++-.++.++.+.++++||.+.|+.++ +++ +.-..+-|..-|.- T Consensus 247 vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~------~~~~~~~~~~~l~P 310 (416) T protein:vir:45 247 VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMS------ITDANLDYLSTLKP 310 (416) T ss_pred ecC----------CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcc------HHHHHHHHHHHHHH Confidence 121 14555555322211 2244456678899999999999996432 222 12222334444444 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQ 468 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~ 468 (533) +-.++...|...|- + ++. ...+.|. +.++...+ ...|++.++.+-.- -++|.+-++.. T Consensus 311 ~~~~ie~~ln~~l~---------~--~~~----~~~~~f~----~~~l~~~D-~~~~~~~~~~~~~~--G~~T~NE~R~~ 368 (416) T protein:vir:45 311 YITCVCAELNFKFN---------D--EYV----NREFKFD----TTEIRVVD-EKTQAEIDKINIDS--GKMNIDEIRQR 368 (416) T ss_pred HHHHHHHHHhhhcc---------c--ccc----CceEEEe----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHHH Confidence 44444444443321 1 121 1234444 33333222 23455555554332 35555555532 Q ss_pred HhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc---CCCCC---CCCCCCCccccccccCCccccch Q lcl|NC_021072. 469 VLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP---AMDPG---NAPPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 469 IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~---~~~~~---~~~~~~d~~~~~~~~~~~~~~~~ 530 (533) |++ +.++.++....+ +.-+. ++.+........ ....-+|.-+ T Consensus 369 -~gl------------------~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~-~~~kgGe~n~ 416 (416) T protein:vir:45 369 -DGL------------------APIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD-KKLKGGEENE 416 (416) T ss_pred -hCC------------------CCCCCCCcceEeecccccccccccccCcccccccc-cccCCCCCCC Confidence 333 334444322111 00000 000001000000 0011122111 No 133 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=96.16 E-value=0.00093 Score=37.21 Aligned_cols=406 Identities=13% Similarity=0.076 Sum_probs=171.1 Q ss_pred CCCCCCCcccceeecccccccccchhhhhh-HHHHHHHHHHhhhhcchhhhHH--------------------------- Q lcl|NC_021072. 22 FVQKDSMDGSQPIVGGGYYGYSVDFDGTVR-NEYELITRYREMVLQPECDSAV--------------------------- 73 (533) Q Consensus 22 ~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~-~~~~LI~~YR~m~~~pEvd~Av--------------------------- 73 (533) ..+...++ .-+.+- .-.....+|+.+..+.+-+..| T Consensus 1 ~~~~t~~~----------------~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~iv 64 (456) T protein:vir:79 1 MTASTPAE----------------WLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVR 64 (456) T ss_pred CCCCCHHH----------------HHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHH Confidence 00000000 000000 0011122233333333322222 Q ss_pred HHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeE Q lcl|NC_021072. 74 DDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLT 153 (533) Q Consensus 74 deIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~ 153 (533) +..+.-+ -+.||.+...+ .+.+.+ ++..+...-+|+....++++.-++-|+-|.+.-.|. +|-. T Consensus 65 d~~~~~l-----~~~g~~~~~~~---d~~~~~----~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~e----dg~~ 128 (456) T protein:vir:79 65 DSVADRI-----IPNGITVGGSA---DSDLAL----RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD----DGTA 128 (456) T ss_pred HHHHhhh-----ccCCeecCCCC---CccHHH----HHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCC----CCce Confidence 2222211 13455543211 112222 233444445788888999999999999888755542 3445 Q ss_pred EEEEcChhhceehhhccCCC--cCceeEEecccee------ec-cch-----hceecccccc-ccccCCcceeccchhhc Q lcl|NC_021072. 154 ELRYIDPRKIRKVTEYQQKR--PEQLRGEDINTQL------TQ-KAA-----EYYLYNPKGL-KNSTNQGMKIATDSVTY 218 (533) Q Consensus 154 elr~lDP~~i~~vr~~~~~~--~~~~~~~~~~~~~------~~-~~~-----e~~~y~p~~~-~~~~~~~~kI~~dai~y 218 (533) .++.++|+.+-.+..-.... .-..+++.-.+.. +. ... ..++|+.... ......+.-.+. ..+ T Consensus 129 ~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 206 (456) T protein:vir:79 129 TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPV--GDA 206 (456) T ss_pred EEEEeccceeEEEEcCCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeec--ccc Confidence 68899999885543211110 0011111111100 00 000 0011111100 000000000000 001 Q ss_pred cccc----cccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEE Q lcl|NC_021072. 219 CHSG----IQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKL 293 (533) Q Consensus 219 ~hsG----l~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~ 293 (533) .|-. ++..++...+|=++..+.....+. ++-+.++.-..+-.|.|-+.=.+- .+|.. T Consensus 207 ~~~~~~~pvv~~~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~-~~~~~----------------- 268 (456) T protein:vir:79 207 VVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEH-RLPKV----------------- 268 (456) T ss_pred cCCCCceeEEEecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCc-ccccc----------------- Confidence 1100 011123334555665554443322 222333333333333333211110 01100 Q ss_pred EeeCCCCcc-ccccccchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccc Q lcl|NC_021072. 294 VYDANTGEI-KDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGR 371 (533) Q Consensus 294 vYd~~TGev-~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~ 371 (533) | .+|+. .....+....--.|+ +..|.+|..++... ++ -.+-++-+-..++...++|..-|+..++ |. . T Consensus 269 --d-~~g~~i~~~~~~~~~~~~~~~----~~~~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N~-S 338 (456) T protein:vir:79 269 --D-ENGNAIDYASIFEAAPGALWE----LPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-S 338 (456) T ss_pred --c-ccccccchhhhhhhhcccccc----CCCCcceeeecccC-hHHHHHHHHHHHHHHHhhcCCChhHhccccc-Cc-H Confidence 0 11110 000000000001122 11234454454432 22 2344677777888888999988875432 22 3 Q ss_pred hhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 372 AAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNT 451 (533) Q Consensus 372 ~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~ 451 (533) +..|.--+..+-.-+.+.|+.|..-+.+.++.-+.+.|.. ++ ..|++.|..-..=+. .+..+++.+ T Consensus 339 g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~--~~-----~~i~v~w~~~~~~s~-------~~~ada~~k 404 (456) T protein:vir:79 339 AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--VE-----DTVDVSFESPDRVTL-------GEKYSAASL 404 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--cc-----ccceEEeCCCCCcCH-------HHHHHHHHH Confidence 3455666666777789999999999999999888888852 22 247888864333222 344555555 Q ss_pred hhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCC Q lcl|NC_021072. 452 MDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDP 506 (533) Q Consensus 452 ~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~ 506 (533) +..- ...|.+.+ ..+|++|+++|++.+.+-..|..++...+|-+..+++--. T Consensus 405 l~~~--G~~~~~~~-~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 405 AKAA--GESWASIR-RNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHhc--CCChHHHH-HhcCCCCHHHHHHHHHHHHHHHHHHHhhhHhhcCCCCCCC Confidence 5322 34666555 5789999999975444333333333333221111110000 No 134 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=96.12 E-value=0.00098 Score=37.08 Aligned_cols=435 Identities=11% Similarity=0.115 Sum_probs=176.0 Q ss_pred cceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhh--------------- Q lcl|NC_021072. 6 FGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECD--------------- 70 (533) Q Consensus 6 fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd--------------- 70 (533) .-+.+.+ +.++. ..+.....+.. .....+. -+.+|+.+..+.+-. T Consensus 1 ~~~~~~~-~~~~~--------~~~~~~~~i~~--------~i~~~~~---~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ 60 (499) T protein:vir:10 1 MAVVIDK-DLLDD--------VNEPNIEAINY--------AIRELQN---RKKRLDKLSDYYNGKQEIEKHEFDNATVEA 60 (499) T ss_pred Cccchhh-hHHhh--------hhcCCHHHHHH--------HHHHHHH---HHHHHHHHHHHhccccchhcCCcCcCCCCc Confidence 1111111 11100 00000111100 0000111 111222222221111 Q ss_pred -----hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecC Q lcl|NC_021072. 71 -----SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDP 145 (533) Q Consensus 71 -----~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~ 145 (533) +-..-||+..+-+ .=+.||.+..++ +. ..++|..+++--+|+....++++.+.+.|+.|.+.-+|. T Consensus 61 ~ki~~n~~~~Iv~~~~~~-l~g~p~~~~~~~----~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~ 131 (499) T protein:vir:10 61 ANVMVNHAKYITDMNVGF-MTGNPVKYVAEK----GK----NIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKK 131 (499) T ss_pred ceeecchHHHHHHHHhhh-hcccCceeecCC----hh----HHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecc Confidence 1112223322111 124566666543 22 233466677667899999999999999999999987775 Q ss_pred CCCC-------------CCeEEEEEcChhhceehhhccCCCcCc--eeEEecccee-eccchhceecccccccc------ Q lcl|NC_021072. 146 KNPR-------------GGLTELRYIDPRKIRKVTEYQQKRPEQ--LRGEDINTQL-TQKAAEYYLYNPKGLKN------ 203 (533) Q Consensus 146 ~~~~-------------~gI~elr~lDP~~i~~vr~~~~~~~~~--~~~~~~~~~~-~~~~~e~~~y~p~~~~~------ 203 (533) +... ..-..+..+||+..-.|.......... .+++...+.- ........+|.|..... T Consensus 132 ~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~ 211 (499) T protein:vir:10 132 TDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTT 211 (499) T ss_pred cccccccccccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCC Confidence 4321 123558889999886654432211100 1111111100 00011112344432110 Q ss_pred ------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccC Q lcl|NC_021072. 204 ------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDV 270 (533) Q Consensus 204 ------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDv 270 (533) .+|.--+||.= .| +++....|=++..+.....+. ++=+....-+-+..|-+-+.-.+. T Consensus 212 ~~~~~~~~~~~~~~~~~g~vPvv--~~-------~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~ 282 (499) T protein:vir:10 212 MEVSANDPIVYDGENLFGAVPII--EF-------RNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGL 282 (499) T ss_pred ccccCcceecccccCCCCccceE--Ee-------cCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc Confidence 01111112210 11 123334566666655555554 334555555666777666553332 Q ss_pred CCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHH Q lcl|NC_021072. 271 GNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKK 349 (533) Q Consensus 271 Gnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~YF~~k 349 (533) +... .....+..+.+.--.+..|..+++|-...+.... .-+.-+.+. T Consensus 283 ~~~~--------------------------------~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~ 330 (499) T protein:vir:10 283 GDDK--------------------------------DDIQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIEND 330 (499) T ss_pred cccc--------------------------------chhhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHH Confidence 2211 1111111111111122223346666554444332 345666777 Q ss_pred HHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CHhHHhhhhhceeEE Q lcl|NC_021072. 350 LYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVM-SLEEWDEMKEHIQFD 426 (533) Q Consensus 350 Ly~aL~VP~sRl~~~~~~~~g~~~--eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~-t~eew~~~~~~i~~~ 426 (533) +|+-..+|-- ..+. | .|..| .|..-......-+.+.++.|...+.++++.=+-+-++. ...+| ..+.+. T Consensus 331 I~~~s~~p~~--~~~~-~-~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~----~~i~i~ 402 (499) T protein:vir:10 331 IHKISYVPNM--NDEK-F-MGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDA----SGCKIS 402 (499) T ss_pred HHHHhCcccC--Cchh-h-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----ccceEE Confidence 8888888831 1111 1 12222 23222222233455666667776666666544332221 22233 346777 Q ss_pred EeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcC--CCCCCCcccccCC Q lcl|NC_021072. 427 FIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAG--LIVDPMAEMDPAM 504 (533) Q Consensus 427 f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~--~~~~p~~~~~~~~ 504 (533) |...---.+ .+.+++++.+ .| .+|.+++++. |...++ .+++.++|++|.... ...+|...++|+. T Consensus 403 f~~~~p~n~-------~e~~~~~~kl---~g-~iS~et~~~~-l~~v~d-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 469 (499) T protein:vir:10 403 LVANIPSNL-------SDVVNNVKNA---DG-IIPRKYTYSW-LPDVDN-PQDVIDEMNQQDAETIKKNQEALRGQDPDR 469 (499) T ss_pred eCCCCCCCH-------HHHHHHHHHH---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHHHhhhccCCCCC Confidence 765443333 3445555555 35 4999999977 555321 222233343333221 1111111122211 Q ss_pred CCCCCCCCCCcc-ccc--cccCCccccchh Q lcl|NC_021072. 505 DPGNAPPADDMS-AQE--GPAVDAGDAKRG 531 (533) Q Consensus 505 ~~~~~~~~~d~~-~~~--~~~~~~~~~~~~ 531 (533) ...+..++++.. ..+ ......+.+++- T Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 470 LELEDKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred CCCCCCCcccCCCCCCCccccccCCCCCCC Confidence 111111111000 000 001122334433 No 135 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=96.11 E-value=0.00099 Score=37.06 Aligned_cols=390 Identities=17% Similarity=0.165 Sum_probs=170.6 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |.. |.| ++ ++.. . ++ +...+ ......++.. . -...|..++.+|.|.+||+-|.+.+ T Consensus 1 Mg~--~~~--f~-~k~~-~-~~--~~~~~--~~~~~~~~~~-----~--------~~~~~~~~~~~~~V~~~I~~ia~~i 56 (403) T protein:vir:80 1 MGL--FNF--FR-RKTR-S-EP--TNAIS--WFLTQEAYDT-----L--------AIPGYTRLSDNPEVRMAVHKIAELI 56 (403) T ss_pred Ccc--ccc--cc-cccc-c-cc--cchhh--hhcccccccc-----c--------ccchhhhhhhhHHHHHHHHHHHHhh Confidence 543 432 22 1111 1 10 10000 0000000000 0 0222445777899999999998875 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHHHHhh----hhc--CceeeeeeecCCCCCC Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYEIFRRW----YVD--GRLFYHKVIDPKNPRG 150 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~fR~W----YvD--Gri~~hkvid~~~~~~ 150 (533) -. .|+.+--..-+..+. + ...+..+|+-. -.+.++++.+ +.+ |.-|+.++-| ..+ T Consensus 57 A~-----~p~~~~~~~~~g~~~----~---~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~---~~g 121 (403) T protein:vir:80 57 SS-----MTIHLMQNTDNGDIR----I---KNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYT---TSG 121 (403) T ss_pred hh-----CceEEEEecCCceee----c---CChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEc---CCC Confidence 42 344432111111111 1 22344444322 3566665543 444 5567665544 345 Q ss_pred CeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCc Q lcl|NC_021072. 151 GLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNM 230 (533) Q Consensus 151 gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~ 230 (533) -+++|.+|+|.+++.+... ++..+.... ..|.+ ...+++......+ ++-. T Consensus 122 ~~~~L~~l~p~~v~~~~~~-----~g~~~~y~~----------~~~~~-------~eiih~~~~~~~~--------~~~~ 171 (403) T protein:vir:80 122 LIDELIPLAPSKVSFVDTD-----TGYQIWYQG----------KAYNY-------DEVLHFIVNPDPE--------KPYM 171 (403) T ss_pred cEEEEEEEcCCeeEEEEcC-----CceEEEEee----------cccch-------hhEEEEeccCCCc--------Cccc Confidence 5999999999999753322 111111110 01221 1223333211111 1113 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 231 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 231 i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) ++|.+..+..+++.....+....-+--.-+--+-|..++. .+....+++..+.+..+|..- ..+|.+- T Consensus 172 G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~~~~~~~~~~~~~~----~~~g~~~------- 239 (403) T protein:vir:80 172 GRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDA-ATAELSSEEGRNAVFKKYLEA----SEAGQPW------- 239 (403) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCChHHHHHHHHHHHHHHhhh----hhcCCee------- Confidence 5688999999999998888877666554455666777764 455555555555544444221 0122211 Q ss_pred hHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHH-HH Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIA-RL 389 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~-rL 389 (533) + +| -++..+++++.|. -..+.-++-.++-...+.++++||...|+- +..+ +. .|..|+. .| T Consensus 240 ~-----~~-~~~~~~~~~~~l~-~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~------~~~~---~~--~~~~f~~~~l 301 (403) T protein:vir:80 240 I-----IP-AELLDVEQVKPLS-LKDLAIHETVELDKRTVAGIFGVPAFLLGV------GKYD---KD--EYNNFINSTI 301 (403) T ss_pred e-----ec-ccccccceeccCC-HHHHHHHHHHHHhHHHHHHHhCCCHHHcCC------CCcc---HH--HHHHHHHHHH Confidence 0 11 0112334444442 233444555677788899999999998852 1111 11 1223332 23 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHH Q lcl|NC_021072. 390 RKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQV 469 (533) Q Consensus 390 r~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~I 469 (533) + -+...+.+.|-..|+ ++.++ .+.|..+ .+-..+ ..+|.+.+..+-.- -++|..-++. . T Consensus 302 ~-P~~~~ie~~l~~kll-----~~~~~-------~~~f~~~----~ll~~d-~~~~~~~~~~~~~~--Gi~t~NE~R~-~ 360 (403) T protein:vir:80 302 L-PIAKGIEQELTRKLL-----ISPDL-------YFKFNPR----SLYAYD-LKELAEVGSNMYVR--GLMEGNEVRD-W 360 (403) T ss_pred H-HHHHHHHHHHHHhcc-----CCCCc-------EEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-H Confidence 3 223333444444443 44432 4555422 222222 34566666655332 4667777664 3 Q ss_pred hCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCC-cccc-ccccCCccccchhc Q lcl|NC_021072. 470 LKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADD-MSAQ-EGPAVDAGDAKRGE 532 (533) Q Consensus 470 L~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d-~~~~-~~~~~~~~~~~~~~ 532 (533) +++.+.+ . .+..+. ..+-.|.+. +..+ .....++++.-+.| T Consensus 361 ~gl~p~~--g---------gd~~~~-----------~~n~~pl~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 361 LGLSPKE--G---------LSELVI-----------LENYIPLDKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred hCCCCCC--C---------CCeEee-----------cccccchhhccchhhccCCCCCCCCCCCC Confidence 5654321 0 000000 000001100 0000 00001111111111 No 136 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.08 E-value=0.001 Score=36.96 Aligned_cols=424 Identities=12% Similarity=0.115 Sum_probs=181.3 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |-+.| . +.+.+..+... ...+.-....+..+.++.........+ ..+.+|.|-.||+-|.+.+ T Consensus 1 ~~~~~-----~--~~~~~~~~~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~---------~a~~~~~v~~~v~~ia~~i 63 (460) T protein:vir:10 1 MANRI-----I--RALRELTGLDN-KFNDAFIKYIGQTFTKYDNNGKTYLEQ---------GYNINPDVYSCISQMAAKT 63 (460) T ss_pred CchhH-----H--HHHhhhhccCC-CchHHHHHhhccccCCCccchhhhhHH---------HHhcchHHHHHHHHHHHhh Confidence 32221 1 11222221111 112222222232232221111111111 2456799999999998774 Q ss_pred eeecCCCceEEEEec-cCCC----------cHHHHHHHHHHHHHHH--------------HHhcchhhhhHHHH----hh Q lcl|NC_021072. 81 ICGNFDDVPVEVELS-NLKQ----------SDKIKKLIREEFAEIL--------------RLLDFENRSYEIFR----RW 131 (533) Q Consensus 81 iv~d~~~~~v~v~l~-~~~~----------S~~ik~~I~eeF~~i~--------------~lL~f~~~~~~~fR----~W 131 (533) -- .|+.|--. ..+- .+.+-..++...++.+ ..=|-...+.++.+ .+ T Consensus 64 A~-----lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~l 138 (460) T protein:vir:10 64 VA-----VPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYM 138 (460) T ss_pred hh-----CceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHH Confidence 32 34444311 1110 1122223333333222 21233334444444 46 Q ss_pred hhcCceeeeeeecCCCCCCC-eEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc--cccccCCc Q lcl|NC_021072. 132 YVDGRLFYHKVIDPKNPRGG-LTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG--LKNSTNQG 208 (533) Q Consensus 132 YvDGri~~hkvid~~~~~~g-I~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~--~~~~~~~~ 208 (533) .+.|.-|..++-+......| +.+|.+|+|..++..... +.....+.. ...+|.+...+ ..+.+... T Consensus 139 ll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~---~~~~~~~~~--------~~~~~~~~~~g~~~~~~~~ev 207 (460) T protein:vir:10 139 RLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKD---DINLLSTDS--------PIKSYMLIQGDQFIEFNEDEV 207 (460) T ss_pred hhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcC---CCceeeeee--------eeeEEEEecCceeEEecccce Confidence 78899999888765543444 789999999999653221 111111111 11112222221 11222233 Q ss_pred ceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHh Q lcl|NC_021072. 209 MKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGR 288 (533) Q Consensus 209 ~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~ 288 (533) ++|....-.+. .+..+-.++|.+..|.+.+.....+++...-+--.-++-. ..+..-+.|.+..+++..+.+... T Consensus 208 ih~r~~~~~~~----~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~-~i~~~~~~l~~e~~~~~~~~~~~~ 282 (460) T protein:vir:10 208 IHTKYANPNFD----LQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFG-FIHGGSTGLTQPQADSLKQRLTEM 282 (460) T ss_pred EEEecCCCCcc----cccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-eeeecCCCCCHHHHHHHHHHHHHH Confidence 33332211110 0111224568899999988888888888776655556554 456666777777776666665555 Q ss_pred cccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCC--C Q lcl|NC_021072. 289 YRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETE--T 365 (533) Q Consensus 289 ~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~--~ 365 (533) |+..- ..|.+ + .++ .|.+++.|.-. ..+.-++-.+|..+.+.++++||...|+.. + T Consensus 283 ~~g~~----n~g~~------~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 341 (460) T protein:vir:10 283 DKSPD----RLSQI------A-GAS----------GEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGG 341 (460) T ss_pred hcCcc----ccCCc------e-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC Confidence 54210 11222 1 121 25566666432 222234556688899999999999999753 2 Q ss_pred cccccchhhhhHHhhhHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHH Q lcl|NC_021072. 366 TFNIGRAAEITRDEVKFQKF-IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNE 444 (533) Q Consensus 366 ~~~~g~~~eItRDElkF~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~ 444 (533) +.+....++..+. |..+ |.-+-.++...|. ..|+. +.+. .....++|. |.++. .++. T Consensus 342 t~~~sn~e~~~~~---f~~~~l~P~~~~ie~~ln----~kl~~-----~~~~---~~~~~i~~d----~~~l~---~l~~ 399 (460) T protein:vir:10 342 GLNTGNLEEERKR---VVTDNIQPDLVILKQAFD----KKFIK-----RFKG---YENAVIEWD----ISELP---EMQT 399 (460) T ss_pred CCccccHHHHHHH---HHHHHHHHHHHHHHHHHH----HhhcC-----cccc---cCCceEEee----cchhh---hHHH Confidence 3333344444444 5554 3334444433333 33322 2111 112234443 22321 1222 Q ss_pred HHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCC Q lcl|NC_021072. 445 RMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVD 524 (533) Q Consensus 445 R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 524 (533) +......+ +-.-+ ||..|+-+. ..-+..++|..+.- .-+.+..|.++.+.+....-+ T Consensus 400 d~~~~~~~--~~~g~------------~T~NE~R~~-------~g~~pi~~~~gD~~--~~~~n~~~~~~~~~~~~~~~~ 456 (460) T protein:vir:10 400 DMVAMASW--LNTIP------------VTPNEIRIA-------MKYETLNQDGMDIV--FMPSNKVRIDDVSNNLIDSAF 456 (460) T ss_pred HHHHHHHH--HhCCC------------CCHHHHHHH-------hCCCCCCCCCCCee--eecccccchhhcccccCCCcc Confidence 22222211 11122 454443321 12333333332210 011111111111111100000 Q ss_pred cccc Q lcl|NC_021072. 525 AGDA 528 (533) Q Consensus 525 ~~~~ 528 (533) +... T Consensus 457 nq~~ 460 (460) T protein:vir:10 457 NQNQ 460 (460) T ss_pred cCCC Confidence 1000 No 137 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=96.06 E-value=0.0011 Score=36.88 Aligned_cols=415 Identities=10% Similarity=0.042 Sum_probs=166.8 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) +..-.-|-. ..-... ... .....+..-+..+|.. -++++ .+.-. T Consensus 63 l~~Yy~g~~--~i~~~~-~~~----~~~~~~~~ki~~n~~k-------------~Iv~~----------------~~~yl 106 (511) T protein:vir:99 63 LSDYYEGKT--KNLVEL-TRR----KEEYMADNRVAHDYAS-------------YISDF----------------INGYF 106 (511) T ss_pred HHHHhcccC--cccccc-Ccc----cccccCcceeecchHH-------------HHHHH----------------HHhhh Confidence 222222210 000000 000 0000000001111111 11111 11111 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcCh Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDP 160 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP 160 (533) -+.|+.+.+++ +. ..+.+..+++.-+|+....++++...+-|+-|.+.-+|. .|-..+..+|| T Consensus 107 -----~g~p~~~~~~d----~~----~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~de----d~~~~i~~~~p 169 (511) T protein:vir:99 107 -----LGNPIQYQDDD----KD----VLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDA 169 (511) T ss_pred -----cccCceeecCc----hH----HHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCC----CCceEEEEEcc Confidence 14666666544 22 234566677677899999999999999999999877763 45678999999 Q ss_pred hhceehhhccCCCcCce---eEEecc--ceeecc-chhceecccccccc------------------ccCCcceeccchh Q lcl|NC_021072. 161 RKIRKVTEYQQKRPEQL---RGEDIN--TQLTQK-AAEYYLYNPKGLKN------------------STNQGMKIATDSV 216 (533) Q Consensus 161 ~~i~~vr~~~~~~~~~~---~~~~~~--~~~~~~-~~e~~~y~p~~~~~------------------~~~~~~kI~~dai 216 (533) +.+-+|..-... .... +++... ...... ..-.-+|.+..... .+|.--+||. + T Consensus 170 ~~~~~vyd~~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v 246 (511) T protein:vir:99 170 MSTFVIYDNTIE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI--T 246 (511) T ss_pred ceeEEEEcCCCC-CceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccce--E Confidence 999776543221 1111 111110 000000 00111344432210 0111112221 1 Q ss_pred hccccccccCCCCccchhHHHHHHHHHHHHHHH-HHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEe Q lcl|NC_021072. 217 TYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE-DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVY 295 (533) Q Consensus 217 ~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~E-DalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vY 295 (533) .| +++....|-++..+.....+..+- +....-+-++.|-+-+.-... T Consensus 247 ~~-------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~------------------------- 294 (511) T protein:vir:99 247 EF-------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN------------------------- 294 (511) T ss_pred Ee-------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcc------------------------- Confidence 11 122334455665555544443322 222222334444433321000 Q ss_pred eCCCCccccccccchhHhhhccccc--------CCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCc Q lcl|NC_021072. 296 DANTGEIKDDKKFMSMLEDFWLPRR--------EGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETT 366 (533) Q Consensus 296 d~~TGev~~d~~~msmlEDywLpRR--------eggrgTEIsTLpGg~nLgei~-DV~YF~~kLy~aL~VP~sRl~~~~~ 366 (533) ..++.+..+++. --.|++.. ..+.|..+..|-...+...+. -+.-+.+.+|+...+|---.+.-+| T Consensus 295 -~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~g 369 (511) T protein:vir:99 295 -LDPVEVRKQKEA----NVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG 369 (511) T ss_pred -cCchhhcccccc----cceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc Confidence 001111100000 01222221 112234455554444433333 3566677788888888532222111 Q ss_pred ccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHH Q lcl|NC_021072. 367 FNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERM 446 (533) Q Consensus 367 ~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~ 446 (533) |. .+..|..-+.....-+.+.++.|..-+.+.++.=+-+-++...-++..-...+.+.|....--.+ .+.+ T Consensus 370 -n~-Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~-------~e~~ 440 (511) T protein:vir:99 370 -TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSL-------IEEL 440 (511) T ss_pred -cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCH-------HHHH Confidence 11 12233333333444466666777777766665432222222222222222346777764322222 2344 Q ss_pred HHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCcc Q lcl|NC_021072. 447 NQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAG 526 (533) Q Consensus 447 ~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 526 (533) +++..+. | .+|.+++++. |...+ +.+++.++|++|....+ +.. +.....+.+ +.++...++.+ -+.. T Consensus 441 ~~~~kl~---G-iiS~et~l~~-l~~v~-D~~~E~~ri~~E~~~~~-~~~--~~~~~~~~~---~~~~~~~~~~~-~~~~ 507 (511) T protein:vir:99 441 KAYIDSG---G-KISQTTLMSL-FSFFQ-DPELEVKKIEEDEKESI-KKA--QKNMYQDPR---NINDDEQDDST-KDSI 507 (511) T ss_pred HHHHHHh---c-cCCHHHHHHh-CCCCC-CHHHHHHHHHHHHHHHH-HHH--hhcccccCC---CCCCCCCCCCC-cCcc Confidence 4555553 4 4999999987 55543 23444555555554321 100 111111111 11111111111 1111 Q ss_pred ccch Q lcl|NC_021072. 527 DAKR 530 (533) Q Consensus 527 ~~~~ 530 (533) |.+| T Consensus 508 d~~e 511 (511) T protein:vir:99 508 DKKE 511 (511) T ss_pred cccC Confidence 2222 No 138 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=96.05 E-value=0.0011 Score=36.87 Aligned_cols=335 Identities=14% Similarity=0.183 Sum_probs=152.6 Q ss_pred CCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhc----chhhhhHHHHh----hhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 85 FDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLD----FENRSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 85 ~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~----f~~~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~elr 156 (533) --..|+.|- .+.+ .+ ...++++|+ -...+.++.+. |.+.|.-|..++-+. .+-+++|. T Consensus 1 ia~lp~~~~-~~~~---~~-------~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---~G~~~~L~ 66 (348) T protein:vir:93 1 MASLPLKMY-EDYK---VV-------NTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDI---YHQPSKLF 66 (348) T ss_pred CcccceEeE-ecCc---Cc-------ccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEEEE Confidence 233444442 1211 11 123444443 23355555554 677899999877654 44589999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLH 236 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~ 236 (533) .|+|..++.+..- ......+ .++.+.+ +.+.++.+.|.+.. +....+.-.++|.|. T Consensus 67 ~l~~~~v~~~~~~---~~~~~~y--------------~~~~~~g------~~~~~~~~eiih~r-~~~~~~~~~G~s~~~ 122 (348) T protein:vir:93 67 LLNPDVVEMLIEN---QSRELYY--------------SIHAATG------NKLIVHNMDMLHFK-HIVASNMVQGISPID 122 (348) T ss_pred EEcCCceEEEEeC---CCcEEEE--------------EEEcCCC------eEEEEccccEEEec-CCCCCCceeeccHHH Confidence 9999999653221 1111110 0111111 12223333332221 111112223467788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhc Q lcl|NC_021072. 237 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 316 (533) Q Consensus 237 ~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDyw 316 (533) .|..++.....++... +....+.| . +...-.+++-+.++++..+.+...|. +.|.+ + . T Consensus 123 ~~~~~i~~~~~~~~~~-~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~-------n~~~~------~-v----- 180 (348) T protein:vir:93 123 VLKNTTDFDNAVRTFN-LTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYE-------ENGGI------L-F----- 180 (348) T ss_pred HHHHHHHHHHHHHHHH-HHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhh-------cCCCe------e-e----- Confidence 8888777666666553 33444333 2 33334456666666655555444332 12321 1 1 Q ss_pred ccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHHH Q lcl|NC_021072. 317 LPRREGGRGTEISTLPGGQNLGELED---VKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRKR 392 (533) Q Consensus 317 LpRReggrgTEIsTLpGg~nLgei~D---V~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi-~rLr~~ 392 (533) | ..|.+++.|. .+..+++= -+|..+.+.++++||...|+..++-+.....+..+. |.+++ .-+-.+ T Consensus 181 l-----~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~---~~~~~l~P~~~~ 250 (348) T protein:vir:93 181 Q-----EPGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRF---YLQHTLLPIVKQ 250 (348) T ss_pred c-----CCCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHHHHH Confidence 1 1356677664 33444333 346788899999999999987666666666665554 54433 333232 Q ss_pred HHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCC Q lcl|NC_021072. 393 FSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQ 472 (533) Q Consensus 393 fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~ 472 (533) +...| -..++++.+|. ....|.|. +..+.... +..|.+++..+-. .-+++.+-++. .+++ T Consensus 251 ie~~l---------~~~l~~~~~~~---~g~~i~fd----~~~l~~~d-~~~~a~~~~~~~~--~G~~T~NE~R~-~~g~ 310 (348) T protein:vir:93 251 YEEEF---------NRKLLTKTDRE---KNRYFKFN----VKSYLRAD-SATQAEVYFKAVR--SGYYTINDIRE-WEDL 310 (348) T ss_pred HHHHH---------HHhhCCccccc---CcceEEee----chhhhccC-HHHHHHHHHHHHh--CCCCCHHHHHH-HhCC Confidence 22222 22344555554 12334454 22332222 4566666665533 24666666664 3555 Q ss_pred CHHHHHHHHHHHHHhhhcCCCCCCCc-ccccCCCCCCCCCCCCc Q lcl|NC_021072. 473 TDQEIKEIDKQIDSEREAGLIVDPMA-EMDPAMDPGNAPPADDM 515 (533) Q Consensus 473 tDeeI~e~~kqi~~E~~~~~~~~p~~-~~~~~~~~~~~~~~~d~ 515 (533) .+-+ --++-+ .....++ +++ .+.+....||+...+++ T Consensus 311 ~p~~--ggD~~~---~~~n~~~-~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 311 PPVE--GGDKPL---ISGDLYP-IDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred CCCC--CcCeEe---ecccccc-cccchhhcccccCCCCCcCCC Confidence 3321 000000 0000001 000 00000011111111111 No 139 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=95.95 E-value=0.0012 Score=36.57 Aligned_cols=436 Identities=12% Similarity=0.123 Sum_probs=171.0 Q ss_pred cccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhh------------ Q lcl|NC_021072. 4 QLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDS------------ 71 (533) Q Consensus 4 ~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~------------ 71 (533) -||-|.-.+ ...+.-|..... .... .-..+-.... -....+|+.+..+.+-+. T Consensus 1 ~~~~~~~~~------~~~~~~~~~~~~-l~~~------~i~~li~~~~--~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~ 65 (506) T protein:vir:94 1 MDYDLTEHK------QANLIYQESLEN-LTPN------KIMKFITHHF--NYQRPRLEMLDDYYQGYNLKILDKQSRRHE 65 (506) T ss_pred CCcchhhhh------cceeecccchhc-CCHH------HHHHHHHHHH--HHHHHHHHHHHHHhcCCCcccccccccccc Confidence 233322222 122222221111 1000 0000000000 011122333333222211 Q ss_pred -----------HHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeee Q lcl|NC_021072. 72 -----------AVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYH 140 (533) Q Consensus 72 -----------AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~h 140 (533) -...||+-.. .=.-+.|+.+.+++ ++. .+.+..+++--+|+....++++.+.+-|+.|.+ T Consensus 66 ~~~~~~ki~~n~~~~Iv~~~~-~~l~G~p~~~~~~d----~~~----~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~ 136 (506) T protein:vir:94 66 DGKADHRATHSFAKYIADFQT-SYSVGNPINVKLPD----DGS----NSGFDTFNKANDVDAENYDLFLDMSRYGRAYEY 136 (506) T ss_pred ccCCcceeecchHHHHHHHhh-hhhcccCceeecCc----chH----HHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEE Confidence 1222222211 01135677766554 223 345666666678999999999999999999998 Q ss_pred eeecCCCCCCCeEEEEEcChhhceehhhccCCCcCc---eeEEeccc----eeeccchhceeccccccc----------- Q lcl|NC_021072. 141 KVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQ---LRGEDINT----QLTQKAAEYYLYNPKGLK----------- 202 (533) Q Consensus 141 kvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~---~~~~~~~~----~~~~~~~e~~~y~p~~~~----------- 202 (533) .-+| .+|-..+..+||+.+-+|..-.... .. ++++.... .......-..+|.+.... T Consensus 137 v~~d----ed~~~~i~~~~p~~~~~v~dd~~~~-~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~ 211 (506) T protein:vir:94 137 VYRG----EDNEEHLAKLDPLDTFVIYSTDVDP-KPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKM 211 (506) T ss_pred EEec----CCCeeEEEEEcccceEEEecCCCCC-ceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccce Confidence 7775 3467889999999997766432211 11 11111110 000000011122222111 Q ss_pred --cccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHH--------HHHHHHHHHHHHhcCc-cceEEEccCC Q lcl|NC_021072. 203 --NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQL--------RMIEDSLVIYRLSRAP-ERRIFYIDVG 271 (533) Q Consensus 203 --~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqL--------rm~EDalVIyRi~RAP-eRrvfyIDvG 271 (533) ...|.--+||.-. | +|+....|-++..+.....+ ..+++..-.+++...- .....-.+.. T Consensus 212 ~~~~~~~~g~vPvv~--~-------~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~ 282 (506) T protein:vir:94 212 QVDTTKPITTFPVVE--F-------KNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMM 282 (506) T ss_pred eccccccCCccceEE--e-------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhcc Confidence 0011111222110 1 12233345555544443333 3445444444444331 1111111111 Q ss_pred CC-----------chHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCcc----ceeecCCCCC Q lcl|NC_021072. 272 NL-----------PKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGT----EISTLPGGQN 336 (533) Q Consensus 272 nl-----------pk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgT----EIsTLpGg~n 336 (533) .. ......+++.+ |. ++++ ++++-..+..|+ .+..|--..+ T Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~---------------------~~~~~~~~~~~~~~~~d~~~l~~~~~ 338 (506) T protein:vir:94 283 NTIDPNDEDAMAKLAKDKLELIKE-MK--DANM---------------------LLLKSGMTVNGTQTSVDAKYINKTYD 338 (506) T ss_pred ccccccccccccccccchhHHHhh-hh--hcCe---------------------eeecccccccCccccccceeeeecCC Confidence 10 00111111111 00 1111 222222221111 2222222122 Q ss_pred cch-HHHHHHHHHHHHHhcCCCccccCCCCcccccc--hhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCH Q lcl|NC_021072. 337 LGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGR--AAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSL 413 (533) Q Consensus 337 Lge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~--~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~ 413 (533) +.. -.-+.-+.+.+|...++|- +..+ .|. |. |..|.--+..-..-+.+.+..|..-+.++++.=+-+-++. . T Consensus 339 ~~~~~~~~~~l~~~I~~~s~~p~--~~~~-~~~-~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~-~ 413 (506) T protein:vir:94 339 VVGSEAYKKRVAGDIHKFSHTPD--LTDE-NFA-SNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSI-H 413 (506) T ss_pred HHHHHHHHHHHHHHHHHHhCccc--cccc-ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-C Confidence 221 2335556778888899984 2222 111 22 2223333333335577777777777777776533222221 1 Q ss_pred hHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCC Q lcl|NC_021072. 414 EEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLI 493 (533) Q Consensus 414 eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~ 493 (533) ..++.-...+.+.|...---.+... ++++..+ +| .+|.++++.. |..+++. +++.++|++|.... - T Consensus 414 ~~~~~d~~~i~i~f~~~~p~d~~e~-------a~~~~kl---~g-~iS~et~~~~-lp~v~d~-~~E~~ri~~E~~~~-~ 479 (506) T protein:vir:94 414 GDWTFDPQELTFTFRDNLPADNISQ-------IKALVQA---GA-TLPQKYLYQQ-LPGVTNP-QDIVDMMKEQSANG-D 479 (506) T ss_pred CccccccccceEEeCCCCCcCHHHH-------HHHHHHH---hc-cCChHHHHHh-CCCCCCH-HHHHHHHHHHHHHH-h Confidence 1122122347788865444334333 3344444 34 5999999977 6666532 23444555555432 1 Q ss_pred CCCCcccccCCCCCCCCCCCCccccccccCC Q lcl|NC_021072. 494 VDPMAEMDPAMDPGNAPPADDMSAQEGPAVD 524 (533) Q Consensus 494 ~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 524 (533) + . +++....+..+..++...++...+. T Consensus 480 ~--~--~~~~~~~~~~~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 480 Y--S--FDQNGVISNDGQTNTTATQTDEEVR 506 (506) T ss_pred h--c--chhhcCCCcccCccccccccccCCC Confidence 1 1 1111111111111111111111122 No 140 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=95.93 E-value=0.0012 Score=36.52 Aligned_cols=441 Identities=11% Similarity=0.098 Sum_probs=173.8 Q ss_pred CC-ccccceeeeccccccccCCCCCCCCCcccceeecccccccc-cch-hhhh----hHHHHHHHHHHhhhhcchhhh-- Q lcl|NC_021072. 1 MS-NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYS-VDF-DGTV----RNEYELITRYREMVLQPECDS-- 71 (533) Q Consensus 1 ~~-~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~-~~~-~~~~----~~~~~LI~~YR~m~~~pEvd~-- 71 (533) |. .+|.. +-++-+-++.-+..|-... ++.+....+.... ..+ .-.+ ..-...+.+|+.+..+.+-+. T Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I 76 (492) T protein:vir:94 1 MQFIQLIS---QVAQALIKGGNILYPSQPT-QTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDI 76 (492) T ss_pred ChHHHHHH---HHHHHHhcCCceeecCccc-hhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 11 00000 0011111111111111111 0111111111100 000 0000 011122334444444433221 Q ss_pred -------------------------HHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhH Q lcl|NC_021072. 72 -------------------------AVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYE 126 (533) Q Consensus 72 -------------------------AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~ 126 (533) -...||+-.. .=.-+.|+.+..++.+..+ .++.+++ =+|+....+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~-~yl~G~p~~~~~~d~~~~~--------~l~~~~~-n~~~~~~~~ 146 (492) T protein:vir:94 77 VKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKV-SYIVGKPIAFKHTDDEVVK--------RIDEVLG-NRFDDKLHS 146 (492) T ss_pred ccccccccccccccccccccccccchHHHHHHHHH-hhhcccCceeccCchHHHH--------HHHHHHh-ccHHHHHHH Confidence 1111222111 1123677777665532222 2333332 268888889 Q ss_pred HHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeE---Eeccc----e-eeccchhceeccc Q lcl|NC_021072. 127 IFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRG---EDINT----Q-LTQKAAEYYLYNP 198 (533) Q Consensus 127 ~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~---~~~~~----~-~~~~~~e~~~y~p 198 (533) .++..++-|+-|.+.-+| ++|-..++.+||+.+-.+..-. ...+...+ +.... + .+.....+|.+.. T Consensus 147 ~~~~a~~~G~a~~~v~~d----~dg~~~~~~~~p~~~~~v~d~~-~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~ 221 (492) T protein:vir:94 147 VLTGASNKGIEWLHPYLD----EEGEFKLFRVPAEQGIPIWTDK-EHEELEAFIRMYKLENETKVEYWDKVTVNYYVYEN 221 (492) T ss_pred HHHHHhhCCeEEEEEEec----CCCceEEEEEcccceEEEEcCC-CCCceEEEEEEEeeccceeEEEEecCeEEEEEEec Confidence 999999999999997665 3466789999999986654311 11111111 11100 0 0111112222222 Q ss_pred ccccccc---CCcceeccchhhcccccccc----CCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccC Q lcl|NC_021072. 199 KGLKNST---NQGMKIATDSVTYCHSGIQD----LNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDV 270 (533) Q Consensus 199 ~~~~~~~---~~~~kI~~dai~y~hsGl~d----~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDv 270 (533) .+..... .+...+....-. .|-++ +++....|=++..+....-+. ++=+....-+.+..|-+-+.-.+. T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~---~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~ 298 (492) T protein:vir:94 222 GSLIPDYSNNLENSKTHFSTGS---WGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDD 298 (492) T ss_pred CeeeeccccccccccccccccC---CCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc Confidence 1110000 000000000000 01100 122334566665544444443 233444445666666544433222 Q ss_pred CCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHH Q lcl|NC_021072. 271 GNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKK 349 (533) Q Consensus 271 Gnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~YF~~k 349 (533) .+.+. ++..+..++..--+++ ..+.+|-...+.+.+ .-+.-+.+. T Consensus 299 ~~~~~--------------------------------~~~~~~~~~~~~~~~~--~~~~~l~~~~~~~~~~~~~~~l~~~ 344 (492) T protein:vir:94 299 QELPE--------------------------------FKRLLRYYGAIKVSDN--GGVDTIQVEVPVENSKKYLDELYQK 344 (492) T ss_pred ccchh--------------------------------hHHHHhhccceecCCC--CcceeEeccCCHHHHHHHHHHHHHH Confidence 22111 1111112222211221 234555443333332 344667778 Q ss_pred HHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEec Q lcl|NC_021072. 350 LYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIA 429 (533) Q Consensus 350 Ly~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~ 429 (533) +|+-.++|---.+.-++ +. .|..|..-+.....-+.+.++.|..-+.++++.=+-+-|+ ..+|. .|.+.|.. T Consensus 345 I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~--~~~~~----~i~v~f~~ 416 (492) T protein:vir:94 345 IMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHK----DVDISFNY 416 (492) T ss_pred HHHHhCCcCCCcccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--Ccccc----eeeEEecC Confidence 88888988422211111 11 2223444444455557777777777777777654333343 33444 46677754 Q ss_pred cchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCC Q lcl|NC_021072. 430 DNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNA 509 (533) Q Consensus 430 Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~ 509 (533) ..--.+ .+.++++..+ +| .+|.+++++. |..++ +.+++.++|++|..+..-..|+.. +.+ .+. T Consensus 417 ~~p~~~-------~e~~~~~~kl---~g-iiS~et~~~~-l~~v~-d~~~E~eri~~E~~~~~~~~~~~~---~~~-~~~ 479 (492) T protein:vir:94 417 NKVANT-------ELQVQTAQQS---MG-IVSHETVLEN-HPFVE-DLQAELERIEQEQMEYNKQLPNLD---DGG-ADS 479 (492) T ss_pred CCCCCH-------HHHHHHHHHH---hc-cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhccccc---ccc-CCC Confidence 333222 2334555555 35 4899999976 56554 233444445555432221111110 000 011 Q ss_pred CCCCCccccccccCCccccchhc Q lcl|NC_021072. 510 PPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 510 ~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) ++.++.+.+ ++.| T Consensus 480 ~~~~~~~~~----------~e~e 492 (492) T protein:vir:94 480 AQQQERSNN----------KESE 492 (492) T ss_pred CccccCCcc----------ccCC Confidence 111111111 1111 No 141 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=95.88 E-value=0.0013 Score=36.39 Aligned_cols=435 Identities=12% Similarity=0.100 Sum_probs=168.6 Q ss_pred CC-ccccceeeeccccccccCCCCCCCCCcccceeecccccccc-cchhh-h----hhHHHHHHHHHHhhhhcchhhh-- Q lcl|NC_021072. 1 MS-NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYS-VDFDG-T----VRNEYELITRYREMVLQPECDS-- 71 (533) Q Consensus 1 ~~-~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~-~~~~~-~----~~~~~~LI~~YR~m~~~pEvd~-- 71 (533) |. .+|.. +-++-+-++.-+..|-... ++......+.... ..+.. - +..-...+.+|+.+..+.+-.. T Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i 76 (492) T protein:vir:97 1 MQFIQLIS---QVAQALIKGGNILYPSQPT-QTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDI 76 (492) T ss_pred ChHHHHHH---HHHHHHhcCCceeeccchh-hhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcc Confidence 11 00000 0001111111111111000 0001111111100 00000 0 0011122334444444332221 Q ss_pred -------------------------HHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhH Q lcl|NC_021072. 72 -------------------------AVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYE 126 (533) Q Consensus 72 -------------------------AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~ 126 (533) -..-||+... .=.-+.|+.+..++. ... +.++.+++ =+|+....+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~-~yl~g~p~~~~~~d~----~~~----~~l~~~~~-n~~~~~~~~ 146 (492) T protein:vir:97 77 VKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKV-SYIVGKPIAFKHTDD----EVV----KRIDEVLG-NRFDDKLHS 146 (492) T ss_pred ccccccccccccccccccccccccchHHHHHHHHh-hhhcccCceeccCch----HHH----HHHHHHHh-ccHHHHHHH Confidence 1112222211 112356677766552 222 23333433 268888999 Q ss_pred HHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeE---Eeccce-----eeccchhceeccc Q lcl|NC_021072. 127 IFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRG---EDINTQ-----LTQKAAEYYLYNP 198 (533) Q Consensus 127 ~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~---~~~~~~-----~~~~~~e~~~y~p 198 (533) +.+.+++-|+-|.+.-+| .+|-..++.+||+.+-++..-.. ......+ +..-.. .+.....+|.+.. T Consensus 147 ~~~~~~~~G~a~~~v~~d----~dg~~~~~~~~p~~~~~i~d~~~-~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~ 221 (492) T protein:vir:97 147 VLTGASNKGIEWLHPYLD----EEGEFKLFRVPAEQGIPIWTDKE-HEELEAFIRMYKLENETKVEYWDKVTVNYYVYEN 221 (492) T ss_pred HHHHHhhcCeEEEEEEec----CCCceEEEEEcccceEEEEcCCC-CCceEEEEEEEeeccceeEEEEecCeEEEEEEec Confidence 999999999999987765 34667899999999977543211 1111111 111000 0111111222211 Q ss_pred ccccc-------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccce Q lcl|NC_021072. 199 KGLKN-------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERR 264 (533) Q Consensus 199 ~~~~~-------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRr 264 (533) ..... .+|.--+||.=.+ +++....|=++..+....-+. ++=+....-+-++-|-+- T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~---------~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~ 292 (492) T protein:vir:97 222 GSLIPDYSNNLENSKTHFSTGSWGKIPFIPF---------KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV 292 (492) T ss_pred CeeeecccccccccccccccCCCCCcceEEe---------cCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceee Confidence 11100 0111111111100 112334455555444443333 233334444555555444 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcch-HHHH Q lcl|NC_021072. 265 IFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDV 343 (533) Q Consensus 265 vfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV 343 (533) +.-.+.-+.+ + +...+..++...-++ |..+.+|-...+... ..-+ T Consensus 293 ~~g~~~~~~~-----~---------------------------~~~~~~~~~~~~~~~--~~~~~~l~~~~~~~~~~~~~ 338 (492) T protein:vir:97 293 LKNYDDQELP-----E---------------------------FKRLLRYYGAIKVSD--NGGVDTIQVEVPVENSKKYL 338 (492) T ss_pred eecCCcccch-----h---------------------------HHHHHhhccceecCC--CCcceeEeccCCHHHHHHHH Confidence 3322211111 1 111111111111111 223555544333332 2334 Q ss_pred HHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhce Q lcl|NC_021072. 344 KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHI 423 (533) Q Consensus 344 ~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i 423 (533) .-+.+.+|+-..+|---++.-++ +. .|..|.--+.....-+.+.++.|..-+.++++.=+-+-|+ ..+|.. | T Consensus 339 ~~L~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~--~~~~~~----i 410 (492) T protein:vir:97 339 DELYQKIMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHKD----V 410 (492) T ss_pred HHHHHHHHHHhCCCCCCcccccc-Cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--Ccccce----e Confidence 55667778888888421111111 11 1222433444445556777788888887777753333343 335554 5 Q ss_pred eEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC Q lcl|NC_021072. 424 QFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA 503 (533) Q Consensus 424 ~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~ 503 (533) .+.|....--.+ .+.++++.++. | .+|.+++++. |...++ .+++.++|++|..+..-..++.. + T Consensus 411 ~v~f~~~~p~~~-------~e~a~~~~kl~---G-~iS~et~l~~-l~~v~d-~~~Eleri~~E~~~~~~~~~~~~---~ 474 (492) T protein:vir:97 411 DISFNYNKVANT-------ELQVQTAQQSM---G-IVSHETVLEN-HPFVED-LQAELERIEQEQTEYNKQLPNLD---D 474 (492) T ss_pred eEEecCCCCCCH-------HHHHHHHHHHh---c-cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHhhhccc---c Confidence 677753332222 23345555553 4 4999999987 554431 23344455555432211111110 0 Q ss_pred CCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 504 MDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 504 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) .+.. ..+.++.. +.+++| T Consensus 475 ~~~~-~~~~~~~~----------~~~~~e 492 (492) T protein:vir:97 475 GGAD-SAQQQERS----------NNKESE 492 (492) T ss_pred CCCC-CCcccccc----------cccccC Confidence 0000 00001111 111111 No 142 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=95.84 E-value=0.0014 Score=36.28 Aligned_cols=405 Identities=12% Similarity=0.092 Sum_probs=172.4 Q ss_pred cceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecC Q lcl|NC_021072. 6 FGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNF 85 (533) Q Consensus 6 fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~ 85 (533) ..|+ +..+...+.+ .|.. ++-. ...+++..+.....+. -...+++|-|..||+-|.+.+-- T Consensus 1 m~~~--~~~~~~~~~~--~~~~-~~~~---~~~~g~~~s~~~~~v~--------~~~al~~~~v~~cv~~ia~~ia~--- 61 (419) T protein:vir:80 1 MFFS--RQLLSNLGQT--QPGS-GGWV---SALLGSARSEAGQVVT--------PASALSLTVLQNCVTLLAESIAQ--- 61 (419) T ss_pred CCcc--cccccccCcC--CCCc-chhh---HHhhcccccccCcccC--------hHHhhccHHHHHHHHHHHHhhcc--- Confidence 1121 1011111111 1100 1111 1111111111001111 12345789999999999987543 Q ss_pred CCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHH----hhhhcCceeeeeeecCCCCCCCeEEEEE Q lcl|NC_021072. 86 DDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFR----RWYVDGRLFYHKVIDPKNPRGGLTELRY 157 (533) Q Consensus 86 ~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR----~WYvDGri~~hkvid~~~~~~gI~elr~ 157 (533) .|+.|--..-+..+.. .-..++++|+. ...+.++.+ .+.+.|.-|.-++-+. .+-+++|.+ T Consensus 62 --lp~~~~~~~~~~~~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---~G~~~~L~~ 130 (419) T protein:vir:80 62 --LPVELYERSGDDRKPA------TDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQ---DGVIQGLYP 130 (419) T ss_pred --CceEEEEecCCCcccc------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEEEEE Confidence 4555432111111111 11234455543 234545444 4677799998877654 445899999 Q ss_pred cChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHHH Q lcl|NC_021072. 158 IDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHK 237 (533) Q Consensus 158 lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~ 237 (533) |+|..++.++. .+....+. +.... .......+++.. ...++-.++|-++. T Consensus 131 i~~~~v~i~~~-----~~~~~~y~-------------~~~~~--~~~~~~i~h~~~----------~~~d~~~G~s~i~~ 180 (419) T protein:vir:80 131 LDNEAVTVMKG-----PDLKPMYR-------------VAGAD--PLPQRLVHHVRW----------MSINGYTGLSPVLL 180 (419) T ss_pred ecCceEEEEEC-----CCceEEEE-------------EcCcc--ccchhheEEecC----------CCCCCcccccHHHH Confidence 99999965322 11111111 00000 011112222221 12233456789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcc Q lcl|NC_021072. 238 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWL 317 (533) Q Consensus 238 AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywL 317 (533) |..++.....+++...=+---.+--+-++.++... +..+.++-+..+...+....-=....|.+ + +++ T Consensus 181 ~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~n~g~~------~-vl~---- 248 (419) T protein:vir:80 181 HANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDA-PALKDQASVDRITDGWNAKFGGSGNAKKV------A-LLQ---- 248 (419) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCC-CcccCHHHHHHHHHHHHHHhcCccccCCc------e-ecC---- Confidence 99999988888877665544556566677776422 21111222222232332221100111221 1 221 Q ss_pred cccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHHHHHH Q lcl|NC_021072. 318 PRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRKRFSE 395 (533) Q Consensus 318 pRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi-~rLr~~fs~ 395 (533) .|.+++-|.- .+.+.-++-.++..+.+.++++||...|+..++-+.....+..+. |..++ .-+..+ T Consensus 249 ------~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~---f~~~~l~P~~~~--- 316 (419) T protein:vir:80 249 ------EGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQ---FVIYTLLPWVKR--- 316 (419) T ss_pred ------CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHH---HHHHHHHHHHHH--- Confidence 2455665532 122223344567789999999999999975444444444444443 65552 222222 Q ss_pred HHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHH Q lcl|NC_021072. 396 LFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQ 475 (533) Q Consensus 396 if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDe 475 (533) +.+.|-.. ++++.++.. ..+.|.. .++... -+..|++.+..+-. .-+++.+-++. ++++.+ T Consensus 317 -ie~~l~~k-----ll~~~~~~~----~~i~fd~----~~l~~~-d~~~~~~~~~~~~~--~G~~T~NE~R~-~~g~~p- 377 (419) T protein:vir:80 317 -HEQAKTRD-----LLLPSERKQ----YFIEYNL----AGLLRG-DQSSRYAAYAVGRQ--WGWLSINDIRR-LENMPP- 377 (419) T ss_pred -HHHHHhhh-----ccCccccCC----eEEEEec----hhhhcc-CHHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCC- Confidence 23333333 344444432 3445542 233222 23456666655422 23555555552 244422 Q ss_pred HHHHHHHHHHHhhhcCCCCCCCcccccCC-CCCC-CCCCCCccccccccCCccccchhcC Q lcl|NC_021072. 476 EIKEIDKQIDSEREAGLIVDPMAEMDPAM-DPGN-APPADDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 476 eI~e~~kqi~~E~~~~~~~~p~~~~~~~~-~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~ 533 (533) ++.=+.-..|.+ -..+ ..+...+..++ ..+.-++...+ T Consensus 378 -----------------~~gGD~~~~~~n~~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 417 (419) T protein:vir:80 378 -----------------VKGGDIYLSPMNMVDASKPQPIPMGKTEP---TKAALDEIGRI 417 (419) T ss_pred -----------------CCCcceeeeccccccccccccccCCCCCc---hhhhHHHHHhh Confidence 111000001111 0111 11111111110 01111222222 No 143 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=95.80 E-value=0.0014 Score=36.18 Aligned_cols=440 Identities=11% Similarity=0.054 Sum_probs=171.9 Q ss_pred ceeeeccccccccCCCCCCCCCccccee-----ecccccccccchhh--hhhHHHHHHH--------HHHhhhhcchhh- Q lcl|NC_021072. 7 GFSLERAKKVPKGPSFVQKDSMDGSQPI-----VGGGYYGYSVDFDG--TVRNEYELIT--------RYREMVLQPECD- 70 (533) Q Consensus 7 g~~i~~~~~~~~~~s~~~~~~~dg~~~~-----~~~~~~~~~~~~~~--~~~~~~~LI~--------~YR~m~~~pEvd- 70 (533) =+++++... ....-|...- ....|.-.....+. ...-...+|+ +|+.+..+.+=+ T Consensus 1 ~~~~~~~~~---------~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~ 71 (511) T protein:vir:96 1 MLKVNEFET---------DTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKT 71 (511) T ss_pred Cccccchhh---------hhhhhhhhhhhhhhhhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccC Confidence 111211100 0000111000 01111100000000 0111112222 344443322211 Q ss_pred ---------------------hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHH Q lcl|NC_021072. 71 ---------------------SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFR 129 (533) Q Consensus 71 ---------------------~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR 129 (533) +-..-||+... .=.-+.|+.+.+++ +.. .+.+..+++.-+|+....++++ T Consensus 72 ~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~~~~~----~~~----~~~l~~~~~~n~~~~~~~~~~~ 142 (511) T protein:vir:96 72 KNLVELTRRKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQYQDDD----KDV----LEAIEAFNDLNDVESHNRSLGL 142 (511) T ss_pred ccccccCcCcccccCcceeecchHHHHHHHHH-hhhccCCceeecCc----hHH----HHHHHHHHhhcCHHHHHHHHHH Confidence 11122222211 11236777777654 333 3456667777789999999999 Q ss_pred hhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCcee---EEecc--ceeeccch-hceecccccccc Q lcl|NC_021072. 130 RWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLR---GEDIN--TQLTQKAA-EYYLYNPKGLKN 203 (533) Q Consensus 130 ~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~---~~~~~--~~~~~~~~-e~~~y~p~~~~~ 203 (533) ...+-|+.|.+.-+| +.|-..+..+||+.+-+|....... .... ++... +....... -..+|.+..... T Consensus 143 ~~~i~G~a~~~vy~d----ed~~~~i~~~~p~~~~~vydd~~~~-~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~ 217 (511) T protein:vir:96 143 DLSIYGKAYELMIRN----QDDETRLYKSDAMSTFVIYDNTIER-NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYR 217 (511) T ss_pred HHHhcCeeEEEEEeC----CCCceEEEEEccceeEEEEcCCCCC-ceEEEEEEEEeeeccccccceEEEEEEEeCCcEEE Confidence 999999999887665 3466889999999997765432211 1111 11110 00000001 111344433110 Q ss_pred ------------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHH-HHHHHHHHhcCccce Q lcl|NC_021072. 204 ------------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE-DSLVIYRLSRAPERR 264 (533) Q Consensus 204 ------------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~E-DalVIyRi~RAPeRr 264 (533) .+|.--+||.-. | +++....|-++..+.....+..+- +....-+-++.|-+- T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~--~-------~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv 288 (511) T protein:vir:96 218 YLTSRTNGLKLTPRENGFESHSFERMPITE--F-------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLL 288 (511) T ss_pred EEecCCCcccccccccccccccCCceeeEE--e-------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceee Confidence 011111222111 1 123345566766655555443322 112222334444333 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccc-cccccchhHhhhcc---cccCCCCccceeecCCCCCcchH Q lcl|NC_021072. 265 IFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIK-DDKKFMSMLEDFWL---PRREGGRGTEISTLPGGQNLGEL 340 (533) Q Consensus 265 vfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~-~d~~~msmlEDywL---pRReggrgTEIsTLpGg~nLgei 340 (533) +.- .+. ..++++. +....+-.+..... .....+.|..+..|-...+...+ T Consensus 289 ~~g----~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 342 (511) T protein:vir:96 289 IKG----NLN----------------------LDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGT 342 (511) T ss_pred eec----Ccc----------------------CCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHH Confidence 221 100 0011111 11111111111110 01112224456666554444433 Q ss_pred -HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhh Q lcl|NC_021072. 341 -EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEM 419 (533) Q Consensus 341 -~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~ 419 (533) .-+.-+.+.+|.-.++|---.+.-+| +. .|..|..-...-..-+.+.+..|..-+...++.=+-+-++....+++.- T Consensus 343 e~~~~~L~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d 420 (511) T protein:vir:96 343 EAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKD 420 (511) T ss_pred HHHHHHHHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccc Confidence 33456677888888888532221111 11 1222333333333445555555666655555442222222222222222 Q ss_pred hhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC----C Q lcl|NC_021072. 420 KEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV----D 495 (533) Q Consensus 420 ~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~----~ 495 (533) ...+.+.|...---.+ .+.++++..+ .| .+|.+++++. |...++ .+++.++|++|.... .+ . T Consensus 421 ~~~i~~~f~~~~p~n~-------~e~~~~~~kl---~G-~iS~et~l~~-l~~v~D-~~~E~~ri~~E~~~~-~~~~~~~ 486 (511) T protein:vir:96 421 FNTVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSL-FSFFQD-PELEVKKIEEDEKES-IKKAQKG 486 (511) T ss_pred cccceEEeCCCCCCCH-------HHHHHHHHHH---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-HHHHhhc Confidence 2357788864322222 2344555555 34 4899999966 565442 334445555554332 11 0 Q ss_pred CCcccccCCCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 496 PMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 496 p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) +.....+..+. .++++.. ....++| T Consensus 487 ~~~~~~~~~~~---~~~~~~~---------~~~~~~~ 511 (511) T protein:vir:96 487 IYKDPRDINDD---EQDDDTK---------DTVDKKE 511 (511) T ss_pred cccCCCCCCCC---CCCCccc---------ccccccC Confidence 11111111110 0111000 0111111 No 144 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=95.79 E-value=0.0015 Score=36.15 Aligned_cols=376 Identities=11% Similarity=0.101 Sum_probs=153.4 Q ss_pred ccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceee Q lcl|NC_021072. 3 NQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETIC 82 (533) Q Consensus 3 ~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv 82 (533) =.||++.-.. + . .++....+........+.+...+... +. -+...++|.|.+||+-|.+.+-- T Consensus 1 M~~f~~~~~~-~-----~--~~~~~~~~~~~~~~~~~~~~~~~~~~-v~--------~~~~~~~~~v~~~i~~ia~~ia~ 63 (386) T protein:vir:48 1 MPIFNITNLA-T-----E--SPPISQGGFFDITDPDFLSTLNGSEW-VS--------AESALRNSDLFSIINQLSNDLAT 63 (386) T ss_pred Cccccccccc-c-----c--ccccccccccccccchhcccccCCce-ec--------hhhhhcchHHHHHHHHHHHhhcc Confidence 2356542111 1 1 11122222222222222222111110 11 12235789999999999998654 Q ss_pred ecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHH----hhhhcCceeeeeeecCCCCCCCeEEEEEc Q lcl|NC_021072. 83 GNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFR----RWYVDGRLFYHKVIDPKNPRGGLTELRYI 158 (533) Q Consensus 83 ~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR----~WYvDGri~~hkvid~~~~~~gI~elr~l 158 (533) .|+.+. .. -...++..-|-.-.+.++.+ .+.+.|.-|+-++-|. .+-+++|.++ T Consensus 64 -----~p~~~~--~~------------~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~g~~~~L~~l 121 (386) T protein:vir:48 64 -----VKLTAS--RK------------QLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE---NGRDMKWEYL 121 (386) T ss_pred -----Cceeec--cc------------hhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECC---CCcEEEEEEe Confidence 233332 10 12233444444445555544 5778899999877764 4559999999 Q ss_pred ChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC-CccchhHHH Q lcl|NC_021072. 159 DPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK-NMTLSHLHK 237 (533) Q Consensus 159 DP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~-~~i~syL~~ 237 (533) ||..++..+.. .++...+...... ....... .+.....++|+..+ .++ -.++|.|.. T Consensus 122 ~~~~v~v~~~~---~~~~~~y~~~~~~--~~~~~~~-------~~~~~evih~~~~~----------~~~~~~G~s~i~~ 179 (386) T protein:vir:48 122 RPSQVSFNRLD---NKDGIYYNITFDD--PRIPPKQ-------HVPQGDVLHFKLLS----------VDGGLTSVSPLMA 179 (386) T ss_pred cCceeEEEEcC---CCceEEEEEEecC--cccccee-------EecCccEEEecCCC----------CCCceeeccHHHH Confidence 99999653321 1111111100000 0000000 11122333343221 222 235689999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcc Q lcl|NC_021072. 238 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWL 317 (533) Q Consensus 238 AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywL 317 (533) |.+++.....+++...=+----+--+-+.-.+- .+.+...++ +++..... +. ..|.+ + .++ T Consensus 180 ~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~-~~~~e~~~~-~~~~~~~~-----~~-n~g~~------~-vl~---- 240 (386) T protein:vir:48 180 LSRELNIQKASDKLTLNSLKNALNANGILKIKG-GGLLDFKTK-LSRSRQAM-----KQ-MQGGP------L-VLD---- 240 (386) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-CCCHHHHHH-HHHHHHHh-----hc-CCCCc------e-ecC---- Confidence 999999988888875544333333444555443 333333332 33322221 11 12221 1 111 Q ss_pred cccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHHHHHH Q lcl|NC_021072. 318 PRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRKRFSE 395 (533) Q Consensus 318 pRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi-~rLr~~fs~ 395 (533) .|.++..|.-...-.| ++=.++....+.++++||...|+..+..+ ...+-.++ |.+++ .-+-..+.. T Consensus 241 ------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~~e~~~~~---~~~~~l~P~~~~ie~ 309 (386) T protein:vir:48 241 ------DLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ--SSLEMSLD---LYNKAVSRYLRPFLS 309 (386) T ss_pred ------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHHHHHHH---HHHHHHHHHHHHHHH Confidence 2456666642222222 33346667899999999999997544332 23333333 54443 333333333 Q ss_pred HHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhc-cccccHHHHHHHHhCCCH Q lcl|NC_021072. 396 LFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYV-GKYFSIDYMRRQVLKQTD 474 (533) Q Consensus 396 if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~v-Gky~S~~~i~k~IL~~tD 474 (533) .|..-|-+++ +.+ +...+..|.+. ....++..+ +-.+|..-+++ +|+ T Consensus 310 ~l~~~l~~~~---------~~~-----~~~~~~~d~~~--------------~~~~~~~l~~~g~~t~nE~r~-~lg--- 357 (386) T protein:vir:48 310 ELSQKLSCDV---------DAD-----ILPAVDPTGSN--------------SVSRINSMVKSGTLAQNQGLY-ILQ--- 357 (386) T ss_pred HHHHhhcchh---------hcc-----hhhhhccChHH--------------HHHHHHHHHhCCCcCHHHHHH-Hhh--- Confidence 3322221110 000 00011111110 011111111 22333333332 222 Q ss_pred HHHHHHHHHHHHhhhcCCCCCCCcc--cccCCCCCCCCCCCCccccc Q lcl|NC_021072. 475 QEIKEIDKQIDSEREAGLIVDPMAE--MDPAMDPGNAPPADDMSAQE 519 (533) Q Consensus 475 eeI~e~~kqi~~E~~~~~~~~p~~~--~~~~~~~~~~~~~~d~~~~~ 519 (533) ..|+.+ .+.. +.+++.+. ++- |.+.++ T Consensus 358 --------------~~~~~~-~~~~~~~~~~~~~~-~gG--d~~~~~ 386 (386) T protein:vir:48 358 --------------QAEILP-KELPEGENPNKTTL-KGG--EINGED 386 (386) T ss_pred --------------cCCCCC-ccchhhcCCCCCcc-CCC--CCCCCC Confidence 122211 1100 01111100 000 111111 No 145 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=95.67 E-value=0.0016 Score=35.85 Aligned_cols=408 Identities=13% Similarity=0.136 Sum_probs=173.7 Q ss_pred ccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeec Q lcl|NC_021072. 5 LFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGN 84 (533) Q Consensus 5 ~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d 84 (533) +|=+...+.+.+ ..| ..+......++..+..+...-.+.. +...++|-|..||+-|.+.+-- T Consensus 1 m~~~~~~~~~~~--~~s------~~~~w~~~~~~~~~~~~~~g~~vt~--------~~al~~~~v~~~i~~Ia~~iA~-- 62 (421) T protein:vir:10 1 MFIPQMFEGKKR--SVS------GGGFWEAMLGGVRSSHSKAGVMITP--------ETALALSAVRACVTLLAESVAQ-- 62 (421) T ss_pred CCCcchhccccc--ccC------cchhhHHHhhhhccCcccCCceech--------HHhhccHHHHHHHHHHHHhhcc-- Confidence 222222221111 111 1111111111111111111001111 1245788999999999888542 Q ss_pred CCCceEEEEeccCCCc-HHHHHHHHHHHHHHHHHh----cchhhhhHHHHh----hhhcCceeeeeeecCCCCCCCeEEE Q lcl|NC_021072. 85 FDDVPVEVELSNLKQS-DKIKKLIREEFAEILRLL----DFENRSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLTEL 155 (533) Q Consensus 85 ~~~~~v~v~l~~~~~S-~~ik~~I~eeF~~i~~lL----~f~~~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~el 155 (533) .|+.|--...+.+ +.+++ ..++.+| |-...+.++.+. +.+.|.-|+.++-|. .+-+++| T Consensus 63 ---lp~~~~~~~~~g~~~~~~~------~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~---~G~~~~L 130 (421) T protein:vir:10 63 ---LPVELYRRDKNGGRQRATD------HPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDG---KGYPKEL 130 (421) T ss_pred ---CceEEEEEcCCCceeeccc------chHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---CCcEEEE Confidence 4554421111111 11111 0244444 333456665544 678999999987764 4569999 Q ss_pred EEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhH Q lcl|NC_021072. 156 RYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHL 235 (533) Q Consensus 156 r~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL 235 (533) .+|+|..+...+. +++..++ .+...+........+++.. ...++-.++|.| T Consensus 131 ~~l~~~~v~v~~~-----~~g~~~y--------------~~~~~g~~~~~~eiih~~~----------~~~d~~~G~spi 181 (421) T protein:vir:10 131 IPINPKKVIVLKG-----PDGMPYY--------------EIPEIGETLPMRMMHHVKV----------FSLDGYIGSSPI 181 (421) T ss_pred EEecCceEEEEEC-----CCceEEE--------------EEcCCCcEEchhhEEEecC----------cCCCCcccccHH Confidence 9999999965332 1221111 1111111111122222221 112233467899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) +.|.+++.....+++...=+=---+--+-+...+- +++..+.++-...+..+++++.-=....|.+ + .++ T Consensus 182 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~------~-vl~-- 251 (421) T protein:vir:10 182 QTNADVLGLNLAVEEHASAVFRRGATMSGVIERPK-EAPAIKSQEKIDQLLAKWTDRYSGINNMFSV------A-LLQ-- 251 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecC-ccCccCCHHHHHHHHHHHHHHhcCccccCcc------e-ecC-- Confidence 99999988877777665433333344455666664 2322222222333333333332100111211 1 111 Q ss_pred cccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFS 394 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs 394 (533) .|++++.|--. ..+.-++-.++..+.+.++.+||..-|+..+.-+....++..+. |.++ .|+-.+ T Consensus 252 --------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~---f~~~--tl~P~~- 317 (421) T protein:vir:10 252 --------EGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQ---FVMY--TLLAWL- 317 (421) T ss_pred --------CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHH---HHHH--HHHHHH- Confidence 25666666422 22222344457888899999999999965443333334444433 5543 232211 Q ss_pred HHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCH Q lcl|NC_021072. 395 ELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTD 474 (533) Q Consensus 395 ~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tD 474 (533) ..+.+.|-.. ++++.++. ...+.|..+. +... -+..|.+.+..+-. .-++|.+-++. .+++.+ T Consensus 318 ~~ie~~ln~k-----L~~~~~~~----~~~v~fd~~~----l~~~-d~~~~~~~~~~~~~--~G~~T~NE~R~-~~gl~p 380 (421) T protein:vir:10 318 KRHEGALQRD-----LLLPSERR----DLYIEFNVSG----LLRG-DQKSRYESYALGRQ--WGWLSVNDIRR-MENLPP 380 (421) T ss_pred HHHHHHHhhh-----ccCccccC----CeEEEEechh----hhcc-CHHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCC Confidence 1223333333 34555443 3456665332 2211 12345555555432 23666666663 345432 Q ss_pred HHHHHHHHHHHHhhhcCCCCCCCcccccCCC-C-CCC-CCCCCccccccccCCccccch Q lcl|NC_021072. 475 QEIKEIDKQIDSEREAGLIVDPMAEMDPAMD-P-GNA-PPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 475 eeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~-~-~~~-~~~~d~~~~~~~~~~~~~~~~ 530 (533) - +- .+-.+ -|.+. . +.. +..+..++++.+..|+.-..- T Consensus 381 ~--~g---------gD~~~-------~~~n~~~~~~~~~~~~~~~~~~~~e~d~~~~~~ 421 (421) T protein:vir:10 381 I--AG---------GDKYL-------TPLNMVDSAQIIPGDKKPTAQQMAEIDTILSRT 421 (421) T ss_pred C--CC---------cceee-------eccccccccccccCCCCcccccCcccccccccC Confidence 1 00 00001 11110 0 000 011111122222222211111 No 146 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=95.63 E-value=0.0017 Score=35.74 Aligned_cols=403 Identities=13% Similarity=0.144 Sum_probs=168.0 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |.. +...+ ..+.. +........|.... .+...|-.+ ..+...++|.|..||+-|.+.+ T Consensus 1 m~~----~~~~~--~~~~~-~~~~~~~~~~~~~~-~~~~~g~~v--------------~~~~al~~~~v~~~i~~ia~~i 58 (419) T protein:vir:57 1 MFI----PQFWK--GRPSE-NRVNWQVVPGGMRS-SSSQAGVII--------------TPETALALSAVRACVTLLAESV 58 (419) T ss_pred Ccc----hhhhc--cCCcc-cccccccccccccc-ccccCCcee--------------chHHhhccHHHHHHHHHHHHhh Confidence 321 11111 01111 11111111111111 111111111 1122346788999999999874 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHh----cchhhhhHHH----HhhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLL----DFENRSYEIF----RRWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL----~f~~~~~~~f----R~WYvDGri~~hkvid~~~~~~gI 152 (533) -- .|+.+-=..-+.++.. +.+ ..+.++| |-...+.++. ..+++.|.-|+.++-+. .+-+ T Consensus 59 a~-----lp~~~~~~~~~g~~~~---~~~--~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---~G~~ 125 (419) T protein:vir:57 59 AQ-----LPCVLYRRTENGGREI---AFD--HPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNG---RGDI 125 (419) T ss_pred cc-----CceEEEEEcCCCceec---ccc--chHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcE Confidence 42 4554421111111111 100 1234444 3334555544 44778999998877654 4569 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccc Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTL 232 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~ 232 (533) ++|.+|+|..++... ..++.. +|.+.-.+ ..++.+-+.+.. ....++-.++ T Consensus 126 ~~L~pl~~~~v~v~~-----~~~g~~--------------~y~~~~~~--------~~~~~~~vih~r--~~~~d~~~G~ 176 (419) T protein:vir:57 126 TELIPINPHKVIVLK-----GPDGMP--------------YYDIPSIG--------EILPMRMVHHIK--SFSLDGYIGT 176 (419) T ss_pred EEEEEEcCcceEEEE-----CCCceE--------------EEEEcCCc--------eEEchhhEEEec--CcCCCCcccc Confidence 999999999996422 112111 12221111 123333332222 1122334567 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhH Q lcl|NC_021072. 233 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSML 312 (533) Q Consensus 233 syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msml 312 (533) |.+..|.+++....-+++...=+----+--+-+...+.. +.....++-...+..++.++.-=..+.|.+ + .+ T Consensus 177 s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~e~~~~~~~~~~~~~~g~~nag~~------~-vl 248 (419) T protein:vir:57 177 SPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFE-AKAIASQAAVDAILAKWTERYGGVRNAFSV------G-ML 248 (419) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCc-CCcccCHHHHHHHHHHHHHHhccccccccc------e-ec Confidence 899999998888877777665443333444455555421 111112222333333333322100011211 1 11 Q ss_pred hhhcccccCCCCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHH Q lcl|NC_021072. 313 EDFWLPRREGGRGTEISTLPGGQNLGELE---DVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IAR 388 (533) Q Consensus 313 EDywLpRReggrgTEIsTLpGg~nLgei~---DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~r 388 (533) + .|+++..|- .+.-++. =.++..+.++++++||...|+..+.-+....++..+. |.++ +.- T Consensus 249 ~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~---f~~~~l~P 313 (419) T protein:vir:57 249 Q----------EGMTYKQLS--QDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQ---YVIYTMLA 313 (419) T ss_pred C----------CCceEEEcC--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHH---HHHHHHHH Confidence 1 245565553 2333333 3356668899999999999975443333333332222 5544 233 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQ 468 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~ 468 (533) +...+... +-++++++.++. ...+.|..+. +... -+..|+++++.+-.- -+++.+-++. T Consensus 314 ~~~~ie~~---------l~~~ll~~~~~~----~~~i~fd~~~----ll~~-d~~~~~~~~~~~~~~--G~~T~NE~R~- 372 (419) T protein:vir:57 314 ILKRHESA---------MMRDLLLPSERR----DFYIEFNVSS----LLRG-DQKSRYESYALGRQW--GWLSVNDIRR- 372 (419) T ss_pred HHHHHHHH---------HHhhccCccccC----CeEEEEechh----hhcc-CHHHHHHHHHHHHhC--CCcCHHHHHH- Confidence 32322222 233344454443 3455665332 2211 234556666554322 3566666663 Q ss_pred HhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc--ccCCCC--CCCCCCCCccccccccCCccccchh Q lcl|NC_021072. 469 VLKQTDQEIKEIDKQIDSEREAGLIVDPMAEM--DPAMDP--GNAPPADDMSAQEGPAVDAGDAKRG 531 (533) Q Consensus 469 IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~--~~~~~~--~~~~~~~d~~~~~~~~~~~~~~~~~ 531 (533) ++++.+ + |+.+. .|.+.. +.....+....++.+..++....+- T Consensus 373 ~~gl~p------------------~--~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 419 (419) T protein:vir:57 373 MENLTP------------------I--PGGDKYLTPLNMVDSKALTGIGKATPQQLKDIEAILCTRN 419 (419) T ss_pred HhCCCC------------------C--CCcCeeeeccccccccccccccCCCcccCcchhhhhhccC Confidence 345432 1 11110 111100 0000000011111111111111111 No 147 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=95.46 E-value=0.002 Score=35.34 Aligned_cols=397 Identities=13% Similarity=0.099 Sum_probs=180.5 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) +-..||. + +. ..++..|-..+ .....+..+. +.. +. -+..+++|-|.+||+-|.+.+ T Consensus 18 ~~~~lf~----~--~~--~~~~~~~~~~~---~~~~~~~~~~--~~~---------vs-~~~al~~~~v~~cv~~Ia~~i 74 (424) T protein:vir:45 18 LLDALFR----S--KS--LENPSTPITGD---AVDTDGLFRA--DVY---------VS-PETAMKLAAVYSCIYVLSSSL 74 (424) T ss_pred HHHhhcc----c--cC--CCCCccccchh---hhhhhccccC--Cce---------ec-hHHhhccHHHHHHHHHHHHHH Confidence 2223332 1 11 01111111111 1111111100 000 11 133456888999999999885 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHH----hhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFR----RWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR----~WYvDGri~~hkvid~~~~~~gI 152 (533) -. .|+.|--...+..+.+++ ..++++|+. .-.+.++.+ .+.+.|.-|..++-|. .+-+ T Consensus 75 A~-----lp~~v~~~~~~~~~~~~~------~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~---~G~~ 140 (424) T protein:vir:45 75 AQ-----MPLHVMRRHKGKVEPARD------HPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNR---RGEV 140 (424) T ss_pred hh-----CceEEEEecCCceeeccc------chHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC---CCcE Confidence 53 455553222222222211 134444432 234555444 4677899999866553 5669 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccc Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTL 232 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~ 232 (533) ++|.+++|..+...+. .+...+ .+++..+ ...++.+-|.+.. ....++-+++ T Consensus 141 ~~L~~l~~~~v~i~~~-----~~~~~y--------------~~~~~~~-------~~~~~~~eVih~r--~~~~d~~~G~ 192 (424) T protein:vir:45 141 ISLDCCMPWETTLMNT-----GGRYTY--------------GLYNEYG-------AFAISPDDMIHIR--ALGNNQKMGL 192 (424) T ss_pred EEEEEecCceEEEEEc-----CCeEEE--------------EEEecCc-------eEEECcccEEEec--CcCCCCcccc Confidence 9999999998854221 111111 1111111 1123333222221 1233455678 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhH Q lcl|NC_021072. 233 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSML 312 (533) Q Consensus 233 syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msml 312 (533) |-++.|...+.....+++...=+----|--+-|+.++. .|.+.++++.-+.+-..|+- ..+ +.|.+ + .+ T Consensus 193 spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g--~~~-n~g~~------~-vl 261 (424) T protein:vir:45 193 SPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-GLNKESWGWLKDQWQKASQA--LRR-QENKT------M-LL 261 (424) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHHHhcc--ccc-cCCce------e-Ec Confidence 99999999999888888876644444455567777774 46665554444433333321 111 11211 1 11 Q ss_pred hhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHH Q lcl|NC_021072. 313 EDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLR 390 (533) Q Consensus 313 EDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kf-i~rLr 390 (533) ..|.++..|.-. ..+.-++-.++-.+.+.++++||...|+..+.-+.....+..+. |.++ +.-+- T Consensus 262 ----------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~---f~~~tL~P~~ 328 (424) T protein:vir:45 262 ----------PADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQ---FVRYTMMPWV 328 (424) T ss_pred ----------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHHH Confidence 124455555321 11222455568888999999999999976544344444554444 5554 33333 Q ss_pred HHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHh Q lcl|NC_021072. 391 KRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVL 470 (533) Q Consensus 391 ~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL 470 (533) .++.. .|-..| .+..++. ....+.|..+ ++... -+..|.+.+..+-.- -++|.+-++. ++ T Consensus 329 ~~ie~----~ln~kL-----l~~~e~~---~g~~i~fd~~----~llr~-d~~~r~~~~~~~~~~--g~~T~NE~R~-~~ 388 (424) T protein:vir:45 329 TNWEQ----ELNRRL-----FTRAELA---AGYYVRFNLT----GLLRG-TPQERAQFYHFAITD--GWMSRNEARA-FE 388 (424) T ss_pred HHHHH----HHHHhc-----CChhhhc---CCcEEEeech----hhhcc-CHHHHHHHHHHHHhC--CCcCHHHHHH-Hh Confidence 33222 233332 3433332 2235566533 22222 234566666655433 4666666663 34 Q ss_pred CCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccc Q lcl|NC_021072. 471 KQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQE 519 (533) Q Consensus 471 ~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~ 519 (533) ++..- |-.+..+...+. .+..+....++.+++..++ T Consensus 389 gl~pi-----------~ggD~~~~~~n~--~~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 389 DMNPV-----------EGLDEMLVSVNA--ANPAGDFKPPKNDEGKTNE 424 (424) T ss_pred CCCCC-----------CCcceeeecccc--cccccccCCCCCCCCCCCC Confidence 54321 001111110010 0000111122222222222 No 148 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=95.30 E-value=0.0023 Score=35.01 Aligned_cols=416 Identities=12% Similarity=0.136 Sum_probs=168.3 Q ss_pred CCccccceeeeccccccccC-CCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchh---------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGP-SFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPEC---------- 69 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~-s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEv---------- 69 (533) |- ..|-..+-+ +.-.+.. +..+....+.. .+.- .....+. -+.+|+.+..+.+- T Consensus 1 ~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~--~i~~--------~i~~~~~---~~~~~~~~~~YY~g~~~i~~~~~~ 65 (474) T protein:vir:97 1 MF-NIIRMPWDK-PYGEEVVEQLKPQFETQEE--MIVR--------LIDDHRK---QLDKITVGQRYYDKDNDIVKQMKK 65 (474) T ss_pred Cc-ccccccCCC-chhhHHHHhhhhcccCHHH--HHHH--------HHHHHHH---HHHHHHHHHHHhccccchhcccch Confidence 10 000000000 0000000 00000000000 0000 0000011 11222222222111 Q ss_pred -----------------hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhh Q lcl|NC_021072. 70 -----------------DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWY 132 (533) Q Consensus 70 -----------------d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WY 132 (533) .+-..-||+-.. .=.=+.|+.+.+++ +...+. ++.+++ -+|+....++++.+. T Consensus 66 ~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~-~~l~g~p~~~~~~d----~~~~~~----l~~~~~-n~~~~~~~e~~~~~~ 135 (474) T protein:vir:97 66 VDVHGNIDYDKPDWRITTNFHQNLVDQKV-SYVASKPVTYSCED----ENVLKV----IHDVLD-TRWDNKLIDILTATS 135 (474) T ss_pred hccccccccccCcceeecchHHHHHHHHH-hhhhcCCceeccCc----HHHHHH----HHHHHh-ccHHHHHHHHHHHHh Confidence 111222332211 11225667776655 223222 222222 368889999999999 Q ss_pred hcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCc---eeEEeccceeeccchhceeccccccc------- Q lcl|NC_021072. 133 VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQ---LRGEDINTQLTQKAAEYYLYNPKGLK------- 202 (533) Q Consensus 133 vDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~---~~~~~~~~~~~~~~~e~~~y~p~~~~------- 202 (533) +-|+.|.+.-+| .+|-..+..+||+.+-++..-.. .... .+++.... .....+|.+.... T Consensus 136 ~~G~~~~~~~~d----~~~~~~i~~~~p~~~~~v~d~~~-~~~~~~~ir~~~~~~-----~~~~~~yt~~~~~~y~~~~~ 205 (474) T protein:vir:97 136 NKGIDWLQVYIN----ENGEMKLFRVPAEQAIPIWVDKE-REELKSFIRYYKFNN-----EEKVEFWTDTTVTYYVLENG 205 (474) T ss_pred hcCceEEEEEec----CCCeeEEEEEcccceEEEEcCCC-CCceEEEEEEEEecC-----eEEEEEEeCCeEEEEEEcCC Confidence 999999887665 34668899999999977643211 1111 11111111 1111223332110 Q ss_pred ----------------cccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceE Q lcl|NC_021072. 203 ----------------NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRI 265 (533) Q Consensus 203 ----------------~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrv 265 (533) ..+|..-+||.-.+ +++....|=++..+.....+. ++-+....-+.++.|-+-+ T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~---------~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~ 276 (474) T protein:vir:97 206 GLIPDYYYGANHVQSHFSNGNWGRVPFIAF---------KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYIL 276 (474) T ss_pred ccccccccCcCcccccccccCCCccceEEe---------cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Confidence 01111122221111 123344566676666665554 3344444445556664433 Q ss_pred EEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHH-HHH Q lcl|NC_021072. 266 FYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE-DVK 344 (533) Q Consensus 266 fyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~-DV~ 344 (533) .-.+.- +...++..+..+++-.=+++.+ ++.|-.-.+.+... -+. T Consensus 277 ~g~~~~--------------------------------~~~~~~~~~~~~~~i~~~~~~~--~~~l~~~~~~~~~~~~~~ 322 (474) T protein:vir:97 277 KGYEGE--------------------------------DLEEFMRGLKYYKAINVDGDGG--VETIQVEVPVSSTKEYID 322 (474) T ss_pred ecCCcc--------------------------------cchhhhhhhhccceeeccCCCc--eeEEeecCCHHHHHHHHH Confidence 221110 1111122222222222233333 44444333444333 456 Q ss_pred HHHHHHHHhcCCCccccCCCCccccc--chhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhc Q lcl|NC_021072. 345 YFQKKLYKALNVPSSRLETETTFNIG--RAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEH 422 (533) Q Consensus 345 YF~~kLy~aL~VP~sRl~~~~~~~~g--~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~ 422 (533) -+.+.+|+...+|- +..++ |. | .|..|..-......-+.+.+..|...+..+++.=+-+-|+ ..+|. . T Consensus 323 ~l~~~I~~~s~~p~--~~~~~-~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~--~~d~~----~ 392 (474) T protein:vir:97 323 LMRVYIMEFGQGVD--FQTDK-FG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL--KTDVK----D 392 (474) T ss_pred HHHHHHHHHhCccc--cCccc-cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----e Confidence 66777888899884 22211 11 1 2223333333344446677777777777776654444454 23444 4 Q ss_pred eeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_021072. 423 IQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP 502 (533) Q Consensus 423 i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~ 502 (533) |.+.|...---.+. +.++++.++ | .+|.+++++. |...++ .+++.++|++|..+..=. .+ T Consensus 393 i~v~f~~~~p~~~~-------e~a~~~~~~----g-~iS~et~l~~-l~~v~D-~~~E~eri~~E~~~~~~~------~~ 452 (474) T protein:vir:97 393 IEISFNFNRMMNDA-------EQSQIIAQS----Q-YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQ------LP 452 (474) T ss_pred eeEEeccCcccCHH-------HHHHHHHHc----C-CCCHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhh------cc Confidence 67778543322232 222334442 2 5899999977 554332 334455555555432101 11 Q ss_pred CCCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 503 AMDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 503 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) ....+++ ..++.....+.+++| T Consensus 453 ~~~~~~~--------~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 453 NLDDGGA--------DGAQQQEGSNNKESE 474 (474) T ss_pred ccCCCCC--------CCcccCCCCcccccC Confidence 1111110 011111112222222 No 149 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=95.30 E-value=0.0023 Score=35.01 Aligned_cols=416 Identities=12% Similarity=0.136 Sum_probs=168.3 Q ss_pred CCccccceeeeccccccccC-CCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchh---------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGP-SFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPEC---------- 69 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~-s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEv---------- 69 (533) |- ..|-..+-+ +.-.+.. +..+....+.. .+.- .....+. -+.+|+.+..+.+- T Consensus 1 ~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~--~i~~--------~i~~~~~---~~~~~~~~~~YY~g~~~i~~~~~~ 65 (474) T protein:vir:94 1 MF-NIIRMPWDK-PYGEEVVEQLKPQFETQEE--MIVR--------LIDDHRK---QLDKITVGQRYYDKDNDIVKQMKK 65 (474) T ss_pred Cc-ccccccCCC-chhhHHHHhhhhcccCHHH--HHHH--------HHHHHHH---HHHHHHHHHHHhccccchhcccch Confidence 10 000000000 0000000 00000000000 0000 0000011 11222222222111 Q ss_pred -----------------hhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhh Q lcl|NC_021072. 70 -----------------DSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWY 132 (533) Q Consensus 70 -----------------d~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WY 132 (533) .+-..-||+-.. .=.=+.|+.+.+++ +...+. ++.+++ -+|+....++++.+. T Consensus 66 ~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~-~~l~g~p~~~~~~d----~~~~~~----l~~~~~-n~~~~~~~e~~~~~~ 135 (474) T protein:vir:94 66 VDVHGNIDYDKPDWRITTNFHQNLVDQKV-SYVASKPVTYSCED----ENVLKV----IHDVLD-TRWDNKLIDILTATS 135 (474) T ss_pred hccccccccccCcceeecchHHHHHHHHH-hhhhcCCceeccCc----HHHHHH----HHHHHh-ccHHHHHHHHHHHHh Confidence 111222332211 11225667776655 223222 222222 368889999999999 Q ss_pred hcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCc---eeEEeccceeeccchhceeccccccc------- Q lcl|NC_021072. 133 VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQ---LRGEDINTQLTQKAAEYYLYNPKGLK------- 202 (533) Q Consensus 133 vDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~---~~~~~~~~~~~~~~~e~~~y~p~~~~------- 202 (533) +-|+.|.+.-+| .+|-..+..+||+.+-++..-.. .... .+++.... .....+|.+.... T Consensus 136 ~~G~~~~~~~~d----~~~~~~i~~~~p~~~~~v~d~~~-~~~~~~~ir~~~~~~-----~~~~~~yt~~~~~~y~~~~~ 205 (474) T protein:vir:94 136 NKGIDWLQVYIN----ENGEMKLFRVPAEQAIPIWVDKE-REELKSFIRYYKFNN-----EEKVEFWTDTTVTYYVLENG 205 (474) T ss_pred hcCceEEEEEec----CCCeeEEEEEcccceEEEEcCCC-CCceEEEEEEEEecC-----eEEEEEEeCCeEEEEEEcCC Confidence 999999887665 34668899999999977643211 1111 11111111 1111223332110 Q ss_pred ----------------cccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceE Q lcl|NC_021072. 203 ----------------NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRI 265 (533) Q Consensus 203 ----------------~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrv 265 (533) ..+|..-+||.-.+ +++....|=++..+.....+. ++-+....-+.++.|-+-+ T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~---------~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~ 276 (474) T protein:vir:94 206 GLIPDYYYGANHVQSHFSNGNWGRVPFIAF---------KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYIL 276 (474) T ss_pred ccccccccCcCcccccccccCCCccceEEe---------cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeee Confidence 01111122221111 123344566676666665554 3344444445556664433 Q ss_pred EEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHH-HHH Q lcl|NC_021072. 266 FYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE-DVK 344 (533) Q Consensus 266 fyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~-DV~ 344 (533) .-.+.- +...++..+..+++-.=+++.+ ++.|-.-.+.+... -+. T Consensus 277 ~g~~~~--------------------------------~~~~~~~~~~~~~~i~~~~~~~--~~~l~~~~~~~~~~~~~~ 322 (474) T protein:vir:94 277 KGYEGE--------------------------------DLEEFMRGLKYYKAINVDGDGG--VETIQVEVPVSSTKEYID 322 (474) T ss_pred ecCCcc--------------------------------cchhhhhhhhccceeeccCCCc--eeEEeecCCHHHHHHHHH Confidence 221110 1111122222222222233333 44444333444333 456 Q ss_pred HHHHHHHHhcCCCccccCCCCccccc--chhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhc Q lcl|NC_021072. 345 YFQKKLYKALNVPSSRLETETTFNIG--RAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEH 422 (533) Q Consensus 345 YF~~kLy~aL~VP~sRl~~~~~~~~g--~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~ 422 (533) -+.+.+|+...+|- +..++ |. | .|..|..-......-+.+.+..|...+..+++.=+-+-|+ ..+|. . T Consensus 323 ~l~~~I~~~s~~p~--~~~~~-~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~--~~d~~----~ 392 (474) T protein:vir:94 323 LMRVYIMEFGQGVD--FQTDK-FG-SAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNL--KTDVK----D 392 (474) T ss_pred HHHHHHHHHhCccc--cCccc-cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----e Confidence 66777888899884 22211 11 1 2223333333344446677777777777776654444454 23444 4 Q ss_pred eeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_021072. 423 IQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP 502 (533) Q Consensus 423 i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~ 502 (533) |.+.|...---.+. +.++++.++ | .+|.+++++. |...++ .+++.++|++|..+..=. .+ T Consensus 393 i~v~f~~~~p~~~~-------e~a~~~~~~----g-~iS~et~l~~-l~~v~D-~~~E~eri~~E~~~~~~~------~~ 452 (474) T protein:vir:94 393 IEISFNFNRMMNDA-------EQSQIIAQS----Q-YLSRETLVKS-SPLVDD-YKAELERIEQEQMEYNKQ------LP 452 (474) T ss_pred eeEEeccCcccCHH-------HHHHHHHHc----C-CCCHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhh------cc Confidence 67778543322232 222334442 2 5899999977 554332 334455555555432101 11 Q ss_pred CCCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 503 AMDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 503 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) ....+++ ..++.....+.+++| T Consensus 453 ~~~~~~~--------~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 453 NLDDGGA--------DGAQQQEGSNNKESE 474 (474) T ss_pred ccCCCCC--------CCcccCCCCcccccC Confidence 1111110 011111112222222 No 150 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=95.29 E-value=0.0024 Score=34.99 Aligned_cols=374 Identities=14% Similarity=0.084 Sum_probs=158.8 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |.. |...-.+ +.+..+...+.. ++......++..+.++.- +.-..+|-|.+||+-|++.+ T Consensus 1 Mg~--~~~~~~~---k~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~--------------~~~l~~~~v~~~i~~ia~~i 60 (383) T protein:vir:10 1 MGL--LTPKNFS---KRNAKNMVYPSN-PAFFTTTVGGMQLSYVSA--------------LSALQNTNVYSVINRIASDV 60 (383) T ss_pred CCc--ccccccc---cccccccccccc-hhhhhhhccCccccccch--------------hHhhcchHHHHHHHHHHHhh Confidence 543 2211111 111112222222 111211111111211110 12245788999999999876 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhH----HHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYE----IFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~----~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -. .|+.+. +. .. ..+++.-|=...+.+ ++..+...|.-|+.++=+ ..++. T Consensus 61 a~-----~~~~~~--~~----~~--------~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~-------~~~~~ 114 (383) T protein:vir:10 61 SS-----AHFKTE--NT----AT--------LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-------NLEHI 114 (383) T ss_pred cc-----Cceeec--cc----ch--------hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-------ceeEe Confidence 54 344442 21 11 123332222223333 344466789999875422 46788 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhcccccccc-CCCCccchhH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQD-LNKNMTLSHL 235 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d-~~~~~i~syL 235 (533) +++|-+++.++.- ++..+. +.+.+.+ ..+.++.+-|.+...--.+ .+...++|.| T Consensus 115 p~~~~~v~~~~~~-----~~~~~~-------------~~~~~~~------~~~~~~~~evih~r~~~~~~~~~~~G~s~l 170 (383) T protein:vir:10 115 PNSDVQINYLPGN-----MGIVYT-------------VLESNDR------PKMVLRQDQMLHFRLMPDPQYRYLIGRSPL 170 (383) T ss_pred ecCcceEEEEEcC-----CceEEE-------------EEEcCCc------eEEEEcccceEEeccCCCCcccccccccHH Confidence 8888887553221 111000 1111111 1222333333222100011 1223467999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 236 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 236 ~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) ..|.+++.....++....=+----+--+-+..++.+..+ .++.+=+++.++++..- .+.|.+ + .+ T Consensus 171 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~-~e~~~~~~~~~~~~~~~----~n~~~~------~-vl--- 235 (383) T protein:vir:10 171 ESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSD-GKDLESAREEFEKANTG----DNSGRL------M-VL--- 235 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCC-HHHHHHHHHHHHHHhCc----cccCCc------c-cc--- Confidence 999999999988888765443333555566666644333 44444445445544321 122321 1 22 Q ss_pred cccccCCCCccceeecCCCCCcch-HHHH-HHHHHHHHHhcCCCccccCCCC-c-ccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_021072. 316 WLPRREGGRGTEISTLPGGQNLGE-LEDV-KYFQKKLYKALNVPSSRLETET-T-FNIGRAAEITRDEVKFQKFIARLRK 391 (533) Q Consensus 316 wLpRReggrgTEIsTLpGg~nLge-i~DV-~YF~~kLy~aL~VP~sRl~~~~-~-~~~g~~~eItRDElkF~Kfi~rLr~ 391 (533) ..|.+++.|.-...-.+ +.+. .+-.+.+.++++||.+.|+... + -+....+++ ..-|.+-+.-+-+ T Consensus 236 -------~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~---~~~~~~~l~P~~~ 305 (383) T protein:vir:10 236 -------PDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQI---KATYLANLNSYVN 305 (383) T ss_pred -------CCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHH---HHHHHHHHHHHHH Confidence 13677777754322222 2333 3446889999999999997432 1 111111221 1224443333333 Q ss_pred HHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhC Q lcl|NC_021072. 392 RFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLK 471 (533) Q Consensus 392 ~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~ 471 (533) .+ .+.|...|. + ..|+|++. .+... -+..|.+.+..+-.- -++|.+-++. +|+ T Consensus 306 ~i----e~~l~~~l~-----~--------~~~~f~~~------~l~~~-d~~~~~~~~~~~~~~--G~~t~nE~R~-~lg 358 (383) T protein:vir:10 306 PI----VDELRLKMN-----A--------PDLELDIK------DMLDV-DDSILINQVSNLAKS--GVLGAEQAQF-ILT 358 (383) T ss_pred HH----HHHHHHhhC-----C--------ceEEeech------hhhcc-CHHHHHHHHHHHHhC--CCcCHHHHHH-HhC Confidence 33 333333332 1 22344332 22211 123445544443322 3555555553 244 Q ss_pred CCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccccc Q lcl|NC_021072. 472 QTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAK 529 (533) Q Consensus 472 ~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 529 (533) +.. ++.++... ...+. +.+.-+|.| T Consensus 359 ~~p------------------~~~~d~~~------------~~~~~---~~~~gGd~e 383 (383) T protein:vir:10 359 RSG------------------FLPDNLPE------------FKPLT---NETKGGDDK 383 (383) T ss_pred CCc------------------ccCCcccc------------cCCCc---ccCCCCCCC Confidence 332 22111000 00000 011122222 No 151 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=95.21 E-value=0.0025 Score=34.82 Aligned_cols=388 Identities=11% Similarity=0.065 Sum_probs=163.7 Q ss_pred cchhhhhhHHHH-------HHHHHHhhhhcchhh-----------------------------------hHHHHhhccee Q lcl|NC_021072. 44 VDFDGTVRNEYE-------LITRYREMVLQPECD-----------------------------------SAVDDIVNETI 81 (533) Q Consensus 44 ~~~~~~~~~~~~-------LI~~YR~m~~~pEvd-----------------------------------~AvdeIvneai 81 (533) .+++.-.+-... .+.+|..+..+.+-. +-...||+-.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 222222111111 122233332222221 11222333211 Q ss_pred eecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChh Q lcl|NC_021072. 82 CGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPR 161 (533) Q Consensus 82 v~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~ 161 (533) -+ .=+.|+.+..++. .. .+.++.+++ =+|+....++.+.+++-|+-|.+.-+|. .+|-..+..+||+ T Consensus 81 ~y-l~G~p~~~~~~~~----~~----~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~---~~g~~~~~~~~p~ 147 (471) T protein:vir:10 81 AY-ALTYPPTFDVDDK----KV----NDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDA---SDNSFRYACVDSK 147 (471) T ss_pred hh-hcccCceeccCCh----HH----HHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeC---CCCeeEEEEEccc Confidence 11 1257777776652 22 223343443 3688899999999999999999988875 4566789999999 Q ss_pred hceehhhccCCCcCce---eEEecccee-eccchhceecccccccc----------c----------------------- Q lcl|NC_021072. 162 KIRKVTEYQQKRPEQL---RGEDINTQL-TQKAAEYYLYNPKGLKN----------S----------------------- 204 (533) Q Consensus 162 ~i~~vr~~~~~~~~~~---~~~~~~~~~-~~~~~e~~~y~p~~~~~----------~----------------------- 204 (533) .+-+|..-... .... +++...... .....-..+|.+.+... . T Consensus 148 ~~~~i~d~~~~-~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (471) T protein:vir:10 148 EVIPIYSKSLD-KKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSF 226 (471) T ss_pred ceEEEEcCCCC-CceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccc Confidence 98664432211 1111 122111000 00000111222211100 0 Q ss_pred cCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHH Q lcl|NC_021072. 205 TNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLR 283 (533) Q Consensus 205 ~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~ 283 (533) +|.--+||.-.+ .|+....|-|+..+.....+.+ +=+..-.-+-+..|-+-+.-.+.-.++ +.+. T Consensus 227 ~~~~g~iPvv~~---------~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-----~~~~ 292 (471) T protein:vir:10 227 KHDFGLVPFIPF---------KNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQ-----EFLE 292 (471) T ss_pred cCCCCceeEEEe---------ccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccc-----hhHH Confidence 000011221111 1122334556554444443332 222222334445553333222211111 1111 Q ss_pred HHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccC Q lcl|NC_021072. 284 EVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLE 362 (533) Q Consensus 284 ~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~ 362 (533) . +.+ .+.+.-..+| .+.|-.++.|--..+.. .-.-+.-..+.+|...++|- +. T Consensus 293 ~-~~~--~~~i~~~~~~---------------------~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~--~~ 346 (471) T protein:vir:10 293 D-LKR--YKMIKMDNDG---------------------MGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVN--PE 346 (471) T ss_pred H-hhc--CCeEEecCCC---------------------CccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcC--CC Confidence 1 111 1222111111 12223344444333333 22445667788888888883 22 Q ss_pred CCCcccccchhhhhH--HhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHH Q lcl|NC_021072. 363 TETTFNIGRAAEITR--DEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIE 440 (533) Q Consensus 363 ~~~~~~~g~~~eItR--DElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~E 440 (533) .+ ++|.+|.... -..--..-+.+.|+.|...|.++++.=+-+-|+ .+|. .+.+.|...---.+...+ T Consensus 347 ~~---~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~d~~----~i~i~f~~~~p~n~~e~~- 415 (471) T protein:vir:10 347 TD---KLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGL---SDKL----KIKQTWTRNSINNDTEMA- 415 (471) T ss_pred cc---cccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---CCCc----eeEEEeCCCCCCCHHHHH- Confidence 22 2244443332 111122336777777777777766543322232 3453 467777755444443222 Q ss_pred HHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCC Q lcl|NC_021072. 441 IRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADD 514 (533) Q Consensus 441 i~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d 514 (533) ++++.+ .| .+|.+++++.+-..+|. +++.++|++|.....=..| ..+..++.++-+ T Consensus 416 ------~~~~kl---~g-~iS~et~~~~~p~v~D~--~~E~eri~~E~~~~~~~~~------~~~~~~~~~e~~ 471 (471) T protein:vir:10 416 ------QVVSTL---AT-ITSRENVAKSNPIVEDW--QDELRLQKAEQEGRSEKLY------DMEEVEHESEVE 471 (471) T ss_pred ------HHHHHH---hc-cCchHHHHHhCCCCCCH--HHHHHHHHHHHHHHHhccc------ccCCCCCccccC Confidence 334444 35 49999999885555543 2344555555433211111 111111111101 No 152 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=95.04 E-value=0.0029 Score=34.50 Aligned_cols=420 Identities=12% Similarity=0.084 Sum_probs=167.0 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhh---------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECD---------- 70 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd---------- 70 (533) |-.-=-++.+.-.+.. ..+.-+..+.... . +.--++.-..-+.+|+.+..+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~--~-----------i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~ 65 (478) T protein:vir:10 1 MISINWPWDKPYHEQV--VEQIKPKYETQEE--M-----------ILRLVREHKENIDNITMGERYYNHHPDILDAPFKR 65 (478) T ss_pred CccccccCCchhhhHH--HHHhhhccCChHH--H-----------HHHHHHHHHHHHHHHHHHHHHhcccccccccchhh Confidence 2110000000000000 0000000000000 0 0000001111122333333222211 Q ss_pred -----------------hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhh Q lcl|NC_021072. 71 -----------------SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYV 133 (533) Q Consensus 71 -----------------~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYv 133 (533) +-...||+-..-+ .-+.|+.+..++ +...+.|.+-|+ =+|+....++.+.+.+ T Consensus 66 ~~~~~~~~~~~~~ki~~n~~k~ivd~~~~y-l~g~p~~~~~~~----~~~~~~l~~~~~-----n~~~~~~~~~~~~~~~ 135 (478) T protein:vir:10 66 DVNGDYDETKPDWRMYTNYHQNLVDQKVAY-AVANPVTFGVDN----DKALKQIQHTLN-----HKWDDKLVDILTAASN 135 (478) T ss_pred hcccccccccccceeccchHHHHHHHHhhh-hcccCceeecCC----hHHHHHHHHHHh-----ccHHHHHHHHHHHHhh Confidence 1112223221111 125667776555 223333332222 1588888899999999 Q ss_pred cCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccc-c--------- Q lcl|NC_021072. 134 DGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLK-N--------- 203 (533) Q Consensus 134 DGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~-~--------- 203 (533) -|+.|.+.-+|. +|-..+..+||+.+-+|..-.. ......+..... ........+|.+.... + T Consensus 136 ~G~~~~~v~~d~----~~~~~~~~~~p~~~~~v~d~~~-~~~~~~~ir~~~--~~~~~~~~~y~~~~i~~~~~~~~~~~~ 208 (478) T protein:vir:10 136 KGIEWVQPYVDE----EGEFKTFRVPAEQAVPIWTNKE-RDELQAFIRVYE--LDGAERVEYWTKDDVTFYELKEGQLIP 208 (478) T ss_pred CCeEEEEEEecC----CCceEEEEEcccceEEEEcCCC-CCceEEEEEEEe--eeCceEEEEEeCCcEEEEEecCCeeec Confidence 999999877764 3567899999999877643211 111111111000 0001111122221110 0 Q ss_pred -----------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceE Q lcl|NC_021072. 204 -----------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRI 265 (533) Q Consensus 204 -----------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrv 265 (533) .++.--+||.=.+ +|+....|-|+..+....-+. ++-+....-+-++.|-+-+ T Consensus 209 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~---------~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~ 279 (478) T protein:vir:10 209 DFYRSEDHIQPHYYQGNKLMSWGRVPFIPF---------KNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYIL 279 (478) T ss_pred cccccccccccceecccccccCCcceEEEe---------ccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceee Confidence 0011111111111 123445577776655555554 3344444456666665444 Q ss_pred EEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHH Q lcl|NC_021072. 266 FYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVK 344 (533) Q Consensus 266 fyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~ 344 (533) .-.+..+.. + .... |..++ =+|++.-+| .+++.|-...+...+ .-+. T Consensus 280 ~g~~~~~~~--~---~~~~-~~~~~-----------------------~~~~~~~~~---~~~~~l~~~~~~~~~~~~~~ 327 (478) T protein:vir:10 280 KGYEGEDMK--D---FMHN-LKYYK-----------------------AISVAGESG---SGVDTIKVEVPIDSVKEYTK 327 (478) T ss_pred ecCCccccc--c---hhhh-hhhCc-----------------------eeEecCCCC---CcceEEeecCCHHHHHHHHH Confidence 322221111 0 0000 11111 112322222 335555544444443 3366 Q ss_pred HHHHHHHHhcCCCccccCCCCcccccch--hhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhc Q lcl|NC_021072. 345 YFQKKLYKALNVPSSRLETETTFNIGRA--AEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEH 422 (533) Q Consensus 345 YF~~kLy~aL~VP~sRl~~~~~~~~g~~--~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~ 422 (533) -+.+.+|...++|- +.. +.|. |.. ..|..-......-+.+.+..|...+.++|+.=+-+.|+ ..+|. . T Consensus 328 ~l~~~I~~~s~~p~--~~~-~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~d~~----~ 397 (478) T protein:vir:10 328 MLRDYIIEFGQGVD--FQQ-DKFG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL--DVRVQ----D 397 (478) T ss_pred HHHHHHHHHhCCcC--cCc-cccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----c Confidence 77888999999994 211 1121 222 22333322233345666666666666666654444453 23333 4 Q ss_pred eeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccccc Q lcl|NC_021072. 423 IQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDP 502 (533) Q Consensus 423 i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~ 502 (533) |.+.|...---.+.. -+++++.+. | .+|.++++.. |...++ -+++.++|++|.....-..|+.. .+ T Consensus 398 i~i~f~~~~p~~~~e-------~~~~~~~~~---g-~iS~et~i~~-~~~v~d-~~~E~~ri~~E~~~~~~~~~~~~-~~ 463 (478) T protein:vir:10 398 IEITFNFNVMVNELE-------NSQIAMNST---G-LLSKETILGN-HSWVQD-PVAEMERIEQENIELNQQLPDIE-EG 463 (478) T ss_pred ceEEeCCCCCCCHHH-------HHHHHHHHh---C-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHhccccC-CC Confidence 677775433322322 234444443 4 4899999976 555432 33444455555443211111000 00 Q ss_pred CCCCCCCCCCCCccc Q lcl|NC_021072. 503 AMDPGNAPPADDMSA 517 (533) Q Consensus 503 ~~~~~~~~~~~d~~~ 517 (533) ..++...+..|+... T Consensus 464 ~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 464 LNDEQQRQSEDNQSE 478 (478) T ss_pred CcccccccCcCCCCC Confidence 011111111111111 No 153 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=94.88 E-value=0.0033 Score=34.22 Aligned_cols=409 Identities=11% Similarity=0.109 Sum_probs=166.8 Q ss_pred ccceeeeccccccccCCCCCCCCCcccceeecccccccc-cchhhhhhH--HHHHHHHHHhhhhcchhhh---------- Q lcl|NC_021072. 5 LFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYS-VDFDGTVRN--EYELITRYREMVLQPECDS---------- 71 (533) Q Consensus 5 ~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~-~~~~~~~~~--~~~LI~~YR~m~~~pEvd~---------- 71 (533) .+..-|-+.... ... +-+-. ......+.. ..+.+.+|+.+..+.+=.. T Consensus 1 ~~~~~~~~~~~~------------------~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~ 61 (479) T protein:vir:79 1 MLNIYISETDLI------------------KVQ-LKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYL 61 (479) T ss_pred CCCceecccceE------------------eec-cccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccc Confidence 111111110000 000 00000 001111111 1223334444433332111 Q ss_pred -------------------HHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhh Q lcl|NC_021072. 72 -------------------AVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWY 132 (533) Q Consensus 72 -------------------AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WY 132 (533) -...||+...-+ .-+.|+.+..++ +.+++. ++.+.+ =+|+....+..+... T Consensus 62 ~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~-l~g~p~~~~~~~----~~~~~~----~~~~~~-n~~~~~~~~~~~~~~ 131 (479) T protein:vir:79 62 LDGAKVDDFTKVNNKAINNYHKLLVDQKVGY-SVGNPIVFNADD----DNLTKL----LNDLLG-EEFDDTITELYLNAS 131 (479) T ss_pred cccccccccccCcceeecchHHHHHHHHHhh-hhcCCceeccCC----HHHHHH----HHHHHh-cCHHHHHHHHHHHHH Confidence 122233321111 124566666544 333332 222222 278999999999999 Q ss_pred hcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccC--CCcCceeEEeccceeeccchhceecccccccc------- Q lcl|NC_021072. 133 VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQ--KRPEQLRGEDINTQLTQKAAEYYLYNPKGLKN------- 203 (533) Q Consensus 133 vDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~--~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~------- 203 (533) +-|+.|.+.-+| ++|-..+..+||+.+-+|..-.. +..-..+++.....-.....-..+|.+..... T Consensus 132 ~~G~~~~~v~~d----~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~ 207 (479) T protein:vir:79 132 NKGVEWLHPYIN----RKGEFKYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNS 207 (479) T ss_pred hcCeEEEEEEeC----CCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCc Confidence 999999987765 34667899999999977643211 11111111111100000001111222211100 Q ss_pred -------------------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_021072. 204 -------------------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRL 257 (533) Q Consensus 204 -------------------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi 257 (533) .+|.--+||.-.+ +|+....|-++..+....-+.. +-+....-+. T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~---------~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~ 278 (479) T protein:vir:79 208 FIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPF---------KNNEKCVSDLTFYKSLIDIYDNNISTLADNLDE 278 (479) T ss_pred ccccccccccccccccccccccccccccCCCcccEEEe---------cCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHH Confidence 0000001111100 1234455677766665555543 3344445555 Q ss_pred hcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCc Q lcl|NC_021072. 258 SRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNL 337 (533) Q Consensus 258 ~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nL 337 (533) ++-|-.-+--.+ |. ...+.. .+.+.. .|+ .+ +++-+.+.-|.+. +. T Consensus 279 ~~~~~~v~~g~~-~~----~~~~~~--------------------~~~~~~-~~i---~~---~~~~~~~~l~~~~--~~ 324 (479) T protein:vir:79 279 IQEVIYVLKEYP-GT----SLQEFI--------------------DNIRYY-KSI---KV---DGGGGVDKLEINI--PV 324 (479) T ss_pred hhCceeeeecCC-cc----ccccch--------------------hhhhhc-cce---ec---CCCCcceEEeccC--CH Confidence 555543321111 11 000000 000000 011 01 1222233333333 33 Q ss_pred c-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhh--hhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHh Q lcl|NC_021072. 338 G-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAE--ITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLE 414 (533) Q Consensus 338 g-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~e--ItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~e 414 (533) . .-.-++-+++.+|+...+|-. ..++ +|.+|. |..-...-..-+.+.+..|...+.++++.=+-+-++.... T Consensus 325 ~~~~~~~~~l~~~i~~~s~~p~~--~~~~---~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~ 399 (479) T protein:vir:79 325 EAKKELLDRLEKNIIIFGQGVNP--ESQN---TGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNK 399 (479) T ss_pred HHHHHHHHHHHHHHHHHhCcccc--cccc---ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC Confidence 2 223456667778888888843 2222 243333 3232223333467777777777777776544333444333 Q ss_pred HHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC Q lcl|NC_021072. 415 EWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV 494 (533) Q Consensus 415 ew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~ 494 (533) +++. ..+.+.|...---.+.. -+++++.+ +| .+|++++++. |...++ .+++-++|++|.....-. T Consensus 400 ~~~~--~~i~i~f~~~~p~~~~~-------~a~~~~kl---~g-~iS~et~l~~-l~~v~d-~~~E~~ri~~E~~~~~~~ 464 (479) T protein:vir:79 400 SYDY--KTVQITFNHSMIINEAE-------KIDMAAKS---TG-IVSDETIVSN-HPWVED-VNDELERLKKQEDTQKEY 464 (479) T ss_pred cccc--ccceEEeCCCCCcCHHH-------HHHHHHHH---hc-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHH Confidence 3332 35677776443333322 23444444 35 4899999976 565442 334445555554432110 Q ss_pred CCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 495 DPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 495 ~p~~~~~~~~~~~~~~~~~d~ 515 (533) . +. ....+.+-.+|. T Consensus 465 ~---~~---~~~~~~~~~~e~ 479 (479) T protein:vir:79 465 D---DL---IPNNQDGVIDET 479 (479) T ss_pred H---hc---cCcccCCCcCcC Confidence 0 00 000000000011 No 154 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=94.81 E-value=0.0034 Score=34.09 Aligned_cols=438 Identities=16% Similarity=0.223 Sum_probs=170.9 Q ss_pred CC--ccccceeeeccccccccC--CCCCCCCCcccceeeccccc-------ccccchhhhhhHHHHH--------HHHHH Q lcl|NC_021072. 1 MS--NQLFGFSLERAKKVPKGP--SFVQKDSMDGSQPIVGGGYY-------GYSVDFDGTVRNEYEL--------ITRYR 61 (533) Q Consensus 1 ~~--~~~fg~~i~~~~~~~~~~--s~~~~~~~dg~~~~~~~~~~-------~~~~~~~~~~~~~~~L--------I~~YR 61 (533) |. -.|- -++-++ -+.|.-..--..++..++|. .|..++.+.....-.+ +..-- T Consensus 46 ~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la 117 (698) T protein:vir:10 46 MGRRGALN--------ALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLV 117 (698) T ss_pred hccccccc--------ccccccccCCCccccccccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHH Confidence 11 0000 000000 00000011111112111111 1111111111111111 23345 Q ss_pred hhhhcchhhhHHHHhhcceeee------cCCCc----eEEEEeccCC--CcHHHHHHHHHHHHHHHHHhcchhhhhHHHH Q lcl|NC_021072. 62 EMVLQPECDSAVDDIVNETICG------NFDDV----PVEVELSNLK--QSDKIKKLIREEFAEILRLLDFENRSYEIFR 129 (533) Q Consensus 62 ~m~~~pEvd~AvdeIvneaiv~------d~~~~----~v~v~l~~~~--~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR 129 (533) .|+|+||+++++.=|+.||+-. ..... -+.+.-+..+ .++.| ++|..||+. |++..+..+.++ T Consensus 118 ~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi-~~L~~e~er----l~V~~~l~eai~ 192 (698) T protein:vir:10 118 LLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQL-KQINDEIER----LRIRDAVRTTVI 192 (698) T ss_pred HHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccccccHHHH-HHHHHHHHH----HHHHHHHHHHHH Confidence 6899999999999999999532 11111 0222211222 22344 556666653 333333333333 Q ss_pred hhhhcCceeeeeeecCC--------------CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhcee Q lcl|NC_021072. 130 RWYVDGRLFYHKVIDPK--------------NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYL 195 (533) Q Consensus 130 ~WYvDGri~~hkvid~~--------------~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~ 195 (533) -=-+-|.-.....|+.. -.|+.++.|+.|||..+.+-- . ........ .. T Consensus 193 ~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~-~--------n~~dP~sp--------df 255 (698) T protein:vir:10 193 HDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNN-Y--------NSINPVAD--------DF 255 (698) T ss_pred hcccccceEEEEEeecCccccccccccccccccCccceeeeeecccccccch-h--------hhccchhh--------cc Confidence 21122222111122221 235678889999998885510 0 00011111 22 Q ss_pred ccccccccccCCcceeccchhhccc-----cccccCCCCccchhHHHHHHHHHH-HHHHHHHH-HHHHHhcCccceEEEc Q lcl|NC_021072. 196 YNPKGLKNSTNQGMKIATDSVTYCH-----SGIQDLNKNMTLSHLHKAIKAVNQ-LRMIEDSL-VIYRLSRAPERRIFYI 268 (533) Q Consensus 196 y~p~~~~~~~~~~~kI~~dai~y~h-----sGl~d~~~~~i~syL~~AiK~~Nq-Lrm~EDal-VIyRi~RAPeRrvfyI 268 (533) |.|....-. + .+|+.+-+.-.. -.|...-.-.++|.+..+...+.+ ++....+. ++...+..- +.. T Consensus 256 gkP~~y~V~-G--~~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~----l~~ 328 (698) T protein:vir:10 256 YKPSTWWMI-G--SEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSG----ILM 328 (698) T ss_pred CCCceEEEe-c--ceecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHH----HHH Confidence 333221111 0 112221111000 001111122245666665554333 33333222 222211111 011 Q ss_pred cCCC-CchHHHHHHHH--HHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHH- Q lcl|NC_021072. 269 DVGN-LPKNKAEQYLR--EVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVK- 344 (533) Q Consensus 269 DvGn-lpk~KAeqYl~--~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~- 344 (533) |... |.....++... +++++||+-. |-+--|+ -.|+|- +. ..+|+-++||. T Consensus 329 dla~aL~~g~~~~l~~R~eli~~~Rsn~------G~~llDk----~~Eefe-------------q~--st~lSGLddVi~ 383 (698) T protein:vir:10 329 DLAQALTPGANVDLSMRAELINRYRDNR------NILFLDK----ATEEFF-------------QF--NTPLSGLDALQA 383 (698) T ss_pred HHHHhcCChhhHHHHHHHHHHHHhcCcc------ceEEEec----CCcceE-------------EE--ecCcCCHHHHHH Confidence 1110 00011111111 3445554211 1110000 013332 11 24688888874 Q ss_pred HHHHHHHHhcCCCccccCC--CCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHH-----HHHhccCCCHhHHh Q lcl|NC_021072. 345 YFQKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKT-----QLILKGVMSLEEWD 417 (533) Q Consensus 345 YF~~kLy~aL~VP~sRl~~--~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~-----qLilkgi~t~eew~ 417 (533) =|..-+=-+.+||+.||=. -.|||--.-+ |.-.|+..|..+|. ..+...|++ |+-+ |- T Consensus 384 qf~q~VAgaa~IPltkLfGqSPkGlNATGE~----D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~--------~G 448 (698) T protein:vir:10 384 QAQEQMSAVSHIPLIKLLGITPTGLNASSEG----EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSL--------FG 448 (698) T ss_pred HHHHHHHhhhcCchhhhhccCCcccCccchh----hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHh--------cC Confidence 5888888899999999943 3688752222 23348999988875 334444443 3333 33 Q ss_pred hhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCC-C Q lcl|NC_021072. 418 EMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVD-P 496 (533) Q Consensus 418 ~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~-p 496 (533) .+...|.|.|+.=..-+|..-+||...+.+.-..+-.- | -++.+-|+. .+..+.. +.|.. - T Consensus 449 ~idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~-g-vI~~~evr~---------------rL~~d~~-s~Y~~~~ 510 (698) T protein:vir:10 449 AVDPSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQE-Q-VIRPDQVAA---------------RLNTEPD-GPYAGKL 510 (698) T ss_pred CCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHh-c-CCCHHHHHH---------------HHhccCC-Ccccccc Confidence 45567999999888888888899988888765543222 1 122222222 2222211 12211 1 Q ss_pred CcccccCCCC-----------------CCCCC-CCC-----------ccccccccCCccccchhcC Q lcl|NC_021072. 497 MAEMDPAMDP-----------------GNAPP-ADD-----------MSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 497 ~~~~~~~~~~-----------------~~~~~-~~d-----------~~~~~~~~~~~~~~~~~~~ 533 (533) +++++|+.++ ++.+. .+. .+.+..+.+++.++-..+. T Consensus 511 d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 576 (698) T protein:vir:10 511 DANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQDA 576 (698) T ss_pred CCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCccccCcccc Confidence 1222221111 00000 000 0111111111111111111 No 155 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=94.79 E-value=0.0035 Score=34.05 Aligned_cols=409 Identities=11% Similarity=0.067 Sum_probs=161.7 Q ss_pred CCccccce-ee-ecc-ccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhh Q lcl|NC_021072. 1 MSNQLFGF-SL-ERA-KKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIV 77 (533) Q Consensus 1 ~~~~~fg~-~i-~~~-~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIv 77 (533) +..-.-|. .| .+. ....+. .+..-+..+|..+ .|+..+ T Consensus 63 l~~Yy~g~~~i~~~~~~~~~~~----------~~~~ki~~n~~k~-----------------------------Iv~~~~ 103 (511) T protein:vir:10 63 LSDYYEGKTKNLVELTRRKEEY----------MADNRVAHDYASY-----------------------------ISDFIN 103 (511) T ss_pred HHHHhcccCccccccCcccccc----------cCcceeecchHHH-----------------------------HHHHHh Confidence 11111110 11 110 000000 0000011112211 111111 Q ss_pred cceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEE Q lcl|NC_021072. 78 NETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRY 157 (533) Q Consensus 78 neaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~ 157 (533) .-. -+.|+.+.+++ +.+ .+.+..+.+.-+|+....++.+.+.+-|+-|.+.-+| +.|-..+.. T Consensus 104 ~yl-----~g~p~~~~~~d----~~~----~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~d----edg~~~i~~ 166 (511) T protein:vir:10 104 GYF-----LGNPIQYQDDD----KDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRN----QDDETRLYK 166 (511) T ss_pred hhh-----cccCceeecCc----hHH----HHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeC----CCCceEEEE Confidence 111 25677776544 333 3345666666789999999999999999999887665 346678999 Q ss_pred cChhhceehhhccCCCcCce---eEEecc--ceeeccchh-ceecccccccc------------------ccCCcceecc Q lcl|NC_021072. 158 IDPRKIRKVTEYQQKRPEQL---RGEDIN--TQLTQKAAE-YYLYNPKGLKN------------------STNQGMKIAT 213 (533) Q Consensus 158 lDP~~i~~vr~~~~~~~~~~---~~~~~~--~~~~~~~~e-~~~y~p~~~~~------------------~~~~~~kI~~ 213 (533) +||+.+-+|....... ... +++... +........ ..+|.+..... .+|.--+||. T Consensus 167 ~~p~~~~~vydd~~~~-~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPv 245 (511) T protein:vir:10 167 SDAMSTFVIYDNTIER-NSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPI 245 (511) T ss_pred EccceeEEEEcCCCCC-ceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeE Confidence 9999997765432221 111 111110 000000011 11344332110 0111112222 Q ss_pred chhhccccccccCCCCccchhHHHHHHHHHHHHHH-HHHHHHHHHhcCccceEEE---ccCCCCchHHHHHHHHHHHHhc Q lcl|NC_021072. 214 DSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMI-EDSLVIYRLSRAPERRIFY---IDVGNLPKNKAEQYLREVMGRY 289 (533) Q Consensus 214 dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~-EDalVIyRi~RAPeRrvfy---IDvGnlpk~KAeqYl~~im~~~ 289 (533) =. | +++....|-++..+....-+..+ =+....-+-++.|-+-+.- .|.+.+++.+ T Consensus 246 v~--f-------~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~------------ 304 (511) T protein:vir:10 246 TE--F-------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQK------------ 304 (511) T ss_pred EE--e-------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccch------------ Confidence 11 1 12334456666665555444322 2222222334444333221 1111111100 Q ss_pred ccEEEeeCCCCccccccccchhHhhh-cccc--cCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCC Q lcl|NC_021072. 290 RNKLVYDANTGEIKDDKKFMSMLEDF-WLPR--REGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETET 365 (533) Q Consensus 290 rnk~vYd~~TGev~~d~~~msmlEDy-wLpR--ReggrgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~ 365 (533) ...+-.+..- +.+. ...+.|..+..|-...+...+ .-+.-+.+.+|.-.++|---.+.-+ T Consensus 305 ----------------~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~ 368 (511) T protein:vir:10 305 ----------------EANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS 368 (511) T ss_pred ----------------hccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc Confidence 0111111100 0000 011112334444333333322 3455667778888888852221111 Q ss_pred cccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCHhHHhhhhhceeEEEeccchHHHHHHHHH Q lcl|NC_021072. 366 TFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILK----GVMSLEEWDEMKEHIQFDFIADNYFTELKEIEI 441 (533) Q Consensus 366 ~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilk----gi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei 441 (533) | |. .|..|..-......-+.+.+..|..-+.+.++.=+-+- ++-...+|. .|.+.|...--=.+ T Consensus 369 ~-n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~----~i~i~f~~~~p~d~------ 436 (511) T protein:vir:10 369 G-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN----TVRYVYNRNLPKSL------ 436 (511) T ss_pred c-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccc----eeeEEeCCCCCcCH------ Confidence 1 11 12223333333334456666666666666655422221 222234443 47788865322222 Q ss_pred HHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCC-CCCCCCCCcccccc Q lcl|NC_021072. 442 RNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDP-GNAPPADDMSAQEG 520 (533) Q Consensus 442 ~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~-~~~~~~~d~~~~~~ 520 (533) .+.++++..+ +| .+|.+++++. |...++ .+++.++|++|.... .+............ ++..++++.. T Consensus 437 -~~~~~~~~kl---~G-~iS~et~~~~-l~~v~d-~~~E~~ri~~E~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~---- 504 (511) T protein:vir:10 437 -IEELKAYIDS---GG-KISQTTLMSL-FSFFQD-PELEVKKIEEDEKES-IKKAQKGIYKDPRDINDDEQDDDTK---- 504 (511) T ss_pred -HHHHHHHHHH---hc-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-HHHHhhhcccCCCCCCCCCCCCccc---- Confidence 2345556665 35 4899999977 565442 234445555554432 11010000000000 0111111111 Q ss_pred ccCCccccchhc Q lcl|NC_021072. 521 PAVDAGDAKRGE 532 (533) Q Consensus 521 ~~~~~~~~~~~~ 532 (533) ..+ .++| T Consensus 505 --~~~---~~~~ 511 (511) T protein:vir:10 505 --DTV---DKKE 511 (511) T ss_pred --Ccc---cccC Confidence 011 1111 No 156 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=94.69 E-value=0.0037 Score=33.89 Aligned_cols=411 Identities=16% Similarity=0.200 Sum_probs=167.6 Q ss_pred CCcc-ccceeeeccccccccCCCCCCCCCccccee-----ecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHH Q lcl|NC_021072. 1 MSNQ-LFGFSLERAKKVPKGPSFVQKDSMDGSQPI-----VGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVD 74 (533) Q Consensus 1 ~~~~-~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~-----~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~Avd 74 (533) |+-+ ..||=-.. +..-...+++ ...+.... ....++....+....+ ......++|.|.+||+ T Consensus 1 ~~~~~~mg~f~r~-~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~v--------~~~~al~~~~V~~~i~ 68 (432) T protein:vir:81 1 MPDEKKLGLFGQL-KAMFVPPDPV---DIGGGQTFTPVNATARDLGIIISDTGAAV--------NADAIMRLDAVAACVK 68 (432) T ss_pred CCchhhcchhhhh-hhhccccccc---ccccccccccCccchhhhcccccccCccc--------chHhhhccHHHHHHHH Confidence 4321 12211110 0000001110 00000000 0111111111111111 1133456799999999 Q ss_pred HhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHH----HhhhhcCceeeeeeecCC Q lcl|NC_021072. 75 DIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIF----RRWYVDGRLFYHKVIDPK 146 (533) Q Consensus 75 eIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~f----R~WYvDGri~~hkvid~~ 146 (533) -|.+.+-- .|+.|--.. +. ..++.+. .-++++|+. ...+.++. ..+..+|.-|..++-+ T Consensus 69 ~Ia~~ia~-----lp~~~y~~~-~~--g~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-- 135 (432) T protein:vir:81 69 LVSQAIAA-----MPLTMYMRT-PD--GRKEAVN---HPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-- 135 (432) T ss_pred HHHHhhhh-----CceeeEEec-CC--cceeccc---chHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-- Confidence 99887543 344432111 11 1111111 234455543 23444443 4477889999886553 Q ss_pred CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccC Q lcl|NC_021072. 147 NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDL 226 (533) Q Consensus 147 ~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~ 226 (533) .+.+++|.+|+|..+...+. +++..++. ++......+.+ .....++|..- .. T Consensus 136 --~g~~~~L~~l~~~~v~v~~~-----~~g~~~y~----~~~~~g~~~~~-------~~~~iih~r~~----------~~ 187 (432) T protein:vir:81 136 --DGRIESLQYLANDRLTITTD-----PKGNTAYR----YRRTDGQMIDI-------PKQQIWKIMGY----------SL 187 (432) T ss_pred --CCcEEEEEEEcCCceEEEEC-----CCCcEEEE----EEecCceEEEE-------ccccEEEecCC----------CC Confidence 25699999999999865432 12111111 00000011111 11222333211 11 Q ss_pred CCCccchhHHHHHHHHHHHHHHHHHHHHHHHhc--CccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccc Q lcl|NC_021072. 227 NKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSR--APERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD 304 (533) Q Consensus 227 ~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~R--APeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~ 304 (533) ++-.++|-|+.|.+.+..-..+++... +.++ +--.-|..+| +.|-+..++...+.+ +... +.|. T Consensus 188 dg~~G~spi~~~~~~i~~~~~~~~~~~--~~f~ng~~~~gil~~~-~~l~~e~~~~~~~~~-~~~~-------nag~--- 253 (432) T protein:vir:81 188 DGENGLSAIRYGAQIFGTAIAAEAQAA--RAFRNGQLQSVYYQID-RFLTDDQYDSFAKKV-SGSV-------EAGR--- 253 (432) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcCCCcceEEecC-CCCCHHHHHHHHHHH-hhhh-------cCCC--- Confidence 222456899999888887766666543 2232 2223455554 555554444433321 1111 1121 Q ss_pred ccccchhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHH Q lcl|NC_021072. 305 DKKFMSMLEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQ 383 (533) Q Consensus 305 d~~~msmlEDywLpRReggrgTEIsTLpG-g~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~ 383 (533) .+ .++ .|++++.|.= .+.+.-++-.+|....+.++++||...|+....-+.+.++.+.-.-+-|. T Consensus 254 ---~~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~ 319 (432) T protein:vir:81 254 ---AP-LLE----------GGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFL 319 (432) T ss_pred ---ce-ecC----------CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHH Confidence 11 221 2455555532 12222334456788899999999999997654322222222222212255 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHH Q lcl|NC_021072. 384 KFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSID 463 (533) Q Consensus 384 Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~ 463 (533) ++ .|+--+. .+.+.|-..| +++.++. ...+.|. ..++.... +.+|.+.+..+-. .-++|.+ T Consensus 320 ~~--tl~P~~~-~ie~~l~~kL-----l~~~~~~----~~~~~fd----~~~llr~d-~~~r~~~~~~~~~--~G~~t~N 380 (432) T protein:vir:81 320 TM--TLSPWLR-RIEQSIALNL-----LSPAERR----RYFADFD----TSALLRAD-SAARSSYYSQLVN--NGLMTRD 380 (432) T ss_pred HH--HHHHHHH-HHHHHHHhhc-----cCccccC----ceEEEee----chhhhccC-HHHHHHHHHHHHh--CCCCCHH Confidence 43 3333222 2333344433 4445443 3456665 33433332 3567777766533 2467777 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCC----CCCCCCccccccccCCccccchhc Q lcl|NC_021072. 464 YMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGN----APPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 464 ~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~----~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) -++. .+++..-+ .- ...+. .+....|-...+. .+..++++++ +.+-.+ T Consensus 381 E~R~-~~glpp~~--g~---------~~~~~-~~~~~~pl~~~~~~~~~~~~~~~~n~~--------~~~~~~ 432 (432) T protein:vir:81 381 EARE-IEGLPKLG--GN---------AAVLT-VQSAMVPLDSIGLQASPEPASGLGNQQ--------QDKVSK 432 (432) T ss_pred HHHH-HhCCCCCC--CC---------cceEe-ecCcccchhhhccCCCCCCCCCCCCcc--------cccccC Confidence 7774 35553311 00 00000 0000001000000 0001111110 111111 No 157 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=94.61 E-value=0.004 Score=33.75 Aligned_cols=438 Identities=11% Similarity=0.045 Sum_probs=171.2 Q ss_pred ceeeeccccccccCCCCCCCCCccccee-----eccccccccc-chhh--hhhHHHHHHH--------HHHhhhhcchhh Q lcl|NC_021072. 7 GFSLERAKKVPKGPSFVQKDSMDGSQPI-----VGGGYYGYSV-DFDG--TVRNEYELIT--------RYREMVLQPECD 70 (533) Q Consensus 7 g~~i~~~~~~~~~~s~~~~~~~dg~~~~-----~~~~~~~~~~-~~~~--~~~~~~~LI~--------~YR~m~~~pEvd 70 (533) =+++++-... ...-|...- .+..|. +.. ..+. ...-...+|+ +|+.+..+.+=. T Consensus 1 ~~~~~~~~~~---------~~~~~~~~~~~~~~~n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~ 70 (511) T protein:vir:78 1 MLKVNEFETD---------TDLRGNINYLFNDEANVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGK 70 (511) T ss_pred Cccccchhhh---------hhhhhhhhhhhhhhhCCccc-ccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhcc Confidence 1111111000 000011000 011111 100 0000 0111112222 233333322211 Q ss_pred hH----------------------HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHH Q lcl|NC_021072. 71 SA----------------------VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIF 128 (533) Q Consensus 71 ~A----------------------vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~f 128 (533) .. ..-||+... .=.-+.|+.+.+++ +. ..+.+..+++.-+|+....++. T Consensus 71 ~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~~~~d----~~----~~~~l~~~~~~n~~~~~~~~~~ 141 (511) T protein:vir:78 71 TKNLVELTRRKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQYQDDD----KD----VLEAIEAFNDLNDVESHNRSLG 141 (511) T ss_pred CccccccCcccccccCcceeecchHHHHHHHHh-hhhcccCceeecCc----hH----HHHHHHHHHhhcChhHHHHHHH Confidence 11 112222211 11236777777654 32 2345667777778999999999 Q ss_pred HhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCcee---EEeccc-e--eeccchhceeccccccc Q lcl|NC_021072. 129 RRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLR---GEDINT-Q--LTQKAAEYYLYNPKGLK 202 (533) Q Consensus 129 R~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~---~~~~~~-~--~~~~~~e~~~y~p~~~~ 202 (533) +.+.+-|+-|.+.-+| +.|-..+..+||+.+-+|..-... ..... ++.... + ......-..+|.+.... T Consensus 142 ~~~~~~G~a~~~vy~d----~dg~~~i~~~~p~~~~~v~dd~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~ 216 (511) T protein:vir:78 142 LDLSIYGKAYELMIRN----QDDETRLYKSDAMSTFIIYDNTVE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVY 216 (511) T ss_pred HHHHhcCeeEEEEEeC----CCCceEEEEEcccceEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEE Confidence 9999999999987665 346688999999999776543221 11111 111100 0 00001111244443321 Q ss_pred c-c-----------------cCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHH-HHHHHHHHHhcCccc Q lcl|NC_021072. 203 N-S-----------------TNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMI-EDSLVIYRLSRAPER 263 (533) Q Consensus 203 ~-~-----------------~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~-EDalVIyRi~RAPeR 263 (533) . . +|.--++|.- .|+ ++....|=++..+.....+..+ =+....-+.++.|-+ T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv--~~~-------n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~l 287 (511) T protein:vir:78 217 RYLTNRTNGLKLTPRENSFESHSFERMPIT--EFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (511) T ss_pred EEEecCCCcccccccccccccCcCcccceE--Eec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 1 0 1111112211 111 1223346566555544443321 122222233344444 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccc-cccccchhHhh---hcccccCCCCccceeecCCCCCcch Q lcl|NC_021072. 264 RIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIK-DDKKFMSMLED---FWLPRREGGRGTEISTLPGGQNLGE 339 (533) Q Consensus 264 rvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~-~d~~~msmlED---ywLpRReggrgTEIsTLpGg~nLge 339 (533) -+.=....+ .++++ +....+-.+.. +.......+.|..+..|-...+... T Consensus 288 v~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 341 (511) T protein:vir:78 288 LIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQG 341 (511) T ss_pred heecCccCC--------------------------chhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHH Confidence 332211110 00111 00011100000 0011112233444555554444433 Q ss_pred H-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CHh Q lcl|NC_021072. 340 L-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVM----SLE 414 (533) Q Consensus 340 i-~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~----t~e 414 (533) + .-+.-+.+.+|.-.++|---.+.-+| |. .+..|..-......-+.+.+..|..-+.+.++.=+-+-++. ... T Consensus 342 ~e~~~~~L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~ 419 (511) T protein:vir:78 342 TEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK 419 (511) T ss_pred HHHHHHHHHHHHHHHhCCcccccccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 2 34455677788888888532222111 11 22223333333344455666666666666555422222222 233 Q ss_pred HHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC Q lcl|NC_021072. 415 EWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV 494 (533) Q Consensus 415 ew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~ 494 (533) +|. .+.+.|...---.+ .+.++++..+ +| .+|.++++.. |...++ .+++.++|++|.....-. T Consensus 420 ~~~----~i~~~f~~~~p~n~-------~e~~d~~~kl---~G-~iS~et~l~~-l~~v~d-~~~El~ri~~E~~~~~~~ 482 (511) T protein:vir:78 420 DFN----TVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSL-FSFFQD-PELEVKKIEEDEKESIKK 482 (511) T ss_pred ccc----cceEEeCCCCCcCH-------HHHHHHHHHH---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHH Confidence 343 46788865322222 2344555555 35 4899999976 666542 344445555554432111 Q ss_pred -CCCcccccCCCCCCCCCCCCccccccccCCccccchh Q lcl|NC_021072. 495 -DPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRG 531 (533) Q Consensus 495 -~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 531 (533) ......+++....+++ +++. ....++++ T Consensus 483 ~~~~~~~~~~~~~~~~~-~~~~--------~~~~~e~~ 511 (511) T protein:vir:78 483 AQKGIYKDPRDINDDEQ-DDDT--------KDTVDKKE 511 (511) T ss_pred HhhccccCCCCCCCCCC-CCCc--------cCcccccC Confidence 0000101110001111 1100 00111111 No 158 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=94.61 E-value=0.004 Score=33.75 Aligned_cols=438 Identities=11% Similarity=0.045 Sum_probs=171.2 Q ss_pred ceeeeccccccccCCCCCCCCCccccee-----eccccccccc-chhh--hhhHHHHHHH--------HHHhhhhcchhh Q lcl|NC_021072. 7 GFSLERAKKVPKGPSFVQKDSMDGSQPI-----VGGGYYGYSV-DFDG--TVRNEYELIT--------RYREMVLQPECD 70 (533) Q Consensus 7 g~~i~~~~~~~~~~s~~~~~~~dg~~~~-----~~~~~~~~~~-~~~~--~~~~~~~LI~--------~YR~m~~~pEvd 70 (533) =+++++-... ...-|...- .+..|. +.. ..+. ...-...+|+ +|+.+..+.+=. T Consensus 1 ~~~~~~~~~~---------~~~~~~~~~~~~~~~n~~~~-~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~ 70 (511) T protein:vir:96 1 MLKVNEFETD---------TDLRGNINYLFNDEANVVYT-YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGK 70 (511) T ss_pred Cccccchhhh---------hhhhhhhhhhhhhhhCCccc-ccchhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhcc Confidence 1111111000 000011000 011111 100 0000 0111112222 233333322211 Q ss_pred hH----------------------HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHH Q lcl|NC_021072. 71 SA----------------------VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIF 128 (533) Q Consensus 71 ~A----------------------vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~f 128 (533) .. ..-||+... .=.-+.|+.+.+++ +. ..+.+..+++.-+|+....++. T Consensus 71 ~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~~~~d----~~----~~~~l~~~~~~n~~~~~~~~~~ 141 (511) T protein:vir:96 71 TKNLVELTRRKEEYMADNRVAHDYASYISDFIN-GYFLGNPIQYQDDD----KD----VLEAIEAFNDLNDVESHNRSLG 141 (511) T ss_pred CccccccCcccccccCcceeecchHHHHHHHHh-hhhcccCceeecCc----hH----HHHHHHHHHhhcChhHHHHHHH Confidence 11 112222211 11236777777654 32 2345667777778999999999 Q ss_pred HhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCcee---EEeccc-e--eeccchhceeccccccc Q lcl|NC_021072. 129 RRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLR---GEDINT-Q--LTQKAAEYYLYNPKGLK 202 (533) Q Consensus 129 R~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~---~~~~~~-~--~~~~~~e~~~y~p~~~~ 202 (533) +.+.+-|+-|.+.-+| +.|-..+..+||+.+-+|..-... ..... ++.... + ......-..+|.+.... T Consensus 142 ~~~~~~G~a~~~vy~d----~dg~~~i~~~~p~~~~~v~dd~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~ 216 (511) T protein:vir:96 142 LDLSIYGKAYELMIRN----QDDETRLYKSDAMSTFIIYDNTVE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVY 216 (511) T ss_pred HHHHhcCeeEEEEEeC----CCCceEEEEEcccceEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEE Confidence 9999999999987665 346688999999999776543221 11111 111100 0 00001111244443321 Q ss_pred c-c-----------------cCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHH-HHHHHHHHHhcCccc Q lcl|NC_021072. 203 N-S-----------------TNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMI-EDSLVIYRLSRAPER 263 (533) Q Consensus 203 ~-~-----------------~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~-EDalVIyRi~RAPeR 263 (533) . . +|.--++|.- .|+ ++....|=++..+.....+..+ =+....-+.++.|-+ T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv--~~~-------n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~l 287 (511) T protein:vir:96 217 RYLTNRTNGLKLTPRENSFESHSFERMPIT--EFS-------NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAML 287 (511) T ss_pred EEEecCCCcccccccccccccCcCcccceE--Eec-------CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchh Confidence 1 0 1111112211 111 1223346566555544443321 122222233344444 Q ss_pred eEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccc-cccccchhHhh---hcccccCCCCccceeecCCCCCcch Q lcl|NC_021072. 264 RIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIK-DDKKFMSMLED---FWLPRREGGRGTEISTLPGGQNLGE 339 (533) Q Consensus 264 rvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~-~d~~~msmlED---ywLpRReggrgTEIsTLpGg~nLge 339 (533) -+.=....+ .++++ +....+-.+.. +.......+.|..+..|-...+... T Consensus 288 v~~G~~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 341 (511) T protein:vir:96 288 LIKGNLNLD--------------------------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQG 341 (511) T ss_pred heecCccCC--------------------------chhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHH Confidence 332211110 00111 00011100000 0011112233444555554444433 Q ss_pred H-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CHh Q lcl|NC_021072. 340 L-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVM----SLE 414 (533) Q Consensus 340 i-~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~----t~e 414 (533) + .-+.-+.+.+|.-.++|---.+.-+| |. .+..|..-......-+.+.+..|..-+.+.++.=+-+-++. ... T Consensus 342 ~e~~~~~L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~ 419 (511) T protein:vir:96 342 TEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANK 419 (511) T ss_pred HHHHHHHHHHHHHHHhCCcccccccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 2 34455677788888888532222111 11 22223333333344455666666666666555422222222 233 Q ss_pred HHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC Q lcl|NC_021072. 415 EWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV 494 (533) Q Consensus 415 ew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~ 494 (533) +|. .+.+.|...---.+ .+.++++..+ +| .+|.++++.. |...++ .+++.++|++|.....-. T Consensus 420 ~~~----~i~~~f~~~~p~n~-------~e~~d~~~kl---~G-~iS~et~l~~-l~~v~d-~~~El~ri~~E~~~~~~~ 482 (511) T protein:vir:96 420 DFN----TVRYVYNRNLPKSL-------IEELKAYIDS---GG-KISQTTLMSL-FSFFQD-PELEVKKIEEDEKESIKK 482 (511) T ss_pred ccc----cceEEeCCCCCcCH-------HHHHHHHHHH---hc-cCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHH Confidence 343 46788865322222 2344555555 35 4899999976 666542 344445555554432111 Q ss_pred -CCCcccccCCCCCCCCCCCCccccccccCCccccchh Q lcl|NC_021072. 495 -DPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRG 531 (533) Q Consensus 495 -~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 531 (533) ......+++....+++ +++. ....++++ T Consensus 483 ~~~~~~~~~~~~~~~~~-~~~~--------~~~~~e~~ 511 (511) T protein:vir:96 483 AQKGIYKDPRDINDDEQ-DDDT--------KDTVDKKE 511 (511) T ss_pred HhhccccCCCCCCCCCC-CCCc--------cCcccccC Confidence 0000101110001111 1100 00111111 No 159 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=94.46 E-value=0.0044 Score=33.53 Aligned_cols=418 Identities=10% Similarity=0.048 Sum_probs=168.8 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccc-hhhhhhHHHHHHHHHHhhhhcchhhhH------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVD-FDGTVRNEYELITRYREMVLQPECDSA------- 72 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~-~~~~~~~~~~LI~~YR~m~~~pEvd~A------- 72 (533) |- .++|-+-..+..+.+.-...--...... +..-++.-...+.+|..+..+.+-+.. T Consensus 1 ~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~ 65 (474) T protein:vir:96 1 MI---------------NIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYK 65 (474) T ss_pred Cc---------------ccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccch Confidence 11 1122222222221110000000000000 000111112233444444443332221 Q ss_pred --------------------HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhh Q lcl|NC_021072. 73 --------------------VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWY 132 (533) Q Consensus 73 --------------------vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WY 132 (533) ..-||+-.. .=.-+.|+.+..++. ...+ .+..+++ =+|+....++.+.++ T Consensus 66 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~~~~~~----~~~~----~l~~~~~-n~~~~~~~~l~~~~~ 135 (474) T protein:vir:96 66 QDLHGNIDYTKPDWRITTNFHQNLVDQKV-SYVAGKPVTYAHDDD----KVLD----VIHQVLD-TRWDNKLIDILTAAS 135 (474) T ss_pred hhhcccccccccccccccchHHHHHHhhh-hhhcccCceeccCCh----HHHH----HHHHHHh-ccHHHHHHHHHHHHh Confidence 122222211 112367777766552 2222 2233332 268889999999999 Q ss_pred hcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeE---Eeccceeeccchhceecccccccc------ Q lcl|NC_021072. 133 VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRG---EDINTQLTQKAAEYYLYNPKGLKN------ 203 (533) Q Consensus 133 vDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~---~~~~~~~~~~~~e~~~y~p~~~~~------ 203 (533) +-|+-|.+.-+|. +|-..+..+||+.+-+|..-... .....+ +.... ...+.+|.+..... T Consensus 136 ~~G~~~~~~~~d~----~~~~~i~~~~p~~~~~v~d~~~~-~~~~a~ir~~~~~~-----~~~~~vy~~~~i~~~~~~~~ 205 (474) T protein:vir:96 136 NKGIDWLQVYINE----DGELKLFRVPAEQAIPIWTDKER-EQLNAFIRIFTFNG-----ETKVEYWTAETVTYYVYENG 205 (474) T ss_pred hCCeEEEEeeeCC----CCceEEEEEcccceEEEEcCCCC-CceEEEEEEEeecC-----eeEEEEEeCCeEEEEEEcCC Confidence 9999999877753 46678999999999775432111 111111 11110 11122333322110 Q ss_pred -----------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceE Q lcl|NC_021072. 204 -----------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRI 265 (533) Q Consensus 204 -----------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrv 265 (533) .+|.--+||.-.+.. +....|=|+..+.....+. ++=+....-+-++.|-+-+ T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n---------n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~ 276 (474) T protein:vir:96 206 GLIPDFYYGDEHIQTHFSTGSWERVPFIAFKN---------NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL 276 (474) T ss_pred ceeeccccccccccCcccccCCCccceEEecC---------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 111112222222222 2233455666555555444 3333343445555664433 Q ss_pred EEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHH Q lcl|NC_021072. 266 FYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVK 344 (533) Q Consensus 266 fyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~ 344 (533) .-.+.-++. +.+.. |..+ +++ |++ ++. .+..|-...+.+ .-.-+. T Consensus 277 ~g~~~~~~~-----~~~~~-~~~~--~~i---------------------~~~---~~~--~~~~l~~~~~~~~~~~~~~ 322 (474) T protein:vir:96 277 RGYEGEDLS-----EFMEG-LKYY--KAI---------------------NVS---SDG--GVETIQVEVPVASTKEYLD 322 (474) T ss_pred cCCCccccc-----chhhh-hhcc--cee---------------------ecc---CCC--ceeEEeccCCHHHHHHHHH Confidence 322111111 11110 1111 111 121 111 244333322222 123455 Q ss_pred HHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhcee Q lcl|NC_021072. 345 YFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQ 424 (533) Q Consensus 345 YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~ 424 (533) -+.+.+|...++|---.+.-++ +. .|..|..-......-+.+.+..|...+.++|+.=+-+-|+ ..+| ..|. T Consensus 323 ~l~~~I~~~s~~p~~~~~~~~~-n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~--~~d~----~~i~ 394 (474) T protein:vir:96 323 MMRAYIVEFGQGVDFQTDKFGS-AT-SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI--KLDA----KEIE 394 (474) T ss_pred HHHHHHHHHhCCcCcccccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceee Confidence 5677789999998432221111 11 1112322222233446667777777777776654444453 1233 4577 Q ss_pred EEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCC Q lcl|NC_021072. 425 FDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAM 504 (533) Q Consensus 425 ~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~ 504 (533) +.|...---.+.. .++++.++ | .+|.++++.. |..+++ -+++.++|++|..... + ..+.-.+. T Consensus 395 i~f~~~~p~~~~e-------~a~~~~~~----g-iiS~et~~~~-lp~v~D-~~~E~eri~~E~~~~~-~--~~~~~~~~ 457 (474) T protein:vir:96 395 ITFNFNVMVNDLE-------QSQIGAQS----Q-YLSKETLVRH-HPWVDD-PKAELERLDEEQLELN-K--QLPNLDDG 457 (474) T ss_pred EEecCCCccCHHH-------HHHHHHHc----C-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHH-h--hccccccc Confidence 8886543333322 22334332 3 5899999966 666543 2333455655544321 1 00111111 Q ss_pred CCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 505 DPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 505 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) +..+..+++ .++.+++| T Consensus 458 ~~~~~~~~~-----------~~~~~e~~ 474 (474) T protein:vir:96 458 GADGAQQQQ-----------QSENNQSK 474 (474) T ss_pred cCCCCCCcC-----------CCCccccC Confidence 111111111 11112222 No 160 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=94.46 E-value=0.0044 Score=33.53 Aligned_cols=418 Identities=10% Similarity=0.048 Sum_probs=168.8 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccc-hhhhhhHHHHHHHHHHhhhhcchhhhH------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVD-FDGTVRNEYELITRYREMVLQPECDSA------- 72 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~-~~~~~~~~~~LI~~YR~m~~~pEvd~A------- 72 (533) |- .++|-+-..+..+.+.-...--...... +..-++.-...+.+|..+..+.+-+.. T Consensus 1 ~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~ 65 (474) T protein:vir:95 1 MI---------------NIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYK 65 (474) T ss_pred Cc---------------ccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccch Confidence 11 1122222222221110000000000000 000111112233444444443332221 Q ss_pred --------------------HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhh Q lcl|NC_021072. 73 --------------------VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWY 132 (533) Q Consensus 73 --------------------vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WY 132 (533) ..-||+-.. .=.-+.|+.+..++. ...+ .+..+++ =+|+....++.+.++ T Consensus 66 ~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~-~yl~g~p~~~~~~~~----~~~~----~l~~~~~-n~~~~~~~~l~~~~~ 135 (474) T protein:vir:95 66 QDLHGNIDYTKPDWRITTNFHQNLVDQKV-SYVAGKPVTYAHDDD----KVLD----VIHQVLD-TRWDNKLIDILTAAS 135 (474) T ss_pred hhhcccccccccccccccchHHHHHHhhh-hhhcccCceeccCCh----HHHH----HHHHHHh-ccHHHHHHHHHHHHh Confidence 122222211 112367777766552 2222 2233332 268889999999999 Q ss_pred hcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeE---Eeccceeeccchhceecccccccc------ Q lcl|NC_021072. 133 VDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRG---EDINTQLTQKAAEYYLYNPKGLKN------ 203 (533) Q Consensus 133 vDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~---~~~~~~~~~~~~e~~~y~p~~~~~------ 203 (533) +-|+-|.+.-+|. +|-..+..+||+.+-+|..-... .....+ +.... ...+.+|.+..... T Consensus 136 ~~G~~~~~~~~d~----~~~~~i~~~~p~~~~~v~d~~~~-~~~~a~ir~~~~~~-----~~~~~vy~~~~i~~~~~~~~ 205 (474) T protein:vir:95 136 NKGIDWLQVYINE----DGELKLFRVPAEQAIPIWTDKER-EQLNAFIRIFTFNG-----ETKVEYWTAETVTYYVYENG 205 (474) T ss_pred hCCeEEEEeeeCC----CCceEEEEEcccceEEEEcCCCC-CceEEEEEEEeecC-----eeEEEEEeCCeEEEEEEcCC Confidence 9999999877753 46678999999999775432111 111111 11110 11122333322110 Q ss_pred -----------------ccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceE Q lcl|NC_021072. 204 -----------------STNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRI 265 (533) Q Consensus 204 -----------------~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrv 265 (533) .+|.--+||.-.+.. +....|=|+..+.....+. ++=+....-+-++.|-+-+ T Consensus 206 ~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n---------n~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~ 276 (474) T protein:vir:95 206 GLIPDFYYGDEHIQTHFSTGSWERVPFIAFKN---------NPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYIL 276 (474) T ss_pred ceeeccccccccccCcccccCCCccceEEecC---------CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhh Confidence 111112222222222 2233455666555555444 3333343445555664433 Q ss_pred EEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcc-hHHHHH Q lcl|NC_021072. 266 FYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVK 344 (533) Q Consensus 266 fyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLg-ei~DV~ 344 (533) .-.+.-++. +.+.. |..+ +++ |++ ++. .+..|-...+.+ .-.-+. T Consensus 277 ~g~~~~~~~-----~~~~~-~~~~--~~i---------------------~~~---~~~--~~~~l~~~~~~~~~~~~~~ 322 (474) T protein:vir:95 277 RGYEGEDLS-----EFMEG-LKYY--KAI---------------------NVS---SDG--GVETIQVEVPVASTKEYLD 322 (474) T ss_pred cCCCccccc-----chhhh-hhcc--cee---------------------ecc---CCC--ceeEEeccCCHHHHHHHHH Confidence 322111111 11110 1111 111 121 111 244333322222 123455 Q ss_pred HHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhcee Q lcl|NC_021072. 345 YFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQ 424 (533) Q Consensus 345 YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~ 424 (533) -+.+.+|...++|---.+.-++ +. .|..|..-......-+.+.+..|...+.++|+.=+-+-|+ ..+| ..|. T Consensus 323 ~l~~~I~~~s~~p~~~~~~~~~-n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~--~~d~----~~i~ 394 (474) T protein:vir:95 323 MMRAYIVEFGQGVDFQTDKFGS-AT-SGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI--KLDA----KEIE 394 (474) T ss_pred HHHHHHHHHhCCcCcccccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceee Confidence 5677789999998432221111 11 1112322222233446667777777777776654444453 1233 4577 Q ss_pred EEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCC Q lcl|NC_021072. 425 FDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAM 504 (533) Q Consensus 425 ~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~ 504 (533) +.|...---.+.. .++++.++ | .+|.++++.. |..+++ -+++.++|++|..... + ..+.-.+. T Consensus 395 i~f~~~~p~~~~e-------~a~~~~~~----g-iiS~et~~~~-lp~v~D-~~~E~eri~~E~~~~~-~--~~~~~~~~ 457 (474) T protein:vir:95 395 ITFNFNVMVNDLE-------QSQIGAQS----Q-YLSKETLVRH-HPWVDD-PKAELERLDEEQLELN-K--QLPNLDDG 457 (474) T ss_pred EEecCCCccCHHH-------HHHHHHHc----C-CCChHHHHHh-CCCCCC-HHHHHHHHHHHHHHHH-h--hccccccc Confidence 8886543333322 22334332 3 5899999966 666543 2333455655544321 1 00111111 Q ss_pred CCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 505 DPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 505 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) +..+..+++ .++.+++| T Consensus 458 ~~~~~~~~~-----------~~~~~e~~ 474 (474) T protein:vir:95 458 GADGAQQQQ-----------QSENNQSK 474 (474) T ss_pred cCCCCCCcC-----------CCCccccC Confidence 111111111 11112222 No 161 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=94.35 E-value=0.0047 Score=33.37 Aligned_cols=188 Identities=13% Similarity=0.159 Sum_probs=87.6 Q ss_pred EEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccC--CC----Ccc-ceeecCCCCCc Q lcl|NC_021072. 265 IFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRRE--GG----RGT-EISTLPGGQNL 337 (533) Q Consensus 265 vfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRRe--gg----rgT-EIsTLpGg~nL 337 (533) ||.++. +.+-+ +...+++ ++.|.+...+ |+ ++ +.+ +..++ ..+| T Consensus 1 V~k~~~------------------l~~~~--~~~~~~~---~~r~~~~~~~----~~~~~~~~ld~~~e~~e~~--~~~l 51 (201) T protein:vir:10 1 MWKAKG------------------LADLC--DDSDGAA---RLRLAQVDNN----SGVGQAIGIDADSEEYNVL--NSDI 51 (201) T ss_pred CccchH------------------HHHHh--cCChHHH---HHHHHHHHHh----hhhhhhheeecCCcceeee--ecCc Confidence 333211 00000 0000111 1112211111 10 00 000 11111 2356 Q ss_pred chHHH-HHHHHHHHHHhcCCCccccCCC--Ccccc-cchhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhccCCC Q lcl|NC_021072. 338 GELED-VKYFQKKLYKALNVPSSRLETE--TTFNI-GRAAEITRDEVKFQKFIARLRKR-FSELFMDLLKTQLILKGVMS 412 (533) Q Consensus 338 gei~D-V~YF~~kLy~aL~VP~sRl~~~--~~~~~-g~~~eItRDElkF~Kfi~rLr~~-fs~if~d~Lk~qLilkgi~t 412 (533) +-++| +..|...+=.+.++|+.||-.+ +|+|- |.+ |.-.|+.+|..+|.+ +..+...+++ -+.. T Consensus 52 sGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~-----d~~nyyd~i~~~Qe~~l~p~le~l~~-----~~~~- 120 (201) T protein:vir:10 52 GGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNT-----ALETFYGYVDRKRKAELLPLLEFLLP-----FIVT- 120 (201) T ss_pred CChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchh-----HHHHHHHHHHHHHHHHHHHHHHHHHH-----hhcC- Confidence 66777 4578888999999999999443 67764 333 223499999999953 3444433332 2332 Q ss_pred HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_021072. 413 LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGL 492 (533) Q Consensus 413 ~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~ 492 (533) .+.++|.|..=..=++...+||.....++++.+-.- -.+|.+-+++.+ ......+. T Consensus 121 -------~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~--g~i~~~e~r~~L---------------~~~~~~~~ 176 (201) T protein:vir:10 121 -------EQEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAA--GIIDADEARDTL---------------RAISTEVK 176 (201) T ss_pred -------CCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CCCCHHHHHHHH---------------HhcCCcCC Confidence 245789999888889999999999999988876432 133444444332 11222222 Q ss_pred CCCCCcccccCCCCC-CCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 493 IVDPMAEMDPAMDPG-NAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 493 ~~~p~~~~~~~~~~~-~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) +++ ...++..+.. ++.|.+ .++.+ T Consensus 177 ~~~--~~~~~~~~~~e~~dp~~--------------~~~~~ 201 (201) T protein:vir:10 177 IGE--GSIQTEVVINESEDPLD--------------VSANN 201 (201) T ss_pred CCC--CCCCccccccccCCCCC--------------CCCCC Confidence 221 1111111100 011111 11111 No 162 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=94.34 E-value=0.0047 Score=33.36 Aligned_cols=412 Identities=9% Similarity=0.014 Sum_probs=160.8 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) ...-.-|-. .+-... .. .++ ...+..-+..+|. .-.|+..+.-. T Consensus 63 l~~YY~g~~--~i~~~~-~~--~~~--~~~~~~ki~~n~~-----------------------------k~Ivd~~~~yl 106 (512) T protein:vir:97 63 LSDYYEGKT--KNLVEL-TR--RKE--EYMADNRVAHDYA-----------------------------SYISDFINGYF 106 (512) T ss_pred HHHHhcccC--cccccc-Cc--ccc--cccCcceeecchH-----------------------------HHHHHHHhhhh Confidence 111112210 000000 00 000 0000000001111 11122222111 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcCh Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDP 160 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP 160 (533) -+.|+.+.+++ +.. .+.+..+++--+|+....++.+...+-|+-|.+.-+|. +|-..+..+|| T Consensus 107 -----~g~p~~~~~~d----~~~----~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~de----d~~~~i~~~~p 169 (512) T protein:vir:97 107 -----LGNPIQCQDDD----KDV----LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQ----DDETRLYKSDA 169 (512) T ss_pred -----cccCceeccCC----hHH----HHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCC----CCceEEEEEcc Confidence 24666666544 222 23456666666899999999999999999999977763 45678999999 Q ss_pred hhceehhhccCCCcCcee---EEecc--ce-eeccchhceecccccccc------------------ccCCcceeccchh Q lcl|NC_021072. 161 RKIRKVTEYQQKRPEQLR---GEDIN--TQ-LTQKAAEYYLYNPKGLKN------------------STNQGMKIATDSV 216 (533) Q Consensus 161 ~~i~~vr~~~~~~~~~~~---~~~~~--~~-~~~~~~e~~~y~p~~~~~------------------~~~~~~kI~~dai 216 (533) +.+-.|..-... ..... ++... +. ......-..+|.+..... .+|.--+||.-. T Consensus 170 ~~~~~iyd~~~~-~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~- 247 (512) T protein:vir:97 170 MSTFVIYDNTIE-RNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITE- 247 (512) T ss_pred cceEEEEcCCCC-CceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEe- Confidence 999775432211 11111 11100 00 000001111343332110 011111111111 Q ss_pred hccccccccCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEe Q lcl|NC_021072. 217 TYCHSGIQDLNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVY 295 (533) Q Consensus 217 ~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vY 295 (533) | +++....|-++.++.....+.. +=+....-+-++.|-+-+.-....+ T Consensus 248 -~-------~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~----------------------- 296 (512) T protein:vir:97 248 -F-------SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD----------------------- 296 (512) T ss_pred -e-------cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCC----------------------- Confidence 1 1233455777766665555443 2222223344444444332211111 Q ss_pred eCCCCccc-cccccchhHhh----hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccc Q lcl|NC_021072. 296 DANTGEIK-DDKKFMSMLED----FWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNI 369 (533) Q Consensus 296 d~~TGev~-~d~~~msmlED----ywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~ 369 (533) ..++. +....+..+.+ +..+.-+++.|..+..|-...+... -.-+.-+.+.+|.-.++|---.+.-+| |. T Consensus 297 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~g-n~ 372 (512) T protein:vir:97 297 ---PVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ 372 (512) T ss_pred ---chhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccc-cc Confidence 00000 00001111111 1111111222333444433333322 234555667778888888643322111 11 Q ss_pred cchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh---ccCC-CHhHHhhhhhceeEEEeccchHHHHHHHHHHHHH Q lcl|NC_021072. 370 GRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLIL---KGVM-SLEEWDEMKEHIQFDFIADNYFTELKEIEIRNER 445 (533) Q Consensus 370 g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLil---kgi~-t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R 445 (533) .|..|..-...-..-+.+.++.|..-+.+.++.=+-+ ++.+ .+.+|. .|.+.|...---.+ .+. T Consensus 373 -Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~----~i~~~f~~~~p~~~-------~e~ 440 (512) T protein:vir:97 373 -SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN----TVRYVYNRNLPKSL-------IEE 440 (512) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccc----cceEEeCCCCCcCH-------HHH Confidence 1222333222233335555666666655555442222 2221 233343 47788864322222 234 Q ss_pred HHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC---CCCcccccCCCCCCCCCCCCcccccccc Q lcl|NC_021072. 446 MNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV---DPMAEMDPAMDPGNAPPADDMSAQEGPA 522 (533) Q Consensus 446 ~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~---~p~~~~~~~~~~~~~~~~~d~~~~~~~~ 522 (533) ++++..+ +| .+|.+++++. |...++ .+++.++|++|....+-. .+.....+..+...++..++.+. T Consensus 441 ~~~~~kl---~g-iiS~et~~~~-l~~v~d-~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 509 (512) T protein:vir:97 441 LKAYIDS---GG-KISQTTLMSL-FSFFQD-PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVD----- 509 (512) T ss_pred HHHHHHH---hc-cCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcccccc----- Confidence 4555555 35 4899999977 565431 223334444444332100 11111111111111111111111 Q ss_pred CCccccchh Q lcl|NC_021072. 523 VDAGDAKRG 531 (533) Q Consensus 523 ~~~~~~~~~ 531 (533) +++ T Consensus 510 ------~~~ 512 (512) T protein:vir:97 510 ------KKE 512 (512) T ss_pred ------ccC Confidence 111 No 163 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=94.11 E-value=0.0054 Score=33.03 Aligned_cols=418 Identities=12% Similarity=0.082 Sum_probs=167.5 Q ss_pred ceeeeccccccccCCCC---CCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhH----------- Q lcl|NC_021072. 7 GFSLERAKKVPKGPSFV---QKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSA----------- 72 (533) Q Consensus 7 g~~i~~~~~~~~~~s~~---~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~A----------- 72 (533) =..|..--.+..+.-++ .++..+ ...+. .--+..-..-+.+|+.+..+.+-+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~i-----------~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~ 68 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYET-QEEMI-----------IRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNK 68 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCC-hHHHH-----------HHHHHHHHHHHHHHHHHHHHhccCCcchhccchhccc Confidence 01111100000000000 000000 00000 00011111123344444333222211 Q ss_pred ----------------HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCc Q lcl|NC_021072. 73 ----------------VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGR 136 (533) Q Consensus 73 ----------------vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGr 136 (533) ...||+-..- =.=+.|+.+.+++. ...+ .++.+++ =+++....++.+...+-|+ T Consensus 69 ~~~~~~~~~~ki~~n~~~~Ivd~~~~-~l~g~p~~~~~~d~----~~~~----~l~~~~~-n~~~~~~~~~~~~~~~~G~ 138 (474) T protein:vir:96 69 GEIDPLKPDWRMFTNYHQNLVDQKVA-YAVANPVTFSSDDD----KSLK----TIQEVLN-HKWDDKLVDILTAASNKGI 138 (474) T ss_pred ccccccccchhcccchHHHHHHhhhh-hhcccCceeecCch----HHHH----HHHHHHh-cCHHHHHHHHHHHHHhcCe Confidence 1112221110 02257777776553 2222 2333332 1577788889999999999 Q ss_pred eeeeeeecCCCCCCCeEEEEEcChhhceehhhcc--CCCcCceeEEeccce-----eeccchhceeccccccc------- Q lcl|NC_021072. 137 LFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQ--QKRPEQLRGEDINTQ-----LTQKAAEYYLYNPKGLK------- 202 (533) Q Consensus 137 i~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~--~~~~~~~~~~~~~~~-----~~~~~~e~~~y~p~~~~------- 202 (533) -|.+.-+|. +|-..+..+||+.+-++..-. ++..-..+++..... .+.....+|.+...... T Consensus 139 ~~~~~y~d~----~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~ 214 (474) T protein:vir:96 139 EWLQPYIDE----NGEFKTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGE 214 (474) T ss_pred eEEEEEecC----CCceEEEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeecccccc Confidence 998876653 455779999999987654321 111111111111100 00001111111110000 Q ss_pred ----------cccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCC Q lcl|NC_021072. 203 ----------NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVG 271 (533) Q Consensus 203 ----------~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvG 271 (533) ..++.--+||.-.+. ++....|=++..+...+.+.. +-+..-.-+.++.|-+-+.-.+.. T Consensus 215 ~~~~~~~~~~~~~~~~g~iPvv~~~---------nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~ 285 (474) T protein:vir:96 215 EHIQSHYYVGNKRVSWGRVPFIPFK---------NNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQ 285 (474) T ss_pred ccccccccccccccCCCceeEEEec---------cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcc Confidence 011111222221111 233445667666666665553 334444456666665433322211 Q ss_pred CCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHH Q lcl|NC_021072. 272 NLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKL 350 (533) Q Consensus 272 nlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei-~DV~YF~~kL 350 (533) .. .+.+. -|.. ++++ ++| +.|..++.|-...++... .-+.-..+.+ T Consensus 286 ~~-----~~~~~-~~~~--~~~i---------------------~~~----~~~~~~~~l~~~~~~~~~~~~~~~l~~~i 332 (474) T protein:vir:96 286 DL-----DEFMR-NLKY--YKAI---------------------NVD----GDGSGVDTIQIEVPVQSSKEYLDMLRDYV 332 (474) T ss_pred cc-----cchhh-hhhc--CceE---------------------Eec----CCCCceeEEeecCChHHHHHHHHHHHHHH Confidence 10 00000 0111 1111 111 123345555444444332 3345667789 Q ss_pred HHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEecc Q lcl|NC_021072. 351 YKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIAD 430 (533) Q Consensus 351 y~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~D 430 (533) |+..++|---.+..++ |. .|..|..-...--.-+.+.+..|..-+.++|+.=|-+.|+ ..+| ..|.+.|... T Consensus 333 ~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~--~~~~----~~i~i~f~~~ 404 (474) T protein:vir:96 333 IEFGQGVDFQQDKFGN-SP-SGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL--NIKV----QDVEITFNFN 404 (474) T ss_pred HHHhCCcccccccccc-cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--Cccc----ceeeEEeccC Confidence 9999999432221111 11 1112222211122335666677777777766654444453 2223 3466777544 Q ss_pred chHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCC Q lcl|NC_021072. 431 NYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAP 510 (533) Q Consensus 431 n~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~ 510 (533) --..+.. .++++.+ .| .+|.+++++. |...++ -+++.++|++|..+..=..|. ..+....+ T Consensus 405 ~p~~~~e-------~~~~~~~----ag-~iS~et~~~~-~~~v~d-~~~E~~ri~~E~~e~~~~~~~-----~~~~~~~~ 465 (474) T protein:vir:96 405 VMVNELE-------QSQIGVQ----SQ-YLSKETVVTN-HPWVDD-PVAELERIEQDNIDFNKQLPP-----LEGDANGR 465 (474) T ss_pred CCcCHHH-------HHHHHHh----cC-CCchHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHhcccc-----cccccccc Confidence 4333322 2223333 23 5899999976 555432 345666666665432211111 11111112 Q ss_pred CCCCccccc Q lcl|NC_021072. 511 PADDMSAQE 519 (533) Q Consensus 511 ~~~d~~~~~ 519 (533) ..||...++ T Consensus 466 ~~d~~~e~~ 474 (474) T protein:vir:96 466 AQDNESETN 474 (474) T ss_pred cCCCcccCC Confidence 222222222 No 164 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=93.99 E-value=0.0057 Score=32.88 Aligned_cols=412 Identities=12% Similarity=0.067 Sum_probs=174.8 Q ss_pred CCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhh-----cchhhhH----HHHhh-cc-eeeec-----CC Q lcl|NC_021072. 23 VQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVL-----QPECDSA----VDDIV-NE-TICGN-----FD 86 (533) Q Consensus 23 ~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~-----~pEvd~A----vdeIv-ne-aiv~d-----~~ 86 (533) -.+...+- .+. .....-..-..+.+.++.|=+..+ .+.+... -.-++ |- ..+.| .- T Consensus 1 ~~~~t~~~---~~~-----~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:10 1 MTASTPAE---WLP-----VLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCHHH---HHH-----HHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc Confidence 00000000 000 000000000112222222222110 0011000 00011 11 01111 12 Q ss_pred CceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceeh Q lcl|NC_021072. 87 DVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKV 166 (533) Q Consensus 87 ~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~v 166 (533) +.||.+...+ . .+.. +++..+...-+|+....++++.-++.|+-|.+.-.| ..|-..++.+||+.+=.+ T Consensus 73 ~~~~~~~~~~--d-~~~~----~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d----~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 73 PNGITVGGSA--D-SDLA----LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSPETMVVS 141 (456) T ss_pred cCCeecCCCC--C-cchH----HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC----CCCceEEEEEccceeEEE Confidence 4566654322 1 1122 234445555578888999999999999988764433 456667888999987544 Q ss_pred hhccCCCc--CceeEEeccce-------eeccc-hh----ceecccccc-c--cccCCcceeccchhhcccccc----cc Q lcl|NC_021072. 167 TEYQQKRP--EQLRGEDINTQ-------LTQKA-AE----YYLYNPKGL-K--NSTNQGMKIATDSVTYCHSGI----QD 225 (533) Q Consensus 167 r~~~~~~~--~~~~~~~~~~~-------~~~~~-~e----~~~y~p~~~-~--~~~~~~~kI~~dai~y~hsGl----~d 225 (533) ..-..... -..+++.-.+. ++... .. .++|..... . ......+.+.. ..|.+- +. T Consensus 142 ~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~pvv~ 217 (456) T protein:vir:10 142 VDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGD----AVVTGSPPPVVV 217 (456) T ss_pred EcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccc----cCCCCCceeEEE Confidence 32111100 00111111000 00000 00 001110000 0 00000000000 011110 11 Q ss_pred CCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccc Q lcl|NC_021072. 226 LNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD 304 (533) Q Consensus 226 ~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~ 304 (533) .++...+|=++..+-....+.. +=|.++.-..+--|.|-+.-.+.+. +.. | .+|..-+ T Consensus 218 ~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~-~~~-------------------d-~~g~~~~ 276 (456) T protein:vir:10 218 YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGL-PNV-------------------D-ENGNAID 276 (456) T ss_pred ecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccc-ccc-------------------c-ccccccc Confidence 1234445666665554444332 2244444444444555443222211 100 0 0111000 Q ss_pred ccccchhHhh-hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhH Q lcl|NC_021072. 305 DKKFMSMLED-FWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKF 382 (533) Q Consensus 305 d~~~msmlED-ywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF 382 (533) ..+....-.+ .|. +..|+.|..++... ++. ++-++-.-..+....++|..-|+...+ |. .+..|.--+..+ T Consensus 277 ~~~~~~~~~~~~~~----~~~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~-Sg~Ai~~~~~~l 349 (456) T protein:vir:10 277 YASIFEAAPGALWE----LPPGVDIWESQAND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-SAEGAHNIEKGF 349 (456) T ss_pred hhhhhhhhcccccc----CCCCcceEEecccC-hhHHHHHHHHHHHHHHhccCCChHHhccccc-Ch-HHHHHHHHHHHH Confidence 0000000000 132 12345566776543 443 344677777788888999888865432 22 344566666667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccH Q lcl|NC_021072. 383 QKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSI 462 (533) Q Consensus 383 ~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~ 462 (533) -.-+.+.|+.|..-+.+.++.-+.+.|.. ++ ..+++.|..-..=+. .+.++++..+..- -..|. T Consensus 350 ~~k~~~~~~~f~~~l~~~~rl~~~~~g~~---~~----~~~~v~w~~~~~~~~-------~~~ada~~kl~~~--gi~~~ 413 (456) T protein:vir:10 350 LFKCEDRLSIAKIGLEAILVKALQIEGES---VE----DTVDVSFESPDRVTL-------GEKYSAASLAKAA--GESWA 413 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCC---cc----cceeEEecCCCCcCH-------HHHHHHHHHHHHc--CCChH Confidence 77888999999999999999888888852 12 357788854332222 2345555555432 24555 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC----CCCccccc Q lcl|NC_021072. 463 DYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV----DPMAEMDP 502 (533) Q Consensus 463 ~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~----~p~~~~~~ 502 (533) ..+ ..+|++++++|++.+.+-.+|..+.... -|..+-+. T Consensus 414 ~~~-~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 414 SIR-RNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHH-HhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 544 5789999999875433333332222222 12111111 No 165 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=93.99 E-value=0.0057 Score=32.88 Aligned_cols=412 Identities=12% Similarity=0.067 Sum_probs=174.8 Q ss_pred CCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhh-----cchhhhH----HHHhh-cc-eeeec-----CC Q lcl|NC_021072. 23 VQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVL-----QPECDSA----VDDIV-NE-TICGN-----FD 86 (533) Q Consensus 23 ~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~-----~pEvd~A----vdeIv-ne-aiv~d-----~~ 86 (533) -.+...+- .+. .....-..-..+.+.++.|=+..+ .+.+... -.-++ |- ..+.| .- T Consensus 1 ~~~~t~~~---~~~-----~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:10 1 MTASTPAE---WLP-----VLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCHHH---HHH-----HHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhc Confidence 00000000 000 000000000112222222222110 0011000 00011 11 01111 12 Q ss_pred CceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceeh Q lcl|NC_021072. 87 DVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKV 166 (533) Q Consensus 87 ~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~v 166 (533) +.||.+...+ . .+.. +++..+...-+|+....++++.-++.|+-|.+.-.| ..|-..++.+||+.+=.+ T Consensus 73 ~~~~~~~~~~--d-~~~~----~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d----~~g~~~i~~~~p~~~~~i 141 (456) T protein:vir:10 73 PNGITVGGSA--D-SDLA----LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSPETMVVS 141 (456) T ss_pred cCCeecCCCC--C-cchH----HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC----CCCceEEEEEccceeEEE Confidence 4566654322 1 1122 234445555578888999999999999988764433 456667888999987544 Q ss_pred hhccCCCc--CceeEEeccce-------eeccc-hh----ceecccccc-c--cccCCcceeccchhhcccccc----cc Q lcl|NC_021072. 167 TEYQQKRP--EQLRGEDINTQ-------LTQKA-AE----YYLYNPKGL-K--NSTNQGMKIATDSVTYCHSGI----QD 225 (533) Q Consensus 167 r~~~~~~~--~~~~~~~~~~~-------~~~~~-~e----~~~y~p~~~-~--~~~~~~~kI~~dai~y~hsGl----~d 225 (533) ..-..... -..+++.-.+. ++... .. .++|..... . ......+.+.. ..|.+- +. T Consensus 142 ~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~pvv~ 217 (456) T protein:vir:10 142 VDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGD----AVVTGSPPPVVV 217 (456) T ss_pred EcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccc----cCCCCCceeEEE Confidence 32111100 00111111000 00000 00 001110000 0 00000000000 011110 11 Q ss_pred CCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccc Q lcl|NC_021072. 226 LNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD 304 (533) Q Consensus 226 ~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~ 304 (533) .++...+|=++..+-....+.. +=|.++.-..+--|.|-+.-.+.+. +.. | .+|..-+ T Consensus 218 ~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~-~~~-------------------d-~~g~~~~ 276 (456) T protein:vir:10 218 YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGL-PNV-------------------D-ENGNAID 276 (456) T ss_pred ecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccc-ccc-------------------c-ccccccc Confidence 1234445666665554444332 2244444444444555443222211 100 0 0111000 Q ss_pred ccccchhHhh-hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhH Q lcl|NC_021072. 305 DKKFMSMLED-FWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKF 382 (533) Q Consensus 305 d~~~msmlED-ywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF 382 (533) ..+....-.+ .|. +..|+.|..++... ++. ++-++-.-..+....++|..-|+...+ |. .+..|.--+..+ T Consensus 277 ~~~~~~~~~~~~~~----~~~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~-Sg~Ai~~~~~~l 349 (456) T protein:vir:10 277 YASIFEAAPGALWE----LPPGVDIWESQAND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-SAEGAHNIEKGF 349 (456) T ss_pred hhhhhhhhcccccc----CCCCcceEEecccC-hhHHHHHHHHHHHHHHhccCCChHHhccccc-Ch-HHHHHHHHHHHH Confidence 0000000000 132 12345566776543 443 344677777788888999888865432 22 344566666667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccH Q lcl|NC_021072. 383 QKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSI 462 (533) Q Consensus 383 ~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~ 462 (533) -.-+.+.|+.|..-+.+.++.-+.+.|.. ++ ..+++.|..-..=+. .+.++++..+..- -..|. T Consensus 350 ~~k~~~~~~~f~~~l~~~~rl~~~~~g~~---~~----~~~~v~w~~~~~~~~-------~~~ada~~kl~~~--gi~~~ 413 (456) T protein:vir:10 350 LFKCEDRLSIAKIGLEAILVKALQIEGES---VE----DTVDVSFESPDRVTL-------GEKYSAASLAKAA--GESWA 413 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCC---cc----cceeEEecCCCCcCH-------HHHHHHHHHHHHc--CCChH Confidence 77888999999999999999888888852 12 357788854332222 2345555555432 24555 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCC----CCCccccc Q lcl|NC_021072. 463 DYMRRQVLKQTDQEIKEIDKQIDSEREAGLIV----DPMAEMDP 502 (533) Q Consensus 463 ~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~----~p~~~~~~ 502 (533) ..+ ..+|++++++|++.+.+-.+|..+.... -|..+-+. T Consensus 414 ~~~-~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 414 SIR-RNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHH-HhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 544 5789999999875433333332222222 12111111 No 166 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=93.84 E-value=0.0062 Score=32.68 Aligned_cols=387 Identities=10% Similarity=0.066 Sum_probs=188.3 Q ss_pred CcccceeecccccccccchhhhhhH-----HHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHH Q lcl|NC_021072. 28 MDGSQPIVGGGYYGYSVDFDGTVRN-----EYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDK 102 (533) Q Consensus 28 ~dg~~~~~~~~~~~~~~~~~~~~~~-----~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ 102 (533) .+. +.--......||.+ ...++. ..++=..+|... +=+.-+|+-.++=+.+ ++ + ...+ +. T Consensus 1 l~~-~~~r~~~~~~yY~g-~~~~~~~~~~~p~~~~~~~~~v~--nw~~~~Vds~a~rl~~---~G--f--~~~d----~~ 65 (410) T protein:vir:95 1 MNL-YQSRVNLRYKHYAM-QHYEAPTGITIPAHIRAKYQAVL--GWAAKGVDSLADRLIF---RA--F--ANDD----FN 65 (410) T ss_pred CCc-chhhHHHHHHHhcC-CCCccccchhccHHHHhHHHhhc--chhHHHHHHhHhhhcc---cc--c--cCCC----ch Confidence 000 00000001111111 111110 111111232211 1222233333321121 11 0 0111 11 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeE-Ee Q lcl|NC_021072. 103 IKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRG-ED 181 (533) Q Consensus 103 ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~-~~ 181 (533) ...+...=+|+....+.++.=++.|+-|.- |.- + ..|-..++.++|+.+--+.+ .......+ .. T Consensus 66 --------l~~i~~~N~ld~~~~~~~~~al~~G~sf~~-v~~--~-~d~~~~i~~~sP~~~~~i~D---p~~~~~~~al~ 130 (410) T protein:vir:95 66 --------VTEIFDRNNPDIFFDSAILSALIGSCSFVY-ISK--G-EDDEVRLQVIESSNATGVID---PITGLLVEGYA 130 (410) T ss_pred --------HHHHHhhcChHHHHHHHHHHHHHhCceeEE-Eec--C-CCCceEEEEEcccceEEEEe---CCCCceEEEEE Confidence 344555567888999999999999996654 442 2 23445788999988854332 22111111 11 Q ss_pred cc-ceeeccchhceeccccccc---------cccCCcceeccchhhccccccccCCCCccchhH-HHHHHHHHH-HHHHH Q lcl|NC_021072. 182 IN-TQLTQKAAEYYLYNPKGLK---------NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHL-HKAIKAVNQ-LRMIE 249 (533) Q Consensus 182 ~~-~~~~~~~~e~~~y~p~~~~---------~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL-~~AiK~~Nq-Lrm~E 249 (533) .. ..-.+......+|.|.... ..+|..-++|. +.|++.. +.....+.|-+ +..+...+- -|.|. T Consensus 131 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--V~f~n~~--~l~~~~G~s~I~~~v~~l~da~~r~~~ 206 (410) T protein:vir:95 131 VLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLL--VPVIHRP--DAVRPFGRSRITRAGMYYQKYAKRTLE 206 (410) T ss_pred EEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcce--EEecccc--cCCccCCccccchhHHHHHHHHHHHHH Confidence 00 0011111112233332111 01111111221 1222211 11112223323 222222222 26778 Q ss_pred HHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCcccee Q lcl|NC_021072. 250 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEIS 329 (533) Q Consensus 250 DalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIs 329 (533) ++++.=..+=.|.|-++=+|-..-|..+-..+ +--=..+|.-++|-+.+|. T Consensus 207 ~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~-----------------------------~~~i~~~~~~~~~~~~~v~ 257 (410) T protein:vir:95 207 RADITAEFYSWPQKYILGLDPDAEPMEKWKAT-----------------------------VSSLLTISSSDKGVKPSVG 257 (410) T ss_pred HHHHHHHHhcchhheeeccCCCCCcCchhhhh-----------------------------hhhheeccCCCCCCcceEE Confidence 88899899999999887655422122111111 1112445776777778898 Q ss_pred ecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021072. 330 TLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILK 408 (533) Q Consensus 330 TLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilk 408 (533) .++++. |+. ++-++=.-..+....++|..-|+..+. |-..+..|.-.|....+-+.+.|+.|..-+.++++.-+.+. T Consensus 258 q~~~~~-l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~ 335 (410) T protein:vir:95 258 QFTTAS-MSPFTEQLRTAAAGFAGEMGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLR 335 (410) T ss_pred ecCCCC-hHHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 898866 443 344555556666777999888876543 32234457777888889999999999999999999988776 Q ss_pred cCCC--HhHHhhhhhceeEEEec--cchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHH Q lcl|NC_021072. 409 GVMS--LEEWDEMKEHIQFDFIA--DNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQI 484 (533) Q Consensus 409 gi~t--~eew~~~~~~i~~~f~~--Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi 484 (533) +-.. +.+|.. +.+.|.. | . ++--+.++.+.+..+..-+=.+.+.++++ +.|++|+++|..... T Consensus 336 ~~~~~~~~~~~~----~~v~W~p~~d---~---~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~-~~lg~~~~~~~~~~~-- 402 (410) T protein:vir:95 336 DEFRYTRSQFVR----TAVKWEPLFE---A---DANTMTMIGDGVVKLNQALPGYINAETIR-DLTGIAGDMSAKPVV-- 402 (410) T ss_pred cCCCCcccccce----eeEEeeecCC---c---chhhHHHHHHHHHHHHHhccCCccHHHHH-HhcCCChHHHHHHHH-- Confidence 6442 344442 4555541 2 1 12234566666655544322345557766 569999998775443 Q ss_pred HHhhhcCC Q lcl|NC_021072. 485 DSEREAGL 492 (533) Q Consensus 485 ~~E~~~~~ 492 (533) ++....|- T Consensus 403 ~e~~~~g~ 410 (410) T protein:vir:95 403 SEGGSNGE 410 (410) T ss_pred HHHHhCCC Confidence 22233332 No 167 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=93.77 E-value=0.0064 Score=32.59 Aligned_cols=416 Identities=14% Similarity=0.114 Sum_probs=162.9 Q ss_pred CCccc--cceeeeccccccccCCC-CCC--CCCcccceeecccccccccchhhhhhHHHHHHHHH--HhhhhcchhhhHH Q lcl|NC_021072. 1 MSNQL--FGFSLERAKKVPKGPSF-VQK--DSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRY--REMVLQPECDSAV 73 (533) Q Consensus 1 ~~~~~--fg~~i~~~~~~~~~~s~-~~~--~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~Y--R~m~~~pEvd~Av 73 (533) |+--- .=|.=.+..+..++... -.+ ...--+........... .+.-++.... -+..| ....++|-|..|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~-~~~~~~~~~~--~~~~~~~~~al~~~~V~acv 77 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMM-VQTLPGFQGT--KLRQYKDIEAIRHSDIFTAV 77 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHH-HHHhhccccc--CccccchhhhhccHHHHHHH Confidence 43211 11111222222222111 000 00000000000000000 0000000000 00011 1235678899999 Q ss_pred HHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHH----HHhhhhcCceeeeeeecC Q lcl|NC_021072. 74 DDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYEI----FRRWYVDGRLFYHKVIDP 145 (533) Q Consensus 74 deIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~----fR~WYvDGri~~hkvid~ 145 (533) +-|.+.+-. .|+.+- ++.. .. .-..++++|+-. -.+.++ +..+...|.-|+.++-|. T Consensus 78 ~~Ia~~iA~-----lpl~~~-~~~~--~~-------~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~ 142 (441) T protein:vir:98 78 MMIASDLAR-----MPIRVT-VNGQ--IN-------YSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK 142 (441) T ss_pred HHHHHhhcc-----CceEEe-cCCc--cc-------ccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC Confidence 999887553 444442 1111 11 112355555432 234444 444577899999977764 Q ss_pred CCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeecc-chhceeccccccccccCCcceeccchhhccccccc Q lcl|NC_021072. 146 KNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQK-AAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQ 224 (533) Q Consensus 146 ~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~-~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~ 224 (533) .+-+++|.+|+|..+...+.- ++.-++... .+... ...... +.+...++|.. + T Consensus 143 ---~G~~~~L~~i~~~~v~v~~~~-----~g~~~~~~~-~~~~~~~~~~~~-------~~~~dviHir~----~------ 196 (441) T protein:vir:98 143 ---TGEPMNLTFRKTSEIELKLDA-----RGRLYYFHQ-RIDSNGNNIERN-------VKFEDMLDIKF----Y------ 196 (441) T ss_pred ---CCcEEEEEEEcCceeEEEECC-----CCcEEEEEE-EeccCcceeeEE-------EccccEEEecc----C------ Confidence 445899999999999754322 111111000 00000 000011 12223333321 1 Q ss_pred cCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHH-HHHHhcccEEEeeCCCCccc Q lcl|NC_021072. 225 DLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLR-EVMGRYRNKLVYDANTGEIK 303 (533) Q Consensus 225 d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~-~im~~~rnk~vYd~~TGev~ 303 (533) ..++-.++|-|+.|.+++.....+++..-=+=..-+--+-|..++ |.+...+|.+=++ .....|. | .. T Consensus 197 ~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~~~~~e~~~~~~~~~~~~~~---------G-~~ 265 (441) T protein:vir:98 197 SLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS---------G-TK 265 (441) T ss_pred CCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc---------C-cc Confidence 112224568888888888877777776542222223345566666 4443334433233 3333332 2 11 Q ss_pred cccccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCc-ccccchhhhhHHhhh Q lcl|NC_021072. 304 DDKKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETT-FNIGRAAEITRDEVK 381 (533) Q Consensus 304 ~d~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~-~~~g~~~eItRDElk 381 (533) +..+.+ .++ .|.+++.|.-. +.+.-++--+|..+.+.++++||...|+.+.. ++ .++. .+- T Consensus 266 nag~~~-vl~----------~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s---~~q~---~~~ 328 (441) T protein:vir:98 266 QAGKVV-VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMS---ITDA---NLD 328 (441) T ss_pred ccCcce-ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCcc---HHHH---HHH Confidence 111122 222 24556655321 11222444566778899999999999964322 21 1222 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_021072. 382 FQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFS 461 (533) Q Consensus 382 F~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S 461 (533) |. ..|+--+. .+.+.|-..|. ++. . ...+.|. ..++...+ +..|.+.+..+-.- -++| T Consensus 329 y~---~tl~P~~~-~ie~~ln~~L~-----~~~--~----~~~~~fd----~~~llr~d-~~~~~~~~~~~~~~--G~~T 386 (441) T protein:vir:98 329 YL---STLKPYIT-CVCAELNFKFN-----DEY--V----NREFKFD----TTEIRVVD-EKTQAEIDKINIDS--GKMN 386 (441) T ss_pred HH---HHHHHHHH-HHHHHHHhhcc-----ccc--c----CceEEEe----chhhhccC-HHHHHHHHHHHHhC--CCcC Confidence 43 44443222 22333333332 211 1 1234454 23333222 23466666554332 3566 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccc--cCC-CCC---CCCCCCCccccccccCCccccch Q lcl|NC_021072. 462 IDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMD--PAM-DPG---NAPPADDMSAQEGPAVDAGDAKR 530 (533) Q Consensus 462 ~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~--~~~-~~~---~~~~~~d~~~~~~~~~~~~~~~~ 530 (533) .+-++. .+++. .++.++...- +.+ -+. ++.........+ ....-+|.-+ T Consensus 387 ~NE~R~-~~gl~------------------pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~-~~~kgGe~ne 441 (441) T protein:vir:98 387 IDEIRQ-RDGLA------------------PIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATD-KKLKGGEENE 441 (441) T ss_pred HHHHHH-HhCCC------------------CCCCCCcceEeecccccccccccccccccccccc-cccCCCCCCC Confidence 665553 24442 2222221110 000 000 000000000000 0011111111 No 168 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=93.24 E-value=0.0083 Score=32.00 Aligned_cols=374 Identities=14% Similarity=0.103 Sum_probs=162.6 Q ss_pred CCcc-ccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcc Q lcl|NC_021072. 1 MSNQ-LFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNE 79 (533) Q Consensus 1 ~~~~-~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvne 79 (533) |..- -|+|.-. +..+.+.+.... ......++..+..+ . . +...++|-|.+||+-|++. T Consensus 1 Mg~~~~~~~~~~------~~~~~~~~~~~~-~~~~~~~~~~~~~v------~-~-------~~al~~~~v~~~i~~ia~~ 59 (385) T protein:vir:10 1 MGLLTPRNFNKR------KAKNMVYPSNPA-FFTTTVGGMQLSYV------S-A-------LSALQNTNVYSVINRIASD 59 (385) T ss_pred Cccccchhcccc------cccccccccchh-hhhhhccccCcccc------C-H-------HHhhccHHHHHHHHHHHHH Confidence 5521 0121111 111111111111 01111111111111 1 1 1234578899999999988 Q ss_pred eeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHH----hhhhcCceeeeeeecCCCCCCCeEEE Q lcl|NC_021072. 80 TICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFR----RWYVDGRLFYHKVIDPKNPRGGLTEL 155 (533) Q Consensus 80 aiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR----~WYvDGri~~hkvid~~~~~~gI~el 155 (533) +-- .|+.+. + .. ...+++.=|-...+.++.+ .+.+.|.-|+.++=| ..++ T Consensus 60 ia~-----~p~~v~--~----~~--------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~-------~~~~ 113 (385) T protein:vir:10 60 VAS-----AHFKTE--N----TA--------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-------NLEH 113 (385) T ss_pred Hhh-----Cceeee--c----cc--------hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-------ceeE Confidence 653 355443 1 11 1223333333334444444 566889999886422 4678 Q ss_pred EEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccC-CCCccchh Q lcl|NC_021072. 156 RYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDL-NKNMTLSH 234 (533) Q Consensus 156 r~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~-~~~~i~sy 234 (533) .+++|-++++++.- ++..+ +++.+.+ ++...++.+-|.+...--.+. +.-.++|. T Consensus 114 ~p~~~~~v~~~~~~-----~~~~~--------------~~~~~~~-----~~~~~~~~~eiihik~~~~~~~~~~~G~s~ 169 (385) T protein:vir:10 114 IPNSDVQINYLPGN-----MGIVY--------------TVLESND-----RPQMVLRQDQMLHFRLMPDPQYRYLIGRSP 169 (385) T ss_pred eecCCceEEEEEcC-----CceEE--------------EEEEcCC-----ceEEEEccccEEEeccCCCCcccccccccH Confidence 88999888654321 11111 1111111 111223333332221000011 12245699 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhh Q lcl|NC_021072. 235 LHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 314 (533) Q Consensus 235 L~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlED 314 (533) |..|.++++....+++...=+----+--+-+..++.+-..+..+++ +++-+.+.... ...|.+ + .++ T Consensus 170 i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~-~~~~~~~~~~~----~n~~~~------~-vl~- 236 (385) T protein:vir:10 170 LESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLES-AREEFEKANTG----DNSGRL------M-VLP- 236 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHH-HHHHHHHHhCc----cccCCc------c-ccC- Confidence 9999999999999988766555555566777777754444444333 34434443211 122221 1 221 Q ss_pred hcccccCCCCccceeecCCCCCcch-HHHH-HHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_021072. 315 FWLPRREGGRGTEISTLPGGQNLGE-LEDV-KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 392 (533) Q Consensus 315 ywLpRReggrgTEIsTLpGg~nLge-i~DV-~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~ 392 (533) .|.+++.|.-...-.+ +.+. +|-.+.+.++++||...|+...+-+. ..+.+.-...-|. ..|+-- T Consensus 237 ---------~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~-~~sn~eq~~~~~~---~~l~P~ 303 (385) T protein:vir:10 237 ---------DGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTES-QHSNIDQIKATYL---ANLNSY 303 (385) T ss_pred ---------CCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCc-ccccHHHHHHHHH---HHHHHH Confidence 2566776643222222 2233 45578899999999999975321111 1122222222233 345432 Q ss_pred HHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCC Q lcl|NC_021072. 393 FSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQ 472 (533) Q Consensus 393 fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~ 472 (533) +. .+.+.|-..|+ ++ .| .|. +.++...+ +..|.+.+..+-.- -++|.+-++. ++++ T Consensus 304 ~~-~ie~~l~~~l~-----~~--------~~--~f~----~~~ll~~d-~~~~~~~~~~~~~~--G~~T~NE~R~-~~g~ 359 (385) T protein:vir:10 304 VN-PIVDELRLKMN-----AP--------DL--ELD----IKDMLDVD-DSALINQVSNLAKS--GVLGAEQAQF-ILTR 359 (385) T ss_pred HH-HHHHHHHHhhC-----Cc--------eE--Eee----chhhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCC Confidence 22 33333444332 21 23 443 33443333 24556555544322 3555555553 3333 Q ss_pred CHHHHHHHHHHHHHhhhcCCCCCCC---cccccCCCCCCCCCCC Q lcl|NC_021072. 473 TDQEIKEIDKQIDSEREAGLIVDPM---AEMDPAMDPGNAPPAD 513 (533) Q Consensus 473 tDeeI~e~~kqi~~E~~~~~~~~p~---~~~~~~~~~~~~~~~~ 513 (533) +.+|+.+ .....+...+|+.-++ T Consensus 360 ------------------~p~p~~~~~~~~~~~~~~~~g~~~dn 385 (385) T protein:vir:10 360 ------------------SGFLPDNLPEFKPLTTQVKGGDEGDN 385 (385) T ss_pred ------------------CccCCCCCccccCcccccCCCCCCCC Confidence 1132111 1111111111111111 No 169 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=92.87 E-value=0.0097 Score=31.63 Aligned_cols=386 Identities=13% Similarity=0.129 Sum_probs=191.2 Q ss_pred CcccceeecccccccccchhhhhhHHHHHHHHHHhhhh-----c---------------chhhhHHHHhhcceeeecCCC Q lcl|NC_021072. 28 MDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVL-----Q---------------PECDSAVDDIVNETICGNFDD 87 (533) Q Consensus 28 ~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~-----~---------------pEvd~AvdeIvneaiv~d~~~ 87 (533) +|-+ .+ .+..+.-..-+.+..++..|=+... . +=+.-+|+-.+.-..+ +| T Consensus 1 m~~~-~i------~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~---~G 70 (422) T protein:vir:97 1 MNYM-GM------GYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIF---RE 70 (422) T ss_pred CChH-HH------HHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhcccc---ce Confidence 1110 00 0000000001112222222222210 0 1112222222211111 11 Q ss_pred ceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehh Q lcl|NC_021072. 88 VPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVT 167 (533) Q Consensus 88 ~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr 167 (533) +.+.+ + +...+...=+|+....+.++.=++.|+-|.-.--+ +..|-..++.++|+.+--+. T Consensus 71 ----f~~~d----~--------~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~---~~~~~p~i~~~sp~~~~~i~ 131 (422) T protein:vir:97 71 ----FTNDD----F--------NAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPG---AEDGLPKMQVIEASKATGIL 131 (422) T ss_pred ----eeCCc----h--------hHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeC---CCCCeeEEEEechhhEEEEE Confidence 11111 1 23445555678888999999999999977764333 33466678999999885433 Q ss_pred hccCCCcCceeE-Eeccc-eeeccch-hceecc-------cccc-ccccCCcceeccchhhccccccccCCCCccchhH- Q lcl|NC_021072. 168 EYQQKRPEQLRG-EDINT-QLTQKAA-EYYLYN-------PKGL-KNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHL- 235 (533) Q Consensus 168 ~~~~~~~~~~~~-~~~~~-~~~~~~~-e~~~y~-------p~~~-~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL- 235 (533) +.......+ ..... ...+... ..|.++ ..+. ...+|..-++|. +.|++..-.. ...+.|-+ T Consensus 132 ---D~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~n~~~~~--~~~G~s~I~ 204 (422) T protein:vir:97 132 ---DPTTFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLL--VPIIHRPDAV--RPFGRSRIT 204 (422) T ss_pred ---eCCCCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcce--EEecccCCCc--cccCccccc Confidence 322222211 10000 0000100 111111 1110 001122122222 2222221111 11222322 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhh Q lcl|NC_021072. 236 HKAIKAVNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 314 (533) Q Consensus 236 ~~AiK~~Nq-Lrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlED 314 (533) +..+...+- -+.|.++++.=..+=.|.|-++=+|--.-|..+-. ..|-.= T Consensus 205 e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~-----------------------------~~~~~i 255 (422) T protein:vir:97 205 KAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWR-----------------------------ATVSTL 255 (422) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhh-----------------------------hhhhhh Confidence 222222221 25678888999999999998875543211211111 112233 Q ss_pred hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHH Q lcl|NC_021072. 315 FWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRF 393 (533) Q Consensus 315 ywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~f 393 (533) ..+|.-+.|-+.+|..++++. |+- ++-++-.-..+....++|.+-|+..+. |-=.+..|.-.|....+-+.+-|+.| T Consensus 256 ~~~~~de~~~~~~v~q~~~~~-l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-NpsSa~Ai~a~~~~L~~ka~~k~~~f 333 (422) T protein:vir:97 256 LEISKDEDGDKPTVGQFTTAS-MAPFMEHLKMYASLFAGGSGLTLDDLGFPSD-NPSSVESIKAAHENLRAAGRKAQRSF 333 (422) T ss_pred hccCCCCCCCcceeeecCCCC-hhHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHH Confidence 456777777778898898866 442 333444444555556999888876553 21123457777888899999999999 Q ss_pred HHHHHHHHHHHHHhccCCC--HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhC Q lcl|NC_021072. 394 SELFMDLLKTQLILKGVMS--LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLK 471 (533) Q Consensus 394 s~if~d~Lk~qLilkgi~t--~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~ 471 (533) ..-+..+++.-+.+.|-.. +++|.. +.+.|. .++- .++..+.+..+.+..+..-+=.+.+.+++++. |+ T Consensus 334 g~~l~~~~rla~~~~~~~~~~~~~~~~----~~~~w~-p~~~---~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~-lg 404 (422) T protein:vir:97 334 SSGFLNVAYIAVCLRDEFPYLRNQFMD----TVIKWE-PLFE---ADANMLTLVGDGAIKLNQAIPGFMDADVIRDL-TG 404 (422) T ss_pred HHHHHHHHHHHHHHhcCCcccchhhcc----ceEEEc-cCCC---CChHHHHHHHHHHHHHHhhccccccHHHHHHH-cC Confidence 9999999999887766432 344543 566665 2221 12444566777766665543356667877755 89 Q ss_pred CCHHHHHHHHHHHHHhhhcC Q lcl|NC_021072. 472 QTDQEIKEIDKQIDSEREAG 491 (533) Q Consensus 472 ~tDeeI~e~~kqi~~E~~~~ 491 (533) +++.+++. ..+++++.+| T Consensus 405 ~~~~~~~~--~~~~~~~~d~ 422 (422) T protein:vir:97 405 VKGADKPI--PAITEVTTDG 422 (422) T ss_pred CCchhHHH--HHHHhhhccC Confidence 98876653 3446666666 No 170 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=91.81 E-value=0.014 Score=30.73 Aligned_cols=405 Identities=15% Similarity=0.161 Sum_probs=172.3 Q ss_pred CCccccceeeecc-----ccc----cccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhh Q lcl|NC_021072. 1 MSNQLFGFSLERA-----KKV----PKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDS 71 (533) Q Consensus 1 ~~~~~fg~~i~~~-----~~~----~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~ 71 (533) |-.+-+-..++-- +.+ .+..+.+.+....|.+. ..+...|.. +.. +...++|-|.. T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~------v~~--------~~al~~~~v~~ 65 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVS-AHGHLGDSS------IND--------ERILQISTVWR 65 (424) T ss_pred CCCCcceEeecCCCchHHHHHhhhcccccccccccccccccc-ccccccccc------ccH--------HHhhccHHHHH Confidence 3333332222210 000 00111111112222211 111111111 111 33466788999 Q ss_pred HHHHhhcceeeecCCCceEEEE-eccCCCcHHHHHHHHHHHHHHHHHhc----chhhhhHHHHh----hhhcCceeeeee Q lcl|NC_021072. 72 AVDDIVNETICGNFDDVPVEVE-LSNLKQSDKIKKLIREEFAEILRLLD----FENRSYEIFRR----WYVDGRLFYHKV 142 (533) Q Consensus 72 AvdeIvneaiv~d~~~~~v~v~-l~~~~~S~~ik~~I~eeF~~i~~lL~----f~~~~~~~fR~----WYvDGri~~hkv 142 (533) ||+-|.+.+-. .|+.+- .+..+..+.+ .. -..++++|+ -...+.++.+. +...|.-|.-++ T Consensus 66 cv~~Ia~~iA~-----lp~~~~~~~~~~~~~~~----~~-~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~ 135 (424) T protein:vir:18 66 CVSLISTLTAC-----LPLDVFETDQNDNRKKV----DL-SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) T ss_pred HHHHHHHhhcc-----CceEEEEeecCCceeee----cc-ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 99998887543 444442 1111111111 00 012444443 33455555444 566799998876 Q ss_pred ecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccc Q lcl|NC_021072. 143 IDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSG 222 (533) Q Consensus 143 id~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsG 222 (533) -+. .+-+++|.+|+|..+...+ ..+... |.|...+ ....++.+-|.+.. + T Consensus 136 r~~---~G~~~~L~pl~~~~V~v~~-----~~~~~~---------------y~~~~~g------~~~~~~~~eIih~r-~ 185 (424) T protein:vir:18 136 RNS---AGDVISLLPLQSANMDVKL-----VGKKVV---------------YRYQRDS------EYADFSQKEIFHLK-G 185 (424) T ss_pred ECC---CCcEEEEEEecCcceEEEE-----cCCeEE---------------EEEEeCC------eEEEeccccEEEec-C Confidence 553 4449999999999985311 112111 1111111 11222222222111 0 Q ss_pred cccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcc Q lcl|NC_021072. 223 IQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEI 302 (533) Q Consensus 223 l~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev 302 (533) ...++-.++|-++.|.+++.....+++..-=+----+--+-+...+-+.+.+..+++. ++.++++..- ...|. T Consensus 186 -~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~-~~~~~~~~~g----~nag~- 258 (424) T protein:vir:18 186 -FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV-EENFKEIAGG----PVKKR- 258 (424) T ss_pred -cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHH-HHHHHHHhCC----cccCC- Confidence 1123335678999999998887777766543322223334566666666665554443 3333332110 01121 Q ss_pred ccccccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhh Q lcl|NC_021072. 303 KDDKKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVK 381 (533) Q Consensus 303 ~~d~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElk 381 (533) .+ .++ .|++++.|.=. +.+.-++-.+|..+.+.++++||...|+...+-+.. ++.+.-..+. T Consensus 259 -----~~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~-~sn~eq~~~~ 321 (424) T protein:vir:18 259 -----LW-ILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW-GSGIEQQNLG 321 (424) T ss_pred -----ce-ecc----------CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccc-cccHHHHHHH Confidence 11 121 25566666322 222224445677889999999999999654322211 1112222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_021072. 382 FQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFS 461 (533) Q Consensus 382 F~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S 461 (533) |.+++ |+- +...+.+.|-.. ++++.++. ...+.|..+ .+... -+..|.+.+..+-.- -++| T Consensus 322 f~~~t--l~P-~~~~ie~~l~~~-----L~~~~~~~----~~~~~fd~~----~llr~-d~~~r~~~~~~~~~~--G~~T 382 (424) T protein:vir:18 322 FLQYT--LQP-YISRWENSIQRW-----LIPAKDVG----RIHAEHNLD----GLLRG-DSASRAAFMKAMGEA--GLRT 382 (424) T ss_pred HHHHH--HHH-HHHHHHHHHHhh-----cCCccccC----CeEEEEech----hhhcc-CHHHHHHHHHHHHhC--CCcC Confidence 65542 322 122233334333 34445443 245556533 33221 234566666655322 3555 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCcc Q lcl|NC_021072. 462 IDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAG 526 (533) Q Consensus 462 ~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 526 (533) .+-++.. +++.. + |+.+ .-.-..+..|.++...+..|+-+.+ T Consensus 383 ~NE~R~~-~gl~p------------------i--~gGD--~~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 383 INEMRRT-DNLPP------------------L--PGGD--VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred HHHHHHH-hCCCC------------------C--CCcC--eeeeccCccchHhhhccCCCccCCC Confidence 5555532 33321 1 1111 0000111111111111111111111 No 171 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=91.46 E-value=0.016 Score=30.47 Aligned_cols=435 Identities=11% Similarity=0.052 Sum_probs=171.4 Q ss_pred CCccccceeeeccccccccCC--CCCCCCCcccceeecccccccccchh-------hhhhHH-HHHHHHHHhhhhcchhh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPS--FVQKDSMDGSQPIVGGGYYGYSVDFD-------GTVRNE-YELITRYREMVLQPECD 70 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s--~~~~~~~dg~~~~~~~~~~~~~~~~~-------~~~~~~-~~LI~~YR~m~~~pEvd 70 (533) |+ +|.+-....+...+... .+...+.. .....+.++.+....+. ....+. ..-+ .=+..+++|.|. T Consensus 1 M~--~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v-~~~~a~~~~~v~ 76 (466) T protein:vir:81 1 MR--LIDRLLSTRGAAPRMSIDDYAQMLNEF-AFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGL-ATQAYQANGPVF 76 (466) T ss_pred Cc--hhHHHhhccCcccccchhhhhhhhhhh-hccccccccccccHHHHHhhccccccccCcccccc-chhhhhccHHHH Confidence 33 22221111111111100 00000000 00000000000000000 000000 0000 112245689999 Q ss_pred hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHH-HHh---cchhhhhHHHHh----hhhcCceeeeee Q lcl|NC_021072. 71 SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEIL-RLL---DFENRSYEIFRR----WYVDGRLFYHKV 142 (533) Q Consensus 71 ~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~-~lL---~f~~~~~~~fR~----WYvDGri~~hkv 142 (533) .||+-|.+.+-. .|+.+.=..-+..+.+ .++.+ .|+ |-...+.++.+. +.+.|.-|..++ T Consensus 77 ~~i~~Ia~~ia~-----lp~~~~~~~~~~~~~~-------~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~ 144 (466) T protein:vir:81 77 ACMLVRQLVFSS-----VRFRWQRLRDGKPSDT-------FGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIV 144 (466) T ss_pred HHHHHHHHhhcc-----CceEEEEecCCceeec-------cccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEE Confidence 999999988654 4555432111111112 12222 222 333455565444 556799999977 Q ss_pred ecCC-----CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhh Q lcl|NC_021072. 143 IDPK-----NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVT 217 (533) Q Consensus 143 id~~-----~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~ 217 (533) -+.. ...+-+++|.+|+|..+...... +....- .|.|...+.. .......++.+-+. T Consensus 145 r~~~g~l~~~~~g~~~~l~~l~~~~v~~~~~~---~~~~~~--------------~y~~~~~~~~-~~~~~~~~~~~dvi 206 (466) T protein:vir:81 145 DGEFVRMRPDWVDVVVEERMVRGGRGELGGGQ---LGWRKV--------------GYLYTEGGRQ-SGNESVGFLAEDVV 206 (466) T ss_pred ecCccccccccCcceeEEEEecCcceEEEEcC---CCceEE--------------EEEEEecCcc-cccceeeeccccEE Confidence 6532 12344789999999998653321 111100 0111111100 00111122222221 Q ss_pred cccccccc-CCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEee Q lcl|NC_021072. 218 YCHSGIQD-LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYD 296 (533) Q Consensus 218 y~hsGl~d-~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd 296 (533) +.. ++-+ .++-.++|.+..|++++.....+++...=+=---+--.-|+..+ +.|.+..+++..+.+...|+-- T Consensus 207 Hir-~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~---- 280 (466) T protein:vir:81 207 HFA-PIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHN-PMADPAAVKKWADEVNSKHAGV---- 280 (466) T ss_pred EEc-CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecC-CCCCHHHHHHHHHHHHHHhcCc---- Confidence 111 1111 12224569999999998888777766543333334445566665 4577766666555555555320 Q ss_pred CCCCccccccccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCc---ccccch Q lcl|NC_021072. 297 ANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETT---FNIGRA 372 (533) Q Consensus 297 ~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~---~~~g~~ 372 (533) ...|.+ + .++ -|.+++.|.-. ..+.-++-.++..+.+.++.+||...|+-..+ -...+. T Consensus 281 ~n~g~~------~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~ 343 (466) T protein:vir:81 281 DNAWKN------L-NLY----------PGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNY 343 (466) T ss_pred cccccc------e-EcC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccH Confidence 111221 1 121 24566666432 12222445568899999999999999964321 222334 Q ss_pred hhhhHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 373 AEITRDEVKFQKFI-ARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNT 451 (533) Q Consensus 373 ~eItRDElkF~Kfi-~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~ 451 (533) ++..+. |.+++ .-+-.++.. .|-..|+. ..+. ..+.++|..+ ++-... ++.|.++... T Consensus 344 eq~~~~---f~~~tl~P~~~~ie~----~l~~~L~~-----~~~~----~~~~~~f~~~----~llr~d-~~~r~~~~~~ 402 (466) T protein:vir:81 344 GQARRR---LADGTAHPLWQNLSG----CIGHVMPD-----MGPD----VRLWYDADDV----PFLRED-EKDAADIQKV 402 (466) T ss_pred HHHHHH---HHHHHHHHHHHHHHH----HHHhhcCC-----cccC----cceEEEecch----hhhccC-HHHHHHHHHH Confidence 454444 66553 444444433 33333332 2221 2345666633 222221 2344443221 Q ss_pred hhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccC--CCCCCCCCCCCccccccccCCccc Q lcl|NC_021072. 452 MDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPA--MDPGNAPPADDMSAQEGPAVDAGD 527 (533) Q Consensus 452 ~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~ 527 (533) ...+ ...++... +|..|+-.. .. -.+..+..|. ...+. -..+...+......+...+-++++ T Consensus 403 ~~~~------~~~~~~~g--~t~nE~r~~----~~-~gd~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 403 RAET------INTLITAG--YEPESVVAA----VN-SGDLRLLKHT-GLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred HHHH------HHHHHHcC--CChhhcccc----cc-CCccccccCC-CcchhhhcccccccccCCCCcccCCCCcCCC Confidence 1111 12222222 244333321 11 0111111110 00000 000110000101111112222222 No 172 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=91.18 E-value=0.017 Score=30.28 Aligned_cols=402 Identities=10% Similarity=0.063 Sum_probs=172.1 Q ss_pred ccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeec Q lcl|NC_021072. 5 LFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGN 84 (533) Q Consensus 5 ~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d 84 (533) .|.-.....+......| .++-.....++.. +.....+.. ...+++|-|.+||+-|.+.+-- T Consensus 1 ~~~~r~~~~~~~~~~~~------~~~~~~~~~g~~~---s~~~~~vt~--------~~al~~~~v~~~v~~ia~~iA~-- 61 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMS------AGGWVSALLGSSR---SDSGQVVTP--------ASALALTVLQNCVTLLAESIAQ-- 61 (419) T ss_pred CcccccccccccccccC------cchhhHHhhcCCC---ccCCcccch--------HHhhccHHHHHHHHHHHHhhcc-- Confidence 33322222111111111 1111111111111 000000110 1235788899999999987542 Q ss_pred CCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHHh----hhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 85 FDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 85 ~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~elr 156 (533) .|+.|--..-+..+.+ ....+.++|+. ...+.++.+. +.+.|.-|..++-|. .+-+++|. T Consensus 62 ---lp~~~~~~~~~~~~~~------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~---~G~~~~l~ 129 (419) T protein:vir:14 62 ---LPIELYERSGEDRKPA------TDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDS---DGVIQGLY 129 (419) T ss_pred ---CceEEEEecCCccccc------cccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcEEEEE Confidence 4554431111111111 12345555543 2355565554 667799888876654 45589999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLH 236 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~ 236 (533) +|+|..+...+.. ++..++ .+..... ......+++.. ...++-.++|-+. T Consensus 130 pl~~~~v~v~~~~-----~~~~~y--------------~~~~~~~-~~~~~i~h~~~----------~~~dg~~G~s~i~ 179 (419) T protein:vir:14 130 PLDNEAVTVMRGS-----DLKPVY--------------RVRGSDP-MPQRLVHHVRW----------MSINGYTGLSPVL 179 (419) T ss_pred EecCceEEEEECC-----CceEEE--------------EEccCcc-cchhheeEecC----------cCCCCcccccHHH Confidence 9999999653221 111111 1110000 01111122211 1222334678899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCc--hHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhh Q lcl|NC_021072. 237 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLP--KNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 314 (533) Q Consensus 237 ~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlp--k~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlED 314 (533) .|..++.....+++...=+----+--+-++..+...-+ ..++.+-+++.+++ .. +| ..+..+.+ .++ T Consensus 180 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~---~~-----~g-~~nag~~~-vl~- 248 (419) T protein:vir:14 180 LHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNA---KF-----GG-SGNAKKVA-LLQ- 248 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHH---Hh-----cC-ccccCCce-ecC- Confidence 99998888887777655544444666777777642211 22332223332222 11 11 11111111 221 Q ss_pred hcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_021072. 315 FWLPRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 391 (533) Q Consensus 315 ywLpRReggrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~ 391 (533) .|.+++.|. .+..+ ++--++..+.+.++++||...++...+-+.....+..+. |..++ |+- T Consensus 249 ---------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~---f~~~~--L~P 312 (419) T protein:vir:14 249 ---------EGMTFRPLS--MTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQ---FVIYT--LLP 312 (419) T ss_pred ---------CCceEEEcc--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHH---HHHHH--HHH Confidence 245666663 22333 333457789999999999999975433333333343333 44432 222 Q ss_pred HHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhC Q lcl|NC_021072. 392 RFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLK 471 (533) Q Consensus 392 ~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~ 471 (533) -+. .+. +-+-++++++.++.. ..+.|.. ..+... -+..|.+.++.+-.- -++|.+-++. .++ T Consensus 313 ~~~-~ie-----~~l~~kll~~~~~~~----~~i~fd~----~~l~r~-d~~~~~~~~~~~~~~--G~~T~NE~R~-~~g 374 (419) T protein:vir:14 313 WVK-RHE-----QAKTRDLLLPSERKQ----YFIEYNL----AGLLRG-DQSSRYAAYAVGRQW--GWLSINDIRR-LEN 374 (419) T ss_pred HHH-HHH-----HHHhhhccCccccCC----eEEEEec----hhhhcc-CHHHHHHHHHHHHhC--CCcCHHHHHH-HhC Confidence 111 111 222234456665542 3455542 222221 234566666654222 3455555553 233 Q ss_pred CCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCC-C-CCCCCCCCccccccccCCccccchhcC Q lcl|NC_021072. 472 QTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMD-P-GNAPPADDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 472 ~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~-~-~~~~~~~d~~~~~~~~~~~~~~~~~~~ 533 (533) +.+ ++.-+.-..|.+- . +...+.+++. ....+++.++.+.+ T Consensus 375 l~p------------------~~gGD~~~~~~n~~~~~~~~~~~~~~---~~~~~~~~~e~~~~ 417 (419) T protein:vir:14 375 MPP------------------VKGGDIYLSPMNMVDASKPQQLPVGK---SEPTKAAIDEIGRI 417 (419) T ss_pred CCC------------------CCCcCeeeeccccccccccccccCCC---CCCccccccchhcc Confidence 321 1111111111111 0 1111111111 11123344444444 No 173 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=91.06 E-value=0.018 Score=30.19 Aligned_cols=387 Identities=16% Similarity=0.162 Sum_probs=162.0 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |+ +|.. +......++ +.-..+.++...+.++.. .-+++|.|..||+-|.+.+ T Consensus 1 m~--~f~~---------~~~~~~~~~--~~~~~~~~~~~~~~~~~~---------------~Al~~~~V~~~i~~Ia~~i 52 (406) T protein:vir:97 1 MS--FFQP---------LGTSKVSYD--DYISSVLAGDVSQKYLGV---------------SALKNSDILTATSIIAGDI 52 (406) T ss_pred Cc--cccc---------cCCCCCCcc--hHHHHHhcCCCCcccccc---------------hhhccHHHHHHHHHHHHhh Confidence 33 3321 111111111 110111111111111110 0245789999999999886 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHHH----HhhhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFE----NRSYEIF----RRWYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~f----R~WYvDGri~~hkvid~~~~~~gI 152 (533) -. .|+.+.-.+ . + ++.+ ..+..+|+.. ..+.++. ..+.+.|.-|.-++-|. ..+.+ T Consensus 53 A~-----lp~~~~~~~--g-~----~~~~--~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~--~~g~~ 116 (406) T protein:vir:97 53 AR-----FPLVKKDVN--G-D----IIHD--EDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDP--KTNQA 116 (406) T ss_pred hh-----CeeEEEecC--c-c----cccc--chHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC--CCCeE Confidence 54 344332111 1 1 1111 2355566432 2444444 44677899998766543 23458 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecc-ccc-c--ccccCCcceeccchhhccccccccCCC Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYN-PKG-L--KNSTNQGMKIATDSVTYCHSGIQDLNK 228 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~-p~~-~--~~~~~~~~kI~~dai~y~hsGl~d~~~ 228 (533) .+|.+++|..++..+.- +.... |.|. +.+ . .....+.++|+- ...++ T Consensus 117 ~~L~~i~p~~v~v~~~~-----~~~~~--------------y~~~~~~~~~~~~~~~~evih~r~----------~~~dg 167 (406) T protein:vir:97 117 LQFQFYRPSETTVEETD-----NHEIV--------------YTFTDMLTAKQVKCFAHDVIHWKF----------FSHDT 167 (406) T ss_pred EEEEEECCCeeEEEEcC-----CceEE--------------EEEEecCCceEEEEccccEEEecC----------CCCCC Confidence 89999999999653221 11101 1111 111 0 112223333431 11222 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccccccc Q lcl|NC_021072. 229 NMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKF 308 (533) Q Consensus 229 ~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ 308 (533) -.++|-|..|.+++.....+++...=+----++- .++...-+.|-+..+++. ++-+.++.. ...+|.+ T Consensus 168 ~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~-~~i~~~~~~l~~e~~~~~-~~~~~~~~~----g~n~g~~------ 235 (406) T protein:vir:97 168 ILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSS-GILTMKGAQLSGDARQRA-RQEFEKMRE----GSVGGSP------ 235 (406) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC-ceEEecCCCCCHHHHHHH-HHHHHHHhc----ccccCce------ Confidence 2367889999998888777777655443444553 455555566655544443 333333211 0122322 Q ss_pred chhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHH Q lcl|NC_021072. 309 MSMLEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIA 387 (533) Q Consensus 309 msmlEDywLpRReggrgTEIsTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~ 387 (533) + .++ .|.+++.|.-..+-.| ++--+|-.+.+-++.+||...|+..+..+ ..++..+. |.+++ T Consensus 236 ~-vl~----------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~--~~e~~~~~---f~~~~- 298 (406) T protein:vir:97 236 L-VFD----------STMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQ--SVAQLMED---YVTND- 298 (406) T ss_pred e-ecC----------CCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcc--hHHHHHHH---HHHHH- Confidence 1 111 2456666642221111 22223336778889999999997543321 23343433 65542 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHH Q lcl|NC_021072. 388 RLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRR 467 (533) Q Consensus 388 rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k 467 (533) |+-.+. .+.+.|-..| .++.+|.. -.|+|++.. +++.|++.+..+-. +-.+|.+-++. T Consensus 299 -l~P~~~-~ie~~l~~kl-----l~~~~~~~--~~i~fd~~~-----------~~~~~~~~~~~~~~--~g~~T~NE~R~ 356 (406) T protein:vir:97 299 -LPFYFD-AITSELGLKT-----LNDKDRRL--YHIEFDTRS-----------VTGRNVDEIVKLVN--NQILTPNQGLV 356 (406) T ss_pred -HHHHHH-HHHHHHhhhh-----cChhhccc--eeEEEecCc-----------cchhhHHHHHHHHh--CCCcCHHHHHH Confidence 333222 2233333333 45555542 223443321 24455554443311 12344443332 Q ss_pred HHhCCCHHHHHHHHHHHHHhhhcCCCCCCCccc--ccCC-CCCCC--CCCCCcc-ccccccCCccccch Q lcl|NC_021072. 468 QVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEM--DPAM-DPGNA--PPADDMS-AQEGPAVDAGDAKR 530 (533) Q Consensus 468 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~--~~~~-~~~~~--~~~~d~~-~~~~~~~~~~~~~~ 530 (533) . |+ .+.++.|+.+. -|.+ -+.+. .+.+..+ ...+...++.+.+- T Consensus 357 ~-~g------------------~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 357 E-LG------------------KQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred H-hC------------------CCCCCCCCCCeEeeccCccchhcccccccccccccCCCCCCCCCCCC Confidence 1 22 23333333211 1111 00000 0000000 11111111111111 No 174 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=90.40 E-value=0.021 Score=29.77 Aligned_cols=379 Identities=13% Similarity=0.118 Sum_probs=161.8 Q ss_pred CCccccce-eeeccc----cc------cccCCCCCCC---CCcccceeeccccccc----ccchhhhhhHHHHHHHHHHh Q lcl|NC_021072. 1 MSNQLFGF-SLERAK----KV------PKGPSFVQKD---SMDGSQPIVGGGYYGY----SVDFDGTVRNEYELITRYRE 62 (533) Q Consensus 1 ~~~~~fg~-~i~~~~----~~------~~~~s~~~~~---~~dg~~~~~~~~~~~~----~~~~~~~~~~~~~LI~~YR~ 62 (533) .-+-|||+ +|-.-+ .. +.-.++..|. +.........+++.|+ ..+....+. . +. T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t-~-------~~ 74 (409) T protein:vir:83 3 FWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQ-D-------KL 74 (409) T ss_pred hhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccc-h-------hh Confidence 55778886 111100 00 0011111111 1111111111112111 111111111 1 44 Q ss_pred hhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHh-cchhhhhHHH----HhhhhcCce Q lcl|NC_021072. 63 MVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLL-DFENRSYEIF----RRWYVDGRL 137 (533) Q Consensus 63 m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL-~f~~~~~~~f----R~WYvDGri 137 (533) +++.|-|..||+-|.+.+-- .|+.+- .+.... ++..++++.- |-...+.++. ..+.+ |.- T Consensus 75 ~~~~~~v~acV~~Ia~~iA~-----lpl~~~-~~~~~~--------~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gna 139 (409) T protein:vir:83 75 RTLIDVAWACIDLNASVLSS-----MPIYRM-RNGRII--------DSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEA 139 (409) T ss_pred HhhhHHHHHHHHHHHHhhcc-----CceEEe-eCCccc--------cchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCc Confidence 66789999999999887432 344432 111111 1222222211 1113344433 44555 888 Q ss_pred eeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhh Q lcl|NC_021072. 138 FYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVT 217 (533) Q Consensus 138 ~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~ 217 (533) |+.+|.. +..+-+++|..|+|..+..... .++...+ .+.. .+.+...++|+.-... T Consensus 140 y~~~i~r--~~~G~~~~L~pl~p~~v~v~~~-----~~g~~~y--------------~~~~---~~~~~eiiHir~~~~~ 195 (409) T protein:vir:83 140 FVLPMAH--GSDGYPIRFRVVPPWLVNVELK-----KGARREY--------------RIGG---LNVTDEILHIRYQGNT 195 (409) T ss_pred EEEEEEE--CCCCcEEEEEEECCcceEEEEc-----CCceEEE--------------EEcc---ccCccceEEeCCCCCC Confidence 8887653 2345589999999998864221 1111111 1110 0112233444322111 Q ss_pred ccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhc--CccceEEEccCCCCchHHHHHHHHHHHHhcccEEEe Q lcl|NC_021072. 218 YCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSR--APERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVY 295 (533) Q Consensus 218 y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~R--APeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vY 295 (533) ++..++|-|+.|...+..-...++... +.+. |--.-|...| +.|.+.++++..+.....|.. T Consensus 196 ---------~~~~G~spi~~~~~~i~~~~a~~~~~~--~~f~nga~p~gil~~~-~~ls~e~~~~~~~~~~~~~~~---- 259 (409) T protein:vir:83 196 ---------ADAHGHGPLESAAPRQVVIGLLQKYVQ--NLAETGGVPLYWLGVE-RRLSETEAVDLMDRWIESRSK---- 259 (409) T ss_pred ---------CCcccccHHHHHHHHHHHHHHHHHHHH--HHHhcCCCcceEeecC-CCCCHHHHHHHHHHHHHhhCC---- Confidence 122456888888888887777777543 3333 2223344444 456666676665555443321 Q ss_pred eCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCC---cccccch Q lcl|NC_021072. 296 DANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETET---TFNIGRA 372 (533) Q Consensus 296 d~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~---~~~~g~~ 372 (533) ..|.+ + +++ +|+.-++..++.. ..+.-++-.+|-.+..-++.+||...|+-.+ .....+. T Consensus 260 --nag~~------~-il~-------~g~~~~~~~~~s~-~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~ 322 (409) T protein:vir:83 260 --YAGHP------A-LVT-------GGATLNQAKSMSA-QDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNI 322 (409) T ss_pred --ccCcc------c-eec-------CCcccccccCCCH-HHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccH Confidence 22321 1 222 1222122222211 1111122234455668899999999996432 2223333 Q ss_pred hhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021072. 373 AEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTM 452 (533) Q Consensus 373 ~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~ 452 (533) ++..+. |.++ .|+--+. .+.+.|-..|+..| +. +.|. +.++...++ ..|.+.++.+ T Consensus 323 eq~~~~---f~~~--tL~P~~~-~ie~~l~~~Ll~~~-----------~~--~~f~----~~~llr~d~-~~r~~~~~~~ 378 (409) T protein:vir:83 323 EQLFSF---HDRS--SLRPKAT-AVMAALDRWALPSP-----------QH--LELN----RDDYTRPSL-VERATAYKIM 378 (409) T ss_pred HHHHHH---HHHH--HHHHHHH-HHHHHHHHhhCCCC-----------cE--EEee----hhhhhccCH-HHHHHHHHHH Confidence 444444 5442 3333222 23444444554333 12 4454 334444443 4677776655 Q ss_pred hhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCc Q lcl|NC_021072. 453 DPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDM 515 (533) Q Consensus 453 ~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~ 515 (533) -.- -++|.+-+++. ++| + |. ++++.-...|+ T Consensus 379 ~~~--G~lT~NE~R~~-~gl---------------------p-p~-------~ggd~l~~~gv 409 (409) T protein:vir:83 379 IEA--GVMEPNEARAM-ERL---------------------H-SE-------AAAVRLSGGGV 409 (409) T ss_pred HhC--CCcCHHHHHHH-hCC---------------------C-CC-------CCCcccCCCCC Confidence 432 34454444421 222 1 11 11111122222 No 175 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=89.60 E-value=0.025 Score=29.33 Aligned_cols=377 Identities=10% Similarity=0.049 Sum_probs=186.8 Q ss_pred Cccccee--e---------cccccccccch----hhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEE Q lcl|NC_021072. 28 MDGSQPI--V---------GGGYYGYSVDF----DGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEV 92 (533) Q Consensus 28 ~dg~~~~--~---------~~~~~~~~~~~----~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v 92 (533) +|-.... . ......||.+- +-++.-..++-..||... +=+.-+|+-++.-+.+ ++ . T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~--nw~~~iVds~a~rl~~---~G----f 71 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSIL--GWCAKGVDSLADRLVF---RE----F 71 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhc--chhHHHHHHhHhhccc---Cc----c Confidence 1100000 0 00000111000 000000111112222111 1112222222221111 11 0 Q ss_pred EeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCC Q lcl|NC_021072. 93 ELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQK 172 (533) Q Consensus 93 ~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~ 172 (533) ..+ .. +...+...=+|+....+.++.=++.|+-|.- |.-. ..|-..++.++|+.+--+.+ . T Consensus 72 ~~~----d~--------~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~-v~~~---~dg~~~i~~~sp~~~~~i~D---~ 132 (409) T protein:vir:94 72 END----DF--------TVNEIFEENNPDIFFDSAVLSSLIASCSFTY-ISKG---ENDAVRLQVIEAVNATGIID---P 132 (409) T ss_pred cCC----ch--------HHHHHHHhcChhHHHHHHHHHHHHhcceeEE-EecC---CCCceEEEEeccceEEEEEe---c Confidence 000 11 2345666667888999999999999996654 3421 24456788889987743322 2 Q ss_pred CcCceeE-Eecc-ceeeccchhceeccccccc----------cccCCcceeccchhhccccccccCCCCccchhHHHHHH Q lcl|NC_021072. 173 RPEQLRG-EDIN-TQLTQKAAEYYLYNPKGLK----------NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIK 240 (533) Q Consensus 173 ~~~~~~~-~~~~-~~~~~~~~e~~~y~p~~~~----------~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK 240 (533) ......+ .... +.-.+......+|-|.... ..+|..-++| .+.|++..-.+ ...+.|-+-..+. T Consensus 133 ~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vP--vV~f~n~~~~~--~~~G~s~I~e~v~ 208 (409) T protein:vir:94 133 ITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPL--LVPIIHRPDAV--RPFGRSRITRSGM 208 (409) T ss_pred CCCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcc--eEEeccccccc--cccCccccchhHH Confidence 2111111 1000 0001111111123232111 0111111222 12232211111 1122333322222 Q ss_pred H-HHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhccc Q lcl|NC_021072. 241 A-VNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLP 318 (533) Q Consensus 241 ~-~Nq-Lrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLp 318 (533) + .+- -+.+.+.++.=...=.|.|-++=+|...-|..+-..++ -.=..+| T Consensus 209 ~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~-----------------------------~~i~~~~ 259 (409) T protein:vir:94 209 YWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATV-----------------------------SSMLQFT 259 (409) T ss_pred HHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhH-----------------------------HHhhcCC Confidence 1 121 25678889999999999998886654222222111111 1113356 Q ss_pred ccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_021072. 319 RREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELF 397 (533) Q Consensus 319 RReggrgTEIsTLpGg~nLg-ei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if 397 (533) .-+.|-+.+|..+|++. |+ =++-++=.-..+....++|.+-|+..+. |-..+..|.-.|....+-+.|-|+.|..-+ T Consensus 260 ~d~dg~~~~v~q~~~~~-l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~ 337 (409) T protein:vir:94 260 KDEDGDKPTLGQFTQPS-MSPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQRSLGAGL 337 (409) T ss_pred CCCCCCCceEEecCCCC-hhHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66667778898898866 43 3455566666777788999888876543 322344577777778888999999999999 Q ss_pred HHHHHHHHHhccCCC--HhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHH Q lcl|NC_021072. 398 MDLLKTQLILKGVMS--LEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQ 475 (533) Q Consensus 398 ~d~Lk~qLilkgi~t--~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDe 475 (533) .+++|.-+.+.|-.. +++|. .+.+.|. +++-++ +-.+.++.+.+..+..-+=.+.+.+.++.. |++|+. T Consensus 338 ~~~~rla~~i~~~~~~~~~~~~----~~~v~W~-p~~~~~---~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~-lG~~~~ 408 (409) T protein:vir:94 338 LNVAYLAACLRDDAPYLREQFR----KTKPKWE-PLFEAD---ASMLSLIGDGAIKLNQAIPEFINKDTIRDL-TGIEGG 408 (409) T ss_pred HHHHHHHHHHhCCCCccccccc----cceEEec-cCCCcc---hHHHHHHHHHHHHHHHhcccccchhHHHHH-cCCCCC Confidence 999998776655432 34443 3677776 333333 334567777787777653355666777654 999999 Q ss_pred H Q lcl|NC_021072. 476 E 476 (533) Q Consensus 476 e 476 (533) | T Consensus 409 d 409 (409) T protein:vir:94 409 E 409 (409) T ss_pred C Confidence 9 No 176 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=89.42 E-value=0.026 Score=29.23 Aligned_cols=414 Identities=11% Similarity=0.120 Sum_probs=173.5 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhh--------------hc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMV--------------LQ 66 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~--------------~~ 66 (533) ...++|=++..++ .. +. .+.. +.. ...... .+.+.+..|-.-. .+ T Consensus 4 ~~~~~~~~~~~~~-~~-------------~~--~i~~-~i~---~~~~~~-~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ 62 (453) T protein:vir:73 4 KPIKLMTYSRDEE-IT-------------DK--VVND-FMK---KHQEEV-ERYEYLGNMYKGIMEISSQKAKDSWKPDN 62 (453) T ss_pred ccceeeecccccc-CC-------------HH--HHHH-HHH---HHHHHH-HHHHHHHHHhccccchhcCCCCCccCccc Confidence 2233333332210 00 00 0000 000 000000 1111112221110 00 Q ss_pred chhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCC Q lcl|NC_021072. 67 PECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPK 146 (533) Q Consensus 67 pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~ 146 (533) --+.+-..-||+... .=.-+.|+.+..++ +.. .+.+..+++--+|.....+..+..++-|+-|.+.-.|. T Consensus 63 ki~~n~~~~ivd~~~-~~l~g~~~~~~~~d----~~~----~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~- 132 (453) T protein:vir:73 63 RLTNNFAKYIVDTFV-GYFNGIPIKKTHDD----KSV----LEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNE- 132 (453) T ss_pred eeecchHHHHHHHhh-hhhcccCceeecCC----hHH----HHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCC- Confidence 001111222222211 11234566665443 222 33455566666899999999999999999999877763 Q ss_pred CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccce-eeccchhceecccccccc-------------ccCCcceec Q lcl|NC_021072. 147 NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQ-LTQKAAEYYLYNPKGLKN-------------STNQGMKIA 212 (533) Q Consensus 147 ~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~-~~~~~~e~~~y~p~~~~~-------------~~~~~~kI~ 212 (533) .|-..+..++|+.+-++..-.. ........... .........+|.+..... .+|..-+|| T Consensus 133 ---~~~~~i~~~~p~~~~~v~dd~~---~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vP 206 (453) T protein:vir:73 133 ---STESEVIYCSPLNVFMVYDDSI---KQKPLFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLP 206 (453) T ss_pred ---CCceEEEEEcccceEEEEeCCC---CceeEEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCcee Confidence 4566788899999976553321 11111100000 000111122333322110 111111222 Q ss_pred cchhhccccccccCCCCccchhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcc- Q lcl|NC_021072. 213 TDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYR- 290 (533) Q Consensus 213 ~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr-m~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~r- 290 (533) .-. | +++....|-++..+.....+. ++=+....-+.++.|-+-+.-. +++. + -++..+ T Consensus 207 vv~--~-------~n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~---~~~~----~----~~~~~~~ 266 (453) T protein:vir:73 207 IVE--Y-------NFNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGA---EVDE----E----DAKNIKD 266 (453) T ss_pred EEE--e-------cCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC---CCCc----h----hhhcccc Confidence 211 1 123334455555555444433 2344444456667777655422 1111 1 111111 Q ss_pred cEEEeeCCCCccccccccchhHhhhcccccCC--CCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcc Q lcl|NC_021072. 291 NKLVYDANTGEIKDDKKFMSMLEDFWLPRREG--GRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTF 367 (533) Q Consensus 291 nk~vYd~~TGev~~d~~~msmlEDywLpRReg--grgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~~~~~ 367 (533) ++.. -+.+.. +...| +.+.++.+|-...+.+.+ .-+.-+.+.+|...++|- +..++ T Consensus 267 ~~~~----------------~~~~~~-~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~-- 325 (453) T protein:vir:73 267 NRLI----------------NFFDKN-SNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAAN--ISDEN-- 325 (453) T ss_pred cccc----------------cccccc-cccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcc--cCccc-- Confidence 1110 000100 11111 122346655444344333 335667788889999984 33222 Q ss_pred cccchh--hhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-CHhHHhhhhhceeEEEeccchHHHHHHHHHHHH Q lcl|NC_021072. 368 NIGRAA--EITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVM-SLEEWDEMKEHIQFDFIADNYFTELKEIEIRNE 444 (533) Q Consensus 368 ~~g~~~--eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~-t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~ 444 (533) +|++| .|.--+.....-+.+.++.|..-+.+.++.=+-+.+.. ...+| ..|.+.|...---.+ .+ T Consensus 326 -~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~----~~i~v~f~~~~p~~~-------~~ 393 (453) T protein:vir:73 326 -FGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAW----KDIEYTFTRNEPKDI-------KE 393 (453) T ss_pred -ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----ccceEEeCCCCCCCH-------HH Confidence 24333 34333444445577777777777777766533332221 22233 357788864433233 23 Q ss_pred HHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccc Q lcl|NC_021072. 445 RMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSA 517 (533) Q Consensus 445 R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~ 517 (533) .+++++.+. | .+|.++++.. |...++ .+++.++|++|+.+.+-. ++. ++. -.+..+.+++ T Consensus 394 ~a~~~~k~~---g-iis~et~~~~-~~~~~d-~~~E~~ri~~E~~~~~~~----~~~-~~~--~~~~~~~~~~ 453 (453) T protein:vir:73 394 QAETANILK---G-ITSEETALSV-ISVIPD-VQAEMEKIKKKKLLQLSL----TRT-SNL--VRMKQMRGNL 453 (453) T ss_pred HHHHHHHHh---c-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHHHH----HHh-ccC--CcchhhhcCC Confidence 444555553 4 5999999976 555432 233334444444433211 000 000 0111111111 No 177 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=89.06 E-value=0.029 Score=29.05 Aligned_cols=404 Identities=16% Similarity=0.222 Sum_probs=170.8 Q ss_pred CCccccceeeec-c----cccc--ccCCCCC--CCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhh Q lcl|NC_021072. 1 MSNQLFGFSLER-A----KKVP--KGPSFVQ--KDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDS 71 (533) Q Consensus 1 ~~~~~fg~~i~~-~----~~~~--~~~s~~~--~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~ 71 (533) |-.+=.-..+.- . +.+. ++..... .....|. +..+++.+..+ +. . +..+++|-|.. T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~--~~~~~~~~~~~-----v~-~-------~~al~~~~v~~ 65 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGP--VSAHGYLGDSS-----IN-D-------ERILQISTVWR 65 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccccccccchhhccc--ccccccccccc-----cc-H-------HHhhccHHHHH Confidence 211111111100 0 0000 1111111 1111111 11222222110 11 1 34567788999 Q ss_pred HHHHhhcceeeecCCCceEEEE-eccCCCcHHHHHHHHHHHHHHHHHhcc----hhhhhHHHH----hhhhcCceeeeee Q lcl|NC_021072. 72 AVDDIVNETICGNFDDVPVEVE-LSNLKQSDKIKKLIREEFAEILRLLDF----ENRSYEIFR----RWYVDGRLFYHKV 142 (533) Q Consensus 72 AvdeIvneaiv~d~~~~~v~v~-l~~~~~S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR----~WYvDGri~~hkv 142 (533) ||+-|.+.+-. .|+.|- ...-+..+.+. . -..++++|+. ...+.++.+ .++..|.-|.-++ T Consensus 66 cv~~Ia~~iA~-----lp~~vy~~~~~~~~~~~~--~---~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~ 135 (424) T protein:vir:18 66 CVSLISTLTAC-----LPLDVFETDQNDNRKKVD--L---SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) T ss_pred HHHHHHHhhcc-----CceEEEEeccCCceeeec--c---ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 99999988654 344442 11111111110 0 1124455543 235555444 4567799988765 Q ss_pred ecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccc Q lcl|NC_021072. 143 IDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSG 222 (533) Q Consensus 143 id~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsG 222 (533) =+ ..+-+++|.+|+|..+...+ ..+... |.|...| +...++.+-|.+.. + T Consensus 136 r~---~~G~~~~L~~l~~~~v~v~~-----~~~~~~---------------y~~~~~g------~~~~~~~~eVihir-~ 185 (424) T protein:vir:18 136 RN---SAGDVISLLPLQSANMDVKL-----VGKKVV---------------YRYQRDS------EYADFSQKEIFHLK-G 185 (424) T ss_pred EC---CCCcEEEEEEecCcceEEEE-----cCCeEE---------------EEEEeCC------eEEEeccccEEEec-C Confidence 44 34559999999999996421 111111 1111111 11122222222111 0 Q ss_pred cccCCCCccchhHHHHHHHHHHHHHHHHHHHH-HHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCc Q lcl|NC_021072. 223 IQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVI-YRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGE 301 (533) Q Consensus 223 l~d~~~~~i~syL~~AiK~~NqLrm~EDalVI-yRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGe 301 (533) ...++-.++|-|..|..++.....+++...= |.-.-.| +-+...+-+.+.+..+++ +++.+.++..- ...|. T Consensus 186 -~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~-~gil~~~~~~l~~e~~~~-~~~~~~~~~~~----~nag~ 258 (424) T protein:vir:18 186 -FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKS-PQILSTGEKVLTEQQRSQ-VEENFKEIAGG----PVKKR 258 (424) T ss_pred -cCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCc-ceEEEeCCcCCCHHHHHH-HHHHHHHHhCC----cccCC Confidence 0122335678899999998887777775543 3322233 456666666566555443 44444443210 01121 Q ss_pred cccccccchhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhh Q lcl|NC_021072. 302 IKDDKKFMSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEV 380 (533) Q Consensus 302 v~~d~~~msmlEDywLpRReggrgTEIsTLpGg-~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDEl 380 (533) .+ .++ .|.+++.|.=. +.+.-++--+|....+.++++||.+.|+..++-+.. ++.+.-.-+ T Consensus 259 ------~~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~-~sn~eq~~~ 320 (424) T protein:vir:18 259 ------LW-ILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW-GSGIEQQNL 320 (424) T ss_pred ------ce-ecc----------CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccc-cccHHHHHH Confidence 11 221 14556555321 222223444577788999999999999654332221 112222323 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_021072. 381 KFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYF 460 (533) Q Consensus 381 kF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~ 460 (533) .|.+++ |+- +...+.+.|-.. +++++++. ...+.|. +..+...+ ...|.+.+..+-.- -++ T Consensus 321 ~f~~~t--l~P-~~~~ie~~ln~~-----L~~~~~~~----~~~~~fd----~~~llr~d-~~~r~~~~~~~~~~--G~~ 381 (424) T protein:vir:18 321 GFLQYT--LQP-YISRWENSIQRW-----LIPSKDVG----RLHAEHN----LDGLLRGD-SASRAAFMKAMGES--GLR 381 (424) T ss_pred HHHHHH--HHH-HHHHHHHHHHhh-----cCCccccC----CeEEEEe----chhhhccC-HHHHHHHHHHHHhC--CCc Confidence 355442 322 122234444443 34555553 3455565 33333222 24566666655321 355 Q ss_pred cHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccc Q lcl|NC_021072. 461 SIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGD 527 (533) Q Consensus 461 S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 527 (533) +.+-++.. ++|.+ + |+.++ -.-..+..|.++...+..|+ +++. T Consensus 382 T~NE~R~~-~gl~p------------------i--~ggD~--~~~~~n~~~l~~~~~~~~~~-~n~a 424 (424) T protein:vir:18 382 TINEMRRT-DNMPP------------------L--PGGDV--AMRQAQYVPITDLGTNKEPR-NNGA 424 (424) T ss_pred CHHHHHHH-hCCCC------------------C--CCcCe--eeeccCccchhhhhccCCcc-ccCC Confidence 55555532 33321 1 21110 00011111111111111111 1111 No 178 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=88.69 E-value=0.031 Score=28.87 Aligned_cols=442 Identities=12% Similarity=0.049 Sum_probs=155.9 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhh---------- Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECD---------- 70 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd---------- 70 (533) |..+|+.-.|...+.. +.-...... .-..+.+|+.+..+++=. T Consensus 1 ~~~~~~~~~~~~~~~~----------------------~~~~i~~~~-----~~~~~~~~~~~~~YY~g~h~Il~r~~~~ 53 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGL----------------------LNTEITTYM-----ASNHIKWAHIGENYYNQENDIEKSRIFY 53 (537) T ss_pred CCcccccccHHHHHHH----------------------HHHHHHHHH-----HHHHHHHHHHHHHHhcccchhhhccccc Confidence 5555555444332111 100000000 001122233222222222 Q ss_pred --------------------hHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh Q lcl|NC_021072. 71 --------------------SAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRR 130 (533) Q Consensus 71 --------------------~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~ 130 (533) +=...||+-.+-+ .=+.||.+.+.+.+.++ +.+.+..+++ -+|++...++.+. T Consensus 54 ~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~y-l~G~Pv~~~~~d~~~~e-----~~~~l~~~~~-~~~~~~~~el~~~ 126 (537) T protein:vir:78 54 MNDKGQLREDNYASNVKISHGFFTELVDQLAQY-LLSNGVEVKVKDEDNTQ-----LDEILQEYFD-EDFQATIDTLVTN 126 (537) T ss_pred ccccccccccccccccccccchHHHHHHHHhhh-hcccCceeecCcchhHH-----HHHHHHHHhh-ccHHHHHHHHHHH Confidence 1122233322211 23678888765532222 2233443333 3588888999999 Q ss_pred hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeec----c-chhceecccccccc-- Q lcl|NC_021072. 131 WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQ----K-AAEYYLYNPKGLKN-- 203 (533) Q Consensus 131 WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~----~-~~e~~~y~p~~~~~-- 203 (533) +.+-|+-|.|.=+|. .|-..+..+||+.+=+|..-..+..--.+++........ . ..-..+|.+..... T Consensus 127 ~s~~G~ay~~~y~de----~~~~~~~~i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~ 202 (537) T protein:vir:78 127 ASKKGFEGIFARTTS----EGKLKFQTVDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYI 202 (537) T ss_pred HhhcCeeEEEeeecC----CCceEEEEEccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEE Confidence 999999999877763 466789999999986654322211111111111000000 0 00011222221110 Q ss_pred --------ccCCccee---------------ccchhhccc-------ccccc----CCCCccchhHHHHHHHHHHHHHHH Q lcl|NC_021072. 204 --------STNQGMKI---------------ATDSVTYCH-------SGIQD----LNKNMTLSHLHKAIKAVNQLRMIE 249 (533) Q Consensus 204 --------~~~~~~kI---------------~~dai~y~h-------sGl~d----~~~~~i~syL~~AiK~~NqLrm~E 249 (533) .......+ ..+...+.. -|.++ +++....|=|+..+.....+.++= T Consensus 203 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~ 282 (537) T protein:vir:78 203 QDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMN 282 (537) T ss_pred ecCCcccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHH Confidence 00000000 000000000 01111 122334455666555544443211 Q ss_pred -HHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccce Q lcl|NC_021072. 250 -DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEI 328 (533) Q Consensus 250 -DalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEI 328 (533) +..-...-+.-|---+. + .+|+ +...++.-+..+.+..-.| .|..+ T Consensus 283 S~~an~~~~~~~~ilvi~-----------------------------g-~~~~--~~~~~~~~l~~~~~i~v~~-d~~~v 329 (537) T protein:vir:78 283 CFLSNNLQDFSEAIYVVK-----------------------------G-FSGD--STDKLRQNIKAKKMIGVNG-DNAGM 329 (537) T ss_pred HhhhhHHHHhcCceeeee-----------------------------c-CCCc--cchhHHHHHhhcCceeecC-CCCce Confidence 11111112222211111 1 1111 1111222222332211111 11223 Q ss_pred eecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 329 STLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQ---KFIARLRKRFSELFMDLLKTQ 404 (533) Q Consensus 329 sTLpGg~nLge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~---Kfi~rLr~~fs~if~d~Lk~q 404 (533) ..|--..+..- -.-+.-..+.+|+-..+|-. ... .+|++|..... .+|. .-+.+.++.|...|...|+.= T Consensus 330 ~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~--~~~---~~gn~SGvAlk-~~~~~l~~ka~~ke~~f~~~l~~~~~~i 403 (537) T protein:vir:78 330 EIQTVSIPYEARKAKMDIDVENIYRSGMGFNS--TAV---GDGNVTNVVIK-SRYTLLAMKARKMETSLRKVLRWCADMV 403 (537) T ss_pred eEEEecCCHHHHHHHHHHHHHHHHHhcCCCCC--ccc---cccCCcHHHHH-HHHhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 33332222211 12244455566666556532 221 12433332211 1111 123444444444444443332 Q ss_pred HHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHH Q lcl|NC_021072. 405 LILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQI 484 (533) Q Consensus 405 Lilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi 484 (533) +-+-++....+|+ ...|.+.|.+.---.+...++++ ..+. .+-.+|.+++++. +.+.|+. |..+++ T Consensus 404 ~~~~~~~~~~~~d--~~~i~i~f~~~~P~n~~e~a~~~-------~~l~--~~giiS~eT~l~~-~p~vdd~--e~ek~~ 469 (537) T protein:vir:78 404 VSDIALRGLGEYD--SNDICFEIEPHVLANELDIATTR-------KTEA--ETEALKIGNIMTV-APRIGDD--ETLKLI 469 (537) T ss_pred HHHHhhcCCcccc--cceeeEEeccCCCCCHHHHHHHH-------HHHH--hcCcchHHHHHHh-CCCCCCH--HHHHHH Confidence 2221222222333 23578888865444443323222 2221 2347899999976 5654432 111222 Q ss_pred HHhhhc-----------------CCCCCCCcccccCCCCCCCCCCCCcccccccc-CCccccchhcC Q lcl|NC_021072. 485 DSEREA-----------------GLIVDPMAEMDPAMDPGNAPPADDMSAQEGPA-VDAGDAKRGEF 533 (533) Q Consensus 485 ~~E~~~-----------------~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~-~~~~~~~~~~~ 533 (533) ++|... ...++++...++.....+.+|.|+.+....|. ++++|..++-. T Consensus 470 ~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:78 470 AEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQ 536 (537) T ss_pred HHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCccCCC Confidence 222111 01111111111111111222333222221111 22222222222 No 179 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=88.30 E-value=0.033 Score=28.69 Aligned_cols=342 Identities=14% Similarity=0.169 Sum_probs=141.2 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |+. |+ .+++ ..+..+. +--......++...+.++.- +...++|-|.++|+-|.+.+ T Consensus 1 M~~--~~-~f~~------r~~~~~~-~~~~~~~~~~~~~~~~~v~~--------------~~al~~~av~~cv~~ia~~i 56 (359) T protein:vir:10 1 MSI--LN-PFER------RSSITPN-NYYPFMVQNGSIVPNSLVDA--------------TEALKNSDLYAVTSLISSDI 56 (359) T ss_pred Ccc--cc-hhhc------cccCCCC-cchhhhhccccccCCcccCH--------------HHhhcchHHHHHHHHHHHhh Confidence 653 33 0111 1111111 11100111111222222211 11246788999999998764 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHh----hhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~elr 156 (533) -- .|+. . ..+ ...++..=|-...+.++.+. ++..|.-|..++-|. .+-+++|. T Consensus 57 a~-----~p~~----~----~~~-------~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~---~g~~~~l~ 113 (359) T protein:vir:10 57 AG-----TRFI----G----NQV-------FTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGD---NSLMKELR 113 (359) T ss_pred hc-----Cccc----c----chH-------HHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECC---CCeEEEEE Confidence 31 2220 0 111 22223333333455555444 456799998866553 44589999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLH 236 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~ 236 (533) +++|..+.... ..+...+... ....+. ...+.....++|..-+. +.-..++-.++|.|+ T Consensus 114 ~l~~~~v~i~~-----~~~~~~y~~~--~~~~~~---------~~~~~~~evih~~~~~~-----~~~~~dg~~G~spi~ 172 (359) T protein:vir:10 114 LIPSNAITIDL-----TDDTLTYEVN--QFDDYP---------SAKYNASEMIHVKIMAY-----GVDTLHNLVGHSPLE 172 (359) T ss_pred EeCCceEEEEE-----cCCeEEEEEE--ecCCce---------EEEEcccceEEeccCCC-----CCCccCccccccHHH Confidence 99999886421 1111111100 000000 01111222233322111 111123335679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhc Q lcl|NC_021072. 237 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 316 (533) Q Consensus 237 ~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDyw 316 (533) .|.+++.....+++...-+=-.=+--+-+..++-|++.+..+++. ++...++.. - ...|++- .++ T Consensus 173 ~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~-~~~~~~~~~--~--~n~g~~~-------vl~--- 237 (359) T protein:vir:10 173 SLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSI-RKEFEKANG--G--NNSGRVM-------VLD--- 237 (359) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHH-HHHHHHHhC--c--cccCCce-------ecC--- Confidence 999999988888876543322223345677777777766554443 333333321 0 1122221 221 Q ss_pred ccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHH Q lcl|NC_021072. 317 LPRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRF 393 (533) Q Consensus 317 LpRReggrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~f 393 (533) .|.+++.|. .+.-+ ++-.+|-...+.++++||...|+..+.-+ ...+.+ |-.+..|+...-..+ T Consensus 238 -------~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~---e~~~~~~l~~~l~p~ 304 (359) T protein:vir:10 238 -------QSADFSTVS--INADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQ-SSLDQI---KDLYVNALNRFIEPL 304 (359) T ss_pred -------CCcceeeec--CCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCccc-ccHHHH---HHHHHHHHHHHHHHH Confidence 245666553 33333 23455667889999999999996432211 112222 112333322111111 Q ss_pred HHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHH----HHH------Hhhhhc Q lcl|NC_021072. 394 SELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMN----QVN------TMDPYV 456 (533) Q Consensus 394 s~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~----~~~------~~~~~v 456 (533) ..-+...|-+++-+ + ....+. |. .++...++. ..++ +.. .+.|+. T Consensus 305 ~~~l~~~l~~~~~~---------~-~~~~~~--~d-----~~~~~~~~~-~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 305 ISELRIKCDSSIGV---------D-MSPITD--YS-----NSVFKADIL-NWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHhhhhhcc---------c-chhhhh--cC-----HHHHHHHHH-HHHhCCCcCHHHHHHHhCCCCCC Confidence 11111112111110 0 000011 11 111111111 1111 111 223333 No 180 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=88.25 E-value=0.033 Score=28.67 Aligned_cols=423 Identities=12% Similarity=0.048 Sum_probs=198.2 Q ss_pred cccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhc-------------------chhhhHHHH Q lcl|NC_021072. 15 KVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQ-------------------PECDSAVDD 75 (533) Q Consensus 15 ~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~-------------------pEvd~Avde 75 (533) ..+-.+..+|--.++-...+ ......-..-+.+..++..|-+.... +=+.-+|+- T Consensus 1 ~~~~~~~~~~gl~~~~~~~~------~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~ 74 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALI------NGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDA 74 (474) T ss_pred CcCCCcCcCCCCChhHHHHH------HHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHH Confidence 11101100110000000000 00000000001122222222221110 111122222 Q ss_pred hhcceeeecCCCceEEEEe-ccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEE Q lcl|NC_021072. 76 IVNETICGNFDDVPVEVEL-SNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTE 154 (533) Q Consensus 76 Ivneaiv~d~~~~~v~v~l-~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~e 154 (533) .++=+.+ ++ ..+ +..+.+.. ...+...=+|+....+.++.=++.||-|.- |-.....++... T Consensus 75 ~a~rl~~---~G----f~~~d~~~~~~~--------l~~iw~~N~ld~~~~~~~~~al~~G~sf~~--V~~~~d~~~~~~ 137 (474) T protein:vir:81 75 LARRCNL---EG----FVWPDGDLDSLG--------GTEVVDDNHLLSEIDSAIVAAMQHGPAFLI--NTVGEDDEPEAL 137 (474) T ss_pred HHhhhcc---cc----eECCCCCccchH--------HHHHHHhcChhHHHHHHHHHHHhhCceeEE--EecCCCCCceeE Confidence 2111110 11 111 11111222 234555557888889999999999999854 333345566777 Q ss_pred EEEcChhhceehhhccCCCcCceeE-Eecc-ceeeccchhceeccccccc--------------cccCCcceeccchhhc Q lcl|NC_021072. 155 LRYIDPRKIRKVTEYQQKRPEQLRG-EDIN-TQLTQKAAEYYLYNPKGLK--------------NSTNQGMKIATDSVTY 218 (533) Q Consensus 155 lr~lDP~~i~~vr~~~~~~~~~~~~-~~~~-~~~~~~~~e~~~y~p~~~~--------------~~~~~~~kI~~dai~y 218 (533) ++.++|+.+--+. +.......+ .... ....+......+|-|.... ..+|.. -+| .+.| T Consensus 138 i~~~sp~~~~~~~---D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~-gvP--vV~~ 211 (474) T protein:vir:81 138 IHVKDASEATGEW---NRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVY-GVP--AQVL 211 (474) T ss_pred EEEeccceEEEEE---eCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCC-Ccc--eEEe Confidence 8889998774322 222222111 1000 0001111111223222111 011111 123 3455 Q ss_pred cccccccCCCCccchhH-HHHHHHHHH-HHHHHHHHHHHHHhcCccceEEEccCCCC------chHHHHHHHHHHHHhcc Q lcl|NC_021072. 219 CHSGIQDLNKNMTLSHL-HKAIKAVNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNL------PKNKAEQYLREVMGRYR 290 (533) Q Consensus 219 ~hsGl~d~~~~~i~syL-~~AiK~~Nq-Lrm~EDalVIyRi~RAPeRrvfyIDvGnl------pk~KAeqYl~~im~~~r 290 (533) ++.--.+. ..+.|-+ +..+...+- -|.+.+.++.=...=.|.|-|+=.+-... |..+-.+|+- T Consensus 212 ~n~~~~~~--~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~------- 282 (474) T protein:vir:81 212 PYKPAPKR--PFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLG------- 282 (474) T ss_pred cccccccC--cCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHH------- Confidence 55422211 1222222 122221121 25678888998999999988864332111 1111112221 Q ss_pred cEEEeeCCCCccccccccchhHhhhc-ccccCC-C----CccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCC Q lcl|NC_021072. 291 NKLVYDANTGEIKDDKKFMSMLEDFW-LPRREG-G----RGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLET 363 (533) Q Consensus 291 nk~vYd~~TGev~~d~~~msmlEDyw-LpRReg-g----rgTEIsTLpGg~nLgei-~DV~YF~~kLy~aL~VP~sRl~~ 363 (533) ..| +|.-+. . .+++|..+|+.. |.-. +-++=.-..+-...++|..-|+- T Consensus 283 -----------------------~i~~~~~d~d~~~~~~~~~~~~q~~~a~-l~~~~~~l~~~~~~~a~~t~iP~~~lG~ 338 (474) T protein:vir:81 283 -----------------------RIKGLPDDADADIPQLARADVKQFPAAS-PDAHWSDINGLAKLFAREASLPDTAVAI 338 (474) T ss_pred -----------------------HHhcCCCcccccccccccccccccCCCC-hhHHHHHHHHHHHHHHhhhCCCHHHhcc Confidence 222 333222 1 235677777654 4432 22333344555567999888763 Q ss_pred CCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHH Q lcl|NC_021072. 364 ETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRN 443 (533) Q Consensus 364 ~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~ 443 (533) .+.-|-..+..|.-.|....+-+.+.|+.|..=+.+++|.-+.+.|-...++|..--..+.+.|..-.+ --+. T Consensus 339 ~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~-------~s~a 411 (474) T protein:vir:81 339 SGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRY-------LSKS 411 (474) T ss_pred cccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCc-------cCHH Confidence 333333345567777888888899999999999999999999998876666655544557777743222 1234 Q ss_pred HHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCC Q lcl|NC_021072. 444 ERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAP 510 (533) Q Consensus 444 ~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~ 510 (533) ++.+.+..+..- |.-+....+..++|++|+.||+.....++++...+.+..- ...+.+.+..+ T Consensus 412 ~~aDa~~Kl~~a-~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l---~~~~~~~~~aq 474 (474) T protein:vir:81 412 AQADAGMKQLAA-VPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQAL---IDRSNNGATAQ 474 (474) T ss_pred HHHHHHHHHHhc-ccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHH---HhcCCCCCCCC Confidence 566666666543 4444445666688999999999877666666555444311 11222222222 No 181 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=85.56 E-value=0.052 Score=27.63 Aligned_cols=408 Identities=13% Similarity=0.130 Sum_probs=172.8 Q ss_pred CCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEEEEeccCCC Q lcl|NC_021072. 20 PSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQ 99 (533) Q Consensus 20 ~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~ 99 (533) .|+.|..... ......+...++... ..+++|-|..+|+-|.+.+-- .|+.+. ..+ T Consensus 1 ~~~~~~~~g~---------~~~~~~~~~~~~~~~--------~~~~~~~V~acV~~Ia~~iA~-----lpl~l~--~~~- 55 (723) T protein:vir:94 1 MTTFPSGAGG---------WNAWSADSVFGNGAK--------GWSNSAVAYRCISMLANNAAS-----VDLVVR--GPD- 55 (723) T ss_pred CcccccCCCc---------cccccccccccccHH--------HHhhhHHHHHHHHHHHHhhcc-----ceeEEE--cCC- Confidence 2222222211 111111111111111 135779999999988876542 455553 111 Q ss_pred cHHHHHHHHHHHHHHHHHhcc----hhhhhHHHHh----hhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccC Q lcl|NC_021072. 100 SDKIKKLIREEFAEILRLLDF----ENRSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQ 171 (533) Q Consensus 100 S~~ik~~I~eeF~~i~~lL~f----~~~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~ 171 (533) .+ +. +...++.+|+. ...+.++.+. +...|.-|..++.+..+..+-+.+|..++|+....+... T Consensus 56 ~~-~~-----~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~-- 127 (723) T protein:vir:94 56 GE-LD-----ELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATR-- 127 (723) T ss_pred Cc-cc-----hhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecC-- Confidence 11 11 12345666643 2355555444 557899999988876666677899999999877442221 Q ss_pred CCcCceeEEeccceeeccchhceecc-ccccc--cccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHH Q lcl|NC_021072. 172 KRPEQLRGEDINTQLTQKAAEYYLYN-PKGLK--NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMI 248 (533) Q Consensus 172 ~~~~~~~~~~~~~~~~~~~~e~~~y~-p~~~~--~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~ 248 (533) ..+.+ +....-.|.|. ..|.. ......++|+..+ ..++-.++|-|..|.+++.....+ T Consensus 128 -~~~~~---------~~~~~~~y~~~~~~G~~~~~~~~dIiHir~~~---------~~dg~~G~Spi~~a~~~i~~~~aa 188 (723) T protein:vir:94 128 -AADAV---------PQAQIIGYVIERTDGVRVPVLADEMLWLRFSD---------PYDPLAVMAPWKAARAAVDADFYA 188 (723) T ss_pred -CCccc---------eeeeeeEEEEEecCceeEEecccceEEecCCC---------CCCCcccccHHHHHHHHHHHHHHH Confidence 11111 00111112221 12211 1122333333221 122335679999999999888888 Q ss_pred HHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHh---hhcccccCCCCc Q lcl|NC_021072. 249 EDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLE---DFWLPRREGGRG 325 (533) Q Consensus 249 EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlE---DywLpRReggrg 325 (533) ++.. .++++-=-+--..|-+++|.+..+++..+.+...|.-- .+.|+ .+ +++ .-+...- .| T Consensus 189 ~~~~--~~~f~NG~~p~giL~~~~l~~e~~~~~~~~~~~~~~G~----~Nagk------~~-vL~g~~~~~~vl~---~G 252 (723) T protein:vir:94 189 ATWQ--RQSFKNGARPGGVVNLGDMDEQTFTKTVAAFRSQVEGV----QNAGR------HL-LIAGQGSDGGAAG---KG 252 (723) T ss_pred HHHH--HHHHhcCCCcceEEEcCCCCHHHHHHHHHHHHHHhhch----hhcCc------ce-eeccccccccccc---CC Confidence 7754 33443322322333345677666655555544443210 01121 11 111 1111112 34 Q ss_pred cceeecCCCCCcchHHHH---HHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 326 TEISTLPGGQNLGELEDV---KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDLLK 402 (533) Q Consensus 326 TEIsTLpGg~nLgei~DV---~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~Lk 402 (533) ++++.|. .+.-++.-+ +|-.+..-++.+||...|...+.++ +..+-. +.|..+ .|+-. ...+.+.|- T Consensus 253 ~~~~~l~--~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~s--N~e~~~---~~f~~~--tL~P~-~~~ie~~ln 322 (723) T protein:vir:94 253 ATFTSLS--MSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYE--NQAEAK---AAVWTE--TLIPQ-MEVMASITD 322 (723) T ss_pred ceEEEcc--CCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcc--cHHHHH---HHHHHH--HHHHH-HHHHHHHHh Confidence 5555553 333333333 3445668899999998886443221 222222 235433 23322 233444555 Q ss_pred HHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHH Q lcl|NC_021072. 403 TQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDK 482 (533) Q Consensus 403 ~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~k 482 (533) ..|+- ... ..+.++|...-.... =+..|.+.+..+-. .-++|.+-++. ++++.+-+ .=+ T Consensus 323 ~~Ll~-----~~g-----~~~~~~f~~~~lLr~-----D~~~r~~~~~~~v~--~G~~T~NE~R~-~lglpPi~--gGd- 381 (723) T protein:vir:94 323 LQLLP-----DIG-----WTVEWDFNSVPALQE-----DLEAQAGRNQGYLV--NDVLMVDEVRA-TIGLDPLP--GGI- 381 (723) T ss_pred Hhhcc-----ccc-----CceEEeecchhhhhc-----CHHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCCCC--CCc- Confidence 55432 112 235677763322211 12244555544332 24777777764 46664321 000 Q ss_pred HHHHhhhcCCCCCC-Ccc------cccCCCCCCCC-------CCCCccccc----ccc---CCccccchhcC Q lcl|NC_021072. 483 QIDSEREAGLIVDP-MAE------MDPAMDPGNAP-------PADDMSAQE----GPA---VDAGDAKRGEF 533 (533) Q Consensus 483 qi~~E~~~~~~~~p-~~~------~~~~~~~~~~~-------~~~d~~~~~----~~~---~~~~~~~~~~~ 533 (533) ..++..| +.+ ..|+.+++... ...|-...+ ++. -+.+..+..+. T Consensus 382 -------~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 382 -------GQMTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRATTVLHHDPGPDPQQTL 446 (723) T ss_pred -------ccceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCCCCCCCCcccCCchhH Confidence 0000111 011 11111111100 000000000 000 00111111111 No 182 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=81.34 E-value=0.087 Score=26.41 Aligned_cols=359 Identities=15% Similarity=0.138 Sum_probs=147.4 Q ss_pred CCccccceeeeccccccccCCCCCCCC-CcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDS-MDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNE 79 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~-~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvne 79 (533) |. +|| +..|+..... .++.... .|.+. . + .| +.+-|..||+-|++. T Consensus 1 Mg--~f~----------~~~~~~~~~~~~~~~~~~---~~~~~-----------~--~-~~----~~~~v~~~i~~Ia~~ 47 (378) T protein:vir:16 1 MN--LFG----------KVVSFSRGKLNNDTQRVT---AWQNE-----------A--V-EY----TSAFVTNIHNKIANE 47 (378) T ss_pred Cc--cch----------hhhhhhcccccCCcceee---ecccc-----------h--h-hH----HHHHHHHHHHHHHhh Confidence 21 111 1111110000 0000000 01110 0 0 12 344588999999988 Q ss_pred eeeecCCCceEEEEec--cCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHHHH----hhhhcCceeeeeeecCCCCC Q lcl|NC_021072. 80 TICGNFDDVPVEVELS--NLKQSDKIKKLIREEFAEILRLLDFE----NRSYEIFR----RWYVDGRLFYHKVIDPKNPR 149 (533) Q Consensus 80 aiv~d~~~~~v~v~l~--~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~fR----~WYvDGri~~hkvid~~~~~ 149 (533) +-. .|+.+--. +-...+...+.+. ..++++|+.. -.+.++.+ .+...|.-|..++.|. .. T Consensus 48 iA~-----l~~~~~~~~~~~~~~~~~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~--~~ 117 (378) T protein:vir:16 48 ITK-----VEFNHVKYKKSDVGSDTLISMAG---SDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDD--NT 117 (378) T ss_pred hhh-----CceeEEEEccccccccccccccc---chHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeec--CC Confidence 553 44443211 1111222222222 2445555432 35555544 5777999999888763 11 Q ss_pred CCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCC Q lcl|NC_021072. 150 GGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKN 229 (533) Q Consensus 150 ~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~ 229 (533) .++.++-|.... ..| .+ ...++|+- +..+. T Consensus 118 ---g~~~~l~~~~~~---------------------------~~~--~~-------~diih~r~-----------~~~~~ 147 (378) T protein:vir:16 118 ---GELLDLLFADDK---------------------------KEY--KP-------EELVRLTS-----------PFYIN 147 (378) T ss_pred ---ceEEEEEecCCe---------------------------eEe--cc-------cceEEecC-----------ccCcc Confidence 123333222110 001 11 12233321 01123 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccc Q lcl|NC_021072. 230 MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFM 309 (533) Q Consensus 230 ~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~m 309 (533) .+.|.|+.|.+.++. + +.-+--|-+..++ +.|....+.+....+...|++..--+.+ | +.+ T Consensus 148 ~~~s~l~~~~~~i~~------~-----~~~~~~~g~l~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~-g------~~~ 208 (378) T protein:vir:16 148 EDTSILDNALASIQT------K-----LEQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQEGSSY-N------GLT 208 (378) T ss_pred chhHHHHHHHHHHHH------H-----HhcCccceeeEeC-CcCCHHHHHHHHHHHHHHHHHhhccccc-c------cce Confidence 356788887766432 1 1122223344443 4455555555444444444432221111 1 111 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHH Q lcl|NC_021072. 310 SMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARL 389 (533) Q Consensus 310 smlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rL 389 (533) . + ..|.+++.|.-.....++...+|-++.+.++++||.+.|... . +++-.+. |..+ .| T Consensus 209 v-l----------~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~--~----~e~~~~~---f~~~--tl 266 (378) T protein:vir:16 209 P-V----------DNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT--A----SQEQQIY---FYNS--TI 266 (378) T ss_pred E-c----------CCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC--c----hHHHHHH---HHHH--HH Confidence 1 1 124556655544445567888999999999999999998421 1 1222221 4333 23 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCHhHHhhhh---hceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHH Q lcl|NC_021072. 390 RKRFSELFMDLLKTQLILKGVMSLEEWDEMK---EHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMR 466 (533) Q Consensus 390 r~~fs~if~d~Lk~qLilkgi~t~eew~~~~---~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~ 466 (533) +-.+ ..+.+.|... +++++|+.... ....++|. +..+.... +..|++.+..+-.- -++|.+-++ T Consensus 267 ~P~~-~~ie~~l~~k-----Ll~~~e~~~~~~~~~~~~~~f~----~~~l~~~d-~~~~~~~~~~~~~~--G~~T~NE~R 333 (378) T protein:vir:16 267 IPLL-IQLEKELTYK-----LISTNRRRVVKGNLYYERIIVD----NQLFKFAT-LKELIDLYHENING--PIFTQNQLL 333 (378) T ss_pred HHHH-HHHHHHHHhh-----cCChhhhhhhhhcccccceeec----cchhhhcC-HHHHHHHHHHHHhC--CCcCHHHHH Confidence 2211 2223333333 45666665422 22345554 33333332 23666666555443 366777665 Q ss_pred HHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCc----cc-ccCCCCCCCCCCCCcccc Q lcl|NC_021072. 467 RQVLKQTDQEIKEIDKQIDSEREAGLIVDPMA----EM-DPAMDPGNAPPADDMSAQ 518 (533) Q Consensus 467 k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~----~~-~~~~~~~~~~~~~d~~~~ 518 (533) . ++++.+-+ . .+-.+...+. +. +......+..+.++.+++ T Consensus 334 ~-~~g~~p~~--g---------gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 334 V-KMGEQPIE--G---------GDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred H-HhCCCCCC--C---------CCeEeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 4 35554311 0 0101110000 00 000001112222333332 No 183 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=78.81 E-value=0.11 Score=25.83 Aligned_cols=449 Identities=12% Similarity=0.087 Sum_probs=165.9 Q ss_pred CCccccceeeecc----ccccccCCCCCCC-CCcc-cceeecccccccccchhhhhhHHHHHHHHHHhh-hhcchhhhHH Q lcl|NC_021072. 1 MSNQLFGFSLERA----KKVPKGPSFVQKD-SMDG-SQPIVGGGYYGYSVDFDGTVRNEYELITRYREM-VLQPECDSAV 73 (533) Q Consensus 1 ~~~~~fg~~i~~~----~~~~~~~s~~~~~-~~dg-~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m-~~~pEvd~Av 73 (533) |+.+-.-..|... ....+++-+..-. ++.+ .+.+..+.-.|+. .-...++...+ ..++.+ ...|-|..+| T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~~~~~~~~-~~l~~~~~~~~~~~~~i 89 (535) T protein:vir:10 13 LSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQY--SVASISDVLST-KKLLKAYADNDIVQAII 89 (535) T ss_pred hhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCccccc--ccCccccccCH-HHHHHHhccChhHHHHH Confidence 2222111111110 0001111000000 1110 0111111111111 01111111111 223333 4578888888 Q ss_pred HHhhcceee--e-c---CCCceEEEEecc---CCCcHHHHHHHHHHHHHHHHHhcch----hhhhH----HHHhh----h Q lcl|NC_021072. 74 DDIVNETIC--G-N---FDDVPVEVELSN---LKQSDKIKKLIREEFAEILRLLDFE----NRSYE----IFRRW----Y 132 (533) Q Consensus 74 deIvneaiv--~-d---~~~~~v~v~l~~---~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~----~fR~W----Y 132 (533) +-+++.+.+ | . .....+.|.|.. .+..+.++ +...+.++|... ....+ +++.. + T Consensus 90 ~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~-----~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l 164 (535) T protein:vir:10 90 RTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIK-----RAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMY 164 (535) T ss_pred HHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhh-----hhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHH Confidence 777765332 1 1 223334444322 22222222 234455555321 12232 22221 2 Q ss_pred h-cCceeeeeeecCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc--cccccCCcc Q lcl|NC_021072. 133 V-DGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG--LKNSTNQGM 209 (533) Q Consensus 133 v-DGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~--~~~~~~~~~ 209 (533) + +|.-|..++-+ ..+.+++|.+|||.+|+............. +|.|...+ ..+.+...+ T Consensus 165 ~~~g~ay~~i~r~---~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~---------------~~~~~~~~~~~~~~~~eii 226 (535) T protein:vir:10 165 VQDQINIERIFKN---DSNELDHFNAVDASKVVISYSPRSKDQPRK---------------FEQFVSETKSVKFSERNLT 226 (535) T ss_pred hhCCceEEEEEEC---CCCcEEEEEEeCCceeEEEEcCccccCceE---------------EEEEecCceeEEECcccEE Confidence 2 34445554433 456699999999999976444332211111 11111111 111222333 Q ss_pred eeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCC---CchHHHHHHHHHHH Q lcl|NC_021072. 210 KIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGN---LPKNKAEQYLREVM 286 (533) Q Consensus 210 kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGn---lpk~KAeqYl~~im 286 (533) +|......-. ..+.+++|-|+.|.+.+.....++....=+----+--+-|..++... +.+..++..-+.+- T Consensus 227 h~~~~~~~~~------~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~ 300 (535) T protein:vir:10 227 FINYWNLSDT------DRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWT 300 (535) T ss_pred EEeccCCCCc------ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHH Confidence 3332211101 12345678999999999999888876654433334445666766432 33322222222111 Q ss_pred HhcccEEEeeCCCCccccccccchhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCC Q lcl|NC_021072. 287 GRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLET 363 (533) Q Consensus 287 ~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLge---i~DV~YF~~kLy~aL~VP~sRl~~ 363 (533) ..|. | ..+ +.-..+++. .|.++.-|-- +..+ ++=..+..+..-++.+||...|+- T Consensus 301 ~~~~---------G-~~n-ag~~~vl~~---------~g~~~~~l~~--~~~D~qfle~~~~~~~eIa~afgVPp~~lG~ 358 (535) T protein:vir:10 301 SQGS---------G-LGG-AWKIPILAA---------KDAKFVNMTQ--NSRDMEFDKFLNFMIYDTAAIFQMQPEEINF 358 (535) T ss_pred HHhc---------C-ccc-ccccccccC---------CCceEEecCC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHhcc Confidence 1221 1 011 111123322 1333333322 2223 333456788899999999999875 Q ss_pred CCcccccchh------hhhHHhhhHHHH----HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchH Q lcl|NC_021072. 364 ETTFNIGRAA------EITRDEVKFQKF----IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYF 433 (533) Q Consensus 364 ~~~~~~g~~~------eItRDElkF~Kf----i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f 433 (533) ...-+..+++ .-+--|-.+..| +.-+..++...|.. .| ++..+ ..+.|+|.. T Consensus 359 ~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~----~L-----l~~~~-----~~~~f~f~~---- 420 (535) T protein:vir:10 359 PNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVIND----KI-----MRYVD-----TDYRFSFTL---- 420 (535) T ss_pred ccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh----hc-----ccccC-----CeEEEEecc---- Confidence 4322221111 111112222223 33333333333332 22 22211 235555542 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHH---HHHHHHHHHH---hhhcCCCCCCCcccccCCC-- Q lcl|NC_021072. 434 TELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQE---IKEIDKQIDS---EREAGLIVDPMAEMDPAMD-- 505 (533) Q Consensus 434 ~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDee---I~e~~kqi~~---E~~~~~~~~p~~~~~~~~~-- 505 (533) +.... ...|.++...+. .-.+|..-++.. ++|..-+ +--+.-+... ....+.-+.|...+..+.+ T Consensus 421 --l~~~d-~~~r~~~~~~~~---~g~lT~NE~R~~-~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~ 493 (535) T protein:vir:10 421 --GDAQD-KLQEEQVWKLKL---ANGYFINEYRKD-HGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLG 493 (535) T ss_pred --ccccC-HHHHHHHHHHHH---cCCCCHHHHHHH-hCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCC Confidence 22222 234555554332 224788888854 7876531 1000000000 0000001111111111000 Q ss_pred ------CCC-----CCCCCCccc--------cccccCCcccc Q lcl|NC_021072. 506 ------PGN-----APPADDMSA--------QEGPAVDAGDA 528 (533) Q Consensus 506 ------~~~-----~~~~~d~~~--------~~~~~~~~~~~ 528 (533) +.+ +...||+.. .+.+.-+.+++ T Consensus 494 ~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 494 ERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVSNNEDADT 535 (535) T ss_pred ccccCcccccccccccCCCCCCCCCCcCCCCCccccccccCC Confidence 000 000111111 11111112222 No 184 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=78.47 E-value=0.11 Score=25.75 Aligned_cols=419 Identities=13% Similarity=0.079 Sum_probs=186.1 Q ss_pred eeccccccccCCCCCCCCCcccceeeccccc-ccccchhhhhhHHH-HHHHHHHhhhhcchhhhHHHHhhcceeeecCCC Q lcl|NC_021072. 10 LERAKKVPKGPSFVQKDSMDGSQPIVGGGYY-GYSVDFDGTVRNEY-ELITRYREMVLQPECDSAVDDIVNETICGNFDD 87 (533) Q Consensus 10 i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~-~~~~~~~~~~~~~~-~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~ 87 (533) |++.-+...... ...|. .+. +.|. +-.+....-++... --++.|++|..++.|.++++-+...+.. T Consensus 1 v~~~~l~~e~at-----~~~~~-d~~-~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~----- 68 (488) T protein:vir:99 1 MEKPALGREIAT-----SGDGR-DIT-RPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVS----- 68 (488) T ss_pred CCccchhHHHHH-----HHhhh-hhh-ccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhc----- Confidence 333111110000 00000 000 1111 11111121111111 1168999999999999999999877664 Q ss_pred ceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcChhhceehh Q lcl|NC_021072. 88 VPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDPRKIRKVT 167 (533) Q Consensus 88 ~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP~~i~~vr 167 (533) .+..|.- . +.|. -.+++.++....++-++|+.-..+++-- ..-|--++++|....+..-.+..+...+|+.+++-+ T Consensus 69 ~~w~i~p-~-~~~~-~~~~~ae~v~~~l~~~~~~~~l~~~lda-~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~ 144 (488) T protein:vir:99 69 REWKVEA-G-GDRP-IDQAAAEHLEQQLQRVGWDRVTSKMLFG-VFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQ 144 (488) T ss_pred CCceEEc-C-CCCh-HHHHHHHHHHHHHhCCCHHHHHHHHHhh-hhhcceeEEEEEeecCCeeeEeeeeeecccceeecC Confidence 3444432 1 1122 2334455556666667888777777754 447889999999765555566788888888774311 Q ss_pred hccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccc-hhhccccccccCCCCccchhHHHHHHHHHHHH Q lcl|NC_021072. 168 EYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATD-SVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLR 246 (533) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~d-ai~y~hsGl~d~~~~~i~syL~~AiK~~NqLr 246 (533) . .+.+.... . ....+.-+|+. -+.++. +--...++...|-|.+|..+|--.+ T Consensus 145 -----~-~~l~~~~~------~--------------~~~~g~~lp~~~~~i~~~-~~~~~g~p~g~gLl~~~~w~~~fK~ 197 (488) T protein:vir:99 145 -----D-GGLRLLTP------N--------------NMFEGEPCPAPYFWHFST-GADNDDEPYGLGLAHWLYWPVFFKR 197 (488) T ss_pred -----C-CceEEecc------C--------------CCCCccccccCceEEEEe-ecCCCCCcccchHHHHHHHHHHHHH Confidence 1 11111000 0 00111222221 111111 1111224556789999999988888 Q ss_pred HHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccccCCCCc Q lcl|NC_021072. 247 MIEDSLVIYRLS-RAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRG 325 (533) Q Consensus 247 m~EDalVIyRi~-RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpRReggrg 325 (533) ......+.+--. =.|-|-.-|=+.|.=+..| .+.++. +....+.-+ | .+ | .| T Consensus 198 ~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek-~~l~~a-v~~~~~~~~-----~-------vi--------P-----~~ 250 (488) T protein:vir:99 198 NGIKFWLIFLDKFGMPTAVGRYDDKTATPEDK-AKLLAA-LHAIQTDSA-----I-------IM--------P-----AG 250 (488) T ss_pred hhHHHHHHHHHHcCCceeeeecCCCCCCHHHH-HHHHHH-HHHHhcCcE-----E-------Ee--------c-----CC Confidence 877777776432 3565533332233323322 233333 222222111 1 11 1 36 Q ss_pred cceeecCCCCCcch--HHHHHHHHHHHHHhcCCCccccCCCC---cccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 326 TEISTLPGGQNLGE--LEDVKYFQKKLYKALNVPSSRLETET---TFNIGRAAEITRDEVKFQKFIARLRKRFSELFMDL 400 (533) Q Consensus 326 TEIsTLpGg~nLge--i~DV~YF~~kLy~aL~VP~sRl~~~~---~~~~g~~~eItRDElkF~Kfi~rLr~~fs~if~d~ 400 (533) ++|..+..+..-+. ..=++|..++.-+++-=- =|..++ +.++|. +-+|+ +...+..-.+.++..|..- T Consensus 251 ~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGq--tlts~~~~Gs~a~~~----vh~~v-~~d~~~aDa~~i~~tln~~ 323 (488) T protein:vir:99 251 MQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQ--VASTQGTPGRLGNDD----LQADV-RLDLVKADADLICESFNLG 323 (488) T ss_pred ceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhh--hhcccccccchhhHH----HHHHH-HHHHHHHHHHHHHHHHHHH Confidence 78888864332222 234778888877764211 122222 222222 33333 3444555556666666542 Q ss_pred HHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHH Q lcl|NC_021072. 401 LKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEI 480 (533) Q Consensus 401 Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~ 480 (533) |-..|+.=|. .. .-...+.|++.. .|-++.+.+.+..+-+..|-=++.+|++++ +++...+-.+. T Consensus 324 li~~l~~~N~-~~----~~~p~~~~~~~e---------~edl~~~a~~~~~l~~~~G~~i~~~~i~e~-~Gip~~~~~~~ 388 (488) T protein:vir:99 324 PARWLTEWNF-PG----AQPPRVYRVIEE---------PEDITAKAERDEKVFRMSGFRPTRGYVQET-YGVEVESTQAE 388 (488) T ss_pred HHHHHHHhCc-CC----cCCceeEecCCC---------cccHHHHHHHHHHHHhhcCCCCCHHHHHHH-cCCCCcccccc Confidence 2223333231 11 112334444432 233345555555555555656889999966 78865432110 Q ss_pred HHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccc----ccc-----------CCccccchhcC Q lcl|NC_021072. 481 DKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQE----GPA-----------VDAGDAKRGEF 533 (533) Q Consensus 481 ~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~----~~~-----------~~~~~~~~~~~ 533 (533) ...+.|... ++.......+.+....+- .++ +.++++.+ |+ T Consensus 389 ----------~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~s~e-e~ 443 (488) T protein:vir:99 389 ----------ATAPTPSTE--FAEGDQPSDPAAAMAPQLAEAMQPVVGNWTTQLRTLIEQASSLE-DL 443 (488) T ss_pred ----------cccCCCccc--CCCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHH-HH Confidence 111222211 111111111111111110 000 00111110 00 No 185 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=70.58 E-value=0.21 Score=24.33 Aligned_cols=326 Identities=15% Similarity=0.121 Sum_probs=129.7 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecc----cccccccchhhhh----hHHHHHHHHHHhhhhcchhhhH Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGG----GYYGYSVDFDGTV----RNEYELITRYREMVLQPECDSA 72 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~----~~~~~~~~~~~~~----~~~~~LI~~YR~m~~~pEvd~A 72 (533) |+-+.=--.-+.... ++.+.+.--......++..+ .|...+.++ +.. -+...|-+.+|....| .++ T Consensus 1 m~~~~~~~~~~~~~~--~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~-~~~~~pp~~~~~la~l~~~~~~h---~~~ 74 (346) T protein:vir:10 1 MKKQLRKNLTQNDRL--QPQAQTEIFSFGDPIPVLDRADILNYLECSAMY-EKWYNPPMSFDGLAKSLRSSTHH---ESA 74 (346) T ss_pred CCcccCCCCCccccc--ccccCeEEEecCCcceecCchhHHHHHHHhhcC-CceEecCCCHHHHHHHHHhhhhc---chh Confidence 443211000000000 00000000000000001000 011110000 000 0133344444444433 112 Q ss_pred HHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcc------hhhhhHHHHhhhhcCceeeeeeecCC Q lcl|NC_021072. 73 VDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDF------ENRSYEIFRRWYVDGRLFYHKVIDPK 146 (533) Q Consensus 73 vdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f------~~~~~~~fR~WYvDGri~~hkvid~~ 146 (533) +.-=.| .+..++.- ...-..++..|.+-|.-|++++-+. T Consensus 75 i~~k~n----------------------------------~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~r~~- 119 (346) T protein:vir:10 75 IITKAN----------------------------------ILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVVRNR- 119 (346) T ss_pred hhhhhh----------------------------------hHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcC- Confidence 111011 11222210 0011223445777899999988754 Q ss_pred CCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceecccccc--ccccCCcceeccchhhccccccc Q lcl|NC_021072. 147 NPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGL--KNSTNQGMKIATDSVTYCHSGIQ 224 (533) Q Consensus 147 ~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~--~~~~~~~~kI~~dai~y~hsGl~ 224 (533) .+.+++|..++|..++.. ...+...+. .+.+.+. .+.+...+++..-... T Consensus 120 --~G~~~~L~pl~~~~v~~~-----~~~~~~~~~--------------~~~~~g~~~~~~~~dIih~r~~~~~------- 171 (346) T protein:vir:10 120 --LGQVQRIESPLAKYVRKG-----LEAGQFYYV--------------PQRFDHQEHEFAKGSIYHLLEPDIN------- 171 (346) T ss_pred --CCcEEEEEEecCCceEEE-----EcCCeEEEE--------------EEccCCeEEEEecccEEEecCCCCC------- Confidence 566999999999999642 122222111 1111111 1112222333222111 Q ss_pred cCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccc Q lcl|NC_021072. 225 DLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD 304 (533) Q Consensus 225 d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~ 304 (533) +.-.++|-+..|+..+..-...+....=|=--=|--.-|+|+.-++|.+..+++ +++-+.+.+.. .+.|. T Consensus 172 --~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~-i~~~~~~~~g~----~n~~~--- 241 (346) T protein:vir:10 172 --QDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVEN-IRQQLKQSKGV----GNFKN--- 241 (346) T ss_pred --CCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHH-HHHHHHHhcCc----cccCc--- Confidence 122345778888777666555554432211112335567777556776555444 44444443310 11111 Q ss_pred ccccchhHhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCC--CCcccccchhhhhHHhhh Q lcl|NC_021072. 305 DKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVK 381 (533) Q Consensus 305 d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV-~YF~~kLy~aL~VP~sRl~~--~~~~~~g~~~eItRDElk 381 (533) +-+++ --....|.+++.|.--..=.|+-.+ .+-.+...++.+||...++- .++-+++..++..+. T Consensus 242 ----~~vl~-----~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~--- 309 (346) T protein:vir:10 242 ----LFVHA-----PNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEV--- 309 (346) T ss_pred ----eeEec-----CCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHH--- Confidence 10111 0011135555555322211122222 24466789999999999863 233445555555555 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHH Q lcl|NC_021072. 382 FQKF-IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTE 435 (533) Q Consensus 382 F~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E 435 (533) |.++ |.-|+.+|.+++..+.. + .|+|+-..=--++| T Consensus 310 f~~~~l~P~~~~iee~n~~L~~-e-----------------~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 310 FFITEIEPLQERLKEFNQWLGQ-E-----------------VIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHHHHhhccc-c-----------------eeeechhhhcccCC Confidence 6666 78888888775543211 1 12221110001111 No 186 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=61.56 E-value=0.35 Score=23.08 Aligned_cols=357 Identities=15% Similarity=0.123 Sum_probs=145.2 Q ss_pred CCccccceeeeccccccccCCCCC--CCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhc Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQ--KDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVN 78 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~--~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvn 78 (533) |. +||-. .|+.. -+.+.+ ... .|.+.. + .| +.+-|..||+-|++ T Consensus 1 Mg--~f~~~----------~~f~~~~~~~~~~-~~~---~~~~~~-------------~-~~----~~~~v~~~i~~Ia~ 46 (378) T protein:vir:93 1 MN--LFGKV----------VSFSRGKLNNDTQ-RVT---AWQNEA-------------V-EY----TSAFVTNIHNKIAN 46 (378) T ss_pred Cc--cchhh----------hhhhccccCCCcc-eee---ecccch-------------h-HH----HHHHHHHHHHHHHh Confidence 32 23221 11110 001111 000 111100 0 11 33458889999988 Q ss_pred ceeeecCCCceEEEEec--cCCCcHHHHHHHHHHHHHHHHHhcch----hhhhHHHH----hhhhcCceeeeeeecCCCC Q lcl|NC_021072. 79 ETICGNFDDVPVEVELS--NLKQSDKIKKLIREEFAEILRLLDFE----NRSYEIFR----RWYVDGRLFYHKVIDPKNP 148 (533) Q Consensus 79 eaiv~d~~~~~v~v~l~--~~~~S~~ik~~I~eeF~~i~~lL~f~----~~~~~~fR----~WYvDGri~~hkvid~~~~ 148 (533) .+-. .|+.+--. +-...+...+.+ -..++++|+.+ -.+.++.+ .+...|.-|.+++.+. . T Consensus 47 ~iA~-----lp~~~~~~~~~~~~~~~~~~~~---~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~--~ 116 (378) T protein:vir:93 47 EITK-----VEFNHVKYKKSDVGSDTLISMA---GSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDD--N 116 (378) T ss_pred hhhh-----CceeeEEEcccccccccccccc---cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeec--C Confidence 8543 55544321 111122222222 22455555433 35555544 5778999999888763 1 Q ss_pred CCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCC Q lcl|NC_021072. 149 RGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNK 228 (533) Q Consensus 149 ~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~ 228 (533) .+ ++.++-|.... . .|.+ ...++|+- . ..+ T Consensus 117 ~g---~~~~l~~~~~~--------------------------~---~~~~-------~diih~r~--~---------~~~ 146 (378) T protein:vir:93 117 TG---ELLDLLFADDK--------------------------K---EYKT-------EELVRLTS--P---------FYI 146 (378) T ss_pred Cc---eEEEEEecCCe--------------------------e---Eecc-------ceeEEecC--c---------ccc Confidence 11 23333221110 0 0111 12233321 0 112 Q ss_pred CccchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccccccc Q lcl|NC_021072. 229 NMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKF 308 (533) Q Consensus 229 ~~i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ 308 (533) ..++|-|+.|...+.. ....+--+-+..++ |.|.+..+.+....+...|++..--+. .| .. T Consensus 147 ~~~~s~l~~~~~~i~~-----------~~~~~~~~g~l~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~------~~ 207 (378) T protein:vir:93 147 NEDTSILDNALASIQT-----------KLEQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQEGSS-YN------GL 207 (378) T ss_pred chhhHHHHHHHHHHHH-----------HHhcCcccceeeeC-CcCCHHHHHHHHHHHHHHHHHhhcccc-cc------cc Confidence 2345777777655432 11222223444443 344443333333334344433221111 11 11 Q ss_pred chhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHH- Q lcl|NC_021072. 309 MSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIA- 387 (533) Q Consensus 309 msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~- 387 (533) + .+ ..|.+++.|.-.....+++..+|-.+.+.++++||.+.|.. ..+| -.+..|+. T Consensus 208 ~-~l----------~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g-------~~~e-----~~~~~f~~~ 264 (378) T protein:vir:93 208 T-PV----------DNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLG-------TATQ-----EQQIYFYNS 264 (378) T ss_pred e-Ec----------CCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC-------CcHH-----HHHHHHHHH Confidence 1 11 12455555544344456788899999999999999998842 2222 11222322 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhh---hceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHH Q lcl|NC_021072. 388 RLRKRFSELFMDLLKTQLILKGVMSLEEWDEMK---EHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDY 464 (533) Q Consensus 388 rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~---~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~ 464 (533) .|+-. ...+.+.|- ..++++.|+.... ..+.+.|. +.++.... +..|++.+..+-.- -+++.+- T Consensus 265 tl~P~-~~~ie~~l~-----~kLl~~~er~~~~~~~~~~~~~fd----~~~l~~~d-~~~~~~~~~~~~~~--G~~t~NE 331 (378) T protein:vir:93 265 TIIPL-LIQLEKELT-----YKLISTNRRRVVKGNLYYERIIVD----NQLFKFAT-LKELIDLYHENING--PIFTQNQ 331 (378) T ss_pred HHHHH-HHHHHHHHH-----hhcCChhHhhhhhhcccccceeec----cchhhhcC-HHHHHHHHHHHHhC--CCcCHHH Confidence 22221 122233333 3445666665432 23345554 33333222 24666666655443 3666666 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCc----c-cccCCCCCCCCCCCCcccc Q lcl|NC_021072. 465 MRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMA----E-MDPAMDPGNAPPADDMSAQ 518 (533) Q Consensus 465 i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~----~-~~~~~~~~~~~~~~d~~~~ 518 (533) ++. .++|.+-+ . .+-.+...+. . .+.....++..+.++.+++ T Consensus 332 ~R~-~~gl~p~~--g---------gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 332 LLV-KMGEQPIE--G---------GDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHH-HhCCCCCC--C---------CCeeeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 654 35553211 0 0000000000 0 0000011222233333333 No 187 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=59.17 E-value=0.4 Score=22.78 Aligned_cols=367 Identities=10% Similarity=0.040 Sum_probs=187.3 Q ss_pred Ccccceeecccccccccchhhhhh-------HHHHHHHHHHhhhh---------c-----------chhhhHHHHhhcce Q lcl|NC_021072. 28 MDGSQPIVGGGYYGYSVDFDGTVR-------NEYELITRYREMVL---------Q-----------PECDSAVDDIVNET 80 (533) Q Consensus 28 ~dg~~~~~~~~~~~~~~~~~~~~~-------~~~~LI~~YR~m~~---------~-----------pEvd~AvdeIvnea 80 (533) ++ .+.-+.+. .+..++..|=+... . +=+.-+|+-.+.-+ T Consensus 1 ~~--------------~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl 66 (409) T protein:vir:16 1 MT--------------EKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRL 66 (409) T ss_pred CC--------------HHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhc Confidence 11 11111111 12222222211100 0 11111222221111 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcCh Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYIDP 160 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lDP 160 (533) .+ + +|+.. -++...+...=+|+.+..+.++.=++.|+-|.- |.-. ..|-..++.++| T Consensus 67 ~~---~-----------Gf~~~-----d~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~-v~~~---~dg~~~i~~~sP 123 (409) T protein:vir:16 67 VF---R-----------EFEND-----DFTVNEIFEENNPDIFFDSTVLSALIASCSFTY-ISKG---ENDAVRLQVIEA 123 (409) T ss_pred cc---c-----------cccCc-----chHHHHHHHhcChhHHHHHHHHHHHHhCceeEE-EecC---CCCceEEEEEcc Confidence 11 1 11100 012445666667888999999999999997663 4421 245578889999 Q ss_pred hhceehhhccCCCcCceeE-Eecc-ceeeccchhceeccccccc----------cccCCcceeccchhhccccccccCCC Q lcl|NC_021072. 161 RKIRKVTEYQQKRPEQLRG-EDIN-TQLTQKAAEYYLYNPKGLK----------NSTNQGMKIATDSVTYCHSGIQDLNK 228 (533) Q Consensus 161 ~~i~~vr~~~~~~~~~~~~-~~~~-~~~~~~~~e~~~y~p~~~~----------~~~~~~~kI~~dai~y~hsGl~d~~~ 228 (533) +.+--+.+ .......+ .... ....++.....+|-|.... ..+|..=++| .+.|++.- +... T Consensus 124 ~~~~~i~D---~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP--vV~f~n~~--~~~~ 196 (409) T protein:vir:16 124 TNATGIID---PITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPL--LVPIIHRP--DAVR 196 (409) T ss_pred cceEEEee---cccccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcc--eEEecccc--cccc Confidence 88754332 22111111 1100 0011111111223222110 0112211222 12222211 1111 Q ss_pred CccchhHHHHHH-HHHH-HHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCcccccc Q lcl|NC_021072. 229 NMTLSHLHKAIK-AVNQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDK 306 (533) Q Consensus 229 ~~i~syL~~AiK-~~Nq-Lrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~ 306 (533) ..+.|-+-..+. ..+- -|.+.+.++.=..+=.|.|-++=+|...-|..+=. T Consensus 197 ~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~--------------------------- 249 (409) T protein:vir:16 197 PFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWK--------------------------- 249 (409) T ss_pred cCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccchhh--------------------------- Confidence 222333322222 1122 25678888998999999998886654322221110 Q ss_pred ccchhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH Q lcl|NC_021072. 307 KFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI 386 (533) Q Consensus 307 ~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi 386 (533) ..+-.=..+|.-+.|-+.+|..++++.=-+=++-++-.-..+....++|.+-|+..+. |-..+..|.-.|....+-+ T Consensus 250 --~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-NpsSa~Ai~a~~~~L~~ka 326 (409) T protein:vir:16 250 --ATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVEAIKASHENLRLAG 326 (409) T ss_pred --hhhhHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccC-chhHHHHHHHHHHHHHHHH Confidence 0111122366667777788989988652223566666777888888999888876543 3334456777888899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhcccccc-HHHH Q lcl|NC_021072. 387 ARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFS-IDYM 465 (533) Q Consensus 387 ~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S-~~~i 465 (533) .+.|+.|..-+..++|.-+.+.|-.. ++......+.+.|. ++.. .++.-+.++.+.+..+..- |+.+. .+.+ T Consensus 327 ~~k~~~fg~~l~~~~rla~~~~~~~~--~~~~~~~~~~v~W~-~~~~---~~~~s~a~~aDa~~Kl~~a-~~~~~~~~v~ 399 (409) T protein:vir:16 327 RKAQRSLGAGLLNVAYLAACLRDDVP--YLREQFSKTKPKWE-PLFE---ADASMLSLIGDGAIKLNQA-IPEFINKDTI 399 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCC--ccchhhccceEEec-CCCC---cchhhHHHHHHHHHHHHhh-cccccchhHH Confidence 99999999999999999888766432 22222234677775 2211 2234456777888777664 54444 4555 Q ss_pred HHHHhCCCHHH Q lcl|NC_021072. 466 RRQVLKQTDQE 476 (533) Q Consensus 466 ~k~IL~~tDee 476 (533) .+-|++|+.| T Consensus 400 -~~~~g~~~~d 409 (409) T protein:vir:16 400 -RDLTGIKGAE 409 (409) T ss_pred -HHhccCCCCC Confidence 4668999999 No 188 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=56.64 E-value=0.45 Score=22.47 Aligned_cols=240 Identities=15% Similarity=0.097 Sum_probs=110.7 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |. || ++.++ .+..+ ++......... ..++.+.. ...+. ...-.++|-|.+||+-|.+.. T Consensus 1 Mg--lF-~~~~~----r~~~~--~~~~~~~~~~~-~~~~~~~~---~~~v~--------~~~al~~~~v~~~i~~ia~~i 59 (251) T protein:vir:46 1 MG--IF-YKNEK----RDLQY--NEDDLQMMVQT-LPSFQGTK---LRQYK--------DIEAIRHSDIFTAVMMIASDL 59 (251) T ss_pred CC--cc-ccccc----cccCC--Cccchhhhhhh-hccccCcC---cceec--------hhhhhccHHHHHHHHHHHHhH Confidence 43 23 22111 11111 11111111111 11121111 00111 122356788999999988774 Q ss_pred eeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhc----chhhhhHHHHh----hhhcCceeeeeeecCCCCCCCe Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLD----FENRSYEIFRR----WYVDGRLFYHKVIDPKNPRGGL 152 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~----f~~~~~~~fR~----WYvDGri~~hkvid~~~~~~gI 152 (533) -- .|+.+.=.. +..+ -..++++|+ -.-.+.++.+. ++..|.-|.-++-|. .+-+ T Consensus 60 A~-----lp~~~~~~~----~~~~------~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~---~G~~ 121 (251) T protein:vir:46 60 AR-----MPIRVTVNG----QINY------SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDK---TGEP 121 (251) T ss_pred hh-----CceEEeeCc----cccc------cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECC---CCcE Confidence 32 454443111 1111 123444443 33455555444 567799999987764 4559 Q ss_pred EEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccc--cccccCCcceeccchhhccccccccCCCCc Q lcl|NC_021072. 153 TELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKG--LKNSTNQGMKIATDSVTYCHSGIQDLNKNM 230 (533) Q Consensus 153 ~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~--~~~~~~~~~kI~~dai~y~hsGl~d~~~~~ 230 (533) ++|.+|+|..++.++.. +...+..+... .++..+ ..+...+.++|..- ..++-. T Consensus 122 ~~L~~i~~~~v~v~~~~---~g~~~~~~~~~-----------~~~~~g~~~~~~~~diiH~r~~----------~~dg~~ 177 (251) T protein:vir:46 122 MNLTFRKTSEIELKSDA---RGRLYYFHQRI-----------DSNGNNIERNVKFEDMLDIKFY----------SLDGIN 177 (251) T ss_pred EEEEEECCceEEEEECC---CCcEEEEEEEe-----------ccCCcceeEEECCccEEEecCc----------CCCCee Confidence 99999999999754322 11111000000 011111 11222334444321 122335 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHH-HHHHHHHHHhcccEEEeeCCCCccccccccc Q lcl|NC_021072. 231 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKA-EQYLREVMGRYRNKLVYDANTGEIKDDKKFM 309 (533) Q Consensus 231 i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KA-eqYl~~im~~~rnk~vYd~~TGev~~d~~~m 309 (533) ++|-|+.|..++......++...-+=---+--+-+..++ |.|...+| ++..+.....|.-- ...|.|-. T Consensus 178 G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~~e~~~~~~~~~~~~~~g~----~n~g~~~~----- 247 (251) T protein:vir:46 178 GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFPKVLVEL----NKLGKLSY----- 247 (251) T ss_pred ecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhcCc----cccccccc----- Confidence 678899999999888888876554433334456677777 56644444 44444454444311 22344331 Q ss_pred hhHh Q lcl|NC_021072. 310 SMLE 313 (533) Q Consensus 310 smlE 313 (533) .|-| T Consensus 248 gm~~ 251 (251) T protein:vir:46 248 SMNQ 251 (251) T ss_pred ccCC Confidence 1222 No 189 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=54.79 E-value=0.49 Score=22.26 Aligned_cols=388 Identities=14% Similarity=0.128 Sum_probs=169.8 Q ss_pred cchh--hhh-hHHHHHHHHHHhhhhcchhhh---------------------------HHHHhhcceeeecCCCceEEEE Q lcl|NC_021072. 44 VDFD--GTV-RNEYELITRYREMVLQPECDS---------------------------AVDDIVNETICGNFDDVPVEVE 93 (533) Q Consensus 44 ~~~~--~~~-~~~~~LI~~YR~m~~~pEvd~---------------------------AvdeIvneaiv~d~~~~~v~v~ 93 (533) ...+ ..+ +.-..-+.+|+.+..+.+-+. -...||+..+-+ .-+.|+.+. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~y-l~G~p~~~~ 79 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASY-MFTYPVLFD 79 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhh-eecccceee Confidence 0000 000 011112233444433333321 112223221111 125666665 Q ss_pred eccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCC----CCCCeEEEEEcChhhceehhhc Q lcl|NC_021072. 94 LSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKN----PRGGLTELRYIDPRKIRKVTEY 169 (533) Q Consensus 94 l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~----~~~gI~elr~lDP~~i~~vr~~ 169 (533) .++ ++...+. ++..++ =+|+....++.+.+.+.|+-|.+.-+|.+. +.+|-..+..++|+.+=+|..- T Consensus 80 ~~~---~~~~~~~----~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd 151 (451) T protein:vir:10 80 IDN---NKELNEK----VTDVLG-NEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRN 151 (451) T ss_pred cCC---cHHHHHH----HHHHhc-cCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcC Confidence 433 2333222 222222 268899999999999999999998887432 2346777899999998665432 Q ss_pred cCCCcCce---eEEeccceee----ccc-hhceeccccccc--------c---------ccCCcceeccchhhccccccc Q lcl|NC_021072. 170 QQKRPEQL---RGEDINTQLT----QKA-AEYYLYNPKGLK--------N---------STNQGMKIATDSVTYCHSGIQ 224 (533) Q Consensus 170 ~~~~~~~~---~~~~~~~~~~----~~~-~e~~~y~p~~~~--------~---------~~~~~~kI~~dai~y~hsGl~ 224 (533) ... .... +++....... ... .-..+|.+.... . .+|.--+||.-. | T Consensus 152 ~~~-~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~--~------ 222 (451) T protein:vir:10 152 GIE-RELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVE--F------ 222 (451) T ss_pred CCC-CceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEE--e------ Confidence 111 1111 1111100000 000 001122222110 0 011111222111 1 Q ss_pred cCCCCccchhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccc Q lcl|NC_021072. 225 DLNKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIK 303 (533) Q Consensus 225 d~~~~~i~syL~~AiK~~NqLrm-~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~ 303 (533) .++....|-++..+.....+.+ +=+..-.-+-+.-|-+-+.-.+... .+ +.+.. |..++--.+... T Consensus 223 -~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~-~~----~~~~~-~~~~~~i~~~~~------ 289 (451) T protein:vir:10 223 -SNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGED-TS----EFLKE-LKRYKTIKTETD------ 289 (451) T ss_pred -ccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCccc-ch----hhHHH-HhhCCeEEecCc------ Confidence 1133345777766666555543 3333334455666655443322212 11 11111 222221111111 Q ss_pred cccccchhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccchhh--hhHHhh Q lcl|NC_021072. 304 DDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETETTFNIGRAAE--ITRDEV 380 (533) Q Consensus 304 ~d~~~msmlEDywLpRReggrgTEIsTLpGg~nLgei~D-V~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~e--ItRDEl 380 (533) +.+.|-.+..|....+...... +.-+.+.+|+..++|- +..+ ++|++|. |.--.. T Consensus 290 -----------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~---~~gn~Sg~Alk~~~~ 347 (451) T protein:vir:10 290 -----------------SEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTE---NFGNASGVALKFFYR 347 (451) T ss_pred -----------------CCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cccc---ccccccHHHHHHHHH Confidence 1122233555555445555544 7788889999999994 2222 2344443 322222 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_021072. 381 KFQKFIARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYF 460 (533) Q Consensus 381 kF~Kfi~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~ 460 (533) ..-.-+.+.+..|...+.++|+.=+-.-|+ .+|. .|.+.|...---.+ .+.++++..+ +| .+ T Consensus 348 ~l~~k~~~k~~~f~~~l~~~~~li~~~~~~---~d~~----~i~i~f~~~~p~n~-------~e~~~~~~kl---~g-~i 409 (451) T protein:vir:10 348 KLELKSGLLETEFRTSFDKLIKAILYFLGV---TDYK----KIQQTYTRNMMSND-------LEDADIATKS---VG-II 409 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCcc----ceeEEecCCCCCCH-------HHHHHHHHHH---hc-cC Confidence 233345666666666666666544433343 3554 46677754432222 2344555555 35 49 Q ss_pred cHHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCC Q lcl|NC_021072. 461 SIDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADD 514 (533) Q Consensus 461 S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d 514 (533) |.+++++. |...+. .+++.++|++|.... .. +...+++. -+| T Consensus 410 S~et~~~~-~p~v~d-~~~e~~~~~ee~~~~-~~----~~~~~~~~-----~~~ 451 (451) T protein:vir:10 410 PTKIILRH-HPWVDD-VEEAEKLYLEEKKIQ-AS----KVSDDYNN-----FTE 451 (451) T ss_pred chHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-HH----HHHhhcCC-----CCC Confidence 99999977 555442 334444444443322 11 11111111 111 No 190 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=50.79 E-value=0.6 Score=21.80 Aligned_cols=439 Identities=12% Similarity=0.067 Sum_probs=182.5 Q ss_pred cccccCCCCCCCCCcccceeecccc-cccccchh--hhhhHHHHHHHHHHhhh-hcchhhhHHHHhhcceeeecCCCceE Q lcl|NC_021072. 15 KVPKGPSFVQKDSMDGSQPIVGGGY-YGYSVDFD--GTVRNEYELITRYREMV-LQPECDSAVDDIVNETICGNFDDVPV 90 (533) Q Consensus 15 ~~~~~~s~~~~~~~dg~~~~~~~~~-~~~~~~~~--~~~~~~~~LI~~YR~m~-~~pEvd~AvdeIvneaiv~d~~~~~v 90 (533) .-+...-+.|. ..-|.+...+-.. +..+-..+ ..++ ..+.++.|++|. .++.|.++++-+...+.- .+. T Consensus 1 ~~~~~~~~~p~-~~~g~~~~~~~~~~~~~~~~~e~~~~lr-~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~-----~~w 73 (469) T protein:vir:10 1 MTERVKTAAPV-SEAGYVFGSGVVDGWTVWDPFEQTPELQ-WPQSVAVYSRMDNEDSRVTSLLEAISLPIRS-----TPW 73 (469) T ss_pred CCCcccCCCCc-cchhhhhhcccccchhhccccccccccc-cccchHHHHHHHhhChHHHHHHHHHHHHHhc-----CCc Confidence 22222222222 1222211110011 11111111 1222 236688999995 699999999999876553 223 Q ss_pred EEEeccCCCcHHHHHHHHHHHHHHHHHh-------------cchhhhhHHHHhhhhcCceeeeeeecCCC----CCCCeE Q lcl|NC_021072. 91 EVELSNLKQSDKIKKLIREEFAEILRLL-------------DFENRSYEIFRRWYVDGRLFYHKVIDPKN----PRGGLT 153 (533) Q Consensus 91 ~v~l~~~~~S~~ik~~I~eeF~~i~~lL-------------~f~~~~~~~fR~WYvDGri~~hkvid~~~----~~~gI~ 153 (533) .|. --+.++.+.+.+.+......... .+.....+++-..+.-|--++++|..... ..-.+. T Consensus 74 ~v~--p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~ 151 (469) T protein:vir:10 74 RIR--ANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLR 151 (469) T ss_pred eEe--cCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeee Confidence 332 22335666666666544332111 23344445555566678899999986432 122355 Q ss_pred EEEEcChhhceehhhccCCCcC-ceeEEeccceeeccchhceeccccccccccCCcceeccc-hhhccccccccCCCCcc Q lcl|NC_021072. 154 ELRYIDPRKIRKVTEYQQKRPE-QLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATD-SVTYCHSGIQDLNKNMT 231 (533) Q Consensus 154 elr~lDP~~i~~vr~~~~~~~~-~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~d-ai~y~hsGl~d~~~~~i 231 (533) .|...+|++|.+ + .-+++ +.....+.... .......| .....+.-||.. .+++.|.. ...+... T Consensus 152 ~l~~rp~~~i~~---~-~~~~~~~l~~~~~~~~~--~~~~~~~~------~~~~~~~~lp~~k~i~~~~~~--~~g~p~g 217 (469) T protein:vir:10 152 KLAPRPQWTISK---F-NVAPDGGLESIEQIAPP--ARTRGSLY------VANIAPPEIPVNRLVVYTRNK--RPGQWQG 217 (469) T ss_pred eeeecCccccee---e-eeccCCceeeeeecCcc--cccccccc------cCCCCccccccCcEEEEEecC--CCCCccc Confidence 666666666632 1 11111 11111110000 00000000 011123334433 34444432 1223455 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHh-cCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 232 LSHLHKAIKAVNQLRMIEDSLVIYRLS-RAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 232 ~syL~~AiK~~NqLrm~EDalVIyRi~-RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) .|-|.+|..+|-=.+......+.+=-. =.|-| |...+.|.-...| +-|.+++..+++-- ..|=| + T Consensus 218 ~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~-vgky~~~a~~~ek--~~l~~a~~~~~~g~----~a~~i------i- 283 (469) T protein:vir:10 218 KSILRSAYKHWLLKDKLLRIEAATAERNGMGIP-VGTASSATDEDEV--RKMAALARSVRGGI----NAGVG------L- 283 (469) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHcCCcce-EEecCCCCCHHHH--HHHHHHHHHHhcCC----ceEEE------c- Confidence 688999988887666666555554332 34554 5556666544433 33333444443210 01100 1 Q ss_pred hHhhhcccccCCCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCCccccCCC-CcccccchhhhhHHhhhHHHHHHH Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGGQNL-GELEDVKYFQKKLYKALNVPSSRLETE-TTFNIGRAAEITRDEVKFQKFIAR 388 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg~nL-gei~DV~YF~~kLy~aL~VP~sRl~~~-~~~~~g~~~eItRDElkF~Kfi~r 388 (533) ..|++|..+..+.+. .-..=++|..+++-+++--..=-.+.+ |+.++|....=. |...++. T Consensus 284 ------------p~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~vh~ev-----~~d~~~s 346 (469) T protein:vir:10 284 ------------AQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASVLEDP-----FTQAVHA 346 (469) T ss_pred ------------cCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHHHHHH-----HHHHHHH Confidence 146778877653222 222336778777766664332111212 223333221112 3445555 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccc-cccHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGK-YFSIDYMRR 467 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGk-y~S~~~i~k 467 (533) ..+.++..+..-|-..|+.=| |......-+|.|..- | .+.+...+++..+..+ ++... -++.+|+++ T Consensus 347 Da~~i~~tln~~li~~l~~lN------~g~~~~~P~~~~~~~----e-~~~~~~a~~i~~l~~~-G~~~~~~~~~~~~~e 414 (469) T protein:vir:10 347 YATSICRIANQHIIEDLVDIN------FGVDTPAPVLTFDPI----G-SRQDLTAAAVKLLYDA-GVFDDDPAVKRAIRQ 414 (469) T ss_pred HHHHHHHHHHHHHHHHHHHhc------CCCCCCccEEEecCC----C-CcHHHHHHHHHHHHhc-CCccCccccHHHHHH Confidence 556666666543333333332 222223345555321 1 1233344444444432 11111 367788886 Q ss_pred HHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccccchhcC Q lcl|NC_021072. 468 QVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 468 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 533 (533) . +++...+-.+.. .+.+.++. .|.....|.. .. .+.+..++...+.+..+++ T Consensus 415 ~-~gip~~~~~~~~--~~~~~~~~---~~~~~~~~~~---~~-----~~~~~~~~~~~~~~~~~~l 466 (469) T protein:vir:10 415 R-FNLPSELNDTPS--AEPEEPAA---VPNQSAAPAR---TR-----SSGNADARARAPKADQGVL 466 (469) T ss_pred H-hCCCCCCCCccc--ccchhccc---CCCCCccccc---cC-----CCCCcccccccCCChHHhh Confidence 5 777533222111 11111111 0110000100 00 0001111111112222222 No 191 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=45.32 E-value=0.77 Score=21.19 Aligned_cols=470 Identities=13% Similarity=0.107 Sum_probs=183.7 Q ss_pred CCccccceeeecccccc--ccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhh-cchhhhHHHHhh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVP--KGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVL-QPECDSAVDDIV 77 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~--~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~-~pEvd~AvdeIv 77 (533) |+-.-= +++....+. ++..-..-.+..+......+ ++..-..+..-. +-..| +.|+. .|-+..||+.+. T Consensus 1 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~p~~-~~~~L----~~~~e~~~~~~~~i~~~~ 72 (651) T protein:vir:99 1 MTDTTG--ETQETKVHVEGLGGEADLAKSPNSTQIPDHR-IQSHNVGVNPPY-NPDRL----AAFLELNETLATGIRKKS 72 (651) T ss_pred CCCccc--eeeeeEEEeecccccccccccccccccchhh-hcccCCCCCCCC-CHHHH----HHHHhcChHHHHHHHHHh Confidence 332100 000000000 00000000111221111112 211111222211 23333 45544 788888998888 Q ss_pred cceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHH-------HhcchhhhhHHHHhh----hhcCceeeeeeecCC Q lcl|NC_021072. 78 NETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILR-------LLDFENRSYEIFRRW----YVDGRLFYHKVIDPK 146 (533) Q Consensus 78 neaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~-------lL~f~~~~~~~fR~W----YvDGri~~hkvid~~ 146 (533) +.....-.+=.| ..+.+....++..+++..+.+..... -++++....+++++- -+-|.-|+..+-+. T Consensus 73 ~~iag~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~- 150 (651) T protein:vir:99 73 RYEVGFGFDLVP-AQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDI- 150 (651) T ss_pred hhhhccCceeee-cccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcC- Confidence 775542211111 11222333455555555444433222 234555555666543 33477777765543 Q ss_pred CCCCCeEEEEEcChhhceehhhcc----------CCCcCceeEEe-----------ccceeeccchhc------------ Q lcl|NC_021072. 147 NPRGGLTELRYIDPRKIRKVTEYQ----------QKRPEQLRGED-----------INTQLTQKAAEY------------ 193 (533) Q Consensus 147 ~~~~gI~elr~lDP~~i~~vr~~~----------~~~~~~~~~~~-----------~~~~~~~~~~e~------------ 193 (533) .+-+..|.++||..+|.-+... ...+|.+.... .+..+......| T Consensus 151 --~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 228 (651) T protein:vir:99 151 --EGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGD 228 (651) T ss_pred --ccchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCc Confidence 2337788889988876422110 01111110000 000000000000 Q ss_pred ---eec-cccccc---------------cccCCcceeccchhhccccccccCCCCccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 194 ---YLY-NPKGLK---------------NSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVI 254 (533) Q Consensus 194 ---~~y-~p~~~~---------------~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~AiK~~NqLrm~EDalVI 254 (533) ..+ ...+.. ...+....++.+-+-+... .-..++-.++|-|..|+..+.....+++...- T Consensus 229 ~v~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~-~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~ 307 (651) T protein:vir:99 229 EPTIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPN-PSILEDDYGVPDWVSAIRTISADEAAKDYNRD 307 (651) T ss_pred ceeEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 000 000000 0001111222222222110 00123346789999999999998888887664 Q ss_pred HHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhh---h-cccccCCCCccceee Q lcl|NC_021072. 255 YRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED---F-WLPRREGGRGTEIST 330 (533) Q Consensus 255 yRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlED---y-wLpRReggrgTEIsT 330 (533) |=---+--+-|+.+..+.|-+..++. +++.+++.. . ..|+ .| .|+- . +++. +.|.++.. T Consensus 308 ~f~NG~~p~gil~~~~~~ls~e~~~~-lr~~~~~~~-----~-nagk------~~-vL~~~~~~~~~~~---~~g~~~~p 370 (651) T protein:vir:99 308 FFDNDTIPRMVIKVTGGELSEESKRD-LRQMLNGLR-----E-ESHR------AV-VLEVEKFQSQLDE---DVEIELEP 370 (651) T ss_pred HHhccCCCceEEEecCCCCCHHHHHH-HHHHHHHHh-----c-cCCc------eE-Eeecccccccccc---cCCceEEE Confidence 44444666778877656665555444 344444321 1 1121 11 1100 0 0000 12333433 Q ss_pred cCCCCCc---ch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 331 LPGGQNL---GE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQK-FIARLRKRFSELFMDLLKTQL 405 (533) Q Consensus 331 LpGg~nL---ge-i~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~K-fi~rLr~~fs~if~d~Lk~qL 405 (533) |- ... .| ++=-+|....+-++.+||.+.|+..+..+...+.+..+. |.. -+.-+..++...+.. .| T Consensus 371 ls--~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~---f~~~tL~P~~~~ie~eln~----kL 441 (651) T protein:vir:99 371 MG--QGISEEMDFRQFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKD---FALEVIQPEQHTFAEWLYQ----II 441 (651) T ss_pred cC--cCchhhHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHHH---HHHHHHHHHHHHHHHHHHH----hh Confidence 31 111 11 222256677799999999999976555444444444433 543 344555544444433 33 Q ss_pred HhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|NC_021072. 406 ILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQEIKEIDKQID 485 (533) Q Consensus 406 ilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDeeI~e~~kqi~ 485 (533) +.. .+.. ....|.++|..+ ++-..+ ...|.+.++.+-. .-++|.+-+++ .++|..-+ T Consensus 442 l~~-----~e~~-~~~~i~~ef~~~----~llr~D-~~~~~e~~~~~i~--~G~~T~NE~R~-~lglppi~--------- 498 (651) T protein:vir:99 442 HQQ-----ALGV-TDWTIEYELRGA----DQPKQE-AQLAEQRVRAMRL--AGVGLVDEARE-ELGLDPLG--------- 498 (651) T ss_pred cCc-----cccc-cCceEEEEeccc----hhhhcc-HHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCCCC--------- Confidence 222 2111 112466666633 222222 1344554444332 24778888774 46764311 Q ss_pred HhhhcCCCCC---C--CcccccCC-CCCCCCCC-CCccccccccCCccccchhcC Q lcl|NC_021072. 486 SEREAGLIVD---P--MAEMDPAM-DPGNAPPA-DDMSAQEGPAVDAGDAKRGEF 533 (533) Q Consensus 486 ~E~~~~~~~~---p--~~~~~~~~-~~~~~~~~-~d~~~~~~~~~~~~~~~~~~~ 533 (533) ++.....+.. . .....++. +...++|. .+...++..++.+.-. ..|. T Consensus 499 ~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~~~e~~~~~~~~~-~~e~ 552 (651) T protein:vir:99 499 EPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKIGEREWDTVKSELT-TKDP 552 (651) T ss_pred CccccccccccccccccccccCCCCcccccCccccccccchhhhhhhhhc-ccch Confidence 0000000000 0 00000111 11111111 1122222222222111 1111 No 192 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=38.74 E-value=1.1 Score=20.46 Aligned_cols=455 Identities=13% Similarity=0.080 Sum_probs=175.2 Q ss_pred cccccccCCCCCCCCCcccceeecccccc-cccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeecCCCceEE Q lcl|NC_021072. 13 AKKVPKGPSFVQKDSMDGSQPIVGGGYYG-YSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGNFDDVPVE 91 (533) Q Consensus 13 ~~~~~~~~s~~~~~~~dg~~~~~~~~~~~-~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d~~~~~v~ 91 (533) --......|-.+|...-.--..+..++.+ .+...+..++ .-..++.|+.|..++.|.++++-+...+..++ -. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr-~~~~~~ly~~m~~D~hi~s~l~~Rk~av~~~~-----w~ 74 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALR-FPESIKTFQLMMRDPAVAASVNIIKMFVRKVN-----WR 74 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhc-ccchHHHHHHHhhChHHHHHHHHHHHHHhcCC-----ce Confidence 11122233333333222111112222222 2223334444 35678899999999999999999988776432 22 Q ss_pred EEeccCCCcHHHHHH-HHHHHHHHHHHhc--chhhhhHHHHhhhhcCceeeeeeecCCCCCCC------------eEEEE Q lcl|NC_021072. 92 VELSNLKQSDKIKKL-IREEFAEILRLLD--FENRSYEIFRRWYVDGRLFYHKVIDPKNPRGG------------LTELR 156 (533) Q Consensus 92 v~l~~~~~S~~ik~~-I~eeF~~i~~lL~--f~~~~~~~fR~WYvDGri~~hkvid~~~~~~g------------I~elr 156 (533) |.- ..+.++..|.. +.+.+++.++-+. |..-..+++ ..+.-|--++++|........+ +.++. T Consensus 75 v~p-~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~ 152 (488) T protein:vir:95 75 FVP-PKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLP 152 (488) T ss_pred Eec-CCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeeeee Confidence 221 12223444432 3556666666553 334334433 2445577777777754322222 33333 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhcee-ccccccccccCCcceeccch-hhccccccccCCCCccchh Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYL-YNPKGLKNSTNQGMKIATDS-VTYCHSGIQDLNKNMTLSH 234 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~-y~p~~~~~~~~~~~kI~~da-i~y~hsGl~d~~~~~i~sy 234 (533) ..+|.+++. +.-+++..... .+........ ..+.........++.||..- ++|.|.. ...+....|- T Consensus 153 ~Rpq~~~~~----f~~d~d~~l~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~--~~g~p~g~gL 221 (488) T protein:vir:95 153 IRNQSTLDK----WYFDEDFRRVT-----GVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDD--EYGNPEGRSP 221 (488) T ss_pred ecCcccccc----eeeccCCCcee-----ecccccccccccccccccccccccccccccceEEEeecC--CCCccchhhH Confidence 334433321 11111110000 0000000000 01111111223345566553 3444422 2233555688 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-hcCccceEEE----ccCC-CCchHHHHHHHHHHHHhcccEEEeeCCCCcccccccc Q lcl|NC_021072. 235 LHKAIKAVNQLRMIEDSLVIYRL-SRAPERRIFY----IDVG-NLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKF 308 (533) Q Consensus 235 L~~AiK~~NqLrm~EDalVIyRi-~RAPeRrvfy----IDvG-nlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ 308 (533) |.+|..+|-=.+...-..+.+=- .=.|-.-..| .+.- .--+.+..+.++.+....+.. ...|-|-...-- T Consensus 222 lr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~----~~ag~iiP~g~~ 297 (488) T protein:vir:95 222 LLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIAN----DRAGLIWPRYID 297 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhcc----chhheeeccccc Confidence 99998888655555444443311 0022222222 1111 111222344455544444331 122211100000 Q ss_pred chhHhhhcccccCCCCccceeec--CCCCCcchHHHHHHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhhhHHH Q lcl|NC_021072. 309 MSMLEDFWLPRREGGRGTEISTL--PGGQNLGELEDVKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQK 384 (533) Q Consensus 309 msmlEDywLpRReggrgTEIsTL--pGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~--~~~~~g~~~eItRDElkF~K 384 (533) ...-+++ .|+..+ .|++...-..=++|..+..-+++--..--++.. |+.++|.. -.|+ |.. T Consensus 298 ~~~k~~~----------~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~v----h~ev-~~~ 362 (488) T protein:vir:95 298 PDTKEDI----------FEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADS----KTSL-LAM 362 (488) T ss_pred cccchhh----------hhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHH----HHHH-HHH Confidence 0000111 122222 233333344458999999888874432222222 22333222 2222 334 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhcccccc-- Q lcl|NC_021072. 385 FIARLRKRFSELFMD-LLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFS-- 461 (533) Q Consensus 385 fi~rLr~~fs~if~d-~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S-- 461 (533) .++...+.++..|.. +++.-+.++. ......-++.|- ..|-.+.+.+.+++..|. +- |-.++ T Consensus 363 i~~aDa~~i~~tln~~li~~l~~~Nf-------g~~~~~P~~~~~----~~e~~Dl~~~ae~~~~L~---~~-G~~i~~~ 427 (488) T protein:vir:95 363 SVDILLKQIKNVINRDLVAQTYALNM-------WDDEEHVQITYD----DIETPDLEAIGSYIQKTV---AV-GALEVDK 427 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC-------CCCCCccEEEec----CcChhhHHHHHHHHHHHH---hC-CCccccH Confidence 455555666666653 3333333332 112222334443 223334444444554444 33 44454 Q ss_pred --HHHHHHHHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcccccCCCCCCCCCCCCccccccccCCccccchhc Q lcl|NC_021072. 462 --IDYMRRQVLKQTDQEIKEIDKQIDSEREAGLIVDPMAEMDPAMDPGNAPPADDMSAQEGPAVDAGDAKRGE 532 (533) Q Consensus 462 --~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 532 (533) .+|++++ +++...+-.+ +...+. .|.++ +..+....++....+....+...+....+.+ T Consensus 428 ~~~~~i~e~-~gip~~~~~e-------~~~~~~--~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 428 ELSNKLREH-IGLPPADESQ-------PVSEKL--SPNSQ--SRSGDGYKTAGEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred HHHHHHHHH-hCCCCCCCCc-------cccccC--CCCCC--CCCCcccCCCcccCCcccccccchhhhhccC Confidence 5777755 6775332111 000110 11111 1111111111110011111111111111111 No 193 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=35.51 E-value=1.2 Score=20.10 Aligned_cols=316 Identities=13% Similarity=0.110 Sum_probs=126.6 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) ++..-|+|- .|-..++..+. .-.+....--.+.-+ +...|-+.+|.-..| .+|| T Consensus 18 ~~~~~f~~~--------------~~~~~~~~~y~-~~~~~~~~~~~epp~-~~~~la~l~~~~~~h---~~~i------- 71 (345) T protein:vir:37 18 INDRTFSLN--------------EISASPALDYV-GIGFDENYNCYLPPV-NRHALAKLPHQNAQH---GGIL------- 71 (345) T ss_pred ceeEEeecC--------------Ccccccchhhh-hhhhcCCccccCCCC-CHHHHHHHhhccccc---ccce------- Confidence 122222221 00000110000 000000000001111 133344444444333 1111 Q ss_pred eeecCCCceEEEEec-cCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEEEcC Q lcl|NC_021072. 81 ICGNFDDVPVEVELS-NLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELRYID 159 (533) Q Consensus 81 iv~d~~~~~v~v~l~-~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr~lD 159 (533) ....| .+...+. +-..| +.+|. +++..|++-|.-|+.++-+. .+.++.|..++ T Consensus 72 -~~k~n--~l~~~~~Pn~~lt-------~~~f~-------------~~~~d~ll~Gnay~~~~rn~---~G~~~~L~pl~ 125 (345) T protein:vir:37 72 -HSRAN--MVSSLYEGGKALS-------RMDMR-------------ALCLNLIQFGDVGLLKVRNG---FGQVVRLVPLS 125 (345) T ss_pred -eeech--HHHhhccCCCCCC-------HHHHH-------------HHHHHHHhcCCeEEEEEEcC---CCcEEEEEEEc Confidence 10000 0000000 00011 11122 24456778899999988753 56799999999 Q ss_pred hhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHHHHH Q lcl|NC_021072. 160 PRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAI 239 (533) Q Consensus 160 P~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~Ai 239 (533) |..++... +.+. .+.............+. +.+...++|+.-...- .-.++|-+..|+ T Consensus 126 ~~~vr~~~-----d~~~--~~~~~~~~~~~~g~~~~-------~~~~dVihir~~~~~~---------~~~Gls~~~~a~ 182 (345) T protein:vir:37 126 SLYLRVRK-----DGGY--SYLMKKSLYDTAQEIYR-------YDAKDIIFIKLYDPMQ---------QVYGSPDYVGGI 182 (345) T ss_pred CceeEEEE-----eCCe--eEEEEEeEecCCceEEE-------EccccEEEecCCCCCC---------CcccccHHHHHH Confidence 99886421 1111 11110000000001111 1222333333221111 113457777777 Q ss_pred HHHHHHHHHHHHHHHHH--Hhc--CccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhh Q lcl|NC_021072. 240 KAVNQLRMIEDSLVIYR--LSR--APERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDF 315 (533) Q Consensus 240 K~~NqLrm~EDalVIyR--i~R--APeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDy 315 (533) +.+.--. +.-.|| ..+ |--.-|+|+.-++|.+. +.+=+++-+..++-. .+.++ |-++ T Consensus 183 ~si~l~~----~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e-~~~~lk~~~~~~~g~----------~n~~~-~~i~--- 243 (345) T protein:vir:37 183 QSALLNS----DATVFRRRYFSNGAHMGFILYSTDPDLTEE-MEEEIARKISESKGV----------GNFRS-MFVN--- 243 (345) T ss_pred HHHHHHH----HHHHHHHHHHhccCCcceEEEecCCCCCHH-HHHHHHHHHHHhcCc----------ccccc-eEEE--- Confidence 7654322 222332 222 34567777765566544 333344433333211 11111 1111 Q ss_pred cccccCC-CCccceeecCCCCCcchHHHHH---HHHHHHHHhcCCCccccCC--CCcccccchhhhhHHhhhHH-HHHHH Q lcl|NC_021072. 316 WLPRREG-GRGTEISTLPGGQNLGELEDVK---YFQKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVKFQ-KFIAR 388 (533) Q Consensus 316 wLpRReg-grgTEIsTLpGg~nLgei~DV~---YF~~kLy~aL~VP~sRl~~--~~~~~~g~~~eItRDElkF~-Kfi~r 388 (533) --+| ..|.++..| +.+--+++=++ +-.+...++.+||...++- .++..++..++..+. |. .-|.- T Consensus 244 ---~p~g~~~G~~~~pl--s~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~---f~~~~l~P 315 (345) T protein:vir:37 244 ---IANGHPDGLKVIPI--GDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREV---YHYDEVMP 315 (345) T ss_pred ---cCCCcccceEEEEc--cCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHH---HHHHHHHH Confidence 1111 124444443 44333333233 5677799999999999863 334445555554443 43 34677 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTE 435 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E 435 (533) ++.+|...+...+. +.....+.|..++ .++ T Consensus 316 ~~~~ie~~ln~~~~----------------~~~~~~i~F~~~~-L~~ 345 (345) T protein:vir:37 316 LQEIIAETINQDPE----------------IKNLLKIKFREQN-FAK 345 (345) T ss_pred HHHHHHHHhhhhcc----------------CCCcceEEecchh-hcC Confidence 88888887765331 1222344444221 222 No 194 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=26.83 E-value=1.9 Score=19.05 Aligned_cols=336 Identities=13% Similarity=0.094 Sum_probs=138.0 Q ss_pred CCccccceeeeccccccccCCCC--CCCCCcccce--eecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFV--QKDSMDGSQP--IVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDI 76 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~--~~~~~dg~~~--~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeI 76 (533) |..++---.-..+.-...+-||- |-+-.++... ...-.+.+...-.+.-+ +...|-+.+|.-..| .+||.-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~-~~~~La~l~~~n~~h---~~~i~~k 76 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPI-SLKGLAEIANANGYH---GSLLKAR 76 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCC-CHHHHHHHHhhhhhh---hhhHhhh Confidence 55433211100000000111221 1111111100 00000001000001111 245555566655444 4455433 Q ss_pred hcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeeecCCCCCCCeEEEE Q lcl|NC_021072. 77 VNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPKNPRGGLTELR 156 (533) Q Consensus 77 vneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvid~~~~~~gI~elr 156 (533) .|-. ..+.- | +-..| +.+|. +++..|.+-|.-|+.++-+. .+.+++|. T Consensus 77 ~N~l-~~~~~--P------n~~~t-------~~~f~-------------~~~~d~ll~Gnay~~~~rn~---~G~~~~L~ 124 (348) T protein:vir:26 77 ANYV-AGRFM--N------GGGLP-------MYKMN-------------SACWDYFGLGMSAFVKIRSY---LKNVIALE 124 (348) T ss_pred hhHH-hhccc--C------CCCCC-------HHHHH-------------HHHHHHHhcCCeEEEEEEcC---CCcEEEEE Confidence 3331 11000 0 10111 11232 33445566699999988764 55699999 Q ss_pred EcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHH Q lcl|NC_021072. 157 YIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLH 236 (533) Q Consensus 157 ~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~ 236 (533) .|+|..++... ++..+... .. .. ...+.+.+.++|+.-...- .-.++|-+. T Consensus 125 ~l~~~~v~~~~-------d~~~~~~~-----~~-g~-------~~~f~~~dIiHir~~~~~~---------~~~Gls~~~ 175 (348) T protein:vir:26 125 PLPMVHMRKRK-------NGDFVQLL-----RN-NE-------QKVFKAKDVIFIPQYDPQQ---------QIYGLPDYL 175 (348) T ss_pred EecCceeEeee-------cCcEEEEE-----ec-Ce-------EEEEcCccEEEEcCCCCCC---------CcccccHHH Confidence 99999886521 11111100 00 00 0112333444554211111 123457777 Q ss_pred HHHHHHHHHHHHHHHHHHHHH--hc--CccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhH Q lcl|NC_021072. 237 KAIKAVNQLRMIEDSLVIYRL--SR--APERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSML 312 (533) Q Consensus 237 ~AiK~~NqLrm~EDalVIyRi--~R--APeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msml 312 (533) .|++.+.. -.+.-.||. .+ |--.-|+|+.-++|-+..+++ +++-+...+ -+ | +.++ +-+ T Consensus 176 ~a~~si~l----~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~-lk~~~~~~~--G~-----~---n~~~-~~v- 238 (348) T protein:vir:26 176 GSIQSSLL----NRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKA-LKEKIASSK--GI-----G---NFRS-MFV- 238 (348) T ss_pred HHHHHHHH----HHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHH-HHHHHHHhc--Cc-----c---cccc-eeE- Confidence 77776643 333444432 22 334467777555665444433 443333321 11 1 1111 111 Q ss_pred hhhcccccCCCCccceeecCCCCCcchHHHHHHH-HHHHHHhcCCCccccCC--CCcccccchhhhhHHhhhHHH-HHHH Q lcl|NC_021072. 313 EDFWLPRREGGRGTEISTLPGGQNLGELEDVKYF-QKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVKFQK-FIAR 388 (533) Q Consensus 313 EDywLpRReggrgTEIsTLpGg~nLgei~DV~YF-~~kLy~aL~VP~sRl~~--~~~~~~g~~~eItRDElkF~K-fi~r 388 (533) +.--+...|.+++-|.-...=.|+-.++=| ..-+.++.+||...++- .++-+++..++..+. |.+ -+.- T Consensus 239 ----l~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~---f~~~~l~P 311 (348) T protein:vir:26 239 ----NIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQV---YDFYEVIP 311 (348) T ss_pred ----EcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHH---HHHHHHHH Confidence 111111235566655433323344444444 44599999999998853 334455666665655 553 3566 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021072. 389 LRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNT 451 (533) Q Consensus 389 Lr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~ 451 (533) ++.+|...+.+-| ... ++ -.++|+|..+ .+|.+..+- T Consensus 312 ~~~~ie~~ln~~l----~~~-----~~-----~~~~fdl~~~------------~e~~~~~a~ 348 (348) T protein:vir:26 312 VCKRFMDAVNNDP----EIP-----DN-----LKLKFNLNPG------------VESANGSAV 348 (348) T ss_pred HHHHHHHHHhhhh----CCC-----Cc-----cEEEEecCcc------------cccchhhcC Confidence 6666665554432 111 11 1233443321 122222222 No 195 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=24.47 E-value=2.2 Score=18.74 Aligned_cols=360 Identities=14% Similarity=0.132 Sum_probs=138.7 Q ss_pred CCccccceeeeccccccccCCCCCCCCCcccceeecccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcce Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPKGPSFVQKDSMDGSQPIVGGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNET 80 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvnea 80 (533) |. |||-... ..++.. + .+.+... .+.+. .+ .| +.+-|.+||+-|++.+ T Consensus 1 Mg--~f~~~~~----~~~~~~--~--~~~~~~~----~~~~~------~~--------~~----~~~~v~~~v~~IA~~i 48 (378) T protein:vir:94 1 MN--LFGKVVS----FSRGKL--N--NDTQRVT----AWQNE------AV--------EY----TSAFVTNIHNKIANEI 48 (378) T ss_pred CC--ccccchh----cccccc--c--CCcceee----eeccc------hh--------HH----HHHHHHHHHHHHHhhh Confidence 32 3332111 011111 1 1111010 01100 00 11 2244888999998875 Q ss_pred eeecCCCceEEEEeccC-CC-cHHHHHHHHHHHHHHHHHhcch----hhhhHHH----HhhhhcCceeeeeeecCCCCCC Q lcl|NC_021072. 81 ICGNFDDVPVEVELSNL-KQ-SDKIKKLIREEFAEILRLLDFE----NRSYEIF----RRWYVDGRLFYHKVIDPKNPRG 150 (533) Q Consensus 81 iv~d~~~~~v~v~l~~~-~~-S~~ik~~I~eeF~~i~~lL~f~----~~~~~~f----R~WYvDGri~~hkvid~~~~~~ 150 (533) -- .|+.+--... +. .+.....+. ..++++|+.. -.+.++. ..+..+|.-|.+++.+. ..+ T Consensus 49 A~-----lp~~~~~~~~~~~~~~~~~~~~~---~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~--~~g 118 (378) T protein:vir:94 49 TK-----VEFNHVKYKKSDVGSDTLISMAG---SDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDD--NTG 118 (378) T ss_pred hh-----CceeeEEEcccCccccccccccc---chHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeC--CCc Confidence 42 4544322111 11 111111111 1344555432 2444444 44677899999988864 223 Q ss_pred CeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCc Q lcl|NC_021072. 151 GLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNM 230 (533) Q Consensus 151 gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~ 230 (533) .+..+ .|...+ . . |.+ ...++|+- . .++.. T Consensus 119 ~~~~l---~p~~~~------------~---------------~--~~~-------~diiH~~~--------~---~~~~~ 148 (378) T protein:vir:94 119 ELLDL---LFADDK------------K---------------E--YKP-------EELVRLTS--------P---FYINE 148 (378) T ss_pred eEEEE---EecCCe------------e---------------E--eee-------eeeEEecC--------c---CCccc Confidence 33333 221110 0 0 111 12223321 0 11223 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccch Q lcl|NC_021072. 231 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMS 310 (533) Q Consensus 231 i~syL~~AiK~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~ms 310 (533) ++|-|+.|.+.+.. + +..+--+-+..++ |.|....+.+....+...|+++.-.+.+ |. .+ T Consensus 149 g~s~l~~~~~~i~~------~-----~~~~~~~gil~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~-g~------~~- 208 (378) T protein:vir:94 149 DTSILDNALASIQT------K-----LEQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQEGSSY-NG------LT- 208 (378) T ss_pred hhHHHHHHHHHHHH------H-----HhcccccceeeeC-CcCCHHHHHHHHHHHHHHHHHhhccccc-cc------ce- Confidence 45777777765432 1 1112223444443 4444444444444444444433211111 11 11 Q ss_pred hHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHH Q lcl|NC_021072. 311 MLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLR 390 (533) Q Consensus 311 mlEDywLpRReggrgTEIsTLpGg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr 390 (533) .++ .|.+++-|.=.....++...+|-.+.+.++++||.+.|.. .+ +++-.+. |..+ .|+ T Consensus 209 vl~----------~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVP~~~l~~--~~----se~~~~~---f~~~--tL~ 267 (378) T protein:vir:94 209 PVD----------NKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLG--TA----SQEQQIY---FYNS--TII 267 (378) T ss_pred ecC----------CCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC--Ch----HHHHHHH---HHHH--HHH Confidence 111 1233333332233445677888899999999999998842 11 1121111 3322 232 Q ss_pred HHHHHHHHHHHHHHHHhccCCCHhHHhhhh---hceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHH Q lcl|NC_021072. 391 KRFSELFMDLLKTQLILKGVMSLEEWDEMK---EHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRR 467 (533) Q Consensus 391 ~~fs~if~d~Lk~qLilkgi~t~eew~~~~---~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k 467 (533) -. ...+.+.|. ..+++++|+..-. ..+.+.|. +..+... -+..|++.+..+-.- -++|.+-++. T Consensus 268 P~-~~~ie~~l~-----~~Ll~~~er~~g~~~~~~~~~~f~----~~~l~~~-d~~~~~~~~~~~~~~--G~~T~NE~R~ 334 (378) T protein:vir:94 268 PL-LIQLEKELT-----YKLISTNRRRVVKGNLYYERIIVD----NQLFKFA-TLKELIDLYHENING--PIFTQNQLLV 334 (378) T ss_pred HH-HHHHHHHHH-----hhcCChhHhhhhhhcccccceeec----chhhhhc-CHHHHHHHHHHHHhC--CCcCHHHHHH Confidence 21 122233333 3445666655421 22345554 3333322 223566666655443 3666666664 Q ss_pred HHhCCCHHHHHHHHHHHHHhhhcCCCCCCCcc--cccC---CCCCCCCCCCCcccc Q lcl|NC_021072. 468 QVLKQTDQEIKEIDKQIDSEREAGLIVDPMAE--MDPA---MDPGNAPPADDMSAQ 518 (533) Q Consensus 468 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~p~~~--~~~~---~~~~~~~~~~d~~~~ 518 (533) +++|.+-+ . .+-.+-.-+.. .+++ ...++..+.++.+++ T Consensus 335 -~~gl~p~~--g---------GD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 335 -KMGEQPIE--G---------GDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred -HhCCCCCC--C---------CCeeeecccccccccchhhcCCcCCCCCCCCCCCC Confidence 34543211 0 00000000000 0000 001111222222222 No 196 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=23.51 E-value=2.3 Score=18.60 Aligned_cols=317 Identities=12% Similarity=0.091 Sum_probs=133.7 Q ss_pred CCccccceeeeccccccc----------cC--CCCCCCCCcccceee-----cccccccccchhhhhhHHHHHHHHHHhh Q lcl|NC_021072. 1 MSNQLFGFSLERAKKVPK----------GP--SFVQKDSMDGSQPIV-----GGGYYGYSVDFDGTVRNEYELITRYREM 63 (533) Q Consensus 1 ~~~~~fg~~i~~~~~~~~----------~~--s~~~~~~~dg~~~~~-----~~~~~~~~~~~~~~~~~~~~LI~~YR~m 63 (533) |+ +.++-+. .. |+..|-..++..+.. .+-|| +.-+ +...|-+.+|.- T Consensus 1 ~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~------epp~-~~~~la~~~~~~ 64 (345) T protein:vir:37 1 MK---------TNVKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIGFDENYNCY------LPPV-NRHALAKLPHQN 64 (345) T ss_pred CC---------ccccccchhhhcCCCceEEEeecCCcccchhhcccceeeecCCccc------cCCC-CHHHHHHHhhcc Confidence 21 1111000 00 111111011100000 11111 1111 245566666665 Q ss_pred hhcchhhhHHHHhhcceeeecCCCceEEEEeccCCCcHHHHHHHHHHHHHHHHHhcchhhhhHHHHhhhhcCceeeeeee Q lcl|NC_021072. 64 VLQPECDSAVDDIVNETICGNFDDVPVEVELSNLKQSDKIKKLIREEFAEILRLLDFENRSYEIFRRWYVDGRLFYHKVI 143 (533) Q Consensus 64 ~~~pEvd~AvdeIvneaiv~d~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~lL~f~~~~~~~fR~WYvDGri~~hkvi 143 (533) ..| .+||.--.|... ... .| +-..| +.+|. +++..+.+-|.-|+.++- T Consensus 65 ~~h---~~~i~~k~n~l~-~~~--~P------n~~~t-------~~~f~-------------~~v~d~ll~Gnay~~i~r 112 (345) T protein:vir:37 65 AQH---GGILHSRANMVS-ATY--EG------GKALS-------KMEMR-------------ALCLNLIQFGDVGLLKVR 112 (345) T ss_pred hhh---cchhhhhhhHHh-hcc--CC------CCCCC-------HHHHH-------------HHHHHHHhcCCeEEEEEE Confidence 555 333322222110 000 00 11111 11222 234456777999999987 Q ss_pred cCCCCCCCeEEEEEcChhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhcccccc Q lcl|NC_021072. 144 DPKNPRGGLTELRYIDPRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGI 223 (533) Q Consensus 144 d~~~~~~gI~elr~lDP~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl 223 (533) +. .+.+++|..++|..++.. . ++...+......+.....-+. +.+...++|..-...-.. T Consensus 113 n~---~G~~~~L~pl~~~~vr~~-----~--d~~~~~~~~~~~~~~~g~~~~-------~~~~eViHir~~~~~~~~--- 172 (345) T protein:vir:37 113 NG---FGQVVRLVPLSSLYLRVH-----K--DGGYSYLMKKSLYDTAQEIYR-------YDAKDIIFIKLYDPMQQV--- 172 (345) T ss_pred CC---CCCEEEEEEecCceeEEe-----e--cCCeeEEEeeeeeccCceEEE-------EccccEEEEcCCCCCCCc--- Confidence 64 567999999999988641 1 111111111000000001111 122334444422111112 Q ss_pred ccCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHh----cCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCC Q lcl|NC_021072. 224 QDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLS----RAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANT 299 (533) Q Consensus 224 ~d~~~~~i~syL~~AiK~~NqLrm~EDalVIyRi~----RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~T 299 (533) .++|-+..|++.+. +-.+.-.||.. -|--.-|+|+.-++|.+..++. +++-+.+.+. T Consensus 173 ------~Gl~~~~~a~~si~----l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~-lk~~~~~~~g-------- 233 (345) T protein:vir:37 173 ------YGSPDYVGGIQSAL----LNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEE-IARKISESKG-------- 233 (345) T ss_pred ------ccchHHHHHHHHHH----HHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHH-HHHHHHHhcC-------- Confidence 23354555555432 22334444432 2555778888767877665544 3333333221 Q ss_pred CccccccccchhHhhhcccccCC-CCccceeecCCCCCcchH--HHH-HHHHHHHHHhcCCCccccCC--CCcccccchh Q lcl|NC_021072. 300 GEIKDDKKFMSMLEDFWLPRREG-GRGTEISTLPGGQNLGEL--EDV-KYFQKKLYKALNVPSSRLET--ETTFNIGRAA 373 (533) Q Consensus 300 Gev~~d~~~msmlEDywLpRReg-grgTEIsTLpGg~nLgei--~DV-~YF~~kLy~aL~VP~sRl~~--~~~~~~g~~~ 373 (533) | .+. +. -++.--+| ..|.+++.|.- +.-++ -++ .+-++-+.++.+||...++- .++-+++..+ T Consensus 234 ~--~n~-~~------~~i~~~~g~~~G~~~~pl~~--~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e 302 (345) T protein:vir:37 234 V--GNF-RS------MFVNIAGGHPDGLKVIPIGD--TGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPL 302 (345) T ss_pred c--ccc-Cc------eeEecCCCCccceeEEEccC--ChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHH Confidence 1 010 11 11221222 23455554433 22222 221 33455689999999888753 3344455555 Q ss_pred hhhHHhhhHHHH-HHHHHHHHHHHHHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHH Q lcl|NC_021072. 374 EITRDEVKFQKF-IARLRKRFSELFMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRN 443 (533) Q Consensus 374 eItRDElkF~Kf-i~rLr~~fs~if~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~ 443 (533) +..+. |.++ |.-++.+|...+...+. +.....+.|..+ ++++ T Consensus 303 ~~~~~---f~~~~l~P~~~~ie~~ln~~~e----------------~~~~~~i~F~~~---------~l~k 345 (345) T protein:vir:37 303 KYREV---YHYDEVMPLQEIIAETINQDPE----------------IKNLLKIKFREQ---------NFAK 345 (345) T ss_pred HHHHH---HHHHHHHHHHHHHHHHhhhhhc----------------cCCcceEEECch---------hhcC Confidence 55554 7666 77788888887765321 222345555432 2222 No 197 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=21.64 E-value=2.6 Score=18.34 Aligned_cols=393 Identities=12% Similarity=0.123 Sum_probs=151.3 Q ss_pred cceeeeccccccccCCCCCCCCCcccceee-cccccccccchhhhhhHHHHHHHHHHhhhhcchhhhHHHHhhcceeeec Q lcl|NC_021072. 6 FGFSLERAKKVPKGPSFVQKDSMDGSQPIV-GGGYYGYSVDFDGTVRNEYELITRYREMVLQPECDSAVDDIVNETICGN 84 (533) Q Consensus 6 fg~~i~~~~~~~~~~s~~~~~~~dg~~~~~-~~~~~~~~~~~~~~~~~~~~LI~~YR~m~~~pEvd~AvdeIvneaiv~d 84 (533) .||.=--.++...+.....+.. .+. ...+.++ . +. +....+|.|..+|.-|++.+-. T Consensus 1 mg~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~-------~--t~------~~~~~~~~v~~cv~~Ia~~ia~-- 58 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDME-----PVSHRTNRKPF-------T--TG------QAYSKIEILNRTANMVIDSAAE-- 58 (403) T ss_pred Ccchhhhhhccchhhhhhhccc-----ccccccCCccc-------c--cH------HHHHHHHHHHHHHHHHHHHHhh-- Confidence 3432111111111111111110 000 0001111 0 00 2234678899999999876543 Q ss_pred CCCceEEEEeccCCCcHHHHHHHHHHHHHHHHH-hcchhhhhHHHHh----hhhcCceeeeeeecCCCCCCCeEEEEEcC Q lcl|NC_021072. 85 FDDVPVEVELSNLKQSDKIKKLIREEFAEILRL-LDFENRSYEIFRR----WYVDGRLFYHKVIDPKNPRGGLTELRYID 159 (533) Q Consensus 85 ~~~~~v~v~l~~~~~S~~ik~~I~eeF~~i~~l-L~f~~~~~~~fR~----WYvDGri~~hkvid~~~~~~gI~elr~lD 159 (533) .|+.|- +....-..-+..+...+.++++. =|-.-.+.++.+. +...|.-|.++ +. ..|..++ T Consensus 59 ---~p~~v~-~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~~-------~~l~~l~ 125 (403) T protein:vir:10 59 ---CSYTVG-DKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--DG-------TSLYHVP 125 (403) T ss_pred ---CceeEe-ecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--eC-------ceeEeec Confidence 444442 11111000001111223333332 1222344444433 56789988763 21 2577888 Q ss_pred hhhceehhhccCCCcCceeEEeccceeeccchhceeccccccccccCCcceeccchhhccccccccCCCCccchhHHHHH Q lcl|NC_021072. 160 PRKIRKVTEYQQKRPEQLRGEDINTQLTQKAAEYYLYNPKGLKNSTNQGMKIATDSVTYCHSGIQDLNKNMTLSHLHKAI 239 (533) Q Consensus 160 P~~i~~vr~~~~~~~~~~~~~~~~~~~~~~~~e~~~y~p~~~~~~~~~~~kI~~dai~y~hsGl~d~~~~~i~syL~~Ai 239 (533) |..+.-.. ..++..+ +|.|.. +..+.....++|.-....+ |. .++-.++|-+..|+ T Consensus 126 ~~~~~v~~-----~~~~~~~-------------~~~~~~-~~~~~~~eiih~~~~~~~~-~~----~~~~~G~s~i~~~~ 181 (403) T protein:vir:10 126 AALMQVEA-----DANKFIK-------------KFIFNN-QINYRVDEIIFIKDNSYVC-GT----NSQISGQSRVATVI 181 (403) T ss_pred CcceEEEE-----cCCceEE-------------EEEecC-ceeecccceEEeccccccc-CC----CCCcccccHHHHHH Confidence 77664211 1111110 011110 0111223444554332211 10 12223457777777 Q ss_pred HHHHHHHHHHHHHHHHHHhcCccceEEEccCCCCchHHHHHHHHHHHHhcccEEEeeCCCCccccccccchhHhhhcccc Q lcl|NC_021072. 240 KAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPR 319 (533) Q Consensus 240 K~~NqLrm~EDalVIyRi~RAPeRrvfyIDvGnlpk~KAeqYl~~im~~~rnk~vYd~~TGev~~d~~~msmlEDywLpR 319 (533) +.++.-..++....=+--.-+.-.-|...+ +.|-+..+++.-+.+-..|... ...|.+ + .++ T Consensus 182 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~g~~------~-vl~------ 243 (403) T protein:vir:10 182 DSLEKRSKMLNFKEKFLDNGTVIGLILETD-EILNKKLRERKQEELQLDYNPS----TGQSSV------L-ILD------ 243 (403) T ss_pred HHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCHHHHHHHHHHHHHHhCCc----ccCcce------e-ecC------ Confidence 777776666664432212223334455544 4565555544443333333210 111221 1 111 Q ss_pred cCCCCccceeecC---CCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHH Q lcl|NC_021072. 320 REGGRGTEISTLP---GGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSEL 396 (533) Q Consensus 320 ReggrgTEIsTLp---Gg~nLgei~DV~YF~~kLy~aL~VP~sRl~~~~~~~~g~~~eItRDElkF~Kfi~rLr~~fs~i 396 (533) | |.+++.+. .-..+..++..+|..+.+.++++||..-|+... .+.+.-.-..|.+++ |+-.+.. T Consensus 244 --~--g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~------~sn~e~~~~~f~~~t--l~P~~~~- 310 (403) T protein:vir:10 244 --G--GMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGN------NANIRPNIELFYYMT--IIPMLNK- 310 (403) T ss_pred --C--CceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC------CcCHHHHHHHHHHHH--HHHHHHH- Confidence 1 33344332 122344577788899999999999999985321 112222223365543 3322222 Q ss_pred HHHHHHHHHHhccCCCHhHHhhhhhceeEEEeccchHHHHHHHHHHHHHHHHHHHhhhhccccccHHHHHHHHhCCCHHH Q lcl|NC_021072. 397 FMDLLKTQLILKGVMSLEEWDEMKEHIQFDFIADNYFTELKEIEIRNERMNQVNTMDPYVGKYFSIDYMRRQVLKQTDQE 476 (533) Q Consensus 397 f~d~Lk~qLilkgi~t~eew~~~~~~i~~~f~~Dn~f~E~ke~Ei~~~R~~~~~~~~~~vGky~S~~~i~k~IL~~tDee 476 (533) +.+.|-..| ...+.+++........ + ...|.+.++.+-. .-+++.+-++.. +++..-+ T Consensus 311 ie~~l~~~L--------------~~~~~~d~~~~~~l~~--D---~~~~~~~~~~~~~--~G~lT~NE~R~~-~gl~pi~ 368 (403) T protein:vir:10 311 LTSSLTFFF--------------GYKITPNTKEVAALTP--D---KEAEAKHLTSLVN--NGIITGNEARSE-LNLEPLD 368 (403) T ss_pred HHHHHHHhc--------------Cceeeeccchhhhccc--C---HHHHHHHHHHHHh--CCCcCHHHHHHH-hCCCCCC Confidence 222232222 1234455432211110 0 1345555544322 245666666633 5665421 Q ss_pred HHHHHHHHHHhhhcCCCCCCC-cccccCCCCCCCCCCCCccccc Q lcl|NC_021072. 477 IKEIDKQIDSEREAGLIVDPM-AEMDPAMDPGNAPPADDMSAQE 519 (533) Q Consensus 477 I~e~~kqi~~E~~~~~~~~p~-~~~~~~~~~~~~~~~~d~~~~~ 519 (533) +|.-+..+..-+ +-...+..+++.+.++.+...+ T Consensus 369 ---------~~~~d~~~~p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 369 ---------DEQMNKIRIPANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred ---------cccccccccccccccccccCCCCcCCCCCCCcCCC Confidence 111111221001 0000111111111111111111 Done!