Query lcl|NC_016762.1_cdsid_YP_005098070.1 [gene=phi297_00043] [protein=hypothetical protein] [protein_id=YP_005098070.1] [location=25608..26978] Match_columns 456 No_of_seqs 144 out of 198 Neff 7.5 Searched_HMMs 1612 Date Thu Nov 7 13:26:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_43 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_43_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105782 Length: 449 100.0 2E-144 1E-147 808.1 45.5 447 1-453 1-449 (449) 2 protein:vir:5249 Length: 437 # 100.0 1E-116 8E-120 656.0 39.9 414 18-455 1-437 (437) 3 protein:vir:96068 Length: 765 100.0 9E-114 5E-117 640.4 36.8 426 1-456 60-519 (765) 4 protein:vir:79647 Length: 435 100.0 3E-113 2E-116 637.5 39.4 410 1-456 5-435 (435) 5 protein:vir:99563 Length: 862 100.0 2E-112 1E-115 633.5 38.4 429 1-456 72-550 (862) 6 protein:vir:94049 Length: 532 100.0 2E-112 2E-115 632.5 37.5 432 1-456 23-511 (532) 7 protein:vir:104338 Length: 422 100.0 4E-112 3E-115 631.2 37.8 401 18-454 1-422 (422) 8 protein:vir:80040 Length: 461 100.0 3E-110 2E-113 620.9 39.2 429 1-456 1-457 (461) 9 protein:vir:107742 Length: 537 100.0 8E-111 5E-114 624.3 35.8 428 1-456 28-521 (537) 10 protein:vir:107662 Length: 427 100.0 3E-110 2E-113 621.2 37.7 404 16-456 1-426 (427) 11 protein:vir:106716 Length: 698 100.0 2E-110 2E-113 621.5 36.3 433 1-456 63-544 (698) 12 protein:vir:101541 Length: 694 100.0 7E-110 4E-113 619.1 36.9 433 1-456 62-539 (694) 13 protein:vir:78589 Length: 695 100.0 4E-110 2E-113 620.4 35.4 433 1-456 63-540 (695) 14 protein:vir:3648 Length: 695 # 100.0 4E-110 3E-113 620.3 35.4 433 1-456 63-540 (695) 15 protein:vir:103219 Length: 201 100.0 1.2E-55 7.4E-59 321.7 20.0 196 252-456 1-199 (201) 16 protein:vir:79772 Length: 648 99.9 2.1E-21 1.3E-24 133.9 29.9 412 1-456 17-490 (648) 17 protein:vir:102118 Length: 409 99.8 4E-19 2.5E-22 121.5 25.3 386 1-455 1-409 (409) 18 protein:vir:1380 Length: 422 # 99.8 1.4E-18 8.8E-22 118.4 27.1 381 22-454 1-422 (422) 19 protein:vir:100691 Length: 535 99.8 9.1E-18 5.6E-21 114.0 30.5 411 1-456 13-513 (535) 20 protein:vir:102855 Length: 432 99.7 5E-18 3.1E-21 115.4 28.2 382 22-456 1-427 (432) 21 protein:vir:105002 Length: 432 99.7 5E-18 3.1E-21 115.4 28.2 382 22-456 1-427 (432) 22 protein:vir:107605 Length: 432 99.7 5E-18 3.1E-21 115.4 28.2 382 22-456 1-427 (432) 23 protein:vir:4454 Length: 414 # 99.7 1.1E-17 6.7E-21 113.6 28.2 377 22-456 1-411 (414) 24 protein:vir:1326 Length: 457 # 99.7 2.2E-17 1.4E-20 111.9 28.6 380 22-456 1-440 (457) 25 protein:vir:1266 Length: 416 # 99.7 1.3E-17 8.3E-21 113.1 27.3 385 1-456 6-413 (416) 26 protein:vir:100882 Length: 383 99.7 4.8E-18 3E-21 115.5 24.4 357 22-456 1-383 (383) 27 protein:vir:483 Length: 413 # 99.7 3E-17 1.9E-20 111.1 27.0 386 1-456 2-409 (413) 28 protein:vir:6240 Length: 457 # 99.7 3.1E-17 1.9E-20 111.1 27.0 379 22-456 1-452 (457) 29 protein:vir:102080 Length: 429 99.7 4E-17 2.5E-20 110.5 27.3 382 22-456 1-429 (429) 30 protein:vir:7853 Length: 518 # 99.7 4.5E-17 2.8E-20 110.2 27.3 388 1-456 3-432 (518) 31 protein:vir:63755 Length: 547 99.7 2.9E-16 1.8E-19 105.7 31.1 405 1-456 23-515 (547) 32 protein:vir:80644 Length: 551 99.7 1.2E-16 7.6E-20 107.8 28.6 402 1-456 39-517 (551) 33 protein:vir:80796 Length: 574 99.7 3.2E-16 2E-19 105.5 30.4 405 1-456 41-520 (574) 34 protein:vir:8418 Length: 409 # 99.7 1E-16 6.2E-20 108.3 27.6 375 22-456 1-406 (409) 35 protein:vir:5737 Length: 419 # 99.7 2.6E-17 1.6E-20 111.5 24.1 379 22-456 1-412 (419) 36 protein:vir:4337 Length: 434 # 99.7 1.5E-16 9.1E-20 107.4 26.2 393 1-456 1-431 (434) 37 protein:vir:81152 Length: 411 99.7 2.8E-16 1.7E-19 105.8 26.6 380 22-455 1-411 (411) 38 protein:vir:100187 Length: 385 99.6 2.8E-16 1.7E-19 105.9 25.4 359 22-455 1-385 (385) 39 protein:vir:960 Length: 413 # 99.6 1E-16 6.3E-20 108.3 22.9 383 1-452 4-413 (413) 40 protein:vir:81095 Length: 416 99.6 3.8E-16 2.4E-19 105.1 25.9 377 22-456 1-416 (416) 41 protein:vir:4598 Length: 416 # 99.6 3.8E-16 2.4E-19 105.1 25.9 377 22-456 1-416 (416) 42 protein:vir:95378 Length: 406 99.6 2.1E-16 1.3E-19 106.6 23.6 373 22-456 1-406 (406) 43 protein:vir:81072 Length: 432 99.6 7.4E-16 4.6E-19 103.5 26.5 390 1-456 1-425 (432) 44 protein:vir:8100 Length: 466 # 99.6 1.2E-15 7.6E-19 102.3 27.6 418 1-456 3-463 (466) 45 protein:vir:95599 Length: 563 99.6 3.8E-15 2.3E-18 99.7 30.0 408 1-456 42-526 (563) 46 protein:vir:99312 Length: 563 99.6 3.8E-15 2.3E-18 99.7 30.0 408 1-456 42-526 (563) 47 protein:vir:94666 Length: 723 99.6 1.4E-15 8.4E-19 102.1 27.0 373 24-456 1-405 (723) 48 protein:vir:101648 Length: 518 99.6 1.7E-15 1E-18 101.6 27.3 381 22-456 1-432 (518) 49 protein:vir:94426 Length: 409 99.6 1E-15 6.4E-19 102.8 26.0 384 1-456 1-408 (409) 50 protein:vir:2683 Length: 412 # 99.6 2E-15 1.2E-18 101.2 27.5 386 1-456 1-411 (412) 51 protein:vir:3153 Length: 467 # 99.6 9.3E-16 5.8E-19 103.0 25.5 374 53-456 1-441 (467) 52 protein:vir:93943 Length: 409 99.6 2.1E-15 1.3E-18 101.1 27.0 385 1-456 1-408 (409) 53 protein:vir:3843 Length: 397 # 99.6 1.9E-15 1.2E-18 101.3 26.4 366 22-456 1-397 (397) 54 protein:vir:97060 Length: 432 99.6 1.9E-15 1.2E-18 101.3 26.3 390 1-456 1-428 (432) 55 protein:vir:96980 Length: 409 99.6 3.7E-15 2.3E-18 99.7 26.7 384 1-456 1-408 (409) 56 protein:vir:4509 Length: 424 # 99.6 1E-14 6.4E-18 97.3 28.8 382 1-456 11-424 (424) 57 protein:vir:1431 Length: 419 # 99.6 6.1E-15 3.8E-18 98.5 26.3 391 1-456 1-413 (419) 58 protein:vir:10362 Length: 432 99.6 5.3E-15 3.3E-18 98.9 25.7 391 1-456 1-428 (432) 59 protein:vir:102727 Length: 945 99.6 3.1E-15 1.9E-18 100.2 24.2 404 1-456 64-528 (945) 60 protein:vir:100150 Length: 437 99.6 1.4E-14 8.8E-18 96.5 27.5 390 1-456 1-436 (437) 61 protein:vir:93610 Length: 454 99.6 1.2E-14 7.4E-18 96.9 27.1 392 1-456 1-440 (454) 62 protein:vir:6210 Length: 394 # 99.6 1.5E-14 9.5E-18 96.3 27.0 362 22-456 1-393 (394) 63 protein:vir:81218 Length: 423 99.6 2.7E-15 1.7E-18 100.5 22.7 386 22-456 1-421 (423) 64 protein:vir:80333 Length: 419 99.6 1.6E-14 9.8E-18 96.2 26.8 388 1-456 1-405 (419) 65 protein:vir:1884 Length: 424 # 99.6 8.8E-15 5.4E-18 97.7 25.3 387 1-453 1-424 (424) 66 protein:vir:189 Length: 424 # 99.6 8.8E-15 5.5E-18 97.6 24.8 387 1-453 1-424 (424) 67 protein:vir:9507 Length: 395 # 99.6 3.9E-15 2.4E-18 99.6 22.7 360 22-456 1-391 (395) 68 protein:vir:100650 Length: 395 99.6 3.9E-15 2.4E-18 99.6 22.7 360 22-456 1-391 (395) 69 protein:vir:101289 Length: 395 99.6 3.9E-15 2.4E-18 99.6 22.7 360 22-456 1-391 (395) 70 protein:vir:4952 Length: 386 # 99.6 1.2E-14 7.5E-18 96.9 25.3 375 1-455 3-386 (386) 71 protein:vir:100249 Length: 431 99.5 1.2E-14 7.6E-18 96.9 25.1 397 1-453 3-431 (431) 72 protein:vir:4156 Length: 542 # 99.5 1.9E-14 1.2E-17 95.8 25.1 403 1-456 1-439 (542) 73 protein:vir:9702 Length: 406 # 99.5 2E-14 1.3E-17 95.6 24.4 378 22-456 1-404 (406) 74 protein:vir:96579 Length: 576 99.5 5E-14 3.1E-17 93.5 26.3 408 1-456 1-522 (576) 75 protein:vir:105064 Length: 421 99.5 1.5E-14 9.5E-18 96.3 23.4 387 1-456 1-415 (421) 76 protein:vir:79984 Length: 441 99.5 2.3E-14 1.4E-17 95.4 23.6 389 1-456 11-440 (441) 77 protein:vir:9408 Length: 441 # 99.5 2.3E-14 1.4E-17 95.4 23.6 389 1-456 11-440 (441) 78 protein:vir:98396 Length: 441 99.5 2.8E-14 1.7E-17 94.9 23.4 392 1-456 17-440 (441) 79 protein:vir:9359 Length: 348 # 99.5 5.7E-14 3.5E-17 93.2 24.2 326 66-456 1-347 (348) 80 protein:vir:3868 Length: 417 # 99.5 7.3E-14 4.6E-17 92.6 24.6 376 22-456 1-413 (417) 81 protein:vir:7407 Length: 392 # 99.5 1.2E-13 7.6E-17 91.4 25.8 365 20-456 1-389 (392) 82 protein:vir:99452 Length: 651 99.5 3.8E-14 2.4E-17 94.1 22.7 420 1-456 1-539 (651) 83 protein:vir:78310 Length: 376 99.5 4.2E-14 2.6E-17 93.9 22.6 360 22-453 1-376 (376) 84 protein:vir:4854 Length: 386 # 99.5 1.7E-13 1.1E-16 90.6 25.1 368 22-455 1-386 (386) 85 protein:vir:80134 Length: 403 99.5 1E-13 6.2E-17 91.9 23.5 373 22-456 1-403 (403) 86 protein:vir:1023 Length: 392 # 99.5 2.1E-13 1.3E-16 90.1 25.3 365 20-456 1-389 (392) 87 protein:vir:3989 Length: 392 # 99.5 2.1E-13 1.3E-16 90.1 25.3 365 20-456 1-389 (392) 88 protein:vir:4089 Length: 395 # 99.4 1.2E-13 7.6E-17 91.4 22.4 367 22-456 1-389 (395) 89 protein:vir:95965 Length: 385 99.4 9.9E-14 6.1E-17 91.9 21.9 361 22-456 1-383 (385) 90 protein:vir:94002 Length: 378 99.4 6.4E-14 3.9E-17 92.9 20.7 343 22-456 1-378 (378) 91 protein:vir:101647 Length: 460 99.4 1.3E-12 8.2E-16 85.7 27.8 400 1-456 1-460 (460) 92 protein:vir:104259 Length: 403 99.4 3.9E-13 2.4E-16 88.6 24.8 363 22-456 1-403 (403) 93 protein:vir:93867 Length: 378 99.4 6.2E-14 3.9E-17 93.0 19.8 343 22-456 1-374 (378) 94 protein:vir:4995 Length: 384 # 99.4 5.4E-14 3.3E-17 93.3 19.3 356 22-439 1-384 (384) 95 protein:vir:4828 Length: 382 # 99.4 3.1E-13 1.9E-16 89.2 23.4 364 22-456 1-379 (382) 96 protein:vir:4194 Length: 540 # 99.4 6.5E-13 4E-16 87.4 25.2 396 1-456 1-450 (540) 97 protein:vir:9641 Length: 395 # 99.4 3.5E-13 2.2E-16 88.9 23.1 369 22-456 1-394 (395) 98 protein:vir:105819 Length: 456 99.4 6.7E-13 4.2E-16 87.3 24.4 415 1-455 1-456 (456) 99 protein:vir:102602 Length: 456 99.4 6.7E-13 4.2E-16 87.3 24.4 415 1-455 1-456 (456) 100 protein:vir:1661 Length: 378 # 99.4 2E-13 1.3E-16 90.2 21.2 343 22-456 1-378 (378) 101 protein:vir:8317 Length: 409 # 99.4 4.3E-13 2.7E-16 88.4 22.7 355 22-441 1-409 (409) 102 protein:vir:98444 Length: 434 99.4 1.4E-12 8.5E-16 85.6 25.0 374 43-453 1-434 (434) 103 protein:vir:79538 Length: 502 99.4 1.7E-11 1E-14 79.7 30.4 417 1-455 11-502 (502) 104 protein:vir:2427 Length: 485 # 99.4 4E-11 2.5E-14 77.6 32.2 410 1-453 13-485 (485) 105 protein:vir:7987 Length: 456 # 99.3 1.8E-12 1.1E-15 85.0 23.5 410 1-455 1-456 (456) 106 protein:vir:94869 Length: 378 99.3 1.8E-12 1.1E-15 85.0 23.4 340 22-456 1-378 (378) 107 protein:vir:98643 Length: 395 99.3 4.4E-12 2.7E-15 82.8 22.7 371 22-456 1-394 (395) 108 protein:vir:99072 Length: 479 99.3 5.2E-11 3.2E-14 77.0 27.8 412 1-456 9-472 (479) 109 protein:vir:99916 Length: 504 99.3 2.6E-10 1.6E-13 73.1 33.1 420 1-456 18-503 (504) 110 protein:vir:858 Length: 378 # 99.2 9.3E-12 5.7E-15 81.1 22.2 344 22-456 1-378 (378) 111 protein:vir:78227 Length: 480 99.2 4.3E-10 2.7E-13 71.9 30.7 409 1-456 1-476 (480) 112 protein:vir:94101 Length: 474 99.2 2.8E-10 1.7E-13 73.0 28.9 423 1-456 15-474 (474) 113 protein:vir:105889 Length: 474 99.2 2.8E-10 1.7E-13 73.0 28.9 423 1-456 15-474 (474) 114 protein:vir:4223 Length: 486 # 99.2 1.6E-10 1E-13 74.2 27.3 413 1-456 1-482 (486) 115 protein:vir:78537 Length: 480 99.2 5.4E-10 3.4E-13 71.4 29.5 412 1-456 1-476 (480) 116 protein:vir:104082 Length: 485 99.2 5.6E-10 3.5E-13 71.3 29.2 411 1-453 1-485 (485) 117 protein:vir:7768 Length: 484 # 99.2 3.6E-10 2.3E-13 72.3 27.7 401 1-456 1-481 (484) 118 protein:vir:2500 Length: 501 # 99.1 1.2E-09 7.5E-13 69.5 29.9 403 1-456 23-496 (501) 119 protein:vir:389 Length: 530 # 99.1 6.8E-10 4.2E-13 70.8 27.5 432 1-453 1-530 (530) 120 protein:vir:95542 Length: 548 99.1 1.2E-09 7.5E-13 69.5 28.2 422 1-456 1-515 (548) 121 protein:vir:96738 Length: 505 99.1 1E-09 6.4E-13 69.9 27.1 422 1-455 1-505 (505) 122 protein:vir:97171 Length: 512 99.1 1E-09 6.5E-13 69.8 25.3 417 1-456 31-511 (512) 123 protein:vir:9751 Length: 422 # 99.1 8.6E-10 5.3E-13 70.3 24.8 384 1-444 1-422 (422) 124 protein:vir:8184 Length: 474 # 99.0 2E-09 1.2E-12 68.3 26.4 419 1-456 12-474 (474) 125 protein:vir:1082 Length: 359 # 99.0 3.7E-10 2.3E-13 72.3 22.4 338 22-431 1-359 (359) 126 protein:vir:99781 Length: 511 99.0 3E-09 1.9E-12 67.3 27.3 420 1-456 31-510 (511) 127 protein:vir:6382 Length: 553 # 99.0 1.3E-09 8.2E-13 69.3 25.0 439 1-456 1-553 (553) 128 protein:vir:96240 Length: 511 99.0 2.2E-09 1.4E-12 68.1 26.0 421 1-456 31-510 (511) 129 protein:vir:93747 Length: 472 99.0 9.5E-10 5.9E-13 70.0 24.0 406 1-456 18-472 (472) 130 protein:vir:9306 Length: 511 # 99.0 2.4E-09 1.5E-12 67.9 25.5 423 1-456 31-510 (511) 131 protein:vir:96494 Length: 501 99.0 5.7E-09 3.5E-12 65.8 27.9 420 1-456 37-499 (501) 132 protein:vir:94498 Length: 474 99.0 3.1E-09 1.9E-12 67.2 25.7 415 1-456 1-474 (474) 133 protein:vir:97447 Length: 474 99.0 3.1E-09 1.9E-12 67.2 25.7 415 1-456 1-474 (474) 134 protein:vir:99522 Length: 470 99.0 9.6E-10 5.9E-13 70.0 22.8 409 1-456 1-470 (470) 135 protein:vir:97336 Length: 492 99.0 1.6E-09 9.8E-13 68.8 24.0 407 1-456 38-492 (492) 136 protein:vir:78805 Length: 511 99.0 6.7E-09 4.2E-12 65.4 28.2 420 1-456 40-510 (511) 137 protein:vir:96366 Length: 511 99.0 6.7E-09 4.2E-12 65.4 28.2 420 1-456 40-510 (511) 138 protein:vir:103951 Length: 511 99.0 4.2E-09 2.6E-12 66.5 25.9 423 1-456 31-510 (511) 139 protein:vir:3420 Length: 533 # 99.0 3.5E-09 2.2E-12 66.9 24.9 437 1-456 3-531 (533) 140 protein:vir:106639 Length: 481 99.0 3.4E-09 2.1E-12 67.0 24.7 407 1-453 30-481 (481) 141 protein:vir:95113 Length: 474 99.0 4.9E-09 3E-12 66.1 25.4 414 1-450 1-474 (474) 142 protein:vir:2341 Length: 488 # 99.0 9.5E-09 5.9E-12 64.6 29.4 420 1-453 7-488 (488) 143 protein:vir:9568 Length: 410 # 99.0 7.5E-10 4.7E-13 70.6 20.8 374 7-450 1-410 (410) 144 protein:vir:94805 Length: 492 99.0 3.4E-09 2.1E-12 67.0 24.2 407 1-456 38-492 (492) 145 protein:vir:98883 Length: 517 98.9 6.7E-10 4.2E-13 70.9 19.4 421 1-456 3-517 (517) 146 protein:vir:10321 Length: 495 98.9 3E-09 1.8E-12 67.3 22.7 423 1-456 3-495 (495) 147 protein:vir:95806 Length: 440 98.9 3.5E-09 2.1E-12 67.0 22.9 408 7-456 1-440 (440) 148 protein:vir:38 Length: 496 # N 98.9 1.2E-09 7.3E-13 69.5 19.9 410 1-456 1-496 (496) 149 protein:vir:3964 Length: 453 # 98.9 1.9E-08 1.2E-11 62.9 26.1 403 1-456 17-453 (453) 150 protein:vir:5961 Length: 503 # 98.9 2.1E-08 1.3E-11 62.6 28.8 420 1-456 26-501 (503) 151 protein:vir:80959 Length: 499 98.9 1.9E-09 1.2E-12 68.4 19.9 420 1-456 1-499 (499) 152 protein:vir:9871 Length: 429 # 98.9 3.3E-09 2E-12 67.1 21.2 395 1-452 1-429 (429) 153 protein:vir:1236 Length: 483 # 98.9 1.3E-08 8E-12 63.8 24.1 413 1-456 29-483 (483) 154 protein:vir:3609 Length: 452 # 98.8 2.4E-08 1.5E-11 62.4 25.1 399 1-456 17-452 (452) 155 protein:vir:1634 Length: 409 # 98.8 3.3E-08 2.1E-11 61.6 28.4 369 1-431 1-409 (409) 156 protein:vir:4898 Length: 502 # 98.8 2.1E-08 1.3E-11 62.7 24.1 417 1-456 38-498 (502) 157 protein:vir:95899 Length: 474 98.8 3.4E-08 2.1E-11 61.5 25.2 412 1-456 1-473 (474) 158 protein:vir:96266 Length: 474 98.8 3.4E-08 2.1E-11 61.5 25.2 412 1-456 1-473 (474) 159 protein:vir:99853 Length: 488 98.8 3.8E-08 2.3E-11 61.3 29.5 390 8-456 1-408 (488) 160 protein:vir:79043 Length: 479 98.8 2.4E-08 1.5E-11 62.4 23.1 406 1-452 20-479 (479) 161 protein:vir:1587 Length: 508 # 98.8 1.6E-08 9.6E-12 63.4 22.1 409 1-456 3-508 (508) 162 protein:vir:102950 Length: 471 98.8 5.7E-08 3.5E-11 60.3 25.1 410 1-456 1-471 (471) 163 protein:vir:94546 Length: 506 98.8 6.1E-08 3.8E-11 60.1 27.0 426 1-456 22-504 (506) 164 protein:vir:107112 Length: 478 98.8 6.3E-08 3.9E-11 60.0 26.7 411 1-454 20-478 (478) 165 protein:vir:94742 Length: 409 98.8 6.5E-08 4E-11 60.0 29.0 371 1-431 1-409 (409) 166 protein:vir:79703 Length: 505 98.7 1.7E-08 1E-11 63.2 21.1 403 1-454 3-505 (505) 167 protein:vir:107880 Length: 491 98.7 7.5E-08 4.6E-11 59.7 34.1 387 1-456 1-419 (491) 168 protein:vir:9815 Length: 500 # 98.7 9.1E-09 5.6E-12 64.7 19.6 397 1-454 3-500 (500) 169 protein:vir:3028 Length: 500 # 98.7 9.1E-09 5.6E-12 64.7 19.6 397 1-454 3-500 (500) 170 protein:vir:79063 Length: 491 98.7 1E-07 6.3E-11 58.9 30.6 389 1-456 1-419 (491) 171 protein:vir:106571 Length: 499 98.6 1.5E-07 9.4E-11 58.0 25.1 413 1-456 5-491 (499) 172 protein:vir:2732 Length: 501 # 98.6 1.6E-07 9.9E-11 57.8 25.9 419 1-456 37-497 (501) 173 protein:vir:105292 Length: 478 98.6 1.7E-07 1.1E-10 57.7 27.7 409 1-456 26-478 (478) 174 protein:vir:78907 Length: 518 98.6 2.1E-07 1.3E-10 57.2 24.9 419 1-456 3-514 (518) 175 protein:vir:4782 Length: 522 # 98.5 3E-07 1.8E-10 56.4 22.5 417 1-453 3-522 (522) 176 protein:vir:102330 Length: 451 98.5 5.2E-07 3.2E-10 55.0 23.5 404 1-454 1-451 (451) 177 protein:vir:96839 Length: 474 98.5 6E-07 3.7E-10 54.7 27.4 402 1-450 20-474 (474) 178 protein:vir:9922 Length: 489 # 98.4 6.4E-07 3.9E-10 54.6 25.4 417 1-451 9-489 (489) 179 protein:vir:733 Length: 453 # 98.4 2E-07 1.2E-10 57.4 18.5 401 1-456 17-448 (453) 180 protein:vir:96179 Length: 468 98.4 9.6E-07 6E-10 53.6 25.9 395 1-452 25-468 (468) 181 protein:vir:80680 Length: 441 98.4 9.8E-07 6.1E-10 53.5 26.8 394 1-456 1-440 (441) 182 protein:vir:78641 Length: 278 98.3 7E-07 4.3E-10 54.3 19.8 264 66-386 1-278 (278) 183 protein:vir:108215 Length: 469 98.3 2E-06 1.2E-09 51.9 29.2 409 1-456 1-464 (469) 184 protein:vir:5839 Length: 533 # 98.2 2.5E-06 1.6E-09 51.3 23.5 413 1-456 1-493 (533) 185 protein:vir:267 Length: 348 # 98.2 1E-06 6.4E-10 53.4 18.1 321 1-397 1-348 (348) 186 protein:vir:105461 Length: 470 98.1 4.5E-06 2.8E-09 49.9 25.1 403 1-456 2-470 (470) 187 protein:vir:95254 Length: 488 98.1 5.4E-06 3.4E-09 49.5 27.1 410 1-456 1-480 (488) 188 protein:vir:103458 Length: 524 97.9 9.6E-06 6E-09 48.1 18.6 431 1-456 1-522 (524) 189 protein:vir:7208 Length: 524 # 97.9 9.7E-06 6E-09 48.1 18.5 431 1-456 1-522 (524) 190 protein:vir:106282 Length: 521 97.7 2.5E-05 1.5E-08 45.9 21.3 429 1-456 1-519 (521) 191 protein:vir:104500 Length: 537 97.7 2.7E-05 1.6E-08 45.7 23.5 426 1-456 1-533 (537) 192 protein:vir:98816 Length: 446 97.6 4.1E-05 2.6E-08 44.6 25.5 406 1-435 1-446 (446) 193 protein:vir:106999 Length: 564 97.6 4.6E-05 2.8E-08 44.4 24.0 422 1-456 1-548 (564) 194 protein:vir:108049 Length: 524 97.5 4.7E-05 2.9E-08 44.3 18.8 431 1-456 1-522 (524) 195 protein:vir:98265 Length: 524 97.5 5.3E-05 3.3E-08 44.0 18.3 431 1-456 6-522 (524) 196 protein:vir:79233 Length: 526 97.5 5.6E-05 3.5E-08 43.9 30.1 397 1-456 1-446 (526) 197 protein:vir:81017 Length: 521 97.4 7.1E-05 4.4E-08 43.3 18.9 430 1-456 1-519 (521) 198 protein:vir:103860 Length: 528 97.4 8.7E-05 5.4E-08 42.8 32.8 400 1-456 1-449 (528) 199 protein:vir:98567 Length: 340 97.3 5.1E-05 3.2E-08 44.1 15.1 315 1-392 1-340 (340) 200 protein:vir:99232 Length: 526 97.2 0.00012 7.3E-08 42.1 33.1 398 1-456 1-447 (526) 201 protein:vir:104892 Length: 558 97.2 0.00013 8.2E-08 41.9 21.2 422 1-456 1-543 (558) 202 protein:vir:6058 Length: 344 # 97.2 0.00015 9.2E-08 41.6 16.6 316 1-391 1-344 (344) 203 protein:vir:78191 Length: 351 97.1 0.00018 1.1E-07 41.2 16.9 319 1-395 1-351 (351) 204 protein:vir:6896 Length: 523 # 97.0 0.0002 1.3E-07 40.8 18.1 429 1-456 1-521 (523) 205 protein:vir:79150 Length: 368 97.0 0.00012 7.5E-08 42.0 14.2 324 1-402 1-368 (368) 206 protein:vir:3780 Length: 345 # 97.0 0.00024 1.5E-07 40.5 16.7 321 1-388 1-345 (345) 207 protein:vir:6596 Length: 521 # 96.9 0.00029 1.8E-07 40.0 22.7 432 1-456 1-519 (521) 208 protein:vir:103971 Length: 376 96.9 0.00029 1.8E-07 40.0 16.3 318 1-395 26-376 (376) 209 protein:vir:103177 Length: 533 96.8 0.00033 2E-07 39.7 23.7 427 1-456 1-532 (533) 210 protein:vir:2013 Length: 344 # 96.8 0.00035 2.2E-07 39.5 17.2 316 1-391 1-344 (344) 211 protein:vir:78083 Length: 537 96.5 0.0006 3.7E-07 38.2 29.2 424 1-456 1-536 (537) 212 protein:vir:79207 Length: 351 96.3 0.00073 4.5E-07 37.8 17.6 319 1-395 1-351 (351) 213 protein:vir:78161 Length: 355 96.2 0.00088 5.4E-07 37.3 23.5 302 125-456 1-333 (355) 214 protein:vir:3743 Length: 345 # 96.2 0.0009 5.6E-07 37.3 17.6 315 1-388 1-345 (345) 215 protein:vir:98853 Length: 219 96.0 0.0011 7E-07 36.7 15.2 201 142-393 1-219 (219) 216 protein:vir:79511 Length: 448 95.9 0.0012 7.5E-07 36.6 29.8 402 1-456 1-440 (448) 217 protein:vir:5691 Length: 344 # 95.9 0.0012 7.6E-07 36.6 16.3 316 1-395 1-344 (344) 218 protein:vir:78749 Length: 337 95.5 0.0019 1.2E-06 35.6 18.1 310 1-389 1-337 (337) 219 protein:vir:101806 Length: 516 95.4 0.0021 1.3E-06 35.3 18.0 426 1-456 3-515 (516) 220 protein:vir:101189 Length: 516 95.4 0.0021 1.3E-06 35.3 18.0 426 1-456 3-515 (516) 221 protein:vir:5665 Length: 511 # 95.1 0.0028 1.7E-06 34.6 20.5 417 1-456 3-509 (511) 222 protein:vir:1986 Length: 512 # 95.0 0.0031 1.9E-06 34.4 30.2 398 1-456 1-439 (512) 223 protein:vir:4698 Length: 251 # 94.6 0.0036 2.2E-06 34.0 12.0 234 22-293 1-251 (251) 224 protein:vir:100598 Length: 516 94.4 0.0044 2.7E-06 33.5 18.9 426 1-456 3-515 (516) 225 protein:vir:100328 Length: 346 94.4 0.0045 2.8E-06 33.4 15.0 318 1-393 1-346 (346) 226 protein:vir:100039 Length: 522 94.1 0.0053 3.3E-06 33.0 19.9 409 1-456 1-511 (522) 227 protein:vir:107822 Length: 555 93.1 0.0089 5.5E-06 31.8 20.5 415 1-456 1-517 (555) 228 protein:vir:98506 Length: 555 93.1 0.0089 5.5E-06 31.8 20.5 415 1-456 1-517 (555) 229 protein:vir:107404 Length: 555 93.1 0.0089 5.5E-06 31.8 20.5 415 1-456 1-517 (555) 230 protein:vir:6322 Length: 510 # 92.4 0.012 7.2E-06 31.2 21.4 410 1-456 1-490 (510) 231 protein:vir:3361 Length: 535 # 91.4 0.016 9.9E-06 30.4 18.1 410 1-456 1-521 (535) 232 protein:vir:99672 Length: 532 90.7 0.019 1.2E-05 30.0 17.4 415 1-456 1-506 (532) 233 protein:vir:101418 Length: 569 89.9 0.024 1.5E-05 29.5 21.8 421 1-456 45-569 (569) 234 protein:vir:7321 Length: 556 # 88.0 0.035 2.2E-05 28.5 20.4 420 1-456 1-556 (556) 235 protein:vir:1538 Length: 535 # 87.9 0.036 2.2E-05 28.5 16.4 413 1-456 1-521 (535) 236 protein:vir:8883 Length: 543 # 87.8 0.036 2.3E-05 28.5 17.0 418 1-456 1-543 (543) 237 protein:vir:1150 Length: 350 # 87.7 0.037 2.3E-05 28.4 21.5 314 1-386 7-350 (350) 238 protein:vir:102668 Length: 547 87.4 0.039 2.4E-05 28.3 22.1 413 1-456 1-547 (547) 239 protein:vir:97265 Length: 513 86.7 0.044 2.7E-05 28.0 21.7 396 1-456 1-500 (513) 240 protein:vir:95315 Length: 559 86.6 0.045 2.8E-05 28.0 19.9 412 1-456 1-558 (559) 241 protein:vir:78942 Length: 510 85.8 0.05 3.1E-05 27.7 21.7 408 1-456 1-489 (510) 242 protein:vir:77981 Length: 448 85.2 0.054 3.4E-05 27.5 31.1 397 1-456 1-435 (448) 243 protein:vir:1785 Length: 555 # 83.8 0.065 4E-05 27.1 21.9 409 1-456 1-539 (555) 244 protein:vir:78696 Length: 542 80.0 0.099 6.1E-05 26.1 20.7 408 1-456 1-541 (542) 245 protein:vir:95149 Length: 501 79.1 0.11 6.7E-05 25.9 24.5 400 1-456 1-501 (501) 246 protein:vir:80211 Length: 514 76.1 0.14 8.7E-05 25.3 22.2 417 1-456 1-495 (514) 247 protein:vir:10447 Length: 536 64.7 0.3 0.00018 23.5 24.8 416 1-456 1-534 (536) 248 protein:vir:94709 Length: 522 62.4 0.34 0.00021 23.2 23.8 406 1-456 1-494 (522) 249 protein:vir:96988 Length: 516 56.3 0.46 0.00029 22.4 14.7 416 1-456 1-512 (516) 250 protein:vir:2198 Length: 536 # 54.2 0.51 0.00032 22.2 24.6 417 1-456 1-534 (536) 251 protein:vir:80453 Length: 535 35.4 1.2 0.00077 20.1 27.8 396 1-456 32-532 (535) 252 protein:vir:101494 Length: 527 26.7 1.9 0.0012 19.0 26.3 408 1-452 1-527 (527) 253 protein:vir:102239 Length: 527 26.0 2 0.0012 18.9 26.3 408 1-452 1-527 (527) 254 protein:vir:7430 Length: 563 # 22.8 2.4 0.0015 18.5 26.2 419 1-456 1-559 (563) No 1 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=100.00 E-value=2.3e-144 Score=808.08 Aligned_cols=447 Identities=60% Similarity=1.012 Sum_probs=416.1 Q ss_pred CCchhHHHHhHHHHHH-HHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSA-IARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNP 79 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~-~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~ 79 (456) ||+||.+||||++++. ++++||+|.|+++|+||+||++|++||||+.+++++|+++|++|||+|+|||+|+|+|||+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~ 80 (449) T protein:vir:10 1 MTDKLTLAVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNP 80 (449) T ss_pred CchhhHHHHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCc Confidence 9999999999998875 788999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEecccc Q lcl|NC_016762. 80 QVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGC 159 (456) Q Consensus 80 ~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~ 159 (456) +|+++++.++.++.+.|++++++++++ ++|++|+||++|+|+||||+|++.++|+++|++||..+ ++|++|+|+|+++ T Consensus 81 ~i~~g~~~~~~~~~~~~e~~~~~l~~~-~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~~~-~~i~~i~v~~~~~ 158 (449) T protein:vir:10 81 EIIEGDDADDSEDETSWEKKSKQVFTN-RLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPATKG-RGLQKVSVSWAGS 158 (449) T ss_pred ccccCccccchhhhHHHHHHHHHHHHH-HHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccccC-cceeeEEeecccc Confidence 999999999999999999999987654 89999999999999999999999999999999999854 6899999999999 Q ss_pred CChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 160 LKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLK 239 (456) Q Consensus 160 ~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~ 239 (456) ++|.++++||+||+||+|++|+|++..+++ +.++++||||||++|+++..+|+|+|+++||++++++++++++++++|+ T Consensus 159 i~~~~~~~dp~sp~yg~P~~y~v~~~~~g~-~~~~~~iH~SRl~~~~~~~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~ 237 (449) T protein:vir:10 159 LKVAEWDTGINSKTYGQPKLWKYTERLPNG-SSRRVDIHPDRVFILGDYSEDAIGFLEPAYNAFVSLEKVEGGSGESFLK 237 (449) T ss_pred CChhhhhcCCCCCCCCCceEEEEeeeccCC-CccceeeccceeEeecCCCCCChhHHHHHHHHhhhHHHhhhhHHHHHHH Confidence 999999999999999999999999776654 4567899999999999999999999999999999999999999999999 Q ss_pred HhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccCCHHHHHHHHHHHHHhh Q lcl|NC_016762. 240 NAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVSDPGPTYNVNLQTAAAG 319 (456) Q Consensus 240 ~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~sgl~~~~~~~~~~~aaa 319 (456) ++++++.+.+.++++|.++++.++.+.+++.++++++++.++++++.+++|++++|++++++|||+++++++++|++||+ T Consensus 238 ~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~i~~~~d~~~~~~~~sgl~d~l~~~~q~iaaa 317 (449) T protein:vir:10 238 NAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEINRGNDVLMTTQGATVTPLVTSVADPTATYNVNLQTAAAG 317 (449) T ss_pred HHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhccchheeecCCcceEEEecccCChhHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHH Q lcl|NC_016762. 320 VDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSK 399 (456) Q Consensus 320 s~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~ 399 (456) ++||+||||||||||||||+|++|||++|+++|+ .|+|.|++|+++|+++++|.++++|+|+|+|||+||+||||||++ T Consensus 318 ~~IP~t~L~Gqsp~glnst~D~~nyyd~i~~~Q~-~l~p~le~l~~~l~~s~~g~~~~d~~i~f~pL~~~t~kEkAei~k 396 (449) T protein:vir:10 318 VDIPTRILIGNQQAERSSTEDQKYFNARCQSRRV-DLSFEIEDFCDKLIELKIIDAVAKKAVIWDDLNEQTGTEKLTNAK 396 (449) T ss_pred hCCCeeeeeccCccccccchhHHHHHHHHHHHHH-hhhHHHHHHHHHHHHhhcCCCCCceeEEeCCCCCCCHHHHHHHHH Confidence 9999999999999999999999999999999998 699999999999999999988889999999999999999999999 Q ss_pred HHHHHHHHHHHcC-CcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCC Q lcl|NC_016762. 400 TMSEINSAAIGTG-EPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTG 453 (456) Q Consensus 400 ~~A~a~~~~~~~g-~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~ 453 (456) ++|+|+++++++| +++++++|+|++++++|......++ ++.+++.+..|+.| T Consensus 397 ~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~~~~~~--e~~de~~~~~d~~a 449 (449) T protein:vir:10 397 TMGEINQTMLGSGDNPAFSREEIRTAAGYDNDDEEPLGE--EDGDEEDKATDSAA 449 (449) T ss_pred HHHHHHHHHHHccccCCcCHHHHHHHhcccCCCCCCCCC--CCCccccccCCcCC Confidence 9999999999887 5699999999999998865433222 22334444455555 No 2 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=100.00 E-value=1.2e-116 Score=656.02 Aligned_cols=414 Identities=14% Similarity=0.114 Sum_probs=351.5 Q ss_pred HHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHH Q lcl|NC_016762. 18 ARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWE 97 (456) Q Consensus 18 ~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e 97 (456) .+++|+|.|+++|+||+|++++..++|+..+++++|+++|++||++|+|||+||+||||+|++|.+.+.++ +.. T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~------~~~ 74 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNS------KQL 74 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCCH------HHH Confidence 77899999999999999999999999999999999999999999999999999999999999997643222 123 Q ss_pred HHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhh-hhccccccccCC Q lcl|NC_016762. 98 RKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKS-FDEKPDSETYGQ 176 (456) Q Consensus 98 ~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~-~~~Dp~s~~yg~ 176 (456) +++++++++|++|++|++|++|+|+||+|+|++.+ |++++++||+.+ ++++.|+|+++++++|.. .++||++|+||+ T Consensus 75 ~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~-d~~~~~~pl~~~-~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~ 152 (437) T protein:vir:52 75 DLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVT-DSQNTSAPLKPT-ERLKRLIILPKWKISPTGTKDDDVLSPNFGR 152 (437) T ss_pred HHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEe-cCCCcccccccC-CceeEEEEechhhccccccccccccccccCc Confidence 46889999999999999999999999999998866 788999999863 456778888888888654 457999999999 Q ss_pred ceeEEEeecccCCccccceeeehhhhheecC-------CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Q lcl|NC_016762. 177 PTMWEYTEASQAGRPGLVRDIHPDRVFILGD-------WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNF 249 (456) Q Consensus 177 P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~-------~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~ 249 (456) |++|+|+. + ..+++|||||||||.+ +.+||+|++|++|++|.++++++.+++++++++....+ T Consensus 153 p~~y~v~~----~--~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~---- 222 (437) T protein:vir:52 153 YSEYSILG----G--SQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIF---- 222 (437) T ss_pred ceEEEEec----C--CcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCce---- Confidence 99999972 1 2357899999999964 56799999999999999999999999999999765544 Q ss_pred hhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeec Q lcl|NC_016762. 250 DKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVG 329 (456) Q Consensus 250 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G 329 (456) ++.++++.++.+.++...++.+.+..++++.+++++|++++|++++++||||++++++++++||++++||+|+||| T Consensus 223 ----k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G 298 (437) T protein:vir:52 223 ----KIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTFTGLKDLLTEFRNAVAGAADMPVTILFG 298 (437) T ss_pred ----ecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCcCCHHHHHHHHHHHHHHHhcCchhhhcC Confidence 3345666666666666777777888899999999999999999999999999999999999999999999999999 Q ss_pred cCCCcccch-HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 330 MQTGERASS-EDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAA 408 (456) Q Consensus 330 ~sp~Glnst-~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~ 408 (456) +||+||+++ +|++|||++|+++||+.|+|.|++|+++|+++.+|+.+++|+|+|||||+||+||+||+++++|++++++ T Consensus 299 ~s~~Glasge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~ 378 (437) T protein:vir:52 299 QSVSGLASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTL 378 (437) T ss_pred cCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHH Confidence 999999643 5999999999999999999999999999999999998899999999999999999999999999999999 Q ss_pred HHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCC--------CCC------CcCCCCCC Q lcl|NC_016762. 409 IGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDE--------DAA------RTDPTGEQ 455 (456) Q Consensus 409 ~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~--------~~~------~~d~~~~~ 455 (456) +++| +++++|+|+.+...+.....+.++.++... +++ ++.+++++ T Consensus 379 ~~~g--~i~~~e~r~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 379 IQNG--VLNEYQIANELRESGLFANISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred HhcC--CCCHHHHHHHHHhcCCCCCCCccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 9998 999999999876555544443332222111 111 11111111 No 3 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=100.00 E-value=8.7e-114 Score=640.41 Aligned_cols=426 Identities=14% Similarity=0.152 Sum_probs=339.8 Q ss_pred CCchhHHHHhHHHHHHH-HHHHHHHhhhhhccCcccc-hhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAI-ARARMSLLNQGIGHDAKRP-QAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~-~~~~d~~~n~~~~~gt~~~-~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) -..+++.+ ..|...+. ....|+|.|+..++|+.+. .+...|+++..++.+||+++|++|||+|+|||+||+||||+| T Consensus 60 ~~~~~~~~-~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gyql~alY~~~~l~rkiVd~pAeDa~R~g 138 (765) T protein:vir:96 60 KDFLEPGL-SVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGYQACAIISQHWLVDKACSMSGEDAARNG 138 (765) T ss_pred CcccCccc-ceeccccccccccchHHHhhhccCccchhhHHHhhhcccCCccHHHHHHHHhCchhhhhhhcchHHhhcCC Confidence 11111111 01221111 2245778888888887664 456678888888889999999999999999999999999999 Q ss_pred CEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec--CCCCcccccc------CCcCcee Q lcl|NC_016762. 79 PQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR--DSQPWDRPAR------GKLNGLA 150 (456) Q Consensus 79 ~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~--D~~~~~~Pl~------~~~~~l~ 150 (456) ++|++.++... ....+.|++++++|++|++|++|++|+|+|||+++++.+. |++.|++||+ ++++||. T Consensus 139 ~~I~~~~~e~~----~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~ 214 (765) T protein:vir:96 139 WELKSDGRKLS----DEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGIS 214 (765) T ss_pred ceeecCccccC----HHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEE Confidence 99975433222 2334569999999999999999999999999999988884 7888999995 3566777 Q ss_pred EEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----------cCCCcchHHHHH Q lcl|NC_016762. 151 KVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----------TGDAIGFLEPAY 220 (456) Q Consensus 151 ~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----------~~~G~S~le~~~ 220 (456) .|.|+|....++.++++||++|+||+|++|+|+ +++||+||||+|.+. .+||+|++|++| T Consensus 215 vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~----------g~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~y 284 (765) T protein:vir:96 215 QIDPYWAMPQLTAESTADPSAEHFYEPDFWIIS----------GKKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIY 284 (765) T ss_pred EechhhcccccchhccccccccccCcceeeeec----------CceeccceEEEecCCCchhhhccccCccCccHHHHHH Confidence 777888887777888899999999999999996 468999999999653 479999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEec Q lcl|NC_016762. 221 NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVS 300 (456) Q Consensus 221 ~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~ 300 (456) ++|++++++++++++++++++++++.+++.+.+ + +.+++.+++ +.+..+++|.+++++|++|+|+++++ T Consensus 285 d~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l---------~-~~~~l~~r~-~~~~~~r~n~g~~~id~ee~~e~~s~ 353 (765) T protein:vir:96 285 ERVYAAERTANEAPLLAMSKRTSTIHVDVEKAI---------A-NEDAFNARL-AFWIANRDNHGVKVIGIDETMEQFDT 353 (765) T ss_pred HHHHHHHHHHHHHHHHHHHhccceeeechHhhh---------c-cHHHHHHHH-HHHHHhcCCceeEEecCCcceeEEec Confidence 999999999999999999999888866543321 1 234555555 45667788999999999999999999 Q ss_pred ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccch--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCc Q lcl|NC_016762. 301 AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASS--EDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAE 378 (456) Q Consensus 301 ~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst--~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d 378 (456) +||||++++++++++|||+++||+|||||+||+|||+| +|++|||++|+++||+.|+|+|++|+++|++++.+ +++ T Consensus 354 ~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s~~i--~~d 431 (765) T protein:vir:96 354 NLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKSESI--DVQ 431 (765) T ss_pred ccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--CCc Confidence 99999999999999999999999999999999999976 48999999999999999999999999999999655 458 Q ss_pred eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCC----- Q lcl|NC_016762. 379 FTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTG----- 453 (456) Q Consensus 379 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~----- 453 (456) |+|+|||||+||+|||||+++++|+++++|+++| +++++|+|+.+..++......+++++.+.+....+...+ T Consensus 432 ~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~G--vis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~ 509 (765) T protein:vir:96 432 LEIVWNPVDSTTSQQQAELNNKKAATDEIYINSG--VVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKA 509 (765) T ss_pred ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC--CCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccCC Confidence 9999999999999999999999999999999999 999999999887665443332222211111000000000 Q ss_pred -------CCC Q lcl|NC_016762. 454 -------EQQ 456 (456) Q Consensus 454 -------~~e 456 (456) ..+ T Consensus 510 ~~~~~~~~~e 519 (765) T protein:vir:96 510 GAQSAKAKGE 519 (765) T ss_pred CcccccccCc Confidence 000 No 4 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=100.00 E-value=3e-113 Score=637.45 Aligned_cols=410 Identities=15% Similarity=0.130 Sum_probs=343.8 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhh-hhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLN-QGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNP 79 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n-~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~ 79 (456) |+.|.+ ..+++|+|.| ++++.|+.+++. ++...+++++|+++|++||++|+|||+||+||||+|+ T Consensus 5 m~~~~~----------~~~~~D~~~~~~~~~~g~~~~~~----~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~ 70 (435) T protein:vir:79 5 MSDKVK----------AITKEDGYNEIFGSKDGTFRPNA----FYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGF 70 (435) T ss_pred cccccc----------cchhhcchhhhhcccccccccCc----ccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCc Confidence 888832 3346799999 577788888753 4566789999999999999999999999999999999 Q ss_pred EEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEecccc Q lcl|NC_016762. 80 QVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGC 159 (456) Q Consensus 80 ~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~ 159 (456) +|.+ ++++ +++++++++|++|++|++|++|+|+||||+|++.++|++++++||+.. +.|++|+|+|+++ T Consensus 71 ~i~g-~~~~---------~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~-g~i~~i~v~d~~~ 139 (435) T protein:vir:79 71 KVDG-VKNE---------KSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPG-AQLEDIRVYDRYQ 139 (435) T ss_pred eecC-CChH---------HHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccC-CceeeEEeechhh Confidence 9853 2211 247788999999999999999999999999999999999999999864 4688899999999 Q ss_pred CChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecC----------CcCCCcchH-HHHHHHHHHHHH Q lcl|NC_016762. 160 LKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----------WTGDAIGFL-EPAYNSFISLEK 228 (456) Q Consensus 160 ~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----------~~~~G~S~l-e~~~~~l~~~~~ 228 (456) ++|..+++||++|+||+|++|+|++. ....+++||||||++|.+ +.+||.|+| |++|++|+++++ T Consensus 140 i~~~~~~~dp~sp~fg~P~~y~v~~~----~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~ 215 (435) T protein:vir:79 140 ITIHERETNARSVRYGEPKLYKISPG----GDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNY 215 (435) T ss_pred ccchhhccCCcccccCcceEEEEecC----CCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHH Confidence 99999999999999999999999732 223468999999999964 468999976 899999999999 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCC--HHHHHHHHHHHHHHHhcCCCeEEecCC-CceeEEecccCCH Q lcl|NC_016762. 229 VEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVT--LDALNERFNEAARQLNRGNDVLLPTQG-ATVTQMVSAVSDP 305 (456) Q Consensus 229 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~lid~~-d~~~~~~~~~sgl 305 (456) ++.++++++++++++++.+ .++++.++.+ ..++.+++ ..+..++++.+.++++.+ ++|++++++|||| T Consensus 216 ~~~~~~~l~~~~~~~v~~~--------~~l~~~~~~~~~~~~~~~r~-~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl 286 (435) T protein:vir:79 216 CQELATQLLRRKQQAVWKA--------RDLALMCDDEEGRYAARLRL-AQVDDESGVGKAIGIDATDEEYEVLNSDVSGV 286 (435) T ss_pred HHHHHHHHHHHhcCccccc--------hhHHHhhcCccchHHHHHHH-HHHHHhcCCCCceeEecCCcceEEEecccCCH Confidence 9999999999987776643 3444544433 23444444 344455666676666664 6799999999999 Q ss_pred HHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH--HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEe Q lcl|NC_016762. 306 GPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE--DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIW 383 (456) Q Consensus 306 ~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~--D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f 383 (456) ++++++++++|||+++||+|||||+||+|||||| |++|||++|+++||..++|+|++|+++++++ ++|+|+| T Consensus 287 ~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~s------~d~~~~f 360 (435) T protein:vir:79 287 PEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFMISE------TEWSIEF 360 (435) T ss_pred HHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC------CCCeEEe Confidence 9999999999999999999999999999999874 8999999999999999999999999999876 5899999 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHh----cccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 384 DDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEA----GYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 384 ~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~----~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +|||+||+||+||+++++|+++++++++| +++++|+|+.+ ...+..+.+.++-++..|.++++..|++++| T Consensus 361 ~pL~~~sekEkAei~~~~a~a~~~~~~~g--~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 361 EPLSVPSDKDKAEIMAKNVESVVKLKAEQ--AINLKETRDTLRSICPDLKIMDNDNIELPEPEDLDPEPGQEGGLNK 435 (435) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHhcC--CCCHHHHHHHHHHhccccCCCCcccccCCccccCCCCCCCCCCCCC Confidence 99999999999999999999999999998 99999998865 3445555444443344566777788889888 No 5 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=100.00 E-value=1.6e-112 Score=633.55 Aligned_cols=429 Identities=14% Similarity=0.142 Sum_probs=337.6 Q ss_pred CCc----hhHHHHhHHHHH----HHHHHHHHHhhhhhccCcccchh-h--h----hccCcccCCHHHHHHHHhcCchhhh Q lcl|NC_016762. 1 MTD----KLDLAVNHAMSS----AIARARMSLLNQGIGHDAKRPQA-W--C----EYGFPQEITFNDLYTMYRRGGIAHG 65 (456) Q Consensus 1 ~~~----~~~~~~~~a~~~----~~~~~~d~~~n~~~~~gt~~~~~-~--~----~~~~~~~~~~~~l~~~Y~~~~l~r~ 65 (456) |.. .+.+|.++|.+. ....++|++.|.++++|+.+++. + + .++.+..+..++|+++|++|||+|+ T Consensus 72 ~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY~~~~lark 151 (862) T protein:vir:99 72 VNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALIAQHWLVDK 151 (862) T ss_pred ccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHHHhCchhhh Confidence 222 233455555433 33556799999999999987742 1 1 1122344566689999999999999 Q ss_pred hhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEe--cCCCCcccccc Q lcl|NC_016762. 66 AVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHI--RDSQPWDRPAR 143 (456) Q Consensus 66 iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i--~D~~~~~~Pl~ 143 (456) |||+||+||||+|++|.+..+.++. ..+..+.|++++++|++|++|++|++|+|+|||+++++.+ .|++.|++||+ T Consensus 152 iVd~pAeDatR~g~~I~~~~d~~e~--~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn 229 (862) T protein:vir:99 152 ACSLAGEDAIRNGWHLKSLGEGEEI--DEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFN 229 (862) T ss_pred hhhhhhHHHhhCCceEeecCccccc--CHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcC Confidence 9999999999999999865543332 2345678999999999999999999999999999888776 47888999995 Q ss_pred ------CCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecC---------- Q lcl|NC_016762. 144 ------GKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD---------- 207 (456) Q Consensus 144 ------~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~---------- 207 (456) ++++||..|.|+|..++++..+++||++|+||+|++|+|+ +++||+||||+|.+ T Consensus 230 ~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~----------g~~IH~SRliif~g~~vpd~lk~a 299 (862) T protein:vir:99 230 PDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIIS----------GQKYHRSHLIIARGPQPADILKPT 299 (862) T ss_pred cccccccceeEEEEechhhhcccccccccccccccccCCceeeeec----------CeeeccceeEEecCCCchhhhhcc Confidence 3567888888889888777888899999999999999986 46899999999964 Q ss_pred CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeE Q lcl|NC_016762. 208 WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVL 287 (456) Q Consensus 208 ~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (456) +.+||+|++|++|++|+++++++.++++++++++++++++++.. .+. +.+.+.+++ +++..+++|.+++ T Consensus 300 y~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~---------~l~-~ed~l~~r~-~~~~~~rdN~Gi~ 368 (862) T protein:vir:99 300 YIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAK---------AIA-NEDKFIQRL-MFWVRYRDNHAVK 368 (862) T ss_pred CCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHh---------hhc-cHHHHHHHH-HHHHhccCcceeE Confidence 45799999999999999999999999999999888777544322 122 223445444 5677788889999 Q ss_pred EecCCCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccch--HHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_016762. 288 LPTQGATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASS--EDQKYHNARCQARRVQELTFEINDLFA 365 (456) Q Consensus 288 lid~~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst--~D~~nyyd~I~~~Qe~~lrp~L~~l~~ 365 (456) ++|++|+|++++++||||++++++++++|||+++||+|||||+||+|||+| +|++|||++|+++||+.|+|.|++|+. T Consensus 369 liD~eEe~e~ls~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~ 448 (862) T protein:vir:99 369 VLGTDETMEQFDTSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYL 448 (862) T ss_pred EecCCCceeEEecccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999976 489999999999999999999999998 Q ss_pred HHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCC--- Q lcl|NC_016762. 366 HLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPE--- 442 (456) Q Consensus 366 ~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~--- 442 (456) +++.+ ++ .+++|+|+|+|||+||++|+||+++++|+++++++++| +++++|+|+.+...+......+++++.+ T Consensus 449 li~~~-lg-~~~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sG--vispdEvR~~L~~~~~~g~~~l~ded~E~d~ 524 (862) T protein:vir:99 449 ISRLS-LG-IQHEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGG--VISPDEERNRIRDDKRSGYNRLTKEDAEETP 524 (862) T ss_pred HHHHh-cC-CCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCcCCCCCCcccccccC Confidence 77654 44 35689999999999999999999999999999999998 9999999997644333222211111110 Q ss_pred ---CCCCCCc---------CCCCCCC Q lcl|NC_016762. 443 ---DEDAART---------DPTGEQQ 456 (456) Q Consensus 443 ---d~~~~~~---------d~~~~~e 456 (456) +++..+. .|..+.+ T Consensus 525 ~~~~e~~~~~e~~g~a~~~ap~de~~ 550 (862) T protein:vir:99 525 GASPENLAAYQKAGAAQETASAKETQ 550 (862) T ss_pred CCCcccccccccCCcccccccccccc Confidence 0000000 0000000 No 6 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=100.00 E-value=2.4e-112 Score=632.49 Aligned_cols=432 Identities=13% Similarity=0.130 Sum_probs=345.8 Q ss_pred CCc------hhHHHHhHHHHHH--HHHHHHHHhhhhh---ccCcccchhhh-hccCcccCCHHHHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTD------KLDLAVNHAMSSA--IARARMSLLNQGI---GHDAKRPQAWC-EYGFPQEITFNDLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~------~~~~~~~~a~~~~--~~~~~d~~~n~~~---~~gt~~~~~~~-~~~~~~~~~~~~l~~~Y~~~~l~r~iVd 68 (456) |-. ++.+|++|++..+ ..+.+|++.|.++ |+||+|++.+. .|+++..++..+++++|++||++|+||| T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~~Vd 102 (532) T protein:vir:94 23 VDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPEYRTMHE 102 (532) T ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCchhhhhhc Confidence 333 3346666666543 2335688888765 99999998765 5566788999999999999999999999 Q ss_pred cchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec---CCCCcccccc-- Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR---DSQPWDRPAR-- 143 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~---D~~~~~~Pl~-- 143 (456) +||+||||+|++|++.++++..+ ...++|++++++|++|++|+++++|+|+||||+++|.++ +..+++.|+. T Consensus 103 ~~aed~~r~~~~i~~~~~~~~~~---~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~ 179 (532) T protein:vir:94 103 TPADECVRAWGKITCSSKDELAA---DKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLS 179 (532) T ss_pred cchHHHhhCCceEeeCCccccch---HHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCcccccccccccc Confidence 99999999999998765544322 233568888999999999999999999999999999985 2234666653 Q ss_pred ---CCcCceeEEEEeccccCChhhhh-ccccccccCCceeEEEeecccCCccccceeeehhhhheecC----------Cc Q lcl|NC_016762. 144 ---GKLNGLAKVTPAWAGCLKPKSFD-EKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----------WT 209 (456) Q Consensus 144 ---~~~~~l~~i~~~~~~~~~~~~~~-~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----------~~ 209 (456) ++.+++++|.|+|+++++|..++ .||+||+||+|++|+|. .+++||||||+||.+ +. T Consensus 180 ~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~---------~g~~iH~SRli~f~g~~~p~~~~~~~~ 250 (532) T protein:vir:94 180 PSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT---------SGKKIHSSRIHTVVGRPVGDMLKAAYS 250 (532) T ss_pred ccccccceeeEEEeechheecccccccccccccccCCceeEEEc---------cCeeeccceEEEecCCCchhhhccccc Confidence 45667888999999999999887 59999999999999985 257899999999964 35 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCC-HHHHHHHHHHHHHHHhcCCCeEE Q lcl|NC_016762. 210 GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVT-LDALNERFNEAARQLNRGNDVLL 288 (456) Q Consensus 210 ~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l 288 (456) +||+|++|++|++|+++++++.++++++++++...++ +++ ++.+..+ .+.+.+++ +.+..++++.++++ T Consensus 251 ~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k------~~~---a~~ls~~~~~~~~~r~-~~~~~~~~n~g~~~ 320 (532) T protein:vir:94 251 FRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLA------TDM---AQLLAPGGAQSLDARL-QLFNLYRDNRNIGA 320 (532) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceee------ech---HHhhcchhHHHHHHHH-HHHHhhcCCccceE Confidence 7999999999999999999999999999987655432 233 3333333 33444444 56677788899999 Q ss_pred ecCC-CceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccch--HHHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_016762. 289 PTQG-ATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASS--EDQKYHNARCQARRVQELTFEINDLFA 365 (456) Q Consensus 289 id~~-d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst--~D~~nyyd~I~~~Qe~~lrp~L~~l~~ 365 (456) ++++ ++|++++++||||++++++++++|||+++||+|||||+||+||||| +|++|||++|+++||+.|+|+|++|++ T Consensus 321 id~~~e~~e~~~~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~ 400 (532) T protein:vir:94 321 LDKGTEEIQQTNTPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIID 400 (532) T ss_pred EcCCCceeEEEecccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9974 7899999999999999999999999999999999999999999986 489999999999999999999999999 Q ss_pred HHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC----cccCC Q lcl|NC_016762. 366 HLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL----PDTEP 441 (456) Q Consensus 366 ~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~----~~~~~ 441 (456) +|+++.+|.++++|+|+|+|||+||+||+||+++++|+++++++++| +++++|+|+.++.++....... ++.++ T Consensus 401 ~l~~s~~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G--vi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~ 478 (532) T protein:vir:94 401 LIQLSEYGQIDPGLAWEWSPLMELDDKELAEVRQLNASTDSTLMELG--VIDAKMVQQRLAADPTSGYAGALGERDELDD 478 (532) T ss_pred HHHHHhcCCCCCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC--CCCHHHHHHHHhcCCcccccccccccccccc Confidence 99999999888899999999999999999999999999999999988 9999999998876665322111 00000 Q ss_pred CC----------CCCCCcCCC--------CCCC Q lcl|NC_016762. 442 ED----------EDAARTDPT--------GEQQ 456 (456) Q Consensus 442 ~d----------~~~~~~d~~--------~~~e 456 (456) ++ ++++.++|. .+++ T Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 511 (532) T protein:vir:94 479 VEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQ 511 (532) T ss_pred ccchhhhhcccccCCCCCCCCCCCCCCCCCCCC Confidence 00 011111111 1111 No 7 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=100.00 E-value=4.1e-112 Score=631.23 Aligned_cols=401 Identities=14% Similarity=0.133 Sum_probs=331.6 Q ss_pred HHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHH Q lcl|NC_016762. 18 ARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWE 97 (456) Q Consensus 18 ~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e 97 (456) .--.|+|.|.++|++.+. ..|+++..+++++|+++|++|||+|||||+||+||||+|++|.+. ++ . T Consensus 1 ~~~~D~~~n~~~gg~~~~----~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~-~~-~-------- 66 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGS----EIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI-DD-E-------- 66 (422) T ss_pred CccchhhHHHHcCCCCCc----cccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCC-CH-H-------- Confidence 333799999998865543 237888999999999999999999999999999999999998532 11 1 Q ss_pred HHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCc Q lcl|NC_016762. 98 RKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQP 177 (456) Q Consensus 98 ~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P 177 (456) .+++..+++|++|++|++|++|+|+||||+|++.++|++.+++||+.. +.|+.|+|+|+++++|..+++||++|+||+| T Consensus 67 ~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~-g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P 145 (422) T protein:vir:10 67 PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREG-AELETVRVYDRTQVKVQTREENPRNARFGEP 145 (422) T ss_pred HHHHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCcccccccc-CceeeEEeeccccccchhcccCccccccCcc Confidence 236777889999999999999999999999999999999999999853 4688899999999999999999999999999 Q ss_pred eeEEEeecccCCccccceeeehhhhheecC----------CcCCCcchHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_016762. 178 TMWEYTEASQAGRPGLVRDIHPDRVFILGD----------WTGDAIGFLEP-AYNSFISLEKVEGGSGESFLKNAARQLL 246 (456) Q Consensus 178 ~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----------~~~~G~S~le~-~~~~l~~~~~~~~~~~~~~~~~~~~~l~ 246 (456) ++|+|... ..+.+++|||||||+|.+ +.+||.|++++ +|++|.+++++++++++++++++++++. T Consensus 146 ~~y~v~~~----~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~ 221 (422) T protein:vir:10 146 LTYRITTN----ESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWK 221 (422) T ss_pred eEEEEecC----CCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 99999732 233468999999999954 35799999975 8999999999999999999999877664 Q ss_pred hhhhhhccHhhHHhhhcCC--HHHHHHHHHHHHHHHhcCCCeEEecC-CCceeEEecccCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 247 LNFDKEINLGEIASTYGVT--LDALNERFNEAARQLNRGNDVLLPTQ-GATVTQMVSAVSDPGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 247 ~~~~~~~~~~~l~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~lid~-~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP 323 (456) ++ .++++++.+ ..++++++. .+...+++.+.+++++ +++|++++++||||++++++++++|||+++|| T Consensus 222 ~~--------~l~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~IP 292 (422) T protein:vir:10 222 AK--------GLAELCDDSEGFGAARLRLA-QVDNNSGVGQAIGIDAESEEYSVLNSDIGGIDAFLDKKFDRIVALSGIH 292 (422) T ss_pred ch--------hHHHhcCCccchHHHHHHHH-HHHHhcCCccceeEecCCcceEEEecccCChHHHHHHHHHHHHhhhCCC Confidence 43 344444332 334555543 3444555666666666 58899999999999999999999999999999 Q ss_pred eEEeeccCCCcccch--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_016762. 324 TKILVGMQTGERASS--EDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTM 401 (456) Q Consensus 324 ~t~L~G~sp~Glnst--~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~ 401 (456) +|||||+||+||||| +|++|||++|+++||+.|+|+|++|+++|+++ .+|+|+|||||+||+|||||+++++ T Consensus 293 ~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~s------~~~~~~f~pL~~~sekekaei~~~~ 366 (422) T protein:vir:10 293 EIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPILEFLIPFIVNA------EEWSVEFNPLAQESSKDKAEILEKN 366 (422) T ss_pred eeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc------CCcEEEeCCCCCCCHHHHHHHHHHH Confidence 999999999999987 48999999999999999999999999999875 4899999999999999999999999 Q ss_pred HHHHHHHHHcCCcCcCHHHHHHHh----cccCCCCCCCCcccCCCC-CCCCCcCCCCC Q lcl|NC_016762. 402 SEINSAAIGTGEPVFTAEEIREEA----GYDPLQGGDPLPDTEPED-EDAARTDPTGE 454 (456) Q Consensus 402 A~a~~~~~~~g~~~i~~~E~R~~~----~~~~~~~~~~~~~~~~~d-~~~~~~d~~~~ 454 (456) |+++++++++| +++++|+|+.+ .+.++.+++.+++++.++ .+.++.+|..+ T Consensus 367 a~a~~~~~~~g--~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 367 VNSIAALIAAG--AMDIDEARDTLRTIAPEVKINDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HHHHHHHHhcC--CCCHHHHHHHhhhhcccccCCCCCCccccchhhcCCCCCCCCCCC Confidence 99999999998 99999999866 445555555444443333 23333344444 No 8 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=100.00 E-value=3.1e-110 Score=620.91 Aligned_cols=429 Identities=14% Similarity=0.132 Sum_probs=341.1 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccC--cccch-hhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHD--AKRPQ-AWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~g--t~~~~-~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) |.. +|.|-.....+.++++.|+++|+| +.++. .++.|+++..+++++|+++|++||++|+|||+||+||||+ T Consensus 1 ~~~-----~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~ 75 (461) T protein:vir:80 1 MYS-----IDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRA 75 (461) T ss_pred Ccc-----chhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcC Confidence 543 233444445567889999988887 44665 5788999999999999999999999999999999999999 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC----cccccc-CCcCceeEE Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP----WDRPAR-GKLNGLAKV 152 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~----~~~Pl~-~~~~~l~~i 152 (456) |++|.+.+ ++. .+.|++.+++|++|++|+++++|+|+||+|+|+|.++|+++ +.+||+ .+.+||++| T Consensus 76 g~~i~~~~-~~~-------~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l 147 (461) T protein:vir:80 76 GWSLKTDN-KEM-------KKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYI 147 (461) T ss_pred CeeeecCC-HHH-------HHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEE Confidence 99996532 221 23578889999999999999999999999999999987654 567885 577899999 Q ss_pred EEeccccCChhhhhccccccccCCceeEEEeecccCC-------ccccceeeehhhhheecCC----cCCCcchHHHHHH Q lcl|NC_016762. 153 TPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAG-------RPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYN 221 (456) Q Consensus 153 ~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g-------~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~ 221 (456) +|+|+.++++..+++||++|+||+|++|+|......+ ....+++||+||||+|.+. ..||+|++|++|+ T Consensus 148 ~~~~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~ 227 (461) T protein:vir:80 148 NTFNTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYD 227 (461) T ss_pred EeccccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHHHHHH Confidence 9999999999999999999999999999997543321 1234688999999999653 4689999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecc Q lcl|NC_016762. 222 SFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSA 301 (456) Q Consensus 222 ~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~ 301 (456) +|+++++++.+++++++++....++++ .+....+....++ ..++..+++|.++++++++++|++++++ T Consensus 228 ~l~~~~~~~~~~~~l~~~~~~~v~k~~--------~l~~~~~~~~~~~----~~~~~~~~~~~g~~~~d~~e~~e~~~~~ 295 (461) T protein:vir:80 228 IITVMDTSLWSVGQILYDFAFKVYKTD--------DIDALNKDDKANL----TAMLDFMFRTEALAIIKGDEQLTKESTN 295 (461) T ss_pred HHHHHHHHHHHHHHHHHHhCCCceecc--------hHHhhhchHHHHH----HHHHHHhcCCceEEEEcCCcceEEEecC Confidence 999999999999999999876555332 2222222222222 3344567789999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccch-HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCc------C Q lcl|NC_016762. 302 VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASS-EDQKYHNARCQARRVQELTFEINDLFAHLMRIGVV------P 374 (456) Q Consensus 302 ~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst-~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~------~ 374 (456) ||||+++++++++.|||+++||+|+|||+||||+++. +|++|||++|+++||+.++|.|++|+++|+++.++ | T Consensus 296 lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p 375 (461) T protein:vir:80 296 VSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDP 375 (461) T ss_pred cCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCc Confidence 9999999999999999999999999999999777643 59999999999999999999999999999998766 2 Q ss_pred CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhc-ccCCCCCCCCcccCCCCCCCCC-cCCC Q lcl|NC_016762. 375 LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAG-YDPLQGGDPLPDTEPEDEDAAR-TDPT 452 (456) Q Consensus 375 ~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~-~~~~~~~~~~~~~~~~d~~~~~-~d~~ 452 (456) .+.+|+|+|+|||+||+||+||+++++|+++++++++| +|+++|+|+.+. ..++......+.+.++.++..+ .++. T Consensus 376 ~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g--~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (461) T protein:vir:80 376 DSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNG--VLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDA 453 (461) T ss_pred cccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC--CCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhcccc Confidence 23589999999999999999999999999999999998 999999998653 2222211111111112121111 1112 Q ss_pred CCCC Q lcl|NC_016762. 453 GEQQ 456 (456) Q Consensus 453 ~~~e 456 (456) .++| T Consensus 454 ~~~e 457 (461) T protein:vir:80 454 YAKK 457 (461) T ss_pred cccc Confidence 2222 No 9 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=100.00 E-value=7.7e-111 Score=624.25 Aligned_cols=428 Identities=11% Similarity=0.122 Sum_probs=337.3 Q ss_pred CCc--hhHHHHh----------------------HHHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHH Q lcl|NC_016762. 1 MTD--KLDLAVN----------------------HAMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLY 54 (456) Q Consensus 1 ~~~--~~~~~~~----------------------~a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~ 54 (456) ++. +...|.. ..+.. |+..+.+++.|++++.|+.+++++..++|+..+++++|+ T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 107 (537) T protein:vir:10 28 FGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGHQMC 107 (537) T ss_pred CcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccHHHH Confidence 211 1111100 00011 223344556778888899999999999999999999999 Q ss_pred HHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec- Q lcl|NC_016762. 55 TMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR- 133 (456) Q Consensus 55 ~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~- 133 (456) ++|++||++|+|||+||+||||+|++|++.+..+. ..+..++|++++++|++|++|+++++|+|+|||++++|.++ T Consensus 108 a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~---~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~ 184 (537) T protein:vir:10 108 ALIATHWLVNKACSQMPRDAMRKGYKIISDDGNEL---DPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDS 184 (537) T ss_pred HHHHhCchhhhhhhhhhHHhhcCCceeecCCcccc---cHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecC Confidence 99999999999999999999999999976543322 22334678999999999999999999999999999999885 Q ss_pred -CCCCccccccC------CcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheec Q lcl|NC_016762. 134 -DSQPWDRPARG------KLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILG 206 (456) Q Consensus 134 -D~~~~~~Pl~~------~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~ 206 (456) |++.+++||+. ++++|..|.|.|..+..+..+.+||+||+||+|++|+|+ +++||||||++|. T Consensus 185 ~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~----------g~~iH~SRli~f~ 254 (537) T protein:vir:10 185 PDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN----------GKKYHRSHLAIYI 254 (537) T ss_pred cCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec----------CeEecceeEEEec Confidence 88889999952 345555555566555555567789999999999999986 5789999999996 Q ss_pred C----------CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHH Q lcl|NC_016762. 207 D----------WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEA 276 (456) Q Consensus 207 ~----------~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 276 (456) + +.+||+|++|++|++|+++++++.++++++++++.+++.+++.. .++ +.+.+.++ .++ T Consensus 255 g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~---------~l~-~~~~~~~r-~~~ 323 (537) T protein:vir:10 255 NDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQ---------VLA-NKQQFDET-MSW 323 (537) T ss_pred CCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHH---------hhc-CHHHHHHH-HHH Confidence 4 34799999999999999999999999999999998877654322 122 22334433 456 Q ss_pred HHHHhcCCCeEEecCC-CceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccch--HHHHHHHHHHHHHHH Q lcl|NC_016762. 277 ARQLNRGNDVLLPTQG-ATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASS--EDQKYHNARCQARRV 353 (456) Q Consensus 277 ~~~~~~~~~~~lid~~-d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst--~D~~nyyd~I~~~Qe 353 (456) +..++++.++++++++ ++|++++++|||+++++++++++|||+++||+|||||+||+||||| +|.+|||++|+++|+ T Consensus 324 ~~~~r~n~g~~~id~e~e~~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe 403 (537) T protein:vir:10 324 WTATRDNYQVRVVDKDNEDVVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQD 403 (537) T ss_pred HHhhcCCcceeEecCCCceeEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHHHHHHH Confidence 6778889999999996 8899999999999999999999999999999999999999999986 489999999999999 Q ss_pred hhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccC---- Q lcl|NC_016762. 354 QELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDP---- 429 (456) Q Consensus 354 ~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~---- 429 (456) .|+|.|++|+++|+++.+++ +.+|+|+|+|||+||+|||||+++++|+++++++++| +++++|+|+.+..++ T Consensus 404 -~l~p~l~~l~~ll~~~~~~~-~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G--~i~~~Evr~~L~~~~~~g~ 479 (537) T protein:vir:10 404 -DMRPLIDRHHQLVCRSHLRK-RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMG--AVDGVDVNEYLRMDPTLGF 479 (537) T ss_pred -HHHHHHHHHHHHHHHhcCCC-CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC--CCCHHHHHHHHhccCcccc Confidence 59999999999999999997 5689999999999999999999999999999999998 999999999876644 Q ss_pred --CCCCCCCcccCCCC--C-------CCCCcCCCC----CCC Q lcl|NC_016762. 430 --LQGGDPLPDTEPED--E-------DAARTDPTG----EQQ 456 (456) Q Consensus 430 --~~~~~~~~~~~~~d--~-------~~~~~d~~~----~~e 456 (456) +...+++++.++.. + .+.+++|+. ..+ T Consensus 480 ~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (537) T protein:vir:10 480 TSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGATSS 521 (537) T ss_pred ccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCCcc Confidence 32222322222211 0 111111110 000 No 10 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=100.00 E-value=2.8e-110 Score=621.19 Aligned_cols=404 Identities=15% Similarity=0.093 Sum_probs=323.5 Q ss_pred HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHH Q lcl|NC_016762. 16 AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETE 95 (456) Q Consensus 16 ~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~ 95 (456) -....+|+|.|++.+ |+... . ++........+|+++|++|||+|||||+||+||||+|++|.+. ++ . T Consensus 1 ~~~~~~d~~~~~~~~-~~~~~--~--~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~-~~-~------ 67 (427) T protein:vir:10 1 MKIVKHDGYNDIFNG-GADGS--P--KPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGV-KD-E------ 67 (427) T ss_pred CCccccchHHHHhhc-CCCCc--c--cCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCc-cH-H------ Confidence 223468999997533 32221 1 2222344556899999999999999999999999999999542 21 1 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccC Q lcl|NC_016762. 96 WERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYG 175 (456) Q Consensus 96 ~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg 175 (456) +++++++++|++|++|++|++|+|+||||+|++.++|++++++|+.. .++|++|+|+|+++++|+.+++||++|+|| T Consensus 68 --~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~-~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg 144 (427) T protein:vir:10 68 --KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKP-GAKLEGVRVYDRFAITVEKRVTNARSPRYG 144 (427) T ss_pred --HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCC-CcceeEEEEechhcccccccccCccccccC Confidence 24788889999999999999999999999999999999999999974 357889999999999999999999999999 Q ss_pred CceeEEEeecccCCccccceeeehhhhheecC----------CcCCCcchH-HHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_016762. 176 QPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----------WTGDAIGFL-EPAYNSFISLEKVEGGSGESFLKNAARQ 244 (456) Q Consensus 176 ~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----------~~~~G~S~l-e~~~~~l~~~~~~~~~~~~~~~~~~~~~ 244 (456) +|++|+|+. +...++++||||||++|.+ +.+||.|+| |++|++|+++++++++++++++++++++ T Consensus 145 ~P~~y~v~~----~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v 220 (427) T protein:vir:10 145 EPEIYKVSP----GDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAV 220 (427) T ss_pred cceEEEEec----CCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 999999973 2233468999999999964 357999987 5799999999999999999999998776 Q ss_pred hhhhhhhhccHhhHHhhhcCC--HHHHHHHHHHHHHHHhcCCCeEEecC-CCceeEEecccCCHHHHHHHHHHHHHhhhc Q lcl|NC_016762. 245 LLLNFDKEINLGEIASTYGVT--LDALNERFNEAARQLNRGNDVLLPTQ-GATVTQMVSAVSDPGPTYNVNLQTAAAGVD 321 (456) Q Consensus 245 l~~~~~~~~~~~~l~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~lid~-~d~~~~~~~~~sgl~~~~~~~~~~~aaas~ 321 (456) +.++ +++++++.+ ..++++++. .+..++++.+.+++++ +++|++++++||||++++++++++|||+++ T Consensus 221 ~k~~--------~l~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl~~~~~~~~~~iaaa~~ 291 (427) T protein:vir:10 221 WKVK--------GLAEMCDDDDAQYAARLRLA-QVDDNSGVGRAIGIDAETEEYDVLNSDISGVPEFLSSKMDRIVSLSG 291 (427) T ss_pred ccch--------hHHHHhcCccchHHHHHHHH-HHHHhcCcccceeeecCCCceeEEecccCChHHHHHHHHHHHHhhhC Confidence 6432 344444332 234455543 3445566666666665 588999999999999999999999999999 Q ss_pred CCeEEeeccCCCcccch--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHH Q lcl|NC_016762. 322 IPTKILVGMQTGERASS--EDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSK 399 (456) Q Consensus 322 IP~t~L~G~sp~Glnst--~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~ 399 (456) ||+|||||+||+||||| +|++|||++|+++||+.|+|+|++|+++|+++ ++|+|+|||||+||++||||+++ T Consensus 292 IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~s------~~~~~~f~pL~~~s~kEkaei~~ 365 (427) T protein:vir:10 292 IHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVDE------EEWSIEFEPLSVPSKKEESEITK 365 (427) T ss_pred CCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC------CCcEEEeCCCCCCCHHHHHHHHH Confidence 99999999999999987 48999999999999999999999999999876 48999999999999999999999 Q ss_pred HHHHHHHHHHHcCCcCcCHHHHHHHh----cccCCCCCCCCcccCCC--CCCCCCcCCCCCCC Q lcl|NC_016762. 400 TMSEINSAAIGTGEPVFTAEEIREEA----GYDPLQGGDPLPDTEPE--DEDAARTDPTGEQQ 456 (456) Q Consensus 400 ~~A~a~~~~~~~g~~~i~~~E~R~~~----~~~~~~~~~~~~~~~~~--d~~~~~~d~~~~~e 456 (456) ++|+++++++++| +++++|+|+.+ +++++.++....+++.+ .+.++.++++-++| T Consensus 366 ~~a~a~~~~~~~g--vi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~~d~ 426 (427) T protein:vir:10 366 NNVESVTKAITEQ--IIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDE 426 (427) T ss_pred HHHHHHHHHHhcC--CCCHHHHHHHHHhhhccccCCCCccccccccchhcCCCCCCCCCCCCC Confidence 9999999999999 99999999865 45666544433322222 22222223333344 No 11 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=100.00 E-value=2.4e-110 Score=621.51 Aligned_cols=433 Identities=12% Similarity=0.109 Sum_probs=337.2 Q ss_pred CCchhHHHHhHHHHHHHHH--HHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIAR--ARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~--~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) =++.++|++++-+.-..-. .+..++-..-.-|+..|.+ .|.+...+--++.++.-.++|..|++|++++++|||+| T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w 140 (698) T protein:vir:10 63 PSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDAL--SFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTW 140 (698) T ss_pred CCccccccccceeccccCCccccchhhhhhcccccccccc--hhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhccc Confidence 5678888888754332110 0111111111123444443 23334444445666677889999999999999999999 Q ss_pred CEEecCCCcchh------------hhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEe-cCCCCccccc--c Q lcl|NC_016762. 79 PQVIEGDDQDRS------------KDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHI-RDSQPWDRPA--R 143 (456) Q Consensus 79 ~~i~~~~~~d~~------------~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i-~D~~~~~~Pl--~ 143 (456) ++++++.+.+-. ....+..++|+++++||+||++|+++++|+|+|||++++|+| +|.+.+++|| + T Consensus 141 ~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~ 220 (698) T protein:vir:10 141 GEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPR 220 (698) T ss_pred ceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCccccccccccc Confidence 998766444311 122244567999999999999999999999999999999988 3556688888 3 Q ss_pred ---CCcCceeEEEEeccccCChhhhhc-cccccccCCceeEEEeecccCCccccceeeehhhhheecC----------Cc Q lcl|NC_016762. 144 ---GKLNGLAKVTPAWAGCLKPKSFDE-KPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----------WT 209 (456) Q Consensus 144 ---~~~~~l~~i~~~~~~~~~~~~~~~-Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----------~~ 209 (456) ++.++++.|.|++++.++|..++. ||++|+||+|++|+|. +.+||+|||++|.+ +. T Consensus 221 ~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~----------G~~IH~SRL~~~vg~pvpd~LKp~y~ 290 (698) T protein:vir:10 221 PYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI----------GSEVHATRLHTIVSRPVGDMLKPTYS 290 (698) T ss_pred cccccCccceeeeeecccccccchhhhccchhhccCCCceEEEe----------cceecceeEEEecCCCchhhhcchhc Confidence 455666667777777788998885 9999999999999997 45799999999965 46 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEe Q lcl|NC_016762. 210 GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLP 289 (456) Q Consensus 210 ~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~li 289 (456) +||+|++|++|++|.++++++.++++++++.+++.+. ++|++.++.+.......+.+.++.+++|++++++ T Consensus 291 f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~---------~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~ll 361 (698) T protein:vir:10 291 FAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL---------MDLAQALTPGANVDLSMRAELINRYRDNRNILFL 361 (698) T ss_pred cCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHH---------HHHHHhcCChhhHHHHHHHHHHHHhcCccceEEE Confidence 8999999999999999999999999999877665552 3466666655554444445788899999999999 Q ss_pred cC-CCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH--HHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_016762. 290 TQ-GATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE--DQKYHNARCQARRVQELTFEINDLFAH 366 (456) Q Consensus 290 d~-~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~--D~~nyyd~I~~~Qe~~lrp~L~~l~~~ 366 (456) |+ +|+|++++++||||++|+++|+|+|||+++||+||||||||+|||+|| |++||||+|+++||+.|+|+|++|+++ T Consensus 362 Dk~~Eefeq~st~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~i 441 (698) T protein:vir:10 362 DKATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVM 441 (698) T ss_pred ecCCcceEEEecCcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 96 688999999999999999999999999999999999999999999885 899999999999999999999999999 Q ss_pred HHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCC--CcccCC--- Q lcl|NC_016762. 367 LMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDP--LPDTEP--- 441 (456) Q Consensus 367 l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~--~~~~~~--- 441 (456) |++|.+|..+++|+|+||||||||++|+|||++|+|+++++|++.| +|+++|+|+.+..++...+.. +.+++| T Consensus 442 i~rS~~G~idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~g--vI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~ 519 (698) T protein:vir:10 442 IQLSLFGAVDPSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQ--VIRPDQVAARLNTEPDGPYAGKLDANDDPGAP 519 (698) T ss_pred HHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhc--CCCHHHHHHHHhccCCCccccccCCcccCCCC Confidence 9999999988899999999999999999999999999999999999 999999999987776555422 211111 Q ss_pred CCCCC----------CCcCCCCCCC Q lcl|NC_016762. 442 EDEDA----------ARTDPTGEQQ 456 (456) Q Consensus 442 ~d~~~----------~~~d~~~~~e 456 (456) +|++. .++.+.++.+ T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~~~~~ 544 (698) T protein:vir:10 520 ADDDIDGVLTYVQRMAEGGDTGAPT 544 (698) T ss_pred CCCcchHHHhhhcCCcCCCCccccc Confidence 11111 1111111111 No 12 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=100.00 E-value=6.9e-110 Score=619.06 Aligned_cols=433 Identities=12% Similarity=0.098 Sum_probs=339.1 Q ss_pred CCchhHHHHhHHHHHHHHH--HHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIAR--ARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~--~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) =++.++|+.++.+....-. -+..++-..-.-|+..|.+ .|.+...+--++.++.-.++|..|++|++++++|||+| T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w 139 (694) T protein:vir:10 62 PSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDAL--SFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTW 139 (694) T ss_pred CCcchhhhhhccccccCCCccccchhhhhhccCcccccch--hhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhccc Confidence 5678899988876543210 0111111111124444443 23344444445666677889999999999999999999 Q ss_pred CEEecCCCcchh------------hhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEe-cCCCCccccc--c Q lcl|NC_016762. 79 PQVIEGDDQDRS------------KDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHI-RDSQPWDRPA--R 143 (456) Q Consensus 79 ~~i~~~~~~d~~------------~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i-~D~~~~~~Pl--~ 143 (456) ++++++.+.+-. ....+..++|+++++||+||++|+++++|+|+|||++++|++ +|++.+++|| + T Consensus 140 ~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~ 219 (694) T protein:vir:10 140 GEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPR 219 (694) T ss_pred ceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCccccccccccc Confidence 998766444311 122244567999999999999999999999999999999988 4556689999 3 Q ss_pred ---CCcCceeEEEEeccccCChhhhhc-cccccccCCceeEEEeecccCCccccceeeehhhhheecC----------Cc Q lcl|NC_016762. 144 ---GKLNGLAKVTPAWAGCLKPKSFDE-KPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----------WT 209 (456) Q Consensus 144 ---~~~~~l~~i~~~~~~~~~~~~~~~-Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----------~~ 209 (456) ++.++++.|.|++++.++|..++. ||++|+||+|++|+|. +++||+|||++|.+ ++ T Consensus 220 ~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~----------G~~IH~SRL~~f~g~plPd~LKp~y~ 289 (694) T protein:vir:10 220 PYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI----------GTEVHATRLHTIVSRPVGDMLKPTYS 289 (694) T ss_pred cccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe----------ceEEeeeeEEEecCCCchhhhhcccc Confidence 456667777777777789998886 9999999999999996 46899999999964 56 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEe Q lcl|NC_016762. 210 GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLP 289 (456) Q Consensus 210 ~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~li 289 (456) +||+|++|.+|+++.++++++.++++++++.+++.+. ++|++.+..+.+.....+.+.++++++|++++++ T Consensus 290 ~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk---------~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~ll 360 (694) T protein:vir:10 290 FAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL---------MDLAQALMPGANVDLSMRAELINRYRDNRNILFL 360 (694) T ss_pred cCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHH---------HHHHHhhcChhHHHHHHHHHHHHHhcCccceEEE Confidence 8999999999999999999999999999876655542 3466655555454444445788899999999999 Q ss_pred cC-CCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH--HHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_016762. 290 TQ-GATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE--DQKYHNARCQARRVQELTFEINDLFAH 366 (456) Q Consensus 290 d~-~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~--D~~nyyd~I~~~Qe~~lrp~L~~l~~~ 366 (456) |+ +|+|++++++||||++|+++|+|+|||+++||+||||||||+|||+|| |++||||+|+++||+.|+|+|++|+++ T Consensus 361 Dk~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~i 440 (694) T protein:vir:10 361 DKATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVM 440 (694) T ss_pred ecCCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 96 688999999999999999999999999999999999999999999884 899999999999999999999999999 Q ss_pred HHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCC--CCcccCC--- Q lcl|NC_016762. 367 LMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGD--PLPDTEP--- 441 (456) Q Consensus 367 l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~--~~~~~~~--- 441 (456) |++|.+|..+++|+|+|||||+||++|+|||++|+|+++++|++.| +|+++|+|+.+..++...+. .+.+++| T Consensus 441 i~rS~~G~idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~g--vI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~ 518 (694) T protein:vir:10 441 IQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQ--VIRPDQVAARLNTEPDGPYAGKLDANDDPGVP 518 (694) T ss_pred HHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhc--CCCHHHHHHHHhcCCCcccccccccccCCCcC Confidence 9999999888899999999999999999999999999999999999 99999999997766544432 1111111 Q ss_pred CCC------CCCCcCCCCCCC Q lcl|NC_016762. 442 EDE------DAARTDPTGEQQ 456 (456) Q Consensus 442 ~d~------~~~~~d~~~~~e 456 (456) +|+ ...++.+.+++. T Consensus 519 ~~~~~~~~~~~~~~~~~~~~~ 539 (694) T protein:vir:10 519 ADDDIDGVLTYVQRLAEGGDT 539 (694) T ss_pred ccchhhhhHhhhcCccccccc Confidence 111 111222222222 No 13 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=100.00 E-value=4e-110 Score=620.35 Aligned_cols=433 Identities=12% Similarity=0.098 Sum_probs=337.7 Q ss_pred CCchhHHHHhHHHHHHHHH--HHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIAR--ARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~--~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) =++.++|+.++-+.-..-. .+..++-..-.-|+..|.+ .|.+...+--++.++.-.++|..|++|++++++|||+| T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w 140 (695) T protein:vir:78 63 PSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDAL--SFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTW 140 (695) T ss_pred CCcccccceeceeccccCCccccchhhhhhcccccccccc--hhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhccc Confidence 5678888888754332110 0111111111123444443 23344444445666777889999999999999999999 Q ss_pred CEEecCCCcchh------------hhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEe-cCCCCccccc--c Q lcl|NC_016762. 79 PQVIEGDDQDRS------------KDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHI-RDSQPWDRPA--R 143 (456) Q Consensus 79 ~~i~~~~~~d~~------------~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i-~D~~~~~~Pl--~ 143 (456) ++++++.+.+-. ....+..++|+++++||+||++|+++++|+|+|||++++|++ +|++.+++|| + T Consensus 141 ~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~ 220 (695) T protein:vir:78 141 GEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPR 220 (695) T ss_pred ceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccc Confidence 998766444311 122244567999999999999999999999999999999988 4556689999 3 Q ss_pred ---CCcCceeEEEEeccccCChhhhhc-cccccccCCceeEEEeecccCCccccceeeehhhhheecC----------Cc Q lcl|NC_016762. 144 ---GKLNGLAKVTPAWAGCLKPKSFDE-KPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----------WT 209 (456) Q Consensus 144 ---~~~~~l~~i~~~~~~~~~~~~~~~-Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----------~~ 209 (456) ++.++++.|.|++++.++|..++. ||++|+||+|++|+|. +++||+|||++|.+ ++ T Consensus 221 ~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~----------G~kIH~SRL~~f~g~plPd~LKp~y~ 290 (695) T protein:vir:78 221 PYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI----------GTEVHATRLHTIVSRPVGDMLKPTYS 290 (695) T ss_pred cccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe----------ceEEeeeeEEEecCCCchhhhhcccc Confidence 456667777777777789998886 9999999999999996 46899999999964 56 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEe Q lcl|NC_016762. 210 GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLP 289 (456) Q Consensus 210 ~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~li 289 (456) +||+|++|.+|++|.++++++.++++++++.+++.+. ++|++.+..+.+.....+.+.++++++|++++++ T Consensus 291 ~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk---------~dla~~L~~g~~~~l~~R~eli~~~Rsn~G~~ll 361 (695) T protein:vir:78 291 FAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL---------MDLAQALMPGANVDLSMRAELINRYRDNRNILFL 361 (695) T ss_pred cCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHH---------HHHHHhhcChhHHHHHHHHHHHHHhcCccceEEE Confidence 8999999999999999999999999999876655442 3466655555454444445788899999999999 Q ss_pred cC-CCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH--HHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_016762. 290 TQ-GATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE--DQKYHNARCQARRVQELTFEINDLFAH 366 (456) Q Consensus 290 d~-~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~--D~~nyyd~I~~~Qe~~lrp~L~~l~~~ 366 (456) |+ +|+|++++++||||++|+++|+|+|||+++||+||||||||+|||+|| |++||||+|+++||+.|+|+|++|+++ T Consensus 362 Dk~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~i 441 (695) T protein:vir:78 362 DKATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVM 441 (695) T ss_pred ecCCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 96 688999999999999999999999999999999999999999999874 899999999999999999999999999 Q ss_pred HHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCC--CCcccCC--- Q lcl|NC_016762. 367 LMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGD--PLPDTEP--- 441 (456) Q Consensus 367 l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~--~~~~~~~--- 441 (456) |++|.+|..+++|+|+|||||+||++|+|||++|+|+++++|++.| +|+++|+|+.+..++...+. .+.+++| T Consensus 442 i~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~g--vI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~ 519 (695) T protein:vir:78 442 IQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQ--VIRPDQVAARLNTEPDGPYAGKLDANDDPGVP 519 (695) T ss_pred HHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhc--CCCHHHHHHHHhcCCCcccccccccccCCCcC Confidence 9999999888899999999999999999999999999999999999 99999999997766544332 1111111 Q ss_pred CCC------CCCCcCCCCCCC Q lcl|NC_016762. 442 EDE------DAARTDPTGEQQ 456 (456) Q Consensus 442 ~d~------~~~~~d~~~~~e 456 (456) +|+ ...++.+.+++. T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~ 540 (695) T protein:vir:78 520 ADDDIDGVLTYVQRLAEGGDT 540 (695) T ss_pred ccchhhhhHhhhcCccccccc Confidence 111 111122222222 No 14 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=100.00 E-value=4.1e-110 Score=620.28 Aligned_cols=433 Identities=12% Similarity=0.099 Sum_probs=338.2 Q ss_pred CCchhHHHHhHHHHHHHHH--HHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIAR--ARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~--~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) =++.++|+.++-+.-..-. -+..++-..-.-|+..|.+ .|.+...+--++.++.-.++|..|++|++++++|||+| T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w 140 (695) T protein:vir:36 63 PSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDAL--SFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTW 140 (695) T ss_pred CCcccccceeceecccccCccccchhhhhhcccccccccc--hhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhccc Confidence 5778888888754332110 0111111111123444443 23344444445666777889999999999999999999 Q ss_pred CEEecCCCcchh------------hhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEe-cCCCCccccc--c Q lcl|NC_016762. 79 PQVIEGDDQDRS------------KDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHI-RDSQPWDRPA--R 143 (456) Q Consensus 79 ~~i~~~~~~d~~------------~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i-~D~~~~~~Pl--~ 143 (456) ++++++.+.+-. +...+..++|+++++||+||++|+++++|+|+|||++++|++ +|++.+++|| + T Consensus 141 ~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~ 220 (695) T protein:vir:36 141 GEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPR 220 (695) T ss_pred ceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccc Confidence 998766444311 122244567999999999999999999999999999999988 4556689999 3 Q ss_pred ---CCcCceeEEEEeccccCChhhhhc-cccccccCCceeEEEeecccCCccccceeeehhhhheecC----------Cc Q lcl|NC_016762. 144 ---GKLNGLAKVTPAWAGCLKPKSFDE-KPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----------WT 209 (456) Q Consensus 144 ---~~~~~l~~i~~~~~~~~~~~~~~~-Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----------~~ 209 (456) ++.++++.|.|++++.++|..++. ||++|+||+|++|+|. +++||+|||++|.+ ++ T Consensus 221 ~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~----------G~kIH~SRL~~f~g~plPd~LKp~y~ 290 (695) T protein:vir:36 221 PYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI----------GTEVHATRLHTIVSRPVGDMLKPTYS 290 (695) T ss_pred cccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEe----------ceEEeeeeEEEecCCCchhhhhcccc Confidence 456667777777777789998886 9999999999999996 46899999999964 56 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEe Q lcl|NC_016762. 210 GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLP 289 (456) Q Consensus 210 ~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~li 289 (456) +||+|++|.+|++|.++++++.++++++++.+++.+. ++|++.+..+.+.....+.+.++++++|++++++ T Consensus 291 ~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk---------~dla~aL~~g~~~~l~~R~eli~~~Rsn~G~~ll 361 (695) T protein:vir:36 291 FAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL---------MDLAQALMPGANVDLSMRAELINRYRDNRNILFL 361 (695) T ss_pred cCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhHHHHH---------HHHHHhhcChhHHHHHHHHHHHHHhcCccceEEE Confidence 8999999999999999999999999999876665552 3466655555554444445788899999999999 Q ss_pred cC-CCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH--HHHHHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_016762. 290 TQ-GATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE--DQKYHNARCQARRVQELTFEINDLFAH 366 (456) Q Consensus 290 d~-~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~--D~~nyyd~I~~~Qe~~lrp~L~~l~~~ 366 (456) |+ +|+|++++++||||++|+++|+|+|||+++||+||||||||+|||+|| |++||||+|+++||+.|+|+|++|+++ T Consensus 362 Dk~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~i 441 (695) T protein:vir:36 362 DKATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVM 441 (695) T ss_pred ecCCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 96 688999999999999999999999999999999999999999999874 899999999999999999999999999 Q ss_pred HHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCC--CcccCC--- Q lcl|NC_016762. 367 LMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDP--LPDTEP--- 441 (456) Q Consensus 367 l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~--~~~~~~--- 441 (456) |++|.+|..+++|+|+|||||+||++|+|||++|+|+++++|++.| +|+++|+|+.+..++...+.. +.+++| T Consensus 442 i~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~g--vI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~ 519 (695) T protein:vir:36 442 IQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQ--VIRPDQVAARLNTEPDGPYAGKLDANDDPGVP 519 (695) T ss_pred HHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhc--CCCHHHHHHHHhcCCCcccccccccccCCCcC Confidence 9999999888899999999999999999999999999999999999 999999999977665444321 111111 Q ss_pred CCC------CCCCcCCCCCCC Q lcl|NC_016762. 442 EDE------DAARTDPTGEQQ 456 (456) Q Consensus 442 ~d~------~~~~~d~~~~~e 456 (456) +|+ ...++.+.+++. T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~ 540 (695) T protein:vir:36 520 ADDDIDGVLTYVQRLAEGGDT 540 (695) T ss_pred ccchhhhhHhhhcCccccccc Confidence 111 111222223222 No 15 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=100.00 E-value=1.2e-55 Score=321.69 Aligned_cols=196 Identities=13% Similarity=0.081 Sum_probs=158.3 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCC-CceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeecc Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQG-ATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGM 330 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~-d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~ 330 (456) ++++.+|++.++.+..++.+++ ..+..+++++++++++++ |+|++++++||||++++++++++|||+++||+|||||| T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~-~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~ 79 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRL-AQVDNNSGVGQAIGIDADSEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIILKGK 79 (201) T ss_pred CccchHHHHHhcCChHHHHHHH-HHHHHhhhhhhhheeecCCcceeeeecCcCChHHHHHHHHHHHHhHhcCchhhhcCC Confidence 3456778887777766666554 455666666677777765 78999999999999999999999999999999999999 Q ss_pred CCCcccchH--HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 331 QTGERASSE--DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAA 408 (456) Q Consensus 331 sp~Glnst~--D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~ 408 (456) ||||||||| |++|||++|+++||+.|+|+|++|++++++ +++|+|+|||||+||+|||||+++++|+|+++| T Consensus 80 sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~~------~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~ 153 (201) T protein:vir:10 80 NVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIVT------EQEWSVEFNPLSQVSDKDKSEILEKNVNSVAAL 153 (201) T ss_pred CCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC------CCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHH Confidence 999999875 899999999999999999999999997653 569999999999999999999999999999999 Q ss_pred HHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 409 IGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 409 ~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +++| +++++|+|+.+...+...+.+....+++.+..+..||....| T Consensus 154 ~~~g--~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~e~~dp~~~~~ 199 (201) T protein:vir:10 154 IAAG--IIDADEARDTLRAISTEVKIGEGSIQTEVVINESEDPLDVSA 199 (201) T ss_pred HHcC--CCCHHHHHHHHHhcCCcCCCCCCCCCccccccccCCCCCCCC Confidence 9999 999999999887766655544333222222222223332222 No 16 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.86 E-value=2.1e-21 Score=133.95 Aligned_cols=412 Identities=14% Similarity=0.120 Sum_probs=205.2 Q ss_pred CCc-----hhHHHHhH--HHHHH--------------------HHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHH Q lcl|NC_016762. 1 MTD-----KLDLAVNH--AMSSA--------------------IARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDL 53 (456) Q Consensus 1 ~~~-----~~~~~~~~--a~~~~--------------------~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l 53 (456) |-- |..++.|. -+..+ ....|++.. ...+.+.+++ | .....++.+| T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~-~~~~~~g~~~-----~-~epp~d~~~l 89 (648) T protein:vir:79 17 MWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLA-IMDGGGGGRD-----F-EEPEFDFNEI 89 (648) T ss_pred hccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHH-HHhhcCCccc-----c-ccCCcCHHHH Confidence 111 22222221 11100 111123321 1222222232 1 2245688999 Q ss_pred HHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec Q lcl|NC_016762. 54 YTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR 133 (456) Q Consensus 54 ~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~ 133 (456) ..+|..++.++++|+++++++.+-.+.+..............+ .+.+-.......+-+......-.++|-+++.+.-+ T Consensus 90 ~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~--ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd 167 (648) T protein:vir:79 90 TSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRF--TLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRA 167 (648) T ss_pred HHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHH--HhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEec Confidence 9999999999999999999999999888544322111110000 01111111112222222333455778777765432 Q ss_pred -CCCCccc--ccc-CCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecC-- Q lcl|NC_016762. 134 -DSQPWDR--PAR-GKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD-- 207 (456) Q Consensus 134 -D~~~~~~--Pl~-~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~-- 207 (456) +|.++-. ++. .....+..+.|+....++ +...+||.+..|.+... ++ ...+.++++.||||.. T Consensus 168 ~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~-------v~~d~~g~~~~Y~y~~~--g~--~~~~~~~~~dIIHik~~~ 236 (648) T protein:vir:79 168 KDALPFQGMNVMGVGDSMPVAGYFPLNLASMK-------VKRDKFGMIKGWQQEQE--GQ--DKPQKFKPEDIVHIYYKR 236 (648) T ss_pred CCCccchhhhhhhhccccceeeeEeecCceeE-------EEEcCCCceeeeEEEec--CC--ceeEEecCccEEEEccCC Confidence 2222111 111 111112222232221111 22236889998887521 22 1235678899998852 Q ss_pred --CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCC Q lcl|NC_016762. 208 --WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGND 285 (456) Q Consensus 208 --~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (456) ....|.|.++.+.+.+.....+. ..+..+|++..+...+.. +.. +....+..+++ ++.++++.+ T Consensus 237 ~~d~~~GlSpi~~a~~aI~l~~aa~-~~~~~fF~NGa~P~gil~---~~~-------~~~~~e~~k~~---~e~~~~~~~ 302 (648) T protein:vir:79 237 EKGRAFGTPWLLPALDDIRALRQVE-ENVLRLVYRNLHPLWHVK---VGL-------EQEGFGAEEGE---VDLVRGEVE 302 (648) T ss_pred CCCCceeccHHHHHHHHHHHHHHHH-HHHHHHHhccCCccEEEE---eCC-------CccchHHHHHH---HHHHHHhcc Confidence 23459999999999886554443 344456666544322211 100 00011112222 222333333 Q ss_pred eEEecCC-CceeEEecccCC------HHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH--HHHHHHHHHHHHHHhhh Q lcl|NC_016762. 286 VLLPTQG-ATVTQMVSAVSD------PGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE--DQKYHNARCQARRVQEL 356 (456) Q Consensus 286 ~~lid~~-d~~~~~~~~~sg------l~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~--D~~nyyd~I~~~Qe~~l 356 (456) .+.+..+ -+++.+..+..+ +.+......++||++.|||-. ++|...++-.++. ...+|+++|...|.... T Consensus 303 ~~~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~-lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~ 381 (648) T protein:vir:79 303 NMDVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSEL-MMGRGGTASRSTGDNLSSDFKDRIKALQKVMA 381 (648) T ss_pred cccccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHh-HcccCCCccchHHHHHHHHHHHHHHHHHHHHH Confidence 3333332 345555544422 233345567899999999986 5687554443443 45789999998887655 Q ss_pred hHHHHHHHHHH-HHhcCc---CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCC Q lcl|NC_016762. 357 TFEINDLFAHL-MRIGVV---PLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQG 432 (456) Q Consensus 357 rp~L~~l~~~l-~~s~~~---~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~ 432 (456) +..-..+...+ ...++. ..+..+.|.|++|...+++.+++.. ..++.+| ++|+||+|+..+++|+++ T Consensus 382 ~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~-------~~l~~~G--ilT~NEaR~~lGlpPi~~ 452 (648) T protein:vir:79 382 TFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQA-------VFLYEHN--AISEDEMRELIGRDPVDD 452 (648) T ss_pred HHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHH-------HHHHhCC--CcCHHHHHHHhCCCCCCC Confidence 55444444333 223222 2344678999999988877765543 4567778 999999999999999865 Q ss_pred CCCCc---cc-----CCCCCCCCCcCCCCCC------C Q lcl|NC_016762. 433 GDPLP---DT-----EPEDEDAARTDPTGEQ------Q 456 (456) Q Consensus 433 ~~~~~---~~-----~~~d~~~~~~d~~~~~------e 456 (456) +.... .. ....++...++|.++. + T Consensus 453 g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~e 490 (648) T protein:vir:79 453 GEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGD 490 (648) T ss_pred CCCccccccccccchhccccccCCCCCCCCCCCCcccc Confidence 43210 00 0011111111111100 0 No 17 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.77 E-value=4e-19 Score=121.47 Aligned_cols=386 Identities=13% Similarity=0.090 Sum_probs=199.3 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCc---ccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFP---QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~---~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) |-=+ . .|.+......+.......-+|.. ..++.. .++ .+..++++|+++|.+.-+- T Consensus 1 m~f~----------~-------~~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~---~al-~~~~v~~~i~~ia~~ia~l 59 (409) T protein:vir:10 1 MLFR----------K-------GFKNQSQEISIDDKKILEWLGINPSETYVNGK---SCL-KQATVFGCIRILSDNISKL 59 (409) T ss_pred Cccc----------c-------cccCcCCCCCCChHHHHHHhcCCcCcceechh---hhh-ccHHHHHHHHHHHHhhhhC Confidence 1110 0 00000000100000000011111 122332 223 5788999999999999877 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEec Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAW 156 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~ 156 (456) -+.+....+.........+...+...=....-+..|.+.+-+ -.++|-+++++.- ++. +.+..+.|+. T Consensus 60 p~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r-~~~----------G~~~~L~~i~ 128 (409) T protein:vir:10 60 PIKIYQKKDGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDF-KKN----------GEIKGLYPLK 128 (409) T ss_pred ceEEEEecCCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-cCC----------CcEEEEEEEc Confidence 777643221111111111111121111112233445554444 5667877777643 211 1234455555 Q ss_pred cccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 157 AGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGS 233 (456) Q Consensus 157 ~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~ 233 (456) ...+++.. +.+... .+.....|.++. ..+..+.++++.||||...+ ..|.|.++.+.+.+.....+ ... T Consensus 129 ~~~V~v~~-~~~~~~-~~~~~~~y~~~~-----~~g~~~~~~~~evih~r~~~~d~~~G~s~i~~~~~~i~~~~~~-~~~ 200 (409) T protein:vir:10 129 SDGMKIFV-DDTGLL-NSENNVWYLYTD-----DLGQRHKFMSDEILHFKGLTADGLAGLSVIELLNHLIENGKSS-ETY 200 (409) T ss_pred CCceEEEE-cCCccc-cccceEEEEEEe-----CCceeEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHH-HHH Confidence 44444321 122211 222223454432 11234678999999885432 35899999999877554443 344 Q ss_pred HHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc----CCCeEEecCCCceeEEecccCCHH-- Q lcl|NC_016762. 234 GESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR----GNDVLLPTQGATVTQMVSAVSDPG-- 306 (456) Q Consensus 234 ~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~lid~~d~~~~~~~~~sgl~-- 306 (456) +..++++..+.-. ++.... + ..+..+++.+.+....+ ..+.++++.+.+|++++.+..+.+ T Consensus 201 ~~~~f~ng~~~~gil~~~~~-----l-------~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~ 268 (409) T protein:vir:10 201 LNNFFKNGLQVKGLVQYAGD-----L-------NPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFL 268 (409) T ss_pred HHHHHhccCCCcEEEEcCCC-----C-------CHHHHHHHHHHHHHHhccccccCCceecCCCceEEEccCChhhHHHH Confidence 4456666543221 111111 1 12333444444444333 234566777788999988775543 Q ss_pred HHHHHHHHHHHhhhcCCeEEeeccCCCc-ccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCC--ceE Q lcl|NC_016762. 307 PTYNVNLQTAAAGVDIPTKILVGMQTGE-RASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKA--EFT 380 (456) Q Consensus 307 ~~~~~~~~~~aaas~IP~t~L~G~sp~G-lnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~--d~~ 380 (456) +......++||++.|||...| |...++ .++.+ ..+.||.. -|.|.++.+-..|-+.-+.+ ... .+. T Consensus 269 e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~e~~~~~f~~~-------~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~ 340 (409) T protein:vir:10 269 ENSQLTIRQIASVFGVKMHQL-NDLDRATHSNITEQNREFYID-------TLQSILNMYELEINYKLFLISEIKNGFYSK 340 (409) T ss_pred HHHHHHHHHHHHHhCCCHHHc-CCCCCCccccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCchhccCCcEEE Confidence 556678889999999999866 543333 33333 45677754 37788877766664432221 122 356 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCCCCCCCCCcCCCCCC Q lcl|NC_016762. 381 AIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEPEDEDAARTDPTGEQ 455 (456) Q Consensus 381 ~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~~d~~~~~~d~~~~~ 455 (456) |.++.|...+.+++++ +...++..| ++++||+|+..+++|+++++..- .-.+.+....+...+|++ T Consensus 341 fd~~~ll~~d~~~~~~-------~~~~~~~~G--~~T~NE~R~~lgl~p~~ggD~~~~~~n~~~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 341 FNVDTILRADIKTRYE-------SYKEAIQNG--FKTPNEIRELEEDEPLEGGDVLLINGNMIPVKMAGEQYSKGGEK 409 (409) T ss_pred EechhhhccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeeccCccchhhccccccccCCC Confidence 6677888888887654 445677778 99999999999999987664421 111222222333455666 No 18 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.76 E-value=1.4e-18 Score=118.43 Aligned_cols=381 Identities=12% Similarity=0.084 Sum_probs=201.4 Q ss_pred HHHhhh-hhccCc----------------ccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecC Q lcl|NC_016762. 22 MSLLNQ-GIGHDA----------------KRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEG 84 (456) Q Consensus 22 d~~~n~-~~~~gt----------------~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~ 84 (456) |+|.+- +..... .-+..|..+|.+...+. .-..++ ++..+.++|+++++++-+--+.+... T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v-~~~~al-~~~~v~~ci~~ia~~iA~lp~~~~~~ 78 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSV-RGKRAL-KENTVYVCTKIRAESIGKLSLKIYKD 78 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCccc-chhhhh-ccHHHHHHHHHHHHhhhhCceEEEec Confidence 333321 111110 01112233333221111 111223 46778999999999999888887643 Q ss_pred CCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChh Q lcl|NC_016762. 85 DDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPK 163 (456) Q Consensus 85 ~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~ 163 (456) .+.... ..+...|...=....-+..|.+.+-+ -.++|.+++++. ++.. +.+..+.|+-...+.+. T Consensus 79 ~~~~~~---~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~----------G~~~~L~~i~~~~v~~~ 144 (422) T protein:vir:13 79 KEEYKE---HELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIE-RDRK----------GKIIGLYPINSDNVTKI 144 (422) T ss_pred Cccccc---chHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEECCcceEEE Confidence 322111 11112222111122233455555555 455677766653 2211 12344445444434332 Q ss_pred hhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 164 SFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLK 239 (456) Q Consensus 164 ~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~ 239 (456) . +.|..-..++. .+|.+.. .+ +....++++.+||+... ...|.|.++.+.+.+..... .......+++ T Consensus 145 ~-~~~~~~~~~~~-~~y~~~~--~~---g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~-~~~~~~~~f~ 216 (422) T protein:vir:13 145 I-DDDNFLSSLSK-VWYVVTD--KN---GKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRA-TQEFINKFFK 216 (422) T ss_pred E-cCCcceeccce-EEEEEEe--CC---CeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHH-HHHHHHHHHh Confidence 1 22333333433 3455542 12 23467999999998643 24589999999987654433 4455556666 Q ss_pred Hhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCCHH--HHHHHH Q lcl|NC_016762. 240 NAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSDPG--PTYNVN 312 (456) Q Consensus 240 ~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sgl~--~~~~~~ 312 (456) +..+.-. ++.... + .++..+++.+.+..+.++ .+.++++.+-+|+.++.+..+.. +..... T Consensus 217 ng~~p~gil~~~~~-----l-------~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~ 284 (422) T protein:vir:13 217 NGLSIKGIVQYVGD-----L-------DEKAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLT 284 (422) T ss_pred ccCCccEEEEeCCC-----C-------CHHHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHH Confidence 6433221 111111 1 123344444455444332 24566777788999887776543 455567 Q ss_pred HHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCCc--eEEEeCCCC Q lcl|NC_016762. 313 LQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKAE--FTAIWDDLT 387 (456) Q Consensus 313 ~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~d--~~~~f~pL~ 387 (456) ...||.+.|||...|.+...+..++.+ ...+||.. -|.|.++++-+.|-+.-+-+ ...+ |.|.++.|. T Consensus 285 ~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~ 357 (422) T protein:vir:13 285 KRELAATFGMKSYHLNDLERATFNNLTEQQKDFYVT-------TLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTIL 357 (422) T ss_pred HHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhh Confidence 788999999999766655555555444 45667654 37788777666554432221 1223 445556888 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---cc---CCCCC-CCCCcCCCCC Q lcl|NC_016762. 388 VPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DT---EPEDE-DAARTDPTGE 454 (456) Q Consensus 388 ~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~---~~~d~-~~~~~d~~~~ 454 (456) ..|.+++++ +.+.+++.| ++|+||+|+..+++|+++++..- .- +..++ ....++++++ T Consensus 358 r~d~~~~~~-------~~~~~~~~G--~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 358 RSDIKTRYE-------AYRIGIQGG--FIEANEARRRENLPPVEGGDRLLVNGNMIPIEMAGEQYKKGGEKGGK 422 (422) T ss_pred cCCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeeccCccchhhcccccccCCCcCCC Confidence 888877655 445677777 99999999999999987665421 11 11111 2222233333 No 19 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.76 E-value=9.1e-18 Score=114.01 Aligned_cols=411 Identities=11% Similarity=0.087 Sum_probs=202.4 Q ss_pred CCchh-----------HHHHhHHHHHHHHHHHHHHhhhhhccCcc--cchhh-hhccCcccCCHHHHHHHHhcCchhhhh Q lcl|NC_016762. 1 MTDKL-----------DLAVNHAMSSAIARARMSLLNQGIGHDAK--RPQAW-CEYGFPQEITFNDLYTMYRRGGIAHGA 66 (456) Q Consensus 1 ~~~~~-----------~~~~~~a~~~~~~~~~d~~~n~~~~~gt~--~~~~~-~~~~~~~~~~~~~l~~~Y~~~~l~r~i 66 (456) |+.|. +=.+|+|+.......++.+. |+.-. .+..+ .........++++|.++|..+.++++| T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 88 (535) T protein:vir:10 13 LSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVD----GIDIADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQAI 88 (535) T ss_pred hhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhcccc----ccccccCCcccccccCccccccCHHHHHHHhccChhHHHH Confidence 44332 22333444443333333332 21111 11011 111222457899999999999999999 Q ss_pred hccchhHHhh-----------CCCEE--ecCCCcchhhhhHHHHHHHHHHHH--------HhhHHHHHHHHHH-hhcccC Q lcl|NC_016762. 67 VEKIVTTCWK-----------TNPQV--IEGDDQDRSKDETEWERKNKPLIA--------GGRFWRAVSEADR-RRLVGR 124 (456) Q Consensus 67 Vd~~aed~tR-----------~~~~i--~~~~~~d~~~~~~~~e~~i~~~~~--------~l~~~~~~~ea~~-~~r~~G 124 (456) |++.++..+. .++.| ...+.....+.... ...+..++. ....|..|...+- ..+++| T Consensus 89 i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~-~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~ 167 (535) T protein:vir:10 89 IRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKR-AHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQD 167 (535) T ss_pred HHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhh-hhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhC Confidence 9998887653 12232 22221111111111 011222222 1124444555433 345666 Q ss_pred ceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhe Q lcl|NC_016762. 125 YSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFI 204 (456) Q Consensus 125 gs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~ 204 (456) |.+++..+++.. +.+..|.|+....+++ ..|+...+ +-+.+|++. ++ .....+.++.||| T Consensus 168 g~ay~~i~r~~~----------G~~~~L~~l~p~~V~v---~~d~~~~~-~~~~~~~~~----~~--~~~~~~~~~eiih 227 (535) T protein:vir:10 168 QINIERIFKNDS----------NELDHFNAVDASKVVI---SYSPRSKD-QPRKFEQFV----SE--TKSVKFSERNLTF 227 (535) T ss_pred CceEEEEEECCC----------CcEEEEEEeCCceeEE---EEcCcccc-CceEEEEEe----cC--ceeEEECcccEEE Confidence 544443343321 1133444444333332 12332211 123444443 11 2245688888988 Q ss_pred ecCC-------cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-hhhhhccHhhHHhhhcCCHHHHHHHHHHH Q lcl|NC_016762. 205 LGDW-------TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLL-NFDKEINLGEIASTYGVTLDALNERFNEA 276 (456) Q Consensus 205 ~~~~-------~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 276 (456) |... ...|.|.++.+.+.+.....+ ......+|++..+.-.+ +.... +. .....+..+++.+. T Consensus 228 ~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa-~~~~~~~f~ng~~p~giL~~~~~-----~~---~~ls~e~~e~lk~~ 298 (535) T protein:vir:10 228 INYWNLSDTDRRGYGYSPVEASIPLIRAIYDT-EQFNARFFSQGGTTRGILVIDQD-----GD---AQANQMMLAGIRRQ 298 (535) T ss_pred EeccCCCCcccccccccHHHHHHHHHHHHHHH-HHHHHHHHhccCCccEEEEecCC-----CC---cccCHHHHHHHHHH Confidence 8532 234899999998877655443 44555566664432211 11100 00 00112344455444 Q ss_pred HHHHhcC---CCe-EEec-CCCceeEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeecc-CCCcccc-hH-HHHHHHH Q lcl|NC_016762. 277 ARQLNRG---NDV-LLPT-QGATVTQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGM-QTGERAS-SE-DQKYHNA 346 (456) Q Consensus 277 ~~~~~~~---~~~-~lid-~~d~~~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~-sp~Glns-t~-D~~nyyd 346 (456) +....++ .+. .++. .+-+|+.++.+..+. -+........||.+.|||-..| |. ..+..+. .+ ....|.+ T Consensus 299 ~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~l-G~~~~at~sn~~~~~~~~~~s 377 (535) T protein:vir:10 299 WTSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEI-NFPNNGGSTGKSGTKSVNEGS 377 (535) T ss_pred HHHHhcCcccccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-ccccCcccccchhhhhhhhhh Confidence 4443333 232 3443 455777777666543 3444456779999999999755 65 3344432 22 3455666 Q ss_pred HHHHHHHhh----hhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHH Q lcl|NC_016762. 347 RCQARRVQE----LTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIR 422 (456) Q Consensus 347 ~I~~~Qe~~----lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R 422 (456) .++..+... |.|.+..+-..|-+.-+-....++.|+|+-|...+.++++++.+ ... .| .++++|+| T Consensus 378 ~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~~~~f~f~~l~~~d~~~r~~~~~-------~~~-~g--~lT~NE~R 447 (535) T protein:vir:10 378 TAKAKLESSKDKGLTPLLSFIEQVINDKIMRYVDTDYRFSFTLGDAQDKLQEEQVWK-------LKL-AN--GYFINEYR 447 (535) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCCeEEEEeccccccCHHHHHHHHH-------HHH-cC--CCCHHHHH Confidence 666665544 66776666555533322233457999999999999888776532 222 34 68999999 Q ss_pred HHhcccCCCCCCCCccc-------------CC--------CCCCCCCcC-----------CCCCCC Q lcl|NC_016762. 423 EEAGYDPLQGGDPLPDT-------------EP--------EDEDAARTD-----------PTGEQQ 456 (456) Q Consensus 423 ~~~~~~~~~~~~~~~~~-------------~~--------~d~~~~~~d-----------~~~~~e 456 (456) +..+++|+++++.+-.. ++ ...+.++.+ +++.++ T Consensus 448 ~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~ 513 (535) T protein:vir:10 448 KDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDD 513 (535) T ss_pred HHhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCC Confidence 99999998765421000 00 000000000 000000 No 20 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.75 E-value=5e-18 Score=115.43 Aligned_cols=382 Identities=12% Similarity=0.082 Sum_probs=193.8 Q ss_pred HHHhhhhhcc-Cc-ccc-----------hhhhh-ccC-c--ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecC Q lcl|NC_016762. 22 MSLLNQGIGH-DA-KRP-----------QAWCE-YGF-P--QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEG 84 (456) Q Consensus 22 d~~~n~~~~~-gt-~~~-----------~~~~~-~~~-~--~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~ 84 (456) |+|.+.+.+. |. +|. ..+.. +|. + ...+. ..++ +++.+.+||+++|+++-+--+.+... T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~---~~al-~~~~v~~~i~~ia~~ia~lp~~~~~~ 76 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKG---KNAL-KVATVFACIKILSESVSKLPLKIYQE 76 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccch---hhhh-ccHHHHHHHHHHHHhhccCceEEEEe Confidence 3333322111 10 110 00001 111 1 11222 2333 47889999999999998877777533 Q ss_pred CCcch-hhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCCh Q lcl|NC_016762. 85 DDQDR-SKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKP 162 (456) Q Consensus 85 ~~~d~-~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~ 162 (456) ++... ......+...+...=....-+..|.+.+.+ -.++|-+++++. .+.. +.+..+.|+....+++ T Consensus 77 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~----------G~~~~L~~i~~~~v~v 145 (432) T protein:vir:10 77 DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIE-FDRK----------GKVQALWPIDASKVTV 145 (432) T ss_pred cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEEcCceeEE Confidence 32211 111111111121111112223445555444 456677777654 2211 1234444544433333 Q ss_pred hhhhccccccccCCce-eEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 163 KSFDEKPDSETYGQPT-MWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESF 237 (456) Q Consensus 163 ~~~~~Dp~s~~yg~P~-~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~ 237 (456) . .|....-++... +|.+. .+ +..+.++++.||||... ...|.|.++.+...+.....+ ..+...+ T Consensus 146 ~---~d~~~~~~~~~~~~y~~~---~~---g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~-~~~~~~~ 215 (432) T protein:vir:10 146 Y---IDDVGLLNSKTKMWYVVN---TG---GQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASA-DKFINNF 215 (432) T ss_pred E---EcCcccccccceEEEEEe---cC---CeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHH-HHHHHHH Confidence 2 121111122223 34443 12 22467999999998532 245999999998876555444 3344445 Q ss_pred HHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCCHH--HHHH Q lcl|NC_016762. 238 LKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSDPG--PTYN 310 (456) Q Consensus 238 ~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sgl~--~~~~ 310 (456) +++..+.-. ++.... + .++..+++.+.+....++ .+.++++.+-+|+.++.+..+.. +... T Consensus 216 ~~ng~~p~gil~~~~~-----l-------~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~ 283 (432) T protein:vir:10 216 YKQGLQVKGLVQYVGD-----L-------NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTE 283 (432) T ss_pred HhccCCccEEEEcCCC-----C-------CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHH Confidence 565433221 221111 1 122334444455444332 24566777778999887766553 5566 Q ss_pred HHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCC--ceEEEeCC Q lcl|NC_016762. 311 VNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKA--EFTAIWDD 385 (456) Q Consensus 311 ~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~--d~~~~f~p 385 (456) ...++||.+.|||...|-....+..++.+ -...||.. -|+|.++.+-+.|-+.-+.+ ... .|.|.++. T Consensus 284 ~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~-------~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~ 356 (432) T protein:vir:10 284 LTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTD-------TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDA 356 (432) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechh Confidence 67889999999999866443444444444 34566643 57888877766654332221 112 35566668 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCC---CCC-CCCCcC--CCCCCC Q lcl|NC_016762. 386 LTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEP---EDE-DAARTD--PTGEQQ 456 (456) Q Consensus 386 L~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~---~d~-~~~~~d--~~~~~e 456 (456) |...|.+++++ +...++..| +++++|+|+..+++|+++++..- .-.+ .++ ....++ ..+.++ T Consensus 357 l~~~d~~~~~~-------~~~~~~~~G--~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~ 427 (432) T protein:vir:10 357 ILRADIKTRYE-------AYRTGIQGG--FLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKE 427 (432) T ss_pred hhcCCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCC Confidence 98888888765 456677777 99999999999999987654321 0000 000 001111 111111 No 21 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.75 E-value=5e-18 Score=115.43 Aligned_cols=382 Identities=12% Similarity=0.082 Sum_probs=193.8 Q ss_pred HHHhhhhhcc-Cc-ccc-----------hhhhh-ccC-c--ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecC Q lcl|NC_016762. 22 MSLLNQGIGH-DA-KRP-----------QAWCE-YGF-P--QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEG 84 (456) Q Consensus 22 d~~~n~~~~~-gt-~~~-----------~~~~~-~~~-~--~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~ 84 (456) |+|.+.+.+. |. +|. ..+.. +|. + ...+. ..++ +++.+.+||+++|+++-+--+.+... T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~---~~al-~~~~v~~~i~~ia~~ia~lp~~~~~~ 76 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKG---KNAL-KVATVFACIKILSESVSKLPLKIYQE 76 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccch---hhhh-ccHHHHHHHHHHHHhhccCceEEEEe Confidence 3333322111 10 110 00001 111 1 11222 2333 47889999999999998877777533 Q ss_pred CCcch-hhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCCh Q lcl|NC_016762. 85 DDQDR-SKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKP 162 (456) Q Consensus 85 ~~~d~-~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~ 162 (456) ++... ......+...+...=....-+..|.+.+.+ -.++|-+++++. .+.. +.+..+.|+....+++ T Consensus 77 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~----------G~~~~L~~i~~~~v~v 145 (432) T protein:vir:10 77 DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIE-FDRK----------GKVQALWPIDASKVTV 145 (432) T ss_pred cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEEcCceeEE Confidence 32211 111111111121111112223445555444 456677777654 2211 1234444544433333 Q ss_pred hhhhccccccccCCce-eEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 163 KSFDEKPDSETYGQPT-MWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESF 237 (456) Q Consensus 163 ~~~~~Dp~s~~yg~P~-~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~ 237 (456) . .|....-++... +|.+. .+ +..+.++++.||||... ...|.|.++.+...+.....+ ..+...+ T Consensus 146 ~---~d~~~~~~~~~~~~y~~~---~~---g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~-~~~~~~~ 215 (432) T protein:vir:10 146 Y---IDDVGLLNSKTKMWYVVN---TG---GQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASA-DKFINNF 215 (432) T ss_pred E---EcCcccccccceEEEEEe---cC---CeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHH-HHHHHHH Confidence 2 121111122223 34443 12 22467999999998532 245999999998876555444 3344445 Q ss_pred HHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCCHH--HHHH Q lcl|NC_016762. 238 LKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSDPG--PTYN 310 (456) Q Consensus 238 ~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sgl~--~~~~ 310 (456) +++..+.-. ++.... + .++..+++.+.+....++ .+.++++.+-+|+.++.+..+.. +... T Consensus 216 ~~ng~~p~gil~~~~~-----l-------~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~ 283 (432) T protein:vir:10 216 YKQGLQVKGLVQYVGD-----L-------NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTE 283 (432) T ss_pred HhccCCccEEEEcCCC-----C-------CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHH Confidence 565433221 221111 1 122334444455444332 24566777778999887766553 5566 Q ss_pred HHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCC--ceEEEeCC Q lcl|NC_016762. 311 VNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKA--EFTAIWDD 385 (456) Q Consensus 311 ~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~--d~~~~f~p 385 (456) ...++||.+.|||...|-....+..++.+ -...||.. -|+|.++.+-+.|-+.-+.+ ... .|.|.++. T Consensus 284 ~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~-------~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~ 356 (432) T protein:vir:10 284 LTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTD-------TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDA 356 (432) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechh Confidence 67889999999999866443444444444 34566643 57888877766654332221 112 35566668 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCC---CCC-CCCCcC--CCCCCC Q lcl|NC_016762. 386 LTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEP---EDE-DAARTD--PTGEQQ 456 (456) Q Consensus 386 L~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~---~d~-~~~~~d--~~~~~e 456 (456) |...|.+++++ +...++..| +++++|+|+..+++|+++++..- .-.+ .++ ....++ ..+.++ T Consensus 357 l~~~d~~~~~~-------~~~~~~~~G--~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~ 427 (432) T protein:vir:10 357 ILRADIKTRYE-------AYRTGIQGG--FLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKE 427 (432) T ss_pred hhcCCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCC Confidence 98888888765 456677777 99999999999999987654321 0000 000 001111 111111 No 22 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.75 E-value=5e-18 Score=115.43 Aligned_cols=382 Identities=12% Similarity=0.082 Sum_probs=193.8 Q ss_pred HHHhhhhhcc-Cc-ccc-----------hhhhh-ccC-c--ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecC Q lcl|NC_016762. 22 MSLLNQGIGH-DA-KRP-----------QAWCE-YGF-P--QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEG 84 (456) Q Consensus 22 d~~~n~~~~~-gt-~~~-----------~~~~~-~~~-~--~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~ 84 (456) |+|.+.+.+. |. +|. ..+.. +|. + ...+. ..++ +++.+.+||+++|+++-+--+.+... T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~---~~al-~~~~v~~~i~~ia~~ia~lp~~~~~~ 76 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKG---KNAL-KVATVFACIKILSESVSKLPLKIYQE 76 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccch---hhhh-ccHHHHHHHHHHHHhhccCceEEEEe Confidence 3333322111 10 110 00001 111 1 11222 2333 47889999999999998877777533 Q ss_pred CCcch-hhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCCh Q lcl|NC_016762. 85 DDQDR-SKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKP 162 (456) Q Consensus 85 ~~~d~-~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~ 162 (456) ++... ......+...+...=....-+..|.+.+.+ -.++|-+++++. .+.. +.+..+.|+....+++ T Consensus 77 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~----------G~~~~L~~i~~~~v~v 145 (432) T protein:vir:10 77 DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIE-FDRK----------GKVQALWPIDASKVTV 145 (432) T ss_pred cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEEcCceeEE Confidence 32211 111111111121111112223445555444 456677777654 2211 1234444544433333 Q ss_pred hhhhccccccccCCce-eEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 163 KSFDEKPDSETYGQPT-MWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESF 237 (456) Q Consensus 163 ~~~~~Dp~s~~yg~P~-~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~ 237 (456) . .|....-++... +|.+. .+ +..+.++++.||||... ...|.|.++.+...+.....+ ..+...+ T Consensus 146 ~---~d~~~~~~~~~~~~y~~~---~~---g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~-~~~~~~~ 215 (432) T protein:vir:10 146 Y---IDDVGLLNSKTKMWYVVN---TG---GQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASA-DKFINNF 215 (432) T ss_pred E---EcCcccccccceEEEEEe---cC---CeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHH-HHHHHHH Confidence 2 121111122223 34443 12 22467999999998532 245999999998876555444 3344445 Q ss_pred HHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCCHH--HHHH Q lcl|NC_016762. 238 LKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSDPG--PTYN 310 (456) Q Consensus 238 ~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sgl~--~~~~ 310 (456) +++..+.-. ++.... + .++..+++.+.+....++ .+.++++.+-+|+.++.+..+.. +... T Consensus 216 ~~ng~~p~gil~~~~~-----l-------~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~ 283 (432) T protein:vir:10 216 YKQGLQVKGLVQYVGD-----L-------NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTE 283 (432) T ss_pred HhccCCccEEEEcCCC-----C-------CHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHH Confidence 565433221 221111 1 122334444455444332 24566777778999887766553 5566 Q ss_pred HHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCC--ceEEEeCC Q lcl|NC_016762. 311 VNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKA--EFTAIWDD 385 (456) Q Consensus 311 ~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~--d~~~~f~p 385 (456) ...++||.+.|||...|-....+..++.+ -...||.. -|+|.++.+-+.|-+.-+.+ ... .|.|.++. T Consensus 284 ~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~-------~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~ 356 (432) T protein:vir:10 284 LTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTD-------TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDA 356 (432) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechh Confidence 67889999999999866443444444444 34566643 57888877766654332221 112 35566668 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCC---CCC-CCCCcC--CCCCCC Q lcl|NC_016762. 386 LTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEP---EDE-DAARTD--PTGEQQ 456 (456) Q Consensus 386 L~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~---~d~-~~~~~d--~~~~~e 456 (456) |...|.+++++ +...++..| +++++|+|+..+++|+++++..- .-.+ .++ ....++ ..+.++ T Consensus 357 l~~~d~~~~~~-------~~~~~~~~G--~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~ 427 (432) T protein:vir:10 357 ILRADIKTRYE-------AYRTGIQGG--FLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKE 427 (432) T ss_pred hhcCCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCCC Confidence 98888888765 456677777 99999999999999987654321 0000 000 001111 111111 No 23 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.74 E-value=1.1e-17 Score=113.59 Aligned_cols=377 Identities=13% Similarity=0.087 Sum_probs=197.3 Q ss_pred HHHhhhhhcc--Ccc---cchhhhhccCc------ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcch- Q lcl|NC_016762. 22 MSLLNQGIGH--DAK---RPQAWCEYGFP------QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDR- 89 (456) Q Consensus 22 d~~~n~~~~~--gt~---~~~~~~~~~~~------~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~- 89 (456) |+|..-..+. .+. .......++.+ ..++. ..+..++.+.++|+++|+++-.--+.+...++... T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~ 76 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISS----QRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQ 76 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCccccCCceech----hhhhccHHHHHHHHHHHHHhccCceEEEEecCCcee Confidence 3443311111 000 00011111111 12232 24567899999999999999877777653322111 Q ss_pred hhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcc Q lcl|NC_016762. 90 SKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEK 168 (456) Q Consensus 90 ~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~D 168 (456) ......+...|...-....-+..|.+.+-+ -.++|.+++++. +++. .+..+.|+....+++. .+ T Consensus 77 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~-~~~g-----------~~~~L~~l~~~~v~~~---~~ 141 (414) T protein:vir:44 77 RATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKV-KAFG-----------EVAELLPVDPGCVVPK---LN 141 (414) T ss_pred ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-eCCC-----------cEEEEEEEcCceEEEE---EC Confidence 111111111222222222334455555555 455677776653 3321 1233444433323221 11 Q ss_pred ccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhh- Q lcl|NC_016762. 169 PDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQ- 244 (456) Q Consensus 169 p~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~- 244 (456) ..|.+ .|.+.. .+ +....++++.|+||...+ ..|.|.++.+.+.+..... .......++++..+. T Consensus 142 ----~~~~~-~y~~~~--~~---g~~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~-~~~~~~~~f~ng~~p~ 210 (414) T protein:vir:44 142 ----SSWEP-VYQVTF--PD---GSTDVLSQEDIWHVRTLTLDGLVGLNPIAYAREAISLAAA-TEEHGARLFSNGAVTS 210 (414) T ss_pred ----CCCcE-EEEEEe--cC---ceEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHHH-HHHHHHHHHhccCCCc Confidence 12333 344542 12 234679999999986432 3599999998876654433 344444556654332 Q ss_pred hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCCH--HHHHHHHHHHHHh Q lcl|NC_016762. 245 LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSDP--GPTYNVNLQTAAA 318 (456) Q Consensus 245 l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sgl--~~~~~~~~~~~aa 318 (456) ..++.... -.++..+++.+.+....++ ...++++.+-+|+.++.+..+. -+......+.||. T Consensus 211 gil~~~~~------------l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~ 278 (414) T protein:vir:44 211 GVLRTEQT------------LSDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICR 278 (414) T ss_pred eEEEeCCC------------CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHH Confidence 11211111 1123344444444443332 2356677777899988776554 3555567788999 Q ss_pred hhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC-Cc--eEEEeCCCCCCCHHHH Q lcl|NC_016762. 319 GVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK-AE--FTAIWDDLTVPTKAER 394 (456) Q Consensus 319 as~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~-~d--~~~~f~pL~~~seke~ 394 (456) +.|||..+|-+..-+..+..+ -.++||.. -|.|.++.+-+.|-+.-+.+.. .. |.|.+..|...+.+++ T Consensus 279 ~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~-------~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~ 351 (414) T protein:vir:44 279 LFRVPLHMVQNTDRATFNNIEELGLGFINY-------SLVPYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSR 351 (414) T ss_pred HhCCCHHHhCCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHH Confidence 999999876544434444434 45667754 4678887776666443332211 23 5566668888888887 Q ss_pred HHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCccc-----C--CCCCCCCCcCCCCCCC Q lcl|NC_016762. 395 LANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDT-----E--PEDEDAARTDPTGEQQ 456 (456) Q Consensus 395 Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~-----~--~~d~~~~~~d~~~~~e 456 (456) ++ +.++++..| ++++||+|+..+++|+++++..--. . +..+...++++..++| T Consensus 352 ~~-------~~~~~~~~G--~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~d~ 411 (414) T protein:vir:44 352 FE-------AYATGINWG--IYSPNDCRDLEDMNPRPGGDVYLTPMNMTTKPSDGSKAGKQKDNANADE 411 (414) T ss_pred HH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcceecccccccccCCccccCCCCCCCCCCCC Confidence 55 455677777 9999999999999998766542100 0 1122222333433344 No 24 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.73 E-value=2.2e-17 Score=111.86 Aligned_cols=380 Identities=14% Similarity=0.069 Sum_probs=191.4 Q ss_pred HHHhhhhhccCccc-------------chhhhhccCc----ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecC Q lcl|NC_016762. 22 MSLLNQGIGHDAKR-------------PQAWCEYGFP----QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEG 84 (456) Q Consensus 22 d~~~n~~~~~gt~~-------------~~~~~~~~~~----~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~ 84 (456) |+|.+...+.+.++ +..+..++.+ ..++.+ .+.+++-+.+||+++|+++-.--+.+... T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~----~al~~~~V~~~v~~Ia~~iA~lp~~~~~~ 76 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPH----DALQVSAVFASVRLLSETIATLPLSTYSK 76 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechH----HhhccHHHHHHHHHHHHhhccCceEEEEe Confidence 55544433333322 1111122111 122322 23457778899999999987766666543 Q ss_pred CCcchhhhhHHHHHHHHHHHHH----hhHHHHHHHH-HHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEecccc Q lcl|NC_016762. 85 DDQDRSKDETEWERKNKPLIAG----GRFWRAVSEA-DRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGC 159 (456) Q Consensus 85 ~~~d~~~~~~~~e~~i~~~~~~----l~~~~~~~ea-~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~ 159 (456) ......+.. ...+..++.. .. +..|.+. +....++|-++++|. .++.. +..|.|+-... T Consensus 77 ~~~~~~~~~---~~~l~~~ln~~~n~~t-~~~f~~~~~~~lll~Gna~~~i~-~~~g~-----------~~~l~~l~p~~ 140 (457) T protein:vir:13 77 RGGSRKEIV---TPEWLDYPNAEPGGMG-RIDILSQTVLSLLLQGNAFLAVR-WQGPN-----------IVGLDVLDPTK 140 (457) T ss_pred cCCcccccc---cchHHHhccccCCCCC-HHHHHHHHHHHHhhcCCeEEEEE-ecCCc-----------EEEEEEEccCc Confidence 221111110 1112222221 11 2233443 334566787877663 33221 23344444333 Q ss_pred CChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 160 LKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGE 235 (456) Q Consensus 160 ~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~ 235 (456) +++.....+.. .+.....|.+. .+|.......++++.|||+.... ..|.|.++.+.+.+..... +..... T Consensus 141 v~v~~~~~~~~--~~~~~~~y~~~---~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~-~~~~~~ 214 (457) T protein:vir:13 141 IHVHMVMVDGL--RRKVFEAYDID---ADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALA-AQKYGS 214 (457) T ss_pred eEEEEecCCCc--cceeEEEEEEe---cCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHH-HHHHHH Confidence 33322111111 11111234443 12222223457788998885432 4699999988876654443 444555 Q ss_pred HHHHHhhhhh-hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCCH--HHH Q lcl|NC_016762. 236 SFLKNAARQL-LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSDP--GPT 308 (456) Q Consensus 236 ~~~~~~~~~l-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sgl--~~~ 308 (456) .++++..+.- .++.... -..+..+++.+.++...++ .+.++++.+-+|++++.+..+. -+. T Consensus 215 ~~f~ng~~p~gil~~~~~------------ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~ 282 (457) T protein:vir:13 215 KFFANGAMPGAVVEVPGT------------MSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQT 282 (457) T ss_pred HHHhcCCCcceEEEcCCC------------CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHH Confidence 5666644322 1221111 1123445555555555443 2346677777899988776543 355 Q ss_pred HHHHHHHHHhhhcCCeEEeeccCCCccc-ch--H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC-C--ceEE Q lcl|NC_016762. 309 YNVNLQTAAAGVDIPTKILVGMQTGERA-SS--E-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK-A--EFTA 381 (456) Q Consensus 309 ~~~~~~~~aaas~IP~t~L~G~sp~Gln-st--~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~-~--d~~~ 381 (456) ......+||.+.+||-.. +|...++-. ++ + -...||.. -|.|.++++-..|-+.-+.+.. . .|.| T Consensus 283 ~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~sn~eq~~~~f~~~-------tl~P~~~~ie~~ln~~L~~~~~~~~~~i~f 354 (457) T protein:vir:13 283 RQFQVPEIARIFGVPPHL-ISDATNSTSWGSGLAEQNIAFTMF-------SLRPWLERIEAGFNRLLFAETADRFRFVKF 354 (457) T ss_pred HHHHHHHHHHHhCCCHHH-cCCCCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCccccCceeEEe Confidence 556788999999999874 576555432 22 2 23445543 4678777776655443332211 1 2566 Q ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcc--------------cCCCCC--- Q lcl|NC_016762. 382 IWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPD--------------TEPEDE--- 444 (456) Q Consensus 382 ~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~--------------~~~~d~--- 444 (456) .++.|...+-+++++. ..+++..| ++++||+|+..+++|++++..+.- ..+... T Consensus 355 d~~~l~~~D~~~r~~~-------~~~~~~~G--~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~ 425 (457) T protein:vir:13 355 NLDEIKRGAPKERMEL-------WSLGLQNG--IYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPA 425 (457) T ss_pred echhhhccCHHHHHHH-------HHHHHhCC--CcCHHHHHHHhCCCCCCCCcccceeeccccccccccccccccCCCCC Confidence 6778888888887655 44566777 999999999999998866411100 000000 Q ss_pred ---CCCCcCCCCCCC Q lcl|NC_016762. 445 ---DAARTDPTGEQQ 456 (456) Q Consensus 445 ---~~~~~d~~~~~e 456 (456) ...++++..+++ T Consensus 426 ~~~~~~~~~~~~~~~ 440 (457) T protein:vir:13 426 IEPPAEEPDEEPEPE 440 (457) T ss_pred CCCCccccCCCCCCC Confidence 000000011111 No 25 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.73 E-value=1.3e-17 Score=113.09 Aligned_cols=385 Identities=12% Similarity=0.010 Sum_probs=202.2 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCE Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQ 80 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~ 80 (456) +..|..-. +......-+.+...+.+..+ .....++.+ -+..++-+.++|+++|++.-+--++ T Consensus 6 ~f~~~~~~-----~~~~~~~~~~~~~~~~~~~~---------~~~~~v~~~----~al~~~~v~~~i~~Ia~~ia~l~~~ 67 (416) T protein:vir:12 6 MFEKRSGS-----SDHEDGFNNILLNMFGGRKT---------ASGERVSES----NSLVQPDIFACVNVLSDDIAKLPIH 67 (416) T ss_pred hcccccCc-----cccCccchhHHHHhhcCccc---------ccCceechh----hhhccHHHHHHHHHHHHhhhhCceE Confidence 11111100 00000000000000000000 001122222 1335677889999999999887777 Q ss_pred EecCCCcchhh-hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccc Q lcl|NC_016762. 81 VIEGDDQDRSK-DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAG 158 (456) Q Consensus 81 i~~~~~~d~~~-~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~ 158 (456) +....+....+ ....+-..+...=..+.-+..|.+.+.+ -.++|-+++++.-++. +.+..+.|+... T Consensus 68 ~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~-----------G~~~~L~~l~~~ 136 (416) T protein:vir:12 68 TYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSH-----------GYPEALFPLRPD 136 (416) T ss_pred EEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-----------CcEEEEEEECCc Confidence 65433221111 1111111222221222333445555554 4556777776653211 113344444433 Q ss_pred cCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 159 CLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGSGE 235 (456) Q Consensus 159 ~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~ 235 (456) .+++.. ++ .+.+.+|++. .+| ..+.++++.++||...+ ..|.|.++.+.+.+..... ...... T Consensus 137 ~v~v~~---~~----~~~~~~~~~~---~~g---~~~~~~~~eiih~~~~~~~~~~G~s~i~~~~~~i~~~~~-~~~~~~ 202 (416) T protein:vir:12 137 YTNAYV---HP----TTGMLWYQTV---LNG---KAIELYDYEVLHFKGLSTDGIHGKSPIGVVREHIGAQAA-ATKYNA 202 (416) T ss_pred ceEEEE---eC----CCcEEEEEEe---cCC---eEEEecCccEEEecCcCCCCcccccHHHHHHHHHHHHHH-HHHHHH Confidence 333211 11 1234456654 222 24678899998885433 3589999999987655443 344444 Q ss_pred HHHHHhhhh-hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccCCHH--HHHHHH Q lcl|NC_016762. 236 SFLKNAARQ-LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVSDPG--PTYNVN 312 (456) Q Consensus 236 ~~~~~~~~~-l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~sgl~--~~~~~~ 312 (456) .++++.... ..++... .-.++..+++.+.+..+.++.+.++++.+-+|++++.+..+.+ +..... T Consensus 203 ~~~~ng~~p~~il~~~~------------~~~~e~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~ 270 (416) T protein:vir:12 203 KLYKNEATPRGILKVPA------------FLDEKPKENVRKEWKRVNKVENIAIIDYGLEYQSISMPLQEAQFVESMKFN 270 (416) T ss_pred HHHhcCCCCceEEecCC------------CCCHHHHHHHHHHHHHHhcCCCeeecCCCceEEEccCChhhHHHHHHHHHH Confidence 566654321 1121111 1113445555555666667777888888889999988776544 667778 Q ss_pred HHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC--CCc--eEEEeCCCC Q lcl|NC_016762. 313 LQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL--KAE--FTAIWDDLT 387 (456) Q Consensus 313 ~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~--~~d--~~~~f~pL~ 387 (456) ..+||.+.+||...|-+...+..+..+ ..+.||.. -|.|.++.+-+.|-+.-+-+. ..+ +.|.++.|. T Consensus 271 ~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~ 343 (416) T protein:vir:12 271 KAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRN-------TLQPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSEL 343 (416) T ss_pred HHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCchhhcCCceEEeechhhh Confidence 899999999999877555555554443 45677754 578888877776644433221 122 555566777 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCC---CCCCCC----CcCCCCCCC Q lcl|NC_016762. 388 VPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEP---EDEDAA----RTDPTGEQQ 456 (456) Q Consensus 388 ~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~---~d~~~~----~~d~~~~~e 456 (456) ..+.+++| ++...++..| ++++||+|+..+++|+++++..- +-.+ .++... ....+||+. T Consensus 344 ~~d~~~~~-------~~~~~~~~~G--~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~gge~~ 413 (416) T protein:vir:12 344 RGDSKTQA-------EYLKTLHETG--VLNKDEIRELLERNPIENGDKYISSLNYVFLDFLEEYQRLKAGGAMKGGDNK 413 (416) T ss_pred ccCHHHHH-------HHHHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeeccccccccccchhhccccccccCCCCCc Confidence 77877765 4456677878 99999999999999997664321 0000 111100 112344444 No 26 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.72 E-value=4.8e-18 Score=115.55 Aligned_cols=357 Identities=11% Similarity=-0.012 Sum_probs=194.9 Q ss_pred HHHhhhhh--cc-----CcccchhhhhccCc----ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchh Q lcl|NC_016762. 22 MSLLNQGI--GH-----DAKRPQAWCEYGFP----QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRS 90 (456) Q Consensus 22 d~~~n~~~--~~-----gt~~~~~~~~~~~~----~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~ 90 (456) |++.+... .. .+..+..+-...+. ..++. .-|.+++.+.++|+++|+++-.--+++...... T Consensus 1 Mg~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~--- 73 (383) T protein:vir:10 1 MGLLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQLSYVSA----LSALQNTNVYSVINRIASDVSSAHFKTENTATL--- 73 (383) T ss_pred CCcccccccccccccccccccchhhhhhhccCccccccch----hHhhcchHHHHHHHHHHHhhccCceeecccchh--- Confidence 34433210 00 00001111111111 11222 224568889999999999998877776321110 Q ss_pred hhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccc Q lcl|NC_016762. 91 KDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKP 169 (456) Q Consensus 91 ~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp 169 (456) ..+++ -..+.-+..|.+.+.+ -.++|-+++++. ++ . ..+.|....++.+. .|. T Consensus 74 -------~ll~~-PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~-~~--~------------~~~~p~~~~~v~~~---~~~ 127 (383) T protein:vir:10 74 -------NRLES-PSSLIGRFSFWQGALMQLCLSGNDYIPLV-GQ--N------------LEHIPNSDVQINYL---PGN 127 (383) T ss_pred -------hhhhC-CCCCCCHHHHHHHHHHHhhhcCCeEEEEE-cC--c------------eeEeecCcceEEEE---EcC Confidence 01111 1112233344444444 456777777652 22 1 11223332222211 111 Q ss_pred cccccCCceeEEEeecccCCccccceeeehhhhheecCC------cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_016762. 170 DSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW------TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAAR 243 (456) Q Consensus 170 ~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~------~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~ 243 (456) ...+|.+... .+ +..+.+.++.|+||... ...|.|.++.+...+.....+... ...++++... T Consensus 128 ------~~~~~~~~~~-~~---~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~-~~~~f~ng~~ 196 (383) T protein:vir:10 128 ------MGIVYTVLES-ND---RPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKS-NMSAMENQIN 196 (383) T ss_pred ------CceEEEEEEc-CC---ceEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHH-HHHHHhccCC Confidence 1233444321 11 22466888999888432 134999999998877665554433 3345555433 Q ss_pred hhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEEecccCCHHH---HHHHHHHHHH Q lcl|NC_016762. 244 QLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQMVSAVSDPGP---TYNVNLQTAA 317 (456) Q Consensus 244 ~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~~~~~sgl~~---~~~~~~~~~a 317 (456) .-.+...+ . +...++..+++.+.++.+.++. +.++++.+.+|+.++.+....+. +.....++|| T Consensus 197 ~~~il~~~--------~--~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia 266 (383) T protein:vir:10 197 PAGKLTIS--------N--YLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQIS 266 (383) T ss_pred cceEEEeC--------C--CCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHH Confidence 22111110 0 0111344445555555554432 35677778899999988876654 4455678999 Q ss_pred hhhcCCeEEeeccCCCcccch--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 318 AGVDIPTKILVGMQTGERASS--EDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 318 aas~IP~t~L~G~sp~Glnst--~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~A 395 (456) .+.|||-.+|-+...+..+.+ +..+.+|.. -|+|.++.+-+.|-+.-++ .++.|.+++|...+.+++| T Consensus 267 ~afgVPp~~lg~~~~~~~~~sn~eq~~~~~~~-------~l~P~~~~ie~~l~~~l~~---~~~~f~~~~l~~~d~~~~~ 336 (383) T protein:vir:10 267 KAFGVPSDILGGGTSTESQHSNIDQIKATYLA-------NLNSYVNPIVDELRLKMNA---PDLELDIKDMLDVDDSILI 336 (383) T ss_pred HHhCCCHHHcCCccCCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCC---ceEEeechhhhccCHHHHH Confidence 999999887755555554432 334444422 2788888776666443333 2689999999999998875 Q ss_pred HHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 396 ei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) + +...+++.| ++++||+|+..+++|+..++.+.. .....+-+++|+| T Consensus 337 ~-------~~~~~~~~G--~~t~nE~R~~lg~~p~~~~d~~~~-----~~~~~~~~gGd~e 383 (383) T protein:vir:10 337 N-------QVSNLAKSG--VLGAEQAQFILTRSGFLPDNLPEF-----KPLTNETKGGDDK 383 (383) T ss_pred H-------HHHHHHhCC--CcCHHHHHHHhCCCcccCCccccc-----CCCcccCCCCCCC Confidence 5 456677778 999999999999999876543321 2233445678888 No 27 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.71 E-value=3e-17 Score=111.13 Aligned_cols=386 Identities=13% Similarity=0.073 Sum_probs=194.7 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCE Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQ 80 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~ 80 (456) +..++-.-.+.+.... ...+.+.. ++ +.+. .....++. ..|.+++.+.++|+++|+++-.--+. T Consensus 2 ~f~~~f~r~~~~~~~~----~~~~~~~~-~~--~~~~-----~~g~~v~~----~~~l~~~~v~~~i~~Ia~~iA~~p~~ 65 (413) T protein:vir:48 2 FFSGLFQRKSDAPVTT----PAELAEAI-GL--SYDT-----YTGKRISS----QRAMRLTAVYSCVRVLAESVGMLPCS 65 (413) T ss_pred ccchhhccCccCCccc----hHHHHHhh-hc--Cccc-----ccCceech----hhhhccHHHHHHHHHHHHhhhhCceE Confidence 2232221111110000 01111110 11 1100 00112232 23556888999999999999877777 Q ss_pred EecCCCcchhh-hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccc Q lcl|NC_016762. 81 VIEGDDQDRSK-DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAG 158 (456) Q Consensus 81 i~~~~~~d~~~-~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~ 158 (456) +...++..... ....+...+...=....-+..|.+.+-+ -.++|.+++++. +++ +.+..+.|+... T Consensus 66 ~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~-~~~-----------g~~~~L~~l~~~ 133 (413) T protein:vir:48 66 LYKISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKV-KAL-----------GEVVELLPIDPG 133 (413) T ss_pred EEEecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEE-eCC-----------CcEEEEEEEcCc Confidence 65433221111 1111111121111112233345555444 455677766653 221 112344444333 Q ss_pred cCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 159 CLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGSGE 235 (456) Q Consensus 159 ~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~ 235 (456) .+++. .|+ .+.| .|.+.. .+ +....++++.||||...+ ..|.|.++.+.+.+.....+ ..... T Consensus 134 ~v~~~---~~~----~~~~-~y~~~~--~~---g~~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~~-~~~~~ 199 (413) T protein:vir:48 134 CVEPK---LNS----QWQP-VYQVTF--PD---GSVDVLTQDEIWHVRTLTLDGLVGLNPIAYAREAISLAAAT-EEHGA 199 (413) T ss_pred eEEEE---EcC----CceE-EEEEEe--cC---ceEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHH-HHHHH Confidence 33321 111 1223 344431 12 223568899999885432 45999999999877554443 44445 Q ss_pred HHHHHhhhh-hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCCH--HHH Q lcl|NC_016762. 236 SFLKNAARQ-LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSDP--GPT 308 (456) Q Consensus 236 ~~~~~~~~~-l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sgl--~~~ 308 (456) .++++..+. ..++... .-..+..+++.+.+....++ ...++++.+.+|+.++.+..+. -+. T Consensus 200 ~~~~ng~~p~gil~~~~------------~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~ 267 (413) T protein:vir:48 200 RLFGNGAVTSGVLRTEQ------------KLTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLET 267 (413) T ss_pred HHHhccCCcceEEEeCC------------CCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHH Confidence 566654322 1111111 11123344444444443332 2346677778899988777655 366 Q ss_pred HHHHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC---CCceEEEeC Q lcl|NC_016762. 309 YNVNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL---KAEFTAIWD 384 (456) Q Consensus 309 ~~~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~---~~d~~~~f~ 384 (456) .......||.+.|||...|-+..-+..+..+ ...+||.. -|.|.++.+-+.|-+.-+-+. .-.|.|.+. T Consensus 268 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~-------~i~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~ 340 (413) T protein:vir:48 268 RKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINY-------SLVPYLTRIEQRINTGLVRESKQGKFYAKFNAG 340 (413) T ss_pred HHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccCccccCCeEEEEech Confidence 6677889999999999866544334444333 45667754 467888877666544322211 123566667 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc------ccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 385 DLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP------DTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 385 pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~------~~~~~d~~~~~~d~~~~~e 456 (456) .|...|.+++++ +.+++++.| ++++||+|+..+++|+++++..- ......++..++.++++.+ T Consensus 341 ~l~~~d~~~~~~-------~~~~~~~~g--~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~ 409 (413) T protein:vir:48 341 ALLRGDMKSRFE-------AYATGINWG--IYSPNDCRDLEDMNPRPGGDVYLTPMNMTTSPSAGDDNGKKKESGDAD 409 (413) T ss_pred hhhccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeccccccccccccccCCCCCCCCCcc Confidence 888777777655 455677878 99999999999999987654421 1111112222222222222 No 28 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.71 E-value=3.1e-17 Score=111.12 Aligned_cols=379 Identities=16% Similarity=0.124 Sum_probs=191.4 Q ss_pred HHHhhhhhccCcccc------hhhh-------hccCc----ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecC Q lcl|NC_016762. 22 MSLLNQGIGHDAKRP------QAWC-------EYGFP----QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEG 84 (456) Q Consensus 22 d~~~n~~~~~gt~~~------~~~~-------~~~~~----~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~ 84 (456) |+|.+...+.+.+.. +.|. .++.+ ..++.+ .+.++..+.++|+++++++-.--+++... T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~----~al~~~~v~~~i~~ia~~iA~lp~~~~~~ 76 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPH----DALQVSAVFASVRLLSETIATLPLSTYSK 76 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechH----HhhccHHHHHHHHHHHHhHhhCceEEEEe Confidence 555554333332211 0111 01111 122222 24467889999999999998777777543 Q ss_pred CCcchhhhhHHHHHHHHHHHHH---hhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccC Q lcl|NC_016762. 85 DDQDRSKDETEWERKNKPLIAG---GRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCL 160 (456) Q Consensus 85 ~~~d~~~~~~~~e~~i~~~~~~---l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~ 160 (456) ........... .+..++.+ ..-+..|.+.+.+ -.++|-++++|. +++ +++..+.|+-...+ T Consensus 77 ~~~~~~~~~~~---~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~-~~~-----------g~~~~l~~l~p~~v 141 (457) T protein:vir:62 77 RGGTRKEIDTP---EWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVR-WAG-----------PNIAGLDVLDPTKI 141 (457) T ss_pred cCCccccccch---HHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEE-eCC-----------CcEEEEEEEcCcce Confidence 32211111110 11222211 1123334444444 566677777663 221 11333444443333 Q ss_pred ChhhhhccccccccCCcee--EEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 161 KPKSFDEKPDSETYGQPTM--WEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSG 234 (456) Q Consensus 161 ~~~~~~~Dp~s~~yg~P~~--y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~ 234 (456) ++.....+ .+....+ |.+. .+|.......++++.||||.... ..|.|.++.+.+.+... ..+.... T Consensus 142 ~v~~~~~~----~~~~~~~~~y~~~---~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~-~~~~~~~ 213 (457) T protein:vir:62 142 HVHMVMVD----GLRRKVFEAYDID---ADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLA-LAAQKYG 213 (457) T ss_pred EEEEeccC----CccceeEEEEEEc---cCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHH-HHHHHHH Confidence 33221111 1122222 3332 22222223457889999885432 45899999888766544 3344555 Q ss_pred HHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC--HHH Q lcl|NC_016762. 235 ESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD--PGP 307 (456) Q Consensus 235 ~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg--l~~ 307 (456) ..+|++..+.-. ++.... + ..+..+++.+.+....++ .+.++++.+-+|+.++.+..+ +-+ T Consensus 214 ~~~f~ng~~p~gil~~~~~-----l-------s~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e 281 (457) T protein:vir:62 214 AHFFRNGAMPGAVVEVPGT-----M-------SEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQ 281 (457) T ss_pred HHHHhccCCcceEEEcCCC-----C-------CHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHH Confidence 566776443221 221111 1 123345555555554443 235677777889998877654 345 Q ss_pred HHHHHHHHHHhhhcCCeEEeeccCCCccc-c--hH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC---CCCceE Q lcl|NC_016762. 308 TYNVNLQTAAAGVDIPTKILVGMQTGERA-S--SE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP---LKAEFT 380 (456) Q Consensus 308 ~~~~~~~~~aaas~IP~t~L~G~sp~Gln-s--t~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~---~~~d~~ 380 (456) ........||.+.|||-. ++|...++-. + .+ -...||..+ |.|.++.+-..|-+.-+.+ ....+. T Consensus 282 ~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~sn~eq~~~~f~~~~-------l~P~~~~ie~~ln~~L~~~~~~~~~~i~ 353 (457) T protein:vir:62 282 TRQFQVPEIARIFGVPPH-LISDATNSTSWGSGLAEQNIAFTMFS-------LRPWLERIEAGFNRLLFAETADRFRFVK 353 (457) T ss_pred HHHHHHHHHHHHhCCCHH-HcCCCCCcccccchHHHHHHHHHHHH-------HHHHHHHHHHHHHhhhcCccccCceEEE Confidence 556678899999999986 5576554432 2 12 234566553 6787777655554332221 111356 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC-----------cc---cCCC---- Q lcl|NC_016762. 381 AIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL-----------PD---TEPE---- 442 (456) Q Consensus 381 ~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~-----------~~---~~~~---- 442 (456) |.++.|...+.+++++.. .++++.| ++++||+|+..+++|++++..+ .. ..+. T Consensus 354 fd~~~l~~~d~~~r~~~~-------~~~~~~G--~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~ 424 (457) T protein:vir:62 354 FNLDEIKRGAPKERMELW-------SLGLQNG--IYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPP 424 (457) T ss_pred eechhhhccCHHHHHHHH-------HHHHhCC--CcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccccccCCCc Confidence 667789888888876654 4566777 9999999999999988764110 00 0000 Q ss_pred --CC---CCCCc----C-C----CCCCC Q lcl|NC_016762. 443 --DE---DAART----D-P----TGEQQ 456 (456) Q Consensus 443 --d~---~~~~~----d-~----~~~~e 456 (456) ++ ++.++ + + ..+.| T Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 452 (457) T protein:vir:62 425 AIDPPAEEPADDEEPDNAEGDPDEGETE 452 (457) T ss_pred cCCCCccCCCCCCCCCCCCCCCcccccc Confidence 00 00000 0 0 00000 No 29 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.71 E-value=4e-17 Score=110.46 Aligned_cols=382 Identities=11% Similarity=0.072 Sum_probs=192.2 Q ss_pred HHHhhhhhccC---ccc-------ch-hhhhccCcc---cCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCc Q lcl|NC_016762. 22 MSLLNQGIGHD---AKR-------PQ-AWCEYGFPQ---EITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQ 87 (456) Q Consensus 22 d~~~n~~~~~g---t~~-------~~-~~~~~~~~~---~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~ 87 (456) |++.+-..+.. +.. +. ...-+|... ..+. ..+| .++.+++||+++|+++-+--+.+....+. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~---~~al-~~~~v~~~i~~ia~~ia~l~~~~~~~~~~ 76 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKG---KNAL-KVATVFACIKILSESVSKLPLKIYQEDEY 76 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCCcceech---hhhh-ccHHHHHHHHHHHHhhccCceEEEEecCC Confidence 34433221110 000 00 011111111 1222 2344 47899999999999998777776533222 Q ss_pred ch-hhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhh Q lcl|NC_016762. 88 DR-SKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSF 165 (456) Q Consensus 88 d~-~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~ 165 (456) .. ......+...+...=....-+..|.+.+.+ -.++|-+++++.- |+. +.+..+.|+....+++. T Consensus 77 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~~~----------G~~~~L~~i~~~~v~v~-- 143 (429) T protein:vir:10 77 GIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEF-DRK----------GKVQALWPIDASKVTVY-- 143 (429) T ss_pred ceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEE-CCC----------CcEEEEEEEcCceeEEE-- Confidence 11 111111111111110011122334444444 4566777776542 211 12344555544434331 Q ss_pred hccccc-cccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 166 DEKPDS-ETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKN 240 (456) Q Consensus 166 ~~Dp~s-~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~ 240 (456) .|... ..++...+|.+. .+ +..+.++++.||||... ...|.|.++.+...+-....+ ..+...++++ T Consensus 144 -~~~~~~~~~~~~~~~~~~---~~---g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~-~~~~~~~~~n 215 (429) T protein:vir:10 144 -IDDVGLLNSKTKMWYVVN---TG---GQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASA-DKFINNFYKQ 215 (429) T ss_pred -EcCcccccccceEEEEEc---cC---CeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHH-HHHHHHHHhc Confidence 12111 122222344443 12 22467999999998532 245999999998866554443 3444455665 Q ss_pred hhhhh-hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc---C-CCeEEecCCCceeEEecccCCHH--HHHHHHH Q lcl|NC_016762. 241 AARQL-LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR---G-NDVLLPTQGATVTQMVSAVSDPG--PTYNVNL 313 (456) Q Consensus 241 ~~~~l-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~---~-~~~~lid~~d~~~~~~~~~sgl~--~~~~~~~ 313 (456) ..+.- .++.... + .++..+++.+.+....+ | .+.++++.+-+|+.++.+..+.+ +...... T Consensus 216 g~~~~~il~~~~~-----l-------~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 283 (429) T protein:vir:10 216 GLQVKGLVQYVGD-----L-------NEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTI 283 (429) T ss_pred cCCccEEEEcCCC-----C-------CHHHHHHHHHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHH Confidence 43321 1221111 1 12233444444444332 2 24566666778999887765443 4456778 Q ss_pred HHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCC--ceEEEeCCCCC Q lcl|NC_016762. 314 QTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKA--EFTAIWDDLTV 388 (456) Q Consensus 314 ~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~--d~~~~f~pL~~ 388 (456) ++||.+.|||...|-+...+..++.+ ....||. ..|.|.++.+-+.|-+.-+.+ ... .|.|.+..|.. T Consensus 284 ~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~-------~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~ 356 (429) T protein:vir:10 284 RQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYT-------DTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILR 356 (429) T ss_pred HHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhc Confidence 89999999999766444444444433 4556663 347888777766654432221 122 34555668988 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcc---cCC-----------CCCCCCCcCCCCC Q lcl|NC_016762. 389 PTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPD---TEP-----------EDEDAARTDPTGE 454 (456) Q Consensus 389 ~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~---~~~-----------~d~~~~~~d~~~~ 454 (456) .|.+++++ +...++..| ++++||+|+..+++|+++++..-- -.+ .+++.+.++++.| T Consensus 357 ~d~~~~~~-------~~~~~~~~G--~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~d~~~~~~~k~g~~~~~~~~~~~e 427 (429) T protein:vir:10 357 ADIKTRYE-------AYRTGIQGG--FLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNE 427 (429) T ss_pred CCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccccCCCCCCCCCCCCCCC Confidence 88888765 445677777 999999999999999876543210 000 0111111111111 Q ss_pred CC Q lcl|NC_016762. 455 QQ 456 (456) Q Consensus 455 ~e 456 (456) -- T Consensus 428 ~~ 429 (429) T protein:vir:10 428 GN 429 (429) T ss_pred CC Confidence 11 No 30 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.70 E-value=4.5e-17 Score=110.23 Aligned_cols=388 Identities=14% Similarity=0.133 Sum_probs=191.7 Q ss_pred CCchhHHHHhHHH-HHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCC Q lcl|NC_016762. 1 MTDKLDLAVNHAM-SSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNP 79 (456) Q Consensus 1 ~~~~~~~~~~~a~-~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~ 79 (456) ......++ ... ........++|... ..+|.+....+...-..|..++.+.+||++++++.-.--+ T Consensus 3 ~~~~~~~~--~p~~~~~~~~~~~~~~~~------------~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~ 68 (518) T protein:vir:78 3 LANGQTLS--APAMAELSPQMQDSYYYA------------PAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPV 68 (518) T ss_pred ccCceeec--cchhhhhhhhhhhccccc------------ceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCce Confidence 00000000 000 00000111111100 0012221112223335688899999999999999987766 Q ss_pred EEecCCCcchhhhhHHHHHHHHHHHHHh---hHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEe Q lcl|NC_016762. 80 QVIEGDDQDRSKDETEWERKNKPLIAGG---RFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPA 155 (456) Q Consensus 80 ~i~~~~~~d~~~~~~~~e~~i~~~~~~l---~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~ 155 (456) .+...+.+...+.... .+..++.+= .-+..|.+.+-. -.++|.+++++.- +.. +.++.+.|+ T Consensus 69 ~l~~~~~~~~~~~~~~---~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r-~~~----------G~~~~L~~l 134 (518) T protein:vir:78 69 KCMFTSGDTETEEHDT---GYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-NKS----------GTPEKLMPM 134 (518) T ss_pred EEEEEcCCccccccch---HHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-cCC----------CcEEEEEEE Confidence 7654332221111111 122222211 123345554444 4456777776532 221 123445555 Q ss_pred ccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHH Q lcl|NC_016762. 156 WAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEG 231 (456) Q Consensus 156 ~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~ 231 (456) ....+++.. |. .+-+..|++.... +..+..+.++++.||||...+ ..|.|.++.+.+.+..... +. T Consensus 135 ~p~~Vtv~~---~~----~~~~~~y~~~~~~--~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~a-a~ 204 (518) T protein:vir:78 135 HPSRVAIKR---NS----RTGRYEYYFQAGA--GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDS-SR 204 (518) T ss_pred CCCceEEEE---cC----CCCEEEEEEEecC--CccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHH-HH Confidence 444343321 11 1223445554221 112234567888898885432 2489999988887655444 34 Q ss_pred HHHHHHHHHhhhhh-hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---C-CeEEecCCCceeEEecccCC-- Q lcl|NC_016762. 232 GSGESFLKNAARQL-LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG---N-DVLLPTQGATVTQMVSAVSD-- 304 (456) Q Consensus 232 ~~~~~~~~~~~~~l-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~-~~~lid~~d~~~~~~~~~sg-- 304 (456) .....++++..+.- .++.... + .++..+++.+.+....++ . ..++++.+-+|+.++.+..+ T Consensus 205 ~~~~~~f~Ng~~p~gvl~~~~~-----l-------s~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q 272 (518) T protein:vir:78 205 NATAAMWKNAGRPNLVLRHEKR-----L-------SPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQ 272 (518) T ss_pred HHHHHHHhcCCCccEEEecCCC-----C-------CHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHH Confidence 44556666644321 1221111 1 123344454445444332 2 35667777889998876654 Q ss_pred HHHHHHHHHHHHHhhhcCCeEEeeccCCC-cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC---CCCce Q lcl|NC_016762. 305 PGPTYNVNLQTAAAGVDIPTKILVGMQTG-ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP---LKAEF 379 (456) Q Consensus 305 l~~~~~~~~~~~aaas~IP~t~L~G~sp~-Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~---~~~d~ 379 (456) +-+......+.||.+.|||-.+| |...+ ..+..+ -...||..+ |.|.+.++-..|-+. +.+ ....+ T Consensus 273 ~le~r~~~~~eIa~afgVPp~~l-g~~~~st~sn~e~~~~~f~~~t-------L~P~~~~ie~eln~~-L~~~~~~~~~~ 343 (518) T protein:vir:78 273 FIEARQLNREEVCGVYDIAPPIV-HILDRATFSNISAQMRAFYRDT-------MAIPIARIQSAMDKY-VGQYWVRKNRM 343 (518) T ss_pred HHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHHH-------HHHHHHHHHHHHHHh-hcccccCcceE Confidence 34555567789999999998755 65432 222223 345666543 778777776655432 221 12245 Q ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCC--CCCC---------ccc---C-CCCC Q lcl|NC_016762. 380 TAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQG--GDPL---------PDT---E-PEDE 444 (456) Q Consensus 380 ~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~--~~~~---------~~~---~-~~d~ 444 (456) .|....|..++.++++ ++...++..| ++++||+|+..+++|+++ ++.. ... . ...+ T Consensus 344 ~fd~~~Llr~D~~~r~-------~~~~~~~~~G--~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~ 414 (518) T protein:vir:78 344 KFDIDDVIQPDWEAKS-------ESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEE 414 (518) T ss_pred EeechhhhccCHHHHH-------HHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCC Confidence 5555688888887764 4456667777 999999999999988753 2211 000 0 0111 Q ss_pred CCCCcCCCCC------CC Q lcl|NC_016762. 445 DAARTDPTGE------QQ 456 (456) Q Consensus 445 ~~~~~d~~~~------~e 456 (456) .+.+++|... +. T Consensus 415 ~~~~~~~~~~~~~~~~~~ 432 (518) T protein:vir:78 415 APAPKRPASTPVASLDQS 432 (518) T ss_pred CCCCCCCCcccccccccC Confidence 1111112111 11 No 31 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.70 E-value=2.9e-16 Score=105.73 Aligned_cols=405 Identities=13% Similarity=0.086 Sum_probs=182.2 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccC---cccCCHHHH---HHHHhcCchhhhhhccchhHH Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGF---PQEITFNDL---YTMYRRGGIAHGAVEKIVTTC 74 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~---~~~~~~~~l---~~~Y~~~~l~r~iVd~~aed~ 74 (456) -+.++....++-....+...+++=....+.-..+-+.. ..|| |...++.+| ...|..|+++++||++.++.. T Consensus 23 ~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~--~~g~~~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a~~i 100 (547) T protein:vir:63 23 VDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSA--NPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQV 100 (547) T ss_pred cccccchhhhhhhHHHHHHhhcccchhhhchhhheeec--ccccccCCccCChhHHHHHHHHhhcCHHHHHHHHHHHHHH Confidence 22222222222222222222221100000000000000 0112 123455554 457999999999999999876 Q ss_pred hhC---------C--CEE--ecCCCcchhhhhHHHHHHHHHHHHHhhH--------HHHHHHHHHh-hcccCceEEEEEe Q lcl|NC_016762. 75 WKT---------N--PQV--IEGDDQDRSKDETEWERKNKPLIAGGRF--------WRAVSEADRR-RLVGRYSGLLLHI 132 (456) Q Consensus 75 tR~---------~--~~i--~~~~~~d~~~~~~~~e~~i~~~~~~l~~--------~~~~~ea~~~-~r~~Ggs~i~i~i 132 (456) .+- + +++ +..+.....+....+ ..++..+.+.+. +..|.+++-. -.++|.+++.+. T Consensus 101 a~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~-~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~- 178 (547) T protein:vir:63 101 SMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATI-KRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKV- 178 (547) T ss_pred hhhhhhhhhhccCCCceeEecccccccChhhHHHH-HHHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEEE- Confidence 531 1 233 222222222222222 135555555442 2345555444 456676665543 Q ss_pred cCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC---- Q lcl|NC_016762. 133 RDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW---- 208 (456) Q Consensus 133 ~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~---- 208 (456) .|.. +.+..|.|+....+.+. .++..-....+..|... .++. ....++++.||||... T Consensus 179 rd~~----------G~~~~L~~l~p~~V~~~---~~~~g~~~~~~~~y~~~---~~~~--~~~~~~~~eiih~r~n~~~~ 240 (547) T protein:vir:63 179 FNRN----------QSMVRFVAKDPTTIFFA---TTADGKIPDNGNRFVQV---IDQK--IVATFNAREMAFAVRNPRSD 240 (547) T ss_pred ECCC----------CcEEEEEEecCceeEEE---ECCccccccCceEEEEE---cCCc--EEEEeccccEEEecccCCCC Confidence 3321 11233344333323221 11111111112223221 1111 1245677778877432 Q ss_pred ---cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC- Q lcl|NC_016762. 209 ---TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG- 283 (456) Q Consensus 209 ---~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~- 283 (456) ...|.|.++.+.+.+..... +......+|++....-. +.+.... .+ .++..+++.+.+....++ T Consensus 241 ~~~~~~G~Spi~~~~~~i~~~~~-a~~~~~~~f~Ng~~p~giL~~~~~~---~l-------s~e~~~~lk~~~~~~~~G~ 309 (547) T protein:vir:63 241 IYATGYGYPELEIALKQFIAHEN-TEAFNDRFFSHGGTTRGILQIKAAQ---QQ-------SQHALEIFKREWKNSLSGI 309 (547) T ss_pred cccccccccHHHHHHHHHHHHHH-HHHHHHHHHHcCCCcceEEEecCCC---CC-------CHHHHHHHHHHHHHHhcCc Confidence 23499999999887765544 34455566776543211 1110000 01 123334444444443332 Q ss_pred --CCe-EEec-CCCceeEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeeccCC-Cc--------ccc--hH-HHHHHH Q lcl|NC_016762. 284 --NDV-LLPT-QGATVTQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGMQT-GE--------RAS--SE-DQKYHN 345 (456) Q Consensus 284 --~~~-~lid-~~d~~~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~sp-~G--------lns--t~-D~~nyy 345 (456) -+. .++. .+-+|+.++.+..+. -+......+.||.+.|||-.. +|... +. ++. .+ -.+.|| T Consensus 310 ~nagk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~-lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~ 388 (547) T protein:vir:63 310 NGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAE-INIPNNGGATGSKGGSLNEGNSAEKNQASK 388 (547) T ss_pred ccccccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHH-cCcccccccccccccccchhhHHHHHHHHH Confidence 232 3443 344777777655443 344455677899999999974 44321 11 111 11 123343 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHh Q lcl|NC_016762. 346 ARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEA 425 (456) Q Consensus 346 d~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~ 425 (456) . .-|.|.+.++-..|-+.-+......+.|+|+-+...++.+++++. ..+..| +++++|+|+.. T Consensus 389 ~-------~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~~~~~~~~~~~~--------~~~~~g--~lT~NE~R~~~ 451 (547) T protein:vir:63 389 N-------KGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVKIL--------AEKAKV--AMTVNEVRKEL 451 (547) T ss_pred H-------HHHHHHHHHHHHHHHhhcccccCCceEEEeeccccccHHHHHHHH--------HHHhCC--CcCHHHHHHHh Confidence 3 346777777655553332222234689999998888877765431 234456 89999999999 Q ss_pred cccCC-CCCCCCc------------------ccCC------------CCCCCCCcCCCC--CCC Q lcl|NC_016762. 426 GYDPL-QGGDPLP------------------DTEP------------EDEDAARTDPTG--EQQ 456 (456) Q Consensus 426 ~~~~~-~~~~~~~------------------~~~~------------~d~~~~~~d~~~--~~e 456 (456) +++|. ++++..- +... .++++++++... +.+ T Consensus 452 gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (547) T protein:vir:63 452 NLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTT 515 (547) T ss_pred CCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcccC Confidence 98873 4333210 0000 000011111000 000 No 32 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.70 E-value=1.2e-16 Score=107.81 Aligned_cols=402 Identities=13% Similarity=0.095 Sum_probs=182.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccC-cccCCHHHH---HHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGF-PQEITFNDL---YTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~-~~~~~~~~l---~~~Y~~~~l~r~iVd~~aed~tR 76 (456) |.+-++.|.+. -+.+ +.....+ -......|+. |..+++..| ...|..|.++++||+++++..-. T Consensus 39 ~~~~~~k~~~~-~~~a-------~~~~~~~----~~~~~~~~~~r~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~ 106 (551) T protein:vir:80 39 EQEQISKAMNN-KEVA-------YSQPVIG----SMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSM 106 (551) T ss_pred cHHHHHHhhcc-Ccce-------eeccccc----ceecCcccccCccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhh Confidence 33333333221 0000 0111100 0000001110 123345444 45699999999999999987653 Q ss_pred ---------C--CCEEecC--CCcchhhhhHHHHHHHHHHHHHhhH--------HHHHHHHHH-hhcccCceEEEEEecC Q lcl|NC_016762. 77 ---------T--NPQVIEG--DDQDRSKDETEWERKNKPLIAGGRF--------WRAVSEADR-RRLVGRYSGLLLHIRD 134 (456) Q Consensus 77 ---------~--~~~i~~~--~~~d~~~~~~~~e~~i~~~~~~l~~--------~~~~~ea~~-~~r~~Ggs~i~i~i~D 134 (456) + ++.|... +.....+....+ +.++..+.+.+. +..|.+.+- .-.++|.+++.+.- | T Consensus 107 ~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~-~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~r-d 184 (551) T protein:vir:80 107 YCKPARHSEKGVGFEVRLKDLDKKPTSHDEATI-KRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVF-N 184 (551) T ss_pred hhhhhhhhcCCCCceEEecccCcccChhHHHHH-HHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEE-C Confidence 2 2344222 111112222222 124555555442 233444443 45667777665433 2 Q ss_pred CCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC------ Q lcl|NC_016762. 135 SQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW------ 208 (456) Q Consensus 135 ~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~------ 208 (456) .. +.+..|.|+....+.+. .++..-..-.+..|... .+| +....+.++.||||... T Consensus 185 ~~----------G~~~~L~~l~p~~V~v~---~~~~g~~~~~~~~y~~~---~~g--~~~~~~~~~eiiH~~~n~~~~~~ 246 (551) T protein:vir:80 185 RN----------QSMVRFVAKDPTTIFFA---TTADGKIPDNGNRFVQV---IDQ--KIVATFNAREMAFAVRNPRSDIY 246 (551) T ss_pred CC----------CcEEEEEEeCCceeEEE---ECCccccccCceEEEEE---eCC--cEEEEEcccceEEecccCCCCcc Confidence 11 11333444433333321 11111111112233221 111 11345777788887432 Q ss_pred -cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC--- Q lcl|NC_016762. 209 -TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG--- 283 (456) Q Consensus 209 -~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~--- 283 (456) ...|.|-++.+.+.+..... +..+...+|++..+.-. +.+..... + . .+..+++.+.+....++ T Consensus 247 ~~~~G~spi~~a~~~i~~~~a-~~~~~~~~f~Ng~~p~giL~~~~~~~---l------t-~e~~~~lk~~~~~~~~G~~n 315 (551) T protein:vir:80 247 ATGYGYPELEIALKQFIAHEN-TEAFNDRFFSHGGTTRGILQIKAAQQ---Q------S-QHALEIFKREWKNSLSGING 315 (551) T ss_pred cccccccHHHHHHHHHHHHHH-HHHHHHHHHHcCCCcceEEEEcCCCC---C------C-HHHHHHHHHHHHHHhcCccc Confidence 23499999999887765544 34455567776543221 11110000 1 1 23334444444443332 Q ss_pred CCe-EEec-CCCceeEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeecc-CCCcccch--H--HHHHHHHHHHHHHHh Q lcl|NC_016762. 284 NDV-LLPT-QGATVTQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGM-QTGERASS--E--DQKYHNARCQARRVQ 354 (456) Q Consensus 284 ~~~-~lid-~~d~~~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~-sp~Glnst--~--D~~nyyd~I~~~Qe~ 354 (456) -+. .++. .+-+|+.++.+..+. -+......+.||.+.|||-. ++|. .-++..++ + -..|+-....+.-+. T Consensus 316 ag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~-~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~ 394 (551) T protein:vir:80 316 SWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPA-EINIPNNGGATGSKGGSLNEGNSAEKNQASKNK 394 (551) T ss_pred cCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHH-HcCcccccccccccccccchhhHHHHHHHHHHH Confidence 233 3443 344777777665543 34455577889999999986 4453 22211111 1 111222222222233 Q ss_pred hhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccC-CCCC Q lcl|NC_016762. 355 ELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDP-LQGG 433 (456) Q Consensus 355 ~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~-~~~~ 433 (456) -|.|.+.++-..|-+.-+......+.|+|+-+...+.++++++. ..+..| ++|++|+|+..+++| .+++ T Consensus 395 tL~P~~~~ie~~ln~~L~~~~~~~~~f~f~~~~~~~~~~~~~~~--------~~~~~g--~lT~NE~R~~~gl~P~~egG 464 (551) T protein:vir:80 395 GLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVKIL--------AEKAKV--AMTVNEVRKELNLPGDVIGG 464 (551) T ss_pred HHHHHHHHHHHHHHhhhccccCCceEEEeeccChhhHHHHHHHH--------HHHhcC--CcCHHHHHHHhCCCCCCCCC Confidence 57787776655554332222234689999988877766665432 233446 899999999999987 3443 Q ss_pred CCCc---------------ccC---------------CCCCCCCCcCCCCCCC Q lcl|NC_016762. 434 DPLP---------------DTE---------------PEDEDAARTDPTGEQQ 456 (456) Q Consensus 434 ~~~~---------------~~~---------------~~d~~~~~~d~~~~~e 456 (456) +..- ..+ ..+.+++++++..+.+ T Consensus 465 D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 517 (551) T protein:vir:80 465 DIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKD 517 (551) T ss_pred ceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccc Confidence 3210 000 0001111111111111 No 33 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.69 E-value=3.2e-16 Score=105.50 Aligned_cols=405 Identities=12% Similarity=0.111 Sum_probs=187.3 Q ss_pred CCch--hHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh-- Q lcl|NC_016762. 1 MTDK--LDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK-- 76 (456) Q Consensus 1 ~~~~--~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR-- 76 (456) |..+ ++.+++. ...+...-..+..++..|.++.. .+...-+...+...|..+.++++||++.++.+.+ T Consensus 41 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~ 112 (574) T protein:vir:80 41 PYSMESIEKGMNG-KTTAYMQPIIGEMSVNPGYKTKP-------SIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYC 112 (574) T ss_pred CCCHHHHHHhHhh-hcccccchhhhhccccccccCcC-------ccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHH Confidence 3333 2222221 11111111222223223332210 0111223456677889999999999998876643 Q ss_pred ---------CCCEEecCCC--cchhhhhHHHHHHHHHHHHHhh--------HHHHHHHHHHh-hcccCceEEEEEecCCC Q lcl|NC_016762. 77 ---------TNPQVIEGDD--QDRSKDETEWERKNKPLIAGGR--------FWRAVSEADRR-RLVGRYSGLLLHIRDSQ 136 (456) Q Consensus 77 ---------~~~~i~~~~~--~d~~~~~~~~e~~i~~~~~~l~--------~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~ 136 (456) =.++|...+. ....+.... ...+.+++.+.. -+..|.+.+-+ -.++|.+++.+. .++. T Consensus 113 ~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~-~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~-r~~~ 190 (574) T protein:vir:80 113 KPARNSETGVGYEIRLKDIEAEPTSHDIAN-IKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKV-FDKD 190 (574) T ss_pred HHHHhhhccCceEEEEeccCCCccchhhhh-hhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEE-ECCC Confidence 1234432221 111111111 112444443211 12234554444 456777766543 3221 Q ss_pred CccccccCCcCceeEEEEeccccCChhhhhcccccc-ccCCceeEEEeecccCCccccceeeehhhhheecCC------- Q lcl|NC_016762. 137 PWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSE-TYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW------- 208 (456) Q Consensus 137 ~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~-~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~------- 208 (456) +.+..|.|+....+.+.. |.... ..+.+.+|++. +|. ....+.++.||||... T Consensus 191 ----------G~~~~L~pl~p~~V~v~~---d~~~~~~~~~~~y~~~~----~g~--~~~~~~~~eiih~~~~~~~~~~~ 251 (574) T protein:vir:80 191 ----------GNFIKFDTVDPTTIFLAT---NGEGKLIKNGERFVQVI----DNR--IVAKFNERELAFAVRNPRADIEV 251 (574) T ss_pred ----------CcEEEEEEEcCceeEEEE---cCccccccCceEEEEEe----CCc--eEEEEccccEEEEeccCCCCccc Confidence 113344444433333321 11111 11123455543 111 1244667777776321 Q ss_pred cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc---CC Q lcl|NC_016762. 209 TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR---GN 284 (456) Q Consensus 209 ~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~---~~ 284 (456) ...|.|.++.+.+.+.....+ ...+..+|++....-. +++.. + ..-.++..+++.+.+....+ |. T Consensus 252 ~~~G~spi~~a~~~i~~~~~a-~~~~~~~f~ng~~p~gil~~~~--~--------~~ls~e~~~~lk~~~~~~~~G~~n~ 320 (574) T protein:vir:80 252 GQYGYPELEIALKQFIAHENT-EVFNDRFFSHGGTTRGILHVKT--G--------QQQSQQALDIFRREWRSSLAGINGS 320 (574) T ss_pred ccccccHHHHHHHHHHHHHHH-HHHHHHHHhccCCCceEEEeCC--C--------CCCCHHHHHHHHHHHHHHhcccccc Confidence 235899999998877655444 3444456665432111 11100 0 00112334444444444333 22 Q ss_pred Ce-EEe-cCCCceeEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeecc-CCCcccchHH----HHHHHHHHHHHHHhh Q lcl|NC_016762. 285 DV-LLP-TQGATVTQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGM-QTGERASSED----QKYHNARCQARRVQE 355 (456) Q Consensus 285 ~~-~li-d~~d~~~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~-sp~Glnst~D----~~nyyd~I~~~Qe~~ 355 (456) +. .++ +.+-+|+.++.+..+. -+........||.+.+||-. ++|. +.+.+.+++. ..|.-..-.+..+.- T Consensus 321 g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~-~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~t 399 (574) T protein:vir:80 321 WQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPA-EINFPNNGGATGSKGGSLNEGNSKEKMQASQNKG 399 (574) T ss_pred ccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHH-HhcccccccccccccccccchhHHHHHHHHHHHH Confidence 22 344 4566888887666543 45555677899999999997 5553 3333333321 222222233334445 Q ss_pred hhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCC Q lcl|NC_016762. 356 LTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDP 435 (456) Q Consensus 356 lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~ 435 (456) |.|.+.++-..|-+.-+-.....+.|+|+...-.+..+++.+ ..++..| +++++|+|+..+++|+++++. T Consensus 400 L~P~~~~ie~~ln~~Ll~~~~~~~~~~f~~~d~~~~~~~~~~--------~~~~~~G--~lT~NE~R~~lgl~Pi~gGD~ 469 (574) T protein:vir:80 400 LQPLLRFIEDTVNTYIVAEFGEKYQFQFRGGDLSAQLDKLKI--------IEQEGKV--FRTVNEIRHDKGLEPIKGGDV 469 (574) T ss_pred HHHHHHHHHHHHHhhhhhhcCCceEEEecccchhhHHHHHHH--------HHHHhCC--ccCHHHHHHHhCCCCCCCCCE Confidence 778777766555433222223467888887665444333322 2345566 999999999999999876543 Q ss_pred Ccc---------c------C---------------CCCCCCCCcCCCCCCC Q lcl|NC_016762. 436 LPD---------T------E---------------PEDEDAARTDPTGEQQ 456 (456) Q Consensus 436 ~~~---------~------~---------------~~d~~~~~~d~~~~~e 456 (456) .-. . + ..+.+.++++...+.+ T Consensus 470 ~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 520 (574) T protein:vir:80 470 ILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQ 520 (574) T ss_pred eeeccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCcc Confidence 200 0 0 0000111111111111 No 34 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.69 E-value=1e-16 Score=108.31 Aligned_cols=375 Identities=14% Similarity=0.081 Sum_probs=187.4 Q ss_pred HHHhh-hhhccCcccchh----h----hhccCc-ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhh Q lcl|NC_016762. 22 MSLLN-QGIGHDAKRPQA----W----CEYGFP-QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSK 91 (456) Q Consensus 22 d~~~n-~~~~~gt~~~~~----~----~~~~~~-~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~ 91 (456) |+|.. ++.+-..++... + ..+.++ ...+.+ -+.++..+.++|+++|+++-.--+.+...++....+ T Consensus 1 Mgl~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~----~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~ 76 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKISGIPSPAEDWAMHGDRPGAN----SAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIP 76 (409) T ss_pred CchhhhhhcCCCcccccccccccccccchhhccCcccchh----hhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccc Confidence 44443 222111122110 0 001111 122221 234577899999999999977666665433221111 Q ss_pred hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccc Q lcl|NC_016762. 92 DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPD 170 (456) Q Consensus 92 ~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~ 170 (456) ...+...|...-....-+..|.+++.+ -.++|-++++|..++.. +.+..+.|+....+.+.. ..|.. T Consensus 77 -~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~----------g~~~~L~~l~p~~v~v~~-~~~~~ 144 (409) T protein:vir:84 77 -VSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEA----------NRPTAIMPIHPDCIHVTD-AKDED 144 (409) T ss_pred -cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCC----------CceEEEEEEcCceeEEEE-cCCCc Confidence 111111121111122234455555554 55667777777654311 112233333322222211 11211 Q ss_pred ccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_016762. 171 SETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL 246 (456) Q Consensus 171 s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~ 246 (456) ... +|.+. . ..++.++++.||||.... ..|.|-++.+.+.+.....+ ......+|++..+.-. T Consensus 145 ~~~-----~~~~~--~-----~~g~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~-~~~~~~~f~ng~~p~g 211 (409) T protein:vir:84 145 GDW-----IEPVY--R-----IDGKVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAA-ERYGLRWFRDSANPSG 211 (409) T ss_pred ceE-----EEEEe--c-----CCceEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHH-HHHHHHHHhcCCCccE Confidence 111 11111 1 124678999999986433 35889999888766554443 3344455555432211 Q ss_pred -hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEecccCC--HHHHHHHHHHHHHhhhcC Q lcl|NC_016762. 247 -LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAGVDI 322 (456) Q Consensus 247 -~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas~I 322 (456) ++.... + . ++..+++.+......+|. ..++++.+.+|++++.+..+ +-+......++||.+.|| T Consensus 212 il~~~~~-----l------~-~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgV 279 (409) T protein:vir:84 212 ILSSDAD-----L------T-PDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRI 279 (409) T ss_pred EEecCCC-----C------C-HHHHHHHHHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 111111 1 1 122333333333333444 35666667789998877654 344455677899999999 Q ss_pred CeEEeeccC-CCcccch--H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHH Q lcl|NC_016762. 323 PTKILVGMQ-TGERASS--E-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANS 398 (456) Q Consensus 323 P~t~L~G~s-p~Glnst--~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~ 398 (456) |..+ +|.. .+..+++ + -..+||..+ |.|.++.+-..|-+. + +....|.|.++.|...|.++++ T Consensus 280 Pp~~-lg~~~~~~~~~sn~e~~~~~f~~~~-------l~P~~~~ie~~l~~~-L-~~g~~i~fd~~~l~~~d~~~~~--- 346 (409) T protein:vir:84 280 PPHM-IGDVEKSTSWGTGIEEQGINFVRHT-------LLPWLRCIEQALDTF-L-PRGQFVKFNVDGLMRGDVTARF--- 346 (409) T ss_pred CHHH-hCCCCCcccccchHHHHHHHHHHHH-------HHHHHHHHHHHHHHh-c-cCCCeEEEechhhhccCHHHHH--- Confidence 9875 4543 3333222 3 345666443 677776665554322 2 2233467778899888888765 Q ss_pred HHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc--------ccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 399 KTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP--------DTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 399 ~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~--------~~~~~d~~~~~~d~~~~~e 456 (456) ++...++..| ++++||+|+..+++|+++++..- +..+..+...++.|.+..+ T Consensus 347 ----~~~~~~~~~G--~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~ 406 (409) T protein:vir:84 347 ----TAYQMGLQNG--IWSVNEVRAWEDAPPIPEGDIHLQPMNFVPLGYVPPEEPAQEPQPNSATE 406 (409) T ss_pred ----HHHHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeecccccccccCCccccCcCCCCCCccC Confidence 4556677778 99999999999999987754311 1111111111222222222 No 35 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.69 E-value=2.6e-17 Score=111.52 Aligned_cols=379 Identities=9% Similarity=0.021 Sum_probs=192.3 Q ss_pred HHHhhhhhccCcccchhhhh----c----cC-cccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhh- Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCE----Y----GF-PQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSK- 91 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~----~----~~-~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~- 91 (456) |.|.++.-+--+.+-..|.. + .. ....+.+. ..++..+++||+++|++.-.--+.+....++...+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~----al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~ 76 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAGVIITPET----ALALSAVRACVTLLAESVAQLPCVLYRRTENGGREI 76 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCCceechHH----hhccHHHHHHHHHHHHhhccCceEEEEEcCCCceec Confidence 44444332221111001110 0 00 01223221 23467889999999999876655654333221111 Q ss_pred -hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccc Q lcl|NC_016762. 92 -DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKP 169 (456) Q Consensus 92 -~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp 169 (456) ....+...+...-....-+..|.+.+.. -.++|-+++++. +++. +.+..+.|+....+++.. + T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~-r~~~----------G~~~~L~pl~~~~v~v~~---~- 141 (419) T protein:vir:57 77 AFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLID-RNGR----------GDITELIPINPHKVIVLK---G- 141 (419) T ss_pred cccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEEcCcceEEEE---C- Confidence 1111111122111122233445554444 445676666553 2321 123444454443333311 1 Q ss_pred cccccCCceeEEEeecccCCccccceeeehhhhheecCC---cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_016762. 170 DSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW---TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL 246 (456) Q Consensus 170 ~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~---~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~ 246 (456) ..|.+ +|.+.. .+..++.+.|+|+... ...|.|.++.+...+..... +......++++..+.-. T Consensus 142 ---~~g~~-~y~~~~--------~~~~~~~~~vih~r~~~~d~~~G~s~i~~~~~~i~~~~~-~~~~~~~~f~ng~~p~g 208 (419) T protein:vir:57 142 ---PDGMP-YYDIPS--------IGEILPMRMVHHIKSFSLDGYIGTSPIQTNPDVLGLGIA-VEQHAAQVFARGTTMSG 208 (419) T ss_pred ---CCceE-EEEEcC--------CceEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHH-HHHHHHHHHHccCCccE Confidence 12333 455531 2345777888877532 34589999998876654433 34444455665433211 Q ss_pred -hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc---C-CCeEEecCCCceeEEecccCCH--HHHHHHHHHHHHhh Q lcl|NC_016762. 247 -LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR---G-NDVLLPTQGATVTQMVSAVSDP--GPTYNVNLQTAAAG 319 (456) Q Consensus 247 -~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~---~-~~~~lid~~d~~~~~~~~~sgl--~~~~~~~~~~~aaa 319 (456) ++.....+ +...++..+++.+.+....+ + ...++++.+-+|+.++.+.... -+......++||++ T Consensus 209 il~~~~~~~--------~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 280 (419) T protein:vir:57 209 VIERPFEAK--------AIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRL 280 (419) T ss_pred EEEecCcCC--------cccCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHH Confidence 11110000 01112333344333332222 2 2456667777898888766544 45566677899999 Q ss_pred hcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC-Cc--eEEEeCCCCCCCHHHHH Q lcl|NC_016762. 320 VDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK-AE--FTAIWDDLTVPTKAERL 395 (456) Q Consensus 320 s~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~-~d--~~~~f~pL~~~seke~A 395 (456) .|||...|-+...+..++.+ -...||..+ |.|.++.+-+.|-+.-+.+.. .+ |.|.+..|...|.++++ T Consensus 281 fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~-------l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~ 353 (419) T protein:vir:57 281 YKVPPHMIQDLQKSTNNNIEHQGLQYVIYT-------MLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRY 353 (419) T ss_pred hCCCHHHhCCCCCCccccHHHHHHHHHHHH-------HHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHH Confidence 99998766444434344333 456677543 788888776666554333211 23 45556688888888876 Q ss_pred HHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcc--c-C----CCCCCCCCcCCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPD--T-E----PEDEDAARTDPTGEQQ 456 (456) Q Consensus 396 ei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~--~-~----~~d~~~~~~d~~~~~e 456 (456) +. ...+++.| ++|+||+|+..+++|+++++..-- . . ..+...+.++...+.| T Consensus 354 ~~-------~~~~~~~G--~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~ 412 (419) T protein:vir:57 354 ES-------YALGRQWG--WLSVNDIRRMENLTPIPGGDKYLTPLNMVDSKALTGIGKATPQQLKDIE 412 (419) T ss_pred HH-------HHHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeeccccccccccccccCCCcccCcchh Confidence 64 44567777 999999999999999877654311 0 0 1111223333333333 No 36 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.67 E-value=1.5e-16 Score=107.40 Aligned_cols=393 Identities=14% Similarity=0.086 Sum_probs=200.0 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCc-ccchhhhhc-cCc----ccCCHHHHHHHHhcCchhhhhhccchhHH Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDA-KRPQAWCEY-GFP----QEITFNDLYTMYRRGGIAHGAVEKIVTTC 74 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt-~~~~~~~~~-~~~----~~~~~~~l~~~Y~~~~l~r~iVd~~aed~ 74 (456) |++.|..++..|.+.. +-++.+.+-..-+ ..+..|..| |-+ ..++.+ .+.+++.+.+||+++|+++ T Consensus 1 ~~~~l~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~----~al~~~~V~~~i~~ia~~i 72 (434) T protein:vir:43 1 MSKSLGKVLSSATSAP----RSSLFGWGGKTIRLTDGAFWSQFLGRESSSGKKVTVD----KAMKLSAVWACVRLISTSV 72 (434) T ss_pred Cccchhhhhhhccccc----chhhhcccccccccCchHHHHHHhcCCccCCceechh----hhhccHHHHHHHHHHHHhh Confidence 9999988766433321 1122221110000 111223333 111 122322 2346788899999999999 Q ss_pred hhCCCEEecCCCcchhh--hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeE Q lcl|NC_016762. 75 WKTNPQVIEGDDQDRSK--DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAK 151 (456) Q Consensus 75 tR~~~~i~~~~~~d~~~--~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~ 151 (456) -.--+.+...+.+.... ....+-+.+...=....-+..|.+.+-+ -.++|-++++|.-.+|+ +.. T Consensus 73 a~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~~G~------------~~~ 140 (434) T protein:vir:43 73 AGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRAAGR------------PAA 140 (434) T ss_pred hhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCc------------EEE Confidence 77666664332211111 1111111121111112223344444444 45667777665322221 233 Q ss_pred EEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHH Q lcl|NC_016762. 152 VTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEK 228 (456) Q Consensus 152 i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~ 228 (456) +.|+-...+++. .|. .|.+.|+... .+ +..+.++++.|+||...+ ..|.|.++.+.+.+..... T Consensus 141 L~~l~p~~v~~~---~~~----~g~~~y~~~~---~~---g~~~~~~~~eVih~~~~~~dg~~G~spi~~~~~~i~~~~~ 207 (434) T protein:vir:43 141 LDFLLPSRVDLE---CDE----NGRLKYFYTT---KK---GARREIERTNMLHIPAFTLDGRIGLSAIRYGVDVFGSVMS 207 (434) T ss_pred EEEEcCcceEEE---EcC----CCeEEEEEEe---cC---ceEEEEccccEEEecCcCCCCccccCHHHHHHHHHHHHHH Confidence 444433333321 122 2444444332 12 234678999999885433 3589999999887655443 Q ss_pred HHHHHHHHHHHHhhhhh-hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CCeEEecCCCceeEEecccC- Q lcl|NC_016762. 229 VEGGSGESFLKNAARQL-LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG---NDVLLPTQGATVTQMVSAVS- 303 (456) Q Consensus 229 ~~~~~~~~~~~~~~~~l-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~lid~~d~~~~~~~~~s- 303 (456) + ..++..++++..+.- .++....+ ..+..+++.+.++.+... .+.++++.+.+|+.++.+.. T Consensus 208 ~-~~~~~~~f~ng~~~~gil~~~~~l------------~~e~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d 274 (434) T protein:vir:43 208 A-EDAANGTFKNGLLPTVAFKVDRIL------------QPAQREEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVD 274 (434) T ss_pred H-HHHHHHHHhccCCcceEEecCCCC------------CHHHHHHHHHHHHHhcCccccCCccccCCCceEEEccCChhH Confidence 3 344445666643322 12211111 122334444444444322 23567777788999987765 Q ss_pred -CHHHHHHHHHHHHHhhhcCCeEEeeccCCCccc-ch--H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC-C Q lcl|NC_016762. 304 -DPGPTYNVNLQTAAAGVDIPTKILVGMQTGERA-SS--E-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK-A 377 (456) Q Consensus 304 -gl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gln-st--~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~-~ 377 (456) .+-+......++||.+.|||-. ++|...++=+ ++ + -...||. .-|.|.+..+-..|-+.-+.+.. . T Consensus 275 ~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~~~s~~e~~~~~f~~-------~~L~P~~~~ie~~ln~kL~~~~~~~ 346 (434) T protein:vir:43 275 AQLLETREHGVIEICRWFGVPPW-MIGQTDKGSNWGTGLEQQMLAFLT-------FSISSITNQIQQCVNKRLLTAPERI 346 (434) T ss_pred HHHHHHHHHHHHHHHHHhCCCHH-HhCCCcCCccccchHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCChhhhc Confidence 4456677788999999999976 5576544322 22 2 2344553 34788887776555443222211 1 Q ss_pred c--eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC---------cccCCCC--- Q lcl|NC_016762. 378 E--FTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL---------PDTEPED--- 443 (456) Q Consensus 378 d--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~---------~~~~~~d--- 443 (456) . +.|.+..|...|.++++ ++..+++..| ++++||+|+..+++|+++++.. +...+.+ T Consensus 347 ~~~~~fd~~~llr~d~~~r~-------~~~~~~~~~G--~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~ 417 (434) T protein:vir:43 347 RYYAEFSLEGFLKADSAGRA-------AWYSTMAQNG--FMTRNEGRRKENLPELPGGDILTVQSNLVPIDQLGQSNKSQ 417 (434) T ss_pred CceEEEechhhhccCHHHHH-------HHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeEeeccCccchhhhhccCCCc Confidence 3 45555588888887764 4555677777 9999999999999998765432 1111100 Q ss_pred -CCCCCcCCCCCCC Q lcl|NC_016762. 444 -EDAARTDPTGEQQ 456 (456) Q Consensus 444 -~~~~~~d~~~~~e 456 (456) .........+++| T Consensus 418 ~~~~~~~~~~~~~~ 431 (434) T protein:vir:43 418 AVRAALMNWFSQPE 431 (434) T ss_pred chhhhhhccCCCCC Confidence 0111111111222 No 37 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.66 E-value=2.8e-16 Score=105.85 Aligned_cols=380 Identities=11% Similarity=0.018 Sum_probs=194.0 Q ss_pred HHHhhhhhccCcccch-------hhhh-ccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhh-h Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQ-------AWCE-YGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSK-D 92 (456) Q Consensus 22 d~~~n~~~~~gt~~~~-------~~~~-~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~-~ 92 (456) |++.+...++..++.. .+.+ +|-+ ..+.. ...++.-+.+||+++|+++-.--+.+...+++...+ . T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-~~~~~----~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~ 75 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPLLLQWLGVD-PDTPR----NQLSEATYFACLKILSESLGKLPLKMYQKTERGIVKSD 75 (411) T ss_pred CchHHHHHhhccCcccccccchHHHHHHhcCc-ccChh----hhhccHHHHHHHHHHHHhHhhCceeEEEecCCceeeec Confidence 4444433333322211 1111 1111 12221 123567789999999999987777775333221111 1 Q ss_pred hHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccc Q lcl|NC_016762. 93 ETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDS 171 (456) Q Consensus 93 ~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s 171 (456) ...+...++..=....-+..|.+.+.+ -.++|.+++++.- ++.. +..+.|+-...+++. .|... T Consensus 76 ~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r-~~g~-----------~~~l~~l~~~~v~~~---~~~~~ 140 (411) T protein:vir:81 76 REELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQY-SGPQ-----------LQALWILPSQYVTIV---VDDRG 140 (411) T ss_pred ccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEe-cCCc-----------eEEEEEECCceEEEE---EcCcc Confidence 112212222111122234455555554 4566877777654 3221 222333333323321 11111 Q ss_pred cccC-CceeEEEeecccCCccccceeeehhhhheecC----CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-h Q lcl|NC_016762. 172 ETYG-QPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQ-L 245 (456) Q Consensus 172 ~~yg-~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~-l 245 (456) ..+. ..-+|.+.. ..+ +....++++.||||.. ....|.|.++.+.+.+.....+. .....++++..+. . T Consensus 141 ~~~~~~~~~~~~~~-~~~---g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~-~~~~~~f~ng~~p~g 215 (411) T protein:vir:81 141 LLGEKNAIWYRYND-PYD---GKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQ-KFMNNLYKTGLTGKA 215 (411) T ss_pred cccccceEEEEEEe-cCC---ceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHH-HHHHHHHhccCCCce Confidence 1111 122344431 112 2345689999998842 23469999999988766554443 3344455553321 1 Q ss_pred hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC--HHHHHHHHHHHHHhh Q lcl|NC_016762. 246 LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAG 319 (456) Q Consensus 246 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaa 319 (456) .++.... + .++..+++.+++..+.++ ...++++.+-+|++++.+..+ +-+......++||++ T Consensus 216 il~~~~~-----l-------~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 283 (411) T protein:vir:81 216 VLEYTGD-----L-------NQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAA 283 (411) T ss_pred EEEeCCC-----C-------CHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHH Confidence 1111111 1 122334444444443332 234666677789998876643 345566778899999 Q ss_pred hcCCeEEeeccCCCc-ccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCc----CCCCceEEEeCCCCCCCHHH Q lcl|NC_016762. 320 VDIPTKILVGMQTGE-RASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVV----PLKAEFTAIWDDLTVPTKAE 393 (456) Q Consensus 320 s~IP~t~L~G~sp~G-lnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~----~~~~d~~~~f~pL~~~seke 393 (456) .|||...| |...++ .+..+ -..+||.. -|.|.++.+-+.|-+.-+. ...-.|.|.+..|...|.++ T Consensus 284 fgVPp~~l-g~~~~~t~~n~e~~~~~f~~~-------~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~ 355 (411) T protein:vir:81 284 FGIKPNQI-NDYEKSSYASAEAQNLAFYVD-------TLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKT 355 (411) T ss_pred hCCCHHHh-CCCCCCCchhHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHH Confidence 99998755 654433 22222 24456543 4788887777666443222 22234677778888888887 Q ss_pred HHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCCCCCCCCCcCCCCCC Q lcl|NC_016762. 394 RLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEPEDEDAARTDPTGEQ 455 (456) Q Consensus 394 ~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~~d~~~~~~d~~~~~ 455 (456) ++ ++...++..| +++++|+|+..+++|+++++..- .-.+-+.-.+....++|. T Consensus 356 ~~-------~~~~~~~~~g--~~t~NE~R~~~gl~p~~ggD~~~~~~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 356 QM-------DSLSTAVQNG--IMTPNEARDYLDMPADDYGNNLMANGNYIPLSMLGANYGKGGDS 411 (411) T ss_pred HH-------HHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeeeeccCccchhhhhhhhccCCCC Confidence 64 4456677777 99999999999999887654321 111111111111234444 No 38 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.65 E-value=2.8e-16 Score=105.88 Aligned_cols=359 Identities=11% Similarity=-0.000 Sum_probs=192.0 Q ss_pred HHHhhhhh-ccC------cccchhhhhcc---C-cccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchh Q lcl|NC_016762. 22 MSLLNQGI-GHD------AKRPQAWCEYG---F-PQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRS 90 (456) Q Consensus 22 d~~~n~~~-~~g------t~~~~~~~~~~---~-~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~ 90 (456) |++.|... ... +..+..+.... . ....+.. .|.++..+++||+++|+++-+--+++..... T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~----~al~~~~v~~~i~~ia~~ia~~p~~v~~~~~---- 72 (385) T protein:vir:10 1 MGLLTPRNFNKRKAKNMVYPSNPAFFTTTVGGMQLSYVSAL----SALQNTNVYSVINRIASDVASAHFKTENTAT---- 72 (385) T ss_pred CccccchhcccccccccccccchhhhhhhccccCccccCHH----HhhccHHHHHHHHHHHHHHhhCceeeeccch---- Confidence 44443211 000 01111111110 1 1122322 2456788999999999999887777632110 Q ss_pred hhhHHHHHHHHHHHHHhhHHHHHHHHHHhhc-ccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccc Q lcl|NC_016762. 91 KDETEWERKNKPLIAGGRFWRAVSEADRRRL-VGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKP 169 (456) Q Consensus 91 ~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r-~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp 169 (456) .. +...-..+.-+..|.+.+.+.+ ++|-+++++ ++|. ..+.|.....+.+ ..|. T Consensus 73 ------~~-ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i-~r~~--------------~~~~p~~~~~v~~---~~~~ 127 (385) T protein:vir:10 73 ------LN-RLESPSSLIGRFSFWQGALMQLCLSGNDYIPL-VGQN--------------LEHIPNSDVQINY---LPGN 127 (385) T ss_pred ------hh-hhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEE-EcCc--------------eeEeecCCceEEE---EEcC Confidence 01 1111113334556666666655 567777665 3321 1123333322221 1121 Q ss_pred cccccCCceeEEEeecccCCccccceeeehhhhheecCCc------CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_016762. 170 DSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT------GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAAR 243 (456) Q Consensus 170 ~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~------~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~ 243 (456) . ..+|.+... + .+..+.++++.+|||...+ ..|.|.++.+...+.....+ ......++++..+ T Consensus 128 ~------~~~~~~~~~--~--~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~-~~~~~~~~~ng~~ 196 (385) T protein:vir:10 128 M------GIVYTVLES--N--DRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKA-SKSNMSAMENQIN 196 (385) T ss_pred C------ceEEEEEEc--C--CceEEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHH-HHHHHHHHhccCC Confidence 1 223444321 1 1224678999999885322 24899999998866544433 3444455565433 Q ss_pred hhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEEecccCCHHH---HHHHHHHHHH Q lcl|NC_016762. 244 QLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQMVSAVSDPGP---TYNVNLQTAA 317 (456) Q Consensus 244 ~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~~~~~sgl~~---~~~~~~~~~a 317 (456) .-.+..... +...++..+++.+.++.+.++. ..++++.+.+|+.++.+.....- ......++|| T Consensus 197 ~~gil~~~~----------~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia 266 (385) T protein:vir:10 197 PAGKLTISN----------YLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQIS 266 (385) T ss_pred cceEEEeCC----------CCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHH Confidence 221111000 0011233445555555554432 35677778899999888777653 3455578899 Q ss_pred hhhcCCeEEeeccCCCcccch--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 318 AGVDIPTKILVGMQTGERASS--EDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 318 aas~IP~t~L~G~sp~Glnst--~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~A 395 (456) .+.|||...|-+...+..+.+ +..+.||.. -|.|.++.+-+.|-+.-+. .++.|.+.+|..+|.++++ T Consensus 267 ~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~-------~l~P~~~~ie~~l~~~l~~---~~~~f~~~~ll~~d~~~~~ 336 (385) T protein:vir:10 267 KAFGVPSDILGGGTSTESQHSNIDQIKATYLA-------NLNSYVNPIVDELRLKMNA---PDLELDIKDMLDVDDSALI 336 (385) T ss_pred HHhCCCHHHcCCccCCCcccccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCC---ceEEeechhhhccCHHHHH Confidence 999999876555444444322 334445522 3788888887777554333 3588899999999988864 Q ss_pred HHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQ 455 (456) Q Consensus 396 ei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~ 455 (456) ++.+++++.| ++++||+|+..+++|++++..+.-..+. ...+..+..++ T Consensus 337 -------~~~~~~~~~G--~~T~NE~R~~~g~~p~p~~~~~~~~~~~--~~~~~g~~~dn 385 (385) T protein:vir:10 337 -------NQVSNLAKSG--VLGAEQAQFILTRSGFLPDNLPEFKPLT--TQVKGGDEGDN 385 (385) T ss_pred -------HHHHHHHhCC--CcCHHHHHHHhCCCccCCCCCccccCcc--cccCCCCCCCC Confidence 5556777778 9999999999999887653322211111 11122222222 No 39 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.65 E-value=1e-16 Score=108.27 Aligned_cols=383 Identities=10% Similarity=0.037 Sum_probs=186.9 Q ss_pred CCchhHHHHhHHHHHHHHHHHH-HHhhhh--h-ccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARM-SLLNQG--I-GHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d-~~~n~~--~-~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) |++|.+..-.+-.+.....+.. ...+.. . ......+...... +. ..+.. +..+..+.+||+++|.+.-+ T Consensus 4 ~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-----~~~~~-~~~~~~v~~cI~~ia~~ia~ 76 (413) T protein:vir:96 4 VSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKEL-IS-----DGYTK-LSDSPEVRMAVDCIADLVSN 76 (413) T ss_pred cchhhhhhcCCccccCCCcchhhhhhccccccccccccchhhHhhh-cc-----chhHH-HhhchHHHHHHHHHHHhhcc Confidence 8888755221222111100000 000000 0 0000000000111 00 01112 34588999999999999988 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEe Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPA 155 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~ 155 (456) --+.+...+++........+...+...-....-+..|.+.+-+ -.++|.+++++.- |... ..+..+.|+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r-~~~g---------~~~~~L~~l 146 (413) T protein:vir:96 77 MTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKP-QVSG---------DKIIGLTPI 146 (413) T ss_pred CceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEE-cCCC---------CceEEEEEe Confidence 7788754333222111122211121111122234445555554 4566777766543 3211 012233333 Q ss_pred ccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC-----cCCCcchHHHHHHHHHHHHHHH Q lcl|NC_016762. 156 WAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW-----TGDAIGFLEPAYNSFISLEKVE 230 (456) Q Consensus 156 ~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~-----~~~G~S~le~~~~~l~~~~~~~ 230 (456) ....+++. .++ + -..|.+.. .+..+.++.||||... ...|.|.++.+.+.+.....+ T Consensus 147 ~~~~v~~~---~~~-----~-~~~y~~~~--------~~~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~- 208 (413) T protein:vir:96 147 SPYKVTFN---VSD-----D-DLDYSITF--------DNKEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQA- 208 (413) T ss_pred cCceeEEE---EcC-----C-eEEEEEee--------cCcEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHH- Confidence 33223221 111 1 12455531 1345778888888521 234999999998877555444 Q ss_pred HHHHHHHHHHhhhh-hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CC-eEEecCC-CceeEE-ecccC Q lcl|NC_016762. 231 GGSGESFLKNAARQ-LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG---ND-VLLPTQG-ATVTQM-VSAVS 303 (456) Q Consensus 231 ~~~~~~~~~~~~~~-l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~lid~~-d~~~~~-~~~~s 303 (456) ......++++.... ..++... .+ ..+..+++.+.+....++ .+ .+++..+ .++..+ ..+.. T Consensus 209 ~~~~~~~~~ng~~p~gil~~~~-----~l-------~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~ 276 (413) T protein:vir:96 209 SVTKKGFMASEYMPNLIVSVDS-----DS-------DELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLN 276 (413) T ss_pred HHHHHHHHhccCCccEEEEeCC-----CC-------CHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCChh Confidence 33444555553321 1111111 01 122334444444443322 23 3445444 334433 23332 Q ss_pred --CHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceE Q lcl|NC_016762. 304 --DPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFT 380 (456) Q Consensus 304 --gl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~ 380 (456) .+-+........||.+.|||..+| |.. .+++ ...+||..+ |.|.++.+-+.|-+.- .+....|. T Consensus 277 d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~~----~~~~~~~~~~~~~~-------l~P~~~~ie~~ln~~l-l~~~~~~~ 343 (413) T protein:vir:96 277 DLAINDAVTLDKKTVAGIFGVPAFLL-GVG----TYNKDEFNNFINTK-------IMSIAQVIQQTYNKLI-VEEDMYFS 343 (413) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHc-CCC----cchHHHHHHHHHHH-------HHHHHHHHHHHHHHhh-CCCCcEEE Confidence 333455566788999999999755 532 1222 345666543 8888888877775543 33344567 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCCC----CCCCCCcCCC Q lcl|NC_016762. 381 AIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEPE----DEDAARTDPT 452 (456) Q Consensus 381 ~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~~----d~~~~~~d~~ 452 (456) |.++.|...+.+++++ +...++..| +++++|+|+..+++|+++++..- .-.+. +.+..+.+++ T Consensus 344 fd~~~ll~~d~~~~~~-------~~~~~~~~G--~~t~NE~R~~~g~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 344 LNPRSLYNYSLTEMVS-------AGAQMTQLN--ALRRNEFRNWVGMPPDAEMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred EechhhhccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeecccccchhhcccccCCCCCCC Confidence 7777888888887765 445677778 99999999999999987654431 11111 1111122222 No 40 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.65 E-value=3.8e-16 Score=105.12 Aligned_cols=377 Identities=15% Similarity=0.104 Sum_probs=190.4 Q ss_pred HHHhhhhhccCc--ccchhh---hh-ccCc----ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhh Q lcl|NC_016762. 22 MSLLNQGIGHDA--KRPQAW---CE-YGFP----QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSK 91 (456) Q Consensus 22 d~~~n~~~~~gt--~~~~~~---~~-~~~~----~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~ 91 (456) |++.+..-.-.+ ..+... .. +++. ...+.. -+.+++-+.+||+++|.++-+--+++..+.+..... T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~ 76 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI----EAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSD 76 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccCccccchh----hhhcchHHHHHHHHHHHhhccCceEEecCccccccc Confidence 333322111111 111100 00 0111 111111 123455567799999999987766765433221111 Q ss_pred hhHHHHHHHHHHHHHhhHHHHHHHHHHhh-cccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccc Q lcl|NC_016762. 92 DETEWERKNKPLIAGGRFWRAVSEADRRR-LVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPD 170 (456) Q Consensus 92 ~~~~~e~~i~~~~~~l~~~~~~~ea~~~~-r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~ 170 (456) .+-..|...=..+.-+..|.+++.+. .++|-+++++. .++. +.+..+.|+-...+++ . T Consensus 77 ---~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~----------G~~~~L~~i~~~~v~v-------~ 135 (416) T protein:vir:81 77 ---RIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT-RDKT----------GEPMNLTFRKTSEIEL-------K 135 (416) T ss_pred ---hHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEEcCceeEE-------E Confidence 11111111111122234555665554 46677776653 3321 1133444444333332 2 Q ss_pred ccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh- Q lcl|NC_016762. 171 SETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL- 246 (456) Q Consensus 171 s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~- 246 (456) ....|.+.+|...... + .....+.++++.||||...+ ..|.|.++.+.+.+... .....++..++++..+.-. T Consensus 136 ~~~~g~~~~~~~~~~~-~-~~~~~~~~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~-~~~~~~~~~~f~ng~~~~gi 212 (416) T protein:vir:81 136 SDARGRLYYFHQRIDS-N-GNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESD-NNGKDFLNNFLRNGTHAGGI 212 (416) T ss_pred ECCCccEEEEEEEecC-C-CceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcEE Confidence 2234666555443111 1 12223568888898885332 35889999988766543 3344455556666433221 Q ss_pred hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC--HHHHHHHHHHHHHhhh Q lcl|NC_016762. 247 LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAGV 320 (456) Q Consensus 247 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas 320 (456) ++.... + ..++..+++.+.+....++ .+.++++.+.+|+.++.+... +-+........||++. T Consensus 213 l~~~~~-----~------~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 281 (416) T protein:vir:81 213 LKMKGV-----L------DNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVF 281 (416) T ss_pred EEeCCC-----C------CCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHh Confidence 111100 0 1123333443344333322 245677777899998876543 3455566678999999 Q ss_pred cCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC--CCceEEEeCCCCCCCHHHHHHHH Q lcl|NC_016762. 321 DIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL--KAEFTAIWDDLTVPTKAERLANS 398 (456) Q Consensus 321 ~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~--~~d~~~~f~pL~~~seke~Aei~ 398 (456) |||.. |+|...++.+. +....+|. .-|.|.++.+-..|-+. +.+. .-.|.|.++.|...|.+++++ T Consensus 282 gVPp~-~lg~~~~~~~~-~~~~~~~~-------~~l~P~~~~ie~~ln~~-l~~~~~~~~~~f~~~~l~~~D~~~~~~-- 349 (416) T protein:vir:81 282 GIPLH-KFGIETANMSI-TDANLDYL-------STLKPYITCVCAELNFK-FNDEYVNREFKFDTTEIRVVDEKTQAE-- 349 (416) T ss_pred CCCHH-HcCCCCCCccH-HHHHHHHH-------HHHHHHHHHHHHHHhhh-ccccccCceEEEechhhhccCHHHHHH-- Confidence 99987 56766555432 23333332 13788887776655443 2222 235777778888888888754 Q ss_pred HHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCccc--------CCCCCC-------CCCcCCCCCC-C Q lcl|NC_016762. 399 KTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDT--------EPEDED-------AARTDPTGEQ-Q 456 (456) Q Consensus 399 ~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~--------~~~d~~-------~~~~d~~~~~-e 456 (456) +..+++..| +++++|+|+..+++|+++++.+.-. +..++. ...+..+||+ | T Consensus 350 -----~~~~~~~~G--~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 350 -----IDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred -----HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccccccCCCCCCC Confidence 456677778 9999999999999998776532110 111110 0111123332 2 No 41 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.65 E-value=3.8e-16 Score=105.12 Aligned_cols=377 Identities=15% Similarity=0.104 Sum_probs=190.4 Q ss_pred HHHhhhhhccCc--ccchhh---hh-ccCc----ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhh Q lcl|NC_016762. 22 MSLLNQGIGHDA--KRPQAW---CE-YGFP----QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSK 91 (456) Q Consensus 22 d~~~n~~~~~gt--~~~~~~---~~-~~~~----~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~ 91 (456) |++.+..-.-.+ ..+... .. +++. ...+.. -+.+++-+.+||+++|.++-+--+++..+.+..... T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~ 76 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI----EAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSD 76 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccCccccchh----hhhcchHHHHHHHHHHHhhccCceEEecCccccccc Confidence 333322111111 111100 00 0111 111111 123455567799999999987766765433221111 Q ss_pred hhHHHHHHHHHHHHHhhHHHHHHHHHHhh-cccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccc Q lcl|NC_016762. 92 DETEWERKNKPLIAGGRFWRAVSEADRRR-LVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPD 170 (456) Q Consensus 92 ~~~~~e~~i~~~~~~l~~~~~~~ea~~~~-r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~ 170 (456) .+-..|...=..+.-+..|.+++.+. .++|-+++++. .++. +.+..+.|+-...+++ . T Consensus 77 ---~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~----------G~~~~L~~i~~~~v~v-------~ 135 (416) T protein:vir:45 77 ---RIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT-RDKT----------GEPMNLTFRKTSEIEL-------K 135 (416) T ss_pred ---hHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEEcCceeEE-------E Confidence 11111111111122234555665554 46677776653 3321 1133444444333332 2 Q ss_pred ccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh- Q lcl|NC_016762. 171 SETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL- 246 (456) Q Consensus 171 s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~- 246 (456) ....|.+.+|...... + .....+.++++.||||...+ ..|.|.++.+.+.+... .....++..++++..+.-. T Consensus 136 ~~~~g~~~~~~~~~~~-~-~~~~~~~~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~-~~~~~~~~~~f~ng~~~~gi 212 (416) T protein:vir:45 136 SDARGRLYYFHQRIDS-N-GNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESD-NNGKDFLNNFLRNGTHAGGI 212 (416) T ss_pred ECCCccEEEEEEEecC-C-CceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHH-HHHHHHHHHHHhccCCCcEE Confidence 2234666555443111 1 12223568888898885332 35889999988766543 3344455556666433221 Q ss_pred hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC--HHHHHHHHHHHHHhhh Q lcl|NC_016762. 247 LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAGV 320 (456) Q Consensus 247 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas 320 (456) ++.... + ..++..+++.+.+....++ .+.++++.+.+|+.++.+... +-+........||++. T Consensus 213 l~~~~~-----~------~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 281 (416) T protein:vir:45 213 LKMKGV-----L------DNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVF 281 (416) T ss_pred EEeCCC-----C------CCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHh Confidence 111100 0 1123333443344333322 245677777899998876543 3455566678999999 Q ss_pred cCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC--CCceEEEeCCCCCCCHHHHHHHH Q lcl|NC_016762. 321 DIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL--KAEFTAIWDDLTVPTKAERLANS 398 (456) Q Consensus 321 ~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~--~~d~~~~f~pL~~~seke~Aei~ 398 (456) |||.. |+|...++.+. +....+|. .-|.|.++.+-..|-+. +.+. .-.|.|.++.|...|.+++++ T Consensus 282 gVPp~-~lg~~~~~~~~-~~~~~~~~-------~~l~P~~~~ie~~ln~~-l~~~~~~~~~~f~~~~l~~~D~~~~~~-- 349 (416) T protein:vir:45 282 GIPLH-KFGIETANMSI-TDANLDYL-------STLKPYITCVCAELNFK-FNDEYVNREFKFDTTEIRVVDEKTQAE-- 349 (416) T ss_pred CCCHH-HcCCCCCCccH-HHHHHHHH-------HHHHHHHHHHHHHHhhh-ccccccCceEEEechhhhccCHHHHHH-- Confidence 99987 56766555432 23333332 13788887776655443 2222 235777778888888888754 Q ss_pred HHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCccc--------CCCCCC-------CCCcCCCCCC-C Q lcl|NC_016762. 399 KTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDT--------EPEDED-------AARTDPTGEQ-Q 456 (456) Q Consensus 399 ~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~--------~~~d~~-------~~~~d~~~~~-e 456 (456) +..+++..| +++++|+|+..+++|+++++.+.-. +..++. ...+..+||+ | T Consensus 350 -----~~~~~~~~G--~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 350 -----IDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 416 (416) T ss_pred -----HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccccccCCCCCCC Confidence 456677778 9999999999999998776532110 111110 0111123332 2 No 42 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.64 E-value=2.1e-16 Score=106.57 Aligned_cols=373 Identities=12% Similarity=0.051 Sum_probs=183.7 Q ss_pred HHHhhhhhccCcc----cchhh-hhccC-----cccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhh Q lcl|NC_016762. 22 MSLLNQGIGHDAK----RPQAW-CEYGF-----PQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSK 91 (456) Q Consensus 22 d~~~n~~~~~gt~----~~~~~-~~~~~-----~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~ 91 (456) |+|.+.+-+.-.+ .+..+ ..++. ....+. ..+ ..++.++.||+++|+++-+--+.+....++.... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 76 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGLFMSGEDVSFLVPGY---VRL-SDNPEVRMAVHKIADLISSMTIYLMQNTEDGDIR 76 (406) T ss_pred CcchhhhccccccccccccchhhhhhccCcccCccccCH---HHH-hhcHHHHHHHHHHHHhhccCceEEEEecCCccee Confidence 4554433211110 11111 11110 111222 233 4589999999999999988877775333221111 Q ss_pred hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEe-cCCCCccccccCCcCceeEEEEeccccCChhhhhccc Q lcl|NC_016762. 92 DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHI-RDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKP 169 (456) Q Consensus 92 ~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i-~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp 169 (456) ....+...+...=..+.-+..|.+.+-+ -.++|.+..++.+ +++. +.+..+.|+....+++ ..+. T Consensus 77 ~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~----------g~~~~l~~i~~~~v~~---~~~~ 143 (406) T protein:vir:95 77 IRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTAD----------GLIDELVPLTPSKVNF---LDTP 143 (406) T ss_pred ecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCC----------CcEEEEEEEcCceeEE---EEcC Confidence 1122222222211122234445555444 4555544444332 2221 1123344443332322 1111 Q ss_pred cccccCCceeEEEeecccCCccccceeeehhhhheecC--C---cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_016762. 170 DSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD--W---TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQ 244 (456) Q Consensus 170 ~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--~---~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~ 244 (456) . .|++.. .++.+.++.||||.- . ...|.|.++.+.+.+.....+..... .++++..+. T Consensus 144 ~--------~~~~~~--------~~~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~-~~~~ng~~~ 206 (406) T protein:vir:95 144 D--------GYQVLY--------GGQTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKK-SFMSGKYMP 206 (406) T ss_pred C--------eEEEEe--------ccEEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHH-HHHhccCCc Confidence 1 134431 134577788888742 2 23599999999887766655544333 444543332 Q ss_pred hh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecC-CCceeEEe-ccc--CCHHHHHHHHHHHHHhh Q lcl|NC_016762. 245 LL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQ-GATVTQMV-SAV--SDPGPTYNVNLQTAAAG 319 (456) Q Consensus 245 l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~-~d~~~~~~-~~~--sgl~~~~~~~~~~~aaa 319 (456) -. ++.... +. ....++..+++.+.+.......+.+++.. +++++++. .+. +.+.+......+.||.+ T Consensus 207 ~~il~~~~~-----l~---~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~ 278 (406) T protein:vir:95 207 SLIVKVDAA-----TA---ELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGM 278 (406) T ss_pred ceEEEeCCC-----CC---HHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHH Confidence 11 111111 10 00112223333222221112223444543 45565432 333 34456667788999999 Q ss_pred hcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHH Q lcl|NC_016762. 320 VDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSK 399 (456) Q Consensus 320 s~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~ 399 (456) .|||..+| |... +..+...+||.. -|.|.++++-+.|-+.-+.+..-.+.|.++.|...|.+++++. T Consensus 279 fgVp~~~l-g~~~---~~~~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~fd~~~l~~~d~~~~~~~-- 345 (406) T protein:vir:95 279 FGVPAFLL-GIGE---FNRDEYNNFINS-------TILPIAKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKELAEV-- 345 (406) T ss_pred hCCCHHHc-CCCC---chHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCCCCCcEEEeechhhhcCCHHHHHHH-- Confidence 99998755 5321 112345666654 4889988888777655454444457777888888888876544 Q ss_pred HHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---c-----cCCCCCC---CCCcCCCCCCC Q lcl|NC_016762. 400 TMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---D-----TEPEDED---AARTDPTGEQQ 456 (456) Q Consensus 400 ~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~-----~~~~d~~---~~~~d~~~~~e 456 (456) ...++..| +++++|+|+..+++|+++++..- . ....... .+..++.++.| T Consensus 346 -----~~~l~~~G--~~t~NE~R~~~gl~p~~~gd~~~~~~n~~~~~~~~~~~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 346 -----GSNMYVRG--IMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGDQSKLKGGDNSGADGQTD 406 (406) T ss_pred -----HHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeeccCccchhhcccccccCCCCCCCCCCCCC Confidence 45667777 99999999999999887653321 0 0001011 11112222233 No 43 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.64 E-value=7.4e-16 Score=103.55 Aligned_cols=390 Identities=13% Similarity=0.097 Sum_probs=192.6 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhh--h-hhccCcc---cchhhhhccCc-----ccCCHHHHHHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLN--Q-GIGHDAK---RPQAWCEYGFP-----QEITFNDLYTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n--~-~~~~gt~---~~~~~~~~~~~-----~~~~~~~l~~~Y~~~~l~r~iVd~ 69 (456) |++..+. .-.+. ....|.- . .++.+.. .+-....++.. ..++. ..+.++..+.+||++ T Consensus 1 ~~~~~~m---g~f~r----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~V~~~i~~ 69 (432) T protein:vir:81 1 MPDEKKL---GLFGQ----LKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNA----DAIMRLDAVAACVKL 69 (432) T ss_pred CCchhhc---chhhh----hhhhcccccccccccccccccCccchhhhcccccccCcccch----HhhhccHHHHHHHHH Confidence 5543221 01000 0001100 0 0010000 00011112211 12222 235678889999999 Q ss_pred chhHHhhCCCEEecCCCcchhh-hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcC Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRSK-DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLN 147 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~~-~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~ 147 (456) +|++.-+--+.+....++...+ ....+-..|...=....-+..|.+++-+ -.++|-|++++.-.+|+ T Consensus 70 Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~g~----------- 138 (432) T protein:vir:81 70 VSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGR----------- 138 (432) T ss_pred HHHhhhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCc----------- Confidence 9999987766664322211111 1111111121111112223345555444 45667776665443332 Q ss_pred ceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHH Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFI 224 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~ 224 (456) +..+.|+-...+++. .|+ .|. .+|.+.. .+ +..+.++++.|+||...+ ..|.|.++.+.+.+. T Consensus 139 -~~~L~~l~~~~v~v~---~~~----~g~-~~y~~~~--~~---g~~~~~~~~~iih~r~~~~dg~~G~spi~~~~~~i~ 204 (432) T protein:vir:81 139 -IESLQYLANDRLTIT---TDP----KGN-TAYRYRR--TD---GQMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQIFG 204 (432) T ss_pred -EEEEEEEcCCceEEE---ECC----CCc-EEEEEEe--cC---ceEEEEccccEEEecCCCCCCcccccHHHHHHHHHH Confidence 223333333323221 122 233 3455542 12 234678888888885432 358999998887665 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhh-hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccC Q lcl|NC_016762. 225 SLEKVEGGSGESFLKNAARQLLL-NFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVS 303 (456) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~s 303 (456) .... ....+..++++..+.-.+ +.... + .++..+++.+.+.......+.++++.+.+|++++.+.. T Consensus 205 ~~~~-~~~~~~~~f~ng~~~~gil~~~~~-----l-------~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~ 271 (432) T protein:vir:81 205 TAIA-AEAQAARAFRNGQLQSVYYQIDRF-----L-------TDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNPV 271 (432) T ss_pred HHHH-HHHHHHHHHhcCCCcceEEecCCC-----C-------CHHHHHHHHHHHhhhhcCCCceecCCCceEEEccCCHH Confidence 4433 344444566664433222 11111 1 12334455554444434446777777889999987765 Q ss_pred CH--HHHHHHHHHHHHhhhcCCeEEeeccCCCcccch----H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC- Q lcl|NC_016762. 304 DP--GPTYNVNLQTAAAGVDIPTKILVGMQTGERASS----E-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL- 375 (456) Q Consensus 304 gl--~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst----~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~- 375 (456) +. -+......++||.+.+||-.. +|...+|=+++ + -.+.||.. -|.|.++.+-.-|-+.-+.+. T Consensus 272 d~q~le~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~~~~sn~eq~~~~f~~~-------tl~P~~~~ie~~l~~kLl~~~~ 343 (432) T protein:vir:81 272 DAQLLQSRQYSVESICRFFGVPPSM-IGHSSAGTTSWGSGIESQQLGFLTM-------TLSPWLRRIEQSIALNLLSPAE 343 (432) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHH-cCCcCCccccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccCccc Confidence 44 355667788999999999864 57654443221 2 23456643 467777766555544322221 Q ss_pred --CCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc----ccCCCC--CCCC Q lcl|NC_016762. 376 --KAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP----DTEPED--EDAA 447 (456) Q Consensus 376 --~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~----~~~~~d--~~~~ 447 (456) .-.+.|.++.|...+.+++++ +..+++..| +++++|+|+..+++|+++++..- .-.+.+ .+.. T Consensus 344 ~~~~~~~fd~~~llr~d~~~r~~-------~~~~~~~~G--~~t~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~~~ 414 (432) T protein:vir:81 344 RRRYFADFDTSALLRADSAARSS-------YYSQLVNNG--LMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGLQA 414 (432) T ss_pred cCceEEEeechhhhccCHHHHHH-------HHHHHHhCC--CCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhccCC Confidence 113556666888888887654 445667777 99999999999999987543210 000100 0111 Q ss_pred CcC--CCCCCC Q lcl|NC_016762. 448 RTD--PTGEQQ 456 (456) Q Consensus 448 ~~d--~~~~~e 456 (456) .++ .+.+++ T Consensus 415 ~~~~~~~~~n~ 425 (432) T protein:vir:81 415 SPEPASGLGNQ 425 (432) T ss_pred CCCCCCCCCCc Confidence 111 111111 No 44 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.64 E-value=1.2e-15 Score=102.34 Aligned_cols=418 Identities=13% Similarity=0.090 Sum_probs=202.5 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhh------hhccCcccch--hhhhccCcc--cCCHHHH-HHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQ------GIGHDAKRPQ--AWCEYGFPQ--EITFNDL-YTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~------~~~~gt~~~~--~~~~~~~~~--~~~~~~l-~~~Y~~~~l~r~iVd~ 69 (456) |-.++.-+...+.+.+.... .+..|. +.+.+...+. .+-.-+.+. ..++.-+ ...|.++..+.+||++ T Consensus 3 ~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~~ 81 (466) T protein:vir:81 3 LIDRLLSTRGAAPRMSIDDY-AQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACMLV 81 (466) T ss_pred hhHHHhhccCcccccchhhh-hhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHHH Confidence 34444433222211111110 111110 0111111111 110000000 0011111 3346779999999999 Q ss_pred chhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHH---hhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCC Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAG---GRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGK 145 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~---l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~ 145 (456) +++++-+--+.+....+....+..... +..++.+ ..-+..|.+.+.+ -.++|-+++++.- ++...-.|.. T Consensus 82 Ia~~ia~lp~~~~~~~~~~~~~~~~~~---~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r-~~~g~l~~~~-- 155 (466) T protein:vir:81 82 RQLVFSSVRFRWQRLRDGKPSDTFGSR---DLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVD-GEFVRMRPDW-- 155 (466) T ss_pred HHHhhccCceEEEEecCCceeeccccH---HHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEe-cCcccccccc-- Confidence 999998888887644322211111111 2222221 2234445555554 4556777776543 3221111111 Q ss_pred cCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC-----cCCCcchHHHHH Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW-----TGDAIGFLEPAY 220 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~-----~~~G~S~le~~~ 220 (456) .+....+.|+-...+++.. +.|. + ....|.++..... .....+.++++.||||... ...|.|.++.+. T Consensus 156 ~g~~~~l~~l~~~~v~~~~-~~~~----~-~~~~y~~~~~~~~-~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~ 228 (466) T protein:vir:81 156 VDVVVEERMVRGGRGELGG-GQLG----W-RKVGYLYTEGGRQ-SGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPIL 228 (466) T ss_pred CcceeEEEEecCcceEEEE-cCCC----c-eEEEEEEEecCcc-cccceeeeccccEEEEcCCCCcccccccccHHHHHH Confidence 1223445554443333211 1111 1 1122333211100 1112356888999988542 235899999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---C-CeEEecCCCce Q lcl|NC_016762. 221 NSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG---N-DVLLPTQGATV 295 (456) Q Consensus 221 ~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~-~~~lid~~d~~ 295 (456) +.+... .+.......+|++....-. ++.... + . ++..+++.+.+....++ . ..++++.+-+| T Consensus 229 ~~i~~~-~a~~~~~~~~f~ng~~p~gil~~~~~-----l------~-~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~ 295 (466) T protein:vir:81 229 REIRAD-QAMSKHQAKFFDNGATVNLVIKHNPM-----A------D-PAAVKKWADEVNSKHAGVDNAWKNLNLYPGADA 295 (466) T ss_pred HHHHHH-HHHHHHHHHHHhcCCCcceEEecCCC-----C------C-HHHHHHHHHHHHHHhcCccccccceEcCCCceE Confidence 766443 3444555566666443221 111111 1 1 23334444444443332 2 34566677889 Q ss_pred eEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeeccCCCcccch----H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_016762. 296 TQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGMQTGERASS----E-DQKYHNARCQARRVQELTFEINDLFAHLM 368 (456) Q Consensus 296 ~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst----~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~ 368 (456) +.++.+..+. -+......++||.+.+||-. ++|.+.++-.++ + -.++||.. -|.|.++++-+.|- T Consensus 296 ~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~-~lG~~~~~~~st~sn~eq~~~~f~~~-------tl~P~~~~ie~~l~ 367 (466) T protein:vir:81 296 DVVGSNLQEIDFKNVRGGGETRIAAAAGVPPV-IVGLSEGLAAATYSNYGQARRRLADG-------TAHPLWQNLSGCIG 367 (466) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHH-HcccccCCCccccccHHHHHHHHHHH-------HHHHHHHHHHHHHH Confidence 9998776544 35556788899999999975 777665443333 3 34567643 46787777766554 Q ss_pred HhcCcC---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccC---CC-CCC-CCcccC Q lcl|NC_016762. 369 RIGVVP---LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDP---LQ-GGD-PLPDTE 440 (456) Q Consensus 369 ~s~~~~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~---~~-~~~-~~~~~~ 440 (456) +.-+.+ ....|.|...+|...+.++++++.+++++.....+..| ++++|+|...+... +. .+. +.+... T Consensus 368 ~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g---~t~nE~r~~~~~gd~~~~~~~~~~~~~~~~ 444 (466) T protein:vir:81 368 HVMPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAG---YEPESVVAAVNSGDLRLLKHTGLTSVQLLP 444 (466) T ss_pred hhcCCcccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcC---CChhhccccccCCccccccCCCcchhhhcc Confidence 332211 11134455568999999999999999999989999888 59999997543210 10 111 111111 Q ss_pred CC---CCCCCCcCCCCCCC Q lcl|NC_016762. 441 PE---DEDAARTDPTGEQQ 456 (456) Q Consensus 441 ~~---d~~~~~~d~~~~~e 456 (456) +. ..+.+++...+.++ T Consensus 445 ~~~~~~~~~~~~~~~Gg~~ 463 (466) T protein:vir:81 445 PGVSASASSDTPTSGGADD 463 (466) T ss_pred cccccccCCCCcccCCCCc Confidence 11 11222222222223 No 45 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.63 E-value=3.8e-15 Score=99.65 Aligned_cols=408 Identities=12% Similarity=0.097 Sum_probs=180.0 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhh-hccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC-- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQG-IGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT-- 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~-~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~-- 77 (456) |.+-++.++|..-+.. ...|.... ++.|-..+... -....+...+...|..|.++++||++.++++.+- T Consensus 42 ~~~~~~~~~~~~~~a~----~~~~~~~~~~~~~~~~~~~~----~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~ 113 (563) T protein:vir:95 42 EYQDLTKSLYGQQQAY----AEPFIEMMDTNPEFRDKRSY----MKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQ 113 (563) T ss_pred hHHHHHhhhccCCCcc----hhhhHhhhcccccccccccC----CCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhh Confidence 3333333333211110 01111100 00111111000 1112244678888999999999999999887641 Q ss_pred ---------CCEEecCCCcc-hhhhhHHHHHHHHHHHHHhh--------HHHHHHHH-HHhhcccCceEEEEEe-cCCCC Q lcl|NC_016762. 78 ---------NPQVIEGDDQD-RSKDETEWERKNKPLIAGGR--------FWRAVSEA-DRRRLVGRYSGLLLHI-RDSQP 137 (456) Q Consensus 78 ---------~~~i~~~~~~d-~~~~~~~~e~~i~~~~~~l~--------~~~~~~ea-~~~~r~~Ggs~i~i~i-~D~~~ 137 (456) ++.|.-...+. ..+....-...+...+..++ -+..|.+. +..-.++|.+++++.+ +|+. T Consensus 114 ~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~- 192 (563) T protein:vir:95 114 PARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNK- 192 (563) T ss_pred hhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCC- Confidence 22332111111 11111100112333332111 12234444 4446777877776544 3421 Q ss_pred ccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhee------cCC-cC Q lcl|NC_016762. 138 WDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFIL------GDW-TG 210 (456) Q Consensus 138 ~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~------~~~-~~ 210 (456) +.+..+.|+....+++. .|....-|.....|... .+|.. ...+-++.+|++ ... .. T Consensus 193 ---------G~~~~L~pl~p~~V~v~---~~~~g~~~~~~~~y~~~---~~g~~--~~~~~~~evI~~~~~~~~d~~~~~ 255 (563) T protein:vir:95 193 ---------TKLEKFIAVDPSTIFYA---TDKKGKIIKGGKRFVQV---VDKRV--VASFTSRELAMGIRNPRTELSSSG 255 (563) T ss_pred ---------CceEEEEEeCCceeEEE---ECCCCceeccceeEEEE---eCCce--eEEecCcceEEEeccCCCCcccCc Confidence 12344555544444432 12222223223333322 11211 112223333322 111 34 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CC- Q lcl|NC_016762. 211 DAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLL-NFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG---ND- 285 (456) Q Consensus 211 ~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~- 285 (456) .|.|.++.+.+.+.....+ ......+|++....-.+ ++.... ...++..+++.+.+....++ .+ T Consensus 256 ~G~Spi~~a~~~i~~~~~~-~~~~~~~f~ng~~p~giL~~~~~~----------~ls~e~~~~~~~~~~~~~~G~~nagk 324 (563) T protein:vir:95 256 YGLSEVEIAMKEFIAYNNT-ESFNDRFFSHGGTTRGILQIRSDQ----------QQSQHALENFKREWKSSLSGINGSWQ 324 (563) T ss_pred ccchHHHHHHHHHHHHHHH-HHHHHHHHHccCCCceEEEeCCCC----------CCCHHHHHHHHHHHHHHhcccccccc Confidence 5999999999877655444 33444556654332211 110000 01133444555555544333 22 Q ss_pred -eEEecCCCceeEEecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCC-CcccchH-----HHHHHHHHHHHHHHhhh Q lcl|NC_016762. 286 -VLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQT-GERASSE-----DQKYHNARCQARRVQEL 356 (456) Q Consensus 286 -~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp-~Glnst~-----D~~nyyd~I~~~Qe~~l 356 (456) .++++.+-+|+.++.+... +-+......+.||.+.|||-..| |..- ++..+++ ...|.-..-.+.-+.-| T Consensus 325 ~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL 403 (563) T protein:vir:95 325 IPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEI-GFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGL 403 (563) T ss_pred ceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-cccccccccccccccchhhccHHHHHHHHHHHHH Confidence 2455667789998877654 34666678889999999998754 6533 2222111 11111111112222346 Q ss_pred hHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC Q lcl|NC_016762. 357 TFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL 436 (456) Q Consensus 357 rp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~ 436 (456) .|.++.+-..|-+.-+-.....+.|+|. .++.+.+++... ...++..| ++|++|+|+..+++|+++++.. T Consensus 404 ~P~l~~ie~~ln~~L~~~~~~~~~~~f~---r~D~~~~~e~~~-----~~~~~~~G--~lT~NE~R~~~gl~Pi~gGD~~ 473 (563) T protein:vir:95 404 QPLLRFIEDLVNRHIISEYGDKYTFQFV---GGDTKSATDKLN-----ILKLETQI--FKTVNEAREEQGKKPIEGGDII 473 (563) T ss_pred HHHHHHHHHHHHhhhchhcccccEEEec---cCCHHHHHHHHH-----HHHHhcCC--ccCHHHHHHHhCCCCCCCccee Confidence 7777666555433212122234566653 445555544322 12345666 9999999999999998765421 Q ss_pred cc---------------c--C---------------CCCC-CCCCcCCCCCCC Q lcl|NC_016762. 437 PD---------------T--E---------------PEDE-DAARTDPTGEQQ 456 (456) Q Consensus 437 ~~---------------~--~---------------~~d~-~~~~~d~~~~~e 456 (456) -. . . +.++ +.++++++.+++ T Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (563) T protein:vir:95 474 LDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDD 526 (563) T ss_pred ecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCc Confidence 00 0 0 0000 000000111111 No 46 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.63 E-value=3.8e-15 Score=99.65 Aligned_cols=408 Identities=12% Similarity=0.097 Sum_probs=180.0 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhh-hccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC-- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQG-IGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT-- 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~-~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~-- 77 (456) |.+-++.++|..-+.. ...|.... ++.|-..+... -....+...+...|..|.++++||++.++++.+- T Consensus 42 ~~~~~~~~~~~~~~a~----~~~~~~~~~~~~~~~~~~~~----~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~ 113 (563) T protein:vir:99 42 EYQDLTKSLYGQQQAY----AEPFIEMMDTNPEFRDKRSY----MKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQ 113 (563) T ss_pred hHHHHHhhhccCCCcc----hhhhHhhhcccccccccccC----CCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhh Confidence 3333333333211110 01111100 00111111000 1112244678888999999999999999887641 Q ss_pred ---------CCEEecCCCcc-hhhhhHHHHHHHHHHHHHhh--------HHHHHHHH-HHhhcccCceEEEEEe-cCCCC Q lcl|NC_016762. 78 ---------NPQVIEGDDQD-RSKDETEWERKNKPLIAGGR--------FWRAVSEA-DRRRLVGRYSGLLLHI-RDSQP 137 (456) Q Consensus 78 ---------~~~i~~~~~~d-~~~~~~~~e~~i~~~~~~l~--------~~~~~~ea-~~~~r~~Ggs~i~i~i-~D~~~ 137 (456) ++.|.-...+. ..+....-...+...+..++ -+..|.+. +..-.++|.+++++.+ +|+. T Consensus 114 ~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~- 192 (563) T protein:vir:99 114 PARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNK- 192 (563) T ss_pred hhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCC- Confidence 22332111111 11111100112333332111 12234444 4446777877776544 3421 Q ss_pred ccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhee------cCC-cC Q lcl|NC_016762. 138 WDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFIL------GDW-TG 210 (456) Q Consensus 138 ~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~------~~~-~~ 210 (456) +.+..+.|+....+++. .|....-|.....|... .+|.. ...+-++.+|++ ... .. T Consensus 193 ---------G~~~~L~pl~p~~V~v~---~~~~g~~~~~~~~y~~~---~~g~~--~~~~~~~evI~~~~~~~~d~~~~~ 255 (563) T protein:vir:99 193 ---------TKLEKFIAVDPSTIFYA---TDKKGKIIKGGKRFVQV---VDKRV--VASFTSRELAMGIRNPRTELSSSG 255 (563) T ss_pred ---------CceEEEEEeCCceeEEE---ECCCCceeccceeEEEE---eCCce--eEEecCcceEEEeccCCCCcccCc Confidence 12344555544444432 12222223223333322 11211 112223333322 111 34 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CC- Q lcl|NC_016762. 211 DAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLL-NFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG---ND- 285 (456) Q Consensus 211 ~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~- 285 (456) .|.|.++.+.+.+.....+ ......+|++....-.+ ++.... ...++..+++.+.+....++ .+ T Consensus 256 ~G~Spi~~a~~~i~~~~~~-~~~~~~~f~ng~~p~giL~~~~~~----------~ls~e~~~~~~~~~~~~~~G~~nagk 324 (563) T protein:vir:99 256 YGLSEVEIAMKEFIAYNNT-ESFNDRFFSHGGTTRGILQIRSDQ----------QQSQHALENFKREWKSSLSGINGSWQ 324 (563) T ss_pred ccchHHHHHHHHHHHHHHH-HHHHHHHHHccCCCceEEEeCCCC----------CCCHHHHHHHHHHHHHHhcccccccc Confidence 5999999999877655444 33444556654332211 110000 01133444555555544333 22 Q ss_pred -eEEecCCCceeEEecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCC-CcccchH-----HHHHHHHHHHHHHHhhh Q lcl|NC_016762. 286 -VLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQT-GERASSE-----DQKYHNARCQARRVQEL 356 (456) Q Consensus 286 -~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp-~Glnst~-----D~~nyyd~I~~~Qe~~l 356 (456) .++++.+-+|+.++.+... +-+......+.||.+.|||-..| |..- ++..+++ ...|.-..-.+.-+.-| T Consensus 325 ~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL 403 (563) T protein:vir:99 325 IPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEI-GFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGL 403 (563) T ss_pred ceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-cccccccccccccccchhhccHHHHHHHHHHHHH Confidence 2455667789998877654 34666678889999999998754 6533 2222111 11111111112222346 Q ss_pred hHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC Q lcl|NC_016762. 357 TFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL 436 (456) Q Consensus 357 rp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~ 436 (456) .|.++.+-..|-+.-+-.....+.|+|. .++.+.+++... ...++..| ++|++|+|+..+++|+++++.. T Consensus 404 ~P~l~~ie~~ln~~L~~~~~~~~~~~f~---r~D~~~~~e~~~-----~~~~~~~G--~lT~NE~R~~~gl~Pi~gGD~~ 473 (563) T protein:vir:99 404 QPLLRFIEDLVNRHIISEYGDKYTFQFV---GGDTKSATDKLN-----ILKLETQI--FKTVNEAREEQGKKPIEGGDII 473 (563) T ss_pred HHHHHHHHHHHHhhhchhcccccEEEec---cCCHHHHHHHHH-----HHHHhcCC--ccCHHHHHHHhCCCCCCCccee Confidence 7777666555433212122234566653 445555544322 12345666 9999999999999998765421 Q ss_pred cc---------------c--C---------------CCCC-CCCCcCCCCCCC Q lcl|NC_016762. 437 PD---------------T--E---------------PEDE-DAARTDPTGEQQ 456 (456) Q Consensus 437 ~~---------------~--~---------------~~d~-~~~~~d~~~~~e 456 (456) -. . . +.++ +.++++++.+++ T Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (563) T protein:vir:99 474 LDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDD 526 (563) T ss_pred ecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCc Confidence 00 0 0 0000 000000111111 No 47 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.63 E-value=1.4e-15 Score=102.10 Aligned_cols=373 Identities=13% Similarity=0.057 Sum_probs=182.1 Q ss_pred HhhhhhccCcccchhhhh-ccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHH Q lcl|NC_016762. 24 LLNQGIGHDAKRPQAWCE-YGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKP 102 (456) Q Consensus 24 ~~n~~~~~gt~~~~~~~~-~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~ 102 (456) ...+-.+.|+ + ..|.. .++ .++ ...|..+..+.++|++++++.-.--+.+..++.. ..+ ... +.. T Consensus 1 ~~~~~~~~g~-~-~~~~~~~~~--~~~----~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~-~~~-~~~----l~~ 66 (723) T protein:vir:94 1 MTTFPSGAGG-W-NAWSADSVF--GNG----AKGWSNSAVAYRCISMLANNAASVDLVVRGPDGE-LDE-LHP----LSQ 66 (723) T ss_pred CcccccCCCc-c-ccccccccc--ccc----HHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCc-cch-hhH----HHH Confidence 1111111111 1 11110 011 111 2457789999999999999987666666533221 111 111 222 Q ss_pred HHH----HhhHHHHHHHHHHhhc-ccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccccccc-CC Q lcl|NC_016762. 103 LIA----GGRFWRAVSEADRRRL-VGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETY-GQ 176 (456) Q Consensus 103 ~~~----~l~~~~~~~ea~~~~r-~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~y-g~ 176 (456) ++. ...-...|.+.+.+.. ++|.+++++.- +++... +....+.|+....+.+... .+. ...| +. T Consensus 67 lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~~r~~~-------g~p~~l~~l~~~~~~v~~~-~~~-~~~~~~~ 136 (723) T protein:vir:94 67 LWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNY-NGRTPA-------GVPDEIWYVYDRVTTIVAT-RAA-DAVPQAQ 136 (723) T ss_pred HHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe-cCCccc-------cceeEEEEecCcceEEeec-CCC-ccceeee Confidence 222 1112344666655544 55767666544 333211 1122333332221111111 111 1111 11 Q ss_pred ceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhh Q lcl|NC_016762. 177 PTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL-LNFDK 251 (456) Q Consensus 177 P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~ 251 (456) .-.|.+.. .+ +..+.+..+.||||-.. ...|.|.++.+...+-... ........+|+|..+.-. ++. . T Consensus 137 ~~~y~~~~--~~---G~~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~-aa~~~~~~~f~NG~~p~giL~~-~ 209 (723) T protein:vir:94 137 IIGYVIER--TD---GVRVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADF-YAATWQRQSFKNGARPGGVVNL-G 209 (723) T ss_pred eeEEEEEe--cC---ceeEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHH-HHHHHHHHHHhcCCCcceEEEc-C Confidence 22344432 12 22356788889888533 2359999998887665443 344455567776543211 111 1 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhc---CCCeEEe-----------cCCCceeEEecccCCH--HHHHHHHHHH Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNR---GNDVLLP-----------TQGATVTQMVSAVSDP--GPTYNVNLQT 315 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~li-----------d~~d~~~~~~~~~sgl--~~~~~~~~~~ 315 (456) + + + ++..+++.+.+....+ |-+..++ +.+-+|+.++.+..+. -+......+. T Consensus 210 ~-----l------~-~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~e 277 (723) T protein:vir:94 210 D-----M------D-EQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEE 277 (723) T ss_pred C-----C------C-HHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHH Confidence 1 1 1 2223333333332221 2222222 2344677776554432 3444556778 Q ss_pred HHhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCC--CCCCCHHH Q lcl|NC_016762. 316 AAAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDD--LTVPTKAE 393 (456) Q Consensus 316 ~aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~p--L~~~seke 393 (456) ||.+.|||-..|.|.+. +-|..+-...||. .-|.|.++.+-+.|-+.-+-....++.|+|+. |...|.++ T Consensus 278 Ia~afgVPp~~i~~~st-~sN~e~~~~~f~~-------~tL~P~~~~ie~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~~ 349 (723) T protein:vir:94 278 VMLAFGIRKDALLGGST-YENQAEAKAAVWT-------ETLIPQMEVMASITDLQLLPDIGWTVEWDFNSVPALQEDLEA 349 (723) T ss_pred HHHHhCCChhHcCCCCC-cccHHHHHHHHHH-------HHHHHHHHHHHHHHhHhhcccccCceEEeecchhhhhcCHHH Confidence 99999999887766432 1122233455654 34788887777666443232223467888886 56666666 Q ss_pred HHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCC-CcCC-CCCCC Q lcl|NC_016762. 394 RLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAA-RTDP-TGEQQ 456 (456) Q Consensus 394 ~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~-~~d~-~~~~e 456 (456) ++ ++...+++.| ++++||+|+..+++|+++++...-..+...+.. .+.| .+.+| T Consensus 350 r~-------~~~~~~v~~G--~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e 405 (723) T protein:vir:94 350 QA-------GRNQGYLVND--VLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEE 405 (723) T ss_pred HH-------HHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchh Confidence 54 4566777778 999999999999999987653211111111100 0011 11111 No 48 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.63 E-value=1.7e-15 Score=101.58 Aligned_cols=381 Identities=16% Similarity=0.147 Sum_probs=184.8 Q ss_pred HHHhhhhhccCcc-----cchhhhhccCc------ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchh Q lcl|NC_016762. 22 MSLLNQGIGHDAK-----RPQAWCEYGFP------QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRS 90 (456) Q Consensus 22 d~~~n~~~~~gt~-----~~~~~~~~~~~------~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~ 90 (456) +-|+| +-.+-|. ++.....|++. ........-..|..++.+.+||++++++.-.--+.+.....+... T Consensus 1 ~~~~~-~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~ 79 (518) T protein:vir:10 1 MLLAN-GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccC-ceeecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 11111 0001111 00011122221 111112233457788999999999999996655555332222111 Q ss_pred hhhHHHHHHHHHHHHH---hhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhh Q lcl|NC_016762. 91 KDETEWERKNKPLIAG---GRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFD 166 (456) Q Consensus 91 ~~~~~~e~~i~~~~~~---l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~ 166 (456) +... ..+..++.+ ..-+..|.+.+-. -.++|-+++++.- ++. +.+..+.|+-...+++.. T Consensus 80 ~~~~---~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r-~~~----------G~~~~L~~l~p~~v~v~~-- 143 (518) T protein:vir:10 80 EESD---TGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-NKS----------GTPEKLMPMHPSRVAIKR-- 143 (518) T ss_pred eccc---hHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-CCC----------CcEEEEEEECCCceEEEE-- Confidence 1111 112222221 1122334444444 4566777766543 221 123344444433333311 Q ss_pred ccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016762. 167 EKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAA 242 (456) Q Consensus 167 ~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~ 242 (456) |. .+-+..|++.... +.....+.+.++.||||...+ ..|.|.++.+.+.+.....+ ......++++.. T Consensus 144 -~~----~~~~~~y~~~~~~--~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~-~~~~~~~f~ng~ 215 (518) T protein:vir:10 144 -NS----RTGRYEYYFQAGA--GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSS-RNATAAMWKNAG 215 (518) T ss_pred -cC----CCCEEEEEEEecC--CccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHH-HHHHHHHHhcCC Confidence 11 1122345554211 111223456678888875432 24889999888866555443 444555666643 Q ss_pred hhh-hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC--HHHHHHHHHHH Q lcl|NC_016762. 243 RQL-LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD--PGPTYNVNLQT 315 (456) Q Consensus 243 ~~l-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~ 315 (456) +.- .++.... + + ++..+++.+.+....++ ...++++.+.+|+.++.+..+ +-+......+. T Consensus 216 ~p~gil~~~~~-----l------s-~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~e 283 (518) T protein:vir:10 216 RPNLVLRHEKR-----L------S-EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREE 283 (518) T ss_pred CccEEEecCCC-----C------C-HHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 321 1211111 1 1 23334444444444333 235667777889998876654 34445566789 Q ss_pred HHhhhcCCeEEeeccCCC-cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCCceEEEeCCCCCCCH Q lcl|NC_016762. 316 AAAGVDIPTKILVGMQTG-ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKAEFTAIWDDLTVPTK 391 (456) Q Consensus 316 ~aaas~IP~t~L~G~sp~-Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~d~~~~f~pL~~~se 391 (456) ||.+.|||-.+| |..-+ ..+..+ -..+||..+ |.|.+..+-..|-+.-+-+ ....+.|....|...+. T Consensus 284 Ia~afgVPp~~l-g~~~~~t~sn~eq~~~~f~~~t-------L~P~l~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D~ 355 (518) T protein:vir:10 284 VCGVYDIAPPIV-HILDRATFSNISAQMRAFYRDT-------MAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) T ss_pred HHHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcccccCCceEEEechhhhccCH Confidence 999999998655 64332 232223 345666543 6787777765554331111 12235555568888887 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCC--CCCC---------ccc---CC-CCCCCCCcCCC---- Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQG--GDPL---------PDT---EP-EDEDAARTDPT---- 452 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~--~~~~---------~~~---~~-~d~~~~~~d~~---- 452 (456) +++ +++...++..| ++++||+|+..+++|+.+ ++.. ... .. ..+.+.+++|. T Consensus 356 ~~r-------~~~~~~~~~~G--~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~ 426 (518) T protein:vir:10 356 EAK-------SESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPV 426 (518) T ss_pred HHH-------HHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCcccc Confidence 776 44456677777 999999999999988752 2211 000 00 01111111111 Q ss_pred --CCCC Q lcl|NC_016762. 453 --GEQQ 456 (456) Q Consensus 453 --~~~e 456 (456) ++++ T Consensus 427 ~~~~~~ 432 (518) T protein:vir:10 427 ASLDQS 432 (518) T ss_pred cccccc Confidence 1111 No 49 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.62 E-value=1e-15 Score=102.76 Aligned_cols=384 Identities=13% Similarity=0.086 Sum_probs=189.9 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhc---cCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEY---GFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~---~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) |..+ .+ ..+...++.+-..+-...+-..+..+ .|. .++. ..|..+..+.++|+++|+++-.- T Consensus 1 ~~~~-~~---------~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~----~~a~~~~~v~~~i~~Ia~~ia~l 65 (409) T protein:vir:94 1 MAKE-NI---------VTRIKKKLIDNWIDQSASKLYDFSPWKNKSFW-GVIN----NTLETNETIFSAITKLSNSMASL 65 (409) T ss_pred Cccc-cc---------chhhhhHHhhhhhcCCcccccccccccCcccc-ccch----hhhhccHHHHHHHHHHHHhhhhC Confidence 4321 11 11111122211111111111111111 111 1122 23667888999999999999877 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEec Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAW 156 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~ 156 (456) -+.+....+..+. .+...+...=..+.-+..|.+.+.+ -.++|-+++++.- +.. +.+..+.|+. T Consensus 66 p~~~~~~~~~~~~----~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~~~----------G~~~~L~~l~ 130 (409) T protein:vir:94 66 PLKMYEDYKVVNT----EVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER-DIY----------HQPSKLFLLN 130 (409) T ss_pred ceeEeecccccch----hHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEE-CCC----------CcEEEEEEEc Confidence 7777544332221 1111122111222334445555444 4667777766542 211 1133444444 Q ss_pred cccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 157 AGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGG 232 (456) Q Consensus 157 ~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~ 232 (456) ...+++. .| .-+.+-+|.+... + +..+.+.++.||||... ...|.|.+..+.+.+.....+ .. T Consensus 131 ~~~v~v~---~~----~~~~~~~y~~~~~--~---g~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~-~~ 197 (409) T protein:vir:94 131 PDVVEML---IE----NQSRELYYSIHAA--T---GNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV-RT 197 (409) T ss_pred CceeEEE---Ee----CCCcEEEEEEEcC--C---ceEEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHH-HH Confidence 3333321 11 1234556777522 1 23466888889988532 234889888776644332222 11 Q ss_pred HHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEecccC--CHHHHH Q lcl|NC_016762. 233 SGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVSAVS--DPGPTY 309 (456) Q Consensus 233 ~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~~~s--gl~~~~ 309 (456) . .+..+.... .+. ....+.-..+..+++.+.+..+.++. +.++++.+-+|+.++.+.. .+-+.. T Consensus 198 ~--~~~~~~~~~-~~i----------~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~ 264 (409) T protein:vir:94 198 F--NLTEMQKPD-SFM----------LKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASE 264 (409) T ss_pred H--HHHhcCCCC-eeE----------EecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHH Confidence 1 111111110 000 00001111233344444444444444 4556666778988876654 334455 Q ss_pred HHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCCc--eEEEeC Q lcl|NC_016762. 310 NVNLQTAAAGVDIPTKILVGMQTGERASSED-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKAE--FTAIWD 384 (456) Q Consensus 310 ~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~d--~~~~f~ 384 (456) .....+||.+.+||-.+|-+...+..+..+. .+.||..+ |.|.++.+-+.|-+.-+-+ .... |.|... T Consensus 265 ~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~-------l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~ 337 (409) T protein:vir:94 265 NLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVK 337 (409) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhCCcccccCcceEEeech Confidence 5567889999999998775544444333333 45677654 7888888776665443321 1223 444455 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc--------ccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 385 DLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP--------DTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 385 pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~--------~~~~~d~~~~~~d~~~~~e 456 (456) .|...|.+++++ +..+++..| +++++|+|+..+++|+++++..- +.........++.+...+| T Consensus 338 ~ll~~d~~~~~~-------~~~~~~~~G--~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~kGG~~n~~e 408 (409) T protein:vir:94 338 SYLRADSATQAE-------VYFKAVRSG--YYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVNE 408 (409) T ss_pred hhhccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcCeEeecccccccccchhhcccccCCCCCcCC Confidence 777777777654 456677778 99999999999999987654421 1111111122222222333 No 50 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.62 E-value=2e-15 Score=101.22 Aligned_cols=386 Identities=13% Similarity=0.073 Sum_probs=189.7 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccch---hhhh-ccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQ---AWCE-YGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~---~~~~-~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) |+==-+ +....+...++.+...+-.+.... .|.. -+++ ++ ...|.++..+.+||+++|+++-. T Consensus 1 m~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--v~----~~~a~~~~~v~~~i~~ia~~iA~ 67 (412) T protein:vir:26 1 MNVIAK-------ENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWG--VI----NNTLETNETIFSAITKLSNSMAS 67 (412) T ss_pred Cccchh-------hhhhhhhhhhHhhhhhcccccccccccccCCccccc--cc----hhhhhccHHHHHHHHHHHHhHhh Confidence 321100 001112223333322222111111 1111 1111 12 23456788899999999999987 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEe Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPA 155 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~ 155 (456) --+.+....+..+. .+...+...=..+.-+..|.+.+.. -.++|-+++++. ++. .+.+..+.|+ T Consensus 68 lp~~~~~~~~~~~~----~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~----------~G~~~~L~~l 132 (412) T protein:vir:26 68 LPLKMYEDYKVVNT----EVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE-RDI----------YHQPSKLFLL 132 (412) T ss_pred CceeEeeccccccc----hHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEE-ECC----------CCcEEEEEEE Confidence 66666543322211 1111122221222234445554444 456677766653 221 1223445554 Q ss_pred ccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHH Q lcl|NC_016762. 156 WAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEG 231 (456) Q Consensus 156 ~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~ 231 (456) -...+++.. |. -+.+.+|.+... .+..+.+.++-|+||... ...|.|.++.+...+.-...+ . T Consensus 133 ~~~~v~v~~---~~----~~~~~~y~~~~~-----~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~-~ 199 (412) T protein:vir:26 133 NPDVVEMLI---EN----QSRELYYSIHAA-----TGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV-R 199 (412) T ss_pred cCceeEEEE---eC----CCcEEEEEEEcC-----CceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHH-H Confidence 443333321 11 123556777521 123456888888888542 235889888776544322222 1 Q ss_pred HHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEecccC--CHHHH Q lcl|NC_016762. 232 GSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVSAVS--DPGPT 308 (456) Q Consensus 232 ~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~~~s--gl~~~ 308 (456) .. .+..+. ..-++. ....+.-.++..+++.+.+....++. +.++++.+.+|+.++.+.. .+-+. T Consensus 200 ~~--~~~~~~-~~~~~i----------~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~ 266 (412) T protein:vir:26 200 TF--NLTEMQ-KPDSFM----------LKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVAS 266 (412) T ss_pred HH--HHHhcC-CCCceE----------EecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHH Confidence 11 111111 111100 00001111233344444444444444 4555666778998876654 33444 Q ss_pred HHHHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC--C--CceEEEe Q lcl|NC_016762. 309 YNVNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL--K--AEFTAIW 383 (456) Q Consensus 309 ~~~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~--~--~d~~~~f 383 (456) .....++||.+.|||-..|-+.+.+..+..+ -.+.||..+ |.|.++.+-+.|-+.-+.+. . ..|.|.+ T Consensus 267 ~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~-------l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~ 339 (412) T protein:vir:26 267 ENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNV 339 (412) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCCcccccCcceEEeec Confidence 4456788999999999877554444433333 345677654 78888887666644333221 1 2356666 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCCCC-----CCCCCcCCCCCC Q lcl|NC_016762. 384 DDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEPED-----EDAARTDPTGEQ 455 (456) Q Consensus 384 ~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~~d-----~~~~~~d~~~~~ 455 (456) .+|...|.+++++. ...++..| +++++|+|+..+++|+++++..- +..+.+ ....++.+...+ T Consensus 340 ~~l~~~d~~~~~~~-------~~~~~~~G--~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~ 410 (412) T protein:vir:26 340 KSYLRADSATQAEV-------YFKAVRSG--YYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVN 410 (412) T ss_pred hhhhccCHHHHHHH-------HHHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeecccccccccchhhcccccCCCCCcC Confidence 68888888877554 45667777 99999999999999987654421 101111 111122222222 Q ss_pred C Q lcl|NC_016762. 456 Q 456 (456) Q Consensus 456 e 456 (456) | T Consensus 411 e 411 (412) T protein:vir:26 411 E 411 (412) T ss_pred C Confidence 2 No 51 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.62 E-value=9.3e-16 Score=102.98 Aligned_cols=374 Identities=11% Similarity=0.064 Sum_probs=177.8 Q ss_pred HHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHh-------------hHHHHHHH-HHH Q lcl|NC_016762. 53 LYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGG-------------RFWRAVSE-ADR 118 (456) Q Consensus 53 l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l-------------~~~~~~~e-a~~ 118 (456) |..+-+.|..+++||++++++...-++.|......+.........+.+...+... ..+..|.+ .+. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 7777778999999999999999988888853222111111111111222222211 12233333 344 Q ss_pred hhcccCceEEEEEec-CCCCcc-ccccCCc-----CceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCcc Q lcl|NC_016762. 119 RRLVGRYSGLLLHIR-DSQPWD-RPARGKL-----NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRP 191 (456) Q Consensus 119 ~~r~~Ggs~i~i~i~-D~~~~~-~Pl~~~~-----~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~ 191 (456) +-.++|.+++++.-+ .|+... .|++... .+.. +...+........+..++...++.....+.+.. ...+.. T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 158 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERG-FVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVD-ADDGST 158 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecce-eEeecCCceeeEEeccccceeecccceeeeeee-eccccc Confidence 566678777765432 233221 2332110 0000 000000000000001111111111111111111 112223 Q ss_pred ccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHH Q lcl|NC_016762. 192 GLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLD 267 (456) Q Consensus 192 ~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~ 267 (456) +..+.+.++.||||... ...|.|-+..+...+.... .+..+...+|++....-.+.....- .+ .. T Consensus 159 ~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~-~~~~~~~~~f~ng~~p~gil~~~~~---~l-------~~ 227 (467) T protein:vir:31 159 GTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDS-AAQDYNIDFFENDGVPRIAIIVKGA---EL-------TE 227 (467) T ss_pred cceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHH-HHHHHHHHHHhccCCCceEEEecCc---CC-------CH Confidence 34567888999988533 3469999999988775433 3445555566665432221110000 00 11 Q ss_pred HHHHHHHHHHHHHhc---------------CCCeEEecCCCceeEEeccc---C-------CHHHHHHHHHHHHHhhhcC Q lcl|NC_016762. 268 ALNERFNEAARQLNR---------------GNDVLLPTQGATVTQMVSAV---S-------DPGPTYNVNLQTAAAGVDI 322 (456) Q Consensus 268 ~~~~~~~~~~~~~~~---------------~~~~~lid~~d~~~~~~~~~---s-------gl~~~~~~~~~~~aaas~I 322 (456) +..+++.+.+....+ +...+++..+.++..+.+.+ + .+.+.......+||++.|| T Consensus 228 e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgV 307 (467) T protein:vir:31 228 KGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDV 307 (467) T ss_pred HHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCC Confidence 222333332222111 11223344444444433322 2 2334555567789999999 Q ss_pred CeEEeeccCC-Ccccch--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC----cCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 323 PTKILVGMQT-GERASS--EDQKYHNARCQARRVQELTFEINDLFAHLMRIGV----VPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 323 P~t~L~G~sp-~Glnst--~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~----~~~~~d~~~~f~pL~~~seke~A 395 (456) |-. ++|..- +..++. +-..+||..+ |+|.++.+-+.|-+.-+ ......+.|.+..|...+.++++ T Consensus 308 pp~-~lG~~~~~~~~s~~e~~~~~f~~~~-------l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~ 379 (467) T protein:vir:31 308 PPV-IAGVVESGAFSTDAEEQRKEFAEET-------IQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEI 379 (467) T ss_pred CHH-HcccCCCCCcccCHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHH Confidence 986 557643 333332 2345666433 68877776555433211 11122367777899988888887 Q ss_pred HHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccC----------CCCCCCCCcCCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTE----------PEDEDAARTDPTGEQQ 456 (456) Q Consensus 396 ei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~----------~~d~~~~~~d~~~~~e 456 (456) ++. ..++..| ++|++|+|+..+++|+.+..-.+... +.+...+++.++.+++ T Consensus 380 ~~~-------~~~~~~G--~~T~NE~R~~~Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (467) T protein:vir:31 380 ASQ-------RVQAMQG--LLTVNELRDEFGFEPFPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDR 441 (467) T ss_pred HHH-------HHHHhCC--CcCHHHHHHHhCCCCCCcccccCCcccccccccccCCCCcccCcCCCCCCCc Confidence 664 4456777 99999999999999875432211100 0011111111111111 No 52 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.62 E-value=2.1e-15 Score=101.10 Aligned_cols=385 Identities=11% Similarity=0.039 Sum_probs=187.1 Q ss_pred CCchh-HHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhh-ccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKL-DLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCE-YGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~-~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~-~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) |...- ---+..++.. ..+.....++ + ....|.. -++. ++. ..|..+..+.++|+.+|+++-.-- T Consensus 1 ~~~~~~~~~~~~~~~~------~~~~~~~~~~-~-~~~~~~~~~~~~--v~~----~~~~~~~~V~~ci~~Ia~~ia~lp 66 (409) T protein:vir:93 1 MAKENIVTRIKKKLID------NWIDQSTSKL-Y-DFSPWKNRSFWG--VIN----NTLETNETIFSAITKLSNSMASLP 66 (409) T ss_pred CCccchhhhhhhhhhh------hhhccccccc-c-ccccccCccccc--cch----hhhhccHHHHHHHHHHHHhhhhCc Confidence 53221 1111111111 0011111111 1 1111110 1111 122 236678889999999999998766 Q ss_pred CEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEecc Q lcl|NC_016762. 79 PQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWA 157 (456) Q Consensus 79 ~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~ 157 (456) +.+....+..+. .+...+...=...--+..|.+.+.+ -.++|-+++++.- |+. +.+..+.|+-. T Consensus 67 ~~~~~~~~~~~~----~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r-~~~----------G~~~~L~~l~~ 131 (409) T protein:vir:93 67 LKMYEDYKVVNT----EVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER-DIY----------HQPSKLFLLNP 131 (409) T ss_pred eeEeeccccccc----hHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEE-CCC----------CcEEEEEEEcC Confidence 666543322111 1111121111112233445454444 4566777776543 211 12334444443 Q ss_pred ccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 158 GCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGS 233 (456) Q Consensus 158 ~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~ 233 (456) ..+++.. + +-+.+.+|.+... .+..+.+.++.||||.... ..|.|.++.+.+.+.....+... T Consensus 132 ~~v~~~~---~----~~~~~~~y~~~~~-----~g~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~- 198 (409) T protein:vir:93 132 DVVEMLI---E----NQSRELYYSIHAA-----TGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF- 198 (409) T ss_pred ceeEEEE---e----CCCcEEEEEEEcC-----CceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHH- Confidence 3332211 1 1133556777522 1234678899999985432 35889888776644332222111 Q ss_pred HHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC-CCeEEecCCCceeEEecccCC--HHHHHH Q lcl|NC_016762. 234 GESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG-NDVLLPTQGATVTQMVSAVSD--PGPTYN 310 (456) Q Consensus 234 ~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lid~~d~~~~~~~~~sg--l~~~~~ 310 (456) .+..+... -++. ....+.-.++..+++.+.+....++ .+.++++.+.+|++++.+... +-+... T Consensus 199 --~~~~~~~~-~~~i----------~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~ 265 (409) T protein:vir:93 199 --NLTEMQKP-DSFM----------LKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASEN 265 (409) T ss_pred --HHHhcCCC-CceE----------EecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHH Confidence 11111111 0000 0000111123334444444443334 345566677789988866543 344444 Q ss_pred HHHHHHHhhhcCCeEEeeccCCCcccchHH-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCCc--eEEEeCC Q lcl|NC_016762. 311 VNLQTAAAGVDIPTKILVGMQTGERASSED-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKAE--FTAIWDD 385 (456) Q Consensus 311 ~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~d--~~~~f~p 385 (456) ...+.||.+.+||-.+|-+...+..+..+. .+.||..+ |.|.++.+-+.|-+.-+.+ .... |.|.+.. T Consensus 266 ~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~-------l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ 338 (409) T protein:vir:93 266 LTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKS 338 (409) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCCcccccCcceEEeechh Confidence 567889999999988775544444443333 45677664 7888877766554432221 1123 4555557 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCCCC-----CCCCCcCCCCCCC Q lcl|NC_016762. 386 LTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEPED-----EDAARTDPTGEQQ 456 (456) Q Consensus 386 L~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~~d-----~~~~~~d~~~~~e 456 (456) |...+.+++++ +..++++.| +++++|+|+..+++|.++++..- ...+.+ .....+.+...+| T Consensus 339 ll~~d~~~~~~-------~~~~~~~~G--~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e 408 (409) T protein:vir:93 339 YLRADSATQAE-------VYFKAVRSG--YYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVNE 408 (409) T ss_pred hhccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeecccccccccchhhcccccCCCCCcCC Confidence 77777777654 456667778 99999999999999987654421 101111 1111222222222 No 53 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.61 E-value=1.9e-15 Score=101.31 Aligned_cols=366 Identities=10% Similarity=0.022 Sum_probs=182.9 Q ss_pred HHHhhhhhcc--Ccc-cchhhhhccC----cccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhH Q lcl|NC_016762. 22 MSLLNQGIGH--DAK-RPQAWCEYGF----PQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDET 94 (456) Q Consensus 22 d~~~n~~~~~--gt~-~~~~~~~~~~----~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~ 94 (456) |+|......- ++. .+..|..+.. ...++.+ .+.+++.+.+||+++|.++-.--+++ .+ .. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~----~al~~~~V~~~v~~ia~~ia~~p~~~--~~--~~----- 67 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDWVNFLTGGEAQKYVSAD----TALKNSDIFSLIMQLSGDLAMVRYTS--ES--DR----- 67 (397) T ss_pred CcchhhhhcccCcccCCchhhhhhhcCCcCCceechH----HhhccHHHHHHHHHHHHHHhhCcccc--cc--cH----- Confidence 4444321111 111 1222332211 1123332 23468899999999999885433222 11 10 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHhh-cccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccc Q lcl|NC_016762. 95 EWERKNKPLIAGGRFWRAVSEADRRR-LVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSET 173 (456) Q Consensus 95 ~~e~~i~~~~~~l~~~~~~~ea~~~~-r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~ 173 (456) -..|...-....-+..|.+.+.+. .++|-|++++. +|+. +.+..+.|+....+++. -.. T Consensus 68 --~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~-r~~~----------g~~~~l~~l~~~~v~i~-------~~~ 127 (397) T protein:vir:38 68 --SQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRH-KNTN----------GVDLSWEYLRPSQVQPM-------LLQ 127 (397) T ss_pred --HHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEE-ECCC----------CcEEEEEEEcCceeEEE-------EcC Confidence 011222112223344555555554 45676766553 3321 12334444443333221 111 Q ss_pred cCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-hhh Q lcl|NC_016762. 174 YGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQL-LLN 248 (456) Q Consensus 174 yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l-~~~ 248 (456) .|...+|+++.... ..+..+.+.++.||||.... ..|.|.++.+...+.....+. .....++++....- .++ T Consensus 128 ~~~~~~y~~~~~~~--~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~-~~~~~~f~ng~~~~~il~ 204 (397) T protein:vir:38 128 DGSGLIYNINFDEP--AIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASN-ELTLKALKQSVTASAVLT 204 (397) T ss_pred CCceEEEEEEeccc--cccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHH-HHHHHHHhccCCccEEEE Confidence 23345566653221 12224568888898885432 358999999988775544433 33344555533211 111 Q ss_pred hhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc---CCCeEEecCCCceeEEecccCC--HHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 249 FDKEINLGEIASTYGVTLDALNERFNEAARQLNR---GNDVLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 249 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas~IP 323 (456) .... ...+..+++......... +.+.++++.+-+|+.++.+... +-+......++||++.||| T Consensus 205 ~~~~------------~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp 272 (397) T protein:vir:38 205 IQKG------------GLLDAETRIARSKEISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVP 272 (397) T ss_pred eCCC------------CCHHHHHHHHHHHHHHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 1111 111222233233333222 2345667777889988876554 3466777889999999999 Q ss_pred eEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHH Q lcl|NC_016762. 324 TKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSE 403 (456) Q Consensus 324 ~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~ 403 (456) ...|-|...+. ++.+..+.||. .-|.|.++.+-+.|-+. +.+ ..++ .+.-+...+.+++ ++ T Consensus 273 ~~~lg~~~~~~-~~~e~~~~~~~-------~~l~P~~~~ie~~ln~~-l~~-~~~~--~~~~~~~~d~~~~-------~~ 333 (397) T protein:vir:38 273 DSYLNGQGDQQ-SSITQISGQYA-------KSLNRYVQAIVGELNDK-LHA-NISA--NIRFAIDAMGDQY-------AS 333 (397) T ss_pred HHHhCCCCCcc-cHHHHHHHHHH-------HHHHHHHHHHHHHHHHh-ccC-hhcc--cccccccCCHHHH-------HH Confidence 98765543222 22344556663 24778777776555332 222 1233 3333455555444 55 Q ss_pred HHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCccc-C------------CCCCCCCCcCCCCCCC Q lcl|NC_016762. 404 INSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDT-E------------PEDEDAARTDPTGEQQ 456 (456) Q Consensus 404 a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~-~------------~~d~~~~~~d~~~~~e 456 (456) +.+.+++.| +++++|+|+..+++|..+++..... . .+++.....++++++| T Consensus 334 ~~~~~~~~G--~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 334 TISSSVKGG--TIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HHHHHHhCC--CcCHHHHHHHhCCCCCCCCccccccccccccccccccccCCCCCCCCCCCCCCCC Confidence 566678878 9999999999999887654422100 0 0111222223444444 No 54 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.61 E-value=1.9e-15 Score=101.33 Aligned_cols=390 Identities=13% Similarity=0.090 Sum_probs=189.8 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhh---hhccCc---ccchhhhhccCc-----ccCCHHHHHHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQ---GIGHDA---KRPQAWCEYGFP-----QEITFNDLYTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~---~~~~gt---~~~~~~~~~~~~-----~~~~~~~l~~~Y~~~~l~r~iVd~ 69 (456) |.+.--.-.=..++ ..|... .++.+. ..+..+..++.. ..++.+ .+.++..+.++|++ T Consensus 1 ~~~~~~~g~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~----~a~~~~aV~~~v~~ 69 (432) T protein:vir:97 1 MPDEKKLGLLGQLK-------AMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNAD----AIMRLDAVAACVKL 69 (432) T ss_pred CCCcccCchhhhhH-------hhcCCccccccccccccccCchhhhhhcccccccCcccchH----hhhcchHHHHHHHH Confidence 33221110000000 001000 000000 001111111111 122322 35578899999999 Q ss_pred chhHHhhCCCEEecCCCcchhh-hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcC Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRSK-DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLN 147 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~~-~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~ 147 (456) +++++-+--+.+....++...+ ....+-..|...=....-+..|.+.+-+ -.++|.+++++.-.+|+ T Consensus 70 Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~----------- 138 (432) T protein:vir:97 70 VSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGR----------- 138 (432) T ss_pred HHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCc----------- Confidence 9999977666664332211111 1111111121111112233345554443 45667776665433322 Q ss_pred ceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHH Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFI 224 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~ 224 (456) +..+.|+-...+++. .| ..|.+ .|++.. .+| ..+.++++.|+|+...+ ..|.|.++.+.+.+. T Consensus 139 -~~~L~~l~p~~v~v~---~~----~~g~~-~y~~~~--~~g---~~~~~~~~~iih~r~~~~dg~~G~spi~~~~~~i~ 204 (432) T protein:vir:97 139 -IESLQYLANDRLTIT---TD----TKGNT-AYRYRR--TDG---QMIDIPRQQIWKIMGYSLDGENGLSAIRYGAQIFG 204 (432) T ss_pred -EEEEEEEcCcceEEE---Ec----CCCcE-EEEEEe--cCc---eEEEEccccEEEecCcCCCCcccccHHHHHHHHHH Confidence 223333332223221 12 12333 455542 122 34678888898885432 358899999887664 Q ss_pred HHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccC Q lcl|NC_016762. 225 SLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVS 303 (456) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~s 303 (456) ....+ ...+..+|++..+.-. ++.... -.++..+++.+.+.......+.++++.+-+|+.++.+.. T Consensus 205 ~~~a~-~~~~~~~f~ng~~~~gil~~~~~------------l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~ 271 (432) T protein:vir:97 205 TAIAA-EAQAARAFRNGQLQSVYYQIDRF------------LTDDQYDSFSKKVSGSVEAGRAPLLEGGMDVKSLGLNPV 271 (432) T ss_pred HHHHH-HHHHHHHHhccCCcceeEecCCC------------CCHHHHHHHHHHHhhhhcCCCceecCCCceEEEccCChh Confidence 44333 3444455565433221 111111 113344555555544444456677777889999988766 Q ss_pred CH--HHHHHHHHHHHHhhhcCCeEEeeccCCCcccc----hHH-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC Q lcl|NC_016762. 304 DP--GPTYNVNLQTAAAGVDIPTKILVGMQTGERAS----SED-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK 376 (456) Q Consensus 304 gl--~~~~~~~~~~~aaas~IP~t~L~G~sp~Glns----t~D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~ 376 (456) +. -+......++||.+.+||-..| |....|=.+ .++ ...||.. -|.|.++.+-..|-+.-+.+.. T Consensus 272 d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~s~~e~~~~~f~~~-------tl~P~~~~ie~~ln~kLl~~~e 343 (432) T protein:vir:97 272 DAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTSWGSGIESQQLGFLTM-------TLSPWLRRIEQSIALNLLTPAE 343 (432) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHc-CCcCCcccccchhHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccCccc Confidence 54 3556677889999999998654 654333211 122 2345433 3567766665555443333211 Q ss_pred -C--ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc----ccCC---CCC-- Q lcl|NC_016762. 377 -A--EFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP----DTEP---EDE-- 444 (456) Q Consensus 377 -~--d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~----~~~~---~d~-- 444 (456) . .+.|.++.|...|.+++++ +..+++..| ++|+||+|+..+++|+.+++..- .-.+ ... T Consensus 344 ~~~~~~~fd~~~llr~d~~~r~~-------~~~~~~~~G--~~T~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~~~ 414 (432) T protein:vir:97 344 RRRYFADFDTSALLRADSAARSS-------YYSQLVNNG--LMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGLQA 414 (432) T ss_pred cCceEEEeechhhhccCHHHHHH-------HHHHHHhCC--CCCHHHHHHHhCCCCCCCCcceEeecccccchhhhcccC Confidence 1 3566667888888888755 445678778 99999999999999887543210 0001 111 Q ss_pred --CCCCcCCCCCCC Q lcl|NC_016762. 445 --DAARTDPTGEQQ 456 (456) Q Consensus 445 --~~~~~d~~~~~e 456 (456) ++.++++..++. T Consensus 415 ~~~~~~~~~~~~~~ 428 (432) T protein:vir:97 415 SPEPASGLGNQQQD 428 (432) T ss_pred CCCCCCCCCCcccc Confidence 111222222222 No 55 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.60 E-value=3.7e-15 Score=99.71 Aligned_cols=384 Identities=13% Similarity=0.071 Sum_probs=187.6 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccc---hhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRP---QAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~---~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) |. +++.++. ...++.+-..+-...+- ..|..-.|. .++. ..|.++..+.++|+++|+++-.- T Consensus 1 ~~------~~~~~~~----~k~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~----~~a~~~~~V~~ci~~ia~~ia~l 65 (409) T protein:vir:96 1 MA------KENIVTR----IKKKLIDNWIDQSASKLYDFSPWKNKSFW-GVIN----NTLETNETIFSAITKLSNSMASL 65 (409) T ss_pred Cc------cccchhh----hhhHHhhhhhccccccccccccccCcccc-ccch----hhHhhhHHHHHHHHHHHHhhhhC Confidence 21 1111111 11122211111111111 111110111 1122 23567888899999999999776 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEec Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAW 156 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~ 156 (456) -+.+....+..+ ..+...+...=....-+..|.+.+.+ -.++|-+++++. ++.. +.+..+.|+. T Consensus 66 p~~~~~~~~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~----------G~~~~L~~l~ 130 (409) T protein:vir:96 66 PLKMYEDYKVVN----TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIE-RDIY----------HQPSKLFLLN 130 (409) T ss_pred ceEEeecccccc----hhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEE-ECCC----------CcEEEEEEEc Confidence 666654332211 11111122111112233445454444 456677766653 2211 1123444444 Q ss_pred cccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 157 AGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGG 232 (456) Q Consensus 157 ~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~ 232 (456) ...+++.. | +.+.+.+|.+... .+..+.+.++.||||... ...|.|.++.+.+.+.....+ .. T Consensus 131 ~~~v~v~~---~----~~~~~~~y~~~~~-----~g~~~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~-~~ 197 (409) T protein:vir:96 131 PDVVEMLI---E----NQSRELYYSIHAA-----TGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV-RT 197 (409) T ss_pred CceeEEEE---e----CCCcEEEEEEEcC-----CceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHH-HH Confidence 33333211 1 1233556766521 122456778888888532 234889888876654322222 11 Q ss_pred HHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEecccCC--HHHHH Q lcl|NC_016762. 233 SGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVSAVSD--PGPTY 309 (456) Q Consensus 233 ~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~~~sg--l~~~~ 309 (456) . .+.+..+...+.. ...+.-.++..+++.+.+....++. +.++++.+-+|+.++.+... +-+.. T Consensus 198 ~---~~~~~~~~~~~i~----------~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~ 264 (409) T protein:vir:96 198 F---NLTEMQKPDSFML----------KYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASE 264 (409) T ss_pred H---HHHhcCCCceeEE----------ecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHH Confidence 1 1221111111000 0001111233344444444433444 45566667889998876653 34445 Q ss_pred HHHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCCceEEE--eC Q lcl|NC_016762. 310 NVNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKAEFTAI--WD 384 (456) Q Consensus 310 ~~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~d~~~~--f~ 384 (456) ....++||.+.+||-..|-+...+..+..+ -.+.||..+ |.|.++.+-+.|-+.-+.+ ......|+ .. T Consensus 265 ~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~-------l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~ 337 (409) T protein:vir:96 265 NLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVK 337 (409) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCCcccccCcceEEeech Confidence 556788999999998866444444443333 355677654 7888888877665543332 12234444 45 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc--------ccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 385 DLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP--------DTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 385 pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~--------~~~~~d~~~~~~d~~~~~e 456 (456) .|...|.+++++ +..++++.| ++|++|+|+..+++|+++++..- +.........++.+...+| T Consensus 338 ~ll~~d~~~~~e-------~~~~~~~~G--~~T~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~~~~gG~~n~~e 408 (409) T protein:vir:96 338 SYLRADSATQAE-------VYFKAVRSG--YYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGDKNVNE 408 (409) T ss_pred hhhccCHHHHHH-------HHHHHHhCC--CCCHHHHHHHhCCCCCCCcceeeecccccccccchhhcccccCCCCCcCC Confidence 777777776654 446677777 99999999999999987654421 1111111122233333333 No 56 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.60 E-value=1e-14 Score=97.27 Aligned_cols=382 Identities=12% Similarity=0.012 Sum_probs=191.6 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccch------hhhhccC---cccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQ------AWCEYGF---PQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~------~~~~~~~---~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) -.++.+- -|.+++-+.+...+. .+...+. ...++. ..+.++..+.++|+++| T Consensus 11 ~~~~~~~---------------~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~----~~al~~~~v~~cv~~Ia 71 (424) T protein:vir:45 11 WPEGGRV---------------LLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSP----ETAMKLAAVYSCIYVLS 71 (424) T ss_pred cCcchhH---------------HHHhhccccCCCCCccccchhhhhhhccccCCceech----HHhhccHHHHHHHHHHH Confidence 2222222 222222222211111 0011111 112222 22456788899999999 Q ss_pred hHHhhCCCEEecCCCcchhhh-hHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCce Q lcl|NC_016762. 72 TTCWKTNPQVIEGDDQDRSKD-ETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGL 149 (456) Q Consensus 72 ed~tR~~~~i~~~~~~d~~~~-~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l 149 (456) ++.-.--+++....+....+. ...+-..+...=....-+..|.+++.. -.++|-+++++. +|.. +.+ T Consensus 72 ~~iA~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~-r~~~----------G~~ 140 (424) T protein:vir:45 72 SSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVK-RNRR----------GEV 140 (424) T ss_pred HHHhhCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-EcCC----------CcE Confidence 998776666653322111111 111111121111112233345555444 455676766553 2221 112 Q ss_pred eEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHH Q lcl|NC_016762. 150 AKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISL 226 (456) Q Consensus 150 ~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~ 226 (456) ..+.|+....+++.. +-|. -.|.+... .....++++.||||.... ..|.|.++.+.+.+-.. T Consensus 141 ~~L~~l~~~~v~i~~--------~~~~-~~y~~~~~------~~~~~~~~~eVih~r~~~~d~~~G~spi~~~~~~i~~~ 205 (424) T protein:vir:45 141 ISLDCCMPWETTLMN--------TGGR-YTYGLYNE------YGAFAISPDDMIHIRALGNNQKMGLSPIMQHAETIGMG 205 (424) T ss_pred EEEEEecCceEEEEE--------cCCe-EEEEEEec------CceEEECcccEEEecCcCCCCcccccHHHHHHHHHHHH Confidence 344444433333211 1122 24555421 124568889999885433 34899999988876544 Q ss_pred HHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHH----hcCC-CeEEecCCCceeEEec Q lcl|NC_016762. 227 EKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQL----NRGN-DVLLPTQGATVTQMVS 300 (456) Q Consensus 227 ~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~lid~~d~~~~~~~ 300 (456) ..+ ...+..++++..+.-. ++.... + . ++..+++.+.++.. .++. ..++++.+-+|+.++. T Consensus 206 ~~~-~~~~~~~f~ng~~p~gil~~~~~-----l----~---~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~ 272 (424) T protein:vir:45 206 MSG-QKYTESFFSGNARPAGIVSVKSG-----L----N---KESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTV 272 (424) T ss_pred HHH-HHHHHHHHhccCCccEEEEeCCC-----C----C---HHHHHHHHHHHHHHhccccccCCceeEcCCCceEEEccC Confidence 443 3444456666544322 221111 1 1 22233333333322 2233 3566777778998887 Q ss_pred ccCCH--HHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--C Q lcl|NC_016762. 301 AVSDP--GPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--L 375 (456) Q Consensus 301 ~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~ 375 (456) +..+. -+......++||.+.|||-..|-+...+..++.+ -.+.||.. -|.|.++.+-+.|-+.-+-+ . T Consensus 273 ~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~-------tL~P~~~~ie~~ln~kLl~~~e~ 345 (424) T protein:vir:45 273 SPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRY-------TMMPWVTNWEQELNRRLFTRAEL 345 (424) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhcCChhhh Confidence 66543 4566677889999999999766443333333322 23445443 46777777665554332221 1 Q ss_pred CCc--eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcc-----cCCCCCCCCC Q lcl|NC_016762. 376 KAE--FTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPD-----TEPEDEDAAR 448 (456) Q Consensus 376 ~~d--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~-----~~~~d~~~~~ 448 (456) ..+ |.|..+.|...|.+++++. ..++++.| ++++||+|+..+++|+++++..-- ....+..+++ T Consensus 346 ~~g~~i~fd~~~llr~d~~~r~~~-------~~~~~~~g--~~T~NE~R~~~gl~pi~ggD~~~~~~n~~~~~~~~~~~~ 416 (424) T protein:vir:45 346 AAGYYVRFNLTGLLRGTPQERAQF-------YHFAITDG--WMSRNEARAFEDMNPVEGLDEMLVSVNAANPAGDFKPPK 416 (424) T ss_pred cCCcEEEeechhhhccCHHHHHHH-------HHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeecccccccccccCCCC Confidence 123 5566668877787776554 45567777 999999999999999877654211 1122333444 Q ss_pred cCCCCCCC Q lcl|NC_016762. 449 TDPTGEQQ 456 (456) Q Consensus 449 ~d~~~~~e 456 (456) .+++.++| T Consensus 417 ~~~~~~~~ 424 (424) T protein:vir:45 417 NDEGKTNE 424 (424) T ss_pred CCCCCCCC Confidence 44444444 No 57 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.58 E-value=6.1e-15 Score=98.53 Aligned_cols=391 Identities=13% Similarity=0.060 Sum_probs=184.7 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCE Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQ 80 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~ 80 (456) |.=+-+...+.- . ..-..+++.+...|.+... ....+|.+. +..+.-+.+||+++++++-.--+. T Consensus 1 ~~~~r~~~~~~~-~--~~~~~~~~~~~~~g~~~s~--------~~~~vt~~~----al~~~~v~~~v~~ia~~iA~lp~~ 65 (419) T protein:vir:14 1 MFFSRQLLSNLG-Q--TQMSAGGWVSALLGSSRSD--------SGQVVTPAS----ALALTVLQNCVTLLAESIAQLPIE 65 (419) T ss_pred Cccccccccccc-c--cccCcchhhHHhhcCCCcc--------CCcccchHH----hhccHHHHHHHHHHHHhhccCceE Confidence 221111110000 0 0000111222122211110 012233322 235778899999999998766666 Q ss_pred EecCCCcchhh-hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccc Q lcl|NC_016762. 81 VIEGDDQDRSK-DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAG 158 (456) Q Consensus 81 i~~~~~~d~~~-~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~ 158 (456) +.....+...+ ....+-+.+...=....-+..|.+.+.+ -.++|-+++++. +++. +.+..+.|+-.. T Consensus 66 ~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~-r~~~----------G~~~~l~pl~~~ 134 (419) T protein:vir:14 66 LYERSGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFID-RDSD----------GVIQGLYPLDNE 134 (419) T ss_pred EEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEecCc Confidence 64433222111 1111111111110112233345555444 455676766653 2321 112334444333 Q ss_pred cCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 159 CLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWTGDAIGFLEPAYNSFISLEKVEGGSGESFL 238 (456) Q Consensus 159 ~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~ 238 (456) .+++.. +.| |.+ +|++... .+ .....|||-|...+ ....|.|.++.+.+.+.....+ ......++ T Consensus 135 ~v~v~~-~~~------~~~-~y~~~~~--~~--~~~~~i~h~~~~~~--dg~~G~s~i~~~~~~i~~~~~~-~~~~~~~f 199 (419) T protein:vir:14 135 AVTVMR-GSD------LKP-VYRVRGS--DP--MPQRLVHHVRWMSI--NGYTGLSPVLLHANAIGHAQAI-QQYAGKSF 199 (419) T ss_pred eEEEEE-CCC------ceE-EEEEccC--cc--cchhheeEecCcCC--CCcccccHHHHHHHHHHHHHHH-HHHHHHHH Confidence 333211 111 222 3555421 11 11123443333222 2356999999998876554444 34444555 Q ss_pred HHhhhh-hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC--HHHHHHH Q lcl|NC_016762. 239 KNAARQ-LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD--PGPTYNV 311 (456) Q Consensus 239 ~~~~~~-l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg--l~~~~~~ 311 (456) ++..+. ..++..... .+...++..+++.+.++...++ ...++++.+-+|++++.+..+ +-+.... T Consensus 200 ~ng~~p~gil~~~~~~--------~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 271 (419) T protein:vir:14 200 MNGTALSGVIERPKDA--------PALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRL 271 (419) T ss_pred hccCCccEEEEecCCC--------CcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHH Confidence 654331 112211110 0111234445555555444433 235667777789988876644 3455566 Q ss_pred HHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC-CCc--eEEEeCCCC Q lcl|NC_016762. 312 NLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL-KAE--FTAIWDDLT 387 (456) Q Consensus 312 ~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~-~~d--~~~~f~pL~ 387 (456) ..+.||.+.|||-.+|-....+..++.+ -.+.||..+ |.|.+.++-+.|-+.-+-+. ... +.|.++.|. T Consensus 272 ~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~-------L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~ 344 (419) T protein:vir:14 272 SALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYT-------LLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLL 344 (419) T ss_pred HHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHhhhccCccccCCeEEEEechhhh Confidence 7889999999998866443333333333 345666654 78888777666654323221 123 444555777 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc------c-cCCCCCCCCCcC--CCCCCC Q lcl|NC_016762. 388 VPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP------D-TEPEDEDAARTD--PTGEQQ 456 (456) Q Consensus 388 ~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~------~-~~~~d~~~~~~d--~~~~~e 456 (456) ..+.+++++ +..++++.| ++++||+|+..+++|+++++..- . ..+...+..+++ +.+..| T Consensus 345 r~d~~~~~~-------~~~~~~~~G--~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~e 413 (419) T protein:vir:14 345 RGDQSSRYA-------AYAVGRQWG--WLSINDIRRLENMPPVKGGDIYLSPMNMVDASKPQQLPVGKSEPTKAAIDE 413 (419) T ss_pred ccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeeccccccccccccccCCCCCCccccccc Confidence 777777654 445577777 99999999999999987765321 0 001111111112 222222 No 58 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.58 E-value=5.3e-15 Score=98.86 Aligned_cols=391 Identities=13% Similarity=0.090 Sum_probs=190.9 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhh---hhccCcc---cchhhhhccCc-----ccCCHHHHHHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQ---GIGHDAK---RPQAWCEYGFP-----QEITFNDLYTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~---~~~~gt~---~~~~~~~~~~~-----~~~~~~~l~~~Y~~~~l~r~iVd~ 69 (456) |.+.--.-.- .+.+..|... .++.+.. .+..+..++.. ..++. ..+.++..+.++|++ T Consensus 1 ~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~----~~al~~~~V~~~i~~ 69 (432) T protein:vir:10 1 MPDEKKLGLL-------GQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNA----DAIMRLDAVAACVKL 69 (432) T ss_pred CCCCcccchh-------hhhHhhcCCccccccccccccccCcchhhhhcccccccCcccch----hhhhcchHHHHHHHH Confidence 4332211100 0001111110 0000000 01111111111 12222 235578999999999 Q ss_pred chhHHhhCCCEEecCCCcchhh-hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcC Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRSK-DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLN 147 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~~-~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~ 147 (456) ++++.-+--+.+...+.+...+ ....+-..|...=....-+..|.+.+-. -.++|.|++++.-.+|+ T Consensus 70 Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~----------- 138 (432) T protein:vir:10 70 VSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGR----------- 138 (432) T ss_pred HHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCc----------- Confidence 9999977666664332221111 1111111121111112223344444443 46678777665433322 Q ss_pred ceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHH Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFI 224 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~ 224 (456) +..+.|+-...+++. .|. .|. ..|++.. .+| ..+.++++.|+|+...+ ..|.|.++.+.+.+. T Consensus 139 -~~~L~~l~~~~v~v~---~~~----~g~-~~y~~~~--~~g---~~~~~~~~~iih~~~~~~dg~~G~spi~~~~~~i~ 204 (432) T protein:vir:10 139 -IESLQYLANDRLTIT---TDT----KGN-TAYRYRR--TDG---QMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQIFG 204 (432) T ss_pred -EEEEEEEcCCceEEE---EcC----CCc-EEEEEEe--cCc---eEEEEcCccEEEecCCCCCCcccccHHHHHHHHHH Confidence 223333333223221 121 233 3455542 122 34678888888875432 348899999987665 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccCC Q lcl|NC_016762. 225 SLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVSD 304 (456) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~sg 304 (456) .... ....+..+|++..+.-.+.. ++ +.-.++..+++.+.+.......+.++++.+-+|+.++.+..+ T Consensus 205 ~~~~-~~~~~~~~f~ng~~~~gil~---~~--------~~l~~e~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d 272 (432) T protein:vir:10 205 TAIA-AEAQAARAFRNGQLQSVYYQ---ID--------RFLTDDQYDSFAKKVSGSVEAGRAPLLEGGMDVKSLGLNPVD 272 (432) T ss_pred HHHH-HHHHHHHHHhcCCCcceEEe---cC--------CCCCHHHHHHHHHHHhhhhhCCCceecCCCceEEEccCChHH Confidence 4433 34445556676443322211 11 001123344555444444444566777778899999877654 Q ss_pred H--HHHHHHHHHHHHhhhcCCeEEeeccCCCcccc----hH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC- Q lcl|NC_016762. 305 P--GPTYNVNLQTAAAGVDIPTKILVGMQTGERAS----SE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK- 376 (456) Q Consensus 305 l--~~~~~~~~~~~aaas~IP~t~L~G~sp~Glns----t~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~- 376 (456) . -+.......+||.+.|||-.. +|....|=++ .+ -...||.. -|.|.++.+-..|-+.-+.+.. T Consensus 273 ~q~le~~~~~~~~Ia~afgVPp~~-lg~~~~~t~~~~sn~e~~~~~f~~~-------tl~P~~~~ie~~ln~kL~~~~~~ 344 (432) T protein:vir:10 273 AQLLQSRQYSVESICRFFGVPPSM-IGHSSAGTTSWGSGIESQQLGFLSM-------TLSPWLRRIEQSIALNLLSPAER 344 (432) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHH-cCCccCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhhcCcccc Confidence 4 355567788999999999965 5654443222 12 23456533 4677777766555443332211 Q ss_pred Cc--eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc----ccCC---CCC--- Q lcl|NC_016762. 377 AE--FTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP----DTEP---EDE--- 444 (456) Q Consensus 377 ~d--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~----~~~~---~d~--- 444 (456) .. +.|..+.|...+.+++++ +..++++.| +++++|+|+..+++|+.+++..- .-.+ ... T Consensus 345 ~~~~~~fd~~~ll~~d~~~r~~-------~~~~~~~~G--~~T~NE~R~~~glppi~g~~~~~~~~~~~~pl~~~~~~~~ 415 (432) T protein:vir:10 345 RRYFADFDTSALLRADSAARSS-------YYSQLVNNG--LMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGLQAS 415 (432) T ss_pred CceEEEeechhhhccCHHHHHH-------HHHHHHhCC--CCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhcccCC Confidence 23 455556888888888755 445667777 99999999999999987543210 0001 011 Q ss_pred -CCCCcCCCCCCC Q lcl|NC_016762. 445 -DAARTDPTGEQQ 456 (456) Q Consensus 445 -~~~~~d~~~~~e 456 (456) ++.++++..++. T Consensus 416 ~~~~~~~~~~~~~ 428 (432) T protein:vir:10 416 PEPASGLGNQQQD 428 (432) T ss_pred CCCCCCCCCcccc Confidence 111111111111 No 59 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.58 E-value=3.1e-15 Score=100.16 Aligned_cols=404 Identities=14% Similarity=0.113 Sum_probs=185.0 Q ss_pred CCchhHHHHhHHHHHHHH----------HHHHHHhhhhhccCcccchhhhhccCcccCC-HHHHHHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIA----------RARMSLLNQGIGHDAKRPQAWCEYGFPQEIT-FNDLYTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~----------~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~-~~~l~~~Y~~~~l~r~iVd~ 69 (456) ...+ |++++.+.. +..+.|.. .+.+.. ......-|...+ -.-+.+....+..+..+|++ T Consensus 64 ~~~~-----~~~~kk~~i~~pfkkk~~~~~~d~f~~----s~es~s-~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~ 133 (945) T protein:vir:10 64 IFRK-----NQVLKKEKIIVPYNHQEPPFKFNLFEY----SPESLM-YLPSISDPDAFFLINLFRKYRFNNDSKLIKVSE 133 (945) T ss_pred eehh-----hhHHHhhcccccccccccchhhhhhhc----cCccce-ecccccCccceeeehhhhhhhhccHHHHHHHHH Confidence 1111 222222100 01112210 010000 000000111100 01233444567889999999 Q ss_pred chhHHhhCCCEEecCCCcchh----h---hhHHHHHHH---HHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCc Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRS----K---DETEWERKN---KPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPW 138 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~----~---~~~~~e~~i---~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~ 138 (456) ++++.-.--+.+....++-.. + ....+.+-+ +..+....+|+.|.+.+-. -.++|-+++++.- +. T Consensus 134 IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiR-d~--- 209 (945) T protein:vir:10 134 IPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIR-DE--- 209 (945) T ss_pred HHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEE-CC--- Confidence 999987666665322211100 0 001111111 2233344567767766544 5566777666532 21 Q ss_pred cccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhe------ecCC-cCC Q lcl|NC_016762. 139 DRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFI------LGDW-TGD 211 (456) Q Consensus 139 ~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~------~~~~-~~~ 211 (456) .+.+..+.|+....+++. .|+ +-+.+..|... .+|.. ...++++.+|+ ..+. .+- T Consensus 210 -------~G~ii~L~pLdPs~Vti~---~dd---DG~~~y~Yv~~---idG~~--~~~v~a~DvIlhirn~s~DG~~~Gy 271 (945) T protein:vir:10 210 -------QGNLVAITPVDGTTIKPI---LSE---DTGIVVGYVQE---VDGAI--VAHFDKRDVVLFRQNLTPDVYMYGY 271 (945) T ss_pred -------CCcEEEEEEECCcceEEE---EcC---CCcEEEEEEEe---cCCce--EEEecCCceEEEeccCCCCcccccC Confidence 122345555555444432 121 22222233322 12221 12334444332 2222 234 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC--C-eE Q lcl|NC_016762. 212 AIGFLEPAYNSFISLEKVEGGSGESFLKNAARQ-LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN--D-VL 287 (456) Q Consensus 212 G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~-l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~ 287 (456) |.|-++.+.+.+.....+....+..+.++..+. ..+....... ......+.-..+..+++.+.++...++. + .+ T Consensus 272 GlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~--~d~k~~~~LseEq~erlKe~wee~~sG~NnG~pi 349 (945) T protein:vir:10 272 SLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSY--KEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPI 349 (945) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccc--cccccccccCHHHHHHHHHHHHHHhCCcccccce Confidence 889999998877655444433333333333221 1111111000 0000011112333444544455544432 2 34 Q ss_pred EecCCCceeEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeeccCCC-cccchH-HHHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_016762. 288 LPTQGATVTQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGMQTG-ERASSE-DQKYHNARCQARRVQELTFEINDL 363 (456) Q Consensus 288 lid~~d~~~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~sp~-Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l 363 (456) +++.+.+|++++.+..+. -+.......+||++.|||...| |...+ ..+..+ -..+||.. -|.|.+..+ T Consensus 350 VLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lL-G~~e~st~SNiEqq~~~Fv~~-------tL~Pil~~I 421 (945) T protein:vir:10 350 LSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDV-GILEGSNKATAEVMASLTKAK-------GLEPLMATI 421 (945) T ss_pred ecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-ccCCCCCcchHHHHHHHHHHH-------HHHHHHHHH Confidence 566777899888776544 4566667789999999998766 54332 222222 35566643 244555554 Q ss_pred HHHHHHhcCcC--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---- Q lcl|NC_016762. 364 FAHLMRIGVVP--LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---- 437 (456) Q Consensus 364 ~~~l~~s~~~~--~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---- 437 (456) -+.|-+. +.+ ...++.|+|+.+.-++.+++ +++...+++.| ++++||+|+..+++|+++++..- T Consensus 422 EqeLNrk-Ll~~~eg~~i~fdFd~ldl~D~ksr-------aEal~kli~sG--iLTiNEvRe~lGLpPIeGGD~lli~~n 491 (945) T protein:vir:10 422 SKGFDEV-VSEFRNEKDIKLWFKEDDLEKERDW-------WNIIQGQLNTG--FRSINEARMEKGLEPVPWGDVPFSGLR 491 (945) T ss_pred HHHHHHh-ccccccCceeEEEecchhccCHHHH-------HHHHHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeeccc Confidence 4433221 211 13468999999988876554 45556677778 99999999999999987654421 Q ss_pred ccCCCCC--------C--------CCCcC--CCCCCC Q lcl|NC_016762. 438 DTEPEDE--------D--------AARTD--PTGEQQ 456 (456) Q Consensus 438 ~~~~~d~--------~--------~~~~d--~~~~~e 456 (456) ...+.++ . .+++. +++.+| T Consensus 492 n~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dE 528 (945) T protein:vir:10 492 NWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDE 528 (945) T ss_pred cccccccccccccCCCCcccccCCCCCCCCCCCCCCC Confidence 0001000 0 00000 001111 No 60 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.57 E-value=1.4e-14 Score=96.51 Aligned_cols=390 Identities=13% Similarity=0.070 Sum_probs=195.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCc-----ccCCHHHHHHHHhcCchhhhhhccchhHHh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFP-----QEITFNDLYTMYRRGGIAHGAVEKIVTTCW 75 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~-----~~~~~~~l~~~Y~~~~l~r~iVd~~aed~t 75 (456) |-.....+++.... +. + ++. +....+.....|..++-. ..++.+ -+.++..+.+||++++++.- T Consensus 1 ~~~~~~~~~~~~~~-~~---~-~~~--g~~~s~~~~~~~~~~~~~~~~~g~~v~~~----~al~~~~v~~ci~~Ia~~ia 69 (437) T protein:vir:10 1 MKQGKQRALGRIKS-SF---L-KWL--GVPISLTDGSFWSAWGGMGSSSGETVTAD----SALQLSAVWSCVRLIAETIA 69 (437) T ss_pred CCcchhhhhhhhHH-hh---h-hhc--CCcccCCchhHHHhhcccccCCCceechH----hhhccHHHHHHHHHHHHHHh Confidence 66555555552211 11 1 111 111222222333333211 223322 24578889999999999987 Q ss_pred hCCCEEecCCCcchhh--hhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEE Q lcl|NC_016762. 76 KTNPQVIEGDDQDRSK--DETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKV 152 (456) Q Consensus 76 R~~~~i~~~~~~d~~~--~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i 152 (456) +--+.+...+++.... ....+...+...=..+.-+..|.+.+-+ -.++|.+++++.-++|+ ++.+ T Consensus 70 ~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~g~------------~~~L 137 (437) T protein:vir:10 70 TLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSAGV------------LIGL 137 (437) T ss_pred hCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc------------EEEE Confidence 7655654332211100 1111111111111111233345555554 45677777765432221 2333 Q ss_pred EEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC---cCCCcchHHHHHHHHHHHHHH Q lcl|NC_016762. 153 TPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW---TGDAIGFLEPAYNSFISLEKV 229 (456) Q Consensus 153 ~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~---~~~G~S~le~~~~~l~~~~~~ 229 (456) .|+-...+++.. +. -|.+ +|++.. .+| ....+.++.||||... ...|.|.++.+.+.+.... + T Consensus 138 ~~l~p~~v~i~~---~~----~g~~-~y~~~~--~~g---~~~~~~~~dIih~r~~~~d~~~G~spi~~~~~~i~~~~-~ 203 (437) T protein:vir:10 138 ELMLPQRTTVKR---LT----SGAL-QYTYRN--VDG---TVSTLAEDDVFHVRGFSLDGLMGLTPIQYAREVLGNST-A 203 (437) T ss_pred EEEcCcceEEEE---CC----CCeE-EEEEEe--cCc---eEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHH-H Confidence 343333233211 11 1222 344431 222 2356778888887432 2469999999987765443 3 Q ss_pred HHHHHHHHHHHhhhhh-hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC Q lcl|NC_016762. 230 EGGSGESFLKNAARQL-LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD 304 (456) Q Consensus 230 ~~~~~~~~~~~~~~~l-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg 304 (456) .......++++..+.- .++.... + . .+..+++.+.+....++ .+.++++.+-+|++++.+..+ T Consensus 204 ~~~~~~~~f~ng~~p~gil~~~~~-----l------~-~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d 271 (437) T protein:vir:10 204 ANKTSASVFRNGLRPSGVLSTDQI-----L------Q-KEKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGD 271 (437) T ss_pred HHHHHHHHHhccCCccEEEEcCCC-----C------C-HHHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhh Confidence 4455556666654321 1221111 1 1 22333444444433222 245677777889998876544 Q ss_pred --HHHHHHHHHHHHHhhhcCCeEEeeccCCCc-ccch--H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC--- Q lcl|NC_016762. 305 --PGPTYNVNLQTAAAGVDIPTKILVGMQTGE-RASS--E-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL--- 375 (456) Q Consensus 305 --l~~~~~~~~~~~aaas~IP~t~L~G~sp~G-lnst--~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~--- 375 (456) +-+........||.+.+||-..| |...++ .+++ + -.+.||.. -|.|.+..+-..|-+.-+.+. T Consensus 272 ~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~e~~~~~f~~~-------tl~P~~~~ie~~l~~kll~~~e~~ 343 (437) T protein:vir:10 272 VQLLETRAFNIEEICRWYRVPPFMV-GHSEKSTSWGTGIEQQTLGFLTF-------TLRPWLTRIEQAARRSLLRPGERD 343 (437) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccCccccC Confidence 35555567789999999998655 754433 2221 3 34556643 468888777666654333221 Q ss_pred CCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC-------cccCCCCC---- Q lcl|NC_016762. 376 KAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL-------PDTEPEDE---- 444 (456) Q Consensus 376 ~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~-------~~~~~~d~---- 444 (456) ...|.|.+..|...|.+++++. ...++..| +++++|+|+..+++|+.+++.. ...+..++ T Consensus 344 ~~~~~fd~~~ll~~d~~~r~~~-------~~~~~~~G--~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~ 414 (437) T protein:vir:10 344 QFYAEFSVEGLLRADSAGRAAF-------YSTMTQNG--LMTRDECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTTA 414 (437) T ss_pred ceEEEEechhhhccCHHHHHHH-------HHHHHhCC--CcCHHHHHHHhCCCCCCCCcceEeecCcccchhhccCcCCC Confidence 1135666678888888777654 45567777 9999999999999988654321 00001111 Q ss_pred ----------CCCCcCCCCCCC Q lcl|NC_016762. 445 ----------DAARTDPTGEQQ 456 (456) Q Consensus 445 ----------~~~~~d~~~~~e 456 (456) +..+.+..+++| T Consensus 415 ~~~~~~~~~~~~~~~~~~~~~e 436 (437) T protein:vir:10 415 TAAQDALKAWLYQEEKTRATQE 436 (437) T ss_pred cchhccccccCCCCCCCCcccc Confidence 111122223333 No 61 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.57 E-value=1.2e-14 Score=96.92 Aligned_cols=392 Identities=13% Similarity=0.043 Sum_probs=190.9 Q ss_pred CCchhHHHHhHHHHHHHH---HHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIA---RARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~---~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) |.+=++......-...-. .+...|...+...+... ..| ..+|... ..++..+.+||++++++.-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-----~~g--~~v~~~~----al~~~~V~~~v~~Ia~~iA~l 69 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAW-----QQG--VKADPEA----VLSFHAVFACISLISQDIAKM 69 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchh-----hcC--cccChHH----hhccHHHHHHHHHHHHhhccC Confidence 555544422211110000 00111111111111100 111 2233322 234667888999999999877 Q ss_pred CCEEecCCCcch-hhhhHHHHHHHHHHHHH---hhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEE Q lcl|NC_016762. 78 NPQVIEGDDQDR-SKDETEWERKNKPLIAG---GRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKV 152 (456) Q Consensus 78 ~~~i~~~~~~d~-~~~~~~~e~~i~~~~~~---l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i 152 (456) -+.+...+.+.. .+.... -+..++.+ +.-+..|.+.+.+ -.++|.+++++.- ++. +.+..+ T Consensus 70 p~~~~~~~~~g~~~~~~~~---~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r-~~~----------G~~~~L 135 (454) T protein:vir:93 70 RLRLMQTDAQGIRRETRRG---DIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIR-NAR----------GQIKEL 135 (454) T ss_pred ceEEEEeccCCccchhhhH---HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEE-CCC----------CcEEEE Confidence 777654332211 111111 12222222 2223345555554 4566777666543 221 123444 Q ss_pred EEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecC----CcCCCcchHHHHHHHHHHHHH Q lcl|NC_016762. 153 TPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----WTGDAIGFLEPAYNSFISLEK 228 (456) Q Consensus 153 ~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----~~~~G~S~le~~~~~l~~~~~ 228 (456) .|+....+++. .+.. |. -+|++...... ..+....+.++.||||.. ....|.|.++.+.+.+.... T Consensus 136 ~~i~~~~v~v~---~~~~----g~-~~y~~~~~~~~-~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~- 205 (454) T protein:vir:93 136 RILDWNRVEPL---VADD----GE-VFYRITPDRNC-GITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGH- 205 (454) T ss_pred EEEcCcceEEE---EcCC----Cc-EEEEEEecccc-ccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHH- Confidence 45444333331 1111 22 23555432211 112245677778887742 23469999998888765443 Q ss_pred HHHHHHHHHHHHhhhhhhh-hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC--CC-eEEecCCCceeEEecccCC Q lcl|NC_016762. 229 VEGGSGESFLKNAARQLLL-NFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG--ND-VLLPTQGATVTQMVSAVSD 304 (456) Q Consensus 229 ~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~lid~~d~~~~~~~~~sg 304 (456) .+......++++..+.-.+ +.... + .++..+++.+.++.+.++ .+ .++++.+-+|++++.+..+ T Consensus 206 ~~~~~~~~~f~ng~~p~gil~~~~~-----l-------~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d 273 (454) T protein:vir:93 206 HIQENSTSFFRNGGRPSGVIEIPGS-----I-------TEENAKKLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVD 273 (454) T ss_pred HHHHHHHHHHhccCCccEEEecCCC-----C-------CHHHHHHHHHHHHHHhcccccCCceeccCCceEEEcccChhH Confidence 3444555677775442221 11110 1 123344444455554443 22 4666777789998876654 Q ss_pred H--HHHHHHHHHHHHhhhcCCeEEeeccCCCcccch-H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceE Q lcl|NC_016762. 305 P--GPTYNVNLQTAAAGVDIPTKILVGMQTGERASS-E-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFT 380 (456) Q Consensus 305 l--~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst-~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~ 380 (456) . -+........||.+.|||... +|..-++-.++ + -.+.||.. -|.|.++++-..|-+.-+-+....+. T Consensus 274 ~q~le~~~~~~~~Ia~~fgVPp~~-lg~~~~~t~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~~L~~~~~~~~~ 345 (454) T protein:vir:93 274 SQTVEQLKMTAEIVCSVFRVPAYK-IGVGQPPSSDNVEALEQQYYSQ-------CLQTLIESIELLLDEALETGENESTE 345 (454) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHH-cCCCCCCcchhHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCCCCcEEE Confidence 4 344445677899999999864 56443332222 2 23445543 46787777755554432222223466 Q ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC---------cccCCCC--CCC--- Q lcl|NC_016762. 381 AIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL---------PDTEPED--EDA--- 446 (456) Q Consensus 381 ~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~---------~~~~~~d--~~~--- 446 (456) |.++.|...+.+++++ +...+++.| +++++|+|+..+++|+++++.. +.....+ +++ T Consensus 346 f~~~~ll~~D~~~r~~-------~~~~~~~~G--~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~ 416 (454) T protein:vir:93 346 FDVTTLLRMDSERRMK-------TLGDAVKNT--LLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFAS 416 (454) T ss_pred eechhhhccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcccCCCCC Confidence 6667787777777654 455677777 9999999999999998765431 0000000 000 Q ss_pred -----CCcCC---------CCCCC Q lcl|NC_016762. 447 -----ARTDP---------TGEQQ 456 (456) Q Consensus 447 -----~~~d~---------~~~~e 456 (456) .++++ .++.| T Consensus 417 ~~~~~~~~~~~~~~d~~~~~~e~~ 440 (454) T protein:vir:93 417 SGKTASVPQAVAASDGNKAITETE 440 (454) T ss_pred CccCCCCCCCCCCCCCCCCccCCc Confidence 00000 01111 No 62 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.56 E-value=1.5e-14 Score=96.32 Aligned_cols=362 Identities=10% Similarity=0.009 Sum_probs=178.7 Q ss_pred HHHhhhhhccCccc----chhhhhccCcc-----cCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhh Q lcl|NC_016762. 22 MSLLNQGIGHDAKR----PQAWCEYGFPQ-----EITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKD 92 (456) Q Consensus 22 d~~~n~~~~~gt~~----~~~~~~~~~~~-----~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~ 92 (456) |+|.+.+.+.-.+. .-....++... .++.. .+| ++..++++|+++++++-+--+.+...+.. +. T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vt~~---~al-~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~---~~ 73 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRYSGVYVTDS---NIL-QSSDVYELLQDISNQMVLADIVVEDEFGN---EI 73 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhcccccCccccChh---hhh-ccHHHHHHHHHHHHhhcccceEEEcCCCc---cc Confidence 45544432221111 11112222221 12322 334 46889999999999998877777643321 11 Q ss_pred hHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccc Q lcl|NC_016762. 93 ETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDS 171 (456) Q Consensus 93 ~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s 171 (456) .....-.|...=....-+..|.+.+.+ -.++|.+++++. ++....| ..+.|. .|. T Consensus 74 ~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~---~~~~~~~--------~~~~~~-----------~~~-- 129 (394) T protein:vir:62 74 KDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILN---GAQIHLA--------SNVFTE-----------LDD-- 129 (394) T ss_pred chhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEe---cceeecc--------ccceEE-----------ECC-- Confidence 111111121111112233345555444 456677777652 2221110 011110 010 Q ss_pred cccCCceeEEEeecccCCccccceeeehhhhheecCC---cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhh Q lcl|NC_016762. 172 ETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW---TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAAR-QLLL 247 (456) Q Consensus 172 ~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~---~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~-~l~~ 247 (456) +.+ .+|.+ .++.+.++.|+||... ...|.|.++.+...+.....+. ..+..++++..+ ...+ T Consensus 130 -~~~--~~~~~----------~~~~~~~~eiih~r~~~~d~~~G~s~~~~~~~~i~~~~~~~-~~~~~~~~ng~~~~~il 195 (394) T protein:vir:62 130 -NLV--EHFNI----------GGHEIPPCMIRHVKNIGADHLRGKGILDLGRDTLEGVMSAE-KTLTDKYKKGGLLTFLL 195 (394) T ss_pred -ceE--EEEee----------CCEEechhheEEecCcCCCCccccChHHHHHHHHHHHHHHH-HHHHHHHHccCCcceEE Confidence 111 11222 1356778888877433 2458899999888765544433 344445555322 1112 Q ss_pred hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEec--ccCC--HHHHHHHHHHHHHhh Q lcl|NC_016762. 248 NFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVS--AVSD--PGPTYNVNLQTAAAG 319 (456) Q Consensus 248 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~--~~sg--l~~~~~~~~~~~aaa 319 (456) +.... .. ...+..+++.+.+....++ ...+++..+.+++.... +... +-+.......+||.+ T Consensus 196 ~~~~~---------~~-~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 265 (394) T protein:vir:62 196 NLDAH---------IN-PQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKF 265 (394) T ss_pred EeCCC---------CC-cCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHH Confidence 11111 11 1122334444444433332 23456666676665444 3333 334445667889999 Q ss_pred hcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC-CCceEEEeCCCCCCCHHHHHHHH Q lcl|NC_016762. 320 VDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL-KAEFTAIWDDLTVPTKAERLANS 398 (456) Q Consensus 320 s~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~-~~d~~~~f~pL~~~seke~Aei~ 398 (456) .+||-..| |.... =|..+-.++||.. -|.|.+..+-..|-+.-+.+. ...+.|+|+.+.-++..+++ T Consensus 266 fgVPp~~l-g~~~~-sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~~~~~--- 333 (394) T protein:vir:62 266 LGINVDTY-TELIK-EDIEKAMMYIHNK-------AVRPIMKNFEDHLSLLFYAQNSGKRIKFKINILDFVTYSNKT--- 333 (394) T ss_pred hCCCHHHc-CCCCC-cCHHHHHHHHHHH-------HHHHHHHHHHHHHhhhhcCccccCceEEEechhhhcCHHHHH--- Confidence 99999866 42211 0112234556654 378888877666654333321 23688999998888776554 Q ss_pred HHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCccc-----CCCCC--CCCCcCCCCC-CC Q lcl|NC_016762. 399 KTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDT-----EPEDE--DAARTDPTGE-QQ 456 (456) Q Consensus 399 ~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~-----~~~d~--~~~~~d~~~~-~e 456 (456) ++...++..| ++++||+|+..+++|+.+...+.-- .+.+. ....+..+++ +| T Consensus 334 ----~~~~~~~~~g--~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~~~~~~~~~~~~~~kgge~~e 393 (394) T protein:vir:62 334 ----NIGYNLVRTA--ITSPDNVADMLGFPKQNTKESQAIYISNDVTEIGKKEATDGSLGGGEENE 393 (394) T ss_pred ----HHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeeecccccccccccccccccCCCCCCCC Confidence 4456778888 9999999999999988432221110 11111 1112223333 23 No 63 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.56 E-value=2.7e-15 Score=100.46 Aligned_cols=386 Identities=14% Similarity=0.068 Sum_probs=191.0 Q ss_pred HHHhhhhhccCc---ccchhh--hhccC-cccCCH-HHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCc---chhh Q lcl|NC_016762. 22 MSLLNQGIGHDA---KRPQAW--CEYGF-PQEITF-NDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQ---DRSK 91 (456) Q Consensus 22 d~~~n~~~~~gt---~~~~~~--~~~~~-~~~~~~-~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~---d~~~ 91 (456) |+|.+.+...-. ..+..+ ...+- ....+. ..+.+.|..++.++.||+++++++-+--+.+.....+ +..+ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~~~ 80 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRERVR 80 (423) T ss_pred CchhHhhccccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceeeec Confidence 555554311110 000000 11100 111121 2467788899999999999999998776666432211 1111 Q ss_pred hhHHHHHHHHHHHH---HhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhc Q lcl|NC_016762. 92 DETEWERKNKPLIA---GGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDE 167 (456) Q Consensus 92 ~~~~~e~~i~~~~~---~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~ 167 (456) . . .+..++. .+.-+..|.+++-+ -.++|-+++++. +|... ...+..+.|+....+.+..+ . T Consensus 81 ~-~----~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~-rd~~~--------~~~~~~l~p~~~~~v~~~~~-~ 145 (423) T protein:vir:81 81 E-G----HLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLP-GDLGV--------DTPTLDIRPIPVSWVQRRAY-K 145 (423) T ss_pred c-c----hHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEE-ecCCc--------CcceEEEeecccceeeeeec-c Confidence 1 1 1222222 11234445555444 445676666553 33211 11233444443332222211 1 Q ss_pred cccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_016762. 168 KPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAAR 243 (456) Q Consensus 168 Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~ 243 (456) | ..|.+. |++.... ...+....++++.|||+.... .-|.|.++.+.+.+-....+ ..++..++++..+ T Consensus 146 ~----~~~~~~-Y~~~~~~--~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~-~~~~~~~f~ng~~ 217 (423) T protein:vir:81 146 D----GWGSLD-YIIIESG--DNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEA-AIFRAQMWRNGPR 217 (423) T ss_pred C----CCcceE-EEEEEec--CCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHH-HHHHHHHHhccCC Confidence 2 223332 3333111 112234668889998875432 24999999998877554443 3444456666443 Q ss_pred hhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh----cCC-CeEEecCCCceeEEecccCCH--HHHHHHHHHH Q lcl|NC_016762. 244 QLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLN----RGN-DVLLPTQGATVTQMVSAVSDP--GPTYNVNLQT 315 (456) Q Consensus 244 ~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~----~~~-~~~lid~~d~~~~~~~~~sgl--~~~~~~~~~~ 315 (456) .-. ++...... .+.-.++..+++.+.++... ++. +.++++.+-+|+.++.+..+. -+.......+ T Consensus 218 p~gvi~~~~~~~-------~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~e 290 (423) T protein:vir:81 218 PGMVIMRDPESK-------AGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQT 290 (423) T ss_pred CceEEEecCccc-------CccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHH Confidence 221 11111000 00111334444444444332 222 345666677899888765443 2334456678 Q ss_pred HHhhhcCCeEEeeccCCCcc-cchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC---CCCc--eEEEeCCCCC Q lcl|NC_016762. 316 AAAGVDIPTKILVGMQTGER-ASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP---LKAE--FTAIWDDLTV 388 (456) Q Consensus 316 ~aaas~IP~t~L~G~sp~Gl-nst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~---~~~d--~~~~f~pL~~ 388 (456) ||.+-|||.. ++|..-++- +..+ -.+.||.. -|.|.++.+-+.|-+.-+.+ ...+ |.|.++.|.. T Consensus 291 Ia~~fgVPp~-~lg~~~~~t~sn~e~~~~~f~~~-------~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr 362 (423) T protein:vir:81 291 VAQVYGINPT-MVGQLDNANYSNVREFRKALYGD-------NLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLR 362 (423) T ss_pred HHHHhCCCHH-HhcCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhc Confidence 9999999965 567543332 2223 24556654 36676666555443322221 1223 4555567877 Q ss_pred CCHHHHHHHHHHHHHHHHHHH-HcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCC-CCCCcCCCCCCC Q lcl|NC_016762. 389 PTKAERLANSKTMSEINSAAI-GTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDE-DAARTDPTGEQQ 456 (456) Q Consensus 389 ~seke~Aei~~~~A~a~~~~~-~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~-~~~~~d~~~~~e 456 (456) .|-+++++. ..+++ ..| ++++||+|+..+++|+++++..-- +.+- ..+.+++.++++ T Consensus 363 ~d~~~r~~~-------~~~~l~~~G--~~T~NE~R~~~gl~p~~gGD~~~~--p~n~~~~~~~~~~~~~~ 421 (423) T protein:vir:81 363 ASFEEAAEI-------KRAAVGNVA--WMTINEVRAMDNLPSIDGGDDLAR--PLNTEFGDSEDAPGEEV 421 (423) T ss_pred cCHHHHHHH-------HHHHHhCCC--CcCHHHHHHHhCCCCCCCcceeec--ccccccCccCCCCCCCC Confidence 777777654 33344 346 899999999999999877654321 1111 122223333333 No 64 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.56 E-value=1.6e-14 Score=96.24 Aligned_cols=388 Identities=13% Similarity=0.042 Sum_probs=184.3 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCE Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQ 80 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~ 80 (456) |.=+-....+- ........+.+. ...|+.. . ......|.+. +. ++..+.+||+++|+++-.--+. T Consensus 1 m~~~~~~~~~~--~~~~~~~~~~~~-~~~g~~~---s-----~~~~~v~~~~---al-~~~~v~~cv~~ia~~ia~lp~~ 65 (419) T protein:vir:80 1 MFFSRQLLSNL--GQTQPGSGGWVS-ALLGSAR---S-----EAGQVVTPAS---AL-SLTVLQNCVTLLAESIAQLPVE 65 (419) T ss_pred CCccccccccc--CcCCCCcchhhH-Hhhcccc---c-----ccCcccChHH---hh-ccHHHHHHHHHHHHhhccCceE Confidence 33211110000 000000000111 0111100 0 0012233322 22 4778899999999999877777 Q ss_pred EecCCCcchhh-hhHHHHHHHHHHHHHhhHHHHHHHHHHhh-cccCceEEEEEecCCCCccccccCCcCceeEEEEeccc Q lcl|NC_016762. 81 VIEGDDQDRSK-DETEWERKNKPLIAGGRFWRAVSEADRRR-LVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAG 158 (456) Q Consensus 81 i~~~~~~d~~~-~~~~~e~~i~~~~~~l~~~~~~~ea~~~~-r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~ 158 (456) +...+++...+ ....+...+...=....-+..|.+++.+. .++|-+++++. +++. +.+..+.|+... T Consensus 66 ~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~-r~~~----------G~~~~L~~i~~~ 134 (419) T protein:vir:80 66 LYERSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFID-RDQD----------GVIQGLYPLDNE 134 (419) T ss_pred EEEecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEecCc Confidence 65433322111 11111111221111123344555555554 45566666553 3321 113344444433 Q ss_pred cCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecC---CcCCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 159 CLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD---WTGDAIGFLEPAYNSFISLEKVEGGSGE 235 (456) Q Consensus 159 ~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~---~~~~G~S~le~~~~~l~~~~~~~~~~~~ 235 (456) .+++. .+. .|.+ .|++.. . ..++++-|+|+.. ....|.|.++.+.+.+.....+ ..... T Consensus 135 ~v~i~---~~~----~~~~-~y~~~~----~-----~~~~~~~i~h~~~~~~d~~~G~s~i~~~~~~i~~~~~~-~~~~~ 196 (419) T protein:vir:80 135 AVTVM---KGP----DLKP-MYRVAG----A-----DPLPQRLVHHVRWMSINGYTGLSPVLLHANAIGHAQAI-QQYAG 196 (419) T ss_pred eEEEE---ECC----CceE-EEEEcC----c-----cccchhheEEecCCCCCCcccccHHHHHHHHHHHHHHH-HHHHH Confidence 33331 121 1233 344531 1 1234444444421 2357999999998876544443 34444 Q ss_pred HHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC--HHHH Q lcl|NC_016762. 236 SFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD--PGPT 308 (456) Q Consensus 236 ~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg--l~~~ 308 (456) .++++..+.-. ++... ++ .+...++..+++.+.++...++ ...++++.+-+|+.++.+..+ +-+. T Consensus 197 ~~f~ng~~~~gil~~~~--~~------~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~ 268 (419) T protein:vir:80 197 KSFMNGTALSGVIERPT--DA------PALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDA 268 (419) T ss_pred HHHhcCCCccEEEEecC--CC------CcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHH Confidence 55665433211 11100 00 0111133344454445444333 235667777889988876654 3566 Q ss_pred HHHHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC-CCc--eEEEeC Q lcl|NC_016762. 309 YNVNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL-KAE--FTAIWD 384 (456) Q Consensus 309 ~~~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~-~~d--~~~~f~ 384 (456) .....++||.+.|||...|-....+..++.+ -.+.||..+ |.|.++.+-+.|-+.-+-+. ... |.|.++ T Consensus 269 ~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~-------l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~ 341 (419) T protein:vir:80 269 LRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYT-------LLPWVKRHEQAKTRDLLLPSERKQYFIEYNLA 341 (419) T ss_pred HHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHhhhccCccccCCeEEEEech Confidence 6677889999999998755333333333333 345666654 67877777665544322221 123 455556 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCC-CCCCCCCcCCCCCCC Q lcl|NC_016762. 385 DLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEP-EDEDAARTDPTGEQQ 456 (456) Q Consensus 385 pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~-~d~~~~~~d~~~~~e 456 (456) .|...|.+++++ +..++++.| ++|+||+|+..+++|+++++..--.-. ...+.+.+.+.++.+ T Consensus 342 ~l~~~d~~~~~~-------~~~~~~~~G--~~T~NE~R~~~g~~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~ 405 (419) T protein:vir:80 342 GLLRGDQSSRYA-------AYAVGRQWG--WLSINDIRRLENMPPVKGGDIYLSPMNMVDASKPQPIPMGKTE 405 (419) T ss_pred hhhccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeeccccccccccccccCCCCC Confidence 777777777765 445567777 999999999999999876644311000 011111112222222 No 65 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.56 E-value=8.8e-15 Score=97.65 Aligned_cols=387 Identities=15% Similarity=0.072 Sum_probs=197.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhcc--Cccc-----chh-----hhhccCcccCCHHHHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGH--DAKR-----PQA-----WCEYGFPQEITFNDLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~--gt~~-----~~~-----~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd 68 (456) |-+-.- ..+ . +.+-++-|.+.+. |..+ ... +..+.....++. ..+.++..+.+||+ T Consensus 1 ~~~~~~-~~~-~------~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~cv~ 68 (424) T protein:vir:18 1 MEEPKY-TID-L------RTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSIND----ERILQISTVWRCVS 68 (424) T ss_pred CCCCcc-eEe-e------cCCCchHHHHHhhhcccccccccccccccccccccccccccccH----HHhhccHHHHHHHH Confidence 322110 000 0 0111222211110 0000 000 000001112333 23456788899999 Q ss_pred cchhHHhhCCCEEecCCCcchhhh---hHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccC Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQDRSKD---ETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARG 144 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d~~~~---~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~ 144 (456) +++++.-.--+.+...+++...+. ...+-+.|...=....-+..|.+.+.+ -.++|-+++++. ++. T Consensus 69 ~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~--------- 138 (424) T protein:vir:18 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNS--------- 138 (424) T ss_pred HHHHhhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECC--------- Confidence 999999776666654443322111 111112122111111223334444443 455677777653 221 Q ss_pred CcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHH Q lcl|NC_016762. 145 KLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYN 221 (456) Q Consensus 145 ~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~ 221 (456) .+.+..+.|+....+++.. + +.+-.|++.. +| ..+.+.++.||||.... ..|.|.++.+.+ T Consensus 139 -~G~~~~L~pl~~~~V~v~~-~--------~~~~~y~~~~---~g---~~~~~~~~eIih~r~~~~dg~~G~spi~~~~~ 202 (424) T protein:vir:18 139 -AGDVISLLPLQSANMDVKL-V--------GKKVVYRYQR---DS---EYADFSQKEIFHLKGFGFTGLVGLSPIAFACK 202 (424) T ss_pred -CCcEEEEEEecCcceEEEE-c--------CCeEEEEEEe---CC---eEEEeccccEEEecCcCCCCcccccHHHHHHH Confidence 1123445555444443311 1 1234566642 22 23568888888885332 358999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEE Q lcl|NC_016762. 222 SFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQM 298 (456) Q Consensus 222 ~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~ 298 (456) .+... ......+..++++..+.-.+.. +. + +.-.++..+++.+.+..+.++. +.++++.+-+|+++ T Consensus 203 ~i~~~-~a~~~~~~~~f~ng~~p~gil~---~~--~-----~~l~~e~~~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l 271 (424) T protein:vir:18 203 SAGVA-VAMEDQQRDFFANGAKSPQILS---TG--E-----KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) T ss_pred HHHHH-HHHHHHHHHHHHccCCcceEEE---eC--C-----cCCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEec Confidence 76543 3444555566666544322211 10 0 0011233444444555544332 35677777789888 Q ss_pred ecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCCCc-ccch--H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC Q lcl|NC_016762. 299 VSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQTGE-RASS--E-DQKYHNARCQARRVQELTFEINDLFAHLMRIGV 372 (456) Q Consensus 299 ~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp~G-lnst--~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~ 372 (456) +.+..+ +-+......++||.+.|||-..| |...++ ..++ + -...||.. -|.|.++++-+.|-+.-+ T Consensus 272 ~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~~~-------tl~P~~~~ie~~l~~~L~ 343 (424) T protein:vir:18 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFLQY-------TLQPYISRWENSIQRWLI 343 (424) T ss_pred CCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcC Confidence 876543 34455567788999999998654 654443 3222 2 34566643 578888887666644333 Q ss_pred cCCC-Cc--eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc------ccCCCC Q lcl|NC_016762. 373 VPLK-AE--FTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP------DTEPED 443 (456) Q Consensus 373 ~~~~-~d--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~------~~~~~d 443 (456) .+.. .. +.|.++.|...+.+++++.. .+++..| ++++||+|+..+++|+++++..- ...+.. T Consensus 344 ~~~~~~~~~~~fd~~~llr~d~~~r~~~~-------~~~~~~G--~~T~NE~R~~~gl~pi~gGD~~~~~~n~~~l~~~~ 414 (424) T protein:vir:18 344 PAKDVGRIHAEHNLDGLLRGDSASRAAFM-------KAMGEAG--LRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLG 414 (424) T ss_pred CccccCCeEEEEechhhhccCHHHHHHHH-------HHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeeccCccchHhhh Confidence 2211 12 56667788888888875554 4567777 99999999999999987664421 001122 Q ss_pred CCCCCcCCCC Q lcl|NC_016762. 444 EDAARTDPTG 453 (456) Q Consensus 444 ~~~~~~d~~~ 453 (456) .+.++.+.+| T Consensus 415 ~~~~p~~~ga 424 (424) T protein:vir:18 415 TNKEPRNNGA 424 (424) T ss_pred ccCCCccCCC Confidence 3344555555 No 66 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.55 E-value=8.8e-15 Score=97.64 Aligned_cols=387 Identities=16% Similarity=0.092 Sum_probs=197.2 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhcc--C----cccch------hhhhccCcccCCHHHHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGH--D----AKRPQ------AWCEYGFPQEITFNDLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~--g----t~~~~------~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd 68 (456) |-+- +.-.+.+- +-++-+.+.+. | +..+. .+..+......+.+ .+.++..+.+||+ T Consensus 1 ~~~~-~~~~~~~~-------~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~----~al~~~~v~~cv~ 68 (424) T protein:vir:18 1 MEEP-KYTIDLRT-------NNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDE----RILQISTVWRCVS 68 (424) T ss_pred CCCC-ccccccCC-------CCchHHHHHhhccccccccccchhhccccccccccccccccHH----HhhccHHHHHHHH Confidence 3221 11111111 11222221111 0 00000 01111111233432 3456788899999 Q ss_pred cchhHHhhCCCEEecCCCcchhhh---hHHHHHHHHHHHHHhhHHHHHHHHHH-hhcccCceEEEEEecCCCCccccccC Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQDRSKD---ETEWERKNKPLIAGGRFWRAVSEADR-RRLVGRYSGLLLHIRDSQPWDRPARG 144 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d~~~~---~~~~e~~i~~~~~~l~~~~~~~ea~~-~~r~~Ggs~i~i~i~D~~~~~~Pl~~ 144 (456) ++++++-.=-+.+...+++...+. ...+-..|...=....-+..|.+.+- .-.++|-+++++. ++. T Consensus 69 ~Ia~~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~--------- 138 (424) T protein:vir:18 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNS--------- 138 (424) T ss_pred HHHHhhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECC--------- Confidence 999999766666654333221111 11111112111111112233444444 4556677777653 221 Q ss_pred CcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHH Q lcl|NC_016762. 145 KLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYN 221 (456) Q Consensus 145 ~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~ 221 (456) .+.+..+.|+....+++.. + +.+-+|++.. +| ..+.++++.||||.... ..|.|.++.+.+ T Consensus 139 -~G~~~~L~~l~~~~v~v~~-~--------~~~~~y~~~~---~g---~~~~~~~~eVihir~~~~dg~~G~spi~~~~~ 202 (424) T protein:vir:18 139 -AGDVISLLPLQSANMDVKL-V--------GKKVVYRYQR---DS---EYADFSQKEIFHLKGFGFTGLVGLSPIAFACK 202 (424) T ss_pred -CCcEEEEEEecCcceEEEE-c--------CCeEEEEEEe---CC---eEEEeccccEEEecCcCCCCcccccHHHHHHH Confidence 1123345554443343311 1 2244566652 22 24578899998885432 358999999887 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEE Q lcl|NC_016762. 222 SFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQM 298 (456) Q Consensus 222 ~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~ 298 (456) .+.. .......+..+|++..+.-.+.. +. + +.-.++..+++.+.+..+..+. +.++++.+-+|+.+ T Consensus 203 ~i~~-~~~~~~~~~~~f~ng~~~~gil~---~~--~-----~~l~~e~~~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l 271 (424) T protein:vir:18 203 SAGV-AVAMEDQQRDFFANGAKSPQILS---TG--E-----KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) T ss_pred HHHH-HHHHHHHHHHHHhccCCcceEEE---eC--C-----cCCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEec Confidence 7644 33444555566666543322211 10 0 0011333444444555544332 35667777789888 Q ss_pred ecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCCCccc-ch--H-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC Q lcl|NC_016762. 299 VSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQTGERA-SS--E-DQKYHNARCQARRVQELTFEINDLFAHLMRIGV 372 (456) Q Consensus 299 ~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp~Gln-st--~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~ 372 (456) +.+..+ +-+......+.||.+.|||-. ++|...++-. ++ + -...||.. -|.|.++++-+.|-+.-+ T Consensus 272 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~t~~~sn~eq~~~~f~~~-------tl~P~~~~ie~~ln~~L~ 343 (424) T protein:vir:18 272 GVTPQDAEMMASRKFQVSELARFFGVPPH-LVGDVEKSTSWGSGIEQQNLGFLQY-------TLQPYISRWENSIQRWLI 343 (424) T ss_pred CCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhCCCCCcccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcC Confidence 876553 345556677889999999976 4576554432 22 2 24456543 578888887666644333 Q ss_pred cCC---CCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---cc---CCCC Q lcl|NC_016762. 373 VPL---KAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DT---EPED 443 (456) Q Consensus 373 ~~~---~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~---~~~d 443 (456) .+. .--+.|.+..|...|.+++++.. ..++..| ++++||+|+..+++|+++++..- .- .... T Consensus 344 ~~~~~~~~~~~fd~~~llr~d~~~r~~~~-------~~~~~~G--~~T~NE~R~~~gl~pi~ggD~~~~~~n~~~l~~~~ 414 (424) T protein:vir:18 344 PSKDVGRLHAEHNLDGLLRGDSASRAAFM-------KAMGESG--LRTINEMRRTDNMPPLPGGDVAMRQAQYVPITDLG 414 (424) T ss_pred CccccCCeEEEEechhhhccCHHHHHHHH-------HHHHhCC--CcCHHHHHHHhCCCCCCCcCeeeeccCccchhhhh Confidence 221 11356666788888888876554 4567777 99999999999999987654421 00 1112 Q ss_pred CCCCCcCCCC Q lcl|NC_016762. 444 EDAARTDPTG 453 (456) Q Consensus 444 ~~~~~~d~~~ 453 (456) .+.++.+.+| T Consensus 415 ~~~~~~~n~a 424 (424) T protein:vir:18 415 TNKEPRNNGA 424 (424) T ss_pred ccCCccccCC Confidence 2334444455 No 67 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.55 E-value=3.9e-15 Score=99.59 Aligned_cols=360 Identities=13% Similarity=0.074 Sum_probs=174.3 Q ss_pred HHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNK 101 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~ 101 (456) |+|.+...+....... +..+.....++ ...|..+..+++||+.+++++-+--+.+..+..... ..+...+. T Consensus 1 Mg~f~~lf~~~~~~~~-~~~~~~~~~v~----~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~----~~~~~ll~ 71 (395) T protein:vir:95 1 MSILEKIFKTRKDITY-MLDLDMIEDLS----QQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQK----NDVYYKLN 71 (395) T ss_pred CchhhhhhccCccccc-cccchhccccc----hhhhhhhHHHHHHHHHHHHhhccceeEeccCCcccc----chHHHHHH Confidence 5554433332221111 11111111112 234667899999999999999887766654433221 12222222 Q ss_pred HHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEE Q lcl|NC_016762. 102 PLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWE 181 (456) Q Consensus 102 ~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~ 181 (456) ..=..+.-+..|.+++...++.||.++++...++.-+ |+. .|. +.+. ...++. ...+. T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~~--~~~-----------~~~--~~~~-----~~~~~~--~~~~~ 129 (395) T protein:vir:95 72 IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKELL--IAD-----------SFY--REEY-----ALYDDI--FKDVT 129 (395) T ss_pred hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCeE--ecC-----------Ccc--ceeE-----eecCcc--eeEEE Confidence 2212233444555555555566665555443322111 111 010 1110 011110 01111 Q ss_pred EeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh Q lcl|NC_016762. 182 YTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE 257 (456) Q Consensus 182 i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 257 (456) +. ..+-...+.++.||||... ...|.|.++.+...+.... ..+..+......+. +. T Consensus 130 ~~------~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~------~~- 189 (395) T protein:vir:95 130 VK------DYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILK------SA- 189 (395) T ss_pred Ec------CceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEE------eC- Confidence 11 0111245778888887432 2347787776654432211 11222211111111 00 Q ss_pred HHhhhcCCHHHHHHHHHHHHHHHhcC---CCe-EE-ecCCCceeEEecccCCH-------HHHHHHHHHHHHhhhcCCeE Q lcl|NC_016762. 258 IASTYGVTLDALNERFNEAARQLNRG---NDV-LL-PTQGATVTQMVSAVSDP-------GPTYNVNLQTAAAGVDIPTK 325 (456) Q Consensus 258 l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~l-id~~d~~~~~~~~~sgl-------~~~~~~~~~~~aaas~IP~t 325 (456) + +...++..+++.+.+..+.++ .+. ++ ++.+.+|+.++.+..+. -+......++||.+.+||-. T Consensus 190 --~--~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~ 265 (395) T protein:vir:95 190 --S--SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG 265 (395) T ss_pred --C--CCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 0 111234444444444444332 222 33 46778899888766543 33344566789999999988 Q ss_pred EeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC--CCceEEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_016762. 326 ILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL--KAEFTAIWDDLTVPTKAERLANSKTMS 402 (456) Q Consensus 326 ~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~--~~d~~~~f~pL~~~seke~Aei~~~~A 402 (456) +| | |..++.+ ..++||.. -|.|.++.+-..|-+.-+.+. ...+.|.+++|...+.+++++. T Consensus 266 ~l-~---~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~----- 329 (395) T protein:vir:95 266 LI-Y---GETADLEKNTLVFEKF-------CLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEA----- 329 (395) T ss_pred Hh-c---CcccCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHH----- Confidence 66 3 2222234 45667753 378888777666654433321 2346788889988888876554 Q ss_pred HHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc-----------ccCCCCCCCCCcCC-CCCCC Q lcl|NC_016762. 403 EINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP-----------DTEPEDEDAARTDP-TGEQQ 456 (456) Q Consensus 403 ~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~-----------~~~~~d~~~~~~d~-~~~~e 456 (456) ...++..| +++++|+|+..+++|++++..+. ..+..+....+..+ ++++. T Consensus 330 --~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~ 391 (395) T protein:vir:95 330 --IDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDED 391 (395) T ss_pred --HHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCC Confidence 45567777 99999999999999987642111 10111111111111 11111 No 68 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.55 E-value=3.9e-15 Score=99.59 Aligned_cols=360 Identities=13% Similarity=0.074 Sum_probs=174.3 Q ss_pred HHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNK 101 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~ 101 (456) |+|.+...+....... +..+.....++ ...|..+..+++||+.+++++-+--+.+..+..... ..+...+. T Consensus 1 Mg~f~~lf~~~~~~~~-~~~~~~~~~v~----~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~----~~~~~ll~ 71 (395) T protein:vir:10 1 MSILEKIFKTRKDITY-MLDLDMIEDLS----QQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQK----NDVYYKLN 71 (395) T ss_pred CchhhhhhccCccccc-cccchhccccc----hhhhhhhHHHHHHHHHHHHhhccceeEeccCCcccc----chHHHHHH Confidence 5554433332221111 11111111112 234667899999999999999887766654433221 12222222 Q ss_pred HHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEE Q lcl|NC_016762. 102 PLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWE 181 (456) Q Consensus 102 ~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~ 181 (456) ..=..+.-+..|.+++...++.||.++++...++.-+ |+. .|. +.+. ...++. ...+. T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~~--~~~-----------~~~--~~~~-----~~~~~~--~~~~~ 129 (395) T protein:vir:10 72 IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKELL--IAD-----------SFY--REEY-----ALYDDI--FKDVT 129 (395) T ss_pred hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCeE--ecC-----------Ccc--ceeE-----eecCcc--eeEEE Confidence 2212233444555555555566665555443322111 111 010 1110 011110 01111 Q ss_pred EeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh Q lcl|NC_016762. 182 YTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE 257 (456) Q Consensus 182 i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 257 (456) +. ..+-...+.++.||||... ...|.|.++.+...+.... ..+..+......+. +. T Consensus 130 ~~------~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~------~~- 189 (395) T protein:vir:10 130 VK------DYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILK------SA- 189 (395) T ss_pred Ec------CceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEE------eC- Confidence 11 0111245778888887432 2347787776654432211 11222211111111 00 Q ss_pred HHhhhcCCHHHHHHHHHHHHHHHhcC---CCe-EE-ecCCCceeEEecccCCH-------HHHHHHHHHHHHhhhcCCeE Q lcl|NC_016762. 258 IASTYGVTLDALNERFNEAARQLNRG---NDV-LL-PTQGATVTQMVSAVSDP-------GPTYNVNLQTAAAGVDIPTK 325 (456) Q Consensus 258 l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~l-id~~d~~~~~~~~~sgl-------~~~~~~~~~~~aaas~IP~t 325 (456) + +...++..+++.+.+..+.++ .+. ++ ++.+.+|+.++.+..+. -+......++||.+.+||-. T Consensus 190 --~--~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~ 265 (395) T protein:vir:10 190 --S--SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG 265 (395) T ss_pred --C--CCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 0 111234444444444444332 222 33 46778899888766543 33344566789999999988 Q ss_pred EeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC--CCceEEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_016762. 326 ILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL--KAEFTAIWDDLTVPTKAERLANSKTMS 402 (456) Q Consensus 326 ~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~--~~d~~~~f~pL~~~seke~Aei~~~~A 402 (456) +| | |..++.+ ..++||.. -|.|.++.+-..|-+.-+.+. ...+.|.+++|...+.+++++. T Consensus 266 ~l-~---~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~----- 329 (395) T protein:vir:10 266 LI-Y---GETADLEKNTLVFEKF-------CLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEA----- 329 (395) T ss_pred Hh-c---CcccCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHH----- Confidence 66 3 2222234 45667753 378888777666654433321 2346788889988888876554 Q ss_pred HHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc-----------ccCCCCCCCCCcCC-CCCCC Q lcl|NC_016762. 403 EINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP-----------DTEPEDEDAARTDP-TGEQQ 456 (456) Q Consensus 403 ~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~-----------~~~~~d~~~~~~d~-~~~~e 456 (456) ...++..| +++++|+|+..+++|++++..+. ..+..+....+..+ ++++. T Consensus 330 --~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~ 391 (395) T protein:vir:10 330 --IDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDED 391 (395) T ss_pred --HHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCC Confidence 45567777 99999999999999987642111 10111111111111 11111 No 69 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.55 E-value=3.9e-15 Score=99.59 Aligned_cols=360 Identities=13% Similarity=0.074 Sum_probs=174.3 Q ss_pred HHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNK 101 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~ 101 (456) |+|.+...+....... +..+.....++ ...|..+..+++||+.+++++-+--+.+..+..... ..+...+. T Consensus 1 Mg~f~~lf~~~~~~~~-~~~~~~~~~v~----~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~----~~~~~ll~ 71 (395) T protein:vir:10 1 MSILEKIFKTRKDITY-MLDLDMIEDLS----QQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQK----NDVYYKLN 71 (395) T ss_pred CchhhhhhccCccccc-cccchhccccc----hhhhhhhHHHHHHHHHHHHhhccceeEeccCCcccc----chHHHHHH Confidence 5554433332221111 11111111112 234667899999999999999887766654433221 12222222 Q ss_pred HHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEE Q lcl|NC_016762. 102 PLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWE 181 (456) Q Consensus 102 ~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~ 181 (456) ..=..+.-+..|.+++...++.||.++++...++.-+ |+. .|. +.+. ...++. ...+. T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~~--~~~-----------~~~--~~~~-----~~~~~~--~~~~~ 129 (395) T protein:vir:10 72 IKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKELL--IAD-----------SFY--REEY-----ALYDDI--FKDVT 129 (395) T ss_pred hccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCeE--ecC-----------Ccc--ceeE-----eecCcc--eeEEE Confidence 2212233444555555555566665555443322111 111 010 1110 011110 01111 Q ss_pred EeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh Q lcl|NC_016762. 182 YTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE 257 (456) Q Consensus 182 i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 257 (456) +. ..+-...+.++.||||... ...|.|.++.+...+.... ..+..+......+. +. T Consensus 130 ~~------~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~------~~- 189 (395) T protein:vir:10 130 VK------DYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILK------SA- 189 (395) T ss_pred Ec------CceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEE------eC- Confidence 11 0111245778888887432 2347787776654432211 11222211111111 00 Q ss_pred HHhhhcCCHHHHHHHHHHHHHHHhcC---CCe-EE-ecCCCceeEEecccCCH-------HHHHHHHHHHHHhhhcCCeE Q lcl|NC_016762. 258 IASTYGVTLDALNERFNEAARQLNRG---NDV-LL-PTQGATVTQMVSAVSDP-------GPTYNVNLQTAAAGVDIPTK 325 (456) Q Consensus 258 l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~l-id~~d~~~~~~~~~sgl-------~~~~~~~~~~~aaas~IP~t 325 (456) + +...++..+++.+.+..+.++ .+. ++ ++.+.+|+.++.+..+. -+......++||.+.+||-. T Consensus 190 --~--~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~ 265 (395) T protein:vir:10 190 --S--SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPG 265 (395) T ss_pred --C--CCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHH Confidence 0 111234444444444444332 222 33 46778899888766543 33344566789999999988 Q ss_pred EeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC--CCceEEEeCCCCCCCHHHHHHHHHHHH Q lcl|NC_016762. 326 ILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL--KAEFTAIWDDLTVPTKAERLANSKTMS 402 (456) Q Consensus 326 ~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~--~~d~~~~f~pL~~~seke~Aei~~~~A 402 (456) +| | |..++.+ ..++||.. -|.|.++.+-..|-+.-+.+. ...+.|.+++|...+.+++++. T Consensus 266 ~l-~---~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~----- 329 (395) T protein:vir:10 266 LI-Y---GETADLEKNTLVFEKF-------CLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEA----- 329 (395) T ss_pred Hh-c---CcccCHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHH----- Confidence 66 3 2222234 45667753 378888777666654433321 2346788889988888876554 Q ss_pred HHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc-----------ccCCCCCCCCCcCC-CCCCC Q lcl|NC_016762. 403 EINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP-----------DTEPEDEDAARTDP-TGEQQ 456 (456) Q Consensus 403 ~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~-----------~~~~~d~~~~~~d~-~~~~e 456 (456) ...++..| +++++|+|+..+++|++++..+. ..+..+....+..+ ++++. T Consensus 330 --~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~ 391 (395) T protein:vir:10 330 --IDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDED 391 (395) T ss_pred --HHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCC Confidence 45567777 99999999999999987642111 10111111111111 11111 No 70 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.55 E-value=1.2e-14 Score=96.90 Aligned_cols=375 Identities=8% Similarity=-0.020 Sum_probs=179.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCE Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQ 80 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~ 80 (456) +..++..... ......+++.+..-........ -....+.+ .+.+++.+.++|+++|+++-+--++ T Consensus 3 ~f~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~------~~~~v~~~----~al~~~~v~~~i~~ia~~ia~~p~~ 67 (386) T protein:vir:49 3 IFNITNLATE-----SPPINQESFFDIADSDFLASLN------SSEWVSAE----NALKNSDLFSIISQLSNDLATAKIT 67 (386) T ss_pred hhhhhccCCC-----Ccccchhhhhhhhhcccccccc------CCceechh----hhhccHHHHHHHHHHHHHhhhCcee Confidence 1111111100 0001112221111000000000 00112221 2345788899999999999877777 Q ss_pred EecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhc-ccCceEEEEEecCCCCccccccCCcCceeEEEEecccc Q lcl|NC_016762. 81 VIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRL-VGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGC 159 (456) Q Consensus 81 i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r-~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~ 159 (456) +...... .+...-..+.-+..|.+.+.+.+ ++|-+++++.- ++. +.+..+.|+-... T Consensus 68 ~~~~~~~-----------~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r-~~~----------g~~~~l~~i~~~~ 125 (386) T protein:vir:49 68 TSRKQLQ-----------GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWR-NDN----------GRDMKWEYLRPSQ 125 (386) T ss_pred eccchhh-----------hhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEE-CCC----------CcEEEEEEecCce Confidence 6432211 12212222233455666666655 45766666543 221 1122333333333 Q ss_pred CChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 160 LKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGE 235 (456) Q Consensus 160 ~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~ 235 (456) +++.. ++ + +.+-+|.+...... .+..+.++++.||||.... ..|.|.++.+.+.+.....+.. ... T Consensus 126 v~v~~---~~---~-~~~~~y~~~~~~~~--~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~-~~~ 195 (386) T protein:vir:49 126 VSFNR---LD---N-QNGLYYNITFDDPH--IAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDK-LTI 195 (386) T ss_pred eEEEE---cC---C-CceEEEEEEEcCcc--ccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHH-HHH Confidence 33221 11 1 22345666432211 1234678899999986432 3599999999987765554433 333 Q ss_pred HHHHHhhhhh-hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEecccCCH--HHHHHH Q lcl|NC_016762. 236 SFLKNAARQL-LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVSAVSDP--GPTYNV 311 (456) Q Consensus 236 ~~~~~~~~~l-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~~~sgl--~~~~~~ 311 (456) .++++....- .++... ...++..+++.+....+.++. +.++++.+.+|+.++.+.... -+.... T Consensus 196 ~~~~ng~~~~~il~~~~------------~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~ 263 (386) T protein:vir:49 196 SALKNALNANGILKIKG------------GGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADW 263 (386) T ss_pred HHHHccCCccEEEEeCC------------CCChHHHHHHHHHHHHhccCCCCceecCCCceEEEccCChhHHHHHHHHHH Confidence 4555533221 121111 111122222222223344444 456666777899988766543 456677 Q ss_pred HHHHHHhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCH Q lcl|NC_016762. 312 NLQTAAAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTK 391 (456) Q Consensus 312 ~~~~~aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~se 391 (456) ..++||++.|||-.+| |.+.++-+..+..+.|| ...++|.++.+...|-+. ++ ..+.|....+..++. T Consensus 264 ~~~~Ia~~fgVPp~~l-g~~~~~~~~~~~~~~~~-------~~~i~~~l~~i~~~~~~~-l~---~~~~~~~~~~~~~d~ 331 (386) T protein:vir:49 264 TTGQFAKVYGIPESIV-GGDGDQQSSLEMIYNIY-------FKSVSRYLRPFVSEMSKK-LS---CEVDVDISPAVDPTG 331 (386) T ss_pred HHHHHHHHhCCCHHHh-CCCCCccchHHHHHHHH-------HHHHHHHHHHHHHHHHHH-hc---chhcccchhhhccCH Confidence 8889999999998765 53333322223344444 234556665555544322 22 235566667777777 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCC Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQ 455 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~ 455 (456) +++++. ...++.+| +++++|+|+.+...+...++-+............+|+.+++ T Consensus 332 ~~~~~~-------~~~l~~~g--~~t~nE~r~~l~~~~~~~~~~~~~~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 332 SNYISL-------INSMVKSG--TLAQNQGLYILQQAEILPKELPDGKNPNRTSLKGGEINEQD 386 (386) T ss_pred HHHHHH-------HHHHHhCC--CcCHHHHHHHHhhCCCCCCcCcchhccCCCCCCCCCCCCCC Confidence 666543 34567777 99999999988765543221111110001111112222111 No 71 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.55 E-value=1.2e-14 Score=96.85 Aligned_cols=397 Identities=16% Similarity=0.066 Sum_probs=186.3 Q ss_pred CCchhHHHHhH--HHHHHH---HHHHHHHhhhhhccCcc-cchhhhhcc-----CcccCCHHHHHHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDLAVNH--AMSSAI---ARARMSLLNQGIGHDAK-RPQAWCEYG-----FPQEITFNDLYTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~~~~~--a~~~~~---~~~~d~~~n~~~~~gt~-~~~~~~~~~-----~~~~~~~~~l~~~Y~~~~l~r~iVd~ 69 (456) |.+.++-..+. +.+-+. ....-...+......++ .+..+..|. .....+. .-+.++..+.+||++ T Consensus 3 l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~V~~ci~~ 78 (431) T protein:vir:10 3 LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRE----TRALRNMAVLRCVTL 78 (431) T ss_pred chhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceech----hhhhccHHHHHHHHH Confidence 33333221110 000000 00000000000000000 011111110 0111121 122357889999999 Q ss_pred chhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCc Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNG 148 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~ 148 (456) +++++-.--+.+...++.........+...|...=...--+..|.+.+-+ -.++|-++++|.- |+ +. T Consensus 79 Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r-~~-----------g~ 146 (431) T protein:vir:10 79 ISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVW-SG-----------NR 146 (431) T ss_pred HHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-cC-----------Cc Confidence 99999766666654322211111111111121111112223345554444 4556777776643 22 12 Q ss_pred eeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC---cCCCcchHHHHHHHHHH Q lcl|NC_016762. 149 LAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW---TGDAIGFLEPAYNSFIS 225 (456) Q Consensus 149 l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~---~~~G~S~le~~~~~l~~ 225 (456) +..+.|+....+++.. +. -|.+ +|.+.. .+ +..+.+.++.|+||... ...|.|.++.+.+.+-. T Consensus 147 ~~~L~pl~~~~v~~~~---~~----~~~~-~y~~~~--~~---g~~~~~~~~dViHir~~~~dg~~G~spi~~~~~~i~~ 213 (431) T protein:vir:10 147 PIRLIPMDRGSAKGRL---TS----TWQI-VYDYTT--PT---GDKIELPAREVFHLRDLSIDGVSGVSRVKLSGNALEL 213 (431) T ss_pred eEEEEEEcCceeEEEE---cC----CCeE-EEEEEe--CC---ceEEEEchhhEEEecCcCCCCcccccHHHHHHHHHHH Confidence 3345555544333311 11 1233 344432 12 22456888888888543 24589999988876644 Q ss_pred HHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc---CC-CeEEecCCCceeEEec Q lcl|NC_016762. 226 LEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR---GN-DVLLPTQGATVTQMVS 300 (456) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~lid~~d~~~~~~~ 300 (456) . .....+...++++..+.-. ++.... + .++..+++.+.+....+ |. +.++++.+-+|++++. T Consensus 214 ~-~~~~~~~~~~f~ng~~p~gil~~~~~-----l-------s~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~ 280 (431) T protein:vir:10 214 A-EQAERAASRTFRTGVMAGGAIEVPKE-----L-------SDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSN 280 (431) T ss_pred H-HHHHHHHHHHHhccCCccEEEecCCC-----C-------CHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccC Confidence 3 4444555566666443221 111111 1 12233444444433322 22 4566777778988877 Q ss_pred ccCCH--HHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--- Q lcl|NC_016762. 301 AVSDP--GPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--- 374 (456) Q Consensus 301 ~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--- 374 (456) +.... -+.......+||.+.|||..+|-+...+..++.+ -...|+.. -|.|.+.++-+.|-+.-+.+ T Consensus 281 ~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~-------tL~P~~~~ie~~ln~~Ll~~~~~ 353 (431) T protein:vir:10 281 TAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAIFFIQY-------GLSHWFVSWEQAAARAFLPEKML 353 (431) T ss_pred ChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccChhhc Confidence 65433 3444455778999999999866444433333333 23456544 37787777665554332221 Q ss_pred CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC--CcCcCHHHHHHHhcccCCCC--CCCCcc--cCCCCCCCCC Q lcl|NC_016762. 375 LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTG--EPVFTAEEIREEAGYDPLQG--GDPLPD--TEPEDEDAAR 448 (456) Q Consensus 375 ~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g--~~~i~~~E~R~~~~~~~~~~--~~~~~~--~~~~d~~~~~ 448 (456) ....+.|.+..|...+.+++++. ..+++..| .+.+|+||+|+..+++|+.+ ++..-. ......+..+ T Consensus 354 ~~~~~~fd~~~llr~d~~~r~~~-------~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~gD~~~~p~n~~~~~~~~~ 426 (431) T protein:vir:10 354 GQRQFKFNEGALLRGTLNDQAAF-------FSKALGAGGQSPWMKQNEVREMLDLPRADDPVADQLRNPMTQKQKGSGDE 426 (431) T ss_pred CCceEEEechhhhccCHHHHHHH-------HHHHHhcccccCccCHHHHHHHhCCCCCCCccccceecccccccCCCCCC Confidence 11235666667777777776554 44555555 23699999999999999865 432211 1111111112 Q ss_pred cCCCC Q lcl|NC_016762. 449 TDPTG 453 (456) Q Consensus 449 ~d~~~ 453 (456) +..+. T Consensus 427 ~p~~~ 431 (431) T protein:vir:10 427 PPATT 431 (431) T ss_pred CCCCC Confidence 22222 No 72 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.53 E-value=1.9e-14 Score=95.80 Aligned_cols=403 Identities=12% Similarity=0.004 Sum_probs=172.5 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCE Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQ 80 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~ 80 (456) |..++=.++.-.-..++.. .+..+ .....+ .+.++ |...+++..|.++|+.|+.++.||++++++..+-.+. T Consensus 1 ~~~~~~~i~s~~~~~~i~~--~~~~s----~~~~~~-~~~~~-~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~ 72 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKR--EEVES----QALGET-RFEEY-VEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYI 72 (542) T ss_pred Cccccccccccccchhhhh--ccccc----cccccc-cCCcc-ccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCcee Confidence 6665433322111111110 00000 000000 11112 3345788999999999999999999999999999888 Q ss_pred EecCCCcchhhhhHHHHHHHHHHH--HHhhHHHHHHHHHHhhcccCceEEEEEecC-CCCccccccCCcCceeEEEEecc Q lcl|NC_016762. 81 VIEGDDQDRSKDETEWERKNKPLI--AGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQPWDRPARGKLNGLAKVTPAWA 157 (456) Q Consensus 81 i~~~~~~d~~~~~~~~e~~i~~~~--~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~~~Pl~~~~~~l~~i~~~~~ 157 (456) +....... +...+ ......+-+...+..-.++|-|++.+.-+. |+ +..+.|+.. T Consensus 73 ~~~~~~~~-----------l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~------------~~~L~~l~~ 129 (542) T protein:vir:41 73 LEGDDEGV-----------VDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGD------------PIRFEYIPS 129 (542) T ss_pred eecccchh-----------hhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCc------------EEEEEEEcC Confidence 74322111 11111 011122222333445677788877654322 22 122222222 Q ss_pred ccCChhhhhcccc-ccccCCceeEEEeec---ccCCcc-ccceeeehhhhheecC----CcCCCcchHHHHHHHHHHHHH Q lcl|NC_016762. 158 GCLKPKSFDEKPD-SETYGQPTMWEYTEA---SQAGRP-GLVRDIHPDRVFILGD----WTGDAIGFLEPAYNSFISLEK 228 (456) Q Consensus 158 ~~~~~~~~~~Dp~-s~~yg~P~~y~i~~~---~~~g~~-~~~~~IH~SRli~~~~----~~~~G~S~le~~~~~l~~~~~ 228 (456) ..+++.. +.+.. .-..+....|..... ...... ..+..+-++-||||.. ....|.|.+..+...+... . T Consensus 130 ~~v~v~~-d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~-~ 207 (542) T protein:vir:41 130 HTIRVHK-DGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAM-Q 207 (542) T ss_pred cceEEEE-cCCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHH-H Confidence 1121110 00000 001111122211100 000000 0112334445555532 3356999999888766443 3 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHH----HHhcCCCeEEec-------CCCceeE Q lcl|NC_016762. 229 VEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAAR----QLNRGNDVLLPT-------QGATVTQ 297 (456) Q Consensus 229 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~lid-------~~d~~~~ 297 (456) .+......+|++....-.+............ ....-..+..+++.+.++ ....+.+..++. .+-+|+. T Consensus 208 ~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~-~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~p 286 (542) T protein:vir:41 208 KIDEYNYAFFDNYTIPSYVITVTGEFEDELE-EDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTP 286 (542) T ss_pred HHHHHHHHHHhccCCccEEEEeCCccccccc-cccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEE Confidence 3444555566664332111110000000000 000011222333333332 222333434432 2234555 Q ss_pred EecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCC-Ccccch---HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc Q lcl|NC_016762. 298 MVSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQT-GERASS---EDQKYHNARCQARRVQELTFEINDLFAHLMRIG 371 (456) Q Consensus 298 ~~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp-~Glnst---~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~ 371 (456) ++.+..+ +-+......++||++.+||... +|... +.+|++ +-..+||.. .|.|.++.+-..|-+. T Consensus 287 l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~-lG~~~~~t~n~sn~Eq~~~~f~~~-------tL~P~~~~ie~~ln~~- 357 (542) T protein:vir:41 287 LNTSQKELSFREYAAEKKYDIAAAHMIDPYR-LGIADTGPLGGNFAEVTRRTYYES-------VVRPQQNIISSILTDF- 357 (542) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHH-hCcCCCcccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhh- Confidence 5544332 2344455678899999999875 57654 445532 234555544 3567776665555432 Q ss_pred CcC-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHh-cccCCCCCCCCcccC-----CCCC Q lcl|NC_016762. 372 VVP-LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEA-GYDPLQGGDPLPDTE-----PEDE 444 (456) Q Consensus 372 ~~~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~-~~~~~~~~~~~~~~~-----~~d~ 444 (456) +.+ ...++.|+|+...-+... + +.....++..| +++++|+|+.+ +.+|.++-.-.+... ...+ T Consensus 358 L~~~~~~~~~~~f~~~~ll~~d-~-------~~~~~~~v~~G--ilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~ 427 (542) T protein:vir:41 358 FQVKFNPKTRFKFNDETLLESD-S-------VRNCALLVQSG--VLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQE 427 (542) T ss_pred cccccCCceEEEecchhhcchH-H-------HHHHHHHHhCC--CCCHHHHHHhhCCCCCCCccccccccccccccccCC Confidence 222 223577777754433321 1 11234567777 99999999854 555533210000000 0000 Q ss_pred CCCCcCCCCCCC Q lcl|NC_016762. 445 DAARTDPTGEQQ 456 (456) Q Consensus 445 ~~~~~d~~~~~e 456 (456) ...+.++..+.+ T Consensus 428 ~n~~~~~~~~~~ 439 (542) T protein:vir:41 428 RNYEKNQIREIR 439 (542) T ss_pred cCCCCCchhhhh Confidence 000000000000 No 73 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.52 E-value=2e-14 Score=95.63 Aligned_cols=378 Identities=10% Similarity=-0.023 Sum_probs=178.7 Q ss_pred HHHhhhhhccCcccchhhhhccCc-ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFP-QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKN 100 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~-~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i 100 (456) ++|.+.........+..+..+... ....+....++ .+.-+.++|+++|+++-+--+.+...+.. . .....+...+ T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Al--~~~~V~~~i~~Ia~~iA~lp~~~~~~~g~-~-~~~~~~~~lL 76 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQKYLGVSAL--KNSDILTATSIIAGDIARFPLVKKDVNGD-I-IHDEDINYLL 76 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCcccccchhh--ccHHHHHHHHHHHHhhhhCeeEEEecCcc-c-cccchHHHHh Confidence 555543222222222233333211 11111111221 35566779999999887654444322211 1 1111111212 Q ss_pred HHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCcee Q lcl|NC_016762. 101 KPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTM 179 (456) Q Consensus 101 ~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~ 179 (456) +..=..+.-+..|.+.+-+ -.++|-+++++. +|++. +.+..+.|+-...+++. .+ +.|.+ . T Consensus 77 ~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~-r~~~~---------g~~~~L~~i~p~~v~v~---~~----~~~~~-~ 138 (406) T protein:vir:97 77 NVKSTSNASARTWKFAMAVNAILTGNSFSRIL-RDPKT---------NQALQFQFYRPSETTVE---ET----DNHEI-V 138 (406) T ss_pred hccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE-ecCCC---------CeEEEEEEECCCeeEEE---Ec----CCceE-E Confidence 1111122233344444444 455677776653 23211 11234445444434331 11 12333 3 Q ss_pred EEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHh Q lcl|NC_016762. 180 WEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLG 256 (456) Q Consensus 180 y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 256 (456) |.+... .+ +..+.+.++.||||...+ .-|.|.++.+.+.+-... .+......++++..+.-.+. .... T Consensus 139 y~~~~~-~~---~~~~~~~~~evih~r~~~~dg~~G~spi~~~~~~i~~~~-a~~~~~~~~f~ng~~~~~i~-~~~~--- 209 (406) T protein:vir:97 139 YTFTDM-LT---AKQVKCFAHDVIHWKFFSHDTILGRSPLLSLGDEIDLQT-GGINTLIKFFKDGFSSGILT-MKGA--- 209 (406) T ss_pred EEEEec-CC---ceEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHH-HHHHHHHHHHhccCCCceEE-ecCC--- Confidence 455421 11 223567788898885433 238999998887664433 33444445666654322111 1100 Q ss_pred hHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEEecccCCHH--HHHHHHHHHHHhhhcCCeEEeeccC Q lcl|NC_016762. 257 EIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQMVSAVSDPG--PTYNVNLQTAAAGVDIPTKILVGMQ 331 (456) Q Consensus 257 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~~~~~sgl~--~~~~~~~~~~aaas~IP~t~L~G~s 331 (456) .-.++..+++.+.++.+.++. ..++++.+.+|++++.+..... +........||.+.|||-..|-+++ T Consensus 210 -------~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~ 282 (406) T protein:vir:97 210 -------QLSGDARQRARQEFEKMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNS 282 (406) T ss_pred -------CCCHHHHHHHHHHHHHHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCC Confidence 001233344444455554432 3456677888999886654433 3344457789999999998774322 Q ss_pred CCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC-CCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 332 TGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL-KAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIG 410 (456) Q Consensus 332 p~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~-~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~ 410 (456) .+-|..+-.++||. .-|.|.++.|-+.|-+.-+.+. ...+.|+|+ +.. ..+..+++..+.+. T Consensus 283 -~~~~~e~~~~~f~~-------~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd-~~~--------~~~~~~~~~~~~~~ 345 (406) T protein:vir:97 283 -PNQSVAQLMEDYVT-------NDLPFYFDAITSELGLKTLNDKDRRLYHIEFD-TRS--------VTGRNVDEIVKLVN 345 (406) T ss_pred -CcchHHHHHHHHHH-------HHHHHHHHHHHHHHhhhhcChhhccceeEEEe-cCc--------cchhhHHHHHHHHh Confidence 22122233455654 3477877777666544322221 134567775 111 12234555666777 Q ss_pred cCCcCcCHHHHHHHhcccCCCC--CCCCc---ccCCCCCC---------CCCcCC-CCCCC Q lcl|NC_016762. 411 TGEPVFTAEEIREEAGYDPLQG--GDPLP---DTEPEDED---------AARTDP-TGEQQ 456 (456) Q Consensus 411 ~g~~~i~~~E~R~~~~~~~~~~--~~~~~---~~~~~d~~---------~~~~d~-~~~~e 456 (456) .| +++++|+|+..+++|+.+ ++..- .-.+.+.. ..++.+ .+++. T Consensus 346 ~g--~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 404 (406) T protein:vir:97 346 NQ--ILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGIKGKGGEVNAEED 404 (406) T ss_pred CC--CcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhcccccccccccccCCCCCCCCCC Confidence 77 999999999999998765 22210 00111110 111111 11111 No 74 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.52 E-value=5e-14 Score=93.49 Aligned_cols=408 Identities=12% Similarity=0.090 Sum_probs=179.3 Q ss_pred CCchh-------HHHHhHHHHHHHHHHHHHHhhhhhccCc----------ccchhhhh------c---cCcc---c---- Q lcl|NC_016762. 1 MTDKL-------DLAVNHAMSSAIARARMSLLNQGIGHDA----------KRPQAWCE------Y---GFPQ---E---- 47 (456) Q Consensus 1 ~~~~~-------~~~~~~a~~~~~~~~~d~~~n~~~~~gt----------~~~~~~~~------~---~~~~---~---- 47 (456) |-.+| +|-++..-.....-..|.+.+.+..+.. +.++++.+ . +|.. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~ 80 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNS 80 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhh Confidence 32222 1111100000011112233222222210 01111110 1 1111 1 Q ss_pred CCHHHHHHHHhcCchhhhhhccchhHHhh-----------CCCEEecCCCc--chhhhhHHHHHHHHHHHHHhh------ Q lcl|NC_016762. 48 ITFNDLYTMYRRGGIAHGAVEKIVTTCWK-----------TNPQVIEGDDQ--DRSKDETEWERKNKPLIAGGR------ 108 (456) Q Consensus 48 ~~~~~l~~~Y~~~~l~r~iVd~~aed~tR-----------~~~~i~~~~~~--d~~~~~~~~e~~i~~~~~~l~------ 108 (456) .+...+...|..|.++++||+++++++.+ -++.|.-.+.+ ...+....+ ..+...+..+. T Consensus 81 ~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~-~~l~~~l~~~~~~~~p~ 159 (576) T protein:vir:96 81 DNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEI-KRIENFILNTGRDKDID 159 (576) T ss_pred hhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhh-hhHHhhHhhccCCCCCc Confidence 12345667788999999999999988754 23333221111 111111111 11222222211 Q ss_pred --HHHHHHHHHHh-hcccCceEEEEEe-cCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEee Q lcl|NC_016762. 109 --FWRAVSEADRR-RLVGRYSGLLLHI-RDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTE 184 (456) Q Consensus 109 --~~~~~~ea~~~-~r~~Ggs~i~i~i-~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~ 184 (456) -+..|.+.+.+ -.++|-+++++.. +++. +.+..+.|+....+++. .+.....|..+..|... T Consensus 160 ~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~----------g~~~~L~pl~p~~V~v~---~~~dg~~~~~~~~~~~~- 225 (576) T protein:vir:96 160 RDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNA----------TTMDKFIAVDPSTIFYA---TDKNGKIIKGGKRFVQV- 225 (576) T ss_pred cccHHHHHHHHHHHHHhcCCeEEEEEEecCCC----------CceEEEEEeCCceeEEE---ECCCCceeeeeeEEEEe- Confidence 12335555444 5677877776544 2321 11334444443333331 22223333334433221 Q ss_pred cccCCccccceeeehhhhheecC-------CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-hhhhhccHh Q lcl|NC_016762. 185 ASQAGRPGLVRDIHPDRVFILGD-------WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLL-NFDKEINLG 256 (456) Q Consensus 185 ~~~~g~~~~~~~IH~SRli~~~~-------~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~ 256 (456) .++. ....+.++.+|++.. ....|.|.++.+...+.....+ ..+...+|++....-.+ ++.... T Consensus 226 --~~~~--~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~-~~~~~~~f~Ng~~p~giL~~~~~~--- 297 (576) T protein:vir:96 226 --INKK--VVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNNT-ETFNDRFFSHGGTTRGILQIKSEQ--- 297 (576) T ss_pred --cCCc--eEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHHHH-HHHHHHHHhccCCCceEEEeCCCC--- Confidence 1111 112334444443321 1345999999998877554443 44445566664432211 110000 Q ss_pred hHHhhhcCCHHHHHHHHHHHHHHHhcC---CC--eEEecCCCceeEEecccC--CHHHHHHHHHHHHHhhhcCCeEEeec Q lcl|NC_016762. 257 EIASTYGVTLDALNERFNEAARQLNRG---ND--VLLPTQGATVTQMVSAVS--DPGPTYNVNLQTAAAGVDIPTKILVG 329 (456) Q Consensus 257 ~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~--~~lid~~d~~~~~~~~~s--gl~~~~~~~~~~~aaas~IP~t~L~G 329 (456) ...++..+++.+.+....++ .+ .+++..+-+|+.++.+.. .+-+......+.||.+.|||...| | T Consensus 298 -------~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G 369 (576) T protein:vir:96 298 -------QQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEI-G 369 (576) T ss_pred -------CCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHc-c Confidence 01123344444444443332 22 245566778888876654 445666677889999999998755 7 Q ss_pred cCCCcc----------c-c-hH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHH Q lcl|NC_016762. 330 MQTGER----------A-S-SE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLA 396 (456) Q Consensus 330 ~sp~Gl----------n-s-t~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Ae 396 (456) ..-++- + + .+ -.+.||.. -|.|.+..+-..|-+.-+-....++.|+|. ..+.+.+++ T Consensus 370 ~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~-------tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~---r~d~~~~~e 439 (576) T protein:vir:96 370 FPNRGGATGGKGGNTLNEADPGKKQQQSQNK-------GLQPLLRFIEDLINTHIISEYSDKYVFQFV---GGDTKSELD 439 (576) T ss_pred ccccccccccccccccccccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhhchhccCceEEEec---cCCHHHHHH Confidence 543221 1 1 11 23344443 367777666555433212122235666664 334444443 Q ss_pred HHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc--------c---------cCCCCC------------CCC Q lcl|NC_016762. 397 NSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP--------D---------TEPEDE------------DAA 447 (456) Q Consensus 397 i~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~--------~---------~~~~d~------------~~~ 447 (456) .. ++ ...+..| +++++|+|+..+++|+++++.+- . .+...+ ++. T Consensus 440 ~~----~~-~~~~~~G--~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 512 (576) T protein:vir:96 440 KI----KI-LQEEVKT--YKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDE 512 (576) T ss_pred HH----HH-HHHHhcC--ccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccCCCCCC Confidence 22 11 1233456 99999999999999887654311 0 000000 000 Q ss_pred CcCC-CCCCC Q lcl|NC_016762. 448 RTDP-TGEQQ 456 (456) Q Consensus 448 ~~d~-~~~~e 456 (456) ++.+ +.++. T Consensus 513 ~~~~~s~~~~ 522 (576) T protein:vir:96 513 EPQQESTEDK 522 (576) T ss_pred CCCCCCCCCc Confidence 0111 00100 No 75 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.52 E-value=1.5e-14 Score=96.31 Aligned_cols=387 Identities=12% Similarity=0.030 Sum_probs=187.5 Q ss_pred CCchhHHHH-hHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCC Q lcl|NC_016762. 1 MTDKLDLAV-NHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNP 79 (456) Q Consensus 1 ~~~~~~~~~-~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~ 79 (456) |.-.-.... ..+++.. +.+..+..+.+..... .| ..++.+ .+.++..+.+||+++|++.-.--+ T Consensus 1 m~~~~~~~~~~~~~s~~-----~~w~~~~~~~~~~~~~----~g--~~vt~~----~al~~~~v~~~i~~Ia~~iA~lp~ 65 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGG-----GFWEAMLGGVRSSHSK----AG--VMITPE----TALALSAVRACVTLLAESVAQLPV 65 (421) T ss_pred CCCcchhcccccccCcc-----hhhHHHhhhhccCccc----CC--ceechH----HhhccHHHHHHHHHHHHhhccCce Confidence 321111100 0111110 1111111111111100 01 123332 234678899999999999877666 Q ss_pred EEecCCCcchhhh--hHHHHHHHHHHHHHhhHHHHHHHHHH-hhcccCceEEEEEecCCCCccccccCCcCceeEEEEec Q lcl|NC_016762. 80 QVIEGDDQDRSKD--ETEWERKNKPLIAGGRFWRAVSEADR-RRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAW 156 (456) Q Consensus 80 ~i~~~~~~d~~~~--~~~~e~~i~~~~~~l~~~~~~~ea~~-~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~ 156 (456) .+...+++...+. ...+-..+...=....-+..|.+.+. +-.++|-+++++. +++. +.+..+.|+- T Consensus 66 ~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~----------G~~~~L~~l~ 134 (421) T protein:vir:10 66 ELYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIID-RDGK----------GYPKELIPIN 134 (421) T ss_pred EEEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEE-EcCC----------CcEEEEEEec Confidence 6654332221111 11111112111111222344545444 4556676766553 3321 1123444444 Q ss_pred cccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC---cCCCcchHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 157 AGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW---TGDAIGFLEPAYNSFISLEKVEGGS 233 (456) Q Consensus 157 ~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~---~~~G~S~le~~~~~l~~~~~~~~~~ 233 (456) ...+++. .| ..|.+ +|++.. .+..+.++-+||+... ...|.|.++.+.+.+... ...... T Consensus 135 ~~~v~v~---~~----~~g~~-~y~~~~--------~g~~~~~~eiih~~~~~~d~~~G~spi~~~~~~i~~~-~~~~~~ 197 (421) T protein:vir:10 135 PKKVIVL---KG----PDGMP-YYEIPE--------IGETLPMRMMHHVKVFSLDGYIGSSPIQTNADVLGLN-LAVEEH 197 (421) T ss_pred CceEEEE---EC----CCceE-EEEEcC--------CCcEEchhhEEEecCcCCCCcccccHHHHHHHHHHHH-HHHHHH Confidence 3333331 12 12444 344531 1235666666666432 235899999888766443 334444 Q ss_pred HHHHHHHhhhhh-hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCCH--H Q lcl|NC_016762. 234 GESFLKNAARQL-LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSDP--G 306 (456) Q Consensus 234 ~~~~~~~~~~~l-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sgl--~ 306 (456) ...++++..+.- .++.... + .+....+..+++.+.+....++ .+.++++.+-+|++++.+.... - T Consensus 198 ~~~~f~ng~~~~gil~~~~~-----~---~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~ 269 (421) T protein:vir:10 198 ASAVFRRGATMSGVIERPKE-----A---PAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLL 269 (421) T ss_pred HHHHHhcCCCccEEEEecCc-----c---CccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHH Confidence 455666543321 1221110 1 0111133344444444443322 3457777788999998766543 4 Q ss_pred HHHHHHHHHHHhhhcCCeEEeeccCC-CcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC---CCCceEE Q lcl|NC_016762. 307 PTYNVNLQTAAAGVDIPTKILVGMQT-GERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP---LKAEFTA 381 (456) Q Consensus 307 ~~~~~~~~~~aaas~IP~t~L~G~sp-~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~---~~~d~~~ 381 (456) +......+.||.+.|||-..| |... +..++.+ -...||.. -|.|.+..+-..|-+.-+.+ ....|.| T Consensus 270 e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn~e~~~~~f~~~-------tl~P~~~~ie~~ln~kL~~~~~~~~~~v~f 341 (421) T protein:vir:10 270 QSRQWGVEEVCRLYKIPPHMV-QMLAKATNNNIEHQGLQFVMY-------TLLAWLKRHEGALQRDLLLPSERRDLYIEF 341 (421) T ss_pred HHHHHhHHHHHHHhCCCHHHc-CCCcCCccccHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccCccccCCeEEEE Confidence 455567888999999997654 5433 3333323 34556554 46777776655554332221 1123667 Q ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---------ccCCCCCCCCCcCCC Q lcl|NC_016762. 382 IWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---------DTEPEDEDAARTDPT 452 (456) Q Consensus 382 ~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---------~~~~~d~~~~~~d~~ 452 (456) ....|...|.+++++ +..+++..| +++++|+|+..+++|+++++..- ...+.+ ..+...++ T Consensus 342 d~~~l~~~d~~~~~~-------~~~~~~~~G--~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~-~~~~~~~~ 411 (421) T protein:vir:10 342 NVSGLLRGDQKSRYE-------SYALGRQWG--WLSVNDIRRMENLPPIAGGDKYLTPLNMVDSAQIIPGD-KKPTAQQM 411 (421) T ss_pred echhhhccCHHHHHH-------HHHHHHhCC--CcCHHHHHHHhCCCCCCCcceeeeccccccccccccCC-CCcccccC Confidence 777888888888765 445567777 99999999999999987665421 111111 11111122 Q ss_pred CCCC Q lcl|NC_016762. 453 GEQQ 456 (456) Q Consensus 453 ~~~e 456 (456) ++.. T Consensus 412 ~e~d 415 (421) T protein:vir:10 412 AEID 415 (421) T ss_pred cccc Confidence 2211 No 76 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.51 E-value=2.3e-14 Score=95.37 Aligned_cols=389 Identities=15% Similarity=0.096 Sum_probs=186.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhh---hccCcc--cch--hhh--hccCc----ccCCHHHHHHHHhcCchhhhhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQG---IGHDAK--RPQ--AWC--EYGFP----QEITFNDLYTMYRRGGIAHGAV 67 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~---~~~gt~--~~~--~~~--~~~~~----~~~~~~~l~~~Y~~~~l~r~iV 67 (456) |.-|-+.. -+. -+.+.+++ -..++. .+. .+- -.+++ ...+.+ . +.++.-+.++| T Consensus 11 ~~~~~~~~----~~~-----~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~-al~~~~V~~cv 77 (441) T protein:vir:79 11 VDFKSRKQ----SRK-----ELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI---E-AIRHSDIFTAV 77 (441) T ss_pred cccccccc----chh-----hhhccccccccccccccCCCcchHHHHHHhcccCcccccccchh---h-hhccHHHHHHH Confidence 33332221 000 01111111 000000 000 000 00111 111111 1 23455667789 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCc Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKL 146 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~ 146 (456) +++|++.-.--+++..+.+..... .+-..|...=..+--+..|.+++.+ -.++|-|++++. +++. T Consensus 78 ~~Ia~~iA~lp~~~~~~~~~~~~~---~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~---------- 143 (441) T protein:vir:79 78 MMIASDLARMPIRVTVNGQINYSD---RIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT-RDKT---------- 143 (441) T ss_pred HHHHHhhccCceeeecCccccccc---hHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC---------- Confidence 999988876656665433221111 1111111110111112334444444 566788777653 3321 Q ss_pred CceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHH Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSF 223 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l 223 (456) +.+..+.|+-...+++ ..| +.|.+.++... .... .....+.++++.||||...+ ..|.|.++.+.+.+ T Consensus 144 G~~~~L~~i~~~~v~v---~~d----~~g~~~~~~~~-~~~~-~~~~~~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i 214 (441) T protein:vir:79 144 GEPMNLTFRKTSEIEL---KSD----ARGRLYYFHQR-IDSN-GNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTI 214 (441) T ss_pred CcEEEEEEEcCceeEE---EEC----CCccEEEEEEE-eccC-CceeEEEEccccEEEeccCCCCCccccCHHHHHHHHH Confidence 1133344444333332 122 23455443322 1111 11223568888888885332 35899999988766 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEE Q lcl|NC_016762. 224 ISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQM 298 (456) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~ 298 (456) -. ..........++++..+.-. ++.... + ..++..+++.+.+....++ ...++++.+.+|+.+ T Consensus 215 ~~-~~~~~~~~~~~f~ng~~p~gil~~~~~-----~------~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l 282 (441) T protein:vir:79 215 ES-DNNGKDFLNNFLRNGTHAGGILKMKGV-----L------DNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQL 282 (441) T ss_pred HH-HHHHHHHHHHHHhccCCCcEEEEcCCC-----C------CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEc Confidence 43 34444555556666443221 111110 0 1123344444444443332 235677777889998 Q ss_pred ecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC Q lcl|NC_016762. 299 VSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK 376 (456) Q Consensus 299 ~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~ 376 (456) +.+..+ +-+........||.+.|||-.. +|...++.+.. +...+|. .-|.|.+..+-..|-+. +.+.. T Consensus 283 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~s~~-q~~~~~~-------~tl~P~~~~ie~eln~k-l~~~~ 352 (441) T protein:vir:79 283 EVDTEVLKLIRENKSSTREIAGVFGIPLHK-FGIETANMSIT-DANLDYL-------STLKPYITCVCAELNFK-FNDEY 352 (441) T ss_pred cCChhHHHHHHHHHHhHHHHHHHhCCCHHH-cCCCCCCccHH-HHHHHHH-------HHHHHHHHHHHHHHhhh-ccccc Confidence 876544 3455566778899999999974 58765554322 3233332 13778877766555433 22222 Q ss_pred --CceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcc--------cCCCCC-- Q lcl|NC_016762. 377 --AEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPD--------TEPEDE-- 444 (456) Q Consensus 377 --~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~--------~~~~d~-- 444 (456) -.|+|.++.|...|.++++ ++...++..| +++++|+|+..+++|+++++.+.- .+..++ T Consensus 353 ~~~~~~fd~~~llr~D~~~~~-------~~~~~~i~~G--~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~ 423 (441) T protein:vir:79 353 VNREFKFDTTEIRVVDEKTQA-------EIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQ 423 (441) T ss_pred cCceEEeechhhhccCHHHHH-------HHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccc Confidence 2466666778877777764 4456677778 999999999999999877653211 011111 Q ss_pred -----CCCCcCCCCCCC Q lcl|NC_016762. 445 -----DAARTDPTGEQQ 456 (456) Q Consensus 445 -----~~~~~d~~~~~e 456 (456) +.+....+||+. T Consensus 424 ~~~~~~~~~~~kgGe~~ 440 (441) T protein:vir:79 424 MNKSRATDKKLKGGEEN 440 (441) T ss_pred cccccccccccCCCCCC Confidence 011112233322 No 77 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.51 E-value=2.3e-14 Score=95.37 Aligned_cols=389 Identities=15% Similarity=0.096 Sum_probs=186.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhh---hccCcc--cch--hhh--hccCc----ccCCHHHHHHHHhcCchhhhhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQG---IGHDAK--RPQ--AWC--EYGFP----QEITFNDLYTMYRRGGIAHGAV 67 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~---~~~gt~--~~~--~~~--~~~~~----~~~~~~~l~~~Y~~~~l~r~iV 67 (456) |.-|-+.. -+. -+.+.+++ -..++. .+. .+- -.+++ ...+.+ . +.++.-+.++| T Consensus 11 ~~~~~~~~----~~~-----~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~-al~~~~V~~cv 77 (441) T protein:vir:94 11 VDFKSRKQ----SRK-----ELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDI---E-AIRHSDIFTAV 77 (441) T ss_pred cccccccc----chh-----hhhccccccccccccccCCCcchHHHHHHhcccCcccccccchh---h-hhccHHHHHHH Confidence 33332221 000 01111111 000000 000 000 00111 111111 1 23455667789 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCc Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKL 146 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~ 146 (456) +++|++.-.--+++..+.+..... .+-..|...=..+--+..|.+++.+ -.++|-|++++. +++. T Consensus 78 ~~Ia~~iA~lp~~~~~~~~~~~~~---~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~---------- 143 (441) T protein:vir:94 78 MMIASDLARMPIRVTVNGQINYSD---RIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT-RDKT---------- 143 (441) T ss_pred HHHHHhhccCceeeecCccccccc---hHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC---------- Confidence 999988876656665433221111 1111111110111112334444444 566788777653 3321 Q ss_pred CceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHH Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSF 223 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l 223 (456) +.+..+.|+-...+++ ..| +.|.+.++... .... .....+.++++.||||...+ ..|.|.++.+.+.+ T Consensus 144 G~~~~L~~i~~~~v~v---~~d----~~g~~~~~~~~-~~~~-~~~~~~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i 214 (441) T protein:vir:94 144 GEPMNLTFRKTSEIEL---KSD----ARGRLYYFHQR-IDSN-GNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTI 214 (441) T ss_pred CcEEEEEEEcCceeEE---EEC----CCccEEEEEEE-eccC-CceeEEEEccccEEEeccCCCCCccccCHHHHHHHHH Confidence 1133344444333332 122 23455443322 1111 11223568888888885332 35899999988766 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEE Q lcl|NC_016762. 224 ISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQM 298 (456) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~ 298 (456) -. ..........++++..+.-. ++.... + ..++..+++.+.+....++ ...++++.+.+|+.+ T Consensus 215 ~~-~~~~~~~~~~~f~ng~~p~gil~~~~~-----~------~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l 282 (441) T protein:vir:94 215 ES-DNNGKDFLNNFLRNGTHAGGILKMKGV-----L------DNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQL 282 (441) T ss_pred HH-HHHHHHHHHHHHhccCCCcEEEEcCCC-----C------CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEc Confidence 43 34444555556666443221 111110 0 1123344444444443332 235677777889998 Q ss_pred ecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC Q lcl|NC_016762. 299 VSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK 376 (456) Q Consensus 299 ~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~ 376 (456) +.+..+ +-+........||.+.|||-.. +|...++.+.. +...+|. .-|.|.+..+-..|-+. +.+.. T Consensus 283 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~s~~-q~~~~~~-------~tl~P~~~~ie~eln~k-l~~~~ 352 (441) T protein:vir:94 283 EVDTEVLKLIRENKSSTREIAGVFGIPLHK-FGIETANMSIT-DANLDYL-------STLKPYITCVCAELNFK-FNDEY 352 (441) T ss_pred cCChhHHHHHHHHHHhHHHHHHHhCCCHHH-cCCCCCCccHH-HHHHHHH-------HHHHHHHHHHHHHHhhh-ccccc Confidence 876544 3455566778899999999974 58765554322 3233332 13778877766555433 22222 Q ss_pred --CceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcc--------cCCCCC-- Q lcl|NC_016762. 377 --AEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPD--------TEPEDE-- 444 (456) Q Consensus 377 --~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~--------~~~~d~-- 444 (456) -.|+|.++.|...|.++++ ++...++..| +++++|+|+..+++|+++++.+.- .+..++ T Consensus 353 ~~~~~~fd~~~llr~D~~~~~-------~~~~~~i~~G--~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~ 423 (441) T protein:vir:94 353 VNREFKFDTTEIRVVDEKTQA-------EIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQ 423 (441) T ss_pred cCceEEeechhhhccCHHHHH-------HHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccc Confidence 2466666778877777764 4456677778 999999999999999877653211 011111 Q ss_pred -----CCCCcCCCCCCC Q lcl|NC_016762. 445 -----DAARTDPTGEQQ 456 (456) Q Consensus 445 -----~~~~~d~~~~~e 456 (456) +.+....+||+. T Consensus 424 ~~~~~~~~~~~kgGe~~ 440 (441) T protein:vir:94 424 MNKSRATDKKLKGGEEN 440 (441) T ss_pred cccccccccccCCCCCC Confidence 011112233322 No 78 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.50 E-value=2.8e-14 Score=94.92 Aligned_cols=392 Identities=14% Similarity=0.075 Sum_probs=184.0 Q ss_pred CCchhHHHHhHHHHHH----HHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSA----IARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~----~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) =+++.+.++....+.. .....+.+......+++.... .....+.. .+.++.-+.+||+++|+++-. T Consensus 17 ~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~----~al~~~~V~acv~~Ia~~iA~ 86 (441) T protein:vir:98 17 KQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGT------KLRQYKDI----EAIRHSDIFTAVMMIASDLAR 86 (441) T ss_pred cchhhhhhccccccccccccccCCCcchHHHHHHhhccccc------Cccccchh----hhhccHHHHHHHHHHHHhhcc Confidence 1111111111111000 000001111111111000000 00111111 123456677799999998877 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEe Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPA 155 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~ 155 (456) --+++....+..... .+-..|...=....-+..|.+++.+ -.++|-+++++. +++. +....+.|+ T Consensus 87 lpl~~~~~~~~~~~~---~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~-r~~~----------G~~~~L~~i 152 (441) T protein:vir:98 87 MPIRVTVNGQINYSD---RIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT-RDKT----------GEPMNLTFR 152 (441) T ss_pred CceEEecCCcccccc---hHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEE-EcCC----------CcEEEEEEE Confidence 666665433221111 1111111111111122234444444 456677766653 3321 112334444 Q ss_pred ccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 156 WAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGG 232 (456) Q Consensus 156 ~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~ 232 (456) -...+++. .| .-|.+.+|.... ... ..+..+.+.++.||||...+ ..|.|.++.+.+.+... ..... T Consensus 153 ~~~~v~v~---~~----~~g~~~~~~~~~-~~~-~~~~~~~~~~~dviHir~~~~dg~~G~spi~~~~~~i~~~-~a~~~ 222 (441) T protein:vir:98 153 KTSEIELK---LD----ARGRLYYFHQRI-DSN-GNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESD-NNGKD 222 (441) T ss_pred cCceeEEE---EC----CCCcEEEEEEEe-ccC-cceeeEEEccccEEEeccCCCCCccccCHHHHHHHHHHHH-HHHHH Confidence 43333331 12 124554443321 111 12223567788888875322 35899999988766443 33444 Q ss_pred HHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCCceeEEecccCC--H Q lcl|NC_016762. 233 SGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGATVTQMVSAVSD--P 305 (456) Q Consensus 233 ~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d~~~~~~~~~sg--l 305 (456) ....++++..+.-. ++.... + ..++..+++.+.+....++ .+.++++.+.+|+.++.+... + T Consensus 223 ~~~~~f~ng~~~~gil~~~~~-----~------~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~ 291 (441) T protein:vir:98 223 FLNNFLRNGTHAGGILKMKGV-----L------DNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKL 291 (441) T ss_pred HHHHHHhccCCCcEEEEeCCC-----C------CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHH Confidence 45556666433221 111110 1 1123344444444443333 235677777889988766543 3 Q ss_pred HHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC--ceEEEe Q lcl|NC_016762. 306 GPTYNVNLQTAAAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA--EFTAIW 383 (456) Q Consensus 306 ~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~--d~~~~f 383 (456) -+......++||.+.|||-..| |...++.+..+ ...+|- .-|.|.+..+-..|-+. +.+... .|.|.. T Consensus 292 ~e~r~~~~~~Ia~~fgVPp~~l-g~~~~~~s~~q-~~~~y~-------~tl~P~~~~ie~~ln~~-L~~~~~~~~~~fd~ 361 (441) T protein:vir:98 292 IRENKSSTREIAGVFGIPLHKF-GIETANMSITD-ANLDYL-------STLKPYITCVCAELNFK-FNDEYVNREFKFDT 361 (441) T ss_pred HHHHHHhHHHHHHHhCCCHHHc-CCCCCCccHHH-HHHHHH-------HHHHHHHHHHHHHHHhh-ccccccCceEEEec Confidence 4555566778999999999855 76555443222 222221 13778887776655433 333222 455666 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc--------ccCCCCC-------CCCC Q lcl|NC_016762. 384 DDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP--------DTEPEDE-------DAAR 448 (456) Q Consensus 384 ~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~--------~~~~~d~-------~~~~ 448 (456) +.|...|.++++ ++.+.++..| +++++|+|+..+++|+++++.+. ..+..++ ..+. T Consensus 362 ~~llr~d~~~~~-------~~~~~~~~~G--~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~ 432 (441) T protein:vir:98 362 TEIRVVDEKTQA-------EIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDK 432 (441) T ss_pred hhhhccCHHHHH-------HHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeeccccccccccccccccccccccc Confidence 677777777764 4456677777 99999999999999987765321 1111111 0111 Q ss_pred cCCCCCCC Q lcl|NC_016762. 449 TDPTGEQQ 456 (456) Q Consensus 449 ~d~~~~~e 456 (456) ...+||+. T Consensus 433 ~~kgGe~n 440 (441) T protein:vir:98 433 KLKGGEEN 440 (441) T ss_pred ccCCCCCC Confidence 12233322 No 79 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.49 E-value=5.7e-14 Score=93.18 Aligned_cols=326 Identities=12% Similarity=0.038 Sum_probs=161.3 Q ss_pred hhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccC Q lcl|NC_016762. 66 AVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARG 144 (456) Q Consensus 66 iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~ 144 (456) |-..|.. ..|+ .+..+ ..+-..+...=....-+..|.+.+.+ -.++|.+++++.- +.. T Consensus 1 ia~lp~~-~~~~-------~~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r-~~~-------- 59 (348) T protein:vir:93 1 MASLPLK-MYED-------YKVVN----TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER-DIY-------- 59 (348) T ss_pred CcccceE-eEec-------CcCcc----cHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEE-CCC-------- Confidence 2222221 1111 11111 11111111111111123334444443 4566777776532 211 Q ss_pred CcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHH Q lcl|NC_016762. 145 KLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAY 220 (456) Q Consensus 145 ~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~ 220 (456) +.+..+.|+-...+++. .+ .-+.+-+|.+... .+..+.++++.|+||... ...|.|.++.+. T Consensus 60 --G~~~~L~~l~~~~v~~~---~~----~~~~~~~y~~~~~-----~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~ 125 (348) T protein:vir:93 60 --HQPSKLFLLNPDVVEML---IE----NQSRELYYSIHAA-----TGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLK 125 (348) T ss_pred --CcEEEEEEEcCCceEEE---Ee----CCCcEEEEEEEcC-----CCeEEEEccccEEEecCCCCCCceeeccHHHHHH Confidence 11334444443333221 11 1233555666521 123466889999988532 235899888887 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCC-eEEecCCCceeEEe Q lcl|NC_016762. 221 NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGND-VLLPTQGATVTQMV 299 (456) Q Consensus 221 ~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~lid~~d~~~~~~ 299 (456) ..+.....+. ..+ +..+.... .+.. ...+.-..+..+++.+.+....++.+ .++++.+.+|+.++ T Consensus 126 ~~i~~~~~~~-~~~--~~~~~~~~-~~i~----------~~~~~l~~e~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~ 191 (348) T protein:vir:93 126 NTTDFDNAVR-TFN--LTEMQKPD-SFML----------KYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLP 191 (348) T ss_pred HHHHHHHHHH-HHH--HHhcCCCc-eeEE----------ecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcC Confidence 6543322221 111 11111110 0000 00011112333444444444444444 55666777899988 Q ss_pred cccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC- Q lcl|NC_016762. 300 SAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL- 375 (456) Q Consensus 300 ~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~- 375 (456) .+... +.+........||++.|||-.+|-+...+..++.+ -.++||..+ |.|.++++-+.|-+.-+-+. T Consensus 192 ~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~~~~~~-------l~P~~~~ie~~l~~~l~~~~~ 264 (348) T protein:vir:93 192 KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTD 264 (348) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhCCccc Confidence 77654 34555567889999999998866443333333333 355677665 78888887766654322211 Q ss_pred ---CCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCC--CCC--- Q lcl|NC_016762. 376 ---KAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEP--EDE--- 444 (456) Q Consensus 376 ---~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~--~d~--- 444 (456) ...|.|.++.|...|.+++|++ ..++++.| ++++||+|+..+++|+++++..- .-.+ ..+ T Consensus 265 ~~~g~~i~fd~~~l~~~d~~~~a~~-------~~~~~~~G--~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~ 335 (348) T protein:vir:93 265 REKNRYFKFNVKSYLRADSATQAEV-------YFKAVRSG--YYTINDIREWEDLPPVEGGDKPLISGDLYPIDTPLELR 335 (348) T ss_pred ccCcceEEeechhhhccCHHHHHHH-------HHHHHhCC--CCCHHHHHHHhCCCCCCCcCeEeecccccccccchhhc Confidence 1236666778888888887664 45667777 99999999999999987654311 0001 111 Q ss_pred CCCCcCCCCCCC Q lcl|NC_016762. 445 DAARTDPTGEQQ 456 (456) Q Consensus 445 ~~~~~d~~~~~e 456 (456) ....+.+...+| T Consensus 336 ~~~~gg~~n~~~ 347 (348) T protein:vir:93 336 KSLKGGDKNVNE 347 (348) T ss_pred ccccCCCCCcCC Confidence 111222222223 No 80 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.48 E-value=7.3e-14 Score=92.59 Aligned_cols=376 Identities=11% Similarity=0.079 Sum_probs=174.8 Q ss_pred HHHhhhhhccCcccchhhhhc----cCcccCC--HHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEY----GFPQEIT--FNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETE 95 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~----~~~~~~~--~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~ 95 (456) |.|. -++.+..+..|..+ |+..... +-...+ .++.-+.+||++++++.-+--+.+.....+...+ ... T Consensus 1 m~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~A--l~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~-~~~ 74 (417) T protein:vir:38 1 MKLF---RGLATEVDPHWADHLLDSGVIPSFRGGYLGISA--LRNSDVLTAVSIVSGDVSRFPLVITDSSTDEVID-LAN 74 (417) T ss_pred Cccc---cccccCCCccchhhhcccccccccCCceechhh--cccHHHHHHHHHHHHhhccCeeEEEEcCCcceec-cch Confidence 2221 12222222323221 2221111 111222 2466677899999999877666665443322211 112 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccccccc Q lcl|NC_016762. 96 WERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETY 174 (456) Q Consensus 96 ~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~y 174 (456) +...+...=...--+..|.+.+-. -.++|-++++|. +|+.. +....+.|+-...+.+.. .+. T Consensus 75 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~-r~~~g---------~~~~~l~~l~p~~v~v~~--~~~----- 137 (417) T protein:vir:38 75 IEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIV-RDPIT---------NEPAMFEFYAPSQTQVDT--SDP----- 137 (417) T ss_pred HHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEE-EcCCC---------CEEEEEEEeCCceEEEEE--cCC----- Confidence 212121111112233345555444 456677776654 33211 001222232222222211 111 Q ss_pred CCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016762. 175 GQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDK 251 (456) Q Consensus 175 g~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 251 (456) |. .+|++... +|. ....++++.||||...+ ..|.|.++.+.+.+..... +......+|++..+.-.+.. T Consensus 138 ~~-~~y~~~~~--~~~--~~~~~~~~dviH~r~~~~d~~~G~s~l~~~~~~i~~~~~-~~~~~~~~f~ng~~p~~il~-- 209 (417) T protein:vir:38 138 DN-IIYRFTPY--NSS--MQKVCGFEDVIHWKFFSYDTIMGRSPLLSLGDEIGLQES-GVSTLQKFFKSGLKGSIIKA-- 209 (417) T ss_pred Ce-EEEEEEEc--CCc--EEEEecCcceEEecCCCCCCccccCHHHHHHHHHHHHHH-HHHHHHHHHhccCCCcEEEE-- Confidence 22 23555422 111 13446677788875432 3499999999876654433 34455556676544222210 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CCeEEecCCCceeEEecccCCHH--HHHHHHHHHHHhhhcCCeEE Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNRG---NDVLLPTQGATVTQMVSAVSDPG--PTYNVNLQTAAAGVDIPTKI 326 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~lid~~d~~~~~~~~~sgl~--~~~~~~~~~~aaas~IP~t~ 326 (456) +. ..+ .++..+++.+.+....++ .+.++++.+.+|+.++.+..+.+ +........||.+.|||..+ T Consensus 210 -~~-----~~l---~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~ 280 (417) T protein:vir:38 210 -KE-----SRL---SAEARQKIREDFERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYR 280 (417) T ss_pred -eC-----CCC---CHHHHHHHHHHHHHHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHH Confidence 00 001 122334444444444332 23566777788999876654432 33444567899999999876 Q ss_pred eeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC-CCceEEEeCCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_016762. 327 LVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL-KAEFTAIWDDLTVPTKAERLANSKTMSEIN 405 (456) Q Consensus 327 L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~-~~d~~~~f~pL~~~seke~Aei~~~~A~a~ 405 (456) | |.+..+-|..+-...||. .-|.|.++.+-+.|-+.-+.+. ..++.|+|+. ..+..... .+. T Consensus 281 l-g~~~~~s~~e~~~~~~~~-------~tl~P~~~~ie~~l~~~Ll~~~~~~~~~~~fd~-~~l~~~~~--------~~~ 343 (417) T protein:vir:38 281 L-AQNSPNQSVKQLADDYIR-------NDLPFYFEPITSEFELKLLDDAQRHQYCIGFDT-KSVNGLPI--------ADV 343 (417) T ss_pred h-CCCCcchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhhcChhhcccceEEech-hhhhHHHH--------HHH Confidence 5 643222112223445553 3577877777665543323221 1356777752 11222222 223 Q ss_pred HHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc----------ccCCC-----------CCCCCCcCCCCCCC Q lcl|NC_016762. 406 SAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP----------DTEPE-----------DEDAARTDPTGEQQ 456 (456) Q Consensus 406 ~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~----------~~~~~-----------d~~~~~~d~~~~~e 456 (456) ++.+.+| +++++|+|+..+++|+++++.+. +...+ .++..+.+..++.+ T Consensus 344 ~~~~~~G--~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~~~~~~~ 413 (417) T protein:vir:38 344 NTAVNGG--LWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAAELKGGDTNAKGNQNGSGT 413 (417) T ss_pred HHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeeeecccccccccccccccccccccCCCCCCCCCCCcCCCC Confidence 4567777 99999999999999987653311 00000 00111111111111 No 81 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.48 E-value=1.2e-13 Score=91.38 Aligned_cols=365 Identities=10% Similarity=-0.038 Sum_probs=180.0 Q ss_pred HHHHHhhhhhccCcccc-------------h-hhhhcc-C-cccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEec Q lcl|NC_016762. 20 ARMSLLNQGIGHDAKRP-------------Q-AWCEYG-F-PQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIE 83 (456) Q Consensus 20 ~~d~~~n~~~~~gt~~~-------------~-~~~~~~-~-~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~ 83 (456) .+++|.|+..+...... . .+..+. . ....+. ..+.++..+.+||+++|++.-.=-+++.. T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~v~~~v~~ia~~ia~lp~~~~~ 76 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSA----RAALRNSDLFSIILQLSSDLAIVKINAEK 76 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccch----hhhhcchHHHHHHHHHHHhhccCceeecc Confidence 34555554433221110 0 000000 0 001111 12345788999999999998654455432 Q ss_pred CCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCCh Q lcl|NC_016762. 84 GDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKP 162 (456) Q Consensus 84 ~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~ 162 (456) .... .+...=....-+..|.+++-+ -.++|.+++++. +|.. +.+..+.|+....+++ T Consensus 77 ~~~~-----------~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~----------G~~~~L~~i~~~~v~v 134 (392) T protein:vir:74 77 KKNQ-----------GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRW-RNAN----------GADMKWEYLRPSQVNT 134 (392) T ss_pred chhh-----------hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEE-ECCC----------CcEEEEEEEcCceeEE Confidence 1110 111111122233445555444 456676766653 3321 1234455544333332 Q ss_pred hhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 163 KSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFL 238 (456) Q Consensus 163 ~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~ 238 (456) . .|+ +|...+|+++... +..+..+.++++.||||.... ..|.|.++.+.+.+..... .......++ T Consensus 135 ~---~~~----~~~~~~y~~~~~~--~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~-~~~~~~~~f 204 (392) T protein:vir:74 135 Y---YFE----YENGMYYNITFDD--PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRA-SDRLTISSL 204 (392) T ss_pred E---EcC----CCceEEEEEEecC--CccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHH-HHHHHHHHH Confidence 2 111 2334456665321 112224568889998885432 3589999999887754444 344455566 Q ss_pred HHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccC--CHHHHHHHHHHHH Q lcl|NC_016762. 239 KNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVS--DPGPTYNVNLQTA 316 (456) Q Consensus 239 ~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~s--gl~~~~~~~~~~~ 316 (456) ++....-.+-..+. + ....++.++++.+.........+.++++.+.+|++++.+.. .+-+.......+| T Consensus 205 ~ng~~p~~il~~~~-~--------~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 275 (392) T protein:vir:74 205 NSSLNVPGVLTVKG-G--------GLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQY 275 (392) T ss_pred hccCCCceEEEeCC-C--------CCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHH Confidence 66543221110000 0 01112223333332222222234566777888999886643 4455566678899 Q ss_pred HhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHH Q lcl|NC_016762. 317 AAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLA 396 (456) Q Consensus 317 aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Ae 396 (456) |.+.|||-..| |.....-+..+..+.||. ..|.|.++.+-+.|-+. +. .++.+.+.++...+.+++++ T Consensus 276 a~~fgVPp~~l-g~~~~~~~~~e~~~~~~~-------~~l~p~~~~ie~~l~~~-l~---~~~~~~~~~~~~~d~~~~~~ 343 (392) T protein:vir:74 276 AKVYGLPDSYI-GGQGDQQSSIQQISGMYA-------SALNRYLRPAISELEYK-LS---DHISVNMRPAIDPLGDNYLS 343 (392) T ss_pred HHHhCCCHHHh-CCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHHh-cc---chhcccchhhhcCCHHHHHH Confidence 99999998654 532211122233444443 34778777776655433 22 24566677777777666543 Q ss_pred HHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCC-CCC Q lcl|NC_016762. 397 NSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTG-EQQ 456 (456) Q Consensus 397 i~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~-~~e 456 (456) ....++..| +++++|+|+.+...++.. ++.++.++.++ -|+| ++| T Consensus 344 -------~~~~l~~~g--~~t~near~~~~~~g~~p-----ne~r~~enl~~-~~~Gd~~~ 389 (392) T protein:vir:74 344 -------TISTATRWG--ALAENQATFVLQEAGYIP-----KDLPAPENTNK-KTTGQSNE 389 (392) T ss_pred -------HHHHHHhCC--CcCHHHHHHHHHhCCCCc-----cccchhcCCCC-CCCCCCCC Confidence 455667777 999999999875544322 11112121111 1222 122 No 82 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=99.48 E-value=3.8e-14 Score=94.14 Aligned_cols=420 Identities=12% Similarity=0.059 Sum_probs=190.2 Q ss_pred CCchhHHHHhHHHHHHHHH-HHHHHhhhh-hccCcccchhh-hhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIAR-ARMSLLNQG-IGHDAKRPQAW-CEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~-~~d~~~n~~-~~~gt~~~~~~-~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) ||++....++-.++..--. -++--.+.. .-++. +... ..-..+...++.+|..+-+.|.++++||++.+++..-- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~ 78 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPD--HRIQSHNVGVNPPYNPDRLAAFLELNETLATGIRKKSRYEVGF 78 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccch--hhhcccCCCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhhhcc Confidence 9998877655332211000 000000000 00000 0000 01123456689999999999999999999999999988 Q ss_pred CCEEecCCCcc-hhhhhHHHHHHHHHHHH----Hh-------h----HHHHHHHHHHhhcccCceEEEEEecC--CCCc- Q lcl|NC_016762. 78 NPQVIEGDDQD-RSKDETEWERKNKPLIA----GG-------R----FWRAVSEADRRRLVGRYSGLLLHIRD--SQPW- 138 (456) Q Consensus 78 ~~~i~~~~~~d-~~~~~~~~e~~i~~~~~----~l-------~----~~~~~~ea~~~~r~~Ggs~i~i~i~D--~~~~- 138 (456) ||.|.--.+-+ .......+++ .+..+. +| + ....+..++......|.+++=+. .+ ++.. T Consensus 79 g~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiI-rn~~g~pv~ 156 (651) T protein:vir:99 79 GFDLVPAQGVDGDDASDAQREV-ARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEML-TDIEGRPVG 156 (651) T ss_pred CceeeecccCCCCccchHHHHH-HHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhh-hcCccchhh Confidence 98885211111 1111111111 111111 00 1 11111112222223333333221 11 1110 Q ss_pred --ccccc---CCcC----------------------------------ceeEEE----------EeccccCCh--hhhhc Q lcl|NC_016762. 139 --DRPAR---GKLN----------------------------------GLAKVT----------PAWAGCLKP--KSFDE 167 (456) Q Consensus 139 --~~Pl~---~~~~----------------------------------~l~~i~----------~~~~~~~~~--~~~~~ 167 (456) ..|.. ...+ +-.++. ..+...... ..+.. T Consensus 157 L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~ 236 (651) T protein:vir:99 157 LAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYRE 236 (651) T ss_pred hhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEecc Confidence 01110 0000 000000 000000000 00001 Q ss_pred c---ccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 168 K---PDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKN 240 (456) Q Consensus 168 D---p~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~ 240 (456) | ...+.+..+..+.+..... .....+.++.||||... ...|.|.++.+...+..... +......+|++ T Consensus 237 d~~~~~~~~~~~~~~g~~~~~~~----~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~-a~~~~~~~f~N 311 (651) T protein:vir:99 237 DEESEREPIFVDRETGDVTTGDA----NGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEA-AKDYNRDFFDN 311 (651) T ss_pred CcceeeeeecccceeeeEEEcCC----CceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHH-HHHHHHHHHhc Confidence 1 1123344455555432211 12345777889888532 34599999999887754443 44445556666 Q ss_pred hhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecC------------CCceeEEecccC---CH Q lcl|NC_016762. 241 AARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQ------------GATVTQMVSAVS---DP 305 (456) Q Consensus 241 ~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~------------~d~~~~~~~~~s---gl 305 (456) ....-.+. .+.+ +.-.++..+++.+.++...+|.+..++.. +-+|+.++...+ .+ T Consensus 312 G~~p~gil-----~~~~-----~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qf 381 (651) T protein:vir:99 312 DTIPRMVI-----KVTG-----GELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDF 381 (651) T ss_pred cCCCceEE-----EecC-----CCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHH Confidence 43221111 1100 01113334455555555555554444332 335555554332 23 Q ss_pred HHHHHHHHHHHHhhhcCCeEEeeccCCCc-ccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCc----CCCCce Q lcl|NC_016762. 306 GPTYNVNLQTAAAGVDIPTKILVGMQTGE-RASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVV----PLKAEF 379 (456) Q Consensus 306 ~~~~~~~~~~~aaas~IP~t~L~G~sp~G-lnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~----~~~~d~ 379 (456) -+........||++.|||-. ++|...++ .+..+ -.+.|+..+ |.|.+..+-+.|-+.-+- .....+ T Consensus 382 le~r~~~~~eIa~afgVPp~-~lG~~~~~~~sn~E~~~~~f~~~t-------L~P~~~~ie~eln~kLl~~~e~~~~~~i 453 (651) T protein:vir:99 382 RQFREKNEHEIAKVLEVPPV-KIGVTDSANRSNSDQQDKDFALEV-------IQPEQHTFAEWLYQIIHQQALGVTDWTI 453 (651) T ss_pred HHHHHHHHHHHHHHhCCCHH-HhccCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCccccccCceE Confidence 44445567789999999976 55765433 33223 456666543 677776666555332111 112235 Q ss_pred EEEeCC--CCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCC---CCCccc-------------CC Q lcl|NC_016762. 380 TAIWDD--LTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGG---DPLPDT-------------EP 441 (456) Q Consensus 380 ~~~f~p--L~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~---~~~~~~-------------~~ 441 (456) .|+|+. |...+.+.+ +++...++..| ++++||+|+..+++|+.+. ...... .+ T Consensus 454 ~~ef~~~~llr~D~~~~-------~e~~~~~i~~G--~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~ 524 (651) T protein:vir:99 454 EYELRGADQPKQEAQLA-------EQRVRAMRLAG--VGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETE 524 (651) T ss_pred EEEeccchhhhccHHHH-------HHHHHHHHhCC--CcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCc Confidence 566654 655555444 55556777888 9999999999999887531 111000 00 Q ss_pred CCCCCCCcCCCCCCC Q lcl|NC_016762. 442 EDEDAARTDPTGEQQ 456 (456) Q Consensus 442 ~d~~~~~~d~~~~~e 456 (456) ...++++..+.+++| T Consensus 525 ~~~~~~~~~~~~~~e 539 (651) T protein:vir:99 525 AVHEPPEENKIGERE 539 (651) T ss_pred ccccCccccccccch Confidence 111111122222222 No 83 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.48 E-value=4.2e-14 Score=93.91 Aligned_cols=360 Identities=13% Similarity=0.072 Sum_probs=172.3 Q ss_pred HHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNK 101 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~ 101 (456) |+|.+...+-.+.....+. +......+. ..|..+..+.+||+.++.++-+--+.+.......+ ..+...+. T Consensus 1 Mg~f~~l~~~~~~~~~~~~-~~~~~~~~~----~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~----~~l~~ll~ 71 (376) T protein:vir:78 1 MGFFSELFKRNKEIEWMWD-LDFLEDKTT----KVYLKKMALNTCVKHIARTIAKSDFRLKNGETSVR----DKLYYKLN 71 (376) T ss_pred CchhhhhhccCCccccccc-hhhccccch----hhhhhhHHHHHHHHHHHHhhcccceeecccccccc----chHHHHHh Confidence 6666554443332221111 111122222 23456788999999999999887777754332211 11112222 Q ss_pred HHHHHhhHHHHHHHHHHhhcc-cCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeE Q lcl|NC_016762. 102 PLIAGGRFWRAVSEADRRRLV-GRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMW 180 (456) Q Consensus 102 ~~~~~l~~~~~~~ea~~~~r~-~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y 180 (456) ..=....-+..|.+.+-+.++ +|.+++++ .+++... ...+..+.+.. +.+ ..++ T Consensus 72 ~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~-~r~~~~~-------~~~~~~~~~~~---~~~--------------~~~~ 126 (376) T protein:vir:78 72 IRPNTDMSSSSFWEKVIYKLIYDNECLIVL-SDTDDFL-------IADSYVRKEFA---FFP--------------DVFE 126 (376) T ss_pred hccccCCCHHHHHHHHHHHHhHcCcEEEEE-EeCCCee-------eccceeecccc---eee--------------eeee Confidence 221223345556666555555 46555554 3343211 00111111110 000 0111 Q ss_pred EEeecccCCccccceeeehhhhheecCCcCCCcchHHHHHHHHHHHHHHHHHHHHH-HHHHhhhhhhhhhhhhccHhhHH Q lcl|NC_016762. 181 EYTEASQAGRPGLVRDIHPDRVFILGDWTGDAIGFLEPAYNSFISLEKVEGGSGES-FLKNAARQLLLNFDKEINLGEIA 259 (456) Q Consensus 181 ~i~~~~~~g~~~~~~~IH~SRli~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~l~ 259 (456) .+... + ......+.++.|+||.-....|.+.+..+.+....... ..... .+.+..+... .+... T Consensus 127 ~~~~~---~-~~~~~~~~~~evih~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~-----~~~~~--- 191 (376) T protein:vir:78 127 GVTVK---D-YRYNRNFSMDDVIFLEYGNERLSAFTDGMFEDYGELFG---KMIRAQMRNFQIRGAV-----NFKMA--- 191 (376) T ss_pred eeeee---c-ceeeeeeccccEEEeccCCCCchhhhhHHHHHHHHHHH---HHHHHHHhcCCCceeE-----EEccC--- Confidence 22100 0 00113456677777754444555555555443222111 11111 1111111110 01100 Q ss_pred hhhcCCHHHHHHHHHHHHHHHhc----CCC-eEEecCCCceeEEecccCCH-------HHHHHHHHHHHHhhhcCCeEEe Q lcl|NC_016762. 260 STYGVTLDALNERFNEAARQLNR----GND-VLLPTQGATVTQMVSAVSDP-------GPTYNVNLQTAAAGVDIPTKIL 327 (456) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~----~~~-~~lid~~d~~~~~~~~~sgl-------~~~~~~~~~~~aaas~IP~t~L 327 (456) +...++..+++.+.++...+ +.+ ++.++.+-+|+.++.+...+ -+.......+||.+.|||..+| T Consensus 192 ---~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l 268 (376) T protein:vir:78 192 ---GVADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLL 268 (376) T ss_pred ---CCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHh Confidence 11112333444444443322 222 34467778999998877644 3344445678999999999866 Q ss_pred eccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016762. 328 VGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINS 406 (456) Q Consensus 328 ~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~ 406 (456) | +..++.+ -...||.. -|.|.+..+-+.|-+.-+.+..-.+.|.+..|...+.+++ +++.. T Consensus 269 -~---~~~s~~e~~~~~f~~~-------~l~P~~~~ie~~l~~kll~~~~~~~~~~~~~ll~~d~~~~-------~~~~~ 330 (376) T protein:vir:78 269 -H---GDMADLSNNMKAYMEY-------CIDPLTKKLEDELNAKLFTFSEFLAGEHIKIIHKKDIIEN-------AEAVD 330 (376) T ss_pred -C---CCCCCHHHHHHHHHHH-------HHHHHHHHHHHHHHhhhCCcccceecccchhhcccCHHHH-------HHHHH Confidence 3 2222223 23455554 3788887777666554454433334556667777776665 55666 Q ss_pred HHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCC-CCcCCCC Q lcl|NC_016762. 407 AAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDA-ARTDPTG 453 (456) Q Consensus 407 ~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~-~~~d~~~ 453 (456) +++..| ++++||+|+..+++|++++..+.-..+.+-.+ .+..+.| T Consensus 331 ~~~~~G--~~t~NE~R~~lg~~p~~~g~~d~~~~~~n~~~~~~~~e~g 376 (376) T protein:vir:78 331 KLVASG--SFNRNEVRELLGAERVDNPELDKYLITKNYQSADEGGEDG 376 (376) T ss_pred HHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeeccCceehhccccCC Confidence 778888 99999999999999987653322111221111 0111111 No 84 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.46 E-value=1.7e-13 Score=90.57 Aligned_cols=368 Identities=10% Similarity=-0.026 Sum_probs=179.1 Q ss_pred HHHhhhhhccCcccchh---hhhc----cCcccCCHH-HHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhh Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQA---WCEY----GFPQEITFN-DLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDE 93 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~---~~~~----~~~~~~~~~-~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~ 93 (456) |+|.+...+-..+.... +..+ ......... --...+.+++.+.+||++++.++-.--+++.... . T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~--~----- 73 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQ--L----- 73 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccch--h----- Confidence 33333211111000000 0000 000000000 0122356789999999999999866555543211 0 Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHhh-cccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccccc Q lcl|NC_016762. 94 TEWERKNKPLIAGGRFWRAVSEADRRR-LVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSE 172 (456) Q Consensus 94 ~~~e~~i~~~~~~l~~~~~~~ea~~~~-r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~ 172 (456) ..+...-....-+..|.+++-+. .++|-+++++. +|.. +.+..+.|+....+++.. + T Consensus 74 ----~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~----------g~~~~L~~l~~~~v~v~~---~---- 131 (386) T protein:vir:48 74 ----QGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRW-RNEN----------GRDMKWEYLRPSQVSFNR---L---- 131 (386) T ss_pred ----HHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEE-ECCC----------CcEEEEEEecCceeEEEE---c---- Confidence 01222222222344455555544 55566666553 3321 123344444443333321 1 Q ss_pred ccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-h Q lcl|NC_016762. 173 TYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL-L 247 (456) Q Consensus 173 ~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~-~ 247 (456) ..|.+.+|+|.... ...+..+.+-++.||||.... ..|.|.++.+...+.....+.. ....++++....-. + T Consensus 132 ~~~~~~~y~~~~~~--~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~-~~~~~~~ng~~~~~ii 208 (386) T protein:vir:48 132 DNKDGIYYNITFDD--PRIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDK-LTLNSLKNALNANGIL 208 (386) T ss_pred CCCceEEEEEEecC--ccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHH-HHHHHHhccCCcceEE Confidence 12456677775321 111222345566788775433 3489999988876655444433 33345555332211 1 Q ss_pred hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEecccCC--HHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 248 NFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 248 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas~IP~ 324 (456) +... ....+..+++.+....+.++. +.++++.+-+|+.++.+... +-+......++||++.|||- T Consensus 209 ~~~~------------~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp 276 (386) T protein:vir:48 209 KIKG------------GGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPE 276 (386) T ss_pred EeCC------------CCCHHHHHHHHHHHHHhhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 1111 111222233333333344444 44666777889998877654 45666777889999999998 Q ss_pred EEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_016762. 325 KILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEI 404 (456) Q Consensus 325 t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a 404 (456) ..| |.+..+-|..+...+||..+ |.|.++.+-+.|-+. +++ ++.+.+.++..++...+ +.. T Consensus 277 ~~l-g~~~~~~~~e~~~~~~~~~~-------l~P~~~~ie~~l~~~-l~~---~~~~~~~~~~~~d~~~~-------~~~ 337 (386) T protein:vir:48 277 NVV-GGQGDQQSSLEMSLDLYNKA-------VSRYLRPFLSELSQK-LSC---DVDADILPAVDPTGSNS-------VSR 337 (386) T ss_pred HHh-CCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHHh-hcc---hhhcchhhhhccChHHH-------HHH Confidence 754 54322222334456666443 778777766655433 222 34444455555555443 233 Q ss_pred HHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCC-CCCCcCCCCCC Q lcl|NC_016762. 405 NSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDE-DAARTDPTGEQ 455 (456) Q Consensus 405 ~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~-~~~~~d~~~~~ 455 (456) ...++..| +++++|+|+.++..|+..++-..- +..+. ....+|+-+++ T Consensus 338 ~~~l~~~g--~~t~nE~r~~lg~~~~~~~~~~~~-~~~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 338 INSMVKSG--TLAQNQGLYILQQAEILPKELPEG-ENPNKTTLKGGEINGED 386 (386) T ss_pred HHHHHhCC--CcCHHHHHHHhhcCCCCCccchhh-cCCCCCccCCCCCCCCC Confidence 44667777 999999999998888654432211 11111 11122222222 No 85 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.46 E-value=1e-13 Score=91.85 Aligned_cols=373 Identities=13% Similarity=0.093 Sum_probs=172.4 Q ss_pred HHHhhhhhccCcccch-hhhhccC---cccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQ-AWCEYGF---PQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWE 97 (456) Q Consensus 22 d~~~n~~~~~gt~~~~-~~~~~~~---~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e 97 (456) |+|.|++-+-...... ....+.. ...++.....++ ..++.+..+|+++|+++-.--+.+....+.........+. T Consensus 1 Mg~~~~f~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~~~ 79 (403) T protein:vir:80 1 MGLFNFFRRKTRSEPTNAISWFLTQEAYDTLAIPGYTRL-SDNPEVRMAVHKIAELISSMTIHLMQNTDNGDIRIKNELS 79 (403) T ss_pred Ccccccccccccccccchhhhhcccccccccccchhhhh-hhhHHHHHHHHHHHHhhhhCceEEEEecCCceeecCChHH Confidence 6766654332111111 1111111 112222222333 3467789999999999976666664322211111112222 Q ss_pred HHHHHHHHHhhHHHHHHHHHHhhc-cc--CceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccccccc Q lcl|NC_016762. 98 RKNKPLIAGGRFWRAVSEADRRRL-VG--RYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETY 174 (456) Q Consensus 98 ~~i~~~~~~l~~~~~~~ea~~~~r-~~--Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~y 174 (456) ..+...=..+.-+..|.+.+-+.. +. |.|.+++. .|+. +.+..+.|+....+++. .+.. T Consensus 80 ~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~-~~~~----------g~~~~L~~l~p~~v~~~---~~~~---- 141 (403) T protein:vir:80 80 RKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPK-YTTS----------GLIDELIPLAPSKVSFV---DTDT---- 141 (403) T ss_pred HHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEE-EcCC----------CcEEEEEEEcCCeeEEE---EcCC---- Confidence 222211112223334555554443 33 44555443 2321 11233434333333221 1111 Q ss_pred CCceeEEEeecccCCccccceeeehhhhheecC-----CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhh Q lcl|NC_016762. 175 GQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD-----WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQ-LLLN 248 (456) Q Consensus 175 g~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~-----~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~-l~~~ 248 (456) |..-+|. +..+-++-||||.. ....|.|.++.+.+.+-... ........++++..+. ..++ T Consensus 142 g~~~~y~------------~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~-~~~~~~~~~~~ng~~p~~il~ 208 (403) T protein:vir:80 142 GYQIWYQ------------GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLK-QATTTKKSFMSGKYMPSLIVK 208 (403) T ss_pred ceEEEEe------------ecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHH-HHHHHHHHHHhccCCcceEEE Confidence 1111221 12233555666541 12348898888877665443 3444445566654321 1122 Q ss_pred hhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecC-CCceeEEe-cccC--CHHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 249 FDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQ-GATVTQMV-SAVS--DPGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 249 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~-~d~~~~~~-~~~s--gl~~~~~~~~~~~aaas~IP~ 324 (456) ....++ ....++..+++.+......++...+++.. ..+++++. .+.. .+-+.......+||.+.+||. T Consensus 209 ~~~~~~--------~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp 280 (403) T protein:vir:80 209 VDAATA--------ELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPA 280 (403) T ss_pred eCCCCC--------hHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCH Confidence 111110 01112233333222222222223344433 34444432 3332 334555666778999999997 Q ss_pred EEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEe--CCCCCCCHHHHHHHHHHHH Q lcl|NC_016762. 325 KILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIW--DDLTVPTKAERLANSKTMS 402 (456) Q Consensus 325 t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f--~pL~~~seke~Aei~~~~A 402 (456) .+| |... ++.+...+||.. -|.|.++.+-+.|-+.-+-+ .++.|+| +.|...+.+++++ T Consensus 281 ~~l-g~~~---~~~~~~~~f~~~-------~l~P~~~~ie~~l~~kll~~--~~~~~~f~~~~ll~~d~~~~~~------ 341 (403) T protein:vir:80 281 FLL-GVGK---YDKDEYNNFINS-------TILPIAKGIEQELTRKLLIS--PDLYFKFNPRSLYAYDLKELAE------ 341 (403) T ss_pred HHc-CCCC---ccHHHHHHHHHH-------HHHHHHHHHHHHHHHhccCC--CCcEEEeechhhhccCHHHHHH------ Confidence 654 5321 112234667754 48899888887775543433 3455555 5777777777765 Q ss_pred HHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccCC----CCCCC----CCcCCCCCCC Q lcl|NC_016762. 403 EINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTEP----EDEDA----ARTDPTGEQQ 456 (456) Q Consensus 403 ~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~~----~d~~~----~~~d~~~~~e 456 (456) +...++..| +++++|+|+..+++|+++++..- .-.+ .+.+. +..+..++.| T Consensus 342 -~~~~~~~~G--i~t~NE~R~~~gl~p~~ggd~~~~~~n~~pl~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 342 -VGSNMYVRG--LMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred -HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeEeecccccchhhccchhhccCCCCCCCCCCCC Confidence 445667777 99999999999999987654311 1111 11111 1111222222 No 86 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.46 E-value=2.1e-13 Score=90.08 Aligned_cols=365 Identities=10% Similarity=-0.035 Sum_probs=179.8 Q ss_pred HHHHHhhhhhccCcccchh-hh------------hc--cCc-ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEec Q lcl|NC_016762. 20 ARMSLLNQGIGHDAKRPQA-WC------------EY--GFP-QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIE 83 (456) Q Consensus 20 ~~d~~~n~~~~~gt~~~~~-~~------------~~--~~~-~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~ 83 (456) .+++|.++..+........ .. .. +.. ...+. ..+.+++.+.++|+++|++.-.--+++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 76 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSA----RAALRNSDLFSIILQLSSDLAIVKINAEK 76 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceech----HHhhccHHHHHHHHHHHHhhccCceeecc Confidence 4455555443322111100 00 00 000 01111 12235788999999999998665555532 Q ss_pred CCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCCh Q lcl|NC_016762. 84 GDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKP 162 (456) Q Consensus 84 ~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~ 162 (456) ... ..+...=....-+..|.+.+-+ -.++|.+++++. +|.. +.+..+.|+....+++ T Consensus 77 ~~~-----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~----------g~~~~L~~l~~~~v~~ 134 (392) T protein:vir:10 77 KKN-----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRW-RNAN----------GADMKWEYLRPSQVNT 134 (392) T ss_pred chh-----------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEE-ECCC----------CcEEEEEEEcCceeEE Confidence 111 0111111112223445554444 456677666653 3321 2234455554333332 Q ss_pred hhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 163 KSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFL 238 (456) Q Consensus 163 ~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~ 238 (456) . .| ..|...+|+++.... ..+..+.++++.||||...+ ..|.|.++.+...+.....+ ......++ T Consensus 135 ~---~~----~~~~~~~y~~~~~~~--~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~-~~~~~~~f 204 (392) T protein:vir:10 135 Y---YF----EYENGMYYNITFDDP--KIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRAS-DRLTISSL 204 (392) T ss_pred E---Ec----CCCceEEEEEEecCc--ccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHH-HHHHHHHH Confidence 1 11 123455677753321 11223568889999886433 35899999988876544444 33444555 Q ss_pred HHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccC--CHHHHHHHHHHH Q lcl|NC_016762. 239 KNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVS--DPGPTYNVNLQT 315 (456) Q Consensus 239 ~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~s--gl~~~~~~~~~~ 315 (456) ++....-. ++.... ....++.++++.+.........+.++++.+-+|+.++.+.. .+-+......++ T Consensus 205 ~ng~~p~gil~~~~~----------~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 274 (392) T protein:vir:10 205 NSSLNVPGVLTVKGG----------GLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQ 274 (392) T ss_pred hccCCCceEEEeCCC----------CCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 55433211 111100 01112233333332222223335567777888999887654 445666667789 Q ss_pred HHhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 316 AAAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 316 ~aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~A 395 (456) ||.+.|||-..| |.+...-+..+..+.||.. -|.|.++.+-+-|-+. +. .++.+...++...+.++++ T Consensus 275 Ia~~fgVpp~~l-g~~~~~~~~~~~~~~f~~~-------~l~P~~~~ie~~l~~~-L~---~~~~~d~~~~~~~d~~~~~ 342 (392) T protein:vir:10 275 YAKVYGLPDSYI-GGQGDQQSSIQQISGMYAS-------ALNRYLRPAISELEYK-LS---DHISVNMRPAIDPLGDNYL 342 (392) T ss_pred HHHHhCCCHHHh-CCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHh-cc---ccccccchhhhccCHHHHH Confidence 999999997655 5322211222334455543 3677777665555332 22 2355666677777766554 Q ss_pred HHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 396 ei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +....++..| +++++|+|+.+...+...+ +.++.++.++.+.+.+.| T Consensus 343 -------~~~~~l~~~g--~~t~nE~r~~l~~~g~~p~-----e~r~~e~l~~~~~Gd~~~ 389 (392) T protein:vir:10 343 -------STISTATRWG--ALAENQATFVLQEAGYIPK-----DLPAPENTNKKTTGQSNE 389 (392) T ss_pred -------HHHHHHHhCC--CcCHHHHHHHHHhcCCCcc-----ccchhcCCCCCCCCCCCC Confidence 3455667777 9999999998744443211 111111111111111222 No 87 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.46 E-value=2.1e-13 Score=90.08 Aligned_cols=365 Identities=10% Similarity=-0.035 Sum_probs=179.8 Q ss_pred HHHHHhhhhhccCcccchh-hh------------hc--cCc-ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEec Q lcl|NC_016762. 20 ARMSLLNQGIGHDAKRPQA-WC------------EY--GFP-QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIE 83 (456) Q Consensus 20 ~~d~~~n~~~~~gt~~~~~-~~------------~~--~~~-~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~ 83 (456) .+++|.++..+........ .. .. +.. ...+. ..+.+++.+.++|+++|++.-.--+++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~~i~~ia~~ia~lp~~~~~ 76 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSA----RAALRNSDLFSIILQLSSDLAIVKINAEK 76 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceech----HHhhccHHHHHHHHHHHHhhccCceeecc Confidence 4455555443322111100 00 00 000 01111 12235788999999999998665555532 Q ss_pred CCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCCh Q lcl|NC_016762. 84 GDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKP 162 (456) Q Consensus 84 ~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~ 162 (456) ... ..+...=....-+..|.+.+-+ -.++|.+++++. +|.. +.+..+.|+....+++ T Consensus 77 ~~~-----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~-r~~~----------g~~~~L~~l~~~~v~~ 134 (392) T protein:vir:39 77 KKN-----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRW-RNAN----------GADMKWEYLRPSQVNT 134 (392) T ss_pred chh-----------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEE-ECCC----------CcEEEEEEEcCceeEE Confidence 111 0111111112223445554444 456677666653 3321 2234455554333332 Q ss_pred hhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 163 KSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFL 238 (456) Q Consensus 163 ~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~ 238 (456) . .| ..|...+|+++.... ..+..+.++++.||||...+ ..|.|.++.+...+.....+ ......++ T Consensus 135 ~---~~----~~~~~~~y~~~~~~~--~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~-~~~~~~~f 204 (392) T protein:vir:39 135 Y---YF----EYENGMYYNITFDDP--KIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRAS-DRLTISSL 204 (392) T ss_pred E---Ec----CCCceEEEEEEecCc--ccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHH-HHHHHHHH Confidence 1 11 123455677753321 11223568889999886433 35899999988876544444 33444555 Q ss_pred HHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccC--CHHHHHHHHHHH Q lcl|NC_016762. 239 KNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVS--DPGPTYNVNLQT 315 (456) Q Consensus 239 ~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~s--gl~~~~~~~~~~ 315 (456) ++....-. ++.... ....++.++++.+.........+.++++.+-+|+.++.+.. .+-+......++ T Consensus 205 ~ng~~p~gil~~~~~----------~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 274 (392) T protein:vir:39 205 NSSLNVPGVLTVKGG----------GLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQ 274 (392) T ss_pred hccCCCceEEEeCCC----------CCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 55433211 111100 01112233333332222223335567777888999887654 445666667789 Q ss_pred HHhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 316 AAAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 316 ~aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~A 395 (456) ||.+.|||-..| |.+...-+..+..+.||.. -|.|.++.+-+-|-+. +. .++.+...++...+.++++ T Consensus 275 Ia~~fgVpp~~l-g~~~~~~~~~~~~~~f~~~-------~l~P~~~~ie~~l~~~-L~---~~~~~d~~~~~~~d~~~~~ 342 (392) T protein:vir:39 275 YAKVYGLPDSYI-GGQGDQQSSIQQISGMYAS-------ALNRYLRPAISELEYK-LS---DHISVNMRPAIDPLGDNYL 342 (392) T ss_pred HHHHhCCCHHHh-CCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHHHh-cc---ccccccchhhhccCHHHHH Confidence 999999997655 5322211222334455543 3677777665555332 22 2355666677777766554 Q ss_pred HHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 396 ei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +....++..| +++++|+|+.+...+...+ +.++.++.++.+.+.+.| T Consensus 343 -------~~~~~l~~~g--~~t~nE~r~~l~~~g~~p~-----e~r~~e~l~~~~~Gd~~~ 389 (392) T protein:vir:39 343 -------STISTATRWG--ALAENQATFVLQEAGYIPK-----DLPAPENTNKKTTGQSNE 389 (392) T ss_pred -------HHHHHHHhCC--CcCHHHHHHHHHhcCCCcc-----ccchhcCCCCCCCCCCCC Confidence 3455667777 9999999998744443211 111111111111111222 No 88 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.43 E-value=1.2e-13 Score=91.37 Aligned_cols=367 Identities=11% Similarity=0.021 Sum_probs=162.7 Q ss_pred HHHhhhhhcc-C----cccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHH Q lcl|NC_016762. 22 MSLLNQGIGH-D----AKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEW 96 (456) Q Consensus 22 d~~~n~~~~~-g----t~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~ 96 (456) |+|.+...+. + +.....+..+.+.. ......|..+..+..+|++++.+.-.--+.+...++. ....+ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~----~~~~~ 72 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTDTVWCSIPS----EKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEE----VRKKN 72 (395) T ss_pred CchHHHHHhhhcccccccccccchhhcccc----ccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCcc----ccchH Confidence 4443332222 1 11111111111111 1223446678899999999999998777776543221 11111 Q ss_pred HHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccC Q lcl|NC_016762. 97 ERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYG 175 (456) Q Consensus 97 e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg 175 (456) ...++..=..+.-+..|.+++.+ -.++|.+++++ .++. . .+..-|.. . .....++ T Consensus 73 ~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~-~~~~-~-------------~~~~~~~~----~---~~~~~~~-- 128 (395) T protein:vir:40 73 WYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFM-QDEY-I-------------YVADSFTK----N---DKSLYEN-- 128 (395) T ss_pred HHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEE-ecCc-e-------------eecCCccc----c---ccccccc-- Confidence 11111110011122445554444 44567666554 3221 1 01111110 0 0001111 Q ss_pred CceeEEEeecccCCccccceeeehhhhheecCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccH Q lcl|NC_016762. 176 QPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINL 255 (456) Q Consensus 176 ~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 255 (456) .++.|+. ++ ..-.+.+.++.|+||.-....+.+.+..++....... ..........+..+.. +. ++. T Consensus 129 --~~~~v~~---~~-~~~~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~-l~----~~~ 195 (395) T protein:vir:40 129 --TYTEVTL---KD-LTLKKEFKESEVLHLTLNNESIKSIIDGFYLLYGDLL--TAAVNKYKKLNSRKII-VK----LKA 195 (395) T ss_pred --eeeeeee---cC-ceeeeeeccccEEEeecCCCCccccchhHHHHHHHHH--HHHHHHHHhcCCCCce-EE----Eec Confidence 1111211 11 0112345677788774222233333433333222111 1111111111111111 11 100 Q ss_pred hhHHhhhcCCHHHHHHHHHHHHHHHhcC-CCeEEecCCCceeEEecccCCHHHHH-----HHHHHHHHhhhcCCeEEeec Q lcl|NC_016762. 256 GEIASTYGVTLDALNERFNEAARQLNRG-NDVLLPTQGATVTQMVSAVSDPGPTY-----NVNLQTAAAGVDIPTKILVG 329 (456) Q Consensus 256 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lid~~d~~~~~~~~~sgl~~~~-----~~~~~~~aaas~IP~t~L~G 329 (456) .. .......+..++.+.+++....++ ...++++.+-+|+.++.+.....-+- +.+..+||.+-|||..+| | T Consensus 196 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l-~ 272 (395) T protein:vir:40 196 MF--GQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLA-K 272 (395) T ss_pred cc--CCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHh-c Confidence 00 000011123333443333333333 34566777889999988877654332 223468999999999876 3 Q ss_pred cCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC----CCceEEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_016762. 330 MQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL----KAEFTAIWDDLTVPTKAERLANSKTMSEI 404 (456) Q Consensus 330 ~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~----~~d~~~~f~pL~~~seke~Aei~~~~A~a 404 (456) |..+..+ -...||. ..|.|.++++-+-|-+.-+.+. .-.|.|.+.+|...|.+++++. T Consensus 273 ---~~~sn~e~~~~~f~~-------~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~------- 335 (395) T protein:vir:40 273 ---GDTVGLSEQVNSFLM-------FSINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASS------- 335 (395) T ss_pred ---CCCcCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHH------- Confidence 1222223 2344543 3567877777666544433321 1236666678988888888764 Q ss_pred HHHHHHcCCcCcCHHHHHHHhcccCCCC--CCCC---cccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 405 NSAAIGTGEPVFTAEEIREEAGYDPLQG--GDPL---PDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 405 ~~~~~~~g~~~i~~~E~R~~~~~~~~~~--~~~~---~~~~~~d~~~~~~d~~~~~e 456 (456) ...++..| ++++||+|+..+++|+.+ ++.. -+-.+.+. ......++++. T Consensus 336 ~~~~~~~G--~~t~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~-~~~~~kgge~~ 389 (395) T protein:vir:40 336 MDVLFHIG--VNTIDDNLRMIGREPVMSPETQERFVTKNYAPLGE-NEEDLKGGDIN 389 (395) T ss_pred HHHHHhCC--CCCHHHHHHHhCCCCCCCCCCceeeeccccccccc-cccccCCCCCC Confidence 45567777 999999999999999854 3221 11111111 11111222222 No 89 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.43 E-value=9.9e-14 Score=91.88 Aligned_cols=361 Identities=12% Similarity=0.062 Sum_probs=169.4 Q ss_pred HHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNK 101 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~ 101 (456) |+|.+.+.+...+....+.-. +-... ....|.++..+.+||++++.++.+--+++...+.... ..+.+.+. T Consensus 1 Mg~f~~~f~~~~~~~~~~~~~-~~~~~----~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~----~~l~~lL~ 71 (385) T protein:vir:95 1 MGLFDSVFKRHSELSWMYDLE-FLQDK----SKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEK----GTLYYLLN 71 (385) T ss_pred CchhhhhhccCcccccccchh-hhhcc----chhhhhhhHHHHHHHHHHHHHHcccceeeeecCcccc----chHHHHHh Confidence 566544443322222111100 00111 1234567888999999999999988777754332221 11112221 Q ss_pred HHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeE Q lcl|NC_016762. 102 PLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMW 180 (456) Q Consensus 102 ~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y 180 (456) ..=....-+..|.+.+-+ -.++|.+++++ .+++..+ . +.+... +... .... +.+| T Consensus 72 ~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~-~~~~~~~---~-----------~~~~~~--~~~~--~~~~-----~~~~ 127 (385) T protein:vir:95 72 VRPNRNQNAVDFWQKFIFKLIMDNEVLVVK-NDEGHFF---V-----------ADDFEK--EDEL--GLYS-----HRFT 127 (385) T ss_pred cccCcCCCHHHHHHHHHHHHhhcCceEEEE-ecCCCee---e-----------cccccc--cccc--cccc-----ccce Confidence 111112233445444444 44567676654 3343211 0 000000 0000 0001 1122 Q ss_pred EEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHh Q lcl|NC_016762. 181 EYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLG 256 (456) Q Consensus 181 ~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 256 (456) .+... + .+..+.+-++.||||... ...|.|.++.+.+.+..... ...+.+..+.+ ++ +... T Consensus 128 ~~~~~---~-~~~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~------~~~~~~~~~g~-l~----~~~~ 192 (385) T protein:vir:95 128 NVLVN---D-FEFKRVFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGRMID------LQMLNNQIRGI-LK----VDAT 192 (385) T ss_pred eeeec---c-cceeeeeccccEEEecCCCCCcccccchHHHHHHHHHHHHHH------HHHhcCCCceE-EE----eCCc Confidence 22110 0 001122334556666432 23488888877664422111 11222222211 11 1100 Q ss_pred hHHhhhcCCHHHHHHHHHHHHHHH----hcCC-CeEEecCCCceeEEecccC--------CHHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 257 EIASTYGVTLDALNERFNEAARQL----NRGN-DVLLPTQGATVTQMVSAVS--------DPGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 257 ~l~~~~~~~~~~~~~~~~~~~~~~----~~~~-~~~lid~~d~~~~~~~~~s--------gl~~~~~~~~~~~aaas~IP 323 (456) ....++..+++.+.++.+ .++. ..++++.+-+|+.++.... .+-+.......+||.+.||| T Consensus 193 ------~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVp 266 (385) T protein:vir:95 193 ------KFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVP 266 (385) T ss_pred ------cCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCC Confidence 011123333333333332 2223 3455777888998875432 23445555677899999999 Q ss_pred eEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC---CCCceEEEeCCCCCCCHHHHHHHHH Q lcl|NC_016762. 324 TKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP---LKAEFTAIWDDLTVPTKAERLANSK 399 (456) Q Consensus 324 ~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~---~~~d~~~~f~pL~~~seke~Aei~~ 399 (456) ..+|- |..+..+ -...||..+ |.|.+..+-..|-+.-+.+ ....+.|.+++|...+.+++++. T Consensus 267 p~~l~----~~~sn~e~~~~~~~~~~-------l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~-- 333 (385) T protein:vir:95 267 PSLVL----GEMADLEKTIESYLQFC-------INPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEA-- 333 (385) T ss_pred HHHhc----CCCcCHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHH-- Confidence 98773 2233223 345566543 7888877766664433322 12246777778988888876544 Q ss_pred HHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 400 TMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 400 ~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ...+++.| ++++||+|+..+++|+.+...+.--.+.+-.+.....++|+. T Consensus 334 -----~~~~~~~g--~lt~NE~R~~~g~~p~~~~~gd~~~~~~n~~~~~~~kgge~~ 383 (385) T protein:vir:95 334 -----IDKLVASG--TFTRNQVRIMTGEEPADDPELDKFIITKNLQSADAFKGGESN 383 (385) T ss_pred -----HHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeecccceecccccCCCCC Confidence 45677777 999999999999998743221111111111111111222222 No 90 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.43 E-value=6.4e-14 Score=92.93 Aligned_cols=343 Identities=15% Similarity=0.134 Sum_probs=169.1 Q ss_pred HHHhhhhhccCcccchhhhhccCccc-CCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhh--HHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQE-ITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDE--TEWER 98 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~-~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~--~~~e~ 98 (456) |+|.|...+.+.+.... .... .++..-...| .+.++++||+.+|++.-.--+.+....+.+..... ..... T Consensus 1 Mg~f~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~-~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~ 74 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN-----DTQRVTAWQNEAVEY-TSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGS 74 (378) T ss_pred CCccccchhcccccccC-----CcceeeeeccchhHH-HHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccc Confidence 56655433221111000 0111 1111111223 45678999999999998766665432222211110 00011 Q ss_pred HHHHHHHH----hhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccc Q lcl|NC_016762. 99 KNKPLIAG----GRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSET 173 (456) Q Consensus 99 ~i~~~~~~----l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~ 173 (456) .+..+++. .--+..|.+.+.+ -.++|-+++++..++ . .+.+.++.|.+ T Consensus 75 ~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~-~---------~g~~~~l~p~~----------------- 127 (378) T protein:vir:94 75 DLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDD-N---------TGELLDLLFAD----------------- 127 (378) T ss_pred hHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeC-C---------CceEEEEEecC----------------- Confidence 23333321 1123445555555 455676776654332 1 12233332211 Q ss_pred cCCceeEEEeecccCCccccceeeehhhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016762. 174 YGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDK 251 (456) Q Consensus 174 yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 251 (456) .++.++++.+|||.. +...|.|.++.+...+...- +++.-...++... T Consensus 128 -------------------~~~~~~~~diiH~~~~~~~~~g~s~l~~~~~~i~~~~-----------~~~~~~gil~~~~ 177 (378) T protein:vir:94 128 -------------------DKKEYKPEELVRLTSPFYINEDTSILDNALASIQTKL-----------EQGKLRGLLKINA 177 (378) T ss_pred -------------------CeeEeeeeeeEEecCcCCccchhHHHHHHHHHHHHHH-----------hcccccceeeeCC Confidence 123455666666642 22347777776665543221 1110000111110 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEEecccCCHH-HHHHHHHHHHHhhhcCCeEEe Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQMVSAVSDPG-PTYNVNLQTAAAGVDIPTKIL 327 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~~~~~sgl~-~~~~~~~~~~aaas~IP~t~L 327 (456) . +. .....+..+++.+.++....+. +.++++.+.+|++++.+....+ ........+||.+.|||..+| T Consensus 178 ~-----l~---~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVP~~~l 249 (378) T protein:vir:94 178 F-----LD---IDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENIL 249 (378) T ss_pred c-----CC---HHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHh Confidence 0 00 0112334455555554443322 3577777889999887766554 233445678999999999877 Q ss_pred eccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC----------CCCceEEEeCCCCCCCHHHHHHH Q lcl|NC_016762. 328 VGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP----------LKAEFTAIWDDLTVPTKAERLAN 397 (456) Q Consensus 328 ~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~----------~~~d~~~~f~pL~~~seke~Aei 397 (456) -|. ++.+...+||.. -|.|.+..+-.-|-+.-+-+ ...++.|++..|...+.+++++. T Consensus 250 ~~~-----~se~~~~~f~~~-------tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~ 317 (378) T protein:vir:94 250 LGT-----ASQEQQIYFYNS-------TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDL 317 (378) T ss_pred cCC-----hHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHH Confidence 331 222334556533 57888877666553332211 11247788889998888877554 Q ss_pred HHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc-------ccCCCCCCCCCcC----CCCCCC Q lcl|NC_016762. 398 SKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP-------DTEPEDEDAARTD----PTGEQQ 456 (456) Q Consensus 398 ~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~-------~~~~~d~~~~~~d----~~~~~e 456 (456) ..++++.| +++++|+|+..+++|+++++..- .....+.+..+.+ ++..+| T Consensus 318 -------~~~~~~~G--~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 318 -------YHENINGP--IFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred -------HHHHHhCC--CcCHHHHHHHhCCCCCCCCCeeeecccccccccchhhcCCcCCCCCCCCCCCC Confidence 45667777 99999999999999998765421 0111111211212 222222 No 91 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.43 E-value=1.3e-12 Score=85.70 Aligned_cols=400 Identities=10% Similarity=0.021 Sum_probs=184.3 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCE Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQ 80 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~ 80 (456) |.+.+.-|...+...+.. ..+.+.+ .+|.. + .+++.. ...-....|..++.+.++|++++++.-.--+. T Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~---~~g~~----~--~~~~~~-~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~ 69 (460) T protein:vir:10 1 MANRIIRALRELTGLDNK-FNDAFIK---YIGQT----F--TKYDNN-GKTYLEQGYNINPDVYSCISQMAAKTVAVPYT 69 (460) T ss_pred CchhHHHHHhhhhccCCC-chHHHHH---hhccc----c--CCCccc-hhhhhHHHHhcchHHHHHHHHHHHhhhhCceE Confidence 777666654322111100 0111111 01100 0 011111 01123445778899999999999998665556 Q ss_pred EecCCCcchhhhhH-------------------------HHHHHHHHHHHH---hhHHHHHHHHHHh-hcccCceEEEEE Q lcl|NC_016762. 81 VIEGDDQDRSKDET-------------------------EWERKNKPLIAG---GRFWRAVSEADRR-RLVGRYSGLLLH 131 (456) Q Consensus 81 i~~~~~~d~~~~~~-------------------------~~e~~i~~~~~~---l~~~~~~~ea~~~-~r~~Ggs~i~i~ 131 (456) +.....+...+... ..+..+..++.+ +.-+..|.+.+-+ -.++|-+++++. T Consensus 70 v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~ 149 (460) T protein:vir:10 70 IKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLM 149 (460) T ss_pred EEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEE Confidence 54333221110000 000001111111 1123344444444 456677766554 Q ss_pred ecCCCCccccccCCcCceeEEEEeccccCChhhhh-ccccccccCCceeEEEeecccCCccccceeeehhhhheecC--- Q lcl|NC_016762. 132 IRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFD-EKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD--- 207 (456) Q Consensus 132 i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~-~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--- 207 (456) - ++..- ..+-+..+.|+-...+++.... ..+....|+ -..|.+. .+ +....+.++.||||.. T Consensus 150 r-~~~~~------~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~-~~~~~~~---~~---g~~~~~~~~evih~r~~~~ 215 (460) T protein:vir:10 150 S-PDDGI------NAGVPSQMYVLPAHLIKIVLKDDINLLSTDSP-IKSYMLI---QG---DQFIEFNEDEVIHTKYANP 215 (460) T ss_pred e-cCCCc------cCceeEEEEEEcCceEEEEEcCCCceeeeeee-eeEEEEe---cC---ceeEEecccceEEEecCCC Confidence 3 21110 1111334444444333332111 111111111 1122222 11 2246788889988842 Q ss_pred ------CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh Q lcl|NC_016762. 208 ------WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN 281 (456) Q Consensus 208 ------~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 281 (456) ....|.|.++.+.+.+..... ....+..++++....-.+. +.+ +.-.++..+++.+.+.... T Consensus 216 ~~~~~~~~~~G~sp~~~~~~~i~~~~~-~~~~~~~~f~ng~~~~~i~---~~~--------~~l~~e~~~~~~~~~~~~~ 283 (460) T protein:vir:10 216 NFDLQGSHLYGMSPIRAILRNINSQNS-TIDNNVKTMQNGGVFGFIH---GGS--------TGLTQPQADSLKQRLTEMD 283 (460) T ss_pred CcccccCccccccHHHHHHHHHHHHHH-HHHHHHHHHhcCCCcceee---ecC--------CCCCHHHHHHHHHHHHHHh Confidence 123589999998876654433 3445555666643221111 000 1111233445555555544 Q ss_pred cC---C-CeEEecCCCceeEEecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCCC-ccc--chH-HHHHHHHHHHHH Q lcl|NC_016762. 282 RG---N-DVLLPTQGATVTQMVSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQTG-ERA--SSE-DQKYHNARCQAR 351 (456) Q Consensus 282 ~~---~-~~~lid~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp~-Gln--st~-D~~nyyd~I~~~ 351 (456) ++ . ..++++.+-+|+.++.+... +-+......++||.+.|||-. ++|...+ ..+ ..+ -...||.. T Consensus 284 ~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~t~~~sn~e~~~~~f~~~---- 358 (460) T protein:vir:10 284 KSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDK-LLNNNEGGGLNTGNLEEERKRVVTD---- 358 (460) T ss_pred cCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHH-HhCCCCCCCCccccHHHHHHHHHHH---- Confidence 33 2 34566667789888876544 345566677899999999987 6665443 333 223 34566654 Q ss_pred HHhhhhHHHHHHHHHHHHhcCc--CCCCceEEEeC--CCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcc Q lcl|NC_016762. 352 RVQELTFEINDLFAHLMRIGVV--PLKAEFTAIWD--DLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGY 427 (456) Q Consensus 352 Qe~~lrp~L~~l~~~l~~s~~~--~~~~d~~~~f~--pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~ 427 (456) -|.|.+..+-+.|-+.-+- ....++.|+|+ .|..+.+ + .++.+ .++..| ++|++|+|+..++ T Consensus 359 ---~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~----d-~~~~~----~~~~~g--~~T~NE~R~~~g~ 424 (460) T protein:vir:10 359 ---NIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQT----D-MVAMA----SWLNTI--PVTPNEIRIAMKY 424 (460) T ss_pred ---HHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHH----H-HHHHH----HHHhCC--CCCHHHHHHHhCC Confidence 3567776665554332221 12334555553 3321111 1 12222 355667 9999999999999 Q ss_pred cCCCC--CCCCc---ccCCCC--CCCCCcCCCCCCC Q lcl|NC_016762. 428 DPLQG--GDPLP---DTEPED--EDAARTDPTGEQQ 456 (456) Q Consensus 428 ~~~~~--~~~~~---~~~~~d--~~~~~~d~~~~~e 456 (456) +|+.+ ++..- .-.+.+ .+...+++.-++| T Consensus 425 ~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 425 ETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred CCCCCCCCCeeeecccccchhhcccccCCCcccCCC Confidence 98743 22110 111111 1222222222333 No 92 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.43 E-value=3.9e-13 Score=88.62 Aligned_cols=363 Identities=13% Similarity=0.049 Sum_probs=175.7 Q ss_pred HHHhhhhh-ccCcc----cch--hhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchh---h Q lcl|NC_016762. 22 MSLLNQGI-GHDAK----RPQ--AWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRS---K 91 (456) Q Consensus 22 d~~~n~~~-~~gt~----~~~--~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~---~ 91 (456) ++|.|... .++-+ ++. .++.+.+.... -...|..++.+.++|+.+|+.+-.--+++......... . T Consensus 1 mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~ 76 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDMEPVSHRTNRKPFT----TGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANGV 76 (403) T ss_pred CcchhhhhhccchhhhhhhcccccccccCCcccc----cHHHHHHHHHHHHHHHHHHHHHhhCceeEeeccccccccccc Confidence 55555332 22211 111 11111111111 22556788999999999999887766666432211110 0 Q ss_pred hhHHHHHHHHHHHHHhhHHHHHHHHHHhhc-ccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccc Q lcl|NC_016762. 92 DETEWERKNKPLIAGGRFWRAVSEADRRRL-VGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPD 170 (456) Q Consensus 92 ~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r-~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~ 170 (456) ....+...++..=....-+..|.+.+-+.+ ++|-+++++ ++.. |..+ |.....+.+ | T Consensus 77 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~---~~~~-----------l~~l-~~~~~~v~~-----~-- 134 (403) T protein:vir:10 77 KTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW---DGTS-----------LYHV-PAALMQVEA-----D-- 134 (403) T ss_pred ccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE---eCce-----------eEee-cCcceEEEE-----c-- Confidence 111121222211111233455666655544 556666554 2222 1111 111111111 1 Q ss_pred ccccCCceeEEEeecccCCccccceeeehhhhheecC--------CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016762. 171 SETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD--------WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAA 242 (456) Q Consensus 171 s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--------~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~ 242 (456) -+...++.+.. .+..+.++++++|.. ....|.|.++.+...+.....+.. .+..++++.. T Consensus 135 ---~~~~~~~~~~~--------~~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~-~~~~~f~ng~ 202 (403) T protein:vir:10 135 ---ANKFIKKFIFN--------NQINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLN-FKEKFLDNGT 202 (403) T ss_pred ---CCceEEEEEec--------CceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHH-HHHHHHhccC Confidence 01111122211 112233455655532 234589999999887766544433 3345566654 Q ss_pred hhhhh-hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CC-eEEecCCCceeEEecccC--C--HHHHHHHHH Q lcl|NC_016762. 243 RQLLL-NFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG---ND-VLLPTQGATVTQMVSAVS--D--PGPTYNVNL 313 (456) Q Consensus 243 ~~l~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~lid~~d~~~~~~~~~s--g--l~~~~~~~~ 313 (456) +.-.+ +.... + .++..+++.+.+....++ .+ .++++.+-+|+.++.+.+ + +-+...... T Consensus 203 ~~~gil~~~~~-----l-------~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~ 270 (403) T protein:vir:10 203 VIGLILETDEI-----L-------NKKLRERKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFN 270 (403) T ss_pred CcceEEEeCCC-----C-------CHHHHHHHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHH Confidence 33222 11111 1 123334444444443333 22 456666778888875433 3 345555667 Q ss_pred HHHHhhhcCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCC--CCCCH Q lcl|NC_016762. 314 QTAAAGVDIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDL--TVPTK 391 (456) Q Consensus 314 ~~~aaas~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL--~~~se 391 (456) ..||.+.|||.. ++|.+.. =|..+....||.. -|.|.+.++-+.|-+. ++ ..+.|+++.+ ..++. T Consensus 271 ~~Ia~~fgVPp~-~lg~~~~-sn~e~~~~~f~~~-------tl~P~~~~ie~~l~~~-L~---~~~~~d~~~~~~l~~D~ 337 (403) T protein:vir:10 271 KSICLAFGVPQV-LLDGGNN-ANIRPNIELFYYM-------TIIPMLNKLTSSLTFF-FG---YKITPNTKEVAALTPDK 337 (403) T ss_pred HHHHHHhCCCHH-HcCCCCC-cCHHHHHHHHHHH-------HHHHHHHHHHHHHHHh-cC---ceeeeccchhhhcccCH Confidence 889999999996 4564221 1112334556643 3778887776666442 33 2566777755 44444 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCC--------CCCCCcCC--CCCCC Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPED--------EDAARTDP--TGEQQ 456 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d--------~~~~~~d~--~~~~e 456 (456) + +++++..+++..| ++++||+|+..+++|+++..-+.--.+.+ ...+.++| +.+-| T Consensus 338 ~-------~~~~~~~~~~~~G--~lT~NE~R~~~gl~pi~~~~~d~~~~p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 338 E-------AEAKHLTSLVNNG--IITGNEARSELNLEPLDDEQMNKIRIPANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred H-------HHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCcccccccccccccccccccCCCCcCCCCCCCcCCC Confidence 4 3466667788888 99999999999999985422111111111 11111122 11112 No 93 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.42 E-value=6.2e-14 Score=92.98 Aligned_cols=343 Identities=15% Similarity=0.125 Sum_probs=167.4 Q ss_pred HHHhhhhhccCcc-cchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhH--HHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAK-RPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDET--EWER 98 (456) Q Consensus 22 d~~~n~~~~~gt~-~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~--~~e~ 98 (456) |+|.|-..+.+.. +...- ....+++. +..+..+..+.+||+++|++.-.--+.+....+.+...... .... T Consensus 1 Mg~f~~~~~f~~~~~~~~~-----~~~~~~~~-~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~ 74 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDT-----QRVTAWQN-EAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGS 74 (378) T ss_pred CccchhhhhhhccccCCCc-----ceeeeccc-chhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccc Confidence 5665543322111 11000 00111111 11223456788999999999987777664332221111100 0001 Q ss_pred HHHHHHH----HhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccc Q lcl|NC_016762. 99 KNKPLIA----GGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSET 173 (456) Q Consensus 99 ~i~~~~~----~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~ 173 (456) .+..++. ..--...|.+.+-+ -.++|.+++++.. ++.. +.+.++.|.+ T Consensus 75 ~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~-~~~~---------g~~~~l~~~~----------------- 127 (378) T protein:vir:93 75 DLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVF-DDNT---------GELLDLLFAD----------------- 127 (378) T ss_pred hHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEe-ecCC---------ceEEEEEecC----------------- Confidence 1223322 11123345554444 4456777765543 3221 2233332211 Q ss_pred cCCceeEEEeecccCCccccceeeehhhhheecCC--cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016762. 174 YGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW--TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDK 251 (456) Q Consensus 174 yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~--~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 251 (456) .++.+.++.+|||... ...|.|.++.+...+..+ ++++.....++... T Consensus 128 -------------------~~~~~~~~diih~r~~~~~~~~~s~l~~~~~~i~~~-----------~~~~~~~g~l~~~~ 177 (378) T protein:vir:93 128 -------------------DKKEYKTEELVRLTSPFYINEDTSILDNALASIQTK-----------LEQGKLRGLLKINA 177 (378) T ss_pred -------------------CeeEeccceeEEecCccccchhhHHHHHHHHHHHHH-----------HhcCcccceeeeCC Confidence 0234556666666421 234667666655444221 11111000111100 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CCeEEecCCCceeEEecccCCHH-HHHHHHHHHHHhhhcCCeEEe Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNRG---NDVLLPTQGATVTQMVSAVSDPG-PTYNVNLQTAAAGVDIPTKIL 327 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~lid~~d~~~~~~~~~sgl~-~~~~~~~~~~aaas~IP~t~L 327 (456) . +. .....+..+++.+.++.+..+ .+.++++.+.+|+.++.+....+ ........+||.+.|||..+| T Consensus 178 ~-----l~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l 249 (378) T protein:vir:93 178 F-----LD---IDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENIL 249 (378) T ss_pred c-----CC---HHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHh Confidence 0 00 011234455555555544432 24577777889999887766543 334556778999999998877 Q ss_pred eccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC----------CCCceEEEeCCCCCCCHHHHHHH Q lcl|NC_016762. 328 VGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP----------LKAEFTAIWDDLTVPTKAERLAN 397 (456) Q Consensus 328 ~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~----------~~~d~~~~f~pL~~~seke~Aei 397 (456) -|. ++.+-..+||. .-|.|.+..+-+.|-+.-+-+ -..++.|.++.|...+.+++++. T Consensus 250 ~g~-----~~e~~~~~f~~-------~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~ 317 (378) T protein:vir:93 250 LGT-----ATQEQQIYFYN-------STIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDL 317 (378) T ss_pred cCC-----cHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHH Confidence 331 12223444543 357777776666553322211 12347788889999988877554 Q ss_pred HHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc-------ccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 398 SKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP-------DTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 398 ~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~-------~~~~~d~~~~~~d~~~~~e 456 (456) ...++..| +++++|+|+..+++|+++++..- .+...+.+..+.+....+| T Consensus 318 -------~~~~~~~G--~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e 374 (378) T protein:vir:93 318 -------YHENINGP--IFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDE 374 (378) T ss_pred -------HHHHHhCC--CcCHHHHHHHhCCCCCCCCCeeeeccccccccchhhhcCccCCCCCCCC Confidence 56677777 99999999999999997755311 1111111222222222222 No 94 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.42 E-value=5.4e-14 Score=93.33 Aligned_cols=356 Identities=10% Similarity=-0.020 Sum_probs=170.9 Q ss_pred HHHhhhhhccCcccchhhh--hccC-----------cccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcc Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWC--EYGF-----------PQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQD 88 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~--~~~~-----------~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d 88 (456) |+|.+... .+........ -+++ ....+. .-+.++..+.+||+++|+++-.--+++...... T Consensus 1 Mglf~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~~- 74 (384) T protein:vir:49 1 MPIFNITN-LATESPPSNQDSFFDITDPEFLDALNGSEWVSA----ETALKNSDLFSIISQLSNDLATAKITTSRKQLQ- 74 (384) T ss_pred Cccccccc-cCcccccccchhhccccchhhcccccCCceech----hhhhccHHHHHHHHHHHHHHhhCceeeecchhh- Confidence 44433210 0000000000 0000 011121 124568889999999999998777766422110 Q ss_pred hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhh-cccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhc Q lcl|NC_016762. 89 RSKDETEWERKNKPLIAGGRFWRAVSEADRRR-LVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDE 167 (456) Q Consensus 89 ~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~-r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~ 167 (456) .|...=..+.-+..|.+.+-+. .++|-+++++.- |.. +.+..+.|+....+++.. T Consensus 75 ----------~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r-~~~----------g~~~~L~~l~~~~v~v~~--- 130 (384) T protein:vir:49 75 ----------GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWR-NEN----------GRDMKWEYLRPSQVSFNR--- 130 (384) T ss_pred ----------hhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEE-CCC----------CcEEEEEEEcCceeEEEE--- Confidence 1211111222344455555554 456766666543 321 113344454443343321 Q ss_pred cccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_016762. 168 KPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAAR 243 (456) Q Consensus 168 Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~ 243 (456) + ++. ...+|++.... +..+..+.++++.||||.... ..|.|.++.+.+.+.....+.. ....++++... T Consensus 131 ~---~~~-~~~~y~~~~~~--~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~-~~~~~~~ng~~ 203 (384) T protein:vir:49 131 L---DNQ-NGLYYNITFDD--PRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDK-LTLNALKNALN 203 (384) T ss_pred c---CCC-ceEEEEEEecC--ccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHH-HHHHHHhccCC Confidence 1 122 23456675322 122334678899999986533 3589999999887765554443 33345555432 Q ss_pred hhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccCCH--HHHHHHHHHHHHhhh Q lcl|NC_016762. 244 QLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVSDP--GPTYNVNLQTAAAGV 320 (456) Q Consensus 244 ~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~sgl--~~~~~~~~~~~aaas 320 (456) .-. ++.... ...++..+...+.........+.++++.+.+|++++.+.... -+......++||.+. T Consensus 204 ~~~il~~~~~-----------~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 272 (384) T protein:vir:49 204 ANGILKIKGG-----------GLLDFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVY 272 (384) T ss_pred CceEEEeCCC-----------CChHHHHHHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHh Confidence 211 111100 111122222222211112223456777788999988776544 466677889999999 Q ss_pred cCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc-------CcCCCCceEEEeCCCCCCCHHH Q lcl|NC_016762. 321 DIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIG-------VVPLKAEFTAIWDDLTVPTKAE 393 (456) Q Consensus 321 ~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~-------~~~~~~d~~~~f~pL~~~seke 393 (456) |||..+| |...++-+..+..+.+|..+ . ...|+|.++.+-..|.+.- ..+.+..+.|.++.|...+-++ T Consensus 273 gVp~~~l-g~~~~~~~~~~~~~~~~~~~--i-~~~l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t 348 (384) T protein:vir:49 273 GIPESVV-GGEGDKQSSLEMIYNIYFKA--V-SRFLRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLA 348 (384) T ss_pred CCCHHHh-CCCCCccccHHHHHHHHHHH--H-HHHHHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCccc Confidence 9998755 54333322222333332211 1 1234454444433332210 0111123444445555555444 Q ss_pred HHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCccc Q lcl|NC_016762. 394 RLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDT 439 (456) Q Consensus 394 ~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~ 439 (456) ++ ++...+...| .. ++|+|+..++.|+++++..+.= T Consensus 349 ~~-------e~~~~l~~~g--~~-~ne~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 349 QN-------QGLYVLQQAE--IL-PKDLPEGETDSTLKGGETNEQY 384 (384) T ss_pred HH-------HHHHHHhhCC--CC-ChhHHHHcCCCCCCCCCCCCCC Confidence 43 3444455556 44 4889998888887665433221 No 95 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.42 E-value=3.1e-13 Score=89.16 Aligned_cols=364 Identities=11% Similarity=-0.002 Sum_probs=175.7 Q ss_pred HHHhhhhhccCccc-----chhhhhccCcccCCHH-HHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKR-----PQAWCEYGFPQEITFN-DLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETE 95 (456) Q Consensus 22 d~~~n~~~~~gt~~-----~~~~~~~~~~~~~~~~-~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~ 95 (456) |+|.+...+--.+. +-....+. ....... --...|.+++.+.++|+++|+++-.--+++...... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~-------- 71 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFL-ASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKLQ-------- 71 (382) T ss_pred CccccccccCCcccccccccchhhhcc-ccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchhh-------- Confidence 44433221110000 00001111 0111111 112235678899999999999997666666432211 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHhh-cccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccccccc Q lcl|NC_016762. 96 WERKNKPLIAGGRFWRAVSEADRRR-LVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETY 174 (456) Q Consensus 96 ~e~~i~~~~~~l~~~~~~~ea~~~~-r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~y 174 (456) .|...=..+.-+..|.+.+.+. .++|-+++++. +|.. +.+..+.|+-...+++.. + .. T Consensus 72 ---~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~-rd~~----------G~~~~l~~i~~~~v~v~~---~----~~ 130 (382) T protein:vir:48 72 ---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRW-RNEN----------GRDMKWEYLRPSQVSFNR---L----DN 130 (382) T ss_pred ---hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEE-ECCC----------CcEEEEEEEcCceeEEEE---c----CC Confidence 1222222233445566666654 44566666553 3321 123344444433333311 1 23 Q ss_pred CCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhh Q lcl|NC_016762. 175 GQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQ-LLLNF 249 (456) Q Consensus 175 g~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~-l~~~~ 249 (456) |...+|+|.... ...+..+.++++.||||.... ..|.|.++.+.+.+..... .......++++.... ..++. T Consensus 131 ~~~~~y~~~~~~--~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~-~~~~~~~~~~ng~~p~~il~~ 207 (382) T protein:vir:48 131 KDGIYYNITFDD--PRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKA-SGNLTINSLKNALNANGILKI 207 (382) T ss_pred CCeEEEEEEecC--ccccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHH-HHHHHHHHHhccCCCceEEEe Confidence 445567775321 112234668888999885432 4689999999887754433 344444555654322 22222 Q ss_pred hhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCC-eEEecCCCceeEEecccCCH--HHHHHHHHHHHHhhhcCCeEE Q lcl|NC_016762. 250 DKEINLGEIASTYGVTLDALNERFNEAARQLNRGND-VLLPTQGATVTQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKI 326 (456) Q Consensus 250 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~lid~~d~~~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~ 326 (456) ... ...+..+++.+....+.++.+ .++++.+.+|+.++.+...+ -+......++||.+.|||-.. T Consensus 208 ~~~------------~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~ 275 (382) T protein:vir:48 208 KGG------------GLLDFKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNV 275 (382) T ss_pred CCC------------CChHHHHHHHHHHHhhccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 111 111222233223333344444 46666778899988766543 366667788999999999765 Q ss_pred eeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016762. 327 LVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINS 406 (456) Q Consensus 327 L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~ 406 (456) | |.+..+-+..+-.+.||. ..|.|.+..+-+.|-+.-+.+ +.+...+...++....+ .... T Consensus 276 l-g~~~~~~~~~~~~~~~~~-------~~l~p~~~~i~~~l~~~l~~~----~~~~~~~~~~~~~~~~~-------~~~~ 336 (382) T protein:vir:48 276 V-GGQGDQQSSLEMSSDLYS-------KAVSRYLRPFLSELSQKLSCD----VDADIFPAVDPTGSNYI-------SRIN 336 (382) T ss_pred h-CCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHHHhcCh----hhhhhhhhhccchhHHH-------HHHH Confidence 5 543222122233445553 446777776666554332211 11111111222222211 1123 Q ss_pred HHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 407 AAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 407 ~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) .++..| +++++|+|+.+...+... ++.+..++..++.++||+. T Consensus 337 ~l~~~g--~~t~~e~r~~l~~~g~~~-----~~~~~~~~~~~~~~GGd~~ 379 (382) T protein:vir:48 337 SLVKTG--TLAQNQGLYILQQAEILP-----KELPNGENPNSTLKGGEED 379 (382) T ss_pred HHhhcC--ccCHHHHHHHHhhCCCCC-----cchhhhhcCCCCCCCCCCC Confidence 556667 999999999875544321 1122223332333444433 No 96 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.42 E-value=6.5e-13 Score=87.40 Aligned_cols=396 Identities=11% Similarity=-0.009 Sum_probs=177.8 Q ss_pred CCchhHHHHhHHHHH-HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSS-AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNP 79 (456) Q Consensus 1 ~~~~~~~~~~~a~~~-~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~ 79 (456) |..-+-..... .+. ++.. ......+ .-++.+ . +|...+++..|.++|+.|.+++.||+++++++-.-.+ T Consensus 1 ~~~~~~~~~~~-~~~~~~~~---~~~~~~~----~~~~~~-~-~~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~ 70 (540) T protein:vir:41 1 MFNYHLSIKSL-EKYRAIKG---DTDSQAL----KEDRFE-E-YVEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGY 70 (540) T ss_pred CCCcccChhhc-cchhhhhc---ccccccc----ccCCCC-c-cccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCc Confidence 44433222110 000 0000 0000000 111111 1 2445678999999999999999999999999999888 Q ss_pred EEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHH-hhcccCceEEEEEecC-CCCccccccCCcCceeEEEEecc Q lcl|NC_016762. 80 QVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADR-RRLVGRYSGLLLHIRD-SQPWDRPARGKLNGLAKVTPAWA 157 (456) Q Consensus 80 ~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~-~~r~~Ggs~i~i~i~D-~~~~~~Pl~~~~~~l~~i~~~~~ 157 (456) .+...+.. ..+. .-+..+. +..|.+.+. .-.++|-+++++.-++ |+ +..+.|+.. T Consensus 71 ~i~~~~~~-~~~~------lpN~~~t----~~~f~~~~v~dlll~Gnayv~i~r~~~G~------------~~~L~~i~~ 127 (540) T protein:vir:41 71 LIDGDDGG-VEEL------LRACRPS----FEFILLQALEDLQVFNYCTLEVVRDDQGE------------PVRLDYIPA 127 (540) T ss_pred eEecCccc-hhhh------ccCCCCC----HHHHHHHHHHHHHhcCCeEEEEEECCCCc------------EEEEEEeCC Confidence 88543321 1000 0011111 222333333 3566787777664322 22 222222222 Q ss_pred ccCChhh----hh--cc----ccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHH Q lcl|NC_016762. 158 GCLKPKS----FD--EK----PDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSF 223 (456) Q Consensus 158 ~~~~~~~----~~--~D----p~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l 223 (456) ..+.+.. +. .| .....|+.+..+.. .+| .....+.++-||||... ...|.|.+..+...+ T Consensus 128 ~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~----~~g--~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i 201 (540) T protein:vir:41 128 HTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNP----DNG--EDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSI 201 (540) T ss_pred cceEEeEcCceeEeeecCceeeeeecccccceeec----ccc--ccceeecccceEEecCCCCCCCcccccHHHHHHHHH Confidence 2222110 00 01 11112222211111 011 11345667778887533 346999999888766 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhh-hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHH----hcCCCeEEec-------C Q lcl|NC_016762. 224 ISLEKVEGGSGESFLKNAARQLLL-NFDKEINLGEIASTYGVTLDALNERFNEAARQL----NRGNDVLLPT-------Q 291 (456) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~lid-------~ 291 (456) .... .+......+|++....-.+ +.....+ +-............+++...+... ..+-+..++. . T Consensus 202 ~~~~-~~~~~~~~~f~Ng~~p~giL~~~g~l~--~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~ 278 (540) T protein:vir:41 202 LAMQ-KIDEYNYAFFDNYTIPSYVITVTGEFE--DEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTV 278 (540) T ss_pred HHHH-HHHHHHHHHHhccCCCceEEEeCcccC--chhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCccc Confidence 5443 3445555667765432211 1110000 000000001112223333322221 1233334442 2 Q ss_pred CCceeEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeeccCCC-ccc--chH-HHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_016762. 292 GATVTQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGMQTG-ERA--SSE-DQKYHNARCQARRVQELTFEINDLFA 365 (456) Q Consensus 292 ~d~~~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~sp~-Gln--st~-D~~nyyd~I~~~Qe~~lrp~L~~l~~ 365 (456) +-+|+.++.+..+. -+......+.||++.+||-. ++|...+ ..| ..+ -...||..+ |.|.++++-. T Consensus 279 g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~-~lG~~~~~~~n~sn~eq~~~~f~~~t-------L~P~~~~ie~ 350 (540) T protein:vir:41 279 EVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPY-RLGITDVGPLGGNFAEVARRTYYESV-------VRPQQEIVSS 350 (540) T ss_pred ceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHH-HcCcccCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHH Confidence 33566666554433 45666678889999999987 5586543 344 223 356677543 6777777755 Q ss_pred HHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHh-cccCCCCC-C-----CCcc Q lcl|NC_016762. 366 HLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEA-GYDPLQGG-D-----PLPD 438 (456) Q Consensus 366 ~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~-~~~~~~~~-~-----~~~~ 438 (456) .|-+.-+-....++.|+|+.-.-+.. +.. .....++..| ++|++|+|+.+ +.+|.++. . ...+ T Consensus 351 ~ln~~L~~~~~~~~~i~f~~~~ll~~----D~~----~~~~~lv~~G--~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~ 420 (540) T protein:vir:41 351 VLTDFIQLKLDPGARFVFNEEILMES----EFV----HNYALLVQCG--VLTPSEVREKLFGLDGGPDMFMVPSSIGKSA 420 (540) T ss_pred HHHHhhhhccCCceEEEecchhhcch----HHH----HHHHHHHhCC--CCCHHHHHHHhCcCcCCCccccccccccccc Confidence 55332111223467788876432221 211 1133567777 99999999854 55543321 0 0000 Q ss_pred cCCC--CCCC----------CCcCCCCCCC Q lcl|NC_016762. 439 TEPE--DEDA----------ARTDPTGEQQ 456 (456) Q Consensus 439 ~~~~--d~~~----------~~~d~~~~~e 456 (456) .... +.+. ++.+|..+.+ T Consensus 421 ~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~ 450 (540) T protein:vir:41 421 MKRQKRNYEKNQINEIKRTYAKYKPRIQEI 450 (540) T ss_pred ccccccccCCCCccccccccchhcccccCc Confidence 0000 0000 0001100000 No 97 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=99.41 E-value=3.5e-13 Score=88.88 Aligned_cols=369 Identities=13% Similarity=0.050 Sum_probs=165.0 Q ss_pred HHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNK 101 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~ 101 (456) |||.+.+... +.+.....++...+. ......|..+..+.++|+++|+++-.--+.+...++... ....+.+.++ T Consensus 1 Mgl~d~~~~~---~~~~~~~~~~~~~~~-~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~--~~~~~~~lL~ 74 (395) T protein:vir:96 1 MGILDFFSFK---KSGTLSDDDSGSTTS-EKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTE--NQKDWLYWIN 74 (395) T ss_pred CcchhhhcCC---CCccccccccccchh-hhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCcccc--ccchHHHHHh Confidence 6777654222 111111111112111 123445667788899999999998777677754432211 1111211121 Q ss_pred HHHHHhhHHHHHHHHHHhhc-ccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeE Q lcl|NC_016762. 102 PLIAGGRFWRAVSEADRRRL-VGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMW 180 (456) Q Consensus 102 ~~~~~l~~~~~~~ea~~~~r-~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y 180 (456) ..=...--+..|.+.+-+.+ ++|.+.+++ ++|... ++.+.|.. . ....|+ .++ T Consensus 75 ~~PN~~~t~~~f~~~l~~~lll~Gna~~~~-~~~~~~-------------~~~~~~~~-----~---~~~~~~----~~~ 128 (395) T protein:vir:96 75 TKANPNQSASQFWVEVVQKLLVDGETLIFV-IPGKGI-------------YVADAFTQ-----D---KKLSGN----KFK 128 (395) T ss_pred hcCCCCCCHHHHHHHHHHHHhhcCceEEEE-EcCCce-------------ecCCcccc-----c---cccccc----eee Confidence 11001112334444444444 456665554 333211 11111110 0 000111 111 Q ss_pred EEeecccCCccccceeeehhhhheecCCcCC----CcchHHH---HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhc Q lcl|NC_016762. 181 EYTEASQAGRPGLVRDIHPDRVFILGDWTGD----AIGFLEP---AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEI 253 (456) Q Consensus 181 ~i~~~~~~g~~~~~~~IH~SRli~~~~~~~~----G~S~le~---~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 253 (456) .+. .++ ..-...+.++.|+||...... +.++++. +....+..... ..+..+. .++.+.. T Consensus 129 ~v~---~~~-~~~~~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~--~~~~~~~--------~~~~~~~ 194 (395) T protein:vir:96 129 VSR---VQG-QTYEKIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKI--ANQIRFT--------MTPPKDK 194 (395) T ss_pred eee---ecc-ceeeeEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHH--HHHHHHH--------hhhcccc Confidence 121 011 111244667778777432221 2222221 11111111100 0000000 1110000 Q ss_pred -cHhhHHhhhcCCHHHHHHHHHHH-HHHHhcCCCeE-EecCCCceeEEecccCCHHH--------HHHHHHHHHHhhhcC Q lcl|NC_016762. 254 -NLGEIASTYGVTLDALNERFNEA-ARQLNRGNDVL-LPTQGATVTQMVSAVSDPGP--------TYNVNLQTAAAGVDI 322 (456) Q Consensus 254 -~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-lid~~d~~~~~~~~~sgl~~--------~~~~~~~~~aaas~I 322 (456) .........+....+..+++.+. ....+.+...+ ++..+-+|+.++.+..+..- +.....++||.+-|| T Consensus 195 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgV 274 (395) T protein:vir:96 195 VRERAQENSDGGRQPKSDKDFFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGI 274 (395) T ss_pred cccceeeccCchhhHHHHHHHHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCC Confidence 00000011111112222222222 23344444433 35556788888876654422 223345789999999 Q ss_pred CeEEeeccCCCcccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCCceEEEeCCCCCCCHHHHHHHHH Q lcl|NC_016762. 323 PTKILVGMQTGERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKAEFTAIWDDLTVPTKAERLANSK 399 (456) Q Consensus 323 P~t~L~G~sp~Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~d~~~~f~pL~~~seke~Aei~~ 399 (456) |..+| | |..++.+ -...||.. -|.|.++.+-+.|-+.-+.+ ...++.|.|+.|...+.+++++.. T Consensus 275 Pp~~l-~---~~~sn~e~~~~~f~~~-------~L~P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~- 342 (395) T protein:vir:96 275 PISLL-H---GDIADNQKNYELLLEG-------PIESLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQA- 342 (395) T ss_pred CHHHh-c---CCCccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHHHHHH- Confidence 99876 3 2222223 34566663 37888877766665443322 234577899999988888776654 Q ss_pred HHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCC---CCCCcCCCCCCC Q lcl|NC_016762. 400 TMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDE---DAARTDPTGEQQ 456 (456) Q Consensus 400 ~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~---~~~~~d~~~~~e 456 (456) +.+++.| +++++|+|+..+++|+++...+.-..+.+- +...+++..++| T Consensus 343 ------~~~~~~G--~~T~NE~R~~~gl~pi~~~~gD~~~~~~N~~~~~~~gge~~~~~~ 394 (395) T protein:vir:96 343 ------DKLISSG--FVFIDEVREEIGLPELPDGLGKVLYMTKNYESVLERGGEVDEEVE 394 (395) T ss_pred ------HHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeecccceechhccCCCCCCCC Confidence 4566777 999999999999999866222211111111 111222223333 No 98 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=99.40 E-value=6.7e-13 Score=87.32 Aligned_cols=415 Identities=12% Similarity=0.053 Sum_probs=186.9 Q ss_pred CCchhH------HHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHH--hcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLD------LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMY--RRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~------~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y--~~~~l~r~iVd~~ae 72 (456) ||+--. |...|..+..+-+..+.|.+.--.+ . ..++.. ..++...+ ..+..+++|||..++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i----~------~~~~~~-~~~~~~~~~k~~~n~~~~ivd~~~~ 69 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL----P------ELTRNT-SAAWRSFQREARTNWGLMVRDSVAD 69 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc----h------hcCccc-ChhhhhhhhhhhcchHHHHHHHHHh Confidence 665432 2222222222223334443311100 0 011111 12232222 235677999999999 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc----- Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR----- 143 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~----- 143 (456) =++-++|++...++.+.. ..+.+.+++.++-....++.+....||.|++++..+ +|.+.- .|.. T Consensus 70 ~l~~~~~~~~~~~d~~~~-------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~ 142 (456) T protein:vir:10 70 RIIPNGITVGGSADSDLA-------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) T ss_pred hhccCCeecCCCCCcchH-------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEE Confidence 999999987433322211 235566777777788888889999999998877664 333321 1221 Q ss_pred -CCc-CceeEEEEeccccC-Ch--hhhhccccccccCCceeEEEeec-----ccCCccc-cceeeehhhhheecC-CcCC Q lcl|NC_016762. 144 -GKL-NGLAKVTPAWAGCL-KP--KSFDEKPDSETYGQPTMWEYTEA-----SQAGRPG-LVRDIHPDRVFILGD-WTGD 211 (456) Q Consensus 144 -~~~-~~l~~i~~~~~~~~-~~--~~~~~Dp~s~~yg~P~~y~i~~~-----~~~g~~~-~~~~IH~SRli~~~~-~~~~ 211 (456) ... .-+....-+|...- .+ .....+.....|+.+........ ..++... ....-|...++.+.. .+.+ T Consensus 143 d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~ 222 (456) T protein:vir:10 143 DPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPD 222 (456) T ss_pred cCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCC Confidence 000 01111111121000 00 00000001111111111000000 0001000 001122223322221 2457 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh-HHhhhcCCHHHHHHHHHHHHHHHhcCCC-eEEe Q lcl|NC_016762. 212 AIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE-IASTYGVTLDALNERFNEAARQLNRGND-VLLP 289 (456) Q Consensus 212 G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~li 289 (456) |.|.++++.+-+.+++++.-......--.+..++.+... +... ..+..+.. .. ....+....+ +..+ T Consensus 223 g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~---~~~~~~~d~~g~~----~~----~~~~~~~~~~~~~~~ 291 (456) T protein:vir:10 223 GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKST---EHGLPNVDENGNA----ID----YASIFEAAPGALWEL 291 (456) T ss_pred CCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhcc---Ccccccccccccc----cc----hhhhhhhhccccccC Confidence 899999988776666655432211111112222222211 0000 00111110 00 1111222333 2334 Q ss_pred cCCCceeEE-ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_016762. 290 TQGATVTQM-VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFA 365 (456) Q Consensus 290 d~~d~~~~~-~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~ 365 (456) +.+.++.++ .+++.+..+.++...+++|+.+++|...|-|.+ +..+|.. -....-..+..+|+ .+++.|++++. T Consensus 292 ~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~N~Sg~Ai~~~~~~l~~k~~~~~~-~f~~~l~~~~r 369 (456) T protein:vir:10 292 PPGVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLS-IAKIGLEAILV 369 (456) T ss_pred CCCcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhcccc-cChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 555566554 446678888899999999999999988776654 2222221 13345556665554 68999999999 Q ss_pred HHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCC----CCCCcccCC Q lcl|NC_016762. 366 HLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQG----GDPLPDTEP 441 (456) Q Consensus 366 ~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~----~~~~~~~~~ 441 (456) +++.....+....+++.|.|...+|..|.|+...|.. ++| +++..-++++++.++..- -+...++.. T Consensus 370 l~~~~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~-------~~g--i~~~~~~~~~lg~~~~~i~~~e~er~~~e~~ 440 (456) T protein:vir:10 370 KALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAK-------AAG--ESWASIRRNILNYNADQIKQDDLDRAREQIT 440 (456) T ss_pred HHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHH-------HcC--CChHHHHHhhCCCCHHHHHHHHHHHHHHHHH Confidence 8876654444458999999999999888877655533 334 333322233322211000 000000000 Q ss_pred --CCCCCCCcCCCCCC Q lcl|NC_016762. 442 --EDEDAARTDPTGEQ 455 (456) Q Consensus 442 --~d~~~~~~d~~~~~ 455 (456) +..-...++|.|.. T Consensus 441 ~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 441 LFAGNPVQRPQEDGSR 456 (456) T ss_pred HHhhhhhhcCCCCCCC Confidence 00111112222222 No 99 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=99.40 E-value=6.7e-13 Score=87.32 Aligned_cols=415 Identities=12% Similarity=0.053 Sum_probs=186.9 Q ss_pred CCchhH------HHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHH--hcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLD------LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMY--RRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~------~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y--~~~~l~r~iVd~~ae 72 (456) ||+--. |...|..+..+-+..+.|.+.--.+ . ..++.. ..++...+ ..+..+++|||..++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i----~------~~~~~~-~~~~~~~~~k~~~n~~~~ivd~~~~ 69 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL----P------ELTRNT-SAAWRSFQREARTNWGLMVRDSVAD 69 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc----h------hcCccc-ChhhhhhhhhhhcchHHHHHHHHHh Confidence 665432 2222222222223334443311100 0 011111 12232222 235677999999999 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc----- Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR----- 143 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~----- 143 (456) =++-++|++...++.+.. ..+.+.+++.++-....++.+....||.|++++..+ +|.+.- .|.. T Consensus 70 ~l~~~~~~~~~~~d~~~~-------~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~ 142 (456) T protein:vir:10 70 RIIPNGITVGGSADSDLA-------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) T ss_pred hhccCCeecCCCCCcchH-------HHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEE Confidence 999999987433322211 235566777777788888889999999998877664 333321 1221 Q ss_pred -CCc-CceeEEEEeccccC-Ch--hhhhccccccccCCceeEEEeec-----ccCCccc-cceeeehhhhheecC-CcCC Q lcl|NC_016762. 144 -GKL-NGLAKVTPAWAGCL-KP--KSFDEKPDSETYGQPTMWEYTEA-----SQAGRPG-LVRDIHPDRVFILGD-WTGD 211 (456) Q Consensus 144 -~~~-~~l~~i~~~~~~~~-~~--~~~~~Dp~s~~yg~P~~y~i~~~-----~~~g~~~-~~~~IH~SRli~~~~-~~~~ 211 (456) ... .-+....-+|...- .+ .....+.....|+.+........ ..++... ....-|...++.+.. .+.+ T Consensus 143 d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~ 222 (456) T protein:vir:10 143 DPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPD 222 (456) T ss_pred cCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCC Confidence 000 01111111121000 00 00000001111111111000000 0001000 001122223322221 2457 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh-HHhhhcCCHHHHHHHHHHHHHHHhcCCC-eEEe Q lcl|NC_016762. 212 AIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE-IASTYGVTLDALNERFNEAARQLNRGND-VLLP 289 (456) Q Consensus 212 G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~li 289 (456) |.|.++++.+-+.+++++.-......--.+..++.+... +... ..+..+.. .. ....+....+ +..+ T Consensus 223 g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~---~~~~~~~d~~g~~----~~----~~~~~~~~~~~~~~~ 291 (456) T protein:vir:10 223 GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKST---EHGLPNVDENGNA----ID----YASIFEAAPGALWEL 291 (456) T ss_pred CCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhcc---Ccccccccccccc----cc----hhhhhhhhccccccC Confidence 899999988776666655432211111112222222211 0000 00111110 00 1111222333 2334 Q ss_pred cCCCceeEE-ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_016762. 290 TQGATVTQM-VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFA 365 (456) Q Consensus 290 d~~d~~~~~-~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~ 365 (456) +.+.++.++ .+++.+..+.++...+++|+.+++|...|-|.+ +..+|.. -....-..+..+|+ .+++.|++++. T Consensus 292 ~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~N~Sg~Ai~~~~~~l~~k~~~~~~-~f~~~l~~~~r 369 (456) T protein:vir:10 292 PPGVDIWESQANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLS-IAKIGLEAILV 369 (456) T ss_pred CCCcceEEecccChhHHHHHHHHHHHHHHhccCCChHHhcccc-cChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 555566554 446678888899999999999999988776654 2222221 13345556665554 68999999999 Q ss_pred HHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCC----CCCCcccCC Q lcl|NC_016762. 366 HLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQG----GDPLPDTEP 441 (456) Q Consensus 366 ~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~----~~~~~~~~~ 441 (456) +++.....+....+++.|.|...+|..|.|+...|.. ++| +++..-++++++.++..- -+...++.. T Consensus 370 l~~~~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~-------~~g--i~~~~~~~~~lg~~~~~i~~~e~er~~~e~~ 440 (456) T protein:vir:10 370 KALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAK-------AAG--ESWASIRRNILNYNADQIKQDDLDRAREQIT 440 (456) T ss_pred HHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHH-------HcC--CChHHHHHhhCCCCHHHHHHHHHHHHHHHHH Confidence 8876654444458999999999999888877655533 334 333322233322211000 000000000 Q ss_pred --CCCCCCCcCCCCCC Q lcl|NC_016762. 442 --EDEDAARTDPTGEQ 455 (456) Q Consensus 442 --~d~~~~~~d~~~~~ 455 (456) +..-...++|.|.. T Consensus 441 ~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 441 LFAGNPVQRPQEDGSR 456 (456) T ss_pred HHhhhhhhcCCCCCCC Confidence 00111112222222 No 100 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.40 E-value=2e-13 Score=90.19 Aligned_cols=343 Identities=15% Similarity=0.150 Sum_probs=166.6 Q ss_pred HHHhhhhhccCcc-cchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhH--HHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAK-RPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDET--EWER 98 (456) Q Consensus 22 d~~~n~~~~~gt~-~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~--~~e~ 98 (456) |+|.+-..+...+ ++.. -....++..-...| .+..+.+||+++|++.-.--+.+....+.+...... .... T Consensus 1 Mg~f~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~-~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~ 74 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNND-----TQRVTAWQNEAVEY-TSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGS 74 (378) T ss_pred CccchhhhhhhcccccCC-----cceeeecccchhhH-HHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccc Confidence 5555433222111 1100 00111111111233 356789999999999977666654322221111100 0001 Q ss_pred HHHHHHH----HhhHHHHHHHHHHhhc-ccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccc Q lcl|NC_016762. 99 KNKPLIA----GGRFWRAVSEADRRRL-VGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSET 173 (456) Q Consensus 99 ~i~~~~~----~l~~~~~~~ea~~~~r-~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~ 173 (456) .+..+++ ..--...|.+.+-+.+ ++|.+++++.- |+. .+.+.++.|.+ T Consensus 75 ~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~-d~~---------~g~~~~l~~~~----------------- 127 (378) T protein:vir:16 75 DLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVF-DDN---------TGELLDLLFAD----------------- 127 (378) T ss_pred hHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEe-ecC---------CceEEEEEecC----------------- Confidence 1333322 1112334555545444 46767665533 332 12233333221 Q ss_pred cCCceeEEEeecccCCccccceeeehhhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016762. 174 YGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDK 251 (456) Q Consensus 174 yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 251 (456) .++.+.++.+|||.. +...|.|.++.+...+... ++++.....++... T Consensus 128 -------------------~~~~~~~~diih~r~~~~~~~~~s~l~~~~~~i~~~-----------~~~~~~~g~l~~~~ 177 (378) T protein:vir:16 128 -------------------DKKEYKPEELVRLTSPFYINEDTSILDNALASIQTK-----------LEQGKLRGLLKINA 177 (378) T ss_pred -------------------CeeEecccceEEecCccCccchhHHHHHHHHHHHHH-----------HhcCccceeeEeCC Confidence 023355566666642 1123556565555433211 11111000011000 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEEecccCCHH-HHHHHHHHHHHhhhcCCeEEe Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQMVSAVSDPG-PTYNVNLQTAAAGVDIPTKIL 327 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~~~~~sgl~-~~~~~~~~~~aaas~IP~t~L 327 (456) . +. .....+..+++.+.++.++.+. +.++++.+.+|++++.+....+ ........+||.+.|||..+| T Consensus 178 ~-----l~---~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l 249 (378) T protein:vir:16 178 F-----LD---IDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENIL 249 (378) T ss_pred c-----CC---HHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHh Confidence 0 00 0112344555555555544332 4577777889999887765433 223455678999999999877 Q ss_pred eccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC----------CCCceEEEeCCCCCCCHHHHHHH Q lcl|NC_016762. 328 VGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP----------LKAEFTAIWDDLTVPTKAERLAN 397 (456) Q Consensus 328 ~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~----------~~~d~~~~f~pL~~~seke~Aei 397 (456) -|. ++.+...+||.. -|.|.++.+-+.|-+.-+-+ ...++.|+++.|...|.+++++. T Consensus 250 ~g~-----~~e~~~~~f~~~-------tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~ 317 (378) T protein:vir:16 250 LGT-----ASQEQQIYFYNS-------TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDL 317 (378) T ss_pred cCC-----chHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHH Confidence 431 222334556543 47888777766554332211 12357788899999998887554 Q ss_pred HHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc---ccC----CCCCCCCCcC----CCCCCC Q lcl|NC_016762. 398 SKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP---DTE----PEDEDAARTD----PTGEQQ 456 (456) Q Consensus 398 ~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~---~~~----~~d~~~~~~d----~~~~~e 456 (456) ...++..| +++++|+|+..+++|+++++..- .-. ..+.+....+ ....+| T Consensus 318 -------~~~~~~~G--~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 318 -------YHENINGP--IFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred -------HHHHHhCC--CcCHHHHHHHhCCCCCCCCCeEeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 45677777 99999999999999997755321 111 1111111111 112222 No 101 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.39 E-value=4.3e-13 Score=88.36 Aligned_cols=355 Identities=14% Similarity=0.112 Sum_probs=176.7 Q ss_pred HHHhhhhhcc---------------Ccc--------cch--------hh----hhccCcccC---CH-HHHHHHHhcCch Q lcl|NC_016762. 22 MSLLNQGIGH---------------DAK--------RPQ--------AW----CEYGFPQEI---TF-NDLYTMYRRGGI 62 (456) Q Consensus 22 d~~~n~~~~~---------------gt~--------~~~--------~~----~~~~~~~~~---~~-~~l~~~Y~~~~l 62 (456) ++|-+...|+ +++ ++. .+ .-.||+... +. .-=...|..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 3443333332 000 000 00 000111100 00 001233556788 Q ss_pred hhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccc Q lcl|NC_016762. 63 AHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPA 142 (456) Q Consensus 63 ~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl 142 (456) +.+||+++++++-.-=+.+..+.+..+ .....+...=..+.-+..|.+++-+..+.|++++++...+.. T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~~-----~~~~ll~~~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~~~------ 149 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRIID-----SVAWMSNPDPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHGSD------ 149 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCcccc-----chhhhcccCCCCCCCHHHHHHHHHHHHhhCCcEEEEEEECCC------ Confidence 999999999998765555543322111 111112111112223445556655666667777665443321 Q ss_pred cCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecC----CcCCCcchHHH Q lcl|NC_016762. 143 RGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD----WTGDAIGFLEP 218 (456) Q Consensus 143 ~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~----~~~~G~S~le~ 218 (456) +.+..+.|+....+++. .+.| |. .+|+|... .. ++.||||.. ....|.|-++. T Consensus 150 ----G~~~~L~pl~p~~v~v~-~~~~------g~-~~y~~~~~-----~~------~~eiiHir~~~~~~~~~G~spi~~ 206 (409) T protein:vir:83 150 ----GYPIRFRVVPPWLVNVE-LKKG------AR-REYRIGGL-----NV------TDEILHIRYQGNTADAHGHGPLES 206 (409) T ss_pred ----CcEEEEEEECCcceEEE-EcCC------ce-EEEEEccc-----cC------ccceEEeCCCCCCCCcccccHHHH Confidence 11233444443333321 1111 22 24566421 11 234444421 23458999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhh-hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc-CCC-eEEecCCCce Q lcl|NC_016762. 219 AYNSFISLEKVEGGSGESFLKNAARQLLL-NFDKEINLGEIASTYGVTLDALNERFNEAARQLNR-GND-VLLPTQGATV 295 (456) Q Consensus 219 ~~~~l~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~lid~~d~~ 295 (456) +...+... .....++..++++..+.-.+ +.... + . .+..+++.+.+..... +.+ .+++..+.++ T Consensus 207 ~~~~i~~~-~a~~~~~~~~f~nga~p~gil~~~~~-----l------s-~e~~~~~~~~~~~~~~~nag~~~il~~g~~~ 273 (409) T protein:vir:83 207 AAPRQVVI-GLLQKYVQNLAETGGVPLYWLGVERR-----L------S-ETEAVDLMDRWIESRSKYAGHPALVTGGATL 273 (409) T ss_pred HHHHHHHH-HHHHHHHHHHHhcCCCcceEeecCCC-----C------C-HHHHHHHHHHHHHhhCCccCccceecCCccc Confidence 88776543 34455566677765433222 11111 1 1 2222333333322222 222 3555555554 Q ss_pred -eEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeeccCCC----cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_016762. 296 -TQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGMQTG----ERASSE-DQKYHNARCQARRVQELTFEINDLFAHL 367 (456) Q Consensus 296 -~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~sp~----Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l 367 (456) +.++.+..++ -+........||.+.+||- .|+|.... +.+..+ -...||. .-|.|.++++-+.| T Consensus 274 ~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp-~llg~~~~~~~~tysn~eq~~~~f~~-------~tL~P~~~~ie~~l 345 (409) T protein:vir:83 274 NQAKSMSAQDLSLMELTQFNEARIAILLGVPP-FLVGLPGATGSLTYSNIEQLFSFHDR-------SSLRPKATAVMAAL 345 (409) T ss_pred ccccCCCHHHHHHHHHHHhhHHHHHHHhCCCH-HHccCCCCccccccccHHHHHHHHHH-------HHHHHHHHHHHHHH Confidence 4456554433 3334455778999999996 57785432 222223 3455654 34678887776666 Q ss_pred HHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCC Q lcl|NC_016762. 368 MRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEP 441 (456) Q Consensus 368 ~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~ 441 (456) -+. +.+....++|.+..|...+.+++ +++.+.+++.| ++++||+|+..+++|..+++..+...- T Consensus 346 ~~~-Ll~~~~~~~f~~~~llr~d~~~r-------~~~~~~~~~~G--~lT~NE~R~~~glpp~~ggd~l~~~gv 409 (409) T protein:vir:83 346 DRW-ALPSPQHLELNRDDYTRPSLVER-------ATAYKIMIEAG--VMEPNEARAMERLHSEAAAVRLSGGGV 409 (409) T ss_pred HHh-hCCCCcEEEeehhhhhccCHHHH-------HHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcccCCCCC Confidence 543 33344456777777877777765 44567777888 999999999999988766655432211 No 102 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.38 E-value=1.4e-12 Score=85.61 Aligned_cols=374 Identities=10% Similarity=0.016 Sum_probs=175.8 Q ss_pred cCcccCCHHHHHHHHh--cCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhh Q lcl|NC_016762. 43 GFPQEITFNDLYTMYR--RGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRR 120 (456) Q Consensus 43 ~~~~~~~~~~l~~~Y~--~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~ 120 (456) +-|+. ..+++....+ ...++++|||..++=+.-+||+. .+.++. ..+.+.+++-++-....++.+.. T Consensus 1 ~l~~~-~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~--~d~~~~--------~~~~~i~~~N~~d~~~~~~~~~a 69 (434) T protein:vir:98 1 MLPKN-AEQAFLDFQRKARTNFCGLIANASVHRLLALGVTG--PDGEPD--------TRASRWWQANRLDSRQKLVWRMA 69 (434) T ss_pred CCCCC-ccHHHHHhhhhhhccchHHHHHHHHhhhccCceec--CCCchH--------HHHHHHHHhcChhHHHHHHHHHH Confidence 22333 3345555543 34689999999999887788763 222211 23556677778888889999999 Q ss_pred cccCceEEEEEecCCCC-c----------ccccc------CCcCceeEEEEeccccCChhhhhcccccccc-CCceeEEE Q lcl|NC_016762. 121 LVGRYSGLLLHIRDSQP-W----------DRPAR------GKLNGLAKVTPAWAGCLKPKSFDEKPDSETY-GQPTMWEY 182 (456) Q Consensus 121 r~~Ggs~i~i~i~D~~~-~----------~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~y-g~P~~y~i 182 (456) .+||.|++++..+.... . -.|.. ...+.+.....+|.....- ....--| +.-.++.. T Consensus 70 ~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~-----~~~~~~~~~~~~~~~~ 144 (434) T protein:vir:98 70 MAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDG-----FGYARVFFDDTSFPYR 144 (434) T ss_pred hhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCC-----ceEEEEEEeCcEEEEE Confidence 99999998887642111 1 01111 1111122222222211000 0000001 11111111 Q ss_pred eecccCC--cccc----------ceeeehh-hh--heecCC---cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_016762. 183 TEASQAG--RPGL----------VRDIHPD-RV--FILGDW---TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQ 244 (456) Q Consensus 183 ~~~~~~g--~~~~----------~~~IH~S-Rl--i~~~~~---~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~ 244 (456) ......+ .... ...-|+= +| +.|... ..+|.|.++++.+-+.+++++.-......--.+..+ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~ 224 (434) T protein:vir:98 145 TRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQ 224 (434) T ss_pred EeeccccccccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchh Confidence 0000000 0000 0011211 11 223221 146899999988877777776544332221122223 Q ss_pred hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEec---ccCCHHHHHHHHHHHHHhhhc Q lcl|NC_016762. 245 LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVS---AVSDPGPTYNVNLQTAAAGVD 321 (456) Q Consensus 245 l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~---~~sgl~~~~~~~~~~~aaas~ 321 (456) +.+.. .++....+. .+.. . ... +.+....+.+....+++.+..+. ++.+..+.+....+++|+.++ T Consensus 225 ~~i~G---~~~~~~~~~--~~~~--~-~~~---~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~ 293 (434) T protein:vir:98 225 KWIKG---HKFAKRTDP--ATGM--T-VVD---QPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQ 293 (434) T ss_pred hhhcC---CCccccccc--cccc--c-hhh---hhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccC Confidence 22221 111111110 0000 0 000 11122233344444444444443 344555667777999999999 Q ss_pred CCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC-CCceEEEeCCCCCCCHHHHHHH Q lcl|NC_016762. 322 IPTKILVGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL-KAEFTAIWDDLTVPTKAERLAN 397 (456) Q Consensus 322 IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~-~~d~~~~f~pL~~~seke~Aei 397 (456) +|...|-|. -+..+|.. -....-..+..+|+ .+++.|++++.+++.....+. ..++.+.|.|-..+|..+.|++ T Consensus 294 ~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~-~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada 371 (434) T protein:vir:98 294 TPTYLYATD-LVNISADTIGALDILHVAKVREHIA-SFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADA 371 (434) T ss_pred CCHHHhccc-cCChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHH Confidence 998755442 22222221 23455666666664 679999999988776654432 2468899999999999998887 Q ss_pred HHHHHHHH-H--H-HHHcCCcCcCHHHHHHHhcc----------cCCCCCCCCcccCCCCCCCCCcC-CCC Q lcl|NC_016762. 398 SKTMSEIN-S--A-AIGTGEPVFTAEEIREEAGY----------DPLQGGDPLPDTEPEDEDAARTD-PTG 453 (456) Q Consensus 398 ~~~~A~a~-~--~-~~~~g~~~i~~~E~R~~~~~----------~~~~~~~~~~~~~~~d~~~~~~d-~~~ 453 (456) ..|...+. . . +-..| ++++|++..... .....+.+ ++++++++++ ..| T Consensus 372 ~~kl~~~g~~~e~~~~~lg---~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~-----~~g~~~~~~~~~dg 434 (434) T protein:vir:98 372 ATKLKSIGYPLDVIAEELD---ESPARVRRIVAGAASQALLAASLLPAPGAP-----SAGNVPDSGGAVDG 434 (434) T ss_pred HHHHHhcCCcHHHHHHhCC---CCHHHHHHHHHHHHHHHHHHHhhhccCCCC-----CCCCCCcccCCCCC Confidence 76655431 1 1 11122 345555322100 01111111 1111111111 111 No 103 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.37 E-value=1.7e-11 Score=79.67 Aligned_cols=417 Identities=10% Similarity=0.025 Sum_probs=198.6 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHH------------HHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFN------------DLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~------------~l~~~Y~~~~l~r~iVd 68 (456) ++++... + +.+.+..+.+.-+. ++.|-..| ++...+.+ -...+|+.|++++++|+ T Consensus 11 ~sP~~~~------~--R~~ar~~~~~y~aa-~~~r~~~~----~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~ 77 (502) T protein:vir:79 11 FSPGWKA------A--RLRSRAVIQAYEAV-KTTRTHKA----RRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFD 77 (502) T ss_pred cChHHHH------H--HHhhHHHHhhcccc-CcccccCC----CCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 4443222 1 11222233222211 23332222 22222221 23668999999999999 Q ss_pred cchhHHhhC-CCEEecCCCcchhhhhHHHHHHHHHHHHH----------hhHHHHHHHHHHhhcccCceEEEEEecCCCC Q lcl|NC_016762. 69 KIVTTCWKT-NPQVIEGDDQDRSKDETEWERKNKPLIAG----------GRFWRAVSEADRRRLVGRYSGLLLHIRDSQP 137 (456) Q Consensus 69 ~~aed~tR~-~~~i~~~~~~d~~~~~~~~e~~i~~~~~~----------l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~ 137 (456) ....-.+=. |+.+...-..+......++.++|++++++ +.+-....-+++.-...|-+++.+....... T Consensus 78 ~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~ 157 (502) T protein:vir:79 78 KLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINS 157 (502) T ss_pred HHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCc Confidence 999999864 66664332223333334566777777762 2343333334444445566666554422111 Q ss_pred ccccccCCcCceeEEEEeccccCChhhhhcc------ccccccCCceeEEEeecccCCc-cccceeeehhhhheecC--- Q lcl|NC_016762. 138 WDRPARGKLNGLAKVTPAWAGCLKPKSFDEK------PDSETYGQPTMWEYTEASQAGR-PGLVRDIHPDRVFILGD--- 207 (456) Q Consensus 138 ~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~D------p~s~~yg~P~~y~i~~~~~~g~-~~~~~~IH~SRli~~~~--- 207 (456) ++.-+... + .|..+....|.. .++.. ..=-.+|+|..|+|....++.. ......|..++|+|+-. T Consensus 158 ~~~g~~~~---l-~lq~iepd~l~~-~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r 232 (502) T protein:vir:79 158 LTPSAGVH---F-WLEALEPDFIPM-TSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRR 232 (502) T ss_pred cCCCcccc---e-EEEEecchhcCC-CCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccC Confidence 11111111 1 122222222211 11110 1123689999999975544432 22346799999988743 Q ss_pred -CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHH--HhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC Q lcl|NC_016762. 208 -WTGDAIGFLEPAYNSFISLEKVEGGSGESFLK--NAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN 284 (456) Q Consensus 208 -~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~--~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (456) ....|+|.+-++...+.+++.-.... ++=+ ++.-...++.. ..-.......+.+... ....+. - T Consensus 233 ~gQ~RGis~lapvl~~l~~l~~~~dae--l~~a~i~A~~~~fi~~~--~~~~~~~~~~~~~~~~-------~~~~l~--p 299 (502) T protein:vir:79 233 LHQMRGTSLLSGVLIRLSALKEYEDSE--LTAARIAAALGMYIRKG--DGQSYEPDGNGSKENE-------RELTIQ--P 299 (502) T ss_pred CccccCCchHHHHHHHHHHHhHHHHHH--HHHHHHhhhheeeeecC--CCcccccccCCCCCcc-------cccccc--C Confidence 23469999999999988877655332 2211 11111111100 0000000000110000 000111 1 Q ss_pred CeE--EecCCCceeEEec--ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccch-HHHHHHHHHHHHHHHhhh--- Q lcl|NC_016762. 285 DVL--LPTQGATVTQMVS--AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASS-EDQKYHNARCQARRVQEL--- 356 (456) Q Consensus 285 ~~~--lid~~d~~~~~~~--~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst-~D~~nyyd~I~~~Qe~~l--- 356 (456) ++. .+..+++++.++. +-++..+....++..||+..|||--.|.|--.+..+|. .-+..+...++..|+... T Consensus 300 G~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nySs~R~~~~e~~r~~~~~q~~~~~~~ 379 (502) T protein:vir:79 300 GIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAV 379 (502) T ss_pred CccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 332 2455788887765 45689999999999999999999999999854322232 356677888888887543 Q ss_pred -hHHHHHHHHHHHHhcCcCCCC------ceEEEe--CCCCCCCHHHHHHHHHHHHHHHHHHHHcCCc---------CcCH Q lcl|NC_016762. 357 -TFEINDLFAHLMRIGVVPLKA------EFTAIW--DDLTVPTKAERLANSKTMSEINSAAIGTGEP---------VFTA 418 (456) Q Consensus 357 -rp~L~~l~~~l~~s~~~~~~~------d~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~~---------~i~~ 418 (456) +|+-+.+++..+..+.++.|. -....| +..-..+.. |.++|....+.+|.. =.++ T Consensus 380 ~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~-------Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~ 452 (502) T protein:vir:79 380 TRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPV-------KEAEAWKIQIRGGAATESDWVRAGGRNP 452 (502) T ss_pred HHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChH-------HHHHHHHHHHHcCCCCHHHHHHHcCCCH Confidence 444444555555666665442 123344 222222332 344555555655510 1334 Q ss_pred HHHHHH-------hcccCCC-C----CCCCcccCC-CCCCCCCcCCCCCC Q lcl|NC_016762. 419 EEIREE-------AGYDPLQ-G----GDPLPDTEP-EDEDAARTDPTGEQ 455 (456) Q Consensus 419 ~E~R~~-------~~~~~~~-~----~~~~~~~~~-~d~~~~~~d~~~~~ 455 (456) +|+.+. ...-++. + ..+.....+ ..++++++++..|+ T Consensus 453 ~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 453 DDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 433221 1111111 0 111111111 12233333333333 No 104 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.37 E-value=4e-11 Score=77.60 Aligned_cols=410 Identities=11% Similarity=0.017 Sum_probs=180.5 Q ss_pred CCchhH--HHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKLD--LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~~--~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) |+.... |+..|.-+..+-+..+.|...--.+ + . .+..+ ..++......+.++++|||..++=+.-+| T Consensus 13 ~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i-----~---~--~~~~~-~~~~~~~~~~~n~~~~ivd~~~~~l~~~g 81 (485) T protein:vir:24 13 DPAIARDEMVSAFEDQNQNLRSNTSYYEAERRP-----E---A--IGVTV-PVQMQSLLAHVGYPRLYVDSIAERQAVEG 81 (485) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHhccCch-----h---h--cCccc-chhhhhhhhccchHHHHHHHHhhhhccCc Confidence 666643 6666655544444444443211000 0 0 11111 12333444556789999999998887788 Q ss_pred CEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeE-----EE Q lcl|NC_016762. 79 PQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAK-----VT 153 (456) Q Consensus 79 ~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~-----i~ 153 (456) |++- +.++. .+.+.+.+++-++-....++.+...+||.|++++..+.......+-++.. .|.. +. T Consensus 82 ~~~~--~~~~~-------~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~-~i~~~~p~~~~ 151 (485) T protein:vir:24 82 FRLG--DADEA-------DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVP-LIRVEPPTRMY 151 (485) T ss_pred eecC--CCchh-------HHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcc-eEEEeccceeE Confidence 8752 21111 12356666666777778889999999999998887643211100000000 0111 11 Q ss_pred EeccccCC-hh---h-h-hcc---ccccccCCce-eEEEeecccCCccc-cceeeehhhhheecC-------CcCCCcch Q lcl|NC_016762. 154 PAWAGCLK-PK---S-F-DEK---PDSETYGQPT-MWEYTEASQAGRPG-LVRDIHPDRVFILGD-------WTGDAIGF 215 (456) Q Consensus 154 ~~~~~~~~-~~---~-~-~~D---p~s~~yg~P~-~y~i~~~~~~g~~~-~~~~IH~SRli~~~~-------~~~~G~S~ 215 (456) |+|-.... +. . + ..+ ...-.++.+. .|++.. .+|... ....-|+--.+.+.+ ...+|.|. T Consensus 152 ~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~--~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~ 229 (485) T protein:vir:24 152 AEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFR--AEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSE 229 (485) T ss_pred EEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEe--cCCceEeecccccCCCcccEEEeccCcccCCcCCccc Confidence 12210000 00 0 0 000 0111111111 222211 111110 011124333222221 12368888 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCc Q lcl|NC_016762. 216 LEP-AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGAT 294 (456) Q Consensus 216 le~-~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~ 294 (456) ++. +..-+.+++++.-......-..+..++.+.. .+........+.+ ...+....+.+....+++ T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G---~~~~~~~~~~~~~-----------~~~~~~~~~~i~~~~~~~ 295 (485) T protein:vir:24 230 ITPELRSMTDAAARILMLMQATAELMGVPQRLIFG---IKPEEIGVDPETG-----------QTLFDAYLARILAFEDAE 295 (485) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhcc---CCccccccccccc-----------cchhhhcccceeccCCCC Confidence 764 4443444454433222211111222222211 0100000000000 011122223333333444 Q ss_pred eeEEecccCC---HHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_016762. 295 VTQMVSAVSD---PGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFAH 366 (456) Q Consensus 295 ~~~~~~~~sg---l~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~~ 366 (456) .+..+.+-++ ..+.+.....++|+.+++|..-| |.++.. +++++ ++ ..-..++ .++..+++.|++++.+ T Consensus 296 ~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~f-g~~~~n-~~Sg~Al~~~~~~l~~ka~-~~~~~f~~~l~~~~~l 372 (485) T protein:vir:24 296 GKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYL-STAADN-PASAEAIRAAESRLIKKVE-RKNAIFGGAWEEAMRL 372 (485) T ss_pred ceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHh-ccccCc-chHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 4444444444 45556666888999999998755 544332 13333 33 2233334 4445689999999998 Q ss_pred HHHhc-CcC---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHH------HHHH-HcCCcCcCHHHH---HHHhcc----- Q lcl|NC_016762. 367 LMRIG-VVP---LKAEFTAIWDDLTVPTKAERLANSKTMSEIN------SAAI-GTGEPVFTAEEI---REEAGY----- 427 (456) Q Consensus 367 l~~s~-~~~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~------~~~~-~~g~~~i~~~E~---R~~~~~----- 427 (456) ++... ... ...++++.|.|-..+|.++.|+...+.+++. .++. ..| ++++++ ++..+. T Consensus 373 ~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~---~~~d~~~e~~~~~ee~~~~~ 449 (485) T protein:vir:24 373 AYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMG---YSIAEREEMRRWDEEEAAMG 449 (485) T ss_pred HHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCC---CCHhHHHHHHHHHHHHhhhh Confidence 76542 221 2247899999999999999999877766532 1111 222 334332 211100 Q ss_pred ----cCCC------CCCCCcccCCCCCCCCCcCCCC Q lcl|NC_016762. 428 ----DPLQ------GGDPLPDTEPEDEDAARTDPTG 453 (456) Q Consensus 428 ----~~~~------~~~~~~~~~~~d~~~~~~d~~~ 453 (456) +.+. .+.+.+.+.+.++..+.++++| T Consensus 450 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 450 LGLLGTMVDADPTVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred hhHHHhhcccCCCCCCCCCCCCCCCCccCCCCCCCC Confidence 1111 1111111111222223334444 No 105 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=99.35 E-value=1.8e-12 Score=84.95 Aligned_cols=410 Identities=12% Similarity=0.061 Sum_probs=184.4 Q ss_pred CCchhHH-HHh-----HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHH--HhcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLDL-AVN-----HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTM--YRRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~~-~~~-----~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~--Y~~~~l~r~iVd~~ae 72 (456) |++.-.. .++ |.-+...-+....|... +.+-. . .+... ..++... ...+..+++|||..++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g------~~~i~--~--~~~~~-~~~~~~~~~~~~~n~~~~ivd~~~~ 69 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNG------DAPLP--E--LTRNT-SAAWRSFQREARTNWGLMVRDSVAD 69 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhc------cCChh--h--cCccc-ChhhchhhhhhhcchHHHHHHHHHh Confidence 4443221 111 22111111222333221 11000 0 01111 1122211 1234577999999999 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc----- Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR----- 143 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~----- 143 (456) -++-++|++...++.+.. ..+.+.+++.++-....++.+....||.|++++..+ ||...- .|.. T Consensus 70 ~l~~~g~~~~~~~d~~~~-------~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~ 142 (456) T protein:vir:79 70 RIIPNGITVGGSADSDLA-------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) T ss_pred hhccCCeecCCCCCccHH-------HHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEE Confidence 999999987543322221 235666777677778888999999999998887764 343321 2221 Q ss_pred -C-CcCceeEEEEeccccCChhhhhcccc-ccccCCceeEEE---eec----------ccCCccccceeeeh-hhhheec Q lcl|NC_016762. 144 -G-KLNGLAKVTPAWAGCLKPKSFDEKPD-SETYGQPTMWEY---TEA----------SQAGRPGLVRDIHP-DRVFILG 206 (456) Q Consensus 144 -~-~~~~l~~i~~~~~~~~~~~~~~~Dp~-s~~yg~P~~y~i---~~~----------~~~g~~~~~~~IH~-SRli~~~ 206 (456) . ..+.+....-+|.. . +..+. ...|..+..|.+ ... ..++.......+|| --++.+. T Consensus 143 d~~~~~~~~~~~~~~~~---~---d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv 216 (456) T protein:vir:79 143 DPLQPWRIRSAMRWWRD---L---DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVV 216 (456) T ss_pred cCCCCCceEEEEEEEEe---c---CCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEE Confidence 0 11112112112210 0 00000 001111111111 000 00000000011222 1112221 Q ss_pred C-CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCC Q lcl|NC_016762. 207 D-WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGND 285 (456) Q Consensus 207 ~-~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (456) . .+..|.|.++++.+-+..++++.-......-..+..++.+.... ..+ ...+..|.. . .....+....+ T Consensus 217 ~~~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~-~~~-~~~d~~g~~----i----~~~~~~~~~~~ 286 (456) T protein:vir:79 217 VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSE-HRL-PKVDENGNA----I----DYASIFEAAPG 286 (456) T ss_pred EecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCC-ccc-ccccccccc----c----chhhhhhhhcc Confidence 1 24568899998877666666553221111111112222221110 000 000111110 0 01112222333 Q ss_pred e-EEecCCCceeEE-ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHH Q lcl|NC_016762. 286 V-LLPTQGATVTQM-VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNARCQARRVQELTFEI 360 (456) Q Consensus 286 ~-~lid~~d~~~~~-~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L 360 (456) . ..++.+.++.++ ++++.+..+.++....++|+.+++|...|-|.+. ..+|.. -....-..++.+| ..+++.| T Consensus 287 ~~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N~Sg~Al~~~~~~l~~k~~~~~-~~f~~~l 364 (456) T protein:vir:79 287 ALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKCEDRL-SIAKIGL 364 (456) T ss_pred ccccCCCCcceeeecccChHHHHHHHHHHHHHHHhhcCCChhHhccccc-CcHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 3 334445555443 5677889999999999999999999987776542 223322 2345556666666 4789999 Q ss_pred HHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCC----CCCC Q lcl|NC_016762. 361 NDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQG----GDPL 436 (456) Q Consensus 361 ~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~----~~~~ 436 (456) ++++.+++.....+....+++.|.|...+|.++.|+...+...+ | +++..-+++.+++++..- -+.. T Consensus 365 ~~~~~l~~~~~g~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~-------G--~~~~~~~~~~lg~~~~~i~~~e~~r~ 435 (456) T protein:vir:79 365 EAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAA-------G--ESWASIRRNILNYNADQIKQDDLDRA 435 (456) T ss_pred HHHHHHHHHhcCCCccccceEEeCCCCCcCHHHHHHHHHHHHhc-------C--CChHHHHHhcCCCCHHHHHHHHHHHH Confidence 99999887776555556899999999999998887776664432 3 222222222222211000 0000 Q ss_pred cccCC--CCCCCCCcCCCCCC Q lcl|NC_016762. 437 PDTEP--EDEDAARTDPTGEQ 455 (456) Q Consensus 437 ~~~~~--~d~~~~~~d~~~~~ 455 (456) .++.+ +..-...++|.+-- T Consensus 436 ~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 436 REQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHHHHHHhhhHhhcCCCCCCC Confidence 00000 00111112221111 No 106 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=99.34 E-value=1.8e-12 Score=85.03 Aligned_cols=340 Identities=15% Similarity=0.137 Sum_probs=162.4 Q ss_pred HHHhhhhhcc---CcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhh--HHH Q lcl|NC_016762. 22 MSLLNQGIGH---DAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDE--TEW 96 (456) Q Consensus 22 d~~~n~~~~~---gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~--~~~ 96 (456) |++.|-..+. .+..........+... ..|. +..+.+||+.+|+++-.--+.+....+.+..... ... T Consensus 1 M~if~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~-~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~ 72 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQRVTAWQNEA-------VEYT-SAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMA 72 (378) T ss_pred CchhHHhHhhhhcccccCcceeeeeecch-------hhhh-hHHHHHHHHHHHHhHhhCceeeeeecccccccccccccc Confidence 4444422111 1111111111111111 1233 3467889999999997765554332222111110 000 Q ss_pred HHHHHHHHH----HhhHHHHHHHHHHhhc-ccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccc Q lcl|NC_016762. 97 ERKNKPLIA----GGRFWRAVSEADRRRL-VGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDS 171 (456) Q Consensus 97 e~~i~~~~~----~l~~~~~~~ea~~~~r-~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s 171 (456) ...+..++. ..--+..|.+.+-+.+ +.|.++++....| ..+.+.+..++. T Consensus 73 ~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~----------~~g~~~~~~~~~--------------- 127 (378) T protein:vir:94 73 GSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDS----------ETGELLDLLFAN--------------- 127 (378) T ss_pred cchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeC----------CCCcEEEEEEec--------------- Confidence 011222222 1113334555544444 5566766533221 111222221110 Q ss_pred cccCCceeEEEeecccCCccccceeeehhhhheecCC--cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhhhhh Q lcl|NC_016762. 172 ETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW--TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNA-ARQLLLN 248 (456) Q Consensus 172 ~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~--~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~-~~~l~~~ 248 (456) .++.+.++.|+|+... ...+.+.++.....+... . +++ .+. .++ T Consensus 128 ---------------------~~~~~~~~dvih~~~~~~~~~~~~~~~~~~~~~~~~---~--------~~~~~~g-~l~ 174 (378) T protein:vir:94 128 ---------------------DKKEYKPEELVRLTSPFYINEDTSILDNALASIQTK---L--------EQGKLRG-LLK 174 (378) T ss_pred ---------------------CcEEechhceeeecCcCCcccchhHHHHHHHHHHHH---H--------hhCCccc-cee Confidence 1244667777777422 223455555554433211 1 111 111 111 Q ss_pred hhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEEecccCCHH-HHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 249 FDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQMVSAVSDPG-PTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 249 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~~~~~sgl~-~~~~~~~~~~aaas~IP~ 324 (456) ....++ .....+..+++.+.++....+. +.++++.+.+|+.++.+...++ +-+.....+||.+.|||. T Consensus 175 ~~~~l~--------~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp 246 (378) T protein:vir:94 175 INAFLD--------IDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNE 246 (378) T ss_pred eCCcCC--------HHHHHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhHHHHHHHHHHHHHHhCCCH Confidence 111110 0112344555555555444322 3577777889999988776553 223445678999999998 Q ss_pred EEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCc----------CCCCceEEEeCCCCCCCHHHH Q lcl|NC_016762. 325 KILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVV----------PLKAEFTAIWDDLTVPTKAER 394 (456) Q Consensus 325 t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~----------~~~~d~~~~f~pL~~~seke~ 394 (456) .+|.|.. +.+...+||.. -|.|.+.++-+-|-+.-+- ....++.|.+++|...|.+++ T Consensus 247 ~~l~g~~-----~e~~~~~f~~~-------tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~ 314 (378) T protein:vir:94 247 NILLGTA-----TQEQQIYFYNS-------TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKEL 314 (378) T ss_pred HHhcCCc-----hHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHH Confidence 7774321 11234456654 4788887766655432221 112357788899999998887 Q ss_pred HHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCc-------ccCCCCCCCCCcC----CCCCCC Q lcl|NC_016762. 395 LANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLP-------DTEPEDEDAARTD----PTGEQQ 456 (456) Q Consensus 395 Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~-------~~~~~d~~~~~~d----~~~~~e 456 (456) ++. ...+++.| ++++||+|+..+++|+++++..- .+...+.+....+ ++...| T Consensus 315 ~e~-------~~~~~~~G--~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 315 IDL-------YHENINGP--IFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGNRKDVTSTDETNNQ 378 (378) T ss_pred HHH-------HHHHHhCC--CcCHHHHHHHhCCCCCCCCCeeeecccccchhcchhcccccCCCCCCCCCCCC Confidence 665 45577777 99999999999999997754311 0111111111111 111112 No 107 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.29 E-value=4.4e-12 Score=82.83 Aligned_cols=371 Identities=11% Similarity=0.006 Sum_probs=160.2 Q ss_pred HHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNK 101 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~ 101 (456) |||.+.+.+. +.......+....+ ..-....|..+..+.++|+++|+++-.--+.+...++.. .....+-..|. T Consensus 1 MGlf~~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~--~~~~~~~~lL~ 74 (395) T protein:vir:98 1 MGILDFFSFK---KSGTLSDDDSGSTT-SEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLT--ENQKDWLYWIN 74 (395) T ss_pred CcchhhhcCC---Ccccccccccchhh-hhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcc--cccchHHHHHh Confidence 6776655322 22111111111111 122334455678899999999999988777775443221 11112222222 Q ss_pred HHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeE Q lcl|NC_016762. 102 PLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMW 180 (456) Q Consensus 102 ~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y 180 (456) ..=..+.-...|.+.+-+ -.++|-+++++. +++.-+ +-+-|.. .. ...++. ++ T Consensus 75 ~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~-~~~~~~-------------~~~~~~~---~~-----~~~~~~----~~ 128 (395) T protein:vir:98 75 TKANPNQSASQFWVEVIQKLLVDGETLIFVI-PGKGIY-------------VADSFTQ---DK-----KISGSQ----FK 128 (395) T ss_pred hcCCCCCCHHHHHHHHHHHHhhcCceEEEEE-eCCcee-------------cCCcccc---cc-----cccCcc----cc Confidence 110111223344444444 445676766543 332111 0000100 00 001111 11 Q ss_pred EEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHHH-HHHHHHHHHHH-hhhhhhhhhhhhcc Q lcl|NC_016762. 181 EYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEKV-EGGSGESFLKN-AARQLLLNFDKEIN 254 (456) Q Consensus 181 ~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~~-~~~~~~~~~~~-~~~~l~~~~~~~~~ 254 (456) .+.. .+. .-.+.+-++.|+||.... ..+.++++..-..+...... ....+..+..+ ....+.+.. T Consensus 129 ~~~~---~~~-~~~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 199 (395) T protein:vir:98 129 VSRV---QGQ-TYEKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERA----- 199 (395) T ss_pred eeee---cCc-eeeeEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccc----- Confidence 1110 000 001122234455553222 12333333222211110000 00000001100 000000000 Q ss_pred HhhHHhhhcCCH-HHHHHHHHHHHHHHhcCCC-eEEecCCCceeEEecccCC--------HHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 255 LGEIASTYGVTL-DALNERFNEAARQLNRGND-VLLPTQGATVTQMVSAVSD--------PGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 255 ~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~lid~~d~~~~~~~~~sg--------l~~~~~~~~~~~aaas~IP~ 324 (456) .....+... +...+.+.......+.+-. .+.+..+-+|+.++.+... +-++......+||.+-|||. T Consensus 200 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~ 276 (395) T protein:vir:98 200 ---QENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPI 276 (395) T ss_pred ---cccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 000000001 1122222222333333333 3334556688888755432 23344445678999999999 Q ss_pred EEeeccCCCcccchHH-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC--CCCceEEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_016762. 325 KILVGMQTGERASSED-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVP--LKAEFTAIWDDLTVPTKAERLANSKTM 401 (456) Q Consensus 325 t~L~G~sp~Glnst~D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~--~~~d~~~~f~pL~~~seke~Aei~~~~ 401 (456) .+| | +..++.++ ...||. ..|.|.+.++-+.|-+.-+.+ ....+.|+|+.|...+.+++++ T Consensus 277 ~~l-~---~~~sn~e~~~~~f~~-------~tl~P~~~~ie~~l~~kll~~~~~~~g~~f~~~~l~~~d~~~~~~----- 340 (395) T protein:vir:98 277 SLL-H---GDIADNQKNYELLLE-------GPIESLITNIVDGLEYAIFDKSETLQGSFIKVTGLKNYDLFSISN----- 340 (395) T ss_pred HHh-c---CCcccHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcCChhhhcCcceeeehhhhccCHHHHHH----- Confidence 866 3 23333333 455654 357888877776665443332 2345779999999999877654 Q ss_pred HHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCC---CCCCcCCCCCCC Q lcl|NC_016762. 402 SEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDE---DAARTDPTGEQQ 456 (456) Q Consensus 402 A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~---~~~~~d~~~~~e 456 (456) +...+++.| ++++||+|+..+++|+.+...+.--...+- +...+++..++| T Consensus 341 --~~~~~~~~G--~~T~NE~R~~~g~~Pi~~~~gD~~~~~~n~~~~~~~gge~~~~~~ 394 (395) T protein:vir:98 341 --QADKLISSG--FVFIDEVREEIGLPELPDGLGKVLYMTKNYESVLERGGEVDEEVE 394 (395) T ss_pred --HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeecccceecccccCCCCCCCC Confidence 445667777 999999999999999866221111111110 011112222222 No 108 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=99.28 E-value=5.2e-11 Score=76.95 Aligned_cols=412 Identities=12% Similarity=0.043 Sum_probs=172.2 Q ss_pred CCchh-H------HHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhH Q lcl|NC_016762. 1 MTDKL-D------LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTT 73 (456) Q Consensus 1 ~~~~~-~------~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed 73 (456) |+.+. + |...|.-+..+-+....|...- ++-.......+ .-..+-+....+. ..+++|||..++- T Consensus 9 l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~------~~i~~~~~~~~-~~~~~~~~~~~~~-n~~~~iVd~~~~~ 80 (479) T protein:vir:99 9 LSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNG------QEVPDLATRHK-NKEREVLQQLSRK-PWMGLMVNSFAQQ 80 (479) T ss_pred CChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcC------CcccccccccC-ChhHHHHHHHhhc-CcHHHHHHHHHhh Confidence 55442 2 2212222222222233332211 11000000000 0112234444454 4599999999998 Q ss_pred HhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccc-cCCcCceeE Q lcl|NC_016762. 74 CWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPA-RGKLNGLAK 151 (456) Q Consensus 74 ~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl-~~~~~~l~~ 151 (456) +.=++|++. +.++. + .+.+.+++-++-....++.+....||.|++++.-+ +..+ .+|. +.....=.. T Consensus 81 l~~~gf~~~--d~~~~-~-------~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d-~~g~~~i~~~~p~~ 149 (479) T protein:vir:99 81 LIVDGYRKT--GTNEN-A-------KGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLD-GTTVARIKCIDPRD 149 (479) T ss_pred cccccccCC--Cchhh-H-------HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcC-CCCceEEEEechhh Confidence 877776642 22211 1 24445555566677788888899999888876531 1000 0000 000000011 Q ss_pred EEEeccccCCh----hhhhcc-ccccccCCceeEEEeecccCCcc-ccceeeeh-hhh--heecCC---cCCCcchHHHH Q lcl|NC_016762. 152 VTPAWAGCLKP----KSFDEK-PDSETYGQPTMWEYTEASQAGRP-GLVRDIHP-DRV--FILGDW---TGDAIGFLEPA 219 (456) Q Consensus 152 i~~~~~~~~~~----~~~~~D-p~s~~yg~P~~y~i~~~~~~g~~-~~~~~IH~-SRl--i~~~~~---~~~G~S~le~~ 219 (456) +.++|.....- ..+..+ ...-.|+-+..|.+-... +|.. .....=|. -+| +.|... ..+|.|.++.+ T Consensus 150 ~~~iydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~g~sd~e~v 228 (479) T protein:vir:99 150 AFAIWEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFK-QGKFIYRETVSHDYGHIPFVRYVNVMDLRGVCYGDVEPL 228 (479) T ss_pred eEEEecCCcccceeeEEEeecCceeEEEEecceEEEEEec-CCceeeccccccCCCCcceEEeecCCCcCcCCcchhHHH Confidence 22222111000 000000 001112222222121111 1110 00111222 222 223222 14699999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEe Q lcl|NC_016762. 220 YNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMV 299 (456) Q Consensus 220 ~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~ 299 (456) .+-+.+++++.-.....+-..+..++.+.. .. +.+.... .. . .+.-..+.++...+++.+..+ T Consensus 229 ~~liDa~~~~~s~~~~~~~~~a~p~~~i~G---~~---~~~~~~~--~~--~-------~~~~~~~~i~~~~~~~~~~~q 291 (479) T protein:vir:99 229 VTVAKAIDKTGLDILLVQHHQSFQIRWATG---LM---LPEGANA--DQ--E-------KMRFAQESMLISQNEKASFGA 291 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHhhchhhhhcC---CC---ccccccc--ch--h-------ccccccccceeecCCCceEEE Confidence 877777766644332222112222222211 00 0000000 00 0 011112233333444444333 Q ss_pred ---cccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHHHHHHhc Q lcl|NC_016762. 300 ---SAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFAHLMRIG 371 (456) Q Consensus 300 ---~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~ 371 (456) .++....+.+.....++|+.++||.. .||.+. |++++ ++ ..-..+..+| ..+++.|++++.+++... T Consensus 292 ~~~~~~~~~~~~l~~~i~~i~~~t~~p~~-~~g~~~---n~Sg~Al~~~~~~l~~ka~~~~-~~f~~al~~~~~l~~~~~ 366 (479) T protein:vir:99 292 IPAAPLDGLLNAYKESLLEFLALAQLPPH-IAGQIV---NVAADALAAGTRQTMQKLFEKQ-ATWKASHNQTMRLVNKIE 366 (479) T ss_pred ecccchHHHHHHHHHHHHHHhccCCCCHH-Hccccc---chHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHc Confidence 34455666677788899999999985 677531 33433 22 2233444444 468999999999887655 Q ss_pred CcCCC---CceEEEeCCCCCCCHHHHHHHHHHHHHHH----HHHHHcCCcCcCHHHHHHH---hc--------ccCCCCC Q lcl|NC_016762. 372 VVPLK---AEFTAIWDDLTVPTKAERLANSKTMSEIN----SAAIGTGEPVFTAEEIREE---AG--------YDPLQGG 433 (456) Q Consensus 372 ~~~~~---~d~~~~f~pL~~~seke~Aei~~~~A~a~----~~~~~~g~~~i~~~E~R~~---~~--------~~~~~~~ 433 (456) .+..+ -++++.|.+...+|..+.|+...|.+++. .+.+.. .+-++..++..+ .+ ...+..+ T Consensus 367 ~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~-l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 445 (479) T protein:vir:99 367 GRTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDM-IPNLDQSTVNGWKEIYDREGDFGKYMRKLQNG 445 (479) T ss_pred CCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHh-cCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 44322 36888999999999999988877765531 122211 112444443211 00 0111100 Q ss_pred -CCC---cccCCCC-CCCCCcCCCCCCC Q lcl|NC_016762. 434 -DPL---PDTEPED-EDAARTDPTGEQQ 456 (456) Q Consensus 434 -~~~---~~~~~~d-~~~~~~d~~~~~e 456 (456) .+. .....++ .+++.++| ++.. T Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 472 (479) T protein:vir:99 446 PDPAEQRGGPNGATNMQQANNKT-GEPA 472 (479) T ss_pred cCcccccCCCCCCCCCCCCCCCC-cchh Confidence 000 0000011 11111111 1111 No 109 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.26 E-value=2.6e-10 Score=73.11 Aligned_cols=420 Identities=11% Similarity=-0.000 Sum_probs=179.9 Q ss_pred CCchhHHHHhHHHHHH-----HHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSA-----IARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCW 75 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~-----~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~t 75 (456) |++-....++..++.- .-+..+.|.. |..+- .. .+.. -..++...+....++++|||..++-+. T Consensus 18 l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~-----G~~~i---~~--~~~~-~p~~~~~~~~v~n~~~~iVd~~a~rl~ 86 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVDRTPRNLLRASFYD-----GKYAI---RQ--IGNL-IPPEYLRTATVLGWSAKAVDTLARRCN 86 (504) T ss_pred CCHHHHHHHHHHHHHHHHHhHHHHHHHHHHh-----ccccc---hh--cccc-ccHHHHHHhhccCcHHHHHHHHHhhhc Confidence 5555443333222211 1111122211 00000 00 1111 123455555556669999999999888 Q ss_pred hCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCC--c---ccccc------ Q lcl|NC_016762. 76 KTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQP--W---DRPAR------ 143 (456) Q Consensus 76 R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~--~---~~Pl~------ 143 (456) =+||++- +.++.. ..+.+.+.+-++-....++.+...+||.|++++.-+ |+++ . ..|.. T Consensus 87 ~~Gf~~~--d~~~~~-------~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD 157 (504) T protein:vir:99 87 LESFVWP--DGDYGS-------IGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWN 157 (504) T ss_pred cceeeCC--CCChhh-------HHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEe Confidence 8898762 211111 235555666667677888888889999998877653 2332 1 12221 Q ss_pred CCcCceeEEEEeccccCChhhhhccccccccCCcee-EEEeecccCCccccceeeehhh--hheecCC----cCCCcchH Q lcl|NC_016762. 144 GKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTM-WEYTEASQAGRPGLVRDIHPDR--VFILGDW----TGDAIGFL 216 (456) Q Consensus 144 ~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~-y~i~~~~~~g~~~~~~~IH~SR--li~~~~~----~~~G~S~l 216 (456) ...+.+.....+|... .+..+..-.++.|.. |.+... .++.......-|+=- ++.|... ...|.|.+ T Consensus 158 ~~~~~~~~a~~~~~~d-----~~g~~~~~~~y~~~~~~~~~~~-~~~~~~~~~~~~~~gvPvV~~~n~~~~~~~~G~sei 231 (504) T protein:vir:99 158 SRRNAMDSLLSITSRD-----AEGHPTGIALYEDGVTVTADMD-DDGDWHADVRTHKLGVPVEVLPYKPREDRPLGSSRI 231 (504) T ss_pred CCCCceeEEEEEEEec-----CCCeEEEEEEEcCCcEEEEEEc-CCceeeeccccCCCCcceEEecccccCccccCcccc Confidence 0111111111111100 000122222333432 333211 011111111112211 2333221 23577765 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEec-CCCc Q lcl|NC_016762. 217 -EPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPT-QGAT 294 (456) Q Consensus 217 -e~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid-~~d~ 294 (456) +++..-+.++.++.-.......-.+..++.+.. .+..+.....+.+... .+....++-.+-.+.+..+.. .+-+ T Consensus 232 ~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G---~~~~~~~~~d~~~~~~-~~~~~~~i~~~~~~~~~~~~~~~~~~ 307 (504) T protein:vir:99 232 TRPVMSLQQRALKGCIRMDGHADVYSFPQLILLG---ADAKNFRNKDGSMKPA-WQIALARVFALPDDEDEPDAARARAD 307 (504) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcc---CCccccccccccccch-hhhhhhhhhcCCCccccccccCccce Confidence 344444444444432221111111222222211 1111111111111111 111111221121222211111 1123 Q ss_pred eeEE-ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-----HHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_016762. 295 VTQM-VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-----QKYHNARCQARRVQELTFEINDLFAHLM 368 (456) Q Consensus 295 ~~~~-~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-----~~nyyd~I~~~Qe~~lrp~L~~l~~~l~ 368 (456) +.++ ++++.++.+.+.....+||+.++||.. -||...-.-|++++ ....-..+..+|+ .+.+.|++++.+.+ T Consensus 308 ~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~-~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~-~f~~~l~~~~rla~ 385 (504) T protein:vir:99 308 VKQFPASSPQPHIEMLEQIAMMFSGETSIPVE-SLGFSNRANPTSADAYIASREDLIAEAEGATD-DWSPAFRRSMIRAL 385 (504) T ss_pred eeecCCCChHHHHHHHHHHHHHHHhhhCCCHH-HhcccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Confidence 3333 234556677788889999999999965 67754322233332 3345566666665 57999999998765 Q ss_pred HhcC--cCCC---CceEEEeCCCCCCCHHHHHHHHHHHHHHHHH-------H-HHcCCcCcCHHHHHHHh---------- Q lcl|NC_016762. 369 RIGV--VPLK---AEFTAIWDDLTVPTKAERLANSKTMSEINSA-------A-IGTGEPVFTAEEIREEA---------- 425 (456) Q Consensus 369 ~s~~--~~~~---~d~~~~f~pL~~~seke~Aei~~~~A~a~~~-------~-~~~g~~~i~~~E~R~~~---------- 425 (456) .... ...+ ..+++.|.|...+|..+.|+...|.+++... + -..| ++++|+..+. T Consensus 386 ~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg---~~~~ei~r~~~e~~~~~~~~ 462 (504) T protein:vir:99 386 AIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLG---LTPQQAKRALAERRRASSVS 462 (504) T ss_pred HHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcC---CCHHHHHHHHHHHHHHhhHH Confidence 4432 2222 3578889999999999999988877765321 1 1223 4666653211 Q ss_pred ------cccCCCCCCCCcccCCCCCCCCCcCCCC-----CCC Q lcl|NC_016762. 426 ------GYDPLQGGDPLPDTEPEDEDAARTDPTG-----EQQ 456 (456) Q Consensus 426 ------~~~~~~~~~~~~~~~~~d~~~~~~d~~~-----~~e 456 (456) ...+.........++++ .+++.+.|++ .++ T Consensus 463 ~~~~l~~~~~~~~~~~~~~~~~~-~e~a~~~~~~~~~~p~~~ 503 (504) T protein:vir:99 463 IIEALNRRQQEAATAGEDQDQGA-GEPPANEPPAALGRPTLV 503 (504) T ss_pred HHHHHhcccCCCCCCCCCCCcCC-CCCCCCCCCccCCCcccC Confidence 00111111111111111 1111111111 111 No 110 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=99.25 E-value=9.3e-12 Score=81.07 Aligned_cols=344 Identities=14% Similarity=0.135 Sum_probs=155.4 Q ss_pred HHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhh--HHHHHH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDE--TEWERK 99 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~--~~~e~~ 99 (456) |++.+-..+. .+.+.. .++....+... ...+..+..+.+||+++|++.-.--+.+......+..... ...... T Consensus 1 M~~f~k~~~~--~~~~~~--~~~~~~~~~~~-~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~ 75 (378) T protein:vir:85 1 MNLFGKVVSF--SRGKLN--NDTQRVTAWQN-EAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSD 75 (378) T ss_pred Cchhhhhhhh--hhcccc--cCCcceeeeec-cchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccccccccch Confidence 4443321111 111110 01111111000 0111234667889999999987766666433322111100 000011 Q ss_pred HHHHHH----HhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccccccc Q lcl|NC_016762. 100 NKPLIA----GGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETY 174 (456) Q Consensus 100 i~~~~~----~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~y 174 (456) +..++. ..--+..|.+.+.+ -.+.|-+++++...| ..+.+.+..+.. T Consensus 76 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~----------~~g~~~~~~~~~------------------ 127 (378) T protein:vir:85 76 LDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDS----------ETGELLDLLFAN------------------ 127 (378) T ss_pred HHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecC----------CCceEEEEEecC------------------ Confidence 222221 11122334454443 456677777654322 111222221110 Q ss_pred CCceeEEEeecccCCccccceeeehhhhheec-CCcCC-CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_016762. 175 GQPTMWEYTEASQAGRPGLVRDIHPDRVFILG-DWTGD-AIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKE 252 (456) Q Consensus 175 g~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~-~~~~~-G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 252 (456) .++.+-++.+||+. ..... +.+.++.....+.. .+++......++.... T Consensus 128 ------------------~~~~~~~~dvih~~~~~~~~~~~~~~~~a~~~~~~-----------~~~~~~~~g~l~~~~~ 178 (378) T protein:vir:85 128 ------------------DKKEYKPEELVRLVSPFYINEDTSILDNALASIQT-----------KLEQGKLRGLLKINAF 178 (378) T ss_pred ------------------CCEEEcccceEEEecCcCccchhhHHHHHHHHHHH-----------HHhcCCcceEEEeCCc Confidence 01223333444442 22222 23333333322211 1111110001111111 Q ss_pred ccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CCeEEecCCCceeEEecccCCHH-HHHHHHHHHHHhhhcCCeEEee Q lcl|NC_016762. 253 INLGEIASTYGVTLDALNERFNEAARQLNRG---NDVLLPTQGATVTQMVSAVSDPG-PTYNVNLQTAAAGVDIPTKILV 328 (456) Q Consensus 253 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~lid~~d~~~~~~~~~sgl~-~~~~~~~~~~aaas~IP~t~L~ 328 (456) ++ . ...++..+++.+.+.....+ .+.++++.+.+|++++.+...++ ........+||.+.|||..+|. T Consensus 179 l~-----~---~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~ 250 (378) T protein:vir:85 179 LD-----I---DNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIELIKSELLTGYFMNENILL 250 (378) T ss_pred CC-----H---HHHHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhHHHHHHHHHHHHHHhCCCHHHhc Confidence 10 0 01133444554444444332 24567777889999887665544 2234445689999999988774 Q ss_pred ccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC----------CCCceEEEeCCCCCCCHHHHHHHH Q lcl|NC_016762. 329 GMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVP----------LKAEFTAIWDDLTVPTKAERLANS 398 (456) Q Consensus 329 G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~----------~~~d~~~~f~pL~~~seke~Aei~ 398 (456) |. ++.+-..+||.. -|.|.+.++-.-|-+.-+-+ ...++.|+++.|...|.+++++ T Consensus 251 ~s-----~~e~~~~~f~~~-------tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~-- 316 (378) T protein:vir:85 251 GT-----ATQEQQIYFYNS-------TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELID-- 316 (378) T ss_pred CC-----chHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHH-- Confidence 31 122233455543 48888877766554332221 1124667788899888887755 Q ss_pred HHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC-------cccCCCCCCCC----CcCCCCCCC Q lcl|NC_016762. 399 KTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL-------PDTEPEDEDAA----RTDPTGEQQ 456 (456) Q Consensus 399 ~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~-------~~~~~~d~~~~----~~d~~~~~e 456 (456) +...++..| +++++|+|+..+++|+++++.. +.+...+.+.. ++.++..+| T Consensus 317 -----~~~~~~~~G--~~T~NE~R~~lgl~p~~gGD~~~~~~N~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 317 -----LYHENINGP--IFTQNQLLVKMGEQPIEGGDIYIANLNAVAVKNLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred -----HHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeEeecccccccccchhhcCccCCCCCCCCCCCC Confidence 445667777 9999999999999999876532 11111111111 111112222 No 111 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=99.23 E-value=4.3e-10 Score=71.91 Aligned_cols=409 Identities=12% Similarity=0.039 Sum_probs=176.0 Q ss_pred CCchhHHHHh----HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDKLDLAVN----HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~~~~~~~----~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) |+.-.+.+.- |.-+...-+..+.|.. | +++ . ...+ .. .+.++...-....++++|||..++-+.= T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~-----G-~~~-i-~~~~--~~-~~~~~~~~~~~~n~~~~ivd~~~~~l~~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRN-----G-TRR-L-KTIG--IG-APPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHh-----c-ccc-c-cccc--cc-cchhHhhhhhhcchHHHHHHHHHhhhcc Confidence 9888876542 2222222223333322 1 111 0 0111 11 1223333333456689999999998888 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccc---cccCCcCceeEEE Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDR---PARGKLNGLAKVT 153 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~---Pl~~~~~~l~~i~ 153 (456) +||.+.. ++. . .+.+.+.+++-++-....++.+...+||.|++++..++....+. |. ...-.-..+. T Consensus 70 ~g~~~~~-d~~-~-------~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~-i~~~~p~~~~ 139 (480) T protein:vir:78 70 EGFRISE-DSE-G-------LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPL-IRVESPLYMY 139 (480) T ss_pred CceecCC-Cch-h-------HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeE-EEEEcccceE Confidence 8876522 211 1 12366677777788889999999999999988886532211110 10 0000001122 Q ss_pred Eecccc----CChhh-hhccccccccCCceeE---------EEeecccCCccccc---ee-e-eh-hhh--heecC---- Q lcl|NC_016762. 154 PAWAGC----LKPKS-FDEKPDSETYGQPTMW---------EYTEASQAGRPGLV---RD-I-HP-DRV--FILGD---- 207 (456) Q Consensus 154 ~~~~~~----~~~~~-~~~Dp~s~~yg~P~~y---------~i~~~~~~g~~~~~---~~-I-H~-SRl--i~~~~---- 207 (456) |+|-.. +...- +..+ ..+.+.+.++ .+... ++..... .. + |. .+| +.|.. T Consensus 140 ~~~D~~~~~~~~~~i~~~~~--~~~~~~~~~~~~y~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~ 215 (480) T protein:vir:78 140 AELDPRNTRRVTRAVRLYTT--RDDVAVPDRATLYLPDETVPLRRN--GGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRL 215 (480) T ss_pred EEEcCCCccceEEEEEEEEe--ecCCCceEEEEEEeCCeEEEEEec--CCCccccccccccccCCCCCcceEEeeccccc Confidence 233110 00000 0000 0112222211 11110 1111000 01 1 11 122 22321 Q ss_pred CcCCCcchHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCe Q lcl|NC_016762. 208 WTGDAIGFLEP-AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDV 286 (456) Q Consensus 208 ~~~~G~S~le~-~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (456) ...+|.|.++. +.+-+.+++++....+..+-..+..++.+.. .+.....+ +.-...+. ...+. T Consensus 216 ~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G---~~~~~~~~----------~~~~~~~~---~~~~~ 279 (480) T protein:vir:78 216 GNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISG---VTTDELTN----------DGENTTLD---IYYGR 279 (480) T ss_pred CCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhc---CCcccccc----------ccccchhh---hhhhh Confidence 12468888765 5444444555443332222112222222211 11000000 00000111 11122 Q ss_pred EEecCCCceeEEecc---cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-H---HHHHHHHHHhhhhH Q lcl|NC_016762. 287 LLPTQGATVTQMVSA---VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-H---NARCQARRVQELTF 358 (456) Q Consensus 287 ~lid~~d~~~~~~~~---~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-y---yd~I~~~Qe~~lrp 358 (456) ++...+++.+..+.+ +....+.+.....++|+.++||...| |..... +++++ ++. | -..+ ..++..+.+ T Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~n-~~Sg~Alk~~~~~l~~ka-~~~~~~f~~ 356 (480) T protein:vir:78 280 ILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYL-SSSSEN-PASAEAIIATDSRIVKMA-ERKGRIFGG 356 (480) T ss_pred hccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHh-ccccCc-chHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 222233343334333 44455667777888999999999866 443332 23332 222 2 2333 344456789 Q ss_pred HHHHHHHHHHHhcCcCC---CCceEEEeCCCCCCCHHHHHHHHHHHHHHHH------H-HHHcCCcCcCHHHHHHHh--- Q lcl|NC_016762. 359 EINDLFAHLMRIGVVPL---KAEFTAIWDDLTVPTKAERLANSKTMSEINS------A-AIGTGEPVFTAEEIREEA--- 425 (456) Q Consensus 359 ~L~~l~~~l~~s~~~~~---~~d~~~~f~pL~~~seke~Aei~~~~A~a~~------~-~~~~g~~~i~~~E~R~~~--- 425 (456) .|++++.+++....... ...+++.|.+-..+|..+.|+...+.+++.. + +...| ++++++.++. T Consensus 357 ~l~~~~~l~~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg---~~~d~~~~~~~~~ 433 (480) T protein:vir:78 357 AWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG---YTATQREQMRDWD 433 (480) T ss_pred HHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCC---CCHhHHHHHHHHH Confidence 99999998776543322 2368889999999999999887766555321 1 11222 3444332211 Q ss_pred ---cc---cCCC---CCCCCccc-CCCCCC--CCCcCCCCCCC Q lcl|NC_016762. 426 ---GY---DPLQ---GGDPLPDT-EPEDED--AARTDPTGEQQ 456 (456) Q Consensus 426 ---~~---~~~~---~~~~~~~~-~~~d~~--~~~~d~~~~~e 456 (456) .. +.+. ........ +...+. +++..|+|... T Consensus 434 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (480) T protein:vir:78 434 KQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR 476 (480) T ss_pred HHHHHHHHHHhhccccccCCCCCCCCCCCCCCccccccCCCCc Confidence 00 1111 01111100 001111 11111222222 No 112 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=99.22 E-value=2.8e-10 Score=72.97 Aligned_cols=423 Identities=10% Similarity=-0.011 Sum_probs=189.3 Q ss_pred CCchh--HHHHhHHHHHH-HHHHHHHHhhhhhccCc-ccchhh--hhccC---cccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKL--DLAVNHAMSSA-IARARMSLLNQGIGHDA-KRPQAW--CEYGF---PQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~--~~~~~~a~~~~-~~~~~d~~~n~~~~~gt-~~~~~~--~~~~~---~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) ++.++ ++...|..... +.+...-|-+.-..+.. .+.... ..|.. .......--.. -.+.+++.||+..+ T Consensus 15 ~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k--i~~n~~~~ivd~~~ 92 (474) T protein:vir:94 15 ILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNK--LNNSFDSEIVDTRV 92 (474) T ss_pred CCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccc--cccchHHHHHHhHh Confidence 23221 11212221111 11111111111000000 000000 00000 00000000000 13789999999999 Q ss_pred hHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCc---cccccC--- Q lcl|NC_016762. 72 TTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPW---DRPARG--- 144 (456) Q Consensus 72 ed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~---~~Pl~~--- 144 (456) .=++.+.+++...++.+. ...+.+.|...+++-++.....++.+....||.|++++.++ +|+.- -.|... T Consensus 93 ~yl~g~pv~~~~~~~~~~---~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v 169 (474) T protein:vir:94 93 GYLHGVPVTYDLDENAEK---NEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKNIDPYNVIFV 169 (474) T ss_pred hheeccceeEeeCCCCcc---hHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEEEcccceEEE Confidence 999999888865432222 12233456677777788999999999999999999988774 34321 123221 Q ss_pred ---CcCceeEEEEeccccCChhhhhcc-ccccccCCceeEEEeecccCCccccceeeehhhhheecC--CcCCCcchHHH Q lcl|NC_016762. 145 ---KLNGLAKVTPAWAGCLKPKSFDEK-PDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD--WTGDAIGFLEP 218 (456) Q Consensus 145 ---~~~~l~~i~~~~~~~~~~~~~~~D-p~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--~~~~G~S~le~ 218 (456) ..+.+. ..-+|....... ... -.-..|..-+.|.+.....++.......-|..-.+.+.. ...+|.|.++. T Consensus 170 ~d~~~~~~~-~i~~~~~~~~~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~ 246 (474) T protein:vir:94 170 GDNILEPTY-SLRYFYEKDDDN--GTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEK 246 (474) T ss_pred EcCCCceEE-EEEEEEEeeCCC--ceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHH Confidence 111111 111221100000 000 011123333444443222221111122345554444433 34568999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEE Q lcl|NC_016762. 219 AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQM 298 (456) Q Consensus 219 ~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~ 298 (456) +..-+.+++.+.-..+..+-..+...+.++. .+.. ++.. ..+....-..+.+.+.+++.+ T Consensus 247 v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g------------~~~~-~~~~-------~~~~~~~~i~~~~~~~~~~~l 306 (474) T protein:vir:94 247 VIHLIDAYDLTMSDASSEISQTRLAYLVLRG------------MGMS-EEMI-------QETQKSGAFELFDKDMDVKYL 306 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhcc------------CCCC-chhh-------hhhhhcceeEecCCCCceeEE Confidence 8877777777655555444322222222211 0101 1111 111222222233445555554 Q ss_pred --ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-HHH--HHHHHHHhhhhHHHHHHHHHHHHh-c Q lcl|NC_016762. 299 --VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-HNA--RCQARRVQELTFEINDLFAHLMRI-G 371 (456) Q Consensus 299 --~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-yyd--~I~~~Qe~~lrp~L~~l~~~l~~s-~ 371 (456) +.+..+....++.+.+.|...+++|-.-. +...| |.++. ++. |.. .-...++..++..|++++++++.. + T Consensus 307 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~~~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 383 (474) T protein:vir:94 307 TKDVNDTMIENHLDRIEKNIMRFAKSVNFNS-DEFNG--NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALK 383 (474) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCccccc-ccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 45667889999999999999999996422 22212 33333 222 222 223455667899999998887643 1 Q ss_pred ---Cc--C-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCC Q lcl|NC_016762. 372 ---VV--P-LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPED 443 (456) Q Consensus 372 ---~~--~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d 443 (456) .+ + ...++++.|+|-...+++|.|++..+.+-..+ +++..--.+-++++..+....+........++....+ T Consensus 384 ~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~ 463 (474) T protein:vir:94 384 RKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGD 463 (474) T ss_pred hccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCC Confidence 11 1 12478999999999999999999877653221 1121110122333221111111100000111111111 Q ss_pred CCCCCcCCCCCCC Q lcl|NC_016762. 444 EDAARTDPTGEQQ 456 (456) Q Consensus 444 ~~~~~~d~~~~~e 456 (456) .+..+ +..++| T Consensus 464 ~~~~~--~~~~s~ 474 (474) T protein:vir:94 464 ANDKS--QNNQSE 474 (474) T ss_pred cCCCC--ccccCC Confidence 11111 111111 No 113 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=99.22 E-value=2.8e-10 Score=72.97 Aligned_cols=423 Identities=10% Similarity=-0.011 Sum_probs=189.3 Q ss_pred CCchh--HHHHhHHHHHH-HHHHHHHHhhhhhccCc-ccchhh--hhccC---cccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKL--DLAVNHAMSSA-IARARMSLLNQGIGHDA-KRPQAW--CEYGF---PQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~--~~~~~~a~~~~-~~~~~d~~~n~~~~~gt-~~~~~~--~~~~~---~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) ++.++ ++...|..... +.+...-|-+.-..+.. .+.... ..|.. .......--.. -.+.+++.||+..+ T Consensus 15 ~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k--i~~n~~~~ivd~~~ 92 (474) T protein:vir:10 15 ILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNK--LNNSFDSEIVDTRV 92 (474) T ss_pred CCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccc--cccchHHHHHHhHh Confidence 23221 11212221111 11111111111000000 000000 00000 00000000000 13789999999999 Q ss_pred hHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCc---cccccC--- Q lcl|NC_016762. 72 TTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPW---DRPARG--- 144 (456) Q Consensus 72 ed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~---~~Pl~~--- 144 (456) .=++.+.+++...++.+. ...+.+.|...+++-++.....++.+....||.|++++.++ +|+.- -.|... T Consensus 93 ~yl~g~pv~~~~~~~~~~---~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v 169 (474) T protein:vir:10 93 GYLHGVPVTYDLDENAEK---NEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKNIDPYNVIFV 169 (474) T ss_pred hheeccceeEeeCCCCcc---hHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEEEcccceEEE Confidence 999999888865432222 12233456677777788999999999999999999988774 34321 123221 Q ss_pred ---CcCceeEEEEeccccCChhhhhcc-ccccccCCceeEEEeecccCCccccceeeehhhhheecC--CcCCCcchHHH Q lcl|NC_016762. 145 ---KLNGLAKVTPAWAGCLKPKSFDEK-PDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD--WTGDAIGFLEP 218 (456) Q Consensus 145 ---~~~~l~~i~~~~~~~~~~~~~~~D-p~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--~~~~G~S~le~ 218 (456) ..+.+. ..-+|....... ... -.-..|..-+.|.+.....++.......-|..-.+.+.. ...+|.|.++. T Consensus 170 ~d~~~~~~~-~i~~~~~~~~~~--~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~ 246 (474) T protein:vir:10 170 GDNILEPTY-SLRYFYEKDDDN--GTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEK 246 (474) T ss_pred EcCCCceEE-EEEEEEEeeCCC--ceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHH Confidence 111111 111221100000 000 011123333444443222221111122345554444433 34568999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEE Q lcl|NC_016762. 219 AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQM 298 (456) Q Consensus 219 ~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~ 298 (456) +..-+.+++.+.-..+..+-..+...+.++. .+.. ++.. ..+....-..+.+.+.+++.+ T Consensus 247 v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g------------~~~~-~~~~-------~~~~~~~~i~~~~~~~~~~~l 306 (474) T protein:vir:10 247 VIHLIDAYDLTMSDASSEISQTRLAYLVLRG------------MGMS-EEMI-------QETQKSGAFELFDKDMDVKYL 306 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhcc------------CCCC-chhh-------hhhhhcceeEecCCCCceeEE Confidence 8877777777655555444322222222211 0101 1111 111222222233445555554 Q ss_pred --ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-HHH--HHHHHHHhhhhHHHHHHHHHHHHh-c Q lcl|NC_016762. 299 --VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-HNA--RCQARRVQELTFEINDLFAHLMRI-G 371 (456) Q Consensus 299 --~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-yyd--~I~~~Qe~~lrp~L~~l~~~l~~s-~ 371 (456) +.+..+....++.+.+.|...+++|-.-. +...| |.++. ++. |.. .-...++..++..|++++++++.. + T Consensus 307 ~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~~~--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 383 (474) T protein:vir:10 307 TKDVNDTMIENHLDRIEKNIMRFAKSVNFNS-DEFNG--NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALK 383 (474) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCccccc-ccccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 45667889999999999999999996422 22212 33333 222 222 223455667899999998887643 1 Q ss_pred ---Cc--C-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCC Q lcl|NC_016762. 372 ---VV--P-LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPED 443 (456) Q Consensus 372 ---~~--~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d 443 (456) .+ + ...++++.|+|-...+++|.|++..+.+-..+ +++..--.+-++++..+....+........++....+ T Consensus 384 ~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~ 463 (474) T protein:vir:10 384 RKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGD 463 (474) T ss_pred hccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCC Confidence 11 1 12478999999999999999999877653221 1121110122333221111111100000111111111 Q ss_pred CCCCCcCCCCCCC Q lcl|NC_016762. 444 EDAARTDPTGEQQ 456 (456) Q Consensus 444 ~~~~~~d~~~~~e 456 (456) .+..+ +..++| T Consensus 464 ~~~~~--~~~~s~ 474 (474) T protein:vir:10 464 ANDKS--QNNQSE 474 (474) T ss_pred cCCCC--ccccCC Confidence 11111 111111 No 114 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=99.21 E-value=1.6e-10 Score=74.24 Aligned_cols=413 Identities=12% Similarity=0.024 Sum_probs=173.4 Q ss_pred CCchhHH--------------HHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhh Q lcl|NC_016762. 1 MTDKLDL--------------AVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGA 66 (456) Q Consensus 1 ~~~~~~~--------------~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~i 66 (456) ||-..++ +..|-.+...-+..+.|... +++ . .. .+.. .+.++........++++| T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G------~~~-i-~~--~~~~-~~~~~~~~~~v~n~~~~i 69 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDA------ERR-P-EA--IGVT-VPREMQQLLAHVGYPRLY 69 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cCc-c-hh--cccc-cchhHhhhhhccchHHHH Confidence 3322222 22222222222223333321 111 0 00 0111 123344444455669999 Q ss_pred hccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCc Q lcl|NC_016762. 67 VEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKL 146 (456) Q Consensus 67 Vd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~ 146 (456) ||..++-+.=.||++- +.++.. ..+.+.+++.++-....++.+....||.|++++..+.+.....+.++.. T Consensus 70 Vd~~~~~l~~~g~~~~--~~~~~~-------~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~ 140 (486) T protein:vir:42 70 VDSVAERQAVEGFRLG--DADEAD-------EELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVP 140 (486) T ss_pred HHHHHhhhcccceecC--CCchhH-------HHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCee Confidence 9999988877887752 211111 2355566666777788888899999999988886643221111111000 Q ss_pred Ccee-----EEEEecccc---CChh--hhh-cc---ccccccCCce-eEEEeecccCCcccc-ceeeehhhhheecCC-- Q lcl|NC_016762. 147 NGLA-----KVTPAWAGC---LKPK--SFD-EK---PDSETYGQPT-MWEYTEASQAGRPGL-VRDIHPDRVFILGDW-- 208 (456) Q Consensus 147 ~~l~-----~i~~~~~~~---~~~~--~~~-~D---p~s~~yg~P~-~y~i~~~~~~g~~~~-~~~IH~SRli~~~~~-- 208 (456) .|. .+.++|-.. +... -+. .+ +..-.++.|. .|++.. .+|...- ...-|+--.+.+.++ T Consensus 141 -~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~--~~~~~~~~~~~~h~~g~vPvv~~~n 217 (486) T protein:vir:42 141 -IIRVEPPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFR--ADGEWAEWFNVPHGLGVVPVVPLPN 217 (486) T ss_pred -EEEEecccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEe--cCCcEEeecceecCCCCceEEEecc Confidence 011 111222100 0000 000 00 1111122222 122211 1111110 111244322222211 Q ss_pred -----cCCCcchHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc Q lcl|NC_016762. 209 -----TGDAIGFLEP-AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR 282 (456) Q Consensus 209 -----~~~G~S~le~-~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 282 (456) ..+|.|.+++ +..-+.+++++.-..+...--.+..++.+.. .+........+.+ ...+.. T Consensus 218 ~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G---~~~~~~~~~~~~~-----------~~~~~~ 283 (486) T protein:vir:42 218 RTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG---IKPEEIGVDSETG-----------QTLFDA 283 (486) T ss_pred ccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhc---CCccccccccccc-----------cchhhh Confidence 2368888875 4333344444432222111111122222211 0111110000000 001112 Q ss_pred CCCeEEecCCCceeEEecccCC---HHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH----HHHHHHHHHHh Q lcl|NC_016762. 283 GNDVLLPTQGATVTQMVSAVSD---PGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY----HNARCQARRVQ 354 (456) Q Consensus 283 ~~~~~lid~~d~~~~~~~~~sg---l~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n----yyd~I~~~Qe~ 354 (456) ..+.++...+++.+..+.+-++ ..+.+.....++|+.+++|...| |.++.. +++++ ++. ....++ .++. T Consensus 284 ~~~~~~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~f-g~~~~n-~~Sg~Al~~~~~~l~~ka~-~~~~ 360 (486) T protein:vir:42 284 YLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYL-STAADN-PASAEAIRAAESRLIKKVE-RKNL 360 (486) T ss_pred hhchhcccCCCCceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHh-ccccCc-hhHHHHHHHHHHHHHHHHH-HHHH Confidence 2233334344444444444444 45666666888999999998766 443332 23333 332 233344 3445 Q ss_pred hhhHHHHHHHHHHHHhcCcC----CCCceEEEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHHcCCcCcCHH---H Q lcl|NC_016762. 355 ELTFEINDLFAHLMRIGVVP----LKAEFTAIWDDLTVPTKAERLANSKTMSEIN-------SAAIGTGEPVFTAE---E 420 (456) Q Consensus 355 ~lrp~L~~l~~~l~~s~~~~----~~~d~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~~~i~~~---E 420 (456) .+++.|++++.+++....+. ...++++.|+|-..+|..+.|+...|.+++. ......| ++++ | T Consensus 361 ~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg---~~~d~~~e 437 (486) T protein:vir:42 361 MFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMG---YSVKEREE 437 (486) T ss_pred HHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCC---CChhHHHH Confidence 68999999999876654331 1236889999999999999999887776542 1112222 2333 3 Q ss_pred HHHHhc---------ccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 421 IREEAG---------YDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 421 ~R~~~~---------~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +++... ...+.+........++..+.+.++|.+.+. T Consensus 438 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (486) T protein:vir:42 438 MRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESS 482 (486) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCC Confidence 322100 111111111111111111222222222222 No 115 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=99.20 E-value=5.4e-10 Score=71.39 Aligned_cols=412 Identities=12% Similarity=0.053 Sum_probs=175.8 Q ss_pred CCchhHHHHhHHH----HHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDKLDLAVNHAM----SSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~~~~~~~~a~----~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) |+.-++++.-..- +...-...+.|.. |...-+ ..+ .. -+.++........++++|||..++=+.= T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~-----G~~~i~---~~~--~~-~~~~~~~~~~~~n~~~~ivd~~~~~l~~ 69 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRN-----GTRRLK---TIG--IG-APPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHh-----ccccch---hcc--cc-cchhhhhhhhhcchHHHHHHHHHhhhcc Confidence 9999887654221 1111112233322 111100 000 11 1223333334567899999999998887 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccc---cccCCcCceeEEE Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDR---PARGKLNGLAKVT 153 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~---Pl~~~~~~l~~i~ 153 (456) +||.+. +++ +. ...+.+.+++-++-....++.+....||.|++++.-+.....+. |. ...-.=..+. T Consensus 70 ~g~~~~-~d~-~~-------~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~-i~~~~p~~~~ 139 (480) T protein:vir:78 70 EGFRIS-EDS-EG-------LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPL-IRVESPLYMY 139 (480) T ss_pred CceecC-CCc-hh-------HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeE-EEEEcccceE Confidence 787652 221 11 12366777777888889999999999999988876432111110 00 0000000111 Q ss_pred Eeccc----cCChhh-h-h-cc----ccccccCCce-eEEEeecccCCccccc---ee-e-eh-hhh--heecC----Cc Q lcl|NC_016762. 154 PAWAG----CLKPKS-F-D-EK----PDSETYGQPT-MWEYTEASQAGRPGLV---RD-I-HP-DRV--FILGD----WT 209 (456) Q Consensus 154 ~~~~~----~~~~~~-~-~-~D----p~s~~yg~P~-~y~i~~~~~~g~~~~~---~~-I-H~-SRl--i~~~~----~~ 209 (456) |+|-. .+...- + . .| +....++.|. .|.+.. .++..... .. + |. -+| +.|.. .. T Consensus 140 ~i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~ 217 (480) T protein:vir:78 140 AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRR--NGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGN 217 (480) T ss_pred EEEcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEe--cCCCcccccccccccccCCCCcceEEeecccccCC Confidence 22210 000000 0 0 00 1111122221 111111 11111100 01 1 21 111 22321 12 Q ss_pred CCCcchHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEE Q lcl|NC_016762. 210 GDAIGFLEP-AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLL 288 (456) Q Consensus 210 ~~G~S~le~-~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 288 (456) .+|.|.+++ +..-+.+++++.......+-..+..++.+.. .++....+ . .-...+ ....+.++ T Consensus 218 ~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G---~~~~~~~~---~-------~~~~~~---~~~~~~~~ 281 (480) T protein:vir:78 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISG---VTTDELTN---D-------GENTTL---DIYYGRIL 281 (480) T ss_pred ccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhC---CCcccccc---c-------cccchh---hhhhhhhc Confidence 468888764 5544445555543332222112222222211 11000000 0 000011 11112223 Q ss_pred ecCCCceeEEeccc---CCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH---HHHHHHHHHHhhhhHHHH Q lcl|NC_016762. 289 PTQGATVTQMVSAV---SDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY---HNARCQARRVQELTFEIN 361 (456) Q Consensus 289 id~~d~~~~~~~~~---sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n---yyd~I~~~Qe~~lrp~L~ 361 (456) ...+++.+..+.+- ....+.+.....++++.+++|...| |.++.. +++++ ++. --......++..+++.|+ T Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~f-g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~ 359 (480) T protein:vir:78 282 TLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYL-SSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) T ss_pred cCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHh-ccccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33334444444333 4455666777888999999998755 543332 23333 221 123333455556899999 Q ss_pred HHHHHHHHhcCcCC---CCceEEEeCCCCCCCHHHHHHHHHHHHHHH-----H-H-HHHcCCcCcCHHHHHHH------h Q lcl|NC_016762. 362 DLFAHLMRIGVVPL---KAEFTAIWDDLTVPTKAERLANSKTMSEIN-----S-A-AIGTGEPVFTAEEIREE------A 425 (456) Q Consensus 362 ~l~~~l~~s~~~~~---~~d~~~~f~pL~~~seke~Aei~~~~A~a~-----~-~-~~~~g~~~i~~~E~R~~------~ 425 (456) +++.+++....... ..++++.|+|-..+|..+.|+...+.+++. . + +...| ++++++.+. - T Consensus 360 ~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg---~~~d~~~e~~~~~~~~ 436 (480) T protein:vir:78 360 RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG---YTATQREQMRDWDKQE 436 (480) T ss_pred HHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCC---CCHhHHHHHHHHHHHH Confidence 99998876644322 236899999999999999988776665532 1 1 11223 334333221 1 Q ss_pred cccC---C---CCCCCCcccCCCCCC---CCCcCCCCCCC Q lcl|NC_016762. 426 GYDP---L---QGGDPLPDTEPEDED---AARTDPTGEQQ 456 (456) Q Consensus 426 ~~~~---~---~~~~~~~~~~~~d~~---~~~~d~~~~~e 456 (456) +..+ + ..+.+.....+...+ .++..|.+.-. T Consensus 437 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (480) T protein:vir:78 437 TEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR 476 (480) T ss_pred HHHHHHHhhccccCCCccccCCCCCCCCCccCCCcccCCC Confidence 1101 0 011111000000000 01111111111 No 116 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.19 E-value=5.6e-10 Score=71.30 Aligned_cols=411 Identities=10% Similarity=-0.008 Sum_probs=168.0 Q ss_pred CCchhHHH---------Hh-----HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhh Q lcl|NC_016762. 1 MTDKLDLA---------VN-----HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGA 66 (456) Q Consensus 1 ~~~~~~~~---------~~-----~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~i 66 (456) ||-..++. ++ |..+..+-+....|... +++-.. .+... ..++........++++| T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G------~~~i~~----~~~~~-~~~~~~~~~~~n~~~~i 69 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEA------ERRPEA----IGVTV-PIQMQSLLAHVGYPRLY 69 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCcchh----cCCCC-ChhhhhhhhhcCcHHHH Confidence 33332222 11 11111111122222211 111000 11111 22333444456788999 Q ss_pred hccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCc Q lcl|NC_016762. 67 VEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKL 146 (456) Q Consensus 67 Vd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~ 146 (456) ||..++=+.=+||++ ++.++. .+.+.+.+.+-++-....++.+...+||.|++++..+.......+-++.. T Consensus 70 vd~~~~~l~~~g~~~--~~~~~~-------~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~ 140 (485) T protein:vir:10 70 VDSIAERQAVEGFRF--GDADEA-------DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTP 140 (485) T ss_pred HHHHHhhhcccceec--CCCchh-------HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCee Confidence 999999887777764 222211 12366667777788888889999999999998887653221111000000 Q ss_pred CceeE-----EEEeccccC---Chh-hhhcc-----ccccccCCceeEEEeecccCCccc-cceeeehhhhh---eecC- Q lcl|NC_016762. 147 NGLAK-----VTPAWAGCL---KPK-SFDEK-----PDSETYGQPTMWEYTEASQAGRPG-LVRDIHPDRVF---ILGD- 207 (456) Q Consensus 147 ~~l~~-----i~~~~~~~~---~~~-~~~~D-----p~s~~yg~P~~y~i~~~~~~g~~~-~~~~IH~SRli---~~~~- 207 (456) -|.. +.++|-... ... .+..+ .....++.|..+..-.. .++... ....=|.--.+ .|.. T Consensus 141 -~i~~~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~-~~~~~~~~~~~~~~~g~vPvv~~~n~ 218 (485) T protein:vir:10 141 -IIRVEPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYR-VENEWQEWFNNPHGLGVVPVVPIPNR 218 (485) T ss_pred -EEEEEccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEE-cCCceEEeccccCCCCcccEEEeccc Confidence 0111 111111000 000 00000 11112222222111100 011100 00111332222 2221 Q ss_pred ---CcCCCcchHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC Q lcl|NC_016762. 208 ---WTGDAIGFLEP-AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG 283 (456) Q Consensus 208 ---~~~~G~S~le~-~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (456) ...+|.|.+++ +..-+.+++++.-......-..+..++.+.. ....+.....+.+ ...+... T Consensus 219 ~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G---~~~~~~~~~~~~~-----------~~~~~~~ 284 (485) T protein:vir:10 219 TRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG---IKPEEIGVDPETG-----------QTLFDAY 284 (485) T ss_pred cccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhc---CCccccccccccc-----------chhhhhc Confidence 11368887764 3333334444332211111111222222211 0111110000000 0111222 Q ss_pred CCeEEecCCCceeEEeccc---CCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-----HHHHHHHHHHHHHhh Q lcl|NC_016762. 284 NDVLLPTQGATVTQMVSAV---SDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-----QKYHNARCQARRVQE 355 (456) Q Consensus 284 ~~~~lid~~d~~~~~~~~~---sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-----~~nyyd~I~~~Qe~~ 355 (456) .+.++...+++.+..+.+- .+..+.+.....++|+.+++|...| |.+... +++++ .......++.+| .. T Consensus 285 ~~~i~~~~~~d~k~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~f-g~~~~n-~~Sg~Al~~~~~~l~~k~~~k~-~~ 361 (485) T protein:vir:10 285 LARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYL-STAADN-PASAEAIRAAESRLIKKVERKN-SI 361 (485) T ss_pred ccceeccCCCCceEEeecccchHHHHHHHHHHHHHHhcccCCCHHHh-ccccCc-hhHHHHHHHHHHHHHHHHHHHH-HH Confidence 3334444444444334334 4455666667889999999999876 543322 12332 234455555544 46 Q ss_pred hhHHHHHHHHHHHHhcCc-C---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHH------HH-HHHcCCcCcCHHHH--- Q lcl|NC_016762. 356 LTFEINDLFAHLMRIGVV-P---LKAEFTAIWDDLTVPTKAERLANSKTMSEIN------SA-AIGTGEPVFTAEEI--- 421 (456) Q Consensus 356 lrp~L~~l~~~l~~s~~~-~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~------~~-~~~~g~~~i~~~E~--- 421 (456) +.+.|++++.+++....+ . ...++++.|.|-..+|.+|.|++..+..++. .+ .-..| ++++++ T Consensus 362 f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg---~~~~~~~~~ 438 (485) T protein:vir:10 362 FGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMG---YSIAEREEM 438 (485) T ss_pred HHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC---CCHhHHHHH Confidence 799999999887654322 1 1236889999999999999988776665432 11 11223 233322 Q ss_pred HHHhc---------ccCCCC-CCCCcccCCCCC--CC---CCcCCCC Q lcl|NC_016762. 422 REEAG---------YDPLQG-GDPLPDTEPEDE--DA---ARTDPTG 453 (456) Q Consensus 422 R~~~~---------~~~~~~-~~~~~~~~~~d~--~~---~~~d~~~ 453 (456) +...+ ++.+.. ....++..++++ ++ ..+..+| T Consensus 439 ~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 439 RRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 11100 000100 000011000000 00 1111122 No 117 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.18 E-value=3.6e-10 Score=72.33 Aligned_cols=401 Identities=11% Similarity=0.057 Sum_probs=171.4 Q ss_pred CCchhHHHHh-------------HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhh Q lcl|NC_016762. 1 MTDKLDLAVN-------------HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAV 67 (456) Q Consensus 1 ~~~~~~~~~~-------------~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iV 67 (456) ||.-+...-+ +......-+....|.. |..+-+ .. +.. ...++........++++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~-----G~~~i~---~~--~~~-~~~~~~~~~~~~n~~~~iv 69 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYE-----SERRPD---AV--GVT-VPQQMQKLLAHVGYPRLYI 69 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHh-----ccccch---hc--ccc-cchhHHhhhhhcCcHHHHH Confidence 4332222211 1111011111122211 110000 01 111 1244555556678899999 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CCCc-c------ Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQPW-D------ 139 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~-~------ 139 (456) |..++-+.=+||++- +.++. .+.+.+.+++-++-....++.+....||.|++++..+. +... . T Consensus 70 d~~~~~l~~~g~~~~--~~~~~-------~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~ 140 (484) T protein:vir:77 70 DAIAARQELEGFRLG--GADKA-------DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPI 140 (484) T ss_pred HHHHhhhccCceecC--Ccchh-------HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccce Confidence 999998887888752 21111 12466677777888888999999999999988877642 2211 0 Q ss_pred ----cccc------CCcCceeEEEEeccccCChhhhhccccccccCCce---------eEEEeecccCCccc-cceeeeh Q lcl|NC_016762. 140 ----RPAR------GKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPT---------MWEYTEASQAGRPG-LVRDIHP 199 (456) Q Consensus 140 ----~Pl~------~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~---------~y~i~~~~~~g~~~-~~~~IH~ 199 (456) .|.. ...+.+.....+|.. .+.+++. .|++.. .+|... ....=|+ T Consensus 141 i~~~~p~~~~~~~D~~~~~~~~a~~~~~~-------------~~~~~~~~~~~y~~~~~~~~~~--~~~~~~~~~~~~~~ 205 (484) T protein:vir:77 141 IRVEPPTNLYAQIDPRTRQVMRAIRAIED-------------EEGNEVIGATLYLPNNTVIWNR--EDGQWVQVANVAHN 205 (484) T ss_pred EEEeccceeEEEecCCCCceEEEEEEEEe-------------ecCCcEEEEEEEecCeEEEEEe--cCCceEeeccccCC Confidence 0110 011111111111110 0111111 111110 011100 0011133 Q ss_pred hhh---heecC----CcCCCcchHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHH Q lcl|NC_016762. 200 DRV---FILGD----WTGDAIGFLEP-AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNE 271 (456) Q Consensus 200 SRl---i~~~~----~~~~G~S~le~-~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~ 271 (456) --. +.|.. ...+|.|.+++ +..-+.+++.+.-.......-.+..++.+.. .+..........+ T Consensus 206 ~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G---~~~~~~~~~~~~~------ 276 (484) T protein:vir:77 206 LEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFG---VKGEELGVDPETG------ 276 (484) T ss_pred CCCcceEEeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhC---CCcchhccccccc------ Confidence 222 22321 12368888764 4433334444432222111111222222211 1111111000010 Q ss_pred HHHHHHHHHhcCCCeEEecCCCceeEEecc---cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----H Q lcl|NC_016762. 272 RFNEAARQLNRGNDVLLPTQGATVTQMVSA---VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----Y 343 (456) Q Consensus 272 ~~~~~~~~~~~~~~~~lid~~d~~~~~~~~---~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----n 343 (456) ...+....+.++...+++.+..+.+ +.+..+.+.....++|+++++|..-| |.+...- ++++ ++ . T Consensus 277 -----~~~~~~~~~~~~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~f-g~~~~n~-~Sg~Al~~~~~~ 349 (484) T protein:vir:77 277 -----QTLFDAYLARILAFEDHESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYL-SFSSENP-ASAEAIRSSESR 349 (484) T ss_pred -----chhhhhhhhhhcccCCCCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHh-ccccCcc-hHHHHHHHHHHH Confidence 0111112233333333343333433 34456666777889999999999876 4333321 2332 22 2 Q ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC-CC---CceEEEeCCCCCCCHHHHHHHHHHHHHHH------HHHH-HcC Q lcl|NC_016762. 344 HNARCQARRVQELTFEINDLFAHLMRIGVVP-LK---AEFTAIWDDLTVPTKAERLANSKTMSEIN------SAAI-GTG 412 (456) Q Consensus 344 yyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~-~~---~d~~~~f~pL~~~seke~Aei~~~~A~a~------~~~~-~~g 412 (456) .-..++.+| ..+++.|++++.+++....+. .+ .++++.|.|...+|.++.|+...|.+++. .+.. ..| T Consensus 350 l~~ka~~k~-~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~ 428 (484) T protein:vir:77 350 LVKTVERKN-KIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMG 428 (484) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCC Confidence 333445444 457899999998876654331 11 35889999999999999998877766542 1111 222 Q ss_pred CcCcCHH---HHHHHh---------cccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 413 EPVFTAE---EIREEA---------GYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 413 ~~~i~~~---E~R~~~---------~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ++.+ |++..- ...++.+............+++++.|.++++ T Consensus 429 ---~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (484) T protein:vir:77 429 ---YSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDNPETPEPQPNPAEE 481 (484) T ss_pred ---CChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCCCCCcccccCCCccc Confidence 2333 332111 0111111111100011111222222222222 No 118 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=99.15 E-value=1.2e-09 Score=69.48 Aligned_cols=403 Identities=13% Similarity=0.080 Sum_probs=174.5 Q ss_pred CCchhHHH-Hh-----HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHH-Hh-cCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLDLA-VN-----HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTM-YR-RGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~~~-~~-----~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~-Y~-~~~l~r~iVd~~ae 72 (456) |+...-.+ ++ |..+...-+....|.+. .++.. ..+.... .++... .+ .+..+++|||..++ T Consensus 23 ~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G------~~~~~----~~~~~~~-~~~~~~~~~~v~n~~~~ivd~~a~ 91 (501) T protein:vir:25 23 MSREQLGALVADMWRLHISERQWLDRIYEYTKG------LRGRP----EVPEGAS-DEVKELAKLSVKNVLSLVRDSFAQ 91 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------CCCch----hccccCC-hhhhhhHhhhhcChHHHHHHHHHh Confidence 43332111 11 11111111112222110 01000 0111111 122221 22 34689999999999 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEE Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKV 152 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i 152 (456) =+.-+||++- +..+.. .+...+++-++-....++.+...+||.|++++..++..+.-..+.. ..+ T Consensus 92 ~l~~~gf~~~--d~~~~~--------~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~~i~~~sp-----~~~ 156 (501) T protein:vir:25 92 NLSVVGYRNA--LAKEND--------PAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGPVFRTRSP-----RQI 156 (501) T ss_pred hhcccceecC--CccchH--------HHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCCeEEEecc-----ccE Confidence 8887887752 222221 2445566666777778888888899999887765432211000000 011 Q ss_pred EEeccccCC------hhhh---hcc---ccccccCCce-eEEEeeccc------CCcc--------ccceee------eh Q lcl|NC_016762. 153 TPAWAGCLK------PKSF---DEK---PDSETYGQPT-MWEYTEASQ------AGRP--------GLVRDI------HP 199 (456) Q Consensus 153 ~~~~~~~~~------~~~~---~~D---p~s~~yg~P~-~y~i~~~~~------~g~~--------~~~~~I------H~ 199 (456) .++|....+ .-.+ ..+ ..-..++.|. .|.+..... .+.. ...-.. |+ T Consensus 157 ~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (501) T protein:vir:25 157 LAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEG 236 (501) T ss_pred EEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCC Confidence 112210000 0000 000 0111122222 122111000 0000 000001 11 Q ss_pred hhh---heecC---CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHH Q lcl|NC_016762. 200 DRV---FILGD---WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERF 273 (456) Q Consensus 200 SRl---i~~~~---~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 273 (456) =-+ ++|.. ...+|.|.++++.+-+.+++++.........-.+..++.+. |.+.+.. ..+ T Consensus 237 ~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~--------------G~~~~~~-~~~ 301 (501) T protein:vir:25 237 KPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVIS--------------GWTGSKA-EVL 301 (501) T ss_pred ccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHh--------------CCCCCcc-chh Confidence 111 22322 13568898888776665666654332222211222222221 1111111 011 Q ss_pred HHHHHHHhcCCCeEEecCCCc--eeEEe-cccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-----HHHHH Q lcl|NC_016762. 274 NEAARQLNRGNDVLLPTQGAT--VTQMV-SAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-----QKYHN 345 (456) Q Consensus 274 ~~~~~~~~~~~~~~lid~~d~--~~~~~-~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-----~~nyy 345 (456) ....+.++...+++ +-+++ +++.+..+.+.....+||+.+++|..-|.|.+. |.+++ ....- T Consensus 302 -------~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~---N~Sg~Al~~~~~~l~ 371 (501) T protein:vir:25 302 -------KASALRVWTFEDPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMI---NVSAEALAAAEANQQ 371 (501) T ss_pred -------hhcccceeccCCCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccC---ChHHHHHHHHHHHHH Confidence 11122233333333 43333 345566777888999999999999886665432 22332 33445 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHhcCcCC---CCceEEEeCCCCCCCHHHHHHHHHHHHHH---HHHHHHcCCcCcCHH Q lcl|NC_016762. 346 ARCQARRVQELTFEINDLFAHLMRIGVVPL---KAEFTAIWDDLTVPTKAERLANSKTMSEI---NSAAIGTGEPVFTAE 419 (456) Q Consensus 346 d~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~---~~d~~~~f~pL~~~seke~Aei~~~~A~a---~~~~~~~g~~~i~~~ 419 (456) ..+..+|+ .+++.|++++.+++....+.. ..++++.|.|...+|.++.|++..|.+.+ ....... .+-+++. T Consensus 372 ~ka~~k~~-~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gis~et~~~~-~~g~~~~ 449 (501) T protein:vir:25 372 RKLAAKRE-SFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGIPIEHLLSM-VPGMTQQ 449 (501) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCHHHHHHH-cCCCCHH Confidence 55555554 679999999988765554432 24789999999999999999988877664 1222211 1235655 Q ss_pred HHHHHh------cc----cCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 420 EIREEA------GY----DPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 420 E~R~~~------~~----~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ++.++. .. +.+....+.+...+..++...+++.++.. T Consensus 450 ~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 496 (501) T protein:vir:25 450 TIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVN 496 (501) T ss_pred HHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCC Confidence 543211 11 11111122111112222222222111111 No 119 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.14 E-value=6.8e-10 Score=70.84 Aligned_cols=432 Identities=11% Similarity=0.013 Sum_probs=192.6 Q ss_pred CCchhHHHHhH-HHHHHHHHHHHHHhhhhhccCcccchhhhhcc--------CcccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKLDLAVNH-AMSSAIARARMSLLNQGIGHDAKRPQAWCEYG--------FPQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~~~~~~~-a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~--------~~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) |.----+.+.. ....+.. ++....++.+ .+...|.... .....-..-...+|+.|++++++|+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~----~~~~~a~~~~-~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 75 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYA----GYHGGGGGFG-GQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQ 75 (530) T ss_pred CccceeecCccccchHHHh----hhhcccCCCC-CcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 32222222111 1111111 1222222111 1111111100 0001112335678999999999999999 Q ss_pred hHHhhCCCEEecCCC----cchhhhhHHHHHHHHHHHHH--------------hhHHHHHHHHHHhhcccCceEEEEEec Q lcl|NC_016762. 72 TTCWKTNPQVIEGDD----QDRSKDETEWERKNKPLIAG--------------GRFWRAVSEADRRRLVGRYSGLLLHIR 133 (456) Q Consensus 72 ed~tR~~~~i~~~~~----~d~~~~~~~~e~~i~~~~~~--------------l~~~~~~~ea~~~~r~~Ggs~i~i~i~ 133 (456) ...+=.|+.+...-+ ....+...+|.++|++++.+ +.+.+...-+++-....|-+++.+... T Consensus 76 ~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~ 155 (530) T protein:vir:38 76 DHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWD 155 (530) T ss_pred HHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeec Confidence 999999998754211 00112234555667776653 123333333444444556666554432 Q ss_pred CCCCccccccCCcCceeEEEEeccccC--Ch-hhhhcc-ccccccCCceeEEEeecccCCccc-------cceeeehhhh Q lcl|NC_016762. 134 DSQPWDRPARGKLNGLAKVTPAWAGCL--KP-KSFDEK-PDSETYGQPTMWEYTEASQAGRPG-------LVRDIHPDRV 202 (456) Q Consensus 134 D~~~~~~Pl~~~~~~l~~i~~~~~~~~--~~-~~~~~D-p~s~~yg~P~~y~i~~~~~~g~~~-------~~~~IH~SRl 202 (456) ......-|++ |.-|.|-+.... .+ +.+..+ ..--.+|+|..|+|......|... ....|+..+| T Consensus 156 ~~~g~~~~~~-----lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~v 230 (530) T protein:vir:38 156 SDSTRLFRTQ-----FKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSF 230 (530) T ss_pred cCCCCccceE-----EEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHe Confidence 1111111221 222222111100 00 000001 111358999999997543333211 1244677788 Q ss_pred heecC----CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhh-hcCCHHHHHHH---H- Q lcl|NC_016762. 203 FILGD----WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIAST-YGVTLDALNER---F- 273 (456) Q Consensus 203 i~~~~----~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~-~~~~~~~~~~~---~- 273 (456) +|+-+ ....|+|.+-+++..+.+++.-....-..---+++-...++.. ..-....+. .+.+....... . T Consensus 231 lH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (530) T protein:vir:38 231 IHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESE--LDTQSAMDFILGADNKEQQSKLTGWL 308 (530) T ss_pred EeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecc--CCccccccccccCCcccccccccccc Confidence 87643 2346999999999988887665443211111011111111110 000011111 01110000000 0 Q ss_pred HHHHH-------HHhcCCCeEEecCCCceeEEecc--cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCccc--ch-HHH Q lcl|NC_016762. 274 NEAAR-------QLNRGNDVLLPTQGATVTQMVSA--VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERA--SS-EDQ 341 (456) Q Consensus 274 ~~~~~-------~~~~~~~~~lid~~d~~~~~~~~--~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gln--st-~D~ 341 (456) ..... .+..+ .+.-+..+++++.++.+ -++..+....+...||+..|||.-.|.|- .++-| |. ..+ T Consensus 309 ~~~~~~~~~~~~~l~pG-~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D-~s~~nYSS~R~~~ 386 (530) T protein:vir:38 309 GEMAAYYSAAPVRLGGA-RVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRN-YSQMSYSTARASA 386 (530) T ss_pred hhhhhcccccceeccCc-eeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcc-cccccHHHHHHHH Confidence 00000 11111 12234457888877655 46899999999999999999999988883 33334 33 467 Q ss_pred HHHHHHHHHHHHhhhhHHHHHHHH----HHHHhcCcCCCCce------------EEEe--CCCCCCCHHHHHHHHHHHHH Q lcl|NC_016762. 342 KYHNARCQARRVQELTFEINDLFA----HLMRIGVVPLKAEF------------TAIW--DDLTVPTKAERLANSKTMSE 403 (456) Q Consensus 342 ~nyyd~I~~~Qe~~lrp~L~~l~~----~l~~s~~~~~~~d~------------~~~f--~pL~~~seke~Aei~~~~A~ 403 (456) ..+...++++|.....+.+..+++ ..+..+.++.|... ..+| +..-..+.. |.++ T Consensus 387 ~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~-------Ke~~ 459 (530) T protein:vir:38 387 NESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGL-------KEVQ 459 (530) T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChH-------HHHH Confidence 778888888888776655555444 44556666554321 1223 222223332 4455 Q ss_pred HHHHHHHcCCc---------CcCHHHHHHH-------hcccCCC-CCCC----CcccCCCCCCCCCcCCCC Q lcl|NC_016762. 404 INSAAIGTGEP---------VFTAEEIREE-------AGYDPLQ-GGDP----LPDTEPEDEDAARTDPTG 453 (456) Q Consensus 404 a~~~~~~~g~~---------~i~~~E~R~~-------~~~~~~~-~~~~----~~~~~~~d~~~~~~d~~~ 453 (456) |....+.+|.. =.+++|+-+. ...-++. +..+ .....+.+++++.++.++ T Consensus 460 a~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 460 EAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred HHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCCCCCCC Confidence 55566666610 1333333211 1111111 0011 001111111122222222 No 120 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.12 E-value=1.2e-09 Score=69.48 Aligned_cols=422 Identities=10% Similarity=0.015 Sum_probs=191.4 Q ss_pred CC--chhHHHHh--HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCH------------HHHHHHHhcCchhh Q lcl|NC_016762. 1 MT--DKLDLAVN--HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITF------------NDLYTMYRRGGIAH 64 (456) Q Consensus 1 ~~--~~~~~~~~--~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~------------~~l~~~Y~~~~l~r 64 (456) |+ +|+=-.+. .+++ +.+.+..+...- +.+++|-. ..+ +...+. .-...+|+.|++++ T Consensus 1 Mn~iDr~i~~~sP~~a~~--R~~ar~~~~~y~-aa~~~r~~--~~~--~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 73 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVAR--RLAAREAIQAYE-AARPGRTH--KAK--RQPLGADTSLQKSAVSMREQCRKLDEDHDLVT 73 (548) T ss_pred CchHHhHhhhcchHHHHH--HHHhHHHhcccc-ccCccccc--ccc--CCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 32 22222211 1111 122222222211 11222211 111 111222 23467899999999 Q ss_pred hhhccchhHHhh-CCCEEecCCCcchhhhhHHHHHHHHHHHHHh----------hHHHHHHHHHHhhcccCceEEEEEec Q lcl|NC_016762. 65 GAVEKIVTTCWK-TNPQVIEGDDQDRSKDETEWERKNKPLIAGG----------RFWRAVSEADRRRLVGRYSGLLLHIR 133 (456) Q Consensus 65 ~iVd~~aed~tR-~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l----------~~~~~~~ea~~~~r~~Ggs~i~i~i~ 133 (456) .+|+......+= .|+.|...--..+.+...+|.+.|++++++. .+-+....+++.-...|-+++.+... T Consensus 74 ~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~ 153 (548) T protein:vir:95 74 GLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMG 153 (548) T ss_pred HHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeec Confidence 999999988884 4555542211222233345556666666532 23343444555555556665554432 Q ss_pred CCCC----ccccccCCcCceeEEEEeccccCC-h----hhhh-ccccccccCCceeEEEeecccCCc-----cccceeee Q lcl|NC_016762. 134 DSQP----WDRPARGKLNGLAKVTPAWAGCLK-P----KSFD-EKPDSETYGQPTMWEYTEASQAGR-----PGLVRDIH 198 (456) Q Consensus 134 D~~~----~~~Pl~~~~~~l~~i~~~~~~~~~-~----~~~~-~Dp~s~~yg~P~~y~i~~~~~~g~-----~~~~~~IH 198 (456) ...+ ...|+. |..+....|. + +... .-..--.+|+|..|+|...-++.. ......|= T Consensus 154 ~~~~~~~g~~~~~~--------lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvp 225 (548) T protein:vir:95 154 RVPNYTFATSVPFA--------LELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVE 225 (548) T ss_pred ccccccCCcccceE--------EEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeec Confidence 1111 111221 1222221121 0 0000 111123579999999986544421 12245688 Q ss_pred hhhhheecC----CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHH Q lcl|NC_016762. 199 PDRVFILGD----WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFN 274 (456) Q Consensus 199 ~SRli~~~~----~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 274 (456) .++|+|+-+ ....|+|.+-+++..+.+++.-....-..---+++-...++ . . ..+........ ..- T Consensus 226 A~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~---~-~---~~~~~~~~~~~--~~~- 295 (548) T protein:vir:95 226 AERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIK---K-G---NPDSYTVEPGK--DRK- 295 (548) T ss_pred hhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeee---c-C---CCccccCCCCc--ccc- Confidence 888877643 23469999999999888876654432111110111111111 1 1 11110000000 000 Q ss_pred HHHHHHhcCCCeE--EecCCCceeEEecc--cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccch-HHHHHHHHHHH Q lcl|NC_016762. 275 EAARQLNRGNDVL--LPTQGATVTQMVSA--VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASS-EDQKYHNARCQ 349 (456) Q Consensus 275 ~~~~~~~~~~~~~--lid~~d~~~~~~~~--~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst-~D~~nyyd~I~ 349 (456) .....+.. ++. .+..+++++.++.+ -++..+....+...||+..|||.-.|.|-..+..+|. ..+..+...+. T Consensus 296 ~~~~~~~p--G~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYSS~R~~l~e~~r~~~ 373 (548) T protein:vir:95 296 NRTIPIAP--GMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYSAQRQELVEGWLGYD 373 (548) T ss_pred cccccccC--CccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHHHHHHHHHHHHHHHH Confidence 00001112 322 24457888887754 5699999999999999999999999999854333333 35666777777 Q ss_pred HHHHhhh----hHHHHHHHHHHHHhcCcCCCC------ceEEEeCC-CC-CCCHHHHHHHHHHHHHHHHHHHHcCCc--- Q lcl|NC_016762. 350 ARRVQEL----TFEINDLFAHLMRIGVVPLKA------EFTAIWDD-LT-VPTKAERLANSKTMSEINSAAIGTGEP--- 414 (456) Q Consensus 350 ~~Qe~~l----rp~L~~l~~~l~~s~~~~~~~------d~~~~f~p-L~-~~seke~Aei~~~~A~a~~~~~~~g~~--- 414 (456) .+|.... +|+-+.+++..+..+.++.|. -+..+|-+ =| ..+. .|.++|....+.+|.. T Consensus 374 ~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP-------~Kea~A~~~~i~~Gl~T~~ 446 (548) T protein:vir:95 374 LLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINP-------MHEANAWELLVKAGFADEA 446 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccCh-------HHHHHHHHHHHHcCCCCHH Confidence 7877544 344444444455566655443 24455522 11 1222 2455555666666610 Q ss_pred ------CcCHHHHHHH-------hcccCCC-CCCC---------CcccCCCCC----CCCCcCCCCCCC Q lcl|NC_016762. 415 ------VFTAEEIREE-------AGYDPLQ-GGDP---------LPDTEPEDE----DAARTDPTGEQQ 456 (456) Q Consensus 415 ------~i~~~E~R~~-------~~~~~~~-~~~~---------~~~~~~~d~----~~~~~d~~~~~e 456 (456) =.+++|+.+. ...-++. +.++ .+.+++..+ ..+.+.+.+++| T Consensus 447 ~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (548) T protein:vir:95 447 EVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEAREL 515 (548) T ss_pred HHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHh Confidence 1333443211 0111110 0010 000000000 000111112222 No 121 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.10 E-value=1e-09 Score=69.86 Aligned_cols=422 Identities=9% Similarity=-0.009 Sum_probs=192.9 Q ss_pred CCch--hHH----HHhHHHHHHHHHHHHHHhhhhhccCcccc-hhhhhccCcccCCH------------HHHHHHHhcCc Q lcl|NC_016762. 1 MTDK--LDL----AVNHAMSSAIARARMSLLNQGIGHDAKRP-QAWCEYGFPQEITF------------NDLYTMYRRGG 61 (456) Q Consensus 1 ~~~~--~~~----~~~~a~~~~~~~~~d~~~n~~~~~gt~~~-~~~~~~~~~~~~~~------------~~l~~~Y~~~~ 61 (456) |..- ..- ++.-+........+....+.- +.+++|. ..|. +.|...+. .-...+|+.|+ T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~-aa~~~r~~~~w~--~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~ 77 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFE-AARRDRLGKAWL--RRASRLSADEEIYADLASLVQRAREQSINNP 77 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcc-cccCCCcccccc--CCCCCCChHHHHHHHHHHHHHHHHHHHhcCh Confidence 3211 111 111111111222222222211 1112222 1221 11222222 22466899999 Q ss_pred hhhhhhccchhHHhh-CCCEEecCCCcchhhhhHHHHHHHHHHHHHh------------hHHHHHHHHHHhhcccCceEE Q lcl|NC_016762. 62 IAHGAVEKIVTTCWK-TNPQVIEGDDQDRSKDETEWERKNKPLIAGG------------RFWRAVSEADRRRLVGRYSGL 128 (456) Q Consensus 62 l~r~iVd~~aed~tR-~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l------------~~~~~~~ea~~~~r~~Ggs~i 128 (456) +++.+|+......+= .|+.+.............++.++|++++++. .+-+....+.+--...|-+++ T Consensus 78 ~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~ 157 (505) T protein:vir:96 78 YAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLV 157 (505) T ss_pred HHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEE Confidence 999999999999995 6888864433322222334556666666542 122222334444444566655 Q ss_pred EEEecCCCCccccccCCcCceeEEEEeccccCChh--------hhh-ccccccccCCceeEEEeecccCC-------ccc Q lcl|NC_016762. 129 LLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPK--------SFD-EKPDSETYGQPTMWEYTEASQAG-------RPG 192 (456) Q Consensus 129 ~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~--------~~~-~Dp~s~~yg~P~~y~i~~~~~~g-------~~~ 192 (456) .+...++.. .|++ |.++....|... .+. .-..--.+|+|..|+|...-++. ... T Consensus 158 ~~~~~~~~~--~~~~--------lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~ 227 (505) T protein:vir:96 158 REHRGYPNK--WGYA--------LQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQ 227 (505) T ss_pred EEeecCCCC--cceE--------EEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccc Confidence 554433221 1221 222222222110 000 01111358999999997543321 111 Q ss_pred cceeeehhhhheecC----CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHH Q lcl|NC_016762. 193 LVRDIHPDRVFILGD----WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDA 268 (456) Q Consensus 193 ~~~~IH~SRli~~~~----~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~ 268 (456) ...+|=.++|+|+-. ....|+|.|-+++..+.+++.-....-...--++.-...++. +...+......... T Consensus 228 ~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~----~~~~~~~~~~~~~~- 302 (505) T protein:vir:96 228 TYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQ----DPEAYDQPPEDDQG- 302 (505) T ss_pred cccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeec----CCccCCCccccccC- Confidence 234577778877643 234699999999998888766554322111111111111111 11111111000000 Q ss_pred HHHHHHHHHHHHhcCCCeEEecCCCceeEEecc--cCCHHHHHHHHHHHHHhhhcCCeEEeeccCC-Ccccch-HHHHHH Q lcl|NC_016762. 269 LNERFNEAARQLNRGNDVLLPTQGATVTQMVSA--VSDPGPTYNVNLQTAAAGVDIPTKILVGMQT-GERASS-EDQKYH 344 (456) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~--~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp-~Glnst-~D~~ny 344 (456) .....+..+. +.-+..+++++.++.+ -++..+....+...||+..|||.-.|.|--. .-.+|. ..+..+ T Consensus 303 ------~~~~~l~pG~-i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~ 375 (505) T protein:vir:96 303 ------EIVEEVEAGT-YQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDE 375 (505) T ss_pred ------ccccccCCce-eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHH Confidence 0111222222 2334557888887765 4689999999999999999999988888422 223333 466777 Q ss_pred HHHHHHHHHhhhhHHH----HHHHHHHHHhcCcCCCCc-----eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcC Q lcl|NC_016762. 345 NARCQARRVQELTFEI----NDLFAHLMRIGVVPLKAE-----FTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPV 415 (456) Q Consensus 345 yd~I~~~Qe~~lrp~L----~~l~~~l~~s~~~~~~~d-----~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~ 415 (456) ...++.+|.....+.+ +.+++..+..+.++.|.. ....| .+....-.|= .|.++|....+.+| . T Consensus 376 ~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w----~~p~~~~iDP-~Ke~~a~~~~i~~G--~ 448 (505) T protein:vir:96 376 RDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAF----QPRGWDWVDP-AKDSKAHSESIKNR--T 448 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeee----ccCCccccCh-HHHHHHHHHHHHcC--C Confidence 7888888876544444 444444455666554421 12333 2222222221 24555556666666 3 Q ss_pred cC-----------HHHHHHH-------hcccCCCCCCCCcccCCCCCCCCCcCCCCCC Q lcl|NC_016762. 416 FT-----------AEEIREE-------AGYDPLQGGDPLPDTEPEDEDAARTDPTGEQ 455 (456) Q Consensus 416 i~-----------~~E~R~~-------~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~ 455 (456) .| ++|+.+. ...-++.... +.........++++++.+++ T Consensus 449 ~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~-~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 449 RSRSSIIRAAGDDPEDVFDEIAWEEQLMRDKGVNPTP-PEQESKDATTDEEDDSASDD 505 (505) T ss_pred CCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCC-CCCCCCCCCCCCCCCCCCCC Confidence 33 3333211 1112221110 00000111111222222222 No 122 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=99.06 E-value=1e-09 Score=69.83 Aligned_cols=417 Identities=10% Similarity=-0.056 Sum_probs=186.7 Q ss_pred CCchhHHHHh------HHHHH-------HHHHHHHHHhhhhhccCc--ccch-hhhhccCcccCCHHHHHHHHhcCchhh Q lcl|NC_016762. 1 MTDKLDLAVN------HAMSS-------AIARARMSLLNQGIGHDA--KRPQ-AWCEYGFPQEITFNDLYTMYRRGGIAH 64 (456) Q Consensus 1 ~~~~~~~~~~------~a~~~-------~~~~~~d~~~n~~~~~gt--~~~~-~~~~~~~~~~~~~~~l~~~Y~~~~l~r 64 (456) |+.-..+..+ ..++. .+.+..+-|. |-.. .+.+ ....+ .+. .--...+++ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~----g~~~i~~~~~~~~~~~-~~~---------~ki~~n~~k 96 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYE----GKTKNLVELTRRKEEY-MAD---------NRVAHDYAS 96 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhc----ccCccccccCcccccc-cCc---------ceeecchHH Confidence 3322211111 11111 1111122221 1100 0000 00000 000 012468899 Q ss_pred hhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---c Q lcl|NC_016762. 65 GAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---R 140 (456) Q Consensus 65 ~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~ 140 (456) .||+..+.=++-+.+++...++.. .+.|...+++.++...+.++.+....||.|++++..+ ||+.-. . T Consensus 97 ~Ivd~~~~yl~g~p~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~ 168 (512) T protein:vir:97 97 YISDFINGYFLGNPIQCQDDDKDV--------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSD 168 (512) T ss_pred HHHHHHhhhhcccCceeccCChHH--------HHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEc Confidence 999999998999998886543321 1347777777788999999999999999999988874 333211 1 Q ss_pred ccc------CC-cCceeEEEEeccccCChhhhhcc-ccccccCCce-eEEEeecccCCccccc----eeeehhhhheecC Q lcl|NC_016762. 141 PAR------GK-LNGLAKVTPAWAGCLKPKSFDEK-PDSETYGQPT-MWEYTEASQAGRPGLV----RDIHPDRVFILGD 207 (456) Q Consensus 141 Pl~------~~-~~~l~~i~~~~~~~~~~~~~~~D-p~s~~yg~P~-~y~i~~~~~~g~~~~~----~~IH~SRli~~~~ 207 (456) |.. .. .+.+....=+|.....-. ...+ .....++.+. .|++.....++..... ..-|+--.+.+.+ T Consensus 169 p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~-~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 247 (512) T protein:vir:97 169 AMSTFVIYDNTIERNSIAGVRYLRTKPIDK-TDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITE 247 (512) T ss_pred ccceEEEEcCCCCCceEEEEEEEEeeeccc-cccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEe Confidence 221 00 011111111111100000 0000 0011111111 1222211111000001 1123333333333 Q ss_pred C--cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cC Q lcl|NC_016762. 208 W--TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RG 283 (456) Q Consensus 208 ~--~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~ 283 (456) + ...|.|.++.+.+-+.+++.+.-..+..+-..+...+.+.... +.............+..+. .. T Consensus 248 ~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~-----------~~~~~~~~~~~~~~~~~~~~~~~ 316 (512) T protein:vir:97 248 FSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNL-----------NLDPVEVRKQKEANVLFLEPTVY 316 (512) T ss_pred ecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCc-----------cCCchhhhhhhhcccccccccch Confidence 2 3468899998888777777766555544322222222221110 0111111111100000000 00 Q ss_pred CCe-EEe--cCCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHH Q lcl|NC_016762. 284 NDV-LLP--TQGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRV 353 (456) Q Consensus 284 ~~~-~li--d~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe 353 (456) .+. ..+ +.+.+ |-..+.+.++....++.+.+.|...+++|-.-. +...| |.+++ ++ .-...+ ..++ T Consensus 317 ~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~-~~~~g--n~Sg~Al~~~~~~l~~ka-~~k~ 392 (512) T protein:vir:97 317 ENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNFSG--TQSGEAMKYKLFGLEQRT-KTKE 392 (512) T ss_pred hhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCc-ccccc--cchHHHHHHHHHHHHHHH-HHHH Confidence 011 111 12233 445567788999999999999999999998532 22222 22332 22 223333 4555 Q ss_pred hhhhHHHHHHHHHHHHhc--Cc--C---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCc-CH-HHHH Q lcl|NC_016762. 354 QELTFEINDLFAHLMRIG--VV--P---LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVF-TA-EEIR 422 (456) Q Consensus 354 ~~lrp~L~~l~~~l~~s~--~~--~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i-~~-~E~R 422 (456) ..++..|++++.+++... .+ . ...++++.|+|-...+.++.|++..+.+-+.. +++..- +-+ ++ .|+. T Consensus 393 ~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l-~~v~d~~~E~e 471 (512) T protein:vir:97 393 GLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLF-SFFQDPELEVK 471 (512) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhC-CCCCCHHHHHH Confidence 678999999998875431 11 1 12368999999999999999998777654332 222111 122 23 2332 Q ss_pred HHhcc-----c-CCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 423 EEAGY-----D-PLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 423 ~~~~~-----~-~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ..... . ........++..+.+++.++.++.+++| T Consensus 472 ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (512) T protein:vir:97 472 KIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 511 (512) T ss_pred HHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCcccccccc Confidence 11100 0 0000011122222333344444555555 No 123 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=99.06 E-value=8.6e-10 Score=70.30 Aligned_cols=384 Identities=11% Similarity=0.002 Sum_probs=180.9 Q ss_pred CCchhHHHHhHHHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhc-CchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRR-GGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~-~~l~r~iVd~~aed~tR~ 77 (456) |..+.-.-.-.-+.. ..-+.++.|... ..+.+ ..+.. -..++..+|+. .+.+++|||..++=+.=+ T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~~yy~g-----~~~~~-----~~~~~-~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~ 69 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRYRYYAM-----DDRDD-----TRSIV-MPNNVREMYRSVLEWTAKGVDSLADRIIFR 69 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHhc-----CCChh-----hcCcc-ccHHHHHHHHhhcchhHHHHHHHHhccccc Confidence 776543321111111 111123333221 10100 01122 23455556653 356799999999977667 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec--CCCCcc---cccc------CCc Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR--DSQPWD---RPAR------GKL 146 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~--D~~~~~---~Pl~------~~~ 146 (456) ||+. .+ . .+.+.+.+.++-....++.+-+..||.|++++.-+ ||.+.- .|.. ... T Consensus 70 Gf~~--~d----~--------~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D~~~ 135 (422) T protein:vir:97 70 EFTN--DD----F--------NAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILDPTT 135 (422) T ss_pred eeeC--Cc----h--------hHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEeCCC Confidence 8763 11 1 13344555666667777888889999999988653 233211 1111 011 Q ss_pred CceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhh---heecCC----cCCCcchH-HH Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRV---FILGDW----TGDAIGFL-EP 218 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRl---i~~~~~----~~~G~S~l-e~ 218 (456) +.+.....+|...- +..+..-.|+.+..++... .+|... ..-|+-.. +.|... ..+|.|.+ ++ T Consensus 136 ~~~~~a~~~~~~~~-----~~~~~~~~~~~~~~~~~~~--~~~~~~--~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~ 206 (422) T protein:vir:97 136 FLLTEGYAILESDS-----NGNPTLEAYFTDKDIWYYP--KKGKPY--NIKNPTGHPLLVPIIHRPDAVRPFGRSRITKA 206 (422) T ss_pred CcceeeEEEEEecC-----CCcEEEEEEEcCceEEEEc--CCCccc--cccCCCCCcceEEecccCCCccccCccccchh Confidence 11111111111000 0011112233332222211 111111 11232222 233221 24688876 66 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCC----- Q lcl|NC_016762. 219 AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGA----- 293 (456) Q Consensus 219 ~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d----- 293 (456) +..-..++.++.-...-...-.+..+..+. .+ +..+.+.+ .+...+. .+..+.+++ T Consensus 207 v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-----G~----d~d~~~~~----~~~~~~~------~i~~~~~de~~~~~ 267 (422) T protein:vir:97 207 GMYHQKAAKRTLERAEVTAEFYSFPQKYVL-----GM----DPDAKPME----KWRATVS------TLLEISKDEDGDKP 267 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhc-----cc----CcccccCc----hhhhhhh------hhhccCCCCCCCcc Confidence 666555555553321111111122222221 11 11111111 1111111 112233222 Q ss_pred ceeEE-ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-----HHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_016762. 294 TVTQM-VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-----QKYHNARCQARRVQELTFEINDLFAHL 367 (456) Q Consensus 294 ~~~~~-~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-----~~nyyd~I~~~Qe~~lrp~L~~l~~~l 367 (456) ++.++ ++++.+.-+.+.....++|++++||..-|-|.+ .. +++++ ....-..++.+|+ .+.+.+++++.+. T Consensus 268 ~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~-~N-psSa~Ai~a~~~~L~~ka~~k~~-~fg~~l~~~~rla 344 (422) T protein:vir:97 268 TVGQFTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPS-DN-PSSVESIKAAHENLRAAGRKAQR-SFSSGFLNVAYIA 344 (422) T ss_pred eeeecCCCChhHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc-hhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 23333 345667778888889999999999976554444 32 13322 3455666776665 5799999999876 Q ss_pred HHhcCc--CCCC---ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCC Q lcl|NC_016762. 368 MRIGVV--PLKA---EFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPE 442 (456) Q Consensus 368 ~~s~~~--~~~~---d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~ 442 (456) +....+ ..++ ++.+.|.|...++..+.|+ .|++..+++++|.+..+.+-+++.++++......+--+...+ T Consensus 345 ~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~----~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~~~~~~~~~~ 420 (422) T protein:vir:97 345 VCLRDEFPYLRNQFMDTVIKWEPLFEADANMLTL----VGDGAIKLNQAIPGFMDADVIRDLTGVKGADKPIPAITEVTT 420 (422) T ss_pred HHHhcCCcccchhhccceEEEccCCCCChHHHHH----HHHHHHHHHhhccccccHHHHHHHcCCCchhHHHHHHHhhhc Confidence 543322 2222 4689999888777655554 457777888887777888888888877543221111111112 Q ss_pred CC Q lcl|NC_016762. 443 DE 444 (456) Q Consensus 443 d~ 444 (456) |. T Consensus 421 d~ 422 (422) T protein:vir:97 421 DG 422 (422) T ss_pred cC Confidence 22 No 124 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=99.05 E-value=2e-09 Score=68.29 Aligned_cols=419 Identities=13% Similarity=0.052 Sum_probs=177.2 Q ss_pred CCchhHHHHhHHHHH-----HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSS-----AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCW 75 (456) Q Consensus 1 ~~~~~~~~~~~a~~~-----~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~t 75 (456) |++--...+...++. ..-+.++.|.. |..+.+ . .+... ..++..+-..-+++++|||..++-+. T Consensus 12 l~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~-----G~~~~~---~--~~~~~-p~~~r~~~~v~nw~~~~Vd~~a~rl~ 80 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLRWKNLLRTSYYE-----NKRTIQ---Y--VGTLI-PPQYFNLGLVLGWTGKAVDALARRCN 80 (474) T ss_pred CChhHHHHHHHHHHHHHHHhhHHHHHHHHhc-----cCCChh---h--ccccc-cHHHHHHHhhcChHHHHHHHHHhhhc Confidence 555544333322221 11112223321 111100 0 11111 23343333456778999999999888 Q ss_pred hCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CCC--cc---cccc------ Q lcl|NC_016762. 76 KTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQP--WD---RPAR------ 143 (456) Q Consensus 76 R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~--~~---~Pl~------ 143 (456) =+||++ .+++.++. .+.+.+.+-++-....++.+-+..||.|++++.-++ +++ .- .|.. T Consensus 81 ~~Gf~~-~d~~~~~~--------~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D 151 (474) T protein:vir:81 81 LEGFVW-PDGDLDSL--------GGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWN 151 (474) T ss_pred ccceEC-CCCCccch--------HHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEe Confidence 899885 22222221 244555666666677777778889999988876543 321 11 1211 Q ss_pred CCcCceeEEEEeccccCChhhhhccccccccCCcee-EEEeecccCCccccceeeehhh--hheecCC----cCCCcchH Q lcl|NC_016762. 144 GKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTM-WEYTEASQAGRPGLVRDIHPDR--VFILGDW----TGDAIGFL 216 (456) Q Consensus 144 ~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~-y~i~~~~~~g~~~~~~~IH~SR--li~~~~~----~~~G~S~l 216 (456) ...+.+..-..+|...- +..+..-.++.|.. |++.....++.......-|+=- ++.|... ...|.|.+ T Consensus 152 ~~~~~~~~al~~~~~~~-----~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s~i 226 (474) T protein:vir:81 152 RRRRGLNNLLSIIDKDK-----EGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQSRI 226 (474) T ss_pred CCCCcceeeeEEEEEcC-----CCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCcccc Confidence 01111111111111000 00111122222222 2222111111000011123221 2334322 12477755 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEec-CCCc Q lcl|NC_016762. 217 -EPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPT-QGAT 294 (456) Q Consensus 217 -e~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid-~~d~ 294 (456) +++..-..++.++.-...-...-.+..+..+.. .+.....+..+....... .....+-.+..+.+..... .+-+ T Consensus 227 ~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G---~~~~~~~d~d~~~~~~~~-~~~~~i~~~~~d~d~~~~~~~~~~ 302 (474) T protein:vir:81 227 TKPMMGLQDAGVRELARREGHMDVFSYPEFWLLG---ADESALKNADGTIKSVWE-ARLGRIKGLPDDADADIPQLARAD 302 (474) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhcchhheeec---CChhhcccccccccchhh-hhHHHHhcCCCccccccccccccc Confidence 566554444444432211111111222222211 111111111111111111 1111121122222211111 1123 Q ss_pred eeEE-ecccCCHHHHHHHHHHHHHhhhcCCeEEeecc-CCCcccchH----HHHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_016762. 295 VTQM-VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGM-QTGERASSE----DQKYHNARCQARRVQELTFEINDLFAHLM 368 (456) Q Consensus 295 ~~~~-~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~-sp~Glnst~----D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~ 368 (456) +.++ ++++.++-+.+.....++|+.++||.. -||. +..+-+|.+ -....-..++.+|+ .+.+.+++++.+.+ T Consensus 303 ~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~-~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~-~fg~~l~~~~rla~ 380 (474) T protein:vir:81 303 VKQFPAASPDAHWSDINGLAKLFAREASLPDT-AVAISGLSNPTSAESYDASQYELIAEAEGAVD-DFTPALRKAFIRAL 380 (474) T ss_pred ccccCCCChhHHHHHHHHHHHHHHhhhCCCHH-HhcccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Confidence 4444 456778888899999999999999987 4453 223322221 23455666776666 57999999998765 Q ss_pred HhcCc----CCC---CceEEEeCCCCCCCHHHHHHHHHHHHHHHHHH----HHcCCcCcCHHHHHHHhcc-cCCCCCCCC Q lcl|NC_016762. 369 RIGVV----PLK---AEFTAIWDDLTVPTKAERLANSKTMSEINSAA----IGTGEPVFTAEEIREEAGY-DPLQGGDPL 436 (456) Q Consensus 369 ~s~~~----~~~---~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~----~~~g~~~i~~~E~R~~~~~-~~~~~~~~~ 436 (456) ....+ ..+ -.+++.|.|...+|..++|+...|.+++.... +.....=++++|+...... ..-...... T Consensus 381 ~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~ 460 (474) T protein:vir:81 381 AMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTL 460 (474) T ss_pred HHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHH Confidence 54332 111 25688999999999999988777766643111 0000112455555321110 000000000 Q ss_pred cccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 437 PDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 437 ~~~~~~d~~~~~~d~~~~~e 456 (456) +.-.....+++.-| T Consensus 461 ------~~l~~~~~~~~~aq 474 (474) T protein:vir:81 461 ------QALIDRSNNGATAQ 474 (474) T ss_pred ------HHHHhcCCCCCCCC Confidence 00001111222222 No 125 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=99.05 E-value=3.7e-10 Score=72.30 Aligned_cols=338 Identities=11% Similarity=0.002 Sum_probs=157.1 Q ss_pred HHHhhhhhccCcccchh-hhh------ccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhH Q lcl|NC_016762. 22 MSLLNQGIGHDAKRPQA-WCE------YGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDET 94 (456) Q Consensus 22 d~~~n~~~~~gt~~~~~-~~~------~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~ 94 (456) |++.|.+-...+..+.. +.. ..+...++... + .++.-+..+|++++++.-.- .++. .... T Consensus 1 M~~~~~f~~r~~~~~~~~~~~~~~~~~~~~~~~v~~~~---a-l~~~av~~cv~~ia~~ia~~--p~~~------~~~~- 67 (359) T protein:vir:10 1 MSILNPFERRSSITPNNYYPFMVQNGSIVPNSLVDATE---A-LKNSDLYAVTSLISSDIAGT--RFIG------NQVF- 67 (359) T ss_pred CcccchhhccccCCCCcchhhhhccccccCCcccCHHH---h-hcchHHHHHHHHHHHhhhcC--cccc------chHH- Confidence 55555333222222211 111 11122333322 1 23555678999999877432 2211 1111 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccc Q lcl|NC_016762. 95 EWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSET 173 (456) Q Consensus 95 ~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~ 173 (456) ..+...=..+.-...|.+.+.+ -.++|-+.+++. +|+.. -+..+.|+-...+++. ...+ T Consensus 68 ---~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~-r~~~g----------~~~~l~~l~~~~v~i~-~~~~----- 127 (359) T protein:vir:10 68 ---TSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAIL-KGDNS----------LMKELRLIPSNAITID-LTDD----- 127 (359) T ss_pred ---HHHhhcccccCCHHHHHHHHHHhccccCceEEEEE-ECCCC----------eEEEEEEeCCceEEEE-EcCC----- Confidence 1111111112233344444444 455677766553 22211 1222333322222221 1111 Q ss_pred cCCceeEEEeecccCCccccceeeehhhhheecCC--------cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_016762. 174 YGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW--------TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQL 245 (456) Q Consensus 174 yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~--------~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l 245 (456) .-.|++... + .+....++++.|+||... ...|.|.++.+...+..... +......++++..+.- T Consensus 128 ---~~~y~~~~~--~--~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~-~~~~~~~~f~ng~~~~ 199 (359) T protein:vir:10 128 ---TLTYEVNQF--D--DYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKE-ANRLSLSTLKGALNPT 199 (359) T ss_pred ---eEEEEEEec--C--CceEEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHH-HHHHHHHHHhccCCcc Confidence 123555421 1 123567888899888432 23488989888876654443 3344445566644322 Q ss_pred hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC---CeEEecCCCceeEEecccCCHH--HHHHHHHHHHHhhh Q lcl|NC_016762. 246 LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN---DVLLPTQGATVTQMVSAVSDPG--PTYNVNLQTAAAGV 320 (456) Q Consensus 246 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~lid~~d~~~~~~~~~sgl~--~~~~~~~~~~aaas 320 (456) .+.. +. .+.-.++..+++.+.++.+.++. ..++++.+.+|+.++.+..... +......+.||.+. T Consensus 200 gil~---~~-------~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~f 269 (359) T protein:vir:10 200 SVVK---VP-------QGTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAF 269 (359) T ss_pred eEEE---eC-------CCCCCHHHHHHHHHHHHHHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHh Confidence 1110 10 00111233444555555554432 2567777788998886654433 45556677899999 Q ss_pred cCCeEEeeccCCCcccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016762. 321 DIPTKILVGMQTGERASSEDQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKT 400 (456) Q Consensus 321 ~IP~t~L~G~sp~Glnst~D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~ 400 (456) +||-..| |-. +.-+++ |+.+++.-...++|.|..+.+.|-.. +.. .+.+....+...+.... T Consensus 270 gVPp~~l-g~~-~~~~~~------~~~~e~~~~~~l~~~l~p~~~~l~~~-l~~---~~~~~~~~~~~~d~~~~------ 331 (359) T protein:vir:10 270 GVSDSYL-NGT-GDQQSS------LDQIKDLYVNALNRFIEPLISELRIK-CDS---SIGVDMSPITDYSNSVF------ 331 (359) T ss_pred CCCHHHh-CCC-Cccccc------HHHHHHHHHHHHHHHHHHHHHHHHHH-hhh---hhcccchhhhhcCHHHH------ Confidence 9998755 421 111122 22222222334444444444433211 110 11222223333333222 Q ss_pred HHHHHHHHHHcCCcCcCHHHHHHHhcccCCC Q lcl|NC_016762. 401 MSEINSAAIGTGEPVFTAEEIREEAGYDPLQ 431 (456) Q Consensus 401 ~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~ 431 (456) .+ ....++..| +++++|+|+..+++|.. T Consensus 332 ~~-~~~~~~~~G--~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 332 KA-DILNWVKEG--IIEPTEAKTLLESKGII 359 (359) T ss_pred HH-HHHHHHhCC--CcCHHHHHHHhCCCCCC Confidence 11 233467777 99999999999999977 No 126 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=99.04 E-value=3e-09 Score=67.28 Aligned_cols=420 Identities=9% Similarity=-0.056 Sum_probs=184.7 Q ss_pred CCc--h--------hHHHHh-HHHH--HHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhh Q lcl|NC_016762. 1 MTD--K--------LDLAVN-HAMS--SAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAV 67 (456) Q Consensus 1 ~~~--~--------~~~~~~-~a~~--~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iV 67 (456) |+. + +.-.++ |-.. ..+.+..+-|.+ --.+-..+....... .+. .--.+.+++.|| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g-~~~i~~~~~~~~~~~-~~~---------~ki~~n~~k~Iv 99 (511) T protein:vir:99 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEG-KTKNLVELTRRKEEY-MAD---------NRVAHDYASYIS 99 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcc-cCccccccCcccccc-cCc---------ceeecchHHHHH Confidence 221 1 111222 1110 011111221111 000000000000000 010 012468899999 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR 143 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~ 143 (456) +..+.=++-+.+++..+++.. .+.|.+.+++.++...+.++.+...+||.+++++..+ |++... .|.. T Consensus 100 ~~~~~yl~g~p~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~ 171 (511) T protein:vir:99 100 DFINGYFLGNPIQYQDDDKDV--------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMS 171 (511) T ss_pred HHHHhhhcccCceeecCchHH--------HHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccce Confidence 999988888888886543221 2347777777789999999999999999999988874 343221 1221 Q ss_pred ------CC-cCceeEEEEeccccCChhhhhcc-ccccccCCce-eEEEeecccCCc----cccceeeehhhhheecCC-- Q lcl|NC_016762. 144 ------GK-LNGLAKVTPAWAGCLKPKSFDEK-PDSETYGQPT-MWEYTEASQAGR----PGLVRDIHPDRVFILGDW-- 208 (456) Q Consensus 144 ------~~-~~~l~~i~~~~~~~~~~~~~~~D-p~s~~yg~P~-~y~i~~~~~~g~----~~~~~~IH~SRli~~~~~-- 208 (456) .. .+.+....-+|.....- ....+ ..-...+.|. .|+......++. .......|+=-.+.+..+ T Consensus 172 ~~~vyd~~~~~~~~~~vr~~~~~~~~-~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 250 (511) T protein:vir:99 172 TFVIYDNTIERNSIAGVRYLRTKPID-KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (511) T ss_pred eEEEEcCCCCCceEEEEEEEEeeecc-cCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecC Confidence 00 01111111111110000 00000 0001111121 112111100000 001122333323333332 Q ss_pred cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHH---HHH-hcCC Q lcl|NC_016762. 209 TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAA---RQL-NRGN 284 (456) Q Consensus 209 ~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~---~~~-~~~~ 284 (456) +.+|.|.++.+..-+.+++.+.-..+..+...+...+.+......+ ..+........+ ..+ .... T Consensus 251 n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:99 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD-----------PVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccC-----------chhhcccccccceeccccccccc Confidence 3568999998888777788776666655543333333222111111 011110000000 000 0011 Q ss_pred CeEEecCCCcee--EEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhh Q lcl|NC_016762. 285 DVLLPTQGATVT--QMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELT 357 (456) Q Consensus 285 ~~~lid~~d~~~--~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lr 357 (456) +..-.+.+.+++ ..+.+.+++...++...+.|...+++|-.-+-+.+ | |.++. ++ .-...+. .++..++ T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-g--n~Sg~Alk~~~~~l~~ka~-~k~~~~~ 395 (511) T protein:vir:99 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G--TQSGEAMKYKLFGLEQRTK-TKEGLFT 395 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c--cchHHHHHHHHHHHHHHHH-HHHHHHH Confidence 111122233444 44556679999999999999999999986332222 2 22332 22 2233444 4556789 Q ss_pred HHHHHHHHHHHHhc--Cc--C---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCc-CH-HHHHHHhc Q lcl|NC_016762. 358 FEINDLFAHLMRIG--VV--P---LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVF-TA-EEIREEAG 426 (456) Q Consensus 358 p~L~~l~~~l~~s~--~~--~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i-~~-~E~R~~~~ 426 (456) ..|++++++++... .+ . ...+++|.|+|-...+.+|.|++..+.+-+.. +++..- +-+ ++ .|+..... T Consensus 396 ~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~et~l~~l-~~v~D~~~E~~ri~~ 474 (511) T protein:vir:99 396 KGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLF-SFFQDPELEVKKIEE 474 (511) T ss_pred HHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhC-CCCCCHHHHHHHHHH Confidence 99999988875431 11 1 12368999999999999999998777654322 222211 122 23 34432111 Q ss_pred c------cCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 427 Y------DPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 427 ~------~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) . .........++..+.+++.+++..+.++| T Consensus 475 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 510 (511) T protein:vir:99 475 DEKESIKKAQKNMYQDPRNINDDEQDDSTKDSIDKK 510 (511) T ss_pred HHHHHHHHHhhcccccCCCCCCCCCCCCCcCccccc Confidence 0 00000011111111122222222222222 No 127 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.04 E-value=1.3e-09 Score=69.27 Aligned_cols=439 Identities=10% Similarity=-0.029 Sum_probs=197.0 Q ss_pred CCchhHHHHhH-HHHHHHHHHHHHHhhhhhccCcccc-hhhhhccCcccCC------------HHHHHHHHhcCchhhhh Q lcl|NC_016762. 1 MTDKLDLAVNH-AMSSAIARARMSLLNQGIGHDAKRP-QAWCEYGFPQEIT------------FNDLYTMYRRGGIAHGA 66 (456) Q Consensus 1 ~~~~~~~~~~~-a~~~~~~~~~d~~~n~~~~~gt~~~-~~~~~~~~~~~~~------------~~~l~~~Y~~~~l~r~i 66 (456) |......++-. +...+..+.+-... .--|.++. +....+ .|...+ ..-...+|+.|++++.+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~---~y~gA~~~~r~~~~w-~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~a 76 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGG---GLEGASRLSRETVSW-NPSLRSPDALINPLKRIADARGRDMADNDGFTNGA 76 (553) T ss_pred Ccchhhhhhcccccccchhhhhhhcc---cccccccCCCccccc-ccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHH Confidence 66555554432 22222212111111 10111111 111111 121222 22356789999999999 Q ss_pred hccchhHHhhCCCEEecCCC-----cchhhhhHHHHHHHHHHHHHh--------------hHHHHHHHHHHhhcccCceE Q lcl|NC_016762. 67 VEKIVTTCWKTNPQVIEGDD-----QDRSKDETEWERKNKPLIAGG--------------RFWRAVSEADRRRLVGRYSG 127 (456) Q Consensus 67 Vd~~aed~tR~~~~i~~~~~-----~d~~~~~~~~e~~i~~~~~~l--------------~~~~~~~ea~~~~r~~Ggs~ 127 (456) |+......+=.|+++...-+ ....+...+|.++|++++++. .+......+++.-...|-++ T Consensus 77 v~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~ 156 (553) T protein:vir:63 77 VGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVL 156 (553) T ss_pred HHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceE Confidence 99999999999998753211 112334455666777766542 12222233444444556665 Q ss_pred EEEEecCCCCccccccCCcCceeEEEEeccccCCh--hhhh-----ccccccccCCceeEEEeecccCCcc--------- Q lcl|NC_016762. 128 LLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKP--KSFD-----EKPDSETYGQPTMWEYTEASQAGRP--------- 191 (456) Q Consensus 128 i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~--~~~~-----~Dp~s~~yg~P~~y~i~~~~~~g~~--------- 191 (456) +.+.........-|++ |.++....|.. ...+ .-..=-.+|+|..|+|...-++... T Consensus 157 ~~~~~~~~~~~~~~~~--------lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~ 228 (553) T protein:vir:63 157 ATAEWDRAANRPYATC--------FQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKW 228 (553) T ss_pred EEeeeccCCCCcccce--------EEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccce Confidence 5544321111111221 22222222211 0000 0111125799999999754333210 Q ss_pred ---ccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcC Q lcl|NC_016762. 192 ---GLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGV 264 (456) Q Consensus 192 ---~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~ 264 (456) .....|+.+.|||+-.. ...|+|.|-+++..+.+++.-....-..---+++-...++ ............+. T Consensus 229 ~r~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~--~~~~~~~~~~~~~~ 306 (553) T protein:vir:63 229 KFVQQSKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIE--SELPPEFIHSQMSG 306 (553) T ss_pred eeeccccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeee--cCCChhhhhhhccc Confidence 01245888998876432 3569999999999888776654432111000111111111 01111111111100 Q ss_pred C--HH-----------HHHHHHH-HHHHHHhcCCCeEE-ecCCCceeEEecc--cCCHHHHHHHHHHHHHhhhcCCeEEe Q lcl|NC_016762. 265 T--LD-----------ALNERFN-EAARQLNRGNDVLL-PTQGATVTQMVSA--VSDPGPTYNVNLQTAAAGVDIPTKIL 327 (456) Q Consensus 265 ~--~~-----------~~~~~~~-~~~~~~~~~~~~~l-id~~d~~~~~~~~--~sgl~~~~~~~~~~~aaas~IP~t~L 327 (456) + .. ....... .....+. -+++. +..+++++..+.+ -++..+....++..||+..|||.-.| T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~l 384 (553) T protein:vir:63 307 GSPNADMVGIFGKYMDALKAYVGGANNIQID--GAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEF 384 (553) T ss_pred ccccccccccccccccccccccccccceeec--CceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHH Confidence 0 00 0000000 0000111 12233 3447778777665 56899999999999999999999999 Q ss_pred eccC-CCcccch-HHHHHHHHHHHHHHHhhh----hHHHHHHHHHHHHhcCcCCCCceEEE--eCC---------CCCCC Q lcl|NC_016762. 328 VGMQ-TGERASS-EDQKYHNARCQARRVQEL----TFEINDLFAHLMRIGVVPLKAEFTAI--WDD---------LTVPT 390 (456) Q Consensus 328 ~G~s-p~Glnst-~D~~nyyd~I~~~Qe~~l----rp~L~~l~~~l~~s~~~~~~~d~~~~--f~p---------L~~~s 390 (456) .|-- -.-.+|. ..+..+...++.+|.... +|+-+..++.-+.++.++.|..+.-. ++| -|.+- T Consensus 385 t~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p 464 (553) T protein:vir:63 385 TRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGA 464 (553) T ss_pred hhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecC Confidence 8852 2233333 467777788888887543 34444444444556666555422110 011 12222 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCc---------CcCHHHHHHH-------hcccCCC-CCCC------CcccCCC-CCCC Q lcl|NC_016762. 391 KAERLANSKTMSEINSAAIGTGEP---------VFTAEEIREE-------AGYDPLQ-GGDP------LPDTEPE-DEDA 446 (456) Q Consensus 391 eke~Aei~~~~A~a~~~~~~~g~~---------~i~~~E~R~~-------~~~~~~~-~~~~------~~~~~~~-d~~~ 446 (456) ..+-.|= .|.++|....+.+|.. =.+++|+.+. ...-++. +..+ ..+.+.. .+++ T Consensus 465 ~~~~iDP-~Ke~~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~ 543 (553) T protein:vir:63 465 SQGQIDQ-LKETQAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDP 543 (553) T ss_pred CccccCh-HHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCC Confidence 2222221 2455566666666610 1333333211 1111111 0000 0111111 1122 Q ss_pred CCcCCCCCCC Q lcl|NC_016762. 447 ARTDPTGEQQ 456 (456) Q Consensus 447 ~~~d~~~~~e 456 (456) ..++++.+.| T Consensus 544 ~~~~~~~~~e 553 (553) T protein:vir:63 544 AAAQTSQQGE 553 (553) T ss_pred CCCCcccccC Confidence 2222222222 No 128 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=99.03 E-value=2.2e-09 Score=68.06 Aligned_cols=421 Identities=9% Similarity=-0.069 Sum_probs=188.4 Q ss_pred CCc--h--------hHHHHh-HHHH--HHHHHHHHHHhhhhhccCcccchhhhhccCc-ccCCHHHHHHHHhcCchhhhh Q lcl|NC_016762. 1 MTD--K--------LDLAVN-HAMS--SAIARARMSLLNQGIGHDAKRPQAWCEYGFP-QEITFNDLYTMYRRGGIAHGA 66 (456) Q Consensus 1 ~~~--~--------~~~~~~-~a~~--~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~-~~~~~~~l~~~Y~~~~l~r~i 66 (456) |+. + +.-.++ |-.. ..+.+..+-|. |- .+-....-..+ ..... .--.+.+++.| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~----g~---~~i~~~~~~~~~~~~~~-----~ki~~n~~k~I 98 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYE----GK---TKNLVELTRRKEEYMAD-----NRVAHDYASYI 98 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhc----cc---CccccccCcCcccccCc-----ceeecchHHHH Confidence 221 1 111222 1110 01111222221 11 11000000000 00000 01246889999 Q ss_pred hccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---cc Q lcl|NC_016762. 67 VEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PA 142 (456) Q Consensus 67 Vd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl 142 (456) |+..+.=++.+.+++...++.. ...|.+.+++.++...+.++.+....||.+++++..+ ||+.-.. |. T Consensus 99 v~~~~~yl~g~p~~~~~~~~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~ 170 (511) T protein:vir:96 99 SDFINGYFLGNPIQYQDDDKDV--------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (511) T ss_pred HHHHHhhhccCCceeecCchHH--------HHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccc Confidence 9999998999999986543321 1347777888889999999999999999999988874 3432211 22 Q ss_pred c------CC-cCceeEEEEeccccCChhhh-hcc-ccccccCCce-eEEEeecccCCc----cccceeeehhhhheecCC Q lcl|NC_016762. 143 R------GK-LNGLAKVTPAWAGCLKPKSF-DEK-PDSETYGQPT-MWEYTEASQAGR----PGLVRDIHPDRVFILGDW 208 (456) Q Consensus 143 ~------~~-~~~l~~i~~~~~~~~~~~~~-~~D-p~s~~yg~P~-~y~i~~~~~~g~----~~~~~~IH~SRli~~~~~ 208 (456) . .. ...+....-+|... ..+. ..+ ..-.+++.|. .|+......++. ......-|+-..+.+..+ T Consensus 171 ~~~~vydd~~~~~~~~~vr~~~~~--~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~ 248 (511) T protein:vir:96 171 STFVIYDNTIERNSIAGVRYLRTK--PIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF 248 (511) T ss_pred eeEEEEcCCCCCceEEEEEEEEee--eccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEe Confidence 1 00 01111111111100 0000 000 0001111111 111111100000 001122344334444433 Q ss_pred --cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh----c Q lcl|NC_016762. 209 --TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN----R 282 (456) Q Consensus 209 --~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~----~ 282 (456) +..|.|.++.+.+-+.+++.+.-..+..+...+...+.+......+ ..+........+..+. . T Consensus 249 ~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~-----------~~~~~~~~~~~~~~~~~~~~~ 317 (511) T protein:vir:96 249 SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD-----------PVEVRKQKEANVLFLEPTVYA 317 (511) T ss_pred cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCC-----------chhhcccccccceeccccccc Confidence 3468899998888777888776666655543333333222111111 1111100000000000 0 Q ss_pred CCCeEEecCCCcee--EEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhh Q lcl|NC_016762. 283 GNDVLLPTQGATVT--QMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNARCQARRVQELT 357 (456) Q Consensus 283 ~~~~~lid~~d~~~--~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lr 357 (456) ..+..-.+.+.+++ ..+.+.+++...++.+.+.|...+++|-.-.-+.+ |..+|.. -...-...+. .++..++ T Consensus 318 ~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~-~k~~~~~ 395 (511) T protein:vir:96 318 DSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTK-TKEGLFT 395 (511) T ss_pred ccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHH-HHHHHHH Confidence 11111122233444 44667789999999999999999999986332222 3332222 1122334444 4456789 Q ss_pred HHHHHHHHHHHHh-cC-c--C---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCH-HHHHHHhcc Q lcl|NC_016762. 358 FEINDLFAHLMRI-GV-V--P---LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTA-EEIREEAGY 427 (456) Q Consensus 358 p~L~~l~~~l~~s-~~-~--~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~-~E~R~~~~~ 427 (456) ..|++++++++.. +. + . ...+++|.|+|-...+.++.+++..+.+-... +++..--.+-++ .|+...... T Consensus 396 ~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~G~iS~et~l~~l~~v~D~~~E~~ri~~E 475 (511) T protein:vir:96 396 KGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEED 475 (511) T ss_pred HHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHH Confidence 9999998887643 11 1 1 12368999999999999999998776544322 222111012222 344321111 Q ss_pred c----C--CCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 428 D----P--LQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 428 ~----~--~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) . . .......++..+.++..++.++.+++| T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) T protein:vir:96 476 EKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred HHHHHHHHhhccccCCCCCCCCCCCCccccccccc Confidence 0 0 000111122222233333344444444 No 129 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=99.03 E-value=9.5e-10 Score=70.04 Aligned_cols=406 Identities=10% Similarity=-0.008 Sum_probs=176.6 Q ss_pred CCchhHHHHh---HHHH---HHHHHHHHHHhhhhhcc---CcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKLDLAVN---HAMS---SAIARARMSLLNQGIGH---DAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~~~~~~---~a~~---~~~~~~~d~~~n~~~~~---gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) |+.+-++.-. .... ..+. ..+-+....-|- -....+.+............ -....+++.|||..+ T Consensus 18 ~~~~~~~~~~~i~~~i~~~~~~~~-~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~-----ri~~n~~~~ivd~~~ 91 (472) T protein:vir:93 18 TNNKPETLEEMIVRYIKQHLEKLP-EISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDD-----RMITNFHANLVDQKV 91 (472) T ss_pred ecCchhhHHHHHHHHHHHHHHHHH-HHHHHHHHhccccccccccchhhcccccccccccc-----ccccchHHHHHHHHh Confidence 2221111110 0000 0000 111111111110 00001111111000000000 013689999999999 Q ss_pred hHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc---- Q lcl|NC_016762. 72 TTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR---- 143 (456) Q Consensus 72 ed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~---- 143 (456) .=++-+++++...++. .. +.|+..+.. ++...+.++.+....||.|++++..+ |++..- .|.. T Consensus 92 ~~l~g~~~~~~~~d~~-~~-------~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~~~i 162 (472) T protein:vir:93 92 SYIVGKPIAFKHTDDE-VV-------KRIDEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPI 162 (472) T ss_pred hhhcccCeeeccCChH-HH-------HHHHHHHhc-cHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccceEEE Confidence 9999999888543322 11 234444443 67788888888889999998888774 333211 1211 Q ss_pred --CC-cCceeEEEEeccccCChhhhhccccccccCCce---eEEEeeccc---CCccccceeeehh----hhheecC--C Q lcl|NC_016762. 144 --GK-LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPT---MWEYTEASQ---AGRPGLVRDIHPD----RVFILGD--W 208 (456) Q Consensus 144 --~~-~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~---~y~i~~~~~---~g~~~~~~~IH~S----Rli~~~~--~ 208 (456) .. .+.+....=+|.. .+-..-.++.|. +|....... .........+|.. -.+.+.. . T Consensus 163 ~d~~~~~~~~~~ir~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n 234 (472) T protein:vir:93 163 WTDKEHEELEAFIRMYKL--------ENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN 234 (472) T ss_pred EcCCCCCceEEEEEEEEe--------ecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecC Confidence 00 1111111111110 010111111221 222111000 0000011112211 1111211 2 Q ss_pred cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEE Q lcl|NC_016762. 209 TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLL 288 (456) Q Consensus 209 ~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 288 (456) +.+|.|.++.+.+-+.+++.+.-..+..+--.+...+.+. ++ ......+ ....+ +..+... T Consensus 235 n~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~--------g~------~~~~~~~-~~~~~----~~~~~~~ 295 (472) T protein:vir:93 235 NDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--------NY------DDQELPE-FKRLL----RYYGAIK 295 (472) T ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEee--------cC------Ccccchh-hHHHH----hhccccc Confidence 3578999998877777777665444433221121112111 11 1011111 11111 2223344 Q ss_pred ecCCCcee--EEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHH Q lcl|NC_016762. 289 PTQGATVT--QMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEIN 361 (456) Q Consensus 289 id~~d~~~--~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~ 361 (456) ++.+.+.+ ..+.+.+++...++.+.+.+...+++|-.-+ +.. || |.+++ ++ .-...+. .++..+...|+ T Consensus 296 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~-~~-n~Sg~Al~~~~~~l~~ka~-~~~~~~~~~l~ 371 (472) T protein:vir:93 296 VSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS-DKF-GS-APSGVALEFLYTNLNLKAD-KLARKAKVAIQ 371 (472) T ss_pred cCCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCc-ccc-cc-CchHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 55555544 4567778999999999999999999996422 211 11 23333 32 2233343 44557899999 Q ss_pred HHHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHH-cCCcCcCHH-HHHHHhc-----ccCCC Q lcl|NC_016762. 362 DLFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AIG-TGEPVFTAE-EIREEAG-----YDPLQ 431 (456) Q Consensus 362 ~l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~~-~g~~~i~~~-E~R~~~~-----~~~~~ 431 (456) +++++++..... ....++++.|+|-...+.++.|++..+.+.+... ++. .+ .+-+++ |+..... ...+. T Consensus 372 ~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~~giis~et~l~~l~-~~~d~~~E~~ri~~E~~~~~~~~~ 450 (472) T protein:vir:93 372 ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHP-FVEDLQAELERIEQEQMEYNKQLP 450 (472) T ss_pred HHHHHHHHHhCCCcccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCC-CCCCHHHHHHHHHHHHHHHHHhcc Confidence 999887654322 2345899999999999999999998877654321 121 11 122233 3322110 01111 Q ss_pred CCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 432 GGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 432 ~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) + ..+....+..+.+..+..++| T Consensus 451 ~---~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 451 N---LDDGGADGAQQQERSNNKESE 472 (472) T ss_pred C---cCcccCCCCCCCCCCCcccCC Confidence 1 111111111111222222223 No 130 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=99.01 E-value=2.4e-09 Score=67.87 Aligned_cols=423 Identities=10% Similarity=-0.055 Sum_probs=186.3 Q ss_pred CCchhHHH------HhHHHHHHHHHHHHH---HhhhhhccCc--ccch-hhhhccCcccCCHHHHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDKLDLA------VNHAMSSAIARARMS---LLNQGIGHDA--KRPQ-AWCEYGFPQEITFNDLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~~~~~------~~~a~~~~~~~~~d~---~~n~~~~~gt--~~~~-~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd 68 (456) |+...++. +...++.-....+.. +....-|-.. .+.+ ....+ .+. .--.+.+++.||+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~-~~~---------~ki~~n~~k~Iv~ 100 (511) T protein:vir:93 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEY-MAD---------NRVAHDYASYISD 100 (511) T ss_pred ccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccc-cCc---------ceeecchHHHHHH Confidence 33211111 111222111111111 1111111100 0000 00000 010 0124688999999 Q ss_pred cchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---ccccC Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPARG 144 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~~ 144 (456) ..+.=++-+.+++..+++.. .+.|.+.+++.++...+.++.+....||.|++++..+ +++.-. .|... T Consensus 101 ~~~~yl~g~p~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~ 172 (511) T protein:vir:93 101 FINGYFLGNPIQYQDDDKDV--------LEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMST 172 (511) T ss_pred HHhhhhcccCeeeccCChHH--------HHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccee Confidence 99988888998886544321 1347777777789999999999999999999988874 343221 22210 Q ss_pred ------C-cCceeEEEEeccccCChhhhhccccccccCCce-eEEEeecccCCcc----ccceeeehhhhheecCC--cC Q lcl|NC_016762. 145 ------K-LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPT-MWEYTEASQAGRP----GLVRDIHPDRVFILGDW--TG 210 (456) Q Consensus 145 ------~-~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~-~y~i~~~~~~g~~----~~~~~IH~SRli~~~~~--~~ 210 (456) . .+.+....-+|.....-..-...+.-.+++.|. .|++.....++.. .....-|+=-.+.+.++ .. T Consensus 173 ~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~ 252 (511) T protein:vir:93 173 FVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE 252 (511) T ss_pred EEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCC Confidence 0 011111111111100000000000111111121 1122111100000 00112233222333322 34 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCe Q lcl|NC_016762. 211 DAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDV 286 (456) Q Consensus 211 ~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~ 286 (456) +|.|.++.+..-+.+++.+.-..+..+...+...+.+..... .+..+..+.....+-.+..+ ... T Consensus 253 ~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (511) T protein:vir:93 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-----------LDPVEVRKQKEANVLFLEPTVYADSEG 321 (511) T ss_pred CCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcc-----------cCchhhcccccccceeccccccccccc Confidence 688999988887777777766655544333322232221111 11111111000000000010 011 Q ss_pred EEecCCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHH Q lcl|NC_016762. 287 LLPTQGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFE 359 (456) Q Consensus 287 ~lid~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~ 359 (456) .-.+.+.+ |-..+.+.+++...++...+.|...+++|-.-. +...|.. +++ ++ .-...+. .++..++.. T Consensus 322 ~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~-~~~~~n~--Sg~Al~~~~~~l~~k~~-~k~~~f~~~ 397 (511) T protein:vir:93 322 RETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNFSGTQ--SGEAMKYKLFGLEQRTK-TKEGLFTKG 397 (511) T ss_pred ccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-ccccccc--hHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 11223334 444566778999999999999999999997533 2221322 332 22 2233444 455678999 Q ss_pred HHHHHHHHHHh-cC-c--CC---CCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHHcCCcCcCH-HHHHHHhcc-- Q lcl|NC_016762. 360 INDLFAHLMRI-GV-V--PL---KAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AIGTGEPVFTA-EEIREEAGY-- 427 (456) Q Consensus 360 L~~l~~~l~~s-~~-~--~~---~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~~~g~~~i~~-~E~R~~~~~-- 427 (456) |++++++++.. +. + .. ..++++.|+|-...+.+|.|++..+.+-+... ++..--.+-++ .|+...... T Consensus 398 l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~ 477 (511) T protein:vir:93 398 LRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEK 477 (511) T ss_pred HHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 99999887643 11 1 11 23679999999999999999987776543321 22111012223 343321110 Q ss_pred ---cCC-CCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 428 ---DPL-QGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 428 ---~~~-~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ... ......++..+.+++.++.++.+++| T Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) T protein:vir:93 478 ESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred HHHHHHhhhcccCCCCCCCCCCCCccccccccc Confidence 000 00001111112222333333344444 No 131 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=99.01 E-value=5.7e-09 Score=65.78 Aligned_cols=420 Identities=10% Similarity=-0.041 Sum_probs=184.9 Q ss_pred CCchhHHH---Hh-HHHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHH Q lcl|NC_016762. 1 MTDKLDLA---VN-HAMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTC 74 (456) Q Consensus 1 ~~~~~~~~---~~-~a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~ 74 (456) +..--+++ ++ |..+. .+.+..+-|.+ . ..........+ .......-..+.+++.||+..+.=+ T Consensus 37 ~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g------~-~~~i~~~~~~~----~~~~~~~ri~~n~~k~Ivd~~~~yl 105 (501) T protein:vir:96 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARG------E-NHDVLKSGRRK----DNEMADKRAVHNYGRMISKFKTGYL 105 (501) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC------C-CCcccCccccC----ccccccceeecchHHHHHHHHhhhh Confidence 33322222 21 22221 12222222211 1 00000000000 0000011235788999999999999 Q ss_pred hhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcCceeEEE Q lcl|NC_016762. 75 WKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLNGLAKVT 153 (456) Q Consensus 75 tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~~l~~i~ 153 (456) +.+.+++...++.+... ....|.+.+++-++...+.++.+....||.|++++..+ ||+..-..+.. ..+. T Consensus 106 ~g~p~~~~~~~~~~~~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p-----~~~~ 176 (501) T protein:vir:96 106 AGNPIRVEYDDNDDNSQ----NDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSP-----LETF 176 (501) T ss_pred cccCeeEeeCCccchhH----HHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcc-----ceeE Confidence 99999997655443322 22346677777788899999999999999999888774 33321111100 0111 Q ss_pred EeccccCC--h--h--hhhccccc-----cccCCc-eeEEEeecccCCccccceeeehhhhheecC--CcCCCcchHHHH Q lcl|NC_016762. 154 PAWAGCLK--P--K--SFDEKPDS-----ETYGQP-TMWEYTEASQAGRPGLVRDIHPDRVFILGD--WTGDAIGFLEPA 219 (456) Q Consensus 154 ~~~~~~~~--~--~--~~~~Dp~s-----~~yg~P-~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--~~~~G~S~le~~ 219 (456) |+|..... + . -|..+... -.++.| ..|.+.. .++.......-|+--.+.+.. ...+|.|.++.+ T Consensus 177 ~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~--~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v 254 (501) T protein:vir:96 177 VIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDA--SDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETE 254 (501) T ss_pred EEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEee--CCCceeccccccCCCccceEEecCCccCCCchhhh Confidence 22211000 0 0 00000000 000111 1111110 000000011223222222222 234689999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHH-HHHHHhcCCCeEE-ecCCCc--e Q lcl|NC_016762. 220 YNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNE-AARQLNRGNDVLL-PTQGAT--V 295 (456) Q Consensus 220 ~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l-id~~d~--~ 295 (456) .+-+.+++.+.-..+..+-..+...+.+..... ....+....+.. .+-.+ ...+... ...+-+ | T Consensus 255 ~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~-----------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 322 (501) T protein:vir:96 255 LYLIDLYDSAESDTANHMSDMADAILAIYGDLA-----------LPKGMQASDMKRTRLMQL-KPPKSADGKEGTVKAEY 322 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecccc-----------cCcccchhhhhhcCeeee-cccccccccccCcceee Confidence 877777777765555444333333332221100 000011111100 00000 0111000 111123 3 Q ss_pred eEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc- Q lcl|NC_016762. 296 TQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIG- 371 (456) Q Consensus 296 ~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~- 371 (456) -..+.+.+++...++.+.+.|...+++|-.-+ |...|..+|.. -.......+ ..++..++..|++++++++... T Consensus 323 l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~~~n~Sg~Al~~~~~~l~~ka-~~~~~~~~~~l~~~~~li~~~~~ 400 (501) T protein:vir:96 323 LTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSD-TNFSGNTSGEALKYKLFGLDQDR-VDTQSQFTKGLKRRYRLAARIGS 400 (501) T ss_pred EeccCCHHHHHHHHHHHHHHHHHHhCCcccCc-ccccccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 34566778999999999999999999996533 22212222221 112223344 4555678999999988865431 Q ss_pred ---CcC--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCHHHHHHHhc-----c--cCCCCCCCCc Q lcl|NC_016762. 372 ---VVP--LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTAEEIREEAG-----Y--DPLQGGDPLP 437 (456) Q Consensus 372 ---~~~--~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~~E~R~~~~-----~--~~~~~~~~~~ 437 (456) .+. ...+++|.|+|-...++++.|++..+.+-+.. +++..--.+-++++-.+... . .+........ T Consensus 401 ~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~ 480 (501) T protein:vir:96 401 LVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEH 480 (501) T ss_pred hcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhc Confidence 121 12368899999999999999998877765332 22221101223333222111 1 1111110011 Q ss_pred ccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 438 DTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 438 ~~~~~d~~~~~~d~~~~~e 456 (456) ..+..++..+...+.+|++ T Consensus 481 ~~~~~~~~~e~~~d~~e~~ 499 (501) T protein:vir:96 481 VGKYTDEVKETHTDDFERE 499 (501) T ss_pred ccccCCcCCCCCCCccccc Confidence 1111111111111222222 No 132 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=99.00 E-value=3.1e-09 Score=67.21 Aligned_cols=415 Identities=9% Similarity=-0.025 Sum_probs=177.2 Q ss_pred CCchhHHHHhHHHHH--------HHHHHHHHHhhhhhccCcccch--hhhhccCcc--cC----------CHHH-HHHHH Q lcl|NC_016762. 1 MTDKLDLAVNHAMSS--------AIARARMSLLNQGIGHDAKRPQ--AWCEYGFPQ--EI----------TFND-LYTMY 57 (456) Q Consensus 1 ~~~~~~~~~~~a~~~--------~~~~~~d~~~n~~~~~gt~~~~--~~~~~~~~~--~~----------~~~~-l~~~Y 57 (456) |..++++-+....+. ...-+.+-+..+...+....++ ....|+-.. .+ .... ....- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~k 80 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWR 80 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcce Confidence 444333332211110 0001112222222111111111 011111000 00 0000 00001 Q ss_pred hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CC Q lcl|NC_016762. 58 RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQ 136 (456) Q Consensus 58 ~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~ 136 (456) -.+.+++.||+..+.=++-+.+++...++. . .+.++..++ -++...+.++.+....+|.+++++..+. ++ T Consensus 81 i~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~-~-------~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~ 151 (474) T protein:vir:94 81 ITTNFHQNLVDQKVSYVASKPVTYSCEDEN-V-------LKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGE 151 (474) T ss_pred eecchHHHHHHHHHhhhhcCCceeccCcHH-H-------HHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCe Confidence 136789999999999999999988543321 1 122444433 4778888999999999999999887743 32 Q ss_pred Cc---ccccc------CC-cCceeEEEEeccccCChhhhhccccccccCCcee---EEEeecccCCc---ccc----cee Q lcl|NC_016762. 137 PW---DRPAR------GK-LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTM---WEYTEASQAGR---PGL----VRD 196 (456) Q Consensus 137 ~~---~~Pl~------~~-~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~---y~i~~~~~~g~---~~~----~~~ 196 (456) .. -.|.. .. .+.+....=+|.. .+...-.++.|.. |.......... ... ... T Consensus 152 ~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~--------~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 223 (474) T protein:vir:94 152 MKLFRVPAEQAIPIWVDKEREELKSFIRYYKF--------NNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFS 223 (474) T ss_pred eEEEEEcccceEEEEcCCCCCceEEEEEEEEe--------cCeEEEEEEeCCeEEEEEEcCCccccccccCcCccccccc Confidence 21 11221 00 1111111111110 0100011112221 11110000000 000 011 Q ss_pred eehhhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHH Q lcl|NC_016762. 197 IHPDRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFN 274 (456) Q Consensus 197 IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 274 (456) -|.--.+.+.. ....|.|.++.+.+-+.+++.+.-..+..+-..+...+.++ +. .+....+... T Consensus 224 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--------g~---~~~~~~~~~~--- 289 (474) T protein:vir:94 224 NGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--------GY---EGEDLEEFMR--- 289 (474) T ss_pred ccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--------cC---Ccccchhhhh--- Confidence 23322222322 23468899998887777777765555433211111122111 00 0111111111 Q ss_pred HHHHHHhcCCCeEEecCCCce--eEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-H----HHHHHH Q lcl|NC_016762. 275 EAARQLNRGNDVLLPTQGATV--TQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-Q----KYHNAR 347 (456) Q Consensus 275 ~~~~~~~~~~~~~lid~~d~~--~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~----~nyyd~ 347 (456) .+ ....++.++.+.+. -..+.+.+++...++.+...|...+++|-.-. .+.+| |.++. + ..-... T Consensus 290 ----~~-~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~--~~~~~-n~Sg~Al~~~~~~l~~k 361 (474) T protein:vir:94 290 ----GL-KYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQT--DKFGS-APSGIALKFLYGNLDLK 361 (474) T ss_pred ----hh-hccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCc--ccccc-ccHHHHHHHHHHHHHHH Confidence 11 12234445555554 44567778999999999999999999996321 11122 23333 2 223344 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHH--HHHHHcCCcCcCHH-HHHH Q lcl|NC_016762. 348 CQARRVQELTFEINDLFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEIN--SAAIGTGEPVFTAE-EIRE 423 (456) Q Consensus 348 I~~~Qe~~lrp~L~~l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~--~~~~~~g~~~i~~~-E~R~ 423 (456) +. .++..+++.|++++.+++..... ....++++.|+|-...+++|.|++..+ |-.. .+++..--.+-+++ |+.. T Consensus 362 ~~-~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~~~-~g~iS~et~l~~l~~v~D~~~E~er 439 (474) T protein:vir:94 362 AN-KLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQIIAQ-SQYLSRETLVKSSPLVDDYKAELER 439 (474) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHHHH-cCCCCHHHHHHhCCCCCCHHHHHHH Confidence 44 44457899999999987764333 334578999999998899998887544 2111 11121110122222 3322 Q ss_pred Hhcc-cCCCCCC-CCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 424 EAGY-DPLQGGD-PLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 424 ~~~~-~~~~~~~-~~~~~~~~d~~~~~~d~~~~~e 456 (456) .... ....... ........++...+.++..+.| T Consensus 440 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 440 IEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 1100 0000000 0000000111111222222222 No 133 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=99.00 E-value=3.1e-09 Score=67.21 Aligned_cols=415 Identities=9% Similarity=-0.025 Sum_probs=177.2 Q ss_pred CCchhHHHHhHHHHH--------HHHHHHHHHhhhhhccCcccch--hhhhccCcc--cC----------CHHH-HHHHH Q lcl|NC_016762. 1 MTDKLDLAVNHAMSS--------AIARARMSLLNQGIGHDAKRPQ--AWCEYGFPQ--EI----------TFND-LYTMY 57 (456) Q Consensus 1 ~~~~~~~~~~~a~~~--------~~~~~~d~~~n~~~~~gt~~~~--~~~~~~~~~--~~----------~~~~-l~~~Y 57 (456) |..++++-+....+. ...-+.+-+..+...+....++ ....|+-.. .+ .... ....- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~k 80 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWR 80 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcce Confidence 444333332211110 0001112222222111111111 011111000 00 0000 00001 Q ss_pred hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CC Q lcl|NC_016762. 58 RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQ 136 (456) Q Consensus 58 ~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~ 136 (456) -.+.+++.||+..+.=++-+.+++...++. . .+.++..++ -++...+.++.+....+|.+++++..+. ++ T Consensus 81 i~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~-~-------~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~ 151 (474) T protein:vir:97 81 ITTNFHQNLVDQKVSYVASKPVTYSCEDEN-V-------LKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGE 151 (474) T ss_pred eecchHHHHHHHHHhhhhcCCceeccCcHH-H-------HHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCe Confidence 136789999999999999999988543321 1 122444433 4778888999999999999999887743 32 Q ss_pred Cc---ccccc------CC-cCceeEEEEeccccCChhhhhccccccccCCcee---EEEeecccCCc---ccc----cee Q lcl|NC_016762. 137 PW---DRPAR------GK-LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTM---WEYTEASQAGR---PGL----VRD 196 (456) Q Consensus 137 ~~---~~Pl~------~~-~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~---y~i~~~~~~g~---~~~----~~~ 196 (456) .. -.|.. .. .+.+....=+|.. .+...-.++.|.. |.......... ... ... T Consensus 152 ~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~--------~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 223 (474) T protein:vir:97 152 MKLFRVPAEQAIPIWVDKEREELKSFIRYYKF--------NNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFS 223 (474) T ss_pred eEEEEEcccceEEEEcCCCCCceEEEEEEEEe--------cCeEEEEEEeCCeEEEEEEcCCccccccccCcCccccccc Confidence 21 11221 00 1111111111110 0100011112221 11110000000 000 011 Q ss_pred eehhhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHH Q lcl|NC_016762. 197 IHPDRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFN 274 (456) Q Consensus 197 IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 274 (456) -|.--.+.+.. ....|.|.++.+.+-+.+++.+.-..+..+-..+...+.++ +. .+....+... T Consensus 224 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~--------g~---~~~~~~~~~~--- 289 (474) T protein:vir:97 224 NGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILK--------GY---EGEDLEEFMR--- 289 (474) T ss_pred ccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--------cC---Ccccchhhhh--- Confidence 23322222322 23468899998887777777765555433211111122111 00 0111111111 Q ss_pred HHHHHHhcCCCeEEecCCCce--eEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-H----HHHHHH Q lcl|NC_016762. 275 EAARQLNRGNDVLLPTQGATV--TQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-Q----KYHNAR 347 (456) Q Consensus 275 ~~~~~~~~~~~~~lid~~d~~--~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~----~nyyd~ 347 (456) .+ ....++.++.+.+. -..+.+.+++...++.+...|...+++|-.-. .+.+| |.++. + ..-... T Consensus 290 ----~~-~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~--~~~~~-n~Sg~Al~~~~~~l~~k 361 (474) T protein:vir:97 290 ----GL-KYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQT--DKFGS-APSGIALKFLYGNLDLK 361 (474) T ss_pred ----hh-hccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCc--ccccc-ccHHHHHHHHHHHHHHH Confidence 11 12234445555554 44567778999999999999999999996321 11122 23333 2 223344 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHH--HHHHHcCCcCcCHH-HHHH Q lcl|NC_016762. 348 CQARRVQELTFEINDLFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEIN--SAAIGTGEPVFTAE-EIRE 423 (456) Q Consensus 348 I~~~Qe~~lrp~L~~l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~--~~~~~~g~~~i~~~-E~R~ 423 (456) +. .++..+++.|++++.+++..... ....++++.|+|-...+++|.|++..+ |-.. .+++..--.+-+++ |+.. T Consensus 362 ~~-~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~~~-~g~iS~et~l~~l~~v~D~~~E~er 439 (474) T protein:vir:97 362 AN-KLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQIIAQ-SQYLSRETLVKSSPLVDDYKAELER 439 (474) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHHHH-cCCCCHHHHHHhCCCCCCHHHHHHH Confidence 44 44457899999999987764333 334578999999998899998887544 2111 11121110122222 3322 Q ss_pred Hhcc-cCCCCCC-CCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 424 EAGY-DPLQGGD-PLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 424 ~~~~-~~~~~~~-~~~~~~~~d~~~~~~d~~~~~e 456 (456) .... ....... ........++...+.++..+.| T Consensus 440 i~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 440 IEQEQMEYNKQLPNLDDGGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHHHhhccccCCCCCCCcccCCCCcccccC Confidence 1100 0000000 0000000111111222222222 No 134 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=99.00 E-value=9.6e-10 Score=70.03 Aligned_cols=409 Identities=9% Similarity=0.021 Sum_probs=181.5 Q ss_pred CCchhHHHHhHHHHHHHHH---------HHHHHhhhhhccCcccchhhhhccCcccCC-HHHHHHHHhc----------- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIAR---------ARMSLLNQGIGHDAKRPQAWCEYGFPQEIT-FNDLYTMYRR----------- 59 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~---------~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~-~~~l~~~Y~~----------- 59 (456) |.. |.-+.+-..++.. +-+-+..+....-+. ..+ ++.|...|.. T Consensus 1 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~------------~~~~~~~l~~Yy~g~~~i~~~~~~~ 65 (470) T protein:vir:99 1 MKD---INYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETV------------LKPRYRENMKLYLGKHKILTAPEKE 65 (470) T ss_pred Ccc---ccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHh------------hHHHHHHHHHHhccccccccCcccc Confidence 210 1000000000000 000000010000000 000 1222333332 Q ss_pred --------CchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEE Q lcl|NC_016762. 60 --------GGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLH 131 (456) Q Consensus 60 --------~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~ 131 (456) +.++++||+..+.=.+.+.+++...++... ...+.+.+++-++...+.++.+....+|.+++++. T Consensus 66 ~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~~d~~~-------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~ 138 (470) T protein:vir:99 66 TGADNRIVVNSAKYVVDVYNGYFCGIEPKLALLNDSSK-------IDEIARWNRQENFFDTINEISKQCDIFGRSIASIY 138 (470) T ss_pred cCCcceeecchHHHHHHHHhhhhccCCeeEeeCCchhH-------HHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEE Confidence 358999999999999999988865432211 12366777788889999999999999999999888 Q ss_pred ec-CCCCcc---ccccC------C-cCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCC-ccccceeeeh Q lcl|NC_016762. 132 IR-DSQPWD---RPARG------K-LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAG-RPGLVRDIHP 199 (456) Q Consensus 132 i~-D~~~~~---~Pl~~------~-~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g-~~~~~~~IH~ 199 (456) ++ ||+... .|... . ..-+....=+|...- ... .-..-.-|..=..|.+.....+. ........|+ T Consensus 139 ~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~--~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (470) T protein:vir:99 139 QGEDARPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQIDNS--NNW-TDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINP 215 (470) T ss_pred eCCCCeEEEEEEccceeEEEEcCCCCcceEEEEEEEEEec--CCe-eEEEEEEEecCeEEEEEecccccccccccccccC Confidence 74 343321 22210 0 000111111111000 000 00000011111222222111000 0011233455 Q ss_pred hhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHH Q lcl|NC_016762. 200 DRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAA 277 (456) Q Consensus 200 SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 277 (456) --.+.+.. ...+|.|.++.+..-+.+++.+.-..+..+-..+..++.+... .+.+ ++..+. + T Consensus 216 ~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~------~~~~------~~~g~~----~ 279 (470) T protein:vir:99 216 YGLVPAVEFFENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGF------KLPE------DDEGNP----K 279 (470) T ss_pred CCccceEeecCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC------Cccc------ccccch----h Confidence 33333332 2456889999888777777766555443332222222222110 0000 000011 1 Q ss_pred HHHhcCCCeEEe-----cCCCceeEE--ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHH Q lcl|NC_016762. 278 RQLNRGNDVLLP-----TQGATVTQM--VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNAR 347 (456) Q Consensus 278 ~~~~~~~~~~li-----d~~d~~~~~--~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~ 347 (456) ..+..+ ....+ +.+.+++.+ +.+..++...++.+.+.++..+++|-. .++...|..+|.. -...-... T Consensus 280 ~~~~~~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~~~n~Sg~Ai~~~~~~l~~k 357 (470) T protein:vir:99 280 FDFKNN-RVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNI-QDKNFAGNSSGVALQYKLFAMKNK 357 (470) T ss_pred hhhhhc-ceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccc-cccccccCchHHHHHHHHHHHHHH Confidence 111111 11111 223344444 456678899999999999999999953 2333222222221 12233344 Q ss_pred HHHHHHhhhhHHHHHHHHHHHHhc----CcC-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHH-HcCCcCcCHH Q lcl|NC_016762. 348 CQARRVQELTFEINDLFAHLMRIG----VVP-LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAI-GTGEPVFTAE 419 (456) Q Consensus 348 I~~~Qe~~lrp~L~~l~~~l~~s~----~~~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~-~~g~~~i~~~ 419 (456) +. .++..++..|++++.+++... .++ ...++++.|+|-...+++|.|++..+.+-+.. +++ ..+ -++++ T Consensus 358 ~~-~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~giis~et~l~~l~--~vd~~ 434 (470) T protein:vir:99 358 AD-SKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAEGIVSKKTQLGMIP--DIEPD 434 (470) T ss_pred HH-HHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCC--CCCHH Confidence 44 444578999999998875431 111 23478999999999999999998877654321 111 122 33544 Q ss_pred HHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 420 EIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 420 E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +-.+.+..+...... .....-...+..+.+|.+++| T Consensus 435 ~E~eri~~E~~~~~~-~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 435 AEMKQIAKEKADAIK-QTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred HHHHHHHHHHHHHHH-HHHhhcCCCCcCCCCCCccCC Confidence 221111110000000 000000011222333444444 No 135 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=99.00 E-value=1.6e-09 Score=68.84 Aligned_cols=407 Identities=10% Similarity=-0.001 Sum_probs=178.9 Q ss_pred CCchhHHH--------HhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLDLA--------VNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~~~--------~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ae 72 (456) ++.+.++. -.|..+.........|...--.+-....+.+............ -..+.+++.||+..+. T Consensus 38 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~-----ri~~n~~k~Ivd~~~~ 112 (492) T protein:vir:97 38 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDD-----RMITNFHANLVDQKVS 112 (492) T ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccccccccccccccccccc-----ccccchHHHHHHHHhh Confidence 12111111 1121111111111112111000000011111111000000000 0136889999999999 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc----- Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR----- 143 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~----- 143 (456) =++.+.+++...++. .. ..|+..++ -++...+.++.+....||.|++++..+ ||+.-- .|.. T Consensus 113 yl~g~p~~~~~~d~~-~~-------~~l~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~i~ 183 (492) T protein:vir:97 113 YIVGKPIAFKHTDDE-VV-------KRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIW 183 (492) T ss_pred hhcccCceeccCchH-HH-------HHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEE Confidence 999999888544322 11 22444443 366778888888888999998888774 333211 1211 Q ss_pred -CC-cCceeEEEEeccccCChhhhhccccccccCCc---eeEEEeecccC---Cccccceeeehh----hhheecC--Cc Q lcl|NC_016762. 144 -GK-LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQP---TMWEYTEASQA---GRPGLVRDIHPD----RVFILGD--WT 209 (456) Q Consensus 144 -~~-~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P---~~y~i~~~~~~---g~~~~~~~IH~S----Rli~~~~--~~ 209 (456) .. .+.+....=+|.. .+-..-.++.| .+|.+...... ........+|.. -.+.+.. .+ T Consensus 184 d~~~~~~~~~~vr~~~~--------~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn 255 (492) T protein:vir:97 184 TDKEHEELEAFIRMYKL--------ENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN 255 (492) T ss_pred cCCCCCceEEEEEEEee--------ccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCC Confidence 00 1111111111110 01111112222 22322211000 000001112211 0111222 23 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEe Q lcl|NC_016762. 210 GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLP 289 (456) Q Consensus 210 ~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~li 289 (456) .+|.|.++.+.+-+.+++.+.-..+..+...+...+.++ |...+... .+... .+....+.+ T Consensus 256 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~--------------g~~~~~~~-~~~~~----~~~~~~~~~ 316 (492) T protein:vir:97 256 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLK--------------NYDDQELP-EFKRL----LRYYGAIKV 316 (492) T ss_pred CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeee--------------cCCcccch-hHHHH----Hhhccceec Confidence 468899988877777777665544443322222222211 10111111 11111 122233445 Q ss_pred cCCCcee--EEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-H---HHHHHHHHHhhhhHHHHH Q lcl|NC_016762. 290 TQGATVT--QMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-H---NARCQARRVQELTFEIND 362 (456) Q Consensus 290 d~~d~~~--~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-y---yd~I~~~Qe~~lrp~L~~ 362 (456) +++.+.+ ..+.+.+++...++...+.+...+++|-.-+ ..-+| |.++. ++. | ...+ ...+..++..|++ T Consensus 317 ~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~--~~~~~-n~Sg~Al~~~~~~l~~ka-~~~~~~f~~~l~~ 392 (492) T protein:vir:97 317 SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS--DKFGS-APSGVALEFLYTNLNLKA-DKLARKAKVAIQE 392 (492) T ss_pred CCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCc--ccccc-CcHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 5555544 4566778999999999999999999996422 11112 23333 322 2 2233 4555678999999 Q ss_pred HHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HH-HcCCcCcCH-HHHHHHhcc-----cCCCC Q lcl|NC_016762. 363 LFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AI-GTGEPVFTA-EEIREEAGY-----DPLQG 432 (456) Q Consensus 363 l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~-~~g~~~i~~-~E~R~~~~~-----~~~~~ 432 (456) ++++++..... ....++++.|+|-...+++|.|++..+.+-+... ++ ..+ .+-++ .|+...... ..... T Consensus 393 ~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~G~iS~et~l~~l~-~v~d~~~Eleri~~E~~~~~~~~~~ 471 (492) T protein:vir:97 393 LLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHP-FVEDLQAELERIEQEQTEYNKQLPN 471 (492) T ss_pred HHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCC-CCCCHHHHHHHHHHHHHHHHHhhhc Confidence 99987765332 2346899999999999999999988877543321 11 111 12223 244221110 00110 Q ss_pred CCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 433 GDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 433 ~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ..+....+.+..+..+..++| T Consensus 472 ---~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 472 ---LDDGGADSAQQQERSNNKESE 492 (492) T ss_pred ---cccCCCCCCcccccccccccC Confidence 001111122222223333333 No 136 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=99.00 E-value=6.7e-09 Score=65.39 Aligned_cols=420 Identities=10% Similarity=-0.055 Sum_probs=185.3 Q ss_pred CCch-hHHHHh-HHHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDK-LDLAVN-HAMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~-~~~~~~-~a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) +..+ +...++ |-... .+.+..+-|.+- -.+=....+..... .+. .--.+.+++.||+..+.=++. T Consensus 40 ~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~-~~il~~~~~~~~~~-~~~---------~ki~~n~~k~Iv~~~~~yl~g 108 (511) T protein:vir:78 40 QNVNEVSKYIEHHMDYQRPRLKVLSDYYEGK-TKNLVELTRRKEEY-MAD---------NRVAHDYASYISDFINGYFLG 108 (511) T ss_pred cCHHHHHHHHHHHHHhhhHHHHHHHHHhhcc-CccccccCcccccc-cCc---------ceeecchHHHHHHHHhhhhcc Confidence 1211 111111 11111 111222222110 00000000000000 010 012468899999999999999 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---ccc------C-C Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PAR------G-K 145 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl~------~-~ 145 (456) +.+++...++.. ...|.+.+++.++.....++.+....||.|++++..+ ||+.-.. |.. . . T Consensus 109 ~p~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~ 180 (511) T protein:vir:78 109 NPIQYQDDDKDV--------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTV 180 (511) T ss_pred cCceeecCchHH--------HHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCC Confidence 988886543321 1347777777788888999999999999999988774 4443221 221 0 0 Q ss_pred cCceeEEEEeccccCChhhhhc-cccccccCCcee-EEEeecccCCccc----cceeeehhhhheecCC--cCCCcchHH Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKSFDE-KPDSETYGQPTM-WEYTEASQAGRPG----LVRDIHPDRVFILGDW--TGDAIGFLE 217 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~~~~-Dp~s~~yg~P~~-y~i~~~~~~g~~~----~~~~IH~SRli~~~~~--~~~G~S~le 217 (456) .+.+....-+|.....-. ... .+.--+++.|.. |++.....++... ....-|+--.+.+..+ ..+|.|.++ T Consensus 181 ~~~~~~~vr~~~~~~~~~-~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e 259 (511) T protein:vir:78 181 ERNSIAGVRYLRTKPIDK-TDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYE 259 (511) T ss_pred CCceEEEEEEEEeeeccc-cccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchh Confidence 011111111121100000 000 011111222221 2221111010000 0112233333333332 346889998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCC Q lcl|NC_016762. 218 PAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGA 293 (456) Q Consensus 218 ~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d 293 (456) .+..-+.+++.+.-..+..+...+...+.+......+ ..+........+-....+ .+..-.+.+. T Consensus 260 ~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (511) T protein:vir:78 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD-----------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSV 328 (511) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCC-----------chhhcccccccceeccccceeccccccCCCCc Confidence 8877777777765555544433332223222111111 111110000000000000 0000111222 Q ss_pred c--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_016762. 294 T--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFAH 366 (456) Q Consensus 294 ~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~~ 366 (456) + |-..+.+.+++...++.+.+.|...+++|-.- ++...|.. ++. ++ .-...+. .++..++..|++++.+ T Consensus 329 ~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~-~~~~~~n~--Sg~Al~~~~~~l~~ka~-~~~~~f~~~l~~~~~l 404 (511) T protein:vir:78 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMK-DDNFSGTQ--SGEAMKYKLFGLEQRTK-TKEGLFTKGLRRRAKL 404 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc-cccccccc--HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 3 44456677899999999999999999999752 22222322 332 22 2233444 4456789999998888 Q ss_pred HHHhc--Cc--C---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHHcCCcCc-CH-HHHHHHhcc-----c-C Q lcl|NC_016762. 367 LMRIG--VV--P---LKAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AIGTGEPVF-TA-EEIREEAGY-----D-P 429 (456) Q Consensus 367 l~~s~--~~--~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~~~g~~~i-~~-~E~R~~~~~-----~-~ 429 (456) ++... .+ . ...++++.|+|-...+.+|.|++..+.+-+... ++.. .+-+ ++ .|+...... . . T Consensus 405 i~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~-l~~v~d~~~El~ri~~E~~~~~~~~ 483 (511) T protein:vir:78 405 LETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSL-FSFFQDPELEVKKIEEDEKESIKKA 483 (511) T ss_pred HHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHh-CCCCCCHHHHHHHHHHHHHHHHHHH Confidence 75431 11 1 123689999999999999999987777543321 2221 1122 22 344321110 0 0 Q ss_pred CCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 430 LQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 430 ~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) .......++..+.++..++..+.+++| T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~e~ 510 (511) T protein:vir:78 484 QKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred hhccccCCCCCCCCCCCCCccCccccc Confidence 000011112222233333334444444 No 137 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=99.00 E-value=6.7e-09 Score=65.39 Aligned_cols=420 Identities=10% Similarity=-0.055 Sum_probs=185.3 Q ss_pred CCch-hHHHHh-HHHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDK-LDLAVN-HAMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~-~~~~~~-~a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) +..+ +...++ |-... .+.+..+-|.+- -.+=....+..... .+. .--.+.+++.||+..+.=++. T Consensus 40 ~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~-~~il~~~~~~~~~~-~~~---------~ki~~n~~k~Iv~~~~~yl~g 108 (511) T protein:vir:96 40 QNVNEVSKYIEHHMDYQRPRLKVLSDYYEGK-TKNLVELTRRKEEY-MAD---------NRVAHDYASYISDFINGYFLG 108 (511) T ss_pred cCHHHHHHHHHHHHHhhhHHHHHHHHHhhcc-CccccccCcccccc-cCc---------ceeecchHHHHHHHHhhhhcc Confidence 1211 111111 11111 111222222110 00000000000000 010 012468899999999999999 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---ccc------C-C Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PAR------G-K 145 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl~------~-~ 145 (456) +.+++...++.. ...|.+.+++.++.....++.+....||.|++++..+ ||+.-.. |.. . . T Consensus 109 ~p~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~ 180 (511) T protein:vir:96 109 NPIQYQDDDKDV--------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTV 180 (511) T ss_pred cCceeecCchHH--------HHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCC Confidence 988886543321 1347777777788888999999999999999988774 4443221 221 0 0 Q ss_pred cCceeEEEEeccccCChhhhhc-cccccccCCcee-EEEeecccCCccc----cceeeehhhhheecCC--cCCCcchHH Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKSFDE-KPDSETYGQPTM-WEYTEASQAGRPG----LVRDIHPDRVFILGDW--TGDAIGFLE 217 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~~~~-Dp~s~~yg~P~~-y~i~~~~~~g~~~----~~~~IH~SRli~~~~~--~~~G~S~le 217 (456) .+.+....-+|.....-. ... .+.--+++.|.. |++.....++... ....-|+--.+.+..+ ..+|.|.++ T Consensus 181 ~~~~~~~vr~~~~~~~~~-~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e 259 (511) T protein:vir:96 181 ERNSIAGVRYLRTKPIDK-TDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYE 259 (511) T ss_pred CCceEEEEEEEEeeeccc-cccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchh Confidence 011111111121100000 000 011111222221 2221111010000 0112233333333332 346889998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC----CCeEEecCCC Q lcl|NC_016762. 218 PAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG----NDVLLPTQGA 293 (456) Q Consensus 218 ~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~lid~~d 293 (456) .+..-+.+++.+.-..+..+...+...+.+......+ ..+........+-....+ .+..-.+.+. T Consensus 260 ~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (511) T protein:vir:96 260 KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD-----------PVEVRKQKEANVLFLEPTVYVDAEGRETEGSV 328 (511) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCC-----------chhhcccccccceeccccceeccccccCCCCc Confidence 8877777777765555544433332223222111111 111110000000000000 0000111222 Q ss_pred c--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_016762. 294 T--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFAH 366 (456) Q Consensus 294 ~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~~ 366 (456) + |-..+.+.+++...++.+.+.|...+++|-.- ++...|.. ++. ++ .-...+. .++..++..|++++.+ T Consensus 329 ~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~-~~~~~~n~--Sg~Al~~~~~~l~~ka~-~~~~~f~~~l~~~~~l 404 (511) T protein:vir:96 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMK-DDNFSGTQ--SGEAMKYKLFGLEQRTK-TKEGLFTKGLRRRAKL 404 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccc-cccccccc--HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 3 44456677899999999999999999999752 22222322 332 22 2233444 4456789999998888 Q ss_pred HHHhc--Cc--C---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHHcCCcCc-CH-HHHHHHhcc-----c-C Q lcl|NC_016762. 367 LMRIG--VV--P---LKAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AIGTGEPVF-TA-EEIREEAGY-----D-P 429 (456) Q Consensus 367 l~~s~--~~--~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~~~g~~~i-~~-~E~R~~~~~-----~-~ 429 (456) ++... .+ . ...++++.|+|-...+.+|.|++..+.+-+... ++.. .+-+ ++ .|+...... . . T Consensus 405 i~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~-l~~v~d~~~El~ri~~E~~~~~~~~ 483 (511) T protein:vir:96 405 LETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSL-FSFFQDPELEVKKIEEDEKESIKKA 483 (511) T ss_pred HHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHh-CCCCCCHHHHHHHHHHHHHHHHHHH Confidence 75431 11 1 123689999999999999999987777543321 2221 1122 22 344321110 0 0 Q ss_pred CCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 430 LQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 430 ~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) .......++..+.++..++..+.+++| T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~e~ 510 (511) T protein:vir:96 484 QKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred hhccccCCCCCCCCCCCCCccCccccc Confidence 000011112222233333334444444 No 138 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.99 E-value=4.2e-09 Score=66.49 Aligned_cols=423 Identities=8% Similarity=-0.074 Sum_probs=188.2 Q ss_pred CC----------chhHHHHh-HHHH--HHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhh Q lcl|NC_016762. 1 MT----------DKLDLAVN-HAMS--SAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAV 67 (456) Q Consensus 1 ~~----------~~~~~~~~-~a~~--~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iV 67 (456) |+ +.+.-+++ |-.. ..+.+..+-|.+- -.+ -.+.+....-+.+. .--.+.+++.|| T Consensus 31 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~-~~i-~~~~~~~~~~~~~~---------~ki~~n~~k~Iv 99 (511) T protein:vir:10 31 YDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGK-TKN-LVELTRRKEEYMAD---------NRVAHDYASYIS 99 (511) T ss_pred CchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhccc-Ccc-ccccCcccccccCc---------ceeecchHHHHH Confidence 22 11222222 1111 1111222222110 000 00000000000010 012468899999 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR 143 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~ 143 (456) +..+.=++.+.+++..+++.. ...|...+++.++...+.++.+....||.|++++..+ ||+.-. .|.. T Consensus 100 ~~~~~yl~g~p~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~ 171 (511) T protein:vir:10 100 DFINGYFLGNPIQYQDDDKDV--------LEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMS 171 (511) T ss_pred HHHhhhhcccCceeecCchHH--------HHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccce Confidence 999988899998886543321 1347777777788888999999999999999888874 344322 2221 Q ss_pred ------CC-cCceeEEEEeccccCChhhhhcc-ccccccCCce-eEEEeecccCCc----cccceeeehhhhheecCC-- Q lcl|NC_016762. 144 ------GK-LNGLAKVTPAWAGCLKPKSFDEK-PDSETYGQPT-MWEYTEASQAGR----PGLVRDIHPDRVFILGDW-- 208 (456) Q Consensus 144 ------~~-~~~l~~i~~~~~~~~~~~~~~~D-p~s~~yg~P~-~y~i~~~~~~g~----~~~~~~IH~SRli~~~~~-- 208 (456) .. ...+....-+|.....- ....+ ..-.+++.+. .|++.....++. ......-|+=..+.+..+ T Consensus 172 ~~~vydd~~~~~~~~~vr~~~~~~~d-~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~n 250 (511) T protein:vir:10 172 TFVIYDNTIERNSIAGVRYLRTKPID-KTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSN 250 (511) T ss_pred eEEEEcCCCCCceEEEEEEEEeeecc-cCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecC Confidence 00 01111111111110000 00000 0001111121 122211110000 000112243333333332 Q ss_pred cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc----CC Q lcl|NC_016762. 209 TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR----GN 284 (456) Q Consensus 209 ~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~----~~ 284 (456) ..+|.|.++.+..-+.+++.+.-..+..+...+...+.+..... .+..+..+.....+..+.. .. T Consensus 251 n~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:10 251 NERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN-----------LDPVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecccc-----------CCchhhccchhccceeccccccccc Confidence 34688999988887777777765555544333333332221111 1111111100000000111 11 Q ss_pred CeEEecCCCcee--EEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHH Q lcl|NC_016762. 285 DVLLPTQGATVT--QMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNARCQARRVQELTFE 359 (456) Q Consensus 285 ~~~lid~~d~~~--~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~ 359 (456) +..-.+.+.+++ ..+.+.+++...++.+.+.|...+++|-.-. +...|..+|.. -...-...+. .++..++.. T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~-~~~~~n~Sg~Al~~~~~~l~~k~~-~k~~~f~~~ 397 (511) T protein:vir:10 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKD-DNFSGTQSGEAMKYKLFGLEQRTK-TKEGLFTKG 397 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-ccccccchHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 111122233443 4566778999999999999999999998533 22223332222 1122334444 445678999 Q ss_pred HHHHHHHHHHh-c--Cc-C---CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCH-HHHHHHhcc-- Q lcl|NC_016762. 360 INDLFAHLMRI-G--VV-P---LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTA-EEIREEAGY-- 427 (456) Q Consensus 360 L~~l~~~l~~s-~--~~-~---~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~-~E~R~~~~~-- 427 (456) |++++.+++.. + .+ . ...+++|.|+|-...+.++.+++..+.+-+.. +++..--.+-++ .|+...... T Consensus 398 l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~ 477 (511) T protein:vir:10 398 LRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEK 477 (511) T ss_pred HHHHHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 99998887543 1 11 1 12368999999999999999998777653322 222211012223 344322111 Q ss_pred ---cC-CCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 428 ---DP-LQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 428 ---~~-~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) .. .......++..+.++..++.++.+++| T Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (511) T protein:vir:10 478 ESIKKAQKGIYKDPRDINDDEQDDDTKDTVDKK 510 (511) T ss_pred HHHHHHhhhcccCCCCCCCCCCCCcccCccccc Confidence 00 000111122222233334444444444 No 139 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.97 E-value=3.5e-09 Score=66.95 Aligned_cols=437 Identities=11% Similarity=-0.003 Sum_probs=193.1 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhh--------hccCcccCCHHHHHHHHhcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWC--------EYGFPQEITFNDLYTMYRRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~--------~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ae 72 (456) |+.=-.++.=..++.. +...++....++.+. +-..|. ........-..-...+|+.|++++++|+.... T Consensus 3 ~p~~~~~~~~~~~~~~--~~~~~y~~~a~~~~~-~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~ 79 (533) T protein:vir:34 3 TPTIPTLLGPDGMTSL--REYAGYHGGGSGFGG-QLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQD 79 (533) T ss_pred CchhhhhhcccccchH--HHHHhhhhccCCCCC-cccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH Confidence 4411111110111111 111123332222211 111110 00000111123456789999999999999999 Q ss_pred HHhhCCCEEecCCCc----chhhhhHHHHHHHHHHHHHh--------------hHHHHHHHHHHhhcccCceEEEEEecC Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQ----DRSKDETEWERKNKPLIAGG--------------RFWRAVSEADRRRLVGRYSGLLLHIRD 134 (456) Q Consensus 73 d~tR~~~~i~~~~~~----d~~~~~~~~e~~i~~~~~~l--------------~~~~~~~ea~~~~r~~Ggs~i~i~i~D 134 (456) ..+=.|+++...-+. -..+...+|.++|++++++. .+.+....+++--...|-+++.+...- T Consensus 80 nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~ 159 (533) T protein:vir:34 80 HIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDT 159 (533) T ss_pred HhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeecc Confidence 999899987532110 01122234555666665431 233333334444445566666554421 Q ss_pred CCCccccccCCcCceeEEEEeccccCCh------hhhh-ccccccccCCceeEEEeecccCCccc-------cceeeehh Q lcl|NC_016762. 135 SQPWDRPARGKLNGLAKVTPAWAGCLKP------KSFD-EKPDSETYGQPTMWEYTEASQAGRPG-------LVRDIHPD 200 (456) Q Consensus 135 ~~~~~~Pl~~~~~~l~~i~~~~~~~~~~------~~~~-~Dp~s~~yg~P~~y~i~~~~~~g~~~-------~~~~IH~S 200 (456) .....-|++ |..+....|.. +.+. .-..--.+|+|..|+|.....+|... ....|+.+ T Consensus 160 ~~g~~~~~~--------lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~ 231 (533) T protein:vir:34 160 SSSRLFRTQ--------FRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRA 231 (533) T ss_pred CCCCccceE--------EEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChh Confidence 111111221 22222222211 0000 11222358999999997443333211 12447788 Q ss_pred hhheecCC----cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhh-cCCHHHHHHHHHH Q lcl|NC_016762. 201 RVFILGDW----TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTY-GVTLDALNERFNE 275 (456) Q Consensus 201 Rli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~-~~~~~~~~~~~~~ 275 (456) +|+|+-.. ...|+|.+-+++..+.+++.-....-..-.-+++-...++. ...-....+.+ +.........+.. T Consensus 232 ~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (533) T protein:vir:34 232 SFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIES--ELDTQSAMDFILGANSQEQRERLTG 309 (533) T ss_pred HeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeec--CCCcccccccccCCCcccccccccc Confidence 88876432 35699999999998887766544321111111111111110 00000000111 1100111111100 Q ss_pred ----HHH-------HHhcCCCeEE-ecCCCceeEEecc--cCCHHHHHHHHHHHHHhhhcCCeEEeecc-CCCcccch-H Q lcl|NC_016762. 276 ----AAR-------QLNRGNDVLL-PTQGATVTQMVSA--VSDPGPTYNVNLQTAAAGVDIPTKILVGM-QTGERASS-E 339 (456) Q Consensus 276 ----~~~-------~~~~~~~~~l-id~~d~~~~~~~~--~sgl~~~~~~~~~~~aaas~IP~t~L~G~-sp~Glnst-~ 339 (456) ... .+.. +++. +..+++++.++.+ -++..+....+...||+..|||.-.|.|- |-...+|. . T Consensus 310 ~~~~~~~~~~~~~~~l~p--G~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~ 387 (533) T protein:vir:34 310 WIGEIAAYYAAAPVRLGG--AKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARA 387 (533) T ss_pred cchhhhhccCcceeeccC--ceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHH Confidence 000 0111 2222 3457788777654 57899999999999999999999989884 22333333 3 Q ss_pred HHHHHHHHHHHHHHhhhhHH----HHHHHHHHHHhcCcCCCCceEEEe--------CCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 340 DQKYHNARCQARRVQELTFE----INDLFAHLMRIGVVPLKAEFTAIW--------DDLTVPTKAERLANSKTMSEINSA 407 (456) Q Consensus 340 D~~nyyd~I~~~Qe~~lrp~----L~~l~~~l~~s~~~~~~~d~~~~f--------~pL~~~seke~Aei~~~~A~a~~~ 407 (456) .+..+...++.+|.....+. -+.+++..+.++.++.|....+.| +--|.+....-.| -.|.+++... T Consensus 388 ~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iD-P~Ke~~a~~~ 466 (533) T protein:vir:34 388 SANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAID-GLKEVQEAVM 466 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccC-hHHHHHHHHH Confidence 67778888888887665544 444455455666666543211110 1122222222222 1245666666 Q ss_pred HHHcCCcCcCH-----------HHHHHH-------hcccCCCCCCCCcccCCCCC-CCCCcCCCCCCC Q lcl|NC_016762. 408 AIGTGEPVFTA-----------EEIREE-------AGYDPLQGGDPLPDTEPEDE-DAARTDPTGEQQ 456 (456) Q Consensus 408 ~~~~g~~~i~~-----------~E~R~~-------~~~~~~~~~~~~~~~~~~d~-~~~~~d~~~~~e 456 (456) .+.+| ..|. +|+.+. ...-++... .++-...... .+++++|..+.. T Consensus 467 ~i~~G--~~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~-~~~~~~~~s~~~~~~~~~~~~~~ 531 (533) T protein:vir:34 467 LIEAG--LSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPP-AWAAAAFESGLRQSTEEEKSDSR 531 (533) T ss_pred HHHcC--CCCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCC-CCCCcCccCCCCCCCCCCcccCC Confidence 67666 3333 333211 111112110 0000001111 111122222222 No 140 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.97 E-value=3.4e-09 Score=67.00 Aligned_cols=407 Identities=10% Similarity=-0.030 Sum_probs=177.7 Q ss_pred CCch-hHHHHh-H-HHHHH-HHHHHHHHhhhhhccCcccchh-hhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHh Q lcl|NC_016762. 1 MTDK-LDLAVN-H-AMSSA-IARARMSLLNQGIGHDAKRPQA-WCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCW 75 (456) Q Consensus 1 ~~~~-~~~~~~-~-a~~~~-~~~~~d~~~n~~~~~gt~~~~~-~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~t 75 (456) |++. ++-.++ | ..+.. +.+..+-|.+--.-+ ..+.+. ....+.+ ...-.+++++.||+..+.=++ T Consensus 30 ~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i-~~~~~~~~~~~~~~---------~~ki~~n~~~~ivd~~~~~l~ 99 (481) T protein:vir:10 30 LKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDI-LAGERRLQKYGDKA---------DHRAVHNYAKYVSRFIVGYLT 99 (481) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-ccCccccccccccc---------cceeecchHHHHHHHHHhhhc Confidence 2222 111122 1 11101 111111111100000 000000 0000111 011257899999999999899 Q ss_pred hCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---ccccC------- Q lcl|NC_016762. 76 KTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPARG------- 144 (456) Q Consensus 76 R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~~------- 144 (456) .+.++++..++.. ...+.+.+++.++...+.++.+...++|.|++++.++ ||+..- .|... T Consensus 100 g~~~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~~~~~v~d~~ 171 (481) T protein:vir:10 100 GNPITITHQDNQT--------NDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPKSTFVVYDQT 171 (481) T ss_pred cCCceEecCChhH--------HHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEcccceEEEEcCC Confidence 8988886543322 1247777888889999999999999999999988774 333211 12110 Q ss_pred CcCceeEEEEeccccCChhhhhcccccc------ccCCceeEEEeecccCCcc-ccceeeeh-hhhheecC--CcCCCcc Q lcl|NC_016762. 145 KLNGLAKVTPAWAGCLKPKSFDEKPDSE------TYGQPTMWEYTEASQAGRP-GLVRDIHP-DRVFILGD--WTGDAIG 214 (456) Q Consensus 145 ~~~~l~~i~~~~~~~~~~~~~~~Dp~s~------~yg~P~~y~i~~~~~~g~~-~~~~~IH~-SRli~~~~--~~~~G~S 214 (456) ..+.+....-+|.. .+.... -|..-..|++... ++.. .-...-|. .+| .+.. ...+|.| T Consensus 172 ~~~~~~~~i~~~~~--------~~~~~~~~~~~~~y~~~~i~~~~~~--~~~~~~~~~~~~~~g~v-Pvv~~~n~~~g~~ 240 (481) T protein:vir:10 172 LDKKVVAGVRYFEK--------QDKDKVPVQHVEVYTTDKIYYIEIK--GGTYHRVEEVEHYYNDV-PIIEYLNDQFKQG 240 (481) T ss_pred CCCceEEEEEEEEE--------eeCCCceEEEEEEEecCeEEEEEec--CCceeecccccccCCce-eEEEeecCCCCCC Confidence 00111111111110 011010 1111111222110 1100 00011222 222 1211 2356888 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCc Q lcl|NC_016762. 215 FLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGAT 294 (456) Q Consensus 215 ~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~ 294 (456) .++.+..-+.+++.+....+..+-..+...+.+.. .. ..+.+.........+............+.+.+ T Consensus 241 ~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g--------~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (481) T protein:vir:10 241 DFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIG--------NV---DLDSEDAKAFRDANMIHLEPGTNANGSEGKAE 309 (481) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeec--------Cc---CCCccchhhhhhccceeccccccccCCCCCcc Confidence 88887776667777655544433322222222211 10 01111111100000000001111111112223 Q ss_pred ee--EEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-H----HHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_016762. 295 VT--QMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-Q----KYHNARCQARRVQELTFEINDLFAHL 367 (456) Q Consensus 295 ~~--~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~----~nyyd~I~~~Qe~~lrp~L~~l~~~l 367 (456) ++ ..+.+.+++...++...+.+...+++|-. -+|...| |.+++ + ..-...++.+ +..++..|++++.++ T Consensus 310 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~~~--n~Sg~Al~~~~~~l~~k~~~~-~~~~~~~l~~~~~li 385 (481) T protein:vir:10 310 VKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDL-NDEQFSG--VQSGESMKYKLFGLEQVRAIK-ERLFKKGLMKRYKLL 385 (481) T ss_pred eeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc-ccccccc--ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Confidence 33 45556688999999999999999999963 4443222 22332 2 2334445544 457899999999887 Q ss_pred HHh-cC--cC--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHH-HcCCcCc-CH-HHHHHHhccc-CCCCCCCC Q lcl|NC_016762. 368 MRI-GV--VP--LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAI-GTGEPVF-TA-EEIREEAGYD-PLQGGDPL 436 (456) Q Consensus 368 ~~s-~~--~~--~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~-~~g~~~i-~~-~E~R~~~~~~-~~~~~~~~ 436 (456) ++. +. +. ...++++.|+|-...++++.|++..+.+-... +++ ..+ -+ ++ +|+....... ........ T Consensus 386 ~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~g~is~et~~~~l~--~i~d~~~E~~ri~~E~~~~~~~~~~ 463 (481) T protein:vir:10 386 LNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNALSGGVSESTRLSLLD--FIDNPKEELEKMQEEEAQREKQADK 463 (481) T ss_pred HHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCC--CCCCHHHHHHHHHHHHHHHHhhhhh Confidence 654 22 11 22478999999999999999998776543221 111 111 22 22 2332211100 00000000 Q ss_pred cccCCCCCCCCCcC-CCC Q lcl|NC_016762. 437 PDTEPEDEDAARTD-PTG 453 (456) Q Consensus 437 ~~~~~~d~~~~~~d-~~~ 453 (456) ....++.++..++| ..| T Consensus 464 ~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 464 RGYGEAFENHLNVDDSNG 481 (481) T ss_pred ccCCccCCCCCCCCCCCC Confidence 01111111111111 122 No 141 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.96 E-value=4.9e-09 Score=66.14 Aligned_cols=414 Identities=8% Similarity=-0.019 Sum_probs=175.4 Q ss_pred CCchhHHHHhH--------HHHHHHHHHHHHHhhhhhccCcccch--hhhhccCcc------------cC-CHHHHHHHH Q lcl|NC_016762. 1 MTDKLDLAVNH--------AMSSAIARARMSLLNQGIGHDAKRPQ--AWCEYGFPQ------------EI-TFNDLYTMY 57 (456) Q Consensus 1 ~~~~~~~~~~~--------a~~~~~~~~~d~~~n~~~~~gt~~~~--~~~~~~~~~------------~~-~~~~l~~~Y 57 (456) |+++++.=-.. ++.....-+..-+..+...+.....+ .+..|+-.. .. ........- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~k 80 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWR 80 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccce Confidence 44443321110 00000000011111111111111000 011110000 00 000000001 Q ss_pred hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCC Q lcl|NC_016762. 58 RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQ 136 (456) Q Consensus 58 ~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~ 136 (456) -.+.+++.||+..+.=++.+.+++...++. . .+.++..++ -++...+.++.+....+|.+++++.++ +|+ T Consensus 81 i~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~-~-------~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~ 151 (474) T protein:vir:95 81 ITTNFHQNLVDQKVSYVASKPVTYSCEDES-V-------LKIIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGE 151 (474) T ss_pred eccchHHHHHHHHHhhhccCCceeccCchH-H-------HHHHHHHHh-ccHHHHHHHHHHHHhhcCcEEEEEEecCCCc Confidence 146889999999999999999998543321 1 122444443 367788889999999999999988874 333 Q ss_pred CccccccCCcCceeEEEEeccccCC--h----hhhhc-cccccccCCce---eEEEeecccCCc-------cccceeeeh Q lcl|NC_016762. 137 PWDRPARGKLNGLAKVTPAWAGCLK--P----KSFDE-KPDSETYGQPT---MWEYTEASQAGR-------PGLVRDIHP 199 (456) Q Consensus 137 ~~~~Pl~~~~~~l~~i~~~~~~~~~--~----~~~~~-Dp~s~~yg~P~---~y~i~~~~~~g~-------~~~~~~IH~ 199 (456) ..-..+.. ..+.|+|..... + ..+.. +...-.++.|. +|.......... ......-|. T Consensus 152 ~~i~~~~p-----~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (474) T protein:vir:95 152 MKLFRVPA-----EQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGN 226 (474) T ss_pred eEEEEEcc-----cceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccC Confidence 21111110 011122211000 0 00000 00011111121 122111000000 000111233 Q ss_pred hhhheecCC--cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHH Q lcl|NC_016762. 200 DRVFILGDW--TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAA 277 (456) Q Consensus 200 SRli~~~~~--~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 277 (456) --.+.+..+ +..|.|.++.+.+-+.+++.+.-..+..+-..+...+.+. + + .+.........+ T Consensus 227 ~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~-----g---~---~~~~~~~~~~~~---- 291 (474) T protein:vir:95 227 WGRVPFIAFKNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILK-----G---Y---EGQDLEEFMRGL---- 291 (474) T ss_pred CCccceEeecCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee-----c---C---Ccccchhhhhhh---- Confidence 222333322 3458899988887777777765555443322222222111 1 0 111111111111 Q ss_pred HHHhcCCCeEEecCCCce--eEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHH Q lcl|NC_016762. 278 RQLNRGNDVLLPTQGATV--TQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNARCQARR 352 (456) Q Consensus 278 ~~~~~~~~~~lid~~d~~--~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Q 352 (456) ....++.++++.+. -..+.+.+++...++.+.+.|...+++|-. .++.-.|..+|.. -...-...+. .. T Consensus 292 ----~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~~~n~Sg~Alk~~~~~l~~k~~-~k 365 (474) T protein:vir:95 292 ----KYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDF-QTDKFGSAPSGIALKFLYGNLDLKAN-KL 365 (474) T ss_pred ----hccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccc-ccccccccchHHHHHHHHHHHHHHHH-HH Confidence 22334445555554 455678889999999999999999999952 2222222222221 1122333444 34 Q ss_pred HhhhhHHHHHHHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHH--HHHHHcCCcCcCHH-HHHHHhcc- Q lcl|NC_016762. 353 VQELTFEINDLFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEIN--SAAIGTGEPVFTAE-EIREEAGY- 427 (456) Q Consensus 353 e~~lrp~L~~l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~--~~~~~~g~~~i~~~-E~R~~~~~- 427 (456) +..++..|++++.++...... ....+++|.|+|-...+++|.|++..+ |-+. .+++..--.+-+++ |+...... T Consensus 366 ~~~~~~~l~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~~e~a~~~~~-~g~iS~et~i~~l~~v~d~~~E~~ri~~E~ 444 (474) T protein:vir:95 366 KNKATVAIQELIGFIIDFNNLKMDVKDIEISFNFNRMMNDAEQSQIIAQ-SQYLSRETLVKSSPLVDDYKAELERIEQEQ 444 (474) T ss_pred HHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHHHHh-cCCCchHHHHHhCCCCCCHHHHHHHHHHHH Confidence 456899999999887665333 334688999999999999999887543 2211 12222110122332 33221110 Q ss_pred -------cCCCCCCCCcccCCCCCCCCCcC Q lcl|NC_016762. 428 -------DPLQGGDPLPDTEPEDEDAARTD 450 (456) Q Consensus 428 -------~~~~~~~~~~~~~~~d~~~~~~d 450 (456) ........+...+++.++..+++ T Consensus 445 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 445 MEYNKQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred HHHHhcccccccccCCCCcCCCCCccCCCC Confidence 01111111110111110111111 No 142 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.96 E-value=9.5e-09 Score=64.57 Aligned_cols=420 Identities=10% Similarity=-0.016 Sum_probs=172.6 Q ss_pred CCch---hHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDK---LDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~---~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) +.+. .+|...|..+..+-+....|... +.+ ... .+... ++++........++++|||..++=+.=+ T Consensus 7 ~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g------~~~--i~~--~~~~~-~~~~~~~~~~~n~~~~ivd~~a~~l~~~ 75 (488) T protein:vir:23 7 IDPEKLRDQLLDAFENKQNELKSSKAYYDA------ERR--PDA--IGLAV-PLDMRKYLAHVGYPRTYVDAIAERQELE 75 (488) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHhc------ccc--hhh--cCccc-chhhhhhhhhcchHHHHHHHHHHhhhcc Confidence 3333 23333333333333333444321 111 000 11111 2233333345777899999999877777 Q ss_pred CCEEecCCCcch-hhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCcee-----E Q lcl|NC_016762. 78 NPQVIEGDDQDR-SKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLA-----K 151 (456) Q Consensus 78 ~~~i~~~~~~d~-~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~-----~ 151 (456) ||.+......+. .....+....+.+.+++-++-....++.+...+||.|++++..+.+.....+-.+.. .|. . T Consensus 76 Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~-~i~~~~p~~ 154 (488) T protein:vir:23 76 GFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVP-LIRVEPPTA 154 (488) T ss_pred ceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcc-eEEEeccce Confidence 887632221110 000111123477778888888889999999999999988876543221110100000 011 1 Q ss_pred EEEecccc---CChhh-hhcc-----ccccccCCce-eEEEeecccCCccc-cceeeehhhh---heecC----CcCCCc Q lcl|NC_016762. 152 VTPAWAGC---LKPKS-FDEK-----PDSETYGQPT-MWEYTEASQAGRPG-LVRDIHPDRV---FILGD----WTGDAI 213 (456) Q Consensus 152 i~~~~~~~---~~~~~-~~~D-----p~s~~yg~P~-~y~i~~~~~~g~~~-~~~~IH~SRl---i~~~~----~~~~G~ 213 (456) +.|+|-.. +...- +..+ ...-.++.|. .|++.. .+|... ....-|.=-. +.|.. ...+|. T Consensus 155 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~--~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~ 232 (488) T protein:vir:23 155 LYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLR--AEGEWEAPTSTPHGLEMVPVIPISNRTRLSDLYGT 232 (488) T ss_pred eEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEe--cCCceEeccccccCCCCcceEEeccccccCCcCCc Confidence 12222100 00000 0000 0000111111 111110 011100 0011132221 22321 123688 Q ss_pred chHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCe-EEecC Q lcl|NC_016762. 214 GFLEP-AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDV-LLPTQ 291 (456) Q Consensus 214 S~le~-~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~lid~ 291 (456) |.+++ +..-+.+++++.-..+..+--.+..++.+... ...+.......+ ... ++...+. .++.+ T Consensus 233 s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~---~~~~~~~~~~~~----~~~-------~~~~~~~v~~~~~ 298 (488) T protein:vir:23 233 SEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGA---KPEELGINAETG----QRM-------FDAYMARILAFEG 298 (488) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCC---Cccccccccccc----chh-------hhhhhhhhccCCC Confidence 87764 33333344444333222221112222322211 111111000000 000 1111122 22233 Q ss_pred CCceeEEec---ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHH Q lcl|NC_016762. 292 GATVTQMVS---AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDL 363 (456) Q Consensus 292 ~d~~~~~~~---~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l 363 (456) +++.+..+. ++....+.+.....++|+.+++|..-| |.+...- ++++ ++ .....++ .++..+.+.|+++ T Consensus 299 g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~n~-~Sg~Al~~~~~~l~~k~~-~~~~~f~~~l~~~ 375 (488) T protein:vir:23 299 GEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYL-SSSSDNP-ASAEAIKAAESRLVKKVE-RKNKIFGGAWEQA 375 (488) T ss_pred CCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHh-ccccCcc-hHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 444444443 334556666667889999999998655 5433321 2332 22 3334444 4445689999999 Q ss_pred HHHHHHhcCcC----CCCceEEEeCCCCCCCHHHHHHHHHHHHHHH------HHH-HHcCCcCcCH--HHHHHH------ Q lcl|NC_016762. 364 FAHLMRIGVVP----LKAEFTAIWDDLTVPTKAERLANSKTMSEIN------SAA-IGTGEPVFTA--EEIREE------ 424 (456) Q Consensus 364 ~~~l~~s~~~~----~~~d~~~~f~pL~~~seke~Aei~~~~A~a~------~~~-~~~g~~~i~~--~E~R~~------ 424 (456) +.+++....+. ...++++.|.+-..+|..+.|+...|.+++. .++ ...| -++. +|+++. T Consensus 376 ~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~--~~~d~~~~~~~~~~~~~~ 453 (488) T protein:vir:23 376 MRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMG--YTIVEREQMRQWLEQDQK 453 (488) T ss_pred HHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCC--CCchHHHHHHHHHHHHHH Confidence 99887653331 1247889999999999999998877665532 111 1122 1111 122111 Q ss_pred ---hcccCCCCCCCCcccCC---CCCCCCCcCCCC Q lcl|NC_016762. 425 ---AGYDPLQGGDPLPDTEP---EDEDAARTDPTG 453 (456) Q Consensus 425 ---~~~~~~~~~~~~~~~~~---~d~~~~~~d~~~ 453 (456) ..++.+......++... +.+.+++-++-| T Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 454 QGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HHHHHHHHHhccCCCcccCCCCCCCCCCCCCCCCC Confidence 01111111111111000 001111111111 No 143 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.96 E-value=7.5e-10 Score=70.60 Aligned_cols=374 Identities=10% Similarity=0.010 Sum_probs=168.5 Q ss_pred HHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHh-cCchhhhhhccchhHHhhCCCEEecCC Q lcl|NC_016762. 7 LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYR-RGGIAHGAVEKIVTTCWKTNPQVIEGD 85 (456) Q Consensus 7 ~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~-~~~l~r~iVd~~aed~tR~~~~i~~~~ 85 (456) |+. | ...-+.++.|.. |..+-+ ..+.. -++++...|+ ..+.+++|||..++=+.=+||+. .+ T Consensus 1 l~~-~---~~r~~~~~~yY~-----g~~~~~-----~~~~~-~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~--~d 63 (410) T protein:vir:95 1 MNL-Y---QSRVNLRYKHYA-----MQHYEA-----PTGIT-IPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFAN--DD 63 (410) T ss_pred CCc-c---hhhHHHHHHHhc-----CCCCcc-----ccchh-ccHHHHhHHHhhcchhHHHHHHhHhhhccccccC--CC Confidence 211 1 112222333322 110000 01111 1234544444 34778999999999887788762 11 Q ss_pred CcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---ccc------CCcCceeEEEEe Q lcl|NC_016762. 86 DQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PAR------GKLNGLAKVTPA 155 (456) Q Consensus 86 ~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl~------~~~~~l~~i~~~ 155 (456) . .+.+.+.+-++-....++.+-+..||.|++++.-+ |+++.-. |.. ...+.+..-..+ T Consensus 64 ---~---------~l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~~~al~~ 131 (410) T protein:vir:95 64 ---F---------NVTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGLLVEGYAV 131 (410) T ss_pred ---c---------hHHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCceEEEEEE Confidence 1 13444566677777888888899999998887543 3332111 111 000111111111 Q ss_pred ccccCChhhhhccccccccCCce-eEEEeecccCCccccceeeehhhh---heecC----CcCCCcchH-HHHHHHHHHH Q lcl|NC_016762. 156 WAGCLKPKSFDEKPDSETYGQPT-MWEYTEASQAGRPGLVRDIHPDRV---FILGD----WTGDAIGFL-EPAYNSFISL 226 (456) Q Consensus 156 ~~~~~~~~~~~~Dp~s~~yg~P~-~y~i~~~~~~g~~~~~~~IH~SRl---i~~~~----~~~~G~S~l-e~~~~~l~~~ 226 (456) |... -...+....++.|. .|++.. +|.. ...-|+--+ ++|.. ...+|.|.+ +++..-..++ T Consensus 132 ~~~~-----~~~~~~~~~~~~~~~~~~~~~---~~~~--~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~ 201 (410) T protein:vir:95 132 LARD-----DYNRPTLEAYFEPNATHFIPK---DGEP--YSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYA 201 (410) T ss_pred EEec-----CCCeEEEEEEEeCCcEEEEee---CCcc--ccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHH Confidence 1000 00012222222222 222221 1111 011132222 33321 134688855 6676655555 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCC-----ceeEE-ec Q lcl|NC_016762. 227 EKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGA-----TVTQM-VS 300 (456) Q Consensus 227 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d-----~~~~~-~~ 300 (456) .++.-...-...-.+..+..+. . + +..+.+.+ .+...+. .+..+.+++ ++.++ +. T Consensus 202 ~r~~~~~~~~~e~~a~pqr~i~-----G---~-d~d~~~~~----~~~~~~~------~i~~~~~~~~~~~~~v~q~~~~ 262 (410) T protein:vir:95 202 KRTLERADITAEFYSWPQKYIL-----G---L-DPDAEPME----KWKATVS------SLLTISSSDKGVKPSVGQFTTA 262 (410) T ss_pred HHHHHHHHHHHHHhcchhheee-----c---c-CCCCCcCc----hhhhhhh------hheeccCCCCCCcceEEecCCC Confidence 5553321111111122222221 1 1 11111111 1111110 122333322 23333 23 Q ss_pred ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-----HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc--Cc Q lcl|NC_016762. 301 AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-----QKYHNARCQARRVQELTFEINDLFAHLMRIG--VV 373 (456) Q Consensus 301 ~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-----~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~--~~ 373 (456) ++.+.-+.+.....++|+.++||..-|-|.+- - +++++ ....-..++.+|+ .+.+.+++++.+.+... .. T Consensus 263 ~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~-N-psSa~Al~a~~~~L~~ka~~k~~-~fg~~l~~~~rla~~i~~~~~ 339 (410) T protein:vir:95 263 SMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSD-N-PSSVEAIKASHENLRLAGRKAQR-SLGAGLLNVAYVAACLRDEFR 339 (410) T ss_pred ChHHHHHHHHHHHHHHhhhcCCCHHHhccccC-c-hhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCC Confidence 56667788888899999999999886655542 1 13332 3445667776665 56899999988754432 22 Q ss_pred CCCC---ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcC Q lcl|NC_016762. 374 PLKA---EFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTD 450 (456) Q Consensus 374 ~~~~---d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d 450 (456) ..+. +..+.|.|+..++-... ...|++..++.++|.++.+.+-+++.+++.+......- .++....+. T Consensus 340 ~~~~~~~~~~v~W~p~~d~~~~s~----a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~~~~~-----~~e~~~~g~ 410 (410) T protein:vir:95 340 YTRSQFVRTAVKWEPLFEADANTM----TMIGDGVVKLNQALPGYINAETIRDLTGIAGDMSAKPV-----VSEGGSNGE 410 (410) T ss_pred CcccccceeeEEeeecCCcchhhH----HHHHHHHHHHHHhccCCccHHHHHHhcCCChHHHHHHH-----HHHHHhCCC Confidence 2222 46788997766543332 23456666666666556666666776666432110000 001111111 No 144 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.95 E-value=3.4e-09 Score=66.99 Aligned_cols=407 Identities=10% Similarity=-0.015 Sum_probs=177.9 Q ss_pred CCchhHHHH--------hHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLDLAV--------NHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~~~~--------~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ae 72 (456) ...+.++.. .|..+....+....|...--.+-....+.+...-......... -.+.+++.|||..+. T Consensus 38 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~r-----i~~n~~k~Ivd~~~~ 112 (492) T protein:vir:94 38 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDR-----MITNFHANLVDQKVS 112 (492) T ss_pred cCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccc-----cccchHHHHHHHHHh Confidence 111111111 1111111111111121110000011111111110000000000 146899999999999 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc----- Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR----- 143 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~----- 143 (456) =++.+.+++...++. .. +.|+..++ -++...+.++.+....||.|++++..+ ||+.-- .|.. T Consensus 113 yl~G~p~~~~~~d~~-~~-------~~l~~~~~-n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~v~ 183 (492) T protein:vir:94 113 YIVGKPIAFKHTDDE-VV-------KRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIW 183 (492) T ss_pred hhcccCceeccCchH-HH-------HHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEE Confidence 999999888544322 11 22444443 367788888888899999999888774 343211 1211 Q ss_pred -CC-cCceeEEEEeccccCChhhhhccccccccCCc---eeEEEeeccc--CCc-cccceeeehh----hhheecC--Cc Q lcl|NC_016762. 144 -GK-LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQP---TMWEYTEASQ--AGR-PGLVRDIHPD----RVFILGD--WT 209 (456) Q Consensus 144 -~~-~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P---~~y~i~~~~~--~g~-~~~~~~IH~S----Rli~~~~--~~ 209 (456) .. .+.+....-+|... +-..-.++.| .+|.+..... ... ......+|.. -.+.+.. .+ T Consensus 184 d~~~~~~~~a~ir~~~~~--------~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn 255 (492) T protein:vir:94 184 TDKEHEELEAFIRMYKLE--------NETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN 255 (492) T ss_pred cCCCCCceEEEEEEEeec--------cceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCC Confidence 00 01111111111110 0001111111 1222211000 000 0011122211 1111222 23 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEe Q lcl|NC_016762. 210 GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLP 289 (456) Q Consensus 210 ~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~li 289 (456) .+|.|.++.+.+-+.+++.+.-..+..+-..+...+.++ ++. +....+.... .+......+ T Consensus 256 ~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--------g~~---~~~~~~~~~~--------~~~~~~~~~ 316 (492) T protein:vir:94 256 DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLK--------NYD---DQELPEFKRL--------LRYYGAIKV 316 (492) T ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--------cCC---cccchhhHHH--------Hhhccceec Confidence 568899988877777777665444433222121112111 110 0111111111 112233334 Q ss_pred cCCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHH Q lcl|NC_016762. 290 TQGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEIND 362 (456) Q Consensus 290 d~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~ 362 (456) +.+.+ |-..+.+.+++...++...+.|...+++|-.-+ + .-|| |.++. ++ .-...+ ..++..++..|++ T Consensus 317 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~-~~k~~~f~~~l~~ 392 (492) T protein:vir:94 317 SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS-D-KFGS-APSGVALEFLYTNLNLKA-DKLARKAKVAIQE 392 (492) T ss_pred CCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCc-c-cccc-CchHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 55554 445677788999999999999999999986322 1 1122 23333 22 223344 4555678999999 Q ss_pred HHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHH-cCCcCcCHH-HHHHHhc-----ccCCCC Q lcl|NC_016762. 363 LFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIG-TGEPVFTAE-EIREEAG-----YDPLQG 432 (456) Q Consensus 363 l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~-~g~~~i~~~-E~R~~~~-----~~~~~~ 432 (456) ++++++..... ....++.|.|+|-...++++.|++..+.+-+.. +++. .+ .+-+++ |+..... ...... T Consensus 393 ~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~-~v~d~~~E~eri~~E~~~~~~~~~~ 471 (492) T protein:vir:94 393 LLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHP-FVEDLQAELERIEQEQMEYNKQLPN 471 (492) T ss_pred HHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCC-CCCCHHHHHHHHHHHHHHHHhhccc Confidence 99887654322 234579999999999999999998877654322 1221 11 122333 3322110 011111 Q ss_pred CCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 433 GDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 433 ~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ..+..+.++..++..+..+.| T Consensus 472 ---~~~~~~~~~~~~~~~~~~e~e 492 (492) T protein:vir:94 472 ---LDDGGADSAQQQERSNNKESE 492 (492) T ss_pred ---cccccCCCCccccCCccccCC Confidence 111111112222222333344 No 145 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.93 E-value=6.7e-10 Score=70.87 Aligned_cols=421 Identities=10% Similarity=0.052 Sum_probs=187.7 Q ss_pred CCch-----------------------hHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHH Q lcl|NC_016762. 1 MTDK-----------------------LDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMY 57 (456) Q Consensus 1 ~~~~-----------------------~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y 57 (456) |.+| .++++.-....++.+++..|.+-.--+ . ....+.+.-..-+ T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~-------~-----~~~~~~~~~~~~~ 70 (517) T protein:vir:98 3 VIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQV-------E-----YINSQGKIQERDY 70 (517) T ss_pred hHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCccc-------c-----cccccccccccce Confidence 2222 233333223333333333332211100 0 0000000000012 Q ss_pred hcCchhhhhhccchhHHhhCCCEEecCCCcc-hhhhhHH--HHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC Q lcl|NC_016762. 58 RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQD-RSKDETE--WERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD 134 (456) Q Consensus 58 ~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d-~~~~~~~--~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D 134 (456) .+=.++++||...|+=++-+-.+|+-++... ..+..+. -...|++.++.-++...+.+++......||+++-+.++. T Consensus 71 ~sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~ 150 (517) T protein:vir:98 71 MTLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN 150 (517) T ss_pred eecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC Confidence 2347899999999998888888886544321 1111111 112477778888899999999999887787777666643 Q ss_pred CCC-c-----c--ccccCCcCceeEEE-EeccccCChhh---hhc-----cccccccCCceeEEEeeccc-------CCc Q lcl|NC_016762. 135 SQP-W-----D--RPARGKLNGLAKVT-PAWAGCLKPKS---FDE-----KPDSETYGQPTMWEYTEASQ-------AGR 190 (456) Q Consensus 135 ~~~-~-----~--~Pl~~~~~~l~~i~-~~~~~~~~~~~---~~~-----Dp~s~~yg~P~~y~i~~~~~-------~g~ 190 (456) ++. + + -|+.-..+++..+. +++.... ... +.+ .+..-.|+.- .|+|..... -|. T Consensus 151 ~~~~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~-~~~~~~~Yt~lE~H~~~~~~~~~~-~y~I~n~ly~s~~~~~lG~ 228 (517) T protein:vir:98 151 GEIEFSWALANAFYPLRSNSNGISEGVMKSVTTKV-IGNKTVYYTLLEFHEWEKTEEGES-LYVITNELYKSDNEGEIGK 228 (517) T ss_pred CeeEEEEEcCCeeEEEEecCCCeEEEEEEEEEEEe-ecCCceEEEEEEEEecCceeccCC-cEEEEEEEEecCCCccccc Confidence 321 1 1 14443334443221 1111110 000 000 1111111111 233321110 010 Q ss_pred cc----------cceeee-hhhh--heec--------CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Q lcl|NC_016762. 191 PG----------LVRDIH-PDRV--FILG--------DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNF 249 (456) Q Consensus 191 ~~----------~~~~IH-~SRl--i~~~--------~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~ 249 (456) +. ....+. -.|. .+|. ...+.|+|++..+.+.+..++.+-..+. +...+....+.. T Consensus 229 ~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~---~e~~~g~~~i~v 305 (517) T protein:vir:98 229 RIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFW---WEIKMGQRTVFV 305 (517) T ss_pred cccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHH---HHHHhCCcceec Confidence 00 001111 1221 1110 1245699999999988888887654433 221111111110 Q ss_pred hhhccHhhHH----hhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEeccc--CCHHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 250 DKEINLGEIA----STYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 250 ~~~~~~~~l~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP 323 (456) -.++. +..+.......+.-......+... +.+.-++.++..+ ......++.+++.++..+|++ T Consensus 306 -----p~~~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~------~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls 374 (517) T protein:vir:98 306 -----SDVMLRTVPDESGMPPPQVFDPDVNVYKSIRMG------TDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLS 374 (517) T ss_pred -----ChhhhccccCCCCcccCCCCCcccceeeeccCC------CCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCC Confidence 01111 000000000000000000001000 1122355555444 456777888899999999999 Q ss_pred eEEeeccCCCccc-chH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHh--------cCcCCCCceEEEeCCCCCCCH Q lcl|NC_016762. 324 TKILVGMQTGERA-SSE---DQKYHNARCQARRVQELTFEINDLFAHLMRI--------GVVPLKAEFTAIWDDLTVPTK 391 (456) Q Consensus 324 ~t~L~G~sp~Gln-st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s--------~~~~~~~d~~~~f~pL~~~se 391 (456) -. -||....|.. ||+ ..+.-|.++.++|. .++..|++|+..++.. +..+...+++|.|.+--..+. T Consensus 375 ~~-t~~~~~~~~kTATEi~s~~~~~~~t~~~~~~-~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~ 452 (517) T protein:vir:98 375 VG-TFSFDGRSMKTATEIVSENDLTYRTRNDHVY-EVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDR 452 (517) T ss_pred cc-cccccccccccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCH Confidence 86 6777777774 553 45567788888876 5799999998876432 222344579999999999998 Q ss_pred HHHHHHHHHHHHH----HHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCC-CCCCC Q lcl|NC_016762. 392 AERLANSKTMSEI----NSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDP-TGEQQ 456 (456) Q Consensus 392 ke~Aei~~~~A~a----~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~-~~~~e 456 (456) ++.++...+...+ ...++.--- -++.+|+++.+..-....... .+........++ .+|.| T Consensus 453 ~~~~~~~~~~v~aG~ms~~~~i~~~~-g~~eeeA~~e~~~i~~E~~~~----~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 453 SALLRFYGQAKTFGFIPTVEAIQRIF-KVPKKTAEQWLEEIRKDQIEL----DPVTISQRAQKRMFGDEE 517 (517) T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHhC-CCChHHHHHHHHHHHHhcccc----CCCCccccccCCCCCCCC Confidence 8877765432111 011111000 134454444321110000000 011111112222 22222 No 146 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.92 E-value=3e-09 Score=67.34 Aligned_cols=423 Identities=14% Similarity=0.076 Sum_probs=190.5 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccC------cccCCHHHHHHHHhcCchhhhhhccchhHH Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGF------PQEITFNDLYTMYRRGGIAHGAVEKIVTTC 74 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~------~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~ 74 (456) |+++--.+.....+ ....+-+|.....+.++. .|..++- ....-..-...+|+.|++++.+|+...... T Consensus 3 ~~~~~~~a~~~~~~--~~~~~~~y~aa~~~~~~~---~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~v 77 (495) T protein:vir:10 3 MTPSGYQSLASGLL--VPVGASAYEGASGGHRWQ---DIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAA 77 (495) T ss_pred cccccccccchhhh--hHHHhhhhhccccCcccC---CCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhh Confidence 56653222111111 111122333222221111 1110000 001112235678999999999999999999 Q ss_pred hhCCCEEecCCCcchhhhhHHHHHHHHHHHHH----------hhHHHHHHHHHHhhcccCceEEEEEecC-CCCcccccc Q lcl|NC_016762. 75 WKTNPQVIEGDDQDRSKDETEWERKNKPLIAG----------GRFWRAVSEADRRRLVGRYSGLLLHIRD-SQPWDRPAR 143 (456) Q Consensus 75 tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~----------l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~~~Pl~ 143 (456) +=.||+.....+ .. ++.++|++++++ +.+.+....+.+.-...|-+++.+.... +....-|++ T Consensus 78 VG~Gi~p~~~~~--~~----~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~ 151 (495) T protein:vir:10 78 VGNGLTPRWRMK--EQ----ELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQ 151 (495) T ss_pred cCCCcccccCCc--hH----HHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceE Confidence 888988754322 22 234455555543 2233333334444445566655554321 111111222 Q ss_pred CCcCceeEEEEeccccCC-hhh--------hh-ccccccccCCceeEEEeecccCCc-----cccceeeehhhhheecC- Q lcl|NC_016762. 144 GKLNGLAKVTPAWAGCLK-PKS--------FD-EKPDSETYGQPTMWEYTEASQAGR-----PGLVRDIHPDRVFILGD- 207 (456) Q Consensus 144 ~~~~~l~~i~~~~~~~~~-~~~--------~~-~Dp~s~~yg~P~~y~i~~~~~~g~-----~~~~~~IH~SRli~~~~- 207 (456) |.-|.|-+ |+ |.. +. .-..--.+|+|..|+|...-++.. ......|-.++|+|+-. T Consensus 152 -----lqliepd~---l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~ 223 (495) T protein:vir:10 152 -----LQIIEPDM---LASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTVL 223 (495) T ss_pred -----EEEechhh---cCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEecccc Confidence 22222222 21 110 00 011112589999999975444321 11235577777776532 Q ss_pred --CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCC Q lcl|NC_016762. 208 --WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGND 285 (456) Q Consensus 208 --~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (456) ....|+|.+-++. .+.+++.-....-..-.-+++-...++. ...-.......+....... ......+..+ . T Consensus 224 r~gQ~RGis~la~i~-~l~~l~~y~dael~~a~i~A~~~~fi~~--~~~~~~~~~~~~~~~~~~~---~~~~~~l~pG-~ 296 (495) T protein:vir:10 224 TVRSDAGAPWFQLLL-RLNELDQYEDAELVRKKTAALFAAFIQE--ATADSTGGPTIGQPKRSKG---GKRITGLNPG-T 296 (495) T ss_pred CCCcccCcchhHHHH-HHHHhhHHHHHHHHHHHHhhhheeeeec--CCCccccccccCccccccC---cccceecCCc-e Confidence 2235999987765 4666655443211111111111111110 0000000000010000000 0001111111 1 Q ss_pred eEEecCCCceeEEecc--cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCC-cccch-HHHHHHHHHHHHHHHhh-----h Q lcl|NC_016762. 286 VLLPTQGATVTQMVSA--VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTG-ERASS-EDQKYHNARCQARRVQE-----L 356 (456) Q Consensus 286 ~~lid~~d~~~~~~~~--~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~-Glnst-~D~~nyyd~I~~~Qe~~-----l 356 (456) +.-+..+++++.++.+ -++..+....++..||+..|||--.|.|--.+ ..+|. ..+..+...+++.|.+. + T Consensus 297 i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~ 376 (495) T protein:vir:10 297 LQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFC 376 (495) T ss_pred eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2234457888888764 56999999999999999999999999884332 23333 35667777787777653 4 Q ss_pred hHHHHHHHHHHHHhcCcCCCCce-------EEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCc-----------CH Q lcl|NC_016762. 357 TFEINDLFAHLMRIGVVPLKAEF-------TAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVF-----------TA 418 (456) Q Consensus 357 rp~L~~l~~~l~~s~~~~~~~d~-------~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i-----------~~ 418 (456) +|+-+.+++..+..+.++.|+.+ ..+| .+...+-.|= .|.++|....+.+| .. ++ T Consensus 377 ~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w----~~p~~~~vDP-~Ke~~A~~~~i~~G--~~s~~~~~a~~G~D~ 449 (495) T protein:vir:10 377 RPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSW----RTPRWEEVDP-LKKHLADLGDVRAG--FAPISDKQAERGYDM 449 (495) T ss_pred HHHHHHHHHHHHHcCCCCCCCchhhhHhhhcccc----ccCCccccCh-HHHHHHHHHHHHcC--CCCHHHHHHHcCCCH Confidence 56667777777777777655432 2233 2222222221 24555666666666 33 33 Q ss_pred HHHHHH-------hcccCCC-CCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 419 EEIREE-------AGYDPLQ-GGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 419 ~E~R~~-------~~~~~~~-~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +|+.+. ...-++. +.++-..........+..++..++| T Consensus 450 ~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 450 EELFDMISDANQLIDEYDLRLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHHHHHHHHHHHHHcCCCCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 333221 1111221 1111111111111111222222222 No 147 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.91 E-value=3.5e-09 Score=66.97 Aligned_cols=408 Identities=7% Similarity=-0.139 Sum_probs=183.9 Q ss_pred HHHhHHHH--HHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecC Q lcl|NC_016762. 7 LAVNHAMS--SAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEG 84 (456) Q Consensus 7 ~~~~~a~~--~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~ 84 (456) +..++-.. ..+.+-.+-|.+----+ .+.......+.+. ..-.+.+++.||+..+.=++.+.+++... T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~--~~~~~~~~~~~~~---------~ki~~n~~~~ivd~~~~~l~g~~~~~~~~ 69 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSI--LSGHRRLDDEKAD---------YRVRHKWGGYISSFATGYVIGNPVSIGVM 69 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCccc--ccccccccccCCc---------ceeecchHHHHHHhhhhheeccCceEeeC Confidence 33222111 11222222221100000 0000000000010 01257889999999999999999888654 Q ss_pred CCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---ccccC-------CcCceeEEE Q lcl|NC_016762. 85 DDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPARG-------KLNGLAKVT 153 (456) Q Consensus 85 ~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~~-------~~~~l~~i~ 153 (456) +..+... + ..|...+.+-++...+.++.+....||.|++++.++ +|...- .|... ..+.+.... T Consensus 70 ~~~~~~~----~-~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i 144 (440) T protein:vir:95 70 EGGSADQ----L-STIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAV 144 (440) T ss_pred CCccHHH----H-HHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 4332211 1 236666777788999999999999999999988874 343321 23320 111122211 Q ss_pred EeccccCChhhhhccccccccCCce-eEEEeecccC--CccccceeeehhhhheecC--CcCCCcchHHHHHHHHHHHHH Q lcl|NC_016762. 154 PAWAGCLKPKSFDEKPDSETYGQPT-MWEYTEASQA--GRPGLVRDIHPDRVFILGD--WTGDAIGFLEPAYNSFISLEK 228 (456) Q Consensus 154 ~~~~~~~~~~~~~~Dp~s~~yg~P~-~y~i~~~~~~--g~~~~~~~IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~ 228 (456) -+|.. .+...-.++-|. .|++.....+ +.......=|+--.+.+.. ...+|.|.++.+.+-+.+++. T Consensus 145 ~~~~~--------~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~ 216 (440) T protein:vir:95 145 HLPIY--------ADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDA 216 (440) T ss_pred EEEEe--------cCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHH Confidence 12211 011011111122 1221110000 0000011124333222222 235688999988887777777 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCc--eeEEecccCCHH Q lcl|NC_016762. 229 VEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGAT--VTQMVSAVSDPG 306 (456) Q Consensus 229 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~--~~~~~~~~sgl~ 306 (456) +.-..+..+-..+...+.++. .........+....-....+............+.+.+ |-..+.+.+++. T Consensus 217 ~~s~~~~~~~~~~~~~~v~~g--------~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~ 288 (440) T protein:vir:95 217 GQSDTANYMSDLNDAMLLVKG--------DLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTE 288 (440) T ss_pred HHHHHHHHHHHhhcceeeeec--------ccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHH Confidence 655554433222222222221 1111111111111111111100111111111223333 444566778999 Q ss_pred HHHHHHHHHHHhhhcCCeEEeeccCCCcccchHHHH----HHHHHHHHHHHhhhhHHHHHHHHHHHHhc---Cc-C-CCC Q lcl|NC_016762. 307 PTYNVNLQTAAAGVDIPTKILVGMQTGERASSEDQK----YHNARCQARRVQELTFEINDLFAHLMRIG---VV-P-LKA 377 (456) Q Consensus 307 ~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D~~----nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~---~~-~-~~~ 377 (456) ..++.+.+.|...+++|-.-+ +.-.|..+|.. ++ .-...++.+ +..++..|++++++++... .+ . ... T Consensus 289 ~~~~~l~~~i~~~s~~p~~~~-~~~~~n~Sg~A-l~~~~~~l~~k~~~k-~~~~~~~l~~~~~li~~~~~~~~~~~~~~~ 365 (440) T protein:vir:95 289 AYKNRLANDIHRFSRIPNLDD-DRFNSTSSGIA-LLYKMIGLEQVRKDK-ETYFTKALRRRYELISNIHKAINGPVIEAN 365 (440) T ss_pred HHHHHHHHHHHHHhCCccccc-ccccccchHHH-HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcCCcccccc Confidence 999999999999999997433 22223232222 22 233344444 4568999999998876431 11 1 235 Q ss_pred ceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcC-HHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCC Q lcl|NC_016762. 378 EFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFT-AEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGE 454 (456) Q Consensus 378 d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~-~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~ 454 (456) ++++.|+|-...++++.|++..+.+.... +++.. .+-++ +.|+-........ ..... .+..+...+.+.+ T Consensus 366 ~v~i~f~~~~p~~~~~~ad~~~kl~g~iS~et~~~~-l~~~d~~~E~~ri~~E~~~-~~~~~-----~~~~~~~~~~~~~ 438 (440) T protein:vir:95 366 KLTFTFHPNIPQDVWTEIKAYIEAGGEISQETLMEN-ASFTDYKTEHSRILKQGGS-SDLEI-----GQIVGDADVGQAD 438 (440) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHhccCcHHHHHHh-CCCCCcHHHHHHHHHHHHH-hhhhH-----HhhccCCCCCCcC Confidence 78999999999999999998877654322 11111 11223 3333222111110 00000 0001111122222 Q ss_pred CC Q lcl|NC_016762. 455 QQ 456 (456) Q Consensus 455 ~e 456 (456) .| T Consensus 439 ~e 440 (440) T protein:vir:95 439 TE 440 (440) T ss_pred CC Confidence 22 No 148 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.90 E-value=1.2e-09 Score=69.53 Aligned_cols=410 Identities=11% Similarity=0.073 Sum_probs=189.2 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCc-------ccchhhhhccCc--ccCC-HH------HHHHHHhcCchhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDA-------KRPQAWCEYGFP--QEIT-FN------DLYTMYRRGGIAH 64 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt-------~~~~~~~~~~~~--~~~~-~~------~l~~~Y~~~~l~r 64 (456) |.+|+...+..=++.-. ..-++.+..-..+. .+-..|..|+.. ..+. +. ....-..+..+++ T Consensus 1 m~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k 78 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMG--LLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPK 78 (496) T ss_pred ChhHHHHHHHHHHHHhc--cchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHH Confidence 99998876664444210 00111111111111 111123333111 1000 00 0001122458899 Q ss_pred hhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CCCccc--- Q lcl|NC_016762. 65 GAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQPWDR--- 140 (456) Q Consensus 65 ~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~~~--- 140 (456) .||+..|.=++-+-+.|..+++. ..+.|.+.++..+++..+.+++.....+|++++.+.++. ++..-. T Consensus 79 ~i~~~~a~~l~~~p~~i~~~d~~--------~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~ 150 (496) T protein:vir:38 79 VTAKYMSKLLFNEKVKINIDDKA--------AEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFAT 150 (496) T ss_pred HHHHHHhhhhhCCcceEeeCChH--------HHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEc Confidence 99999999888888888654321 112467777777899999999999999999999888753 432211 Q ss_pred -----cccCCcCceeEEEEeccccCChhhhhccccccccCCce-------eEEEeec-------ccCCccc--------- Q lcl|NC_016762. 141 -----PARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPT-------MWEYTEA-------SQAGRPG--------- 192 (456) Q Consensus 141 -----Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~-------~y~i~~~-------~~~g~~~--------- 192 (456) |+.-..+.+..+ +|+.. .. . ....|..-+ .|.|.-. ..-|.+. T Consensus 151 ~~~~~P~~~~~~~~~~~-~f~~~-~~-----~--~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~ 221 (496) T protein:vir:38 151 ADCMYPLSNDSENVDEC-VIANS-FH-----K--NNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDI 221 (496) T ss_pred ccceEEEEecCCcEEEE-EEEEE-EE-----e--CCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccccc Confidence 432112222211 12110 00 0 000111111 1111100 0000000 Q ss_pred -cceee-ehhhhh--eec--------CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHh Q lcl|NC_016762. 193 -LVRDI-HPDRVF--ILG--------DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIAS 260 (456) Q Consensus 193 -~~~~I-H~SRli--~~~--------~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~ 260 (456) ....+ +-+|+. .|. .....|.|+++.+.+-+..++.+....+.-+-. +-..+.+ -..+.. T Consensus 222 ~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v-------~~~~l~ 293 (496) T protein:vir:38 222 EPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLV-------PSSFVK 293 (496) T ss_pred ccceeecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceec-------chHHhh Confidence 00001 113331 121 123469999999998888888776555433221 1100100 011111 Q ss_pred hhcCCHHHHHHHHHHHHHHHhcCCCe--EE-ecC---CCceeEEeccc--CCHHHHHHHHHHHHHhhhcCCeEEeeccCC Q lcl|NC_016762. 261 TYGVTLDALNERFNEAARQLNRGNDV--LL-PTQ---GATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIPTKILVGMQT 332 (456) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~l-id~---~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP~t~L~G~sp 332 (456) ....+..+....+ ...... ++ .+. ...++.++..+ ......++...++++..+|+|-. -||... T Consensus 294 ~~~~~~g~~~~~~-------~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~-~f~~~~ 365 (496) T protein:vir:38 294 TAVNLDGSTTQYF-------DSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAG-TFTFDE 365 (496) T ss_pred ccCCCCCccccCC-------CCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChh-hcCCCc Confidence 0010000000011 111111 11 111 12355555544 34567788888999999999876 477666 Q ss_pred Cccc-chH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHh--------cCcCCCCceEEEeCCCCCCCHHHHHHHHHH Q lcl|NC_016762. 333 GERA-SSE---DQKYHNARCQARRVQELTFEINDLFAHLMRI--------GVVPLKAEFTAIWDDLTVPTKAERLANSKT 400 (456) Q Consensus 333 ~Gln-st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s--------~~~~~~~d~~~~f~pL~~~seke~Aei~~~ 400 (456) +|.. +++ ....-+.++..+|. .++..|++++..++.. +....+.+++|.|++-...++.+.++...+ T Consensus 366 ~g~~tAtei~~~~~~l~~~~~~~~~-~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~ 444 (496) T protein:vir:38 366 NGLKTATEVVSEKSETYQTKNSHSQ-LIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTN 444 (496) T ss_pred cccchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHH Confidence 6764 443 33445666666554 6788888888776532 122345679999999988888887665443 Q ss_pred HHHHHHHHHHcCCcCcCHHHHHHHh-cccCCCCCC-CCc---ccCC-CCCCCCCcCCCCCCC Q lcl|NC_016762. 401 MSEINSAAIGTGEPVFTAEEIREEA-GYDPLQGGD-PLP---DTEP-EDEDAARTDPTGEQQ 456 (456) Q Consensus 401 ~A~a~~~~~~~g~~~i~~~E~R~~~-~~~~~~~~~-~~~---~~~~-~d~~~~~~d~~~~~e 456 (456) ++.+| +++.+.+.... +.+. .+.. ..+ ++.. ...+.+...+.+++| T Consensus 445 -------~~~~G--iiS~et~l~~~~~~~d-~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 445 -------AKNQG--MIPLKIALQRAWNITE-AEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred -------HHhcC--CCCHHHHHHhcCCCCh-HHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 23345 55544443221 0000 0000 000 0000 001122233455555 No 149 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=98.89 E-value=1.9e-08 Score=62.90 Aligned_cols=403 Identities=9% Similarity=0.021 Sum_probs=179.0 Q ss_pred CCch-h-HHHHhHHHHHHHHHHHHHHhhhhhc-cCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDK-L-DLAVNHAMSSAIARARMSLLNQGIG-HDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~-~-~~~~~~a~~~~~~~~~d~~~n~~~~-~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) |+.+ + ++.-.|..+...-+....|.+.--. +...+ + ..+-+ ...+ ...+++.||+..+.=++-+ T Consensus 17 ~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~-~---~~~~~-------~~ki--~~n~~~~ivd~~~~~l~g~ 83 (453) T protein:vir:39 17 ITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPT-K---DLWKP-------DNRL--TVNFTKYIVDTFTGYFNGI 83 (453) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCC-c---cccCc-------ccee--ecchHHHHHHHHhhhhccc Confidence 3333 1 2221222221111112223221000 00000 0 00000 0111 3578999999999999999 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---ccccCCcCceeEEE Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPARGKLNGLAKVT 153 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~~~~~~l~~i~ 153 (456) .+++...++.. .+.|.+.+++-++...+.++.+....+|.|++++..+ +|+..- .|.. +. T Consensus 84 ~~~~~~~d~~~--------~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~--------~~ 147 (453) T protein:vir:39 84 PVKKSHSDKET--------LSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPEN--------MF 147 (453) T ss_pred CceeccCChHH--------HHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccc--------eE Confidence 98886544322 1347777888888899999999999999999888773 333211 1221 12 Q ss_pred EeccccCC--h---hhhh--cc-ccccccCCcee-EEEeecccCCccccceeeehhhhheecCC--cCCCcchHHHHHHH Q lcl|NC_016762. 154 PAWAGCLK--P---KSFD--EK-PDSETYGQPTM-WEYTEASQAGRPGLVRDIHPDRVFILGDW--TGDAIGFLEPAYNS 222 (456) Q Consensus 154 ~~~~~~~~--~---~~~~--~D-p~s~~yg~P~~-y~i~~~~~~g~~~~~~~IH~SRli~~~~~--~~~G~S~le~~~~~ 222 (456) |+|..... + ..+. .+ ..--.++.|.. |++... .++-......=|+-..+.+..+ ..+|.|.++.+..- T Consensus 148 ~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~-~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~l 226 (453) T protein:vir:39 148 MVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGT-MGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISL 226 (453) T ss_pred EEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEec-CCceeeecccccCCCceeEEEecCCCCCCcchhhhHHH Confidence 22211000 0 0000 00 00011122222 222211 0100000111233333333332 35688999888777 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCc--eeEEec Q lcl|NC_016762. 223 FISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGAT--VTQMVS 300 (456) Q Consensus 223 l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~--~~~~~~ 300 (456) +.+++.+.-..+..+-..+...+.+.. . +...+. ...+... ..+. -.+......+.+ +-+.+. T Consensus 227 iDa~~~~~s~~~~~~~~~~~p~~~~~g---~---------~~~~~~-~~~~~~~-~~~~-~~~~~~~~~~~~~~~lt~~~ 291 (453) T protein:vir:39 227 VNAFNKAISEKANDVDYFSDQYLTFLG---A---------AVEEED-LKNIRSN-RVIN-YYGESSEAKNVDVKFLEKPD 291 (453) T ss_pred HHHHHHHHHHHHHHHHHhhCceeeeec---C---------CCCchh-hhhhhhc-ceee-ecCCCCCCCCCceeEEeecC Confidence 767777655544333211222221110 0 001111 1111100 0000 000000112233 444566 Q ss_pred ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHHHHHHhc--Cc Q lcl|NC_016762. 301 AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFAHLMRIG--VV 373 (456) Q Consensus 301 ~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~--~~ 373 (456) +.+++...++.+...|...+++|-.-. +.. | |++++ ++ .-...+..+| ..+...|++++.+++... .+ T Consensus 292 ~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~-g--n~Sg~Al~~~~~~l~~ka~~~~-~~~~~~l~~~~~li~~~~~~~~ 366 (453) T protein:vir:39 292 SDSQTENLLDRLTKLIFQTTMVANISD-ESF-G--SSSGVSLAYKLQAMSNLALSFQ-RKFQSSLNSRYKLYCELSTNVS 366 (453) T ss_pred CHHHHHHHHHHHHHHHHHHhCCccccc-ccc-c--CChHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccC Confidence 778999999999999999999995322 111 2 33443 22 2344555444 467888998888765432 12 Q ss_pred --CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HH-HcCCcCc-CH-HHHHHHhcccCCCCCCCCcccCCCCCCC Q lcl|NC_016762. 374 --PLKAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AI-GTGEPVF-TA-EEIREEAGYDPLQGGDPLPDTEPEDEDA 446 (456) Q Consensus 374 --~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~-~~g~~~i-~~-~E~R~~~~~~~~~~~~~~~~~~~~d~~~ 446 (456) ....+++|.|+|-...+.++.|++..+.+.+... ++ ..+ -+ ++ .|+...- .+.-.........+...+.. T Consensus 367 ~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~g~is~et~l~~l~--~v~D~~~E~~ri~-~E~~~~~~~~~~~~~~~~~~ 443 (453) T protein:vir:39 367 NKEAWKDIEYTFTRNEPKDIKEQAETANILMGITSQETALSVIS--VIPDVQAEMEKIK-KEEASTAIFDKDKQPSEKGT 443 (453) T ss_pred CccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCC--CCCCHHHHHHHHH-HHHHHHHHHHHhccCCCCCC Confidence 2234789999999999999999988777643321 11 111 22 22 2332211 11000000000001111111 Q ss_pred CCcCCCCCCC Q lcl|NC_016762. 447 ARTDPTGEQQ 456 (456) Q Consensus 447 ~~~d~~~~~e 456 (456) ....+.-++| T Consensus 444 ~~~~~~~~~e 453 (453) T protein:vir:39 444 DTVVPETNEE 453 (453) T ss_pred CCCCCCcCCC Confidence 1111111222 No 150 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.88 E-value=2.1e-08 Score=62.63 Aligned_cols=420 Identities=9% Similarity=-0.038 Sum_probs=172.2 Q ss_pred CCchhHHHHhHHHHHH---HHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSA---IARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~---~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) -++.....+++..+.- ..+....|...--.+-...++.....+.... ....-..=..+.++++||+..+.=++-+ T Consensus 26 ~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~--~~~~~~~ri~~n~~~~ivd~~~~yl~g~ 103 (503) T protein:vir:59 26 IAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLV--DDTKTNNRTSHAWHKLFVDQKTQYLVGE 103 (503) T ss_pred ccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhccccccccc--ccccccceeecchHHHHHHHHHhhhhcC Confidence 1111111222221111 0011111111000000000111111111000 0000000013678999999999999999 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCc---ccccc------CC-c Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPW---DRPAR------GK-L 146 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~---~~Pl~------~~-~ 146 (456) ++++...++ +.. +.++..+ +-++...+.++.+....+|.+++++.++ ||+.- -.|.. .. . T Consensus 104 ~~~~~~~d~-~~~-------~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~ 174 (503) T protein:vir:59 104 PVTFTSDNK-TLL-------EYVNELA-DDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTR 174 (503) T ss_pred CeeeccCcH-HHH-------HHHHHHH-hcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCC Confidence 999854322 111 1233333 3477888899999999999999998874 33321 11211 00 0 Q ss_pred CceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCcc--------------ccceeeehhhhheecC--CcC Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRP--------------GLVRDIHPDRVFILGD--WTG 210 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~--------------~~~~~IH~SRli~~~~--~~~ 210 (456) +.+....=+|...-.-+ ....--.++.|..+..-....++.. .....-|.--.+.+.. .+. T Consensus 175 ~~~~~~ir~~~~~~~~~---~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~ 251 (503) T protein:vir:59 175 RDILFALRYYSYKGIMG---EETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNE 251 (503) T ss_pred CceEEEEEEEEEecCCC---ceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCC Confidence 11111111111000000 0000001112211110000000000 0001122222122222 245 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEec Q lcl|NC_016762. 211 DAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPT 290 (456) Q Consensus 211 ~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid 290 (456) +|.|.++.+.+-+.+++.+.-..+..+-..+...+.++. . .+....+.... + ....++.+. T Consensus 252 ~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g---~--------~~~~~~~~~~~-------~-~~~~~~~~~ 312 (503) T protein:vir:59 252 EMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKN---Y--------DGENPKEFTAN-------L-RYHSVIKVS 312 (503) T ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeec---C--------Cccccchhhhh-------h-hcccceecc Confidence 689999888877777777655544433222222222111 0 01111111111 1 112333445 Q ss_pred CCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_016762. 291 QGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFA 365 (456) Q Consensus 291 ~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~ 365 (456) ++.+ +-..+.+.+++...++.+.+.+...+.+|-.-. +.-.|..+|.. -...-...++.+ +..++..|++++. T Consensus 313 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~~~~~Sg~Ai~~~~~~l~~k~~~~-~~~~~~~l~~~~~ 390 (503) T protein:vir:59 313 GDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSP-ETIGGGATGPALENLYALLDLKANMA-ERKIRAGLRLFFW 390 (503) T ss_pred CCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCc-ccccccccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 5444 445566778999999999888888888875321 11112222221 223344445544 4567999999888 Q ss_pred HHHHh----cCcC--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHH----HHHHH-cCCcCcCH-HHHHHHhc------- Q lcl|NC_016762. 366 HLMRI----GVVP--LKAEFTAIWDDLTVPTKAERLANSKTMSEIN----SAAIG-TGEPVFTA-EEIREEAG------- 426 (456) Q Consensus 366 ~l~~s----~~~~--~~~d~~~~f~pL~~~seke~Aei~~~~A~a~----~~~~~-~g~~~i~~-~E~R~~~~------- 426 (456) +++.. ..+. ...++++.|++-...++++.|++..+..++. .+++. .+ .+-++ .|+..... T Consensus 391 ~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~-~v~d~~~E~~ri~~E~~~~~~ 469 (503) T protein:vir:59 391 FFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNP-FVQDPEEELARIEEEMNQYAE 469 (503) T ss_pred HHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCC-CCCCHHHHHHHHHHHHHHHHh Confidence 76532 1111 2246999999999999999988876654431 12222 11 12233 34332211 Q ss_pred -ccCCCCCCCCcccCCCC-CCCCCcCCCCCCC Q lcl|NC_016762. 427 -YDPLQGGDPLPDTEPED-EDAARTDPTGEQQ 456 (456) Q Consensus 427 -~~~~~~~~~~~~~~~~d-~~~~~~d~~~~~e 456 (456) ..+..+.....+.+.++ +.+....-.+.-+ T Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (503) T protein:vir:59 470 MQGNLLDDEGGDDDLEEDDPNAGAAESGGAGQ 501 (503) T ss_pred hhccccCccCCCCCCCcCCCCCCcccCCCCCC Confidence 01111111111111110 0000001111111 No 151 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.87 E-value=1.9e-09 Score=68.43 Aligned_cols=420 Identities=10% Similarity=0.060 Sum_probs=188.6 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhc----cCc---ccchhhhhccCccc---CCH------HHHHHHHhcCchhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIG----HDA---KRPQAWCEYGFPQE---ITF------NDLYTMYRRGGIAH 64 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~----~gt---~~~~~~~~~~~~~~---~~~------~~l~~~Y~~~~l~r 64 (456) |.+|+...+..-++.-. ...++....-. +.+ .+-..|..++.-.. ..+ .....-..+..+++ T Consensus 1 m~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~ 78 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMG--LLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPK 78 (499) T ss_pred ChhHHHHHHHHHHHHhc--cccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHH Confidence 99988877665444200 00111111101 111 01111222211000 000 00001123568999 Q ss_pred hhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CCCc----- Q lcl|NC_016762. 65 GAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQPW----- 138 (456) Q Consensus 65 ~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~----- 138 (456) .||+..|+=++-+-++|...++. ..+.|++.++.-+++..+.+++..+..+|++++.+.++. |+.. T Consensus 79 ~iv~~~a~~l~~ep~~i~~~d~~--------~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~ 150 (499) T protein:vir:80 79 VTAKYMSKLLFNEKVKINIDDET--------AEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFAT 150 (499) T ss_pred HHHHHHHHhhhCCcceEeeCCHH--------HHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEc Confidence 99999999998888888654321 123477777777899999999999999999999888853 3321 Q ss_pred -cc--cccCCcCceeEEEEeccccCChhhhhccccccccC--CceeEEEeeccc-------CCcc----------cccee Q lcl|NC_016762. 139 -DR--PARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYG--QPTMWEYTEASQ-------AGRP----------GLVRD 196 (456) Q Consensus 139 -~~--Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg--~P~~y~i~~~~~-------~g~~----------~~~~~ 196 (456) ++ |+.-..+.+..+ +|+.....-..+.+--..-.++ +-..|.|..... -|.+ ..... T Consensus 151 a~~~~Pi~~d~~~~~~~-~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~ 229 (499) T protein:vir:80 151 ADCMYPLSNDSENVDEC-LIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVP 229 (499) T ss_pred CCceEEEEecCCCeEEE-EEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCcee Confidence 11 331111222221 1111000000000000000000 001223321100 0000 00111 Q ss_pred e-ehhhh--heec--------CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCC Q lcl|NC_016762. 197 I-HPDRV--FILG--------DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVT 265 (456) Q Consensus 197 I-H~SRl--i~~~--------~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~ 265 (456) + +.+|. +.|. .....|+|++..+.+-+..++.+...++.-+ +..-+.+.+ -.++......+ T Consensus 230 ~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v-------~~~~l~~~~~~ 301 (499) T protein:vir:80 230 LPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEF-KLGKKKVLV-------PSSFVKTAVNL 301 (499) T ss_pred ecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHH-Hhcccceec-------chhhhhccCCC Confidence 1 12332 1221 1234599999999988888888765554322 111111100 01111100000 Q ss_pred HHHHHHHHHHHHHHHhcCCCeEEecC---CCceeEEeccc--CCHHHHHHHHHHHHHhhhcCCeEEeeccCCCccc-chH Q lcl|NC_016762. 266 LDALNERFNEAARQLNRGNDVLLPTQ---GATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERA-SSE 339 (456) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~lid~---~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gln-st~ 339 (456) ..+....+..- .+.+...-.+. +..++.++..+ ......++.++.++...+|+|-. -||...+|.. |++ T Consensus 302 ~g~~~~~~~~~----~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~-~fg~~~~g~~TAte 376 (499) T protein:vir:80 302 DGSTTQYFDST----DEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAG-TFTFDENGLKTATE 376 (499) T ss_pred CCCcccCCCcc----cceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChh-hcCCCcccchhHHH Confidence 00000011000 00000111111 12356555444 44567788888899999999976 4776666653 443 Q ss_pred ---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHh--------cCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 340 ---DQKYHNARCQARRVQELTFEINDLFAHLMRI--------GVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAA 408 (456) Q Consensus 340 ---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s--------~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~ 408 (456) ....-|.++..+|. .++..|++|+..+++. +....+.+++|.|+.--..++++.++... .+ T Consensus 377 i~s~~~~l~~~~~~~~~-~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~-------~~ 448 (499) T protein:vir:80 377 VVSEKSETYQTKNSHSQ-LIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYT-------TA 448 (499) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHH-------HH Confidence 44456667777665 5799999988876542 11233468999999988888877655543 33 Q ss_pred HHcCCcCcCHHHHHHHhcccCCCCCC---CCcccC-CCCCCCCCcC---CCCCCC Q lcl|NC_016762. 409 IGTGEPVFTAEEIREEAGYDPLQGGD---PLPDTE-PEDEDAARTD---PTGEQQ 456 (456) Q Consensus 409 ~~~g~~~i~~~E~R~~~~~~~~~~~~---~~~~~~-~~d~~~~~~d---~~~~~e 456 (456) +.+| +++...++... .+..+.+ ..+... +...+.+.+| -.+++| T Consensus 449 ~~~G--i~S~et~l~~~--~~~~d~ea~~el~~i~~E~~~~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 449 KNQG--MIPLKIALQRA--WNITEAEADEWAEMLAKEKQAEIPNNDMTGIFGEEE 499 (499) T ss_pred HHcC--CCCHHHHHhhc--CCCChHHHHHHHHHHHHHhhcCCCCCCccccCCCCC Confidence 4444 44544433211 0000000 000000 0000111122 233444 No 152 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=98.87 E-value=3.3e-09 Score=67.11 Aligned_cols=395 Identities=10% Similarity=-0.035 Sum_probs=177.6 Q ss_pred CCchh-H-HHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKL-D-LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~-~-~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) |+.+. . +...|-.+...-+....|.. |- . ......--..... ...-.+.+++.||+..+.=++.+. T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~---g~---~-~il~~~~~~~~~~-----~~ki~~n~~~~ivd~~~~~l~g~~ 68 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYE---GD---H-AILQQKQKEQYKP-----DNRLVVNFAKYIVDTFNGYFIGVP 68 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhc---cc---c-ccccccccccCCC-----cceeecchHHHHHHHHhhhhcccC Confidence 77654 1 11112111111112222222 10 1 0111110000000 112357899999999999999999 Q ss_pred CEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcCceeEEEEecc Q lcl|NC_016762. 79 PQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLNGLAKVTPAWA 157 (456) Q Consensus 79 ~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~~l~~i~~~~~ 157 (456) ++++..++.. ...+...+++.++...+.++.+....||.|++++..+ ||+.....+.+ ..+.|+|- T Consensus 69 ~~~~~~~~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p-----~~~~~v~d 135 (429) T protein:vir:98 69 VQTSHENKQV--------SNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTP-----LEAFIVYD 135 (429) T ss_pred ceeecCChHH--------HHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcc-----cceEEEEe Confidence 9986543221 1236666777778888999999999999999888774 44432221110 01111111 Q ss_pred ccCC--h---hhhhccc---cccccCCceeEEEeecccCCccccceeeehhhhheecC--CcCCCcchHHHHHHHHHHHH Q lcl|NC_016762. 158 GCLK--P---KSFDEKP---DSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD--WTGDAIGFLEPAYNSFISLE 227 (456) Q Consensus 158 ~~~~--~---~~~~~Dp---~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~ 227 (456) .... + ..+..++ ..-.|+.+..+++-....++.......=|+--.+.+.. ...+|.|.++.+.+-+.+++ T Consensus 136 d~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d 215 (429) T protein:vir:98 136 DSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFN 215 (429) T ss_pred CCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHH Confidence 1000 0 0000000 11112222222111000000000000112111122221 23568999998888777777 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCC----C--ceeEEecc Q lcl|NC_016762. 228 KVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQG----A--TVTQMVSA 301 (456) Q Consensus 228 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~----d--~~~~~~~~ 301 (456) .+.-..+..+-..+..++.++ + .... ++... .+..+ .+..+..+ . +|-..+.+ T Consensus 216 ~~~s~~~~~~~~~~~p~~~i~--------g----~~~~-~~~~~-------~~~~~-~~~~~~~~~~~~~~~~~l~~~~~ 274 (429) T protein:vir:98 216 KAISEKANDVEYFADAYLKIL--------G----AELD-DETLK-------SLRDT-RIINLKDTDAQQLTVEFLQKPDA 274 (429) T ss_pred HHHHHHHHHHHHhcCceeeee--------c----CCCC-cchhh-------hHhhC-ceeeccCCCCCCcceeEEeecCC Confidence 765554433222222222211 0 0011 11111 11111 22223221 1 33445667 Q ss_pred cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcc-cchHH-H----HHHHHHHHHHHHhhhhHHHHHHHHHHHHhc---C Q lcl|NC_016762. 302 VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGER-ASSED-Q----KYHNARCQARRVQELTFEINDLFAHLMRIG---V 372 (456) Q Consensus 302 ~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gl-nst~D-~----~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~---~ 372 (456) ..++...++.+.+.+...+++|-. +++++ |.+++ + ..-...+..+ +..++..|++++.+++... . T Consensus 275 ~~~~~~~~~~l~~~i~~~s~~p~~-----~~~~~gn~Sg~Al~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~~~~~~~~ 348 (429) T protein:vir:98 275 DATQEHLLDRLENLIFRTAMVANI-----SDESFGTASGIALRYRLQAMDNLAKTK-ERKFMSGMNRRYKLIASYPTSKI 348 (429) T ss_pred HHHHHHHHHHHHHHHHHHhCcccc-----CccccccchHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccCC Confidence 788899999999999999999963 22222 33333 2 2334455544 4568899999988876542 1 Q ss_pred cC-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HH-HcCCcCcCHHHHHHHhccc--CCCCCCCCcccCCCCCCC Q lcl|NC_016762. 373 VP-LKAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AI-GTGEPVFTAEEIREEAGYD--PLQGGDPLPDTEPEDEDA 446 (456) Q Consensus 373 ~~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~-~~g~~~i~~~E~R~~~~~~--~~~~~~~~~~~~~~d~~~ 446 (456) ++ ...++++.|+|-...++++.|++..+.+..... ++ ..+ .+-++++-.+....+ ........ ...++. T Consensus 349 ~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl~g~is~et~~~~l~-~v~d~~~E~~ri~~E~~~~~~~~~~----~~~~~~ 423 (429) T protein:vir:98 349 GPKDWIGIKYKFTRNLPANLLEESQIAGNLAGIVSEETQVGVLS-IVENPQKEIERKNSDKSTLISRQAG----GLNGQN 423 (429) T ss_pred CccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCC-CCCCHHHHHHHHHHHHHHHHHHHHh----hhcCCC Confidence 22 224689999999999999999987776533221 11 111 011232221111100 00000000 000000 Q ss_pred CCcCCC Q lcl|NC_016762. 447 ARTDPT 452 (456) Q Consensus 447 ~~~d~~ 452 (456) .+.|.. T Consensus 424 ~~~~~~ 429 (429) T protein:vir:98 424 TTTILE 429 (429) T ss_pred CCCCCC Confidence 000000 No 153 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.86 E-value=1.3e-08 Score=63.84 Aligned_cols=413 Identities=10% Similarity=-0.013 Sum_probs=176.4 Q ss_pred CCchhHHHHh--------HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLDLAVN--------HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~~~~~--------~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ae 72 (456) +..+-++... |..+.........|...--.+-....+.+............ --...+++.||+..+. T Consensus 29 ~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~-----ki~~n~~k~Ivd~~~~ 103 (483) T protein:vir:12 29 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDD-----RMITNFHANLVDQKVS 103 (483) T ss_pred cCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccccccccccccccc-----ccccchHHHHHHHHhh Confidence 2222212111 11111111111111111000000111111111000000000 0136889999999999 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcCceeE Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLNGLAK 151 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~~l~~ 151 (456) =++-+.+++...++. . .+.|+..++. ++...+.++.+....+|.+++++..+ ||+..-..+.+ .. T Consensus 104 ~l~G~p~~~~~~d~~-~-------~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~~p-----~~ 169 (483) T protein:vir:12 104 YIVGKPIAFKHTDDE-V-------VKRIDEVLGN-RFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPA-----EQ 169 (483) T ss_pred hhcccCceeccCChH-H-------HHHHHHHHhc-cHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEEcc-----cc Confidence 999999888543321 1 1234444433 67778888888888999998888774 33321111110 01 Q ss_pred EEEeccccCC----h--hhhhc-cccccccCCce---eEEEeeccc---CCccccceeeehh----hhheecC--CcCCC Q lcl|NC_016762. 152 VTPAWAGCLK----P--KSFDE-KPDSETYGQPT---MWEYTEASQ---AGRPGLVRDIHPD----RVFILGD--WTGDA 212 (456) Q Consensus 152 i~~~~~~~~~----~--~~~~~-Dp~s~~yg~P~---~y~i~~~~~---~g~~~~~~~IH~S----Rli~~~~--~~~~G 212 (456) +.|+|..... . ..|.. +-..-.++.|. +|.+..... .........+|.. -.+.+.. .+.+| T Consensus 170 ~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g 249 (483) T protein:vir:12 170 GIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLE 249 (483) T ss_pred eEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCC Confidence 1222211000 0 00000 10111122221 222111000 0000001112211 1111222 23468 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCC Q lcl|NC_016762. 213 IGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQG 292 (456) Q Consensus 213 ~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~ 292 (456) .|.++.+.+-+.+++.+.-..+..+--.+...+.++ |...+...+ ....+ +..+.+.++++ T Consensus 250 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~--------------g~~~~~~~~-~~~~~----~~~~~~~~~~~ 310 (483) T protein:vir:12 250 ISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLT--------------NYDDQELPE-FKRLL----RYYGAIKVSDN 310 (483) T ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--------------cCCcccchh-HHHhh----hhccccccCCC Confidence 899988777776777654444432221111112111 111111111 11111 12223334555 Q ss_pred Cc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_016762. 293 AT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFA 365 (456) Q Consensus 293 d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~ 365 (456) .+ |-..+.+.+++...++...+.+...+++|-.-+ +.. +| |.++. ++ .-...+ ..++..++..|++++. T Consensus 311 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~-~~-n~Sg~Al~~~~~~l~~k~-~~~~~~f~~~l~~~~~ 386 (483) T protein:vir:12 311 GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSS-DKF-GS-APSGVALEFLYTNLNLKA-DKLARKAKVAIQELLW 386 (483) T ss_pred CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCc-ccc-cc-CcHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 44 445677788999999999999999999996422 221 12 33333 22 223333 4555678999999998 Q ss_pred HHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHH-cCCcCcCHH-HHHHHhccc--CCCCCCCCcc Q lcl|NC_016762. 366 HLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIG-TGEPVFTAE-EIREEAGYD--PLQGGDPLPD 438 (456) Q Consensus 366 ~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~-~g~~~i~~~-E~R~~~~~~--~~~~~~~~~~ 438 (456) +++..... ....++++.|+|-...+.++.|++..+.+-+.. +++. .+ .+-+++ |+....... .........+ T Consensus 387 li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~GiiS~et~~~~~~-~v~d~~~E~~ri~~E~~~~~~~~~~~~~ 465 (483) T protein:vir:12 387 FVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHP-FVEDLQAELERIEQEQMEYNKQLPNLDD 465 (483) T ss_pred HHHHHhcCCCccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCC-CCCCHHHHHHHHHHHHHHHHhhcccccc Confidence 87654322 234589999999999999999998877654332 1221 11 122232 332211100 0000000111 Q ss_pred cCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 439 TEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 439 ~~~~d~~~~~~d~~~~~e 456 (456) ....+...++..+.+++| T Consensus 466 ~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 466 GGADGAQQQERSNNKESE 483 (483) T ss_pred cccCCcccCCCCCcccCC Confidence 111111122222333444 No 154 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.84 E-value=2.4e-08 Score=62.36 Aligned_cols=399 Identities=9% Similarity=-0.029 Sum_probs=179.8 Q ss_pred CCchh-HHHHh-HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKL-DLAVN-HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~-~~~~~-~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) |+.+. .-.++ |..+...-+....|.+.--.+-.... ...+.+. .. -...+++.||+..+.=++-+. T Consensus 17 ~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~---~~~~~~~-------~k--i~~n~~~~ivd~~~~~l~g~~ 84 (452) T protein:vir:36 17 ITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPA---KDSWKPD-------NR--LAVNFTKYIVDTFTGYFNGIP 84 (452) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCcc---ccccCcc-------ce--eecchHHHHHHHHhhhhcccC Confidence 54321 11111 22211111122222221111100000 0011110 11 235789999999999999999 Q ss_pred CEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---ccc------C-CcC Q lcl|NC_016762. 79 PQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PAR------G-KLN 147 (456) Q Consensus 79 ~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl~------~-~~~ 147 (456) +++...++.. .+.|.+.++.-++...+.++.+....+|.+++++..+ +|+..-. |.. . ..+ T Consensus 85 ~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~ 156 (452) T protein:vir:36 85 VKKSHSDKEI--------LTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMVYDDTVKQ 156 (452) T ss_pred ceeecCChhH--------HHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCC Confidence 9986543221 1346777777788899999999999999999888774 3433211 211 0 001 Q ss_pred ceeEEEEeccccCChhhhhccccccccCCce-eEEEeecccCCccccceeeehhhhheecCC--cCCCcchHHHHHHHHH Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEKPDSETYGQPT-MWEYTEASQAGRPGLVRDIHPDRVFILGDW--TGDAIGFLEPAYNSFI 224 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~-~y~i~~~~~~g~~~~~~~IH~SRli~~~~~--~~~G~S~le~~~~~l~ 224 (456) .+.....+|...- ....-..+.|. .|++.... ++.......-|.--.+.+..+ ...|.|.++.+.+-+. T Consensus 157 ~~~~~i~~~~~~~-------~~~~~~vyt~~~i~~~~~~~-~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liD 228 (452) T protein:vir:36 157 EPLFAVRYGVDED-------KKLQGEVYTLLETIKISGEN-DEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVN 228 (452) T ss_pred ceEEEEEEEEecC-------ceEEEEEEecCeEEEEEEcC-CceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHH Confidence 1112222221100 00000111221 12221100 000000111233222222222 3458888988887777 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCC-----Ccee--E Q lcl|NC_016762. 225 SLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQG-----ATVT--Q 297 (456) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~-----d~~~--~ 297 (456) +++.+.-..+..+-..+...+.+. +. ... .+.... +..+ .+..+..+ .+++ . T Consensus 229 a~d~~~s~~~~~~~~~~~p~~~~~-----g~-------~~~-~~~~~~-------~~~~-~~~~~~~~~~~~~~~~~~l~ 287 (452) T protein:vir:36 229 AFNKAISEKANDVDYFSDQYLTFL-----GA-------AVE-EEDLKN-------IRSN-RVINYYADGEGKNVDVKFLE 287 (452) T ss_pred HHHHHHHHHHHHHHHhcCceeEee-----cC-------CcC-chhhhh-------hhhc-ceEEecCCCCccCCcceeEe Confidence 777766555544433222222221 00 000 011111 1111 22222221 1334 3 Q ss_pred EecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-H----HHHHHHHHHHHHhhhhHHHHHHHHHHHHhc- Q lcl|NC_016762. 298 MVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-Q----KYHNARCQARRVQELTFEINDLFAHLMRIG- 371 (456) Q Consensus 298 ~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~----~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~- 371 (456) .+.+.+++...++...+.|...+++|-. -++.. | |++++ + ..-...+..+| ..++..|++++.+++... T Consensus 288 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~-~~~~~-g--n~Sg~Al~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~~~ 362 (452) T protein:vir:36 288 KPDSDSQTENLLDRLTKLIFQTTMVANI-SDESF-G--SSSGVSLAYKLQAMSNLALSFQ-RKFQSSLNSRYKLFCELST 362 (452) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCcccc-Ccccc-c--CCcHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHh Confidence 4566788999999999999999999963 22222 3 23333 2 23344455444 467899999988776432 Q ss_pred -Cc-C-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHH-HcCCcCc-CH-HHHHHHhcccCCCCCCCCcccCCCC Q lcl|NC_016762. 372 -VV-P-LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAI-GTGEPVF-TA-EEIREEAGYDPLQGGDPLPDTEPED 443 (456) Q Consensus 372 -~~-~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~-~~g~~~i-~~-~E~R~~~~~~~~~~~~~~~~~~~~d 443 (456) .+ . ...+++|.|+|-...++++.|++..+.+.+.. +++ ..+ -+ ++ .|+........ .......+.+..+ T Consensus 363 ~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~g~iS~et~~~~~~--~~~d~~~E~~ri~~E~~-~~~~~~~~~~~~~ 439 (452) T protein:vir:36 363 NVSNKDSWKDIEYTFTRNEPKDIKEQAETANILMGITSQETALSVIS--VIPDVQAEMEKIKKEEA-STAIFDKDKQPSE 439 (452) T ss_pred ccCCccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCC--CCCCHHHHHHHHHHHHH-HHHHHHhhccCCC Confidence 22 2 23478999999999999999998877654322 111 111 22 22 23322111000 0000000000011 Q ss_pred CCCCCcCCCCCCC Q lcl|NC_016762. 444 EDAARTDPTGEQQ 456 (456) Q Consensus 444 ~~~~~~d~~~~~e 456 (456) .........-++| T Consensus 440 ~~~~~~~~~~~~e 452 (452) T protein:vir:36 440 KGTDTVVSETNEE 452 (452) T ss_pred CcccccCccccCC Confidence 1111111111111 No 155 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.83 E-value=3.3e-08 Score=61.57 Aligned_cols=369 Identities=10% Similarity=-0.026 Sum_probs=174.4 Q ss_pred CCchhHH--HHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHh-cCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDL--AVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYR-RGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~--~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~-~~~l~r~iVd~~aed~tR~ 77 (456) |..++=. ..-..-....-+.++.|.. |..+.+ ..+.. -..++...|+ .-..+++|||..++=+.=+ T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~-----g~~~~~-----~~~~~-~p~~~~~~~~~v~nw~~~iVds~a~rl~~~ 69 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYA-----MKHVDR-----FKGIT-IPQALSQQYRSILGWCAKGVDSLADRLVFR 69 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHh-----ccCchh-----hcchh-hhHHHHHHHhhhcChhHHHHHHhHhhcccc Confidence 6655321 1111111111222333322 111110 01111 1234433343 2367899999999977777 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---ccc------CCcC Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PAR------GKLN 147 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl~------~~~~ 147 (456) ||+. +| . .+.+.+.+-++-....++.+-+..||.|++++.-+ ||++.-. |.. ...+ T Consensus 70 Gf~~-----~d-~--------~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D~~~~ 135 (409) T protein:vir:16 70 EFEN-----DD-F--------TVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIIDPITG 135 (409) T ss_pred cccC-----cc-h--------HHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEeecccc Confidence 8752 11 1 24455666677777888888899999998887653 3332111 111 0000 Q ss_pred ceeEEEEeccccCChhhhhccccccccCCce--eEEEeecccCCccccceeeehhhh---heecC----CcCCCcchH-H Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEKPDSETYGQPT--MWEYTEASQAGRPGLVRDIHPDRV---FILGD----WTGDAIGFL-E 217 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~--~y~i~~~~~~g~~~~~~~IH~SRl---i~~~~----~~~~G~S~l-e 217 (456) .+.....+|.... ...+..-.++.|. +|.+. .++. ....-|+--. ++|.. ...+|.|.+ + T Consensus 136 ~~~~a~~~~~~d~-----~~~~~~~~~~~~~~~~~~~~---~~~~--~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~ 205 (409) T protein:vir:16 136 LLTEGYAVLERDE-----NNNVVLEAHFLPDRTDYYYR---DSRN--NISIANPTGNPLLVPIIHRPDAVRPFGRSRITR 205 (409) T ss_pred cceeeeEEEEecC-----CCceEEEEEEecCcEEEEEe---cCcc--ccceecCCCCcceEEecccccccccCCccccch Confidence 1111111111000 0001111122221 11111 0110 0112244332 33432 134688866 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecC---CC- Q lcl|NC_016762. 218 PAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQ---GA- 293 (456) Q Consensus 218 ~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~---~d- 293 (456) ++..-..++.++.-...-...-.+..+..+. . + +..+.+. +.+...+.+ +..+.+ ++ T Consensus 206 ~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-----G---~-d~d~~~~----~~~~~~~~~------i~~~~~d~~g~~ 266 (409) T protein:vir:16 206 SGMYWQSNAKRTLERADVTAEFYSFPQKYVT-----G---L-SDDAEPM----ETWKATVSS------MLQFTKDEDGDK 266 (409) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcChhheeE-----e---c-CCCCCcc----chhhhhhhH------hhccCCCCCCCC Confidence 6666555555553321111101122222221 1 1 0111111 111111111 111211 11 Q ss_pred -ceeEE-ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCccc-chHH-----HHHHHHHHHHHHHhhhhHHHHHHHH Q lcl|NC_016762. 294 -TVTQM-VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERA-SSED-----QKYHNARCQARRVQELTFEINDLFA 365 (456) Q Consensus 294 -~~~~~-~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gln-st~D-----~~nyyd~I~~~Qe~~lrp~L~~l~~ 365 (456) ++.++ ++++.++-+.+.....++|+.++||..-|-|.+- | ++++ ....-..++.+|+ .+.+.+++++. T Consensus 267 ~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~---NpsSa~Ai~a~~~~L~~ka~~k~~-~fg~~l~~~~r 342 (409) T protein:vir:16 267 PTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSD---NPSSVEAIKASHENLRLAGRKAQR-SLGAGLLNVAY 342 (409) T ss_pred ceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccC---chhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 24333 3456677788888999999999999886655542 3 3332 3345556666665 57999999988 Q ss_pred HHHHhcC--cCCCC---ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCC Q lcl|NC_016762. 366 HLMRIGV--VPLKA---EFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQ 431 (456) Q Consensus 366 ~l~~s~~--~~~~~---d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~ 431 (456) +.+.... +..++ .+.+.|.|+..++....| ..|++..+++++|.+....+-+++.++++..+ T Consensus 343 la~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~a----~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 343 LAACLRDDVPYLREQFSKTKPKWEPLFEADASMLS----LIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred HHHHHhcCCCccchhhccceEEecCCCCcchhhHH----HHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 7655433 22232 468899998877744443 36788888888886555556678877765533 No 156 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.82 E-value=2.1e-08 Score=62.73 Aligned_cols=417 Identities=10% Similarity=0.003 Sum_probs=184.6 Q ss_pred CCchhHHHHh----HHHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHH Q lcl|NC_016762. 1 MTDKLDLAVN----HAMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTC 74 (456) Q Consensus 1 ~~~~~~~~~~----~a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~ 74 (456) +....+++.+ |..+. .+.+..+-|.+----+ .........+.+ .--..+.+++.||+..+.=+ T Consensus 38 ~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i--~~~~~~~~~~~~---------~~ki~~n~~k~Ivd~~~~yl 106 (502) T protein:vir:48 38 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDV--LKSGRRKDNEMA---------DKRAVHNYGRMISKFKTGYL 106 (502) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--cccccccccccc---------cceeecchHHHHHHHHhhhh Confidence 2222222211 22211 1222222221100000 000000000111 01135788999999999999 Q ss_pred hhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---ccc------C Q lcl|NC_016762. 75 WKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PAR------G 144 (456) Q Consensus 75 tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl~------~ 144 (456) +-+.++++..++.+... +...|.+.+++-++...+.++.+....||.|++++..+ ||+.... |.. . T Consensus 107 ~g~p~~~~~~d~~~~~~----~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~vydd 182 (502) T protein:vir:48 107 AGNPIRVEYDDNEDNSQ----NDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDN 182 (502) T ss_pred cccCeeEecCCccchhH----HHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceEEEEcC Confidence 99999997665443322 22346667777788899999999999999999888774 3332111 111 0 Q ss_pred C-cCceeEEEEeccccCChhhhhccc-cccccCCc-eeEEEeecccCCccccceeeehhhhheecCC--cCCCcchHHHH Q lcl|NC_016762. 145 K-LNGLAKVTPAWAGCLKPKSFDEKP-DSETYGQP-TMWEYTEASQAGRPGLVRDIHPDRVFILGDW--TGDAIGFLEPA 219 (456) Q Consensus 145 ~-~~~l~~i~~~~~~~~~~~~~~~Dp-~s~~yg~P-~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~--~~~G~S~le~~ 219 (456) . .+.+....-+|.... ..+. .--.++.+ ..|++.. .++.......-|.--.+.+..+ +..|.|.++.+ T Consensus 183 ~~~~~~~~~ir~~~~~~-----~~~~~~~~~iyt~~~i~~~~~--~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v 255 (502) T protein:vir:48 183 SLEDNSIAAVRYYNRGT-----LQNAKDVVEIYTNQHIYTLDA--SDSFNEISVTPHAFGTVPITEFLNNADGIGDYETE 255 (502) T ss_pred CCCCceEEEEEEEEEee-----cCCcEEEEEEEeCCeEEEEEe--CCceeeccceecCCCccceEEecCCCCCCCchhhh Confidence 0 011111111111000 0000 00001111 1111110 0000000111232222222221 34688999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHH--hcCCCeEEecCCCc--e Q lcl|NC_016762. 220 YNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQL--NRGNDVLLPTQGAT--V 295 (456) Q Consensus 220 ~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~lid~~d~--~ 295 (456) ..-+.+++.+.-..+..+-..+...+.+..... ....+....+. ....+ ...........+-+ | T Consensus 256 ~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~-----------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~ 323 (502) T protein:vir:48 256 LYLIDLYDSAESDTANHMSDMADAILAIYGDLA-----------LPQGMQASDMK-RTRLMQLKPPKSADGKEGTVKAEY 323 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc-----------cccccchhhhh-hcceeeccccccccccccCcceeE Confidence 877777777765555443322222222211100 00001111110 00000 00000000111223 4 Q ss_pred eEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHHHHHHh Q lcl|NC_016762. 296 TQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFAHLMRI 370 (456) Q Consensus 296 ~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~~l~~s 370 (456) -..+.+..+....++...+.|...+++|-.- ++...|. .++. ++ .-...+. .++..++..|++++.+++.. T Consensus 324 l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~-~~~~~~n--~Sg~Alk~~~~~l~~k~~-~~~~~~~~~l~~~~~li~~~ 399 (502) T protein:vir:48 324 LTKSYDVSGAEAYKTRLNKDIHVFTNTPDMS-DNHFSGN--ASGEALKYKLFGLDQDRV-DTQSQFTQGLKRRYRLAARI 399 (502) T ss_pred eeecCCHHHHHHHHHHHHHHHHHHhCCCCcC-ccccccC--chHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 4456677899999999999999999999643 3322222 2332 22 2233444 44467899999999887654 Q ss_pred cC--c---C-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCc-CH-HHHHHHhcc-cCC--CCCCCC- Q lcl|NC_016762. 371 GV--V---P-LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVF-TA-EEIREEAGY-DPL--QGGDPL- 436 (456) Q Consensus 371 ~~--~---~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i-~~-~E~R~~~~~-~~~--~~~~~~- 436 (456) .. + + ...++++.|+|-...+.+|.|++..+.+-... +++.. .+-+ ++ .|+...... +.. ...... T Consensus 400 ~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~l~~-l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~ 478 (502) T protein:vir:48 400 GSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLGGQVSQETALSL-SGLVENPTEELDKINEESSKIDFKGYPSYF 478 (502) T ss_pred HhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHh-CCCCCCHHHHHHHHHHHHHhhhhhcccccc Confidence 21 1 1 12368999999999999999999877654322 22221 1122 33 344322111 110 000000 Q ss_pred cccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 437 PDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 437 ~~~~~~d~~~~~~d~~~~~e 456 (456) .+......+..+.+++.+.| T Consensus 479 ~~~~~~~~d~~~e~~~~~~~ 498 (502) T protein:vir:48 479 YDNVGKYTDEVKETHTDDFE 498 (502) T ss_pred cccccccCCCccCCCCcCcC Confidence 01111111222223333333 No 157 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.82 E-value=3.4e-08 Score=61.52 Aligned_cols=412 Identities=9% Similarity=-0.008 Sum_probs=172.9 Q ss_pred CCchhHH-----------------------HHhHHHHHHHHHH--HHHHhhhhhccCcccchhhhhccCcc-cCCHHHHH Q lcl|NC_016762. 1 MTDKLDL-----------------------AVNHAMSSAIARA--RMSLLNQGIGHDAKRPQAWCEYGFPQ-EITFNDLY 54 (456) Q Consensus 1 ~~~~~~~-----------------------~~~~a~~~~~~~~--~d~~~n~~~~~gt~~~~~~~~~~~~~-~~~~~~l~ 54 (456) |++|.+- .++...+.-..+. .+-+.... .|. .+-..... ++. ......-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy--~g~-~~i~~~~~-~~~~~~~~~~~~ 76 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYY--DKD-NDINYQAY-KQDLHGNIDYTK 76 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHh--ccc-Cccccccc-hhhhcccccccc Confidence 3333221 1111111111000 00000000 000 00000000 000 00000000 Q ss_pred HHH-hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec Q lcl|NC_016762. 55 TMY-RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR 133 (456) Q Consensus 55 ~~Y-~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~ 133 (456) .-. -.+.+++.||+..+.=++-+.+++...++.. . +.+...++ -++...+.++.+....||.|++++.++ T Consensus 77 ~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~-~-------~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d 147 (474) T protein:vir:95 77 PDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKV-L-------DVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYIN 147 (474) T ss_pred cccccccchHHHHHHhhhhhhcccCceeccCChHH-H-------HHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeC Confidence 000 0368899999999999999999986543221 1 22444433 367888889988899999999988874 Q ss_pred -CCCCccccccCCcCceeEEEEeccccCC----hh--hhhcc-ccccccCCce-eEEEeecccCC----------ccccc Q lcl|NC_016762. 134 -DSQPWDRPARGKLNGLAKVTPAWAGCLK----PK--SFDEK-PDSETYGQPT-MWEYTEASQAG----------RPGLV 194 (456) Q Consensus 134 -D~~~~~~Pl~~~~~~l~~i~~~~~~~~~----~~--~~~~D-p~s~~yg~P~-~y~i~~~~~~g----------~~~~~ 194 (456) +|+.-...+.. ..+.|+|..... .. .|..+ -.--.++.|. .++... ..++ ..... T Consensus 148 ~~~~~~i~~~~p-----~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~ 221 (474) T protein:vir:95 148 EDGELKLFRVPA-----EQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVY-ENGGLIPDFYYGDEHIQTH 221 (474) T ss_pred CCCceEEEEEcc-----cceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEE-cCCceeeccccccccccCc Confidence 33321111110 011122211000 00 00000 0000111111 111110 0000 00001 Q ss_pred eeeehhhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHH Q lcl|NC_016762. 195 RDIHPDRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNER 272 (456) Q Consensus 195 ~~IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~ 272 (456) ..-|.--.+.+.. ....|.|.++.+.+-+.+++.+.-..+..+-..+...+.+. ++ .+....+.... T Consensus 222 ~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--------g~---~~~~~~~~~~~ 290 (474) T protein:vir:95 222 FSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR--------GY---EGEDLSEFMEG 290 (474) T ss_pred ccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--------CC---Ccccccchhhh Confidence 1123222222222 22358888888877777777765555544332222222211 10 00111111111 Q ss_pred HHHHHHHHhcCCCeEEecCCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-HH--H Q lcl|NC_016762. 273 FNEAARQLNRGNDVLLPTQGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-HN--A 346 (456) Q Consensus 273 ~~~~~~~~~~~~~~~lid~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-yy--d 346 (456) +. ...++.++.+.+ |-+.+.+.+++...++...+.|...+++|-.-.-+. || |.++. ++. |. + T Consensus 291 -------~~-~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~-n~Sg~Alk~~~~~l~ 359 (474) T protein:vir:95 291 -------LK-YYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF--GS-ATSGIALKFLYTNLN 359 (474) T ss_pred -------hh-ccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc--cc-ccHHHHHHHHHHHHH Confidence 11 122333455544 445577778999999999999999999996533221 22 33333 332 22 2 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHH--HHHHH-cCCcCcCHH-HH Q lcl|NC_016762. 347 RCQARRVQELTFEINDLFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEIN--SAAIG-TGEPVFTAE-EI 421 (456) Q Consensus 347 ~I~~~Qe~~lrp~L~~l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~--~~~~~-~g~~~i~~~-E~ 421 (456) .-...++..++..|++++.++...... ....++++.|+|-...+++|.|++.++ +-.. .+++. .+ .+-+++ |+ T Consensus 360 ~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp-~v~D~~~E~ 437 (474) T protein:vir:95 360 LKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHP-WVDDPKAEL 437 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCC-CCCCHHHHH Confidence 222455567899999999988765333 334678999999999999999987543 2111 11111 11 022332 33 Q ss_pred HHHhcc-cC-CCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 422 REEAGY-DP-LQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 422 R~~~~~-~~-~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ...... .. ........+. .++...++..+.+++. T Consensus 438 eri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~e~ 473 (474) T protein:vir:95 438 ERLDEEQLELNKQLPNLDDG-GADGAQQQQQSENNQS 473 (474) T ss_pred HHHHHHHHHHHhhccccccc-cCCCCCCcCCCCcccc Confidence 211000 00 0000000000 0111111111111111 No 158 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.82 E-value=3.4e-08 Score=61.52 Aligned_cols=412 Identities=9% Similarity=-0.008 Sum_probs=172.9 Q ss_pred CCchhHH-----------------------HHhHHHHHHHHHH--HHHHhhhhhccCcccchhhhhccCcc-cCCHHHHH Q lcl|NC_016762. 1 MTDKLDL-----------------------AVNHAMSSAIARA--RMSLLNQGIGHDAKRPQAWCEYGFPQ-EITFNDLY 54 (456) Q Consensus 1 ~~~~~~~-----------------------~~~~a~~~~~~~~--~d~~~n~~~~~gt~~~~~~~~~~~~~-~~~~~~l~ 54 (456) |++|.+- .++...+.-..+. .+-+.... .|. .+-..... ++. ......-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy--~g~-~~i~~~~~-~~~~~~~~~~~~ 76 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYY--DKD-NDINYQAY-KQDLHGNIDYTK 76 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHh--ccc-Cccccccc-hhhhcccccccc Confidence 3333221 1111111111000 00000000 000 00000000 000 00000000 Q ss_pred HHH-hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec Q lcl|NC_016762. 55 TMY-RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR 133 (456) Q Consensus 55 ~~Y-~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~ 133 (456) .-. -.+.+++.||+..+.=++-+.+++...++.. . +.+...++ -++...+.++.+....||.|++++.++ T Consensus 77 ~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~-~-------~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d 147 (474) T protein:vir:96 77 PDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDDKV-L-------DVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYIN 147 (474) T ss_pred cccccccchHHHHHHhhhhhhcccCceeccCChHH-H-------HHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeC Confidence 000 0368899999999999999999986543221 1 22444433 367888889988899999999988874 Q ss_pred -CCCCccccccCCcCceeEEEEeccccCC----hh--hhhcc-ccccccCCce-eEEEeecccCC----------ccccc Q lcl|NC_016762. 134 -DSQPWDRPARGKLNGLAKVTPAWAGCLK----PK--SFDEK-PDSETYGQPT-MWEYTEASQAG----------RPGLV 194 (456) Q Consensus 134 -D~~~~~~Pl~~~~~~l~~i~~~~~~~~~----~~--~~~~D-p~s~~yg~P~-~y~i~~~~~~g----------~~~~~ 194 (456) +|+.-...+.. ..+.|+|..... .. .|..+ -.--.++.|. .++... ..++ ..... T Consensus 148 ~~~~~~i~~~~p-----~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~-~~~~~~~~~~~~~~~~~~~ 221 (474) T protein:vir:96 148 EDGELKLFRVPA-----EQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVY-ENGGLIPDFYYGDEHIQTH 221 (474) T ss_pred CCCceEEEEEcc-----cceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEE-cCCceeeccccccccccCc Confidence 33321111110 011122211000 00 00000 0000111111 111110 0000 00001 Q ss_pred eeeehhhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHH Q lcl|NC_016762. 195 RDIHPDRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNER 272 (456) Q Consensus 195 ~~IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~ 272 (456) ..-|.--.+.+.. ....|.|.++.+.+-+.+++.+.-..+..+-..+...+.+. ++ .+....+.... T Consensus 222 ~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--------g~---~~~~~~~~~~~ 290 (474) T protein:vir:96 222 FSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILR--------GY---EGEDLSEFMEG 290 (474) T ss_pred ccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhc--------CC---Ccccccchhhh Confidence 1123222222222 22358888888877777777765555544332222222211 10 00111111111 Q ss_pred HHHHHHHHhcCCCeEEecCCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-HH--H Q lcl|NC_016762. 273 FNEAARQLNRGNDVLLPTQGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-HN--A 346 (456) Q Consensus 273 ~~~~~~~~~~~~~~~lid~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-yy--d 346 (456) +. ...++.++.+.+ |-+.+.+.+++...++...+.|...+++|-.-.-+. || |.++. ++. |. + T Consensus 291 -------~~-~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~-n~Sg~Alk~~~~~l~ 359 (474) T protein:vir:96 291 -------LK-YYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKF--GS-ATSGIALKFLYTNLN 359 (474) T ss_pred -------hh-ccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccccc--cc-ccHHHHHHHHHHHHH Confidence 11 122333455544 445577778999999999999999999996533221 22 33333 332 22 2 Q ss_pred HHHHHHHhhhhHHHHHHHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHH--HHHHH-cCCcCcCHH-HH Q lcl|NC_016762. 347 RCQARRVQELTFEINDLFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEIN--SAAIG-TGEPVFTAE-EI 421 (456) Q Consensus 347 ~I~~~Qe~~lrp~L~~l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~--~~~~~-~g~~~i~~~-E~ 421 (456) .-...++..++..|++++.++...... ....++++.|+|-...+++|.|++.++ +-.. .+++. .+ .+-+++ |+ T Consensus 360 ~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp-~v~D~~~E~ 437 (474) T protein:vir:96 360 LKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHP-WVDDPKAEL 437 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHHHH-cCCCChHHHHHhCC-CCCCHHHHH Confidence 222455567899999999988765333 334678999999999999999987543 2111 11111 11 022332 33 Q ss_pred HHHhcc-cC-CCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 422 REEAGY-DP-LQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 422 R~~~~~-~~-~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ...... .. ........+. .++...++..+.+++. T Consensus 438 eri~~E~~~~~~~~~~~~~~-~~~~~~~~~~~~~~e~ 473 (474) T protein:vir:96 438 ERLDEEQLELNKQLPNLDDG-GADGAQQQQQSENNQS 473 (474) T ss_pred HHHHHHHHHHHhhccccccc-cCCCCCCcCCCCcccc Confidence 211000 00 0000000000 0111111111111111 No 159 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.81 E-value=3.8e-08 Score=61.28 Aligned_cols=390 Identities=11% Similarity=-0.037 Sum_probs=178.6 Q ss_pred HHhHHHHHHHH---HHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecC Q lcl|NC_016762. 8 AVNHAMSSAIA---RARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEG 84 (456) Q Consensus 8 ~~~~a~~~~~~---~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~ 84 (456) +.+.++...++ ..+|-+.+...++=..++.....-+.. +...++.|.+ +.-+..++++.-.-.+..-|+|..+ T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~g---~~~~y~~l~~-D~~i~s~l~~rk~av~~~~w~i~p~ 76 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQVPNDSILQRRGGN---DLRVYEEILS-DAQVKTVWGQRQLAVVSREWKVEAG 76 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCCCChHHHHhhccC---CHHHHHHHhh-ChHHHHHHHHHHHHHhcCCceEEcC Confidence 22233343333 335555555444422233222222111 1233444544 6777788888887777777788654 Q ss_pred CCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcCceeEEEEeccccCChh Q lcl|NC_016762. 85 DDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLNGLAKVTPAWAGCLKPK 163 (456) Q Consensus 85 ~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~ 163 (456) +++...+. ....+++.++++.+...+.+.+ .+.+||.|++=+.-. ++..+ .+..|.++.+..+. T Consensus 77 ~~~~~~~~---~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~---------~~~~l~~r~~~~f~-- 141 (488) T protein:vir:99 77 GDRPIDQA---AAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYI---------TLEAIKVRNRRRFR-- 141 (488) T ss_pred CCChHHHH---HHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCee---------eEeeeeeeccccee-- Confidence 43322222 2234777788887766666655 588999998865432 22111 12222222211111 Q ss_pred hhhccccccccCCceeEEEeecccCCccc--c-ceee--ehhhhheecCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 164 SFDEKPDSETYGQPTMWEYTEASQAGRPG--L-VRDI--HPDRVFILGDWTGDAIGFLEPAYNSFISLEKVEGGSGESFL 238 (456) Q Consensus 164 ~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~--~-~~~I--H~SRli~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~ 238 (456) -|+.. ... +........|.+. . .+.+ |.+ ...+.+|.|++..||...+--.....-++.-+- T Consensus 142 ---~d~~~----~l~-~~~~~~~~~g~~lp~~~~~i~~~~~~-----~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E 208 (488) T protein:vir:99 142 ---YDQDG----GLR-LLTPNNMFEGEPCPAPYFWHFSTGAD-----NDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLD 208 (488) T ss_pred ---ecCCC----ceE-EeccCCCCCccccccCceEEEEeecC-----CCCCcccchHHHHHHHHHHHHHhhHHHHHHHHH Confidence 11111 110 0000000011100 0 1112 222 235678999999998754322222222221111 Q ss_pred HHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccCCH---HHHHHHHHHH Q lcl|NC_016762. 239 KNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVSDP---GPTYNVNLQT 315 (456) Q Consensus 239 ~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~sgl---~~~~~~~~~~ 315 (456) +........++ + . ....++-.+.+.+.+..+.+ ....+|..+.+++-++.+=++. ..+++..-.+ T Consensus 209 ~yG~P~~igky----~------~-~~a~~~ek~~l~~av~~~~~-~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~ 276 (488) T protein:vir:99 209 KFGMPTAVGRY----D------D-KTATPEDKAKLLAALHAIQT-DSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDAT 276 (488) T ss_pred HcCCceeeeec----C------C-CCCCHHHHHHHHHHHHHHhc-CcEEEecCCceeEEeecCCCChHHHHHHHHHHHHH Confidence 11111000000 0 0 01123344555556665544 3567778888888887654443 3355555556 Q ss_pred HHhhh-cCCeEEeeccCCCcccchHH--HHHHHHHHHHHHHhhhhHHHH-HHHHHHHHhcCcCC-CCceEEEeCCCCCCC Q lcl|NC_016762. 316 AAAGV-DIPTKILVGMQTGERASSED--QKYHNARCQARRVQELTFEIN-DLFAHLMRIGVVPL-KAEFTAIWDDLTVPT 390 (456) Q Consensus 316 ~aaas-~IP~t~L~G~sp~Glnst~D--~~nyyd~I~~~Qe~~lrp~L~-~l~~~l~~s~~~~~-~~d~~~~f~pL~~~s 390 (456) ||-+. |= | |.++.-+|-.|.++ .....+.+.+... .+...|. .|+..|+...++.. ++-+. |.-.-..+ T Consensus 277 Isk~iLGq--t-lts~~~~Gs~a~~~vh~~v~~d~~~aDa~-~i~~tln~~li~~l~~~N~~~~~~p~~~--~~~~e~ed 350 (488) T protein:vir:99 277 IAKVGLGQ--V-ASTQGTPGRLGNDDLQADVRLDLVKADAD-LICESFNLGPARWLTEWNFPGAQPPRVY--RVIEEPED 350 (488) T ss_pred HHHHHhhh--h-hcccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhCcCCcCCceeE--ecCCCccc Confidence 66542 21 1 22332233223333 3456666766665 4566675 48887877766532 22233 32222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHc-CCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 391 KAERLANSKTMSEINSAAIGT-GEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 391 eke~Aei~~~~A~a~~~~~~~-g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) - ++.|+++..+++. |. .++.+++|+..+.+.....++..... ......+++...+.. T Consensus 351 l-------~~~a~~~~~l~~~~G~-~i~~~~i~e~~Gip~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 408 (488) T protein:vir:99 351 I-------TAKAERDEKVFRMSGF-RPTRGYVQETYGVEVESTQAEATAPT-PSTEFAEGDQPSDPA 408 (488) T ss_pred H-------HHHHHHHHHHHhhcCC-CCCHHHHHHHcCCCCcccccccccCC-CcccCCCCCCCCCch Confidence 2 3456667777775 63 47889999998886543322211100 111111111101111 No 160 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.78 E-value=2.4e-08 Score=62.37 Aligned_cols=406 Identities=10% Similarity=-0.029 Sum_probs=178.5 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccC---cccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHD---AKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~g---t~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) ++..+...++......+....+-+....-|-- ..+...+...+.....+ -...--.+++++.||+..+.=++.+ T Consensus 20 ~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~---~~~~ki~~~~~~~Ivd~~~~~l~g~ 96 (479) T protein:vir:79 20 STINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFT---KVNNKAINNYHKLLVDQKVGYSVGN 96 (479) T ss_pred ChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccc---cCcceeecchHHHHHHHHHhhhhcC Confidence 66666666665544432221222222111110 00000111111100000 0001124788999999999999999 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CCCc---ccccc------C-Cc Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQPW---DRPAR------G-KL 146 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~---~~Pl~------~-~~ 146 (456) .+++...++. .. ..++. +.+-++...+.++.+....+|.+++++.++. |+.- -.|.. . .. T Consensus 97 p~~~~~~~~~-~~-------~~~~~-~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~ 167 (479) T protein:vir:79 97 PIVFNADDDN-LT-------KLLND-LLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEEAIPIWDSKRQ 167 (479) T ss_pred CceeccCCHH-HH-------HHHHH-HHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccceeEEEEeCCCC Confidence 9998543321 11 11222 3334788889999999999999999888743 3221 11221 0 00 Q ss_pred CceeEEEEeccccCChhhhhccc--cccccCCceeEEEeecccCCccc----------------c----ceeeehhhhhe Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKP--DSETYGQPTMWEYTEASQAGRPG----------------L----VRDIHPDRVFI 204 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp--~s~~yg~P~~y~i~~~~~~g~~~----------------~----~~~IH~SRli~ 204 (456) +-+....-+|...- .+... ...-|..-..|++... ++... . ...=|.--.+. T Consensus 168 ~~~~~~ir~y~~~~----~~~~~~~~~e~y~~~~i~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vP 241 (479) T protein:vir:79 168 RELVAFIRFYYIED----IDGNKIKRVEYYTENDITYFIER--GNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVP 241 (479) T ss_pred CceEEEEEEEEEee----cCCceEEEEEEEeCCcEEEEEec--CCcccccccccccccccccccccccccccccCCCccc Confidence 11111111111000 00000 0001111111111100 00000 0 00112222222 Q ss_pred ecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc Q lcl|NC_016762. 205 LGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR 282 (456) Q Consensus 205 ~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 282 (456) +.. .+.+|.|.++.+..-+.+++.+.-..+..+-..+...+.++ +.. +....+.... +. T Consensus 242 vv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~--------g~~---~~~~~~~~~~-------~~- 302 (479) T protein:vir:79 242 FIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLK--------EYP---GTSLQEFIDN-------IR- 302 (479) T ss_pred EEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee--------cCC---ccccccchhh-------hh- Confidence 222 23568899988877777777765555443322222222211 100 0010111111 11 Q ss_pred CCCeEEecCCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH----HHHHHHHHHHhh Q lcl|NC_016762. 283 GNDVLLPTQGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY----HNARCQARRVQE 355 (456) Q Consensus 283 ~~~~~lid~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n----yyd~I~~~Qe~~ 355 (456) ..++..++.+.+ |-+.+.+.+++...++...+.+...+++|-.-.-+. | |.+++ ++. -...+ ...+.. T Consensus 303 ~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g-n~Sg~Ai~~~~~~l~~k~-~~~~~~ 377 (479) T protein:vir:79 303 YYKSIKVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNT---G-DKSGVALKFLYSLLDLKC-SKTEKK 377 (479) T ss_pred hccceecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccc---c-chhHHHHHHHHHHHHHHH-HHHHHH Confidence 112333455555 444566777889999999999999999996533221 1 33333 222 23334 344556 Q ss_pred hhHHHHHHHHHHHHhc---Cc--CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCc-CH-HHHHHHhc Q lcl|NC_016762. 356 LTFEINDLFAHLMRIG---VV--PLKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVF-TA-EEIREEAG 426 (456) Q Consensus 356 lrp~L~~l~~~l~~s~---~~--~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i-~~-~E~R~~~~ 426 (456) ++..|++++++++... .+ ....+++|.|++-...++++.|++..+.+-+.. +++.. .+-+ ++ .|+..... T Consensus 378 ~~~~l~~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~g~iS~et~l~~-l~~v~d~~~E~~ri~~ 456 (479) T protein:vir:79 378 FKKAIRELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKSTGIVSDETIVSN-HPWVEDVNDELERLKK 456 (479) T ss_pred HHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHh-CCCCCCHHHHHHHHHH Confidence 8999999998876431 11 223578999999999999999988766543221 11111 0111 11 22221110 Q ss_pred ccCCCCCCCCcccCCCCCCCCCcCCC Q lcl|NC_016762. 427 YDPLQGGDPLPDTEPEDEDAARTDPT 452 (456) Q Consensus 427 ~~~~~~~~~~~~~~~~d~~~~~~d~~ 452 (456) . .-....... ...+.+....+++ T Consensus 457 E-~~~~~~~~~--~~~~~~~~~~~e~ 479 (479) T protein:vir:79 457 Q-EDTQKEYDD--LIPNNQDGVIDET 479 (479) T ss_pred H-HHHHHHHHh--ccCcccCCCcCcC Confidence 0 000000000 0001111111111 No 161 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.78 E-value=1.6e-08 Score=63.40 Aligned_cols=409 Identities=11% Similarity=0.086 Sum_probs=181.9 Q ss_pred CCchhHHHHhHHHHHHHHHH--HHHHhhhh----hccCcc---cchhhhhccC--c-----ccCCHHHHHHHHhcCchhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARA--RMSLLNQG----IGHDAK---RPQAWCEYGF--P-----QEITFNDLYTMYRRGGIAH 64 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~--~d~~~n~~----~~~gt~---~~~~~~~~~~--~-----~~~~~~~l~~~Y~~~~l~r 64 (456) |.+|++-. ++..+.+. ..++.+.. ..+... |=..|..++- + +......-...+.+-.+++ T Consensus 3 ~~~~~k~~----~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~ 78 (508) T protein:vir:15 3 LIQRIKDL----FWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAK 78 (508) T ss_pred hHHHHHHH----HHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHH Confidence 44444422 11111110 00111100 000000 0000111100 0 0000001111223448999 Q ss_pred hhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC------c Q lcl|NC_016762. 65 GAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP------W 138 (456) Q Consensus 65 ~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~------~ 138 (456) .||+..|+=++-+-++|..++++.. ...|.+.+++-+++..+.+++......|++++-+.++.++. . T Consensus 79 ~i~~~~A~lv~~e~~~i~v~~~~~~-------~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~a 151 (508) T protein:vir:15 79 TAARRIASVVFNEKAEIHVKDNNEA-------DKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGNHIKIAWVRA 151 (508) T ss_pred HHHHHHHhhhhCCCceEEeCCchHH-------HHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCCeeEEEEEcC Confidence 9999999999999888865432211 12477788888899999999999888888888777654331 1 Q ss_pred c--ccccCCcCceeEEEEeccccCChhhhhccccccccC-Cc--------eeEEEeecccC-------Cccc-------- Q lcl|NC_016762. 139 D--RPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYG-QP--------TMWEYTEASQA-------GRPG-------- 192 (456) Q Consensus 139 ~--~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg-~P--------~~y~i~~~~~~-------g~~~-------- 192 (456) + -|+.-..+++..+ ++|...... +-....|+ .= ..|.|.-...- |.+. T Consensus 152 d~~~P~~~d~~~~~~~-af~~~~~~~-----~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~ 225 (508) T protein:vir:15 152 DQFYPLQSNTNDISEA-AIASRTQRT-----ESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVY 225 (508) T ss_pred CeeEEEEEcCCCeEEE-EEEEEEEee-----cCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccc Confidence 1 1443223333222 122110000 00000000 00 01222110000 0000 Q ss_pred ----cceee-ehhhh--heec--------CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh Q lcl|NC_016762. 193 ----LVRDI-HPDRV--FILG--------DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE 257 (456) Q Consensus 193 ----~~~~I-H~SRl--i~~~--------~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 257 (456) ....+ +-.|. .+|. ..++.|+|++..+.+.+..++.+...+..-+ +.+-+.+.+ ..+ T Consensus 226 ~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~-~~~~~~i~v-------~~~ 297 (508) T protein:vir:15 226 KELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEI-RLGQKHIAV-------QPG 297 (508) T ss_pred cCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHH-Hhcccceee-------chH Confidence 00111 11221 1121 1245699999999998888887766555433 221111111 011 Q ss_pred HHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEeccc--CCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcc Q lcl|NC_016762. 258 IASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIPTKILVGMQTGER 335 (456) Q Consensus 258 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gl 335 (456) +....+.+.. .+..--+.+ ...+.- .+.+..++.++..+ .-....++.+.+.+...+|++-. -||-..+|. T Consensus 298 ~l~~d~~~~~----~~~~~~~~~-~~~~~~-~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~-~f~~~~~~~ 370 (508) T protein:vir:15 298 MLRFDDEHKP----TFDTEQNVY-VGVLSD-DNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTG-TFSYSNDGV 370 (508) T ss_pred HhcCCCCCcc----ccCCCCeeE-EeccCC-CCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCch-hcccccCcc Confidence 1111111100 010000000 000000 01123366666554 34677788888888888888854 667666666 Q ss_pred c-chH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC----------------cCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 336 A-SSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIGV----------------VPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 336 n-st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~----------------~~~~~d~~~~f~pL~~~seke~A 395 (456) . |++ ..+.-|.++..+|. .++..|+.|+..++.... ...+.+++|.|++--.+++.+++ T Consensus 371 ~TAtei~s~~~~~~~t~~~~~~-~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~ 449 (508) T protein:vir:15 371 KTATEVVSNNSMTYQTRSSYLT-MVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQL 449 (508) T ss_pred ccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHH Confidence 3 443 45667888888876 579999998887654311 12345799999999888877765 Q ss_pred HHHHHHHHHHHHHHHcCCcCcCH------------HHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIGTGEPVFTA------------EEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 396 ei~~~~A~a~~~~~~~g~~~i~~------------~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +...+ ++.+| +++. +|+++.+..-.-.... .+ +.+ ..-.+..+++.| T Consensus 450 ~~~~~-------~v~aG--i~s~e~~i~~~~g~~deea~~el~ri~~E~~~--~~--~~~-~~~~~~~g~~ge 508 (508) T protein:vir:15 450 EEDAK-------VLAIG--ALSKQTFLQRNYGMTDEQAAEELAKIQSEAPT--DT--FEG-GRSAILNGGDGE 508 (508) T ss_pred HHHHH-------HHhcC--CCCHHHHHHhcCCCChHHHHHHHHHHHHhccc--cC--ccc-cccccCCCCCCC Confidence 44332 33344 4444 3333322110000000 00 000 001111122222 No 162 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=98.77 E-value=5.7e-08 Score=60.30 Aligned_cols=410 Identities=10% Similarity=-0.027 Sum_probs=169.9 Q ss_pred CCch-hHHHHhHHHHHHHHH--HHHHHhhhhhccCcccchhhhhccCcc-----cC----CHHHHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDK-LDLAVNHAMSSAIAR--ARMSLLNQGIGHDAKRPQAWCEYGFPQ-----EI----TFNDLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~-~~~~~~~a~~~~~~~--~~d~~~n~~~~~gt~~~~~~~~~~~~~-----~~----~~~~l~~~Y~~~~l~r~iVd 68 (456) |.-. +...+...+..-..+ ..+-+....-| ..+-......+.. .. ...-.....-.+.+++.||+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g---~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd 77 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRN---ENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLD 77 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---ccccccccchhhhhcccccccccccccccccceeccchhHHHHH Confidence 3221 111122121111101 01111111111 1111110000000 00 00000011234679999999 Q ss_pred cchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec--CCCCcc---cccc Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR--DSQPWD---RPAR 143 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~--D~~~~~---~Pl~ 143 (456) ..+.=++.+.+++...++. . .+.+...++ -++.....++.+....+|.|++++.++ ||+... .|.. T Consensus 78 ~~~~yl~G~p~~~~~~~~~-~-------~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~ 148 (471) T protein:vir:10 78 QKKAYALTYPPTFDVDDKK-V-------NDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKE 148 (471) T ss_pred hhhhhhcccCceeccCChH-H-------HHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccc Confidence 9999899888887543321 1 122333333 367778888889999999998888774 343221 1221 Q ss_pred -------CCcCceeEEEEeccccCChhhhhcccccc-ccCCceeEEEeecccC------------------Ccc-cccee Q lcl|NC_016762. 144 -------GKLNGLAKVTPAWAGCLKPKSFDEKPDSE-TYGQPTMWEYTEASQA------------------GRP-GLVRD 196 (456) Q Consensus 144 -------~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~-~yg~P~~y~i~~~~~~------------------g~~-~~~~~ 196 (456) ...+.+....=+|..... .-.....-. -|..=..|.+.....+ |.. ..... T Consensus 149 ~~~i~d~~~~~~~~~~ir~~~~~~~--~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (471) T protein:vir:10 149 VIPIYSKSLDKKSIGVLRVYSSIDE--TDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSF 226 (471) T ss_pred eEEEEcCCCCCceEEEEEEEEeecc--CCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccc Confidence 000111111111110000 000000000 0111111111100000 000 00000 Q ss_pred eehhhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHH Q lcl|NC_016762. 197 IHPDRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFN 274 (456) Q Consensus 197 IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 274 (456) -|.=-.+.+.. .+..|.|.++.+.+-+.+++.+.-..+..+-..+-..+.++. . .+....+.... T Consensus 227 ~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g---~--------~~~~~~~~~~~-- 293 (471) T protein:vir:10 227 KHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTN---Y--------GGQDKQEFLED-- 293 (471) T ss_pred cCCCCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeec---C--------CccccchhHHH-- Confidence 11111111221 234578888887776667776654444333222211221111 0 01111111111 Q ss_pred HHHHHHhcCCCeEEec-----C--CCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-H- Q lcl|NC_016762. 275 EAARQLNRGNDVLLPT-----Q--GATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-H- 344 (456) Q Consensus 275 ~~~~~~~~~~~~~lid-----~--~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-y- 344 (456) +.. .+...+. . +-+|-..+.+..++...++...+.|...++.|-.-..+. | |+++. ++. | T Consensus 294 -----~~~-~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~--g--n~Sg~Alk~~~~ 363 (471) T protein:vir:10 294 -----LKR-YKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKL--G--NSSGVALKFLYS 363 (471) T ss_pred -----hhc-CCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccc--c--CccHHHHHHHHH Confidence 111 1122221 1 224555667788999999999999999999996433221 2 33443 332 2 Q ss_pred --HHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCHH- Q lcl|NC_016762. 345 --NARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTAE- 419 (456) Q Consensus 345 --yd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~~- 419 (456) ...+. ..+..++..|++++++++...-.....+++|.|+|....+++|.|++..+.+...+ +++..--.+-+++ T Consensus 364 ~l~~k~~-~~~~~~~~~l~~~~~li~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~D~~~ 442 (471) T protein:vir:10 364 LLELKAG-NMETQFRSGYATLVKMILKHLGLSDKLKIKQTWTRNSINNDTEMAQVVSTLATITSRENVAKSNPIVEDWQD 442 (471) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCCceeEEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHH Confidence 22344 44567899999999888654333445689999999999999999998766543221 1111100111222 Q ss_pred HHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 420 EIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 420 E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) |+.... .+........ .+....++..|.| T Consensus 443 E~eri~-~E~~~~~~~~-------~~~~~~~~~~e~~ 471 (471) T protein:vir:10 443 ELRLQK-AEQEGRSEKL-------YDMEEVEHESEVE 471 (471) T ss_pred HHHHHH-HHHHHHHhcc-------cccCCCCCccccC Confidence 221111 0000000000 0111111111122 No 163 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=98.76 E-value=6.1e-08 Score=60.13 Aligned_cols=426 Identities=9% Similarity=-0.041 Sum_probs=181.4 Q ss_pred CCch-hHHHHhH-HHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDK-LDLAVNH-AMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~-~~~~~~~-a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) |+.. +.-.+++ -.+. .+.+..+-|.+--.-+ ..+......-+.+. ..-..++++.||+..+.=++- T Consensus 22 l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i-~~~~~~~~~~~~~~---------~ki~~n~~~~Iv~~~~~~l~G 91 (506) T protein:vir:94 22 LTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKI-LDKQSRRHEDGKAD---------HRATHSFAKYIADFQTSYSVG 91 (506) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc-cccccccccccCCc---------ceeecchHHHHHHHhhhhhcc Confidence 4433 3333332 2111 1122122121100000 00100000011110 112578999999999999998 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc------CC- Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR------GK- 145 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~------~~- 145 (456) +.+++...++... ..|...++.-++...+.++.+....+|.+++++.++ ||+..- .|.. .. T Consensus 92 ~p~~~~~~d~~~~--------~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~v~dd~~ 163 (506) T protein:vir:94 92 NPINVKLPDDGSN--------SGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFVIYSTDV 163 (506) T ss_pred cCceeecCcchHH--------HHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEecCCC Confidence 9988865543221 236677777788888999999999999999988884 333221 1211 00 Q ss_pred cCceeEEEEeccccCChhhhhcccc-----ccccCCceeEEEeecccCCccccceeeehhhhheecCC--cCCCcchHHH Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKSFDEKPD-----SETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW--TGDAIGFLEP 218 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~~~~Dp~-----s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~--~~~G~S~le~ 218 (456) .+.+....-+|..... ..+.. ...++.+..+.+-.....+.......-|+--.+.+..+ +..|.|.++. T Consensus 164 ~~~~~~~v~~~~~~~~----~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~ 239 (506) T protein:vir:94 164 DPKPIMAVRYHQIELV----DDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFKNSNFRLGDFEN 239 (506) T ss_pred CCceEEEEEEEeeeec----cCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccceEEecCCCCCCCchhh Confidence 0111111111111000 00000 00111222222211110000000011243333333222 2347777877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHh--hHHhhh----cCCHHHHHHHHHHHHHHHhcCCCeEEec-- Q lcl|NC_016762. 219 AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLG--EIASTY----GVTLDALNERFNEAARQLNRGNDVLLPT-- 290 (456) Q Consensus 219 ~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~--~l~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~lid-- 290 (456) +.+-+.+++.+.-..+..+-..+...+.+......... .+.... ..+.........+.+..+..+ +.+.+. T Consensus 240 ~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 318 (506) T protein:vir:94 240 VLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDA-NMLLLKSG 318 (506) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhc-Ceeeeccc Confidence 77666666666555544332222222222221111111 000000 000000000011112222221 111111 Q ss_pred -------CCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-H---HHHHHHHHHhhh Q lcl|NC_016762. 291 -------QGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-H---NARCQARRVQEL 356 (456) Q Consensus 291 -------~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-y---yd~I~~~Qe~~l 356 (456) .+.+ |-..+.+..+....++...+.|...+++|-.- ++...| |.+++ ++. | ...+. ..+..+ T Consensus 319 ~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~-~~~~~~--n~Sg~Aik~~~~~l~~k~~-~k~~~~ 394 (506) T protein:vir:94 319 MTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLT-DENFAS--NSSGVAMQYKVLGTVELAS-TKRRMF 394 (506) T ss_pred ccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccc-cccccc--cchHHHHHHHHHHHHHHHH-HHHHHH Confidence 1223 34455677899999999999999999999632 111112 33343 221 2 23444 445568 Q ss_pred hHHHHHHHHHHHHh-c--Cc--C-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCHH-HHHHHh-- Q lcl|NC_016762. 357 TFEINDLFAHLMRI-G--VV--P-LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTAE-EIREEA-- 425 (456) Q Consensus 357 rp~L~~l~~~l~~s-~--~~--~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~~-E~R~~~-- 425 (456) +..|++++.+++.. + .+ + ...+++|.|+|-...+++|.|++..+.+-... +++..--.+-+++ |+.... T Consensus 395 ~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~lp~v~d~~~E~~ri~~E 474 (506) T protein:vir:94 395 ERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAGATLPQKYLYQQLPGVTNPQDIVDMMKEQ 474 (506) T ss_pred HHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHH Confidence 99999998876543 1 11 1 12367899999999999999998777643222 1221110122232 221111 Q ss_pred --cccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 426 --GYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 426 --~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ...+..+ .......+++..+.+...++| T Consensus 475 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~e 504 (506) T protein:vir:94 475 SANGDYSFD---QNGVISNDGQTNTTATQTDEE 504 (506) T ss_pred HHHHhhcch---hhcCCCcccCccccccccccC Confidence 0011100 000001112222222233333 No 164 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=98.75 E-value=6.3e-08 Score=60.04 Aligned_cols=411 Identities=10% Similarity=0.016 Sum_probs=172.8 Q ss_pred CCch--------hHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCc-ccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDK--------LDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFP-QEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~--------~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~-~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) +-+| .++.-.|-.+-.. .+-+....-|- .+-.....-+. ....+.......-.+.+++.||+..+ T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~~~~r---~~~~~~Yy~g~---~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~ 93 (478) T protein:vir:10 20 IKPKYETQEEMILRLVREHKENIDN---ITMGERYYNHH---PDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKV 93 (478) T ss_pred hhhccCChHHHHHHHHHHHHHHHHH---HHHHHHHhccc---ccccccchhhhcccccccccccceeccchHHHHHHHHh Confidence 1111 0111111111111 11111111111 11000000000 00000001111124689999999999 Q ss_pred hHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcCcee Q lcl|NC_016762. 72 TTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLNGLA 150 (456) Q Consensus 72 ed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~~l~ 150 (456) .=++.+.+++...++... +.|...++ -++...+.++.+....+|.+++++.++ |++.-...+.. . T Consensus 94 ~yl~g~p~~~~~~~~~~~--------~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~p-----~ 159 (478) T protein:vir:10 94 AYAVANPVTFGVDNDKAL--------KQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPA-----E 159 (478) T ss_pred hhhcccCceeecCChHHH--------HHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcc-----c Confidence 999999999865432211 22444444 367788888889999999999988774 33321111100 0 Q ss_pred EEEEeccccCC----hh--hhhcc-ccccccCCc---eeEEEeecc--------cCCccc---cceeeehhhhheecC-- Q lcl|NC_016762. 151 KVTPAWAGCLK----PK--SFDEK-PDSETYGQP---TMWEYTEAS--------QAGRPG---LVRDIHPDRVFILGD-- 207 (456) Q Consensus 151 ~i~~~~~~~~~----~~--~~~~D-p~s~~yg~P---~~y~i~~~~--------~~g~~~---~~~~IH~SRli~~~~-- 207 (456) .+.|+|..... .. .|..+ -..-.++.| .+|...... ..+... ....-|.=-.+.+.. T Consensus 160 ~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 239 (478) T protein:vir:10 160 QAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFK 239 (478) T ss_pred ceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEec Confidence 11122211000 00 00000 000011111 112111000 000000 000112211122211 Q ss_pred CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeE Q lcl|NC_016762. 208 WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVL 287 (456) Q Consensus 208 ~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (456) ....|.|.++.+..-+.+++.+.-..+..+-..+...+.++ |.+.+...+ + ...+.. ...+ T Consensus 240 n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~--------------g~~~~~~~~-~---~~~~~~-~~~~ 300 (478) T protein:vir:10 240 NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK--------------GYEGEDMKD-F---MHNLKY-YKAI 300 (478) T ss_pred cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeee--------------cCCcccccc-h---hhhhhh-Ccee Confidence 23458888888777777777665555443322222222111 111111111 1 111111 2333 Q ss_pred EecC--CCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhH Q lcl|NC_016762. 288 LPTQ--GAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTF 358 (456) Q Consensus 288 lid~--~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp 358 (456) .++. +.+ +-..+.+.+++...++.+.+.|...+++|-.-. +.. +| |.++. ++ .-...+. ..+..+++ T Consensus 301 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~-~~-n~Sg~Ai~~~~~~l~~k~~-~~~~~~~~ 376 (478) T protein:vir:10 301 SVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQ-DKF-GN-SPSGIALKFMYSNLDLKAN-KLKNKTLT 376 (478) T ss_pred EecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCc-ccc-cc-chHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 3332 234 444566778999999999999999999996422 211 11 33343 22 2333344 44456899 Q ss_pred HHHHHHHHHHHhcCcC-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCHHHHHHHhcccCCCCCCC Q lcl|NC_016762. 359 EINDLFAHLMRIGVVP-LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTAEEIREEAGYDPLQGGDP 435 (456) Q Consensus 359 ~L~~l~~~l~~s~~~~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~~E~R~~~~~~~~~~~~~ 435 (456) .|++++.+++....+. ...+++|.|+|-..-+++|.|++..+.+.... +++..--.+-++++..+....+....... T Consensus 377 ~l~~~~~li~~~~~~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~~g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~ 456 (478) T protein:vir:10 377 ALQELLQYIIDFYRLDVRVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILGNHSWVQDPVAEMERIEQENIELNQQ 456 (478) T ss_pred HHHHHHHHHHHHhCCCcccccceEEeCCCCCCCHHHHHHHHHHHhCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh Confidence 9999998887654333 23579999999999999999988766544322 12221101223433221111000000000 Q ss_pred CcccCC--CCCCCCCc-CCCCC Q lcl|NC_016762. 436 LPDTEP--EDEDAART-DPTGE 454 (456) Q Consensus 436 ~~~~~~--~d~~~~~~-d~~~~ 454 (456) .++..+ .+++.+++ |.++| T Consensus 457 ~~~~~~~~~d~~~~~~~d~~~e 478 (478) T protein:vir:10 457 LPDIEEGLNDEQQRQSEDNQSE 478 (478) T ss_pred ccccCCCCcccccccCcCCCCC Confidence 000000 11111111 11222 No 165 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.75 E-value=6.5e-08 Score=60.00 Aligned_cols=371 Identities=10% Similarity=-0.038 Sum_probs=175.9 Q ss_pred CCchhHH--HHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHh-cCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDL--AVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYR-RGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~--~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~-~~~l~r~iVd~~aed~tR~ 77 (456) |..++=. ..-..-....-+.++.|.+ |..+.+ . .+.. -..++...|+ .-.++++|||.+++=+.=+ T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~-----g~~~~~---~--~~~~-~p~~~~~~~~~v~nw~~~iVds~a~rl~~~ 69 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYA-----MKYVDR---F--KGIT-IPQALSQQYRSILGWCAKGVDSLADRLVFR 69 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhc-----ccCchh---h--cChh-hhHHHHHHHhhhcchhHHHHHHhHhhcccC Confidence 6655421 1111111111122233322 110000 0 1111 1234544554 3367899999999877777 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---ccc------CCcC Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PAR------GKLN 147 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl~------~~~~ 147 (456) ||+. +| . .+.+.+.+-++-....++.+-+..||.|++++.-+ ||++.-. |.. ...+ T Consensus 70 Gf~~-----~d-~--------~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~~~ 135 (409) T protein:vir:94 70 EFEN-----DD-F--------TVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPITG 135 (409) T ss_pred cccC-----Cc-h--------HHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecCCC Confidence 7652 11 1 24455566666677778888889999998877653 3432211 111 0001 Q ss_pred ceeEEEEeccccCChhhhhccccccccCCce-eEEEeecccCCccccceeeehhhh---heecC----CcCCCcchH-HH Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEKPDSETYGQPT-MWEYTEASQAGRPGLVRDIHPDRV---FILGD----WTGDAIGFL-EP 218 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~-~y~i~~~~~~g~~~~~~~IH~SRl---i~~~~----~~~~G~S~l-e~ 218 (456) .+.....+|... ....+..-.++.|. .|++.. .++. ....-|+--. ++|.. ...+|.|.+ ++ T Consensus 136 ~~~~a~~~~~~d-----~~~~~~~~~~~~~~~~~~~~~--~~~~--~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~ 206 (409) T protein:vir:94 136 LLTEGYAVLERD-----ENNNVVLEAHFLPDRTDYYYR--DSRN--NISIANPTGHPLLVPIIHRPDAVRPFGRSRITRS 206 (409) T ss_pred ceeeeEEEEEec-----CCCceEEEEEEecCcEEEEEe--cCce--eEeeeCCCCCcceEEeccccccccccCccccchh Confidence 111111111100 00001111222221 111110 0110 0111233332 23322 124688876 66 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCC---C-- Q lcl|NC_016762. 219 AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQG---A-- 293 (456) Q Consensus 219 ~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~---d-- 293 (456) +..-..++.++.-...-...-.+..+..+. . + +..+.+. +.+...+.++ ..+.++ + T Consensus 207 v~~l~da~~r~~~~~~~~~e~~a~pqr~i~-----G---~-d~d~~~~----~~~~~~~~~i------~~~~~d~dg~~~ 267 (409) T protein:vir:94 207 GMYWQSNAKRTLERADVTAEFYSFPQKYVT-----G---L-SDDAEPM----ETWKATVSSM------LQFTKDEDGDKP 267 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcChhheeE-----e---c-CCCCccc----chhhhhHHHh------hcCCCCCCCCCc Confidence 665555555553221111111122222211 1 1 1111111 1221111111 112211 1 Q ss_pred ceeEE-ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-----HHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_016762. 294 TVTQM-VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-----QKYHNARCQARRVQELTFEINDLFAHL 367 (456) Q Consensus 294 ~~~~~-~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-----~~nyyd~I~~~Qe~~lrp~L~~l~~~l 367 (456) ++.++ ++++.++-+.+.....++|+.++||..-|-|.+ .. +++++ ...--..++.+|+ .+.+.+++++.+. T Consensus 268 ~v~q~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~-~N-psSa~Al~a~~~~L~~~a~~k~~-~fg~~~~~~~rla 344 (409) T protein:vir:94 268 TLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVS-DN-PSSVEAIKASHENLRLAGRKAQR-SLGAGLLNVAYLA 344 (409) T ss_pred eEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHhcccc-Cc-hhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 23333 335666778888889999999999987655544 21 13333 2334455665554 5789999998875 Q ss_pred HHhcC--cCCCC---ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCC Q lcl|NC_016762. 368 MRIGV--VPLKA---EFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQ 431 (456) Q Consensus 368 ~~s~~--~~~~~---d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~ 431 (456) +.... ...++ ++++.|.|+..++..+.|+ .|++..+++++|.+..+.+-+++.++++..+ T Consensus 345 ~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a~----~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 345 ACLRDDAPYLREQFRKTKPKWEPLFEADASMLSL----IGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred HHHhCCCCccccccccceEEeccCCCcchHHHHH----HHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 54322 22233 5688899988777665544 4688889999987666677888888876543 No 166 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.74 E-value=1.7e-08 Score=63.24 Aligned_cols=403 Identities=12% Similarity=0.063 Sum_probs=182.0 Q ss_pred CCchhHHHHhH------------------------HHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHH Q lcl|NC_016762. 1 MTDKLDLAVNH------------------------AMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTM 56 (456) Q Consensus 1 ~~~~~~~~~~~------------------------a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~ 56 (456) |.++++-.+.. -.-.++...+..|.+-.-.+.. ....+-+. ..- T Consensus 3 ~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~-----~~~~~~~~-------~~~ 70 (505) T protein:vir:79 3 FWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTH-----KNSYGDTQ-------KHE 70 (505) T ss_pred hHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccc-----cccCCCcc-------ccc Confidence 33333321110 0011111122222110000000 00000000 011 Q ss_pred HhcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCC Q lcl|NC_016762. 57 YRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQ 136 (456) Q Consensus 57 Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~ 136 (456) +.+-.+++.||+..|+=++-+-++|..+++.. .+.|.+.+++-+++..+.+++......||+++-+.+..++ T Consensus 71 ~~slnl~~~i~~~~A~ll~~e~~~i~~~d~~~--------~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~~~ 142 (505) T protein:vir:79 71 LQSVNVTKLASAKLASLIFNEQCQVTVSDETA--------NDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDSGK 142 (505) T ss_pred eeecchHHHHHHHHHhhhcCCCceeecCChHH--------HHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeCCc Confidence 34457999999999999998888886543211 2347788888889999999999988888888877775443 Q ss_pred C-c-----c--ccccCCcCceeEEEEeccccCChhhhhccccccccC-Cce-------eEEEeeccc---C----Cccc- Q lcl|NC_016762. 137 P-W-----D--RPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYG-QPT-------MWEYTEASQ---A----GRPG- 192 (456) Q Consensus 137 ~-~-----~--~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg-~P~-------~y~i~~~~~---~----g~~~- 192 (456) . + + -|+.-..+++..+..+-.+.. .+.....|+ .-+ .|+|.-... + |.+. T Consensus 143 ~~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~------~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~ 216 (505) T protein:vir:79 143 IKLAWATADQVYPLQADTNQVNELAIASRTTE------VENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVP 216 (505) T ss_pred eEEEEEcCCeeEEEEEcCCCeEEEEEEEEEEE------ecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccc Confidence 2 1 1 133211122222211110000 000000111 111 122221100 0 0000 Q ss_pred -----------cceeee-hhhh--hee-----c---CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh Q lcl|NC_016762. 193 -----------LVRDIH-PDRV--FIL-----G---DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFD 250 (456) Q Consensus 193 -----------~~~~IH-~SRl--i~~-----~---~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~ 250 (456) ....++ -.|. .+| + ..++.|+|++..+.+.+..++.+...+..-+ +. .+..+.. T Consensus 217 l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~-~~--g~~~i~v- 292 (505) T protein:vir:79 217 LNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEV-KK--GQRRLIV- 292 (505) T ss_pred hhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHH-Hh--cccceee- Confidence 001111 1121 111 0 1235699999999988888887765544322 11 1111110 Q ss_pred hhccHhhHHhhhc--CCHH-HH-HHHHHHHHHHHhcCCCeEEecC-CCceeEEeccc--CCHHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 251 KEINLGEIASTYG--VTLD-AL-NERFNEAARQLNRGNDVLLPTQ-GATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 251 ~~~~~~~l~~~~~--~~~~-~~-~~~~~~~~~~~~~~~~~~lid~-~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP 323 (456) -.++..... .+.. .. ...+...-+. +..+..+. +..++.++..+ ....+.++.++++++..+|++ T Consensus 293 ----~~~~l~~~~~~~~~~~~~~~~~fd~~~~~----y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s 364 (505) T protein:vir:79 293 ----PAEWLKTGSSYGGQASETHPPMFDPDETV----YQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLS 364 (505) T ss_pred ----chHHhcccCCCCcccccccccCCCcccee----eeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCC Confidence 011111100 0000 00 0001000000 00111122 23477776655 456777888889999999998 Q ss_pred eEEeeccCCCccc-chH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc-------------Cc-CCCCceEEEeCC Q lcl|NC_016762. 324 TKILVGMQTGERA-SSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIG-------------VV-PLKAEFTAIWDD 385 (456) Q Consensus 324 ~t~L~G~sp~Gln-st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~-------------~~-~~~~d~~~~f~p 385 (456) -. -||....|.. |++ ..+.-|.++..+|. .++..|+.|+..++... .+ .++.+++|.|++ T Consensus 365 ~~-~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~-~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d 442 (505) T protein:vir:79 365 QG-TFTTSPSGIQTATEVVTNNSQTYQTRSSYIT-QVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFND 442 (505) T ss_pred hh-hcCCCccccchHHHHHHHHhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCC Confidence 75 5666666653 443 45567888888776 57888998888775421 11 223489999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCC---Cccc--CCCCCCCCCcCCCCC Q lcl|NC_016762. 386 LTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDP---LPDT--EPEDEDAARTDPTGE 454 (456) Q Consensus 386 L~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~---~~~~--~~~d~~~~~~d~~~~ 454 (456) --..++.+.++... .++.+| +++..+++... .+..+.+- .+.+ |.+...++..+-++| T Consensus 443 ~i~~d~~~~~~~~~-------~~v~~G--i~s~e~~l~~~--~~~~eeea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 443 GVFVDQESKRAADL-------QAVQAQ--VMPKKQFLMRN--YGLDEEEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred CCCCCHHHHHHHHH-------HHHHcC--CCCHHHHHHhc--CCCChHHHHHHHHHHHHhccccCCCchhccCC Confidence 99888777655433 333445 45554443321 11111000 0000 001112222334455 No 167 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=98.73 E-value=7.5e-08 Score=59.65 Aligned_cols=387 Identities=10% Similarity=-0.003 Sum_probs=188.2 Q ss_pred CCchh--------HH-HHhHHHHHHHHHHHHHHhhhhhccCcccc--hhhhhccCcccCCHHHHHHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKL--------DL-AVNHAMSSAIARARMSLLNQGIGHDAKRP--QAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~--------~~-~~~~a~~~~~~~~~d~~~n~~~~~gt~~~--~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~ 69 (456) |+.++ +. +...++...++.++ ...+..+++.|-.+ ......+ .+.+.++.|. ++.-+..++++ T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~~~-~~~~~~~~~~~~~~~~~iLr~~~----~~~~~y~~m~-~D~~i~s~l~~ 74 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIATRA-RSIDFFALGMYLPNPDPVLKALG----KDIRVYRELR-ADAHVGGCVRR 74 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHhhh-cccccccccCCccchHHHHHhcC----CCHHHHHHHh-hChHHHHHHHH Confidence 33211 00 00112223333322 33344333333221 1222222 2445556665 57777888888 Q ss_pred chhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcCc Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLNG 148 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~~ 148 (456) .-.-.+..-|+|..+.+++ ...+.+.+.++++.+-..+.+.+ .+.+||+|++=+.-. ++ +.-. T Consensus 75 Rk~av~~~~w~i~~~~~~~------~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~---------g~~~ 138 (491) T protein:vir:10 75 RKAAVKALEWGLDRGKAKS------RVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVG---------NYIV 138 (491) T ss_pred HHHHHhCCCcEEecCCCCH------HHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecC---------CeeE Confidence 8777776667886543322 12245777788887777776665 688999998755432 11 1111 Q ss_pred eeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheec----CCcCCCcchHHHHHHHHH Q lcl|NC_016762. 149 LAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILG----DWTGDAIGFLEPAYNSFI 224 (456) Q Consensus 149 l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~----~~~~~G~S~le~~~~~l~ 224 (456) +..+.++.+..+. .|+.. .. .|.-. ++ ...+..+.+-+.+.+. ....+|.+++..||...+ T Consensus 139 ~~~l~~r~~~~f~-----~d~~~----~l-~~~~~----~~-~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~ 203 (491) T protein:vir:10 139 PIDVVGKPADWFV-----YDPEN----QL-RFRSK----DH-WMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTT 203 (491) T ss_pred EEEeeeeccccee-----eccCC----ce-EEecC----CC-CCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHH Confidence 2333332221111 12211 11 11111 11 1123445555554442 345789999999987554 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccC- Q lcl|NC_016762. 225 SLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVS- 303 (456) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~s- 303 (456) --.....-++.-.-+........++. ....++-.+++.+++..|.+ ....++..+.+++.++.+-+ T Consensus 204 fK~~~~~~w~~f~E~yG~P~~igky~------------~~a~~~ek~~l~~al~~~~~-~a~~viP~~~~ie~~ea~~~~ 270 (491) T protein:vir:10 204 FKKGGLKFWVQFTEKYGSPMLVGKHP------------RSASDGEKNLLLDCLEDMVQ-DAVAVVPDDSSIEIKEAAGKT 270 (491) T ss_pred HHHHHHHHHHHHHHHcCCCeEEEecC------------CCCCHHHHHHHHHHHHHHhc-CcEEEecCCceeEEEecCCCC Confidence 32222222222111111111111110 01123344556666666654 36677888889998877542 Q ss_pred C----HHHHHHHHHHHHHhhhcCCeEEeeccC-----CCcccchHH--HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC Q lcl|NC_016762. 304 D----PGPTYNVNLQTAAAGVDIPTKILVGMQ-----TGERASSED--QKYHNARCQARRVQELTFEINDLFAHLMRIGV 372 (456) Q Consensus 304 g----l~~~~~~~~~~~aaas~IP~t~L~G~s-----p~Glnst~D--~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~ 372 (456) | ...+++..-.+||-+ ++||+ .|+. +.++ .....+.+++.. ..+...|++|+.-|+...+ T Consensus 271 g~~~~y~~li~~~d~~Isk~-------iLGqtlTt~~~gs~-a~~~vh~~v~~di~~~D~-~~i~~tln~li~~l~~~N~ 341 (491) T protein:vir:10 271 GSADVYERLLHFCRGEVSIA-------LLGQNQTTEATSTR-ASAQAGLEVTDDIRDGDK-AVVSEAMNMLIRWICDLNF 341 (491) T ss_pred CChhHHHHHHHHHHHHHHHH-------HhhhhcccCcccch-hHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcC Confidence 2 345566666666654 33432 2222 2222 234555555554 3567778888888888777 Q ss_pred cCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC-cccCCCC---CCCCC Q lcl|NC_016762. 373 VPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL-PDTEPED---EDAAR 448 (456) Q Consensus 373 ~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~-~~~~~~d---~~~~~ 448 (456) ++.+ ...|.|...- +..++.|+++.++++.|. .++.+++|+..+.+........ +...+.. ....+ T Consensus 342 ~~~~-~p~f~~~~~~--------e~~~~~a~~~~~L~~~G~-~i~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~ 411 (491) T protein:vir:10 342 DGAD-RPVFDMWEQE--------QVDEIQAGRDQKLTQAGA-RFTPAYFKRAYNLQDGDLDERPLPVSAVDTVGAASFAE 411 (491) T ss_pred CCCC-cceEEecCcC--------chhHHHHHHHHHHHhCCC-cCCHHHHHHHhCCCCCCcCccccccCCCCCcccccccc Confidence 7533 3456654321 334567888889999885 4788999998887543322211 1110000 00011 Q ss_pred cCCCCCCC Q lcl|NC_016762. 449 TDPTGEQQ 456 (456) Q Consensus 449 ~d~~~~~e 456 (456) .....+++ T Consensus 412 ~~~~~~~~ 419 (491) T protein:vir:10 412 FEAPDQDA 419 (491) T ss_pred cCCCCCCc Confidence 11111111 No 168 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.73 E-value=9.1e-09 Score=64.67 Aligned_cols=397 Identities=11% Similarity=0.047 Sum_probs=183.8 Q ss_pred CCchhHH-----------------------HHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHH Q lcl|NC_016762. 1 MTDKLDL-----------------------AVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMY 57 (456) Q Consensus 1 ~~~~~~~-----------------------~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y 57 (456) |.+|++- +++--.-.++.+.+..|.+-.-.+ +.+.. .+.. -..-+ T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~-----~~~~~--~~~~-----~~~~~ 70 (500) T protein:vir:98 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSV-----LYLNT--DGET-----KKRDL 70 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCc-----ccccC--CCCc-----ccCce Confidence 4444433 222111122222223332110000 00000 0000 01113 Q ss_pred hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC Q lcl|NC_016762. 58 RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP 137 (456) Q Consensus 58 ~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~ 137 (456) .+-.+++.||+..|+=++-+-++|..+++. ..+.|++.++..+++..+.+++......|++++-+.+..++. T Consensus 71 ~slnl~~~i~~~~A~lv~~e~~~i~~~d~~--------~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~ 142 (500) T protein:vir:98 71 NHLPIARTAAKKIASLVFNEQAEIKVDDDA--------ANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDKV 142 (500) T ss_pred eecchHHHHHHHHhhhhcCCcceEecCChH--------HHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCce Confidence 344899999999999999988888654321 123578888888899999999999888888888777654332 Q ss_pred c------c--ccccCCcCceeEEEEeccccCChhhhhccccccccC-Cce--------eEEEeeccc-------CCcccc Q lcl|NC_016762. 138 W------D--RPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYG-QPT--------MWEYTEASQ-------AGRPGL 193 (456) Q Consensus 138 ~------~--~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg-~P~--------~y~i~~~~~-------~g~~~~ 193 (456) . + -|+.-...++..+..++...-+ .+ ....|+ .-+ .|.|.-... -|.... T Consensus 143 ~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~---~~---~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 216 (500) T protein:vir:98 143 RVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKT---IN---GKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVP 216 (500) T ss_pred EEEEEcCCeeEEEEEcCCCeEEEEEEEEEeee---ec---CCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccc Confidence 1 1 1332222233222222111000 00 000110 000 122211000 011000 Q ss_pred ceeeeh-----------hhh--heec--------CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_016762. 194 VRDIHP-----------DRV--FILG--------DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKE 252 (456) Q Consensus 194 ~~~IH~-----------SRl--i~~~--------~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 252 (456) -..+|. .|. .+|. ...+.|+|++..+.+-+..++.+...+..-+ +..-+.+ T Consensus 217 l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~-~~g~~~i------- 288 (500) T protein:vir:98 217 LSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEV-KMGQRRV------- 288 (500) T ss_pred cccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHH-HhCccee------- Confidence 000111 111 1110 1235699999999998888888765554322 1111111 Q ss_pred ccHhhHHhhh--cCCHHHHHHHHHHHHHHHhcCCCeE-Eec----CCCceeEEeccc--CCHHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 253 INLGEIASTY--GVTLDALNERFNEAARQLNRGNDVL-LPT----QGATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 253 ~~~~~l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~-lid----~~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP 323 (456) +-..++.... +.+.+.+.... +....... .++ .+..++.++..+ ......++.+++.++..+|++ T Consensus 289 ~v~~~~l~~~~~~~~g~~~~~~~------~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls 362 (500) T protein:vir:98 289 AVPESLTALTVRTTDGDVVPRPR------FESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVS 362 (500) T ss_pred eechHHhcccCCCCCccccCCcc------cCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCC Confidence 0001111100 00000000000 00000110 111 122366665554 456778888889999999988 Q ss_pred eEEeeccCCCcc-cchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc--------CcCCCCceEEEeCCCCCCCH Q lcl|NC_016762. 324 TKILVGMQTGER-ASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIG--------VVPLKAEFTAIWDDLTVPTK 391 (456) Q Consensus 324 ~t~L~G~sp~Gl-nst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~--------~~~~~~d~~~~f~pL~~~se 391 (456) -.. ||-..+|. +||+ ..+.-|.++..+|. .++..|+.|+..++... ..+...+++|.|++--..++ T Consensus 363 ~~~-~~~~~~g~~TAtei~s~~~~~~~t~~~~~~-~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~ 440 (500) T protein:vir:98 363 AGL-FSFDGKSMKTATEIVSENSDTYQMRNSIVA-LVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDR 440 (500) T ss_pred ccc-cccCcCccccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCH Confidence 763 45444454 4553 45567888888886 57999999998775431 22344579999998877777 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCcCcCHHH------------HHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCC Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIGTGEPVFTAEE------------IREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGE 454 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~~g~~~i~~~E------------~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~ 454 (456) .+.++... .++.+| +++..+ +++.+..... +..+..+...+..|+.|| T Consensus 441 ~~~~~~~~-------~~v~aG--i~s~~~~i~~~~g~~eeea~~~l~~i~~------E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 441 DAELDYWI-------KVVNAG--FGTREMAIQKVLNVTEEKAQEIAAEINT------GIVDEINQQRTDTHLYGE 500 (500) T ss_pred HHHHHHHH-------HHHHcC--CCCHHHHHHhcCCCCHHHHHHHHHHHHH------hccccCCCCCccccccCC Confidence 66555433 334444 444444 3332211000 000111122233455555 No 169 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.73 E-value=9.1e-09 Score=64.67 Aligned_cols=397 Identities=11% Similarity=0.047 Sum_probs=183.8 Q ss_pred CCchhHH-----------------------HHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHH Q lcl|NC_016762. 1 MTDKLDL-----------------------AVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMY 57 (456) Q Consensus 1 ~~~~~~~-----------------------~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y 57 (456) |.+|++- +++--.-.++.+.+..|.+-.-.+ +.+.. .+.. -..-+ T Consensus 3 ~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~-----~~~~~--~~~~-----~~~~~ 70 (500) T protein:vir:30 3 VIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSV-----LYLNT--DGET-----KKRDL 70 (500) T ss_pred hHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCc-----ccccC--CCCc-----ccCce Confidence 4444433 222111122222223332110000 00000 0000 01113 Q ss_pred hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC Q lcl|NC_016762. 58 RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP 137 (456) Q Consensus 58 ~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~ 137 (456) .+-.+++.||+..|+=++-+-++|..+++. ..+.|++.++..+++..+.+++......|++++-+.+..++. T Consensus 71 ~slnl~~~i~~~~A~lv~~e~~~i~~~d~~--------~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~ 142 (500) T protein:vir:30 71 NHLPIARTAAKKIASLVFNEQAEIKVDDDA--------ANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDKV 142 (500) T ss_pred eecchHHHHHHHHhhhhcCCcceEecCChH--------HHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCce Confidence 344899999999999999988888654321 123578888888899999999999888888888777654332 Q ss_pred c------c--ccccCCcCceeEEEEeccccCChhhhhccccccccC-Cce--------eEEEeeccc-------CCcccc Q lcl|NC_016762. 138 W------D--RPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYG-QPT--------MWEYTEASQ-------AGRPGL 193 (456) Q Consensus 138 ~------~--~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg-~P~--------~y~i~~~~~-------~g~~~~ 193 (456) . + -|+.-...++..+..++...-+ .+ ....|+ .-+ .|.|.-... -|.... T Consensus 143 ~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~---~~---~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~ 216 (500) T protein:vir:30 143 RVAFVQAPVFLPLQSNTQDVSSAAVVIKSVKT---IN---GKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVP 216 (500) T ss_pred EEEEEcCCeeEEEEEcCCCeEEEEEEEEEeee---ec---CCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccc Confidence 1 1 1332222233222222111000 00 000110 000 122211000 011000 Q ss_pred ceeeeh-----------hhh--heec--------CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_016762. 194 VRDIHP-----------DRV--FILG--------DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKE 252 (456) Q Consensus 194 ~~~IH~-----------SRl--i~~~--------~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 252 (456) -..+|. .|. .+|. ...+.|+|++..+.+-+..++.+...+..-+ +..-+.+ T Consensus 217 l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~-~~g~~~i------- 288 (500) T protein:vir:30 217 LSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEV-KMGQRRV------- 288 (500) T ss_pred cccccCCcCcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHH-HhCccee------- Confidence 000111 111 1110 1235699999999998888888765554322 1111111 Q ss_pred ccHhhHHhhh--cCCHHHHHHHHHHHHHHHhcCCCeE-Eec----CCCceeEEeccc--CCHHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 253 INLGEIASTY--GVTLDALNERFNEAARQLNRGNDVL-LPT----QGATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 253 ~~~~~l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~-lid----~~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP 323 (456) +-..++.... +.+.+.+.... +....... .++ .+..++.++..+ ......++.+++.++..+|++ T Consensus 289 ~v~~~~l~~~~~~~~g~~~~~~~------~d~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls 362 (500) T protein:vir:30 289 AVPESLTALTVRTTDGDVVPRPR------FESDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVS 362 (500) T ss_pred eechHHhcccCCCCCccccCCcc------cCCCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCC Confidence 0001111100 00000000000 00000110 111 122366665554 456778888889999999988 Q ss_pred eEEeeccCCCcc-cchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc--------CcCCCCceEEEeCCCCCCCH Q lcl|NC_016762. 324 TKILVGMQTGER-ASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIG--------VVPLKAEFTAIWDDLTVPTK 391 (456) Q Consensus 324 ~t~L~G~sp~Gl-nst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~--------~~~~~~d~~~~f~pL~~~se 391 (456) -.. ||-..+|. +||+ ..+.-|.++..+|. .++..|+.|+..++... ..+...+++|.|++--..++ T Consensus 363 ~~~-~~~~~~g~~TAtei~s~~~~~~~t~~~~~~-~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~ 440 (500) T protein:vir:30 363 AGL-FSFDGKSMKTATEIVSENSDTYQMRNSIVA-LVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDR 440 (500) T ss_pred ccc-cccCcCccccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCH Confidence 763 45444454 4553 45567888888886 57999999998775431 22344579999998877777 Q ss_pred HHHHHHHHHHHHHHHHHHHcCCcCcCHHH------------HHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCC Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIGTGEPVFTAEE------------IREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGE 454 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~~g~~~i~~~E------------~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~ 454 (456) .+.++... .++.+| +++..+ +++.+..... +..+..+...+..|+.|| T Consensus 441 ~~~~~~~~-------~~v~aG--i~s~~~~i~~~~g~~eeea~~~l~~i~~------E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 441 DAELDYWI-------KVVNAG--FGTREMAIQKVLNVTEEKAQEIAAEINT------GIVDEINQQRTDTHLYGE 500 (500) T ss_pred HHHHHHHH-------HHHHcC--CCCHHHHHHhcCCCCHHHHHHHHHHHHH------hccccCCCCCccccccCC Confidence 66555433 334444 444444 3332211000 000111122233455555 No 170 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=98.70 E-value=1e-07 Score=58.93 Aligned_cols=389 Identities=10% Similarity=-0.006 Sum_probs=187.3 Q ss_pred CCchh------H---HHHhHHHHHHHHHHHHHHhhhhh-ccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKL------D---LAVNHAMSSAIARARMSLLNQGI-GHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~------~---~~~~~a~~~~~~~~~d~~~n~~~-~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |+.++ . -+...++...++..+..+.+... ++-..++......+ .+.+-++.|. ++.-+..++++. T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~----~~~~~y~~m~-~D~~i~s~l~~R 75 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALG----KDIRVYRELR-ADAHVGGCVRRR 75 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhHHHhhcc----CCHHHHHHHh-hChHHHHHHHHH Confidence 43321 0 01123444556666666655433 22222333222221 2344555555 577777777877 Q ss_pred hhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcCce Q lcl|NC_016762. 71 VTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLNGL 149 (456) Q Consensus 71 aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~~l 149 (456) -.-.+..-|+|..+..++ ...+.+++.++++.+...+.+.+ .+.+||.|++=|.-. ++.. -.+ T Consensus 76 k~av~~~~w~i~~~~~~~------~~a~~i~e~l~~~~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~---------~~~ 139 (491) T protein:vir:79 76 KAAVKALEWGLDRGKAKS------RVAKSIADVFADLDLSRIATEML-DAVLYGYQPMEITWGKVGNY---------IVP 139 (491) T ss_pred HHHHhCCCcEEecCCCCH------HHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCe---------eeE Confidence 777776666775443322 12345777888887666665554 588999998765331 2111 112 Q ss_pred eEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheec----CCcCCCcchHHHHHHHHHH Q lcl|NC_016762. 150 AKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILG----DWTGDAIGFLEPAYNSFIS 225 (456) Q Consensus 150 ~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~----~~~~~G~S~le~~~~~l~~ 225 (456) ..|.++.+..+. .|+. +... |.... ....+..+.+-+.+.+. ....+|.+++..||...+- T Consensus 140 ~~l~~r~~~~f~-----~d~~----~~l~-l~~~~-----~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~f 204 (491) T protein:vir:79 140 IDVVGKPADWFV-----YDPE----NQLR-FRSKE-----HWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTF 204 (491) T ss_pred Eeeeeeccccee-----eccC----CceE-EeecC-----CCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHH Confidence 223322221111 1221 1111 11110 01113344444444432 3457799999999875432 Q ss_pred HHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecc-cCC Q lcl|NC_016762. 226 LEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSA-VSD 304 (456) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~-~sg 304 (456) -.....-++.-+-+........++. ....++-.+.+.+++..|.+ ....+|..+.+++-++.+ -+| T Consensus 205 K~~~~~~w~~f~E~~G~P~~igky~------------~~a~~~ek~~l~~al~~~~~-~a~~viP~~~~ie~~ea~~~~g 271 (491) T protein:vir:79 205 KKGGLKFWVQFTEKYGSPMLVGKHP------------RSASDAETNLLLDRLEDMVQ-DAVAVIPDDSSIEIKEAAGKSG 271 (491) T ss_pred HHhhHHHHHHHHHHcCCCeEEEecC------------CCCCHHHHHHHHHHHHHHhc-CeEEEecCCceeEEEeccCCCC Confidence 2222222221111111110101100 01123344555566666644 356778888889988775 344 Q ss_pred ----HHHHHHHHHHHHHhhhcCCeEEeeccC----CCcccchHH--HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC Q lcl|NC_016762. 305 ----PGPTYNVNLQTAAAGVDIPTKILVGMQ----TGERASSED--QKYHNARCQARRVQELTFEINDLFAHLMRIGVVP 374 (456) Q Consensus 305 ----l~~~~~~~~~~~aaas~IP~t~L~G~s----p~Glnst~D--~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~ 374 (456) ...+++..-.+||-+. +||+ .+|-.+.++ .....+.+.+... .+...|++|+.-|+...+++ T Consensus 272 ~~~~y~~li~~~d~~Isk~i-------LGqtlTt~~~gs~a~~~vh~~v~~~i~~~D~~-~i~~tln~li~~l~~~N~~~ 343 (491) T protein:vir:79 272 SADVYERLLHFCRGEVSIAL-------LGQNQTTEATSTRASAQAGLEVTDDIRDGDKA-IVVEAMNMLIRWICDLNFDG 343 (491) T ss_pred ChhHHHHHHHHHHHHHHHHH-------hhhhhccCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCC Confidence 3445555556666543 4442 122222232 3345566665544 56677788888887777764 Q ss_pred CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCC-CcccCCCC---CCCCCcC Q lcl|NC_016762. 375 LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDP-LPDTEPED---EDAARTD 450 (456) Q Consensus 375 ~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~-~~~~~~~d---~~~~~~d 450 (456) .+ .+.|.|... | ++.+..|++++.+++.|. .++.+++|+..+.+.....+. .+...+.. ....+.. T Consensus 344 ~~-~p~f~~~e~------e--e~~~~~a~~~~~L~~~G~-~i~~~~~~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~ 413 (491) T protein:vir:79 344 AA-RPVFDMWEQ------E--QVDEIQAGRDEKLTRAGA-RFTPAYFKRAYNLQDGDLDERPLPVSAVDAVGAASFAEFE 413 (491) T ss_pred CC-cceEeecCc------C--chhHHHHHHHHHHHhCCC-ccCHHHHHHHhCCCCCCCCccccCcCcccccccccccccC Confidence 33 344554331 2 344567888899999985 489999999887754332211 11000000 0000111 Q ss_pred CCCCCC Q lcl|NC_016762. 451 PTGEQQ 456 (456) Q Consensus 451 ~~~~~e 456 (456) ...++. T Consensus 414 ~~~~~~ 419 (491) T protein:vir:79 414 APDQDA 419 (491) T ss_pred CCCCcc Confidence 111111 No 171 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.64 E-value=1.5e-07 Score=57.97 Aligned_cols=413 Identities=12% Similarity=0.010 Sum_probs=176.4 Q ss_pred CCch------------hHHHHh-HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhh Q lcl|NC_016762. 1 MTDK------------LDLAVN-HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAV 67 (456) Q Consensus 1 ~~~~------------~~~~~~-~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iV 67 (456) |-++ +.-..+ |-.+...-+....|.. |- .+-.. +...........-.+.+++.|| T Consensus 5 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~---g~---~~i~~------~~~~~~~~~~~ki~~n~~~~Iv 72 (499) T protein:vir:10 5 IDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYN---GK---QEIEK------HEFDNATVEAANVMVNHAKYIT 72 (499) T ss_pred hhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhc---cc---cchhc------CCcCcCCCCcceeecchHHHHH Confidence 1111 111111 1111111111112211 10 00000 0000000111122467899999 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcccccc--- Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPAR--- 143 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~--- 143 (456) +..+.=++.+.+.+...++.+. ..+...+++.++...+.++.+....||.+++++.++ +|..+..+.. T Consensus 73 ~~~~~~l~g~p~~~~~~~~~~~--------~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~ 144 (499) T protein:vir:10 73 DMNVGFMTGNPVKYVAEKGKNI--------DDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNE 144 (499) T ss_pred HHHhhhhcccCceeecCChhHH--------HHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccccccc Confidence 9999989989888865433221 236667777788889999999999999999988775 3333221110 Q ss_pred --C--CcCceeEEEE-----eccccCCh-----hhhhccc--------cccccCCcee-EEEeeccc----CCcccccee Q lcl|NC_016762. 144 --G--KLNGLAKVTP-----AWAGCLKP-----KSFDEKP--------DSETYGQPTM-WEYTEASQ----AGRPGLVRD 196 (456) Q Consensus 144 --~--~~~~l~~i~~-----~~~~~~~~-----~~~~~Dp--------~s~~yg~P~~-y~i~~~~~----~g~~~~~~~ 196 (456) . ..-.+..+.| +|...... ..+.... .....+.|.. |.+..... ++....... T Consensus 145 ~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~ 224 (499) T protein:vir:10 145 KLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDG 224 (499) T ss_pred ccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccc Confidence 0 0011222222 22111100 0000000 0001111111 01100000 000000111 Q ss_pred eehhhhheecC--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHH Q lcl|NC_016762. 197 IHPDRVFILGD--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFN 274 (456) Q Consensus 197 IH~SRli~~~~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 274 (456) -|.--.+.+.. .+.+|.|.++.+..-+..++.+.-..+..+-..+...+.+. |.......... T Consensus 225 ~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~--------------G~~~~~~~~~~- 289 (499) T protein:vir:10 225 ENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTF--------------GFGLGDDKDDI- 289 (499) T ss_pred cCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeee--------------cCccccccchh- Confidence 23322222222 23457888887766565666654444433322222222111 00000000000 Q ss_pred HHHHHHhcCCCeEEec--CCCceeEE--ecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHH Q lcl|NC_016762. 275 EAARQLNRGNDVLLPT--QGATVTQM--VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHN 345 (456) Q Consensus 275 ~~~~~~~~~~~~~lid--~~d~~~~~--~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyy 345 (456) .. ++. ....+++ .+.+++.+ +.+.+++...++.+.+.|...+++|-.- ++.-.| |.++. ++ .-. T Consensus 290 ~~---~~~-~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~-~~~~~g--n~Sg~Al~~~~~~l~ 362 (499) T protein:vir:10 290 QR---LKR-GAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMN-DEKFMG--NVSGEAMKFKLFGLE 362 (499) T ss_pred hh---hhh-cceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCC-chhhcc--cchHHHHHHHHHHHH Confidence 00 111 1122222 23345544 4566799999999999999999999631 222112 22332 22 223 Q ss_pred HHHHHHHHhhhhHHHHHHHHHHHHhc-C-c--CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCHH Q lcl|NC_016762. 346 ARCQARRVQELTFEINDLFAHLMRIG-V-V--PLKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTAE 419 (456) Q Consensus 346 d~I~~~Qe~~lrp~L~~l~~~l~~s~-~-~--~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~~ 419 (456) ..+. ..+..+++.+++++.+++... . + ....++++.|+|=..-+++|.|++..+.+.+.. +++..--.+-+++ T Consensus 363 ~k~~-~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~ 441 (499) T protein:vir:10 363 NLLS-IKQRYFFDGLRRRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNADGIIPRKYTYSWLPDVDNPQ 441 (499) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHH Confidence 3344 445678999999999887542 1 2 123478999999999999999999887654322 2222111122333 Q ss_pred -HHHHHhc---------ccCCCCCCCCccc--CC-CCCCCCCcCCCCCCC Q lcl|NC_016762. 420 -EIREEAG---------YDPLQGGDPLPDT--EP-EDEDAARTDPTGEQQ 456 (456) Q Consensus 420 -E~R~~~~---------~~~~~~~~~~~~~--~~-~d~~~~~~d~~~~~e 456 (456) |+..... ..+.....+.... +. .+.+++.+.++++.. T Consensus 442 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (499) T protein:vir:10 442 DVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAGSNHN 491 (499) T ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCccccc Confidence 3322110 1111111111111 11 111111111222111 No 172 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.64 E-value=1.6e-07 Score=57.85 Aligned_cols=419 Identities=10% Similarity=-0.017 Sum_probs=184.0 Q ss_pred CCchhH---HHHh-HHHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHH Q lcl|NC_016762. 1 MTDKLD---LAVN-HAMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTC 74 (456) Q Consensus 1 ~~~~~~---~~~~-~a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~ 74 (456) +...-+ ..++ |-.+. .+.+-.+-|.+---.+-..+ .............+.+++.||+..+.=+ T Consensus 37 ~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~-----------~~~~~~~~~~ki~~n~~k~Ivd~~~~yl 105 (501) T protein:vir:27 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFG-----------RRKDREMADKRAVHNYGRMISKFKTGYL 105 (501) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccC-----------ccCccccccceeccchHHHHHHHHhhhh Confidence 333222 2211 22111 12222222211000000000 0000000111235789999999999999 Q ss_pred hhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccc---ccc------C Q lcl|NC_016762. 75 WKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDR---PAR------G 144 (456) Q Consensus 75 tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~---Pl~------~ 144 (456) +-+.+++...++.+... ....|.+.+++.++...+.++.+....||.|++++..+ ||+..-. |.. . T Consensus 106 ~g~p~~~~~~d~~~~~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~v~d~ 181 (501) T protein:vir:27 106 AGNPIRVEYDDNDNNSQ----NDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETFVIYDN 181 (501) T ss_pred cccCeeEecCCccchHH----HHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeEEEecC Confidence 99999987655443322 22346667777889999999999999999999988774 3332111 111 0 Q ss_pred Cc-CceeEEEEeccccCChhhhhccccccccCCc-eeEEEeecccCCccccceeeehhhhheecC--CcCCCcchHHHHH Q lcl|NC_016762. 145 KL-NGLAKVTPAWAGCLKPKSFDEKPDSETYGQP-TMWEYTEASQAGRPGLVRDIHPDRVFILGD--WTGDAIGFLEPAY 220 (456) Q Consensus 145 ~~-~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P-~~y~i~~~~~~g~~~~~~~IH~SRli~~~~--~~~~G~S~le~~~ 220 (456) .. +.+....-+|........ ..--.++.+ ..|.+.. .++.......-|.=-.+.+.. .+..|.|.++.+. T Consensus 182 ~~~~~~~~~ir~~~~~~~~~~----~~~~~vyt~~~v~~~~~--~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~ 255 (501) T protein:vir:27 182 SLEDNSIAAVRYYNRGTLQNA----KDVVEIYTNEHIYTLDA--SDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETEL 255 (501) T ss_pred CCCCceEEEEEEEEeeecCCc----EEEEEEEeCCeEEEEEe--CCceeeccccccCCCcccEEEecCCCCCCCchhhhH Confidence 00 001111111110000000 000000011 1111110 000000011223211111221 2346889999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHH-HHHHHHhcCCCeEEecCCCc--eeE Q lcl|NC_016762. 221 NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFN-EAARQLNRGNDVLLPTQGAT--VTQ 297 (456) Q Consensus 221 ~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~lid~~d~--~~~ 297 (456) +-+.+++.+.-..+..+...+...+.+..... ....+....+. ..+-.+...........+.+ |-. T Consensus 256 ~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 324 (501) T protein:vir:27 256 YLIDLYDSAESDTANHMSDMADAILAIYGDLA-----------LPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLT 324 (501) T ss_pred HHHHHHHHHHHHHHHHHHHhcCceeeeecCcc-----------CCcccchhhhhhcCceeecccccccCCCCCcceeeee Confidence 87777877766655544333333332221100 00001111110 00000001101111112223 444 Q ss_pred EecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHHHHHHhc- Q lcl|NC_016762. 298 MVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFAHLMRIG- 371 (456) Q Consensus 298 ~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~- 371 (456) .+.+.+++...++...+.+...+++|-.-+ +...| |.++. ++ .-... .+.++..++..|++++.+++... T Consensus 325 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~~~--n~Sg~Al~~~~~~l~~k-a~~~~~~~~~~l~~~~~li~~~~~ 400 (501) T protein:vir:27 325 KSYDVSGAEAYKTRLNRDIHIFTNIPDMSD-TNFSG--NTSGEALKYKLFGLDQD-RVDTQSQFTQGLKRRYRLAARIGS 400 (501) T ss_pred ccCCHHHHHHHHHHHHHHHHHHhCCcccCc-ccccc--CchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHh Confidence 556668999999999999999999996422 22212 23333 22 22233 34555678999999998876531 Q ss_pred C---cC--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCc-CHH-HHHHHhcc----cCCCCCCCCcc Q lcl|NC_016762. 372 V---VP--LKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVF-TAE-EIREEAGY----DPLQGGDPLPD 438 (456) Q Consensus 372 ~---~~--~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i-~~~-E~R~~~~~----~~~~~~~~~~~ 438 (456) . +. ...+++|.|+|-...+.+|.|++..+.+-... +++.. .+-+ +++ |+...-.. +.........+ T Consensus 401 ~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~et~l~~-l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~ 479 (501) T protein:vir:27 401 LVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSL-SGLVESPNEELDKINKEVSEIDFKGYSNDFNE 479 (501) T ss_pred hcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHh-CCCCCCHHHHHHHHHHHHHhhhHhhhcCcccc Confidence 1 11 12368899999999999999999887765332 22221 1123 333 43221111 10000000111 Q ss_pred cCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 439 TEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 439 ~~~~d~~~~~~d~~~~~e 456 (456) ......+..+.+.+.++| T Consensus 480 ~~~~~~d~~~~~~~d~~e 497 (501) T protein:vir:27 480 HVGKYTDEVKETHTDDFE 497 (501) T ss_pred ccccccCCCCCCcccccc Confidence 111111111111222222 No 173 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.63 E-value=1.7e-07 Score=57.65 Aligned_cols=409 Identities=8% Similarity=-0.029 Sum_probs=176.1 Q ss_pred CCchh--HHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKL--DLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~--~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) -.+++ ++...|-.+-........|.+.---+=..+++..... . .+.......-.+.+++.||+..+.=++.+. T Consensus 26 ~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~--~---~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~ 100 (478) T protein:vir:10 26 TQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNG--D---YDETKPDWRMYTNYHQNLVDQKVAYAVANP 100 (478) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccccccc--c---cccccccceeccchHHHHHHHHHhhhccCC Confidence 01111 1111111111111111112110000000000000000 0 000001111246889999999999999999 Q ss_pred CEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc------CC-cC Q lcl|NC_016762. 79 PQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR------GK-LN 147 (456) Q Consensus 79 ~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~------~~-~~ 147 (456) +++...++.. .+.|...++ -++...+.++.+....+|.+++++..+ +|+..- .|.. .. .+ T Consensus 101 ~~~~~~~d~~--------~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~~~p~~~~~i~d~~~~~ 171 (478) T protein:vir:10 101 VTFGVDNDKA--------LKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERD 171 (478) T ss_pred eeeecCChHH--------HHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCC Confidence 9885433221 123444444 478888899999999999999888774 333211 1221 01 11 Q ss_pred ceeEEEEeccccCChhhhhccccccccCCc---eeEEEeeccc--------CCcc---ccceeeehhhhheecC--CcCC Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEKPDSETYGQP---TMWEYTEASQ--------AGRP---GLVRDIHPDRVFILGD--WTGD 211 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P---~~y~i~~~~~--------~g~~---~~~~~IH~SRli~~~~--~~~~ 211 (456) .+....-+|.. . +...-.++.| .+|....... .+.. ......|.--.+.+.. .... T Consensus 172 ~~~~~v~~~~~--~------~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~ 243 (478) T protein:vir:10 172 ELQAFIRVYEL--D------GAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQ 243 (478) T ss_pred ceEEEEEEEEe--c------CceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEeccCCC Confidence 11111111110 0 0000011111 1222210000 0000 0011122222222221 2346 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEec- Q lcl|NC_016762. 212 AIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPT- 290 (456) Q Consensus 212 G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid- 290 (456) |.|.++.+.+-+.+++.+.-..+..+-..+...+.+. + .+.+...+.. ..+.. ...+.+. T Consensus 244 g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~--------g------~~~~~~~~~~----~~~~~-~~~~~~~~ 304 (478) T protein:vir:10 244 EVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILK--------G------YEGEDMKDFM----HNLKY-YKAISVAG 304 (478) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeee--------c------CCccccchhh----hhhhh-cceEEecC Confidence 8899988877777777765555543322222222111 1 1101111100 11111 1223332 Q ss_pred -CCCc--eeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH-HH---HHHHHHHHHhhhhHHHHH Q lcl|NC_016762. 291 -QGAT--VTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK-YH---NARCQARRVQELTFEIND 362 (456) Q Consensus 291 -~~d~--~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~-ny---yd~I~~~Qe~~lrp~L~~ 362 (456) .+.+ +-..+.+.+++...++.+.+.+...+++|-.-+-+. +| |.++. ++ -| ...+ ...+..+...|++ T Consensus 305 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~-n~Sg~Al~~~~~~l~~k~-~~~~~~~~~~l~~ 380 (478) T protein:vir:10 305 ESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF--GN-SPSGIALKFMYSNLDLKA-NKLKNKTLTALQE 380 (478) T ss_pred CCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccc--cc-ccHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 2233 445567788999999999999999999995322111 12 33443 22 22 2233 4445678999999 Q ss_pred HHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHH-cCCcCcCHHHHHHHhcccC--CCCCCCC Q lcl|NC_016762. 363 LFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIG-TGEPVFTAEEIREEAGYDP--LQGGDPL 436 (456) Q Consensus 363 l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~-~g~~~i~~~E~R~~~~~~~--~~~~~~~ 436 (456) ++.+++....+ ....++++.|+|-...++++.|++..+.+.... +++. .+ .+-++++..+....+. ....... T Consensus 381 ~~~li~~~~g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~-~v~D~~~E~~ri~~E~~~~~~~~~~ 459 (478) T protein:vir:10 381 LLQYIIDFYRLDVKVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILSNHA-WVEDPVAEMERIEQENIELNQQLPD 459 (478) T ss_pred HHHHHHHHhCCCcccccceEEecCCCCCCHHHHHHHHHHHhCCCChHHHHHhCC-CCCCHHHHHHHHHHHHHHHHhhccc Confidence 99887665433 334578999999999999999998776643321 1221 11 1223332222211110 0000000 Q ss_pred cccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 437 PDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 437 ~~~~~~d~~~~~~d~~~~~e 456 (456) . .....++....++..++| T Consensus 460 ~-~~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 460 I-EEGLNGEQQRQSENNQPE 478 (478) T ss_pred c-ccccCCCCCCCCCCCCCC Confidence 0 001111122222222223 No 174 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.60 E-value=2.1e-07 Score=57.18 Aligned_cols=419 Identities=9% Similarity=-0.020 Sum_probs=178.5 Q ss_pred CCchhHHHHhHHHH-----HHHHHHHHHHhhhhhcc--CcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhH Q lcl|NC_016762. 1 MTDKLDLAVNHAMS-----SAIARARMSLLNQGIGH--DAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTT 73 (456) Q Consensus 1 ~~~~~~~~~~~a~~-----~~~~~~~d~~~n~~~~~--gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed 73 (456) .-...+.++..-.+ ....+ -+.+.+...+- ...+......|+... ...-+.....+.+|++.||+..|+= T Consensus 3 ~~~~~~~~i~~w~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~w~~~--~~~~~~~~~~~~~l~~~i~~~~A~l 79 (518) T protein:vir:78 3 VWSVMTRFIKGWLNGKPNGSEPEL-IPKYLPLVPDNQKEWSKDSYLTSLWAQG--YVPTVHDKLMNSGTGNEIVVVAAEY 79 (518) T ss_pred chhhHHHHHHHhhcCCCCccchhc-cHHHhhhcccchhhhhhhhhhhhhcccC--CCCccccccccCChHHHHHHHHHHh Confidence 11111121111100 00000 00000000000 000010001110000 0112233344678999999999999 Q ss_pred HhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCcc--------ccccCC Q lcl|NC_016762. 74 CWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWD--------RPARGK 145 (456) Q Consensus 74 ~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~--------~Pl~~~ 145 (456) ++-+.++|+-.+.+ ..+. ....+.|.+.++..+++..+.+++.....-||.++=+.+.+++..- -|+. . T Consensus 80 l~~e~~~i~v~~~~-~~d~-e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~ad~~~P~~-~ 156 (518) T protein:vir:78 80 ISGKPLSIDVTGVN-GSKD-ENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNGRPSISVHSSSQFWIDF-K 156 (518) T ss_pred hcCCCceEEecCcc-ccCc-HHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECCeeEEEEEcCCeeEEEe-e Confidence 99998877522211 1111 1223458888888899999999999877777777655554444211 1221 1 Q ss_pred cCceeEEEEeccccCChhh-----------hhc-cccccccCCceeEEEeeccc---CCccc------------------ Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKS-----------FDE-KPDSETYGQPTMWEYTEASQ---AGRPG------------------ 192 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~-----------~~~-Dp~s~~yg~P~~y~i~~~~~---~g~~~------------------ 192 (456) .+.+..+ +||........ +.. +-..-.|+ .|.|+-... .+... T Consensus 157 ~g~~~~~-~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~---~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~ 232 (518) T protein:vir:78 157 NNEPFRF-NFFEEIPTSNKADIYYLVESREIKQWDKEGKKLS---GGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTN 232 (518) T ss_pred cCcEEEE-EEEEEeecCCcceeEEEEEeeccccccceeeccc---ceeEEEEEeeecCcccccccccccccccccccccc Confidence 1112211 22221111000 000 00000111 011210000 00000 Q ss_pred -c--ceeee-hhhhh--ee--c-------CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh Q lcl|NC_016762. 193 -L--VRDIH-PDRVF--IL--G-------DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE 257 (456) Q Consensus 193 -~--~~~IH-~SRli--~~--~-------~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 257 (456) . ...|+ -++.. .| + ..+++|+|++..+.+.+..++.+...+..-| +.+-+.+.+ ..+ T Consensus 233 ~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~-~~g~~~i~v-------~~~ 304 (518) T protein:vir:78 233 DIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREG-EKTKTKIAA-------SER 304 (518) T ss_pred cCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHH-HhCCceeee-------chh Confidence 0 00010 11111 11 1 1245699999999998888888765544322 211111111 111 Q ss_pred HHhhh--cCCHHHHHHHHHHHHHHHhcCCCeEE-----ecCC----CceeEEecccC--CHHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 258 IASTY--GVTLDALNERFNEAARQLNRGNDVLL-----PTQG----ATVTQMVSAVS--DPGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 258 l~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~l-----id~~----d~~~~~~~~~s--gl~~~~~~~~~~~aaas~IP~ 324 (456) +.... +.+... .-.+ ....+... .+.+ +.++.++..+- -....++.++..+...+|++- T Consensus 305 ~l~~~~~~~~~~~-~~~f-------d~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~ 376 (518) T protein:vir:78 305 MFRKKVNKSTDKE-EWSM-------NVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNP 376 (518) T ss_pred HhccCCCCCCCcc-cccc-------CCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCCh Confidence 11100 111000 0001 11111110 1111 23666665543 456667778888888888887 Q ss_pred EEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc---Cc-------CCCCceEEEeCCCCCCCH Q lcl|NC_016762. 325 KILVGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIG---VV-------PLKAEFTAIWDDLTVPTK 391 (456) Q Consensus 325 t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~---~~-------~~~~d~~~~f~pL~~~se 391 (456) .- ||.+.+..++|+ ...--|.+|..+|. .++..|++|+..++... .+ ..+.+++|.|++--..++ T Consensus 377 ~t-fg~~~~~~TATei~s~~~~~~~t~~~~~~-~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~ 454 (518) T protein:vir:78 377 AT-FNLGNREVKATEIWSLQDATVRKIEKKKR-LIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNL 454 (518) T ss_pred hh-cCcccccccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCH Confidence 64 587666656664 44457888888875 67999998887765321 11 123479999999999999 Q ss_pred HHHHHHHHHHHHH----HHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 392 AERLANSKTMSEI----NSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 392 ke~Aei~~~~A~a----~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ++++++..+...+ ...++..-.+-.+.+|+++.+..-. .|.+..+++++++-+-.+ T Consensus 455 ~~~~~~~~~~v~aGimS~e~~i~~~~~~~~deea~~e~~ri~---------~E~~~~~~~~p~~~~g~~ 514 (518) T protein:vir:78 455 NELSSTLNNMNSALAMSVEEKVKLIHPKWEDEEIQAEVKRIY---------LENAIGEVPDPEAIGGME 514 (518) T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHhCCCCCHHHHHHHHHHHH---------HHhcccCCCCCccccCCC Confidence 9888876543221 1111110000123333332211000 000001111111111111 No 175 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.53 E-value=3e-07 Score=56.38 Aligned_cols=417 Identities=11% Similarity=0.067 Sum_probs=181.3 Q ss_pred CCchhHHHHhHHH-----------------------HHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHH Q lcl|NC_016762. 1 MTDKLDLAVNHAM-----------------------SSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMY 57 (456) Q Consensus 1 ~~~~~~~~~~~a~-----------------------~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y 57 (456) |.++++-....-. -..+.+.+..|.+- -++ ..|. .........-+ T Consensus 3 ~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~------~~~-----~~~~-~~~~~~~~~~~ 70 (522) T protein:vir:47 3 LFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSK------WDD-----VQYK-NTDGDIKSRPM 70 (522) T ss_pred hHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCC------ccc-----cccc-ccCcchhcccc Confidence 3333322211100 01111111111110 000 0000 00001111123 Q ss_pred hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC Q lcl|NC_016762. 58 RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP 137 (456) Q Consensus 58 ~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~ 137 (456) .+-.+++.||+..|+=++-+-+.|..+++. ..+.|.+.++..++...+.+++..+...||.++-+.++.++. T Consensus 71 ~slnl~~~i~~~~A~lv~~e~~~i~v~d~~--------~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~ 142 (522) T protein:vir:47 71 NHLPIARTASKKIASLVYNEQATITTKNEI--------LQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGDKV 142 (522) T ss_pred eecchHHHHHHHHhhhhcCCcceeecCChH--------HHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcCCce Confidence 444899999999999999998888654321 223578888888999999999998777777777777654432 Q ss_pred c------c--ccccCCcCceeEEEEecccc------------CChhhhhc-----cccccccCCceeEEEeeccc----- Q lcl|NC_016762. 138 W------D--RPARGKLNGLAKVTPAWAGC------------LKPKSFDE-----KPDSETYGQPTMWEYTEASQ----- 187 (456) Q Consensus 138 ~------~--~Pl~~~~~~l~~i~~~~~~~------------~~~~~~~~-----Dp~s~~yg~P~~y~i~~~~~----- 187 (456) . + -|+.-..+++.....++... +--.+|.. +... ..|. .|.|..... T Consensus 143 ~i~~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~-~~~~--~~~I~n~ly~~~~~ 219 (522) T protein:vir:47 143 RVAFIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGST-NDKK--YYRITNELYRSDVN 219 (522) T ss_pred EEEEEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeeccccccccccc-ccCC--ceEEEEEEeecCCC Confidence 1 1 24422222222221222110 00001100 0000 0011 122211100 Q ss_pred --CCccc------------cceee-ehhhhh--eec--------CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016762. 188 --AGRPG------------LVRDI-HPDRVF--ILG--------DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAA 242 (456) Q Consensus 188 --~g~~~------------~~~~I-H~SRli--~~~--------~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~ 242 (456) -|.+. ....+ +-+|.+ +|. ..++.|+|++..+.+.+..++.+-..+. ++..+ T Consensus 220 ~~lG~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~---~e~~~ 296 (522) T protein:vir:47 220 DVLGQRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFM---WEVRM 296 (522) T ss_pred cccCccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHH---HHHHh Confidence 00000 00111 112321 221 1245699999999988888887654433 22111 Q ss_pred hhhhhhhhhhccHhhHHhhh--c-CCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEeccc--CCHHHHHHHHHHHHH Q lcl|NC_016762. 243 RQLLLNFDKEINLGEIASTY--G-VTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAV--SDPGPTYNVNLQTAA 317 (456) Q Consensus 243 ~~l~~~~~~~~~~~~l~~~~--~-~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~--sgl~~~~~~~~~~~a 317 (456) .+..+-.. ..+.... + .+.......++ .-..+....+.. .+.+..++.++..+ .-+...+..+...++ T Consensus 297 g~~~i~v~-----~~~l~~~~~~~~g~~~~~~~fd-~~~~~f~~~~~~-~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~ 369 (522) T protein:vir:47 297 GQRRVIVP-----EHLTQRQYQRPDGTIDFRPRFD-VEQNVYMQIGGS-SMDAGGITDLTSPIRANDYILAISEGLKLFE 369 (522) T ss_pred ccceeecc-----hHHhccCCCCCCcccccccccC-cccceEeecCCC-CCCCCcceeeccccChHHHHHHHHHHHHHHH Confidence 11111111 1111110 0 11111111111 000011111111 11222355555443 346667777788888 Q ss_pred hhhcCCeEEeeccCCCcc-cchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc--------CcCCCCceEEEeCC Q lcl|NC_016762. 318 AGVDIPTKILVGMQTGER-ASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIG--------VVPLKAEFTAIWDD 385 (456) Q Consensus 318 aas~IP~t~L~G~sp~Gl-nst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~--------~~~~~~d~~~~f~p 385 (456) -.+|++-. -||-..+|. ++|+ ..+.-|.+++.+|. .++..|++|+..|+... ..+.+.+++|.|++ T Consensus 370 ~~~gls~~-tf~~~~~~~kTAtEi~s~~~~~~~t~~~~~~-~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D 447 (522) T protein:vir:47 370 MQIGVSSG-MFTFDGQGMKTATEIVSENSDTYQMRSSIVA-LVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDD 447 (522) T ss_pred HHhCCCcc-ccCccccccccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCC Confidence 88888764 556555555 3553 45567888888886 57999999988775332 12345679999999 Q ss_pred CCCCCHHHHHHHHHHHHHH----HHHHHHcCCcCcCHHHHHHHhcc---cCCCCCCCCcccCC-CCCCCCCcCCCC Q lcl|NC_016762. 386 LTVPTKAERLANSKTMSEI----NSAAIGTGEPVFTAEEIREEAGY---DPLQGGDPLPDTEP-EDEDAARTDPTG 453 (456) Q Consensus 386 L~~~seke~Aei~~~~A~a----~~~~~~~g~~~i~~~E~R~~~~~---~~~~~~~~~~~~~~-~d~~~~~~d~~~ 453 (456) --.+++.++++...+...+ ...++.. ..-++.+|+++.+.. +......+..+..+ .+.+..++|+.| T Consensus 448 ~i~~D~~~~~~~~~~~v~aG~~s~e~~i~~-~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~~ 522 (522) T protein:vir:47 448 GVFTDRHAELDYWAKMVAAGFSTKKRAIGK-TLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDKG 522 (522) T ss_pred CCCCCHHHHHHHHHHHHhcCCCCHHHHHHh-cCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCCC Confidence 8888877665554432221 1111111 111444444433211 11111001111111 112222233333 No 176 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=98.47 E-value=5.2e-07 Score=55.04 Aligned_cols=404 Identities=10% Similarity=-0.013 Sum_probs=170.2 Q ss_pred CCchh--HHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKL--DLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~--~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) |+... ++.-.|..+.........|...---+-....+.......... -....-.+.+++.||+..+.=++-+. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-----~~~~ki~~n~~~~Ivd~~~~yl~G~p 75 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLR-----NADNRISHNFHEILVDEKASYMFTYP 75 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccccccccccccccc-----ccccccccchHHHHHHhhhhheeccc Confidence 44321 111112222111111222221100000000011111111000 00112247899999999999888888 Q ss_pred CEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEE----- Q lcl|NC_016762. 79 PQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVT----- 153 (456) Q Consensus 79 ~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~----- 153 (456) +++...++.+.. +.++..+ +-++.....++.+....+|.|++++.++.......|.++.++ +..+. T Consensus 76 ~~~~~~~~~~~~-------~~~~~~~-~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~-~~~i~p~~~~ 146 (451) T protein:vir:10 76 VLFDIDNNKELN-------EKVTDVL-GNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFK-YGVVNTEEII 146 (451) T ss_pred ceeecCCcHHHH-------HHHHHHh-ccCHHHHHHHHHHHHhhcCeEEEEEeecCCccccccccccee-EEEEcccceE Confidence 887544332221 1233333 236777788888889999999998887543222223332222 22222 Q ss_pred EeccccCC--h--h-h-hh-c-cccccc---------cCCce-eEEEeecccCCccccc----eeeehhhhheecCC--c Q lcl|NC_016762. 154 PAWAGCLK--P--K-S-FD-E-KPDSET---------YGQPT-MWEYTEASQAGRPGLV----RDIHPDRVFILGDW--T 209 (456) Q Consensus 154 ~~~~~~~~--~--~-~-~~-~-Dp~s~~---------yg~P~-~y~i~~~~~~g~~~~~----~~IH~SRli~~~~~--~ 209 (456) |+|..... + . . |. . +..... ++.+. .|.+.....+ ..+.. ..-|.=-.+.+..+ + T Consensus 147 ~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~vPvv~~~nn 225 (451) T protein:vir:10 147 PIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVS-CCGSQIEHITVQHRFNSVPFVEFSNN 225 (451) T ss_pred EEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccC-ccccccccccccCCCCeeeEEEeccC Confidence 23321110 0 0 0 00 0 000000 01110 0111100000 00000 00111111222221 2 Q ss_pred CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEe Q lcl|NC_016762. 210 GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLP 289 (456) Q Consensus 210 ~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~li 289 (456) ..|.|.++.+-.-+.+++.+.-..+..+-..+-..+.+ .++ .+....+.... +..+ +...+ T Consensus 226 ~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~--------~g~---~~~~~~~~~~~-------~~~~-~~i~~ 286 (451) T protein:vir:10 226 IKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYIL--------ENF---GGEDTSEFLKE-------LKRY-KTIKT 286 (451) T ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeee--------ecC---CcccchhhHHH-------HhhC-CeEEe Confidence 34778888777666666665444443321111111111 111 01111111111 1111 22222 Q ss_pred c-----C--CCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhh Q lcl|NC_016762. 290 T-----Q--GATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELT 357 (456) Q Consensus 290 d-----~--~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lr 357 (456) . . +-+|-..+.+..+....++...+.|...+++|-.-. + .-| |+++. ++ .-...+ ..++..++ T Consensus 287 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~g--n~Sg~Alk~~~~~l~~k~-~~k~~~f~ 361 (451) T protein:vir:10 287 ETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDT-E-NFG--NASGVALKFFYRKLELKS-GLLETEFR 361 (451) T ss_pred cCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCcccccc-c-ccc--cccHHHHHHHHHHHHHHH-HHHHHHHH Confidence 2 1 124445667788999999999999999999995321 1 112 33443 22 223334 34555789 Q ss_pred HHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHH-cCCcCcCHHHHHH-HhcccCCCCC Q lcl|NC_016762. 358 FEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AIG-TGEPVFTAEEIRE-EAGYDPLQGG 433 (456) Q Consensus 358 p~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~~-~g~~~i~~~E~R~-~~~~~~~~~~ 433 (456) +.|++++++++...-.....++.+.|+|-..-+++|.|++..+.+...+. ++. .+ .+-+++|..+ .......... T Consensus 362 ~~l~~~~~li~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p-~v~d~~~e~~~~~ee~~~~~~ 440 (451) T protein:vir:10 362 TSFDKLIKAILYFLGVTDYKKIQQTYTRNMMSNDLEDADIATKSVGIIPTKIILRHHP-WVDDVEEAEKLYLEEKKIQAS 440 (451) T ss_pred HHHHHHHHHHHHHhCCCCccceeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCC-CCCCHHHHHHHHHHHHHHHHH Confidence 99999999887653334456899999999999999999887776432221 111 11 0112222111 1000000000 Q ss_pred CCCcccCCCCCCCCCcCCCCC Q lcl|NC_016762. 434 DPLPDTEPEDEDAARTDPTGE 454 (456) Q Consensus 434 ~~~~~~~~~d~~~~~~d~~~~ 454 (456) ....+. .+-.+ T Consensus 441 ~~~~~~----------~~~~~ 451 (451) T protein:vir:10 441 KVSDDY----------NNFTE 451 (451) T ss_pred HHHhhc----------CCCCC Confidence 000000 00000 No 177 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.45 E-value=6e-07 Score=54.71 Aligned_cols=402 Identities=10% Similarity=-0.010 Sum_probs=172.5 Q ss_pred CCchh-------HHHH-hHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKL-------DLAV-NHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~-------~~~~-~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ae 72 (456) |.++. +-.+ .|-.+....+....|.+.--.+-....+.............. --.+++++.||+..+. T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~-----ki~~n~~~~Ivd~~~~ 94 (474) T protein:vir:96 20 IKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDW-----RMFTNYHQNLVDQKVA 94 (474) T ss_pred hhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccch-----hcccchHHHHHHhhhh Confidence 11111 1111 222222222222222221000000011111100000000000 0146899999999999 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc----- Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR----- 143 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~----- 143 (456) =.+.+.+++...++.. .+.+...++. ++.....++.+....+|.+++++.++ +|+..- .|.. T Consensus 95 ~l~g~p~~~~~~d~~~--------~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p~~~~~v~ 165 (474) T protein:vir:96 95 YAVANPVTFSSDDDKS--------LKTIQEVLNH-KWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFRVPAEQAIPIW 165 (474) T ss_pred hhcccCceeecCchHH--------HHHHHHHHhc-CHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEEEcccceEEEE Confidence 9999999986543221 1234444443 67778888888888999999888774 333211 1211 Q ss_pred -C-CcCceeEEEEeccccCChhhhhccccc-cccCCc---eeEEEeeccc-----------CCccccceeeehhhhheec Q lcl|NC_016762. 144 -G-KLNGLAKVTPAWAGCLKPKSFDEKPDS-ETYGQP---TMWEYTEASQ-----------AGRPGLVRDIHPDRVFILG 206 (456) Q Consensus 144 -~-~~~~l~~i~~~~~~~~~~~~~~~Dp~s-~~yg~P---~~y~i~~~~~-----------~g~~~~~~~IH~SRli~~~ 206 (456) . ..+.+....=+|. .+... -.++.+ .+|....... .........-|.--.+.+. T Consensus 166 d~~~~~~~~~~vr~~~---------~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 236 (474) T protein:vir:96 166 TNKERDTLKAFIRYYR---------LDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFI 236 (474) T ss_pred cCCCCCceEEEEEEEe---------ecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEE Confidence 0 0111111111111 00000 011111 1111110000 0000001112333323332 Q ss_pred C--CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC Q lcl|NC_016762. 207 D--WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN 284 (456) Q Consensus 207 ~--~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (456) . ....|.|.++.+.+-+.+++.+.-..+..+-..+...+.+. ++ .+.+..+.. ..+.. . T Consensus 237 ~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~--------g~---~~~~~~~~~-------~~~~~-~ 297 (474) T protein:vir:96 237 PFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILK--------GY---EGQDLDEFM-------RNLKY-Y 297 (474) T ss_pred EeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeee--------cC---Ccccccchh-------hhhhc-C Confidence 2 23458888888777777777665555443322222222111 10 011111111 11222 2 Q ss_pred CeEEec-CCCceeE--EecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-H---HHHHHHHHHhhh Q lcl|NC_016762. 285 DVLLPT-QGATVTQ--MVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-H---NARCQARRVQEL 356 (456) Q Consensus 285 ~~~lid-~~d~~~~--~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-y---yd~I~~~Qe~~l 356 (456) ..+.++ ++.+++. .+.+.++....++...+.|...+++|-.- ++.. || |.++. ++. | ...+ ...+..+ T Consensus 298 ~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~-~~-n~Sg~Al~~~~~~l~~k~-~~k~~~~ 373 (474) T protein:vir:96 298 KAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQ-QDKF-GN-SPSGIALKFMYSNLDLKA-NKLKNKT 373 (474) T ss_pred ceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccc-cccc-cc-ccHHHHHHHHHHHHHHHH-HHHHHHH Confidence 333333 3445554 45677899999999999999999999642 2222 11 33333 222 2 2333 4455578 Q ss_pred hHHHHHHHHHHHHhcCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcCHHHHHHHhccc----- Q lcl|NC_016762. 357 TFEINDLFAHLMRIGVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFTAEEIREEAGYD----- 428 (456) Q Consensus 357 rp~L~~l~~~l~~s~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~~~E~R~~~~~~----- 428 (456) +..|++++.+++..... ....++.|.|+|-...+++|.|++.++ |-+.+ +++..--.+-++++-.+....+ T Consensus 374 ~~~l~~~~~~i~~~~~~~~~~~~i~i~f~~~~p~~~~e~~~~~~~-ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~ 452 (474) T protein:vir:96 374 LTALQELLQYIIDFYKLNIKVQDVEITFNFNVMVNELEQSQIGVQ-SQYLSKETVVTNHPWVDDPVAELERIEQDNIDFN 452 (474) T ss_pred HHHHHHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHHHHh-cCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 99999999887655333 234578999999999999998887532 22111 1111100122232222211110 Q ss_pred -CCCCCCCCcccCCCCCCCCCcC Q lcl|NC_016762. 429 -PLQGGDPLPDTEPEDEDAARTD 450 (456) Q Consensus 429 -~~~~~~~~~~~~~~d~~~~~~d 450 (456) ........++....|++. +.+ T Consensus 453 ~~~~~~~~~~~~~~~d~~~-e~~ 474 (474) T protein:vir:96 453 KQLPPLEGDANGRAQDNES-ETN 474 (474) T ss_pred hcccccccccccccCCCcc-cCC Confidence 000000000000111111 111 No 178 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=98.44 E-value=6.4e-07 Score=54.55 Aligned_cols=417 Identities=7% Similarity=-0.054 Sum_probs=173.4 Q ss_pred CCc-------hhHHHHh-H-HHHHH-HHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTD-------KLDLAVN-H-AMSSA-IARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~-------~~~~~~~-~-a~~~~-~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) ++. .+.-.++ | .-+.. +.+..+ |.+.--.+- .+...... +.+. . . -...+++.||+.. T Consensus 9 ~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~-yy~g~~~i~-~~~~~~~~-~~~~----~---k--i~~n~~~~iv~~~ 76 (489) T protein:vir:99 9 IDYESKLWIDQLKNYISRFKAEQLERLKELKR-YYLGDNNIK-YRPAKTDK-YAAD----N---R--IASDFAKYITVFE 76 (489) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH-HhcccCccc-cccccccc-cCCc----c---e--eecchHHHHHHHH Confidence 222 2222222 2 11111 122222 222110000 00000000 0110 0 1 1478999999999 Q ss_pred hhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC-----cc---ccc Q lcl|NC_016762. 71 VTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP-----WD---RPA 142 (456) Q Consensus 71 aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~-----~~---~Pl 142 (456) +.=++-+.+++...++.. ...+...+++-++...+.++.+....+|.|++++.+..+.+ .- .|. T Consensus 77 ~~~l~g~~~~~~~~d~~~--------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~ 148 (489) T protein:vir:99 77 QGYMLGVPVEYKNENKDL--------QAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAE 148 (489) T ss_pred hhhhccCCceeecCChhH--------HHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEccc Confidence 999999998886543221 22366666667788888899999999999988877631111 10 111 Q ss_pred c------CC-cCceeEEEEeccccCChhhhhccccccccCCce-eEEEeecccC--CccccceeeehhhhheecC--CcC Q lcl|NC_016762. 143 R------GK-LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPT-MWEYTEASQA--GRPGLVRDIHPDRVFILGD--WTG 210 (456) Q Consensus 143 ~------~~-~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~-~y~i~~~~~~--g~~~~~~~IH~SRli~~~~--~~~ 210 (456) . .. .+.+....-+|......+ ....--.++.|. .|++.....+ +.......-|.=-.+.+.. ... T Consensus 149 ~~~~v~dd~~~~~~~~~i~~~~~~~~~~---~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 225 (489) T protein:vir:99 149 QTFVIYDDTYQRNSLMAVHFYDIDYGSG---KRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANNE 225 (489) T ss_pred ceEEEEcCCCCCceEEEEEEEEEecCCC---ceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecCC Confidence 1 00 011111111111000000 000011122221 1222111100 0000001112111111111 234 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhh--ccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEE Q lcl|NC_016762. 211 DAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKE--INLGEIASTYGVTLDALNERFNEAARQLNRGNDVLL 288 (456) Q Consensus 211 ~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 288 (456) .|.|.++.+.+-+.+++.+.-..+..+--.+...+.++.... .+.....+.......... . +..-........ T Consensus 226 ~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~-~----~~~~~~~~~~~~ 300 (489) T protein:vir:99 226 ERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRL-A----ISIGFKKAQVLI 300 (489) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhccccccccc-c----cccccccceeee Confidence 578888887776666666655444433222222332221100 000000000000000000 0 000000011111 Q ss_pred ecC---------CCceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-H----HHHHHHHHHHHHh Q lcl|NC_016762. 289 PTQ---------GATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-Q----KYHNARCQARRVQ 354 (456) Q Consensus 289 id~---------~d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~----~nyyd~I~~~Qe~ 354 (456) ++. +-.|-....+.+++...++.+.+.+...+++|-.-.- ..+| |.++. + ..-...+..+| . T Consensus 301 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~~~-n~Sg~Al~~~~~~l~~k~~~k~-~ 376 (489) T protein:vir:99 301 LDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDM--KFSG-VQSGESMKYKLMASDNYREKQE-R 376 (489) T ss_pred eccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccc--cccc-cchHHHHHHHHHHHHHHHHHHH-H Confidence 111 2234455667789999999999999999999964321 1122 33443 2 23344455444 4 Q ss_pred hhhHHHHHHHHHHHHhc---CcCC-----CCceEEEeCCCCCCCHHHHHHHHHHHHHHHH--HHHHcCCcCcC---H-HH Q lcl|NC_016762. 355 ELTFEINDLFAHLMRIG---VVPL-----KAEFTAIWDDLTVPTKAERLANSKTMSEINS--AAIGTGEPVFT---A-EE 420 (456) Q Consensus 355 ~lrp~L~~l~~~l~~s~---~~~~-----~~d~~~~f~pL~~~seke~Aei~~~~A~a~~--~~~~~g~~~i~---~-~E 420 (456) .++..|++++.+++... .++. ..+++|.|+|=...+..+.|++..+.+-+.. +++.. .+-++ + +| T Consensus 377 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~giis~et~~~~-l~~v~~~d~~~E 455 (489) T protein:vir:99 377 LFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLYGIVSDQTIFEI-LNTVTGVDAEAE 455 (489) T ss_pred HHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHh-cCCCCchhHHHH Confidence 68999999998775431 1211 1368999999999999999998776543221 12211 11232 2 23 Q ss_pred HHHHhcc----cCCCCCCCCcccCCCCCCCCCcCC Q lcl|NC_016762. 421 IREEAGY----DPLQGGDPLPDTEPEDEDAARTDP 451 (456) Q Consensus 421 ~R~~~~~----~~~~~~~~~~~~~~~d~~~~~~d~ 451 (456) +...... ..+.+.. .......++++.+..| T Consensus 456 ~~ri~~E~~~~~~~~~~~-~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 456 LKRLKEEADKKQSLPEPR-LVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHHHHHHhcccccc-ccCCCCCCcCCCCCCC Confidence 3221111 1111100 0001111112222222 No 179 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.40 E-value=2e-07 Score=57.36 Aligned_cols=401 Identities=8% Similarity=-0.063 Sum_probs=172.9 Q ss_pred CCchhH-HHHh-HHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCC Q lcl|NC_016762. 1 MTDKLD-LAVN-HAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTN 78 (456) Q Consensus 1 ~~~~~~-~~~~-~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~ 78 (456) |+.+.= -.++ |..+...-.....|.+.- . ............. ..--...+++.|||..+.=++-++ T Consensus 17 ~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~------~-~i~~~~~~~~~~~-----~~ki~~n~~~~ivd~~~~~l~g~~ 84 (453) T protein:vir:73 17 ITDKVVNDFMKKHQEEVERYEYLGNMYKGI------M-EISSQKAKDSWKP-----DNRLTNNFAKYIVDTFVGYFNGIP 84 (453) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccc------c-chhcCCCCCccCc-----cceeecchHHHHHHHhhhhhcccC Confidence 433321 1111 221111111122222211 0 0100000000000 011246789999999999999999 Q ss_pred CEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---cccc------CCcCc Q lcl|NC_016762. 79 PQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPAR------GKLNG 148 (456) Q Consensus 79 ~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~------~~~~~ 148 (456) +++...++.+. ..++..++.-++...+.++.+....||.+++++..+ +|...- .|.. ...+. T Consensus 85 ~~~~~~d~~~~--------~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~dd~~~~ 156 (453) T protein:vir:73 85 IKKTHDDKSVL--------EAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFMVYDDSIKQ 156 (453) T ss_pred ceeecCChHHH--------HHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEeCCCCc Confidence 98864432211 236666777788899999999999999999888774 333211 2221 01111 Q ss_pred eeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC--cCCCcchHHHHHHHHHHH Q lcl|NC_016762. 149 LAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW--TGDAIGFLEPAYNSFISL 226 (456) Q Consensus 149 l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~--~~~G~S~le~~~~~l~~~ 226 (456) ..-...+|....... -...-|..=..|++....... ......=|.--.+.+..+ ...|.|.++.+..-+.++ T Consensus 157 ~~~~~i~~~~~~~~~-----~~~~vyt~~~i~~~~~~~~~~-~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~ 230 (453) T protein:vir:73 157 KPLFAVYYGFDEEGN-----LSGTVYTLLETISITGKAGEV-KFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSY 230 (453) T ss_pred eeEEEEEEEEecCce-----EEEEEEeCCeEEEEEecCCce-EEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHH Confidence 000111111100000 000011111222322111000 000111233333333322 346888888877766677 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHH-H-HHHHHhcCCC-eEEecCCCc--eeEEecc Q lcl|NC_016762. 227 EKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFN-E-AARQLNRGND-VLLPTQGAT--VTQMVSA 301 (456) Q Consensus 227 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~-~-~~~~~~~~~~-~~lid~~d~--~~~~~~~ 301 (456) +.+.-..+..+-..+...+.+.. . .+. .+....+. . .+.......+ ......+.+ |-..+.+ T Consensus 231 ~~~~S~~~~~~~~~~~~~l~~~g---~---~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~ 297 (453) T protein:vir:73 231 NKVTSEKANDVEYFSDQYLVFLG---A---EVD-------EEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDS 297 (453) T ss_pred HHHHHHHHHHHHHhccceeeeec---C---CCC-------chhhhcccccccccccccccccccccccCceeEEeeecCC Confidence 77655554433212211121110 0 000 01111110 0 0100011111 111112233 4445667 Q ss_pred cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHHHHHHHHHHhc--Cc- Q lcl|NC_016762. 302 VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEINDLFAHLMRIG--VV- 373 (456) Q Consensus 302 ~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~--~~- 373 (456) .+++...++.+.+.|...+++|-.-.-+ . | |++++ ++ .-...++.+ +..++..|++++.+++... .+ T Consensus 298 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~-g--n~Sg~Al~~~~~~l~~ka~~~-~~~~~~~l~~~~~li~~~~~~~~~ 372 (453) T protein:vir:73 298 DVQTENLLNRLERSIFQFTMAANISDEN-F-G--NSSGVALAYKLQAMSNLALSF-QRKFQSALNRRYSLWSSLSTNASN 372 (453) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccCccc-c-c--CccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCC Confidence 7899999999999999999999632211 1 2 23333 22 223344443 4467888999888775431 12 Q ss_pred C-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCC--CCCcccCC-CCCCCCCc Q lcl|NC_016762. 374 P-LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGG--DPLPDTEP-EDEDAART 449 (456) Q Consensus 374 ~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~--~~~~~~~~-~d~~~~~~ 449 (456) + ...+++|.|+|-...++++.|++..+.+- +++.+-+.+.++....... .....+.. ........ T Consensus 373 ~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~g-----------iis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~ 441 (453) T protein:vir:73 373 KDAWKDIEYTFTRNEPKDIKEQAETANILKG-----------ITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTS 441 (453) T ss_pred ccccccceEEeCCCCCCCHHHHHHHHHHHhc-----------cCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhc Confidence 1 23478999999999999999988655542 3333322222211000000 00000000 00000000 Q ss_pred CCCCCCC Q lcl|NC_016762. 450 DPTGEQQ 456 (456) Q Consensus 450 d~~~~~e 456 (456) .+.+.++ T Consensus 442 ~~~~~~~ 448 (453) T protein:vir:73 442 NLVRMKQ 448 (453) T ss_pred cCCcchh Confidence 0111111 No 180 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=98.38 E-value=9.6e-07 Score=53.58 Aligned_cols=395 Identities=8% Similarity=-0.017 Sum_probs=168.9 Q ss_pred CCchhHH---HHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhC Q lcl|NC_016762. 1 MTDKLDL---AVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKT 77 (456) Q Consensus 1 ~~~~~~~---~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~ 77 (456) ...+..+ .-.|..+.........|...--.+-..+.+..............-+ .+.+++.||+..+.=++.+ T Consensus 25 ~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki-----~~n~~~~Iv~~~~~~l~g~ 99 (468) T protein:vir:96 25 ETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRM-----YTNYHQNLVDQKVAYAVAN 99 (468) T ss_pred cCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccccccccccccccccccc-----ccchHHHHHHHHHhhhccC Confidence 1111111 1112111111111111111000000000000000000000000111 3689999999999999999 Q ss_pred CCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcc---ccccC-------Cc Q lcl|NC_016762. 78 NPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWD---RPARG-------KL 146 (456) Q Consensus 78 ~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~---~Pl~~-------~~ 146 (456) .+++...++.. .+.|...++. ++...+.++.+....+|.+++++.++ +++..- .|... .. T Consensus 100 p~~~~~~d~~~--------~~~l~~~~~n-~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~~~~~~ 170 (468) T protein:vir:96 100 PVTYGTEDEKS--------LKTIQEVLNH-KWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPAEQAIPIWTNKER 170 (468) T ss_pred CceeccCChHH--------HHHHHHHHhc-CHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCC Confidence 99986543221 1234444443 67778888889999999999888774 333211 22210 01 Q ss_pred CceeEEEEeccccCChhhhhccccccccCCce---eEEEeecc--------cCCccc---cceeeehhhhheecC--CcC Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPT---MWEYTEAS--------QAGRPG---LVRDIHPDRVFILGD--WTG 210 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~---~y~i~~~~--------~~g~~~---~~~~IH~SRli~~~~--~~~ 210 (456) +.+....-+|... +-..-.++.|. +|...... ..+... ....-|.--.+.+.. ... T Consensus 171 ~~~~~~ir~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~ 242 (468) T protein:vir:96 171 DELKAFIRLYELD--------GGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNP 242 (468) T ss_pred CceEEEEEEEEec--------CceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCCC Confidence 1111111111110 00000111111 11111000 000000 000112222222222 134 Q ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEec Q lcl|NC_016762. 211 DAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPT 290 (456) Q Consensus 211 ~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid 290 (456) .|.|.++.+.+-+.+++.+.-..+..+-..+...+.++ ++ .+........ .+ .....+.++ T Consensus 243 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~--------g~---~~~~~~~~~~-------~~-~~~~~i~~~ 303 (468) T protein:vir:96 243 QEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLK--------GY---EGEDLEEFMY-------NL-KYYKAINVD 303 (468) T ss_pred CCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeee--------cC---Cccccchhhh-------hh-hcCceEEec Confidence 58888888777777777665554443322222222111 10 0111111111 11 222334444 Q ss_pred CC--CceeE--EecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HH----HHHHHHHHHHHhhhhHHHH Q lcl|NC_016762. 291 QG--ATVTQ--MVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QK----YHNARCQARRVQELTFEIN 361 (456) Q Consensus 291 ~~--d~~~~--~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~----nyyd~I~~~Qe~~lrp~L~ 361 (456) .+ .+.+. .+.+..+....++.+.+.+...+++|-.- ++ .-+| |.++. ++ .-...+..+ +..++..|+ T Consensus 304 ~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~-~~-~~~~-n~Sg~Alk~~~~~l~~k~~~k-~~~~~~~l~ 379 (468) T protein:vir:96 304 GDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQ-QD-KFGN-SPSGIALKFMYSNLDLKANKL-KNKTLTALQ 379 (468) T ss_pred CCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccc-cc-cccc-chHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Confidence 32 33444 45566789999999999999999999632 22 1122 33443 32 233444444 456899999 Q ss_pred HHHHHHHHhcCcC-CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCC--CCC-- Q lcl|NC_016762. 362 DLFAHLMRIGVVP-LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGG--DPL-- 436 (456) Q Consensus 362 ~l~~~l~~s~~~~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~--~~~-- 436 (456) +++++++...... ...++.|.|++-...+++|.|++.++ + | ++|.+.+.+.+........ ... T Consensus 380 ~~~~li~~~~g~~~d~~~i~i~f~~~~p~d~~e~a~~~~~-~---------g--~iS~et~i~~l~~v~D~~~E~~ri~~ 447 (468) T protein:vir:96 380 ELLQYIIDFYKLSIKVQDVEITFNFNVMVNELEQSQIGVN-S---------Q--YLSKETVVTNHPWVDDPVAEMERIDQ 447 (468) T ss_pred HHHHHHHHHhCCCcccceeeEEecCCCCcCHHHHHHHHHh-c---------C--CCchHHHHHhCCCCCCHHHHHHHHHH Confidence 9999887654333 34578999999999999998876532 2 3 3333333322211000000 000 Q ss_pred -----cccCCCCCCCCCcCCC Q lcl|NC_016762. 437 -----PDTEPEDEDAARTDPT 452 (456) Q Consensus 437 -----~~~~~~d~~~~~~d~~ 452 (456) ...++.-......+|+ T Consensus 448 E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 448 EELALPSIEEGLNGKENNEPT 468 (468) T ss_pred HHHHHHHHhhccCCCCCCCCC Confidence 0000000001111122 No 181 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.37 E-value=9.8e-07 Score=53.52 Aligned_cols=394 Identities=12% Similarity=-0.011 Sum_probs=165.1 Q ss_pred CCchhHHHHhHHHHH-----HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSS-----AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCW 75 (456) Q Consensus 1 ~~~~~~~~~~~a~~~-----~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~t 75 (456) |++--.-.++..++. ..-+..+.|.. | +++ . .. .+.. ...++......+.++++|||..++=+. T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~-----G-~~~-i-~~--~~~~-~~~~~~~~k~~~n~~~~ivd~~~~~l~ 69 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYE-----G-SNR-V-RD--LGVA-IPPELQRVQTVVSWPGIAVDALEERLD 69 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHh-----c-CCc-c-hh--cCcc-cchhhhhhhhhcchHHHHHHHHHhhhc Confidence 555433222222221 11112222221 1 111 0 00 1111 122333444567889999999998886 Q ss_pred hCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccC---------C Q lcl|NC_016762. 76 KTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARG---------K 145 (456) Q Consensus 76 R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~---------~ 145 (456) -.||+. ++. ..+.+.+++.++-..+.++.+...+||.|++++..+ ||.+.-.++.. . T Consensus 70 ~~g~~~--~d~-----------~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~ 136 (441) T protein:vir:80 70 WLGWTN--GDG-----------YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSAD 136 (441) T ss_pred cccccC--CCh-----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCC Confidence 666652 221 125556667788888999999999999998888764 34332222210 1 Q ss_pred cCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhh---heecC----CcCCCcchHHH Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRV---FILGD----WTGDAIGFLEP 218 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRl---i~~~~----~~~~G~S~le~ 218 (456) .+.+.....+|...... . -...-|.....|++.....++-......-|.--. +.|.. ...+|.|.+.. T Consensus 137 ~~~~~~~~~~~~~~~~~-~----~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~ 211 (441) T protein:vir:80 137 GSRLDAGLVVQQTCDPE-V----VEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITR 211 (441) T ss_pred CCceeEEEEEEEEecCc-e----EEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchh Confidence 11111111111110000 0 0001122222222211110000000011122111 22211 12368886643 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCC---C Q lcl|NC_016762. 219 -AYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQG---A 293 (456) Q Consensus 219 -~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~---d 293 (456) +.+-+.+++.+.-..+...-..+..++.+. |...++..... ++... ....++.+ + T Consensus 212 ~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--------------G~~~~~~~~~~------~~~~~~~i~~~~~~~~~~ 271 (441) T protein:vir:80 212 SIRAYTDEAVRTLLGQSVNRDFYAYPQRWVT--------------GVSADEFSQPG------WVLSMASVWAVDKDDDGD 271 (441) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhcCceeeee--------------cCCccccccch------hhhcccccccCCCCCCCC Confidence 433333444443222211111112222211 11111110000 00011 11222222 1 Q ss_pred ceeEEecccCCHHHH---HHHHHHHHHhhhcCCeEEeeccCCCcccchHHHH----HHHHHHHHHHHhhhhHHHHHHHHH Q lcl|NC_016762. 294 TVTQMVSAVSDPGPT---YNVNLQTAAAGVDIPTKILVGMQTGERASSEDQK----YHNARCQARRVQELTFEINDLFAH 366 (456) Q Consensus 294 ~~~~~~~~~sgl~~~---~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D~~----nyyd~I~~~Qe~~lrp~L~~l~~~ 366 (456) ..+..+.+-+++... +......+|+.++||... ||.++...+|..=++ ..-..+ .+++..+++.|++++.+ T Consensus 272 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~-~g~~~~~~~Sg~Al~~~~~~l~~k~-~~~~~~f~~~l~~~~~l 349 (441) T protein:vir:80 272 TPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERY-FGFITSNPPSGEALAAEESRLVKRA-ERRQTSFGQGWLSVGFL 349 (441) T ss_pred cceeEecCccchHHHHHHHHHHHHHHhcccCCCHHH-hccCCCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 233444444555554 555588999999999754 466654333322222 223333 44556789999999988 Q ss_pred HHHhcC--cCCC---CceEEEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHHcCCcCcCHHHHHHHhcccCCCCCC Q lcl|NC_016762. 367 LMRIGV--VPLK---AEFTAIWDDLTVPTKAERLANSKTMSEIN-------SAAIGTGEPVFTAEEIREEAGYDPLQGGD 434 (456) Q Consensus 367 l~~s~~--~~~~---~d~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~~~i~~~E~R~~~~~~~~~~~~ 434 (456) ++.... +..+ .++++.|+|-...|.+|.|+...+.+++. ...-..| .+++|+.++.....- . T Consensus 350 ~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~---~~~~e~~~~~~e~~e-~-- 423 (441) T protein:vir:80 350 AAKALDSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLG---LDDVQVEAVMRHRAE-S-- 423 (441) T ss_pred HHHHhcCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCC---CCHHHHHHHHHHHHH-H-- Confidence 766432 2111 36789999999999999988776665531 1111112 334444432110000 0 Q ss_pred CCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 435 PLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 435 ~~~~~~~~d~~~~~~d~~~~~e 456 (456) .+.-.......+..-+| T Consensus 424 -----~~~~~~~~~~~~~~~~~ 440 (441) T protein:vir:80 424 -----SDPLAVLAGAISRQTNE 440 (441) T ss_pred -----HHHHHHHhhhhhccccc Confidence 00000000001111111 No 182 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=98.32 E-value=7e-07 Score=54.34 Aligned_cols=264 Identities=9% Similarity=0.016 Sum_probs=128.3 Q ss_pred hhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHh-hcccCceEEEEEecCCCCccccccC Q lcl|NC_016762. 66 AVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRR-RLVGRYSGLLLHIRDSQPWDRPARG 144 (456) Q Consensus 66 iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~-~r~~Ggs~i~i~i~D~~~~~~Pl~~ 144 (456) |--.|. .+....+..+ ..+...+...=....-+..|.+.+.+ -.++|-|++++.- +.. T Consensus 1 ia~l~~--------~~~~~~~~~~----~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r-~~~-------- 59 (278) T protein:vir:78 1 MASLPL--------KMYEDYKVVN----TEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIER-DIY-------- 59 (278) T ss_pred Ccccee--------EEEecCcccc----cHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEE-CCC-------- Confidence 111111 1111111111 11111111110011223345555454 4455666665543 211 Q ss_pred CcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHH Q lcl|NC_016762. 145 KLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAY 220 (456) Q Consensus 145 ~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~ 220 (456) +.+..+.|+....+++.. + +.|.|.+|++... + +..+.+.++.||||... ...|.|.+..+. T Consensus 60 --G~~~~l~~l~~~~v~v~~---~----~~~~~~~y~~~~~--~---g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~ 125 (278) T protein:vir:78 60 --HQPSKLFLLNPDVVEMLI---E----NQSRELYYSIHAA--T---GNKLIVHNMDMLHFKHIVASNMVQGISPIDVLK 125 (278) T ss_pred --CcEEEEEEECCceeEEEE---c----CCCceEEEEEEcC--C---ceEEEEccccEEEECCCCCCCCeeeccHHHHHH Confidence 123344444433333311 1 2345667777521 2 23467889999988533 235999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEe Q lcl|NC_016762. 221 NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMV 299 (456) Q Consensus 221 ~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~ 299 (456) ..+.....+... .+..+......+. +. .+.-.++..+++.+.++...++. ..++++.+.++++++ T Consensus 126 ~~i~~~~~~~~~---~~~~~~~~~~~i~---~~--------~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~ 191 (278) T protein:vir:78 126 NTTDFDNAVRTF---NLTEMQKPDSFML---KY--------GSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLP 191 (278) T ss_pred HHHHHHHHHHHH---HHHHhcCCCcEEE---Ee--------CCCCCHHHHHHHHHHHHHHhccCCCceecCCCceEEEcc Confidence 766543333221 1122211110000 00 01111233444444444444443 456666777899988 Q ss_pred cccCCHH--HHHHHHHHHHHhhhcCCeEEeeccCCCcc-cchHH-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC Q lcl|NC_016762. 300 SAVSDPG--PTYNVNLQTAAAGVDIPTKILVGMQTGER-ASSED-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL 375 (456) Q Consensus 300 ~~~sgl~--~~~~~~~~~~aaas~IP~t~L~G~sp~Gl-nst~D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~ 375 (456) .+..+.+ +......+.||.+.|||- .|+|...++= +..++ .+.||..+ |.|.++.+-+.|-+.-+.+. T Consensus 192 ~~~~d~~~~e~~~~~~~~Ia~~fgVpp-~~lg~~~~~~~sn~~~~~~~~~~~~-------l~P~~~~i~~~ln~~L~~~~ 263 (278) T protein:vir:78 192 KKYVSEDIVASENLTRERVANVFQLPS-VFLNARSNTNFAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKT 263 (278) T ss_pred CChhHHHHHHHHHHHHHHHHHHhCCCH-HHhCCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCChh Confidence 7776554 556678889999999995 4667655432 22233 55677654 78888888777655434321 Q ss_pred --CC--ceEEEeCCC Q lcl|NC_016762. 376 --KA--EFTAIWDDL 386 (456) Q Consensus 376 --~~--d~~~~f~pL 386 (456) .. .|.|..+-| T Consensus 264 e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 264 DREKIGILNLTLNLI 278 (278) T ss_pred HhcCCceEEEecccC Confidence 11 355555555 No 183 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=98.26 E-value=2e-06 Score=51.88 Aligned_cols=409 Identities=13% Similarity=0.029 Sum_probs=178.8 Q ss_pred CCchhHHHHhHHHHHHHHH-HHHHHhhhhhccCcccchhhhhccCcccCCH----HHHHHHHhcCchhhhhhccchhHHh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIAR-ARMSLLNQGIGHDAKRPQAWCEYGFPQEITF----NDLYTMYRRGGIAHGAVEKIVTTCW 75 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~-~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~----~~l~~~Y~~~~l~r~iVd~~aed~t 75 (456) ||++..-- +-+-++.. ...+++.+ + ..+.....-..+-+ +-++.|=++..-+..++++.-.-.+ T Consensus 1 ~~~~~~~~---~p~~~~g~~~~~~~~~~----~----~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~ 69 (469) T protein:vir:10 1 MTERVKTA---APVSEAGYVFGSGVVDG----W----TVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIR 69 (469) T ss_pred CCCcccCC---CCccchhhhhhcccccc----h----hhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHh Confidence 77664321 11101111 00011110 0 11111111111111 1223333467888888888887777 Q ss_pred hCCCEEecCCCcchhhhhHHHHHHHHHHHH-------------HhhHHHHHHHHHHhhcccCceEEEEEec-CCCCcccc Q lcl|NC_016762. 76 KTNPQVIEGDDQDRSKDETEWERKNKPLIA-------------GGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRP 141 (456) Q Consensus 76 R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~-------------~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~P 141 (456) +.-|+|.-++++++ ....+...+...+. +......|.+.+-....||.|++=+.-+ +++.. T Consensus 70 ~~~w~v~p~~~~~e--~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~--- 144 (469) T protein:vir:10 70 STPWRIRANGASDE--VTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSP--- 144 (469) T ss_pred cCCceEecCCCCHH--HHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccC--- Confidence 77778864443322 11112222222211 1123455666666677899998855432 21110 Q ss_pred ccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecc------cCCccccceeeehhhhheec----CCcCC Q lcl|NC_016762. 142 ARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEAS------QAGRPGLVRDIHPDRVFILG----DWTGD 211 (456) Q Consensus 142 l~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~------~~g~~~~~~~IH~SRli~~~----~~~~~ 211 (456) .+.-.+..|.++.. -+...|..++ +.+ ...++-.... .......+..|.+.+++.+. ..+.+ T Consensus 145 --dG~~~~~~l~~rp~--~~i~~~~~~~---~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~ 216 (469) T protein:vir:10 145 --DGRFWLRKLAPRPQ--WTISKFNVAP---DGG-LESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQ 216 (469) T ss_pred --CCceeeeeeeecCc--ccceeeeecc---CCc-eeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcc Confidence 01111233332221 1111122111 111 1111100000 00001124556677766553 34567 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCC-eEEec Q lcl|NC_016762. 212 AIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGND-VLLPT 290 (456) Q Consensus 212 G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~lid 290 (456) |.|++..||...+--.....-++.-.-+........++. .++ .++-.+.+.++++.+..+.+ .+++. T Consensus 217 g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~-----------~~a-~~~ek~~l~~a~~~~~~g~~a~~iip 284 (469) T protein:vir:10 217 GKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTAS-----------SAT-DEDEVRKMAALARSVRGGINAGVGLA 284 (469) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecC-----------CCC-CHHHHHHHHHHHHHHhcCCceEEEcc Confidence 999999998754321112211221111111100000000 011 23445566666776765544 45677 Q ss_pred CCCceeEEecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH--HHHHHHHHHHHHHhhhhHHHH-HHHH Q lcl|NC_016762. 291 QGATVTQMVSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED--QKYHNARCQARRVQELTFEIN-DLFA 365 (456) Q Consensus 291 ~~d~~~~~~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D--~~nyyd~I~~~Qe~~lrp~L~-~l~~ 365 (456) .+.+++-++.+-++ ...+++..-.+||-+.--.. |..++.||-.|.++ .....+.+.+... .+...|. .|+. T Consensus 285 ~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~t--lTs~~~gGS~a~~~vh~ev~~d~~~sDa~-~i~~tln~~li~ 361 (469) T protein:vir:10 285 QGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHF--LNLDGKGGSYALASVLEDPFTQAVHAYAT-SICRIANQHIIE 361 (469) T ss_pred CCceEEEeecCCCchHHHHHHHHHHHHHHHHHhccc--ccccCccchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Confidence 78888887765332 24444444455655441111 22232233223333 3456666666654 4667775 5888 Q ss_pred HHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCc---CcCHHHHHHHhcccCCCCCCCCcccCCC Q lcl|NC_016762. 366 HLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEP---VFTAEEIREEAGYDPLQGGDPLPDTEPE 442 (456) Q Consensus 366 ~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~---~i~~~E~R~~~~~~~~~~~~~~~~~~~~ 442 (456) -|+...+++...-..|+|...-. ..+..|++++.++++|.. .++.+.+|+..+.....+..+....... T Consensus 362 ~l~~lN~g~~~~~P~~~~~~~e~--------~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~ 433 (469) T protein:vir:10 362 DLVDINFGVDTPAPVLTFDPIGS--------RQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEP 433 (469) T ss_pred HHHHhcCCCCCCccEEEecCCCC--------cHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhc Confidence 88888887654445677754321 123468888889999842 2566789998887655443332111000 Q ss_pred CCCCC-----------------CcCCCCCCC Q lcl|NC_016762. 443 DEDAA-----------------RTDPTGEQQ 456 (456) Q Consensus 443 d~~~~-----------------~~d~~~~~e 456 (456) .-.+. .++|..+.. T Consensus 434 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (469) T protein:vir:10 434 AAVPNQSAAPARTRSSGNADARARAPKADQG 464 (469) T ss_pred ccCCCCCccccccCCCCCcccccccCCChHH Confidence 00000 000100000 No 184 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.21 E-value=2.5e-06 Score=51.26 Aligned_cols=413 Identities=13% Similarity=0.097 Sum_probs=184.9 Q ss_pred CCc-----hhHHHHhHHHHHH-HHHHHHHHhhhhhccC-cccc---------hhh-hhccCcccCCHHHHHHHHhc---- Q lcl|NC_016762. 1 MTD-----KLDLAVNHAMSSA-IARARMSLLNQGIGHD-AKRP---------QAW-CEYGFPQEITFNDLYTMYRR---- 59 (456) Q Consensus 1 ~~~-----~~~~~~~~a~~~~-~~~~~d~~~n~~~~~g-t~~~---------~~~-~~~~~~~~~~~~~l~~~Y~~---- 59 (456) |+. |++-+++ .. +..-+.+|.-.-..-| +..+ ... ++|.-....+-.+|...||+ T Consensus 1 ~~~~~~w~~~de~~~----~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~ 76 (533) T protein:vir:58 1 MPSLEKYKKLNEAVN----FTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYT 76 (533) T ss_pred CCCcchhhhhhHHHH----HHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhcc Confidence 543 2333222 11 1111223321111111 1111 111 22211244566677777764 Q ss_pred CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC Q lcl|NC_016762. 60 GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP 137 (456) Q Consensus 60 ~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~ 137 (456) |+.+..+|+.++.+|+ .+.-.+++.+ -+..+...++..+|..+ +++..+..+..|.--+.|.. +..... ++ T Consensus 77 ~pEVd~AideIvneaiv~d~~~~pV~v~-l~~~e~s~~iK~kI~~l---ldf~~~~~~~fR~WYVDGri--y~Hkii-k~ 149 (533) T protein:vir:58 77 DPLISTVLDIIADECTIPNENGNIVDVV-TKDIELAKAILSYLDYV---INIEKNAYPIIRNMIKYGDM--FLHILE-KG 149 (533) T ss_pred CcchhhHHHhhhceeeEecCCCceeEee-cccccccHHHHHHHHHH---hcchhhhhHHHHhhhhccee--EEEecc-CC Confidence 6788999999999984 2222222222 12222333333344433 44566666655544444433 333311 11 Q ss_pred ccccccCCcCceeEEEEeccccCCh-hhhhccccccccCCceeEEEeeccc-CCccccceeeehhhhheecCC-----cC Q lcl|NC_016762. 138 WDRPARGKLNGLAKVTPAWAGCLKP-KSFDEKPDSETYGQPTMWEYTEASQ-AGRPGLVRDIHPDRVFILGDW-----TG 210 (456) Q Consensus 138 ~~~Pl~~~~~~l~~i~~~~~~~~~~-~~~~~Dp~s~~yg~P~~y~i~~~~~-~g~~~~~~~IH~SRli~~~~~-----~~ 210 (456) | +.++..|.+|.|+- +.. .+..++ -+||-+++... .+++..+.+|+++.++.+..+ .. T Consensus 150 ---~-k~GI~elr~lDPr~---i~~vr~~~t~--------~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~ 214 (533) T protein:vir:58 150 ---S-DGTIEKFQVVSPYI---FSKRYNPETD--------TWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFP 214 (533) T ss_pred ---c-ccchhhheecCCee---eEEEEeeccc--------eEEEeecccccccccCccccccchhheeeeeeccccCCCC Confidence 1 33444445555432 211 111122 13444432221 123444688999998877533 23 Q ss_pred CCcchHHHH---HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHH---HhcCC Q lcl|NC_016762. 211 DAIGFLEPA---YNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQ---LNRGN 284 (456) Q Consensus 211 ~G~S~le~~---~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~---~~~~~ 284 (456) .++|.|.++ +|-|.-++-+. .+|+.+...=...| -+|+-+|....+ ++-.+.++.+.+. +..++ T Consensus 215 ~iisyLhkAiKp~NQLkmiEDAl-----VIYRisRAPeRRvF--YIDVGNlpk~KA---eqYl~~im~k~kNklvYDa~T 284 (533) T protein:vir:58 215 YGRSYLESARAIWNQLRLMEDAL-----MLYRVVRSVDRRVF--YVDVGNVPPDKI---NEYLTNIAMQYKRDYWVRNNQ 284 (533) T ss_pred ceehhhhHHHHHHHHHHHHHHHH-----HHHhhcCChhheEE--EEeecCCCccCH---HHHHHHHHHhcccceEEeccC Confidence 467999887 66665544332 44543211000000 123333332111 2222222211110 11112 Q ss_pred CeEEec---------------------CCCceeEEec-ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccch--HH Q lcl|NC_016762. 285 DVLLPT---------------------QGATVTQMVS-AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASS--ED 340 (456) Q Consensus 285 ~~~lid---------------------~~d~~~~~~~-~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst--~D 340 (456) |-+.-| ++-+++++.- +++-+ +=+..|...+=-|.++|++||=..+.-|-++. -| T Consensus 285 Gev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~lgem-eDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRD 363 (533) T protein:vir:58 285 NQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGSKVDLA-EDVEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQ 363 (533) T ss_pred CeEeeccchhhhhhhHhhhcccccCCCccceeeecCCCCCCcH-HHHHHHHHHHHHHhCCCeeecCCCCCCccchhhhHH Confidence 211000 1224444432 34444 44677888999999999999955543332221 25 Q ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHH--------HcC Q lcl|NC_016762. 341 QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAI--------GTG 412 (456) Q Consensus 341 ~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~--------~~g 412 (456) .-.|...|.++|.. +.+.+++ .|+.-+++ .+++|.|.|+-=..-+|-..+|+...+..+.+..- .-. T Consensus 364 EiKF~KFI~rLR~r-F~~ll~~---qLilk~ii-t~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ 438 (533) T protein:vir:58 364 DIKFNNTIKRIQGF-FVEELER---MVRMNKEF-ADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSN 438 (533) T ss_pred HHHHHHHHHHHHHH-HHHHHhc---ccccccCc-chhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHH Confidence 55699999999875 3444443 23333333 45789999999999999999999888877655421 111 Q ss_pred CcCcCHHHHHHH------hcccCCCCC-CCCcccCCCCCCCCCcCCCCC-----CC Q lcl|NC_016762. 413 EPVFTAEEIREE------AGYDPLQGG-DPLPDTEPEDEDAARTDPTGE-----QQ 456 (456) Q Consensus 413 ~~~i~~~E~R~~------~~~~~~~~~-~~~~~~~~~d~~~~~~d~~~~-----~e 456 (456) .--.+ +|+.+. -...++... +..++..+.+..+...+|... .. T Consensus 439 ILr~t-dei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~ 493 (533) T protein:vir:58 439 ILQIP-YDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEF 493 (533) T ss_pred HhcCC-hhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCChhhH Confidence 11223 233221 111222211 111122222222222221111 00 No 185 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=98.19 E-value=1e-06 Score=53.42 Aligned_cols=321 Identities=8% Similarity=-0.007 Sum_probs=150.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHh-hhhhccCcccch-----hh-hh--ccCcccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLL-NQGIGHDAKRPQ-----AW-CE--YGFPQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~-n~~~~~gt~~~~-----~~-~~--~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) ||+++.......... ...+-+|- .+..=+ +.++- .| +. -+|-..+++..|..+++.|.....+|.... T Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~p~~~~-~~~~~~~~~~~~~~~~~~~~epp~~~~~La~l~~~n~~h~~~i~~k~ 77 (348) T protein:vir:26 1 MTEQLIHSHTTDGTE--SKSVYSFDPNPEPVD-TNSWMTRYCELFYNDFDDYWEPPISLKGLAEIANANGYHGSLLKARA 77 (348) T ss_pred CCccccchhhccccC--CceEEEecCCCeeec-CcchHHHHHHHHhcCCCccccCCCCHHHHHHHHhhhhhhhhhHhhhh Confidence 998776543321111 00111110 000011 11111 11 11 145567889999999999988888876665 Q ss_pred hHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeE Q lcl|NC_016762. 72 TTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAK 151 (456) Q Consensus 72 ed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~ 151 (456) ..-.+ +++ +.. .. .+..+++ ++..-.++|-+++.+.- ++ .+.+.. T Consensus 78 N~l~~-~~~------Pn~--~~-----------t~~~f~~----~~~d~ll~Gnay~~~~r-n~----------~G~~~~ 122 (348) T protein:vir:26 78 NYVAG-RFM------NGG--GL-----------PMYKMNS----ACWDYFGLGMSAFVKIR-SY----------LKNVIA 122 (348) T ss_pred hHHhh-ccc------CCC--CC-----------CHHHHHH----HHHHHHhcCCeEEEEEE-cC----------CCcEEE Confidence 44332 221 100 00 1111122 22223466877776543 21 111233 Q ss_pred EEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHH Q lcl|NC_016762. 152 VTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLE 227 (456) Q Consensus 152 i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~ 227 (456) +.|+-...+.+. .| ..+|++.. +| ..+.+.+..|+||... ...|+|-+..+...+.- . T Consensus 123 L~~l~~~~v~~~---~d--------~~~~~~~~---~g---~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l-~ 184 (348) T protein:vir:26 123 LEPLPMVHMRKR---KN--------GDFVQLLR---NN---EQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLL-N 184 (348) T ss_pred EEEecCceeEee---ec--------CcEEEEEe---cC---eEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHH-H Confidence 333322222221 12 12454431 22 2356778888888643 33589988887776543 2 Q ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCeEEec----CCCceeEEecc Q lcl|NC_016762. 228 KVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDVLLPT----QGATVTQMVSA 301 (456) Q Consensus 228 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lid----~~d~~~~~~~~ 301 (456) ..+......+|+|....-.+.. +++ +.-.++..+++.+.++... .|.+.+++. +++.++...++ T Consensus 185 ~~a~~~~~~~f~NGa~pg~Il~-----~~~-----~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis 254 (348) T protein:vir:26 185 RDATLFRRRYYLNGAHMGFIFY-----ATD-----PNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVG 254 (348) T ss_pred HHHHHHHHHHHhccCCCceEEE-----ecC-----CCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEcc Confidence 3333444455666433221110 110 0112344455555554432 222333432 12234444444 Q ss_pred cCCH----HHHHHHHHHHHHhhhcCCeEEeeccCCC---cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCc Q lcl|NC_016762. 302 VSDP----GPTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVV 373 (456) Q Consensus 302 ~sgl----~~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~ 373 (456) .+.- -++-....+.||++-+||-. |+|..+. +++.-+ -.+.||. +.|.|.++++-+.|-+.-.. T Consensus 255 ~~~~d~qf~e~k~~t~~dIa~af~VPp~-llGi~~~~~~~~sn~e~~~~~f~~-------~~l~P~~~~ie~~ln~~l~~ 326 (348) T protein:vir:26 255 DIATKDEFERIKNITAQDIFVGHRFPAG-MGGMLPQQGANVPDPLKVSQVYDF-------YEVIPVCKRFMDAVNNDPEI 326 (348) T ss_pred CChhHHHHHHHHHhhHHHHHHHhCCCHH-HccccCCCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhhCC Confidence 4433 33334456679999999985 7786543 343333 3456663 34788888877766544333 Q ss_pred CCCCceEEEeCCCCCCCHHHHHHH Q lcl|NC_016762. 374 PLKAEFTAIWDDLTVPTKAERLAN 397 (456) Q Consensus 374 ~~~~d~~~~f~pL~~~seke~Aei 397 (456) +....|.|+|+|...-+++.- + T Consensus 327 ~~~~~~~fdl~~~~e~~~~~a--~ 348 (348) T protein:vir:26 327 PDNLKLKFNLNPGVESANGSA--V 348 (348) T ss_pred CCccEEEEecCcccccchhhc--C Confidence 444456777777543332222 2 No 186 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=98.10 E-value=4.5e-06 Score=49.89 Aligned_cols=403 Identities=13% Similarity=0.047 Sum_probs=174.3 Q ss_pred CCchhHHHHhHHHHH--HHHHHHHHHhhhhhcc-Cc-ccchh-hhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSS--AIARARMSLLNQGIGH-DA-KRPQA-WCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCW 75 (456) Q Consensus 1 ~~~~~~~~~~~a~~~--~~~~~~d~~~n~~~~~-gt-~~~~~-~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~t 75 (456) =.+.++..+..-... .+.....-+.....|- +- .|... ...-+......... -..--.+.+++.||+..+.=++ T Consensus 2 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~-~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 2 ELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRS-ADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhccccccccccccc-CCcccccchHHHHHHhhhhhee Confidence 122333332211111 1111111111111111 00 01100 00000000000000 0001257889999999999888 Q ss_pred hCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CCCcc---cccc------CC Q lcl|NC_016762. 76 KTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQPWD---RPAR------GK 145 (456) Q Consensus 76 R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~~---~Pl~------~~ 145 (456) -+.+++..+++... .++...+++ +....+.++.+....+|.+++++.++. ++.-. .|.. .. T Consensus 81 G~p~~~~~~d~~~~--------~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~ 151 (470) T protein:vir:10 81 SVFPDIDVGKDADN--------KKIIDVLGD-DRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATT 151 (470) T ss_pred ccceeeecCchHHH--------HHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEEEcCC Confidence 88888865543221 234444443 566777788888888999999888743 32111 1221 11 Q ss_pred -cCceeEEEEeccccCChhhhhccccc-------cccCCceeEEEeecccCCc-----------------cc--cceeee Q lcl|NC_016762. 146 -LNGLAKVTPAWAGCLKPKSFDEKPDS-------ETYGQPTMWEYTEASQAGR-----------------PG--LVRDIH 198 (456) Q Consensus 146 -~~~l~~i~~~~~~~~~~~~~~~Dp~s-------~~yg~P~~y~i~~~~~~g~-----------------~~--~~~~IH 198 (456) .+.+....=+|.. .|..+ .-|..=..|.......+.. .. ....-| T Consensus 152 ~~~~~~a~ir~y~~--------~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (470) T protein:vir:10 152 LDNKLLGILRSYKQ--------LDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKH 223 (470) T ss_pred CCCceEEEEEEEEe--------eecCCceEEEEEEEEcCCcEEEEEeecCcceecccccccccccccccccccccccccc Confidence 1112211111110 00000 0011001111110000000 00 000011 Q ss_pred h-hhh--heecCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHH Q lcl|NC_016762. 199 P-DRV--FILGDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNE 275 (456) Q Consensus 199 ~-SRl--i~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 275 (456) . .+| +.|. .+..|.|.++.+..-+.+++.+.-..+..+-..+-..+.++. . .+....+.... T Consensus 224 ~~g~vPvv~~~-nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g---~--------~~~~~~~~~~~--- 288 (470) T protein:vir:10 224 NFGRVPFIEFS-KNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTN---Y--------GGADLHQFMND--- 288 (470) T ss_pred CCCeeeEEEee-cCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeec---C--------Cccccchhhhh--- Confidence 1 111 1111 134588889888877777777766555544332222222211 0 01111111111 Q ss_pred HHHHHhcCCCeEEecC-----C--CceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchHH-HHH-HHH Q lcl|NC_016762. 276 AARQLNRGNDVLLPTQ-----G--ATVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSED-QKY-HNA 346 (456) Q Consensus 276 ~~~~~~~~~~~~lid~-----~--d~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~D-~~n-yyd 346 (456) +... +...+.. + -+|-+...+..+....++.+.+.|.-.+++|-.-..+ .| |.++. ++. |.. T Consensus 289 ----~~~~-~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~---~g-n~Sg~Alk~~~~~ 359 (470) T protein:vir:10 289 ----LRKY-KSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFE---SS-NASGVAIKMLYSH 359 (470) T ss_pred ----hhhc-CeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccc---cc-cchHHHHHHHHHH Confidence 1111 2222221 1 2355566777899999999999999999999643322 22 44443 332 322 Q ss_pred --HHHHHHHhhhhHHHHHHHHHHHHh-cCc-CCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH--HHH-cCCcCcCHH Q lcl|NC_016762. 347 --RCQARRVQELTFEINDLFAHLMRI-GVV-PLKAEFTAIWDDLTVPTKAERLANSKTMSEINSA--AIG-TGEPVFTAE 419 (456) Q Consensus 347 --~I~~~Qe~~lrp~L~~l~~~l~~s-~~~-~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~--~~~-~g~~~i~~~ 419 (456) .-.+..+..+++.|++++++++.. +.. ....+++|.|++-...+++|.|++..+.+.+.+. ++. .+ .+-+++ T Consensus 360 l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~g~iS~et~l~~~p-~v~D~~ 438 (470) T protein:vir:10 360 LELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANP-IVDDWQ 438 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccceeeEEeccCCCCCHHHHHHHHHHHhccCcHHHHHHhCC-CCCCHH Confidence 224456667899999999987653 322 2335899999999999999999998776554321 221 11 122333 Q ss_pred -HHHHHhcc----cCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 420 -EIREEAGY----DPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 420 -E~R~~~~~----~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) |+...... .+... ...+.. +....++| T Consensus 439 ~E~eri~~E~~e~~~~~~--~~~~~~--------~~~~dde~ 470 (470) T protein:vir:10 439 QELKDLAKDKEENDPYSN--QADELN--------GKGVNDEQ 470 (470) T ss_pred HHHHHHHHHHHHHHHhhc--cccccC--------CCCCCCCC Confidence 33221110 01000 000000 00000111 No 187 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=98.07 E-value=5.4e-06 Score=49.46 Aligned_cols=410 Identities=11% Similarity=0.078 Sum_probs=165.2 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcc-------cCCHHHHHHHHhcCchhhhhhccchhH Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQ-------EITFNDLYTMYRRGGIAHGAVEKIVTT 73 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~-------~~~~~~l~~~Y~~~~l~r~iVd~~aed 73 (456) |-++.+.+-.-.. ..+...+ ..++.++. ...|.. ....+-++.|.+ ..-+..+++..-.- T Consensus 1 ~~~~~~~~~gl~p--------~rl~~i~-~~~~~~~~---~~~~~~~~~~Lr~~~~~~ly~~m~~-D~hi~s~l~~Rk~a 67 (488) T protein:vir:95 1 MADITETQESLPP--------FRMGEVG-SLGLKVKN---GRIYEEPRQALRFPESIKTFQLMMR-DPAVAASVNIIKMF 67 (488) T ss_pred CCCccccCCCCCH--------HHHHHHH-HHhhcccc---chhhccchhhhcccchHHHHHHHhh-ChHHHHHHHHHHHH Confidence 4444443322111 0111110 01111111 111110 012334455554 67777777777777 Q ss_pred HhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhH-H-HHHHHHHHhhcccCceEEEEEecCCCCccccc----cCCcC Q lcl|NC_016762. 74 CWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRF-W-RAVSEADRRRLVGRYSGLLLHIRDSQPWDRPA----RGKLN 147 (456) Q Consensus 74 ~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~-~-~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl----~~~~~ 147 (456) .+..-|+|.-+...++.....+....++..+..+.. | ..+.+++ .+.+||.|++=+.-+-+.....++ +.+.- T Consensus 68 v~~~~w~v~p~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~ 146 (488) T protein:vir:95 68 VRKVNWRFVPPKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLI 146 (488) T ss_pred HhcCCceEecCCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCee Confidence 776666776443332222211222345556665543 3 4444444 588999998755443221111121 12222 Q ss_pred ceeEEEEeccccCChhhhhcc------------ccccccCCceeEEEeecccCCccccceeeehhhhheec----CCcCC Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEK------------PDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILG----DWTGD 211 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~D------------p~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~----~~~~~ 211 (456) .++.|.++... +...+.-| +....+..|..|.. ....+..|.+.|++.+. ..+.+ T Consensus 147 ~~~~i~~Rpq~--~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~lP~~kfi~~~~~~~~g~p~ 217 (488) T protein:vir:95 147 GWAKLPIRNQS--TLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLG-------ERPLTRKLPRAKFMLFKYDDEYGNPE 217 (488) T ss_pred eeeeeeecCcc--cccceeeccCCCceeecccccccccccccccccc-------cccccccccccceEEEeecCCCCccc Confidence 24444444322 11111111 11111222222211 11123445555554442 34567 Q ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc----CC-Ce Q lcl|NC_016762. 212 AIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR----GN-DV 286 (456) Q Consensus 212 G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~----~~-~~ 286 (456) |.+++..||...+--.....-++.-+-+........+.-. ...+...+...+.+.+.+..+-. +. .. T Consensus 218 g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~--------~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag 289 (488) T protein:vir:95 218 GRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPP--------DYLDENAEPEKKAFVQYCKTVVNDMIANDRAG 289 (488) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeecc--------CCCCCcccHHHHHHHHHHHHHHHHhhccchhh Confidence 9999999986443211111111110001001111111000 00011112222233333333322 11 22 Q ss_pred EEecCCCc---------eeEEecccC---CHHHHHHHHHHHHHhhhcCCeEEeecc------CCCcccchHH--HHHHHH Q lcl|NC_016762. 287 LLPTQGAT---------VTQMVSAVS---DPGPTYNVNLQTAAAGVDIPTKILVGM------QTGERASSED--QKYHNA 346 (456) Q Consensus 287 ~lid~~d~---------~~~~~~~~s---gl~~~~~~~~~~~aaas~IP~t~L~G~------sp~Glnst~D--~~nyyd 346 (456) +++-.+-+ ++..+..=+ ....+++..-.+||-+. +|| .-+|-+|.++ .....+ T Consensus 290 ~iiP~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~i-------LGqtLT~~~~~~Gs~Al~~vh~ev~~~ 362 (488) T protein:vir:95 290 LIWPRYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAF-------MSDVLAMGQSKYGSFSLADSKTSLLAM 362 (488) T ss_pred eeeccccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHH-------hccccccccCcchhhhHHHHHHHHHHH Confidence 33332211 112222212 24556777767777654 444 1123333333 345666 Q ss_pred HHHHHHHhhhhHHHH-HHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcC---HHHHH Q lcl|NC_016762. 347 RCQARRVQELTFEIN-DLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFT---AEEIR 422 (456) Q Consensus 347 ~I~~~Qe~~lrp~L~-~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~---~~E~R 422 (456) .+.+-.. .|...|. .|+.-|+...+++...--.|.|...- ..++ ++.|++++.+++.|..+-+ .+.+| T Consensus 363 i~~aDa~-~i~~tln~~li~~l~~~Nfg~~~~~P~~~~~~~e------~~Dl-~~~ae~~~~L~~~G~~i~~~~~~~~i~ 434 (488) T protein:vir:95 363 SVDILLK-QIKNVINRDLVAQTYALNMWDDEEHVQITYDDIE------TPDL-EAIGSYIQKTVAVGALEVDKELSNKLR 434 (488) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHhcCCCCCCccEEEecCcC------hhhH-HHHHHHHHHHHhCCCccccHHHHHHHH Confidence 6666654 3555564 57777877777754332345553322 2222 3578888899999943332 24578 Q ss_pred HHhcccCCCCCCCCcccCCCCCCCCCcC------------CCCCCC Q lcl|NC_016762. 423 EEAGYDPLQGGDPLPDTEPEDEDAARTD------------PTGEQQ 456 (456) Q Consensus 423 ~~~~~~~~~~~~~~~~~~~~d~~~~~~d------------~~~~~e 456 (456) +..+.....+..+.........++..++ +.+++. T Consensus 435 e~~gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (488) T protein:vir:95 435 EHIGLPPADESQPVSEKLSPNSQSRSGDGYKTAGEGTAKTPSAKDP 480 (488) T ss_pred HHhCCCCCCCCccccccCCCCCCCCCCcccCCCcccCCcccccccc Confidence 8877654433222111111000000000 000000 No 188 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=97.92 E-value=9.6e-06 Score=48.09 Aligned_cols=431 Identities=14% Similarity=0.112 Sum_probs=177.9 Q ss_pred CCc-hhHHH---------------HhHHHHHHHHHHHHHHhhhhhccCccc--ch-hhh-hcc-Cc-ccCCHHHHHHHHh Q lcl|NC_016762. 1 MTD-KLDLA---------------VNHAMSSAIARARMSLLNQGIGHDAKR--PQ-AWC-EYG-FP-QEITFNDLYTMYR 58 (456) Q Consensus 1 ~~~-~~~~~---------------~~~a~~~~~~~~~d~~~n~~~~~gt~~--~~-~~~-~~~-~~-~~~~~~~l~~~Y~ 58 (456) |.- -|.+. .+++.+-+--..-|+-.-.-...++.. -. .++ .+| +- ...+-.+|-..|| T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYR 80 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHH Confidence 111 11110 111111110001111110000000000 00 000 111 11 1235667777777 Q ss_pred c---CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHH----HHhhHHHHHHHHHHhhcccCceEEE Q lcl|NC_016762. 59 R---GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLI----AGGRFWRAVSEADRRRLVGRYSGLL 129 (456) Q Consensus 59 ~---~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~----~~l~~~~~~~ea~~~~r~~Ggs~i~ 129 (456) + ++.+..+|+.++.||+ .+.-.+++-+= +..+....+..+|..++ +-|++..+-.+..|.--+.|. ..+ T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR-i~f 158 (524) T protein:vir:10 81 NLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNL-DKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSR-IFF 158 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEe-cCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeE-EEE Confidence 4 7889999999999984 22222222111 22222233333444444 344555555555443333332 233 Q ss_pred EEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccc--ccc-CCceeEEEeecc----cCC---ccccceeeeh Q lcl|NC_016762. 130 LHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDS--ETY-GQPTMWEYTEAS----QAG---RPGLVRDIHP 199 (456) Q Consensus 130 i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s--~~y-g~P~~y~i~~~~----~~g---~~~~~~~IH~ 199 (456) -.+-|.+.+.+ |...|.+|.|+-..-+ ....+++.. ..+ |--++|..++.. .+| .+...++||. T Consensus 159 hKiid~k~pk~----GI~Elr~lDPr~i~~v--r~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~ 232 (524) T protein:vir:10 159 HKIIDPKRPKE----GIKELRRLDPRQVQYV--REIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPK 232 (524) T ss_pred EEEeeCCCccc----cceeeeeeCCccceee--eeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecch Confidence 33335444322 2233334443221110 011111100 001 112222222111 111 2234577777 Q ss_pred hhhheecCCc---CC---CcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHH Q lcl|NC_016762. 200 DRVFILGDWT---GD---AIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALN 270 (456) Q Consensus 200 SRli~~~~~~---~~---G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~ 270 (456) +- |.|+... .. =+|.|.++- |.|.-++-+ ..+|+-+ |....- -.=+|+-+|....+ ++-+ T Consensus 233 dA-I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDA-----lVIYRit-RAPeRR-vFYIDvGnlPk~KA---eqYl 301 (524) T protein:vir:10 233 AA-IVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDA-----VVIYRIT-RAPDRR-VWYVDTGNMPARKA---AEHM 301 (524) T ss_pred hh-eeeeeccceeCCCCceeccchhhhHHHHhhhHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHH Confidence 74 3343211 11 135565543 222222221 1233321 111000 00123333332111 1222 Q ss_pred HHHHHHHHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 271 ERFNEAARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 271 ~~~~~~~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~ 324 (456) + +.|.+++ .++|- +-| .++-+++++. -+++-++| +..|..-+=-|.++|+ T Consensus 302 ~---~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~ 377 (524) T protein:vir:10 302 Q---HVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-VRWFRQALYMALRVPL 377 (524) T ss_pred H---HHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCch Confidence 1 2222221 11110 000 0223455543 25667777 5678888889999999 Q ss_pred EEeeccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCH Q lcl|NC_016762. 325 KILVGMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTK 391 (456) Q Consensus 325 t~L~G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~se 391 (456) +||-+.+++|+| |++ |.-.|...|.+.|.. +...+..+++. |++-+++. ....+.|+|+-=..-+| T Consensus 378 sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~r-Fs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~E 456 (524) T protein:vir:10 378 SRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHK-FEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTE 456 (524) T ss_pred hhcCCCCCccccccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHH Confidence 999888888887 233 677899999998875 45656666653 44444442 33578999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) -..+|+...+..+.+..-. .|. .++-+=+++ .+.+.--.-.....-++.+-.++--++|..++| T Consensus 457 lKe~Eil~~R~~~l~~~dpyvGk-y~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~ 522 (524) T protein:vir:10 457 LKEAEILERRINMLTMAEPFIGK-YISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQE 522 (524) T ss_pred HHHHHHHHHHHHHHHHhhhhhcc-cchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhh Confidence 9999999888887665432 121 234333322 111110000000000000000111122222222 No 189 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=97.91 E-value=9.7e-06 Score=48.07 Aligned_cols=431 Identities=14% Similarity=0.112 Sum_probs=177.9 Q ss_pred CCc-hhHHH---------------HhHHHHHHHHHHHHHHhhhhhccCccc--ch-hhh-hcc-Cc-ccCCHHHHHHHHh Q lcl|NC_016762. 1 MTD-KLDLA---------------VNHAMSSAIARARMSLLNQGIGHDAKR--PQ-AWC-EYG-FP-QEITFNDLYTMYR 58 (456) Q Consensus 1 ~~~-~~~~~---------------~~~a~~~~~~~~~d~~~n~~~~~gt~~--~~-~~~-~~~-~~-~~~~~~~l~~~Y~ 58 (456) |.- -|.+. .+++.+-+--..-|+-.-.-...++.. -. .++ .+| +- ...+-.+|-..|| T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:72 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYR 80 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHH Confidence 111 11110 111111110001111110000000000 00 000 111 11 1235667777777 Q ss_pred c---CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEE Q lcl|NC_016762. 59 R---GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLL 129 (456) Q Consensus 59 ~---~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~ 129 (456) + ++.+..+|+.++.||+ .+.-.+++-+= +..+....+..+|..+++ -|++..+-.+..|.--+.|. ..+ T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR-i~f 158 (524) T protein:vir:72 81 NLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNL-DKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSR-IFF 158 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEe-cCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeE-EEE Confidence 4 7889999999999984 22222222111 222222333334444443 44555555555443333332 233 Q ss_pred EEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccc--ccc-CCceeEEEeecc----cCC---ccccceeeeh Q lcl|NC_016762. 130 LHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDS--ETY-GQPTMWEYTEAS----QAG---RPGLVRDIHP 199 (456) Q Consensus 130 i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s--~~y-g~P~~y~i~~~~----~~g---~~~~~~~IH~ 199 (456) -.+-|.+.+.+ |...|.+|.|+-..-+ ....+++.. ..+ |--++|..++.. .+| .+...++||. T Consensus 159 hKiid~k~pk~----GI~Elr~lDPr~i~~v--r~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~ 232 (524) T protein:vir:72 159 HKIIDPKRPKE----GIKELRRLDPRQVQYV--REIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPK 232 (524) T ss_pred EEEEeCCCccc----cceeeeeeCCccceee--eeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecch Confidence 33345444332 2233334443221110 011111100 001 112222222111 111 2234577777 Q ss_pred hhhheecCCc---CC---CcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHH Q lcl|NC_016762. 200 DRVFILGDWT---GD---AIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALN 270 (456) Q Consensus 200 SRli~~~~~~---~~---G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~ 270 (456) +- |.|+... .. =+|.|.++- |.|.-++-+ ..+|+-+ |....- -.=+|+-+|....+ ++-+ T Consensus 233 dA-I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDA-----lVIYRit-RAPeRR-vFYIDvGnlPk~KA---eqYl 301 (524) T protein:vir:72 233 AA-VVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDA-----VVIYRIT-RAPDRR-VWYVDTGNMPARKA---AEHM 301 (524) T ss_pred hh-eeeeeccceeCCCCceeccchhhhHhHHhhhHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHH Confidence 74 3343211 11 135565543 222222221 1233321 111000 00123333332111 1222 Q ss_pred HHHHHHHHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 271 ERFNEAARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 271 ~~~~~~~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~ 324 (456) + +.|.+++ .++|- +-| .++-+++++. -+++-++| +..|..-+=-|.++|+ T Consensus 302 ~---~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~ 377 (524) T protein:vir:72 302 Q---HVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-IRWFRQALYMALRVPL 377 (524) T ss_pred H---HHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCch Confidence 1 2222221 11110 000 0223455543 25666777 5678888889999999 Q ss_pred EEeeccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCH Q lcl|NC_016762. 325 KILVGMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTK 391 (456) Q Consensus 325 t~L~G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~se 391 (456) +||-+.+++|+| |++ |.-.|...|.+.|.. +...+..+++. |++-+++. ....+.|+|+-=..-+| T Consensus 378 sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~r-Fs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~E 456 (524) T protein:vir:72 378 SRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHK-FEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAE 456 (524) T ss_pred hhcCCCCCccccccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHH Confidence 999888888887 233 677899999998875 45656666653 44444442 33578999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) -..+|+...+..+.+..-. .|. .++-+=+++ .+.+.--.-.....-++.+-.++--++|..+.| T Consensus 457 lKe~Eil~~R~~~l~~~dpyvGk-y~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~ 522 (524) T protein:vir:72 457 LKEAEILERRINMLTMAEPFIGK-YISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQE 522 (524) T ss_pred HHHHHHHHHHHHHHHHhhhhhcc-cchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhh Confidence 9999999888887665432 121 234333322 111110000000000000000111112222222 No 190 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=97.73 E-value=2.5e-05 Score=45.86 Aligned_cols=429 Identities=13% Similarity=0.106 Sum_probs=179.6 Q ss_pred CCc-hhHHHHhHH-----HHHH-HHHHHHHHhhhhhccCc-----ccch--hh----hhc-cCc-ccCCHHHHHHHHhc- Q lcl|NC_016762. 1 MTD-KLDLAVNHA-----MSSA-IARARMSLLNQGIGHDA-----KRPQ--AW----CEY-GFP-QEITFNDLYTMYRR- 59 (456) Q Consensus 1 ~~~-~~~~~~~~a-----~~~~-~~~~~d~~~n~~~~~gt-----~~~~--~~----~~~-~~~-~~~~~~~l~~~Y~~- 59 (456) |.- =+++-.--. .... ....+.+++-.-..-|+ +-.. ++ ++| +.. ...+-.+|-..||+ T Consensus 1 m~~~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~m 80 (521) T protein:vir:10 1 MNPIFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSL 80 (521) T ss_pred CCcchhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHH Confidence 211 011100000 0000 00011222222112222 1011 11 111 111 12355677777774 Q ss_pred --CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHH----HHhhHHHHHHHHHHhhcccCceEEEEE Q lcl|NC_016762. 60 --GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLI----AGGRFWRAVSEADRRRLVGRYSGLLLH 131 (456) Q Consensus 60 --~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~----~~l~~~~~~~ea~~~~r~~Ggs~i~i~ 131 (456) ++.+..+|+.++.||+ .+.-.+++-+= +..+....+..+|..++ +-|++..+..+..|.--+.| -+++. T Consensus 81 a~~pEvd~Av~eIvneaiv~d~~~~pV~i~L-d~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDg--Ri~fH 157 (521) T protein:vir:10 81 SKYHEVDNAIDEIINDAIVQEDNRDTVYLDL-DKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDS--RIYFH 157 (521) T ss_pred hhccchhhHHHhhhcceEEecCCCceEEEEe-cCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeee--eEEEE Confidence 7889999999999984 22333322111 11111223333444444 34455555555544333333 23333 Q ss_pred -ecCCCCccccccCCcCceeEEEEeccccC---ChhhhhccccccccCCceeEEEee-----cccCCccccceeeehhhh Q lcl|NC_016762. 132 -IRDSQPWDRPARGKLNGLAKVTPAWAGCL---KPKSFDEKPDSETYGQPTMWEYTE-----ASQAGRPGLVRDIHPDRV 202 (456) Q Consensus 132 -i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~---~~~~~~~Dp~s~~yg~P~~y~i~~-----~~~~g~~~~~~~IH~SRl 202 (456) +-|.+.+.+ |...|.+|.|+-..-+ .....+...-.. |--++|..++ .+.+|++..+++||.+- T Consensus 158 kiid~~~pk~----GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~--~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~da- 230 (521) T protein:vir:10 158 KMIDPARPKD----GIKELRLLDPRNVEYYRVNLKSNENGNDVYK--GVKEFFTYGATEDNRYNISGNSNNLVQIPIDA- 230 (521) T ss_pred EEeeCCCccc----cceeeeeeCCcceeeeeeecCCCCCcchhhc--cceeeeeeccCCCceecCCCCCCcceeechhh- Confidence 224333322 2223334433221100 000000000000 1113333321 12234556678888864 Q ss_pred heecC------CcCCCcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHH Q lcl|NC_016762. 203 FILGD------WTGDAIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERF 273 (456) Q Consensus 203 i~~~~------~~~~G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 273 (456) |.|+. .....+|.|.++- |.|.-++-+ ..+|+-+ |....- -.-+|+-+|....+ ++-++ T Consensus 231 I~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRR-vFYIDvGnlpk~KA---eqYl~-- 298 (521) T protein:vir:10 231 IVYSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDA-----MVIYRIT-RAPERR-VFYIDVGTMPNKKA---TQHLN-- 298 (521) T ss_pred eeeecccceeCCCCceeccchhhhHhHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHHH-- Confidence 44432 2234567776653 333322222 1233321 111000 00123333332211 12221 Q ss_pred HHHHHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEe Q lcl|NC_016762. 274 NEAARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKIL 327 (456) Q Consensus 274 ~~~~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L 327 (456) +.|..++ .++|- +-+ .++-+++++. -+++-++| +..|..-+=-|.++|++|| T Consensus 299 -~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~D-V~YF~kkLy~aLnVP~sRl 376 (521) T protein:vir:10 299 -NVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDD-VRWFNRKLYESMKIPLSRL 376 (521) T ss_pred -HHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccc Confidence 1222221 11110 000 0223455543 35667777 5678888889999999997 Q ss_pred eccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHHH Q lcl|NC_016762. 328 VGMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAER 394 (456) Q Consensus 328 ~G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke~ 394 (456) =. ..+|+| |++ |.-.|...|.+.|.. +...+..+++. |++-+++. ....+.|.|+-=..-+|-.. T Consensus 377 ~~-e~~~f~~Gr~~EItRDEikF~KFI~rLR~r-Fs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe 454 (521) T protein:vir:10 377 PQ-EGAGVTFGAGNDITRDELQFTKYIRGLQQQ-FEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKD 454 (521) T ss_pred CC-CCCceecccccchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHH Confidence 44 455565 232 667899999998875 46666666653 44444442 33578999999999999999 Q ss_pred HHHHHHHHHHHHHHHH---cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 395 LANSKTMSEINSAAIG---TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 395 Aei~~~~A~a~~~~~~---~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +|+...+..+.+..-- .|. .++-+=++. .+.+.-..--....-++.+-.++-=++|.++.| T Consensus 455 ~eil~~R~~~l~~~dp~~yvGk-y~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~~p~~e~~ 519 (521) T protein:vir:10 455 VEILERRVNLVQTLASAEVTGK-YLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYKNPEDPME 519 (521) T ss_pred HHHHHHHHHHHHhhcCcccccc-ccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCcchhh Confidence 9999998887765422 231 344444432 222110000000000000000111112222222 No 191 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=97.71 E-value=2.7e-05 Score=45.67 Aligned_cols=426 Identities=15% Similarity=0.113 Sum_probs=182.0 Q ss_pred CCchhH-------HHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccC-cccCCHHHHHHHHhc---Cchhhhhhcc Q lcl|NC_016762. 1 MTDKLD-------LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGF-PQEITFNDLYTMYRR---GGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~-------~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~-~~~~~~~~l~~~Y~~---~~l~r~iVd~ 69 (456) |..+|= .-.+.+.+-+.....|+-.+..++...+.. .+. +...+-.+|-.-||+ ++.+..+|+. T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~-----~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~e 75 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYS-----VDFDGTIRNDHELITRYREMVLNPECDSAVDD 75 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeecccccccc-----cccccccchHHHHHHHHHHHhhccchhhHHHH Confidence 222110 000011111111112222222222111100 001 123345678777874 7889999999 Q ss_pred chhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHH----HHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCcccccc Q lcl|NC_016762. 70 IVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPL----IAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPAR 143 (456) Q Consensus 70 ~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~----~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~ 143 (456) ++.||+ .+.-.+++-+= +..+....+.++|..+ ++-|++..+..+..|.--+.|. ..+-.+-|.+.+.+ T Consensus 76 IVneaiv~d~~~~pV~i~L-d~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgR-i~fhKiid~k~pk~--- 150 (537) T protein:vir:10 76 VVNETICGNFDDVPISIDL-HNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGR-LFFHKVIDPKKPRQ--- 150 (537) T ss_pred hhcceeEecCCCceEEEEe-cccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeE-EEEEEEEeCCCccc--- Confidence 999984 22222211110 1111122223334443 3444556655555543333332 23333445444332 Q ss_pred CCcCceeEEEEeccccC------Ch-hhhhcccccccc-CCceeEEEeecccCCccccceeeehhhhheecC-----C-c Q lcl|NC_016762. 144 GKLNGLAKVTPAWAGCL------KP-KSFDEKPDSETY-GQPTMWEYTEASQAGRPGLVRDIHPDRVFILGD-----W-T 209 (456) Q Consensus 144 ~~~~~l~~i~~~~~~~~------~~-~~~~~Dp~s~~y-g~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~-----~-~ 209 (456) +...|.+|.|+-..-+ ++ .....+-...-+ |.-+||..++....+++..+++|+.+- |.|+. . . T Consensus 151 -GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dA-I~y~hSGl~d~n~ 228 (537) T protein:vir:10 151 -GLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDS-IAYCHSGIQDLNK 228 (537) T ss_pred -cceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhh-eeeecccceeCCC Confidence 2222333333221100 00 000011111111 123445555544445566678888864 44432 1 1 Q ss_pred CCCcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh---- Q lcl|NC_016762. 210 GDAIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLN---- 281 (456) Q Consensus 210 ~~G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~---- 281 (456) ...+|.|.++- |.|.-++-+ ..+|+-+ |... ..| -+|+-+|....+ ++-++ +.|.+++ T Consensus 229 ~~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRRvF--YIDVGnLPk~KA---eqYlr---~iM~k~KNklV 294 (537) T protein:vir:10 229 NMVLSHLHKAIKAVNQLRMIEDS-----LVIYRLS-RAPERRIF--YIDVGNLPKNKA---EQYLR---EVMGRYRNKLV 294 (537) T ss_pred CeeeeeehhhhHHHHhhHHHHhh-----HHHHhhh-ccccceEE--EEecCCCCchhH---HHHHH---HHHHhccceEE Confidence 23567776653 333222222 1233321 1110 000 123333332211 12222 2222221 Q ss_pred --cCCCe---------EEec---------CCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCccc--- Q lcl|NC_016762. 282 --RGNDV---------LLPT---------QGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERA--- 336 (456) Q Consensus 282 --~~~~~---------~lid---------~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gln--- 336 (456) .+++- ++=| ++-+++++. -+++-++| +..|..-+=-|.++|++||= +.+|+| T Consensus 295 YDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~D-V~YF~kKLy~aLnVP~SRl~--~e~~f~~Gr 371 (537) T protein:vir:10 295 YDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLE--TETTFNIGR 371 (537) T ss_pred EeccCceecccchhhhhhhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCCccccC--CCCcccccc Confidence 11110 0000 123455543 35667777 56788888899999999993 346765 Q ss_pred chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016762. 337 SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAERLANSKTMSEINS 406 (456) Q Consensus 337 st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~ 406 (456) |++ |.-.|...|.+.|.. +...+..+++. |++-+++. ....+.|.|+-=..-+|-..+|+...+..+.+ T Consensus 372 ~~EItRDEiKF~KFI~RLR~r-Fs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~ 450 (537) T protein:vir:10 372 AAEITRDEVKFQKFIARLRKR-FSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVA 450 (537) T ss_pred cchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHH Confidence 333 677899999998875 45656666653 44444432 33578999999999999999999988888765 Q ss_pred HHHH-cCC-----------cCcCHHHHHHH---hcc---cCCCCC---CCCcc-----c--CCCCCCCCCcCCC-----C Q lcl|NC_016762. 407 AAIG-TGE-----------PVFTAEEIREE---AGY---DPLQGG---DPLPD-----T--EPEDEDAARTDPT-----G 453 (456) Q Consensus 407 ~~~~-~g~-----------~~i~~~E~R~~---~~~---~~~~~~---~~~~~-----~--~~~d~~~~~~d~~-----~ 453 (456) ..-- .|. --.+.+|+.+. ... +|..-. +...+ . .+.....++.||. . T Consensus 451 ~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 530 (537) T protein:vir:10 451 QMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPA 530 (537) T ss_pred HhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCC Confidence 5321 121 12344454332 111 111100 00000 0 0001111111111 1 Q ss_pred CCC Q lcl|NC_016762. 454 EQQ 456 (456) Q Consensus 454 ~~e 456 (456) .++ T Consensus 531 ~~~ 533 (537) T protein:vir:10 531 DQK 533 (537) T ss_pred Ccc Confidence 111 No 192 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=97.58 E-value=4.1e-05 Score=44.61 Aligned_cols=406 Identities=11% Similarity=0.017 Sum_probs=179.5 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCc-ccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHHhhCCC Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDA-KRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNP 79 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt-~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~ 79 (456) |.=..+.|-+-++...++.+++++-- ..|. .-|....+.|=...-..+-++.|-++..-+..++++.-.-.++--| T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~~~~~~---~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w 77 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAMEHLGL---ATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVG 77 (446) T ss_pred CcccccCCCchhhhhhhhhccccchh---hcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCc Confidence 66555555555555555554444311 1111 1122222222100000122345556678788888887777777667 Q ss_pred EEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccC--CcCceeEEEEecc Q lcl|NC_016762. 80 QVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARG--KLNGLAKVTPAWA 157 (456) Q Consensus 80 ~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~--~~~~l~~i~~~~~ 157 (456) +|..++ +.. .+.++..+.++.+...+.+ +..+..||.|+.=+.-.-+.....|.+. +.-++..+.+.|. T Consensus 78 ~V~p~~-----~~~---a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~ 148 (446) T protein:vir:98 78 PYQHGD-----KRI---KKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLI 148 (446) T ss_pred eecCcc-----HHH---HHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeecccccccchhhccccccccccceee Confidence 774322 111 2236666666655444444 3345668998875543211112234321 2222333333443 Q ss_pred ccC----------Chhhhhc-c--cccc-ccCCceeEEEeecccCCccccceeeehhhhheec----CCcCCCcchHHHH Q lcl|NC_016762. 158 GCL----------KPKSFDE-K--PDSE-TYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILG----DWTGDAIGFLEPA 219 (456) Q Consensus 158 ~~~----------~~~~~~~-D--p~s~-~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~----~~~~~G~S~le~~ 219 (456) ..- +...+.. + |..| .++.|..+. +..+....|...|++.+. ..+..|.|++..+ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~ 221 (446) T protein:vir:98 149 ANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKV-------DVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSV 221 (446) T ss_pred eccCCccccccccchhhcccccccCcccchhhhhhhhc-------ccCcccccccccceEEEEecCCCCCccccchHHHH Confidence 211 1111110 0 0000 112222111 111223445666665542 3456799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHH----HHHHHHHHHHHHhcCCCeEEe-----c Q lcl|NC_016762. 220 YNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDA----LNERFNEAARQLNRGNDVLLP-----T 290 (456) Q Consensus 220 ~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~li-----d 290 (456) |...+=-.....-++.-+-+........++-....-. +.......+ ..+.+...++.+.+. +..++ - T Consensus 222 ~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~---~~~~~~~~~~~~~~~~~L~~av~~~~~d-a~~ii~~~~~P 297 (446) T protein:vir:98 222 LDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGV---VEEAPDGTEITTTIAEQAEDALRRLSTD-SGLVLTQLSKE 297 (446) T ss_pred HHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcc---cccchhHHHHHHHHHHHHHHHHHhcccc-ceeeeecccCC Confidence 8744321112222221111111111111111100000 000000011 112233333332222 22333 3 Q ss_pred CCCceeEEecccCC---HHHHHHHHHHHHHhhhcCCeEEeeccCCCcc--cchHH--HHHHHHHHHHHHHhhhhHHHH-H Q lcl|NC_016762. 291 QGATVTQMVSAVSD---PGPTYNVNLQTAAAGVDIPTKILVGMQTGER--ASSED--QKYHNARCQARRVQELTFEIN-D 362 (456) Q Consensus 291 ~~d~~~~~~~~~sg---l~~~~~~~~~~~aaas~IP~t~L~G~sp~Gl--nst~D--~~nyyd~I~~~Qe~~lrp~L~-~ 362 (456) .+-+++-++..-++ ...+++.+-.+||-+.--+.. .+|++.++- ++-++ ...+.+.+++-... +...|. . T Consensus 298 ~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~L-tl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~-i~~tln~~ 375 (446) T protein:vir:98 298 QPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNL-LVQNRETTFGTGRASEIQLELFDGKINSIFDT-VIHAFTEQ 375 (446) T ss_pred CCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccc-cccccccccchhhhHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 34567766665444 466777777788887766543 236554432 22232 34567777777763 555564 6 Q ss_pred HHHHHHHhcCcCCCCceEE-EeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcC-cCHHHHHHHhcccCCCCCCC Q lcl|NC_016762. 363 LFAHLMRIGVVPLKAEFTA-IWDDLTVPTKAERLANSKTMSEINSAAIGTGEPV-FTAEEIREEAGYDPLQGGDP 435 (456) Q Consensus 363 l~~~l~~s~~~~~~~d~~~-~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~-i~~~E~R~~~~~~~~~~~~~ 435 (456) |+.-|+...+++...-..+ ...|-..+. |..++ ++.|+++..+++.|..+ .+.+.+|+..+.....+. + T Consensus 376 Li~~l~~lNf~~~~~~~~~~~~~~~~~~~--e~eDl-~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~-~ 446 (446) T protein:vir:98 376 VIGNLIRLNFDPALYPLASNTGYITRLPG--RATDL-AALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISS-T 446 (446) T ss_pred HHHHHHHhCCCccccccccccccceeccC--ChhhH-HHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCC-C Confidence 8888887777643211111 111222233 33333 35799999999999321 123458998876332111 1 No 193 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=97.55 E-value=4.6e-05 Score=44.37 Aligned_cols=422 Identities=14% Similarity=0.094 Sum_probs=176.4 Q ss_pred CCc-------hhHHHHhHHHHHHHHHHHHHHhhhhhcc--CcccchhhhhccCcccCCHHHHHHHHhc---Cchhhhhhc Q lcl|NC_016762. 1 MTD-------KLDLAVNHAMSSAIARARMSLLNQGIGH--DAKRPQAWCEYGFPQEITFNDLYTMYRR---GGIAHGAVE 68 (456) Q Consensus 1 ~~~-------~~~~~~~~a~~~~~~~~~d~~~n~~~~~--gt~~~~~~~~~~~~~~~~~~~l~~~Y~~---~~l~r~iVd 68 (456) |++ +.+.. .+.+=.---.-|+... .++. |+..+. +|-....+-.+|-..||+ ++.+..+|+ T Consensus 1 m~~lfgf~i~~~~~~--~~~S~vpp~~~~~~~~-i~~g~~g~~v~~----~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~ 73 (564) T protein:vir:10 1 MSQLFGFLINEKEGQ--KGQSPVPPNDEASVST-VAGGYFGTYVDT----SGGQNSRNEYELIRRYRDMSLHPEVDSAID 73 (564) T ss_pred CcchhcceeeeeccC--CCCCcccCCcCCChhh-hhccccceeeec----ccccchhhHHHHHHHHHHHhhccchhhHHH Confidence 221 00000 0000000000011111 1111 111111 111122345577777774 788999999 Q ss_pred cchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEE-ecCCCCcccc Q lcl|NC_016762. 69 KIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLH-IRDSQPWDRP 141 (456) Q Consensus 69 ~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~-i~D~~~~~~P 141 (456) .++.||+ .+.-.+++.+- ++.+....+.++|..+++ -|++..+..+..|.--+.| -+++. +-|.+.+.+ T Consensus 74 eIVneaIv~d~~~~pV~vdL-~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDg--Ri~fHkiid~~~pk~- 149 (564) T protein:vir:10 74 EIVNEFVVNDGDDKPVEVDL-QNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDG--RSHYHKVIDLDNPKK- 149 (564) T ss_pred HhhcceeEecCCCceEEEEe-cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcc--eEEEEEEeeCCChhh- Confidence 9999984 22223322221 223333344444544444 3444555555444322223 22222 223333222 Q ss_pred ccCCcCceeEEEEe-----ccccCChh----hhhcc-ccccccCC-ceeEEEeecccCC-----------ccccceeeeh Q lcl|NC_016762. 142 ARGKLNGLAKVTPA-----WAGCLKPK----SFDEK-PDSETYGQ-PTMWEYTEASQAG-----------RPGLVRDIHP 199 (456) Q Consensus 142 l~~~~~~l~~i~~~-----~~~~~~~~----~~~~D-p~s~~yg~-P~~y~i~~~~~~g-----------~~~~~~~IH~ 199 (456) +...|.+|.|+ |.....+. ..+.+ ...-+|+. |++|.+++....| ++..+.+||. T Consensus 150 ---GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~ 226 (564) T protein:vir:10 150 ---GILELRYIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIAS 226 (564) T ss_pred ---hhhhhhhhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeech Confidence 23333444433 21111010 11111 11113332 6777777543222 2334578887 Q ss_pred hhhheecCC-----cCCCcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHH Q lcl|NC_016762. 200 DRVFILGDW-----TGDAIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALN 270 (456) Q Consensus 200 SRli~~~~~-----~~~G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~ 270 (456) +-+..-..+ ..-=+|.|.++- |-|.-++-+ ..+|+-+ |... ..| =+|+-+|....+ ++-+ T Consensus 227 daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDA-----lVIYRit-RAPeRRvF--YIDVGnLPk~KA---eqYl 295 (564) T protein:vir:10 227 DAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDS-----LVIYRLS-RAPERRIF--YIDVGNLPKVKA---EQYL 295 (564) T ss_pred hhcceecccceeCCCCceeccchhhhHhHHhhHHHHhh-----HHHHhhh-ccccceEE--EEecCCCCchhH---HHHH Confidence 754322111 111235565443 222222221 1233321 1110 000 123333332211 1222 Q ss_pred HHHHHHHHHHh------cCCCe---------EEe---------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 271 ERFNEAARQLN------RGNDV---------LLP---------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 271 ~~~~~~~~~~~------~~~~~---------~li---------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~ 324 (456) + +.|.+++ ..+|- ++= .++-+++++. -+|+.++| +..|..-+=-+.++|+ T Consensus 296 r---~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLY~aLnVP~ 371 (564) T protein:vir:10 296 R---DVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKD-VEYFKKKLYNSLNLPP 371 (564) T ss_pred H---HHHHhcCceEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHH-HHHHHHHHHHHhCCCc Confidence 2 2222221 11110 000 0223455542 36777777 5678888888999999 Q ss_pred EEeeccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCH Q lcl|NC_016762. 325 KILVGMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTK 391 (456) Q Consensus 325 t~L~G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~se 391 (456) +||-.+ .+|+| |++ |.-.|...|.++|.. +...+..+++. |++-+++. ....+.|.|+-=..-+| T Consensus 372 SRl~~e-~~~f~~Gr~~EItRDEiKF~KFI~RLR~r-Fs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~E 449 (564) T protein:vir:10 372 SRLTDD-NKAFNLGKSTEILRDELKFTKFIGRLRKR-FAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNE 449 (564) T ss_pred ccccCC-CceeecccccchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHH Confidence 999554 44555 332 677899999998875 45656666653 44444442 33578999999889999 Q ss_pred HHHHHHHHHHHHHHHHHHH-cCC-----------cCcCHHHHHHH---hcc---cCCCCCCCCcccCC------------ Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIG-TGE-----------PVFTAEEIREE---AGY---DPLQGGDPLPDTEP------------ 441 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~-~g~-----------~~i~~~E~R~~---~~~---~~~~~~~~~~~~~~------------ 441 (456) -..+|+...+..+.+..-. .|. --.+.+|+.+. ... +++. ..|++++ T Consensus 450 lKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~---~~P~e~~~~~~~~~~~~~~ 526 (564) T protein:vir:10 450 LKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLA---IDPIQVNMLDDMEKQNQAF 526 (564) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCC---CCchhhhcCCCccCCCCcC Confidence 9999998888887654321 111 12344444332 110 1110 0010000 Q ss_pred -------CCCCCCCcCCCCCCC Q lcl|NC_016762. 442 -------EDEDAARTDPTGEQQ 456 (456) Q Consensus 442 -------~d~~~~~~d~~~~~e 456 (456) .++.++.+.+.+... T Consensus 527 ~p~~~~~~~~~~~~~~~~~~~~ 548 (564) T protein:vir:10 527 APELQAAQDDLAAEREIKKLNS 548 (564) T ss_pred CcchhhhccccccccChhhhcc Confidence 011111111000000 No 194 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=97.55 E-value=4.7e-05 Score=44.29 Aligned_cols=431 Identities=13% Similarity=0.116 Sum_probs=175.6 Q ss_pred CCc---hhHHHH---hHHHHH---HHHHHHHHHhhhhhccCc-cc---ch--hhh----hcc--C-cccCCHHHHHHHHh Q lcl|NC_016762. 1 MTD---KLDLAV---NHAMSS---AIARARMSLLNQGIGHDA-KR---PQ--AWC----EYG--F-PQEITFNDLYTMYR 58 (456) Q Consensus 1 ~~~---~~~~~~---~~a~~~---~~~~~~d~~~n~~~~~gt-~~---~~--~~~----~~~--~-~~~~~~~~l~~~Y~ 58 (456) |.+ =+++-. +...+. ...-...+++-.-..-|+ .. .+ +++ +|+ + +...+-.+|-..|| T Consensus 1 ~~~~~~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MANFNTILSFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYR 80 (524) T ss_pred CCchhhHHHHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHH Confidence 110 000000 000000 000011122221111111 00 11 111 111 1 12334567777777 Q ss_pred c---CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEE Q lcl|NC_016762. 59 R---GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLL 129 (456) Q Consensus 59 ~---~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~ 129 (456) + ++.+..+|+.++.||+ .++-.+++-+= +..+....+..+|..+++ -|++..+-.+..|.--+.| -++ T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L-d~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDg--Ri~ 157 (524) T protein:vir:10 81 NLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNL-DGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDS--RIF 157 (524) T ss_pred HHhhccchhhHHHHhhcceeEecCCCceEEEEe-cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeec--eEE Confidence 4 8889999999999984 22222222111 223333333344544444 3445555555544333333 233 Q ss_pred EE-ecCCCCccccccCCcCceeEEEEeccccCChhhhhccccc--cccCC-ceeEEEeecc----cCC---ccccceeee Q lcl|NC_016762. 130 LH-IRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDS--ETYGQ-PTMWEYTEAS----QAG---RPGLVRDIH 198 (456) Q Consensus 130 i~-i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s--~~yg~-P~~y~i~~~~----~~g---~~~~~~~IH 198 (456) +. +-|.+.+.+ |...|.+|.|+-..-+ ....+++.. ..+.. -++|..++.. .+| .+...++|+ T Consensus 158 fHkiid~~~pk~----GI~Elr~lDPr~i~~v--r~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~ 231 (524) T protein:vir:10 158 FHKIINPKKMKD----GVQELRRLDPRQVQYI--REIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIP 231 (524) T ss_pred EEEEeeCCCccc----cceeeeeeCCccceee--eeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecc Confidence 32 224333322 2222333333221100 001111100 01111 1222222111 111 223457888 Q ss_pred hhhhheecCCc--CC---CcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHH Q lcl|NC_016762. 199 PDRVFILGDWT--GD---AIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALN 270 (456) Q Consensus 199 ~SRli~~~~~~--~~---G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~ 270 (456) .+-+..-..+- .. =+|.|.++- |.|.-++-+ ..+|+-+ |....- -.=+|+-+|....+ ++-+ T Consensus 232 ~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRR-vFYIDVGnlPk~KA---eqYl 301 (524) T protein:vir:10 232 RAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDA-----MVIYRIT-RAPDRR-VFYIDTGNMPSRKA---AAQM 301 (524) T ss_pred hhheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHH Confidence 88754432111 11 135665543 223222221 1233321 111000 00123333332111 1222 Q ss_pred HHHHHHHHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 271 ERFNEAARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 271 ~~~~~~~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~ 324 (456) + +.|.+++ .++|- +-+ .++-+++++. -+++-++| +..|..-+=-|.++|+ T Consensus 302 ~---~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~ 377 (524) T protein:vir:10 302 Q---HIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDD-VLYFRTALYRALRIPE 377 (524) T ss_pred H---HHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCc Confidence 1 1222221 11110 000 0223455553 35677777 5678888889999999 Q ss_pred EEeeccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCH Q lcl|NC_016762. 325 KILVGMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTK 391 (456) Q Consensus 325 t~L~G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~se 391 (456) +||=..+++|+| |++ |.-.|...|.+.|.. +.+.+..+++. |++-+++. ....+.|.|+-=..-+| T Consensus 378 sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~E 456 (524) T protein:vir:10 378 SRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNK-FEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSE 456 (524) T ss_pred hhccCCCCccccccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHH Confidence 999767777776 333 677899999998875 46666666653 44444442 33578999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) -..+|+...+..+.+..-. .|. .++-+=++. .+.+.-..-.....-++.+-.++--++|..+.| T Consensus 457 lKe~Eil~~R~~~l~~~dpyvGk-y~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~ 522 (524) T protein:vir:10 457 MKDAEIMERRINMLTMAEPFIGK-YISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEE 522 (524) T ss_pred HHHHHHHHHHHHHHHHhhhhhcc-cchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhh Confidence 9999999888887665432 121 233333322 111100000000000000000111111111111 No 195 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=97.51 E-value=5.3e-05 Score=44.00 Aligned_cols=431 Identities=13% Similarity=0.120 Sum_probs=179.6 Q ss_pred CCchhHHHHhHHHHHHHHH------HHHHHhhhh---------hccCcccchhh-hhccCc----ccCCHHHHHHHHhc- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIAR------ARMSLLNQG---------IGHDAKRPQAW-CEYGFP----QEITFNDLYTMYRR- 59 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~------~~d~~~n~~---------~~~gt~~~~~~-~~~~~~----~~~~~~~l~~~Y~~- 59 (456) -..-|++...-+-.+.... -..+++-+- .+.++...... +++ |. ...+-.+|-..||+ T Consensus 6 ~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~-y~~~e~~~~~~~eLI~~YR~m 84 (524) T protein:vir:98 6 FGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQF-YSGQDPAIQNKEQLINTYRGI 84 (524) T ss_pred hhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeee-ccccccccchHHHHHHHHHHH Confidence 1122222211111111100 111222111 11222111111 111 11 12245677777774 Q ss_pred --CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEE Q lcl|NC_016762. 60 --GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLH 131 (456) Q Consensus 60 --~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~ 131 (456) ++.+..+|+.++.||+ .+.-.+++-+= +..+....+..+|..+++ -|++..+..+..|.--+.|.-++-.. T Consensus 85 a~~pEvd~Av~eIVneaIv~~~~~~pV~l~L-~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhki 163 (524) T protein:vir:98 85 MSYPEVENAVSEIIDDAIVNEQGKDIITMDL-AKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKI 163 (524) T ss_pred hhccchhhHHHhhhcceeEecCCCceEEEEe-cccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEE Confidence 7889999999999983 22222221111 222222333334444443 44555555555443333343333333 Q ss_pred ecCCCCccccccCCcCceeEEEEeccccCChhh-hhcccccccc-CCceeEEEeecc----cCC---ccccceeeehhhh Q lcl|NC_016762. 132 IRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKS-FDEKPDSETY-GQPTMWEYTEAS----QAG---RPGLVRDIHPDRV 202 (456) Q Consensus 132 i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~-~~~Dp~s~~y-g~P~~y~i~~~~----~~g---~~~~~~~IH~SRl 202 (456) + |+++ | + |...|.+|.|+-..-+.... -+.|..+..| |.-++|..++.. .+| .+...++||.+-+ T Consensus 164 i-d~~~---~-k-GI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI 237 (524) T protein:vir:98 164 M-HKDE---S-K-GIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAI 237 (524) T ss_pred E-cCCC---C-c-ceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhhe Confidence 3 3222 1 2 33444455543322111000 0011111111 122333333211 112 2334577888865 Q ss_pred heecC----CcCCCcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHH Q lcl|NC_016762. 203 FILGD----WTGDAIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNE 275 (456) Q Consensus 203 i~~~~----~~~~G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~ 275 (456) ..-.. ....=+|.|.++- |.|.-++-+ ..+|+-+ |....- -.-+|+-+|....+ ++-+ .+ T Consensus 238 vy~hSGL~d~~~~iisyLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRR-vFYIDvGnlPk~KA---eqYl---~~ 304 (524) T protein:vir:98 238 VYAHSGLEDCSNNIIGYLHRAVKPANQLRLLEDA-----MVIYRIT-RAPERR-VFYIDVGQMGGNKA---TQYV---NN 304 (524) T ss_pred eeeccCcccCCCCeeeehhHhhHhHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHH---HH Confidence 43211 1111136665543 222222221 1233321 111000 00123333332211 2222 22 Q ss_pred HHHHHh------cCCC-------eEEec-----------CCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEeec Q lcl|NC_016762. 276 AARQLN------RGND-------VLLPT-----------QGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKILVG 329 (456) Q Consensus 276 ~~~~~~------~~~~-------~~lid-----------~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L~G 329 (456) .|.+++ .++| -+-+. ++-+++.+. -+++-++| +..|..-+=-+.++|++||- T Consensus 305 im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~D-V~YF~kkLy~aLnVP~sRl~- 382 (524) T protein:vir:98 305 IAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDD-IKWFNRKLYEALRVPLSRMP- 382 (524) T ss_pred HHHhcCceeEeeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCceecc- Confidence 222222 1111 11111 223455543 35667777 56788888899999999983 Q ss_pred cCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHHHHH Q lcl|NC_016762. 330 MQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAERLA 396 (456) Q Consensus 330 ~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke~Ae 396 (456) ++.+|+| |++ |.-.|...|.+.|.. +.+.+..+++. |++-+++. ....+.|.|.-=..-+|-..+| T Consensus 383 ~~~~~f~~Gr~~EItRDEiKF~KFI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~E 461 (524) T protein:vir:98 383 RDDGGMQIGGGGEITRDELKFSKFIRTLQIQ-FSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIE 461 (524) T ss_pred CCCCccccccccchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHH Confidence 2244554 332 777899999998875 45656666653 44444443 2346899999999999999999 Q ss_pred HHHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 397 NSKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 397 i~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) +...+..+.+..-. .|. .++-+=++. .+.+.-..-.....-++.+-.++--++|.++.| T Consensus 462 il~~R~~~l~~~dpyvGk-y~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~p~~e~~ 522 (524) T protein:vir:98 462 ILERRLNLMSQVEGVVGK-YVSHKYIMKEILRMSDEDIDEQAKLIEEESKEERFKNPEAEEE 522 (524) T ss_pred HHHHHHHHHHHhcccccc-ccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCcCCccccc Confidence 99988887765433 221 233333322 111100000000000000001111122333333 No 196 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=97.49 E-value=5.6e-05 Score=43.87 Aligned_cols=397 Identities=11% Similarity=0.061 Sum_probs=169.5 Q ss_pred CCc---------hhHHHHhHHHHHHHHHHHHHHhhh-hhccCcccc--hhhhhccCcccCCHHH-HHHHHhcCchhhhhh Q lcl|NC_016762. 1 MTD---------KLDLAVNHAMSSAIARARMSLLNQ-GIGHDAKRP--QAWCEYGFPQEITFND-LYTMYRRGGIAHGAV 67 (456) Q Consensus 1 ~~~---------~~~~~~~~a~~~~~~~~~d~~~n~-~~~~gt~~~--~~~~~~~~~~~~~~~~-l~~~Y~~~~l~r~iV 67 (456) |+. +.+.....+ ...+...++-+... .-|+ |-.+ ....+........+.+ ++.|-++..-+..++ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~gl-tp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l 78 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQ-TSRLAGLAKEFAQHPAKGL-TPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEM 78 (526) T ss_pred CCeeeCCCCCccCccccchhh-hhhhhhhhhhcccCCCCCc-CHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHH Confidence 211 111100001 01122233344322 2233 2111 2233332222222222 223334566666677 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhh-HHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCC Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGR-FWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGK 145 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~-~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~ 145 (456) .+.-.-.+..-|+|.-+.+++..+. .....++..+.++. +...+.+ +-.+.+||+|++=|.-+ ++..+ . T Consensus 79 ~~Rk~av~~~~w~I~p~~~~~~~~~--~~a~~v~~~l~~~~~~~~~i~~-~ldA~~~G~s~~Ei~w~~~~g~~------~ 149 (526) T protein:vir:79 79 SKRKRAILGLDWAVEPPRNASAAEK--ADADYLHELLLDLEGLEDLLLD-ALDGIGHGYSCIELEWALQGREW------M 149 (526) T ss_pred HHHHHHHhCCCceEecCCCCChHHH--HHHHHHHHHHhcccCHHHHHHH-HHhhhhhcceeEEEEEeecCCce------e Confidence 7666666655557754322211111 11223566665553 4444444 44588999998866432 21111 1 Q ss_pred cCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhee----cCCcCCCcchHHHHHH Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFIL----GDWTGDAIGFLEPAYN 221 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~----~~~~~~G~S~le~~~~ 221 (456) ...|....+.|.. .|+..- . .+.+.... ..+..+.+-+.+.+ .....+|.++++.||. T Consensus 150 ~~~l~~r~~~~F~--------~~~~~~----~-~l~~~~~~-----~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w 211 (526) T protein:vir:79 150 PLAFHHRPQSWFQ--------LNPEDQ----N-ELRLRDNS-----PAGEALQPFGWIIHRPRARSGYVARSGLFRVLAW 211 (526) T ss_pred EEEeeeecccceE--------eccCCC----c-EEEecCCC-----CCceeecCCceEEEeecCCcCCccccchHHHHHH Confidence 1122222222211 111110 0 01111100 11223333333322 2345679999999986 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh-HHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEec Q lcl|NC_016762. 222 SFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE-IASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVS 300 (456) Q Consensus 222 ~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~ 300 (456) ..+--.....-++. |..++.+-- +..--....++-.+.+.+++..+.+ ....+|..+.+++-++. T Consensus 212 ~~~fK~~~~~~w~~-------------F~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~-da~~iiP~~~~ie~~ea 277 (526) T protein:vir:79 212 PYLFRHYATSDLAE-------------MLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGH-AAAGIIPETMAIDFQQA 277 (526) T ss_pred HHHHHHhhHHHHHH-------------HHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhc-CcEEEecCCceeEEeec Confidence 44321111111111 111111100 0000001123344566666666644 45677888888998887 Q ss_pred ccCCH---HHHHHHHHHHHHhhhcCCeEEeeccC------CCcc--cchHH--HHHHHHHHHHHHHhhhhHHHH-HHHHH Q lcl|NC_016762. 301 AVSDP---GPTYNVNLQTAAAGVDIPTKILVGMQ------TGER--ASSED--QKYHNARCQARRVQELTFEIN-DLFAH 366 (456) Q Consensus 301 ~~sgl---~~~~~~~~~~~aaas~IP~t~L~G~s------p~Gl--nst~D--~~nyyd~I~~~Qe~~lrp~L~-~l~~~ 366 (456) +=+|. ..+++..-.+||-+ ++|+. .||. +|-++ .....+.+.+-.. .+...|. .|+.. T Consensus 278 ~~~~~~~f~~li~~~d~~Isk~-------iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~-~i~~tln~~Li~~ 349 (526) T protein:vir:79 278 AQGSSEPFLAMMRQSEDAISKA-------VLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDAR-QLAATLSRDLLWP 349 (526) T ss_pred CCCCHHHHHHHHHHHHHHHHHH-------HhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH Confidence 64443 33344444455543 34542 1222 22233 3446666665554 4566675 58888 Q ss_pred HHHhcCcCCCC-c--eEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCC- Q lcl|NC_016762. 367 LMRIGVVPLKA-E--FTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPE- 442 (456) Q Consensus 367 l~~s~~~~~~~-d--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~- 442 (456) |+...++...+ . -.|.|..- |..++ ++.|+++..++..|. .++.+.+|+..+.....+.++....... T Consensus 350 l~~~N~~~~~~~~~~p~~~~~~~------e~eDl-~~~a~~~~~L~~~G~-~i~~~~i~e~~gip~~~~~e~~l~~~~~~ 421 (526) T protein:vir:79 350 LLVLNRPGSPDVRRAPRLVFDLR------EQADI-TSMAQSIPALVNVGL-EIPSAWVYDKLGIPQPAKNEPVLRPAAQP 421 (526) T ss_pred HHHhCCCCcCCccccceEEeCCC------CcccH-HHHHHHHHHHHhCCC-cCCHHHHHHHhCCCCCCCchhhccccCCc Confidence 88887764321 1 13344332 22222 457888888898884 4899999999887544333221100000 Q ss_pred ----CCCCCCc-------CCCCCCC Q lcl|NC_016762. 443 ----DEDAART-------DPTGEQQ 456 (456) Q Consensus 443 ----d~~~~~~-------d~~~~~e 456 (456) ....... .+...++ T Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (526) T protein:vir:79 422 AILSRQHGQRVAALATIVGPRYGDQ 446 (526) T ss_pred cccccccccccccccccccccCchh Confidence 0000000 0000000 No 197 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=97.42 E-value=7.1e-05 Score=43.32 Aligned_cols=430 Identities=16% Similarity=0.133 Sum_probs=184.6 Q ss_pred CCchhHHHHhHHHHHH------HHHHHHHHhhhhhccCc-------ccchh----hhhccCc---ccCCHHHHHHHHhc- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSA------IARARMSLLNQGIGHDA-------KRPQA----WCEYGFP---QEITFNDLYTMYRR- 59 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~------~~~~~d~~~n~~~~~gt-------~~~~~----~~~~~~~---~~~~~~~l~~~Y~~- 59 (456) |..-|++-..-+--+. +.....+++-+-.--|+ ..+.+ +.++.+. ...+-.+|-..||+ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:81 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CcchhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHH Confidence 7766665433110000 00011222222111121 11111 1122121 23345677777774 Q ss_pred --CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEE Q lcl|NC_016762. 60 --GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLH 131 (456) Q Consensus 60 --~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~ 131 (456) ++.+..+|+.++.||+ .+.-.+++-+= +..+....+..+|..+++ -|++..+-.+..|.--+.|.-++-.. T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhki 159 (521) T protein:vir:81 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNL-EATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKI 159 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEe-cccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEE Confidence 7889999999999984 22222222111 222223333334444443 34555555555443333343233232 Q ss_pred ecCCCCccccccCCcCceeEEEEeccccC--ChhhhhccccccccCC-ceeEEEeecc----cCC---ccccceeeehhh Q lcl|NC_016762. 132 IRDSQPWDRPARGKLNGLAKVTPAWAGCL--KPKSFDEKPDSETYGQ-PTMWEYTEAS----QAG---RPGLVRDIHPDR 201 (456) Q Consensus 132 i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~--~~~~~~~Dp~s~~yg~-P~~y~i~~~~----~~g---~~~~~~~IH~SR 201 (456) + | +. | +.|...|.+|.|+-..-+ .+.+ + .+.-..++. -++|..++.. .+| .+..+++||.+- T Consensus 160 i-d-~~---p-k~GI~Elr~lDPr~i~~vr~i~k~-~-~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dA 231 (521) T protein:vir:81 160 I-G-KN---P-KDGIVELRQLDPRNLEYVREIITE-D-TPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSA 231 (521) T ss_pred E-c-CC---c-cccceeeeeeCCcceeeeeeeccc-c-cCccceecceeeeeeeecCCccccccceeecCCcceeechhh Confidence 3 3 22 2 223333444444321111 1100 0 011111111 1222222211 011 233456777764 Q ss_pred hheecCCc---C-C--CcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHH Q lcl|NC_016762. 202 VFILGDWT---G-D--AIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNER 272 (456) Q Consensus 202 li~~~~~~---~-~--G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~ 272 (456) + .|+... . . =+|.|.++- |.|.-++-+ ..+|+-+ |....- -.-+|+-+|....+ ++-++ T Consensus 232 I-~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRR-vFYIDvGnlpk~KA---eqYl~- 299 (521) T protein:vir:81 232 I-TYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDA-----MVVYRIT-RAPERR-VFFIDTGNMNNRKA---AQHMN- 299 (521) T ss_pred e-eeeeccceeCCCCeeeecchhhhHhHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHHH- Confidence 3 333211 0 1 135665543 223222221 1233321 111000 00123333332211 12222 Q ss_pred HHHHHHHHhc------CCC-------eEEec-----------CCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEE Q lcl|NC_016762. 273 FNEAARQLNR------GND-------VLLPT-----------QGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKI 326 (456) Q Consensus 273 ~~~~~~~~~~------~~~-------~~lid-----------~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~ 326 (456) +.|.+++. .+| .+-+. ++-+++++. -+++.++| +..|..-+=-|.++|++| T Consensus 300 --~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sR 376 (521) T protein:vir:81 300 --SVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDD-IRYFNRKLYEALRVPLSR 376 (521) T ss_pred --HHHHhcCceeEeecccccccccccccchhhhhcccccCCCcccceeecccCCCCChHHH-HHHHHHHHHHHhCCcccc Confidence 22222211 111 11111 233566553 36677777 567888888999999999 Q ss_pred eeccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHH Q lcl|NC_016762. 327 LVGMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAE 393 (456) Q Consensus 327 L~G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke 393 (456) |-.++.+|+| |++ |.-.|...|.+.|.. +.+.+..+++. |++-+++. ....+.|.|+-=..-+|-. T Consensus 377 l~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElK 455 (521) T protein:vir:81 377 SNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQ-FSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVK 455 (521) T ss_pred ccCCCCcceeccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHH Confidence 9767777776 232 677899999999875 46666666653 44444443 2346899999999999999 Q ss_pred HHHHHHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 394 RLANSKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 394 ~Aei~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) .+|+...+..+.+..-. .|. .++-+=+++ .+.+.-..-.....-++.+-.++--++|.++.| T Consensus 456 e~Eil~~R~~~l~~~dpyvGk-y~s~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~ 519 (521) T protein:vir:81 456 DAEILERRIGLIERITPYIGK-YFSNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIE 519 (521) T ss_pred HHHHHHHHHHHHHHhhhhhcc-ccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCCccccc Confidence 99999888887765432 121 233333322 111110000000000000001111112222222 No 198 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=97.35 E-value=8.7e-05 Score=42.85 Aligned_cols=400 Identities=11% Similarity=0.034 Sum_probs=176.9 Q ss_pred CCchhHH-------H-HhHHHHHHHHHHHHHHhh-hhhccCcccc-hhhhhccCcccCCHHHH-HHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDL-------A-VNHAMSSAIARARMSLLN-QGIGHDAKRP-QAWCEYGFPQEITFNDL-YTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~-------~-~~~a~~~~~~~~~d~~~n-~~~~~gt~~~-~~~~~~~~~~~~~~~~l-~~~Y~~~~l~r~iVd~ 69 (456) |+.=+.. . .+.-....+...++-+.. +..|+.-.|= ....+........+.+| +.|.+++.-+..++++ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~ 80 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSK 80 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 3221100 0 000000112223344433 2333321121 22333322222222233 2333468888888888 Q ss_pred chhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcCc Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLNG 148 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~~ 148 (456) .-.-.+..-|+|..+..+...+ ......+++.+.++.-+..+..-+-.+.+||+|++=|.-. ++..+ . T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~--~~~a~~v~~~l~~~~~f~~~i~~~lda~~~G~s~~Ei~w~~~~g~~---------~ 149 (528) T protein:vir:10 81 RKRAVLGLDWTIEPPRNASAAE--KADAEYLHELLLDLEGIEDLMLDCMDGVGHGYSAIELDWSLQGREW---------L 149 (528) T ss_pred HHHHHhcCCceEecCCCCCHHH--HHHHHHHHHHHhCCccHHHHHHHHHhhhhhcceeEEEEEeecCCce---------e Confidence 8888877666776443222111 1222345666666543444555555688999998865431 22111 1 Q ss_pred eeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhee----cCCcCCCcchHHHHHHHHH Q lcl|NC_016762. 149 LAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFIL----GDWTGDAIGFLEPAYNSFI 224 (456) Q Consensus 149 l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~----~~~~~~G~S~le~~~~~l~ 224 (456) +..+.++.+..+ ..|+. +... +.+.... ..+..+.+-+.+.+ ......|.+++..||...+ T Consensus 150 ~~~~~~r~~~~f-----~~~~~----~~~~-l~~~~~~-----~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~ 214 (528) T protein:vir:10 150 PQAFDHRPQSWF-----QLNPD----DQDE-LRLRDNS-----IAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYL 214 (528) T ss_pred EEEeeeecccce-----eeccC----CCcE-EeccCCC-----CCceeecCCCeEEEeecCCCCCccccchHHHHHHHHH Confidence 222322221111 11111 1111 1121111 11233444443333 2345568999999986543 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh-HHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccC Q lcl|NC_016762. 225 SLEKVEGGSGESFLKNAARQLLLNFDKEINLGE-IASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVS 303 (456) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~s 303 (456) --.....-++. |..++.+-- +..--....++-.+.+.+.+..+.+ ....+|..+.+++-++.+-+ T Consensus 215 fK~~~~~~w~~-------------f~E~yG~P~~igky~~~a~~~ek~~L~~al~~i~~-~~~~iiP~~~~ie~~ea~~~ 280 (528) T protein:vir:10 215 FKHYSTADLAE-------------MLEIYGLPIRLGKYPPGTPDEEKVTLLRAVTGLGH-AAAGIIPESMSIDFQEASKG 280 (528) T ss_pred HHHhhHHHHHH-------------HHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhh-CcEEEecCCceeEEeecCCC Confidence 21111111211 111111100 0000001123344566666666654 35677888889988887544 Q ss_pred CH---HHHHHHHHHHHHhhhcCCeEEeeccC------CCcc--cchHH--HHHHHHHHHHHHHhhhhHHHH-HHHHHHHH Q lcl|NC_016762. 304 DP---GPTYNVNLQTAAAGVDIPTKILVGMQ------TGER--ASSED--QKYHNARCQARRVQELTFEIN-DLFAHLMR 369 (456) Q Consensus 304 gl---~~~~~~~~~~~aaas~IP~t~L~G~s------p~Gl--nst~D--~~nyyd~I~~~Qe~~lrp~L~-~l~~~l~~ 369 (456) +. ..+++..-.+||-+ ++|+. .||. ++-++ .....+.+.+-.. .+...|. .|+..|+. T Consensus 281 ~~~~f~~li~~~d~~Isk~-------iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~-~i~~tln~~li~~l~~ 352 (528) T protein:vir:10 281 SAEPFMAMMRWCDDSMSKA-------ILGGTLTSQTSESGGGAYALGQVHNEVRHDLLAADAR-QLAATLSRDLLWPLLV 352 (528) T ss_pred ChhHHHHHHHHHHHHHHHH-------HhhhhhhccccccccchhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 43 33444444555553 34431 1222 22233 2445666665554 4566675 58888888 Q ss_pred hcCcCCCC---ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccC-CCCCC Q lcl|NC_016762. 370 IGVVPLKA---EFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTE-PEDED 445 (456) Q Consensus 370 s~~~~~~~---d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~-~~d~~ 445 (456) ..+++..+ --.|.|..- |..++ ++.|+++..++..|. .++.+++|+..+.....+++...... ..... T Consensus 353 ~N~~~~~~~~~~p~~~~~~~------e~eDl-~~~a~~~~~L~~~G~-~i~~~~i~e~~gip~p~~~e~~~~~~~~~~~~ 424 (528) T protein:vir:10 353 LNRSGNLDARRAPRLVFDLK------DRADL-AAMATSLPPLVKLGV-QVPVNWVQEQLGIPLPANGEAVLGDQAGAGIA 424 (528) T ss_pred hCCCCCCCccccceEEecCC------CcccH-HHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCCcccccCCCccccc Confidence 88774211 123444333 22223 357888888888884 48999999998875543332211000 00000 Q ss_pred C--C-----------CcCCCC-CCC Q lcl|NC_016762. 446 A--A-----------RTDPTG-EQQ 456 (456) Q Consensus 446 ~--~-----------~~d~~~-~~e 456 (456) + . ...+.. +++ T Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (528) T protein:vir:10 425 QLSRRPGPRIAALAQVIGPRYRDQE 449 (528) T ss_pred ccCcccccccccccccccccccccc Confidence 0 0 000000 000 No 199 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=97.33 E-value=5.1e-05 Score=44.10 Aligned_cols=315 Identities=12% Similarity=0.007 Sum_probs=142.3 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHh--hhhhccCccc--chhh---hhccCcccCCHHHHHHHHhcCchhhhhhccchhH Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLL--NQGIGHDAKR--PQAW---CEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTT 73 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~--n~~~~~gt~~--~~~~---~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed 73 (456) |+.|... ...+.+.+-....-.|. ....-+++.. +-.. +.-+|...+++.-|..+++.|.....+|..-+.+ T Consensus 1 m~~~~~~-~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~n~ 79 (340) T protein:vir:98 1 MSKRKPR-KAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKRNV 79 (340) T ss_pred CCCCCCC-ccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhhhhH Confidence 9965432 11121111111111110 0000011111 1111 1114566888889999999998777777665544 Q ss_pred HhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEE Q lcl|NC_016762. 74 CWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVT 153 (456) Q Consensus 74 ~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~ 153 (456) ..+ +++- . + ..+ +..+++ ++-.-.++|-+++.+. +++ .+.+..+. T Consensus 80 l~~-~~~P--n--~----~lt-----------~~~f~~----~~~d~ll~Gnay~~~~-rn~----------~G~~~~L~ 124 (340) T protein:vir:98 80 LAS-TYIP--H--P----LLS-----------RQDFSR----FALDYLVFGNAFLEQR-HSV----------TGQLIKLL 124 (340) T ss_pred Hhh-ccCC--C--C----CCC-----------HHHHHH----HHHHHHhcCCeEEEEE-ECC----------CCcEEEEE Confidence 332 2211 0 0 001 111121 1112346787777654 221 12234444 Q ss_pred EeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHH Q lcl|NC_016762. 154 PAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKV 229 (456) Q Consensus 154 ~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~ 229 (456) |.....+... .|. + .+|++.. +| ....+++.-|+||... ...|.|.+......+.- ... T Consensus 125 pl~~~~vr~~---~~~---~----~~~~~~~---~~---~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l-~~a 187 (340) T protein:vir:98 125 TSPAKYTRRG---VDD---S----VFWFVEN---FT---QPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWL-NES 187 (340) T ss_pred EeCCceEEEc---ccC---c----EEEEEec---CC---eEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHH-HHH Confidence 4433222221 111 1 2566642 11 2356778888888543 23588888877765543 233 Q ss_pred HHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCeEEec-C---CCceeEEecccC Q lcl|NC_016762. 230 EGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDVLLPT-Q---GATVTQMVSAVS 303 (456) Q Consensus 230 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lid-~---~d~~~~~~~~~s 303 (456) +......+|+|....-.+-+ +. + ..-.++..+++.++++..+ .|.+.+++. . ++.++...++.+ T Consensus 188 a~~~~~~~f~NGa~pg~il~-----~~---~--~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~ 257 (340) T protein:vir:98 188 ATLFRRKYYQNGAHAGYIMY-----VT---D--PAQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEV 257 (340) T ss_pred HHHHHHHHHhccCCCceEEE-----ec---C--CCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCC Confidence 44444556666432221111 00 0 0011233445545554432 122333433 2 233444444343 Q ss_pred C----HHHHHHHHHHHHHhhhcCCeEEeeccCCC---cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC Q lcl|NC_016762. 304 D----PGPTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL 375 (456) Q Consensus 304 g----l~~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~ 375 (456) . +-++-....+.||++-+||-. |+|..+. |++..+ -.+.|+.. .|.|.++++-+ +..+ +++ T Consensus 258 ~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~e~~~~~f~~~-------~l~Pl~~~iee-~n~~-L~~- 326 (340) T protein:vir:98 258 ATKDDFFNIKKASAADLMDAHRVPFQ-LMGGKPENIGSLGDVEKVAKVFVRN-------ELSPLQDRFRE-VNDW-LGM- 326 (340) T ss_pred hhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccccHHHHHHHHHHH-------HHHHHHHHHHH-HHhc-ccc- Confidence 2 345556677889999999985 7787553 333333 34455543 47777766654 2221 221 Q ss_pred CCceEEEeCCCCCCCHH Q lcl|NC_016762. 376 KAEFTAIWDDLTVPTKA 392 (456) Q Consensus 376 ~~d~~~~f~pL~~~sek 392 (456) ++ |+|++-.-++.. T Consensus 327 --e~-~rF~~~~l~~~d 340 (340) T protein:vir:98 327 --EV-IRFKEYTLDNPE 340 (340) T ss_pred --cc-cccCccccccCC Confidence 21 455554333333 No 200 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=97.25 E-value=0.00012 Score=42.12 Aligned_cols=398 Identities=12% Similarity=0.088 Sum_probs=172.9 Q ss_pred CCchhHH---------HHhHHHHHHHHHHHHHHhh-hhhccCccc-chhhhhccCcccCCHHHH-HHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDKLDL---------AVNHAMSSAIARARMSLLN-QGIGHDAKR-PQAWCEYGFPQEITFNDL-YTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~~~~---------~~~~a~~~~~~~~~d~~~n-~~~~~gt~~-~~~~~~~~~~~~~~~~~l-~~~Y~~~~l~r~iVd 68 (456) |+.=+.+ ....+ ...+...++-+.. +..|+.-.| +....+........+.+| +.|.+++.-+..++. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~-~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~ 79 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQ-TSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMS 79 (526) T ss_pred CCeeECCCCCccccccccchh-hhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHH Confidence 2211100 00000 0112222333332 233332112 122333322222222233 344446777777888 Q ss_pred cchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhh-HHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCc Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGR-FWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKL 146 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~-~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~ 146 (456) +.-.-.+..-|+|..+..++..+. .....++..+.++. +...+.++ -.+.+||+|++=|.-. ++..+ .. T Consensus 80 ~Rk~av~~~~w~I~p~~~~~~~~~--~~a~~v~~~l~~~~~~~~~i~~~-lda~~~G~s~~Eivw~~~~g~~------~~ 150 (526) T protein:vir:99 80 KRKRAILGLDWAVEPPRNASAAEK--ADADYLHELLLDLEGLEDLLLDA-LDGIGHGYSCIELEWALQGREW------MP 150 (526) T ss_pred HHHHHHhCCCceEecCCCCCHHHH--HHHHHHHHHHhcccCHHHHHHHH-HHhhhhcceeEEEEEeecCCce------eE Confidence 777777766667754332221111 11234566666553 44444444 4688999998866432 21111 11 Q ss_pred CceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhee----cCCcCCCcchHHHHHHH Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFIL----GDWTGDAIGFLEPAYNS 222 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~----~~~~~~G~S~le~~~~~ 222 (456) ..|..+.+.|-. -|+..- . .+.+.... ..+..+.+-+.+.+ .....+|.++++.||.. T Consensus 151 ~~l~~r~~~~f~--------~~~~~~----~-~l~~~~~~-----~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~ 212 (526) T protein:vir:99 151 LAFHHRPQSWFQ--------LNPEDQ----N-ELRLRDNS-----PAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWP 212 (526) T ss_pred EEeeeeccccee--------eccCCC----c-EEEecCCC-----CCceeecCCCeEEEeecCCcCCccccchHHHHHHH Confidence 122222222211 111110 0 11111100 11233444443333 23456799999999864 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh-HHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecc Q lcl|NC_016762. 223 FISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE-IASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSA 301 (456) Q Consensus 223 l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~ 301 (456) .+--.....-++. |..++.+-- +..--....++-.+.+.+++..+.+ ....+|..+.+++-++.+ T Consensus 213 ~~fK~~~~~~w~~-------------f~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~-d~~~iiP~~~~ie~~ea~ 278 (526) T protein:vir:99 213 YLFRHYATSDLAE-------------MLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGH-AAAGIIPETMAIDFQQAA 278 (526) T ss_pred HHHHHhhHHHHHH-------------HHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhh-CcEEEecCCceeEEeecC Confidence 4321111111211 111111100 0000001123344556566666644 456778888889888876 Q ss_pred cCCH---HHHHHHHHHHHHhhhcCCeEEeeccC------CCccc--chHH--HHHHHHHHHHHHHhhhhHHHH-HHHHHH Q lcl|NC_016762. 302 VSDP---GPTYNVNLQTAAAGVDIPTKILVGMQ------TGERA--SSED--QKYHNARCQARRVQELTFEIN-DLFAHL 367 (456) Q Consensus 302 ~sgl---~~~~~~~~~~~aaas~IP~t~L~G~s------p~Gln--st~D--~~nyyd~I~~~Qe~~lrp~L~-~l~~~l 367 (456) -+|. ..+++..-.+||-+ ++|+. .||.+ +-++ .....+.+.+-.. .+...|. .|+..| T Consensus 279 ~~~~~~f~~li~~~d~~Isk~-------iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~-~i~~tln~~Li~~l 350 (526) T protein:vir:99 279 QGSSEPFLAMMRQSEDAISKA-------VLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDAR-QLAATLSRDLLWPL 350 (526) T ss_pred CCCHHHHHHHHHHHHHHHHHH-------HhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH Confidence 4443 33444444455543 35542 12222 2233 2345555555554 4566675 488888 Q ss_pred HHhcCcCCCC---ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccC--C- Q lcl|NC_016762. 368 MRIGVVPLKA---EFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTE--P- 441 (456) Q Consensus 368 ~~s~~~~~~~---d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~--~- 441 (456) +...++...+ --.|+|..- |..++ +..|+++..++..|. .++.+++++..+.....+.+..-... + T Consensus 351 ~~~N~~~~~~~~~~p~~~~~~~------e~eDl-~~~a~~~~~L~~~G~-~i~~~~i~e~~Gip~~~~~e~~l~~~~~~~ 422 (526) T protein:vir:99 351 LVLNRPGSPDVRRAPRLVFDLR------EQADI-TSMAQSIPALVNVGL-EIPSAWVYDKLGIPQPAKNEPVLRSAAQPA 422 (526) T ss_pred HHhCCCCcCCccccceEEeCCC------CcccH-HHHHHHHHHHHhCCC-ccCHHHHHHHhCCCCCCCcccccCCCCCCc Confidence 8887763211 123444332 22222 357888888898884 48999999998875433322210000 0 Q ss_pred ---CCCCC------CCcCCC-CCCC Q lcl|NC_016762. 442 ---EDEDA------ARTDPT-GEQQ 456 (456) Q Consensus 442 ---~d~~~------~~~d~~-~~~e 456 (456) ..... ....+. .+++ T Consensus 423 ~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (526) T protein:vir:99 423 ILSRQHGQRVAALATIVGPRYGDQQ 447 (526) T ss_pred ccccccccccccccccccccCcchh Confidence 00000 000000 0000 No 201 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=97.20 E-value=0.00013 Score=41.86 Aligned_cols=422 Identities=14% Similarity=0.129 Sum_probs=175.9 Q ss_pred CCc--------hhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhc--cC-cccCCHHHHHHHHhc---Cchhhhh Q lcl|NC_016762. 1 MTD--------KLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEY--GF-PQEITFNDLYTMYRR---GGIAHGA 66 (456) Q Consensus 1 ~~~--------~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~--~~-~~~~~~~~l~~~Y~~---~~l~r~i 66 (456) |++ +.++..+ +.+=+---.-|+..++.+| ++... +. +...+-.+|-..||+ ++.+..+ T Consensus 1 m~~lfgf~~~~~~~~~~~-~~s~~~p~~ddg~~~~~~~-------g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A 72 (558) T protein:vir:10 1 MAKLFGFSIEETQKKSTS-IISPVPKNNEDGVDNFISS-------GFYGQYVDIEGAYRSEYDLIRRYREMALHPEADGA 72 (558) T ss_pred CcchhcchhhhhhhhccC-CccccCCCccccccceecc-------ceeeeeecccchhhhHHHHHHHHHHHhhccchhhH Confidence 332 1111111 1000000011111111111 11111 11 123455677777774 7889999 Q ss_pred hccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEEecCCCCccc Q lcl|NC_016762. 67 VEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDR 140 (456) Q Consensus 67 Vd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~ 140 (456) |+.++.||+ .+.-.+++-+ -+..+....+.++|..+++ -|++..+..+..|.--+.|. ..+-.+-|.+.+.+ T Consensus 73 v~eIVneaiv~d~~~~pV~i~-Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgR-iyfHKiid~k~pk~ 150 (558) T protein:vir:10 73 IEDVVNEAIVSDLYDSPVEVE-LSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGR-VFYLKVIDTKNPQE 150 (558) T ss_pred HHHhhcceeEecCCCceEEEE-ecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeE-EEEEEEEeCCCccc Confidence 999999984 2222222111 0122222233344544444 44555655555543333332 23333335444332 Q ss_pred cccCCcCceeEEEEeccccCChh--h--------hhcccccccc---CCceeEEEeecc-----cCC--ccccceeeehh Q lcl|NC_016762. 141 PARGKLNGLAKVTPAWAGCLKPK--S--------FDEKPDSETY---GQPTMWEYTEAS-----QAG--RPGLVRDIHPD 200 (456) Q Consensus 141 Pl~~~~~~l~~i~~~~~~~~~~~--~--------~~~Dp~s~~y---g~P~~y~i~~~~-----~~g--~~~~~~~IH~S 200 (456) +...|.+|.|+-..-+... + ..++... -. +--+||..++.. .+| ......+|+.+ T Consensus 151 ----GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~-~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~d 225 (558) T protein:vir:10 151 ----GIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQD-VVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKD 225 (558) T ss_pred ----cceeeeeeCcccceeeeeeccccccccceeeeecccc-eeeccceeEeeeecCCcccccccceeecCCCceeechh Confidence 2333444444322111110 0 0111000 01 112333333211 111 12234667666 Q ss_pred hhheecCC------cCCCcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHH Q lcl|NC_016762. 201 RVFILGDW------TGDAIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALN 270 (456) Q Consensus 201 Rli~~~~~------~~~G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~ 270 (456) -+ .|+.. ...=.|.|.++- |.|.-++-+ ..+|+-+ |... ..| -+|+-+|....+ ++-+ T Consensus 226 AI-~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDA-----lVIYRit-RAPERRvF--YIDVGnLPk~KA---eqYl 293 (558) T protein:vir:10 226 SI-TMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDS-----LVIYRLS-RAPERRIF--YIDVGNLPKVKA---EQYL 293 (558) T ss_pred he-eeecccceecCCCeeeecchHhhHhHHhhHHHHhh-----HHHHhhh-ccccceEE--EEecCCCCchhH---HHHH Confidence 33 33211 111135565443 222222221 1233321 1110 000 123333332211 1222 Q ss_pred HHHHHHHHHHh------cCCCe---------EEe---------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCe Q lcl|NC_016762. 271 ERFNEAARQLN------RGNDV---------LLP---------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPT 324 (456) Q Consensus 271 ~~~~~~~~~~~------~~~~~---------~li---------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~ 324 (456) + +.|.+++ .++|- ++= .++-+++.+. -+++.++| +..|..-+=-+.++|+ T Consensus 294 r---~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLy~aLnVP~ 369 (558) T protein:vir:10 294 K---EVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSD-VDYFQKKLYRALGVPE 369 (558) T ss_pred H---HHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHHH-HHHHHHHHHHHhCCCc Confidence 2 2222221 11110 000 0223455542 36777777 5678888889999999 Q ss_pred EEeeccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCH Q lcl|NC_016762. 325 KILVGMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTK 391 (456) Q Consensus 325 t~L~G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~se 391 (456) +||=.+ +|+| |++ |.-.|...|.+.|.. +...+..+++. |++-+++. ....+.|.|+-=..-+| T Consensus 370 SRl~~e--~~f~~Gr~~EItRDEiKF~KFI~RLR~r-Fs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~E 446 (558) T protein:vir:10 370 SRIAAE--GGFNLGRSSEILRDELKFAKFVGRLRKR-FAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAE 446 (558) T ss_pred cccCCC--CcccccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHH Confidence 998443 5676 333 667899999998875 45656666653 44444432 33578999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHH-cCC-----------cCcCHHHHHHH---hcc---cCCCCC---------CCCccc----- Q lcl|NC_016762. 392 AERLANSKTMSEINSAAIG-TGE-----------PVFTAEEIREE---AGY---DPLQGG---------DPLPDT----- 439 (456) Q Consensus 392 ke~Aei~~~~A~a~~~~~~-~g~-----------~~i~~~E~R~~---~~~---~~~~~~---------~~~~~~----- 439 (456) -..+|+...+..+.+..-. .|. --.+.+|+.+. ... +|+... .+.+.. T Consensus 447 lKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~ 526 (558) T protein:vir:10 447 LKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAM 526 (558) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchh Confidence 9999999888887665432 121 12344455332 111 111110 000000 Q ss_pred CCCCCCCCCcCCCCCCC Q lcl|NC_016762. 440 EPEDEDAARTDPTGEQQ 456 (456) Q Consensus 440 ~~~d~~~~~~d~~~~~e 456 (456) ++....++.++..+.++ T Consensus 527 ~~~~~~~~~~~~~~~~~ 543 (558) T protein:vir:10 527 EGMGEQPVDPDLEAQAQ 543 (558) T ss_pred ccCCCCCcccccccchh Confidence 00011111112111111 No 202 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=97.16 E-value=0.00015 Score=41.59 Aligned_cols=316 Identities=12% Similarity=-0.020 Sum_probs=141.1 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhc-----cCc--ccchhh---hhccCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIG-----HDA--KRPQAW---CEYGFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~-----~gt--~~~~~~---~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |+.|++--...+...+.. +.-....+..| +++ -.|-.. +.-+|-..+++..|..+++.|.....+|..- T Consensus 1 m~~~~~~~~~~~~~~~~~-~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k 79 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTA-SAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSPIYVK 79 (344) T ss_pred CCcccCCCCCchHHhhcC-CcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccchhhh Confidence 998874211101111000 00011111111 111 112221 1224666788888988888888777766654 Q ss_pred hhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCcee Q lcl|NC_016762. 71 VTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLA 150 (456) Q Consensus 71 aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~ 150 (456) .....+ +++- .. ..+ + ..|..++..-.++|-|++.+.- ++ .+.++ T Consensus 80 ~n~l~~-~~~P--n~------~~t-----------~----~~f~~~~~d~ll~Gnay~~i~r-n~----------~G~~~ 124 (344) T protein:vir:60 80 RNILAS-TFIP--HP------WLS-----------Q----QDFSRFVLDFLVFGNAFLEKRY-ST----------TGKVI 124 (344) T ss_pred hhHHHh-hccC--CC------CCC-----------H----HHHHHHHHHHHhcCCeEEEEEE-CC----------CCcEE Confidence 443322 2210 00 001 0 0111111123467877776543 21 12233 Q ss_pred EEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHH Q lcl|NC_016762. 151 KVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISL 226 (456) Q Consensus 151 ~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~ 226 (456) .+.|+....+... .|. + .+|++.. + +..+.+.+..|||+... ...|+|-++.+...+.- T Consensus 125 ~L~~l~~~~vr~~---~~~---~----~~~~v~~---~---~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l- 187 (344) T protein:vir:60 125 RLETSPAKYTRRG---VEE---D----VYWWVPS---F---NEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWL- 187 (344) T ss_pred EEEEcCcceEEEe---ecC---C----eEEEEcc---C---CeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHH- Confidence 4444433222221 111 1 2455531 1 12345666677777543 24699988888776543 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh-c-CCCeEEecC----CCceeEEec Q lcl|NC_016762. 227 EKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN-R-GNDVLLPTQ----GATVTQMVS 300 (456) Q Consensus 227 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~lid~----~d~~~~~~~ 300 (456) ...+......+|+|....-.+.. +.+ ..-.++..+++.+.++... . +...+++.. ++.++...+ T Consensus 188 ~~~a~~~~~~~f~NG~~pg~il~-----~~~-----~~ls~e~~~~ik~~~~~~~g~~~~r~~~l~~p~g~~~g~~~~pi 257 (344) T protein:vir:60 188 NESATLFRRKYYENGAHAGYIMY-----VTD-----AVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPL 257 (344) T ss_pred HHHHHHHHHHHHhccCCCceEEE-----ecC-----cCCCHHHHHHHHHHHHHhcCCCCCcceEEecCCCCccceeEEEc Confidence 33344455566766543222211 100 0011233444444444322 1 222344431 233444444 Q ss_pred ccCCH----HHHHHHHHHHHHhhhcCCeEEeeccCCC---cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC Q lcl|NC_016762. 301 AVSDP----GPTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGV 372 (456) Q Consensus 301 ~~sgl----~~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~ 372 (456) +.+.- -++-....+.||++-+||-. |+|..+. |++..+ -.+.|+.. .|.|.++++-+ |..+ + T Consensus 258 s~~~~d~qf~e~k~~~~~eIa~af~VPp~-llGi~~~~t~~~~n~e~~~~~f~~~-------~L~Pl~~~~e~-ln~~-l 327 (344) T protein:vir:60 258 SEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDIEKVAKVFVRN-------ELIPLQDRIRE-INGW-L 327 (344) T ss_pred CCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccccHHHHHHHHHHH-------HHHHHHHHHHH-HHHh-c Confidence 44433 34444577889999999986 7786543 344333 34455443 46776665544 2222 1 Q ss_pred cCCCCceEEEeCCCCCCCH Q lcl|NC_016762. 373 VPLKAEFTAIWDDLTVPTK 391 (456) Q Consensus 373 ~~~~~d~~~~f~pL~~~se 391 (456) | ...|+|.+..|..-+. T Consensus 328 g--~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 328 G--QEVIRFKNYSLDTDNG 344 (344) T ss_pred C--CcccccCccccCCCCC Confidence 2 1345666655554444 No 203 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=97.09 E-value=0.00018 Score=41.18 Aligned_cols=319 Identities=11% Similarity=0.032 Sum_probs=142.2 Q ss_pred CCchhH-HHHhHHHHHHHHHHHHHHhh-hhhccCc------ccch------hhhhccCcccCCHHHHHHHHhcCchhhhh Q lcl|NC_016762. 1 MTDKLD-LAVNHAMSSAIARARMSLLN-QGIGHDA------KRPQ------AWCEYGFPQEITFNDLYTMYRRGGIAHGA 66 (456) Q Consensus 1 ~~~~~~-~~~~~a~~~~~~~~~d~~~n-~~~~~gt------~~~~------~~~~~~~~~~~~~~~l~~~Y~~~~l~r~i 66 (456) |+.|.. ....++..........+=.. ...+.|. .++- .++.-+|...+++..|..+++.|.....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecCCCCHHHHHHHHhhhHhhhhh Confidence 885432 22222111111000000000 0111121 1111 11122455678889999999999888888 Q ss_pred hccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCc Q lcl|NC_016762. 67 VEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKL 146 (456) Q Consensus 67 Vd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~ 146 (456) |-.-+..-++ +++- .. . +.+..++ .++..-.++|.+++.+.-+ +. T Consensus 81 l~~k~n~l~~-~~~P--n~------~-----------~t~~~f~----~~~~d~ll~Gnay~~~~rn-~~---------- 125 (351) T protein:vir:78 81 LFFKANVLAS-TFRP--HR------W-----------LSRHAFE----RWALDFLTFGNGYLERRRN-MV---------- 125 (351) T ss_pred hhhhhhHHhh-cccC--CC------C-----------CCHHHHH----HHHHHHHhcCCeEEEEEEC-CC---------- Confidence 7665543333 2221 00 0 0111122 2222244678887765432 11 Q ss_pred CceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHH Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNS 222 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~ 222 (456) +.+..+.|+-...+.+. .+. + .+|++.. + +..+.+-+..||+|... ...|+|.+..+... T Consensus 126 G~~~~L~pl~~~~v~~~---~~~---~----~~~~~~~---~---~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~s 189 (351) T protein:vir:78 126 GGTLRLEPALAKYVRRK---ADF---S----GFVYVNG---W---QERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHS 189 (351) T ss_pred CCEEEEEEecCcceEEe---eeC---C----eEEEEec---C---CeEEEEccccEEEEcCCCCCCCcccccHHHHHHHH Confidence 12333444332222221 111 0 1233321 1 12344566677777543 34588988888876 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCeEE-ecC---CCcee Q lcl|NC_016762. 223 FISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDVLL-PTQ---GATVT 296 (456) Q Consensus 223 l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~l-id~---~d~~~ 296 (456) +.--. .+......+|+|....-.+.+ +.+ +.-.++..+++.+.++... .|.+.++ +.. ++.++ T Consensus 190 i~l~~-~a~~~~~~~f~NGa~pggIl~-----~~~-----~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k 258 (351) T protein:vir:78 190 AWLNE-SSTLFRRKYYENGSHAGFILY-----MTD-----AAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQ 258 (351) T ss_pred HHHHH-HHHHHHHHHHhccCCCceEEE-----ecC-----CCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCcccee Confidence 65332 233344556666543222111 110 0111333445545554432 2223333 322 23344 Q ss_pred EEecccCC----HHHHHHHHHHHHHhhhcCCeEEeeccCCCc---ccchHH-HHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_016762. 297 QMVSAVSD----PGPTYNVNLQTAAAGVDIPTKILVGMQTGE---RASSED-QKYHNARCQARRVQELTFEINDLFAHLM 368 (456) Q Consensus 297 ~~~~~~sg----l~~~~~~~~~~~aaas~IP~t~L~G~sp~G---lnst~D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~ 368 (456) ....+.+. +-++-....+.||++.+||-. |+|..+.+ ++..+. .+.||. +.|.|.++++-++. T Consensus 259 ~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~e~~~~~f~~-------~~l~P~~~~iee~n- 329 (351) T protein:vir:78 259 LIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ-LLGIVPSNSGGFGTPDTAARVFGR-------NEIRPLQARFAELN- 329 (351) T ss_pred EEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHH- Confidence 44444443 334445567789999999985 67876543 333332 344442 34667666654422 Q ss_pred HhcCcCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 369 RIGVVPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 369 ~s~~~~~~~d~~~~f~pL~~~seke~A 395 (456) .+ ++ .++ |+|++---+.-.++| T Consensus 330 ~~-l~---~~~-~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 330 DW-LG---DEV-VRFDDYEIPPAPVAA 351 (351) T ss_pred hh-cC---ccc-eecChhhhccccccC Confidence 21 22 222 667765555555555 No 204 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=97.03 E-value=0.0002 Score=40.83 Aligned_cols=429 Identities=14% Similarity=0.099 Sum_probs=176.4 Q ss_pred CCch-hHHH---------------HhHHHHHHHHHHHHHHhhhhhccCccc--chh-hhh-ccC--cccCCHHHHHHHHh Q lcl|NC_016762. 1 MTDK-LDLA---------------VNHAMSSAIARARMSLLNQGIGHDAKR--PQA-WCE-YGF--PQEITFNDLYTMYR 58 (456) Q Consensus 1 ~~~~-~~~~---------------~~~a~~~~~~~~~d~~~n~~~~~gt~~--~~~-~~~-~~~--~~~~~~~~l~~~Y~ 58 (456) |.-. +++- .+++.+-+--..-||-...-+++..+. -.+ .++ |+- +...+-.+|-..|| T Consensus 1 m~f~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYR 80 (523) T ss_pred CCCchhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHH Confidence 2210 1110 111111000001121111111111110 011 111 211 12345667877787 Q ss_pred c---CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHH----HHhhHHHHHHHHHHhhcccCceEEE Q lcl|NC_016762. 59 R---GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLI----AGGRFWRAVSEADRRRLVGRYSGLL 129 (456) Q Consensus 59 ~---~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~----~~l~~~~~~~ea~~~~r~~Ggs~i~ 129 (456) + ++.+..+|+.++.||+ .+.-.+++-+= +..+....+..+|..++ +-|++..+-.+..|.--+.|. ..+ T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~L-d~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR-i~f 158 (523) T protein:vir:68 81 NLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINL-DNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSR-IFF 158 (523) T ss_pred HHhhccchhhHHHHhhcceeeecCCCceEEEEe-cccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeE-EEE Confidence 4 7889999999999984 22333322111 12222223333444443 344556655555543333332 233 Q ss_pred EEecCCCCccccccCCcCceeEEEEeccccCChh-hhhc-ccc-cccc-CCceeEEEeecc----cCC---ccccceeee Q lcl|NC_016762. 130 LHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPK-SFDE-KPD-SETY-GQPTMWEYTEAS----QAG---RPGLVRDIH 198 (456) Q Consensus 130 i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~-~~~~-Dp~-s~~y-g~P~~y~i~~~~----~~g---~~~~~~~IH 198 (456) -.+-|.+.+.+ |...|.+|.|+- +... +..+ ++. ..-+ |--++|..++.. .+| .+...++|| T Consensus 159 hKiid~k~pk~----GI~Elr~lDPr~---i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~ 231 (523) T protein:vir:68 159 HKIIDPKRPKE----GIKELRRLDPRQ---VQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIP 231 (523) T ss_pred EEEeeCCCccc----cceeeeeeCCcc---eeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecc Confidence 33335444322 233344444432 2110 0000 000 0000 111222222111 112 123457777 Q ss_pred hhhhheecCCc---CC---CcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHH Q lcl|NC_016762. 199 PDRVFILGDWT---GD---AIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDAL 269 (456) Q Consensus 199 ~SRli~~~~~~---~~---G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~ 269 (456) .+- |.|+... .. =+|.|.++- |.|.-++-+ ..+|+-+ |....- -.=+|+-+|....+ ++- T Consensus 232 ~dA-I~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDA-----lVIYRit-RAPeRR-vFYIDvGnlPk~KA---eqY 300 (523) T protein:vir:68 232 KAA-IVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDA-----VVIYRIT-RAPDRR-VWYVDTGNMPSRKA---AEH 300 (523) T ss_pred hhh-eeeeeccceeCCCCceeccchhhhHHHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHH Confidence 764 3343211 11 135565543 222222221 1233321 111000 00123333332211 122 Q ss_pred HHHHHHHHHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 270 NERFNEAARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 270 ~~~~~~~~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP 323 (456) ++ +.|.+++ .++|- +-+ .++-+++++. -+++-++| +..|..-+=-|.++| T Consensus 301 l~---~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP 376 (523) T protein:vir:68 301 MQ---HVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMED-VRWFRNALYMALRIP 376 (523) T ss_pred HH---HHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCc Confidence 22 2222221 11110 000 0223455553 25667777 567888888999999 Q ss_pred eEEeeccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCC Q lcl|NC_016762. 324 TKILVGMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPT 390 (456) Q Consensus 324 ~t~L~G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~s 390 (456) ++||-++ .||+| |++ |.-.|...|.+.|.. +...+..+++. |++-+++. ....+.|.|+-=..-+ T Consensus 377 ~sRl~~~-~~~f~~Gr~~EItRDEikF~KFI~rLR~r-Fs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ 454 (523) T protein:vir:68 377 ITRIPSD-QGGIQFDAGTSITRDELSFGKFIRELQHK-FEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFS 454 (523) T ss_pred ceeecCC-CcceecccccchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHH Confidence 9999765 36666 333 677899999998875 45656666553 44444442 2357899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 391 KAERLANSKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 391 eke~Aei~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) |-..+|+...+..+.+..-. .|. .++-+=+++ .+.+.-..-.....-++.+-.++--++|..++| T Consensus 455 ElKe~Eil~~R~~~l~~~dpyvGk-y~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~e~~ 521 (523) T protein:vir:68 455 ELKDAEILERRINMLQMAEPFIGK-YISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQDPDQEQE 521 (523) T ss_pred HHHHHHHHHHHHHHHHHhhhhhcc-cchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhh Confidence 99999999888887665432 121 234333322 111110000000000000000111122222222 No 205 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=97.01 E-value=0.00012 Score=42.05 Aligned_cols=324 Identities=12% Similarity=0.017 Sum_probs=138.7 Q ss_pred CCchhHHHHhHHHH----H-------HHHHHHHHHhhhhhccCc------cc---chh--h-hhccCcccCCHHHHHHHH Q lcl|NC_016762. 1 MTDKLDLAVNHAMS----S-------AIARARMSLLNQGIGHDA------KR---PQA--W-CEYGFPQEITFNDLYTMY 57 (456) Q Consensus 1 ~~~~~~~~~~~a~~----~-------~~~~~~d~~~n~~~~~gt------~~---~~~--~-~~~~~~~~~~~~~l~~~Y 57 (456) |+-|.+--..+..+ . .......+-... .+.|. .+ |-. + +.-+|...+++.-|..+| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~fg~p~~~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~~ 79 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEV-FSFGDPVEVLDRRELLDYVECMRMGQWYEPPMPWDGLARSF 79 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEE-EEcCCceeecchhhHHHHHHHHhccchhccCcCHHHHHHHH Confidence 87665321100000 0 000000000000 11111 01 000 0 111344455666666666 Q ss_pred hcCchhhhhhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCC Q lcl|NC_016762. 58 RRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQ 136 (456) Q Consensus 58 ~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~ 136 (456) +.+.-...++-.-. .+.+-++. +. ..+.+..++. ++..-.++|-+++.+.-+ +|+ T Consensus 80 ~~~~~h~~~~~~~~-n~l~l~~~------Pn-------------~~~t~~~f~~----l~~d~ll~Gnay~~~~r~~~G~ 135 (368) T protein:vir:79 80 RAAAHHSSAVYVKR-NILVSTFI------PH-------------PLLSRATFER----LVLDWQVFGNAYLERRENVLGG 135 (368) T ss_pred hhccccchhhhhhc-chhhhhcC------CC-------------cCCCHHHHHH----HHHHHhhcCCeEEEEEEcCCCC Confidence 66654433322111 01111110 00 0011111222 222235678887766432 222 Q ss_pred CccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCC Q lcl|NC_016762. 137 PWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDA 212 (456) Q Consensus 137 ~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G 212 (456) +..+.|+-...+... .|. + .+|++.. + +..+.+.+.-|+++..+. ..| T Consensus 136 ------------~~~L~~l~~~~v~~~---~~~---~----~~~~~~~---~---~~~~~~~~~dIihir~~~~~~~~yG 187 (368) T protein:vir:79 136 ------------TIRLDTPLAKYVRRG---LDL---N----TYFFVQN---W---QQPYTFAAGSVFHLQEPDINQEVYG 187 (368) T ss_pred ------------EEEEEEeCcccceee---ccC---C----EEEEEec---C---CeEEEEccccEEEecCCCCCCCccc Confidence 223333332222211 111 1 2333331 1 123567777788775433 359 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc--CCCeEEec Q lcl|NC_016762. 213 IGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR--GNDVLLPT 290 (456) Q Consensus 213 ~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~lid 290 (456) .|-++.+...+.-- ..+..+...+|+|....-.+-+. .+ ..-.++..+++.+.++.... |.+.+++. T Consensus 188 lsp~~~a~~si~l~-~aa~~~~~~~~~NGa~~~gil~~-----~~-----~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl 256 (368) T protein:vir:79 188 LPEYLSALNATWLN-ESATLFRRRYYKNGSHAGFILYM-----TD-----AAQKQEDVDTLREAMKSAKGPGNFRNLFMY 256 (368) T ss_pred ccHHHHHHHHHHHH-HHHHHHHHHHHhccCCCceEEEe-----CC-----CCCCHHHHHHHHHHHHHhcCCcccCceeEe Confidence 99988888766533 33333445566665433222111 10 01113344555555554332 23333333 Q ss_pred -C---CCceeEEecccCC----HHHHHHHHHHHHHhhhcCCeEEeeccCCCc---ccchH-HHHHHHHHHHHHHHhhhhH Q lcl|NC_016762. 291 -Q---GATVTQMVSAVSD----PGPTYNVNLQTAAAGVDIPTKILVGMQTGE---RASSE-DQKYHNARCQARRVQELTF 358 (456) Q Consensus 291 -~---~d~~~~~~~~~sg----l~~~~~~~~~~~aaas~IP~t~L~G~sp~G---lnst~-D~~nyyd~I~~~Qe~~lrp 358 (456) . ++.++....+.+- +-++.....++||++-+||- .|+|..+++ ++..+ -.+.||.. .|.| T Consensus 257 ~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~sn~e~~~~~f~~~-------~l~P 328 (368) T protein:vir:79 257 APNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPP-QLMGIIPNNTGGFGDVEKAAMVFARN-------EVKP 328 (368) T ss_pred cCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCH-HHccccCCCCCccccHHHHHHHHHHH-------HHHH Confidence 1 2334444444432 23444556778999999997 577876543 33323 34555543 4677 Q ss_pred HHHHHHHHHHHhcCcCCCCceEEEeCC--CCCCCHHHHHHHHHHHH Q lcl|NC_016762. 359 EINDLFAHLMRIGVVPLKAEFTAIWDD--LTVPTKAERLANSKTMS 402 (456) Q Consensus 359 ~L~~l~~~l~~s~~~~~~~d~~~~f~p--L~~~seke~Aei~~~~A 402 (456) .++++-++. . .+++ ..|.|++ |...+.+.+|+-..+-| T Consensus 329 l~~~ie~ln-~-~l~~----e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 329 LQDRLLAIN-D-WIGD----EVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred HHHHHHHHH-h-ccCc----ceeeechhHhhcccccccCCcccccC Confidence 776654322 1 1221 2356666 77777777777555555 No 206 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=96.96 E-value=0.00024 Score=40.48 Aligned_cols=321 Identities=8% Similarity=0.002 Sum_probs=144.5 Q ss_pred CCchhHHHHhHHHHHHHHHH-HHHHhhhhhccCcccchhh-hhc--cCcccCCHHHHHHHHhcCchhhhhhccchhHHhh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARA-RMSLLNQGIGHDAKRPQAW-CEY--GFPQEITFNDLYTMYRRGGIAHGAVEKIVTTCWK 76 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~-~d~~~n~~~~~gt~~~~~~-~~~--~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR 76 (456) |..+......++...+.++. .-+|-.+.....+.=-..| ..+ +|-..+++.-|..+++.|.-...++..-...-.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i~~k~n~l~~ 80 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANMVSS 80 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccchhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccceeeechHHHh Confidence 76665544333322221110 1111111111111000111 111 4667888899999999988888777654433222 Q ss_pred CCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEEEEec Q lcl|NC_016762. 77 TNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAW 156 (456) Q Consensus 77 ~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~ 156 (456) +++ ... . +.+..+++.+ ..-.++|.+++.+.- ++ .+.+..+.|+- T Consensus 81 -~~~---Pn~-----~-----------lt~~~f~~~~----~d~ll~Gnay~~~~r-n~----------~G~~~~L~pl~ 125 (345) T protein:vir:37 81 -LYE---GGK-----A-----------LSRMDMRALC----LNLIQFGDVGLLKVR-NG----------FGQVVRLVPLS 125 (345) T ss_pred -hcc---CCC-----C-----------CCHHHHHHHH----HHHHhcCCeEEEEEE-cC----------CCcEEEEEEEc Confidence 221 110 0 1111122222 123467888776543 21 11233333333 Q ss_pred cccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 157 AGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISLEKVEGG 232 (456) Q Consensus 157 ~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~~~~~~~ 232 (456) ...+.+ ..| ...|+...++.+. .+ +....+.+..||++... ...|+|-+..+...+.--+. +.. T Consensus 126 ~~~vr~---~~d--~~~~~~~~~~~~~---~~---g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~-a~~ 193 (345) T protein:vir:37 126 SLYLRV---RKD--GGYSYLMKKSLYD---TA---QEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSD-ATV 193 (345) T ss_pred CceeEE---EEe--CCeeEEEEEeEec---CC---ceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHH-HHH Confidence 222211 112 2223333333322 11 12345666677777533 33689988888776543332 233 Q ss_pred HHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCeEEec-C---CCceeEEecccCCH- Q lcl|NC_016762. 233 SGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDVLLPT-Q---GATVTQMVSAVSDP- 305 (456) Q Consensus 233 ~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lid-~---~d~~~~~~~~~sgl- 305 (456) ....+|+|....-.+- .+.+ ..-.++..+++.++++... .|.+.+++. . ++.++...++.+.- T Consensus 194 ~~~~~f~NG~~p~~Il-----~~~d-----~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d 263 (345) T protein:vir:37 194 FRRRYFSNGAHMGFIL-----YSTD-----PDLTEEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTK 263 (345) T ss_pred HHHHHHhccCCcceEE-----EecC-----CCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCChhH Confidence 3344566543221111 0110 0111233444544554432 222333432 1 23344444333322 Q ss_pred ---HHHHHHHHHHHHhhhcCCeEEeeccCCCc---ccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCc Q lcl|NC_016762. 306 ---GPTYNVNLQTAAAGVDIPTKILVGMQTGE---RASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAE 378 (456) Q Consensus 306 ---~~~~~~~~~~~aaas~IP~t~L~G~sp~G---lnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d 378 (456) -++-....+.||++-+||-. |+|..+.+ ++..+ -.+.|+. +.|.|.++++-+.|-+. ...+.+ T Consensus 264 ~qf~e~k~~~~~dIa~a~~VPp~-llGi~~~~~~~~~~~e~~~~~f~~-------~~l~P~~~~ie~~ln~~--~~~~~~ 333 (345) T protein:vir:37 264 DEFANIKNISAQDVLTAHRFPAG-LSGIIPTNTGGLGDPLKYREVYHY-------DEVMPLQEIIAETINQD--PEIKNL 333 (345) T ss_pred HHHHHHHHHhHHHHHHHhCCCHH-HhCccCCCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHhhhh--ccCCCc Confidence 33334567789999999986 77876543 33223 3445553 34788777777666432 223445 Q ss_pred eEEEeCC--CCC Q lcl|NC_016762. 379 FTAIWDD--LTV 388 (456) Q Consensus 379 ~~~~f~p--L~~ 388 (456) ..|.|++ |.+ T Consensus 334 ~~i~F~~~~L~~ 345 (345) T protein:vir:37 334 LKIKFREQNFAK 345 (345) T ss_pred ceEEecchhhcC Confidence 6677764 333 No 207 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=96.87 E-value=0.00029 Score=40.02 Aligned_cols=432 Identities=16% Similarity=0.121 Sum_probs=186.2 Q ss_pred CCchhHHHHhHHHHHH------HHHHHHHHhhhhhccCcc-c----------chhhhhccCc---ccCCHHHHHHHHhc- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSA------IARARMSLLNQGIGHDAK-R----------PQAWCEYGFP---QEITFNDLYTMYRR- 59 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~------~~~~~d~~~n~~~~~gt~-~----------~~~~~~~~~~---~~~~~~~l~~~Y~~- 59 (456) |...|++-..-+--+. +.--..+++.+-..-|+. . ...+.++++. ...+-.+|-..||+ T Consensus 1 ~~~~l~~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:65 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CccchhhhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHH Confidence 7777776544221111 111122332221112221 0 1111112222 13345677777774 Q ss_pred --CchhhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEE Q lcl|NC_016762. 60 --GGIAHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLH 131 (456) Q Consensus 60 --~~l~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~ 131 (456) ++.+..+|+.++.||+ .+.-.+++-+= +..+....+..+|..+++ -|++..+..+..|.--+.|.-++-.. T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhki 159 (521) T protein:vir:65 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNL-EATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKI 159 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEe-cccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEE Confidence 7889999999999984 22222222111 222223333334444443 44555555555443333343233232 Q ss_pred ecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCce-eEEEeecc----cCC---ccccceeeehhhhh Q lcl|NC_016762. 132 IRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPT-MWEYTEAS----QAG---RPGLVRDIHPDRVF 203 (456) Q Consensus 132 i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~-~y~i~~~~----~~g---~~~~~~~IH~SRli 203 (456) + | +. | +.|...|.+|.|+-..-+-...-...+.-..++.-+ +|..++.. .+| .+..+++||.+-+ T Consensus 160 i-d-~~---p-k~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI- 232 (521) T protein:vir:65 160 I-G-KN---P-KDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAI- 232 (521) T ss_pred E-c-CC---c-cccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhhe- Confidence 3 3 22 2 123333444444221111000000001111122222 22221100 111 2334567777643 Q ss_pred eecCCc----CC--CcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHH Q lcl|NC_016762. 204 ILGDWT----GD--AIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFN 274 (456) Q Consensus 204 ~~~~~~----~~--G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 274 (456) .|+... .. =+|.|.++- |.|.-++-+ ..+|+-+ |....- -.-+|+-+|....+ ++-++ T Consensus 233 ~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRR-vFYIDvGnlPk~KA---eqYl~--- 299 (521) T protein:vir:65 233 TYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDA-----MVVYRIT-RAPERR-VFFIDTGNMNNRKA---AQHMN--- 299 (521) T ss_pred eeeeccceeCCCCeeeecchhhhHhHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHHH--- Confidence 333211 01 135665543 223222221 1233321 111000 00123333332211 12222 Q ss_pred HHHHHHhc------CCC-------eEEec-----------CCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEee Q lcl|NC_016762. 275 EAARQLNR------GND-------VLLPT-----------QGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKILV 328 (456) Q Consensus 275 ~~~~~~~~------~~~-------~~lid-----------~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L~ 328 (456) +.|.+++. .+| .+-+. ++-+++++. -+++-++| +..|..-+=-|.++|++||- T Consensus 300 ~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~ 378 (521) T protein:vir:65 300 SVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDD-IRYFNRKLYEALRVPLSRSN 378 (521) T ss_pred HHHHhcCceeEeecccccccccccccchhhhhcccccCCCCccceeecccCCCcChHHH-HHHHHHHHHHHhCCCceecc Confidence 22222211 111 11111 233566553 36677777 56788888899999999996 Q ss_pred ccCCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 329 GMQTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 329 G~sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke~A 395 (456) .++.+|+| |++ |.-.|...|.+.|.. +.+.+..+++. |++-+++. ....+.|.|+-=..-+|-..+ T Consensus 379 ~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~ 457 (521) T protein:vir:65 379 LSDANMVIGGDGSEITRDELEFSKFIRTLQSQ-FSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDA 457 (521) T ss_pred CCCCcceeccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHH Confidence 67777776 333 677899999999875 46666666653 44444443 234689999999999999999 Q ss_pred HHHHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 396 ei~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) |+...+..+.+..-. .|. .++-+=+++ .+.+.-..-.....-++.+-.++--++|.++.| T Consensus 458 Eil~~R~~~l~~~dpyvGk-y~S~dyi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~p~~~~~ 519 (521) T protein:vir:65 458 EILERRIGLIERITPYIGK-YFSNQTVMRDILKYTDDQMDTEKKQIEEEANDPRFKQTPDEIE 519 (521) T ss_pred HHHHHHHHHHHHhhhhhcc-ccchHHHHHHHhccCHHHHHHHHHHHHHhhhCCCCCCCccccc Confidence 999888887765432 121 234443332 111110000000000000001111112222222 No 208 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=96.87 E-value=0.00029 Score=39.99 Aligned_cols=318 Identities=12% Similarity=0.062 Sum_probs=139.8 Q ss_pred CCchhHH-HHh--HHHHHHHHHHHHHHhhhhhccCc------ccch-hh-----hhccCcccCCHHHHHHHHhcCchhhh Q lcl|NC_016762. 1 MTDKLDL-AVN--HAMSSAIARARMSLLNQGIGHDA------KRPQ-AW-----CEYGFPQEITFNDLYTMYRRGGIAHG 65 (456) Q Consensus 1 ~~~~~~~-~~~--~a~~~~~~~~~d~~~n~~~~~gt------~~~~-~~-----~~~~~~~~~~~~~l~~~Y~~~~l~r~ 65 (456) |+-+..- ... ++...+......+ .-...+.|. .++- .+ +.-+|...+++..|..+++.|..... T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~La~~~~~~~~h~s 104 (376) T protein:vir:10 26 MSKRRSRAPRTFAAAPNPSAGSAAPA-RAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSS 104 (376) T ss_pred chhccCCCcccchhhhhHhhhccCcc-eeEEEEcCCceeccCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHHhhh Confidence 5544321 111 1111111110000 000111222 1211 11 11145557788888899999988888 Q ss_pred hhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCC Q lcl|NC_016762. 66 AVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGK 145 (456) Q Consensus 66 iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~ 145 (456) +|..-+.+..+ +++- + ... ++..+++ ++..-.++|.+++.+.- ++ T Consensus 105 ~l~~k~n~l~~-~~~P--n------p~l-----------T~~~f~~----~v~d~ll~Gnay~~~~r-n~---------- 149 (376) T protein:vir:10 105 ALFFKANVLAS-TFRP--H------RWL-----------SRHAFER----WALDFLTFGNGYLERRR-NM---------- 149 (376) T ss_pred hHHHHhHHHHh-ccCC--C------CCC-----------CHHHHHH----HHHHHHhcCCeEEEEEE-CC---------- Confidence 88766655433 2210 0 000 1111222 22223467877766543 21 Q ss_pred cCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHH Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYN 221 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~ 221 (456) .+.+..+.|+-...+.+. .|+ + .+|++.. + +..+.+-++-||+|... ...|+|-+..+.. T Consensus 150 ~G~~~~L~pl~~~~vr~~---~d~---~----~~~~~~~---~---~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~ 213 (376) T protein:vir:10 150 VGGTLRLEPALAKYVRRK---ADF---N----GFVYVNG---W---QERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLH 213 (376) T ss_pred CCCEEEEEEeCCcceEEE---eeC---C----eEEEEEc---C---CeEEEEccccEEEecCCCCCCCcccccHHHHHHH Confidence 122334444433322221 121 1 1333321 1 11234556667777543 3468998888877 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCeEEe-cC---CC-- Q lcl|NC_016762. 222 SFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDVLLP-TQ---GA-- 293 (456) Q Consensus 222 ~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~li-d~---~d-- 293 (456) .+.- ...+......+|+|....-.+.+ +.+ ..-.++..+++.+.++... .|.+.+++ .. ++ T Consensus 214 si~l-~~aa~~f~~~~f~NGa~pggIl~-----~~d-----~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi 282 (376) T protein:vir:10 214 SAWL-NESSTLFRRKYYENGSHAGFILY-----MTD-----AAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGI 282 (376) T ss_pred HHHH-HHHHHHHHHHHHhccCCCceEEE-----ecC-----CCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccce Confidence 6543 23344445556666443222111 110 0011233444544554432 22233333 32 23 Q ss_pred ceeEEecccCC--HHHHHHHHHHHHHhhhcCCeEEeeccCCCc---ccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHH Q lcl|NC_016762. 294 TVTQMVSAVSD--PGPTYNVNLQTAAAGVDIPTKILVGMQTGE---RASSE-DQKYHNARCQARRVQELTFEINDLFAHL 367 (456) Q Consensus 294 ~~~~~~~~~sg--l~~~~~~~~~~~aaas~IP~t~L~G~sp~G---lnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l 367 (456) ++..++.+... +-++-....+.||++-+||- .|+|..+.+ ++..+ -.+.||.. .|.|.++++-+ + T Consensus 283 ~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~sn~eq~~~~f~~~-------~L~Pl~~~iee-l 353 (376) T protein:vir:10 283 QLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPP-QLLGIVPSNSGGFGTPDTAARVFGRN-------EIRPLQARFAE-L 353 (376) T ss_pred EEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHH-H Confidence 34444433332 23444456778999999997 588986643 33333 34455532 36676666544 2 Q ss_pred HHhcCcCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 368 MRIGVVPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 368 ~~s~~~~~~~d~~~~f~pL~~~seke~A 395 (456) ..+ ++ .++ |+|++---+.-.++| T Consensus 354 n~~-L~---~~~-~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 354 NDW-LG---EEV-VRFDDYEIPPAPVAA 376 (376) T ss_pred Hhh-cc---ccc-cccChhHhhcccccC Confidence 221 12 122 666664444444444 No 209 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=427 Identities=15% Similarity=0.144 Sum_probs=179.5 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccc--hhhhhc--cC-cccCCHHHHHHHHhc---Cchhhhhhccchh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRP--QAWCEY--GF-PQEITFNDLYTMYRR---GGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~--~~~~~~--~~-~~~~~~~~l~~~Y~~---~~l~r~iVd~~ae 72 (456) |++=.-.-.+.... ....++|+-...-=|+..- .++... +. +...+-.||-.-||+ ++.+..+|+.++. T Consensus 1 m~~lfg~~i~~~~~---~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVn 77 (533) T protein:vir:10 1 MSQLFGFSLERAKK---APKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVN 77 (533) T ss_pred Cccccccccccccc---cccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhc Confidence 44322221111100 0111222111110000000 001000 11 123345678777874 7889999999999 Q ss_pred HHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEE-ecCCCCccccccCC Q lcl|NC_016762. 73 TCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLH-IRDSQPWDRPARGK 145 (456) Q Consensus 73 d~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~-i~D~~~~~~Pl~~~ 145 (456) ||+ .+.-.+++-+= +..+....+.++|..+++ -|++..+..+..|.--+.| -+++. +-|.+.+.+ + T Consensus 78 eaiv~d~~~~pV~i~L-d~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDg--Ri~fHkiid~~~pk~----G 150 (533) T protein:vir:10 78 ETICGNFDDVPVSVEL-SNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDG--RLFYHKVIDPDNPQG----G 150 (533) T ss_pred ceeeecCCCceEEEEe-cccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcc--eEEEEEEecCCCccc----c Confidence 984 22222221110 111122233334444433 3444555555444322222 22222 224333221 3 Q ss_pred cCceeEEEEeccccCChhh-hhccc------cccccCC-ceeEEEeecccCCccccceeeehhhhheecCCcC----C-- Q lcl|NC_016762. 146 LNGLAKVTPAWAGCLKPKS-FDEKP------DSETYGQ-PTMWEYTEASQAGRPGLVRDIHPDRVFILGDWTG----D-- 211 (456) Q Consensus 146 ~~~l~~i~~~~~~~~~~~~-~~~Dp------~s~~yg~-P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~~----~-- 211 (456) ...|.+|.|+-...+--.. ...|. ....|+. -+||..++......+.++++|+.+ .|.|+.... . T Consensus 151 I~ELr~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~d-AI~y~hSGl~d~~~~~ 229 (533) T protein:vir:10 151 LIELRYIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPD-SICYVHSGIMDLNKNM 229 (533) T ss_pred ceeeeeccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchh-heeeeeccceeCCCCc Confidence 3334444443221111000 01111 1112222 335555544433445557888886 444432110 1 Q ss_pred CcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh------ Q lcl|NC_016762. 212 AIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEAARQLN------ 281 (456) Q Consensus 212 G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~------ 281 (456) =+|.|.++- |.|.-++-+ ..+|+-+ |... ..| -+|+-+|....+ ++-++ +.|.+++ T Consensus 230 i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRRvF--YIDVGnLPk~KA---eqYlr---~iM~k~KNklVYD 295 (533) T protein:vir:10 230 TLSHLHKAIKAVNQLRMIEDS-----LVIYRLS-RAPERRIF--YIDVGNLPKNKA---EQYLR---EVMGRYRNKLVYD 295 (533) T ss_pred eeccchHhHHHHHhhHHHHhh-----HHHHhhh-ccccceEE--EEecCCCCchhH---HHHHH---HHHHhccceEEEe Confidence 135665543 222222221 1233321 1110 000 123333332211 12222 2222221 Q ss_pred cCCCe---------EEec---------CCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCccc---ch Q lcl|NC_016762. 282 RGNDV---------LLPT---------QGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERA---SS 338 (456) Q Consensus 282 ~~~~~---------~lid---------~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gln---st 338 (456) .++|- ++=| ++-+++++. -+++-++| +..|..-+=-+.++|++||= +.+|+| |+ T Consensus 296 a~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~D-V~YF~kKLY~aLnVP~SRl~--~e~~f~~Gr~~ 372 (533) T protein:vir:10 296 ANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELED-VKYFQKKLYKSLNVPGSRLE--TETTFNVGRAA 372 (533) T ss_pred ccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCccccC--CCCcccccccc Confidence 11110 0000 223455543 35677777 56788888899999999993 346776 33 Q ss_pred H---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 339 E---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAERLANSKTMSEINSAA 408 (456) Q Consensus 339 ~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~ 408 (456) + |.-.|...|.+.|.. +...+..+++. |++-+++. ....+.|.|+-=..-+|-..+|+...+..+.+.. T Consensus 373 EItRDEiKF~KFI~RLR~r-Fs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~ 451 (533) T protein:vir:10 373 EITRDEVKFQKFVARLRKR-FSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATM 451 (533) T ss_pred hhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHh Confidence 3 677899999998875 45666666653 44444442 3357899999999999999999998888876543 Q ss_pred HH-cCC-----------cCcCHHHHHHH---hcc---cCCCC----------C--CCCcccCCCCC-CCCCcCCCCCC-- Q lcl|NC_016762. 409 IG-TGE-----------PVFTAEEIREE---AGY---DPLQG----------G--DPLPDTEPEDE-DAARTDPTGEQ-- 455 (456) Q Consensus 409 ~~-~g~-----------~~i~~~E~R~~---~~~---~~~~~----------~--~~~~~~~~~d~-~~~~~d~~~~~-- 455 (456) -. .|. --.+.+|+.+. ... +|... + .|..+..+.++ .+..++|.-+- T Consensus 452 dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 531 (533) T protein:vir:10 452 DPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEGPDPSDERKA 531 (533) T ss_pred hhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCCCCcchhhcc Confidence 21 111 12344454332 111 11111 0 00111111111 11122222222 Q ss_pred C Q lcl|NC_016762. 456 Q 456 (456) Q Consensus 456 e 456 (456) | T Consensus 532 ~ 532 (533) T protein:vir:10 532 E 532 (533) T ss_pred C Confidence 2 No 210 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=96.77 E-value=0.00035 Score=39.53 Aligned_cols=316 Identities=12% Similarity=-0.028 Sum_probs=144.3 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhcc----Cccc---chhh---hhccCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGH----DAKR---PQAW---CEYGFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~----gt~~---~~~~---~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |+-|...--..+.+.+.+. .-....+..|- -+.+ +-.+ +.-+|-..+++..|..+++.|.....+|..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~~k 79 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMTAS-GPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVK 79 (344) T ss_pred CCcccCCCCcchhhhhhcc-CCceEEEEcCCceEecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhhhhCccceeh Confidence 8877532111111111111 01111111111 0112 1111 2225667888999999999998877777665 Q ss_pred hhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCcee Q lcl|NC_016762. 71 VTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLA 150 (456) Q Consensus 71 aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~ 150 (456) .....+ +++ -.. ..+ +.. |..++..-.++|-|++.+.- + ..+.++ T Consensus 80 ~n~l~~-~~~--Pn~------~lt-----------~~~----f~~~~~d~ll~Gnay~~i~r-n----------~~G~~~ 124 (344) T protein:vir:20 80 RNILAS-TFI--PHP------WLS-----------QQD----FSRFVLDFLVFGNAFLEKRY-S----------TTGKVI 124 (344) T ss_pred hhhHHH-hcc--CCC------CCC-----------HHH----HHHHHHHHHhcCCeEEEEEE-C----------CCCcEE Confidence 543322 221 010 001 000 11111123467888776532 2 112344 Q ss_pred EEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHH Q lcl|NC_016762. 151 KVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISL 226 (456) Q Consensus 151 ~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~ 226 (456) .+.|+....+... .|. + .+|++.. +| ..+.+.+..|||+... ...|+|-+......+.- T Consensus 125 ~L~pl~~~~vr~~---~~~---~----~~~~~~~---~~---~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l- 187 (344) T protein:vir:20 125 RLETSPAKYTRRG---VEE---D----VYWWVPS---FN---EPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWL- 187 (344) T ss_pred EEEEcCCceeEee---ecC---C----EEEEEcc---CC---eEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHH- Confidence 4555443333221 111 1 1344431 11 2345666777777543 34589988877776543 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh-c-CCCeEEecC----CCceeEEec Q lcl|NC_016762. 227 EKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN-R-GNDVLLPTQ----GATVTQMVS 300 (456) Q Consensus 227 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~lid~----~d~~~~~~~ 300 (456) ...+......+|+|....-.+.. +.+ +.-.++..+++.+.++... . +...+++.. ++.++...+ T Consensus 188 ~~~a~~~~~~~f~NGa~p~~Il~-----~~d-----~~l~~e~~~~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pi 257 (344) T protein:vir:20 188 NESATLFRRKYYENGAHAGYIMY-----VTD-----AVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPL 257 (344) T ss_pred HHHHHHHHHHHHhccCCCceEEE-----ecC-----cCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEc Confidence 23334445556666443222111 110 0011233444444444322 1 222344432 233444444 Q ss_pred ccCCH----HHHHHHHHHHHHhhhcCCeEEeeccCCC---cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC Q lcl|NC_016762. 301 AVSDP----GPTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGV 372 (456) Q Consensus 301 ~~sgl----~~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~ 372 (456) +.+.- -++-....+.||++-+||-. |+|..+. |++..+ -.+.|+. +.|.|.++++-++. .+ + T Consensus 258 s~~~~d~qf~e~k~~s~~eIa~af~VPp~-llGi~~~~t~~~~n~e~~~~~f~~-------~~l~P~~~~~e~in-~~-l 327 (344) T protein:vir:20 258 SEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDIEKVAKVFVR-------NELIPLQDRIREIN-GW-L 327 (344) T ss_pred CCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhccCCCCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHH-Hh-c Confidence 44433 33445567789999999996 6786553 333333 3445543 23667666554422 11 2 Q ss_pred cCCCCceEEEeCCCCCCCH Q lcl|NC_016762. 373 VPLKAEFTAIWDDLTVPTK 391 (456) Q Consensus 373 ~~~~~d~~~~f~pL~~~se 391 (456) |. ..|+|.++.|..-+| T Consensus 328 g~--~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 328 GQ--EVIRFKNYSLDTDND 344 (344) T ss_pred CC--cccccCccccccCCC Confidence 32 357777778765555 No 211 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=96.46 E-value=0.0006 Score=38.25 Aligned_cols=424 Identities=10% Similarity=0.032 Sum_probs=171.8 Q ss_pred CCchhH-------------HHHhHHHHH--HHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhh Q lcl|NC_016762. 1 MTDKLD-------------LAVNHAMSS--AIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHG 65 (456) Q Consensus 1 ~~~~~~-------------~~~~~a~~~--~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~ 65 (456) ||+.|= ..++|-... ...+....|...--.+=..+++.+..-+..+..... -..--.+.+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~--~nnki~~nf~k~ 78 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYA--SNVKISHGFFTE 78 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccc--cccccccchHHH Confidence 665431 111111111 111111111110000000001111111111110000 000124678999 Q ss_pred hhccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC-CCCcc---cc Q lcl|NC_016762. 66 AVEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD-SQPWD---RP 141 (456) Q Consensus 66 iVd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~~---~P 141 (456) ||+..+.=++-+.++++..++. ..+. ...++..++ -.+...+.+..+....+|.++.++.++. ++.-. .| T Consensus 79 Ivd~~~~yl~G~Pv~~~~~d~~-~~e~----~~~l~~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p 152 (537) T protein:vir:78 79 LVDQLAQYLLSNGVEVKVKDED-NTQL----DEILQEYFD-EDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDG 152 (537) T ss_pred HHHHHhhhhcccCceeecCcch-hHHH----HHHHHHHhh-ccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEcc Confidence 9999999999888888644322 2221 223444332 2455667777777888898888887753 32111 12 Q ss_pred cc-----CCcCceeEEEEeccccC-Chhhhhcc-ccccccCCce--eEEEeeccc------------------------- Q lcl|NC_016762. 142 AR-----GKLNGLAKVTPAWAGCL-KPKSFDEK-PDSETYGQPT--MWEYTEASQ------------------------- 187 (456) Q Consensus 142 l~-----~~~~~l~~i~~~~~~~~-~~~~~~~D-p~s~~yg~P~--~y~i~~~~~------------------------- 187 (456) .. .....+..+.-+|.... ........ ..--.++.+. +|.+..... T Consensus 153 ~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 232 (537) T protein:vir:78 153 LTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEES 232 (537) T ss_pred ceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecccc Confidence 21 00001111111110000 00000000 0000111111 000000000 Q ss_pred -CCcc----ccceeeehhhhheecCC--cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHh Q lcl|NC_016762. 188 -AGRP----GLVRDIHPDRVFILGDW--TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIAS 260 (456) Q Consensus 188 -~g~~----~~~~~IH~SRli~~~~~--~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~ 260 (456) .... .....-|.=-.|.+.++ +..|.|.++.+..-+.+++.+.-..+..+-..+-..+.+. +. T Consensus 233 ~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~--------g~-- 302 (537) T protein:vir:78 233 TDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVK--------GF-- 302 (537) T ss_pred ccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeee--------cC-- Confidence 0000 00001121111222221 2347888888777777777766555544433222222111 10 Q ss_pred hhcCCHHHHHHHHHHHHHHHhcCCCeEEecC-CC--ceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccc Q lcl|NC_016762. 261 TYGVTLDALNERFNEAARQLNRGNDVLLPTQ-GA--TVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERAS 337 (456) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~-~d--~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glns 337 (456) .+....+... .++ ..++..++. +. +|-+.+.+..+....++...+.|-..+..|-+ ...-+| |+ T Consensus 303 -~~~~~~~~~~-------~l~-~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~---~~~~~g-n~ 369 (537) T protein:vir:78 303 -SGDSTDKLRQ-------NIK-AKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNS---TAVGDG-NV 369 (537) T ss_pred -CCccchhHHH-------HHh-hcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCC---cccccc-CC Confidence 0111111111 122 223333442 33 45566777788999999988887777666653 222222 44 Q ss_pred hHH-HHH-HHH--HHHHHHHhhhhHHHHHHHHHHHHh----cCcC-CCCceEEEeCCCCCCCHHHHHHHHHHHHHH---- Q lcl|NC_016762. 338 SED-QKY-HNA--RCQARRVQELTFEINDLFAHLMRI----GVVP-LKAEFTAIWDDLTVPTKAERLANSKTMSEI---- 404 (456) Q Consensus 338 t~D-~~n-yyd--~I~~~Qe~~lrp~L~~l~~~l~~s----~~~~-~~~d~~~~f~pL~~~seke~Aei~~~~A~a---- 404 (456) ++. ++. |.. .-....+..+++.|++++.+++.. +... ...++.|.|+|=..-+++|.|++.++..++ T Consensus 370 SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS 449 (537) T protein:vir:78 370 TNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEALK 449 (537) T ss_pred cHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcch Confidence 443 332 221 123455567899999998887543 1111 235789999999999999999998776432 Q ss_pred HHHHHHcCCcCcCHHH-HHH---Hh-----------ccc------------CCCCCCCCc-ccCCCCC--------CCCC Q lcl|NC_016762. 405 NSAAIGTGEPVFTAEE-IRE---EA-----------GYD------------PLQGGDPLP-DTEPEDE--------DAAR 448 (456) Q Consensus 405 ~~~~~~~g~~~i~~~E-~R~---~~-----------~~~------------~~~~~~~~~-~~~~~d~--------~~~~ 448 (456) ..+++.. .+.++..| .+. .. ... +.-++.+.. +.++.|+ ..++ T Consensus 450 ~eT~l~~-~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 528 (537) T protein:vir:78 450 IGNIMTV-APRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPP 528 (537) T ss_pred HHHHHHh-CCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCC Confidence 1223321 12333222 110 00 000 000000000 0000111 1122 Q ss_pred cCCCCCCC Q lcl|NC_016762. 449 TDPTGEQQ 456 (456) Q Consensus 449 ~d~~~~~e 456 (456) +||.+-.+ T Consensus 529 ~~~~~~~~ 536 (537) T protein:vir:78 529 TDPNAVPQ 536 (537) T ss_pred CCCccCCC Confidence 22322222 No 212 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=96.33 E-value=0.00073 Score=37.78 Aligned_cols=319 Identities=11% Similarity=0.034 Sum_probs=142.4 Q ss_pred CCchhH-HHHhHHHHHHHHHHHHHHhh-hhhccCc------ccch------hhhhccCcccCCHHHHHHHHhcCchhhhh Q lcl|NC_016762. 1 MTDKLD-LAVNHAMSSAIARARMSLLN-QGIGHDA------KRPQ------AWCEYGFPQEITFNDLYTMYRRGGIAHGA 66 (456) Q Consensus 1 ~~~~~~-~~~~~a~~~~~~~~~d~~~n-~~~~~gt------~~~~------~~~~~~~~~~~~~~~l~~~Y~~~~l~r~i 66 (456) |+.|.. ....++..........+=.. ...+.|. .++- .++.-+|...+++..|..+++.|.-...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~ 80 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSFAGLAKSFRASTHHSSA 80 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHhhhHhhhhh Confidence 885432 22222111111000000000 0111121 1211 11122455678899999999999988888 Q ss_pred hccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCc Q lcl|NC_016762. 67 VEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKL 146 (456) Q Consensus 67 Vd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~ 146 (456) +-.-+.+-++ +++- . + ..+ +..++. ++..-.++|.+++.+.-+ + . T Consensus 81 l~~k~n~l~~-~~~P---n-p----~~t-----------~~~f~~----~v~d~ll~Gnay~~~~r~-~----------~ 125 (351) T protein:vir:79 81 LFFKANVLAS-TFRP---H-R----WLS-----------RHAFER----WALDFLTFGNGYLERRRN-M----------V 125 (351) T ss_pred hhhhhhHHhh-cccC---C-C----CCC-----------HHHHHH----HHHHHHhcCCeEEEEEEC-C----------C Confidence 8765543333 2221 0 0 001 111222 111234678887766432 1 1 Q ss_pred CceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHH Q lcl|NC_016762. 147 NGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNS 222 (456) Q Consensus 147 ~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~ 222 (456) +.+..+.|+-...+.+. .|. + .+|++.. + +..+.+-+..|||+..+. ..|.|-+..+... T Consensus 126 G~~~~L~~l~~~~v~~~---~~~---~----~~~~~~~---~---g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~s 189 (351) T protein:vir:79 126 GGTLRLEPALAKYVRRK---ADF---S----GFVYVNG---W---QERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHS 189 (351) T ss_pred CCEEEEEEeCCcceeee---ecC---C----eEEEEec---C---ceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHH Confidence 12334444433222221 111 1 1333321 1 122445567777775433 3588988888876 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCeEEec-C---CCcee Q lcl|NC_016762. 223 FISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDVLLPT-Q---GATVT 296 (456) Q Consensus 223 l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lid-~---~d~~~ 296 (456) +.-- ..+......+|+|....-.+.+ +.+ +.-.++..+++.+.++... .|.+.+++. . ++.++ T Consensus 190 i~l~-~~a~~~~~~~f~NGa~pg~il~-----~~~-----~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~ 258 (351) T protein:vir:79 190 AWLN-ESSTLFRRKYYENGSHAGFILY-----MTD-----AAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQ 258 (351) T ss_pred HHHH-HHHHHHHHHHHhccCCCceEEE-----ecC-----CCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceE Confidence 6543 2333445556666543222211 110 0011233444444444432 223333332 2 23344 Q ss_pred EEecccCC----HHHHHHHHHHHHHhhhcCCeEEeeccCCC---cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHH Q lcl|NC_016762. 297 QMVSAVSD----PGPTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLM 368 (456) Q Consensus 297 ~~~~~~sg----l~~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~ 368 (456) ....+.+. +-++-....+.||++.+||-. |+|..+. |++..+ -.+.||.. .|.|.++++-++. T Consensus 259 ~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~-llGi~~~~t~~~~n~e~~~~~f~~~-------~l~Pl~~~ie~ln- 329 (351) T protein:vir:79 259 LIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQ-LLGIVPSNSGGFGTPDTAARVFGRN-------EIRPLQARFAELN- 329 (351) T ss_pred EEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHH- Confidence 44444443 334455567889999999975 6687654 333333 34455532 3566665554321 Q ss_pred HhcCcCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 369 RIGVVPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 369 ~s~~~~~~~d~~~~f~pL~~~seke~A 395 (456) .+ ++ .++ ++|++---+....+| T Consensus 330 ~~-lg---~~~-~~F~~~~llr~d~~a 351 (351) T protein:vir:79 330 DW-LG---DEV-VTFDDYEIPPAPVAA 351 (351) T ss_pred hh-cC---cce-eeeChhhhccccccC Confidence 11 12 232 677775544444444 No 213 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=96.20 E-value=0.00088 Score=37.34 Aligned_cols=302 Identities=13% Similarity=0.067 Sum_probs=120.0 Q ss_pred ceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhe Q lcl|NC_016762. 125 YSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFI 204 (456) Q Consensus 125 gs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~ 204 (456) -.=++-... ++.-.+..|. |+.+-+...+..++ +-+. ..+... ...|. .+..|.+.+++. T Consensus 1 v~Eivw~~~----------~g~~~~~~l~--~r~~~~~~~f~~~~---~~~l-~~~~~~--~~~g~--~~~~lp~~kfi~ 60 (355) T protein:vir:78 1 MFEQVYRIE----------NGRARLGKLA--WRPPRTISRFDVAP---DGGL-VAIEQW--GVFGK--ATVRIPVDRLVV 60 (355) T ss_pred CeEEEEEee----------CCeEEEeeee--ecCccceeeeeecc---CCce-eEEEec--CCCCC--CcceeccCCEEE Confidence 000111111 1111122222 22221111111111 1111 111111 11111 124455555554 Q ss_pred ec----CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHH-hhhcCCHHHHHHHHHHHHHH Q lcl|NC_016762. 205 LG----DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIA-STYGVTLDALNERFNEAARQ 279 (456) Q Consensus 205 ~~----~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~ 279 (456) +. ..+.+|.+++..||...+--.....-++.-+-+........++-......+.. ...........+.+..+++. T Consensus 61 ~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~ 140 (355) T protein:vir:78 61 FVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKE 140 (355) T ss_pred EEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHH Confidence 42 34567999999999755422222222222111111111111110000000000 00000112223334444444 Q ss_pred HhcCC-CeEEecCCCceeEEeccc--CCHHHHHHHHHHHHHhhhcCCeEEeeccCCCccc-chHH--HHHHHHHHHHHHH Q lcl|NC_016762. 280 LNRGN-DVLLPTQGATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERA-SSED--QKYHNARCQARRV 353 (456) Q Consensus 280 ~~~~~-~~~lid~~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Gln-st~D--~~nyyd~I~~~Qe 353 (456) +..+. ...++.++.+++-+++.- ++...+++..-.+||-+.--. |--.+.+.+|-+ |-++ .....+.+.+... T Consensus 141 i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~ 219 (355) T protein:vir:78 141 FRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSYALGDTFASFFTGSLNAVMK 219 (355) T ss_pred hhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchhhHHHHHHHHHHHHHHHHHH Confidence 54554 356788888888776543 355667777666776643221 111111222221 2233 3556677776665 Q ss_pred hhhhHHHH-HHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCH---HHHHHHhcccC Q lcl|NC_016762. 354 QELTFEIN-DLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTA---EEIREEAGYDP 429 (456) Q Consensus 354 ~~lrp~L~-~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~---~E~R~~~~~~~ 429 (456) .|...|. .|+.-|+...+++...--.|+|...- + ..++.|++++.++..|..+-.+ +.+|+..+... T Consensus 220 -~i~~~ln~~li~~l~~lN~~~~~~~P~~~~~~~~---~-----~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~ 290 (355) T protein:vir:78 220 -HIADVTQQHVVEDLVDQNWGPEEPAPRLVPAQLG---K-----EQPVTAEAIRALVECGAFTADPELEKDLRARYGLPA 290 (355) T ss_pred -HHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCcC---h-----hHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCC Confidence 4566664 58888888877764433356664432 1 1234688888999998422222 34788777643 Q ss_pred CCCCCCCc-c-cCCC--C------------CCCCCcCCCCCCC Q lcl|NC_016762. 430 LQGGDPLP-D-TEPE--D------------EDAARTDPTGEQQ 456 (456) Q Consensus 430 ~~~~~~~~-~-~~~~--d------------~~~~~~d~~~~~e 456 (456) ..+.+... . .+.. . .+.....|.+.+. T Consensus 291 p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~ 333 (355) T protein:vir:78 291 PAERDDGADAAAAKAAGRRRAKRLPGQRQGAALPSRSPRADPP 333 (355) T ss_pred CCCCCcccCCccccccccccccccCCccccccccccCCCCCCh Confidence 32221100 0 0000 0 0000001111111 No 214 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=96.18 E-value=0.0009 Score=37.28 Aligned_cols=315 Identities=8% Similarity=0.005 Sum_probs=136.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccch---hhh-----hc--cCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQ---AWC-----EY--GFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~---~~~-----~~--~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |.-. +.+-+++...+ ..-...+.+.|..... -+. .. +|-..+++..|..+++.|.....++..- T Consensus 1 ~~~~-~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k 74 (345) T protein:vir:37 1 MKTN-VKTDNKKGIVI-----APINDRTFSLSEITASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSR 74 (345) T ss_pred CCcc-ccccchhhhcC-----CCceEEEeecCCcccchhhcccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhhh Confidence 4322 11111111110 0001111122211111 011 11 4567888899999999998888777655 Q ss_pred hhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCcee Q lcl|NC_016762. 71 VTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLA 150 (456) Q Consensus 71 aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~ 150 (456) ...-.+ +++- +. . +.+..+++ ++..-.++|-+++.+.- ++ .+.+. T Consensus 75 ~n~l~~-~~~P--n~------~-----------~t~~~f~~----~v~d~ll~Gnay~~i~r-n~----------~G~~~ 119 (345) T protein:vir:37 75 ANMVSA-TYEG--GK------A-----------LSKMEMRA----LCLNLIQFGDVGLLKVR-NG----------FGQVV 119 (345) T ss_pred hhHHhh-ccCC--CC------C-----------CCHHHHHH----HHHHHHhcCCeEEEEEE-CC----------CCCEE Confidence 433222 2211 00 0 01111222 22223467877776643 21 12233 Q ss_pred EEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHH Q lcl|NC_016762. 151 KVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISL 226 (456) Q Consensus 151 ~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~ 226 (456) .+.|+-...+.. ..|. -.|+.-..|.+. .. +....+.++.|+||... ...|.|-+..+...+.-- T Consensus 120 ~L~pl~~~~vr~---~~d~--~~~~~~~~~~~~---~~---g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~ 188 (345) T protein:vir:37 120 RLVPLSSLYLRV---HKDG--GYSYLMKKSLYD---TA---QEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLN 188 (345) T ss_pred EEEEecCceeEE---eecC--CeeEEEeeeeec---cC---ceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHH Confidence 333332221111 1111 111111111111 01 12244566777777543 235888777776655432 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc--CCCeEEec-C---CCceeEEec Q lcl|NC_016762. 227 EKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR--GNDVLLPT-Q---GATVTQMVS 300 (456) Q Consensus 227 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~lid-~---~d~~~~~~~ 300 (456) ..+......+|+|....-.+.+ +++ +.-.++..+++.+.++.... +...+++. . ++.++...+ T Consensus 189 -~~a~~~~~~~f~NGa~~~~Il~-----~t~-----~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl 257 (345) T protein:vir:37 189 -SDATVFRRRYFSNGAHMGFILY-----STD-----PDLTEEMEEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPI 257 (345) T ss_pred -HHHHHHHHHHHhccCCcceEEE-----eCC-----CCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEEc Confidence 2333344455666432211110 110 01112334445444544332 22333332 2 122444444 Q ss_pred ccCCH----HHHHHHHHHHHHhhhcCCeEEeeccCCC---cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC Q lcl|NC_016762. 301 AVSDP----GPTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGV 372 (456) Q Consensus 301 ~~sgl----~~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~ 372 (456) +.+.- -++-....+.||++-+||-. |+|..+. |++..+ -.+.|+. ..|.|.++++-+.|-+. T Consensus 258 ~~~~~d~qf~e~k~~~~~dI~~a~~VPp~-liGi~~~~t~~~s~~e~~~~~f~~-------~~l~P~~~~ie~~ln~~-- 327 (345) T protein:vir:37 258 GDTGTKDEFANIKNISAQDVLTAHRFPAG-LSGIIPTNTGGLGDPLKYREVYHY-------DEVMPLQEIIAETINQD-- 327 (345) T ss_pred cCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhccccCCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHhhhh-- Confidence 44432 33445567789999999974 6687553 444333 3455553 35778777776666432 Q ss_pred cCCCCceEEEeCC--CCC Q lcl|NC_016762. 373 VPLKAEFTAIWDD--LTV 388 (456) Q Consensus 373 ~~~~~d~~~~f~p--L~~ 388 (456) ...+.+..|.|+| |.. T Consensus 328 ~e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 328 PEIKNLLKIKFREQNFAK 345 (345) T ss_pred hccCCcceEEECchhhcC Confidence 2223456666765 443 No 215 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=96.01 E-value=0.0011 Score=36.74 Aligned_cols=201 Identities=9% Similarity=0.061 Sum_probs=98.1 Q ss_pred ccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHH Q lcl|NC_016762. 142 ARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLE 217 (456) Q Consensus 142 l~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le 217 (456) ++.+..|-. .|.+.....+ ..+....++++.|+||.... ..|.|-++ T Consensus 1 ~r~~~dg~~----------------------------~y~~~~~~~~-~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~ 51 (219) T protein:vir:98 1 MRVCKDGNY----------------------------KYLMKKSLYD-TKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYV 51 (219) T ss_pred CceeecCeE----------------------------EEEEecceec-CCceeEEeccccEEEecCCCCCCCcceecHHH Confidence 222211110 1111111000 11223567788888886543 35999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc--CCCeEEec----- Q lcl|NC_016762. 218 PAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR--GNDVLLPT----- 290 (456) Q Consensus 218 ~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~lid----- 290 (456) .+...+.. ...+..+...+|+|....-.+..... ..| .++..+++.+.++.... |...+++. T Consensus 52 ~a~~~i~~-~~aa~~~~~~~f~Ng~~p~gil~~~~---~~l-------~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~ 120 (219) T protein:vir:98 52 GGITSALL-NSDATIFRRRYYSNGAHMGFILYSTD---PDM-------TEEMEDEIAERIRDSKGVGNFRSMFVNIAGGH 120 (219) T ss_pred HHHHHHHH-HHHHHHHHHHHHhcCCCCceEEEeCC---CCC-------CHHHHHHHHHHHHHhcCcccccceeEecCCCC Confidence 87766543 34455566677777554332211110 001 12334444444444321 11233442 Q ss_pred -CCCceeEEecccCCH--HHHHHHHHHHHHhhhcCCeEEeeccC---CCcccchHH-HHHHHHHHHHHHHhhhhHHHHHH Q lcl|NC_016762. 291 -QGATVTQMVSAVSDP--GPTYNVNLQTAAAGVDIPTKILVGMQ---TGERASSED-QKYHNARCQARRVQELTFEINDL 363 (456) Q Consensus 291 -~~d~~~~~~~~~sgl--~~~~~~~~~~~aaas~IP~t~L~G~s---p~Glnst~D-~~nyyd~I~~~Qe~~lrp~L~~l 363 (456) ++-+|+.++.+..+. -+.-....+.||.+-+||-. ++|.. .++.++.+. ...||. ..|.|.++++ T Consensus 121 ~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~-~lG~~~~~~~~~sn~eq~~~~f~~-------~tL~P~~~~i 192 (219) T protein:vir:98 121 PDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPG-LSGIIPVNTAGLGDPLKIREAYQA-------DEVLPLQEII 192 (219) T ss_pred ccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHH-HcccccCCCCCccCHHHHHHHHHH-------HHHHHHHHHH Confidence 233566666555433 33334457789999999997 55754 333333332 345553 4578988888 Q ss_pred HHHHHHhcCcCCCCceEEEeCCCCCCCHHH Q lcl|NC_016762. 364 FAHLMRIGVVPLKAEFTAIWDDLTVPTKAE 393 (456) Q Consensus 364 ~~~l~~s~~~~~~~d~~~~f~pL~~~seke 393 (456) -..|-..-+. +.+..+.|+ =..++++. T Consensus 193 e~~ln~~~~~--~~~~~~~F~-~~~~~d~~ 219 (219) T protein:vir:98 193 AESINSDYEI--KSALKVNFK-QPEKRDKN 219 (219) T ss_pred HHHhhhhhcC--CCccEEeec-CcccccCC Confidence 7777543233 334556664 22223332 No 216 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=95.95 E-value=0.0012 Score=36.57 Aligned_cols=402 Identities=9% Similarity=-0.053 Sum_probs=152.7 Q ss_pred CCchhHHHHh----HHHH-----HHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKLDLAVN----HAMS-----SAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~~~~~~----~a~~-----~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) |..|.+.=+. ++.. ......+..+.+..+++-+.... ..-.-.+. +..-++.|.+ +.-+..++++.- T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-~~iLr~~~--~~~ly~~m~~-D~hi~s~l~~Rk 76 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREF-DELLQGKD--GLLVYHKMLS-DGTVKNALNYIF 76 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcccccccccccch-hHhhcccc--chHHHHHHhh-ChHHHHHHHHHH Confidence 6555443111 1111 01222333344433322222111 11110111 1233444544 777777888888 Q ss_pred hHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHH-------hhHHHHHHHHHHhhcccCceEEEEEe---cCCCCcccc Q lcl|NC_016762. 72 TTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAG-------GRFWRAVSEADRRRLVGRYSGLLLHI---RDSQPWDRP 141 (456) Q Consensus 72 ed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~-------l~~~~~~~ea~~~~r~~Ggs~i~i~i---~D~~~~~~P 141 (456) .-.++.-|+|..+++++......+ .+...++. .. |..+..-+-.+.+||+|++=+.- .||.- T Consensus 77 ~av~~~~w~v~p~~~~~~~~~~ae---~v~~~l~~~~~~~~~~~-f~~~~~~~lda~~~G~s~~Eivw~~~~~g~~---- 148 (448) T protein:vir:79 77 GRIRSAKWYVEPASTDPEDIAIAA---FIHAQLGIDDASVGKYP-FGRLFAIYENAYIYGMAAGEIVLTLGADGKL---- 148 (448) T ss_pred HHHhcCCceEecCCCCHHHHHHHH---HHHHHhhhhhhhhccCC-HHHHHHHHHHhhhhcceeEEEEeeecCCCce---- Confidence 777777778854333322222222 12222221 12 22333334458899999875542 13321 Q ss_pred ccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCcc-c------cceeeehhhhheecCCcCCCcc Q lcl|NC_016762. 142 ARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRP-G------LVRDIHPDRVFILGDWTGDAIG 214 (456) Q Consensus 142 l~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~-~------~~~~IH~SRli~~~~~~~~G~S 214 (456) .+..+.++... +...+.-|+. +.+..........++.. . ...-||+.+ . ...+.+|.+ T Consensus 149 ------~~~~l~~r~~~--~~~~f~~~~d----~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~--~~g~p~g~g 213 (448) T protein:vir:79 149 ------ILDKIVPIHPF--NIDEVLYDEE----GGPKALKLSGEVKGGSQFVSGLEIPIWKTVVFLH-N--DDGSFTGQS 213 (448) T ss_pred ------ecccccccCCc--cccceeeecC----CceEEeecCCcccccccCCCccccccceEEEEec-C--ccCCcccch Confidence 11111111100 0000000110 00111100000000000 0 012234322 1 224677999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCC Q lcl|NC_016762. 215 FLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGA 293 (456) Q Consensus 215 ~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d 293 (456) ++..||...+--.....-++.-+-+........++.. .....++..+.+.++++.+..+. ...+|..+. T Consensus 214 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~----------ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~ 283 (448) T protein:vir:79 214 ALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPK----------SVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDW 283 (448) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCC----------CCCcCHHHHHHHHHHHHHHhcCCceEEEecCCc Confidence 9999987443221222222211111000000000000 00111334455555666665444 345677788 Q ss_pred ceeEEecccC--CHHHHHHHHHHHHHhhhcCCeEEeecc-----CCCcccc--hHHH-HHHHHHHHHHHHhhhhHHHH-H Q lcl|NC_016762. 294 TVTQMVSAVS--DPGPTYNVNLQTAAAGVDIPTKILVGM-----QTGERAS--SEDQ-KYHNARCQARRVQELTFEIN-D 362 (456) Q Consensus 294 ~~~~~~~~~s--gl~~~~~~~~~~~aaas~IP~t~L~G~-----sp~Glns--t~D~-~nyyd~I~~~Qe~~lrp~L~-~ 362 (456) +++-++..-+ ....+++..-.+||-+ ++|| +-||-.+ .++. ....+.+.+-.. .+...|. . T Consensus 284 ~ie~~ea~~~~~~~~~~i~~~d~~Isk~-------iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~-~i~~tln~~ 355 (448) T protein:vir:79 284 KFDTVDLKSAMPDAIPYLTYHDAGIARA-------LGIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQR-EFASAVNLY 355 (448) T ss_pred eEEEEecCCCcccHHHHHHHHHHHHHHH-------HhhhhhccccccchhhhhhhhHHHHHHHHHHHHHH-HHHHHHHHH Confidence 8877766532 3445555555556553 3343 1122211 1222 223344444333 3455564 4 Q ss_pred HHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCC Q lcl|NC_016762. 363 LFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPE 442 (456) Q Consensus 363 l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~ 442 (456) |+.-|+...+++...-=.|.|... |..|+ ++.|++..+++..+ ....+-+|+..+......+.+....... T Consensus 356 li~~l~~lNfg~~~~~P~~~f~~~------e~~Dl-~~~a~~~~~l~~~~--~~~~~~~~~~~~~p~~~~~~~~~a~~~~ 426 (448) T protein:vir:79 356 LIPKLVLPNWPSATRFPRLTFEME------ERNDF-SAAANLMGMLINAV--KDSEDIPTELKALIDALPSKMRRALGVV 426 (448) T ss_pred HHHHHHHhcCCCcCCCcEEEecCC------ChHHH-HHHHHHhhhhhccc--hhhHHHHHHhhcCCCCCCCccccccCCC Confidence 777777777775332113444321 22232 34677777777654 2222334554443221111110000000 Q ss_pred CCCCCCcCCCCCCC Q lcl|NC_016762. 443 DEDAARTDPTGEQQ 456 (456) Q Consensus 443 d~~~~~~d~~~~~e 456 (456) +...+.....+++. T Consensus 427 ~~~~~~~~~~~~~~ 440 (448) T protein:vir:79 427 DEVREAVRQPADSR 440 (448) T ss_pred CcccccccCCcccc Confidence 11111111111111 No 217 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=95.94 E-value=0.0012 Score=36.56 Aligned_cols=316 Identities=11% Similarity=-0.027 Sum_probs=139.2 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhc-----cCcc--cchhh---hhccCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIG-----HDAK--RPQAW---CEYGFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~-----~gt~--~~~~~---~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |+-|.+.--..+.....+. .-....+..| +++. -+-.+ +.-+|...+++.-|..+++.|.....+|... T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~s~i~~k 79 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTAS-APKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPVSFTGLAKSLRAAVHHSSPIYVK 79 (344) T ss_pred CCCCCCCCCchhhHHhhcC-CCceEEEEcCCceeecCcchhhhHHHhhhcCccccCCCCHHHHHHHHhhhhhhCccceeh Confidence 9877653111111110000 0111111111 1111 12221 2225667889999999999998877777765 Q ss_pred hhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCcee Q lcl|NC_016762. 71 VTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLA 150 (456) Q Consensus 71 aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~ 150 (456) +....+ +++= . ...+.. .+ .+ ++..-.++|-+++.+. +++ .+.+. T Consensus 80 ~n~l~~-~~~P--n------p~~t~~--~f----~~---------~~~d~ll~Gnay~~~~-rn~----------~G~~~ 124 (344) T protein:vir:56 80 RNILAS-TFIP--H------PWLSQQ--DF----SR---------FVLDFLVFGNAFLEKR-YST----------TGKVI 124 (344) T ss_pred hhhHHh-hcCC--C------CCCCHH--HH----HH---------HHHHHHhcCCeEEEEE-ECC----------CCcEE Confidence 544332 2210 0 000100 11 11 1112346787777653 221 12234 Q ss_pred EEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCC----cCCCcchHHHHHHHHHHH Q lcl|NC_016762. 151 KVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDW----TGDAIGFLEPAYNSFISL 226 (456) Q Consensus 151 ~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~----~~~G~S~le~~~~~l~~~ 226 (456) .+.|+....+... .|. + .+|++.. + +....+.+..|||+..+ ...|+|-+..+.+.+.- T Consensus 125 ~L~pl~~~~v~~~---~~~---~----~~~~~~~---~---g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l- 187 (344) T protein:vir:56 125 RLETSPAKYTRRG---VEE---D----VYWWVPS---F---NEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWL- 187 (344) T ss_pred EEEEeCCceeEEe---ecC---C----EEEEEec---C---CeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHH- Confidence 4544443322221 111 1 1355531 1 12345677778887543 24589988888776543 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh-c-CCCeEEecC----CCceeEEec Q lcl|NC_016762. 227 EKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN-R-GNDVLLPTQ----GATVTQMVS 300 (456) Q Consensus 227 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~lid~----~d~~~~~~~ 300 (456) ...+......+|+|....-.+.... + ..+ .++..+++.+.++... . +...+++.. ++.++.... T Consensus 188 ~~~a~~~~~~~f~NGa~pg~Il~~~--d-~~l-------s~e~~~~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pi 257 (344) T protein:vir:56 188 NESATLFRRKYYENGAHAGYIMYVT--D-AVQ-------DRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPL 257 (344) T ss_pred HHHHHHHHHHHHhccCCCceEEEec--C-CCC-------CHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEc Confidence 3334445556666643322221100 0 001 1233444444444322 1 223344431 233444444 Q ss_pred ccCCH----HHHHHHHHHHHHhhhcCCeEEeeccCCC---cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC Q lcl|NC_016762. 301 AVSDP----GPTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGV 372 (456) Q Consensus 301 ~~sgl----~~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~ 372 (456) +.+.- -++-....+.||++-+||-. |+|..+. |++.-+ -.+.|+. +.|.|.++++-++. .+ + T Consensus 258 s~~~~d~qf~e~k~~s~~eIa~afrVPp~-llGi~~~~t~~~~n~eq~~~~f~~-------~tL~Pl~~~ie~~n-~~-l 327 (344) T protein:vir:56 258 SEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLGDIEKVAKVFVR-------NELIPLQDRIREIN-GW-I 327 (344) T ss_pred CCChHHHHHHHHHHhhHHHHHHHhCCCHH-HhccCCCCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHH-hh-h Confidence 44333 34455567789999999996 7786554 333333 3444543 24566665554322 11 1 Q ss_pred cCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 373 VPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 373 ~~~~~d~~~~f~pL~~~seke~A 395 (456) +. + .+.|+|-.-. .+.| T Consensus 328 ~~---~-~~~F~~y~l~--~~~~ 344 (344) T protein:vir:56 328 GQ---E-VIRFKNYSLD--TDNG 344 (344) T ss_pred cc---c-cccCCCcccc--ccCC Confidence 11 1 1334333211 1111 No 218 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=95.55 E-value=0.0019 Score=35.55 Aligned_cols=310 Identities=12% Similarity=0.003 Sum_probs=139.7 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccc---h---hhhhc--cCcccCCHHHHHHHHhcCchhhhhhccchh Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRP---Q---AWCEY--GFPQEITFNDLYTMYRRGGIAHGAVEKIVT 72 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~---~---~~~~~--~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ae 72 (456) |+.+... ...+...+.+. +-+|-....-+. .++ - .++.. +|-..+++..|..+++.|+-...++..-+. T Consensus 1 m~~~~~~-~~~~~~~~~~~-~~~~~~p~~~~~-~~~~~~~~~~~~~~~~~~~~pP~~~~~La~l~~~~~~h~~~L~~k~N 77 (337) T protein:vir:78 1 MTKRQQQ-PAQAAASSPRP-SVVFSMPEAIDP-TAWMTDYTGVFYNPYGEYYQPPIDRKGLAKVARANAHHGAILMARRN 77 (337) T ss_pred CCCcccC-cccccccCcee-EEEecCcccccC-cchhHhhhhhhhccCcceecCCCCHHHHHHHhhcchhhhhHHHhhhc Confidence 8865543 11111110000 111111111111 111 1 01111 355678888999999998887777765543 Q ss_pred HHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCceeEE Q lcl|NC_016762. 73 TCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLAKV 152 (456) Q Consensus 73 d~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i 152 (456) . +..++.- + + ..+.+ ++..-.++|-+++.+. +++ .+.++.+ T Consensus 78 ~-~~~~f~~-~--------------~---~~~~~---------~~~d~ll~GNay~~~~-rn~----------~G~~~~L 118 (337) T protein:vir:78 78 M-VAGRFTN-Q--------------R---ATITA---------FVHNYLQFGDGGLLKL-RNS----------FGQVVGL 118 (337) T ss_pred c-ccccCcC-c--------------H---HHHHH---------HHHHHHhhCCeEEEEE-ECC----------CCcEEEE Confidence 2 2223211 0 0 01111 1112346787777653 331 1112333 Q ss_pred EEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHHHH Q lcl|NC_016762. 153 TPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISLEK 228 (456) Q Consensus 153 ~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~~~ 228 (456) .|+-...+.. ..| ..+|.+.. ++ ..+.+-+..|+|+.... ..|.|.++.+...+.--. T Consensus 119 ~pl~~~~v~~---~~d--------~~~~~~~~---~~---~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~- 180 (337) T protein:vir:78 119 HPLSSVYLRR---RED--------GCFVYLQQ---GK---PNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQ- 180 (337) T ss_pred EEeCCceeEe---eeC--------CeEEEEEc---CC---ceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHH- Confidence 3333221211 111 12233321 11 12345556677765432 358898888777664332 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCeEEec-C---CCceeEEeccc Q lcl|NC_016762. 229 VEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDVLLPT-Q---GATVTQMVSAV 302 (456) Q Consensus 229 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lid-~---~d~~~~~~~~~ 302 (456) .+......+|+|....-.+.. +++ +.-.++..+++.+.++... .|.+.+++. . ++.++...++. T Consensus 181 aa~~~~~~~f~NGa~p~~il~-----~~~-----~~l~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~ 250 (337) T protein:vir:78 181 DATLFRRRYFLNGAHMGFIFY-----ATD-----PNMDDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGD 250 (337) T ss_pred HHHHHHHHHHhccCCCceeEE-----cCC-----CCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCC Confidence 333344456666443222211 000 0111233445555555443 233333332 1 23344444444 Q ss_pred CCHH----HHHHHHHHHHHhhhcCCeEEeeccCCCc----ccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCc Q lcl|NC_016762. 303 SDPG----PTYNVNLQTAAAGVDIPTKILVGMQTGE----RASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGVV 373 (456) Q Consensus 303 sgl~----~~~~~~~~~~aaas~IP~t~L~G~sp~G----lnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~ 373 (456) +.-+ ++-....+.||++-+||-. |+|..+.+ ++.-+ -.+.|+. ..|.|.++++-+.+-+. +. T Consensus 251 ~~~d~qfle~k~~s~~eIa~a~~VPp~-llGi~~~~~~~~~~n~e~~~~~f~~-------~~L~P~~~~ie~~~n~~-ll 321 (337) T protein:vir:78 251 IATKDEFAAIKGITAQDVLTAHRYPPA-LAGIIPTNGGGGLGDPEKYDATYAR-------NEVLPLCELVQDAINSA-GL 321 (337) T ss_pred ChhHHHHHHHHHHhHHHHHHHhCCCHH-HcccccCCCcCccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhh-cC Confidence 4332 3344567789999999985 67876544 32222 2345543 35788888777766443 33 Q ss_pred CCCCceEEEeCCCCCC Q lcl|NC_016762. 374 PLKAEFTAIWDDLTVP 389 (456) Q Consensus 374 ~~~~d~~~~f~pL~~~ 389 (456) +...-+.|+|++=--+ T Consensus 322 ~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 322 PRALWVTFRETIGAAV 337 (337) T ss_pred ChhhceeccccccccC Confidence 4333355666554444 No 219 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=426 Identities=14% Similarity=0.104 Sum_probs=175.7 Q ss_pred CCc------hhH------HHHhHHHHHHHHHHHHHHhhhhhccCcccchhh-hhc-cCc-ccCCHHHHHHHHhc---Cch Q lcl|NC_016762. 1 MTD------KLD------LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAW-CEY-GFP-QEITFNDLYTMYRR---GGI 62 (456) Q Consensus 1 ~~~------~~~------~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~-~~~-~~~-~~~~~~~l~~~Y~~---~~l 62 (456) |.. |.+ ...+++.+-+--...||-.-.-.+.++....++ ++| +.. ...+-.+|-..||+ ++. T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pE 82 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPE 82 (516) T ss_pred chHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccc Confidence 110 000 000011110000001111111111111111111 111 111 23355677777774 888 Q ss_pred hhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEEecCCC Q lcl|NC_016762. 63 AHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLHIRDSQ 136 (456) Q Consensus 63 ~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~ 136 (456) +..+|+.++.||+ .++-.+++-+= +..+....+..+|..+++ -|++..+-.+..|.--+.|. ..+-.+-| T Consensus 83 vd~Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR-i~fhKiid-- 158 (516) T protein:vir:10 83 VERAVANIVNEAIVYERGHKVVSLDL-DDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSR-IFFHKIMP-- 158 (516) T ss_pred hhhHHHHhhcceeEecCCCceEEEEe-cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcce-EEEEEEec-- Confidence 9999999999984 22222222111 222223333344544444 34455555555443333332 22222333 Q ss_pred CccccccCCcCceeEEEEeccccCChhhhh--cccccc--ccCCceeEEEeec----ccCC---ccccceeeehhhhhee Q lcl|NC_016762. 137 PWDRPARGKLNGLAKVTPAWAGCLKPKSFD--EKPDSE--TYGQPTMWEYTEA----SQAG---RPGLVRDIHPDRVFIL 205 (456) Q Consensus 137 ~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~--~Dp~s~--~yg~P~~y~i~~~----~~~g---~~~~~~~IH~SRli~~ 205 (456) .|- .|...|.+|.|+- +.....+ .|..+. --|--++|..++. ..+| .+...++|+.+- |.| T Consensus 159 ---~~k-~GI~Elr~lDPr~---i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dA-I~y 230 (516) T protein:vir:10 159 ---NPK-KGIAELRRLDPRF---MEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSA-VVY 230 (516) T ss_pred ---Ccc-ccceeeeeeCCcc---eeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhh-eee Confidence 111 1222333333322 1111000 111111 0111122322211 1111 223356777764 333 Q ss_pred cCC------cCCCcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHH Q lcl|NC_016762. 206 GDW------TGDAIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEA 276 (456) Q Consensus 206 ~~~------~~~G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 276 (456) +.. ...=+|.|.++- |.|.-++-+ ..+|+-+ |....- -.=+|+-+|....+ ++-++ +. T Consensus 231 ~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRR-vFYIDvGnlPk~KA---eqYl~---~i 297 (516) T protein:vir:10 231 ASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDA-----MVIYRIT-RAPERR-VFYIDVGNMNNRKA---TEYVN---GI 297 (516) T ss_pred ecccceeCCCCceeeeehhhhHhHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHHH---HH Confidence 221 111145665543 223222221 1233321 111000 00123333332211 12222 22 Q ss_pred HHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEeecc Q lcl|NC_016762. 277 ARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKILVGM 330 (456) Q Consensus 277 ~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~ 330 (456) |.+++ .++|- +-+ .++-+++++. -+++-++| +..|..-+=-|.++|++||=.. T Consensus 298 m~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e 376 (516) T protein:vir:10 298 MQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDD-VRWFNKKLYEALRIPLSRIPRD 376 (516) T ss_pred HHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCC Confidence 22221 11110 000 0223455543 35677777 5678888889999999999888 Q ss_pred CCCccc---ch---HHHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHHHHHH Q lcl|NC_016762. 331 QTGERA---SS---EDQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAERLAN 397 (456) Q Consensus 331 sp~Gln---st---~D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke~Aei 397 (456) +++.++ |+ -|.-.|...|.+.|.. +.+.+..+++. |++-+++. ....+.|.|+-=..-+|-..+|+ T Consensus 377 ~~~~~~~Gr~~EItRDEiKF~KFI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Ei 455 (516) T protein:vir:10 377 DGGMVIGGQDTAITRDELDFRKFVVQLQHD-FEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIET 455 (516) T ss_pred CCceeeccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHH Confidence 776652 33 3777899999999875 46666666653 44444442 33578999999999999999999 Q ss_pred HHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCC--cCCCCCCC Q lcl|NC_016762. 398 SKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAAR--TDPTGEQQ 456 (456) Q Consensus 398 ~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~--~d~~~~~e 456 (456) ...+..+.+..-- .| -.++-+=++. .+.+.- +.+..++..-++|..++ .+|..++. T Consensus 456 l~~R~~~l~~~dpyvG-ky~s~~yi~k~ILr~tD--eei~~e~k~I~~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 456 LRLRVDALSQIEPYVG-KYVSHDYVMKNILQMTE--EQIAQEEKQIEQEAGIKRFQNPENEDD 515 (516) T ss_pred HHHHHHHHHHhhhhhc-cccchHHHHHHHhcCCH--hhHHHHHHHHHHhhhCCCCCCCCcccc Confidence 9988887765432 22 1344444433 222110 00000000000111111 12222222 No 220 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=426 Identities=14% Similarity=0.104 Sum_probs=175.7 Q ss_pred CCc------hhH------HHHhHHHHHHHHHHHHHHhhhhhccCcccchhh-hhc-cCc-ccCCHHHHHHHHhc---Cch Q lcl|NC_016762. 1 MTD------KLD------LAVNHAMSSAIARARMSLLNQGIGHDAKRPQAW-CEY-GFP-QEITFNDLYTMYRR---GGI 62 (456) Q Consensus 1 ~~~------~~~------~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~-~~~-~~~-~~~~~~~l~~~Y~~---~~l 62 (456) |.. |.+ ...+++.+-+--...||-.-.-.+.++....++ ++| +.. ...+-.+|-..||+ ++. T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pE 82 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINNPE 82 (516) T ss_pred chHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhccc Confidence 110 000 000011110000001111111111111111111 111 111 23355677777774 888 Q ss_pred hhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEEecCCC Q lcl|NC_016762. 63 AHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLHIRDSQ 136 (456) Q Consensus 63 ~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~ 136 (456) +..+|+.++.||+ .++-.+++-+= +..+....+..+|..+++ -|++..+-.+..|.--+.|. ..+-.+-| T Consensus 83 vd~Av~eIVneaiv~d~~~~pV~l~L-~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR-i~fhKiid-- 158 (516) T protein:vir:10 83 VERAVANIVNEAIVYERGHKVVSLDL-DDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSR-IFFHKIMP-- 158 (516) T ss_pred hhhHHHHhhcceeEecCCCceEEEEe-cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcce-EEEEEEec-- Confidence 9999999999984 22222222111 222223333344544444 34455555555443333332 22222333 Q ss_pred CccccccCCcCceeEEEEeccccCChhhhh--cccccc--ccCCceeEEEeec----ccCC---ccccceeeehhhhhee Q lcl|NC_016762. 137 PWDRPARGKLNGLAKVTPAWAGCLKPKSFD--EKPDSE--TYGQPTMWEYTEA----SQAG---RPGLVRDIHPDRVFIL 205 (456) Q Consensus 137 ~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~--~Dp~s~--~yg~P~~y~i~~~----~~~g---~~~~~~~IH~SRli~~ 205 (456) .|- .|...|.+|.|+- +.....+ .|..+. --|--++|..++. ..+| .+...++|+.+- |.| T Consensus 159 ---~~k-~GI~Elr~lDPr~---i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dA-I~y 230 (516) T protein:vir:10 159 ---NPK-KGIAELRRLDPRF---MEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSA-VVY 230 (516) T ss_pred ---Ccc-ccceeeeeeCCcc---eeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhh-eee Confidence 111 1222333333322 1111000 111111 0111122322211 1111 223356777764 333 Q ss_pred cCC------cCCCcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHH Q lcl|NC_016762. 206 GDW------TGDAIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEA 276 (456) Q Consensus 206 ~~~------~~~G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 276 (456) +.. ...=+|.|.++- |.|.-++-+ ..+|+-+ |....- -.=+|+-+|....+ ++-++ +. T Consensus 231 ~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRR-vFYIDvGnlPk~KA---eqYl~---~i 297 (516) T protein:vir:10 231 ASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDA-----MVIYRIT-RAPERR-VFYIDVGNMNNRKA---TEYVN---GI 297 (516) T ss_pred ecccceeCCCCceeeeehhhhHhHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHHH---HH Confidence 221 111145665543 223222221 1233321 111000 00123333332211 12222 22 Q ss_pred HHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEeecc Q lcl|NC_016762. 277 ARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKILVGM 330 (456) Q Consensus 277 ~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~ 330 (456) |.+++ .++|- +-+ .++-+++++. -+++-++| +..|..-+=-|.++|++||=.. T Consensus 298 m~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~sRl~~e 376 (516) T protein:vir:10 298 MQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDD-VRWFNKKLYEALRIPLSRIPRD 376 (516) T ss_pred HHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCC Confidence 22221 11110 000 0223455543 35677777 5678888889999999999888 Q ss_pred CCCccc---ch---HHHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHHHHHH Q lcl|NC_016762. 331 QTGERA---SS---EDQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAERLAN 397 (456) Q Consensus 331 sp~Gln---st---~D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke~Aei 397 (456) +++.++ |+ -|.-.|...|.+.|.. +.+.+..+++. |++-+++. ....+.|.|+-=..-+|-..+|+ T Consensus 377 ~~~~~~~Gr~~EItRDEiKF~KFI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Ei 455 (516) T protein:vir:10 377 DGGMVIGGQDTAITRDELDFRKFVVQLQHD-FEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIET 455 (516) T ss_pred CCceeeccccchhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHH Confidence 776652 33 3777899999999875 46666666653 44444442 33578999999999999999999 Q ss_pred HHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCCC--cCCCCCCC Q lcl|NC_016762. 398 SKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAAR--TDPTGEQQ 456 (456) Q Consensus 398 ~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~~--~d~~~~~e 456 (456) ...+..+.+..-- .| -.++-+=++. .+.+.- +.+..++..-++|..++ .+|..++. T Consensus 456 l~~R~~~l~~~dpyvG-ky~s~~yi~k~ILr~tD--eei~~e~k~I~~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 456 LRLRVDALSQIEPYVG-KYVSHDYVMKNILQMTE--EQIAQEEKQIEQEAGIKRFQNPENEDD 515 (516) T ss_pred HHHHHHHHHHhhhhhc-cccchHHHHHHHhcCCH--hhHHHHHHHHHHhhhCCCCCCCCcccc Confidence 9988887765432 22 1344444433 222110 00000000000111111 12222222 No 221 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=95.09 E-value=0.0028 Score=34.60 Aligned_cols=417 Identities=15% Similarity=0.149 Sum_probs=177.5 Q ss_pred CCchhHHH------HhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcc------cCCHHHHHHHHhc---Cchhhh Q lcl|NC_016762. 1 MTDKLDLA------VNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQ------EITFNDLYTMYRR---GGIAHG 65 (456) Q Consensus 1 ~~~~~~~~------~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~------~~~~~~l~~~Y~~---~~l~r~ 65 (456) -..|.+-. .+++.+-+--..-||-.-.-++.++. ......|+. .+.-++|-.-||+ ++.+.. T Consensus 3 ~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~---~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~ 79 (511) T protein:vir:56 3 FWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAP---QLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDD 79 (511) T ss_pred CccchhhhhhhhhccCCcccccCCCCCCCceEEecccccc---eecceeccccccccCccchHHHHHHHHHHhhccchhh Confidence 11111111 11111100000111111100111111 011111221 2222688888874 788999 Q ss_pred hhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEE-ecCCCCc Q lcl|NC_016762. 66 AVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLH-IRDSQPW 138 (456) Q Consensus 66 iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~-i~D~~~~ 138 (456) +|+.++.||+ .+.-.+++-+= +..+....+..+|..+++ -|++..+..+..|.--+.|. +++. +-|++. T Consensus 80 Av~eIvne~iv~d~~~~pV~l~l-d~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR--i~fHkiid~k~- 155 (511) T protein:vir:56 80 AIQEIVDEAIVYENDKEVVWLNL-DNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSR--IYFHKILDKDN- 155 (511) T ss_pred HHHHhhcceeEecCCCceEEEEe-cccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcce--EEEEEEecccc- Confidence 9999999984 22222222111 222333333344444443 34455555555443333332 3332 224321 Q ss_pred cccccCCcCceeEEEEeccccCChhhhhccc-cc-cc-cCCceeEEEeeccc--------CCccccceeeehhhhh--ee Q lcl|NC_016762. 139 DRPARGKLNGLAKVTPAWAGCLKPKSFDEKP-DS-ET-YGQPTMWEYTEASQ--------AGRPGLVRDIHPDRVF--IL 205 (456) Q Consensus 139 ~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp-~s-~~-yg~P~~y~i~~~~~--------~g~~~~~~~IH~SRli--~~ 205 (456) |...|.+|.|+-..-+.. ..+++ .+ .. -+--++|..++.+. .|....+.+|+.+-+. ++ T Consensus 156 ------GI~eLr~lDPr~i~~vr~--i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hS 227 (511) T protein:vir:56 156 ------NIIELRPLNPMKMELVRE--IQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHS 227 (511) T ss_pred ------ceeehhhcCcccchhhhh--hhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecc Confidence 333344444433221111 11111 00 00 11133444433211 1112345778888763 32 Q ss_pred c-----CCcCCCcchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhhhccHhhHHhhhcCCHHHHHHHHHHH Q lcl|NC_016762. 206 G-----DWTGDAIGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLL-LNFDKEINLGEIASTYGVTLDALNERFNEA 276 (456) Q Consensus 206 ~-----~~~~~G~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 276 (456) + ...+-.+|.|.++- |.|.-++-+ ..+|+-+ |... ..| =+|+-+|....+ ++-++ +. T Consensus 228 GL~d~~~~~g~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRRvF--YIDVGnLPk~KA---eqYl~---~i 293 (511) T protein:vir:56 228 GLMRGCADDPYIIGYLDRAIKPANQLKMLEDA-----LVIYRLA-RAPERRVF--YVDVGNLPTQKA---QQYVN---GI 293 (511) T ss_pred cceeccCCCCeeeccchhhhHHHHhhHHHHhh-----HHHHhhh-ccccceEE--EEecCCCCchhH---HHHHH---HH Confidence 2 12223567777654 333322222 1333321 1110 000 123333332211 12221 22 Q ss_pred HHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEeecc Q lcl|NC_016762. 277 ARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKILVGM 330 (456) Q Consensus 277 ~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~ 330 (456) |..++ .++|- +-| .++-+++.+. -+++-++| +..|..-+=.|.++|++||=.. T Consensus 294 M~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kKLy~aLnVP~SRl~~e 372 (511) T protein:vir:56 294 MQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIED-VLYFNRKLYKAMRIPTSRAASE 372 (511) T ss_pred HHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCC Confidence 22221 11110 000 0223455543 35667777 5678888889999999999865 Q ss_pred -CCCccc---chH---HHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCcC------CCCceEEEeCCCCCCCHHHHHH Q lcl|NC_016762. 331 -QTGERA---SSE---DQKYHNARCQARRVQELTFEINDLFAH-LMRIGVVP------LKAEFTAIWDDLTVPTKAERLA 396 (456) Q Consensus 331 -sp~Gln---st~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~~------~~~d~~~~f~pL~~~seke~Ae 396 (456) +++|+| |++ |.-.|...|.+.|.. +.+.+..+++. |++-+++. ....+.|.|+-=..-+|-..+| T Consensus 373 ~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~r-Fs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~E 451 (511) T protein:vir:56 373 DQTGGINFGQGAEITRDELKFTKFVKRLQTK-FETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELE 451 (511) T ss_pred CCccccccccchhhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHH Confidence 557776 333 677899999998875 46666666653 44444442 3357899999999999999999 Q ss_pred HHHHHHHHHHHHHHc-CCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCC--CCCCcCCCC--CCC Q lcl|NC_016762. 397 NSKTMSEINSAAIGT-GEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDE--DAARTDPTG--EQQ 456 (456) Q Consensus 397 i~~~~A~a~~~~~~~-g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~--~~~~~d~~~--~~e 456 (456) +...+..+.+..-.. |. .++-+=++. .+.+.- +++...+. +.+..+|.- .+| T Consensus 452 il~~Rl~~l~~~dpyvGk-y~S~~yi~k~ILr~tD-------eei~~~~k~I~~E~k~~~~~~~e~ 509 (511) T protein:vir:56 452 ILNSRMNAMRDIQDYAGK-YYSHKYIQKNILRLSD-------DQITAMQSEIDEEETNPRFQQDDQ 509 (511) T ss_pred HHHHHHHHHHHhcchhcc-ccchHHHHHHHhccCH-------HHHHHHHHHHHHhhcCCCCCCccc Confidence 998888877654322 21 234433322 111100 00000000 000000000 000 No 222 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=94.97 E-value=0.0031 Score=34.36 Aligned_cols=398 Identities=13% Similarity=0.070 Sum_probs=165.8 Q ss_pred CCchhHH-------HHh-HHHHHHHHHHHHHHhh-hhhccCccc-chhhhhccCcccCCHHH-HHHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDL-------AVN-HAMSSAIARARMSLLN-QGIGHDAKR-PQAWCEYGFPQEITFND-LYTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~-------~~~-~a~~~~~~~~~d~~~n-~~~~~gt~~-~~~~~~~~~~~~~~~~~-l~~~Y~~~~l~r~iVd~ 69 (456) |+.=+.. ... .....+....++-+.+ ++.|+.-.| ........-.....+.+ ++.+-.+..-+..++.+ T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~ 80 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELSK 80 (512) T ss_pred CcceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 2211100 000 0000111222333332 233442222 11222222221111222 23444456666666666 Q ss_pred chhHHhhCCCEEecCCCcc-hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEec-CCCCccccccCCcC Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQD-RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIR-DSQPWDRPARGKLN 147 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d-~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~-D~~~~~~Pl~~~~~ 147 (456) .-.-.+..-|+|.-+.+.+ ..+. ....++..+.++.-+..+..-+-.+.+||+|++=|.-. ++..+ ... T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~---~a~~v~~~l~~~~~f~~~~~~lldA~~~G~s~~Ei~w~~~~g~~------~~~ 151 (512) T protein:vir:19 81 RRLAIQALEWRIAPARDASAQEKK---DADMLNEYLHDAAWFEDALFDAGDAILKGYSMQEIEWGWLGKMR------VPV 151 (512) T ss_pred HHHHHhCCCceEecCCCCCHHHHH---HHHHHHHHHhcCCCHHHHHHHHHhhhhhcceeeeeEeeeeCCce------eee Confidence 6666665555775433221 1111 12235555655542344444444689999998755321 22111 112 Q ss_pred ceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhhee----cCCcCCCcchHHHHHHHH Q lcl|NC_016762. 148 GLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFIL----GDWTGDAIGFLEPAYNSF 223 (456) Q Consensus 148 ~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~----~~~~~~G~S~le~~~~~l 223 (456) .|..+.+.|-. .|+.... + ..+.... ..+..+++-+.+.+ .....+|.++++.||... T Consensus 152 ~~~~r~~~~f~--------~~~~~~~----~-lr~~~~~-----~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~ 213 (512) T protein:vir:19 152 ALHHRDPALFC--------ANPDNLN----E-LRLRDAS-----YHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPF 213 (512) T ss_pred eeeeeccccce--------eccCCCc----E-EEecCCC-----CCceeecCCceEEEeccCCCCCcccccHHHHHHHHH Confidence 23333333321 1111100 0 0010000 01222333332222 235667999999988644 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhc-CCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEeccc Q lcl|NC_016762. 224 ISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYG-VTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAV 302 (456) Q Consensus 224 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~ 302 (456) +--.....-++. |..++.+--...... ...++-.+.+.+.+..+.+ ....+|..+.+++-+++.= T Consensus 214 ~fK~~~~~~w~~-------------f~E~yG~P~~igky~~~a~~~ek~~L~~al~~~~~-~a~~iiP~~~~ie~~ea~~ 279 (512) T protein:vir:19 214 IFKNYSVRDFAE-------------FLEIYGLPMRVGKYPTGSTNREKATLMQAVMDIGR-RAGGIIPMGMTLDFQSAAD 279 (512) T ss_pred HHHHHHHHHHHH-------------HHHHcCCCeeEEecCCCCCHHHHHHHHHHHHHHhh-CcEEEecCCceEEEeecCC Confidence 322222222221 111111100000001 1123445566666666644 4567788888888887654 Q ss_pred CCH---HHHHHHHHHHHHhhhcCCeEEeeccC------CCcccchHH--HHHHHHHHHHHHHhhhhHHHH-HHHHHHHHh Q lcl|NC_016762. 303 SDP---GPTYNVNLQTAAAGVDIPTKILVGMQ------TGERASSED--QKYHNARCQARRVQELTFEIN-DLFAHLMRI 370 (456) Q Consensus 303 sgl---~~~~~~~~~~~aaas~IP~t~L~G~s------p~Glnst~D--~~nyyd~I~~~Qe~~lrp~L~-~l~~~l~~s 370 (456) ++. ..+++..-.+||-+ ++|+. .+|-+|.++ .....+.+.+... .+...|. .|+.-|+.. T Consensus 280 ~~~~~y~~li~~~d~~Isk~-------iLGqtlTs~~g~~Gs~a~~~vh~ev~~di~~aDa~-~i~~tln~~li~~l~~~ 351 (512) T protein:vir:19 280 GQSDPFMAMIGWAEKAISKA-------ILGGTLTTEAGDKGARSLGEVHDEVRREIRNADVG-QLARSINRDLIYPLLAL 351 (512) T ss_pred CCHHHHHHHHHHHHHHHHHH-------HhhhhhcccccccchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHh Confidence 333 33344444556644 34442 122233333 3456666666665 4566674 588888888 Q ss_pred cCcCCCC---ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCcccCCCC---- Q lcl|NC_016762. 371 GVVPLKA---EFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDTEPED---- 443 (456) Q Consensus 371 ~~~~~~~---d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~~~~d---- 443 (456) .+++..+ --.|.|..-- .+-.++.|++...+. .|. .++.+.+|+..+.+...+.+......+.. T Consensus 352 N~~~~~~~~~~p~~~f~~~e-------~eDl~~~a~~~~~l~-~G~-~i~~~~i~e~~Gip~~~~~e~~~~~~~~~~~~~ 422 (512) T protein:vir:19 352 NSDSTIDINRLPGIVFDTSE-------AGDITALSDAIPKLA-AGM-RIPVSWIQEKLHIPQPVGDEAVFTIQPVVPDNG 422 (512) T ss_pred CCCCCCCccccceEEecCCC-------hhhHHHHHHHHHHHh-cCC-CCCHHHHHHHhCCCCCCCccccccCCCcccccc Confidence 8874321 1233443322 112244556666554 564 57999999998875433322211111100 Q ss_pred CCCC---CcCCCCCC-C Q lcl|NC_016762. 444 EDAA---RTDPTGEQ-Q 456 (456) Q Consensus 444 ~~~~---~~d~~~~~-e 456 (456) .... ..+....+ + T Consensus 423 ~~~~~~~~~~~~~~~~~ 439 (512) T protein:vir:19 423 SQKEAALSAEDIPQEDD 439 (512) T ss_pred ccccccccccCCCchhh Confidence 0000 00000011 1 No 223 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=94.58 E-value=0.0036 Score=33.99 Aligned_cols=234 Identities=10% Similarity=0.002 Sum_probs=103.9 Q ss_pred HHHhhhhhccCc--ccchhhhhcc----Cc----ccCCHHHHHHHHhcCchhhhhhccchhHHhhCCCEEecCCCcchhh Q lcl|NC_016762. 22 MSLLNQGIGHDA--KRPQAWCEYG----FP----QEITFNDLYTMYRRGGIAHGAVEKIVTTCWKTNPQVIEGDDQDRSK 91 (456) Q Consensus 22 d~~~n~~~~~gt--~~~~~~~~~~----~~----~~~~~~~l~~~Y~~~~l~r~iVd~~aed~tR~~~~i~~~~~~d~~~ 91 (456) |+|.+......+ ..+......+ +. ...+. .-+..+.-+..+|++++++.-.--+++....+... T Consensus 1 MglF~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~v~~----~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~-- 74 (251) T protein:vir:46 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPSFQGTKLRQYKD----IEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINY-- 74 (251) T ss_pred CCccccccccccCCCccchhhhhhhhccccCcCcceech----hhhhccHHHHHHHHHHHHhHhhCceEEeeCccccc-- Confidence 343321111110 0000000000 00 11222 22346777889999999999887777764332221 Q ss_pred hhHHHHHHHHHHHHHhhHHHHHHHHHHhh-cccCceEEEEEecCCCCccccccCCcCceeEEEEeccccCChhhhhcccc Q lcl|NC_016762. 92 DETEWERKNKPLIAGGRFWRAVSEADRRR-LVGRYSGLLLHIRDSQPWDRPARGKLNGLAKVTPAWAGCLKPKSFDEKPD 170 (456) Q Consensus 92 ~~~~~e~~i~~~~~~l~~~~~~~ea~~~~-r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~~Dp~ 170 (456) ...+.+.+...-...--+..|.+++.+. .++|-+++++. +|+. +.+..+.|+-...+++. .| T Consensus 75 -~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~-r~~~----------G~~~~L~~i~~~~v~v~---~~-- 137 (251) T protein:vir:46 75 -SDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEIT-RDKT----------GEPMNLTFRKTSEIELK---SD-- 137 (251) T ss_pred -cchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEE-ECCC----------CcEEEEEEECCceEEEE---EC-- Confidence 1122222322222223344566665554 55677766653 3322 11334444444434332 11 Q ss_pred ccccCCceeEEEeecccCCccccceeeehhhhheecCCc---CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_016762. 171 SETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT---GDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLL 247 (456) Q Consensus 171 s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~---~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~ 247 (456) +.|.+.++... .... ..+....+.++.||||...+ ..|.|.++.+.+.+.... .+...+..++++..+.-.+ T Consensus 138 --~~g~~~~~~~~-~~~~-~~g~~~~~~~~diiH~r~~~~dg~~G~spi~~~~~~i~~~~-~~~~~~~~~f~ng~~p~gi 212 (251) T protein:vir:46 138 --ARGRLYYFHQR-IDSN-GNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDN-NGKDFLNNFLRNGTHAGGI 212 (251) T ss_pred --CCCcEEEEEEE-eccC-CcceeEEECCccEEEecCcCCCCeeecCHHHHHHHHHHHHH-HHHHHHHHHHHccCCCcEE Confidence 23455444332 1111 11223567788888886443 358999999988765443 3445555666664432222 Q ss_pred hhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC---CCeEEecCCC Q lcl|NC_016762. 248 NFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG---NDVLLPTQGA 293 (456) Q Consensus 248 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~lid~~d 293 (456) - ++. ..+ ..++..+++.+.++.+.++ -+.+.++.++ T Consensus 213 l---~~~-~~l------~~~e~~~~~~~~~~~~~~g~~n~g~~~~gm~~ 251 (251) T protein:vir:46 213 L---KMK-GVL------DNKKARDRAREEFPKVLVELNKLGKLSYSMNQ 251 (251) T ss_pred E---EeC-CCC------CCHHHHHHHHHHHHHHhcCcccccccccccCC Confidence 1 111 011 1123344444444444443 2333343333 No 224 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=94.45 E-value=0.0044 Score=33.51 Aligned_cols=426 Identities=13% Similarity=0.091 Sum_probs=173.1 Q ss_pred CCc------hh------HHHHhHHHHHHHHHHHHHHhhhhhccCcccchhh-hhc-c-CcccCCHHHHHHHHhc---Cch Q lcl|NC_016762. 1 MTD------KL------DLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAW-CEY-G-FPQEITFNDLYTMYRR---GGI 62 (456) Q Consensus 1 ~~~------~~------~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~-~~~-~-~~~~~~~~~l~~~Y~~---~~l 62 (456) |.. |. +...+++.+-+--..-||-.-.-.+.++..-..+ ++| + .+...+-.+|-.-||+ ++- T Consensus 3 ~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~pE 82 (516) T protein:vir:10 3 FLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNNPE 82 (516) T ss_pred chHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhccc Confidence 110 00 0000011110000011111111111222111111 111 1 1233344677777774 778 Q ss_pred hhhhhccchhHHh--hCCCEEecCCCcchhhhhHHHHHHHHHHHH----HhhHHHHHHHHHHhhcccCceEEEEEecCCC Q lcl|NC_016762. 63 AHGAVEKIVTTCW--KTNPQVIEGDDQDRSKDETEWERKNKPLIA----GGRFWRAVSEADRRRLVGRYSGLLLHIRDSQ 136 (456) Q Consensus 63 ~r~iVd~~aed~t--R~~~~i~~~~~~d~~~~~~~~e~~i~~~~~----~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~ 136 (456) +..+|+.++.||+ .+.-.+++-+ -+..+....+..+|..+++ -|++..+..+..|.--+.|. ..+-.+-| T Consensus 83 vd~Av~eIvneaiv~d~~~~pV~l~-l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgR-i~fhKiid-- 158 (516) T protein:vir:10 83 VERAVANIVNEAVVYEKGHKVVSLD-LDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSR-IFFHKIMP-- 158 (516) T ss_pred hhHHHHHhhcceeEecCCCceEEEE-ecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcce-EEEEEEec-- Confidence 8899999999984 3333333222 1222233333344544444 34455555555443333332 22222333 Q ss_pred CccccccCCcCceeEEEEeccccCChhhhh-c-cccc-cc-cCCceeEEEeecc----cCC---ccccceeeehhhhhee Q lcl|NC_016762. 137 PWDRPARGKLNGLAKVTPAWAGCLKPKSFD-E-KPDS-ET-YGQPTMWEYTEAS----QAG---RPGLVRDIHPDRVFIL 205 (456) Q Consensus 137 ~~~~Pl~~~~~~l~~i~~~~~~~~~~~~~~-~-Dp~s-~~-yg~P~~y~i~~~~----~~g---~~~~~~~IH~SRli~~ 205 (456) .|- .|...|.+|.|+- +.....+ + |..+ .- -|--++|...+.. .+| .+...++|+.+-+ .| T Consensus 159 ---~~k-~GI~elr~lDPr~---i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI-~y 230 (516) T protein:vir:10 159 ---NPK-EGIVELRRLDPRH---VEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAI-VY 230 (516) T ss_pred ---Ccc-cceeeeeeeCCcc---eeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhhe-ee Confidence 111 1222233333322 2111111 0 1000 00 1111222221110 111 1233466666533 33 Q ss_pred cCC-----cCCC-cchHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHH Q lcl|NC_016762. 206 GDW-----TGDA-IGFLEPAY---NSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEA 276 (456) Q Consensus 206 ~~~-----~~~G-~S~le~~~---~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 276 (456) +.. ...+ +|.|.++- |.|.-++-+ -.+|+-+ |....- -.=+|+-+|....+ ++-++ +. T Consensus 231 ~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDA-----lVIYRit-RAPeRR-vFYIDVGnLPk~KA---eqYl~---~i 297 (516) T protein:vir:10 231 AHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDA-----LVIYRIT-RAPERR-VFYIDVGNMPNRKA---TEYVN---GI 297 (516) T ss_pred eecCcccCCCCceeceehhhhHhHHhhHHHHhh-----HHHHhhh-ccccce-EEEEecCCCCchhH---HHHHH---HH Confidence 211 1111 35665543 222222221 1233321 111000 00123333332211 12222 22 Q ss_pred HHHHh------cCCCe-------EEe-----------cCCCceeEEe--cccCCHHHHHHHHHHHHHhhhcCCeEEeecc Q lcl|NC_016762. 277 ARQLN------RGNDV-------LLP-----------TQGATVTQMV--SAVSDPGPTYNVNLQTAAAGVDIPTKILVGM 330 (456) Q Consensus 277 ~~~~~------~~~~~-------~li-----------d~~d~~~~~~--~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~ 330 (456) |.+++ .++|- +-| .++-+++++. -+++-++| +..|..-+=-|.++|++||=.. T Consensus 298 M~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~D-V~YF~kkLy~aLnVP~SRl~~e 376 (516) T protein:vir:10 298 MQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDD-VRWFNKKLYEALRIPLSRMPRD 376 (516) T ss_pred HHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHH-HHHHHHHHHHHhCCCcccccCC Confidence 22221 11110 000 0223455543 35667777 5678888889999999999887 Q ss_pred CCCccc---ch---HHHHHHHHHHHHHHHhhhhHHHHHHHHH-HHHhcCc------CCCCceEEEeCCCCCCCHHHHHHH Q lcl|NC_016762. 331 QTGERA---SS---EDQKYHNARCQARRVQELTFEINDLFAH-LMRIGVV------PLKAEFTAIWDDLTVPTKAERLAN 397 (456) Q Consensus 331 sp~Gln---st---~D~~nyyd~I~~~Qe~~lrp~L~~l~~~-l~~s~~~------~~~~d~~~~f~pL~~~seke~Aei 397 (456) +++.++ |+ -|.-.|...|.+.|..+ .+....+++. |++-+++ .....+.|.|+-=..-+|-..+|+ T Consensus 377 ~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rF-s~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Ei 455 (516) T protein:vir:10 377 DGGMVIGGQDMAITRDELDFRKFIVQLQHNF-EEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIET 455 (516) T ss_pred CCceeeccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHH Confidence 776652 33 27778999999888753 4544444443 4443433 233578999999999999999999 Q ss_pred HHHHHHHHHHHHH-cCCcCcCHHHHHH-HhcccCCCCCCCCcccCCCCCCCC--CcCCCCCCC Q lcl|NC_016762. 398 SKTMSEINSAAIG-TGEPVFTAEEIRE-EAGYDPLQGGDPLPDTEPEDEDAA--RTDPTGEQQ 456 (456) Q Consensus 398 ~~~~A~a~~~~~~-~g~~~i~~~E~R~-~~~~~~~~~~~~~~~~~~~d~~~~--~~d~~~~~e 456 (456) ...+..+.+..-- .| -.++-+=++. .+.+.- +.+..++..-++|..+ =.+|..+.+ T Consensus 456 l~~Rl~~l~~~dpyvG-ky~s~~yi~k~ILr~tD--eei~~~~k~I~~E~~~~~~~~p~~e~~ 515 (516) T protein:vir:10 456 LRQRVDALSQIEPYVG-KYVSHDYVMKNILQMTD--EQIAQEEKQIEKEANVKRFQNPENEDD 515 (516) T ss_pred HHHHHHHHHHhhhhhc-cccchHHHHHHHhcCCH--hHHHHHHHHHHHhhhCCCCCCCCcccc Confidence 9988887765432 22 1345444433 222210 0000000000001111 012222222 No 225 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=94.41 E-value=0.0045 Score=33.45 Aligned_cols=318 Identities=10% Similarity=0.048 Sum_probs=137.0 Q ss_pred CCchhHHHHhHHHHHHHHHH-H---HHHhhhhhccCccc--chh--hhhc--cCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARA-R---MSLLNQGIGHDAKR--PQA--WCEY--GFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~-~---d~~~n~~~~~gt~~--~~~--~~~~--~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |+.|.+-. .+..++...+ . -+|-....=+++.. +-. |... +|...+++..|..+++.|..-..++..- T Consensus 1 m~~~~~~~--~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h~~~i~~k 78 (346) T protein:vir:10 1 MKKQLRKN--LTQNDRLQPQAQTEIFSFGDPIPVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHHESAIITK 78 (346) T ss_pred CCcccCCC--CCcccccccccCeEEEecCCcceecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhcchhhhhh Confidence 99886431 1111110000 0 00101101111111 111 1112 3456778888888888887655555443 Q ss_pred hhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCcee Q lcl|NC_016762. 71 VTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGLA 150 (456) Q Consensus 71 aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l~ 150 (456) .....+ .+.. +. ..+.+..+++ ++..-.++|.|++.+.- ++ .+.+. T Consensus 79 ~n~l~~-l~~~-----Pn-------------~~~t~~~f~~----~~~d~ll~Gnay~~i~r-~~----------~G~~~ 124 (346) T protein:vir:10 79 ANILLS-TCEV-----DS-------------RYLSRRDLSS----FVKDYLVFGNAYFEVVR-NR----------LGQVQ 124 (346) T ss_pred hhhHHH-HHhC-----CC-------------CCCCHHHHHH----HHHHHHhcCCeEEEEEE-cC----------CCcEE Confidence 322211 1110 00 0111222222 22223467888776543 21 11233 Q ss_pred EEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHHH Q lcl|NC_016762. 151 KVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFISL 226 (456) Q Consensus 151 ~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~~ 226 (456) .+.|+-...+.+.. +.| .| .|.+. ..+| ..+.+-++.||+|.... ..|.|.+..+...+.-. T Consensus 125 ~L~pl~~~~v~~~~-~~~----~~----~~~~~--~~~g---~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~ 190 (346) T protein:vir:10 125 RIESPLAKYVRKGL-EAG----QF----YYVPQ--RFDH---QEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLN 190 (346) T ss_pred EEEEecCCceEEEE-cCC----eE----EEEEE--ccCC---eEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHH Confidence 34444333332211 111 11 12221 1122 12445566777775433 35899888877765543 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCe-EEecCC---CceeEEec Q lcl|NC_016762. 227 EKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDV-LLPTQG---ATVTQMVS 300 (456) Q Consensus 227 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-~lid~~---d~~~~~~~ 300 (456) .. +......+|++....-.+.. +++ +.-.++..+++.+.++... .|.+. +++..+ +.++.... T Consensus 191 ~~-a~~~~~~~~~NG~~~~~il~-----~~d-----~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pi 259 (346) T protein:vir:10 191 ES-ATLFRRKYFLNGAHAGFVFY-----MSD-----ASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPI 259 (346) T ss_pred HH-HHHHHHHHHhccCCCceEEE-----eCC-----CCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEec Confidence 33 33334445665433222111 110 0111333444444444432 22233 333322 23444444 Q ss_pred ccCCHH----HHHHHHHHHHHhhhcCCeEEeeccCCC---cccchH-HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcC Q lcl|NC_016762. 301 AVSDPG----PTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSE-DQKYHNARCQARRVQELTFEINDLFAHLMRIGV 372 (456) Q Consensus 301 ~~sgl~----~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~-D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~ 372 (456) +.+.-+ ++-....++||++-+||-. |+|..++ +++..+ -.+.||. +.|.|.++++-++. .. + T Consensus 260 s~~~~d~qf~e~k~~~~~~I~~af~VPp~-llG~~~~~~~~~s~~e~~~~~f~~-------~~l~P~~~~iee~n-~~-L 329 (346) T protein:vir:10 260 ADVSAKDEFFNIKNVSRDDVLAAHRVPPQ-LMGIIPNNTGGFGNVADAAEVFFI-------TEIEPLQERLKEFN-QW-L 329 (346) T ss_pred CCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcccHHHHHHHHHH-------HHHHHHHHHHHHHH-hh-c Confidence 444332 3344567789999999996 6787554 444434 3455653 35777777765432 11 1 Q ss_pred cCCCCceEEEeCCCCCCCHHH Q lcl|NC_016762. 373 VPLKAEFTAIWDDLTVPTKAE 393 (456) Q Consensus 373 ~~~~~d~~~~f~pL~~~seke 393 (456) + .+ .|+|+|=--+.-+| T Consensus 330 ~---~e-~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 330 G---QE-VIKFKPSKLLQRTQ 346 (346) T ss_pred c---cc-eeeechhhhcccCC Confidence 2 12 25566544444444 No 226 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=94.12 E-value=0.0053 Score=33.04 Aligned_cols=409 Identities=11% Similarity=0.052 Sum_probs=169.0 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhh-hccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHH----- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQG-IGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTC----- 74 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~-~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~----- 74 (456) |+-|.+...-..-+.........+.... -.++...... .-+- ..+...| +..+..+|+..|..+ T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~~~~--~~~~------~~~~~~~--dstg~~a~~~LAa~l~~~lt 70 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDDISS--RPNH------KSLTVPW--QSVGAKCCVTLAAKLMLAVL 70 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCCCCC--Cccc------ccccccc--cchHHHHHHHHHHHHHHhhc Confidence 8877665322222222122222222221 1122111000 0000 0001111 122233333333333 Q ss_pred --hhCCCEEecCCCcch-----------hhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCcccc Q lcl|NC_016762. 75 --WKTNPQVIEGDDQDR-----------SKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRP 141 (456) Q Consensus 75 --tR~~~~i~~~~~~d~-----------~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~P 141 (456) .|.||++.-.+.... .+--...++.+.+.+.+-++...+.++.+.-..+|-+++++.- |+-. .-| T Consensus 71 pp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~-~~~~-~~p 148 (522) T protein:vir:10 71 PPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGK-DGLK-TFP 148 (522) T ss_pred CCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcC-CCce-EEE Confidence 478999853332111 1112334556777788899999999999988888888776643 2211 235 Q ss_pred cc------CCcCceeEEEEeccccCChhhhhc----ccccc---ccCCc----eeEEEeecccCCccccceeeehh---- Q lcl|NC_016762. 142 AR------GKLNGLAKVTPAWAGCLKPKSFDE----KPDSE---TYGQP----TMWEYTEASQAGRPGLVRDIHPD---- 200 (456) Q Consensus 142 l~------~~~~~l~~i~~~~~~~~~~~~~~~----Dp~s~---~yg~P----~~y~i~~~~~~g~~~~~~~IH~S---- 200 (456) +. ...+.+. .++.+..+++..+.. +-.+. +=++| +.|+.-.-..+ . ....+|++ T Consensus 149 l~~y~v~~d~~G~vd--~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~--~-~~~~~~~~~~~~ 223 (522) T protein:vir:10 149 LTRYVINRDGDGNVL--EIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKS--S-GRWVWHQEAFDK 223 (522) T ss_pred cceEEEeeCCCCCee--EEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeecc--C-CceEEEEccCCc Confidence 52 2333332 234444444333221 10000 00011 11211100000 0 01122221 Q ss_pred --------------hh--heec--CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhh Q lcl|NC_016762. 201 --------------RV--FILG--DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTY 262 (456) Q Consensus 201 --------------Rl--i~~~--~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~ 262 (456) =+ .||. .+..+|.|-.+..+..+..++...........++....+ +.. T Consensus 224 ~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~------------lv~-- 289 (522) T protein:vir:10 224 IIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVF------------LVS-- 289 (522) T ss_pred cccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCce------------eec-- Confidence 11 1221 334579998888888888777665554443333211111 000 Q ss_pred cCCHHHHHHHHHHHHHHH-hcCCCeEEecCCCceeEEec----ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccc Q lcl|NC_016762. 263 GVTLDALNERFNEAARQL-NRGNDVLLPTQGATVTQMVS----AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERAS 337 (456) Q Consensus 263 ~~~~~~~~~~~~~~~~~~-~~~~~~~lid~~d~~~~~~~----~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glns 337 (456) .+...+. ..+ ..+.+..+.+..++...+.. .|......++...+.|.-+.= .+..+.+..+.+ T Consensus 290 ---~~~~~~~-----~~l~~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl----~~~~~d~~rvTA 357 (522) T protein:vir:10 290 ---PSSTTKP-----ATIAKAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFL----VMNVRNAERVTA 357 (522) T ss_pred ---ccccccc-----ccccCCCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHh----hccCCCCCCCCH Confidence 0011111 112 23334344444455555443 345556666777777776541 233345555677 Q ss_pred hH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC-ce----EEEeCCCCCCCHHHHHHHHHHHHHHH Q lcl|NC_016762. 338 SE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA-EF----TAIWDDLTVPTKAERLANSKTMSEIN 405 (456) Q Consensus 338 t~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~-d~----~~~f~pL~~~seke~Aei~~~~A~a~ 405 (456) ++ +. ...--.++..|...|.|.|++.+.+|.+.+..|+++ +. .++ .+..+.-.++++.-..-++.. T Consensus 358 tEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~--~is~Laraq~~~~l~~~~~~i 435 (522) T protein:vir:10 358 EEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVA--GVNALGRGQDRESLTAFVGTI 435 (522) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccccccccc--chhHHHHHHHHHHHHHHHHHH Confidence 64 22 335555677788899999999999999998875443 22 122 222222333322211112211 Q ss_pred HHHHHcCC----cCcCHHHH-HHHhcccCCC-CCCCCcccC-CCCC--------------------CCCCcCCCCCCC Q lcl|NC_016762. 406 SAAIGTGE----PVFTAEEI-REEAGYDPLQ-GGDPLPDTE-PEDE--------------------DAARTDPTGEQQ 456 (456) Q Consensus 406 ~~~~~~g~----~~i~~~E~-R~~~~~~~~~-~~~~~~~~~-~~d~--------------------~~~~~d~~~~~e 456 (456) .. -.|- ..|+.+++ +..+..-+.+ ..+.-.+++ .+.. ..+-.+|+...+ T Consensus 436 ~~--~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~ 511 (522) T protein:vir:10 436 AQ--TLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQ 511 (522) T ss_pred HH--hhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHH Confidence 11 1121 13455543 3333333322 111101000 0000 000001110000 No 227 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=93.07 E-value=0.0089 Score=31.83 Aligned_cols=415 Identities=9% Similarity=-0.025 Sum_probs=167.1 Q ss_pred CCchhHH--HHhH--HHHHHHHH---HHHHHhhhh---hccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDL--AVNH--AMSSAIAR---ARMSLLNQG---IGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~--~~~~--a~~~~~~~---~~d~~~n~~---~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |-++... ++.+ .++..+.. ....+.... -+.-...+.....-... ..| .+.+...|+.. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~---------~~~--dst~~~a~~~L 69 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN---------NIL--DNTGTRALRVL 69 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc---------ccc--cccHHHHHHHH Confidence 7665542 3221 22222111 111111111 00001111100000011 111 12233333333 Q ss_pred hhHH-------hhCCCEEecCCCcc-h----hhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCc Q lcl|NC_016762. 71 VTTC-------WKTNPQVIEGDDQD-R----SKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW 138 (456) Q Consensus 71 aed~-------tR~~~~i~~~~~~d-~----~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~ 138 (456) |..+ .|.||++.-.+.+. + .+--...++.+.+.+.+-++...+.++.+.-..+|-+.+++.-+.++.+ T Consensus 70 Aa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~ 149 (555) T protein:vir:10 70 AAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVV 149 (555) T ss_pred HHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceE Confidence 3333 47899986543321 1 1112234556777788889999999999888888988887654322222 Q ss_pred ---ccccc-----CCcCceeEEEEeccccCChhh-----------------hhccccccccCCceeEEEeecccCCcc-- Q lcl|NC_016762. 139 ---DRPAR-----GKLNGLAKVTPAWAGCLKPKS-----------------FDEKPDSETYGQPTMWEYTEASQAGRP-- 191 (456) Q Consensus 139 ---~~Pl~-----~~~~~l~~i~~~~~~~~~~~~-----------------~~~Dp~s~~yg~P~~y~i~~~~~~g~~-- 191 (456) .-|+. ....|.+ ..++.+..+++.+ ++.+|...++ +.|+.-....+..+ T Consensus 150 rf~~~pl~~~~v~~d~~G~v-d~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v---~v~~~V~pr~~~~~~~ 225 (555) T protein:vir:10 150 YHHSLTAGEYAIAADNQGRV-NTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWV---TVIHAIEPRADRDPSK 225 (555) T ss_pred EEEEeecceeEEeeCCCCCE-EEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceE---EEEEEEeeccCcCcCC Confidence 23552 1222322 2333333333322 1222211111 11111000000000 Q ss_pred -------ccceeee----hhhhh-------------ee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_016762. 192 -------GLVRDIH----PDRVF-------------IL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQL 245 (456) Q Consensus 192 -------~~~~~IH----~SRli-------------~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l 245 (456) -.++.+| ..+++ +| ..+..+|.|-.+.++..+..++.......+...+.....+ T Consensus 226 ~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (555) T protein:vir:10 226 RDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPL 305 (555) T ss_pred CCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 0001111 11221 22 1334579999999998888877665544333322111100 Q ss_pred hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCc-ee---EEecccCCHHHHHHHHHHHHHhhhc Q lcl|NC_016762. 246 LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGAT-VT---QMVSAVSDPGPTYNVNLQTAAAGVD 321 (456) Q Consensus 246 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~-~~---~~~~~~sgl~~~~~~~~~~~aaas~ 321 (456) . +.+ +.....+ +....+.+-.......+ .. +....|+-+...++...+.|..+.= T Consensus 306 ~-----------v~~------~~~~~~~----~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~ 364 (555) T protein:vir:10 306 Q-----------LPV------SAKNQDI----STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFY 364 (555) T ss_pred e-----------ecc------ccccccc----eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhh Confidence 0 000 0000001 11111122111122222 21 1223455555666667777766552 Q ss_pred CCeEEeec-cCCCcccchH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC------ceEEEeC-CC Q lcl|NC_016762. 322 IPTKILVG-MQTGERASSE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA------EFTAIWD-DL 386 (456) Q Consensus 322 IP~t~L~G-~sp~Glnst~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~------d~~~~f~-pL 386 (456) -.+-..++ ....-++++| +. ...--.+...|...|.|.|++.+.+|.+.+..|+++ +++++|- || T Consensus 365 ~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~L 444 (555) T protein:vir:10 365 ADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSML 444 (555) T ss_pred cchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHH Confidence 11101122 2223356664 22 334555666777889999999999999998876543 3555543 33 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCC---cCcCHHHH-HHHhcccCCCCCCCCcccCCCCCCCCCc----CCCCCCC Q lcl|NC_016762. 387 TVPTKAERLANSKTMSEINSAAIGTGE---PVFTAEEI-REEAGYDPLQGGDPLPDTEPEDEDAART----DPTGEQQ 456 (456) Q Consensus 387 ~~~seke~Aei~~~~A~a~~~~~~~g~---~~i~~~E~-R~~~~~~~~~~~~~~~~~~~~d~~~~~~----d~~~~~e 456 (456) -+.-..+.+.......+......+.+- ..|+.+++ +.....-+.+...-- .+++.+.- ...++.+ T Consensus 445 a~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~ir-----s~eev~~~r~qr~~~~q~~ 517 (555) T protein:vir:10 445 AQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIV-----PGNQVALIRKQRADQQQAA 517 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccC-----CHHHHHHHHHHHHHHHHHH Confidence 332222222222223333333333331 13566655 333333333322111 11111100 0000000 No 228 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=93.07 E-value=0.0089 Score=31.83 Aligned_cols=415 Identities=9% Similarity=-0.025 Sum_probs=167.1 Q ss_pred CCchhHH--HHhH--HHHHHHHH---HHHHHhhhh---hccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDL--AVNH--AMSSAIAR---ARMSLLNQG---IGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~--~~~~--a~~~~~~~---~~d~~~n~~---~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |-++... ++.+ .++..+.. ....+.... -+.-...+.....-... ..| .+.+...|+.. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~---------~~~--dst~~~a~~~L 69 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN---------NIL--DNTGTRALRVL 69 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc---------ccc--cccHHHHHHHH Confidence 7665542 3221 22222111 111111111 00001111100000011 111 12233333333 Q ss_pred hhHH-------hhCCCEEecCCCcc-h----hhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCc Q lcl|NC_016762. 71 VTTC-------WKTNPQVIEGDDQD-R----SKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW 138 (456) Q Consensus 71 aed~-------tR~~~~i~~~~~~d-~----~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~ 138 (456) |..+ .|.||++.-.+.+. + .+--...++.+.+.+.+-++...+.++.+.-..+|-+.+++.-+.++.+ T Consensus 70 Aa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~ 149 (555) T protein:vir:98 70 AAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVV 149 (555) T ss_pred HHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceE Confidence 3333 47899986543321 1 1112234556777788889999999999888888988887654322222 Q ss_pred ---ccccc-----CCcCceeEEEEeccccCChhh-----------------hhccccccccCCceeEEEeecccCCcc-- Q lcl|NC_016762. 139 ---DRPAR-----GKLNGLAKVTPAWAGCLKPKS-----------------FDEKPDSETYGQPTMWEYTEASQAGRP-- 191 (456) Q Consensus 139 ---~~Pl~-----~~~~~l~~i~~~~~~~~~~~~-----------------~~~Dp~s~~yg~P~~y~i~~~~~~g~~-- 191 (456) .-|+. ....|.+ ..++.+..+++.+ ++.+|...++ +.|+.-....+..+ T Consensus 150 rf~~~pl~~~~v~~d~~G~v-d~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v---~v~~~V~pr~~~~~~~ 225 (555) T protein:vir:98 150 YHHSLTAGEYAIAADNQGRV-NTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWV---TVIHAIEPRADRDPSK 225 (555) T ss_pred EEEEeecceeEEeeCCCCCE-EEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceE---EEEEEEeeccCcCcCC Confidence 23552 1222322 2333333333322 1222211111 11111000000000 Q ss_pred -------ccceeee----hhhhh-------------ee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_016762. 192 -------GLVRDIH----PDRVF-------------IL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQL 245 (456) Q Consensus 192 -------~~~~~IH----~SRli-------------~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l 245 (456) -.++.+| ..+++ +| ..+..+|.|-.+.++..+..++.......+...+.....+ T Consensus 226 ~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (555) T protein:vir:98 226 RDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPL 305 (555) T ss_pred CCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 0001111 11221 22 1334579999999998888877665544333322111100 Q ss_pred hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCc-ee---EEecccCCHHHHHHHHHHHHHhhhc Q lcl|NC_016762. 246 LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGAT-VT---QMVSAVSDPGPTYNVNLQTAAAGVD 321 (456) Q Consensus 246 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~-~~---~~~~~~sgl~~~~~~~~~~~aaas~ 321 (456) . +.+ +.....+ +....+.+-.......+ .. +....|+-+...++...+.|..+.= T Consensus 306 ~-----------v~~------~~~~~~~----~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~ 364 (555) T protein:vir:98 306 Q-----------LPV------SAKNQDI----STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFY 364 (555) T ss_pred e-----------ecc------ccccccc----eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhh Confidence 0 000 0000001 11111122111122222 21 1223455555666667777766552 Q ss_pred CCeEEeec-cCCCcccchH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC------ceEEEeC-CC Q lcl|NC_016762. 322 IPTKILVG-MQTGERASSE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA------EFTAIWD-DL 386 (456) Q Consensus 322 IP~t~L~G-~sp~Glnst~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~------d~~~~f~-pL 386 (456) -.+-..++ ....-++++| +. ...--.+...|...|.|.|++.+.+|.+.+..|+++ +++++|- || T Consensus 365 ~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~L 444 (555) T protein:vir:98 365 ADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSML 444 (555) T ss_pred cchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHH Confidence 11101122 2223356664 22 334555666777889999999999999998876543 3555543 33 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCC---cCcCHHHH-HHHhcccCCCCCCCCcccCCCCCCCCCc----CCCCCCC Q lcl|NC_016762. 387 TVPTKAERLANSKTMSEINSAAIGTGE---PVFTAEEI-REEAGYDPLQGGDPLPDTEPEDEDAART----DPTGEQQ 456 (456) Q Consensus 387 ~~~seke~Aei~~~~A~a~~~~~~~g~---~~i~~~E~-R~~~~~~~~~~~~~~~~~~~~d~~~~~~----d~~~~~e 456 (456) -+.-..+.+.......+......+.+- ..|+.+++ +.....-+.+...-- .+++.+.- ...++.+ T Consensus 445 a~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~ir-----s~eev~~~r~qr~~~~q~~ 517 (555) T protein:vir:98 445 AQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIV-----PGNQVALIRKQRADQQQAA 517 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccC-----CHHHHHHHHHHHHHHHHHH Confidence 332222222222223333333333331 13566655 333333333322111 11111100 0000000 No 229 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=93.07 E-value=0.0089 Score=31.83 Aligned_cols=415 Identities=9% Similarity=-0.025 Sum_probs=167.1 Q ss_pred CCchhHH--HHhH--HHHHHHHH---HHHHHhhhh---hccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDL--AVNH--AMSSAIAR---ARMSLLNQG---IGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~--~~~~--a~~~~~~~---~~d~~~n~~---~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |-++... ++.+ .++..+.. ....+.... -+.-...+.....-... ..| .+.+...|+.. T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~---------~~~--dst~~~a~~~L 69 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHN---------NIL--DNTGTRALRVL 69 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhccc---------ccc--cccHHHHHHHH Confidence 7665542 3221 22222111 111111111 00001111100000011 111 12233333333 Q ss_pred hhHH-------hhCCCEEecCCCcc-h----hhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCc Q lcl|NC_016762. 71 VTTC-------WKTNPQVIEGDDQD-R----SKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW 138 (456) Q Consensus 71 aed~-------tR~~~~i~~~~~~d-~----~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~ 138 (456) |..+ .|.||++.-.+.+. + .+--...++.+.+.+.+-++...+.++.+.-..+|-+.+++.-+.++.+ T Consensus 70 Aa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~ 149 (555) T protein:vir:10 70 AAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVV 149 (555) T ss_pred HHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceE Confidence 3333 47899986543321 1 1112234556777788889999999999888888988887654322222 Q ss_pred ---ccccc-----CCcCceeEEEEeccccCChhh-----------------hhccccccccCCceeEEEeecccCCcc-- Q lcl|NC_016762. 139 ---DRPAR-----GKLNGLAKVTPAWAGCLKPKS-----------------FDEKPDSETYGQPTMWEYTEASQAGRP-- 191 (456) Q Consensus 139 ---~~Pl~-----~~~~~l~~i~~~~~~~~~~~~-----------------~~~Dp~s~~yg~P~~y~i~~~~~~g~~-- 191 (456) .-|+. ....|.+ ..++.+..+++.+ ++.+|...++ +.|+.-....+..+ T Consensus 150 rf~~~pl~~~~v~~d~~G~v-d~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v---~v~~~V~pr~~~~~~~ 225 (555) T protein:vir:10 150 YHHSLTAGEYAIAADNQGRV-NTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWV---TVIHAIEPRADRDPSK 225 (555) T ss_pred EEEEeecceeEEeeCCCCCE-EEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceE---EEEEEEeeccCcCcCC Confidence 23552 1222322 2333333333322 1222211111 11111000000000 Q ss_pred -------ccceeee----hhhhh-------------ee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_016762. 192 -------GLVRDIH----PDRVF-------------IL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQL 245 (456) Q Consensus 192 -------~~~~~IH----~SRli-------------~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l 245 (456) -.++.+| ..+++ +| ..+..+|.|-.+.++..+..++.......+...+.....+ T Consensus 226 ~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (555) T protein:vir:10 226 RDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPL 305 (555) T ss_pred CCccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 0001111 11221 22 1334579999999998888877665544333322111100 Q ss_pred hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCc-ee---EEecccCCHHHHHHHHHHHHHhhhc Q lcl|NC_016762. 246 LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGAT-VT---QMVSAVSDPGPTYNVNLQTAAAGVD 321 (456) Q Consensus 246 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~-~~---~~~~~~sgl~~~~~~~~~~~aaas~ 321 (456) . +.+ +.....+ +....+.+-.......+ .. +....|+-+...++...+.|..+.= T Consensus 306 ~-----------v~~------~~~~~~~----~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~ 364 (555) T protein:vir:10 306 Q-----------LPV------SAKNQDI----STVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFY 364 (555) T ss_pred e-----------ecc------ccccccc----eeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhh Confidence 0 000 0000001 11111122111122222 21 1223455555666667777766552 Q ss_pred CCeEEeec-cCCCcccchH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC------ceEEEeC-CC Q lcl|NC_016762. 322 IPTKILVG-MQTGERASSE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA------EFTAIWD-DL 386 (456) Q Consensus 322 IP~t~L~G-~sp~Glnst~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~------d~~~~f~-pL 386 (456) -.+-..++ ....-++++| +. ...--.+...|...|.|.|++.+.+|.+.+..|+++ +++++|- || T Consensus 365 ~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~L 444 (555) T protein:vir:10 365 ADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSML 444 (555) T ss_pred cchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHH Confidence 11101122 2223356664 22 334555666777889999999999999998876543 3555543 33 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCC---cCcCHHHH-HHHhcccCCCCCCCCcccCCCCCCCCCc----CCCCCCC Q lcl|NC_016762. 387 TVPTKAERLANSKTMSEINSAAIGTGE---PVFTAEEI-REEAGYDPLQGGDPLPDTEPEDEDAART----DPTGEQQ 456 (456) Q Consensus 387 ~~~seke~Aei~~~~A~a~~~~~~~g~---~~i~~~E~-R~~~~~~~~~~~~~~~~~~~~d~~~~~~----d~~~~~e 456 (456) -+.-..+.+.......+......+.+- ..|+.+++ +.....-+.+...-- .+++.+.- ...++.+ T Consensus 445 a~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~ir-----s~eev~~~r~qr~~~~q~~ 517 (555) T protein:vir:10 445 AQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIV-----PGNQVALIRKQRADQQQAA 517 (555) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccC-----CHHHHHHHHHHHHHHHHHH Confidence 332222222222223333333333331 13566655 333333333322111 11111100 0000000 No 230 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=92.39 E-value=0.012 Score=31.19 Aligned_cols=410 Identities=9% Similarity=0.014 Sum_probs=172.4 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhh-ccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHH----- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGI-GHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTC----- 74 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~-~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~----- 74 (456) |-.|++..-+..-++.....+..+..... .+.+..... +. ..+...| +..+...|+..|.-+ T Consensus 1 mk~~~~~~~~~lkR~~~e~~w~e~a~~tlP~~~~~~~~~-~~---------~~~~~~~--dstg~~a~~~LAa~l~~~lt 68 (510) T protein:vir:63 1 MKTTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMSG-SR---------GVVEHDF--QSAGALLVNNLAAKLARSLF 68 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccCCCCCCc-cc---------cccCCCc--cchHHHHHHHHHHHHHhhhc Confidence 77777666553322222222222222111 111100000 00 0011111 122333334333333 Q ss_pred --hhCCCEEecCCCcc------------hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCc-c Q lcl|NC_016762. 75 --WKTNPQVIEGDDQD------------RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW-D 139 (456) Q Consensus 75 --tR~~~~i~~~~~~d------------~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~-~ 139 (456) .|.||++.-+++.. -.+--...|+.+.+.+.+-++...+.++.+.-..+|-+.+++. .|+..+ . T Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~-~~~~~~~~ 147 (510) T protein:vir:63 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRD-SDAATVVA 147 (510) T ss_pred CCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEc-CCCcEEEE Confidence 46899885333211 1112223455677788888999999999998778887766654 344322 2 Q ss_pred cccc------CCcCceeEEEEeccccCChhhhhc--------ccccc-ccCCceeEEEeecccCCccccceeeehh---- Q lcl|NC_016762. 140 RPAR------GKLNGLAKVTPAWAGCLKPKSFDE--------KPDSE-TYGQPTMWEYTEASQAGRPGLVRDIHPD---- 200 (456) Q Consensus 140 ~Pl~------~~~~~l~~i~~~~~~~~~~~~~~~--------Dp~s~-~yg~P~~y~i~~~~~~g~~~~~~~IH~S---- 200 (456) -|+. ...+.+.. .+.+..+++.++.. ++... .+.+.+.|+.-.-. ++.......||.+ T Consensus 148 ~pl~~y~v~~d~~G~vd~--i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~-~~~~~~~~sv~~e~dg~ 224 (510) T protein:vir:63 148 WSLRSYAVRRDATGRWMD--IVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRK-KGTAMEYAELYHEIDGV 224 (510) T ss_pred EEcceeEEeeCCCcCeeE--EEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEee-cCCCceEEEEEEEecCc Confidence 3552 23333322 33333344322111 11111 12223333222111 1110011112211 Q ss_pred hhh---------------ee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhc Q lcl|NC_016762. 201 RVF---------------IL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYG 263 (456) Q Consensus 201 Rli---------------~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~ 263 (456) ++. || ..+..+|.|-.+.++..+..++..........-++ ...-+ +....+ T Consensus 225 ~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a----~~~~~--------lv~p~g 292 (510) T protein:vir:63 225 RVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELES----LEVLN--------LVDEAK 292 (510) T ss_pred eeccccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHh----ccCCc--------ccCccc Confidence 111 11 12345799988888888877766654433322211 11100 111001 Q ss_pred CCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEecc----cCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCc-ccc Q lcl|NC_016762. 264 VTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVSA----VSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGE-RAS 337 (456) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~~----~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~G-lns 337 (456) .. ....+..+- +..+-+..+++..++.. |.-....++...+.|..+. .+. |. +..++ +++ T Consensus 293 -----~~-----~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af--~~~-l~-~~~~~rvTA 358 (510) T protein:vir:63 293 -----GA-----VVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF--MYG-AN-QRDAERVTA 358 (510) T ss_pred -----cc-----chhhhccCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHH--Hhh-cc-cCCCCCcCH Confidence 01 111223333 33333334555555433 4445577777777777774 233 33 44444 677 Q ss_pred hH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCc-eEEEe-CCCCCCCHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 338 SE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAE-FTAIW-DDLTVPTKAERLANSKTMSEINSAA 408 (456) Q Consensus 338 t~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d-~~~~f-~pL~~~seke~Aei~~~~A~a~~~~ 408 (456) +| + ....--.++..|...|.|.+++.+.+|.+.++.|+|++ +.... ..+..+...++++- ...+.+.. T Consensus 359 tEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~---l~~~~q~l 435 (510) T protein:vir:63 359 EEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQS---MLNASQVI 435 (510) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCchhcccceecchhHHHHHHHHHH---HHHHHHHH Confidence 74 2 23456667778888999999999999998876655543 21110 01222233333222 22222222 Q ss_pred HHcC-----CcCcCHHHHHH-HhcccCC-CCCCCCcccCCCCCCCCCcCC-CCCCC Q lcl|NC_016762. 409 IGTG-----EPVFTAEEIRE-EAGYDPL-QGGDPLPDTEPEDEDAARTDP-TGEQQ 456 (456) Q Consensus 409 ~~~g-----~~~i~~~E~R~-~~~~~~~-~~~~~~~~~~~~d~~~~~~d~-~~~~e 456 (456) ...+ .+.|+.+++-+ .+..-+. .....-.+++-+.+ ..+... .++++ T Consensus 436 ~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~-~~~~~qq~~~~~ 490 (510) T protein:vir:63 436 AGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAE-AEQQRQQAAQAQ 490 (510) T ss_pred HHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHH-HHHHHHHHHHHH Confidence 2222 23466665532 3322232 11111111100000 000000 00000 No 231 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=91.42 E-value=0.016 Score=30.44 Aligned_cols=410 Identities=8% Similarity=-0.029 Sum_probs=175.3 Q ss_pred CCchhH------HHHh--HHHHHHH---HHHHHHHhhhh-hccCc--------ccchhhhhccCcccCCHHHHHHHHhcC Q lcl|NC_016762. 1 MTDKLD------LAVN--HAMSSAI---ARARMSLLNQG-IGHDA--------KRPQAWCEYGFPQEITFNDLYTMYRRG 60 (456) Q Consensus 1 ~~~~~~------~~~~--~a~~~~~---~~~~d~~~n~~-~~~gt--------~~~~~~~~~~~~~~~~~~~l~~~Y~~~ 60 (456) |.+-++ .|+. ..++..+ ......+.... -.+.+ .+.+.|..+ -.++..-. .+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst-------~~~a~~~L-aa 72 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAV-------GARGLNNL-AS 72 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccccccccc-------HHHHHHHH-HH Confidence 665442 1211 1111111 11112222211 01100 001111111 11222222 24 Q ss_pred chhhhhhccchhHHhhCCCEEecCCCcc------------hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEE Q lcl|NC_016762. 61 GIAHGAVEKIVTTCWKTNPQVIEGDDQD------------RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGL 128 (456) Q Consensus 61 ~l~r~iVd~~aed~tR~~~~i~~~~~~d------------~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i 128 (456) +|...+. |+ |.||++.-.+..- ..+--...++.+...+.+-++...+.++.+.-..+|-+++ T Consensus 73 ~l~~~lt--P~----~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l 146 (535) T protein:vir:33 73 KLMLALF--PM----QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALL 146 (535) T ss_pred HHHHhhc--CC----CcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeE Confidence 4554443 54 5799985333110 1112234466778888899999999999988888898877 Q ss_pred EEEecCCCC---cccccc------CCcCceeEEEEeccccCChhhhh----ccc-----cccccCCceeEEEeecccCCc Q lcl|NC_016762. 129 LLHIRDSQP---WDRPAR------GKLNGLAKVTPAWAGCLKPKSFD----EKP-----DSETYGQPTMWEYTEASQAGR 190 (456) Q Consensus 129 ~i~i~D~~~---~~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~----~Dp-----~s~~yg~P~~y~i~~~~~~g~ 190 (456) ++.-..++. -.-|+. ...+.+. .++.+..+++.+.. .+- ....+-.++.|..-- .... T Consensus 147 ~~~~~~~~~~~f~~~pl~~~~v~~d~~G~vd--~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~--~~~~ 222 (535) T protein:vir:33 147 YLPEPEGSYNPMKLYRLSSYVVQRDAYGNVL--QIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVY--LDEE 222 (535) T ss_pred EeecCCCCceeeEEEEcCeeEEeeCCCCCee--EEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEE--eeCC Confidence 765432221 123552 2233232 23344444433221 110 011122333443210 1111 Q ss_pred cccceeeehhh------------------h--hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_016762. 191 PGLVRDIHPDR------------------V--FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLN 248 (456) Q Consensus 191 ~~~~~~IH~SR------------------l--i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~ 248 (456) .+ .+.+|++. + .+| ..+..+|.|-.+..+..+..++...........++....+ T Consensus 223 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~--- 298 (535) T protein:vir:33 223 SG-DYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIG--- 298 (535) T ss_pred CC-cEEEEEEEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce--- Confidence 00 11222211 1 111 1234579998888888888877776555444433211110 Q ss_pred hhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC-CCeEEecCCCceeEEec----ccCCHHHHHHHHHHHHHhhhcCC Q lcl|NC_016762. 249 FDKEINLGEIASTYGVTLDALNERFNEAARQLNRG-NDVLLPTQGATVTQMVS----AVSDPGPTYNVNLQTAAAGVDIP 323 (456) Q Consensus 249 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lid~~d~~~~~~~----~~sgl~~~~~~~~~~~aaas~IP 323 (456) +... +...+. ..+..+ .+..+-+..+++..+.. .|.-....++...+.|.-+.=+ T Consensus 299 ---------lv~~-----~g~~~~-----~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~- 358 (535) T protein:vir:33 299 ---------LVNP-----AGITQP-----RRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFML- 358 (535) T ss_pred ---------eecc-----ccccch-----hhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhh- Confidence 1000 011111 122232 33333344455555532 4566667777777777666411 Q ss_pred eEEeeccCCCcccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC--CceEEEeC-CCCCCCHHH Q lcl|NC_016762. 324 TKILVGMQTGERASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK--AEFTAIWD-DLTVPTKAE 393 (456) Q Consensus 324 ~t~L~G~sp~Glnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~--~d~~~~f~-pL~~~seke 393 (456) .-|.......+.+++ + ....--.++..|...|.|.+++++.+|.+.+..|++ ..+.++|- ||.++ . T Consensus 359 -~~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~a---q 434 (535) T protein:vir:33 359 -NSAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAI---G 434 (535) T ss_pred -hhcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHH---H Confidence 112224445567774 2 245667777888999999999999999999887643 34666653 33333 2 Q ss_pred HHHHHHHHHHHHHHHHHcCC----cCcCHHHH-HHHhcccCCCCC-CC-CcccCCCCCCCC-----------------Cc Q lcl|NC_016762. 394 RLANSKTMSEINSAAIGTGE----PVFTAEEI-REEAGYDPLQGG-DP-LPDTEPEDEDAA-----------------RT 449 (456) Q Consensus 394 ~Aei~~~~A~a~~~~~~~g~----~~i~~~E~-R~~~~~~~~~~~-~~-~~~~~~~d~~~~-----------------~~ 449 (456) |..-..+.-+......+.+- ..|+.+++ +.....-+.+.. +. .+++..+.-... .+ T Consensus 435 r~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~ 514 (535) T protein:vir:33 435 RGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGA 514 (535) T ss_pred HHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcc Confidence 22221122222222223321 12555554 333333333211 11 111111000000 00 Q ss_pred CCCCCCC Q lcl|NC_016762. 450 DPTGEQQ 456 (456) Q Consensus 450 d~~~~~e 456 (456) .+....| T Consensus 515 ~~~~~~~ 521 (535) T protein:vir:33 515 LATSSPE 521 (535) T ss_pred hhhcCCh Confidence 0000000 No 232 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=90.71 E-value=0.019 Score=29.97 Aligned_cols=415 Identities=10% Similarity=0.045 Sum_probs=166.0 Q ss_pred CCchhHH--HHhHHHH--HHHHHHHHHHhhhhhccCcccchhhhhccCcccC------CHHHHHHHHhcCchhhhhhccc Q lcl|NC_016762. 1 MTDKLDL--AVNHAMS--SAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEI------TFNDLYTMYRRGGIAHGAVEKI 70 (456) Q Consensus 1 ~~~~~~~--~~~~a~~--~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~------~~~~l~~~Y~~~~l~r~iVd~~ 70 (456) |.+-++. +..-+.+ +.....|..+.+.- +-...|-.|..+ ....+...| .+.+...|+.. T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w--------~e~~~~~lP~~~~~~~~~~~~~~~~~~--dst~~~a~~~L 70 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRA--------EDCATYTIPSVFPSATADGSTSYTTPW--QSIGARGLNNL 70 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHH--------HHHHHHhhhcccCCCCCcchhhccccc--cchHHHHHHHH Confidence 7774332 2211100 00000111111100 000111111111 011122222 23344445554 Q ss_pred hhHH-------hhCCCEEecCCCcc------------hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEE Q lcl|NC_016762. 71 VTTC-------WKTNPQVIEGDDQD------------RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLH 131 (456) Q Consensus 71 aed~-------tR~~~~i~~~~~~d------------~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~ 131 (456) |..+ .|.||++.-.+..- ..+-....|+.+.+.+.+-++...+.++.+.-..+|-+.+++. T Consensus 71 Aa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~ 150 (532) T protein:vir:99 71 ASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIP 150 (532) T ss_pred HHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEec Confidence 4444 47899885433211 1222234456677888889999999999998778888888775 Q ss_pred ecCCCC------cccccc------CCcCceeEEEEeccccCChhhh----hcc-cccc----ccCCceeEEEeecccCCc Q lcl|NC_016762. 132 IRDSQP------WDRPAR------GKLNGLAKVTPAWAGCLKPKSF----DEK-PDSE----TYGQPTMWEYTEASQAGR 190 (456) Q Consensus 132 i~D~~~------~~~Pl~------~~~~~l~~i~~~~~~~~~~~~~----~~D-p~s~----~yg~P~~y~i~~~~~~g~ 190 (456) ..+... -.-|+. ...+.+. .++.+..++...+ -.+ ..+. .+-.-+.|+.-.-..++. T Consensus 151 ~~~~~~~~~~~f~~~pl~~y~v~~d~~G~v~--~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~ 228 (532) T protein:vir:99 151 STEQVEGQSNAPKLYKLHNFVVERDAYDNVL--QIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPEAM 228 (532) T ss_pred ccccccCcccceEEEEcCeEEEeeCCCCCee--eEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCCCC Confidence 532211 123442 2233232 2222222222111 000 0000 011112221110000111 Q ss_pred c---------------ccceeeehhhh--hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016762. 191 P---------------GLVRDIHPDRV--FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDK 251 (456) Q Consensus 191 ~---------------~~~~~IH~SRl--i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 251 (456) + ..+...|..=+ .|| ..+..+|.|-.+..+..+..++...........++ ....+ T Consensus 229 ~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a----~~~~~-- 302 (532) T protein:vir:99 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMIS----SKVLF-- 302 (532) T ss_pred eeEEEEeecCceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH----cCCCc-- Confidence 0 00111222111 122 13345799988888888877776654433322221 11110 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC-CCeEEecCCCceeEEec----ccCCHHHHHHHHHHHHHhhhcCCeEE Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNRG-NDVLLPTQGATVTQMVS----AVSDPGPTYNVNLQTAAAGVDIPTKI 326 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lid~~d~~~~~~~----~~sgl~~~~~~~~~~~aaas~IP~t~ 326 (456) +... +.... ...+..+ .+..+-+..+++..+.. .|.-....+....+.|.-+.=+ .- T Consensus 303 ------lv~p-----~g~~~-----~~~~~~~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~ 364 (532) T protein:vir:99 303 ------FVNP-----NGVTQ-----IRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML--NS 364 (532) T ss_pred ------eecc-----ccccc-----hhhhccCCCcceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhh--hh Confidence 1100 01111 1112222 23333333345544432 3555566677777777665511 11 Q ss_pred eeccCCCcccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC-ce---E-EEeCCCCCCCHHHH Q lcl|NC_016762. 327 LVGMQTGERASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA-EF---T-AIWDDLTVPTKAER 394 (456) Q Consensus 327 L~G~sp~Glnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~-d~---~-~~f~pL~~~seke~ 394 (456) |.-.....+.+++ + ....--.++..|...|.|.|++.+.+|.+.+..|+++ +. . +++ +..+...++ T Consensus 365 ~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~--is~Laraq~ 442 (532) T protein:vir:99 365 AVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATG--LEALGRGHD 442 (532) T ss_pred cccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccceeec--chHHHHHHH Confidence 2223444467764 2 2345666777888899999999999999998876432 11 1 222 333334444 Q ss_pred HHHHHHHHHHHHHHHHc-C--CcCcCHHHHH-HHhcccCC-CCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 395 LANSKTMSEINSAAIGT-G--EPVFTAEEIR-EEAGYDPL-QGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 395 Aei~~~~A~a~~~~~~~-g--~~~i~~~E~R-~~~~~~~~-~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) .+. .+.......+. + ...|+.+++- .....-+. .....-.+++....-.......+++. T Consensus 443 ~~~---l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~ 506 (532) T protein:vir:99 443 LNK---LNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVT 506 (532) T ss_pred HHH---HHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHH Confidence 332 22222222221 1 1245555542 22222222 11111111110000000000000000 No 233 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=89.94 E-value=0.024 Score=29.51 Aligned_cols=421 Identities=9% Similarity=0.046 Sum_probs=163.8 Q ss_pred CCchhHHHHh--HHHH-HHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhc---Cchhhhhhccch--- Q lcl|NC_016762. 1 MTDKLDLAVN--HAMS-SAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRR---GGIAHGAVEKIV--- 71 (456) Q Consensus 1 ~~~~~~~~~~--~a~~-~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~---~~l~r~iVd~~a--- 71 (456) |+-. .+|.. +-.+ ..+..+.|+|+-...-+ .+...-| ++++|++.+|.. .+++..+..+-. T Consensus 45 ~~g~-p~~~~~~~~~~~~~~t~~~D~~~~g~~~~-------~~~~~~p--r~R~qiY~~~eeM~~~p~Ia~AlniHVtaA 114 (569) T protein:vir:10 45 RAGA-PVQLSGFLGGKPGDSGMAGDGLVDGSRFI-------FDEVQLP--EDRLQRYPLLEEMAVYSTIATALNIHITHA 114 (569) T ss_pred ecCc-chhhhhhhccCccccchhhhhHHHHHHHH-------hhhccCc--hhHHHHHHHHHHHhcCchhhhhhhhhhhee Confidence 3321 11111 0000 01223455554322111 1122223 456666655542 233222222222 Q ss_pred ---hHHhhCCCEEe--cCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC--------c Q lcl|NC_016762. 72 ---TTCWKTNPQVI--EGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP--------W 138 (456) Q Consensus 72 ---ed~tR~~~~i~--~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~--------~ 138 (456) ++-|-.-+.|+ +..++.+.+...++.+++..-+-++ +.+.+-...+|.-.||.|..=|..+.++. + T Consensus 115 Lggde~TGd~vfI~p~~~~~~a~~daakai~~el~~dl~~~-iNr~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~y 193 (569) T protein:vir:10 115 LSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMNDIGRT-INKEVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYY 193 (569) T ss_pred ecccccccceEEEEeecCCCCCcchHHHHHHHHHHHHHHHH-HHHHhhHHHHHHHhhhhhheeeeccCCceeEEEEeccc Confidence 12222223332 2222222233334445554433332 44555555667777887776665543332 1 Q ss_pred c-----ccccC-----CcCceeEEEEeccccCChhhhhccccc-cccCCceeEEEeecccCCccccceeeehhhhhe-ec Q lcl|NC_016762. 139 D-----RPARG-----KLNGLAKVTPAWAGCLKPKSFDEKPDS-ETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFI-LG 206 (456) Q Consensus 139 ~-----~Pl~~-----~~~~l~~i~~~~~~~~~~~~~~~Dp~s-~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~-~~ 206 (456) . +|++. +|.++. .+-|...++- .+|-. -.+-.|.+=.+. .-..||.=..-. +. T Consensus 194 t~PsfIqpFE~g~~tvGF~~~~--~~~~~~ti~~----l~p~qm~rmKmPrm~~i~---------q~~~v~~g~~~~~L~ 258 (569) T protein:vir:10 194 TLPSFIKEFEVSGNLAGFSGDY--LKDASGKMVF----ADPWAIIPMKIPYWRPKS---------NLMPVHTGHKAYSLL 258 (569) T ss_pred ccccccchhhhcCceEEeeccc--CCccccceee----echhhhhhhcccceeecc---------ccchhhhhhhheeec Confidence 1 33321 111111 0111111100 01100 012233211110 001122211101 11 Q ss_pred CC---------cCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhc----CCHHHHHHHH Q lcl|NC_016762. 207 DW---------TGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYG----VTLDALNERF 273 (456) Q Consensus 207 ~~---------~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~----~~~~~~~~~~ 273 (456) +. .--|-|+|+.+|+...++..+..+.-..=+-.++.+--+ .++|.+|..... ....+..++. T Consensus 259 ~d~~~~~Pi~psn~GgSFL~~ae~pf~~l~~Al~sL~~qri~dSv~~~~I----tlnm~gM~p~qr~~y~r~lt~~LKr~ 334 (569) T protein:vir:10 259 DNPEERTPIETQNYGTSLLEYAYEPYMNLRSAIRSLKATRFNASKIDRII----GLAMNSLDPVKAADYSRTITQTLKRA 334 (569) T ss_pred ccccccccccchhhhhHHHHHHHhHHHHHHHHHHhccchhhHHHHHhHHh----hccccCCCHHHHhHHHHHHHHHHHHH Confidence 10 113779999999988777666554211111122221111 123333322111 2234445555 Q ss_pred HHHHHHHhc-CCCe------E--EecCCC-----ceeEEecccCCHHHHHHHHHHHHHhhhcCCeEEeec---cCCCccc Q lcl|NC_016762. 274 NEAARQLNR-GNDV------L--LPTQGA-----TVTQMVSAVSDPGPTYNVNLQTAAAGVDIPTKILVG---MQTGERA 336 (456) Q Consensus 274 ~~~~~~~~~-~~~~------~--lid~~d-----~~~~~~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G---~sp~Gln 336 (456) .+++++-.+ ++.+ + +.+.+- |..+...+.-|+.|+ +..+.++||+.||-.+ |+| +=.|||. T Consensus 335 ~d~ie~a~~gg~~~~~~~~H~LPv~gekq~~~tvDt~~~~A~~~gIEdv-M~~~R~LagaLGlD~S-MlGwAD~LsGGLG 412 (569) T protein:vir:10 335 ADLMERRARGANNMPTVTNTLLPIMGDGKGQMTIDTQTIQADINGIEDI-LTYMRQLAAALGLDYT-LLGWADQMSGGLG 412 (569) T ss_pred HHHHHHHhccCccccccceeeeeeecCccccccccccccccCcccHHHH-HHHHHHHHhhhccchh-HhhHHHHhccccc Confidence 555544222 2211 1 122221 333444555577664 5667889999999998 778 4567776 Q ss_pred chHHHHHHHHHHHHHHHh-hhhHHHHHHH----HHHHHhcCcC----CCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 337 SSEDQKYHNARCQARRVQ-ELTFEINDLF----AHLMRIGVVP----LKAEFTAIWDDLTVPTKAERLANSKTMSEINSA 407 (456) Q Consensus 337 st~D~~nyyd~I~~~Qe~-~lrp~L~~l~----~~l~~s~~~~----~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~ 407 (456) -.+=+++ .+++.++. .||-.+...+ ++=+-.+.+. .+--|.|+|++-..-=+.|..+.+..++.+... T Consensus 413 eGG~frt---SaQaa~RS~~iRqa~~e~in~iidiH~~fKYgevf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i 489 (569) T protein:vir:10 413 EGGFLRT---AIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATI 489 (569) T ss_pred ccHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHH Confidence 4443333 34433333 3555554444 4434455542 223599999999976666666665555555443 Q ss_pred HHH--cC--C---cCcCHHHHHHHhccc-CCCCC---------CCCcccCCC-CCCCCCcCC-------------CCCCC Q lcl|NC_016762. 408 AIG--TG--E---PVFTAEEIREEAGYD-PLQGG---------DPLPDTEPE-DEDAARTDP-------------TGEQQ 456 (456) Q Consensus 408 ~~~--~g--~---~~i~~~E~R~~~~~~-~~~~~---------~~~~~~~~~-d~~~~~~d~-------------~~~~e 456 (456) .++ ++ + -.++.+-.|..+... ++++. ...+.+|+- -+..-...| +-+.. T Consensus 490 ~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~~De~~~e~l~ae~~akp~DEe~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 569 (569) T protein:vir:10 490 VTQILDAVSNNSVLANSDAFKRYLFSDVLEIDEKISEALVNELKAKSEDDDHLMDSIIKTPPQELAQILESVFKEGNDND 569 (569) T ss_pred HHHHHHHhhhcccccccHHHHHHHHHHHhhcchhHHHHHHhhcCCCcchhHHHHHHHhcCChHHHHHHHHHHhhccCCCC Confidence 222 11 1 122333233322111 11100 000000000 000000000 00000 No 234 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=87.97 E-value=0.035 Score=28.55 Aligned_cols=420 Identities=8% Similarity=-0.018 Sum_probs=165.8 Q ss_pred CCchhHHHHh---HHHHHHHHH---HHHHHhhhhh-ccC-c-ccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKLDLAVN---HAMSSAIAR---ARMSLLNQGI-GHD-A-KRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~~~~~~---~a~~~~~~~---~~d~~~n~~~-~~g-t-~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) |.+....... ..++..+.. ....+..... .++ . ..+.....-..++ . -+..+.+.|+..| T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~---------~--~dst~~~a~~~La 69 (556) T protein:vir:73 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTK---------I--VDPTGSMAQRILS 69 (556) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCc---------c--ccchHHHHHHHHH Confidence 8775433221 122221111 1122222110 000 0 0000000000000 0 1122223333333 Q ss_pred hH-------HhhCCCEEecCCCcc-h-h---hhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCc- Q lcl|NC_016762. 72 TT-------CWKTNPQVIEGDDQD-R-S---KDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW- 138 (456) Q Consensus 72 ed-------~tR~~~~i~~~~~~d-~-~---~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~- 138 (456) .. ..|.||++.-.+.+. + . +--...++.+...+.+-++...+.++++.-..+|-+.+++.-..++-+ T Consensus 70 s~l~~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r 149 (556) T protein:vir:73 70 SGMMSGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIR 149 (556) T ss_pred HHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEE Confidence 22 258899986444321 1 1 122234556777888888999999998888888988887654322211 Q ss_pred --ccccc-----CCcCceeEEEEeccccCChhh-----------------hhccccccccCCceeEEEeecccCCc---- Q lcl|NC_016762. 139 --DRPAR-----GKLNGLAKVTPAWAGCLKPKS-----------------FDEKPDSETYGQPTMWEYTEASQAGR---- 190 (456) Q Consensus 139 --~~Pl~-----~~~~~l~~i~~~~~~~~~~~~-----------------~~~Dp~s~~yg~P~~y~i~~~~~~g~---- 190 (456) .-|+. ....|.+ ..++.+..+++.. +..+|....| +.++.-....... T Consensus 150 ~~~~~l~~~~~~~d~~G~v-d~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~---~v~~~V~pr~~~~~~~~ 225 (556) T protein:vir:73 150 TMPFPIGSYYLANSPRGSV-DTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWV---EVNHCITPNVNRDSGKM 225 (556) T ss_pred EEEeecceeEEeeCCCCCe-EEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceE---EEEEEEecccccccccc Confidence 23442 1222222 2233333333211 2222221111 1111100000000 Q ss_pred -----cccceee----------ehhhh-------hee--cCCcCCCcch-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_016762. 191 -----PGLVRDI----------HPDRV-------FIL--GDWTGDAIGF-LEPAYNSFISLEKVEGGSGESFLKNAARQL 245 (456) Q Consensus 191 -----~~~~~~I----------H~SRl-------i~~--~~~~~~G~S~-le~~~~~l~~~~~~~~~~~~~~~~~~~~~l 245 (456) +-.++.+ +.|.+ .+| ..+..+|.|. .+.++..+..++.......++.-+.+...+ T Consensus 226 ~~~~~p~~s~~~~~~~~~~~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (556) T protein:vir:73 226 DSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPM 305 (556) T ss_pred CcccceEEEEEEEecCCCceecccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 0001112 22211 122 2344579884 888888887777666554443332211111 Q ss_pred hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeE-EecCCCceeEEe---cccCCHHHHHHHHHHHHHhhhc Q lcl|NC_016762. 246 LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVL-LPTQGATVTQMV---SAVSDPGPTYNVNLQTAAAGVD 321 (456) Q Consensus 246 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-lid~~d~~~~~~---~~~sgl~~~~~~~~~~~aaas~ 321 (456) .+ .+ .+.-+ .++....+.+.. .....+.++.+. ..+..+...+....+.|..+.= T Consensus 306 ~v-----------~~-------~~~~~---~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~ 364 (556) T protein:vir:73 306 VA-----------PT-------SLKNQ---RVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYF 364 (556) T ss_pred ec-----------cc-------ccccc---ceeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhh Confidence 00 00 00000 011111111111 111223455442 2344444445555566655543 Q ss_pred CCeEEeec-cCCCcccchH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC------ceEEEeCC-C Q lcl|NC_016762. 322 IPTKILVG-MQTGERASSE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA------EFTAIWDD-L 386 (456) Q Consensus 322 IP~t~L~G-~sp~Glnst~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~------d~~~~f~p-L 386 (456) -..-..++ +...-++++| +. ...--.+...|...|.|.|++.+.+|.+.+..|+++ +++++|-+ | T Consensus 365 ~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~L 444 (556) T protein:vir:73 365 VDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVM 444 (556) T ss_pred cchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHH Confidence 22111133 3334456664 22 334455666677789999999999999998876543 36566533 3 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHcCC---cCcCHHHH-HHHhcccCCCCCCCCcccCCCCC------------------ Q lcl|NC_016762. 387 TVPTKAERLANSKTMSEINSAAIGTGE---PVFTAEEI-REEAGYDPLQGGDPLPDTEPEDE------------------ 444 (456) Q Consensus 387 ~~~seke~Aei~~~~A~a~~~~~~~g~---~~i~~~E~-R~~~~~~~~~~~~~~~~~~~~d~------------------ 444 (456) -+.-..+........++......+++- ..|+.+++ +..+..-+.+....-.+++-+.. T Consensus 445 a~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~ 524 (556) T protein:vir:73 445 AQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQ 524 (556) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222222222222333333333331 13566655 33343334332211111110000 Q ss_pred ----------CCCCcCCC----------CCCC Q lcl|NC_016762. 445 ----------DAARTDPT----------GEQQ 456 (456) Q Consensus 445 ----------~~~~~d~~----------~~~e 456 (456) +...++|. +-+| T Consensus 525 ~a~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 525 AAAQGAKTLSETQTSDPSALTAIANAAGAPQQ 556 (556) T ss_pred HHHHHHHHhhhccCCCHHHHHHHHHhhcCCCC Confidence 00000110 0000 No 235 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=87.89 E-value=0.036 Score=28.51 Aligned_cols=413 Identities=9% Similarity=0.002 Sum_probs=173.1 Q ss_pred CCchhH--HHHh------HHHHHHH---HHHHHHHhhhh-hccCc--------ccchhhhhccCcccCCHHHHHHHHhcC Q lcl|NC_016762. 1 MTDKLD--LAVN------HAMSSAI---ARARMSLLNQG-IGHDA--------KRPQAWCEYGFPQEITFNDLYTMYRRG 60 (456) Q Consensus 1 ~~~~~~--~~~~------~a~~~~~---~~~~d~~~n~~-~~~gt--------~~~~~~~~~~~~~~~~~~~l~~~Y~~~ 60 (456) |.+-.+ |+.. ..++..+ ......+.... -.+.+ ...+.|..+ -.++..-. .+ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst-------~~~a~~~L-aa 72 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAV-------GARGLNNL-AS 72 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccccccccc-------HHHHHHHH-HH Confidence 654331 1111 1111111 11112222211 01100 001111111 11222222 24 Q ss_pred chhhhhhccchhHHhhCCCEEecCCCc---------c---hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEE Q lcl|NC_016762. 61 GIAHGAVEKIVTTCWKTNPQVIEGDDQ---------D---RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGL 128 (456) Q Consensus 61 ~l~r~iVd~~aed~tR~~~~i~~~~~~---------d---~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i 128 (456) +|...+. |+ |.||++.-.+.. + ...--...++.+...+.+-++...+.++.+.-..+|-+.+ T Consensus 73 ~l~~~lt--P~----~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 146 (535) T protein:vir:15 73 KLMLALF--PM----QSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALL 146 (535) T ss_pred HHHHhhc--CC----CcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeE Confidence 5555443 54 579998533211 1 1122234466778888889999999999888888898877 Q ss_pred EEEecCCCC--c-ccccc------CCcCceeEEEEeccccCChhhhhcc----cc-----ccccCCceeEEEee-cccCC Q lcl|NC_016762. 129 LLHIRDSQP--W-DRPAR------GKLNGLAKVTPAWAGCLKPKSFDEK----PD-----SETYGQPTMWEYTE-ASQAG 189 (456) Q Consensus 129 ~i~i~D~~~--~-~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~~D----p~-----s~~yg~P~~y~i~~-~~~~g 189 (456) ++.-..+.. + .-|+. ...+.+ ..++.+..+++.+...+ -. ...+-.-+.|..-- ..-++ T Consensus 147 ~~~~~~~~~~~f~~~pl~~~~v~~d~~G~v--d~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~ 224 (535) T protein:vir:15 147 YLPEPEGSYNPMKLYRLSSYVVQRDAYGNV--LQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESG 224 (535) T ss_pred EeecCCCCceeeEEEEcCeeEEeeCCCCCe--eEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCC Confidence 764332221 1 23442 222222 22333444443332111 00 00111112222110 00000 Q ss_pred ccc-----cceeee--hhh-------h--hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016762. 190 RPG-----LVRDIH--PDR-------V--FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDK 251 (456) Q Consensus 190 ~~~-----~~~~IH--~SR-------l--i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 251 (456) ... .+..|| .|+ + .+| ..+..+|.|-.+..+..+..++...........++....+ T Consensus 225 ~~~~~~e~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~------ 298 (535) T protein:vir:15 225 DYLKYEEVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIG------ 298 (535) T ss_pred cEEEEEEeeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce------ Confidence 000 011111 111 1 111 1334579998888888888877776555444433211110 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhc-CCCeEEecCCCceeEEec----ccCCHHHHHHHHHHHHHhhhcCCeEE Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNR-GNDVLLPTQGATVTQMVS----AVSDPGPTYNVNLQTAAAGVDIPTKI 326 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~lid~~d~~~~~~~----~~sgl~~~~~~~~~~~aaas~IP~t~ 326 (456) +.. .+...+. ..+.. +.+..+-+..+++..+.. .|.-....++...+.|.-+.=+ .- T Consensus 299 ------lv~-----~~g~~~~-----~~l~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~--~~ 360 (535) T protein:vir:15 299 ------LVN-----PAGITQP-----RRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFML--NS 360 (535) T ss_pred ------eec-----ccccccc-----hhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhh--hh Confidence 000 0011111 11222 233333344455555532 4556667777777777666411 11 Q ss_pred eeccCCCcccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC--CceEEEeC-CCCCCCHHHHHH Q lcl|NC_016762. 327 LVGMQTGERASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK--AEFTAIWD-DLTVPTKAERLA 396 (456) Q Consensus 327 L~G~sp~Glnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~--~d~~~~f~-pL~~~seke~Ae 396 (456) |.......+.+++ + ....--.++..|...|.|.+++++.+|.+.+..|+. ..+.++|- ||.++ .|.. T Consensus 361 ~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~a---qr~~ 437 (535) T protein:vir:15 361 AVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAI---GRGQ 437 (535) T ss_pred cccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHH---HHHH Confidence 2224445567774 2 245667777888999999999999999999887643 34556653 33333 2222 Q ss_pred HHHHHHHHHHHHHHcCC----cCcCHHHH-HHHhcccCCCC-CCCCcccC-CCCCCC-----------------CCcCCC Q lcl|NC_016762. 397 NSKTMSEINSAAIGTGE----PVFTAEEI-REEAGYDPLQG-GDPLPDTE-PEDEDA-----------------ARTDPT 452 (456) Q Consensus 397 i~~~~A~a~~~~~~~g~----~~i~~~E~-R~~~~~~~~~~-~~~~~~~~-~~d~~~-----------------~~~d~~ 452 (456) -..+.-+......+.+- ..|+.+++ +.....-+.+. .+.-.+++ .+.-.. ....+. T Consensus 438 ~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~ 517 (535) T protein:vir:15 438 DLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALAT 517 (535) T ss_pred HHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhc Confidence 22122222222223331 12555554 33333333321 11111111 000000 000000 Q ss_pred CCCC Q lcl|NC_016762. 453 GEQQ 456 (456) Q Consensus 453 ~~~e 456 (456) +-.| T Consensus 518 ~~p~ 521 (535) T protein:vir:15 518 SSPE 521 (535) T ss_pred cChH Confidence 0000 No 236 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=87.81 E-value=0.036 Score=28.48 Aligned_cols=418 Identities=8% Similarity=0.006 Sum_probs=175.6 Q ss_pred CCc--hhHHHHhH------HHHHHHH---HHHHHHhhhhh-ccCcccchhhhhccCcccC--CHHHHHHHHhcCchhhhh Q lcl|NC_016762. 1 MTD--KLDLAVNH------AMSSAIA---RARMSLLNQGI-GHDAKRPQAWCEYGFPQEI--TFNDLYTMYRRGGIAHGA 66 (456) Q Consensus 1 ~~~--~~~~~~~~------a~~~~~~---~~~d~~~n~~~-~~gt~~~~~~~~~~~~~~~--~~~~l~~~Y~~~~l~r~i 66 (456) |.+ +-.++... .++..+. .....+..... .+.+. +...+.-...+.+ +-.++..-. .++|...+ T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~-~~~~~~~~~~~~~dst~~~a~~~L-aa~l~~~l 78 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPK-DSDNSSTDYTTPWQAVGARGLNNL-SAKVMLAL 78 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCC-CCCcccccccccccchHHHHHHHH-HHHHHHhh Confidence 776 32222211 1121111 11222222211 11110 1000000000001 111222222 24455544 Q ss_pred hccchhHHhhCCCEEecCCCc---------c---hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC Q lcl|NC_016762. 67 VEKIVTTCWKTNPQVIEGDDQ---------D---RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD 134 (456) Q Consensus 67 Vd~~aed~tR~~~~i~~~~~~---------d---~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D 134 (456) . |+ |.||++.-.+.. + ...--...++.+.+.+.+-++...+.++.+.-..+|-+++++.-.. T Consensus 79 t--P~----~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~ 152 (543) T protein:vir:88 79 F--PL----QSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPD 152 (543) T ss_pred c--CC----CcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCc Confidence 3 54 579998543311 1 1112233356777888889999999999888777888887765422 Q ss_pred CCC-----c-ccccc------CCcCceeEEEEeccccCChhhh------------hccccccccCCceeEEEeecccCCc Q lcl|NC_016762. 135 SQP-----W-DRPAR------GKLNGLAKVTPAWAGCLKPKSF------------DEKPDSETYGQPTMWEYTEASQAGR 190 (456) Q Consensus 135 ~~~-----~-~~Pl~------~~~~~l~~i~~~~~~~~~~~~~------------~~Dp~s~~yg~P~~y~i~~~~~~g~ 190 (456) ++. + .-|+. +..+.+ ..++++..++..++ +.|| |..-+.|+.-.-..++. T Consensus 153 ~~~~~~~~~~~~pl~~y~v~~d~~G~v--~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p----~~~~~v~~~V~pr~~~~ 226 (543) T protein:vir:88 153 ASSNSYNPMKLYTLHNHVVQRDAFGNV--LQIVTLDKVAYAALPEDVRNSLSGGQEYKP----EQELEVYTHIYIDDESG 226 (543) T ss_pred cccceecceEEeEcceEEEeeCCCCCe--eeeeeeeeccHHHHhHHhhHHHHHHhhcCC----ccceEEEEEEEeecCCC Confidence 211 1 13552 222222 33455555555443 1222 22334443221111111 Q ss_pred c------ccceee--ehhhh---------hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Q lcl|NC_016762. 191 P------GLVRDI--HPDRV---------FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDK 251 (456) Q Consensus 191 ~------~~~~~I--H~SRl---------i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 251 (456) . ..+..| +.|+. .|| ..+..||.|-.+..+..+..++...........++....+ T Consensus 227 ~~~~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~------ 300 (543) T protein:vir:88 227 DFLSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVG------ 300 (543) T ss_pred cccccccccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce------ Confidence 0 001111 11111 111 1234579998888888888877766555443333211111 Q ss_pred hccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC-CCeEEecCCCceeEEec----ccCCHHHHHHHHHHHHHhhhcCCeEE Q lcl|NC_016762. 252 EINLGEIASTYGVTLDALNERFNEAARQLNRG-NDVLLPTQGATVTQMVS----AVSDPGPTYNVNLQTAAAGVDIPTKI 326 (456) Q Consensus 252 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lid~~d~~~~~~~----~~sgl~~~~~~~~~~~aaas~IP~t~ 326 (456) +... +... + ...+..+ .+..+.+..++...+.. +|.-....+....+.|.-+.=+ .- T Consensus 301 ------~v~~-----~g~~-~----~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~ 362 (543) T protein:vir:88 301 ------LVNP-----NGIT-Q----VRRLVKAQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFML--NS 362 (543) T ss_pred ------eecc-----cccc-c----hhhcccCCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhh--hh Confidence 0000 0001 1 1122232 33333344455544322 4666667777777777665522 12 Q ss_pred eeccCCCcccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC--CceEEEeC-CCCCCCHHHHHH Q lcl|NC_016762. 327 LVGMQTGERASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK--AEFTAIWD-DLTVPTKAERLA 396 (456) Q Consensus 327 L~G~sp~Glnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~--~d~~~~f~-pL~~~seke~Ae 396 (456) |.......+.+++ + ....--.++..|...|.|.|++.+.+|.+.+..|++ +++.++|- +|..+....+++ T Consensus 363 ~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~ 442 (543) T protein:vir:88 363 AVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLD 442 (543) T ss_pred hccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHH Confidence 2224445577774 2 234666777788999999999999999999887543 34555543 344444333333 Q ss_pred HHHHHHHHHHHHHHcC-CcCcCHHHHHH-HhcccCCC-CCCC-CcccCCC------------------------------ Q lcl|NC_016762. 397 NSKTMSEINSAAIGTG-EPVFTAEEIRE-EAGYDPLQ-GGDP-LPDTEPE------------------------------ 442 (456) Q Consensus 397 i~~~~A~a~~~~~~~g-~~~i~~~E~R~-~~~~~~~~-~~~~-~~~~~~~------------------------------ 442 (456) --....+.......-+ ...|+.+++-+ ....-+.+ .... .+++... T Consensus 443 ~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~ 522 (543) T protein:vir:88 443 KLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASP 522 (543) T ss_pred HHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccCh Confidence 2222222221111100 11244444422 22222221 1110 0000000 Q ss_pred -------CCCCCCcCCCCCCC Q lcl|NC_016762. 443 -------DEDAARTDPTGEQQ 456 (456) Q Consensus 443 -------d~~~~~~d~~~~~e 456 (456) +....++-|-+-+- T Consensus 523 ~~~~~~~~~~~~~~~p~~~~~ 543 (543) T protein:vir:88 523 EAMESAMDTAGVQPGPIATQV 543 (543) T ss_pred HHHHHHhhhcCCCCCCCCCCC Confidence 00000111111100 No 237 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=87.73 E-value=0.037 Score=28.45 Aligned_cols=314 Identities=11% Similarity=0.029 Sum_probs=130.2 Q ss_pred CCchhHHHHh-HHHHHHHHHHHHHHhhhhhc-----cCcc--cchh--h-hhccCcccCCHHHHHHHHhcCchhhhhhcc Q lcl|NC_016762. 1 MTDKLDLAVN-HAMSSAIARARMSLLNQGIG-----HDAK--RPQA--W-CEYGFPQEITFNDLYTMYRRGGIAHGAVEK 69 (456) Q Consensus 1 ~~~~~~~~~~-~a~~~~~~~~~d~~~n~~~~-----~gt~--~~~~--~-~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~ 69 (456) =..+..-++. ++..+....+.-.+..+..| +++. .|-. | +.-+|...+++.-|..+++.|.-...++.. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~~~~h~~~l~~ 86 (350) T protein:vir:11 7 HRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGSSVYLQSGLKF 86 (350) T ss_pred CCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHHHHhhhhhhccchhh Confidence 0000000111 01000000000001011111 1111 0111 1 122455678888888888888877777765 Q ss_pred chhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccccccCCcCce Q lcl|NC_016762. 70 IVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDRPARGKLNGL 149 (456) Q Consensus 70 ~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~Pl~~~~~~l 149 (456) -.....+ +++ +.. . +.+..+++ ++..-.++|-+++.+.- ++ .+.+ T Consensus 87 k~n~l~~-~~~------Pn~--~-----------~t~~~f~~----~v~d~ll~Gnay~~~~r-n~----------~G~~ 131 (350) T protein:vir:11 87 KRNMLAK-TFI------PHR--L-----------LSRATFEQ----FSLDWLTFGSAYLEQPR-SR----------LGTR 131 (350) T ss_pred hhhhhhh-ccc------CCC--C-----------CCHHHHHH----HHHHHHhcCCeEEEEEE-cC----------CCCE Confidence 4433222 111 000 0 11111222 22223467877776643 21 1112 Q ss_pred eEEEEeccccCChhhhhccccccccCCceeEEEeecccCCccccceeeehhhhheecCCc----CCCcchHHHHHHHHHH Q lcl|NC_016762. 150 AKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAGRPGLVRDIHPDRVFILGDWT----GDAIGFLEPAYNSFIS 225 (456) Q Consensus 150 ~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g~~~~~~~IH~SRli~~~~~~----~~G~S~le~~~~~l~~ 225 (456) ..+.|+-...+... .|. + .+|++.. + +..+.+.++.||+|.... ..|+|-+..+...+.- T Consensus 132 ~~L~~l~~~~vr~~---~~~---~----~~~~~~~---~---~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l 195 (350) T protein:vir:11 132 MPLQAPLAKYMRRG---TDL---E----TFYQVRS---W---KDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALL 195 (350) T ss_pred EEEEEeCCceeEee---ecC---C----eEEEEee---C---CeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHH Confidence 33333332222221 111 1 1355542 1 223567777888876432 3589988888776654 Q ss_pred HHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHh--cCCCeEEec-C---CCceeEEe Q lcl|NC_016762. 226 LEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLN--RGNDVLLPT-Q---GATVTQMV 299 (456) Q Consensus 226 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lid-~---~d~~~~~~ 299 (456) -.. +......+|+|....-.+-. +.+ ..-.++..+++.+.++... .|.+.+++. . ++.++... T Consensus 196 ~~~-a~~~~~~~f~NGa~~~gil~-----~~~-----~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~p 264 (350) T protein:vir:11 196 NES-ATLFRRKYYNNGSHAGFILY-----MTD-----AAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIP 264 (350) T ss_pred HHH-HHHHHHHHHhccCCCceEEE-----ecC-----CCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEE Confidence 332 33344455666433221111 110 0111334445555554432 222333333 2 23244443 Q ss_pred cccCC----HHHHHHHHHHHHHhhhcCCeEEeeccCCC---cccchHH-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc Q lcl|NC_016762. 300 SAVSD----PGPTYNVNLQTAAAGVDIPTKILVGMQTG---ERASSED-QKYHNARCQARRVQELTFEINDLFAHLMRIG 371 (456) Q Consensus 300 ~~~sg----l~~~~~~~~~~~aaas~IP~t~L~G~sp~---Glnst~D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~ 371 (456) ++.+. +-++-....++||++-+||-. |+|..+. |++.-+. .+.||.. .|.|.++++-+ +..+ T Consensus 265 l~~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~e~~~~~f~~~-------~L~P~~~~ie~-ln~~- 334 (350) T protein:vir:11 265 VSEVAAKDEFGSIKNISRDDQLAGLRVYPQ-LMGVVPQNAGGFGSISDAAAVWASL-------ELAPMQTRLQQ-VNEM- 334 (350) T ss_pred cCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcCCHHHHHHHHHHH-------HHHHHHHHHHH-HHhh- Confidence 33332 344455667789999999986 8886543 3443333 3455432 35666655544 2222 Q ss_pred CcCCCCce-EEEeCCC Q lcl|NC_016762. 372 VVPLKAEF-TAIWDDL 386 (456) Q Consensus 372 ~~~~~~d~-~~~f~pL 386 (456) +++....| .|.-+.| T Consensus 335 l~~~~~~F~~~~~~~l 350 (350) T protein:vir:11 335 IGEEVVRFAQFDAPGL 350 (350) T ss_pred cCccccccCcccccCC Confidence 22211110 1222233 No 238 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=87.40 E-value=0.039 Score=28.31 Aligned_cols=413 Identities=9% Similarity=-0.044 Sum_probs=167.0 Q ss_pred CCchh-HHHHhHHHHHHHHH---HHHHHhhhhh-ccC--cccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhH Q lcl|NC_016762. 1 MTDKL-DLAVNHAMSSAIAR---ARMSLLNQGI-GHD--AKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTT 73 (456) Q Consensus 1 ~~~~~-~~~~~~a~~~~~~~---~~d~~~n~~~-~~g--t~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed 73 (456) |..|. ...- ..++..+.. .+..+..... .++ +..+.. .++. +......+| .+.+.+.|+..|.. T Consensus 1 ~~~~~l~~r~-~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~---~~~~---~~~~~~~i~--dst~~~a~~~Las~ 71 (547) T protein:vir:10 1 MENSKIVKRL-DFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRS---EGSI---NWNQNREVF--DSTAGDGLETLSSS 71 (547) T ss_pred CCHHHHHHHH-HHHHHHhhHHHHHHHHHHHHhcccccccccCCCC---Cccc---ccccccccc--cchHHHHHHHHHHH Confidence 65432 1111 122222211 1122222111 000 000000 0000 000000011 12233333333333 Q ss_pred H-------hhCCCEEecCCCc-----chhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC---- Q lcl|NC_016762. 74 C-------WKTNPQVIEGDDQ-----DRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP---- 137 (456) Q Consensus 74 ~-------tR~~~~i~~~~~~-----d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~---- 137 (456) + .|.||++.-.+.+ ...+--...++.+.+.+.+-++...+.++.+.-..+|.+.+++.- |+.. T Consensus 72 L~~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~-d~~~~~~~ 150 (547) T protein:vir:10 72 LHGSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEE-DEDEEGSV 150 (547) T ss_pred HHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecc-CCCCCCce Confidence 2 4789988543332 112222334556777888889999999999888888888777643 3221 Q ss_pred --cccccc------CCcCceeEEEEeccccCChhhhhc----cccc----------cccCCceeEEEeecc--cCCc--- Q lcl|NC_016762. 138 --WDRPAR------GKLNGLAKVTPAWAGCLKPKSFDE----KPDS----------ETYGQPTMWEYTEAS--QAGR--- 190 (456) Q Consensus 138 --~~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~~----Dp~s----------~~yg~P~~y~i~~~~--~~g~--- 190 (456) -.-|+. ...+.+ ..++++..++..+... +.++ |+-..++.+.+...- ..+. T Consensus 151 r~~~~pl~~~~v~~d~~G~v--~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~ 228 (547) T protein:vir:10 151 VFQSSPIQDSYFEEDSRGQV--VNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNR 228 (547) T ss_pred eEEEeecceEEEeeCCCcCe--eeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCc Confidence 123442 222222 3344444444432211 1111 111111211111000 0000 Q ss_pred -----------ccc---------ceeeehhh-----h--hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_016762. 191 -----------PGL---------VRDIHPDR-----V--FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNA 241 (456) Q Consensus 191 -----------~~~---------~~~IH~SR-----l--i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~ 241 (456) +-. ....+.|. + .+| ..+..+|.|..+.++..+..++.......+..-+.. T Consensus 229 ~~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~ 308 (547) T protein:vir:10 229 NAGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVI 308 (547) T ss_pred cccceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 01111111 1 111 133457999888888888777766655444333221 Q ss_pred hhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecc--cCCHHHHHHHHHHHHHhh Q lcl|NC_016762. 242 ARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSA--VSDPGPTYNVNLQTAAAG 319 (456) Q Consensus 242 ~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~--~sgl~~~~~~~~~~~aaa 319 (456) ...+.+ ........+ + . ...+....+..+.++.++.. |.-....++...+.|..+ T Consensus 309 ~pp~~v-----------------~~~g~~~~~----~-~-~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~a 365 (547) T protein:vir:10 309 DPAIMV-----------------TERGLISDI----D-L-GASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRI 365 (547) T ss_pred cCceec-----------------ccccccccc----e-e-cCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHH Confidence 111100 000111111 1 0 11222333444566655433 344445555566655554 Q ss_pred hcCCeEEeec-cCCCcccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC-C--------ceEEE Q lcl|NC_016762. 320 VDIPTKILVG-MQTGERASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK-A--------EFTAI 382 (456) Q Consensus 320 s~IP~t~L~G-~sp~Glnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~-~--------d~~~~ 382 (456) .= .=+|+ .....+++++ + .+..--.++..|...|.|.|++.+.+|.+.+..|++ + ++.|+ T Consensus 366 f~---~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~ 442 (547) T protein:vir:10 366 YY---VDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIV 442 (547) T ss_pred hh---hhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEE Confidence 32 11223 2344566664 2 234566667788889999999999999999887643 2 23344 Q ss_pred eC-CCCCCCHHHHHHHHHHHHHHHHHHHHcCCc---CcCHHHH-HHHhcccCCCCCCCCcccCCCCCCCCCcC------- Q lcl|NC_016762. 383 WD-DLTVPTKAERLANSKTMSEINSAAIGTGEP---VFTAEEI-REEAGYDPLQGGDPLPDTEPEDEDAARTD------- 450 (456) Q Consensus 383 f~-pL~~~seke~Aei~~~~A~a~~~~~~~g~~---~i~~~E~-R~~~~~~~~~~~~~~~~~~~~d~~~~~~d------- 450 (456) |- ||-+.-..+.........+......+++-. .|+.+++ +..+..-+.+...-- .+++.+.-- T Consensus 443 ~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~ir-----s~eev~~~r~qr~~~~ 517 (547) T protein:vir:10 443 YTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMR-----PKAKVTSIRKNRSQTQ 517 (547) T ss_pred eccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccC-----CHHHHHHHHHHHHHHH Confidence 32 333332222222222223333333333311 3666655 333333333221111 111111000 Q ss_pred ------------------------CCCCCC Q lcl|NC_016762. 451 ------------------------PTGEQQ 456 (456) Q Consensus 451 ------------------------~~~~~e 456 (456) ...+++ T Consensus 518 q~~~qaa~~~~~g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 518 QKAEQAAIAEAEGNAMEAQGKGQAALKENQ 547 (547) T ss_pred HHHHHHHHHHHHHHHHHhhcCcccchhccC Confidence 000111 No 239 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=86.73 E-value=0.044 Score=28.05 Aligned_cols=396 Identities=10% Similarity=0.001 Sum_probs=154.9 Q ss_pred CCch--------hHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcc--cCCHHHHHHHHh----cCchhhhh Q lcl|NC_016762. 1 MTDK--------LDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQ--EITFNDLYTMYR----RGGIAHGA 66 (456) Q Consensus 1 ~~~~--------~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~--~~~~~~l~~~Y~----~~~l~r~i 66 (456) |+++ |+.-..++..+. ..+|.+ .|+..-+....-+-|+ .-+.+. +..|. ...+.+++ T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~--~ird~~------~G~~~~r~~g~~YLPk~~~E~~~~-Y~~rl~rA~~~n~~~~t 71 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWH--VIETLL------GGTEAMREAGETYLPRHQEETDKG-YQERLASAVLLNMVEQT 71 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHH--HHHHHh------cChHHHHhhcccCCCCCCCCCHHH-HHHHHhcccCCChHHHH Confidence 6666 444333333332 245655 3333223222222222 112222 22222 24567778 Q ss_pred hccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHH-HHH-----HhhHHHHHHHHHHhhcccCceEEEEEecCCCCc-- Q lcl|NC_016762. 67 VEKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKP-LIA-----GGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW-- 138 (456) Q Consensus 67 Vd~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~-~~~-----~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~-- 138 (456) |+..+--.+|+-+++.....+ .+.. +++ -.++...++.+.+....+|.++++++....... T Consensus 72 l~~l~G~vf~k~p~~~~~~p~-----------~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~ 140 (513) T protein:vir:97 72 LDTLSGKPFSEPIKLNEDVPK-----------AIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPRED 140 (513) T ss_pred HHHHhhhhhhcCcccCcCchH-----------HHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccc Confidence 888888888887776321111 1222 111 124666677777788889989998876321100 Q ss_pred c--------------------cccc------CCcCceeEE-EEeccccCChhhhhcccccccc------CCceeEEEeec Q lcl|NC_016762. 139 D--------------------RPAR------GKLNGLAKV-TPAWAGCLKPKSFDEKPDSETY------GQPTMWEYTEA 185 (456) Q Consensus 139 ~--------------------~Pl~------~~~~~l~~i-~~~~~~~~~~~~~~~Dp~s~~y------g~P~~y~i~~~ 185 (456) . .|.. ...+|...+ .++++-.+. ..|.++... ..|-.|+|-.. T Consensus 141 ~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~----~~Dgf~~~~~~q~rvL~~g~~~v~r~ 216 (513) T protein:vir:97 141 GQPRTLADDRREGLRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYM----EQDGFAEVCKRRIRVLEPGLVQLWEP 216 (513) T ss_pred hhHHhHHHHHhhccCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEe----ecCCCcceEEEEEEEEeCceEEEEEe Confidence 0 1111 011121111 122221111 123222111 02333333221 Q ss_pred ccCCcc-ccceeeehhh-----hheec----CCcCC--Cc-chHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhh Q lcl|NC_016762. 186 SQAGRP-GLVRDIHPDR-----VFILG----DWTGD--AI-GFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKE 252 (456) Q Consensus 186 ~~~g~~-~~~~~IH~SR-----li~~~----~~~~~--G~-S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 252 (456) ..++.. .....+|.+. .|.|. ..+.+ |. |++..++=.+..|... .-.-+.+|.....++.+.. T Consensus 217 ~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~-Sd~~~il~~~~~P~l~~~G--- 292 (513) T protein:vir:97 217 VKKSNAQKEEWALADEWATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSA-SDQRHILTVSRFPILACSG--- 292 (513) T ss_pred ecCCCccccceEEecCCCCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhh-hhHHHHHHhcccceeeeec--- Confidence 111111 1123343333 22221 11222 22 3444443233333221 1222233333333333221 Q ss_pred ccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEE-ec-CCCceeEEecccCCHHHH---HHHHHHHHHhhhcCCeEEe Q lcl|NC_016762. 253 INLGEIASTYGVTLDALNERFNEAARQLNRGNDVLL-PT-QGATVTQMVSAVSDPGPT---YNVNLQTAAAGVDIPTKIL 327 (456) Q Consensus 253 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-id-~~d~~~~~~~~~sgl~~~---~~~~~~~~aaas~IP~t~L 327 (456) +.+..+ + . +.-+-+.++ +. ++.++..++.+-+++... +....+++..+.- +| T Consensus 293 -----~~~~~~---~----~-------i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga----~l 349 (513) T protein:vir:97 293 -----ASGEDS---D----P-------VVVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGA----EF 349 (513) T ss_pred -----CCcCCC---C----c-------eEeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHH----Hh Confidence 100000 0 0 111222222 22 356777777777777543 3333444433322 34 Q ss_pred eccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_016762. 328 VGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEI 404 (456) Q Consensus 328 ~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a 404 (456) +..+++..++++ |...=+..++++.. .+...|+++++++.++... .+++.+|+.|+=+.....+..++ ++ T Consensus 350 l~~~~~~~Ta~a~~~~~~~~~S~L~~~a~-~le~al~~~l~~~a~wlg~-~~~~~~v~in~dF~~~~~~~~~~-----~a 422 (513) T protein:vir:97 350 LKRKTGGQTATARALDSAEATSDLSAMTG-LFEDALAQALDITADWLRL-GPNGGTVELVKDYDLEEMDAPGL-----QA 422 (513) T ss_pred hccCCccccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhCC-CCCccEEEeccccCcccCCHHHH-----HH Confidence 445566566553 55556667777665 4788899999888777433 33456677666554333322222 11 Q ss_pred HHHHHHcCCcCcCHHHHHHHhcccCC--------------------CCCCCCcccCCCC--------CCCCCcCCCCCCC Q lcl|NC_016762. 405 NSAAIGTGEPVFTAEEIREEAGYDPL--------------------QGGDPLPDTEPED--------EDAARTDPTGEQQ 456 (456) Q Consensus 405 ~~~~~~~g~~~i~~~E~R~~~~~~~~--------------------~~~~~~~~~~~~d--------~~~~~~d~~~~~e 456 (456) ...+...| .|+...+++.+..-+. ..++...+..+.+ ...+.+.++++.. T Consensus 423 l~~a~~~G--~is~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (513) T protein:vir:97 423 LQVAREKR--DISRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGG 500 (513) T ss_pred HHHHHhCC--CCCHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCC Confidence 12223333 3333222221110000 0000000000000 0111112222222 No 240 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=86.55 E-value=0.045 Score=27.98 Aligned_cols=412 Identities=8% Similarity=-0.010 Sum_probs=162.5 Q ss_pred CCchhHHHHhH---HHHHHHHH---HHHHHhhhhh--ccC-cccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKLDLAVNH---AMSSAIAR---ARMSLLNQGI--GHD-AKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~~~~~~~---a~~~~~~~---~~d~~~n~~~--~~g-t~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) |.+++..-... .++..+.. ....+..... ... ...+.....-..++. -+..+.+.|+..| T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~-----------~dst~~~a~~~La 69 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRI-----------IDSTGTMAARTLA 69 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCccccccccc-----------ccchHHHHHHHHH Confidence 88865442221 12211111 1222222210 110 001100000001110 1122223333333 Q ss_pred hHH-------hhCCCEEecCCCcc-h-hh---hhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCc- Q lcl|NC_016762. 72 TTC-------WKTNPQVIEGDDQD-R-SK---DETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW- 138 (456) Q Consensus 72 ed~-------tR~~~~i~~~~~~d-~-~~---~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~- 138 (456) ..+ .|.||++.-.+... + .+ --...++.+.+.+.+-++...+.++.+.-..+|-+++++.-+.++.+ T Consensus 70 s~l~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r 149 (559) T protein:vir:95 70 SGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIR 149 (559) T ss_pred HHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeE Confidence 222 57899985444321 1 11 22233556777888889999999998887788988887754322221 Q ss_pred --ccccc------CCcCceeEEEEeccccCChhh-----------------hhccccccccCCceeEEEeecccCCccc- Q lcl|NC_016762. 139 --DRPAR------GKLNGLAKVTPAWAGCLKPKS-----------------FDEKPDSETYGQPTMWEYTEASQAGRPG- 192 (456) Q Consensus 139 --~~Pl~------~~~~~l~~i~~~~~~~~~~~~-----------------~~~Dp~s~~yg~P~~y~i~~~~~~g~~~- 192 (456) .-|+. ...+.+ ..++++..+++.+ ++.+|....| +.|+.-.....+.+. T Consensus 150 ~~~~~l~~~~v~~d~~G~v--d~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v---~v~~~V~pr~~~~~~~ 224 (559) T protein:vir:95 150 TMPFPIGSYYLANSPRGSV--DTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWI---EVMHSVYPNIDRDTSK 224 (559) T ss_pred EEEeecCeEEEeeCCCCCe--EEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeE---EEEEEEeccccccccc Confidence 23442 222222 2344444444422 2222222211 111110000000000 Q ss_pred --------cc----------eeeehhhh-------hee--cCCcCCCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_016762. 193 --------LV----------RDIHPDRV-------FIL--GDWTGDAIG-FLEPAYNSFISLEKVEGGSGESFLKNAARQ 244 (456) Q Consensus 193 --------~~----------~~IH~SRl-------i~~--~~~~~~G~S-~le~~~~~l~~~~~~~~~~~~~~~~~~~~~ 244 (456) .+ ..++.|.+ .+| .....+|.| ..+.++..+..++........+.-+..... T Consensus 225 ~~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp 304 (559) T protein:vir:95 225 LDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) T ss_pred cccccceEEEEEEEecCCCceeeecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCc Confidence 00 11122211 122 123456888 477787877777666554443333221111 Q ss_pred hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCC---CceeEEecccCCHHHH---HHHHHHHHHh Q lcl|NC_016762. 245 LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQG---ATVTQMVSAVSDPGPT---YNVNLQTAAA 318 (456) Q Consensus 245 l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~---d~~~~~~~~~sgl~~~---~~~~~~~~aa 318 (456) +.+ .+ +...+.+ .+.-+ +...++.. +.++.....=+++..+ +....+.|.. T Consensus 305 ~~v-----------~~------~~~~~~~-----~l~pg-g~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~ 361 (559) T protein:vir:95 305 MVA-----------PT------SLKNQRA-----SLLPG-DITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINS 361 (559) T ss_pred eec-----------cc------cccccce-----eeecc-ceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHH Confidence 100 00 0000011 01111 11222221 2333322111334433 3344444544 Q ss_pred hhcC-CeEEeeccCCCcccchH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC------ceEEEeC Q lcl|NC_016762. 319 GVDI-PTKILVGMQTGERASSE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA------EFTAIWD 384 (456) Q Consensus 319 as~I-P~t~L~G~sp~Glnst~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~------d~~~~f~ 384 (456) +.=- ++-.|-.+....+++++ +. ...--.+...|...|.|.|++.+.+|.+.+..|+++ +++++|- T Consensus 362 af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~i 441 (559) T protein:vir:95 362 AYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYI 441 (559) T ss_pred HhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEee Confidence 4321 12223334555667774 22 334555666677789999999999999998876443 3445543 Q ss_pred -CCCCCCHHHHHHHHHHHHHHHHHHHHcCC---cCcCHHHH-HHHhcccCCCCCCCCcccCCCCCCCCCcC--------- Q lcl|NC_016762. 385 -DLTVPTKAERLANSKTMSEINSAAIGTGE---PVFTAEEI-REEAGYDPLQGGDPLPDTEPEDEDAARTD--------- 450 (456) Q Consensus 385 -pL~~~seke~Aei~~~~A~a~~~~~~~g~---~~i~~~E~-R~~~~~~~~~~~~~~~~~~~~d~~~~~~d--------- 450 (456) ||-+.-..+..+.-...++......+++- ..|+.+++ +..+..-+.+....-. +++.+.-- T Consensus 442 s~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs-----~~ev~~~rqqr~~~qq~ 516 (559) T protein:vir:95 442 SVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVP-----QEQVEQARQQRAQQQQQ 516 (559) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCC-----HHHHHHHHHHHHHHHHH Confidence 33222222222222222333333333331 13555554 3333333332221111 11111000 Q ss_pred ------------------------------------CCCCCC Q lcl|NC_016762. 451 ------------------------------------PTGEQQ 456 (456) Q Consensus 451 ------------------------------------~~~~~e 456 (456) .++.++ T Consensus 517 ~q~~~~~~~aa~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 558 (559) T protein:vir:95 517 QQMMAMGMAAAQGVKTLSEAKTSDPSVLSAMANAVSGQGGQS 558 (559) T ss_pred HHHHHHHHHHHHhhhccccccCCChhHHHHHHHhhcCccccC Confidence 000000 No 241 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=85.76 E-value=0.05 Score=27.69 Aligned_cols=408 Identities=9% Similarity=-0.021 Sum_probs=168.1 Q ss_pred CCchhHHHHhHHHHHHH-HHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHH----- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAI-ARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTC----- 74 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~-~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~----- 74 (456) |-.+.+..-+..-++.. .++++...=..-.+.+. +..-.. ..+...| +..+.+.|+..|.-+ T Consensus 1 mk~~~~~~~~~lkr~~~e~~w~e~a~~tlP~~~~~-~~~~~~---------~~~~~~~--dstg~~a~~~LAa~l~~~lt 68 (510) T protein:vir:78 1 MKSTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVD-PMSGSR---------GVVEHDF--QSAGALLVNNLAAKLARSLF 68 (510) T ss_pred ChhHHHHHHHHHhccchHHHHHHHHHhhccccccC-CCCccc---------ccccCcc--cchHHHHHHHHHHHHHHhhc Confidence 77776665553322221 22222221111111110 000000 0011111 122233444443333 Q ss_pred --hhCCCEEecCCCcc------------hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCc-c Q lcl|NC_016762. 75 --WKTNPQVIEGDDQD------------RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW-D 139 (456) Q Consensus 75 --tR~~~~i~~~~~~d------------~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~-~ 139 (456) .|.||++.-++... -.+--...|+.+.+.+.+-++...+.++.+.--.+|.+.+++. .|+..+ . T Consensus 69 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~-~~~~~~~~ 147 (510) T protein:vir:78 69 PTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN-SDEATVVA 147 (510) T ss_pred CCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEe-CCCCeEEE Confidence 46798885333211 1112223455667778888999999999998777787766654 333322 2 Q ss_pred cccc------CCcCceeEEEEeccccCChhhhhc----cccc-----cccCCceeEEEeecccCCccccceeeeh----- Q lcl|NC_016762. 140 RPAR------GKLNGLAKVTPAWAGCLKPKSFDE----KPDS-----ETYGQPTMWEYTEASQAGRPGLVRDIHP----- 199 (456) Q Consensus 140 ~Pl~------~~~~~l~~i~~~~~~~~~~~~~~~----Dp~s-----~~yg~P~~y~i~~~~~~g~~~~~~~IH~----- 199 (456) -|+. ...+.+. .++.+..+++.++.. +..+ ..+..-+.|+.-... ++......-||. T Consensus 148 ~pl~~y~v~~d~~G~vd--~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~-~~~~~~~~sv~~e~dg~ 224 (510) T protein:vir:78 148 WSLRSYAVRRDATGRWM--DIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRR-KGTAMDYAEMYHEIDGV 224 (510) T ss_pred EEcceeEEeeCCCcCee--EEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEee-cCCCCcEEEEEEEecCe Confidence 3552 2333332 233344444322111 1000 011111222111100 000000111221 Q ss_pred ------------hhh--hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhc Q lcl|NC_016762. 200 ------------DRV--FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYG 263 (456) Q Consensus 200 ------------SRl--i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~ 263 (456) .=. .|| ..+..+|.|-.+.++..+..++........ .+.......+ +....+ T Consensus 225 ~i~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~----~a~~a~~~~~--------lv~p~g 292 (510) T protein:vir:78 225 RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGL----YELESLEVLN--------LVDEAK 292 (510) T ss_pred eeccccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHH----HHHHhhcCCc--------ccCCcc Confidence 111 112 123457999888888888777666544332 2211111111 111001 Q ss_pred CCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEec----ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCc-ccc Q lcl|NC_016762. 264 VTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVS----AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGE-RAS 337 (456) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~----~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~G-lns 337 (456) .. ....+..+- +..+-+..+++..++. .|.-....+....+.|.-+. .+. |. +..++ +++ T Consensus 293 -----~~-----~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF--~~~-l~-~~~~~rvTA 358 (510) T protein:vir:78 293 -----GA-----VVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF--MYG-AN-QRDAERVTA 358 (510) T ss_pred -----cc-----chhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHH--hhc-cc-cCCCCCcCH Confidence 01 111223333 3333333455555443 34444566777777777664 233 33 33443 677 Q ss_pred hH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCc----eEEEeCCCCCCCHHHHHHHHHHHHHHHH Q lcl|NC_016762. 338 SE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAE----FTAIWDDLTVPTKAERLANSKTMSEINS 406 (456) Q Consensus 338 t~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d----~~~~f~pL~~~seke~Aei~~~~A~a~~ 406 (456) +| +. ...--.++..|...|.|.+++.+.+|.+.++.|++++ ..+++ +..+.-.++++ ....+.+ T Consensus 359 tEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~--is~Laraq~~~---~l~~~~q 433 (510) T protein:vir:78 359 EEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETG--LPALSRSAAVQ---SMLNASQ 433 (510) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeec--ccHHHHHHHHH---HHHHHHH Confidence 64 22 3455667778888999999999999988876655443 12222 22222223222 2222222 Q ss_pred HHHHcC-----CcCcCHHHHHH-HhcccCC-CCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 407 AAIGTG-----EPVFTAEEIRE-EAGYDPL-QGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 407 ~~~~~g-----~~~i~~~E~R~-~~~~~~~-~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) .....+ .+.|+.+++-+ .+..-+. ....--.+++-+.. ..+....+.++ T Consensus 434 ~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~-~~~~~~q~~~~ 489 (510) T protein:vir:78 434 VIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAE-AEEQRRQAAQA 489 (510) T ss_pred HHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHH-HHHHHHHHHHH Confidence 222222 22466665532 2322222 11111111110000 00000000000 No 242 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=85.22 E-value=0.054 Score=27.51 Aligned_cols=397 Identities=8% Similarity=-0.052 Sum_probs=146.9 Q ss_pred CCchhHHHHh---HHH------HHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccch Q lcl|NC_016762. 1 MTDKLDLAVN---HAM------SSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIV 71 (456) Q Consensus 1 ~~~~~~~~~~---~a~------~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~a 71 (456) |.-|.+.=++ -+. .......+..+.+...++-+... ...-.-.+. ...-++.|-+ +.-+..++++.- T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-~~~iLr~~~--~~~ly~~m~~-D~hi~s~l~~Rk 76 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDRE-FDELLQGKD--GLLVYHKMLS-DGTVKNALNYIF 76 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccc-hhHhhcccc--chHHHHHHhh-ChHHHHHHHHHH Confidence 7666544211 000 00111123333332221111111 100010111 1223444444 666667777777 Q ss_pred hHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHH-------HhhHHHHHHHHHHhhcccCceEEEEEe---cCCCCcccc Q lcl|NC_016762. 72 TTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIA-------GGRFWRAVSEADRRRLVGRYSGLLLHI---RDSQPWDRP 141 (456) Q Consensus 72 ed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~-------~l~~~~~~~ea~~~~r~~Ggs~i~i~i---~D~~~~~~P 141 (456) .-.+..-|+|..++++.......+ .++..+. +..+.+.+.++ -.+.+||+|++=+.- .||.-. T Consensus 77 ~av~~~~w~v~p~~~~~~d~~~ae---~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~--- 149 (448) T protein:vir:77 77 GRIRSAKWYVEPASTDPEDIAIAA---FIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLI--- 149 (448) T ss_pred HHHhcCCceEecCCCCHHHHHHHH---HHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCcee--- Confidence 666655567753333222222222 2333222 22334444444 468899999875432 123210 Q ss_pred ccCCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccC------Ccc-ccceeeehhhhheecCCcCCCcc Q lcl|NC_016762. 142 ARGKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQA------GRP-GLVRDIHPDRVFILGDWTGDAIG 214 (456) Q Consensus 142 l~~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~------g~~-~~~~~IH~SRli~~~~~~~~G~S 214 (456) +..|.++. +-++..+.-|+.. .+..........+ +.. -..+-||+.+ - ...+.+|.+ T Consensus 150 -------~~~l~~r~--~~~~~~f~~~~~~----~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~--~~g~p~g~g 213 (448) T protein:vir:77 150 -------LDKIVPIH--PFNIDEVLYDEEG----GPKALKLSGEVKGGSQFVNGLEIPIWKTVVFLH-N--DDGSFTGQS 213 (448) T ss_pred -------eccccccC--CCccceeeeecCC----ceEEEecCCcccccccCCCccccccceEEEEec-C--CcCCcccch Confidence 11111111 0000000001100 0000000000000 000 0012344432 1 234678999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhh-HHhh-hc-CCHHHHHHHHHHHHHHHhcCC-CeEEec Q lcl|NC_016762. 215 FLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGE-IAST-YG-VTLDALNERFNEAARQLNRGN-DVLLPT 290 (456) Q Consensus 215 ~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-l~~~-~~-~~~~~~~~~~~~~~~~~~~~~-~~~lid 290 (456) ++..||...+--.....-+++ |...+.+-- +..- .+ ...++..+.+.+++..++.+. ..++|. T Consensus 214 Llr~~~w~~~fK~~~~~~w~~-------------f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP 280 (448) T protein:vir:77 214 ALRAAVPHWLAKRALILLINH-------------GLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILP 280 (448) T ss_pred HHHHHHHHHHHHHhhHHHHHH-------------HHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEec Confidence 999998744321111111111 111111100 0000 01 112344555556666665443 456777 Q ss_pred CCCceeEEeccc--CCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccc-h-HHH-HHHHHHHHHHHHhhhhHHHH-HHH Q lcl|NC_016762. 291 QGATVTQMVSAV--SDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERAS-S-EDQ-KYHNARCQARRVQELTFEIN-DLF 364 (456) Q Consensus 291 ~~d~~~~~~~~~--sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glns-t-~D~-~nyyd~I~~~Qe~~lrp~L~-~l~ 364 (456) .+.+++-++..- +....+++..-.+||-+.--.. |.-++-||..+ . ++. ....+.+.+.-. .+...|. .|+ T Consensus 281 ~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqt--lTs~~~~g~~~~~~~~~~~v~~~~~~aDa~-~i~~tln~~Li 357 (448) T protein:vir:77 281 DDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDF--NTVQLNMGVQAVNIGEFVSLTQQTIISLQR-EFASAVNLYLI 357 (448) T ss_pred CCceEEEEecCCCccCHHHHHHHHHHHHHHHHhccc--cccccccchhhhhhhhHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 888888777653 2345555555556665442211 11112222221 1 232 233444444332 2444454 477 Q ss_pred HHHHHhcCcCCCCceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCCccc--CCC Q lcl|NC_016762. 365 AHLMRIGVVPLKAEFTAIWDDLTVPTKAERLANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPLPDT--EPE 442 (456) Q Consensus 365 ~~l~~s~~~~~~~d~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~~~~--~~~ 442 (456) .-|+...+++...-=.|.|.-.-. .|+ ++.|++...+++ .+|+..+.....+..++... ..+ T Consensus 358 ~~l~~lNfg~~~~~P~~~f~~~e~------eDl-~~~a~~~~~l~~---------~~~~~~~ip~~~~~~~~~~~~~~~~ 421 (448) T protein:vir:77 358 PKLVLPNWPGATRFPRLTFEMEER------NDF-SAAANLMGMLIN---------AVKDSEDIPTELKALIDALPSKMRR 421 (448) T ss_pred HHHHHhcCCCCCCCCEEEecCCCh------hhH-HHHHHHhHHHHH---------HHHHHhcCCccCCcCCCCCchhccc Confidence 777777777543211345532211 222 234555555542 34554444321111111100 000 Q ss_pred CCCCCCcCCCCCCC Q lcl|NC_016762. 443 DEDAARTDPTGEQQ 456 (456) Q Consensus 443 d~~~~~~d~~~~~e 456 (456) -...+++++.+... T Consensus 422 ~~~~~~~~~~~~~~ 435 (448) T protein:vir:77 422 ALGVVDEVREAVRQ 435 (448) T ss_pred ccCCCCCCCchhhc Confidence 00111111111111 No 243 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=83.84 E-value=0.065 Score=27.09 Aligned_cols=409 Identities=12% Similarity=0.007 Sum_probs=166.0 Q ss_pred CCchhHHHHhHHHHHH--HHHHHHHHhhhh-hccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHH--- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSA--IARARMSLLNQG-IGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTC--- 74 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~--~~~~~d~~~n~~-~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~--- 74 (456) |-+.++..-...-++. .......+.... -.++....+.. . ......| .+.+..+|+..|..+ T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~-~---------~~~~~~~--dst~~~a~~~Laa~l~~~ 68 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYILTDEGHVQ-G---------GYLPTPW--QSVGSKGVNVLASKLMLS 68 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcc-c---------ccccccc--cccHHHHHHHHHHHHHHh Confidence 8777766544332221 111122222221 11111000000 0 0001111 222333444444333 Q ss_pred ----hhCCCEEecCCCcc---------hhhhh---HHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCc Q lcl|NC_016762. 75 ----WKTNPQVIEGDDQD---------RSKDE---TEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPW 138 (456) Q Consensus 75 ----tR~~~~i~~~~~~d---------~~~~~---~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~ 138 (456) .|.||++.-.+... ..... ...++.+...+.+-++...+.++.+.-..+|-+++++.- |+-. T Consensus 69 ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~-~~~~- 146 (555) T protein:vir:17 69 LFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGK-KNLK- 146 (555) T ss_pred hcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecC-Ccee- Confidence 47899986443221 11111 235566777788899999999998887778877766542 3221 Q ss_pred ccccc------CCcCceeEEEEeccccCChhhhh-----------------cccccc----------ccCCcee------ Q lcl|NC_016762. 139 DRPAR------GKLNGLAKVTPAWAGCLKPKSFD-----------------EKPDSE----------TYGQPTM------ 179 (456) Q Consensus 139 ~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~-----------------~Dp~s~----------~yg~P~~------ 179 (456) .-||. ...+.+. .++.+..+++.... .+|..+ +.+.+.. T Consensus 147 ~~pl~~y~v~~d~~G~vd--~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~ 224 (555) T protein:vir:17 147 LYPLDRFVVSRDGEGNVM--EIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTY 224 (555) T ss_pred EEEcCeEEEeeCCCcCee--EEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeec Confidence 13552 2233232 23333333333211 011111 0011111 Q ss_pred -------EEEeecccCCccc----cceeeehhhh--hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_016762. 180 -------WEYTEASQAGRPG----LVRDIHPDRV--FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQ 244 (456) Q Consensus 180 -------y~i~~~~~~g~~~----~~~~IH~SRl--i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~ 244 (456) |.+. ...+|... +...+|..=+ .+| .....+|.|-.+..+..+..++...........+..... T Consensus 225 ~~~~~~~~~~~-~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp 303 (555) T protein:vir:17 225 VCRKDGQVKWH-QECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVV 303 (555) T ss_pred ccccCCeeEEE-EecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCc Confidence 1110 01111100 0011121111 122 234567999888888888887776655544433321111 Q ss_pred hhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC-CCeEEecCCCceeEEecc----cCCHHHHHHHHHHHHHhh Q lcl|NC_016762. 245 LLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRG-NDVLLPTQGATVTQMVSA----VSDPGPTYNVNLQTAAAG 319 (456) Q Consensus 245 l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lid~~d~~~~~~~~----~sgl~~~~~~~~~~~aaa 319 (456) +.+ . .+..... ..+..+ .+..+-+..+++..+... |.-....+....+.|.-+ T Consensus 304 ~lv------------~-----~~g~~~~-----~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~a 361 (555) T protein:vir:17 304 FMV------------S-----PSATTKP-----QNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDA 361 (555) T ss_pred eee------------c-----cccccCc-----ceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHH Confidence 100 0 0000111 111222 233333334556555543 334445556556666544 Q ss_pred hcCCeEEeeccCCCcccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC-c---eEEEeCCCCC Q lcl|NC_016762. 320 VDIPTKILVGMQTGERASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA-E---FTAIWDDLTV 388 (456) Q Consensus 320 s~IP~t~L~G~sp~Glnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~-d---~~~~f~pL~~ 388 (456) .-+ +--+....+.+++ + ....--.++..|...|.|.|++.+.+|.+.+..|+++ + .++. .+|.. T Consensus 362 Fm~----~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~-~~l~~ 436 (555) T protein:vir:17 362 FLM----LQVRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVV-AGLWG 436 (555) T ss_pred Hhh----cCCCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhcccee-ehHHH Confidence 322 1123344455664 2 2346667777888899999999999999998875432 2 2222 23332 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHcC-C----cCcCHHHHHH-HhcccCCC-CCCCC-cccCCCCC---------------- Q lcl|NC_016762. 389 PTKAERLANSKTMSEINSAAIGTG-E----PVFTAEEIRE-EAGYDPLQ-GGDPL-PDTEPEDE---------------- 444 (456) Q Consensus 389 ~seke~Aei~~~~A~a~~~~~~~g-~----~~i~~~E~R~-~~~~~~~~-~~~~~-~~~~~~d~---------------- 444 (456) +.... ++.+.. +......+.+ . ..|+.+++-+ ....-+.. ....- +++....- T Consensus 437 l~r~~--~~~~l~-~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa 513 (555) T protein:vir:17 437 VGRGQ--DKQQLM-EFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQA 513 (555) T ss_pred HHHHH--HHHHHH-HHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222 222221 1122222221 1 1355555432 22222221 11111 11100000 Q ss_pred CCCCcCC--------------CCCCC Q lcl|NC_016762. 445 DAARTDP--------------TGEQQ 456 (456) Q Consensus 445 ~~~~~d~--------------~~~~e 456 (456) ....+.+ ++++. T Consensus 514 ~~~~~~~~~~~~~~~~~~~~~~a~~~ 539 (555) T protein:vir:17 514 GQLAKTPMAEQAMQLIQQQQEGAQDA 539 (555) T ss_pred HHHHhhhhhhhHHhccccchhhhhHH Confidence 0000000 00000 No 244 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=79.99 E-value=0.099 Score=26.09 Aligned_cols=408 Identities=12% Similarity=0.048 Sum_probs=167.1 Q ss_pred CCchhHHHHhHHHHHH---HHHHHHHHhhhhh-ccC--------cccchhhhhccCcccCCHHHHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSA---IARARMSLLNQGI-GHD--------AKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~---~~~~~d~~~n~~~-~~g--------t~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd 68 (456) |-.-.+..-. .++.. .......+..... .++ ....+.|..++ .++..-. .++|...+ T Consensus 1 mk~~a~~r~~-~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg-------~~a~~~L-aa~l~~~l-- 69 (542) T protein:vir:78 1 MKGLAQARYS-AMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLG-------SKGVNAL-SSKLMLSL-- 69 (542) T ss_pred ChhHHHHHHH-HHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchH-------HHHHHHH-HHHHHHhh-- Confidence 6532222222 22222 2222233333211 111 11112222222 1111111 12222222 Q ss_pred cchhHHhhCCCEEecCCCcc-------------hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCC Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQD-------------RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDS 135 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d-------------~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~ 135 (456) +|+ .|.||++.-.+... ...-....|+.+.+.+.+.++...+.++.+.-..+|.+++++.- |+ T Consensus 70 tpp---~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~-~~ 145 (542) T protein:vir:78 70 FPI---QTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGK-KT 145 (542) T ss_pred cCC---CCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecC-CC Confidence 122 47798885332110 11112334567778888899999999999997888887776542 21 Q ss_pred CCcccccc------CCcCceeEEEEeccccCChhhhhc----cccc----------c--cc--------------C---- Q lcl|NC_016762. 136 QPWDRPAR------GKLNGLAKVTPAWAGCLKPKSFDE----KPDS----------E--TY--------------G---- 175 (456) Q Consensus 136 ~~~~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~~----Dp~s----------~--~y--------------g---- 175 (456) --.-|+. ...+.+ .+++.+..+++.+... +-.+ | .| + T Consensus 146 -~~~~pl~~y~v~~d~~G~v--d~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~ 222 (542) T protein:vir:78 146 -LKVYPLDRYVIERDGDGNV--IEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCK 222 (542) T ss_pred -ceEEecceeEEeeCCCCCe--EEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccc Confidence 1123442 222222 2244444444332211 1000 0 00 0 Q ss_pred --CceeEEEeecccCCcc----ccceeeehhhhh--ee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_016762. 176 --QPTMWEYTEASQAGRP----GLVRDIHPDRVF--IL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQL 245 (456) Q Consensus 176 --~P~~y~i~~~~~~g~~----~~~~~IH~SRli--~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l 245 (456) .| +|.+- ..++|.. .....+|..=++ || ..+..+|.|-.+..+..+..++...........++....+ T Consensus 223 ~~~~-~~s~~-~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~ 300 (542) T protein:vir:78 223 LVDG-QHRWH-QECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVF 300 (542) T ss_pred cCCC-eEEEE-EEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 01 11111 0111110 001112222221 22 2344679998888888888877766555444333211111 Q ss_pred hhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhc-CCCeEEecCCCceeEEec----ccCCHHHHHHHHHHHHHhhh Q lcl|NC_016762. 246 LLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNR-GNDVLLPTQGATVTQMVS----AVSDPGPTYNVNLQTAAAGV 320 (456) Q Consensus 246 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~lid~~d~~~~~~~----~~sgl~~~~~~~~~~~aaas 320 (456) +.. .+...+. ..+.. +.+..+-+..+++..+.. .|.-....+....+.|.-+. T Consensus 301 ------------lv~-----~~g~~~~-----~~~~~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aF 358 (542) T protein:vir:78 301 ------------MVS-----PSATTKP-----QSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAF 358 (542) T ss_pred ------------eec-----cccccch-----hhcccCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHh Confidence 100 0011111 11222 233333344456654432 45556677777777777664 Q ss_pred cCCeEEeec--cCCCcccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC--ceEEEe-CCCCC Q lcl|NC_016762. 321 DIPTKILVG--MQTGERASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA--EFTAIW-DDLTV 388 (456) Q Consensus 321 ~IP~t~L~G--~sp~Glnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~--d~~~~f-~pL~~ 388 (456) |+. +....+.+++ + ....--.++..|...|.|.|++.+.+|.+.+..|+++ -++++| .||.+ T Consensus 359 ------l~~~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~ 432 (542) T protein:vir:78 359 ------LILNVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGG 432 (542) T ss_pred ------cccccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHH Confidence 222 2333345664 2 2345566677788899999999999999999886543 244443 34433 Q ss_pred CCHHHHHHHHHHHHHHHHHHH--------------------HcCCc----CcCHHHHHHH-------h------cccCC- Q lcl|NC_016762. 389 PTKAERLANSKTMSEINSAAI--------------------GTGEP----VFTAEEIREE-------A------GYDPL- 430 (456) Q Consensus 389 ~seke~Aei~~~~A~a~~~~~--------------------~~g~~----~i~~~E~R~~-------~------~~~~~- 430 (456) +...+.++.-..-+++....+ ..|.+ +-+++|+... . ..-+. T Consensus 433 ~~r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~ 512 (542) T protein:vir:78 433 VGRGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQL 512 (542) T ss_pred HHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 333222222111112111110 11211 1122222110 0 00000 Q ss_pred ---CCCCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 431 ---QGGDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 431 ---~~~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) .-+........++.+..+..|..-++ T Consensus 513 a~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 541 (542) T protein:vir:78 513 AKSPIGEKMMQQINAPGQEAPAGPQTGED 541 (542) T ss_pred cccccccchhhhcCCCCcCCCCCCccccc Confidence 01111112222222222323322222 No 245 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=79.07 E-value=0.11 Score=25.88 Aligned_cols=400 Identities=12% Similarity=0.037 Sum_probs=166.4 Q ss_pred CC---chhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCccc----CC--HHHHHHHHhcC----chhhhhh Q lcl|NC_016762. 1 MT---DKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQE----IT--FNDLYTMYRRG----GIAHGAV 67 (456) Q Consensus 1 ~~---~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~----~~--~~~l~~~Y~~~----~l~r~iV 67 (456) |+ .+|..-..++..+.+ .+|.+ .|+..-+...+-+-|+. -+ -.+.++.|... .+.+++| T Consensus 1 m~~V~~~hp~y~~~~~~W~~--ird~~------~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~ 72 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYL--IRDAI------AGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTL 72 (501) T ss_pred CCCCCCCCHHHHHHHHHHHH--HHHHh------cChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHH Confidence 66 677776555555544 45665 24432222222222211 00 01234444433 5566777 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHh-----hHHHHHHHHHHhhcccCceEEEEEecC--CCC-cc Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGG-----RFWRAVSEADRRRLVGRYSGLLLHIRD--SQP-WD 139 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l-----~~~~~~~ea~~~~r~~Ggs~i~i~i~D--~~~-~~ 139 (456) +...--.+|+.+++. -+ ..++.+++.. ++.+.++.+.+....+|.++|+++... ++. .+ T Consensus 73 ~~l~G~vf~k~p~~~---~p----------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t 139 (501) T protein:vir:95 73 FGLVGQVFMRDPVVK---VP----------ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGAS 139 (501) T ss_pred HHHhhhhhcCCccee---Cc----------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCccccc Confidence 777777788888772 11 1244444433 577778888888888999999987631 110 00 Q ss_pred ---------ccc-------c---------CCcCceeEEEEeccccCChhhhhccccccc--------------cCCceeE Q lcl|NC_016762. 140 ---------RPA-------R---------GKLNGLAKVTPAWAGCLKPKSFDEKPDSET--------------YGQPTMW 180 (456) Q Consensus 140 ---------~Pl-------~---------~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~--------------yg~P~~y 180 (456) -|- . ++.+.|.. ++++-.... ..|+++.. ++.-+.| T Consensus 140 ~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~--v~l~E~~~~---~d~~f~~~~~~q~RvL~~~~~g~~~~~v~ 214 (501) T protein:vir:95 140 IADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSL--VVLFETWCA---ADDGFEMKTSGQFRVLRLDEEGYYVHEIW 214 (501) T ss_pred HHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeE--EEEEEEEee---cCCCcccceeEEEEEEeeCCCceEEEEEE Confidence 021 0 11112222 222111100 01122221 1111222 Q ss_pred EEeeccc-C------Cccc--cceee-----ehhhhheec----CCcCC--C-cchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 181 EYTEASQ-A------GRPG--LVRDI-----HPDRVFILG----DWTGD--A-IGFLEPAYNSFISLEKVEGGSGESFLK 239 (456) Q Consensus 181 ~i~~~~~-~------g~~~--~~~~I-----H~SRli~~~----~~~~~--G-~S~le~~~~~l~~~~~~~~~~~~~~~~ 239 (456) +-..... . |... ..... |.=..|.|. ..+.+ | -+++..++=.+..|...+. .-++++. T Consensus 215 r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd-~~~~l~~ 293 (501) T protein:vir:95 215 REPQPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSAD-YEESCYI 293 (501) T ss_pred EecCCcccCcceecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhH-HHHHHHH Confidence 2211000 0 0000 00111 111123332 12222 2 2444444323333322221 2233333 Q ss_pred HhhhhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCCceeEEecccCCH-HHHHHHHHHHHHh Q lcl|NC_016762. 240 NAARQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGATVTQMVSAVSDP-GPTYNVNLQTAAA 318 (456) Q Consensus 240 ~~~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d~~~~~~~~~sgl-~~~~~~~~~~~aa 318 (456) .+..++.++....-. ...+.. . -+ .+ .......+.++.++..++.+-+++ ...++...+++.. T Consensus 294 ~~~P~l~i~G~~~~~-----~~~~~~------~---~i-~~-G~~~~~~lP~~~~~~~ie~~~~~i~~~~l~~l~~~m~~ 357 (501) T protein:vir:95 294 VGQPTPVLIGLTEEW-----VTNVLK------G---SV-NF-GSRGGIPLPVGADAKLLQASENTMLKEAMDTKERQMVA 357 (501) T ss_pred cccceeeeeCCcccc-----cccCCC------C---ce-ee-cccccccCCCCCceeEEecChhhHHHHHHHHHHHHHHH Confidence 333333322110000 000000 0 00 00 112233445666777777665565 3344444444433 Q ss_pred hhcCCeEEeeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCCceEEEeCCCCCCCHHHHH Q lcl|NC_016762. 319 GVDIPTKILVGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKAEFTAIWDDLTVPTKAERL 395 (456) Q Consensus 319 as~IP~t~L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~d~~~~f~pL~~~seke~A 395 (456) +-- +|+-+.++..++++ |...=+..++++-. .+...|++++++..++... .+.+.+|+.|+-......+.+ T Consensus 358 ~Ga----~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~-~le~al~~~l~~~a~w~g~-~~~~~~v~i~~df~~~~~~~~ 431 (501) T protein:vir:95 358 LGA----KLVEQKEVQRTATEAELEAASEGSTLSSATK-NVSAAFEWALKWAARWVGQ-ADSGVKFELNTDFDIARMTPD 431 (501) T ss_pred HHH----hhccCCccchhHHHHHHHHHHHhHHHHHHHH-HHHHHHHHHHHHHHHHcCC-CCCceEEEEecccccccCCHH Confidence 321 23334445555543 33333444554443 4688899999988877443 345677777776644433332 Q ss_pred HHHHHHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCCCCCC---cccCC-------------CCCCCCCcCCCCCCC Q lcl|NC_016762. 396 ANSKTMSEINSAAIGTGEPVFTAEEIREEAGYDPLQGGDPL---PDTEP-------------EDEDAARTDPTGEQQ 456 (456) Q Consensus 396 ei~~~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~~~~~---~~~~~-------------~d~~~~~~d~~~~~e 456 (456) + +++...+...| .|+.+.+++.+..-++.+.+.. +.+++ .....+.++.-+..| T Consensus 432 ~-----~~al~~~~~~G--~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 432 E-----RRSLVEEWQKG--AITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred H-----HHHHHHHHhCC--CCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccccccCCC Confidence 2 34445556666 7777777665433222211000 00000 001112222333334 No 246 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=76.06 E-value=0.14 Score=25.27 Aligned_cols=417 Identities=9% Similarity=-0.019 Sum_probs=173.8 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhh-hccCcccchhhhhccCcccCCHHHHHHHHhcCchhhhhhccchhHH----- Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQG-IGHDAKRPQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVEKIVTTC----- 74 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~-~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd~~aed~----- 74 (456) |.+|-.......-+......+..+.... -.++..-......- ..+...| ...+.+.|+..|..+ T Consensus 1 m~~~~~~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~--------~~~~~~~--dstg~~a~~~LAa~l~~~lt 70 (514) T protein:vir:80 1 MRQQASAMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQA--------EVVEYDF--QSAGAFLVNNLTAKLALTLF 70 (514) T ss_pred CccchHHHHHHhhcchHHHHHHHHHHHhcccccCCCCCCcccc--------ccccccc--chhHHHHHHHHHHHHHhhhc Confidence 8888776644333333222223333321 11111000000000 0011112 223333444444333 Q ss_pred --hhCCCEEecCCCcc------------hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccc Q lcl|NC_016762. 75 --WKTNPQVIEGDDQD------------RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDR 140 (456) Q Consensus 75 --tR~~~~i~~~~~~d------------~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~ 140 (456) .|.||++.-+++.. -.+--...|+.+.+.+.+-++...+.++.+.--.+|.+.+++.-+.+.--.- T Consensus 71 pp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~ 150 (514) T protein:vir:80 71 PPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGTGKMLVW 150 (514) T ss_pred CCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCCCcEEEE Confidence 47899985433211 1122223455677777888999999999998888887777764321111123 Q ss_pred ccc------CCcCceeEEEEeccccCChhhhhcccc--------cc-ccCCceeEEEeecccCCccccceeeehh----h Q lcl|NC_016762. 141 PAR------GKLNGLAKVTPAWAGCLKPKSFDEKPD--------SE-TYGQPTMWEYTEASQAGRPGLVRDIHPD----R 201 (456) Q Consensus 141 Pl~------~~~~~l~~i~~~~~~~~~~~~~~~Dp~--------s~-~yg~P~~y~i~~~~~~g~~~~~~~IH~S----R 201 (456) |+. ...+.+. ..+.+..+++.++..+-. .+ .+.+-+.|..-.-..+ ......-||.+ + T Consensus 151 pl~~y~v~~d~~G~v~--~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~-~~~~~~sv~~e~~g~~ 227 (514) T protein:vir:80 151 TMQSYTVRRTSHGDPA--VVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPT-PNGKRCAVWHELEGKR 227 (514) T ss_pred EcCeEEEeeCCCcCeE--EEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecC-CCCeEEEEEEecccee Confidence 552 2333332 233333344433211110 00 1111122211100000 00001112221 1 Q ss_pred h---------------hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHhhhcC Q lcl|NC_016762. 202 V---------------FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIASTYGV 264 (456) Q Consensus 202 l---------------i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~ 264 (456) + .|| ..+..+|.|-.+.++..+..++...... ++.+.......+ +... T Consensus 228 i~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~----l~~~~~a~~~~~--------~v~~--- 292 (514) T protein:vir:80 228 VGPESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERL----GLYEFEALSLLN--------LVDE--- 292 (514) T ss_pred ecccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHH----HHHHHHhcCCCc--------eeCc--- Confidence 1 111 1233569998888888887776554433 332211111111 1100 Q ss_pred CHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEEec----ccCCHHHHHHHHHHHHHhhhcCCeEEeeccCCCcccchH Q lcl|NC_016762. 265 TLDALNERFNEAARQLNRGN-DVLLPTQGATVTQMVS----AVSDPGPTYNVNLQTAAAGVDIPTKILVGMQTGERASSE 339 (456) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~~~----~~sgl~~~~~~~~~~~aaas~IP~t~L~G~sp~Glnst~ 339 (456) +... ....++.+- +..+-+..+++..++. .|.-....++...+.|.-+. -++.. .+....+.+++ T Consensus 293 --~g~~-----~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF--ml~~~-~rd~~rvTAtE 362 (514) T protein:vir:80 293 --AKGG-----AVDDYRDAETGDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAF--MYTGQ-VRDAERVTVEE 362 (514) T ss_pred --cccc-----chhhhcccCCceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHH--hhhcc-CCCCCCCCHHH Confidence 0001 111222322 3333334455655543 34444566777777776553 11211 12333367764 Q ss_pred ------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc--CcC-CCC-ceEEEe-CCCCCCCHHHHHHHHHHHHHHHHH Q lcl|NC_016762. 340 ------D-QKYHNARCQARRVQELTFEINDLFAHLMRIG--VVP-LKA-EFTAIW-DDLTVPTKAERLANSKTMSEINSA 407 (456) Q Consensus 340 ------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~--~~~-~~~-d~~~~f-~pL~~~seke~Aei~~~~A~a~~~ 407 (456) + ....--.++..|...|.|.+++.+.++.+.. ..| +|. -+.+++ .+|.++.-...++.-...++.... T Consensus 363 V~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~ 442 (514) T protein:vir:80 363 IRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASA 442 (514) T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2234455677888899999999999987753 333 222 233333 345555555544443334444433 Q ss_pred HHHcCC---cCcCHHHH-HHHhcccCCCC-CCCCcccCCCCCCCCCcCCCCCCC Q lcl|NC_016762. 408 AIGTGE---PVFTAEEI-REEAGYDPLQG-GDPLPDTEPEDEDAARTDPTGEQQ 456 (456) Q Consensus 408 ~~~~g~---~~i~~~E~-R~~~~~~~~~~-~~~~~~~~~~d~~~~~~d~~~~~e 456 (456) ...... ..|+.+++ +..+..-+.+. .....+++..- ..++..-.++++ T Consensus 443 l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~-~~~~~~~~~~~~ 495 (514) T protein:vir:80 443 IVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAA-EAEQEAALAQQQ 495 (514) T ss_pred HhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHH-HHHHHHHHHHHH Confidence 333221 24666665 33344444432 12211111000 000000000000 No 247 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=64.68 E-value=0.3 Score=23.48 Aligned_cols=416 Identities=10% Similarity=0.037 Sum_probs=172.2 Q ss_pred CCchh-HHHHh------HHHHHHH---HHHHHHHhhhh-hccCccc-chhhhhccCcccCCHHHHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDKL-DLAVN------HAMSSAI---ARARMSLLNQG-IGHDAKR-PQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~~-~~~~~------~a~~~~~---~~~~d~~~n~~-~~~gt~~-~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd 68 (456) |.++. -++.. ..++..+ ......+.... -.+.... +.......-+-.-+-.++..-. .++|...+. T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~L-aa~l~~~lt- 78 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNL-ASKLMLALF- 78 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHH-HHHHHhhhc- Confidence 99833 11111 1111111 11111222211 0111000 0000000000000112222222 245555443 Q ss_pred cchhHHhhCCCEEecCCCcch------------hhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCC Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQDR------------SKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQ 136 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d~------------~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~ 136 (456) |+ |.||++.-.+..-. .+-....++.+.+.+.+-++...+.++.+.-..+|-+++++.-+.++ T Consensus 79 -P~----~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~ 153 (536) T protein:vir:10 79 -PM----QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) T ss_pred -CC----CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCC Confidence 64 56999853332210 11133346678888889999999999988877888887777543332 Q ss_pred C---c-ccccc------CCcCceeEEEEeccccCChhhhh----ccccc-----cccCCceeEEEe-ecccCCcccccee Q lcl|NC_016762. 137 P---W-DRPAR------GKLNGLAKVTPAWAGCLKPKSFD----EKPDS-----ETYGQPTMWEYT-EASQAGRPGLVRD 196 (456) Q Consensus 137 ~---~-~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~----~Dp~s-----~~yg~P~~y~i~-~~~~~g~~~~~~~ 196 (456) . + .-||. ...+.+ ..++.+..+++.+.. .+-.+ ..+-..+.|+.- ...-++ .+. T Consensus 154 ~~~~~~~~pl~~~~v~~d~~G~v--d~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~----~~~ 227 (536) T protein:vir:10 154 NYNPMKLYRLSSYVVQRDAFGNV--LQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASG----EYL 227 (536) T ss_pred ceeeEEEEEcCeEEEeeCCCCCe--eEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCC----cEE Confidence 1 1 24552 233333 233444445533221 11110 011122233211 100011 111 Q ss_pred eehh----hh----------------hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhcc Q lcl|NC_016762. 197 IHPD----RV----------------FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEIN 254 (456) Q Consensus 197 IH~S----Rl----------------i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 254 (456) +|+. ++ .+| ..+..+|.|-.+..+..+..++...........+. .... T Consensus 228 ~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a----~~~~------ 297 (536) T protein:vir:10 228 RYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMIS----SKVI------ 297 (536) T ss_pred EEEeecCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH----hcCC------ Confidence 1111 11 111 12345799988888888887776654443322211 1100 Q ss_pred HhhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEE----ecccCCHHHHHHHHHHHHHhhhcCCeEEeec Q lcl|NC_016762. 255 LGEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQM----VSAVSDPGPTYNVNLQTAAAGVDIPTKILVG 329 (456) Q Consensus 255 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~----~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G 329 (456) -+... ..... ...+..+. +..+-+..+++..+ ...|.-....+....+.|.-+.=+ .-|.- T Consensus 298 --~lv~p-----~g~~~-----~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~l~~ 363 (536) T protein:vir:10 298 --GLVNP-----AGITQ-----PRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQ 363 (536) T ss_pred --cccCc-----ccccc-----hhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhh--hhccc Confidence 01100 01111 11222322 32333334555433 334555667777777777666521 12222 Q ss_pred cCCCcccchH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC--ceEEEe-CCCCCCCHHHHHHHHH Q lcl|NC_016762. 330 MQTGERASSE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA--EFTAIW-DDLTVPTKAERLANSK 399 (456) Q Consensus 330 ~sp~Glnst~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~--d~~~~f-~pL~~~seke~Aei~~ 399 (456) .....+.+++ +. ...--.++..|...|.|.|++++.+|.+.+..|+++ .+.+++ .||.++.-.+.+ . T Consensus 364 ~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~---~ 440 (536) T protein:vir:10 364 RTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDL---D 440 (536) T ss_pred CCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHH---H Confidence 3445567774 22 335556677888899999999999999998876432 223333 355444333332 2 Q ss_pred HHHHHHHHHHHcCC----cCcCHHHHHH-HhcccCC-CCCCC-CcccCCCCCCC-------------------------- Q lcl|NC_016762. 400 TMSEINSAAIGTGE----PVFTAEEIRE-EAGYDPL-QGGDP-LPDTEPEDEDA-------------------------- 446 (456) Q Consensus 400 ~~A~a~~~~~~~g~----~~i~~~E~R~-~~~~~~~-~~~~~-~~~~~~~d~~~-------------------------- 446 (456) +.........+.+- +.|+.+++-+ .+..-+. ..... .+++...--+. T Consensus 441 ~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~ 520 (536) T protein:vir:10 441 KLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASP 520 (536) T ss_pred HHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 23333333333331 1356665533 2222232 11111 11100000000 Q ss_pred ----CCcCCCCCCC Q lcl|NC_016762. 447 ----ARTDPTGEQQ 456 (456) Q Consensus 447 ----~~~d~~~~~e 456 (456) .-.+..+.++ T Consensus 521 ~~~~~~~~~~g~~~ 534 (536) T protein:vir:10 521 EAMAAAADSVGLQP 534 (536) T ss_pred hhHHhhhhccccCC Confidence 0000000000 No 248 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=62.39 E-value=0.34 Score=23.18 Aligned_cols=406 Identities=8% Similarity=0.009 Sum_probs=173.4 Q ss_pred CCchhHHHHhHHH------HHHH---HHHHHHHhhhh-hccCc--------ccchhhhhccCcccCCHHHHHHHHhcCch Q lcl|NC_016762. 1 MTDKLDLAVNHAM------SSAI---ARARMSLLNQG-IGHDA--------KRPQAWCEYGFPQEITFNDLYTMYRRGGI 62 (456) Q Consensus 1 ~~~~~~~~~~~a~------~~~~---~~~~d~~~n~~-~~~gt--------~~~~~~~~~~~~~~~~~~~l~~~Y~~~~l 62 (456) |.++.-.|...+. +..+ ......+.... -.+.. .+.+.|..+ -.++..-. .++| T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst-------~~~a~~~L-as~l 72 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAV-------GARCLNNL-AAKL 72 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccccccccccc-------HHHHHHHH-HHHH Confidence 8887776544321 1111 11111122211 01110 001111111 11222222 2444 Q ss_pred hhhhhccchhHHhhCCCEEecCCC---------cc---hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEE Q lcl|NC_016762. 63 AHGAVEKIVTTCWKTNPQVIEGDD---------QD---RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLL 130 (456) Q Consensus 63 ~r~iVd~~aed~tR~~~~i~~~~~---------~d---~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i 130 (456) ...+ .|+ |.||++.-.+. .+ ..+.-...++.+.+.+.+-++...+.++.+.-..+|-+++++ T Consensus 73 ~~~l--tP~----~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~ 146 (522) T protein:vir:94 73 MLAL--FPQ----SPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYI 146 (522) T ss_pred Hhhc--CCC----CcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEee Confidence 4444 253 57999853321 11 122234456677788888999999999988877888877776 Q ss_pred EecC-CCC--c-ccccc------CCcCceeEEEEeccccCChhh--------hhccccccccCCceeEEEeecc------ Q lcl|NC_016762. 131 HIRD-SQP--W-DRPAR------GKLNGLAKVTPAWAGCLKPKS--------FDEKPDSETYGQPTMWEYTEAS------ 186 (456) Q Consensus 131 ~i~D-~~~--~-~~Pl~------~~~~~l~~i~~~~~~~~~~~~--------~~~Dp~s~~yg~P~~y~i~~~~------ 186 (456) .-.. +.. + .-|+. ...+.+. .++.+..++... ++.|...|+ -.-+.|+.-... T Consensus 147 ~~~~~~~~~~~~~~pl~~y~v~~d~~G~vd--~i~r~~~~~~~~l~~~~~~~~~~~~~~p~-~~v~v~~~v~~~~~~~~~ 223 (522) T protein:vir:94 147 PEPEQGTYSPMRMYRLVSYVVQRDAFGNIL--QIVTIDKVAFSALPEDVKSQLNADDYEPD-TELEVYTHIYRQDDEYLR 223 (522) T ss_pred eccCCCceeeEEEEEcceEEEeeCCCcCeE--EEeeeeeccHHhcchHHHHHHhcccCCcc-ceEEEEEEEEeeCCceeE Confidence 4321 111 1 24652 2223232 233333333221 122222221 111222211000 Q ss_pred ---cCCcc----ccceeeehhhh--hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccH Q lcl|NC_016762. 187 ---QAGRP----GLVRDIHPDRV--FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINL 255 (456) Q Consensus 187 ---~~g~~----~~~~~IH~SRl--i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 255 (456) ..|.. ......|..=+ .+| .....+|.|-.+.++..+..++...........++....+ T Consensus 224 ~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~---------- 293 (522) T protein:vir:94 224 YEEVEGIEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVG---------- 293 (522) T ss_pred EeeccCceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce---------- Confidence 00000 00001121111 111 1234579999999988888877776655544433221111 Q ss_pred hhHHhhhcCCHHHHHHHHHHHHHHHhc-CCCeEEecCCCceeEEe----cccCCHHHHHHHHHHHHHhhhcCCeEEeec- Q lcl|NC_016762. 256 GEIASTYGVTLDALNERFNEAARQLNR-GNDVLLPTQGATVTQMV----SAVSDPGPTYNVNLQTAAAGVDIPTKILVG- 329 (456) Q Consensus 256 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~lid~~d~~~~~~----~~~sgl~~~~~~~~~~~aaas~IP~t~L~G- 329 (456) +... +...+. ..+.. +.+..+-+..+++..+. ..|.-....+....+.|..+.=+ . .++ T Consensus 294 --~v~~-----~g~~~~-----~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~-~~~~ 358 (522) T protein:vir:94 294 --LVNP-----NGITQP-----RRLNKAATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLL--N-SAVQ 358 (522) T ss_pred --eecc-----cccccc-----hheeccCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhh--h-hhcc Confidence 0000 000111 11122 22333334445555443 24555566777777777766522 1 233 Q ss_pred cCCCcccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCC--CceEEEeC-CCCCCCHHHHHHHHH Q lcl|NC_016762. 330 MQTGERASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPLK--AEFTAIWD-DLTVPTKAERLANSK 399 (456) Q Consensus 330 ~sp~Glnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~--~d~~~~f~-pL~~~seke~Aei~~ 399 (456) .....+.+++ + ....--.++..|...|.|.|++.+.+|.+.+..|++ +.+.++|- ||.++ .|+.-.. T Consensus 359 ~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~---qr~~~~~ 435 (522) T protein:vir:94 359 RNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEAL---GRGQDLE 435 (522) T ss_pred CCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHH---HHHHHHH Confidence 3444567774 2 245667777788999999999999999999887543 34555553 33333 2222222 Q ss_pred HHHHHHHHHHHcCC----cCcCHHHHH-HHhcccCCC-CCCCCcccCCCCCCCCCcC-CCCCCC Q lcl|NC_016762. 400 TMSEINSAAIGTGE----PVFTAEEIR-EEAGYDPLQ-GGDPLPDTEPEDEDAARTD-PTGEQQ 456 (456) Q Consensus 400 ~~A~a~~~~~~~g~----~~i~~~E~R-~~~~~~~~~-~~~~~~~~~~~d~~~~~~d-~~~~~e 456 (456) +.-+......+.+- ..|+.+++- .....-+.+ ..+.-. +++...-- +...+| T Consensus 436 ~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~-----~ee~~~~~~q~~~~~ 494 (522) T protein:vir:94 436 KLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLT-----QDEKIQRMAEQSSQQ 494 (522) T ss_pred HHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCC-----HHHHHHHHHHHHHHH Confidence 22222222222221 124555442 222222221 111111 11000000 000000 No 249 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=56.28 E-value=0.46 Score=22.43 Aligned_cols=416 Identities=9% Similarity=-0.013 Sum_probs=161.6 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhh---hhccCcccCCH----HHHHHHHhcCchhhhhhccchhH Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAW---CEYGFPQEITF----NDLYTMYRRGGIAHGAVEKIVTT 73 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~---~~~~~~~~~~~----~~l~~~Y~~~~l~r~iVd~~aed 73 (456) |.+...+.-+.+- ..+.+..+.+.+--... ...| ..|..|..++. ......| ...+...|+..|.. T Consensus 1 ~~~~~~~~~~~~~-~~l~~r~~~L~~~R~~~----e~~w~e~a~~~lP~~~~~~~~~~~~~~~~--dstg~~a~~~LAa~ 73 (516) T protein:vir:96 1 MKQSIDLEYGGKR-SKIPKLWEKFSNKRSSF----LDRAKHYSKLTLPYLMNDKGDNETSQNGW--QGVGAQATNHLANK 73 (516) T ss_pred CcchhhhhhhhhH-HHHHHHHHHHHHHhhHH----HHHHHHHHHhhcccccCCCCCccccCCcc--cchHHHHHHHHHHH Confidence 4443333222111 11111111111100000 0011 11111111110 0111122 23334444444444 Q ss_pred H-------hhCCCEEecCCCcc------------hhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecC Q lcl|NC_016762. 74 C-------WKTNPQVIEGDDQD------------RSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRD 134 (456) Q Consensus 74 ~-------tR~~~~i~~~~~~d------------~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D 134 (456) + .|.||++.-++... ..+--...++.+...+.+-++...+.++.+.-..+|-+++++.-+ T Consensus 74 l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~~- 152 (516) T protein:vir:96 74 LAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKPSK- 152 (516) T ss_pred HHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEecCC- Confidence 3 47899985333211 111122335567778888899999999999877888877766422 Q ss_pred CCCcccccc------CCcCceeEEEEeccccCChhhhhcc--cc---------ccccCCceeEEEeecccCCcc------ Q lcl|NC_016762. 135 SQPWDRPAR------GKLNGLAKVTPAWAGCLKPKSFDEK--PD---------SETYGQPTMWEYTEASQAGRP------ 191 (456) Q Consensus 135 ~~~~~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~~D--p~---------s~~yg~P~~y~i~~~~~~g~~------ 191 (456) +.--.-|+. ...+.+. ..+.+..+++.++..+ +. ...+..-+.|....-..++.. T Consensus 153 ~~~~~~pl~~y~v~~d~~G~v~--~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~ 230 (516) T protein:vir:96 153 GAISAIPMHHYVVNRDTNGDLL--DIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSA 230 (516) T ss_pred CCEEEEEcCeEEEeeCCCCCee--eehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEe Confidence 211123552 2333332 2333333343332111 00 011222223322111111100 Q ss_pred -------ccceeeehhhh--hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccHhhHHh Q lcl|NC_016762. 192 -------GLVRDIHPDRV--FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINLGEIAS 260 (456) Q Consensus 192 -------~~~~~IH~SRl--i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~l~~ 260 (456) .++...|..=. .|| ..+..+|.|-.+.++..+..++...... ++.+....... -+.. T Consensus 231 d~~~~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~----l~~~~~a~~~~--------~lv~ 298 (516) T protein:vir:96 231 DDIPVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAV----ARGAALMADIK--------YLIR 298 (516) T ss_pred CceeeccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHH----HHHHHHhcCCc--------cccC Confidence 00111121111 111 1233579998888888777766554433 32221111110 0111 Q ss_pred hhcCCHHHHHHHHHHHHHHHhcC-CCeEEecCCCceeEEecc----cCCHHHHHHHHHHHHHhhhcCCeEEeec-cCCCc Q lcl|NC_016762. 261 TYGVTLDALNERFNEAARQLNRG-NDVLLPTQGATVTQMVSA----VSDPGPTYNVNLQTAAAGVDIPTKILVG-MQTGE 334 (456) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~lid~~d~~~~~~~~----~sgl~~~~~~~~~~~aaas~IP~t~L~G-~sp~G 334 (456) . +...+ ...+..+ .+..+-+..+++..++.. |.-....++...+.|.-+.=+- ++. ..... T Consensus 299 p-----~g~~~-----~~~l~~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~---~l~~r~~~r 365 (516) T protein:vir:96 299 P-----GAQTD-----VDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMME---TMTRRDAER 365 (516) T ss_pred c-----ccccc-----hhhhccCCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh---hhccCCCcc Confidence 0 11111 1122233 333333344566665443 4445566777777776654211 122 23444 Q ss_pred ccchH------H-HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCC-C-CceEEEe-CCCCCCCHHHHHHHHHHHHHH Q lcl|NC_016762. 335 RASSE------D-QKYHNARCQARRVQELTFEINDLFAHLMRIGVVPL-K-AEFTAIW-DDLTVPTKAERLANSKTMSEI 404 (456) Q Consensus 335 lnst~------D-~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~-~-~d~~~~f-~pL~~~seke~Aei~~~~A~a 404 (456) +.+++ + ....--.++..|...|.|.+++++..+ +|. | ..+.+++ .+|.++....+++--...++. T Consensus 366 vTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~-----~p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~ 440 (516) T protein:vir:96 366 VTAVEIQRDALEIEQNMGGVYSLFATTMQSPVAMWGLLEA-----GESFTSDLVDPVIITGIEALGRMAELDKLANFAQY 440 (516) T ss_pred ccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhc-----CCCCccccccceeechHHHHHHHHHHHHHHHHHHH Confidence 67774 2 233555566678888999988865433 222 2 2223222 244444444444433333333 Q ss_pred HHHHHHcCC---cCcCHHHH-HHHhcccCCCCCC-CCcccCCCCC--------------CCCCcCCC-CCCC Q lcl|NC_016762. 405 NSAAIGTGE---PVFTAEEI-REEAGYDPLQGGD-PLPDTEPEDE--------------DAARTDPT-GEQQ 456 (456) Q Consensus 405 ~~~~~~~g~---~~i~~~E~-R~~~~~~~~~~~~-~~~~~~~~d~--------------~~~~~d~~-~~~e 456 (456) ....++... ..|+.+++ +..+..-+.+... ..+++....- ..-+..++ +.+| T Consensus 441 i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~ 512 (516) T protein:vir:96 441 MSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQE 512 (516) T ss_pred HHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhcc Confidence 332222221 24555554 3333333322211 1111110000 00000000 0000 No 250 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=54.19 E-value=0.51 Score=22.19 Aligned_cols=417 Identities=10% Similarity=0.032 Sum_probs=173.2 Q ss_pred CCchh-HHHHh------HHHHHHH---HHHHHHHhhhh-hccCccc-chhhhhccCcccCCHHHHHHHHhcCchhhhhhc Q lcl|NC_016762. 1 MTDKL-DLAVN------HAMSSAI---ARARMSLLNQG-IGHDAKR-PQAWCEYGFPQEITFNDLYTMYRRGGIAHGAVE 68 (456) Q Consensus 1 ~~~~~-~~~~~------~a~~~~~---~~~~d~~~n~~-~~~gt~~-~~~~~~~~~~~~~~~~~l~~~Y~~~~l~r~iVd 68 (456) |.++. -++.. ..++..+ ......+.... -.+.... +.......-+-.-+-.++..-. .++|...+. T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~L-aa~l~~~lt- 78 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGLNNL-ASKLMLALF- 78 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHH-HHHHHHhhc- Confidence 99833 11111 1111111 11111122211 0111000 0000000000000112222222 245555452 Q ss_pred cchhHHhhCCCEEecCCCcch------------hhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCC Q lcl|NC_016762. 69 KIVTTCWKTNPQVIEGDDQDR------------SKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQ 136 (456) Q Consensus 69 ~~aed~tR~~~~i~~~~~~d~------------~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~ 136 (456) |+ |.||++.-.+..-. .+-....++.+.+.+.+-++...+.++.+.-..+|-+++++.-+.++ T Consensus 79 -P~----~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~ 153 (536) T protein:vir:21 79 -PM----QTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) T ss_pred -CC----CcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCC Confidence 64 56999853332210 11133346678888889999999999988877888887777543332 Q ss_pred C---c-ccccc------CCcCceeEEEEeccccCChhhhh----ccccc-----cccCCceeEEEeecccCCccccceee Q lcl|NC_016762. 137 P---W-DRPAR------GKLNGLAKVTPAWAGCLKPKSFD----EKPDS-----ETYGQPTMWEYTEASQAGRPGLVRDI 197 (456) Q Consensus 137 ~---~-~~Pl~------~~~~~l~~i~~~~~~~~~~~~~~----~Dp~s-----~~yg~P~~y~i~~~~~~g~~~~~~~I 197 (456) . + .-||. ...+.+ .+++.+..+++.+.. .+-.+ ..+..-+.|+.-....++. .+.+ T Consensus 154 ~~~~f~~~pl~~~~v~~d~~G~v--d~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~---~~~~ 228 (536) T protein:vir:21 154 NYNPMKLYRLSSYVVQRDAFGNV--LQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSG---EYLR 228 (536) T ss_pred ceeeEEEEEcCeEEEeeCCCCCe--eEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCC---cEEE Confidence 1 1 24552 223323 233444444443221 11111 1122223332221111111 1122 Q ss_pred ehhh----h----------------hee--cCCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhccH Q lcl|NC_016762. 198 HPDR----V----------------FIL--GDWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLNFDKEINL 255 (456) Q Consensus 198 H~SR----l----------------i~~--~~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 255 (456) |+.. + .+| ..+..+|.|-.+..+..+..++...........+. .... T Consensus 229 ~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a----~~~~------- 297 (536) T protein:vir:21 229 YEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMIS----SKVI------- 297 (536) T ss_pred EeccCCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHH----hcCC------- Confidence 2211 1 111 12335799988888888887776654443322211 1100 Q ss_pred hhHHhhhcCCHHHHHHHHHHHHHHHhcCC-CeEEecCCCceeEE----ecccCCHHHHHHHHHHHHHhhhcCCeEEeecc Q lcl|NC_016762. 256 GEIASTYGVTLDALNERFNEAARQLNRGN-DVLLPTQGATVTQM----VSAVSDPGPTYNVNLQTAAAGVDIPTKILVGM 330 (456) Q Consensus 256 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~lid~~d~~~~~----~~~~sgl~~~~~~~~~~~aaas~IP~t~L~G~ 330 (456) -+... ..... ...+..+. +..+-+..+++..+ ...|.-....+....+.|.-+.=+ .-|.-. T Consensus 298 -~lv~p-----~g~~~-----~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~l~~~ 364 (536) T protein:vir:21 298 -GLVNP-----AGITQ-----PRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFML--NSAVQR 364 (536) T ss_pred -cccCc-----ccccc-----hhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhh--hhcccC Confidence 01100 01111 11222332 32333334555433 334555667777777777666521 122223 Q ss_pred CCCcccchH------HH-HHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCcCCCC--ceEEEe-CCCCCCCHHHHHHHHHH Q lcl|NC_016762. 331 QTGERASSE------DQ-KYHNARCQARRVQELTFEINDLFAHLMRIGVVPLKA--EFTAIW-DDLTVPTKAERLANSKT 400 (456) Q Consensus 331 sp~Glnst~------D~-~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~~~~--d~~~~f-~pL~~~seke~Aei~~~ 400 (456) ....+.+++ +. ...--.++..|...|.|.|++++.+|.+.+..|+++ .+.+++ .||.++.-.+.+ .+ T Consensus 365 ~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~---~~ 441 (536) T protein:vir:21 365 TGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDL---DK 441 (536) T ss_pred CCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHH---HH Confidence 445567774 22 335556677888899999999999999998876432 223333 345444333322 23 Q ss_pred HHHHHHHHHHcCC----cCcCHHHHHH-HhcccCC-CCCCC-CcccCCCCCC---------------------------- Q lcl|NC_016762. 401 MSEINSAAIGTGE----PVFTAEEIRE-EAGYDPL-QGGDP-LPDTEPEDED---------------------------- 445 (456) Q Consensus 401 ~A~a~~~~~~~g~----~~i~~~E~R~-~~~~~~~-~~~~~-~~~~~~~d~~---------------------------- 445 (456) .........+.+- +.|+.+++-+ .+...+. ..... .+++...--+ T Consensus 442 l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~ 521 (536) T protein:vir:21 442 LERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPE 521 (536) T ss_pred HHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChh Confidence 3333333333331 2356665532 3322232 11111 1110000000 Q ss_pred --CCCcCCCCCCC Q lcl|NC_016762. 446 --AARTDPTGEQQ 456 (456) Q Consensus 446 --~~~~d~~~~~e 456 (456) ..-.+..+.++ T Consensus 522 ~~~~~~~~~g~~~ 534 (536) T protein:vir:21 522 AMAAAADSVGLQP 534 (536) T ss_pred hHHhhhhccccCC Confidence 00000000000 No 251 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=35.37 E-value=1.2 Score=20.08 Aligned_cols=396 Identities=11% Similarity=0.022 Sum_probs=155.6 Q ss_pred CC---chhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccC----C------HHHHHHHHhcCchhhhhh Q lcl|NC_016762. 1 MT---DKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEI----T------FNDLYTMYRRGGIAHGAV 67 (456) Q Consensus 1 ~~---~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~----~------~~~l~~~Y~~~~l~r~iV 67 (456) |+ .+|..-..++..+.+ .+|.+. |+..-+...+-+-|+.- + |+...+.-....+.+++| T Consensus 32 m~dV~~~hp~y~a~~~~W~~--ird~~~------G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl 103 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRK--IMDCLS------GQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTL 103 (535) T ss_pred CCCCCcCCHHHHHHHHHHHH--HHHHhc------ChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHH Confidence 55 588876555555544 445552 44332332222223210 0 222222222345677788 Q ss_pred ccchhHHhhCCCEEecCCCcchhhhhHHHHHHHHHHHHHh-----hHHHHHHHHHHhhcccCceEEEEEecC-CCCcc-- Q lcl|NC_016762. 68 EKIVTTCWKTNPQVIEGDDQDRSKDETEWERKNKPLIAGG-----RFWRAVSEADRRRLVGRYSGLLLHIRD-SQPWD-- 139 (456) Q Consensus 68 d~~aed~tR~~~~i~~~~~~d~~~~~~~~e~~i~~~~~~l-----~~~~~~~ea~~~~r~~Ggs~i~i~i~D-~~~~~-- 139 (456) +..+--.+|+.+++.. + ..++.+++.. ++.+.++.+.+....||.++|+++... +...+ T Consensus 104 ~~l~G~vfrk~p~~~~---p----------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~a 170 (535) T protein:vir:80 104 DGMMGQVFSRDPIRQL---P----------PALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVL 170 (535) T ss_pred HHHhchhhcCCcceec---c----------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHH Confidence 8888788888877621 1 1244444433 577778888888888999999997632 21100 Q ss_pred -------ccc-------c---------CCcCceeEEEEeccccCChhhhhccccccccCCceeEEEeecccCC------- Q lcl|NC_016762. 140 -------RPA-------R---------GKLNGLAKVTPAWAGCLKPKSFDEKPDSETYGQPTMWEYTEASQAG------- 189 (456) Q Consensus 140 -------~Pl-------~---------~~~~~l~~i~~~~~~~~~~~~~~~Dp~s~~yg~P~~y~i~~~~~~g------- 189 (456) -|. . ++.+.|.. ++++-.+.. ..|.++... -+.|++.+-..+| T Consensus 171 de~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~--v~lrE~~~~---~dd~f~~~~--~~q~RvL~~~~~G~y~v~~~ 243 (535) T protein:vir:80 171 EQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISL--VVIQENVLA---QDDGFETTY--VQQWRVLQLNAEGNYQVERW 243 (535) T ss_pred HHHhcCCCcEEEEechhhccCccccccCCccceeE--EEEEEEEEe---cCCCcccce--eEEEEEEEecCCceEEEEEE Confidence 011 0 11222322 223221111 012222221 0112221110000 Q ss_pred ------ccccce--------eeehhhhheec--C--CcCC--Cc-chHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_016762. 190 ------RPGLVR--------DIHPDRVFILG--D--WTGD--AI-GFLEPAYNSFISLEKVEGGSGESFLKNAARQLLLN 248 (456) Q Consensus 190 ------~~~~~~--------~IH~SRli~~~--~--~~~~--G~-S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~ 248 (456) ...... ..|.=-.|.|. + .+.+ |. +++..+.=.+..|...+. .-+.+|-....++.+. T Consensus 244 ~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd-~~~il~~~~~P~l~i~ 322 (535) T protein:vir:80 244 RRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSAD-YEEMAFVAGQPTAFFT 322 (535) T ss_pred EeecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhH-HHHHHHHhcCceeeee Confidence 000000 11322234332 1 1222 22 333333323333322211 1222222222222221 Q ss_pred hhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcC-CCeEEecCCCceeEEecccCCHH-HHHHHHHHHHHhhhcCCeEE Q lcl|NC_016762. 249 FDKEINLGEIASTYGVTLDALNERFNEAARQLNRG-NDVLLPTQGATVTQMVSAVSDPG-PTYNVNLQTAAAGVDIPTKI 326 (456) Q Consensus 249 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lid~~d~~~~~~~~~sgl~-~~~~~~~~~~aaas~IP~t~ 326 (456) .+.+.....+ .+. ..+.-+ .....+.++.++..+..+-+++. ..++...++++..-. + T Consensus 323 -----G~~~~~~~~~------~~~-----~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a~~~l~~~e~qM~~lGa----~ 382 (535) T protein:vir:80 323 -----GLTKDWVEDV------FKD-----FKVHLGSRAIIPLPQGATAGILQITPNSVPFEAMTHKESQMIAMGA----N 382 (535) T ss_pred -----cCchhhhhcC------CCC-----cceEecCcccccCCCCCCcceeeeccchhHHHHHHHHHHHHHHHHH----H Confidence 1100000000 000 000111 12223444555555555555553 333334444444332 2 Q ss_pred eeccCCCcccchH---HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCc-CCCCceEEEeCC---CCCCCHHHHHHHHH Q lcl|NC_016762. 327 LVGMQTGERASSE---DQKYHNARCQARRVQELTFEINDLFAHLMRIGVV-PLKAEFTAIWDD---LTVPTKAERLANSK 399 (456) Q Consensus 327 L~G~sp~Glnst~---D~~nyyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~-~~~~d~~~~f~p---L~~~seke~Aei~~ 399 (456) |+.+++++..+++ |...=+..++++- ..+...++++++++.++... ..+.+..|..|. .-.++..+.+ T Consensus 383 ll~~~~~~~Ta~~a~~~~~~~~S~L~~~a-~~le~al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~---- 457 (535) T protein:vir:80 383 LLVKSGGNRTFGEAQQEEASEQSILSACT-KNVSMAFRKALRWANQFQTGIVNDETVEYNLNTDFPAARLTPNERA---- 457 (535) T ss_pred hhccCcccccHHHHHHHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHcCCccCCCceEEEeccccccccCCHHHHH---- Confidence 3445555565553 3333344455444 35788899999988877543 334456666553 3333443332 Q ss_pred HHHHHHHHHHHcCCcCcCHHHHHHHhcccCCCC-CCCCcccCC--CCC--------CCCCcC-CCCC------------C Q lcl|NC_016762. 400 TMSEINSAAIGTGEPVFTAEEIREEAGYDPLQG-GDPLPDTEP--EDE--------DAARTD-PTGE------------Q 455 (456) Q Consensus 400 ~~A~a~~~~~~~g~~~i~~~E~R~~~~~~~~~~-~~~~~~~~~--~d~--------~~~~~d-~~~~------------~ 455 (456) +...+.++| .|+.+.+++.+..-++-+ ....+++.. ++| ....++ -+++ + T Consensus 458 ----all~~~~~G--~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~~ 531 (535) T protein:vir:80 458 ----ELILEWQQG--AITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGGN 531 (535) T ss_pred ----HHHHHHhcC--CCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCccccc Confidence 223344455 566655554432222111 001111000 000 000000 0111 1 Q ss_pred C Q lcl|NC_016762. 456 Q 456 (456) Q Consensus 456 e 456 (456) + T Consensus 532 ~ 532 (535) T protein:vir:80 532 Q 532 (535) T ss_pred c Confidence 1 No 252 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=26.73 E-value=1.9 Score=19.04 Aligned_cols=408 Identities=13% Similarity=0.017 Sum_probs=154.6 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcc---------cCCHHHHHHHHhcC----------c Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQ---------EITFNDLYTMYRRG----------G 61 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~---------~~~~~~l~~~Y~~~----------~ 61 (456) |.-.+++=- . ...+ -...-++|. ...|..|+++|..+ + T Consensus 1 ~~~~~~~~~--~------------~~~~---------~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~ 57 (527) T protein:vir:10 1 MGQDKRQYG--S------------TQQL---------RAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGG 57 (527) T ss_pred CCccccccC--C------------CcCc---------CCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCc Confidence 222222100 0 0000 000111211 11233444555442 1 Q ss_pred h---hhhhhccchhHHhhCCCEEe-cCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC Q lcl|NC_016762. 62 I---AHGAVEKIVTTCWKTNPQVI-EGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP 137 (456) Q Consensus 62 l---~r~iVd~~aed~tR~~~~i~-~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~ 137 (456) - .|.+++--....+-.--+|+ .+.+-.........+..++...++-++..++.++.+|..+-|.+++.+.-+.+++ T Consensus 58 ~~~~~r~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~ 137 (527) T protein:vir:10 58 DEGDQRPIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKD 137 (527) T ss_pred cccccceeeehhhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCC Confidence 1 12222222222222222332 1111111222233455677777888888999999999988888777665433332 Q ss_pred ----cc----c-----ccc--CCcCceeEEEEe--ccccCChhh-----------hhc-cccccc-cCCc----eeEEE- Q lcl|NC_016762. 138 ----WD----R-----PAR--GKLNGLAKVTPA--WAGCLKPKS-----------FDE-KPDSET-YGQP----TMWEY- 182 (456) Q Consensus 138 ----~~----~-----Pl~--~~~~~l~~i~~~--~~~~~~~~~-----------~~~-Dp~s~~-yg~P----~~y~i- 182 (456) ++ . |+. .+.+.+..++-. |+..-.+.. +.- |...|- .|.- ..|.+ T Consensus 138 ~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg 217 (527) T protein:vir:10 138 EGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPG 217 (527) T ss_pred cCCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecc Confidence 11 1 111 122334444433 443222221 111 111111 1211 12222 Q ss_pred --ee---cccCC----ccccceeeehhhh-h------eecC----CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016762. 183 --TE---ASQAG----RPGLVRDIHPDRV-F------ILGD----WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAA 242 (456) Q Consensus 183 --~~---~~~~g----~~~~~~~IH~SRl-i------~~~~----~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~ 242 (456) +. ....- ....+..+|+.=. | +|.. ...||.|-|+.+..-+..+.++. +-. . T Consensus 218 ~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~-Td~-------s 289 (527) T protein:vir:10 218 KWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTM-TDE-------D 289 (527) T ss_pred ccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhh-hHH-------H Confidence 00 00000 0011233443322 2 2321 34699999987765554443332 111 1 Q ss_pred hhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEE-ecCCCceeEEec--ccCCHHHHHHHHHHHHHhh Q lcl|NC_016762. 243 RQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLL-PTQGATVTQMVS--AVSDPGPTYNVNLQTAAAG 319 (456) Q Consensus 243 ~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-id~~d~~~~~~~--~~sgl~~~~~~~~~~~aaa 319 (456) +.+.+.....+..+.+. ..+. ...+..++=+-+.+. ++.+.++..++. .++++.+.++..+..++.. T Consensus 290 ~is~~sG~Pi~~~tg~~---------~vd~-~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~v 359 (527) T protein:vir:10 290 LIMVFGGLGFYATDSAP---------PRDS-RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQT 359 (527) T ss_pred HHHHHhCCceeeecccc---------cccc-cCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHh Confidence 11111111111111111 0000 000111111223333 455567777766 4567888899999999999 Q ss_pred hcCCeEEeec-cCCCcccchHH-HH-HHHHHHHHHHHhh--hhHHHHHHHH-HH-----HHhcCcCC----CCceEEEeC Q lcl|NC_016762. 320 VDIPTKILVG-MQTGERASSED-QK-YHNARCQARRVQE--LTFEINDLFA-HL-----MRIGVVPL----KAEFTAIWD 384 (456) Q Consensus 320 s~IP~t~L~G-~sp~Glnst~D-~~-nyyd~I~~~Qe~~--lrp~L~~l~~-~l-----~~s~~~~~----~~d~~~~f~ 384 (456) +++|.+ =|| .-+++ +-++. ++ ..--.+++-|+.. ++-.+.++.. .+ .+-+++.. .-.+++.|. T Consensus 360 A~~Pav-A~G~vD~s~-~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~ 437 (527) T protein:vir:10 360 KGIPDI-AVGVVDAAV-AESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFR 437 (527) T ss_pred hcCCee-eeccccCCc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEec Confidence 999997 455 22222 22221 11 1222233333332 2332322111 11 11122221 235789999 Q ss_pred CCCCCCHHHHHHHHHHHHHH--------HHHHHHcCCcCcCHH-HHHHHh--------------cccCCCCCCCCcccCC Q lcl|NC_016762. 385 DLTVPTKAERLANSKTMSEI--------NSAAIGTGEPVFTAE-EIREEA--------------GYDPLQGGDPLPDTEP 441 (456) Q Consensus 385 pL~~~seke~Aei~~~~A~a--------~~~~~~~g~~~i~~~-E~R~~~--------------~~~~~~~~~~~~~~~~ 441 (456) |....++++..+..++.-++ ...+..+| ..-+++ |+++.. +..+.-.+....-.++ T Consensus 438 p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~-g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~ 516 (527) T protein:vir:10 438 DPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIM-GFELTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDE 516 (527) T ss_pred ccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhcc-CCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCC Confidence 99777777776655444332 11122221 011221 222110 0011100011000111 Q ss_pred CCCCCCCcCCC Q lcl|NC_016762. 442 EDEDAARTDPT 452 (456) Q Consensus 442 ~d~~~~~~d~~ 452 (456) +.++.-.+-|. T Consensus 517 ~~d~~~~~~~~ 527 (527) T protein:vir:10 517 EDDQALNGQPL 527 (527) T ss_pred CcccccCCCCC Confidence 11222222233 No 253 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=25.99 E-value=2 Score=18.94 Aligned_cols=408 Identities=13% Similarity=0.024 Sum_probs=154.8 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcc---------cCCHHHHHHHHhcC----------c Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQ---------EITFNDLYTMYRRG----------G 61 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~---------~~~~~~l~~~Y~~~----------~ 61 (456) |.-.+++=- . ...+ -...-++|. ...|..|+++|..+ + T Consensus 1 ~~~~~~~~~--~------------~~~~---------~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~ 57 (527) T protein:vir:10 1 MGQDKRQYG--S------------TQQL---------RAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGG 57 (527) T ss_pred CCccccccC--C------------CcCc---------CCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCc Confidence 222222100 0 0000 000111211 11233444555442 1 Q ss_pred h---hhhhhccchhHHhhCCCEEe-cCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCC Q lcl|NC_016762. 62 I---AHGAVEKIVTTCWKTNPQVI-EGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQP 137 (456) Q Consensus 62 l---~r~iVd~~aed~tR~~~~i~-~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~ 137 (456) - .|.+++--....+-+--+|+ .+.+-.........+..++...++-++..++.++.+|..+-|.+++.+.-+.+++ T Consensus 58 ~~~~~r~~~~ps~~~~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~ 137 (527) T protein:vir:10 58 DEGDQRPIYVPNGEKLIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKD 137 (527) T ss_pred cccccceeeehhhHHhhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCC Confidence 1 12222222222222222332 1111112222233455677777888888999999999988888777665433332 Q ss_pred ----cc----c-----ccc--CCcCceeEEEEe--ccccCChhh-----------hhc-cccccc-cCCc----eeEEE- Q lcl|NC_016762. 138 ----WD----R-----PAR--GKLNGLAKVTPA--WAGCLKPKS-----------FDE-KPDSET-YGQP----TMWEY- 182 (456) Q Consensus 138 ----~~----~-----Pl~--~~~~~l~~i~~~--~~~~~~~~~-----------~~~-Dp~s~~-yg~P----~~y~i- 182 (456) ++ . |+. .+.+.+..++-. |+..-.+.. +.- |...|- .|.- ..|.+ T Consensus 138 ~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg 217 (527) T protein:vir:10 138 EGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPG 217 (527) T ss_pred cCCCceEeecCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeecc Confidence 11 1 111 122334444433 443222221 111 111111 1211 12222 Q ss_pred --ee---cccCC----ccccceeeehhhh-h------eecC----CcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_016762. 183 --TE---ASQAG----RPGLVRDIHPDRV-F------ILGD----WTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAA 242 (456) Q Consensus 183 --~~---~~~~g----~~~~~~~IH~SRl-i------~~~~----~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~ 242 (456) +. ....- ....+..+|+.=. | +|.. ...||.|-|+.+..-+..+.++. +-. . T Consensus 218 ~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~-Td~-------s 289 (527) T protein:vir:10 218 KWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTM-TDE-------D 289 (527) T ss_pred ccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhh-hHH-------H Confidence 00 00000 0011233443322 2 2321 34699999987765554443332 111 1 Q ss_pred hhhhhhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEE-ecCCCceeEEec--ccCCHHHHHHHHHHHHHhh Q lcl|NC_016762. 243 RQLLLNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLL-PTQGATVTQMVS--AVSDPGPTYNVNLQTAAAG 319 (456) Q Consensus 243 ~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-id~~d~~~~~~~--~~sgl~~~~~~~~~~~aaa 319 (456) +.+.+.....+..+.+. ..+. ...+..++=+-+.+. ++.+.++..++. .++++.+.++..+..++.. T Consensus 290 ~is~~sG~Pi~~~tg~~---------~vd~-~G~~~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~v 359 (527) T protein:vir:10 290 LIMVFGGLGFYATDSAP---------PRDS-RGNMVPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQT 359 (527) T ss_pred HHHHHhCCceeeecccc---------cccc-cCCcCccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHh Confidence 11111111111111111 0000 000111111223333 455567777766 4567888899999999999 Q ss_pred hcCCeEEeec-cCCCcccchHH-HH-HHHHHHHHHHHhh--hhHHHHHHHH-HH-----HHhcCcCC----CCceEEEeC Q lcl|NC_016762. 320 VDIPTKILVG-MQTGERASSED-QK-YHNARCQARRVQE--LTFEINDLFA-HL-----MRIGVVPL----KAEFTAIWD 384 (456) Q Consensus 320 s~IP~t~L~G-~sp~Glnst~D-~~-nyyd~I~~~Qe~~--lrp~L~~l~~-~l-----~~s~~~~~----~~d~~~~f~ 384 (456) +++|.+ =|| .-+++ +-++. ++ ..--.+++-|+.. ++-.+.++.. .+ .+-+++.. .-.+++.|. T Consensus 360 A~~Pav-A~G~vD~s~-~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~ 437 (527) T protein:vir:10 360 KGIPDI-AVGVVDAAV-AESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFR 437 (527) T ss_pred hcCCee-eeccccCCc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEec Confidence 999997 455 22222 22221 11 1222233333332 2332322111 11 11122221 235789999 Q ss_pred CCCCCCHHHHHHHHHHHHHH--------HHHHHHcCCcCcCH-HHHHHHh--------------cccCCCCCCCCcccCC Q lcl|NC_016762. 385 DLTVPTKAERLANSKTMSEI--------NSAAIGTGEPVFTA-EEIREEA--------------GYDPLQGGDPLPDTEP 441 (456) Q Consensus 385 pL~~~seke~Aei~~~~A~a--------~~~~~~~g~~~i~~-~E~R~~~--------------~~~~~~~~~~~~~~~~ 441 (456) |....++++..+..++.-++ ...+..+| ..-++ .|+++.. +..+.-.+....-.++ T Consensus 438 p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~-g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~ 516 (527) T protein:vir:10 438 DPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIM-GFELTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDE 516 (527) T ss_pred ccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhcc-CCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCC Confidence 99777777766655444332 11122221 01122 1222211 0001100011000111 Q ss_pred CCCCCCCcCCC Q lcl|NC_016762. 442 EDEDAARTDPT 452 (456) Q Consensus 442 ~d~~~~~~d~~ 452 (456) +.++.-.+-|. T Consensus 517 ~~d~~~~~~~~ 527 (527) T protein:vir:10 517 EDDQALNGQPL 527 (527) T ss_pred CcccccCCCCC Confidence 11222222233 No 254 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=22.83 E-value=2.4 Score=18.51 Aligned_cols=419 Identities=10% Similarity=0.020 Sum_probs=148.1 Q ss_pred CCchhHHHHhHHHHHHHHHHHHHHhhhhhccCcccchhhhhccCcccCCHHHHHHHHhc------------------Cch Q lcl|NC_016762. 1 MTDKLDLAVNHAMSSAIARARMSLLNQGIGHDAKRPQAWCEYGFPQEITFNDLYTMYRR------------------GGI 62 (456) Q Consensus 1 ~~~~~~~~~~~a~~~~~~~~~d~~~n~~~~~gt~~~~~~~~~~~~~~~~~~~l~~~Y~~------------------~~l 62 (456) |.-.|++= +.-+.+. .-+..|.+.-.+-. +...|..|.++|.. .+- T Consensus 1 m~~~~~q~--~p~~~~f---p~~~a~wV~~~D~~-----------RlaaY~ly~d~y~n~~~el~~il~G~dr~~~~~ps 64 (563) T protein:vir:74 1 MPYNHKQY--DPAKPFL---RGGDDNIVDENDKN-----------RVRAYDLYENIYLNSAETLKLVLRGDDSVPILMPS 64 (563) T ss_pred CCcccccc--CCCcccc---cccccccCCHHHHH-----------HHHHHHHHHHhhcCchhhhhhhcCCCceeeeccch Confidence 44444332 1111000 00000100000000 11122333333332 223 Q ss_pred hhhhhccchhHHhhCCCEEe-cCCCcchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhhcccCceEEEEEecCCCCccc- Q lcl|NC_016762. 63 AHGAVEKIVTTCWKTNPQVI-EGDDQDRSKDETEWERKNKPLIAGGRFWRAVSEADRRRLVGRYSGLLLHIRDSQPWDR- 140 (456) Q Consensus 63 ~r~iVd~~aed~tR~~~~i~-~~~~~d~~~~~~~~e~~i~~~~~~l~~~~~~~ea~~~~r~~Ggs~i~i~i~D~~~~~~- 140 (456) ++++|++.+ --+-.+.+++ ...+. +......+++-++...++-++..++.++.+|+.+-|.+++.+.-+..+..-+ T Consensus 65 ~r~~V~~~~-~~Lg~~~~~~Ve~~~~-de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R 142 (563) T protein:vir:74 65 GRKIVEAVH-RFLGVGFDYLVEPDMG-DEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGER 142 (563) T ss_pred HHHHHHHHH-HhcCCCcEEecCcccc-CcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCC Confidence 678888854 4456677774 33322 2333345677788888899999999999999888887766554321111000 Q ss_pred ----ccc----------CCcCceeEEEE--eccccCChh-----------hhhccccccc-c-CCceeEEE-----ee-c Q lcl|NC_016762. 141 ----PAR----------GKLNGLAKVTP--AWAGCLKPK-----------SFDEKPDSET-Y-GQPTMWEY-----TE-A 185 (456) Q Consensus 141 ----Pl~----------~~~~~l~~i~~--~~~~~~~~~-----------~~~~Dp~s~~-y-g~P~~y~i-----~~-~ 185 (456) +++ +...|+.-+.+ -|+..-.+. ++|..|.-.. | ..-+.|.. .+ . T Consensus 143 ~rv~~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~ 222 (563) T protein:vir:74 143 ISVDEVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAI 222 (563) T ss_pred ceEeecCCceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCcc Confidence 000 11222211110 111100000 0111110000 0 00011211 00 0 Q ss_pred ccCCccccceeeehhhh---------------heec----CCcCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_016762. 186 SQAGRPGLVRDIHPDRV---------------FILG----DWTGDAIGFLEPAYNSFISLEKVEGGSGESFLKNAARQLL 246 (456) Q Consensus 186 ~~~g~~~~~~~IH~SRl---------------i~~~----~~~~~G~S~le~~~~~l~~~~~~~~~~~~~~~~~~~~~l~ 246 (456) ...-.....-.+|-+|. ++|. ....||.|-|..+..-+..+.++. ... .+.+. T Consensus 223 ~~~~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~-Td~-------s~i~~ 294 (563) T protein:vir:74 223 SDEQARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSL-TDE-------DATIV 294 (563) T ss_pred chhhhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhh-hHH-------HHHHH Confidence 00000000011222221 1232 234699999987776554443321 111 11222 Q ss_pred hhhhhhccHhhHHhhhcCCHHHHHHHHHHHHHHHhcCCCeEEecCCC----ceeEEec--ccCCHHHHHHHHHH-HHHhh Q lcl|NC_016762. 247 LNFDKEINLGEIASTYGVTLDALNERFNEAARQLNRGNDVLLPTQGA----TVTQMVS--AVSDPGPTYNVNLQ-TAAAG 319 (456) Q Consensus 247 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lid~~d----~~~~~~~--~~sgl~~~~~~~~~-~~aaa 319 (456) +.....+-+.+ +...+... ..+..++=+-+.++=..++ -++.++. +++++..=++-... .++-. T Consensus 295 ~tG~pi~vl~~-----~~p~d~~~----g~~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~ 365 (563) T protein:vir:74 295 FQGLGMYVTNA-----SAPVDPNT----GELTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEG 365 (563) T ss_pred hcCCCeEEecc-----cccccccc----ccccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhh Confidence 22111111110 00000001 1112233333333322211 2445444 33455444666665 56778 Q ss_pred hcCCeEEeec-----cCCCcccch---H--HHHH------HHHHHHHHHHhhhhHHHHHHHHHHHHhcCcC-------CC Q lcl|NC_016762. 320 VDIPTKILVG-----MQTGERASS---E--DQKY------HNARCQARRVQELTFEINDLFAHLMRIGVVP-------LK 376 (456) Q Consensus 320 s~IP~t~L~G-----~sp~Glnst---~--D~~n------yyd~I~~~Qe~~lrp~L~~l~~~l~~s~~~~-------~~ 376 (456) +++|.+ =|| ..++|.+=. + +-++ ++......+.+.++-.| +.++.|..++-++ .+ T Consensus 366 s~tPav-A~G~vD~~~~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL-~~~erl~~~g~~~~~~g~~~~~ 443 (563) T protein:vir:74 366 SGTPEV-AIGRVDVTSAESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWL-PAYESDFQEQDGSRPFASADLL 443 (563) T ss_pred ccCcce-eecccccccccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHH-HHHHhHhhhhcccccccccccC Confidence 899997 556 344443311 1 1112 22223333222222222 1223333333332 12 Q ss_pred C--ceEEEeCCCCCCCHHHHHHHHHHHHH--------HHHHHHHcCCc---------CcCHHHHHHHh-----cccCCC- Q lcl|NC_016762. 377 A--EFTAIWDDLTVPTKAERLANSKTMSE--------INSAAIGTGEP---------VFTAEEIREEA-----GYDPLQ- 431 (456) Q Consensus 377 ~--d~~~~f~pL~~~seke~Aei~~~~A~--------a~~~~~~~g~~---------~i~~~E~R~~~-----~~~~~~- 431 (456) . -+++.|.|+.-.+..+..+.-.+.-+ |...+..+|.. .+..+.|-+++ ...+.. T Consensus 444 ~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~ 523 (563) T protein:vir:74 444 NECSVVCIFADPMPVNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGL 523 (563) T ss_pred CceEEEEEeCCCCCccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccc Confidence 2 36788999986665554333222111 11223333411 23333333321 011100 Q ss_pred ---CCCCCcccCCCC--------CCCCCcCCCCCCC Q lcl|NC_016762. 432 ---GGDPLPDTEPED--------EDAARTDPTGEQQ 456 (456) Q Consensus 432 ---~~~~~~~~~~~d--------~~~~~~d~~~~~e 456 (456) +....++.+..| .++.+-.|...+- T Consensus 524 ~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~~~~~~~ 559 (563) T protein:vir:74 524 SAMDNGGAGEQQFDDQGNPIDQFGNPVEIPPDVTQV 559 (563) T ss_pred eecccCCCCcccccccCCchhHcCCcccCCcccccc Confidence 000000000000 0111111111111 Done!