Query lcl|NC_021302.1_cdsid_YP_008051228.1 [gene=3] [protein=portal protein] [protein_id=YP_008051228.1] [location=2106..3560] Match_columns 484 No_of_seqs 140 out of 356 Neff 8.4 Searched_HMMs 1612 Date Thu Nov 7 17:41:28 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:108215 Length: 469 100.0 8E-121 5E-124 678.9 47.6 460 1-476 5-469 (469) 2 protein:vir:95254 Length: 488 100.0 5E-111 3E-114 625.2 40.5 444 1-477 1-488 (488) 3 protein:vir:79233 Length: 526 100.0 3E-109 2E-112 615.3 38.6 444 1-484 12-484 (526) 4 protein:vir:99232 Length: 526 100.0 5E-108 3E-111 608.8 39.5 444 1-484 13-484 (526) 5 protein:vir:103860 Length: 528 100.0 3E-107 2E-110 604.9 39.4 441 1-484 18-496 (528) 6 protein:vir:79063 Length: 491 100.0 5E-107 3E-110 603.5 39.1 434 1-484 7-455 (491) 7 protein:vir:107880 Length: 491 100.0 4E-106 2E-109 598.5 38.7 432 1-484 1-455 (491) 8 protein:vir:1986 Length: 512 # 100.0 1E-105 7E-109 596.1 40.2 441 1-484 14-472 (512) 9 protein:vir:79511 Length: 448 100.0 9E-104 6E-107 585.4 42.7 428 1-464 1-448 (448) 10 protein:vir:99853 Length: 488 100.0 9E-104 5E-107 585.6 38.4 432 1-484 1-442 (488) 11 protein:vir:77981 Length: 448 100.0 7E-103 5E-106 580.5 41.3 426 1-464 1-448 (448) 12 protein:vir:98816 Length: 446 100.0 6E-100 4E-103 564.5 36.0 393 1-424 7-446 (446) 13 protein:vir:78161 Length: 355 100.0 3.8E-90 2.4E-93 510.8 36.2 334 131-478 1-355 (355) 14 protein:vir:101648 Length: 518 99.9 1.3E-20 8.1E-24 129.6 37.9 429 1-484 3-467 (518) 15 protein:vir:7853 Length: 518 # 99.9 2.4E-20 1.5E-23 128.1 38.0 430 1-484 3-466 (518) 16 protein:vir:1380 Length: 422 # 99.9 2.1E-20 1.3E-23 128.5 35.7 402 1-451 11-422 (422) 17 protein:vir:93610 Length: 454 99.8 1.4E-19 8.6E-23 124.0 38.1 425 1-480 7-454 (454) 18 protein:vir:102727 Length: 945 99.8 8.6E-21 5.3E-24 130.6 29.7 445 1-484 52-569 (945) 19 protein:vir:1266 Length: 416 # 99.8 1.1E-19 7E-23 124.5 35.4 398 1-459 7-416 (416) 20 protein:vir:3843 Length: 397 # 99.8 2.6E-19 1.6E-22 122.4 36.2 392 1-465 1-397 (397) 21 protein:vir:4337 Length: 434 # 99.8 9.4E-20 5.8E-23 124.9 33.2 402 1-456 9-434 (434) 22 protein:vir:79772 Length: 648 99.8 5.5E-19 3.4E-22 120.7 37.2 443 1-484 38-531 (648) 23 protein:vir:102080 Length: 429 99.8 6.8E-19 4.2E-22 120.2 36.2 405 1-467 12-429 (429) 24 protein:vir:8418 Length: 409 # 99.8 6.4E-19 3.9E-22 120.3 36.0 389 1-459 8-409 (409) 25 protein:vir:10362 Length: 432 99.8 8.9E-19 5.5E-22 119.5 36.7 401 1-463 17-432 (432) 26 protein:vir:483 Length: 413 # 99.8 9.6E-19 6E-22 119.4 36.4 394 1-457 7-413 (413) 27 protein:vir:4454 Length: 414 # 99.8 3.8E-19 2.4E-22 121.6 34.0 394 1-462 8-414 (414) 28 protein:vir:102855 Length: 432 99.8 1.8E-18 1.1E-21 117.9 36.4 407 1-461 13-432 (432) 29 protein:vir:107605 Length: 432 99.8 1.8E-18 1.1E-21 117.9 36.4 407 1-461 13-432 (432) 30 protein:vir:105002 Length: 432 99.8 1.8E-18 1.1E-21 117.9 36.4 407 1-461 13-432 (432) 31 protein:vir:81152 Length: 411 99.8 1.6E-18 9.9E-22 118.2 35.7 391 1-452 1-411 (411) 32 protein:vir:3868 Length: 417 # 99.8 1.9E-18 1.2E-21 117.7 35.3 408 1-469 1-417 (417) 33 protein:vir:105064 Length: 421 99.8 9.5E-19 5.9E-22 119.4 33.6 400 1-464 3-421 (421) 34 protein:vir:100249 Length: 431 99.8 3.1E-18 1.9E-21 116.6 36.2 396 1-449 21-431 (431) 35 protein:vir:97060 Length: 432 99.8 2.3E-18 1.4E-21 117.3 35.3 401 1-463 17-432 (432) 36 protein:vir:80796 Length: 574 99.8 2.3E-18 1.4E-21 117.3 35.1 440 1-484 52-552 (574) 37 protein:vir:81072 Length: 432 99.8 5.8E-18 3.6E-21 115.1 36.5 400 1-462 1-432 (432) 38 protein:vir:98396 Length: 441 99.8 4.7E-18 2.9E-21 115.6 35.7 404 1-459 23-441 (441) 39 protein:vir:1431 Length: 419 # 99.8 4.2E-18 2.6E-21 115.9 34.6 396 1-465 8-419 (419) 40 protein:vir:6240 Length: 457 # 99.8 8.2E-18 5.1E-21 114.3 35.7 423 1-479 9-457 (457) 41 protein:vir:102118 Length: 409 99.8 6.9E-18 4.3E-21 114.7 35.3 391 1-454 7-409 (409) 42 protein:vir:1326 Length: 457 # 99.8 6.1E-18 3.8E-21 115.0 34.2 422 12-479 1-457 (457) 43 protein:vir:79984 Length: 441 99.8 1.4E-17 8.8E-21 112.9 35.7 403 1-459 14-441 (441) 44 protein:vir:9408 Length: 441 # 99.8 1.4E-17 8.8E-21 112.9 35.7 403 1-459 14-441 (441) 45 protein:vir:5737 Length: 419 # 99.8 3.9E-18 2.4E-21 116.1 32.4 395 1-473 6-419 (419) 46 protein:vir:96980 Length: 409 99.8 9.9E-18 6.1E-21 113.8 34.6 403 1-460 1-409 (409) 47 protein:vir:4509 Length: 424 # 99.8 1.2E-17 7.1E-21 113.5 34.8 387 1-454 23-424 (424) 48 protein:vir:93943 Length: 409 99.8 1.3E-17 7.8E-21 113.2 34.9 403 1-460 1-409 (409) 49 protein:vir:96579 Length: 576 99.8 1.7E-16 1E-19 107.1 40.8 443 1-484 44-546 (576) 50 protein:vir:80333 Length: 419 99.8 9.2E-18 5.7E-21 114.0 33.8 399 1-466 3-419 (419) 51 protein:vir:1884 Length: 424 # 99.8 7E-18 4.4E-21 114.6 33.0 397 5-451 1-424 (424) 52 protein:vir:2683 Length: 412 # 99.8 1.7E-17 1.1E-20 112.5 34.8 398 1-460 4-412 (412) 53 protein:vir:9702 Length: 406 # 99.8 2.4E-17 1.5E-20 111.7 35.4 394 1-462 4-406 (406) 54 protein:vir:81095 Length: 416 99.8 1.3E-17 8.1E-21 113.2 34.0 399 1-459 1-416 (416) 55 protein:vir:4598 Length: 416 # 99.8 1.3E-17 8.1E-21 113.2 34.0 399 1-459 1-416 (416) 56 protein:vir:101647 Length: 460 99.8 5.4E-17 3.4E-20 109.7 36.8 422 1-459 1-460 (460) 57 protein:vir:63755 Length: 547 99.8 1E-16 6.4E-20 108.2 38.0 431 1-484 43-544 (547) 58 protein:vir:3153 Length: 467 # 99.8 2.8E-17 1.7E-20 111.4 34.5 415 47-481 1-467 (467) 59 protein:vir:4952 Length: 386 # 99.8 2.2E-17 1.4E-20 111.9 34.0 373 1-459 7-386 (386) 60 protein:vir:94666 Length: 723 99.8 5.1E-17 3.2E-20 109.9 35.8 420 9-484 1-448 (723) 61 protein:vir:80644 Length: 551 99.8 1E-16 6.5E-20 108.2 37.5 436 1-484 38-548 (551) 62 protein:vir:99452 Length: 651 99.8 1.6E-16 9.9E-20 107.2 37.7 461 1-484 1-569 (651) 63 protein:vir:9359 Length: 348 # 99.8 2.4E-17 1.5E-20 111.7 33.1 342 68-460 1-348 (348) 64 protein:vir:189 Length: 424 # 99.8 2.6E-17 1.6E-20 111.5 33.1 397 5-451 1-424 (424) 65 protein:vir:95599 Length: 563 99.8 1.9E-16 1.2E-19 106.8 37.5 437 1-484 50-546 (563) 66 protein:vir:99312 Length: 563 99.8 1.9E-16 1.2E-19 106.8 37.5 437 1-484 50-546 (563) 67 protein:vir:94426 Length: 409 99.8 7E-17 4.3E-20 109.2 35.1 403 1-460 1-409 (409) 68 protein:vir:100150 Length: 437 99.8 3.2E-17 2E-20 111.0 33.1 406 1-461 1-437 (437) 69 protein:vir:7407 Length: 392 # 99.7 1.3E-16 7.8E-20 107.8 34.1 377 1-454 9-392 (392) 70 protein:vir:1023 Length: 392 # 99.7 2.1E-16 1.3E-19 106.6 34.3 376 1-454 9-392 (392) 71 protein:vir:3989 Length: 392 # 99.7 2.1E-16 1.3E-19 106.6 34.3 376 1-454 9-392 (392) 72 protein:vir:4854 Length: 386 # 99.7 4.3E-16 2.7E-19 104.8 35.1 377 1-459 7-386 (386) 73 protein:vir:81218 Length: 423 99.7 6.4E-16 4E-19 103.9 35.5 394 1-455 9-423 (423) 74 protein:vir:960 Length: 413 # 99.7 3.6E-16 2.2E-19 105.2 33.1 381 1-455 4-413 (413) 75 protein:vir:4995 Length: 384 # 99.7 1.4E-16 8.8E-20 107.5 30.3 372 1-429 4-384 (384) 76 protein:vir:8317 Length: 409 # 99.7 4.5E-16 2.8E-19 104.7 32.1 364 1-435 36-409 (409) 77 protein:vir:100187 Length: 385 99.7 1.4E-15 8.5E-19 102.1 34.6 367 1-452 4-385 (385) 78 protein:vir:4828 Length: 382 # 99.7 1.5E-15 9.3E-19 101.8 34.4 371 1-441 7-382 (382) 79 protein:vir:4194 Length: 540 # 99.7 5.9E-15 3.7E-18 98.6 34.5 428 1-484 2-470 (540) 80 protein:vir:100882 Length: 383 99.7 6.4E-15 4E-18 98.4 34.5 367 1-453 4-383 (383) 81 protein:vir:100650 Length: 395 99.7 5.2E-15 3.2E-18 98.9 33.4 378 21-458 1-395 (395) 82 protein:vir:9507 Length: 395 # 99.7 5.2E-15 3.2E-18 98.9 33.4 378 21-458 1-395 (395) 83 protein:vir:101289 Length: 395 99.7 5.2E-15 3.2E-18 98.9 33.4 378 21-458 1-395 (395) 84 protein:vir:95378 Length: 406 99.7 1.4E-14 8.5E-18 96.6 35.0 384 1-463 7-406 (406) 85 protein:vir:5249 Length: 437 # 99.6 4.7E-14 2.9E-17 93.7 35.4 405 8-459 1-437 (437) 86 protein:vir:8100 Length: 466 # 99.6 2.3E-14 1.4E-17 95.4 32.2 417 1-458 8-466 (466) 87 protein:vir:104259 Length: 403 99.6 4.5E-14 2.8E-17 93.8 33.7 372 21-460 1-403 (403) 88 protein:vir:6210 Length: 394 # 99.6 3.4E-14 2.1E-17 94.4 33.1 378 1-456 7-394 (394) 89 protein:vir:4156 Length: 542 # 99.6 1E-13 6.3E-17 91.8 35.3 430 1-484 6-498 (542) 90 protein:vir:4089 Length: 395 # 99.6 4.9E-14 3E-17 93.6 32.2 375 21-456 1-395 (395) 91 protein:vir:95965 Length: 385 99.6 7.2E-14 4.5E-17 92.6 32.4 366 21-451 1-385 (385) 92 protein:vir:100691 Length: 535 99.6 1.3E-13 8.2E-17 91.2 33.2 440 1-481 36-535 (535) 93 protein:vir:78310 Length: 376 99.5 5.4E-13 3.3E-16 87.8 33.0 360 21-450 1-376 (376) 94 protein:vir:80134 Length: 403 99.5 8.8E-13 5.5E-16 86.7 33.6 375 1-453 7-403 (403) 95 protein:vir:9641 Length: 395 # 99.5 7.6E-13 4.7E-16 87.0 30.9 376 21-460 1-395 (395) 96 protein:vir:1082 Length: 359 # 99.5 9.5E-13 5.9E-16 86.5 31.2 348 1-421 1-359 (359) 97 protein:vir:78641 Length: 278 99.5 4E-13 2.5E-16 88.5 27.9 275 68-383 1-278 (278) 98 protein:vir:6382 Length: 553 # 99.5 3.4E-12 2.1E-15 83.4 32.0 441 1-451 12-553 (553) 99 protein:vir:94002 Length: 378 99.4 1.2E-12 7.4E-16 86.0 27.5 354 21-459 1-378 (378) 100 protein:vir:98643 Length: 395 99.4 4E-12 2.5E-15 83.1 29.8 376 21-460 1-395 (395) 101 protein:vir:3420 Length: 533 # 99.4 2.4E-12 1.5E-15 84.3 28.3 436 1-456 10-533 (533) 102 protein:vir:1661 Length: 378 # 99.4 1.4E-12 8.8E-16 85.5 24.0 355 21-459 1-378 (378) 103 protein:vir:389 Length: 530 # 99.4 4E-11 2.5E-14 77.6 31.9 437 1-456 7-530 (530) 104 protein:vir:107742 Length: 537 99.3 8.7E-11 5.4E-14 75.7 36.7 428 1-475 48-537 (537) 105 protein:vir:93867 Length: 378 99.3 6.9E-12 4.3E-15 81.8 24.0 359 21-457 1-378 (378) 106 protein:vir:79538 Length: 502 99.3 1.6E-10 1E-13 74.3 29.2 426 1-459 11-502 (502) 107 protein:vir:94049 Length: 532 99.2 3.6E-10 2.2E-13 72.3 36.5 440 1-480 1-532 (532) 108 protein:vir:80040 Length: 461 99.2 3.9E-10 2.4E-13 72.2 33.9 413 1-452 1-461 (461) 109 protein:vir:95542 Length: 548 99.2 7.7E-10 4.8E-13 70.5 29.0 460 1-476 11-548 (548) 110 protein:vir:79647 Length: 435 99.2 1E-09 6.2E-13 69.9 31.9 395 1-453 5-435 (435) 111 protein:vir:858 Length: 378 # 99.2 1.1E-10 6.9E-14 75.2 24.2 359 12-453 1-378 (378) 112 protein:vir:96738 Length: 505 99.1 1.2E-09 7.5E-13 69.5 30.7 440 1-459 15-505 (505) 113 protein:vir:94869 Length: 378 99.1 2.7E-10 1.7E-13 73.0 25.7 364 12-459 1-378 (378) 114 protein:vir:267 Length: 348 # 99.1 2.2E-09 1.3E-12 68.1 30.6 322 1-392 1-348 (348) 115 protein:vir:103971 Length: 376 99.1 3.6E-09 2.2E-12 66.9 27.9 316 1-395 26-376 (376) 116 protein:vir:78749 Length: 337 99.0 4.1E-09 2.5E-12 66.6 27.5 310 1-385 1-337 (337) 117 protein:vir:10321 Length: 495 99.0 5.3E-09 3.3E-12 65.9 30.9 425 1-459 9-495 (495) 118 protein:vir:1150 Length: 350 # 99.0 2.9E-09 1.8E-12 67.4 26.0 314 1-385 1-350 (350) 119 protein:vir:98567 Length: 340 99.0 4.7E-09 2.9E-12 66.2 26.4 314 1-385 1-340 (340) 120 protein:vir:99563 Length: 862 99.0 8.1E-09 5E-12 64.9 35.2 438 1-484 53-603 (862) 121 protein:vir:104338 Length: 422 99.0 8.9E-09 5.5E-12 64.7 32.6 384 8-445 1-422 (422) 122 protein:vir:6058 Length: 344 # 98.9 5.8E-09 3.6E-12 65.7 24.5 314 1-386 1-344 (344) 123 protein:vir:79207 Length: 351 98.9 1.6E-08 1E-11 63.3 26.3 317 1-395 1-351 (351) 124 protein:vir:96068 Length: 765 98.9 2.2E-08 1.3E-11 62.6 35.7 439 1-484 48-592 (765) 125 protein:vir:4698 Length: 251 # 98.9 2.3E-09 1.4E-12 67.9 20.6 243 1-288 1-251 (251) 126 protein:vir:100328 Length: 346 98.9 2.4E-08 1.5E-11 62.4 27.6 315 1-393 1-346 (346) 127 protein:vir:3743 Length: 345 # 98.9 2.7E-08 1.7E-11 62.0 28.9 323 1-387 1-345 (345) 128 protein:vir:2013 Length: 344 # 98.8 9.4E-09 5.8E-12 64.6 23.1 314 1-386 1-344 (344) 129 protein:vir:3780 Length: 345 # 98.8 3.2E-08 2E-11 61.7 26.1 317 1-387 1-345 (345) 130 protein:vir:5691 Length: 344 # 98.8 1.3E-08 8.3E-12 63.7 23.3 313 1-386 1-344 (344) 131 protein:vir:78191 Length: 351 98.8 4.3E-08 2.6E-11 61.0 27.3 316 1-389 1-351 (351) 132 protein:vir:107662 Length: 427 98.7 1.2E-07 7.3E-11 58.6 31.1 392 7-454 1-427 (427) 133 protein:vir:98444 Length: 434 98.6 1.5E-07 9.4E-11 58.0 27.2 387 37-466 1-434 (434) 134 protein:vir:94742 Length: 409 98.6 2.3E-07 1.4E-10 57.0 30.3 365 21-421 1-409 (409) 135 protein:vir:1634 Length: 409 # 98.6 2.4E-07 1.5E-10 56.9 30.7 366 21-421 1-409 (409) 136 protein:vir:94956 Length: 452 98.6 2.4E-07 1.5E-10 56.9 26.8 403 1-450 3-452 (452) 137 protein:vir:79150 Length: 368 98.6 2.8E-07 1.8E-10 56.5 25.0 319 1-403 1-368 (368) 138 protein:vir:105782 Length: 449 98.5 4.2E-07 2.6E-10 55.5 28.6 397 1-458 1-449 (449) 139 protein:vir:99916 Length: 504 98.5 5.9E-07 3.7E-10 54.7 25.7 446 1-478 1-504 (504) 140 protein:vir:105819 Length: 456 98.4 9.4E-07 5.8E-10 53.6 24.4 413 1-463 1-456 (456) 141 protein:vir:102602 Length: 456 98.4 9.4E-07 5.8E-10 53.6 24.4 413 1-463 1-456 (456) 142 protein:vir:9751 Length: 422 # 98.3 1.8E-06 1.1E-09 52.1 27.7 378 21-433 1-422 (422) 143 protein:vir:7987 Length: 456 # 98.3 1.8E-06 1.1E-09 52.1 22.6 414 1-463 1-456 (456) 144 protein:vir:9568 Length: 410 # 98.2 2.8E-06 1.7E-09 51.1 27.8 370 33-434 1-410 (410) 145 protein:vir:98853 Length: 219 98.1 1.1E-06 7E-10 53.2 16.9 206 159-386 1-219 (219) 146 protein:vir:7768 Length: 484 # 98.0 6.1E-06 3.8E-09 49.2 23.7 429 1-472 1-484 (484) 147 protein:vir:93747 Length: 472 98.0 6.6E-06 4.1E-09 49.0 30.5 409 1-462 1-472 (472) 148 protein:vir:95806 Length: 440 98.0 8.3E-06 5.2E-09 48.4 29.0 400 14-459 1-440 (440) 149 protein:vir:80453 Length: 535 97.8 1.6E-05 9.8E-09 46.9 29.6 435 1-477 17-535 (535) 150 protein:vir:2427 Length: 485 # 97.8 1.8E-05 1.1E-08 46.6 30.2 419 22-470 1-485 (485) 151 protein:vir:78537 Length: 480 97.8 2.2E-05 1.4E-08 46.1 28.9 420 8-471 1-480 (480) 152 protein:vir:78227 Length: 480 97.7 2.3E-05 1.4E-08 46.0 26.8 425 8-474 1-480 (480) 153 protein:vir:4223 Length: 486 # 97.7 2.7E-05 1.7E-08 45.6 24.3 423 22-469 1-486 (486) 154 protein:vir:733 Length: 453 # 97.7 2.7E-05 1.7E-08 45.6 27.2 401 1-453 3-453 (453) 155 protein:vir:80680 Length: 441 97.7 2.8E-05 1.7E-08 45.6 27.6 391 29-453 1-441 (441) 156 protein:vir:95113 Length: 474 97.6 3.5E-05 2.1E-08 45.0 29.0 409 1-462 6-474 (474) 157 protein:vir:5961 Length: 503 # 97.6 4E-05 2.5E-08 44.7 34.6 417 1-469 23-503 (503) 158 protein:vir:78083 Length: 537 97.6 4.4E-05 2.8E-08 44.4 32.6 421 1-481 1-537 (537) 159 protein:vir:99072 Length: 479 97.5 5.5E-05 3.4E-08 43.9 30.2 416 27-477 1-479 (479) 160 protein:vir:1236 Length: 483 # 97.5 5.7E-05 3.5E-08 43.9 29.4 412 1-469 12-483 (483) 161 protein:vir:105292 Length: 478 97.5 5.8E-05 3.6E-08 43.8 30.6 408 1-459 1-478 (478) 162 protein:vir:99522 Length: 470 97.5 5.9E-05 3.7E-08 43.8 29.0 410 1-459 1-470 (470) 163 protein:vir:8184 Length: 474 # 97.5 6.4E-05 4E-08 43.6 26.3 422 1-459 1-474 (474) 164 protein:vir:97336 Length: 492 97.4 7E-05 4.3E-08 43.4 28.3 410 1-459 24-492 (492) 165 protein:vir:95149 Length: 501 97.4 8.7E-05 5.4E-08 42.9 25.8 420 1-459 1-501 (501) 166 protein:vir:106571 Length: 499 97.2 0.00012 7.7E-08 42.0 36.7 411 1-469 1-499 (499) 167 protein:vir:101494 Length: 527 97.2 0.00013 7.9E-08 41.9 24.6 431 1-482 1-527 (527) 168 protein:vir:102239 Length: 527 97.2 0.00013 8.3E-08 41.8 24.6 431 1-482 1-527 (527) 169 protein:vir:97265 Length: 513 97.2 0.00014 9E-08 41.6 35.7 426 1-468 1-513 (513) 170 protein:vir:105154 Length: 525 97.2 0.00015 9.1E-08 41.6 18.6 435 1-479 27-525 (525) 171 protein:vir:94805 Length: 492 97.2 0.00015 9.3E-08 41.5 29.6 407 1-464 24-492 (492) 172 protein:vir:2341 Length: 488 # 97.1 0.00016 1E-07 41.4 30.5 407 1-457 1-488 (488) 173 protein:vir:104082 Length: 485 97.1 0.00017 1E-07 41.3 31.6 428 22-470 1-485 (485) 174 protein:vir:3964 Length: 453 # 97.1 0.00018 1.1E-07 41.1 27.0 410 1-459 11-453 (453) 175 protein:vir:95014 Length: 491 97.0 0.00021 1.3E-07 40.8 26.0 417 2-462 1-491 (491) 176 protein:vir:105461 Length: 470 96.9 0.00025 1.6E-07 40.3 30.1 379 21-459 1-470 (470) 177 protein:vir:105889 Length: 474 96.9 0.00028 1.8E-07 40.0 34.1 387 21-464 1-474 (474) 178 protein:vir:94101 Length: 474 96.9 0.00028 1.8E-07 40.0 34.1 387 21-464 1-474 (474) 179 protein:vir:80959 Length: 499 96.9 0.0003 1.9E-07 39.9 28.7 412 9-453 1-499 (499) 180 protein:vir:94498 Length: 474 96.8 0.00032 2E-07 39.7 30.5 409 1-469 6-474 (474) 181 protein:vir:97447 Length: 474 96.8 0.00032 2E-07 39.7 30.5 409 1-469 6-474 (474) 182 protein:vir:38 Length: 496 # N 96.8 0.00033 2E-07 39.7 24.8 397 1-453 27-496 (496) 183 protein:vir:79043 Length: 479 96.8 0.00036 2.2E-07 39.5 31.6 394 1-460 6-479 (479) 184 protein:vir:102330 Length: 451 96.7 0.00039 2.4E-07 39.3 27.0 379 21-447 1-451 (451) 185 protein:vir:5839 Length: 533 # 96.7 0.00043 2.7E-07 39.0 25.9 438 1-482 27-533 (533) 186 protein:vir:99781 Length: 511 96.7 0.00043 2.7E-07 39.0 29.4 407 1-469 40-511 (511) 187 protein:vir:9871 Length: 429 # 96.6 0.00047 2.9E-07 38.8 28.7 394 1-459 1-429 (429) 188 protein:vir:95899 Length: 474 96.5 0.00053 3.3E-07 38.5 29.5 412 1-461 1-474 (474) 189 protein:vir:96266 Length: 474 96.5 0.00053 3.3E-07 38.5 29.5 412 1-461 1-474 (474) 190 protein:vir:3609 Length: 452 # 96.5 0.00056 3.5E-07 38.4 29.9 401 7-459 1-452 (452) 191 protein:vir:104500 Length: 537 96.5 0.0006 3.7E-07 38.2 29.2 438 1-469 15-537 (537) 192 protein:vir:2500 Length: 501 # 96.4 0.00062 3.9E-07 38.2 31.2 426 5-468 1-501 (501) 193 protein:vir:97171 Length: 512 96.4 0.00066 4.1E-07 38.0 29.2 435 1-469 1-512 (512) 194 protein:vir:106639 Length: 481 96.3 0.00072 4.5E-07 37.8 30.2 424 4-457 1-481 (481) 195 protein:vir:107112 Length: 478 96.2 0.00094 5.8E-07 37.2 30.7 408 1-459 1-478 (478) 196 protein:vir:78805 Length: 511 96.0 0.0011 6.9E-07 36.8 29.5 415 1-469 40-511 (511) 197 protein:vir:96366 Length: 511 96.0 0.0011 6.9E-07 36.8 29.5 415 1-469 40-511 (511) 198 protein:vir:102950 Length: 471 95.8 0.0015 9.2E-07 36.1 28.5 383 21-459 1-471 (471) 199 protein:vir:96240 Length: 511 95.7 0.0015 9.6E-07 36.0 29.3 415 1-469 40-511 (511) 200 protein:vir:78393 Length: 489 95.7 0.0016 9.8E-07 35.9 26.4 415 2-462 1-489 (489) 201 protein:vir:106999 Length: 564 95.6 0.0019 1.1E-06 35.6 24.2 451 1-478 21-564 (564) 202 protein:vir:103177 Length: 533 95.4 0.0021 1.3E-06 35.3 28.9 449 1-477 14-533 (533) 203 protein:vir:2732 Length: 501 # 95.3 0.0023 1.4E-06 35.0 32.6 424 1-472 37-501 (501) 204 protein:vir:106282 Length: 521 95.0 0.003 1.9E-06 34.4 25.8 415 1-445 32-521 (521) 205 protein:vir:96179 Length: 468 94.9 0.0032 2E-06 34.3 30.5 395 1-455 1-468 (468) 206 protein:vir:9306 Length: 511 # 94.8 0.0036 2.2E-06 34.0 29.4 432 12-469 1-511 (511) 207 protein:vir:103951 Length: 511 94.5 0.0043 2.7E-06 33.6 31.6 432 12-469 1-511 (511) 208 protein:vir:96839 Length: 474 94.1 0.0053 3.3E-06 33.0 31.8 398 1-461 20-474 (474) 209 protein:vir:4898 Length: 502 # 93.9 0.006 3.7E-06 32.8 31.3 412 1-459 38-502 (502) 210 protein:vir:96494 Length: 501 93.2 0.0086 5.3E-06 31.9 33.8 434 1-459 1-501 (501) 211 protein:vir:5665 Length: 511 # 92.7 0.01 6.4E-06 31.5 23.3 409 1-442 24-511 (511) 212 protein:vir:6896 Length: 523 # 92.0 0.013 8.3E-06 30.8 25.0 419 1-445 32-523 (523) 213 protein:vir:103219 Length: 201 91.2 0.017 1.1E-05 30.3 14.2 189 226-449 1-201 (201) 214 protein:vir:81017 Length: 521 90.8 0.019 1.2E-05 30.0 27.3 425 1-445 31-521 (521) 215 protein:vir:98883 Length: 517 90.7 0.019 1.2E-05 30.0 29.2 408 9-453 1-517 (517) 216 protein:vir:101806 Length: 516 90.7 0.02 1.2E-05 30.0 20.6 417 1-442 30-516 (516) 217 protein:vir:101189 Length: 516 90.7 0.02 1.2E-05 30.0 20.6 417 1-442 30-516 (516) 218 protein:vir:104892 Length: 558 90.3 0.021 1.3E-05 29.7 28.2 448 1-474 15-558 (558) 219 protein:vir:6596 Length: 521 # 90.2 0.022 1.4E-05 29.7 27.6 428 1-445 31-521 (521) 220 protein:vir:94599 Length: 641 90.2 0.022 1.4E-05 29.6 21.8 457 1-482 4-641 (641) 221 protein:vir:94546 Length: 506 90.1 0.023 1.4E-05 29.6 29.0 411 1-467 22-506 (506) 222 protein:vir:98265 Length: 524 90.0 0.023 1.4E-05 29.5 26.1 423 1-445 36-524 (524) 223 protein:vir:103458 Length: 524 89.7 0.025 1.5E-05 29.4 23.1 418 1-445 32-524 (524) 224 protein:vir:7208 Length: 524 # 89.4 0.027 1.7E-05 29.2 23.1 418 1-445 32-524 (524) 225 protein:vir:78907 Length: 518 88.4 0.033 2E-05 28.7 32.4 393 19-451 1-518 (518) 226 protein:vir:101541 Length: 694 88.2 0.034 2.1E-05 28.6 29.8 432 1-484 56-598 (694) 227 protein:vir:3028 Length: 500 # 88.0 0.035 2.2E-05 28.6 21.3 398 1-459 28-500 (500) 228 protein:vir:9815 Length: 500 # 88.0 0.035 2.2E-05 28.6 21.3 398 1-459 28-500 (500) 229 protein:vir:108049 Length: 524 87.9 0.036 2.2E-05 28.5 23.9 415 1-445 34-524 (524) 230 protein:vir:80165 Length: 651 87.6 0.037 2.3E-05 28.4 25.7 453 1-482 3-651 (651) 231 protein:vir:4782 Length: 522 # 87.1 0.041 2.5E-05 28.2 30.6 421 9-466 1-522 (522) 232 protein:vir:1587 Length: 508 # 86.8 0.043 2.7E-05 28.1 30.9 408 9-451 1-508 (508) 233 protein:vir:9922 Length: 489 # 84.9 0.057 3.5E-05 27.4 29.3 414 7-464 1-489 (489) 234 protein:vir:3648 Length: 695 # 84.6 0.059 3.7E-05 27.3 30.1 435 1-484 36-599 (695) 235 protein:vir:106716 Length: 698 83.2 0.07 4.4E-05 26.9 32.0 438 1-484 57-602 (698) 236 protein:vir:79703 Length: 505 81.7 0.083 5.1E-05 26.5 29.9 393 1-451 14-505 (505) 237 protein:vir:100598 Length: 516 81.0 0.09 5.6E-05 26.3 21.6 422 1-442 30-516 (516) 238 protein:vir:78589 Length: 695 75.6 0.14 9E-05 25.2 31.0 437 1-484 57-599 (695) 239 protein:vir:7430 Length: 563 # 37.5 1.1 0.00069 20.3 28.3 427 1-464 9-563 (563) 240 protein:vir:2198 Length: 536 # 35.1 1.3 0.00078 20.0 19.8 437 1-480 1-536 (536) 241 protein:vir:102668 Length: 547 34.6 1.3 0.00079 20.0 21.5 413 1-474 31-547 (547) 242 protein:vir:10447 Length: 536 34.0 1.3 0.00082 19.9 19.3 438 1-471 1-536 (536) 243 protein:vir:78696 Length: 542 30.3 1.6 0.00099 19.5 22.6 425 1-470 1-542 (542) 244 protein:vir:103765 Length: 549 25.8 2 0.0012 18.9 19.1 408 1-482 37-549 (549) 245 protein:vir:100039 Length: 522 21.9 2.5 0.0016 18.4 21.5 439 1-480 1-522 (522) No 1 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=100.00 E-value=8.4e-121 Score=678.88 Aligned_cols=460 Identities=43% Similarity=0.694 Sum_probs=384.0 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) .+|.+|++ ++|++...+- ......+...++.++||+++++++|++|+++|+||+++|++|+++|++++|+|+|+++ T Consensus 5 ~~~~~p~~--~~g~~~~~~~--~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~~ 80 (469) T protein:vir:10 5 VKTAAPVS--EAGYVFGSGV--VDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRIRANGA 80 (469) T ss_pred ccCCCCcc--chhhhhhccc--ccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC Confidence 44444332 3333333211 1112345567788899999999999999988999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHH-HHhhcceeeeEEEeec----CCeeeeeeeeeeCccc Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALK-SLQFGHAVFEQTYFYE----GGRFWLKRLAPRPQSS 155 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~-a~~~G~s~~Eivw~~~----~g~~~~~~l~~r~~~~ 155 (484) +++++++++++|+..+..........+...+.+|.++|.++|+ |++|||||+|+||+.. +|+|.|.+|.+|||++ T Consensus 81 ~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l~~rp~~~ 160 (469) T protein:vir:10 81 SDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKLAPRPQWT 160 (469) T ss_pred CHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeeeeecCccc Confidence 9999999999999888777666655666667788888888776 8999999999999864 6899999999999999 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) |.+|.++.+++++.++|....... ....+..+..+++||++|||+|+|+++++||||.|||+.|||+|+||++++++| T Consensus 161 i~~~~~~~~~~l~~~~~~~~~~~~--~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w 238 (469) T protein:vir:10 161 ISKFNVAPDGGLESIEQIAPPART--RGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIE 238 (469) T ss_pred ceeeeeccCCceeeeeecCccccc--ccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHH Confidence 999999999999988886543322 233344566789999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVA 315 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i 315 (484) +.|+||| |+|+++|||++++++++++.|++++++|++|+++++|||.|++|||++++|++.+|.+|++|||++|||+| T Consensus 239 ~~f~Ery--G~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~i 316 (469) T protein:vir:10 239 AATAERN--GMGIPVGTASSATDEDEVRKMAALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSG 316 (469) T ss_pred HHHHHHc--CCcceEEecCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHH Confidence 9999997 88999999999999999999999999999999999999999999999999998999999999999999999 Q ss_pred hhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHH Q lcl|NC_021302. 316 LAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQML 395 (484) Q Consensus 316 lGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L 395 (484) ||||||++++|||+|+|+||++++.+++++|+++|+++||+|||++|+.+||+++.++|+|+|++.+++++..++++++| T Consensus 317 LG~tlTs~~~gGS~a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~~e~~~~~~a~~i~~l 396 (469) T protein:vir:10 317 LAHFLNLDGKGGSYALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDPIGSRQDLTAAAVKLL 396 (469) T ss_pred hcccccccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCCCCcHHHHHHHHHHH Confidence 99999999889999999999999999999999999999999999999999999999999999998888888999999999 Q ss_pred HhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccchHHHhcC Q lcl|NC_021302. 396 VNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKT 475 (484) Q Consensus 396 ~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (484) +++|+++.+++.++|++|+||||+|.++++++.+......+ .+.. .+.....+....++.+..++...+..+ T Consensus 397 ~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~-~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~l~d 468 (469) T protein:vir:10 397 YDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVP-NQSA-------APARTRSSGNADARARAPKADQGVLFD 468 (469) T ss_pred HhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCC-CCCc-------cccccCCCCCcccccccCCChHHhhcc Confidence 99999998888999999999999999888776543222111 1111 111111122223333334444444444 Q ss_pred c Q lcl|NC_021302. 476 P 476 (484) Q Consensus 476 ~ 476 (484) + T Consensus 469 a 469 (469) T protein:vir:10 469 A 469 (469) T ss_pred C Confidence 4 No 2 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=100.00 E-value=5.3e-111 Score=625.16 Aligned_cols=444 Identities=22% Similarity=0.343 Sum_probs=353.2 Q ss_pred CCCCCCCcc-------ceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTVAPR-------TERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) ||.+.-... .+|++....+..+. + -++..++||+++++++|++|++ |+||+++|++|+++|++++| T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~-----~-~~~~~~~Lr~~~~~~ly~~m~~-D~hi~s~l~~Rk~av~~~~w 73 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGR-----I-YEEPRQALRFPESIKTFQLMMR-DPAVAASVNIIKMFVRKVNW 73 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccch-----h-hccchhhhcccchHHHHHHHhh-ChHHHHHHHHHHHHHhcCCc Confidence 766543222 23332222222221 1 1455578999999999999985 99999999999999999999 Q ss_pred EEecCCCCHH------HHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEee--------- Q lcl|NC_021302. 74 RIRPNGARPE------VVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFY--------- 138 (484) Q Consensus 74 ~v~p~~~~~e------~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~--------- 138 (484) +|+|++.+++ +++++.+++.. ...+|+++|++||+|++|||||+|++|++ T Consensus 74 ~v~p~~~~~~d~~~~~~a~~v~~~l~~---------------~~~~~~~~i~~~lda~~~G~s~~Eivw~~~~~~~~~~~ 138 (488) T protein:vir:95 74 RFVPPKGKEQDPKMLERADFFNSLMDD---------------MEHDWADFINSVMSFCTYGFCVNEKVYKKRQGKKGKYQ 138 (488) T ss_pred eEecCCCCchhHHHHHHHHHHHHHHhc---------------cCccHHHHHHHHHHhhcccceeeeeeeecccccccccc Confidence 9999875543 23444444421 13469999999999999999999999975 Q ss_pred ---cCCeeeeeeeeeeCccceeeeeecCCCcee-eeecccccccccccceeccCCCCcccccccceEEEeecCccCcccc Q lcl|NC_021302. 139 ---EGGRFWLKRLAPRPQSSIAYWNVDRDGGLI-SIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTG 214 (484) Q Consensus 139 ---~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G 214 (484) ++|+|.|++|.+|||.++.+|.|+.+++++ +.+|...+...............++.||++|||+|+|+++++|||| T Consensus 139 ~~~~dg~~~~~~i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g 218 (488) T protein:vir:95 139 SKFDDGLIGWAKLPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEG 218 (488) T ss_pred ccccCCeeeeeeeeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCCccch Confidence 479999999999999999999999999875 5556555544444444445567888999999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecC--CCCCCHHHHHHHHHHHHHH----hcCCceEEEccCCceE Q lcl|NC_021302. 215 NSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNE--ADSEDDDRMDELLEIASNY----SGGESAGLALTAGEEA 288 (484) Q Consensus 215 ~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~--~~~~~~~~~~~l~~~l~~~----~~g~~a~~vip~~~~i 288 (484) .|||+.|||+|+||++++++|+.|+||||+|+|+++|++ ..++++++++.+++++.+| ++++++++|||.++++ T Consensus 219 ~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~ 298 (488) T protein:vir:95 219 RSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDP 298 (488) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeecccccc Confidence 999999999999999999999999999999999999964 4566677777777777665 4566788999999987 Q ss_pred EEe---------cccC-CchhHHHHHHHHHHHHHHHHhhhhhccc-ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 289 GIL---------SPNG-TPLDPRRAIEYHDHQMALVALAHFLNLD-GKGGSYALASVQADTFVQSVQTVADEIRDVAQAH 357 (484) Q Consensus 289 e~~---------~~~~-~~~~~~~li~~~d~~Isk~ilGqtlt~~-~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~q 357 (484) ++. ++++ +..+|.+||+|||++|||+|||||||++ ++|||+|+|+||++|+++++++|+++|+++||+| T Consensus 299 ~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln~~ 378 (488) T protein:vir:95 299 DTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLAMSVDILLKQIKNVINRD 378 (488) T ss_pred ccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 763 3443 3457999999999999999999999985 4579999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCccccceEEecCCC-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCC Q lcl|NC_021302. 358 VVEDIVDVNWGEDEPAPLLVFDEIG-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQD 436 (484) Q Consensus 358 li~~l~~~Nf~~~~~~P~~~~~~~~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~ 436 (484) ||++|+.+||++..++|+|+|+..+ +|++++++++++|+++|++++++..++|++|+||||.|.++++++.+.+....+ T Consensus 379 li~~l~~~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~ 458 (488) T protein:vir:95 379 LVAQTYALNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQS 458 (488) T ss_pred HHHHHHHhcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCC Confidence 9999999999999999999998655 677999999999999999998877789999999999998887766543222111 Q ss_pred CccccCCCCccccccccccccccccccccccchHHHhcCcc Q lcl|NC_021302. 437 EPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPD 477 (484) Q Consensus 437 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (484) . .+.....+...+++.+.+.+++.+|++.. T Consensus 459 ~-----------~~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 488 (488) T protein:vir:95 459 R-----------SGDGYKTAGEGTAKTPSAKDPSTANKANK 488 (488) T ss_pred C-----------CCcccCCCcccCCcccccccchhhhhccC Confidence 1 11111222334455666777777887766 No 3 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=100.00 E-value=3.3e-109 Score=615.34 Aligned_cols=444 Identities=16% Similarity=0.178 Sum_probs=326.4 Q ss_pred CCCCCCC-ccc-ee-eee-----cccccch-hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCC Q lcl|NC_021302. 1 MAPKTVA-PRT-ER-GYV-----NPLAGFG-TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRT 71 (484) Q Consensus 1 ~~~~~~~-~~~-~~-~~~-----~~~~~~~-~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~ 71 (484) +.++... +.+ ++ ++. ++.++.. ..+...|+..+.. ++ ...++||++|+++|+||+++|++|+++|+++ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~~g-d~--~~~~~L~edm~e~D~~i~s~l~~Rk~av~~~ 88 (526) T protein:vir:79 12 IRPQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQG-NL--QAQAELFMDMEERDAHLFAEMSKRKRAILGL 88 (526) T ss_pred cCccccchhhhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhhCC-CH--HHHHHHHHHHHhhChHHHHHHHHHHHHHhCC Confidence 2222111 000 11 111 1222222 1344556666553 22 2358899999989999999999999999999 Q ss_pred CcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 72 DWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 72 ~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) +|.|+|++++.+..+.+++.++..+ .+..+|+++|++||+|++|||||+||+|+.++|.|.|++|.+| T Consensus 89 ~w~I~p~~~~~~~~~~~a~~v~~~l------------~~~~~~~~~i~~~ldA~~~G~s~~Ei~w~~~~g~~~~~~l~~r 156 (526) T protein:vir:79 89 DWAVEPPRNASAAEKADADYLHELL------------LDLEGLEDLLLDALDGIGHGYSCIELEWALQGREWMPLAFHHR 156 (526) T ss_pred CceEecCCCCChHHHHHHHHHHHHH------------hcccCHHHHHHHHHhhhhhcceeEEEEEeecCCceeEEEeeee Confidence 9999998765444444555544332 1334799999999999999999999999999999999999999 Q ss_pred CccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHH Q lcl|NC_021302. 152 PQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDEL 231 (484) Q Consensus 152 ~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~ 231 (484) ||+||. |++++++.. .......+++++|++|||+|+|+++++||||.||+|.|||+|+||+++ T Consensus 157 ~~~~F~---~~~~~~~~l--------------~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~ 219 (526) T protein:vir:79 157 PQSWFQ---LNPEDQNEL--------------RLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYA 219 (526) T ss_pred cccceE---eccCCCcEE--------------EecCCCCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhh Confidence 999875 455554321 112235678899999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccC-CchhHHHHHHHHHHH Q lcl|NC_021302. 232 IRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNG-TPLDPRRAIEYHDHQ 310 (484) Q Consensus 232 ~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~-~~~~~~~li~~~d~~ 310 (484) +++|+.|+||| |+|+++|||+++++++++++|+++|++| ++++++|||.|++|||+++++ ++..|.+|++|||++ T Consensus 220 ~~~w~~F~E~y--G~P~~igky~~~a~~~ek~~L~~av~~i--~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~ 295 (526) T protein:vir:79 220 TSDLAEMLEIY--GLPIRLGKYPPGTADEEKATLLRAVTGL--GHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDA 295 (526) T ss_pred HHHHHHHHHHc--CCceEEEecCCCCCHHHHHHHHHHHHHH--hcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHH Confidence 99999999997 8999999999999999999999999999 446899999999999999764 446799999999999 Q ss_pred HHHHHhhhhhccc---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc---cccceEEecC-CCC Q lcl|NC_021302. 311 MALVALAHFLNLD---GKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGED---EPAPLLVFDE-IGS 383 (484) Q Consensus 311 Isk~ilGqtlt~~---~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~---~~~P~~~~~~-~~~ 383 (484) |||+||||||||+ ++|||+|+|+||++|+++++++|+++|+++||+|||++|+.+||+.. ..+|+|+|+. .++ T Consensus 296 Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~e 375 (526) T protein:vir:79 296 ISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQA 375 (526) T ss_pred HHHHHhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcc Confidence 9999999999995 34689999999999999999999999999999999999999999854 4589999985 457 Q ss_pred cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccc----c Q lcl|NC_021302. 384 RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAP----Q 459 (484) Q Consensus 384 ~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 459 (484) |++.+++++++|+++|+.+ +++|++++||||.|+++++++.+.+.+..+...................... + T Consensus 376 Dl~~~a~~~~~L~~~G~~i----~~~~i~e~~gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 451 (526) T protein:vir:79 376 DITSMAQSIPALVNVGLEI----PSAWVYDKLGIPQPAKNEPVLRPAAQPAILSRQHGQRVAALATIVGPRYGDQQALDK 451 (526) T ss_pred cHHHHHHHHHHHHhCCCcC----CHHHHHHHhCCCCCCCchhhccccCCccccccccccccccccccccccCchhhHHHH Confidence 8899999999999999965 4789999999999999888776544333222221111111100000000000 0 Q ss_pred ccccccccchHH---HhcCcc-----cCcccCC Q lcl|NC_021302. 460 ARKRPRGRSPRD---RRKTPD-----GAMPLWD 484 (484) Q Consensus 460 ~~~~~~~~~~~~---~~~~~~-----~~~~~~~ 484 (484) +......++... ....+- -+.++=+ T Consensus 452 ~l~~~~~~~~~~~~~~~~~~i~~~~~~~~s~ee 484 (526) T protein:vir:79 452 ALADLPAKDMQNQANDLLAPLLDAVNRGDSETE 484 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHH Confidence 000000000000 000000 0000000 No 4 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=100.00 E-value=5.1e-108 Score=608.79 Aligned_cols=444 Identities=16% Similarity=0.158 Sum_probs=325.9 Q ss_pred CCCCCCCccc-ee-eeecc-----cccch-hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCC Q lcl|NC_021302. 1 MAPKTVAPRT-ER-GYVNP-----LAGFG-TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTD 72 (484) Q Consensus 1 ~~~~~~~~~~-~~-~~~~~-----~~~~~-~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~ 72 (484) -.+....+.+ ++ ++.++ .++.. ..+...|+..+.. ++ ...++||++|+++|+||+++|++|+++|++++ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~~g-d~--~~~~~L~e~m~e~D~~i~s~l~~Rk~av~~~~ 89 (526) T protein:vir:99 13 RTQQLREPQTSRLAGLAKEFAQHPAKGLTPAKLARILVEAEQG-NL--QAQAELFMDMEERDAHLFAEMSKRKRAILGLD 89 (526) T ss_pred ccccccchhhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhhCC-CH--HHHHHHHHHHHhhChHHHHHHHHHHHHHhCCC Confidence 1111111111 11 11111 12221 1344455555553 22 24678999999899999999999999999999 Q ss_pred cEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeeeeeeC Q lcl|NC_021302. 73 WRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRLAPRP 152 (484) Q Consensus 73 ~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~ 152 (484) |.|+|++++.+..+.+++.++..+ .+..+|+++|++||+|++|||||+|++|+.++|.|.|.++.+|| T Consensus 90 w~I~p~~~~~~~~~~~a~~v~~~l------------~~~~~~~~~i~~~lda~~~G~s~~Eivw~~~~g~~~~~~l~~r~ 157 (526) T protein:vir:99 90 WAVEPPRNASAAEKADADYLHELL------------LDLEGLEDLLLDALDGIGHGYSCIELEWALQGREWMPLAFHHRP 157 (526) T ss_pred ceEecCCCCCHHHHHHHHHHHHHH------------hcccCHHHHHHHHHHhhhhcceeEEEEEeecCCceeEEEeeeec Confidence 999998765444445555554332 13347999999999999999999999999999999999999999 Q ss_pred ccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHH Q lcl|NC_021302. 153 QSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELI 232 (484) Q Consensus 153 ~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~ 232 (484) |+||. |++++++.. .......+++++|++|||+|+|+++++||||.||++.|||+|+||++++ T Consensus 158 ~~~f~---~~~~~~~~l--------------~~~~~~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~ 220 (526) T protein:vir:99 158 QSWFQ---LNPEDQNEL--------------RLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYAT 220 (526) T ss_pred cccee---eccCCCcEE--------------EecCCCCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhH Confidence 99774 455554321 1122356788999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccC-CchhHHHHHHHHHHHH Q lcl|NC_021302. 233 RIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNG-TPLDPRRAIEYHDHQM 311 (484) Q Consensus 233 ~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~-~~~~~~~li~~~d~~I 311 (484) ++|+.|+||| |+|+++|||+.+++++++++|+++|++|. +++++|||.|++|||+++++ ++..|.+|++|||++| T Consensus 221 ~~w~~f~E~y--G~P~~igky~~~a~~~ek~~L~~av~~i~--~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~I 296 (526) T protein:vir:99 221 SDLAEMLEIY--GLPIRLGKYPPGTADEEKATLLRAVTGLG--HAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAI 296 (526) T ss_pred HHHHHHHHHc--CCceEEEecCCCCCHHHHHHHHHHHHHHh--hCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHH Confidence 9999999997 89999999999999999999999999994 45899999999999999764 4467999999999999 Q ss_pred HHHHhhhhhccc---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc---cccceEEecCC-CCc Q lcl|NC_021302. 312 ALVALAHFLNLD---GKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGED---EPAPLLVFDEI-GSR 384 (484) Q Consensus 312 sk~ilGqtlt~~---~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~---~~~P~~~~~~~-~~~ 384 (484) ||+||||||||+ +++||+|+|+||++|+++++++|+++|+++||+|||++|+.+||+.. ..+|+|+|+.. ++| T Consensus 297 sk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eD 376 (526) T protein:vir:99 297 SKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQAD 376 (526) T ss_pred HHHHhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCccc Confidence 999999999985 34689999999999999999999999999999999999999999854 45799999854 578 Q ss_pred HHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccccc--- Q lcl|NC_021302. 385 QDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQAR--- 461 (484) Q Consensus 385 ~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 461 (484) ++++++++++|+++|+.+ +++|++++||||.|.++++++.+++.+..+......................++. T Consensus 377 l~~~a~~~~~L~~~G~~i----~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 452 (526) T protein:vir:99 377 ITSMAQSIPALVNVGLEI----PSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATIVGPRYGDQQALDKA 452 (526) T ss_pred HHHHHHHHHHHHhCCCcc----CHHHHHHHhCCCCCCCcccccCCCCCCcccccccccccccccccccccCcchhhHHHH Confidence 899999999999999965 4789999999999999988876554433322221111110000000000000000 Q ss_pred -ccccccchH---HHhcCcc-----cCcccCC Q lcl|NC_021302. 462 -KRPRGRSPR---DRRKTPD-----GAMPLWD 484 (484) Q Consensus 462 -~~~~~~~~~---~~~~~~~-----~~~~~~~ 484 (484) .....++.. .....+- -+.++=+ T Consensus 453 l~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ee 484 (526) T protein:vir:99 453 LADLPAKDMQNQANDLLAPLLEAVNRGDSETE 484 (526) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHH Confidence 000000000 0000000 0000000 No 5 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=100.00 E-value=2.6e-107 Score=604.90 Aligned_cols=441 Identities=16% Similarity=0.163 Sum_probs=322.6 Q ss_pred CCCCCCCccceeeee-----cccccch-hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcE Q lcl|NC_021302. 1 MAPKTVAPRTERGYV-----NPLAGFG-TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWR 74 (484) Q Consensus 1 ~~~~~~~~~~~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~ 74 (484) +.|.+ ...+ ++. ++.++.. ..+...|+..+.. .+ ...++||++|+++|+||+++|++|+++|++++|+ T Consensus 18 ~~~~~-~~~~--~~~~~~~~~~~~gltp~~l~~il~~a~~g-d~--~~~~~L~~~m~e~D~~i~s~l~~Rk~av~~~~w~ 91 (528) T protein:vir:10 18 RKQQT-AHLA--GLAKEFANHPAKGLTPAKLAHILIEAEQG-HL--QAQAELFMDMEERDAHLFAEMSKRKRAVLGLDWT 91 (528) T ss_pred cchhh-hhhh--hhhhhhcccCCCCCCHHHHHHHHHhhhCC-CH--HHHHHHHHHHHhhChHHHHHHHHHHHHHhcCCce Confidence 22211 1111 111 1222222 1344555555543 22 3468899999989999999999999999999999 Q ss_pred EecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeeeeeeCcc Q lcl|NC_021302. 75 IRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQS 154 (484) Q Consensus 75 v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 154 (484) |+|++++....+.+++.++..+ .+..+|+++|++||+|++|||||+|++|..++|.|.|+++.+|||+ T Consensus 92 I~p~~~~~~~~~~~a~~v~~~l------------~~~~~f~~~i~~~lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~ 159 (528) T protein:vir:10 92 IEPPRNASAAEKADAEYLHELL------------LDLEGIEDLMLDCMDGVGHGYSAIELDWSLQGREWLPQAFDHRPQS 159 (528) T ss_pred EecCCCCCHHHHHHHHHHHHHH------------hCCccHHHHHHHHHhhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 9998655433444444443332 1234799999999999999999999999999999999999999998 Q ss_pred ceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 155 SIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRI 234 (484) Q Consensus 155 ~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 234 (484) || .|++++++.... .....+++++|++||++|+|+.+++||||.|||+.|||+|+||++++++ T Consensus 160 ~f---~~~~~~~~~l~~--------------~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~ 222 (528) T protein:vir:10 160 WF---QLNPDDQDELRL--------------RDNSIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTAD 222 (528) T ss_pred ce---eeccCCCcEEec--------------cCCCCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHH Confidence 76 455555543221 1234578899999999999999999999999999999999999999999 Q ss_pred HHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccC-CchhHHHHHHHHHHHHHH Q lcl|NC_021302. 235 EAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNG-TPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 235 w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~-~~~~~~~li~~~d~~Isk 313 (484) |+.|+||| |+|+++|||+.+++++++++|+++|++|. +++++|||.|++|||+++++ ++..|.+|++|||++||| T Consensus 223 w~~f~E~y--G~P~~igky~~~a~~~ek~~L~~al~~i~--~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk 298 (528) T protein:vir:10 223 LAEMLEIY--GLPIRLGKYPPGTPDEEKVTLLRAVTGLG--HAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSK 298 (528) T ss_pred HHHHHHHc--CCCeEEEecCCCCCHHHHHHHHHHHHHHh--hCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHH Confidence 99999997 89999999999999999999999999994 45899999999999999764 456799999999999999 Q ss_pred HHhhhhhcccc---cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc---cccceEEecCC-CCcHH Q lcl|NC_021302. 314 VALAHFLNLDG---KGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGED---EPAPLLVFDEI-GSRQD 386 (484) Q Consensus 314 ~ilGqtlt~~~---~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~---~~~P~~~~~~~-~~~~~ 386 (484) +||||||||++ +|||+|+|+||++|+++++++|+++|+++||+|||++|+.+||++. ..+|+|+|+.. ++|++ T Consensus 299 ~iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~ 378 (528) T protein:vir:10 299 AILGGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLA 378 (528) T ss_pred HHhhhhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHH Confidence 99999999863 3589999999999999999999999999999999999999999864 45799999854 47789 Q ss_pred HHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCC-CCccccccc-----cccccccc Q lcl|NC_021302. 387 ATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEP-ALPNTSGTT-----STTNAPQA 460 (484) Q Consensus 387 ~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-----~~~~~~~~ 460 (484) ++++++++|+++|+.+ +++|++++||||.|+++++++.+........+..... .....+... ...+..+. T Consensus 379 ~~a~~~~~L~~~G~~i----~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 454 (528) T protein:vir:10 379 AMATSLPPLVKLGVQV----PVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRRPGPRIAALAQVIGPRYRDQEALDQV 454 (528) T ss_pred HHHHHHHHHHhCCCCC----CHHHHHHHhCCCCCCCCcccccCCCcccccccCcccccccccccccccccccccchHHHH Confidence 9999999999999965 5799999999999999888765432221111111000 000000000 00000000 Q ss_pred cccccccchHHH---hcCcc-----cCcc----------cCC Q lcl|NC_021302. 461 RKRPRGRSPRDR---RKTPD-----GAMP----------LWD 484 (484) Q Consensus 461 ~~~~~~~~~~~~---~~~~~-----~~~~----------~~~ 484 (484) ......++.... ...|- -+.+ |+. T Consensus 455 ~~~~~~~~~~~~~~~~l~~i~~~l~~~~s~ee~~~~L~~l~~ 496 (528) T protein:vir:10 455 LASLPAQDMQNQADSLVAPLLDVISRGGSEAELLGALAEAFP 496 (528) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhh Confidence 000000000000 00000 0000 000 No 6 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=100.00 E-value=4.8e-107 Score=603.46 Aligned_cols=434 Identities=14% Similarity=0.127 Sum_probs=329.5 Q ss_pred CCCCCCCcc------ceeeeecccccchhhhhhhccccccccccc-ccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTVAPR------TERGYVNPLAGFGTFLAQGLDQFEQVDELR-WPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr-~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) +.+..+.+. ....+....+.+......++.+...+ .|| .+..+++|++|+ +|+||+++|++|+++|++++| T Consensus 7 ~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~-il~~~~~~~~~y~~m~-~D~~i~s~l~~Rk~av~~~~w 84 (491) T protein:vir:79 7 VSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDP-VLKALGKDIRVYRELR-ADAHVGGCVRRRKAAVKALEW 84 (491) T ss_pred CCCCCcccccccchhHHHHHhhhccccccccccccCcchhH-HHhhccCCHHHHHHHh-hChHHHHHHHHHHHHHhCCCc Confidence 222222211 11112222333444444566665543 333 466799999998 699999999999999999999 Q ss_pred EEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeeeeeeCc Q lcl|NC_021302. 74 RIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQ 153 (484) Q Consensus 74 ~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~ 153 (484) +|+|+++++++++++++++. +.+|+++|++||+|++|||||+|++|..++|.|.|++|.+||| T Consensus 85 ~i~~~~~~~~~a~~i~e~l~-----------------~~~~~~~i~~~lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~ 147 (491) T protein:vir:79 85 GLDRGKAKSRVAKSIADVFA-----------------DLDLSRIATEMLDAVLYGYQPMEITWGKVGNYIVPIDVVGKPA 147 (491) T ss_pred EEecCCCCHHHHHHHHHHHh-----------------cCCHHHHHHHHHHhhhhcceeEEEEEeecCCeeeEEeeeeecc Confidence 99999999888999988874 3589999999999999999999999999999999999999999 Q ss_pred cceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 154 SSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIR 233 (484) Q Consensus 154 ~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~ 233 (484) +||. |+++++++... .....+++++|++|||+|+|+++++||||.||++.|||+|+||+++++ T Consensus 148 ~~f~---~d~~~~l~l~~--------------~~~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~ 210 (491) T protein:vir:79 148 DWFV---YDPENQLRFRS--------------KEHWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLK 210 (491) T ss_pred ccee---eccCCceEEee--------------cCCCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHH Confidence 9764 56677654322 223567899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccC---CchhHHHHHHHHHHH Q lcl|NC_021302. 234 IEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNG---TPLDPRRAIEYHDHQ 310 (484) Q Consensus 234 ~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~---~~~~~~~li~~~d~~ 310 (484) +|+.|+||| |+|+++|||+.+++++++++|++++++| ++++++|||.|++|||+++++ +...|++|++|||++ T Consensus 211 ~w~~f~E~~--G~P~~igky~~~a~~~ek~~l~~al~~~--~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~ 286 (491) T protein:vir:79 211 FWVQFTEKY--GSPMLVGKHPRSASDAETNLLLDRLEDM--VQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGE 286 (491) T ss_pred HHHHHHHHc--CCCeEEEecCCCCCHHHHHHHHHHHHHH--hcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHH Confidence 999999997 8899999999999999999999999999 456899999999999998753 335699999999999 Q ss_pred HHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHH Q lcl|NC_021302. 311 MALVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAA 390 (484) Q Consensus 311 Isk~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae 390 (484) |||+||||||||++ |||+|+|+||++|+++++++|+++|+++|| +||++++.+||++. +.|+|.|.+.+++++.+++ T Consensus 287 Isk~iLGqtlTt~~-~gs~a~~~vh~~v~~~i~~~D~~~i~~tln-~li~~l~~~N~~~~-~~p~f~~~e~ee~~~~~a~ 363 (491) T protein:vir:79 287 VSIALLGQNQTTEA-TSTRASAQAGLEVTDDIRDGDKAIVVEAMN-MLIRWICDLNFDGA-ARPVFDMWEQEQVDEIQAG 363 (491) T ss_pred HHHHHhhhhhccCc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCC-CcceEeecCcCchhHHHHH Confidence 99999999999985 799999999999999999999999999999 59999999999865 4579999887777788999 Q ss_pred HHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccchH Q lcl|NC_021302. 391 ALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPR 470 (484) Q Consensus 391 ~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (484) ++++|+++|+.+ +++|++|+||||+|+++++..+.......+.........+ ......+.............. T Consensus 364 ~~~~L~~~G~~i----~~~~~~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~---~~~~~d~~~~~~~~~~~~~~~ 436 (491) T protein:vir:79 364 RDEKLTRAGARF----TPAYFKRAYNLQDGDLDERPLPVSAVDAVGAASFAEFEAP---DQDALDAALNALSARDLNADA 436 (491) T ss_pred HHHHHHhCCCcc----CHHHHHHHhCCCCCCCCccccCcCcccccccccccccCCC---CCcchHHHHHHHHHHHHHHHH Confidence 999999999955 5799999999999988776654332222111111111111 110000100000000000000 Q ss_pred HHhcCcc-----cCcccCC Q lcl|NC_021302. 471 DRRKTPD-----GAMPLWD 484 (484) Q Consensus 471 ~~~~~~~-----~~~~~~~ 484 (484) .+-..+- -+.++=+ T Consensus 437 ~~~~~~i~~~l~~~~s~~e 455 (491) T protein:vir:79 437 QALVAPLLKRIANGASADE 455 (491) T ss_pred HHHHHHHHHHHHhcCCHHH Confidence 0000000 0000000 No 7 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=100.00 E-value=3.9e-106 Score=598.49 Aligned_cols=432 Identities=14% Similarity=0.136 Sum_probs=324.8 Q ss_pred CCC------------CCCCcc--ceeeeecccccchhhhhhhccccccccccc-ccchHHHHHHHHhcchHHHHHHHHHH Q lcl|NC_021302. 1 MAP------------KTVAPR--TERGYVNPLAGFGTFLAQGLDQFEQVDELR-WPNSVYTYTRMCREEARIASVLRAIG 65 (484) Q Consensus 1 ~~~------------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr-~~~~~~~y~~m~~~D~~v~s~l~~r~ 65 (484) |+| .+..+. .++++. .++.+ ....++.+...-..|| .+.++++|++|+ +|+||+++|++|+ T Consensus 1 m~~~i~~~~g~p~~~~~~~~~~~~~ia~~--~~~~~-~~~~~~~~~~~~~iLr~~~~~~~~y~~m~-~D~~i~s~l~~Rk 76 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDKSLSSQIATR--ARSID-FFALGMYLPNPDPVLKALGKDIRVYRELR-ADAHVGGCVRRRK 76 (491) T ss_pred CCCceeCCCCCccCcccCChHHHHHHHhh--hcccc-cccccCCccchHHHHHhcCCCHHHHHHHh-hChHHHHHHHHHH Confidence 222 121111 232221 12221 1112333222222343 345799999998 6999999999999 Q ss_pred HHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeee Q lcl|NC_021302. 66 LPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWL 145 (484) Q Consensus 66 ~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~ 145 (484) ++|++++|+|+|+++++++++++++++. +.+|+++|++||+|++|||||+|++|..++|.+.| T Consensus 77 ~av~~~~w~i~~~~~~~~~~e~v~e~l~-----------------~~~~~~~l~~~lda~~~G~s~~Ei~w~~~~g~~~~ 139 (491) T protein:vir:10 77 AAVKALEWGLDRGKAKSRVAKSIADVFA-----------------DLDLSRIVTEMLDAVLYGYQPMEITWGKVGNYIVP 139 (491) T ss_pred HHHhCCCcEEecCCCCHHHHHHHHHHHh-----------------cCCHHHHHHHHHHhhhhcceeEEEEEeecCCeeEE Confidence 9999999999999988888999988874 45899999999999999999999999999999999 Q ss_pred eeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHH Q lcl|NC_021302. 146 KRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNW 225 (484) Q Consensus 146 ~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~ 225 (484) ++|.+|||+|| .|+.++++....+ ....+++++|++|||+|+|+++++||||.||++.|||+| T Consensus 140 ~~l~~r~~~~f---~~d~~~~l~~~~~--------------~~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~ 202 (491) T protein:vir:10 140 IDVVGKPADWF---VYDPENQLRFRSK--------------DHWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPT 202 (491) T ss_pred EEeeeecccce---eeccCCceEEecC--------------CCCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHH Confidence 99999999876 4566777653322 234578899999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCC---chhHHH Q lcl|NC_021302. 226 KLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGT---PLDPRR 302 (484) Q Consensus 226 ~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~---~~~~~~ 302 (484) +||++++++|+.|+||| |+|+++|||+.+++++++++|++++++| ++++++|||.|++|||++++++ ...|++ T Consensus 203 ~fK~~~~~~w~~f~E~y--G~P~~igky~~~a~~~ek~~l~~al~~~--~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~ 278 (491) T protein:vir:10 203 TFKKGGLKFWVQFTEKY--GSPMLVGKHPRSASDGEKNLLLDCLEDM--VQDAVAVVPDDSSIEIKEAAGKTGSADVYER 278 (491) T ss_pred HHHHHHHHHHHHHHHHc--CCCeEEEecCCCCCHHHHHHHHHHHHHH--hcCcEEEecCCceeEEEecCCCCCChhHHHH Confidence 99999999999999997 8899999999999999999999999999 4468999999999999997642 346999 Q ss_pred HHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC Q lcl|NC_021302. 303 AIEYHDHQMALVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG 382 (484) Q Consensus 303 li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~ 382 (484) |++|||++|||+||||||||++ |||+|+|+||++|+++++++|+++|+++|| +||++++.+||++.. +|+|+|++.+ T Consensus 279 li~~~d~~Isk~iLGqtlTt~~-~gs~a~~~vh~~v~~di~~~D~~~i~~tln-~li~~l~~~N~~~~~-~p~f~~~~~~ 355 (491) T protein:vir:10 279 LLHFCRGEVSIALLGQNQTTEA-TSTRASAQAGLEVTDDIRDGDKAVVSEAMN-MLIRWICDLNFDGAD-RPVFDMWEQE 355 (491) T ss_pred HHHHHHHHHHHHHhhhhcccCc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCCC-cceEEecCcC Confidence 9999999999999999999985 799999999999999999999999999999 599999999998654 6899999877 Q ss_pred CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccc Q lcl|NC_021302. 383 SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARK 462 (484) Q Consensus 383 ~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (484) ++++.+++++++|+++|+.+ +++|++++||||.|..+++.....+.............. .............. T Consensus 356 e~~~~~a~~~~~L~~~G~~i----~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~d~~~~~~~ 428 (491) T protein:vir:10 356 QVDEIQAGRDQKLTQAGARF----TPAYFKRAYNLQDGDLDERPLPVSAVDTVGAASFAEFEA---PDQDALDAALNTLS 428 (491) T ss_pred chhHHHHHHHHHHHhCCCcC----CHHHHHHHhCCCCCCcCccccccCCCCCcccccccccCC---CCCCchHHHHHHHH Confidence 88889999999999999955 578999999999998877654332222111111111111 11100000000000 Q ss_pred cccccchHHHhcCcc-----cCcccCC Q lcl|NC_021302. 463 RPRGRSPRDRRKTPD-----GAMPLWD 484 (484) Q Consensus 463 ~~~~~~~~~~~~~~~-----~~~~~~~ 484 (484) ........++-.++- -+.++=+ T Consensus 429 ~~~~~~~~~~~~~~i~~~l~~~~s~~e 455 (491) T protein:vir:10 429 ARDLNADAQALVAPLLKRIANGASADE 455 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCHHH Confidence 000000000000000 0000000 No 8 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=100.00 E-value=1e-105 Score=596.12 Aligned_cols=441 Identities=15% Similarity=0.123 Sum_probs=321.4 Q ss_pred CCCCCCCccceee-ee-----cccccch-hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTVAPRTERG-YV-----NPLAGFG-TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~~~~~~~~-~~-----~~~~~~~-~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) +++..-....+++ +. ++.++.. ..+...|+..+..+. ....+||++|+++|+||+++|++|+++|++++| T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~---~~~~~L~~dm~~~D~hi~s~l~~Rk~av~~~~w 90 (512) T protein:vir:19 14 FDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDL---TAQADLAFDMEEKDTHLFSELSKRRLAIQALEW 90 (512) T ss_pred cccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCH---HHHHHHHHHHHhhChHHHHHHHHHHHHHhCCCc Confidence 2222111111111 11 1222222 234455666655322 234688999999999999999999999999999 Q ss_pred EEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeeeeeeCc Q lcl|NC_021302. 74 RIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQ 153 (484) Q Consensus 74 ~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~ 153 (484) +|+|++++....+.+++.+...+ .+..||+++|++||+|++|||||+|++|.+++|.|.|++|.+||| T Consensus 91 ~I~p~~~~~~~~~~~a~~v~~~l------------~~~~~f~~~~~~lldA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~ 158 (512) T protein:vir:19 91 RIAPARDASAQEKKDADMLNEYL------------HDAAWFEDALFDAGDAILKGYSMQEIEWGWLGKMRVPVALHHRDP 158 (512) T ss_pred eEecCCCCCHHHHHHHHHHHHHH------------hcCCCHHHHHHHHHhhhhhcceeeeeEeeeeCCceeeeeeeeecc Confidence 99998764333333333333221 134579999999999999999999999999999999999999999 Q ss_pred cceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 154 SSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIR 233 (484) Q Consensus 154 ~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~ 233 (484) +||. +++++++.. .....+.+++++|++||++|+|+++++||||.||++.|||+|+||+++++ T Consensus 159 ~~f~---~~~~~~~~l--------------r~~~~~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~ 221 (512) T protein:vir:19 159 ALFC---ANPDNLNEL--------------RLRDASYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVR 221 (512) T ss_pred ccce---eccCCCcEE--------------EecCCCCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHH Confidence 9874 344443221 11223457889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCC-chhHHHHHHHHHHHHH Q lcl|NC_021302. 234 IEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGT-PLDPRRAIEYHDHQMA 312 (484) Q Consensus 234 ~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~-~~~~~~li~~~d~~Is 312 (484) +|+.|+||| |+|+++|||+.+++++++++|++++.+| ++++++|||.|++|||++++++ ...|+.|++|||++|| T Consensus 222 ~w~~f~E~y--G~P~~igky~~~a~~~ek~~L~~al~~~--~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Is 297 (512) T protein:vir:19 222 DFAEFLEIY--GLPMRVGKYPTGSTNREKATLMQAVMDI--GRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAIS 297 (512) T ss_pred HHHHHHHHc--CCCeeEEecCCCCCHHHHHHHHHHHHHH--hhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHH Confidence 999999997 8999999999999999999999999999 4468999999999999997654 4569999999999999 Q ss_pred HHHhhhhhcccc-cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc---ccceEEecCCC-CcHHH Q lcl|NC_021302. 313 LVALAHFLNLDG-KGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDE---PAPLLVFDEIG-SRQDA 387 (484) Q Consensus 313 k~ilGqtlt~~~-~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~---~~P~~~~~~~~-~~~~~ 387 (484) |+||||||||++ .+||+|+|+||++|+++++++|+++|+++||+|||++|+.+||++.. .+|+|+|+..+ +|++. T Consensus 298 k~iLGqtlTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~ 377 (512) T protein:vir:19 298 KAILGGTLTTEAGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITA 377 (512) T ss_pred HHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHH Confidence 999999999984 57999999999999999999999999999999999999999998753 47999998544 77899 Q ss_pred HHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccccccccccc Q lcl|NC_021302. 388 TAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGR 467 (484) Q Consensus 388 ~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (484) .++++++|+ +|+.+ +++|++++||||+|.+++.....++......+... ........+............... T Consensus 378 ~a~~~~~l~-~G~~i----~~~~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~d~~~~~~~~~ 450 (512) T protein:vir:19 378 LSDAIPKLA-AGMRI----PVSWIQEKLHIPQPVGDEAVFTIQPVVPDNGSQKE--AALSAEDIPQEDDIDRMGVSPEDW 450 (512) T ss_pred HHHHHHHHh-cCCCC----CHHHHHHHhCCCCCCCccccccCCCcccccccccc--ccccccCCCchhhHhHHhhhHHHH Confidence 999999996 89865 57999999999999988876654332221111111 111111111111111100000000 Q ss_pred ch-HHHhcCcc----cCcccCC Q lcl|NC_021302. 468 SP-RDRRKTPD----GAMPLWD 484 (484) Q Consensus 468 ~~-~~~~~~~~----~~~~~~~ 484 (484) .. ......+. .+.++=+ T Consensus 451 ~~~~~~~~~~i~~~~~~~s~ee 472 (512) T protein:vir:19 451 QRSVDPLLKPVIFSVLKDGPEA 472 (512) T ss_pred HHHHHHHHHHHHHHHHhCCHHH Confidence 00 00000000 0011100 No 9 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=100.00 E-value=9.4e-104 Score=585.42 Aligned_cols=428 Identities=19% Similarity=0.229 Sum_probs=333.8 Q ss_pred CCCCCCCccceee---eec---------ccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHh Q lcl|NC_021302. 1 MAPKTVAPRTERG---YVN---------PLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPI 68 (484) Q Consensus 1 ~~~~~~~~~~~~~---~~~---------~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v 68 (484) ||.+..-|+..+. -+. ....+......++.+.+....||++.++++|++|++ |+||+++|++|+++| T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~-D~hi~s~l~~Rk~av 79 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLS-DGTVKNALNYIFGRI 79 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccchHHHHHHhh-ChHHHHHHHHHHHHH Confidence 7766666643111 111 111122233456666776678999899999999986 999999999999999 Q ss_pred hCCCcEEecCCCCHH---HHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEee-cCCeee Q lcl|NC_021302. 69 RRTDWRIRPNGARPE---VVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFY-EGGRFW 144 (484) Q Consensus 69 ~~~~~~v~p~~~~~e---~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~-~~g~~~ 144 (484) ++++|+|+|++++++ +++++.++|. ..+. ..++.+|+++|++||+|++|||||+|++|.. .+|++. T Consensus 80 ~~~~w~v~p~~~~~~~~~~ae~v~~~l~----~~~~------~~~~~~f~~~~~~~lda~~~G~s~~Eivw~~~~~g~~~ 149 (448) T protein:vir:79 80 RSAKWYVEPASTDPEDIAIAAFIHAQLG----IDDA------SVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGADGKLI 149 (448) T ss_pred hcCCceEecCCCCHHHHHHHHHHHHHhh----hhhh------hhccCCHHHHHHHHHHhhhhcceeEEEEeeecCCCcee Confidence 999999999887764 4444555543 2221 1246689999999999999999999999986 689999 Q ss_pred eeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHH Q lcl|NC_021302. 145 LKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKN 224 (484) Q Consensus 145 ~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~ 224 (484) +.+|.+|||+++.+|.|+.+++++...+..... .......++++|..+|++|. ++++|||||.|||+.|||+ T Consensus 150 ~~~l~~r~~~~~~~f~~~~d~~l~~~~~~~~~~-------~~~~~~~~~~lP~~~~i~~~-~~~~g~p~g~gLlr~~~w~ 221 (448) T protein:vir:79 150 LDKIVPIHPFNIDEVLYDEEGGPKALKLSGEVK-------GGSQFVSGLEIPIWKTVVFL-HNDDGSFTGQSALRAAVPH 221 (448) T ss_pred cccccccCCccccceeeecCCceEEeecCCccc-------ccccCCCccccccceEEEEe-cCccCCcccchhHHHHHHH Confidence 999999999999999999999987766532111 11223467789999998875 4789999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC--HHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHH Q lcl|NC_021302. 225 WKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED--DDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRR 302 (484) Q Consensus 225 ~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~--~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~ 302 (484) |+||++++++|+.|+||| |+|+++|||+.+++ ++++++|+++++++++|+++++|||.|++|||+++++++.+|.+ T Consensus 222 ~~fK~~~~~~w~~f~E~y--G~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~~ie~~ea~~~~~~~~~ 299 (448) T protein:vir:79 222 WLAKRALILLINHGLERF--MIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIP 299 (448) T ss_pred HHHHHHHHHHHHHHHHHc--CCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCcccHHH Confidence 999999999999999997 78999999987665 67889999999999999999999999999999999988888999 Q ss_pred HHHHHHHHHHHHHhhhhhcccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC Q lcl|NC_021302. 303 AIEYHDHQMALVALAHFLNLDGKGGSYALA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI 381 (484) Q Consensus 303 li~~~d~~Isk~ilGqtlt~~~~gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~ 381 (484) +++|||++|||+|||||||++++|||++.+ .+|.+++.+++++|+++|+++||+|||++|+++||+++.++|+|+|+.. T Consensus 300 ~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~ 379 (448) T protein:vir:79 300 YLTYHDAGIARALGIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEME 379 (448) T ss_pred HHHHHHHHHHHHHhhhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCC Confidence 999999999999999999998776655444 4789999999999999999999999999999999999999999999865 Q ss_pred C-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 382 G-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 382 ~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) + +|++++++++++|++++ ...++|+++++|+|+|.++++..... ..+.+..+.+.++. +.-.=. T Consensus 380 e~~Dl~~~a~~~~~l~~~~-----~~~~~~~~~~~~~p~~~~~~~~~a~~---~~~~~~~~~~~~~~-------~~~~~~ 444 (448) T protein:vir:79 380 ERNDFSAAANLMGMLINAV-----KDSEDIPTELKALIDALPSKMRRALG---VVDEVREAVRQPAD-------SRYLYT 444 (448) T ss_pred ChHHHHHHHHHhhhhhccc-----hhhHHHHHHhhcCCCCCCCccccccC---CCCcccccccCCcc-------ccchhh Confidence 4 67789999999999775 34578999999999887765432211 11111111000000 000000 Q ss_pred cccc Q lcl|NC_021302. 461 RKRP 464 (484) Q Consensus 461 ~~~~ 464 (484) ..+. T Consensus 445 ~~~~ 448 (448) T protein:vir:79 445 RRRR 448 (448) T ss_pred cccC Confidence 0111 No 10 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=100.00 E-value=8.8e-104 Score=585.58 Aligned_cols=432 Identities=19% Similarity=0.211 Sum_probs=319.9 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhccccc-ccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFE-QVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) |. -+....++.++..-++.-..+..++.... .+...+.+..+++|++|++ |+||+++|++|+++|++++|+|+|++ T Consensus 1 v~--~~~l~~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~-D~~i~s~l~~rk~av~~~~w~i~p~~ 77 (488) T protein:vir:99 1 ME--KPALGREIATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILS-DAQVKTVWGQRQLAVVSREWKVEAGG 77 (488) T ss_pred CC--ccchhHHHHHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhh-ChHHHHHHHHHHHHHhcCCceEEcCC Confidence 11 11122232221111222122333444333 2222345677999999985 99999999999999999999999988 Q ss_pred CCHH---HHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 80 ARPE---VVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 80 ~~~e---~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) ++++ .++++.+++ .+.+|+++|++||+|++|||||+|++|..++|.|.|.+|.+|||+|| T Consensus 78 ~~~~~~~~ae~v~~~l-----------------~~~~~~~~l~~~lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f 140 (488) T protein:vir:99 78 DRPIDQAAAEHLEQQL-----------------QRVGWDRVTSKMLFGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRF 140 (488) T ss_pred CChHHHHHHHHHHHHH-----------------hCCCHHHHHHHHHhhhhhcceeEEEEEeecCCeeeEeeeeeecccce Confidence 7654 344444444 34589999999999999999999999999999999999999999975 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCccccc-ccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIP-VEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp-~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) .|+++++++...+ ....+++++| +.+|++|+|+++++||||.|||+.|||+|+||++++++| T Consensus 141 ---~~d~~~~l~~~~~--------------~~~~~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w 203 (488) T protein:vir:99 141 ---RYDQDGGLRLLTP--------------NNMFEGEPCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFW 203 (488) T ss_pred ---eecCCCceEEecc--------------CCCCCccccccCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHH Confidence 4677777653322 2344678886 569999999999999999999999999999999999999 Q ss_pred HHHHHHhcCCcceEEecCCC-CCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCc-hhHHHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEAD-SEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTP-LDPRRAIEYHDHQMAL 313 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~-~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~-~~~~~li~~~d~~Isk 313 (484) +.|+||| |+|+++|||+. +++++++++|++++.+|. +++++|||.|++|||+++++++ ..|.+|++|||++||| T Consensus 204 ~~f~E~y--G~P~~igky~~~~a~~~ek~~l~~av~~~~--~~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk 279 (488) T protein:vir:99 204 LIFLDKF--GMPTAVGRYDDKTATPEDKAKLLAALHAIQ--TDSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAK 279 (488) T ss_pred HHHHHHc--CCceeeeecCCCCCCHHHHHHHHHHHHHHh--cCcEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHH Confidence 9999997 88999999985 788999999999999994 4589999999999999976544 5699999999999999 Q ss_pred HHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC-CCCcHHHHHHHH Q lcl|NC_021302. 314 VALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE-IGSRQDATAAAL 392 (484) Q Consensus 314 ~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~-~~~~~~~~ae~~ 392 (484) +|||||||++++|||+|+|+||++|+.+++++|+++|+++||+|||++|+.+||+.. .+|+|+|+. .++|++++++++ T Consensus 280 ~iLGqtlts~~~~Gs~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~-~~p~~~~~~~e~edl~~~a~~~ 358 (488) T protein:vir:99 280 VGLGQVASTQGTPGRLGNDDLQADVRLDLVKADADLICESFNLGPARWLTEWNFPGA-QPPRVYRVIEEPEDITAKAERD 358 (488) T ss_pred HHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCc-CCceeEecCCCcccHHHHHHHH Confidence 999999999988899999999999999999999999999999999999999999754 569999974 457889999999 Q ss_pred HHHHhc-CcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccchHH Q lcl|NC_021302. 393 QMLVNA-GLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRD 471 (484) Q Consensus 393 ~~L~~~-G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (484) ++|+++ |+.+ +++|++++||||.+.++++...+.+.....+... +.........+..+....... .... T Consensus 359 ~~l~~~~G~~i----~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~-~~~~ 428 (488) T protein:vir:99 359 EKVFRMSGFRP----TRGYVQETYGVEVESTQAEATAPTPSTEFAEGDQ-----PSDPAAAMAPQLAEAMQPVVG-NWTT 428 (488) T ss_pred HHHHhhcCCCC----CHHHHHHHcCCCCcccccccccCCCcccCCCCCC-----CCCchHHHHHHHHHHHHHHHH-HHHH Confidence 999997 8855 5799999999999988776654332222111111 111111111111000000000 0000 Q ss_pred HhcC-cccCcccCC Q lcl|NC_021302. 472 RRKT-PDGAMPLWD 484 (484) Q Consensus 472 ~~~~-~~~~~~~~~ 484 (484) .-++ ..-+.++=+ T Consensus 429 ~i~~~l~~a~s~ee 442 (488) T protein:vir:99 429 QLRTLIEQASSLED 442 (488) T ss_pred HHHHHHHhcCCHHH Confidence 0000 000000000 No 11 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=100.00 E-value=7.4e-103 Score=580.50 Aligned_cols=426 Identities=19% Similarity=0.235 Sum_probs=328.8 Q ss_pred CCCCCCCcccee---eee--------cc-cccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHh Q lcl|NC_021302. 1 MAPKTVAPRTER---GYV--------NP-LAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPI 68 (484) Q Consensus 1 ~~~~~~~~~~~~---~~~--------~~-~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v 68 (484) ||.|.+.|+..+ +.+ .. +..+.+....++.+......||++.++++|++|++ |+||+++|++|+++| T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m~~-D~hi~s~l~~Rk~av 79 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKMLS-DGTVKNALNYIFGRI 79 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhccccchHHHHHHhh-ChHHHHHHHHHHHHH Confidence 998888886321 111 11 11122333456666666678899999999999986 999999999999999 Q ss_pred hCCCcEEecCCCCHH---HHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEee-cCCeee Q lcl|NC_021302. 69 RRTDWRIRPNGARPE---VVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFY-EGGRFW 144 (484) Q Consensus 69 ~~~~~~v~p~~~~~e---~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~-~~g~~~ 144 (484) ++++|+|+|++++++ +++++.++|. ..+. ...+.+|+++|++||+|++|||||+|++|.. .+|.|. T Consensus 80 ~~~~w~v~p~~~~~~d~~~ae~v~~~l~----~~~~------~~~~~~f~~~i~~~lda~~~G~s~~Eivw~~~~dg~~~ 149 (448) T protein:vir:77 80 RSAKWYVEPASTDPEDIAIAAFIHAQLG----IDDA------SVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGADGKLI 149 (448) T ss_pred hcCCceEecCCCCHHHHHHHHHHHHHhh----chhh------hhccCCHHHHHHHHHHhhhhcceeEEEEEeecCCCcee Confidence 999999999887654 4445555543 2221 1246689999999999999999999999986 689999 Q ss_pred eeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHH Q lcl|NC_021302. 145 LKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKN 224 (484) Q Consensus 145 ~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~ 224 (484) +.+|.+|||+++.+|.|+.+++++...+...- ........+++||..||++|. ++++|||||.|||+.|||+ T Consensus 150 ~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~lP~~~~i~~~-~~~~g~p~g~gLlr~~~w~ 221 (448) T protein:vir:77 150 LDKIVPIHPFNIDEVLYDEEGGPKALKLSGEV-------KGGSQFVNGLEIPIWKTVVFL-HNDDGSFTGQSALRAAVPH 221 (448) T ss_pred eccccccCCCccceeeeecCCceEEEecCCcc-------cccccCCCccccccceEEEEe-cCCcCCcccchHHHHHHHH Confidence 99999999999999999999998766553211 111234457789999998775 5788999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC--HHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHH Q lcl|NC_021302. 225 WKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED--DDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRR 302 (484) Q Consensus 225 ~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~--~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~ 302 (484) |+||++++++|+.|+||| |+|+++|||+.+++ ++++++|++++.+|++|+++++|||.|++|||+++++++.+|.+ T Consensus 222 ~~fK~~~~~~w~~f~E~y--G~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g~~ie~~ea~~~~~~~~~ 299 (448) T protein:vir:77 222 WLAKRALILLINHGLERF--MIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDWKFDTVDLKSAMPDAIP 299 (448) T ss_pred HHHHHhhHHHHHHHHHHc--CCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCCceEEEEecCCCccCHHH Confidence 999999999999999997 88999999987654 57889999999999999999999999999999999888888999 Q ss_pred HHHHHHHHHHHHHhhhhhcccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC Q lcl|NC_021302. 303 AIEYHDHQMALVALAHFLNLDGKGGSYALA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI 381 (484) Q Consensus 303 li~~~d~~Isk~ilGqtlt~~~~gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~ 381 (484) +++|||++|||+|||||||++++||+++.+ ..|.+++.+++++|+++|+++||+|||++|+.+|||++.++|+|+|+.. T Consensus 300 ~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~ 379 (448) T protein:vir:77 300 YLTYHDAGIARALGIDFNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEME 379 (448) T ss_pred HHHHHHHHHHHHHhccccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCC Confidence 999999999999999999998776655444 3566899999999999999999999999999999999999999999854 Q ss_pred C-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCcccc--CCCCccccccccccccc Q lcl|NC_021302. 382 G-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETD--EPALPNTSGTTSTTNAP 458 (484) Q Consensus 382 ~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 458 (484) + +|++++++++++|+ +++++++|||++.++... +.+.++.+... .+..+ ....+..+... T Consensus 380 e~eDl~~~a~~~~~l~------------~~~~~~~~ip~~~~~~~~----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 442 (448) T protein:vir:77 380 ERNDFSAAANLMGMLI------------NAVKDSEDIPTELKALID----ALPSKMRRALGVVDEVRE-AVRQPADSRYL 442 (448) T ss_pred ChhhHHHHHHHhHHHH------------HHHHHHhcCCccCCcCCC----CCchhcccccCCCCCCCc-hhhcchhhHHH Confidence 4 67889999999886 368999999987643221 11111111111 11111 11111111222 Q ss_pred cccccc Q lcl|NC_021302. 459 QARKRP 464 (484) Q Consensus 459 ~~~~~~ 464 (484) .+..+. T Consensus 443 ~~r~~~ 448 (448) T protein:vir:77 443 YTRRRR 448 (448) T ss_pred HhhhcC Confidence 222221 No 12 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=100.00 E-value=6.2e-100 Score=564.46 Aligned_cols=393 Identities=13% Similarity=0.125 Sum_probs=306.1 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhccccccccccccc---chHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWP---NSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~---~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p 77 (484) -||+ |...+.+.++..+.|.. .++...+....-+++ +.+++|++|+++|+||+++|++||++|++++|+|+| T Consensus 7 ~~p~---~~~~~~~~~~~~~~~~~--~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~p 81 (446) T protein:vir:98 7 NAPT---PAIRRRTIYAMEHLGLA--TSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQH 81 (446) T ss_pred CCCc---hhhhhhhhhccccchhh--cccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceecC Confidence 2332 22222233444444432 233333332222232 357999999999999999999999999999999998 Q ss_pred CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeee----eeeCc Q lcl|NC_021302. 78 NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRL----APRPQ 153 (484) Q Consensus 78 ~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l----~~r~~ 153 (484) + +++++++++++|+. ..|+.++.+|++|++|||||+|++|++.+|.+.+.++ ..+.| T Consensus 82 ~--~~~~a~~v~~~l~~-----------------~~~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~ 142 (446) T protein:vir:98 82 G--DKRIKKFIDDQLRN-----------------RAKTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHP 142 (446) T ss_pred c--cHHHHHHHHHHHhh-----------------cCchhHHHHHHHHHhhCceeeeEEEeecccccccchhhcccccccc Confidence 5 56789999998853 2578899999999999999999999999888887654 33444 Q ss_pred cceeeeeecCCCcee-----eeeccccccccc-------ccceeccCCCCcccccccceEEEeecCccCccccchhHHHH Q lcl|NC_021302. 154 SSIAYWNVDRDGGLI-----SIQQWPAGTFGG-------PGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPA 221 (484) Q Consensus 154 ~~~~~~~~~~dg~l~-----~~~q~~~~~~~~-------~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~ 221 (484) ..+ +|.++.+++++ +..|+....... ...+.....+.++.||+.||++|+|+++++||||.||+|.| T Consensus 143 ~~~-r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~ 221 (446) T protein:vir:98 143 LQV-MLIANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSV 221 (446) T ss_pred ccc-eeeeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHHHH Confidence 444 35666666543 222322211111 12233345567788999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHH-------------HHHHHHHHHHhcCCceEEEc-----c Q lcl|NC_021302. 222 YKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRM-------------DELLEIASNYSGGESAGLAL-----T 283 (484) Q Consensus 222 ~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~-------------~~l~~~l~~~~~g~~a~~vi-----p 283 (484) ||+|+||++++++|+.|+||| |+|+++|||+++++++++ +.|+++++++ ++++++|+ | T Consensus 222 ~w~~~fK~~~~~~w~~f~E~y--G~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~--~~da~~ii~~~~~P 297 (446) T protein:vir:98 222 LDYSIFKRAFRDMMLIALDRY--GTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRL--STDSGLVLTQLSKE 297 (446) T ss_pred HHHHHHHHhhHHHHHHHHhHc--CCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhc--cccceeeeecccCC Confidence 999999999999999999997 889999999988775544 3588889888 44577777 9 Q ss_pred CCceEEEecccCCc-hhHHHHHHHHHHHHHHHHhhhhhccc---ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 284 AGEEAGILSPNGTP-LDPRRAIEYHDHQMALVALAHFLNLD---GKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVV 359 (484) Q Consensus 284 ~~~~ie~~~~~~~~-~~~~~li~~~d~~Isk~ilGqtlt~~---~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli 359 (484) +|++|||+++++++ .+|+++|+|||++|||+|||||||++ +++||+|+|+||++|+.+++++|+++||++||+||| T Consensus 298 ~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~Li 377 (446) T protein:vir:98 298 QPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQVI 377 (446) T ss_pred CCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999987665 47999999999999999999988853 456999999999999999999999999999999999 Q ss_pred HHHHHhCCCCccccc-----eEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCc Q lcl|NC_021302. 360 EDIVDVNWGEDEPAP-----LLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDA 424 (484) Q Consensus 360 ~~l~~~Nf~~~~~~P-----~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e 424 (484) +||+.+||++...+| .++|+ ..++|++.+++++++|+++|++++. +++|++++||||++.+.- T Consensus 378 ~~l~~lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~--~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 378 GNLIRLNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDG--DKDHIRSITGLPDAISST 446 (446) T ss_pred HHHHHhCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccc--cHHHHHHHhCcCCCCCCC Confidence 999999999765443 23444 3357889999999999999998753 589999999998776543 No 13 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=100.00 E-value=3.8e-90 Score=510.82 Aligned_cols=334 Identities=40% Similarity=0.656 Sum_probs=271.9 Q ss_pred eeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccC Q lcl|NC_021302. 131 VFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG 210 (484) Q Consensus 131 ~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~ 210 (484) |+||+|++++|.|.|++|.+|||++|.+|.++++++++.++|... .+.++++||+.|||+|+|+++++ T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~------------~g~~~~~lp~~kfi~~~~~~~~g 68 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGV------------FGKATVRIPVDRLVVFVNEREGA 68 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCC------------CCCCcceeccCCEEEEEeCCCCC Confidence 999999999999999999999999999999999999988877432 24467899999999999999999 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC-----------HHHHHHHHHHHHHHhcCCceE Q lcl|NC_021302. 211 VWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED-----------DDRMDELLEIASNYSGGESAG 279 (484) Q Consensus 211 ~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~-----------~~~~~~l~~~l~~~~~g~~a~ 279 (484) ||||.|||+.|||+|+||++++++|+.|+||||.|||+++|+|+.+.. .++++.+.++++++++|++++ T Consensus 69 ~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~ 148 (355) T protein:vir:78 69 NWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAG 148 (355) T ss_pred CccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCccee Confidence 999999999999999999999999999999999999999999976653 345678999999999999999 Q ss_pred EEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccc--cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 280 LALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDG--KGGSYALASVQADTFVQSVQTVADEIRDVAQAH 357 (484) Q Consensus 280 ~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~--~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~q 357 (484) +|||+|++|||+++++++.+|.++++|||++|||+|||||||+++ +|||+|+|++|++|+++++++|+++|+++||+| T Consensus 149 ~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~ 228 (355) T protein:vir:78 149 GYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQH 228 (355) T ss_pred EeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999998888899999999999999999999999964 469999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCC Q lcl|NC_021302. 358 VVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDE 437 (484) Q Consensus 358 li~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~ 437 (484) ||++|+.+||++..++|+|+|++++++++++++++++|+++|++++++..++|++++||||+|.++++...+.+....+. T Consensus 229 li~~l~~lN~~~~~~~P~~~~~~~~~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~ 308 (355) T protein:vir:78 229 VVEDLVDQNWGPEEPAPRLVPAQLGKEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGADAAAAKAAGR 308 (355) T ss_pred HHHHHHHhcCCCCCCCCEEEecCcChhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccCCcccccccc Confidence 99999999999999999999998788888899999999999999988878899999999999987776544322211111 Q ss_pred ccccCCCCccccccccccccccccccccccch--------HHHhcCccc Q lcl|NC_021302. 438 PETDEPALPNTSGTTSTTNAPQARKRPRGRSP--------RDRRKTPDG 478 (484) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~ 478 (484) ..... ......+...++. ...+..+..+.+ +.-.-.++| T Consensus 309 ~~~~~-~~~~~~~~~~~a~-~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (355) T protein:vir:78 309 RRAKR-LPGQRQGAALPSR-SPRADPPRRRGPLRRRPRHPAHRRCAPDG 355 (355) T ss_pred ccccc-cCCcccccccccc-CCCCCChhhhHHHHHHhhccccCCCCCCC Confidence 11100 0000000000100 001111111111 111122334 No 14 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.87 E-value=1.3e-20 Score=129.60 Aligned_cols=429 Identities=16% Similarity=0.125 Sum_probs=244.3 Q ss_pred CC-------CCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MA-------PKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) +| |+..-..+. .....++. +..+ ... .....+|-.+.++-+.|.+|+..+-..|.++++ T Consensus 3 ~~~~~~~~~p~~~e~~~~---~~~~~~~~--~~~~----~~~-----~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl 68 (518) T protein:vir:10 3 LANGQTLSAPAMAELSPQ---MQDSYYYA--PAVG----MQL-----ERQFSLYGGIYKNQPWVRTVIAKRAQALARLPV 68 (518) T ss_pred ccCceeecCchhhhhhhh---hhcccccc--cccc----eec-----ccccchhhHHHhhhHHHHHHHHHHHHhhccCce Confidence 22 222111111 11111111 1111 010 112333444445678999999999999999999 Q ss_pred EEecCCCCH--HHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeee Q lcl|NC_021302. 74 RIRPNGARP--EVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAP 150 (484) Q Consensus 74 ~v~p~~~~~--e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~ 150 (484) .+.....+. +..+.....| ..+-....+..++++.++ +.+.+|.+++++++..+ | .+..|.+ T Consensus 69 ~l~~~~~~~~~~~~~~~~~~L------------l~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~-G--~~~~L~~ 133 (518) T protein:vir:10 69 KCMFTSGDTETEESDTGYAKL------------LADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-G--TPEKLMP 133 (518) T ss_pred EEEEEcCCCceeccchHHHHH------------HcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEE Confidence 984322221 1111110111 001112234667787776 57789999999986433 3 4678999 Q ss_pred eCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHH Q lcl|NC_021302. 151 RPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDE 230 (484) Q Consensus 151 r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~ 230 (484) ++|.++........+.+.+..+...+ .......+|...+|++++....+..+|.|.+..+......-.. T Consensus 134 l~p~~v~v~~~~~~~~~~y~~~~~~~-----------~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a 202 (518) T protein:vir:10 134 MHPSRVAIKRNSRTGRYEYYFQAGAG-----------VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDS 202 (518) T ss_pred ECCCceEEEEcCCCCEEEEEEEecCC-----------ccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHH Confidence 99988764333334444333222111 1122345778888888776666667899999999988888888 Q ss_pred HHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCCchhHHHHHHHH Q lcl|NC_021302. 231 LIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGTPLDPRRAIEYH 307 (484) Q Consensus 231 ~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~~~~~~~li~~~ 307 (484) ..++...|... .+.|-.+.+++...++++++++.+.+.+...|. + ..++++.|++++-++.+.....|.+..++. T Consensus 203 ~~~~~~~~f~n--g~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~ 280 (518) T protein:vir:10 203 SRNATAAMWKN--AGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLN 280 (518) T ss_pred HHHHHHHHHhc--CCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHH Confidence 88888888875 367867777877888999999999988876652 2 358899999998888766666788888999 Q ss_pred HHHHHHHHhhhh-hcccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-C Q lcl|NC_021302. 308 DHQMALVALAHF-LNLDGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-G 382 (484) Q Consensus 308 d~~Isk~ilGqt-lt~~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~ 382 (484) ..+|++++.-.. +....++++++-.+.+.. ....-+.-.++.|+..||+.|++.. . .. .+|+|+. . . T Consensus 281 ~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-----~-~~--~~~~fd~~~llr 352 (518) T protein:vir:10 281 REEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW-----V-RK--NRMKFDIDDVIQ 352 (518) T ss_pred HHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----c-CC--ceEEEechhhhc Confidence 999999976543 222334456665444443 3445577889999999998776531 1 12 2566653 2 3 Q ss_pred CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCC--CCccccc-----c---cCCCc---CCCccc-cCCCCccc Q lcl|NC_021302. 383 SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPD--PDADDDE-----S---TADTG---QDEPET-DEPALPNT 448 (484) Q Consensus 383 ~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~--~~e~~~~-----~---~~~~~---~~~~~~-~~~~~~~~ 448 (484) .|.+..++++.++++.|+.. .+++|+.+|+|.-+ .++.... + ...+. ++.+.. .....+.. T Consensus 353 ~D~~~r~~~~~~~~~~G~lT-----~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~ 427 (518) T protein:vir:10 353 PDWEAKSESTQKMVNSGVAT-----PNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVA 427 (518) T ss_pred cCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCccccc Confidence 57788999999999999764 47899999998643 2222110 0 00000 000000 00000010 Q ss_pred cccc-cccccccccccccccchHHHhcCcccCc---ccCC Q lcl|NC_021302. 449 SGTT-STTNAPQARKRPRGRSPRDRRKTPDGAM---PLWD 484 (484) Q Consensus 449 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 484 (484) +.+. .+.......+...++.......+|++.+ +.=| T Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (518) T protein:vir:10 428 SLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKE 467 (518) T ss_pred cccccccccCCCCCcccccccccccccchhccccCCCccc Confidence 1111 1111111222222222222222233322 1111 No 15 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.87 E-value=2.4e-20 Score=128.14 Aligned_cols=430 Identities=16% Similarity=0.126 Sum_probs=243.5 Q ss_pred CC----CCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MA----PKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) +| +++|........+....++ .+..+.... ....++-.+.++-+.|.+|+..+-..|.+++|.+. T Consensus 3 ~~~~~~~~~p~~~~~~~~~~~~~~~--~~~~g~~~~---------~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~ 71 (518) T protein:vir:78 3 LANGQTLSAPAMAELSPQMQDSYYY--APAVGMQLE---------RQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred ccCceeeccchhhhhhhhhhhcccc--cceeceecc---------cccchhhHHhhhhHHHHHHHHHHHHhhccCceEEE Confidence 22 2222111111111111111 111111111 11223333344679999999999999999999985 Q ss_pred cCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhc----CCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLPVEGDESDKPTPRTRG----RFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~----~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) ..+.+....+ .++....++. ..+..++++.++ +.+.+|.+++++++... | .+..|.++ T Consensus 72 ~~~~~~~~~~--------------~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~-G--~~~~L~~l 134 (518) T protein:vir:78 72 FTSGDTETEE--------------HDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-G--TPEKLMPM 134 (518) T ss_pred EEcCCccccc--------------cchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-C--cEEEEEEE Confidence 4332221100 0011111222 234667777776 56679999999986433 2 46789999 Q ss_pred CccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHH Q lcl|NC_021302. 152 PQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDEL 231 (484) Q Consensus 152 ~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~ 231 (484) +|.++........+.+.+..+...+ .+.....+|....|++++....+..+|.|.+..+....-.-... T Consensus 135 ~p~~Vtv~~~~~~~~~~y~~~~~~~-----------~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa 203 (518) T protein:vir:78 135 HPSRVAIKRNSRTGRYEYYFQAGAG-----------VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSS 203 (518) T ss_pred CCCceEEEEcCCCCEEEEEEEecCC-----------ccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHH Confidence 9988764333233333332221111 12233457788888887666566678999999998887777788 Q ss_pred HHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCCchhHHHHHHHHH Q lcl|NC_021302. 232 IRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGTPLDPRRAIEYHD 308 (484) Q Consensus 232 ~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~~~~~~~li~~~d 308 (484) .++...|... .++|-.+.+++...++++++++.+.+.+...|. + ..++++.|++++-++.+.....|.+..++.. T Consensus 204 ~~~~~~~f~N--g~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~ 281 (518) T protein:vir:78 204 RNATAAMWKN--AGRPNLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNR 281 (518) T ss_pred HHHHHHHHhc--CCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHH Confidence 8888888875 377867777777788999999999998876553 2 3578899999988877666667888889999 Q ss_pred HHHHHHHhhhh-hcccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CC Q lcl|NC_021302. 309 HQMALVALAHF-LNLDGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GS 383 (484) Q Consensus 309 ~~Isk~ilGqt-lt~~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~ 383 (484) .+|++++.-.. +....++++++-.+.+.. ....-+.-.++.|+..||+.|++. +. .. .+|+|+. . .. T Consensus 282 ~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~P~~~~ie~eln~~L~~~-----~~-~~--~~~~fd~~~Llr~ 353 (518) T protein:vir:78 282 EEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQY-----WV-RK--NRMKFDIDDVIQP 353 (518) T ss_pred HHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----cc-Cc--ceEEeechhhhcc Confidence 99999876543 222233456665444443 335567888999999999877653 11 11 2566653 2 35 Q ss_pred cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCC--CCccccc-----c---cCCC---cCCCcc-ccCCCCcccc Q lcl|NC_021302. 384 RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPD--PDADDDE-----S---TADT---GQDEPE-TDEPALPNTS 449 (484) Q Consensus 384 ~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~--~~e~~~~-----~---~~~~---~~~~~~-~~~~~~~~~~ 449 (484) |.+..++++.++++.|+.. .+++|+.+|+|.-+ .++.... + ...+ +++.+. +.....+... T Consensus 354 D~~~r~~~~~~~~~~G~lT-----~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~ 428 (518) T protein:vir:78 354 DWEAKSESTQKMVNSGVAT-----PNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVAS 428 (518) T ss_pred CHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCCCCcccccc Confidence 7788999999999999764 47899999998543 2222111 0 0000 000000 0000111111 Q ss_pred ccc-cccccccccccccccchHHHhcCcccCc--ccCC Q lcl|NC_021302. 450 GTT-STTNAPQARKRPRGRSPRDRRKTPDGAM--PLWD 484 (484) Q Consensus 450 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 484 (484) ++. .+.....+.+...++.......+++..+ +.=+ T Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (518) T protein:vir:78 429 LDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPK 466 (518) T ss_pred cccCccccCCCCCcccccccccccccchhcccCCCCcc Confidence 111 1112222222222233332333333322 1111 No 16 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.86 E-value=2.1e-20 Score=128.51 Aligned_cols=402 Identities=11% Similarity=-0.013 Sum_probs=243.2 Q ss_pred CCCCCCCccceeeeecccccchhhhh-hhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLA-QGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) .+++.........-.......+..+. .+... ...+..+...+-+.|.+|+..+-..|.++++.+.... T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-----------~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~ 79 (422) T protein:vir:13 11 KNNNDEKRSNYDEDIGIDISDSNFWEKFGIKL-----------NFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDK 79 (422) T ss_pred cCCccchhhhhhhccccccCcchhhhhccccC-----------CcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecC Confidence 11111111100000000000000000 01100 1112222222457899999999999999999996543 Q ss_pred CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceee Q lcl|NC_021302. 80 ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAY 158 (484) Q Consensus 80 ~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~ 158 (484) +..+. ..+...|.. +-.....+.++++.++ +.+.+|-+.+++++... | .+..|.+++|.++. T Consensus 80 ~~~~~-~~~~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G--~~~~L~~i~~~~v~- 142 (422) T protein:vir:13 80 EEYKE-HELYYLLRY------------KPNPLMSSINFWKCLETQRTLKGNAYAYIERDRK-G--KIIGLYPINSDNVT- 142 (422) T ss_pred ccccc-chHHHHHhh------------hcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEECCcceE- Confidence 32111 012222211 1112335677888876 57889999999987543 3 37789999999886 Q ss_pred eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAA 238 (484) Q Consensus 159 ~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f 238 (484) ...+.+|.+... +.........++....++++..|++++....+.++|.|.+..+....-.-....++...| T Consensus 143 ~~~~~~~~~~~~--------~~~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~ 214 (422) T protein:vir:13 143 KIIDDDNFLSSL--------SKVWYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKF 214 (422) T ss_pred EEEcCCcceecc--------ceEEEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH Confidence 445555543211 111112222344556788999999987767777899999999998887777777788888 Q ss_pred HHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC---ceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 239 IRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE---SAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVA 315 (484) Q Consensus 239 ~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~---~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i 315 (484) ... .+.|-.+.+++...++++++++.+.+.++.+|. ...++++.|++++-++.+.....|.+..++...+|++++ T Consensus 215 f~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~f 292 (422) T protein:vir:13 215 FKN--GLSIKGIVQYVGDLDEKAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATF 292 (422) T ss_pred Hhc--cCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHh Confidence 885 267877777878888999999999999887653 236889999999888776666678888999999999997 Q ss_pred hhhh-hcccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHHHH Q lcl|NC_021302. 316 LAHF-LNLDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDATAA 390 (484) Q Consensus 316 lGqt-lt~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~ae 390 (484) .-.. +....++++++..+-+. .....-+.-.++.|+..||+.|++..-.. .+ .+|+|+.. ..|.+..++ T Consensus 293 gVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~----~g--~~i~fd~~~l~r~d~~~~~~ 366 (422) T protein:vir:13 293 GMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLTVYEQEIQDKLFSQYETL----QD--VKAEFNVDTILRSDIKTRYE 366 (422) T ss_pred CCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCChhhhc----CC--ceEEeechhhhcCCHHHHHH Confidence 6654 33333445555444333 34456677888999999998887753221 11 24566432 247788999 Q ss_pred HHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccc Q lcl|NC_021302. 391 ALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 391 ~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (484) +++++.+.|+.. .+++|+.+|+|.-++++....+.--.................|+ T Consensus 367 ~~~~~~~~G~~T-----~NE~R~~~gl~p~~ggD~~~~~~n~~~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 367 AYRIGIQGGFIE-----ANEARRRENLPPVEGGDRLLVNGNMIPIEMAGEQYKKGGEKGGK 422 (422) T ss_pred HHHHHHhCCCcC-----HHHHHHHhCCCCCCCcCeeeeccCccchhhcccccccCCCcCCC Confidence 999999999764 47899999998766555433221000000000111111111111 No 17 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.85 E-value=1.4e-19 Score=123.97 Aligned_cols=425 Identities=14% Similarity=0.059 Sum_probs=244.2 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHH-HHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY-TRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) -+.+...+..+. ... ++.. .+.++ ...... .+..+..+- +..+ +=+.|.+|+..+...|.+++|.|.... T Consensus 7 ~~~~~~~~~~~~---~~~-~~~~-~~~~~-~~~~~g--~~~~g~~v~~~~al-~~~~V~~~v~~Ia~~iA~lp~~~~~~~ 77 (454) T protein:vir:93 7 RTRKNQKSGRDV---REA-GWTS-LFQAV-AEPFAG--AWQQGVKADPEAVL-SFHAVFACISLISQDIAKMRLRLMQTD 77 (454) T ss_pred cCcccccccccc---cch-hhhh-hhhhh-hhhhcc--hhhcCcccChHHhh-ccHHHHHHHHHHHHhhccCceEEEEec Confidence 111111110000 000 0000 00000 000000 001111122 2333 467899999999999999999985432 Q ss_pred CC---HHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 80 AR---PEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 80 ~~---~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) .+ .+........|. .+-....+..++++.++ +.+.+|-+++++++... | .+..|.+++|.+ T Consensus 78 ~~g~~~~~~~~~~~~L~------------~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~-G--~~~~L~~i~~~~ 142 (454) T protein:vir:93 78 AQGIRRETRRGDIARLC------------RRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNAR-G--QIKELRILDWNR 142 (454) T ss_pred cCCccchhhhHHHHHHH------------hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCC-C--cEEEEEEEcCcc Confidence 22 111111111110 11112334667788776 67889999999987543 3 367899999998 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) +. ...+.+|.+.+....... ........++....|++++....+..+|.|.+..+....-.-....++. T Consensus 143 v~-v~~~~~g~~~y~~~~~~~----------~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~ 211 (454) T protein:vir:93 143 VE-PLVADDGEVFYRITPDRN----------CGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENS 211 (454) T ss_pred eE-EEEcCCCcEEEEEEeccc----------cccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHH Confidence 86 345666765543221111 0112345677888888887777778899999999988888888888888 Q ss_pred HHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk 313 (484) ..|... .+.|-.+.+++...++++++++.+.+.++.+|.++ .++++.|++++-++.+.....|.+..++...+|++ T Consensus 212 ~~~f~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~ 289 (454) T protein:vir:93 212 TSFFRN--GGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCS 289 (454) T ss_pred HHHHhc--cCCccEEEecCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHH Confidence 888774 36787777887788899999999999999877544 57889999998888766666788888899999999 Q ss_pred HHhhhhhc-ccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHHH Q lcl|NC_021302. 314 VALAHFLN-LDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDAT 388 (484) Q Consensus 314 ~ilGqtlt-~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ 388 (484) ++.-...- ...++++++-.+-+. .....-+.-.++.|+..||+.|+.. ... .++|+. . ..|.+.. T Consensus 290 ~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~L~~~--------~~~--~~~f~~~~ll~~D~~~r 359 (454) T protein:vir:93 290 VFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIELLLDEALETG--------ENE--STEFDVTTLLRMDSERR 359 (454) T ss_pred HhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--------CCc--EEEeechhhhccCHHHH Confidence 96655422 233345565444443 3455667788888999998866531 122 455543 2 3577889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccC-----CCcCCCccccCCCCcccccccccc----cc-- Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTA-----DTGQDEPETDEPALPNTSGTTSTT----NA-- 457 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~----~~-- 457 (484) ++++.++.+.|+.. .+++|+.+|+|.-+++++...... ..++...... +....+++... .. T Consensus 360 ~~~~~~~~~~G~~T-----~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~d 431 (454) T protein:vir:93 360 MKTLGDAVKNTLLT-----PNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDARED---PFASSGKTASVPQAVAASD 431 (454) T ss_pred HHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcccC---CCCCCccCCCCCCCCCCCC Confidence 99999999999754 488999999987766554321110 0011110100 00001111000 00 Q ss_pred ccccccccccchHHHhcCcccCc Q lcl|NC_021302. 458 PQARKRPRGRSPRDRRKTPDGAM 480 (484) Q Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~ 480 (484) ..........+..++........ T Consensus 432 ~~~~~~e~~~d~~~~~~~~~~~~ 454 (454) T protein:vir:93 432 GNKAITETEHDAVKAMFRGILKK 454 (454) T ss_pred CCCCccCCccchhhhhhhhhhcC Confidence 00111111222222222222222 No 18 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.84 E-value=8.6e-21 Score=130.59 Aligned_cols=445 Identities=12% Similarity=0.049 Sum_probs=236.3 Q ss_pred CCCCCCCccceee-----------eeccccc------chhhhhhhccccccccccccc---chHHHHHHHHhcchHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERG-----------YVNPLAG------FGTFLAQGLDQFEQVDELRWP---NSVYTYTRMCREEARIASV 60 (484) Q Consensus 1 ~~~~~~~~~~~~~-----------~~~~~~~------~~~~~~~~~~~~~~~~~lr~~---~~~~~y~~m~~~D~~v~s~ 60 (484) +|=...+-.+-+. ++.|-.. +.+.-..+- +......+..+ -.+.++.++..+.+.|.+| T Consensus 52 ~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~e-s~s~vtsls~pdaf~~vnVs~~~AlknsaV~sc 130 (945) T protein:vir:10 52 LAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPE-SLMYLPSISDPDAFFLINLFRKYRFNNDSKLIK 130 (945) T ss_pred hhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhhhccCc-cceecccccCccceeeehhhhhhhhccHHHHHH Confidence 1000000000000 0111000 000000000 00000011111 1234566666678999999 Q ss_pred HHHHHHHhhCCCcEEecCCCCH---HHH------HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcce Q lcl|NC_021302. 61 LRAIGLPIRRTDWRIRPNGARP---EVV------EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHA 130 (484) Q Consensus 61 l~~r~~~v~~~~~~v~p~~~~~---e~~------~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s 130 (484) +..+...|.++++++-...++. +.. ..+...+..+ + ........|..+++.++ +.+.+|-+ T Consensus 131 I~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rP--N-------p~mT~~eFwqsFl~~Lv~dLLL~GNA 201 (945) T protein:vir:10 131 VSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERP--D-------PYFSEVNSWEYLLGMVLDDILTIDRG 201 (945) T ss_pred HHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCC--C-------cccChhHHHHHHHHHHHHHHhhcCCe Confidence 9999999999999984322111 100 1111222111 0 00011123556777764 78999999 Q ss_pred eeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccC Q lcl|NC_021302. 131 VFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG 210 (484) Q Consensus 131 ~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~ 210 (484) ..++++... | .+..|.+.+|.++. ...+.+|++....+.. ..+.....+++...|+|++....+ T Consensus 202 YieIiRd~~-G--~ii~L~pLdPs~Vt-i~~ddDG~~~y~Yv~~------------idG~~~~~v~a~DvIlhirn~s~D 265 (945) T protein:vir:10 202 AIVKIRDEQ-G--NLVAITPVDGTTIK-PILSEDTGIVVGYVQE------------VDGAIVAHFDKRDVVLFRQNLTPD 265 (945) T ss_pred EEEEEECCC-C--cEEEEEEECCcceE-EEEcCCCcEEEEEEEe------------cCCceEEEecCCceEEEeccCCCC Confidence 999986533 4 35688999999885 4566667654322111 122233456777888888665433 Q ss_pred ---ccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEe----------cCCCCCCHHHHHHHHHHHHHHhcCCc Q lcl|NC_021302. 211 ---VWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKG----------NEADSEDDDRMDELLEIASNYSGGES 277 (484) Q Consensus 211 ---~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~g----------k~~~~~~~~~~~~l~~~l~~~~~g~~ 277 (484) ..||.|.+..+....-.-....++-+.+..+. ...|-.+. +++...++++++++.+.+.+..+|.+ T Consensus 266 G~~~GyGlSPIeaa~~aI~~alAaek~aar~FskN-Ga~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~N 344 (945) T protein:vir:10 266 VYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKG-GSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDY 344 (945) T ss_pred cccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhC-CCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcc Confidence 34688889988877777666777666666543 23563332 23345578888999999998877755 Q ss_pred eE--EEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhh-cccccccchhhHHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_021302. 278 AG--LALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFL-NLDGKGGSYALASVQADTF-VQSVQTVADEIRDV 353 (484) Q Consensus 278 a~--~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtl-t~~~~gGs~A~~evh~~v~-~~~~~aD~~~i~~~ 353 (484) ++ ++++.|++++-++.+.....|.+..++..++|+++..-..- ....++++++..+.+...+ ..-+.-.+..|+.. T Consensus 345 nG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqe 424 (945) T protein:vir:10 345 TQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGLEPLMATISKG 424 (945) T ss_pred cccceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHH Confidence 44 46789988887776655567888899999999999766532 2223345555555555555 46688999999999 Q ss_pred HHHHHHHHHHHhCCCCccccceEEecCCC-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc-- Q lcl|NC_021302. 354 AQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST-- 430 (484) Q Consensus 354 ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~-- 430 (484) ||+.|++. . .+..-+|.|+... .+.+.++++++++.+.|+.. .+++|+.+|+|+-++++....+. T Consensus 425 LNrkLl~~------~-eg~~i~fdFd~ldl~D~ksraEal~kli~sGiLT-----iNEvRe~lGLpPIeGGD~lli~~nn 492 (945) T protein:vir:10 425 FDEVVSEF------R-NEKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRS-----INEARMEKGLEPVPWGDVPFSGLRN 492 (945) T ss_pred HHHhcccc------c-cCceeEEEecchhccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeecccc Confidence 99865431 1 1122366675433 46678999999999999864 47899999998776665543211 Q ss_pred ---CCCc------CCCcc--ccCCCCccccccccc--cccc--------cccccccccchHHHhcC--cccCcccCC Q lcl|NC_021302. 431 ---ADTG------QDEPE--TDEPALPNTSGTTST--TNAP--------QARKRPRGRSPRDRRKT--PDGAMPLWD 484 (484) Q Consensus 431 ---~~~~------~~~~~--~~~~~~~~~~~~~~~--~~~~--------~~~~~~~~~~~~~~~~~--~~~~~~~~~ 484 (484) .+.. ..+++ +...+.+...+.... +..+ +..+.-..+....+... .-+..+=-| T Consensus 493 ~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~kda~~e~~~~l~~~~~~~a~e~i~~~~e~~~~~ 569 (945) T protein:vir:10 493 WKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQKNAGLEVLRNLFKSLDANASENLKQVIELTNDD 569 (945) T ss_pred ccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 0000 00000 000011111111100 0000 00000000111111111 111122222 No 19 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.84 E-value=1.1e-19 Score=124.46 Aligned_cols=398 Identities=11% Similarity=0.026 Sum_probs=241.4 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) .-++......... .......++.+.... .+..+ ..+..+ +=+.|.+|+..+...|.+++|++-...+ T Consensus 7 f~~~~~~~~~~~~----~~~~~~~~~~~~~~~-------~~~~v-~~~~al-~~~~v~~~i~~Ia~~ia~l~~~~~~~~~ 73 (416) T protein:vir:12 7 FEKRSGSSDHEDG----FNNILLNMFGGRKTA-------SGERV-SESNSL-VQPDIFACVNVLSDDIAKLPIHTYKRTD 73 (416) T ss_pred cccccCccccCcc----chhHHHHhhcCcccc-------cCcee-chhhhh-ccHHHHHHHHHHHHhhhhCceEEEEecC Confidence 2222211111100 000000111110000 00001 122343 4688999999999999999999843222 Q ss_pred CH--HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 RP--EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~~--e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +. ++.+ -+...|. .+.....++.++++.++ +.+.+|-+.+++++... | .+..|.+.+|.++ T Consensus 74 ~~~~~~~~~~l~~~l~------------~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~-G--~~~~L~~l~~~~v 138 (416) T protein:vir:12 74 GGIERKPEHKSAHAVY------------ARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSH-G--YPEALFPLRPDYT 138 (416) T ss_pred CccccccccHHHHHHH------------hhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEECCcce Confidence 11 1111 1111111 11112345677888876 56789999999886433 3 4788999999888 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) .. ..+.+++.+... ...++....+|...++++++.. .+.++|.|.+..++...-.-....++.. T Consensus 139 ~v-~~~~~~~~~~~~--------------~~~~g~~~~~~~~eiih~~~~~-~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 202 (416) T protein:vir:12 139 NA-YVHPTTGMLWYQ--------------TVLNGKAIELYDYEVLHFKGLS-TDGIHGKSPIGVVREHIGAQAAATKYNA 202 (416) T ss_pred EE-EEeCCCcEEEEE--------------EecCCeEEEecCccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 64 344454443222 1223345678888888887654 4558999999999988888888888888 Q ss_pred HHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVAL 316 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~il 316 (484) .+.+. -+.|-.+.+++...++++++++.+.+..+.++. ..++++.|++++-++.+.....|.+..++..++|++++. T Consensus 203 ~~~~n--g~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~-~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 279 (416) T protein:vir:12 203 KLYKN--EATPRGILKVPAFLDEKPKENVRKEWKRVNKVE-NIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYK 279 (416) T ss_pred HHHhc--CCCCceEEecCCCCCHHHHHHHHHHHHHHhcCC-CeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhC Confidence 88885 367877778888889999999999998876553 578899999999888776666788999999999999976 Q ss_pred hhh-hcccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC---CCCcHHHHHHH Q lcl|NC_021302. 317 AHF-LNLDGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE---IGSRQDATAAA 391 (484) Q Consensus 317 Gqt-lt~~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~---~~~~~~~~ae~ 391 (484) -+. +.....+++++-.+-... ....-+.-.++.|+..||+.|+...-.. .. ..|+|+. ...|.++++++ T Consensus 280 VPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~l~~~~~~~----~g--~~i~fd~~~l~~~d~~~~~~~ 353 (416) T protein:vir:12 280 VPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVNFEQELNVKLFLDHDQK----SG--HYVKFNIDSELRGDSKTQAEY 353 (416) T ss_pred CCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchhhc----CC--ceEEeechhhhccCHHHHHHH Confidence 654 333334566765554443 3466778889999999998777532211 11 2466643 23478889999 Q ss_pred HHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccC---CCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 392 LQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTA---DTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 392 ~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) ++++.+.|+.. .+++|+.+|+|+-+++++...+.- .....+.+. +.. +...++......+ T Consensus 354 ~~~~~~~G~~T-----~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~~~~~~~--~~~-~~~~~gge~~~~g 416 (416) T protein:vir:12 354 LKTLHETGVLN-----KDEIRELLERNPIENGDKYISSLNYVFLDFLEEYQR--LKA-GGAMKGGDNKNEG 416 (416) T ss_pred HHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeeccccccccccchhhc--ccc-ccccCCCCCcCCC Confidence 99999999864 478999999987766654432210 000000000 000 0000111111111 No 20 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.83 E-value=2.6e-19 Score=122.44 Aligned_cols=392 Identities=12% Similarity=0.061 Sum_probs=227.0 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHH-HHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY-TRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) |.-=.-..++..++ ...+..+...+..... +..+. +..+ +-+.|.+|+..+-..|.+++|+++.. T Consensus 1 M~~f~~~~~~~~~~----~~~~~~~~~~~~~~~~--------~~~v~~~~al-~~~~V~~~v~~ia~~ia~~p~~~~~~- 66 (397) T protein:vir:38 1 MPLLKLNKSHSQGF----SLNDPDWVNFLTGGEA--------QKYVSADTAL-KNSDIFSLIMQLSGDLAMVRYTSESD- 66 (397) T ss_pred CcchhhhhcccCcc----cCCchhhhhhhcCCcC--------CceechHHhh-ccHHHHHHHHHHHHHHhhCccccccc- Confidence 21110000000000 0000111111111110 11112 2344 57899999999999999999987421 Q ss_pred CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceee Q lcl|NC_021302. 80 ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAY 158 (484) Q Consensus 80 ~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~ 158 (484) .. ...+. +......+.++++.+. +.+.+|.+++++++...+ .+..|.+++|.++.. T Consensus 67 ---~~----~~l~~-------------~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g---~~~~l~~l~~~~v~i 123 (397) T protein:vir:38 67 ---RS----QSIIS-------------NPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNG---VDLSWEYLRPSQVQP 123 (397) T ss_pred ---HH----HHHHh-------------cCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCC---cEEEEEEEcCceeEE Confidence 11 11111 1112335778888887 567799999999875432 467899999988753 Q ss_pred eeecCCCceeeee-cccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLISIQ-QWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAA 237 (484) Q Consensus 159 ~~~~~dg~l~~~~-q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 237 (484) ..+.+|+.+... +.. ....+....+|....|++++....+..||.|.+..+....-......++... T Consensus 124 -~~~~~~~~~~y~~~~~-----------~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 191 (397) T protein:vir:38 124 -MLLQDGSGLIYNINFD-----------EPAIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLK 191 (397) T ss_pred -EEcCCCceEEEEEEec-----------cccccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHH Confidence 445555433221 111 1112333568888888888887777789999999999988888888888888 Q ss_pred HHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 238 AIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVA 315 (484) Q Consensus 238 f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i 315 (484) +... .++|-.+.+.+...++++.+.+.+.+....++.++ .++++.|++++-++.+.....|.+..++...+|++++ T Consensus 192 ~f~n--g~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~af 269 (397) T protein:vir:38 192 ALKQ--SVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVY 269 (397) T ss_pred HHhc--cCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHh Confidence 8885 36787777777778888888888888777665443 4778899988888776666678999999999999996 Q ss_pred hhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHH Q lcl|NC_021302. 316 LAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQML 395 (484) Q Consensus 316 lGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L 395 (484) .-..--.+...++++..+-.......-+.-.+..|+..||+.|++.. .+ .+.+ ....+.+..++++++| T Consensus 270 gVp~~~lg~~~~~~~~~e~~~~~~~~~l~P~~~~ie~~ln~~l~~~~-~~---------~~~~-~~~~d~~~~~~~~~~~ 338 (397) T protein:vir:38 270 GVPDSYLNGQGDQQSSITQISGQYAKSLNRYVQAIVGELNDKLHANI-SA---------NIRF-AIDAMGDQYASTISSS 338 (397) T ss_pred CCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccChh-cc---------cccc-cccCCHHHHHHHHHHH Confidence 55432222112222222212233345667788888888888776641 11 1222 1234678889999999 Q ss_pred HhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccccccccc Q lcl|NC_021302. 396 VNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPR 465 (484) Q Consensus 396 ~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 465 (484) .+.|+.. .+++|+.+|+|.-+.++......... ...+..+...+........+....|. T Consensus 339 ~~~G~~t-----~nE~R~~lg~~p~~~~d~~~~~~~~~------~~~~~~~~~~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 339 VKGGTIA-----GNQARFILQNSGYLAKDLPDPEKEPQ------QAIQLIQQEGGENDGNNSDERGSDPE 397 (397) T ss_pred HhCCCcC-----HHHHHHHhCCCCCCCCcccccccccc------ccccccccccCCCCCCCCCCCCCCCC Confidence 9999754 47899999997644433211110000 00000000000000000001111111 No 21 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=99.83 E-value=9.4e-20 Score=124.90 Aligned_cols=402 Identities=13% Similarity=0.066 Sum_probs=241.2 Q ss_pred CCCCCCCccceee-eecccccc--hhhhhhhcccccccccccccchHHH-HHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERG-YVNPLAGF--GTFLAQGLDQFEQVDELRWPNSVYT-YTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~~~~~~lr~~~~~~~-y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) +|+..+++.+... ..++.... +..+...+..... .+..+ .+..+ +=+.|.+|+..+-..|.+++|.+- T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-------~g~~v~~~~al-~~~~V~~~i~~ia~~ia~lp~~~~ 80 (434) T protein:vir:43 9 LSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESS-------SGKKVTVDKAM-KLSAVWACVRLISTSVAGLPLGVY 80 (434) T ss_pred hhhcccccchhhhcccccccccCchHHHHHHhcCCcc-------CCceechhhhh-ccHHHHHHHHHHHHhhhhCceEEE Confidence 6776666665421 11111110 1111111111000 11112 22344 468999999999999999999984 Q ss_pred cCCCCH---HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 77 PNGARP---EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 77 p~~~~~---e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) -...+. ++.+ .+...|.. +-....+..++++.++ +.+.+|-+..++.+ ++|. +..|.+. T Consensus 81 ~~~~~g~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~--~~G~--~~~L~~l 144 (434) T protein:vir:43 81 ERKADGSRVDARSFPLYDVVHN------------SPNDDMTAFQFWQAMVASMLLWGNAYAEIRR--AAGR--PAALDFL 144 (434) T ss_pred EEcCCCccccccccHHHHHHhc------------cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEe--CCCc--EEEEEEE Confidence 322211 1111 11122210 1112234566777775 67889999888754 3454 6789999 Q ss_pred CccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHH Q lcl|NC_021302. 152 PQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDEL 231 (484) Q Consensus 152 ~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~ 231 (484) +|.++. +..+.+|.+.+..+. ..+....++...+++.++.+ .+..+|.|.+..+....-.-... T Consensus 145 ~p~~v~-~~~~~~g~~~y~~~~--------------~~g~~~~~~~~eVih~~~~~-~dg~~G~spi~~~~~~i~~~~~~ 208 (434) T protein:vir:43 145 LPSRVD-LECDENGRLKYFYTT--------------KKGARREIERTNMLHIPAFT-LDGRIGLSAIRYGVDVFGSVMSA 208 (434) T ss_pred cCcceE-EEEcCCCeEEEEEEe--------------cCceEEEEccccEEEecCcC-CCCccccCHHHHHHHHHHHHHHH Confidence 999886 556677776544321 23345678888888887664 45588999999999888887888 Q ss_pred HHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHH Q lcl|NC_021302. 232 IRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDH 309 (484) Q Consensus 232 ~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~ 309 (484) .++-..|... .+.|-.+.+++...++++++++.+.++++..+.++ .++++.|++++-++.+.....|.+..++..+ T Consensus 209 ~~~~~~~f~n--g~~~~gil~~~~~l~~e~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~ 286 (434) T protein:vir:43 209 EDAANGTFKN--GLLPTVAFKVDRILQPAQREEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVI 286 (434) T ss_pred HHHHHHHHhc--cCCcceEEecCCCCCHHHHHHHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHH Confidence 8888888874 36787777888888888889999988887554443 4679999998888766666678899999999 Q ss_pred HHHHHHhhhh-hcccccccch--h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---C Q lcl|NC_021302. 310 QMALVALAHF-LNLDGKGGSY--A-LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---G 382 (484) Q Consensus 310 ~Isk~ilGqt-lt~~~~gGs~--A-~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~ 382 (484) +|++++.-.. +....++++. + ..+........-+.-.++.|+..||+.|+..--..+ + .|+|+.. . T Consensus 287 ~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~------~-~~~fd~~~llr 359 (434) T protein:vir:43 287 EICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFSISSITNQIQQCVNKRLLTAPERIR------Y-YAEFSLEGFLK 359 (434) T ss_pred HHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcC------c-eEEEechhhhc Confidence 9999976653 3223333332 2 223333445556788888899999887665321111 1 4566532 3 Q ss_pred CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc----CCCc--CCCccccCCCCccccccccccc Q lcl|NC_021302. 383 SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST----ADTG--QDEPETDEPALPNTSGTTSTTN 456 (484) Q Consensus 383 ~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~ 456 (484) .|.+..++++.++.+.|+.. .+++|+.+|+|.-++++....+. .+.. ...++......++.++.+.+.. T Consensus 360 ~d~~~r~~~~~~~~~~G~~T-----~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 360 ADSAGRAAWYSTMAQNGFMT-----RNEGRRKENLPELPGGDILTVQSNLVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred cCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCeEeeccCccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 47888999999999999864 47899999998765554432211 0000 0000000000011111110000 No 22 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.83 E-value=5.5e-19 Score=120.68 Aligned_cols=443 Identities=11% Similarity=0.074 Sum_probs=222.4 Q ss_pred CCCCCCCccc--eeeee-ccc----ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTVAPRT--ERGYV-NPL----AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~~~~~--~~~~~-~~~----~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) =||.+-.+-. ..... .+. ...|.....+-...... ..-+-.+..+.++....++|..|+..+...|.+++| T Consensus 38 ~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~--~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~ 115 (648) T protein:vir:79 38 EAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDF--EEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADW 115 (648) T ss_pred CCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCcccc--ccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcc Confidence 2222211100 00000 000 00010000000000000 001223444444555789999999999999999999 Q ss_pred EEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCee--------- Q lcl|NC_021302. 74 RIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRF--------- 143 (484) Q Consensus 74 ~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~--------- 143 (484) .|.+.++...........+ .+.....+..++++.++ +.+.||.+.+|++...+++.. T Consensus 116 ~i~~~~~~~~~~~~~~~ll-------------~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~ 182 (648) T protein:vir:79 116 DFVSKNPNAVEYIRMRFTL-------------MAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVG 182 (648) T ss_pred eEEecCCccchhhHHHHHh-------------hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhc Confidence 9987654321111111111 11112335567777665 577899999999876554321 Q ss_pred ---eeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHH Q lcl|NC_021302. 144 ---WLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRP 220 (484) Q Consensus 144 ---~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~ 220 (484) ....+.+.+|.++. ...+++|.++..... ...+.....+++...|++++....+.+||.|.+.. T Consensus 183 ~~~~v~~l~pl~p~~v~-v~~d~~g~~~~Y~y~------------~~g~~~~~~~~~~dIIHik~~~~~d~~~GlSpi~~ 249 (648) T protein:vir:79 183 DSMPVAGYFPLNLASMK-VKRDKFGMIKGWQQE------------QEGQDKPQKFKPEDIVHIYYKREKGRAFGTPWLLP 249 (648) T ss_pred cccceeeeEeecCceeE-EEEcCCCceeeeEEE------------ecCCceeEEecCccEEEEccCCCCCCceeccHHHH Confidence 13456677777764 455666654432211 11123345677888888887777888999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCCceEEEccCCceEEEec--ccCCc Q lcl|NC_021302. 221 AYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGESAGLALTAGEEAGILS--PNGTP 297 (484) Q Consensus 221 ~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~--~~~~~ 297 (484) |....-.-.....+...|.... +.|-.+.+++.+. ..+..+++.+.+... .....+.+.+...+.+. ..++. T Consensus 250 a~~aI~l~~aa~~~~~~fF~NG--a~P~gil~~~~~~~~~e~~k~~~e~~~~~---~~~~~i~gg~v~~~~~~i~~~~s~ 324 (648) T protein:vir:79 250 ALDDIRALRQVEENVLRLVYRN--LHPLWHVKVGLEQEGFGAEEGEVDLVRGE---VENMDVEGGMVTTERVNISSIASN 324 (648) T ss_pred HHHHHHHHHHHHHHHHHHHhcc--CCccEEEEeCCCccchHHHHHHHHHHHHh---cccccccccccccceeeccccCCH Confidence 9998888888888989998863 5676666553322 222233333333322 12222222222222221 12222 Q ss_pred --hhHHHHHHHHHHHHHHHHhhhhhcc-cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhCCC---Cc Q lcl|NC_021302. 298 --LDPRRAIEYHDHQMALVALAHFLNL-DGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIV-DVNWG---ED 370 (484) Q Consensus 298 --~~~~~li~~~d~~Isk~ilGqtlt~-~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~-~~Nf~---~~ 370 (484) ..|.+..++..++|+.++.-...-. ..++++++.++.....+...+..-+..++..++..+.+.+. ...|. .. T Consensus 325 ~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~ 404 (648) T protein:vir:79 325 QIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNP 404 (648) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 2477778888899999976654322 33456677777766677777777777777777666555432 22221 11 Q ss_pred cccceEEecCCC-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccc--c--cCCCcCCCccccCCCC Q lcl|NC_021302. 371 EPAPLLVFDEIG-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDE--S--TADTGQDEPETDEPAL 445 (484) Q Consensus 371 ~~~P~~~~~~~~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~--~--~~~~~~~~~~~~~~~~ 445 (484) ....+|.|.... .+.+..++.+.++.+.|+.. .+++|+.+|+|.-+++..... . ........+....+.+ T Consensus 405 d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT-----~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~ 479 (648) T protein:vir:79 405 DDKVEFRFNEIDMDSKIKLENQAVFLYEHNAIS-----EDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTP 479 (648) T ss_pred cceEEEeecccchhhHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCC Confidence 223467775543 35567788888999999754 578999999975444332110 0 0000000000000000 Q ss_pred ccccccccccccccccccccccchHHHhcCcccCc----------------ccCC Q lcl|NC_021302. 446 PNTSGTTSTTNAPQARKRPRGRSPRDRRKTPDGAM----------------PLWD 484 (484) Q Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~ 484 (484) + +.....+..+..+.....+....|...--.. -||. T Consensus 480 ~---~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 531 (648) T protein:vir:79 480 A---GGSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQTNGRHVRYMQEMLLEY 531 (648) T ss_pred C---CCCCCCccccccccccCCCCCCCCCCCcCCCCccccchhhhhhhhhhhhcc Confidence 0 0000001111111111111111221110011 1111 No 23 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.82 E-value=6.8e-19 Score=120.17 Aligned_cols=405 Identities=12% Similarity=0.015 Sum_probs=237.4 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) =+++++.. ++ .. .+.....++... ...+.+..+...+-+.|.+|+..+-..|.+++|.+.-..+ T Consensus 12 ~r~~~~~~--~~---~~---~~~~~~~~~g~~--------~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~ 75 (429) T protein:vir:10 12 KRQTSQVI--EL---NK---DDEKLLEWLGIS--------PSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDE 75 (429) T ss_pred ccCccccc--cc---CC---ChHHHHHHhcCC--------CCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 01111100 00 00 000011110000 0111222222234789999999999999999999843222 Q ss_pred CH--HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 RP--EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~~--e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +. ++.+ -+...|.. +-.....+.++++.++ +.+.+|-+++++++... | .+..|.++++.++ T Consensus 76 ~~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~i~~~~v 140 (429) T protein:vir:10 76 YGIQRGTKHYLNNLLRL------------RPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK-G--KVQALWPIDASKV 140 (429) T ss_pred CceeeccccHHHHHHHh------------hccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEEcCcee Confidence 21 1111 11122210 1112234667777776 46889999999986433 3 3678999999887 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . +..++++.+.. .....+....++....+|+...|++++....+..+|.|.+..+....-.-....++.. T Consensus 141 ~-v~~~~~~~~~~---------~~~~~~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~ 210 (429) T protein:vir:10 141 T-VYIDDVGLLNS---------KTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFIN 210 (429) T ss_pred E-EEEcCcccccc---------cceEEEEEccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 5 34444333211 0111223334455667899998888877777778999999999988888888888888 Q ss_pred HHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC---ceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHH Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE---SAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~---~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk 313 (484) .+.+. -+.|-.+.+.+...++++++++.+.+..+..|. ...++++.|++++-++.+.....|.+..++..++|++ T Consensus 211 ~~~~n--g~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 288 (429) T protein:vir:10 211 NFYKQ--GLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIAT 288 (429) T ss_pred HHHhc--cCCccEEEEcCCCCCHHHHHHHHHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 88885 356766677777788888899999888876542 2468899999998887665556688888899999999 Q ss_pred HHhhhhh-cccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHHH Q lcl|NC_021302. 314 VALAHFL-NLDGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDAT 388 (484) Q Consensus 314 ~ilGqtl-t~~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ 388 (484) ++.-... .....+|+++-.+-+.. ....-+.-.++.|+..||+.|+..--. . .+. +|+|+. . ..|.++. T Consensus 289 ~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~---~-~g~--~~~fd~~~ll~~d~~~~ 362 (429) T protein:vir:10 289 AFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSEL---D-KGF--YSKFNVDAILRADIKTR 362 (429) T ss_pred HhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhc---C-CCc--EEEeechhhhcCCHHHH Confidence 9766543 22333456654444433 456667888899999999877653211 1 112 455543 2 3477889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCC-CccccCCCCcccccccccccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQD-EPETDEPALPNTSGTTSTTNAPQARKRPRGR 467 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (484) ++++++|.+.|+.. .+++|+.+|+|.-+.++....+. +-... ...+.........++ ....+.+- . T Consensus 363 ~~~~~~~~~~G~~T-----~NE~R~~~gl~p~~ggD~~~~~~-n~~~~d~~~~~~~k~g~~~~~-~~~~~~e~------~ 429 (429) T protein:vir:10 363 YEAYRTGIQGGFLK-----PNEARSKEDLPPEAGGDRLLVNG-NMLPIDMAGQAYLKGGDTNGE-VSKEGNEG------N 429 (429) T ss_pred HHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcCeeeecc-cccchhhccccccCCCCCCCC-CCCCCCCC------C Confidence 99999999999764 47899999998655544433221 10000 000000000000000 00000010 0 No 24 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.82 E-value=6.4e-19 Score=120.34 Aligned_cols=389 Identities=10% Similarity=0.020 Sum_probs=228.9 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) .......+..+. . ......+..+. . .+..+ ..+..+ +=+.|.+|+..+-..|.+++|++...++ T Consensus 8 f~~~~~~~~~~~-~-~~~~~~~~~~~-----------~-~g~~v-~~~~al-~~~~v~~~v~~ia~~iA~lp~~~~~~~~ 71 (409) T protein:vir:84 8 FSGPSEERTLTK-I-SGIPSPAEDWA-----------M-HGDRP-GANSAM-TLGAFYACVTLLADTVASLSIDAYRKKD 71 (409) T ss_pred hcCCCccccccc-c-cccccccchhh-----------c-cCccc-chhhhh-ccHHHHHHHHHHHHhhhhCceEEEEecC Confidence 111110010000 0 00000000000 0 01111 133444 3578999999999999999999865433 Q ss_pred CHHHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceee Q lcl|NC_021302. 81 RPEVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAY 158 (484) Q Consensus 81 ~~e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~ 158 (484) ..++.+ -+.+.|. .+......+.++++.++ +.+.+|-+++++.+.-.+| .+..|.+.+|.++.. T Consensus 72 ~~~~~~~~l~~lL~------------~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g--~~~~L~~l~p~~v~v 137 (409) T protein:vir:84 72 NVRIPVSPAPKLLE------------STPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEAN--RPTAIMPIHPDCIHV 137 (409) T ss_pred CcccccchHHHHhh------------ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCC--ceEEEEEEcCceeEE Confidence 322111 1122221 11223446788888887 6778999998887654444 467888999988753 Q ss_pred eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAA 238 (484) Q Consensus 159 ~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f 238 (484) ....+..+... .......+..+|.+.++++++....+..+|.|.+..+....-.-....++...| T Consensus 138 ~~~~~~~~~~~---------------~~~~~~~g~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 202 (409) T protein:vir:84 138 TDAKDEDGDWI---------------EPVYRIDGKVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRW 202 (409) T ss_pred EEcCCCcceEE---------------EEEecCCceEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 32222211111 111123345678888888888777777899999999988887777788888888 Q ss_pred HHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhh Q lcl|NC_021302. 239 IRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAH 318 (484) Q Consensus 239 ~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGq 318 (484) ... .+.|-.+.+++...++++++++.+.......+....++++.|++++-++.+.....|.+..++..++|++++.-. T Consensus 203 f~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP 280 (409) T protein:vir:84 203 FRD--SANPSGILSSDADLTPDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIP 280 (409) T ss_pred Hhc--CCCccEEEecCCCCCHHHHHHHHHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 875 367867777777788888888888877766554457889999999888766555678888889999999986554 Q ss_pred h-hcccccccch--h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEec--CC-CCcHHHHHHH Q lcl|NC_021302. 319 F-LNLDGKGGSY--A-LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFD--EI-GSRQDATAAA 391 (484) Q Consensus 319 t-lt~~~~gGs~--A-~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~--~~-~~~~~~~ae~ 391 (484) . +.....+++. + ..+........-+.--++.|+..||+.|. .+. .++|+ .. ..|.++.+++ T Consensus 281 p~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~L~----------~g~--~i~fd~~~l~~~d~~~~~~~ 348 (409) T protein:vir:84 281 PHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTFLP----------RGQ--FVKFNVDGLMRGDVTARFTA 348 (409) T ss_pred HHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------CCC--eEEEechhhhccCHHHHHHH Confidence 3 2222222332 1 12222333455567778888888887541 122 34554 32 3578899999 Q ss_pred HHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccC--CCcCCCcccc--CCCCcccccccccccccc Q lcl|NC_021302. 392 LQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTA--DTGQDEPETD--EPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 392 ~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 459 (484) +.++++.|+.. .+++|+.+|+|+-++++....+.- ..+...+.+. .+.+.+.+ .+.+ T Consensus 349 ~~~~~~~G~~t-----~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~------~gn~ 409 (409) T protein:vir:84 349 YQMGLQNGIWS-----VNEVRAWEDAPPIPEGDIHLQPMNFVPLGYVPPEEPAQEPQPNSAT------EGNK 409 (409) T ss_pred HHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeecccccccccCCccccCcCCCCCCcc------CCCC Confidence 99999999754 478999999987655554332210 0000000000 00000000 0001 No 25 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.82 E-value=8.9e-19 Score=119.54 Aligned_cols=401 Identities=14% Similarity=0.066 Sum_probs=230.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) .+|..++.-...-...+.. +.....+.... ..+..+..+-.++-+.|.+|+..+-..|.+++|.|.-.+. T Consensus 17 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~s--------~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:10 17 FVPPDPVDIGGGQTFTPVN--ATARDLGIIIS--------DTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTP 86 (432) T ss_pred cCCccccccccccccccCc--chhhhhccccc--------ccCcccchhhhhcchHHHHHHHHHHHhhhhCceeEEEecC Confidence 2332221111000001100 00000000000 0112223332335789999999999999999999843222 Q ss_pred C--HHHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 R--PEVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~--~e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) + .++.+ -+...|. .+-.....+.++++.++ +.+.+|.+.+++++. +| .+..|.+++|.++ T Consensus 87 ~g~~~~~~~~l~~lL~------------~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~--~g--~~~~L~~l~~~~v 150 (432) T protein:vir:10 87 DGRKEAVNHPLYTLLL------------DGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--DG--RIESLQYLANDRL 150 (432) T ss_pred CCcccccccHHHHHHH------------hcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--CC--cEEEEEEEcCCce Confidence 2 11111 0111111 01112235667777665 678899999999873 45 3678889999887 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . ...+.+|.+.+..+ ...+....++.+.++++++.+. +..+|.|.+..+....-.-....++-. T Consensus 151 ~-v~~~~~g~~~y~~~--------------~~~g~~~~~~~~~iih~~~~~~-dg~~G~spi~~~~~~i~~~~~~~~~~~ 214 (432) T protein:vir:10 151 T-ITTDTKGNTAYRYR--------------RTDGQMIDIPKQQIWKIMGYSL-DGENGLSAIRYGAQIFGTAIAAEAQAA 214 (432) T ss_pred E-EEEcCCCcEEEEEE--------------ecCceEEEEcCccEEEecCCCC-CCcccccHHHHHHHHHHHHHHHHHHHH Confidence 5 45566776554322 1233445688888888776544 347899999999987777777777777 Q ss_pred HHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVAL 316 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~il 316 (484) .|... .+.|-.+.+.+...++++++++.+.+....+. ...++++.|++++-++.+.....|.+..++...+|++++. T Consensus 215 ~~f~n--g~~~~gil~~~~~l~~e~~~~~~~~~~~~~na-g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afg 291 (432) T protein:vir:10 215 RAFRN--GQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEA-GRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFG 291 (432) T ss_pred HHHhc--CCCcceEEecCCCCCHHHHHHHHHHHhhhhhC-CCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhC Confidence 77764 36787777777788889999998888765432 2468899999998887766666788889999999999975 Q ss_pred hhh-hcccccccchhhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHH Q lcl|NC_021302. 317 AHF-LNLDGKGGSYALAS----VQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDAT 388 (484) Q Consensus 317 Gqt-lt~~~~gGs~A~~e----vh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~ 388 (484) -.. |....+.|+++.+. ........-+.-.++.|+..||+.|+.+-- . .. -+|+|+.. ..|.+++ T Consensus 292 VPp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~----~--~~-~~~~fd~~~ll~~d~~~r 364 (432) T protein:vir:10 292 VPPSMIGHSSAGTTSWGSGIESQQLGFLSMTLSPWLRRIEQSIALNLLSPAE----R--RR-YFADFDTSALLRADSAAR 364 (432) T ss_pred CCHHHcCCccCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCccc----c--Cc-eEEEeechhhhccCHHHH Confidence 543 33233334443222 222333346677778888888876665421 1 11 25666532 3578889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCC---ccccCCCCcccccccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDE---PETDEPALPNTSGTTSTTNAPQARKR 463 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (484) +++++++.+.|+.. .+++|+.+|+|.-+++....... ..-.+. .++..+.+. .+++-.... .. +. T Consensus 365 ~~~~~~~~~~G~~T-----~NE~R~~~glppi~g~~~~~~~~-~~~~pl~~~~~~~~~~~~--~~~~~~~~~-~~-~~ 432 (432) T protein:vir:10 365 SSYYSQLVNNGLMT-----RDEAREIEGLPKLGGNAAVLTVQ-SAMVPLDSIGLQASPEPA--SGLGNQQQD-KV-SK 432 (432) T ss_pred HHHHHHHHhCCCCC-----HHHHHHHhCCCCCCCCcceEeec-CcccchhhhcccCCCCCC--CCCCCcccc-cc-cC Confidence 99999999999754 48999999998776543322110 000000 000000000 000000000 00 00 No 26 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.82 E-value=9.6e-19 Score=119.36 Aligned_cols=394 Identities=11% Similarity=0.042 Sum_probs=235.6 Q ss_pred CCCCCCCccceee-eecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERG-YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) -..+...+.+... ..+.. +.+.....+ ..+-.+...+-+.|.+|+..+-..|.++++++...+ T Consensus 7 f~r~~~~~~~~~~~~~~~~-~~~~~~~~g---------------~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~ 70 (413) T protein:vir:48 7 FQRKSDAPVTTPAELAEAI-GLSYDTYTG---------------KRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKIS 70 (413) T ss_pred hccCccCCccchHHHHHhh-hcCcccccC---------------ceechhhhhccHHHHHHHHHHHHhhhhCceEEEEec Confidence 1111111111100 00000 000000000 001112222468899999999999999999986433 Q ss_pred CCHH--HHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 80 ARPE--VVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 80 ~~~e--~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) ++.. +.+ -+...|. .+.....++.++++.++ +.+.+|-+.+++++. .| .+..|.++++.+ T Consensus 71 ~~~~~~~~~~~~~~lL~------------~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~--~g--~~~~L~~l~~~~ 134 (413) T protein:vir:48 71 GTLKTRVVDERLHKLVS------------AKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA--LG--EVVELLPIDPGC 134 (413) T ss_pred CCcceeecccHHHHHHH------------hhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC--CC--cEEEEEEEcCce Confidence 2211 111 1112221 01112335667777776 678899999988753 44 367898999988 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) +. ...+.++..++..+. ..+....++...+++.++.. .+.++|.|.+..+....-.-....++. T Consensus 135 v~-~~~~~~~~~~y~~~~--------------~~g~~~~~~~~evih~~~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~ 198 (413) T protein:vir:48 135 VE-PKLNSQWQPVYQVTF--------------PDGSVDVLTQDEIWHVRTLT-LDGLVGLNPIAYAREAISLAAATEEHG 198 (413) T ss_pred EE-EEEcCCceEEEEEEe--------------cCceEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHH Confidence 75 456666665543322 12333457788888777654 456899999999998877777777777 Q ss_pred HHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-ce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-SA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMA 312 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Is 312 (484) ..+... .+.|-.+.+.+...++++.+++.+.+.+...|. ++ .++++.|++++-++.+.....|.+..++...+|+ T Consensus 199 ~~~~~n--g~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 276 (413) T protein:vir:48 199 ARLFGN--GAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEIC 276 (413) T ss_pred HHHHhc--cCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHH Confidence 777775 367877777777788899999999988876552 22 4788999999888766666678889999999999 Q ss_pred HHHhhhh-hcccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHH Q lcl|NC_021302. 313 LVALAHF-LNLDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDA 387 (484) Q Consensus 313 k~ilGqt-lt~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~ 387 (484) .++.-.. +....++++++-.+-+. .....-+.-.++.|++.||+.|+++--. ... +|+|+.. ..|.++ T Consensus 277 ~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~~ie~~l~~~L~~~~~~-----~~~--~~~fd~~~l~~~d~~~ 349 (413) T protein:vir:48 277 RLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRESKQ-----GKF--YAKFNAGALLRGDMKS 349 (413) T ss_pred HHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcccc-----CCe--EEEEechhhhccCHHH Confidence 9976654 33333345666544333 3445567788889999999877764221 112 4566432 247788 Q ss_pred HHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccc Q lcl|NC_021302. 388 TAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNA 457 (484) Q Consensus 388 ~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (484) .+++++++++.|+.. .+++|+.+|+|.-+.++....+. +........+....+..+++...+++ T Consensus 350 ~~~~~~~~~~~g~~T-----~NE~R~~~g~~p~~ggD~~~~~~-n~~~~~~~~~~~~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 350 RFEAYATGINWGIYS-----PNDCRDLEDMNPRPGGDVYLTPM-NMTTSPSAGDDNGKKKESGDADKTAS 413 (413) T ss_pred HHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeccc-cccccccccccCCCCCCCCCccccCC Confidence 999999999999865 47899999998766555433221 11110001111111122222211111 No 27 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.82 E-value=3.8e-19 Score=121.55 Aligned_cols=394 Identities=11% Similarity=0.047 Sum_probs=233.6 Q ss_pred CCCCCCCccceee-eecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERG-YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) ...+.+.+.+... .... + ++.. ... .+..+..+..++-+.|.+|+..+-..|.++++.|...+ T Consensus 8 f~r~~~~~~~~~~~~~~~---~------~~~~----~~~---~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~ 71 (414) T protein:vir:44 8 FQRKSDAPVTTPAELADA---I------GLSY----DTY---TGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLN 71 (414) T ss_pred hccCccCcccchhhHhHh---h------ccCc----ccc---CCceechhhhhccHHHHHHHHHHHHHhccCceEEEEec Confidence 1111111111000 0000 0 0000 000 11112222223578999999999999999999985433 Q ss_pred CCHH-HH--HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 80 ARPE-VV--EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 80 ~~~e-~~--~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) ++.+ .+ .-+...|. .+......+.++++.++ +.+.+|-+++.++. ++| .+..|.+++|.+ T Consensus 72 ~~~~~~~~~~~~~~lL~------------~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~--~~g--~~~~L~~l~~~~ 135 (414) T protein:vir:44 72 GSLKQRATGERLHKLIS------------THPNGYMTPQEFWELVVTCLCLRGNFYAYKVK--AFG--EVAELLPVDPGC 135 (414) T ss_pred CCceeecccchHHHHHH------------hhcccCCCHHHHHHHHHHHHhhcCCeEEEEEe--CCC--cEEEEEEEcCce Confidence 2211 10 11111111 11122335677777776 57789999988753 334 367899999988 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) +. ...+.++.+++..+.. .+....+++...+++++. ..+.++|.|.+..+....-.-....++. T Consensus 136 v~-~~~~~~~~~~y~~~~~--------------~g~~~~~~~~evih~~~~-~~d~~~G~s~i~~~~~~i~~~~~~~~~~ 199 (414) T protein:vir:44 136 VV-PKLNSSWEPVYQVTFP--------------DGSTDVLSQEDIWHVRTL-TLDGLVGLNPIAYAREAISLAAATEEHG 199 (414) T ss_pred EE-EEECCCCcEEEEEEec--------------CceEEEEccccEEEecCC-CCCCcccccHHHHHHHHHHHHHHHHHHH Confidence 74 4566667665443322 233456788888888765 4456899999999987776767777777 Q ss_pred HHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-ce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-SA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMA 312 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Is 312 (484) ..|... .+.|-.+.+.+...++++.+++.+.+.+...|. ++ .++++.|++++-++.+.....|.+..++...+|+ T Consensus 200 ~~~f~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 277 (414) T protein:vir:44 200 ARLFSN--GAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEIC 277 (414) T ss_pred HHHHhc--cCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHH Confidence 777775 367877777777888898999998887776542 22 5788999998888766555678888999999999 Q ss_pred HHHhhhhh-cccccccchhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHH Q lcl|NC_021302. 313 LVALAHFL-NLDGKGGSYALASVQADT-FVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDA 387 (484) Q Consensus 313 k~ilGqtl-t~~~~gGs~A~~evh~~v-~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~ 387 (484) +++.-..- ....++++++-.+-+... ...-+.-.++.|+..||+.|++.-- .... .++|+. . ..|.++ T Consensus 278 ~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P~~~~ie~~ln~~L~~~~~-----~~~~--~i~fd~~~ll~~d~~~ 350 (414) T protein:vir:44 278 RLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRKSK-----QGVF--YAKFNAGALLRGDMKS 350 (414) T ss_pred HHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc-----cCce--EEEEechhhhccCHHH Confidence 99766542 223334566655544443 4557778888899999887765311 1111 455643 2 247788 Q ss_pred HHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccc Q lcl|NC_021302. 388 TAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARK 462 (484) Q Consensus 388 ~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (484) ++++++++.+.|+.. .+++|+.+|+|.-+.++....+. +... .+.... ...+.+......+..+ T Consensus 351 ~~~~~~~~~~~G~~t-----~NE~R~~~gl~p~~ggD~~~~~~-n~~~-~~~~~~----~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 351 RFEAYATGINWGIYS-----PNDCRDLEDMNPRPGGDVYLTPM-NMTT-KPSDGS----KAGKQKDNANADETTS 414 (414) T ss_pred HHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcceecccc-cccc-cCCccc----cCCCCCCCCCCCCCCC Confidence 999999999999864 47899999998765555433221 1100 000000 0000001111111111 No 28 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.81 E-value=1.8e-18 Score=117.85 Aligned_cols=407 Identities=12% Similarity=0.030 Sum_probs=238.6 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) ..+++..+..+. +. .+.....++.... ..+.+..+...+.+.|.+|+..+-..|.++++.|.-.++ T Consensus 13 ~~~r~~~~~~~~---~~---~~~~~~~~~g~~~--------~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 78 (432) T protein:vir:10 13 FEKRQTSQVIEL---NK---DDEKLLEWLGISP--------STISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDE 78 (432) T ss_pred ccccCccccccc---CC---chHHHHHHhCCCc--------CccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 112221111110 10 0111111111010 111222222235789999999999999999999843322 Q ss_pred CH--HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 RP--EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~~--e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +. ++.+ .+...|.. +-....++.++++.++ +.+.+|-+.+++++... | .+..|.+++|.++ T Consensus 79 ~~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~i~~~~v 143 (432) T protein:vir:10 79 YGIQRGTKHYLNNLLRL------------RPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK-G--KVQALWPIDASKV 143 (432) T ss_pred CceeeccccHHHHHHHh------------hccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEEcCcee Confidence 21 1111 11122210 1112345777888876 56889999999987543 3 3678999999887 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . ...++++.+.. .....+....++....+|+...+++++....+..+|.|.+..+....-.-....++-. T Consensus 144 ~-v~~d~~~~~~~---------~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 213 (432) T protein:vir:10 144 T-VYIDDVGLLNS---------KTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFIN 213 (432) T ss_pred E-EEEcCcccccc---------cceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 5 33444333211 0011222334445567889998888876666778899999999888877788888888 Q ss_pred HHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC---ceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHH Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE---SAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~---~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk 313 (484) .+... .+.|-.+.+.+...++++.+++.+.+.++..|. ...++++.|++++-++.+.....|.+..++..++|++ T Consensus 214 ~~~~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 291 (432) T protein:vir:10 214 NFYKQ--GLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIAT 291 (432) T ss_pred HHHhc--cCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 88875 366766667777788888899999988876552 2467899999998888766666788888999999999 Q ss_pred HHhhhhhc-ccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHHH Q lcl|NC_021302. 314 VALAHFLN-LDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDAT 388 (484) Q Consensus 314 ~ilGqtlt-~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ 388 (484) ++.-..-. ...+.|+++-.+-+. .....-+.-.++.|+..||+.|+..-- +. .+. +|+|+. . ..|.++. T Consensus 292 ~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~-~~---~g~--~~~fd~~~l~~~d~~~~ 365 (432) T protein:vir:10 292 AFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSE-LD---KGF--YSKFNVDAILRADIKTR 365 (432) T ss_pred HhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhh-cC---CCc--EEEeechhhhcCCHHHH Confidence 97665433 233345665444443 344566788889999999987775311 11 112 455543 2 2477889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcC-CCccccCCCCcccccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQ-DEPETDEPALPNTSGTTSTTNAPQAR 461 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (484) +++++++.+.|+.. .+++|+.+|+|.-+.++....+. +-.. ..............++ ....+.+-- T Consensus 366 ~~~~~~~~~~G~~t-----~NE~R~~~g~~pi~ggD~~~~~~-n~~~~~~~~~~~~k~~~~~~~-~~~~~~~~~ 432 (432) T protein:vir:10 366 YEAYRTGIQGGFLK-----PNEARSKEDLPPEAGGDRLLVNG-NMLPIDMAGQAYLKGGDTNGE-VSKEGNEGN 432 (432) T ss_pred HHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCeEeecc-cccchhhccccccCCCCCCCC-CCCCCCCCC Confidence 99999999999864 47899999998655554433221 1000 0000000000000000 000000000 No 29 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.81 E-value=1.8e-18 Score=117.85 Aligned_cols=407 Identities=12% Similarity=0.030 Sum_probs=238.6 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) ..+++..+..+. +. .+.....++.... ..+.+..+...+.+.|.+|+..+-..|.++++.|.-.++ T Consensus 13 ~~~r~~~~~~~~---~~---~~~~~~~~~g~~~--------~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 78 (432) T protein:vir:10 13 FEKRQTSQVIEL---NK---DDEKLLEWLGISP--------STISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDE 78 (432) T ss_pred ccccCccccccc---CC---chHHHHHHhCCCc--------CccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 112221111110 10 0111111111010 111222222235789999999999999999999843322 Q ss_pred CH--HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 RP--EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~~--e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +. ++.+ .+...|.. +-....++.++++.++ +.+.+|-+.+++++... | .+..|.+++|.++ T Consensus 79 ~~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~i~~~~v 143 (432) T protein:vir:10 79 YGIQRGTKHYLNNLLRL------------RPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK-G--KVQALWPIDASKV 143 (432) T ss_pred CceeeccccHHHHHHHh------------hccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEEcCcee Confidence 21 1111 11122210 1112345777888876 56889999999987543 3 3678999999887 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . ...++++.+.. .....+....++....+|+...+++++....+..+|.|.+..+....-.-....++-. T Consensus 144 ~-v~~d~~~~~~~---------~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 213 (432) T protein:vir:10 144 T-VYIDDVGLLNS---------KTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFIN 213 (432) T ss_pred E-EEEcCcccccc---------cceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 5 33444333211 0011222334445567889998888876666778899999999888877788888888 Q ss_pred HHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC---ceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHH Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE---SAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~---~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk 313 (484) .+... .+.|-.+.+.+...++++.+++.+.+.++..|. ...++++.|++++-++.+.....|.+..++..++|++ T Consensus 214 ~~~~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 291 (432) T protein:vir:10 214 NFYKQ--GLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIAT 291 (432) T ss_pred HHHhc--cCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 88875 366766667777788888899999988876552 2467899999998888766666788888999999999 Q ss_pred HHhhhhhc-ccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHHH Q lcl|NC_021302. 314 VALAHFLN-LDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDAT 388 (484) Q Consensus 314 ~ilGqtlt-~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ 388 (484) ++.-..-. ...+.|+++-.+-+. .....-+.-.++.|+..||+.|+..-- +. .+. +|+|+. . ..|.++. T Consensus 292 ~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~-~~---~g~--~~~fd~~~l~~~d~~~~ 365 (432) T protein:vir:10 292 AFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSE-LD---KGF--YSKFNVDAILRADIKTR 365 (432) T ss_pred HhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhh-cC---CCc--EEEeechhhhcCCHHHH Confidence 97665433 233345665444443 344566788889999999987775311 11 112 455543 2 2477889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcC-CCccccCCCCcccccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQ-DEPETDEPALPNTSGTTSTTNAPQAR 461 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (484) +++++++.+.|+.. .+++|+.+|+|.-+.++....+. +-.. ..............++ ....+.+-- T Consensus 366 ~~~~~~~~~~G~~t-----~NE~R~~~g~~pi~ggD~~~~~~-n~~~~~~~~~~~~k~~~~~~~-~~~~~~~~~ 432 (432) T protein:vir:10 366 YEAYRTGIQGGFLK-----PNEARSKEDLPPEAGGDRLLVNG-NMLPIDMAGQAYLKGGDTNGE-VSKEGNEGN 432 (432) T ss_pred HHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCeEeecc-cccchhhccccccCCCCCCCC-CCCCCCCCC Confidence 99999999999864 47899999998655554433221 1000 0000000000000000 000000000 No 30 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.81 E-value=1.8e-18 Score=117.85 Aligned_cols=407 Identities=12% Similarity=0.030 Sum_probs=238.6 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) ..+++..+..+. +. .+.....++.... ..+.+..+...+.+.|.+|+..+-..|.++++.|.-.++ T Consensus 13 ~~~r~~~~~~~~---~~---~~~~~~~~~g~~~--------~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 78 (432) T protein:vir:10 13 FEKRQTSQVIEL---NK---DDEKLLEWLGISP--------STISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDE 78 (432) T ss_pred ccccCccccccc---CC---chHHHHHHhCCCc--------CccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 112221111110 10 0111111111010 111222222235789999999999999999999843322 Q ss_pred CH--HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 RP--EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~~--e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +. ++.+ .+...|.. +-....++.++++.++ +.+.+|-+.+++++... | .+..|.+++|.++ T Consensus 79 ~~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~i~~~~v 143 (432) T protein:vir:10 79 YGIQRGTKHYLNNLLRL------------RPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRK-G--KVQALWPIDASKV 143 (432) T ss_pred CceeeccccHHHHHHHh------------hccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEEcCcee Confidence 21 1111 11122210 1112345777888876 56889999999987543 3 3678999999887 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . ...++++.+.. .....+....++....+|+...+++++....+..+|.|.+..+....-.-....++-. T Consensus 144 ~-v~~d~~~~~~~---------~~~~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~ 213 (432) T protein:vir:10 144 T-VYIDDVGLLNS---------KTKMWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFIN 213 (432) T ss_pred E-EEEcCcccccc---------cceEEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 5 33444333211 0011222334445567889998888876666778899999999888877788888888 Q ss_pred HHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC---ceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHH Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE---SAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~---~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk 313 (484) .+... .+.|-.+.+.+...++++.+++.+.+.++..|. ...++++.|++++-++.+.....|.+..++..++|++ T Consensus 214 ~~~~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 291 (432) T protein:vir:10 214 NFYKQ--GLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIAT 291 (432) T ss_pred HHHhc--cCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHH Confidence 88875 366766667777788888899999988876552 2467899999998888766666788888999999999 Q ss_pred HHhhhhhc-ccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHHH Q lcl|NC_021302. 314 VALAHFLN-LDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDAT 388 (484) Q Consensus 314 ~ilGqtlt-~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ 388 (484) ++.-..-. ...+.|+++-.+-+. .....-+.-.++.|+..||+.|+..-- +. .+. +|+|+. . ..|.++. T Consensus 292 ~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~-~~---~g~--~~~fd~~~l~~~d~~~~ 365 (432) T protein:vir:10 292 AFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQATLTMYEQEMTYKLFLDSE-LD---KGF--YSKFNVDAILRADIKTR 365 (432) T ss_pred HhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhh-cC---CCc--EEEeechhhhcCCHHHH Confidence 97665433 233345665444443 344566788889999999987775311 11 112 455543 2 2477889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcC-CCccccCCCCcccccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQ-DEPETDEPALPNTSGTTSTTNAPQAR 461 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (484) +++++++.+.|+.. .+++|+.+|+|.-+.++....+. +-.. ..............++ ....+.+-- T Consensus 366 ~~~~~~~~~~G~~t-----~NE~R~~~g~~pi~ggD~~~~~~-n~~~~~~~~~~~~k~~~~~~~-~~~~~~~~~ 432 (432) T protein:vir:10 366 YEAYRTGIQGGFLK-----PNEARSKEDLPPEAGGDRLLVNG-NMLPIDMAGQAYLKGGDTNGE-VSKEGNEGN 432 (432) T ss_pred HHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCeEeecc-cccchhhccccccCCCCCCCC-CCCCCCCCC Confidence 99999999999864 47899999998655554433221 1000 0000000000000000 000000000 No 31 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.81 E-value=1.6e-18 Score=118.16 Aligned_cols=391 Identities=15% Similarity=0.066 Sum_probs=236.6 Q ss_pred CC--------CCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCC Q lcl|NC_021302. 1 MA--------PKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTD 72 (484) Q Consensus 1 ~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~ 72 (484) |. ++...+. .....+. +..++ +.. ...-+..+ +-+.|.+|+..+-..|.+++ T Consensus 1 MG~~~~~~~~~~~~~~~--~~~~~~~------~~~~~----------g~~-~~~~~~al-~~~~V~~~v~~Ia~~iA~lp 60 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNET--VDMTNPL------LLQWL----------GVD-PDTPRNQL-SEATYFACLKILSESLGKLP 60 (411) T ss_pred CchHHHHHhhccCcccc--cccchHH------HHHHh----------cCc-ccChhhhh-ccHHHHHHHHHHHHhHhhCc Confidence 00 0000000 0000000 00000 000 00112344 46889999999999999999 Q ss_pred cEEecCCCCH--HHH-HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeee Q lcl|NC_021302. 73 WRIRPNGARP--EVV-EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRL 148 (484) Q Consensus 73 ~~v~p~~~~~--e~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l 148 (484) |++...+++. ++. ..+...|.. +-.....+.++++.++ +.+.+|-+.+++++. +|. +..| T Consensus 61 ~~~~~~~~~~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~--~g~--~~~l 124 (411) T protein:vir:81 61 LKMYQKTERGIVKSDREELYNLLKL------------RPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS--GPQ--LQAL 124 (411) T ss_pred eeEEEecCCceeeecccHHHHHHhh------------ccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec--CCc--eEEE Confidence 9995433221 111 111122210 1112335778888886 578899999998764 443 5778 Q ss_pred eeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHH Q lcl|NC_021302. 149 APRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLK 228 (484) Q Consensus 149 ~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K 228 (484) .+.+|.++. +..++++.+.... .....+.....+....++....|++++....+..+|.|.+..+....-.- T Consensus 125 ~~l~~~~v~-~~~~~~~~~~~~~-------~~~~~~~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~ 196 (411) T protein:vir:81 125 WILPSQYVT-IVVDDRGLLGEKN-------AIWYRYNDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGA 196 (411) T ss_pred EEECCceEE-EEEcCcccccccc-------eEEEEEEecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHH Confidence 899998875 3444444321000 00011112224455568889988888776677789999999999888888 Q ss_pred HHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCc---eEEEccCCceEEEecccCCchhHHHHHH Q lcl|NC_021302. 229 DELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGES---AGLALTAGEEAGILSPNGTPLDPRRAIE 305 (484) Q Consensus 229 ~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~---a~~vip~~~~ie~~~~~~~~~~~~~li~ 305 (484) ....++...+... .+.|-.+.+.+...++++++++.+.+.++.+|.. ..++++.|++++-++.+.....|.+..+ T Consensus 197 ~~~~~~~~~~f~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~ 274 (411) T protein:vir:81 197 LESQKFMNNLYKT--GLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKK 274 (411) T ss_pred HHHHHHHHHHHhc--cCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHH Confidence 8888888888875 3678776677777888999999999998876632 3578899999888876655566888889 Q ss_pred HHHHHHHHHHhhhh-hcccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-- Q lcl|NC_021302. 306 YHDHQMALVALAHF-LNLDGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-- 381 (484) Q Consensus 306 ~~d~~Isk~ilGqt-lt~~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-- 381 (484) +..++|++++.-.. +....++++++..+-+.. ....-+.-.++.|++.||+.|+..-. +. .+ .+|+|+.. T Consensus 275 ~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~-~~---~~--~~~~fd~~~l 348 (411) T protein:vir:81 275 YTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLYVLKQYEEEITYKILSNDL-IS---QG--HYFKFNVNVI 348 (411) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhh-cC---CC--cEEEeechhh Confidence 99999999976664 333444567776655443 34455677888888888887765411 11 11 25666532 Q ss_pred -CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccc Q lcl|NC_021302. 382 -GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTT 452 (484) Q Consensus 382 -~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (484) ..|.++++++++++.+.|+.. .+++|+.+|+|.-+.++....+. +- . |-..........|+. T Consensus 349 l~~d~~~~~~~~~~~~~~g~~t-----~NE~R~~~gl~p~~ggD~~~~~~-n~-~--pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 349 LRADIKTQMDSLSTAVQNGIMT-----PNEARDYLDMPADDYGNNLMANG-NY-I--PLSMLGANYGKGGDS 411 (411) T ss_pred hccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCeeeecc-Cc-c--chhhhhhhhccCCCC Confidence 357788999999999999764 47899999998654443322111 00 0 001000100111111 No 32 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=99.81 E-value=1.9e-18 Score=117.73 Aligned_cols=408 Identities=12% Similarity=0.033 Sum_probs=225.8 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) |.= +........+.|...+...-..+..++. -+. ...+ +=+.|.+|+..+...|.++++++.-.+. T Consensus 1 m~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~--~~Al-~~~~V~~cv~~ia~~iA~lp~~~~~~~~ 66 (417) T protein:vir:38 1 MKL----------FRGLATEVDPHWADHLLDSGVIPSFRGG-YLG--ISAL-RNSDVLTAVSIVSGDVSRFPLVITDSST 66 (417) T ss_pred Ccc----------ccccccCCCccchhhhcccccccccCCc-eec--hhhc-ccHHHHHHHHHHHHhhccCeeEEEEcCC Confidence 100 0000000111111110000000111111 011 1233 4688999999999999999999965444 Q ss_pred CHHHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceee Q lcl|NC_021302. 81 RPEVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAY 158 (484) Q Consensus 81 ~~e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~ 158 (484) +....+ -+...|. .+-.....+.++++.++ +.+.+|.+..+++....+| .+..|.+.+|.++.. T Consensus 67 ~~~~~~~~~~~lL~------------~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~--~~~~l~~l~p~~v~v 132 (417) T protein:vir:38 67 DEVIDLANIEYLMN------------TKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITN--EPAMFEFYAPSQTQV 132 (417) T ss_pred cceeccchHHHHHh------------cccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCC--EEEEEEEeCCceEEE Confidence 322111 1111111 11122335667777775 5788999999998654443 357788898888753 Q ss_pred eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAA 238 (484) Q Consensus 159 ~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f 238 (484) ..++++++.+.-. ...+.....+|...+|++++.+ .+..+|.|.+..+....-.-....++...| T Consensus 133 -~~~~~~~~~y~~~-------------~~~~~~~~~~~~~dviH~r~~~-~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 197 (417) T protein:vir:38 133 -DTSDPDNIIYRFT-------------PYNSSMQKVCGFEDVIHWKFFS-YDTIMGRSPLLSLGDEIGLQESGVSTLQKF 197 (417) T ss_pred -EEcCCCeEEEEEE-------------EcCCcEEEEecCcceEEecCCC-CCCccccCHHHHHHHHHHHHHHHHHHHHHH Confidence 4455565543221 1122233456777888888764 344789999999988777778888888888 Q ss_pred HHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 239 IRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVAL 316 (484) Q Consensus 239 ~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~il 316 (484) ... .+.|-.+.+++...++++.+++.+.+.+..+|.++ .++++.|++++-++.+.....|.+..++..++|++++. T Consensus 198 f~n--g~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fg 275 (417) T protein:vir:38 198 FKS--GLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALR 275 (417) T ss_pred Hhc--cCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhC Confidence 874 36786777777888889999999999888766443 56789999988887665555688888889999999865 Q ss_pred hhhhcccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHH Q lcl|NC_021302. 317 AHFLNLDGKGGSYALA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQML 395 (484) Q Consensus 317 Gqtlt~~~~gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L 395 (484) -..-..+ ..++++.. +........-+.-.++.|++.||+.|+.+.... . -.|+|+... ........++++ T Consensus 276 VPp~~lg-~~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~~-----~--~~~~fd~~~-l~~~~~~~~~~~ 346 (417) T protein:vir:38 276 VPAYRLA-QNSPNQSVKQLADDYIRNDLPFYFEPITSEFELKLLDDAQRH-----Q--YCIGFDTKS-VNGLPIADVNTA 346 (417) T ss_pred CCHHHhC-CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhcChhhcc-----c--ceEEechhh-hhHHHHHHHHHH Confidence 4432222 22333322 223344455778888889999998777643221 1 257776432 233334557788 Q ss_pred HhcCcccCCcccHHHHHHHhCCCCCCCCcc-cccccCC---CcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 396 VNAGLLTPDPRLEAFLRDAAGLPGPDPDAD-DDESTAD---TGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 396 ~~~G~~~~~~~~~~~i~e~~glp~p~~~e~-~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) ++.|+.. .+++|+.+|+|.-+++.. ....+.+ ........ .+...+.+|............. .+..+ T Consensus 347 ~~~G~~T-----~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~-~~~~~~~kgg~~~~~~~~~~~~-~~~~~ 417 (417) T protein:vir:38 347 VNGGLWT-----GNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQ-AEHAAELKGGDTNAKGNQNGSG-TNANS 417 (417) T ss_pred HhCCCcC-----HHHHHHHhCCCCCCCCCCCeeeecccccccccccccc-cccccccCCCCCCCCCCCcCCC-CcCCC Confidence 8999754 478999999985444322 1110000 00000000 0000000110000000000000 00000 No 33 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.81 E-value=9.5e-19 Score=119.40 Aligned_cols=400 Identities=10% Similarity=0.021 Sum_probs=230.1 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) +++..-..+.++ ... +....+..++..... . .+..+=.+...+-+.|.+|+..+...|.+++|+|.-.+. T Consensus 3 ~~~~~~~~~~~~---s~~-~~w~~~~~~~~~~~~----~--~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~ 72 (421) T protein:vir:10 3 IPQMFEGKKRSV---SGG-GFWEAMLGGVRSSHS----K--AGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDK 72 (421) T ss_pred Ccchhccccccc---Ccc-hhhHHHhhhhccCcc----c--CCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcC Confidence 222222222111 000 110111111111100 0 111122223345789999999999999999999843222 Q ss_pred CHH---HHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 81 RPE---VVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 81 ~~e---~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) +.. +.+ -+...|. .+.....+..++++.++ +.+.+|-+.+++++... ..+..|.+++|.+ T Consensus 73 ~g~~~~~~~~~l~~lL~------------~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~---G~~~~L~~l~~~~ 137 (421) T protein:vir:10 73 NGGRQRATDHPIYDLIH------------SQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGK---GYPKELIPINPKK 137 (421) T ss_pred CCceeecccchHHHHHh------------hcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC---CcEEEEEEecCce Confidence 211 111 0111111 01112334677887765 78889999999886433 3477899999988 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) +.. ..+.+|.+.+.. ...+..+|.+..++.++.. .+..+|.|.+..+....-.-....++. T Consensus 138 v~v-~~~~~g~~~y~~-----------------~~~g~~~~~~eiih~~~~~-~d~~~G~spi~~~~~~i~~~~~~~~~~ 198 (421) T protein:vir:10 138 VIV-LKGPDGMPYYEI-----------------PEIGETLPMRMMHHVKVFS-LDGYIGSSPIQTNADVLGLNLAVEEHA 198 (421) T ss_pred EEE-EECCCceEEEEE-----------------cCCCcEEchhhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHH Confidence 753 345555543221 1123356777777776654 455899999999988777777777888 Q ss_pred HHHHHHhcCCcceEEecCCCC----CCHHHHHHHHHHHHHHhcCC---ceEEEccCCceEEEecccCCchhHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADS----EDDDRMDELLEIASNYSGGE---SAGLALTAGEEAGILSPNGTPLDPRRAIEYHD 308 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~----~~~~~~~~l~~~l~~~~~g~---~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d 308 (484) ..+...- +.|-.+.+++.. .++++++++.+.+.+..+|. ...++++.|++++-++.+.....|.+..++.. T Consensus 199 ~~~f~ng--~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 276 (421) T protein:vir:10 199 SAVFRRG--ATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGV 276 (421) T ss_pred HHHHhcC--CCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhH Confidence 8888752 556444444332 26788888888888876552 24688999999988887766667888899999 Q ss_pred HHHHHHHhhhh-hcccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CC Q lcl|NC_021302. 309 HQMALVALAHF-LNLDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GS 383 (484) Q Consensus 309 ~~Isk~ilGqt-lt~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~ 383 (484) ++|++++.-.. +....++++++-.+-+. .....-+.-.++.|+..||+.|+.+-- . .. -.|+|+.. .. T Consensus 277 ~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~----~-~~--~~v~fd~~~l~~~ 349 (421) T protein:vir:10 277 EEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYTLLAWLKRHEGALQRDLLLPSE----R-RD--LYIEFNVSGLLRG 349 (421) T ss_pred HHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccc----c-CC--eEEEEechhhhcc Confidence 99999976654 33333445555444333 444556778888888888886665311 1 11 24566532 25 Q ss_pred cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc--CCCcCCCccccCCCCcccccccccccccccc Q lcl|NC_021302. 384 RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST--ADTGQDEPETDEPALPNTSGTTSTTNAPQAR 461 (484) Q Consensus 384 ~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (484) |.+.++++++++++.|+.. .+++|+.+|+|..+.++....+. ...+...+....+... . ........ T Consensus 350 d~~~~~~~~~~~~~~G~~T-----~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~--~----~~e~d~~~ 418 (421) T protein:vir:10 350 DQKSRYESYALGRQWGWLS-----VNDIRRMENLPPIAGGDKYLTPLNMVDSAQIIPGDKKPTAQ--Q----MAEIDTIL 418 (421) T ss_pred CHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeeccccccccccccCCCCcccc--c----Cccccccc Confidence 7888999999999999754 48899999998766665543211 0011111110000000 0 00000111 Q ss_pred ccc Q lcl|NC_021302. 462 KRP 464 (484) Q Consensus 462 ~~~ 464 (484) .+. T Consensus 419 ~~~ 421 (421) T protein:vir:10 419 SRT 421 (421) T ss_pred ccC Confidence 110 No 34 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=99.81 E-value=3.1e-18 Score=116.57 Aligned_cols=396 Identities=13% Similarity=0.061 Sum_probs=232.6 Q ss_pred CCCCCCCccceeeeeccccc-ch-hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAG-FG-TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPN 78 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~ 78 (484) +.|+.++........+.... .+ ..+...+... ...+ ..+. .+..+ +-+.|.+|+..+-..|.++++.|... T Consensus 21 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~----~~~g-~~v~-~~~al-~~~~V~~ci~~Ia~~iA~lp~~v~~~ 93 (431) T protein:vir:10 21 VEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRG----ELNG-GTGR-ETRAL-RNMAVLRCVTLISGTIGMLPMNLISS 93 (431) T ss_pred cccccccccccccccccccccccchHHHHhhccC----ccCc-ceec-hhhhh-ccHHHHHHHHHHHHhhccCceEEEEe Confidence 22222222111111111000 00 0011111110 0111 1111 22344 46889999999999999999998443 Q ss_pred CCCHHH-H-HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 79 GARPEV-V-EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 79 ~~~~e~-~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) .+..+. . ..+...|.. +-.......++++.+ .+.+.+|-+++++++. +|. +..|.+.+|.+ T Consensus 94 ~~~~~~~~~~~~~~lL~~------------~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~--~g~--~~~L~pl~~~~ 157 (431) T protein:vir:10 94 DDSKQVLTDDPAHRLLKY------------KPNDWQTPMEFKSLMQLRALLDGESMARIVWS--GNR--PIRLIPMDRGS 157 (431) T ss_pred cCceeeeccchHHHHHhh------------ccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc--CCc--eEEEEEEcCce Confidence 222111 1 111111110 111223456677665 4678899999999874 343 56788999888 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) +. ...+.++.+.+... ...+....++.+..+++++.. .+.++|.|.+..+....-.-....++. T Consensus 158 v~-~~~~~~~~~~y~~~--------------~~~g~~~~~~~~dViHir~~~-~dg~~G~spi~~~~~~i~~~~~~~~~~ 221 (431) T protein:vir:10 158 AK-GRLTSTWQIVYDYT--------------TPTGDKIELPAREVFHLRDLS-IDGVSGVSRVKLSGNALELAEQAERAA 221 (431) T ss_pred eE-EEEcCCCeEEEEEE--------------eCCceEEEEchhhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHH Confidence 75 35556666543322 123344568888888877654 455899999999988887777788888 Q ss_pred HHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCCchhHHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMA 312 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Is 312 (484) ..|... .+.|-.+.+++...++++++++.+.+.+..+|. + ..++++.|++++-++.+.....|.+..++..++|+ T Consensus 222 ~~~f~n--g~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia 299 (431) T protein:vir:10 222 SRTFRT--GVMAGGAIEVPKELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVA 299 (431) T ss_pred HHHHhc--cCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHH Confidence 888874 367877778878889999999999998876552 2 34789999998888766555668888888899999 Q ss_pred HHHhhhhh-cccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC---CCCcHHH Q lcl|NC_021302. 313 LVALAHFL-NLDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE---IGSRQDA 387 (484) Q Consensus 313 k~ilGqtl-t~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~---~~~~~~~ 387 (484) +++.-..- ....++++++-.+-.. .....-+.-.++.|+..||+.|++.-.. ... .|+|+. ...|.++ T Consensus 300 ~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~-----~~~--~~~fd~~~llr~d~~~ 372 (431) T protein:vir:10 300 RMYGVPRPLLMMDDTSWGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKML-----GQR--QFKFNEGALLRGTLND 372 (431) T ss_pred HHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhc-----CCc--eEEEechhhhccCHHH Confidence 99765442 2223344555444333 3334457778888999999877654111 112 455543 2357899 Q ss_pred HHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCC--CcccccccCCCcCCCccccCCCCcccc Q lcl|NC_021302. 388 TAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDP--DADDDESTADTGQDEPETDEPALPNTS 449 (484) Q Consensus 388 ~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (484) .++.+++++..|+..+ -...+++|+.+|+|.-++ ++....+. ..... ...+. +|... T Consensus 373 r~~~~~~~~~~G~~~g-~lT~NE~R~~~gl~p~~~~~gD~~~~p~-n~~~~-~~~~~--~p~~~ 431 (431) T protein:vir:10 373 QAAFFSKALGAGGQSP-WMKQNEVREMLDLPRADDPVADQLRNPM-TQKQK-GSGDE--PPATT 431 (431) T ss_pred HHHHHHHHHhcccccC-ccCHHHHHHHhCCCCCCCccccceeccc-ccccC-CCCCC--CCCCC Confidence 9999999999987422 135689999999986543 32222111 11000 00000 00000 No 35 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=99.80 E-value=2.3e-18 Score=117.31 Aligned_cols=401 Identities=13% Similarity=0.062 Sum_probs=229.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) .+|..++.....-...+.. +.....+... ...+..+-.+..++-+.|.+|+..+-..|.+++|.|--.+. T Consensus 17 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--------~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~ 86 (432) T protein:vir:97 17 FVPPDPVDIGGGQTFTPVN--ATARDLGIII--------SDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLMMYMRTP 86 (432) T ss_pred cCCccccccccccccccCc--hhhhhhcccc--------cccCcccchHhhhcchHHHHHHHHHHHhhccCceEEEEecC Confidence 2222211100000000000 0000000000 01122233333345789999999999999999999843222 Q ss_pred CH--HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 RP--EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~~--e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +. ++.+ -+...|. .+-....+..++++.++ +.+.+|.+.+++++. +| .+..|.+++|.++ T Consensus 87 ~g~~~~~~~pl~~lL~------------~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~--~g--~~~~L~~l~p~~v 150 (432) T protein:vir:97 87 DGRKEAVNHPLYTLLL------------DGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT--DG--RIESLQYLANDRL 150 (432) T ss_pred CCcccccccHHHHHHH------------hcccccCCHHHHHHHHHHHHhhcCCeEEEEEec--CC--cEEEEEEEcCcce Confidence 21 1111 0111111 01112235667777765 678899999999874 45 3678889999887 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . ...+.+|++.+..+ ..++....+|.+.+++.++.+.. ..+|.|.+..+....-.-....++-. T Consensus 151 ~-v~~~~~g~~~y~~~--------------~~~g~~~~~~~~~iih~r~~~~d-g~~G~spi~~~~~~i~~~~a~~~~~~ 214 (432) T protein:vir:97 151 T-ITTDTKGNTAYRYR--------------RTDGQMIDIPRQQIWKIMGYSLD-GENGLSAIRYGAQIFGTAIAAEAQAA 214 (432) T ss_pred E-EEEcCCCcEEEEEE--------------ecCceEEEEccccEEEecCcCCC-CcccccHHHHHHHHHHHHHHHHHHHH Confidence 5 45566776554332 12334456888888887765444 47999999999887767667777777 Q ss_pred HHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVAL 316 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~il 316 (484) .|... .+.|-.+.+.+...++++++++.+.+....+. ...++++.|++++-++.+.....|.+..++...+|++++. T Consensus 215 ~~f~n--g~~~~gil~~~~~l~~e~~~~~~~~~~~~~na-g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 291 (432) T protein:vir:97 215 RAFRN--GQLQSVYYQIDRFLTDDQYDSFSKKVSGSVEA-GRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFG 291 (432) T ss_pred HHHhc--cCCcceeEecCCCCCHHHHHHHHHHHhhhhcC-CCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhC Confidence 77774 36787777777788888888888887755332 2468899999998887766666788889999999999865 Q ss_pred hhh-hcccccccchhhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHH Q lcl|NC_021302. 317 AHF-LNLDGKGGSYALA----SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDAT 388 (484) Q Consensus 317 Gqt-lt~~~~gGs~A~~----evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~ 388 (484) -.. +....+.|+++.+ +........-+.-.++.|+..||+.|+.+-- ... -.|+|+.. ..|.+++ T Consensus 292 VPp~~lg~~~~~t~~~~s~~e~~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e------~~~-~~~~fd~~~llr~d~~~r 364 (432) T protein:vir:97 292 VPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLTPAE------RRR-YFADFDTSALLRADSAAR 364 (432) T ss_pred CCHHHcCCcCCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccc------cCc-eEEEeechhhhccCHHHH Confidence 543 3333333444322 2222233446667778888888876665311 111 25666532 3577889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCC---ccccCCCCcccccccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDE---PETDEPALPNTSGTTSTTNAPQARKR 463 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (484) ++++.++++.|+.. .+++|+.+|+|..+++....... ..-.+. .++..+.+. .+++..... .. +. T Consensus 365 ~~~~~~~~~~G~~T-----~NE~R~~~glpp~~g~~~~~~~~-~~~~pl~~~~~~~~~~~~--~~~~~~~~~-~~-~~ 432 (432) T protein:vir:97 365 SSYYSQLVNNGLMT-----RDEAREIEGLPKLGGNAAVLTVQ-SAMVPLDSIGLQASPEPA--SGLGNQQQD-KV-SK 432 (432) T ss_pred HHHHHHHHhCCCCC-----HHHHHHHhCCCCCCCCcceEeec-ccccchhhhcccCCCCCC--CCCCCcccc-cc-cC Confidence 99999999999764 47899999998765443322110 000000 000000000 000000000 00 00 No 36 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=99.80 E-value=2.3e-18 Score=117.31 Aligned_cols=440 Identities=11% Similarity=0.077 Sum_probs=216.2 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHH-------HHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY-------TRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y-------~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) -+-..+..+...+.+.-..++++... +++.+ .+ +..+..| .-|..+=.+|++|+......+.+++| T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~---~~--~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~ 124 (574) T protein:vir:80 52 NGKTTAYMQPIIGEMSVNPGYKTKPS--IRNSQ---DL--HKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGY 124 (574) T ss_pred hhhcccccchhhhhccccccccCcCc--cCCcc---cH--HHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCce Confidence 11112222222222222222222110 11111 11 1112222 11211223444444444445557899 Q ss_pred EEecCCCCH-----HHHH--HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeee Q lcl|NC_021302. 74 RIRPNGARP-----EVVE--HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWL 145 (484) Q Consensus 74 ~v~p~~~~~-----e~~~--~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~ 145 (484) .|..-..+. +.++ .+...|......- .-....|.++++.++ +.+.+|.+.+|+++... | .+ T Consensus 125 ~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~--------nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~-G--~~ 193 (574) T protein:vir:80 125 EIRLKDIEAEPTSHDIANIKRIESFLENTAQFR--------DPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKD-G--NF 193 (574) T ss_pred EEEEeccCCCccchhhhhhhHHHHHHhccCCCC--------CCccccHHHHHHHHHHHHHhcCCeEEEEEECCC-C--cE Confidence 996432221 1111 1111111100000 001125777888776 57789999999998644 3 36 Q ss_pred eeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccC---ccccchhHHHHH Q lcl|NC_021302. 146 KRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG---VWTGNSLLRPAY 222 (484) Q Consensus 146 ~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~---~p~G~gll~~~~ 222 (484) ..|.+++|.++.. ..+.++.+. ......+....+.....++...+|++++....+ ..||.|.+..+. T Consensus 194 ~~L~pl~p~~V~v-~~d~~~~~~---------~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~ 263 (574) T protein:vir:80 194 IKFDTVDPTTIFL-ATNGEGKLI---------KNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIAL 263 (574) T ss_pred EEEEEEcCceeEE-EEcCccccc---------cCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccccHHHHHH Confidence 7899999988753 333333211 000111222334445567888888888765543 568999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCcceEEe--cCCCCCCHHHHHHHHHHHHHHhcCC-ceE---EEccCCceEEEecccCC Q lcl|NC_021302. 223 KNWKLKDELIRIEAAAIRRHGIGVPYLKG--NEADSEDDDRMDELLEIASNYSGGE-SAG---LALTAGEEAGILSPNGT 296 (484) Q Consensus 223 ~~~~~K~~~~~~w~~f~Er~~~G~P~~~g--k~~~~~~~~~~~~l~~~l~~~~~g~-~a~---~vip~~~~ie~~~~~~~ 296 (484) ...-.-.....+-..|...- +.|-.+. +.+...++++++++.+.+.+...|. +++ ++.+.|++++-++.+.. T Consensus 264 ~~i~~~~~a~~~~~~~f~ng--~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~ 341 (574) T protein:vir:80 264 KQFIAHENTEVFNDRFFSHG--GTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSAN 341 (574) T ss_pred HHHHHHHHHHHHHHHHHhcc--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChh Confidence 88888888888888888853 5564333 4444568888999999988875552 232 44577888777766655 Q ss_pred chhHHHHHHHHHHHHHHHHhhhh-hcc---c----c---cccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 297 PLDPRRAIEYHDHQMALVALAHF-LNL---D----G---KGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVD 364 (484) Q Consensus 297 ~~~~~~li~~~d~~Isk~ilGqt-lt~---~----~---~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~ 364 (484) ...|.+..++..++|+.++.-.. +.. . + +..++|-.+.... .....+.-.++.|+..||+.|++. T Consensus 342 D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~--- 418 (574) T protein:vir:80 342 DMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAE--- 418 (574) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh--- Confidence 56788889999999999965432 111 0 0 1123454454443 444567888999999999988863 Q ss_pred hCCCCccccceEEecCCCC-cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc--------CCCcC Q lcl|NC_021302. 365 VNWGEDEPAPLLVFDEIGS-RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST--------ADTGQ 435 (484) Q Consensus 365 ~Nf~~~~~~P~~~~~~~~~-~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~--------~~~~~ 435 (484) |+. . + +|+|..... ......+ +.+++..|+.. .+++|+.+|+|.-+.++....+. ....+ T Consensus 419 --~~~-~-~-~~~f~~~d~~~~~~~~~-~~~~~~~G~lT-----~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~ 487 (574) T protein:vir:80 419 --FGE-K-Y-QFQFRGGDLSAQLDKLK-IIEQEGKVFRT-----VNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQ 487 (574) T ss_pred --cCC-c-e-EEEecccchhhHHHHHH-HHHHHhCCccC-----HHHHHHHhCCCCCCCCCEeeeccceeeccccccccc Confidence 221 1 2 566754322 1222222 23456677643 58999999998766554432210 00000 Q ss_pred CCc--cccCCCCccccccc--ccccccccccccc--ccchHHHhcCcccCc----------ccCC Q lcl|NC_021302. 436 DEP--ETDEPALPNTSGTT--STTNAPQARKRPR--GRSPRDRRKTPDGAM----------PLWD 484 (484) Q Consensus 436 ~~~--~~~~~~~~~~~~~~--~~~~~~~~~~~~~--~~~~~~~~~~~~~~~----------~~~~ 484 (484) .+. +......+..+..+ ......+...... +-...++.....|.. ...| T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 552 (574) T protein:vir:80 488 LEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVSFQDEQQGLNGKSKKVNGKVDDNVGKD 552 (574) T ss_pred CCccchhccccccccccCCCCCCCCCCCCCCccccccchhhhhhhhhccchhhhcCCcccccccc Confidence 000 00000000000000 0000000000000 000011111111111 1111 No 37 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.80 E-value=5.8e-18 Score=115.07 Aligned_cols=400 Identities=14% Similarity=0.075 Sum_probs=229.4 Q ss_pred CCC----------------CCCCccceeeeecccccchhhhhhhcccccccccccccchHHHH-HHHHhcchHHHHHHHH Q lcl|NC_021302. 1 MAP----------------KTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY-TRMCREEARIASVLRA 63 (484) Q Consensus 1 ~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~~D~~v~s~l~~ 63 (484) |+| ..++.........+.. ++....++.... .+..+- +..+ +-+.|.+|+.. T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~--------~g~~v~~~~al-~~~~V~~~i~~ 69 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVN--ATARDLGIIISD--------TGAAVNADAIM-RLDAVAACVKL 69 (432) T ss_pred CCchhhcchhhhhhhhcccccccccccccccccCc--cchhhhcccccc--------cCcccchHhhh-ccHHHHHHHHH Confidence 222 1111000000000000 000000111000 111111 2333 56899999999 Q ss_pred HHHHhhCCCcEEecCCCCH--HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeec Q lcl|NC_021302. 64 IGLPIRRTDWRIRPNGARP--EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYE 139 (484) Q Consensus 64 r~~~v~~~~~~v~p~~~~~--e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~ 139 (484) +-..|.++++.|--...+. ++.+ -+...|.. +-.......++++.++ +.+.+|-+..++++. T Consensus 70 Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-- 135 (432) T protein:vir:81 70 VSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLD------------GPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-- 135 (432) T ss_pred HHHhhhhCceeeEEecCCcceecccchHHHHHHh------------cccccCCHHHHHHHHHHHHhhcCCeEEEEEec-- Confidence 9999999999984322211 1111 11111110 0111234566777775 678899999998763 Q ss_pred CCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHH Q lcl|NC_021302. 140 GGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLR 219 (484) Q Consensus 140 ~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~ 219 (484) +|. +..|.+++|.++. ...+.+|.+.+..+ ...+....+|.+.++++++.+..+ .+|.|.+. T Consensus 136 ~g~--~~~L~~l~~~~v~-v~~~~~g~~~y~~~--------------~~~g~~~~~~~~~iih~r~~~~dg-~~G~spi~ 197 (432) T protein:vir:81 136 DGR--IESLQYLANDRLT-ITTDPKGNTAYRYR--------------RTDGQMIDIPKQQIWKIMGYSLDG-ENGLSAIR 197 (432) T ss_pred CCc--EEEEEEEcCCceE-EEECCCCcEEEEEE--------------ecCceEEEEccccEEEecCCCCCC-cccccHHH Confidence 453 6788899998875 45666776554322 123344578888888887765544 78999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchh Q lcl|NC_021302. 220 PAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLD 299 (484) Q Consensus 220 ~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~ 299 (484) .+....-.-....++-..|... .+.|-.+.+.+...++++++++.+.+....+. ...++++.|++++-++.+..... T Consensus 198 ~~~~~i~~~~~~~~~~~~~f~n--g~~~~gil~~~~~l~~e~~~~~~~~~~~~~na-g~~~vl~~g~~~~~l~~~~~d~q 274 (432) T protein:vir:81 198 YGAQIFGTAIAAEAQAARAFRN--GQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEA-GRAPLLEGGMDVKSLGLNPVDAQ 274 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHhc--CCCcceEEecCCCCCHHHHHHHHHHHhhhhcC-CCceecCCCceEEEccCCHHHHH Confidence 9888777777777777777764 36786777777888888889888888765432 24688999999988877666667 Q ss_pred HHHHHHHHHHHHHHHHhhhh-hcccccccchhhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc Q lcl|NC_021302. 300 PRRAIEYHDHQMALVALAHF-LNLDGKGGSYALAS----VQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAP 374 (484) Q Consensus 300 ~~~li~~~d~~Isk~ilGqt-lt~~~~gGs~A~~e----vh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P 374 (484) |.+..++..++|++++.-.. +....++|+++.+. ........-+.-.++.|+..||+.|+.+-- . .. - T Consensus 275 ~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~----~-~~--~ 347 (432) T protein:vir:81 275 LLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTLSPWLRRIEQSIALNLLSPAE----R-RR--Y 347 (432) T ss_pred HHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc----c-Cc--e Confidence 88888999999999976654 33333344443322 222333445667778888888887765411 1 11 2 Q ss_pred eEEecCC---CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCC---ccccCCCCccc Q lcl|NC_021302. 375 LLVFDEI---GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDE---PETDEPALPNT 448 (484) Q Consensus 375 ~~~~~~~---~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~---~~~~~~~~~~~ 448 (484) .|+|+.. ..|.+++++++.++.+.|+.. .+++|+.+|+|.-+++.+....... ..+. .+...+.+... T Consensus 348 ~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t-----~NE~R~~~glpp~~g~~~~~~~~~~-~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:81 348 FADFDTSALLRADSAARSSYYSQLVNNGLMT-----RDEAREIEGLPKLGGNAAVLTVQSA-MVPLDSIGLQASPEPASG 421 (432) T ss_pred EEEeechhhhccCHHHHHHHHHHHHhCCCCC-----HHHHHHHhCCCCCCCCcceEeecCc-ccchhhhccCCCCCCCCC Confidence 5666532 357889999999999999754 4889999999876554332211100 0000 00000000000 Q ss_pred cccccccccccccc Q lcl|NC_021302. 449 SGTTSTTNAPQARK 462 (484) Q Consensus 449 ~~~~~~~~~~~~~~ 462 (484) .+...... ..+ T Consensus 422 ~~n~~~~~---~~~ 432 (432) T protein:vir:81 422 LGNQQQDK---VSK 432 (432) T ss_pred CCCccccc---ccC Confidence 00000000 000 No 38 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.80 E-value=4.7e-18 Score=115.60 Aligned_cols=404 Identities=13% Similarity=0.055 Sum_probs=228.9 Q ss_pred CCCCCCCccce-eeeecccccchhhhhhhcccccccccccccchHH-HHHHHHhcchHHHHHHHHHHHHhhCCCcEEecC Q lcl|NC_021302. 1 MAPKTVAPRTE-RGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVY-TYTRMCREEARIASVLRAIGLPIRRTDWRIRPN 78 (484) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~-~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~ 78 (484) |+....-...+ ++...+..+. +.+...+.. .++ ..+. +-.+...+-+.|.+|+..+-..|.++++++... T Consensus 23 ~~~~~~f~~~e~r~~~~~~~~~-~~~~~~~~~------~~~-~~~~~~~~~~al~~~~V~acv~~Ia~~iA~lpl~~~~~ 94 (441) T protein:vir:98 23 LVVVGIFYKNEKRDLQYNEDDL-QMMVQTLPG------FQG-TKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVN 94 (441) T ss_pred hhccccccccccccccCCCcch-HHHHHHhhc------ccc-cCccccchhhhhccHHHHHHHHHHHHhhccCceEEecC Confidence 22222211111 1111111111 111111100 011 1111 112222357899999999999999999999754 Q ss_pred CCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcccee Q lcl|NC_021302. 79 GARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIA 157 (484) Q Consensus 79 ~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~ 157 (484) +.... ...+...|.. +-....+..++++.++ +.+.+|.+.+++++... | .+..|.+++|.++. T Consensus 95 ~~~~~-~~~~~~lL~~------------~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~-G--~~~~L~~i~~~~v~ 158 (441) T protein:vir:98 95 GQINY-SDRIVNLLNT------------RPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-G--EPMNLTFRKTSEIE 158 (441) T ss_pred Ccccc-cchHHHHHhc------------ccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-C--cEEEEEEEcCceeE Confidence 43211 1111122210 0111224556666664 57889999999987533 3 47789999998885 Q ss_pred eeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 158 YWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAA 237 (484) Q Consensus 158 ~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 237 (484) +..+.+|.+.+..+.... ........+++..+|++++.+. +..+|.|.+..+....-.-....++... T Consensus 159 -v~~~~~g~~~~~~~~~~~----------~~~~~~~~~~~~dviHir~~~~-dg~~G~spi~~~~~~i~~~~a~~~~~~~ 226 (441) T protein:vir:98 159 -LKLDARGRLYYFHQRIDS----------NGNNIERNVKFEDMLDIKFYSL-DGINGLSLLDTLSRTIESDNNGKDFLNN 226 (441) T ss_pred -EEECCCCcEEEEEEEecc----------CcceeeEEEccccEEEeccCCC-CCccccCHHHHHHHHHHHHHHHHHHHHH Confidence 456778877654432111 1122234578888888887654 4478999999998877777778888888 Q ss_pred HHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCCchhHHHHHHHHHHHHHH Q lcl|NC_021302. 238 AIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 238 f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk 313 (484) |.+. .+.|-.+.+++... ++++++++.+.+.+...|. + ..++++.|++++-++.+.....|.+..++..++|++ T Consensus 227 ~f~n--g~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~ 304 (441) T protein:vir:98 227 FLRN--GTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAG 304 (441) T ss_pred HHhc--cCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHH Confidence 8885 36676666766554 4666778888887776652 2 258899999998887665556688888999999999 Q ss_pred HHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHHHH Q lcl|NC_021302. 314 VALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDATAA 390 (484) Q Consensus 314 ~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~ae 390 (484) ++.-...-.+..+++++..+. +..+..-+.-.++.|+..||+.|++. ... -+|+|+.. ..|.+..++ T Consensus 305 ~fgVPp~~lg~~~~~~s~~q~-~~~y~~tl~P~~~~ie~~ln~~L~~~--------~~~-~~~~fd~~~llr~d~~~~~~ 374 (441) T protein:vir:98 305 VFGIPLHKFGIETANMSITDA-NLDYLSTLKPYITCVCAELNFKFNDE--------YVN-REFKFDTTEIRVVDEKTQAE 374 (441) T ss_pred HhCCCHHHcCCCCCCccHHHH-HHHHHHHHHHHHHHHHHHHHhhcccc--------ccC-ceEEEechhhhccCHHHHHH Confidence 976543322222222222221 12233456677788888888765432 111 25666542 347788999 Q ss_pred HHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccc-ccCCC----cCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 391 ALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDE-STADT----GQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 391 ~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~-~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) +++++.+.|+.. .+++|+.+|+|.-++++...- ...+. ..++.+...+.......++.... + T Consensus 375 ~~~~~~~~G~~T-----~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~~kgGe~n--e 441 (441) T protein:vir:98 375 IDKINIDSGKMN-----IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEEN--E 441 (441) T ss_pred HHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCC--C Confidence 999999999864 489999999986555543221 11000 01111111111000111111111 1 No 39 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.79 E-value=4.2e-18 Score=115.87 Aligned_cols=396 Identities=10% Similarity=0.006 Sum_probs=227.4 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) .+.......+..+.+....+.+ .... +..+ ..+..+ +=+.|.+|+..+-..|.+++|.|...+. T Consensus 8 ~~~~~~~~~~~~~~~~~~~g~~--------~s~~------~~~v-t~~~al-~~~~v~~~v~~ia~~iA~lp~~~~~~~~ 71 (419) T protein:vir:14 8 LSNLGQTQMSAGGWVSALLGSS--------RSDS------GQVV-TPASAL-ALTVLQNCVTLLAESIAQLPIELYERSG 71 (419) T ss_pred cccccccccCcchhhHHhhcCC--------CccC------Cccc-chHHhh-ccHHHHHHHHHHHHhhccCceEEEEecC Confidence 2222222222222222211111 1000 0001 112344 4678999999999999999999854332 Q ss_pred CH--HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 RP--EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~~--e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +. ++.+ -+...|. .+-....+..++++.++ +.+.+|-+++++++... | .+..|.+.+|.++ T Consensus 72 ~~~~~~~~~~l~~lL~------------~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~-G--~~~~l~pl~~~~v 136 (419) T protein:vir:14 72 EDRKPATDHPLYSILK------------YEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSD-G--VIQGLYPLDNEAV 136 (419) T ss_pred CccccccccHHHHHHH------------hhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEecCceE Confidence 21 1111 1111111 01112335677777754 67889999999976543 3 3678999999888 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . ...+.++.+++..+. ...+|.+.+++.++.+ .+..+|.|.+..+....-.-....++.. T Consensus 137 ~-v~~~~~~~~~y~~~~------------------~~~~~~~~i~h~~~~~-~dg~~G~s~i~~~~~~i~~~~~~~~~~~ 196 (419) T protein:vir:14 137 T-VMRGSDLKPVYRVRG------------------SDPMPQRLVHHVRWMS-INGYTGLSPVLLHANAIGHAQAIQQYAG 196 (419) T ss_pred E-EEECCCceEEEEEcc------------------CcccchhheeEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 6 445666666543221 1124445555555444 4558999999999988777777788888 Q ss_pred HHHHHhcCCcceEEecCCC----CCCHHHHHHHHHHHHHHhcCCc---eEEEccCCceEEEecccCCchhHHHHHHHHHH Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEAD----SEDDDRMDELLEIASNYSGGES---AGLALTAGEEAGILSPNGTPLDPRRAIEYHDH 309 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~----~~~~~~~~~l~~~l~~~~~g~~---a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~ 309 (484) .+...- +.|-.+.+++. ..++++++++.+.+++...|.. ..++++.|++++-++.+.....|.+..++..+ T Consensus 197 ~~f~ng--~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~ 274 (419) T protein:vir:14 197 KSFMNG--TALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSAL 274 (419) T ss_pred HHHhcc--CCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHH Confidence 888752 56744444432 2357778888888888766532 25788999988877766555568888899999 Q ss_pred HHHHHHhhhhh-cccccccchhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCc Q lcl|NC_021302. 310 QMALVALAHFL-NLDGKGGSYALASVQADT-FVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSR 384 (484) Q Consensus 310 ~Isk~ilGqtl-t~~~~gGs~A~~evh~~v-~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~ 384 (484) +|++++.-..- ....++++++-.|-+... ...-+.-.++.|+..||+.|+.+--. .. -+++|+.. ..| T Consensus 275 ~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~~~~L~P~~~~ie~~l~~kll~~~~~-----~~--~~i~fd~~~l~r~d 347 (419) T protein:vir:14 275 DIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSER-----KQ--YFIEYNLAGLLRGD 347 (419) T ss_pred HHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcccc-----CC--eEEEEechhhhccC Confidence 99999766542 223345666554544433 34566778888888888866543111 11 14566532 347 Q ss_pred HHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccccc Q lcl|NC_021302. 385 QDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRP 464 (484) Q Consensus 385 ~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (484) .+..+++++++++.|+.. .+++|+.+|+|.-+.++....+.--.....+.. .+....++......+..+.. T Consensus 348 ~~~~~~~~~~~~~~G~~T-----~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~----~~~~~~~~~~~~~~e~~~~l 418 (419) T protein:vir:14 348 QSSRYAAYAVGRQWGWLS-----INDIRRLENMPPVKGGDIYLSPMNMVDASKPQQ----LPVGKSEPTKAAIDEIGRIL 418 (419) T ss_pred HHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcCeeeecccccccccccc----ccCCCCCCccccccchhccc Confidence 888999999999999864 478999999987666554332210000000000 00001111111111221111 Q ss_pred c Q lcl|NC_021302. 465 R 465 (484) Q Consensus 465 ~ 465 (484) + T Consensus 419 ~ 419 (419) T protein:vir:14 419 S 419 (419) T ss_pred C Confidence 1 No 40 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.79 E-value=8.2e-18 Score=114.27 Aligned_cols=423 Identities=12% Similarity=0.073 Sum_probs=230.6 Q ss_pred CCCCCCCccceee-eecccccchhhhhhhcccccccccccccchHHHH-HHHHhcchHHHHHHHHHHHHhhCCCcEEecC Q lcl|NC_021302. 1 MAPKTVAPRTERG-YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY-TRMCREEARIASVLRAIGLPIRRTDWRIRPN 78 (484) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~ 78 (484) =.++.++.....+ ...+. ....... + .... .+..+- +..+ +=+.|.+|+..+-..|.+++++|... T Consensus 9 ~~~~~~~~~~~~~~~~~~~-~~~~~~~-~--------~~~~-~g~~v~~~~al-~~~~v~~~i~~ia~~iA~lp~~~~~~ 76 (457) T protein:vir:62 9 GRGHSPALDAAEGRAWEPY-DPSIYNL-G--------ATAS-SGERVTPHDAL-QVSAVFASVRLLSETIATLPLSTYSK 76 (457) T ss_pred ccccccccccccccccccc-hhhhhhc-c--------cccc-CCceechHHhh-ccHHHHHHHHHHHHhHhhCceEEEEe Confidence 0011111000000 00000 0000000 0 0000 011111 2344 46899999999999999999998543 Q ss_pred CCCH-HHHHH--HHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcc Q lcl|NC_021302. 79 GARP-EVVEH--VAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQS 154 (484) Q Consensus 79 ~~~~-e~~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 154 (484) .+.. +..+. ....+. .......+.++++.++ +.+.+|.+++++.+. +|. +..|.+.+|. T Consensus 77 ~~~~~~~~~~~~~~~ll~-------------~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~--~g~--~~~l~~l~p~ 139 (457) T protein:vir:62 77 RGGTRKEIDTPEWLDFPN-------------AEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA--GPN--IAGLDVLDPT 139 (457) T ss_pred cCCccccccchHHHHhcc-------------ccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC--CCc--EEEEEEEcCc Confidence 3221 11110 011110 0011235777888776 578899999998654 443 5678888888 Q ss_pred ceeeeeecCCCceeeeecccccccccccceeccCCC---CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHH Q lcl|NC_021302. 155 SIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNS---MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDEL 231 (484) Q Consensus 155 ~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~---~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~ 231 (484) ++.......++...... ..+.....+ ....++++.+|++++....+..+|.|.+..+....-.-... T Consensus 140 ~v~v~~~~~~~~~~~~~----------~~y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~ 209 (457) T protein:vir:62 140 KIHVHMVMVDGLRRKVF----------EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAA 209 (457) T ss_pred ceEEEEeccCCccceeE----------EEEEEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHH Confidence 77543322222110000 001111111 11245777888887776667789999999999888888888 Q ss_pred HHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-ce--EEEccCCceEEEecccCCchhHHHHHHHHH Q lcl|NC_021302. 232 IRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-SA--GLALTAGEEAGILSPNGTPLDPRRAIEYHD 308 (484) Q Consensus 232 ~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d 308 (484) .++...|... .++|-.+.+++...++++++++.+.+.++.+|. ++ .++++.|++++-++.+.....|.+..++.. T Consensus 210 ~~~~~~~f~n--g~~p~gil~~~~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 287 (457) T protein:vir:62 210 QKYGAHFFRN--GAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQV 287 (457) T ss_pred HHHHHHHHhc--cCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHH Confidence 8888888885 377877778888889999999999999887653 22 578999999988877655567888888999 Q ss_pred HHHHHHHhhhh-hcccccccchhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C- Q lcl|NC_021302. 309 HQMALVALAHF-LNLDGKGGSYAL---ASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I- 381 (484) Q Consensus 309 ~~Isk~ilGqt-lt~~~~gGs~A~---~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~- 381 (484) .+|++++.-.. +.....++++.. .+........-+.--++.|+..||+.|+... ..... .++|+. . T Consensus 288 ~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~l~P~~~~ie~~ln~~L~~~~-----~~~~~--~i~fd~~~l~ 360 (457) T protein:vir:62 288 PEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLLFAET-----ADRFR--FVKFNLDEIK 360 (457) T ss_pred HHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCcc-----ccCce--EEEeechhhh Confidence 99999976543 333333343322 2333334455667778888888888776542 11111 455543 2 Q ss_pred CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCC--cccccc-----cCCCcCCCccccCCCCccccccccc Q lcl|NC_021302. 382 GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPD--ADDDES-----TADTGQDEPETDEPALPNTSGTTST 454 (484) Q Consensus 382 ~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~--e~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (484) ..|.+..++++.++++.|+.. .+++|+.+|+|.-+++ +....+ ........+.+..++.......+.. T Consensus 361 ~~d~~~r~~~~~~~~~~G~~T-----~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (457) T protein:vir:62 361 RGAPKERMELWSLGLQNGIYS-----IDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPAD 435 (457) T ss_pred ccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCC Confidence 347788999999999999754 4899999999866554 222211 0000000111100000000000000 Q ss_pred cccccccccccccchHHHhcCcccC Q lcl|NC_021302. 455 TNAPQARKRPRGRSPRDRRKTPDGA 479 (484) Q Consensus 455 ~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (484) . .+. ....+....+...+.|.+ T Consensus 436 ~--~~~-~~~~~~~d~~~~~~~~~~ 457 (457) T protein:vir:62 436 D--EEP-DNAEGDPDEGETEDDDDA 457 (457) T ss_pred C--CCC-CCCCCCCccccccccccC Confidence 0 000 000111111111112222 No 41 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.79 E-value=6.9e-18 Score=114.68 Aligned_cols=391 Identities=11% Similarity=-0.016 Sum_probs=232.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHH-HHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYT-YTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~-y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) ..+++ .++..-.+ ....++..... +..+ .+.++ +-+.|.+|+..+-..|.+++|.|--.. T Consensus 7 ~~~~~----~~~~~~~~------~~~~~~g~~~~--------~~~v~~~~al-~~~~v~~~i~~ia~~ia~lp~~~~~~~ 67 (409) T protein:vir:10 7 FKNQS----QEISIDDK------KILEWLGINPS--------ETYVNGKSCL-KQATVFGCIRILSDNISKLPIKIYQKK 67 (409) T ss_pred ccCcC----CCCCCChH------HHHHHhcCCcC--------cceechhhhh-ccHHHHHHHHHHHHhhhhCceEEEEec Confidence 11111 11100000 00011100000 0111 12344 467899999999999999999984322 Q ss_pred CCH-HHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 80 ARP-EVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 80 ~~~-e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +.. ++.+ -+...|. .+-....++.++++.++ +.+.+|-+.+++++... |. +..|.+++|.++ T Consensus 68 ~~~~~~~~~~l~~lL~------------~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G~--~~~L~~i~~~~V 132 (409) T protein:vir:10 68 DGIKRVPDHYLEYLLK------------LRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKN-GE--IKGLYPLKSDGM 132 (409) T ss_pred CCeeeccCchHHHHHh------------hccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC-Cc--EEEEEEEcCCce Confidence 111 1111 0111111 01122345677887776 57889999999987644 32 678999999887 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . ...+++|.+.... ..........+....+|....|+.++.. .+.++|.|.+..+....-.-....++.. T Consensus 133 ~-v~~~~~~~~~~~~--------~~~y~~~~~~g~~~~~~~~evih~r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~~ 202 (409) T protein:vir:10 133 K-IFVDDTGLLNSEN--------NVWYLYTDDLGQRHKFMSDEILHFKGLT-ADGLAGLSVIELLNHLIENGKSSETYLN 202 (409) T ss_pred E-EEEcCCccccccc--------eEEEEEEeCCceeEEeccccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHHH Confidence 5 3445444332110 0011112233445678888888877654 4568999999999988878778888888 Q ss_pred HHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCc---eEEEccCCceEEEecccCCchhHHHHHHHHHHHHHH Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGES---AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~---a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk 313 (484) .+... .+.|-.+.+.+...++++.+++.+.+.++..|.. ..++++.|++++-++.+.....|.+..++..++|++ T Consensus 203 ~~f~n--g~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 280 (409) T protein:vir:10 203 NFFKN--GLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIAS 280 (409) T ss_pred HHHhc--cCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHH Confidence 88875 3667666677777888889999999988876532 368899999998887776666788899999999999 Q ss_pred HHhhhh-hcccccccchhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHH Q lcl|NC_021302. 314 VALAHF-LNLDGKGGSYALASVQ-ADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDAT 388 (484) Q Consensus 314 ~ilGqt-lt~~~~gGs~A~~evh-~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~ 388 (484) ++.-.. +....++++++..+.. ......-+.-.++.|+..||+.|+..-- ++ .. -+|+|+.. ..|.++. T Consensus 281 ~fgVPp~~lg~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~kL~~~~~---~~-~~--~~~~fd~~~ll~~d~~~~ 354 (409) T protein:vir:10 281 VFGVKMHQLNDLDRATHSNITEQNREFYIDTLQSILNMYELEINYKLFLISE---IK-NG--FYSKFNVDTILRADIKTR 354 (409) T ss_pred HhCCCHHHcCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCchh---cc-CC--cEEEEechhhhccCHHHH Confidence 976653 3333334556554433 3444555677888888888876543211 11 11 24666532 2477889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTST 454 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (484) ++++.++++.|+.. .+++|+.+|+|.-+.++....+. + -.+ ........ ..| +.. T Consensus 355 ~~~~~~~~~~G~~T-----~NE~R~~lgl~p~~ggD~~~~~~-n-~~~--~~~~~~~~-~kg-Ge~ 409 (409) T protein:vir:10 355 YESYKEAIQNGFKT-----PNEIRELEEDEPLEGGDVLLING-N-MIP--VKMAGEQY-SKG-GEK 409 (409) T ss_pred HHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcCeeeecc-C-ccc--hhhccccc-ccc-CCC Confidence 99999999999865 47899999998765554332211 0 001 11000000 011 111 No 42 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.79 E-value=6.1e-18 Score=114.97 Aligned_cols=422 Identities=12% Similarity=0.079 Sum_probs=227.9 Q ss_pred eeeecccccchhhhh------hhcccccc-cccc--cccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCH Q lcl|NC_021302. 12 RGYVNPLAGFGTFLA------QGLDQFEQ-VDEL--RWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARP 82 (484) Q Consensus 12 ~~~~~~~~~~~~~~~------~~~~~~~~-~~~l--r~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~ 82 (484) =|..+...+.+.... ..+.+... ...+ ....+..+..+...+=+.|.+|+..+-..|.+++++|....... T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 111111100000000 00000000 0000 00011222332223468899999999999999999985432221 Q ss_pred -HHHHHHHHHHHhhhccchhhhhHHHhhc----CCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 83 -EVVEHVAACLGLPVEGDESDKPTPRTRG----RFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 83 -e~~~~~~~~l~~~~~~~~~~~~~~~~~~----~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +.+. . +....++. ...+.++++.++ +.+.+|.++++|++. +| .+..|.+.+|..+ T Consensus 81 ~~~~~--~-------------~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~--~g--~~~~l~~l~p~~v 141 (457) T protein:vir:13 81 RKEIV--T-------------PEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ--GP--NIVGLDVLDPTKI 141 (457) T ss_pred ccccc--c-------------chHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec--CC--cEEEEEEEccCce Confidence 1110 0 01111111 234667777776 578899999998764 44 3567888888877 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCC---CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNS---MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIR 233 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~---~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~ 233 (484) .......++...... ..+.....+ ....+++..+|++++....+..+|.|.+..+....-.-....+ T Consensus 142 ~v~~~~~~~~~~~~~----------~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~ 211 (457) T protein:vir:13 142 HVHMVMVDGLRRKVF----------EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQK 211 (457) T ss_pred EEEEecCCCccceeE----------EEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHH Confidence 543333332211100 000011111 1224667788877777666668999999999988888888888 Q ss_pred HHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCc---eEEEccCCceEEEecccCCchhHHHHHHHHHHH Q lcl|NC_021302. 234 IEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGES---AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQ 310 (484) Q Consensus 234 ~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~---a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~ 310 (484) +...|... .++|-.+.+.+...++++++++.+.+.+...|.. ..++++.|++++-++.+.....|.+..++...+ T Consensus 212 ~~~~~f~n--g~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 289 (457) T protein:vir:13 212 YGSKFFAN--GAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPE 289 (457) T ss_pred HHHHHHhc--CCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 88888875 3778777778788899999999999998876632 357899999998887765555688888899999 Q ss_pred HHHHHhhhh-hcccccccchh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CC Q lcl|NC_021302. 311 MALVALAHF-LNLDGKGGSYA---LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GS 383 (484) Q Consensus 311 Isk~ilGqt-lt~~~~gGs~A---~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~ 383 (484) |++++.-.. |....+++++. ..+........-+.-.++.|+..||+.|+...- .... .++|+. . .. T Consensus 290 Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~-----~~~~--~i~fd~~~l~~~ 362 (457) T protein:vir:13 290 IARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSLRPWLERIEAGFNRLLFAETA-----DRFR--FVKFNLDEIKRG 362 (457) T ss_pred HHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc-----cCce--eEEeechhhhcc Confidence 999976543 33333333331 223333444556677888888888887766421 1112 355543 2 24 Q ss_pred cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCC--ccccccc---CCCcCCCcc--ccCCCCccccccccccc Q lcl|NC_021302. 384 RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPD--ADDDEST---ADTGQDEPE--TDEPALPNTSGTTSTTN 456 (484) Q Consensus 384 ~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~--e~~~~~~---~~~~~~~~~--~~~~~~~~~~~~~~~~~ 456 (484) |.++.++++.++.+.|+.. .+++|+.+|+|.-+++ +....+. ..+..+..+ ...++.......+.... T Consensus 363 D~~~r~~~~~~~~~~G~~T-----~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (457) T protein:vir:13 363 APKERMELWSLGLQNGIYS-----IDEVRAAEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEP 437 (457) T ss_pred CHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCcccceeeccccccccccccccccCCCCCCCCCccccCCCC Confidence 7789999999999999764 4789999999765443 2221110 000000000 00000000000000000 Q ss_pred cccccccccccchHHHhcCcccC Q lcl|NC_021302. 457 APQARKRPRGRSPRDRRKTPDGA 479 (484) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~ 479 (484) ..+-.....+. ..+...+-+ T Consensus 438 ~~~g~~d~~~~---~~~~~~~~~ 457 (457) T protein:vir:13 438 EPEGKPDDEGA---TEEDDEDDA 457 (457) T ss_pred CCCCCCccccC---CCCcccccC Confidence 00000000000 000001111 No 43 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.78 E-value=1.4e-17 Score=112.94 Aligned_cols=403 Identities=13% Similarity=0.071 Sum_probs=228.7 Q ss_pred CCCCCCCc---------cc-eeeeecccccchhhhhhhcccccccccccccchHHHH--HHHHhcchHHHHHHHHHHHHh Q lcl|NC_021302. 1 MAPKTVAP---------RT-ERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY--TRMCREEARIASVLRAIGLPI 68 (484) Q Consensus 1 ~~~~~~~~---------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y--~~m~~~D~~v~s~l~~r~~~v 68 (484) -+-|++.. .. .++...+..+. ..++..+.. .. +..+..| +..+ +-+.|.+|+..+-..| T Consensus 14 ~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~-~~~~~~~~~------~~-~~~~~~~~~~~al-~~~~V~~cv~~Ia~~i 84 (441) T protein:vir:79 14 KSRKQSRKELVVVGIFYKNEKRDLQYNEDDL-QMMVQTLPG------FQ-GTKLRQYKDIEAI-RHSDIFTAVMMIASDL 84 (441) T ss_pred cccccchhhhhccccccccccccccCCCcch-HHHHHHhcc------cC-cccccccchhhhh-ccHHHHHHHHHHHHhh Confidence 12222211 11 11111111111 011110000 00 0111112 1233 4788999999999999 Q ss_pred hCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 69 RRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 69 ~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) .++++++...+.... ...+...|.. +-....+..++++.+. +.+.+|.+.+++++... | .+.. T Consensus 85 A~lp~~~~~~~~~~~-~~~~~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~ 148 (441) T protein:vir:79 85 ARMPIRVTVNGQINY-SDRIVNLLNT------------RPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-G--EPMN 148 (441) T ss_pred ccCceeeecCccccc-cchHHHHHhc------------ccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEE Confidence 999999975432111 1111111110 0111223556676665 57889999999987533 3 4778 Q ss_pred eeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHH Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKL 227 (484) Q Consensus 148 l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~ 227 (484) |.+++|.++. ...+.+|.+.+..+...+. .......+++..+|++++.+. +..+|.|.+..+....-. T Consensus 149 L~~i~~~~v~-v~~d~~g~~~~~~~~~~~~----------~~~~~~~~~~~dvih~k~~~~-dg~~G~spl~~~~~~i~~ 216 (441) T protein:vir:79 149 LTFRKTSEIE-LKSDARGRLYYFHQRIDSN----------GNNIERNVKFEDMLDIKFYSL-DGINGLSLLDTLSRTIES 216 (441) T ss_pred EEEEcCceeE-EEECCCccEEEEEEEeccC----------CceeEEEEccccEEEeccCCC-CCccccCHHHHHHHHHHH Confidence 9999998885 4667777766544322111 112234578888888887544 447999999999887777 Q ss_pred HHHHHHHHHHHHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCC-ce--EEEccCCceEEEecccCCchhHHHH Q lcl|NC_021302. 228 KDELIRIEAAAIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGE-SA--GLALTAGEEAGILSPNGTPLDPRRA 303 (484) Q Consensus 228 K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~-~a--~~vip~~~~ie~~~~~~~~~~~~~l 303 (484) -....++...|... .+.|-.+.+++... ++++++++.+.+.+...|. ++ .++++.|++++-++.+.....|.+. T Consensus 217 ~~~~~~~~~~~f~n--g~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~ 294 (441) T protein:vir:79 217 DNNGKDFLNNFLRN--GTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRE 294 (441) T ss_pred HHHHHHHHHHHHhc--cCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHH Confidence 77778888888885 36676666665554 4666778888887776552 22 4789999998888766655668889 Q ss_pred HHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-- Q lcl|NC_021302. 304 IEYHDHQMALVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-- 381 (484) Q Consensus 304 i~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-- 381 (484) .++..++|++++.-...-.+..+++++..+. +..+..-+.-.++.|+..||+.|++.. .. -+|+|+.. T Consensus 295 ~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~-~~~~~~tl~P~~~~ie~eln~kl~~~~-------~~--~~~~fd~~~l 364 (441) T protein:vir:79 295 NKSSTREIAGVFGIPLHKFGIETANMSITDA-NLDYLSTLKPYITCVCAELNFKFNDEY-------VN--REFKFDTTEI 364 (441) T ss_pred HHHhHHHHHHHhCCCHHHcCCCCCCccHHHH-HHHHHHHHHHHHHHHHHHHhhhccccc-------cC--ceEEeechhh Confidence 9999999999976543222212222222221 222334567788888888887664321 11 25666532 Q ss_pred -CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccc-c----CCCcCCCccccCCCCcccccccccc Q lcl|NC_021302. 382 -GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDES-T----ADTGQDEPETDEPALPNTSGTTSTT 455 (484) Q Consensus 382 -~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~-~----~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (484) ..|.+..+++++++.+.|+.. .+++|+.+|+|.-++++...-. . +....++.+...+.......++... T Consensus 365 lr~D~~~~~~~~~~~i~~G~~T-----~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:79 365 RVVDEKTQAEIDKINIDSGKMN-----IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred hccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCC Confidence 247788999999999999864 4789999999866555432211 1 0000111111111111111111111 Q ss_pred cccc Q lcl|NC_021302. 456 NAPQ 459 (484) Q Consensus 456 ~~~~ 459 (484) . + T Consensus 440 ~--e 441 (441) T protein:vir:79 440 N--E 441 (441) T ss_pred C--C Confidence 1 0 No 44 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.78 E-value=1.4e-17 Score=112.94 Aligned_cols=403 Identities=13% Similarity=0.071 Sum_probs=228.7 Q ss_pred CCCCCCCc---------cc-eeeeecccccchhhhhhhcccccccccccccchHHHH--HHHHhcchHHHHHHHHHHHHh Q lcl|NC_021302. 1 MAPKTVAP---------RT-ERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY--TRMCREEARIASVLRAIGLPI 68 (484) Q Consensus 1 ~~~~~~~~---------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y--~~m~~~D~~v~s~l~~r~~~v 68 (484) -+-|++.. .. .++...+..+. ..++..+.. .. +..+..| +..+ +-+.|.+|+..+-..| T Consensus 14 ~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~-~~~~~~~~~------~~-~~~~~~~~~~~al-~~~~V~~cv~~Ia~~i 84 (441) T protein:vir:94 14 KSRKQSRKELVVVGIFYKNEKRDLQYNEDDL-QMMVQTLPG------FQ-GTKLRQYKDIEAI-RHSDIFTAVMMIASDL 84 (441) T ss_pred cccccchhhhhccccccccccccccCCCcch-HHHHHHhcc------cC-cccccccchhhhh-ccHHHHHHHHHHHHhh Confidence 12222211 11 11111111111 011110000 00 0111112 1233 4788999999999999 Q ss_pred hCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 69 RRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 69 ~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) .++++++...+.... ...+...|.. +-....+..++++.+. +.+.+|.+.+++++... | .+.. T Consensus 85 A~lp~~~~~~~~~~~-~~~~~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~ 148 (441) T protein:vir:94 85 ARMPIRVTVNGQINY-SDRIVNLLNT------------RPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-G--EPMN 148 (441) T ss_pred ccCceeeecCccccc-cchHHHHHhc------------ccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEE Confidence 999999975432111 1111111110 0111223556676665 57889999999987533 3 4778 Q ss_pred eeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHH Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKL 227 (484) Q Consensus 148 l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~ 227 (484) |.+++|.++. ...+.+|.+.+..+...+. .......+++..+|++++.+. +..+|.|.+..+....-. T Consensus 149 L~~i~~~~v~-v~~d~~g~~~~~~~~~~~~----------~~~~~~~~~~~dvih~k~~~~-dg~~G~spl~~~~~~i~~ 216 (441) T protein:vir:94 149 LTFRKTSEIE-LKSDARGRLYYFHQRIDSN----------GNNIERNVKFEDMLDIKFYSL-DGINGLSLLDTLSRTIES 216 (441) T ss_pred EEEEcCceeE-EEECCCccEEEEEEEeccC----------CceeEEEEccccEEEeccCCC-CCccccCHHHHHHHHHHH Confidence 9999998885 4667777766544322111 112234578888888887544 447999999999887777 Q ss_pred HHHHHHHHHHHHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCC-ce--EEEccCCceEEEecccCCchhHHHH Q lcl|NC_021302. 228 KDELIRIEAAAIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGE-SA--GLALTAGEEAGILSPNGTPLDPRRA 303 (484) Q Consensus 228 K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~-~a--~~vip~~~~ie~~~~~~~~~~~~~l 303 (484) -....++...|... .+.|-.+.+++... ++++++++.+.+.+...|. ++ .++++.|++++-++.+.....|.+. T Consensus 217 ~~~~~~~~~~~f~n--g~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~ 294 (441) T protein:vir:94 217 DNNGKDFLNNFLRN--GTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRE 294 (441) T ss_pred HHHHHHHHHHHHhc--cCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHH Confidence 77778888888885 36676666665554 4666778888887776552 22 4789999998888766655668889 Q ss_pred HHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-- Q lcl|NC_021302. 304 IEYHDHQMALVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-- 381 (484) Q Consensus 304 i~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-- 381 (484) .++..++|++++.-...-.+..+++++..+. +..+..-+.-.++.|+..||+.|++.. .. -+|+|+.. T Consensus 295 ~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~-~~~~~~tl~P~~~~ie~eln~kl~~~~-------~~--~~~~fd~~~l 364 (441) T protein:vir:94 295 NKSSTREIAGVFGIPLHKFGIETANMSITDA-NLDYLSTLKPYITCVCAELNFKFNDEY-------VN--REFKFDTTEI 364 (441) T ss_pred HHHhHHHHHHHhCCCHHHcCCCCCCccHHHH-HHHHHHHHHHHHHHHHHHHhhhccccc-------cC--ceEEeechhh Confidence 9999999999976543222212222222221 222334567788888888887664321 11 25666532 Q ss_pred -CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccc-c----CCCcCCCccccCCCCcccccccccc Q lcl|NC_021302. 382 -GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDES-T----ADTGQDEPETDEPALPNTSGTTSTT 455 (484) Q Consensus 382 -~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~-~----~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (484) ..|.+..+++++++.+.|+.. .+++|+.+|+|.-++++...-. . +....++.+...+.......++... T Consensus 365 lr~D~~~~~~~~~~~i~~G~~T-----~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:94 365 RVVDEKTQAEIDKINIDSGKMN-----IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEE 439 (441) T ss_pred hccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCC Confidence 247788999999999999864 4789999999866555432211 1 0000111111111111111111111 Q ss_pred cccc Q lcl|NC_021302. 456 NAPQ 459 (484) Q Consensus 456 ~~~~ 459 (484) . + T Consensus 440 ~--e 441 (441) T protein:vir:94 440 N--E 441 (441) T ss_pred C--C Confidence 1 0 No 45 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.78 E-value=3.9e-18 Score=116.05 Aligned_cols=395 Identities=10% Similarity=-0.017 Sum_probs=223.8 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) .-.+.+. ..+. +.. .....+..+ . ...+..+ ..+..+ +-+.|.+|+..+...|.+++|.+....+ T Consensus 6 ~~~~~~~--~~~~--~~~-~~~~~~~~~----~----~~~g~~v-~~~~al-~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 70 (419) T protein:vir:57 6 FWKGRPS--ENRV--NWQ-VVPGGMRSS----S----SQAGVII-TPETAL-ALSAVRACVTLLAESVAQLPCVLYRRTE 70 (419) T ss_pred hhccCCc--cccc--ccc-ccccccccc----c----ccCCcee-chHHhh-ccHHHHHHHHHHHHhhccCceEEEEEcC Confidence 1111100 0000 000 000000000 0 0001111 122333 4688999999999999999999843222 Q ss_pred CH--HH-HH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 81 RP--EV-VE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 81 ~~--e~-~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) +. +. .+ .+...|.. +.....++.++++.+. +.+.+|-+.++|++... | .+..|.+++|.+ T Consensus 71 ~g~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~-G--~~~~L~pl~~~~ 135 (419) T protein:vir:57 71 NGGREIAFDHPLHDLIRY------------QPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGR-G--DITELIPINPHK 135 (419) T ss_pred CCceeccccchHHHHHhh------------ccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEEcCcc Confidence 21 11 11 12222211 1112335677777776 67789999999987543 3 367899999988 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) +. ...+.+|.+.+.. ...+..+|....++.++.. .+.++|.|.+..+....-.-....++. T Consensus 136 v~-v~~~~~g~~~y~~-----------------~~~~~~~~~~~vih~r~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~ 196 (419) T protein:vir:57 136 VI-VLKGPDGMPYYDI-----------------PSIGEILPMRMVHHIKSFS-LDGYIGTSPIQTNPDVLGLGIAVEQHA 196 (419) T ss_pred eE-EEECCCceEEEEE-----------------cCCceEEchhhEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHHH Confidence 75 3344444432211 2233456777777766554 455899999999988877777777888 Q ss_pred HHHHHHhcCCcceEEecCC----CCCCHHHHHHHHHHHHHHhcCC---ceEEEccCCceEEEecccCCchhHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEA----DSEDDDRMDELLEIASNYSGGE---SAGLALTAGEEAGILSPNGTPLDPRRAIEYHD 308 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~----~~~~~~~~~~l~~~l~~~~~g~---~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d 308 (484) ..+...- +.|-.+.+++ ...++++++++.+.+.+...|. ...++++.|++++-++.+.....|.+..++.. T Consensus 197 ~~~f~ng--~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 274 (419) T protein:vir:57 197 AQVFARG--TTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTV 274 (419) T ss_pred HHHHHcc--CCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHH Confidence 8887753 6674444442 3446788888888887765542 23578899999888776655667889999999 Q ss_pred HHHHHHHhhhh-hcccccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CC Q lcl|NC_021302. 309 HQMALVALAHF-LNLDGKGGSYALASVQADTF-VQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GS 383 (484) Q Consensus 309 ~~Isk~ilGqt-lt~~~~gGs~A~~evh~~v~-~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~ 383 (484) ++|++++.-.. +....++++++-.+-+...+ ..-+.-.++.|+..||+.|+.+-- . ... +++|+. . .. T Consensus 275 ~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~ll~~~~----~-~~~--~i~fd~~~ll~~ 347 (419) T protein:vir:57 275 NEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQYVIYTMLAILKRHESAMMRDLLLPSE----R-RDF--YIEFNVSSLLRG 347 (419) T ss_pred HHHHHHhCCCHHHhCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc----c-CCe--EEEEechhhhcc Confidence 99999976654 22223345565544444433 555677888888888876665311 1 112 455543 2 35 Q ss_pred cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc--CCCcCCCccccCCCCcccccccccccccccc Q lcl|NC_021302. 384 RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST--ADTGQDEPETDEPALPNTSGTTSTTNAPQAR 461 (484) Q Consensus 384 ~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (484) |.+.++++++++++.|+.. .+++|+.+|+|.-++++....+. ........... +.|... ++.. T Consensus 348 d~~~~~~~~~~~~~~G~~T-----~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~--~~~~~~--------~~~~ 412 (419) T protein:vir:57 348 DQKSRYESYALGRQWGWLS-----VNDIRRMENLTPIPGGDKYLTPLNMVDSKALTGIGK--ATPQQL--------KDIE 412 (419) T ss_pred CHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcCeeeeccccccccccccccC--CCcccC--------cchh Confidence 7888999999999999764 47899999998665555443221 00000000000 000000 0000 Q ss_pred ccccccchHHHh Q lcl|NC_021302. 462 KRPRGRSPRDRR 473 (484) Q Consensus 462 ~~~~~~~~~~~~ 473 (484) + .....| T Consensus 413 ~-----~~~~~~ 419 (419) T protein:vir:57 413 A-----ILCTRN 419 (419) T ss_pred h-----hhhccC Confidence 0 000111 No 46 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=99.78 E-value=9.9e-18 Score=113.82 Aligned_cols=403 Identities=10% Similarity=0.021 Sum_probs=227.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) |+...-...--..+++...+........+.. ...+....+ ..+..+ +-+.|.+|+..+-..|.+++|.+...++ T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~----~~~~~~~~v-~~~~a~-~~~~V~~ci~~ia~~ia~lp~~~~~~~~ 74 (409) T protein:vir:96 1 MAKENIVTRIKKKLIDNWIDQSASKLYDFSP----WKNKSFWGV-INNTLE-TNETIFSAITKLSNSMASLPLKMYEDYK 74 (409) T ss_pred CccccchhhhhhHHhhhhhcccccccccccc----ccCcccccc-chhhHh-hhHHHHHHHHHHHHhhhhCceEEeeccc Confidence 4443332221111111111100000000000 000000011 011233 4678999999999999999999854332 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) ..+ ..+.+.|.. +.....+..++++.++ +.+.+|-+..++++... | .+..|.+.+|.++.. T Consensus 75 ~~~--~~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~l~~~~v~v- 136 (409) T protein:vir:96 75 VVN--TEVSDLLTV------------SPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-H--QPSKLFLLNPDVVEM- 136 (409) T ss_pred ccc--hhHHHHHhh------------hcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC-C--cEEEEEEEcCceeEE- Confidence 111 112222211 1112234566766665 67889999999987543 3 367899999988763 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAI 239 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 239 (484) ..+.+++.+.... ....+....+|....+++++....+..+|.|.+..+....-.-.....+ . | T Consensus 137 ~~~~~~~~~~y~~-------------~~~~g~~~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~-~-~- 200 (409) T protein:vir:96 137 LIENQSRELYYSI-------------HAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTF-N-L- 200 (409) T ss_pred EEeCCCcEEEEEE-------------EcCCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHH-H-H- Confidence 4455554332221 1223344567888888887765666788999988876544333333333 2 2 Q ss_pred HHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhh Q lcl|NC_021302. 240 RRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHF 319 (484) Q Consensus 240 Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqt 319 (484) -.++. .|-.+.+.+...++++++++.+.+.+...+....++++.|++++-++.+.....|.+..++..++|++++.-.. T Consensus 201 ~~~~~-~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp 279 (409) T protein:vir:96 201 TEMQK-PDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPS 279 (409) T ss_pred HhcCC-CceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 22222 24455667778889999999999888776655678899999998887766666788888999999999976654 Q ss_pred hcc-cccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHHHHHHHH Q lcl|NC_021302. 320 LNL-DGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDATAAALQM 394 (484) Q Consensus 320 lt~-~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~ae~~~~ 394 (484) --. ...+++++..+-+.. ....-+.-.++.|++.||+.|++..- .. .. -+|+|+.. ..|.+..++++++ T Consensus 280 ~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~---~~-~g--~~i~fd~~~ll~~d~~~~~e~~~~ 353 (409) T protein:vir:96 280 IFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTD---RE-KN--RYFKFNVKSYLRADSATQAEVYFK 353 (409) T ss_pred HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc---cc-Cc--ceEEeechhhhccCHHHHHHHHHH Confidence 332 233456665554443 34555778888888888887766311 11 11 25666532 2478899999999 Q ss_pred HHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 395 LVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 395 L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) +++.|+.. .+++|+.+|+|.-+.++....+. +- .+. ... .....+.++...+..+. T Consensus 354 ~~~~G~~T-----~NE~R~~~g~~pi~ggD~~~~~~-n~-~~~-~~~--~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 354 AVRSGYYT-----INDIREWEDLPPVEGGDKPLISG-DL-YPI-DTP--LELRKSLKGGDKNVNES 409 (409) T ss_pred HHhCCCCC-----HHHHHHHhCCCCCCCcceeeecc-cc-ccc-ccc--hhhcccccCCCCCcCCC Confidence 99999764 48899999998665554433211 00 000 000 00000111111111111 No 47 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=99.78 E-value=1.2e-17 Score=113.45 Aligned_cols=387 Identities=10% Similarity=-0.026 Sum_probs=227.2 Q ss_pred CCCCC-CCccceeeeecccccchhhhhhhcccccccccccccchHHHH-HHHHhcchHHHHHHHHHHHHhhCCCcEEecC Q lcl|NC_021302. 1 MAPKT-VAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY-TRMCREEARIASVLRAIGLPIRRTDWRIRPN 78 (484) Q Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~ 78 (484) ...+. ..|.++. ... .+. ..++.. .+..+- +..+ +-+.|.+|+..+-..|.+++++|... T Consensus 23 f~~~~~~~~~~~~---~~~-~~~---~~~~~~----------~~~~vs~~~al-~~~~v~~cv~~Ia~~iA~lp~~v~~~ 84 (424) T protein:vir:45 23 FRSKSLENPSTPI---TGD-AVD---TDGLFR----------ADVYVSPETAM-KLAAVYSCIYVLSSSLAQMPLHVMRR 84 (424) T ss_pred ccccCCCCCcccc---chh-hhh---hhcccc----------CCceechHHhh-ccHHHHHHHHHHHHHHhhCceEEEEe Confidence 11111 1111110 000 000 000000 011111 2333 46889999999999999999998432 Q ss_pred C--CCHHHHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcc Q lcl|NC_021302. 79 G--ARPEVVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQS 154 (484) Q Consensus 79 ~--~~~e~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 154 (484) + ...++.+ -+.+.|.. +-....+..++.+.++ +.+.+|-++.++++... -.+..|.+.+|. T Consensus 85 ~~~~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~---G~~~~L~~l~~~ 149 (424) T protein:vir:45 85 HKGKVEPARDHPAFYLVHD------------EPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRR---GEVISLDCCMPW 149 (424) T ss_pred cCCceeecccchHHHHHHh------------hcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCC---CcEEEEEEecCc Confidence 2 1111111 11122210 1112234566777665 67889999999986543 235678888887 Q ss_pred ceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 155 SIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRI 234 (484) Q Consensus 155 ~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 234 (484) .+... ..++++.+.. ........++++..+++++.. .+..+|.|.+..+....-.-....++ T Consensus 150 ~v~i~--~~~~~~~y~~---------------~~~~~~~~~~~~eVih~r~~~-~d~~~G~spi~~~~~~i~~~~~~~~~ 211 (424) T protein:vir:45 150 ETTLM--NTGGRYTYGL---------------YNEYGAFAISPDDMIHIRALG-NNQKMGLSPIMQHAETIGMGMSGQKY 211 (424) T ss_pred eEEEE--EcCCeEEEEE---------------EecCceEEECcccEEEecCcC-CCCcccccHHHHHHHHHHHHHHHHHH Confidence 76432 2333333221 112234467888887777654 45689999999998887777777777 Q ss_pred HHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC--c--eEEEccCCceEEEecccCCchhHHHHHHHHHHH Q lcl|NC_021302. 235 EAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE--S--AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQ 310 (484) Q Consensus 235 w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~--~--a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~ 310 (484) ...|... .+.|-.+.+.+...++++++++.+.+.+..+|. + ..++++.|++++-++.+.....|.+..++...+ T Consensus 212 ~~~~f~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~ 289 (424) T protein:vir:45 212 TESFFSG--NARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSM 289 (424) T ss_pred HHHHHhc--cCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 7787774 367877777777888998999988887765542 2 467899999998887665555688888899999 Q ss_pred HHHHHhhhhhcc-cccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcH Q lcl|NC_021302. 311 MALVALAHFLNL-DGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQ 385 (484) Q Consensus 311 Isk~ilGqtlt~-~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~ 385 (484) |++++.-..--. ..++++++-.+-.. .....-+.-.++.|++.||+.|+..--.. .+. +|+|+.. ..|. T Consensus 290 Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~----~g~--~i~fd~~~llr~d~ 363 (424) T protein:vir:45 290 IAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWVTNWEQELNRRLFTRAELA----AGY--YVRFNLTGLLRGTP 363 (424) T ss_pred HHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhc----CCc--EEEeechhhhccCH Confidence 999976554322 23345555444333 34455677888889999988777642111 111 4666532 2477 Q ss_pred HHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccc Q lcl|NC_021302. 386 DATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTST 454 (484) Q Consensus 386 ~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (484) ++.++.++++++.|+.. .+++|+.+|+|.-+++++...+. +...+ .......+...++... T Consensus 364 ~~r~~~~~~~~~~g~~T-----~NE~R~~~gl~pi~ggD~~~~~~-n~~~~--~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 364 QERAQFYHFAITDGWMS-----RNEARAFEDMNPVEGLDEMLVSV-NAANP--AGDFKPPKNDEGKTNE 424 (424) T ss_pred HHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeecc-ccccc--ccccCCCCCCCCCCCC Confidence 88999999999999764 47899999998766655543321 11110 0000000000111000 No 48 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=99.78 E-value=1.3e-17 Score=113.24 Aligned_cols=403 Identities=11% Similarity=0.031 Sum_probs=231.1 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) |+|+--...-.-++.+......+.....+... ..+..-.+ ..+..+ +-+.|.+|+..+-..|.+++|.+...++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~v-~~~~~~-~~~~V~~ci~~Ia~~ia~lp~~~~~~~~ 74 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWIDQSTSKLYDFSPW----KNRSFWGV-INNTLE-TNETIFSAITKLSNSMASLPLKMYEDYK 74 (409) T ss_pred CCccchhhhhhhhhhhhhhccccccccccccc----cCcccccc-chhhhh-ccHHHHHHHHHHHHhhhhCceeEeeccc Confidence 77765555443333332211111100000000 00100111 112343 4688999999999999999999864332 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) .. ...+...|.. +.....+..++++.++ +.+.+|-+..++++... | .+..|.+.+|.++.. T Consensus 75 ~~--~~~~~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~l~~~~v~~- 136 (409) T protein:vir:93 75 VV--NTEVSDLLTV------------SPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-H--QPSKLFLLNPDVVEM- 136 (409) T ss_pred cc--cchHHHHHhh------------hcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC-C--cEEEEEEEcCceeEE- Confidence 11 1112222211 1112334667777665 57889999999876432 2 367899999988763 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAI 239 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 239 (484) ..+.+++.+.... ....+....+|.+..+++++....+..||.|.+..+....-......+ +. + T Consensus 137 ~~~~~~~~~~y~~-------------~~~~g~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~-~~-~- 200 (409) T protein:vir:93 137 LIENQSRELYYSI-------------HAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRT-FN-L- 200 (409) T ss_pred EEeCCCcEEEEEE-------------EcCCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHH-HH-H- Confidence 4444444332211 112233456888888888776566678899988877655544433333 32 2 Q ss_pred HHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhh Q lcl|NC_021302. 240 RRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHF 319 (484) Q Consensus 240 Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqt 319 (484) ..++.+ |-.+.+.+...++++++++.+.+.+...+....++++.|++++-++.+.....|.+..++...+|++++.-.. T Consensus 201 ~~~~~~-~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp 279 (409) T protein:vir:93 201 TEMQKP-DSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPS 279 (409) T ss_pred HhcCCC-CceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 222222 3445566777888889999988887666555677899999998887665556788888899999999976654 Q ss_pred hcc-cccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHHHHHHHH Q lcl|NC_021302. 320 LNL-DGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDATAAALQM 394 (484) Q Consensus 320 lt~-~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~ae~~~~ 394 (484) --. ...+++++-.+-... ....-+.-.++.|+..||+.|++..- .. .. -.|+|+.. ..|.++.++++++ T Consensus 280 ~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~---~~-~~--~~~~fd~~~ll~~d~~~~~~~~~~ 353 (409) T protein:vir:93 280 VFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTD---RE-KN--RYFKFNVKSYLRADSATQAEVYFK 353 (409) T ss_pred HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc---cc-Cc--ceEEeechhhhccCHHHHHHHHHH Confidence 333 233456665554443 44556788888899999887776311 11 11 25666532 2478889999999 Q ss_pred HHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 395 LVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 395 L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) +++.|+.. .+++|+.+|+|.-+.++....+. +- .+. ... ... ..+.+++..+..+. T Consensus 354 ~~~~G~~T-----~NE~R~~~g~~p~~ggD~~~~~~-n~-~~~-~~~-~~~-~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 354 AVRSGYYT-----INDIREWEDLPPVEGGDKPLISG-DL-YPI-DTP-LEL-RKSLKGGDKNVNES 409 (409) T ss_pred HHhCCCcC-----HHHHHHHhCCCCCCCcCeeeecc-cc-ccc-ccc-hhh-cccccCCCCCcCCC Confidence 99999764 48899999998665554433211 00 000 000 000 00011111111111 No 49 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=99.78 E-value=1.7e-16 Score=107.08 Aligned_cols=443 Identities=11% Similarity=0.123 Sum_probs=227.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchH-HHHHHHHhcchHHHHHHHHHHHHhhC--------- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSV-YTYTRMCREEARIASVLRAIGLPIRR--------- 70 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m~~~D~~v~s~l~~r~~~v~~--------- 70 (484) -.+++... .+..+..|....-.. ..+. -..+..+.....+ .+.+.... -+.|.+|+..+...|.. T Consensus 44 ~~~~~~~~-~~~a~~~p~~~~~~~-~~~~--~~~p~~~~~~~~~~~~l~~~~~-npiv~~~I~~ia~~vA~~~~~~~~~~ 118 (576) T protein:vir:96 44 ELNKSLYG-KQQAYAEPFLEVMDT-NPEF--RTKRSYMKNSDNLHDVLKQFGN-NPILNAIILTRSNQVAMYCQPSRYNE 118 (576) T ss_pred hhccccCC-ccchhhcceeeeeec-CCCc--cccCcchhhhhhhHHHHHHhhc-CHHHHHHHHHHHHHHHhhhhhhhhcc Confidence 11111111 112222221100000 0000 0111111111111 11122222 36789999999988875 Q ss_pred --CCcEEecCCCCH-----HHHH--HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecC Q lcl|NC_021302. 71 --TDWRIRPNGARP-----EVVE--HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEG 140 (484) Q Consensus 71 --~~~~v~p~~~~~-----e~~~--~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~ 140 (484) ..|.|..-..+. +.++ .+...|...... ..-...+|.++++.++ +.+.+|.+.+|++|.+++ T Consensus 119 ~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~--------~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~ 190 (576) T protein:vir:96 119 RGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRD--------KDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKN 190 (576) T ss_pred ccccceeEEecCcCccchhhhHhhhhHHhhHhhccCC--------CCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCC Confidence 466665432221 1111 111111110000 0011235778888876 578999999999987653 Q ss_pred CeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccC---ccccchh Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG---VWTGNSL 217 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~---~p~G~gl 217 (484) ...+..|.+++|.++. +..+.+|.+.... ........+.....++....|+|++....+ ..||.|. T Consensus 191 -~g~~~~L~pl~p~~V~-v~~~~dg~~~~~~---------~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Sp 259 (576) T protein:vir:96 191 -ATTMDKFIAVDPSTIF-YATDKNGKIIKGG---------KRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGYGLSE 259 (576) T ss_pred -CCceEEEEEeCCceeE-EEECCCCceeeee---------eEEEEecCCceEEEecccceEEEeecCCCCcccCcccccH Confidence 3446789999999885 4556666543211 011112223344567788888888776544 6789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec--CCCCCCHHHHHHHHHHHHHHhcCC-ce---EEEccCCceEEEe Q lcl|NC_021302. 218 LRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN--EADSEDDDRMDELLEIASNYSGGE-SA---GLALTAGEEAGIL 291 (484) Q Consensus 218 l~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk--~~~~~~~~~~~~l~~~l~~~~~g~-~a---~~vip~~~~ie~~ 291 (484) +..+....-.-....++-..|.... +.|-.+.+ .+...++++++++.+.+.+...|. ++ .++++.|++++-+ T Consensus 260 i~~a~~~i~~~~~~~~~~~~~f~Ng--~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~l 337 (576) T protein:vir:96 260 VEIAMKQFIAYNNTETFNDRFFSHG--GTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNM 337 (576) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEec Confidence 9999988888888888888888853 56644433 333457888899999998876553 22 3678999998888 Q ss_pred cccCCchhHHHHHHHHHHHHHHHHhhhhhcc----------ccccc--chhhH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 292 SPNGTPLDPRRAIEYHDHQMALVALAHFLNL----------DGKGG--SYALA-SVQADTFVQSVQTVADEIRDVAQAHV 358 (484) Q Consensus 292 ~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~----------~~~gG--s~A~~-evh~~v~~~~~~aD~~~i~~~ln~ql 358 (484) +.+.....|.+..++..++|++++.-...-. ...+| +++-. +........-+.-.++.|+..||+.| T Consensus 338 s~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L 417 (576) T protein:vir:96 338 TPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHI 417 (576) T ss_pred cCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 7766667899999999999999964432111 11122 33322 33334555677888899999999988 Q ss_pred HHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHH--HHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc------ Q lcl|NC_021302. 359 VEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQM--LVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST------ 430 (484) Q Consensus 359 i~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~--L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~------ 430 (484) ++. |+. . -.|.|.. .+.+..++.... +...|+.. .+++|+.+|+|.-+.++....+. T Consensus 418 l~~-----~~~--~-~~~~f~r--~d~~~~~e~~~~~~~~~~G~lT-----~NE~R~~~gl~piegGD~~~~~~~~~~~~ 482 (576) T protein:vir:96 418 ISE-----YSD--K-YVFQFVG--GDTKSELDKIKILQEEVKTYKT-----VNEARKEKGLKPIEGGDVLLDGSFIQSMS 482 (576) T ss_pred chh-----ccC--c-eEEEecc--CCHHHHHHHHHHHHHHhcCccC-----HHHHHHHhCCCCCCCcceecccccccccc Confidence 763 221 1 2555643 344445554443 34457643 48899999998665554432110 Q ss_pred --CCCcCCCc--cccCCCCccccccccccccc--ccc-ccccccchHHH--hcCcccCc-ccCC Q lcl|NC_021302. 431 --ADTGQDEP--ETDEPALPNTSGTTSTTNAP--QAR-KRPRGRSPRDR--RKTPDGAM-PLWD 484 (484) Q Consensus 431 --~~~~~~~~--~~~~~~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~--~~~~~~~~-~~~~ 484 (484) ....+.+. ++..-..+.....+.....+ .+. ....+....+. ...+-+.. -||+ T Consensus 483 ~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 546 (576) T protein:vir:96 483 LNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESNDPTKIDSPVGTDGQLKD 546 (576) T ss_pred ccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccccCCCCCCccccccccCC Confidence 00000000 00000000000000000000 000 00011111110 00000111 3555 No 50 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=99.78 E-value=9.2e-18 Score=113.97 Aligned_cols=399 Identities=9% Similarity=-0.021 Sum_probs=229.0 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) .++...+...+ ..+..+.......+..... .+..+. -+..+ +-+.|.+|+..+-..|.+++++|.-.+. T Consensus 3 ~~~~~~~~~~~---~~~~~~~~~~~~~g~~~s~------~~~~v~-~~~al-~~~~v~~cv~~ia~~ia~lp~~~~~~~~ 71 (419) T protein:vir:80 3 FSRQLLSNLGQ---TQPGSGGWVSALLGSARSE------AGQVVT-PASAL-SLTVLQNCVTLLAESIAQLPVELYERSG 71 (419) T ss_pred cccccccccCc---CCCCcchhhHHhhcccccc------cCcccC-hHHhh-ccHHHHHHHHHHHHhhccCceEEEEecC Confidence 23322111111 1111110001111111100 011111 12344 4689999999999999999999854332 Q ss_pred CH-H-HHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 81 RP-E-VVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 81 ~~-e-~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +. + +.+ .+...|. .+-....+..++++.++ +.+.+|-+++++++... | .+..|.+++|.++ T Consensus 72 ~~~~~~~~~~l~~lL~------------~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~-G--~~~~L~~i~~~~v 136 (419) T protein:vir:80 72 DDRKPATDHPLYSILK------------YEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQD-G--VIQGLYPLDNEAV 136 (419) T ss_pred CCcccccccHHHHHHH------------hhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEecCceE Confidence 21 1 111 0111111 01112335677777776 67889999999986543 3 3788999999887 Q ss_pred eeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEA 236 (484) Q Consensus 157 ~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~ 236 (484) . ...+.++.+++... ....+|.+..++.++.+. +.++|.|.+..+....-.-....++.. T Consensus 137 ~-i~~~~~~~~~y~~~------------------~~~~~~~~~i~h~~~~~~-d~~~G~s~i~~~~~~i~~~~~~~~~~~ 196 (419) T protein:vir:80 137 T-VMKGPDLKPMYRVA------------------GADPLPQRLVHHVRWMSI-NGYTGLSPVLLHANAIGHAQAIQQYAG 196 (419) T ss_pred E-EEECCCceEEEEEc------------------CccccchhheEEecCCCC-CCcccccHHHHHHHHHHHHHHHHHHHH Confidence 5 34555555443221 112356666566555544 558999999999988777777788888 Q ss_pred HHHHHhcCCcceEEecCCC----CCCHHHHHHHHHHHHHHhcCCc---eEEEccCCceEEEecccCCchhHHHHHHHHHH Q lcl|NC_021302. 237 AAIRRHGIGVPYLKGNEAD----SEDDDRMDELLEIASNYSGGES---AGLALTAGEEAGILSPNGTPLDPRRAIEYHDH 309 (484) Q Consensus 237 ~f~Er~~~G~P~~~gk~~~----~~~~~~~~~l~~~l~~~~~g~~---a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~ 309 (484) .|.+.- +.|-.+.+++. ..++++.+++.+.+.+...|.. ..++++.|++++-++.+.....|.+..++..+ T Consensus 197 ~~f~ng--~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~ 274 (419) T protein:vir:80 197 KSFMNG--TALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSAL 274 (419) T ss_pred HHHhcC--CCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHH Confidence 888853 66755545432 2356777888888887766532 35789999998877766555668888899999 Q ss_pred HHHHHHhhhh-hcccccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCc Q lcl|NC_021302. 310 QMALVALAHF-LNLDGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSR 384 (484) Q Consensus 310 ~Isk~ilGqt-lt~~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~ 384 (484) +|++++.-.. +....++++++-.+-+.. ....-+.-.++.|+..||+.|+.+--.. . + .++|+.. ..| T Consensus 275 ~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~-----~-~-~i~fd~~~l~~~d 347 (419) T protein:vir:80 275 DIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRHEQAKTRDLLLPSERK-----Q-Y-FIEYNLAGLLRGD 347 (419) T ss_pred HHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCccccC-----C-e-EEEEechhhhccC Confidence 9999976653 222334456655444443 3455578888999999998776541111 1 1 4566532 247 Q ss_pred HHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc--CCCcCCCccccCCCCccccccccccccccccc Q lcl|NC_021302. 385 QDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST--ADTGQDEPETDEPALPNTSGTTSTTNAPQARK 462 (484) Q Consensus 385 ~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (484) .++.++++.++.+.|+.. .+++|+.+|+|.-+.++....+. .....+.+.+ .....+..++..+. + T Consensus 348 ~~~~~~~~~~~~~~G~~T-----~NE~R~~~g~~p~~gGD~~~~~~n~~~~~~~~~~~------~~~~~~~~~~~~~~-~ 415 (419) T protein:vir:80 348 QSSRYAAYAVGRQWGWLS-----INDIRRLENMPPVKGGDIYLSPMNMVDASKPQPIP------MGKTEPTKAALDEI-G 415 (419) T ss_pred HHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeecccccccccccccc------CCCCCchhhhHHHH-H Confidence 889999999999999865 47899999998766665443221 0111111100 00111111111111 1 Q ss_pred cccc Q lcl|NC_021302. 463 RPRG 466 (484) Q Consensus 463 ~~~~ 466 (484) +-.+ T Consensus 416 ~~l~ 419 (419) T protein:vir:80 416 RILS 419 (419) T ss_pred hhcC Confidence 1111 No 51 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=99.78 E-value=7e-18 Score=114.62 Aligned_cols=397 Identities=10% Similarity=-0.013 Sum_probs=230.9 Q ss_pred CCCccceeeeecccccchhhhhhhccc----cccccccc-------ccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 5 TVAPRTERGYVNPLAGFGTFLAQGLDQ----FEQVDELR-------WPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~lr-------~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) ---|+-+....+.. ++......++.. ......+. ...+..+-.+...+-+.|.+|+..+-..|.+++| T Consensus 1 ~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~ 79 (424) T protein:vir:18 1 MEEPKYTIDLRTNN-GWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPL 79 (424) T ss_pred CCCCcceEeecCCC-chHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCce Confidence 11122222222221 111111111100 00000000 0011122223333578999999999999999999 Q ss_pred EEecCCCCH---HHH-H-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 74 RIRPNGARP---EVV-E-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 74 ~v~p~~~~~---e~~-~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) .+--.+.+. ++. + -+...|.. +-....+..++++.++ +.+.+|-+.+++++... | .+.. T Consensus 80 ~~~~~~~~~~~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~ 144 (424) T protein:vir:18 80 DVFETDQNDNRKKVDLSNPLARLLRY------------SPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DVIS 144 (424) T ss_pred EEEEeecCCceeeeccccHHHHHHhh------------ccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEE Confidence 984322211 110 0 11122210 0111234566777765 67889999999987543 3 3678 Q ss_pred eeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHH Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKL 227 (484) Q Consensus 148 l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~ 227 (484) |.+.+|.++.... .++.+.+. +...+....++++..|++++.. .+..+|.|.+..+....-. T Consensus 145 L~pl~~~~V~v~~--~~~~~~y~---------------~~~~g~~~~~~~~eIih~r~~~-~dg~~G~spi~~~~~~i~~ 206 (424) T protein:vir:18 145 LLPLQSANMDVKL--VGKKVVYR---------------YQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGV 206 (424) T ss_pred EEEecCcceEEEE--cCCeEEEE---------------EEeCCeEEEeccccEEEecCcC-CCCcccccHHHHHHHHHHH Confidence 8899988775322 22333221 2223445678888888877654 4558999999999877777 Q ss_pred HHHHHHHHHHHHHHhcCCcceEEecCCCC-CCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHH Q lcl|NC_021302. 228 KDELIRIEAAAIRRHGIGVPYLKGNEADS-EDDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAI 304 (484) Q Consensus 228 K~~~~~~w~~f~Er~~~G~P~~~gk~~~~-~~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li 304 (484) -....++-..|... .+.|-.+.+.+.. .++++++++.+.+.++.++.++ .++++.|++++-++.+.....|.+.. T Consensus 207 ~~a~~~~~~~~f~n--g~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~ 284 (424) T protein:vir:18 207 AVAMEDQQRDFFAN--GAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASR 284 (424) T ss_pred HHHHHHHHHHHHHc--cCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHH Confidence 67777777788775 3567555566544 5788889999999888766544 57899999998888776666788889 Q ss_pred HHHHHHHHHHHhhhh-hcccccccch--h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC Q lcl|NC_021302. 305 EYHDHQMALVALAHF-LNLDGKGGSY--A-LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE 380 (484) Q Consensus 305 ~~~d~~Isk~ilGqt-lt~~~~gGs~--A-~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~ 380 (484) ++..++|++++.-.. +..+.+++++ + ..+........-+.-.++.|+..||+.|++.. .... .+|+|+. T Consensus 285 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~-----~~~~--~~~~fd~ 357 (424) T protein:vir:18 285 KFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DVGR--IHAEHNL 357 (424) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc-----ccCC--eEEEEec Confidence 999999999976654 3333333333 2 12223344566778888999999998777641 1112 2456643 Q ss_pred ---CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccc Q lcl|NC_021302. 381 ---IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 381 ---~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (484) ...|.++.++++.++.+.|+.. .+++|+.+|+|.-++++...... +- .+.........|...|. T Consensus 358 ~~llr~d~~~r~~~~~~~~~~G~~T-----~NE~R~~~gl~pi~gGD~~~~~~-n~-~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 358 DGLLRGDSASRAAFMKAMGEAGLRT-----INEMRRTDNLPPLPGGDVAMRQS-QY-VPITDLGTNKEPRNNGA 424 (424) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcCeeeecc-Cc-cchHhhhccCCCccCCC Confidence 2357888999999999999864 47899999998765554432211 00 00000000000111111 No 52 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=99.77 E-value=1.7e-17 Score=112.47 Aligned_cols=398 Identities=10% Similarity=0.038 Sum_probs=227.1 Q ss_pred CCCCCCCccceeeee----cccccchhhhhhhcccccccccccccchHHHH-HHHHhcchHHHHHHHHHHHHhhCCCcEE Q lcl|NC_021302. 1 MAPKTVAPRTERGYV----NPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY-TRMCREEARIASVLRAIGLPIRRTDWRI 75 (484) Q Consensus 1 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y-~~m~~~D~~v~s~l~~r~~~v~~~~~~v 75 (484) ++-............ ....+ +...+..+... ....+. +..+ +-+.|.+|+..+-..|.+++|.+ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~---------~~~~v~~~~a~-~~~~v~~~i~~ia~~iA~lp~~~ 72 (412) T protein:vir:26 4 IAKENIVTRIKKKLIDNWIDQSTS-KLYDFSPWKNR---------SFWGVINNTLE-TNETIFSAITKLSNSMASLPLKM 72 (412) T ss_pred chhhhhhhhhhhhHhhhhhccccc-ccccccccCCc---------cccccchhhhh-ccHHHHHHHHHHHHhHhhCceeE Confidence 111000000000001 11111 00000010000 111112 2333 56899999999999999999998 Q ss_pred ecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcc Q lcl|NC_021302. 76 RPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQS 154 (484) Q Consensus 76 ~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 154 (484) ....+.. ...+...|.. +.....+..++++.++ +.+.+|-+..+++.... | .+..|.+.+|. T Consensus 73 ~~~~~~~--~~~~~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~l~~~ 135 (412) T protein:vir:26 73 YEDYKVV--NTEVSDLLTV------------SPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-H--QPSKLFLLNPD 135 (412) T ss_pred eeccccc--cchHHHHHHh------------hcccCCCHHHHHHHHHHHHhhcCceEEEEEECCC-C--cEEEEEEEcCc Confidence 5432211 1111222211 1112335677777665 67889999998875433 3 36788899998 Q ss_pred ceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 155 SIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRI 234 (484) Q Consensus 155 ~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 234 (484) ++.. ..+.+++.+..... ...+....++....+++++....+..||.|.+..+....-..... .. T Consensus 136 ~v~v-~~~~~~~~~~y~~~-------------~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~-~~ 200 (412) T protein:vir:26 136 VVEM-LIENQSRELYYSIH-------------AATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV-RT 200 (412) T ss_pred eeEE-EEeCCCcEEEEEEE-------------cCCceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHH-HH Confidence 8753 44455443322211 112334567888888888766667789999988876554444433 33 Q ss_pred HHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHH Q lcl|NC_021302. 235 EAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALV 314 (484) Q Consensus 235 w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ 314 (484) |. + -..+. .|-.+.+.+...++++++++.+.+.+...+....++++.|++++-++.+.....|.+..++...+|+++ T Consensus 201 ~~-~-~~~~~-~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~a 277 (412) T protein:vir:26 201 FN-L-TEMQK-PDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANV 277 (412) T ss_pred HH-H-HhcCC-CCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHH Confidence 33 2 22222 244556677788889999999998887666556788999999988876655567888888899999999 Q ss_pred Hhhhhhccc-ccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHHH Q lcl|NC_021302. 315 ALAHFLNLD-GKGGSYALASVQADTF-VQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDATA 389 (484) Q Consensus 315 ilGqtlt~~-~~gGs~A~~evh~~v~-~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~a 389 (484) +.-...-.+ ..+++++..+-+...+ ..-+.-.++.|++.||+.|+... ... .. -.|+|+.. ..|.++.+ T Consensus 278 fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kLl~~~---~~~-~~--~~~~fd~~~l~~~d~~~~~ 351 (412) T protein:vir:26 278 FQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKT---DRE-KN--RYFKFNVKSYLRADSATQA 351 (412) T ss_pred hCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc---ccc-Cc--ceEEeechhhhccCHHHHH Confidence 766543332 2345666666555444 45578888999999998776632 111 11 25677532 24788999 Q ss_pred HHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 390 AALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 390 e~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) ++++++.+.|+.. .+++|+.+|+|.-+.++....+. +- .+. .. +.....+.++...+..+. T Consensus 352 ~~~~~~~~~G~~t-----~NE~R~~~gl~p~~ggD~~~~~~-n~-~~~-~~--~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 352 EVYFKAVRSGYYT-----INDIREWEDLPPVEGGDKPLISG-DL-YPI-DT--PLELRKSLKGGDKNVNES 412 (412) T ss_pred HHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcCeeeecc-cc-ccc-cc--chhhcccccCCCCCcCCC Confidence 9999999999864 48899999998766555433211 00 000 00 000000111111111111 No 53 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=99.77 E-value=2.4e-17 Score=111.74 Aligned_cols=394 Identities=11% Similarity=0.013 Sum_probs=225.5 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) ..+.+..+.+...... ++....... ..... ..+ +=+.|.+|+..+...|.++++.+.-.+. T Consensus 4 f~~~~~~~~~~~~~~~-----------~~~~~~~~~-----~~~~~--~Al-~~~~V~~~i~~Ia~~iA~lp~~~~~~~g 64 (406) T protein:vir:97 4 FQPLGTSKVSYDDYIS-----------SVLAGDVSQ-----KYLGV--SAL-KNSDILTATSIIAGDIARFPLVKKDVNG 64 (406) T ss_pred ccccCCCCCCcchHHH-----------HHhcCCCCc-----ccccc--hhh-ccHHHHHHHHHHHHhhhhCeeEEEecCc Confidence 3233222222110000 000000000 01111 123 3578999999999999999998764332 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) +......+...|.. +-....++.++++.++ +.+.+|-+.++++....+| .+..|.+++|..+. + T Consensus 65 ~~~~~~~~~~lL~~------------~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g--~~~~L~~i~p~~v~-v 129 (406) T protein:vir:97 65 DIIHDEDINYLLNV------------KSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTN--QALQFQFYRPSETT-V 129 (406) T ss_pred cccccchHHHHhhc------------cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCC--eEEEEEEECCCeeE-E Confidence 21111112222210 1112345677777775 5788999999997654444 36789999998875 3 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAI 239 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 239 (484) ..+.++++.+.-. ....+....++...+|++++.+. +..+|.|.+..+....-.-....++...|. T Consensus 130 ~~~~~~~~~y~~~-------------~~~~~~~~~~~~~evih~r~~~~-dg~~G~spi~~~~~~i~~~~a~~~~~~~~f 195 (406) T protein:vir:97 130 EETDNHEIVYTFT-------------DMLTAKQVKCFAHDVIHWKFFSH-DTILGRSPLLSLGDEIDLQTGGINTLIKFF 195 (406) T ss_pred EEcCCceEEEEEE-------------ecCCceEEEEccccEEEecCCCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 4455565543221 11233445678888888876543 347799999988877766777777777887 Q ss_pred HHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhh Q lcl|NC_021302. 240 RRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALA 317 (484) Q Consensus 240 Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilG 317 (484) +. | +.|-.+...+...++++++++.+.+.++..|.++ .++++.|++++-++.+.....|-+..++..++|++++.- T Consensus 196 ~n-g-~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgV 273 (406) T protein:vir:97 196 KD-G-FSSGILTMKGAQLSGDARQRARQEFEKMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRV 273 (406) T ss_pred hc-c-CCCceEEecCCCCCHHHHHHHHHHHHHHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCC Confidence 64 4 3465555666778899999999999998876554 467899999988876644445777778888999998654 Q ss_pred hhhccccc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHHH Q lcl|NC_021302. 318 HFLNLDGK-GGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQMLV 396 (484) Q Consensus 318 qtlt~~~~-gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L~ 396 (484) ..-..+.. .++ +..+........-+.-.++.|++.||+.|+..-.. .. -+++|+. ..+++..++.+.+++ T Consensus 274 Pp~~lg~~~~~~-~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~~~~------~~-~~i~fd~-~~~~~~~~~~~~~~~ 344 (406) T protein:vir:97 274 PSYKLGVNSPNQ-SVAQLMEDYVTNDLPFYFDAITSELGLKTLNDKDR------RL-YHIEFDT-RSVTGRNVDEIVKLV 344 (406) T ss_pred CHHHcCCCCCcc-hHHHHHHHHHHHHHHHHHHHHHHHHhhhhcChhhc------cc-eeEEEec-CccchhhHHHHHHHH Confidence 43322211 122 22333334445567777888888888877543111 11 2466653 345666778888999 Q ss_pred hcCcccCCcccHHHHHHHhCCCCCCC--Ccccccc---cCCCcCCCccccCCCCccccccccccccccccc Q lcl|NC_021302. 397 NAGLLTPDPRLEAFLRDAAGLPGPDP--DADDDES---TADTGQDEPETDEPALPNTSGTTSTTNAPQARK 462 (484) Q Consensus 397 ~~G~~~~~~~~~~~i~e~~glp~p~~--~e~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (484) +.|+.. .+++|+.+|+|.-.+ ++....+ .+....++++.. .+..+++....+.+..+ T Consensus 345 ~~g~~T-----~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~----~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 345 NNQILT-----PNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDK----VGIKGKGGEVNAEEDKS 406 (406) T ss_pred hCCCcC-----HHHHHHHhCCCCCCCCCCCeEeeccCccchhcccccccc----cccccCCCCCCCCCCCC Confidence 999754 478999999986433 2222111 111111111111 11111222212111111 No 54 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.77 E-value=1.3e-17 Score=113.15 Aligned_cols=399 Identities=13% Similarity=0.079 Sum_probs=224.0 Q ss_pred CCC--CCCCccceeeeecccccchhhhhhhcccccccccccccchHHHH--HHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAP--KTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY--TRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y--~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) |.= +.-.+.++ .+..+. ..+...+ .-.++ .....| +..+ +-+.|.+|+..+-..|.+++|++. T Consensus 1 Mg~f~~~~~r~~~----~~~~~~-~~~~~~~------~~~~~-~~~~~~~~~~al-~~~~v~~cv~~Ia~~iA~~p~~~~ 67 (416) T protein:vir:81 1 MGIFYKNEKRDLQ----YNEDDL-QMMVQTL------PGFQG-TKLRQYKDIEAI-RHSDIFTAVMMIASDLARMPIRVT 67 (416) T ss_pred CCccccccccccc----CCCcch-hHHHHHh------ccccc-cCccccchhhhh-cchHHHHHHHHHHHhhccCceEEe Confidence 000 00000000 000000 0000000 00000 001111 1233 468899999999999999999997 Q ss_pred cCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) .++.... ...+...|.. +-....+..++++.+. +.+.+|.+.+++++... | .+..|.+++|.+ T Consensus 68 ~~~~~~~-~~~~~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G--~~~~L~~i~~~~ 131 (416) T protein:vir:81 68 VNGQINY-SDRIVNLLNT------------RPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-G--EPMNLTFRKTSE 131 (416) T ss_pred cCccccc-cchHHHHHhc------------ccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEEcCce Confidence 6443211 1112222210 0111224566777765 46789999999886533 3 377899999988 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) +. ...+.+|.+.+..+..... .......+|+..+|++++.+ .+..+|.|.+..+....-.-....++. T Consensus 132 v~-v~~~~~g~~~~~~~~~~~~----------~~~~~~~~~~~evihir~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~ 199 (416) T protein:vir:81 132 IE-LKSDARGRLYYFHQRIDSN----------GNNIERNVKFEDMLDIKFYS-LDGINGLSLLDTLSRTIESDNNGKDFL 199 (416) T ss_pred eE-EEECCCccEEEEEEEecCC----------CceeEEEEccccEEEeccCC-CCCccccCHHHHHHHHHHHHHHHHHHH Confidence 85 4556677765443321111 11122457888888887654 455899999999998887777888888 Q ss_pred HHHHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCCchhHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQM 311 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~I 311 (484) ..|.... +.|-.+.+++... ++++++++.+.+..+..|. + ..++++.|++++-++.+.....|.+..++..++| T Consensus 200 ~~~f~ng--~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 277 (416) T protein:vir:81 200 NNFLRNG--THAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREI 277 (416) T ss_pred HHHHhcc--CCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHH Confidence 8888853 6676666776554 4566778888787776552 2 2578999999888876655566888889999999 Q ss_pred HHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHHH Q lcl|NC_021302. 312 ALVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDAT 388 (484) Q Consensus 312 sk~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ 388 (484) ++++.-..--.+..+++.+..+. +..+..-+.-.++.|+..||+.|.+. + .. -+|+|+. . ..|.+.. T Consensus 278 a~~fgVPp~~lg~~~~~~~~~~~-~~~~~~~l~P~~~~ie~~ln~~l~~~-----~--~~--~~~~f~~~~l~~~D~~~~ 347 (416) T protein:vir:81 278 AGVFGIPLHKFGIETANMSITDA-NLDYLSTLKPYITCVCAELNFKFNDE-----Y--VN--REFKFDTTEIRVVDEKTQ 347 (416) T ss_pred HHHhCCCHHHcCCCCCCccHHHH-HHHHHHHHHHHHHHHHHHHhhhcccc-----c--cC--ceEEEechhhhccCHHHH Confidence 99976543222212222222221 12233456777888888888765432 1 11 1556643 2 2478889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccc-cccCC----CcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDD-ESTAD----TGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~-~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) +++++++++.|+.. .+++|+++|+|.-++++... ....+ ...++.+.........+.++...+ + T Consensus 348 ~~~~~~~~~~G~~T-----~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~n--~ 416 (416) T protein:vir:81 348 AEIDKINIDSGKMN-----IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEEN--E 416 (416) T ss_pred HHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccccccCCCCCC--C Confidence 99999999999864 47899999998655554321 11100 000111111111111111111111 1 No 55 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.77 E-value=1.3e-17 Score=113.15 Aligned_cols=399 Identities=13% Similarity=0.079 Sum_probs=224.0 Q ss_pred CCC--CCCCccceeeeecccccchhhhhhhcccccccccccccchHHHH--HHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAP--KTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY--TRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y--~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) |.= +.-.+.++ .+..+. ..+...+ .-.++ .....| +..+ +-+.|.+|+..+-..|.+++|++. T Consensus 1 Mg~f~~~~~r~~~----~~~~~~-~~~~~~~------~~~~~-~~~~~~~~~~al-~~~~v~~cv~~Ia~~iA~~p~~~~ 67 (416) T protein:vir:45 1 MGIFYKNEKRDLQ----YNEDDL-QMMVQTL------PGFQG-TKLRQYKDIEAI-RHSDIFTAVMMIASDLARMPIRVT 67 (416) T ss_pred CCccccccccccc----CCCcch-hHHHHHh------ccccc-cCccccchhhhh-cchHHHHHHHHHHHhhccCceEEe Confidence 000 00000000 000000 0000000 00000 001111 1233 468899999999999999999997 Q ss_pred cCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) .++.... ...+...|.. +-....+..++++.+. +.+.+|.+.+++++... | .+..|.+++|.+ T Consensus 68 ~~~~~~~-~~~~~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G--~~~~L~~i~~~~ 131 (416) T protein:vir:45 68 VNGQINY-SDRIVNLLNT------------RPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-G--EPMNLTFRKTSE 131 (416) T ss_pred cCccccc-cchHHHHHhc------------ccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEEcCce Confidence 6443211 1112222210 0111224566777765 46789999999886533 3 377899999988 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) +. ...+.+|.+.+..+..... .......+|+..+|++++.+ .+..+|.|.+..+....-.-....++. T Consensus 132 v~-v~~~~~g~~~~~~~~~~~~----------~~~~~~~~~~~evihir~~~-~d~~~G~s~i~~~~~~i~~~~~~~~~~ 199 (416) T protein:vir:45 132 IE-LKSDARGRLYYFHQRIDSN----------GNNIERNVKFEDMLDIKFYS-LDGINGLSLLDTLSRTIESDNNGKDFL 199 (416) T ss_pred eE-EEECCCccEEEEEEEecCC----------CceeEEEEccccEEEeccCC-CCCccccCHHHHHHHHHHHHHHHHHHH Confidence 85 4556677765443321111 11122457888888887654 455899999999998887777888888 Q ss_pred HHHHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCCchhHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQM 311 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~I 311 (484) ..|.... +.|-.+.+++... ++++++++.+.+..+..|. + ..++++.|++++-++.+.....|.+..++..++| T Consensus 200 ~~~f~ng--~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~I 277 (416) T protein:vir:45 200 NNFLRNG--THAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREI 277 (416) T ss_pred HHHHhcc--CCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHH Confidence 8888853 6676666776554 4566778888787776552 2 2578999999888876655566888889999999 Q ss_pred HHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHHH Q lcl|NC_021302. 312 ALVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDAT 388 (484) Q Consensus 312 sk~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ 388 (484) ++++.-..--.+..+++.+..+. +..+..-+.-.++.|+..||+.|.+. + .. -+|+|+. . ..|.+.. T Consensus 278 a~~fgVPp~~lg~~~~~~~~~~~-~~~~~~~l~P~~~~ie~~ln~~l~~~-----~--~~--~~~~f~~~~l~~~D~~~~ 347 (416) T protein:vir:45 278 AGVFGIPLHKFGIETANMSITDA-NLDYLSTLKPYITCVCAELNFKFNDE-----Y--VN--REFKFDTTEIRVVDEKTQ 347 (416) T ss_pred HHHhCCCHHHcCCCCCCccHHHH-HHHHHHHHHHHHHHHHHHHhhhcccc-----c--cC--ceEEEechhhhccCHHHH Confidence 99976543222212222222221 12233456777888888888765432 1 11 1556643 2 2478889 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccc-cccCC----CcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDD-ESTAD----TGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~-~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) +++++++++.|+.. .+++|+++|+|.-++++... ....+ ...++.+.........+.++...+ + T Consensus 348 ~~~~~~~~~~G~~T-----~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~n--~ 416 (416) T protein:vir:45 348 AEIDKINIDSGKMN-----IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEEN--E 416 (416) T ss_pred HHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccccccCCCCCC--C Confidence 99999999999864 47899999998655554321 11100 000111111111111111111111 1 No 56 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=99.77 E-value=5.4e-17 Score=109.75 Aligned_cols=422 Identities=11% Similarity=0.032 Sum_probs=233.9 Q ss_pred CCCC---CCCccceeeeecccccchhhhhhhcccccccccccccch-HHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPK---TVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNS-VYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~-~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) ||-- .-...++ ...+....|..++.+.-.. .+.. -.+=.+..++-+.|.+|+......|.+++|.|. T Consensus 1 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~g~~~~~----~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~ 71 (460) T protein:vir:10 1 MANRIIRALRELTG-----LDNKFNDAFIKYIGQTFTK----YDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIK 71 (460) T ss_pred CchhHHHHHhhhhc-----cCCCchHHHHHhhccccCC----CccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEE Confidence 1100 0000000 0000011121111111000 1111 112223334579999999999999999999996 Q ss_pred cCCCCHHHHH-----HHHHHHHh--------hhc-cchhhhhHHHhh----cCCCHHHHHHHHH-HHHhhcceeeeEEEe Q lcl|NC_021302. 77 PNGARPEVVE-----HVAACLGL--------PVE-GDESDKPTPRTR----GRFSWDQHLRLAL-KSLQFGHAVFEQTYF 137 (484) Q Consensus 77 p~~~~~e~~~-----~~~~~l~~--------~~~-~~~~~~~~~~~~----~~~~~~~~i~~~l-~a~~~G~s~~Eivw~ 137 (484) ....+....+ ....++.. .+. ..........++ ...+..++++.++ +.+.+|-+..++++. T Consensus 72 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~ 151 (460) T protein:vir:10 72 VVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSP 151 (460) T ss_pred eccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEec Confidence 4333321100 00000000 000 000111111222 2335778888887 788999999999876 Q ss_pred ecC-CeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCcc-----Cc Q lcl|NC_021302. 138 YEG-GRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDP-----GV 211 (484) Q Consensus 138 ~~~-g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~-----~~ 211 (484) ..+ +.-.+..|.+++|.++. ...+.++..+.... ....+....++....++++..|++++.... +. T Consensus 152 ~~~~~~G~~~~L~~l~~~~v~-v~~~~~~~~~~~~~-------~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~ 223 (460) T protein:vir:10 152 DDGINAGVPSQMYVLPAHLIK-IVLKDDINLLSTDS-------PIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSH 223 (460) T ss_pred CCCccCceeEEEEEEcCceEE-EEEcCCCceeeeee-------eeeEEEEecCceeEEecccceEEEecCCCCcccccCc Confidence 542 33457889999999886 44555555443221 112223334556678999999888865443 45 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-c--eEEEccCCceE Q lcl|NC_021302. 212 WTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-S--AGLALTAGEEA 288 (484) Q Consensus 212 p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~i 288 (484) .+|.|.+..+......-....++-..|... | +.|-.+.+.+...++++++++.+.+.++..|. + ..++++.|+++ T Consensus 224 ~~G~sp~~~~~~~i~~~~~~~~~~~~~f~n-g-~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~ 301 (460) T protein:vir:10 224 LYGMSPIRAILRNINSQNSTIDNNVKTMQN-G-GVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAF 301 (460) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhc-C-CCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceE Confidence 689999999988888888888888888875 3 56766777778889999999999999886553 2 35788999988 Q ss_pred EEecccCCchhHHHHHHHHHHHHHHHHhhhh-hccccccc--chhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 289 GILSPNGTPLDPRRAIEYHDHQMALVALAHF-LNLDGKGG--SYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVD 364 (484) Q Consensus 289 e~~~~~~~~~~~~~li~~~d~~Isk~ilGqt-lt~~~~gG--s~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~ 364 (484) +-++.+.....|.+..++..++|++++.-.. +....++| +++-.+-+. .....-+.-.++.|++.||+.|++..- T Consensus 302 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~- 380 (460) T protein:vir:10 302 TKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFK- 380 (460) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccc- Confidence 8887765566688888999999999975543 22222232 344444333 444556788899999999998876421 Q ss_pred hCCCCccccceEEecCCCC-cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCC-CCCcccccccCCCcCCCccccC Q lcl|NC_021302. 365 VNWGEDEPAPLLVFDEIGS-RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGP-DPDADDDESTADTGQDEPETDE 442 (484) Q Consensus 365 ~Nf~~~~~~P~~~~~~~~~-~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p-~~~e~~~~~~~~~~~~~~~~~~ 442 (484) .... ..++|+.... .+..-.+....+.+.|+.. .+++|+.+|+|.- +++-+..-.+.+-......... T Consensus 381 ----~~~~-~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T-----~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~ 450 (460) T protein:vir:10 381 ----GYEN-AVIEWDISELPEMQTDMVAMASWLNTIPVT-----PNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNN 450 (460) T ss_pred ----ccCC-ceEEeecchhhhHHHHHHHHHHHHhCCCCC-----HHHHHHHhCCCCCCCCCCCeeeecccccchhhcccc Confidence 1111 2455543221 1222223334566788754 5889999999864 2322222111111100000000 Q ss_pred CCCcccccccccccccc Q lcl|NC_021302. 443 PALPNTSGTTSTTNAPQ 459 (484) Q Consensus 443 ~~~~~~~~~~~~~~~~~ 459 (484) ......... + T Consensus 451 ~~~~~~nq~-------~ 460 (460) T protein:vir:10 451 LIDSAFNQN-------Q 460 (460) T ss_pred cCCCcccCC-------C Confidence 000000000 0 No 57 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.77 E-value=1e-16 Score=108.24 Aligned_cols=431 Identities=13% Similarity=0.147 Sum_probs=218.0 Q ss_pred CCCCCCCccc-eeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhC--------- Q lcl|NC_021302. 1 MAPKTVAPRT-ERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRR--------- 70 (484) Q Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~--------- 70 (484) +.-+..+=++ .++...-..+++.. +..+.+..++-.-+....-+.|.+|+..+...|.+ T Consensus 43 ~~~~~~~~~~~~~~~~~~~~g~~~~-----------~~~~~~~~l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~ 111 (547) T protein:vir:63 43 MNNKEVAYSQPVIGSMSANPGFKTK-----------PSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSE 111 (547) T ss_pred hcccchhhhchhhheeecccccccC-----------CccCChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhc Confidence 1111111000 11111111111110 00111111111111212247899999999988874 Q ss_pred --CCcEEecCCC-------CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecC Q lcl|NC_021302. 71 --TDWRIRPNGA-------RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEG 140 (484) Q Consensus 71 --~~~~v~p~~~-------~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~ 140 (484) ..|.|.+... +.+..+.+.+.|..+...-. -.+..|.++++.++ +.+.+|.+++|+++... T Consensus 112 ~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~--------p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd~~- 182 (547) T protein:vir:63 112 KGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDND--------INRDSFSSFVKKIVRDTYMYDQVNFEKVFNRN- 182 (547) T ss_pred cCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCC--------CccchHHHHHHHHHHHHHhhCCEEEEEEECCC- Confidence 3455543221 11222333344332211000 01125777888876 67899999999998644 Q ss_pred CeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccC---ccccchh Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG---VWTGNSL 217 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~---~p~G~gl 217 (484) | .+..|.+++|.++. +..+.+|.+.. ..........+.....++....|++++....+ .+||.|. T Consensus 183 G--~~~~L~~l~p~~V~-~~~~~~g~~~~---------~~~~y~~~~~~~~~~~~~~~eiih~r~n~~~~~~~~~~G~Sp 250 (547) T protein:vir:63 183 Q--SMVRFVAKDPTTIF-FATTADGKIPD---------NGNRFVQVIDQKIVATFNAREMAFAVRNPRSDIYATGYGYPE 250 (547) T ss_pred C--cEEEEEEecCceeE-EEECCcccccc---------CceEEEEEcCCcEEEEeccccEEEecccCCCCcccccccccH Confidence 3 37789999999885 44555554310 00011122233344567888888887765433 5789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcce--EEecCCCCCCHHHHHHHHHHHHHHhcCC-ceE--EEc-cCCceEEEe Q lcl|NC_021302. 218 LRPAYKNWKLKDELIRIEAAAIRRHGIGVPY--LKGNEADSEDDDRMDELLEIASNYSGGE-SAG--LAL-TAGEEAGIL 291 (484) Q Consensus 218 l~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~--~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~a~--~vi-p~~~~ie~~ 291 (484) +..+......-....++-..|.+.. +.|- +..+.+...++++++.+.+.+.+...|. +++ .++ ..|++++-+ T Consensus 251 i~~~~~~i~~~~~a~~~~~~~f~Ng--~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~~~g~~~~~l 328 (547) T protein:vir:63 251 LEIALKQFIAHENTEAFNDRFFSHG--GTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNM 328 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcC--CCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEEc Confidence 9999988888888888888898853 4563 3334445578888999999888765553 333 233 456676666 Q ss_pred cccCCchhHHHHHHHHHHHHHHHHhhhhhcc---------cccccc--hhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 292 SPNGTPLDPRRAIEYHDHQMALVALAHFLNL---------DGKGGS--YALASVQ-ADTFVQSVQTVADEIRDVAQAHVV 359 (484) Q Consensus 292 ~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~---------~~~gGs--~A~~evh-~~v~~~~~~aD~~~i~~~ln~qli 359 (484) +.+.....|.+..++..++|++++.-...-. .+.++| ++-.+.. ......-+.-.++.|+..||+.|+ T Consensus 329 ~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~ 408 (547) T protein:vir:63 329 TPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIV 408 (547) T ss_pred CCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 6554555688889999999999965432111 111222 3322322 234566778889999999999877 Q ss_pred HHHHHhCCCCccccceEEecCCC-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCC-CCCccccccc------- Q lcl|NC_021302. 360 EDIVDVNWGEDEPAPLLVFDEIG-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGP-DPDADDDEST------- 430 (484) Q Consensus 360 ~~l~~~Nf~~~~~~P~~~~~~~~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p-~~~e~~~~~~------- 430 (484) +. |+. . -+|.|+... .+....++ +.++...|+. ..+++|+.+|+|.. +.++....+. T Consensus 409 ~~-----~~~--~-~~~~f~~~~~~~~~~~~~-~~~~~~~g~l-----T~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~ 474 (547) T protein:vir:63 409 AE-----FGD--K-YTFQFVGGDIKSELESVK-ILAEKAKVAM-----TVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQ 474 (547) T ss_pred cc-----cCC--c-eEEEeeccccccHHHHHH-HHHHHhCCCc-----CHHHHHHHhCCCCCCCCCceeecccccccccc Confidence 53 221 2 267776443 33444444 3456667764 35899999999753 3333222110 Q ss_pred -CCCcCCCcc---ccCCCCccccccc---------------cccccccccccccccchHHHhcCcccCcccC--C Q lcl|NC_021302. 431 -ADTGQDEPE---TDEPALPNTSGTT---------------STTNAPQARKRPRGRSPRDRRKTPDGAMPLW--D 484 (484) Q Consensus 431 -~~~~~~~~~---~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 484 (484) .+...++.+ .......+..++. ......+..+.... .|...++-.-.. | T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~-----~~~~~~~~~~~~~~~ 544 (547) T protein:vir:63 475 LMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDN-----ANAGKQGMKGDKPND 544 (547) T ss_pred cccccCCccccchhhccccccccCCCCCCCCCCCCCCcccCCCcCccccccCccc-----cchhhhhcCCCCccc Confidence 000000000 0000000000000 00000011111111 111111111100 0 No 58 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.76 E-value=2.8e-17 Score=111.36 Aligned_cols=415 Identities=14% Similarity=0.046 Sum_probs=215.0 Q ss_pred HHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC--CHHHHHHHHHHHHhhhccchhhh-hHHHhhcCCCHHHHHHHHH- Q lcl|NC_021302. 47 YTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA--RPEVVEHVAACLGLPVEGDESDK-PTPRTRGRFSWDQHLRLAL- 122 (484) Q Consensus 47 y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~--~~e~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~i~~~l- 122 (484) -++|.+..+.|.+|+..+...|.+++|.|.+... .........+.+...+....... ..........|.++++.++ T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 3467777899999999999999999999975321 11111111111111111111100 0000001123566776654 Q ss_pred HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCC-------Cceeeeeccccc----ccc----cccceecc Q lcl|NC_021302. 123 KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRD-------GGLISIQQWPAG----TFG----GPGMVVMA 187 (484) Q Consensus 123 ~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~d-------g~l~~~~q~~~~----~~~----~~~~~~~~ 187 (484) +.+.+|.+.+|+++...+ .+..|.++++.++.. ..+.. +........... ..+ ........ T Consensus 81 ~l~l~Gn~~i~~~r~~~G---~~~~l~~l~~~~v~~-~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDG---TPTGLAYVPGHTIRK-RMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDG 156 (467) T ss_pred HHHhcCCeEEEEEECCCC---cEEEEEEeCCceeEe-eeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccc Confidence 688899999999975443 366788888877642 11111 111111110000 000 00001112 Q ss_pred CCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHH Q lcl|NC_021302. 188 PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELL 266 (484) Q Consensus 188 ~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~ 266 (484) ..+....+|....|+++.....+..||.+.+..+......-.....+-..|... .+.|-.+.. .+...++++++++. T Consensus 157 ~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~n--g~~p~gil~~~~~~l~~e~~~~~~ 234 (467) T protein:vir:31 157 STGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFEN--DGVPRIAIIVKGAELTEKGREEMR 234 (467) T ss_pred cccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhc--cCCCceEEEecCcCCCHHHHHHHH Confidence 344556788889888887666777899999998887766655666666666663 366754444 34567888888888 Q ss_pred HHHHHHhcCC--------------ceEEEccCCce-----EEEeccc---CCchhHHHHHHHHHHHHHHHHhhhh-hccc Q lcl|NC_021302. 267 EIASNYSGGE--------------SAGLALTAGEE-----AGILSPN---GTPLDPRRAIEYHDHQMALVALAHF-LNLD 323 (484) Q Consensus 267 ~~l~~~~~g~--------------~a~~vip~~~~-----ie~~~~~---~~~~~~~~li~~~d~~Isk~ilGqt-lt~~ 323 (484) +.+.+...+. ...++++.|.+ +++...+ .....|.+..++..++|+++..-.. +... T Consensus 235 ~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~ 314 (467) T protein:vir:31 235 NLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGV 314 (467) T ss_pred HHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHccc Confidence 8887654321 12345565554 3443322 1234588899999999999865443 3322 Q ss_pred ccccchh-hH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcHHHHHHHHHHHHhcCc Q lcl|NC_021302. 324 GKGGSYA-LA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQDATAAALQMLVNAGL 400 (484) Q Consensus 324 ~~gGs~A-~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~~~~ae~~~~L~~~G~ 400 (484) .++++++ -. +.........+.-.++.|++.||+.|++..... ...+.+|.+.... .+.+..+++++.+++.|+ T Consensus 315 ~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~----~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~ 390 (467) T protein:vir:31 315 VESGAFSTDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDA----PDWTIEFELAKPDTKLQDVEIASQRVQAMQGL 390 (467) T ss_pred CCCCCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhcc----CCceEEEecchhhccCHHHHHHHHHHHHhCCC Confidence 3334442 12 222333456678888999999998777643222 1122234333332 577889999999999998 Q ss_pred ccCCcccHHHHHHHhCCCCCCCCccccc-cc---CCCcCCCccccCCCCccccccccccccccccccccccchHHHhcCc Q lcl|NC_021302. 401 LTPDPRLEAFLRDAAGLPGPDPDADDDE-ST---ADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTP 476 (484) Q Consensus 401 ~~~~~~~~~~i~e~~glp~p~~~e~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (484) .. .+++|+.+|+|.-.++ .... .+ ...++..+.......+...++..........++.. ....... T Consensus 391 ~T-----~NE~R~~~Gl~pi~d~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 460 (467) T protein:vir:31 391 LT-----VNELRDEFGFEPFPEE-HVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRADEIIDSYQADL----ETEQLIE 460 (467) T ss_pred cC-----HHHHHHHhCCCCCCcc-cccCCcccccccccccCCCCcccCcCCCCCCCcccchHhhhhhcc----ccchhhh Confidence 64 4789999999754322 2111 00 00001111111000000000000000000000000 0000001 Q ss_pred --ccCcc Q lcl|NC_021302. 477 --DGAMP 481 (484) Q Consensus 477 --~~~~~ 481 (484) +-+.| T Consensus 461 ~~~~~~~ 467 (467) T protein:vir:31 461 IGANADS 467 (467) T ss_pred hccccCC Confidence 11112 No 59 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.76 E-value=2.2e-17 Score=111.87 Aligned_cols=373 Identities=10% Similarity=0.026 Sum_probs=219.9 Q ss_pred CCCCCCCccceee-eecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERG-YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) +..+...+..... ..+..... ..... ..+..++.+...+-+.|.+|+..+...|.++++.+.... T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~-----------~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~ 72 (386) T protein:vir:49 7 TNLATESPPINQESFFDIADSD---FLASL-----------NSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQ 72 (386) T ss_pred hccCCCCcccchhhhhhhhhcc---ccccc-----------cCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccch Confidence 3332222221111 11111000 00000 001112222233578999999999999999999986321 Q ss_pred CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceee Q lcl|NC_021302. 80 ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAY 158 (484) Q Consensus 80 ~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~ 158 (484) .+ .| ..+.....+..++++.++ +.+.+|-+++++++... | .+..|.+++|.++.. T Consensus 73 ~~---------~l------------~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-g--~~~~l~~i~~~~v~v 128 (386) T protein:vir:49 73 LQ---------GI------------VDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDN-G--RDMKWEYLRPSQVSF 128 (386) T ss_pred hh---------hh------------hhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC-C--cEEEEEEecCceeEE Confidence 11 11 111122335677888887 56779999999987543 2 467899999988753 Q ss_pred eeecCCCcee-eeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLI-SIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAA 237 (484) Q Consensus 159 ~~~~~dg~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 237 (484) ..+.+++.+ +.... .....+....+|...++++++....+..+|.|.+..+....-.-....++... T Consensus 129 -~~~~~~~~~~y~~~~-----------~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~ 196 (386) T protein:vir:49 129 -NRLDNQNGLYYNITF-----------DDPHIAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTIS 196 (386) T ss_pred -EEcCCCceEEEEEEE-----------cCccccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHH Confidence 344444432 21111 01123344568888888888766667789999999999888888888888888 Q ss_pred HHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhh Q lcl|NC_021302. 238 AIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALA 317 (484) Q Consensus 238 f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilG 317 (484) +... .++|-.+.+.+...++++...+.+...++..+....++++.|++++-++.+.....|.+..++...+|+++..- T Consensus 197 ~~~n--g~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgV 274 (386) T protein:vir:49 197 ALKN--ALNANGILKIKGGGLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGI 274 (386) T ss_pred HHHc--cCCccEEEEeCCCCChHHHHHHHHHHHHhccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 8875 36787777887778888888888888777666556788999999988876666667888899999999999655 Q ss_pred hhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC---CCCcHHHHHHHHHH Q lcl|NC_021302. 318 HFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE---IGSRQDATAAALQM 394 (484) Q Consensus 318 qtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~---~~~~~~~~ae~~~~ 394 (484) ..--.+..+.+++.++.........+.--++.++..||+.|.. +++|+. ...+...++..+.+ T Consensus 275 Pp~~lg~~~~~~~~~~~~~~~~~~~i~~~l~~i~~~~~~~l~~--------------~~~~~~~~~~~~d~~~~~~~~~~ 340 (386) T protein:vir:49 275 PESIVGGDGDQQSSLEMIYNIYFKSVSRYLRPFVSEMSKKLSC--------------EVDVDISPAVDPTGSNYISLINS 340 (386) T ss_pred CHHHhCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------hhcccchhhhccCHHHHHHHHHH Confidence 4332222333444444334444455555666666666665432 233332 22456778889999 Q ss_pred HHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccc-cccccccccc Q lcl|NC_021302. 395 LVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTS-GTTSTTNAPQ 459 (484) Q Consensus 395 L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 459 (484) |...|+.. .+++|+.++-..-...+ + +....+..+..+ |+. .... T Consensus 341 l~~~g~~t-----~nE~r~~l~~~~~~~~~-~-----------~~~~~~~~~~~~gGd~---~~~~ 386 (386) T protein:vir:49 341 MVKSGTLA-----QNQGLYILQQAEILPKE-L-----------PDGKNPNRTSLKGGEI---NEQD 386 (386) T ss_pred HHhCCCcC-----HHHHHHHHhhCCCCCCc-C-----------cchhccCCCCCCCCCC---CCCC Confidence 99999864 47788887532111110 0 000000000000 000 0000 No 60 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=99.76 E-value=5.1e-17 Score=109.90 Aligned_cols=420 Identities=12% Similarity=0.059 Sum_probs=224.7 Q ss_pred cceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHH Q lcl|NC_021302. 9 RTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHV 88 (484) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~ 88 (484) +|+ -|.+++| +.. +..... .+-..+.| + +-+.|.+|+...-..|.+++|.+...+........+ T Consensus 1 ~~~----~~~~~g~--~~~-~~~~~~-----~~~~~~~~---~-~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l 64 (723) T protein:vir:94 1 MTT----FPSGAGG--WNA-WSADSV-----FGNGAKGW---S-NSAVAYRCISMLANNAASVDLVVRGPDGELDELHPL 64 (723) T ss_pred Ccc----cccCCCc--ccc-cccccc-----ccccHHHH---h-hhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHH Confidence 221 1121111 111 111100 01111222 2 468999999999999999999996433221111111 Q ss_pred HHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCce Q lcl|NC_021302. 89 AACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGL 167 (484) Q Consensus 89 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l 167 (484) ...|. .+-....+..++.+.++ +.+.+|-+.+++++.-.+-...+..|.+.+++.... ....++.. T Consensus 65 ~~lL~------------~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v-~~~~~~~~ 131 (723) T protein:vir:94 65 SQLWN------------VMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTI-VATRAADA 131 (723) T ss_pred HHHHh------------hCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEE-eecCCCcc Confidence 11111 01122335677777776 688899999999875333234577888888765432 22222221 Q ss_pred eeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc Q lcl|NC_021302. 168 ISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVP 247 (484) Q Consensus 168 ~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P 247 (484) ..... ..........+....++....|++++....+..||.|.+..+....-.-.....+...|... .+.| T Consensus 132 ~~~~~-------~~~y~~~~~~G~~~~~~~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~N--G~~p 202 (723) T protein:vir:94 132 VPQAQ-------IIGYVIERTDGVRVPVLADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKN--GARP 202 (723) T ss_pred ceeee-------eeEEEEEecCceeEEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhc--CCCc Confidence 11000 00111122334456788888888877665677899999999988887777778888888774 3667 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHHHhcCC-ce--EEEc----------cCCceEEEecccCCchhHHHHHHHHHHHHHHH Q lcl|NC_021302. 248 YLKGNEADSEDDDRMDELLEIASNYSGGE-SA--GLAL----------TAGEEAGILSPNGTPLDPRRAIEYHDHQMALV 314 (484) Q Consensus 248 ~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~a--~~vi----------p~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ 314 (484) -.+.+++ ..++++.+++.+.+.+..+|. ++ .+++ ++|++++-++.+.....|.+..++..++|+++ T Consensus 203 ~giL~~~-~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~a 281 (723) T protein:vir:94 203 GGVVNLG-DMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLA 281 (723) T ss_pred ceEEEcC-CCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHH Confidence 5555664 578888888888887765542 22 2333 46777777766545556888889999999999 Q ss_pred Hhhhh-hcccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHHH Q lcl|NC_021302. 315 ALAHF-LNLDGKGGSYALA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDATA 389 (484) Q Consensus 315 ilGqt-lt~~~~gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~a 389 (484) +.-.. +..+ +++++-. +........-+.-.++.|+..||+.|++. . +.. -+|.|+.. ..|.+..+ T Consensus 282 fgVPp~~i~~--~st~sN~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~---~--g~~---~~~~f~~~~lLr~D~~~r~ 351 (723) T protein:vir:94 282 FGIRKDALLG--GSTYENQAEAKAAVWTETLIPQMEVMASITDLQLLPD---I--GWT---VEWDFNSVPALQEDLEAQA 351 (723) T ss_pred hCCChhHcCC--CCCcccHHHHHHHHHHHHHHHHHHHHHHHHhHhhccc---c--cCc---eEEeecchhhhhcCHHHHH Confidence 76663 3332 2223222 22223346677888899999999988763 1 211 25677653 35778899 Q ss_pred HHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccc-ccCCCcCCCccccCCCCccccccc-ccccccc-ccccccc Q lcl|NC_021302. 390 AALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDE-STADTGQDEPETDEPALPNTSGTT-STTNAPQ-ARKRPRG 466 (484) Q Consensus 390 e~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~ 466 (484) ++++++++.|+.. .+++|+.+|+|.-+++....- .+.... ....+.+.+....+.. ....... ++.+|.. T Consensus 352 ~~~~~~v~~G~~T-----~NE~R~~lglpPi~gGd~~~~~~p~~~~--~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~ 424 (723) T protein:vir:94 352 GRNQGYLVNDVLM-----VDEVRATIGLDPLPGGIGQMTLTPYRAQ--FAPAPAPAPAVEEGAARMLALLERVAADRPLP 424 (723) T ss_pred HHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCcccceecccccc--ccCCCCCCccchhhhHhhhhhccccccccCcC Confidence 9999999999864 478999999975554432211 110000 0000001100000000 0000000 0000000 Q ss_pred cchHH------HhcCcccCcccCC Q lcl|NC_021302. 467 RSPRD------RRKTPDGAMPLWD 484 (484) Q Consensus 467 ~~~~~------~~~~~~~~~~~~~ 484 (484) ..+.. ....|+-..++|- T Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~ 448 (723) T protein:vir:94 425 ELPVRATTVLHHDPGPDPQQTLYE 448 (723) T ss_pred CCCCCCCCCCCCCcccCCchhHHH Confidence 00000 0000111111221 No 61 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.76 E-value=1e-16 Score=108.19 Aligned_cols=436 Identities=12% Similarity=0.146 Sum_probs=219.9 Q ss_pred CCCC---------CCCccc-eeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhC Q lcl|NC_021302. 1 MAPK---------TVAPRT-ERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRR 70 (484) Q Consensus 1 ~~~~---------~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~ 70 (484) ++++ +.+=.+ .++......+++... . ..++..++ . +.+-.. .-+.|.+|+..+...|.. T Consensus 38 ~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~-~----~~~~~~l~--~---~~~~~~-~npiv~~~I~~ia~~IA~ 106 (551) T protein:vir:80 38 REQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKP-S----IRNNQDLH--G---VLKKFG-GNIILNAIINTRSNQVSM 106 (551) T ss_pred ccHHHHHHhhccCcceeecccccceecCcccccCc-c----ccChhHHH--H---HHHHhh-cCHHHHHHHHHHHHHHhh Confidence 1111 111001 111111111111110 0 00111111 1 122222 347899999999999985 Q ss_pred -----------CCcEEecCCCC-------HHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhccee Q lcl|NC_021302. 71 -----------TDWRIRPNGAR-------PEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAV 131 (484) Q Consensus 71 -----------~~~~v~p~~~~-------~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~ 131 (484) ..|.|.+...+ .+..+.+.+.|..+...-. -.+..|.++++.++ +.+.+|.+. T Consensus 107 ~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~--------p~~~s~~~f~~~lv~dlll~Gnay 178 (551) T protein:vir:80 107 YCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDND--------INRDSFSSFVKKIVRDTYMYDQVN 178 (551) T ss_pred hhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCC--------CccchHHHHHHHHHHHHHhcCCEE Confidence 45666543211 1222233344332210000 01125777888876 578899999 Q ss_pred eeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCcc-- Q lcl|NC_021302. 132 FEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDP-- 209 (484) Q Consensus 132 ~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~-- 209 (484) +|+++... | .+..|.+++|.++. ...+.+|.+... .........+.....++.+..|++++.... T Consensus 179 ~~i~rd~~-G--~~~~L~~l~p~~V~-v~~~~~g~~~~~---------~~~y~~~~~g~~~~~~~~~eiiH~~~n~~~~~ 245 (551) T protein:vir:80 179 FEKVFNRN-Q--SMVRFVAKDPTTIF-FATTADGKIPDN---------GNRFVQVIDQKIVATFNAREMAFAVRNPRSDI 245 (551) T ss_pred EEEEECCC-C--cEEEEEEeCCceeE-EEECCccccccC---------ceEEEEEeCCcEEEEEcccceEEecccCCCCc Confidence 99998653 3 37889999999885 445556543100 001112223334456788888888766543 Q ss_pred -CccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcce--EEecCCCCCCHHHHHHHHHHHHHHhcCC-ceE--EEc- Q lcl|NC_021302. 210 -GVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPY--LKGNEADSEDDDRMDELLEIASNYSGGE-SAG--LAL- 282 (484) Q Consensus 210 -~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~--~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~a~--~vi- 282 (484) ..+||.|.+..+......-....++-..|...- +.|- +..+.+...++++.+++.+.+.+..+|. +++ .++ T Consensus 246 ~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng--~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~ 323 (551) T protein:vir:80 246 YATGYGYPELEIALKQFIAHENTEAFNDRFFSHG--GTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS 323 (551) T ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHcC--CCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcccccc Confidence 357899999999888888888888888888853 4563 3334445578888899999888875553 332 233 Q ss_pred cCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcc---------cccccc--hhhHHHHH-HHHHHHHHHHHHHH Q lcl|NC_021302. 283 TAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNL---------DGKGGS--YALASVQA-DTFVQSVQTVADEI 350 (484) Q Consensus 283 p~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~---------~~~gGs--~A~~evh~-~v~~~~~~aD~~~i 350 (484) +.|++++-++.+.....|.+..++..++|++++.-...-. .+.++| ++-.+... .....-+.-.++.| T Consensus 324 ~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~i 403 (551) T protein:vir:80 324 AEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFI 403 (551) T ss_pred CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 4677777666555555688999999999999964432111 111222 33333333 34556688889999 Q ss_pred HHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCC-CCCccccc Q lcl|NC_021302. 351 RDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGP-DPDADDDE 428 (484) Q Consensus 351 ~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p-~~~e~~~~ 428 (484) +..||+.|++. |+. .+ +|.|+... .+....+++. ++...|+. ..+++|+.+|+|.. +.++.... T Consensus 404 e~~ln~~L~~~-----~~~--~~-~f~f~~~~~~~~~~~~~~~-~~~~~g~l-----T~NE~R~~~gl~P~~egGD~~~~ 469 (551) T protein:vir:80 404 EDFINKHIVAE-----FGD--KY-TFQFVGGDIKSELESVKIL-AEKAKVAM-----TVNEVRKELNLPGDVIGGDIPLN 469 (551) T ss_pred HHHHHhhhccc-----cCC--ce-EEEeeccChhhHHHHHHHH-HHHhcCCc-----CHHHHHHHhCCCCCCCCCceeec Confidence 99999977753 221 22 67776443 2344444443 45566764 35899999999753 33333221 Q ss_pred cc--C------CCcCCCcc--ccC-CCCccccccc-------ccccc---ccccccccccchHHHhcCcccCcccC--C Q lcl|NC_021302. 429 ST--A------DTGQDEPE--TDE-PALPNTSGTT-------STTNA---PQARKRPRGRSPRDRRKTPDGAMPLW--D 484 (484) Q Consensus 429 ~~--~------~~~~~~~~--~~~-~~~~~~~~~~-------~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 484 (484) +. . +....+.+ ... ....+..+.. .+... ....+.-...+..-.|...++-.-+. | T Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (551) T protein:vir:80 470 GVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNANAGKQGMKGDKPND 548 (551) T ss_pred ccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccccCccccchhhhhcCCCCccc Confidence 10 0 00000000 000 0000000000 00000 00000000000001111111111000 0 No 62 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=99.76 E-value=1.6e-16 Score=107.18 Aligned_cols=461 Identities=11% Similarity=0.010 Sum_probs=231.1 Q ss_pred CCCCCCCccceeeeecccc------------cchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHh Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLA------------GFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPI 68 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v 68 (484) |+-|.-..+.++--+...+ .+....++.-+ ..+..|-.+.--+.|.+.-+.+.+|++..+..| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~p~~~~~~L~~~~e~~~~~~~~i~~~~~~i 75 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHN-----VGVNPPYNPDRLAAFLELNETLATGIRKKSRYE 75 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhhhcccC-----CCCCCCCCHHHHHHHHhcChHHHHHHHHHhhhh Confidence 3333211111111111110 00111111000 111112223334457676899999999999999 Q ss_pred hCCCcEEecCC------CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCC Q lcl|NC_021302. 69 RRTDWRIRPNG------ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGG 141 (484) Q Consensus 69 ~~~~~~v~p~~------~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g 141 (484) .++.|.|+|.. .+++..+.+..++...... -......+.....+..+++.++ |-..+||+++|++=+ ..| T Consensus 76 ag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn-~~g 152 (651) T protein:vir:99 76 VGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSR--WQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTD-IEG 152 (651) T ss_pred hccCceeeecccCCCCccchHHHHHHHHHhhccchh--hcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhc-Ccc Confidence 99999999832 2333334444433221000 0000000011124566666554 677889999998522 111 Q ss_pred eeeeeeeeeeCcccee------------------------------------------eeeec-CCCceeeeec------ Q lcl|NC_021302. 142 RFWLKRLAPRPQSSIA------------------------------------------YWNVD-RDGGLISIQQ------ 172 (484) Q Consensus 142 ~~~~~~l~~r~~~~~~------------------------------------------~~~~~-~dg~l~~~~q------ 172 (484) . +..|...|+..+. +..+. ..+....... T Consensus 153 ~--pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v 230 (651) T protein:vir:99 153 R--PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEP 230 (651) T ss_pred c--hhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcce Confidence 1 2222222221110 00000 0000000000 Q ss_pred ----ccccc--------cccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 173 ----WPAGT--------FGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIR 240 (484) Q Consensus 173 ----~~~~~--------~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E 240 (484) ..... ......+..........+|....|++++....+.+||.|.+..+......-.....+...|.. T Consensus 231 ~~~~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~ 310 (651) T protein:vir:99 231 TIRYREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFD 310 (651) T ss_pred eEEeccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 00000 000001122233344567788888887776667789999999999888888888888888887 Q ss_pred HhcCCcceEEecCC-CCCCHHHHHHHHHHHHHHhcCCceEEEccC-----------CceEEEecccCC-chhHHHHHHHH Q lcl|NC_021302. 241 RHGIGVPYLKGNEA-DSEDDDRMDELLEIASNYSGGESAGLALTA-----------GEEAGILSPNGT-PLDPRRAIEYH 307 (484) Q Consensus 241 r~~~G~P~~~gk~~-~~~~~~~~~~l~~~l~~~~~g~~a~~vip~-----------~~~ie~~~~~~~-~~~~~~li~~~ 307 (484) .. +.|-.+.+.+ ...++++++++.+.++++..+..-.++++. |++++-++.+.. ...|.+..++. T Consensus 311 NG--~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~ 388 (651) T protein:vir:99 311 ND--TIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKN 388 (651) T ss_pred cc--CCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHH Confidence 53 5676665643 457899999999999988766444555543 555555554433 45688999999 Q ss_pred HHHHHHHHhhhhh-cccccccchhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---C Q lcl|NC_021302. 308 DHQMALVALAHFL-NLDGKGGSYALASVQADT-FVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---G 382 (484) Q Consensus 308 d~~Isk~ilGqtl-t~~~~gGs~A~~evh~~v-~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~ 382 (484) ..+|++++.-... ....+++++|..+.+... ...-+.-.++.|+..||+.|++...... +..-+|+|+.. . T Consensus 389 ~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~----~~~i~~ef~~~~llr 464 (651) T protein:vir:99 389 EHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVT----DWTIEYELRGADQPK 464 (651) T ss_pred HHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccc----CceEEEEeccchhhh Confidence 9999999766543 333345667766666554 4567788999999999998887633321 11125566532 2 Q ss_pred CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCc--cccccc---CCCcCCCc-c-ccCCCCcccccccccc Q lcl|NC_021302. 383 SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDA--DDDEST---ADTGQDEP-E-TDEPALPNTSGTTSTT 455 (484) Q Consensus 383 ~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e--~~~~~~---~~~~~~~~-~-~~~~~~~~~~~~~~~~ 455 (484) .|.+..+++++.+.+.|+.. .+++|+.+|+|.-.++. ...... ..+....+ + .....++..... .. T Consensus 465 ~D~~~~~e~~~~~i~~G~~T-----~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~--~~ 537 (651) T protein:vir:99 465 QEAQLAEQRVRAMRLAGVGL-----VDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKI--GE 537 (651) T ss_pred ccHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCcccccc--cc Confidence 47788999999999999864 47899999998543221 111110 00000000 0 000000000000 00 Q ss_pred ccccccccccccchHHHhcC---cccCcccCC Q lcl|NC_021302. 456 NAPQARKRPRGRSPRDRRKT---PDGAMPLWD 484 (484) Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 484 (484) ...++.++...+.....+.. -....--+| T Consensus 538 ~e~~~~~~~~~~~e~~~~~~v~ss~~~~~gyd 569 (651) T protein:vir:99 538 REWDTVKSELTTKDPIEQMQFSSSNLDEGLYD 569 (651) T ss_pred chhhhhhhhhcccchhhhhhHHHHHHHhhcCC Confidence 00000011000111111100 000011222 No 63 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.76 E-value=2.4e-17 Score=111.69 Aligned_cols=342 Identities=11% Similarity=0.019 Sum_probs=210.1 Q ss_pred hhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeee Q lcl|NC_021302. 68 IRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLK 146 (484) Q Consensus 68 v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~ 146 (484) |.+++++|...++.. -.-+...|.. +-....++.++++.++ +.+.+|.+++.+++... | .+. T Consensus 1 ia~lp~~~~~~~~~~--~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~-G--~~~ 63 (348) T protein:vir:93 1 MASLPLKMYEDYKVV--NTEVSDLLTV------------SPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-H--QPS 63 (348) T ss_pred CcccceEeEecCcCc--ccHHHHHHHh------------CCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEE Confidence 999999986433211 0112222211 1112235677777776 67889999999876432 3 367 Q ss_pred eeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHH Q lcl|NC_021302. 147 RLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWK 226 (484) Q Consensus 147 ~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~ 226 (484) .|.+++|.++.. ..+.+++.+.... ....+....+|+...+++++....+..+|.|.+..+....- T Consensus 64 ~L~~l~~~~v~~-~~~~~~~~~~y~~-------------~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~ 129 (348) T protein:vir:93 64 KLFLLNPDVVEM-LIENQSRELYYSI-------------HAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTD 129 (348) T ss_pred EEEEEcCCceEE-EEeCCCcEEEEEE-------------EcCCCeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHH Confidence 899999988764 3455554433221 11223445678888888877656677889998888765443 Q ss_pred HHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHH Q lcl|NC_021302. 227 LKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEY 306 (484) Q Consensus 227 ~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~ 306 (484) .- .....|. +. .++ ..|..+.+.+...++++++++.+.+.+...+....++++.|++++-++.+.....|.+..++ T Consensus 130 ~~-~~~~~~~-~~-~~~-~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 205 (348) T protein:vir:93 130 FD-NAVRTFN-LT-EMQ-KPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENL 205 (348) T ss_pred HH-HHHHHHH-HH-hcC-CCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHH Confidence 33 2333343 22 222 23466677788889999999999998887766567889999999888777666679999999 Q ss_pred HHHHHHHHHhhhhhcc-cccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC--- Q lcl|NC_021302. 307 HDHQMALVALAHFLNL-DGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI--- 381 (484) Q Consensus 307 ~d~~Isk~ilGqtlt~-~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~--- 381 (484) ...+|++++.-...-. ..++++++..+-+.. ....-+.-.++.|++.||+.|++.. +... ..+|+|+.. T Consensus 206 ~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-~~~~-----g~~i~fd~~~l~ 279 (348) T protein:vir:93 206 TRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKT-DREK-----NRYFKFNVKSYL 279 (348) T ss_pred HHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcc-cccC-----cceEEeechhhh Confidence 9999999976654333 233456766655544 3455677888889999998777631 1111 125666532 Q ss_pred CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 382 GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 382 ~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) ..|.++.++++.+|++.|+.. .+++|+.+|+|.-++++....+.-- .+ ... +.....+.++...+..+. T Consensus 280 ~~d~~~~a~~~~~~~~~G~~T-----~NE~R~~~g~~p~~ggD~~~~~~n~--~~-~~~--~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 280 RADSATQAEVYFKAVRSGYYT-----INDIREWEDLPPVEGGDKPLISGDL--YP-IDT--PLELRKSLKGGDKNVNES 348 (348) T ss_pred ccCHHHHHHHHHHHHhCCCCC-----HHHHHHHhCCCCCCCcCeEeecccc--cc-ccc--chhhcccccCCCCCcCCC Confidence 247788999999999999764 4789999999866555443321000 00 000 000000001111111111 No 64 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=99.76 E-value=2.6e-17 Score=111.50 Aligned_cols=397 Identities=9% Similarity=-0.013 Sum_probs=228.5 Q ss_pred CCCccceeeeecccccch--hhhhhhccccccc-----cccc---ccchHHHH-HHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 5 TVAPRTERGYVNPLAGFG--TFLAQGLDQFEQV-----DELR---WPNSVYTY-TRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 5 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-----~~lr---~~~~~~~y-~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) ---|.-+...++..+.+. ..++.+-+..... ..+. +..+..+= +..+ +-+.|.+|+..+-..|.+++| T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al-~~~~v~~cv~~Ia~~iA~lp~ 79 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERIL-QISTVWRCVSLISTLTACLPL 79 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhh-ccHHHHHHHHHHHHhhccCce Confidence 111111111111111110 0011010000000 0000 00011111 2333 468899999999999999999 Q ss_pred EEecCCCCH---HHH--HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 74 RIRPNGARP---EVV--EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 74 ~v~p~~~~~---e~~--~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) .|--...+. +.. .-+...|.. +.....+..++++.++ +.+.+|-+.+++++... | .+.. T Consensus 80 ~vy~~~~~~~~~~~~~~~~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~ 144 (424) T protein:vir:18 80 DVFETDQNDNRKKVDLSNPLARLLRY------------SPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DVIS 144 (424) T ss_pred EEEEeccCCceeeeccccHHHHHHhh------------ccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEE Confidence 984322211 110 011122210 1112234566777665 67889999999986433 3 3678 Q ss_pred eeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHH Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKL 227 (484) Q Consensus 148 l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~ 227 (484) |.+.+|.++.... .++.+.+. +...+....++++..++.++.. .+..+|.|.+..+....-. T Consensus 145 L~~l~~~~v~v~~--~~~~~~y~---------------~~~~g~~~~~~~~eVihir~~~-~dg~~G~spi~~~~~~i~~ 206 (424) T protein:vir:18 145 LLPLQSANMDVKL--VGKKVVYR---------------YQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGV 206 (424) T ss_pred EEEecCcceEEEE--cCCeEEEE---------------EEeCCeEEEeccccEEEecCcC-CCCcccccHHHHHHHHHHH Confidence 8899988875322 22333221 1223445578888888887654 4558999999998877777 Q ss_pred HHHHHHHHHHHHHHhcCCcceEEecCCCC-CCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHH Q lcl|NC_021302. 228 KDELIRIEAAAIRRHGIGVPYLKGNEADS-EDDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAI 304 (484) Q Consensus 228 K~~~~~~w~~f~Er~~~G~P~~~gk~~~~-~~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li 304 (484) -....++...|... .+.|-.+.+.+.. .++++++.+.+.++++.++.++ .++++.|++++-++.+.....|.+.. T Consensus 207 ~~~~~~~~~~~f~n--g~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~ 284 (424) T protein:vir:18 207 AVAMEDQQRDFFAN--GAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASR 284 (424) T ss_pred HHHHHHHHHHHHhc--cCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHH Confidence 77777777788875 3567566666554 5788889999999888766544 48899999998887776666788999 Q ss_pred HHHHHHHHHHHhhhhhc-ccccccch--h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC Q lcl|NC_021302. 305 EYHDHQMALVALAHFLN-LDGKGGSY--A-LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE 380 (484) Q Consensus 305 ~~~d~~Isk~ilGqtlt-~~~~gGs~--A-~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~ 380 (484) ++...+|++++.-...- .+..++++ + ..+........-+.-.++.|+..||+.|++.- ..... .|+|+. T Consensus 285 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl~P~~~~ie~~ln~~L~~~~-----~~~~~--~~~fd~ 357 (424) T protein:vir:18 285 KFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPSK-----DVGRL--HAEHNL 357 (424) T ss_pred HHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcc-----ccCCe--EEEEec Confidence 99999999997665432 23333333 1 22333455567778888899999998776641 11122 455543 Q ss_pred --C-CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccc Q lcl|NC_021302. 381 --I-GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 381 --~-~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (484) . ..|.++.++++.++.+.|+.. .+++|+.+|+|.-++++...... +-.... .......+...+. T Consensus 358 ~~llr~d~~~r~~~~~~~~~~G~~T-----~NE~R~~~gl~pi~ggD~~~~~~-n~~~l~-~~~~~~~~~~n~a 424 (424) T protein:vir:18 358 DGLLRGDSASRAAFMKAMGESGLRT-----INEMRRTDNMPPLPGGDVAMRQA-QYVPIT-DLGTNKEPRNNGA 424 (424) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcCeeeecc-Cccchh-hhhccCCccccCC Confidence 2 347788999999999999865 47899999998765544332211 000000 0000000000000 No 65 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=99.75 E-value=1.9e-16 Score=106.81 Aligned_cols=437 Identities=10% Similarity=0.080 Sum_probs=223.1 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchH-HHHHHHHhcchHHHHHHHHHHHHhhC--------- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSV-YTYTRMCREEARIASVLRAIGLPIRR--------- 70 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m~~~D~~v~s~l~~r~~~v~~--------- 70 (484) +-.++++-+++.. . .+++. .++. ..+..++.+..+ .+.+.+.. -+.|.+|+.++...|.. T Consensus 50 ~~~~~~a~~~~~~--~---~~~~~--~~~~--~~~~~~~~~~~l~~~l~~~~~-n~i~~~~I~t~~~~vA~~~~~~~~~~ 119 (563) T protein:vir:95 50 LYGQQQAYAEPFI--E---MMDTN--PEFR--DKRSYMKNEHNLHDVLKKFGN-NPILNAIILTRSNQVAMYCQPARYSE 119 (563) T ss_pred hccCCCcchhhhH--h---hhccc--cccc--ccccCCCCcccHHHHHHHhhc-chHHHHHHHHHHHHHHHHhhhhhhhc Confidence 2233322111110 0 01100 0000 000111112112 12233333 36788889888887774 Q ss_pred --CCcEEecCCCC-----HHHH--HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecC Q lcl|NC_021302. 71 --TDWRIRPNGAR-----PEVV--EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEG 140 (484) Q Consensus 71 --~~~~v~p~~~~-----~e~~--~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~ 140 (484) +.|.|..-..+ .+.. ..+...|...... ..-....|.++++.++ +.+.+|.+.+|+++.+++ T Consensus 120 ~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~--------~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~ 191 (563) T protein:vir:95 120 KGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKD--------KDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNN 191 (563) T ss_pred ccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCC--------CCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecC Confidence 34555332111 1111 1111112111000 0001225778888776 579999999999887653 Q ss_pred CeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccC---ccccchh Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG---VWTGNSL 217 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~---~p~G~gl 217 (484) ...+..|.+++|.++. ...+.+|.+.... ........+.....++....|+|+.....+ .+||.|. T Consensus 192 -~G~~~~L~pl~p~~V~-v~~~~~g~~~~~~---------~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Sp 260 (563) T protein:vir:95 192 -KTKLEKFIAVDPSTIF-YATDKKGKIIKGG---------KRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSE 260 (563) T ss_pred -CCceEEEEEeCCceeE-EEECCCCceeccc---------eeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchH Confidence 3457889999999885 4555565543110 011112223334467788888888776544 6789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec--CCCCCCHHHHHHHHHHHHHHhcCC-ce---EEEccCCceEEEe Q lcl|NC_021302. 218 LRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN--EADSEDDDRMDELLEIASNYSGGE-SA---GLALTAGEEAGIL 291 (484) Q Consensus 218 l~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk--~~~~~~~~~~~~l~~~l~~~~~g~-~a---~~vip~~~~ie~~ 291 (484) +..+......-....++-..|.... +.|-.+.+ .+...++++++++.+.+.+...|. ++ .++++.|++++-+ T Consensus 261 i~~a~~~i~~~~~~~~~~~~~f~ng--~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l 338 (563) T protein:vir:95 261 VEIAMKEFIAYNNTESFNDRFFSHG--GTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNM 338 (563) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcc--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEec Confidence 9999999888888888889998863 56644443 334468888999999999876653 33 3678999998888 Q ss_pred cccCCchhHHHHHHHHHHHHHHHHhhhhh-c---------ccccccch--hhHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 292 SPNGTPLDPRRAIEYHDHQMALVALAHFL-N---------LDGKGGSY--ALAS-VQADTFVQSVQTVADEIRDVAQAHV 358 (484) Q Consensus 292 ~~~~~~~~~~~li~~~d~~Isk~ilGqtl-t---------~~~~gGs~--A~~e-vh~~v~~~~~~aD~~~i~~~ln~ql 358 (484) +.+.....|.+..++..++|++++.-..- . +++.|+|. +-.+ ........-+.--++.|+..||+.| T Consensus 339 ~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L 418 (563) T protein:vir:95 339 TPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHI 418 (563) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 77666667899999999999999554321 1 11222222 2222 2233455667788889999999988 Q ss_pred HHHHHHhCCCCccccceEEecCCCCcHHHHHHHH--HHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc----CC Q lcl|NC_021302. 359 VEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAAL--QMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST----AD 432 (484) Q Consensus 359 i~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~--~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~----~~ 432 (484) ++. |+. . -+|.|.. .+.+..++.. .++.+.|+.. .+++|+.+|+|.-+.++....+. .. T Consensus 419 ~~~-----~~~--~-~~~~f~r--~D~~~~~e~~~~~~~~~~G~lT-----~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~ 483 (563) T protein:vir:95 419 ISE-----YGD--K-YTFQFVG--GDTKSATDKLNILKLETQIFKT-----VNEAREEQGKKPIEGGDIILDASFLQGTA 483 (563) T ss_pred chh-----ccc--c-cEEEecc--CCHHHHHHHHHHHHHhcCCccC-----HHHHHHHhCCCCCCCcceeeccccccccc Confidence 864 221 1 2555643 2444444433 3566777653 48999999998766554432210 00 Q ss_pred C----cCCCccc-cCCCCccccccccccccccccccccccchHHHhcCcccCc-------ccCC Q lcl|NC_021302. 433 T----GQDEPET-DEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPDGAM-------PLWD 484 (484) Q Consensus 433 ~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~ 484 (484) . ...+.+. ........++...+...++. ..+......++....++.. ++-+ T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (563) T protein:vir:95 484 QLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEE-GQSTDSSNDDKEIGTDAQIKGDDNVYRTQT 546 (563) T ss_pred ccccccCCCccccchhhhhcccccCCCCCCCCC-CCCCCCCCCccccccccccccccccccccC Confidence 0 0000000 00000000000000000000 0000000000000001111 1101 No 66 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=99.75 E-value=1.9e-16 Score=106.81 Aligned_cols=437 Identities=10% Similarity=0.080 Sum_probs=223.1 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchH-HHHHHHHhcchHHHHHHHHHHHHhhC--------- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSV-YTYTRMCREEARIASVLRAIGLPIRR--------- 70 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m~~~D~~v~s~l~~r~~~v~~--------- 70 (484) +-.++++-+++.. . .+++. .++. ..+..++.+..+ .+.+.+.. -+.|.+|+.++...|.. T Consensus 50 ~~~~~~a~~~~~~--~---~~~~~--~~~~--~~~~~~~~~~~l~~~l~~~~~-n~i~~~~I~t~~~~vA~~~~~~~~~~ 119 (563) T protein:vir:99 50 LYGQQQAYAEPFI--E---MMDTN--PEFR--DKRSYMKNEHNLHDVLKKFGN-NPILNAIILTRSNQVAMYCQPARYSE 119 (563) T ss_pred hccCCCcchhhhH--h---hhccc--cccc--ccccCCCCcccHHHHHHHhhc-chHHHHHHHHHHHHHHHHhhhhhhhc Confidence 2233322111110 0 01100 0000 000111112112 12233333 36788889888887774 Q ss_pred --CCcEEecCCCC-----HHHH--HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecC Q lcl|NC_021302. 71 --TDWRIRPNGAR-----PEVV--EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEG 140 (484) Q Consensus 71 --~~~~v~p~~~~-----~e~~--~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~ 140 (484) +.|.|..-..+ .+.. ..+...|...... ..-....|.++++.++ +.+.+|.+.+|+++.+++ T Consensus 120 ~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~--------~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~ 191 (563) T protein:vir:99 120 KGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKD--------KDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNN 191 (563) T ss_pred ccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCC--------CCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecC Confidence 34555332111 1111 1111112111000 0001225778888776 579999999999887653 Q ss_pred CeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccC---ccccchh Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG---VWTGNSL 217 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~---~p~G~gl 217 (484) ...+..|.+++|.++. ...+.+|.+.... ........+.....++....|+|+.....+ .+||.|. T Consensus 192 -~G~~~~L~pl~p~~V~-v~~~~~g~~~~~~---------~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Sp 260 (563) T protein:vir:99 192 -KTKLEKFIAVDPSTIF-YATDKKGKIIKGG---------KRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSE 260 (563) T ss_pred -CCceEEEEEeCCceeE-EEECCCCceeccc---------eeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchH Confidence 3457889999999885 4555565543110 011112223334467788888888776544 6789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec--CCCCCCHHHHHHHHHHHHHHhcCC-ce---EEEccCCceEEEe Q lcl|NC_021302. 218 LRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN--EADSEDDDRMDELLEIASNYSGGE-SA---GLALTAGEEAGIL 291 (484) Q Consensus 218 l~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk--~~~~~~~~~~~~l~~~l~~~~~g~-~a---~~vip~~~~ie~~ 291 (484) +..+......-....++-..|.... +.|-.+.+ .+...++++++++.+.+.+...|. ++ .++++.|++++-+ T Consensus 261 i~~a~~~i~~~~~~~~~~~~~f~ng--~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l 338 (563) T protein:vir:99 261 VEIAMKEFIAYNNTESFNDRFFSHG--GTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNM 338 (563) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcc--CCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEec Confidence 9999999888888888889998863 56644443 334468888999999999876653 33 3678999998888 Q ss_pred cccCCchhHHHHHHHHHHHHHHHHhhhhh-c---------ccccccch--hhHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 292 SPNGTPLDPRRAIEYHDHQMALVALAHFL-N---------LDGKGGSY--ALAS-VQADTFVQSVQTVADEIRDVAQAHV 358 (484) Q Consensus 292 ~~~~~~~~~~~li~~~d~~Isk~ilGqtl-t---------~~~~gGs~--A~~e-vh~~v~~~~~~aD~~~i~~~ln~ql 358 (484) +.+.....|.+..++..++|++++.-..- . +++.|+|. +-.+ ........-+.--++.|+..||+.| T Consensus 339 ~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L 418 (563) T protein:vir:99 339 TPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHI 418 (563) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 77666667899999999999999554321 1 11222222 2222 2233455667788889999999988 Q ss_pred HHHHHHhCCCCccccceEEecCCCCcHHHHHHHH--HHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccccc----CC Q lcl|NC_021302. 359 VEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAAL--QMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST----AD 432 (484) Q Consensus 359 i~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~--~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~----~~ 432 (484) ++. |+. . -+|.|.. .+.+..++.. .++.+.|+.. .+++|+.+|+|.-+.++....+. .. T Consensus 419 ~~~-----~~~--~-~~~~f~r--~D~~~~~e~~~~~~~~~~G~lT-----~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~ 483 (563) T protein:vir:99 419 ISE-----YGD--K-YTFQFVG--GDTKSATDKLNILKLETQIFKT-----VNEAREEQGKKPIEGGDIILDASFLQGTA 483 (563) T ss_pred chh-----ccc--c-cEEEecc--CCHHHHHHHHHHHHHhcCCccC-----HHHHHHHhCCCCCCCcceeeccccccccc Confidence 864 221 1 2555643 2444444433 3566777653 48999999998766554432210 00 Q ss_pred C----cCCCccc-cCCCCccccccccccccccccccccccchHHHhcCcccCc-------ccCC Q lcl|NC_021302. 433 T----GQDEPET-DEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPDGAM-------PLWD 484 (484) Q Consensus 433 ~----~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~ 484 (484) . ...+.+. ........++...+...++. ..+......++....++.. ++-+ T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (563) T protein:vir:99 484 QLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEE-GQSTDSSNDDKEIGTDAQIKGDDNVYRTQT 546 (563) T ss_pred ccccccCCCccccchhhhhcccccCCCCCCCCC-CCCCCCCCCccccccccccccccccccccC Confidence 0 0000000 00000000000000000000 0000000000000001111 1101 No 67 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.75 E-value=7e-17 Score=109.16 Aligned_cols=403 Identities=11% Similarity=0.041 Sum_probs=229.1 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) |+++.-...-....++...+........+..- ..+....+ ..+..+ +-+.|.+|+..+-..|.+++|++...++ T Consensus 1 ~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~----~~~~~~~v-~~~~a~-~~~~v~~~i~~Ia~~ia~lp~~~~~~~~ 74 (409) T protein:vir:94 1 MAKENIVTRIKKKLIDNWIDQSASKLYDFSPW----KNKSFWGV-INNTLE-TNETIFSAITKLSNSMASLPLKMYEDYK 74 (409) T ss_pred CcccccchhhhhHHhhhhhcCCcccccccccc----cCcccccc-chhhhh-ccHHHHHHHHHHHHhhhhCceeEeeccc Confidence 66655444322222222111110000000000 00000001 111233 4678999999999999999999854332 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) ... ..+.+.|.. +.....+..++++.++ +.+.+|-+..++++... | .+..|.+.+|.++.. T Consensus 75 ~~~--~~~~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~-G--~~~~L~~l~~~~v~v- 136 (409) T protein:vir:94 75 VVN--TEVSDLLTV------------SPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-H--QPSKLFLLNPDVVEM- 136 (409) T ss_pred ccc--hhHHHHHhh------------hcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEEcCceeEE- Confidence 111 112222211 1122335677777765 67889999998876432 3 367899999988753 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAI 239 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 239 (484) ..+.+++.+.... ....+....+|....+++++....+..+|.|.+..+....-..... ..|.. T Consensus 137 ~~~~~~~~~~y~~-------------~~~~g~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~-~~~~~-- 200 (409) T protein:vir:94 137 LIENQSRELYYSI-------------HAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAV-RTFNL-- 200 (409) T ss_pred EEeCCCcEEEEEE-------------EcCCceEEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHH-HHHHH-- Confidence 4445554332211 1112334567888888887765566788999988876655444433 33332 Q ss_pred HHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhh Q lcl|NC_021302. 240 RRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHF 319 (484) Q Consensus 240 Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqt 319 (484) ..++.+ |-.+.+.+...++++++++.+.+++...+....++++.|++++-++.+.....|.+..++..++|++++.-.. T Consensus 201 ~~~~~~-~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp 279 (409) T protein:vir:94 201 TEMQKP-DSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPS 279 (409) T ss_pred HhcCCC-CeeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 222222 4455567778889999999999988776655678899999998887665566788888999999999976654 Q ss_pred hcc-cccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCcHHHHHHHHHH Q lcl|NC_021302. 320 LNL-DGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSRQDATAAALQM 394 (484) Q Consensus 320 lt~-~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~~~~~ae~~~~ 394 (484) --. ...+++++-.+-+.. ....-+.-.++.|++.||+.|++.. ... .. ..|+|+.. ..|.++.++++++ T Consensus 280 ~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~Ll~~~---~~~-~~--~~i~fd~~~ll~~d~~~~~~~~~~ 353 (409) T protein:vir:94 280 VFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKT---DRE-KN--RYFKFNVKSYLRADSATQAEVYFK 353 (409) T ss_pred HHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcc---ccc-Cc--ceEEeechhhhccCHHHHHHHHHH Confidence 333 223455555444443 3345577788888888888776631 111 11 25666532 2478889999999 Q ss_pred HHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 395 LVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 395 L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) +++.|+.. .+++|+.+|+|.-+.++....+. +- .+. .. +.....+.++...+..+. T Consensus 354 ~~~~G~~T-----~NE~R~~~g~~p~~ggD~~~~~~-n~-~~~-~~--~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 354 AVRSGYYT-----INDIREWEDLPPVEGGDKPLISG-DL-YPI-DT--PLELRKSLKGGDKNVNES 409 (409) T ss_pred HHhCCCcC-----HHHHHHHhCCCCCCCcCeEeecc-cc-ccc-cc--chhhcccccCCCCCcCCC Confidence 99999764 48899999998665554433211 00 000 00 000000111111111111 No 68 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.75 E-value=3.2e-17 Score=111.04 Aligned_cols=406 Identities=11% Similarity=0.032 Sum_probs=230.4 Q ss_pred CCC-CCCCcc----ceeeeecccccc--hhhhhhhcccccccccccccchHHH-HHHHHhcchHHHHHHHHHHHHhhCCC Q lcl|NC_021302. 1 MAP-KTVAPR----TERGYVNPLAGF--GTFLAQGLDQFEQVDELRWPNSVYT-YTRMCREEARIASVLRAIGLPIRRTD 72 (484) Q Consensus 1 ~~~-~~~~~~----~~~~~~~~~~~~--~~~~~~~~~~~~~~~~lr~~~~~~~-y~~m~~~D~~v~s~l~~r~~~v~~~~ 72 (484) |=. ++-.-. +-...++...+. +..+ .++.... ...+..+ .+..+ +=+.|.+|+..+-..|.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~-~~~~~~~------~~~g~~v~~~~al-~~~~v~~ci~~Ia~~ia~lp 72 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFW-SAWGGMG------SSSGETVTADSAL-QLSAVWSCVRLIAETIATLP 72 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHH-Hhhcccc------cCCCceechHhhh-ccHHHHHHHHHHHHHHhhCc Confidence 110 000000 000011111010 0000 0000000 0001111 22344 46899999999999999999 Q ss_pred cEEecCCCCH---HHHHH-HHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 73 WRIRPNGARP---EVVEH-VAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 73 ~~v~p~~~~~---e~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) |.+.....+. +..+. +...|. .+.....++.++++.++ +.+.+|-+.+++++. +| .+.. T Consensus 73 ~~~~~~~~~g~~~~~~~~~l~~lL~------------~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~--~g--~~~~ 136 (437) T protein:vir:10 73 LNLYQTKPDGTRVLAKQHRLYTVIH------------SQPNAENTAAEFWEVIVASMLLWGNGYARKLRS--AG--VLIG 136 (437) T ss_pred eeEEEEcCCCceeeccccHHHHHhh------------ccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec--CC--cEEE Confidence 9984322221 11110 111111 01112335677777776 568899999998764 34 4678 Q ss_pred eeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHH Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKL 227 (484) Q Consensus 148 l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~ 227 (484) |.+++|.++. ...+.+|.+.+..+. .++....++++..+++++.. .+.++|.|.+..+....-. T Consensus 137 L~~l~p~~v~-i~~~~~g~~~y~~~~--------------~~g~~~~~~~~dIih~r~~~-~d~~~G~spi~~~~~~i~~ 200 (437) T protein:vir:10 137 LELMLPQRTT-VKRLTSGALQYTYRN--------------VDGTVSTLAEDDVFHVRGFS-LDGLMGLTPIQYAREVLGN 200 (437) T ss_pred EEEEcCcceE-EEECCCCeEEEEEEe--------------cCceEEEEccccEEEecCcC-CCCcccccHHHHHHHHHHH Confidence 8899998875 445556665543321 12334567888887777654 4568999999999988877 Q ss_pred HHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCCchhHHHHH Q lcl|NC_021302. 228 KDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGTPLDPRRAI 304 (484) Q Consensus 228 K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~~~~~~~li 304 (484) -.....+-..|.+. .+.|-.+.+++...++++.+++.+.+.+...|. + ..++++.|++++-++.+.....|.+.. T Consensus 201 ~~~~~~~~~~~f~n--g~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~ 278 (437) T protein:vir:10 201 STAANKTSASVFRN--GLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETR 278 (437) T ss_pred HHHHHHHHHHHHhc--cCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHH Confidence 78888888888885 366866667777788888899999888765542 2 357899999988887766666788888 Q ss_pred HHHHHHHHHHHhhhhhc-ccccccch--h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC Q lcl|NC_021302. 305 EYHDHQMALVALAHFLN-LDGKGGSY--A-LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE 380 (484) Q Consensus 305 ~~~d~~Isk~ilGqtlt-~~~~gGs~--A-~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~ 380 (484) ++..++|++++.-..-- ....++++ + ..+........-+.-.+..|+..||+.|++.-- . .. -.|+|+. T Consensus 279 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~e----~--~~-~~~~fd~ 351 (437) T protein:vir:10 279 AFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTLRPWLTRIEQAARRSLLRPGE----R--DQ-FYAEFSV 351 (437) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhhccCccc----c--Cc-eEEEEec Confidence 99999999997655432 22333333 2 223333445666777888888888887765311 1 11 1456653 Q ss_pred C---CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCcccc--CCCC------cccc Q lcl|NC_021302. 381 I---GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETD--EPAL------PNTS 449 (484) Q Consensus 381 ~---~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~--~~~~------~~~~ 449 (484) . ..|.+.++++++++.+.|+.. .+++|+.+|+|.-+++.+.......- .+..... .+.. .+.. T Consensus 352 ~~ll~~d~~~r~~~~~~~~~~G~~T-----~NE~R~~~gl~pi~gg~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 425 (437) T protein:vir:10 352 EGLLRADSAGRAAFYSTMTQNGLMT-----RDECRAKENLPPMGGNAAVLTVQSAL-LPIDKLGEHTTATAAQDALKAWL 425 (437) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCcceEeecCcc-cchhhccCcCCCcchhccccccC Confidence 2 347788999999999999764 47899999998655443322111000 0000000 0000 0000 Q ss_pred cccccccccccc Q lcl|NC_021302. 450 GTTSTTNAPQAR 461 (484) Q Consensus 450 ~~~~~~~~~~~~ 461 (484) +....+.+.+.. T Consensus 426 ~~~~~~~~~~e~ 437 (437) T protein:vir:10 426 YQEEKTRATQER 437 (437) T ss_pred CCCCCCCccccC Confidence 111111111111 No 69 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.74 E-value=1.3e-16 Score=107.77 Aligned_cols=377 Identities=12% Similarity=-0.001 Sum_probs=211.1 Q ss_pred CCCCCCCccceeeeeccccc-chhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAG-FGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) ..-....+.+. .+...... .+......+ .+..+..+..+...+-+.|.+|+..+-..|.++++++.... T Consensus 9 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~---------~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~ 78 (392) T protein:vir:74 9 INQTNDPPEAG-SVQSYFPDGNDAQIMESL---------LGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKK 78 (392) T ss_pred hhcccCccccc-ccccccccCchhhhhhhc---------cCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccch Confidence 11111111110 00000000 001111110 11112223333334678999999999999999999985321 Q ss_pred CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceee Q lcl|NC_021302. 80 ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAY 158 (484) Q Consensus 80 ~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~ 158 (484) .. ..+ .+-....+..++++.++ +.+.+|.+.+++++... | .+..|.+++|.++. T Consensus 79 ~~--------~l~-------------~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-G--~~~~L~~i~~~~v~- 133 (392) T protein:vir:74 79 NQ--------GII-------------DNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN-G--ADMKWEYLRPSQVN- 133 (392) T ss_pred hh--------hhh-------------hhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCC-C--cEEEEEEEcCceeE- Confidence 10 011 11122335677788776 78899999999987543 3 36789999999885 Q ss_pred eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAA 238 (484) Q Consensus 159 ~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f 238 (484) ...+.+++.+..+....+ ........++.+.++++++....+..+|.|.+..+....-.-....++...+ T Consensus 134 v~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 203 (392) T protein:vir:74 134 TYYFEYENGMYYNITFDD----------PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISS 203 (392) T ss_pred EEEcCCCceEEEEEEecC----------CccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 445555554332211100 0112234577888888777666677899999999999888888888888889 Q ss_pred HHHhcCCcceEEecCCC--CCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 239 IRRHGIGVPYLKGNEAD--SEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVAL 316 (484) Q Consensus 239 ~Er~~~G~P~~~gk~~~--~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~il 316 (484) .... +.|-.+.+++. ..++++++.+.+...... +....++++.|++++-++.+.....|.+..++..++|++++. T Consensus 204 f~ng--~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~-n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fg 280 (392) T protein:vir:74 204 LNSS--LNVPGVLTVKGGGLLSDKDKASRSRSFMKRS-RSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYG 280 (392) T ss_pred Hhcc--CCCceEEEeCCCCCchHHHHHHHHHHHhccc-cCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhC Confidence 8863 56755555543 334555666655554332 112347899999999888776666788999999999999975 Q ss_pred hhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHHH Q lcl|NC_021302. 317 AHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQMLV 396 (484) Q Consensus 317 Gqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L~ 396 (484) -..-..+..+.+++..+.-......-+.-.++.|++.||+.|++.+ ++|+. . ....+...+++.+.+|+ T Consensus 281 VPp~~lg~~~~~~~~~e~~~~~~~~~l~p~~~~ie~~l~~~l~~~~-~~~~~-------~---~~~~d~~~~~~~~~~l~ 349 (392) T protein:vir:74 281 LPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-SVNMR-------P---AIDPLGDNYLSTISTAT 349 (392) T ss_pred CCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccchh-cccch-------h---hhcCCHHHHHHHHHHHH Confidence 5432222112222222222334455566677888888888765431 11110 0 11235677888999999 Q ss_pred hcCcccCCcccHHHHHHHh---CCCCCCCCcccccccCCCcCCCccccCCCCccccccccc Q lcl|NC_021302. 397 NAGLLTPDPRLEAFLRDAA---GLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTST 454 (484) Q Consensus 397 ~~G~~~~~~~~~~~i~e~~---glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (484) ..|+.. .+++|+.+ |+...+--+.+..++..++ +...+.+ T Consensus 350 ~~g~~t-----~near~~~~~~g~~pne~r~~enl~~~~~G-------------d~~~p~p 392 (392) T protein:vir:74 350 RWGALA-----ENQATFVLQEAGYIPKDLPAPENTNKKTTG-------------QSNEPVP 392 (392) T ss_pred hCCCcC-----HHHHHHHHHhCCCCccccchhcCCCCCCCC-------------CCCCCCC Confidence 999764 35566554 6542111111111110000 0111111 No 70 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.73 E-value=2.1e-16 Score=106.57 Aligned_cols=376 Identities=12% Similarity=0.002 Sum_probs=211.6 Q ss_pred CCCCCCCcc--ceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecC Q lcl|NC_021302. 1 MAPKTVAPR--TERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPN 78 (484) Q Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~ 78 (484) +.-+...+. +....... +........+. +..+..+..+...+-+.|.+|+..+-..|.++++++.-. T Consensus 9 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~---------~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:10 9 INQTNDPPEVGSVQSYFPD--GNDAQIMESLL---------GDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred hhccccccccccccccccc--Cchhhhhhhhc---------CCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccc Confidence 111111111 11111000 01111111111 111111222333357899999999999999999998532 Q ss_pred CCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcccee Q lcl|NC_021302. 79 GARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIA 157 (484) Q Consensus 79 ~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~ 157 (484) .. ..| ..+-....+..++++.++ +.+.+|.+++++++... | .+..|.++++.++. T Consensus 78 ~~---------~~l------------~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-g--~~~~L~~l~~~~v~ 133 (392) T protein:vir:10 78 KN---------QGI------------IDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN-G--ADMKWEYLRPSQVN 133 (392) T ss_pred hh---------hhH------------hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC-C--cEEEEEEEcCceeE Confidence 11 011 111122345677888776 67889999999986533 3 36789999999885 Q ss_pred eeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 158 YWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAA 237 (484) Q Consensus 158 ~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 237 (484) ...+.+++.+..+....+ ........++.+..|+.++....+..+|.|.+..+....-.-....++... T Consensus 134 -~~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 202 (392) T protein:vir:10 134 -TYYFEYENGMYYNITFDD----------PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTIS 202 (392) T ss_pred -EEEcCCCceEEEEEEecC----------cccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHH Confidence 345555554433221111 011123457788888887777677789999999999988888888888888 Q ss_pred HHHHhcCCcceEEecCCC--CCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 238 AIRRHGIGVPYLKGNEAD--SEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVA 315 (484) Q Consensus 238 f~Er~~~G~P~~~gk~~~--~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i 315 (484) +.... +.|-.+.+++. ..++++++.+.+...... +....+++|.|++++-++.+.....|.+..++..++|++++ T Consensus 203 ~f~ng--~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~f 279 (392) T protein:vir:10 203 SLNSS--LNVPGVLTVKGGGLLSDKDKASRSRSFMKRS-RSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVY 279 (392) T ss_pred HHhcc--CCCceEEEeCCCCCchHHHHHHHHHHHhccc-cCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHh Confidence 88863 56755555543 344555555555544332 22245789999999888877666678899999999999997 Q ss_pred hhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHH Q lcl|NC_021302. 316 LAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQML 395 (484) Q Consensus 316 lGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L 395 (484) .-..-..+..+.+.+..+-.......-+.-.++.|++.||+.|++.+ .++.. . ....+...+++.+.+| T Consensus 280 gVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~-~~d~~-------~---~~~~d~~~~~~~~~~l 348 (392) T protein:vir:10 280 GLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-SVNMR-------P---AIDPLGDNYLSTISTA 348 (392) T ss_pred CCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccch-------h---hhccCHHHHHHHHHHH Confidence 65433322112222222323344556667778888888888765431 11110 0 1123567778889999 Q ss_pred HhcCcccCCcccHHHHHHHh---CCCCCCCCcccccccCCCcCCCccccCCCCccccccccc Q lcl|NC_021302. 396 VNAGLLTPDPRLEAFLRDAA---GLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTST 454 (484) Q Consensus 396 ~~~G~~~~~~~~~~~i~e~~---glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (484) ...|+.. .+++|+.+ |+...+--+.+..++.. .++..++.+ T Consensus 349 ~~~g~~t-----~nE~r~~l~~~g~~p~e~r~~e~l~~~~-------------~Gd~~~p~p 392 (392) T protein:vir:10 349 TRWGALA-----ENQATFVLQEAGYIPKDLPAPENTNKKT-------------TGQSNEPVP 392 (392) T ss_pred HhCCCcC-----HHHHHHHHHhcCCCccccchhcCCCCCC-------------CCCCCCCCC Confidence 9999764 35566655 66421110000101100 001111111 No 71 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.73 E-value=2.1e-16 Score=106.57 Aligned_cols=376 Identities=12% Similarity=0.002 Sum_probs=211.6 Q ss_pred CCCCCCCcc--ceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecC Q lcl|NC_021302. 1 MAPKTVAPR--TERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPN 78 (484) Q Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~ 78 (484) +.-+...+. +....... +........+. +..+..+..+...+-+.|.+|+..+-..|.++++++.-. T Consensus 9 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~---------~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:39 9 INQTNDPPEVGSVQSYFPD--GNDAQIMESLL---------GDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred hhccccccccccccccccc--Cchhhhhhhhc---------CCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccc Confidence 111111111 11111000 01111111111 111111222333357899999999999999999998532 Q ss_pred CCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcccee Q lcl|NC_021302. 79 GARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIA 157 (484) Q Consensus 79 ~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~ 157 (484) .. ..| ..+-....+..++++.++ +.+.+|.+++++++... | .+..|.++++.++. T Consensus 78 ~~---------~~l------------~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-g--~~~~L~~l~~~~v~ 133 (392) T protein:vir:39 78 KN---------QGI------------IDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNAN-G--ADMKWEYLRPSQVN 133 (392) T ss_pred hh---------hhH------------hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC-C--cEEEEEEEcCceeE Confidence 11 011 111122345677888776 67889999999986533 3 36789999999885 Q ss_pred eeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 158 YWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAA 237 (484) Q Consensus 158 ~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 237 (484) ...+.+++.+..+....+ ........++.+..|+.++....+..+|.|.+..+....-.-....++... T Consensus 134 -~~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 202 (392) T protein:vir:39 134 -TYYFEYENGMYYNITFDD----------PKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTIS 202 (392) T ss_pred -EEEcCCCceEEEEEEecC----------cccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHH Confidence 345555554433221111 011123457788888887777677789999999999988888888888888 Q ss_pred HHHHhcCCcceEEecCCC--CCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 238 AIRRHGIGVPYLKGNEAD--SEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVA 315 (484) Q Consensus 238 f~Er~~~G~P~~~gk~~~--~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i 315 (484) +.... +.|-.+.+++. ..++++++.+.+...... +....+++|.|++++-++.+.....|.+..++..++|++++ T Consensus 203 ~f~ng--~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~f 279 (392) T protein:vir:39 203 SLNSS--LNVPGVLTVKGGGLLSDKDKASRSRSFMKRS-RSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVY 279 (392) T ss_pred HHhcc--CCCceEEEeCCCCCchHHHHHHHHHHHhccc-cCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHh Confidence 88863 56755555543 344555555555544332 22245789999999888877666678899999999999997 Q ss_pred hhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHH Q lcl|NC_021302. 316 LAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQML 395 (484) Q Consensus 316 lGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L 395 (484) .-..-..+..+.+.+..+-.......-+.-.++.|++.||+.|++.+ .++.. . ....+...+++.+.+| T Consensus 280 gVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L~~~~-~~d~~-------~---~~~~d~~~~~~~~~~l 348 (392) T protein:vir:39 280 GLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYKLSDHI-SVNMR-------P---AIDPLGDNYLSTISTA 348 (392) T ss_pred CCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-cccch-------h---hhccCHHHHHHHHHHH Confidence 65433322112222222323344556667778888888888765431 11110 0 1123567778889999 Q ss_pred HhcCcccCCcccHHHHHHHh---CCCCCCCCcccccccCCCcCCCccccCCCCccccccccc Q lcl|NC_021302. 396 VNAGLLTPDPRLEAFLRDAA---GLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTST 454 (484) Q Consensus 396 ~~~G~~~~~~~~~~~i~e~~---glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (484) ...|+.. .+++|+.+ |+...+--+.+..++.. .++..++.+ T Consensus 349 ~~~g~~t-----~nE~r~~l~~~g~~p~e~r~~e~l~~~~-------------~Gd~~~p~p 392 (392) T protein:vir:39 349 TRWGALA-----ENQATFVLQEAGYIPKDLPAPENTNKKT-------------TGQSNEPVP 392 (392) T ss_pred HhCCCcC-----HHHHHHHHHhcCCCccccchhcCCCCCC-------------CCCCCCCCC Confidence 9999764 35566655 66421110000101100 001111111 No 72 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=99.72 E-value=4.3e-16 Score=104.81 Aligned_cols=377 Identities=11% Similarity=0.028 Sum_probs=223.1 Q ss_pred CCCCCCCcccee-eeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTER-GYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) +......+.... +....... .+... .+.+. .+..+...+-+.|.+|+..+...|.++++++.... T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~---~~~~~---------~~~~~--~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~ 72 (386) T protein:vir:48 7 TNLATESPPISQGGFFDITDP---DFLST---------LNGSE--WVSAESALRNSDLFSIINQLSNDLATVKLTASRKQ 72 (386) T ss_pred ccccccccccccccccccccc---hhccc---------ccCCc--eechhhhhcchHHHHHHHHHHHhhccCceeeccch Confidence 222222222221 11111100 00000 11111 12222223579999999999999999999985321 Q ss_pred CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceee Q lcl|NC_021302. 80 ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAY 158 (484) Q Consensus 80 ~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~ 158 (484) . ..| ..+.....++.++++.++ +.+.+|-+++++++... | .+..|.+.++.++.. T Consensus 73 ~---------~~l------------~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~-g--~~~~L~~l~~~~v~v 128 (386) T protein:vir:48 73 L---------QGI------------IDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN-G--RDMKWEYLRPSQVSF 128 (386) T ss_pred h---------HHH------------hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCC-C--cEEEEEEecCceeEE Confidence 1 111 111222345777888876 67889999999987543 3 467899999988853 Q ss_pred eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAA 238 (484) Q Consensus 159 ~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f 238 (484) ..+.+|+.+...-...+ ...+....+|...++++++....+.++|.|.+..+....-.-....++...+ T Consensus 129 -~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~ 197 (386) T protein:vir:48 129 -NRLDNKDGIYYNITFDD----------PRIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNS 197 (386) T ss_pred -EEcCCCceEEEEEEecC----------ccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHH Confidence 44445543322111000 0112334578888888887766777899999999988777777788888888 Q ss_pred HHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhh Q lcl|NC_021302. 239 IRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAH 318 (484) Q Consensus 239 ~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGq 318 (484) ... .++|-.+.+++...+++++.++.+....+..+....++++.|++++-++.+.....|.+..++..++|++++.-. T Consensus 198 ~~n--g~~~~~ii~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP 275 (386) T protein:vir:48 198 LKN--ALNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIP 275 (386) T ss_pred Hhc--cCCcceEEEeCCCCCHHHHHHHHHHHHHhhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 885 367877778888888888888888887776665556889999998888766555678888899999999996654 Q ss_pred hhcccccccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHHHh Q lcl|NC_021302. 319 FLNLDGKGGSYAL-ASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQMLVN 397 (484) Q Consensus 319 tlt~~~~gGs~A~-~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L~~ 397 (484) ..-.+. .++++. .+........-+.--++.|+..||+.|++.+ .++.. + ....+...++..+.+|+. T Consensus 276 p~~lg~-~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-~~~~~-----~-----~~~~d~~~~~~~~~~l~~ 343 (386) T protein:vir:48 276 ENVVGG-QGDQQSSLEMSLDLYNKAVSRYLRPFLSELSQKLSCDV-DADIL-----P-----AVDPTGSNSVSRINSMVK 343 (386) T ss_pred HHHhCC-CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchh-hcchh-----h-----hhccChHHHHHHHHHHHh Confidence 433221 222222 2223344555566778888888888776532 11110 0 112345566778889999 Q ss_pred cCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 398 AGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 398 ~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .|+.. .+++|+.+|.+.-.+++...... ...+..+. |+ ....+ T Consensus 344 ~g~~t-----~nE~r~~lg~~~~~~~~~~~~~~--~~~~~~~g---------Gd---~~~~~ 386 (386) T protein:vir:48 344 SGTLA-----QNQGLYILQQAEILPKELPEGEN--PNKTTLKG---------GE---INGED 386 (386) T ss_pred CCCcC-----HHHHHHHhhcCCCCCccchhhcC--CCCCccCC---------CC---CCCCC Confidence 99754 47899999875433222110000 00000000 00 00001 No 73 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=99.72 E-value=6.4e-16 Score=103.88 Aligned_cols=394 Identities=15% Similarity=0.082 Sum_probs=219.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec--- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP--- 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p--- 77 (484) .+|...+.....-..++...... .. ... ...+++.++.+.|.+|+..+-..|.+++++|-. T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~--------------~~-~~~-~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 72 (423) T protein:vir:81 9 LAPSVVATPEPIELVGPIFESLK--------------LS-TKN-MTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVE 72 (423) T ss_pred cccccccCccccccccccccccc--------------cc-cch-hhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEec Confidence 22221111110000011000000 00 011 134566567899999999999999999999832 Q ss_pred CCCCHHHHHH-HHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 78 NGARPEVVEH-VAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 78 ~~~~~e~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) ++...++.+. +...+.. -.....+.++++.++ +.+.+|-+...+.-. .++...+..|.+.++.. T Consensus 73 dg~~~~~~~~~~~~ll~~-------------PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd-~~~~~~~~~l~p~~~~~ 138 (423) T protein:vir:81 73 DGGRERVREGHLARVCKL-------------ANSDMTMYDLLERTMFDLCLYDEFFWLLPGD-LGVDTPTLDIRPIPVSW 138 (423) T ss_pred CCceeeeccchHHHHhhc-------------CCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcCcceEEEeecccce Confidence 2322221111 1111111 011234677777765 667899877766432 23333344566666665 Q ss_pred eeeeeecC-CCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDR-DGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRI 234 (484) Q Consensus 156 ~~~~~~~~-dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~ 234 (484) +......+ .+.+.+... ......+....+|....|+.+.....+..+|.|.+..+....-.-....++ T Consensus 139 v~~~~~~~~~~~~~Y~~~-----------~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~ 207 (423) T protein:vir:81 139 VQRRAYKDGWGSLDYIII-----------ESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIF 207 (423) T ss_pred eeeeeccCCCcceEEEEE-----------EecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHH Confidence 54322211 222222111 011123344668888877776555555568999999999888777778888 Q ss_pred HHHHHHHhcCCcceEEecCC-----CCCCHHHHHHHHHHHHHHhc-C-C--ceEEEccCCceEEEecccCCchhHHHHHH Q lcl|NC_021302. 235 EAAAIRRHGIGVPYLKGNEA-----DSEDDDRMDELLEIASNYSG-G-E--SAGLALTAGEEAGILSPNGTPLDPRRAIE 305 (484) Q Consensus 235 w~~f~Er~~~G~P~~~gk~~-----~~~~~~~~~~l~~~l~~~~~-g-~--~a~~vip~~~~ie~~~~~~~~~~~~~li~ 305 (484) -..|...- +.|-.+.+.+ ...++++++++.+.++.... + . ...++++.|++++-++.+.....|.+..+ T Consensus 208 ~~~~f~ng--~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~ 285 (423) T protein:vir:81 208 RAQMWRNG--PRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTK 285 (423) T ss_pred HHHHHhcc--CCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHH Confidence 88888742 5564444332 23467788888887776542 2 1 13568999999888776655556888888 Q ss_pred HHHHHHHHHHhhhh-hcccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-- Q lcl|NC_021302. 306 YHDHQMALVALAHF-LNLDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-- 381 (484) Q Consensus 306 ~~d~~Isk~ilGqt-lt~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-- 381 (484) +...+|+++..-.. |..+.++++++-.+-.. .....-+.-.++.|++.||+.|+++.-. ..... .|+|+.. T Consensus 286 ~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~---~~~~~--~~~fd~~~l 360 (423) T protein:vir:81 286 LSLQTVAQVYGINPTMVGQLDNANYSNVREFRKALYGDNLGSWIRIIQDVMNLFLLPRVGI---DNEKF--YFEFNLEEK 360 (423) T ss_pred hhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccc---ccCcc--EEEecchhh Confidence 99999999876543 33333345665444333 3444467788889999999888775221 11112 4566432 Q ss_pred -CCcHHHHHHHHHHHH-hcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccc Q lcl|NC_021302. 382 -GSRQDATAAALQMLV-NAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTT 455 (484) Q Consensus 382 -~~~~~~~ae~~~~L~-~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (484) ..|.+..++++++++ +.|+.. .+++|+.+|+|.-+.+++...+. +-. + .+....+ ++...+ T Consensus 361 lr~d~~~r~~~~~~~l~~~G~~T-----~NE~R~~~gl~p~~gGD~~~~p~-n~~-~---~~~~~~~---~~~~~t 423 (423) T protein:vir:81 361 LRASFEEAAEIKRAAVGNVAWMT-----INEVRAMDNLPSIDGGDDLARPL-NTE-F---GDSEDAP---GEEVET 423 (423) T ss_pred hccCHHHHHHHHHHHHhCCCCcC-----HHHHHHHhCCCCCCCcceeeccc-ccc-c---CccCCCC---CCCCCC Confidence 247788888888765 678654 47899999998776666544321 110 0 0000111 111111 No 74 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=99.71 E-value=3.6e-16 Score=105.24 Aligned_cols=381 Identities=11% Similarity=0.034 Sum_probs=217.3 Q ss_pred CCCCC------------CCc-----cceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHH Q lcl|NC_021302. 1 MAPKT------------VAP-----RTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRA 63 (484) Q Consensus 1 ~~~~~------------~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~ 63 (484) ||++. +.. ..+.............+. ..+ ....|..++ +-+.|.+|+.. T Consensus 4 ~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~----~~~~~~~~~-~~~~v~~cI~~ 69 (413) T protein:vir:96 4 VSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFF---------KEL----ISDGYTKLS-DSPEVRMAVDC 69 (413) T ss_pred cchhhhhhcCCccccCCCcchhhhhhccccccccccccchhhH---------hhh----ccchhHHHh-hchHHHHHHHH Confidence 22221 000 000000000000000000 000 112244454 47899999999 Q ss_pred HHHHhhCCCcEEecCCCCHH-HHH-HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecC Q lcl|NC_021302. 64 IGLPIRRTDWRIRPNGARPE-VVE-HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEG 140 (484) Q Consensus 64 r~~~v~~~~~~v~p~~~~~e-~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~ 140 (484) +...|.+++|.+...+.+.+ ..+ .....|. .+.....++.++++.++ +.+.+|.++++++....+ T Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~------------~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g 137 (413) T protein:vir:96 70 IADLVSNMTIQLMQNGETGDKRIKNDLSRVVD------------IEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSG 137 (413) T ss_pred HHHhhccCceEEEEecCCCccccccHHHHHHH------------hccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC Confidence 99999999999854333221 111 1111111 11122345777887776 567899999999876554 Q ss_pred CeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccC-ccccchhHH Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG-VWTGNSLLR 219 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~-~p~G~gll~ 219 (484) + .+..|.+.+|.++... .+ ++.+.+.. ...+..+++...|++++..... ..+|.|.+. T Consensus 138 ~--~~~~L~~l~~~~v~~~-~~-~~~~~y~~-----------------~~~~~~~~~~evih~k~~~~~~~~~~G~s~~~ 196 (413) T protein:vir:96 138 D--KIIGLTPISPYKVTFN-VS-DDDLDYSI-----------------TFDNKEYDPSTLLHFVLNPSIERPFIGTGYKV 196 (413) T ss_pred C--ceEEEEEecCceeEEE-Ec-CCeEEEEE-----------------eecCcEEchhhEEEEeccCCCCCccccccHHH Confidence 3 3467888888877532 22 22222111 1122346778888888765443 456999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-ce--EEEccCCce-EEEe-ccc Q lcl|NC_021302. 220 PAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-SA--GLALTAGEE-AGIL-SPN 294 (484) Q Consensus 220 ~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~a--~~vip~~~~-ie~~-~~~ 294 (484) .+....-.-....++...+.... +.|-.+.+.+...++++++++.+.+.+...|. ++ .++++.|.. +.-+ ..+ T Consensus 197 ~~~~~i~~~~~~~~~~~~~~~ng--~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~ 274 (413) T protein:vir:96 197 ALKDIVGNLKQASVTKKGFMASE--YMPNLIVSVDSDSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLT 274 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHhcc--CCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCC Confidence 99988888888888888888863 67866667777788888999999998876552 22 256666653 2222 223 Q ss_pred CCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc Q lcl|NC_021302. 295 GTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAP 374 (484) Q Consensus 295 ~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P 374 (484) .....|.+..++..++|++++.-..--.+...++.+ ........-+.-.++.|++.||+.|++. +. T Consensus 275 ~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~---~~~~~~~~~l~P~~~~ie~~ln~~ll~~---------~~-- 340 (413) T protein:vir:96 275 LNDLAINDAVTLDKKTVAGIFGVPAFLLGVGTYNKD---EFNNFINTKIMSIAQVIQQTYNKLIVEE---------DM-- 340 (413) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchHH---HHHHHHHHHHHHHHHHHHHHHHHhhCCC---------Cc-- Confidence 334457778888899999997655422221112332 2334455567788888999998876541 22 Q ss_pred eEEecC--C-CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccc Q lcl|NC_021302. 375 LLVFDE--I-GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 375 ~~~~~~--~-~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (484) +|+|+. . ..|.+..++++.++.+.|+.. .+++|+.+|+|.-++++....+. +- .+ ...... ....+ T Consensus 341 ~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t-----~NE~R~~~g~~p~~~gd~~~~~~-n~-~~-~~~~~~---~~~~~ 409 (413) T protein:vir:96 341 YFSLNPRSLYNYSLTEMVSAGAQMTQLNALR-----RNEFRNWVGMPPDAEMDDLLVLE-NY-LQ-QKDLVN---QKKLI 409 (413) T ss_pred EEEEechhhhccCHHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeecc-cc-cc-hhhccc---ccCCC Confidence 455643 2 347788999999999999864 47899999998654444332111 00 00 000000 00001 Q ss_pred cccc Q lcl|NC_021302. 452 TSTT 455 (484) Q Consensus 452 ~~~~ 455 (484) +..+ T Consensus 410 ~~dt 413 (413) T protein:vir:96 410 QDET 413 (413) T ss_pred CCCC Confidence 1111 No 75 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=99.71 E-value=1.4e-16 Score=107.48 Aligned_cols=372 Identities=9% Similarity=-0.006 Sum_probs=211.7 Q ss_pred CC---CCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec Q lcl|NC_021302. 1 MA---PKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP 77 (484) Q Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p 77 (484) .. .....+..+....... .+......+.. +..+. -+..+ +-+.|.+|+..+...|.++++++.- T Consensus 4 f~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~---------~~~v~-~~~al-~~~~V~~~i~~Ia~~ia~l~~~~~~ 70 (384) T protein:vir:49 4 FNITNLATESPPSNQDSFFDI--TDPEFLDALNG---------SEWVS-AETAL-KNSDLFSIISQLSNDLATAKITTSR 70 (384) T ss_pred ccccccCcccccccchhhccc--cchhhcccccC---------Cceec-hhhhh-ccHHHHHHHHHHHHHHhhCceeeec Confidence 11 1111111111000000 00000000100 11010 11233 4688999999999999999999852 Q ss_pred CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 78 NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 78 ~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) ...+ .| ..+.....++.++++.++ +.+.+|-+.+++++... | .+..|.+.+|.++ T Consensus 71 ~~~~---------~l------------~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~-g--~~~~L~~l~~~~v 126 (384) T protein:vir:49 71 KQLQ---------GI------------VDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNEN-G--RDMKWEYLRPSQV 126 (384) T ss_pred chhh---------hh------------hhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCC-C--cEEEEEEEcCcee Confidence 1110 11 111122345777888776 57889999999987543 2 4678999999888 Q ss_pred eeeeecCCCcee-eeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 157 AYWNVDRDGGLI-SIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 157 ~~~~~~~dg~l~-~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) ... .+.+++.+ +.-... ....+....++...+|++++....+..+|.|.+..+....-.-....++. T Consensus 127 ~v~-~~~~~~~~~y~~~~~-----------~~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~ 194 (384) T protein:vir:49 127 SFN-RLDNQNGLYYNITFD-----------DPRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLT 194 (384) T ss_pred EEE-EcCCCceEEEEEEec-----------CccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHH Confidence 643 33444332 211110 01123345688888888887766777899999999998887777788888 Q ss_pred HHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVA 315 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i 315 (484) ..+... .+.|-.+.+.+...++++..+...+-.....+....++++.|++++-++.+.....|.+..++..++|++++ T Consensus 195 ~~~~~n--g~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~f 272 (384) T protein:vir:49 195 LNALKN--ALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVY 272 (384) T ss_pred HHHHhc--cCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHh Confidence 888875 367766666665555554444333333333333456789999998877766666678888999999999997 Q ss_pred hhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hCCCCccccceEEecC-CCCcHHHHHHH Q lcl|NC_021302. 316 LAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVD---VNWGEDEPAPLLVFDE-IGSRQDATAAA 391 (484) Q Consensus 316 lGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~---~Nf~~~~~~P~~~~~~-~~~~~~~~ae~ 391 (484) .-..--.+..+++.+..+.-.+.....++.-++-+...+++.|.+.+.. ..........+|.++. ...+.....++ T Consensus 273 gVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~ 352 (384) T protein:vir:49 273 GIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQG 352 (384) T ss_pred CCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHH Confidence 6543222212222222233344455556666666666666665554321 1111112211222221 12345667788 Q ss_pred HHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccc Q lcl|NC_021302. 392 LQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDES 429 (484) Q Consensus 392 ~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~ 429 (484) ..+|...|+.. +++|+..|+|.-+.++..+.= T Consensus 353 ~~~l~~~g~~~------ne~r~~~~~~p~~gGd~~~~~ 384 (384) T protein:vir:49 353 LYVLQQAEILP------KDLPEGETDSTLKGGETNEQY 384 (384) T ss_pred HHHHhhCCCCC------hhHHHHcCCCCCCCCCCCCCC Confidence 88888888742 468999998765544333222 No 76 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=99.70 E-value=4.5e-16 Score=104.71 Aligned_cols=364 Identities=11% Similarity=0.037 Sum_probs=217.2 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) -.|+.+ +.+..+..-...+.+. +...+.... +. .+-++-..+-+.|.+|+..+...|.++++.+.-++. T Consensus 36 ~~~~~~-~~~~~~~~~~~~~~~g-~~~~~~~~~-------~~--~~t~~~~~~~~~v~acV~~Ia~~iA~lpl~~~~~~~ 104 (409) T protein:vir:83 36 RGPEEE-PEARALPWIRPTAWSG-YPESWATPS-------WG--SAQDKLRTLIDVAWACIDLNASVLSSMPIYRMRNGR 104 (409) T ss_pred cCCCcc-hhhhhccccccccccc-ccccccccC-------cc--ccchhhHhhhHHHHHHHHHHHHhhccCceEEeeCCc Confidence 111111 1111111111111111 111110000 00 111122234689999999999999999998865432 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWN 160 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~ 160 (484) .. +.....++. +-.....+.++++.++..+..|-+..+++-...+| .+..|.+++|..+. +. T Consensus 105 ~~---~~~~~ll~~------------~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~~~G--~~~~L~pl~p~~v~-v~ 166 (409) T protein:vir:83 105 II---DSVAWMSNP------------DPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHGSDG--YPIRFRVVPPWLVN-VE 166 (409) T ss_pred cc---cchhhhccc------------CCCCCCCHHHHHHHHHHHHhhCCcEEEEEEECCCC--cEEEEEEECCcceE-EE Confidence 21 111111110 01112356777777766566688887765333334 46789999998876 55 Q ss_pred ecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 161 VDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIR 240 (484) Q Consensus 161 ~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~E 240 (484) .+.+|.+.+.... ... ....+++++....+..||.|.+..+....-......++-..|.. T Consensus 167 ~~~~g~~~y~~~~-------------------~~~-~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~ 226 (409) T protein:vir:83 167 LKKGARREYRIGG-------------------LNV-TDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAE 226 (409) T ss_pred EcCCceEEEEEcc-------------------ccC-ccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6666665432110 011 13455555655667789999999999888888888888788877 Q ss_pred HhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCc-eEEEccCCceE-EEecccCCchhHHHHHHHHHHHHHHHHhhh Q lcl|NC_021302. 241 RHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGES-AGLALTAGEEA-GILSPNGTPLDPRRAIEYHDHQMALVALAH 318 (484) Q Consensus 241 r~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~-a~~vip~~~~i-e~~~~~~~~~~~~~li~~~d~~Isk~ilGq 318 (484) . -+.|-.+.+++...++++++++.+.+.....+.+ ..+++..|+++ +.++.+.....|.+..++..++|++++.-. T Consensus 227 n--ga~p~gil~~~~~ls~e~~~~~~~~~~~~~~~nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVP 304 (409) T protein:vir:83 227 T--GGVPLYWLGVERRLSETEAVDLMDRWIESRSKYAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVP 304 (409) T ss_pred c--CCCcceEeecCCCCCHHHHHHHHHHHHHhhCCccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCC Confidence 4 3678777788888899999999988877765422 12667777776 345555444457788888999999997765 Q ss_pred hhccc--ccc--cchhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHHHHH Q lcl|NC_021302. 319 FLNLD--GKG--GSYALASVQ-ADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDATAA 390 (484) Q Consensus 319 tlt~~--~~g--Gs~A~~evh-~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ae 390 (484) ....+ +++ .+|+-.+-. ......-+.-.++.|+..||+.|++. .. +|+|+. . ..|.+++++ T Consensus 305 p~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~---------~~--~~~f~~~~llr~d~~~r~~ 373 (409) T protein:vir:83 305 PFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPS---------PQ--HLELNRDDYTRPSLVERAT 373 (409) T ss_pred HHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC---------Cc--EEEeehhhhhccCHHHHHH Confidence 43332 122 234433333 33444567788889999999877642 12 455543 2 257788999 Q ss_pred HHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcC Q lcl|NC_021302. 391 ALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQ 435 (484) Q Consensus 391 ~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~ 435 (484) +++++.+.|+.. .+++|+..|+|..+.+.+.... +. T Consensus 374 ~~~~~~~~G~lT-----~NE~R~~~glpp~~ggd~l~~~----gv 409 (409) T protein:vir:83 374 AYKIMIEAGVME-----PNEARAMERLHSEAAAVRLSGG----GV 409 (409) T ss_pred HHHHHHhCCCcC-----HHHHHHHhCCCCCCCCcccCCC----CC Confidence 999999999864 4889999999865554433211 11 No 77 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.70 E-value=1.4e-15 Score=102.06 Aligned_cols=367 Identities=14% Similarity=0.114 Sum_probs=210.0 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) +....-.+.......... +..+...+...... ..+..+...+-+.|.+|+...-..|.+++++|..... T Consensus 4 ~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~--------~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~~~~ 72 (385) T protein:vir:10 4 LTPRNFNKRKAKNMVYPS---NPAFFTTTVGGMQL--------SYVSALSALQNTNVYSVINRIASDVASAHFKTENTAT 72 (385) T ss_pred ccchhccccccccccccc---chhhhhhhccccCc--------cccCHHHhhccHHHHHHHHHHHHHHhhCceeeeccch Confidence 221111011110000010 00000000000000 0112222235789999999999999999999953211 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) ...+. +........++++.++ +.+.+|-++++++... ..+.+.++.++.. T Consensus 73 --------~~ll~-------------~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~-------~~~~p~~~~~v~~- 123 (385) T protein:vir:10 73 --------LNRLE-------------SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN-------LEHIPNSDVQINY- 123 (385) T ss_pred --------hhhhh-------------cCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc-------eeEeecCCceEEE- Confidence 11111 1112335677788776 5667999999987431 2234444444432 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCc--cCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMD--PGVWTGNSLLRPAYKNWKLKDELIRIEAA 237 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~--~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 237 (484) ....+++.+. +....++....+|++..|++++... .+..+|.|.+..+....-......++... T Consensus 124 -~~~~~~~~~~-------------~~~~~~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~ 189 (385) T protein:vir:10 124 -LPGNMGIVYT-------------VLESNDRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMS 189 (385) T ss_pred -EEcCCceEEE-------------EEEcCCceEEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 2222222211 1122233456688888888876432 34568999999999988888888888888 Q ss_pred HHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhH-HHHHHHHHHHHHH Q lcl|NC_021302. 238 AIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDP-RRAIEYHDHQMAL 313 (484) Q Consensus 238 f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~-~~li~~~d~~Isk 313 (484) +... .+.|-.+.+.+... ++++++++.+.++++.++.++ .++++.|++++-++.+.....+ .+..++..++|++ T Consensus 190 ~~~n--g~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~ 267 (385) T protein:vir:10 190 AMEN--QINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISK 267 (385) T ss_pred HHhc--cCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHH Confidence 8875 25676666665433 567889999999998776544 4789999998888766544454 4777888899999 Q ss_pred HHhhhhhcc-ccc--ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC---CCCcHHH Q lcl|NC_021302. 314 VALAHFLNL-DGK--GGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE---IGSRQDA 387 (484) Q Consensus 314 ~ilGqtlt~-~~~--gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~---~~~~~~~ 387 (484) ++.-..--. ..+ +.+++-.+-....+..-+.-.++.|++.||+.|+ + +.|+|+. ...|.+. T Consensus 268 ~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~ie~~l~~~l~--------~-----~~~~f~~~~ll~~d~~~ 334 (385) T protein:vir:10 268 AFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYVNPIVDELRLKMN--------A-----PDLELDIKDMLDVDDSA 334 (385) T ss_pred HhCCCHHHcCCccCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHhhC--------C-----ceEEeechhhhccCHHH Confidence 976543222 112 2334444444445555677888888888887653 1 2455543 2357888 Q ss_pred HHHHHHHHHhcCcccCCcccHHHHHHHhCCCCC-CCCcc-cccccCCCcCCCccccCCCCccccccc Q lcl|NC_021302. 388 TAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGP-DPDAD-DDESTADTGQDEPETDEPALPNTSGTT 452 (484) Q Consensus 388 ~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p-~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (484) ++++++++++.|+.. .+++|+.+|++.- .++.+ ...+.... .....++. T Consensus 335 ~~~~~~~~~~~G~~T-----~NE~R~~~g~~p~p~~~~~~~~~~~~~~-----------~~g~~~dn 385 (385) T protein:vir:10 335 LINQVSNLAKSGVLG-----AEQAQFILTRSGFLPDNLPEFKPLTTQV-----------KGGDEGDN 385 (385) T ss_pred HHHHHHHHHhCCCcC-----HHHHHHHhCCCccCCCCCccccCccccc-----------CCCCCCCC Confidence 999999999999864 4789999997532 22211 11111000 00001111 No 78 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=99.69 E-value=1.5e-15 Score=101.85 Aligned_cols=371 Identities=11% Similarity=0.052 Sum_probs=217.9 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) +..+-+.+++ ++....... +...+ . .+..++.+...+-+.|.+|+..+...|.+++|++.-... T Consensus 7 ~~~~~~~~~~--~~~~~~~~~---~~~~~---------~--~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 70 (382) T protein:vir:48 7 ATESPPDNQG--GFFDVVDSD---FLASL---------K--GNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKL 70 (382) T ss_pred cccCCccccc--ccccchhhh---ccccc---------c--CCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchh Confidence 2211111111 111111000 00000 0 111223222235789999999999999999999863221 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) + .| ..+.....++.++++.++ +.+.+|-++++++... +| .+..|.+.++.++.. T Consensus 71 ~---------~L------------~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~-~G--~~~~l~~i~~~~v~v- 125 (382) T protein:vir:48 71 Q---------GI------------VDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNE-NG--RDMKWEYLRPSQVSF- 125 (382) T ss_pred h---------hh------------hhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECC-CC--cEEEEEEEcCceeEE- Confidence 1 11 111122346788888887 6788999999987532 23 367899999998864 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAI 239 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 239 (484) ..+.+++.+...-...+ ...+....++...++++++....+.++|.|.+..+....-.-....++...+. T Consensus 126 ~~~~~~~~~~y~~~~~~----------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~ 195 (382) T protein:vir:48 126 NRLDNKDGIYYNITFDD----------PRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSL 195 (382) T ss_pred EEcCCCCeEEEEEEecC----------ccccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 34445543322110000 11223456788888888877777789999999999988887788888888888 Q ss_pred HHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhh Q lcl|NC_021302. 240 RRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHF 319 (484) Q Consensus 240 Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqt 319 (484) ..- +.|-.+.+++...++++..++.+...+...+....++++.|++++-++.+.....|.+..++..++|++++.-.. T Consensus 196 ~ng--~~p~~il~~~~~~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~ 273 (382) T protein:vir:48 196 KNA--LNANGILKIKGGGLLDFKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPD 273 (382) T ss_pred hcc--CCCceEEEeCCCCChHHHHHHHHHHHhhccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCH Confidence 853 678777788777788888888888777765555568899999988887665666788889999999999976554 Q ss_pred hcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHHHHHHHHHhcC Q lcl|NC_021302. 320 LNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATAAALQMLVNAG 399 (484) Q Consensus 320 lt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L~~~G 399 (484) ...+..+.+....+.........+.--++.|++.||+.|.+++- .+ ..+.+ ..+.......+.+|...| T Consensus 274 ~~lg~~~~~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~l~~~~~-~~-----~~~~~-----~~~~~~~~~~~~~l~~~g 342 (382) T protein:vir:48 274 NVVGGQGDQQSSLEMSSDLYSKAVSRYLRPFLSELSQKLSCDVD-AD-----IFPAV-----DPTGSNYISRINSLVKTG 342 (382) T ss_pred HHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhh-hh-----hhhhh-----ccchhHHHHHHHHHhhcC Confidence 33322121222333344555666677788888888887755421 11 11111 122344556677888888 Q ss_pred cccCCcccHHHHHHHhC---C-CCCCCCcccccccCCCcCCCcccc Q lcl|NC_021302. 400 LLTPDPRLEAFLRDAAG---L-PGPDPDADDDESTADTGQDEPETD 441 (484) Q Consensus 400 ~~~~~~~~~~~i~e~~g---l-p~p~~~e~~~~~~~~~~~~~~~~~ 441 (484) ++. .+++|+.++ + |.+....+...++.. +.++-+.+ T Consensus 343 ~~t-----~~e~r~~l~~~g~~~~~~~~~~~~~~~~~-GGd~~~~~ 382 (382) T protein:vir:48 343 TLA-----QNQGLYILQQAEILPKELPNGENPNSTLK-GGEEDGQD 382 (382) T ss_pred ccC-----HHHHHHHHhhCCCCCcchhhhhcCCCCCC-CCCCCCCC Confidence 754 467888764 2 111100000001000 11100000 No 79 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=99.67 E-value=5.9e-15 Score=98.57 Aligned_cols=428 Identities=14% Similarity=0.027 Sum_probs=213.0 Q ss_pred CCCCCCCccceee----eecccccchhhh-hhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEE Q lcl|NC_021302. 1 MAPKTVAPRTERG----YVNPLAGFGTFL-AQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRI 75 (484) Q Consensus 1 ~~~~~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v 75 (484) ..+..+.++..+- -......+.... .-++ ++ +-......++.+.-++|.+|+..+...|.+++|.+ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~p------p~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i 72 (540) T protein:vir:41 2 FNYHLSIKSLEKYRAIKGDTDSQALKEDRFEEYV---EP------KVHPLVLLSLLQVNPYHASACSIKANDILRTGYLI 72 (540) T ss_pred CCcccChhhccchhhhhccccccccccCCCCccc---cC------CCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceE Confidence 2222222221110 000000000000 0001 11 11233344666678899999999999999999999 Q ss_pred ecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcc Q lcl|NC_021302. 76 RPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQS 154 (484) Q Consensus 76 ~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 154 (484) +.....- . +.+ - ....++.++++.++ +.+.+|.+.+++++... | .+..|.++++. T Consensus 73 ~~~~~~~--~----~~l----p-----------N~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~-G--~~~~L~~i~~~ 128 (540) T protein:vir:41 73 DGDDGGV--E----ELL----R-----------ACRPSFEFILLQALEDLQVFNYCTLEVVRDDQ-G--EPVRLDYIPAH 128 (540) T ss_pred ecCccch--h----hhc----c-----------CCCCCHHHHHHHHHHHHHhcCCeEEEEEECCC-C--cEEEEEEeCCc Confidence 7643321 1 111 1 12335778888876 68889999999987543 3 46788888888 Q ss_pred ceeeeeecCCCceeeeecccc----cccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHH Q lcl|NC_021302. 155 SIAYWNVDRDGGLISIQQWPA----GTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDE 230 (484) Q Consensus 155 ~~~~~~~~~dg~l~~~~q~~~----~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~ 230 (484) ++.. ..+.++ .+....... ...+.........+.....+|....|+++.....+.+||.|.+..+......-.. T Consensus 129 ~V~v-~~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~ 206 (540) T protein:vir:41 129 TVRV-HRDGSR-YMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQK 206 (540) T ss_pred ceEE-eEcCce-eEeeecCceeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHH Confidence 7742 211111 111100000 0001111122223334456788888877766667788999999999998888888 Q ss_pred HHHHHHHHHHHhcCCcceEEecCCCCCC------HHHH----HHHHHHHHHHhcC----CceEEEc------cCCceEEE Q lcl|NC_021302. 231 LIRIEAAAIRRHGIGVPYLKGNEADSED------DDRM----DELLEIASNYSGG----ESAGLAL------TAGEEAGI 290 (484) Q Consensus 231 ~~~~w~~f~Er~~~G~P~~~gk~~~~~~------~~~~----~~l~~~l~~~~~g----~~a~~vi------p~~~~ie~ 290 (484) ...+-..|... .+.|-.+.+.+.... ++.. +.+.+...+..+| ....+++ +.|++++- T Consensus 207 ~~~~~~~~f~N--g~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~p 284 (540) T protein:vir:41 207 IDEYNYAFFDN--YTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTP 284 (540) T ss_pred HHHHHHHHHhc--cCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEe Confidence 88888888874 366755544432222 1222 2333333322111 1122344 34666666 Q ss_pred ecccCCchhHHHHHHHHHHHHHHHHhhhh-hcccccc--cchhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021302. 291 LSPNGTPLDPRRAIEYHDHQMALVALAHF-LNLDGKG--GSYALASVQ-ADTFVQSVQTVADEIRDVAQAHVVEDIVDVN 366 (484) Q Consensus 291 ~~~~~~~~~~~~li~~~d~~Isk~ilGqt-lt~~~~g--Gs~A~~evh-~~v~~~~~~aD~~~i~~~ln~qli~~l~~~N 366 (484) ++.+.....|.+..++...+|++++.-.. +....++ +++|-.+.. .......+.-.++.|+..||+.|++. T Consensus 285 l~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~----- 359 (540) T protein:vir:41 285 LNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLK----- 359 (540) T ss_pred cccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc----- Confidence 66555556799999999999999976543 2222222 223333333 34456667889999999999876552 Q ss_pred CCCccccceEEecCCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHh-CCCCCCCCcccccc----cCCC--cCCCcc Q lcl|NC_021302. 367 WGEDEPAPLLVFDEIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAA-GLPGPDPDADDDES----TADT--GQDEPE 439 (484) Q Consensus 367 f~~~~~~P~~~~~~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~-glp~p~~~e~~~~~----~~~~--~~~~~~ 439 (484) ++. + -+|+|+...-.....++.+.++++.|+.. .+++|+.+ |++.- ++....+ .... .....+ T Consensus 360 ~~~-~--~~i~f~~~~ll~~D~~~~~~~lv~~G~lT-----~NE~Re~L~g~e~g--dd~~l~p~n~~~~~~~~~~~~~~ 429 (540) T protein:vir:41 360 LDP-G--ARFVFNEEILMESEFVHNYALLVQCGVLT-----PSEVREKLFGLDGG--PDMFMVPSSIGKSAMKRQKRNYE 429 (540) T ss_pred cCC-c--eEEEecchhhcchHHHHHHHHHHhCCCCC-----HHHHHHHhCcCcCC--CcccccccccccccccccccccC Confidence 221 1 26777654332234566778899999764 46788864 66431 1111111 0000 000000 Q ss_pred ccCCCCccccccccccccccccccccccchHHHhcCcccCcccCC Q lcl|NC_021302. 440 TDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPDGAMPLWD 484 (484) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (484) .+.+.......+....+..+......-. .+.+...+ -.+-| T Consensus 430 ~~~~~~~~k~~~~~~~~~~~~~~~~~~~--~~~~~~~~--~~~~~ 470 (540) T protein:vir:41 430 KNQINEIKRTYAKYKPRIQEIISSESPL--EDKKKKID--EVLSD 470 (540) T ss_pred CCCccccccccchhcccccCcccccccc--cccccccc--ccccc Confidence 1111000000000000000000000000 00010000 11122 No 80 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.66 E-value=6.4e-15 Score=98.39 Aligned_cols=367 Identities=13% Similarity=0.094 Sum_probs=208.4 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) +.+.--.+.+......+.. ..+............+. -+..+ +-+.|.+|+..+-..+.+++|+|.-... T Consensus 4 ~~~~~~~k~~~~~~~~~~~---~~~~~~~~~~~~~~~v~-------~~~~l-~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 72 (383) T protein:vir:10 4 LTPKNFSKRNAKNMVYPSN---PAFFTTTVGGMQLSYVS-------ALSAL-QNTNVYSVINRIASDVSSAHFKTENTAT 72 (383) T ss_pred ccccccccccccccccccc---hhhhhhhccCccccccc-------hhHhh-cchHHHHHHHHHHHhhccCceeecccch Confidence 3322111111111111110 00100000000000000 12233 3578999999999999999999852111 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) ...|. +.....++.++++.++ +.+.+|-++++++-. ...+.+.++.++.. T Consensus 73 --------~~ll~-------------~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~-------~~~~~p~~~~~v~~- 123 (383) T protein:vir:10 73 --------LNRLE-------------SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ-------NLEHIPNSDVQINY- 123 (383) T ss_pred --------hhhhh-------------CCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC-------ceeEeecCcceEEE- Confidence 01111 1112335677777775 566799999987521 11233344433321 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCc--cCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMD--PGVWTGNSLLRPAYKNWKLKDELIRIEAA 237 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~--~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 237 (484) ....+++++.. .....+....++....+++++... .+..+|.|.+..|......-....++... T Consensus 124 -~~~~~~~~~~~-------------~~~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~ 189 (383) T protein:vir:10 124 -LPGNMGIVYTV-------------LESNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMS 189 (383) T ss_pred -EEcCCceEEEE-------------EEcCCceEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHH Confidence 12222222211 112233455678888888775432 33468999999999888888888888888 Q ss_pred HHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhH-HHHHHHHHHHHHH Q lcl|NC_021302. 238 AIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDP-RRAIEYHDHQMAL 313 (484) Q Consensus 238 f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~-~~li~~~d~~Isk 313 (484) |.... ++|-.+.+++... ++++.+++.+.++++.++.++ .++++.|++++-++.+.....+ .++.++..++|+. T Consensus 190 ~f~ng--~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~ 267 (383) T protein:vir:10 190 AMENQ--INPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISK 267 (383) T ss_pred HHhcc--CCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHH Confidence 88853 6675555555444 577888899999988766543 4788999999888766555555 5788888999999 Q ss_pred HHhhhhh-cc--cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCcHHH Q lcl|NC_021302. 314 VALAHFL-NL--DGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDA 387 (484) Q Consensus 314 ~ilGqtl-t~--~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~ 387 (484) ++.-..- .. ++.+.+++-.+-....+..-+.-.++.|+..||+.|+ + +.++|+. . ..|.+. T Consensus 268 afgVPp~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~~~ie~~l~~~l~--------~-----~~~~f~~~~l~~~d~~~ 334 (383) T protein:vir:10 268 AFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYVNPIVDELRLKMN--------A-----PDLELDIKDMLDVDDSI 334 (383) T ss_pred HhCCCHHHcCCccCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhC--------C-----ceEEeechhhhccCHHH Confidence 9765432 21 1112233333444445556677788888888887552 1 2455542 2 357889 Q ss_pred HHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccc Q lcl|NC_021302. 388 TAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTS 453 (484) Q Consensus 388 ~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (484) .++++.++.+.|+.. .+++|+.+|+|.-.+++.... ... ..+..-|+.. T Consensus 335 ~~~~~~~~~~~G~~t-----~nE~R~~lg~~p~~~~d~~~~----------~~~--~~~~~gGd~e 383 (383) T protein:vir:10 335 LINQVSNLAKSGVLG-----AEQAQFILTRSGFLPDNLPEF----------KPL--TNETKGGDDK 383 (383) T ss_pred HHHHHHHHHhCCCcC-----HHHHHHHhCCCcccCCccccc----------CCC--cccCCCCCCC Confidence 999999999999754 478999999965443322111 110 0011111111 No 81 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=99.66 E-value=5.2e-15 Score=98.90 Aligned_cols=378 Identities=11% Similarity=0.055 Sum_probs=193.4 Q ss_pred chhhhhhhccccccccc-ccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccc Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDE-LRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGD 99 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~ 99 (484) ||..... +.....+.. ..+.-...++.+-..+-+.|.+|+......|.+++|.+.-.+... ...+...|.. T Consensus 1 Mg~f~~l-f~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~--~~~~~~ll~~----- 72 (395) T protein:vir:10 1 MSILEKI-FKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQ--KNDVYYKLNI----- 72 (395) T ss_pred Cchhhhh-hccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccc--cchHHHHHHh----- Confidence 3321000 000111110 111112223332223468999999999999999999986543211 1111122210 Q ss_pred hhhhhHHHhhcCCCHHHHHHHHHH-HHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeeccccccc Q lcl|NC_021302. 100 ESDKPTPRTRGRFSWDQHLRLALK-SLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTF 178 (484) Q Consensus 100 ~~~~~~~~~~~~~~~~~~i~~~l~-a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~ 178 (484) +-.......++++.++. .+..|.++..+ .+++.+.+....++.+... .+ .... T Consensus 73 -------~PN~~~t~~~f~~~~~~~lll~g~~~~~~---~~~~~~~~~~~~~~~~~~~-----~~-~~~~---------- 126 (395) T protein:vir:10 73 -------KPNTDLSSDSFWQQVIYKLIYDNEVLIVV---SDSKELLIADSFYREEYAL-----YD-DIFK---------- 126 (395) T ss_pred -------ccCcCCCHHHHHHHHHHHHhhCCceEEEE---ecCCCeEecCCccceeEee-----cC-ccee---------- Confidence 11122345666666653 44556544322 2233222222112111111 00 0000 Q ss_pred ccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC Q lcl|NC_021302. 179 GGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED 258 (484) Q Consensus 179 ~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~ 258 (484) ............+|+..+|++++....+..||.|.+..+.... .....+..+ +...+.++.......+ T Consensus 127 ----~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~-------~~~~~~~~~-~~~~~gii~~~~~~~~ 194 (395) T protein:vir:10 127 ----DVTVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIF-------GRMIGAQLK-NYQIRGILKSASSAYD 194 (395) T ss_pred ----EEEEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHHHH-------HHHHHHHHh-cCCCceEEEeCCCCCC Confidence 0111112223468889999888888888899999988776332 222333343 4445555544344467 Q ss_pred HHHHHHHHHHHHHHhcC----CceEEEccCCceEEEecccCCch-----hHHHHHHHHHHHHHHHHhhhhhcccccccch Q lcl|NC_021302. 259 DDRMDELLEIASNYSGG----ESAGLALTAGEEAGILSPNGTPL-----DPRRAIEYHDHQMALVALAHFLNLDGKGGSY 329 (484) Q Consensus 259 ~~~~~~l~~~l~~~~~g----~~a~~vip~~~~ie~~~~~~~~~-----~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~ 329 (484) +++++++.+.++++.++ ..++++++.|++++-++.+.... .|.+..++..++|++++.-..--.. |++ T Consensus 195 ~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~---~~~ 271 (395) T protein:vir:10 195 EKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY---GET 271 (395) T ss_pred HHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc---Ccc Confidence 88888888888776544 22345578899998877553322 4666777888999999765432222 333 Q ss_pred h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-CCcHHHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 330 A-LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-GSRQDATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 330 A-~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-~~~~~~~ae~~~~L~~~G~~~~~~~~ 407 (484) + ..+........-+.--+..|+..||+.|+.+--... .-+|.++.. ..|.+..+++++++++.|+.. T Consensus 272 sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~------~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt----- 340 (395) T protein:vir:10 272 ADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLK------DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFT----- 340 (395) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcc------cceecchhhhccCHHHHHHHHHHHHhCCCcC----- Confidence 3 334445555667788888999999987765421111 114554433 357788999999999999864 Q ss_pred HHHHHHHhCCCCCCCCc--cccccc--CCCcCCCccccCCCCccccccccccccc Q lcl|NC_021302. 408 EAFLRDAAGLPGPDPDA--DDDEST--ADTGQDEPETDEPALPNTSGTTSTTNAP 458 (484) Q Consensus 408 ~~~i~e~~glp~p~~~e--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (484) .+++|+.+|+|.-+++. ....+. ............+.....+|.....++. T Consensus 341 ~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 341 RNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 47899999998655432 211110 0000000001001110111111111111 No 82 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=99.66 E-value=5.2e-15 Score=98.90 Aligned_cols=378 Identities=11% Similarity=0.055 Sum_probs=193.4 Q ss_pred chhhhhhhccccccccc-ccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccc Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDE-LRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGD 99 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~ 99 (484) ||..... +.....+.. ..+.-...++.+-..+-+.|.+|+......|.+++|.+.-.+... ...+...|.. T Consensus 1 Mg~f~~l-f~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~--~~~~~~ll~~----- 72 (395) T protein:vir:95 1 MSILEKI-FKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQ--KNDVYYKLNI----- 72 (395) T ss_pred Cchhhhh-hccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccc--cchHHHHHHh----- Confidence 3321000 000111110 111112223332223468999999999999999999986543211 1111122210 Q ss_pred hhhhhHHHhhcCCCHHHHHHHHHH-HHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeeccccccc Q lcl|NC_021302. 100 ESDKPTPRTRGRFSWDQHLRLALK-SLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTF 178 (484) Q Consensus 100 ~~~~~~~~~~~~~~~~~~i~~~l~-a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~ 178 (484) +-.......++++.++. .+..|.++..+ .+++.+.+....++.+... .+ .... T Consensus 73 -------~PN~~~t~~~f~~~~~~~lll~g~~~~~~---~~~~~~~~~~~~~~~~~~~-----~~-~~~~---------- 126 (395) T protein:vir:95 73 -------KPNTDLSSDSFWQQVIYKLIYDNEVLIVV---SDSKELLIADSFYREEYAL-----YD-DIFK---------- 126 (395) T ss_pred -------ccCcCCCHHHHHHHHHHHHhhCCceEEEE---ecCCCeEecCCccceeEee-----cC-ccee---------- Confidence 11122345666666653 44556544322 2233222222112111111 00 0000 Q ss_pred ccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC Q lcl|NC_021302. 179 GGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED 258 (484) Q Consensus 179 ~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~ 258 (484) ............+|+..+|++++....+..||.|.+..+.... .....+..+ +...+.++.......+ T Consensus 127 ----~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~-------~~~~~~~~~-~~~~~gii~~~~~~~~ 194 (395) T protein:vir:95 127 ----DVTVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIF-------GRMIGAQLK-NYQIRGILKSASSAYD 194 (395) T ss_pred ----EEEEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHHHH-------HHHHHHHHh-cCCCceEEEeCCCCCC Confidence 0111112223468889999888888888899999988776332 222333343 4445555544344467 Q ss_pred HHHHHHHHHHHHHHhcC----CceEEEccCCceEEEecccCCch-----hHHHHHHHHHHHHHHHHhhhhhcccccccch Q lcl|NC_021302. 259 DDRMDELLEIASNYSGG----ESAGLALTAGEEAGILSPNGTPL-----DPRRAIEYHDHQMALVALAHFLNLDGKGGSY 329 (484) Q Consensus 259 ~~~~~~l~~~l~~~~~g----~~a~~vip~~~~ie~~~~~~~~~-----~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~ 329 (484) +++++++.+.++++.++ ..++++++.|++++-++.+.... .|.+..++..++|++++.-..--.. |++ T Consensus 195 ~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~---~~~ 271 (395) T protein:vir:95 195 EKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY---GET 271 (395) T ss_pred HHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc---Ccc Confidence 88888888888776544 22345578899998877553322 4666777888999999765432222 333 Q ss_pred h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-CCcHHHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 330 A-LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-GSRQDATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 330 A-~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-~~~~~~~ae~~~~L~~~G~~~~~~~~ 407 (484) + ..+........-+.--+..|+..||+.|+.+--... .-+|.++.. ..|.+..+++++++++.|+.. T Consensus 272 sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~------~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt----- 340 (395) T protein:vir:95 272 ADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLK------DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFT----- 340 (395) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcc------cceecchhhhccCHHHHHHHHHHHHhCCCcC----- Confidence 3 334445555667788888999999987765421111 114554433 357788999999999999864 Q ss_pred HHHHHHHhCCCCCCCCc--cccccc--CCCcCCCccccCCCCccccccccccccc Q lcl|NC_021302. 408 EAFLRDAAGLPGPDPDA--DDDEST--ADTGQDEPETDEPALPNTSGTTSTTNAP 458 (484) Q Consensus 408 ~~~i~e~~glp~p~~~e--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (484) .+++|+.+|+|.-+++. ....+. ............+.....+|.....++. T Consensus 341 ~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 341 RNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 47899999998655432 211110 0000000001001110111111111111 No 83 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=99.66 E-value=5.2e-15 Score=98.90 Aligned_cols=378 Identities=11% Similarity=0.055 Sum_probs=193.4 Q ss_pred chhhhhhhccccccccc-ccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccc Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDE-LRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGD 99 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~ 99 (484) ||..... +.....+.. ..+.-...++.+-..+-+.|.+|+......|.+++|.+.-.+... ...+...|.. T Consensus 1 Mg~f~~l-f~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~--~~~~~~ll~~----- 72 (395) T protein:vir:10 1 MSILEKI-FKTRKDITYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQ--KNDVYYKLNI----- 72 (395) T ss_pred Cchhhhh-hccCccccccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCccc--cchHHHHHHh----- Confidence 3321000 000111110 111112223332223468999999999999999999986543211 1111122210 Q ss_pred hhhhhHHHhhcCCCHHHHHHHHHH-HHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeeccccccc Q lcl|NC_021302. 100 ESDKPTPRTRGRFSWDQHLRLALK-SLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTF 178 (484) Q Consensus 100 ~~~~~~~~~~~~~~~~~~i~~~l~-a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~ 178 (484) +-.......++++.++. .+..|.++..+ .+++.+.+....++.+... .+ .... T Consensus 73 -------~PN~~~t~~~f~~~~~~~lll~g~~~~~~---~~~~~~~~~~~~~~~~~~~-----~~-~~~~---------- 126 (395) T protein:vir:10 73 -------KPNTDLSSDSFWQQVIYKLIYDNEVLIVV---SDSKELLIADSFYREEYAL-----YD-DIFK---------- 126 (395) T ss_pred -------ccCcCCCHHHHHHHHHHHHhhCCceEEEE---ecCCCeEecCCccceeEee-----cC-ccee---------- Confidence 11122345666666653 44556544322 2233222222112111111 00 0000 Q ss_pred ccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC Q lcl|NC_021302. 179 GGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED 258 (484) Q Consensus 179 ~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~ 258 (484) ............+|+..+|++++....+..||.|.+..+.... .....+..+ +...+.++.......+ T Consensus 127 ----~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~-------~~~~~~~~~-~~~~~gii~~~~~~~~ 194 (395) T protein:vir:10 127 ----DVTVKDYTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIF-------GRMIGAQLK-NYQIRGILKSASSAYD 194 (395) T ss_pred ----EEEEcCceeeeeeccccEEEEccCCCCcccccchHHHHHHHHH-------HHHHHHHHh-cCCCceEEEeCCCCCC Confidence 0111112223468889999888888888899999988776332 222333343 4445555544344467 Q ss_pred HHHHHHHHHHHHHHhcC----CceEEEccCCceEEEecccCCch-----hHHHHHHHHHHHHHHHHhhhhhcccccccch Q lcl|NC_021302. 259 DDRMDELLEIASNYSGG----ESAGLALTAGEEAGILSPNGTPL-----DPRRAIEYHDHQMALVALAHFLNLDGKGGSY 329 (484) Q Consensus 259 ~~~~~~l~~~l~~~~~g----~~a~~vip~~~~ie~~~~~~~~~-----~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~ 329 (484) +++++++.+.++++.++ ..++++++.|++++-++.+.... .|.+..++..++|++++.-..--.. |++ T Consensus 195 ~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~---~~~ 271 (395) T protein:vir:10 195 EKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY---GET 271 (395) T ss_pred HHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc---Ccc Confidence 88888888888776544 22345578899998877553322 4666777888999999765432222 333 Q ss_pred h-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-CCcHHHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 330 A-LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-GSRQDATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 330 A-~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-~~~~~~~ae~~~~L~~~G~~~~~~~~ 407 (484) + ..+........-+.--+..|+..||+.|+.+--... .-+|.++.. ..|.+..+++++++++.|+.. T Consensus 272 sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL~~~~~~~~------~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt----- 340 (395) T protein:vir:10 272 ADLEKNTLVFEKFCLTPLLKKIQNELNAKLITQSMYLK------DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFT----- 340 (395) T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcChhhhcc------cceecchhhhccCHHHHHHHHHHHHhCCCcC----- Confidence 3 334445555667788888999999987765421111 114554433 357788999999999999864 Q ss_pred HHHHHHHhCCCCCCCCc--cccccc--CCCcCCCccccCCCCccccccccccccc Q lcl|NC_021302. 408 EAFLRDAAGLPGPDPDA--DDDEST--ADTGQDEPETDEPALPNTSGTTSTTNAP 458 (484) Q Consensus 408 ~~~i~e~~glp~p~~~e--~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (484) .+++|+.+|+|.-+++. ....+. ............+.....+|.....++. T Consensus 341 ~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 341 RNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred HHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 47899999998655432 211110 0000000001001110111111111111 No 84 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.65 E-value=1.4e-14 Score=96.58 Aligned_cols=384 Identities=11% Similarity=0.075 Sum_probs=212.4 Q ss_pred CC--CCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecC Q lcl|NC_021302. 1 MA--PKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPN 78 (484) Q Consensus 1 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~ 78 (484) +- ++.+.....-.+. ..+..+ ...... + . -+..++ +-+.|.+|+..+...|.+++|.|-.. T Consensus 7 ~~~~~~~~~~~~~~~~~-------~~~~~~--~~~~~~--~-~----~~~~~~-~~~~v~~~i~~ia~~ia~~~~~~~~~ 69 (406) T protein:vir:95 7 WRRTKRKSKIRADTGYV-------GLFMSG--EDVSFL--V-P----GYVRLS-DNPEVRMAVHKIADLISSMTIYLMQN 69 (406) T ss_pred hccccccccccccchhh-------hhhccC--cccCcc--c-c----CHHHHh-hcHHHHHHHHHHHHhhccCceEEEEe Confidence 11 1111111100000 000000 000000 0 0 133454 47899999999999999999998543 Q ss_pred CCCHH-HH-HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHH-HHhh--cceeeeEEEeecCCeeeeeeeeeeCc Q lcl|NC_021302. 79 GARPE-VV-EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALK-SLQF--GHAVFEQTYFYEGGRFWLKRLAPRPQ 153 (484) Q Consensus 79 ~~~~e-~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~-a~~~--G~s~~Eivw~~~~g~~~~~~l~~r~~ 153 (484) .++.. .+ ......| ..+-.....+.++++.++. .+.+ ||+..++++.. ...+..|.+++| T Consensus 70 ~~~~~~~~~~~~~~~l------------~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~---~g~~~~l~~i~~ 134 (406) T protein:vir:95 70 TEDGDIRIRNELSRKI------------DITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTA---DGLIDELVPLTP 134 (406) T ss_pred cCCcceeecchHHHHH------------hhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECC---CCcEEEEEEEcC Confidence 32211 11 1111111 1111223457788888875 4444 67777776643 234788999999 Q ss_pred cceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCc-cCccccchhHHHHHHHHHHHHHHH Q lcl|NC_021302. 154 SSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMD-PGVWTGNSLLRPAYKNWKLKDELI 232 (484) Q Consensus 154 ~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~-~~~p~G~gll~~~~~~~~~K~~~~ 232 (484) .++.. ..+.+|..+. ..+..++++.++++++... .+..+|.|.+..+....-.-.... T Consensus 135 ~~v~~-~~~~~~~~~~--------------------~~~~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~ 193 (406) T protein:vir:95 135 SKVNF-LDTPDGYQVL--------------------YGGQTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQAT 193 (406) T ss_pred ceeEE-EEcCCeEEEE--------------------eccEEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHH Confidence 88753 3333332111 1234578888888886544 344679999999988888888888 Q ss_pred HHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCc-e--EEEccCC-ceEEEe-cccCCchhHHHHHHHH Q lcl|NC_021302. 233 RIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGES-A--GLALTAG-EEAGIL-SPNGTPLDPRRAIEYH 307 (484) Q Consensus 233 ~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~-a--~~vip~~-~~ie~~-~~~~~~~~~~~li~~~ 307 (484) ++...+... .+.|-.+.+.+...++++.+++.+.+.+...|.. + .++++.+ .+++-+ ..+.....|.+..++. T Consensus 194 ~~~~~~~~n--g~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~ 271 (406) T protein:vir:95 194 ATKKSFMSG--KYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELD 271 (406) T ss_pred HHHHHHHhc--cCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHH Confidence 888888875 3678677777777888888888888877655532 2 3456554 444332 2343445677888999 Q ss_pred HHHHHHHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--C-CCc Q lcl|NC_021302. 308 DHQMALVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--I-GSR 384 (484) Q Consensus 308 d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~ 384 (484) .++|++++.-..-..+. ++.. .+........-+.-.++.|++.||+.|+. +.+. .|+|+. . ..| T Consensus 272 ~~~Ia~~fgVp~~~lg~--~~~~-~~~~~~~~~~~l~P~~~~ie~~l~~~l~~--------~~~~--~~~fd~~~l~~~d 338 (406) T protein:vir:95 272 KRTVAGMFGVPAFLLGI--GEFN-RDEYNNFINSTILPIAKGIEQELTRKLLI--------SPDL--YFKFNPRSLYAYD 338 (406) T ss_pred HHHHHHHhCCCHHHcCC--CCch-HHHHHHHHHHHHHHHHHHHHHHHHHhcCC--------CCCc--EEEeechhhhcCC Confidence 99999997655422221 1221 22334455566677777888877775543 2222 455543 2 247 Q ss_pred HHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccccccc Q lcl|NC_021302. 385 QDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKR 463 (484) Q Consensus 385 ~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (484) .++.++.+.+|.+.|+.. .+++|+.+|+|.-++++....+.-. .+..... .. ...++......+.... T Consensus 339 ~~~~~~~~~~l~~~G~~t-----~NE~R~~~gl~p~~~gd~~~~~~n~--~~~~~~~--~~--~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 339 LKELAEVGSNMYVRGIME-----GNEVRDWLGLSPKEGLSELVILENY--IPLDKIG--DQ--SKLKGGDNSGADGQTD 406 (406) T ss_pred HHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCcceeeeccCc--cchhhcc--cc--cccCCCCCCCCCCCCC Confidence 788999999999999864 4789999999865554443322100 0000000 00 0000000000000000 No 85 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.63 E-value=4.7e-14 Score=93.67 Aligned_cols=405 Identities=13% Similarity=0.058 Sum_probs=213.3 Q ss_pred ccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHH Q lcl|NC_021302. 8 PRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEH 87 (484) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~ 87 (484) =...-++.+..-+.|+..-...-.... ...+ ....++ .+-++.+-+..++++--.-.++..|.|.-.+.+++..+. T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~-~~~~--~~~~l~-a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~~~ 76 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSL-SLTD--DLVQLE-ALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQLDL 76 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCc-cccc--cHHHHH-HHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHHHHH Confidence 011122333222222211000000000 1111 122333 334678999999999988889999999765555555555 Q ss_pred HHHHHHhhhccchhhhhHHHhhcCCC-HHHHHHHHHHHHhhcceeeeEEEeecCC--------eeeeeeeeeeCccceee Q lcl|NC_021302. 88 VAACLGLPVEGDESDKPTPRTRGRFS-WDQHLRLALKSLQFGHAVFEQTYFYEGG--------RFWLKRLAPRPQSSIAY 158 (484) Q Consensus 88 ~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~l~a~~~G~s~~Eivw~~~~g--------~~~~~~l~~r~~~~~~~ 158 (484) +.+.+.. .. |..+...+..+.+||=+++=++ .++. .-.++.|...++.++.. T Consensus 77 ~~~~~~~-----------------l~~~~~l~~a~~~~rl~G~a~i~i~--~d~~~~~~pl~~~~~~~~~~v~~~~~v~~ 137 (437) T protein:vir:52 77 FTKFERS-----------------LKLRETLTKALQWSSLYGSVGLLVV--TDSQNTSAPLKPTERLKRLIILPKWKISP 137 (437) T ss_pred HHHHHHh-----------------hcHHHHHHHHHHhcccccceEEEEE--ecCCCcccccccCCceeEEEEechhhccc Confidence 5544422 12 4445555556999997765443 2221 12345566666544421 Q ss_pred eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeec---CccCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHD---MDPGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 159 ~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~---~~~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) ... .+ .......++.+..+..........+.+.+++++... ....+.+|.|++..+|....--......= T Consensus 138 ~~~-~~------~dp~s~~fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~ 210 (437) T protein:vir:52 138 TGT-KD------DDVLSPNFGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNV 210 (437) T ss_pred ccc-cc------ccccccccCcceEEEEecCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHH Confidence 111 00 111233455666666665656677889998888643 24567889999999997776555555555 Q ss_pred HHHHHHhcCCcceEEecC----CCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNE----ADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQM 311 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~----~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~I 311 (484) ...+.++ .++++.-+- -....++.+.+..+.+..++++ ...+++..+.+++.++.+-+ .-..+++..-.+| T Consensus 211 ~~l~~~~--~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~~~~e~~~~~~s--gl~~~l~~~~~~i 285 (437) T protein:vir:52 211 GDLIFES--KIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSA-TNSLLLDAENEYDRKELTFT--GLKDLLTEFRNAV 285 (437) T ss_pred HHHHHHc--CCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCC-CceEEEcCCcceEEEecCcC--CHHHHHHHHHHHH Confidence 5666664 455543321 0111244555666666666554 46778888989998876533 3456666777788 Q ss_pred HHHHhhhh-hcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-Cc----- Q lcl|NC_021302. 312 ALVALAHF-LNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SR----- 384 (484) Q Consensus 312 sk~ilGqt-lt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~----- 384 (484) |.+.--+. ...+...|+.|.|+-....+-+.+++.......-+.+.|++.|+.--|+...+--.|+|...- .+ T Consensus 286 aaa~~iP~t~L~G~s~~Glasge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~~~~~f~pL~~~s~keka 365 (437) T protein:vir:52 286 AGAADMPVTILFGQSVSGLASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPADWWFEFVPLTTVKQEQQI 365 (437) T ss_pred HHHhcCchhhhcCcCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCcCCcCHHHHH Confidence 87744432 112222344577777788889999998876555555668887777777754332366675422 22 Q ss_pred --HHHHHHHHHHHHhcCcccCCcccHHHHHHHh---C----CCCCCCCcccccccCCCcCCCccccCCCCcccccccccc Q lcl|NC_021302. 385 --QDATAAALQMLVNAGLLTPDPRLEAFLRDAA---G----LPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTT 455 (484) Q Consensus 385 --~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~---g----lp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (484) .+..+++++++++.|+..+ +.+++++ | ||.....+..+..........++.. .+.+... T Consensus 366 e~~~~~a~a~~~~~~~g~i~~-----~e~r~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~ 433 (437) T protein:vir:52 366 NMLNTFATAANTLIQNGVLNE-----YQIANELRESGLFANISAEHIEELKNADEFAGNFEEPEKM-------EGAQVQN 433 (437) T ss_pred HHHHHHHHHHHHHHhcCCCCH-----HHHHHHHHhcCCCCCCCccccccccCCCCCCCccCCCCCC-------CCCCCCC Confidence 2456677888999997654 3455544 1 2211100000000000001111111 1111111 Q ss_pred cccc Q lcl|NC_021302. 456 NAPQ 459 (484) Q Consensus 456 ~~~~ 459 (484) .+++ T Consensus 434 ~~~~ 437 (437) T protein:vir:52 434 SEDQ 437 (437) T ss_pred CCCC Confidence 1111 No 86 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.61 E-value=2.3e-14 Score=95.39 Aligned_cols=417 Identities=11% Similarity=0.021 Sum_probs=219.0 Q ss_pred CCCCCCCccc--eee--eec-----ccc---cchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHh Q lcl|NC_021302. 1 MAPKTVAPRT--ERG--YVN-----PLA---GFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPI 68 (484) Q Consensus 1 ~~~~~~~~~~--~~~--~~~-----~~~---~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v 68 (484) .+-...+++. +.. ..+ ..+ +++......+..-... .........+-.+...+-+.|.+|+..+-..| T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~~~g~~v~~~~a~~~~~v~~~i~~Ia~~i 86 (466) T protein:vir:81 8 LSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPST-ELAPDTFVGLATQAYQANGPVFACMLVRQLVF 86 (466) T ss_pred hhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccc-cccCccccccchhhhhccHHHHHHHHHHHHhh Confidence 1111111111 100 000 000 0110011100000000 00000112222222235789999999999999 Q ss_pred hCCCcEEecCCCCH--HHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecC----- Q lcl|NC_021302. 69 RRTDWRIRPNGARP--EVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEG----- 140 (484) Q Consensus 69 ~~~~~~v~p~~~~~--e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~----- 140 (484) .+++|.|....+.. ++.......|. .+-.....+.++++.++ +.+.+|.+.+++++...+ T Consensus 87 a~lp~~~~~~~~~~~~~~~~~~~~~L~------------~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~ 154 (466) T protein:vir:81 87 SSVRFRWQRLRDGKPSDTFGSRDLQIL------------ETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPD 154 (466) T ss_pred ccCceEEEEecCCceeeccccHHHHHh------------hCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccc Confidence 99999986543221 11111111110 01112234667777776 688899999999874321 Q ss_pred CeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecC-ccCccccchhHH Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDM-DPGVWTGNSLLR 219 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~-~~~~p~G~gll~ 219 (484) ..-.+..+.+.+|..+. ...+.++..........+. .........++....|++++.. ..+..+|.|.+. T Consensus 155 ~~g~~~~l~~l~~~~v~-~~~~~~~~~~~~y~~~~~~--------~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~ 225 (466) T protein:vir:81 155 WVDVVVEERMVRGGRGE-LGGGQLGWRKVGYLYTEGG--------RQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLT 225 (466) T ss_pred cCcceeEEEEecCcceE-EEEcCCCceEEEEEEEecC--------cccccceeeeccccEEEEcCCCCcccccccccHHH Confidence 11336788888888774 3445555432211110000 0112244568888888887653 345578999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCC Q lcl|NC_021302. 220 PAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGT 296 (484) Q Consensus 220 ~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~ 296 (484) .+....-.-....++-..+.... +.|-.+.+++...++++++++.+.+.+...|. + ..++++.|++++-++.+.. T Consensus 226 ~~~~~i~~~~a~~~~~~~~f~ng--~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~ 303 (466) T protein:vir:81 226 PILREIRADQAMSKHQAKFFDNG--ATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQ 303 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHhcC--CCcceEEecCCCCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChh Confidence 99988877777888888888853 67867778888888999999999998876552 2 2578999999988887666 Q ss_pred chhHHHHHHHHHHHHHHHHhhhhhcc----cccccchhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc Q lcl|NC_021302. 297 PLDPRRAIEYHDHQMALVALAHFLNL----DGKGGSYALASVQ-ADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDE 371 (484) Q Consensus 297 ~~~~~~li~~~d~~Isk~ilGqtlt~----~~~gGs~A~~evh-~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~ 371 (484) ...|.+..++..++|++++.-...-. +...++++-.+-. ......-+.-.++.|+..||+.|++. .... T Consensus 304 d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~------~~~~ 377 (466) T protein:vir:81 304 EIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDM------GPDV 377 (466) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc------ccCc Confidence 66788999999999999975543221 1112344433333 33446667788889999998866542 1211 Q ss_pred ccceEEecCCC---CcHHHHHH-------HHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccc---cCCCcCCCc Q lcl|NC_021302. 372 PAPLLVFDEIG---SRQDATAA-------ALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDES---TADTGQDEP 438 (484) Q Consensus 372 ~~P~~~~~~~~---~~~~~~ae-------~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~---~~~~~~~~~ 438 (484) . -+|+|+..+ .|.+..++ .++.+++.|+ . .+++|+...- .+...... .+....+.. T Consensus 378 ~-~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~-t-----~nE~r~~~~~----gd~~~~~~~~~~~~~~~~~~ 446 (466) T protein:vir:81 378 R-LWYDADDVPFLREDEKDAADIQKVRAETINTLITAGY-E-----PESVVAAVNS----GDLRLLKHTGLTSVQLLPPG 446 (466) T ss_pred c-eEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCC-C-----hhhccccccC----CccccccCCCcchhhhcccc Confidence 2 256665432 35554443 3677888886 2 2455532211 00000000 000000001 Q ss_pred cccCCCCccccccccccccc Q lcl|NC_021302. 439 ETDEPALPNTSGTTSTTNAP 458 (484) Q Consensus 439 ~~~~~~~~~~~~~~~~~~~~ 458 (484) +......++.+.++...++. T Consensus 447 ~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 447 VSASASSDTPTSGGADDNGN 466 (466) T ss_pred cccccCCCCcccCCCCcCCC Confidence 11111111111111111111 No 87 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=99.61 E-value=4.5e-14 Score=93.77 Aligned_cols=372 Identities=13% Similarity=0.084 Sum_probs=203.3 Q ss_pred chh-hhhhh-cccc----ccccccc--ccchHH-HHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC---CH-HHH-H Q lcl|NC_021302. 21 FGT-FLAQG-LDQF----EQVDELR--WPNSVY-TYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA---RP-EVV-E 86 (484) Q Consensus 21 ~~~-~~~~~-~~~~----~~~~~lr--~~~~~~-~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~---~~-e~~-~ 86 (484) ||. .|... +.+. .+...+. ...... .-+..+ +-+.|.+|+......|.+++|+|..... +. ... . T Consensus 1 mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~-~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~~~~~ 79 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDMEPVSHRTNRKPFTTGQAYS-KIEILNRTANMVIDSAAECSYTVGDKYNIVTYANGVKTK 79 (403) T ss_pred CcchhhhhhccchhhhhhhcccccccccCCcccccHHHHH-HHHHHHHHHHHHHHHHhhCceeEeecccccccccccccc Confidence 331 11110 0000 0000010 000000 112233 4688999999999999999999853221 11 111 1 Q ss_pred HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCC Q lcl|NC_021302. 87 HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDG 165 (484) Q Consensus 87 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg 165 (484) -....|. .+-.......++.+.++ +.+.+|-+.+++. + ..+.++|+..+. ...+.++ T Consensus 80 ~l~~lL~------------~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~----~-----~~l~~l~~~~~~-v~~~~~~ 137 (403) T protein:vir:10 80 TLDTLLN------------VRPNPFMDISTFRRLVVTDLLFEGCAYIYWD----G-----TSLYHVPAALMQ-VEADANK 137 (403) T ss_pred hHHHHHh------------hCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe----C-----ceeEeecCcceE-EEEcCCc Confidence 1111111 01112334667777765 6778998876541 1 234455665543 2333332 Q ss_pred ceeeeecccccccccccceeccCCCCcccccccceEEEeecC----ccCccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 166 GLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDM----DPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRR 241 (484) Q Consensus 166 ~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~----~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er 241 (484) ...... ...+..+++...++++... ..+.++|.+.+..+....-.-....++-..|... T Consensus 138 ~~~~~~-----------------~~~~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~n 200 (403) T protein:vir:10 138 FIKKFI-----------------FNNQINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDN 200 (403) T ss_pred eEEEEE-----------------ecCceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 221111 1122345566666655332 2366889999999888887777777777777763 Q ss_pred hcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCc-e--EEEccCCceEEEecccCC--chhHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 242 HGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGES-A--GLALTAGEEAGILSPNGT--PLDPRRAIEYHDHQMALVAL 316 (484) Q Consensus 242 ~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~-a--~~vip~~~~ie~~~~~~~--~~~~~~li~~~d~~Isk~il 316 (484) .+.|-.+.+++...++++++++.+.+.+..+|.. + .++++.|++++.++.+.+ ...|.+..++..++|++++. T Consensus 201 --g~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fg 278 (403) T protein:vir:10 201 --GTVIGLILETDEILNKKLRERKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFG 278 (403) T ss_pred --cCCcceEEEeCCCCCHHHHHHHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhC Confidence 3678777788888899999999999988765532 2 578999999988774333 34588888999999999866 Q ss_pred hhhhccc-ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC-----CCCcHHHHHH Q lcl|NC_021302. 317 AHFLNLD-GKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE-----IGSRQDATAA 390 (484) Q Consensus 317 Gqtlt~~-~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~-----~~~~~~~~ae 390 (484) -.....+ +.+++. .+........-+.-.++.|++.||+.|. ++|.|+. ...|.+..++ T Consensus 279 VPp~~lg~~~~sn~--e~~~~~f~~~tl~P~~~~ie~~l~~~L~--------------~~~~~d~~~~~~l~~D~~~~~~ 342 (403) T protein:vir:10 279 VPQVLLDGGNNANI--RPNIELFYYMTIIPMLNKLTSSLTFFFG--------------YKITPNTKEVAALTPDKEAEAK 342 (403) T ss_pred CCHHHcCCCCCcCH--HHHHHHHHHHHHHHHHHHHHHHHHHhcC--------------ceeeeccchhhhcccCHHHHHH Confidence 5543332 222222 2223344555666778888888887541 2344442 2346788899 Q ss_pred HHHHHHhcCcccCCcccHHHHHHHhCCCCCCC-CcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 391 ALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDP-DADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 391 ~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~-~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) +++++++.|+.. .+++|+.+|+|.-++ +-+..-.+.+.... ..+......+++ .++..+. T Consensus 343 ~~~~~~~~G~lT-----~NE~R~~~gl~pi~~~~~d~~~~p~n~~~~----~~~~~~~e~~~~-~~~~~g~ 403 (403) T protein:vir:10 343 HLTSLVNNGIIT-----GNEARSELNLEPLDDEQMNKIRIPANVAGS----ATGVSGQEGGRP-KGSTEGD 403 (403) T ss_pred HHHHHHhCCCcC-----HHHHHHHhCCCCCCcccccccccccccccc----cccCCCCcCCCC-CCCcCCC Confidence 999999999864 489999999985432 11111111111100 000111111111 0000000 No 88 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=99.61 E-value=3.4e-14 Score=94.44 Aligned_cols=378 Identities=10% Similarity=0.043 Sum_probs=199.1 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) +.++-....++++......+.. ....+..+-.+...+-+.|.+|+..+...|.+++|+|...+. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g 70 (394) T protein:vir:62 7 FSNYLFKKAEKRGYLDNVLGKS----------------IRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFG 70 (394) T ss_pred hhhhccCCCCchhhhhhhhhcc----------------cccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCC Confidence 1111111111111111100000 000111122222224688999999999999999999975433 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) . ++.+.....| ..+-....++.++++.++ +.+.+|-+.+.+. . +....+..+ .. T Consensus 71 ~-~~~~~~~~~L------------l~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~--~-~~~~~~~~~---------~~ 125 (394) T protein:vir:62 71 N-EIKDDIALQI------------LRNPNNYLTQSEFIKLMTNTYLLEGETFPILN--G-AQIHLASNV---------FT 125 (394) T ss_pred c-ccchhhHHHH------------hccCCCCCCHHHHHHHHHHHHHhcCCeEEEEe--c-ceeeccccc---------eE Confidence 2 2211111111 001112234566666554 6788999988763 1 111111111 11 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAI 239 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 239 (484) ..++++... ....+..+|.+.+++.++.. .+..+|.|.+..+....-.-....++...+. T Consensus 126 ~~~~~~~~~-------------------~~~~~~~~~~~eiih~r~~~-~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 185 (394) T protein:vir:62 126 ELDDNLVEH-------------------FNIGGHEIPPCMIRHVKNIG-ADHLRGKGILDLGRDTLEGVMSAEKTLTDKY 185 (394) T ss_pred EECCceEEE-------------------EeeCCEEechhheEEecCcC-CCCccccChHHHHHHHHHHHHHHHHHHHHHH Confidence 222222211 01124567888888877654 4457899999999877777777777888888 Q ss_pred HHhcCCcceEEecCCC--CCCHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEecccCC--chhHHHHHHHHHHHHH Q lcl|NC_021302. 240 RRHGIGVPYLKGNEAD--SEDDDRMDELLEIASNYSGGE-S--AGLALTAGEEAGILSPNGT--PLDPRRAIEYHDHQMA 312 (484) Q Consensus 240 Er~~~G~P~~~gk~~~--~~~~~~~~~l~~~l~~~~~g~-~--a~~vip~~~~ie~~~~~~~--~~~~~~li~~~d~~Is 312 (484) ... +.|-.+.+.+. ..+++.++++.+.+.+...|. + ..+++|.|.+++++..+.+ ...|.+..++..++|+ T Consensus 186 ~ng--~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia 263 (394) T protein:vir:62 186 KKG--GLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLG 263 (394) T ss_pred Hcc--CCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHH Confidence 753 67755555543 334556677777777765542 2 2367888888877654433 3457788888999999 Q ss_pred HHHhhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcHHHHHHH Q lcl|NC_021302. 313 LVALAHFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQDATAAA 391 (484) Q Consensus 313 k~ilGqtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~~~~ae~ 391 (484) +++.-.....+...+|. ..+........-+.-.++.|+..||+.|+.+- ....-+|+|+... .+.+..+++ T Consensus 264 ~~fgVPp~~lg~~~~sn-~e~~~~~~~~~~l~P~~~~ie~~l~~kll~~~-------~~~~~~~~fd~~~~~~~~~~~~~ 335 (394) T protein:vir:62 264 KFLGINVDTYTELIKED-IEKAMMYIHNKAVRPIMKNFEDHLSLLFYAQN-------SGKRIKFKINILDFVTYSNKTNI 335 (394) T ss_pred HHhCCCHHHcCCCCCcC-HHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcc-------ccCceEEEechhhhcCHHHHHHH Confidence 99766543332111222 22233344456677788888888887665431 1112367776433 345677889 Q ss_pred HHHHHhcCcccCCcccHHHHHHHhCCCCCCCCc-ccccccCCCcCCCccccCCCCccccccccccc Q lcl|NC_021302. 392 LQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDA-DDDESTADTGQDEPETDEPALPNTSGTTSTTN 456 (484) Q Consensus 392 ~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (484) +.++++.|+.. .+++|+.+|+|.-.++. +..-...+- .+......... +.+|.....+ T Consensus 336 ~~~~~~~g~~T-----~NE~R~~~gl~p~~~~~gd~~~~~~n~-~~~~~~~~~~~-~~kgge~~en 394 (394) T protein:vir:62 336 GYNLVRTAITS-----PDNVADMLGFPKQNTKESQAIYISNDV-TEIGKKEATDG-SLGGGEENEN 394 (394) T ss_pred HHHHHhCCCcC-----HHHHHHHhCCCCCCCCCCCeeeccccc-ccccccccccc-cCCCCCCCCC Confidence 99999999754 47899999998653221 111111000 11000000000 0011111111 No 89 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=99.61 E-value=1e-13 Score=91.80 Aligned_cols=430 Identities=13% Similarity=0.076 Sum_probs=213.6 Q ss_pred CCCCC-----CCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEE Q lcl|NC_021302. 1 MAPKT-----VAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRI 75 (484) Q Consensus 1 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v 75 (484) +|=++ +..+.+.+... .+......+ -++ |-...-+.++.+..+.|.+|+..+...|.+++|.+ T Consensus 6 ~~i~s~~~~~~i~~~~~~s~~----~~~~~~~~~--~~p------p~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~ 73 (542) T protein:vir:41 6 LSIRSLEKYKAIKREEVESQA----LGETRFEEY--VEP------KVNPLVLLSLLQVNPYHASACSIKANDIIRTGYIL 73 (542) T ss_pred ccccccccchhhhhccccccc----cccccCCcc--ccC------CCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCceee Confidence 11111 11111111100 000000000 011 12233455677788999999999999999999999 Q ss_pred ecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcc Q lcl|NC_021302. 76 RPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQS 154 (484) Q Consensus 76 ~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 154 (484) .+.... . +...+ .....++.++++.++ +.+.+|.+.+++++...+ .+..|.+.++. T Consensus 74 ~~~~~~--~---l~~~l---------------pN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G---~~~~L~~l~~~ 130 (542) T protein:vir:41 74 EGDDEG--V---VDEFI---------------RACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRG---DPIRFEYIPSH 130 (542) T ss_pred ecccch--h---hhhhc---------------CCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCC---cEEEEEEEcCc Confidence 643221 1 11111 112346778888877 678899999999865433 46788888888 Q ss_pred ceeeeeecCCCceeeeeccccc----ccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHH Q lcl|NC_021302. 155 SIAYWNVDRDGGLISIQQWPAG----TFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDE 230 (484) Q Consensus 155 ~~~~~~~~~dg~l~~~~q~~~~----~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~ 230 (484) ++.. ..+.++ .+........ .+.....+....+.....+|....|++++....+.+||.|.+..+......-.. T Consensus 131 ~v~v-~~d~~~-~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~ 208 (542) T protein:vir:41 131 TIRV-HKDGSR-YRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQK 208 (542) T ss_pred ceEE-EEcCCe-eEeeecCCcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHH Confidence 7743 222211 1111111100 001111111222333445777788887777667789999999999988888888 Q ss_pred HHHHHHHHHHHhcCCcceEEecCC----------CCCCHHHHHHHHHHHHHHhcC----CceEEEcc------CCceEEE Q lcl|NC_021302. 231 LIRIEAAAIRRHGIGVPYLKGNEA----------DSEDDDRMDELLEIASNYSGG----ESAGLALT------AGEEAGI 290 (484) Q Consensus 231 ~~~~w~~f~Er~~~G~P~~~gk~~----------~~~~~~~~~~l~~~l~~~~~g----~~a~~vip------~~~~ie~ 290 (484) ..++-..|... .++|-.+.+.+ ...++++++.+.+.+.+...| ....++++ .|++++- T Consensus 209 ~~~~~~~~f~N--g~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~p 286 (542) T protein:vir:41 209 IDEYNYAFFDN--YTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTP 286 (542) T ss_pred HHHHHHHHHhc--cCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEE Confidence 88888888774 35675444332 234566667777666554322 11234443 4555555 Q ss_pred ecccCCchhHHHHHHHHHHHHHHHHhhhh-hcccccccch--hhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021302. 291 LSPNGTPLDPRRAIEYHDHQMALVALAHF-LNLDGKGGSY--ALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVN 366 (484) Q Consensus 291 ~~~~~~~~~~~~li~~~d~~Isk~ilGqt-lt~~~~gGs~--A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~N 366 (484) ++.+.....|.+..++..++|++++.-.. +.....++++ +-.+... ......+.-.++.|+..||+.|++. T Consensus 287 l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~----- 361 (542) T protein:vir:41 287 LNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVK----- 361 (542) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----- Confidence 55554455688888999999999975543 2222222322 3333333 3456667888999999999866553 Q ss_pred CCCccccceEEecCCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHh-CCCCCCCCcccccccCCCcCC------Ccc Q lcl|NC_021302. 367 WGEDEPAPLLVFDEIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAA-GLPGPDPDADDDESTADTGQD------EPE 439 (484) Q Consensus 367 f~~~~~~P~~~~~~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~-glp~p~~~e~~~~~~~~~~~~------~~~ 439 (484) ++. . -+|+|+..........+.++.+++.|+..+ +++|+.+ |++. .++ ....+.-..... ..+ T Consensus 362 ~~~-~--~~~~f~~~~ll~~d~~~~~~~~v~~GilT~-----NE~Re~L~g~~p-gdd-~~l~p~~~~~~~~~~~~~n~~ 431 (542) T protein:vir:41 362 FNP-K--TRFKFNDETLLESDSVRNCALLVQSGVLTP-----AEARERLFGLDG-GPD-IFMVPSKGAAKSVKRQERNYE 431 (542) T ss_pred cCC-c--eEEEecchhhcchHHHHHHHHHHhCCCCCH-----HHHHHhhCCCCC-CCc-cccccccccccccccCCcCCC Confidence 221 1 256775433222234456778899998653 6788864 6642 221 111110000000 000 Q ss_pred ccCC---CCcccccccc----ccccccccccccccchH-HHhcC---cccCcccC-----------C Q lcl|NC_021302. 440 TDEP---ALPNTSGTTS----TTNAPQARKRPRGRSPR-DRRKT---PDGAMPLW-----------D 484 (484) Q Consensus 440 ~~~~---~~~~~~~~~~----~~~~~~~~~~~~~~~~~-~~~~~---~~~~~~~~-----------~ 484 (484) .... ..-..++.+. .+....+.+.....+.+ ..+++ +-|+.-|. + T Consensus 432 ~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (542) T protein:vir:41 432 KNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDESLAEFRAEAYEAGKKMLIIGGDMGSMSALN 498 (542) T ss_pred CCchhhhhhcccccCccccccccccccchhhcccccchhhhhHHhHHhcCceEEEeecCchhhhhhh Confidence 0000 0000000000 00000000000001100 11111 11111111 1 No 90 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=99.59 E-value=4.9e-14 Score=93.58 Aligned_cols=375 Identities=10% Similarity=0.015 Sum_probs=188.4 Q ss_pred chh-hhhhhccccc-c----cccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHh Q lcl|NC_021302. 21 FGT-FLAQGLDQFE-Q----VDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGL 94 (484) Q Consensus 21 ~~~-~~~~~~~~~~-~----~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~ 94 (484) ||. .++.++.... . .....+...-.++.+...+-+.|.+|+..+...|.+++|.+...++. ...-+...|.. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~--~~~~~~~lL~~ 78 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTDTVWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGEE--VRKKNWYMFNV 78 (395) T ss_pred CchHHHHHhhhcccccccccccchhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCcc--ccchHHHHHHh Confidence 332 1111111100 0 00001111112233333346899999999999999999999654322 11112222211 Q ss_pred hhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeecc Q lcl|NC_021302. 95 PVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQW 173 (484) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~ 173 (484) +-.......++++.++ +.+.+|.+.+.+. .++.+.+..+..... .. ... ...+ T Consensus 79 ------------~PN~~~t~~~f~~~~~~~lll~Gnay~~~~---~~~~~~~~~~~~~~~-~~-------~~~--~~~~- 132 (395) T protein:vir:40 79 ------------EANQNQNATEFWKKAIYKLVYDNEALIFMQ---DEYIYVADSFTKNDK-SL-------YEN--TYTE- 132 (395) T ss_pred ------------cCCCCCCHHHHHHHHHHHHhhcCceEEEEe---cCceeecCCcccccc-cc-------ccc--eeee- Confidence 1112334566766654 6777999886543 233222222111100 00 000 0000 Q ss_pred cccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecC Q lcl|NC_021302. 174 PAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNE 253 (484) Q Consensus 174 ~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~ 253 (484) ...........+|+...+++++....+.+++.++...+.-. +... ...+. +.+..-+.++.++ T Consensus 133 ----------v~~~~~~~~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~--~~~~----~~~~~-~~~~~~~~l~~~~ 195 (395) T protein:vir:40 133 ----------VTLKDLTLKKEFKESEVLHLTLNNESIKSIIDGFYLLYGDL--LTAA----VNKYK-KLNSRKIIVKLKA 195 (395) T ss_pred ----------eeecCceeeeeeccccEEEeecCCCCccccchhHHHHHHHH--HHHH----HHHHH-hcCCCCceEEEec Confidence 00011112235788888998888778888887776544321 1111 11222 2234456777777 Q ss_pred CCCCCHHHHHHHHHHHHHHh----cCCceEEEccCCceEEEecccCCchhHHHHH---HHHHHHHHHHHhhhhhcccccc Q lcl|NC_021302. 254 ADSEDDDRMDELLEIASNYS----GGESAGLALTAGEEAGILSPNGTPLDPRRAI---EYHDHQMALVALAHFLNLDGKG 326 (484) Q Consensus 254 ~~~~~~~~~~~l~~~l~~~~----~g~~a~~vip~~~~ie~~~~~~~~~~~~~li---~~~d~~Isk~ilGqtlt~~~~g 326 (484) +...+++..+++.+.+.+.. ++...+++++.|++++-++.+.....|.++- +.+-++|++++.-..--. | T Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l---~ 272 (395) T protein:vir:40 196 MFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLA---K 272 (395) T ss_pred ccCCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHh---c Confidence 77777777777777666543 2333467899999998887765555565544 444579999966543222 2 Q ss_pred cchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEec--CC-CCcHHHHHHHHHHHHhcCccc Q lcl|NC_021302. 327 GSYAL-ASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFD--EI-GSRQDATAAALQMLVNAGLLT 402 (484) Q Consensus 327 Gs~A~-~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~--~~-~~~~~~~ae~~~~L~~~G~~~ 402 (484) |+++- .+........-+.-.++.|++.||+.|++.--.. .+. .|+|+ .. ..|.+++++++.++++.|+.. T Consensus 273 ~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~kLl~~~~~~----~g~--~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t 346 (395) T protein:vir:40 273 GDTVGLSEQVNSFLMFSINPIAEMFTDEGNRKFYGRDSVL----ERT--YMKLDTTRIKVQDIQEIASSMDVLFHIGVNT 346 (395) T ss_pred CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhc----CCc--eEEEechhhhccCHHHHHHHHHHHHhCCCCC Confidence 33332 2223344445667778888888888776632211 111 44554 32 347789999999999999754 Q ss_pred CCcccHHHHHHHhCCCCCCC--CcccccccCCCcCCCccccCCCCccccccccccc Q lcl|NC_021302. 403 PDPRLEAFLRDAAGLPGPDP--DADDDESTADTGQDEPETDEPALPNTSGTTSTTN 456 (484) Q Consensus 403 ~~~~~~~~i~e~~glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (484) .+++|+.+|+|.-++ ++....+ .+- .+..............+....+ T Consensus 347 -----~NE~R~~~g~~pi~~~~gD~~~~~-~n~-~~~~~~~~~~kgge~~~~~~~~ 395 (395) T protein:vir:40 347 -----IDDNLRMIGREPVMSPETQERFVT-KNY-APLGENEEDLKGGDINENKGDS 395 (395) T ss_pred -----HHHHHHHhCCCCCCCCCCceeeec-ccc-ccccccccccCCCCCCCCcCCC Confidence 478999999986533 2221111 010 0000000000000000000000 No 91 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=99.58 E-value=7.2e-14 Score=92.64 Aligned_cols=366 Identities=11% Similarity=0.049 Sum_probs=194.7 Q ss_pred chhhhhhhccccccccc-ccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccc Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDE-LRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGD 99 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~ 99 (484) ||..... +.....+.. +...-.-.++.....+.+.|.+|+..+...|.+++|++...+...+ .-+...|.. T Consensus 1 Mg~f~~~-f~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~~~--~~l~~lL~~----- 72 (385) T protein:vir:95 1 MGLFDSV-FKRHSELSWMYDLEFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTKEK--GTLYYLLNV----- 72 (385) T ss_pred Cchhhhh-hccCcccccccchhhhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCcccc--chHHHHHhc----- Confidence 3321100 000000000 0000001122232235789999999999999999999975443211 112222210 Q ss_pred hhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeeccccccc Q lcl|NC_021302. 100 ESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTF 178 (484) Q Consensus 100 ~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~ 178 (484) +.....++.++++.++ +.+.+|.+.+.++ ++ +.+.+.....++.. ..... ..+ T Consensus 73 -------~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~--~~-~~~~~~~~~~~~~~-~~~~~---------------~~~ 126 (385) T protein:vir:95 73 -------RPNRNQNAVDFWQKFIFKLIMDNEVLVVKN--DE-GHFFVADDFEKEDE-LGLYS---------------HRF 126 (385) T ss_pred -------ccCcCCCHHHHHHHHHHHHhhcCceEEEEe--cC-CCeeeccccccccc-ccccc---------------ccc Confidence 1112335667777765 5778999986542 23 33333222222111 11000 000 Q ss_pred ccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-eEEecCCCCC Q lcl|NC_021302. 179 GGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVP-YLKGNEADSE 257 (484) Q Consensus 179 ~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P-~~~gk~~~~~ 257 (484) . ............+|....+++++....+..+|.|++..+....- ..+.. .+++.+++ +++.+..... T Consensus 127 ~---~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~------~~~~~--~~~~~~~~g~l~~~~~~~~ 195 (385) T protein:vir:95 127 T---NVLVNDFEFKRVFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFG------RMIDL--QMLNNQIRGILKVDATKFY 195 (385) T ss_pred e---eeeecccceeeeeccccEEEecCCCCCcccccchHHHHHHHHHH------HHHHH--HHhcCCCceEEEeCCccCC Confidence 0 00111122234578888888888877778899999988764321 11221 22344444 3434444456 Q ss_pred CHHHHHHHHHHHHHHhcC----CceEEEccCCceEEEecccCC------chhHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_021302. 258 DDDRMDELLEIASNYSGG----ESAGLALTAGEEAGILSPNGT------PLDPRRAIEYHDHQMALVALAHFLNLDGKGG 327 (484) Q Consensus 258 ~~~~~~~l~~~l~~~~~g----~~a~~vip~~~~ie~~~~~~~------~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG 327 (484) +++..+++.+.+.++.+| ....++++.|++++-++.... ...|.+..++...+|+++..-..-.. +| T Consensus 196 ~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l---~~ 272 (385) T protein:vir:95 196 NKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLV---LG 272 (385) T ss_pred CHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh---cC Confidence 777777888887776443 233577999999987764322 23588889999999999976643222 23 Q ss_pred chhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEec--C-CCCcHHHHHHHHHHHHhcCcccC Q lcl|NC_021302. 328 SYALAS-VQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFD--E-IGSRQDATAAALQMLVNAGLLTP 403 (484) Q Consensus 328 s~A~~e-vh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~--~-~~~~~~~~ae~~~~L~~~G~~~~ 403 (484) +++-.+ ........-+.-.++.|+..||+.|+++--..+ .+|+|+ . ...|.++++++++++++.|+.. T Consensus 273 ~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~~~-------~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt- 344 (385) T protein:vir:95 273 EMADLEKTIESYLQFCINPLLRKIEAELNSKFFYQDEYLN-------DDMHIKVVGIDKRDPLKLSEAIDKLVASGTFT- 344 (385) T ss_pred CCcCHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhccc-------ceEEEechhhhccCHHHHHHHHHHHHhCCCcC- Confidence 444323 334555667778889999999988877532221 145554 3 2357788999999999999764 Q ss_pred CcccHHHHHHHhCCCCCC--CCcccccccCCCcCCCccccCCCCcccccc Q lcl|NC_021302. 404 DPRLEAFLRDAAGLPGPD--PDADDDESTADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 404 ~~~~~~~i~e~~glp~p~--~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (484) .+++|+.+|+|+-. .++....+ .+-.. .. ........++ T Consensus 345 ----~NE~R~~~g~~p~~~~~gd~~~~~-~n~~~---~~-~~kgge~~~e 385 (385) T protein:vir:95 345 ----RNQVRIMTGEEPADDPELDKFIIT-KNLQS---AD-AFKGGESNEE 385 (385) T ss_pred ----HHHHHHHhCCCCCCCCCCceeeec-cccee---cc-cccCCCCCCC Confidence 47899999997532 22222111 00000 00 0000000111 No 92 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=99.58 E-value=1.3e-13 Score=91.18 Aligned_cols=440 Identities=8% Similarity=-0.018 Sum_probs=200.9 Q ss_pred CCCCCCCcccee-eeecccccchhhhhh--hcccccccccccc--cch--H-HHHHHHHhcchHHHHHHHHHHHHhhCCC Q lcl|NC_021302. 1 MAPKTVAPRTER-GYVNPLAGFGTFLAQ--GLDQFEQVDELRW--PNS--V-YTYTRMCREEARIASVLRAIGLPIRRTD 72 (484) Q Consensus 1 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~~~~~lr~--~~~--~-~~y~~m~~~D~~v~s~l~~r~~~v~~~~ 72 (484) |-|.-.+...+. |+-...+... ++.. .+........++. ... + ..-+-+ .++..+.+|+......+.+++ T Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~t~-~~~va~~~~i~~~s~~~~~~~ 113 (535) T protein:vir:10 36 IRPGRASARDTVDGIDIADGNVA-GQYSVASISDVLSTKKLLKAYADNDIVQAIIRTR-TNQVLTYSNPSRYNRNGVGFK 113 (535) T ss_pred hhhhhhhhhccccccccccCCcc-cccccCccccccCHHHHHHHhccChhHHHHHHHH-HHHHHHHHHHHHHhcccCcce Confidence 333222211111 1000000000 0000 0000000000100 000 0 001112 123444455554444455556 Q ss_pred cEEecCCCC---HHHHHH--HHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhc-ceeeeEEEeecCCeeee Q lcl|NC_021302. 73 WRIRPNGAR---PEVVEH--VAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFG-HAVFEQTYFYEGGRFWL 145 (484) Q Consensus 73 ~~v~p~~~~---~e~~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G-~s~~Eivw~~~~g~~~~ 145 (484) +.+.-...+ .++++. +...|... -+.. ......|..+++.++ +.+.+| .+..++++. ..| .+ T Consensus 114 i~l~~~~~~~~~~~~~~~~~l~~lL~~~-PN~~-------~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~-~~G--~~ 182 (535) T protein:vir:10 114 VELKDATKVMSKAQIKRAHEIEDFIYNT-GSEY-------YEWRDTFPRLLTKIINDMYVQDQINIERIFKN-DSN--EL 182 (535) T ss_pred eEEEeccCCCcchhhhhhhHHHHHHHhC-CCCC-------CChhHHHHHHHHHHHHHHHhhCCceEEEEEEC-CCC--cE Confidence 555422211 111111 11112100 0000 001113556777766 455565 456655433 334 36 Q ss_pred eeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCc---cCccccchhHHHHH Q lcl|NC_021302. 146 KRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMD---PGVWTGNSLLRPAY 222 (484) Q Consensus 146 ~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~---~~~p~G~gll~~~~ 222 (484) ..|.+++|.++. +..+.++.- ..........+.....++...+|++++... .+.+||.|.+..+. T Consensus 183 ~~L~~l~p~~V~-v~~d~~~~~-----------~~~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~ 250 (535) T protein:vir:10 183 DHFNAVDASKVV-ISYSPRSKD-----------QPRKFEQFVSETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASI 250 (535) T ss_pred EEEEEeCCceeE-EEEcCcccc-----------CceEEEEEecCceeEEECcccEEEEeccCCCCcccccccccHHHHHH Confidence 789999998875 334433321 011122223344556788889888887553 33578999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCcceEEecCC----CCCCHHHHHHHHHHHHHHhcCCc-eE-EEcc--CCceEEEeccc Q lcl|NC_021302. 223 KNWKLKDELIRIEAAAIRRHGIGVPYLKGNEA----DSEDDDRMDELLEIASNYSGGES-AG-LALT--AGEEAGILSPN 294 (484) Q Consensus 223 ~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~----~~~~~~~~~~l~~~l~~~~~g~~-a~-~vip--~~~~ie~~~~~ 294 (484) ...-.-....++-..|...- +.|-.+.+++ ...++++++++.+.+.+..+|.. ++ +.+. .|++++-++.+ T Consensus 251 ~~i~~~~aa~~~~~~~f~ng--~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~ 328 (535) T protein:vir:10 251 PLIRAIYDTEQFNARFFSQG--GTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQN 328 (535) T ss_pred HHHHHHHHHHHHHHHHHhcc--CCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCC Confidence 88888888888888888853 5664444442 23567888999999888766632 22 1232 56666555555 Q ss_pred CCchhHHHHHHHHHHHHHHHHhhhh-hcccccccchh------------hHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 295 GTPLDPRRAIEYHDHQMALVALAHF-LNLDGKGGSYA------------LASVQADT-FVQSVQTVADEIRDVAQAHVVE 360 (484) Q Consensus 295 ~~~~~~~~li~~~d~~Isk~ilGqt-lt~~~~gGs~A------------~~evh~~v-~~~~~~aD~~~i~~~ln~qli~ 360 (484) .....|.+..++..++|++++.-.. +....+.++|+ ..+..... ...-+.-.++.|+..||+.|++ T Consensus 329 ~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~ 408 (535) T protein:vir:10 329 SRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMR 408 (535) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 5555688888999999999976543 22222223322 11222222 2345677888899999988775 Q ss_pred HHHHhCCCCccccceEEecCCC-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccc-cCC---CcC Q lcl|NC_021302. 361 DIVDVNWGEDEPAPLLVFDEIG-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDES-TAD---TGQ 435 (484) Q Consensus 361 ~l~~~Nf~~~~~~P~~~~~~~~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~-~~~---~~~ 435 (484) . ++. . -+|.|+... .+.+..+++.+... .|.. ..+++|+.+|+|.-+.++..... +.. ..+ T Consensus 409 ~-----~~~--~-~~f~f~~l~~~d~~~r~~~~~~~~-~g~l-----T~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~ 474 (535) T protein:vir:10 409 Y-----VDT--D-YRFSFTLGDAQDKLQEEQVWKLKL-ANGY-----FINEYRKDHGLKTVDGLDVPGFIGSAENFINAT 474 (535) T ss_pred c-----cCC--e-EEEEeccccccCHHHHHHHHHHHH-cCCC-----CHHHHHHHhCCCCCCCccccccccchhhccccc Confidence 3 221 1 267776533 45566666665444 4543 35899999999876655431110 000 000 Q ss_pred CCccccCCCCccccccccc----ccccc-ccccccccchH-------HHhcC---cccCcc Q lcl|NC_021302. 436 DEPETDEPALPNTSGTTST----TNAPQ-ARKRPRGRSPR-------DRRKT---PDGAMP 481 (484) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~-------~~~~~---~~~~~~ 481 (484) ...++..+.....++.+.. ....+ ......+.+.. ..+.+ -+.+.| T Consensus 475 ~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 475 GFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDDVSNNEDADT 535 (535) T ss_pred ccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCCCccccccccCC Confidence 0000000000000000000 00000 00000000000 00001 111112 No 93 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=99.53 E-value=5.4e-13 Score=87.85 Aligned_cols=360 Identities=13% Similarity=0.063 Sum_probs=190.4 Q ss_pred chhhhhhhccccc--ccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhcc Q lcl|NC_021302. 21 FGTFLAQGLDQFE--QVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEG 98 (484) Q Consensus 21 ~~~~~~~~~~~~~--~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~ 98 (484) ||......-+... ....... .-.++.+...+-+.|.+|+..+...+.+++|.+...+.. .-..+...|.. T Consensus 1 Mg~f~~l~~~~~~~~~~~~~~~--~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~--~~~~l~~ll~~---- 72 (376) T protein:vir:78 1 MGFFSELFKRNKEIEWMWDLDF--LEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGETS--VRDKLYYKLNI---- 72 (376) T ss_pred CchhhhhhccCCccccccchhh--ccccchhhhhhhHHHHHHHHHHHHhhcccceeecccccc--ccchHHHHHhh---- Confidence 3321100000000 0000000 001112111235789999999999999999998643211 11111111110 Q ss_pred chhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccc Q lcl|NC_021302. 99 DESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGT 177 (484) Q Consensus 99 ~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~ 177 (484) +......+.++++.++ +.+.+|.+...+++ ++...+..+.++.+..+....+. . T Consensus 73 --------~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r---~~~~~~~~~~~~~~~~~~~~~~~---~----------- 127 (376) T protein:vir:78 73 --------RPNTDMSSSSFWEKVIYKLIYDNECLIVLSD---TDDFLIADSYVRKEFAFFPDVFE---G----------- 127 (376) T ss_pred --------ccccCCCHHHHHHHHHHHHhHcCcEEEEEEe---CCCeeeccceeecccceeeeeee---e----------- Confidence 1112234566666665 56778998876643 33344555555555443211110 0 Q ss_pred cccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-eEEecCCCC Q lcl|NC_021302. 178 FGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVP-YLKGNEADS 256 (484) Q Consensus 178 ~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P-~~~gk~~~~ 256 (484) ...........+|....+++++....+.+++.++...+.- ..-..+..+ +++.|.+ .++.+++.. T Consensus 128 ------~~~~~~~~~~~~~~~evih~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~--~~~~~~~~~~~~~~~~~ 193 (376) T protein:vir:78 128 ------VTVKDYRYNRNFSMDDVIFLEYGNERLSAFTDGMFEDYGE------LFGKMIRAQ--MRNFQIRGAVNFKMAGV 193 (376) T ss_pred ------eeeecceeeeeeccccEEEeccCCCCchhhhhHHHHHHHH------HHHHHHHHH--HhcCCCceeEEEccCCC Confidence 0001111123477788888888777777776666544321 111111222 2234544 555666667 Q ss_pred CCHHHHHHHHHHHHHHhcC----CceEEEccCCceEEEecccCCc-----hhHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_021302. 257 EDDDRMDELLEIASNYSGG----ESAGLALTAGEEAGILSPNGTP-----LDPRRAIEYHDHQMALVALAHFLNLDGKGG 327 (484) Q Consensus 257 ~~~~~~~~l~~~l~~~~~g----~~a~~vip~~~~ie~~~~~~~~-----~~~~~li~~~d~~Isk~ilGqtlt~~~~gG 327 (484) .++++.+++.+.+.+..++ ...+++++.|++++-++.+... ..|.+..++...+|++++.-..--.. | T Consensus 194 ~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~---~ 270 (376) T protein:vir:78 194 ADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLH---G 270 (376) T ss_pred CCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhC---C Confidence 7888888888888877554 2234568999999888765322 25788888999999999765442222 2 Q ss_pred chhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCc Q lcl|NC_021302. 328 SYALA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDP 405 (484) Q Consensus 328 s~A~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~ 405 (484) +++-. +........-+.-.++.|++.||+.|+++ ......|.++. ...|.++++++++++++.|+.. T Consensus 271 ~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kll~~--------~~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t--- 339 (376) T protein:vir:78 271 DMADLSNNMKAYMEYCIDPLTKKLEDELNAKLFTF--------SEFLAGEHIKIIHKKDIIENAEAVDKLVASGSFN--- 339 (376) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhCCc--------ccceecccchhhcccCHHHHHHHHHHHHhCCCcC--- Confidence 33322 22234455567777888888888877543 12111222222 2357788999999999999754 Q ss_pred ccHHHHHHHhCCCCCCCCcc-cccccCCCcCCCccccCCCCccccc Q lcl|NC_021302. 406 RLEAFLRDAAGLPGPDPDAD-DDESTADTGQDEPETDEPALPNTSG 450 (484) Q Consensus 406 ~~~~~i~e~~glp~p~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (484) .+++|+.+|+|.-+++.- ..-.+.+- ++ . ...+..| T Consensus 340 --~NE~R~~lg~~p~~~g~~d~~~~~~n~-~~-~-----~~~~e~g 376 (376) T protein:vir:78 340 --RNEVRELLGAERVDNPELDKYLITKNY-QS-A-----DEGGEDG 376 (376) T ss_pred --HHHHHHHhCCCCCCCCCCceeeeccCc-ee-h-----hccccCC Confidence 478999999986554431 11111110 00 0 0011111 No 94 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=99.53 E-value=8.8e-13 Score=86.67 Aligned_cols=375 Identities=13% Similarity=0.071 Sum_probs=200.0 Q ss_pred CCCCC-CCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKT-VAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) ...++ ..+.++.. .+........ -....|..+. +.+.|.+|+..+...|.++++++.-.. T Consensus 7 f~~k~~~~~~~~~~--------------~~~~~~~~~~----~~~~~~~~~~-~~~~V~~~I~~ia~~iA~~p~~~~~~~ 67 (403) T protein:vir:80 7 FRRKTRSEPTNAIS--------------WFLTQEAYDT----LAIPGYTRLS-DNPEVRMAVHKIAELISSMTIHLMQNT 67 (403) T ss_pred ccccccccccchhh--------------hhcccccccc----cccchhhhhh-hhHHHHHHHHHHHHhhhhCceEEEEec Confidence 11111 11111100 0000000000 0111233454 368899999999999999999984332 Q ss_pred CCH--HHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHH-HHh--hcceeeeEEEeecCCeeeeeeeeeeCcc Q lcl|NC_021302. 80 ARP--EVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALK-SLQ--FGHAVFEQTYFYEGGRFWLKRLAPRPQS 154 (484) Q Consensus 80 ~~~--e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~-a~~--~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 154 (484) ++. ++...+...|.. +-....+..++++.++. .+. +|++.++++|.. .-.+..|.+.+|. T Consensus 68 ~~g~~~~~~~~~~lL~~------------~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~---~g~~~~L~~l~p~ 132 (403) T protein:vir:80 68 DNGDIRIKNELSRKIDI------------NPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTT---SGLIDELIPLAPS 132 (403) T ss_pred CCceeecCChHHHHHhc------------cCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcC---CCcEEEEEEEcCC Confidence 221 111111122210 11122245667777653 444 688999887643 2346788899998 Q ss_pred ceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCc-cccchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 155 SIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGV-WTGNSLLRPAYKNWKLKDELIR 233 (484) Q Consensus 155 ~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~-p~G~gll~~~~~~~~~K~~~~~ 233 (484) ++.. ..+.+|..+.. .+..++.+..++++....+.+ .+|.|.+..+....-.-....+ T Consensus 133 ~v~~-~~~~~g~~~~y--------------------~~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~ 191 (403) T protein:vir:80 133 KVSF-VDTDTGYQIWY--------------------QGKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATT 191 (403) T ss_pred eeEE-EEcCCceEEEE--------------------eecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHH Confidence 8753 34444432211 123466777887775544444 4599998888777766667777 Q ss_pred HHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcC-Cc--eEEEccCCc-eEE-EecccCCchhHHHHHHHHH Q lcl|NC_021302. 234 IEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGG-ES--AGLALTAGE-EAG-ILSPNGTPLDPRRAIEYHD 308 (484) Q Consensus 234 ~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g-~~--a~~vip~~~-~ie-~~~~~~~~~~~~~li~~~d 308 (484) +...|... .++|-.+.+.+...+++..+++.+.+.+...+ .+ ..+++|.+. +.+ +...+.....+.+..++.. T Consensus 192 ~~~~~~~n--g~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~ 269 (403) T protein:vir:80 192 TKKSFMSG--KYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDK 269 (403) T ss_pred HHHHHHhc--cCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhH Confidence 77788764 36786666666666666556666554433222 12 234566553 332 2223333345778888899 Q ss_pred HHHHHHHhhhh-hcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC---CCc Q lcl|NC_021302. 309 HQMALVALAHF-LNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI---GSR 384 (484) Q Consensus 309 ~~Isk~ilGqt-lt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~---~~~ 384 (484) .+|++++.-.. +...+++. .+ ........-+.-.++.|+..||+.|+. +.+ + .|+|+.. ..| T Consensus 270 ~~Ia~~fgVPp~~lg~~~~~-~~---~~~~f~~~~l~P~~~~ie~~l~~kll~--------~~~-~-~~~f~~~~ll~~d 335 (403) T protein:vir:80 270 RTVAGIFGVPAFLLGVGKYD-KD---EYNNFINSTILPIAKGIEQELTRKLLI--------SPD-L-YFKFNPRSLYAYD 335 (403) T ss_pred HHHHHHhCCCHHHcCCCCcc-HH---HHHHHHHHHHHHHHHHHHHHHHHhccC--------CCC-c-EEEeechhhhccC Confidence 99999876554 22112222 11 223344556677778888888775543 122 2 5667532 347 Q ss_pred HHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCC---ccccCC---CCcccccccc Q lcl|NC_021302. 385 QDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDE---PETDEP---ALPNTSGTTS 453 (484) Q Consensus 385 ~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~---~~~~~~---~~~~~~~~~~ 453 (484) .+++++++.++++.|+.. .+++|+.+|+|.-+.+++..... + -.+. .+.... ...++.++.. T Consensus 336 ~~~~~~~~~~~~~~Gi~t-----~NE~R~~~gl~p~~ggd~~~~~~-n-~~pl~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 336 LKELAEVGSNMYVRGLME-----GNEVRDWLGLSPKEGLSELVILE-N-YIPLDKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred HHHHHHHHHHHHhCCCcC-----HHHHHHHhCCCCCCCCCeEeecc-c-ccchhhccchhhccCCCCCCCCCCCC Confidence 788999999999999864 47899999998655444322110 0 0000 000000 0000011111 No 95 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=99.49 E-value=7.6e-13 Score=87.02 Aligned_cols=376 Identities=11% Similarity=0.066 Sum_probs=184.8 Q ss_pred chhhhhhhcccccccccccccchH-HHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccc Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDELRWPNSV-YTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGD 99 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~lr~~~~~-~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~ 99 (484) ||..-.............-....+ .++.++-.+-+.|.+|+..+...|.+++|+|...+.+......+...|.. T Consensus 1 Mgl~d~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~lL~~----- 75 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLYWINT----- 75 (395) T ss_pred CcchhhhcCCCCccccccccccchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHHHHhh----- Confidence 332111000000000000001111 12233333467999999999999999999997554332211112222211 Q ss_pred hhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeeccccccc Q lcl|NC_021302. 100 ESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTF 178 (484) Q Consensus 100 ~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~ 178 (484) +........++++.++ +.+.+|.+.+.+++. +...+.....+... . .+... .+ T Consensus 76 -------~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~---~~~~~~~~~~~~~~------~--~~~~~--~~------ 129 (395) T protein:vir:96 76 -------KANPNQSASQFWVEVVQKLLVDGETLIFVIPG---KGIYVADAFTQDKK------L--SGNKF--KV------ 129 (395) T ss_pred -------cCCCCCCHHHHHHHHHHHHhhcCceEEEEEcC---CceecCCccccccc------c--cccee--ee------ Confidence 0111224556666654 566789988776542 22222211111100 0 00000 00 Q ss_pred ccccceeccCCCCcccccccceEEEeecCccCccccchhHHHH------HHHHHHHHHHHHHHHHHHHHhcCCcceEEec Q lcl|NC_021302. 179 GGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPA------YKNWKLKDELIRIEAAAIRRHGIGVPYLKGN 252 (484) Q Consensus 179 ~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~------~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk 252 (484) ...........+++...+++++......+++.|+.... ......+....++...+... .|.|..+-+ T Consensus 130 -----v~~~~~~~~~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 202 (395) T protein:vir:96 130 -----SRVQGQTYEKIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKD--KVRERAQEN 202 (395) T ss_pred -----eeeccceeeeEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhccc--ccccceeec Confidence 00001111234677888888877666666665554322 12222333444555555543 255555555 Q ss_pred CCCCCCHHHHHHHHHH-HHHHhcCCceEEEccCCceEEEecccCCch------hHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|NC_021302. 253 EADSEDDDRMDELLEI-ASNYSGGESAGLALTAGEEAGILSPNGTPL------DPRRAIEYHDHQMALVALAHFLNLDGK 325 (484) Q Consensus 253 ~~~~~~~~~~~~l~~~-l~~~~~g~~a~~vip~~~~ie~~~~~~~~~------~~~~li~~~d~~Isk~ilGqtlt~~~~ 325 (484) ......++..++..+. .....++..++++++.|++++-++.+.... .|.++....-++|++++.-..--. T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l--- 279 (395) T protein:vir:96 203 SDGGRQPKSDKDFFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLL--- 279 (395) T ss_pred cCchhhHHHHHHHHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHh--- Confidence 4444444444444333 334444544566788999888776654332 344455566789999976544322 Q ss_pred ccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-CCcHHHHHHHHHHHHhcCcccC Q lcl|NC_021302. 326 GGSYALA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-GSRQDATAAALQMLVNAGLLTP 403 (484) Q Consensus 326 gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-~~~~~~~ae~~~~L~~~G~~~~ 403 (484) ||+++-. +........-+.-.++.|+..||+.|++.--. .. .-+|.++.. ..|.++++++++++++.|+.. T Consensus 280 ~~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~---~~---~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T- 352 (395) T protein:vir:96 280 HGDIADNQKNYELLLEGPIESLITNIVDGLEYAIFDKSET---LE---GSFIKVTGLKNYDLFSISSQADKLISSGFVF- 352 (395) T ss_pred cCCCccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhh---cC---ceeEeecchhccCHHHHHHHHHHHHhCCCcC- Confidence 2333322 23334455567788888999998877764211 11 114566543 357889999999999999764 Q ss_pred CcccHHHHHHHhCCCCCCC--CcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 404 DPRLEAFLRDAAGLPGPDP--DADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 404 ~~~~~~~i~e~~glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) .+++|+.+|+|+-++ ++....+ .+- .+..+. .|+ .....+. T Consensus 353 ----~NE~R~~~gl~pi~~~~gD~~~~~-~N~-~~~~~~--------gge--~~~~~~~ 395 (395) T protein:vir:96 353 ----IDEVREEIGLPELPDGLGKVLYMT-KNY-ESVLER--------GGE--VDEEVET 395 (395) T ss_pred ----HHHHHHHhCCCCCCCCCCceeeec-ccc-eechhc--------cCC--CCCCCCC Confidence 478999999986543 2222111 000 000000 000 0001111 No 96 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=99.49 E-value=9.5e-13 Score=86.50 Aligned_cols=348 Identities=14% Similarity=0.097 Sum_probs=197.3 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) |+=-.+=-+ +....+... +. .+...- ....+..+ .-+..+ +-+.|.+|+......|.++++.- T Consensus 1 M~~~~~f~~--r~~~~~~~~----~~-~~~~~~---~~~~~~~v-~~~~al-~~~av~~cv~~ia~~ia~~p~~~----- 63 (359) T protein:vir:10 1 MSILNPFER--RSSITPNNY----YP-FMVQNG---SIVPNSLV-DATEAL-KNSDLYAVTSLISSDIAGTRFIG----- 63 (359) T ss_pred Ccccchhhc--cccCCCCcc----hh-hhhccc---cccCCccc-CHHHhh-cchHHHHHHHHHHHhhhcCcccc----- Confidence 221110000 000000000 00 000000 00000111 112333 46889999999999999998732 Q ss_pred CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeee Q lcl|NC_021302. 81 RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYW 159 (484) Q Consensus 81 ~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~ 159 (484) ++ . ....+.. -....+..++++.+. +.+.+|-+..++++.. + -.+..|.+.++.++.. T Consensus 64 ~~-~---~~~L~~~-------------PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-~--g~~~~l~~l~~~~v~i- 122 (359) T protein:vir:10 64 NQ-V---FTSVLNN-------------PSHLTNAFSFWQTAILNLLLNGNVFLAILKGD-N--SLMKELRLIPSNAITI- 122 (359) T ss_pred ch-H---HHHHhhc-------------ccccCCHHHHHHHHHHhccccCceEEEEEECC-C--CeEEEEEEeCCceEEE- Confidence 11 1 1111111 111224556666665 5678899999987643 2 3577888988887753 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCc----cCccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMD----PGVWTGNSLLRPAYKNWKLKDELIRIE 235 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~----~~~p~G~gll~~~~~~~~~K~~~~~~w 235 (484) ..+ ++++.+... .........++....+++++... .+..+|.|.+..+....-......++. T Consensus 123 ~~~-~~~~~y~~~-------------~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~ 188 (359) T protein:vir:10 123 DLT-DDTLTYEVN-------------QFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLS 188 (359) T ss_pred EEc-CCeEEEEEE-------------ecCCceEEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHH Confidence 333 344432211 11233456678888888776543 234579999999888888888888888 Q ss_pred HHHHHHhcCCcceEEecCCC-CCCHHHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEAD-SEDDDRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMA 312 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~-~~~~~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Is 312 (484) ..+... .+.|-.+.+++. ..++++++.+.+.++++.++.++ .++++.|++++-++.+.....|.+..++..++|+ T Consensus 189 ~~~f~n--g~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia 266 (359) T protein:vir:10 189 LSTLKG--ALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIA 266 (359) T ss_pred HHHHhc--cCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHH Confidence 888864 356766777754 56888889999999988766554 4789999999888766555568888999999999 Q ss_pred HHHhhhhhcccccc---cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCCcHHHHH Q lcl|NC_021302. 313 LVALAHFLNLDGKG---GSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGSRQDATA 389 (484) Q Consensus 313 k~ilGqtlt~~~~g---Gs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~~~~~~a 389 (484) +++.-..--.+..+ .+++. + ++.....+.--..-+++.|+..|.+.+ .++.+ . .+.+ +...+. T Consensus 267 ~~fgVPp~~lg~~~~~~~~~~~--~-e~~~~~~l~~~l~p~~~~l~~~l~~~~-~~~~~---~--~~~~-----d~~~~~ 332 (359) T protein:vir:10 267 KAFGVSDSYLNGTGDQQSSLDQ--I-KDLYVNALNRFIEPLISELRIKCDSSI-GVDMS---P--ITDY-----SNSVFK 332 (359) T ss_pred HHhCCCHHHhCCCCcccccHHH--H-HHHHHHHHHHHHHHHHHHHHHHhhhhh-cccch---h--hhhc-----CHHHHH Confidence 99765543332111 23322 2 222222233333444455554443321 12211 0 1222 234455 Q ss_pred HHHHHHHhcCcccCCcccHHHHHHHhCCCCCC Q lcl|NC_021302. 390 AALQMLVNAGLLTPDPRLEAFLRDAAGLPGPD 421 (484) Q Consensus 390 e~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~ 421 (484) ..+.++++.|+.. .+++|+.+|+|.-- T Consensus 333 ~~~~~~~~~G~~t-----~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 333 ADILNWVKEGIIE-----PTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHhCCCcC-----HHHHHHHhCCCCCC Confidence 6677889999864 47899999996432 No 97 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=99.47 E-value=4e-13 Score=88.55 Aligned_cols=275 Identities=8% Similarity=-0.036 Sum_probs=180.3 Q ss_pred hhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeee Q lcl|NC_021302. 68 IRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLK 146 (484) Q Consensus 68 v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~ 146 (484) |.+++|.+...+...+ ..+...|.. +-....++.++++.++ +.+.+|-++++++.... | .+. T Consensus 1 ia~l~~~~~~~~~~~~--~~l~~lL~~------------~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~-G--~~~ 63 (278) T protein:vir:78 1 MASLPLKMYEDYKVVN--TEVSDLLTV------------SPNNSLSSFDFINQIETIRNEKGNAYVLIERDIY-H--QPS 63 (278) T ss_pred CccceeEEEecCcccc--cHHHHHHHh------------cCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCC-C--cEE Confidence 9999999865433211 112222210 1112335777888887 78889999999986433 3 367 Q ss_pred eeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHH Q lcl|NC_021302. 147 RLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWK 226 (484) Q Consensus 147 ~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~ 226 (484) .|.+.+|.++. ...+.+++.+.... ....+....++.+..+++++....+.++|.|.+..+....- T Consensus 64 ~l~~l~~~~v~-v~~~~~~~~~~y~~-------------~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~ 129 (278) T protein:vir:78 64 KLFLLNPDVVE-MLIENQSRELYYSI-------------HAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTD 129 (278) T ss_pred EEEEECCceeE-EEEcCCCceEEEEE-------------EcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHH Confidence 89999999886 45555555442211 11233445678888787776656677899999999887665 Q ss_pred HHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHH Q lcl|NC_021302. 227 LKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEY 306 (484) Q Consensus 227 ~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~ 306 (484) ....... |.. .+++.| |-.+.+.+...++++++++.+.+++...+....++++.|++++-++.+.....|.+..++ T Consensus 130 ~~~~~~~-~~~--~~~~~~-~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~ 205 (278) T protein:vir:78 130 FDNAVRT-FNL--TEMQKP-DSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENL 205 (278) T ss_pred HHHHHHH-HHH--HHhcCC-CcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecCCCceEEEccCChhHHHHHHHHHH Confidence 5444433 332 333434 556667777888999999999988887665567889999999888877666779999999 Q ss_pred HHHHHHHHHhhhhh-cccccccchhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCCC Q lcl|NC_021302. 307 HDHQMALVALAHFL-NLDGKGGSYALASVQA-DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIGS 383 (484) Q Consensus 307 ~d~~Isk~ilGqtl-t~~~~gGs~A~~evh~-~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~~ 383 (484) ..++|++++.-... ....++++++..+-+. ......+.-.++.|++.||+.|+++--.. .+. +|+|+...- T Consensus 206 ~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~----~g~--~~~f~~~~l 278 (278) T protein:vir:78 206 TRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDRE----KIG--ILNLTLNLI 278 (278) T ss_pred HHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhc----CCc--eEEEecccC Confidence 99999999766543 3333456666555544 45566788999999999999887642111 111 455642111 No 98 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.46 E-value=3.4e-12 Score=83.43 Aligned_cols=441 Identities=10% Similarity=0.002 Sum_probs=214.0 Q ss_pred CCCCCCCcccee---eeecccccchhhhhhhccccc-cc-ccc--cccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTVAPRTER---GYVNPLAGFGTFLAQGLDQFE-QV-DEL--RWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~-~~-~~l--r~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) +||..++..... +|....+ . .....++.+.- .. .++ .+.....--+++.+++++++++++.....|.+.-+ T Consensus 12 ~a~~~~~~~~~~~~~~y~gA~~-~-~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~Gi 89 (553) T protein:vir:63 12 VTSGRPEQSASLGGGGLEGASR-L-SRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQRDSIVGAQY 89 (553) T ss_pred cccccchhhhhhhccccccccc-C-CCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCc Confidence 566555444322 1111111 0 11122222211 10 111 12233455678889999999999999999999988 Q ss_pred EEecC-------CCCHHHHHHHHHHHHhhhccchhhhh-HHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeee Q lcl|NC_021302. 74 RIRPN-------GARPEVVEHVAACLGLPVEGDESDKP-TPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFW 144 (484) Q Consensus 74 ~v~p~-------~~~~e~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~ 144 (484) .+.+. +.+++.++...+.+...+..-..... .-+..+..+|..+...++ ..+.-|=+++-+.|....|... T Consensus 90 ~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~~~~~ 169 (553) T protein:vir:63 90 RLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEWDRAANRPY 169 (553) T ss_pred eeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeeeccCCCCcc Confidence 87654 44555555555555433221111000 012346678999999888 4577888888888987766555 Q ss_pred eeeeeeeCccceeeeeecCCCcee--eeecccccccccccceeccC-CC----------------CcccccccceEEEee Q lcl|NC_021302. 145 LKRLAPRPQSSIAYWNVDRDGGLI--SIQQWPAGTFGGPGMVVMAP-NS----------------MGPAIPVEQLVVYTH 205 (484) Q Consensus 145 ~~~l~~r~~~~~~~~~~~~dg~l~--~~~q~~~~~~~~~~~~~~~~-~~----------------~~~~lp~~k~l~~~~ 205 (484) +-+|..++|..|.......+|+.+ ++.-...+. ....+.... .+ ....+|... |+|.+ T Consensus 170 ~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr--~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~-vlH~f 246 (553) T protein:vir:63 170 ATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGR--PQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQ-VIHIL 246 (553) T ss_pred cceEEEechhhcCCCCCCCCCCeeEeeeEECCCCc--eEEEEeeccCCCccccccccccceeeeccccccChhH-heecc Confidence 556777777776543222333321 221111110 111111111 11 122355444 55665 Q ss_pred cC-ccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHH------------------- Q lcl|NC_021302. 206 DM-DPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDEL------------------- 265 (484) Q Consensus 206 ~~-~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l------------------- 265 (484) .. ..+..-|.+.|.++.....-........+.-... ..-+. ...+.+.+.. ...+.+ T Consensus 247 ~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i-~A~~a-~fi~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (553) T protein:vir:63 247 EPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVI-NASYA-AAIESELPPE-FIHSQMSGGSPNADMVGIFGKYMDA 323 (553) T ss_pred cccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHH-hhhhe-eeeecCCChh-hhhhhcccccccccccccccccccc Confidence 54 5888889999998876655444433333332221 11112 2222221110 000000 Q ss_pred ------HHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhh--hhhcccccccchhhHHHHHH Q lcl|NC_021302. 266 ------LEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALA--HFLNLDGKGGSYALASVQAD 337 (484) Q Consensus 266 ------~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilG--qtlt~~~~gGs~A~~evh~~ 337 (484) ......|..| ....++.|.+|++.+++..+..|..|.+..-+.|+..+.- +.||.|-.+.||+.+-.-.. T Consensus 324 ~~~~~~~~~~~~l~pG--~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~ 401 (553) T protein:vir:63 324 LKAYVGGANNIQIDGA--KIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIA 401 (553) T ss_pred cccccccccceeecCc--eeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHH Confidence 0111234334 4667899999999998877889999999999999999533 34666654567877666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh-------CCCCcc------------ccceEEe--cCCC-CcHHHHHHHHHHH Q lcl|NC_021302. 338 TFVQSVQTVADEIRDVAQAHVVEDIVDV-------NWGEDE------------PAPLLVF--DEIG-SRQDATAAALQML 395 (484) Q Consensus 338 v~~~~~~aD~~~i~~~ln~qli~~l~~~-------Nf~~~~------------~~P~~~~--~~~~-~~~~~~ae~~~~L 395 (484) .+...++.....+...+-+-+..++++. ..+... .+-..+| .... -|..+=+++.... T Consensus 402 e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~ 481 (553) T protein:vir:63 402 MTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMR 481 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHH Confidence 6666666666666665555555544442 111100 0001122 1111 2333334555566 Q ss_pred HhcCcccCCcc----------------cHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccc Q lcl|NC_021302. 396 VNAGLLTPDPR----------------LEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 396 ~~~G~~~~~~~----------------~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (484) +..|+.....+ .+....+++||+.+.+..............+++.+....++..++ T Consensus 482 i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 482 IDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 66676432100 111234445655432211111110000111111111111111111 No 99 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=99.43 E-value=1.2e-12 Score=85.95 Aligned_cols=354 Identities=12% Similarity=0.022 Sum_probs=178.0 Q ss_pred chhhhh-----hhccccccccccc-ccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCC----HHHHH---- Q lcl|NC_021302. 21 FGTFLA-----QGLDQFEQVDELR-WPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGAR----PEVVE---- 86 (484) Q Consensus 21 ~~~~~~-----~~~~~~~~~~~lr-~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~----~e~~~---- 86 (484) ||.+.. ....... ..... +.. -.++ .+-+.|.+|+..+-..|.++++.+...... .+... T Consensus 1 Mg~f~~~~~~~~~~~~~~-~~~~~~~~~-~~~~----~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~ 74 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNND-TQRVTAWQN-EAVE----YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGS 74 (378) T ss_pred CCccccchhcccccccCC-cceeeeecc-chhH----HHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccc Confidence 332110 0000010 00111 111 1111 123579999999999999999987432211 11111 Q ss_pred HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCC Q lcl|NC_021302. 87 HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDG 165 (484) Q Consensus 87 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg 165 (484) -+.+.|.. +-.......++++.++ +.+.+|.+.+.++|....|.+. .+. | T Consensus 75 ~l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~--~l~---p------------ 125 (378) T protein:vir:94 75 DLDEVLNW------------SPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELL--DLL---F------------ 125 (378) T ss_pred hHHHHHhh------------cCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEE--EEE---e------------ Confidence 11122210 1112334566777665 6788999998887754433321 110 0 Q ss_pred ceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021302. 166 GLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIG 245 (484) Q Consensus 166 ~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G 245 (484) ......+|++..|++++ ..++ -.|.|.+..+.... ..++. .| T Consensus 126 -----------------------~~~~~~~~~~diiH~~~-~~~~-~~g~s~l~~~~~~i----------~~~~~---~~ 167 (378) T protein:vir:94 126 -----------------------ADDKKEYKPEELVRLTS-PFYI-NEDTSILDNALASI----------QTKLE---QG 167 (378) T ss_pred -----------------------cCCeeEeeeeeeEEecC-cCCc-cchhHHHHHHHHHH----------HHHHh---cc Confidence 01122355666665553 2222 34677776654321 11122 13 Q ss_pred cceEEecCCCCCCHH----HHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhh Q lcl|NC_021302. 246 VPYLKGNEADSEDDD----RMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHF 319 (484) Q Consensus 246 ~P~~~gk~~~~~~~~----~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqt 319 (484) .|-.+.+.+...+++ .++++.+.+++...+.++ .++++.|++++-++.+.....+.. .++..++|++++.-.. T Consensus 168 ~~~gil~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~-~~~~~~~Ia~~fgVP~ 246 (378) T protein:vir:94 168 KLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDE-IDLIKSELLTGYFMNE 246 (378) T ss_pred cccceeeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhHHH-HHHHHHHHHHHhCCCH Confidence 443344554444433 345566666665555444 478889998887776554444543 4778899999976643 Q ss_pred hcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--ccceEEecCC-CCcHHHHHHHHHHHH Q lcl|NC_021302. 320 LNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDE--PAPLLVFDEI-GSRQDATAAALQMLV 396 (484) Q Consensus 320 lt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~--~~P~~~~~~~-~~~~~~~ae~~~~L~ 396 (484) -.. +|+++. +-.......-+.-.++.|+..||+.|+++--...+-... .-++|.++.. ..+.++.+++++++. T Consensus 247 ~~l---~~~~se-~~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~ 322 (378) T protein:vir:94 247 NIL---LGTASQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENI 322 (378) T ss_pred HHh---cCChHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHH Confidence 222 234432 334455566778888999999999888763332211111 1123433332 347788999999999 Q ss_pred hcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 397 NAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 397 ~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) +.|+.. .+++|+.+|+|.-+++++...+.--.....+....+...+..++....+ + T Consensus 323 ~~G~~T-----~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n--~ 378 (378) T protein:vir:94 323 NGPIFT-----QNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNN--Q 378 (378) T ss_pred hCCCcC-----HHHHHHHhCCCCCCCCCeeeecccccccccchhhcCCcCCCCCCCCCCC--C Confidence 999864 4789999999877766654322100000000000000000000000000 0 No 100 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=99.42 E-value=4e-12 Score=83.08 Aligned_cols=376 Identities=10% Similarity=0.049 Sum_probs=179.9 Q ss_pred chhhhhhhccccccccc-ccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccc Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDE-LRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGD 99 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~-lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~ 99 (484) ||..-...-........ ..+...-.++.+.-.+-+.|.+|+...-..|.+++|.+...+.+.....-+...|.. T Consensus 1 MGlf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~~~~~~lL~~----- 75 (395) T protein:vir:98 1 MGILDFFSFKKSGTLSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQKDWLYWINT----- 75 (395) T ss_pred CcchhhhcCCCcccccccccchhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCcccccchHHHHHhh----- Confidence 44321111111110000 000011112223222467899999999999999999997544332211112222211 Q ss_pred hhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeeccccccc Q lcl|NC_021302. 100 ESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTF 178 (484) Q Consensus 100 ~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~ 178 (484) +........++++.++ +.+.+|.+.+.++.. .+.+.+..+ .+... + .+.. .. T Consensus 76 -------~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~--~~~~~~~~~-~~~~~-~-------~~~~--~~------- 128 (395) T protein:vir:98 76 -------KANPNQSASQFWVEVIQKLLVDGETLIFVIPG--KGIYVADSF-TQDKK-I-------SGSQ--FK------- 128 (395) T ss_pred -------cCCCCCCHHHHHHHHHHHHhhcCceEEEEEeC--CceecCCcc-ccccc-c-------cCcc--cc------- Confidence 0111224455666654 567789998877542 222211111 10000 0 0000 00 Q ss_pred ccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHH--HHHHHHHHHHHHHHHhcCCcceEEecCCCC Q lcl|NC_021302. 179 GGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWK--LKDELIRIEAAAIRRHGIGVPYLKGNEADS 256 (484) Q Consensus 179 ~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~--~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~ 256 (484) ............+++...+++++....+.+++.|+........- ........-..+... ++.+..+.+.... T Consensus 129 ----~~~~~~~~~~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 202 (395) T protein:vir:98 129 ----VSRVQGQTYEKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMI--PPKDKVRERAQEN 202 (395) T ss_pred ----eeeecCceeeeEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhc--ccccccccccccc Confidence 00000111134567788888888777777777777654332210 111111111122222 1222222222121 Q ss_pred -CCHHHHHHHHHH----HHHHhcCCceEEEccCCceEEEecccCCc------hhHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|NC_021302. 257 -EDDDRMDELLEI----ASNYSGGESAGLALTAGEEAGILSPNGTP------LDPRRAIEYHDHQMALVALAHFLNLDGK 325 (484) Q Consensus 257 -~~~~~~~~l~~~----l~~~~~g~~a~~vip~~~~ie~~~~~~~~------~~~~~li~~~d~~Isk~ilGqtlt~~~~ 325 (484) .+++..+...+. .....++...+++++.|++++-++.+... ..|.+..++.-++|++++.-..--. T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l--- 279 (395) T protein:vir:98 203 SDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLL--- 279 (395) T ss_pred CCcHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHh--- Confidence 222222333333 33333343445668899998877654321 2477777888899999976644322 Q ss_pred ccchhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCC-CCcHHHHHHHHHHHHhcCcccC Q lcl|NC_021302. 326 GGSYALAS-VQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEI-GSRQDATAAALQMLVNAGLLTP 403 (484) Q Consensus 326 gGs~A~~e-vh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~-~~~~~~~ae~~~~L~~~G~~~~ 403 (484) ||+++-.+ ........-+.-.++.|++.||+.|+++-.. .. .-+|.|+.. ..|.++.+++++++.+.|+.. T Consensus 280 ~~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~~~-~~-----g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T- 352 (395) T protein:vir:98 280 HGDIADNQKNYELLLEGPIESLITNIVDGLEYAIFDKSET-LQ-----GSFIKVTGLKNYDLFSISNQADKLISSGFVF- 352 (395) T ss_pred cCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhh-cC-----cceeeehhhhccCHHHHHHHHHHHHhCCCcC- Confidence 24444222 2234456667888899999999888764221 11 125666543 357888999999999999754 Q ss_pred CcccHHHHHHHhCCCCCCC--CcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 404 DPRLEAFLRDAAGLPGPDP--DADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 404 ~~~~~~~i~e~~glp~p~~--~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) .+++|+.+|+|.-++ +.....+ .+-.......+. .....+. T Consensus 353 ----~NE~R~~~g~~Pi~~~~gD~~~~~-~n~~~~~~~gge-----------~~~~~~~ 395 (395) T protein:vir:98 353 ----IDEVREEIGLPELPDGLGKVLYMT-KNYESVLERGGE-----------VDEEVET 395 (395) T ss_pred ----HHHHHHHhCCCCCCCCCCceeeec-ccceecccccCC-----------CCCCCCC Confidence 489999999986543 2222211 010000000000 0000111 No 101 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.42 E-value=2.4e-12 Score=84.30 Aligned_cols=436 Identities=15% Similarity=0.093 Sum_probs=206.0 Q ss_pred CCCCCCCccce-eeeecccccchhhhhhhcccc-cccc-cc--cccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEE Q lcl|NC_021302. 1 MAPKTVAPRTE-RGYVNPLAGFGTFLAQGLDQF-EQVD-EL--RWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRI 75 (484) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~-~l--r~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v 75 (484) +.|....+... .+|.....+.+ ....++.+. .... +. .+.....--+++.++++++.++++.....|.+..+.+ T Consensus 10 ~~~~~~~~~~~~~~y~~~a~~~~-~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~ 88 (533) T protein:vir:34 10 LGPDGMTSLREYAGYHGGGSGFG-GQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRL 88 (533) T ss_pred hcccccchHHHHHhhhhccCCCC-CcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCcee Confidence 11111111111 11111110111 112222221 1111 11 1223345567888999999999999999999998888 Q ss_pred ecC------CCCHHHHHHHHHHHHhhhccchhhhh-HHHhhcCCCHHHHHHHHHHH-HhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 76 RPN------GARPEVVEHVAACLGLPVEGDESDKP-TPRTRGRFSWDQHLRLALKS-LQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 76 ~p~------~~~~e~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~l~a-~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) .+. +-+++.++...+.+...+..--.... .-+.....+|..+...++.+ +.-|=+++-+.|....|...+-+ T Consensus 89 ~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~g~~~~~~ 168 (533) T protein:vir:34 89 SHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQ 168 (533) T ss_pred eeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccCCCCccceE Confidence 763 22344444444444333221111100 01344667899999888854 77898989888988766555556 Q ss_pred eeeeCccceeeeeecCCCcee--eeecccccccccccceec---cCCC----C------cccccccceEEEeecC-ccCc Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDRDGGLI--SIQQWPAGTFGGPGMVVM---APNS----M------GPAIPVEQLVVYTHDM-DPGV 211 (484) Q Consensus 148 l~~r~~~~~~~~~~~~dg~l~--~~~q~~~~~~~~~~~~~~---~~~~----~------~~~lp~~k~l~~~~~~-~~~~ 211 (484) |..+++..|.-.....+|..+ ++.-. ..+....+.. ...+ . ...+|. .-|+|.+.. ..+. T Consensus 169 lq~ie~d~l~~~~~~~~~~~i~~GIe~d---~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a-~~VlH~f~~~r~gQ 244 (533) T protein:vir:34 169 FRMVSPKRISNPNNTGDSRNCRAGVQIN---DSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGR-ASFIHVFEPVEDGQ 244 (533) T ss_pred EEEechhhcCCCCCCCCCCceEeeeEEC---CCCCeEEEEEeecCCCCccccccceeeeeeccCh-hHeeeeccccCCCc Confidence 777777776543222233321 12111 1111111111 1111 0 112332 346666655 4888 Q ss_pred cccchhHHHHHHHHHHHHHHHHHH----------HHHHHHhcCCc----ceEEecCCCCCCHHHHHHHHHH--------H Q lcl|NC_021302. 212 WTGNSLLRPAYKNWKLKDELIRIE----------AAAIRRHGIGV----PYLKGNEADSEDDDRMDELLEI--------A 269 (484) Q Consensus 212 p~G~gll~~~~~~~~~K~~~~~~w----------~~f~Er~~~G~----P~~~gk~~~~~~~~~~~~l~~~--------l 269 (484) .-|.+.|.++.....-........ +.|++.- .|- ....+....... +........ - T Consensus 245 ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 322 (533) T protein:vir:34 245 TRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESE-LDTQSAMDFILGANSQEQR-ERLTGWIGEIAAYYAAAP 322 (533) T ss_pred ccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecC-CCcccccccccCCCccccc-ccccccchhhhhccCcce Confidence 999999988876554433333332 2333321 110 000011000000 000000000 0 Q ss_pred HHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhh--hhhcccccccchhhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 270 SNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALA--HFLNLDGKGGSYALASVQADTFVQSVQTVA 347 (484) Q Consensus 270 ~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilG--qtlt~~~~gGs~A~~evh~~v~~~~~~aD~ 347 (484) ..|..| ....++.|.+|++++++..+..|..|.+..-+.|+..+.- +.||.|-.+.||+.+-.-...+...++... T Consensus 323 ~~l~pG--~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q 400 (533) T protein:vir:34 323 VRLGGA--KVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRR 400 (533) T ss_pred eeccCc--eeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHH Confidence 124334 4567899999999999888889999999999999999532 346666445688777666666666666666 Q ss_pred HHHHHHHHHHHHHHHHH---hC-CCC--cc----------ccceEEec--CCC-CcHHHHHHHHHHHHhcCcccCCcc-- Q lcl|NC_021302. 348 DEIRDVAQAHVVEDIVD---VN-WGE--DE----------PAPLLVFD--EIG-SRQDATAAALQMLVNAGLLTPDPR-- 406 (484) Q Consensus 348 ~~i~~~ln~qli~~l~~---~N-f~~--~~----------~~P~~~~~--~~~-~~~~~~ae~~~~L~~~G~~~~~~~-- 406 (484) ..+...+-+-+.+.+++ ++ .-+ .. .+.+..|. ..+ -|..+-+++....++.|+.....+ T Consensus 401 ~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a 480 (533) T protein:vir:34 401 KFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECA 480 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHH Confidence 66666555555555443 22 111 00 00112221 111 243344456666777776432100 Q ss_pred --------------cHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccc Q lcl|NC_021302. 407 --------------LEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTN 456 (484) Q Consensus 407 --------------~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (484) .+....+++||+.+..... ......+..++...+..+.+ T Consensus 481 ~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~-----------~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 481 KRGDDYQEIFAQQVRETMERRAAGLKPPAWAAA-----------AFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred HcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCc-----------CccCCCCCCCCCCcccCCCC Confidence 0111233444443221100 00000000000000000000 No 102 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=99.37 E-value=1.4e-12 Score=85.55 Aligned_cols=355 Identities=12% Similarity=0.028 Sum_probs=175.9 Q ss_pred chhhhh-----hhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCC----HHHH----HH Q lcl|NC_021302. 21 FGTFLA-----QGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGAR----PEVV----EH 87 (484) Q Consensus 21 ~~~~~~-----~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~----~e~~----~~ 87 (484) ||.+.- .............+... .+..+-+.|.+|+..+...|.+++|.+...... .... .- T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~ 75 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNNDTQRVTAWQNE-----AVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSD 75 (378) T ss_pred CccchhhhhhhcccccCCcceeeecccc-----hhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccch Confidence 332100 01111111000111111 111234589999999999999999987432211 1111 11 Q ss_pred HHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCc Q lcl|NC_021302. 88 VAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGG 166 (484) Q Consensus 88 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~ 166 (484) +.+.|.. +-....+..++++.++ +.+.+|-+.+.++|.-..|.+. .+.+ . T Consensus 76 l~~lL~~------------~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~--~l~~-----------~---- 126 (378) T protein:vir:16 76 LDEVLNW------------SPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DLLF-----------A---- 126 (378) T ss_pred HHHHHhh------------cCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EEEe-----------c---- Confidence 2222211 1112334566666664 6777999999888754333321 1100 0 Q ss_pred eeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021302. 167 LISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGV 246 (484) Q Consensus 167 l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~ 246 (484) .....+|++..|++++. ...-.|.|++..+... ...+.. .|. T Consensus 127 -----------------------~~~~~~~~~diih~r~~--~~~~~~~s~l~~~~~~----------i~~~~~---~~~ 168 (378) T protein:vir:16 127 -----------------------DDKKEYKPEELVRLTSP--FYINEDTSILDNALAS----------IQTKLE---QGK 168 (378) T ss_pred -----------------------CCeeEecccceEEecCc--cCccchhHHHHHHHHH----------HHHHHh---cCc Confidence 01223455555555522 1122355555554422 122332 234 Q ss_pred ceEEecCCCCCCH----HHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_021302. 247 PYLKGNEADSEDD----DRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFL 320 (484) Q Consensus 247 P~~~gk~~~~~~~----~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtl 320 (484) |-.+.+.+...++ +.++.+.+.++++..+.++ .++++.|++++-++.+.....+.. .++..++|++++.-..- T Consensus 169 ~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~-~~~~~~~Ia~~fgVPp~ 247 (378) T protein:vir:16 169 LRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDE-IDLIKSELLTGYFMNEN 247 (378) T ss_pred cceeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHH-HHHHHHHHHHHhCCCHH Confidence 4333444444343 3455666666666555444 477889998887776554445544 47888999999766542 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCc-cccceEEecCC-CCcHHHHHHHHHHHHh Q lcl|NC_021302. 321 NLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNW-GED-EPAPLLVFDEI-GSRQDATAAALQMLVN 397 (484) Q Consensus 321 t~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf-~~~-~~~P~~~~~~~-~~~~~~~ae~~~~L~~ 397 (484) .. +|+++. +-.......-+.-.++.|+..||+.|+++--...+ +.. ..-.+|.++.. ..+.++.++++.++++ T Consensus 248 ~l---~g~~~e-~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~ 323 (378) T protein:vir:16 248 IL---LGTASQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENIN 323 (378) T ss_pred Hh---cCCchH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHh Confidence 22 234432 22334445667888899999999988865332221 111 11123433333 3577889999999999 Q ss_pred cCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 398 AGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 398 ~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .|+.. .+++|+.+|+|.-++++....+.--..........+...+..++....+ + T Consensus 324 ~G~~T-----~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n--e 378 (378) T protein:vir:16 324 GPIFT-----QNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNN--Q 378 (378) T ss_pred CCCcC-----HHHHHHHhCCCCCCCCCeEeeccccccccchhhhcCccCCCCCCCCCCC--C Confidence 99764 4789999999877665543322100000000000000000000000000 0 No 103 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.36 E-value=4e-11 Score=77.59 Aligned_cols=437 Identities=13% Similarity=0.047 Sum_probs=208.0 Q ss_pred CCCCCCCcc-ceeeeecccccchhhhhhhccccc-cc-ccc--cccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEE Q lcl|NC_021302. 1 MAPKTVAPR-TERGYVNPLAGFGTFLAQGLDQFE-QV-DEL--RWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRI 75 (484) Q Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~l--r~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v 75 (484) ++|.+-... ...++.....+.+ ....++.+.- .. .++ .+......-+++.++++++.++++.....|.+..+.+ T Consensus 7 ~~~~~~~~~~~~~~~~~~a~~~~-~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~ 85 (530) T protein:vir:38 7 VGPDGKTSLREYAGYHGGGGGFG-GQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVGSFFRL 85 (530) T ss_pred ecCccccchHHHhhhhcccCCCC-CcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCcee Confidence 444432221 1222221111111 1122222211 11 111 1233345567888999999999999999999998888 Q ss_pred ecC------CCCHHHHHHHHHHHHhhhccchhhhh-HHHhhcCCCHHHHHHHHHH-HHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 76 RPN------GARPEVVEHVAACLGLPVEGDESDKP-TPRTRGRFSWDQHLRLALK-SLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 76 ~p~------~~~~e~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~l~-a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) .+. +.+++.++...+.+...+..--.... .-+.....+|..+.+.++. .+.-|=.++-+.|..++|.-.+-+ T Consensus 86 ~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~~~~~~ 165 (530) T protein:vir:38 86 SYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTRLFRTQ 165 (530) T ss_pred eeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCCCccceE Confidence 763 33344444444443333211000000 0123456689999888874 577898888888988777656667 Q ss_pred eeeeCccceeeeeecCCCcee--eeecccccccccccceec-c--CC----CCccccccc-----ceEEEeecC-ccCcc Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDRDGGLI--SIQQWPAGTFGGPGMVVM-A--PN----SMGPAIPVE-----QLVVYTHDM-DPGVW 212 (484) Q Consensus 148 l~~r~~~~~~~~~~~~dg~l~--~~~q~~~~~~~~~~~~~~-~--~~----~~~~~lp~~-----k~l~~~~~~-~~~~p 212 (484) |..+++.+|.-.....+|+.+ ++.- +..+....+.. . .. .....+|.. .-|+|.+.. ..+.. T Consensus 166 lq~ie~d~l~~~~~~~~~~~i~~GIe~---d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~ 242 (530) T protein:vir:38 166 FKMVSPKRVSNPNNIGDTRNCRAGVKI---NDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDGQT 242 (530) T ss_pred EEEechhhcCCCCCCCCCCeeEeeeEE---CCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccCCCcc Confidence 777777776533222233311 1111 11111111111 1 11 111122322 246666655 47889 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCC-----------CHHHHHHH-----------HHHHH Q lcl|NC_021302. 213 TGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSE-----------DDDRMDEL-----------LEIAS 270 (484) Q Consensus 213 ~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~-----------~~~~~~~l-----------~~~l~ 270 (484) -|.+.|.++.....-........+.-... ..-+... .|...+. .+.+...+ ..... T Consensus 243 RGis~lapvl~~l~~l~~y~dael~~a~i-~A~~a~f-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (530) T protein:vir:38 243 RGANAFYSVMEQMKMLDTLQNTQLQSAIV-KAMYAAT-IESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPV 320 (530) T ss_pred cCCchHHHHHHHHHHHhHHHHHHHHHHHH-hhhheee-eeccCCccccccccccCCcccccccccccchhhhhcccccce Confidence 99999998876554443333333222111 0111111 1111000 00000000 00111 Q ss_pred HHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhh--hhhcccccccchhhHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 271 NYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALA--HFLNLDGKGGSYALASVQADTFVQSVQTVAD 348 (484) Q Consensus 271 ~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilG--qtlt~~~~gGs~A~~evh~~v~~~~~~aD~~ 348 (484) .|..| ....++.|.+|++++++..+..|..|.+..-+.|+..+.- +.||.|-.+.||+.+-.-..-+...++.... T Consensus 321 ~l~pG--~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~ 398 (530) T protein:vir:38 321 RLGGA--RVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRK 398 (530) T ss_pred eccCc--eeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHH Confidence 24344 4667899999999999888889999999999999999532 2356664456788776666666666777766 Q ss_pred HHHHHHHHHHHHHHHHh---C-C---CCccc---------cceEEe--cCCC-CcHHHHHHHHHHHHhcCcccCCcc--- Q lcl|NC_021302. 349 EIRDVAQAHVVEDIVDV---N-W---GEDEP---------APLLVF--DEIG-SRQDATAAALQMLVNAGLLTPDPR--- 406 (484) Q Consensus 349 ~i~~~ln~qli~~l~~~---N-f---~~~~~---------~P~~~~--~~~~-~~~~~~ae~~~~L~~~G~~~~~~~--- 406 (484) .+...+-+-+...+++. + . +.... +....| ...+ -|..+-+++....++.|+.....+ T Consensus 399 ~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~ 478 (530) T protein:vir:38 399 FVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAK 478 (530) T ss_pred HHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHH Confidence 66665545455544431 1 1 10000 111222 1111 233333455566667776432100 Q ss_pred -------------cHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccc Q lcl|NC_021302. 407 -------------LEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTN 456 (484) Q Consensus 407 -------------~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (484) .+....+++||+.+.+... .+.......+..+.++.+.+ T Consensus 479 ~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~-----------~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 479 RGDDYQEIFAQQVRESMERRAAGLNPPAWAAA-----------AFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred cCCCHHHHHHHHHHHHHHHHHcCCCCCCCccc-----------ccCCCCCCCCCCCCCCCCCC Confidence 0111233445543321100 00111111111111111111 No 104 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.34 E-value=8.7e-11 Score=75.74 Aligned_cols=428 Identities=10% Similarity=0.029 Sum_probs=205.9 Q ss_pred CCCCCCCccceeeeeccc---ccchh----hhhhhccccc--------ccccccccchHHHHHHHHhcchHHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPL---AGFGT----FLAQGLDQFE--------QVDELRWPNSVYTYTRMCREEARIASVLRAIG 65 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~---~~~~~----~~~~~~~~~~--------~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~ 65 (484) |+.+.+.+... ++++.. .-+++ ....+..... .....++. +++++. +.++.+-+..++++.- T Consensus 48 ~~~~~~~~~~~-~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~a-~Y~~~~l~r~iVd~~A 124 (537) T protein:vir:10 48 MAIRDHAIAMM-PKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFI-GHQMCA-LIATHWLVNKACSQMP 124 (537) T ss_pred CCCCCccCccc-ccccccccchhccccccchhhhhhhccccccchhhhhccccCCc-cHHHHH-HHHhCchhhhhhhhhh Confidence 44444433322 111110 00000 0000000000 00000111 233343 3356899999999998 Q ss_pred HHhhCCCcEEecCCCC---HHHHHHHHHHHHhhhccchhhhhHHHhhcCCC-HHHHHHHHHHHHhhcceeeeEEEeecCC Q lcl|NC_021302. 66 LPIRRTDWRIRPNGAR---PEVVEHVAACLGLPVEGDESDKPTPRTRGRFS-WDQHLRLALKSLQFGHAVFEQTYFYEGG 141 (484) Q Consensus 66 ~~v~~~~~~v~p~~~~---~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~l~a~~~G~s~~Eivw~~~~g 141 (484) .-.++..|.|.-.+.+ ++.++.+.+.+.. .. +..+...+-.+.+||.+++=+.=...++ T Consensus 125 ~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~-----------------l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~ 187 (537) T protein:vir:10 125 RDAMRKGYKIISDDGNELDPKDAKFIDRYDRA-----------------FNIKKHAIQFVRKGRIFGIRIALFKVDSPDP 187 (537) T ss_pred HHhhcCCceeecCCcccccHHHHHHHHHHHHH-----------------hhHHHHHHHHHHhcccccceEEEEeecCcCC Confidence 8889999999765432 2333444333321 12 3445555556789998766432121222 Q ss_pred e-------------eeeeeeeeeCccceeeeeecCCCceeeeec-ccccccccccceeccCCCCcccccccceEEEeecC Q lcl|NC_021302. 142 R-------------FWLKRLAPRPQSSIAYWNVDRDGGLISIQQ-WPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDM 207 (484) Q Consensus 142 ~-------------~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q-~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~ 207 (484) . ..++.|..++|.|+....++. +.+ -....++.+..+.. .+..+-+.+++++.... T Consensus 188 ~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~------~~~dp~sp~fg~P~~y~v----~g~~iH~SRli~f~g~~ 257 (537) T protein:vir:10 188 YYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQ------ASSNPVSMHFYEPTYWLI----NGKKYHRSHLAIYINDE 257 (537) T ss_pred cccccccccccccccceeEEEEechhhcccccchh------hhccCCccccCCceeeee----cCeEecceeEEEecCCC Confidence 1 123445555655554211110 000 01123333333332 23456677777765332 Q ss_pred ------ccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecC-CCCCCHHHHHHHHHHHHHHhcCCceEE Q lcl|NC_021302. 208 ------DPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNE-ADSEDDDRMDELLEIASNYSGGESAGL 280 (484) Q Consensus 208 ------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~-~~~~~~~~~~~l~~~l~~~~~g~~a~~ 280 (484) ...+.+|.|++..+|....--.....--+..+.++ .++++..+- ..-.+++.+..-.+++..++.. ...+ T Consensus 258 ~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~--~~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~n-~g~~ 334 (537) T protein:vir:10 258 VVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTK--RQTVLKVDAAQVLANKQQFDETMSWWTATRDN-YQVR 334 (537) T ss_pred CchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhc--CCceeeechHHhhcCHHHHHHHHHHHHhhcCC-ccee Confidence 23456799999999877654444444445566664 455543332 1223344444445555555443 3456 Q ss_pred EccC-CceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc--cc-ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 281 ALTA-GEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN--LD-GKGGSYALASVQADTFVQSVQTVADEIRDVAQA 356 (484) Q Consensus 281 vip~-~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt--~~-~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~ 356 (484) +++. +.+++.+..+-++ -..+++..-+.||-+. |-.+| .+ +.+|-.|.|+--...+-+.+++.+..+...+++ T Consensus 335 ~id~e~e~~e~~~~~lsg--l~~~l~~~~~~iAa~~-~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~ 411 (537) T protein:vir:10 335 VVDKDNEDVVQIDTTLND--LDKVIMNQYQLVCAIA-RTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDR 411 (537) T ss_pred EecCCCceeEEEeccCCC--HHHHHHHHHHHHHhhh-CCCceeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 6776 5889888866443 4567777777787773 33333 22 335666778878888999999888888777764 Q ss_pred HHHHHHHHhCCCCccccceEEecCCC-CcHHH-------HHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCC------ Q lcl|NC_021302. 357 HVVEDIVDVNWGEDEPAPLLVFDEIG-SRQDA-------TAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDP------ 422 (484) Q Consensus 357 qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~~~-------~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~------ 422 (484) |++-|+...|+... --.|+|...- .+.++ .+++++++.+.|++. .+++|+.++...... T Consensus 412 -l~~ll~~~~~~~~~-~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~-----~~Evr~~L~~~~~~g~~~l~~ 484 (537) T protein:vir:10 412 -HHQLVCRSHLRKRI-RVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVD-----GVDVNEYLRMDPTLGFTSITP 484 (537) T ss_pred -HHHHHHHhcCCCCc-ceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCC-----HHHHHHHHhccCccccccccC Confidence 78877777776532 2356676532 23333 456688888888653 567888875421110 Q ss_pred ---CcccccccC-CCcCCCccccCCCCccccccccccccccccccccccchHHHhcC Q lcl|NC_021302. 423 ---DADDDESTA-DTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKT 475 (484) Q Consensus 423 ---~e~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 475 (484) .++...... ....+... .+..+...+. .....+..+....++..+++++ T Consensus 485 ~~~~ed~e~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~a~~~~ 537 (537) T protein:vir:10 485 AMRPTDAEDIDVDDEGKPVRI--IEDQPAPSEM--FGATSSGESANDPRDSGAAFED 537 (537) T ss_pred CCChhhhhcccCCccCCcCCC--CCCCCCcccc--CCCCccccccCCCccCccccCC Confidence 000000000 00000000 0000000000 0001111111112222222222 No 105 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=99.30 E-value=6.9e-12 Score=81.79 Aligned_cols=359 Identities=12% Similarity=0.027 Sum_probs=176.0 Q ss_pred chhh-----hhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCH----HHHHHHHHH Q lcl|NC_021302. 21 FGTF-----LAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARP----EVVEHVAAC 91 (484) Q Consensus 21 ~~~~-----~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~----e~~~~~~~~ 91 (484) ||.+ ......+........+.. -.++ + +-+.|.+|+..+...|.+++|.+....... ......... T Consensus 1 Mg~f~~~~~f~~~~~~~~~~~~~~~~~-~~~~--~--~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~ 75 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDTQRVTAWQN-EAVE--Y--TSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSD 75 (378) T ss_pred CccchhhhhhhccccCCCcceeeeccc-chhH--H--HHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccch Confidence 3321 000111111100011111 1111 2 235799999999999999999874322111 111111111 Q ss_pred HHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeee Q lcl|NC_021302. 92 LGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISI 170 (484) Q Consensus 92 l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~ 170 (484) +...+. .+.....+..++++.++ +.+.+|-+.+.+++....|... .+ + + T Consensus 76 l~~lL~--------~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~--~l----------~-~--------- 125 (378) T protein:vir:93 76 LDEVLN--------WSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DL----------L-F--------- 125 (378) T ss_pred HHHHHh--------hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EE----------E-e--------- Confidence 111110 01112234566777664 6788999988776543333211 11 0 0 Q ss_pred ecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_021302. 171 QQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLK 250 (484) Q Consensus 171 ~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~ 250 (484) ...+..+|++..+++++ +. +--.|.|++..+.-. ...+.. .|.|-.+ T Consensus 126 ------------------~~~~~~~~~~diih~r~-~~-~~~~~~s~l~~~~~~----------i~~~~~---~~~~~g~ 172 (378) T protein:vir:93 126 ------------------ADDKKEYKTEELVRLTS-PF-YINEDTSILDNALAS----------IQTKLE---QGKLRGL 172 (378) T ss_pred ------------------cCCeeEeccceeEEecC-cc-ccchhhHHHHHHHHH----------HHHHHh---cCcccce Confidence 11123456666665552 21 122356666655422 122333 2444344 Q ss_pred ecCCCCCCHH----HHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|NC_021302. 251 GNEADSEDDD----RMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDG 324 (484) Q Consensus 251 gk~~~~~~~~----~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~ 324 (484) .+.+...+++ .++++.+.++++..+.++ .++++.|++++-++.+.....+ +..++..++|++++.-..-.. T Consensus 173 l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l-- 249 (378) T protein:vir:93 173 LKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENIL-- 249 (378) T ss_pred eeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHh-- Confidence 4444443443 344555556555555443 4778888888877765444445 345788899999976653222 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCCCc-cccceEEecCC-CCcHHHHHHHHHHHHhcCcc Q lcl|NC_021302. 325 KGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVD-VNWGED-EPAPLLVFDEI-GSRQDATAAALQMLVNAGLL 401 (484) Q Consensus 325 ~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~-~Nf~~~-~~~P~~~~~~~-~~~~~~~ae~~~~L~~~G~~ 401 (484) +|+++. +........-+.-.++.|+..||+.|+..--. ..++.. ....+|.++.. ..|.++++++++++++.|+. T Consensus 250 -~g~~~e-~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~ 327 (378) T protein:vir:93 250 -LGTATQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIF 327 (378) T ss_pred -cCCcHH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCc Confidence 234332 22344455677888999999999988865322 111111 11123433333 35778999999999999976 Q ss_pred cCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccc Q lcl|NC_021302. 402 TPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNA 457 (484) Q Consensus 402 ~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (484) . .+++|+.+|+|.-++++....+.--.....+....+...+..++....+- T Consensus 328 t-----~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 328 T-----QNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred C-----HHHHHHHhCCCCCCCCCeeeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 4 47899999998776655433221000000000000000000000000000 No 106 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.25 E-value=1.6e-10 Score=74.27 Aligned_cols=426 Identities=12% Similarity=0.027 Sum_probs=203.5 Q ss_pred CCCCCCCccc-----eeeeecccccchhhhhhhccccccccc-c--cccchHHHHHHHHhcchHHHHHHHHHHHHhhCC- Q lcl|NC_021302. 1 MAPKTVAPRT-----ERGYVNPLAGFGTFLAQGLDQFEQVDE-L--RWPNSVYTYTRMCREEARIASVLRAIGLPIRRT- 71 (484) Q Consensus 1 ~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~-l--r~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~- 71 (484) +||+..+... -.+|-....+ ....+....-.... . .+.....--+++.++++++.+++......|.+. T Consensus 11 ~sP~~~~~R~~ar~~~~~y~aa~~~---r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~g 87 (502) T protein:vir:79 11 FSPGWKAARLRSRAVIQAYEAVKTT---RTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGKN 87 (502) T ss_pred cChHHHHHHHhhHHHHhhccccCcc---cccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCC Confidence 6765544321 1111111111 11111111111111 1 112223445688899999999999999999987 Q ss_pred CcEEec--CCCCHHHHH----HHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEee----cC Q lcl|NC_021302. 72 DWRIRP--NGARPEVVE----HVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFY----EG 140 (484) Q Consensus 72 ~~~v~p--~~~~~e~~~----~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~----~~ 140 (484) -+.+.| ...+....+ .+.+....+..+ -+..++.+|..+...++ ..+.-|=.++-++|.. .+ T Consensus 88 gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~-------~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 88 GIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVS-------PEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred ceeeeeccCCCChhHHHHHHHHHHHHHHHhhcC-------cCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 455543 334433333 333333333221 13346778999998887 4577898999888865 34 Q ss_pred CeeeeeeeeeeCccceee-----------eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCcc Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAY-----------WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDP 209 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~-----------~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~ 209 (484) |.-.+-+|..++|..+.. +++|+.|+.+...=... ..+ .....+...+|...++++-..... T Consensus 161 g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~-hPg------d~~~~~~~rvpA~~vlH~f~~~r~ 233 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKS-RPV------SGRQMETKEVDAERMLHLKFVRRL 233 (502) T ss_pred CcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeec-CCC------CCcccceeEechhheEEeecccCC Confidence 555566777777776642 22344444332210000 000 011223356777654444444558 Q ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHH--HH-HHHHHHHHHHhcCCceEE-EccCC Q lcl|NC_021302. 210 GVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDD--RM-DELLEIASNYSGGESAGL-ALTAG 285 (484) Q Consensus 210 ~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~--~~-~~l~~~l~~~~~g~~a~~-vip~~ 285 (484) +..-|.+.|.++.....-........+.-... ..-+... .+.+.+.+.. .. ..-......|..| +.+ .++.| T Consensus 234 gQ~RGis~lapvl~~l~~l~~~~dael~~a~i-~A~~~~f-i~~~~~~~~~~~~~~~~~~~~~~~l~pG--~i~~~L~pG 309 (502) T protein:vir:79 234 HQMRGTSLLSGVLIRLSALKEYEDSELTAARI-AAALGMY-IRKGDGQSYEPDGNGSKENERELTIQPG--IIYDDLKPG 309 (502) T ss_pred ccccCCchHHHHHHHHHHHhHHHHHHHHHHHH-hhhheee-eecCCCcccccccCCCCCccccccccCC--ccccccCCC Confidence 88899999998876655444333333332221 1111212 2211111000 00 0000001223333 223 36899 Q ss_pred ceEEEecccCCchhHHHHHHHHHHHHHHHHhh--hhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 286 EEAGILSPNGTPLDPRRAIEYHDHQMALVALA--HFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIV 363 (484) Q Consensus 286 ~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilG--qtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~ 363 (484) .+|++++++..+..|..|.+..-++|+..+.- +.||.+.. +||+.+-.-...+...++....++...+-+-+...++ T Consensus 310 e~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s-~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l 388 (502) T protein:vir:79 310 EEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN-GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWL 388 (502) T ss_pred ceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999998888889999999999999999533 35677653 4887776666666666666766666666555555444 Q ss_pred H---hC----CC---CccccceEEecC--CC-CcHHHHHHHHHHHHhcCcccCCcc----------------cHHHHHHH Q lcl|NC_021302. 364 D---VN----WG---EDEPAPLLVFDE--IG-SRQDATAAALQMLVNAGLLTPDPR----------------LEAFLRDA 414 (484) Q Consensus 364 ~---~N----f~---~~~~~P~~~~~~--~~-~~~~~~ae~~~~L~~~G~~~~~~~----------------~~~~i~e~ 414 (484) + ++ .+ ....+....|.. .. -|..+-+++....++.|+.....+ .+....++ T Consensus 389 ~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~ 468 (502) T protein:vir:79 389 KQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRK 468 (502) T ss_pred HHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHH Confidence 3 11 11 111121223321 11 233333455556667776432100 11112334 Q ss_pred hCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 415 AGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 415 ~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) +||+.+.+-.... ...........+.. .+ ....+ T Consensus 469 ~Gl~~~~~~~~~~----~~~~~~~~~~e~~~---~~----~~~e~ 502 (502) T protein:vir:79 469 LDLVFDTDPASDK----GGSSAATKRQEPQH---TD----DQSEE 502 (502) T ss_pred cCCCCCCCCCCCC----CCCCCCCCCCCCCC---CC----CCCCC Confidence 4554332100000 00000000000000 00 00000 No 107 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.24 E-value=3.6e-10 Score=72.34 Aligned_cols=440 Identities=13% Similarity=0.078 Sum_probs=203.4 Q ss_pred CCCCCCCccceeee----------------------------------------eccc---ccchhhhhhhccccccccc Q lcl|NC_021302. 1 MAPKTVAPRTERGY----------------------------------------VNPL---AGFGTFLAQGLDQFEQVDE 37 (484) Q Consensus 1 ~~~~~~~~~~~~~~----------------------------------------~~~~---~~~~~~~~~~~~~~~~~~~ 37 (484) ||-+.|.|..++-| -+-. .+.|+...... ....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~-~~~~~~~ 79 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRN-ALSFVEA 79 (532) T ss_pred CCCCCCCCCcceehhhhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCccccccc-ccccccc Confidence 33333333322221 1100 00110000000 0000000 Q ss_pred ccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCH---HHHHHHHHHHHhhhccchhhhhHHHhhcCCCH Q lcl|NC_021302. 38 LRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARP---EVVEHVAACLGLPVEGDESDKPTPRTRGRFSW 114 (484) Q Consensus 38 lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~---e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 114 (484) ..+ .++.++. +.++.+-+..++++.-.-.++..|+|.-.++++ +..+.+.+.+.. + + -| T Consensus 80 ~~~-~~~~l~a-~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~-------------l--~-v~ 141 (532) T protein:vir:94 80 TSW-PGFPTLA-LLAQLPEYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQ-------------Y--N-VR 141 (532) T ss_pred ccc-chHHHHH-HHHcCchhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHh-------------h--h-HH Confidence 011 1233332 335688999999999999999999996544332 223333332211 0 1 14 Q ss_pred HHHHHHHHHHHhhcceeeeEEEeecCC----------------eeeeeeeeeeCccceeeeeecCCCceeeeeccccccc Q lcl|NC_021302. 115 DQHLRLALKSLQFGHAVFEQTYFYEGG----------------RFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTF 178 (484) Q Consensus 115 ~~~i~~~l~a~~~G~s~~Eivw~~~~g----------------~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~ 178 (484) ..+...+-.+.+||.+++=+.=.-++. ...++.|...+|.|+.--.++.+ ......+ T Consensus 142 ~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~-------dp~sp~f 214 (532) T protein:vir:94 142 TLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNAT-------DPTLPSF 214 (532) T ss_pred HHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccccceeeEEEeechheecccccccc-------ccccccc Confidence 555556667889999875432111111 11234555556655432111100 0012233 Q ss_pred ccccceeccCCCCcccccccceEEEeecC------ccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec Q lcl|NC_021302. 179 GGPGMVVMAPNSMGPAIPVEQLVVYTHDM------DPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN 252 (484) Q Consensus 179 ~~~~~~~~~~~~~~~~lp~~k~l~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk 252 (484) +.+..+... .+..+.+.+++++.... ...+.+|.|++..+|....--.....--+..+.++ .+.++.-. T Consensus 215 g~P~~y~v~---~g~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~--~~~v~k~~ 289 (532) T protein:vir:94 215 YKPDSWIAT---SGKKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQF--SMTNLATD 289 (532) T ss_pred CCceeEEEc---cCeeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhc--CCceeeec Confidence 333333222 23467788888775432 23455799999999877655444444445556654 55654322 Q ss_pred CCCCCCHHHHHHH---HHHHHHHhcCCceEEEccC-CceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc---cccc Q lcl|NC_021302. 253 EADSEDDDRMDEL---LEIASNYSGGESAGLALTA-GEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN---LDGK 325 (484) Q Consensus 253 ~~~~~~~~~~~~l---~~~l~~~~~g~~a~~vip~-~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt---~~~~ 325 (484) .....+.+..+.+ .+.+..++.. ...++++. +.+++.++.+-++ -+.+++..-++||-+. |-.+| ..+. T Consensus 290 ~a~~ls~~~~~~~~~r~~~~~~~~~n-~g~~~id~~~e~~e~~~~~lsg--l~~~l~~~~~~iAaa~-~IP~t~LfG~sp 365 (532) T protein:vir:94 290 MAQLLAPGGAQSLDARLQLFNLYRDN-RNIGALDKGTEEIQQTNTPLSG--LDSLQAQSQEQMAAVS-HIPLVKLLGITP 365 (532) T ss_pred hHHhhcchhHHHHHHHHHHHHhhcCC-ccceEEcCCCceeEEEecccCC--HHHHHHHHHHHHHhHh-CCCeeeeecCCc Confidence 1111111122333 2333333332 34567775 4788888765333 4667777777888663 33333 2233 Q ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcHH-------HHHHHHHHHHh Q lcl|NC_021302. 326 GGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQD-------ATAAALQMLVN 397 (484) Q Consensus 326 gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~~-------~~ae~~~~L~~ 397 (484) +|-.|.|+--...+-+.+++-......-+.+.|++.|+...|+...+--.|+|...- .+.+ ..+++++++.+ T Consensus 366 ~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~ 445 (532) T protein:vir:94 366 NGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSPLMELDDKELAEVRQLNASTDSTLME 445 (532) T ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh Confidence 454566676778889999988766555555668887776667654332367776432 2233 35677788888 Q ss_pred cCcccCCcccHHHHHHHhCCCCCCCCccccc------ccCCCcCC-Ccc-ccCCCCccccccc-cccccccccccccccc Q lcl|NC_021302. 398 AGLLTPDPRLEAFLRDAAGLPGPDPDADDDE------STADTGQD-EPE-TDEPALPNTSGTT-STTNAPQARKRPRGRS 468 (484) Q Consensus 398 ~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~------~~~~~~~~-~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 468 (484) .|++ +.+.+++.++............ .+...... ... .+.+.......++ ......++...+.+.. T Consensus 446 ~Gvi-----~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 520 (532) T protein:vir:94 446 LGVI-----DAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQA 520 (532) T ss_pred cCCC-----CHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCc Confidence 8864 4578999988643211111100 00000000 000 0000000000000 0111112222222222 Q ss_pred hHHHhcCcccCc Q lcl|NC_021302. 469 PRDRRKTPDGAM 480 (484) Q Consensus 469 ~~~~~~~~~~~~ 480 (484) .+-.+..|-|-. T Consensus 521 ~~~~~~~~~~~~ 532 (532) T protein:vir:94 521 DPAQNDQPVGNR 532 (532) T ss_pred cccccCCCcCCC Confidence 222333333333 No 108 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.24 E-value=3.9e-10 Score=72.18 Aligned_cols=413 Identities=13% Similarity=0.096 Sum_probs=209.3 Q ss_pred CCCCCCCcccee-----eeecccccchhh---hhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCC Q lcl|NC_021302. 1 MAPKTVAPRTER-----GYVNPLAGFGTF---LAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTD 72 (484) Q Consensus 1 ~~~~~~~~~~~~-----~~~~~~~~~~~~---~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~ 72 (484) |+.-..++...+ ++-+-..++|+. ........-.+..+ .+..++.+-+.++-+..++++--.-.++.. T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~~~~~~~~----~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g 76 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQTPGNGQKL----DLKACENLYASNSIAMNIVDIISEDMVRAG 76 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccccCccccc----CHHHHHHHHHhCCccchhhccchHHhhcCC Confidence 655544443321 111111111110 00000000011111 233344444678888889999888888888 Q ss_pred cEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeee-------- Q lcl|NC_021302. 73 WRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFW-------- 144 (484) Q Consensus 73 ~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~-------- 144 (484) |.|.. ++++..+.+.+.+... + -|..+...+-.+..||.+.+=+.=. +++.+. T Consensus 77 ~~i~~--~~~~~~~~~~~~~~~l---------------~-~~~~l~~~~~~~rl~G~a~i~i~v~-d~~~~~~~~~~pl~ 137 (461) T protein:vir:80 77 WSLKT--DNKEMKKNIESKWRKL---------------K-TKDRFQKLYADKRLYGDGFLSIGVV-SSNREQADLSTAID 137 (461) T ss_pred eeeec--CCHHHHHHHHHHHHHh---------------h-HHHHHHHHHHhhcccccEEEEEEee-cCCccccCccCCcc Confidence 88863 4555555554443211 1 2566677777899999987655321 111111 Q ss_pred ---eeeeeeeCccceeeeeecCCCceeeeecc-cccccccccceeccC-------------CCCcccccccceEEEeecC Q lcl|NC_021302. 145 ---LKRLAPRPQSSIAYWNVDRDGGLISIQQW-PAGTFGGPGMVVMAP-------------NSMGPAIPVEQLVVYTHDM 207 (484) Q Consensus 145 ---~~~l~~r~~~~~~~~~~~~dg~l~~~~q~-~~~~~~~~~~~~~~~-------------~~~~~~lp~~k~l~~~~~~ 207 (484) +..|..+.+.|..... .. ...+. ....++.+..+.... ......+.+.+++++.... T Consensus 138 ~~~~~~~~~l~~~~~~~i~----~~--~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~ 211 (461) T protein:vir:80 138 PKTIKSIPYINTFNTQKVT----QL--YLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLR 211 (461) T ss_pred cccccceeEEEeccccccc----hh--hhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCC Confidence 1122222221111000 00 00011 112333333333322 2334567788888888887 Q ss_pred ccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCce Q lcl|NC_021302. 208 DPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEE 287 (484) Q Consensus 208 ~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ 287 (484) -.+..+|.|++..++....--......=++.+.++ .+++..-+-......+....+.+.+..+..+ .+.+++..+.+ T Consensus 212 ~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~--~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~-~g~~~~d~~e~ 288 (461) T protein:vir:80 212 FEGETKGRSIFESLYDIITVMDTSLWSVGQILYDF--AFKVYKTDDIDALNKDDKANLTAMLDFMFRT-EALAIIKGDEQ 288 (461) T ss_pred CCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHh--CCCceecchHHhhhchHHHHHHHHHHHhcCC-ceEEEEcCCcc Confidence 77888999999999988766555665556677665 5566544321111223334455555555544 45778899999 Q ss_pred EEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc-ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021302. 288 AGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN-LDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVN 366 (484) Q Consensus 288 ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt-~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~N 366 (484) ++.++.+-+ ..+.+++..-.+|+-+.--+..- .+...|..|.|+-......+.+++.+.....-+.+.|++.|+.-- T Consensus 289 ~e~~~~~ls--gl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~ 366 (461) T protein:vir:80 289 LTKESTNVS--GMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWAS 366 (461) T ss_pred eEEEecCcC--CHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 998887644 35667777777888774333211 122235677788778889999999987655555566888776543 Q ss_pred CC--Cc-cc---cceEEecCCC-CcHH-------HHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCC Q lcl|NC_021302. 367 WG--ED-EP---APLLVFDEIG-SRQD-------ATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTAD 432 (484) Q Consensus 367 f~--~~-~~---~P~~~~~~~~-~~~~-------~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~ 432 (484) ++ +. .+ --.|+|.+.- .+.+ ..+++++++++.|++.+. ...+.++.++++.++..-..+.... . T Consensus 367 ~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~-e~r~~l~~~~~~~~~~~~~~~~~~~-~ 444 (461) T protein:vir:80 367 DDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPD-EVKETRFGRFGLENSSKFSGDSAEI-D 444 (461) T ss_pred cccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHHhcCCCCCccCCCCCchh-h Confidence 33 11 11 1256776532 2333 345778888899975432 1223344566664432111000000 0 Q ss_pred CcCCCccccCCCCccccccc Q lcl|NC_021302. 433 TGQDEPETDEPALPNTSGTT 452 (484) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~ 452 (484) . ........+. ...+++ T Consensus 445 ~-~~~~~~~~~~--~e~~~g 461 (461) T protein:vir:80 445 K-LAKLVYDAYA--KKNADG 461 (461) T ss_pred h-hhhhcccccc--ccCCCC Confidence 0 0000000001 111111 No 109 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.17 E-value=7.7e-10 Score=70.54 Aligned_cols=460 Identities=13% Similarity=0.060 Sum_probs=210.7 Q ss_pred CCCCCCCcc-----ceeeeecccccchhhhhhhccccccc-cccc--ccchHHHHHHHHhcchHHHHHHHHHHHHhhCC- Q lcl|NC_021302. 1 MAPKTVAPR-----TERGYVNPLAGFGTFLAQGLDQFEQV-DELR--WPNSVYTYTRMCREEARIASVLRAIGLPIRRT- 71 (484) Q Consensus 1 ~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~-~~lr--~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~- 71 (484) +||....+. .-.+|-....+ ....++...... .+++ +.....--+++.+++++++++++.....|.+. T Consensus 11 ~sP~~a~~R~~ar~~~~~y~aa~~~---r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~ 87 (548) T protein:vir:95 11 LAPELVARRLAAREAIQAYEAARPG---RTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERVVGGS 87 (548) T ss_pred cchHHHHHHHHhHHHhccccccCcc---ccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhccCcc Confidence 666544322 11122111111 111222221111 1111 22334556788899999999999999999984 Q ss_pred CcEEec--CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecC----Ceee Q lcl|NC_021302. 72 DWRIRP--NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEG----GRFW 144 (484) Q Consensus 72 ~~~v~p--~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~----g~~~ 144 (484) -+.+.| -+.+++.++...+.+...+..-..+ -+...+.+|..+.+.++ ..+.-|=+++-+.|.... |... T Consensus 88 G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~---~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~~ 164 (548) T protein:vir:95 88 GIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLS---PETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTFATSV 164 (548) T ss_pred ccceeeeecCCCHHHHHHHHHHHHHHHHHhhcC---ccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccCCccc Confidence 455554 4455554444444443332211111 12345678999998877 457789888888897643 3344 Q ss_pred eeeeeeeCccceee------------eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCcc Q lcl|NC_021302. 145 LKRLAPRPQSSIAY------------WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVW 212 (484) Q Consensus 145 ~~~l~~r~~~~~~~------------~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p 212 (484) +-+|..++|..+.. +++|..|+.+...=... ..+. ............+|...++|+-...+.+.. T Consensus 165 ~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~-hPgd--~~~~~~~~~~~rvpA~~VlHif~~~r~gQ~ 241 (548) T protein:vir:95 165 PFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKD-HPGN--LQTLGGSLAVKRVEAERIIHIAYRKRIGQN 241 (548) T ss_pred ceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeec-CCCc--ccccccccceeeechhHheecccccCCccc Confidence 55677777776642 22222333221110000 0000 000111223445777665544444568888 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCH--HHHHHHHHHHHHHhcCCceEE-EccCCceEE Q lcl|NC_021302. 213 TGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDD--DRMDELLEIASNYSGGESAGL-ALTAGEEAG 289 (484) Q Consensus 213 ~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~--~~~~~l~~~l~~~~~g~~a~~-vip~~~~ie 289 (484) -|.+.|.++.....-........++-... ..-+... .+.+..... +.-..-......+..| ..+ .++.|.+|+ T Consensus 242 RGvs~lapvl~~l~~l~~y~dael~~aki-~A~~a~f-i~~~~~~~~~~~~~~~~~~~~~~~~pG--~iv~~L~pGe~i~ 317 (548) T protein:vir:95 242 RGVPMLHAVLIRLADLKDYEESERVAARI-SAALAMY-IKKGNPDSYTVEPGKDRKNRTIPIAPG--MVFDDLEPGEDVG 317 (548) T ss_pred cCcchHHHHHHHHHHHhHHHHHHHHHHHH-hhhheee-eecCCCccccCCCCcccccccccccCC--ccccccCCCceee Confidence 99999999876665444444443333221 1111222 222111100 0000000011223333 223 378899999 Q ss_pred EecccCCchhHHHHHHHHHHHHHHHHhh--hhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-- Q lcl|NC_021302. 290 ILSPNGTPLDPRRAIEYHDHQMALVALA--HFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDV-- 365 (484) Q Consensus 290 ~~~~~~~~~~~~~li~~~d~~Isk~ilG--qtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~-- 365 (484) +++++..+..|..|.+..-+.|+..+.- +.||.+.. +||+.+-.-..-+...++....++...|-+-+..++++. T Consensus 318 ~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s-~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~ 396 (548) T protein:vir:95 318 MIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD-GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYL 396 (548) T ss_pred ecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9998877889999999999999999522 45676753 588776665555555666665555555555444443331 Q ss_pred -C----CCC---ccccceEEecC--CC-CcHHHHHHHHHHHHhcCcccCCcc----------------cHHHHHHHhCCC Q lcl|NC_021302. 366 -N----WGE---DEPAPLLVFDE--IG-SRQDATAAALQMLVNAGLLTPDPR----------------LEAFLRDAAGLP 418 (484) Q Consensus 366 -N----f~~---~~~~P~~~~~~--~~-~~~~~~ae~~~~L~~~G~~~~~~~----------------~~~~i~e~~glp 418 (484) + .+. ...+-...|.. .. -|..+-+++...+++.|+.....+ .+....+++||+ T Consensus 397 l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~ 476 (548) T protein:vir:95 397 LARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLV 476 (548) T ss_pred HcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCC Confidence 1 111 11111222311 11 233333455555566665332110 122245667776 Q ss_pred CCCCCcccccccCCCcCC--CccccCC----CCc--------cccccccccccccc--cccccccchHHHhcCc Q lcl|NC_021302. 419 GPDPDADDDESTADTGQD--EPETDEP----ALP--------NTSGTTSTTNAPQA--RKRPRGRSPRDRRKTP 476 (484) Q Consensus 419 ~p~~~e~~~~~~~~~~~~--~~~~~~~----~~~--------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 476 (484) .+.+..... ...+..+ ++++... ..+ +.-|.+.+..++.- -....+++..-.|-+| T Consensus 477 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (548) T protein:vir:95 477 FSSDAYHQL--VKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNNGGADGQPSNPDP 548 (548) T ss_pred CCCcccccc--cccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCcccccCCCCCCCCCCCC Confidence 543221110 0000000 0000000 000 00011111111100 0001111111112222 No 110 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.16 E-value=1e-09 Score=69.92 Aligned_cols=395 Identities=14% Similarity=0.115 Sum_probs=192.5 Q ss_pred CCCCCCCccceeeeecccccc-hhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGF-GTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG 79 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~ 79 (484) ||.++-.-.+.-|+.+...+. |+.. ..... ..+ -.......|-++.+-+..++++--.-.++..|+|+-. T Consensus 5 m~~~~~~~~~~D~~~~~~~~~~g~~~------~~~~~-~~~-~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~- 75 (435) T protein:vir:79 5 MSDKVKAITKEDGYNEIFGSKDGTFR------PNAFY-MQR-AAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVDGV- 75 (435) T ss_pred cccccccchhhcchhhhhcccccccc------cCccc-CCc-CCHHHHHHHHhcCchhhhhhccchHHhhcCCceecCC- Confidence 999865544555655543221 2111 01000 011 1122233443578888999998888888888888632 Q ss_pred CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCe--------eeeeeeeee Q lcl|NC_021302. 80 ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGR--------FWLKRLAPR 151 (484) Q Consensus 80 ~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~--------~~~~~l~~r 151 (484) ++.+ .....+.. + + -++.+...+-.+.+||++.+=+.=. ++.. -.++.|... T Consensus 76 ~~~~---~~~~~~~~-------------l--~-~~~~l~~a~~~~rl~G~~~i~i~~~-d~~~~~~Pl~~~g~i~~i~v~ 135 (435) T protein:vir:79 76 KNEK---SFKSRWDE-------------L--R-LNAKIIDALSWSRLFGGSAILAVVA-DNKMLKSPVKPGAQLEDIRVY 135 (435) T ss_pred ChHH---HHHHHHHH-------------h--h-HHHHHHHHHHhhhccccEEEEEEec-CCCCcccccccCCceeeEEee Confidence 1222 12211110 0 1 1455555666799999986655321 1111 123345555 Q ss_pred CccceeeeeecCCCceeeeecccccccccccceeccCC--CCcccccccceEEEee------cCccCccccchhH-HHHH Q lcl|NC_021302. 152 PQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPN--SMGPAIPVEQLVVYTH------DMDPGVWTGNSLL-RPAY 222 (484) Q Consensus 152 ~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~--~~~~~lp~~k~l~~~~------~~~~~~p~G~gll-~~~~ 222 (484) ++.++.--.++.| -....++.+..+..... ..+..+.+.+++++.. ....++++|.|.| +.+| T Consensus 136 d~~~i~~~~~~~d--------p~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~ 207 (435) T protein:vir:79 136 DRYQITIHERETN--------ARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLI 207 (435) T ss_pred chhhccchhhccC--------CcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHH Confidence 5544421111110 11223444444444322 2345677778777752 2346789999965 7888 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCcceEEecC-----CCCCCHHHH-HHHHHHHHHHhcCCceEEEccCCceEEEecccCC Q lcl|NC_021302. 223 KNWKLKDELIRIEAAAIRRHGIGVPYLKGNE-----ADSEDDDRM-DELLEIASNYSGGESAGLALTAGEEAGILSPNGT 296 (484) Q Consensus 223 ~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~-----~~~~~~~~~-~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~ 296 (484) ....--......=+..+.|+ .+.+...+- +......+. .++ +.+...++...+.++...+.+++.++.+-+ T Consensus 208 ~~l~~~~~~~~~~~~l~~~~--~~~v~~~~~l~~~~~~~~~~~~~~~r~-~~~~~~~~~~~~~~i~~~~e~~e~~~~~ls 284 (435) T protein:vir:79 208 EAIVDYNYCQELATQLLRRK--QQAVWKARDLALMCDDEEGRYAARLRL-AQVDDESGVGKAIGIDATDEEYEVLNSDVS 284 (435) T ss_pred HHHHHHHHHHHHHHHHHHHh--cCccccchhHHHhhcCccchHHHHHHH-HHHHHhcCCCCceeEecCCcceEEEecccC Confidence 76655555555556666664 445443321 111112111 222 223333333335556666678998886543 Q ss_pred chhHHHHHHHHHHHHHHHHhhhhhc--ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc Q lcl|NC_021302. 297 PLDPRRAIEYHDHQMALVALAHFLN--LDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAP 374 (484) Q Consensus 297 ~~~~~~li~~~d~~Isk~ilGqtlt--~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P 374 (484) ....+++..-.+||.+.--+..- ..+.+|-.|.|+--...+-+.+++.+.....-+.+.|++-++. + .+ - T Consensus 285 --gl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~-s---~d--~ 356 (435) T protein:vir:79 285 --GVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFMIS-E---TE--W 356 (435) T ss_pred --CHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-C---CC--C Confidence 35677777777888875443311 2233444466666677788888887655444333334443321 1 11 2 Q ss_pred eEEecCCC-Cc-------HHHHHHHHHHHHhcCcccCCcccHHHHHHHh-C-CCCCCCCcccccccCCCcCCCccccCCC Q lcl|NC_021302. 375 LLVFDEIG-SR-------QDATAAALQMLVNAGLLTPDPRLEAFLRDAA-G-LPGPDPDADDDESTADTGQDEPETDEPA 444 (484) Q Consensus 375 ~~~~~~~~-~~-------~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~-g-lp~p~~~e~~~~~~~~~~~~~~~~~~~~ 444 (484) .|+|...- .+ .+..+++++++.+.|+.. .+.+++.+ . .|.-.-...... .-++++...+. T Consensus 357 ~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~-----~~e~r~~L~~~~~~~~~~~~~~~-----~~~~~~d~~~~ 426 (435) T protein:vir:79 357 SIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAIN-----LKETRDTLRSICPDLKIMDNDNI-----ELPEPEDLDPE 426 (435) T ss_pred eEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCC-----HHHHHHHHHHhccccCCCCcccc-----cCCccccCCCC Confidence 55665422 12 245678888899999754 35566655 1 111111100000 01111222222 Q ss_pred Ccccccccc Q lcl|NC_021302. 445 LPNTSGTTS 453 (484) Q Consensus 445 ~~~~~~~~~ 453 (484) .+...|... T Consensus 427 ~~~e~g~~~ 435 (435) T protein:vir:79 427 PGQEGGLNK 435 (435) T ss_pred CCCCCCCCC Confidence 222222222 No 111 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=99.16 E-value=1.1e-10 Score=75.16 Aligned_cols=359 Identities=11% Similarity=-0.001 Sum_probs=167.8 Q ss_pred eeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCH----HHHHH Q lcl|NC_021302. 12 RGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARP----EVVEH 87 (484) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~----e~~~~ 87 (484) =+..+..+ +.+.........+.....++.+ + | +-+.|.+|+..+-..|.++++.+.....+. ...+. T Consensus 1 M~~f~k~~---~~~~~~~~~~~~~~~~~~~~~~--~--~--~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~ 71 (378) T protein:vir:85 1 MNLFGKVV---SFSRGKLNNDTQRVTAWQNEAV--E--Y--TSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISM 71 (378) T ss_pred Cchhhhhh---hhhhcccccCCcceeeeeccch--h--h--hhHHHHHHHHHHHHhHhhCceeEEEEecccccccccccc Confidence 11111000 0000000000010000111212 1 2 235799999999999999999875322111 00011 Q ss_pred HHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCc Q lcl|NC_021302. 88 VAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGG 166 (484) Q Consensus 88 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~ 166 (484) ....+...+. .+-....+..++...++ +.+.+|-+.+.+++...+|.+.- ..+.. T Consensus 72 ~~~~l~~lL~--------~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~-------------~~~~~--- 127 (378) T protein:vir:85 72 AGSDLDEVLN--------WSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLD-------------LLFAN--- 127 (378) T ss_pred ccchHHHHHh--------ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEE-------------EEecC--- Confidence 1111111110 00112224555666554 57779999888777655553311 00111 Q ss_pred eeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021302. 167 LISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGV 246 (484) Q Consensus 167 l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~ 246 (484) .+....+...|+|+..-..+.+ .+.+..+.- .-..+.. .|. T Consensus 128 ------------------------~~~~~~~~dvih~~~~~~~~~~--~~~~~~a~~----------~~~~~~~---~~~ 168 (378) T protein:vir:85 128 ------------------------DKKEYKPEELVRLVSPFYINED--TSILDNALA----------SIQTKLE---QGK 168 (378) T ss_pred ------------------------CCEEEcccceEEEecCcCccch--hhHHHHHHH----------HHHHHHh---cCC Confidence 1122334556666532222211 233332221 1112222 245 Q ss_pred ceEEecCCCCCCHHHH----HHHHHHHHHHhcCCc--eEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_021302. 247 PYLKGNEADSEDDDRM----DELLEIASNYSGGES--AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFL 320 (484) Q Consensus 247 P~~~gk~~~~~~~~~~----~~l~~~l~~~~~g~~--a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtl 320 (484) |-.+.+.+...+++.. +.+.+.+.+...+.+ ..++++.|++++-++.+....++ ...++..++|++++.-..- T Consensus 169 ~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~ 247 (378) T protein:vir:85 169 LRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNEN 247 (378) T ss_pred cceEEEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHH Confidence 5444555554454443 334444444444333 35788899998877765544455 3457888999998766532 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCCCccccceEEecC--C-CCcHHHHHHHHHHHH Q lcl|NC_021302. 321 NLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVD-VNWGEDEPAPLLVFDE--I-GSRQDATAAALQMLV 396 (484) Q Consensus 321 t~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~-~Nf~~~~~~P~~~~~~--~-~~~~~~~ae~~~~L~ 396 (484) .. +|+++. +-.......-+.-.++.|+..||+.|+.+--. ..++.. ..-++.|+. . ..|.++.++++.++. T Consensus 248 ~l---~~s~~e-~~~~~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~-~~~~~~f~~~~l~~~d~~~~~~~~~~~~ 322 (378) T protein:vir:85 248 IL---LGTATQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNL-YYERIIVDNQLFKFATLKELIDLYHENI 322 (378) T ss_pred Hh---cCCchH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcc-ccceeeecchhhhhcCHHHHHHHHHHHH Confidence 22 244432 22233455566777888888888877754211 111111 111344542 2 357889999999999 Q ss_pred hcCcccCCcccHHHHHHHhCCCCCCCCccccccc--CCCcCCCcccc--CCCCcccccccc Q lcl|NC_021302. 397 NAGLLTPDPRLEAFLRDAAGLPGPDPDADDDEST--ADTGQDEPETD--EPALPNTSGTTS 453 (484) Q Consensus 397 ~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~--~~~~~~~~~~~--~~~~~~~~~~~~ 453 (484) +.|+.. .+++|+++|+|.-++++....+. .........+. ....++.++.+. T Consensus 323 ~~G~~T-----~NE~R~~lgl~p~~gGD~~~~~~N~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 323 NGPIFT-----QNQLLVKMGEQPIEGGDIYIANLNAVAVKNLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred hCCCcC-----HHHHHHHhCCCCCCCCCeEeecccccccccchhhcCccCCCCCCCCCCCC Confidence 999864 47899999998766655433221 00000000000 000001111111 No 112 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.15 E-value=1.2e-09 Score=69.46 Aligned_cols=440 Identities=10% Similarity=0.006 Sum_probs=198.3 Q ss_pred CCCCCCCccceeeeecccccch----hhhhhhcc--c-cccc-cccc--ccchHHHHHHHHhcchHHHHHHHHHHHHhhC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFG----TFLAQGLD--Q-FEQV-DELR--WPNSVYTYTRMCREEARIASVLRAIGLPIRR 70 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~--~-~~~~-~~lr--~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~ 70 (484) ++|..+....... ...+.+. .....++. + .... .+++ +.....--+++.+++++++++++.....|.+ T Consensus 15 i~~~~~~~~~~~~--~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG 92 (505) T protein:vir:96 15 VNWAWYRYVEPQK--NAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLKNNVIG 92 (505) T ss_pred cchhhhhhHHHHH--HhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhcC Confidence 3332221111100 0001111 11112221 0 0110 1111 2233455678889999999999999999998 Q ss_pred C-CcEEecC------CCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHH-HHhhcceeeeEEEeecCCe Q lcl|NC_021302. 71 T-DWRIRPN------GARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALK-SLQFGHAVFEQTYFYEGGR 142 (484) Q Consensus 71 ~-~~~v~p~------~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~-a~~~G~s~~Eivw~~~~g~ 142 (484) . -+.+.+. +.+++..+.+.+....+....+ -+..++.+|..+...++. .+.-|=+++-++|. .+. T Consensus 93 ~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~-----~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~--~~~ 165 (505) T protein:vir:96 93 PKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGN-----CDVTGRYHFVTLLHLWMETLARDGEVLVREHRG--YPN 165 (505) T ss_pred CCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcC-----cceeccCCHHHHHHHHHHHHhhCCceEEEEeec--CCC Confidence 4 6777653 2244555556555555432111 134466789998888775 45667666554443 333 Q ss_pred eeeeeeeeeCccceeeeee--cCCCcee--eeecccccccccccceec----------cCCCCcccccccceEEEeec-C Q lcl|NC_021302. 143 FWLKRLAPRPQSSIAYWNV--DRDGGLI--SIQQWPAGTFGGPGMVVM----------APNSMGPAIPVEQLVVYTHD-M 207 (484) Q Consensus 143 ~~~~~l~~r~~~~~~~~~~--~~dg~l~--~~~q~~~~~~~~~~~~~~----------~~~~~~~~lp~~k~l~~~~~-~ 207 (484) -.+-+|..++|..+..... ..+|+.+ ++.-...+......+... .......-+|... |+|.++ . T Consensus 166 ~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~-vlH~f~~~ 244 (505) T protein:vir:96 166 KWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADE-IIHTFVPW 244 (505) T ss_pred CcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhH-hhhhhccc Confidence 2334567777766643211 1122211 222111111111111100 1112233466554 555554 4 Q ss_pred ccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCCceEEEccCCc Q lcl|NC_021302. 208 DPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGESAGLALTAGE 286 (484) Q Consensus 208 ~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~~a~~vip~~~ 286 (484) ..+..-|.+.|.++.....-........+.-...- .=+... .+.+.+. .....+.-......|..| .+..++.|. T Consensus 245 r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~-A~~a~f-i~~~~~~~~~~~~~~~~~~~~~l~pG--~i~~L~pGe 320 (505) T protein:vir:96 245 RPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELG-AKKVGF-YEQDPEAYDQPPEDDQGEIVEEVEAG--TYQLLPYGI 320 (505) T ss_pred CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHh-hhheee-eecCCccCCCccccccCccccccCCc--eeeecCCCC Confidence 58888899999988766654444444444332211 111222 2221111 111111111223455555 467789999 Q ss_pred eEEEecccCCchhHHHHHHHHHHHHHHHHhh--hhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 287 EAGILSPNGTPLDPRRAIEYHDHQMALVALA--HFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVD 364 (484) Q Consensus 287 ~ie~~~~~~~~~~~~~li~~~d~~Isk~ilG--qtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~ 364 (484) +|++++++..+..|..|.+..-++|+..+.- +.||.+-.+.||+.+-.-..-+...++.....+...+-+-+..++++ T Consensus 321 ~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~ 400 (505) T protein:vir:96 321 RFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLIS 400 (505) T ss_pred eeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999998887889999999999999999532 34666644567776655555555555555555555454444444333 Q ss_pred h----C---CCC--ccccceEEecC--CC-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccc---cc Q lcl|NC_021302. 365 V----N---WGE--DEPAPLLVFDE--IG-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDD---ES 429 (484) Q Consensus 365 ~----N---f~~--~~~~P~~~~~~--~~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~---~~ 429 (484) . + .+. ...+....|-. .. -|..+-+++....++.|+... ++.+++ .|....+-.+... .. T Consensus 401 ~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~----~~~~a~-~G~D~~~v~~q~a~e~~~ 475 (505) T protein:vir:96 401 MSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSR----SSIIRA-AGDDPEDVFDEIAWEEQL 475 (505) T ss_pred HHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCH----HHHHHH-cCCCHHHHHHHHHHHHHH Confidence 1 1 111 11121233321 11 244444556666777776532 222322 3432111000000 00 Q ss_pred cCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 430 TADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 430 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) ....+-.......+.......+....+..+ T Consensus 476 ~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 476 MRDKGVNPTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred HHHcCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 000000000000000000000000000000 No 113 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=99.15 E-value=2.7e-10 Score=73.01 Aligned_cols=364 Identities=12% Similarity=0.004 Sum_probs=169.1 Q ss_pred eeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC----CCHHHHHH Q lcl|NC_021302. 12 RGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG----ARPEVVEH 87 (484) Q Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~----~~~e~~~~ 87 (484) =|..+..+. ++.........+.....+..+. + . =+.|.+|+......|.++++.+.-.. ...+..+. T Consensus 1 M~if~~~~~---~~~~~~~~~~~~~~~~~~~~~~----~-~-~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~ 71 (378) T protein:vir:94 1 MNLFGKVVS---FSRGKLNNDTQRVTAWQNEAVE----Y-T-SAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISM 71 (378) T ss_pred CchhHHhHh---hhhcccccCcceeeeeecchhh----h-h-hHHHHHHHHHHHHhHhhCceeeeeeccccccccccccc Confidence 111111100 0000000011111111122221 2 2 25799999999999999999763211 11111111 Q ss_pred HHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCc Q lcl|NC_021302. 88 VAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGG 166 (484) Q Consensus 88 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~ 166 (484) ....+...+. .+.....+..++.+.++ +.+.+|.+.+-.+|...+|.+. .+ +.+ . T Consensus 72 ~~~~l~~lLn--------~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~--~~----------~~~-~--- 127 (378) T protein:vir:94 72 AGSDLDEVLN--------WSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELL--DL----------LFA-N--- 127 (378) T ss_pred ccchHHHHHh--------hcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEE--EE----------EEe-c--- Confidence 1111111100 01112234556666554 5667898877666654444321 00 000 0 Q ss_pred eeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_021302. 167 LISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGV 246 (484) Q Consensus 167 l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~ 246 (484) .+..++.+..+++++.. +.+. +.+++..+.-.. ..+.. .|. T Consensus 128 ------------------------~~~~~~~~dvih~~~~~-~~~~-~~~~~~~~~~~~----------~~~~~---~~~ 168 (378) T protein:vir:94 128 ------------------------DKKEYKPEELVRLTSPF-YINE-DTSILDNALASI----------QTKLE---QGK 168 (378) T ss_pred ------------------------CcEEechhceeeecCcC-Cccc-chhHHHHHHHHH----------HHHHh---hCC Confidence 12235555655554322 2221 345555543211 11112 133 Q ss_pred ceEEecCCCCCCH----HHHHHHHHHHHHHhcCCce--EEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_021302. 247 PYLKGNEADSEDD----DRMDELLEIASNYSGGESA--GLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFL 320 (484) Q Consensus 247 P~~~gk~~~~~~~----~~~~~l~~~l~~~~~g~~a--~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtl 320 (484) |-.+.+.+...++ +.++++.+.+++...+.++ .++++.|++++-++.+...... +..++..++|++++.-..- T Consensus 169 ~~g~l~~~~~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~ 247 (378) T protein:vir:94 169 LRGLLKINAFLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNEN 247 (378) T ss_pred cccceeeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHH Confidence 3223344444443 3445566666665554443 4788889998877765444455 3457888999998766432 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEec--CC-CCcHHHHHHHHHHHHh Q lcl|NC_021302. 321 NLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFD--EI-GSRQDATAAALQMLVN 397 (484) Q Consensus 321 t~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~--~~-~~~~~~~ae~~~~L~~ 397 (484) .. +|+++- +-.......-+.-.++.|+..||+.|+..--..-+-.....-.++|+ .. ..|.+++++++.++.+ T Consensus 248 ~l---~g~~~e-~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~ 323 (378) T protein:vir:94 248 IL---LGTATQ-EQQIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENIN 323 (378) T ss_pred Hh---cCCchH-HHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHh Confidence 22 233331 11123334556777888889998888754222111111112244554 32 3577899999999999 Q ss_pred cCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 398 AGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 398 ~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .|+.. .+++|+.+|+|+-+.++....+.--..........+...+..+.....+ + T Consensus 324 ~G~~t-----~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n--~ 378 (378) T protein:vir:94 324 GPIFT-----QNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGNRKDVTSTDETNN--Q 378 (378) T ss_pred CCCcC-----HHHHHHHhCCCCCCCCCeeeecccccchhcchhcccccCCCCCCCCCCC--C Confidence 99764 4789999999876665543322100000000000000000000000000 0 No 114 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=99.10 E-value=2.2e-09 Score=68.09 Aligned_cols=322 Identities=12% Similarity=0.053 Sum_probs=171.1 Q ss_pred CCCCCCCccceeeeec-ccccch------------hhhhh-hcccccccccccccchHHHHHHHHhcchHHHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVN-PLAGFG------------TFLAQ-GLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGL 66 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~~------------~~~~~-~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~ 66 (484) |.+......++..... ....+| +.+.. +........+. |-...-..++.+..+|.+++|..++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ep--p~~~~~La~l~~~n~~h~~~i~~k~N 78 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEP--PISLKGLAEIANANGYHGSLLKARAN 78 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCcchHHHHHHHHhcCCCccccC--CCCHHHHHHHHhhhhhhhhhHhhhhh Confidence 4432222222111000 011111 11111 11111111010 11122223555678999999998887 Q ss_pred HhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeee Q lcl|NC_021302. 67 PIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLK 146 (484) Q Consensus 67 ~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~ 146 (484) -+.+ .+.|+..- ...++.+.+++.+.+|.+.+|++....+ .+. T Consensus 79 ~l~~---~~~Pn~~~-------------------------------t~~~f~~~~~d~ll~Gnay~~~~rn~~G---~~~ 121 (348) T protein:vir:26 79 YVAG---RFMNGGGL-------------------------------PMYKMNSACWDYFGLGMSAFVKIRSYLK---NVI 121 (348) T ss_pred HHhh---cccCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEEcCCC---cEE Confidence 7765 34565321 1223334445667789999999754333 467 Q ss_pred eeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHH Q lcl|NC_021302. 147 RLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWK 226 (484) Q Consensus 147 ~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~ 226 (484) .|.+.|+.++.. ..++..... ...+....++++..++++...-.+..||.+.+..+..... T Consensus 122 ~L~~l~~~~v~~---~~d~~~~~~----------------~~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~ 182 (348) T protein:vir:26 122 ALEPLPMVHMRK---RKNGDFVQL----------------LRNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSL 182 (348) T ss_pred EEEEecCceeEe---eecCcEEEE----------------EecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHH Confidence 888888876643 334443222 2233455677888777765443466789999888888777 Q ss_pred HHHHHHHHHHHHHHHhcCCcceEEe-cCCCCCCHHHHHHHHHHHHHHhcCCce--EEEc-----cCCceEEEecccCCch Q lcl|NC_021302. 227 LKDELIRIEAAAIRRHGIGVPYLKG-NEADSEDDDRMDELLEIASNYSGGESA--GLAL-----TAGEEAGILSPNGTPL 298 (484) Q Consensus 227 ~K~~~~~~w~~f~Er~~~G~P~~~g-k~~~~~~~~~~~~l~~~l~~~~~g~~a--~~vi-----p~~~~ie~~~~~~~~~ 298 (484) .-.....+-..|..- .++|-.|. ..++..++++++++.+++++..++.++ .+++ +.|+++.-++.+.... T Consensus 183 l~~~a~~~~~~~f~N--Ga~pg~Il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~ 260 (348) T protein:vir:26 183 LNRDATLFRRRYYLN--GAHMGFIFYATDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKD 260 (348) T ss_pred HHHHHHHHHHHHHhc--cCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHH Confidence 777777777777763 35674444 345668899999999999886533322 2333 3455555555454555 Q ss_pred hHHHHHHHHHHHHHHHHhhhh-hcc--cccccchhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc Q lcl|NC_021302. 299 DPRRAIEYHDHQMALVALAHF-LNL--DGKGGSYALASVQADT-FVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAP 374 (484) Q Consensus 299 ~~~~li~~~d~~Isk~ilGqt-lt~--~~~gGs~A~~evh~~v-~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P 374 (484) .|.+.-+.-..+|+.+..-+. |.. +..+|+++-.+-...+ ...-+.-.++.|++.||+.+ .++. .. T Consensus 261 qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l-------~~~~-~~-- 330 (348) T protein:vir:26 261 EFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDP-------EIPD-NL-- 330 (348) T ss_pred HHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhh-------CCCC-cc-- Confidence 688888888889999865443 221 2223444433333332 23445566667777777632 1222 22 Q ss_pred eEEecCCCCcHHHHHHHH Q lcl|NC_021302. 375 LLVFDEIGSRQDATAAAL 392 (484) Q Consensus 375 ~~~~~~~~~~~~~~ae~~ 392 (484) +|+|+-....+...+.++ T Consensus 331 ~~~fdl~~~~e~~~~~a~ 348 (348) T protein:vir:26 331 KLKFNLNPGVESANGSAV 348 (348) T ss_pred EEEEecCcccccchhhcC Confidence 455532111122222222 No 115 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=99.05 E-value=3.6e-09 Score=66.88 Aligned_cols=316 Identities=12% Similarity=0.054 Sum_probs=157.8 Q ss_pred CCCCCCCccc-----------ee--eeecccccch----------hhhhhhcccccccccccccchHHHHHHHHhcchHH Q lcl|NC_021302. 1 MAPKTVAPRT-----------ER--GYVNPLAGFG----------TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARI 57 (484) Q Consensus 1 ~~~~~~~~~~-----------~~--~~~~~~~~~~----------~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v 57 (484) |+.+--.... +. +.+. ...+| ..-........+..+. |=...-..++.+..+|. T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~f~fg~p~~v~~~~~~~~~~~~~~~~~~~~p--p~~~~~La~~~~~~~~h 102 (376) T protein:vir:10 26 MSKRRSRAPRTFAAAPNPSAGSAAPARAE-VFTFDDPTPVMNRAEILDYVECWSNGEWFEP--PVSFAGLAKSFRASTHH 102 (376) T ss_pred chhccCCCcccchhhhhHhhhccCcceeE-EEEcCCceeccCcchhhhhhhhhhcCceecC--CCCHHHHHHHHhhhHHh Confidence 2211110000 00 0000 01111 0000000000000000 11111122455667888 Q ss_pred HHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEe Q lcl|NC_021302. 58 ASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYF 137 (484) Q Consensus 58 ~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~ 137 (484) +++|..++.-+.+ .+.|+..- ...++-+.+++.+.+|.+.+|++.. T Consensus 103 ~s~l~~k~n~l~~---~~~Pnp~l-------------------------------T~~~f~~~v~d~ll~Gnay~~~~rn 148 (376) T protein:vir:10 103 SSALFFKANVLAS---TFRPHRWL-------------------------------SRHAFERWALDFLTFGNGYLERRRN 148 (376) T ss_pred hhhHHHHhHHHHh---ccCCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEEC Confidence 8888877766655 24454221 1233334445667789999999865 Q ss_pred ecCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchh Q lcl|NC_021302. 138 YEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSL 217 (484) Q Consensus 138 ~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gl 217 (484) ..+ .+..|.++++.++... .+.+ +... ....+....++....++++.....+..||.+. T Consensus 149 ~~G---~~~~L~pl~~~~vr~~-~d~~-~~~~----------------~~~~~~~~~~~~~eViHir~~~~~~~~yGls~ 207 (376) T protein:vir:10 149 MVG---GTLRLEPALAKYVRRK-ADFN-GFVY----------------VNGWQERHEFEPDSVFQLVRPDINQEVYGLPE 207 (376) T ss_pred CCC---CEEEEEEeCCcceEEE-eeCC-eEEE----------------EEcCCeEEEEccccEEEecCCCCCCCcccccH Confidence 433 4678999998877532 2222 2211 12233445677788777765544567899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCce--EEEc-c----CCceEE Q lcl|NC_021302. 218 LRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGESA--GLAL-T----AGEEAG 289 (484) Q Consensus 218 l~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~a--~~vi-p----~~~~ie 289 (484) +..+......-.....+-..|.+- .+.|-.+.. .+...++++++++.+++++..+..++ .+++ | .|+++. T Consensus 208 ~~~a~~si~l~~aa~~f~~~~f~N--Ga~pggIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~ 285 (376) T protein:vir:10 208 YLSSLHSAWLNESSTLFRRKYYEN--GSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLI 285 (376) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc--cCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEE Confidence 888887777766666676777763 356744443 34567899999999999886432221 2333 3 455555 Q ss_pred EecccCCchhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 290 ILSPNGTPLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTFV-QSVQTVADEIRDVAQAHVVEDIVDV 365 (484) Q Consensus 290 ~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~~-~~~~aD~~~i~~~ln~qli~~l~~~ 365 (484) -++.+.....|.+.-+.-..+|+.+..-...-. +..+|+++-.+-...++. .-+.--++.|++ +|+.| T Consensus 286 pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~iee-ln~~L------- 357 (376) T protein:vir:10 286 PVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWL------- 357 (376) T ss_pred EccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhc------- Confidence 555555556799999999999999965543222 222344443333222222 222333333332 33222 Q ss_pred CCCCccccceEEecCCCCcHHHHHHHHHHH Q lcl|NC_021302. 366 NWGEDEPAPLLVFDEIGSRQDATAAALQML 395 (484) Q Consensus 366 Nf~~~~~~P~~~~~~~~~~~~~~ae~~~~L 395 (484) +. . .++|+... +...+.+- T Consensus 358 ---~~-~--~~~F~~~~-----Llr~d~ka 376 (376) T protein:vir:10 358 ---GE-E--VVRFDDYE-----IPPAPVAA 376 (376) T ss_pred ---cc-c--ccccChhH-----hhcccccC Confidence 11 1 23443211 11111110 No 116 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=99.03 E-value=4.1e-09 Score=66.57 Aligned_cols=310 Identities=14% Similarity=0.076 Sum_probs=165.5 Q ss_pred CCCCCCCccceeeee-cccccchh-------h---hhhh-ccc-ccccccccccchHHHHHHHHhcchHHHHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYV-NPLAGFGT-------F---LAQG-LDQ-FEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLP 67 (484) Q Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~~-------~---~~~~-~~~-~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~ 67 (484) |+.+...+.+....- .....+|. + -... +.. ..+-.+. |=.+.-..++.+..+|.+++|..|... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~p--P~~~~~La~l~~~~~~h~~~L~~k~N~ 78 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRPSVVFSMPEAIDPTAWMTDYTGVFYNPYGEYYQP--PIDRKGLAKVARANAHHGAILMARRNM 78 (337) T ss_pred CCCcccCcccccccCceeEEEecCcccccCcchhHhhhhhhhccCcceecC--CCCHHHHHHHhhcchhhhhHHHhhhcc Confidence 776655554432100 00111110 0 0000 000 0000000 001111123445567777777777665 Q ss_pred hhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 68 IRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 68 v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) +.+. +.|. +..|..++ ++-+.+|.+.+|+++...+ .+.. T Consensus 79 ~~~~---f~~~--------------------------------~~~~~~~~---~d~ll~GNay~~~~rn~~G---~~~~ 117 (337) T protein:vir:78 79 VAGR---FTNQ--------------------------------RATITAFV---HNYLQFGDGGLLKLRNSFG---QVVG 117 (337) T ss_pred cccc---CcCc--------------------------------HHHHHHHH---HHHHhhCCeEEEEEECCCC---cEEE Confidence 4431 1110 00133333 4566789999999875433 4678 Q ss_pred eeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHH Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKL 227 (484) Q Consensus 148 l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~ 227 (484) |.+.|+.++.+ ..++....+.. ......++++..++.+.....+..||.+.+..+...... T Consensus 118 L~pl~~~~v~~---~~d~~~~~~~~----------------~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l 178 (337) T protein:vir:78 118 LHPLSSVYLRR---REDGCFVYLQQ----------------GKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALL 178 (337) T ss_pred EEEeCCceeEe---eeCCeEEEEEc----------------CCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHH Confidence 99999887643 34555433221 234456777777666644435667899998888888877 Q ss_pred HHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCceE---EEcc----CCceEEEecccCCchh Q lcl|NC_021302. 228 KDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGESAG---LALT----AGEEAGILSPNGTPLD 299 (484) Q Consensus 228 K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~a~---~vip----~~~~ie~~~~~~~~~~ 299 (484) -.....+-..|..- .+.|-.+.. .+...++++.+++.+.+++..+..++. +..| .|+++.-++.+..... T Consensus 179 ~~aa~~~~~~~f~N--Ga~p~~il~~~~~~l~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~q 256 (337) T protein:vir:78 179 NQDATLFRRRYFLN--GAHMGFIFYATDPNMDDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDE 256 (337) T ss_pred HHHHHHHHHHHHhc--cCCCceeEEcCCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHH Confidence 77777777777763 356644443 445678899999999998865432222 2333 4455544555555567 Q ss_pred HHHHHHHHHHHHHHHHhhhh-hcc---cccccchhhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccc Q lcl|NC_021302. 300 PRRAIEYHDHQMALVALAHF-LNL---DGKGGSYALASVQAD-TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAP 374 (484) Q Consensus 300 ~~~li~~~d~~Isk~ilGqt-lt~---~~~gGs~A~~evh~~-v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P 374 (484) |.+.-++-..+|+.+..-.. |.. ++.+|+++-.+-... ....-+.-.++.|++.+|..+++.... T Consensus 257 fle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~~---------- 326 (337) T protein:vir:78 257 FAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDATYARNEVLPLCELVQDAINSAGLPRALW---------- 326 (337) T ss_pred HHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhcCChhhc---------- Confidence 88888889999999965543 211 223445543333333 334555666777777777655443211 Q ss_pred eEEecC-CCCcH Q lcl|NC_021302. 375 LLVFDE-IGSRQ 385 (484) Q Consensus 375 ~~~~~~-~~~~~ 385 (484) ++|+. ...-+ T Consensus 327 -~~f~~~~~~~~ 337 (337) T protein:vir:78 327 -VTFRETIGAAV 337 (337) T ss_pred -eeccccccccC Confidence 22221 11111 No 117 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.02 E-value=5.3e-09 Score=65.93 Aligned_cols=425 Identities=12% Similarity=-0.009 Sum_probs=200.9 Q ss_pred CCCCCCC--ccceeeeecccccchhhhhhhccccccccccc--ccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVA--PRTERGYVNPLAGFGTFLAQGLDQFEQVDELR--WPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr--~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) .||.... +....+|-....+.+ + ..+.......+++ +.....--+++.+++++++++++.....|.+..+... T Consensus 9 ~a~~~~~~~~~~~~~y~aa~~~~~--~-~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG~Gi~p~ 85 (495) T protein:vir:10 9 QSLASGLLVPVGASAYEGASGGHR--W-QDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVGNGLTPR 85 (495) T ss_pred cccchhhhhHHHhhhhhccccCcc--c-CCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcCCCcccc Confidence 2232221 111112211111111 1 1111111111111 1223345668889999999999999999999988777 Q ss_pred cCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHH-HHhhcceeeeEEEee-cCCeeeeeeeeeeCcc Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALK-SLQFGHAVFEQTYFY-EGGRFWLKRLAPRPQS 154 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~-a~~~G~s~~Eivw~~-~~g~~~~~~l~~r~~~ 154 (484) +...+++..+.+.+....+... -+..++.+|..+...++. .+.-|=+++=+.|.. .+|.-.+-+|..++|. T Consensus 86 ~~~~~~~~~~~ie~~w~~wa~~-------~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd 158 (495) T protein:vir:10 86 WRMKEQELRQELQELWGDWVNE-------ADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPD 158 (495) T ss_pred cCCchHHHHHHHHHHHHHhhcC-------cccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechh Confidence 7666667777777776666432 134467789998887774 466677777666754 3454455567777776 Q ss_pred ceee----------------eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhH Q lcl|NC_021302. 155 SIAY----------------WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLL 218 (484) Q Consensus 155 ~~~~----------------~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll 218 (484) ++.- +.+|..|+.+...=.. ...+. ............||... |+|.+....+..-|.++| T Consensus 159 ~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~-~hpgd--~~~~~~~~~~~rvpA~~-vlH~f~~r~gQ~RGis~l 234 (495) T protein:vir:10 159 MLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYR-NHPAE--SSLIGDPVDTVWIKAEH-VLHVTVLTVRSDAGAPWF 234 (495) T ss_pred hcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEee-cCCCc--ccccccccceeeechhh-eEeccccCCCcccCcchh Confidence 6642 1222222222111000 00000 00011122335577665 556677778888898887 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC-------HHHHHHHHHHHHHHhcCCceEEEccCCceEEEe Q lcl|NC_021302. 219 RPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED-------DDRMDELLEIASNYSGGESAGLALTAGEEAGIL 291 (484) Q Consensus 219 ~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~-------~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~ 291 (484) .++...-.+..+.- ..++-. |-..-+... .+.+.+.. .++.+.--....++..| ....++.|.+|+++ T Consensus 235 a~i~~l~~l~~y~d-ael~~a-~i~A~~~~f-i~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG--~i~~L~pGe~i~~~ 309 (495) T protein:vir:10 235 QLLLRLNELDQYED-AELVRK-KTAALFAAF-IQEATADSTGGPTIGQPKRSKGGKRITGLNPG--TLQYLQPGQEVKFS 309 (495) T ss_pred HHHHHHHHhhHHHH-HHHHHH-HHhhhheee-eecCCCccccccccCccccccCcccceecCCc--eeeecCCCCeeeee Confidence 65443222221111 111111 101111111 11110000 00001111112344444 56678999999999 Q ss_pred cccCCchhHHHHHHHHHHHHHHHHhh--hhhcccccccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHh--- Q lcl|NC_021302. 292 SPNGTPLDPRRAIEYHDHQMALVALA--HFLNLDGKGGSYALASVQADTFVQSVQTVAD-EIRDVAQAHVVEDIVDV--- 365 (484) Q Consensus 292 ~~~~~~~~~~~li~~~d~~Isk~ilG--qtlt~~~~gGs~A~~evh~~v~~~~~~aD~~-~i~~~ln~qli~~l~~~--- 365 (484) +++..+..|..|.+..-+.|+..+.- +.||.|-.+.||+.+-.-...+...++.... ++...|.+-+.+++++. T Consensus 310 ~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l 389 (495) T protein:vir:10 310 NPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVA 389 (495) T ss_pred CCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 98888889999999999999999533 3466665456777665555555555555543 45555544455544441 Q ss_pred C----CCC---cc-ccceEEec--CCC-CcHHHHHHHHHHHHhcCcccCCcc----------------cHHHHHHHhCCC Q lcl|NC_021302. 366 N----WGE---DE-PAPLLVFD--EIG-SRQDATAAALQMLVNAGLLTPDPR----------------LEAFLRDAAGLP 418 (484) Q Consensus 366 N----f~~---~~-~~P~~~~~--~~~-~~~~~~ae~~~~L~~~G~~~~~~~----------------~~~~i~e~~glp 418 (484) + .++ .. .+....|. ..+ -|..+-+++....++.|+.....+ .+....+++||+ T Consensus 390 ~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~ 469 (495) T protein:vir:10 390 SGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLR 469 (495) T ss_pred cCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCC Confidence 1 111 00 01112221 111 244444455566667776432100 111224445554 Q ss_pred CCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 419 GPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 419 ~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .+.+...... .+..+.+.+ +...++ + T Consensus 470 ~~~~p~~~~~---~~~~~~~~~-----~~~~~~-------e 495 (495) T protein:vir:10 470 LDSDPRYVNG---SGAEQKSVM-----EAALNN-------E 495 (495) T ss_pred CCCCCCcCCC---ccCCCCCCC-----CCCCCC-------C Confidence 3221110000 000000000 000000 0 No 118 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=99.01 E-value=2.9e-09 Score=67.36 Aligned_cols=314 Identities=11% Similarity=-0.010 Sum_probs=153.9 Q ss_pred CCCCCCCccceeeeeccccc-----chhhhhhhcccccccccccccc------------------hHHHHHHHHhcchHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAG-----FGTFLAQGLDQFEQVDELRWPN------------------SVYTYTRMCREEARI 57 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~lr~~~------------------~~~~y~~m~~~D~~v 57 (484) |+.+.-.+.+....+....- ........+.--++-+.+.+.. ...-..++.+.-+|- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~~~~h 80 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGSSVYL 80 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHHHHhhhhhh Confidence 54443322221111100000 0000000000000000011100 000111223334555 Q ss_pred HHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEe Q lcl|NC_021302. 58 ASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYF 137 (484) Q Consensus 58 ~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~ 137 (484) +++|..++.-+.+ .+.|+..- ...++-+.+++-+.+|-+.+|++.. T Consensus 81 ~~~l~~k~n~l~~---~~~Pn~~~-------------------------------t~~~f~~~v~d~ll~Gnay~~~~rn 126 (350) T protein:vir:11 81 QSGLKFKRNMLAK---TFIPHRLL-------------------------------SRATFEQFSLDWLTFGSAYLEQPRS 126 (350) T ss_pred ccchhhhhhhhhh---cccCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEEc Confidence 6666555444433 23343211 1222333345667789999999754 Q ss_pred ecCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchh Q lcl|NC_021302. 138 YEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSL 217 (484) Q Consensus 138 ~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gl 217 (484) .. | .+..|.+.++.++.. ..+++.. +.....+....+++...++++.....+..||.+. T Consensus 127 ~~-G--~~~~L~~l~~~~vr~---~~~~~~~---------------~~~~~~~~~~~~~~~eVihir~~~~~~~~yGls~ 185 (350) T protein:vir:11 127 RL-G--TRMPLQAPLAKYMRR---GTDLETF---------------YQVRSWKDEHEFEKGSVIQLREADINQEIYGVPE 185 (350) T ss_pred CC-C--CEEEEEEeCCceeEe---eecCCeE---------------EEEeeCCeEEEECcccEEEeCCCCCCCCcccccH Confidence 43 3 467888998887653 2333322 1122334456788888777765444566789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecC-CCCCCHHHHHHHHHHHHHHhcCCceE--EE-cc----CCceEE Q lcl|NC_021302. 218 LRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNE-ADSEDDDRMDELLEIASNYSGGESAG--LA-LT----AGEEAG 289 (484) Q Consensus 218 l~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~-~~~~~~~~~~~l~~~l~~~~~g~~a~--~v-ip----~~~~ie 289 (484) +..+......-.....+-..|... .+.|-.+... +...++++++++.+++++..+..+++ ++ .| .|+++. T Consensus 186 ~~~a~~si~l~~~a~~~~~~~f~N--Ga~~~gil~~~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~ 263 (350) T protein:vir:11 186 WFCALQSALLNESATLFRRKYYNN--GSHAGFILYMTDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLI 263 (350) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc--cCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEE Confidence 998888887777777777777763 3566444443 45678999999999998864333322 23 33 345555 Q ss_pred EecccCCchhHHHHHHHHHHHHHHHHhhhhh-cc--cccccchhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 290 ILSPNGTPLDPRRAIEYHDHQMALVALAHFL-NL--DGKGGSYALASVQADT-FVQSVQTVADEIRDVAQAHVVEDIVDV 365 (484) Q Consensus 290 ~~~~~~~~~~~~~li~~~d~~Isk~ilGqtl-t~--~~~gGs~A~~evh~~v-~~~~~~aD~~~i~~~ln~qli~~l~~~ 365 (484) -++.+.....|.+.-++-..+|+.+..-..- .. +..+|+++-.+-...+ ...-+.--++.+++ +|+.|.+.++ T Consensus 264 pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~~~-- 340 (350) T protein:vir:11 264 PVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEEVV-- 340 (350) T ss_pred EcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcccc-- Confidence 4554545557999999999999999655432 21 1223444433332222 22233344444442 4443322222 Q ss_pred CCCCccccceEEecCCC-CcH Q lcl|NC_021302. 366 NWGEDEPAPLLVFDEIG-SRQ 385 (484) Q Consensus 366 Nf~~~~~~P~~~~~~~~-~~~ 385 (484) +|.+.. ..+ T Consensus 341 -----------~F~~~~~~~l 350 (350) T protein:vir:11 341 -----------RFAQFDAPGL 350 (350) T ss_pred -----------ccCcccccCC Confidence 222111 111 No 119 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=98.99 E-value=4.7e-09 Score=66.24 Aligned_cols=314 Identities=16% Similarity=0.087 Sum_probs=159.5 Q ss_pred CCCCCCCccceeeeecc----cccch-----hhhhhhccccccccccc---ccchHHHHHHHHhcchHHHHHHHHHHHHh Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNP----LAGFG-----TFLAQGLDQFEQVDELR---WPNSVYTYTRMCREEARIASVLRAIGLPI 68 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~-----~~~~~~~~~~~~~~~lr---~~~~~~~y~~m~~~D~~v~s~l~~r~~~v 68 (484) |+.+...+......-.+ ...+| +.....+...+-...-+ -|=...-..++.+..+|.+++|..++..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~a~~~h~s~i~~k~n~l 80 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLRSAVHHSSPIYVKRNVL 80 (340) T ss_pred CCCCCCCccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHHhccccchhhhhhhhHH Confidence 76655444321110000 01111 00000000111000000 01112223345566788888888877766 Q ss_pred hCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeee Q lcl|NC_021302. 69 RRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRL 148 (484) Q Consensus 69 ~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l 148 (484) .+ .+.|+..- ...++-+-+++-+.+|.+.+|+++...+ .+..| T Consensus 81 ~~---~~~Pn~~l-------------------------------t~~~f~~~~~d~ll~Gnay~~~~rn~~G---~~~~L 123 (340) T protein:vir:98 81 AS---TYIPHPLL-------------------------------SRQDFSRFALDYLVFGNAFLEQRHSVTG---QLIKL 123 (340) T ss_pred hh---ccCCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEECCCC---cEEEE Confidence 55 24554321 1122233344566789999999875433 35678 Q ss_pred eeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHH Q lcl|NC_021302. 149 APRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLK 228 (484) Q Consensus 149 ~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K 228 (484) .+.++.++.. ..+++... .....+....+++...++++...-.+..||.+.+..+......- T Consensus 124 ~pl~~~~vr~---~~~~~~~~---------------~~~~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~ 185 (340) T protein:vir:98 124 LTSPAKYTRR---GVDDSVFW---------------FVENFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLN 185 (340) T ss_pred EEeCCceEEE---cccCcEEE---------------EEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHH Confidence 8888876542 33444322 12223345567788877776543346689999998888877777 Q ss_pred HHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCce--EEEc-----cCCceEEEecccCCchhH Q lcl|NC_021302. 229 DELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGESA--GLAL-----TAGEEAGILSPNGTPLDP 300 (484) Q Consensus 229 ~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~a--~~vi-----p~~~~ie~~~~~~~~~~~ 300 (484) .....+-..|.+- .+.|-.+.. .++..++++++++.+++++..+..++ .+++ +.|+++.-++.+.....| T Consensus 186 ~aa~~~~~~~f~N--Ga~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf 263 (340) T protein:vir:98 186 ESATLFRRKYYQN--GAHAGYIMYVTDPAQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDF 263 (340) T ss_pred HHHHHHHHHHHhc--cCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHH Confidence 7777777777763 256744433 34567899999999999885322221 2333 345555555555555679 Q ss_pred HHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceE Q lcl|NC_021302. 301 RRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTFV-QSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLL 376 (484) Q Consensus 301 ~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~~-~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~ 376 (484) .+.-+.-..+|+.+..-+.--. +..+|+++-.+-...++. .-+.--++.|++ +|+ +...+ .+ T Consensus 264 ~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~iee-~n~----------~L~~e---~~ 329 (340) T protein:vir:98 264 FNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVFVRNELSPLQDRFRE-VND----------WLGME---VI 329 (340) T ss_pred HHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHh----------ccccc---cc Confidence 9998999999999965543222 222344443222222221 111222333322 332 21111 13 Q ss_pred EecCCC--CcH Q lcl|NC_021302. 377 VFDEIG--SRQ 385 (484) Q Consensus 377 ~~~~~~--~~~ 385 (484) +|++.. ..+ T Consensus 330 rF~~~~l~~~d 340 (340) T protein:vir:98 330 RFKEYTLDNPE 340 (340) T ss_pred ccCccccccCC Confidence 343221 111 No 120 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=98.98 E-value=8.1e-09 Score=64.95 Aligned_cols=438 Identities=12% Similarity=0.100 Sum_probs=185.1 Q ss_pred CCC-----------------------------------CCCCcc-----ceeeeecccccchhhhhhhcccc--cccccc Q lcl|NC_021302. 1 MAP-----------------------------------KTVAPR-----TERGYVNPLAGFGTFLAQGLDQF--EQVDEL 38 (484) Q Consensus 1 ~~~-----------------------------------~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~--~~~~~l 38 (484) ..| ...... .-..++...+..+......+... ....-. T Consensus 53 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~ 132 (862) T protein:vir:99 53 PNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQ 132 (862) T ss_pred CCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhcccccccc Confidence 111 000000 00001111111111000000000 000000 Q ss_pred cccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC----CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCC- Q lcl|NC_021302. 39 RWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA----RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFS- 113 (484) Q Consensus 39 r~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~----~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~- 113 (484) .+ .++.++. |.++.+-+..++++...-.++..|+|.-.++ +++..+.+.+.+.. .. T Consensus 133 ~f-~gyql~a-lY~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~r-----------------L~v 193 (862) T protein:vir:99 133 GF-IGHQACA-LIAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVE-----------------FKV 193 (862) T ss_pred Cc-ccHHHHH-HHHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHH-----------------hhH Confidence 11 1234443 3357899999999999999999999975443 23334444333321 12 Q ss_pred HHHHHHHHHHHHhhcceee-eEEEeecCCe-------------eeeeeeeeeCccceeeeeecCCCceeeeecc-ccccc Q lcl|NC_021302. 114 WDQHLRLALKSLQFGHAVF-EQTYFYEGGR-------------FWLKRLAPRPQSSIAYWNVDRDGGLISIQQW-PAGTF 178 (484) Q Consensus 114 ~~~~i~~~l~a~~~G~s~~-Eivw~~~~g~-------------~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~-~~~~~ 178 (484) +..+...+..+.+||-+++ -++ ..+++. ..++.|...+|.|+....+. ...+. ....+ T Consensus 194 ~~~l~eair~~RLyGga~ililv-~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~------~~~~Dp~sp~y 266 (862) T protein:vir:99 194 KENLIEFNRFKNVFGIRVAIFVV-DSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTA------ESTADPSSQFF 266 (862) T ss_pred HHHHHHHHHhcccccceEEEEEe-cCcCchhhhcCcCcccccccceeEEEEechhhhcccccc------ccccccccccc Confidence 3444444455888985543 222 112211 12344555555544321110 00111 12233 Q ss_pred ccccceeccCCCCcccccccceEEEeecC------ccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec Q lcl|NC_021302. 179 GGPGMVVMAPNSMGPAIPVEQLVVYTHDM------DPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN 252 (484) Q Consensus 179 ~~~~~~~~~~~~~~~~lp~~k~l~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk 252 (484) +.+..+.. .+..|-+.+++++.... ...+++|.|++..||....--......=...+.++ .+.++.-+ T Consensus 267 GkP~~y~I----~g~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka--~l~v~ktd 340 (862) T protein:vir:99 267 YEPEFWII----SGQKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNK--RTTAIHTD 340 (862) T ss_pred CCceeeee----cCeeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHh--ccceeech Confidence 33333322 22345566666664432 34457899999999876654444444445556664 44444322 Q ss_pred CCCC-CCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc-cc-ccccch Q lcl|NC_021302. 253 EADS-EDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN-LD-GKGGSY 329 (484) Q Consensus 253 ~~~~-~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt-~~-~~gGs~ 329 (484) -... .+++.+..=.+.+..++ +..+.+++..+.+++.++.+-+ ....+++..-.+||-+.--...- .+ +..|-. T Consensus 341 ~l~~l~~ed~l~~r~~~~~~~r-dN~Gi~liD~eEe~e~ls~slS--GL~dll~~~~q~IAaas~IP~tiLfGqspaGln 417 (862) T protein:vir:99 341 TAKAIANEDKFIQRLMFWVRYR-DNHAVKVLGTDETMEQFDTSLA--DFDAVIMGQYQLVASIAKTPATKLLGTAPKGFN 417 (862) T ss_pred hHhhhccHHHHHHHHHHHHhcc-CcceeEEecCCCceeEEecccC--ChHHHHHHHHHHHHhhhCCCceeecccCccccc Confidence 1111 12222222223333333 3345788999999999887633 33456666666788774333211 12 224555 Q ss_pred hhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcH-------HHHHHHHHHHHhcCc Q lcl|NC_021302. 330 ALASVQADTFVQSVQTVAD-EIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQ-------DATAAALQMLVNAGL 400 (484) Q Consensus 330 A~~evh~~v~~~~~~aD~~-~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~-------~~~ae~~~~L~~~G~ 400 (484) |.|+--..++-+.+++... .+...|++ |+. |+.+-++.... -.|+|.... .+. +..+++++++++.|+ T Consensus 418 ATGE~D~~nYyD~I~s~QE~~L~P~Ler-L~~-li~~~lg~~~d-~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGv 494 (862) T protein:vir:99 418 STGEFETISYHEELESIQEHVYMPFLQR-HYL-ISRLSLGIQHE-IDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGV 494 (862) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH-HHH-HHHHhcCCCCc-ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC Confidence 6666666778888887764 35555543 444 33333432222 266675432 222 335577888999997 Q ss_pred ccCCcccHHHHHHHh------CCCCCCCCcccc-c--------ccCCCcCCCccccCCCCccccccc--------ccccc Q lcl|NC_021302. 401 LTPDPRLEAFLRDAA------GLPGPDPDADDD-E--------STADTGQDEPETDEPALPNTSGTT--------STTNA 457 (484) Q Consensus 401 ~~~~~~~~~~i~e~~------glp~p~~~e~~~-~--------~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~ 457 (484) +. .+.+|+++ |++.-.+++... . .....+.+ ....+.....+++. ..... T Consensus 495 is-----pdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a--~~~ap~de~~aga~~~~~e~d~~~~p~ 567 (862) T protein:vir:99 495 IS-----PDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAA--QETASAKETQAGAAVTTAEGDQPNVQM 567 (862) T ss_pred CC-----HHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCcc--cccccccccccccCCccccCCcccccc Confidence 54 45677654 332111111000 0 00000000 00000111111110 00000 Q ss_pred cccccccc----ccchHHHhcCcccCc--ccC----C Q lcl|NC_021302. 458 PQARKRPR----GRSPRDRRKTPDGAM--PLW----D 484 (484) Q Consensus 458 ~~~~~~~~----~~~~~~~~~~~~~~~--~~~----~ 484 (484) .+ ..++. ..+...+.+.|+-+. --| + T Consensus 568 ~~-~~~~g~~~~~t~~~~a~~p~~~~~~~~~~~~~~e 603 (862) T protein:vir:99 568 VP-SMKPGQMVGPEVGITAPMPEDDAPVAGVVAKLAE 603 (862) T ss_pred cC-CCCCCCccccccccccCCCccccccCcccccchh Confidence 00 00000 011122222221111 011 1 No 121 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=98.97 E-value=8.9e-09 Score=64.73 Aligned_cols=384 Identities=14% Similarity=0.079 Sum_probs=181.9 Q ss_pred ccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHH Q lcl|NC_021302. 8 PRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEH 87 (484) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~ 87 (484) -...-|+.+-.-+.... ..............++ .+-++.+-+..++++--.-.++.-|+|+ +++++. . T Consensus 1 ~~~~D~~~n~~~gg~~~-------~~~~~~~~~~~~~~l~-a~Y~~~~l~~~~Vd~~aed~~r~g~~i~--~~~~~~--~ 68 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDG-------SEIYGSLQNQAPTILA-SLYADNALVRRIIDTIPETALAAGFHID--GIDDEP--A 68 (422) T ss_pred CccchhhHHHHcCCCCC-------ccccCcccccCHHHHH-HHHHhChhhHHHHhhhhHHHhcCCcccc--CCCHHH--H Confidence 11122233322111100 0000111111122233 3346789999999999988898889985 333321 1 Q ss_pred HHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCe--------eeeeeeeeeCccceeee Q lcl|NC_021302. 88 VAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGR--------FWLKRLAPRPQSSIAYW 159 (484) Q Consensus 88 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~--------~~~~~l~~r~~~~~~~~ 159 (484) +.+.+.. + + -|..+...+-.+.+||++++=+.=. +++. ..++.|...++.++.-. T Consensus 69 ~~~~~~~-------------l--~-~~~~l~~a~~~~rl~G~a~i~i~v~-d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~ 131 (422) T protein:vir:10 69 FWSRWDD-------------L--E-MTQNINDAWSWARLFGGAAIVAIVK-DNRALTSPVREGAELETVRVYDRTQVKVQ 131 (422) T ss_pred HHHHHHH-------------h--h-HHHHHHHHHHhhccccceEEEEEec-CCCCccccccccCceeeEEeeccccccch Confidence 2222111 1 1 1455566666799999997755421 1111 12345555555444311 Q ss_pred eecCCCceeeeecccccccccccceeccCCCC--cccccccceEEEeec------CccCccccchhHHH-HHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSM--GPAIPVEQLVVYTHD------MDPGVWTGNSLLRP-AYKNWKLKDE 230 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~--~~~lp~~k~l~~~~~------~~~~~p~G~gll~~-~~~~~~~K~~ 230 (484) .++. --....++.+..+....... ...+-+.+++++... ....++||.|+|.. ||....--.. T Consensus 132 ~~~~--------dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~ 203 (422) T protein:vir:10 132 TREE--------NPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTN 203 (422) T ss_pred hccc--------CccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHH Confidence 1111 11122344545544433322 245667777776432 24667789997754 7765554444 Q ss_pred HHHHHHHHHHHhcCCcceEEecC-----CCCCCHHH-HHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHH Q lcl|NC_021302. 231 LIRIEAAAIRRHGIGVPYLKGNE-----ADSEDDDR-MDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAI 304 (484) Q Consensus 231 ~~~~w~~f~Er~~~G~P~~~gk~-----~~~~~~~~-~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li 304 (484) ....-+..+.|+ .+.+..-+- +.+....+ +.++ +.+...++...+.+++..+.+++.++.+-++ ...++ T Consensus 204 ~~~~~~~l~~~~--~~~v~~~~~l~~~~~~~~~~~~~~~r~-~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsg--l~~~~ 278 (422) T protein:vir:10 204 CERLATQLLKRK--QQAVWKAKGLAELCDDSEGFGAARLRL-AQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFL 278 (422) T ss_pred HHHHHHHHHHHh--ccccccchhHHHhcCCccchHHHHHHH-HHHHHhcCCccceeEecCCcceEEEecccCC--hHHHH Confidence 444445656664 445443321 11111111 2222 2233333333455566777889988876443 56667 Q ss_pred HHHHHHHHHHHhhhhh--cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC Q lcl|NC_021302. 305 EYHDHQMALVALAHFL--NLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG 382 (484) Q Consensus 305 ~~~d~~Isk~ilGqtl--t~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~ 382 (484) +..-.+||-+.--+.. ...+.+|-.|.|+--...+-+.+++.+.....-+.+.|++-++. + .+ -.|+|...- T Consensus 279 ~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~-s---~~--~~~~f~pL~ 352 (422) T protein:vir:10 279 DKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPILEFLIPFIVN-A---EE--WSVEFNPLA 352 (422) T ss_pred HHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-c---CC--cEEEeCCCC Confidence 7777788877433321 12223444455666677888888887754333333446554432 2 21 245554321 Q ss_pred --------CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCC-----CcccccccCCCcCCCccccCCCC Q lcl|NC_021302. 383 --------SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDP-----DADDDESTADTGQDEPETDEPAL 445 (484) Q Consensus 383 --------~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~-----~e~~~~~~~~~~~~~~~~~~~~~ 445 (484) +..+..+++++++++.|+.. .+.+|+.+.=..+.. ..+.+... ....+.|...++.. T Consensus 353 ~~sekekaei~~~~a~a~~~~~~~g~i~-----~~e~r~~L~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~d 422 (422) T protein:vir:10 353 QESSKDKAEILEKNVNSIAALIAAGAMD-----IDEARDTLRTIAPEVKINDGSVETEVTI-SETSNDPLEVPTDD 422 (422) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhcCCCC-----HHHHHHHhhhhcccccCCCCCCccccch-hhcCCCCCCCCCCC Confidence 12245678889999999754 456776652111110 00000000 00001111111000 No 122 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=98.93 E-value=5.8e-09 Score=65.73 Aligned_cols=314 Identities=13% Similarity=0.002 Sum_probs=157.3 Q ss_pred CCCCCCCcccee--------eeecccccch----------hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTER--------GYVNPLAGFG----------TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLR 62 (484) Q Consensus 1 ~~~~~~~~~~~~--------~~~~~~~~~~----------~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~ 62 (484) |+.+--++.++. +.+.. ..+| ..-..++.....-.+.-. ....| .++.+.-+|.+++|. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~-~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~-~~~~l-a~~~~a~~~h~~~i~ 77 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEA-FTFGEPVPVLDRRDILDYVECISNGRWYEPPI-SFTGL-AKSLRAAVHHSSPIY 77 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEE-EEcCCceeecCCcchhHHHHhhhcCccccCCC-CHHHH-HHHHHhhhhhccchh Confidence 444422221110 00000 0111 000001100000000000 00111 233345667777777 Q ss_pred HHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCe Q lcl|NC_021302. 63 AIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGR 142 (484) Q Consensus 63 ~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~ 142 (484) .++.-+.+ .+.|+..-. +..|..+ +++-+.+|-+.+|++.... T Consensus 78 ~k~n~l~~---~~~Pn~~~t----------------------------~~~f~~~---~~d~ll~Gnay~~i~rn~~--- 120 (344) T protein:vir:60 78 VKRNILAS---TFIPHPWLS----------------------------QQDFSRF---VLDFLVFGNAFLEKRYSTT--- 120 (344) T ss_pred hhhhHHHh---hccCCCCCC----------------------------HHHHHHH---HHHHHhcCCeEEEEEECCC--- Confidence 66665554 244543211 1113333 3456678999999986433 Q ss_pred eeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHH Q lcl|NC_021302. 143 FWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAY 222 (484) Q Consensus 143 ~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~ 222 (484) ..+..|.+.|+.++.+ ..+++.. +.....+....++++..++.+...-.+..||.+.+..+. T Consensus 121 G~~~~L~~l~~~~vr~---~~~~~~~---------------~~v~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~ 182 (344) T protein:vir:60 121 GKVIRLETSPAKYTRR---GVEEDVY---------------WWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSAL 182 (344) T ss_pred CcEEEEEEcCcceEEE---eecCCeE---------------EEEccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHH Confidence 3467888998887643 2222221 112223445567777777666444356679999999888 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCc---eEEEcc----CCceEEEeccc Q lcl|NC_021302. 223 KNWKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGES---AGLALT----AGEEAGILSPN 294 (484) Q Consensus 223 ~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~---a~~vip----~~~~ie~~~~~ 294 (484) .....-.....+-..|.+- .++|-.+.. .++..++++++++.+.+++..++.+ ..+.+| +|+++.-++.+ T Consensus 183 ~si~l~~~a~~~~~~~f~N--G~~pg~il~~~~~~ls~e~~~~ik~~~~~~~g~~~~r~~~l~~p~g~~~g~~~~pis~~ 260 (344) T protein:vir:60 183 NSAWLNESATLFRRKYYEN--GAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV 260 (344) T ss_pred HHHHHHHHHHHHHHHHHhc--cCCCceEEEecCcCCCHHHHHHHHHHHHHhcCCCCCcceEEecCCCCccceeEEEcCCC Confidence 8877777777777777763 356744444 3466889999999999988653321 222233 45555555555 Q ss_pred CCchhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_021302. 295 GTPLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTF-VQSVQTVADEIRDVAQAHVVEDIVDVNWGED 370 (484) Q Consensus 295 ~~~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~-~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~ 370 (484) .....|.+.-++-..+|+.+..-..-.. +..+|+++-.+-...++ ..-+.-.++.+++ ||+ || .. T Consensus 261 ~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~L~Pl~~~~e~-ln~----~l------g~ 329 (344) T protein:vir:60 261 ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-ING----WL------GQ 329 (344) T ss_pred hhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHH----hc------CC Confidence 5555688999999999999975543222 22234454333222222 1222222233321 222 21 11 Q ss_pred cccceEEecCCCCcHH Q lcl|NC_021302. 371 EPAPLLVFDEIGSRQD 386 (484) Q Consensus 371 ~~~P~~~~~~~~~~~~ 386 (484) ...+|.....+.++. T Consensus 330 -~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 330 -EVIRFKNYSLDTDNG 344 (344) T ss_pred -cccccCccccCCCCC Confidence 222444434444443 No 123 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=98.91 E-value=1.6e-08 Score=63.29 Aligned_cols=317 Identities=14% Similarity=0.074 Sum_probs=156.2 Q ss_pred CCCCCCCccc---------------eeeee----ccc---ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHH Q lcl|NC_021302. 1 MAPKTVAPRT---------------ERGYV----NPL---AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIA 58 (484) Q Consensus 1 ~~~~~~~~~~---------------~~~~~----~~~---~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~ 58 (484) ||.+-..+.. ....+ +|. ...+......+.......+. |-...--.++.+..+|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~p--p~~~~~la~~~~~~~~h~ 78 (351) T protein:vir:79 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEP--PVSFAGLAKSFRASTHHS 78 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecC--CCCHHHHHHHHhhhHhhh Confidence 4422111100 00000 010 00000000010001000000 111222224446688889 Q ss_pred HHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEee Q lcl|NC_021302. 59 SVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFY 138 (484) Q Consensus 59 s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~ 138 (484) ++|..++..+.+ .+.|+..- ...++-.-+++-+.+|-+.+|++... T Consensus 79 ~~l~~k~n~l~~---~~~Pnp~~-------------------------------t~~~f~~~v~d~ll~Gnay~~~~r~~ 124 (351) T protein:vir:79 79 SALFFKANVLAS---TFRPHRWL-------------------------------SRHAFERWALDFLTFGNGYLERRRNM 124 (351) T ss_pred hhhhhhhhHHhh---cccCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEECC Confidence 998877776655 24454321 12223333456677999999998654 Q ss_pred cCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhH Q lcl|NC_021302. 139 EGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLL 218 (484) Q Consensus 139 ~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll 218 (484) .+ .+..|.+.++.++... .+.+ +.. .....+....+++...++.+.....+..||.+.+ T Consensus 125 ~G---~~~~L~~l~~~~v~~~-~~~~-~~~----------------~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~ 183 (351) T protein:vir:79 125 VG---GTLRLEPALAKYVRRK-ADFS-GFV----------------YVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEY 183 (351) T ss_pred CC---CEEEEEEeCCcceeee-ecCC-eEE----------------EEecCceEEEEcCccEEEeCCCCCCCCcccccHH Confidence 33 4678889998877532 2222 111 1112333455777777666554445678899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCce--EEE-cc----CCceEEE Q lcl|NC_021302. 219 RPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGESA--GLA-LT----AGEEAGI 290 (484) Q Consensus 219 ~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~a--~~v-ip----~~~~ie~ 290 (484) ..+......-.....+-..|.+- .++|-.+.. .++..++++.+++.+.+++..+..++ .++ .| .|+++.- T Consensus 184 ~~a~~si~l~~~a~~~~~~~f~N--Ga~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~p 261 (351) T protein:vir:79 184 LSSLHSAWLNESSTLFRRKYYEN--GSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIP 261 (351) T ss_pred HHHHHHHHHHHHHHHHHHHHHhc--cCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEE Confidence 88888887777777777777763 356644433 34567899999999999886432222 223 33 3445544 Q ss_pred ecccCCchhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021302. 291 LSPNGTPLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTFV-QSVQTVADEIRDVAQAHVVEDIVDVN 366 (484) Q Consensus 291 ~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~~-~~~~aD~~~i~~~ln~qli~~l~~~N 366 (484) ++.+.....|.+.-++-..+|+.+..-..--. +..+|+++-.+-...++. .-+.--++.|++ +|. T Consensus 262 l~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~---------- 330 (351) T protein:vir:79 262 VSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LND---------- 330 (351) T ss_pred cCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHh---------- Confidence 45444555699999999999999965543221 222233433332222222 222222333322 222 Q ss_pred CCCccccceEEecCCCCcHHHHHHHHHHH Q lcl|NC_021302. 367 WGEDEPAPLLVFDEIGSRQDATAAALQML 395 (484) Q Consensus 367 f~~~~~~P~~~~~~~~~~~~~~ae~~~~L 395 (484) +-+.. .++|+..+ +...+.+- T Consensus 331 ~lg~~---~~~F~~~~-----llr~d~~a 351 (351) T protein:vir:79 331 WLGDE---VVTFDDYE-----IPPAPVAA 351 (351) T ss_pred hcCcc---eeeeChhh-----hccccccC Confidence 11111 34554321 11111000 No 124 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=98.88 E-value=2.2e-08 Score=62.61 Aligned_cols=439 Identities=13% Similarity=0.098 Sum_probs=194.4 Q ss_pred CCCCCC-------Ccccee-----------eeecccccchhhhhhhcccccccccc-------cccchHHHHHHHHhcch Q lcl|NC_021302. 1 MAPKTV-------APRTER-----------GYVNPLAGFGTFLAQGLDQFEQVDEL-------RWPNSVYTYTRMCREEA 55 (484) Q Consensus 1 ~~~~~~-------~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~l-------r~~~~~~~y~~m~~~D~ 55 (484) ..|..+ -...+. ++......+.. ++. ....... +.--+++++. |.++.+ T Consensus 48 ~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~----~~~-~~~~~~~~~~~~~~~~f~gyql~a-lY~~~~ 121 (765) T protein:vir:96 48 VEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAG----GQN-PYVVPTMLQDWYNSQGFIGYQACA-IISQHW 121 (765) T ss_pred cccccCCCCCCCCcccCcccceeccccccccccchHHHhhh----ccC-ccchhhHHHhhhcccCCccHHHHH-HHHhCc Confidence 111000 000000 01111110000 000 0000000 0111234443 445788 Q ss_pred HHHHHHHHHHHHhhCCCcEEecCCCC--HHHHHHHHHHHHhhhccchhhhhHHHhhcCCC-HHHHHHHHHHHHhhcceee Q lcl|NC_021302. 56 RIASVLRAIGLPIRRTDWRIRPNGAR--PEVVEHVAACLGLPVEGDESDKPTPRTRGRFS-WDQHLRLALKSLQFGHAVF 132 (484) Q Consensus 56 ~v~s~l~~r~~~v~~~~~~v~p~~~~--~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~l~a~~~G~s~~ 132 (484) -+..++++...-.++..|+|+..+++ ++..+++...+.. .. ++.+...+-.+.+||-+++ T Consensus 122 l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~r-----------------l~v~~~l~ea~~~~RlyGga~i 184 (765) T protein:vir:96 122 LVDKACSMSGEDAARNGWELKSDGRKLSDEQSALIARRDME-----------------FRVKDNLVELNRFKNVFGVRIA 184 (765) T ss_pred hhhhhhhcchHHhhcCCceeecCccccCHHHHHHHHHHHHH-----------------hhHHHHHHHHHHHhhhceeeEE Confidence 99999999988888888999764432 3333333333221 12 4445555556999997654 Q ss_pred eEEEeecCCe-------------eeeeeeeeeCccceeeeeecCCCceeeeecc-cccccccccceeccCCCCccccccc Q lcl|NC_021302. 133 EQTYFYEGGR-------------FWLKRLAPRPQSSIAYWNVDRDGGLISIQQW-PAGTFGGPGMVVMAPNSMGPAIPVE 198 (484) Q Consensus 133 Eivw~~~~g~-------------~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~-~~~~~~~~~~~~~~~~~~~~~lp~~ 198 (484) =+.=.-+++. ..++.|...+|.|+.-..+ + ...+. ....++.+..+... +..|-+. T Consensus 185 ~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v---~---e~~~Dp~sp~fg~P~~y~i~----g~~IH~S 254 (765) T protein:vir:96 185 LFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLT---A---ESTADPSAEHFYEPDFWIIS----GKKYHRS 254 (765) T ss_pred EEEecccCcchhhccccccccccceeeEEEEechhhcccccc---h---hccccccccccCcceeeeec----Cceeccc Confidence 3322211211 1223344444433321100 0 00011 12233333333322 2356667 Q ss_pred ceEEEeecC------ccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCC-CCCCHHHHHHHHHHHHH Q lcl|NC_021302. 199 QLVVYTHDM------DPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEA-DSEDDDRMDELLEIASN 271 (484) Q Consensus 199 k~l~~~~~~------~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~-~~~~~~~~~~l~~~l~~ 271 (484) ++|++.... ...+.+|.|++..||....--......=+..+.++ .+.++.-+.. ...+++.+..-.+.+.. T Consensus 255 Rli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~--~~~v~k~~~~~~l~~~~~l~~r~~~~~~ 332 (765) T protein:vir:96 255 HLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSK--RTSTIHVDVEKAIANEDAFNARLAFWIA 332 (765) T ss_pred eEEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHh--ccceeeechHhhhccHHHHHHHHHHHHH Confidence 777664332 34556799999999877655444554555666664 3444432221 11233444433444444 Q ss_pred HhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhh-hccc-ccccchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 272 YSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHF-LNLD-GKGGSYALASVQADTFVQSVQTVADE 349 (484) Q Consensus 272 ~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqt-lt~~-~~gGs~A~~evh~~v~~~~~~aD~~~ 349 (484) ++. ..+.+++..+.+++.++.+-+ ....+++..-++|+-+.--.. ...+ +-.|-.|.|+--...+-+.+++.+.. T Consensus 333 ~r~-n~g~~~id~ee~~e~~s~~ls--gl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~ 409 (765) T protein:vir:96 333 NRD-NHGVKVIGIDETMEQFDTNLS--DFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEH 409 (765) T ss_pred hcC-CceeEEecCCcceeEEecccC--CHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHH Confidence 443 346788999999999887633 356667777777877743332 1122 22466677777778888999988866 Q ss_pred HHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcH-------HHHHHHHHHHHhcCcccCCcccHHHHHHHhCCC--- Q lcl|NC_021302. 350 IRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQ-------DATAAALQMLVNAGLLTPDPRLEAFLRDAAGLP--- 418 (484) Q Consensus 350 i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~-------~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp--- 418 (484) ...-+.+.|++.|+.....+ +--.|+|...- .+. +..+++++++++.|++. .+.+|+++.-. T Consensus 410 ~l~p~le~L~~li~~s~~i~--~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis-----~dEvR~~L~~~~~~ 482 (765) T protein:vir:96 410 IFDPLLERHYLLLAKSESID--VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVS-----PDEVRERLRDDPRS 482 (765) T ss_pred HHHHHHHHHHHHHHHhcCCC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCC-----HHHHHHHHhccccC Confidence 55545556888776543221 12366675432 222 33567788899999753 46788876421 Q ss_pred ---CCCCCcccccccC--------------CCcC-CCccccCCCCccccccccccc-cccc--------------ccccc Q lcl|NC_021302. 419 ---GPDPDADDDESTA--------------DTGQ-DEPETDEPALPNTSGTTSTTN-APQA--------------RKRPR 465 (484) Q Consensus 419 ---~p~~~e~~~~~~~--------------~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~~--------------~~~~~ 465 (484) ...+++....+.. .... .+.....+.++...+.+.+.. .+++ +..+. T Consensus 483 g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p 562 (765) T protein:vir:96 483 GYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPP 562 (765) T ss_pred CCCCCCccccccccCCCccccccccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCccccCcc Confidence 1111110000000 0000 000000001111111100000 0000 00000 Q ss_pred cc-chHHHhcC----cc---cCc---ccCC Q lcl|NC_021302. 466 GR-SPRDRRKT----PD---GAM---PLWD 484 (484) Q Consensus 466 ~~-~~~~~~~~----~~---~~~---~~~~ 484 (484) .+ .+..++++ .+ .++ +-++ T Consensus 563 ~~~~p~~~~~~~~~~~~~~~~~~~~~a~~~ 592 (765) T protein:vir:96 563 SRPNPRAELRNLLSDLLSKLEALDDAQAPD 592 (765) T ss_pred ccccccccchhcccchhhhhhccccccccC Confidence 00 00001110 00 001 1111 No 125 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=98.87 E-value=2.3e-09 Score=67.95 Aligned_cols=243 Identities=14% Similarity=0.024 Sum_probs=146.4 Q ss_pred CC----CCCCCccceeeeecccccchhhhhhhcccccccccccc--cchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcE Q lcl|NC_021302. 1 MA----PKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRW--PNSVYTYTRMCREEARIASVLRAIGLPIRRTDWR 74 (484) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~--~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~ 74 (484) |. +.......+ ...... .... ...+.+ ...+ ..+..+ +=+.|.+|+..+...|.+++|+ T Consensus 1 MglF~~~~~r~~~~~------~~~~~~-~~~~------~~~~~~~~~~~v-~~~~al-~~~~v~~~i~~ia~~iA~lp~~ 65 (251) T protein:vir:46 1 MGIFYKNEKRDLQYN------EDDLQM-MVQT------LPSFQGTKLRQY-KDIEAI-RHSDIFTAVMMIASDLARMPIR 65 (251) T ss_pred CCccccccccccCCC------ccchhh-hhhh------hccccCcCccee-chhhhh-ccHHHHHHHHHHHHhHhhCceE Confidence 11 100000000 000000 0000 000000 0111 122333 4678999999999999999999 Q ss_pred EecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCc Q lcl|NC_021302. 75 IRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQ 153 (484) Q Consensus 75 v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~ 153 (484) +...++.... .-+...|. .+.....++.++++.+. +.+.+|-+.++++.... | .+..|.+++| T Consensus 66 ~~~~~~~~~~-~~~~~ll~------------~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~-G--~~~~L~~i~~ 129 (251) T protein:vir:46 66 VTVNGQINYS-DRIVNLLN------------TRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKT-G--EPMNLTFRKT 129 (251) T ss_pred EeeCcccccc-chHHHHHh------------ccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCC-C--cEEEEEEECC Confidence 9764432211 01111111 11123345677887776 57999999999976433 3 4789999999 Q ss_pred cceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHH Q lcl|NC_021302. 154 SSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIR 233 (484) Q Consensus 154 ~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~ 233 (484) .++. ...+.+|.+.+..+... ....+....++++.+|++++.+ .+..+|.|.+..+....-......+ T Consensus 130 ~~v~-v~~~~~g~~~~~~~~~~----------~~~~g~~~~~~~~diiH~r~~~-~dg~~G~spi~~~~~~i~~~~~~~~ 197 (251) T protein:vir:46 130 SEIE-LKSDARGRLYYFHQRID----------SNGNNIERNVKFEDMLDIKFYS-LDGINGLSLLDTLSRTIESDNNGKD 197 (251) T ss_pred ceEE-EEECCCCcEEEEEEEec----------cCCcceeEEECCccEEEecCcC-CCCeeecCHHHHHHHHHHHHHHHHH Confidence 9885 45666777654433211 1122344678888988888765 4458999999999988888888888 Q ss_pred HHHHHHHHhcCCcceEEecCCCCC-CHHHHHHHHHHHHHHhcCCceEEEccCCceE Q lcl|NC_021302. 234 IEAAAIRRHGIGVPYLKGNEADSE-DDDRMDELLEIASNYSGGESAGLALTAGEEA 288 (484) Q Consensus 234 ~w~~f~Er~~~G~P~~~gk~~~~~-~~~~~~~l~~~l~~~~~g~~a~~vip~~~~i 288 (484) +...+... .+.|-.+.+++... ++++++++.+.+.++.+|.+.++.++-||+= T Consensus 198 ~~~~~f~n--g~~p~gil~~~~~l~~~e~~~~~~~~~~~~~~g~~n~g~~~~gm~~ 251 (251) T protein:vir:46 198 FLNNFLRN--GTHAGGILKMKGVLDNKKARDRAREEFPKVLVELNKLGKLSYSMNQ 251 (251) T ss_pred HHHHHHHc--cCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCcccccccccccCC Confidence 88888886 25676666776554 4566788888888887775445556666665 No 126 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=98.87 E-value=2.4e-08 Score=62.40 Aligned_cols=315 Identities=11% Similarity=0.049 Sum_probs=154.4 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccc-------------------hHHHHHHHHhcchHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPN-------------------SVYTYTRMCREEARIASVL 61 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~-------------------~~~~y~~m~~~D~~v~s~l 61 (484) |+.+-..+....-...+......+.+ | . .+ +.|.+.. ...-..++.+..+|-++++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~-~-~-p~--~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~~~~~h~~~i 75 (346) T protein:vir:10 1 MKKQLRKNLTQNDRLQPQAQTEIFSF-G-D-PI--PVLDRADILNYLECSAMYEKWYNPPMSFDGLAKSLRSSTHHESAI 75 (346) T ss_pred CCcccCCCCCcccccccccCeEEEec-C-C-cc--eecCchhHHHHHHHhhcCCceEecCCCHHHHHHHHHhhhhcchhh Confidence 66653332221111111111000000 0 0 11 1111110 0111112223344444444 Q ss_pred HHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCC Q lcl|NC_021302. 62 RAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGG 141 (484) Q Consensus 62 ~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g 141 (484) ..++..+..+- . .|+ ...+..++.+-+++-+.+|.+.+|+++...+ T Consensus 76 ~~k~n~l~~l~-~-~Pn-------------------------------~~~t~~~f~~~~~d~ll~Gnay~~i~r~~~G- 121 (346) T protein:vir:10 76 ITKANILLSTC-E-VDS-------------------------------RYLSRRDLSSFVKDYLVFGNAYFEVVRNRLG- 121 (346) T ss_pred hhhhhhHHHHH-h-CCC-------------------------------CCCCHHHHHHHHHHHHhcCCeEEEEEEcCCC- Confidence 44333322210 0 111 1112344555556778899999999875433 Q ss_pred eeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHH Q lcl|NC_021302. 142 RFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPA 221 (484) Q Consensus 142 ~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~ 221 (484) .+..|.+.++.++.. ..+ ++++.... ...++....+++...++.+...-.+..||.+.+..+ T Consensus 122 --~~~~L~pl~~~~v~~-~~~-~~~~~~~~--------------~~~~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a 183 (346) T protein:vir:10 122 --QVQRIESPLAKYVRK-GLE-AGQFYYVP--------------QRFDHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSA 183 (346) T ss_pred --cEEEEEEecCCceEE-EEc-CCeEEEEE--------------EccCCeEEEEecccEEEecCCCCCCCeeeccHHHHH Confidence 356788999887753 222 22222211 122334566778887777655545678999999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCceE--EEccC-----CceEEEecc Q lcl|NC_021302. 222 YKNWKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGESAG--LALTA-----GEEAGILSP 293 (484) Q Consensus 222 ~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~a~--~vip~-----~~~ie~~~~ 293 (484) ......-.....+...|... .+.|-.+.. .+...++++++++.+.+++..+..+++ +++.. |+++.-++. T Consensus 184 ~~si~l~~~a~~~~~~~~~N--G~~~~~il~~~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~ 261 (346) T protein:vir:10 184 LQSAWLNESATLFRRKYFLN--GAHAGFVFYMSDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIAD 261 (346) T ss_pred HHHHHHHHHHHHHHHHHHhc--cCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCC Confidence 88888888888888888874 255644443 345678999999999998775443322 34433 334433443 Q ss_pred cCCchhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021302. 294 NGTPLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTFV-QSVQTVADEIRDVAQAHVVEDIVDVNWGE 369 (484) Q Consensus 294 ~~~~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~~-~~~~aD~~~i~~~ln~qli~~l~~~Nf~~ 369 (484) +..-..|.+.-++-..+|+.+..-+.--. +..+|+++-.+....++. .-+.-.++.|++ +|+.| . T Consensus 262 ~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~iee-~n~~L----------~ 330 (346) T protein:vir:10 262 VSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVFFITEIEPLQERLKE-FNQWL----------G 330 (346) T ss_pred ChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhc----------c Confidence 44445688888899999999966543222 222344543333322222 222333333432 22211 1 Q ss_pred ccccceEEecCCCCcHHHHHHHHH Q lcl|NC_021302. 370 DEPAPLLVFDEIGSRQDATAAALQ 393 (484) Q Consensus 370 ~~~~P~~~~~~~~~~~~~~ae~~~ 393 (484) .. .++|...+ +....+ T Consensus 331 ~e---~i~F~~~~-----ll~~~~ 346 (346) T protein:vir:10 331 QE---VIKFKPSK-----LLQRTQ 346 (346) T ss_pred cc---eeeechhh-----hcccCC Confidence 11 34453211 000000 No 127 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=98.85 E-value=2.7e-08 Score=62.04 Aligned_cols=323 Identities=11% Similarity=0.022 Sum_probs=167.5 Q ss_pred CCCCCCCccc----eeeeecccccchhh-hhhhccccc----ccccccc-cchHHHHHHHHhcchHHHHHHHHHHHHhhC Q lcl|NC_021302. 1 MAPKTVAPRT----ERGYVNPLAGFGTF-LAQGLDQFE----QVDELRW-PNSVYTYTRMCREEARIASVLRAIGLPIRR 70 (484) Q Consensus 1 ~~~~~~~~~~----~~~~~~~~~~~~~~-~~~~~~~~~----~~~~lr~-~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~ 70 (484) |.....++.. .....-...+++.- ....++..+ ...+.-. |-...--.++.+..+|-+++|.-++.-+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~~~~~~~~h~~~i~~k~n~l~~ 80 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLSEITASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGILHSRANMVSA 80 (345) T ss_pred CCccccccchhhhcCCCceEEEeecCCcccchhhcccceeeecCCccccCCCCHHHHHHHhhcchhhcchhhhhhhHHhh Confidence 3222222210 00000000111100 000011100 0000000 111111224446789999999888777765 Q ss_pred CCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeeeeeeeee Q lcl|NC_021302. 71 TDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFWLKRLAP 150 (484) Q Consensus 71 ~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~~~~l~~ 150 (484) .+.|+..- ...++.+-+++-+.+|.+.+|++....+ .+..|.+ T Consensus 81 ---~~~Pn~~~-------------------------------t~~~f~~~v~d~ll~Gnay~~i~rn~~G---~~~~L~p 123 (345) T protein:vir:37 81 ---TYEGGKAL-------------------------------SKMEMRALCLNLIQFGDVGLLKVRNGFG---QVVRLVP 123 (345) T ss_pred ---ccCCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEECCCC---CEEEEEE Confidence 34565322 1233333445667789999999865433 3568888 Q ss_pred eCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHH Q lcl|NC_021302. 151 RPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDE 230 (484) Q Consensus 151 r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~ 230 (484) .++.++.+ ..+++.....+ .......+....+++...++++.....+..||.+-+..+......-.. T Consensus 124 l~~~~vr~---~~d~~~~~~~~----------~~~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~ 190 (345) T protein:vir:37 124 LSSLYLRV---HKDGGYSYLMK----------KSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSD 190 (345) T ss_pred ecCceeEE---eecCCeeEEEe----------eeeeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHH Confidence 88877643 23333221111 011122334456778887777654445667899888887777777666 Q ss_pred HHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCc---eEEEcc----CCceEEEecccCCchhHHH Q lcl|NC_021302. 231 LIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGES---AGLALT----AGEEAGILSPNGTPLDPRR 302 (484) Q Consensus 231 ~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~---a~~vip----~~~~ie~~~~~~~~~~~~~ 302 (484) ...+-..|..- .++|-.|.. .++..++++.+++.+.+++..++.+ ..+.+| +|+++.-++.+.....|.+ T Consensus 191 a~~~~~~~f~N--Ga~~~~Il~~t~~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e 268 (345) T protein:vir:37 191 ATVFRRRYFSN--GAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFAN 268 (345) T ss_pred HHHHHHHHHhc--cCCcceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHHHHH Confidence 66676777763 256744443 4456789999999999998765433 223344 3455555555555556888 Q ss_pred HHHHHHHHHHHHHhhhh-hcc--cccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEe Q lcl|NC_021302. 303 AIEYHDHQMALVALAHF-LNL--DGKGGSYALASVQADTFV-QSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVF 378 (484) Q Consensus 303 li~~~d~~Isk~ilGqt-lt~--~~~gGs~A~~evh~~v~~-~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~ 378 (484) .-+.-..+|+.+..-.. |.. +..+|+++-.+-...++. .-+.--++.|++.+|+ +.+ ++. -..++| T Consensus 269 ~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~-----~~e--~~~---~~~i~F 338 (345) T protein:vir:37 269 IKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQ-----DPE--IKN---LLKIKF 338 (345) T ss_pred HHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhh-----hhc--cCC---cceEEE Confidence 88889999999865543 221 222344544343333332 3345556666666664 111 111 236777 Q ss_pred cCCCCcHHH Q lcl|NC_021302. 379 DEIGSRQDA 387 (484) Q Consensus 379 ~~~~~~~~~ 387 (484) ++. ++.. T Consensus 339 ~~~--~l~k 345 (345) T protein:vir:37 339 REQ--NFAK 345 (345) T ss_pred Cch--hhcC Confidence 542 1111 No 128 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=98.85 E-value=9.4e-09 Score=64.59 Aligned_cols=314 Identities=15% Similarity=0.022 Sum_probs=156.4 Q ss_pred CCCCCCCccc--------eeeeecccccchh----------hhhhhcccccccccccccchHHHHHHHHhcchHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRT--------ERGYVNPLAGFGT----------FLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLR 62 (484) Q Consensus 1 ~~~~~~~~~~--------~~~~~~~~~~~~~----------~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~ 62 (484) |+.+-..+.. +.+.+.. ..+|. .-..+......-.++ |=...--.++.+..+|.+++|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~f~~p~~v~~~~~~~~~~~~~~~~~~~~p--p~~~~~la~~~~a~~~h~~~i~ 77 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMTASGPKMEA-FTFGEPVPVLDRRDILDYVECISNGRWYEP--PVSFTGLAKSLRAAVHHSSPIY 77 (344) T ss_pred CCcccCCCCcchhhhhhccCCceEE-EEcCCceEecCcchhhhhhhhhhcCceecC--CCCHHHHHHHHhhhhhhCccce Confidence 5554322211 0000000 01110 000000000000000 0011111234455677777776 Q ss_pred HHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCe Q lcl|NC_021302. 63 AIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGR 142 (484) Q Consensus 63 ~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~ 142 (484) .++.-+.+ .+.|+..- ...++-+-+++-+.+|-+.+|++.... T Consensus 78 ~k~n~l~~---~~~Pn~~l-------------------------------t~~~f~~~~~d~ll~Gnay~~i~rn~~--- 120 (344) T protein:vir:20 78 VKRNILAS---TFIPHPWL-------------------------------SQQDFSRFVLDFLVFGNAFLEKRYSTT--- 120 (344) T ss_pred ehhhhHHH---hccCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEECCC--- Confidence 66555544 23444211 112232334466778999999976433 Q ss_pred eeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHH Q lcl|NC_021302. 143 FWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAY 222 (484) Q Consensus 143 ~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~ 222 (484) ..+..|.+.++.++.+ ..+++.. +.....+....+++...++.+...-.+..||.+.+..+. T Consensus 121 G~~~~L~pl~~~~vr~---~~~~~~~---------------~~~~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~ 182 (344) T protein:vir:20 121 GKVIRLETSPAKYTRR---GVEEDVY---------------WWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSAL 182 (344) T ss_pred CcEEEEEEcCCceeEe---eecCCEE---------------EEEccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHHH Confidence 3578899998877643 2223221 112223445567777777666544456789999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCc---eEEEcc----CCceEEEeccc Q lcl|NC_021302. 223 KNWKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGES---AGLALT----AGEEAGILSPN 294 (484) Q Consensus 223 ~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~---a~~vip----~~~~ie~~~~~ 294 (484) .....-.....+-..|.+- .+.|-.+.. .+...++++++++.+.+++..++.+ ..+.+| .|+++.-++.+ T Consensus 183 ~si~l~~~a~~~~~~~f~N--Ga~p~~Il~~~d~~l~~e~~~~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~ 260 (344) T protein:vir:20 183 NSAWLNESATLFRRKYYEN--GAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEV 260 (344) T ss_pred HHHHHHHHHHHHHHHHHhc--cCCCceEEEecCcCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCC Confidence 8777777777777777763 356744443 3466889999999999988643311 222333 35556555555 Q ss_pred CCchhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_021302. 295 GTPLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTFV-QSVQTVADEIRDVAQAHVVEDIVDVNWGED 370 (484) Q Consensus 295 ~~~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~~-~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~ 370 (484) .....|.+.-++-..+|+.+..-+.--. +..+|+++-.+-...++. .-+.--++.++ .+|. + -.. T Consensus 261 ~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~P~~~~~e-~in~----~------lg~ 329 (344) T protein:vir:20 261 ATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIR-EING----W------LGQ 329 (344) T ss_pred hhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHH----h------cCC Confidence 5555689999999999999975543222 222344543333333222 11222223332 1222 1 111 Q ss_pred cccceEEecCCCCcHH Q lcl|NC_021302. 371 EPAPLLVFDEIGSRQD 386 (484) Q Consensus 371 ~~~P~~~~~~~~~~~~ 386 (484) . .-+|.+...+.+++ T Consensus 330 ~-~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 330 E-VIRFKNYSLDTDND 344 (344) T ss_pred c-ccccCccccccCCC Confidence 1 12344333332222 No 129 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=98.83 E-value=3.2e-08 Score=61.68 Aligned_cols=317 Identities=10% Similarity=-0.000 Sum_probs=167.3 Q ss_pred CCCCCCCccceeeeecc----cccchh-------hhhh-hc----ccccccccccccchHHHHHHHHhcchHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNP----LAGFGT-------FLAQ-GL----DQFEQVDELRWPNSVYTYTRMCREEARIASVLRAI 64 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~~~-------~~~~-~~----~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r 64 (484) |-+...++........+ ..++|. .+.. +. +--|++.. ..--.++.+..+|-+++|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~------~~~la~l~~~~~~h~~~i~~k 74 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPALDYVGIGFDENYNCYLPPVN------RHALAKLPHQNAQHGGILHSR 74 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccchhhhhhhhcCCccccCCCCC------HHHHHHHhhcccccccceeee Confidence 33333222221111111 122221 1100 00 00111111 111124445678888888876 Q ss_pred HHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCeee Q lcl|NC_021302. 65 GLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRFW 144 (484) Q Consensus 65 ~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~~ 144 (484) +.-+.+ .+.|+..- ...++.+.+++.+.+|.+.+|++.... -. T Consensus 75 ~n~l~~---~~~Pn~~l-------------------------------t~~~f~~~~~d~ll~Gnay~~~~rn~~---G~ 117 (345) T protein:vir:37 75 ANMVSS---LYEGGKAL-------------------------------SRMDMRALCLNLIQFGDVGLLKVRNGF---GQ 117 (345) T ss_pred chHHHh---hccCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEEcCC---Cc Confidence 665554 23454321 223334444567789999999986543 24 Q ss_pred eeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHH Q lcl|NC_021302. 145 LKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKN 224 (484) Q Consensus 145 ~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~ 224 (484) +..|.+.++.++.. . .+++.....+ .......+....+|+...++.+.....+..||.+.+..+... T Consensus 118 ~~~L~pl~~~~vr~-~--~d~~~~~~~~----------~~~~~~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~s 184 (345) T protein:vir:37 118 VVRLVPLSSLYLRV-R--KDGGYSYLMK----------KSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQS 184 (345) T ss_pred EEEEEEEcCceeEE-E--EeCCeeEEEE----------EeEecCCceEEEEccccEEEecCCCCCCCcccccHHHHHHHH Confidence 67888998887643 2 2333221111 011122334456778887766654445567999999988888 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCce---EEEc----cCCceEEEecccCC Q lcl|NC_021302. 225 WKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGESA---GLAL----TAGEEAGILSPNGT 296 (484) Q Consensus 225 ~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~a---~~vi----p~~~~ie~~~~~~~ 296 (484) ...-.....+-..|.+- .+.|-.|.. .+...++++++++.+++++..+..++ .+.. +.|+++.-++.+.. T Consensus 185 i~l~~~a~~~~~~~f~N--G~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~ 262 (345) T protein:vir:37 185 ALLNSDATVFRRRYFSN--GAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGT 262 (345) T ss_pred HHHHHHHHHHHHHHHhc--cCCcceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCChh Confidence 77777777777777763 256744443 34567889999999999886433222 2223 35666655555555 Q ss_pred chhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021302. 297 PLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADT-FVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEP 372 (484) Q Consensus 297 ~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v-~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~ 372 (484) ...|.+.-++...+|+.+..-..--. +..+|+++-.+....+ ...-+.-.++.|++.+|+. .. ++.. T Consensus 263 d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~~-----~~--~~~~-- 333 (345) T protein:vir:37 263 KDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQD-----PE--IKNL-- 333 (345) T ss_pred HHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhh-----cc--CCCc-- Confidence 56789999999999999965543221 2223445433333333 2334455666777777651 11 2221 Q ss_pred cceEEecCCCCcHHH Q lcl|NC_021302. 373 APLLVFDEIGSRQDA 387 (484) Q Consensus 373 ~P~~~~~~~~~~~~~ 387 (484) ..++|++. ++.+ T Consensus 334 -~~i~F~~~--~L~~ 345 (345) T protein:vir:37 334 -LKIKFREQ--NFAK 345 (345) T ss_pred -ceEEecch--hhcC Confidence 25667532 1211 No 130 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=98.83 E-value=1.3e-08 Score=63.75 Aligned_cols=313 Identities=14% Similarity=0.051 Sum_probs=158.6 Q ss_pred CCCCCCCccceeeeec-------ccccchh----------hhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVN-------PLAGFGT----------FLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRA 63 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~-------~~~~~~~----------~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~ 63 (484) |+.+...+..+..... ....+|. .-..+......-.+. |=...--.++.+..+|.+++|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~p--p~~~~~la~~~~a~~~h~s~i~~ 78 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEP--PVSFTGLAKSLRAAVHHSSPIYV 78 (344) T ss_pred CCCCCCCCCchhhHHhhcCCCceEEEEcCCceeecCcchhhhHHHhhhcCccccC--CCCHHHHHHHHhhhhhhCcccee Confidence 6555433221110000 0001110 000000000000000 00111122344556777777776 Q ss_pred HHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCee Q lcl|NC_021302. 64 IGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGRF 143 (484) Q Consensus 64 r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~~ 143 (484) ++.-+.+ .+.|+.-- ...++-+-+++-+.+|.+.+|++.... - T Consensus 79 k~n~l~~---~~~Pnp~~-------------------------------t~~~f~~~~~d~ll~Gnay~~~~rn~~---G 121 (344) T protein:vir:56 79 KRNILAS---TFIPHPWL-------------------------------SQQDFSRFVLDFLVFGNAFLEKRYSTT---G 121 (344) T ss_pred hhhhHHh---hcCCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEECCC---C Confidence 6655544 23444211 122232334566778999999986433 3 Q ss_pred eeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHH Q lcl|NC_021302. 144 WLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYK 223 (484) Q Consensus 144 ~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~ 223 (484) .+..|.+.++.++.. ..+++... .....+....+++...++.+...-.+..||.+.+..+.. T Consensus 122 ~~~~L~pl~~~~v~~---~~~~~~~~---------------~~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~ 183 (344) T protein:vir:56 122 KVIRLETSPAKYTRR---GVEEDVYW---------------WVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALN 183 (344) T ss_pred cEEEEEEeCCceeEE---eecCCEEE---------------EEecCCeEEEEcCccEEEECCCCCCCCcccccHHHHHHH Confidence 467888998887643 23333221 122234455677777766664443456799999998888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCc-eEEE--cc----CCceEEEecccC Q lcl|NC_021302. 224 NWKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGES-AGLA--LT----AGEEAGILSPNG 295 (484) Q Consensus 224 ~~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~-a~~v--ip----~~~~ie~~~~~~ 295 (484) ....-.....+-..|.+- .++|-.+.. .++..++++++++.+.+++..++.+ -.++ +| +|+++.-++.+. T Consensus 184 si~l~~~a~~~~~~~f~N--Ga~pg~Il~~~d~~ls~e~~~~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~ 261 (344) T protein:vir:56 184 SAWLNESATLFRRKYYEN--GAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVA 261 (344) T ss_pred HHHHHHHHHHHHHHHHhc--cCCCceEEEecCCCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCCh Confidence 888777777777788773 356755443 3456788999999999988643311 1123 33 456665555555 Q ss_pred CchhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc Q lcl|NC_021302. 296 TPLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTF-VQSVQTVADEIRDVAQAHVVEDIVDVNWGEDE 371 (484) Q Consensus 296 ~~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~-~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~ 371 (484) ....|.+.-++-..+|+.+..-..--. +..+|+++-.+-...++ ..-+.--++.+++ +|+.|...+ T Consensus 262 ~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~~~ie~-~n~~l~~~~--------- 331 (344) T protein:vir:56 262 TKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-INGWIGQEV--------- 331 (344) T ss_pred HHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhhhcccc--------- Confidence 556689999999999999965543222 22234454333223222 2222233344432 443332222 Q ss_pred ccceEEecC--CCCcHH Q lcl|NC_021302. 372 PAPLLVFDE--IGSRQD 386 (484) Q Consensus 372 ~~P~~~~~~--~~~~~~ 386 (484) ++|++ .+.++. T Consensus 332 ----~~F~~y~l~~~~~ 344 (344) T protein:vir:56 332 ----IRFKNYSLDTDNG 344 (344) T ss_pred ----ccCCCccccccCC Confidence 22322 122222 No 131 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=98.80 E-value=4.3e-08 Score=60.99 Aligned_cols=316 Identities=14% Similarity=0.072 Sum_probs=155.2 Q ss_pred CCCCCCCccc---------------eeeee----ccc---ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHH Q lcl|NC_021302. 1 MAPKTVAPRT---------------ERGYV----NPL---AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIA 58 (484) Q Consensus 1 ~~~~~~~~~~---------------~~~~~----~~~---~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~ 58 (484) ||.+-..+.. ....+ +|. ...+......+.......+. |-...--.++.+..+|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~p--p~~~~~la~~~~~~~~h~ 78 (351) T protein:vir:78 1 MSKRRSRAPRTFAAAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEP--PVSFAGLAKSFRASTHHS 78 (351) T ss_pred CCCCCCCCCCCCCCCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhccCceecC--CCCHHHHHHHHhhhHhhh Confidence 4422111110 00000 010 00000000010011100000 111112224445678888 Q ss_pred HHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEee Q lcl|NC_021302. 59 SVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFY 138 (484) Q Consensus 59 s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~ 138 (484) ++|..++.-+.+ .+.|+..- ...++-+.+++.+.+|-+.+|++-.. T Consensus 79 ~~l~~k~n~l~~---~~~Pn~~~-------------------------------t~~~f~~~~~d~ll~Gnay~~~~rn~ 124 (351) T protein:vir:78 79 SALFFKANVLAS---TFRPHRWL-------------------------------SRHAFERWALDFLTFGNGYLERRRNM 124 (351) T ss_pred hhhhhhhhHHhh---cccCCCCC-------------------------------CHHHHHHHHHHHHhcCCeEEEEEECC Confidence 888877766655 24454321 12233344456777899999987543 Q ss_pred cCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhH Q lcl|NC_021302. 139 EGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLL 218 (484) Q Consensus 139 ~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll 218 (484) . -.+..|.+.++.++.. ..+.+ +... ....+....+++...++++.....+..||.+.+ T Consensus 125 ~---G~~~~L~pl~~~~v~~-~~~~~-~~~~----------------~~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~ 183 (351) T protein:vir:78 125 V---GGTLRLEPALAKYVRR-KADFS-GFVY----------------VNGWQERHEFAPDSVFQLVRPDINQEVYGLPEY 183 (351) T ss_pred C---CCEEEEEEecCcceEE-eeeCC-eEEE----------------EecCCeEEEEccccEEEEcCCCCCCCcccccHH Confidence 2 2467888998887643 22222 1111 112334456777777766644435678999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec-CCCCCCHHHHHHHHHHHHHHhcCCceE--EE-cc----CCceEEE Q lcl|NC_021302. 219 RPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN-EADSEDDDRMDELLEIASNYSGGESAG--LA-LT----AGEEAGI 290 (484) Q Consensus 219 ~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk-~~~~~~~~~~~~l~~~l~~~~~g~~a~--~v-ip----~~~~ie~ 290 (484) ..+......-.....+-..|..- .+.|-.+.. .+...++++++++.+.+++..+..+++ ++ .| .|+++.- T Consensus 184 ~~a~~si~l~~~a~~~~~~~f~N--Ga~pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~p 261 (351) T protein:vir:78 184 LSSLHSAWLNESSTLFRRKYYEN--GSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIP 261 (351) T ss_pred HHHHHHHHHHHHHHHHHHHHHhc--cCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEE Confidence 88888777766666676777763 356644433 345678999999999998864333322 22 33 3444544 Q ss_pred ecccCCchhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021302. 291 LSPNGTPLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTFV-QSVQTVADEIRDVAQAHVVEDIVDVN 366 (484) Q Consensus 291 ~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~~-~~~~aD~~~i~~~ln~qli~~l~~~N 366 (484) ++.+.....|.+.-++-..+|+.+..-..-.. +..+|+++-.+-...++. .-+.--++.|++ +|+ T Consensus 262 ls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~---------- 330 (351) T protein:vir:78 262 VSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LND---------- 330 (351) T ss_pred cCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHh---------- Confidence 44444455699999999999999965543222 122233432222222221 222223333332 222 Q ss_pred CCCccccceEEecCCC-CcHHHHH Q lcl|NC_021302. 367 WGEDEPAPLLVFDEIG-SRQDATA 389 (484) Q Consensus 367 f~~~~~~P~~~~~~~~-~~~~~~a 389 (484) +.+.. .|+|+..+ ......| T Consensus 331 ~l~~~---~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 331 WLGDE---VVRFDDYEIPPAPVAA 351 (351) T ss_pred hcCcc---ceecChhhhccccccC Confidence 11111 35554321 0111111 No 132 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=98.68 E-value=1.2e-07 Score=58.57 Aligned_cols=392 Identities=13% Similarity=0.067 Sum_probs=177.8 Q ss_pred Ccc-ceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHH Q lcl|NC_021302. 7 APR-TERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVV 85 (484) Q Consensus 7 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~ 85 (484) .+. ..-|+.+-.++. .. ....+. .+...++.++. +-++.+-+..++++--.-.++.-|+|+- ++++ T Consensus 1 ~~~~~~d~~~~~~~~~-~~------~~~~~~-~~~~~~~~l~a-~Y~~~~l~~~~Vd~~aed~~r~g~~i~g--~~~~-- 67 (427) T protein:vir:10 1 MKIVKHDGYNDIFNGG-AD------GSPKPF-FMSDASYHVGS-FYNDNATAKRIVDVIPEEMVTAGFKMSG--VKDE-- 67 (427) T ss_pred CCccccchHHHHhhcC-CC------CcccCc-cccCchHHHHH-HHHcCchhhhhhccchHHhhcCCccccC--ccHH-- Confidence 111 222333222110 00 001111 11123345553 3457888999999888888888888863 2221 Q ss_pred HHHHHHHHhhhccchhhhhHHHhhcCCC-HHHHHHHHHHHHhhcceeeeEEEeec-------CCeeeeeeeeeeCcccee Q lcl|NC_021302. 86 EHVAACLGLPVEGDESDKPTPRTRGRFS-WDQHLRLALKSLQFGHAVFEQTYFYE-------GGRFWLKRLAPRPQSSIA 157 (484) Q Consensus 86 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~l~a~~~G~s~~Eivw~~~-------~g~~~~~~l~~r~~~~~~ 157 (484) +.+...+. +.. |..+...+-.+.+||++++=+.=.-. .+.-.++.|...++.++. T Consensus 68 ~~~~~~~~-----------------~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~ 130 (427) T protein:vir:10 68 KEFKSLWD-----------------SYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAIT 130 (427) T ss_pred HHHHHHHH-----------------HhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhccc Confidence 22222211 112 44455555579999999885532110 112234455555554432 Q ss_pred eeeecCCCceeeeecccccccccccceeccCCC--CcccccccceEEEeec------CccCccccchhHH-HHHHHHHHH Q lcl|NC_021302. 158 YWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNS--MGPAIPVEQLVVYTHD------MDPGVWTGNSLLR-PAYKNWKLK 228 (484) Q Consensus 158 ~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~--~~~~lp~~k~l~~~~~------~~~~~p~G~gll~-~~~~~~~~K 228 (484) -...+. .-....++.+..+...... ....+.+.+++++... ....++||.|+|. .+|....-- T Consensus 131 ~~~~~~--------dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~ 202 (427) T protein:vir:10 131 VEKRVT--------NARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDY 202 (427) T ss_pred cccccc--------CccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHH Confidence 111110 1122334444444443222 2356777787877532 2467788999775 566544433 Q ss_pred HHHHHHHHHHHHHhcCCcceEEecC-----CCCC-CHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHH Q lcl|NC_021302. 229 DELIRIEAAAIRRHGIGVPYLKGNE-----ADSE-DDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRR 302 (484) Q Consensus 229 ~~~~~~w~~f~Er~~~G~P~~~gk~-----~~~~-~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~ 302 (484) ......=+..+.|+ .+.++.-+- ..+. ....+.++.. +..+.+...+.+++..+.+++.++.+-++ ... T Consensus 203 ~~~~~~~~~l~~k~--~~~v~k~~~l~~~~~~~~~~~~~~~r~~~-~~~~~~~~~~~~l~~~~e~~e~~~~~lsg--l~~ 277 (427) T protein:vir:10 203 DYCESLATQILRRK--QQAVWKVKGLAEMCDDDDAQYAARLRLAQ-VDDNSGVGRAIGIDAETEEYDVLNSDISG--VPE 277 (427) T ss_pred HHHHHHHHHHHHHh--ccccccchhHHHHhcCccchHHHHHHHHH-HHHhcCcccceeeecCCCceeEEecccCC--hHH Confidence 33333334555554 344433221 0111 1122222222 22233323355666677889888765433 455 Q ss_pred HHHHHHHHHHHHHhhhhh--cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC Q lcl|NC_021302. 303 AIEYHDHQMALVALAHFL--NLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE 380 (484) Q Consensus 303 li~~~d~~Isk~ilGqtl--t~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~ 380 (484) +++..-.+||-+.--+.- ...+.+|-.|.|+--...+-+.+++.+.....-+.+.|++-++ ++ .+ -.|+|.. T Consensus 278 ~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~-~s---~~--~~~~f~p 351 (427) T protein:vir:10 278 FLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV-DE---EE--WSIEFEP 351 (427) T ss_pred HHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cC---CC--cEEEeCC Confidence 677777788877433321 1223344446666667778888888765543333344555433 22 11 2555643 Q ss_pred CC-C-------cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccc-ccccCCCcCCCccccCCCCcccccc Q lcl|NC_021302. 381 IG-S-------RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADD-DESTADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 381 ~~-~-------~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (484) .- . ..+..+++++++++.|+..+. ...++++...+...-.+.... +..+... .++++ +++.... T Consensus 352 L~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~-e~r~~L~~~~~~~~~~~~~~~~~e~~~~~--~e~~p----~~~e~~~ 424 (427) T protein:vir:10 352 LSVPSKKEESEITKNNVESVTKAITEQIIDLE-EARDTLRSIAPEFKLKDGNNINIREPEET--TEPEP----GLGEKLE 424 (427) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHH-HHHHHHHhhhccccCCCCccccccccchh--cCCCC----CCCCCCC Confidence 21 1 124567889999999976542 112233322122111111110 0000000 00000 0000000 Q ss_pred ccc Q lcl|NC_021302. 452 TST 454 (484) Q Consensus 452 ~~~ 454 (484) ... T Consensus 425 d~~ 427 (427) T protein:vir:10 425 DEN 427 (427) T ss_pred CCC Confidence 000 No 133 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.65 E-value=1.5e-07 Score=57.98 Aligned_cols=387 Identities=12% Similarity=0.016 Sum_probs=163.9 Q ss_pred cccccchHHHHHHHHhc--chHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCH Q lcl|NC_021302. 37 ELRWPNSVYTYTRMCRE--EARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSW 114 (484) Q Consensus 37 ~lr~~~~~~~y~~m~~~--D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 114 (484) -|. ++.-.-|..+.+. -.+..-++.+....+.-..|+. .+. +.-+.+ .+....-+| T Consensus 1 ~l~-~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~--~d~--~~~~~~-----------------~~i~~~N~~ 58 (434) T protein:vir:98 1 MLP-KNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTG--PDG--EPDTRA-----------------SRWWQANRL 58 (434) T ss_pred CCC-CCccHHHHHhhhhhhccchHHHHHHHHhhhccCceec--CCC--chHHHH-----------------HHHHHhcCh Confidence 221 1222334333211 1122223332222221112221 111 111111 122223357 Q ss_pred HHHHHHH-HHHHhhcceeeeEEEeecCCeee----eeeeeeeCccceeeeeecCC-Ccee-eeecccccccccc--cce- Q lcl|NC_021302. 115 DQHLRLA-LKSLQFGHAVFEQTYFYEGGRFW----LKRLAPRPQSSIAYWNVDRD-GGLI-SIQQWPAGTFGGP--GMV- 184 (484) Q Consensus 115 ~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~----~~~l~~r~~~~~~~~~~~~d-g~l~-~~~q~~~~~~~~~--~~~- 184 (484) +.....+ .+|..||.| ++++|...++... -..|..++|+++. ..+|.. +++. .+..+.....+.. ..+ T Consensus 59 d~~~~~~~~~a~i~G~a-y~~v~~~~~~~~~~~~~~~~I~~~~p~~~~-~i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~ 136 (434) T protein:vir:98 59 DSRQKLVWRMAMAQSAG-YMLVGAHPTRTEDNGRPSPLITMEHPSECI-VEYDPETGEPLVGLKVWHNDIDGFGYARVFF 136 (434) T ss_pred hHHHHHHHHHHhhcCce-EEEEecCCCcccccCCceeEEEEeccceeE-EEEeCCCCceEEEEEEEEeccCCceEEEEEE Confidence 7776665 589999966 6677764433211 1225556666542 122211 1111 1100000000000 000 Q ss_pred ------------------------ec---cCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 185 ------------------------VM---APNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAA 237 (484) Q Consensus 185 ------------------------~~---~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~ 237 (484) .. ......+++..--++.|.++...+. .|.|-+..+-...-.=...+...+. T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~ 215 (434) T protein:vir:98 137 DDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFAGVLDIQDRVNLGILNRMA 215 (434) T ss_pred eCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCc-CCcchhhhHHHHHHHHHHHHHHHHH Confidence 00 0001112233334555666655433 5888888776655555667777888 Q ss_pred HHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccC-CchhHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 238 AIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNG-TPLDPRRAIEYHDHQMALVAL 316 (484) Q Consensus 238 f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~-~~~~~~~li~~~d~~Isk~il 316 (484) ..|-|.++..++.|-......++.. .....-..+..+. .++...++.+.++.+... ....|...++.+-.+|+..-- T Consensus 216 ~~~~~a~p~~~i~G~~~~~~~~~~~-~~~~~~~~~~~~~-~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~ 293 (434) T protein:vir:98 216 ASRFSGFRQKWIKGHKFAKRTDPAT-GMTVVDQPFVPSP-SAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQ 293 (434) T ss_pred HHHHhcchhhhhcCCCccccccccc-ccchhhhhhhccc-cccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccC Confidence 8888877766776643222222111 1111111111121 233444556666666432 334577777777677765522 Q ss_pred hh--hhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-ccccceEEecC-CCCcHHHHHHHH Q lcl|NC_021302. 317 AH--FLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGE-DEPAPLLVFDE-IGSRQDATAAAL 392 (484) Q Consensus 317 Gq--tlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~-~~~~P~~~~~~-~~~~~~~~ae~~ 392 (484) -. .+..+..+.|-..-..........++.-.+.+...+. ++++.++.++... ...-.+++|.. ......+.++++ T Consensus 294 ~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~-~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~ 372 (434) T protein:vir:98 294 TPTYLYATDLVNISADTIGALDILHVAKVREHIASFSEGLE-SVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAA 372 (434) T ss_pred CCHHHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHH Confidence 11 1111111122222233444455555555666777774 4777666665322 22223566754 345678899999 Q ss_pred HHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccC---CCcCCCccccCCCCccccccccccccccccccccc Q lcl|NC_021302. 393 QMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTA---DTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRG 466 (484) Q Consensus 393 ~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (484) .+|+.+|+ +.+.+++.+|++..+-......... ......++...+.... .+......+| T Consensus 373 ~kl~~~g~------~~e~~~~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~---------~~~~~~~~dg 434 (434) T protein:vir:98 373 TKLKSIGY------PLDVIAEELDESPARVRRIVAGAASQALLAASLLPAPGAPSAGN---------VPDSGGAVDG 434 (434) T ss_pred HHHHhcCC------cHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC---------CCcccCCCCC Confidence 99998885 2467889988864321100000000 0000000000000000 0011111111 No 134 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.59 E-value=2.3e-07 Score=57.02 Aligned_cols=365 Identities=13% Similarity=-0.006 Sum_probs=163.9 Q ss_pred chhhhhhhcccccccccccccchHHHHHHHHhcchHH----HHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhh Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARI----ASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPV 96 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v----~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~ 96 (484) +....+..|...-.....|.....+.|+ ......++ -..+..+.+.+. +| ...+++.+++.+.... T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~-g~~~~~~~~~~~p~~~~~~~~~v~--nw-------~~~iVds~a~rl~~~G 70 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYA-MKYVDRFKGITIPQALSQQYRSIL--GW-------CAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhc-ccCchhhcChhhhHHHHHHHhhhc--ch-------hHHHHHHhHhhcccCc Confidence 3332222222221111111111222332 21111121 122222222222 33 2234444444332221 Q ss_pred ccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecC------------ Q lcl|NC_021302. 97 EGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDR------------ 163 (484) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~------------ 163 (484) .. ..+....+....-+|+....++. +|+.||.| +..||.-.+|.. .|.+++|++... .+|+ T Consensus 71 f~-~~d~~l~~i~~~N~ld~~~~~~~~~aliyG~s-f~~v~~~~dg~~---~i~~~sp~~~~~-i~D~~~~~~~~a~~~~ 144 (409) T protein:vir:94 71 FE-NDDFTVNEIFEENNPDIFFDSAVLSSLIASCS-FTYISKGENDAV---RLQVIEAVNATG-IIDPITGLLTEGYAVL 144 (409) T ss_pred cc-CCchHHHHHHHhcChhHHHHHHHHHHHHhcce-eEEEecCCCCce---EEEEeccceEEE-EEecCCCceeeeEEEE Confidence 11 11122333344446776666664 79999995 557787667753 344555554321 1222 Q ss_pred ----CCceeeeecccccccccccceeccCCCC----cccccccceEEEeecCccCccccchhH-HHHHHHHHHHHHHHHH Q lcl|NC_021302. 164 ----DGGLISIQQWPAGTFGGPGMVVMAPNSM----GPAIPVEQLVVYTHDMDPGVWTGNSLL-RPAYKNWKLKDELIRI 234 (484) Q Consensus 164 ----dg~l~~~~q~~~~~~~~~~~~~~~~~~~----~~~lp~~k~l~~~~~~~~~~p~G~gll-~~~~~~~~~K~~~~~~ 234 (484) .+..+....+..+... ......+. ..++...-++.|.++.+-+.|+|.|-+ +.+-...---+..+.. T Consensus 145 ~~d~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~ 220 (409) T protein:vir:94 145 ERDENNNVVLEAHFLPDRTD----YYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLER 220 (409) T ss_pred EecCCCceEEEEEEecCcEE----EEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHH Confidence 2221111111111000 00011111 122333347778888888899998865 3333222222334444 Q ss_pred HHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCC---ceEEEeccc-CCchhHHHHHHHHHHH Q lcl|NC_021302. 235 EAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAG---EEAGILSPN-GTPLDPRRAIEYHDHQ 310 (484) Q Consensus 235 w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~---~~ie~~~~~-~~~~~~~~li~~~d~~ 310 (484) -+.-.|-|.++..+++|-...+... +.+...+. ....+|++ ..+++-+.+ .+...|...++.+-++ T Consensus 221 ~~~~~e~~a~pqr~i~G~d~d~~~~---~~~~~~~~-------~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~ 290 (409) T protein:vir:94 221 ADVTAEFYSFPQKYVTGLSDDAEPM---ETWKATVS-------SMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAG 290 (409) T ss_pred HHHHHHHhcChhheeEecCCCCccc---chhhhhHH-------HhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHH Confidence 4555677777777888754332222 22222222 23345533 334554433 2334566666666666 Q ss_pred HHHHHhhhhhcc-c--ccc-cc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-C-CCCcc---ccceEEecC Q lcl|NC_021302. 311 MALVALAHFLNL-D--GKG-GS-YALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDV-N-WGEDE---PAPLLVFDE 380 (484) Q Consensus 311 Isk~ilGqtlt~-~--~~g-Gs-~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~-N-f~~~~---~~P~~~~~~ 380 (484) +|-. .+=.... + +.+ +| -|+.. ..+-.....+.-.+.+...+. ++++.++.+ + .+... .-.+++|.+ T Consensus 291 ~a~~-t~lP~~~lg~~~~NpsSa~Al~a-~~~~L~~~a~~k~~~fg~~~~-~~~rla~~i~~~~~~~~~~~~~~~v~W~p 367 (409) T protein:vir:94 291 FAGE-TGLTLDDLGFVSDNPSSVEAIKA-SHENLRLAGRKAQRSLGAGLL-NVAYLAACLRDDAPYLREQFRKTKPKWEP 367 (409) T ss_pred Hhhh-cCCCHHHhccccCchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCccccccccceEEecc Confidence 6643 2211111 1 111 12 22222 222333344445566777785 477765554 2 22111 113566763 Q ss_pred C-CCc---HHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCC Q lcl|NC_021302. 381 I-GSR---QDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPD 421 (484) Q Consensus 381 ~-~~~---~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~ 421 (484) . +.+ ....|+++.||+++|..+. ..+.+++.+|+..++ T Consensus 368 ~~~~~~~~~a~~aDa~~Kl~~ag~~~~---~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 368 LFEADASMLSLIGDGAIKLNQAIPEFI---NKDTIRDLTGIEGGE 409 (409) T ss_pred CCCcchHHHHHHHHHHHHHHHhccccc---chhHHHHHcCCCCCC Confidence 3 222 3567899999999996432 356899999997665 No 135 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.58 E-value=2.4e-07 Score=56.90 Aligned_cols=366 Identities=13% Similarity=0.005 Sum_probs=166.7 Q ss_pred chhhhhhhcccccccccccccchHHHHHHHHhcchH----HHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhh Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEAR----IASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPV 96 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~----v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~ 96 (484) +....+..|.........|.....+.|+ ....... +-..+..+.+.+. +| ...+++.+++.+.... T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~-g~~~~~~~~~~~p~~~~~~~~~v~--nw-------~~~iVds~a~rl~~~G 70 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYA-MKHVDRFKGITIPQALSQQYRSIL--GW-------CAKGVDSLADRLVFRE 70 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHh-ccCchhhcchhhhHHHHHHHhhhc--Ch-------hHHHHHHhHhhccccc Confidence 3332222222221111111112222332 2111111 2222322233232 33 2234444444332221 Q ss_pred ccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeeeCccceee---------------ee Q lcl|NC_021302. 97 EGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAY---------------WN 160 (484) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~---------------~~ 160 (484) .. ..+....+....-+|+....++ .+|+.||.| +..||.-.+|.. .|..++|++..- +. T Consensus 71 f~-~~d~~l~~i~~~N~ld~~~~~~~~~al~yG~s-f~~v~~~~dg~~---~i~~~sP~~~~~i~D~~~~~~~~a~~~~~ 145 (409) T protein:vir:16 71 FE-NDDFTVNEIFEENNPDIFFDSTVLSALIASCS-FTYISKGENDAV---RLQVIEATNATGIIDPITGLLTEGYAVLE 145 (409) T ss_pred cc-CcchHHHHHHHhcChhHHHHHHHHHHHHhCce-eEEEecCCCCce---EEEEEcccceEEEeecccccceeeeEEEE Confidence 11 1112233344445677766665 489999996 457787666753 344555544321 22 Q ss_pred ecCCCceeeeecccccccccccceeccCCC----CcccccccceEEEeecCccCccccchhH-HHHHHHHHHHHHHHHHH Q lcl|NC_021302. 161 VDRDGGLISIQQWPAGTFGGPGMVVMAPNS----MGPAIPVEQLVVYTHDMDPGVWTGNSLL-RPAYKNWKLKDELIRIE 235 (484) Q Consensus 161 ~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~----~~~~lp~~k~l~~~~~~~~~~p~G~gll-~~~~~~~~~K~~~~~~w 235 (484) -+.++..+...-+..+... ......+ ...++...-++.|.++.+-+.|+|.|-+ +.+-...---+..+... T Consensus 146 ~d~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~ 221 (409) T protein:vir:16 146 RDENNNVVLEAHFLPDRTD----YYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERA 221 (409) T ss_pred ecCCCceEEEEEEecCcEE----EEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHH Confidence 2222222111111111000 0001111 1233334457888888888899998744 44332222223334444 Q ss_pred HHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCC---ceEEEecccC-CchhHHHHHHHHHHHH Q lcl|NC_021302. 236 AAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAG---EEAGILSPNG-TPLDPRRAIEYHDHQM 311 (484) Q Consensus 236 ~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~---~~ie~~~~~~-~~~~~~~li~~~d~~I 311 (484) ..-.|=|.++..+++|-...+...+..+ ..+. ....+|++ ..+++-+.++ +...|...++.+-+++ T Consensus 222 ~~~~e~~a~pqr~i~G~d~d~~~~~~~~---~~~~-------~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~ 291 (409) T protein:vir:16 222 DVTAEFYSFPQKYVTGLSDDAEPMETWK---ATVS-------SMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGF 291 (409) T ss_pred HHHHHHhcChhheeEecCCCCCccchhh---hhhh-------HhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHH Confidence 5556767777778887543332222221 1111 23445533 3355544332 3346777777766776 Q ss_pred HHHHhhhhhcc-c--ccc-cc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CCCccc---cceEEecCC Q lcl|NC_021302. 312 ALVALAHFLNL-D--GKG-GS-YALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVN--WGEDEP---APLLVFDEI 381 (484) Q Consensus 312 sk~ilGqtlt~-~--~~g-Gs-~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~N--f~~~~~---~P~~~~~~~ 381 (484) |-. .+=.+.. + +.+ +| -|+. ....-+....+.-.+.+...+. ++.+.++.+- ++.... --+++|.+. T Consensus 292 a~~-s~lP~~~lg~~~~NpsSa~Ai~-a~~~~L~~ka~~k~~~fg~~l~-~~~rla~~~~~~~~~~~~~~~~~~v~W~~~ 368 (409) T protein:vir:16 292 AGE-TGLTLDDLGFVSDNPSSVEAIK-ASHENLRLAGRKAQRSLGAGLL-NVAYLAACLRDDVPYLREQFSKTKPKWEPL 368 (409) T ss_pred hhh-cCCCHHHcccccCchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCccchhhccceEEecCC Confidence 654 2211111 1 111 22 2222 2233333344445566777774 5777656552 222111 125567543 Q ss_pred C-C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCC Q lcl|NC_021302. 382 G-S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPD 421 (484) Q Consensus 382 ~-~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~ 421 (484) . . .....|+++.||+++|.... ..+.+++.+|+..++ T Consensus 369 ~~~~~~s~a~~aDa~~Kl~~a~~~~~---~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 369 FEADASMLSLIGDGAIKLNQAIPEFI---NKDTIRDLTGIKGAE 409 (409) T ss_pred CCcchhhHHHHHHHHHHHHhhccccc---chhHHHHhccCCCCC Confidence 2 1 24677899999999986432 346789999997665 No 136 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.58 E-value=2.4e-07 Score=56.85 Aligned_cols=403 Identities=11% Similarity=0.105 Sum_probs=182.5 Q ss_pred CCCCCCCccceee----eecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERG----YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) |+-+++.-..-.. +..-..|.......+......... .....++-|-.-.-.-+++...++.....|.+.++.++ T Consensus 3 V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~-E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k~p~~~ 81 (452) T protein:vir:94 3 IETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSG-QTDDMYNAYKQRALFYSITSKTLSALSGMVLDQPPVIT 81 (452) T ss_pred CCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCC-CCHHHHHHHHhhccCCchHHHHHHHHhchhhcCCceec Confidence 2221111111000 000011111000011111111100 00112222222112246677777777777777777664 Q ss_pred cCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) -+ +.. +.... + ..+.+++.+++.++ .++.||.+.+=+.|-..+++ -.+..+++.. T Consensus 82 ~p---~~l-~~~~~----------------D-~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~r---Py~~~~~~~~ 137 (452) T protein:vir:94 82 HP---DAM-SKYFE----------------D-QSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGD---PYISVYTTEN 137 (452) T ss_pred cc---HHH-HHHHh----------------c-ccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCc---eEEEEechhh Confidence 21 111 11100 0 23567899999887 79999998877777655553 2355555666 Q ss_pred eeeeeecCCCcee--eeec-----cccccccc-----c----------cceeccC-CCC-------------cccccccc Q lcl|NC_021302. 156 IAYWNVDRDGGLI--SIQQ-----WPAGTFGG-----P----------GMVVMAP-NSM-------------GPAIPVEQ 199 (484) Q Consensus 156 ~~~~~~~~dg~l~--~~~q-----~~~~~~~~-----~----------~~~~~~~-~~~-------------~~~lp~~k 199 (484) |-=|+++.+|++. .++. .+.+.++. . ....+.. ... +..+..=- T Consensus 138 Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP 217 (452) T protein:vir:94 138 ILNWEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIP 217 (452) T ss_pred hcCccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeE Confidence 5557777777642 1111 11111110 0 0001110 110 01111111 Q ss_pred eEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceE Q lcl|NC_021302. 200 LVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAG 279 (484) Q Consensus 200 ~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~ 279 (484) |+++..... +--.+.+-|..++..-+---....+.-..+..-++|+|++.|-.. .+ .+.-|..++ T Consensus 218 ~v~~~~~~~-~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~--~~------------~i~iG~~~~ 282 (452) T protein:vir:94 218 FFCITPSGL-SMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAES--QS------------TMHIGSTKA 282 (452) T ss_pred EEEEcCCCC-CCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcC--CC------------ceEeccccc Confidence 333222221 222355555566544322222222233333333567777665321 11 234487888 Q ss_pred EEccC-CceEEEecccCCc-hhHHHHHHHHHHHHHHHHhhhhhcccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 280 LALTA-GEEAGILSPNGTP-LDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALA-SVQADTFVQSVQTVADEIRDVAQA 356 (484) Q Consensus 280 ~vip~-~~~ie~~~~~~~~-~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~ 356 (484) +.+|+ |.++.+++.+|++ ...++.++....+|..+ .+..+...+.+-.-+.+ ..........+.+-+..+++.++ T Consensus 283 ~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~~m~~~-Ga~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~- 360 (452) T protein:vir:94 283 WVIPEVAAKVGFLEFTGQGLQSLEKALSEKQAQLASL-SARLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLN- 360 (452) T ss_pred ccCCCCCCcceEEccCchhHHHHHHHHHHHHHHHHHH-HHHhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHH- Confidence 99996 9999999998776 45777888777777433 33444443222112222 12333335777888888999996 Q ss_pred HHHHHHHHhCCCCccccceEEecC--C-CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCC Q lcl|NC_021302. 357 HVVEDIVDVNWGEDEPAPLLVFDE--I-GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADT 433 (484) Q Consensus 357 qli~~l~~~Nf~~~~~~P~~~~~~--~-~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~ 433 (484) +++++++.+-. .+ .-.+|++.. . ........+++.++...|.+-. ..--.++ ++.|++.++.+++........ T Consensus 361 ~~l~~~a~w~g-~~-~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~-~t~~~~L-~~~gvl~~~~e~~~i~~E~~~ 436 (452) T protein:vir:94 361 KAYSCIMDMES-MG-GTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISK-EIYIHAL-KVGKVLPPPGESMGVIPDPPA 436 (452) T ss_pred HHHHHHHHHcC-CC-CceEEEeccccccccCCHHHHHHHHHHHhcCCCcH-HHHHHHH-HhCCCCCCccCHHHHHHHhhc Confidence 58898888653 22 223565532 1 1223445566667888886431 1111222 334888775544332211111 Q ss_pred cCCCccccCCCCccccc Q lcl|NC_021302. 434 GQDEPETDEPALPNTSG 450 (484) Q Consensus 434 ~~~~~~~~~~~~~~~~~ 450 (484) ..+. ..+.|..++++. T Consensus 437 ~~~~-~~~~~~~~~~~~ 452 (452) T protein:vir:94 437 PEPS-PSNTPPNPSSKA 452 (452) T ss_pred cCcc-cCCCCCCCccCC Confidence 1111 112222222222 No 137 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=98.56 E-value=2.8e-07 Score=56.47 Aligned_cols=319 Identities=13% Similarity=0.046 Sum_probs=147.5 Q ss_pred CCCCCCCccc-----------eee----eecccccchhhhhhhcccccccccccccchHHHHHHHHhcc----------- Q lcl|NC_021302. 1 MAPKTVAPRT-----------ERG----YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREE----------- 54 (484) Q Consensus 1 ~~~~~~~~~~-----------~~~----~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D----------- 54 (484) ||.+-.++.. +-. ..... +....+.- -++.+.+.+ ..+.-|-++...+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~f----g~p~~~~~~-~~~~~~~~~~~~~~~~~~pi~~~~ 74 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRA-AQAEVFSF----GDPVEVLDR-RELLDYVECMRMGQWYEPPMPWDG 74 (368) T ss_pred CCccccccchhccCcccccccccCcchhhcccc-CceEEEEc----CCceeecch-hhHHHHHHHHhccchhccCcCHHH Confidence 3322211100 000 00000 00000000 011111111 1122222332222 Q ss_pred --------hHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHh Q lcl|NC_021302. 55 --------ARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQ 126 (484) Q Consensus 55 --------~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~ 126 (484) +|-++++..+...+ .+ ...|+. .....++-.-+.+-+. T Consensus 75 la~~~~~~~~h~~~~~~~~n~l-~l--~~~Pn~-------------------------------~~t~~~f~~l~~d~ll 120 (368) T protein:vir:79 75 LARSFRAAAHHSSAVYVKRNIL-VS--TFIPHP-------------------------------LLSRATFERLVLDWQV 120 (368) T ss_pred HHHHHhhccccchhhhhhcchh-hh--hcCCCc-------------------------------CCCHHHHHHHHHHHhh Confidence 22222222211111 00 001111 1123344444556778 Q ss_pred hcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeec Q lcl|NC_021302. 127 FGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHD 206 (484) Q Consensus 127 ~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~ 206 (484) +|.+.+|++....+ .+..|.+.++.++.. ..+++.... ....+....+++...++.+.. T Consensus 121 ~Gnay~~~~r~~~G---~~~~L~~l~~~~v~~---~~~~~~~~~---------------~~~~~~~~~~~~~dIihir~~ 179 (368) T protein:vir:79 121 FGNAYLERRENVLG---GTIRLDTPLAKYVRR---GLDLNTYFF---------------VQNWQQPYTFAAGSVFHLQEP 179 (368) T ss_pred cCCeEEEEEEcCCC---CEEEEEEeCccccee---eccCCEEEE---------------EecCCeEEEEccccEEEecCC Confidence 99999999865432 366888888877642 233332211 112334556778887766644 Q ss_pred CccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecC-CCCCCHHHHHHHHHHHHHHhcCCceE--EEc- Q lcl|NC_021302. 207 MDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNE-ADSEDDDRMDELLEIASNYSGGESAG--LAL- 282 (484) Q Consensus 207 ~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~-~~~~~~~~~~~l~~~l~~~~~g~~a~--~vi- 282 (484) ...+..||.+.+..+......-.....+-..|... .+.|-.+..+ +...++++++++.+.+++..+..+++ +++ T Consensus 180 ~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~N--Ga~~~gil~~~~~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~ 257 (368) T protein:vir:79 180 DINQEVYGLPEYLSALNATWLNESATLFRRRYYKN--GSHAGFILYMTDAAQKQEDVDTLREAMKSAKGPGNFRNLFMYA 257 (368) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhc--cCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCCcccCceeEec Confidence 44566799999999888877777777777777764 2556444433 45678999999999998864333332 333 Q ss_pred ----cCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcc---cccccchhhHHHHHHHH-HHHHHHHHHHHHHHH Q lcl|NC_021302. 283 ----TAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNL---DGKGGSYALASVQADTF-VQSVQTVADEIRDVA 354 (484) Q Consensus 283 ----p~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~---~~~gGs~A~~evh~~v~-~~~~~aD~~~i~~~l 354 (484) +.|++++-++.+.....|.+.-++-.++|+.+......-. +..+|+++-.+-...++ ..-+.-.++.|+ .+ T Consensus 258 ~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie-~l 336 (368) T protein:vir:79 258 PNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAAMVFARNEVKPLQDRLL-AI 336 (368) T ss_pred CCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HH Confidence 3455555555555556799999999999999975543222 11223343222222222 122222333333 22 Q ss_pred HHHHHHHHHHhCCCCccccceEEecCCC---CcHHHHHHHHHHHHhcCcccC Q lcl|NC_021302. 355 QAHVVEDIVDVNWGEDEPAPLLVFDEIG---SRQDATAAALQMLVNAGLLTP 403 (484) Q Consensus 355 n~qli~~l~~~Nf~~~~~~P~~~~~~~~---~~~~~~ae~~~~L~~~G~~~~ 403 (484) |..| ... .++|++.. .|.+..|+. |.+-. T Consensus 337 n~~l----------~~e---~~rF~~~~l~~~D~~a~a~~-------~~rsa 368 (368) T protein:vir:79 337 NDWI----------GDE---VVRFAPYALGGHDQPAAAPG-------GQRSA 368 (368) T ss_pred Hhcc----------Ccc---eeeechhHhhcccccccCCc-------ccccC Confidence 2211 111 34454311 122222221 11100 No 138 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.50 E-value=4.2e-07 Score=55.53 Aligned_cols=397 Identities=14% Similarity=0.120 Sum_probs=166.6 Q ss_pred CCCC------CCCccce-----eeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhh Q lcl|NC_021302. 1 MAPK------TVAPRTE-----RGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIR 69 (484) Q Consensus 1 ~~~~------~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~ 69 (484) |+.+ ++..... -+..+..-+.|+..-..+.+.-.+..+........| +++.-...++++-..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Y----r~~~ia~~iVd~~~d~~~ 76 (449) T protein:vir:10 1 MTDKLTLAVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLY----RRGGIAHGAVEKLVGKCW 76 (449) T ss_pred CchhhHHHHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHH----hcCchhHHHHHhhhhhhh Confidence 4433 2222222 122333333333222222222222222222223333 345556666665544332 Q ss_pred CCCcEEecCCCCHHHH--HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcceeeeEEEeecCCe-e--- Q lcl|NC_021302. 70 RTDWRIRPNGARPEVV--EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAVFEQTYFYEGGR-F--- 143 (484) Q Consensus 70 ~~~~~v~p~~~~~e~~--~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~~Eivw~~~~g~-~--- 143 (484) ..-..|..+.+.++.. ......+ .+++...-|..+....--+.+||++++=+. ..+|+ + T Consensus 77 ~~~~~i~~g~~~~~~~~~~~~e~~~-------------~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~--v~d~~~l~~P 141 (449) T protein:vir:10 77 QTNPEIIEGDDADDSEDETSWEKKS-------------KQVFTNRLWRSFAEADRRRLVGRYAGILLH--IRDEKDWNLP 141 (449) T ss_pred hcCcccccCccccchhhhHHHHHHH-------------HHHHHHHHHHHHHHHHHhhhccCcEEEEEE--ecCCCCCCcc Confidence 2222333222222111 1111111 011111114444444445778999987443 22221 1 Q ss_pred -----eeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCC-----CCcccccccceEEEeecCccCccc Q lcl|NC_021302. 144 -----WLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPN-----SMGPAIPVEQLVVYTHDMDPGVWT 213 (484) Q Consensus 144 -----~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~-----~~~~~lp~~k~l~~~~~~~~~~p~ 213 (484) .+.+|.+.....+. .. .. ..-.....++.+..+..... ..+..|-+.+++++...+ .. T Consensus 142 l~~~~~i~~i~v~~~~~i~-----~~-~~--~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~----~~ 209 (449) T protein:vir:10 142 ATKGRGLQKVSVSWAGSLK-----VA-EW--DTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYS----ED 209 (449) T ss_pred cccCcceeeEEeeccccCC-----hh-hh--hcCCCCCCCCCceEEEEeeeccCCCccceeeccceeEeecCCC----CC Confidence 12233322211110 00 00 01112344555555543321 222234455555443221 12 Q ss_pred cchhHHHHHHHHH-HHH-----------HHHHHHHHHHHH--hcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceE Q lcl|NC_021302. 214 GNSLLRPAYKNWK-LKD-----------ELIRIEAAAIRR--HGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAG 279 (484) Q Consensus 214 G~gll~~~~~~~~-~K~-----------~~~~~w~~f~Er--~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~ 279 (484) |.++|+++|-..+ +-+ ...+....-.++ ...++.-.++ ...++-.+.+.+.++.+..+.+ . T Consensus 210 g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~----~~~e~~~~~~~~~~~~~~~~~~-~ 284 (449) T protein:vir:10 210 AIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYG----VSIDELQDKFNEVAGEINRGND-V 284 (449) T ss_pred ChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhh----CCchHHHHHHHHHHHHHhccch-h Confidence 7788998875321 100 000111111111 1112222211 1123334455556666655654 4 Q ss_pred EEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc---ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 280 LALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN---LDGKGGSYALASVQADTFVQSVQTVADEIRDVAQA 356 (484) Q Consensus 280 ~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt---~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~ 356 (484) +++.++.+++.++.+-++ ...+++..-.++|-+. +-.+| ..+-||-.|.++ ...+.+.+.+-...+...|.+ T Consensus 285 ~~i~~~~d~~~~~~~~sg--l~d~l~~~~q~iaaa~-~IP~t~L~Gqsp~glnst~D--~~nyyd~i~~~Q~~l~p~le~ 359 (449) T protein:vir:10 285 LMTTQGATVTPLVTSVAD--PTATYNVNLQTAAAGV-DIPTRILIGNQQAERSSTED--QKYFNARCQSRRVDLSFEIED 359 (449) T ss_pred eeecCCcceEEEecccCC--hhHHHHHHHHHHHHHh-CCCeeeeeccCccccccchh--HHHHHHHHHHHHHhhhHHHHH Confidence 456777788888765332 3334444444455553 33322 222233223233 456777777776667777754 Q ss_pred HHHHHHHHhCCCCccccceEEecCCC-CcHH-------HHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccccc Q lcl|NC_021302. 357 HVVEDIVDVNWGEDEPAPLLVFDEIG-SRQD-------ATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDE 428 (484) Q Consensus 357 qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~~-------~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~ 428 (484) |++.|+...|+....--.|+|.+.- .+.+ ..|++++++++.|... .++.+++|+..|...+..... .. T Consensus 360 -l~~~l~~s~~g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~--~~~~~EiR~~~~~~~~~~~~~-~~ 435 (449) T protein:vir:10 360 -FCDKLIELKIIDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNP--AFSREEIRTAAGYDNDDEEPL-GE 435 (449) T ss_pred -HHHHHHHhhcCCCCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccC--CcCHHHHHHHhcccCCCCCCC-CC Confidence 8888888877654322366665532 2223 3567788888887532 346789999999865432211 00 Q ss_pred ccCCCcCCCccccCCCCccccccccccccc Q lcl|NC_021302. 429 STADTGQDEPETDEPALPNTSGTTSTTNAP 458 (484) Q Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (484) .. .+ +. .....+++ T Consensus 436 e~----~d--e~----------~~~~d~~a 449 (449) T protein:vir:10 436 ED----GD--EE----------DKATDSAA 449 (449) T ss_pred CC----Cc--cc----------cccCCcCC Confidence 00 00 00 00000000 No 139 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.45 E-value=5.9e-07 Score=54.72 Aligned_cols=446 Identities=13% Similarity=0.025 Sum_probs=164.5 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHH----HHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIA----SVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~----s~l~~r~~~v~~~~~~v~ 76 (484) |.|-.-...+-...+..+-.-...++.-|...-.....|..+..+.|+ ....-..+. ..+. +...| .+| T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~-G~~~i~~~~~~~p~~~~-~~~~v--~n~--- 73 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYD-GKYAIRQIGNLIPPEYL-RTATV--LGW--- 73 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHh-ccccchhccccccHHHH-HHhhc--cCc--- Confidence 332221111111111111000000111000000000001111112222 111001111 1111 11111 122 Q ss_pred cCCCCHHHHHHHHHHHHhh---h-ccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLP---V-EGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~---~-~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) ...+++..++.|... + ..++.+....+....-+|+....+++ +|+.||.|. ++||.-.+|.-.+ .|..+ T Consensus 74 ----~~~iVd~~a~rl~~~Gf~~~d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af-~~v~~~~d~~~~~-~I~~~ 147 (504) T protein:vir:99 74 ----SAKAVDTLARRCNLESFVWPDGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAF-LINTEGGAGEPDS-LIHVK 147 (504) T ss_pred ----HHHHHHHHHhhhccceeeCCCCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCcee-EEEecCCCCCcee-EEEEe Confidence 112333333322111 0 11111222333334446777666664 899999965 7888766553211 13333 Q ss_pred Cccce---------------eeeeecCCCceeeeeccccccccc-----ccceeccCCCCcccccccceEEEeecCccCc Q lcl|NC_021302. 152 PQSSI---------------AYWNVDRDGGLISIQQWPAGTFGG-----PGMVVMAPNSMGPAIPVEQLVVYTHDMDPGV 211 (484) Q Consensus 152 ~~~~~---------------~~~~~~~dg~l~~~~q~~~~~~~~-----~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~ 211 (484) +|++. .++..+.+|......-+..+..-. ...+..........+| ++.|.++.+.+. T Consensus 148 sP~~~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvP---vV~~~n~~~~~~ 224 (504) T protein:vir:99 148 SAMQATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVP---VEVLPYKPREDR 224 (504) T ss_pred ccceeEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcc---eEEecccccCcc Confidence 44332 122233333322211111111000 0000000011111233 677788888889 Q ss_pred cccchhHH-HHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHH------HHHHHHHHHHHHhcCCceEEEccC Q lcl|NC_021302. 212 WTGNSLLR-PAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDD------RMDELLEIASNYSGGESAGLALTA 284 (484) Q Consensus 212 p~G~gll~-~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~------~~~~l~~~l~~~~~g~~a~~vip~ 284 (484) |+|.+-+. .+-...=--+..+...+.-.|-|.++..+++|-......++ ..+.....+-.+..+ .=+.++. T Consensus 225 ~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~--~~~~~~~ 302 (504) T protein:vir:99 225 PLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDD--EDEPDAA 302 (504) T ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCc--ccccccc Confidence 99977542 33222222233333444556767676667766432211111 111111112222111 1122333 Q ss_pred CceEEEecccC-CchhHHHHHHHHHHHHHHHHhh--hhh--cccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 285 GEEAGILSPNG-TPLDPRRAIEYHDHQMALVALA--HFL--NLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVV 359 (484) Q Consensus 285 ~~~ie~~~~~~-~~~~~~~li~~~d~~Isk~ilG--qtl--t~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli 359 (484) +.+.++-+... +...|...++.+-.+||..-.- +.| .++...+|-..-.....-....++.-.+.+...+. +++ T Consensus 303 ~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~-~~~ 381 (504) T protein:vir:99 303 RARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFR-RSM 381 (504) T ss_pred CccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 44455544332 3345766677666777654211 112 11111122212223344445555566677778885 466 Q ss_pred HHHHHh--CCCC--c-cccceEEecCC-CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCc--cc-cc-- Q lcl|NC_021302. 360 EDIVDV--NWGE--D-EPAPLLVFDEI-GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDA--DD-DE-- 428 (484) Q Consensus 360 ~~l~~~--Nf~~--~-~~~P~~~~~~~-~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e--~~-~~-- 428 (484) +..+.+ |++. . ..-.+++|.+. .....+.|+++.||+..|.... ...+.+.+.+|+...+-.. .. .. T Consensus 382 rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~--~~~~~l~~~lg~~~~ei~r~~~e~~~~~ 459 (504) T protein:vir:99 382 IRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWL--KETEVGLELLGLTPQQAKRALAERRRAS 459 (504) T ss_pred HHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccc--cchHHHHhhcCCCHHHHHHHHHHHHHHh Confidence 665544 3321 1 12235667543 3466788999999999885221 1235678888986442110 00 00 Q ss_pred ------ccCCCcCCCccccCCCCccccccccccccccccccccccchHHHhcCccc Q lcl|NC_021302. 429 ------STADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPDG 478 (484) Q Consensus 429 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (484) ...+.. +.+..+........+.+.......+..+| ...| T Consensus 460 ~~~~~~~l~~~~-~~~~~~~~~~~~~~~e~a~~~~~~~~~~p----------~~~~ 504 (504) T protein:vir:99 460 SVSIIEALNRRQ-QEAATAGEDQDQGAGEPPANEPPAALGRP----------TLVG 504 (504) T ss_pred hHHHHHHHhccc-CCCCCCCCCCCcCCCCCCCCCCCccCCCc----------ccCC Confidence 000000 00001100000101110000000111111 0111 No 140 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.38 E-value=9.4e-07 Score=53.63 Aligned_cols=413 Identities=12% Similarity=0.028 Sum_probs=167.6 Q ss_pred CCCCCCCccceeeeec---ccccch--hhhhhhcccccccccc-c-ccchHHH-HHHHHhcchHHHHHHHHHHHHhhCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVN---PLAGFG--TFLAQGLDQFEQVDEL-R-WPNSVYT-YTRMCREEARIASVLRAIGLPIRRTD 72 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~l-r-~~~~~~~-y~~m~~~D~~v~s~l~~r~~~v~~~~ 72 (484) |.|.++.-.-...+.. ...... ..+..|= + ....+ + .+..++- +..+ ......-++......+.+.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~-~--~i~~~~~~~~~~~~~~~~k~--~~n~~~~ivd~~~~~l~~~~ 75 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGD-A--PLPELTRNTSAAWRSFQREA--RTNWGLMVRDSVADRIIPNG 75 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-C--CchhcCcccChhhhhhhhhh--hcchHHHHHHHHHhhhccCC Confidence 7666653321111000 000000 1111120 0 00000 0 1111111 1112 12344555555555566666 Q ss_pred cEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 73 WRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 73 ~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) +.+... ++.+..+.+.+. ...-+|+....++ .+|..||.+ ++++|.-.+|...+. .. T Consensus 76 ~~~~~~-~d~~~~~~~~~i-----------------~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~d~~g~~~i~---~~ 133 (456) T protein:vir:10 76 ITVGGS-ADSDLALRARRI-----------------WRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTATIT---AD 133 (456) T ss_pred eecCCC-CCcchHHHHHHH-----------------HHhcChhhHHHHHHHHHhhcCee-EEEEeeCCCCceEEE---EE Confidence 665322 222222222222 1222466666666 578999996 589998777765433 33 Q ss_pred Ccccee---------------eeeecCCCceeeeeccccccccc-----------ccceeccCCCCcc--c-ccc-cceE Q lcl|NC_021302. 152 PQSSIA---------------YWNVDRDGGLISIQQWPAGTFGG-----------PGMVVMAPNSMGP--A-IPV-EQLV 201 (484) Q Consensus 152 ~~~~~~---------------~~~~~~dg~l~~~~q~~~~~~~~-----------~~~~~~~~~~~~~--~-lp~-~k~l 201 (484) +|+... ++..+.|+...+..-+....... .........+... . .+. ..++ T Consensus 134 ~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (456) T protein:vir:10 134 SPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP 213 (456) T ss_pred ccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCce Confidence 333321 11112222211110000000000 0000000000000 0 000 0111 Q ss_pred EEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC-HHHHHHHHHHHHHHhcCCceEE Q lcl|NC_021302. 202 VYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED-DDRMDELLEIASNYSGGESAGL 280 (484) Q Consensus 202 ~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~-~~~~~~l~~~l~~~~~g~~a~~ 280 (484) ... ...|+.|.|.+..+....---...+.+.+...+-+.+++.+++|...+... ++.-..+ +....+.++..... T Consensus 214 pvv---~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~-~~~~~~~~~~~~~~ 289 (456) T protein:vir:10 214 PVV---VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAI-DYASIFEAAPGALW 289 (456) T ss_pred eEE---EecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccccc-chhhhhhhhccccc Confidence 111 135788999988876544444445555666677666666666664322211 1111111 11112222323445 Q ss_pred EccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccc--ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 281 ALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGK--GGSYALASVQADTFVQSVQTVADEIRDVAQAHV 358 (484) Q Consensus 281 vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~--gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~ql 358 (484) .+|.+.+|..+.. .+...|...++.+-.+|+..-.-.....++. +.|-..-.....-....+..-.+.+...+. ++ T Consensus 290 ~~~~~~~~~q~~~-~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~-~~ 367 (456) T protein:vir:10 290 ELPPGVDIWESQA-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AI 367 (456) T ss_pred cCCCCcceEEecc-cChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Confidence 5788888755432 2334577777777777766522211111111 111111122233344444455566666664 46 Q ss_pred HHHHHHhCCCCccccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCC Q lcl|NC_021302. 359 VEDIVDVNWGEDEPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDE 437 (484) Q Consensus 359 i~~l~~~Nf~~~~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~ 437 (484) ++-++.+.-.......++.|.. ......+.++++.+|+++|+. +..-+++.+|+...+-.+............. T Consensus 368 ~rl~~~~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~-----~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~ 442 (456) T protein:vir:10 368 LVKALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGES-----WASIRRNILNYNADQIKQDDLDRAREQITLF 442 (456) T ss_pred HHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCC-----hHHHHHhhCCCCHHHHHHHHHHHHHHHHHHH Confidence 7766666533222234667754 345678889999999999873 2445677788753211000000000000000 Q ss_pred ccccCCCCcccccccccccccccccc Q lcl|NC_021302. 438 PETDEPALPNTSGTTSTTNAPQARKR 463 (484) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (484) +. +....|. +.+++ T Consensus 443 ~~-~~~~~~~-----------~~~~~ 456 (456) T protein:vir:10 443 AG-NPVQRPQ-----------EDGSR 456 (456) T ss_pred hh-hhhhcCC-----------CCCCC Confidence 00 0000000 00000 No 141 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.38 E-value=9.4e-07 Score=53.63 Aligned_cols=413 Identities=12% Similarity=0.028 Sum_probs=167.6 Q ss_pred CCCCCCCccceeeeec---ccccch--hhhhhhcccccccccc-c-ccchHHH-HHHHHhcchHHHHHHHHHHHHhhCCC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVN---PLAGFG--TFLAQGLDQFEQVDEL-R-WPNSVYT-YTRMCREEARIASVLRAIGLPIRRTD 72 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~l-r-~~~~~~~-y~~m~~~D~~v~s~l~~r~~~v~~~~ 72 (484) |.|.++.-.-...+.. ...... ..+..|= + ....+ + .+..++- +..+ ......-++......+.+.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~-~--~i~~~~~~~~~~~~~~~~k~--~~n~~~~ivd~~~~~l~~~~ 75 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGD-A--PLPELTRNTSAAWRSFQREA--RTNWGLMVRDSVADRIIPNG 75 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-C--CchhcCcccChhhhhhhhhh--hcchHHHHHHHHHhhhccCC Confidence 7666653321111000 000000 1111120 0 00000 0 1111111 1112 12344555555555566666 Q ss_pred cEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 73 WRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 73 ~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) +.+... ++.+..+.+.+. ...-+|+....++ .+|..||.+ ++++|.-.+|...+. .. T Consensus 76 ~~~~~~-~d~~~~~~~~~i-----------------~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~d~~g~~~i~---~~ 133 (456) T protein:vir:10 76 ITVGGS-ADSDLALRARRI-----------------WRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTATIT---AD 133 (456) T ss_pred eecCCC-CCcchHHHHHHH-----------------HHhcChhhHHHHHHHHHhhcCee-EEEEeeCCCCceEEE---EE Confidence 665322 222222222222 1222466666666 578999996 589998777765433 33 Q ss_pred Ccccee---------------eeeecCCCceeeeeccccccccc-----------ccceeccCCCCcc--c-ccc-cceE Q lcl|NC_021302. 152 PQSSIA---------------YWNVDRDGGLISIQQWPAGTFGG-----------PGMVVMAPNSMGP--A-IPV-EQLV 201 (484) Q Consensus 152 ~~~~~~---------------~~~~~~dg~l~~~~q~~~~~~~~-----------~~~~~~~~~~~~~--~-lp~-~k~l 201 (484) +|+... ++..+.|+...+..-+....... .........+... . .+. ..++ T Consensus 134 ~p~~~~~i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (456) T protein:vir:10 134 SPETMVVSVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP 213 (456) T ss_pred ccceeEEEEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCce Confidence 333321 11112222211110000000000 0000000000000 0 000 0111 Q ss_pred EEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCC-HHHHHHHHHHHHHHhcCCceEE Q lcl|NC_021302. 202 VYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSED-DDRMDELLEIASNYSGGESAGL 280 (484) Q Consensus 202 ~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~-~~~~~~l~~~l~~~~~g~~a~~ 280 (484) ... ...|+.|.|.+..+....---...+.+.+...+-+.+++.+++|...+... ++.-..+ +....+.++..... T Consensus 214 pvv---~~~N~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~-~~~~~~~~~~~~~~ 289 (456) T protein:vir:10 214 PVV---VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAI-DYASIFEAAPGALW 289 (456) T ss_pred eEE---EecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccccc-chhhhhhhhccccc Confidence 111 135788999988876544444445555666677666666666664322211 1111111 11112222323445 Q ss_pred EccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccc--ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 281 ALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGK--GGSYALASVQADTFVQSVQTVADEIRDVAQAHV 358 (484) Q Consensus 281 vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~--gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~ql 358 (484) .+|.+.+|..+.. .+...|...++.+-.+|+..-.-.....++. +.|-..-.....-....+..-.+.+...+. ++ T Consensus 290 ~~~~~~~~~q~~~-~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~-~~ 367 (456) T protein:vir:10 290 ELPPGVDIWESQA-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AI 367 (456) T ss_pred cCCCCcceEEecc-cChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Confidence 5788888755432 2334577777777777766522211111111 111111122233344444455566666664 46 Q ss_pred HHHHHHhCCCCccccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCC Q lcl|NC_021302. 359 VEDIVDVNWGEDEPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDE 437 (484) Q Consensus 359 i~~l~~~Nf~~~~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~ 437 (484) ++-++.+.-.......++.|.. ......+.++++.+|+++|+. +..-+++.+|+...+-.+............. T Consensus 368 ~rl~~~~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~-----~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~ 442 (456) T protein:vir:10 368 LVKALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGES-----WASIRRNILNYNADQIKQDDLDRAREQITLF 442 (456) T ss_pred HHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCC-----hHHHHHhhCCCCHHHHHHHHHHHHHHHHHHH Confidence 7766666533222234667754 345678889999999999873 2445677788753211000000000000000 Q ss_pred ccccCCCCcccccccccccccccccc Q lcl|NC_021302. 438 PETDEPALPNTSGTTSTTNAPQARKR 463 (484) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (484) +. +....|. +.+++ T Consensus 443 ~~-~~~~~~~-----------~~~~~ 456 (456) T protein:vir:10 443 AG-NPVQRPQ-----------EDGSR 456 (456) T ss_pred hh-hhhhcCC-----------CCCCC Confidence 00 0000000 00000 No 142 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.28 E-value=1.8e-06 Score=52.12 Aligned_cols=378 Identities=11% Similarity=0.009 Sum_probs=162.3 Q ss_pred chhhhhhhcccccccccccccchHHHHHHHHhcchHHH----HHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhh Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIA----SVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPV 96 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~----s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~ 96 (484) |....+..|...-.....|..+..+.| +.......+. ..+..+...+. +| ...+++.+++.+.... T Consensus 1 m~~~~i~~L~~~~~~~~~r~~~~~~yy-~g~~~~~~~~~~~p~~~~~~~~~v~--nw-------~~~~Vd~~a~rl~~~G 70 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTGVDKRYRYY-AMDDRDDTRSIVMPNNVREMYRSVL--EW-------TAKGVDSLADRIIFRE 70 (422) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHH-hcCCChhhcCccccHHHHHHHHhhc--ch-------hHHHHHHHHhccccce Confidence 333222222221111111111112222 2211111111 11212211221 33 1233444443332211 Q ss_pred ccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEee-cCCeeeeeeeeeeCccceee---------------e Q lcl|NC_021302. 97 EGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFY-EGGRFWLKRLAPRPQSSIAY---------------W 159 (484) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~-~~g~~~~~~l~~r~~~~~~~---------------~ 159 (484) .. ..+....+....-+|+.....+ .+|+.||.|. .+||.- ++|.. .+..++|++..- + T Consensus 71 f~-~~d~~l~~~w~~N~ld~~~~~~~~~al~~G~sf-~~v~~~~~~~~p---~i~~~sp~~~~~i~D~~~~~~~~a~~~~ 145 (422) T protein:vir:97 71 FT-NDDFNAWEIFKANNPDIFFDTAIQSALIASCCF-VYIMPGAEDGLP---KMQVIEASKATGILDPTTFLLTEGYAIL 145 (422) T ss_pred ee-CCchhHHHHHHhcChHHHHHHHHHHHHHhccee-EEEeeCCCCCee---EEEEechhhEEEEEeCCCCcceeeEEEE Confidence 11 0111122333334577666665 4799999974 456653 34542 345555555422 2 Q ss_pred eecCCCceeeeecccccccccccceeccCCCCc----ccccccceEEEeecCccCccccchhH-HHHHHHHHHHHHHHHH Q lcl|NC_021302. 160 NVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMG----PAIPVEQLVVYTHDMDPGVWTGNSLL-RPAYKNWKLKDELIRI 234 (484) Q Consensus 160 ~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~----~~lp~~k~l~~~~~~~~~~p~G~gll-~~~~~~~~~K~~~~~~ 234 (484) ..+.+|........... ........+.. .+++.--++.+.++.+.+.|+|.|-+ +.+-...---+..+.. T Consensus 146 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~ 220 (422) T protein:vir:97 146 ESDSNGNPTLEAYFTDK-----DIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLER 220 (422) T ss_pred EecCCCcEEEEEEEcCc-----eEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHH Confidence 22223322111111100 11111111111 12222346778888888999998855 3333222222344444 Q ss_pred HHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCc---eEEEeccc-CCchhHHHHHHHHHHH Q lcl|NC_021302. 235 EAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGE---EAGILSPN-GTPLDPRRAIEYHDHQ 310 (484) Q Consensus 235 w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~---~ie~~~~~-~~~~~~~~li~~~d~~ 310 (484) .....|-|.++..+++|-...+...+..+. .+ + ....+|.+. .+++-+.+ .+...|...++.+-.+ T Consensus 221 ~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~---~~-----~--~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~ 290 (422) T protein:vir:97 221 AEVTAEFYSFPQKYVLGMDPDAKPMEKWRA---TV-----S--TLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASL 290 (422) T ss_pred HHHHHHHhcchhhhhcccCcccccCchhhh---hh-----h--hhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHH Confidence 455667677777777775433322222111 11 1 234455432 34554433 2234566666665555 Q ss_pred HHHHHhhhhhc---ccccc-cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc-----cceEEecCC Q lcl|NC_021302. 311 MALVALAHFLN---LDGKG-GSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEP-----APLLVFDEI 381 (484) Q Consensus 311 Isk~ilGqtlt---~~~~g-Gs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~-----~P~~~~~~~ 381 (484) ||-. .+=+.. ..+.+ .|-..-.....-+...++.-.+.+...+. ++++.++.+.-+.... -..++|... T Consensus 291 ~a~~-s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~-~~~rla~~~~~~~~~~~~~~~~~~~~w~p~ 368 (422) T protein:vir:97 291 FAGG-SGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFL-NVAYIAVCLRDEFPYLRNQFMDTVIKWEPL 368 (422) T ss_pred Hhcc-cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCcccchhhccceEEEccC Confidence 5544 111111 11111 12111133344455555666777888885 4777666664322111 124566532 Q ss_pred -CCc---HHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccc-ccccCCC Q lcl|NC_021302. 382 -GSR---QDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADD-DESTADT 433 (484) Q Consensus 382 -~~~---~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~-~~~~~~~ 433 (484) +.+ ....|+++.||+++|-. ..+.+.+++.+|+..+...... ....+++ T Consensus 369 ~~~~~~s~a~~aDa~~Kl~~a~~~---~~~~~~~~~~lg~~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 369 FEADANMLTLVGDGAIKLNQAIPG---FMDADVIRDLTGVKGADKPIPAITEVTTDG 422 (422) T ss_pred CCCChHHHHHHHHHHHHHHhhccc---cccHHHHHHHcCCCchhHHHHHHHhhhccC Confidence 223 45678889999988632 2346789999999544322111 1112222 No 143 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.27 E-value=1.8e-06 Score=52.06 Aligned_cols=414 Identities=12% Similarity=0.028 Sum_probs=162.5 Q ss_pred CCCCCCCccceeeeec---ccccch--hhhhhhcccccccccc--cccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTVAPRTERGYVN---PLAGFG--TFLAQGLDQFEQVDEL--RWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~l--r~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) |-|.++.-.-...+-. ...... ..+..|-. ....+ +.+..++...... ......-++.+....+.+.++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~---~i~~~~~~~~~~~~~~~~~~-~~n~~~~ivd~~~~~l~~~g~ 76 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDA---PLPELTRNTSAAWRSFQREA-RTNWGLMVRDSVADRIIPNGI 76 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---ChhhcCcccChhhchhhhhh-hcchHHHHHHHHHhhhccCCe Confidence 5555543221110000 000000 01111100 00000 0011111111111 123444455555555555566 Q ss_pred EEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeC Q lcl|NC_021302. 74 RIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRP 152 (484) Q Consensus 74 ~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~ 152 (484) .+... ++.+..+.+.+. ...-+|+.+..+++ +|..||.+ ++++|..++|... +...+ T Consensus 77 ~~~~~-~d~~~~~~~~~~-----------------~~~n~~d~~~~~~~~~a~~~G~a-~~~~~~~edg~~~---i~~~~ 134 (456) T protein:vir:79 77 TVGGS-ADSDLALRARRI-----------------WRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTAT---ITADS 134 (456) T ss_pred ecCCC-CCccHHHHHHHH-----------------HHhcChhHHHHHHHHHHhhcCee-EEEEeeCCCCceE---EEEec Confidence 54321 222222222222 22235777777665 78999985 6899987777654 33444 Q ss_pred ccceeeeeecC----------------CCceeeeecccccccccc-----------cceeccCCCCccc---cc-ccceE Q lcl|NC_021302. 153 QSSIAYWNVDR----------------DGGLISIQQWPAGTFGGP-----------GMVVMAPNSMGPA---IP-VEQLV 201 (484) Q Consensus 153 ~~~~~~~~~~~----------------dg~l~~~~q~~~~~~~~~-----------~~~~~~~~~~~~~---lp-~~k~l 201 (484) |+.+. ..+++ ++......-+........ ........+.... .+ ....+ T Consensus 135 p~~~~-~i~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (456) T protein:vir:79 135 PETMV-VSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP 213 (456) T ss_pred cceeE-EEEcCCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCce Confidence 44331 11121 111100000000000000 0000000000000 00 11111 Q ss_pred EEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEE Q lcl|NC_021302. 202 VYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLA 281 (484) Q Consensus 202 ~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~v 281 (484) .+. .+.|+.|.|.+..+-...---...+..-....+-|.+++-++.|-.......++..........+.++...... T Consensus 214 pvv---~~~N~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~ 290 (456) T protein:vir:79 214 PVV---VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWE 290 (456) T ss_pred eEE---EecCCCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhcccccc Confidence 111 23578888888876433222233334444556666665555555321111111111111122222223334556 Q ss_pred ccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccc--ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 282 LTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGK--GGSYALASVQADTFVQSVQTVADEIRDVAQAHVV 359 (484) Q Consensus 282 ip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~--gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli 359 (484) +|.+.++..+.. .+...|...++..-.+|+..-.-..-..++. +.|...-+....-+...++.-.+.+...|+ +++ T Consensus 291 ~~~~~~~~q~~~-~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~~~ 368 (456) T protein:vir:79 291 LPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AIL 368 (456) T ss_pred CCCCcceeeecc-cChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 788887744432 2334577777777777765522111111111 112211122333344444555566777775 477 Q ss_pred HHHHHhCCCCccccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCc Q lcl|NC_021302. 360 EDIVDVNWGEDEPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEP 438 (484) Q Consensus 360 ~~l~~~Nf~~~~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~ 438 (484) +.++.+.-.......+++|.. ......+.|+++.+|+.+|+. +..-+++.+|+...+-........... T Consensus 369 ~l~~~~~g~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~-----~~~~~~~~lg~~~~~i~~~e~~r~~~e----- 438 (456) T protein:vir:79 369 VKALQIEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGES-----WASIRRNILNYNADQIKQDDLDRAREQ----- 438 (456) T ss_pred HHHHHhcCCCccccceEEeCCCCCcCHHHHHHHHHHHHhcCCC-----hHHHHHhcCCCCHHHHHHHHHHHHHHH----- Confidence 766666533322234666743 345678889999999999973 234466778874331100000000000 Q ss_pred cccCCCCcccccccccccccccccc Q lcl|NC_021302. 439 ETDEPALPNTSGTTSTTNAPQARKR 463 (484) Q Consensus 439 ~~~~~~~~~~~~~~~~~~~~~~~~~ 463 (484) .... .+. ......+.+++ T Consensus 439 ~~~~------~~~-~~~~~~~~~~~ 456 (456) T protein:vir:79 439 ITLF------AGN-PVQRPQEDGSR 456 (456) T ss_pred HHHH------hhh-HhhcCCCCCCC Confidence 0000 000 00000000000 No 144 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.20 E-value=2.8e-06 Score=51.05 Aligned_cols=370 Identities=14% Similarity=0.071 Sum_probs=158.8 Q ss_pred cccccccccchHHHHHHHHhcchHHH----HHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHh Q lcl|NC_021302. 33 EQVDELRWPNSVYTYTRMCREEARIA----SVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRT 108 (484) Q Consensus 33 ~~~~~lr~~~~~~~y~~m~~~D~~v~----s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~ 108 (484) ......|.....+.|+ +...-..+. ..+..+.+.+. +| ...+++.+++.+....... .+...... T Consensus 1 l~~~~~r~~~~~~yY~-g~~~~~~~~~~~p~~~~~~~~~v~--nw-------~~~~Vds~a~rl~~~Gf~~-~d~~l~~i 69 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYA-MQHYEAPTGITIPAHIRAKYQAVL--GW-------AAKGVDSLADRLIFRAFAN-DDFNVTEI 69 (410) T ss_pred CCcchhhHHHHHHHhc-CCCCccccchhccHHHHhHHHhhc--ch-------hHHHHHHhHhhhccccccC-CCchHHHH Confidence 0001111111112221 111111111 11221212221 22 1122333333222111110 11112233 Q ss_pred hcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecC-CCceee---eecccc-cc----- Q lcl|NC_021302. 109 RGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDR-DGGLIS---IQQWPA-GT----- 177 (484) Q Consensus 109 ~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~-dg~l~~---~~q~~~-~~----- 177 (484) ...-+|+....++ .+|+.||.| +..||.-.+|.- .|..++|++..-. +|+ ++++.. +..... +. T Consensus 70 ~~~N~ld~~~~~~~~~al~~G~s-f~~v~~~~d~~~---~i~~~sP~~~~~i-~Dp~~~~~~~al~~~~~~~~~~~~~~~ 144 (410) T protein:vir:95 70 FDRNNPDIFFDSAILSALIGSCS-FVYISKGEDDEV---RLQVIESSNATGV-IDPITGLLVEGYAVLARDDYNRPTLEA 144 (410) T ss_pred HhhcChHHHHHHHHHHHHHhCce-eEEEecCCCCce---EEEEEcccceEEE-EeCCCCceEEEEEEEEecCCCeEEEEE Confidence 3344677766665 489999996 556787666643 3555555554321 122 222211 000000 00 Q ss_pred -cccccceeccCCCC----cccccccceEEEeecCccCccccchh-HHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEe Q lcl|NC_021302. 178 -FGGPGMVVMAPNSM----GPAIPVEQLVVYTHDMDPGVWTGNSL-LRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKG 251 (484) Q Consensus 178 -~~~~~~~~~~~~~~----~~~lp~~k~l~~~~~~~~~~p~G~gl-l~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~g 251 (484) ......+.....+. ..+++..-++.|.++.+.+.|+|.|- .+.+-...---+..+.....-.|=|.++..+++| T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G 224 (410) T protein:vir:95 145 YFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQKYILG 224 (410) T ss_pred EEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeec Confidence 00011111111111 23334446788888888889999874 3544433322334444445566767777778888 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCc---eEEEecccC-CchhHHHHHHHHHHHHHHHHhhhhhc---ccc Q lcl|NC_021302. 252 NEADSEDDDRMDELLEIASNYSGGESAGLALTAGE---EAGILSPNG-TPLDPRRAIEYHDHQMALVALAHFLN---LDG 324 (484) Q Consensus 252 k~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~---~ie~~~~~~-~~~~~~~li~~~d~~Isk~ilGqtlt---~~~ 324 (484) -...+...+..+ ..+ .....+|++. .+++-+.++ +...|...++.+-++||-. .+=++. ..+ T Consensus 225 ~d~d~~~~~~~~---~~~-------~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~-s~lP~~~lg~~~ 293 (410) T protein:vir:95 225 LDPDAEPMEKWK---ATV-------SSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGE-MGLTLDDLGFVS 293 (410) T ss_pred cCCCCCcCchhh---hhh-------hhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhh-cCCCHHHhcccc Confidence 543332222111 111 1345566543 355544332 3345766666666666654 111111 111 Q ss_pred cc-cc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CCCcc---ccceEEec---CCC-CcHHHHHHHHH Q lcl|NC_021302. 325 KG-GS-YALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVN--WGEDE---PAPLLVFD---EIG-SRQDATAAALQ 393 (484) Q Consensus 325 ~g-Gs-~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~N--f~~~~---~~P~~~~~---~~~-~~~~~~ae~~~ 393 (484) .+ +| -|+.. ..+-+....+.-.+.+...+. ++++..+.+- ++... .-..++|. +.+ ......++++. T Consensus 294 ~NpsSa~Al~a-~~~~L~~ka~~k~~~fg~~l~-~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~ 371 (410) T protein:vir:95 294 DNPSSVEAIKA-SHENLRLAGRKAQRSLGAGLL-NVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVV 371 (410) T ss_pred CchhHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHH Confidence 11 22 23332 233334444455667778885 5777655552 32211 11244454 222 24577889999 Q ss_pred HHHhcCcccCCcccHHHHHHHhCCCCCCCCcccc-cccCCCc Q lcl|NC_021302. 394 MLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDD-ESTADTG 434 (484) Q Consensus 394 ~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~-~~~~~~~ 434 (484) ||+++|--+ .+.+-+++.+|+..++...... ...+.+. T Consensus 372 Kl~~a~~g~---~~~~~~~~~lg~~~~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 372 KLNQALPGY---INAETIRDLTGIAGDMSAKPVVSEGGSNGE 410 (410) T ss_pred HHHHhccCC---ccHHHHHHhcCCChHHHHHHHHHHHHhCCC Confidence 999985322 2456799999996432111000 0011111 No 145 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=98.12 E-value=1.1e-06 Score=53.19 Aligned_cols=206 Identities=8% Similarity=-0.041 Sum_probs=115.3 Q ss_pred eeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 159 WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAA 238 (484) Q Consensus 159 ~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f 238 (484) .+...||++.+..... .....+....++++..++++.....+..||.+.+..+......-....++-..| T Consensus 1 ~r~~~dg~~~y~~~~~----------~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~ 70 (219) T protein:vir:98 1 MRVCKDGNYKYLMKKS----------LYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRY 70 (219) T ss_pred CceeecCeEEEEEecc----------eecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHH Confidence 2233344443322110 111223455677888777665444556789999888776666555555555567 Q ss_pred HHHhcCCcceEEecC-CCCCCHHHHHHHHHHHHHHhcCCceE-EEc------cCCceEEEecccCCchhHHHHHHHHHHH Q lcl|NC_021302. 239 IRRHGIGVPYLKGNE-ADSEDDDRMDELLEIASNYSGGESAG-LAL------TAGEEAGILSPNGTPLDPRRAIEYHDHQ 310 (484) Q Consensus 239 ~Er~~~G~P~~~gk~-~~~~~~~~~~~l~~~l~~~~~g~~a~-~vi------p~~~~ie~~~~~~~~~~~~~li~~~d~~ 310 (484) ... .++|-.+..+ +...++++++++.+.+++..++.++. +++ +.|++++-+..+.....|.+.-++-..+ T Consensus 71 f~N--g~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~e 148 (219) T protein:vir:98 71 YSN--GAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQD 148 (219) T ss_pred Hhc--CCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHH Confidence 764 3678555543 34578889999999998764433221 223 3467777666666656788888888999 Q ss_pred HHHHHhhhhhcc---cccccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcH Q lcl|NC_021302. 311 MALVALAHFLNL---DGKGGSYAL-ASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQ 385 (484) Q Consensus 311 Isk~ilGqtlt~---~~~gGs~A~-~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~ 385 (484) |+.+.....-.. +..+++++- .+........-+.--+..|++.||+++ .+ +. ..++.|++.. +|. T Consensus 149 Ia~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~-----~~--~~---~~~~~F~~~~~~d~ 218 (219) T protein:vir:98 149 VLTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDY-----EI--KS---ALKVNFKQPEKRDK 218 (219) T ss_pred HHHHhCCCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhh-----cC--CC---ccEEeecCcccccC Confidence 999965543221 111233432 222333444445566666777777531 11 11 1367775422 333 Q ss_pred H Q lcl|NC_021302. 386 D 386 (484) Q Consensus 386 ~ 386 (484) . T Consensus 219 ~ 219 (219) T protein:vir:98 219 N 219 (219) T ss_pred C Confidence 3 No 146 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.04 E-value=6.1e-06 Score=49.16 Aligned_cols=429 Identities=12% Similarity=0.066 Sum_probs=153.4 Q ss_pred CCCCCCCccceee--eecccccchhhhhhhcccccccccccccchHHHHHHHHhcc-hHHHHHHHHHHHHh-hCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERG--YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREE-ARIASVLRAIGLPI-RRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D-~~v~s~l~~r~~~v-~~~~~~v~ 76 (484) |+--.+.-+..-. .+..+.. .+. .... |..+.-+.|+ -. .+ .++...+......+ ...+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~-------~~~--~~~~--rl~~l~~Yy~-G~-~~i~~~~~~~~~~~~~~~~~~n~--- 64 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLN-------LFT--ERTQ--DLGDNTAYYE-SE-RRPDAVGVTVPQQMQKLLAHVGY--- 64 (484) T ss_pred CCCcccccCCCCHHHHHHHHHH-------HHH--HHHH--HHHHHHHHHh-cc-ccchhcccccchhHHhhhhhcCc--- Confidence 2211111110000 0000000 000 0000 0000011111 00 00 00000000000000 00011 Q ss_pred cCCCCHHHHHHHHHHHHhh----hccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeee-----ee Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLP----VEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFW-----LK 146 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~----~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~-----~~ 146 (484) ...+++..++.|... -..++.+....+....-+|+....+++ +|..||.| +++||.-.+|... .. T Consensus 65 ----~~~ivd~~~~~l~~~g~~~~~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a-~~~v~~~~~~~~~~~~~~~~ 139 (484) T protein:vir:77 65 ----PRLYIDAIAARQELEGFRLGGADKADEQLWDWWQANDLDIESTLGHTDSLVHGRS-YITISKPDPNIDPGVDPEVP 139 (484) T ss_pred ----HHHHHHHHHhhhccCceecCCcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCce-EEEEecCCCCcccccccccc Confidence 011222222211000 011111223333444456877777764 79999995 6778866554321 11 Q ss_pred eeeeeCccceee---------------eeecCCCceeeeeccccccccc----ccceeccCCCCcccccccceEEEeecC Q lcl|NC_021302. 147 RLAPRPQSSIAY---------------WNVDRDGGLISIQQWPAGTFGG----PGMVVMAPNSMGPAIPVEQLVVYTHDM 207 (484) Q Consensus 147 ~l~~r~~~~~~~---------------~~~~~dg~l~~~~q~~~~~~~~----~~~~~~~~~~~~~~lp~~k~l~~~~~~ 207 (484) .|...+|+.+.. ++.+.++......-+..+.... ...+. .......++..--++.|+++. T Consensus 140 ~i~~~~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~-~~~~~~~~~g~vPvv~f~N~~ 218 (484) T protein:vir:77 140 IIRVEPPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWV-QVANVAHNLEMVPVIPIPNRT 218 (484) T ss_pred eEEEeccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceE-eeccccCCCCCcceEEecccc Confidence 234444444321 1111112211111111110000 00000 001112223333467788888 Q ss_pred ccCccccchhHHHHHHHHH-HHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCc Q lcl|NC_021302. 208 DPGVWTGNSLLRPAYKNWK-LKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGE 286 (484) Q Consensus 208 ~~~~p~G~gll~~~~~~~~-~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~ 286 (484) +.+.|+|.|-+.......+ -=...+..++..+|-|.+++.+++|-.......+. ..-...+.. .......+| +. T Consensus 219 ~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~-~~ 293 (484) T protein:vir:77 219 RLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDP-ETGQTLFDA---YLARILAFE-DH 293 (484) T ss_pred ccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccc-cccchhhhh---hhhhhcccC-CC Confidence 8899999887754222222 22456667777888777766666653222111111 111111111 111233444 33 Q ss_pred eEEEecccC-CchhHHHHHHHHHHHHHHHHhhh--hhcccccc-cc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 287 EAGILSPNG-TPLDPRRAIEYHDHQMALVALAH--FLNLDGKG-GS-YALASVQADTFVQSVQTVADEIRDVAQAHVVED 361 (484) Q Consensus 287 ~ie~~~~~~-~~~~~~~li~~~d~~Isk~ilGq--tlt~~~~g-Gs-~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~ 361 (484) +.++.+... +...|...++.+-.+|+...--. .+...+.+ +| -|+ .....-+...++.-.+.+...+. ++++. T Consensus 294 ~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al-~~~~~~l~~ka~~k~~~f~~~l~-~~~~l 371 (484) T protein:vir:77 294 ESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAI-RSSESRLVKTVERKNKIFGGAWE-QAMRV 371 (484) T ss_pred CceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 444544332 22345555555555554432111 11111111 11 121 22233333444445556666664 35665 Q ss_pred HHHhCCCCcc----ccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcc--c-ccccCCC Q lcl|NC_021302. 362 IVDVNWGEDE----PAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDAD--D-DESTADT 433 (484) Q Consensus 362 l~~~Nf~~~~----~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~--~-~~~~~~~ 433 (484) ++.+--+... ..-.++|.. ......+.++++.+|++.|.-+. +.+.+.+.+|+-.....+- . .+..+.. T Consensus 372 ~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~---s~et~~~~l~~~~~~~~e~~~~~~ee~~~~ 448 (484) T protein:vir:77 372 AYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVI---PKERARIDMGYSITEREEMRKWDEEEQAQG 448 (484) T ss_pred HHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCC---CHHHHHhcCCCChhHHHHHHHHHHHHHHHH Confidence 5544222111 112556754 34567888999999999885322 4567888888743321110 0 0000000 Q ss_pred -------cCCCccc-cCCCCccccccccccccccccccccccchHHH Q lcl|NC_021302. 434 -------GQDEPET-DEPALPNTSGTTSTTNAPQARKRPRGRSPRDR 472 (484) Q Consensus 434 -------~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (484) ....++. ..+..++ ...+.+.+..++ .+ T Consensus 449 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~----------~~ 484 (484) T protein:vir:77 449 LGLMGTMFGTDPSGGGNPDNPE-TPEPQPNPAEEA----------AA 484 (484) T ss_pred HHHHhhhccccccCCCCCCCCC-cccccCCCcccc----------CC Confidence 0000000 0000000 000011111111 11 No 147 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.03 E-value=6.6e-06 Score=49.00 Aligned_cols=409 Identities=8% Similarity=-0.014 Sum_probs=172.8 Q ss_pred CCCCCCCccceee--eeccccc-chhhhhhhcccccccccccccchHHHHHH---HH-h-------------------cc Q lcl|NC_021302. 1 MAPKTVAPRTERG--YVNPLAG-FGTFLAQGLDQFEQVDELRWPNSVYTYTR---MC-R-------------------EE 54 (484) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~~-~~~~~~~~~~~~~~~~~lr~~~~~~~y~~---m~-~-------------------~D 54 (484) |.|..+..+...- .+.+... .-...+.-+.........|..+..+.|+- +. + .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~ 80 (472) T protein:vir:93 1 MYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMIT 80 (472) T ss_pred CCCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccccccc Confidence 9999988874321 1111101 10111111100000000011111222211 00 0 01 Q ss_pred hHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeee Q lcl|NC_021302. 55 ARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFE 133 (484) Q Consensus 55 ~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~E 133 (484) +...-++.+....+.+-+..+.. +++++.+++.+. +. -+|++.+.++ .++..||. +++ T Consensus 81 n~~~~ivd~~~~~l~g~~~~~~~--~d~~~~~~l~~~-----------------~~-n~~~~~~~~~~~~~~~~G~-~~~ 139 (472) T protein:vir:93 81 NFHANLVDQKVSYIVGKPIAFKH--TDDEVVKRIDEV-----------------LG-NRFDDKLHSVLTGASNKGI-EWL 139 (472) T ss_pred chHHHHHHHHhhhhcccCeeecc--CChHHHHHHHHH-----------------Hh-ccHHHHHHHHHHHHhhcCe-EEE Confidence 22333333444444444444432 223333333222 12 2566766665 57899998 467 Q ss_pred EEEeecCCeeeeeeeeeeCccceee-eeecCCCceeeeecc-cccccc------cccceec--cC--------------- Q lcl|NC_021302. 134 QTYFYEGGRFWLKRLAPRPQSSIAY-WNVDRDGGLISIQQW-PAGTFG------GPGMVVM--AP--------------- 188 (484) Q Consensus 134 ivw~~~~g~~~~~~l~~r~~~~~~~-~~~~~dg~l~~~~q~-~~~~~~------~~~~~~~--~~--------------- 188 (484) ++|.-.+|... +...+|+.+.. |.....+.++..... ...... ......+ .. T Consensus 140 ~v~~d~d~~~~---i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (472) T protein:vir:93 140 HPYLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSK 216 (472) T ss_pred EEEECCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccc Confidence 88876666653 44445555422 221222333221110 000000 0000000 00 Q ss_pred -CCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHH Q lcl|NC_021302. 189 -NSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLE 267 (484) Q Consensus 189 -~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~ 267 (484) .....++..--++.|+ +|++|.|.+..+-...---...+..++.-++.|. .|+++++-....+. ..... T Consensus 217 ~~~~~~~~~~vPvv~~~-----nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~--~~~~~~~g~~~~~~---~~~~~ 286 (472) T protein:vir:93 217 THFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSN--ELTYVLTNYDDQEL---PEFKR 286 (472) T ss_pred cccccCCCCCcceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhc--CceeEeecCCcccc---hhhHH Confidence 0000111111123222 3678999998765444455666777777778664 45555432222221 22222 Q ss_pred HHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HH--HHHHHHHHH Q lcl|NC_021302. 268 IASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQ--ADTFVQSVQ 344 (484) Q Consensus 268 ~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh--~~v~~~~~~ 344 (484) .+. .. ..+.++.+.+++++........+..+++.+.+.|...--...++.++-+|.- .|. .+ ..-....+. T Consensus 287 ~~~---~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Al~~~~~~l~~ka~ 360 (472) T protein:vir:93 287 LLR---YY--GAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAP-SGVALEFLYTNLNLKAD 360 (472) T ss_pred HHh---hc--cccccCCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCc-hHHHHHHHHHHHHHHHH Confidence 222 22 3456789999999887666778999999999988887544444444333211 121 11 122233334 Q ss_pred HHHHHHHHHHHHHHHHHHHHhC-CCCccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCC Q lcl|NC_021302. 345 TVADEIRDVAQAHVVEDIVDVN-WGEDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPD 421 (484) Q Consensus 345 aD~~~i~~~ln~qli~~l~~~N-f~~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~ 421 (484) .-.+.+...+ +++++.++.+. ......-..+.|. ..+.+..+.++++.+|+ |+ + +.+.+.+.++. +.+. T Consensus 361 ~~~~~~~~~l-~~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~~--gi-i----s~et~l~~l~~~~d~~ 432 (472) T protein:vir:93 361 KLARKAKVAI-QELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-V----SHETVLENHPFVEDLQ 432 (472) T ss_pred HHHHHHHHHH-HHHHHHHHHHhCCCcccceeeEEeCCCCCCCHHHHHHHHHHHh--cc-C----chHHHHHhCCCCCCHH Confidence 4445566666 34666666653 2222122255665 34567888888888874 64 2 45566666653 3222 Q ss_pred CCcccccc----cCCCcCCCccccCCCCccccccccccccccccc Q lcl|NC_021302. 422 PDADDDES----TADTGQDEPETDEPALPNTSGTTSTTNAPQARK 462 (484) Q Consensus 422 ~~e~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (484) .+-+.... ........ ........... ..+..+... T Consensus 433 ~E~~ri~~E~~~~~~~~~~~--~~~~~d~~~~~---~~~~~~~~e 472 (472) T protein:vir:93 433 AELERIEQEQMEYNKQLPNL--DDGGADGAQQQ---ERSNNKESE 472 (472) T ss_pred HHHHHHHHHHHHHHHhccCc--CcccCCCCCCC---CCCCcccCC Confidence 11000000 00000000 00000000000 000000000 No 148 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=97.98 E-value=8.3e-06 Score=48.43 Aligned_cols=400 Identities=8% Similarity=-0.056 Sum_probs=168.3 Q ss_pred eeccccc-----ch--hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC-CHHHH Q lcl|NC_021302. 14 YVNPLAG-----FG--TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA-RPEVV 85 (484) Q Consensus 14 ~~~~~~~-----~~--~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~-~~e~~ 85 (484) -+..... +. ..++.|-.........+..+....+ .+ ......-++.+....+.+.+-.+...+. +.+.. T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~-ki--~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~ 77 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADY-RV--RHKWGGYISSFATGYVIGNPVSIGVMEGGSADQL 77 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCcc-ee--ecchHHHHHHhhhhheeccCceEeeCCCccHHHH Confidence 0000000 00 0011110000000000000000000 11 1233344555555556666656654332 22222 Q ss_pred HHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCC Q lcl|NC_021302. 86 EHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRD 164 (484) Q Consensus 86 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~d 164 (484) +.+.+. ...-+|+.....+ .++..||.+ ++++|.-.+|... +...+|+.+. ..+++. T Consensus 78 ~~l~~~-----------------~~~n~~~~~~~~~~~~~~~~G~a-~~~~~~d~~~~~~---i~~~~p~~~~-~~~d~~ 135 (440) T protein:vir:95 78 STIKDI-----------------EWQNDINALNSDLAFDASVYGRA-YEYHFRDKDKVDR---VVLISPLEMF-VIRDLT 135 (440) T ss_pred HHHHHH-----------------HHhcCHhHHHHHHHHHHhhcCeE-EEEEEecCCCceE---EEEEcccceE-EEEcCC Confidence 222222 2233567666555 578999996 5666765666543 3344454432 222322 Q ss_pred --Cceeeeec-cccccc------ccccceecc-----------CCCCcccccccceEEEeecCccCccccchhHHHHHHH Q lcl|NC_021302. 165 --GGLISIQQ-WPAGTF------GGPGMVVMA-----------PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKN 224 (484) Q Consensus 165 --g~l~~~~q-~~~~~~------~~~~~~~~~-----------~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~ 224 (484) +.++.... +..... .......+. ......++..--++.|+ +|..|.|.+..+... T Consensus 136 ~~~~~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~l 210 (440) T protein:vir:95 136 VEQNIIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWW-----NNRFRMGDYESEISL 210 (440) T ss_pred CCCceEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEee-----CCCCCCCchhhhHHH Confidence 22221111 000000 000000000 00000111111223332 356788998887766 Q ss_pred HHHHHHHHHHHHHHHHHhcCCcceEEecCC-CCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHH Q lcl|NC_021302. 225 WKLKDELIRIEAAAIRRHGIGVPYLKGNEA-DSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRA 303 (484) Q Consensus 225 ~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~-~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~l 303 (484) .---...+..++..++.|..++.++.|... ...++++...+.+...-+............+.+++++........+... T Consensus 211 ida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~ 290 (440) T protein:vir:95 211 IDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAY 290 (440) T ss_pred HHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHH Confidence 666677778888889988776667766422 2223343443333211111110111123455678888776666778999 Q ss_pred HHHHHHHHHHHHhhhhhcccccccchhhHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHh---CCCCc--cccc Q lcl|NC_021302. 304 IEYHDHQMALVALAHFLNLDGKGGSYALASVQADT----FVQSVQTVADEIRDVAQAHVVEDIVDV---NWGED--EPAP 374 (484) Q Consensus 304 i~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v----~~~~~~aD~~~i~~~ln~qli~~l~~~---Nf~~~--~~~P 374 (484) ++.+.+.|...--...++.++-+|. ..| +..+. ....+..-.+.+...+. ++++.++.+ ..+.. ..-. T Consensus 291 ~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg-~Al~~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~v 367 (440) T protein:vir:95 291 KNRLANDIHRFSRIPNLDDDRFNST-SSG-IALLYKMIGLEQVRKDKETYFTKALR-RRYELISNIHKAINGPVIEANKL 367 (440) T ss_pred HHHHHHHHHHHhCCccccccccccc-chH-HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcCCcccccccc Confidence 9999999988765555555432221 112 22222 22223333445555553 355554443 22221 1234 Q ss_pred eEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccc Q lcl|NC_021302. 375 LLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTS 453 (484) Q Consensus 375 ~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (484) .+.|. ....+..+.++++.+|+ |+ + +.+.+.+.++.-.+..+...................+ ...+.+ T Consensus 368 ~i~f~~~~p~~~~~~ad~~~kl~--g~-i----S~et~~~~l~~~d~~~E~~ri~~E~~~~~~~~~~~~~---~~~~~~- 436 (440) T protein:vir:95 368 TFTFHPNIPQDVWTEIKAYIEAG--GE-I----SQETLMENASFTDYKTEHSRILKQGGSSDLEIGQIVG---DADVGQ- 436 (440) T ss_pred eEEeCCCCCCCHHHHHHHHHHHh--cc-C----cHHHHHHhCCCCCcHHHHHHHHHHHHHhhhhHHhhcc---CCCCCC- Confidence 67775 45677888999999984 54 2 3556667766532211111111100000000000000 000000 Q ss_pred cccccc Q lcl|NC_021302. 454 TTNAPQ 459 (484) Q Consensus 454 ~~~~~~ 459 (484) ..+| T Consensus 437 --~~~e 440 (440) T protein:vir:95 437 --ADTE 440 (440) T ss_pred --cCCC Confidence 0001 No 149 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=97.83 E-value=1.6e-05 Score=46.90 Aligned_cols=435 Identities=15% Similarity=0.102 Sum_probs=166.7 Q ss_pred CCCCCCCccceee-------eeccc-ccchhhh--------------hhhcccccccc-cccccchHHHHHHHHh---cc Q lcl|NC_021302. 1 MAPKTVAPRTERG-------YVNPL-AGFGTFL--------------AQGLDQFEQVD-ELRWPNSVYTYTRMCR---EE 54 (484) Q Consensus 1 ~~~~~~~~~~~~~-------~~~~~-~~~~~~~--------------~~~~~~~~~~~-~lr~~~~~~~y~~m~~---~D 54 (484) ++|-++-|++-.+ +.++. ..+-..| ..+-.....+. +-...+.-+-|+.=+. .- T Consensus 17 ~~~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~ 96 (535) T protein:vir:80 17 LIPPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFY 96 (535) T ss_pred ccCCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCC Confidence 4444433443222 22221 0000000 00000011100 0000111223433322 23 Q ss_pred hHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeee Q lcl|NC_021302. 55 ARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFE 133 (484) Q Consensus 55 ~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~E 133 (484) +++...++.....|.+.+..++ .++..+.+.+++. ..+.+++.+++.+. .++.||.+.+= T Consensus 97 n~~~~tl~~l~G~vfrk~p~~~----~p~~l~~l~~d~D---------------~~G~~L~~f~~~~~~~~l~~G~~~iL 157 (535) T protein:vir:80 97 NVTARTLDGMMGQVFSRDPIRQ----LPPALEAIVEDID---------------GEGVSLDQQAKKALGYTMGFGRAAIF 157 (535) T ss_pred ChhHHHHHHHhchhhcCCccee----ccHHHHHHHhccC---------------CCCCCHHHHHHHHHHHHHhcCeEEEE Confidence 4555555555555555554442 2222222222221 13457888888886 57789988665 Q ss_pred EEEeecCCeee---------eeeeeeeCccceeeeeecCCCc-----eeeeeccc---ccccc----------------c Q lcl|NC_021302. 134 QTYFYEGGRFW---------LKRLAPRPQSSIAYWNVDRDGG-----LISIQQWP---AGTFG----------------G 180 (484) Q Consensus 134 ivw~~~~g~~~---------~~~l~~r~~~~~~~~~~~~dg~-----l~~~~q~~---~~~~~----------------~ 180 (484) +.|-..++... .-.|..+.+..|-=|+++..++ ++.++... .+.++ . T Consensus 158 VD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~ 237 (535) T protein:vir:80 158 TDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGN 237 (535) T ss_pred EeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCce Confidence 55532211100 0112222222222222222211 00110000 00000 0 Q ss_pred ccceec----------------cCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021302. 181 PGMVVM----------------APNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGI 244 (484) Q Consensus 181 ~~~~~~----------------~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~ 244 (484) .....+ .....+..+..=-|+++... ..+--.+...|..++..-+---....+.-..+-.-++ T Consensus 238 y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~-~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~ 316 (535) T protein:vir:80 238 YQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPL-DNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQ 316 (535) T ss_pred EEEEEEEeecCCccccccceeecccCCCcccCeeEEEEeecC-CCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcC Confidence 000000 00111222222234433222 2233345555656654433221222222222333256 Q ss_pred CcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccc- Q lcl|NC_021302. 245 GVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLD- 323 (484) Q Consensus 245 G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~- 323 (484) |+|++.|..... .++. .+ -..+.-|.++++.+|++.+..+++.++++.... .++....+|..+ .+..+... T Consensus 317 P~l~i~G~~~~~--~~~~---~~-~~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a~~-~l~~~e~qM~~l-Ga~ll~~~~ 388 (535) T protein:vir:80 317 PTAFFTGLTKDW--VEDV---FK-DFKVHLGSRAIIPLPQGATAGILQITPNSVPFE-AMTHKESQMIAM-GANLLVKSG 388 (535) T ss_pred ceeeeecCchhh--hhcC---CC-CcceEecCcccccCCCCCCcceeeeccchhHHH-HHHHHHHHHHHH-HHHhhccCc Confidence 777776643111 1100 00 011344777888999999999999888766543 345555555543 22333322 Q ss_pred -ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CccccceEEecC--CCCc-HHHHHHHHHHHHhc Q lcl|NC_021302. 324 -GKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWG-EDEPAPLLVFDE--IGSR-QDATAAALQMLVNA 398 (484) Q Consensus 324 -~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~-~~~~~P~~~~~~--~~~~-~~~~ae~~~~L~~~ 398 (484) +.+.+.+. .........+.+-+..+++.++ ++++++.++-.. .++.-+.|++.. ...+ .....+++.++.+. T Consensus 389 ~~~Ta~~a~--~~~~~~~S~L~~~a~~le~al~-~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~ 465 (535) T protein:vir:80 389 GNRTFGEAQ--QEEASEQSILSACTKNVSMAFR-KALRWANQFQTGIVNDETVEYNLNTDFPAARLTPNERAELILEWQQ 465 (535) T ss_pred ccccHHHHH--HHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHcCCccCCCceEEEeccccccccCCHHHHHHHHHHHhc Confidence 12222222 2233335667888899999996 599998888532 122334555432 1111 22334556677778 Q ss_pred CcccCCcccHHHHHHHhCCCCCCC-Cccccccc-CCCcCCCccccCCCCccccccccccccccccccccccchHHHhcCc Q lcl|NC_021302. 399 GLLTPDPRLEAFLRDAAGLPGPDP-DADDDEST-ADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTP 476 (484) Q Consensus 399 G~~~~~~~~~~~i~e~~glp~p~~-~e~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (484) |.+-. ..--.++ ++.|+..|.. .+++.... ..........+.+..+..+|.+......+. .-+|++. T Consensus 466 G~Is~-et~~~~L-~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~---------~~~~~~~ 534 (535) T protein:vir:80 466 GAITF-KEMRAGL-RRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGN---------GGGNQAG 534 (535) T ss_pred CCCCH-HHHHHHH-HhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCc---------cccccCC Confidence 86432 1111223 4557755432 11111100 000000111111111222222211111111 1111111 Q ss_pred c Q lcl|NC_021302. 477 D 477 (484) Q Consensus 477 ~ 477 (484) . T Consensus 535 ~ 535 (535) T protein:vir:80 535 N 535 (535) T ss_pred C Confidence 1 No 150 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=97.81 E-value=1.8e-05 Score=46.62 Aligned_cols=419 Identities=13% Similarity=0.032 Sum_probs=158.0 Q ss_pred hhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc------EEecC--------CCCHHHHHH Q lcl|NC_021302. 22 GTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW------RIRPN--------GARPEVVEH 87 (484) Q Consensus 22 ~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~------~v~p~--------~~~~e~~~~ 87 (484) -+.-+.++...+.....+ .. |.+... .|+ .-+.++.+--.+... .+.+. +=...+++. T Consensus 1 ~~~~i~~~~~~~~~~~~~-~~---L~~~~~---~~~-~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~ 72 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIAR-DE---MVSAFE---DQN-QNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDS 72 (485) T ss_pred CCCCCCCCCcccchHHHH-HH---HHHHHH---HHH-HHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHH Confidence 011111111111111000 00 000000 000 111111111111100 00000 001223333 Q ss_pred HHHHHHhh----hccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeee-----eeeeeeCcccee Q lcl|NC_021302. 88 VAACLGLP----VEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWL-----KRLAPRPQSSIA 157 (484) Q Consensus 88 ~~~~l~~~----~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~-----~~l~~r~~~~~~ 157 (484) .++.|... -..++.+..+.+....-+|+.+..++. ++..||.| ++++|.-+++.... ..|...+|+.+. T Consensus 73 ~~~~l~~~g~~~~~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~ 151 (485) T protein:vir:24 73 IAERQAVEGFRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRS-YITISRPDPQIDLGWDPNVPLIRVEPPTRMY 151 (485) T ss_pred HhhhhccCceecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCce-EEEEecCCcccccccCCCcceEEEeccceeE Confidence 33222100 011122233444444456877777764 78999997 77888654332110 123444444331 Q ss_pred eeeecC----------------CCceeeeecccccccccc---cceeccCCCCcccccccceEEEeecCccCccccchhH Q lcl|NC_021302. 158 YWNVDR----------------DGGLISIQQWPAGTFGGP---GMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLL 218 (484) Q Consensus 158 ~~~~~~----------------dg~l~~~~q~~~~~~~~~---~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll 218 (484) ..++. ++......-+..+..-.. ..-.........+++.--++.|+++.+.+.|+|.|-+ T Consensus 152 -~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i 230 (485) T protein:vir:24 152 -AEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEI 230 (485) T ss_pred -EEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccc Confidence 11221 111111111111100000 0000000112233444456778888888889999877 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCc Q lcl|NC_021302. 219 RPAYKNW-KLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTP 297 (484) Q Consensus 219 ~~~~~~~-~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~ 297 (484) ....... =.-...+...+...+-|.+++.++.|-.......++. .-...+. +..+ +...+| +.+.++.+... T Consensus 231 ~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-~~~~~~~-~~~~--~i~~~~-~~~~~~~q~~~-- 303 (485) T protein:vir:24 231 TPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPE-TGQTLFD-AYLA--RILAFE-DAEGKIQQFSA-- 303 (485) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccc-cccchhh-hccc--ceeccC-CCCceEEeecc-- Confidence 6422222 2234556677778888877777777643221111110 0011111 1112 223333 44555554332 Q ss_pred hhHHHHHHHHHHHHHHHHhhhhhcccccccc---hhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCC- Q lcl|NC_021302. 298 LDPRRAIEYHDHQMALVALAHFLNLDGKGGS---YALA---SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDV-NWGE- 369 (484) Q Consensus 298 ~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs---~A~~---evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~-Nf~~- 369 (484) .+.+.++++++.-|.+.--...++...=||+ .+.| .....-....++.-.+.+...|+ ++++-++.+ |... T Consensus 304 ~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~-~~~~l~~~~~~~~~~ 382 (485) T protein:vir:24 304 AELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWE-EAMRLAYRLMKGGDV 382 (485) T ss_pred cchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCC Confidence 2234445555544444321111221111111 1112 22333444444555566666775 366655554 3221 Q ss_pred --ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccc--c-cccCCC--------cC Q lcl|NC_021302. 370 --DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADD--D-ESTADT--------GQ 435 (484) Q Consensus 370 --~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~--~-~~~~~~--------~~ 435 (484) +..-.+++|. .....+.+.++.+.+|+..|.. .++.+.+++.+|+......+-. . ...+.. .+ T Consensus 383 ~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~---~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~ 459 (485) T protein:vir:24 383 PPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQG---VIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDA 459 (485) T ss_pred ccccceeeEEecCCCCCCHHHHHHHHHHHHhcccc---cCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhccc Confidence 1122366774 3446778889999999988742 2356778888888533211100 0 000000 00 Q ss_pred CCccccCCCCccccccccccccccccccccccchH Q lcl|NC_021302. 436 DEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPR 470 (484) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (484) ....++.+...+.. ..+.+.. +.+.+ T Consensus 460 ~~~~~~~~~~~e~~-------~~~~~~~--~~~~a 485 (485) T protein:vir:24 460 DPTVPGSPNPTPAP-------KPQPAIE--GGDSA 485 (485) T ss_pred CCCCCCCCCCCCCC-------CCccCCC--CCCCC Confidence 00001111111101 1111111 11111 No 151 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.75 E-value=2.2e-05 Score=46.12 Aligned_cols=420 Identities=16% Similarity=0.072 Sum_probs=158.4 Q ss_pred ccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHH---HHHHhhCCCcEEecCCCCHHH Q lcl|NC_021302. 8 PRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRA---IGLPIRRTDWRIRPNGARPEV 84 (484) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~---r~~~v~~~~~~v~p~~~~~e~ 84 (484) -+|+.-.+..+.. .-.....|..+.-+.|+ -.....+....+.. ..+.+ .+| ...+ T Consensus 1 ~~t~~d~i~~L~~-----------~~~~~~~r~~~~~~Yy~-G~~~i~~~~~~~~~~~~~~~~~--~n~-------~~~i 59 (480) T protein:vir:78 1 MTTYHEHVERLQG-----------LLARDLPNLLEAEAYRN-GTRRLKTIGIGAPPELAYLDVQ--PGW-------VATY 59 (480) T ss_pred CCCHHHHHHHHHH-----------HHHHHHHHHHHHHHHHh-ccccchhcccccchhhhhhhhh--cch-------HHHH Confidence 1111111111110 00000001111111111 10000000000000 00000 011 1112 Q ss_pred HHHHHHHHHhh----hccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEee------cCCeeeeeeeeeeCc Q lcl|NC_021302. 85 VEHVAACLGLP----VEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFY------EGGRFWLKRLAPRPQ 153 (484) Q Consensus 85 ~~~~~~~l~~~----~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~------~~g~~~~~~l~~r~~ 153 (484) ++..+..|... -.+++......+....-+|+..+..+ .++..||.| +++||.- .+|... |..++| T Consensus 60 vd~~~~~l~~~g~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~a-y~~v~~~~~~~~d~~~~~~---i~~~~p 135 (480) T protein:vir:78 60 LRTLSDRLDIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPL---IRVESP 135 (480) T ss_pred HHHHHhhhccCceecCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCce-EEEeecCccccCCCCCeeE---EEEEcc Confidence 22222211000 00111122233334444677777776 589999997 6778852 234333 334444 Q ss_pred cceeeeeecC--CCceeeeecccc---cc--------cccccceec------------cCCCCcccccccceEEEeecCc Q lcl|NC_021302. 154 SSIAYWNVDR--DGGLISIQQWPA---GT--------FGGPGMVVM------------APNSMGPAIPVEQLVVYTHDMD 208 (484) Q Consensus 154 ~~~~~~~~~~--dg~l~~~~q~~~---~~--------~~~~~~~~~------------~~~~~~~~lp~~k~l~~~~~~~ 208 (484) +.+. ..+|+ ++.+........ .. ......+.+ ........++.--++.|.++.+ T Consensus 136 ~~~~-~i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~ 214 (480) T protein:vir:78 136 LYMY-AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR 214 (480) T ss_pred cceE-EEEcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccc Confidence 4432 12221 122211110000 00 000000000 0011112233445677788888 Q ss_pred cCccccchhHHH-HHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCce Q lcl|NC_021302. 209 PGVWTGNSLLRP-AYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEE 287 (484) Q Consensus 209 ~~~p~G~gll~~-~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ 287 (484) .+.++|.|-+.. +-...=-=...+...+..+|.|.+++-++.|-......++.......+ + .+ ... ...+.+ T Consensus 215 ~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~---~-~~--~~~-~~~~~~ 287 (480) T protein:vir:78 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI---Y-YG--RIL-TLASEA 287 (480) T ss_pred cCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhh---h-hh--hhc-cCCCCC Confidence 889999887654 322222224455566677887776665666533222122111111111 1 11 112 223455 Q ss_pred EEEecccC-CchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 288 AGILSPNG-TPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALA---SVQADTFVQSVQTVADEIRDVAQAHVVEDIV 363 (484) Q Consensus 288 ie~~~~~~-~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~---evh~~v~~~~~~aD~~~i~~~ln~qli~~l~ 363 (484) .++.+... +...|...++.+-.+|+..---..-..++.+.+.+.| .....-....++.-.+.+...|.+ +++-++ T Consensus 288 ~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~rl~~ 366 (480) T protein:vir:78 288 AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRIAM 366 (480) T ss_pred ceEEecCccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Confidence 66666443 2344555555555555543111000111111111112 222333444445555666666654 666666 Q ss_pred HhCCCCc---cccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccc------ccc---- Q lcl|NC_021302. 364 DVNWGED---EPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADD------DES---- 429 (484) Q Consensus 364 ~~Nf~~~---~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~------~~~---- 429 (484) .++-... ..--.++|.. ....+.+.++.+.+|+.+|.. ..+.+.+++.+|+......+-. ... T Consensus 367 ~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~---~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~ 443 (480) T protein:vir:78 367 QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQG---PIPKEQARIDLGYTATQREQMRDWDKQETEDMIDT 443 (480) T ss_pred HHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccc---CCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHH Confidence 6653221 1123566743 345677889999999988853 2356788999998643211100 000 Q ss_pred --cCCCcCCCccccCCCCccccccccccccccccccccccchHH Q lcl|NC_021302. 430 --TADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRD 471 (484) Q Consensus 430 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (484) .+..+.+.+. +.+....+. ...+.+....+|...- T Consensus 444 ~~~~~~~~~~~~---~~~~~~~~~----~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 444 LYSTTKAQADAT---PKPTVTETK----TETQTSPSGFNRTKTR 480 (480) T ss_pred hhccccCCCccc---cCCCCCCCC----CccCCCcccCCCcCCC Confidence 0000000000 000000111 1111112222222111 No 152 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.74 E-value=2.3e-05 Score=45.98 Aligned_cols=425 Identities=15% Similarity=0.065 Sum_probs=158.7 Q ss_pred ccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcc-hHHHHHHHH---HHHHhhCCCcEEecCCCCHH Q lcl|NC_021302. 8 PRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREE-ARIASVLRA---IGLPIRRTDWRIRPNGARPE 83 (484) Q Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D-~~v~s~l~~---r~~~v~~~~~~v~p~~~~~e 83 (484) -.|+...+..+ ...-.....|..+.-+.|+ .. .+ .+....+.. ..+.++ +| ... T Consensus 1 ~~t~~~~i~~L-----------~~~~~~~~~r~~~l~~Yy~-G~-~~i~~~~~~~~~~~~~~~~~~--n~-------~~~ 58 (480) T protein:vir:78 1 MTTYHEHVERL-----------QGLLARDLPNLLEAEAYRN-GT-RRLKTIGIGAPPELAYLDVQP--GW-------VAT 58 (480) T ss_pred CCCHHHHHHHH-----------HHHHHHHHHHHHHHHHHHh-cc-ccccccccccchhHhhhhhhc--ch-------HHH Confidence 11111111111 1100000001111111121 10 00 000000000 000000 11 112 Q ss_pred HHHHHHHHHHhh-h---ccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeec------CCeeeeeeeeeeC Q lcl|NC_021302. 84 VVEHVAACLGLP-V---EGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYE------GGRFWLKRLAPRP 152 (484) Q Consensus 84 ~~~~~~~~l~~~-~---~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~------~g~~~~~~l~~r~ 152 (484) +++..+..|... + ..++......+....-+|+..+..+ .+|..||.| +++||.-. +|... +...+ T Consensus 59 ivd~~~~~l~~~g~~~~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a-y~~v~~~~~~~~d~~g~~~---i~~~~ 134 (480) T protein:vir:78 59 YLRTLSDRLDIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRS-YITVSHPDVESGDPAGIPL---IRVES 134 (480) T ss_pred HHHHHHhhhccCceecCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCce-EEEEecCccccCCCCCeeE---EEEEc Confidence 222222211000 0 0111222333344444677777776 589999985 67888521 33332 33444 Q ss_pred ccceeeeeecC--CCceeeeeccc-----cccc------ccccceec------------cCCCCcccccccceEEEeecC Q lcl|NC_021302. 153 QSSIAYWNVDR--DGGLISIQQWP-----AGTF------GGPGMVVM------------APNSMGPAIPVEQLVVYTHDM 207 (484) Q Consensus 153 ~~~~~~~~~~~--dg~l~~~~q~~-----~~~~------~~~~~~~~------------~~~~~~~~lp~~k~l~~~~~~ 207 (484) |..+. ..+|. .+++....... .+.. .......+ ........++.--++.|+++. T Consensus 135 p~~~~-~~~D~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~ 213 (480) T protein:vir:78 135 PLYMY-AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP 213 (480) T ss_pred ccceE-EEEcCCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeeccc Confidence 44432 12221 22222111000 0000 00000000 001112234445567788888 Q ss_pred ccCccccchhHHH-HHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCc Q lcl|NC_021302. 208 DPGVWTGNSLLRP-AYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGE 286 (484) Q Consensus 208 ~~~~p~G~gll~~-~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~ 286 (484) +.+.|+|.|-+.. +-...=--...+..++...|.|.+++-++.|-......++..... +..+ .+ .... ..|. T Consensus 214 ~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~---~~~~-~~--~~~~-~~~~ 286 (480) T protein:vir:78 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT---LDIY-YG--RILT-LASE 286 (480) T ss_pred ccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccch---hhhh-hh--hhcc-CCCC Confidence 8889999887764 322222235566677888888777665666543222222111111 1111 11 1122 2345 Q ss_pred eEEEecccC-CchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 287 EAGILSPNG-TPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS---VQADTFVQSVQTVADEIRDVAQAHVVEDI 362 (484) Q Consensus 287 ~ie~~~~~~-~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e---vh~~v~~~~~~aD~~~i~~~ln~qli~~l 362 (484) +.++.+... +...|...++.+-.+|+....-..-..++.+.+.+.|+ ....-....++.-.+.+...|.+ +++-+ T Consensus 287 ~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~-~~~l~ 365 (480) T protein:vir:78 287 AAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRIA 365 (480) T ss_pred CceEEecCccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 566666443 33456666666666665431111111111111112222 22233333334444555666643 66666 Q ss_pred HHhCCCC-cccc--ceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccc--cc----cCC Q lcl|NC_021302. 363 VDVNWGE-DEPA--PLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDD--ES----TAD 432 (484) Q Consensus 363 ~~~Nf~~-~~~~--P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~--~~----~~~ 432 (484) +.+.-.. ...+ -.++|.. ....+.+.++.+.+|+.+|.. .++.+.+++.+|+...+..+-.. .. +.. T Consensus 366 ~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~---~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~ 442 (480) T protein:vir:78 366 MQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQG---PIPKEQARIDLGYTATQREQMRDWDKQETEDMID 442 (480) T ss_pred HHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccc---cCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHH Confidence 6665321 1111 2456643 335677888899999988742 23567788888885332111000 00 000 Q ss_pred CcCCCccccCCCCccccccccccccccccccccccchHHHhc Q lcl|NC_021302. 433 TGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRK 474 (484) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (484) ......+......++.... ......+.+....+|+ ..| T Consensus 443 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~---~~~ 480 (480) T protein:vir:78 443 TLYSTTKAQADATPKPTVT-ETKTETQTSPSGFNRT---KTR 480 (480) T ss_pred HhhccccccCCCCCCCCCC-CCCCccccccCCCCcc---cCC Confidence 0000000000000000000 0000111111111121 111 No 153 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=97.70 E-value=2.7e-05 Score=45.64 Aligned_cols=423 Identities=14% Similarity=0.067 Sum_probs=149.8 Q ss_pred hhhhhhhcccccccccc-------------cccchHHHHHHHHhcc-hHHHHHHH-H--HHHHhhCCCcEEecCCCCHHH Q lcl|NC_021302. 22 GTFLAQGLDQFEQVDEL-------------RWPNSVYTYTRMCREE-ARIASVLR-A--IGLPIRRTDWRIRPNGARPEV 84 (484) Q Consensus 22 ~~~~~~~~~~~~~~~~l-------------r~~~~~~~y~~m~~~D-~~v~s~l~-~--r~~~v~~~~~~v~p~~~~~e~ 84 (484) -+..+.++...+....+ |..+.-+.|+ -. .+ .++...+. + +...+. +| ...+ T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~-G~-~~i~~~~~~~~~~~~~~~~v~--n~-------~~~i 69 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAFEDASKDLASNTSYYD-AE-RRPEAIGVTVPREMQQLLAHV--GY-------PRLY 69 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhc-cc-CcchhcccccchhHhhhhhcc--ch-------HHHH Confidence 01111122112211110 0000001111 00 00 00000000 0 000000 01 0112 Q ss_pred HHHHHHHHHhh---hc-cchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeee-----eeeeeeeCcc Q lcl|NC_021302. 85 VEHVAACLGLP---VE-GDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFW-----LKRLAPRPQS 154 (484) Q Consensus 85 ~~~~~~~l~~~---~~-~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~-----~~~l~~r~~~ 154 (484) ++..++.|... +. .++.+....+....-+|+....+++ +|..||.| +++||.-.++... ...+..++|+ T Consensus 70 Vd~~~~~l~~~g~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~a-y~~v~~~e~~~~~~~~~~~~~i~~~~p~ 148 (486) T protein:vir:42 70 VDSVAERQAVEGFRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRS-FITISKPDPQLDLGWDQNVPIIRVEPPT 148 (486) T ss_pred HHHHHhhhcccceecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCce-EEEEecCCcccccccCCCeeEEEEeccc Confidence 22222111000 00 0111122233333345777666664 79999997 7788864322211 0133344444 Q ss_pred ceee-e-------------eecCCCc-eeeeeccccccccc----ccceeccCCCCcccccccceEEEeecCccCccccc Q lcl|NC_021302. 155 SIAY-W-------------NVDRDGG-LISIQQWPAGTFGG----PGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTGN 215 (484) Q Consensus 155 ~~~~-~-------------~~~~dg~-l~~~~q~~~~~~~~----~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~ 215 (484) ++.- | ..+.+++ .....-+..+..-. ...+. ........++.--++.|+++.+.+.|+|. T Consensus 149 ~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~-~~~~~~h~~g~vPvv~~~n~~~~~~~~G~ 227 (486) T protein:vir:42 149 RMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWA-EWFNVPHGLGVVPVVPLPNRTRLSDLYGT 227 (486) T ss_pred ceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEE-eecceecCCCCceEEEeccccccCCCCCc Confidence 3221 1 1111121 11111111110000 00000 00111233333456778888888889998 Q ss_pred hhHHHHHHH-HHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeccc Q lcl|NC_021302. 216 SLLRPAYKN-WKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPN 294 (484) Q Consensus 216 gll~~~~~~-~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~ 294 (484) |-+..-... .=--...+...+...|-|.+++.++.|........+.- ....... ...| ...+++ +.+.++.+-. T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~-~~~~~~~-~~~~--~~~~~~-~~~~~~~q~~ 302 (486) T protein:vir:42 228 SEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSE-TGQTLFD-AYLA--RILAFE-DAEGKIQQFS 302 (486) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccc-cccchhh-hhhc--hhcccC-CCCceEEeec Confidence 877642211 11223445566677787777666666643221111100 0000111 1111 222333 3456665543 Q ss_pred CCchhHHHHHHHHHHHHHHHHhhhhhcccccccc---hhhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_021302. 295 GTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGS---YALA---SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWG 368 (484) Q Consensus 295 ~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs---~A~~---evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~ 368 (484) .. +.+.++++++.-|.+.-....++...=||+ .+.| .....-.....+.-.+.+...|.+ +++.++.+-.+ T Consensus 303 ~~--~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~-~~~l~~~~~~~ 379 (486) T protein:vir:42 303 AA--ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEE-AMRIAYRIMKG 379 (486) T ss_pred cc--CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcC Confidence 22 344555555555544422222221111111 1112 122333333444455566666644 55655454322 Q ss_pred Cc----cccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccc---ccccCCCcCCCcc- Q lcl|NC_021302. 369 ED----EPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADD---DESTADTGQDEPE- 439 (484) Q Consensus 369 ~~----~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~---~~~~~~~~~~~~~- 439 (484) .. ..--+++|.. ....+.+.++++.+|++.|.-+ .+.+.+++.+|+-.....+-. .+..+........ T Consensus 380 ~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~---~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~ 456 (486) T protein:vir:42 380 GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGV---IPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTM 456 (486) T ss_pred CCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCC---CCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHh Confidence 11 1112556753 3466788899999999876422 256778888887433211100 0000000000000 Q ss_pred -ccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 440 -TDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 440 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) ...+..+... .+......+.+..+.+.+. T Consensus 457 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 457 VDADPTVPGSP-SPTAPPKPQPAIESSGGDA 486 (486) T ss_pred hcCCCCCCCCC-CCCCCCCCCcccCCCCCCC Confidence 0000000000 0000000011111111111 No 154 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=97.70 E-value=2.7e-05 Score=45.64 Aligned_cols=401 Identities=9% Similarity=0.044 Sum_probs=160.1 Q ss_pred CCCCCCCccceeeeecc-cccchhhhhhhcccccccccccccchHHHHH---HHH-------------hcchHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNP-LAGFGTFLAQGLDQFEQVDELRWPNSVYTYT---RMC-------------REEARIASVLRA 63 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~---~m~-------------~~D~~v~s~l~~ 63 (484) |=|. +....+ ...+....+..+...-.....|..+..+.|+ ++. ...+...-++.+ T Consensus 3 ~~~~-------~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~ 75 (453) T protein:vir:73 3 LKPI-------KLMTYSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDT 75 (453) T ss_pred cccc-------eeeeccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCCCCccCccceeecchHHHHHHH Confidence 1100 000000 0000000000000000000000000011111 000 001222222233 Q ss_pred HHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCe Q lcl|NC_021302. 64 IGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGR 142 (484) Q Consensus 64 r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~ 142 (484) ...-+.+-+..+.+. ++++.++ .......-+|+..+..+. ++..||.+ ++++|.-.+|. T Consensus 76 ~~~~l~g~~~~~~~~--d~~~~~~-----------------l~~~~~~n~~~~~~~~~~~~~~~~G~~-~~~v~~d~~~~ 135 (453) T protein:vir:73 76 FVGYFNGIPIKKTHD--DKSVLEA-----------------MQLFDNLNDMEDEESELAKIACVYGRA-YELMYQNESTE 135 (453) T ss_pred hhhhhcccCceeecC--ChHHHHH-----------------HHHHHHhcChhHHHHHHHHHHHhcCeE-EEEEEeCCCCc Confidence 333333333333321 1122222 222223335777666664 78999975 56777656665 Q ss_pred eeeeeeeeeCccceee---------------eeecCCCceeeeecccccccccccceeccCCC------Ccc--cccccc Q lcl|NC_021302. 143 FWLKRLAPRPQSSIAY---------------WNVDRDGGLISIQQWPAGTFGGPGMVVMAPNS------MGP--AIPVEQ 199 (484) Q Consensus 143 ~~~~~l~~r~~~~~~~---------------~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~------~~~--~lp~~k 199 (484) ..+. ..+|+.... +..+.++... ..-+.. ...+.+.... ... .+..-- T Consensus 136 ~~i~---~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~-~~vyt~-----~~i~~~~~~~~~~~~~~~~~~~~g~vP 206 (453) T protein:vir:73 136 SEVI---YCSPLNVFMVYDDSIKQKPLFAVYYGFDEEGNLS-GTVYTL-----LETISITGKAGEVKFGESTYNVYSDLP 206 (453) T ss_pred eEEE---EEcccceEEEEeCCCCceeEEEEEEEEecCceEE-EEEEeC-----CeEEEEEecCCceEEccceeccCCcee Confidence 5433 333333311 1122222210 000000 0001110000 011 111111 Q ss_pred eEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHH-HHHH-hcCCc Q lcl|NC_021302. 200 LVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEI-ASNY-SGGES 277 (484) Q Consensus 200 ~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~-l~~~-~~g~~ 277 (484) ++.|+ +|+.|.|.+..+-...=--...+..++..++.|.+++.++.|-. .+++....+... +..+ ..... T Consensus 207 vv~~~-----n~~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~---~~~~~~~~~~~~~~~~~~~~~~~ 278 (453) T protein:vir:73 207 IVEYN-----FNEERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAE---VDEEDAKNIKDNRLINFFDKNSN 278 (453) T ss_pred EEEec-----CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC---CCchhhhcccccccccccccccc Confidence 23332 46788899987665554556777888888898866555555532 222222222111 0000 00111 Q ss_pred eEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 278 AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALA-SVQADTFVQSVQTVADEIRDVAQA 356 (484) Q Consensus 278 a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~ 356 (484) +....+.+.++++++.......++.+++.+.+.|...-.+..++.++.|.+.|.+ +....-....+..-.+.+...+. T Consensus 279 ~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~- 357 (453) T protein:vir:73 279 GQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALN- 357 (453) T ss_pred cccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHH- Confidence 2334566788999887666778899999999999886555555544322221111 11122222333344455556664 Q ss_pred HHHHHHHHh-CCCC---ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCCccccccc Q lcl|NC_021302. 357 HVVEDIVDV-NWGE---DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPDADDDEST 430 (484) Q Consensus 357 qli~~l~~~-Nf~~---~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~e~~~~~~ 430 (484) ++++.++.+ +... ...-..++|. ....+..+.++.+.+++ |+ + +.+.+.+.++. +.+..+-+-.... T Consensus 358 ~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--gi-i----s~et~~~~~~~~~d~~~E~~ri~~E 430 (453) T protein:vir:73 358 RRYSLWSSLSTNASNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GI-T----SEETALSVISVIPDVQAEMEKIKKK 430 (453) T ss_pred HHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-C----cHHHHHHhCCCCCCHHHHHHHHHHH Confidence 466655554 2221 1122467775 35567888999999886 64 2 34556666654 2221110000000 Q ss_pred CCCcCCCccccCCCCcccccccc Q lcl|NC_021302. 431 ADTGQDEPETDEPALPNTSGTTS 453 (484) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~ 453 (484) ........+......+...-... T Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 431 KLLQLSLTRTSNLVRMKQMRGNL 453 (453) T ss_pred HHHHHHHHHhccCCcchhhhcCC Confidence 00000000000111111111101 No 155 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=97.69 E-value=2.8e-05 Score=45.57 Aligned_cols=391 Identities=13% Similarity=0.004 Sum_probs=149.2 Q ss_pred cccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCC------CcEEecC--------CCCHHHHHHHHHHHHh Q lcl|NC_021302. 29 LDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRT------DWRIRPN--------GARPEVVEHVAACLGL 94 (484) Q Consensus 29 ~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~------~~~v~p~--------~~~~e~~~~~~~~l~~ 94 (484) +...+. ..+ .+.+..++.- ..-+.++++--.+. ...+.+. +=...+++..+..+.. T Consensus 1 ~~~~~~-~~i--~~l~~~~~~~-------~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~ 70 (441) T protein:vir:80 1 MNSDEL-ALI--EGMYDRIQRL-------SSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDW 70 (441) T ss_pred CCccHH-HHH--HHHHHHHHHH-------HHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhcc Confidence 111110 000 0000000000 00011111111110 0000000 0011222322222211 Q ss_pred hhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeeeCccceeeeeecCC-Ccee---- Q lcl|NC_021302. 95 PVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRD-GGLI---- 168 (484) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~d-g~l~---- 168 (484) .......+....+....-+|+.++.++ .++..||.| ++++|.-.+|... +...+|+.+. ..++++ +++. T Consensus 71 ~g~~~~d~~~l~~i~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~d~~g~~~---i~~~~p~~~~-~i~d~~~~~~~~~~~ 145 (441) T protein:vir:80 71 LGWTNGDGYGLDGVYAANRLATASCDVHLDALIFGLS-FVAIIPHGDGTVS---VRPQSPKNCT-GKFSADGSRLDAGLV 145 (441) T ss_pred ccccCCChHHHHHHHHhcCHHHHHHHHHHHHhhcCee-EEEEEeCCCCceE---EEEEccceEE-EEEeCCCCceeEEEE Confidence 111111122233344445688888777 589999986 6788876677654 3444444432 112221 1111 Q ss_pred eeeccccc-----ccccccceec---------cCCCCcccccccceEEEeecCccCccccchhHHH-HHHHHHHHHHHHH Q lcl|NC_021302. 169 SIQQWPAG-----TFGGPGMVVM---------APNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRP-AYKNWKLKDELIR 233 (484) Q Consensus 169 ~~~q~~~~-----~~~~~~~~~~---------~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~-~~~~~~~K~~~~~ 233 (484) ........ .......+.. .......++..--++.|.++.+.+.|+|.|-+.. +-...=.-...+. T Consensus 146 ~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s 225 (441) T protein:vir:80 146 VQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLL 225 (441) T ss_pred EEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHH Confidence 00000000 0000010100 0011222233334566777888889999886543 2222222355666 Q ss_pred HHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCce---EEEecccCCchhHHHHHHHHHHH Q lcl|NC_021302. 234 IEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEE---AGILSPNGTPLDPRRAIEYHDHQ 310 (484) Q Consensus 234 ~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~---ie~~~~~~~~~~~~~li~~~d~~ 310 (484) .++...|.|++++.++.|-......++.. ++..+ ....+|.+.+ +++.+... .+.+.++++++.- T Consensus 226 ~~~~~~~~~~~~~~~i~G~~~~~~~~~~~--------~~~~~--~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~l~~~ 293 (441) T protein:vir:80 226 GQSVNRDFYAYPQRWVTGVSADEFSQPGW--------VLSMA--SVWAVDKDDDGDTPNVGSFPV--NSPTPYSDQMRLL 293 (441) T ss_pred HHHHHHHhhcCceeeeecCCccccccchh--------hhccc--ccccCCCCCCCCcceeEecCc--cchHHHHHHHHHH Confidence 77778888877666666532222111111 11112 3444554433 44443322 2334444444444 Q ss_pred HHHHHhhhhhccc--ccccc-hhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-C-CCCcc---ccceEEec Q lcl|NC_021302. 311 MALVALAHFLNLD--GKGGS-YALAS---VQADTFVQSVQTVADEIRDVAQAHVVEDIVDV-N-WGEDE---PAPLLVFD 379 (484) Q Consensus 311 Isk~ilGqtlt~~--~~gGs-~A~~e---vh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~-N-f~~~~---~~P~~~~~ 379 (484) |.+..-...+... +..|. .+.|+ ....-....+..-.+.+...|. ++++-++.+ + .+... .-.+++|. T Consensus 294 i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~~~~l~~~~~~~~~~~~~~~~~i~~~f~ 372 (441) T protein:vir:80 294 AQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWL-SVGFLAAKALDSRVDEADFFGDVGLRWR 372 (441) T ss_pred HHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCcccccceeeeEEeC Confidence 4333211112111 11111 11122 1222233333333444555553 355544444 2 22111 12356675 Q ss_pred -CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccc Q lcl|NC_021302. 380 -EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTS 453 (484) Q Consensus 380 -~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (484) ....+..+.++++.+|+..|... .+...+++.+|....+- +........ .+ .+............++. T Consensus 373 ~~~~~~~~e~ad~~~kl~~~g~~~---~s~~~~~~~l~~~~~e~-~~~~~e~~e-~~-~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 373 DASTPTRAATADAVTKLVGAGILP---ADSRTVLEMLGLDDVQV-EAVMRHRAE-SS-DPLAVLAGAISRQTNEV 441 (441) T ss_pred CCCCcCHHHHHHHHHHHHhcCccc---ccHHHHHHhCCCCHHHH-HHHHHHHHH-HH-HHHHHHhhhhhcccccC Confidence 34567788999999999999742 24556788888753221 111000000 00 00000000000000001 No 156 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.63 E-value=3.5e-05 Score=45.04 Aligned_cols=409 Identities=10% Similarity=0.023 Sum_probs=163.3 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhccccccccccc---ccchHHHHH---HHHh--------------------cc Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELR---WPNSVYTYT---RMCR--------------------EE 54 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr---~~~~~~~y~---~m~~--------------------~D 54 (484) --|.+-...++ ++..+..........|.....-...+ ..+..+.|+ +++. .. T Consensus 6 ~~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~ 83 (474) T protein:vir:95 6 RMPWDKPYGEE--VVEQLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITT 83 (474) T ss_pred ecCCCCchhhH--HHHhhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceecc Confidence 11222111111 22222221110000000000000000 000011111 0000 01 Q ss_pred hHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHH-HHHHHhhcceeee Q lcl|NC_021302. 55 ARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRL-ALKSLQFGHAVFE 133 (484) Q Consensus 55 ~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~-~l~a~~~G~s~~E 133 (484) ....-++.+....+.+.+..+.. ++++..+++.+. +. -+|+..+.. ..++.-||.+ ++ T Consensus 84 n~~~~Ivd~~~~~l~g~p~~~~~--~d~~~~~~l~~~-----------------~~-n~~~~~~~e~~~~~~~~G~~-~~ 142 (474) T protein:vir:95 84 NFHQNLVDQKVSYVASKPVTYSC--EDESVLKIIHDV-----------------LD-TRWDNKLIDILTATSNKGID-WL 142 (474) T ss_pred chHHHHHHHHHhhhccCCceecc--CchHHHHHHHHH-----------------Hh-ccHHHHHHHHHHHHhhcCcE-EE Confidence 22222333333444444444432 222222222222 12 246665554 4579999975 57 Q ss_pred EEEeecCCeeeeeeeeeeCccceee-eeecCCCceeeee-cccccccc------cccceeccCCC--------------- Q lcl|NC_021302. 134 QTYFYEGGRFWLKRLAPRPQSSIAY-WNVDRDGGLISIQ-QWPAGTFG------GPGMVVMAPNS--------------- 190 (484) Q Consensus 134 ivw~~~~g~~~~~~l~~r~~~~~~~-~~~~~dg~l~~~~-q~~~~~~~------~~~~~~~~~~~--------------- 190 (484) ++|...+|.+.+ ...+|..+.. |.....+.++..- .+...... ......+.... T Consensus 143 ~v~~d~~~~~~i---~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:95 143 QVYINENGEMKL---FRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQ 219 (474) T ss_pred EEEecCCCceEE---EEEcccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCccccc Confidence 787666676543 3344444321 1111122222111 11000000 00000000000 Q ss_pred ---CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHH Q lcl|NC_021302. 191 ---MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLE 267 (484) Q Consensus 191 ---~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~ 267 (484) ....+..--++.| .+|+.|.|.+..+-...---...+..++..++.+.+++.++.|-. +.+.++ T Consensus 220 ~~~~~~~~g~iPvv~~-----~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~--~~~~~~------ 286 (474) T protein:vir:95 220 SHFSNGNWGRVPFIAF-----KNNPEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYE--GQDLEE------ 286 (474) T ss_pred ccccccCCCccceEee-----cCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC--cccchh------ Confidence 0011111112222 246889999988665555557778888888888777665655532 222111 Q ss_pred HHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-H--HHHHHHHHHH Q lcl|NC_021302. 268 IASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-V--QADTFVQSVQ 344 (484) Q Consensus 268 ~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-v--h~~v~~~~~~ 344 (484) ...++... ..+.++.+.+++++........++..++.+.+.|...-.+..++.++.+|.- .|. . ........+. T Consensus 287 ~~~~~~~~--~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Alk~~~~~l~~k~~ 363 (474) T protein:vir:95 287 FMRGLKYY--KAINVDGDGGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKFGSAP-SGIALKFLYGNLDLKAN 363 (474) T ss_pred hhhhhhcc--ceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc-hHHHHHHHHHHHHHHHH Confidence 12233222 3556788999999887766778999999999999887555555555332211 111 1 1122222333 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCCC-ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCC Q lcl|NC_021302. 345 TVADEIRDVAQAHVVEDIVDVNWGE-DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPD 421 (484) Q Consensus 345 aD~~~i~~~ln~qli~~l~~~Nf~~-~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~ 421 (484) .-.+.+...+ +++++.++.+.-.. +..-..+.|. ..+.++.+.+++ |+++|+. +.+.+.+.++. +.++ T Consensus 364 ~k~~~~~~~l-~~~~~li~~~~g~~~d~~~i~v~f~~~~p~d~~e~a~~---~~~~g~i-----S~et~i~~l~~v~d~~ 434 (474) T protein:vir:95 364 KLKNKATVAI-QELIGFIIDFNNLKMDVKDIEISFNFNRMMNDAEQSQI---IAQSQYL-----SRETLVKSSPLVDDYK 434 (474) T ss_pred HHHHHHHHHH-HHHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHH---HHhcCCC-----chHHHHHhCCCCCCHH Confidence 4445566666 44666666654221 1122356664 344556555554 4456752 45666667653 3322 Q ss_pred CCcccccc-cCCCcCCCccccCCCCccccccccccccccccc Q lcl|NC_021302. 422 PDADDDES-TADTGQDEPETDEPALPNTSGTTSTTNAPQARK 462 (484) Q Consensus 422 ~~e~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (484) .+-+.... .....+..... .... .....+......+... T Consensus 435 ~E~~ri~~E~~~~~~~~~~~-~~~~-~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 435 AELERIEQEQMEYNKQLPNL-DDGG-ADGAQQQERSNDKESE 474 (474) T ss_pred HHHHHHHHHHHHHHhccccc-cccc-CCCCcCCCCCccCCCC Confidence 11000000 00000000000 0000 0000000000000000 No 157 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=97.59 E-value=4e-05 Score=44.68 Aligned_cols=417 Identities=12% Similarity=0.028 Sum_probs=171.1 Q ss_pred CCCCCCCccceee-eeccc--c--cchhhhhhhccccccccc-cc--ccchHH----HHHHHHhcchHHHHHHHHHHHHh Q lcl|NC_021302. 1 MAPKTVAPRTERG-YVNPL--A--GFGTFLAQGLDQFEQVDE-LR--WPNSVY----TYTRMCREEARIASVLRAIGLPI 68 (484) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~--~--~~~~~~~~~~~~~~~~~~-lr--~~~~~~----~y~~m~~~D~~v~s~l~~r~~~v 68 (484) .-+.+.-....+- .++.. . .....+..|-..-..... .. .+.... .-..+ ..+...-++.+...-+ T Consensus 23 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri--~~n~~~~ivd~~~~yl 100 (503) T protein:vir:59 23 AKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRT--SHAWHKLFVDQKTQYL 100 (503) T ss_pred hhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhccccccccccccccccee--ecchHHHHHHHHHhhh Confidence 0011100000000 00000 0 000111111000000000 00 000000 00011 1234455555666666 Q ss_pred hCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 69 RRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 69 ~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) .+-+..+.. ++++..+++.+.. .-+|+..+..+ .++..||.+. +++|.-.+|.+. T Consensus 101 ~g~~~~~~~--~d~~~~~~l~~~~------------------~n~~~~~~~~~~~~~~~~G~~~-~~v~~d~dg~~~--- 156 (503) T protein:vir:59 101 VGEPVTFTS--DNKTLLEYVNELA------------------DDDFDDILNETVKNMSNKGIEY-WHPFVDEEGEFD--- 156 (503) T ss_pred hcCCeeecc--CcHHHHHHHHHHH------------------hcCHHHHHHHHHHHHhhCCeEE-EEEeecCCCceE--- Confidence 777766643 3344444443322 11466655544 4788899975 566655566654 Q ss_pred eeeeCccceee-eeecCCCceeeeec-ccc--cccc---------cccceeccCC--------------------CCccc Q lcl|NC_021302. 148 LAPRPQSSIAY-WNVDRDGGLISIQQ-WPA--GTFG---------GPGMVVMAPN--------------------SMGPA 194 (484) Q Consensus 148 l~~r~~~~~~~-~~~~~dg~l~~~~q-~~~--~~~~---------~~~~~~~~~~--------------------~~~~~ 194 (484) +...+|+.+.- |.-...+.++.... +.. .... ......+... ..+.+ T Consensus 157 i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (503) T protein:vir:59 157 YVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQA 236 (503) T ss_pred EEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeeccee Confidence 34444444321 11111222221110 000 0000 0000000000 00111 Q ss_pred ccccc--eEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHH Q lcl|NC_021302. 195 IPVEQ--LVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNY 272 (484) Q Consensus 195 lp~~k--~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~ 272 (484) .+..+ ++.| .+|+.|.|.+..+-...-.-...+..++..++.+.+++.++.|- .+.+.++ ...++ T Consensus 237 ~~~~~vPiv~~-----~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~--~~~~~~~------~~~~~ 303 (503) T protein:vir:59 237 IGWGRVPIIPF-----KNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNY--DGENPKE------FTANL 303 (503) T ss_pred ccCCccceEEe-----cCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecC--Cccccch------hhhhh Confidence 11111 2222 24678999998865555455666777777788776655554442 2222221 12234 Q ss_pred hcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc-chhhH-HHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 273 SGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG-SYALA-SVQADTFVQSVQTVADEI 350 (484) Q Consensus 273 ~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG-s~A~~-evh~~v~~~~~~aD~~~i 350 (484) .. ..++.++.+.+++++........++..++.+.+.|.+.-.+..++.+..+| +.|.+ +....-....+..-.+.+ T Consensus 304 ~~--~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~ 381 (503) T protein:vir:59 304 RY--HSVIKVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKI 381 (503) T ss_pred hc--ccceeccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 22 245678888899998877777789999999999988876555455443222 11111 112222333334444555 Q ss_pred HHHHHHHHHHHHHHh---CCCCc---cccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCC Q lcl|NC_021302. 351 RDVAQAHVVEDIVDV---NWGED---EPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDP 422 (484) Q Consensus 351 ~~~ln~qli~~l~~~---Nf~~~---~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~ 422 (484) ...|. ++++.++.+ ..+.. ..-..+.|. ....+..+.++++.+|+++|+. +.+.+.+.++. +.|+. T Consensus 382 ~~~l~-~~~~~i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~Gii-----S~et~l~~l~~v~d~~~ 455 (503) T protein:vir:59 382 RAGLR-LFFWFFAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIM-----SKETAVARNPFVQDPEE 455 (503) T ss_pred HHHHH-HHHHHHHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCC-----chHHHHHhCCCCCCHHH Confidence 66664 355544432 22211 111367775 4567888999999999999963 35566666643 22211 Q ss_pred C-ccccc-----ccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 423 D-ADDDE-----STADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 423 ~-e~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) + +.... .........+...........+.. ...+ ....|+.. T Consensus 456 E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~--~~~~g~~~ 503 (503) T protein:vir:59 456 ELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNA---GAAE--SGGAGQVS 503 (503) T ss_pred HHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCC---Cccc--CCCCCCcC Confidence 1 00000 000000000000000000000000 0000 00000000 No 158 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=97.56 E-value=4.4e-05 Score=44.45 Aligned_cols=421 Identities=10% Similarity=0.041 Sum_probs=167.3 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccc---cccccccccchHHHHH---HHH----------------------- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQF---EQVDELRWPNSVYTYT---RMC----------------------- 51 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~lr~~~~~~~y~---~m~----------------------- 51 (484) |-|+--... +- ..+.....-+... .....+ ....+.|+ +++ T Consensus 1 ~~~~~~~~~--~~------~~~~~~~~~i~~~~~~~~~~~~--~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnk 70 (537) T protein:vir:78 1 MTSPLLNKP--ID------QLGGLLNTEITTYMASNHIKWA--HIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVK 70 (537) T ss_pred CCccccccc--HH------HHHHHHHHHHHHHHHHHHHHHH--HHHHHHhcccchhhhcccccccccccccccccccccc Confidence 221111110 00 0010000000000 000000 00011111 110 Q ss_pred hcchHHHHHHHHHHHHhhCCCcEEecCCCCH-HHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHH-HHHHHhhcc Q lcl|NC_021302. 52 REEARIASVLRAIGLPIRRTDWRIRPNGARP-EVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRL-ALKSLQFGH 129 (484) Q Consensus 52 ~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~-e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~-~l~a~~~G~ 129 (484) -......-.+.+...-+.+.+-.+...+++. +..+.+.+. ++ ..|++.+.. ..++..||. T Consensus 71 i~~nf~k~Ivd~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~-----------------~~-~~~~~~~~el~~~~s~~G~ 132 (537) T protein:vir:78 71 ISHGFFTELVDQLAQYLLSNGVEVKVKDEDNTQLDEILQEY-----------------FD-EDFQATIDTLVTNASKKGF 132 (537) T ss_pred cccchHHHHHHHHhhhhcccCceeecCcchhHHHHHHHHHH-----------------hh-ccHHHHHHHHHHHHhhcCe Confidence 0122333344444555556665555433221 111111111 11 246655544 457889998 Q ss_pred eeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeeeecccc-------ccc--c--------cccceeccCCCCc Q lcl|NC_021302. 130 AVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPA-------GTF--G--------GPGMVVMAPNSMG 192 (484) Q Consensus 130 s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~-------~~~--~--------~~~~~~~~~~~~~ 192 (484) + .|++|.-.+|.+... ..+|+.+ +..+++.+.+..+-.... ... . ......+.....+ T Consensus 133 a-y~~~y~de~~~~~~~---~i~p~~~-~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~ 207 (537) T protein:vir:78 133 E-GIFARTTSEGKLKFQ---TVDGLTL-IPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEG 207 (537) T ss_pred e-EEEeeecCCCceEEE---EEcccee-EEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCc Confidence 6 677787677776544 3444443 233444444332111000 000 0 0000000000000 Q ss_pred -------------ccc-------------------------ccc--ceEEEeecCccCccccchhHHHHHHHHHHHHHHH Q lcl|NC_021302. 193 -------------PAI-------------------------PVE--QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELI 232 (484) Q Consensus 193 -------------~~l-------------------------p~~--k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~ 232 (484) .++ +.. -++.| .+|..|.|.+..+-...-.-...+ T Consensus 208 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~f-----~nn~~~~sd~e~v~~LiDayd~~~ 282 (537) T protein:vir:78 208 VSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSKFPFQLL-----YNNKDGMSDVKRVKSIIDDYDVMN 282 (537) T ss_pred ccccccccccccccccceeeeccccccccccccccccccccCCcceeEEEe-----ccCccCCCchhhhHHHHHHHHHHH Confidence 000 000 11222 236678888888776666667788 Q ss_pred HHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHH Q lcl|NC_021302. 233 RIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMA 312 (484) Q Consensus 233 ~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Is 312 (484) .+.+..++.|..++.++.|-. ..+.++ ...++... .+..+-..+.+++++.-......++.+++++.+.|- T Consensus 283 S~~an~~~~~~~~ilvi~g~~--~~~~~~------~~~~l~~~-~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~ 353 (537) T protein:vir:78 283 CFLSNNLQDFSEAIYVVKGFS--GDSTDK------LRQNIKAK-KMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIY 353 (537) T ss_pred HhhhhHHHHhcCceeeeecCC--Cccchh------HHHHHhhc-CceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHH Confidence 888889998877666665522 222221 12233221 122233467889999877777788899999999997 Q ss_pred HHHhhhhhcccccccchhhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHh---CCCC--ccccceEEec-CCC Q lcl|NC_021302. 313 LVALAHFLNLDGKGGSYALASVQADTFVQSV----QTVADEIRDVAQAHVVEDIVDV---NWGE--DEPAPLLVFD-EIG 382 (484) Q Consensus 313 k~ilGqtlt~~~~gGs~A~~evh~~v~~~~~----~aD~~~i~~~ln~qli~~l~~~---Nf~~--~~~~P~~~~~-~~~ 382 (484) +. +++..++..++|.+.| +....+...+ ..-.+.+...|.+ +++.++.+ .... +.....+.|. ..+ T Consensus 354 ~~--s~~~~~~~~~~gn~SG-vAlk~~~~~l~~ka~~ke~~f~~~l~~-~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P 429 (537) T protein:vir:78 354 RS--GMGFNSTAVGDGNVTN-VVIKSRYTLLAMKARKMETSLRKVLRW-CADMVVSDIALRGLGEYDSNDICFEIEPHVL 429 (537) T ss_pred Hh--cCCCCCccccccCCcH-HHHHHHHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhcCCcccccceeeEEeccCCC Confidence 65 3333222223333333 3333333222 2333444455533 34433332 2111 1123467775 467 Q ss_pred CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcc------------c----ccccCCC--cCCCccccCCC Q lcl|NC_021302. 383 SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDAD------------D----DESTADT--GQDEPETDEPA 444 (484) Q Consensus 383 ~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~------------~----~~~~~~~--~~~~~~~~~~~ 444 (484) .++.+.++.+++|++.|+. |.+.+.+.++.-...+.+. . ....++. ..+.++ .... T Consensus 430 ~n~~e~a~~~~~l~~~gii-----S~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 503 (537) T protein:vir:78 430 ANELDIATTRKTEAETEAL-----KIGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQ-AMLD 503 (537) T ss_pred CCHHHHHHHHHHHHhcCcc-----hHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchh-hhcC Confidence 7888899999999988863 3444444443311100000 0 0000000 000000 0000 Q ss_pred CccccccccccccccccccccccchHHHhcCcccCcc Q lcl|NC_021302. 445 LPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPDGAMP 481 (484) Q Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (484) ......++.+..-.++...|...-+.+-+--|+ | T Consensus 504 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~---~ 537 (537) T protein:vir:78 504 GLPVNANQPPVDPNQPVADPNVVPPTDPNAVPQ---T 537 (537) T ss_pred CCCCCCCCCCCCccCCCCCCCCCCCCCCccCCC---C Confidence 000000111111111111111111111111111 1 No 159 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=97.50 E-value=5.5e-05 Score=43.93 Aligned_cols=416 Identities=9% Similarity=-0.036 Sum_probs=153.8 Q ss_pred hhccccccc--ccccccchH--HHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe-cC----------------CCCHHHH Q lcl|NC_021302. 27 QGLDQFEQV--DELRWPNSV--YTYTRMCREEARIASVLRAIGLPIRRTDWRIR-PN----------------GARPEVV 85 (484) Q Consensus 27 ~~~~~~~~~--~~lr~~~~~--~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~-p~----------------~~~~e~~ 85 (484) |.=.+.+.. ..++ +.+ ++++.+.. =..-+.++..--.+..-... +. +=...++ T Consensus 1 ~~~~p~~~l~~~~~~--~~~~~~l~~~~~~----~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iV 74 (479) T protein:vir:99 1 MIDLPDEDLSSEGLA--KYLETKVFPKMNT----ECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMV 74 (479) T ss_pred CccCCcccCChhHHH--HHHHHHHHHHHHH----HhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHH Confidence 100001100 0000 000 00100000 00111111111111110000 00 0001122 Q ss_pred HHHHHHHHhh---hccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEee-----cCCeeeeeeeeeeCccce Q lcl|NC_021302. 86 EHVAACLGLP---VEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFY-----EGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 86 ~~~~~~l~~~---~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~-----~~g~~~~~~l~~r~~~~~ 156 (484) +..+..+... ..+.+......+....-+|+.....+ .++..||. .++++|.. ..|.. .+..++|+.+ T Consensus 75 d~~~~~l~~~gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~-af~~v~~~~~~~d~~g~~---~i~~~~p~~~ 150 (479) T protein:vir:99 75 NSFAQQLIVDGYRKTGTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGY-AFIKVTSGISPLDGTTVA---RIKCIDPRDA 150 (479) T ss_pred HHHHhhcccccccCCCchhhHHHHHHHHhcChhHHHHHHHHHHhhcCc-eEEEEecCCCCcCCCCce---EEEEechhhe Confidence 2222111100 11111122233333344677777776 48999997 57788852 12332 2444455543 Q ss_pred eeeeecCCCc--eeeeecccccccc----cccceecc--------CCCCcccccccceEEEeecCccCccccchhHHHHH Q lcl|NC_021302. 157 AYWNVDRDGG--LISIQQWPAGTFG----GPGMVVMA--------PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAY 222 (484) Q Consensus 157 ~~~~~~~dg~--l~~~~q~~~~~~~----~~~~~~~~--------~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~ 222 (484) .....+...+ .++.......... ......+. .......+...-++.|+++.+. .++|.|.+..+- T Consensus 151 ~~iydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~-~~~g~sd~e~v~ 229 (479) T protein:vir:99 151 FAIWEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDL-RGVCYGDVEPLV 229 (479) T ss_pred EEEecCCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCc-CcCCcchhHHHH Confidence 2211111111 1111000000000 00000000 0111222333446777777665 467999888755 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccC-CchhHH Q lcl|NC_021302. 223 KNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNG-TPLDPR 301 (484) Q Consensus 223 ~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~-~~~~~~ 301 (484) ...=-=...+.......+.|.+++.+++|-.....+..+... ..+..+ .+....+.++++.+-+. +...|. T Consensus 230 ~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~-----~~~~~~---~i~~~~~~~~~~~q~~~~~~~~~~ 301 (479) T protein:vir:99 230 TVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEK-----MRFAQE---SMLISQNEKASFGAIPAAPLDGLL 301 (479) T ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhc-----cccccc---cceeecCCCceEEEecccchHHHH Confidence 444334556667777788888877777774322222111111 112111 22333455666665432 334566 Q ss_pred HHHHHHHHHHHHHHhhhhhccc-ccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--c-cceEE Q lcl|NC_021302. 302 RAIEYHDHQMALVALAHFLNLD-GKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDE--P-APLLV 377 (484) Q Consensus 302 ~li~~~d~~Isk~ilGqtlt~~-~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~--~-~P~~~ 377 (484) ..++.+-.+|+....-..-..+ ..+.|...-.....-.....+.-.+.+...|.+ +++.++.+.-.... . --.++ T Consensus 302 ~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~-~~~l~~~~~~~~~~~~~~~i~~~ 380 (479) T protein:vir:99 302 NAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQ-TMRLVNKIEGRTEEATDLDFTIT 380 (479) T ss_pred HHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHcCCCccccceeeeEE Confidence 6666666666543211111111 111121111222333334444445566666743 66665555422111 1 12455 Q ss_pred ecCC-CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHh-CCCCCCCCc--cc--cc--------ccCCCcCCCccccCC Q lcl|NC_021302. 378 FDEI-GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAA-GLPGPDPDA--DD--DE--------STADTGQDEPETDEP 443 (484) Q Consensus 378 ~~~~-~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~-glp~p~~~e--~~--~~--------~~~~~~~~~~~~~~~ 443 (484) |... .....+.++++.+|+++|.. +.+.+.+.+ |+..++-+. .. .. ....+..+..+.+.+ T Consensus 381 w~~~~~~s~~~~ad~~~kl~~ag~i-----s~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (479) T protein:vir:99 381 WQDVTIQSLAQFADAWAKMVESLKI-----PAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGP 455 (479) T ss_pred ecCCCCCCHHHHHHHHHHHHhcCCC-----CHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCC Confidence 6543 34667889999999998852 345555655 775442110 00 00 000000000111111 Q ss_pred CCccccccccccccccccccccccchHHHhcCcc Q lcl|NC_021302. 444 ALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPD 477 (484) Q Consensus 444 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (484) ..........+..+. ++.-|++.- T Consensus 456 ~~~~~~~~~~~~~~~----------~~~~~~~~~ 479 (479) T protein:vir:99 456 NGATNMQQANNKTGE----------PASLNKSGA 479 (479) T ss_pred CCCCCCCCCCCCCcc----------hhccCCCCC Confidence 111111111111111 111111111 No 160 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.49 E-value=5.7e-05 Score=43.86 Aligned_cols=412 Identities=10% Similarity=0.022 Sum_probs=165.0 Q ss_pred CCCCCCCcccee---eeecccccchhhhhhhcccccccccc-cccchHHHHHH---HHh--------------------c Q lcl|NC_021302. 1 MAPKTVAPRTER---GYVNPLAGFGTFLAQGLDQFEQVDEL-RWPNSVYTYTR---MCR--------------------E 53 (484) Q Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~l-r~~~~~~~y~~---m~~--------------------~ 53 (484) |=|-++.++... -+.+...-.-...+..+..... ... |..+..+.|+- ++. . T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~-~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~ 90 (483) T protein:vir:12 12 LYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHL-EKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMI 90 (483) T ss_pred eecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccccccccccccccccccccc Confidence 333333333211 1111110000001000000000 000 01111122210 000 0 Q ss_pred chHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceee Q lcl|NC_021302. 54 EARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVF 132 (484) Q Consensus 54 D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~ 132 (484) .....-++.+....+.+-+..+.. ++++..+++.+. .. -+|++.+.++ .++..||.+ . T Consensus 91 ~n~~k~Ivd~~~~~l~G~p~~~~~--~d~~~~~~l~~~-----------------~~-n~~~~~~~~~~~~~~~~G~~-y 149 (483) T protein:vir:12 91 TNFHANLVDQKVSYIVGKPIAFKH--TDDEVVKRIDEV-----------------LG-NRFDDKLHSVLTGASNKGIE-W 149 (483) T ss_pred cchHHHHHHHHhhhhcccCceecc--CChHHHHHHHHH-----------------Hh-ccHHHHHHHHHHHHhhCCeE-E Confidence 122222333333334444444432 222222222222 12 2466666665 578999985 5 Q ss_pred eEEEeecCCeeeeeeeeeeCcccee-eeeecCCCceeeeecc-cccccc------cccceeccCCC-------------- Q lcl|NC_021302. 133 EQTYFYEGGRFWLKRLAPRPQSSIA-YWNVDRDGGLISIQQW-PAGTFG------GPGMVVMAPNS-------------- 190 (484) Q Consensus 133 Eivw~~~~g~~~~~~l~~r~~~~~~-~~~~~~dg~l~~~~q~-~~~~~~------~~~~~~~~~~~-------------- 190 (484) +++|.-.+|... +...+|+.+. .|.....+.++...+. ...... ......+.... T Consensus 150 ~~v~~d~d~~~~---i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~ 226 (483) T protein:vir:12 150 LHPYLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENS 226 (483) T ss_pred EEEEEcCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeeccccccccc Confidence 677766667654 3344454432 1221122333321111 000000 00000000000 Q ss_pred --Cc--ccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHH Q lcl|NC_021302. 191 --MG--PAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELL 266 (484) Q Consensus 191 --~~--~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~ 266 (484) .. .++..--++.|+ +|+.|.|.+..+-...---...+..++..++-|.. |.++.+-....+.++ .. T Consensus 227 ~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~--~~lv~~g~~~~~~~~---~~ 296 (483) T protein:vir:12 227 KTHFSTGSWGKIPFIPFK-----NNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNE--LTYVLTNYDDQELPE---FK 296 (483) T ss_pred ccccccCCCCccceEEec-----CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcC--ceeeeecCCcccchh---HH Confidence 00 001111122222 36788999987665555556678888888887655 554443222222222 12 Q ss_pred HHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HH--HHHHHHHH Q lcl|NC_021302. 267 EIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQ--ADTFVQSV 343 (484) Q Consensus 267 ~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh--~~v~~~~~ 343 (484) ..++ .. ..+.++.+.+++++........++.+++.+.+.|...--...++.++-+|.- .|. .. ..-....+ T Consensus 297 ~~~~---~~--~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~-Sg~Al~~~~~~l~~k~ 370 (483) T protein:vir:12 297 RLLR---YY--GAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAP-SGVALEFLYTNLNLKA 370 (483) T ss_pred Hhhh---hc--cccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCc-HHHHHHHHHHHHHHHH Confidence 2222 11 3456788999999987766778999999999988887555445544323211 121 11 12222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhC-CCCccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCC Q lcl|NC_021302. 344 QTVADEIRDVAQAHVVEDIVDVN-WGEDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGP 420 (484) Q Consensus 344 ~aD~~~i~~~ln~qli~~l~~~N-f~~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p 420 (484) ..-.+.+...+. ++++-++.+. ......-..+.|. ..+.+..+.++.+.+|+ |+ ++++.+.+.++. +.+ T Consensus 371 ~~~~~~f~~~l~-~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~--Gi-----iS~et~~~~~~~v~d~ 442 (483) T protein:vir:12 371 DKLARKAKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-----VSHETVLENHPFVEDL 442 (483) T ss_pred HHHHHHHHHHHH-HHHHHHHHHhcCCCccceeeEEeCCCCCCCHHHHHHHHHHHh--cc-----CchHHHHHhCCCCCCH Confidence 344455555553 3666555543 2222222356675 45677888899999884 64 245667777654 322 Q ss_pred CCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 421 DPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 421 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) ..+-+..... ..+............ .....+.......-.. T Consensus 443 ~~E~~ri~~E----~~~~~~~~~~~~~~~----~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 443 QAELERIEQE----QMEYNKQLPNLDDGG----ADGAQQQERSNNKESE 483 (483) T ss_pred HHHHHHHHHH----HHHHHhhcccccccc----cCCcccCCCCCcccCC Confidence 2110000000 000000000000000 0000000000000000 No 161 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=97.48 E-value=5.8e-05 Score=43.80 Aligned_cols=408 Identities=11% Similarity=0.055 Sum_probs=162.8 Q ss_pred CCC----CCCCccceeeeecccccchhhhhhhccccccc--ccc-cccchHHHHH---HHHh------------------ Q lcl|NC_021302. 1 MAP----KTVAPRTERGYVNPLAGFGTFLAQGLDQFEQV--DEL-RWPNSVYTYT---RMCR------------------ 52 (484) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~l-r~~~~~~~y~---~m~~------------------ 52 (484) |+- -......| ++......+......|...... ..+ +..+..+.|+ +++. T Consensus 1 ~~~~~~~~~~~~~~e--~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ 78 (478) T protein:vir:10 1 MISINWPWDKPYHEQ--VVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDW 78 (478) T ss_pred CccccCCCCchhHHH--HHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccccccccccc Confidence 110 00000000 0111100000000000000000 000 0001111121 0000 Q ss_pred --cchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcc Q lcl|NC_021302. 53 --EEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGH 129 (484) Q Consensus 53 --~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~ 129 (484) ..+...-++.+...-+.+-+..+... +++..+.+ ...++ -+|.+.+..+ .++..||. T Consensus 79 ki~~n~~~~ivd~~~~~l~g~~~~~~~~--~d~~~~~l-----------------~~~~~-n~~~~~~~~~~~~~~~~G~ 138 (478) T protein:vir:10 79 RMYTNYHQNLVDQKVAYAVANPVTFGVD--NDKALKQI-----------------QHTLN-HKWDDKLVDILTAASNKGI 138 (478) T ss_pred eeccchHHHHHHHHHhhhccCCeeeecC--ChHHHHHH-----------------HHHHh-cCHHHHHHHHHHHHHhcCe Confidence 01111222222222333333333221 11222221 22222 2566666655 47899998 Q ss_pred eeeeEEEeecCCeeeeeeeeeeCcccee-eeeecCCCceeee-eccccccccc------ccc--eecc------------ Q lcl|NC_021302. 130 AVFEQTYFYEGGRFWLKRLAPRPQSSIA-YWNVDRDGGLISI-QQWPAGTFGG------PGM--VVMA------------ 187 (484) Q Consensus 130 s~~Eivw~~~~g~~~~~~l~~r~~~~~~-~~~~~~dg~l~~~-~q~~~~~~~~------~~~--~~~~------------ 187 (484) +. +++|.-.+|.+.+. ..+|..+. .|.....+.++.. +.+....... ... +... T Consensus 139 ~~-~~~~~d~~g~~~~~---~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~ 214 (478) T protein:vir:10 139 EW-VQPYVDEEGEFKTF---RVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSD 214 (478) T ss_pred EE-EEEEecCCCeeEEE---EEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccc Confidence 64 67776666765433 33444332 1222222222211 1110000000 000 0000 Q ss_pred --------CCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCH Q lcl|NC_021302. 188 --------PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDD 259 (484) Q Consensus 188 --------~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~ 259 (484) ......++..--++.|+ +|++|.|.+..+....-.-...+..++..++.|..++.++.|-......+ T Consensus 215 ~~~~~~~~~~~~~~~~~~vPvv~~~-----n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~ 289 (478) T protein:vir:10 215 DHIQPHYYQGNKLMSWGRVPFIPFK-----NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKD 289 (478) T ss_pred cccccceecccccccCCccceEEec-----cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccch Confidence 00011111111233332 47889999988666665666778888888898877666665532211111 Q ss_pred HHHHHHHHHHHHHhcCCceEEEcc--CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHH--- Q lcl|NC_021302. 260 DRMDELLEIASNYSGGESAGLALT--AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASV--- 334 (484) Q Consensus 260 ~~~~~l~~~l~~~~~g~~a~~vip--~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~ev--- 334 (484) ...++... ..+.++ .|.+++++........++..++.+.+.|.+.-.+..++.++.+|. ..|.. T Consensus 290 --------~~~~~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Al~~ 358 (478) T protein:vir:10 290 --------FMHNLKYY--KAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNS-PSGIALKF 358 (478) T ss_pred --------hhhhhhhc--ceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccc-cHHHHHHH Confidence 12223222 233343 567888887666677899999999999998866555555443332 11211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHH Q lcl|NC_021302. 335 QADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWG-EDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLR 412 (484) Q Consensus 335 h~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~-~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~ 412 (484) ...........-.+.+...+. ++++.++.+... ....-+.++|. ..+.+..+.++++.+| .|+ ++.+.+. T Consensus 359 ~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~-----iS~et~~ 430 (478) T protein:vir:10 359 MYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYRLDVKVQDIEITFNFNVMVNELENSQIAMNS--TGL-----LSKETIL 430 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCcccccceEEecCCCCCCHHHHHHHHHHH--hCC-----CChHHHH Confidence 112222223334455566664 466666665422 12223467775 3556788889999888 454 2466788 Q ss_pred HHhCC-CCCCCCcccccc-cCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 413 DAAGL-PGPDPDADDDES-TADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 413 e~~gl-p~p~~~e~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) +.++. ..++.+-+.... .....+...... .............+.++ T Consensus 431 ~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 431 SNHAWVEDPVAEMERIEQENIELNQQLPDIE-EGLNGEQQRQSENNQPE 478 (478) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHhhccccc-cccCCCCCCCCCCCCCC Confidence 88865 222111000000 000000000000 00000000000000001 No 162 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.48 E-value=5.9e-05 Score=43.77 Aligned_cols=410 Identities=9% Similarity=0.023 Sum_probs=161.9 Q ss_pred CCCCCCCc---cceeeeeccccc-chhh-hhhhcccccccccccccchHHHHHH---HH------------hcchHHHHH Q lcl|NC_021302. 1 MAPKTVAP---RTERGYVNPLAG-FGTF-LAQGLDQFEQVDELRWPNSVYTYTR---MC------------REEARIASV 60 (484) Q Consensus 1 ~~~~~~~~---~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~lr~~~~~~~y~~---m~------------~~D~~v~s~ 60 (484) |+=-.-.+ +....++.|... +... ....+........-|..+..+.|+- ++ .......-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~~~I 80 (470) T protein:vir:99 1 MKDINYGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPEKETGADNRIVVNSAKYV 80 (470) T ss_pred CccccCCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcccccCCcceeecchHHHH Confidence 22111111 111111111000 0000 0000000000000000011111110 00 001122222 Q ss_pred HHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeec Q lcl|NC_021302. 61 LRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYE 139 (484) Q Consensus 61 l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~ 139 (484) +.....-+.+-+..+...+++ +. ...+.+....-+|+..+..+. ++..||.+ ++++|.-. T Consensus 81 vd~~~~~l~g~p~~~~~~~d~-~~-----------------~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~-~~~v~~d~ 141 (470) T protein:vir:99 81 VDVYNGYFCGIEPKLALLNDS-SK-----------------IDEIARWNRQENFFDTINEISKQCDIFGRS-IASIYQGE 141 (470) T ss_pred HHHHhhhhccCCeeEeeCCch-hH-----------------HHHHHHHHHhcCHhHHHHHHHHHHHhcCee-EEEEEeCC Confidence 222223333333333221111 11 112233333446777666664 79999975 77888766 Q ss_pred CCeeeeeeeeeeCccceeeeeecCCCc--eeeeeccc---cccc--------ccccceeccCC----------CCccccc Q lcl|NC_021302. 140 GGRFWLKRLAPRPQSSIAYWNVDRDGG--LISIQQWP---AGTF--------GGPGMVVMAPN----------SMGPAIP 196 (484) Q Consensus 140 ~g~~~~~~l~~r~~~~~~~~~~~~dg~--l~~~~q~~---~~~~--------~~~~~~~~~~~----------~~~~~lp 196 (484) +|.+. +...+|+.+. ..+++.+. ++...+.. .+.. .....+.+... ....++. T Consensus 142 dg~~~---i~~~~p~~~~-~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 217 (470) T protein:vir:99 142 DARPH---LMYSSPNHAF-IIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYG 217 (470) T ss_pred CCeEE---EEEEccceeE-EEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCC Confidence 77654 3444555432 12222211 11110000 0000 00011111100 0111111 Q ss_pred ccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCC Q lcl|NC_021302. 197 VEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGE 276 (484) Q Consensus 197 ~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~ 276 (484) .--++.+ .+|+.|.|.+..+-...---...+..++..++.|.+++.++.|-.... ++..+ .+..+... T Consensus 218 ~vPvv~~-----~n~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~--~~~g~----~~~~~~~~- 285 (470) T protein:vir:99 218 LVPAVEF-----FENEERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPE--DDEGN----PKFDFKNN- 285 (470) T ss_pred ccceEee-----cCCCCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccc--ccccc----hhhhhhhc- Confidence 1112222 246789999988665555556677888888898877666666633222 11111 12223211 Q ss_pred ceEEEcc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HH--HHHHHHHHHHHHH Q lcl|NC_021302. 277 SAGLALT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQ--ADTFVQSVQTVAD 348 (484) Q Consensus 277 ~a~~vip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh--~~v~~~~~~aD~~ 348 (484) ..+.+| .+.+++++........++..++.+.+.|...-....++.++.+|+- .|. .+ ..-....+..-.+ T Consensus 286 -~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~-Sg~Ai~~~~~~l~~k~~~~~~ 363 (470) T protein:vir:99 286 -RVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNS-SGVALQYKLFAMKNKADSKER 363 (470) T ss_pred -ceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccccccCc-hHHHHHHHHHHHHHHHHHHHH Confidence 223333 4567888877666778889999999999887655555544433321 121 11 1222233334445 Q ss_pred HHHHHHHHHHHHHHHHh-CC--CCc--cccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCC Q lcl|NC_021302. 349 EIRDVAQAHVVEDIVDV-NW--GED--EPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDP 422 (484) Q Consensus 349 ~i~~~ln~qli~~l~~~-Nf--~~~--~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~ 422 (484) .+...|. ++++.++.+ +. ... ..-..+.|. ....+..+.++++.+|+ |+ + +.+.+.+.++.-.+. T Consensus 364 ~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--gi-i----s~et~l~~l~~vd~~- 434 (470) T protein:vir:99 364 KFDKSLM-QLYRIVLATLFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAE--GI-V----SKKTQLGMIPDIEPD- 434 (470) T ss_pred HHHHHHH-HHHHHHHHHHhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-C----CHHHHHHhCCCCCHH- Confidence 5556663 466655443 21 111 112467775 34567888899999885 54 2 345666666543221 Q ss_pred CcccccccCCC--cCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 423 DADDDESTADT--GQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 423 ~e~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) ++...-.... ......+.....+....+ +....+ T Consensus 435 -~E~eri~~E~~~~~~~~~~~~~~~d~~~~d--~~~ee~ 470 (470) T protein:vir:99 435 -AEMKQIAKEKADAIKQTQQLSMPIDILKRD--NNAEEE 470 (470) T ss_pred -HHHHHHHHHHHHHHHHHHhhcCCCCcCCCC--CCccCC Confidence 1110000000 000000000000000000 000000 No 163 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.45 E-value=6.4e-05 Score=43.57 Aligned_cols=422 Identities=11% Similarity=0.014 Sum_probs=159.8 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHH----HHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIA----SVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~----s~l~~r~~~v~~~~~~v~ 76 (484) |-|.+-++-. |.-... ..++..|...-.....|.. .+.-|=+......++. -.+..+ +.+ .+| T Consensus 1 ~~~~~~~~~~--gl~~~~----~~~~~~L~~~~~~~~~~~~-~~~~Yy~G~~~~~~~~~~~p~~~r~~-~~v--~nw--- 67 (474) T protein:vir:81 1 MIQQQTVRIP--SLSNDE----NALINGLLAQIENLRWKNL-LRTSYYENKRTIQYVGTLIPPQYFNL-GLV--LGW--- 67 (474) T ss_pred CcCCCcCcCC--CCChhH----HHHHHHHHHHHHHHhhHHH-HHHHHhccCCChhhccccccHHHHHH-Hhh--cCh--- Confidence 3333222211 000000 0000000000000000000 0111111111111111 111111 011 122 Q ss_pred cCCCCHHHHHHHHHHHHhhhc---c-chhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLPVE---G-DESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~~~---~-~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) ...+++.+++.+..... + ++.+....+....-+|+.....+ .+|+.||.|. .+||.-++|.-. ..|..+ T Consensus 68 ----~~~~Vd~~a~rl~~~Gf~~~d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf-~~V~~~~d~~~~-~~i~~~ 141 (474) T protein:vir:81 68 ----TGKAVDALARRCNLEGFVWPDGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAF-LINTVGEDDEPE-ALIHVK 141 (474) T ss_pred ----HHHHHHHHHhhhcccceECCCCCccchHHHHHHHhcChhHHHHHHHHHHHhhCcee-EEEecCCCCCce-eEEEEe Confidence 12233333332221111 0 01111122333334577666665 5899999995 777765544321 113333 Q ss_pred Cccc---------------eeeeeecCCCceeeeecccccccc------cccceeccCCCCcccccccceEEEeecCccC Q lcl|NC_021302. 152 PQSS---------------IAYWNVDRDGGLISIQQWPAGTFG------GPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG 210 (484) Q Consensus 152 ~~~~---------------~~~~~~~~dg~l~~~~q~~~~~~~------~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~ 210 (484) +|++ +.++..+.+|......-+..+..- ....+..........+| ++.+.++.+.+ T Consensus 142 sp~~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvP---vV~~~n~~~~~ 218 (474) T protein:vir:81 142 DASEATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVP---AQVLPYKPAPK 218 (474) T ss_pred ccceEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcc---eEEeccccccc Confidence 3333 223444455543222111111100 00000111111111233 68888888888 Q ss_pred ccccchhH-HHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHH---HHHHHHHHH---HHhcCCceEEEcc Q lcl|NC_021302. 211 VWTGNSLL-RPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDR---MDELLEIAS---NYSGGESAGLALT 283 (484) Q Consensus 211 ~p~G~gll-~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~---~~~l~~~l~---~~~~g~~a~~vip 283 (484) .|+|.|-+ +.+-...---+..+.....-.|-|.|+..+++|-......+++ ...+...+. .+..+.++ -+| T Consensus 219 ~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~--~~~ 296 (474) T protein:vir:81 219 RPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADA--DIP 296 (474) T ss_pred CcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccc--ccc Confidence 99997743 4443222222334444455667777777788775432221111 112222222 22222111 122 Q ss_pred CCceEEEeccc-CCchhHHHHHHHHHHHHHHHHhh--hhh--cc-cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 284 AGEEAGILSPN-GTPLDPRRAIEYHDHQMALVALA--HFL--NL-DGKGGSYALASVQADTFVQSVQTVADEIRDVAQAH 357 (484) Q Consensus 284 ~~~~ie~~~~~-~~~~~~~~li~~~d~~Isk~ilG--qtl--t~-~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~q 357 (484) .....++-+.+ .+...|...++.+-.+||-.-.- +.| ++ ++-.+.-|+... .+-+....+.-.+.+...+. + T Consensus 297 ~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~-~~~l~~kae~k~~~fg~~l~-~ 374 (474) T protein:vir:81 297 QLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDAS-QYELIAEAEGAVDDFTPALR-K 374 (474) T ss_pred ccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHH-HHHHHHHHHHHHHHHHHHHH-H Confidence 22223333322 22334555555555555543111 111 11 111122233332 33334445556677888885 5 Q ss_pred HHHHHHHhC--CCCccc-----cceEEecCCC-CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCc-cccc Q lcl|NC_021302. 358 VVEDIVDVN--WGEDEP-----APLLVFDEIG-SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDA-DDDE 428 (484) Q Consensus 358 li~~l~~~N--f~~~~~-----~P~~~~~~~~-~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e-~~~~ 428 (484) +++..+.+. +..+.. --+.+|.+.. ......++++.||+++|..++ +.+-+++.+|+...+-.. .... T Consensus 375 ~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~---~~~~~~~~lg~t~~~i~~~~~~~ 451 (474) T protein:vir:81 375 AFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLA---ETEVGLELIGLTPQQARRAMADK 451 (474) T ss_pred HHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCC---cHHHHHhhcCCCHHHHHHHHHHH Confidence 777766664 222211 1245675443 456888999999999986443 345678888986332110 0000 Q ss_pred ccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 429 STADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) ..+...... ....+.....+ + +| T Consensus 452 ~~~~~~~~~--~~l~~~~~~~~---~---aq 474 (474) T protein:vir:81 452 RRVQGRGTL--QALIDRSNNGA---T---AQ 474 (474) T ss_pred HHHhHHHHH--HHHHhcCCCCC---C---CC Confidence 000000000 00000000000 0 00 No 164 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=97.42 E-value=7e-05 Score=43.36 Aligned_cols=410 Identities=9% Similarity=0.023 Sum_probs=163.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccc-cccchHHHHHH---HH---------------h-----cchH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDEL-RWPNSVYTYTR---MC---------------R-----EEAR 56 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-r~~~~~~~y~~---m~---------------~-----~D~~ 56 (484) ..||+.--.....+.+.........+.-+... ..... |..+..+.|+- +. + .... T Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~-~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~ 102 (492) T protein:vir:97 24 SQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQ-HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNF 102 (492) T ss_pred cchhhhhHhhhcccCCCchhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccch Confidence 22222211111111110000000000000000 00000 00011111110 00 0 0112 Q ss_pred HHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEE Q lcl|NC_021302. 57 IASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQT 135 (484) Q Consensus 57 v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eiv 135 (484) ..-++.+...-+.+-+..+.. ++++..+++.+ ... -+|++.+.++ .++..||. +++++ T Consensus 103 ~k~Ivd~~~~yl~g~p~~~~~--~d~~~~~~l~~-----------------~~~-n~~~~~~~~~~~~~~~~G~-a~~~v 161 (492) T protein:vir:97 103 HANLVDQKVSYIVGKPIAFKH--TDDEVVKRIDE-----------------VLG-NRFDDKLHSVLTGASNKGI-EWLHP 161 (492) T ss_pred HHHHHHHHhhhhcccCceecc--CchHHHHHHHH-----------------HHh-ccHHHHHHHHHHHHhhcCe-EEEEE Confidence 222222333333333333322 22222222222 112 2566666665 57899997 56788 Q ss_pred EeecCCeeeeeeeeeeCcccee-eeeecCCCceeeeecccc-cccc------cccceeccCCC----------------- Q lcl|NC_021302. 136 YFYEGGRFWLKRLAPRPQSSIA-YWNVDRDGGLISIQQWPA-GTFG------GPGMVVMAPNS----------------- 190 (484) Q Consensus 136 w~~~~g~~~~~~l~~r~~~~~~-~~~~~~dg~l~~~~q~~~-~~~~------~~~~~~~~~~~----------------- 190 (484) |.-.+|.+. +...+|+.+. .|.....+.++....... .... ......+.... T Consensus 162 ~~d~dg~~~---~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 238 (492) T protein:vir:97 162 YLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTH 238 (492) T ss_pred EecCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccc Confidence 876677654 3344555432 122122333332111100 0000 00000000000 Q ss_pred -CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHH Q lcl|NC_021302. 191 -MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIA 269 (484) Q Consensus 191 -~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l 269 (484) ....+..--++.| .+|+.|.|.+..+-...-.-...+..++..++.+.. |+++++-....+.. .....+ T Consensus 239 ~~~~~~g~vPvv~~-----~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~--~~l~~~g~~~~~~~---~~~~~~ 308 (492) T protein:vir:97 239 FSTGSWGKIPFIPF-----KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNE--LTYVLKNYDDQELP---EFKRLL 308 (492) T ss_pred cccCCCCCcceEEe-----cCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcc--ceeeeecCCcccch---hHHHHH Confidence 0000111112222 236778999987655554556677888888887654 55554322222222 222222 Q ss_pred HHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HH--HHHHHHHHHHH Q lcl|NC_021302. 270 SNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQ--ADTFVQSVQTV 346 (484) Q Consensus 270 ~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh--~~v~~~~~~aD 346 (484) . . ..++.++.+.+++++........++.+++.+.+.|.+.--...++.++-+|.. .|. .+ ..-....+..- T Consensus 309 ~---~--~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~-Sg~Al~~~~~~l~~ka~~~ 382 (492) T protein:vir:97 309 R---Y--YGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSAP-SGVALEFLYTNLNLKADKL 382 (492) T ss_pred h---h--ccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccccCc-HHHHHHHHHHHHHHHHHHH Confidence 2 1 23566789999999987766778999999999999888555555554433321 122 11 12222233344 Q ss_pred HHHHHHHHHHHHHHHHHHhC-CCCccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCC Q lcl|NC_021302. 347 ADEIRDVAQAHVVEDIVDVN-WGEDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPD 423 (484) Q Consensus 347 ~~~i~~~ln~qli~~l~~~N-f~~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~ 423 (484) .+.+...+. ++++-++.+. ...+..--.+.|. ..+.++.+.++++.+|+ |+ ++.+.+.+.++. +.+..+ T Consensus 383 ~~~f~~~l~-~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--G~-----iS~et~l~~l~~v~d~~~E 454 (492) T protein:vir:97 383 ARKAKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-----VSHETVLENHPFVEDLQAE 454 (492) T ss_pred HHHHHHHHH-HHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHh--cc-----CchHHHHHhCCCCCCHHHH Confidence 455555553 3566555543 2222122256665 35567888899999884 64 245667777764 322211 Q ss_pred cccccc-cCCCcCCCcc-ccCCCCcccccccccccccc Q lcl|NC_021302. 424 ADDDES-TADTGQDEPE-TDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 424 e~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 459 (484) -+.... .....+..+. ...................+ T Consensus 455 leri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 492 (492) T protein:vir:97 455 LERIEQEQTEYNKQLPNLDDGGADSAQQQERSNNKESE 492 (492) T ss_pred HHHHHHHHHHHHHhhhccccCCCCCCcccccccccccC Confidence 000000 0000000000 00000000000000000001 No 165 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=97.35 E-value=8.7e-05 Score=42.85 Aligned_cols=420 Identities=12% Similarity=0.077 Sum_probs=167.2 Q ss_pred CC---CCCCCccc--eee--eecccccchhhhhhhcccccccc-cccccchHHHHHHHHhc---chHHHHHHHHHHHHhh Q lcl|NC_021302. 1 MA---PKTVAPRT--ERG--YVNPLAGFGTFLAQGLDQFEQVD-ELRWPNSVYTYTRMCRE---EARIASVLRAIGLPIR 69 (484) Q Consensus 1 ~~---~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~-~lr~~~~~~~y~~m~~~---D~~v~s~l~~r~~~v~ 69 (484) |. .++|.-.. +.- +..-..|...-...+-.....+. +-.-.++-+.|+.=+.+ -.++...+......|. T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf 80 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVF 80 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhh Confidence 33 22111100 000 00001111110111111111100 00011122334433322 3444445555555555 Q ss_pred CCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCC--eee-- Q lcl|NC_021302. 70 RTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGG--RFW-- 144 (484) Q Consensus 70 ~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g--~~~-- 144 (484) ..+-.++ .++..+.+.+++. ..+.+++.+++.+. .++.||.+.+=+.+-..++ ... T Consensus 81 ~k~p~~~----~p~~l~~l~~d~D---------------~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a 141 (501) T protein:vir:95 81 MRDPVVK----VPALLNPLVANAT---------------GSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIA 141 (501) T ss_pred cCCccee----CcHHHHHHHhccC---------------CCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHH Confidence 4444442 2222222222221 13557888888886 5778998866554422111 000 Q ss_pred -------eeeeeeeCccceeeeeecCCCc-----eeeeecc---cccccc----------------c--ccceecc---- Q lcl|NC_021302. 145 -------LKRLAPRPQSSIAYWNVDRDGG-----LISIQQW---PAGTFG----------------G--PGMVVMA---- 187 (484) Q Consensus 145 -------~~~l~~r~~~~~~~~~~~~dg~-----l~~~~q~---~~~~~~----------------~--~~~~~~~---- 187 (484) .-.|..+.+..|-=|+++..|+ ++.++.. ..+.++ . .+.+... T Consensus 142 ~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~ 221 (501) T protein:vir:95 142 DLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTK 221 (501) T ss_pred HHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcc Confidence 0113333333332333333332 0111100 000000 0 0000000 Q ss_pred -------------------CCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHH-HHHHHHHHhcCCcc Q lcl|NC_021302. 188 -------------------PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIR-IEAAAIRRHGIGVP 247 (484) Q Consensus 188 -------------------~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~-~w~~f~Er~~~G~P 247 (484) ....+..++.=-|+++..... +--.+...|..++.. -.+++... +.-.-+-.-++|+| T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~-~~~~~~pPLl~lA~l-ni~hy~~ssd~~~~l~~~~~P~l 299 (501) T protein:vir:95 222 ADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENN-DSNPDNPNFYDLASL-NMAHYRNSADYEESCYIVGQPTP 299 (501) T ss_pred cCcceecCCcccccceeeeeccCCCcCCeeeEEEEecCCC-CCCCCccchHHHHHH-HHHHHhhhhHHHHHHHHccccee Confidence 000112222222443322222 212233444455532 23333332 22222332245666 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccc--- Q lcl|NC_021302. 248 YLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDG--- 324 (484) Q Consensus 248 ~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~--- 324 (484) ++.|.... ....... ..+.-|.++++.+|++.++.+++.++++- .+..++....+|..+ |..|...+ T Consensus 300 ~i~G~~~~-----~~~~~~~--~~i~~G~~~~~~lP~~~~~~~ie~~~~~i-~~~~l~~l~~~m~~~--Ga~ll~~~~~~ 369 (501) T protein:vir:95 300 VLIGLTEE-----WVTNVLK--GSVNFGSRGGIPLPVGADAKLLQASENTM-LKEAMDTKERQMVAL--GAKLVEQKEVQ 369 (501) T ss_pred eeeCCccc-----ccccCCC--CceeecccccccCCCCCceeEEecChhhH-HHHHHHHHHHHHHHH--HHhhccCCccc Confidence 66553321 1110000 12344778889999999999999987654 356666666776553 44443322 Q ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--CC-CcHHHHHHHHHHHHhcCcc Q lcl|NC_021302. 325 KGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--IG-SRQDATAAALQMLVNAGLL 401 (484) Q Consensus 325 ~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~~-~~~~~~ae~~~~L~~~G~~ 401 (484) .+++.+ ..........+..-+..+++.++ +++++++.+-.-.+. -++|++.. .. ......++++.++.+.|.+ T Consensus 370 ~Ta~~~--~~~~~~~~S~L~~~a~~le~al~-~~l~~~a~w~g~~~~-~~~v~i~~df~~~~~~~~~~~al~~~~~~G~i 445 (501) T protein:vir:95 370 RTATEA--ELEAASEGSTLSSATKNVSAAFE-WALKWAARWVGQADS-GVKFELNTDFDIARMTPDERRSLVEEWQKGAI 445 (501) T ss_pred hhHHHH--HHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCC-ceEEEEecccccccCCHHHHHHHHHHHhCCCC Confidence 122222 23344445677888899999996 599998887532222 23565532 11 2234456777788888874 Q ss_pred cCCcccHHHHHHHhCCCCCCCCcccc--cccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 402 TPDPRLEAFLRDAAGLPGPDPDADDD--ESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 402 ~~~~~~~~~i~e~~glp~p~~~e~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .. ...-.++ .+.|++.+....+.. ....+.....+..+....+...|+....+ + T Consensus 446 s~-~t~~~~L-~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~~~--~ 501 (501) T protein:vir:95 446 TF-EEMRTGL-RKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVGNS--E 501 (501) T ss_pred cH-HHHHHHH-HhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccccccCC--C Confidence 32 1112233 455887664322211 11111111111111111112222221111 1 No 166 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=97.22 E-value=0.00012 Score=41.99 Aligned_cols=411 Identities=9% Similarity=-0.018 Sum_probs=159.2 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHH---HHh-------------cchHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTR---MCR-------------EEARIASVLRAI 64 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~---m~~-------------~D~~v~s~l~~r 64 (484) || -..+.-.+..........+.-+...-....-|..+..+.|+- ++. ...+..-++.+. T Consensus 1 ~~-----~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~Iv~~~ 75 (499) T protein:vir:10 1 MA-----VVIDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDNATVEAANVMVNHAKYITDMN 75 (499) T ss_pred Cc-----cchhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcCcCCCCcceeecchHHHHHHHH Confidence 10 000000000000000000000000000000000011111110 000 011112222222 Q ss_pred HHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCee Q lcl|NC_021302. 65 GLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRF 143 (484) Q Consensus 65 ~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~ 143 (484) ...+.+.+..+.. ..+.....+.+.....+|+..+..+ .++..||. +++++|...+|.. T Consensus 76 ~~~l~g~p~~~~~-------------------~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~-~~~~v~~~~~g~~ 135 (499) T protein:vir:10 76 VGFMTGNPVKYVA-------------------EKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGY-GYELLYLKKTDPI 135 (499) T ss_pred hhhhcccCceeec-------------------CChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCc-eEEEEEecccccc Confidence 2222222222221 1122222233344444677655555 57999997 5677777666533 Q ss_pred ee--------------eeeeeeCccceeeeeecCCCc--ee------------------eeecccccccccccceeccC- Q lcl|NC_021302. 144 WL--------------KRLAPRPQSSIAYWNVDRDGG--LI------------------SIQQWPAGTFGGPGMVVMAP- 188 (484) Q Consensus 144 ~~--------------~~l~~r~~~~~~~~~~~~dg~--l~------------------~~~q~~~~~~~~~~~~~~~~- 188 (484) .+ .++...+|+.... .++.... ++ ++.-+... ....+.. T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~v~p~~~~~-v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~-----~i~~~~~~ 209 (499) T protein:vir:10 136 SVRDELGNEKLTPNTELKIEVIDPRATVV-VCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQ-----RIVEYRTK 209 (499) T ss_pred cccccccccccccccceEEEEEcccceEE-EecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCC-----eEEEEEec Confidence 21 1234444443211 1111110 11 11000000 0000000 Q ss_pred ------------CCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCC Q lcl|NC_021302. 189 ------------NSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADS 256 (484) Q Consensus 189 ------------~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~ 256 (484) ......+..--++.|+ +|+.|.|.+..+-...-.-...+..++..++.+.+++.++.|... T Consensus 210 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~-- 282 (499) T protein:vir:10 210 TTMEVSANDPIVYDGENLFGAVPIIEFR-----NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGL-- 282 (499) T ss_pred CCccccCcceecccccCCCCccceEEec-----CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcc-- Confidence 0001111111233333 356788888876665555567778889999988776666665321 Q ss_pred CCHHHHHHHHHHHHHHhcCCceEEE--ccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH- Q lcl|NC_021302. 257 EDDDRMDELLEIASNYSGGESAGLA--LTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS- 333 (484) Q Consensus 257 ~~~~~~~~l~~~l~~~~~g~~a~~v--ip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e- 333 (484) .++.+. ...+..+ .... .+.+.++++++.......++.+++.+.+.|...--...++.+.-+|. ..|. T Consensus 283 ~~~~~~------~~~~~~~--~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn-~Sg~A 353 (499) T protein:vir:10 283 GDDKDD------IQRLKRG--AIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGN-VSGEA 353 (499) T ss_pred ccccch------hhhhhhc--ceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhccc-chHHH Confidence 111111 1111112 1222 34667788888766667899999999999988644444444332221 1111 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CC-CC--ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcc Q lcl|NC_021302. 334 --VQADTFVQSVQTVADEIRDVAQAHVVEDIVDV-NW-GE--DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPR 406 (484) Q Consensus 334 --vh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~-Nf-~~--~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~ 406 (484) ....-....+..-.+.+...++ ++++.++.+ |. +. +..-..+.|. ..+.+..+.++.+++|. |+ + T Consensus 354 l~~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~-----i 425 (499) T protein:vir:10 354 MKFKLFGLENLLSIKQRYFFDGLR-RRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNAD--GI-----I 425 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-----C Confidence 1122223334444566666664 466666654 21 11 1122467775 45677888999999983 54 2 Q ss_pred cHHHHHHHhCC-CCCCCC-ccccc----------ccCCCcCC-C-ccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 407 LEAFLRDAAGL-PGPDPD-ADDDE----------STADTGQD-E-PETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 407 ~~~~i~e~~gl-p~p~~~-e~~~~----------~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) +.+.+.+.++. +.+..+ +.... ....+..+ . ........++....... .+..+.-.+|+. T Consensus 426 S~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 499 (499) T protein:vir:10 426 PRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAG---SNHNQSHRTRAV 499 (499) T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCc---cccccCCCCCCC Confidence 45566666543 212110 00000 00000000 0 00000000000000000 011111122222 No 167 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=97.22 E-value=0.00013 Score=41.94 Aligned_cols=431 Identities=13% Similarity=0.102 Sum_probs=186.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcc-hHHHHHHHHHH------------HH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREE-ARIASVLRAIG------------LP 67 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D-~~v~s~l~~r~------------~~ 67 (484) |+ ++.+ .-+.-.|.+++..+...... + .+-++-..+++|+++=.++ .++.++++-+. .. T Consensus 1 ~~--~~~~--~~~~~~~~~~g~~~~p~~v~---~-~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~ 72 (527) T protein:vir:10 1 MG--QDKR--QYGSTQQLRAGEANFPNAVT---D-FDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEK 72 (527) T ss_pred CC--cccc--ccCCCcCcCCccccCcccCC---H-HHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHH Confidence 22 1111 11112222222211111111 1 1122334577777775443 24444444333 34 Q ss_pred hhCCCcEEecCCCC---HHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEee---cC Q lcl|NC_021302. 68 IRRTDWRIRPNGAR---PEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFY---EG 140 (484) Q Consensus 68 v~~~~~~v~p~~~~---~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~---~~ 140 (484) |.+..-+|.-++.+ ++-.+.|.+.|+.+.. +.+|+....+. .++..-|=.|+=+.|.. .+ T Consensus 73 ~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~-------------~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~ 139 (527) T protein:vir:10 73 LIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFD-------------RENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEG 139 (527) T ss_pred hhCCcceeeccCccccccchhHHHHHHHHHHHH-------------HhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcC Confidence 55555555443322 2233345555544322 22455444444 47899999999999984 34 Q ss_pred CeeeeeeeeeeCccceeeeeecCCCc-eeeee--------------------------cccccc--ccc-----ccceec Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAYWNVDRDGG-LISIQ--------------------------QWPAGT--FGG-----PGMVVM 186 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~~~~~~dg~-l~~~~--------------------------q~~~~~--~~~-----~~~~~~ 186 (484) ++..+.. .+|..+.-+.-+.+.+ .+++. ....+. .+. ...+.. T Consensus 140 ~R~~v~~---~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~l 216 (527) T protein:vir:10 140 SRLSLHE---VDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEP 216 (527) T ss_pred CCceEee---cCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeec Confidence 4554433 2333221111110111 11110 000010 000 000000 Q ss_pred cCCC---------------------Cccc--ccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021302. 187 APNS---------------------MGPA--IPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHG 243 (484) Q Consensus 187 ~~~~---------------------~~~~--lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~ 243 (484) .... +..+ |..=-+++|...+..+..||.|-|..+--..---+....+....++- T Consensus 217 g~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~-- 294 (527) T protein:vir:10 217 GKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVF-- 294 (527) T ss_pred cccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHH-- Confidence 0000 0111 11123556677778899999999998877777777777777777774 Q ss_pred CCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccc Q lcl|NC_021302. 244 IGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLD 323 (484) Q Consensus 244 ~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~ 323 (484) .|.|+++.+--+..+ .+.+. ..+.-+.-+..-+|++.++..++.......|...++++.++|+..---.-..++ T Consensus 295 sG~Pi~~~tg~~~vd--~~G~~----~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G 368 (527) T protein:vir:10 295 GGLGFYATDSAPPRD--SRGNM----VPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVG 368 (527) T ss_pred hCCceeeeccccccc--ccCCc----CccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeec Confidence 377777664222211 11111 112123223444889999999988777778999999999988877433333332 Q ss_pred ccccchhhH----HHHHHHHHHHHHHHHHHHHHHHH---H-HHHHHHHHhC-CCCccc----cceEEec-CCCCcHHHHH Q lcl|NC_021302. 324 GKGGSYALA----SVQADTFVQSVQTVADEIRDVAQ---A-HVVEDIVDVN-WGEDEP----APLLVFD-EIGSRQDATA 389 (484) Q Consensus 324 ~~gGs~A~~----evh~~v~~~~~~aD~~~i~~~ln---~-qli~~l~~~N-f~~~~~----~P~~~~~-~~~~~~~~~a 389 (484) .-+.+.+.+ ++...-...........+..++. + -+++||-.+- ++..+. .-++.|. ..+.+.+... T Consensus 369 ~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avi 448 (527) T protein:vir:10 369 VVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRF 448 (527) T ss_pred cccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHH Confidence 111122222 33333334444444332333331 1 1223322211 222211 2267886 4567888888 Q ss_pred HHHHHHHhcCcccCCcccHHHHHHHh----CCCCCCCCcccc-cccCCCcCCCccccCCCCccccccccccccccccccc Q lcl|NC_021302. 390 AALQMLVNAGLLTPDPRLEAFLRDAA----GLPGPDPDADDD-ESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRP 464 (484) Q Consensus 390 e~~~~L~~~G~~~~~~~~~~~i~e~~----glp~p~~~e~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (484) +.+.+|+..|++- ....-+++ |+..++.+-... ...+......++...+ .....++. ...+ T Consensus 449 e~v~tL~~aGi~S-----~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~-~~a~~~~~--------~g~~ 514 (527) T protein:vir:10 449 NQLLQLWEAGLIP-----AKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADP-FGAQMAAE--------QGIP 514 (527) T ss_pred HHHHHHHHcCchh-----HHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCc-hhhhhccc--------cCCC Confidence 9999999999853 34555555 654443221110 0000000000000000 00000000 0000 Q ss_pred cccchHHHhcCcccCccc Q lcl|NC_021302. 465 RGRSPRDRRKTPDGAMPL 482 (484) Q Consensus 465 ~~~~~~~~~~~~~~~~~~ 482 (484) .......-|-. +| T Consensus 515 ~~~~d~~~~~~-----~~ 527 (527) T protein:vir:10 515 DEEDDQALNGQ-----PL 527 (527) T ss_pred CCCcccccCCC-----CC Confidence 00000000100 11 No 168 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=97.20 E-value=0.00013 Score=41.82 Aligned_cols=431 Identities=14% Similarity=0.103 Sum_probs=186.7 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcc-hHHHHHHHHHH------------HH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREE-ARIASVLRAIG------------LP 67 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D-~~v~s~l~~r~------------~~ 67 (484) |+ ++.+ .-+.-.|.+++..+...... + .+-++-..+++|+++=.++ .++.++++-+. .. T Consensus 1 ~~--~~~~--~~~~~~~~~~g~~~~p~~v~---~-~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~ 72 (527) T protein:vir:10 1 MG--QDKR--QYGSTQQLRAGEANFPNAVT---D-FDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEK 72 (527) T ss_pred CC--cccc--ccCCCcCcCCccccCcccCC---H-HHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhhHH Confidence 22 1111 11112222222211111111 1 1122334577777775443 24444444333 34 Q ss_pred hhCCCcEEecCCCC---HHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEee---cC Q lcl|NC_021302. 68 IRRTDWRIRPNGAR---PEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFY---EG 140 (484) Q Consensus 68 v~~~~~~v~p~~~~---~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~---~~ 140 (484) |.+..-+|.-++.+ ++-.+.|.+.|+.+.. +.+|+....+. .++..-|=.|+=+.|.. .+ T Consensus 73 ~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~-------------~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~ 139 (527) T protein:vir:10 73 LIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFD-------------RENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEG 139 (527) T ss_pred hhCCcceeeccCccccccchhHHHHHHHHHHHH-------------HhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcC Confidence 55555555443322 2233345555544322 22455444444 47899999999999984 34 Q ss_pred CeeeeeeeeeeCccceeeeeecCCCc-eeeee--------------------------cccccc--ccc-----ccceec Q lcl|NC_021302. 141 GRFWLKRLAPRPQSSIAYWNVDRDGG-LISIQ--------------------------QWPAGT--FGG-----PGMVVM 186 (484) Q Consensus 141 g~~~~~~l~~r~~~~~~~~~~~~dg~-l~~~~--------------------------q~~~~~--~~~-----~~~~~~ 186 (484) ++..+.. .+|..+.-+.-+.+.+ .+++. ....+. .+. ...+.. T Consensus 140 ~R~~v~~---~DP~~~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~l 216 (527) T protein:vir:10 140 SRLSLHE---VDPSTYFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEP 216 (527) T ss_pred CCceEee---cCcceeeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeec Confidence 4554433 2333221111110111 11110 000010 000 000000 Q ss_pred cCCC---------------------Cccc--ccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021302. 187 APNS---------------------MGPA--IPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHG 243 (484) Q Consensus 187 ~~~~---------------------~~~~--lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~ 243 (484) .... +..+ |..=-+++|...+..+..||.|-|..+--..---+....+....++- T Consensus 217 g~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~-- 294 (527) T protein:vir:10 217 GKWDDRPESPLEPDDIKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVF-- 294 (527) T ss_pred cccccccccccchhhhhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHH-- Confidence 0000 0111 11123556677778899999999998877777777777777777774 Q ss_pred CCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccc Q lcl|NC_021302. 244 IGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLD 323 (484) Q Consensus 244 ~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~ 323 (484) .|.|+++.+--+..+ .+.+. ..+.-+.-+..-+|++.++..++.......|...++++.++|+..---.-..++ T Consensus 295 sG~Pi~~~tg~~~vd--~~G~~----~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G 368 (527) T protein:vir:10 295 GGLGFYATDSAPPRD--SRGNM----VPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVG 368 (527) T ss_pred hCCceeeeccccccc--ccCCc----CccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeec Confidence 377777664222211 11111 112223223444889999999988777778999999999988877433333332 Q ss_pred ccccchhhH----HHHHHHHHHHHHHHHHHHHHHHH---H-HHHHHHHHhC-CCCccc----cceEEec-CCCCcHHHHH Q lcl|NC_021302. 324 GKGGSYALA----SVQADTFVQSVQTVADEIRDVAQ---A-HVVEDIVDVN-WGEDEP----APLLVFD-EIGSRQDATA 389 (484) Q Consensus 324 ~~gGs~A~~----evh~~v~~~~~~aD~~~i~~~ln---~-qli~~l~~~N-f~~~~~----~P~~~~~-~~~~~~~~~a 389 (484) .-+.+.+.+ ++...-...........+..++. + -+++||-.+- ++..+. .-++.|. ..+.+.+... T Consensus 369 ~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avi 448 (527) T protein:vir:10 369 VVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRF 448 (527) T ss_pred cccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHH Confidence 111122222 33333334444444332333331 1 1223322211 222211 2267886 4567888888 Q ss_pred HHHHHHHhcCcccCCcccHHHHHHHh----CCCCCCCCc-ccccccCCCcCCCccccCCCCccccccccccccccccccc Q lcl|NC_021302. 390 AALQMLVNAGLLTPDPRLEAFLRDAA----GLPGPDPDA-DDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRP 464 (484) Q Consensus 390 e~~~~L~~~G~~~~~~~~~~~i~e~~----glp~p~~~e-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (484) +.+.+|+..|++- ....-+++ |+..++.+- .+....+......++...+ .....++.. ..+ T Consensus 449 e~v~tL~~aGiiS-----~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~-~~a~~~~~~--------g~~ 514 (527) T protein:vir:10 449 AQLLELWEAGLIP-----AKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADP-FGAQMAAEQ--------GIP 514 (527) T ss_pred HHHHHHHHcCchh-----HHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCc-hhhhhcccc--------CCC Confidence 9999999999853 34555555 654443221 1111000000000000000 000000000 000 Q ss_pred cccchHHHhcCcccCccc Q lcl|NC_021302. 465 RGRSPRDRRKTPDGAMPL 482 (484) Q Consensus 465 ~~~~~~~~~~~~~~~~~~ 482 (484) .......-|-. +| T Consensus 515 ~~~~d~~~~~~-----~~ 527 (527) T protein:vir:10 515 DEEDDQALNGQ-----PL 527 (527) T ss_pred CCCcccccCCC-----CC Confidence 00000000100 11 No 169 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=97.17 E-value=0.00014 Score=41.64 Aligned_cols=426 Identities=12% Similarity=0.053 Sum_probs=169.0 Q ss_pred CCCCC---CCccceeeeec----------ccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHH Q lcl|NC_021302. 1 MAPKT---VAPRTERGYVN----------PLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLP 67 (484) Q Consensus 1 ~~~~~---~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~ 67 (484) |+-+. +..+.+ .|.. -..|.......+-........- ....++-|-.-.-.-+++...++..... T Consensus 1 m~~~~~~~v~~~h~-~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E-~~~~Y~~rl~rA~~~n~~~~tl~~l~G~ 78 (513) T protein:vir:97 1 MADKDPKSPATTSG-AYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEE-TDKGYQERLASAVLLNMVEQTLDTLSGK 78 (513) T ss_pred CCCCCCCCCCcCCH-HHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCC-CHHHHHHHHhcccCCChHHHHHHHHhhh Confidence 54443 222222 1111 1111111111111111111100 0112333322222345556666666666 Q ss_pred hhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCC----- Q lcl|NC_021302. 68 IRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGG----- 141 (484) Q Consensus 68 v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g----- 141 (484) |.+.+-.++.. ..++..+.+.+++. ..+.+++.+++.+. .++.||.+.+=+.+-...+ T Consensus 79 vf~k~p~~~~~-~p~~~~~~l~~d~D---------------~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~ 142 (513) T protein:vir:97 79 PFSEPIKLNED-VPKAIEETILPDVD---------------LQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQ 142 (513) T ss_pred hhhcCcccCcC-chHHHHHHHhhccC---------------CCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchh Confidence 66555444321 11222222222211 13557889998887 4899998765554421110 Q ss_pred ----------eeeeeeeeeeCccceeeeeecCCCc--------------------eeeeecccccccccccceecc---- Q lcl|NC_021302. 142 ----------RFWLKRLAPRPQSSIAYWNVDRDGG--------------------LISIQQWPAGTFGGPGMVVMA---- 187 (484) Q Consensus 142 ----------~~~~~~l~~r~~~~~~~~~~~~dg~--------------------l~~~~q~~~~~~~~~~~~~~~---- 187 (484) ... -.+..+.+..|-=|+++..++ -....|.---..+....+... T Consensus 143 ~~T~Ade~~~~~r-Py~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~ 221 (513) T protein:vir:97 143 PRTLADDRREGLR-PYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSN 221 (513) T ss_pred HHhHHHHHhhccC-ceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCC Confidence 000 112222222222222222111 000000000000111110000 Q ss_pred --------CCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCH Q lcl|NC_021302. 188 --------PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDD 259 (484) Q Consensus 188 --------~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~ 259 (484) ....+..|..=-|+++... ..+-..+.+.|..++..-+---....+.-..+..-++|+|++.|-... T Consensus 222 ~~~~e~~~~~~g~~~l~~IP~v~~~~~-~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~---- 296 (513) T protein:vir:97 222 AQKEEWALADEWATGLNYVPLVTFYAD-RQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGE---- 296 (513) T ss_pred ccccceEEecCCCCcCCceeEEEEecC-CCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcC---- Confidence 0011122222234443322 223334555566665443322233333333333335677776553211 Q ss_pred HHHHHHHHHHHHHhcCCceEEEccC-CceEEEecccCCc-hhHHHHHHHHHHHHHHHHhhhhhcccccc-cchhhHHHHH Q lcl|NC_021302. 260 DRMDELLEIASNYSGGESAGLALTA-GEEAGILSPNGTP-LDPRRAIEYHDHQMALVALAHFLNLDGKG-GSYALASVQA 336 (484) Q Consensus 260 ~~~~~l~~~l~~~~~g~~a~~vip~-~~~ie~~~~~~~~-~~~~~li~~~d~~Isk~ilGqtlt~~~~g-Gs~A~~evh~ 336 (484) ..+ .+.-|.++++.+|. +.++.+++.++++ ......++...++|.. +|-.|...+.+ -|--...... T Consensus 297 -~~~-------~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~--~Ga~ll~~~~~~~Ta~a~~~~~ 366 (513) T protein:vir:97 297 -DSD-------PVVVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAG--YGAEFLKRKTGGQTATARALDS 366 (513) T ss_pred -CCC-------ceEeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHH--HHHHhhccCCccccHHHHHHHH Confidence 111 13448788888995 8899999999776 4577788888888843 34443332221 1222223445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC-C-CCc-HHHHHHHHHHHHhcCcccCCcccHHHHHH Q lcl|NC_021302. 337 DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE-I-GSR-QDATAAALQMLVNAGLLTPDPRLEAFLRD 413 (484) Q Consensus 337 ~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~-~-~~~-~~~~ae~~~~L~~~G~~~~~~~~~~~i~e 413 (484) ......+.+-+..+++.++ ++++++..+-.. ....++|++.. . ... .....+++.++.+.|.+.. +.++++ T Consensus 367 ~~~~S~L~~~a~~le~al~-~~l~~~a~wlg~-~~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~----~t~~~~ 440 (513) T protein:vir:97 367 AEATSDLSAMTGLFEDALA-QALDITADWLRL-GPNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISR----KTYLNG 440 (513) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCC-CCCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCH----HHHHHH Confidence 5566677778888999996 599998887532 22234666532 1 112 2334456667777776431 222222 Q ss_pred --HhCCCCCCCC--c---cc-ccccCCCc---CC-CccccCC--------CCccccccccccccccccccccccc Q lcl|NC_021302. 414 --AAGLPGPDPD--A---DD-DESTADTG---QD-EPETDEP--------ALPNTSGTTSTTNAPQARKRPRGRS 468 (484) Q Consensus 414 --~~glp~p~~~--e---~~-~~~~~~~~---~~-~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (484) +-|+=.|+.. + .+ .......+ .+ .+....+ ..++..++. .........|.+.+ T Consensus 441 L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 441 LRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEG--GEGGEGGGNPGGES 513 (513) T ss_pred HHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCC--CCccccCCCCCCCC Confidence 2344212111 1 11 10000000 00 0000000 001111111 01111222333332 No 170 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=97.16 E-value=0.00015 Score=41.60 Aligned_cols=435 Identities=13% Similarity=0.123 Sum_probs=187.0 Q ss_pred CCC---CCCCcc-----c-eeeeecccccchh-------hhhhhccccccccccc-ccchHHHHHHHHhcchHHHHHHHH Q lcl|NC_021302. 1 MAP---KTVAPR-----T-ERGYVNPLAGFGT-------FLAQGLDQFEQVDELR-WPNSVYTYTRMCREEARIASVLRA 63 (484) Q Consensus 1 ~~~---~~~~~~-----~-~~~~~~~~~~~~~-------~~~~~~~~~~~~~~lr-~~~~~~~y~~m~~~D~~v~s~l~~ 63 (484) |-- .|=.+. + -.|.+....+.|. ..-.+ ...|++-. .-..+..|=-+ .|+.|.... . T Consensus 27 ~~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~---f~npd~~~~~i~~l~~y~yi--~~~~v~ql~-~ 100 (525) T protein:vir:10 27 HINELERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLW---FNNPDKYINNIVNLLTYYYI--IDGNVFQLY-D 100 (525) T ss_pred HHhhhhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhh---hcChHHHHHHHHHHHHHhhh--hcchHHHHH-H Confidence 000 000000 0 0001111100000 00000 01111110 01123334344 377777644 4 Q ss_pred HHHHhhCCCcEEecCC---CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHH-----Hhhc------- Q lcl|NC_021302. 64 IGLPIRRTDWRIRPNG---ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKS-----LQFG------- 128 (484) Q Consensus 64 r~~~v~~~~~~v~p~~---~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a-----~~~G------- 128 (484) .+.++-.+++.|..-. +..+....+.-.|+..+. .+.+ =.+++.++..| +|-| T Consensus 101 li~~lp~l~y~i~~~~~~k~~~~~~s~~n~~l~k~i~--------hk~l----trdll~q~a~~gtlig~wlg~~~~py~ 168 (525) T protein:vir:10 101 LIFSLPPLDYQIKVLKRDKDYKEDLSTINLYLEKKIQ--------HKQL----TRDLLVQLAHSGTLIGTWLGSKREPYF 168 (525) T ss_pred HHHhcCCcceeehhhhhccchhhHHHHHHHHHHHhHH--------HHHH----HHHHHHHhhccCceeEeeecCCCCcch Confidence 5567778888885432 223333333322222111 0000 12333333321 1111 Q ss_pred ceeeeEEE----eecCCeeeeeeeeeeCccceeeeeecCCCcee-------eeecccccccccccceeccCCCCcccccc Q lcl|NC_021302. 129 HAVFEQTY----FYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLI-------SIQQWPAGTFGGPGMVVMAPNSMGPAIPV 197 (484) Q Consensus 129 ~s~~Eivw----~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~-------~~~q~~~~~~~~~~~~~~~~~~~~~~lp~ 197 (484) |-+-|+-| .+..|.|. ..++-.||.. +.++-+-. .+.|..-..+-+.. -........++||. T Consensus 169 ~vf~~~kyvfp~~r~~g~~v----~vid~~~f~~--~~~~~r~~~~~~lsp~i~~~~y~~~~~~~-~~~~~~~r~i~LP~ 241 (525) T protein:vir:10 169 NVFNNLKYVFPYGRAKGKMV----AVIDLQWFDE--MSELERKLTFENLSPLITENKYKKWKEYN-GENEDALRYIMLPI 241 (525) T ss_pred hhhhhhhhhccccccCCceE----EEEehHHhhh--hhHHHHHHHHHhhchhhhhhhhhHHhhcc-cccchhheeeeccc Confidence 11112221 12233321 1222223321 11111100 11111100000000 00111223467999 Q ss_pred cceEEEeecCccCccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEe--cCCCCC--CHHHHHHHHHHHHHH Q lcl|NC_021302. 198 EQLVVYTHDMDPGVWT-GNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKG--NEADSE--DDDRMDELLEIASNY 272 (484) Q Consensus 198 ~k~l~~~~~~~~~~p~-G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~g--k~~~~~--~~~~~~~l~~~l~~~ 272 (484) ++.++.+...-..||- |.++.-+.+.....|+.........+.|...++-++.. +.+... .+..+++++.-|+.+ T Consensus 242 e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~EqsIA~kii~a~avLk~gg~~gn~mk~p~~~kqkil~gVk~a 321 (525) T protein:vir:10 242 SKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDLEQSIADKIIKAMAVLKFRGKDDNDSKVKESAKRKVLAGVKRA 321 (525) T ss_pred ceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHhhhhheeeeeccccCccccCchHHHHHHHHHHHHH Confidence 9999999888777777 99999999999999999999999999998765554432 222221 233344444444443 Q ss_pred hcC---C---ceEEEccCCceEEEecccC--CchhHHHHHHHHHHHHHHHH-hhhhhcccccccchhhHHHHHHHHHHHH Q lcl|NC_021302. 273 SGG---E---SAGLALTAGEEAGILSPNG--TPLDPRRAIEYHDHQMALVA-LAHFLNLDGKGGSYALASVQADTFVQSV 343 (484) Q Consensus 273 ~~g---~---~a~~vip~~~~ie~~~~~~--~~~~~~~li~~~d~~Isk~i-lGqtlt~~~~gGs~A~~evh~~v~~~~~ 343 (484) ... . -+++++|.-.+|+|-+... .+-+-. -.+..+..|.-+. +.+.|++ ++||.||.+.+..++|-..+ T Consensus 322 leK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~glDg~-K~d~I~~DI~~A~GlS~sL~n-GdggNyAtaslnld~fykki 399 (525) T protein:vir:10 322 LEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTLDPK-KYDSIDNDITNATGISQVLTN-GTKGNYASAKLNLDVFYKKI 399 (525) T ss_pred HhcccccccCeEEEeccceeecccccccCcccCCCch-hhhhhhhhhhhhhccceeeec-CCCCceeeeeeeHHHHHHHH Confidence 221 1 2455679999999876533 222222 4566788888775 4566766 47899999999999998888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEe--cCC-CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCC Q lcl|NC_021302. 344 QTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVF--DEI-GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGP 420 (484) Q Consensus 344 ~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~--~~~-~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p 420 (484) ---.+.|.++-| +|+. +-|+.+.. ..+.| +.. +-+.++..+.+-+|.+.|+.. .++-+..|+.-. T Consensus 400 gVm~e~Iee~y~-kL~d----~Vl~~~k~-~nyifnydkd~pi~~kkk~d~LIkL~d~g~s~------k~vldl~gis~e 467 (525) T protein:vir:10 400 GVMLEIIEEIYN-QLID----IILGEEKG-CNYIFQYNKDTPIEREKKLDTLIKLEAQGYSA------KYVLDILGISSE 467 (525) T ss_pred HHHHHHHHHHHH-HHHh----hhcCcccC-cceEEecCCCchhhhhhhhhhhhhhhccchhh------hhhhhhhccCcc Confidence 777777775554 3555 44554332 23444 322 234466667788888888743 344444555322 Q ss_pred CCCcccccccC----CCcCCCccccCCCCccccccccccccccccccccccchHHHhcCcccC Q lcl|NC_021302. 421 DPDADDDESTA----DTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPDGA 479 (484) Q Consensus 421 ~~~e~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (484) +.-|...-... +...-.|..... .++..+...-....+......+.-.+. .+|- T Consensus 468 ~y~E~s~yEtE~lkl~EKi~pp~~~~v-~SGk~~n~iG~P~~dd~~~~dati~s~----~~~~ 525 (525) T protein:vir:10 468 EYFEESIYEIEKLKLREKIMPPLNTNV-LSGKDGNDIGSPKLDDSDSSDATIESK----ERGV 525 (525) T ss_pred hHHHHHHHHHHHHHHhhhcccccccee-eeccccccccCCccCCCcchhhhhhhh----hcCC Confidence 22111100000 000000010000 000000000000000000000000000 1111 No 171 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.15 E-value=0.00015 Score=41.54 Aligned_cols=407 Identities=8% Similarity=0.011 Sum_probs=167.0 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhc-ccc-cccccccccchHHHHH---HHH---------------h-----cch Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGL-DQF-EQVDELRWPNSVYTYT---RMC---------------R-----EEA 55 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~lr~~~~~~~y~---~m~---------------~-----~D~ 55 (484) ..||+.--.....+.+-....-.-.+.-+ ... +.. .|..+..+.|+ ++. + ..+ T Consensus 24 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~--~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n 101 (492) T protein:vir:94 24 SQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKL--PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITN 101 (492) T ss_pred CccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHH--HHHHHHHHHhccccccccccccccccccccccccccccccc Confidence 33333211111111111100000000000 000 000 01111112221 000 0 012 Q ss_pred HHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeE Q lcl|NC_021302. 56 RIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQ 134 (484) Q Consensus 56 ~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Ei 134 (484) ...-++.+...-+.+-+..+.. ++++..+.+.+. .. -+|++.+.++ .++..||.+ +++ T Consensus 102 ~~k~Ivd~~~~yl~G~p~~~~~--~d~~~~~~l~~~-----------------~~-n~~~~~~~~~~~~a~~~G~a-~~~ 160 (492) T protein:vir:94 102 FHANLVDQKVSYIVGKPIAFKH--TDDEVVKRIDEV-----------------LG-NRFDDKLHSVLTGASNKGIE-WLH 160 (492) T ss_pred hHHHHHHHHHhhhcccCceecc--CchHHHHHHHHH-----------------Hh-ccHHHHHHHHHHHHhhCCeE-EEE Confidence 3333334444444444444432 222222222221 11 2466666555 478899977 567 Q ss_pred EEeecCCeeeeeeeeeeCcccee-eeeecCCCceeeee-cccccccc------cccceeccCCC---------------- Q lcl|NC_021302. 135 TYFYEGGRFWLKRLAPRPQSSIA-YWNVDRDGGLISIQ-QWPAGTFG------GPGMVVMAPNS---------------- 190 (484) Q Consensus 135 vw~~~~g~~~~~~l~~r~~~~~~-~~~~~~dg~l~~~~-q~~~~~~~------~~~~~~~~~~~---------------- 190 (484) +|.-.+|... +...+|+.+. .|.....+.++... .+...... ......+.... T Consensus 161 v~~d~dg~~~---~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 237 (492) T protein:vir:94 161 PYLDEEGEFK---LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKT 237 (492) T ss_pred EEecCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccc Confidence 7766667654 3344554431 22222233333211 11100000 00000000000 Q ss_pred Cc--ccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHH Q lcl|NC_021302. 191 MG--PAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEI 268 (484) Q Consensus 191 ~~--~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~ 268 (484) .. ..+..--++.| .+|+.|.|.+..+....-.-...+..++..++.|.. |+++.+-....+..+ .... T Consensus 238 ~~~~~~~g~vPvv~~-----~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~--p~lv~~g~~~~~~~~---~~~~ 307 (492) T protein:vir:94 238 HFSTGSWGKIPFIPF-----KNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNE--LTYVLKNYDDQELPE---FKRL 307 (492) T ss_pred cccccCCCccceEEe-----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcC--ceeeeecCCcccchh---hHHH Confidence 00 00111112222 236778999987665555556677888888887655 555443222222222 2222 Q ss_pred HHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HH--HHHHHHHHHH Q lcl|NC_021302. 269 ASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQ--ADTFVQSVQT 345 (484) Q Consensus 269 l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh--~~v~~~~~~a 345 (484) +.. ..++.++.+.+++++........++.+++++.+.|.+.--...++.++-+|. ..|+ .. ..-....+.. T Consensus 308 ---~~~--~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n-~Sg~Al~~~~~~l~~k~~~ 381 (492) T protein:vir:94 308 ---LRY--YGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGSA-PSGVALEFLYTNLNLKADK 381 (492) T ss_pred ---Hhh--ccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccccC-chHHHHHHHHHHHHHHHHH Confidence 221 2356678999999988776677889999999999888865555555433332 1222 11 1222333344 Q ss_pred HHHHHHHHHHHHHHHHHHHhC-CCCccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCC Q lcl|NC_021302. 346 VADEIRDVAQAHVVEDIVDVN-WGEDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDP 422 (484) Q Consensus 346 D~~~i~~~ln~qli~~l~~~N-f~~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~ 422 (484) -.+.+...+. ++++.++.+. ......--.+.|. ..+.++.+.++++.+|+ |+ + +.+.+.+.++. +.+.. T Consensus 382 k~~~f~~~l~-~~~~li~~~~~~~~~~~~i~v~f~~~~p~~~~e~~~~~~kl~--gi-i----S~et~~~~l~~v~d~~~ 453 (492) T protein:vir:94 382 LARKAKVAIQ-ELLWFVFEHFDIKGEHKDVDISFNYNKVANTELQVQTAQQSM--GI-V----SHETVLENHPFVEDLQA 453 (492) T ss_pred HHHHHHHHHH-HHHHHHHHHhcCCcccceeeEEecCCCCCCHHHHHHHHHHHh--cc-C----chHHHHHhCCCCCCHHH Confidence 4555566663 3666555543 2222122356665 45677888889998885 64 2 45677777764 32221 Q ss_pred Cccccc----ccCCCcCCCccccCCCCccccccccccccccccccc Q lcl|NC_021302. 423 DADDDE----STADTGQDEPETDEPALPNTSGTTSTTNAPQARKRP 464 (484) Q Consensus 423 ~e~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (484) +-+... ......+... ...+.... . ....+.+... T Consensus 454 E~eri~~E~~~~~~~~~~~~-~~~~~~~~-~-----~~~~~~~e~e 492 (492) T protein:vir:94 454 ELERIEQEQMEYNKQLPNLD-DGGADSAQ-Q-----QERSNNKESE 492 (492) T ss_pred HHHHHHHHHHHHHhhccccc-cccCCCCc-c-----ccCCccccCC Confidence 110000 0000000000 00000000 0 0000000000 No 172 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=97.12 E-value=0.00016 Score=41.35 Aligned_cols=407 Identities=12% Similarity=0.027 Sum_probs=157.5 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHH--------------HHHHhcchHHHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTY--------------TRMCREEARIASVLRAIGL 66 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y--------------~~m~~~D~~v~s~l~~r~~ 66 (484) |++..+-.... .+..+ ...-.....|..+.-+.| +++. |-.+.. T Consensus 1 ~~~~~~~d~~~--~i~~L-----------~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~--~~~~~~------- 58 (488) T protein:vir:23 1 MAETESIDPEK--LRDQL-----------LDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMR--KYLAHV------- 58 (488) T ss_pred CCcccCCCHHH--HHHHH-----------HHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhh--hhhhhc------- Confidence 33332222111 00100 000000000000111111 1110 101100 Q ss_pred HhhCCCcEEecCCCCHHHHHHHHHHHHhh--------------hccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhccee Q lcl|NC_021302. 67 PIRRTDWRIRPNGARPEVVEHVAACLGLP--------------VEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAV 131 (484) Q Consensus 67 ~v~~~~~~v~p~~~~~e~~~~~~~~l~~~--------------~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~ 131 (484) +| ...+++..++.|... ...++......+....-+|+.....+ .++..||.| T Consensus 59 -----n~-------~~~ivd~~a~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a- 125 (488) T protein:vir:23 59 -----GY-------PRTYVDAIAERQELEGFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTA- 125 (488) T ss_pred -----ch-------HHHHHHHHHHhhhccceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCce- Confidence 00 011222222211100 00011111222333344677777765 478999997 Q ss_pred eeEEEeec--------CCeeeeeeeeeeCccceee---------------eeecCCCceeeeecccccccccc---ccee Q lcl|NC_021302. 132 FEQTYFYE--------GGRFWLKRLAPRPQSSIAY---------------WNVDRDGGLISIQQWPAGTFGGP---GMVV 185 (484) Q Consensus 132 ~Eivw~~~--------~g~~~~~~l~~r~~~~~~~---------------~~~~~dg~l~~~~q~~~~~~~~~---~~~~ 185 (484) ++++|... ++.. .|...+|+.+.- +...+++......-+..+..... ..-. T Consensus 126 ~~~v~~~~~~~~~~~~~~~~---~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~ 202 (488) T protein:vir:23 126 YITISMPDPEVDFDVDPEVP---LIRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEW 202 (488) T ss_pred EEEEecCCcccccCCCCCcc---eEEEeccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCce Confidence 67777532 1211 233334333211 11111111111111111100000 0000 Q ss_pred ccCCCCcccccccceEEEeecCccCccccchhHHHHHHHHH-HHHHHHHHHHHHHHHhcCCcceEEecCCCCCCH--HHH Q lcl|NC_021302. 186 MAPNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWK-LKDELIRIEAAAIRRHGIGVPYLKGNEADSEDD--DRM 262 (484) Q Consensus 186 ~~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~-~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~--~~~ 262 (484) .........+..--++.|+++.+.+.++|.|-+.......+ --...+..++...+-|.+++.++.|-....... ... T Consensus 203 ~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~ 282 (488) T protein:vir:23 203 EAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETG 282 (488) T ss_pred EeccccccCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccccccc Confidence 00112223344445677888888888999887754322222 224555666777787777666666633221111 111 Q ss_pred HHHHHHHHHHhcCCceEEEccCCceEEEecccCC-chhHHHHHHHHHHHHHHHHhhhh--hcccccc-cc-hhhHHHHHH Q lcl|NC_021302. 263 DELLEIASNYSGGESAGLALTAGEEAGILSPNGT-PLDPRRAIEYHDHQMALVALAHF--LNLDGKG-GS-YALASVQAD 337 (484) Q Consensus 263 ~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~-~~~~~~li~~~d~~Isk~ilGqt--lt~~~~g-Gs-~A~~evh~~ 337 (484) ..+.+ + +.....++++|.+.++.+-... ...|...++.+-.+|+....-.. +.....+ +| -|+ ..... T Consensus 283 ~~~~~----~--~~~~v~~~~~g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al-~~~~~ 355 (488) T protein:vir:23 283 QRMFD----A--YMARILAFEGGEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAI-KAAES 355 (488) T ss_pred chhhh----h--hhhhhccCCCCCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHH-HHHHH Confidence 11111 1 1124566788877777764432 23455555555555553311111 1111111 11 121 22233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc--c--ccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHH Q lcl|NC_021302. 338 TFVQSVQTVADEIRDVAQAHVVEDIVDVNWGED--E--PAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLR 412 (484) Q Consensus 338 v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~--~--~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~ 412 (484) -+....+.-.+.+...|. ++++-++.+.-+.. . .--.++|.. ....+.+.++++.+|++.|..+ .+.+.++ T Consensus 356 ~l~~k~~~~~~~f~~~l~-~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~---~s~et~~ 431 (488) T protein:vir:23 356 RLVKKVERKNKIFGGAWE-QAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGL---IPRERGW 431 (488) T ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhccccc---CCHHHHH Confidence 344444455566666774 46666665532211 1 112456643 3456788899999999988522 2467788 Q ss_pred HHhCCCCCCCCccc----cccc---------CCCcCCCccccCCCCcccccccccccc Q lcl|NC_021302. 413 DAAGLPGPDPDADD----DEST---------ADTGQDEPETDEPALPNTSGTTSTTNA 457 (484) Q Consensus 413 e~~glp~p~~~e~~----~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (484) +.+|+-.....+-. .... ....++. .......+....++.+.++ T Consensus 432 ~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 432 VDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPE-GKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCc-ccCCCCCCCCCCCCCCCCC Confidence 88887332211000 0000 0000000 0000001111111111111 No 173 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=97.11 E-value=0.00017 Score=41.29 Aligned_cols=428 Identities=14% Similarity=0.042 Sum_probs=155.2 Q ss_pred hhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe-cC-------------CCCHHHHHH Q lcl|NC_021302. 22 GTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR-PN-------------GARPEVVEH 87 (484) Q Consensus 22 ~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~-p~-------------~~~~e~~~~ 87 (484) -+..+.+..-.+....+. . .+++.... |. .-+.++.+--.+...... +. +=...+++. T Consensus 1 ~~~~i~~~~~~~~~~~~~-~---~l~~~~~~---~~-~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~ 72 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIAR-D---EMVSAFED---ST-QNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDS 72 (485) T ss_pred CCCCCCCCCCCCCHHHHH-H---HHHHHHHH---HH-HHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHH Confidence 000111111111100000 0 00000000 00 001111111111100000 00 001123333 Q ss_pred HHHHHHhh----hccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeee-----eeeeeeeCcccee Q lcl|NC_021302. 88 VAACLGLP----VEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFW-----LKRLAPRPQSSIA 157 (484) Q Consensus 88 ~~~~l~~~----~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~-----~~~l~~r~~~~~~ 157 (484) .++.|... -..++.+....+....-+|+.....+. +|+.||.| ++++|.-.++... -..|..++|+.+. T Consensus 73 ~~~~l~~~g~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~ 151 (485) T protein:vir:10 73 IAERQAVEGFRFGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRS-YITISRPDPQIDLGWDPNTPIIRVEPPTRMY 151 (485) T ss_pred HHhhhcccceecCCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCce-EEEEeeCCcccccccCCCeeEEEEEccceeE Confidence 33322100 011122223334444456877777664 78999987 6677764322110 0124445555432 Q ss_pred eeeecCC-Ccee-eeecccccccc---------cccceecc--------CCCCcccccccceEEEeecCccCccccchhH Q lcl|NC_021302. 158 YWNVDRD-GGLI-SIQQWPAGTFG---------GPGMVVMA--------PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLL 218 (484) Q Consensus 158 ~~~~~~d-g~l~-~~~q~~~~~~~---------~~~~~~~~--------~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll 218 (484) ..+|.. +... .+........+ ....+.+. ......+++.--++.|.++.+.+.++|.|-+ T Consensus 152 -~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i 230 (485) T protein:vir:10 152 -AEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEI 230 (485) T ss_pred -EEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccch Confidence 122221 1111 01100000000 00001000 0112233444456778888888889998876 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccC-C Q lcl|NC_021302. 219 RP-AYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNG-T 296 (484) Q Consensus 219 ~~-~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~-~ 296 (484) .. +-...=--...+...+...|-|.+++.++.|-.......++ ..-...+ ....+ +...+ .+.+.+|.+... + T Consensus 231 ~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~-~~~~~~~-~~~~~--~i~~~-~~~d~k~~q~~~~~ 305 (485) T protein:vir:10 231 TPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDP-ETGQTLF-DAYLA--RILAF-EDAEGKIQQFSAAE 305 (485) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccc-cccchhh-hhccc--ceecc-CCCCceEEeecccc Confidence 54 22111122445566667778777766666664322211110 0001111 11112 22333 445666665432 2 Q ss_pred chhHHHHHHHHHHHHHHHHhh--hhhcccccc-cc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc- Q lcl|NC_021302. 297 PLDPRRAIEYHDHQMALVALA--HFLNLDGKG-GS-YALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDE- 371 (484) Q Consensus 297 ~~~~~~li~~~d~~Isk~ilG--qtlt~~~~g-Gs-~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~- 371 (484) ...|...++-.-.+|+..--- ..+...+.+ .| .|+ .....-....++.-.+.+...|++ +++-++.+.-+... T Consensus 306 ~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al-~~~~~~l~~k~~~k~~~f~~~l~~-~~~l~~~~~~~~~~~ 383 (485) T protein:vir:10 306 LANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAI-RAAESRLIKKVERKNSIFGGAWEE-AMRLAYRMMKGGDVP 383 (485) T ss_pred hHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhCCCCCc Confidence 344555555555555443111 011111111 11 222 222333344444555666667743 66655554322211 Q ss_pred ---ccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCccc--cc-ccCCCcCCCccccCCC Q lcl|NC_021302. 372 ---PAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADD--DE-STADTGQDEPETDEPA 444 (484) Q Consensus 372 ---~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~--~~-~~~~~~~~~~~~~~~~ 444 (484) ..-.++|. .....+.+.++++.+|++.|..+ .+.+.+++.+|+....-.+-. .. ..+...........+. T Consensus 384 ~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~---~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~ 460 (485) T protein:vir:10 384 PDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGV---IPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPN 460 (485) T ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccC---CCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 11245674 44567888999999999988422 246677888888543211100 00 0000000000000000 Q ss_pred CccccccccccccccccccccccchH Q lcl|NC_021302. 445 LPNTSGTTSTTNAPQARKRPRGRSPR 470 (484) Q Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (484) . ...+.+.....+..+...++.+.+ T Consensus 461 ~-~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 461 P-TVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred C-CCCCCCCccccccCcCCCCCCCCC Confidence 0 000000000000000000011111 No 174 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.07 E-value=0.00018 Score=41.07 Aligned_cols=410 Identities=7% Similarity=0.004 Sum_probs=163.9 Q ss_pred CCCCCCCccceeeee-cc----cccch--hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTVAPRTERGYV-NP----LAGFG--TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~~~~~~~~~~-~~----~~~~~--~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~ 73 (484) |.+.+.-....+..+ +. ...+. ..+..|-. +....+.......-.++ ..+...-++.+...-+.+-+. T Consensus 11 ~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~---~i~~~~~~~~~~~~~ki--~~n~~~~ivd~~~~~l~g~~~ 85 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIM---AIDAEPTKDLWKPDNRL--TVNFTKYIVDTFTGYFNGIPV 85 (453) T ss_pred cCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccC---chhcCCCccccCcccee--ecchHHHHHHHHhhhhcccCc Confidence 333332111110000 00 00000 00111100 00000000000000011 122333334444444455554 Q ss_pred EEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeeeC Q lcl|NC_021302. 74 RIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPRP 152 (484) Q Consensus 74 ~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~ 152 (484) .+.+. ++++.+.+.+ ....-+|+..+..+ .++..||. +++++|.-.+|... +...+ T Consensus 86 ~~~~~--d~~~~~~l~~-----------------i~~~N~~~~~~~~~~~~~~~~G~-~~~~v~~d~~g~~~---i~~~~ 142 (453) T protein:vir:39 86 KKSHS--DKETLSKLQE-----------------FDNLNDMEDEESELAKMACIYGR-AFELLYQNEETQTN---VIYNT 142 (453) T ss_pred eeccC--ChHHHHHHHH-----------------HHHhcChhHHHHHHHHHHhhcCe-EEEEEEecCCCceE---EEEEc Confidence 44432 2222222222 22333577666555 57999997 56777765666543 33334 Q ss_pred ccceeeeeecC--CCceeeeecccccccc--------cccceeccCCCC--------cccccccceEEEeecCccCcccc Q lcl|NC_021302. 153 QSSIAYWNVDR--DGGLISIQQWPAGTFG--------GPGMVVMAPNSM--------GPAIPVEQLVVYTHDMDPGVWTG 214 (484) Q Consensus 153 ~~~~~~~~~~~--dg~l~~~~q~~~~~~~--------~~~~~~~~~~~~--------~~~lp~~k~l~~~~~~~~~~p~G 214 (484) |+.+. ..+++ ...++........... ......+..... ...+..--++.|+ +++.| T Consensus 143 p~~~~-~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g 216 (453) T protein:vir:39 143 PENMF-MVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFY-----FNEER 216 (453) T ss_pred ccceE-EEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEec-----CCCCC Confidence 44432 11221 1111111111000000 011111111100 1111111122222 36788 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeccc Q lcl|NC_021302. 215 NSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPN 294 (484) Q Consensus 215 ~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~ 294 (484) .|.+..+....---...+..++..++.|.+++.++.|. +.+++....+... ..+ ........+.+.++++++.. T Consensus 217 ~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~---~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~lt~~ 290 (453) T protein:vir:39 217 MSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGA---AVEEEDLKNIRSN-RVI--NYYGESSEAKNVDVKFLEKP 290 (453) T ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecC---CCCchhhhhhhhc-cee--eecCCCCCCCCCceeEEeec Confidence 99998766665566778888888899876655555553 2333333222111 000 00011112356778888876 Q ss_pred CCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CCCc- Q lcl|NC_021302. 295 GTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVN--WGED- 370 (484) Q Consensus 295 ~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~N--f~~~- 370 (484) .....++..++.+.+.|...-....++.++-|++.+.+ +....-....+..-.+.+...+. ++++.++.+. .+.. T Consensus 291 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~~~~~ 369 (453) T protein:vir:39 291 DSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLN-SRYKLYCELSTNVSNKE 369 (453) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCcc Confidence 66678899999999999886444444443323221111 11122223333444555566664 3666555542 1111 Q ss_pred -cccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCCcccccccCCCcCCCccccCCCCcc Q lcl|NC_021302. 371 -EPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPDADDDESTADTGQDEPETDEPALPN 447 (484) Q Consensus 371 -~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~ 447 (484) ..-..+.|. ....++...++++.+|+ |+ + +.+.+.+.++. +.+..+-+.............+........ T Consensus 370 ~~~~i~v~f~~~~p~~~~~~a~~~~kl~--g~-i----s~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 442 (453) T protein:vir:39 370 AWKDIEYTFTRNEPKDIKEQAETANILM--GI-T----SQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKG 442 (453) T ss_pred ccccceEEeCCCCCcCHHHHHHHHHHHh--cc-C----ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCC Confidence 112356775 45567888899999884 54 2 45677777764 322111000000000000000000000000 Q ss_pred cccccccccccc Q lcl|NC_021302. 448 TSGTTSTTNAPQ 459 (484) Q Consensus 448 ~~~~~~~~~~~~ 459 (484) .... .+....+ T Consensus 443 ~~~~-~~~~~~e 453 (453) T protein:vir:39 443 TDTV-VPETNEE 453 (453) T ss_pred CCCC-CCCcCCC Confidence 0000 0001111 No 175 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=97.02 E-value=0.00021 Score=40.80 Aligned_cols=417 Identities=13% Similarity=0.062 Sum_probs=167.9 Q ss_pred CCCCCCccceeeeeccccc-c-------------hhhhhhhccccccccccccc--chHHHHHHHHhcchHHHHHHHHHH Q lcl|NC_021302. 2 APKTVAPRTERGYVNPLAG-F-------------GTFLAQGLDQFEQVDELRWP--NSVYTYTRMCREEARIASVLRAIG 65 (484) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~-~-------------~~~~~~~~~~~~~~~~lr~~--~~~~~y~~m~~~D~~v~s~l~~r~ 65 (484) --+++...+.+.+.++.-. + +....+.-.....++ +.+ ..++-|-.-.-.=+++...++... T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~--~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~ 78 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPD--KAYGEARQAEYEAGGIVYNFTRRTLSGMV 78 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCC--CCCCHHHHHHHHhcccCCChHHHHHHHHh Confidence 1122222222222222100 0 000000000000000 111 112222222222355666666666 Q ss_pred HHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCC--- Q lcl|NC_021302. 66 LPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGG--- 141 (484) Q Consensus 66 ~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g--- 141 (484) ..|.+.+..++ .++..+.+.+++. ..+.+++.+++.++ .++.||.+.+=+.+-..++ T Consensus 79 G~vfrk~p~~~----~p~~l~~l~~d~D---------------~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ 139 (491) T protein:vir:95 79 GSVMRKEPEIN----IPKELEYLLKNAD---------------GSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATA 139 (491) T ss_pred chhhcCCceee----ccHHHHHHHhccC---------------CCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCH Confidence 66666666653 2222222222221 13557888888886 6888998877665532221 Q ss_pred ------eeeeeeeeeeCccceeeeeecCCCc---e--eeeecc-----cccccccc----------------cceecc-- Q lcl|NC_021302. 142 ------RFWLKRLAPRPQSSIAYWNVDRDGG---L--ISIQQW-----PAGTFGGP----------------GMVVMA-- 187 (484) Q Consensus 142 ------~~~~~~l~~r~~~~~~~~~~~~dg~---l--~~~~q~-----~~~~~~~~----------------~~~~~~-- 187 (484) ... -.+..+.+..|-=|+++..++ | +.++.. +.+.++.. ....+. T Consensus 140 Ade~~~~~r-Py~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~ 218 (491) T protein:vir:95 140 AEQNAGLLN-PTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFD 218 (491) T ss_pred HHHHHhcCC-cEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEc Confidence 001 123333333333333333221 1 111111 00111100 000000 Q ss_pred -------------CCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHH-HHHHHHhcCCcceEEecC Q lcl|NC_021302. 188 -------------PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIE-AAAIRRHGIGVPYLKGNE 253 (484) Q Consensus 188 -------------~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w-~~f~Er~~~G~P~~~gk~ 253 (484) ....+..++.=-|+++... ..+-..+...|..++..- .+++-.... -..+-.-++|+|++.|- T Consensus 219 ~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~-~~~~~~~~pPLl~LA~ln-i~Hy~~ssd~~~~l~~~~~P~l~~~G~- 295 (491) T protein:vir:95 219 AEGGAQEEVVEIYPDLGESLRGVIPFTFIGAT-NNDATIDDAPLLPLAELN-IGHYRNSADNEESSFVVGQPTLFIYPG- 295 (491) T ss_pred CCCcceeeeeeeeecCCCcccCeeEEEEEecC-CCCCCCCcCchHHHHHHH-HHHhhhhhHHHHHHHHcccceeeeecC- Confidence 0111222222223333322 223334555566665442 233222222 22222224566666552 Q ss_pred CCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccc--cccchhh Q lcl|NC_021302. 254 ADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDG--KGGSYAL 331 (484) Q Consensus 254 ~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~--~gGs~A~ 331 (484) ...+++-.... .. ..++-|.+++..+|.+.+..++++++++.....|.+ ...+|.. +|-.|...+ .+++. T Consensus 296 -d~~~~~~~~~~-~~-~~i~~g~~~~~~lP~~~~~~~ie~~~~~~~~~~l~~-~e~qm~~--~Ga~l~~~~~~~Ta~~-- 367 (491) T protein:vir:95 296 -DNLTPQSFKEA-NP-NGIKFGSRCGHNLGYGGSAQLIQAGENNLARQNMLD-KEQQAIQ--IGAQLITPSQQITAES-- 367 (491) T ss_pred -cccCcchhhcc-Cc-ceeEecCcCCcCCCCCCccceeecCcchHHHHHHHH-HHHHHHH--HHHHhccCCcchhHHH-- Confidence 22222222211 11 224457778888999999999999877654333333 3333333 343333321 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEec----CCCCcHHHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 332 ASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFD----EIGSRQDATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 332 ~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~----~~~~~~~~~ae~~~~L~~~G~~~~~~~~ 407 (484) ...........+.+-+..+++.++ +++++++.+-.-....-+.|++. ...-+ ....+++-++.+.|.+.. ..- T Consensus 368 ~~~~~~~~~S~L~~~a~~~e~al~-~~l~~~a~w~G~~~~~~v~i~~n~dF~~~~~~-~~~~~all~~~~~G~is~-~t~ 444 (491) T protein:vir:95 368 ARIQRGADTSVMATIARNVSQAYT-DALRWVAMMLGKPEDSEVEFQLNMDFFLQPMT-AQDRAAWMADINAGLLPA-TAY 444 (491) T ss_pred HHHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCCCceEEEeecccccccCC-HHHHHHHHHHHhcCCCCH-HHH Confidence 223344456777888899999996 58999999853222222345432 12222 223455667777886432 111 Q ss_pred HHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccc Q lcl|NC_021302. 408 EAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARK 462 (484) Q Consensus 408 ~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (484) -.++ ++.||+.+..++..+. ..+.+ .+.+......|+ ..+++...-. T Consensus 445 ~~~L-~~~~vl~~~~e~~~~~-ie~~~-----~~~~~~~~~~~~-~~~~~~~~~~ 491 (491) T protein:vir:95 445 YAAL-RKAGVTDWTDEDILNA-IEDAP-----LPSGAVTQVAGE-IPQAAQQQQE 491 (491) T ss_pred HHHH-HhCCCCCccHHHHHHH-HHhcC-----CCCCcccccccc-chhhhhhccC Confidence 2233 4558875543322221 11111 000111111111 1111111101 No 176 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=96.93 E-value=0.00025 Score=40.29 Aligned_cols=379 Identities=10% Similarity=0.013 Sum_probs=160.8 Q ss_pred chhhhhhhcccccccccccccchHHHHHHHHh----------------------------------cchHHHHHHHHHHH Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDELRWPNSVYTYTRMCR----------------------------------EEARIASVLRAIGL 66 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~----------------------------------~D~~v~s~l~~r~~ 66 (484) +..-....+.+. ......+.+.-|..+.+ ......-.+.+... T Consensus 1 ~~~~~~~~~i~~---~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~ 77 (470) T protein:vir:10 1 MELDALKKLIQN---TSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (470) T ss_pred CchHHHHHHHHH---HHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhh Confidence 110000000000 00000111111111110 01112223333344 Q ss_pred HhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeee Q lcl|NC_021302. 67 PIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWL 145 (484) Q Consensus 67 ~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~ 145 (484) -+.+-+..+... +++..+.+.+.+ . .+|.+.+..+ .++.-+|.+.. ++|.-.+|.+.. T Consensus 78 yl~G~p~~~~~~--d~~~~~~l~~~~-----------------~-~~~~~~~~~l~~~~~~~G~a~~-~~y~d~~~~~~~ 136 (470) T protein:vir:10 78 YVASVFPDIDVG--KDADNKKIIDVL-----------------G-DDRALTLNGLLVDSSNAGRAWL-HYWIDEDGNFRY 136 (470) T ss_pred heeccceeeecC--chHHHHHHHHHH-----------------h-hhHHHHHHHHHHHHhhcCeeEE-EEEecCCCceEE Confidence 444555454432 222223332222 1 2466655555 47888998764 556555565543 Q ss_pred eeeeeeCccceeeeeecC--CCceeeeecccccc--cc-c----------ccceeccC--CC------------------ Q lcl|NC_021302. 146 KRLAPRPQSSIAYWNVDR--DGGLISIQQWPAGT--FG-G----------PGMVVMAP--NS------------------ 190 (484) Q Consensus 146 ~~l~~r~~~~~~~~~~~~--dg~l~~~~q~~~~~--~~-~----------~~~~~~~~--~~------------------ 190 (484) ...+|.... ..+++ .+.++..-.+.... .+ . .....+.. .. T Consensus 137 ---~~~~p~~~~-~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (470) T protein:vir:10 137 ---GIIQPDQIT-PIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAG 212 (470) T ss_pred ---EEEcccceE-EEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccc Confidence 333444321 11221 22222211110000 00 0 00000000 00 Q ss_pred --------CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHH Q lcl|NC_021302. 191 --------MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRM 262 (484) Q Consensus 191 --------~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~ 262 (484) ....+..--++.++ +|..|.|.+..+-...---...+.+++..++.+..++.++.|-.. .+.++ T Consensus 213 ~~~~~~~~~~~~~g~vPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~--~~~~~- 284 (470) T protein:vir:10 213 YETGQSNTLKHNFGRVPFIEFS-----KNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG--ADLHQ- 284 (470) T ss_pred cccccccccccCCCeeeEEEee-----cCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCc--cccch- Confidence 00000111122222 367789999987766666688889999999988776666655322 22111 Q ss_pred HHHHHHHHHHhcCCceEEEccC-----CceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHH Q lcl|NC_021302. 263 DELLEIASNYSGGESAGLALTA-----GEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQAD 337 (484) Q Consensus 263 ~~l~~~l~~~~~g~~a~~vip~-----~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~ 337 (484) ...++... ..+.++. +.+++++........++..++++.+.|.+.--+..++.++. | . ++.+... T Consensus 285 -----~~~~~~~~--~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-g-n-~Sg~Alk 354 (470) T protein:vir:10 285 -----FMNDLRKY--KSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFES-S-N-ASGVAIK 354 (470) T ss_pred -----hhhhhhhc--CeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCcccc-c-c-chHHHHH Confidence 12233222 2344443 46788888777777899999999999988766666655432 2 2 2222333 Q ss_pred HHHHHH----HHHHHHHHHHHHHHHHHHHHH-hCCCC-ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHH Q lcl|NC_021302. 338 TFVQSV----QTVADEIRDVAQAHVVEDIVD-VNWGE-DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAF 410 (484) Q Consensus 338 v~~~~~----~aD~~~i~~~ln~qli~~l~~-~Nf~~-~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~ 410 (484) .+...+ ..-.+.+...|. ++++.++. +|... +.....+.|. ..+.+..+.++.+++++ |+ +|.+. T Consensus 355 ~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~--g~-----iS~et 426 (470) T protein:vir:10 355 MLYSHLELKAAKTQTYFEHAIN-ELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVA--NY-----SSKEA 426 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcccCcccceeeEEeccCCCCCHHHHHHHHHHHh--cc-----CcHHH Confidence 333333 333344455553 35555544 33321 1223466775 46678888899988874 53 25666 Q ss_pred HHHHhCC-CCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 411 LRDAAGL-PGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 411 i~e~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) +.+.++. ..+..+-+..........+..+.. ++. ...+. ...+ T Consensus 427 ~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~~-~~~-~~~~~----dde~ 470 (470) T protein:vir:10 427 VAKANPIVDDWQQELKDLAKDKEENDPYSNQA-DEL-NGKGV----NDEQ 470 (470) T ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHhhccc-ccc-CCCCC----CCCC Confidence 7777763 222111000000000000000000 000 00000 0001 No 177 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=96.88 E-value=0.00028 Score=40.04 Aligned_cols=387 Identities=10% Similarity=0.021 Sum_probs=164.5 Q ss_pred chhh-hhhhccccccccc-c------------cccchHHHHHHHH---------------------------------hc Q lcl|NC_021302. 21 FGTF-LAQGLDQFEQVDE-L------------RWPNSVYTYTRMC---------------------------------RE 53 (484) Q Consensus 21 ~~~~-~~~~~~~~~~~~~-l------------r~~~~~~~y~~m~---------------------------------~~ 53 (484) +.+. .+..+...+--.+ + |..+..+.|+..- -. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 1110 0000000000000 0 0000111111100 00 Q ss_pred chHHHHHHHHHHHHhhCCCcEEecCCC---CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcc Q lcl|NC_021302. 54 EARIASVLRAIGLPIRRTDWRIRPNGA---RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGH 129 (484) Q Consensus 54 D~~v~s~l~~r~~~v~~~~~~v~p~~~---~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~ 129 (484) .+...-++.+....+.+-+..+....+ +++..+++.+ ....-.|+.....+ .++..||. T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~-----------------~~~~n~~~~~~~~~~~~~~~~G~ 143 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITN-----------------FAIRNSVDDEDSEIGKMAAICGY 143 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHH-----------------HHhhcCHhHHHHHHHHHHhhcCe Confidence 223333333344444444444433211 1112222222 22223566666554 57999997 Q ss_pred eeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeee-ecccc--ccc----------ccccceeccCCC------ Q lcl|NC_021302. 130 AVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISI-QQWPA--GTF----------GGPGMVVMAPNS------ 190 (484) Q Consensus 130 s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~-~q~~~--~~~----------~~~~~~~~~~~~------ 190 (484) +++++|.-.+|.+.+ ...+|+.+ +..+++.+..+.. +.+.. ... .....+.+...+ T Consensus 144 -a~~~~~~d~~~~~~~---~~i~p~~~-~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ 218 (474) T protein:vir:10 144 -GARLAYIDTNGDIRI---KNIDPYNV-IFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQE 218 (474) T ss_pred -EEEEEEeCCCCeeEE---EEEcccce-EEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccc Confidence 567888766776543 34444443 1223333332211 00000 000 000111111100 Q ss_pred ---CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHH Q lcl|NC_021302. 191 ---MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLE 267 (484) Q Consensus 191 ---~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~ 267 (484) ...++..--++.|+ +|+.|.|.+..+-...---...+...+..++.|..++.++.|- ..+++... T Consensus 219 ~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~---~~~~~~~~---- 286 (474) T protein:vir:10 219 VGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM---GMSEEMIQ---- 286 (474) T ss_pred cccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC---CCCchhhh---- Confidence 11111111223332 4678999998866555555667788888888877666666652 22333222 Q ss_pred HHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHH----HHHHH Q lcl|NC_021302. 268 IASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQADT----FVQSV 343 (484) Q Consensus 268 ~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v----~~~~~ 343 (484) ++... .+..+.+.+.+++++........++.+++.+.+.|...--+..++.++.+|. ..| +.... ....+ T Consensus 287 ---~~~~~-~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n-~Sg-~Al~~~~~~l~~k~ 360 (474) T protein:vir:10 287 ---ETQKS-GAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGN-VPI-IGMKLKLMALENKC 360 (474) T ss_pred ---hhhhc-ceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccccccc-chH-HHHHHHHHHHHHHH Confidence 22111 2445568889999998776667889999999999988644444444432222 122 22222 22233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh-CC--CC----ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHh Q lcl|NC_021302. 344 QTVADEIRDVAQAHVVEDIVDV-NW--GE----DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAA 415 (484) Q Consensus 344 ~aD~~~i~~~ln~qli~~l~~~-Nf--~~----~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~ 415 (484) ..-.+.+...+. ++++-++.+ +. .. .-.-.++.|. ..+.+..+.++++.+|+ |+ ++.+.+.+.+ T Consensus 361 ~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~-----iS~et~~~~l 432 (474) T protein:vir:10 361 MTFERKMTAMLR-YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQ-----VSERTRLGQS 432 (474) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-----CchHHHHHhC Confidence 333445555553 455555442 21 11 1112466775 45678889999999985 54 2567777777 Q ss_pred CC-CCCCCCcccccc-cCCCcCCCccccCCCCccccccccccccccccccc Q lcl|NC_021302. 416 GL-PGPDPDADDDES-TADTGQDEPETDEPALPNTSGTTSTTNAPQARKRP 464 (484) Q Consensus 416 gl-p~p~~~e~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (484) +. +.+..+-+.... ........+... .....+ ..+...+- T Consensus 433 ~~v~d~~~E~eri~~E~~e~~~~~~~~~---~~~~~~------~~~~~~s~ 474 (474) T protein:vir:10 433 QLVDDVDYELDEMEKESLEFNDKLPDID---EGDAND------KSQNNQSE 474 (474) T ss_pred CCCCCHHHHHHHHHHHHHHHHhhccccc---CCCcCC------CCccccCC Confidence 64 322110000000 000000000000 000000 00000000 No 178 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=96.88 E-value=0.00028 Score=40.04 Aligned_cols=387 Identities=10% Similarity=0.021 Sum_probs=164.5 Q ss_pred chhh-hhhhccccccccc-c------------cccchHHHHHHHH---------------------------------hc Q lcl|NC_021302. 21 FGTF-LAQGLDQFEQVDE-L------------RWPNSVYTYTRMC---------------------------------RE 53 (484) Q Consensus 21 ~~~~-~~~~~~~~~~~~~-l------------r~~~~~~~y~~m~---------------------------------~~ 53 (484) +.+. .+..+...+--.+ + |..+..+.|+..- -. T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 1110 0000000000000 0 0000111111100 00 Q ss_pred chHHHHHHHHHHHHhhCCCcEEecCCC---CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcc Q lcl|NC_021302. 54 EARIASVLRAIGLPIRRTDWRIRPNGA---RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGH 129 (484) Q Consensus 54 D~~v~s~l~~r~~~v~~~~~~v~p~~~---~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~ 129 (484) .+...-++.+....+.+-+..+....+ +++..+++.+ ....-.|+.....+ .++..||. T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~-----------------~~~~n~~~~~~~~~~~~~~~~G~ 143 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITN-----------------FAIRNSVDDEDSEIGKMAAICGY 143 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHH-----------------HHhhcCHhHHHHHHHHHHhhcCe Confidence 223333333344444444444433211 1112222222 22223566666554 57999997 Q ss_pred eeeeEEEeecCCeeeeeeeeeeCccceeeeeecCCCceeee-ecccc--ccc----------ccccceeccCCC------ Q lcl|NC_021302. 130 AVFEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISI-QQWPA--GTF----------GGPGMVVMAPNS------ 190 (484) Q Consensus 130 s~~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~-~q~~~--~~~----------~~~~~~~~~~~~------ 190 (484) +++++|.-.+|.+.+ ...+|+.+ +..+++.+..+.. +.+.. ... .....+.+...+ T Consensus 144 -a~~~~~~d~~~~~~~---~~i~p~~~-~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~ 218 (474) T protein:vir:94 144 -GARLAYIDTNGDIRI---KNIDPYNV-IFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQE 218 (474) T ss_pred -EEEEEEeCCCCeeEE---EEEcccce-EEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccc Confidence 567888766776543 34444443 1223333332211 00000 000 000111111100 Q ss_pred ---CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHH Q lcl|NC_021302. 191 ---MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLE 267 (484) Q Consensus 191 ---~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~ 267 (484) ...++..--++.|+ +|+.|.|.+..+-...---...+...+..++.|..++.++.|- ..+++... T Consensus 219 ~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~---~~~~~~~~---- 286 (474) T protein:vir:94 219 VGRYEHLFDYNPLFGVP-----NNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM---GMSEEMIQ---- 286 (474) T ss_pred cccccCCCCccceEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC---CCCchhhh---- Confidence 11111111223332 4678999998866555555667788888888877666666652 22333222 Q ss_pred HHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHH----HHHHH Q lcl|NC_021302. 268 IASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQADT----FVQSV 343 (484) Q Consensus 268 ~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v----~~~~~ 343 (484) ++... .+..+.+.+.+++++........++.+++.+.+.|...--+..++.++.+|. ..| +.... ....+ T Consensus 287 ---~~~~~-~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n-~Sg-~Al~~~~~~l~~k~ 360 (474) T protein:vir:94 287 ---ETQKS-GAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGN-VPI-IGMKLKLMALENKC 360 (474) T ss_pred ---hhhhc-ceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccccccc-chH-HHHHHHHHHHHHHH Confidence 22111 2445568889999998776667889999999999988644444444432222 122 22222 22233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh-CC--CC----ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHh Q lcl|NC_021302. 344 QTVADEIRDVAQAHVVEDIVDV-NW--GE----DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAA 415 (484) Q Consensus 344 ~aD~~~i~~~ln~qli~~l~~~-Nf--~~----~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~ 415 (484) ..-.+.+...+. ++++-++.+ +. .. .-.-.++.|. ..+.+..+.++++.+|+ |+ ++.+.+.+.+ T Consensus 361 ~~~~~~~~~~l~-~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~-----iS~et~~~~l 432 (474) T protein:vir:94 361 MTFERKMTAMLR-YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQ-----VSERTRLGQS 432 (474) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-----CchHHHHHhC Confidence 333445555553 455555442 21 11 1112466775 45678889999999985 54 2567777777 Q ss_pred CC-CCCCCCcccccc-cCCCcCCCccccCCCCccccccccccccccccccc Q lcl|NC_021302. 416 GL-PGPDPDADDDES-TADTGQDEPETDEPALPNTSGTTSTTNAPQARKRP 464 (484) Q Consensus 416 gl-p~p~~~e~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (484) +. +.+..+-+.... ........+... .....+ ..+...+- T Consensus 433 ~~v~d~~~E~eri~~E~~e~~~~~~~~~---~~~~~~------~~~~~~s~ 474 (474) T protein:vir:94 433 QLVDDVDYELDEMEKESLEFNDKLPDID---EGDAND------KSQNNQSE 474 (474) T ss_pred CCCCCHHHHHHHHHHHHHHHHhhccccc---CCCcCC------CCccccCC Confidence 64 322110000000 000000000000 000000 00000000 No 179 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=96.85 E-value=0.0003 Score=39.91 Aligned_cols=412 Identities=11% Similarity=0.091 Sum_probs=157.2 Q ss_pred cceeeeecccccchhhhhhhc----cccccccccc----ccchHHHHHHHHhcchHHHHHHHHHHHHhhCC---CcEEec Q lcl|NC_021302. 9 RTERGYVNPLAGFGTFLAQGL----DQFEQVDELR----WPNSVYTYTRMCREEARIASVLRAIGLPIRRT---DWRIRP 77 (484) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~lr----~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~---~~~v~p 77 (484) +.+.. ..-.+..- .-++. .+..+..... ....|..+..+-+ .-| ..+.+|....... +-.+. T Consensus 1 m~~~~-~~~~~~~~--~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~-g~~--~~~~~~~~~~~~~~~~~~~~s- 73 (499) T protein:vir:80 1 MINQI-IAGVKGVM--RRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQ-GNY--AEWHNLNYEHNGNPVNRRQLS- 73 (499) T ss_pred ChhHH-HHHHHHHH--HHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhc-CCc--chhhccccccCCCccccceee- Confidence 11100 00000000 00010 0110000000 0133445544432 111 0111111100000 00000 Q ss_pred CCCCHHHHHHHHHHHHh-----hhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 78 NGARPEVVEHVAACLGL-----PVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 78 ~~~~~e~~~~~~~~l~~-----~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) -+=...+++..+..|.. .+.++..+....+.+..-.|...++.++ .|..+|-+++-+.|..+ |.+. +... T Consensus 74 ~n~~~~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~-~~~~---i~~v 149 (499) T protein:vir:80 74 MNLPKVTAKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGN-KNVK---VSFA 149 (499) T ss_pred cchHHHHHHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCC-CcEE---EEEE Confidence 00011122222222110 0111112222223333345777666665 69999999998877654 4332 3444 Q ss_pred CccceeeeeecCCCceeeee---------------c---ccccccccccce--ec---cCCCCcccc---------c--- Q lcl|NC_021302. 152 PQSSIAYWNVDRDGGLISIQ---------------Q---WPAGTFGGPGMV--VM---APNSMGPAI---------P--- 196 (484) Q Consensus 152 ~~~~~~~~~~~~dg~l~~~~---------------q---~~~~~~~~~~~~--~~---~~~~~~~~l---------p--- 196 (484) ++..+.-..++ .+++..+. + +.....+..... .+ .....|.++ + T Consensus 150 ~a~~~~Pi~~d-~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~ 228 (499) T protein:vir:80 150 TADCMYPLSND-SENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVV 228 (499) T ss_pred cCCceEEEEec-CCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCce Confidence 44443211222 22221110 0 000000000000 00 000001111 1 Q ss_pred ------ccceEEEeec----CccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEE-------ecCCCCCCH Q lcl|NC_021302. 197 ------VEQLVVYTHD----MDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLK-------GNEADSEDD 259 (484) Q Consensus 197 ------~~k~l~~~~~----~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~-------gk~~~~~~~ 259 (484) .--|++++.. ...++|+|.|.+..+-...---...+..|+.-+|. ....+.+ .+...+... T Consensus 229 ~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~--~~~~i~v~~~~l~~~~~~~g~~~ 306 (499) T protein:vir:80 229 PLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL--GKKKVLVPSSFVKTAVNLDGSTT 306 (499) T ss_pred eecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHh--cccceecchhhhhccCCCCCCcc Confidence 1114555432 24578999999999877666666666667666663 2333333 111111110 Q ss_pred HHHHHHHHHHHHHhcCCceEEEcc---C--CceEEEecccCCchhHHHHHHHHHHHHHHHH-hhhh-hcccccccchhhH Q lcl|NC_021302. 260 DRMDELLEIASNYSGGESAGLALT---A--GEEAGILSPNGTPLDPRRAIEYHDHQMALVA-LAHF-LNLDGKGGSYALA 332 (484) Q Consensus 260 ~~~~~l~~~l~~~~~g~~a~~vip---~--~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i-lGqt-lt~~~~gGs~A~~ 332 (484) ..- ..+...+..++ . +..|+..+..-....|...++.+-++|+..+ +++. ++.++ +|....- T Consensus 307 ~~~----------~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~-~g~~TAt 375 (499) T protein:vir:80 307 QYF----------DSTDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDE-NGLKTAT 375 (499) T ss_pred cCC----------CcccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCc-ccchhHH Confidence 000 00111111111 1 1124444443333456666666666776655 3322 22222 2222122 Q ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHh-------CCCC-ccccceEEecC-CCCcHHHHHHHHHHHHhcCcc Q lcl|NC_021302. 333 SVQAD--TFVQSVQTVADEIRDVAQAHVVEDIVDV-------NWGE-DEPAPLLVFDE-IGSRQDATAAALQMLVNAGLL 401 (484) Q Consensus 333 evh~~--v~~~~~~aD~~~i~~~ln~qli~~l~~~-------Nf~~-~~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~ 401 (484) ++..+ -....+..-.+.+...|. +|++.++.+ +... ....+.+.|+. ...+.++.++.+.+++.+|+. T Consensus 376 ei~s~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~ 454 (499) T protein:vir:80 376 EVVSEKSETYQTKNSHSQLIEQGIK-EMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMI 454 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCC Confidence 22211 122233444556666664 466665543 1111 22345778864 567878888999999999974 Q ss_pred cCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccc Q lcl|NC_021302. 402 TPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTS 453 (484) Q Consensus 402 ~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (484) . .+.++.+.+|+++++-.+++..-........|.. +..+..|+.. T Consensus 455 S----~et~l~~~~~~~d~ea~~el~~i~~E~~~~~~~~---d~~g~~ge~e 499 (499) T protein:vir:80 455 P----LKIALQRAWNITEAEADEWAEMLAKEKQAEIPNN---DMTGIFGEEE 499 (499) T ss_pred C----HHHHHhhcCCCChHHHHHHHHHHHHHhhcCCCCC---CccccCCCCC Confidence 3 2467788889865432222211111111111111 1112222222 No 180 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=409 Identities=11% Similarity=0.024 Sum_probs=159.6 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccc---cchHHHHH---HHHh--------------------cc Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRW---PNSVYTYT---RMCR--------------------EE 54 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~---~~~~~~y~---~m~~--------------------~D 54 (484) =-|.+....++ ++.............|.........|. .+..+.|+ +++. .. T Consensus 6 ~~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~ 83 (474) T protein:vir:94 6 RMPWDKPYGEE--VVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITT 83 (474) T ss_pred cccCCCchhhH--HHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeec Confidence 01111111111 111111110000000000000000000 00111111 0000 01 Q ss_pred hHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeee Q lcl|NC_021302. 55 ARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFE 133 (484) Q Consensus 55 ~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~E 133 (484) ....-++......+.+-+..+.. ++++..+++ ....+ -+|...+..+ .++.-||. +.+ T Consensus 84 n~~k~Ivd~~~~~l~g~p~~~~~--~d~~~~~~l-----------------~~~~~-n~~~~~~~e~~~~~~~~G~-~~~ 142 (474) T protein:vir:94 84 NFHQNLVDQKVSYVASKPVTYSC--EDENVLKVI-----------------HDVLD-TRWDNKLIDILTATSNKGI-DWL 142 (474) T ss_pred chHHHHHHHHHhhhhcCCceecc--CcHHHHHHH-----------------HHHHh-ccHHHHHHHHHHHHhhcCc-eEE Confidence 11122222222333333333321 111111111 11222 2566666555 57888997 567 Q ss_pred EEEeecCCeeeeeeeeeeCccceeeeeecC--CCceeeeec-ccccccc------cccceeccCCC-------------- Q lcl|NC_021302. 134 QTYFYEGGRFWLKRLAPRPQSSIAYWNVDR--DGGLISIQQ-WPAGTFG------GPGMVVMAPNS-------------- 190 (484) Q Consensus 134 ivw~~~~g~~~~~~l~~r~~~~~~~~~~~~--dg~l~~~~q-~~~~~~~------~~~~~~~~~~~-------------- 190 (484) ++|.-.+|.+. +...+|+.+. ..+++ .+.++..-. +...... ......+.... T Consensus 143 ~~~~d~~~~~~---i~~~~p~~~~-~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~ 218 (474) T protein:vir:94 143 QVYINENGEMK---LFRVPAEQAI-PIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHV 218 (474) T ss_pred EEEecCCCeeE---EEEEcccceE-EEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcc Confidence 88876677654 4444555442 12222 233321111 1000000 00000000000 Q ss_pred ----CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHH Q lcl|NC_021302. 191 ----MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELL 266 (484) Q Consensus 191 ----~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~ 266 (484) ....+..--++.| .+|++|.|.+..+-...---...+..++..++.+.+++.++.|-. +.+.++ T Consensus 219 ~~~~~~~~~g~vPvv~~-----~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~--~~~~~~----- 286 (474) T protein:vir:94 219 QSHFSNGNWGRVPFIAF-----KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYE--GEDLEE----- 286 (474) T ss_pred cccccccCCCccceEEe-----cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC--cccchh----- Confidence 0011111112322 247889999988665555556778888888898877666655522 222111 Q ss_pred HHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HHH--HHHHHHH Q lcl|NC_021302. 267 EIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQA--DTFVQSV 343 (484) Q Consensus 267 ~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh~--~v~~~~~ 343 (484) ...++.. ...+.++.+.+++++........++..++.+.+.|...-.+..++.++.+|+ ..|. ... .-....+ T Consensus 287 -~~~~~~~--~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n-~Sg~Al~~~~~~l~~k~ 362 (474) T protein:vir:94 287 -FMRGLKY--YKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSA-PSGIALKFLYGNLDLKA 362 (474) T ss_pred -hhhhhhc--cceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccc-cHHHHHHHHHHHHHHHH Confidence 1233322 2456788899999998777777899999999999888755544554433222 1121 111 1222223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCC-ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCC Q lcl|NC_021302. 344 QTVADEIRDVAQAHVVEDIVDVNWGE-DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGP 420 (484) Q Consensus 344 ~aD~~~i~~~ln~qli~~l~~~Nf~~-~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p 420 (484) ..-.+.+...+. ++++.++.+.-.. +..--.+.|. ..+.+..+.++.+ +..|+. +.+.+.+.++. +.+ T Consensus 363 ~~k~~~~~~~l~-~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~---~~~g~i-----S~et~l~~l~~v~D~ 433 (474) T protein:vir:94 363 NKLKNKATVAIQ-ELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQII---AQSQYL-----SRETLVKSSPLVDDY 433 (474) T ss_pred HHHHHHHHHHHH-HHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHH---HHcCCC-----CHHHHHHhCCCCCCH Confidence 333455555553 4666666653211 1111245564 3345555555554 445652 45677777753 332 Q ss_pred CCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 421 DPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 421 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) ..+-+...... .+.....+.... .......+.........+ T Consensus 434 ~~E~eri~~E~----~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:94 434 KAELERIEQEQ----MEYNKQLPNLDD----GGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHHHHH----HHHHhhccccCC----CCCCCcccCCCCcccccC Confidence 21111000000 000000000000 000000000000000000 No 181 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=409 Identities=11% Similarity=0.024 Sum_probs=159.6 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccc---cchHHHHH---HHHh--------------------cc Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRW---PNSVYTYT---RMCR--------------------EE 54 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~---~~~~~~y~---~m~~--------------------~D 54 (484) =-|.+....++ ++.............|.........|. .+..+.|+ +++. .. T Consensus 6 ~~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~ 83 (474) T protein:vir:97 6 RMPWDKPYGEE--VVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITT 83 (474) T ss_pred cccCCCchhhH--HHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeec Confidence 01111111111 111111110000000000000000000 00111111 0000 01 Q ss_pred hHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeee Q lcl|NC_021302. 55 ARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFE 133 (484) Q Consensus 55 ~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~E 133 (484) ....-++......+.+-+..+.. ++++..+++ ....+ -+|...+..+ .++.-||. +.+ T Consensus 84 n~~k~Ivd~~~~~l~g~p~~~~~--~d~~~~~~l-----------------~~~~~-n~~~~~~~e~~~~~~~~G~-~~~ 142 (474) T protein:vir:97 84 NFHQNLVDQKVSYVASKPVTYSC--EDENVLKVI-----------------HDVLD-TRWDNKLIDILTATSNKGI-DWL 142 (474) T ss_pred chHHHHHHHHHhhhhcCCceecc--CcHHHHHHH-----------------HHHHh-ccHHHHHHHHHHHHhhcCc-eEE Confidence 11122222222333333333321 111111111 11222 2566666555 57888997 567 Q ss_pred EEEeecCCeeeeeeeeeeCccceeeeeecC--CCceeeeec-ccccccc------cccceeccCCC-------------- Q lcl|NC_021302. 134 QTYFYEGGRFWLKRLAPRPQSSIAYWNVDR--DGGLISIQQ-WPAGTFG------GPGMVVMAPNS-------------- 190 (484) Q Consensus 134 ivw~~~~g~~~~~~l~~r~~~~~~~~~~~~--dg~l~~~~q-~~~~~~~------~~~~~~~~~~~-------------- 190 (484) ++|.-.+|.+. +...+|+.+. ..+++ .+.++..-. +...... ......+.... T Consensus 143 ~~~~d~~~~~~---i~~~~p~~~~-~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~ 218 (474) T protein:vir:97 143 QVYINENGEMK---LFRVPAEQAI-PIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHV 218 (474) T ss_pred EEEecCCCeeE---EEEEcccceE-EEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcc Confidence 88876677654 4444555442 12222 233321111 1000000 00000000000 Q ss_pred ----CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHH Q lcl|NC_021302. 191 ----MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELL 266 (484) Q Consensus 191 ----~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~ 266 (484) ....+..--++.| .+|++|.|.+..+-...---...+..++..++.+.+++.++.|-. +.+.++ T Consensus 219 ~~~~~~~~~g~vPvv~~-----~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~--~~~~~~----- 286 (474) T protein:vir:97 219 QSHFSNGNWGRVPFIAF-----KNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYE--GEDLEE----- 286 (474) T ss_pred cccccccCCCccceEEe-----cCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC--cccchh----- Confidence 0011111112322 247889999988665555556778888888898877666655522 222111 Q ss_pred HHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HHH--HHHHHHH Q lcl|NC_021302. 267 EIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQA--DTFVQSV 343 (484) Q Consensus 267 ~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh~--~v~~~~~ 343 (484) ...++.. ...+.++.+.+++++........++..++.+.+.|...-.+..++.++.+|+ ..|. ... .-....+ T Consensus 287 -~~~~~~~--~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n-~Sg~Al~~~~~~l~~k~ 362 (474) T protein:vir:97 287 -FMRGLKY--YKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFGSA-PSGIALKFLYGNLDLKA 362 (474) T ss_pred -hhhhhhc--cceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccccc-cHHHHHHHHHHHHHHHH Confidence 1233322 2456788899999998777777899999999999888755544554433222 1121 111 1222223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCC-ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCC Q lcl|NC_021302. 344 QTVADEIRDVAQAHVVEDIVDVNWGE-DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGP 420 (484) Q Consensus 344 ~aD~~~i~~~ln~qli~~l~~~Nf~~-~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p 420 (484) ..-.+.+...+. ++++.++.+.-.. +..--.+.|. ..+.+..+.++.+ +..|+. +.+.+.+.++. +.+ T Consensus 363 ~~k~~~~~~~l~-~~~~li~~~~~~~~d~~~i~v~f~~~~p~~~~e~a~~~---~~~g~i-----S~et~l~~l~~v~D~ 433 (474) T protein:vir:97 363 NKLKNKATVAIQ-ELISFIIDFNNLKTDVKDIEISFNFNRMMNDAEQSQII---AQSQYL-----SRETLVKSSPLVDDY 433 (474) T ss_pred HHHHHHHHHHHH-HHHHHHHHHhCCCcccceeeEEeccCcccCHHHHHHHH---HHcCCC-----CHHHHHHhCCCCCCH Confidence 333455555553 4666666653211 1111245564 3345555555554 445652 45677777753 332 Q ss_pred CCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 421 DPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 421 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) ..+-+...... .+.....+.... .......+.........+ T Consensus 434 ~~E~eri~~E~----~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~e 474 (474) T protein:vir:97 434 KAELERIEQEQ----MEYNKQLPNLDD----GGADGAQQQEGSNNKESE 474 (474) T ss_pred HHHHHHHHHHH----HHHHhhccccCC----CCCCCcccCCCCcccccC Confidence 21111000000 000000000000 000000000000000000 No 182 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=96.80 E-value=0.00033 Score=39.67 Aligned_cols=397 Identities=12% Similarity=0.065 Sum_probs=157.0 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcc-cccc---cccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLD-QFEQ---VDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~---~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) =-+.-+.+...+.++.....+=.+-...+. .... ...-+....+.+...+. .+-..-+++-+-.|. T Consensus 27 ~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~----------~~~a~~l~~~p~~i~ 96 (496) T protein:vir:38 27 DHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTA----------KYMSKLLFNEKVKIN 96 (496) T ss_pred hcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHH----------HHHhhhhhCCcceEe Confidence 134444555444433221110000000000 0000 00001111122232332 222234556555554 Q ss_pred cCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 77 PNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 77 p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) . ++++..+++.+.+ ....|.+.++.+ ..|..+|-+++=+.|.. +|.+. +..+++.. T Consensus 97 ~--~d~~~~e~l~~~~-----------------~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~-~~~~~---i~~v~~~~ 153 (496) T protein:vir:38 97 I--DDKAAEEFVLNVL-----------------KTNGFTKNMERYIEYGEAMGGFVIKVYHDG-NKNVK---VSFATADC 153 (496) T ss_pred e--CChHHHHHHHHHH-----------------hccCHHHHHHHHHHHHhhhCcEEEEEEEcC-CCcEE---EEEEcccc Confidence 3 2334444444433 223577766665 47999998777665543 34432 34444444 Q ss_pred eeeeeecCCCcee---eeeccccccc------------ccc----cceecc-CCCCcccc---------cc-------cc Q lcl|NC_021302. 156 IAYWNVDRDGGLI---SIQQWPAGTF------------GGP----GMVVMA-PNSMGPAI---------PV-------EQ 199 (484) Q Consensus 156 ~~~~~~~~dg~l~---~~~q~~~~~~------------~~~----~~~~~~-~~~~~~~l---------p~-------~k 199 (484) +.-...+ .+++. .+.+...+.. +.. ..+... ....+.++ ++ .+ T Consensus 154 ~~P~~~~-~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~ 232 (496) T protein:vir:38 154 MYPLSND-SENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTR 232 (496) T ss_pred eEEEEec-CCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCc Confidence 3211111 12221 0000000000 000 000000 00001111 11 11 Q ss_pred --eEEEe----ecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_021302. 200 --LVVYT----HDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYS 273 (484) Q Consensus 200 --~l~~~----~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~ 273 (484) |++++ .....++|+|.|.+..+-...-.-...+..|+.-++. |.+.+..+. .++....+.. T Consensus 233 ~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~---~~~~i~v~~----------~~l~~~~~~~ 299 (496) T protein:vir:38 233 PTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL---GKKKVLVPS----------SFVKTAVNLD 299 (496) T ss_pred ceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh---cccceecch----------HHhhccCCCC Confidence 22332 2335678999999999876665555556666655552 444333211 0000000000 Q ss_pred --------cCCceEEEc---c--CCceEEEecccCCchhHHHHHHHHHHHHHHHH-hhh-hhcccccccchhhHHHHHHH Q lcl|NC_021302. 274 --------GGESAGLAL---T--AGEEAGILSPNGTPLDPRRAIEYHDHQMALVA-LAH-FLNLDGKGGSYALASVQADT 338 (484) Q Consensus 274 --------~g~~a~~vi---p--~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i-lGq-tlt~~~~gGs~A~~evh~~v 338 (484) .+.....++ + .+..|+.+...-....|...++.+-++|+..+ +++ +++.++ +|.....++.... T Consensus 300 g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~-~g~~tAtei~~~~ 378 (496) T protein:vir:38 300 GSTTQYFDSTDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDE-NGLKTATEVVSEK 378 (496) T ss_pred CccccCCCCccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCc-cccchHHHHHHHH Confidence 000001111 1 11123333333333456666666666666554 222 222222 2222122332222 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHh-------CC-CCccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 339 --FVQSVQTVADEIRDVAQAHVVEDIVDV-------NW-GEDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 339 --~~~~~~aD~~~i~~~ln~qli~~l~~~-------Nf-~~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~ 407 (484) ....+..-.+.+...|. ++++.++.+ +. .....-+.+.|+ ....+.++.++.+.+++++|+.- . T Consensus 379 ~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS----~ 453 (496) T protein:vir:38 379 SETYQTKNSHSQLIEQGIK-EMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIP----L 453 (496) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCC----H Confidence 22223334555666663 466665533 11 112223578886 45677888889999999999743 2 Q ss_pred HHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccc Q lcl|NC_021302. 408 EAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTS 453 (484) Q Consensus 408 ~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (484) +.++.+.+|+++++-.++...-....+...|.. +..+..++.. T Consensus 454 et~l~~~~~~~d~ea~~el~ri~~E~~~~~~~~---d~~~~~~~~e 496 (496) T protein:vir:38 454 KIALQRAWNITEAEADEWAEMLAKEKQAEMPNN---DMNGIFGEEE 496 (496) T ss_pred HHHHHhcCCCChHHHHHHHHHHHHhhhccCccc---cccCCCCCCC Confidence 456777778865432222111111111111111 1111112211 No 183 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=96.76 E-value=0.00036 Score=39.47 Aligned_cols=394 Identities=8% Similarity=0.015 Sum_probs=166.6 Q ss_pred CCCCCCCccceeeeecccccch--hhhhhhcccccccccc-cccchHHHHH---HHHh---------------------- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFG--TFLAQGLDQFEQVDEL-RWPNSVYTYT---RMCR---------------------- 52 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~l-r~~~~~~~y~---~m~~---------------------- 52 (484) +-|+.-. .-..... ......+........+ +..+..+.|+ +++. T Consensus 6 ~~~~~~~--------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki 77 (479) T protein:vir:79 6 ISETDLI--------KVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKA 77 (479) T ss_pred ecccceE--------eeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCccee Confidence 2221111 1110000 0011111111000000 0011112221 0000 Q ss_pred cchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhccee Q lcl|NC_021302. 53 EEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAV 131 (484) Q Consensus 53 ~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~ 131 (484) ..+...-++.+....+.+-+..+.. ++++..+.+... . .-+|++.+..+ .++..||.+ T Consensus 78 ~~~~~~~Ivd~~~~~l~g~p~~~~~--~~~~~~~~~~~~-----------------~-~n~~~~~~~~~~~~~~~~G~~- 136 (479) T protein:vir:79 78 INNYHKLLVDQKVGYSVGNPIVFNA--DDDNLTKLLNDL-----------------L-GEEFDDTITELYLNASNKGVE- 136 (479) T ss_pred ecchHHHHHHHHHhhhhcCCceecc--CCHHHHHHHHHH-----------------H-hcCHHHHHHHHHHHHHhcCeE- Confidence 0111222333444444555544432 222222222211 1 12577766665 478889976 Q ss_pred eeEEEeecCCeeeeeeeeeeCccceeeeeecC--CCceeee-eccc---cccccc--------ccceeccC--C------ Q lcl|NC_021302. 132 FEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDR--DGGLISI-QQWP---AGTFGG--------PGMVVMAP--N------ 189 (484) Q Consensus 132 ~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~--dg~l~~~-~q~~---~~~~~~--------~~~~~~~~--~------ 189 (484) ++++|.-.+|.+. +...+|+.+. ..+++ .+.++.. +.+. ...... .....+.. . T Consensus 137 ~~~v~~d~~~~~~---i~~~~p~~~~-~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~ 212 (479) T protein:vir:79 137 WLHPYINRKGEFK---YVIIPAEEAI-PIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEF 212 (479) T ss_pred EEEEEeCCCCceE---EEEEccceeE-EEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccc Confidence 5677765666654 3334444432 11221 1222211 0000 000000 00000000 0 Q ss_pred -------------------CCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_021302. 190 -------------------SMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLK 250 (484) Q Consensus 190 -------------------~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~ 250 (484) .....+..--++.++ +|++|.|.+..+....-.-...+.+++..++.|..++.++. T Consensus 213 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~-----nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~ 287 (479) T protein:vir:79 213 LYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFK-----NNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLK 287 (479) T ss_pred cccccccccccccccccccccccCCCcccEEEec-----CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeee Confidence 001111111233332 46789999987666555556677889999998877666665 Q ss_pred ecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchh Q lcl|NC_021302. 251 GNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYA 330 (484) Q Consensus 251 gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A 330 (484) |-. +...++ ...++..+ ..+.++.+.++++++.......++..++.+.+.|...--+..++.++. |. + T Consensus 288 g~~--~~~~~~------~~~~~~~~--~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-gn-~ 355 (479) T protein:vir:79 288 EYP--GTSLQE------FIDNIRYY--KSIKVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNT-GD-K 355 (479) T ss_pred cCC--cccccc------chhhhhhc--cceecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccc-cc-h Confidence 522 112111 12233222 345678899999998777777899999999999988866666655433 22 1 Q ss_pred hHHH-H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCC--C--ccccceEEec-CCCCcHHHHHHHHHHHHhcCcc Q lcl|NC_021302. 331 LASV-Q--ADTFVQSVQTVADEIRDVAQAHVVEDIVDV-NWG--E--DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLL 401 (484) Q Consensus 331 ~~ev-h--~~v~~~~~~aD~~~i~~~ln~qli~~l~~~-Nf~--~--~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~ 401 (484) .|.. + ..-....+..-.+.+...+. ++++.++.+ +.. . +..-+.+.|. ..+.++.+.++.+.+|+ |+ T Consensus 356 Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~--g~- 431 (479) T protein:vir:79 356 SGVALKFLYSLLDLKCSKTEKKFKKAIR-ELLWFVCEYLKISGNKSYDYKTVQITFNHSMIINEAEKIDMAAKST--GI- 431 (479) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCccccccceEEeCCCCCcCHHHHHHHHHHHh--cc- Confidence 1211 1 12222233333444555553 355555443 221 1 1223477775 35677888899998885 64 Q ss_pred cCCcccHHHHHHHhCC-CCCCCCcccccccCCCcCCCccccCCCCccccccccccccccc Q lcl|NC_021302. 402 TPDPRLEAFLRDAAGL-PGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 402 ~~~~~~~~~i~e~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (484) ++.+.+.+.++. +.+..+-+..........+ .....+ . ..++. ..++ T Consensus 432 ----iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~-~~~~~~---~-~~~~~---~~e~ 479 (479) T protein:vir:79 432 ----VSDETIVSNHPWVEDVNDELERLKKQEDTQKE-YDDLIP---N-NQDGV---IDET 479 (479) T ss_pred ----CcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-HHhccC---c-ccCCC---cCcC Confidence 245667777764 2221110000000000000 000000 0 00000 0001 No 184 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=96.71 E-value=0.00039 Score=39.26 Aligned_cols=379 Identities=9% Similarity=-0.004 Sum_probs=160.0 Q ss_pred chhhhhhhcccccccccccccchHHHHHH---HH--------------------hcchHHHHHHHHHHHHhhCCCcEEec Q lcl|NC_021302. 21 FGTFLAQGLDQFEQVDELRWPNSVYTYTR---MC--------------------REEARIASVLRAIGLPIRRTDWRIRP 77 (484) Q Consensus 21 ~~~~~~~~~~~~~~~~~lr~~~~~~~y~~---m~--------------------~~D~~v~s~l~~r~~~v~~~~~~v~p 77 (484) ...--+.-+.........|-.+..+.|+- ++ -......-++.+...-+.+-+..+.. T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~ 80 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFDI 80 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceeec Confidence 00000000000000000000011111110 00 00223333344444444555544432 Q ss_pred CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecC--------Ceeeeeee Q lcl|NC_021302. 78 NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEG--------GRFWLKRL 148 (484) Q Consensus 78 ~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~--------g~~~~~~l 148 (484) . ++.+..+++.+.+ .-+|+.....+ .++.-||.+. +++|...+ |.+ ++ T Consensus 81 ~-~~~~~~~~~~~~~------------------~n~~~~~~~~~~~~~~~~G~a~-~~~y~de~~~~~~~~~~~~---~~ 137 (451) T protein:vir:10 81 D-NNKELNEKVTDVL------------------GNEFTRKAKNLAIEASNCGSAW-LHYWIDEEYSGEQVTNQTF---KY 137 (451) T ss_pred C-CcHHHHHHHHHHh------------------ccCHHHHHHHHHHHHhhcCeEE-EEEeecCCcccccccccce---eE Confidence 2 1222222222221 12577776665 5788899775 55554332 333 24 Q ss_pred eeeCcccee-eeeecCCCceeeeeccc---ccccc--------------cccceeccC---C---------CCccccccc Q lcl|NC_021302. 149 APRPQSSIA-YWNVDRDGGLISIQQWP---AGTFG--------------GPGMVVMAP---N---------SMGPAIPVE 198 (484) Q Consensus 149 ~~r~~~~~~-~~~~~~dg~l~~~~q~~---~~~~~--------------~~~~~~~~~---~---------~~~~~lp~~ 198 (484) ..++|+.+. .|.-..++.+...-.+. ....+ ......+.. + .....+..- T Consensus 138 ~~i~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 217 (451) T protein:vir:10 138 GVVNTEEIIPIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSV 217 (451) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCee Confidence 444444432 12111222322111100 00000 000000000 0 000111111 Q ss_pred ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCce Q lcl|NC_021302. 199 QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESA 278 (484) Q Consensus 199 k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a 278 (484) -++.|. +|..|.|.+..+-...-.-...+...+..++.|..++.++.|- .+.+.++ .+.++... . T Consensus 218 Pvv~~~-----nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~--~~~~~~~------~~~~~~~~--~ 282 (451) T protein:vir:10 218 PFVEFS-----NNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENF--GGEDTSE------FLKELKRY--K 282 (451) T ss_pred eEEEec-----cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC--Ccccchh------hHHHHhhC--C Confidence 123222 3566888888776666556677888888888876655555442 1222111 12233222 2 Q ss_pred EEEcc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHH---HHHHHHHHHHHHHHH Q lcl|NC_021302. 279 GLALT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQ---ADTFVQSVQTVADEI 350 (484) Q Consensus 279 ~~vip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh---~~v~~~~~~aD~~~i 350 (484) .+.++ .+.++++++.......++..++++.+.|...--+..++.++.|. +.|..- ..-....+..-.+.+ T Consensus 283 ~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn--~Sg~Alk~~~~~l~~k~~~k~~~f 360 (451) T protein:vir:10 283 TIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGN--ASGVALKFFYRKLELKSGLLETEF 360 (451) T ss_pred eEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc--ccHHHHHHHHHHHHHHHHHHHHHH Confidence 33444 34578888877667789999999999999986555555543322 222211 222222333444555 Q ss_pred HHHHHHHHHHHHHHhCCCCccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCC-CCCCccccc Q lcl|NC_021302. 351 RDVAQAHVVEDIVDVNWGEDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPG-PDPDADDDE 428 (484) Q Consensus 351 ~~~ln~qli~~l~~~Nf~~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~-p~~~e~~~~ 428 (484) ...+ +++++.++.+.-..+..-..+.|. ..+.++.+.++.+.+|+ |+ +|++.+.+.++.-. ++....... T Consensus 361 ~~~l-~~~~~li~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~-----iS~et~~~~~p~v~d~~~e~~~~~ 432 (451) T protein:vir:10 361 RTSF-DKLIKAILYFLGVTDYKKIQQTYTRNMMSNDLEDADIATKSV--GI-----IPTKIILRHHPWVDDVEEAEKLYL 432 (451) T ss_pred HHHH-HHHHHHHHHHhCCCCccceeEEecCCCCCCHHHHHHHHHHHh--cc-----CchHHHHHhCCCCCCHHHHHHHHH Confidence 5666 346666665432112222356775 45677888899999985 53 25667777776532 211111100 Q ss_pred ccCCCcCCCccccCCCCcc Q lcl|NC_021302. 429 STADTGQDEPETDEPALPN 447 (484) Q Consensus 429 ~~~~~~~~~~~~~~~~~~~ 447 (484) ..............+.... T Consensus 433 ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 433 EEKKIQASKVSDDYNNFTE 451 (451) T ss_pred HHHHHHHHHHHhhcCCCCC Confidence 0000000000111111111 No 185 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=96.66 E-value=0.00043 Score=39.03 Aligned_cols=438 Identities=12% Similarity=0.043 Sum_probs=187.5 Q ss_pred CCCCCCCccceeeeecccccchhh-hhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec-- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTF-LAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP-- 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p-- 77 (484) -+|+.+...+++-. +....+... .+++.-. ...-.....|+.|++|...++.|.++++.....+.-.+=.-.| T Consensus 27 ~~p~~~dG~s~i~~-~~~~~~~~~~~~~~~~g---g~~~n~~eLI~~YR~ma~~~pEVd~AideIvneaiv~d~~~~pV~ 102 (533) T protein:vir:58 27 GAPHGAGGSSMIPI-NMYHPFATAGYASRFYG---GIEFNRFFLYDMYDRMDYTDPLISTVLDIIADECTIPNENGNIVD 102 (533) T ss_pred cCccCCCCCccccC-CCCcchhhhhhhhhhhc---cccccHHHHHHHHHHhhccCcchhhHHHhhhceeeEecCCCceeE Confidence 45666666665531 111111111 1111100 0111124579999999767999999999887655433211111 Q ss_pred -CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 78 -NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 78 -~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) +-+..+..+.+.+-+.. -.+|+.-..++. .-..+|--.+.++= ++..-.+.+|...+|+. T Consensus 103 v~l~~~e~s~~iK~kI~~----------------lldf~~~~~~~fR~WYVDGriy~Hkii--k~~k~GI~elr~lDPr~ 164 (533) T protein:vir:58 103 VVTKDIELAKAILSYLDY----------------VINIEKNAYPIIRNMIKYGDMFLHILE--KGSDGTIEKFQVVSPYI 164 (533) T ss_pred eecccccccHHHHHHHHH----------------HhcchhhhhHHHHhhhhcceeEEEecc--CCcccchhhheecCCee Confidence 11111223333222211 112333222222 12334544454431 23345567888888888 Q ss_pred eeeeeecCCCceeeeecccccccccccceeccCCCCcccccccceEEEeec-CccCccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 156 IAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHD-MDPGVWTGNSLLRPAYKNWKLKDELIRI 234 (484) Q Consensus 156 ~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~-~~~~~p~G~gll~~~~~~~~~K~~~~~~ 234 (484) +.+++- ...+..+..-.+ ......+....+.||.+-.+++.+. .....+++.|.|+++..++=--+...-. T Consensus 165 i~~vr~-~~t~~eyyvy~~-------~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDA 236 (533) T protein:vir:58 165 FSKRYN-PETDTWYYVITD-------VYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDA 236 (533) T ss_pred eEEEEe-eccceEEEeecc-------cccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHH Confidence 865432 222211111111 1122334556688887655544444 3456799999999999888666665555 Q ss_pred HHHH-HHHhcCCcc-eEEecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE-----------------cc-----CCc Q lcl|NC_021302. 235 EAAA-IRRHGIGVP-YLKGNEADSEDDDRMDELLEIASNYSGG----ESAGLA-----------------LT-----AGE 286 (484) Q Consensus 235 w~~f-~Er~~~G~P-~~~gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v-----------------ip-----~~~ 286 (484) .+.| +-|- |-. +.-...+.-...+.-+.|.+++..+++. .++|=| +| .|+ T Consensus 237 lVIYRisRA--PeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgT 314 (533) T protein:vir:58 237 LMLYRVVRS--VDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAV 314 (533) T ss_pred HHHHhhcCC--hhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccc Confidence 5554 1120 000 2222223323333334455555555432 111211 12 468 Q ss_pred eEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHH-HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 287 EAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASV-QADT-FVQSVQTVADEIRDVAQAHVVEDIVD 364 (484) Q Consensus 287 ~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~ev-h~~v-~~~~~~aD~~~i~~~ln~qli~~l~~ 364 (484) +|+++.++ + -.-.+-|+|..+++-+++....--.+.++|....+++ ..++ |...+......+...|.+||+ T Consensus 315 EI~TLpGg-~-lgemeDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLi----- 387 (533) T protein:vir:58 315 EIDILQGS-K-VDLAEDVEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIKFNNTIKRIQGFFVEELERMVR----- 387 (533) T ss_pred eeeecCCC-C-CCcHHHHHHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHHHHHHHHHHHHHHHHHHhcccc----- Confidence 99999853 2 3445779999999999987765444433333222333 3333 445556666677777776654 Q ss_pred hCCC-CccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHH-hCCCCCCC--------------- Q lcl|NC_021302. 365 VNWG-EDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDA-AGLPGPDP--------------- 422 (484) Q Consensus 365 ~Nf~-~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~-~glp~p~~--------------- 422 (484) ++-- .... -+|.|.... + +.+.+.+++..|..+- | .+.+.||++. +.++.... T Consensus 388 lk~iit~ee-w~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~d---p-yvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~ 462 (533) T protein:vir:58 388 MNKEFADQD-FRLVMNRSNSIVEGERFAVIEQRIGIAERLK---G-WVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLF 462 (533) T ss_pred cccCcchhh-eeeeeeccchHHHHHHHHHHHHHHHHHHHhc---c-hhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCC Confidence 3311 1111 144442211 1 2233445555554421 1 3445676443 45443110 Q ss_pred -----CcccccccCCCcCCCccccCCC-Cccccccc---cccccc--cccccccccchHHHhcCcccCccc Q lcl|NC_021302. 423 -----DADDDESTADTGQDEPETDEPA-LPNTSGTT---STTNAP--QARKRPRGRSPRDRRKTPDGAMPL 482 (484) Q Consensus 423 -----~e~~~~~~~~~~~~~~~~~~~~-~~~~~~~~---~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 482 (484) +++..+..-.+....|.+.++. .....|+. ..+... +.+..........+++.|.-+-.- T Consensus 463 ~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g~~~~~~~~p~~~ 533 (533) T protein:vir:58 463 DTGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAFEEFEEETGGGEEELPFPEEE 533 (533) T ss_pred CCCCcccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccchhhhhhcCCcccCCCCCCCC Confidence 0000000000000000000000 00000000 000000 000000000111122221111111 No 186 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=96.66 E-value=0.00043 Score=39.03 Aligned_cols=407 Identities=11% Similarity=0.029 Sum_probs=169.3 Q ss_pred CCCCCC-----Ccccee-eeecccccchhhhhhhccc----c-cccccccccchHHHHHHHHhcchHHHHHHHHHHHHhh Q lcl|NC_021302. 1 MAPKTV-----APRTER-GYVNPLAGFGTFLAQGLDQ----F-EQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIR 69 (484) Q Consensus 1 ~~~~~~-----~~~~~~-~~~~~~~~~~~~~~~~~~~----~-~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~ 69 (484) +-++-. .+.+.+ -++..+ ..+..|--. . ......+ +. ..+ ......-++.+....+. T Consensus 40 ~~~~~i~~~i~~~~~~~~~r~~~l----~~Yy~g~~~i~~~~~~~~~~~~-~~-----~ki--~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:99 40 QNVNEVSKYIEHHMDYQRPRLKVL----SDYYEGKTKNLVELTRRKEEYM-AD-----NRV--AHDYASYISDFINGYFL 107 (511) T ss_pred ccHHHHHHHHHHHHHhhHHHHHHH----HHHhcccCccccccCccccccc-Cc-----cee--ecchHHHHHHHHHhhhc Confidence 000000 000000 000000 011111000 0 0000000 00 011 12344445555566667 Q ss_pred CCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeee Q lcl|NC_021302. 70 RTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRL 148 (484) Q Consensus 70 ~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l 148 (484) +-+..+... +++..+++.+.+ ...+|+.....+. ++..||. +.+++|.-.+|.+.+ T Consensus 108 g~p~~~~~~--d~~~~~~l~~~~-----------------~~n~~~~~~~~~~~~~~i~G~-a~~~vy~ded~~~~i--- 164 (511) T protein:vir:99 108 GNPIQYQDD--DKDVLEAIEAFN-----------------DLNDVESHNRSLGLDLSIYGK-AYELMIRNQDDETRL--- 164 (511) T ss_pred ccCceeecC--chHHHHHHHHHH-----------------hhcCHhHHHHHHHHHHHhcCe-eEEEEEeCCCCceEE--- Confidence 777777643 333334443332 2224666666654 7888996 567888766776543 Q ss_pred eeeCccceeeeeecC--CCceee-eecccc----c----------ccccccceeccCC-------------CCccccccc Q lcl|NC_021302. 149 APRPQSSIAYWNVDR--DGGLIS-IQQWPA----G----------TFGGPGMVVMAPN-------------SMGPAIPVE 198 (484) Q Consensus 149 ~~r~~~~~~~~~~~~--dg~l~~-~~q~~~----~----------~~~~~~~~~~~~~-------------~~~~~lp~~ 198 (484) ...+|+.+. ..+++ .+.++. ++.+.. + .........+... ....++..- T Consensus 165 ~~~~p~~~~-~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~v 243 (511) T protein:vir:99 165 YKSDAMSTF-VIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERM 243 (511) T ss_pred EEEccceeE-EEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCcc Confidence 334444432 22222 123221 111000 0 0000011111000 001111111 Q ss_pred ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHH----HHHHhc Q lcl|NC_021302. 199 QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEI----ASNYSG 274 (484) Q Consensus 199 k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~----l~~~~~ 274 (484) -++.|+ +|+.|.|.+..+-...---...+..++..++.|..++.++.|... .+..+...+.+. +..+.. T Consensus 244 Pvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~--~~~~~~~~~~~~~~~~~~~~~~ 316 (511) T protein:vir:99 244 PITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN--LDPVEVRKQKEANVLFLEPTVY 316 (511) T ss_pred ceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcc--cCchhhcccccccceecccccc Confidence 133333 367789999887666655677788888888887666666666322 222222111110 000000 Q ss_pred CCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHH----HHHHHHHHHHHHH Q lcl|NC_021302. 275 GESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQAD----TFVQSVQTVADEI 350 (484) Q Consensus 275 g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~----v~~~~~~aD~~~i 350 (484) ....+.-...+.++++++.......++..++++.+.|...--...++.++.+|.- +.+... .....+..-.+.+ T Consensus 317 ~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~--Sg~Alk~~~~~l~~ka~~k~~~~ 394 (511) T protein:vir:99 317 ADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ--SGEAMKYKLFGLEQRTKTKEGLF 394 (511) T ss_pred cccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc--hHHHHHHHHHHHHHHHHHHHHHH Confidence 0011122345778999887766678899999999999877655555554332221 122222 2222333444556 Q ss_pred HHHHHHHHHHHHHHh---CCCC----ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCC Q lcl|NC_021302. 351 RDVAQAHVVEDIVDV---NWGE----DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPD 421 (484) Q Consensus 351 ~~~ln~qli~~l~~~---Nf~~----~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~ 421 (484) ...|++ +++.++.+ +... +-.-.++.|. ..+.+..+.++.+.+|+ |+ ++.+.+.+.++. +.++ T Consensus 395 ~~~l~~-~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--Gi-----iS~et~l~~l~~v~D~~ 466 (511) T protein:vir:99 395 TKGLRR-RAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-----ISQTTLMSLFSFFQDPE 466 (511) T ss_pred HHHHHH-HHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-----CCHHHHHHhCCCCCCHH Confidence 666643 55554443 2111 1112467775 45677888889988885 64 245667777643 2221 Q ss_pred CCccccc----c-cCCC-cCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 422 PDADDDE----S-TADT-GQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 422 ~~e~~~~----~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) .+-+... . .... ................... .......+ T Consensus 467 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~d~~e 511 (511) T protein:vir:99 467 LEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQDDST---------KDSIDKKE 511 (511) T ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCCCCC---------cCcccccC Confidence 1000000 0 0000 0000000000000000000 00000000 No 187 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=96.61 E-value=0.00047 Score=38.82 Aligned_cols=394 Identities=10% Similarity=0.053 Sum_probs=164.7 Q ss_pred CCCCCCCc-----cceeeeecccccchhhhhhhccc-ccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcE Q lcl|NC_021302. 1 MAPKTVAP-----RTERGYVNPLAGFGTFLAQGLDQ-FEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWR 74 (484) Q Consensus 1 ~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~ 74 (484) |.++-..- ...+.++... ..+..|-.. ............ ..+ ..+...-++......+.+-+.. T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l----~~yy~g~~~il~~~~~~~~~~~----~ki--~~n~~~~ivd~~~~~l~g~~~~ 70 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAY----KQLYEGDHAILQQKQKEQYKPD----NRL--VVNFAKYIVDTFNGYFIGVPVQ 70 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH----HHHhccccccccccccccCCCc----cee--ecchHHHHHHHHhhhhcccCce Confidence 11100000 0000000000 000000000 000000000000 011 1234444455555555565555 Q ss_pred EecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeeeeeCc Q lcl|NC_021302. 75 IRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQ 153 (484) Q Consensus 75 v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~ 153 (484) +.+.+ ++..+.+.+. ...-+|+..+..+ .++..||. +++++|...+|... +...+| T Consensus 71 ~~~~~--~~~~~~l~~~-----------------~~~n~~~~~~~~~~~~~~~~G~-~~~~v~~d~~g~~~---~~~~~p 127 (429) T protein:vir:98 71 TSHEN--KQVSNYLELL-----------------DGYNDQDDNNAELSKICSIYGH-GYELVFNDENAEAG---ITYLTP 127 (429) T ss_pred eecCC--hHHHHHHHHH-----------------HhhcCHhHHHHHHHHHHhhcCe-EEEEEEecCCCcEE---EEEEcc Confidence 54322 2222222222 2223566656555 47888997 56777776677654 334444 Q ss_pred ccee-eeeecCCCceeeeecccccccccc--------cceeccCCCCc--------ccccccceEEEeecCccCccccch Q lcl|NC_021302. 154 SSIA-YWNVDRDGGLISIQQWPAGTFGGP--------GMVVMAPNSMG--------PAIPVEQLVVYTHDMDPGVWTGNS 216 (484) Q Consensus 154 ~~~~-~~~~~~dg~l~~~~q~~~~~~~~~--------~~~~~~~~~~~--------~~lp~~k~l~~~~~~~~~~p~G~g 216 (484) +.+. .|....+..++............. ....+.....+ .++..--++.+ .+|+.|.| T Consensus 128 ~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~n~~~g~s 202 (429) T protein:vir:98 128 LEAFIVYDDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEY-----VENEERQS 202 (429) T ss_pred cceEEEEeCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEe-----cCCCCCCC Confidence 4432 111111122221111100000000 00000001111 11111112222 24678999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccC----CceEEEec Q lcl|NC_021302. 217 LLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTA----GEEAGILS 292 (484) Q Consensus 217 ll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~----~~~ie~~~ 292 (484) .+..+....---...+..++..++.|.+++.++.|-. .+++... ++... .++.++. +.+++++. T Consensus 203 d~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~---~~~~~~~-------~~~~~--~~~~~~~~~~~~~~~~~l~ 270 (429) T protein:vir:98 203 LLASVVTLINAFNKAISEKANDVEYFADAYLKILGAE---LDDETLK-------SLRDT--RIINLKDTDAQQLTVEFLQ 270 (429) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC---CCcchhh-------hHhhC--ceeeccCCCCCCcceeEEe Confidence 9998777666667788888888998877666665532 2222222 22212 2344443 34678887 Q ss_pred ccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CC--C Q lcl|NC_021302. 293 PNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYAL-ASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDV-NW--G 368 (484) Q Consensus 293 ~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~-~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~-Nf--~ 368 (484) .......++..++.+.+.|.+.-.+..++.++.|.+.|. -.....-....+..-.+.+...+. ++++-++.+ +. . T Consensus 271 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~ 349 (429) T protein:vir:98 271 KPDADATQEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMN-RRYKLIASYPTSKIG 349 (429) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCC Confidence 666666788899999999988865555554432211111 111222233334444556666664 366655554 21 1 Q ss_pred Ccc-ccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCCcccccccCCCcCCCccccCCCC Q lcl|NC_021302. 369 EDE-PAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPDADDDESTADTGQDEPETDEPAL 445 (484) Q Consensus 369 ~~~-~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~ 445 (484) ... .-..++|. ..+.+..+.++.+.+|+ |+ + +.+.+.+.++. +.|+ ++...-..... ...+...... T Consensus 350 ~~d~~~i~v~f~~~~p~~~~~~a~~~~kl~--g~-i----s~et~~~~l~~v~d~~--~E~~ri~~E~~-~~~~~~~~~~ 419 (429) T protein:vir:98 350 PKDWIGIKYKFTRNLPANLLEESQIAGNLA--GI-V----SEETQVGVLSIVENPQ--KEIERKNSDKS-TLISRQAGGL 419 (429) T ss_pred ccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-C----chHHHHHhCCCCCCHH--HHHHHHHHHHH-HHHHHHHhhh Confidence 111 11256675 45567888889998884 54 2 45677788764 3221 11100000000 0000000000 Q ss_pred cccccccccccccc Q lcl|NC_021302. 446 PNTSGTTSTTNAPQ 459 (484) Q Consensus 446 ~~~~~~~~~~~~~~ 459 (484) ......+.. + T Consensus 420 ~~~~~~~~~----~ 429 (429) T protein:vir:98 420 NGQNTTTIL----E 429 (429) T ss_pred cCCCCCCCC----C Confidence 000000000 0 No 188 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=96.54 E-value=0.00053 Score=38.53 Aligned_cols=412 Identities=9% Similarity=0.024 Sum_probs=162.4 Q ss_pred CCCCCCCcccee---eeecccccchhhhhhhcccccccc--cc-cccchHHHHH---HHHh------------------- Q lcl|NC_021302. 1 MAPKTVAPRTER---GYVNPLAGFGTFLAQGLDQFEQVD--EL-RWPNSVYTYT---RMCR------------------- 52 (484) Q Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~--~l-r~~~~~~~y~---~m~~------------------- 52 (484) |--+-..|-++- .++.............|.....-. .+ |-.+..+.|+ +++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~k 80 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWR 80 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccc Confidence 332222222211 011111111100000010000000 00 0011111111 1100 Q ss_pred -cchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcce Q lcl|NC_021302. 53 -EEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHA 130 (484) Q Consensus 53 -~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s 130 (484) ...-..-++......+.+.+..+... ++++.+.+.+. .+ -+|.+.+..+ .++.-||.+ T Consensus 81 i~~n~~k~Iv~~~~~yl~g~p~~~~~~--~~~~~~~l~~~-----------------~~-n~~~~~~~~l~~~~~~~G~~ 140 (474) T protein:vir:95 81 ITTNFHQNLVDQKVSYVAGKPVTYAHD--DDKVLDVIHQV-----------------LD-TRWDNKLIDILTAASNKGID 140 (474) T ss_pred cccchHHHHHHhhhhhhcccCceeccC--ChHHHHHHHHH-----------------Hh-ccHHHHHHHHHHHHhhCCeE Confidence 01111222223333334444444322 22222222221 12 2466666555 478899995 Q ss_pred eeeEEEeecCCeeeeeeeeeeCcccee-eeeecCCCceeeee-cccccccc------cccceeccCCC------------ Q lcl|NC_021302. 131 VFEQTYFYEGGRFWLKRLAPRPQSSIA-YWNVDRDGGLISIQ-QWPAGTFG------GPGMVVMAPNS------------ 190 (484) Q Consensus 131 ~~Eivw~~~~g~~~~~~l~~r~~~~~~-~~~~~~dg~l~~~~-q~~~~~~~------~~~~~~~~~~~------------ 190 (484) ++++|.-.+|.+. +...+|+.+. .|.....+.++... .+...... ......+.... T Consensus 141 -~~~~~~d~~~~~~---i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~ 216 (474) T protein:vir:95 141 -WLQVYINEDGELK---LFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDE 216 (474) T ss_pred -EEEeeeCCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccc Confidence 5788876667654 3344454432 12111122322111 11100000 00000000000 Q ss_pred ----Ccccccccc--eEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHH Q lcl|NC_021302. 191 ----MGPAIPVEQ--LVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDE 264 (484) Q Consensus 191 ----~~~~lp~~k--~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~ 264 (484) ...+-+..+ ++.+ .+|+.|.|.+..+-...=--...+..++..++.|..++.++.|-.. + +... T Consensus 217 ~~~~~~~~~~~~~vPvv~~-----~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~---~--~~~~ 286 (474) T protein:vir:95 217 HIQTHFSTGSWERVPFIAF-----KNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEG---E--DLSE 286 (474) T ss_pred cccCcccccCCCccceEEe-----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCc---c--cccc Confidence 000001111 1222 2467789999886555545567888889899988776656555221 1 1111 Q ss_pred HHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHH-HHH--HHHH Q lcl|NC_021302. 265 LLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASV-QAD--TFVQ 341 (484) Q Consensus 265 l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~ev-h~~--v~~~ 341 (484) ...++.. ...+.++.+.+++++........++.+++.+.+.|...--...++.++.+|+ ..|.. ... -... T Consensus 287 ---~~~~~~~--~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n-~Sg~Alk~~~~~l~~ 360 (474) T protein:vir:95 287 ---FMEGLKY--YKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSA-TSGIALKFLYTNLNL 360 (474) T ss_pred ---hhhhhhc--cceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccc-cHHHHHHHHHHHHHH Confidence 1222322 2356688899999998777777899999999999988765555555443332 22221 111 1222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCC-CccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCC- Q lcl|NC_021302. 342 SVQTVADEIRDVAQAHVVEDIVDVNWG-EDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLP- 418 (484) Q Consensus 342 ~~~aD~~~i~~~ln~qli~~l~~~Nf~-~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp- 418 (484) .+..-.+.+...+. ++++.++.+.-. .+..-..+.|. ..+.+..+.++.+ ++.|+. +++.+.+.++.- T Consensus 361 k~~~~~~~~~~~l~-~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~---~~~gii-----S~et~~~~lp~v~ 431 (474) T protein:vir:95 361 KANKLKNKANVALQ-ELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIG---AQSQYL-----SKETLVRHHPWVD 431 (474) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHH---HHcCCC-----ChHHHHHhCCCCC Confidence 22233345555553 466666665311 11122356664 3445565555554 446752 456677777642 Q ss_pred CCCCCcccccccCCCcCCCccccCCCCcccccccccccccccc Q lcl|NC_021302. 419 GPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQAR 461 (484) Q Consensus 419 ~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (484) .+..+-+....................+..........+.+.. T Consensus 432 D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 432 DPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred CHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 2211100000000000000000000000000000000111111 No 189 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=96.54 E-value=0.00053 Score=38.53 Aligned_cols=412 Identities=9% Similarity=0.024 Sum_probs=162.4 Q ss_pred CCCCCCCcccee---eeecccccchhhhhhhcccccccc--cc-cccchHHHHH---HHHh------------------- Q lcl|NC_021302. 1 MAPKTVAPRTER---GYVNPLAGFGTFLAQGLDQFEQVD--EL-RWPNSVYTYT---RMCR------------------- 52 (484) Q Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~--~l-r~~~~~~~y~---~m~~------------------- 52 (484) |--+-..|-++- .++.............|.....-. .+ |-.+..+.|+ +++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~k 80 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWR 80 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccc Confidence 332222222211 011111111100000010000000 00 0011111111 1100 Q ss_pred -cchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcce Q lcl|NC_021302. 53 -EEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHA 130 (484) Q Consensus 53 -~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s 130 (484) ...-..-++......+.+.+..+... ++++.+.+.+. .+ -+|.+.+..+ .++.-||.+ T Consensus 81 i~~n~~k~Iv~~~~~yl~g~p~~~~~~--~~~~~~~l~~~-----------------~~-n~~~~~~~~l~~~~~~~G~~ 140 (474) T protein:vir:96 81 ITTNFHQNLVDQKVSYVAGKPVTYAHD--DDKVLDVIHQV-----------------LD-TRWDNKLIDILTAASNKGID 140 (474) T ss_pred cccchHHHHHHhhhhhhcccCceeccC--ChHHHHHHHHH-----------------Hh-ccHHHHHHHHHHHHhhCCeE Confidence 01111222223333334444444322 22222222221 12 2466666555 478899995 Q ss_pred eeeEEEeecCCeeeeeeeeeeCcccee-eeeecCCCceeeee-cccccccc------cccceeccCCC------------ Q lcl|NC_021302. 131 VFEQTYFYEGGRFWLKRLAPRPQSSIA-YWNVDRDGGLISIQ-QWPAGTFG------GPGMVVMAPNS------------ 190 (484) Q Consensus 131 ~~Eivw~~~~g~~~~~~l~~r~~~~~~-~~~~~~dg~l~~~~-q~~~~~~~------~~~~~~~~~~~------------ 190 (484) ++++|.-.+|.+. +...+|+.+. .|.....+.++... .+...... ......+.... T Consensus 141 -~~~~~~d~~~~~~---i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~ 216 (474) T protein:vir:96 141 -WLQVYINEDGELK---LFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDE 216 (474) T ss_pred -EEEeeeCCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccc Confidence 5788876667654 3344454432 12111122322111 11100000 00000000000 Q ss_pred ----Ccccccccc--eEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHH Q lcl|NC_021302. 191 ----MGPAIPVEQ--LVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDE 264 (484) Q Consensus 191 ----~~~~lp~~k--~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~ 264 (484) ...+-+..+ ++.+ .+|+.|.|.+..+-...=--...+..++..++.|..++.++.|-.. + +... T Consensus 217 ~~~~~~~~~~~~~vPvv~~-----~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~---~--~~~~ 286 (474) T protein:vir:96 217 HIQTHFSTGSWERVPFIAF-----KNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEG---E--DLSE 286 (474) T ss_pred cccCcccccCCCccceEEe-----cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCc---c--cccc Confidence 000001111 1222 2467789999886555545567888889899988776656555221 1 1111 Q ss_pred HHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHH-HHH--HHHH Q lcl|NC_021302. 265 LLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASV-QAD--TFVQ 341 (484) Q Consensus 265 l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~ev-h~~--v~~~ 341 (484) ...++.. ...+.++.+.+++++........++.+++.+.+.|...--...++.++.+|+ ..|.. ... -... T Consensus 287 ---~~~~~~~--~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n-~Sg~Alk~~~~~l~~ 360 (474) T protein:vir:96 287 ---FMEGLKY--YKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFGSA-TSGIALKFLYTNLNL 360 (474) T ss_pred ---hhhhhhc--cceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccccc-cHHHHHHHHHHHHHH Confidence 1222322 2356688899999998777777899999999999988765555555443332 22221 111 1222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCC-CccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCC- Q lcl|NC_021302. 342 SVQTVADEIRDVAQAHVVEDIVDVNWG-EDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLP- 418 (484) Q Consensus 342 ~~~aD~~~i~~~ln~qli~~l~~~Nf~-~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp- 418 (484) .+..-.+.+...+. ++++.++.+.-. .+..-..+.|. ..+.+..+.++.+ ++.|+. +++.+.+.++.- T Consensus 361 k~~~~~~~~~~~l~-~~~~~i~~~~g~~~d~~~i~i~f~~~~p~~~~e~a~~~---~~~gii-----S~et~~~~lp~v~ 431 (474) T protein:vir:96 361 KANKLKNKANVALQ-ELMQFILDFNKIKLDAKEIEITFNFNVMVNDLEQSQIG---AQSQYL-----SKETLVRHHPWVD 431 (474) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHhCCCcccceeeEEecCCCccCHHHHHHHH---HHcCCC-----ChHHHHHhCCCCC Confidence 22233345555553 466666665311 11122356664 3445565555554 446752 456677777642 Q ss_pred CCCCCcccccccCCCcCCCccccCCCCcccccccccccccccc Q lcl|NC_021302. 419 GPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQAR 461 (484) Q Consensus 419 ~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (484) .+..+-+....................+..........+.+.. T Consensus 432 D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 432 DPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQSK 474 (474) T ss_pred CHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccccC Confidence 2211100000000000000000000000000000000111111 No 190 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=96.50 E-value=0.00056 Score=38.40 Aligned_cols=401 Identities=7% Similarity=0.018 Sum_probs=163.9 Q ss_pred Cccceeeeecccccchh--hhhhhcccccccccccccchHHHHHH---H-------------HhcchHHHHHHHHHHHHh Q lcl|NC_021302. 7 APRTERGYVNPLAGFGT--FLAQGLDQFEQVDELRWPNSVYTYTR---M-------------CREEARIASVLRAIGLPI 68 (484) Q Consensus 7 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~lr~~~~~~~y~~---m-------------~~~D~~v~s~l~~r~~~v 68 (484) -...+.++..-..+... -.+..+.........|..+..+.|+- + ....+...-++.+....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l 80 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYF 80 (452) T ss_pred CcccCceeEEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccccCccceeecchHHHHHHHHhhhh Confidence 11112222111111110 00000000000000000111111110 0 001233344444444445 Q ss_pred hCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 69 RRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 69 ~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) .+-+..+.+.+ +++.+++. +....-+|+..+..+ .++..||.+ ++++|.-.+|...+ T Consensus 81 ~g~~~~~~~~d--~~~~~~l~-----------------~~~~~n~~~~~~~~~~~~~~~~G~~-~~~v~~d~~g~~~i-- 138 (452) T protein:vir:36 81 NGIPVKKSHSD--KEILTKLQ-----------------EFDNLNDMEDEESELAKMACIYGRA-FEFLYQDEDTQTNV-- 138 (452) T ss_pred cccCceeecCC--hhHHHHHH-----------------HHHhhcChhHHHHHHHHHHHhcCeE-EEEEEecCCCeeEE-- Confidence 55555554322 22222222 222333577766665 478999975 57777655666543 Q ss_pred eeeeCccceeeeeecC--CCceeeeecccccccc--------cccceeccCCCC------cc--cccccceEEEeecCcc Q lcl|NC_021302. 148 LAPRPQSSIAYWNVDR--DGGLISIQQWPAGTFG--------GPGMVVMAPNSM------GP--AIPVEQLVVYTHDMDP 209 (484) Q Consensus 148 l~~r~~~~~~~~~~~~--dg~l~~~~q~~~~~~~--------~~~~~~~~~~~~------~~--~lp~~k~l~~~~~~~~ 209 (484) ...+|+.+. ..+++ ...++........... ....+.+..... +. .++.--++.| . T Consensus 139 -~~~~p~~~~-~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~-----~ 211 (452) T protein:vir:36 139 -VYNSPENMF-MVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEF-----Y 211 (452) T ss_pred -EEEcccceE-EEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEe-----c Confidence 333444432 11222 1222211100000000 001111110000 11 1111112332 2 Q ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCC---- Q lcl|NC_021302. 210 GVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAG---- 285 (484) Q Consensus 210 ~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~---- 285 (484) +|+.|.|.+..+....=--...+..++..++.+.+++.++.|- ..+++.... +..+ ..+.++.+ T Consensus 212 n~~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~---~~~~~~~~~-------~~~~--~~~~~~~~~~~~ 279 (452) T protein:vir:36 212 FNEERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGA---AVEEEDLKN-------IRSN--RVINYYADGEGK 279 (452) T ss_pred CCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecC---CcCchhhhh-------hhhc--ceEEecCCCCcc Confidence 3567888888766655555677788888899887765565552 223332222 2111 23344432 Q ss_pred -ceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 286 -EEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALA-SVQADTFVQSVQTVADEIRDVAQAHVVEDIV 363 (484) Q Consensus 286 -~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~-evh~~v~~~~~~aD~~~i~~~ln~qli~~l~ 363 (484) .+++++........++..++.+.+.|...--+..++.++.|.+.|.| +....-....+..-.+.+...+. ++++-++ T Consensus 280 ~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~ 358 (452) T protein:vir:36 280 NVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLN-SRYKLFC 358 (452) T ss_pred CCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH Confidence 36888876666677889999999999877555545444333221111 11122233333444455566664 4666555 Q ss_pred HhC--CCCc--cccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCC-CCCCCcccccccC-CCcC- Q lcl|NC_021302. 364 DVN--WGED--EPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLP-GPDPDADDDESTA-DTGQ- 435 (484) Q Consensus 364 ~~N--f~~~--~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp-~p~~~e~~~~~~~-~~~~- 435 (484) .+. .+.. ..-..+.|. ....+..+.++.+.+++ |+ ++.+.+.+.++.- .+. ++...-.. .... T Consensus 359 ~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~--g~-----iS~et~~~~~~~~~d~~--~E~~ri~~E~~~~~ 429 (452) T protein:vir:36 359 ELSTNVSNKDSWKDIEYTFTRNEPKDIKEQAETANILM--GI-----TSQETALSVISVIPDVQ--AEMEKIKKEEASTA 429 (452) T ss_pred HHHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-----CChHHHHHhCCCCCCHH--HHHHHHHHHHHHHH Confidence 543 1111 112356675 45567888899998874 54 2456777777642 211 11100000 0000 Q ss_pred CCccccCCCCcccccccccccccc Q lcl|NC_021302. 436 DEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .......+........... ...+ T Consensus 430 ~~~~~~~~~~~~~~~~~~~-~~~e 452 (452) T protein:vir:36 430 IFDKDKQPSEKGTDTVVSE-TNEE 452 (452) T ss_pred HHHhhccCCCCcccccCcc-ccCC Confidence 0000000000000000000 0111 No 191 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=96.46 E-value=0.0006 Score=38.23 Aligned_cols=438 Identities=13% Similarity=0.076 Sum_probs=187.4 Q ss_pred CCCCCCCccce---eeeeccc--ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEE Q lcl|NC_021302. 1 MAPKTVAPRTE---RGYVNPL--AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRI 75 (484) Q Consensus 1 ~~~~~~~~~~~---~~~~~~~--~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v 75 (484) ..++.+++..+ -|.+.-. +.+|+. .++.+ ........|+.|++|+ .++.|-++++.....+.-.+-.- T Consensus 15 ~~~~~~s~~~~~~~dg~~~~~~~~~~g~~--~~~e~----~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~ 87 (537) T protein:vir:10 15 KVPKGPSFVQKDSLDGSQPIVGGGYFGYS--VDFDG----TIRNDHELITRYREMV-LNPECDSAVDDVVNETICGNFDD 87 (537) T ss_pred ccccCCcccCCCcccccceeecccccccc--ccccc----ccchHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCC Confidence 33333333221 1111111 112211 11111 1112356799999998 59999999998887665443221 Q ss_pred ec---CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcce-----------eeeEEEeecCC Q lcl|NC_021302. 76 RP---NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHA-----------VFEQTYFYEGG 141 (484) Q Consensus 76 ~p---~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s-----------~~Eivw~~~~g 141 (484) .| +=+.-+..+.+.+-+.. -|+.++ .+|+.--+||. .+.++=...+. T Consensus 88 ~pV~i~Ld~~~~s~~iK~kI~e------------------EF~~Il-~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~p 148 (537) T protein:vir:10 88 VPISIDLHNLKQSEKIKKLIRS------------------EFDEIL-RLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKP 148 (537) T ss_pred ceEEEEecccccchHHHHHHHH------------------HHHHHH-HHhccchhhhHHHhhheeeeEEEEEEEEeCCCc Confidence 11 00111111222222211 133333 33443344443 33333332333 Q ss_pred eeeeeeeeeeCccceeeeee---cCCCceee------eeccccccc-ccccceeccCCCCcccccccceEEEeecC--cc Q lcl|NC_021302. 142 RFWLKRLAPRPQSSIAYWNV---DRDGGLIS------IQQWPAGTF-GGPGMVVMAPNSMGPAIPVEQLVVYTHDM--DP 209 (484) Q Consensus 142 ~~~~~~l~~r~~~~~~~~~~---~~dg~l~~------~~q~~~~~~-~~~~~~~~~~~~~~~~lp~~k~l~~~~~~--~~ 209 (484) .-.+.+|...+|+.+.+.+. ..+..... +.+.....+ ..+... ...+..++.||. ..|+|.|.. .. T Consensus 149 k~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~-~~~~~~~vkI~~-dAI~y~hSGl~d~ 226 (537) T protein:vir:10 149 RQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGL-KNSTNQGMKIAP-DSIAYCHSGIQDL 226 (537) T ss_pred cccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccc-cccCCCceeccH-hheeeecccceeC Confidence 33455666777777755433 11212111 111111011 111111 233566777877 678888843 45 Q ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcc-eEEecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE-- Q lcl|NC_021302. 210 GVWTGNSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVP-YLKGNEADSEDDDRMDELLEIASNYSGG----ESAGLA-- 281 (484) Q Consensus 210 ~~p~G~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P-~~~gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v-- 281 (484) ++.+..|.|.++..++==-++..-..+.| +-|- |-. +.-...+.-...+.-..|.+++..+++. .++|-+ T Consensus 227 n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRA--PeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~d 304 (537) T protein:vir:10 227 NKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRA--PERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKD 304 (537) T ss_pred CCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhcc--ccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecc Confidence 66888999999998886555555555544 2221 110 1111223334444445666776666542 122222 Q ss_pred ------------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc-c--hhhHHHHHHH-HH Q lcl|NC_021302. 282 ------------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG-S--YALASVQADT-FV 340 (484) Q Consensus 282 ------------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG-s--~A~~evh~~v-~~ 340 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.++| + ++..-+..++ |. T Consensus 305 drk~msMlEDyWLPRReGgrgTEItTLpGgqnl-gem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~ 383 (537) T protein:vir:10 305 DKKFMSMLEDFWLPRREGGRGTEISTLPGGQNL-GELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQ 383 (537) T ss_pred cchhhhhhhhhcccccCCCcccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHH Confidence 22 47889888765332 33445789999999997765533333332 1 2222233343 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHH Q lcl|NC_021302. 341 QSVQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEA 409 (484) Q Consensus 341 ~~~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~ 409 (484) ..+......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+....+.+ T Consensus 384 KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~d 463 (537) T protein:vir:10 384 KFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSAN 463 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchH Confidence 44555566666666554444444333 2111122244442211 1 22334456655554432333455778 Q ss_pred HHHHHh-CCCCCCCCcc--------cccccCCCcCCC-----ccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 410 FLRDAA-GLPGPDPDAD--------DDESTADTGQDE-----PETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 410 ~i~e~~-glp~p~~~e~--------~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) |+++.+ .+...+-.+. ..+.-.++.... .....+.++.........++.++...|-+... T Consensus 464 yi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 464 YIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGEL 537 (537) T ss_pred HHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCCC Confidence 876553 3321110000 000000000000 00001111111111111112222222222222 No 192 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=96.44 E-value=0.00062 Score=38.16 Aligned_cols=426 Identities=12% Similarity=0.050 Sum_probs=158.9 Q ss_pred CCCccceeeeecccccch-------hh----hhhhcccccccccccccchHHHHHHHHhcch-HHHH----HHHH-HHHH Q lcl|NC_021302. 5 TVAPRTERGYVNPLAGFG-------TF----LAQGLDQFEQVDELRWPNSVYTYTRMCREEA-RIAS----VLRA-IGLP 67 (484) Q Consensus 5 ~~~~~~~~~~~~~~~~~~-------~~----~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~-~v~s----~l~~-r~~~ 67 (484) --.|.-+| .+.|...+. .. ....+...-.....|..+..+.|+ - +.|. .+.- .+.. .+.+ T Consensus 1 ~~~~~~~~-~~~~~~~~~~p~~~~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~-G-~~~~~~~~~~~~~~~~~~~~~~ 77 (501) T protein:vir:25 1 MTVPVDVI-ADAPAADVEFPEDSMSREQLGALVADMWRLHISERQWLDRIYEYTK-G-LRGRPEVPEGASDEVKELAKLS 77 (501) T ss_pred Ccccchhh-hccCcccccCCcccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-c-CCCchhccccCChhhhhhHhhh Confidence 00111111 111111111 00 000000000000000001111111 0 0000 0000 0000 0000 Q ss_pred hhCCCcEEecCCCCHHHHHHHHHHHHhh---hccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCee Q lcl|NC_021302. 68 IRRTDWRIRPNGARPEVVEHVAACLGLP---VEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRF 143 (484) Q Consensus 68 v~~~~~~v~p~~~~~e~~~~~~~~l~~~---~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~ 143 (484) +. +| ...+++..++.+... ..+.+......+....-+|+....++ .+|..||.| ++++|.-+++. T Consensus 78 v~--n~-------~~~ivd~~a~~l~~~gf~~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~a-y~~v~~de~~~- 146 (501) T protein:vir:25 78 VK--NV-------LSLVRDSFAQNLSVVGYRNALAKENDPAWEMWQRNRMDARQAEVHRPALTYGAS-YVTVTPTDEGP- 146 (501) T ss_pred hc--Ch-------HHHHHHHHHhhhcccceecCCccchHHHHHHHHhcChhHHHHHHHHHHhhcCce-EEEEecCCCCC- Confidence 00 11 011111111111000 11111222233333444577777665 589999996 58888766652 Q ss_pred eeeeeeeeCccceeeeeecC--CCceee-ee----cccccccc------cccceecc----------------------- Q lcl|NC_021302. 144 WLKRLAPRPQSSIAYWNVDR--DGGLIS-IQ----QWPAGTFG------GPGMVVMA----------------------- 187 (484) Q Consensus 144 ~~~~l~~r~~~~~~~~~~~~--dg~l~~-~~----q~~~~~~~------~~~~~~~~----------------------- 187 (484) .+..++|+...-...|. +...+. ++ ....+... ........ T Consensus 147 ---~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (501) T protein:vir:25 147 ---VFRTRSPRQILAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVRE 223 (501) T ss_pred ---eEEEeccccEEEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccc Confidence 34455665542111111 111111 00 00000000 00000000 Q ss_pred ------CCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHH Q lcl|NC_021302. 188 ------PNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDR 261 (484) Q Consensus 188 ------~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~ 261 (484) ......+++..-++.|.+... .+++|.|-+..+-...=-=...+...+...|-|.+++.+++|-.. ++.+ T Consensus 224 ~~~~~~~~~~~~~~~~vPiv~f~N~~~-~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~---~~~~ 299 (501) T protein:vir:25 224 VTDVIEHGATFEGKPVCPVVRFVNGRD-ADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTG---SKAE 299 (501) T ss_pred cccccccccccCCccceeeEeccCccc-cCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCC---Cccc Confidence 000112222223444554443 367888877765433333344555667788877766666666322 1111 Q ss_pred HHHHHHHHHHHhcCCceEEEccCCceEEEeccc-CCchhHHHHHHHHHHHHHHHHhhhhhccccccc--chhhHHHHHHH Q lcl|NC_021302. 262 MDELLEIASNYSGGESAGLALTAGEEAGILSPN-GTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG--SYALASVQADT 338 (484) Q Consensus 262 ~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~-~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG--s~A~~evh~~v 338 (484) . .++..+ ...++| |.+.++.+-. .+...|...++.+-.+|++.-.-.....++.++ |-..-.....- T Consensus 300 ~-------~~~~~~--~i~~~~-~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~ 369 (501) T protein:vir:25 300 V-------LKASAL--RVWTFE-DPEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEAN 369 (501) T ss_pred h-------hhhccc--ceeccC-CCCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHH Confidence 1 112112 333444 4455555533 233467777777778887764333222222111 21111223344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc---ccceEEecCC-CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHH Q lcl|NC_021302. 339 FVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDE---PAPLLVFDEI-GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDA 414 (484) Q Consensus 339 ~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~---~~P~~~~~~~-~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~ 414 (484) .....+.-.+.+...++ ++++-++.+...... .-..+.|... .....+.++++.+|+.+|+ + .+.-+.+. T Consensus 370 l~~ka~~k~~~f~~~l~-~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi--s---~et~~~~~ 443 (501) T protein:vir:25 370 QQRKLAAKRESFGESWE-QLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI--P---IEHLLSMV 443 (501) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC--C---HHHHHHHc Confidence 44455556677777775 477766666643321 1235667543 4577899999999999885 2 12334555 Q ss_pred hCCCCCCCCc--ccc-cccCCC--cCCCccccCCCCccccccccccccccccccccccc Q lcl|NC_021302. 415 AGLPGPDPDA--DDD-ESTADT--GQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRS 468 (484) Q Consensus 415 ~glp~p~~~e--~~~-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (484) .|+..++-.+ +.. ...+.+ .+.......+.. ...+........+....+.+.+ T Consensus 444 ~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 444 PGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVP-PPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred CCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCC-CCCCCCCccccccccCCCCCCC Confidence 7886432110 000 000000 000000000000 0011111111112222222222 No 193 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=96.40 E-value=0.00066 Score=38.02 Aligned_cols=435 Identities=10% Similarity=0.019 Sum_probs=163.8 Q ss_pred CCCCCC----Cccceee--eecccccchhhhhhhcccccccccccccchHHHHHHHHhc-chHHHHHHHHHHHHhhCCCc Q lcl|NC_021302. 1 MAPKTV----APRTERG--YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCRE-EARIASVLRAIGLPIRRTDW 73 (484) Q Consensus 1 ~~~~~~----~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~-D~~v~s~l~~r~~~v~~~~~ 73 (484) |= |.+ ...-+.+ .+-+...-.. +.+ ...+. . ....++.-.++... ..+-...+++...-..+... T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~e~---~-~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~ 72 (512) T protein:vir:97 1 ML-KANEFETDTDLRENRNYLFNDEANVV-YTY--DGTES---D-LLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTK 72 (512) T ss_pred Cc-cceeccCceeeeeCceeeeccccccc-ccc--Cchhh---h-hhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCc Confidence 00 000 0000000 0000000000 000 00000 0 00000001011100 01111222333333222222 Q ss_pred EEecC---------------CCCHHHHHHHHHHH-Hhh----hccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceee Q lcl|NC_021302. 74 RIRPN---------------GARPEVVEHVAACL-GLP----VEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVF 132 (484) Q Consensus 74 ~v~p~---------------~~~~e~~~~~~~~l-~~~----~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~ 132 (484) .+... +=...+++..+..+ ..+ ...++.+....+....-+|+.....+. ++..||.+ . T Consensus 73 i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a-y 151 (512) T protein:vir:97 73 NLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKA-Y 151 (512) T ss_pred cccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeE-E Confidence 11100 00012333332222 111 122222333444444556877777664 78999974 6 Q ss_pred eEEEeecCCeeeeeeeeeeCccceeeeeecCC--Cceeeeecc-----ccc----------ccccccceeccCCC----- Q lcl|NC_021302. 133 EQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRD--GGLISIQQW-----PAG----------TFGGPGMVVMAPNS----- 190 (484) Q Consensus 133 Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~d--g~l~~~~q~-----~~~----------~~~~~~~~~~~~~~----- 190 (484) +++|.-.+|.+.+ ...+|+.+. ..+++. +.++...+. ..+ .........+.... T Consensus 152 ~~vy~ded~~~~i---~~~~p~~~~-~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~ 227 (512) T protein:vir:97 152 ELMIRNQDDETRL---YKSDAMSTF-VIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK 227 (512) T ss_pred EEEEeCCCCceEE---EEEcccceE-EEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccc Confidence 7888766676544 334444432 122221 222211100 000 00000111111000 Q ss_pred --------CcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHH Q lcl|NC_021302. 191 --------MGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRM 262 (484) Q Consensus 191 --------~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~ 262 (484) ...++..--++.|+ .|+.|.|.+..+-...=--...+..++..++.+..++.++.|.... +.++. T Consensus 228 ~~~~~~~~~~~~~g~vPvv~~~-----nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~--~~~~~ 300 (512) T protein:vir:97 228 LTPRENGFESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL--DPVEV 300 (512) T ss_pred ccccccccccccCcccceEeec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccC--Cchhh Confidence 00111111123322 3577899998876666566677888888889876666666653322 22222 Q ss_pred HHHHHH-HHHHh----cCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HH- Q lcl|NC_021302. 263 DELLEI-ASNYS----GGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQ- 335 (484) Q Consensus 263 ~~l~~~-l~~~~----~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh- 335 (484) ...... +..+. .+.........+.+++++........++.+++++.+.|.+.-....++.++.+|. ..|. .. T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn-~Sg~Al~~ 379 (512) T protein:vir:97 301 RKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGT-QSGEAMKY 379 (512) T ss_pred hhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccccc-chHHHHHH Confidence 211110 00000 0101111234567788888766667789999999999987755555555433222 1122 11 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-C-CC-----CccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcc Q lcl|NC_021302. 336 -ADTFVQSVQTVADEIRDVAQAHVVEDIVDV-N-WG-----EDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPR 406 (484) Q Consensus 336 -~~v~~~~~~aD~~~i~~~ln~qli~~l~~~-N-f~-----~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~ 406 (484) ..-.......-.+.+...|++ +++.++.+ + .+ .+-.-.++.|. ..+.+..+.++++.+|+ |+ + T Consensus 380 ~~~~l~~ka~~k~~~f~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~--gi-i---- 451 (512) T protein:vir:97 380 KLFGLEQRTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-I---- 451 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-C---- Confidence 122222333444555566643 55554443 1 11 11112366775 45677888889998885 64 2 Q ss_pred cHHHHHHHhCC-CCCCCCcccccccCCCcCCCccccCCCCcc--ccccccccccccccccccccch Q lcl|NC_021302. 407 LEAFLRDAAGL-PGPDPDADDDESTADTGQDEPETDEPALPN--TSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 407 ~~~~i~e~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 469 (484) +.+.+.+.++. +.|..+-+-...................+. ..+.+.......+.+ .. T Consensus 452 S~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 512 (512) T protein:vir:97 452 SQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQDDDTKDTVDK-----KE 512 (512) T ss_pred chHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCCCCCCCCCCCCCccccccc-----cC Confidence 45667777664 322111000000000000000000000000 000000000000000 00 No 194 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=96.35 E-value=0.00072 Score=37.82 Aligned_cols=424 Identities=9% Similarity=-0.025 Sum_probs=159.3 Q ss_pred CCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCC------------ Q lcl|NC_021302. 4 KTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRT------------ 71 (484) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~------------ 71 (484) -+...+ ..++..+..-.+- .+. .....+.--+..+ +.-+.+...+-..-+.+.+.--.+. T Consensus 1 ~~~~~~---~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~i--~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~ 72 (481) T protein:vir:10 1 MTVYTI---NNINTKFSPLAND-DFV--VSDLAELLKEENL--RNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQ 72 (481) T ss_pred CeeEee---ehhchhcccccCc-eee--eecchhhcCHHHH--HHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccc Confidence 111111 1111111110000 000 0000000000000 0001000001111111111111111 Q ss_pred ------CcEEecCCCCHHHHHHHHHHHH-hh----hccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeec Q lcl|NC_021302. 72 ------DWRIRPNGARPEVVEHVAACLG-LP----VEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYE 139 (484) Q Consensus 72 ------~~~v~p~~~~~e~~~~~~~~l~-~~----~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~ 139 (484) +.++. .+=...+++..+..+. .. ...+..+....++...-+|+..+..+. ++..+|.+ ++++|... T Consensus 73 ~~~~~~~~ki~-~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~-~~~~~~d~ 150 (481) T protein:vir:10 73 KYGDKADHRAV-HNYAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRA-YEIVYRDF 150 (481) T ss_pred cccccccceee-cchHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeE-EEEEEeCC Confidence 11110 0001223333333221 00 122333444455555556877777664 79999965 45777666 Q ss_pred CCeeeeeeeeeeCccceeeeeecCC--Cceeee-ecc---ccc--------ccccccceeccCCCCc------ccccccc Q lcl|NC_021302. 140 GGRFWLKRLAPRPQSSIAYWNVDRD--GGLISI-QQW---PAG--------TFGGPGMVVMAPNSMG------PAIPVEQ 199 (484) Q Consensus 140 ~g~~~~~~l~~r~~~~~~~~~~~~d--g~l~~~-~q~---~~~--------~~~~~~~~~~~~~~~~------~~lp~~k 199 (484) +|... +...+|+.+. ..+++. ++++.. +.+ ... .........+...+.+ .+-+..+ T Consensus 151 dg~~~---i~~~~p~~~~-~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~ 226 (481) T protein:vir:10 151 EDRDT---FKVLDPKSTF-VVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYND 226 (481) T ss_pred CCeEE---EEEEcccceE-EEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCc Confidence 67654 4444555442 122221 222211 000 000 0001111111111111 0111111 Q ss_pred --eEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCc Q lcl|NC_021302. 200 --LVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGES 277 (484) Q Consensus 200 --~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~ 277 (484) ++.+. +|+.|.|.+..+....---...+..++..++.| ..|+++.+-....++++...+.....-...... T Consensus 227 vPvv~~~-----n~~~g~~~~~~v~~lida~~~~~s~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 299 (481) T protein:vir:10 227 VPIIEYL-----NDQFKQGDFENVIALIDLYDSAQSDTANYMTDL--NDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGT 299 (481) T ss_pred eeEEEee-----cCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHh--cCceeEeecCcCCCccchhhhhhccceeccccc Confidence 22222 357788888765433333344566777777876 456555442223333332222211000000000 Q ss_pred eEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 278 AGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS---VQADTFVQSVQTVADEIRDVA 354 (484) Q Consensus 278 a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e---vh~~v~~~~~~aD~~~i~~~l 354 (484) .......+.+++++........++..++.+.+.|...--...++.++.+|. ..|. ....-....+..-.+.+...+ T Consensus 300 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l 378 (481) T protein:vir:10 300 NANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGV-QSGESMKYKLFGLEQVRAIKERLFKKGL 378 (481) T ss_pred cccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111122356788887766667788899999888887754444444432221 1222 222233333444456666666 Q ss_pred HHHHHHHHHHh-CCCCcc----ccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCCcccc Q lcl|NC_021302. 355 QAHVVEDIVDV-NWGEDE----PAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPDADDD 427 (484) Q Consensus 355 n~qli~~l~~~-Nf~~~~----~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~e~~~ 427 (484) . ++++.++.+ |..... .-..+.|. ....+..+.++++.+|+ |+ + +.+.+.+.++. ..+..+-+.. T Consensus 379 ~-~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~-i----s~et~~~~l~~i~d~~~E~~ri 450 (481) T protein:vir:10 379 M-KRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMESINAFNALS--GG-V----SESTRLSLLDFIDNPKEELEKM 450 (481) T ss_pred H-HHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHHHHHHHHHh--cc-C----ChHHHHHhCCCCCCHHHHHHHH Confidence 4 466655554 322211 12366675 45667888899998885 54 2 45566677664 2211110000 Q ss_pred cc-cCCCcCCCccccCCCCcccccccccccc Q lcl|NC_021302. 428 ES-TADTGQDEPETDEPALPNTSGTTSTTNA 457 (484) Q Consensus 428 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (484) .. .....+.......+......+....+.+ T Consensus 451 ~~E~~~~~~~~~~~~~~~~~~~~~~~dd~~g 481 (481) T protein:vir:10 451 QEEEAQREKQADKRGYGEAFENHLNVDDSNG 481 (481) T ss_pred HHHHHHHHhhhhhccCCccCCCCCCCCCCCC Confidence 00 0000000000001111000111111111 No 195 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=96.15 E-value=0.00094 Score=37.18 Aligned_cols=408 Identities=10% Similarity=0.044 Sum_probs=162.5 Q ss_pred CCC----CCCCccceeeeecccccchhhhhhhccccccccccc---ccchHHHHH---HHHh------------------ Q lcl|NC_021302. 1 MAP----KTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELR---WPNSVYTYT---RMCR------------------ 52 (484) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr---~~~~~~~y~---~m~~------------------ 52 (484) |+- -+.....+ ++......+......|.........+ ..+..+.|+ +++. T Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 78 (478) T protein:vir:10 1 MISINWPWDKPYHEQ--VVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDW 78 (478) T ss_pred CccccccCCchhhhH--HHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccc Confidence 211 11100000 01110000000000000000000000 000111111 0000 Q ss_pred --cchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcc Q lcl|NC_021302. 53 --EEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGH 129 (484) Q Consensus 53 --~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~ 129 (484) ..+...-++.+...-+.+-+..+.. ++++..+.+.+ .++ -+|++.+..+ .++..||. T Consensus 79 ki~~n~~k~ivd~~~~yl~g~p~~~~~--~~~~~~~~l~~-----------------~~~-n~~~~~~~~~~~~~~~~G~ 138 (478) T protein:vir:10 79 RMYTNYHQNLVDQKVAYAVANPVTFGV--DNDKALKQIQH-----------------TLN-HKWDDKLVDILTAASNKGI 138 (478) T ss_pred eeccchHHHHHHHHhhhhcccCceeec--CChHHHHHHHH-----------------HHh-ccHHHHHHHHHHHHhhCCe Confidence 0122233333333344444444332 12222222221 222 2566666655 58999998 Q ss_pred eeeeEEEeecCCeeeeeeeeeeCcccee-eeeecCCCceeeee-cccccccc------cccceeccCC------------ Q lcl|NC_021302. 130 AVFEQTYFYEGGRFWLKRLAPRPQSSIA-YWNVDRDGGLISIQ-QWPAGTFG------GPGMVVMAPN------------ 189 (484) Q Consensus 130 s~~Eivw~~~~g~~~~~~l~~r~~~~~~-~~~~~~dg~l~~~~-q~~~~~~~------~~~~~~~~~~------------ 189 (484) + ++++|.-.+|.+. +...+|+.+. .|.....+.++... .+...... ......+... T Consensus 139 ~-~~~v~~d~~~~~~---~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~ 214 (478) T protein:vir:10 139 E-WVQPYVDEEGEFK---TFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSE 214 (478) T ss_pred E-EEEEEecCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccc Confidence 6 5777766666654 3344444432 22222233333111 11100000 0000000000 Q ss_pred --------CCccccccc--ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCH Q lcl|NC_021302. 190 --------SMGPAIPVE--QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDD 259 (484) Q Consensus 190 --------~~~~~lp~~--k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~ 259 (484) ....+-+.. -++.|+ +|+.|.|.+..+....---...+..++..++.|.+++.++.|-......+ T Consensus 215 ~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~ 289 (478) T protein:vir:10 215 DHIQPHYYQGNKLMSWGRVPFIPFK-----NNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKD 289 (478) T ss_pred cccccceecccccccCCcceEEEec-----cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccc Confidence 000111111 123332 36778999888655555556677888888888877776766632211111 Q ss_pred HHHHHHHHHHHHHhcCCceEEEc--cCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-H-- Q lcl|NC_021302. 260 DRMDELLEIASNYSGGESAGLAL--TAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-V-- 334 (484) Q Consensus 260 ~~~~~l~~~l~~~~~g~~a~~vi--p~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-v-- 334 (484) ...++... .++.+ ..+.+++++........++..++.+.+.|...--+..++.++.+|+- .|. . T Consensus 290 --------~~~~~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~-Sg~Ai~~ 358 (478) T protein:vir:10 290 --------FMHNLKYY--KAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSP-SGIALKF 358 (478) T ss_pred --------hhhhhhhC--ceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccccch-HHHHHHH Confidence 12223222 23333 35678999887777778999999999998888555555554433321 121 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHH Q lcl|NC_021302. 335 QADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGE-DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLR 412 (484) Q Consensus 335 h~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~-~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~ 412 (484) ...-....+..-.+.+...+. ++++-++.+.-.. +..-..++|. ....++.+.++.+.++ .|+ ++.+.+. T Consensus 359 ~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~--~g~-----iS~et~i 430 (478) T protein:vir:10 359 MYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYRLDVRVQDIEITFNFNVMVNELENSQIAMNS--TGL-----LSKETIL 430 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCcccccceEEeCCCCCCCHHHHHHHHHHH--hCC-----CChHHHH Confidence 112223333444555666663 4666555544221 1122466775 3456778888888877 464 2456666 Q ss_pred HHhCC-CCCCCCcccccc-cCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 413 DAAGL-PGPDPDADDDES-TADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 413 e~~gl-p~p~~~e~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) +.++. ..+..+-+.... ........+....+.... ..........+ T Consensus 431 ~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~-~~~~~~d~~~e 478 (478) T protein:vir:10 431 GNHSWVQDPVAEMERIEQENIELNQQLPDIEEGLNDE-QQRQSEDNQSE 478 (478) T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHhccccCCCCccc-ccccCcCCCCC Confidence 76653 322111000000 000000000000000000 00000000001 No 196 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=96.02 E-value=0.0011 Score=36.79 Aligned_cols=415 Identities=10% Similarity=0.024 Sum_probs=166.8 Q ss_pred CCCCC-----CCcccee-eeecccccchhhhhhhccc----c-cccccccccchHHHHHHHHhcchHHHHHHHHHHHHhh Q lcl|NC_021302. 1 MAPKT-----VAPRTER-GYVNPLAGFGTFLAQGLDQ----F-EQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIR 69 (484) Q Consensus 1 ~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~~~~~----~-~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~ 69 (484) +-++- ..+.+.+ -++..+ ..+..|-.. . ......+ +. ..+ ......-++......+. T Consensus 40 ~~~~~i~~~i~~~~~~~~~r~~~l----~~Yy~g~~~il~~~~~~~~~~~-~~-----~ki--~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:78 40 QNVNEVSKYIEHHMDYQRPRLKVL----SDYYEGKTKNLVELTRRKEEYM-AD-----NRV--AHDYASYISDFINGYFL 107 (511) T ss_pred cCHHHHHHHHHHHHHhhhHHHHHH----HHHhhccCccccccCccccccc-Cc-----cee--ecchHHHHHHHHhhhhc Confidence 00000 0000000 000000 001111000 0 0000000 00 011 12344444555556667 Q ss_pred CCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeee Q lcl|NC_021302. 70 RTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRL 148 (484) Q Consensus 70 ~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l 148 (484) +-+..+... +++..+.+.+.+. ..+|+.....+. ++..||. +++++|.-.+|.+.+. T Consensus 108 g~p~~~~~~--d~~~~~~l~~~~~-----------------~n~~~~~~~~~~~~~~~~G~-a~~~vy~d~dg~~~i~-- 165 (511) T protein:vir:78 108 GNPIQYQDD--DKDVLEAIEAFND-----------------LNDVESHNRSLGLDLSIYGK-AYELMIRNQDDETRLY-- 165 (511) T ss_pred ccCceeecC--chHHHHHHHHHHh-----------------hcChhHHHHHHHHHHHhcCe-eEEEEEeCCCCceEEE-- Confidence 777777643 3334444443332 224666666554 7888996 5678887667765443 Q ss_pred eeeCccceeeeeecC--CCceeee-eccc----cc----------ccccccceeccCC-C------------Cccccccc Q lcl|NC_021302. 149 APRPQSSIAYWNVDR--DGGLISI-QQWP----AG----------TFGGPGMVVMAPN-S------------MGPAIPVE 198 (484) Q Consensus 149 ~~r~~~~~~~~~~~~--dg~l~~~-~q~~----~~----------~~~~~~~~~~~~~-~------------~~~~lp~~ 198 (484) ..+|+.+. ..+++ .++++.. +-+. .+ .........+... + ...++..- T Consensus 166 -~~~p~~~~-~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~v 243 (511) T protein:vir:78 166 -KSDAMSTF-IIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERM 243 (511) T ss_pred -EEcccceE-EEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCccc Confidence 34444432 22222 1222211 1000 00 0000001111000 0 01111111 Q ss_pred ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHH-HHHHhcC-- Q lcl|NC_021302. 199 QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEI-ASNYSGG-- 275 (484) Q Consensus 199 k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~-l~~~~~g-- 275 (484) -++.|+ +++.|.|.+..+-...=.-...+..++..++.|..++.++.|... .+.++....... +..+... T Consensus 244 Pvv~~~-----n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~--~~~~~~~~~~~~~~~~~~~~~~ 316 (511) T protein:vir:78 244 PITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN--LDPVEVRKQKEANVLFLEPTVY 316 (511) T ss_pred ceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCcc--CCchhhcccccccceeccccce Confidence 123322 356788998887655555566788888888888777777766322 222222111110 0000000 Q ss_pred -CceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_021302. 276 -ESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQ--ADTFVQSVQTVADEIRD 352 (484) Q Consensus 276 -~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh--~~v~~~~~~aD~~~i~~ 352 (484) ...+.-...+.++++++.......++.+++++.+.|...--...++.++-+|.-+.-... ..........-.+.+.. T Consensus 317 ~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~ 396 (511) T protein:vir:78 317 VDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTK 396 (511) T ss_pred eccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000111234677888887666677899999999999887655555554332221111111 12222223333455566 Q ss_pred HHHHHHHHHHHHh---CCCC----ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCC Q lcl|NC_021302. 353 VAQAHVVEDIVDV---NWGE----DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPD 423 (484) Q Consensus 353 ~ln~qli~~l~~~---Nf~~----~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~ 423 (484) .|++ +++.++.+ ..+. +-.-.++.|. ..+.+..+.++.+.+|+ |+ ++.+.+.+.++. +.+..+ T Consensus 397 ~l~~-~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~-----iS~et~l~~l~~v~d~~~E 468 (511) T protein:vir:78 397 GLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-----ISQTTLMSLFSFFQDPELE 468 (511) T ss_pred HHHH-HHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-----CChHHHHHhCCCCCCHHHH Confidence 6643 55544443 1111 1112467775 45677888889999885 64 245666677654 222111 Q ss_pred cccccccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 424 ADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 424 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) -+....................+..... ....++. ........ T Consensus 469 l~ri~~E~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~e~~ 511 (511) T protein:vir:78 469 VKKIEEDEKESIKKAQKGIYKDPRDIND--DEQDDDT-KDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHHHHhhccccCCCCCCC--CCCCCCc-cCcccccC Confidence 0000000000000000000000000000 0000000 00000000 No 197 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=96.02 E-value=0.0011 Score=36.79 Aligned_cols=415 Identities=10% Similarity=0.024 Sum_probs=166.8 Q ss_pred CCCCC-----CCcccee-eeecccccchhhhhhhccc----c-cccccccccchHHHHHHHHhcchHHHHHHHHHHHHhh Q lcl|NC_021302. 1 MAPKT-----VAPRTER-GYVNPLAGFGTFLAQGLDQ----F-EQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIR 69 (484) Q Consensus 1 ~~~~~-----~~~~~~~-~~~~~~~~~~~~~~~~~~~----~-~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~ 69 (484) +-++- ..+.+.+ -++..+ ..+..|-.. . ......+ +. ..+ ......-++......+. T Consensus 40 ~~~~~i~~~i~~~~~~~~~r~~~l----~~Yy~g~~~il~~~~~~~~~~~-~~-----~ki--~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 40 QNVNEVSKYIEHHMDYQRPRLKVL----SDYYEGKTKNLVELTRRKEEYM-AD-----NRV--AHDYASYISDFINGYFL 107 (511) T ss_pred cCHHHHHHHHHHHHHhhhHHHHHH----HHHhhccCccccccCccccccc-Cc-----cee--ecchHHHHHHHHhhhhc Confidence 00000 0000000 000000 001111000 0 0000000 00 011 12344444555556667 Q ss_pred CCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeee Q lcl|NC_021302. 70 RTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRL 148 (484) Q Consensus 70 ~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l 148 (484) +-+..+... +++..+.+.+.+. ..+|+.....+. ++..||. +++++|.-.+|.+.+. T Consensus 108 g~p~~~~~~--d~~~~~~l~~~~~-----------------~n~~~~~~~~~~~~~~~~G~-a~~~vy~d~dg~~~i~-- 165 (511) T protein:vir:96 108 GNPIQYQDD--DKDVLEAIEAFND-----------------LNDVESHNRSLGLDLSIYGK-AYELMIRNQDDETRLY-- 165 (511) T ss_pred ccCceeecC--chHHHHHHHHHHh-----------------hcChhHHHHHHHHHHHhcCe-eEEEEEeCCCCceEEE-- Confidence 777777643 3334444443332 224666666554 7888996 5678887667765443 Q ss_pred eeeCccceeeeeecC--CCceeee-eccc----cc----------ccccccceeccCC-C------------Cccccccc Q lcl|NC_021302. 149 APRPQSSIAYWNVDR--DGGLISI-QQWP----AG----------TFGGPGMVVMAPN-S------------MGPAIPVE 198 (484) Q Consensus 149 ~~r~~~~~~~~~~~~--dg~l~~~-~q~~----~~----------~~~~~~~~~~~~~-~------------~~~~lp~~ 198 (484) ..+|+.+. ..+++ .++++.. +-+. .+ .........+... + ...++..- T Consensus 166 -~~~p~~~~-~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~v 243 (511) T protein:vir:96 166 -KSDAMSTF-IIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERM 243 (511) T ss_pred -EEcccceE-EEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCccc Confidence 34444432 22222 1222211 1000 00 0000001111000 0 01111111 Q ss_pred ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHH-HHHHhcC-- Q lcl|NC_021302. 199 QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEI-ASNYSGG-- 275 (484) Q Consensus 199 k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~-l~~~~~g-- 275 (484) -++.|+ +++.|.|.+..+-...=.-...+..++..++.|..++.++.|... .+.++....... +..+... T Consensus 244 Pvv~~~-----n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~--~~~~~~~~~~~~~~~~~~~~~~ 316 (511) T protein:vir:96 244 PITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN--LDPVEVRKQKEANVLFLEPTVY 316 (511) T ss_pred ceEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCcc--CCchhhcccccccceeccccce Confidence 123322 356788998887655555566788888888888777777766322 222222111110 0000000 Q ss_pred -CceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_021302. 276 -ESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQ--ADTFVQSVQTVADEIRD 352 (484) Q Consensus 276 -~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh--~~v~~~~~~aD~~~i~~ 352 (484) ...+.-...+.++++++.......++.+++++.+.|...--...++.++-+|.-+.-... ..........-.+.+.. T Consensus 317 ~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~ 396 (511) T protein:vir:96 317 VDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTK 396 (511) T ss_pred eccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000111234677888887666677899999999999887655555554332221111111 12222223333455566 Q ss_pred HHHHHHHHHHHHh---CCCC----ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCC Q lcl|NC_021302. 353 VAQAHVVEDIVDV---NWGE----DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPD 423 (484) Q Consensus 353 ~ln~qli~~l~~~---Nf~~----~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~ 423 (484) .|++ +++.++.+ ..+. +-.-.++.|. ..+.+..+.++.+.+|+ |+ ++.+.+.+.++. +.+..+ T Consensus 397 ~l~~-~~~li~~~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~-----iS~et~l~~l~~v~d~~~E 468 (511) T protein:vir:96 397 GLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-----ISQTTLMSLFSFFQDPELE 468 (511) T ss_pred HHHH-HHHHHHHHHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-----CChHHHHHhCCCCCCHHHH Confidence 6643 55544443 1111 1112467775 45677888889999885 64 245666677654 222111 Q ss_pred cccccccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 424 ADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 424 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) -+....................+..... ....++. ........ T Consensus 469 l~ri~~E~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~e~~ 511 (511) T protein:vir:96 469 VKKIEEDEKESIKKAQKGIYKDPRDIND--DEQDDDT-KDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHHHHhhccccCCCCCCC--CCCCCCc-cCcccccC Confidence 0000000000000000000000000000 0000000 00000000 No 198 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=95.77 E-value=0.0015 Score=36.09 Aligned_cols=383 Identities=8% Similarity=-0.014 Sum_probs=158.0 Q ss_pred chhhhhh-hccccccc--ccc-cccchHHHHH---HHHh----------------------------cchHHHHHHHHHH Q lcl|NC_021302. 21 FGTFLAQ-GLDQFEQV--DEL-RWPNSVYTYT---RMCR----------------------------EEARIASVLRAIG 65 (484) Q Consensus 21 ~~~~~~~-~~~~~~~~--~~l-r~~~~~~~y~---~m~~----------------------------~D~~v~s~l~~r~ 65 (484) +..-.+. .+...... ... +..+..+.|+ +++. ..+-..-.+.+.. T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 2211100 00000000 000 0011111111 0000 0111222223333 Q ss_pred HHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEee-cCCee Q lcl|NC_021302. 66 LPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFY-EGGRF 143 (484) Q Consensus 66 ~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~-~~g~~ 143 (484) .-+.+-+-.+.+. +++..+.+.+ ... -+|+.....+. ++..||.+. +++|.. .+|.+ T Consensus 81 ~yl~G~p~~~~~~--~~~~~~~l~~-----------------~~~-n~~~~~~~~~~~~~~~~G~~~-~~v~~d~~~g~~ 139 (471) T protein:vir:10 81 AYALTYPPTFDVD--DKKVNDMIVD-----------------VLG-DDYERISKQLCVNAGNAGIAW-LHVWKDASDNSF 139 (471) T ss_pred hhhcccCceeccC--ChHHHHHHHH-----------------HHh-cCHHHHHHHHHHHHhhCCeEE-EEEEeeCCCCee Confidence 3333434333321 2222222211 112 25777777654 788999665 566543 46765 Q ss_pred eeeeeeeeCccceeeeeecC--CCceeee-ecccc----ccccccc--------ceeccCC------------------- Q lcl|NC_021302. 144 WLKRLAPRPQSSIAYWNVDR--DGGLISI-QQWPA----GTFGGPG--------MVVMAPN------------------- 189 (484) Q Consensus 144 ~~~~l~~r~~~~~~~~~~~~--dg~l~~~-~q~~~----~~~~~~~--------~~~~~~~------------------- 189 (484) .+. ..+|+.+. ..+++ ++.++.. +.+.. +...... ...+... T Consensus 140 ~~~---~~~p~~~~-~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 215 (471) T protein:vir:10 140 RYA---CVDSKEVI-PIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDT 215 (471) T ss_pred EEE---EEcccceE-EEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccc Confidence 444 34455432 12222 2223211 11100 0000000 0000000 Q ss_pred ---------CCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHH Q lcl|NC_021302. 190 ---------SMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDD 260 (484) Q Consensus 190 ---------~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~ 260 (484) .....+..--++.|+ +|..|.|.+..+-...-.-...+..++..++.|..++.++.|-. +...+ T Consensus 216 ~~~~~~~~~~~~~~~g~iPvv~~~-----n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~--~~~~~ 288 (471) T protein:vir:10 216 MNGDRSSDNSFKHDFGLVPFIPFK-----NNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYG--GQDKQ 288 (471) T ss_pred ccccccccccccCCCCceeEEEec-----cCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC--ccccc Confidence 000011111123332 24567888877665555556678888988998877666665532 22211 Q ss_pred HHHHHHHHHHHHhcCCceEEEcc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhH-HH Q lcl|NC_021302. 261 RMDELLEIASNYSGGESAGLALT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALA-SV 334 (484) Q Consensus 261 ~~~~l~~~l~~~~~g~~a~~vip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~-ev 334 (484) +.+.++..+ ..+.++ .+.+++++........++..++.+.+.|...--+..++.++.|.+.+.| +. T Consensus 289 ------~~~~~~~~~--~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~ 360 (471) T protein:vir:10 289 ------EFLEDLKRY--KMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKF 360 (471) T ss_pred ------hhHHHhhcC--CeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHH Confidence 122333222 223333 3457888887777778999999999999887555545444322221111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHH Q lcl|NC_021302. 335 QADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRD 413 (484) Q Consensus 335 h~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e 413 (484) ...-....+..-.+.+...+ +++++.++.+.-..+..-..+.|. ..+.+..+.++.+++|. |+ +|.+.+.+ T Consensus 361 ~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~-----iS~et~~~ 432 (471) T protein:vir:10 361 LYSLLELKAGNMETQFRSGY-ATLVKMILKHLGLSDKLKIKQTWTRNSINNDTEMAQVVSTLA--TI-----TSRENVAK 432 (471) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhccCCCceeEEEeCCCCCCCHHHHHHHHHHHh--cc-----CchHHHHH Confidence 12222223444455566666 346666665432222222366675 45677888889998874 54 25667777 Q ss_pred HhCC-CCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 414 AAGL-PGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 414 ~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .++. ..+. ++ ...... .+.......+...+...+. -.+ T Consensus 433 ~~p~v~D~~--~E-~eri~~-E~~~~~~~~~~~~~~~~~~----e~~ 471 (471) T protein:vir:10 433 SNPIVEDWQ--DE-LRLQKA-EQEGRSEKLYDMEEVEHES----EVE 471 (471) T ss_pred hCCCCCCHH--HH-HHHHHH-HHHHHHhcccccCCCCCcc----ccC Confidence 7644 2111 11 000000 0000000000000101000 000 No 199 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=95.73 E-value=0.0015 Score=35.99 Aligned_cols=415 Identities=11% Similarity=0.036 Sum_probs=169.6 Q ss_pred CCCCCC-----Ccccee-eeecccccchhhhhhhcccc-----cccccccccchHHHHHHHHhcchHHHHHHHHHHHHhh Q lcl|NC_021302. 1 MAPKTV-----APRTER-GYVNPLAGFGTFLAQGLDQF-----EQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIR 69 (484) Q Consensus 1 ~~~~~~-----~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~ 69 (484) +-++-. .+.+.+ -++..+ ..+..|-... ......+ +. ..+ ......-++.+...-+. T Consensus 40 ~~~~~i~~~i~~~~~~~~~r~~~l----~~Yy~g~~~i~~~~~~~~~~~~-~~-----~ki--~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 40 QNVNEVSKYIEHHMDYQRPRLKVL----SDYYEGKTKNLVELTRRKEEYM-AD-----NRV--AHDYASYISDFINGYFL 107 (511) T ss_pred ccHHHHHHHHHHHHHhhHHHHHHH----HHHhcccCccccccCcCccccc-Cc-----cee--ecchHHHHHHHHHhhhc Confidence 000000 000000 000000 0011110000 0000000 00 011 12344445555566677 Q ss_pred CCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeee Q lcl|NC_021302. 70 RTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRL 148 (484) Q Consensus 70 ~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l 148 (484) +-+..+.+.+ ++..+++.+.+ ..-.|+.....+. ++..||. +.+++|.-.+|.+. + T Consensus 108 g~p~~~~~~~--~~~~~~l~~~~-----------------~~n~~~~~~~~~~~~~~i~G~-a~~~vy~ded~~~~---i 164 (511) T protein:vir:96 108 GNPIQYQDDD--KDVLEAIEAFN-----------------DLNDVESHNRSLGLDLSIYGK-AYELMIRNQDDETR---L 164 (511) T ss_pred cCCceeecCc--hHHHHHHHHHH-----------------hhcCHHHHHHHHHHHHHhcCe-eEEEEEeCCCCceE---E Confidence 7777776433 33334443333 2225777666664 7889997 57788876677654 3 Q ss_pred eeeCccceeeeeecC--CCceeee-ecccc----c----------ccccccceeccC-CCC----------ccccccc-- Q lcl|NC_021302. 149 APRPQSSIAYWNVDR--DGGLISI-QQWPA----G----------TFGGPGMVVMAP-NSM----------GPAIPVE-- 198 (484) Q Consensus 149 ~~r~~~~~~~~~~~~--dg~l~~~-~q~~~----~----------~~~~~~~~~~~~-~~~----------~~~lp~~-- 198 (484) ...+|+.+. ..+++ .++++.. +.+.. + .........+.. .+. ..+-|.. T Consensus 165 ~~~~p~~~~-~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v 243 (511) T protein:vir:96 165 YKSDAMSTF-VIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERM 243 (511) T ss_pred EEEccceeE-EEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCce Confidence 444455432 22222 1222211 11000 0 000000011000 000 0111111 Q ss_pred ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHH-H---HHHHhc Q lcl|NC_021302. 199 QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLE-I---ASNYSG 274 (484) Q Consensus 199 k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~-~---l~~~~~ 274 (484) -++.|+ .|..|.|.+..+-...---...+..++..++.+..++.++.|... .+.++.....+ . +..... T Consensus 244 Pvv~~~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~--~~~~~~~~~~~~~~~~~~~~~~ 316 (511) T protein:vir:96 244 PITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN--LDPVEVRKQKEANVLFLEPTVY 316 (511) T ss_pred eeEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCcc--CCchhhcccccccceecccccc Confidence 123332 356789999888766666677888888888887666656655322 22222111100 0 000000 Q ss_pred CCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccch-hhH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 275 GESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSY-ALA-SVQADTFVQSVQTVADEIRD 352 (484) Q Consensus 275 g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~-A~~-evh~~v~~~~~~aD~~~i~~ 352 (484) ....+.-...+.++++++.......++..++++.+.|...--...++.++-+|.- |.| .....-....+..-.+.+.. T Consensus 317 ~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~ 396 (511) T protein:vir:96 317 ADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTK 396 (511) T ss_pred cccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0001112345678888887666678899999999999887666666654332221 111 11112222333334455566 Q ss_pred HHHHHHHHHHHHh---CCCC----ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCC Q lcl|NC_021302. 353 VAQAHVVEDIVDV---NWGE----DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPD 423 (484) Q Consensus 353 ~ln~qli~~l~~~---Nf~~----~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~ 423 (484) .|++ +++.++.+ +... +-.-.++.|. ..+.+..+.++++.+| .|+ ++.+.+.+.++. +.|..+ T Consensus 397 ~l~~-~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~-----iS~et~l~~l~~v~D~~~E 468 (511) T protein:vir:96 397 GLRR-RAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGK-----ISQTTLMSLFSFFQDPELE 468 (511) T ss_pred HHHH-HHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hcc-----CChHHHHHhCCCCCCHHHH Confidence 6643 45544443 2111 1112467775 4567788888888887 464 245667777764 222111 Q ss_pred cccccccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 424 ADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 424 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) -+....................+........ ..+. ........ T Consensus 469 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~ 511 (511) T protein:vir:96 469 VKKIEEDEKESIKKAQKGIYKDPRDINDDEQ--DDDT-KDTVDKKE 511 (511) T ss_pred HHHHHHHHHHHHHHHhhccccCCCCCCCCCC--CCcc-cccccccC Confidence 0000000000000000000000000000000 0000 00000000 No 200 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=95.71 E-value=0.0016 Score=35.93 Aligned_cols=415 Identities=13% Similarity=0.056 Sum_probs=161.9 Q ss_pred CCCCCCccceeeeeccccc-c-------------hhhhhhhcccccccccccccchHHHHHHHHh---cchHHHHHHHHH Q lcl|NC_021302. 2 APKTVAPRTERGYVNPLAG-F-------------GTFLAQGLDQFEQVDELRWPNSVYTYTRMCR---EEARIASVLRAI 64 (484) Q Consensus 2 ~~~~~~~~~~~~~~~~~~~-~-------------~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~---~D~~v~s~l~~r 64 (484) --+++...+.+.+.++.-. + +....+.-.....++ +.+. -.-|+.=+. .=+++...++.. T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~--~~~~-e~~Y~~rl~rA~~~n~~~~tl~~l 77 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPD--KAYG-EARQAEYEAGGIVYNFTRRTLSGM 77 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCC--CCCC-hHHHHHHHhccccCChHHHHHHHH Confidence 1112222222222222100 0 000011000000100 1111 112443332 234444444444 Q ss_pred HHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCC-- Q lcl|NC_021302. 65 GLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGG-- 141 (484) Q Consensus 65 ~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g-- 141 (484) ...|.+.+-.++ .++..+.+.+++. ..+.+++.+++.++ .++.||.+.+=+.+-..++ T Consensus 78 ~G~vfrk~p~~~----~p~~l~~l~~d~D---------------~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T 138 (489) T protein:vir:78 78 VGSVMRKEPEIN----IPKELEYLLKNAD---------------GSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAAT 138 (489) T ss_pred hchhhcCCccee----ccHHHHHHHhccC---------------CCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcC Confidence 444444444342 2222222222221 13557888888886 6888998876655532221 Q ss_pred -------eeeeeeeeeeCccceeeeeecCCCc---e--eeeecc-----ccccccc----------------ccceec-- Q lcl|NC_021302. 142 -------RFWLKRLAPRPQSSIAYWNVDRDGG---L--ISIQQW-----PAGTFGG----------------PGMVVM-- 186 (484) Q Consensus 142 -------~~~~~~l~~r~~~~~~~~~~~~dg~---l--~~~~q~-----~~~~~~~----------------~~~~~~-- 186 (484) ... -.+..+.+..|-=|+++..++ | +.++.. +.+.++. .+...+ T Consensus 139 ~ade~~~~~r-Py~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~ 217 (489) T protein:vir:78 139 AAEQNAGLLN-PTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRF 217 (489) T ss_pred HHHHHHhcCC-cEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEe Confidence 001 123333333333344433331 1 112211 0011110 000000 Q ss_pred -------------cCCCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHH-HHHHHHHHHhcCCcceEEec Q lcl|NC_021302. 187 -------------APNSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELI-RIEAAAIRRHGIGVPYLKGN 252 (484) Q Consensus 187 -------------~~~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~-~~w~~f~Er~~~G~P~~~gk 252 (484) .....+..++.=-|+++... ..+--.+...|..++..-+ +++-. .+.-..+-.-++|+|++.|- T Consensus 218 ~~~g~~~~~~~~~~~~~g~~~l~~IPfv~~~~~-~~~~~~~~pPLl~LA~lni-~Hy~~ssd~~~~l~~~~~P~l~i~G~ 295 (489) T protein:vir:78 218 DAEGGAQEDVVEIYPDLGESLRGVIPFTFIGAT-NNDATIDDAPLLPLAELNI-GHYRNSADNEESSFVVGQPTLFIYPG 295 (489) T ss_pred ecCCcccceeeEEeccCCCCccCeeeEEEEecC-CCCCCCCcCchHHHHHHHH-HHhhhhhHHHHHHHHcccceeeeecC Confidence 00111222222223333322 2222334555555554422 22222 22222222224566666552 Q ss_pred CCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhh-hhhccc-ccccchh Q lcl|NC_021302. 253 EADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALA-HFLNLD-GKGGSYA 330 (484) Q Consensus 253 ~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilG-qtlt~~-~~gGs~A 330 (484) ...+++....... ..++-|.+++..+|.+.+..+++.++++... +.++-...+|. .+| ..++.. +.+++. T Consensus 296 --d~~~~~~~~~~~~--~~i~~g~~~~~~lp~~~~~~~ie~~~~~~~r-~~l~~le~qm~--~lGa~l~~~~~~~Ta~~- 367 (489) T protein:vir:78 296 --ENLTPQAFKEANP--NGIKFGSRRGHNLGYGGSAQLIQAGENNLAR-QNMLDKEQQAI--QIGAQLITPTQQITAQS- 367 (489) T ss_pred --ccCCcccccccCc--cceeeCCcccccCCCCCCcceeccCcchHHH-HHHHHHHHHHH--HHhhhhccCCcchhHHH- Confidence 2222221111111 2234477788899999999999998766543 33333333433 234 333321 122222 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecC--CCCcH-HHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 331 LASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDE--IGSRQ-DATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 331 ~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~--~~~~~-~~~ae~~~~L~~~G~~~~~~~~ 407 (484) ...........+.+-+..+++.++ +++++++.+-.-.+..-+.|.+.. ...++ ....+++-++.+.|.+-. ..- T Consensus 368 -~~~~~~~~~S~L~~~a~~~e~al~-~~l~~~a~w~G~~~~~~~~i~~n~dF~~~~~d~~~~~al~~~~~~G~is~-~t~ 444 (489) T protein:vir:78 368 -ARIQRGADTSVMATIARNVSQAYT-DALRWVAVMLGKPEDTEVEFRLNMDFFLEPMTAQDRAAWMADINAGLLPA-TAY 444 (489) T ss_pred -HHHHHHHhhHHHHHHHHHHHHHHH-HHHHHHHHHcCCCCCCceEEEeecccCcccCCHHHHHHHHHHHhcCCCCH-HHH Confidence 223344456777888899999996 599999998432222223454321 11111 223455566777886431 111 Q ss_pred HHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccc Q lcl|NC_021302. 408 EAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARK 462 (484) Q Consensus 408 ~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 462 (484) -.++ ++-|+..+.+. ++.....+.+.+ ...+..|+ .+....+.-+ T Consensus 445 ~~~L-~~~gv~d~~~e-~~~~ei~~~~~~-------~~~~~~g~-~~~~~q~~~~ 489 (489) T protein:vir:78 445 YAAL-RKAGVTDWTDA-DIKDAVADQPLP-------VATEVQGE-IPQSAQQQEK 489 (489) T ss_pred HHHH-HhCCCCCccHH-HHHHHHhhcCCC-------cccCCccc-CCCCcccccC Confidence 2233 34466544322 221111111000 00000111 1111111111 No 201 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=95.56 E-value=0.0019 Score=35.57 Aligned_cols=451 Identities=10% Similarity=0.039 Sum_probs=180.3 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcE---Eec Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWR---IRP 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~---v~p 77 (484) ++|+-..+.++++ .+.+|+..-.. ..... ......|+.|++|+ .++.|-++++.....+.-.+-. |+- T Consensus 21 vpp~~~~~~~~i~----~g~~g~~v~~~--g~~~~--~n~~eLI~~YR~ma-~~pEVd~Av~eIVneaIv~d~~~~pV~v 91 (564) T protein:vir:10 21 VPPNDEASVSTVA----GGYFGTYVDTS--GGQNS--RNEYELIRRYRDMS-LHPEVDSAIDEIVNEFVVNDGDDKPVEV 91 (564) T ss_pred ccCCcCCChhhhh----ccccceeeecc--cccch--hhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCceEEE Confidence 4444444444431 12223221110 00011 12346789999997 5999999999887754433211 111 Q ss_pred CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 78 NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 78 ~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +=+.-+..+.+.+-+.. .+...+.-.+|+.-..++. .-...|--.+.++=..++..-.+.+|...+|+.+ T Consensus 92 dL~~~~~s~siK~kI~e---------EF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lDPr~i 162 (564) T protein:vir:10 92 DLQNLEIGSGVKKKIRD---------EFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYIDSLKI 162 (564) T ss_pred EecccCcchHHHHHHHH---------HHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhhcccce Confidence 11111222332222211 0111111112222111111 1112233334444333333334567777788877 Q ss_pred eeeeecC-----CCc-eeeeecccccccccccceecc-----------------CCCCcccccccceEEEeecCc--cCc Q lcl|NC_021302. 157 AYWNVDR-----DGG-LISIQQWPAGTFGGPGMVVMA-----------------PNSMGPAIPVEQLVVYTHDMD--PGV 211 (484) Q Consensus 157 ~~~~~~~-----dg~-l~~~~q~~~~~~~~~~~~~~~-----------------~~~~~~~lp~~k~l~~~~~~~--~~~ 211 (484) .+.+... .+. +.+.....-..........+. ....++.||. ..|+|.|..- .++ T Consensus 163 ~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~-daI~y~hSGL~d~~~ 241 (564) T protein:vir:10 163 RKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIAS-DAIAQSTSGLMDLNK 241 (564) T ss_pred eeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeech-hhcceecccceeCCC Confidence 6544111 111 111110000000000111111 1233455544 4677777642 233 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcc-eEEecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE---- Q lcl|NC_021302. 212 WTGNSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVP-YLKGNEADSEDDDRMDELLEIASNYSGG----ESAGLA---- 281 (484) Q Consensus 212 p~G~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P-~~~gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v---- 281 (484) ..=.|.|.++..++==-++..-..+.| +-|- |-. +.-...+.-...+.-..|.+++..+++. .++|-| T Consensus 242 ~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRA--PeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddr 319 (564) T protein:vir:10 242 KMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRA--PERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDK 319 (564) T ss_pred CceeccchhhhHhHHhhHHHHhhHHHHhhhcc--ccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecccc Confidence 334678888888875555555555444 2221 110 1111223334444445666776666532 122222 Q ss_pred ----------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccc----hhhHHHHHHH-HHH Q lcl|NC_021302. 282 ----------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGS----YALASVQADT-FVQ 341 (484) Q Consensus 282 ----------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs----~A~~evh~~v-~~~ 341 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.++++ ++..-+..++ |.. T Consensus 320 k~msMlEDyWLPRReGgrgTEItTLpGgqnL-gem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~K 398 (564) T protein:vir:10 320 KHMSMLEDFWLPRREGGRGTEITTLPGGQNL-GELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTK 398 (564) T ss_pred hhhhhHhhhcccccCCCcccceeeccccCCc-chHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHHH Confidence 22 47889888765432 233458899999999977665333333322 2222223333 334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHH Q lcl|NC_021302. 342 SVQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAF 410 (484) Q Consensus 342 ~~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~ 410 (484) .+......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+....+.+| T Consensus 399 FI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dy 478 (564) T protein:vir:10 399 FIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEY 478 (564) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHH Confidence 4555666666666554444444333 2111122244442211 1 223345666666655223444567888 Q ss_pred HHHHh-CCC---------------------CCCCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccc Q lcl|NC_021302. 411 LRDAA-GLP---------------------GPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRS 468 (484) Q Consensus 411 i~e~~-glp---------------------~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (484) +++.+ .+. .|.+.+..+....+ .....|..++..+...+....++...+.... T Consensus 479 i~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~-----~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~ 553 (564) T protein:vir:10 479 IRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQ-----NQAFAPELQAAQDDLAAEREIKKLNSAPKPP 553 (564) T ss_pred HHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCC-----CCcCCcchhhhccccccccChhhhccCCCCC Confidence 76553 322 12111111100000 0111111111111111111111000000000 Q ss_pred hHHHhc-Cccc Q lcl|NC_021302. 469 PRDRRK-TPDG 478 (484) Q Consensus 469 ~~~~~~-~~~~ 478 (484) +...++ ++.- T Consensus 554 ~~~~~~~~~~~ 564 (564) T protein:vir:10 554 PSQQSKSQSNK 564 (564) T ss_pred CCCCCcCcCCC Confidence 111111 0111 No 202 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=449 Identities=12% Similarity=0.080 Sum_probs=188.0 Q ss_pred CCCCCCCccceee---ee--cccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEE Q lcl|NC_021302. 1 MAPKTVAPRTERG---YV--NPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRI 75 (484) Q Consensus 1 ~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v 75 (484) .+|+.+++..+.. .+ ...+.+|+ ..++.+ ........|+.|++|+ .++.|-++++.....+.-.+-.- T Consensus 14 ~~~~~~s~~~~~~~dg~~~i~~~~~~~~--~~~~e~----~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~ 86 (533) T protein:vir:10 14 KAPKGPSFVQKDNLDGSQPVSGGGYYGY--TVDFDG----QVRNEYQLISRYREMV-LQPECDSAVDDIVNETICGNFDD 86 (533) T ss_pred ccccCCCCCCCCcccccceeecccccce--eeeccc----ccchHHHHHHHHHHHh-hccchhhHHHHhhcceeeecCCC Confidence 5555555543322 11 11111111 112211 1112456799999998 59999999998887665443221 Q ss_pred ec---CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeee Q lcl|NC_021302. 76 RP---NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPR 151 (484) Q Consensus 76 ~p---~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r 151 (484) .| +=+.-+..+.+.+-+.. .+...+.-.+|+.-..++. .-...|--.+.++=..++..-.+.+|... T Consensus 87 ~pV~i~Ld~~~~s~~iK~kI~e---------EF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~l 157 (533) T protein:vir:10 87 VPVSVELSNLKVSDKIKKLIRE---------EFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYI 157 (533) T ss_pred ceEEEEecccccchHHHHHHHH---------HHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeeec Confidence 11 11111222222222211 0111111112222111111 11112333344443434444445667777 Q ss_pred CccceeeeeecC---CCceee--eecccccccc-----cccceeccCCCCcccccccceEEEeecCc--cCccccchhHH Q lcl|NC_021302. 152 PQSSIAYWNVDR---DGGLIS--IQQWPAGTFG-----GPGMVVMAPNSMGPAIPVEQLVVYTHDMD--PGVWTGNSLLR 219 (484) Q Consensus 152 ~~~~~~~~~~~~---dg~l~~--~~q~~~~~~~-----~~~~~~~~~~~~~~~lp~~k~l~~~~~~~--~~~p~G~gll~ 219 (484) +|+.+.+++.-. .....+ .......... .+... ......++.||. ..|+|.|..- .++..=.|.|. T Consensus 158 DPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~-~~~~~~~vkI~~-dAI~y~hSGl~d~~~~~i~syLh 235 (533) T protein:vir:10 158 DPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGL-KNSTTQGLKIAP-DSICYVHSGIMDLNKNMTLSHLH 235 (533) T ss_pred cccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccc-cccCCCceecch-hheeeeeccceeCCCCceeccch Confidence 777776533221 111100 0000000000 01111 122456677877 6788888543 22222357888 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHhcCCcc-eEEecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE------------ Q lcl|NC_021302. 220 PAYKNWKLKDELIRIEAAA-IRRHGIGVP-YLKGNEADSEDDDRMDELLEIASNYSGG----ESAGLA------------ 281 (484) Q Consensus 220 ~~~~~~~~K~~~~~~w~~f-~Er~~~G~P-~~~gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v------------ 281 (484) ++..++==-++..-..+.| +-|- |-. +.-...+.-...+.-..|.+++..+++. .++|-+ T Consensus 236 kAiKp~NQLkm~EDAlVIYRitRA--PeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlED 313 (533) T protein:vir:10 236 KAIKAVNQLRMIEDSLVIYRLSRA--PERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 313 (533) T ss_pred HhHHHHHhhHHHHhhHHHHhhhcc--ccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhh Confidence 8888875555555554444 2221 110 1111223334444445666776666532 122222 Q ss_pred --cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc-c--hhhHHHHHHH-HHHHHHHHHHHH Q lcl|NC_021302. 282 --LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG-S--YALASVQADT-FVQSVQTVADEI 350 (484) Q Consensus 282 --ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG-s--~A~~evh~~v-~~~~~~aD~~~i 350 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.+|| + ++..-+..++ |...+......+ T Consensus 314 yWLPRReGgrgTEItTLpGgqnL-gem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rF 392 (533) T protein:vir:10 314 FWLPRREGGRGTEITTLPGGQNL-GELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRF 392 (533) T ss_pred hcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHH Confidence 22 47889888765332 33445789999999997665533333332 1 2222233343 334455556666 Q ss_pred HHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHHh-CCC Q lcl|NC_021302. 351 RDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDAA-GLP 418 (484) Q Consensus 351 ~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~-glp 418 (484) +..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+--.+....+.+|+++.+ .+. T Consensus 393 s~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~t 472 (533) T protein:vir:10 393 SELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQT 472 (533) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC Confidence 6666554444444333 2111122244442211 1 22344566666665532344456888886553 433 Q ss_pred CCCCCcc--------cccccCCCcCC-CccccCCCCccccccccccccccccccccccchHHHhcCcc Q lcl|NC_021302. 419 GPDPDAD--------DDESTADTGQD-EPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPD 477 (484) Q Consensus 419 ~p~~~e~--------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (484) ..+-.+. ..+.-+++..+ +|++..+ .|...|.+ ....+|..-++.+.+++.- T Consensus 473 Deei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~-~~~~~~~~------~~~~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:10 473 DVEMKEIDKQIESEMESGIIADPAAEMDPAMAAG-DPDAGGAP------AEEVAPEGPDPSDERKAEF 533 (533) T ss_pred HHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCC-CCCcCCcc------cccCCCCCCCcchhhccCC Confidence 2110000 00111111100 1111111 11111111 0112233333444433322 No 203 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=95.31 E-value=0.0023 Score=35.04 Aligned_cols=424 Identities=10% Similarity=-0.010 Sum_probs=171.9 Q ss_pred CCCCCCCccceee----eecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERG----YVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) +.+....-..-+. ...+.-.....++.|-........ +....-.....+ ...+..-++.+...-+.+-+..+. T Consensus 37 ~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~-~~~~~~~~~~ki--~~n~~k~Ivd~~~~yl~g~p~~~~ 113 (501) T protein:vir:27 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFG-RRKDREMADKRA--VHNYGRMISKFKTGYLAGNPIRVE 113 (501) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccC-ccCcccccccee--ccchHHHHHHHHhhhhcccCeeEe Confidence 1111000000000 000000001111111000000000 000000000112 245666666677777777777775 Q ss_pred cCCCC--HHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCc Q lcl|NC_021302. 77 PNGAR--PEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQ 153 (484) Q Consensus 77 p~~~~--~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~ 153 (484) ..+.+ .+..+++.+.. ..-+|+..+..+. ++..||.+ ++++|.-.+|...+. ..+| T Consensus 114 ~~d~~~~~~~~~~l~~~~-----------------~~n~~~~~~~~~~~~~~~~G~a-~~~vy~ded~~~~i~---~~~p 172 (501) T protein:vir:27 114 YDDNDNNSQNDDTIKRIG-----------------RINDIDSHNRTLIRDLSQTGRA-YEVIYRNEYDETRIK---RLNP 172 (501) T ss_pred cCCccchHHHHHHHHHHH-----------------HhcChhHHHHHHHHHHhhCCeE-EEEEEeCCCCceEEE---EEcc Confidence 43322 12222222221 2235777777764 78899985 678887667765433 3444 Q ss_pred cceeeeeecC--CCceeee-eccccc--ccc--------cccceeccCCC-------CcccccccceEEEeecCccCccc Q lcl|NC_021302. 154 SSIAYWNVDR--DGGLISI-QQWPAG--TFG--------GPGMVVMAPNS-------MGPAIPVEQLVVYTHDMDPGVWT 213 (484) Q Consensus 154 ~~~~~~~~~~--dg~l~~~-~q~~~~--~~~--------~~~~~~~~~~~-------~~~~lp~~k~l~~~~~~~~~~p~ 213 (484) +.+. ..+++ .+.++.. +.+... ... ......+...+ ....+..--++.| .+|+. T Consensus 173 ~~~~-~v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~nn~~ 246 (501) T protein:vir:27 173 LETF-VIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEF-----LNNVD 246 (501) T ss_pred ceeE-EEecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEe-----cCCCC Confidence 4432 11222 1222211 110000 000 00111111111 0011111112332 24678 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcC-CceEEEccCCceEEEec Q lcl|NC_021302. 214 GNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGG-ESAGLALTAGEEAGILS 292 (484) Q Consensus 214 G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g-~~a~~vip~~~~ie~~~ 292 (484) |.|.+..+....---...+..++..++.+..++.++.|..... ..+....+... ..+... ..++.....+.+++++. T Consensus 247 g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~ 324 (501) T protein:vir:27 247 GIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALP-KGMQASDMKRT-RLMQLKPPKSADGKEGTVKAEYLT 324 (501) T ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCC-cccchhhhhhc-CceeecccccccCCCCCcceeeee Confidence 9999998766666666777888888887766555555532222 12222222111 001000 00112234456788887 Q ss_pred ccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CC-- Q lcl|NC_021302. 293 PNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQA--DTFVQSVQTVADEIRDVAQAHVVEDIVDV-NW-- 367 (484) Q Consensus 293 ~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~--~v~~~~~~aD~~~i~~~ln~qli~~l~~~-Nf-- 367 (484) .......++.+++.+.+.|...-....++.++.+|.-+....+. ......+..-.+.+...|. ++++-++.+ +. T Consensus 325 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~-~~~~li~~~~~~~~ 403 (501) T protein:vir:27 325 KSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLK-RRYRLAARIGSLVN 403 (501) T ss_pred ccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhcc Confidence 76666679999999999998885555555544333211111111 2223333444456666664 355554443 21 Q ss_pred C-C--ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCCccccccc---CCCcCCCcc Q lcl|NC_021302. 368 G-E--DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPDADDDEST---ADTGQDEPE 439 (484) Q Consensus 368 ~-~--~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~e~~~~~~---~~~~~~~~~ 439 (484) . . +.....+.|. ..+.+..+.++++.+|+ |+ ++.+.+.+.++. ..|..+-+-.... .+.... . T Consensus 404 ~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~--g~-----iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~--~ 474 (501) T protein:vir:27 404 EFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQ-----VSQETALSLSGLVESPNEELDKINKEVSEIDFKGY--S 474 (501) T ss_pred cccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-----CcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhh--c Confidence 1 1 1112367775 45667888899999884 54 245566666643 3222110000000 000000 0 Q ss_pred ccCCCCccccccccccccccccccccccchHHH Q lcl|NC_021302. 440 TDEPALPNTSGTTSTTNAPQARKRPRGRSPRDR 472 (484) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (484) .+........++.......+...... + T Consensus 475 ~~~~~~~~~~~d~~~~~~~d~~e~~~------~ 501 (501) T protein:vir:27 475 NDFNEHVGKYTDEVKETHTDDFERAY------E 501 (501) T ss_pred CccccccccccCCCCCCccccccccC------C Confidence 00000000000000000000000000 0 No 204 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=94.98 E-value=0.003 Score=34.39 Aligned_cols=415 Identities=12% Similarity=0.110 Sum_probs=182.8 Q ss_pred CCCCCCCccceee--eecccccchh-hhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec Q lcl|NC_021302. 1 MAPKTVAPRTERG--YVNPLAGFGT-FLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP 77 (484) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p 77 (484) .+|+.....+++- .-++.-+.|. ....+. .... ......|+.|++|+ .++.|-++++.....+.-.+-.-.| T Consensus 32 ~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~--~~~~--~n~~eLI~~YR~ma-~~pEvd~Av~eIvneaiv~d~~~~p 106 (521) T protein:vir:10 32 AVPDTADGAIEVDKQIDTTAPKTAIVQSVLGY--APKI--QNTKDLINQYRSLS-KYHEVDNAIDEIINDAIVQEDNRDT 106 (521) T ss_pred ccccCCCCceeeccCCCccccccchhhhhhcc--cccc--chHHHHHHHHHHHh-hccchhhHHHhhhcceEEecCCCce Confidence 3444443333331 1111111111 111111 1111 11346799999997 5999999999888766544311111 Q ss_pred ---CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhccee-----------eeEEEeecCCee Q lcl|NC_021302. 78 ---NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAV-----------FEQTYFYEGGRF 143 (484) Q Consensus 78 ---~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~-----------~Eivw~~~~g~~ 143 (484) +=+..+..+.+.+-+.. -|+.++ .+|+.--+||.. +.++=...+..- T Consensus 107 V~i~Ld~~~~s~~iK~kI~e------------------eF~~Il-~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~ 167 (521) T protein:vir:10 107 VYLDLDKTDWNESVKEMVRE------------------EFRTIL-KLLKFEREGKRHFRRWYVDSRIYFHKMIDPARPKD 167 (521) T ss_pred EEEEecCcccchHHHHHHHH------------------HHHHHH-HHhccchhhhHHHhhheeeeeEEEEEEeeCCCccc Confidence 00111112222222211 133333 334444444433 333323334334 Q ss_pred eeeeeeeeCccceeeeeec---CCCcee---eeeccccc-ccccccceeccCCCCcccccccceEEEeecC--ccCcccc Q lcl|NC_021302. 144 WLKRLAPRPQSSIAYWNVD---RDGGLI---SIQQWPAG-TFGGPGMVVMAPNSMGPAIPVEQLVVYTHDM--DPGVWTG 214 (484) Q Consensus 144 ~~~~l~~r~~~~~~~~~~~---~dg~l~---~~~q~~~~-~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~--~~~~p~G 214 (484) .+.+|...+|+.+.+.+.. .+++.. ....+.-- ..+....-.......++.||. .-|+|.|.. ..++++. T Consensus 168 GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~-daI~y~hSGL~d~~~~~i 246 (521) T protein:vir:10 168 GIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPI-DAIVYSHSGKVDIDGKTI 246 (521) T ss_pred cceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeech-hheeeecccceeCCCCce Confidence 4556777777777553322 122211 00000000 000000111122344556766 678999854 3457888 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE----- Q lcl|NC_021302. 215 NSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA----- 281 (484) Q Consensus 215 ~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v----- 281 (484) .|.|.++..++==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|-| T Consensus 247 ~syLhkAiKp~NQLkm~EDAlVIYRitR----APeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk 322 (521) T protein:vir:10 247 VGYLHNVIKPANQLKMLEDAMVIYRITR----APERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSN 322 (521) T ss_pred eccchhhhHhHHhhHHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchh Confidence 999999998886555555555544 222 12111 1223334444445566666665442 122222 Q ss_pred ---------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc--c--hhhHHHHHHH-HHHH Q lcl|NC_021302. 282 ---------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG--S--YALASVQADT-FVQS 342 (484) Q Consensus 282 ---------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG--s--~A~~evh~~v-~~~~ 342 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.+++ + ++..-+..++ |... T Consensus 323 ~msMlEDyWLpRReGgrgTEI~TLpggqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KF 401 (521) T protein:vir:10 323 NLAMTEDYWLMRRDGKATTEVSTLPGAQSM-GEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKY 401 (521) T ss_pred hhhhHhhhcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHH Confidence 22 47889888765332 33445789999999997665533332321 1 1211223333 3344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCc--ccCCcccHH Q lcl|NC_021302. 343 VQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGL--LTPDPRLEA 409 (484) Q Consensus 343 ~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~--~~~~~~~~~ 409 (484) +......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-- .+....+.+ T Consensus 402 I~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~d 481 (521) T protein:vir:10 402 IRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHE 481 (521) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchH Confidence 555566666666554444444333 2111122244442211 1 22334456655554422 344456888 Q ss_pred HHHHH-hCCCCCCCC--cccccc-cCCCcCCCccccCCCC Q lcl|NC_021302. 410 FLRDA-AGLPGPDPD--ADDDES-TADTGQDEPETDEPAL 445 (484) Q Consensus 410 ~i~e~-~glp~p~~~--e~~~~~-~~~~~~~~~~~~~~~~ 445 (484) |+++. +.++..+-. ++.... ...+--+.|+....+- T Consensus 482 yi~k~ILr~tDeeik~~~k~I~~E~~~~~~~~p~~e~~df 521 (521) T protein:vir:10 482 YVMKNILRMSDEDIKTEREKIDGELKDSVYKNPEDPMEEF 521 (521) T ss_pred HHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCcchhhcC Confidence 98654 455432211 111110 0000000111110000 No 205 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=94.91 E-value=0.0032 Score=34.25 Aligned_cols=395 Identities=11% Similarity=0.026 Sum_probs=156.4 Q ss_pred CCCC----CCCccceeeeecccccch---hhhhhhcccccccccc-cccchHHHHH------------------------ Q lcl|NC_021302. 1 MAPK----TVAPRTERGYVNPLAGFG---TFLAQGLDQFEQVDEL-RWPNSVYTYT------------------------ 48 (484) Q Consensus 1 ~~~~----~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~l-r~~~~~~~y~------------------------ 48 (484) |+-. .+..--+. +...-... .-.+..+..... ..+ |..+..+.|+ T Consensus 1 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~i~~~i~~~~-~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (468) T protein:vir:96 1 MIDIFWPNEKPYHERV--VEQIKPQYETQEEMILRLITKHK-ENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPD 77 (468) T ss_pred CccccCCcCceeehhe--eecccccccCcHHHHHHHHHHHH-HHHHHHHHHHHHhcCCCccccccccccccccccccccc Confidence 2222 21111110 00000000 000000000000 000 0000001110 Q ss_pred -HHHhcchHHHHHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHh Q lcl|NC_021302. 49 -RMCREEARIASVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQ 126 (484) Q Consensus 49 -~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~ 126 (484) .+ ..+...-++.+...-+.+-+..+... +++..+.+. ..++ -+|.+.+..+ .++.. T Consensus 78 ~ki--~~n~~~~Iv~~~~~~l~g~p~~~~~~--d~~~~~~l~-----------------~~~~-n~~~~~~~~~~~~~~~ 135 (468) T protein:vir:96 78 WRM--YTNYHQNLVDQKVAYAVANPVTYGTE--DEKSLKTIQ-----------------EVLN-HKWDDKLVDILTAASN 135 (468) T ss_pred ccc--ccchHHHHHHHHHhhhccCCceeccC--ChHHHHHHH-----------------HHHh-cCHHHHHHHHHHHHhh Confidence 01 12233333333334444444444322 222222222 2222 2466655555 57889 Q ss_pred hcceeeeEEEeecCCeeeeeeeeeeCcccee-eeeecCCCceeeee-cccccccc------cccceeccC---------- Q lcl|NC_021302. 127 FGHAVFEQTYFYEGGRFWLKRLAPRPQSSIA-YWNVDRDGGLISIQ-QWPAGTFG------GPGMVVMAP---------- 188 (484) Q Consensus 127 ~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~~-~~~~~~dg~l~~~~-q~~~~~~~------~~~~~~~~~---------- 188 (484) ||.+. +++|.-.+|.+. +...+|..+. .|.....+.++... .+...... ......+.. T Consensus 136 ~G~~~-~~v~~d~~~~~~---i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (468) T protein:vir:96 136 KGVEW-IQPYVDEQGEFK---TFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYY 211 (468) T ss_pred cCeEE-EEEEEcCCCceE---EEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeeccc Confidence 99974 567765566654 3334444432 12211223322111 00000000 000000000 Q ss_pred ------------CCCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCC Q lcl|NC_021302. 189 ------------NSMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADS 256 (484) Q Consensus 189 ------------~~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~ 256 (484) .....++..--++.|. +|+.|.|.+..+-...---...+..++..++.+.+++.++.|-. . T Consensus 212 ~~~~~~~~~~~~~~~~~~~~~iPvv~~~-----n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~--~ 284 (468) T protein:vir:96 212 QGEEHVQAHYYVGNKSMSWNRVPFIPFK-----NNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYE--G 284 (468) T ss_pred ccccccccceeeccccccCCcccEEEec-----CCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC--c Confidence 0000111111222222 36789999887665555556677888888888766544444422 2 Q ss_pred CCHHHHHHHHHHHHHHhcCCceEEEcc--CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHH Q lcl|NC_021302. 257 EDDDRMDELLEIASNYSGGESAGLALT--AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASV 334 (484) Q Consensus 257 ~~~~~~~~l~~~l~~~~~g~~a~~vip--~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~ev 334 (484) .+.+ ....++..+ ..+.++ .+.+++++........++..++.+.+.|...-.+..++.++.+| ...|.. T Consensus 285 ~~~~------~~~~~~~~~--~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~Sg~A 355 (468) T protein:vir:96 285 EDLE------EFMYNLKYY--KAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDKFGN-SPSGIA 355 (468) T ss_pred cccc------hhhhhhhcC--ceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccccccccc-chHHHH Confidence 1111 112233222 234444 34678888877667789999999999998886555555443332 222221 Q ss_pred -HH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHH Q lcl|NC_021302. 335 -QA--DTFVQSVQTVADEIRDVAQAHVVEDIVDVNWG-EDEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEA 409 (484) Q Consensus 335 -h~--~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~-~~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~ 409 (484) .. .-....+..-.+.+...+. ++++.++.+.-. .+..-..+.|. ..+.+..+.++. ++++|+. |++ T Consensus 356 lk~~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~g~~~d~~~i~i~f~~~~p~d~~e~a~~---~~~~g~i-----S~e 426 (468) T protein:vir:96 356 LKFMYSNLDLKANKLKNKTLTALQ-ELLQYIIDFYKLSIKVQDVEITFNFNVMVNELEQSQI---GVNSQYL-----SKE 426 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCcccceeeEEecCCCCcCHHHHHHH---HHhcCCC-----chH Confidence 11 1222233344555666664 466666665321 12222466675 345566655554 4456752 456 Q ss_pred HHHHHhCC-CCCCCCcccccccCCCc-CCCccccCCCCcccccccccc Q lcl|NC_021302. 410 FLRDAAGL-PGPDPDADDDESTADTG-QDEPETDEPALPNTSGTTSTT 455 (484) Q Consensus 410 ~i~e~~gl-p~p~~~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 455 (484) .+.+.++. ..|. ++...-..... ....+... ...+...++ T Consensus 427 t~i~~l~~v~D~~--~E~~ri~~E~~~~~~~~~~~----~~~~~~~~~ 468 (468) T protein:vir:96 427 TVVTNHPWVDDPV--AEMERIDQEELALPSIEEGL----NGKENNEPT 468 (468) T ss_pred HHHHhCCCCCCHH--HHHHHHHHHHHHHHHHhhcc----CCCCCCCCC Confidence 66677643 2221 11110000000 00000000 001111111 No 206 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=94.76 E-value=0.0036 Score=34.00 Aligned_cols=432 Identities=9% Similarity=0.008 Sum_probs=165.7 Q ss_pred eeeecccccc-hhh--hhhhccccccccccccc-------chHH-HHHHHHhcchHHHHHHHHHHHHhhCCCcEEe---- Q lcl|NC_021302. 12 RGYVNPLAGF-GTF--LAQGLDQFEQVDELRWP-------NSVY-TYTRMCREEARIASVLRAIGLPIRRTDWRIR---- 76 (484) Q Consensus 12 ~~~~~~~~~~-~~~--~~~~~~~~~~~~~lr~~-------~~~~-~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~---- 76 (484) ..-|+....+ ++. .-..+... .+....+. ...+ +.+-|.....+-..-+.+.+.-..+....+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:93 1 MLKVNEFETDTDLRGNINYLFNDE-ANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccchhhhhhhhhhhhhhhhhh-hCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 1111111100 000 00000000 00000000 0000 1111100001111122222322222221110 Q ss_pred -cC----------CCCHHHHHHHHHHHH-hh----hccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeec Q lcl|NC_021302. 77 -PN----------GARPEVVEHVAACLG-LP----VEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYE 139 (484) Q Consensus 77 -p~----------~~~~e~~~~~~~~l~-~~----~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~ 139 (484) +. +=...++++....|. .+ ...++.+....+....-+|+.....+. ++..||. +.+++|.-. T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~-ay~~vy~de 158 (511) T protein:vir:93 80 RKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGK-AYELMIRNQ 158 (511) T ss_pred CcccccCcceeecchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCe-eEEEEEeCC Confidence 00 001123333333221 11 122222233444444556877777775 7889996 567788766 Q ss_pred CCeeeeeeeeeeCccceeeeeecC--CCceeeeec-ccc----cc----------cccccceeccCC------------- Q lcl|NC_021302. 140 GGRFWLKRLAPRPQSSIAYWNVDR--DGGLISIQQ-WPA----GT----------FGGPGMVVMAPN------------- 189 (484) Q Consensus 140 ~g~~~~~~l~~r~~~~~~~~~~~~--dg~l~~~~q-~~~----~~----------~~~~~~~~~~~~------------- 189 (484) +|... +...+|+.+. ..+++ .+.++.... +.. +. ........+... T Consensus 159 ~~~~~---i~~~~p~~~~-~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~ 234 (511) T protein:vir:93 159 DDETR---LYKSDAMSTF-VIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENG 234 (511) T ss_pred CCceE---EEEEccceeE-EEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccc Confidence 67654 3344555432 22222 233221111 000 00 000001111000 Q ss_pred CCcccccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHH- Q lcl|NC_021302. 190 SMGPAIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEI- 268 (484) Q Consensus 190 ~~~~~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~- 268 (484) ....++..--++.|+ .|+.|.|.+..+-...-.-...+..++..++.|..++.++.|... .+.++.....+. T Consensus 235 ~~~~~~g~vPvv~~~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~--~~~~~~~~~~~~~ 307 (511) T protein:vir:93 235 FESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN--LDPVEVRKQKEAN 307 (511) T ss_pred ccccCCCccceEEec-----CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcc--cCchhhccccccc Confidence 001111111133333 356788999887666555567788888888887666666665332 222222111110 Q ss_pred HHHHhcC---CceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHH----HHH Q lcl|NC_021302. 269 ASNYSGG---ESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQADT----FVQ 341 (484) Q Consensus 269 l~~~~~g---~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v----~~~ 341 (484) +..+..+ ...+.-...+.++++++.......++.+++++.+.|.+.--...++.++.+|.- +.+.... ... T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~--Sg~Al~~~~~~l~~ 385 (511) T protein:vir:93 308 VLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ--SGEAMKYKLFGLEQ 385 (511) T ss_pred ceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc--hHHHHHHHHHHHHH Confidence 0000000 001112345778888887766778999999999999887666666655433321 1122222 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh-C--CCCc----cccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHH Q lcl|NC_021302. 342 SVQTVADEIRDVAQAHVVEDIVDV-N--WGED----EPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRD 413 (484) Q Consensus 342 ~~~aD~~~i~~~ln~qli~~l~~~-N--f~~~----~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e 413 (484) .+..-.+.+...|. ++++-++.+ + .... -.-.++.|. ..+.+..+.++++.+|+ |+ + +.+.+.+ T Consensus 386 k~~~k~~~f~~~l~-~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~--g~-i----S~et~~~ 457 (511) T protein:vir:93 386 RTKTKEGLFTKGLR-RRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-I----SQTTLMS 457 (511) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHHh--cc-C----chHHHHH Confidence 33333455555564 355554443 2 2111 112367775 35667888889888884 64 2 4566777 Q ss_pred HhCC-CCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 414 AAGL-PGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 414 ~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) .++. +.+..+-+....................+..... ...+........... T Consensus 458 ~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 458 LFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDIND---DEQDDDTKDTVDKKE 511 (511) T ss_pred hCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCC---CCCCCcccccccccC Confidence 7754 2221110000000000000000000000000000 000000000000000 No 207 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=94.48 E-value=0.0043 Score=33.56 Aligned_cols=432 Identities=9% Similarity=0.001 Sum_probs=163.5 Q ss_pred eeeecccccc-hhh--hhhhcccccccccccc-------cchHH-HHHHHHhcchHHHHHHHHHHHHhhCCCcEEec-CC Q lcl|NC_021302. 12 RGYVNPLAGF-GTF--LAQGLDQFEQVDELRW-------PNSVY-TYTRMCREEARIASVLRAIGLPIRRTDWRIRP-NG 79 (484) Q Consensus 12 ~~~~~~~~~~-~~~--~~~~~~~~~~~~~lr~-------~~~~~-~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p-~~ 79 (484) ..-|+....+ ++. .-..+... .+....+ ...++ +.+-|...-.+-..-+++.+.-..+....+.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~n~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~ 79 (511) T protein:vir:10 1 MLKVNEFETDTDLRGNINYLFNDE-ANVVYTYDGTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred Cccccchhhhhhhhhhhhhhhhhh-hcCCccCchhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCc Confidence 1111111100 000 00000000 0000000 00000 11111000001111222222222222111110 00 Q ss_pred ------C--------CHHHHHHHHHHH-Hhh----hccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeec Q lcl|NC_021302. 80 ------A--------RPEVVEHVAACL-GLP----VEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYE 139 (484) Q Consensus 80 ------~--------~~e~~~~~~~~l-~~~----~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~ 139 (484) . ...++++....+ ..+ ...++.+....+....-+|+.....+. ++..||. +.+++|.-. T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~-ay~~vy~de 158 (511) T protein:vir:10 80 RKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGK-AYEIMIRNQ 158 (511) T ss_pred ccccccCcceeecchHHHHHHHHhhhhcccCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCe-eEEEEEeCC Confidence 0 012222222221 111 122223334444444556877777664 7899997 568888766 Q ss_pred CCeeeeeeeeeeCccceeeeeecCC--Cceeee-eccc----cc----------ccccccceeccCCC-C---------- Q lcl|NC_021302. 140 GGRFWLKRLAPRPQSSIAYWNVDRD--GGLISI-QQWP----AG----------TFGGPGMVVMAPNS-M---------- 191 (484) Q Consensus 140 ~g~~~~~~l~~r~~~~~~~~~~~~d--g~l~~~-~q~~----~~----------~~~~~~~~~~~~~~-~---------- 191 (484) +|.+.+ ...+|+.+. ..+++. +.++.. +.+. .+ .........+.... . T Consensus 159 dg~~~i---~~~~p~~~~-~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~ 234 (511) T protein:vir:10 159 DDETRL---YKSDAMSTF-VIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENG 234 (511) T ss_pred CCceEE---EEEccceeE-EEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccc Confidence 776543 344444432 222221 222211 1100 00 00000111110000 0 Q ss_pred ccccccc--ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHH- Q lcl|NC_021302. 192 GPAIPVE--QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEI- 268 (484) Q Consensus 192 ~~~lp~~--k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~- 268 (484) ..+-|.. -++.|. .|..|.|.+..+-...-.-...+..++..++.+..++.++.|... .+.++.....+. T Consensus 235 ~~~~~~~~vPvv~f~-----nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~--~~~~~~~~~~~~~ 307 (511) T protein:vir:10 235 FESHSFERMPITEFS-----NNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLN--LDPVEVRKQKEAN 307 (511) T ss_pred cccccCcceeEEEec-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecccc--CCchhhccchhcc Confidence 0111111 123322 356788999887766655667778888888887666666666332 222222211110 Q ss_pred HHHHhcC---CceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHH----HHHH Q lcl|NC_021302. 269 ASNYSGG---ESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQAD----TFVQ 341 (484) Q Consensus 269 l~~~~~g---~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~----v~~~ 341 (484) +-.+... ...+.....+.+++++........++.+++.+.+.|...--...++.++.+|.- +.+... .... T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~--Sg~Al~~~~~~l~~ 385 (511) T protein:vir:10 308 VLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQ--SGEAMKYKLFGLEQ 385 (511) T ss_pred ceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccc--hHHHHHHHHHHHHH Confidence 0001000 001112334678888887666678999999999999887555555554322211 112222 2222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh---CCCC----ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHH Q lcl|NC_021302. 342 SVQTVADEIRDVAQAHVVEDIVDV---NWGE----DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRD 413 (484) Q Consensus 342 ~~~aD~~~i~~~ln~qli~~l~~~---Nf~~----~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e 413 (484) .+..-.+.+...|.+ +++-++.+ .-+. +-.-.++.|. ..+.+..+.++++.+|+ |+ ++.+.+.+ T Consensus 386 k~~~k~~~f~~~l~~-~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~-----iS~et~~~ 457 (511) T protein:vir:10 386 RTKTKEGLFTKGLRR-RAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GK-----ISQTTLMS 457 (511) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHh--cc-----CcHHHHHH Confidence 233334455555543 44444443 1111 1112366775 45677888899999885 64 24566777 Q ss_pred HhCC-CCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccccccccch Q lcl|NC_021302. 414 AAGL-PGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSP 469 (484) Q Consensus 414 ~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (484) .++. +.|..+-+....................+........ ..+.... ..... T Consensus 458 ~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~-~~~~~ 511 (511) T protein:vir:10 458 LFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ--DDDTKDT-VDKKE 511 (511) T ss_pred hCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCC--CCcccCc-ccccC Confidence 7754 2221110000000000000000000000000000000 0000000 00000 No 208 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=94.12 E-value=0.0053 Score=33.04 Aligned_cols=398 Identities=11% Similarity=0.043 Sum_probs=169.5 Q ss_pred CCCCCCCccceee-eecc----ccc--chhhhhhhcccccccccccccc----hHHHH-----HHHHhcchHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERG-YVNP----LAG--FGTFLAQGLDQFEQVDELRWPN----SVYTY-----TRMCREEARIASVLRAI 64 (484) Q Consensus 1 ~~~~~~~~~~~~~-~~~~----~~~--~~~~~~~~~~~~~~~~~lr~~~----~~~~y-----~~m~~~D~~v~s~l~~r 64 (484) +.++..-....+- .++. ... ....+..|- + +.+..+. ....+ ..+ ..+...-++.+. T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~-~----~i~~~~~~~~~~~~~~~~~~~~ki--~~n~~~~Ivd~~ 92 (474) T protein:vir:96 20 IKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHD-P----DVLRLAPKLDNKGEIDPLKPDWRM--FTNYHQNLVDQK 92 (474) T ss_pred hhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccC-C----cchhccchhcccccccccccchhc--ccchHHHHHHhh Confidence 3343322111110 0000 000 001111110 0 0000000 00000 011 123444555566 Q ss_pred HHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCee Q lcl|NC_021302. 65 GLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRF 143 (484) Q Consensus 65 ~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~ 143 (484) ...+.+-+..+.+. +++..+.+.+.+ . -+|.+.+..+ .++..||.+ .+++|.-.+|.+ T Consensus 93 ~~~l~g~p~~~~~~--d~~~~~~l~~~~-----------------~-n~~~~~~~~~~~~~~~~G~~-~~~~y~d~~~~~ 151 (474) T protein:vir:96 93 VAYAVANPVTFSSD--DDKSLKTIQEVL-----------------N-HKWDDKLVDILTAASNKGIE-WLQPYIDENGEF 151 (474) T ss_pred hhhhcccCceeecC--chHHHHHHHHHH-----------------h-cCHHHHHHHHHHHHHhcCee-EEEEEecCCCce Confidence 66677777777643 233333333322 1 1355444444 578889996 567777667765 Q ss_pred eeeeeeeeCccceeeeeecC--CCceeeee-cccccccccc------cceeccCC--------------------CCccc Q lcl|NC_021302. 144 WLKRLAPRPQSSIAYWNVDR--DGGLISIQ-QWPAGTFGGP------GMVVMAPN--------------------SMGPA 194 (484) Q Consensus 144 ~~~~l~~r~~~~~~~~~~~~--dg~l~~~~-q~~~~~~~~~------~~~~~~~~--------------------~~~~~ 194 (484) . +...+|+.+. ..+++ .+.++... .+........ ....+... ....+ T Consensus 152 ~---i~~~~p~~~~-~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (474) T protein:vir:96 152 K---TFRVPAEQAI-PIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKR 227 (474) T ss_pred E---EEEEcccceE-EEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccc Confidence 4 4444555432 22222 23332111 1111100000 00000000 00111 Q ss_pred cccc--ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHH Q lcl|NC_021302. 195 IPVE--QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNY 272 (484) Q Consensus 195 lp~~--k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~ 272 (484) -+.. -++.|+ +|+.|.|.+..+-...=--...+..++..++.+..++.++.|-.+ .+.+ +...++ T Consensus 228 ~~~g~iPvv~~~-----nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~--~~~~------~~~~~~ 294 (474) T protein:vir:96 228 VSWGRVPFIPFK-----NNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEG--QDLD------EFMRNL 294 (474) T ss_pred cCCCceeEEEec-----cCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc--cccc------chhhhh Confidence 1111 122222 367889999886655555566788899999988665555554322 1111 123344 Q ss_pred hcCCceEEEcc-CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHH----HHHHHHHHH Q lcl|NC_021302. 273 SGGESAGLALT-AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQADT----FVQSVQTVA 347 (484) Q Consensus 273 ~~g~~a~~vip-~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v----~~~~~~aD~ 347 (484) ..+ ..+.++ +|.+++++........++..++.+.+.|...--+..++.++.|| ...|. ..+. .......-. T Consensus 295 ~~~--~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~-Al~~~~~~l~~k~~~k~ 370 (474) T protein:vir:96 295 KYY--KAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKFGN-SPSGI-ALKFMYSNLDLKANKLK 370 (474) T ss_pred hcC--ceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccccc-ccHHH-HHHHHHHHHHHHHHHHH Confidence 322 344455 57889999877667788999999999999876666665544332 22222 2222 222333444 Q ss_pred HHHHHHHHHHHHHHHHHhCCCC-ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCCc Q lcl|NC_021302. 348 DEIRDVAQAHVVEDIVDVNWGE-DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPDA 424 (484) Q Consensus 348 ~~i~~~ln~qli~~l~~~Nf~~-~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~e 424 (484) +.+...|. ++++.++.+.... ...-..+.|. ..+.++.+.++. ++.+|+ +|.+.+.+.++. +.++.+- T Consensus 371 ~~~~~~l~-~~~~~i~~~~~~~~~~~~i~i~f~~~~p~~~~e~~~~---~~~ag~-----iS~et~~~~~~~v~d~~~E~ 441 (474) T protein:vir:96 371 NKTLTALQ-ELLQYIIDFYKLNIKVQDVEITFNFNVMVNELEQSQI---GVQSQY-----LSKETVVTNHPWVDDPVAEL 441 (474) T ss_pred HHHHHHHH-HHHHHHHHHhCCCcccceeeEEeccCCCcCHHHHHHH---HHhcCC-----CchHHHHHhCCCCCCHHHHH Confidence 56666664 4667666654221 1112356664 345566555554 556675 255667777653 2221110 Q ss_pred ccc-cccCCCcCCCccccCCCCcccccccccccccccc Q lcl|NC_021302. 425 DDD-ESTADTGQDEPETDEPALPNTSGTTSTTNAPQAR 461 (484) Q Consensus 425 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (484) +.. .......+..+.. .............+.- T Consensus 442 ~ri~~E~~e~~~~~~~~-----~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 442 ERIEQDNIDFNKQLPPL-----EGDANGRAQDNESETN 474 (474) T ss_pred HHHHHHHHHHHhccccc-----ccccccccCCCcccCC Confidence 000 0000000000000 0000000000000110 No 209 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=93.91 E-value=0.006 Score=32.77 Aligned_cols=412 Identities=10% Similarity=-0.018 Sum_probs=169.3 Q ss_pred CCCC----------CCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhC Q lcl|NC_021302. 1 MAPK----------TVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRR 70 (484) Q Consensus 1 ~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~ 70 (484) ..+. +.....+ ++... ..++.|-........-+...... -..+ ....-.-++.....-+.+ T Consensus 38 ~~~~~~~i~~~i~~h~~~~~~--rl~~l----~~yY~g~~~~i~~~~~~~~~~~~-~~ki--~~n~~k~Ivd~~~~yl~g 108 (502) T protein:vir:48 38 MVNNWELLKNFINHHKLRQAP--RIQEL----LDYARGENHDVLKSGRRKDNEMA-DKRA--VHNYGRMISKFKTGYLAG 108 (502) T ss_pred ccccHHHHHHHHHHHHHHHHH--HHHHH----HHHhcCCCccccccccccccccc-ccee--ecchHHHHHHHHhhhhcc Confidence 1111 1100000 00000 11111100000000000000000 0011 234555666666777788 Q ss_pred CCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeee Q lcl|NC_021302. 71 TDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLA 149 (484) Q Consensus 71 ~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~ 149 (484) -+..+...++. ..+.+.+.|... ...-+|+..+..+ .++..||.+ ++++|.-.+|.+.+. T Consensus 109 ~p~~~~~~d~~--~~~~~~~~l~~~-------------~~~N~~~~~~~~~~~~~~~~G~a-~~~v~~dedg~~~i~--- 169 (502) T protein:vir:48 109 NPIRVEYDDNE--DNSQNDDAIKRI-------------GRINDIDTHNRNLIRDLSQTGRA-YEVIYRSEYDETRIK--- 169 (502) T ss_pred cCeeEecCCcc--chhHHHHHHHHH-------------HhhcCHhHHHHHHHHHHhhcCeE-EEEEEeCCCCceEEE--- Confidence 88888764432 112222222211 1222577777766 478899975 578887666765433 Q ss_pred eeCccceeeeeecC--CCceee-eecccc--cccc--------cccceeccCCCCcc-------cccccceEEEeecCcc Q lcl|NC_021302. 150 PRPQSSIAYWNVDR--DGGLIS-IQQWPA--GTFG--------GPGMVVMAPNSMGP-------AIPVEQLVVYTHDMDP 209 (484) Q Consensus 150 ~r~~~~~~~~~~~~--dg~l~~-~~q~~~--~~~~--------~~~~~~~~~~~~~~-------~lp~~k~l~~~~~~~~ 209 (484) ..+|+... ..+++ .+.++. ++-+.. .... ....+.+...+... .+..--++.| . T Consensus 170 ~~~p~~~~-~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~-----~ 243 (502) T protein:vir:48 170 RLSPLETF-VIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEF-----L 243 (502) T ss_pred EEcccceE-EEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEe-----c Confidence 34444431 12221 222221 110000 0000 00111111111101 1111112222 2 Q ss_pred CccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcC-CceEEEccCCceE Q lcl|NC_021302. 210 GVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGG-ESAGLALTAGEEA 288 (484) Q Consensus 210 ~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g-~~a~~vip~~~~i 288 (484) +|+.|.|.+..+....=.-...+..++..++.|..++.++.|...... ++....+.+. ..+... ..+.-..+.+.++ T Consensus 244 nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~ 321 (502) T protein:vir:48 244 NNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQ-GMQASDMKRT-RLMQLKPPKSADGKEGTVKA 321 (502) T ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccc-ccchhhhhhc-ceeeccccccccccccCcce Confidence 477899999887666556667788888888877555445544322211 1111111110 000000 0011112356788 Q ss_pred EEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021302. 289 GILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQA--DTFVQSVQTVADEIRDVAQAHVVEDIVDV 365 (484) Q Consensus 289 e~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh~--~v~~~~~~aD~~~i~~~ln~qli~~l~~~ 365 (484) +++........++.+++.+.+.|.+.--...++.++.+|.- .|. ... ......+..-.+.+...+.+ +++-++.+ T Consensus 322 ~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~-Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~-~~~li~~~ 399 (502) T protein:vir:48 322 EYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNA-SGEALKYKLFGLDQDRVDTQSQFTQGLKR-RYRLAARI 399 (502) T ss_pred eEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 88876666667888999999999877544445544322221 221 121 12223333344555666643 55554443 Q ss_pred -CC---CC--ccccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCCCCccccc-----cc-- Q lcl|NC_021302. 366 -NW---GE--DEPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPDPDADDDE-----ST-- 430 (484) Q Consensus 366 -Nf---~~--~~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~~~e~~~~-----~~-- 430 (484) +. +. +.....+.|.. ...+..+.++++.+|+ |+ + +.+.+.+.++. ..+. ++... .. T Consensus 400 ~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-i----S~et~l~~l~~v~D~~--~E~~ri~~E~~~~~ 470 (502) T protein:vir:48 400 GSLVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLG--GQ-V----SQETALSLSGLVENPT--EELDKINEESSKID 470 (502) T ss_pred HhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-C----cHHHHHHhCCCCCCHH--HHHHHHHHHHHhhh Confidence 11 11 11224677753 4567888899999884 54 2 45667777764 2221 11100 00 Q ss_pred CCCcCCCccccCCCCccc---ccccccccccc Q lcl|NC_021302. 431 ADTGQDEPETDEPALPNT---SGTTSTTNAPQ 459 (484) Q Consensus 431 ~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~ 459 (484) ............+..... ..........+ T Consensus 471 ~~~~~~~~~~~~~~~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 471 FKGYPSYFYDNVGKYTDEVKETHTDDFERVYE 502 (502) T ss_pred hhcccccccccccccCCCccCCCCcCcCCCCC Confidence 000000000000000000 00000001111 No 210 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=93.16 E-value=0.0086 Score=31.92 Aligned_cols=434 Identities=12% Similarity=0.036 Sum_probs=167.0 Q ss_pred CCCCCCCccceeeeecccccchhhhhhh-cccccccccccccchHH-HHHHHHhcchHHHHHHHHHHHHhhCCCcEEecC Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQG-LDQFEQVDELRWPNSVY-TYTRMCREEARIASVLRAIGLPIRRTDWRIRPN 78 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~lr~~~~~~-~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~ 78 (484) |-.+.=+-.+-...+...+..-...... ....+. +- ....+ +..-+..-..+...-+.+.+.--.+..+.|... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~-~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~ 76 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEE---LM-VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKS 76 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhccccccc---cc-CChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCc Confidence 2111111111111110000000000000 001111 00 00011 111111111222223333333333333333110 Q ss_pred C-C--------------CHHHHHHHHHHHH-hh--hcc--ch----hhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeee Q lcl|NC_021302. 79 G-A--------------RPEVVEHVAACLG-LP--VEG--DE----SDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFE 133 (484) Q Consensus 79 ~-~--------------~~e~~~~~~~~l~-~~--~~~--~~----~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~E 133 (484) . . ...++++.+..+. .. +.. .. .+....+....-+|+..+..+. ++..||.+ ++ T Consensus 77 ~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~ 155 (501) T protein:vir:96 77 GRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRA-YE 155 (501) T ss_pred cccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeE-EE Confidence 0 0 0112332222221 11 000 11 1112233333446777766664 78999975 57 Q ss_pred EEEeecCCeeeeeeeeeeCccceeeeeecC--CCceeee-ecccc--cccc--------cccceeccCCC-------Ccc Q lcl|NC_021302. 134 QTYFYEGGRFWLKRLAPRPQSSIAYWNVDR--DGGLISI-QQWPA--GTFG--------GPGMVVMAPNS-------MGP 193 (484) Q Consensus 134 ivw~~~~g~~~~~~l~~r~~~~~~~~~~~~--dg~l~~~-~q~~~--~~~~--------~~~~~~~~~~~-------~~~ 193 (484) ++|.-.+|.+. +..++|+.+. ..+++ .++++.. +-+.. .... ......+...+ ... T Consensus 156 ~v~~dedg~~~---i~~~~p~~~~-~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~ 231 (501) T protein:vir:96 156 VIYRSEYDETR---IKRLSPLETF-VIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTH 231 (501) T ss_pred EEEEcCCCceE---EEEEccceeE-EEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceecccccc Confidence 77766667654 4444555542 22222 2333211 11110 0000 00111111111 111 Q ss_pred cccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHH-HHH Q lcl|NC_021302. 194 AIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIA-SNY 272 (484) Q Consensus 194 ~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l-~~~ 272 (484) .+..--++.| .+|+.|.|.+..+-...-.-...+..++..++.+..++.++.|-...... +....+...- -.+ T Consensus 232 ~~g~vPvv~~-----~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~-~~~~~~~~~~~~~~ 305 (501) T protein:vir:96 232 AFGTVPITEY-----LNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKG-MQASDMKRTRLMQL 305 (501) T ss_pred CCCccceEEe-----cCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcc-cchhhhhhcCeeee Confidence 1111112222 24788999999876555556677888888888887766666664322221 1122111110 000 Q ss_pred hcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH-HH--HHHHHHHHHHHHHH Q lcl|NC_021302. 273 SGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALAS-VQ--ADTFVQSVQTVADE 349 (484) Q Consensus 273 ~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~e-vh--~~v~~~~~~aD~~~ 349 (484) . ...+.-....+.+++++........++.+++.+.+.|...--...++.++.+|.- .|. .. ..-....+..-.+. T Consensus 306 ~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~-Sg~Al~~~~~~l~~ka~~~~~~ 383 (501) T protein:vir:96 306 K-PPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNT-SGEALKYKLFGLDQDRVDTQSQ 383 (501) T ss_pred c-ccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccc-hHHHHHHHHHHHHHHHHHHHHH Confidence 0 0001111234567888877766678899999999988887554444444322221 121 11 12222333344455 Q ss_pred HHHHHHHHHHHHHHHh-C---CCCc--cccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCC-CCCC Q lcl|NC_021302. 350 IRDVAQAHVVEDIVDV-N---WGED--EPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGL-PGPD 421 (484) Q Consensus 350 i~~~ln~qli~~l~~~-N---f~~~--~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~gl-p~p~ 421 (484) +...|. ++++.++.+ + .+.. ..-.++.|. ..+.+..+.++++.+|+ |+ + +.+.+.+.++. ..|. T Consensus 384 ~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~-i----S~et~~~~l~~v~D~~ 455 (501) T protein:vir:96 384 FTKGLK-RRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQ-V----SQETALSLSGLVESPN 455 (501) T ss_pred HHHHHH-HHHHHHHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-C----chHHHHHhCCCCCCHH Confidence 666663 355554443 1 1111 112367775 45677888999999985 54 2 45556666643 2221 Q ss_pred CCccccccc---CC--C--cCCCccccCCCCc-ccccccccccccc Q lcl|NC_021302. 422 PDADDDEST---AD--T--GQDEPETDEPALP-NTSGTTSTTNAPQ 459 (484) Q Consensus 422 ~~e~~~~~~---~~--~--~~~~~~~~~~~~~-~~~~~~~~~~~~~ 459 (484) .+-+-.... .. . ..-.+........ +.......-...+ T Consensus 456 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 456 EELDKINKEMSEIDFKGYSNDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred HHHHHHHHHHHHhhccccccchhhcccccCCcCCCCCCCccccccC Confidence 110000000 00 0 0000000000000 0000000001111 No 211 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=92.71 E-value=0.01 Score=31.48 Aligned_cols=409 Identities=12% Similarity=0.089 Sum_probs=178.5 Q ss_pred CCCCCCCccceeee--ecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec- Q lcl|NC_021302. 1 MAPKTVAPRTERGY--VNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP- 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p- 77 (484) .+|+.....+++.. .++. ..|......+ ... ...+....|+.|++|+ .++.|-++++.....+.-.+-.-.| T Consensus 24 ~~p~~~DGa~~i~~~~~~~~-~~g~~~~~~~--~~~-~~~~~~eLI~~YR~ma-~~pEvd~Av~eIvne~iv~d~~~~pV 98 (511) T protein:vir:56 24 SAPDNVDGAKEIHTNLLAPQ-LGHAIIPSDA--QSE-GTIPVKELIKSYRALA-EYHEVDDAIQEIVDEAIVYENDKEVV 98 (511) T ss_pred cCCCCCCCceEEecccccce-ecceeccccc--ccc-CccchHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCceE Confidence 33444444333321 1111 1111111111 111 1122347899999998 5999999999888766543221111 Q ss_pred --CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcce-----------eeeEEEeecCCeee Q lcl|NC_021302. 78 --NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHA-----------VFEQTYFYEGGRFW 144 (484) Q Consensus 78 --~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s-----------~~Eivw~~~~g~~~ 144 (484) +=+.-+..+.+.+-+.. -|+.++ .+|+.--+||. .+.++=...+ . T Consensus 99 ~l~ld~~~~s~~iK~kI~e------------------eF~~Il-~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~---G 156 (511) T protein:vir:56 99 WLNLDNTDFSENIKAKINE------------------EFDRVV-SLLQMRKHGYKWFRKWYVDSRIYFHKILDKDN---N 156 (511) T ss_pred EEEecccCcchHHHHHHHH------------------HHHHHH-HHhccchhhhHHHhhhhhcceEEEEEEecccc---c Confidence 00111122222222211 133222 33333333333 3333323333 3 Q ss_pred eeeeeeeCccceeeee---ecC-CCce------eeeecccccccccccceeccCCCCcccccccceEEEeecCc----cC Q lcl|NC_021302. 145 LKRLAPRPQSSIAYWN---VDR-DGGL------ISIQQWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMD----PG 210 (484) Q Consensus 145 ~~~l~~r~~~~~~~~~---~~~-dg~l------~~~~q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~----~~ 210 (484) +.+|...+|+.+.+.+ ++. +|.- -...-.+.+...............++.||.. -|+|.|..- .+ T Consensus 157 I~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~d-aI~y~hSGL~d~~~~ 235 (511) T protein:vir:56 157 IIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKD-AIVFAHSGLMRGCAD 235 (511) T ss_pred eeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechh-heeeecccceeccCC Confidence 4555556666554422 111 1110 0111111111111111111112344556554 577777652 57 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE- Q lcl|NC_021302. 211 VWTGNSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA- 281 (484) Q Consensus 211 ~p~G~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v- 281 (484) +++..|.|.++..++==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|-| T Consensus 236 ~g~i~syLhkAiKp~NQLkm~EDAlVIYRitR----APeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ 311 (511) T protein:vir:56 236 DPYIIGYLDRAIKPANQLKMLEDALVIYRLAR----APERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVK 311 (511) T ss_pred CCeeeccchhhhHHHHhhHHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceec Confidence 7889999999998886555555555544 222 12111 1223334444445566666666442 122222 Q ss_pred -------------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhh--hcccccccc----hhhHHHHHH Q lcl|NC_021302. 282 -------------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHF--LNLDGKGGS----YALASVQAD 337 (484) Q Consensus 282 -------------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqt--lt~~~~gGs----~A~~evh~~ 337 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+. |.+++++++ ++..-+..+ T Consensus 312 ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDE 390 (511) T protein:vir:56 312 NTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSL-GDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDE 390 (511) T ss_pred cchhhhhhHhhhcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHH Confidence 22 47889888765332 334458899999999977654 333433222 332333344 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCc Q lcl|NC_021302. 338 T-FVQSVQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDP 405 (484) Q Consensus 338 v-~~~~~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~ 405 (484) + |...+......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+... T Consensus 391 iKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky 470 (511) T protein:vir:56 391 LKFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKY 470 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccc Confidence 3 3344555566666666554444444333 2111122244442211 1 2233456666665543334445 Q ss_pred ccHHHHHHH-hCCCCCCCCccc--cc-ccCCCcCCCccccC Q lcl|NC_021302. 406 RLEAFLRDA-AGLPGPDPDADD--DE-STADTGQDEPETDE 442 (484) Q Consensus 406 ~~~~~i~e~-~glp~p~~~e~~--~~-~~~~~~~~~~~~~~ 442 (484) .+.+|+++. +.+...+-.+.. .. ....+--..++.+. T Consensus 471 ~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 471 YSHKYIQKNILRLSDDQITAMQSEIDEEETNPRFQQDDQGF 511 (511) T ss_pred cchHHHHHHHhccCHHHHHHHHHHHHHhhcCCCCCCcccCC Confidence 688898665 444322111100 00 00000000111111 No 212 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=91.97 E-value=0.013 Score=30.85 Aligned_cols=419 Identities=10% Similarity=0.063 Sum_probs=174.6 Q ss_pred CCCCCCCccceeeee--ccc-ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec Q lcl|NC_021302. 1 MAPKTVAPRTERGYV--NPL-AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p 77 (484) ++|+.....+++.+- .+. ...|.........+. ........|+.|++|+ .++.|-++++.....+.-.+-.-.| T Consensus 32 ~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~--~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~p 108 (523) T protein:vir:68 32 TSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEP--GLKSTRELIDTYRNLM-TNYEVDNAVSEIVSDAIVYEDDTEV 108 (523) T ss_pred cccCCCCcceeeeccccccccccchhhhhhhhcccc--ccchHHHHHHHHHHHh-hccchhhHHHHhhcceeeecCCCce Confidence 333333333333211 111 111111111111111 1112346799999997 5999999999888766544311111 Q ss_pred ---CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhcce-----------eeeEEEeecCCee Q lcl|NC_021302. 78 ---NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHA-----------VFEQTYFYEGGRF 143 (484) Q Consensus 78 ---~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s-----------~~Eivw~~~~g~~ 143 (484) +=+.-+..+.+.+-+.. -|+.++ .+|+.--+||. .+.++=...+..- T Consensus 109 V~i~Ld~~~~s~~iK~kI~e------------------eF~~Il-~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~ 169 (523) T protein:vir:68 109 VSINLDNTKFSPNIKSMMLD------------------EFNEVL-NHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKE 169 (523) T ss_pred EEEEecccccchHHHHHHHH------------------HHHHHH-HHhccchhhhHHHHhheeeeEEEEEEEeeCCCccc Confidence 11111122222222211 133333 33443334443 3333333333333 Q ss_pred eeeeeeeeCccceeeee---ecCCCceeeeec---cc---ccccccccceeccCCCCcccccccceEEEeecCccC--cc Q lcl|NC_021302. 144 WLKRLAPRPQSSIAYWN---VDRDGGLISIQQ---WP---AGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG--VW 212 (484) Q Consensus 144 ~~~~l~~r~~~~~~~~~---~~~dg~l~~~~q---~~---~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~--~p 212 (484) .+.+|...+|+.+.+.+ ...+++...+.- +. ....+..-.-.......++.||.. .|+|.|..-.+ .- T Consensus 170 GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~d-AI~y~hSGL~d~~~~ 248 (523) T protein:vir:68 170 GIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKA-AIVYAHSGLVDCCGK 248 (523) T ss_pred cceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchh-heeeeeccceeCCCC Confidence 45566677777775532 222222111100 00 000000000001122345555544 58888854211 11 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcc-eEEecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE----- Q lcl|NC_021302. 213 TGNSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVP-YLKGNEADSEDDDRMDELLEIASNYSGG----ESAGLA----- 281 (484) Q Consensus 213 ~G~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P-~~~gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v----- 281 (484) .=.|.|.++..++==-++..-..+.| +-|- |-. +.-...+.-...+.-+.|.+++..+++. .++|=| T Consensus 249 ~i~gyLhkAiKp~NQLkmlEDAlVIYRitRA--PeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk 326 (523) T protein:vir:68 249 NIIGYLHRAIKPANQLKLLEDAVVIYRITRA--PDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQH 326 (523) T ss_pred ceeccchhhhHHHHhhHHHHhhHHHHhhhcc--ccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchh Confidence 22478888887775555554444444 2221 110 1111223344444445666776666543 112211 Q ss_pred ---------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccc----hhhHHHHHHH-HHHH Q lcl|NC_021302. 282 ---------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGS----YALASVQADT-FVQS 342 (484) Q Consensus 282 ---------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs----~A~~evh~~v-~~~~ 342 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.++|+ ++..-+..++ |... T Consensus 327 ~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KF 405 (523) T protein:vir:68 327 IMSMTEDYWLQRRDGKAVTEVDTLPGADNT-GNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKF 405 (523) T ss_pred hhhhHhhhcccccCCCcccceeeccccCCc-ChHHHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHH Confidence 22 47889888765332 234457899999999977665433333222 2222233333 3344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHH Q lcl|NC_021302. 343 VQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFL 411 (484) Q Consensus 343 ~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i 411 (484) +......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+....+.+|+ T Consensus 406 I~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi 485 (523) T protein:vir:68 406 IRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTA 485 (523) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHH Confidence 555566666666554444444333 1111122244442211 1 2233445555555433233345678888 Q ss_pred HHH-hCCCCCCCCcc--ccc-ccCCCcCCCccccCCCC Q lcl|NC_021302. 412 RDA-AGLPGPDPDAD--DDE-STADTGQDEPETDEPAL 445 (484) Q Consensus 412 ~e~-~glp~p~~~e~--~~~-~~~~~~~~~~~~~~~~~ 445 (484) ++. +.+...+-.+. ... ....+--+.|+....+- T Consensus 486 ~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 486 MKDILQMSDEEIEQEAKQIEEESKEARFQDPDQEQEDF 523 (523) T ss_pred HHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 655 44432211110 000 00000000111100000 No 213 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=91.17 E-value=0.017 Score=30.27 Aligned_cols=189 Identities=15% Similarity=0.073 Sum_probs=83.5 Q ss_pred HHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHH Q lcl|NC_021302. 226 KLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIE 305 (484) Q Consensus 226 ~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~ 305 (484) .+|-. |+--+ ...++++..+..+.+.++++-..+.++...+.+++.++.+-++ ...++. T Consensus 1 V~k~~--------------~l~~~-----~~~~~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsG--l~d~l~ 59 (201) T protein:vir:10 1 MWKAK--------------GLADL-----CDDSDGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGG--IDTFLS 59 (201) T ss_pred Cccch--------------HHHHH-----hcCChHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCC--hHHHHH Confidence 00100 00000 1112334444445555554322233444455788888876443 445666 Q ss_pred HHHHHHHHHHhhhhhc---ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC Q lcl|NC_021302. 306 YHDHQMALVALAHFLN---LDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG 382 (484) Q Consensus 306 ~~d~~Isk~ilGqtlt---~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~ 382 (484) ..-.+||-+ .|-.+| ..+-+|=.|.|+.-..++-+.+++.+.....-+.+.|++ +...+ .--.|+|.+.- T Consensus 60 ~~~~~iaa~-s~iP~t~LfG~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~----~~~~~--~~~~~~f~pL~ 132 (201) T protein:vir:10 60 QKFDRIVAL-SGIHEIILKGKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLP----FIVTE--QEWSVEFNPLS 132 (201) T ss_pred HHHHHHHhH-hcCchhhhcCCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH----hhcCC--CCceEeeCCCC Confidence 666666665 333333 223345455677777788888888875443333333444 22111 11255564422 Q ss_pred --------CcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCcc-ccCCCCcccc Q lcl|NC_021302. 383 --------SRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPE-TDEPALPNTS 449 (484) Q Consensus 383 --------~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~-~~~~~~~~~~ 449 (484) +..+..+++++++++.|+.. .+.+++++---.....-.... .+..-...+ .+++..|... T Consensus 133 ~~s~kekAei~~~~a~a~~~~~~~g~i~-----~~e~r~~L~~~~~~~~~~~~~--~~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 133 QVSDKDKSEILEKNVNSVAALIAAGIID-----ADEARDTLRAISTEVKIGEGS--IQTEVVINESEDPLDVSANN 201 (201) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHcCCCC-----HHHHHHHHHhcCCcCCCCCCC--CCccccccccCCCCCCCCCC Confidence 12345678888999999754 356666653211100000000 000000000 0111111111 No 214 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=90.83 E-value=0.019 Score=30.04 Aligned_cols=425 Identities=11% Similarity=0.020 Sum_probs=186.0 Q ss_pred CCCCCCCccceeeeeccccc---ch-hhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAG---FG-TFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~---~~-~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) ++|+.....+++..-..... +| ...+.++ +.. .......|+.|++|+ .++.|-++++.....+.-.+-.-. T Consensus 31 ~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~--e~~--~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~ 105 (521) T protein:vir:81 31 AAPKNNDGATEVEINDNLPASAWNSLTQQFYST--DQK--ISTTKQLVNTYRGLM-NNHEVENAVQNIVNDAIVFEEGHE 105 (521) T ss_pred ccCCCCCCceEecccCCCcceeecceeeeeccc--ccc--hhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCc Confidence 45555554444422111000 11 0111111 111 112356799999997 599999999988876654432111 Q ss_pred c---CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeC Q lcl|NC_021302. 77 P---NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRP 152 (484) Q Consensus 77 p---~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~ 152 (484) | +=++-+..+.+.+-+.. .+...+.-.+|+.-..++. .-...|--.+.++.. .+..-.+.+|...+ T Consensus 106 pV~l~L~~~~~s~~iK~kI~e---------eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~pk~GI~Elr~lD 175 (521) T protein:vir:81 106 VVSLNLEATGFSESVKERIHE---------EFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG-KNPKDGIVELRQLD 175 (521) T ss_pred eEEEEecccccchHHHHHHHH---------HHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEc-CCccccceeeeeeC Confidence 1 10111122222222211 1111111122333222222 122335555666655 33344556777777 Q ss_pred ccceeeeeecCCCce-----e-eeec----ccccccccccceeccCCCCcccccccceEEEeecCc--cCccccchhHHH Q lcl|NC_021302. 153 QSSIAYWNVDRDGGL-----I-SIQQ----WPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMD--PGVWTGNSLLRP 220 (484) Q Consensus 153 ~~~~~~~~~~~dg~l-----~-~~~q----~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~--~~~p~G~gll~~ 220 (484) |+.+.+.+....... + .... .+.+........ ......++.||. ..|+|.|..- .++..=.|.|.+ T Consensus 176 Pr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~-~~~~~~~vkI~~-dAI~y~hSGl~d~~~~~i~syLhk 253 (521) T protein:vir:81 176 PRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQ-VFSPNSRVKIPR-SAITYAHSGLMDCDDKYIIGYLHR 253 (521) T ss_pred CcceeeeeeecccccCccceecceeeeeeeecCCccccccce-eecCCcceeech-hheeeeeccceeCCCCeeeecchh Confidence 877766543332111 0 0000 000000000000 112223344443 4677777543 222223578888 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHhcCCcc-eEEecCCCCCCHHHHHHHHHHHHHHhc----CCceEEE------------- Q lcl|NC_021302. 221 AYKNWKLKDELIRIEAAA-IRRHGIGVP-YLKGNEADSEDDDRMDELLEIASNYSG----GESAGLA------------- 281 (484) Q Consensus 221 ~~~~~~~K~~~~~~w~~f-~Er~~~G~P-~~~gk~~~~~~~~~~~~l~~~l~~~~~----g~~a~~v------------- 281 (484) +..++==-++..-..+.| +-|- |-. +.-...+.-...+.-+.|.+++..+++ +..+|=+ T Consensus 254 AiKp~NQLkm~EDAlVIYRitRA--PeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDy 331 (521) T protein:vir:81 254 AVKPANQLKLLEDAMVVYRITRA--PERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDY 331 (521) T ss_pred hhHhHHhhHHHHhhHHHHhhhcc--ccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhh Confidence 888875555555554444 2221 110 111122344444445667777777766 4334333 Q ss_pred -cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcc--cccc---cchhhHHHHHHH-HHHHHHHHHHH Q lcl|NC_021302. 282 -LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNL--DGKG---GSYALASVQADT-FVQSVQTVADE 349 (484) Q Consensus 282 -ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~--~~~g---Gs~A~~evh~~v-~~~~~~aD~~~ 349 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--. ++++ ++++..-+..++ |...+...... T Consensus 332 WLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~r 410 (521) T protein:vir:81 332 WLQRRDGKAITDVTTLPGASGM-SDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQ 410 (521) T ss_pred cccccCCCcccceeecccCCCC-ChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHH Confidence 23 47889888764332 334457899999999977655333 4332 234444444454 33445566666 Q ss_pred HHHHHHHHHHHHHHHhCCCCc------cccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHH-hCC Q lcl|NC_021302. 350 IRDVAQAHVVEDIVDVNWGED------EPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDA-AGL 417 (484) Q Consensus 350 i~~~ln~qli~~l~~~Nf~~~------~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~-~gl 417 (484) ++..|..-|-..|+--+.-.. ...-+|.|.... . +.+.+.+++..|..+-=.+....+.+|+++. +.+ T Consensus 411 Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~ 490 (521) T protein:vir:81 411 FSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKY 490 (521) T ss_pred HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcc Confidence 666666544444544443211 122244442211 1 2233445555555433233345678888655 444 Q ss_pred CCCCCCcc--ccc-ccCCCcCCCccccCCCC Q lcl|NC_021302. 418 PGPDPDAD--DDE-STADTGQDEPETDEPAL 445 (484) Q Consensus 418 p~p~~~e~--~~~-~~~~~~~~~~~~~~~~~ 445 (484) ...+-.+. ... ....+--+.|+.+...- T Consensus 491 tDeei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 491 TDDQMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred CHHHHHHHHHHHHHHhhCCCCCCCcccccCC Confidence 32211110 000 00011011111111011 No 215 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=90.72 E-value=0.019 Score=29.97 Aligned_cols=408 Identities=10% Similarity=0.047 Sum_probs=152.0 Q ss_pred cceeeeecccc-cchhhh-hhhccccccccccc----ccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC-- Q lcl|NC_021302. 9 RTERGYVNPLA-GFGTFL-AQGLDQFEQVDELR----WPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA-- 80 (484) Q Consensus 9 ~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~lr----~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~-- 80 (484) +.=+-.+.... .++... .+-+.+....+... ....|..|..|- .+..+.+.+-.. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y-----------------~g~~~~~~~~~~~~ 63 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQY-----------------EGDYPQVEYINSQG 63 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHh-----------------cCCCcccccccccc Confidence 10000000000 000000 00001000000000 011233333332 233222211000 Q ss_pred --------CHHHHHHHHHHHHhhhccch--------------------hhhhHHHhhcCCCHHHHHHHH-HHHHhhccee Q lcl|NC_021302. 81 --------RPEVVEHVAACLGLPVEGDE--------------------SDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAV 131 (484) Q Consensus 81 --------~~e~~~~~~~~l~~~~~~~~--------------------~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~ 131 (484) +=..++.++..++..+.++. .+....+.+..-+|...+... .+++..|=.+ T Consensus 64 ~~~~~~~~sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a 143 (517) T protein:vir:98 64 KIQERDYMTLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLT 143 (517) T ss_pred cccccceeecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEE Confidence 00122333344443333321 112233333444566655444 5788889888 Q ss_pred eeEEEeecCCeeeeeeeeeeCccceeeeeecCCCce---------------------eeeecccccc--ccccc----ce Q lcl|NC_021302. 132 FEQTYFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGL---------------------ISIQQWPAGT--FGGPG----MV 184 (484) Q Consensus 132 ~Eivw~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l---------------------~~~~q~~~~~--~~~~~----~~ 184 (484) +=+.|.. |.. +|.++++..|.-..++.++.. +....+..-. .+... .+ T Consensus 144 ~k~~~d~--~~~---~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly 218 (517) T protein:vir:98 144 VRPYVDN--GEI---EFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELY 218 (517) T ss_pred EEEEEeC--Cee---EEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEE Confidence 8777752 222 133333333211122221111 0000000000 00000 00 Q ss_pred e-ccCCCCccc---------ccccc---------eEEEee----cCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 185 V-MAPNSMGPA---------IPVEQ---------LVVYTH----DMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRR 241 (484) Q Consensus 185 ~-~~~~~~~~~---------lp~~k---------~l~~~~----~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er 241 (484) . .....-|.+ |++.. |.+++. +...++|+|.|.+..+....-.-...+..|..-++. T Consensus 219 ~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~ 298 (517) T protein:vir:98 219 KSDNEGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKM 298 (517) T ss_pred ecCCCccccccccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHh Confidence 0 000111111 11111 223322 223467999999999987776667666666665553 Q ss_pred hcCCcceEEecCC-----CCCCHHHHHHHHHHHHHHhcCCceEEEcc---CCceEEEecccCCchhHHHHHHHHHHHHHH Q lcl|NC_021302. 242 HGIGVPYLKGNEA-----DSEDDDRMDELLEIASNYSGGESAGLALT---AGEEAGILSPNGTPLDPRRAIEYHDHQMAL 313 (484) Q Consensus 242 ~~~G~P~~~gk~~-----~~~~~~~~~~l~~~l~~~~~g~~a~~vip---~~~~ie~~~~~~~~~~~~~li~~~d~~Isk 313 (484) ....+.+ +.. ............+ .+...+..+. .+.-|+..+..--...|..-++.+=++|+. T Consensus 299 --g~~~i~v-p~~~l~~~~~~~g~~~~~~~d------~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~ 369 (517) T protein:vir:98 299 --GQRTVFV-SDVMLRTVPDESGMPPPQVFD------PDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEM 369 (517) T ss_pred --CCcceec-ChhhhccccCCCCcccCCCCC------cccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHH Confidence 1222322 110 0000000000000 0000011010 111122222222233455555555555554 Q ss_pred HH-hh-hhhcccccccchhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCc---cccceEEecC- Q lcl|NC_021302. 314 VA-LA-HFLNLDGKGGSYALASVQA--DTFVQSVQTVADEIRDVAQAHVVEDIVDVN-----WGED---EPAPLLVFDE- 380 (484) Q Consensus 314 ~i-lG-qtlt~~~~gGs~A~~evh~--~v~~~~~~aD~~~i~~~ln~qli~~l~~~N-----f~~~---~~~P~~~~~~- 380 (484) .+ ++ ++++.+++ |.....++.. .-....+.+-.+.+...| ++|++-++.+- ++.. ...+.+.|++ T Consensus 370 ~~Gls~~t~~~~~~-~~kTATEi~s~~~~~~~t~~~~~~~~~~aL-~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~ 447 (517) T protein:vir:98 370 ELKLSVGTFSFDGR-SMKTATEIVSENDLTYRTRNDHVYEVEQFI-KGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDG 447 (517) T ss_pred HhCCCccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCC Confidence 43 33 34444433 2221223322 233334455566677777 45777665321 3221 1225788875 Q ss_pred CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCcccc-CCCCcccccccc Q lcl|NC_021302. 381 IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETD-EPALPNTSGTTS 453 (484) Q Consensus 381 ~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 453 (484) ...|.++.++...+++..|+.. .+.++.+.||+.+.+-.+.+...........+... .+......|+.. T Consensus 448 i~~D~~~~~~~~~~~v~aG~ms----~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 448 VFQDRSALLRFYGQAKTFGFIP----TVEAIQRIFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQKRMFGDEE 517 (517) T ss_pred CCCCHHHHHHHHHHHHhcCCCC----HHHHHHHhCCCChHHHHHHHHHHHHhccccCCCCccccccCCCCCCCC Confidence 5678888889999999999743 35789999998654322222111111111111110 001111111111 No 216 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=90.69 E-value=0.02 Score=29.95 Aligned_cols=417 Identities=11% Similarity=0.055 Sum_probs=177.6 Q ss_pred CCCCCCCccceeee--eccc--ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERGY--VNPL--AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) ++|+.....+++.. .++. +.+++. +...... ......|+.|++|+ .++.|-++++.....+.-.+-.-. T Consensus 30 ~~p~~~dGa~~i~~~~~~~~~~g~~~~~----~~~~~~~--~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~ 102 (516) T protein:vir:10 30 ATPKKDDGATEIETREGEATYNAVMQQF----FGIDNNI--SGTKDLINTYRQLI-NNPEVERAVANIVNEAIVYERGHK 102 (516) T ss_pred cCCCCCCCceeeecCCCcccccceeeee----ecccccc--chHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCc Confidence 44444444444321 1111 111111 1111111 12346799999997 599999999988876654322111 Q ss_pred c---CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeC Q lcl|NC_021302. 77 P---NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRP 152 (484) Q Consensus 77 p---~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~ 152 (484) | +=+.-+..+.+.+-+.. .+...+.-.+|+.-..++. .-...|--.+.++= ++..-.+.+|...+ T Consensus 103 pV~l~L~~~~~s~~ik~kI~e---------eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKii--d~~k~GI~Elr~lD 171 (516) T protein:vir:10 103 VVSLDLDDTDFGSNVKEKILE---------EFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIM--PNPKKGIAELRRLD 171 (516) T ss_pred eEEEEecccCcchHHHHHHHH---------HHHHHHHHhccchhhhHHHhhhhhcceEEEEEEe--cCccccceeeeeeC Confidence 1 10111122222222211 0111111122322221111 11112333333221 13333455666667 Q ss_pred ccceeeeeec----CCCcee--ee-eccccccccccccee--ccCCCCcccccccceEEEeecCc---cCccccchhHHH Q lcl|NC_021302. 153 QSSIAYWNVD----RDGGLI--SI-QQWPAGTFGGPGMVV--MAPNSMGPAIPVEQLVVYTHDMD---PGVWTGNSLLRP 220 (484) Q Consensus 153 ~~~~~~~~~~----~dg~l~--~~-~q~~~~~~~~~~~~~--~~~~~~~~~lp~~k~l~~~~~~~---~~~p~G~gll~~ 220 (484) |+.+.+.+.- .+|..+ .. ..+.-+......... ....+..+.||. ..|+|.|..- .++.+ .|.|.+ T Consensus 172 Pr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~-dAI~y~hSGL~d~~~~~i-~syLhk 249 (516) T protein:vir:10 172 PRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPR-SAVVYASSGLMDCSDRGI-IGYLHN 249 (516) T ss_pred CcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeech-hheeeecccceeCCCCce-eeeehh Confidence 7777654432 112110 00 000000000000000 011223344544 4588888542 33444 788999 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE----------- Q lcl|NC_021302. 221 AYKNWKLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA----------- 281 (484) Q Consensus 221 ~~~~~~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v----------- 281 (484) +..++==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|=| T Consensus 250 AiKp~NQLkm~EDAlVIYRitR----APeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlE 325 (516) T protein:vir:10 250 AVKPANQLKLLEDAMVIYRITR----APERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTE 325 (516) T ss_pred hhHhHHhhHHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 888885555555555544 222 12111 1223334444445666666666543 112211 Q ss_pred ---cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc-----chhhHHHHHHH-HHHHHHHHH Q lcl|NC_021302. 282 ---LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG-----SYALASVQADT-FVQSVQTVA 347 (484) Q Consensus 282 ---ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG-----s~A~~evh~~v-~~~~~~aD~ 347 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.++| +++..-+..++ |...+.... T Consensus 326 DyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR 404 (516) T protein:vir:10 326 DYWLMRRDGKSVTEVSSLPGAQTM-GDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQ 404 (516) T ss_pred hhcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHH Confidence 22 47889888765332 33445889999999997776543332222 34433344444 344455666 Q ss_pred HHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHH-h Q lcl|NC_021302. 348 DEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDA-A 415 (484) Q Consensus 348 ~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~-~ 415 (484) ..++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+....+.+|+++. + T Consensus 405 ~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~IL 484 (516) T protein:vir:10 405 HDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNIL 484 (516) T ss_pred HHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh Confidence 6666666654444444333 2111122244442211 1 2233445555555432223345678898654 5 Q ss_pred CCCCCCCCccc--ccc-cCCC--cCCCccccC Q lcl|NC_021302. 416 GLPGPDPDADD--DES-TADT--GQDEPETDE 442 (484) Q Consensus 416 glp~p~~~e~~--~~~-~~~~--~~~~~~~~~ 442 (484) .++..+-.+.. ... ...+ ..|+.+.+. T Consensus 485 r~tDeei~~e~k~I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 485 QMTEEQIAQEEKQIEQEAGIKRFQNPENEDDF 516 (516) T ss_pred cCCHhhHHHHHHHHHHhhhCCCCCCCCccccC Confidence 55433211111 000 0000 011111111 No 217 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=90.69 E-value=0.02 Score=29.95 Aligned_cols=417 Identities=11% Similarity=0.055 Sum_probs=177.6 Q ss_pred CCCCCCCccceeee--eccc--ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEe Q lcl|NC_021302. 1 MAPKTVAPRTERGY--VNPL--AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIR 76 (484) Q Consensus 1 ~~~~~~~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~ 76 (484) ++|+.....+++.. .++. +.+++. +...... ......|+.|++|+ .++.|-++++.....+.-.+-.-. T Consensus 30 ~~p~~~dGa~~i~~~~~~~~~~g~~~~~----~~~~~~~--~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~ 102 (516) T protein:vir:10 30 ATPKKDDGATEIETREGEATYNAVMQQF----FGIDNNI--SGTKDLINTYRQLI-NNPEVERAVANIVNEAIVYERGHK 102 (516) T ss_pred cCCCCCCCceeeecCCCcccccceeeee----ecccccc--chHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCc Confidence 44444444444321 1111 111111 1111111 12346799999997 599999999988876654322111 Q ss_pred c---CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeC Q lcl|NC_021302. 77 P---NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRP 152 (484) Q Consensus 77 p---~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~ 152 (484) | +=+.-+..+.+.+-+.. .+...+.-.+|+.-..++. .-...|--.+.++= ++..-.+.+|...+ T Consensus 103 pV~l~L~~~~~s~~ik~kI~e---------eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKii--d~~k~GI~Elr~lD 171 (516) T protein:vir:10 103 VVSLDLDDTDFGSNVKEKILE---------EFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIM--PNPKKGIAELRRLD 171 (516) T ss_pred eEEEEecccCcchHHHHHHHH---------HHHHHHHHhccchhhhHHHhhhhhcceEEEEEEe--cCccccceeeeeeC Confidence 1 10111122222222211 0111111122322221111 11112333333221 13333455666667 Q ss_pred ccceeeeeec----CCCcee--ee-eccccccccccccee--ccCCCCcccccccceEEEeecCc---cCccccchhHHH Q lcl|NC_021302. 153 QSSIAYWNVD----RDGGLI--SI-QQWPAGTFGGPGMVV--MAPNSMGPAIPVEQLVVYTHDMD---PGVWTGNSLLRP 220 (484) Q Consensus 153 ~~~~~~~~~~----~dg~l~--~~-~q~~~~~~~~~~~~~--~~~~~~~~~lp~~k~l~~~~~~~---~~~p~G~gll~~ 220 (484) |+.+.+.+.- .+|..+ .. ..+.-+......... ....+..+.||. ..|+|.|..- .++.+ .|.|.+ T Consensus 172 Pr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~-dAI~y~hSGL~d~~~~~i-~syLhk 249 (516) T protein:vir:10 172 PRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPR-SAVVYASSGLMDCSDRGI-IGYLHN 249 (516) T ss_pred CcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeech-hheeeecccceeCCCCce-eeeehh Confidence 7777654432 112110 00 000000000000000 011223344544 4588888542 33444 788999 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE----------- Q lcl|NC_021302. 221 AYKNWKLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA----------- 281 (484) Q Consensus 221 ~~~~~~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v----------- 281 (484) +..++==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|=| T Consensus 250 AiKp~NQLkm~EDAlVIYRitR----APeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlE 325 (516) T protein:vir:10 250 AVKPANQLKLLEDAMVIYRITR----APERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTE 325 (516) T ss_pred hhHhHHhhHHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 888885555555555544 222 12111 1223334444445666666666543 112211 Q ss_pred ---cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc-----chhhHHHHHHH-HHHHHHHHH Q lcl|NC_021302. 282 ---LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG-----SYALASVQADT-FVQSVQTVA 347 (484) Q Consensus 282 ---ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG-----s~A~~evh~~v-~~~~~~aD~ 347 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.++| +++..-+..++ |...+.... T Consensus 326 DyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR 404 (516) T protein:vir:10 326 DYWLMRRDGKSVTEVSSLPGAQTM-GDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQ 404 (516) T ss_pred hhcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHH Confidence 22 47889888765332 33445889999999997776543332222 34433344444 344455666 Q ss_pred HHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHH-h Q lcl|NC_021302. 348 DEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDA-A 415 (484) Q Consensus 348 ~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~-~ 415 (484) ..++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+....+.+|+++. + T Consensus 405 ~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~IL 484 (516) T protein:vir:10 405 HDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNIL 484 (516) T ss_pred HHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh Confidence 6666666654444444333 2111122244442211 1 2233445555555432223345678898654 5 Q ss_pred CCCCCCCCccc--ccc-cCCC--cCCCccccC Q lcl|NC_021302. 416 GLPGPDPDADD--DES-TADT--GQDEPETDE 442 (484) Q Consensus 416 glp~p~~~e~~--~~~-~~~~--~~~~~~~~~ 442 (484) .++..+-.+.. ... ...+ ..|+.+.+. T Consensus 485 r~tDeei~~e~k~I~~E~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 485 QMTEEQIAQEEKQIEQEAGIKRFQNPENEDDF 516 (516) T ss_pred cCCHhhHHHHHHHHHHhhhCCCCCCCCccccC Confidence 55433211111 000 0000 011111111 No 218 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=90.35 E-value=0.021 Score=29.75 Aligned_cols=448 Identities=11% Similarity=0.087 Sum_probs=177.1 Q ss_pred CCCCCCCccce---eeeeccc--ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcE- Q lcl|NC_021302. 1 MAPKTVAPRTE---RGYVNPL--AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWR- 74 (484) Q Consensus 1 ~~~~~~~~~~~---~~~~~~~--~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~- 74 (484) ++++..++..+ -|...-. +.++. +.++.+. .......|+.|++|+ .++.|-++++.....+.-.+-. T Consensus 15 ~~~~~~s~~~p~~ddg~~~~~~~g~~~~--~~~~~~~----~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~ 87 (558) T protein:vir:10 15 KSTSIISPVPKNNEDGVDNFISSGFYGQ--YVDIEGA----YRSEYDLIRRYREMA-LHPEADGAIEDVVNEAIVSDLYD 87 (558) T ss_pred hccCCccccCCCccccccceeccceeee--eecccch----hhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCC Confidence 33333333222 1211111 11111 1122111 122356799999997 5999999999888766543221 Q ss_pred ----EecCC--CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeee Q lcl|NC_021302. 75 ----IRPNG--ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKR 147 (484) Q Consensus 75 ----v~p~~--~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~ 147 (484) |+-.+ .++.+.+.+.+.... .+.-.+|+.-..++. .-...|--.+.++=...+..-.+.+ T Consensus 88 ~pV~i~Ld~~~~s~~iK~kI~eEF~~-------------Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~E 154 (558) T protein:vir:10 88 SPVEVELSNLNASNTLKKKIREEFRY-------------IKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQD 154 (558) T ss_pred ceEEEEecccCcchHHHHHHHHHHHH-------------HHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCcccccee Confidence 11111 122233333333221 111112322222221 1122233344443333333334556 Q ss_pred eeeeCccceeeeeec----CCCcee-eee------------c---ccccccccccceeccCCCCcccccccceEEEeecC Q lcl|NC_021302. 148 LAPRPQSSIAYWNVD----RDGGLI-SIQ------------Q---WPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDM 207 (484) Q Consensus 148 l~~r~~~~~~~~~~~----~dg~l~-~~~------------q---~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~ 207 (484) |...+|+.+.+.+-- .+++-+ .++ . +..................++.|| ...|+|.|.. T Consensus 155 Lr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~-~dAI~y~hSG 233 (558) T protein:vir:10 155 LRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIA-KDSITMCTSG 233 (558) T ss_pred eeeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeec-hhheeeeccc Confidence 666777776543221 011000 000 0 000000000000011122223333 3567888763 Q ss_pred ----ccCccccchhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcc-eEEecCCCCCCHHHHHHHHHHHHHHhcC----Cc Q lcl|NC_021302. 208 ----DPGVWTGNSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVP-YLKGNEADSEDDDRMDELLEIASNYSGG----ES 277 (484) Q Consensus 208 ----~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P-~~~gk~~~~~~~~~~~~l~~~l~~~~~g----~~ 277 (484) ..++ =.|.|.++..++==-++..-..+.| +-|- |-. +.-...+.-...+.-..|.+++..+++. .+ T Consensus 234 L~d~~~~~--i~syLhkAIKp~NQLkmlEDAlVIYRitRA--PERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~ 309 (558) T protein:vir:10 234 LVDRNKNR--VLSYLHKAIKALNQLRMIEDSLVIYRLSRA--PERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDAN 309 (558) T ss_pred ceecCCCe--eeecchHhhHhHHhhHHHHhhHHHHhhhcc--ccceEEEEecCCCCchhHHHHHHHHHHhccceEEEecc Confidence 2222 2478888887775555555544444 2221 110 1111223334444445666776666532 12 Q ss_pred eEEE--------------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc---chhhHHHH Q lcl|NC_021302. 278 AGLA--------------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG---SYALASVQ 335 (484) Q Consensus 278 a~~v--------------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG---s~A~~evh 335 (484) +|-| +| .|++|.++.++.+. .-..=++|..+.+-+++..+.--.+.+|| .++..-+. T Consensus 310 TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnL-gem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItR 388 (558) T protein:vir:10 310 TGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNL-GELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILR 388 (558) T ss_pred CceecccchhhhhHhhhcccccCCCCccceeeccccCCc-chHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhH Confidence 2222 22 47889888765432 23345789999999997665433333332 12222233 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccC Q lcl|NC_021302. 336 ADT-FVQSVQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTP 403 (484) Q Consensus 336 ~~v-~~~~~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~ 403 (484) .++ |...+......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+. T Consensus 389 DEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvG 468 (558) T protein:vir:10 389 DELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIG 468 (558) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 343 3344555666666666554444444333 2111122244442211 1 22334455555554332333 Q ss_pred CcccHHHHHHHh-CCCCCCCC--------cccccccCCCcCCCccccC----------CCCccccccccccccccccccc Q lcl|NC_021302. 404 DPRLEAFLRDAA-GLPGPDPD--------ADDDESTADTGQDEPETDE----------PALPNTSGTTSTTNAPQARKRP 464 (484) Q Consensus 404 ~~~~~~~i~e~~-glp~p~~~--------e~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~ 464 (484) ...+.+|+++.+ .+...+-. |...+.-+++.+.+|.+.. +..+....++.....+..+.+. T Consensus 469 ky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 548 (558) T protein:vir:10 469 KYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQ 548 (558) T ss_pred cccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccchhhhhhh Confidence 456788886553 33211100 0000111111111111111 1111111111111111111111 Q ss_pred cccchHHHhc Q lcl|NC_021302. 465 RGRSPRDRRK 474 (484) Q Consensus 465 ~~~~~~~~~~ 474 (484) ..++...+-+ T Consensus 549 ~~~~~~~~~~ 558 (558) T protein:vir:10 549 YSKDTKKAEL 558 (558) T ss_pred hhhhhhhhcC Confidence 1222222222 No 219 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=90.21 E-value=0.022 Score=29.67 Aligned_cols=428 Identities=11% Similarity=0.027 Sum_probs=188.6 Q ss_pred CCCCCCCccceeeee--cccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec- Q lcl|NC_021302. 1 MAPKTVAPRTERGYV--NPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP- 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p- 77 (484) .+|+.....+++-+- ++....|....+.+..+.. .......|+.|++|+ .++.|-++++.....+.-.+-.-.| T Consensus 31 ~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~--~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~pV 107 (521) T protein:vir:65 31 AAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQK--ISTTKQLVNTYRGLM-NNHEVENAVQNIVNDAIVFEEGHEVV 107 (521) T ss_pred cCCCCCCCceeecccCCccccccccceeeeccccch--hhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCceE Confidence 556666665555321 1111111111111111111 112356799999997 5999999999888766543221111 Q ss_pred --CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCcc Q lcl|NC_021302. 78 --NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQS 154 (484) Q Consensus 78 --~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~ 154 (484) +=++-+..+.+.+-+.. .+...+.-.+|+.-..++. .-...|--.+.++.. .+..-.+.+|...+|+ T Consensus 108 ~l~L~~~~~s~~iK~kI~e---------eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid-~~pk~GI~ELr~lDPr 177 (521) T protein:vir:65 108 SLNLEATGFSESVKERIHE---------EFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIG-KNPKDGIVELRQLDPR 177 (521) T ss_pred EEEecccccchHHHHHHHH---------HHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEc-CCccccceeeeeeCCc Confidence 10111122222222211 1111111222333222222 123345556666665 3334455677777787 Q ss_pred ceeeeeecCCCce-----e-eeec-ccccccccccc--eeccCCCCcccccccceEEEeecCc--cCccccchhHHHHHH Q lcl|NC_021302. 155 SIAYWNVDRDGGL-----I-SIQQ-WPAGTFGGPGM--VVMAPNSMGPAIPVEQLVVYTHDMD--PGVWTGNSLLRPAYK 223 (484) Q Consensus 155 ~~~~~~~~~dg~l-----~-~~~q-~~~~~~~~~~~--~~~~~~~~~~~lp~~k~l~~~~~~~--~~~p~G~gll~~~~~ 223 (484) .+.+.+....... + .... +.-...+.... -.......++.||. ..|+|.|..- .++..=.|.|.++.. T Consensus 178 ~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~-dAI~y~hSGl~d~~~~~i~syLhkAiK 256 (521) T protein:vir:65 178 NLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPR-SAITYAHSGLMDCDDKYIIGYLHRAVK 256 (521) T ss_pred ceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeech-hheeeeeccceeCCCCeeeecchhhhH Confidence 7766543332211 0 0000 00000000000 00111223344443 4677777543 222223578888888 Q ss_pred HHHHHHHHHHHHHHH-HHHhcCCcc-eEEecCCCCCCHHHHHHHHHHHHHHhc----CCceEEE--------------cc Q lcl|NC_021302. 224 NWKLKDELIRIEAAA-IRRHGIGVP-YLKGNEADSEDDDRMDELLEIASNYSG----GESAGLA--------------LT 283 (484) Q Consensus 224 ~~~~K~~~~~~w~~f-~Er~~~G~P-~~~gk~~~~~~~~~~~~l~~~l~~~~~----g~~a~~v--------------ip 283 (484) ++==-++..-..+.| +-|- |-. +.-...+.-...+.-+.|.+++..+++ +..+|=+ +| T Consensus 257 p~NQLkm~EDAlVIYRitRA--PeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLp 334 (521) T protein:vir:65 257 PANQLKLLEDAMVVYRITRA--PERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQ 334 (521) T ss_pred hHHhhHHHHhhHHHHhhhcc--ccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhccc Confidence 875555555554444 2221 110 111122344444445667777777766 4334333 23 Q ss_pred -----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc--cccccc---chhhHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_021302. 284 -----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN--LDGKGG---SYALASVQADT-FVQSVQTVADEIRD 352 (484) Q Consensus 284 -----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt--~~~~gG---s~A~~evh~~v-~~~~~~aD~~~i~~ 352 (484) .|++|.++.++.+. .-.+=++|..+.+-+++..+.-- .++++| +++..-+..++ |...+......++. T Consensus 335 RReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rFs~ 413 (521) T protein:vir:65 335 RRDGKAITDVTTLPGASGM-SDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQFSE 413 (521) T ss_pred ccCCCCccceeecccCCCc-ChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHH Confidence 47889888764332 33445789999999997765533 343322 34444444454 34445566666666 Q ss_pred HHHHHHHHHHHHhCCCCc------cccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHH-hCCCCC Q lcl|NC_021302. 353 VAQAHVVEDIVDVNWGED------EPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDA-AGLPGP 420 (484) Q Consensus 353 ~ln~qli~~l~~~Nf~~~------~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~-~glp~p 420 (484) .|..-|-..|+--+.-.. ...-+|.|.... . +.+.+.+++..|..+-=.+....+.+|+++. +.+... T Consensus 414 lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDe 493 (521) T protein:vir:65 414 VLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDD 493 (521) T ss_pred HHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHH Confidence 666544444544443211 122244442211 1 2233445565555443233445688898665 444322 Q ss_pred CCCcc--ccc-ccCCCcCCCccccCCCC Q lcl|NC_021302. 421 DPDAD--DDE-STADTGQDEPETDEPAL 445 (484) Q Consensus 421 ~~~e~--~~~-~~~~~~~~~~~~~~~~~ 445 (484) +-.+. ... ....+--+.|+.+...- T Consensus 494 ei~~~~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 494 QMDTEKKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HHHHHHHHHHHhhhCCCCCCCcccccCC Confidence 11110 000 00000000111111000 No 220 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=90.17 E-value=0.022 Score=29.64 Aligned_cols=457 Identities=12% Similarity=0.086 Sum_probs=171.7 Q ss_pred CCCCCCCccceeeeecc-cccchhh---hhhhcccccccccccccchHHHHHH-H-----------------------Hh Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNP-LAGFGTF---LAQGLDQFEQVDELRWPNSVYTYTR-M-----------------------CR 52 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~lr~~~~~~~y~~-m-----------------------~~ 52 (484) =+|++..-.-|.+-... .-+.++. .+..+...-.+.+..|...++.|.. . .- T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki 83 (641) T protein:vir:94 4 EMPTPIIEDKESAKRKLSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRI 83 (641) T ss_pred CCCcccccCCcchhhcCCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccc Confidence 23333332222221111 1112222 2222333333344444333222210 0 01 Q ss_pred cchHHHHHHHHHHHHhhC-----CCc-EEecCC-CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HH Q lcl|NC_021302. 53 EEARIASVLRAIGLPIRR-----TDW-RIRPNG-ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KS 124 (484) Q Consensus 53 ~D~~v~s~l~~r~~~v~~-----~~~-~v~p~~-~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a 124 (484) .|+++..+++.....+.+ .+| +++|.+ ++.+.++.+.+.+... + ...+|.+.+..++ ++ T Consensus 84 ~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~------------l-~~~~~~~~~~~~~~d~ 150 (641) T protein:vir:94 84 NTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTK------------L-EAASIRDIFETYVRNL 150 (641) T ss_pred cchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHH------------H-hhcchHHHHHHHHHHH Confidence 355555555555554443 355 677754 3444455444433321 1 2334566555554 78 Q ss_pred HhhcceeeeEEEeec-----------CCee-------------eeeeeeeeCccceee-------------ee-ec---- Q lcl|NC_021302. 125 LQFGHAVFEQTYFYE-----------GGRF-------------WLKRLAPRPQSSIAY-------------WN-VD---- 162 (484) Q Consensus 125 ~~~G~s~~Eivw~~~-----------~g~~-------------~~~~l~~r~~~~~~~-------------~~-~~---- 162 (484) +.+|.+++.+-|... ++.+ .-.++.+++|..|.+ ++ +- T Consensus 151 ~~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t~~t~~ 230 (641) T protein:vir:94 151 VLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHTREELH 230 (641) T ss_pred hhcCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhhHHHHH Confidence 889988877766421 1110 001122222222210 00 00 Q ss_pred ---CCCc----eeeeecccccc------------ccc--ccce-----------------eccCCC-----Cccccc-cc Q lcl|NC_021302. 163 ---RDGG----LISIQQWPAGT------------FGG--PGMV-----------------VMAPNS-----MGPAIP-VE 198 (484) Q Consensus 163 ---~dg~----l~~~~q~~~~~------------~~~--~~~~-----------------~~~~~~-----~~~~lp-~~ 198 (484) .+|. .+......... .+. ...+ ....+. .+.+.. .. T Consensus 231 ~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~ 310 (641) T protein:vir:94 231 ELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGS 310 (641) T ss_pred HHHhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcC Confidence 0000 00000000000 000 0000 000000 011100 12 Q ss_pred ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCce Q lcl|NC_021302. 199 QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESA 278 (484) Q Consensus 199 k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a 278 (484) -|+++++....++.||.|....|......++...+.-+..+++...| |+. ...+......++ ....| + T Consensus 311 Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p-~~~-~~~~~~~~~~~l--------~~~PG--~ 378 (641) T protein:vir:94 311 PFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINK-MWT-LVEDGILKREDV--------KAKPG--A 378 (641) T ss_pred CeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCC-eee-ecccccccccee--------eccCC--c Confidence 58999999999999999999999999999999999999998887554 333 332222222111 11122 2 Q ss_pred EEEccCCceEEEecccCC-chhHHHHHHHHHHHHHHHHhhhhhcccc--cccch-hhHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_021302. 279 GLALTAGEEAGILSPNGT-PLDPRRAIEYHDHQMALVALAHFLNLDG--KGGSY-ALASVQ--ADTFVQSVQTVADEIRD 352 (484) Q Consensus 279 ~~vip~~~~ie~~~~~~~-~~~~~~li~~~d~~Isk~ilGqtlt~~~--~gGs~-A~~evh--~~v~~~~~~aD~~~i~~ 352 (484) .+.......+..+..... -......+++++..|.+++....+.+.. ..|.. -..+|. .+.....+..-.+.+.+ T Consensus 379 ii~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~ 458 (641) T protein:vir:94 379 VFKVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIED 458 (641) T ss_pred ceeeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333333344555533221 1234567888888888886555443322 12221 223343 22333344455566665 Q ss_pred HHHHHHHHHHHHhC--CCCc-----------------cccc-eE--Ee--cCCCCc-HHHHHHHHHHHH---hcCcccCC Q lcl|NC_021302. 353 VAQAHVVEDIVDVN--WGED-----------------EPAP-LL--VF--DEIGSR-QDATAAALQMLV---NAGLLTPD 404 (484) Q Consensus 353 ~ln~qli~~l~~~N--f~~~-----------------~~~P-~~--~~--~~~~~~-~~~~ae~~~~L~---~~G~~~~~ 404 (484) .+-..|+.+++.++ +... ...| .+ .+ ...... ....+..++.|. +.....|. T Consensus 459 e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~ 538 (641) T protein:vir:94 459 SSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQ 538 (641) T ss_pred HHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChh Confidence 55555666655544 1100 0011 11 11 111111 111122222221 21111221 Q ss_pred -------cccHHHHHHHhCCCCCCCCcccccccCCCc---CCCccccCCCCcccccccccc--------ccccccccccc Q lcl|NC_021302. 405 -------PRLEAFLRDAAGLPGPDPDADDDESTADTG---QDEPETDEPALPNTSGTTSTT--------NAPQARKRPRG 466 (484) Q Consensus 405 -------~~~~~~i~e~~glp~p~~~e~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 466 (484) ...-..+.+..|++.|...-......+.+. +.+.+........+-+..... ...++.....+ T Consensus 539 v~d~~d~~~~~~~~~~~~g~~~p~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~ 618 (641) T protein:vir:94 539 IGQSLDYALILEDLLRQMRFTDPMRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIG 618 (641) T ss_pred hhhcCCHHHHHHHHHHHhCCCCchhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhc Confidence 111244567778887753211000000000 000000000000000000000 00000000111 Q ss_pred cchHHHhcC------cccCc-cc Q lcl|NC_021302. 467 RSPRDRRKT------PDGAM-PL 482 (484) Q Consensus 467 ~~~~~~~~~------~~~~~-~~ 482 (484) .++.+-.++ |+... .| T Consensus 619 ~~~~~~~~~~~~~~~~~~~~~~~ 641 (641) T protein:vir:94 619 IDTSDVAPEAMAAATQQITSGAL 641 (641) T ss_pred CCchhhhHHHHhcccccccccCC Confidence 111100000 00000 11 No 221 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=90.12 E-value=0.023 Score=29.61 Aligned_cols=411 Identities=11% Similarity=0.017 Sum_probs=162.2 Q ss_pred CCCCCCCc-----cceeeeecccccchhhhhhhcccc-----cccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhC Q lcl|NC_021302. 1 MAPKTVAP-----RTERGYVNPLAGFGTFLAQGLDQF-----EQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRR 70 (484) Q Consensus 1 ~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~ 70 (484) |-+..... .+.+ .+.-.....++.|-... ......-.+. . .+ ..+...-++.+...-+.+ T Consensus 22 l~~~~i~~li~~~~~~~---~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~-~----ki--~~n~~~~Iv~~~~~~l~G 91 (506) T protein:vir:94 22 LTPNKIMKFITHHFNYQ---RPRLEMLDDYYQGYNLKILDKQSRRHEDGKAD-H----RA--THSFAKYIADFQTSYSVG 91 (506) T ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHHhcCCCccccccccccccccCCc-c----ee--ecchHHHHHHHhhhhhcc Confidence 11111000 0000 00000000111110000 0000000000 0 12 345666667777777788 Q ss_pred CCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeee Q lcl|NC_021302. 71 TDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLA 149 (484) Q Consensus 71 ~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~ 149 (484) -+..+.+.++. ..+.+.+.+ ..-+|+..+..+ .++..+|.+ ++++|.-.+|... +. T Consensus 92 ~p~~~~~~d~~--~~~~l~~~~-----------------~~N~~~~~~~~~~~~~~~~G~a-~~~v~~ded~~~~---i~ 148 (506) T protein:vir:94 92 NPINVKLPDDG--SNSGFDTFN-----------------KANDVDAENYDLFLDMSRYGRA-YEYVYRGEDNEEH---LA 148 (506) T ss_pred cCceeecCcch--HHHHHHHHH-----------------hccCHhHHHHHHHHHHHhcCeE-EEEEEecCCCeeE---EE Confidence 88777654332 222222222 122465555554 578889985 5677765667654 33 Q ss_pred eeCccceeeeeecC--CCceee-eeccc---ccc------------cccccceeccCC--------CCcccccccceEEE Q lcl|NC_021302. 150 PRPQSSIAYWNVDR--DGGLIS-IQQWP---AGT------------FGGPGMVVMAPN--------SMGPAIPVEQLVVY 203 (484) Q Consensus 150 ~r~~~~~~~~~~~~--dg~l~~-~~q~~---~~~------------~~~~~~~~~~~~--------~~~~~lp~~k~l~~ 203 (484) ..+|+.+. ..+++ .+.++. ++.+. ... ............ ....++..--++.| T Consensus 149 ~~~p~~~~-~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 227 (506) T protein:vir:94 149 KLDPLDTF-VIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEF 227 (506) T ss_pred EEcccceE-EEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccceEEe Confidence 44555432 22221 122221 11000 000 000000000000 00111111112333 Q ss_pred eecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHH----------------HHHHHHH Q lcl|NC_021302. 204 THDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDD----------------RMDELLE 267 (484) Q Consensus 204 ~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~----------------~~~~l~~ 267 (484) + +|..|.|.+..+-...=.-...+..++..++.+..++.++.|......... ....... T Consensus 228 ~-----n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (506) T protein:vir:94 228 K-----NSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLE 302 (506) T ss_pred c-----CCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhH Confidence 2 245577777766555444466667777777765544444443211110000 0011112 Q ss_pred HHHHHhcCCceEEEcc---------CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHH Q lcl|NC_021302. 268 IASNYSGGESAGLALT---------AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQADT 338 (484) Q Consensus 268 ~l~~~~~g~~a~~vip---------~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v 338 (484) .+..+..+ ..+.++ .+.+++++........++..++.+.+.|...-.+..++.++.+|. ..| +.... T Consensus 303 ~~~~~~~~--~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n-~Sg-~Aik~ 378 (506) T protein:vir:94 303 LIKEMKDA--NMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASN-SSG-VAMQY 378 (506) T ss_pred HHhhhhhc--CeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccc-chH-HHHHH Confidence 23333221 122333 244677777666667889999999999988866665655432221 112 12222 Q ss_pred H----HHHHHHHHHHHHHHHHHHHHHHHHHh----CCCC--ccccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 339 F----VQSVQTVADEIRDVAQAHVVEDIVDV----NWGE--DEPAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 339 ~----~~~~~aD~~~i~~~ln~qli~~l~~~----Nf~~--~~~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~ 407 (484) . ...+..-.+.+...+. ++++.++.+ +... +....++.|. ..+.+..+.++++.+|+ |+ ++ T Consensus 379 ~~~~l~~k~~~k~~~~~~~l~-~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~-----iS 450 (506) T protein:vir:94 379 KVLGTVELASTKRRMFERGLY-ARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAG--AT-----LP 450 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-----CC Confidence 2 2222333444555553 355554443 2111 1122467785 45678888999999884 54 24 Q ss_pred HHHHHHHhCC-CCCCCCcccccccCCCcCCCccccCCCCcccccccccccccccccccccc Q lcl|NC_021302. 408 EAFLRDAAGL-PGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGR 467 (484) Q Consensus 408 ~~~i~e~~gl-p~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (484) .+.+.+.++. +.|..+-+..... .....+....... .+........+.......+ T Consensus 451 ~et~~~~lp~v~d~~~E~~ri~~E--~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 451 QKYLYQQLPGVTNPQDIVDMMKEQ--SANGDYSFDQNGV---ISNDGQTNTTATQTDEEVR 506 (506) T ss_pred hHHHHHhCCCCCCHHHHHHHHHHH--HHHHhhcchhhcC---CCcccCccccccccccCCC Confidence 5677777643 3222110000000 0000000000000 0000000000000000001 No 222 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=89.99 E-value=0.023 Score=29.54 Aligned_cols=423 Identities=12% Similarity=0.065 Sum_probs=180.0 Q ss_pred CCCCCCCccceeeee-cccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec-- Q lcl|NC_021302. 1 MAPKTVAPRTERGYV-NPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP-- 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p-- 77 (484) .+|+.....+++..- +.....|.. +.+-.......-.....|+.|++|+ .++.|-++++.....+.-.+-.-.| T Consensus 36 ~~p~~~dGa~~i~~~~~~~~~~g~~--~~~y~~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaIv~~~~~~pV~ 112 (524) T protein:vir:98 36 APPKNNDGAYEIETDLNNQKYAGVF--QQFYSGQDPAIQNKEQLINTYRGIM-SYPEVENAVSEIIDDAIVNEQGKDIIT 112 (524) T ss_pred cCCCCCCCceeecCCCCcceeccee--eeeccccccccchHHHHHHHHHHHh-hccchhhHHHhhhcceeEecCCCceEE Confidence 455555555444310 001011110 0000111111111346799999997 5999999999888765433211111 Q ss_pred -CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccc Q lcl|NC_021302. 78 -NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSS 155 (484) Q Consensus 78 -~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~ 155 (484) +=++-+..+.+.+-+.. .+...+.-.+|+.-..++. .-...|--.+.++...+.. -.+.+|...+|+. T Consensus 113 l~L~~~~~s~~iK~kI~e---------eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~~~-kGI~ELr~lDPr~ 182 (524) T protein:vir:98 113 MDLAKTNFSKAIQDKIVE---------EFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKIMHKDES-KGIRELRQLDPRC 182 (524) T ss_pred EEecccccchHHHHHHHH---------HHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcCCCC-cceeeeeeeCCcc Confidence 11111222222222211 1111111222333222222 1223355556666553332 2567777888888 Q ss_pred eeeee---ec-CCCceee---eec----ccccccccccceeccCCCCcccccccceEEEeecCccCcccc---chhHHHH Q lcl|NC_021302. 156 IAYWN---VD-RDGGLIS---IQQ----WPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPGVWTG---NSLLRPA 221 (484) Q Consensus 156 ~~~~~---~~-~dg~l~~---~~q----~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~~p~G---~gll~~~ 221 (484) +.+.+ +. .+++... ... .+...... ..-.......++.||.. -|+|.|..-.+ ++ .|.|.++ T Consensus 183 i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~-~~g~~~~~~~~ikI~~d-AIvy~hSGL~d--~~~~iisyLhkA 258 (524) T protein:vir:98 183 MELIRESITETLDGGVKVFRGYREFFVYSAPKAGYT-YNGQIYQANQKIKIPRS-AIVYAHSGLED--CSNNIIGYLHRA 258 (524) T ss_pred ceeeeeccccccccchhhccceeeeeeeccCCCccc-cccceecCCCceeechh-heeeeccCccc--CCCCeeeehhHh Confidence 75532 11 1222110 000 00000000 00011122334566655 47777765432 22 3778888 Q ss_pred HHHHHHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE------------ Q lcl|NC_021302. 222 YKNWKLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA------------ 281 (484) Q Consensus 222 ~~~~~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v------------ 281 (484) ..++==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|=| T Consensus 259 iKp~NQLkm~EDAlVIYRitR----APeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlED 334 (524) T protein:vir:98 259 VKPANQLRLLEDAMVIYRITR----APERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTED 334 (524) T ss_pred hHhHHhhHHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhhh Confidence 77775555554444444 222 12111 1223344444445677777666532 112222 Q ss_pred --cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccch----hhHHHHHHH-HHHHHHHHHHH Q lcl|NC_021302. 282 --LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSY----ALASVQADT-FVQSVQTVADE 349 (484) Q Consensus 282 --ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~----A~~evh~~v-~~~~~~aD~~~ 349 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.++|++ +..-+..++ |...+...... T Consensus 335 yWLpRReGgrgTEItTLpggqnl-gem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rLR~r 413 (524) T protein:vir:98 335 YWLMRRDGKAITEVSTLPGGQNF-SDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTLQIQ 413 (524) T ss_pred hcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHHHHH Confidence 23 47889888765332 2344578999999999766543332112222 222223333 33445556666 Q ss_pred HHHHHHHHHHHHHHHhCCCC------ccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHH-hCC Q lcl|NC_021302. 350 IRDVAQAHVVEDIVDVNWGE------DEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDA-AGL 417 (484) Q Consensus 350 i~~~ln~qli~~l~~~Nf~~------~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~-~gl 417 (484) ++..|..-|-..|+--+.-. -...-+|.|.... . +.+.+.+++..|..+-=.+....+.+|+++. +.+ T Consensus 414 Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~ 493 (524) T protein:vir:98 414 FSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRM 493 (524) T ss_pred HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhcc Confidence 66666654444444444321 1122244442211 1 2233445565555433334446688888655 454 Q ss_pred CCCCCCcc--cc-cccCCCcCCCccccCCCC Q lcl|NC_021302. 418 PGPDPDAD--DD-ESTADTGQDEPETDEPAL 445 (484) Q Consensus 418 p~p~~~e~--~~-~~~~~~~~~~~~~~~~~~ 445 (484) ...+-.+. .. .....+--+.|+....+- T Consensus 494 tDeei~~~~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 494 SDEDIDEQAKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred CHHHHHHHHHHHHHHHhCCCCcCCccccccC Confidence 32211110 00 000011111111111111 No 223 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=89.72 E-value=0.025 Score=29.39 Aligned_cols=418 Identities=11% Similarity=0.085 Sum_probs=176.4 Q ss_pred CCCCCCCccceeeeeccc--ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPL--AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP- 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p- 77 (484) .+|+......++.+.... ++++... +.+.....+........|+.|++|+ .++.|-++++.....+.-.+-.-.| T Consensus 32 ~~p~~~Dga~e~~~~~~~~a~~~~g~~-~~~~g~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~pV 109 (524) T protein:vir:10 32 TAPKLDDGAREFEVSSNEAASPYNAAF-QTIFGSYEPGMKTTRELIDTYRNLM-NNYEVDNAVSEIVSDAIVYEDDTEVV 109 (524) T ss_pred cCccCCCCceeeeecccccccccceee-eehhcccccccchHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCceE Confidence 333333333444332211 1211111 0000001111112356799999997 5999999999888766543221111 Q ss_pred --CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhccee-----------eeEEEeecCCeee Q lcl|NC_021302. 78 --NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAV-----------FEQTYFYEGGRFW 144 (484) Q Consensus 78 --~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~-----------~Eivw~~~~g~~~ 144 (484) +=+..+..+.+.+-+.. -|+.++ .+|+.--+||.. +.++=...+..-. T Consensus 110 ~l~L~~~~~s~~iK~kI~e------------------eF~~Il-~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~G 170 (524) T protein:vir:10 110 ALNLDKSKFSPKIKNMMLD------------------EFNDVL-NHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEG 170 (524) T ss_pred EEEecCcCcchHHHHHHHH------------------HHHHHH-HHhccchhhhHHHhhheeeeEEEEEEEeeCCCcccc Confidence 11111222222222211 133333 334444444433 3333232333334 Q ss_pred eeeeeeeCccceeeee---ecCCCceeeeec---cc---ccccccccceeccCCCCcccccccceEEEeecCccC--ccc Q lcl|NC_021302. 145 LKRLAPRPQSSIAYWN---VDRDGGLISIQQ---WP---AGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG--VWT 213 (484) Q Consensus 145 ~~~l~~r~~~~~~~~~---~~~dg~l~~~~q---~~---~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~--~p~ 213 (484) +.+|...+|+.+.+.+ ...+++...+.- +. .+.....-.-.......++.||.. .|+|.|..-.+ .-. T Consensus 171 I~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~d-AI~y~hSGL~d~~~~~ 249 (524) T protein:vir:10 171 IKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKA-AIVYAHSGLVDCCGKN 249 (524) T ss_pred ceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchh-heeeeeccceeCCCCc Confidence 5566677777775522 222222111110 00 000000000001122345555544 58888854211 112 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE---- Q lcl|NC_021302. 214 GNSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA---- 281 (484) Q Consensus 214 G~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v---- 281 (484) =.|.|.++..++==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|=| T Consensus 250 i~gyLhkAiKp~NQLkmlEDAlVIYRitR----APeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddr 325 (524) T protein:vir:10 250 IIGYLHRAVKPANQLKLLEDAVVIYRITR----APDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQ 325 (524) T ss_pred eeccchhhhHHHHhhhHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 2478888887775555554444444 222 12111 1223334444445666776666543 112211 Q ss_pred ----------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc--ccccc---cchhhHHHHHHH-HH Q lcl|NC_021302. 282 ----------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN--LDGKG---GSYALASVQADT-FV 340 (484) Q Consensus 282 ----------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt--~~~~g---Gs~A~~evh~~v-~~ 340 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.-- +++.| ++++..-+..++ |. T Consensus 326 k~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~ 404 (524) T protein:vir:10 326 HNMSMTEDYWLQRRDGKAVTEVDTLPGADNT-GNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFA 404 (524) T ss_pred hhhhhHhhhcccccCCCcccceeeccccCCc-ChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHH Confidence 22 47889888765332 23445789999999997665433 33222 234433344444 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHH Q lcl|NC_021302. 341 QSVQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEA 409 (484) Q Consensus 341 ~~~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~ 409 (484) ..+......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+....+.+ T Consensus 405 KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 484 (524) T protein:vir:10 405 KFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHR 484 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhH Confidence 44556666666666554444444333 1111122244442211 1 22334455655554332333456788 Q ss_pred HHHHH-hCCCCCCCCcc--ccc-ccCCCcCCCccccCCCC Q lcl|NC_021302. 410 FLRDA-AGLPGPDPDAD--DDE-STADTGQDEPETDEPAL 445 (484) Q Consensus 410 ~i~e~-~glp~p~~~e~--~~~-~~~~~~~~~~~~~~~~~ 445 (484) |+++. +.+...+-.+. ... ....+--+.|+.....- T Consensus 485 yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 485 TAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 88655 44432211100 000 00000000111100000 No 224 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=89.38 E-value=0.027 Score=29.21 Aligned_cols=418 Identities=12% Similarity=0.084 Sum_probs=176.4 Q ss_pred CCCCCCCccceeeeeccc--ccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPL--AGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP- 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p- 77 (484) .+|+......++.+.... ++++... +.+.....+........|+.|++|+ .++.|-++++.....+.-.+-.-.| T Consensus 32 ~~p~~~Dga~e~~~~~~~~a~~~~g~~-~~~~g~~e~~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~pV 109 (524) T protein:vir:72 32 TAPKLDDGAREFEVSSNEAASPYNAAF-QTIFGSYEPGMKTTRELIDTYRNLM-NNYEVDNAVSEIVSDAIVYEDDTEVV 109 (524) T ss_pred cCccCCCCceeeeecccccccccceee-eehhcccccccchHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCceE Confidence 333333333444332211 1211111 0000001111112356799999997 5999999999888766543221111 Q ss_pred --CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhccee-----------eeEEEeecCCeee Q lcl|NC_021302. 78 --NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAV-----------FEQTYFYEGGRFW 144 (484) Q Consensus 78 --~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~-----------~Eivw~~~~g~~~ 144 (484) +=+..+..+.+.+-+.. -|+.++ .+|+.--+||.. +.++=...+..-. T Consensus 110 ~l~L~~~~~s~~iK~kI~e------------------eF~~Il-~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~G 170 (524) T protein:vir:72 110 ALNLDKSKFSPKIKNMMLD------------------EFSDVL-NHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEG 170 (524) T ss_pred EEEecCcCcchHHHHHHHH------------------HHHHHH-HHhccchhhhHHHhhheeeeEEEEEEEEeCCCcccc Confidence 11111222222222211 133333 334444444433 3333222333334 Q ss_pred eeeeeeeCccceeeee---ecCCCceeeeec---cc---ccccccccceeccCCCCcccccccceEEEeecCccC--ccc Q lcl|NC_021302. 145 LKRLAPRPQSSIAYWN---VDRDGGLISIQQ---WP---AGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDMDPG--VWT 213 (484) Q Consensus 145 ~~~l~~r~~~~~~~~~---~~~dg~l~~~~q---~~---~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~~~~--~p~ 213 (484) +.+|...+|+.+.+.+ ...+++...+.- +. .+.....-.-.......++.||.. .|+|.|..-.+ .-. T Consensus 171 I~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~d-AI~y~hSGL~d~~~~~ 249 (524) T protein:vir:72 171 IKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKA-AVVYAHSGLVDCCGKN 249 (524) T ss_pred ceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchh-heeeeeccceeCCCCc Confidence 5566677777775522 222222111110 00 000000000001122345555544 58888854211 111 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE---- Q lcl|NC_021302. 214 GNSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA---- 281 (484) Q Consensus 214 G~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v---- 281 (484) =.|.|.++..++==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|=| T Consensus 250 i~gyLhkAiKp~NQLkmlEDAlVIYRitR----APeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddr 325 (524) T protein:vir:72 250 IIGYLHRAVKPANQLKLLEDAVVIYRITR----APDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQ 325 (524) T ss_pred eeccchhhhHhHHhhhHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 2478888887775555554444444 222 12111 1223334444445666776666543 112211 Q ss_pred ----------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc--ccccc---cchhhHHHHHHH-HH Q lcl|NC_021302. 282 ----------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN--LDGKG---GSYALASVQADT-FV 340 (484) Q Consensus 282 ----------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt--~~~~g---Gs~A~~evh~~v-~~ 340 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.-- +++.| ++++..-+..++ |. T Consensus 326 k~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~ 404 (524) T protein:vir:72 326 HNMSMTEDYWLQRRDGKAVTEVDTLPGADNT-GNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFA 404 (524) T ss_pred hhhhhHhhhcccccCCCcccceeeccccCCc-ChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHH Confidence 22 47889888765332 23445789999999997665433 33222 234433344444 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHH Q lcl|NC_021302. 341 QSVQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEA 409 (484) Q Consensus 341 ~~~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~ 409 (484) ..+......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+....+.+ T Consensus 405 KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 484 (524) T protein:vir:72 405 KFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHR 484 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhH Confidence 44556666666666554444444333 1111122244442211 1 22334455655554332333456788 Q ss_pred HHHHH-hCCCCCCCCcc--ccc-ccCCCcCCCccccCCCC Q lcl|NC_021302. 410 FLRDA-AGLPGPDPDAD--DDE-STADTGQDEPETDEPAL 445 (484) Q Consensus 410 ~i~e~-~glp~p~~~e~--~~~-~~~~~~~~~~~~~~~~~ 445 (484) |+++. +.+...+-.+. ... ....+--+.|+.....- T Consensus 485 yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 485 TAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 88655 44432211110 000 00000000011100000 No 225 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=88.40 E-value=0.033 Score=28.74 Aligned_cols=393 Identities=11% Similarity=0.070 Sum_probs=151.7 Q ss_pred ccchh---hhhhhcccccccccccccchHHHHHHHHhcchH-----------------------------HHH-HHHHHH Q lcl|NC_021302. 19 AGFGT---FLAQGLDQFEQVDELRWPNSVYTYTRMCREEAR-----------------------------IAS-VLRAIG 65 (484) Q Consensus 19 ~~~~~---~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~-----------------------------v~s-~l~~r~ 65 (484) -++++ .++++|..-++... -...+..|..+.+ |.. +.+ +..+.- T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A 77 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGS--EPELIPKYLPLVP-DNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAA 77 (518) T ss_pred CcchhhHHHHHHHhhcCCCCcc--chhccHHHhhhcc-cchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHH Confidence 11111 23444333222110 0112222222211 110 011 112222 Q ss_pred HHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHH-HHHHHHHhhcceeeeEEEeecCCeee Q lcl|NC_021302. 66 LPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHL-RLALKSLQFGHAVFEQTYFYEGGRFW 144 (484) Q Consensus 66 ~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i-~~~l~a~~~G~s~~Eivw~~~~g~~~ 144 (484) +-|.+-+-.|+-.+.+....+.+ +....+.+....|...+ ..+..|+..|=.++=+.|. +|.+ T Consensus 78 ~ll~~e~~~i~v~~~~~~d~e~~-------------~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d--~~~~- 141 (518) T protein:vir:78 78 EYISGKPLSIDVTGVNGSKDENL-------------TKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL--NGRP- 141 (518) T ss_pred HhhcCCCceEEecCccccCcHHH-------------HHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE--CCee- Confidence 22333332332111100000111 11122222233455544 4456788899888877664 3443 Q ss_pred eeeeeeeCccceeeeeecCCCceeeeec---ccccccc-ccc--------------------c---eec-cCCCCc---- Q lcl|NC_021302. 145 LKRLAPRPQSSIAYWNVDRDGGLISIQQ---WPAGTFG-GPG--------------------M---VVM-APNSMG---- 192 (484) Q Consensus 145 ~~~l~~r~~~~~~~~~~~~dg~l~~~~q---~~~~~~~-~~~--------------------~---~~~-~~~~~~---- 192 (484) ++.++++..|.-. ..+|+++.+.- ...+... ... . ..+ ...+.+ T Consensus 142 --~i~~v~ad~~~P~--~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~ 217 (518) T protein:vir:78 142 --SISVHSSSQFWID--FKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPIS 217 (518) T ss_pred --EEEEEcCCeeEEE--eecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccc Confidence 2344444433211 11222221100 0000000 000 0 000 000000 Q ss_pred --------------cccc---------ccceEEEeec-----CccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021302. 193 --------------PAIP---------VEQLVVYTHD-----MDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGI 244 (484) Q Consensus 193 --------------~~lp---------~~k~l~~~~~-----~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~ 244 (484) ..++ +.-|++|... ...++|+|.|.+..+.-..-.-...+..|+.-++. T Consensus 218 ~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~--- 294 (518) T protein:vir:78 218 AERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK--- 294 (518) T ss_pred ccccccccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh--- Confidence 0000 1124444332 23578999999999887776667777777766663 Q ss_pred CcceEEecC------CCCCCHHHHHHHHHHHHHHhcCCceEEEccC----C----ceEEEecccCCchhHHHHHHHHHHH Q lcl|NC_021302. 245 GVPYLKGNE------ADSEDDDRMDELLEIASNYSGGESAGLALTA----G----EEAGILSPNGTPLDPRRAIEYHDHQ 310 (484) Q Consensus 245 G~P~~~gk~------~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~----~----~~ie~~~~~~~~~~~~~li~~~d~~ 310 (484) |-+-+.++. ..+...... ..+..+.+.+..++- + ..|+.++..=....|...++.+-++ T Consensus 295 g~~~i~v~~~~l~~~~~~~~~~~~-------~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~ 367 (518) T protein:vir:78 295 TKTKIAASERMFRKKVNKSTDKEE-------WSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQK 367 (518) T ss_pred CCceeeechhHhccCCCCCCCccc-------cccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHH Confidence 555444431 111111000 001112223333221 1 1244444444445666666666666 Q ss_pred HHHHH-hh-hhhcccccccchhhHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhC---CC------Ccc-ccceE Q lcl|NC_021302. 311 MALVA-LA-HFLNLDGKGGSYALASVQADT--FVQSVQTVADEIRDVAQAHVVEDIVDVN---WG------EDE-PAPLL 376 (484) Q Consensus 311 Isk~i-lG-qtlt~~~~gGs~A~~evh~~v--~~~~~~aD~~~i~~~ln~qli~~l~~~N---f~------~~~-~~P~~ 376 (484) |...+ ++ +++..+ +|.....++..+- .-..+..-.+.+...| ++|+..++.+- ++ ... .-+.+ T Consensus 368 ~~~~~G~s~~tfg~~--~~~~TATei~s~~~~~~~t~~~~~~~~e~al-~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i 444 (518) T protein:vir:78 368 AVSKSGYNPATFNLG--NREVKATEIWSLQDATVRKIEKKKRLIQNVY-EQMLWDFLYLLTGGTNNKEKAIMRDEIRVII 444 (518) T ss_pred HHHhhCCChhhcCcc--cccccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhcCccccccCCCceeEEE Confidence 65554 22 233222 2222223333222 2223444455555555 45776665531 11 111 22578 Q ss_pred EecC-CCCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHh-CCCCCCCCccccccc-CCCcCCCccccCCCCcccccc Q lcl|NC_021302. 377 VFDE-IGSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAA-GLPGPDPDADDDEST-ADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 377 ~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~-glp~p~~~e~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 451 (484) .|++ ...|.++.++..++++.+|+.-. +.++++.+ +..+.+..+++..-. .+.....+++..-..-+.+|. T Consensus 445 ~f~D~i~~D~~~~~~~~~~~v~aGimS~----e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 445 EFPDPMSVNLNELSSTLNNMNSALAMSV----EEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred EeCCCCCCCHHHHHHHHHHHHhcCCCCH----HHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 8864 56788889999999999997431 34566643 553322121111100 011011111110000111111 No 226 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=88.17 E-value=0.034 Score=28.63 Aligned_cols=432 Identities=14% Similarity=0.070 Sum_probs=166.4 Q ss_pred CCC-CCCCccceeeeecccccch-----------hhhhhhcccccccccccc-----cchHHHHHHHHhcchHHHHHHHH Q lcl|NC_021302. 1 MAP-KTVAPRTERGYVNPLAGFG-----------TFLAQGLDQFEQVDELRW-----PNSVYTYTRMCREEARIASVLRA 63 (484) Q Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~lr~-----~~~~~~y~~m~~~D~~v~s~l~~ 63 (484) +|| .-|.| | +.+....-.. ..+.+.+... ..+.|.| =-++..-..|. +-+.+.++... T Consensus 56 ~~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~F~Gy~~la~la-Q~~eyr~~~~~ 130 (694) T protein:vir:10 56 AAPVAEPSP-S--LRLARQFEVDVSNYTPRERRAASYALDFNGT-SMDALSFVTSSGFPGFPTLVLLA-QLPEYRAMHEV 130 (694) T ss_pred ccccCCCCc-c--hhhhhhccccccCCCccccchhhhhhccCcc-cccchhhhhccCcchHHHHHHHh-hccchhhHHHH Confidence 221 11222 1 1111100000 0011110000 0011111 01233334454 45667777777 Q ss_pred HHHHhhCCCcEEecCC-------------------CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCC-HHHHHHHHHH Q lcl|NC_021302. 64 IGLPIRRTDWRIRPNG-------------------ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFS-WDQHLRLALK 123 (484) Q Consensus 64 r~~~v~~~~~~v~p~~-------------------~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~l~ 123 (484) .-....+. |.-.-.+ .+++..+.+..++.. .. |+.+..-+-- T Consensus 131 ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~er-----------------l~V~~~l~eaik~ 192 (694) T protein:vir:10 131 LADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER-----------------LRIRDAVRTTVIH 192 (694) T ss_pred HHHHhhcc-cceeccccchhhhhhcccccccccccccHHHHHHHHHHHHH-----------------HHHHHHHHHHHHh Confidence 66655554 6321111 111333334333321 12 3445555557 Q ss_pred HHhhcceeeeEEEeecCCe--------------eeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCC Q lcl|NC_021302. 124 SLQFGHAVFEQTYFYEGGR--------------FWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPN 189 (484) Q Consensus 124 a~~~G~s~~Eivw~~~~g~--------------~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~ 189 (484) +.+||-+++=+.=.-++.. -.++.|..++|.|+.--.++. ..-....++.+..+... T Consensus 193 aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~-------~dP~spdfgkP~~y~V~-- 263 (694) T protein:vir:10 193 DQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS-------INPVADDFYKPSTWWMI-- 263 (694) T ss_pred hccccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccchhhh-------ccchhhccCCCceEEEe-- Confidence 9999999844332222210 112224444444432110000 00011122222222221 Q ss_pred CCcccccccceEEEeecC------ccCccccchhHHHHHHHHH---HHHHHHHHHHHHHHHhcCCcceEE-e--cCCCCC Q lcl|NC_021302. 190 SMGPAIPVEQLVVYTHDM------DPGVWTGNSLLRPAYKNWK---LKDELIRIEAAAIRRHGIGVPYLK-G--NEADSE 257 (484) Q Consensus 190 ~~~~~lp~~k~l~~~~~~------~~~~p~G~gll~~~~~~~~---~K~~~~~~w~~f~Er~~~G~P~~~-g--k~~~~~ 257 (484) +..+-..+++.++-.. ..-+.+|.++...++..+. -.+.....-+ + ++ -+..+. . ..-.+. T Consensus 264 --G~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li-~-~~---~v~~lk~dla~~L~~g 336 (694) T protein:vir:10 264 --GTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV-K-QF---SVSGILMDLAQALMPG 336 (694) T ss_pred --ceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH-H-hh---hhHHHHHHHHHhhcCh Confidence 1123333333333221 1235678998888875432 1122222111 1 11 111110 0 000111 Q ss_pred CHHHHHHHHHHHHHHhcCCceEEEccC-CceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc---ccccccchhhHH Q lcl|NC_021302. 258 DDDRMDELLEIASNYSGGESAGLALTA-GEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN---LDGKGGSYALAS 333 (484) Q Consensus 258 ~~~~~~~l~~~l~~~~~g~~a~~vip~-~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt---~~~~gGs~A~~e 333 (484) .+.++..-.++++.++.. ....++.+ +.+++.++++-+ ....++...-.+||-+. +-.+| ..+-.|=.|.|+ T Consensus 337 ~~~~l~~R~eli~~~Rsn-~G~~llDk~~Eefeq~stslS--GLddVi~qf~q~VAgaa-~IPltkLfGqSPkGlNATGE 412 (694) T protein:vir:10 337 ANVDLSMRAELINRYRDN-RNILFLDKATEEFFQFNTPLS--GLDALQAQAQEQMSAVS-HIPLIKLLGITPTGLNASSE 412 (694) T ss_pred hHHHHHHHHHHHHHhcCc-cceEEEecCCcceEEEecccC--CHHHHHHHHHHHHHhhh-cCchhhhhccCcccccccch Confidence 223344344666666544 45667874 678888776433 35566666666666652 22222 122235456677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcH-------HHHHHHHHHHHhcCcccCCc Q lcl|NC_021302. 334 VQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQ-------DATAAALQMLVNAGLLTPDP 405 (484) Q Consensus 334 vh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~-------~~~ae~~~~L~~~G~~~~~~ 405 (484) .-..+.-+.+++.....-..+-+.++.-|..--||...+-..|+|...- -+. ++.|+.++.+.+.|++.+ T Consensus 413 ~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~-- 490 (694) T protein:vir:10 413 GEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRP-- 490 (694) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCH-- Confidence 7777778888777654444443445555555446654433456665432 122 345677888999998654 Q ss_pred ccHHHHHHHhCCCCCCCC-----cccc-----------cccCC-CcCCCcccc--------CCCCccccccccccccccc Q lcl|NC_021302. 406 RLEAFLRDAAGLPGPDPD-----ADDD-----------ESTAD-TGQDEPETD--------EPALPNTSGTTSTTNAPQA 460 (484) Q Consensus 406 ~~~~~i~e~~glp~p~~~-----e~~~-----------~~~~~-~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~ 460 (484) +.++.++.-.....- ..+. ..+.. +....++.+ ...+|+........+. T Consensus 491 ---~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~--- 564 (694) T protein:vir:10 491 ---DQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNP--- 564 (694) T ss_pred ---HHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCc--- Confidence 578888654321110 0000 00000 000000000 0111111100000000 Q ss_pred cccccccchHH----HhcCcccC-------cccCC Q lcl|NC_021302. 461 RKRPRGRSPRD----RRKTPDGA-------MPLWD 484 (484) Q Consensus 461 ~~~~~~~~~~~----~~~~~~~~-------~~~~~ 484 (484) +.+.+.++.. +-...+|. .-.|. T Consensus 565 -~~ag~~~~~~~~ag~v~~~~g~vLl~kr~~g~W~ 598 (694) T protein:vir:10 565 -REAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWG 598 (694) T ss_pred -cccCCCCccceeeEEEEEeCCEEEEEEecCCCcc Confidence 0000000000 00000111 12244 No 227 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=88.01 E-value=0.035 Score=28.56 Aligned_cols=398 Identities=12% Similarity=0.055 Sum_probs=135.9 Q ss_pred CCCCCCCccc-------------------------------eeeeecccccchhhhhhhcccccccccccccchHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRT-------------------------------ERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTR 49 (484) Q Consensus 1 ~~~~~~~~~~-------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~ 49 (484) --|+-+.+.. .+..++-....-.-++..+..+...-.+.....-+.+++ T Consensus 28 ~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~ 107 (500) T protein:vir:30 28 DHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISE 107 (500) T ss_pred ccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHH Confidence 0011111110 111111111111111111111111111112223334445 Q ss_pred HHhcchHHHHHHHHHHHHhh--CC-CcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHh Q lcl|NC_021302. 50 MCREEARIASVLRAIGLPIR--RT-DWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQ 126 (484) Q Consensus 50 m~~~D~~v~s~l~~r~~~v~--~~-~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~ 126 (484) ++. |......+..-....+ |- =|++...+.... .+++...-..++ .| -. T Consensus 108 il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~-I~~v~ad~~~P~----------------~~----------d~ 159 (500) T protein:vir:30 108 TLK-NDRFNKNFERYLESCLALGGLAMRPYVDGDKVR-VAFVQAPVFLPL----------------QS----------NT 159 (500) T ss_pred HHh-hccHHHHHHHHHHHHhhcCCEEEEEEEeCCceE-EEEEcCCeeEEE----------------EE----------cC Confidence 543 3333333322211111 11 111111111100 000000000000 00 00 Q ss_pred hcceeeeEE----EeecCCeeeeeeeeeeCccceeeeeecCCCc--eeeeeccc---cccccccccee--cc---CCCCc Q lcl|NC_021302. 127 FGHAVFEQT----YFYEGGRFWLKRLAPRPQSSIAYWNVDRDGG--LISIQQWP---AGTFGGPGMVV--MA---PNSMG 192 (484) Q Consensus 127 ~G~s~~Eiv----w~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~--l~~~~q~~---~~~~~~~~~~~--~~---~~~~~ 192 (484) -|-..+-+. ...+++......+..+ .. .+|+ .++..-+. ....|..-.+. +. ..... T Consensus 160 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h--------~~-~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~ 230 (500) T protein:vir:30 160 QDVSSAAVVIKSVKTINGKEVYYTLIEFH--------EW-QSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKV 230 (500) T ss_pred CCeEEEEEEEEEeeeecCCceEEEEEEEE--------EE-eCCceeEEEEEEEecccccccCcccccccccCCcCcceEe Confidence 011100011 0111111111111111 00 1111 11100000 00011000000 00 00000 Q ss_pred ccccccceEEEe----ecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecC------CCCCCHHHH Q lcl|NC_021302. 193 PAIPVEQLVVYT----HDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNE------ADSEDDDRM 262 (484) Q Consensus 193 ~~lp~~k~l~~~----~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~------~~~~~~~~~ 262 (484) ..++.--|.+++ .....++|+|.|.+..+....-.-...+..|+.-++. |-+.+..+. ..+.+.+.. T Consensus 231 ~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~---g~~~i~v~~~~l~~~~~~~~g~~~ 307 (500) T protein:vir:30 231 TDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM---GQRRVAVPESLTALTVRTTDGDVV 307 (500) T ss_pred ccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh---CcceeeechHHhcccCCCCCcccc Confidence 111111133433 2334678999999999988777777777777776663 333333221 001010000 Q ss_pred HHHHHHHHHHhcCCceEEEccC----CceEEEecccCCchhHHHHHHHHHHHHHHHH-hh-hhhcccccccchhhHHH-- Q lcl|NC_021302. 263 DELLEIASNYSGGESAGLALTA----GEEAGILSPNGTPLDPRRAIEYHDHQMALVA-LA-HFLNLDGKGGSYALASV-- 334 (484) Q Consensus 263 ~~l~~~l~~~~~g~~a~~vip~----~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i-lG-qtlt~~~~gGs~A~~ev-- 334 (484) ....-+ .+...+..++. +..|+.++..-....|...++.+=++|+..+ ++ ++++.++.|-.-|. ++ T Consensus 308 ---~~~~~d--~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAt-ei~s 381 (500) T protein:vir:30 308 ---PRPRFE--SDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTAT-EIVS 381 (500) T ss_pred ---CCcccC--CCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHH-HHHH Confidence 000000 01112222221 1235444443334456666666666665543 33 23433333222232 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CC-CC---ccccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCc Q lcl|NC_021302. 335 QADTFVQSVQTVADEIRDVAQAHVVEDIVDV----NW-GE---DEPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDP 405 (484) Q Consensus 335 h~~v~~~~~~aD~~~i~~~ln~qli~~l~~~----Nf-~~---~~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~ 405 (484) .+.-....+.+-.+.+..+|. +|++.++.+ ++ +. ...-+.+.|++ ...|.++.++.+.+++.+|+.. T Consensus 382 ~~~~~~~t~~~~~~~~~~al~-~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s--- 457 (500) T protein:vir:30 382 ENSDTYQMRNSIVALVEQSLK-ELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGT--- 457 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCC--- Confidence 223333444556667777774 577777643 22 11 11124677865 5677788888899999999743 Q ss_pred ccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 406 RLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 406 ~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .+.++.+.||+++.+-.+.+... ...+ .++.+....... ...+ T Consensus 458 -~~~~i~~~~g~~eeea~~~l~~i--~~E~-~~~~~~~~~~~~-------~~g~ 500 (500) T protein:vir:30 458 -REMAIQKVLNVTEEKAQEIAAEI--NTGI-VDEINQQRTDTH-------LYGE 500 (500) T ss_pred -HHHHHHhcCCCCHHHHHHHHHHH--HHhc-cccCCCCCcccc-------ccCC Confidence 24678888898644221111110 0000 011111111000 0011 No 228 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=88.01 E-value=0.035 Score=28.56 Aligned_cols=398 Identities=12% Similarity=0.055 Sum_probs=135.9 Q ss_pred CCCCCCCccc-------------------------------eeeeecccccchhhhhhhcccccccccccccchHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRT-------------------------------ERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTR 49 (484) Q Consensus 1 ~~~~~~~~~~-------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~ 49 (484) --|+-+.+.. .+..++-....-.-++..+..+...-.+.....-+.+++ T Consensus 28 ~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~ 107 (500) T protein:vir:98 28 DHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISE 107 (500) T ss_pred ccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHH Confidence 0011111110 111111111111111111111111111112223334445 Q ss_pred HHhcchHHHHHHHHHHHHhh--CC-CcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHh Q lcl|NC_021302. 50 MCREEARIASVLRAIGLPIR--RT-DWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQ 126 (484) Q Consensus 50 m~~~D~~v~s~l~~r~~~v~--~~-~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~ 126 (484) ++. |......+..-....+ |- =|++...+.... .+++...-..++ .| -. T Consensus 108 il~-~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~-I~~v~ad~~~P~----------------~~----------d~ 159 (500) T protein:vir:98 108 TLK-NDRFNKNFERYLESCLALGGLAMRPYVDGDKVR-VAFVQAPVFLPL----------------QS----------NT 159 (500) T ss_pred HHh-hccHHHHHHHHHHHHhhcCCEEEEEEEeCCceE-EEEEcCCeeEEE----------------EE----------cC Confidence 543 3333333322211111 11 111111111100 000000000000 00 00 Q ss_pred hcceeeeEE----EeecCCeeeeeeeeeeCccceeeeeecCCCc--eeeeeccc---cccccccccee--cc---CCCCc Q lcl|NC_021302. 127 FGHAVFEQT----YFYEGGRFWLKRLAPRPQSSIAYWNVDRDGG--LISIQQWP---AGTFGGPGMVV--MA---PNSMG 192 (484) Q Consensus 127 ~G~s~~Eiv----w~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~--l~~~~q~~---~~~~~~~~~~~--~~---~~~~~ 192 (484) -|-..+-+. ...+++......+..+ .. .+|+ .++..-+. ....|..-.+. +. ..... T Consensus 160 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h--------~~-~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~ 230 (500) T protein:vir:98 160 QDVSSAAVVIKSVKTINGKEVYYTLIEFH--------EW-QSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKV 230 (500) T ss_pred CCeEEEEEEEEEeeeecCCceEEEEEEEE--------EE-eCCceeEEEEEEEecccccccCcccccccccCCcCcceEe Confidence 011100011 0111111111111111 00 1111 11100000 00011000000 00 00000 Q ss_pred ccccccceEEEe----ecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecC------CCCCCHHHH Q lcl|NC_021302. 193 PAIPVEQLVVYT----HDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNE------ADSEDDDRM 262 (484) Q Consensus 193 ~~lp~~k~l~~~----~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~------~~~~~~~~~ 262 (484) ..++.--|.+++ .....++|+|.|.+..+....-.-...+..|+.-++. |-+.+..+. ..+.+.+.. T Consensus 231 ~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~---g~~~i~v~~~~l~~~~~~~~g~~~ 307 (500) T protein:vir:98 231 TDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKM---GQRRVAVPESLTALTVRTTDGDVV 307 (500) T ss_pred ccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh---CcceeeechHHhcccCCCCCcccc Confidence 111111133433 2334678999999999988777777777777776663 333333221 001010000 Q ss_pred HHHHHHHHHHhcCCceEEEccC----CceEEEecccCCchhHHHHHHHHHHHHHHHH-hh-hhhcccccccchhhHHH-- Q lcl|NC_021302. 263 DELLEIASNYSGGESAGLALTA----GEEAGILSPNGTPLDPRRAIEYHDHQMALVA-LA-HFLNLDGKGGSYALASV-- 334 (484) Q Consensus 263 ~~l~~~l~~~~~g~~a~~vip~----~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i-lG-qtlt~~~~gGs~A~~ev-- 334 (484) ....-+ .+...+..++. +..|+.++..-....|...++.+=++|+..+ ++ ++++.++.|-.-|. ++ T Consensus 308 ---~~~~~d--~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAt-ei~s 381 (500) T protein:vir:98 308 ---PRPRFE--SDQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTAT-EIVS 381 (500) T ss_pred ---CCcccC--CCcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHH-HHHH Confidence 000000 01112222221 1235444443334456666666666665543 33 23433333222232 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CC-CC---ccccceEEecC-CCCcHHHHHHHHHHHHhcCcccCCc Q lcl|NC_021302. 335 QADTFVQSVQTVADEIRDVAQAHVVEDIVDV----NW-GE---DEPAPLLVFDE-IGSRQDATAAALQMLVNAGLLTPDP 405 (484) Q Consensus 335 h~~v~~~~~~aD~~~i~~~ln~qli~~l~~~----Nf-~~---~~~~P~~~~~~-~~~~~~~~ae~~~~L~~~G~~~~~~ 405 (484) .+.-....+.+-.+.+..+|. +|++.++.+ ++ +. ...-+.+.|++ ...|.++.++.+.+++.+|+.. T Consensus 382 ~~~~~~~t~~~~~~~~~~al~-~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s--- 457 (500) T protein:vir:98 382 ENSDTYQMRNSIVALVEQSLK-ELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGT--- 457 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCC--- Confidence 223333444556667777774 577777643 22 11 11124677865 5677788888899999999743 Q ss_pred ccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccccccccccc Q lcl|NC_021302. 406 RLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQ 459 (484) Q Consensus 406 ~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (484) .+.++.+.||+++.+-.+.+... ...+ .++.+....... ...+ T Consensus 458 -~~~~i~~~~g~~eeea~~~l~~i--~~E~-~~~~~~~~~~~~-------~~g~ 500 (500) T protein:vir:98 458 -REMAIQKVLNVTEEKAQEIAAEI--NTGI-VDEINQQRTDTH-------LYGE 500 (500) T ss_pred -HHHHHHhcCCCCHHHHHHHHHHH--HHhc-cccCCCCCcccc-------ccCC Confidence 24678888898644221111110 0000 011111111000 0011 No 229 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=87.91 E-value=0.036 Score=28.52 Aligned_cols=415 Identities=11% Similarity=0.073 Sum_probs=177.8 Q ss_pred CCCCCCCccceeeee-cc--cccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEec Q lcl|NC_021302. 1 MAPKTVAPRTERGYV-NP--LAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRP 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p 77 (484) ++|+.....+++..- +. -++.+...+.++.+ ........|+.|++|+ .++.|-++++.....+.-.+-.-.| T Consensus 34 ~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~----~~~~~~eLI~~YR~ma-~~pEvd~Av~eIVneaiv~d~~~~p 108 (524) T protein:vir:10 34 TAPKLDDGAREIETQEQNIPYNALMQQMFGSNEP----EVKNTRELIDTYRNLM-NNYEVDNAVQEIVSDAIVYEDDKEV 108 (524) T ss_pred ccCCCCCCceeeccCcccccchhhhhhhhhcccc----hhhhHHHHHHHHHHHh-hccchhhHHHHhhcceeEecCCCce Confidence 334433333333211 11 11111111112211 1112356799999997 5999999999888766543221111 Q ss_pred ---CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHHHHHhhccee-----------eeEEEeecCCee Q lcl|NC_021302. 78 ---NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLALKSLQFGHAV-----------FEQTYFYEGGRF 143 (484) Q Consensus 78 ---~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~a~~~G~s~-----------~Eivw~~~~g~~ 143 (484) +=+.-+..+.+.+-+.. -|+.++ .+|+.--+||.. +.++=...+..- T Consensus 109 V~l~Ld~~~~s~siK~kI~e------------------eF~~Il-~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~ 169 (524) T protein:vir:10 109 VALNLDGTDFSQSIKDKILA------------------EFSEVL-NLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKMKD 169 (524) T ss_pred EEEEecccCcchHHHHHHHH------------------HHHHHH-HHhccchhhhHHHhhheeeceEEEEEEeeCCCccc Confidence 00111122222222211 133333 334444444433 333323333333 Q ss_pred eeeeeeeeCccceeeee---ecCCCceeeee---ccccccccccc---ceeccCCCCcccccccceEEEeecCccC--cc Q lcl|NC_021302. 144 WLKRLAPRPQSSIAYWN---VDRDGGLISIQ---QWPAGTFGGPG---MVVMAPNSMGPAIPVEQLVVYTHDMDPG--VW 212 (484) Q Consensus 144 ~~~~l~~r~~~~~~~~~---~~~dg~l~~~~---q~~~~~~~~~~---~~~~~~~~~~~~lp~~k~l~~~~~~~~~--~p 212 (484) .+.+|...+|+.+.+.+ ...+++...+. .+.--..+... .-.......++.||.. -|+|.|..-.+ .- T Consensus 170 GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~d-AIvy~~SGL~d~~~~ 248 (524) T protein:vir:10 170 GVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRA-AVVYAHSGLLDCCGK 248 (524) T ss_pred cceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchh-heeeeccCcccCCCC Confidence 45566677777775422 23333321111 00000000000 0011234455566655 47777754321 11 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE--- Q lcl|NC_021302. 213 TGNSLLRPAYKNWKLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA--- 281 (484) Q Consensus 213 ~G~gll~~~~~~~~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v--- 281 (484) .=.|.|.++..++==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|=| T Consensus 249 ~i~syLhkAiKp~NQLkm~EDAlVIYRitR----APeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~dd 324 (524) T protein:vir:10 249 NIIGYLQRAIKPANQLKLMEDAMVIYRITR----APDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQ 324 (524) T ss_pred ceeccchHhhHHHHhhHHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccc Confidence 22478888887775555555554444 222 12111 1223334444445666776666543 112211 Q ss_pred -----------cc-----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcc--cccc---cchhhHHHHHHH-H Q lcl|NC_021302. 282 -----------LT-----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNL--DGKG---GSYALASVQADT-F 339 (484) Q Consensus 282 -----------ip-----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~--~~~g---Gs~A~~evh~~v-~ 339 (484) +| .|++|.++.++.+. .-.+=++|..+.+-+++..+.--. ++++ ++++..-+..++ | T Consensus 325 rk~msMlEDyWLpRReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF 403 (524) T protein:vir:10 325 QHNMSMTEDYWLQRRDGKAVTEVDTMPGATGM-SDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKF 403 (524) T ss_pred hhhhhhHhhhcccccCCCCccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHH Confidence 22 47889888765332 334457899999999977655333 3321 234333344444 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccH Q lcl|NC_021302. 340 VQSVQTVADEIRDVAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLE 408 (484) Q Consensus 340 ~~~~~aD~~~i~~~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~ 408 (484) ...+......++..|..-|-..|+--+ |..-...-+|.|.... . +.+.+.+++..|..+-=.+....+. T Consensus 404 ~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~ 483 (524) T protein:vir:10 404 AKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISH 483 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchh Confidence 344555666666666554444444333 2111122244442211 1 2233445555555433233345678 Q ss_pred HHHHHH-hCCCCCCCCcc--cc-cccCCCcCCCccccCCCC Q lcl|NC_021302. 409 AFLRDA-AGLPGPDPDAD--DD-ESTADTGQDEPETDEPAL 445 (484) Q Consensus 409 ~~i~e~-~glp~p~~~e~--~~-~~~~~~~~~~~~~~~~~~ 445 (484) +|+++. +.+...+-.+. .. .....+--+.|+.....- T Consensus 484 ~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 484 QTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 888655 44432211110 00 000001000111100000 No 230 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=87.64 E-value=0.037 Score=28.41 Aligned_cols=453 Identities=11% Similarity=0.059 Sum_probs=188.4 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhccccc---ccccccccchHHHHHH----H-------H-------------hc Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFE---QVDELRWPNSVYTYTR----M-------C-------------RE 53 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~lr~~~~~~~y~~----m-------~-------------~~ 53 (484) ||-++--+..+. .+...+ ++......+.... .+.+..|...+++|.. + . -. T Consensus 3 ~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~rs~~~ 80 (651) T protein:vir:80 3 LATTTTDKNRQT-YDETHD-VSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGDVNADWRHKIT 80 (651) T ss_pred ccccccchhhhh-hhhhHH-HHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCCCCCCCCcccc Confidence 777776665543 222221 1111111111111 1111111111111110 0 0 02 Q ss_pred chHHHHHHHHHHHHhhCC-----C-cEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHh Q lcl|NC_021302. 54 EARIASVLRAIGLPIRRT-----D-WRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQ 126 (484) Q Consensus 54 D~~v~s~l~~r~~~v~~~-----~-~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~ 126 (484) ++.|..+++.+...+... + +.|+|.++. ++++...+.+...+. .-+...+|...+..+ .+++. T Consensus 81 ~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~-d~a~~~~~~~~~~~~---------~~l~~~~~~~~~~~~~~d~l~ 150 (651) T protein:vir:80 81 TGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPG-QDNLLVSRLIKRYVQ---------DKLTEGKFRAAYANFLRQLLI 150 (651) T ss_pred ChhHHHHHHHHHHHHHHhhcCCCceeEeccCCch-hHHHHHHHHHHHHHH---------HHhhccCcHHHHHHHHHhhcc Confidence 456777777776666653 3 556775433 344444444443322 112345788888776 68999 Q ss_pred hcceeeeEEEeec---------------CC--eee----------eeeeeeeCccceeeeeecC----CCceeeee---- Q lcl|NC_021302. 127 FGHAVFEQTYFYE---------------GG--RFW----------LKRLAPRPQSSIAYWNVDR----DGGLISIQ---- 171 (484) Q Consensus 127 ~G~s~~Eivw~~~---------------~g--~~~----------~~~l~~r~~~~~~~~~~~~----dg~l~~~~---- 171 (484) +|.++.=+.|+.. ++ .+. -..+..+||..|. +.... |+..+... T Consensus 151 ~G~~i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~-~dp~a~~~~d~~~v~~~~~t~ 229 (651) T protein:vir:80 151 TGNSVLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCF-YDPNVTDPNRGAFIRKLTKTK 229 (651) T ss_pred cCceEEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHee-ecCCCcCccccceeeeeeeeH Confidence 9999998888532 00 000 0123334444332 11110 11110000 Q ss_pred ----------------------cccc------------------------------------cccc-cccceeccCCCCc Q lcl|NC_021302. 172 ----------------------QWPA------------------------------------GTFG-GPGMVVMAPNSMG 192 (484) Q Consensus 172 ----------------------q~~~------------------------------------~~~~-~~~~~~~~~~~~~ 192 (484) ..+. ...+ ....+.....+.. T Consensus 230 ~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~ 309 (651) T protein:vir:80 230 ADILNLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNE 309 (651) T ss_pred HHHHHHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcE Confidence 0000 0000 0000000001100 Q ss_pred ----ccccc---cceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHH Q lcl|NC_021302. 193 ----PAIPV---EQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDEL 265 (484) Q Consensus 193 ----~~lp~---~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l 265 (484) ...|+ .-|+++++....+..||.|....+...-...+...+.....+.+...| ++.+ ..+.....++ T Consensus 310 il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~-~~~v-~~d~~~~~~~---- 383 (651) T protein:vir:80 310 VLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQ-MYTL-RSDGLLQPED---- 383 (651) T ss_pred EecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCC-cEEe-cCCccccHHH---- Confidence 01221 368999999999999999999999999999999999999999986543 3333 2222222222 Q ss_pred HHHHHHHhcCCceEEEccCCceEEEecccC-CchhHHHHHHHHHHHHHHHHhhhhhcccc--cc-cchhhHHHHH--HHH Q lcl|NC_021302. 266 LEIASNYSGGESAGLALTAGEEAGILSPNG-TPLDPRRAIEYHDHQMALVALAHFLNLDG--KG-GSYALASVQA--DTF 339 (484) Q Consensus 266 ~~~l~~~~~g~~a~~vip~~~~ie~~~~~~-~~~~~~~li~~~d~~Isk~ilGqtlt~~~--~g-Gs~A~~evh~--~v~ 339 (484) +.+ ..| +++......++..+.... .......++++++..|..+.+-..+..+. .+ +..-.++++. +.. T Consensus 384 ---l~~-~pg--~vi~~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~ 457 (651) T protein:vir:80 384 ---VYT-EPG--KVFLVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAG 457 (651) T ss_pred ---hhc-CCC--ceEEecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHH Confidence 111 223 344455555566555432 22234568999999998886554443321 11 2222244443 344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhC--CCCccccceEE-----------e-----------cCCCC-----cHHHHHH Q lcl|NC_021302. 340 VQSVQTVADEIRDVAQAHVVEDIVDVN--WGEDEPAPLLV-----------F-----------DEIGS-----RQDATAA 390 (484) Q Consensus 340 ~~~~~aD~~~i~~~ln~qli~~l~~~N--f~~~~~~P~~~-----------~-----------~~~~~-----~~~~~ae 390 (484) ...+..-.+.+..++...|++.++.++ |+.....|++. + ..... .....++ T Consensus 458 ~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~ 537 (651) T protein:vir:80 458 GNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIED 537 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHH Confidence 555666667777766666777776666 44333333320 0 00000 1111222 Q ss_pred HHHHHHhcCcccCCcc-------cHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCccc-ccccccc-cccccc Q lcl|NC_021302. 391 ALQMLVNAGLLTPDPR-------LEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNT-SGTTSTT-NAPQAR 461 (484) Q Consensus 391 ~~~~L~~~G~~~~~~~-------~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~ 461 (484) ..+ +.+++...+... ....+.+..|++.+..-- ..+.....+.+++..-..... ....... ...++. T Consensus 538 l~~-~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l---~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~ 613 (651) T protein:vir:80 538 RLT-FIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAYL---KQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQ 613 (651) T ss_pred HHH-HHHhhccCCccchhhhHHHHHHHHHHHcCCCCcHHhc---CCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHH Confidence 222 223232222111 123456778987664321 111000000001000000000 0000000 000000 Q ss_pred ccccccchHHH---------hcC--------cccCccc Q lcl|NC_021302. 462 KRPRGRSPRDR---------RKT--------PDGAMPL 482 (484) Q Consensus 462 ~~~~~~~~~~~---------~~~--------~~~~~~~ 482 (484) +........+. +++ ..-.+|- T Consensus 614 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 651 (651) T protein:vir:80 614 ADGGTQMMSEMYGTPNADQMQQELMATTPNVSEQQLTQ 651 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 00000000000 000 0000000 No 231 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=87.12 E-value=0.041 Score=28.20 Aligned_cols=421 Identities=8% Similarity=0.031 Sum_probs=148.7 Q ss_pred cceeeeecccccchhhhhhhcccccc----c-cc--ccccchHHHHHHHHhcchHH------HHHHHHHHHHhhCCCcEE Q lcl|NC_021302. 9 RTERGYVNPLAGFGTFLAQGLDQFEQ----V-DE--LRWPNSVYTYTRMCREEARI------ASVLRAIGLPIRRTDWRI 75 (484) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~-~~--lr~~~~~~~y~~m~~~D~~v------~s~l~~r~~~v~~~~~~v 75 (484) +.=+-.+......+.... .....-. + .. -..-..|..+..|-+.+.+- -+-+.+|+. .++ T Consensus 1 m~~~~~~k~~~~k~~~~~-~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~--~sl---- 73 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYM-QTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPM--NHL---- 73 (522) T ss_pred CchHHHHHHHHHHHHHHh-hcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccc--eec---- Confidence 111101111100000000 0000000 0 00 00112233343443221110 000000000 000 Q ss_pred ecCCCCHHHHHHHHHHHHh-h----hccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeee Q lcl|NC_021302. 76 RPNGARPEVVEHVAACLGL-P----VEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLA 149 (484) Q Consensus 76 ~p~~~~~e~~~~~~~~l~~-~----~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~ 149 (484) +=...+++..+..+.. + +.++..++...+.+....|...+... ..|...|=.++=+.|. +|.+. +. T Consensus 74 ---nl~~~i~~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d--~~~~~---i~ 145 (522) T protein:vir:47 74 ---PIARTASKKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID--GDKVR---VA 145 (522) T ss_pred ---chHHHHHHHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc--CCceE---EE Confidence 0001111111111100 0 11111222222333333566655554 5788889888877775 23221 22 Q ss_pred eeCccceeeeeecCCCc-----------------e----eeeecccc--------------c----------ccccc-cc Q lcl|NC_021302. 150 PRPQSSIAYWNVDRDGG-----------------L----ISIQQWPA--------------G----------TFGGP-GM 183 (484) Q Consensus 150 ~r~~~~~~~~~~~~dg~-----------------l----~~~~q~~~--------------~----------~~~~~-~~ 183 (484) ++++..|.-..++.++. . +...++.. . +.... .. T Consensus 146 ~v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~ 225 (522) T protein:vir:47 146 FIQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQR 225 (522) T ss_pred EEcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcc Confidence 22222221111111111 0 00000000 0 00000 00 Q ss_pred eeccCCCCccccccc---------ceEEEee----cCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_021302. 184 VVMAPNSMGPAIPVE---------QLVVYTH----DMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLK 250 (484) Q Consensus 184 ~~~~~~~~~~~lp~~---------k~l~~~~----~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~ 250 (484) +......+...|++. =|.+++. ....++|+|.|.+..|....-.-...+..|..=++- .-..+.+ T Consensus 226 v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~--g~~~i~v 303 (522) T protein:vir:47 226 VNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRM--GQRRVIV 303 (522) T ss_pred ccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHh--ccceeec Confidence 000000011122211 1333332 234578999999999886665555555555544442 1222222 Q ss_pred ----ecCC-CCCCHHHHHHHHHHHHHHhcCCceEEEcc----CCceEEEecccCCchhHHHHHHHHHHHHHHHH-hh-hh Q lcl|NC_021302. 251 ----GNEA-DSEDDDRMDELLEIASNYSGGESAGLALT----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVA-LA-HF 319 (484) Q Consensus 251 ----gk~~-~~~~~~~~~~l~~~l~~~~~g~~a~~vip----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i-lG-qt 319 (484) -+.. .....+.. .. ..+-.+...+..++ .+..|+.++..-....|...++.+-+.|+..+ ++ ++ T Consensus 304 ~~~~l~~~~~~~~g~~~--~~---~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~t 378 (522) T protein:vir:47 304 PEHLTQRQYQRPDGTID--FR---PRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGM 378 (522) T ss_pred chHHhccCCCCCCcccc--cc---cccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccc Confidence 1110 11010000 00 00000111111111 22234444443334456666666666665543 44 33 Q ss_pred hcccccccchhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCC-ccccceEEecC-CCCcHHHH Q lcl|NC_021302. 320 LNLDGKGGSYALASV--QADTFVQSVQTVADEIRDVAQAHVVEDIVDVN-------WGE-DEPAPLLVFDE-IGSRQDAT 388 (484) Q Consensus 320 lt~~~~gGs~A~~ev--h~~v~~~~~~aD~~~i~~~ln~qli~~l~~~N-------f~~-~~~~P~~~~~~-~~~~~~~~ 388 (484) ++.+++|..-| .++ .+.-....+..-.+.+..+| ++|+..++.+- ... ...-+.+.|++ ...|.++. T Consensus 379 f~~~~~~~kTA-tEi~s~~~~~~~t~~~~~~~~~~al-~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~ 456 (522) T protein:vir:47 379 FTFDGQGMKTA-TEIVSENSDTYQMRSSIVALVEQSI-KELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAE 456 (522) T ss_pred cCccccccccH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHH Confidence 44333322222 244 22333334455666777777 45777776442 111 22225677875 56777888 Q ss_pred HHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCC-CcCCCccccCCCCccccccccccccccccccccc Q lcl|NC_021302. 389 AAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTAD-TGQDEPETDEPALPNTSGTTSTTNAPQARKRPRG 466 (484) Q Consensus 389 ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (484) ++.+.+++.+|+.. .+.++.+.||+.+.+-.+++...... .....++.+.... .+ ..+......+ T Consensus 457 ~~~~~~~v~aG~~s----~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~--------~~-~~~~~~d~~~ 522 (522) T protein:vir:47 457 LDYWAKMVAAGFST----KKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGM--------HD-QNEEKADDKG 522 (522) T ss_pred HHHHHHHHhcCCCC----HHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCC--------CC-cccccCCCCC Confidence 88999999999743 35778899998654322222111100 0000001000000 00 0000000001 No 232 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=86.84 E-value=0.043 Score=28.09 Aligned_cols=408 Identities=11% Similarity=0.088 Sum_probs=152.7 Q ss_pred cceeeeecccccchhhhhhh----cccccccccc----cccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCCC Q lcl|NC_021302. 9 RTERGYVNPLAGFGTFLAQG----LDQFEQVDEL----RWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWRIRPNGA 80 (484) Q Consensus 9 ~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~l----r~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~~ 80 (484) +.=+..+......+.. .++ +.+..+-... -....|+.+..|-+.+.+.. .. +...+.. T Consensus 1 m~~~~~~k~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~---~~----------~~~~~~~ 66 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAA-ATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYI---HY----------QASDGIK 66 (508) T ss_pred CChHHHHHHHHHHHHH-HhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCccc---cc----------ccCCCCc Confidence 1111111111000000 000 0000000000 01122344444433211100 00 0000000 Q ss_pred ------CHHHHHHHHHHHHhhhccc---------h-hhhhHHHhhcCCCHHHHHH-HHHHHHhhcceeeeEEEeecCCee Q lcl|NC_021302. 81 ------RPEVVEHVAACLGLPVEGD---------E-SDKPTPRTRGRFSWDQHLR-LALKSLQFGHAVFEQTYFYEGGRF 143 (484) Q Consensus 81 ------~~e~~~~~~~~l~~~~~~~---------~-~~~~~~~~~~~~~~~~~i~-~~l~a~~~G~s~~Eivw~~~~g~~ 143 (484) +--.++.++..++..+.++ + .+....+.+..-.|...+. .+.+|..+|-.++=+.|. ++.. T Consensus 67 ~~~~~~sln~~~~i~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d--~~~~ 144 (508) T protein:vir:15 67 KKRLKNTINMAKTAARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYID--GNHI 144 (508) T ss_pred cccceeecchHHHHHHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEe--CCee Confidence 0001122222222221111 1 1112233333345666555 446899999988877775 2222 Q ss_pred eeeeeeeeCccceeeeeecCCC-----------------cee-e---eecccccccccccceeccC-C--CCcccc---- Q lcl|NC_021302. 144 WLKRLAPRPQSSIAYWNVDRDG-----------------GLI-S---IQQWPAGTFGGPGMVVMAP-N--SMGPAI---- 195 (484) Q Consensus 144 ~~~~l~~r~~~~~~~~~~~~dg-----------------~l~-~---~~q~~~~~~~~~~~~~~~~-~--~~~~~l---- 195 (484) +|.++++..|.-..++..+ +.. + ...+..+..+......+.. + ..|.++ T Consensus 145 ---~i~~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~ 221 (508) T protein:vir:15 145 ---KIAWVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLST 221 (508) T ss_pred ---EEEEEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhh Confidence 2233333322111111111 000 0 0000000000000000000 0 001111 Q ss_pred -------cc---------cceEEEee----cCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCC Q lcl|NC_021302. 196 -------PV---------EQLVVYTH----DMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEAD 255 (484) Q Consensus 196 -------p~---------~k~l~~~~----~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~ 255 (484) .+ --|++++. +...++|+|.|.+..+.-..-.-...+..|+.-++ . |-+-+.++... T Consensus 222 ~~e~~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~-~--~~~~i~v~~~~ 298 (508) T protein:vir:15 222 LPVYKELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIR-L--GQKHIAVQPGM 298 (508) T ss_pred cccccCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHH-h--cccceeechHH Confidence 11 11334332 23457899999999998776666666666666664 2 44433332110 Q ss_pred -CCCHHHHHHHHHHHHHHhcCCceEEEccC----CceEEEecccCCchhHHHHHHHHHHHHHHHH-hhh-hhcccccccc Q lcl|NC_021302. 256 -SEDDDRMDELLEIASNYSGGESAGLALTA----GEEAGILSPNGTPLDPRRAIEYHDHQMALVA-LAH-FLNLDGKGGS 328 (484) Q Consensus 256 -~~~~~~~~~l~~~l~~~~~g~~a~~vip~----~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i-lGq-tlt~~~~gGs 328 (484) ..+.+.. . .+..+...+..++. +..|+.++..-....|...++.+-+.|...+ ++. +++.+++ |. T Consensus 299 l~~d~~~~-~------~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~-~~ 370 (508) T protein:vir:15 299 LRFDDEHK-P------TFDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSND-GV 370 (508) T ss_pred hcCCCCCc-c------ccCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccC-cc Confidence 0000000 0 01112223333331 2235555544334456666666666666665 332 2222222 22 Q ss_pred hhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CC----C---------CccccceEEecC-CCCcHHHHH Q lcl|NC_021302. 329 YALASV--QADTFVQSVQTVADEIRDVAQAHVVEDIVDV---NW----G---------EDEPAPLLVFDE-IGSRQDATA 389 (484) Q Consensus 329 ~A~~ev--h~~v~~~~~~aD~~~i~~~ln~qli~~l~~~---Nf----~---------~~~~~P~~~~~~-~~~~~~~~a 389 (484) ...-++ ...-....+..-.+.+...|. +|++.++.+ +. + ....-+.+.|++ ...|.++.+ T Consensus 371 ~TAtei~s~~~~~~~t~~~~~~~~~~al~-~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~ 449 (508) T protein:vir:15 371 KTATEVVSNNSMTYQTRSSYLTMVEKAID-ELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQL 449 (508) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHH Confidence 212222 223344444556677777774 576665543 21 1 011124677865 457777778 Q ss_pred HHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCc-cccCCCCcccccc Q lcl|NC_021302. 390 AALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEP-ETDEPALPNTSGT 451 (484) Q Consensus 390 e~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 451 (484) +...+++.+|+.. .+.++.+.+|+.+.+-.+++..-......+.+ ...........|+ T Consensus 450 ~~~~~~v~aGi~s----~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 450 EEDAKVLAIGALS----KQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred HHHHHHHhcCCCC----HHHHHHhcCCCChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 8888999999743 25678888898654322222111111100000 0100111111111 No 233 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=84.86 E-value=0.057 Score=27.40 Aligned_cols=414 Identities=10% Similarity=0.027 Sum_probs=143.6 Q ss_pred Cccceeeeecccccchhhhhhhc-ccccccccc-cccchHHHHH---HHHh--------------cchHHHHHHHHHHHH Q lcl|NC_021302. 7 APRTERGYVNPLAGFGTFLAQGL-DQFEQVDEL-RWPNSVYTYT---RMCR--------------EEARIASVLRAIGLP 67 (484) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l-r~~~~~~~y~---~m~~--------------~D~~v~s~l~~r~~~ 67 (484) ....-.-+....-......+.-+ ..... ... |..+..+.|+ ++.. ..+...-++.+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~ 79 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKA-EQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGY 79 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhh Confidence 11111001111000000000000 00000 000 0000011111 0000 001111111111122 Q ss_pred hhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEe-e---cCCe Q lcl|NC_021302. 68 IRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYF-Y---EGGR 142 (484) Q Consensus 68 v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~-~---~~g~ 142 (484) +.+-+..+.+ ..+..+....+....-+|+.....+ .++..||.+. +++|- . .++. T Consensus 80 l~g~~~~~~~-------------------~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~-~~v~~~~~~d~~~~ 139 (489) T protein:vir:99 80 MLGVPVEYKN-------------------ENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAY-ELLTVEKIDDKKTE 139 (489) T ss_pred hccCCceeec-------------------CChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEE-EEEeeccCcCCCcc Confidence 2222222221 1111222223333344676655544 5788899764 44442 1 2333 Q ss_pred eeeeeeeeeCccceeeeeecCC--Cceee-eeccc----ccc-------cccccceeccC---CCC--------cccccc Q lcl|NC_021302. 143 FWLKRLAPRPQSSIAYWNVDRD--GGLIS-IQQWP----AGT-------FGGPGMVVMAP---NSM--------GPAIPV 197 (484) Q Consensus 143 ~~~~~l~~r~~~~~~~~~~~~d--g~l~~-~~q~~----~~~-------~~~~~~~~~~~---~~~--------~~~lp~ 197 (484) +. +...+|+.+. ..+++. +.++. ++.+. .+. ........+.. ... ...+.. T Consensus 140 ~~---i~~~~p~~~~-~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~ 215 (489) T protein:vir:99 140 VK---LYQLPAEQTF-VIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKG 215 (489) T ss_pred eE---EEEEcccceE-EEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCc Confidence 33 3333444331 112211 11111 00000 000 00000000000 000 111111 Q ss_pred cceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHHHHHHH----H- Q lcl|NC_021302. 198 EQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELLEIASN----Y- 272 (484) Q Consensus 198 ~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~----~- 272 (484) --++.|+ +|+.|.|.+..+....=.-...+..++..++.+.+++.++.|-.....+..+.......-.+ + T Consensus 216 vPvv~~~-----n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 290 (489) T protein:vir:99 216 VPVNEYA-----NNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAIS 290 (489) T ss_pred eeEEEee-----cCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccc Confidence 1233333 35678888877654444445667788888887766555655533222222111111110000 0 Q ss_pred -hcCCceEEEcc-------CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHH---HHHHH Q lcl|NC_021302. 273 -SGGESAGLALT-------AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQA---DTFVQ 341 (484) Q Consensus 273 -~~g~~a~~vip-------~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~---~v~~~ 341 (484) .....-.+.+. .+.+++++........++..++++.+.|.+.--+..++.++.+| .+.|..-. .-... T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~Sg~Al~~~~~~l~~ 369 (489) T protein:vir:99 291 IGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSG-VQSGESMKYKLMASDN 369 (489) T ss_pred cccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccc-cchHHHHHHHHHHHHH Confidence 00000111111 23456666655455678888999999998775544455443222 22222211 11222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh---CCCCcc-----ccceEEec-CCCCcHHHHHHHHHHHHhcCcccCCcccHHHHH Q lcl|NC_021302. 342 SVQTVADEIRDVAQAHVVEDIVDV---NWGEDE-----PAPLLVFD-EIGSRQDATAAALQMLVNAGLLTPDPRLEAFLR 412 (484) Q Consensus 342 ~~~aD~~~i~~~ln~qli~~l~~~---Nf~~~~-----~~P~~~~~-~~~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~ 412 (484) .+..-.+.+...+ +++++-++.+ ..+... .-..+.|. ....+..+.++++.+|+ |+ + +.+.+. T Consensus 370 k~~~k~~~~~~~l-~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--gi-i----s~et~~ 441 (489) T protein:vir:99 370 YREKQERLFKKGL-MRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GI-V----SDQTIF 441 (489) T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--cc-C----CHHHHH Confidence 2333344555555 3455554443 122111 11356675 45677888889998885 54 2 345555 Q ss_pred HHhC-CCCCCCCcccccccCCCcCCCccccCCCCccccccccccccccccccc Q lcl|NC_021302. 413 DAAG-LPGPDPDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRP 464 (484) Q Consensus 413 e~~g-lp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 464 (484) +.++ +..+...++...-....................+.. .+....| T Consensus 442 ~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~-----~~~~~~p 489 (489) T protein:vir:99 442 EILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQE-----EPTAEKP 489 (489) T ss_pred HhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCCCc-----CCCCCCC Confidence 5543 322211111100000000000000000000001110 0111111 No 234 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=84.64 E-value=0.059 Score=27.33 Aligned_cols=435 Identities=13% Similarity=0.065 Sum_probs=166.6 Q ss_pred CCCCCCCc-------------------cceeeeecc-----------cccchhhhhhhcccccccccccc-----cchHH Q lcl|NC_021302. 1 MAPKTVAP-------------------RTERGYVNP-----------LAGFGTFLAQGLDQFEQVDELRW-----PNSVY 45 (484) Q Consensus 1 ~~~~~~~~-------------------~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~lr~-----~~~~~ 45 (484) -|-.||+| .|+.+.+.. .......+.+.+... ..+.|.| =-++. T Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~F~Gy~ 114 (695) T protein:vir:36 36 AAAAQPVPADFARRGALNALDAAPVVEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGT-SMDALSFVTSSGFPGFP 114 (695) T ss_pred hccccccchhhhhcccccccccccccCCCcccccceeceecccccCccccchhhhhhccccc-ccccchhhhccCcchHH Confidence 11111111 011111111 000000011110000 0011111 01233 Q ss_pred HHHHHHhcchHHHHHHHHHHHHhhCCCcEEecCC-------------------CCHHHHHHHHHHHHhhhccchhhhhHH Q lcl|NC_021302. 46 TYTRMCREEARIASVLRAIGLPIRRTDWRIRPNG-------------------ARPEVVEHVAACLGLPVEGDESDKPTP 106 (484) Q Consensus 46 ~y~~m~~~D~~v~s~l~~r~~~v~~~~~~v~p~~-------------------~~~e~~~~~~~~l~~~~~~~~~~~~~~ 106 (484) .-..|. +-+.+.++....-....+. |.-.-.+ .+++..+.+.+++.. T Consensus 115 ~la~la-Q~~eyr~~~~~ia~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~er------------ 180 (695) T protein:vir:36 115 TLVLLA-QLPEYRAMHEVLADECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER------------ 180 (695) T ss_pred HHHHHh-hccchhhHHHHHHHHhhcc-cceecccchhhhhhccccccccccccCchHHHHHHHHHHHH------------ Confidence 334454 4567777777766665554 6321111 111333444433321 Q ss_pred HhhcCCC-HHHHHHHHHHHHhhcceeeeEEEeecCC--------------eeeeeeeeeeCccceeeeeecCCCceeeee Q lcl|NC_021302. 107 RTRGRFS-WDQHLRLALKSLQFGHAVFEQTYFYEGG--------------RFWLKRLAPRPQSSIAYWNVDRDGGLISIQ 171 (484) Q Consensus 107 ~~~~~~~-~~~~i~~~l~a~~~G~s~~Eivw~~~~g--------------~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~ 171 (484) .. |+.+..-+--+.+||-+++=+.=.-++. .-.++.|..++|.|+.--.++. . T Consensus 181 -----L~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~-------~ 248 (695) T protein:vir:36 181 -----LRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS-------I 248 (695) T ss_pred -----HHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhh-------c Confidence 12 3445555557999999984432222221 0112224444444432110000 0 Q ss_pred cccccccccccceeccCCCCcccccccceEEEeecC------ccCccccchhHHHHHHHHH---HHHHHHHHHHHHHHHh Q lcl|NC_021302. 172 QWPAGTFGGPGMVVMAPNSMGPAIPVEQLVVYTHDM------DPGVWTGNSLLRPAYKNWK---LKDELIRIEAAAIRRH 242 (484) Q Consensus 172 q~~~~~~~~~~~~~~~~~~~~~~lp~~k~l~~~~~~------~~~~p~G~gll~~~~~~~~---~K~~~~~~w~~f~Er~ 242 (484) .-....++.+..+... +..+-..+++.++-.. ..-+.+|.++...++..+. -.+.....-+ + ++ T Consensus 249 dP~spdfgkP~~y~V~----G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li-~-~~- 321 (695) T protein:vir:36 249 NPVADDFYKPSTWWMI----GTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV-K-QF- 321 (695) T ss_pred cchhhccCCCceEEEe----ceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH-H-hh- Confidence 0011122222222221 1123333333333221 1235678998888875432 1122222111 1 11 Q ss_pred cCCcceEE-e--cCCCCCCHHHHHHHHHHHHHHhcCCceEEEccC-CceEEEecccCCchhHHHHHHHHHHHHHHHHhhh Q lcl|NC_021302. 243 GIGVPYLK-G--NEADSEDDDRMDELLEIASNYSGGESAGLALTA-GEEAGILSPNGTPLDPRRAIEYHDHQMALVALAH 318 (484) Q Consensus 243 ~~G~P~~~-g--k~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~-~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGq 318 (484) -+..+. . ..-.+..+.++..-.++++.++.. ....++.+ +.+++.++++-+ ....++...-.+||-+. +- T Consensus 322 --~v~~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn-~G~~llDk~~Eefeq~stslS--GLddVi~qf~q~VAgaa-~I 395 (695) T protein:vir:36 322 --SVSGILMDLAQALMPGANVDLSMRAELINRYRDN-RNILFLDKATEEFFQFNTPLS--GLDALQAQAQEQMSAVS-HI 395 (695) T ss_pred --hHHHHHHHHHHhhcChhHHHHHHHHHHHHHhcCc-cceEEEecCCcceEEEecccC--CHHHHHHHHHHHHHhhh-cC Confidence 111110 0 000111223344344666666544 45667874 678888776433 35566666666666652 22 Q ss_pred hhc---ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcH-------HH Q lcl|NC_021302. 319 FLN---LDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQ-------DA 387 (484) Q Consensus 319 tlt---~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~-------~~ 387 (484) .+| ..+-.|=.|.|+.-..+.-+.+++.....-..+-+.++.-|..--||...+-..|+|...- -+. ++ T Consensus 396 PltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k 475 (695) T protein:vir:36 396 PLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYK 475 (695) T ss_pred chhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhh Confidence 222 1222354566777777778888777654444444445555555446654433456665432 122 34 Q ss_pred HHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCC-----cccc-----------cccCC-CcCCCcccc--------C Q lcl|NC_021302. 388 TAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPD-----ADDD-----------ESTAD-TGQDEPETD--------E 442 (484) Q Consensus 388 ~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~-----e~~~-----------~~~~~-~~~~~~~~~--------~ 442 (484) .|+.++.+.+.|++.+ +.++.++.-.....- ..+. ..+.. +....++.+ . T Consensus 476 ~A~~d~~~~~~gvI~~-----~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 550 (695) T protein:vir:36 476 QAQSDVLYVQEQVIRP-----DQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGA 550 (695) T ss_pred hhHHHHHHHHhcCCCH-----HHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccc Confidence 5677888999998654 578888654321110 0000 00000 000000000 0 Q ss_pred CCCccccccccccccccccccccccchHH----HhcCcccC-------cccCC Q lcl|NC_021302. 443 PALPNTSGTTSTTNAPQARKRPRGRSPRD----RRKTPDGA-------MPLWD 484 (484) Q Consensus 443 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-------~~~~~ 484 (484) ..+|+........+. +.+.+.++.. +-...+|. .-.|. T Consensus 551 ~~~~~v~~~~~~~~~----~~ag~~~~~~~aag~v~~~~g~vLl~kr~~g~W~ 599 (695) T protein:vir:36 551 TAPPTVANVNANVNP----REAGAQDAAMRAAGAVYVVDGKVLLMKRPAGDWG 599 (695) T ss_pred cCCCcccccccccCc----cccCCCCccceeeEEEEEeCCEEEEEEecCCCcc Confidence 011110000000000 0000000000 00000011 12244 No 235 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=83.23 E-value=0.07 Score=26.91 Aligned_cols=438 Identities=14% Similarity=0.102 Sum_probs=166.2 Q ss_pred CCCCCCCccceeeeecccc-----cch----hhhhhhccc-ccccccccc-----cchHHHHHHHHhcchHHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLA-----GFG----TFLAQGLDQ-FEQVDELRW-----PNSVYTYTRMCREEARIASVLRAIG 65 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-----~~~----~~~~~~~~~-~~~~~~lr~-----~~~~~~y~~m~~~D~~v~s~l~~r~ 65 (484) -||-+- .++.+.+.... .++ ....+++.- -...+.|.| =-++.+-..|. +-+.+.++....- T Consensus 57 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~la-Q~~eyr~~~~~ia 133 (698) T protein:vir:10 57 AAPVAE--PSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLA-QLPEYRAMHEVLA 133 (698) T ss_pred cccccC--CCccccccccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHh-hccchhhHHHHHH Confidence 122110 01111111100 000 000011110 000011111 01233344454 4667777777766 Q ss_pred HHhhCCCcEEecCCC-------------------CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCC-HHHHHHHHHHHH Q lcl|NC_021302. 66 LPIRRTDWRIRPNGA-------------------RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFS-WDQHLRLALKSL 125 (484) Q Consensus 66 ~~v~~~~~~v~p~~~-------------------~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~l~a~ 125 (484) ....+. |.-.-.+. +++..+.+..++. +.. |+.+..-+--+. T Consensus 134 ~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~e-----------------rl~V~~~l~eai~~aR 195 (698) T protein:vir:10 134 DECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIE-----------------RLRIRDAVRTTVIHDQ 195 (698) T ss_pred HHhhcc-cceeccccchhhhhhcccccccccccccHHHHHHHHHHHH-----------------HHHHHHHHHHHHHhcc Confidence 665554 63211111 1133333333322 112 344555555799 Q ss_pred hhcceeeeEEEeecCCe----e----------eeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCC Q lcl|NC_021302. 126 QFGHAVFEQTYFYEGGR----F----------WLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSM 191 (484) Q Consensus 126 ~~G~s~~Eivw~~~~g~----~----------~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~ 191 (484) +||-+++=++=.-++.. + .++.|...+|.|+.--.++. ..-....++.+..+... T Consensus 196 lfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~-------~dP~spdfgkP~~y~V~---- 264 (698) T protein:vir:10 196 AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS-------INPVADDFYKPSTWWMI---- 264 (698) T ss_pred cccceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhh-------ccchhhccCCCceEEEe---- Confidence 99999844332222210 1 12224444444432110000 00111223333333222 Q ss_pred cccccccceEEEeecC------ccCccccchhHHHHHHHHHH---HHHHHHHHHHHHHHhcCCcceEE-e--cCCCCCCH Q lcl|NC_021302. 192 GPAIPVEQLVVYTHDM------DPGVWTGNSLLRPAYKNWKL---KDELIRIEAAAIRRHGIGVPYLK-G--NEADSEDD 259 (484) Q Consensus 192 ~~~lp~~k~l~~~~~~------~~~~p~G~gll~~~~~~~~~---K~~~~~~w~~f~Er~~~G~P~~~-g--k~~~~~~~ 259 (484) +..+...+++.++-.. ..-+.+|.|+...++..+.- .+..... + +-+ +-+..+. . .--.+..+ T Consensus 265 G~~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~-L--i~~--~~~~~l~~dla~aL~~g~~ 339 (698) T protein:vir:10 265 GSEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSD-I--VKQ--FSVSGILMDLAQALTPGAN 339 (698) T ss_pred cceecceeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHH-H--HHH--hhHHHHHHHHHHhcCChhh Confidence 1134444444443321 12356789988888765431 1111111 1 110 0111110 0 00011122 Q ss_pred HHHHHHHHHHHHHhcCCceEEEccC-CceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc---ccccccchhhHHHH Q lcl|NC_021302. 260 DRMDELLEIASNYSGGESAGLALTA-GEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN---LDGKGGSYALASVQ 335 (484) Q Consensus 260 ~~~~~l~~~l~~~~~g~~a~~vip~-~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt---~~~~gGs~A~~evh 335 (484) .++..-.+++..++.. ....++.+ +.+++.++++-+ ....++...-.+||-+. +-.+| ..+-.|=.|.|+.- T Consensus 340 ~~l~~R~eli~~~Rsn-~G~~llDk~~Eefeq~st~lS--GLddVi~qf~q~VAgaa-~IPltkLfGqSPkGlNATGE~D 415 (698) T protein:vir:10 340 VDLSMRAELINRYRDN-RNILFLDKATEEFFQFNTPLS--GLDALQAQAQEQMSAVS-HIPLIKLLGITPTGLNASSEGE 415 (698) T ss_pred HHHHHHHHHHHHhcCc-cceEEEecCCcceEEEecCcC--CHHHHHHHHHHHHHhhh-cCchhhhhccCCcccCccchhh Confidence 2333334555666544 45667874 678887776433 35566666666666652 22222 12223445666766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcH-------HHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 336 ADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQ-------DATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 336 ~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~-------~~~ae~~~~L~~~G~~~~~~~~ 407 (484) ..+.-+.+++....--..+-+.|+.-|..--||...+-..|+|...- -+. ++.|+.++.+.+.|++.+ T Consensus 416 ~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~---- 491 (698) T protein:vir:10 416 IRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRP---- 491 (698) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCH---- Confidence 77777777777654433333445555444446654333456665432 122 345677888889998654 Q ss_pred HHHHHHHhCCCCCCCC-----ccccc-ccCC-----------CcCCCccccC--CCCcccccccccccccccccc--ccc Q lcl|NC_021302. 408 EAFLRDAAGLPGPDPD-----ADDDE-STAD-----------TGQDEPETDE--PALPNTSGTTSTTNAPQARKR--PRG 466 (484) Q Consensus 408 ~~~i~e~~glp~p~~~-----e~~~~-~~~~-----------~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~ 466 (484) +.++.++.-.....- +++++ .+++ ....-++... .......|...+.++.+.... +.- T Consensus 492 -~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 570 (698) T protein:vir:10 492 -DQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPRE 570 (698) T ss_pred -HHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCccc Confidence 567776643211100 00000 0000 0000000001 111111222111111111100 000 Q ss_pred cchHHHhcCcccC--------------cccCC Q lcl|NC_021302. 467 RSPRDRRKTPDGA--------------MPLWD 484 (484) Q Consensus 467 ~~~~~~~~~~~~~--------------~~~~~ 484 (484) ....+.....-+. .-.|. T Consensus 571 ~~~~~~~~~a~giv~~~g~~vLL~~r~~g~W~ 602 (698) T protein:vir:10 571 AGAQDAAMRAAGIVFRAGDKVLLMKRPAGDWG 602 (698) T ss_pred cCcccceeeEEEEEEEcCCeEEEEEecCCCcc Confidence 0000000000000 11233 No 236 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=81.74 E-value=0.083 Score=26.51 Aligned_cols=393 Identities=11% Similarity=0.092 Sum_probs=153.4 Q ss_pred CCCCCCCccceeeee-cccccchhhhhhhcccccccccccccchHHHHHHHHhcchH--------------------H-H Q lcl|NC_021302. 1 MAPKTVAPRTERGYV-NPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEAR--------------------I-A 58 (484) Q Consensus 1 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~--------------------v-~ 58 (484) |.++..-.++-..+. .+. ...+.. .-..|+.|+.|-+.+.+ + . T Consensus 14 ~~~~~~~~~~~~~i~d~~~------------i~~~~~---~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~ 78 (505) T protein:vir:79 14 GSAAVGMTKSLGQIIDDPR------------INLPAD---EVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTK 78 (505) T ss_pred hhhhhcchhhhhhhhcccC------------CCCCHH---HHHHHHHHHHHhcCCCccccccccCCCccccceeecchHH Confidence 222111111000000 000 000000 01223333333322211 0 1 Q ss_pred HHHHHHHHHhhCCCcEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEe Q lcl|NC_021302. 59 SVLRAIGLPIRRTDWRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYF 137 (484) Q Consensus 59 s~l~~r~~~v~~~~~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~ 137 (484) .+..+.-.-|.+-+-.|... +.+..+++.+. +..-.|...+... ..|..+|=.++=+.|. T Consensus 79 ~i~~~~A~ll~~e~~~i~~~--d~~~~e~l~~i-----------------~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D 139 (505) T protein:vir:79 79 LASAKLASLIFNEQCQVTVS--DETANDFLDDV-----------------FQQNDFYTTFEEKLEEWIALGSGCVRPYVD 139 (505) T ss_pred HHHHHHHhhhcCCCceeecC--ChHHHHHHHHH-----------------HHhccHHHHHHHHHHHHhhcCCeEEEEEEe Confidence 11222222333333333221 22233333332 2233466666555 5788899888877775 Q ss_pred ecCCeeeeeeeeeeCccceeeeeecCCCc--eeeeecc---cccc-------------cccccc--eecc---CCCCcc- Q lcl|NC_021302. 138 YEGGRFWLKRLAPRPQSSIAYWNVDRDGG--LISIQQW---PAGT-------------FGGPGM--VVMA---PNSMGP- 193 (484) Q Consensus 138 ~~~g~~~~~~l~~r~~~~~~~~~~~~dg~--l~~~~q~---~~~~-------------~~~~~~--~~~~---~~~~~~- 193 (484) +|.+ +|.++++..|.-..++..+. +.-...+ .... .+.... ..+. ...-|. T Consensus 140 --~~~~---~i~~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~ 214 (505) T protein:vir:79 140 --SGKI---KLAWATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGIN 214 (505) T ss_pred --CCce---EEEEEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcc Confidence 3332 23334443332111222111 0000000 0000 000000 0000 000010 Q ss_pred -------------------cccccceEEEee----cCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_021302. 194 -------------------AIPVEQLVVYTH----DMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLK 250 (484) Q Consensus 194 -------------------~lp~~k~l~~~~----~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~ 250 (484) .++.--|.+++. .....+|+|.|.+..+--..-.-...+..|+.-++. .-..+.+ T Consensus 215 v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~--g~~~i~v 292 (505) T protein:vir:79 215 VPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKK--GQRRLIV 292 (505) T ss_pred cchhhcccccccCcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHh--cccceee Confidence 111112444432 234578999999999886665556666666555552 1222332 Q ss_pred ----ec-CCC--CCCHHHHHHHHHHHHHHhcCCceEEEcc---CCceEEEecccCCchhHHHHHHHHHHHHHHHH-hh-h Q lcl|NC_021302. 251 ----GN-EAD--SEDDDRMDELLEIASNYSGGESAGLALT---AGEEAGILSPNGTPLDPRRAIEYHDHQMALVA-LA-H 318 (484) Q Consensus 251 ----gk-~~~--~~~~~~~~~l~~~l~~~~~g~~a~~vip---~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~i-lG-q 318 (484) .+ .+. +........+. ..+...+..+. .+..|+.++..-....|...++.+=++|+..+ ++ + T Consensus 293 ~~~~l~~~~~~~~~~~~~~~~~f------d~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 366 (505) T protein:vir:79 293 PAEWLKTGSSYGGQASETHPPMF------DPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQG 366 (505) T ss_pred chHHhcccCCCCcccccccccCC------CccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChh Confidence 01 111 11100000000 00111111111 12335555554444566666666666666554 33 2 Q ss_pred hhcccccccchhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CC-------Cc--cccceEEecC-C Q lcl|NC_021302. 319 FLNLDGKGGSYALASVQ--ADTFVQSVQTVADEIRDVAQAHVVEDIVDVN-----WG-------ED--EPAPLLVFDE-I 381 (484) Q Consensus 319 tlt~~~~gGs~A~~evh--~~v~~~~~~aD~~~i~~~ln~qli~~l~~~N-----f~-------~~--~~~P~~~~~~-~ 381 (484) +++.++. |....-++. +.-....+..-.+.+...| ++|++.++.+. |. .. ..-+.+.|++ . T Consensus 367 ~~~~~~~-~~~TAtei~s~~~~l~~t~~~~~~~~~~al-~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i 444 (505) T protein:vir:79 367 TFTTSPS-GIQTATEVVTNNSQTYQTRSSYITQVEKTI-KALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGV 444 (505) T ss_pred hcCCCcc-ccchHHHHHHHHhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCC Confidence 3333332 222122332 2234444455566677777 45777766532 11 11 1123577864 5 Q ss_pred CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHhCCCCCCCCcccccccCCCcCCCccccCCCCcccccc Q lcl|NC_021302. 382 GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAAGLPGPDPDADDDESTADTGQDEPETDEPALPNTSGT 451 (484) Q Consensus 382 ~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~glp~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (484) ..|.++.++...+++..|+.. .+.++.+.+|+.+.+-.+++..-..... ...|....--|+ T Consensus 445 ~~d~~~~~~~~~~~v~~Gi~s----~e~~l~~~~~~~eeea~~el~ri~~E~~-----~~~p~~~~~gg~ 505 (505) T protein:vir:79 445 FVDQESKRAADLQAVQAQVMP----KKQFLMRNYGLDEEEADEWLAQIDAENS-----TAEPEFNQFGGD 505 (505) T ss_pred CCCHHHHHHHHHHHHHcCCCC----HHHHHHhcCCCChHHHHHHHHHHHHhcc-----ccCCCchhccCC Confidence 567777788888999999743 2567888889865332222211111111 111111111111 No 237 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=80.99 E-value=0.09 Score=26.32 Aligned_cols=422 Identities=11% Similarity=0.054 Sum_probs=172.5 Q ss_pred CCCCCCCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhCCCcE---Eec Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRTDWR---IRP 77 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~~~~---v~p 77 (484) ++|+......++..-......+....+.+..... .......|+.|++|+ ..+.|-++++.....+.-.+-. |+- T Consensus 30 ~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~--~~~~~~LI~~YR~ma-~~pEvd~Av~eIvneaiv~d~~~~pV~l 106 (516) T protein:vir:10 30 ATPKKDDGATEIEAREGESSYNALMQQFFGIDNN--ISGTKDLINTYRQLT-NNPEVERAVANIVNEAVVYEKGHKVVSL 106 (516) T ss_pred cCCCCccCceeeecCcccccccceeeeeecccCc--cccHHHHHHHHHHhh-hccchhHHHHHhhcceeEecCCCceEEE Confidence 3333333333332100000111111111111111 112356799999998 4999999999888766544321 111 Q ss_pred CCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeeeeeeeeeCccce Q lcl|NC_021302. 78 NGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWLKRLAPRPQSSI 156 (484) Q Consensus 78 ~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~~ 156 (484) +-++-+..+.+.+-+.. .+...+.-.+|+.-..++. .-...|--.+.++= ++..-.+.+|...+|+.+ T Consensus 107 ~l~~~e~s~sik~kI~e---------eF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKii--d~~k~GI~elr~lDPr~i 175 (516) T protein:vir:10 107 DLDDTEFSSSIKDKILE---------EFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIM--PNPKEGIVELRRLDPRHV 175 (516) T ss_pred EecccccchHHHHHHHH---------HHHHHHHHhccchhhhHHHHhhhhcceEEEEEEe--cCcccceeeeeeeCCcce Confidence 11121222222222211 1111111122322211111 11112333333221 122333456666677766 Q ss_pred eeeeec---CCCceeeeec----ccccccccccceec--cCCCCcccccccceEEEeecCc--cCccccchhHHHHHHHH Q lcl|NC_021302. 157 AYWNVD---RDGGLISIQQ----WPAGTFGGPGMVVM--APNSMGPAIPVEQLVVYTHDMD--PGVWTGNSLLRPAYKNW 225 (484) Q Consensus 157 ~~~~~~---~dg~l~~~~q----~~~~~~~~~~~~~~--~~~~~~~~lp~~k~l~~~~~~~--~~~p~G~gll~~~~~~~ 225 (484) .+.+.- ..++...+.. +.-........... ......+.|| ...|+|.|..- .++..=.|.|.++..++ T Consensus 176 ~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~-~daI~y~hSGl~d~~~~~i~syLhkAiKp~ 254 (516) T protein:vir:10 176 EYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIP-RSAIVYAHSGLQDCSDRGIVGYLHNAVKPA 254 (516) T ss_pred eeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecc-hhheeeeecCcccCCCCceeceehhhhHhH Confidence 654432 1111110000 00000000000000 0112223343 34677777532 11122257888888777 Q ss_pred HHHHHHHHHHHHH-HHHhcCCcceEE---ecCCCCCCHHHHHHHHHHHHHHhcC----CceEEE--------------cc Q lcl|NC_021302. 226 KLKDELIRIEAAA-IRRHGIGVPYLK---GNEADSEDDDRMDELLEIASNYSGG----ESAGLA--------------LT 283 (484) Q Consensus 226 ~~K~~~~~~w~~f-~Er~~~G~P~~~---gk~~~~~~~~~~~~l~~~l~~~~~g----~~a~~v--------------ip 283 (484) ==-++..-..+.| +-| -|=+. ...+.-...+.-+.|.+++..+++. .++|=| +| T Consensus 255 NQLkm~EDAlVIYRitR----APeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLp 330 (516) T protein:vir:10 255 NQLKLLEDALVIYRITR----APERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLM 330 (516) T ss_pred HhhHHHHhhHHHHhhhc----cccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccc Confidence 5555555544444 222 12111 1223334444445666776666543 112211 22 Q ss_pred -----CCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc-----chhhHHHHHHH-HHHHHHHHHHHHHH Q lcl|NC_021302. 284 -----AGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG-----SYALASVQADT-FVQSVQTVADEIRD 352 (484) Q Consensus 284 -----~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG-----s~A~~evh~~v-~~~~~~aD~~~i~~ 352 (484) .|++|.++.++.+. .-.+=++|..+.+-+++..+.--.+.++| +++..-+..++ |...+......++. T Consensus 331 RReGgrgTEItTLpGgqnl-gem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~ 409 (516) T protein:vir:10 331 RRDGKSVTEVTSLPGAQTM-GEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEE 409 (516) T ss_pred ccCCCcccceeeccccCCc-ChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHH Confidence 47889888765332 33445889999999997776533332221 34433334444 34445555565555 Q ss_pred HHHHHHHHHHHHhC------CCCccccceEEecCCC--C---cHHHHHHHHHHHHhcCcccCCcccHHHHHHH-hCCCCC Q lcl|NC_021302. 353 VAQAHVVEDIVDVN------WGEDEPAPLLVFDEIG--S---RQDATAAALQMLVNAGLLTPDPRLEAFLRDA-AGLPGP 420 (484) Q Consensus 353 ~ln~qli~~l~~~N------f~~~~~~P~~~~~~~~--~---~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~-~glp~p 420 (484) .|..-|-..|+--+ |......-+|.|.... . +.+.+.+++..|..+-=.+....+.+|+++. +.++.. T Consensus 410 lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDe 489 (516) T protein:vir:10 410 IFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDE 489 (516) T ss_pred HHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHh Confidence 55544444444333 2221222244442211 1 2233445555555433223345678898654 455433 Q ss_pred CCCccc--cccc-CCC--cCCCccccC Q lcl|NC_021302. 421 DPDADD--DEST-ADT--GQDEPETDE 442 (484) Q Consensus 421 ~~~e~~--~~~~-~~~--~~~~~~~~~ 442 (484) +-.+.. .... ..+ ..|+.+... T Consensus 490 ei~~~~k~I~~E~~~~~~~~p~~e~~f 516 (516) T protein:vir:10 490 QIAQEEKQIEKEANVKRFQNPENEDDF 516 (516) T ss_pred HHHHHHHHHHHhhhCCCCCCCCccccC Confidence 211111 1000 000 011111111 No 238 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=75.63 E-value=0.14 Score=25.19 Aligned_cols=437 Identities=14% Similarity=0.065 Sum_probs=167.9 Q ss_pred CCCCCCCccceeeeeccc-----ccchh----hhhhhccc-ccccccccc-----cchHHHHHHHHhcchHHHHHHHHHH Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPL-----AGFGT----FLAQGLDQ-FEQVDELRW-----PNSVYTYTRMCREEARIASVLRAIG 65 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-----~~~~~----~~~~~~~~-~~~~~~lr~-----~~~~~~y~~m~~~D~~v~s~l~~r~ 65 (484) -||-+- .|+.+.+... ..++. ...+++.- -...+.|.| =-++..-..|. +-+.+.++....- T Consensus 57 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~la-Q~~eyr~~~~~ia 133 (695) T protein:vir:78 57 AAPVAE--PSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLA-QLPEYRAMHEVLA 133 (695) T ss_pred cccccC--CCcccccceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHh-hccchhhHHHHHH Confidence 122110 0111111110 00000 00011110 000011111 01233344454 4677777777766 Q ss_pred HHhhCCCcEEecCC-------------------CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCC-HHHHHHHHHHHH Q lcl|NC_021302. 66 LPIRRTDWRIRPNG-------------------ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFS-WDQHLRLALKSL 125 (484) Q Consensus 66 ~~v~~~~~~v~p~~-------------------~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~l~a~ 125 (484) ....+. |.-.-.+ .+++..+.+..++.. .. |+.+..-+--+. T Consensus 134 ~e~~R~-w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~er-----------------L~V~~~l~eaik~aR 195 (695) T protein:vir:78 134 DECIRT-WGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIER-----------------LRIRDAVRTTVIHDQ 195 (695) T ss_pred HHhhcc-cceeccccchhhhhhcccccccccccccHHHHHHHHHHHHH-----------------HHHHHHHHHHHHhhc Confidence 665554 6321111 111333333333221 12 344555555799 Q ss_pred hhcceeeeEEEeecCC--------------eeeeeeeeeeCccceeeeeecCCCceeeeecccccccccccceeccCCCC Q lcl|NC_021302. 126 QFGHAVFEQTYFYEGG--------------RFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSM 191 (484) Q Consensus 126 ~~G~s~~Eivw~~~~g--------------~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~ 191 (484) +||-+++=+.=.-++. .-.++.|..++|.|+.--.++. ..-....++.+..+... T Consensus 196 lfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~-------~dP~spdfgkP~~y~V~---- 264 (695) T protein:vir:78 196 AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNS-------INPVADDFYKPSTWWMI---- 264 (695) T ss_pred cccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhh-------ccchhhccCCCceEEEe---- Confidence 9999984432222221 0112224444444432110000 00011122222222221 Q ss_pred cccccccceEEEeecC------ccCccccchhHHHHHHHHH---HHHHHHHHHHHHHHHhcCCcceEE-e--cCCCCCCH Q lcl|NC_021302. 192 GPAIPVEQLVVYTHDM------DPGVWTGNSLLRPAYKNWK---LKDELIRIEAAAIRRHGIGVPYLK-G--NEADSEDD 259 (484) Q Consensus 192 ~~~lp~~k~l~~~~~~------~~~~p~G~gll~~~~~~~~---~K~~~~~~w~~f~Er~~~G~P~~~-g--k~~~~~~~ 259 (484) +..+-..+++.++-.. ..-+.+|.++...++..+. -.+.....-+ + ++ -+..+. . ..-.+..+ T Consensus 265 G~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li-~-~~---~v~~lk~dla~~L~~g~~ 339 (695) T protein:vir:78 265 GTEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIV-K-QF---SVSGILMDLAQALMPGAN 339 (695) T ss_pred ceEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHH-H-hh---hhHHHHHHHHHhhcChhH Confidence 1123333333333221 1235678998888875532 1122222111 1 11 111110 0 00011122 Q ss_pred HHHHHHHHHHHHHhcCCceEEEccC-CceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhc---ccccccchhhHHHH Q lcl|NC_021302. 260 DRMDELLEIASNYSGGESAGLALTA-GEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLN---LDGKGGSYALASVQ 335 (484) Q Consensus 260 ~~~~~l~~~l~~~~~g~~a~~vip~-~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt---~~~~gGs~A~~evh 335 (484) .++..-.++++.++.. ....++.+ +.+++.++++-+ ....++...-.+||-+. +-.+| ..+-.|=.|.|+.- T Consensus 340 ~~l~~R~eli~~~Rsn-~G~~llDk~~Eefeq~stslS--GLddVi~qf~q~VAgaa-~IPltkLfGqSPkGlNATGE~D 415 (695) T protein:vir:78 340 VDLSMRAELINRYRDN-RNILFLDKATEEFFQFNTPLS--GLDALQAQAQEQMSAVS-HIPLIKLLGITPTGLNASSEGE 415 (695) T ss_pred HHHHHHHHHHHHhcCc-cceEEEecCCcceEEEecccC--CHHHHHHHHHHHHHhhh-cCchhhhhccCCccccccchhh Confidence 3344344666666544 45667874 678888776433 35566666666666652 22222 12223545667777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccceEEecCCC-CcH-------HHHHHHHHHHHhcCcccCCccc Q lcl|NC_021302. 336 ADTFVQSVQTVADEIRDVAQAHVVEDIVDVNWGEDEPAPLLVFDEIG-SRQ-------DATAAALQMLVNAGLLTPDPRL 407 (484) Q Consensus 336 ~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~~~~P~~~~~~~~-~~~-------~~~ae~~~~L~~~G~~~~~~~~ 407 (484) ..+.-+.+++.....-..+-+.++.-|..--||...+-..|+|...- -+. ++.|+.++.+.+.|++.+ T Consensus 416 ~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~---- 491 (695) T protein:vir:78 416 IRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRP---- 491 (695) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCH---- Confidence 77788888777654444444445555555446654433456665432 122 345677888999998654 Q ss_pred HHHHHHHhCCCCCCCC---------------cccc-cccCC-CcCCCccccCCCCccccccccccccc-----ccccccc Q lcl|NC_021302. 408 EAFLRDAAGLPGPDPD---------------ADDD-ESTAD-TGQDEPETDEPALPNTSGTTSTTNAP-----QARKRPR 465 (484) Q Consensus 408 ~~~i~e~~glp~p~~~---------------e~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~ 465 (484) +.++.++.-.....- .+.. ..+.. +....++.+.+.. ...|...+.... .....+. T Consensus 492 -~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~~~~~~~~~~ag 569 (695) T protein:vir:78 492 -DQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG-ARAGATAPPTVANVNANVKPREAG 569 (695) T ss_pred -HHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCC-CCCCCCCCCceeeeeccccccccC Confidence 578888654321110 0000 00000 0000011111000 111111000000 0000011 Q ss_pred ccchHHH----hcCcccC-------cccCC Q lcl|NC_021302. 466 GRSPRDR----RKTPDGA-------MPLWD 484 (484) Q Consensus 466 ~~~~~~~----~~~~~~~-------~~~~~ 484 (484) +.++... -...+|. .-.|. T Consensus 570 ~~~~~~~aag~v~~~~g~vLl~kr~~g~W~ 599 (695) T protein:vir:78 570 AQDAAMRAAGAVYVVDGKVLLMKRPAGDWG 599 (695) T ss_pred CCCcccceeEEEEEeCCEEEEEEecCCCcc Confidence 1111000 0000111 12244 No 239 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=37.49 E-value=1.1 Score=20.32 Aligned_cols=427 Identities=14% Similarity=0.123 Sum_probs=165.1 Q ss_pred CCCCCCCccceeeeecc-----cccchhhhhhhcccccccc-cccccchHHHHHHHHhcchHHHHHHHHHHHHhhCC--C Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNP-----LAGFGTFLAQGLDQFEQVD-ELRWPNSVYTYTRMCREEARIASVLRAIGLPIRRT--D 72 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~~--~ 72 (484) +.++..=|...--.|.. +..+.........+..... .+++.....+| |+-=+-.+.+ ...+++. . T Consensus 9 ~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~dr~~~~------~ps~r~~V~~-~~~~Lg~~~~ 81 (563) T protein:vir:74 9 DPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGDDSVPIL------MPSGRKIVEA-VHRFLGVGFD 81 (563) T ss_pred CCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCceeeec------cchHHHHHHH-HHHhcCCCcE Confidence 22222222221111111 1111111111111111111 12232222222 1111122333 3344455 5 Q ss_pred cEEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEee---cCCeeeeeee Q lcl|NC_021302. 73 WRIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFY---EGGRFWLKRL 148 (484) Q Consensus 73 ~~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~---~~g~~~~~~l 148 (484) |.|+|..+++...+.+...|+.+.+. .+|.....+. .+|+.-|=.|+=+.|.. .+++..+..+ T Consensus 82 ~~Ve~~~~de~~~~avq~~Lr~~~~~-------------e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~v 148 (563) T protein:vir:74 82 YLVEPDMGDEGIRQSLNAYFRTTFKR-------------EAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEV 148 (563) T ss_pred EecCccccCcchHHHHHHHHHHHHHH-------------hhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeec Confidence 55577666665556676776655432 2344433333 57999999999999984 3556655544 Q ss_pred eeeCccceeeee-------------------ecC-CCceeeee-----cccccccccc---------------------- Q lcl|NC_021302. 149 APRPQSSIAYWN-------------------VDR-DGGLISIQ-----QWPAGTFGGP---------------------- 181 (484) Q Consensus 149 ~~r~~~~~~~~~-------------------~~~-dg~l~~~~-----q~~~~~~~~~---------------------- 181 (484) . |.++.-+. .++ .-.+.+.+ .+..+....- T Consensus 149 D---P~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~ 225 (563) T protein:vir:74 149 D---PRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDE 225 (563) T ss_pred C---CceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchh Confidence 3 33221100 000 00111111 0111110000 Q ss_pred ------cceeccCCCCccccc--cc--ceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEe Q lcl|NC_021302. 182 ------GMVVMAPNSMGPAIP--VE--QLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKG 251 (484) Q Consensus 182 ------~~~~~~~~~~~~~lp--~~--k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~g 251 (484) .........+...+| .. -+++++..+..+..||.|-|..+--...--+....+-...++-.|.||.++.+ T Consensus 226 ~~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~ 305 (563) T protein:vir:74 226 QARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNA 305 (563) T ss_pred hhcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEecc Confidence 000000011111112 11 24456666788999999999988888777777777777777766555555543 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCc---eEEEecccCCchhHHHHHHHHHH-HHHHHHhh-hhh---ccc Q lcl|NC_021302. 252 NEADSEDDDRMDELLEIASNYSGGESAGLALTAGE---EAGILSPNGTPLDPRRAIEYHDH-QMALVALA-HFL---NLD 323 (484) Q Consensus 252 k~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~---~ie~~~~~~~~~~~~~li~~~d~-~Isk~ilG-qtl---t~~ 323 (484) -. ..+.-...+.. -++ |.-+.+-+|... -++.+++..+...++.-+++++. .|+.. -+ .-. |.+ T Consensus 306 ~~---p~d~~~g~~~~--w~v--gpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~-s~tPavA~G~vD 377 (563) T protein:vir:74 306 SA---PVDPNTGELTD--WNI--GPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEG-SGTPEVAIGRVD 377 (563) T ss_pred cc---ccccccccccc--ccc--CCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhh-ccCcceeecccc Confidence 21 11111111100 112 211222344332 24444443333344444444443 33321 11 001 112 Q ss_pred c--cccchhhHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhC--------CCCccc----cceEEecC-C Q lcl|NC_021302. 324 G--KGGSYALASVQADTFVQS-------VQTVADEIRDVAQAHVVEDIVDVN--------WGEDEP----APLLVFDE-I 381 (484) Q Consensus 324 ~--~gGs~A~~evh~~v~~~~-------~~aD~~~i~~~ln~qli~~l~~~N--------f~~~~~----~P~~~~~~-~ 381 (484) . .-+++|+ ++...-.... +.+-++++..-+...|++.+-.+- ||.... .-.++|.. . T Consensus 378 ~~~~~SGiAL-eL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~ 456 (563) T protein:vir:74 378 VTSAESGISL-ELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPM 456 (563) T ss_pred cccccchhhh-hhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCC Confidence 1 1133443 3333333332 344455555555556665544531 222111 11345754 4 Q ss_pred CCcHHHHHHHHHHHHhcCcccCCcccHHHHHHHh---CCCCCCCCccc---------c--------------cccCCCcC Q lcl|NC_021302. 382 GSRQDATAAALQMLVNAGLLTPDPRLEAFLRDAA---GLPGPDPDADD---------D--------------ESTADTGQ 435 (484) Q Consensus 382 ~~~~~~~ae~~~~L~~~G~~~~~~~~~~~i~e~~---glp~p~~~e~~---------~--------------~~~~~~~~ 435 (484) +.+.+...+-+..|+..|++-. .-.-+++ |.|.|+-+.+. + ....+++. T Consensus 457 P~d~~~vv~~~~tl~~aGiiSr-----etAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~ 531 (563) T protein:vir:74 457 PVNKTQVTQDTLLLQQAHLILR-----KMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGA 531 (563) T ss_pred CccHHHHHHHHHHHHHcCchhH-----HHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCC Confidence 5666666666667777776421 2222233 55544311100 0 00111111 Q ss_pred CCccccCC-CCccccccc--cccccccccccc Q lcl|NC_021302. 436 DEPETDEP-ALPNTSGTT--STTNAPQARKRP 464 (484) Q Consensus 436 ~~~~~~~~-~~~~~~~~~--~~~~~~~~~~~~ 464 (484) ++.+.+.. .+-..-|.+ .+....+..-+| T Consensus 532 ~~~~~dd~g~p~~~~~~~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 532 GEQQFDDQGNPIDQFGNPVEIPPDVTQVPLSP 563 (563) T ss_pred CcccccccCCchhHcCCcccCCccccccCCCC Confidence 11111100 000000100 000011111111 No 240 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=35.09 E-value=1.3 Score=20.05 Aligned_cols=437 Identities=12% Similarity=0.050 Sum_probs=160.2 Q ss_pred CCCCCCCcc--ceeeeeccc---ccc-hhh---hhhhccccccccc--ccccchHHHHHHHHhcchHHHHHHHHHHHHhh Q lcl|NC_021302. 1 MAPKTVAPR--TERGYVNPL---AGF-GTF---LAQGLDQFEQVDE--LRWPNSVYTYTRMCREEARIASVLRAIGLPIR 69 (484) Q Consensus 1 ~~~~~~~~~--~~~~~~~~~---~~~-~~~---~~~~~~~~~~~~~--lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~ 69 (484) ||.+--... +-..+...+ |.. -.. ......+.-.+.. ....+ +..+ -|++-+-++++....+. T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~----~~~~--~dst~~~a~~~Laa~l~ 74 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTD----YQTP--WQAVGARGLNNLASKLM 74 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccc----cccc--ccccHHHHHHHHHHHHH Confidence 766221111 101111111 110 000 0001111000000 00101 1122 35666666666555555 Q ss_pred CC-----Cc-EEecCC-------CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEE Q lcl|NC_021302. 70 RT-----DW-RIRPNG-------ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQT 135 (484) Q Consensus 70 ~~-----~~-~v~p~~-------~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eiv 135 (484) +. +| ++...+ .++.....+.+.|.. .+......+.+.+|..-+.+++ +-+.+|-++. T Consensus 75 ~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~------ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--- 145 (536) T protein:vir:21 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSM------VERIIMNYIESNSYRVTLFEALKQLVVAGNVLL--- 145 (536) T ss_pred HhhcCCCcccccccChhhhhccccchhhHHHHHHHHHH------HHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeE--- Confidence 43 34 122211 111222233333321 2223344455667887776665 5556786653 Q ss_pred EeecCCeeeeeeeeeeCccceeeeeecCCCc-------------------------------------eeeeeccccccc Q lcl|NC_021302. 136 YFYEGGRFWLKRLAPRPQSSIAYWNVDRDGG-------------------------------------LISIQQWPAGTF 178 (484) Q Consensus 136 w~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~-------------------------------------l~~~~q~~~~~~ 178 (484) |-..+..-.+..+..+|-..+ .+.-|.+|+ ++.......+ . T Consensus 146 y~~e~~~~~~~~f~~~pl~~~-~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~-~ 223 (536) T protein:vir:21 146 YLPEPEGSNYNPMKLYRLSSY-VVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDED-S 223 (536) T ss_pred EEeeCCCCceeeEEEEEcCeE-EEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecC-C Confidence 211110000001111110100 011111111 1100000000 0 Q ss_pred ccccceeccCCCCcc---------cccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceE Q lcl|NC_021302. 179 GGPGMVVMAPNSMGP---------AIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYL 249 (484) Q Consensus 179 ~~~~~~~~~~~~~~~---------~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~ 249 (484) .. +.+....++. +....-|++.|+...+|+.||.|....+..-..--+...+.-+...++-.-+ |+. T Consensus 224 ~~---~~~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~-~~l 299 (536) T protein:vir:21 224 GE---YLRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKV-IGL 299 (536) T ss_pred Cc---EEEEeccCCeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC-Ccc Confidence 00 0000111111 2334568999999999999999999999999888888888888887775433 444 Q ss_pred EecCCCCCCHHHHHHHHHHHHHHhcCCceEEEcc-CCceEEEecc--cCCchhHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|NC_021302. 250 KGNEADSEDDDRMDELLEIASNYSGGESAGLALT-AGEEAGILSP--NGTPLDPRRAIEYHDHQMALVALAHFLNLDGKG 326 (484) Q Consensus 250 ~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip-~~~~ie~~~~--~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~g 326 (484) +.+.+ ... . .++..+.. +.++| ...++..+.. .+.-..-...|+.+...|.++++...++.- ++ T Consensus 300 v~p~g--~~~--~-------~~~~~~~~-g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~-~~ 366 (536) T protein:vir:21 300 VNPAG--ITQ--P-------RRLTKAQT-GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQR-TG 366 (536) T ss_pred cCccc--ccc--h-------hhhccCCC-cceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccC-CC Confidence 43322 111 1 11111212 23344 2334555442 222223467899999999999877654432 22 Q ss_pred cchhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc--cccc----eEEecC-CC-----CcHHHHHHHH Q lcl|NC_021302. 327 GSYALASVQADTF--VQSVQTVADEIRDVAQAHVVEDIVDVNWGED--EPAP----LLVFDE-IG-----SRQDATAAAL 392 (484) Q Consensus 327 Gs~A~~evh~~v~--~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~--~~~P----~~~~~~-~~-----~~~~~~ae~~ 392 (484) ...-..||+.... ...+-.-...+...|-.=||.+++.+-+... .+.| +..+.. .. .+.+.+..++ T Consensus 367 ~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~ 446 (536) T protein:vir:21 367 ERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCV 446 (536) T ss_pred CCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHH Confidence 2233345543222 2222222222222222223333222211110 1111 222211 11 1333344556 Q ss_pred HHHHhcCcc-----cCCcccHHHHHHHhCC-CCC--CCCcccccccCCCcCCCc-cccCCCCccccccccccc--ccccc Q lcl|NC_021302. 393 QMLVNAGLL-----TPDPRLEAFLRDAAGL-PGP--DPDADDDESTADTGQDEP-ETDEPALPNTSGTTSTTN--APQAR 461 (484) Q Consensus 393 ~~L~~~G~~-----~~~~~~~~~i~e~~gl-p~p--~~~e~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~--~~~~~ 461 (484) +.|..+|=. +......+++.+.+|+ |.. ...+++...-++...... ++...+..+........+ ..+++ T Consensus 447 ~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 526 (536) T protein:vir:21 447 TAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAA 526 (536) T ss_pred HHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhh Confidence 666665511 1112233567788898 421 122222111000000000 000000000000000000 00000 Q ss_pred ccccccchHHHhcCcccCc Q lcl|NC_021302. 462 KRPRGRSPRDRRKTPDGAM 480 (484) Q Consensus 462 ~~~~~~~~~~~~~~~~~~~ 480 (484) ....+-.+. + T Consensus 527 ~~~~g~~~~---------~ 536 (536) T protein:vir:21 527 ADSVGLQPG---------I 536 (536) T ss_pred hhccccCCC---------C Confidence 011111111 1 No 241 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=34.65 E-value=1.3 Score=20.00 Aligned_cols=413 Identities=8% Similarity=0.056 Sum_probs=155.3 Q ss_pred CCCCCCCccc---eeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhh------CC Q lcl|NC_021302. 1 MAPKTVAPRT---ERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIR------RT 71 (484) Q Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~------~~ 71 (484) +-|.-..... ..+..++. |.. .+ .|++-+-++++.-..+. +. T Consensus 31 ~lP~~~~~~~~~~~~~~~~~~--------------------~~~-------~i--~dst~~~a~~~Las~L~~~ltPp~~ 81 (547) T protein:vir:10 31 IMPMRSDFFSDLRSEGSINWN--------------------QNR-------EV--FDSTAGDGLETLSSSLHGSLTSPAT 81 (547) T ss_pred hcccccccccCCCCCcccccc--------------------ccc-------cc--ccchHHHHHHHHHHHHHHhhcCCCC Confidence 2232111000 00000000 000 00 12222222222222221 11 Q ss_pred Cc-EEecCCCCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHH-HHHHhhcceeeeEEEeecCCeeeeeeee Q lcl|NC_021302. 72 DW-RIRPNGARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLA-LKSLQFGHAVFEQTYFYEGGRFWLKRLA 149 (484) Q Consensus 72 ~~-~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-l~a~~~G~s~~Eivw~~~~g~~~~~~l~ 149 (484) +| ++.+.+.+......+...|.. .+......+.+.+|...+.++ ++-+.+|-++.=+....+ .-....+. T Consensus 82 ~WF~l~~~d~~~~~~~~v~~~L~~------ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~--~~~~~r~~ 153 (547) T protein:vir:10 82 KWFELAFRDKELNSDDECRKWLEN------ATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDED--EEGSVVFQ 153 (547) T ss_pred cccccccCCccccchHHHHHHHHH------HHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCC--CCCceeEE Confidence 22 233333221111222233221 122233344456787766655 466678877543322111 00111122 Q ss_pred eeCccceeeeeecCCCceeeeec------------ccccc------------ccc------------------c-----c Q lcl|NC_021302. 150 PRPQSSIAYWNVDRDGGLISIQQ------------WPAGT------------FGG------------------P-----G 182 (484) Q Consensus 150 ~r~~~~~~~~~~~~dg~l~~~~q------------~~~~~------------~~~------------------~-----~ 182 (484) .+|-..+ .+.-|.+|++..+-+ ++... .+. . + T Consensus 154 ~~pl~~~-~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~ 232 (547) T protein:vir:10 154 SSPIQDS-YFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGT 232 (547) T ss_pred EeecceE-EEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccc Confidence 3332322 233333443321110 00000 000 0 0 Q ss_pred ----------ceeccCCCCcc-----cccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc Q lcl|NC_021302. 183 ----------MVVMAPNSMGP-----AIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVP 247 (484) Q Consensus 183 ----------~~~~~~~~~~~-----~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P 247 (484) -+.....+..+ ..+..-|++.|+....|+.||.|....+..-..--+...+.-+..+++-. -.| T Consensus 233 ~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~-~pp 311 (547) T protein:vir:10 233 VLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVI-DPA 311 (547) T ss_pred eeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHh-cCc Confidence 00000000001 12345699999999999999999999999999888888888899999854 335 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_021302. 248 YLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG 327 (484) Q Consensus 248 ~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG 327 (484) +.+... +.... .++..| +..+......+.-++.++.-..-...|+.+...|..+++...+... ++. T Consensus 312 ~~v~~~--g~~~~---------~~~~pg--g~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~-~~~ 377 (547) T protein:vir:10 312 IMVTER--GLISD---------IDLGAS--GLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMK-DSP 377 (547) T ss_pred eecccc--ccccc---------ceecCC--eeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcC-CCc Confidence 554322 21211 112223 2233333344554554433333457899999999999887655442 233 Q ss_pred chhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCccccc-----------eEEecCC-CC-----cHH Q lcl|NC_021302. 328 SYALASVQADTF--VQSVQTVADEIRDVAQAHVVEDIVDVNW--GEDEPAP-----------LLVFDEI-GS-----RQD 386 (484) Q Consensus 328 s~A~~evh~~v~--~~~~~aD~~~i~~~ln~qli~~l~~~Nf--~~~~~~P-----------~~~~~~~-~~-----~~~ 386 (484) ..-..||+.... ...+-.....+...|-.-+|...+.+-+ +.-.+.| ++++... .. +.. T Consensus 378 ~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~ 457 (547) T protein:vir:10 378 AMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAA 457 (547) T ss_pred cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHH Confidence 344455554322 2222222222222222223222222221 1111111 1222110 00 011 Q ss_pred H---HHHHHHHHHhcCc----ccCCcccHHHHHHHhCCCCCC--CCcccccccCCCcCCCccccCCCCccccccc-cccc Q lcl|NC_021302. 387 A---TAAALQMLVNAGL----LTPDPRLEAFLRDAAGLPGPD--PDADDDESTADTGQDEPETDEPALPNTSGTT-STTN 456 (484) Q Consensus 387 ~---~ae~~~~L~~~G~----~~~~~~~~~~i~e~~glp~p~--~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 456 (484) . +.+.+..|.+++= .+.......++.+.+|+|..- .++++...- ..+...++.. .... ..++ T Consensus 458 ~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r--~qr~~~~q~~------~qaa~~~~~ 529 (547) T protein:vir:10 458 SIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIR--KNRSQTQQKA------EQAAIAEAE 529 (547) T ss_pred HHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHH--HHHHHHHHHH------HHHHHHHHH Confidence 1 1122222222221 111122345678889998431 122211100 0000000000 0000 0001 Q ss_pred cccccccccccchHHHhc Q lcl|NC_021302. 457 APQARKRPRGRSPRDRRK 474 (484) Q Consensus 457 ~~~~~~~~~~~~~~~~~~ 474 (484) +...++...+.++...|+ T Consensus 530 g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 530 GNAMEAQGKGQAALKENQ 547 (547) T ss_pred HHHHHhhcCcccchhccC Confidence 111111222222223333 No 242 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=34.03 E-value=1.3 Score=19.93 Aligned_cols=438 Identities=12% Similarity=0.052 Sum_probs=160.5 Q ss_pred CCCCCCCcc--ceeeeeccc---ccch-hh---hhhhccccccccc--ccccchHHHHHHHHhcchHHHHHHHHHHHHhh Q lcl|NC_021302. 1 MAPKTVAPR--TERGYVNPL---AGFG-TF---LAQGLDQFEQVDE--LRWPNSVYTYTRMCREEARIASVLRAIGLPIR 69 (484) Q Consensus 1 ~~~~~~~~~--~~~~~~~~~---~~~~-~~---~~~~~~~~~~~~~--lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~ 69 (484) ||.+--... +-..+...+ |... .. ......+.-.+.. .+..+ +..+ -|++-+-++++....+. T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~----~~~~--~dst~~~a~~~Laa~l~ 74 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTD----YQTP--WQAVGARGLNNLASKLM 74 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCCCCccccc----cccc--ccccHHHHHHHHHHHHH Confidence 766221111 111111111 1100 00 0001111000000 01111 1122 35666666666555555 Q ss_pred CC-----Cc-EEecCC-------CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEE Q lcl|NC_021302. 70 RT-----DW-RIRPNG-------ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQT 135 (484) Q Consensus 70 ~~-----~~-~v~p~~-------~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eiv 135 (484) +. +| ++...+ .++.....+.+.|.. .+......+.+.+|..-+.+++ +-+.+|-++. T Consensus 75 ~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~------ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--- 145 (536) T protein:vir:10 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSM------VERIIMNYIESNSYRVTLFEALKQLVVAGNVLL--- 145 (536) T ss_pred hhhcCCCcccccccChhhhhccccchhhHHHHHHHHHH------HHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeE--- Confidence 43 34 122211 111222233333321 2223344455667887776665 5556786653 Q ss_pred EeecCCeeeeeeeeeeCccceeeeeecCCCceeeee----------------------------------c--ccccccc Q lcl|NC_021302. 136 YFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQ----------------------------------Q--WPAGTFG 179 (484) Q Consensus 136 w~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~----------------------------------q--~~~~~~~ 179 (484) |-..+..-.+..+..+|-..+ .+.-|.+|++..+- + .+....+ T Consensus 146 y~~e~~~~~~~~~~~~pl~~~-~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~ 224 (536) T protein:vir:10 146 YLPEPEGSNYNPMKLYRLSSY-VVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASG 224 (536) T ss_pred EEeeCCCCceeeEEEEEcCeE-EEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCC Confidence 211110000011111111111 01111111111000 0 0000000 Q ss_pred cccceeccCCCCcc---------cccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEE Q lcl|NC_021302. 180 GPGMVVMAPNSMGP---------AIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLK 250 (484) Q Consensus 180 ~~~~~~~~~~~~~~---------~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~ 250 (484) . +.+....++. ++...-|++.|+...+|+.||.|....+..-..--+...+.-+...++-.-+ |+.+ T Consensus 225 --~-~~~~~e~~g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~-~~lv 300 (536) T protein:vir:10 225 --E-YLRYEEVEGMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKV-IGLV 300 (536) T ss_pred --c-EEEEEeecCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC-Cccc Confidence 0 0011111122 2234568999999999999999999999999888888888888887775433 4544 Q ss_pred ecCCCCCCHHHHHHHHHHHHHHhcCCceEEEcc-CCceEEEecc--cCCchhHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|NC_021302. 251 GNEADSEDDDRMDELLEIASNYSGGESAGLALT-AGEEAGILSP--NGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGG 327 (484) Q Consensus 251 gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip-~~~~ie~~~~--~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gG 327 (484) .+.+ ... . .++..+.. +.++| ...++..+.. .+.-..-...|+.+...|.++++...++.- ++. T Consensus 301 ~p~g--~~~--~-------~~~~~~~~-g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~-~~~ 367 (536) T protein:vir:10 301 NPAG--ITQ--P-------RRLTKAQT-GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQR-TGE 367 (536) T ss_pred Cccc--ccc--h-------hhhccCCC-cceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccC-CCC Confidence 3322 111 1 11111212 23344 2334555442 222223467899999999999877655432 222 Q ss_pred chhhHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc--cccc----eEEecC-CC-----CcHHHHHHHHH Q lcl|NC_021302. 328 SYALASVQADTF--VQSVQTVADEIRDVAQAHVVEDIVDVNWGED--EPAP----LLVFDE-IG-----SRQDATAAALQ 393 (484) Q Consensus 328 s~A~~evh~~v~--~~~~~aD~~~i~~~ln~qli~~l~~~Nf~~~--~~~P----~~~~~~-~~-----~~~~~~ae~~~ 393 (484) ..-..||+.... ...+-.-...+...|-.=||.+++.+-+... .+.| +..+.. .. .+.+.+..+++ T Consensus 368 r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~ 447 (536) T protein:vir:10 368 RVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVT 447 (536) T ss_pred CccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHH Confidence 233345543222 2222222222222222223333222211110 1111 222211 11 12333445566 Q ss_pred HHHhcCcc-----cCCcccHHHHHHHhCC-CCC--CCCcccccccCCCcCC-CccccCCCCccccccccccc--cccccc Q lcl|NC_021302. 394 MLVNAGLL-----TPDPRLEAFLRDAAGL-PGP--DPDADDDESTADTGQD-EPETDEPALPNTSGTTSTTN--APQARK 462 (484) Q Consensus 394 ~L~~~G~~-----~~~~~~~~~i~e~~gl-p~p--~~~e~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~ 462 (484) .|..+|=. +......+++.+.+|+ |.. ...+++...-++.... ..++...+...........+ ..+++. T Consensus 448 ~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 527 (536) T protein:vir:10 448 AWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAA 527 (536) T ss_pred HHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhh Confidence 66655411 1111233557788898 421 2222221110000000 00000000000000000000 000000 Q ss_pred cccccchHH Q lcl|NC_021302. 463 RPRGRSPRD 471 (484) Q Consensus 463 ~~~~~~~~~ 471 (484) ...+-.+.. T Consensus 528 ~~~g~~~~~ 536 (536) T protein:vir:10 528 DSVGLQPGI 536 (536) T ss_pred hccccCCCC Confidence 111111111 No 243 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=30.28 E-value=1.6 Score=19.48 Aligned_cols=425 Identities=11% Similarity=0.066 Sum_probs=154.0 Q ss_pred CCCCCCCccceeeeeccccc---chh-h---hhhhccccccc--ccccccchHHHHHHHHhcchHHHHHHHHHHHHhhC- Q lcl|NC_021302. 1 MAPKTVAPRTERGYVNPLAG---FGT-F---LAQGLDQFEQV--DELRWPNSVYTYTRMCREEARIASVLRAIGLPIRR- 70 (484) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~---~~~-~---~~~~~~~~~~~--~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~- 70 (484) |-.+.- .+...+.. ... . ......+.--+ ...+..+ ...+ .|++-+-++++....+.+ T Consensus 1 mk~~a~------~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~----~~~~--~dstg~~a~~~Laa~l~~~ 68 (542) T protein:vir:78 1 MKGLAQ------ARYSAMRADREDFLDMARRCAALTLPYLLTEDGHASGGR----LQQP--YQSLGSKGVNALSSKLMLS 68 (542) T ss_pred ChhHHH------HHHHHHHHHhhHHHHHHHHHHHHhccccCCCCCCccccc----cccc--ccchHHHHHHHHHHHHHHh Confidence 111000 00000000 000 0 00000000000 0000000 0111 244444444444443332 Q ss_pred -----CCc-EEecCC--------CCHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceeeeEE Q lcl|NC_021302. 71 -----TDW-RIRPNG--------ARPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVFEQT 135 (484) Q Consensus 71 -----~~~-~v~p~~--------~~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~Eiv 135 (484) .+| ++.+.+ .+++....++..|.. .+......+...+|..-+.+++ +-+.+|-+++ T Consensus 69 ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~------ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--- 139 (542) T protein:vir:78 69 LFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSK------MEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLV--- 139 (542) T ss_pred hcCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHH------HHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--- Confidence 234 233321 122222223333321 1112223334456766555554 6677887643 Q ss_pred EeecCCeeeeeeeeeeCccceeeeeecCCCceeeeeccc----------------------------------------c Q lcl|NC_021302. 136 YFYEGGRFWLKRLAPRPQSSIAYWNVDRDGGLISIQQWP----------------------------------------A 175 (484) Q Consensus 136 w~~~~g~~~~~~l~~r~~~~~~~~~~~~dg~l~~~~q~~----------------------------------------~ 175 (484) |.-.+. +..+|-..+ .+.-|.+|++..+-... . T Consensus 140 ~~~~~~------~~~~pl~~y-~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr 212 (542) T protein:vir:78 140 FAGKKT------LKVYPLDRY-VIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGR 212 (542) T ss_pred EecCCC------ceEEeccee-EEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecc Confidence 432221 111111111 12233333322111000 0 Q ss_pred cccc-------cccceeccCCCCcc---------cccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021302. 176 GTFG-------GPGMVVMAPNSMGP---------AIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAI 239 (484) Q Consensus 176 ~~~~-------~~~~~~~~~~~~~~---------~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~ 239 (484) .... ....+.+....++. +....-|++.|+...+|+.||.|....+..-..--+...+.-+..+ T Consensus 213 ~~~~~~~~~~~~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 292 (542) T protein:vir:78 213 NDAEVFTCCKLVDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGS 292 (542) T ss_pred cCCccccccccCCCeEEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 00001111111111 2333568999999999999999999999999998889999999999 Q ss_pred HHhcCCcceEEecCCCCCCHHHHHHHHHHHHHHhcCCceEEEccCCceEEEeccc--CCchhHHHHHHHHHHHHHHHHhh Q lcl|NC_021302. 240 RRHGIGVPYLKGNEADSEDDDRMDELLEIASNYSGGESAGLALTAGEEAGILSPN--GTPLDPRRAIEYHDHQMALVALA 317 (484) Q Consensus 240 Er~~~G~P~~~gk~~~~~~~~~~~~l~~~l~~~~~g~~a~~vip~~~~ie~~~~~--~~~~~~~~li~~~d~~Isk~ilG 317 (484) ++-.- .|+++.+. +..+ ..++..+...+++-....+|..+... +.-..-...|+.+...|.++++. T Consensus 293 ~~a~~-pp~lv~~~--g~~~---------~~~~~~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~ 360 (542) T protein:vir:78 293 AAAAK-VVFMVSPS--ATTK---------PQSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLI 360 (542) T ss_pred HHHhc-Cceeeccc--cccc---------hhhcccCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhcc Confidence 98543 35554322 1111 11222222233443445567666532 22223467899999999999865 Q ss_pred hhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC------CCccccc----eEEecC-CC---- Q lcl|NC_021302. 318 HFLNLDGKGGSYALASVQADTFVQSVQTVADEIRDVAQAHVVEDIVDVNW------GEDEPAP----LLVFDE-IG---- 382 (484) Q Consensus 318 qtlt~~~~gGs~A~~evh~~v~~~~~~aD~~~i~~~ln~qli~~l~~~Nf------~~~~~~P----~~~~~~-~~---- 382 (484) ... -++...-..||+....+. ... .-=+-+.|+..++-|++..-| +.-.+.| +++|.. .. T Consensus 361 ~~~---~d~~rvTAtEV~~r~~E~-~~~-LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r 435 (542) T protein:vir:78 361 LNV---RQSERTTATEVREVQMEL-DRQ-LSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGR 435 (542) T ss_pred ccc---CCcccccHHHHHHHHHHH-HHH-hhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHH Confidence 322 122222334554322211 111 112222333334444433222 1111122 233321 10 Q ss_pred -CcHHHHHHHHHHHHhc-Ccc-----cCCcccHHHHHHHhCCCCC---CCCcccccccCCCcCCCcc-----c-cCCCCc Q lcl|NC_021302. 383 -SRQDATAAALQMLVNA-GLL-----TPDPRLEAFLRDAAGLPGP---DPDADDDESTADTGQDEPE-----T-DEPALP 446 (484) Q Consensus 383 -~~~~~~ae~~~~L~~~-G~~-----~~~~~~~~~i~e~~glp~p---~~~e~~~~~~~~~~~~~~~-----~-~~~~~~ 446 (484) .+...+...++.+.++ |-. +......+++.+.+|+|.. ...+++....++..+...+ + ...+.. T Consensus 436 ~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~ 515 (542) T protein:vir:78 436 GEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKS 515 (542) T ss_pred HHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 0112223333443332 210 1111123567788899843 2222211110000000000 0 000000 Q ss_pred ---cccccccccccccccccccccchH Q lcl|NC_021302. 447 ---NTSGTTSTTNAPQARKRPRGRSPR 470 (484) Q Consensus 447 ---~~~~~~~~~~~~~~~~~~~~~~~~ 470 (484) +....+....+....+.|.+.... T Consensus 516 ~~~~~~~~~~~a~~~~~~~~~~~~~~~ 542 (542) T protein:vir:78 516 PIGEKMMQQINAPGQEAPAGPQTGEDL 542 (542) T ss_pred ccccchhhhcCCCCcCCCCCCcccccC Confidence 000000000011111111111111 No 244 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=25.83 E-value=2 Score=18.92 Aligned_cols=408 Identities=12% Similarity=0.111 Sum_probs=148.8 Q ss_pred CCCCCC------CccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhC---- Q lcl|NC_021302. 1 MAPKTV------APRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRR---- 70 (484) Q Consensus 1 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~---- 70 (484) .-|.-. .+.+..|. -|. ..+ -|++-+-++.+.-..+.+ T Consensus 37 ~lP~~~~~~~~~~~~~~~~~-----------------------~~~-------~~~--~dstg~~a~~~LAs~l~~~ltp 84 (549) T protein:vir:10 37 LMPRLDKFGQLPRPDSEKGR-----------------------ERS-------QKM--FDSTAPLALRNFVAAMDSMITP 84 (549) T ss_pred hccccccccccCCCCCCccc-----------------------ccc-------ccc--ccchHHHHHHHHHHHHHhhccC Confidence 223210 00000000 000 001 122222222222222221 Q ss_pred --CCc-EEecCCCCHHHHHHHHHHHHhhhccchhhhhHH-HhhcCCCHHHHHHHHH-HHHhhcceeeeEEEeecCCeeee Q lcl|NC_021302. 71 --TDW-RIRPNGARPEVVEHVAACLGLPVEGDESDKPTP-RTRGRFSWDQHLRLAL-KSLQFGHAVFEQTYFYEGGRFWL 145 (484) Q Consensus 71 --~~~-~v~p~~~~~e~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~l-~a~~~G~s~~Eivw~~~~g~~~~ 145 (484) .+| ++..++.+......+++.|... + +.... ......+|..-+.+++ +-+.+|-+++=+... .+.. T Consensus 85 p~~~wF~l~~~~~~~~e~~~v~~~l~~v----e-~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~--~~~~-- 155 (549) T protein:vir:10 85 ATQLWHRLKTGNDALNEIASVKAYLQGV----V-RTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHD--VGKG-- 155 (549) T ss_pred CCCccccccCCccchhhhhHHHHHHHHH----H-HHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeec--CCCe-- Confidence 122 2333322211111222222110 0 00011 1223456777666554 556778776543322 1111 Q ss_pred eeeeeeCccceeeeeecCCCceeeeec------------cccccc----------c-cccc------------------- Q lcl|NC_021302. 146 KRLAPRPQSSIAYWNVDRDGGLISIQQ------------WPAGTF----------G-GPGM------------------- 183 (484) Q Consensus 146 ~~l~~r~~~~~~~~~~~~dg~l~~~~q------------~~~~~~----------~-~~~~------------------- 183 (484) ..+..+|-..+ .+.-|..|++..+-. ++.... + .... T Consensus 156 ~~f~~~pl~~~-~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~ 234 (549) T protein:vir:10 156 IVYRNVPMQRL-WFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLDG 234 (549) T ss_pred eEEEEEEcCeE-EEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCcccccc Confidence 01222222222 123333343322110 000000 0 0000 Q ss_pred ------eeccCCCCcccc-----cccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEec Q lcl|NC_021302. 184 ------VVMAPNSMGPAI-----PVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGN 252 (484) Q Consensus 184 ------~~~~~~~~~~~l-----p~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk 252 (484) ..+...+....+ ...-|++.|+....|+.||.|....+..-..--+...+.-+..+++-.. .|+.+-. T Consensus 235 ~~~pf~sv~~e~~~~~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~-p~~~v~~ 313 (549) T protein:vir:10 235 RNMQFASYWLDEGRDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVD-PPLLANE 313 (549) T ss_pred ccCceEEEEEEecCCEeeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhc-Cceeecc Confidence 000111111111 2356899999999999999999999999998888888999999998644 2555422 Q ss_pred CCCCCCHHHHHHHHHHHHHHhcCCceEEEc-cCC-ceEEEecccCCchhHHHHHHHHHHHHHHHHhhhhhcccccccchh Q lcl|NC_021302. 253 EADSEDDDRMDELLEIASNYSGGESAGLAL-TAG-EEAGILSPNGTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYA 330 (484) Q Consensus 253 ~~~~~~~~~~~~l~~~l~~~~~g~~a~~vi-p~~-~~ie~~~~~~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A 330 (484) . +.... .++..|...+++. +.+ ..+.-+..++.-..-...|+.+...|..+++...+....++...- T Consensus 314 ~--g~~~~---------~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~~T 382 (549) T protein:vir:10 314 D--GVLDG---------FDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDSGDMT 382 (549) T ss_pred c--ccccc---------ceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCcc Confidence 2 21111 1111222222222 222 223333333333345677999999999999876543322333344 Q ss_pred hHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCC------CCccccc----------eEEecC-CC-----CcHH Q lcl|NC_021302. 331 LASVQADTFV--QSVQTVADEIRDVAQAHVVEDIVDVNW------GEDEPAP----------LLVFDE-IG-----SRQD 386 (484) Q Consensus 331 ~~evh~~v~~--~~~~aD~~~i~~~ln~qli~~l~~~Nf------~~~~~~P----------~~~~~~-~~-----~~~~ 386 (484) ..||+....+ ..+-.. -+.|...++-||+..-| +.-.++| ++++.. .. .+.. T Consensus 383 AtEV~~r~~E~~~~LGpv----~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~ 458 (549) T protein:vir:10 383 ATEVLQRAQEKGVLLAPT----LGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGA 458 (549) T ss_pred HHHHHHHHHHHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHH Confidence 4555543222 222222 22222334444433211 1101111 122211 00 0111 Q ss_pred H---HHHHHHHHHhcCc----ccCCcccHHHHHHHhCCCCC--CCCcccccccC--CCcCCCccccCCCCcccccccccc Q lcl|NC_021302. 387 A---TAAALQMLVNAGL----LTPDPRLEAFLRDAAGLPGP--DPDADDDESTA--DTGQDEPETDEPALPNTSGTTSTT 455 (484) Q Consensus 387 ~---~ae~~~~L~~~G~----~~~~~~~~~~i~e~~glp~p--~~~e~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 455 (484) . +.+.+..|.++|= .+......+++.+.+|+|.. ..++++..--. ...+..++. .. T Consensus 459 ~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~-------------~~ 525 (549) T protein:vir:10 459 AILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQM-------------LA 525 (549) T ss_pred HHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHH-------------HH Confidence 1 1122233333331 11112234667888999853 12222111000 000000000 00 Q ss_pred ccccccccccccchHHHhcCcccCccc Q lcl|NC_021302. 456 NAPQARKRPRGRSPRDRRKTPDGAMPL 482 (484) Q Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (484) +++. ....+.+.+.++-+ .....+ T Consensus 526 ~a~~--a~~~a~~~~~~~ta-~~~~~~ 549 (549) T protein:vir:10 526 AAPV--AAGAIKDLSDAQTA-AQTARV 549 (549) T ss_pred HHHH--HHHHHHhhhhhcCC-CcccCC Confidence 0000 00011111111111 111111 No 245 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=21.86 E-value=2.5 Score=18.37 Aligned_cols=439 Identities=10% Similarity=0.040 Sum_probs=146.7 Q ss_pred CCCCC--CCccceeeeecccccchhhhhhhcccccccccccccchHHHHHHHHhcchHHHHHHHHHHHHhhC------CC Q lcl|NC_021302. 1 MAPKT--VAPRTERGYVNPLAGFGTFLAQGLDQFEQVDELRWPNSVYTYTRMCREEARIASVLRAIGLPIRR------TD 72 (484) Q Consensus 1 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~y~~m~~~D~~v~s~l~~r~~~v~~------~~ 72 (484) |==++ ..-++.+...-.. ....+-.............+. +-.-+..+ -|++-+-++.+....+.+ .+ T Consensus 1 m~~~~r~~~L~~~R~~~e~~-w~e~~~~tlP~~~~~~~~~~~--~~~~~~~~--~dstg~~a~~~LAa~l~~~ltpp~~~ 75 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDK-AVECSELTLPYLIDDDISSRP--NHKSLTVP--WQSVGAKCCVTLAAKLMLAVLPPQTS 75 (522) T ss_pred CchHHHHHHHHHHhhHHHHH-HHHHHHHhhhcccCCCCCCCc--cccccccc--ccchHHHHHHHHHHHHHHhhcCCCCc Confidence 00000 0000000000000 000000000000000000000 00000011 123333333333332221 12 Q ss_pred c-EEecCCC------CHHHHHHHHHHHHhhhccchhhhhHHHhhcCCCHHHHHHHHH-HHHhhcceee------------ Q lcl|NC_021302. 73 W-RIRPNGA------RPEVVEHVAACLGLPVEGDESDKPTPRTRGRFSWDQHLRLAL-KSLQFGHAVF------------ 132 (484) Q Consensus 73 ~-~v~p~~~------~~e~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l-~a~~~G~s~~------------ 132 (484) | ++.+.+. ++++...+.+.|. ..+......+...+|..-+.+++ +-+.+|.+++ T Consensus 76 WF~l~~~d~~l~~~~~~~~~~~v~~~l~------~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~~pl 149 (522) T protein:vir:10 76 FFKLQVRDDKLGEELDPQIRSELDLSFS------KMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGKDGLKTFPL 149 (522) T ss_pred cccccCChHHHhhhcChhhHHHHHHHHH------HHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcCCCceEEEc Confidence 3 2222211 1111111222221 11122223334456766555554 5566776553 Q ss_pred -eEEEeec-CCeee--eeeeeee----Cc--------ccee-eeeecCCCceeeeecccccccccccceeccCCCCcc-- Q lcl|NC_021302. 133 -EQTYFYE-GGRFW--LKRLAPR----PQ--------SSIA-YWNVDRDGGLISIQQWPAGTFGGPGMVVMAPNSMGP-- 193 (484) Q Consensus 133 -Eivw~~~-~g~~~--~~~l~~r----~~--------~~~~-~~~~~~dg~l~~~~q~~~~~~~~~~~~~~~~~~~~~-- 193 (484) +.++..+ .|++. ..++..- +. .... ..+.+++-.++.... +....+ +.. +.....+. T Consensus 150 ~~y~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~-p~~~~~--~~~-~~~~~~~~~~ 225 (522) T protein:vir:10 150 TRYVINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVK-LDKSSG--RWV-WHQEAFDKII 225 (522) T ss_pred ceEEEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEE-eeccCC--ceE-EEEccCCccc Confidence 3333322 12111 0000000 00 0000 000000001111100 000000 011 11111111 Q ss_pred -------cccccceEEEeecCccCccccchhHHHHHHHHHHHHHHHHHHHHHHHHhcCCcceEEecCCCCCCHHHHHHHH Q lcl|NC_021302. 194 -------AIPVEQLVVYTHDMDPGVWTGNSLLRPAYKNWKLKDELIRIEAAAIRRHGIGVPYLKGNEADSEDDDRMDELL 266 (484) Q Consensus 194 -------~lp~~k~l~~~~~~~~~~p~G~gll~~~~~~~~~K~~~~~~w~~f~Er~~~G~P~~~gk~~~~~~~~~~~~l~ 266 (484) +....-|++.|+...+|+.||.|....+..-..--+...+.-+..+++-. -.|+++.+.+.... T Consensus 226 ~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~-~p~~lv~~~~~~~~-------- 296 (522) T protein:vir:10 226 PDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAAS-KVVFLVSPSSTTKP-------- 296 (522) T ss_pred cccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhc-CCceeecccccccc-------- Confidence 22233689999999999999999999999999888888899999999853 44666533221111 Q ss_pred HHHHHHhcCCceEEEccCCceEEEeccc--CCchhHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHHHHH--HHH Q lcl|NC_021302. 267 EIASNYSGGESAGLALTAGEEAGILSPN--GTPLDPRRAIEYHDHQMALVALAHFLNLDGKGGSYALASVQADTF--VQS 342 (484) Q Consensus 267 ~~l~~~~~g~~a~~vip~~~~ie~~~~~--~~~~~~~~li~~~d~~Isk~ilGqtlt~~~~gGs~A~~evh~~v~--~~~ 342 (484) .++..+...+++-....++..++.. +.-......|+.+.+.|..+++ +++.-+++.--..||+.... ... T Consensus 297 ---~~l~~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl---~~~~~d~~rvTAtEV~~r~~E~~~~ 370 (522) T protein:vir:10 297 ---ATIAKAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFL---VMNVRNAERVTAEEVRLTQLELEQQ 370 (522) T ss_pred ---ccccCCCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHh---hccCCCCCCCCHHHHHHHHHHHHHH Confidence 1122222334444444556666543 2223357789999999999875 22222334334455553322 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCC------Cccccce-E------EecC-C--CCcHHHHHHHHHHHHh-cCcc---- Q lcl|NC_021302. 343 VQTVADEIRDVAQAHVVEDIVDVNWG------EDEPAPL-L------VFDE-I--GSRQDATAAALQMLVN-AGLL---- 401 (484) Q Consensus 343 ~~aD~~~i~~~ln~qli~~l~~~Nf~------~~~~~P~-~------~~~~-~--~~~~~~~ae~~~~L~~-~G~~---- 401 (484) +-.-... |+..++-|++..-|. --.+.|. + ++-. . ..+.+.+..+++.|.. +|-. T Consensus 371 LGpv~~r----l~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~~ 446 (522) T protein:vir:10 371 LGGIFSL----LVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGRGQDRESLTAFVGTIAQTLGPEALMQ 446 (522) T ss_pred hhHHHHH----HHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHHHHHHHHHHHHHHHHHHhhCchhhhh Confidence 2222222 222333333322211 0011121 1 1100 0 0122233344444432 2211 Q ss_pred -cCCcccHHHHHHHhCCCCCC---CCcccccccCCCcCCCccccCCCCccccccccccccccccccccccchHHHhcCcc Q lcl|NC_021302. 402 -TPDPRLEAFLRDAAGLPGPD---PDADDDESTADTGQDEPETDEPALPNTSGTTSTTNAPQARKRPRGRSPRDRRKTPD 477 (484) Q Consensus 402 -~~~~~~~~~i~e~~glp~p~---~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (484) +......+++.+.+|+|.+. ..+++........+..... ......+.-..+......+++ +..++-. +. T Consensus 447 ~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~---~~~~~a~~~~~~~~~~~~~~~---~~~~~~~-~~ 519 (522) T protein:vir:10 447 YLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQ---SLVDQAGQMTGSPLMDPTKNP---QLMDEEQ-PP 519 (522) T ss_pred cCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHH---HHHHHHHHHhcccccCccccH---HHHHHhC-CC Confidence 11111245678888998431 2211111000000000000 000000000000000000111 1111112 22 Q ss_pred cCc Q lcl|NC_021302. 478 GAM 480 (484) Q Consensus 478 ~~~ 480 (484) ++- T Consensus 520 ~~~ 522 (522) T protein:vir:10 520 MEE 522 (522) T ss_pred CCC Confidence 222 Done!