Query lcl|NC_021305.1_cdsid_YP_008051489.1 [gene=10] [protein=portal protein] [protein_id=YP_008051489.1] [location=4072..5628] Match_columns 518 No_of_seqs 192 out of 1115 Neff 9.3 Searched_HMMs 1612 Date Thu Nov 7 17:40:56 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_10 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_10_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:7853 Length: 518 # 100.0 2E-141 1E-144 791.6 50.2 518 1-518 1-518 (518) 2 protein:vir:101648 Length: 518 100.0 8E-141 5E-144 788.6 50.1 518 1-518 1-518 (518) 3 protein:vir:93610 Length: 454 100.0 5E-100 3E-103 565.1 45.4 448 2-462 1-454 (454) 4 protein:vir:102118 Length: 409 100.0 4.7E-97 3E-100 548.7 43.4 402 1-417 1-409 (409) 5 protein:vir:6240 Length: 457 # 100.0 1.3E-96 8E-100 546.3 44.9 443 1-468 6-457 (457) 6 protein:vir:1266 Length: 416 # 100.0 7E-96 4.3E-99 542.3 44.5 410 1-436 1-416 (416) 7 protein:vir:105064 Length: 421 100.0 2.2E-95 1.4E-98 539.5 43.9 414 1-444 1-421 (421) 8 protein:vir:1326 Length: 457 # 100.0 3E-95 1.9E-98 538.8 44.3 442 1-448 6-457 (457) 9 protein:vir:4337 Length: 434 # 100.0 1.3E-95 7.8E-99 540.8 41.8 415 1-429 4-434 (434) 10 protein:vir:1431 Length: 419 # 100.0 2.6E-95 1.6E-98 539.1 43.4 412 1-434 1-419 (419) 11 protein:vir:105002 Length: 432 100.0 4.7E-95 2.9E-98 537.7 44.7 418 1-430 9-432 (432) 12 protein:vir:102855 Length: 432 100.0 4.7E-95 2.9E-98 537.7 44.7 418 1-430 9-432 (432) 13 protein:vir:107605 Length: 432 100.0 4.7E-95 2.9E-98 537.7 44.7 418 1-430 9-432 (432) 14 protein:vir:102080 Length: 429 100.0 5.5E-95 3.4E-98 537.4 44.8 418 1-430 6-429 (429) 15 protein:vir:483 Length: 413 # 100.0 4.9E-95 3E-98 537.6 44.5 411 1-423 1-413 (413) 16 protein:vir:100249 Length: 431 100.0 2.1E-95 1.3E-98 539.6 42.4 406 1-424 2-431 (431) 17 protein:vir:4509 Length: 424 # 100.0 8.5E-95 5.3E-98 536.3 43.1 401 1-431 17-424 (424) 18 protein:vir:4454 Length: 414 # 100.0 2.5E-94 1.6E-97 533.7 44.3 409 1-434 2-414 (414) 19 protein:vir:1884 Length: 424 # 100.0 3.8E-94 2.3E-97 532.8 43.4 395 1-416 22-424 (424) 20 protein:vir:1380 Length: 422 # 100.0 3.2E-94 2E-97 533.2 41.5 409 1-420 6-422 (422) 21 protein:vir:5737 Length: 419 # 100.0 6.3E-94 3.9E-97 531.5 42.8 411 1-448 1-419 (419) 22 protein:vir:81152 Length: 411 100.0 7E-94 4.3E-97 531.3 41.8 400 1-415 6-411 (411) 23 protein:vir:189 Length: 424 # 100.0 1.7E-93 1.1E-96 529.2 42.8 396 1-423 22-424 (424) 24 protein:vir:80333 Length: 419 100.0 1.1E-93 6.7E-97 530.3 41.5 412 1-461 1-419 (419) 25 protein:vir:100150 Length: 437 100.0 2.5E-93 1.6E-96 528.2 42.5 422 1-442 1-437 (437) 26 protein:vir:10362 Length: 432 100.0 3.4E-93 2.1E-96 527.5 42.7 416 1-435 8-432 (432) 27 protein:vir:97060 Length: 432 100.0 5E-93 3.1E-96 526.6 42.8 416 1-435 8-432 (432) 28 protein:vir:81072 Length: 432 100.0 1.1E-92 6.6E-96 524.8 42.9 416 1-437 8-432 (432) 29 protein:vir:98396 Length: 441 100.0 1.3E-91 7.9E-95 518.9 44.1 412 1-431 23-441 (441) 30 protein:vir:9408 Length: 441 # 100.0 1.6E-91 9.8E-95 518.4 43.1 412 1-431 23-441 (441) 31 protein:vir:79984 Length: 441 100.0 1.6E-91 9.8E-95 518.4 43.1 412 1-431 23-441 (441) 32 protein:vir:81095 Length: 416 100.0 1.3E-90 8.1E-94 513.4 44.1 413 1-431 2-416 (416) 33 protein:vir:4598 Length: 416 # 100.0 1.3E-90 8.1E-94 513.4 44.1 413 1-431 2-416 (416) 34 protein:vir:2683 Length: 412 # 100.0 9E-91 5.6E-94 514.2 42.6 408 1-430 1-412 (412) 35 protein:vir:81218 Length: 423 100.0 1E-90 6.4E-94 513.9 42.5 408 1-424 1-423 (423) 36 protein:vir:8418 Length: 409 # 100.0 2.1E-90 1.3E-93 512.2 43.3 403 1-428 2-409 (409) 37 protein:vir:93943 Length: 409 100.0 1.8E-90 1.1E-93 512.6 42.9 402 1-422 5-409 (409) 38 protein:vir:94426 Length: 409 100.0 2.3E-90 1.5E-93 512.0 42.8 402 1-422 5-409 (409) 39 protein:vir:96980 Length: 409 100.0 2.7E-90 1.7E-93 511.6 42.4 402 1-422 5-409 (409) 40 protein:vir:3868 Length: 417 # 100.0 1.1E-89 6.9E-93 508.3 42.7 415 1-445 1-417 (417) 41 protein:vir:102727 Length: 945 100.0 2.9E-88 1.8E-91 500.5 48.9 505 1-518 63-630 (945) 42 protein:vir:94666 Length: 723 100.0 2.5E-88 1.5E-91 500.9 45.2 486 17-518 1-543 (723) 43 protein:vir:101647 Length: 460 100.0 8E-89 5E-92 503.5 41.7 410 2-435 1-460 (460) 44 protein:vir:9702 Length: 406 # 100.0 2.5E-88 1.5E-91 500.9 43.0 404 1-432 1-406 (406) 45 protein:vir:8100 Length: 466 # 100.0 5.7E-86 3.5E-89 487.9 42.2 419 1-432 6-466 (466) 46 protein:vir:960 Length: 413 # 100.0 1.4E-85 8.4E-89 485.8 41.0 395 1-424 14-413 (413) 47 protein:vir:8317 Length: 409 # 100.0 8E-86 5E-89 487.1 38.6 374 1-400 6-409 (409) 48 protein:vir:80796 Length: 574 100.0 2.3E-84 1.4E-87 479.1 42.2 484 1-489 40-574 (574) 49 protein:vir:3843 Length: 397 # 100.0 1.3E-83 8E-87 475.0 41.5 396 1-439 1-397 (397) 50 protein:vir:95378 Length: 406 100.0 2.7E-83 1.6E-86 473.3 42.4 399 1-437 2-406 (406) 51 protein:vir:104259 Length: 403 100.0 3.6E-83 2.2E-86 472.6 40.6 385 1-431 1-403 (403) 52 protein:vir:80134 Length: 403 100.0 2E-82 1.3E-85 468.4 40.6 394 1-433 3-403 (403) 53 protein:vir:80644 Length: 551 100.0 1.9E-82 1.2E-85 468.6 38.5 466 1-493 39-551 (551) 54 protein:vir:96579 Length: 576 100.0 7.2E-82 4.5E-85 465.4 39.4 484 1-494 33-576 (576) 55 protein:vir:9359 Length: 348 # 100.0 6.8E-82 4.2E-85 465.6 39.2 346 63-422 1-348 (348) 56 protein:vir:63755 Length: 547 100.0 1.8E-80 1.1E-83 457.8 39.2 471 1-493 30-547 (547) 57 protein:vir:6210 Length: 394 # 100.0 2.4E-80 1.5E-83 457.0 38.6 386 1-432 2-394 (394) 58 protein:vir:7407 Length: 392 # 100.0 3.2E-80 2E-83 456.4 38.2 388 1-416 1-392 (392) 59 protein:vir:1023 Length: 392 # 100.0 4.4E-79 2.7E-82 450.2 38.3 385 1-434 1-392 (392) 60 protein:vir:3989 Length: 392 # 100.0 4.4E-79 2.7E-82 450.2 38.3 385 1-434 1-392 (392) 61 protein:vir:4854 Length: 386 # 100.0 1.2E-78 7.7E-82 447.7 40.2 383 1-428 2-386 (386) 62 protein:vir:100187 Length: 385 100.0 1.4E-78 8.8E-82 447.4 40.1 375 1-408 3-385 (385) 63 protein:vir:95599 Length: 563 100.0 1.4E-78 8.7E-82 447.4 39.5 467 1-493 41-563 (563) 64 protein:vir:99312 Length: 563 100.0 1.4E-78 8.7E-82 447.4 39.5 467 1-493 41-563 (563) 65 protein:vir:100691 Length: 535 100.0 1.1E-77 6.5E-81 442.6 39.8 444 1-477 53-535 (535) 66 protein:vir:4194 Length: 540 # 100.0 9.2E-77 5.7E-80 437.4 43.2 481 2-518 1-529 (540) 67 protein:vir:3153 Length: 467 # 100.0 5.7E-77 3.5E-80 438.6 39.0 399 42-444 1-467 (467) 68 protein:vir:9507 Length: 395 # 100.0 1.2E-76 7.2E-80 436.9 39.5 387 1-434 2-395 (395) 69 protein:vir:101289 Length: 395 100.0 1.2E-76 7.2E-80 436.9 39.5 387 1-434 2-395 (395) 70 protein:vir:100650 Length: 395 100.0 1.2E-76 7.2E-80 436.9 39.5 387 1-434 2-395 (395) 71 protein:vir:95965 Length: 385 100.0 5.1E-77 3.2E-80 438.9 36.3 370 1-413 6-385 (385) 72 protein:vir:100882 Length: 383 100.0 6.6E-76 4.1E-79 432.7 39.7 372 1-431 2-383 (383) 73 protein:vir:4995 Length: 384 # 100.0 1.5E-76 9.4E-80 436.3 35.4 375 1-394 2-384 (384) 74 protein:vir:4952 Length: 386 # 100.0 4.3E-75 2.6E-78 428.3 39.7 385 1-430 2-386 (386) 75 protein:vir:4828 Length: 382 # 100.0 7.1E-75 4.4E-78 427.1 39.2 378 1-433 2-382 (382) 76 protein:vir:4089 Length: 395 # 100.0 4.2E-75 2.6E-78 428.3 37.2 386 1-431 1-395 (395) 77 protein:vir:9641 Length: 395 # 100.0 5.9E-75 3.6E-78 427.5 35.4 379 1-415 5-395 (395) 78 protein:vir:78310 Length: 376 100.0 7E-75 4.3E-78 427.1 35.2 369 1-409 2-376 (376) 79 protein:vir:1082 Length: 359 # 100.0 4.2E-74 2.6E-77 422.9 37.2 350 1-386 2-359 (359) 80 protein:vir:4156 Length: 542 # 100.0 3E-73 1.8E-76 418.2 41.9 486 2-517 1-542 (542) 81 protein:vir:98643 Length: 395 100.0 3.8E-73 2.4E-76 417.6 38.2 384 1-415 2-395 (395) 82 protein:vir:94002 Length: 378 100.0 3E-73 1.8E-76 418.2 35.9 359 1-428 2-378 (378) 83 protein:vir:93867 Length: 378 100.0 5.8E-73 3.6E-76 416.6 36.2 359 1-428 1-378 (378) 84 protein:vir:1661 Length: 378 # 100.0 9.2E-73 5.7E-76 415.5 36.5 359 1-428 1-378 (378) 85 protein:vir:79772 Length: 648 100.0 1.8E-70 1.1E-73 403.0 45.5 504 1-518 42-615 (648) 86 protein:vir:858 Length: 378 # 100.0 4.7E-71 2.9E-74 406.2 35.7 359 1-428 2-378 (378) 87 protein:vir:94869 Length: 378 100.0 1.3E-70 7.8E-74 403.8 35.2 359 1-428 2-378 (378) 88 protein:vir:99452 Length: 651 100.0 1.2E-70 7.2E-74 404.0 33.6 482 1-518 12-619 (651) 89 protein:vir:78641 Length: 278 100.0 7.7E-63 4.8E-66 361.1 31.9 276 63-350 1-278 (278) 90 protein:vir:103971 Length: 376 100.0 3.7E-58 2.3E-61 335.5 31.2 315 1-357 54-376 (376) 91 protein:vir:79150 Length: 368 100.0 1.2E-58 7.6E-62 338.1 26.6 339 1-368 1-368 (368) 92 protein:vir:100328 Length: 346 100.0 5.6E-57 3.5E-60 329.0 31.9 317 1-355 22-346 (346) 93 protein:vir:79207 Length: 351 100.0 3.5E-57 2.2E-60 330.1 30.5 315 1-357 29-351 (351) 94 protein:vir:78191 Length: 351 100.0 4.9E-57 3.1E-60 329.3 30.7 315 1-357 29-351 (351) 95 protein:vir:267 Length: 348 # 100.0 7E-57 4.4E-60 328.4 31.1 320 1-361 18-348 (348) 96 protein:vir:98567 Length: 340 100.0 8.2E-56 5.1E-59 322.6 30.2 311 2-354 1-340 (340) 97 protein:vir:1150 Length: 350 # 100.0 9.8E-56 6.1E-59 322.2 30.0 311 1-350 32-350 (350) 98 protein:vir:3780 Length: 345 # 100.0 2.2E-55 1.3E-58 320.3 29.9 316 1-352 21-345 (345) 99 protein:vir:5691 Length: 344 # 100.0 3E-55 1.8E-58 319.5 29.3 312 1-355 24-344 (344) 100 protein:vir:6058 Length: 344 # 100.0 5.1E-55 3.2E-58 318.2 30.2 312 2-355 1-344 (344) 101 protein:vir:3743 Length: 345 # 100.0 8.4E-55 5.2E-58 317.1 31.3 316 1-352 21-345 (345) 102 protein:vir:2013 Length: 344 # 100.0 1.1E-54 6.8E-58 316.4 28.6 312 1-355 24-344 (344) 103 protein:vir:78749 Length: 337 100.0 3.6E-53 2.3E-56 308.1 30.4 321 2-351 1-337 (337) 104 protein:vir:4698 Length: 251 # 100.0 5.8E-50 3.6E-53 290.5 26.8 248 1-261 1-251 (251) 105 protein:vir:98853 Length: 219 100.0 8.5E-44 5.3E-47 256.7 21.7 210 139-354 1-219 (219) 106 protein:vir:5249 Length: 437 # 100.0 9.2E-28 5.7E-31 168.8 33.6 396 1-441 1-437 (437) 107 protein:vir:107742 Length: 537 99.9 9.4E-27 5.9E-30 163.3 31.4 419 1-446 35-537 (537) 108 protein:vir:94049 Length: 532 99.9 3.1E-26 1.9E-29 160.4 31.0 439 1-467 33-532 (532) 109 protein:vir:99563 Length: 862 99.9 3.4E-24 2.1E-27 149.3 29.7 492 1-518 76-669 (862) 110 protein:vir:108215 Length: 469 99.9 3.2E-22 2E-25 138.4 38.4 422 3-456 1-469 (469) 111 protein:vir:80040 Length: 461 99.9 2.2E-23 1.4E-26 144.8 27.8 397 2-436 1-461 (461) 112 protein:vir:389 Length: 530 # 99.9 9.6E-24 6E-27 146.8 25.5 430 1-436 5-530 (530) 113 protein:vir:99232 Length: 526 99.9 4.8E-21 3E-24 132.0 39.8 472 1-518 3-526 (526) 114 protein:vir:3420 Length: 533 # 99.9 7E-24 4.4E-27 147.5 23.7 424 6-436 1-533 (533) 115 protein:vir:103860 Length: 528 99.9 5.2E-21 3.2E-24 131.8 39.3 475 1-518 3-528 (528) 116 protein:vir:79233 Length: 526 99.9 9E-21 5.6E-24 130.5 39.8 472 1-518 3-526 (526) 117 protein:vir:79538 Length: 502 99.9 9.2E-24 5.7E-27 146.9 23.4 420 1-442 24-502 (502) 118 protein:vir:104338 Length: 422 99.9 2E-22 1.2E-25 139.6 29.8 376 14-428 1-422 (422) 119 protein:vir:6382 Length: 553 # 99.9 2E-23 1.3E-26 145.0 24.2 430 1-441 1-553 (553) 120 protein:vir:79647 Length: 435 99.9 3.2E-22 2E-25 138.4 30.2 385 1-431 2-435 (435) 121 protein:vir:96738 Length: 505 99.9 1.7E-23 1.1E-26 145.4 22.6 419 3-436 1-505 (505) 122 protein:vir:96068 Length: 765 99.9 8.9E-23 5.5E-26 141.5 26.0 495 1-518 55-631 (765) 123 protein:vir:99853 Length: 488 99.9 1.3E-19 8.2E-23 124.1 39.6 447 7-518 1-484 (488) 124 protein:vir:107662 Length: 427 99.8 8.9E-21 5.5E-24 130.5 29.1 377 11-434 1-427 (427) 125 protein:vir:10321 Length: 495 99.8 3.5E-21 2.2E-24 132.8 24.6 418 1-431 1-495 (495) 126 protein:vir:79063 Length: 491 99.8 1E-18 6.4E-22 119.2 37.4 456 1-518 5-491 (491) 127 protein:vir:107880 Length: 491 99.8 1.6E-18 1E-21 118.1 37.4 456 1-518 5-491 (491) 128 protein:vir:95542 Length: 548 99.8 2.1E-20 1.3E-23 128.4 23.8 458 1-468 24-548 (548) 129 protein:vir:77981 Length: 448 99.8 1.8E-18 1.1E-21 117.9 33.0 409 1-460 1-448 (448) 130 protein:vir:1986 Length: 512 # 99.8 4.8E-17 3E-20 110.0 40.1 461 1-516 3-512 (512) 131 protein:vir:79511 Length: 448 99.8 2.4E-18 1.5E-21 117.1 32.4 410 1-447 1-448 (448) 132 protein:vir:98816 Length: 446 99.8 1.5E-18 9.5E-22 118.2 28.4 376 1-389 3-446 (446) 133 protein:vir:95254 Length: 488 99.7 2.7E-16 1.7E-19 105.9 34.3 419 1-462 1-488 (488) 134 protein:vir:105782 Length: 449 99.6 1.4E-15 8.6E-19 102.0 26.6 375 1-436 28-449 (449) 135 protein:vir:106716 Length: 698 99.6 5.1E-16 3.2E-19 104.4 22.3 493 1-518 71-642 (698) 136 protein:vir:78589 Length: 695 99.6 2.5E-15 1.6E-18 100.6 22.6 485 1-518 71-634 (695) 137 protein:vir:78161 Length: 355 99.6 1.5E-14 9.3E-18 96.4 26.3 325 117-468 1-355 (355) 138 protein:vir:101541 Length: 694 99.6 4.6E-15 2.9E-18 99.2 23.0 456 1-518 70-595 (694) 139 protein:vir:3648 Length: 695 # 99.5 7.3E-15 4.5E-18 98.1 22.8 455 1-518 71-596 (695) 140 protein:vir:106491 Length: 646 99.3 1.4E-11 8.7E-15 80.1 28.3 500 1-518 2-601 (646) 141 protein:vir:102426 Length: 631 99.3 8.4E-12 5.2E-15 81.3 24.0 489 1-518 1-607 (631) 142 protein:vir:99916 Length: 504 99.2 4.5E-10 2.8E-13 71.8 28.7 425 1-442 9-504 (504) 143 protein:vir:5839 Length: 533 # 99.1 5.5E-11 3.4E-14 76.8 21.7 433 1-468 22-533 (533) 144 protein:vir:5961 Length: 503 # 99.1 6.7E-10 4.1E-13 70.9 27.1 414 1-449 14-503 (503) 145 protein:vir:7768 Length: 484 # 99.1 1.6E-09 1E-12 68.8 28.7 403 7-442 1-484 (484) 146 protein:vir:8654 Length: 629 # 99.1 2E-10 1.3E-13 73.7 23.3 491 1-518 1-591 (629) 147 protein:vir:106027 Length: 629 99.1 4.8E-10 3E-13 71.7 24.7 491 1-518 1-601 (629) 148 protein:vir:2427 Length: 485 # 99.1 1.4E-09 8.4E-13 69.2 26.9 404 1-446 20-485 (485) 149 protein:vir:98444 Length: 434 99.1 1.2E-09 7.4E-13 69.5 25.5 371 16-436 1-434 (434) 150 protein:vir:99088 Length: 629 99.0 1.3E-10 8E-14 74.8 19.6 492 1-518 1-605 (629) 151 protein:vir:104082 Length: 485 99.0 1.6E-09 9.9E-13 68.8 24.0 409 2-449 1-485 (485) 152 protein:vir:4223 Length: 486 # 98.9 3.3E-09 2.1E-12 67.1 23.9 411 1-440 6-486 (486) 153 protein:vir:107517 Length: 639 98.9 3.1E-09 2E-12 67.2 23.4 491 1-518 1-596 (639) 154 protein:vir:97900 Length: 639 98.9 3.1E-09 2E-12 67.2 23.4 491 1-518 1-596 (639) 155 protein:vir:2341 Length: 488 # 98.9 3.9E-09 2.4E-12 66.7 23.8 406 3-436 1-488 (488) 156 protein:vir:38 Length: 496 # N 98.9 9.7E-09 6E-12 64.5 24.8 392 1-430 15-496 (496) 157 protein:vir:4898 Length: 502 # 98.9 1.9E-08 1.2E-11 62.9 28.7 416 1-450 44-502 (502) 158 protein:vir:99072 Length: 479 98.9 1.4E-08 8.9E-12 63.6 25.3 408 1-446 21-479 (479) 159 protein:vir:94742 Length: 409 98.8 2.9E-08 1.8E-11 61.9 25.6 347 1-386 4-409 (409) 160 protein:vir:7987 Length: 456 # 98.8 1.8E-08 1.1E-11 63.1 24.0 387 1-433 12-456 (456) 161 protein:vir:96494 Length: 501 98.8 6.2E-08 3.9E-11 60.1 29.3 415 1-450 37-501 (501) 162 protein:vir:1634 Length: 409 # 98.7 5.2E-08 3.2E-11 60.5 24.2 350 1-386 4-409 (409) 163 protein:vir:8184 Length: 474 # 98.7 7.3E-08 4.5E-11 59.7 25.9 402 1-419 1-474 (474) 164 protein:vir:105819 Length: 456 98.7 7.8E-08 4.8E-11 59.6 27.8 387 1-431 12-456 (456) 165 protein:vir:102602 Length: 456 98.7 7.8E-08 4.8E-11 59.6 27.8 387 1-431 12-456 (456) 166 protein:vir:80959 Length: 499 98.7 8.1E-08 5E-11 59.5 26.2 393 1-433 15-499 (499) 167 protein:vir:103219 Length: 201 98.7 1.1E-09 6.8E-13 69.7 14.2 181 220-428 1-201 (201) 168 protein:vir:105889 Length: 474 98.6 1.5E-07 9E-11 58.1 29.7 391 1-439 40-474 (474) 169 protein:vir:94101 Length: 474 98.6 1.5E-07 9E-11 58.1 29.7 391 1-439 40-474 (474) 170 protein:vir:2500 Length: 501 # 98.6 1.6E-07 9.8E-11 57.9 25.5 397 1-440 34-501 (501) 171 protein:vir:9306 Length: 511 # 98.6 1.6E-07 1E-10 57.8 28.6 410 1-453 48-511 (511) 172 protein:vir:2732 Length: 501 # 98.6 1.8E-07 1.1E-10 57.5 28.9 412 1-441 43-501 (501) 173 protein:vir:9751 Length: 422 # 98.6 2.1E-07 1.3E-10 57.1 23.9 362 1-412 4-422 (422) 174 protein:vir:1236 Length: 483 # 98.6 2.5E-07 1.5E-10 56.8 29.8 381 1-439 58-483 (483) 175 protein:vir:95806 Length: 440 98.6 2.6E-07 1.6E-10 56.7 23.4 393 1-432 1-440 (440) 176 protein:vir:93747 Length: 472 98.6 3E-07 1.8E-10 56.4 30.2 389 1-439 23-472 (472) 177 protein:vir:97171 Length: 512 98.5 3.7E-07 2.3E-10 55.9 28.5 413 1-453 48-512 (512) 178 protein:vir:103951 Length: 511 98.5 3.9E-07 2.4E-10 55.7 28.9 413 1-453 48-511 (511) 179 protein:vir:96240 Length: 511 98.5 4.2E-07 2.6E-10 55.6 29.5 409 1-453 48-511 (511) 180 protein:vir:3964 Length: 453 # 98.5 4.7E-07 2.9E-10 55.3 28.0 401 1-439 9-453 (453) 181 protein:vir:97336 Length: 492 98.5 5.1E-07 3.2E-10 55.1 29.8 381 1-439 67-492 (492) 182 protein:vir:3609 Length: 452 # 98.4 6.5E-07 4E-10 54.5 28.3 392 1-439 9-452 (452) 183 protein:vir:79043 Length: 479 98.4 8.4E-07 5.2E-10 53.9 27.3 375 1-436 45-479 (479) 184 protein:vir:94805 Length: 492 98.4 8.4E-07 5.2E-10 53.9 30.4 397 1-439 43-492 (492) 185 protein:vir:9568 Length: 410 # 98.4 8.5E-07 5.3E-10 53.9 28.8 358 3-413 1-410 (410) 186 protein:vir:4782 Length: 522 # 98.4 9.5E-07 5.9E-10 53.6 28.0 403 1-439 14-522 (522) 187 protein:vir:99781 Length: 511 98.4 1.1E-06 6.5E-10 53.4 29.0 409 1-453 48-511 (511) 188 protein:vir:95113 Length: 474 98.4 1.1E-06 6.7E-10 53.3 29.7 387 1-439 35-474 (474) 189 protein:vir:1587 Length: 508 # 98.3 1.3E-06 8.1E-10 52.8 26.5 396 1-437 17-508 (508) 190 protein:vir:95899 Length: 474 98.3 1.4E-06 8.6E-10 52.7 28.7 379 1-439 50-474 (474) 191 protein:vir:96266 Length: 474 98.3 1.4E-06 8.6E-10 52.7 28.7 379 1-439 50-474 (474) 192 protein:vir:106639 Length: 481 98.3 1.4E-06 8.7E-10 52.7 29.3 403 1-438 28-481 (481) 193 protein:vir:99522 Length: 470 98.3 1.4E-06 8.9E-10 52.6 26.7 392 1-431 17-470 (470) 194 protein:vir:94546 Length: 506 98.3 1.5E-06 9.1E-10 52.6 24.5 406 1-444 22-506 (506) 195 protein:vir:78907 Length: 518 98.3 1.5E-06 9.3E-10 52.5 26.3 400 1-436 12-518 (518) 196 protein:vir:79703 Length: 505 98.3 1.6E-06 1E-09 52.3 28.3 393 1-428 14-505 (505) 197 protein:vir:98883 Length: 517 98.3 2E-06 1.2E-09 51.8 25.9 400 1-431 17-517 (517) 198 protein:vir:78537 Length: 480 98.2 2.1E-06 1.3E-09 51.8 29.5 402 14-447 1-480 (480) 199 protein:vir:80680 Length: 441 98.2 2.4E-06 1.5E-09 51.4 25.6 380 1-433 11-441 (441) 200 protein:vir:96366 Length: 511 98.2 2.4E-06 1.5E-09 51.4 29.2 414 1-446 39-511 (511) 201 protein:vir:78805 Length: 511 98.2 2.4E-06 1.5E-09 51.4 29.2 414 1-446 39-511 (511) 202 protein:vir:9815 Length: 500 # 98.2 3.3E-06 2E-09 50.6 25.8 390 1-429 17-500 (500) 203 protein:vir:3028 Length: 500 # 98.2 3.3E-06 2E-09 50.6 25.8 390 1-429 17-500 (500) 204 protein:vir:106571 Length: 499 98.2 3.5E-06 2.2E-09 50.5 28.8 412 1-443 9-499 (499) 205 protein:vir:78227 Length: 480 98.1 3.9E-06 2.4E-09 50.2 28.8 411 2-447 1-480 (480) 206 protein:vir:96839 Length: 474 98.1 5.9E-06 3.7E-09 49.2 28.4 385 1-435 20-474 (474) 207 protein:vir:105292 Length: 478 98.0 6.1E-06 3.8E-09 49.2 30.7 376 1-437 50-478 (478) 208 protein:vir:733 Length: 453 # 98.0 6.4E-06 4E-09 49.0 27.8 398 1-440 9-453 (453) 209 protein:vir:94498 Length: 474 98.0 6.5E-06 4E-09 49.0 30.8 373 1-439 50-474 (474) 210 protein:vir:97447 Length: 474 98.0 6.5E-06 4E-09 49.0 30.8 373 1-439 50-474 (474) 211 protein:vir:78083 Length: 537 98.0 7.5E-06 4.6E-09 48.7 33.2 407 1-444 33-537 (537) 212 protein:vir:9871 Length: 429 # 97.9 1E-05 6.3E-09 48.0 29.8 377 1-437 8-429 (429) 213 protein:vir:4073 Length: 279 # 97.7 9.6E-07 5.9E-10 53.6 10.0 266 36-391 1-279 (279) 214 protein:vir:104892 Length: 558 97.6 4.4E-05 2.7E-08 44.5 22.7 438 1-458 3-558 (558) 215 protein:vir:107112 Length: 478 97.4 7.4E-05 4.6E-08 43.2 30.3 382 1-439 47-478 (478) 216 protein:vir:105461 Length: 470 97.2 0.00013 8.1E-08 41.9 29.1 375 1-433 29-470 (470) 217 protein:vir:106999 Length: 564 97.2 0.00014 8.7E-08 41.7 25.1 428 1-462 3-564 (564) 218 protein:vir:96179 Length: 468 97.0 0.00021 1.3E-07 40.8 29.7 371 1-434 50-468 (468) 219 protein:vir:105154 Length: 525 96.9 0.00025 1.6E-07 40.3 15.5 411 1-449 52-525 (525) 220 protein:vir:101189 Length: 516 96.9 0.00029 1.8E-07 40.0 23.5 403 1-431 5-516 (516) 221 protein:vir:101806 Length: 516 96.9 0.00029 1.8E-07 40.0 23.5 403 1-431 5-516 (516) 222 protein:vir:98265 Length: 524 96.8 0.0003 1.9E-07 39.9 24.0 405 1-433 27-524 (524) 223 protein:vir:97265 Length: 513 96.6 0.0005 3.1E-07 38.7 26.0 413 2-446 1-513 (513) 224 protein:vir:102239 Length: 527 96.5 0.00054 3.3E-07 38.5 23.7 412 1-438 1-527 (527) 225 protein:vir:103177 Length: 533 96.5 0.00054 3.4E-07 38.5 22.4 426 1-458 3-533 (533) 226 protein:vir:102950 Length: 471 96.5 0.00057 3.5E-07 38.4 27.8 379 1-441 29-471 (471) 227 protein:vir:101494 Length: 527 96.5 0.00058 3.6E-07 38.3 23.9 412 1-438 1-527 (527) 228 protein:vir:9922 Length: 489 # 96.4 0.00061 3.8E-07 38.2 26.2 399 1-438 9-489 (489) 229 protein:vir:104500 Length: 537 96.4 0.00063 3.9E-07 38.1 25.5 425 1-440 4-537 (537) 230 protein:vir:81017 Length: 521 95.8 0.0014 8.9E-07 36.2 24.1 404 1-433 10-521 (521) 231 protein:vir:108049 Length: 524 95.7 0.0016 9.9E-07 35.9 22.2 405 1-433 9-524 (524) 232 protein:vir:103765 Length: 549 95.7 0.0012 7.5E-07 36.6 12.6 430 1-488 1-549 (549) 233 protein:vir:106282 Length: 521 94.9 0.0032 2E-06 34.3 23.8 404 1-433 7-521 (521) 234 protein:vir:100598 Length: 516 94.6 0.0039 2.4E-06 33.8 25.1 403 1-431 5-516 (516) 235 protein:vir:6596 Length: 521 # 94.4 0.0046 2.8E-06 33.4 26.0 405 1-433 22-521 (521) 236 protein:vir:6896 Length: 523 # 94.0 0.0057 3.5E-06 32.9 20.8 405 1-433 7-523 (523) 237 protein:vir:3361 Length: 535 # 92.7 0.01 6.5E-06 31.5 15.8 393 1-485 42-535 (535) 238 protein:vir:1538 Length: 535 # 92.6 0.011 6.6E-06 31.4 15.1 399 1-485 42-535 (535) 239 protein:vir:102668 Length: 547 91.8 0.014 8.8E-06 30.7 24.4 424 1-490 12-547 (547) 240 protein:vir:7208 Length: 524 # 91.5 0.016 9.7E-06 30.5 23.3 405 1-433 1-524 (524) 241 protein:vir:103458 Length: 524 91.3 0.016 1E-05 30.4 23.4 405 1-433 1-524 (524) 242 protein:vir:94956 Length: 452 90.8 0.019 1.2E-05 30.0 27.1 377 1-432 1-452 (452) 243 protein:vir:102330 Length: 451 90.4 0.021 1.3E-05 29.8 30.0 369 1-431 25-451 (451) 244 protein:vir:5665 Length: 511 # 90.4 0.021 1.3E-05 29.8 23.4 402 1-431 1-511 (511) 245 protein:vir:103330 Length: 517 88.4 0.032 2E-05 28.7 16.9 407 1-438 3-517 (517) 246 protein:vir:80453 Length: 535 86.8 0.043 2.7E-05 28.1 27.5 407 1-438 28-535 (535) 247 protein:vir:7017 Length: 515 # 85.3 0.054 3.3E-05 27.5 17.5 400 1-479 43-515 (515) 248 protein:vir:78696 Length: 542 83.3 0.069 4.3E-05 26.9 17.4 398 1-440 1-542 (542) 249 protein:vir:2198 Length: 536 # 81.5 0.085 5.3E-05 26.5 16.7 414 1-470 15-536 (536) 250 protein:vir:96988 Length: 516 80.3 0.096 6E-05 26.2 15.5 382 1-468 44-516 (516) 251 protein:vir:95149 Length: 501 79.7 0.1 6.3E-05 26.0 27.5 404 1-437 1-501 (501) 252 protein:vir:10447 Length: 536 79.6 0.1 6.4E-05 26.0 17.4 414 1-470 15-536 (536) 253 protein:vir:99672 Length: 532 71.7 0.19 0.00012 24.5 17.8 382 1-467 42-532 (532) 254 protein:vir:100039 Length: 522 69.4 0.22 0.00014 24.2 13.8 407 1-467 9-522 (522) 255 protein:vir:105641 Length: 516 68.8 0.23 0.00014 24.1 13.6 404 1-479 18-516 (516) 256 protein:vir:94709 Length: 522 67.5 0.25 0.00016 23.9 25.0 420 1-494 14-522 (522) 257 protein:vir:8883 Length: 543 # 64.6 0.3 0.00018 23.5 18.7 396 1-476 42-543 (543) 258 protein:vir:80165 Length: 651 62.0 0.34 0.00021 23.1 23.1 442 1-518 27-650 (651) 259 protein:vir:78393 Length: 489 58.4 0.41 0.00026 22.7 27.1 396 1-438 1-489 (489) 260 protein:vir:94599 Length: 641 57.6 0.43 0.00027 22.6 12.3 447 1-506 67-641 (641) 261 protein:vir:97376 Length: 320 57.4 0.43 0.00027 22.6 12.6 308 1-395 2-320 (320) 262 protein:vir:78942 Length: 510 56.2 0.46 0.00029 22.4 22.8 397 1-483 31-510 (510) 263 protein:vir:7430 Length: 563 # 54.9 0.49 0.0003 22.3 25.2 428 1-451 1-563 (563) 264 protein:vir:95014 Length: 491 54.2 0.51 0.00032 22.2 26.3 398 1-441 1-491 (491) 265 protein:vir:8846 Length: 705 # 53.1 0.54 0.00033 22.1 19.0 447 1-518 44-691 (705) 266 protein:vir:80211 Length: 514 30.6 1.6 0.00097 19.5 18.6 419 1-503 7-514 (514) No 1 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=2.3e-141 Score=791.56 Aligned_cols=518 Identities=99% Similarity=1.441 Sum_probs=502.1 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) ||+.+|++..+|+....++|+.+++++.|..+.+..+.+.++.++|+++++|++||++||++||++||++|++++++..+ T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~~ 80 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..+++++.|+.+||++||+++||+.++.+++++||+|++++|+..|++++||||+|++|++..+.++....|.+...... T Consensus 81 ~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~~y~~~~~~~~ 160 (518) T protein:vir:78 81 EHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) T ss_pred ccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEecCCc Confidence 88899999999999999999999999999999999999999999999999999999999999998888888888777777 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFD 240 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~ 240 (518) ++..+.|++++||||+++++++..+|+||+.++...|....++++++.++|+||++|++||++++.+++++.+++++.|+ T Consensus 161 ~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~ 240 (518) T protein:vir:78 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFD 240 (518) T ss_pred cceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHH Confidence 78889999999999999999988899999999999999999999999999999999999999999999999999999999 Q ss_pred HHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhH Q lcl|NC_021305. 241 RAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAI 320 (518) Q Consensus 241 ~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P 320 (518) +.++|..|+|+++||++|++|++++.++.|+||++++++++++||++|||||++||+.+++|++|.+++.+.|+++||.| T Consensus 241 ~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~P 320 (518) T protein:vir:78 241 RAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAI 320 (518) T ss_pred HHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccc Q lcl|NC_021305. 321 PIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQ 400 (518) Q Consensus 321 ~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~ 400 (518) |+.+||++||++|++..++.++++||++.+++.|.+++++++.+++++|++|+||+|+++|++|++++|||+++++.|++ T Consensus 321 ~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~ 400 (518) T protein:vir:78 321 PIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQ 400 (518) T ss_pred HHHHHHHHHHHhhcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccce Confidence 99999999999999988888899999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhh Q lcl|NC_021305. 401 PLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAM 480 (518) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~ 480 (518) |++...++...+++++++.+++.++.++.++.++++.++++++++++.++++++++++.++|+.++|++.||+++|++|| T Consensus 401 pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (518) T protein:vir:78 401 PLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAM 480 (518) T ss_pred ecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCcccccccccccccchhcccCCCCcccccchHHHHHHHHh Confidence 99998888888888888889999998888999999999999999999999999999999999999999999999999999 Q ss_pred ccccCcCchhHHHHHHHHHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 481 GRGKDIKGFALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) |++|+.++|+||+++||+|+|++|+||+|+|||||||| T Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) T protein:vir:78 481 GRGKDIKGFALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) T ss_pred hcCCcchhhhhhhhhhcchhHHHHHHHHHHhhhhccCC Confidence 99999999999999999999999999999999999999 No 2 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=8.3e-141 Score=788.57 Aligned_cols=518 Identities=100% Similarity=1.449 Sum_probs=501.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) ||+.+|+.+.+|+....++|+.++|++.|..+.+..+.++++.++|+++++|++||++||++||+|||++|++++++..+ T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~~ 80 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) T ss_pred CcccCceeecCchhhhhhhhhhcccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCcee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..+|+++.|+.+||++||+++||+.++.+++++||+|++++|+..|++++|+||+|++|++..+.++..+.|.+...... T Consensus 81 ~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~~~y~~~~~~~~ 160 (518) T protein:vir:10 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) T ss_pred ccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEecCCc Confidence 88999999999999999999999999999999999999999999999999999999999999998888888888777777 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFD 240 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~ 240 (518) +++.+.|++++||||+++++++..+|+||+..+..+|....++++++.++|+||++|+|||++++.+++++.+++++.|+ T Consensus 161 ~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~ 240 (518) T protein:vir:10 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFD 240 (518) T ss_pred cceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHH Confidence 78889999999999999999998899999999999999999999999999999999999999999999999999999999 Q ss_pred HHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhH Q lcl|NC_021305. 241 RAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAI 320 (518) Q Consensus 241 ~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P 320 (518) +.++|..|+|+++||++|++|+++++++.|+||++++++++++||++|||||++||+.+++|++|.+++.+.|+++||.| T Consensus 241 ~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P 320 (518) T protein:vir:10 241 RAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAI 320 (518) T ss_pred HHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccc Q lcl|NC_021305. 321 PIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQ 400 (518) Q Consensus 321 ~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~ 400 (518) |+..||++||++|++..++.++++||++.+++.|.+++++++.+++++|++|+||+|+++|++|++++|||+++++.|++ T Consensus 321 ~l~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~ 400 (518) T protein:vir:10 321 PIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQ 400 (518) T ss_pred HHHHHHHHHHHhhcccccCCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccce Confidence 99999999999999988888899999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhh Q lcl|NC_021305. 401 PLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAM 480 (518) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~ 480 (518) |++...++...+++++++++++.++.++.+++++.+.+.++++++++++++++.++++.+.|+.++|++.||+++|++|| T Consensus 401 pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (518) T protein:vir:10 401 PLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAM 480 (518) T ss_pred ecccccccccCCCCCCCCCCCCccccccccccccccCCCCCcccccccccccccchhccccCCCcccccchHHHHHHHHh Confidence 99988888888888888889999998888999999999999999999999999999999999999999999999999999 Q ss_pred ccccCcCchhHHHHHHHHHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 481 GRGKDIKGFALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) |++|+.++|+||+++||+|+|++|+||+|+|||||||| T Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) T protein:vir:10 481 GRGKDIKGFALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) T ss_pred hcCccchhHhhhhhhhcchhHHHHHHHHHHhhhhccCC Confidence 99999999999999999999999999999999999999 No 3 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=4.8e-100 Score=565.09 Aligned_cols=448 Identities=18% Similarity=0.201 Sum_probs=364.0 Q ss_pred cCCCCCC-CC-ccccccc--chhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 2 LLANGQT-LS-APAMAEL--SPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 2 ~f~~~~~-~~-~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) ||..-+. ++ ++..+.. ..|....-..++..+..+.++..++.+.++++++|++||++||++||+|||++|+++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 4442221 11 1111111 123221111111122234455678889999999999999999999999999999988766 Q ss_pred c-eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 78 E-TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 78 ~-~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) . ++..+|++++|+.+||++||+++||+.++.+++++||+|++++|+..|++.+|||++|++|++..+.++.. .|.+.. T Consensus 81 ~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~-~y~~~~ 159 (454) T protein:vir:93 81 IRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDGEV-FYRITP 159 (454) T ss_pred ccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCCcE-EEEEEe Confidence 4 45678889999999999999999999999999999999999999999999999999999999998876543 444443 Q ss_pred cc-ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHH Q lcl|NC_021305. 157 GA-GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL 235 (518) Q Consensus 157 ~~-~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~ 235 (518) .. ...+..+.|++++|||++.....+..+|+||+..+...+....+++++..++|+||++|++||++++.+++++.+++ T Consensus 160 ~~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~ 239 (454) T protein:vir:93 160 DRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKL 239 (454) T ss_pred ccccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHH Confidence 32 23355678999999999976666667999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHH Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYR 315 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~ 315 (518) ++.|++.++| .|+|+++||++|++|++++.++.|+||+|++++.+++||++|||||++||+.+++|++|++++.+.|++ T Consensus 240 ~~~~~~~~~g-~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~ 318 (454) T protein:vir:93 240 KSNWDSGYTG-ENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYS 318 (454) T ss_pred HHHHHHHhcc-cccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHH Confidence 9999999988 789999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeee Q lcl|NC_021305. 316 DTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYA 395 (518) Q Consensus 316 ~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~ 395 (518) +||.||+..||++||++|++.. .++++|+++.+++.|.+++++.+.+++++|++|+||+|+++|++|++ |||++++ T Consensus 319 ~~l~P~~~~ie~~ln~~L~~~~--~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~--ggD~~~~ 394 (454) T protein:vir:93 319 QCLQTLIESIELLLDEALETGE--NESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLA--GGDALYL 394 (454) T ss_pred HHHHHHHHHHHHHHHHhhcCCC--CcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeeee Confidence 9999999999999999998754 45799999999999999999999999999999999999999999995 8999999 Q ss_pred cccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcc Q lcl|NC_021305. 396 NSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQK 462 (518) Q Consensus 396 ~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 462 (518) +.|+++++....+....++....+++..++.+......+....+.+. +...+.-++.+.| T Consensus 395 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~-------d~~~~~~~~~~~~ 454 (454) T protein:vir:93 395 QQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEH-------DAVKAMFRGILKK 454 (454) T ss_pred ccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCcc-------chhhhhhhhhhcC Confidence 99999998776666555554444555444433322222222222222 2223333343434 No 4 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=4.7e-97 Score=548.71 Aligned_cols=402 Identities=22% Similarity=0.300 Sum_probs=350.5 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) |||.....+++......++.+..+++.. .++..++.+.++++++|++||++||++||++||++|++++++.+. T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~~g~~-------~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~ 73 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEWLGIN-------PSETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKRV 73 (409) T ss_pred CcccccccCcCCCCCCChHHHHHHhcCC-------cCcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeeec Confidence 9999776665554443333333333321 234567788899999999999999999999999999987666655 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee-----eEEeee Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR-----YEYYFQ 155 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~-----~~~~~~ 155 (518) ..|++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|++|++..+.++.. ..|.+. T Consensus 74 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~ 153 (409) T protein:vir:10 74 PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYT 153 (409) T ss_pred cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEEE Confidence 666677778889999999999999999999999999999999999999999999999999988765432 223222 Q ss_pred cccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHH Q lcl|NC_021305. 156 AGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL 235 (518) Q Consensus 156 ~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~ 235 (518) ...+..+.|++++|||++++++++ .+|+||+..+.+++....++++++.++|+||++|++||++++.+++++.+++ T Consensus 154 ---~~~g~~~~~~~~evih~r~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~ 229 (409) T protein:vir:10 154 ---DDLGQRHKFMSDEILHFKGLTADG-LAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVF 229 (409) T ss_pred ---eCCceeEEeccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHH Confidence 234567789999999999998876 6899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHH Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYR 315 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~ 315 (518) ++.|++.+.|..|+|+++|+++|++|++++.++.|+||++++++..++||++|||||.+||+.++++++|.+++.+.|++ T Consensus 230 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~f~~ 309 (409) T protein:vir:10 230 KENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNREFYI 309 (409) T ss_pred HHHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhHHHHHHHHHHHHhhhhhhc--ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 316 DTMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 316 ~~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) +||.|+++.||++||++|++..+ .+++++||++.+++.|.+++++.+.+++++|++|+||+|+++|+||++ |||++ T Consensus 310 ~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~--ggD~~ 387 (409) T protein:vir:10 310 DTLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLE--GGDVL 387 (409) T ss_pred HHHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCee Confidence 99999999999999999997654 457899999999999999999999999999999999999999999995 89999 Q ss_pred eecccccccccccccCCCCCCCCC Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPA 417 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~ 417 (518) ++|+|++|++...++.. +.+++ T Consensus 388 ~~~~n~~~~~~~~~~~~--kgGe~ 409 (409) T protein:vir:10 388 LINGNMIPVKMAGEQYS--KGGEK 409 (409) T ss_pred eeccCccchhhcccccc--ccCCC Confidence 99999999876533211 11111 No 5 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=1.3e-96 Score=546.27 Aligned_cols=443 Identities=17% Similarity=0.200 Sum_probs=351.7 Q ss_pred CcCCCCCCCCccc--ccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLSAPA--MAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =||+++..+..+. .+..+++... .+..+..+.++..++.+.++++++|++||++||++||+|||++|++.+++. T Consensus 6 ~l~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~~~ 81 (457) T protein:vir:62 6 ALFGRGHSPALDAAEGRAWEPYDPS----IYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGTR 81 (457) T ss_pred hhhccccccccccccccccccchhh----hhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCCcc Confidence 3455433322111 1111221111 111223344556788899999999999999999999999999999988778 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce---eeEEeee Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG---RYEYYFQ 155 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~---~~~~~~~ 155 (518) ++..++.++.|+.+||+.||+++||+.++.+++++||+|++|.++ .|.+.+||||+|.+|++..+..+. ..++.|. T Consensus 82 ~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~ 160 (457) T protein:vir:62 82 KEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVFEAYD 160 (457) T ss_pred ccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeEEEEE Confidence 888899999999999999999999999999999999999998665 689999999999999987765433 2233333 Q ss_pred cccc-cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHH Q lcl|NC_021305. 156 AGAG-VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQR 234 (518) Q Consensus 156 ~~~~-~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~ 234 (518) +... .......|++++||||+++++++..+|+||+..+...|....++++++.++|+||++|++||++++.++++++++ T Consensus 161 ~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~ 240 (457) T protein:vir:62 161 IDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLAR 240 (457) T ss_pred EccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHH Confidence 2221 122345689999999999999988899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQMRA 312 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~ 312 (518) +++.|++.++|.+|+|+++||++|++|++++.++.|+||++++++++++||++|||||++||..+++++ +|.|++.+. T Consensus 241 ~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~ 320 (457) T protein:vir:62 241 AREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIA 320 (457) T ss_pred HHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999888765 889999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD 391 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD 391 (518) |+++||.||+++||++||++|+++.+. .++++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+++++|| T Consensus 321 f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D 400 (457) T protein:vir:62 321 FTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGE 400 (457) T ss_pred HHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Confidence 999999999999999999999987664 4578999999999999999999999999999999999999999999988889 Q ss_pred eeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchh Q lcl|NC_021305. 392 ELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKES 468 (518) Q Consensus 392 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~ 468 (518) ++++|+|+.+++.....+....+. ....+.+.+.++.. +. ..+..++... ++. +.|. T Consensus 401 ~~~~~~n~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~--~~~~~~d~~~-------~~~--------~~~~ 457 (457) T protein:vir:62 401 KYRVPLNLGEIGEEPEPEPAPAPP-AIDPPAEEPADDEE--PD--NAEGDPDEGE-------TED--------DDDA 457 (457) T ss_pred eeeeccccccccccccccccCCCc-cCCCCccCCCCCCC--CC--CCCCCCcccc-------ccc--------cccC Confidence 999999999987665433222111 11111111111110 00 1111111111 000 0000 No 6 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=7e-96 Score=542.27 Aligned_cols=410 Identities=19% Similarity=0.267 Sum_probs=353.4 Q ss_pred CcCCC---CCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 1 MLLAN---GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 1 ~~f~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) |||.+ +++..+......++++.+.|++ ..+.++..++.+.++++++|++||++||++||+|||++|++++++ T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~ 75 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGG-----RKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGG 75 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcC-----cccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCc Confidence 99985 4444444444455555555543 334555678889999999999999999999999999999988777 Q ss_pred ceec-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 78 ETEE-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 78 ~~~~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) .+.. .|+..++|+.+||++||+++||+.++.+++++|++|+++.|+..|.+.+||||+|++|++..+.+++.++|.+.. T Consensus 76 ~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~ 155 (416) T protein:vir:12 76 IERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVL 155 (416) T ss_pred cccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEec Confidence 6554 466667788999999999999999999999999999999999999999999999999999998888887776643 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) ++..+++++++|||++++++++ .+|+||+.++..++....+++++..++|+||+.|++||++++.+++++.++++ T Consensus 156 ----~g~~~~~~~~eiih~~~~~~~~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~ 230 (416) T protein:vir:12 156 ----NGKAIELYDYEVLHFKGLSTDG-IHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVR 230 (416) T ss_pred ----CCeEEEecCccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHH Confidence 4567899999999999888776 68999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|+... ++++++|+++|++|++++.++.|+||++++++..++||++|||||+++|..+.+|++|.+++.+.|+++ T Consensus 231 ~~~~~~~----~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~ 306 (416) T protein:vir:12 231 KEWKRVN----KVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRN 306 (416) T ss_pred HHHHHHh----cCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHH Confidence 9998653 568899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~ 394 (518) ||.|++++||++||++|+++.+. +++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||++ |||+++ T Consensus 307 ~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~--ggd~~~ 384 (416) T protein:vir:12 307 TLQPWIVNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIE--NGDKYI 384 (416) T ss_pred HHHHHHHHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Ccceee Confidence 99999999999999999976643 57899999999999999999999999999999999999999999995 799999 Q ss_pred ecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCC Q lcl|NC_021305. 395 ANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTS 436 (518) Q Consensus 395 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) +|+|+++++.....+...... ...+++ +.+.+ T Consensus 385 ~~~n~~~~~~~~~~~~~~~~~-------~~~gge---~~~~g 416 (416) T protein:vir:12 385 SSLNYVFLDFLEEYQRLKAGG-------AMKGGD---NKNEG 416 (416) T ss_pred eccccccccccchhhcccccc-------ccCCCC---CcCCC Confidence 999999998765443221111 111111 00000 No 7 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=2.2e-95 Score=539.50 Aligned_cols=414 Identities=18% Similarity=0.238 Sum_probs=345.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) |+|.+.-...+.+....+.| ...+++.. +..+.++..++.+.++++++|++||++||++||+|||++|+++.++..+ T Consensus 1 m~~~~~~~~~~~~~s~~~~w-~~~~~~~~--~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~~ 77 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFW-EAMLGGVR--SSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDKNGGRQ 77 (421) T ss_pred CCCcchhcccccccCcchhh-HHHhhhhc--cCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCcee Confidence 77765544443333322223 22333221 2334456678899999999999999999999999999999987666443 Q ss_pred --ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 81 --ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 81 --~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) ..|+.+++|+.+||++||+++||+.++.+++++|++|++++|+..|++.+||||+|+.|++..+.++..+ |.+.. T Consensus 78 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g~~~-y~~~~-- 154 (421) T protein:vir:10 78 RATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDGMPY-YEIPE-- 154 (421) T ss_pred ecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCceEE-EEEcC-- Confidence 3456667788899999999999999999999999999999999999999999999999999888776543 33322 Q ss_pred ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC----CHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL----SEAAQQR 234 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~----~~~~~~~ 234 (518) .+ ..++.++|||+++++.++ .+|+||+..+..++....+++++..++|+||++|+|+|++++.+ ++++.++ T Consensus 155 --~g--~~~~~~eiih~~~~~~d~-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~ 229 (421) T protein:vir:10 155 --IG--ETLPMRMMHHVKVFSLDG-YIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQ 229 (421) T ss_pred --CC--cEEchhhEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHH Confidence 12 258899999999988776 68999999999999999999999999999999999999987654 8999999 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~ 314 (518) +++.|++.++|..|+|+++||++|++|++++.++.|+||+|.++++.++||++|||||++||+.+.+|++|.|++.+.|+ T Consensus 230 ~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~ 309 (421) T protein:vir:10 230 LLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFV 309 (421) T ss_pred HHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) ++||.|++.+||++||++|+++.++ +++++||++.+++.|.+++++.+.+++++|++|+||+|+++|+||++ |||++ T Consensus 310 ~~tl~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~--ggD~~ 387 (421) T protein:vir:10 310 MYTLLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIA--GGDKY 387 (421) T ss_pred HHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Cccee Confidence 9999999999999999999986554 56789999999999999999999999999999999999999999995 89999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccch Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTN 444 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (518) ++|+|+++++....+.. ++...+++++++=.++ | T Consensus 388 ~~~~n~~~~~~~~~~~~---------~~~~~~~~e~d~~~~~--------~ 421 (421) T protein:vir:10 388 LTPLNMVDSAQIIPGDK---------KPTAQQMAEIDTILSR--------T 421 (421) T ss_pred eeccccccccccccCCC---------CcccccCccccccccc--------C Confidence 99999987664432211 1111111111111111 1 No 8 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=3e-95 Score=538.79 Aligned_cols=442 Identities=16% Similarity=0.186 Sum_probs=350.7 Q ss_pred CcCCCCCCCC--cccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLS--APAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =||+++..+. ....+..++.....+. .+....++..++.+.++++++|++||++||++||+|||++|++++++. T Consensus 6 ~l~~r~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~ 81 (457) T protein:vir:13 6 ALFGRGHSPALDGIEARAWEPYDPSIYN----LGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGSR 81 (457) T ss_pred hhhcccccccccccccccccccchHHHh----hcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCcc Confidence 3566555432 2223322222111111 122334456788899999999999999999999999999999988877 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee---eEEeee Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR---YEYYFQ 155 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~---~~~~~~ 155 (518) ++..++.++.++..||..||+++||+.++.+++++||+|++|.++ .|.+++||||+|.+|++..+..+.. .++.|. T Consensus 82 ~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~ 160 (457) T protein:vir:13 82 KEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVFEAYD 160 (457) T ss_pred cccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeEEEEE Confidence 777777777777766668999999999999999999999999776 6899999999999999887654432 222332 Q ss_pred cccc-cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHH Q lcl|NC_021305. 156 AGAG-VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQR 234 (518) Q Consensus 156 ~~~~-~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~ 234 (518) +... .......|++++|||++++++++.++|+||+..+...|....++++++.++|+||++|++||++++.++++++++ T Consensus 161 ~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~ 240 (457) T protein:vir:13 161 IDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLAR 240 (457) T ss_pred EecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHH Confidence 2221 122345689999999999999988899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQMRA 312 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~ 312 (518) +++.|++.++|..|+|+++||++|++|++++.++.|+||++++++.+++||++|||||++||..+++++ +|.+++.+. T Consensus 241 ~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~ 320 (457) T protein:vir:13 241 AREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIA 320 (457) T ss_pred HHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999999999887765 889999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD 391 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD 391 (518) |+++||.||++.||++||++|+++.++ .++++||++.+++.|.+++++++.+++++|+||+||+|+++|++|++++.|| T Consensus 321 f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d 400 (457) T protein:vir:13 321 FTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGE 400 (457) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccc Confidence 999999999999999999999987665 4578999999999999999999999999999999999999999999988889 Q ss_pred eeeecccccccccccccCCCCCCCCCCCCCccCCCCC-CCccccCCccccccchhcch Q lcl|NC_021305. 392 ELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS-LDQSPPTSVPGLSPTNSDRS 448 (518) Q Consensus 392 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 448 (518) ++++|+|+++++.....+....+. ..+.+...+.++ +....++.....+.+..+.+ T Consensus 401 ~~~~~~n~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~g~~d~~~~~~~~~~~~~ 457 (457) T protein:vir:13 401 KYRVPLNLGEVGEEPEPEPAPAPP-AIEPPAEEPDEEPEPEGKPDDEGATEEDDEDDA 457 (457) T ss_pred ceeeccccccccccccccccCCCC-CCCCCccccCCCCCCCCCCccccCCCCcccccC Confidence 999999999987654432221111 111111111111 11111111111122222221 No 9 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=1.3e-95 Score=540.85 Aligned_cols=415 Identities=20% Similarity=0.281 Sum_probs=340.8 Q ss_pred CcCCC---CCCCC--------cccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLAN---GQTLS--------APAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~---~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) +|+.- ..+.+ .++....++++... ..|..+.++..++.+.++++++|++||++||++||++||+ T Consensus 4 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~lp~~ 78 (434) T protein:vir:43 4 SLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQ-----FLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAGLPLG 78 (434) T ss_pred chhhhhhhcccccchhhhcccccccccCchHHHHH-----HhcCCccCCceechhhhhccHHHHHHHHHHHHhhhhCceE Confidence 22221 11111 11122223333222 2334455667788899999999999999999999999999 Q ss_pred EEEecCCcce-e-ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc Q lcl|NC_021305. 70 CMFTSGDTET-E-ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT 147 (518) Q Consensus 70 v~~~~~~~~~-~-~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~ 147 (518) +|+++.++.+ . ..|+++++|+.+||++||+++||+.++.+++++||+|++|.++ .|++++|+||+|+.|++..+.++ T Consensus 79 ~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~~g 157 (434) T protein:vir:43 79 VYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDENG 157 (434) T ss_pred EEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcCCC Confidence 9998866643 3 3455666777899999999999999999999999999998877 69999999999999999988776 Q ss_pred eeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC Q lcl|NC_021305. 148 GRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL 227 (518) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~ 227 (518) ...+ .+.. .++..+.|++++|||+++++.++ .+|+||+..+...+....+++++..++|+||++|+++|++++.+ T Consensus 158 ~~~y-~~~~---~~g~~~~~~~~eVih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l 232 (434) T protein:vir:43 158 RLKY-FYTT---KKGARREIERTNMLHIPAFTLDG-RIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRIL 232 (434) T ss_pred eEEE-EEEe---cCceEEEEccccEEEecCcCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCC Confidence 5443 3322 34567899999999999987776 68999999999999999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccc--cCC Q lcl|NC_021305. 228 SEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT--FSN 305 (518) Q Consensus 228 ~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~sn 305 (518) ++++.+++++.|++ +.|..|+|+++||++|++|++++.++.|+||++.++++.++||++|||||++||..+.++ ++| T Consensus 233 ~~e~~~~~r~~~~~-~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~ 311 (434) T protein:vir:43 233 QPAQREEFREYVKS-VSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTG 311 (434) T ss_pred CHHHHHHHHHHHHH-hcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccch Confidence 99999999999975 567789999999999999999999999999999999999999999999999999887654 789 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_021305. 306 ISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPR 384 (518) Q Consensus 306 ~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p 384 (518) .+++...|+++||.||+.+||++||++|++..++ .++++||++.+++.|.+++++.+.+++++|++|+||+|+++|+|| T Consensus 312 ~e~~~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p 391 (434) T protein:vir:43 312 LEQQMLAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPE 391 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 9999999999999999999999999999987654 567999999999999999999999999999999999999999999 Q ss_pred CCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021305. 385 SDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASL 429 (518) Q Consensus 385 ~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (518) ++ |||++++|+|++|++...+.+..............+|..++ T Consensus 392 ~~--ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 392 LP--GGDILTVQSNLVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred CC--CCCeEeeccCccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 95 89999999999999866543332221111111111111111 No 10 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=2.6e-95 Score=539.10 Aligned_cols=412 Identities=18% Similarity=0.246 Sum_probs=346.6 Q ss_pred CcCCCCCCCCccc-ccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPA-MAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) |+|++........ .....+|+...+|... +.++..++.+.++++++|++||++||++||++||++|++++++.. T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~-----s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~ 75 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSR-----SDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDRK 75 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCC-----ccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCccc Confidence 9999877655443 3333457665554332 344566888999999999999999999999999999999887766 Q ss_pred eccc-hHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 80 EESD-TGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 80 ~~~~-~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) +..+ ++.++|+.+||++||+++||+.++.+++++||+|++|+|+..|.+++|||++|++|++..+.++.. .|.+... T Consensus 76 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~~~-~y~~~~~- 153 (419) T protein:vir:14 76 PATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDLKP-VYRVRGS- 153 (419) T ss_pred cccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceE-EEEEccC- Confidence 6554 455567779999999999999999999999999999999999999999999999999988776654 3333221 Q ss_pred ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC----CHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL----SEAAQQR 234 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~----~~~~~~~ 234 (518) ..++.++|+|+++++.++ .+|+||+..+.+++....+++++..++|+||+.|+|+|++++.+ ++++.++ T Consensus 154 ------~~~~~~~i~h~~~~~~dg-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~ 226 (419) T protein:vir:14 154 ------DPMPQRLVHHVRWMSING-YTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDR 226 (419) T ss_pred ------cccchhheeEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHH Confidence 136789999999988776 68999999999999999999999999999999999999998765 5888999 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~ 314 (518) +++.|++.++|..|+|+++|+++|++|++++.++.|+||+|+++++.++||++|||||++||..++++++|.|++.+.|+ T Consensus 227 ~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~~f~ 306 (419) T protein:vir:14 227 ITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFV 306 (419) T ss_pred HHHHHHHHhcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) ++||.|++++||++|+++|+++.++ .++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||++ |||++ T Consensus 307 ~~~L~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~--gGD~~ 384 (419) T protein:vir:14 307 IYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVK--GGDIY 384 (419) T ss_pred HHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCee Confidence 9999999999999999999976544 57789999999999999999999999999999999999999999995 89999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ++|+|+++++......... +. +......+.++--. T Consensus 385 ~~~~n~~~~~~~~~~~~~~-~~-----~~~~~~~e~~~~l~ 419 (419) T protein:vir:14 385 LSPMNMVDASKPQQLPVGK-SE-----PTKAAIDEIGRILS 419 (419) T ss_pred eeccccccccccccccCCC-CC-----CccccccchhcccC Confidence 9999998876432211100 00 00000000000000 No 11 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=4.7e-95 Score=537.74 Aligned_cols=418 Identities=18% Similarity=0.240 Sum_probs=346.1 Q ss_pred CcCCCCCCC--CcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTL--SAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =+|+..+.. +........+++...++. ..+...++.+.++++++|++||++||++||++||++|++++++. T Consensus 9 ~~~~~~~r~~~~~~~~~~~~~~~~~~~g~-------~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~ 81 (432) T protein:vir:10 9 KFFNFEKRQTSQVIELNKDDEKLLEWLGI-------SPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGI 81 (432) T ss_pred HhcCccccCcccccccCCchHHHHHHhCC-------CcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCce Confidence 145532221 222222222333222221 22345677889999999999999999999999999999988876 Q ss_pred eecc-chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeE-Eeeec Q lcl|NC_021305. 79 TEES-DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYE-YYFQA 156 (518) Q Consensus 79 ~~~~-~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~-~~~~~ 156 (518) ++.. |++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++||||+|++|++..+..+.... +.+.+ T Consensus 82 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y 161 (432) T protein:vir:10 82 QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWY 161 (432) T ss_pred eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEE Confidence 6554 455566778999999999999999999999999999999999999999999999999998876442211 11111 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) ....++..+.|++++|||++++.+.+..+|+||+..+..++....+++++..++|+||+.|++||++++.+++++.++++ T Consensus 162 ~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~ 241 (432) T protein:vir:10 162 VVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFR 241 (432) T ss_pred EEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHH Confidence 12245667889999999999877666788999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|++.++|..|+|+++|+++|++|++++.++.|+||++.+++++++||++|||||++||..+.++++|.+++.+.|+++ T Consensus 242 ~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~ 321 (432) T protein:vir:10 242 ENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTD 321 (432) T ss_pred HHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhc--ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~ 394 (518) ||.|+++.||++||++|++..+ .+++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++ |||+++ T Consensus 322 ~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~--ggD~~~ 399 (432) T protein:vir:10 322 TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEA--GGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeEe Confidence 9999999999999999997654 357889999999999999999999999999999999999999999995 899999 Q ss_pred ecccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 395 ANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 395 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) +|+|++|++...+....+.. .....+...++++ T Consensus 400 ~~~n~~~~~~~~~~~~k~~~---~~~~~~~~~~~~~ 432 (432) T protein:vir:10 400 VNGNMLPIDMAGQAYLKGGD---TNGEVSKEGNEGN 432 (432) T ss_pred ecccccchhhccccccCCCC---CCCCCCCCCCCCC Confidence 99999999876543322111 1111111111111 No 12 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=4.7e-95 Score=537.74 Aligned_cols=418 Identities=18% Similarity=0.240 Sum_probs=346.1 Q ss_pred CcCCCCCCC--CcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTL--SAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =+|+..+.. +........+++...++. ..+...++.+.++++++|++||++||++||++||++|++++++. T Consensus 9 ~~~~~~~r~~~~~~~~~~~~~~~~~~~g~-------~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~ 81 (432) T protein:vir:10 9 KFFNFEKRQTSQVIELNKDDEKLLEWLGI-------SPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGI 81 (432) T ss_pred HhcCccccCcccccccCCchHHHHHHhCC-------CcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCce Confidence 145532221 222222222333222221 22345677889999999999999999999999999999988876 Q ss_pred eecc-chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeE-Eeeec Q lcl|NC_021305. 79 TEES-DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYE-YYFQA 156 (518) Q Consensus 79 ~~~~-~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~-~~~~~ 156 (518) ++.. |++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++||||+|++|++..+..+.... +.+.+ T Consensus 82 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y 161 (432) T protein:vir:10 82 QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWY 161 (432) T ss_pred eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEE Confidence 6554 455566778999999999999999999999999999999999999999999999999998876442211 11111 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) ....++..+.|++++|||++++.+.+..+|+||+..+..++....+++++..++|+||+.|++||++++.+++++.++++ T Consensus 162 ~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~ 241 (432) T protein:vir:10 162 VVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFR 241 (432) T ss_pred EEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHH Confidence 12245667889999999999877666788999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|++.++|..|+|+++|+++|++|++++.++.|+||++.+++++++||++|||||++||..+.++++|.+++.+.|+++ T Consensus 242 ~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~ 321 (432) T protein:vir:10 242 ENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTD 321 (432) T ss_pred HHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhc--ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~ 394 (518) ||.|+++.||++||++|++..+ .+++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++ |||+++ T Consensus 322 ~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~--ggD~~~ 399 (432) T protein:vir:10 322 TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEA--GGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeEe Confidence 9999999999999999997654 357889999999999999999999999999999999999999999995 899999 Q ss_pred ecccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 395 ANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 395 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) +|+|++|++...+....+.. .....+...++++ T Consensus 400 ~~~n~~~~~~~~~~~~k~~~---~~~~~~~~~~~~~ 432 (432) T protein:vir:10 400 VNGNMLPIDMAGQAYLKGGD---TNGEVSKEGNEGN 432 (432) T ss_pred ecccccchhhccccccCCCC---CCCCCCCCCCCCC Confidence 99999999876543322111 1111111111111 No 13 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=4.7e-95 Score=537.74 Aligned_cols=418 Identities=18% Similarity=0.240 Sum_probs=346.1 Q ss_pred CcCCCCCCC--CcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTL--SAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =+|+..+.. +........+++...++. ..+...++.+.++++++|++||++||++||++||++|++++++. T Consensus 9 ~~~~~~~r~~~~~~~~~~~~~~~~~~~g~-------~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~ 81 (432) T protein:vir:10 9 KFFNFEKRQTSQVIELNKDDEKLLEWLGI-------SPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGI 81 (432) T ss_pred HhcCccccCcccccccCCchHHHHHHhCC-------CcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCce Confidence 145532221 222222222333222221 22345677889999999999999999999999999999988876 Q ss_pred eecc-chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeE-Eeeec Q lcl|NC_021305. 79 TEES-DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYE-YYFQA 156 (518) Q Consensus 79 ~~~~-~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~-~~~~~ 156 (518) ++.. |++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++||||+|++|++..+..+.... +.+.+ T Consensus 82 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y 161 (432) T protein:vir:10 82 QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWY 161 (432) T ss_pred eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEE Confidence 6554 455566778999999999999999999999999999999999999999999999999998876442211 11111 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) ....++..+.|++++|||++++.+.+..+|+||+..+..++....+++++..++|+||+.|++||++++.+++++.++++ T Consensus 162 ~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~ 241 (432) T protein:vir:10 162 VVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFR 241 (432) T ss_pred EEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHH Confidence 12245667889999999999877666788999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|++.++|..|+|+++|+++|++|++++.++.|+||++.+++++++||++|||||++||..+.++++|.+++.+.|+++ T Consensus 242 ~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~~~~~ 321 (432) T protein:vir:10 242 ENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTD 321 (432) T ss_pred HHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhc--ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~ 394 (518) ||.|+++.||++||++|++..+ .+++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++ |||+++ T Consensus 322 ~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~--ggD~~~ 399 (432) T protein:vir:10 322 TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEA--GGDRLL 399 (432) T ss_pred HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeEe Confidence 9999999999999999997654 357889999999999999999999999999999999999999999995 899999 Q ss_pred ecccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 395 ANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 395 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) +|+|++|++...+....+.. .....+...++++ T Consensus 400 ~~~n~~~~~~~~~~~~k~~~---~~~~~~~~~~~~~ 432 (432) T protein:vir:10 400 VNGNMLPIDMAGQAYLKGGD---TNGEVSKEGNEGN 432 (432) T ss_pred ecccccchhhccccccCCCC---CCCCCCCCCCCCC Confidence 99999999876543322111 1111111111111 No 14 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=5.5e-95 Score=537.37 Aligned_cols=418 Identities=18% Similarity=0.241 Sum_probs=348.9 Q ss_pred CcCCCCCCC--CcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTL--SAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =+|++.+.. +.......++++.+.+|.. .+...++.+.++++++|++||++||++||++||++|++++++. T Consensus 6 ~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~-------~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~ 78 (429) T protein:vir:10 6 KFFNFEKRQTSQVIELNKDDEKLLEWLGIS-------PSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGI 78 (429) T ss_pred hhhcccccCcccccccCCChHHHHHHhcCC-------CCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCce Confidence 345543322 2222222344444433321 2345577888999999999999999999999999999988776 Q ss_pred eecc-chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeE-Eeeec Q lcl|NC_021305. 79 TEES-DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYE-YYFQA 156 (518) Q Consensus 79 ~~~~-~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~-~~~~~ 156 (518) ++.. |++.++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|++|++..+..+.... +...+ T Consensus 79 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~ 158 (429) T protein:vir:10 79 QRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWY 158 (429) T ss_pred eeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEE Confidence 6554 455666778999999999999999999999999999999999999999999999999998876543211 11111 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) ....++..+.|++++||||++..+.+..+|+||+..+..++....+++++..++|+||+.|+++|++++.+++++.++++ T Consensus 159 ~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~ 238 (429) T protein:vir:10 159 VVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFR 238 (429) T ss_pred EEccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHH Confidence 22245667889999999999877767788999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|++.++|..|+|+++|+++|++|++++.++.|+||++++++.+++||++|||||++||..+.++++|.+++...|+++ T Consensus 239 ~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~ 318 (429) T protein:vir:10 239 ENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTD 318 (429) T ss_pred HHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhc--ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~ 394 (518) ||.||++.|+++||++|+++.+ .+++++||++.+++.|.+++++.+.+++++|++|+||+|+++|+||++ |||+++ T Consensus 319 ~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~--ggD~~~ 396 (429) T protein:vir:10 319 TLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEA--GGDRLL 396 (429) T ss_pred HHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCeee Confidence 9999999999999999998654 457899999999999999999999999999999999999999999995 899999 Q ss_pred ecccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 395 ANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 395 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) +|+|++|++.....+..+.. +.+ ..+.+.++++ T Consensus 397 ~~~n~~~~d~~~~~~~k~g~--~~~-~~~~~~~e~~ 429 (429) T protein:vir:10 397 VNGNMLPIDMAGQAYLKGGD--TNG-EVSKEGNEGN 429 (429) T ss_pred ecccccchhhccccccCCCC--CCC-CCCCCCCCCC Confidence 99999999865543332111 111 1111111111 No 15 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=4.9e-95 Score=537.62 Aligned_cols=411 Identities=22% Similarity=0.304 Sum_probs=347.3 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) |+|.+-..+.+........++.+.++++. .+.++..++.+.|+++++|++||++||++||++|+++|+.++++.++ T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~----~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~ 76 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSY----DTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTR 76 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCc----ccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccee Confidence 98885543322222222223334444322 23344567788999999999999999999999999999998777665 Q ss_pred c-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 81 E-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 81 ~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) . .|++.++|+.+||++||+++||+.++.+++++|++|++++|+ .|++++|||++|++|++..+.++... |.+.. T Consensus 77 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~-y~~~~--- 151 (413) T protein:vir:48 77 VVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQWQPV-YQVTF--- 151 (413) T ss_pred ecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCceEE-EEEEe--- Confidence 5 455666777899999999999999999999999999999987 68999999999999999888765443 33332 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) .++....|++++|||+++++.++ ++|+||+..+..++....+++++..++|+||+.|++||++++.+++++.+++++.| T Consensus 152 ~~g~~~~~~~~evih~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~ 230 (413) T protein:vir:48 152 PDGSVDVLTQDEIWHVRTLTLDG-LVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDF 230 (413) T ss_pred cCceEEEEccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHH Confidence 24556789999999999988776 68999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) ++.++|..|+|+++|+++|++|++++.++.|+||.+++++.+++||++|||||++||..+++|++|.+++...|+++||. T Consensus 231 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~ 310 (413) T protein:vir:48 231 EERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLV 310 (413) T ss_pred HHHhcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA 398 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 398 (518) |++++|+++||++|+++.+. +++++||++.+++.|.+++++++++++++|++|+||+|+++|+||+| |||++++|+| T Consensus 311 P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~--ggD~~~~~~n 388 (413) T protein:vir:48 311 PYLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRP--GGDVYLTPMN 388 (413) T ss_pred HHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Ccceeecccc Confidence 99999999999999976654 57789999999999999999999999999999999999999999995 8999999999 Q ss_pred ccccccccccCCCCCCCCCCCCCcc Q lcl|NC_021305. 399 LQPLGATPDGAVEWEEAPAPKRPAS 423 (518) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (518) ++++....+......+.+...+++. T Consensus 389 ~~~~~~~~~~~~~~~~~~~~~~~~~ 413 (413) T protein:vir:48 389 MTTSPSAGDDNGKKKESGDADKTAS 413 (413) T ss_pred ccccccccccCCCCCCCCCccccCC Confidence 9887654332211111111111111 No 16 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=2.1e-95 Score=539.61 Aligned_cols=406 Identities=20% Similarity=0.230 Sum_probs=339.3 Q ss_pred CcCC---CCCCCCccc----------ccccchhhhhhhcc------cccccccccccchhhhHHHhhcHHHHHHHHHHHH Q lcl|NC_021305. 1 MLLA---NGQTLSAPA----------MAELSPQMQDSYYY------APAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQ 61 (518) Q Consensus 1 ~~f~---~~~~~~~~~----------~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~ 61 (518) =||. +...+.+.. ......|..+.+.+ ..+.+....++..++...++++++|++||++||+ T Consensus 2 gl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia~ 81 (431) T protein:vir:10 2 GLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLISG 81 (431) T ss_pred cchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHHH Confidence 1333 222221111 11111121111110 0011222334456778899999999999999999 Q ss_pred hhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEE Q lcl|NC_021305. 62 ALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAI 141 (518) Q Consensus 62 ~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v 141 (518) +||++|+++|++++++.+...|+..++|+.+||++||+++||+.++.+++++||+|++|+|+. |.+++|+|++|.+|++ T Consensus 82 ~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~~v~~ 160 (431) T protein:vir:10 82 TIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRGSAKG 160 (431) T ss_pred hhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCceeEE Confidence 999999999998776666667777778888999999999999999999999999999999984 8999999999999999 Q ss_pred EEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc Q lcl|NC_021305. 142 KRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL 221 (518) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il 221 (518) ..+.++.. .|.+.. .++..+.|++++||||++++.++ .+|+||+..+.++|....+++++..++|+||++|+||| T Consensus 161 ~~~~~~~~-~y~~~~---~~g~~~~~~~~dViHir~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil 235 (431) T protein:vir:10 161 RLTSTWQI-VYDYTT---PTGDKIELPAREVFHLRDLSIDG-VSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAI 235 (431) T ss_pred EEcCCCeE-EEEEEe---CCceEEEEchhhEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE Confidence 88766544 344332 24567889999999999988776 68999999999999999999999999999999999999 Q ss_pred ccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccc Q lcl|NC_021305. 222 RHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRA 301 (518) Q Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 301 (518) ++++.+++++.+++++.|++.++|.+|+|+++||++|++|++++.++.|+||+|++++++++||++|||||++||+.+++ T Consensus 236 ~~~~~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~ 315 (431) T protein:vir:10 236 EVPKELSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS 315 (431) T ss_pred ecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhc-ccccceecchhhhhcCHHHHHHHHHHHHhCC----CcCHHHH Q lcl|NC_021305. 302 TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWV-RKNRMKFDIDDVIQPDWEAKSESTQKMVNSG----VATPNEG 376 (518) Q Consensus 302 ~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G----~~T~NE~ 376 (518) +++|.|++.+.|+++||.||+++||++||++|+++.+ .+++++||++.+++.|.+++++.+.+++..| |||+||+ T Consensus 316 t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~ 395 (431) T protein:vir:10 316 WGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEV 395 (431) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHH Confidence 9999999999999999999999999999999998654 3678999999999999999999999998655 5999999 Q ss_pred HHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccC Q lcl|NC_021305. 377 REIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPAST 424 (518) Q Consensus 377 R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (518) |+++|+||+++|+||++++|.|+.+.+.. ++.| +.+ T Consensus 396 R~~~gl~p~~~~~gD~~~~p~n~~~~~~~-------~~~p-----~~~ 431 (431) T protein:vir:10 396 REMLDLPRADDPVADQLRNPMTQKQKGSG-------DEPP-----ATT 431 (431) T ss_pred HHHhCCCCCCCccccceecccccccCCCC-------CCCC-----CCC Confidence 99999999999999999999997654321 1111 111 No 17 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=8.5e-95 Score=536.30 Aligned_cols=401 Identities=18% Similarity=0.272 Sum_probs=337.5 Q ss_pred CcCCC---CCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 1 MLLAN---GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 1 ~~f~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) |+|.. +++...|+.. .+++..... ..+.++..++.+.++++++|++||++||++||++||++|++++++ T Consensus 17 ~~~~~lf~~~~~~~~~~~-~~~~~~~~~-------~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp~~v~~~~~~~ 88 (424) T protein:vir:45 17 VLLDALFRSKSLENPSTP-ITGDAVDTD-------GLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPLHVMRRHKGK 88 (424) T ss_pred HHHHhhccccCCCCCccc-cchhhhhhh-------ccccCCceechHHhhccHHHHHHHHHHHHHHhhCceEEEEecCCc Confidence 66553 2333444433 223221111 122334568889999999999999999999999999999987766 Q ss_pred ceecc-chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 78 ETEES-DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 78 ~~~~~-~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) .++.. |+.+++|+.+||++||+++||+.++.+++++||+|++|+|+..|++++|+|++|..|++..+.+ .+.|.+.. T Consensus 89 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~~--~~~y~~~~ 166 (424) T protein:vir:45 89 VEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTGG--RYTYGLYN 166 (424) T ss_pred eeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEcCC--eEEEEEEe Confidence 65554 4555667789999999999999999999999999999999999999999999999999876543 34444432 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) .+....+++++||||+++++++ .+|+||+..+.+.|....++++++.++|+||++|++||++++.+++++.++++ T Consensus 167 ----~~~~~~~~~~eVih~r~~~~d~-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~ 241 (424) T protein:vir:45 167 ----EYGAFAISPDDMIHIRALGNNQ-KMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLK 241 (424) T ss_pred ----cCceEEECcccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHH Confidence 1234579999999999988876 58999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCc-cccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGS-SNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYR 315 (518) Q Consensus 237 ~~~~~~~~g~-~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~ 315 (518) +.|++.+.|. +|+|+++|+++|++|++++.++.|+||++++++.+++||++|||||++||+.++++++|.|++.+.|++ T Consensus 242 ~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~ 321 (424) T protein:vir:45 242 DQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVR 321 (424) T ss_pred HHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHH Confidence 9999999885 589999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhHHHHHHHHHHHHhhhhhhc--ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 316 DTMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 316 ~~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) +||.||++.||++||++|++..+ .+++++||++.+++.|.+++++.+.+++++|++|+||+|+++|+||++ |||++ T Consensus 322 ~tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~--ggD~~ 399 (424) T protein:vir:45 322 YTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVE--GLDEM 399 (424) T ss_pred HHHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Cccee Confidence 99999999999999999998654 357899999999999999999999999999999999999999999995 89999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) ++|.|+.+.... ..+...+. +++++ T Consensus 400 ~~~~n~~~~~~~--------~~~~~~~~-----~~~~~ 424 (424) T protein:vir:45 400 LVSVNAANPAGD--------FKPPKNDE-----GKTNE 424 (424) T ss_pred eecccccccccc--------cCCCCCCC-----CCCCC Confidence 999998753211 00000000 00000 No 18 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=2.5e-94 Score=533.74 Aligned_cols=409 Identities=21% Similarity=0.315 Sum_probs=340.9 Q ss_pred CcCCC--CCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLAN--GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) +||.+ ++..+++... ..++.+.+++. ....++..++.+.++++++|++||++||++||++||++|+.++++. T Consensus 2 g~f~~lf~r~~~~~~~~--~~~~~~~~~~~----~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~ 75 (414) T protein:vir:44 2 VFFSGLFQRKSDAPVTT--PAELADAIGLS----YDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLK 75 (414) T ss_pred chhhhhhccCccCcccc--hhhHhHhhccC----ccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCce Confidence 55552 3322332221 11222223321 2233445677789999999999999999999999999999888776 Q ss_pred eecc-chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecc Q lcl|NC_021305. 79 TEES-DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAG 157 (518) Q Consensus 79 ~~~~-~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~ 157 (518) +... |+++++|+.+||++||+++||+.++.+++++|++|++++++ .|++.+|+||+|..|++..+.++.. .|.+.. T Consensus 76 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~-~y~~~~- 152 (414) T protein:vir:44 76 QRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWEP-VYQVTF- 152 (414) T ss_pred eecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCcE-EEEEEe- Confidence 5554 55566777899999999999999999999999999999987 6999999999999999988776544 333332 Q ss_pred cccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLRE 237 (518) Q Consensus 158 ~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~ 237 (518) .++....|++++||||++++.++ ++|+||+..+..++....+++++..++|+||++|+++|++++.+++++.+++++ T Consensus 153 --~~g~~~~~~~~evih~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~ 229 (414) T protein:vir:44 153 --PDGSTDVLSQEDIWHVRTLTLDG-LVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKK 229 (414) T ss_pred --cCceEEEEccccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHH Confidence 24566789999999999887776 689999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHH Q lcl|NC_021305. 238 QFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDT 317 (518) Q Consensus 238 ~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~ 317 (518) .|++.++|..|+|+++|+++|++|++++.++.|+||+|.++++.++||++|||||++||+.++++++|.+++...|+++| T Consensus 230 ~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~ 309 (414) T protein:vir:44 230 DFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYS 309 (414) T ss_pred HHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeec Q lcl|NC_021305. 318 MAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYAN 396 (518) Q Consensus 318 l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~ 396 (518) |.|++++||++||++|++..++ .++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||++ |||++++| T Consensus 310 l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~--ggD~~~~~ 387 (414) T protein:vir:44 310 LVPYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRP--GGDVYLTP 387 (414) T ss_pred HHHHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Ccceeccc Confidence 9999999999999999987654 46789999999999999999999999999999999999999999995 89999999 Q ss_pred ccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 397 SALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 397 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) .|+.+..........+. +.++ ++++.. T Consensus 388 ~n~~~~~~~~~~~~~~~------~~~~-----~d~~~~ 414 (414) T protein:vir:44 388 MNMTTKPSDGSKAGKQK------DNAN-----ADETTS 414 (414) T ss_pred ccccccCCccccCCCCC------CCCC-----CCCCCC Confidence 99875532221111111 1000 111100 No 19 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=3.8e-94 Score=532.76 Aligned_cols=395 Identities=17% Similarity=0.261 Sum_probs=334.5 Q ss_pred CcCCCC-CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANG-QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) -.|.++ +..+..... ..| + ++. ...++..++.+.|+++++|++||++||++||+|||++|+.+.++.+ T Consensus 22 ~~~~~~~~~~~~~~~~-~~~-~----~~~-----~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~~~~~~~~~~~ 90 (424) T protein:vir:18 22 SWFVGGRLVTPNQGSQ-TGP-V----SAH-----GHLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) T ss_pred hhhccccccccccccc-ccc-c----ccc-----cccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeecCCce Confidence 344332 222332221 112 1 111 1123445788999999999999999999999999999998765533 Q ss_pred e---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 80 E---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 80 ~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) + ..|+++++|+.+||++||+++||+.++.+++++||+|++|+|+..|++++|||++|.+|++..+.+ ...|.|.. T Consensus 91 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~~--~~~y~~~~ 168 (424) T protein:vir:18 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQR 168 (424) T ss_pred eeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC--eEEEEEEe Confidence 2 356666777789999999999999999999999999999999999999999999999999987643 44455432 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCcc-CCHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSEAAQQRL 235 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~~~~~~~ 235 (518) +++.+.|++++|||+|++++++ .+|+||+..+.+++....+++++..++|+||++|++||++++. +++++.+++ T Consensus 169 ----~g~~~~~~~~eIih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~ 243 (424) T protein:vir:18 169 ----DSEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) T ss_pred ----CCeEEEeccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHH Confidence 4567789999999999988776 6799999999999999999999999999999999999999765 799999999 Q ss_pred HHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHHHHH Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQMRAF 313 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~~ 313 (518) ++.|++.++| .|+|+++||++|++|++++.++.|+||++++++++++||++|||||++||+.+++++ +|.|++.+.| T Consensus 244 ~~~~~~~~~g-~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f 322 (424) T protein:vir:18 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) T ss_pred HHHHHHHhCC-cccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHH Confidence 9999988765 689999999999999999999999999999999999999999999999999887765 8999999999 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADE 392 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~ 392 (518) +++||.||++.||++|+++|++..++ +++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||++ |||+ T Consensus 323 ~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~--gGD~ 400 (424) T protein:vir:18 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLP--GGDV 400 (424) T ss_pred HHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCe Confidence 99999999999999999999987664 57899999999999999999999999999999999999999999995 8999 Q ss_pred eeecccccccccccccCCCCCCCC Q lcl|NC_021305. 393 LYANSALQPLGATPDGAVEWEEAP 416 (518) Q Consensus 393 ~~~~~n~~~~~~~~~~~~~~~~~~ 416 (518) +++++|++|++....+..+.+.+. T Consensus 401 ~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred eeeccCccchHhhhccCCCccCCC Confidence 999999999876533211100000 No 20 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=3.2e-94 Score=533.15 Aligned_cols=409 Identities=17% Similarity=0.247 Sum_probs=345.2 Q ss_pred CcCCCCCCCCccccc-ccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMA-ELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =||+++...++.... ..++++.-+..+.+ ..........++...++++++|++||++||++||++|+++|+++.. T Consensus 6 ~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~~--- 81 (422) T protein:vir:13 6 GLFNKKNNNDEKRSNYDEDIGIDISDSNFW-EKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKEE--- 81 (422) T ss_pred hhhhccCCccchhhhhhhccccccCcchhh-hhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecCcc--- Confidence 256655544332211 11111110000111 1122334556888899999999999999999999999999986532 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee-----eEEee Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR-----YEYYF 154 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~-----~~~~~ 154 (518) ...|+.+++|+.+||++||+++||+.++.+++++||+|++|+|+..|++++|+|++|++|++..+.++.. .+|.+ T Consensus 82 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~~~~~y~~ 161 (422) T protein:vir:13 82 YKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSLSKVWYVV 161 (422) T ss_pred cccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceeccceEEEEE Confidence 3456777888889999999999999999999999999999999999999999999999999999887643 23332 Q ss_pred ecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHH Q lcl|NC_021305. 155 QAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQR 234 (518) Q Consensus 155 ~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~ 234 (518) . ..++....+++++|||++.+.+.+..+|+||+..+..++....+++++..++|+||++|+|+|++++.+++++.++ T Consensus 162 ~---~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~ 238 (422) T protein:vir:13 162 T---DKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKI 238 (422) T ss_pred E---eCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHH Confidence 2 2356678899999999998766666799999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~ 314 (518) +++.|++.++|.+|+|+++|+++|++|++++.++.|+||++++++.+++||++|||||++||..++++++|.+++...|+ T Consensus 239 ~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~ 318 (422) T protein:vir:13 239 FKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFY 318 (422) T ss_pred HHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhHHHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADE 392 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~ 392 (518) ++||.|++++||++|+++|+++.+. +++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++ |||+ T Consensus 319 ~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~--ggD~ 396 (422) T protein:vir:13 319 VTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVE--GGDR 396 (422) T ss_pred HHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCe Confidence 9999999999999999999987754 56889999999999999999999999999999999999999999995 8999 Q ss_pred eeecccccccccccccCCCCCCCCCCCC Q lcl|NC_021305. 393 LYANSALQPLGATPDGAVEWEEAPAPKR 420 (518) Q Consensus 393 ~~~~~n~~~~~~~~~~~~~~~~~~~~~~ 420 (518) +++|+|++|++.....+. +.+++.++ T Consensus 397 ~~~~~n~~~l~~~~~~~~--~~g~~~g~ 422 (422) T protein:vir:13 397 LLVNGNMIPIEMAGEQYK--KGGEKGGK 422 (422) T ss_pred eeeccCccchhhcccccc--cCCCcCCC Confidence 999999999986643221 11111111 No 21 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=6.3e-94 Score=531.55 Aligned_cols=411 Identities=18% Similarity=0.166 Sum_probs=340.7 Q ss_pred CcCCCCCC-CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQT-LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) |.|..... ++..++. .|.. ... ...+.++.++..++.+.++++++|++||++||++||+|||++|++++++.+ T Consensus 1 m~~~~~~~~~~~~~~~---~~~~--~~~-~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~ 74 (419) T protein:vir:57 1 MFIPQFWKGRPSENRV---NWQV--VPG-GMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGR 74 (419) T ss_pred CcchhhhccCCccccc---cccc--ccc-ccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 77775432 2333222 1211 111 111233455667888999999999999999999999999999998877654 Q ss_pred e-c-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecc Q lcl|NC_021305. 80 E-E-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAG 157 (518) Q Consensus 80 ~-~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~ 157 (518) + . .|++.++|+.+||++||+++||+.++.+++++|++|++|+|+..|++++|||++|++|++..+.++.. +|.+.. T Consensus 75 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~-~y~~~~- 152 (419) T protein:vir:57 75 EIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMP-YYDIPS- 152 (419) T ss_pred eccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceE-EEEEcC- Confidence 3 3 44556667789999999999999999999999999999999999999999999999999988876653 333321 Q ss_pred cccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc----cCCHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK----RLSEAAQQ 233 (518) Q Consensus 158 ~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~----~~~~~~~~ 233 (518) .+ ..++.++|||+++++.++ .+|+||+..+..++....++++++.++|+||++|+++|+.++ .+++++.+ T Consensus 153 ---~~--~~~~~~~vih~r~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~ 226 (419) T protein:vir:57 153 ---IG--EILPMRMVHHIKSFSLDG-YIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVD 226 (419) T ss_pred ---Cc--eEEchhhEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHH Confidence 12 358899999999987776 689999999999999999999999999999999999998854 56889999 Q ss_pred HHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 234 RLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 234 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) ++++.|.+.++|..|+|+++|+++|++|++++.++.|+||++++++.+++||++|||||.+||..+.++++|+|++.+.| T Consensus 227 ~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f 306 (419) T protein:vir:57 227 AILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQGLQY 306 (419) T ss_pred HHHHHHHHHhccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhc-ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWV-RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADE 392 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~ 392 (518) +++||.|+++.|+++|+++|+++.+ .+++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++ |||+ T Consensus 307 ~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~--ggD~ 384 (419) T protein:vir:57 307 VIYTMLAILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIP--GGDK 384 (419) T ss_pred HHHHHHHHHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCe Confidence 9999999999999999999997654 367899999999999999999999999999999999999999999995 8999 Q ss_pred eeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcch Q lcl|NC_021305. 393 LYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRS 448 (518) Q Consensus 393 ~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (518) +++|+|++++....++.... +.+. +..++.++-|. T Consensus 385 ~~~~~n~~~~~~~~~~~~~~-----~~~~----------------~~~~~~~~~~~ 419 (419) T protein:vir:57 385 YLTPLNMVDSKALTGIGKAT-----PQQL----------------KDIEAILCTRN 419 (419) T ss_pred eeeccccccccccccccCCC-----cccC----------------cchhhhhhccC Confidence 99999998875543322111 1100 00011111111 No 22 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=7e-94 Score=531.29 Aligned_cols=400 Identities=20% Similarity=0.281 Sum_probs=340.7 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =+|+..+. ..++.....+++..++|. ..++.+.++++++|++||++||++||++||++|++++++.++ T Consensus 6 ~~~~~~~~-~~~~~~~~~~~~~~~~g~-----------~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~ 73 (411) T protein:vir:81 6 RLTRFFRP-RNETVDMTNPLLLQWLGV-----------DPDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTERGIVK 73 (411) T ss_pred HHHhhccC-cccccccchHHHHHHhcC-----------cccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecCCceee Confidence 12222221 122222223443322221 124567789999999999999999999999999998887665 Q ss_pred c-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceee---EEeeec Q lcl|NC_021305. 81 E-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRY---EYYFQA 156 (518) Q Consensus 81 ~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~---~~~~~~ 156 (518) . .|+++++|+.+||++||+++||+.++.+++++||+|++++|+ .|++.+|||++|+.|++..+..+... .+.|.+ T Consensus 74 ~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 152 (411) T protein:vir:81 74 SDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRY 152 (411) T ss_pred ecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccceEEEEE Confidence 5 455566777899999999999999999999999999999998 68999999999999999988765321 122333 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) ....++..+.|++++|||++++.+.+..+|+||+..+..++....+++++..++|+||+.|+|+|++++.+++++.++++ T Consensus 153 ~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~ 232 (411) T protein:vir:81 153 NDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLV 232 (411) T ss_pred EecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHH Confidence 34456778889999999999776656679999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|++.++|.+|+|+++|+++|++|++++.++.|+||+|++++..++||++|||||++||+.+.+|++|.+++.+.|+++ T Consensus 233 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~ 312 (411) T protein:vir:81 233 KGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVD 312 (411) T ss_pred HHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhc--ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~ 394 (518) ||.|+++.||++|+++|++..+ .+++++||++.+++.|.+++++.+.+++++|++|+||+|+++|+||+| |||+++ T Consensus 313 ~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~--ggD~~~ 390 (411) T protein:vir:81 313 TLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADD--YGNNLM 390 (411) T ss_pred HHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeee Confidence 9999999999999999998654 467899999999999999999999999999999999999999999995 899999 Q ss_pred ecccccccccccccCCCCCCC Q lcl|NC_021305. 395 ANSALQPLGATPDGAVEWEEA 415 (518) Q Consensus 395 ~~~n~~~~~~~~~~~~~~~~~ 415 (518) +++|++|++....+...+.++ T Consensus 391 ~~~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 391 ANGNYIPLSMLGANYGKGGDS 411 (411) T ss_pred eccCccchhhhhhhhccCCCC Confidence 999999997654322111111 No 23 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=1.7e-93 Score=529.17 Aligned_cols=396 Identities=17% Similarity=0.261 Sum_probs=333.0 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET- 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~- 79 (518) -+|..++. ..|....... ++++ ....++..++.+.|+++++|++||++||++||+|||++|+...++.+ T Consensus 22 ~~f~~~~~-~~~~~~~~~~----~~~~-----~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~vy~~~~~~~~~ 91 (424) T protein:vir:18 22 SWFVGGRL-VTPNQGSQTG----PVSA-----HGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) T ss_pred hhcccccc-ccccchhhcc----cccc-----ccccccccccHHHhhccHHHHHHHHHHHHhhccCceEEEEeccCCcee Confidence 33433221 1221111111 1111 11122345778899999999999999999999999999998765533 Q ss_pred e--ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecc Q lcl|NC_021305. 80 E--ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAG 157 (518) Q Consensus 80 ~--~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~ 157 (518) + ..|+++++|+.+||++||+++||+.++.+++++||+|++|+|+..|++++|||++|.+|++..+.+ ...|.+.. T Consensus 92 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~--~~~y~~~~- 168 (424) T protein:vir:18 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQR- 168 (424) T ss_pred eeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC--eEEEEEEe- Confidence 2 356666778889999999999999999999999999999999999999999999999999987643 44454432 Q ss_pred cccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCcc-CCHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSEAAQQRLR 236 (518) Q Consensus 158 ~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~~~~~~~~ 236 (518) ++..+.|+++||||+|+++.++ .+|+||+..+...+....+++++..++|+||+.|+++|++++. +++++.++++ T Consensus 169 ---~g~~~~~~~~eVihir~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~ 244 (424) T protein:vir:18 169 ---DSEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVE 244 (424) T ss_pred ---CCeEEEeccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHH Confidence 4567789999999999988776 6899999999999999999999999999999999999999875 7999999999 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQMRAFY 314 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~~~ 314 (518) +.|++.+++ .|+|+++||++|++|++++.++.|+||++++++++++||++|||||++||+.+++++ +|.|++.+.|+ T Consensus 245 ~~~~~~~~~-~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~ 323 (424) T protein:vir:18 245 ENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) T ss_pred HHHHHHhCC-cccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHH Confidence 999987655 788999999999999999999999999999999999999999999999999887765 89999999999 Q ss_pred HHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) ++||.||+++||++||++|++..+. .++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||++ |||++ T Consensus 324 ~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~--ggD~~ 401 (424) T protein:vir:18 324 QYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLP--GGDVA 401 (424) T ss_pred HHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCee Confidence 9999999999999999999987654 57899999999999999999999999999999999999999999995 89999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCcc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPAS 423 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (518) ++++|++|++...+...+.+.+ . T Consensus 402 ~~~~n~~~l~~~~~~~~~~~n~-------a 424 (424) T protein:vir:18 402 MRQAQYVPITDLGTNKEPRNNG-------A 424 (424) T ss_pred eeccCccchhhhhccCCccccC-------C Confidence 9999999987653321100000 0 No 24 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=1.1e-93 Score=530.26 Aligned_cols=412 Identities=18% Similarity=0.247 Sum_probs=345.3 Q ss_pred CcCCCCCCC-CcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTL-SAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) |+|++-..+ -.+.....++|+...++.. ++.++..++.+.++++++|++||++||++||++||++|++++++.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~-----~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~ 75 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSA-----RSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSGDDRK 75 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhccc-----ccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecCCCcc Confidence 888864333 2334444577877666533 2345567888999999999999999999999999999999888766 Q ss_pred ecc-chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 80 EES-DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 80 ~~~-~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) +.. |+..++|+.+||++||+++||+.++.+++++||+|++++|+..|++.+||||+|++|++..+.++... |.+. . T Consensus 76 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~~~-y~~~--~ 152 (419) T protein:vir:80 76 PATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLKPM-YRVA--G 152 (419) T ss_pred cccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEE-EEEc--C Confidence 654 55556677899999999999999999999999999999999999999999999999999988765433 3321 1 Q ss_pred ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCcc----CCHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR----LSEAAQQR 234 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~----~~~~~~~~ 234 (518) . ..++.++|+|+++++.++ .+|+||+..+...|....+++++..++|+||+.|+++|++++. .++++.++ T Consensus 153 ---~--~~~~~~~i~h~~~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~ 226 (419) T protein:vir:80 153 ---A--DPLPQRLVHHVRWMSING-YTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDR 226 (419) T ss_pred ---c--cccchhheEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHH Confidence 1 247899999999988776 6899999999999999999999999999999999999998754 46888999 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~ 314 (518) +++.|++.++|..|+|+++|+++|++|++++.++.|+||++.+++..++||++|||||++||..+++|++|.|++.+.|+ T Consensus 227 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~ 306 (419) T protein:vir:80 227 ITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFV 306 (419) T ss_pred HHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhHHHHHHHHHHHHhhhhhhc-ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQYWV-RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~~~-~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) ++||.|+++.||++|+++|+++.+ ..++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||++ |||++ T Consensus 307 ~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~--gGD~~ 384 (419) T protein:vir:80 307 IYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVK--GGDIY 384 (419) T ss_pred HHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Cccee Confidence 999999999999999999997654 367789999999999999999999999999999999999999999995 89999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQ 461 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (518) ++|+|+++++.....+. .+++ +.++ ..++.++... T Consensus 385 ~~~~n~~~~~~~~~~~~-----~~~~-~~~~---------------------------~~~~~~~~l~ 419 (419) T protein:vir:80 385 LSPMNMVDASKPQPIPM-----GKTE-PTKA---------------------------ALDEIGRILS 419 (419) T ss_pred eeccccccccccccccC-----CCCC-chhh---------------------------hHHHHHhhcC Confidence 99999887543221100 0000 0000 0011111111 No 25 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=2.5e-93 Score=528.22 Aligned_cols=422 Identities=21% Similarity=0.295 Sum_probs=338.9 Q ss_pred CcCCCCC--CCCcccc--------cccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_021305. 1 MLLANGQ--TLSAPAM--------AELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) Q Consensus 1 ~~f~~~~--~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v 70 (518) |=-++.+ .+.+++. ...++++.+. +.+..+.++..++.+.++++++|++||++||++||+|||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~~s~~~~~~~~~-----~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~ 75 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVPISLTDGSFWSA-----WGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNL 75 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCcccCCchhHHHh-----hcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeE Confidence 4311111 1111111 1112222222 22333445566888999999999999999999999999999 Q ss_pred EEecCCccee--ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce Q lcl|NC_021305. 71 MFTSGDTETE--ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG 148 (518) Q Consensus 71 ~~~~~~~~~~--~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~ 148 (518) |+++.++.++ ..|+++++|+.+||++||+++||+.++.+++++||+|++|+|+ .|++++|||++|+.|++..+.++. T Consensus 76 ~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g~~~~L~~l~p~~v~i~~~~~g~ 154 (437) T protein:vir:10 76 YQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AGVLIGLELMLPQRTTVKRLTSGA 154 (437) T ss_pred EEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEECCCCe Confidence 9988766433 3455667778899999999999999999999999999999998 599999999999999998877654 Q ss_pred eeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC Q lcl|NC_021305. 149 RYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS 228 (518) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~ 228 (518) .. |.+.. .++....|++++|||||+++.++ .+|+||+..+..++....+++++..++|+||++|++||++++.++ T Consensus 155 ~~-y~~~~---~~g~~~~~~~~dIih~r~~~~d~-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~ 229 (437) T protein:vir:10 155 LQ-YTYRN---VDGTVSTLAEDDVFHVRGFSLDG-LMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQ 229 (437) T ss_pred EE-EEEEe---cCceEEEEccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCC Confidence 33 33322 24567789999999999988766 689999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCH Q lcl|NC_021305. 229 EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNI 306 (518) Q Consensus 229 ~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~ 306 (518) +++.+++++.|++.++|..|+|+++||++|++|++++.++.|+||+++++++.++||++|||||++||+.+++++ +|. T Consensus 230 ~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~ 309 (437) T protein:vir:10 230 KEKRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGI 309 (437) T ss_pred HHHHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchH Confidence 999999999999999999999999999999999999999999999999999999999999999999999887654 899 Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_021305. 307 SAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRS 385 (518) Q Consensus 307 e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~ 385 (518) +++.+.|+++||.||+..||++|+++|+++.++ .++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||+ T Consensus 310 e~~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi 389 (437) T protein:vir:10 310 EQQTLGFLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPM 389 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999999987554 4678999999999999999999999999999999999999999999 Q ss_pred CCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcccccc Q lcl|NC_021305. 386 DDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSP 442 (518) Q Consensus 386 ~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (518) + ++++.++++.|++|++.........+ .....+.++. ++++ .....+. T Consensus 390 ~-gg~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~---~~~~----~~~~~e~ 437 (437) T protein:vir:10 390 G-GNAAVLTVQSALLPIDKLGEHTTATA-AQDALKAWLY---QEEK----TRATQER 437 (437) T ss_pred C-CCcceEeecCcccchhhccCcCCCcc-hhccccccCC---CCCC----CCccccC Confidence 7 34455678999999876433221111 0000000000 0000 0001010 No 26 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=3.4e-93 Score=527.54 Aligned_cols=416 Identities=21% Similarity=0.305 Sum_probs=334.5 Q ss_pred CcCCCCCC---CCcccccccchhhhh-hhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLANGQT---LSAPAMAELSPQMQD-SYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~~~~---~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) ++|++-++ ++.|.....+..+.. ...+....+..+.++..++.+.++++++|++||++||++||+|||++|+++.+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~ 87 (432) T protein:vir:10 8 GLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPD 87 (432) T ss_pred chhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCceeEEEecCC Confidence 55654332 122211110010000 00111112233456677889999999999999999999999999999998877 Q ss_pred cceec-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeee Q lcl|NC_021305. 77 TETEE-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQ 155 (518) Q Consensus 77 ~~~~~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~ 155 (518) +.++. .|+.+++|+.+||++||+++||+.++.+++++||+|++++++ .|++.+||||+|+.|++..+.++.. .|.+. T Consensus 88 g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~g~~-~y~~~ 165 (432) T protein:vir:10 88 GRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGNT-AYRYR 165 (432) T ss_pred CcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCCCcE-EEEEE Confidence 76554 455566777899999999999999999999999999999997 5899999999999999998877654 34433 Q ss_pred cccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHH Q lcl|NC_021305. 156 AGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL 235 (518) Q Consensus 156 ~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~ 235 (518) . .++..+.|++++|||+++++.++ .+|+||+..+.+.+....+++++..++|+||++|++|+++++.+++++++++ T Consensus 166 ~---~~g~~~~~~~~~iih~~~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~ 241 (432) T protein:vir:10 166 R---TDGQMIDIPKQQIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSF 241 (432) T ss_pred e---cCceEEEEcCccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHH Confidence 2 34677899999999999888776 6899999999999999999999999999999999999999999999998888 Q ss_pred HHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc---CCHHHHHHH Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF---SNISAQMRA 312 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~---sn~e~~~~~ 312 (518) ++.|. |..|+|+++||++|++|++++.++.|+||++++++++++||++|||||++||+.+.+++ +|.|++.+. T Consensus 242 ~~~~~----~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~ 317 (432) T protein:vir:10 242 AKKVS----GSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLG 317 (432) T ss_pred HHHHh----hhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHH Confidence 77764 56788999999999999999999999999999999999999999999999999876554 789999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD 391 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD 391 (518) |+++||.||++.||++|+++|+++.++ .++++||++.+++.|.+++++++.+++++|+||+||+|+++|+||++ ++++ T Consensus 318 f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~-g~~~ 396 (432) T protein:vir:10 318 FLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLG-GNAA 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCcc Confidence 999999999999999999999986553 57889999999999999999999999999999999999999999997 3456 Q ss_pred eeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccC Q lcl|NC_021305. 392 ELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPT 435 (518) Q Consensus 392 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (518) .++++.|++|++...... .+.+. +..+++.+.+... T Consensus 397 ~~~~~~~~~pl~~~~~~~-----~~~~~---~~~~~~~~~~~~~ 432 (432) T protein:vir:10 397 VLTVQSAMVPLDSIGLQA-----SPEPA---SGLGNQQQDKVSK 432 (432) T ss_pred eEeecCcccchhhhcccC-----CCCCC---CCCCCcccccccC Confidence 677899999987653211 11111 1111111110000 No 27 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=5e-93 Score=526.59 Aligned_cols=416 Identities=21% Similarity=0.298 Sum_probs=334.6 Q ss_pred CcCCCCCC---CCcccccccchhhhh-hhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLANGQT---LSAPAMAELSPQMQD-SYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~~~~---~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) ++|++-+. ++.|........+.. ...+....+..+.++..++.+.++++++|++||++||++||+|||++|+++.+ T Consensus 8 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~~y~~~~~ 87 (432) T protein:vir:97 8 GLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLMMYMRTPD 87 (432) T ss_pred chhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceEEEEecCC Confidence 56655332 122211111111110 00111112233455677889999999999999999999999999999998877 Q ss_pred cceec-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeee Q lcl|NC_021305. 77 TETEE-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQ 155 (518) Q Consensus 77 ~~~~~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~ 155 (518) +.++. .|+.+++|+.+||++||+++||+.++.+++++||+|++++++ .|++.+||||+|+.|++..+.++. ..|.+. T Consensus 88 g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~g~-~~y~~~ 165 (432) T protein:vir:97 88 GRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTKGN-TAYRYR 165 (432) T ss_pred CcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCCCc-EEEEEE Confidence 76554 455666777899999999999999999999999999999997 589999999999999999887665 344443 Q ss_pred cccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHH Q lcl|NC_021305. 156 AGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL 235 (518) Q Consensus 156 ~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~ 235 (518) . .++..+.+++++|||+|+++.++ .+|+||+..+.+.+....+++++..++|+||++|++||++++.+++++++++ T Consensus 166 ~---~~g~~~~~~~~~iih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~ 241 (432) T protein:vir:97 166 R---TDGQMIDIPRQQIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSF 241 (432) T ss_pred e---cCceEEEEccccEEEecCcCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCCCCHHHHHHH Confidence 2 34677889999999999888776 6899999999999999999999999999999999999999999999998877 Q ss_pred HHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc---CCHHHHHHH Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF---SNISAQMRA 312 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~---sn~e~~~~~ 312 (518) ++.| .|..|+|+++||++|++|++++.++.|+||+|++++++++||++|||||++||..+.+++ +|.|++.+. T Consensus 242 ~~~~----~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~ 317 (432) T protein:vir:97 242 SKKV----SGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLG 317 (432) T ss_pred HHHH----hhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHH Confidence 7665 456789999999999999999999999999999999999999999999999999876654 788999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD 391 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD 391 (518) |+++||.||++.||++|+++|+++.++ .++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||++ ++++ T Consensus 318 f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~-g~~~ 396 (432) T protein:vir:97 318 FLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLG-GNAA 396 (432) T ss_pred HHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCcc Confidence 999999999999999999999986554 56899999999999999999999999999999999999999999996 3455 Q ss_pred eeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccC Q lcl|NC_021305. 392 ELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPT 435 (518) Q Consensus 392 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (518) .++++.|++|++....+.. ++++.+ ..++.+.+... T Consensus 397 ~~~~~~~~~pl~~~~~~~~-----~~~~~~---~~~~~~~~~~~ 432 (432) T protein:vir:97 397 VLTVQSAMVPLDSIGLQAS-----PEPASG---LGNQQQDKVSK 432 (432) T ss_pred eEeecccccchhhhcccCC-----CCCCCC---CCCcccccccC Confidence 6678999999876533211 111111 11111111100 No 28 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=1.1e-92 Score=524.80 Aligned_cols=416 Identities=21% Similarity=0.299 Sum_probs=335.7 Q ss_pred CcCCCCCC---CCcccccccchhhhhhhc-ccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLANGQT---LSAPAMAELSPQMQDSYY-YAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~~~~---~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) -||++.+. ++.+.....+..+....+ ........+.++..++.+.|+++++|++||++||++||+|||++|+++++ T Consensus 8 g~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~~y~~~~~ 87 (432) T protein:vir:81 8 GLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLTMYMRTPD 87 (432) T ss_pred chhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCceeeEEecCC Confidence 46665332 122211100000110001 11112234556677889999999999999999999999999999998877 Q ss_pred cceec-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeee Q lcl|NC_021305. 77 TETEE-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQ 155 (518) Q Consensus 77 ~~~~~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~ 155 (518) +.++. .|+++++|+.+||++||+++||+.++.+++++||+|++++++ .|++.+||||+|+.|++..+.++.. .|.+. T Consensus 88 g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~g~~-~y~~~ 165 (432) T protein:vir:81 88 GRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPKGNT-AYRYR 165 (432) T ss_pred cceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCCCcE-EEEEE Confidence 76654 455566777899999999999999999999999999999987 5899999999999999998876643 34433 Q ss_pred cccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHH Q lcl|NC_021305. 156 AGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL 235 (518) Q Consensus 156 ~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~ 235 (518) . .++..+.+++++|||+++++.++ .+|+||+..+.++|....+++++..++|+||++|++|+++++.+++++++++ T Consensus 166 ~---~~g~~~~~~~~~iih~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~ 241 (432) T protein:vir:81 166 R---TDGQMIDIPKQQIWKIMGYSLDG-ENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSF 241 (432) T ss_pred e---cCceEEEEccccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHH Confidence 2 34677889999999999988777 6899999999999999999999999999999999999999999999998888 Q ss_pred HHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc---CCHHHHHHH Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF---SNISAQMRA 312 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~---sn~e~~~~~ 312 (518) ++.| .|..|+|+++||++|++|++++.++.|+||++.+++++++||++|||||++||+.+.+++ +|.||+.+. T Consensus 242 ~~~~----~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~ 317 (432) T protein:vir:81 242 AKKV----SGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLG 317 (432) T ss_pred HHHH----hhhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHH Confidence 8776 456788999999999999999999999999999999999999999999999999877654 788999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD 391 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD 391 (518) |+++||.||++.||++|+++|+++.+. .++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||++ ++++ T Consensus 318 f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~-g~~~ 396 (432) T protein:vir:81 318 FLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLG-GNAA 396 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC-CCcc Confidence 999999999999999999999986543 57899999999999999999999999999999999999999999997 3567 Q ss_pred eeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCc Q lcl|NC_021305. 392 ELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSV 437 (518) Q Consensus 392 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (518) .++++.|++|++....... +++... ..++.++.. .. T Consensus 397 ~~~~~~~~~pl~~~~~~~~-----~~~~~~---~~n~~~~~~--~~ 432 (432) T protein:vir:81 397 VLTVQSAMVPLDSIGLQAS-----PEPASG---LGNQQQDKV--SK 432 (432) T ss_pred eEeecCcccchhhhccCCC-----CCCCCC---CCCcccccc--cC Confidence 7779999999876533211 111100 001111110 00 No 29 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=1.3e-91 Score=518.89 Aligned_cols=412 Identities=18% Similarity=0.240 Sum_probs=332.5 Q ss_pred Cc----CCC--CCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021305. 1 ML----LAN--GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS 74 (518) Q Consensus 1 ~~----f~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~ 74 (518) |- |.. +++...+. ...+.|+.... +....+...++...++++++|++||++||++||++|+++|+++ T Consensus 23 ~~~~~~f~~~e~r~~~~~~-~~~~~~~~~~~------~~~~~~~~~~~~~~al~~~~V~acv~~Ia~~iA~lpl~~~~~~ 95 (441) T protein:vir:98 23 LVVVGIFYKNEKRDLQYNE-DDLQMMVQTLP------GFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNG 95 (441) T ss_pred hhccccccccccccccCCC-cchHHHHHHhh------cccccCccccchhhhhccHHHHHHHHHHHHhhccCceEEecCC Confidence 22 221 11111111 11122222111 1222233457778899999999999999999999999999643 Q ss_pred CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEee Q lcl|NC_021305. 75 GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYF 154 (518) Q Consensus 75 ~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 154 (518) .....|+.+++|+.+||++||+++||+.++.+++++||+|++|+|+..|++++|+|++|+.|++..+.++...++.+ T Consensus 96 ---~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~ 172 (441) T protein:vir:98 96 ---QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKLDARGRLYYFHQ 172 (441) T ss_pred ---cccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEECCCCcEEEEEE Confidence 44556777888889999999999999999999999999999999999999999999999999999988776665554 Q ss_pred ecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC-HHHHH Q lcl|NC_021305. 155 QAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS-EAAQQ 233 (518) Q Consensus 155 ~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~~~~~ 233 (518) .......+....+++++|||+++++.++ .+|+||+..+.+++....+++++..++|+||++|+|||++++.++ +++.+ T Consensus 173 ~~~~~~~~~~~~~~~~dviHir~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~ 251 (441) T protein:vir:98 173 RIDSNGNNIERNVKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARD 251 (441) T ss_pred EeccCcceeeEEEccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHH Confidence 4444445667889999999999887776 689999999999999999999999999999999999999999875 67789 Q ss_pred HHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 234 RLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 234 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) ++++.|++.++|..|+|+++||++|++|++++.+++|+||++.+++++++||++|||||++||.... .++.+++...| T Consensus 252 ~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~--~~s~~q~~~~y 329 (441) T protein:vir:98 252 RAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA--NMSITDANLDY 329 (441) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC--CccHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999999999999999986332 34556666655 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) . +||.||+++||++||++|++.. .+++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++++.++++ T Consensus 330 ~-~tl~P~~~~ie~~ln~~L~~~~-~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~ 407 (441) T protein:vir:98 330 L-STLKPYITCVCAELNFKFNDEY-VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIH 407 (441) T ss_pred H-HHHHHHHHHHHHHHHhhccccc-cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceE Confidence 5 6999999999999999998765 3568999999999999999999999999999999999999999999986555688 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) ++|+|++|++.....+....... . ....++++++ T Consensus 408 ~~~~n~~~~~~~~~~q~~~~~~~-~---~~~kgGe~ne 441 (441) T protein:vir:98 408 RVDLNHVNIELVDEYQMNKSRAT-D---KKLKGGEENE 441 (441) T ss_pred eeccccccccccccccccccccc-c---cccCCCCCCC Confidence 99999999987644332111110 0 0011111111 No 30 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=1.6e-91 Score=518.39 Aligned_cols=412 Identities=19% Similarity=0.244 Sum_probs=331.2 Q ss_pred Cc----CCCC--CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021305. 1 ML----LANG--QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS 74 (518) Q Consensus 1 ~~----f~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~ 74 (518) |. |... |+...+. ...+.|+....++ +......++...++++++|++||++||++||++|+++|+++ T Consensus 23 ~~~~~lf~~~e~R~~~~~~-~~~~~~~~~~~~~------~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~~~~~~ 95 (441) T protein:vir:94 23 LVVVGIFYKNEKRDLQYNE-DDLQMMVQTLPGF------QGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNG 95 (441) T ss_pred hhccccccccccccccCCC-cchHHHHHHhccc------CcccccccchhhhhccHHHHHHHHHHHHhhccCceeeecCc Confidence 22 2211 1111111 1122333222221 11223356677899999999999999999999999998643 Q ss_pred CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEee Q lcl|NC_021305. 75 GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYF 154 (518) Q Consensus 75 ~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 154 (518) .....|+.+++|+.+||++||+++||+.++.+++++||+|++|+|+..|++++|+|++|+.|++..+.++...++.+ T Consensus 96 ---~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~ 172 (441) T protein:vir:94 96 ---QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQ 172 (441) T ss_pred ---cccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEE Confidence 44556777788889999999999999999999999999999999999999999999999999999988776665554 Q ss_pred ecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC-CHHHHH Q lcl|NC_021305. 155 QAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SEAAQQ 233 (518) Q Consensus 155 ~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~ 233 (518) .......+....+++++||||++++.++ ++|+||+..+.++|....+++++..++|+||++|+|||++++.+ ++++.+ T Consensus 173 ~~~~~~~~~~~~~~~~dvih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e 251 (441) T protein:vir:94 173 RIDSNGNNIERNVKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARD 251 (441) T ss_pred EeccCCceeEEEEccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHH Confidence 4444445667789999999999887776 68999999999999999999999999999999999999999987 467789 Q ss_pred HHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 234 RLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 234 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) ++++.|++.++|..|+|+++||++|++|++++.+++|+||++.+++++++||++|||||++||... .+ ++.+++... T Consensus 252 ~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~-~s~~q~~~~- 328 (441) T protein:vir:94 252 RAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-AN-MSITDANLD- 328 (441) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-CC-ccHHHHHHH- Confidence 999999999999999999999999999999999999999999999999999999999999998643 23 345665554 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) +.+||.|++++||++||++|++.. .+++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++++.++++ T Consensus 329 ~~~tl~P~~~~ie~eln~kl~~~~-~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~ 407 (441) T protein:vir:94 329 YLSTLKPYITCVCAELNFKFNDEY-VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIH 407 (441) T ss_pred HHHHHHHHHHHHHHHHhhhccccc-cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceE Confidence 557999999999999999998764 4578999999999999999999999999999999999999999999986556679 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) ++++|++|++.....+...... +. ....++++++ T Consensus 408 ~~~~n~~~~~~~~~~~~~~~~~-~~---~~~kgGe~~e 441 (441) T protein:vir:94 408 RVDLNHVNIELVDEYQMNKSRA-TD---KKLKGGEENE 441 (441) T ss_pred eecccccccccccccccccccc-cc---cccCCCCCCC Confidence 9999999998764322211110 00 0001111111 No 31 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=1.6e-91 Score=518.39 Aligned_cols=412 Identities=19% Similarity=0.244 Sum_probs=331.2 Q ss_pred Cc----CCCC--CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021305. 1 ML----LANG--QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS 74 (518) Q Consensus 1 ~~----f~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~ 74 (518) |. |... |+...+. ...+.|+....++ +......++...++++++|++||++||++||++|+++|+++ T Consensus 23 ~~~~~lf~~~e~R~~~~~~-~~~~~~~~~~~~~------~~~~~~~~~~~~al~~~~V~~cv~~Ia~~iA~lp~~~~~~~ 95 (441) T protein:vir:79 23 LVVVGIFYKNEKRDLQYNE-DDLQMMVQTLPGF------QGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNG 95 (441) T ss_pred hhccccccccccccccCCC-cchHHHHHHhccc------CcccccccchhhhhccHHHHHHHHHHHHhhccCceeeecCc Confidence 22 2211 1111111 1122333222221 11223356677899999999999999999999999998643 Q ss_pred CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEee Q lcl|NC_021305. 75 GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYF 154 (518) Q Consensus 75 ~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 154 (518) .....|+.+++|+.+||++||+++||+.++.+++++||+|++|+|+..|++++|+|++|+.|++..+.++...++.+ T Consensus 96 ---~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~g~~~~~~~ 172 (441) T protein:vir:79 96 ---QINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQ 172 (441) T ss_pred ---cccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEE Confidence 44556777788889999999999999999999999999999999999999999999999999999988776665554 Q ss_pred ecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC-CHHHHH Q lcl|NC_021305. 155 QAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SEAAQQ 233 (518) Q Consensus 155 ~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~ 233 (518) .......+....+++++||||++++.++ ++|+||+..+.++|....+++++..++|+||++|+|||++++.+ ++++.+ T Consensus 173 ~~~~~~~~~~~~~~~~dvih~k~~~~dg-~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e 251 (441) T protein:vir:79 173 RIDSNGNNIERNVKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARD 251 (441) T ss_pred EeccCCceeEEEEccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHH Confidence 4444445667789999999999887776 68999999999999999999999999999999999999999987 467789 Q ss_pred HHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 234 RLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 234 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) ++++.|++.++|..|+|+++||++|++|++++.+++|+||++.+++++++||++|||||++||... .+ ++.+++... T Consensus 252 ~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~-~s~~q~~~~- 328 (441) T protein:vir:79 252 RAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIET-AN-MSITDANLD- 328 (441) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-CC-ccHHHHHHH- Confidence 999999999999999999999999999999999999999999999999999999999999998643 23 345665554 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) +.+||.|++++||++||++|++.. .+++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++++.++++ T Consensus 329 ~~~tl~P~~~~ie~eln~kl~~~~-~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~ 407 (441) T protein:vir:79 329 YLSTLKPYITCVCAELNFKFNDEY-VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIH 407 (441) T ss_pred HHHHHHHHHHHHHHHHhhhccccc-cCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceE Confidence 557999999999999999998764 4578999999999999999999999999999999999999999999986556679 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) ++++|++|++.....+...... +. ....++++++ T Consensus 408 ~~~~n~~~~~~~~~~~~~~~~~-~~---~~~kgGe~~e 441 (441) T protein:vir:79 408 RVDLNHVNIELVDEYQMNKSRA-TD---KKLKGGEENE 441 (441) T ss_pred eecccccccccccccccccccc-cc---cccCCCCCCC Confidence 9999999998764322211110 00 0001111111 No 32 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=1.3e-90 Score=513.36 Aligned_cols=413 Identities=18% Similarity=0.226 Sum_probs=334.5 Q ss_pred CcCCCCCCCCc-ccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSA-PAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =||.+...+.. .....+..|+...+++ .......++...++++++|++||++||+++|++||++++++ .. T Consensus 2 g~f~~~~~r~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~---~~ 72 (416) T protein:vir:81 2 GIFYKNEKRDLQYNEDDLQMMVQTLPGF------QGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNG---QI 72 (416) T ss_pred CcccccccccccCCCcchhHHHHHhccc------cccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCc---cc Confidence 23433221111 1111122333332222 22233456677899999999999999999999999998643 44 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ...|+.+++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|++|++..+.++...++....... T Consensus 73 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~ 152 (416) T protein:vir:81 73 NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSN 152 (416) T ss_pred cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEecCC Confidence 56677788888999999999999999999999999999999999999999999999999999998877665554444444 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC-HHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS-EAAQQRLREQ 238 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~~~~~~~~~~ 238 (518) ..+....|++++|||||+++.++ .+|+||+..+.+++....+++++..++|+||+.|++||++++.++ +++.+++++. T Consensus 153 ~~~~~~~~~~~evihir~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~ 231 (416) T protein:vir:81 153 GNNIERNVKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREE 231 (416) T ss_pred CceeEEEEccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHH Confidence 45566789999999999887766 689999999999999999999999999999999999999998874 6778999999 Q ss_pred HHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHh Q lcl|NC_021305. 239 FDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTM 318 (518) Q Consensus 239 ~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l 318 (518) |++.++|..|+|+++||++|++|++++.+++|+||++.+++++++||++|||||+++|.... + ++.+++.. ++.+|| T Consensus 232 ~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~-~~~~~~~~-~~~~~l 308 (416) T protein:vir:81 232 FHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N-MSITDANL-DYLSTL 308 (416) T ss_pred HHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C-ccHHHHHH-HHHHHH Confidence 99999999999999999999999999999999999999999999999999999999986332 2 34555554 456799 Q ss_pred hHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc Q lcl|NC_021305. 319 AIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA 398 (518) Q Consensus 319 ~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 398 (518) .|++++||++||++|++.. .+++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||++++.++++++++| T Consensus 309 ~P~~~~ie~~ln~~l~~~~-~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n 387 (416) T protein:vir:81 309 KPYITCVCAELNFKFNDEY-VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN 387 (416) T ss_pred HHHHHHHHHHHhhhccccc-cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc Confidence 9999999999999998765 457899999999999999999999999999999999999999999999777779999999 Q ss_pred ccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 399 LQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) ++|++.....+........ ....++++++ T Consensus 388 ~~~~~~~~~~~~~~~~~~~----~~~kgGe~n~ 416 (416) T protein:vir:81 388 HVNIELVDEYQMNKSRATD----KKLKGGEENE 416 (416) T ss_pred ccccccccccCcccccccc----cccCCCCCCC Confidence 9999876443322111110 0011111111 No 33 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=1.3e-90 Score=513.36 Aligned_cols=413 Identities=18% Similarity=0.226 Sum_probs=334.5 Q ss_pred CcCCCCCCCCc-ccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSA-PAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =||.+...+.. .....+..|+...+++ .......++...++++++|++||++||+++|++||++++++ .. T Consensus 2 g~f~~~~~r~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~---~~ 72 (416) T protein:vir:45 2 GIFYKNEKRDLQYNEDDLQMMVQTLPGF------QGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNG---QI 72 (416) T ss_pred CcccccccccccCCCcchhHHHHHhccc------cccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCc---cc Confidence 23433221111 1111122333332222 22233456677899999999999999999999999998643 44 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ...|+.+++|+.+||++||+++||+.++.+++++||+|++++|+..|++++|||++|++|++..+.++...++....... T Consensus 73 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~ 152 (416) T protein:vir:45 73 NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDSN 152 (416) T ss_pred cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEecCC Confidence 56677788888999999999999999999999999999999999999999999999999999998877665554444444 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC-HHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS-EAAQQRLREQ 238 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~~~~~~~~~~ 238 (518) ..+....|++++|||||+++.++ .+|+||+..+.+++....+++++..++|+||+.|++||++++.++ +++.+++++. T Consensus 153 ~~~~~~~~~~~evihir~~~~d~-~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~ 231 (416) T protein:vir:45 153 GNNIERNVKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREE 231 (416) T ss_pred CceeEEEEccccEEEeccCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHH Confidence 45566789999999999887766 689999999999999999999999999999999999999998874 6778999999 Q ss_pred HHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHh Q lcl|NC_021305. 239 FDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTM 318 (518) Q Consensus 239 ~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l 318 (518) |++.++|..|+|+++||++|++|++++.+++|+||++.+++++++||++|||||+++|.... + ++.+++.. ++.+|| T Consensus 232 ~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~-~~~~~~~~-~~~~~l 308 (416) T protein:vir:45 232 FHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N-MSITDANL-DYLSTL 308 (416) T ss_pred HHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C-ccHHHHHH-HHHHHH Confidence 99999999999999999999999999999999999999999999999999999999986332 2 34555554 456799 Q ss_pred hHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc Q lcl|NC_021305. 319 AIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA 398 (518) Q Consensus 319 ~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 398 (518) .|++++||++||++|++.. .+++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||++++.++++++++| T Consensus 309 ~P~~~~ie~~ln~~l~~~~-~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n 387 (416) T protein:vir:45 309 KPYITCVCAELNFKFNDEY-VNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLN 387 (416) T ss_pred HHHHHHHHHHHhhhccccc-cCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeeccc Confidence 9999999999999998765 457899999999999999999999999999999999999999999999777779999999 Q ss_pred ccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 399 LQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) ++|++.....+........ ....++++++ T Consensus 388 ~~~~~~~~~~~~~~~~~~~----~~~kgGe~n~ 416 (416) T protein:vir:45 388 HVNIELVDEYQMNKSRATD----KKLKGGEENE 416 (416) T ss_pred ccccccccccCcccccccc----cccCCCCCCC Confidence 9999876443322111110 0011111111 No 34 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=9e-91 Score=514.25 Aligned_cols=408 Identities=16% Similarity=0.228 Sum_probs=338.9 Q ss_pred CcCCCCCCCCcc-cccccchhhhhhhcc-cccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLSAP-AMAELSPQMQDSYYY-APAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) |=|-.+...-+- ...-...|+....+. ....++...+...++.+.++++|+|++||++||++||++||++|++++ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~--- 77 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK--- 77 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeeccc--- Confidence 777655221110 111123343222221 112233334556678889999999999999999999999999998653 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) ...|+..++|+.+||++||+++||+.++.+++++||+|++++|+..|++.+|+|++|+.|++..+.++..++|.+... T Consensus 78 -~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~- 155 (412) T protein:vir:26 78 -VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAA- 155 (412) T ss_pred -cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcC- Confidence 345667778888999999999999999999999999999999999999999999999999999998888777766433 Q ss_pred ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQ 238 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~ 238 (518) ++..+.|++++||||+++++.+..+|+||+.++...+....+++++. ++.++..++++++.++.+++++.+++++. T Consensus 156 --~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~ 231 (412) T protein:vir:26 156 --TGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLED 231 (412) T ss_pred --CceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCCCCHHHHHHHHHH Confidence 35567899999999999877777899999999999999999998884 56666677888899999999999999999 Q ss_pred HHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHh Q lcl|NC_021305. 239 FDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTM 318 (518) Q Consensus 239 ~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l 318 (518) |++.+. ++|+++|+++|++|++++.++.|+||++++++++++||++|||||.+||..++++++|.|++.+.|+++|| T Consensus 232 ~~~~~~---~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l 308 (412) T protein:vir:26 232 FKQYYE---ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTL 308 (412) T ss_pred HHHHhh---cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHH Confidence 998764 57889999999999999999999999999999999999999999999999989999999999999999999 Q ss_pred hHHHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeec Q lcl|NC_021305. 319 AIPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYAN 396 (518) Q Consensus 319 ~P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~ 396 (518) .|++++|+++||++|++..+. +++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+| |||+++++ T Consensus 309 ~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~--ggD~~~~~ 386 (412) T protein:vir:26 309 LPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE--GGDKPLIS 386 (412) T ss_pred HHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCeeeec Confidence 999999999999999987654 57899999999999999999999999999999999999999999995 89999999 Q ss_pred ccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 397 SALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 397 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) +|++|++.....+.....+++..+ ++ T Consensus 387 ~n~~~~~~~~~~~~~~~gG~~n~~--------e~ 412 (412) T protein:vir:26 387 GDLYPIDTPLELRKSLKGGDKNVN--------ES 412 (412) T ss_pred ccccccccchhhcccccCCCCCcC--------CC Confidence 999998764333221111111100 00 No 35 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=1e-90 Score=513.92 Aligned_cols=408 Identities=21% Similarity=0.290 Sum_probs=337.0 Q ss_pred CcC-CCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc- Q lcl|NC_021305. 1 MLL-ANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE- 78 (518) Q Consensus 1 ~~f-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~- 78 (518) |=| ++...++.......+.|+... ...+....+...+....++++|+|++||++||++||++|+++|+++.++. T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~ 76 (423) T protein:vir:81 1 MGFLQKLGLAPSVVATPEPIELVGP----IFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGR 76 (423) T ss_pred CchhHhhccccccccCccccccccc----cccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCce Confidence 432 221111222122222233221 11122222223356777899999999999999999999999998876664 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC--CceEEEEeeCCceeEEEEcCCc-eeeEEeee Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS--GTPEKLMPMHPSRVAIKRNSRT-GRYEYYFQ 155 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~--G~~~~l~~l~p~~v~v~~~~~~-~~~~~~~~ 155 (518) ++..+|.++.|+.+||++||+++||+.++.+++++||+|+++.|+.. +.++.|+|+++..+++....++ ..+.|.+. T Consensus 77 ~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~~~Y~~~ 156 (423) T protein:vir:81 77 ERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGSLDYIII 156 (423) T ss_pred eeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcceEEEEE Confidence 45567777788889999999999999999999999999999999753 4678899999998887765443 45566665 Q ss_pred cccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-----ccCCHH Q lcl|NC_021305. 156 AGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-----KRLSEA 230 (518) Q Consensus 156 ~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-----~~~~~~ 230 (518) .....++..+.+++++|||++.+++++..+|+||+..+.+++....+++++..++|+||+.|+++|+++ +.++++ T Consensus 157 ~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e 236 (423) T protein:vir:81 157 ESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAE 236 (423) T ss_pred EecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHH Confidence 555567788899999999999999999889999999999999999999999999999999999999765 358999 Q ss_pred HHHHHHHHHHHHhc-CccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH Q lcl|NC_021305. 231 AQQRLREQFDRAHS-GSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ 309 (518) Q Consensus 231 ~~~~~~~~~~~~~~-g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~ 309 (518) +.+++++.|++.++ +..|+|+++||++|++|+++++++.|+||++++++..++||++|||||+++|+.++++++|.|++ T Consensus 237 ~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~ 316 (423) T protein:vir:81 237 SRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREF 316 (423) T ss_pred HHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHH Confidence 99999999999985 56789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHhhhhhhc---ccccceecchhhhhcCHHHHHHHHHHHHh-CCCcCHHHHHHHhCCCCC Q lcl|NC_021305. 310 MRAFYRDTMAIPIARIQSAMDKYVGQYWV---RKNRMKFDIDDVIQPDWEAKSESTQKMVN-SGVATPNEGREIMGLPRS 385 (518) Q Consensus 310 ~~~~~~~~l~P~~~~ie~~l~~~l~~~~~---~~~~~~fd~~~l~~~d~~~~~~~~~~~~~-~G~~T~NE~R~~~g~~p~ 385 (518) .+.|+++||.|++..||++|+++|++..+ .+++++||.+.+++.|.+++++++.+++. +||+|+||+|+++|+||+ T Consensus 317 ~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~ 396 (423) T protein:vir:81 317 RKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSI 396 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCC Confidence 99999999999999999999999998754 46789999999999999999999999885 699999999999999999 Q ss_pred CCCCcceeeecccccccccccccCCCCCCCCCCCCCccC Q lcl|NC_021305. 386 DDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPAST 424 (518) Q Consensus 386 ~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (518) + |||++++|.|+.+.+.... +++..++ T Consensus 397 ~--gGD~~~~p~n~~~~~~~~~----------~~~~~~t 423 (423) T protein:vir:81 397 D--GGDDLARPLNTEFGDSEDA----------PGEEVET 423 (423) T ss_pred C--CcceeecccccccCccCCC----------CCCCCCC Confidence 5 8999999999877543211 1111111 No 36 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=2.1e-90 Score=512.20 Aligned_cols=403 Identities=17% Similarity=0.217 Sum_probs=328.6 Q ss_pred CcCCC--CCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLAN--GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =||.+ ++..++......+.+... + .....++..++.+.++++++|++||++||++||++||++|++++++ T Consensus 2 gl~~~~f~~~~~~~~~~~~~~~~~~--~-----~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~- 73 (409) T protein:vir:84 2 SLFTRIFSGPSEERTLTKISGIPSP--A-----EDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNV- 73 (409) T ss_pred chhhhhhcCCCcccccccccccccc--c-----chhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCc- Confidence 13331 111111121111111111 0 1111223456778899999999999999999999999999987654 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEE-EcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecc Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ-KNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAG 157 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~-r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~ 157 (518) +...|+.+++|+.+||++||+++||+.++.+++++||+|++|. ++..|++.+||||+|++|++....+.....+.+.+. T Consensus 74 ~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~ 153 (409) T protein:vir:84 74 RIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYR 153 (409) T ss_pred ccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEec Confidence 3445777777888999999999999999999999999999986 688899999999999999988655444333332221 Q ss_pred cccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLRE 237 (518) Q Consensus 158 ~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~ 237 (518) .++ ..|++++|||++++++++..+|+||+..+..++....++++++.++|+||++|+|||++++.+++++.+++++ T Consensus 154 --~~g--~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~ 229 (409) T protein:vir:84 154 --IDG--KVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQK 229 (409) T ss_pred --CCc--eEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHH Confidence 122 3588999999999999988899999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHHHHHHH Q lcl|NC_021305. 238 QFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQMRAFYR 315 (518) Q Consensus 238 ~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~~~~ 315 (518) .|.+.+ .|+|+++||++|++|++++.++.|+||++++++++++||++|||||++||..+.+++ +|.|++.+.|++ T Consensus 230 ~~~~~~---~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~ 306 (409) T protein:vir:84 230 QWIQSH---HNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVR 306 (409) T ss_pred HHHHHh---ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHH Confidence 998876 467899999999999999999999999999999999999999999999998876664 889999999999 Q ss_pred HHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeee Q lcl|NC_021305. 316 DTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYA 395 (518) Q Consensus 316 ~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~ 395 (518) +||.||++.||++|+++|. .+++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||++ |||++++ T Consensus 307 ~~l~P~~~~ie~~l~~~L~----~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~--ggD~~~~ 380 (409) T protein:vir:84 307 HTLLPWLRCIEQALDTFLP----RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIP--EGDIHLQ 380 (409) T ss_pred HHHHHHHHHHHHHHHHhcc----CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Ccceeee Confidence 9999999999999999883 467899999999999999999999999999999999999999999995 7999999 Q ss_pred cccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 396 NSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 396 ~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) |+|+++++.....+...++. ..+.++.+. T Consensus 381 ~~n~~~~~~~~~~~~~~~~~----~~~~~~gn~ 409 (409) T protein:vir:84 381 PMNFVPLGYVPPEEPAQEPQ----PNSATEGNK 409 (409) T ss_pred cccccccccCCccccCcCCC----CCCccCCCC Confidence 99999988654432221111 111111111 No 37 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=1.8e-90 Score=512.56 Aligned_cols=402 Identities=16% Similarity=0.238 Sum_probs=334.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhh-cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSY-YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =+|++..+ .. .+.|+..+. +....+++.......++.+.|+++++|++||++||++||++||++|+++. T Consensus 5 ~~~~~~~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~---- 74 (409) T protein:vir:93 5 NIVTRIKK----KL--IDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK---- 74 (409) T ss_pred chhhhhhh----hh--hhhhhccccccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccc---- Confidence 23333221 11 112221111 11112222233345677888999999999999999999999999998663 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ...|++.++|+.+||++||+++||+.++.+++++||+|+++.|+..|++.+|||++|+.|++..+.++..+.|.+... T Consensus 75 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~-- 152 (409) T protein:vir:93 75 VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAA-- 152 (409) T ss_pred cccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcC-- Confidence 335667778889999999999999999999999999999999999999999999999999999988887777766432 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) ++..+.|++++|||++++++.+..+|+||+.++...+....+++++. ++.++..++++++.++.+++++.+++++.| T Consensus 153 -~g~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~ 229 (409) T protein:vir:93 153 -TGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLEDF 229 (409) T ss_pred -CceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCCCCHHHHHHHHHHH Confidence 35667899999999998877777899999999999999999998884 566666678888999999999999999999 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) ++.+. ++|+++|+++|++|++++.++.|+||+|++++++++||++|||||++||..++++++|.|++.+.|+++||. T Consensus 230 ~~~~~---~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~ 306 (409) T protein:vir:93 230 KQYYE---ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLL 306 (409) T ss_pred HHHhh---cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH Confidence 98764 678899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANS 397 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~ 397 (518) |+++.||++|+++|+++.+. +++++||++.+++.|.+++++++++++++|++|+||+|+++|+||++ |||++++++ T Consensus 307 P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~--ggD~~~~~~ 384 (409) T protein:vir:93 307 PIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE--GGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCeeeecc Confidence 99999999999999987654 57899999999999999999999999999999999999999999995 899999999 Q ss_pred cccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 398 ALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 398 n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) |++|++.....+.....+++..+.+ T Consensus 385 n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 385 DLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred cccccccchhhcccccCCCCCcCCC Confidence 9999976544332211111111000 No 38 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=2.3e-90 Score=511.97 Aligned_cols=402 Identities=16% Similarity=0.233 Sum_probs=334.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccc-ccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYA-PAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =+|++.++ ..+..|+..+++.. ..+.+...+...++.+.|+++++|++||++||++||+|||++|+++. T Consensus 5 ~~~~~~k~------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~~---- 74 (409) T protein:vir:94 5 NIVTRIKK------KLIDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK---- 74 (409) T ss_pred ccchhhhh------HHhhhhhcCCcccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeeccc---- Confidence 22332221 11233332222211 11122223345577888999999999999999999999999998654 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ...|+..++|+.+||++||+++||+.++.+++++||+|++++|+..|.+.+|||++|+.|++..+.++..++|.+... T Consensus 75 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~-- 152 (409) T protein:vir:94 75 VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHAA-- 152 (409) T ss_pred ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEcC-- Confidence 335667778889999999999999999999999999999999999999999999999999999988887777766432 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) ++..+.|++++|||||++++.+..+|+||+..+...+....+++++. ++.++..++++++.++.+++++.+++++.| T Consensus 153 -~g~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~ 229 (409) T protein:vir:94 153 -TGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVGKEKRQQVLEDF 229 (409) T ss_pred -CceEEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEecCCCCCHHHHHHHHHHH Confidence 35667899999999998877777899999999999999999998885 555666677899999999999999999999 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) ++.+. ++|+++|+++|++|++++.++.|+||++.++++.++||++|||||++||..++++++|.|++.+.|+++||. T Consensus 230 ~~~~~---~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~ 306 (409) T protein:vir:94 230 KQYYE---ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLL 306 (409) T ss_pred HHHhh---cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH Confidence 98874 678999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANS 397 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~ 397 (518) |+++.||++||++|+++.+. +++++||++.+++.|.+++++++++++++|++|+||+|+++|+||+| |||++++++ T Consensus 307 P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~--ggD~~~~~~ 384 (409) T protein:vir:94 307 PIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE--GGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCeEeecc Confidence 99999999999999987654 57899999999999999999999999999999999999999999995 899999999 Q ss_pred cccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 398 ALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 398 n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) |++|++.....+...+.+.+.++.+ T Consensus 385 n~~~~~~~~~~~~~~kGG~~n~~e~ 409 (409) T protein:vir:94 385 DLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred cccccccchhhcccccCCCCCcCCC Confidence 9999976543332111111111000 No 39 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=2.7e-90 Score=511.64 Aligned_cols=402 Identities=16% Similarity=0.231 Sum_probs=332.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccc-ccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYA-PAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =+|++.++. -++.|+..+.+.. ....+...+...++.+.|+++++|++||++||++||+|||++|+++. T Consensus 5 ~~~~~~k~~------~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~---- 74 (409) T protein:vir:96 5 NIVTRIKKK------LIDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK---- 74 (409) T ss_pred cchhhhhhH------HhhhhhccccccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeeccc---- Confidence 234432221 1112221111110 01112222334577788999999999999999999999999998653 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ...|++.++|+.+||++||+++||+.++.+++++||+|++|+|+..|.+.+|||++|+.|++..+.++...+|.+.. T Consensus 75 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~--- 151 (409) T protein:vir:96 75 VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIHA--- 151 (409) T ss_pred ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEEc--- Confidence 34567777888999999999999999999999999999999999999999999999999999998888777776543 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) .++..+.|++++||||+++++.+..+|+||+..+...+....+++++. ++.++..++++++.++.+++++++++++.| T Consensus 152 ~~g~~~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~ 229 (409) T protein:vir:96 152 ATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSNVSTEKRQQVLEDF 229 (409) T ss_pred CCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEecCCCCCHHHHHHHHHHH Confidence 335677899999999998877777899999999999999999888774 555555667788899999999999999999 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) ++.++ ++|+++|+++|++|++++.++.|+||+++++++.++||++|||||++||..++++++|.|++.+.|+++||. T Consensus 230 ~~~~~---n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~ 306 (409) T protein:vir:96 230 KQYYE---ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLL 306 (409) T ss_pred HHHhh---cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH Confidence 98874 678999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANS 397 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~ 397 (518) |+++.|+++|+++|+++.+. +++++||++.+++.|.+++++++++++++|++|+||+|+++|+||+| |||++++++ T Consensus 307 P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~--ggD~~~~~~ 384 (409) T protein:vir:96 307 PIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVE--GGDKPLISG 384 (409) T ss_pred HHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCC--Ccceeeecc Confidence 99999999999999987654 67899999999999999999999999999999999999999999995 899999999 Q ss_pred cccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 398 ALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 398 n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) |++|++.....+...+.+.+.++.+ T Consensus 385 n~~~~~~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 385 DLYPIDTPLELRKSLKGGDKNVNES 409 (409) T ss_pred cccccccchhhcccccCCCCCcCCC Confidence 9999876543322111111111100 No 40 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=1.1e-89 Score=508.26 Aligned_cols=415 Identities=17% Similarity=0.191 Sum_probs=330.0 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) |-|.+++..... +.|.......+. .++.++. .....++++++||+||++||+.||++|+++|+.+.++. . T Consensus 1 m~~~~~~~~~~~-----~~~~~~~~~~~~---~~~~~g~-~~~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~-~ 70 (417) T protein:vir:38 1 MKLFRGLATEVD-----PHWADHLLDSGV---IPSFRGG-YLGISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEV-I 70 (417) T ss_pred CccccccccCCC-----ccchhhhccccc---ccccCCc-eechhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcce-e Confidence 888765543322 223221110000 1111111 22345899999999999999999999999998876543 3 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC-CceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-GTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~-G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ..+++.++|+.+||++||+++||+.++.+++++||+|++|+|+.. |.+..|+|++|++|++.....+. ..|.+.. . T Consensus 71 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~-~~y~~~~--~ 147 (417) T protein:vir:38 71 DLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDN-IIYRFTP--Y 147 (417) T ss_pred ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCe-EEEEEEE--c Confidence 345677788899999999999999999999999999999999864 67999999999999998766554 3344332 3 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) .++....+++++||||++++.++ ++|+||+.++..+|....+++++..++|+||++|++|++.++.+++++.+++++.| T Consensus 148 ~~~~~~~~~~~dviH~r~~~~d~-~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~ 226 (417) T protein:vir:38 148 NSSMQKVCGFEDVIHWKFFSYDT-IMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDF 226 (417) T ss_pred CCcEEEEecCcceEEecCCCCCC-ccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHH Confidence 45666789999999999987766 68999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) ++.++| .|+|+++||++|++|++++.++.|+||++++++++++||++|||||++||. .++++|.+++.+.|+++||. T Consensus 227 ~~~~~g-~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~--~~~~s~~e~~~~~~~~~tl~ 303 (417) T protein:vir:38 227 ERAQAG-ADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQ--NSPNQSVKQLADDYIRNDLP 303 (417) T ss_pred HHHhcc-cccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCC--CCcchhHHHHHHHHHHHHHH Confidence 999987 489999999999999999999999999999999999999999999999984 56899999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA 398 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 398 (518) ||++.||++|+++|+++.++ .++++||.+.+...+. ..+++++++|++|+||+|+++|+||++++++|++++|+| T Consensus 304 P~~~~ie~~l~~~Ll~~~~~~~~~~~fd~~~l~~~~~----~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n 379 (417) T protein:vir:38 304 FYFEPITSEFELKLLDDAQRHQYCIGFDTKSVNGLPI----ADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLN 379 (417) T ss_pred HHHHHHHHHHHhhhcChhhcccceEEechhhhhHHHH----HHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccc Confidence 99999999999999986553 5678999887754433 347788999999999999999999999888899999999 Q ss_pred ccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchh Q lcl|NC_021305. 399 LQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNS 445 (518) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 445 (518) +++++.....+.....+.+.++...+ ++++.++.... + T Consensus 380 ~~~~d~~~~~~~~~~~~~kgg~~~~~----~~~~~~~~~~~-----~ 417 (417) T protein:vir:38 380 TVFLDQKEAYQAEHAAELKGGDTNAK----GNQNGSGTNAN-----S 417 (417) T ss_pred ccccccccccccccccccCCCCCCCC----CCCcCCCCcCC-----C Confidence 99998766554433333222221111 11110000000 0 No 41 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=2.9e-88 Score=500.46 Aligned_cols=505 Identities=16% Similarity=0.180 Sum_probs=351.6 Q ss_pred CcCCCCCC-----CCcccccccchhhhhhhcccccc--------cccccccchhhhHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 MLLANGQT-----LSAPAMAELSPQMQDSYYYAPAV--------GMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~f~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) ..|.--+. .-.|=.....||..+.|...... .........++...++++++|++||++||++||++| T Consensus 63 ~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f~~s~es~s~vtsls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLP 142 (945) T protein:vir:10 63 IIFRKNQVLKKEKIIVPYNHQEPPFKFNLFEYSPESLMYLPSISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKE 142 (945) T ss_pred eeehhhhHHHhhcccccccccccchhhhhhhccCccceecccccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCc Confidence 55653332 22332223344554443322211 111112345778889999999999999999999999 Q ss_pred eEEEEecCCcc------eeccchHHHHHHhcCCcCCCHHH----HHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCc Q lcl|NC_021305. 68 VKCMFTSGDTE------TEESDTGYAKLLADPCEYLDPFA----FWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPS 137 (518) Q Consensus 68 ~~v~~~~~~~~------~~~~~~~~~~L~~~PN~~~s~~~----f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~ 137 (518) +++|++..++. +...+|+++.|+.+||++||+++ |++.++.+++++||+|++++|+..|++++|+|++|+ T Consensus 143 lklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs 222 (945) T protein:vir:10 143 LEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGT 222 (945) T ss_pred eEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCc Confidence 99999876653 23467788888889999999988 556788999999999999999999999999999999 Q ss_pred eeEEEEcCCceeeEEeeecccccCceeEEeccccEE-EEeccCCCCcc--cCchHHHHHHHHHHHHHHHHHHHHHHHH-c Q lcl|NC_021305. 138 RVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVV-PIRFFNPDGLE--RGLSLMESLKSTIFSEDSSRNATAAMWK-N 213 (518) Q Consensus 138 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evi-h~~~~~~~~~~--~G~s~l~~~~~~i~~~~~~~~~~~~~~~-n 213 (518) +|++..+.++...++.... ..+.....++++++| |++.+++++.. +|+||+.++++++....++++++.++|. | T Consensus 223 ~Vti~~ddDG~~~y~Yv~~--idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskN 300 (945) T protein:vir:10 223 TIKPILSEDTGIVVGYVQE--VDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKG 300 (945) T ss_pred ceEEEEcCCCcEEEEEEEe--cCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 9999988877654332221 234455678888865 56777777643 6999999999999999999999999995 7 Q ss_pred cCCcccccccC----------ccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHH Q lcl|NC_021305. 214 AGRPNLVLRHE----------KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREE 283 (518) Q Consensus 214 g~~p~~il~~~----------~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 283 (518) |++|+|+|+++ +.+++++.+++++.|++.++| .+.|+++|+++|++|++++.++.|+||++++++++++ T Consensus 301 Ga~PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG-~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~ee 379 (945) T protein:vir:10 301 GSIPEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMG-DYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARK 379 (945) T ss_pred CCccceEEEecCccccccccccccCHHHHHHHHHHHHHHhCC-cccccceecCCCceEEEccCChhHHHHHHHHHHHHHH Confidence 88999999754 568999999999999999988 5678889999999999999999999999999999999 Q ss_pred HHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHH Q lcl|NC_021305. 284 VCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQ 363 (518) Q Consensus 284 Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~ 363 (518) ||++|||||++||+.++++++|++++...|+++||.|++.+||++||++|+.... +.+++|+++.....|.+++++++. T Consensus 380 IArAFGVPP~lLG~~e~st~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~e-g~~i~fdFd~ldl~D~ksraEal~ 458 (945) T protein:vir:10 380 ICAVYQVSPQDVGILEGSNKATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRN-EKDIKLWFKEDDLEKERDWWNIIQ 458 (945) T ss_pred HHHHhCCCHHHcccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-CceeEEEecchhccCHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999875443 345667777777789999999999 Q ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecc-cccccccccccCCCCCCCCCCCCCccCCCCCCCccc-cCCccccc Q lcl|NC_021305. 364 KMVNSGVATPNEGREIMGLPRSDDPKADELYANS-ALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP-PTSVPGLS 441 (518) Q Consensus 364 ~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~-n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 441 (518) +++++|+||+||+|+++|+||++ |||+++++. |+.|.+....+.....+.+.....++++...+++.. +...+. + T Consensus 459 kli~sGiLTiNEvRe~lGLpPIe--GGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~ps-E 535 (945) T protein:vir:10 459 GQLNTGFRSINEARMEKGLEPVP--WGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPS-E 535 (945) T ss_pred HHHhCCCcCHHHHHHHhCCCCCC--CcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCC-c Confidence 99999999999999999999995 899999987 567776654443333332222222222222111111 111110 0 Q ss_pred cchhc--------chhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhc-cccCcCchhHHHHHHH---------HHHHhH Q lcl|NC_021305. 442 PTNSD--------RSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMG-RGKDIKGFALQLAEKY---------PDDLED 503 (518) Q Consensus 442 ~~~~~--------~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~---------~~~~~~ 503 (518) +++.. +..+..+.+.-++...-.+.+++.++++.++..+- .|.+ .-.+-.+-. .|+| T Consensus 536 ~kda~~e~~~~l~~~~~~~a~e~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~-- 610 (945) T protein:vir:10 536 QKNAGLEVLRNLFKSLDANASENLKQVIELTNDDNYLKEKELLTRVLKSVGLD---SVSEFIENNSQTDVEVSAKDIL-- 610 (945) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCchhHHHHHHHHHHHHhhhH---HHHHHHhcCCccceeechhhhh-- Confidence 00000 00011111111112222445555555555555542 1111 000000000 0000 Q ss_pred HHHhh------hhhhhcccCC Q lcl|NC_021305. 504 ILLAV------QLALAERKDN 518 (518) Q Consensus 504 ~~~~~------~~~~~~~~~~ 518 (518) +|-- ..-.|..||- T Consensus 611 -~~~~~~~~~~~~~~~~~~~~ 630 (945) T protein:vir:10 611 -SFKYNSLVEDETIYATEKDI 630 (945) T ss_pred -hhhhhhhccccceeecchhh Confidence 0000 0001111111 No 42 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=2.5e-88 Score=500.89 Aligned_cols=486 Identities=17% Similarity=0.141 Sum_probs=332.5 Q ss_pred cchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhcCCcC Q lcl|NC_021305. 17 LSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEY 96 (518) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~ 96 (518) .|.. +.+.+....+.......++.+.|+++++|++||++||++||++||++|+.++ .....|+++++|+.+||++ T Consensus 1 ~~~~---~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~--~~~~~~~l~~lL~~~PN~~ 75 (723) T protein:vir:94 1 MTTF---PSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDG--ELDELHPLSQLWNVMPNRA 75 (723) T ss_pred Cccc---ccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCC--ccchhhHHHHHHhhCCCCC Confidence 1111 1111111123333445667788999999999999999999999999987543 3344567777778899999 Q ss_pred CCHHHHHHHHHHHHHHcCCeEEEEEEc---CCCceEEEEeeCCceeEEEEcCCceeeE----EeeecccccCceeEEecc Q lcl|NC_021305. 97 LDPFAFWEWVASTLDIYGETYLAIQKN---KSGTPEKLMPMHPSRVAIKRNSRTGRYE----YYFQAGAGVGTQLVSFAD 169 (518) Q Consensus 97 ~s~~~f~~~~v~~ll~~G~~~~~i~r~---~~G~~~~l~~l~p~~v~v~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 169 (518) ||+++||+.++.+|+++||+|++++|+ ..|.+.+|+++++..+.+....+..... +.|.+. ..++..+.+++ T Consensus 76 ~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~-~~~G~~~~~~~ 154 (723) T protein:vir:94 76 MPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIE-RTDGVRVPVLA 154 (723) T ss_pred CCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEE-ecCceeEEecc Confidence 999999999999999999999999965 4588999999999888776554432211 112122 23566788999 Q ss_pred ccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCcccc Q lcl|NC_021305. 170 DEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNT 249 (518) Q Consensus 170 ~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~ 249 (518) ++||||+.+++.+.++|+||+..+..+|....++++++.++|+||++|+|||+.+ .+++++.+++++.|++.++|..|+ T Consensus 155 ~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-~l~~e~~~~~~~~~~~~~~G~~Na 233 (723) T protein:vir:94 155 DEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-DMDEQTFTKTVAAFRSQVEGVQNA 233 (723) T ss_pred cceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCHHHHHHHHHHHHHHhhchhhc Confidence 9999999998777789999999999999999999999999999999999999986 589999999999999999999999 Q ss_pred CCeeecC----------CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 250 GKTMVVE----------EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 250 g~~~vl~----------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) |+++||+ .|++|++++.+++|+||++++++++++||++|||||++|+. .++++|.+++.+.|+++||. T Consensus 234 gk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~--~st~sN~e~~~~~f~~~tL~ 311 (723) T protein:vir:94 234 GRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLG--GSTYENQAEAKAAVWTETLI 311 (723) T ss_pred CcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCC--CCCcccHHHHHHHHHHHHHH Confidence 9999985 58999999999999999999999999999999999999964 56899999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc- Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA- 398 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n- 398 (518) ||++.||++||++|++..+...+++||...+++.|.+++++++.+++++|++|+||+|+++|+||+++..|+.++.|.+ T Consensus 312 P~~~~ie~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~ 391 (723) T protein:vir:94 312 PQMEVMASITDLQLLPDIGWTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRA 391 (723) T ss_pred HHHHHHHHHHhHhhcccccCceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccc Confidence 9999999999999998877777788998899999999999999999999999999999999999997544556666653 Q ss_pred -ccccccccccCCCCCCC---CCCC-CCccCCCCCCCccccCCccccccchhcchhhHH-HHHHHHh-------hccc-- Q lcl|NC_021305. 399 -LQPLGATPDGAVEWEEA---PAPK-RPASTPVASLDQSPPTSVPGLSPTNSDRSTDSG-KTEPRRL-------MQKP-- 463 (518) Q Consensus 399 -~~~~~~~~~~~~~~~~~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-------~~k~-- 463 (518) ..|.+... ...++... +... ..++.| ..+.+..+....+.+....+++.. ++.+..+ +.+. T Consensus 392 ~~a~~~~~~-p~~~e~~~~~~~~~~~~~~~~p---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (723) T protein:vir:94 392 QFAPAPAPA-PAVEEGAARMLALLERVAADRP---LPELPVRATTVLHHDPGPDPQQTLYERLEALLQPLLVELGRRQAA 467 (723) T ss_pred cccCCCCCC-ccchhhhHhhhhhccccccccC---cCCCCCCCCCCCCCCcccCCchhHHHHHHHHHhhhHHHHHHHHHH Confidence 33322111 00000000 0000 000001 001111111111111111111111 1111000 0000 Q ss_pred ----------CC---chhhHHHHHHHHHh-hccccC-cCchhHHHHHHHHHHHhHHHHhhhhh-----hh----cccCC Q lcl|NC_021305. 464 ----------PP---KESSPKHLRAVKGA-MGRGKD-IKGFALQLAEKYPDDLEDILLAVQLA-----LA----ERKDN 518 (518) Q Consensus 464 ----------~~---~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~----~~~~~ 518 (518) .+ .+.+...++.+.+- ..++-= ....+-.. -.+.++.|+.+.+.+ ++ .|+.- T Consensus 468 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 543 (723) T protein:vir:94 468 VTLREFDLLMRGERAAALWLADVRAVASEAYERGALLAPPDAEEV---PPARLTRLDLAPEELAVRINVKRIFNARKWV 543 (723) T ss_pred HHHHhhchhhcchHHHHHHHHHHHHHHHhccccceeccccccchh---hHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH Confidence 01 11111233333221 111100 00000000 012222233222211 11 12211 No 43 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=8e-89 Score=503.55 Aligned_cols=410 Identities=14% Similarity=0.168 Sum_probs=335.3 Q ss_pred cCCCCCC----CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 2 LLANGQT----LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 2 ~f~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) |+..-.. ....+....+.|+. .+|..+ ++. ..+...++.+.|+++|+|++||++||++||+|||++|+...++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~-~~~-~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g 77 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAFIK-YIGQTF-TKY-DNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTK 77 (460) T ss_pred CchhHHHHHhhhhccCCCchHHHHH-hhcccc-CCC-ccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCc Confidence 3332111 11122222334542 233221 222 2334567788899999999999999999999999999987765 Q ss_pred ce----------------------------eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC----C Q lcl|NC_021305. 78 ET----------------------------EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK----S 125 (518) Q Consensus 78 ~~----------------------------~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~----~ 125 (518) .. ...++..+.|+.+||++||+++||+.++.+++++|++|++++|+. . T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~ 157 (460) T protein:vir:10 78 AYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINA 157 (460) T ss_pred cchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccC Confidence 32 234566778999999999999999999999999999999999964 4 Q ss_pred CceEEEEeeCCceeEEEEcCCceeeEEeee---cccccCceeEEeccccEEEEeccCCC-----CcccCchHHHHHHHHH Q lcl|NC_021305. 126 GTPEKLMPMHPSRVAIKRNSRTGRYEYYFQ---AGAGVGTQLVSFADDEVVPIRFFNPD-----GLERGLSLMESLKSTI 197 (518) Q Consensus 126 G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~evih~~~~~~~-----~~~~G~s~l~~~~~~i 197 (518) |.+.+||||+|+.|++..+.++....|.+. +....++..+.|++++|||||+++++ +..+|+||+..+...+ T Consensus 158 G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i 237 (460) T protein:vir:10 158 GVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNI 237 (460) T ss_pred ceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHH Confidence 789999999999999999888766554432 22234677889999999999988765 3468999999999999 Q ss_pred HHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEAR 277 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~ 277 (518) ....+++++..++|+||+.|+++++.++.+++++.+++++.|++.++|.+|+|+++||++|++|++++.++.|+||++.+ T Consensus 238 ~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~ 317 (460) T protein:vir:10 238 NSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYL 317 (460) T ss_pred HHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCHHHhcccccc--ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc--cccceecchhh--h Q lcl|NC_021305. 278 QLNREEVCGVYDIAPPIVHILDRA--TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR--KNRMKFDIDDV--I 351 (518) Q Consensus 278 ~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l--~ 351 (518) ++..++||++|||||++||+.+++ +++|.|++.+.|+++||.|++..||++||++|+++.+. +++++||++.+ + T Consensus 318 ~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l 397 (460) T protein:vir:10 318 KYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEM 397 (460) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhH Confidence 999999999999999999987654 68999999999999999999999999999999987654 46778988877 5 Q ss_pred hcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 352 QPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 352 ~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) +.|.+++ ..++++|++|+||+|+++|+||++++|||++++|+|++|++...++......+ + T Consensus 398 ~~d~~~~----~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~n---------------q 458 (460) T protein:vir:10 398 QTDMVAM----ASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFN---------------Q 458 (460) T ss_pred HHHHHHH----HHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhhcccccCCCccc---------------C Confidence 5555544 45678999999999999999999999999999999999987543321111000 0 Q ss_pred cccC Q lcl|NC_021305. 432 SPPT 435 (518) Q Consensus 432 ~~~~ 435 (518) + + T Consensus 459 ~--~ 460 (460) T protein:vir:10 459 N--Q 460 (460) T ss_pred C--C Confidence 0 0 No 44 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=2.5e-88 Score=500.88 Aligned_cols=404 Identities=17% Similarity=0.185 Sum_probs=331.5 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) |=|-+++... ....++++...+++.. .. .+....++++++|++||++||++||+||+++++.++ .+. T Consensus 1 m~~f~~~~~~---~~~~~~~~~~~~~~~~-----~~---~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g--~~~ 67 (406) T protein:vir:97 1 MSFFQPLGTS---KVSYDDYISSVLAGDV-----SQ---KYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNG--DII 67 (406) T ss_pred CccccccCCC---CCCcchHHHHHhcCCC-----Cc---ccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCc--ccc Confidence 8776654332 2234555554443321 11 233345899999999999999999999998876554 344 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC-CCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK-SGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~-~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ..|++.++|+.+||++||+++||+.++.+++++||+|++++|+. .|.+.+|+|++|+.|++..+.++. ..|.+. .. T Consensus 68 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~-~~y~~~--~~ 144 (406) T protein:vir:97 68 HDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHE-IVYTFT--DM 144 (406) T ss_pred ccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCce-EEEEEE--ec Confidence 55667777778999999999999999999999999999999984 689999999999999998776553 444433 33 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) .++..+.+++++|||||+++.++ .+|+||+.++..++....+++++..++|+||+.|++++..++.+++++.+++++.| T Consensus 145 ~~~~~~~~~~~evih~r~~~~dg-~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~ 223 (406) T protein:vir:97 145 LTAKQVKCFAHDVIHWKFFSHDT-ILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEF 223 (406) T ss_pred CCceEEEEccccEEEecCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHH Confidence 46777899999999999887666 67999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) ++.++| .|+|+++||+.|++|++++.++.|+||+|.+++++++||++|||||++||. .++++|.+++.+.|+++||. T Consensus 224 ~~~~~g-~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~--~~~~~~~e~~~~~f~~~~l~ 300 (406) T protein:vir:97 224 EKMREG-SVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGV--NSPNQSVAQLMEDYVTNDLP 300 (406) T ss_pred HHHhcc-cccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCC--CCCcchHHHHHHHHHHHHHH Confidence 999887 688999999999999999999999999999999999999999999999985 45688999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA 398 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 398 (518) ||++.||++|+++|++..+. .++++||++. +.+.+++.+.+++++|++|+||+|+++|++|+++++||++++|+| T Consensus 301 P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~----~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 376 (406) T protein:vir:97 301 FYFDAITSELGLKTLNDKDRRLYHIEFDTRS----VTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLN 376 (406) T ss_pred HHHHHHHHHHhhhhcChhhccceeEEEecCc----cchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccC Confidence 99999999999999986543 4678898654 456667788899999999999999999999999999999999999 Q ss_pred ccccccccccCCCCCCCCCCCCCccCCCCCCCcc Q lcl|NC_021305. 399 LQPLGATPDGAVEWEEAPAPKRPASTPVASLDQS 432 (518) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (518) ++|++...+++.......+. ....++++++ T Consensus 377 ~~~~~~~~~~~~~~~~~~~g----g~~~~~~~~~ 406 (406) T protein:vir:97 377 YVFLDKKEEYQDKVGIKGKG----GEVNAEEDKS 406 (406) T ss_pred ccchhcccccccccccccCC----CCCCCCCCCC Confidence 99998765444322221111 1111111111 No 45 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=5.7e-86 Score=487.94 Aligned_cols=419 Identities=15% Similarity=0.200 Sum_probs=333.0 Q ss_pred CcCCCCCCCCcccccc-------------------cchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHH Q lcl|NC_021305. 1 MLLANGQTLSAPAMAE-------------------LSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQ 61 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~ 61 (518) -|+++....++.+... ..|++....++.+. .+....+..++.+.|+++++|++||++||+ T Consensus 6 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~~~~g~~v~~~~a~~~~~v~~~i~~Ia~ 84 (466) T protein:vir:81 6 RLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPST-ELAPDTFVGLATQAYQANGPVFACMLVRQL 84 (466) T ss_pred HHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccc-cccCccccccchhhhhccHHHHHHHHHHHH Confidence 2344433332211100 11222222222111 122334567889999999999999999999 Q ss_pred hhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC--------CceEEEEe Q lcl|NC_021305. 62 ALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS--------GTPEKLMP 133 (518) Q Consensus 62 ~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~--------G~~~~l~~ 133 (518) +||+|||++|++++++.++..+|+++.|+.+||++||+++||+.++.+++++||+|++|+|+.. |.+++|+| T Consensus 85 ~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g~~~~l~~ 164 (466) T protein:vir:81 85 VFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVDVVVEERM 164 (466) T ss_pred hhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCcceeEEEE Confidence 9999999999998888888889999999999999999999999999999999999999999765 45899999 Q ss_pred eCCceeEEEEcCCcee-eEEeeeccc-ccCceeEEeccccEEEEecc-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 134 MHPSRVAIKRNSRTGR-YEYYFQAGA-GVGTQLVSFADDEVVPIRFF-NPDGLERGLSLMESLKSTIFSEDSSRNATAAM 210 (518) Q Consensus 134 l~p~~v~v~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~evih~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 210 (518) ++|++|++..+.++.. ..|.+.... ..+...+.|++++||||+.+ ++.+..+|+||+..+.++|....+++++..++ T Consensus 165 l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~ 244 (466) T protein:vir:81 165 VRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKHQAKF 244 (466) T ss_pred ecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999998776643 444444332 23456778999999999975 45566789999999999999999999999999 Q ss_pred HHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021305. 211 WKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDI 290 (518) Q Consensus 211 ~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgV 290 (518) |+||+.|+|||++++.+++++.+++++.|++.++|..|+|+++||++|++|++++.++.|+||+++++++.++||++||| T Consensus 245 f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgV 324 (466) T protein:vir:81 245 FDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGV 324 (466) T ss_pred HhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CHHHhcccc---ccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHH----- Q lcl|NC_021305. 291 APPIVHILD---RATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSES----- 361 (518) Q Consensus 291 Pp~~lg~~~---~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~----- 361 (518) ||++||+.+ +++|+|.|++.+.|+++||.||+++||++|+++|++..++ .++++||..++++.|.++++++ T Consensus 325 Pp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~ 404 (466) T protein:vir:81 325 PPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRA 404 (466) T ss_pred CHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHHHHHHHHH Confidence 999999763 5789999999999999999999999999999999986543 5678999999999999998876 Q ss_pred --HHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee-ecccccccccccccCCCCCCCCCCCCCccCCCCCCCcc Q lcl|NC_021305. 362 --TQKMVNSGVATPNEGREIMGLPRSDDPKADELY-ANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQS 432 (518) Q Consensus 362 --~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~-~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (518) +..++++|+ |+||+|+.+ ++||.++ .+.++.+++....++.....++++...+ ++++.+ T Consensus 405 ~~~~~~~~~g~-t~nE~r~~~-------~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G----g~~ngn 466 (466) T protein:vir:81 405 ETINTLITAGY-EPESVVAAV-------NSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGG----ADDNGN 466 (466) T ss_pred HHHHHHHHcCC-Chhhccccc-------cCCccccccCCCcchhhhcccccccccCCCCcccCC----CCcCCC Confidence 667889995 999999643 2666654 4456666655443333222222211111 011110 No 46 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=1.4e-85 Score=485.85 Aligned_cols=395 Identities=14% Similarity=0.202 Sum_probs=323.0 Q ss_pred CcCCCCCCCCcccc-cccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAM-AELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =+|++++.+..+.. ....++............... .......++++++|++||++||++||++||++|++++++.+ T Consensus 14 ~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~~~~~~~~~~~~ 90 (413) T protein:vir:96 14 KFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKEL---ISDGYTKLSDSPEVRMAVDCIADLVSNMTIQLMQNGETGDK 90 (413) T ss_pred CccccCCCcchhhhhhccccccccccccchhhHhhh---ccchhHHHhhchHHHHHHHHHHHhhccCceEEEEecCCCcc Confidence 35666554332211 111111100000000000000 01112347889999999999999999999999999988888 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC-ceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG-TPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G-~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) ...|+..++|+.+||++||+++||+.++.+++++|++|++++|+..| .+.+|||++|.+|++..+.+. +.|.+...+ T Consensus 91 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~~--~~y~~~~~~ 168 (413) T protein:vir:96 91 RIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDDD--LDYSITFDN 168 (413) T ss_pred ccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcCCe--EEEEEeecC Confidence 88888888999999999999999999999999999999999999877 578999999999999887654 334433221 Q ss_pred ccCceeEEeccccEEEEec-cCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRF-FNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLRE 237 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~-~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~ 237 (518) .++++++||||+. +++++..+|+||+.++...+....+++++..++|+||+.|+|+|++++.+++++.+++++ T Consensus 169 ------~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~ 242 (413) T protein:vir:96 169 ------KEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRE 242 (413) T ss_pred ------cEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHH Confidence 3578999999996 456667789999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhcCccccCCeeecCCCcc-eeec-cCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHH Q lcl|NC_021305. 238 QFDRAHSGSSNTGKTMVVEEGME-PIPL-QLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYR 315 (518) Q Consensus 238 ~~~~~~~g~~n~g~~~vl~~g~~-~~~l-~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~ 315 (518) +|++.++|..|+|+++|+++|.. +.++ ..++.|+||++++++++++||++|||||.+||.. ++.+++...|++ T Consensus 243 ~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~-----~~~~~~~~~~~~ 317 (413) T protein:vir:96 243 NFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVG-----TYNKDEFNNFIN 317 (413) T ss_pred HHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-----cchHHHHHHHHH Confidence 99999999999999999977654 4555 4689999999999999999999999999999752 345888899999 Q ss_pred HHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeee Q lcl|NC_021305. 316 DTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYA 395 (518) Q Consensus 316 ~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~ 395 (518) +||.||++.||++||++|+++ +++++||++.+++.|.+++++++.+++++|++|+||+|+++|+||++ |||++++ T Consensus 318 ~~l~P~~~~ie~~ln~~ll~~---~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~--~gd~~~~ 392 (413) T protein:vir:96 318 TKIMSIAQVIQQTYNKLIVEE---DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDA--EMDDLLV 392 (413) T ss_pred HHHHHHHHHHHHHHHHhhCCC---CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Ccceeee Confidence 999999999999999999863 67899999999999999999999999999999999999999999995 7999999 Q ss_pred cccccccccccccCCCCCCCCCCCCCccC Q lcl|NC_021305. 396 NSALQPLGATPDGAVEWEEAPAPKRPAST 424 (518) Q Consensus 396 ~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (518) |+|++|++...+.+...++. | T Consensus 393 ~~n~~~~~~~~~~~~~~~~d--------t 413 (413) T protein:vir:96 393 LENYLQQKDLVNQKKLIQDE--------T 413 (413) T ss_pred cccccchhhcccccCCCCCC--------C Confidence 99999998765544321111 1 No 47 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=8e-86 Score=487.12 Aligned_cols=374 Identities=18% Similarity=0.212 Sum_probs=314.1 Q ss_pred CcCCC-------------------------CCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHH Q lcl|NC_021305. 1 MLLAN-------------------------GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTV 55 (518) Q Consensus 1 ~~f~~-------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 55 (518) =||+. .+....++.+ ..+|+.-..+.++...+...++..++.+.++++++|++| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~v~ac 84 (409) T protein:vir:83 6 NLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEAR-ALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDVAWAC 84 (409) T ss_pred hhcccccCCCcccccccccccCCCCceeeccCCCcchhhh-hcccccccccccccccccccCccccchhhHhhhHHHHHH Confidence 22331 1222222222 245554332222223345556778889999999999999 Q ss_pred HHHHHHhhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEE-EEcCCCceEEEEee Q lcl|NC_021305. 56 IAKRAQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAI-QKNKSGTPEKLMPM 134 (518) Q Consensus 56 v~~ia~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i-~r~~~G~~~~l~~l 134 (518) |++||++||+||+++|+++. ..+...++|+.+||+.||+.+||+.++.+|++ ||+|+++ .++..|.+++|+|| T Consensus 85 V~~Ia~~iA~lpl~~~~~~~-----~~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~~~~L~pl 158 (409) T protein:vir:83 85 IDLNASVLSSMPIYRMRNGR-----IIDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGYPIRFRVV 158 (409) T ss_pred HHHHHHhhccCceEEeeCCc-----cccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCcEEEEEEE Confidence 99999999999999997542 23556668889999999999999999999987 9999875 58999999999999 Q ss_pred CCceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHcc Q lcl|NC_021305. 135 HPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNA 214 (518) Q Consensus 135 ~p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng 214 (518) +|+.|++..+.++. ..|.+. . .+.+++|||+|++++.+..+|+||+..+..++....++++++.++|+|| T Consensus 159 ~p~~v~v~~~~~g~-~~y~~~--~-------~~~~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng 228 (409) T protein:vir:83 159 PPWLVNVELKKGAR-REYRIG--G-------LNVTDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAETG 228 (409) T ss_pred CCcceEEEEcCCce-EEEEEc--c-------ccCccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 99999998887653 344332 1 2346899999998887778999999999999999999999999999999 Q ss_pred CCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCccee-eccCChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_021305. 215 GRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPI-PLQLTAVEMQFIEARQLNREEVCGVYDIAPP 293 (518) Q Consensus 215 ~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~-~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~ 293 (518) ++|+|+|++++.+++++.++++++|++.+.+ |+|+++++.+|+++. +++.+++|+||+|++++++++||++|||||+ T Consensus 229 a~p~gil~~~~~ls~e~~~~~~~~~~~~~~~--nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~ 306 (409) T protein:vir:83 229 GVPLYWLGVERRLSETEAVDLMDRWIESRSK--YAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPF 306 (409) T ss_pred CCcceEeecCCCCCHHHHHHHHHHHHHhhCC--ccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHH Confidence 9999999999999999999999999998865 789999999999974 6899999999999999999999999999999 Q ss_pred Hhcccc---ccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCC Q lcl|NC_021305. 294 IVHILD---RATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGV 370 (518) Q Consensus 294 ~lg~~~---~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~ 370 (518) +||+.+ +.+|+|+|++...|+++||.||+++||++|+++|++. .++++||++.+++.|.+++++++++++++|+ T Consensus 307 llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~---~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~ 383 (409) T protein:vir:83 307 LVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPS---PQHLELNRDDYTRPSLVERATAYKIMIEAGV 383 (409) T ss_pred HccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC---CcEEEeehhhhhccCHHHHHHHHHHHHhCCC Confidence 999754 4579999999999999999999999999999999975 4578999999999999999999999999999 Q ss_pred cCHHHHHHHhCCCCCCCCCcceeeeccccc Q lcl|NC_021305. 371 ATPNEGREIMGLPRSDDPKADELYANSALQ 400 (518) Q Consensus 371 ~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~ 400 (518) ||+||+|+++||||++ |||++- .+-+ T Consensus 384 lT~NE~R~~~glpp~~--ggd~l~--~~gv 409 (409) T protein:vir:83 384 MEPNEARAMERLHSEA--AAVRLS--GGGV 409 (409) T ss_pred cCHHHHHHHhCCCCCC--CCcccC--CCCC Confidence 9999999999999985 788762 1111 No 48 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=2.3e-84 Score=479.12 Aligned_cols=484 Identities=12% Similarity=0.093 Sum_probs=337.8 Q ss_pred CcCCCCC---CCCcccccccchhhh-----hhhcccccccc---------cccccchhhhHHHhhcHHHHHHHHHHHHhh Q lcl|NC_021305. 1 MLLANGQ---TLSAPAMAELSPQMQ-----DSYYYAPAVGM---------QLERQFSLYGGIYKNQPWVRTVIAKRAQAL 63 (518) Q Consensus 1 ~~f~~~~---~~~~~~~~~~~~~~~-----~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~v~~~v~~ia~~i 63 (518) =+|+... +.........++.+. ..++..+.... ....+..+..-..+....|++||.+|+.++ T Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~i 119 (574) T protein:vir:80 40 EPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSE 119 (574) T ss_pred cCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhh Confidence 1133110 000000000111100 00000000000 000112233444556667888888888899 Q ss_pred ccCceEEEEecCCcc----eeccchHHHHHHhc----CCcCC-CHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEee Q lcl|NC_021305. 64 ARLPVKCMFTSGDTE----TEESDTGYAKLLAD----PCEYL-DPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPM 134 (518) Q Consensus 64 a~l~~~v~~~~~~~~----~~~~~~~~~~L~~~----PN~~~-s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l 134 (518) |+|||+|++++.++. .....|+++.|+.. |||++ |+.+||+.++.+++++|++|++++|+..|++++|||| T Consensus 120 a~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl 199 (574) T protein:vir:80 120 TGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTV 199 (574) T ss_pred ccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEE Confidence 999999998765432 23456777777653 56664 8889999999999999999999999999999999999 Q ss_pred CCceeEEEEcCCceeeEE-eeecccccCceeEEeccccEEEEeccCCCC---cccCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 135 HPSRVAIKRNSRTGRYEY-YFQAGAGVGTQLVSFADDEVVPIRFFNPDG---LERGLSLMESLKSTIFSEDSSRNATAAM 210 (518) Q Consensus 135 ~p~~v~v~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 210 (518) +|.+|++..+.++..... ..++....++..+.|++++|||++++...+ ..||+|||.++..+|....++++++.++ T Consensus 200 ~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i~~~~~a~~~~~~~ 279 (574) T protein:vir:80 200 DPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQFIAHENTEVFNDRF 279 (574) T ss_pred cCceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHH Confidence 999999998876643110 111112345677889999999999765433 4579999999999999999999999999 Q ss_pred HHccCCcccccccC--ccCCHHHHHHHHHHHHHHhcCccccCCeee-cCCCcceeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 211 WKNAGRPNLVLRHE--KRLSEAAQQRLREQFDRAHSGSSNTGKTMV-VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGV 287 (518) Q Consensus 211 ~~ng~~p~~il~~~--~~~~~~~~~~~~~~~~~~~~g~~n~g~~~v-l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 287 (518) |+||++|+|||+++ ..+++++.+++++.|++.++|..|+|++++ +++|++|++++.++.|+||++++++++++||++ T Consensus 280 f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~a 359 (574) T protein:vir:80 280 FSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISAL 359 (574) T ss_pred HhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHH Confidence 99999999999875 458999999999999999999999999755 478999999999999999999999999999999 Q ss_pred hcCCHHHhcccccc----------ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHH Q lcl|NC_021305. 288 YDIAPPIVHILDRA----------TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEA 357 (518) Q Consensus 288 fgVPp~~lg~~~~~----------~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~ 357 (518) |||||++||+.+.+ |++|.|++.+.|+++||.|++.+||++||++|++..+..++++|+..+++..+... T Consensus 360 fgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~~~d~~~~~~~~ 439 (574) T protein:vir:80 360 YGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEFGEKYQFQFRGGDLSAQLDKL 439 (574) T ss_pred hCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCCceEEEecccchhhHHHHH Confidence 99999999987653 57899999999999999999999999999999998887788888766665433222 Q ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCC-ccCCCCCCCccccCC Q lcl|NC_021305. 358 KSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRP-ASTPVASLDQSPPTS 436 (518) Q Consensus 358 ~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 436 (518) .+..++.+||||+||+|+++|+||++ |||++++|.|+++++....+.....+....... ...+.+..++.+... T Consensus 440 ---~~~~~~~~G~lT~NE~R~~lgl~Pi~--gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (574) T protein:vir:80 440 ---KIIEQEGKVFRTVNEIRHDKGLEPIK--GGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPE 514 (574) T ss_pred ---HHHHHHhCCccCHHHHHHHhCCCCCC--CCCEeeeccceeecccccccccCCccchhccccccccccCCCCCCCCCC Confidence 23457889999999999999999995 899999999999987654332222111111100 000001111111001 Q ss_pred cc-ccccchhcchhhH------HHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCch Q lcl|NC_021305. 437 VP-GLSPTNSDRSTDS------GKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGF 489 (518) Q Consensus 437 ~~-~~~~~~~~~~~~~------~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (518) ++ +.+.+......+. +..+......-.+++|++.|.+++.++..++++..+-. T Consensus 515 ~p~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 574 (574) T protein:vir:80 515 EPKDSQNDTDVSFQDEQQGLNGKSKKVNGKVDDNVGKDGQLKSEENTNSTKHGTDGIKKE 574 (574) T ss_pred CCCCccccccchhhhhhhhhccchhhhcCCcccccccccccccccccccccccCccccCC Confidence 11 0000000000000 01111122223477999999999999999888777664 No 49 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=1.3e-83 Score=475.01 Aligned_cols=396 Identities=15% Similarity=0.128 Sum_probs=319.6 Q ss_pred CcCCCCCCCCcccccccc-hhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELS-PQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) |=|.+.....+......+ .|+...++ ..++..++.+.++++++|++||++||++||++||++ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~--------- 63 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDWVNFLTG--------GEAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTS--------- 63 (397) T ss_pred CcchhhhhcccCcccCCchhhhhhhcC--------CcCCceechHHhhccHHHHHHHHHHHHHHhhCcccc--------- Confidence 755433221111111111 23322111 122445778889999999999999999999999964 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) .++.+++|+.+||++||+++||+.++.+++++|++|++++|+..|.+++|+|++|++|++..+.++....|.+..... T Consensus 64 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~ 141 (397) T protein:vir:38 64 --ESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEP 141 (397) T ss_pred --cccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEeccc Confidence 467889999999999999999999999999999999999999999999999999999999999888888888877766 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) .++..+.|++++|||++++++++..+|+||+.++..++....++++++.++|+||++|+++|++++.+++++.+++++.| T Consensus 142 ~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~ 221 (397) T protein:vir:38 142 AIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSK 221 (397) T ss_pred cccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHH Confidence 77778899999999999999999889999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) +..+.+ .|+|+++|+++|++|++++.++.++||.+++++.+++||++|||||.+||+.+.+ ++|.+++ ..||.+||. T Consensus 222 ~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~-~~~~e~~-~~~~~~~l~ 298 (397) T protein:vir:38 222 EISKQI-HNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQ-QSSITQI-SGQYAKSLN 298 (397) T ss_pred HHHhcc-cccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc-ccHHHHH-HHHHHHHHH Confidence 887655 7899999999999999999999999999999999999999999999999987654 4666654 678899999 Q ss_pred HHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSAL 399 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~ 399 (518) |++..|+++||++|++.. +|++..+++.|.+++++.+++++++|++|+||+|+++|++|++ +||.+...... T Consensus 299 P~~~~ie~~ln~~l~~~~------~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~--~~d~~~~~~~~ 370 (397) T protein:vir:38 299 RYVQAIVGELNDKLHANI------SANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYL--AKDLPDPEKEP 370 (397) T ss_pred HHHHHHHHHHHHhccChh------cccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCccccccccc Confidence 999999999999999753 3555566889999999999999999999999999999999996 67765433333 Q ss_pred cccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccc Q lcl|NC_021305. 400 QPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPG 439 (518) Q Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (518) .+........ + ++...+++++.. .+++ T Consensus 371 ~~~~~~~~~~----~-------g~~~~~~~~e~~--~~~~ 397 (397) T protein:vir:38 371 QQAIQLIQQE----G-------GENDGNNSDERG--SDPE 397 (397) T ss_pred cccccccccc----c-------CCCCCCCCCCCC--CCCC Confidence 2221111000 0 000000000000 0011 No 50 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=2.7e-83 Score=473.29 Aligned_cols=399 Identities=16% Similarity=0.162 Sum_probs=324.1 Q ss_pred CcCCCC-CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANG-QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =||.+- +...+........+.. .++.. ...+...+....++++++|++||++||++||++||++|+.++++.+ T Consensus 2 g~f~~~~~~~~~~~~~~~~~~~~-~~~~~-----~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 75 (406) T protein:vir:95 2 GLFDRWRRTKRKSKIRADTGYVG-LFMSG-----EDVSFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDI 75 (406) T ss_pred cchhhhccccccccccccchhhh-hhccC-----cccCccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Confidence 133221 1112222222222222 22111 1122344567788999999999999999999999999999998888 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCe--EEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGET--YLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAG 157 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~--~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~ 157 (518) +..+++.++|+.+||++||+++||+.++.+++++|++ |+++.|+..|.+.+|||++|++|++..+.++..+. +. T Consensus 76 ~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~~~~~----~~ 151 (406) T protein:vir:95 76 RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDGYQVL----YG 151 (406) T ss_pred eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCeEEEE----ec Confidence 8889999999999999999999999999999999765 55677899999999999999999999888753332 21 Q ss_pred cccCceeEEeccccEEEEecc-CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEVVPIRFF-NPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 158 ~~~~~~~~~~~~~evih~~~~-~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) + ..|++++|||++++ ++....+|+||+..+..++....++++++.++|+||+.|+++++.++.+++++.++++ T Consensus 152 ----~--~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~ 225 (406) T protein:vir:95 152 ----G--QTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGR 225 (406) T ss_pred ----c--EEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHH Confidence 2 36899999999964 4445568999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCccccCCeeecCC-Ccceeecc-CChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEE-GMEPIPLQ-LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~-g~~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~ 314 (518) ++|.+.+.|..|+|+++|++. +.+++++. .++.|+||+|+++++.++||++|||||++||.. ++.+++..+|+ T Consensus 226 ~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~-----~~~~~~~~~~~ 300 (406) T protein:vir:95 226 NAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIG-----EFNRDEYNNFI 300 (406) T ss_pred HHHHHHhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-----CchHHHHHHHH Confidence 999999999999999988865 45666764 689999999999999999999999999999753 35688889999 Q ss_pred HHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~ 394 (518) ++||.|+++.|+++|+++|+++. .++++||++.+++.|.+++++.+.+++++|++|+||+|+++|++|++ |||+++ T Consensus 301 ~~~l~P~~~~ie~~l~~~l~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~--~gd~~~ 376 (406) T protein:vir:95 301 NSTILPIAKGIEQELTRKLLISP--DLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKE--GLSELV 376 (406) T ss_pred HHHHHHHHHHHHHHHHHhcCCCC--CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--Ccceee Confidence 99999999999999999999753 46899999999999999999999999999999999999999999995 799999 Q ss_pred ecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCc Q lcl|NC_021305. 395 ANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSV 437 (518) Q Consensus 395 ~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (518) +|+|++|++.....+... .++...++ ++. + T Consensus 377 ~~~n~~~~~~~~~~~~~k--------~g~~~~~~-~~~----~ 406 (406) T protein:vir:95 377 ILENYIPLDKIGDQSKLK--------GGDNSGAD-GQT----D 406 (406) T ss_pred eccCccchhhcccccccC--------CCCCCCCC-CCC----C Confidence 999999987654322211 01100000 000 0 No 51 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=3.6e-83 Score=472.55 Aligned_cols=385 Identities=15% Similarity=0.218 Sum_probs=311.0 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccc-------ccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYA-------PAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFT 73 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~ 73 (518) |=| ..|+.+....+ ......+......+.+.|+++++|++||++||++||+|||+++++ T Consensus 1 mg~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~ 66 (403) T protein:vir:10 1 MGF--------------KSWITEKLNPGQRIIRDMEPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDK 66 (403) T ss_pred Ccc--------------hhhhhhccchhhhhhhcccccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeec Confidence 322 22222211110 001111223344567889999999999999999999999999976 Q ss_pred cCCc---ceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceee Q lcl|NC_021305. 74 SGDT---ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRY 150 (518) Q Consensus 74 ~~~~---~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~ 150 (518) .... .....|++.++|+.+||++||+++||+.++.+++++||+|+++.+ ..|++++++.+++..+.++... T Consensus 67 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~------~~l~~l~~~~~~v~~~~~~~~~ 140 (403) T protein:vir:10 67 YNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG------TSLYHVPAALMQVEADANKFIK 140 (403) T ss_pred ccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC------ceeEeecCcceEEEEcCCceEE Confidence 5422 234467777888999999999999999999999999999988753 3589999999999887665544 Q ss_pred EEeeecccccCceeEEeccccEEEEeccCC----CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCcc Q lcl|NC_021305. 151 EYYFQAGAGVGTQLVSFADDEVVPIRFFNP----DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR 226 (518) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~evih~~~~~~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~ 226 (518) .|.+ . ....+.+++|+||+..++ .+..+|+||+.++..++....+++++..++|+||++|++||+.++. T Consensus 141 ~~~~--~-----~~~~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~ 213 (403) T protein:vir:10 141 KFIF--N-----NQINYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEI 213 (403) T ss_pred EEEe--c-----CceeecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC Confidence 3322 1 124678899999996543 3557899999999999999999999999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccC Q lcl|NC_021305. 227 LSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFS 304 (518) Q Consensus 227 ~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s 304 (518) +++++.+++++.|++.++|..|+|+++||++|++|++++. ++.|+||+++++++.++||++|||||++||. ++++ T Consensus 214 l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~---~~~s 290 (403) T protein:vir:10 214 LNKKLRERKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDG---GNNA 290 (403) T ss_pred CCHHHHHHHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCC---CCCc Confidence 9999999999999999999999999999999999999975 5789999999999999999999999999974 5689 Q ss_pred CHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhh--hhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_021305. 305 NISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDV--IQPDWEAKSESTQKMVNSGVATPNEGREIMGL 382 (518) Q Consensus 305 n~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l--~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~ 382 (518) |.+++.+.|+++||.||++.|+++|+++|. ++++||++.+ ++.|.+++++++.+++++|++|+||+|+++|+ T Consensus 291 n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl 364 (403) T protein:vir:10 291 NIRPNIELFYYMTIIPMLNKLTSSLTFFFG------YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNL 364 (403) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHhcC------ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 999999999999999999999999999983 4567777755 89999999999999999999999999999999 Q ss_pred CCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 383 PRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 383 ~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) +|+++++||++++|+|+.......+++. .++++.+.+++ T Consensus 365 ~pi~~~~~d~~~~p~n~~~~~~~~~~~e----------~~~~~~~~~g~ 403 (403) T protein:vir:10 365 EPLDDEQMNKIRIPANVAGSATGVSGQE----------GGRPKGSTEGD 403 (403) T ss_pred CCCCcccccccccccccccccccCCCCc----------CCCCCCCcCCC Confidence 9999999999999999864322211111 11111111111 No 52 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=2e-82 Score=468.43 Aligned_cols=394 Identities=15% Similarity=0.186 Sum_probs=315.8 Q ss_pred Cc--CCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 ML--LANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~--f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) |+ |+++ +...+.. . ..|+. +......... .. ...+.++|+|++||++||++||++|+++|++.+++. T Consensus 3 ~~~~f~~k-~~~~~~~-~-~~~~~---~~~~~~~~~~----~~-~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~ 71 (403) T protein:vir:80 3 LFNFFRRK-TRSEPTN-A-ISWFL---TQEAYDTLAI----PG-YTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGD 71 (403) T ss_pred cccccccc-ccccccc-h-hhhhc---cccccccccc----ch-hhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCce Confidence 32 4432 2111111 1 11111 1111101110 11 123567899999999999999999999999988887 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHc--CCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIY--GETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~--G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) +...+++.++|+.+||+.||+++||+.++.++++. |++|+++.++..|++.+||||+|+.|++..+.++..++|. T Consensus 72 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~~~~y~--- 148 (403) T protein:vir:80 72 IRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGYQIWYQ--- 148 (403) T ss_pred eecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCceEEEEe--- Confidence 77788888889999999999999999999999985 7799999999999999999999999999988877554432 Q ss_pred ccccCceeEEeccccEEEEec-cCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRF-FNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL 235 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~-~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~ 235 (518) ...|+.++||||+. +++.+..+|+||+..+..++....+++++..++|+||+.|++||++++.+++++.+++ T Consensus 149 -------~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~ 221 (403) T protein:vir:80 149 -------GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEG 221 (403) T ss_pred -------ecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHH Confidence 13578999999995 4556667899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhcCccccCCeeecCCCc-ceeecc-CChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGM-EPIPLQ-LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~-~~~~l~-~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) +++|.+.+.+..++|++++++.+. ++.++. .++.|+||+|.++++..+||++|||||++||.. ++.+++..+| T Consensus 222 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-----~~~~~~~~~f 296 (403) T protein:vir:80 222 RNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVG-----KYDKDEYNNF 296 (403) T ss_pred HHHHHHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCC-----CccHHHHHHH Confidence 999999999999999999987654 455543 578999999999999999999999999999852 2335666789 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) +.+||.|+++.||++|+++|+++. .++++||.+.+++.|.+++++++.+++++|++|+||+|+++|+||++ |||++ T Consensus 297 ~~~~l~P~~~~ie~~l~~kll~~~--~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~--ggd~~ 372 (403) T protein:vir:80 297 INSTILPIAKGIEQELTRKLLISP--DLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKE--GLSEL 372 (403) T ss_pred HHHHHHHHHHHHHHHHHHhccCCC--CcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CCCeE Confidence 999999999999999999999753 46889999999999999999999999999999999999999999995 89999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) ++++|++|++........ +.++..++ +++.. T Consensus 373 ~~~~n~~pl~~~~~~~~~-----k~ge~~~~----~~~~~ 403 (403) T protein:vir:80 373 VILENYIPLDKIGDQNKL-----KGGEKGGA----DGQTD 403 (403) T ss_pred eecccccchhhccchhhc-----cCCCCCCC----CCCCC Confidence 999999999865443211 11111110 01000 No 53 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=1.9e-82 Score=468.64 Aligned_cols=466 Identities=11% Similarity=0.069 Sum_probs=321.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccccccccc----ccccchhhhHHHhhcHHHHHHHHHHHHhhccCc--------- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQ----LERQFSLYGGIYKNQPWVRTVIAKRAQALARLP--------- 67 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~--------- 67 (518) |.+.-.+..-++.....++.+. .....+..+.+ ...........|.++|+|++||+.||+.||+++ T Consensus 39 ~~~~~~k~~~~~~~a~~~~~~~-~~~~~~~~~~r~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g 117 (551) T protein:vir:80 39 EQEQISKAMNNKEVAYSQPVIG-SMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKG 117 (551) T ss_pred cHHHHHHhhccCcceeeccccc-ceecCcccccCccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCC Confidence 2222111111111111222221 00111111111 111112334568889999999999999999854 Q ss_pred --eEEEEecCCccee----ccchHHHHHHhcCCcC-----CCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCC Q lcl|NC_021305. 68 --VKCMFTSGDTETE----ESDTGYAKLLADPCEY-----LDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHP 136 (518) Q Consensus 68 --~~v~~~~~~~~~~----~~~~~~~~L~~~PN~~-----~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p 136 (518) |.+.-++.+.... ...+.+..++.+||+. +|+.+|++.++.+++++|++|++++|+..|++++||||+| T Consensus 118 ~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p 197 (551) T protein:vir:80 118 VGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDP 197 (551) T ss_pred CCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCC Confidence 4443222221111 1123455678899987 4888999999999999999999999999999999999999 Q ss_pred ceeEEEEcCCceeeEEe-eecccccCceeEEeccccEEEEeccCCC---CcccCchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 137 SRVAIKRNSRTGRYEYY-FQAGAGVGTQLVSFADDEVVPIRFFNPD---GLERGLSLMESLKSTIFSEDSSRNATAAMWK 212 (518) Q Consensus 137 ~~v~v~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~evih~~~~~~~---~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ 212 (518) .+|++..+.++...... +++....++..+.|++++|||+++++.. ...||+|||.++..+|....++++++.++|+ T Consensus 198 ~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ 277 (551) T protein:vir:80 198 TTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFS 277 (551) T ss_pred ceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999988776542111 1112233556778999999999976543 3568999999999999999999999999999 Q ss_pred ccCCcccccccCc--cCCHHHHHHHHHHHHHHhcCccccCCeeec-CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhc Q lcl|NC_021305. 213 NAGRPNLVLRHEK--RLSEAAQQRLREQFDRAHSGSSNTGKTMVV-EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYD 289 (518) Q Consensus 213 ng~~p~~il~~~~--~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fg 289 (518) ||++|+|||++++ .+++++.+++++.|++.++|..|+|+++|| ++|++|++++.++.|+||++++++.+++||++|| T Consensus 278 Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFg 357 (551) T protein:vir:80 278 HGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYG 357 (551) T ss_pred cCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhc Confidence 9999999998754 489999999999999999999999998776 6899999999999999999999999999999999 Q ss_pred CCHHHhccccc----------cccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHH Q lcl|NC_021305. 290 IAPPIVHILDR----------ATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKS 359 (518) Q Consensus 290 VPp~~lg~~~~----------~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~ 359 (518) |||++||+.+. .|++|++++...|+++||.||+.+||++||++|++.++..+ +|+++.+...+..+++ T Consensus 358 VPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~--~f~f~~~~~~~~~~~~ 435 (551) T protein:vir:80 358 IDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEFGDKY--TFQFVGGDIKSELESV 435 (551) T ss_pred CCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccCCce--EEEeeccChhhHHHHH Confidence 99999998654 37899999999999999999999999999999998766544 4555567777777777 Q ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCC-----CC-CccCCCCCCCccc Q lcl|NC_021305. 360 ESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAP-----KR-PASTPVASLDQSP 433 (518) Q Consensus 360 ~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~-----~~-~~~~~~~~~~~~~ 433 (518) ++++ ++.+|+||+||+|+++|++|.. +|||+++.|.++.+++........+...... .. .+.....++.+++ T Consensus 436 ~~~~-~~~~g~lT~NE~R~~~gl~P~~-egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 513 (551) T protein:vir:80 436 KILA-EKAKVAMTVNEVRKELNLPGDV-IGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIP 513 (551) T ss_pred HHHH-HHhcCCcCHHHHHHHhCCCCCC-CCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCC Confidence 7654 6678999999999999999843 5999999999998876544332211111000 00 0000111111111 Q ss_pred cCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHH Q lcl|NC_021305. 434 PTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQL 493 (518) Q Consensus 434 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (518) ..++.. ..++++...+..+..++.+++.+......++- T Consensus 514 ~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (551) T protein:vir:80 514 DGKDTT----------------------GDIGKDGQRKDKDNANAGKQGMKGDKPNDWQT 551 (551) T ss_pred CccccC----------------------CCccccccccCccccchhhhhcCCCCccccCC Confidence 111110 01112222222223333333333333333332 No 54 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=7.2e-82 Score=465.43 Aligned_cols=484 Identities=14% Similarity=0.119 Sum_probs=329.0 Q ss_pred CcCCCC----CCCCc----ccccccchhhhhhhccccccccccc----ccchhhhHHHhhcHHHHHHHHHHHHhhccC-- Q lcl|NC_021305. 1 MLLANG----QTLSA----PAMAELSPQMQDSYYYAPAVGMQLE----RQFSLYGGIYKNQPWVRTVIAKRAQALARL-- 66 (518) Q Consensus 1 ~~f~~~----~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l-- 66 (518) -.|.+- ...++ .......|.+-...+.......+.. .+...+...+.++|+|++||++||++||++ T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~ 112 (576) T protein:vir:96 33 ANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQ 112 (576) T ss_pred HHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhh Confidence 111111 00111 1111122211111111111111111 111222345678999999999999999973 Q ss_pred ---------ceEEEEecCCcc--e--ecc----chHHHHHHhcCCcC-CCHHHHHHHHHHHHHHcCCeEEEEEEcC--CC Q lcl|NC_021305. 67 ---------PVKCMFTSGDTE--T--EES----DTGYAKLLADPCEY-LDPFAFWEWVASTLDIYGETYLAIQKNK--SG 126 (518) Q Consensus 67 ---------~~~v~~~~~~~~--~--~~~----~~~~~~L~~~PN~~-~s~~~f~~~~v~~ll~~G~~~~~i~r~~--~G 126 (518) +|.+..+..++. . ... ++.+..++..|||+ +|+.+||+.++.+++++||+|++++++. .| T Consensus 113 ~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g 192 (576) T protein:vir:96 113 PSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNAT 192 (576) T ss_pred hhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCC Confidence 333333222211 1 111 22334445566766 5999999999999999999999998654 57 Q ss_pred ceEEEEeeCCceeEEEEcCCceeeEEeee-cccccCceeEEeccccEEEEeccCCC---CcccCchHHHHHHHHHHHHHH Q lcl|NC_021305. 127 TPEKLMPMHPSRVAIKRNSRTGRYEYYFQ-AGAGVGTQLVSFADDEVVPIRFFNPD---GLERGLSLMESLKSTIFSEDS 202 (518) Q Consensus 127 ~~~~l~~l~p~~v~v~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~evih~~~~~~~---~~~~G~s~l~~~~~~i~~~~~ 202 (518) ++++||||+|.+|++..+.++..+.+... +....++....+++++|||++++... ...||+|||.++..+|....+ T Consensus 193 ~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~ 272 (576) T protein:vir:96 193 TMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGYGLSEVEIAMKQFIAYNN 272 (576) T ss_pred ceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccCcccccHHHHHHHHHHHHHH Confidence 89999999999999999988876544322 22334566788999999887654332 246899999999999999999 Q ss_pred HHHHHHHHHHccCCcccccccCc--cCCHHHHHHHHHHHHHHhcCccccCCe-eecCCCcceeeccCChhhHHHHHHHHH Q lcl|NC_021305. 203 SRNATAAMWKNAGRPNLVLRHEK--RLSEAAQQRLREQFDRAHSGSSNTGKT-MVVEEGMEPIPLQLTAVEMQFIEARQL 279 (518) Q Consensus 203 ~~~~~~~~~~ng~~p~~il~~~~--~~~~~~~~~~~~~~~~~~~g~~n~g~~-~vl~~g~~~~~l~~~~~d~~~~e~~~~ 279 (518) +++++.++|+||++|+|||++++ .+++++++++++.|++.++|..|+|++ +|+++|++|+++++++.|+||++++++ T Consensus 273 ~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~ 352 (576) T protein:vir:96 273 TETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTY 352 (576) T ss_pred HHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHHHH Confidence 99999999999999999999865 579999999999999999999999995 889999999999999999999999999 Q ss_pred HHHHHHHHhcCCHHHhcccccc-----------ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecch Q lcl|NC_021305. 280 NREEVCGVYDIAPPIVHILDRA-----------TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDID 348 (518) Q Consensus 280 ~~~~Ia~~fgVPp~~lg~~~~~-----------~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~ 348 (518) .+++||++|||||++||+.+.+ |++|++++.+.|+++||.||+.+||++||++|++..+..++++| T Consensus 353 ~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f--- 429 (576) T protein:vir:96 353 LINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHIISEYSDKYVFQF--- 429 (576) T ss_pred hHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhccCceEEEe--- Confidence 9999999999999999987644 78999999999999999999999999999999987766554454 Q ss_pred hhhhcCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCC-CC-CccC Q lcl|NC_021305. 349 DVIQPDWEAKSESTQKM--VNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAP-KR-PAST 424 (518) Q Consensus 349 ~l~~~d~~~~~~~~~~~--~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~-~~-~~~~ 424 (518) ++.|.+++++.+..+ +.+|+||+||+|+++|+||++ |||+++.|.++.+++..........+.++. .. ..+. T Consensus 430 --~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~pie--gGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 505 (576) T protein:vir:96 430 --VGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIE--GGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQF 505 (576) T ss_pred --ccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCC--CcceeccccccccccccccCCCCCCccccccccccccc Confidence 577888888877654 567999999999999999995 899999999998876543322211111111 00 0000 Q ss_pred CCCCCCccccCCcccc-ccchhcchhhHHHHHH---HHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHHH Q lcl|NC_021305. 425 PVASLDQSPPTSVPGL-SPTNSDRSTDSGKTEP---RRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLA 494 (518) Q Consensus 425 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~---~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 494 (518) ...+....+ .+... ..+......+....+. +.-..|...+..+.||...+.++||+|++ ++..|+-- T Consensus 506 ~~~~~~~~~--~~~s~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 576 (576) T protein:vir:96 506 LNSPDDEEP--QQESTEDKVDGRESNDPTKIDSPVGTDGQLKDQDNVKSQEGSNKGQGTKGKGNE-KPSDFKNN 576 (576) T ss_pred cCCCCCCCC--CCCCCCCcccccccccCCCCCCccccccccCCCCcccccccccccccccccCCC-CcccccCC Confidence 000000000 00000 0000001111111111 12234555566667777777788887774 44455521 No 55 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=6.8e-82 Score=465.58 Aligned_cols=346 Identities=18% Similarity=0.270 Sum_probs=303.0 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEE Q lcl|NC_021305. 63 LARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIK 142 (518) Q Consensus 63 ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~ 142 (518) ||+|||++|+++. ...|++.++|+.+||++||+.+||+.++.+++++||+|++++|+..|++++|+||+|++|++. T Consensus 1 ia~lp~~~~~~~~----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~ 76 (348) T protein:vir:93 1 MASLPLKMYEDYK----VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEML 76 (348) T ss_pred CcccceEeEecCc----CcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEE Confidence 9999999998653 335666667778999999999999999999999999999999999999999999999999999 Q ss_pred EcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccc Q lcl|NC_021305. 143 RNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLR 222 (518) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 222 (518) .+.++..+.|.+... ++..+.|++++||||+++++.+..+|+||+..+..++....++++++ ++.++..++++++ T Consensus 77 ~~~~~~~~~y~~~~~---~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~ 151 (348) T protein:vir:93 77 IENQSRELYYSIHAA---TGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLK 151 (348) T ss_pred EeCCCcEEEEEEEcC---CCeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEe Confidence 988888877765433 35567899999999999888777899999999999999999998886 4444555678889 Q ss_pred cCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccc Q lcl|NC_021305. 223 HEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT 302 (518) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~ 302 (518) .++.+++++.++++++|++.+. |+|+++|+++|++|++++.+++|+||.|+++++.++||++|||||.+||..+++| T Consensus 152 ~~~~l~~e~~~~~~~~~~~~~~---n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~ 228 (348) T protein:vir:93 152 YGSNVSTEKRQQVLEDFKQYYE---ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTN 228 (348) T ss_pred cCCCCCHHHHHHHHHHHHHHhh---cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 9999999999999999999873 6789999999999999999999999999999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_021305. 303 FSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM 380 (518) Q Consensus 303 ~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 380 (518) ++|.+++.+.|+++||.|+++.|+++||++|++..++ +++++||.+.+++.|.+++++++.+++++|++|+||+|+++ T Consensus 229 ~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~ 308 (348) T protein:vir:93 229 FAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE 308 (348) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh Confidence 9999999999999999999999999999999987654 67899999999999999999999999999999999999999 Q ss_pred CCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 381 GLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 381 g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) |++|+| |||++++++|++|++.....+...+...+.++.+ T Consensus 309 g~~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 309 DLPPVE--GGDKPLISGDLYPIDTPLELRKSLKGGDKNVNES 348 (348) T ss_pred CCCCCC--CcCeEeecccccccccchhhcccccCCCCCcCCC Confidence 999996 7999999999999876544332111111111100 No 56 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=1.8e-80 Score=457.76 Aligned_cols=471 Identities=12% Similarity=0.063 Sum_probs=318.6 Q ss_pred CcCCCCC-----CCCcccccccchhhh-hhhccccccc--ccccccchhhhHHHhhcHHHHHHHHHHHHhhccCc----- Q lcl|NC_021305. 1 MLLANGQ-----TLSAPAMAELSPQMQ-DSYYYAPAVG--MQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLP----- 67 (518) Q Consensus 1 ~~f~~~~-----~~~~~~~~~~~~~~~-~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~----- 67 (518) -|+...+ ..-+......++.+. +.++.+-... ...........+.|..+|+|++||+.||+.||++. T Consensus 30 ~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~ 109 (547) T protein:vir:63 30 AIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARH 109 (547) T ss_pred hhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhh Confidence 1111100 000011111122111 1111110000 00000112234567889999999999999999742 Q ss_pred ------eEEEEecCCcc----eeccchHHHHHHhcCCcCC-----CHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEE Q lcl|NC_021305. 68 ------VKCMFTSGDTE----TEESDTGYAKLLADPCEYL-----DPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLM 132 (518) Q Consensus 68 ------~~v~~~~~~~~----~~~~~~~~~~L~~~PN~~~-----s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~ 132 (518) |.+.-++.+.. .....+.+..++.+||+++ |+.+|++.++.+++++|++|++++|+..|++++|| T Consensus 110 ~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~ 189 (547) T protein:vir:63 110 SEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFV 189 (547) T ss_pred hccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEEEECCCCcEEEEE Confidence 22211111111 1112244556778888874 88999999999999999999999999999999999 Q ss_pred eeCCceeEEEEcCCceeeEEe-eecccccCceeEEeccccEEEEeccCCCC---cccCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 133 PMHPSRVAIKRNSRTGRYEYY-FQAGAGVGTQLVSFADDEVVPIRFFNPDG---LERGLSLMESLKSTIFSEDSSRNATA 208 (518) Q Consensus 133 ~l~p~~v~v~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~s~l~~~~~~i~~~~~~~~~~~ 208 (518) ||+|.+|++..+.++...... +++....++..+.|++++|||+++++..+ ..||+|||..+..+|....++++++. T Consensus 190 ~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eiih~r~n~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~ 269 (547) T protein:vir:63 190 AKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRSDIYATGYGYPELEIALKQFIAHENTEAFND 269 (547) T ss_pred EecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEEEecccCCCCcccccccccHHHHHHHHHHHHHHHHHHHH Confidence 999999999987776432111 11112345567789999999999876543 45799999999999999999999999 Q ss_pred HHHHccCCcccccccCc--cCCHHHHHHHHHHHHHHhcCccccCCeeec-CCCcceeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_021305. 209 AMWKNAGRPNLVLRHEK--RLSEAAQQRLREQFDRAHSGSSNTGKTMVV-EEGMEPIPLQLTAVEMQFIEARQLNREEVC 285 (518) Q Consensus 209 ~~~~ng~~p~~il~~~~--~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 285 (518) ++|+||++|+|||.+++ .+++++++++++.|++.++|..|+|+++|+ ++|++|++++.++.|+||++++++++++|| T Consensus 270 ~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia 349 (547) T protein:vir:63 270 RFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVIS 349 (547) T ss_pred HHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHH Confidence 99999999999998764 489999999999999999999999998766 688999999999999999999999999999 Q ss_pred HHhcCCHHHhccccc----------cccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCH Q lcl|NC_021305. 286 GVYDIAPPIVHILDR----------ATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) Q Consensus 286 ~~fgVPp~~lg~~~~----------~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~ 355 (518) ++|||||++||+.+. .|++|++++...|+++||.|++..||++||++|++.++..+ +|+++.+...+. T Consensus 350 ~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~~~~~--~~~f~~~~~~~~ 427 (547) T protein:vir:63 350 ALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHIVAEFGDKY--TFQFVGGDIKSE 427 (547) T ss_pred HHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCce--EEEeeccccccH Confidence 999999999998654 37899999999999999999999999999999998766544 455556677777 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCC-CC-CCCccCCCCCCCccc Q lcl|NC_021305. 356 EAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAP-AP-KRPASTPVASLDQSP 433 (518) Q Consensus 356 ~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~ 433 (518) .++++++ +++.+|+||+||+|+++|++|.. +|||+++.|.++.+++............. .. ..+........+. T Consensus 428 ~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~-egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 503 (547) T protein:vir:63 428 LESVKIL-AEKAKVAMTVNEVRKELNLPGDV-IGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVST-- 503 (547) T ss_pred HHHHHHH-HHHhCCCcCHHHHHHHhCCCCCC-CCCceeecccccccccccccccCCccccchhhccccccccCCCCCC-- Confidence 7777655 57788999999999999999843 59999999999888765332221111100 00 0000000000000 Q ss_pred cCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHH Q lcl|NC_021305. 434 PTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQL 493 (518) Q Consensus 434 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (518) .+..+++.++.. ..++++...+..+..++.+++.+..+...++- T Consensus 504 ---~~~~~~~~~~~~-------------~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 547 (547) T protein:vir:63 504 ---DVEDIPDGKDTT-------------GDIGKDGQRKDKDNANAGKQGMKGDKPNDWQT 547 (547) T ss_pred ---CCCCCCCCcccC-------------CCcCccccccCccccchhhhhcCCCCccccCC Confidence 001111100000 12223333333333344444444444444442 No 57 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=2.4e-80 Score=457.04 Aligned_cols=386 Identities=11% Similarity=0.132 Sum_probs=305.5 Q ss_pred CcCCCCCC--CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQT--LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =||.+-+. .+..+ ...|+.+.+++. ...++..++.+.++++++|++||++||++||+|||++|+++++ T Consensus 2 Gl~~~~~~~~~~~~~---~~~~~~~~~~~~-----~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~-- 71 (394) T protein:vir:62 2 GLRDRFSNYLFKKAE---KRGYLDNVLGKS-----IRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFGN-- 71 (394) T ss_pred chhhhhhhhccCCCC---chhhhhhhhhcc-----cccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCCc-- Confidence 13332221 12222 234565555442 2233455778889999999999999999999999999986542 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) +..+|+.+.|+.+||++||+++||+.++.+++++|++|+++.++..+. +..+++..+..+.. .+.. T Consensus 72 -~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~--------~~~~~~~~~~~~~~---~~~~-- 137 (394) T protein:vir:62 72 -EIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHL--------ASNVFTELDDNLVE---HFNI-- 137 (394) T ss_pred -ccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeec--------cccceEEECCceEE---EEee-- Confidence 456778889999999999999999999999999999999987654332 23455555444322 1211 Q ss_pred ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC--HHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS--EAAQQRLR 236 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~--~~~~~~~~ 236 (518) ..++|++++|||+|+++.++ .+|+||+..+..+|....+++++..++|+||+.|+++|++++.++ +++.++++ T Consensus 138 ----~~~~~~~~eiih~r~~~~d~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~ 212 (394) T protein:vir:62 138 ----GGHEIPPCMIRHVKNIGADH-LRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLI 212 (394) T ss_pred ----CCEEechhheEEecCcCCCC-ccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHH Confidence 12579999999999988776 689999999999999999999999999999999999999998776 45578999 Q ss_pred HHHHHHhcCccccCCeeecCCCc--ceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGM--EPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~--~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~ 314 (518) +.|++.++|..|+|+++|++.|. ++.+++.++.|+||+|+++++.++||++|||||.+||.. +++|.|++.+.|+ T Consensus 213 ~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~---~~sn~e~~~~~~~ 289 (394) T protein:vir:62 213 NAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTEL---IKEDIEKAMMYIH 289 (394) T ss_pred HHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCC---CCcCHHHHHHHHH Confidence 99999999999999999998776 566888899999999999999999999999999999853 5689999999999 Q ss_pred HHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) ++||.|++++||++|+++|+++.+. .++++||...++. ..++++++.+++++|++|+||+|+++|++|+++++||++ T Consensus 290 ~~~l~P~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~--~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~ 367 (394) T protein:vir:62 290 NKAVRPIMKNFEDHLSLLFYAQNSGKRIKFKINILDFVT--YSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAI 367 (394) T ss_pred HHHHHHHHHHHHHHHhhhhcCccccCceEEEechhhhcC--HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCee Confidence 9999999999999999999876543 4556777666655 456788999999999999999999999999999999999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCcc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQS 432 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (518) ++++|+++++....... +.+ +++++++ T Consensus 368 ~~~~n~~~~~~~~~~~~-------~~k-----gge~~en 394 (394) T protein:vir:62 368 YISNDVTEIGKKEATDG-------SLG-----GGEENEN 394 (394) T ss_pred ecccccccccccccccc-------cCC-----CCCCCCC Confidence 99999998864322111 001 1111111 No 58 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=3.2e-80 Score=456.42 Aligned_cols=388 Identities=14% Similarity=0.144 Sum_probs=311.4 Q ss_pred CcCCC---CC-CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLAN---GQ-TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~---~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) ||-+. -+ ...+++....+.|+...............++..++...++++++|++||++||++||++|++++++.. T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh- Confidence 54332 11 12233333334443221111111222233456788889999999999999999999999999987543 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) ..|+.+||+.||+++||+.++.+++++|++|++++|+..|++++|+||+|++|++..+.+++...|.+.. T Consensus 80 ----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~y~~~~ 149 (392) T protein:vir:74 80 ----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITF 149 (392) T ss_pred ----------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEe Confidence 3588999999999999999999999999999999999999999999999999999999888888888887 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) .+...+..+.+++++|||++++++++..+|+||+.++..+|....++++++.++|+||+.|+++|++++....++ +.+ T Consensus 150 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~--~~~ 227 (392) T protein:vir:74 150 DDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSD--KDK 227 (392) T ss_pred cCCccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchH--HHH Confidence 776777788999999999999999998899999999999999999999999999999999999999987654443 345 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|.+.+.|..|+|+++||++|++|++++.++.|+||++++++..++||++|||||++||+.+.. ++.+++.+.|+++ T Consensus 228 ~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~~~e~~~~~~~~ 305 (392) T protein:vir:74 228 ASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ--QSSIQQISGMYAS 305 (392) T ss_pred HHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc--ccHHHHHHHHHHH Confidence 66777788888999999999999999999999999999999999999999999999999976543 3556778899999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeec Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYAN 396 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~ 396 (518) ||.|+++.|+++++++|++. ++||...+++.|.+++++.+.+++++|++|+||+|+++....+. + ++.... T Consensus 306 ~l~p~~~~ie~~l~~~l~~~------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~-p--ne~r~~ 376 (392) T protein:vir:74 306 ALNRYLRPAISELEYKLSDH------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYI-P--KDLPAP 376 (392) T ss_pred HHHHHHHHHHHHHHHhccch------hcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCC-c--cccchh Confidence 99999999999999999763 57888999999999999999999999999999999987332221 1 122222 Q ss_pred ccccccccccccCCCCCCCC Q lcl|NC_021305. 397 SALQPLGATPDGAVEWEEAP 416 (518) Q Consensus 397 ~n~~~~~~~~~~~~~~~~~~ 416 (518) .|+-|+. ++ .+.++.| T Consensus 377 enl~~~~---~G-d~~~p~p 392 (392) T protein:vir:74 377 ENTNKKT---TG-QSNEPVP 392 (392) T ss_pred cCCCCCC---CC-CCCCCCC Confidence 2322211 11 1111111 No 59 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=4.4e-79 Score=450.18 Aligned_cols=385 Identities=14% Similarity=0.149 Sum_probs=311.5 Q ss_pred CcCCCCC----CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLANGQ----TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) ||.+... ..+++.....+.|+...............++..++...++++++|++||++||++||++|++++++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh- Confidence 6655322 22344444444444321111111122223345678888999999999999999999999999986542 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) ..|+.+||++||+++||+.++.+++++|++|++++|+..|++++|+|++|++|++..+.++..+.|.+.. T Consensus 80 ----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~ 149 (392) T protein:vir:10 80 ----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITF 149 (392) T ss_pred ----------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEe Confidence 3588999999999999999999999999999999999999999999999999999999888888888887 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) .+..++..+.|+++||||++++++++..+|+||+.++..++....++++++.++|+||+.|+|+|++++....++ +.+ T Consensus 150 ~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~--~~~ 227 (392) T protein:vir:10 150 DDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSD--KDK 227 (392) T ss_pred cCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchH--HHH Confidence 777777788999999999999999988899999999999999999999999999999999999999987654432 335 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|.+.+.+..|+|+++|+++|++|++++.++.|+||++++++++++||++|||||++||+... +++.+++.+.|+++ T Consensus 228 ~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~--~~~~~~~~~~f~~~ 305 (392) T protein:vir:10 228 ASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGD--QQSSIQQISGMYAS 305 (392) T ss_pred HHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC--cccHHHHHHHHHHH Confidence 6677778888899999999999999999999999999999999999999999999999997543 33556778899999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCccee Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM---GLPRSDDPKADEL 393 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---g~~p~~~~~gD~~ 393 (518) ||.|+++.|+++++++|++. ++||...+++.|..++++.+.+++++|++|+||+|+++ |+.|.+ + T Consensus 306 ~l~P~~~~ie~~l~~~L~~~------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e------~ 373 (392) T protein:vir:10 306 ALNRYLRPAISELEYKLSDH------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKD------L 373 (392) T ss_pred HHHHHHHHHHHHHHHhcccc------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccc------c Confidence 99999999999999999753 57888888999999999999999999999999999987 554422 1 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ....|+-|+. + ++++++++ T Consensus 374 r~~e~l~~~~---~-------------------Gd~~~p~p 392 (392) T protein:vir:10 374 PAPENTNKKT---T-------------------GQSNEPVP 392 (392) T ss_pred chhcCCCCCC---C-------------------CCCCCCCC Confidence 1112221111 0 11111111 No 60 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=4.4e-79 Score=450.18 Aligned_cols=385 Identities=14% Similarity=0.149 Sum_probs=311.5 Q ss_pred CcCCCCC----CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLANGQ----TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) ||.+... ..+++.....+.|+...............++..++...++++++|++||++||++||++|++++++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh- Confidence 6655322 22344444444444321111111122223345678888999999999999999999999999986542 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) ..|+.+||++||+++||+.++.+++++|++|++++|+..|++++|+|++|++|++..+.++..+.|.+.. T Consensus 80 ----------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~ 149 (392) T protein:vir:39 80 ----------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITF 149 (392) T ss_pred ----------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEe Confidence 3588999999999999999999999999999999999999999999999999999999888888888887 Q ss_pred ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR 236 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~ 236 (518) .+..++..+.|+++||||++++++++..+|+||+.++..++....++++++.++|+||+.|+|+|++++....++ +.+ T Consensus 150 ~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~--~~~ 227 (392) T protein:vir:39 150 DDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSD--KDK 227 (392) T ss_pred cCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchH--HHH Confidence 777777788999999999999999988899999999999999999999999999999999999999987654432 335 Q ss_pred HHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHH Q lcl|NC_021305. 237 EQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRD 316 (518) Q Consensus 237 ~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~ 316 (518) +.|.+.+.+..|+|+++|+++|++|++++.++.|+||++++++++++||++|||||++||+... +++.+++.+.|+++ T Consensus 228 ~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~--~~~~~~~~~~f~~~ 305 (392) T protein:vir:39 228 ASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGD--QQSSIQQISGMYAS 305 (392) T ss_pred HHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC--cccHHHHHHHHHHH Confidence 6677778888899999999999999999999999999999999999999999999999997543 33556778899999 Q ss_pred HhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCccee Q lcl|NC_021305. 317 TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM---GLPRSDDPKADEL 393 (518) Q Consensus 317 ~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---g~~p~~~~~gD~~ 393 (518) ||.|+++.|+++++++|++. ++||...+++.|..++++.+.+++++|++|+||+|+++ |+.|.+ + T Consensus 306 ~l~P~~~~ie~~l~~~L~~~------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e------~ 373 (392) T protein:vir:39 306 ALNRYLRPAISELEYKLSDH------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKD------L 373 (392) T ss_pred HHHHHHHHHHHHHHHhcccc------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCccc------c Confidence 99999999999999999753 57888888999999999999999999999999999987 554422 1 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ....|+-|+. + ++++++++ T Consensus 374 r~~e~l~~~~---~-------------------Gd~~~p~p 392 (392) T protein:vir:39 374 PAPENTNKKT---T-------------------GQSNEPVP 392 (392) T ss_pred chhcCCCCCC---C-------------------CCCCCCCC Confidence 1112221111 0 11111111 No 61 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=1.2e-78 Score=447.71 Aligned_cols=383 Identities=14% Similarity=0.136 Sum_probs=317.3 Q ss_pred CcCCCCCC-CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQT-LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =||++... +++|... .+.|+... ...+ .....++..++.+.++++|+|++||++||++||+||+++|+. T Consensus 2 ~~f~~~~~~~~~~~~~-~~~~~~~~-~~~~--~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~------ 71 (386) T protein:vir:48 2 PIFNITNLATESPPIS-QGGFFDIT-DPDF--LSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRK------ 71 (386) T ss_pred cccccccccccccccc-cccccccc-cchh--cccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccc------ Confidence 25554332 3333322 22222111 1111 112234556788889999999999999999999999999753 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ..+.|+.+||+.||+++||+.++.+++++|++|++++|+..|++++|+|++|++|++..+.++...+|.+...+. T Consensus 72 -----~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~ 146 (386) T protein:vir:48 72 -----QLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDP 146 (386) T ss_pred -----hhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEecCc Confidence 356799999999999999999999999999999999999999999999999999999999888888888877766 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) ..+..+.|++++|||++++++++.++|+||+..+..++....++++++.++|+||++|+++|+.++.+++++.+++++.| T Consensus 147 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~e~~~~~~~~~ 226 (386) T protein:vir:48 147 RIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLLDFKTKLSRSR 226 (386) T ss_pred cccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHH Confidence 66777899999999999999999889999999999999999999999999999999999999999999999999999998 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhh Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMA 319 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~ 319 (518) .... .|+|+++||++|++|++++.++.++||+++++++.++||++|||||.++|+ .+++++.+++.+.|+++||. T Consensus 227 ~~~~---~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~--~~~~~~~e~~~~~~~~~~l~ 301 (386) T protein:vir:48 227 QAMK---QMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGG--QGDQQSSLEMSLDLYNKAVS 301 (386) T ss_pred HHhh---cCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCC--CCCcccHHHHHHHHHHHHHH Confidence 7643 578999999999999999999999999999999999999999999999986 45788999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee-eccc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY-ANSA 398 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~-~~~n 398 (518) |+++.||++|+++|++. +++++...+..|...++..+.+++++|++|+||+|+++|++|++ ++|... ...| T Consensus 302 P~~~~ie~~l~~~l~~~------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~--~~~~~~~~~~~ 373 (386) T protein:vir:48 302 RYLRPFLSELSQKLSCD------VDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEIL--PKELPEGENPN 373 (386) T ss_pred HHHHHHHHHHHHhhcch------hhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCC--CccchhhcCCC Confidence 99999999999999874 35666667788888899999999999999999999999999986 355321 1112 Q ss_pred ccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 399 LQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) ..|+. .++.+.++ T Consensus 374 ~~~~~-----------------gGd~~~~~ 386 (386) T protein:vir:48 374 KTTLK-----------------GGEINGED 386 (386) T ss_pred CCccC-----------------CCCCCCCC Confidence 22211 00000000 No 62 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=1.4e-78 Score=447.38 Aligned_cols=375 Identities=16% Similarity=0.204 Sum_probs=305.2 Q ss_pred CcCCCCC--CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQ--TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) +|..++. ...........+.+...++.. .....++.+.|+++++|++||++||++||++||++++ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~------ 69 (385) T protein:vir:10 3 LLTPRNFNKRKAKNMVYPSNPAFFTTTVGG-------MQLSYVSALSALQNTNVYSVINRIASDVASAHFKTEN------ 69 (385) T ss_pred cccchhcccccccccccccchhhhhhhccc-------cCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeec------ Confidence 3322222 222222222222222222211 1234577888999999999999999999999999964 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) |..+.|+.+||++||+++||+.++.+++++|++|++++++ ..+++|+++.+|++..+..+..+ .+.. T Consensus 70 -----~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~~~~~~~--~~~~-- 136 (385) T protein:vir:10 70 -----TATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVY--TVLE-- 136 (385) T ss_pred -----cchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEcCCceEE--EEEE-- Confidence 2344577899999999999999999999999999999875 46788999988888776655433 2222 Q ss_pred ccCceeEEeccccEEEEeccCCCC--cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC-CHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFFNPDG--LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SEAAQQRL 235 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~~~~~--~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~~~ 235 (518) ..++..+.|++++||||++.++++ ..+|+||+..+..++....+++++..++|+||++|+++|++++.+ ++++.+++ T Consensus 137 ~~~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~ 216 (385) T protein:vir:10 137 SNDRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESA 216 (385) T ss_pred cCCceEEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHH Confidence 345677889999999999877654 568999999999999999999999999999999999999999766 57889999 Q ss_pred HHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHH-HHHHHHHHHHHHHhcCCHHHhcccc--ccccCCHHHHHHH Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFI-EARQLNREEVCGVYDIAPPIVHILD--RATFSNISAQMRA 312 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~-e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn~e~~~~~ 312 (518) ++.|++.++| .|+|+++|+++|++|++++.++.++|++ +.+++++++||++|||||++||..+ +.+++|.|++. . T Consensus 217 ~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~-~ 294 (385) T protein:vir:10 217 REEFEKANTG-DNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIK-A 294 (385) T ss_pred HHHHHHHhCc-cccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHH-H Confidence 9999999877 7899999999999999999999999975 9999999999999999999999754 56788988764 4 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADE 392 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~ 392 (518) ++..||.|+++.|+++++++|+++ .++|+++.+++.|.+++++.+.+++++|++|+||+|+++|++|++.++||+ T Consensus 295 ~~~~~l~P~~~~ie~~l~~~l~~~-----~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~ 369 (385) T protein:vir:10 295 TYLANLNSYVNPIVDELRLKMNAP-----DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPE 369 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhhCCc-----eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCcc Confidence 556799999999999999999753 489999999999999999999999999999999999999999999888888 Q ss_pred eeeccccccccccccc Q lcl|NC_021305. 393 LYANSALQPLGATPDG 408 (518) Q Consensus 393 ~~~~~n~~~~~~~~~~ 408 (518) +.++.+.+..+...+. T Consensus 370 ~~~~~~~~~~g~~~dn 385 (385) T protein:vir:10 370 FKPLTTQVKGGDEGDN 385 (385) T ss_pred ccCcccccCCCCCCCC Confidence 8877775432211110 No 63 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=1.4e-78 Score=447.41 Aligned_cols=467 Identities=14% Similarity=0.127 Sum_probs=317.3 Q ss_pred CcCCC---CCCCCcccccccchhhhhhhcccc-ccc----ccccccchhhhHHHhhcHHHHHHHHHHHHhhcc------- Q lcl|NC_021305. 1 MLLAN---GQTLSAPAMAELSPQMQDSYYYAP-AVG----MQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR------- 65 (518) Q Consensus 1 ~~f~~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~------- 65 (518) -+|.. +.....++ ...|.+. .+.... ..+ .....+.......+..+++|++||+.+++.||+ T Consensus 41 ~~~~~~~~~~~~~~~a--~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~ 117 (563) T protein:vir:95 41 KEYQDLTKSLYGQQQA--YAEPFIE-MMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARY 117 (563) T ss_pred hhHHHHHhhhccCCCc--chhhhHh-hhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhh Confidence 22221 11112222 1233332 111111 111 111112233455677899999999999999985 Q ss_pred ------CceEEEEecCCcce--eccchHHHHHHh----cCCcC-CCHHHHHHHHHHHHHHcCCeEEEEE--EcCCCceEE Q lcl|NC_021305. 66 ------LPVKCMFTSGDTET--EESDTGYAKLLA----DPCEY-LDPFAFWEWVASTLDIYGETYLAIQ--KNKSGTPEK 130 (518) Q Consensus 66 ------l~~~v~~~~~~~~~--~~~~~~~~~L~~----~PN~~-~s~~~f~~~~v~~ll~~G~~~~~i~--r~~~G~~~~ 130 (518) +++++++.+..+.. ....++++.++. .|||+ +|+.+||+.++.+++++|++|++++ |+..|++++ T Consensus 118 ~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~ 197 (563) T protein:vir:95 118 SEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEK 197 (563) T ss_pred hcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEE Confidence 57777766654432 223344433332 33343 5899999999999999999999876 777899999 Q ss_pred EEeeCCceeEEEEcCCceeeEEeee-cccccCceeEEeccccEEEEecc-CCC--CcccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 131 LMPMHPSRVAIKRNSRTGRYEYYFQ-AGAGVGTQLVSFADDEVVPIRFF-NPD--GLERGLSLMESLKSTIFSEDSSRNA 206 (518) Q Consensus 131 l~~l~p~~v~v~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~evih~~~~-~~~--~~~~G~s~l~~~~~~i~~~~~~~~~ 206 (518) ||||+|++|++..+.++..+..... +....++....|.++++||++.+ +.+ ...||+|||.++..+|....+++++ T Consensus 198 L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~ 277 (563) T protein:vir:95 198 FIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESF 277 (563) T ss_pred EEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHH Confidence 9999999999999887765432111 11223556678899998866544 332 2568999999999999999999999 Q ss_pred HHHHHHccCCcccccccCc--cCCHHHHHHHHHHHHHHhcCccccCCe-eecCCCcceeeccCChhhHHHHHHHHHHHHH Q lcl|NC_021305. 207 TAAMWKNAGRPNLVLRHEK--RLSEAAQQRLREQFDRAHSGSSNTGKT-MVVEEGMEPIPLQLTAVEMQFIEARQLNREE 283 (518) Q Consensus 207 ~~~~~~ng~~p~~il~~~~--~~~~~~~~~~~~~~~~~~~g~~n~g~~-~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 283 (518) +.++|+||++|+|||++++ .+++++++++++.|++.++|..|+|++ +|+++|++|++++.++.|+||++++++++++ T Consensus 278 ~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~ 357 (563) T protein:vir:95 278 NDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINI 357 (563) T ss_pred HHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHH Confidence 9999999999999999864 479999999999999999999999996 7899999999999999999999999999999 Q ss_pred HHHHhcCCHHHhcccccc-----------ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhh Q lcl|NC_021305. 284 VCGVYDIAPPIVHILDRA-----------TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQ 352 (518) Q Consensus 284 Ia~~fgVPp~~lg~~~~~-----------~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~ 352 (518) ||++|||||++||+.+.+ +++|.+++.+.|+++||.||+.+||++||++|++..+..+.++| ++ T Consensus 358 Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f-----~r 432 (563) T protein:vir:95 358 ISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQF-----VG 432 (563) T ss_pred HHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcccccEEEe-----cc Confidence 999999999999987654 55889999999999999999999999999999987665544444 67 Q ss_pred cCHHHHHHHHH--HHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 353 PDWEAKSESTQ--KMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 353 ~d~~~~~~~~~--~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) .|.+++++.+. +++++|+||+||+|+++|++|++ |||+++.|.++++++..........+..... .....+.. T Consensus 433 ~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~--gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 507 (563) T protein:vir:95 433 GDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIE--GGDIILDASFLQGTAQLQQDKQYNDGKQKER---LQMMMSLL 507 (563) T ss_pred CCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCC--CcceeecccccccccccccccCCCccccchh---hhhccccc Confidence 78888888765 46889999999999999999995 8999999999988775543332222211111 00000000 Q ss_pred ccccCCccccccchhcchhhHHHH-----HHHHh-hcccCCchhhHHHHHHHHHhhccccCcCchhHHH Q lcl|NC_021305. 431 QSPPTSVPGLSPTNSDRSTDSGKT-----EPRRL-MQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQL 493 (518) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~-~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (518) ..+.+ .+..++++....++..+. +.+.. -.++++++.. ..|-.++..+.. T Consensus 508 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~ 563 (563) T protein:vir:95 508 EGDND-DSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQ------------GRKGEKSSDFKH 563 (563) T ss_pred CCCCC-CCCCCCCCCCCCCccccccccccccccccccccCccccc------------cccCcCcccccC Confidence 10100 111111111111111111 11110 0112222221 111223333332 No 64 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=1.4e-78 Score=447.41 Aligned_cols=467 Identities=14% Similarity=0.127 Sum_probs=317.3 Q ss_pred CcCCC---CCCCCcccccccchhhhhhhcccc-ccc----ccccccchhhhHHHhhcHHHHHHHHHHHHhhcc------- Q lcl|NC_021305. 1 MLLAN---GQTLSAPAMAELSPQMQDSYYYAP-AVG----MQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR------- 65 (518) Q Consensus 1 ~~f~~---~~~~~~~~~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~------- 65 (518) -+|.. +.....++ ...|.+. .+.... ..+ .....+.......+..+++|++||+.+++.||+ T Consensus 41 ~~~~~~~~~~~~~~~a--~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~ 117 (563) T protein:vir:99 41 KEYQDLTKSLYGQQQA--YAEPFIE-MMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARY 117 (563) T ss_pred hhHHHHHhhhccCCCc--chhhhHh-hhcccccccccccCCCCcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhh Confidence 22221 11112222 1233332 111111 111 111112233455677899999999999999985 Q ss_pred ------CceEEEEecCCcce--eccchHHHHHHh----cCCcC-CCHHHHHHHHHHHHHHcCCeEEEEE--EcCCCceEE Q lcl|NC_021305. 66 ------LPVKCMFTSGDTET--EESDTGYAKLLA----DPCEY-LDPFAFWEWVASTLDIYGETYLAIQ--KNKSGTPEK 130 (518) Q Consensus 66 ------l~~~v~~~~~~~~~--~~~~~~~~~L~~----~PN~~-~s~~~f~~~~v~~ll~~G~~~~~i~--r~~~G~~~~ 130 (518) +++++++.+..+.. ....++++.++. .|||+ +|+.+||+.++.+++++|++|++++ |+..|++++ T Consensus 118 ~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~ 197 (563) T protein:vir:99 118 SEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEK 197 (563) T ss_pred hcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEE Confidence 57777766654432 223344433332 33343 5899999999999999999999876 777899999 Q ss_pred EEeeCCceeEEEEcCCceeeEEeee-cccccCceeEEeccccEEEEecc-CCC--CcccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 131 LMPMHPSRVAIKRNSRTGRYEYYFQ-AGAGVGTQLVSFADDEVVPIRFF-NPD--GLERGLSLMESLKSTIFSEDSSRNA 206 (518) Q Consensus 131 l~~l~p~~v~v~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~evih~~~~-~~~--~~~~G~s~l~~~~~~i~~~~~~~~~ 206 (518) ||||+|++|++..+.++..+..... +....++....|.++++||++.+ +.+ ...||+|||.++..+|....+++++ T Consensus 198 L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~ 277 (563) T protein:vir:99 198 FIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGYGLSEVEIAMKEFIAYNNTESF 277 (563) T ss_pred EEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcccchHHHHHHHHHHHHHHHHHH Confidence 9999999999999887765432111 11223556678899998866544 332 2568999999999999999999999 Q ss_pred HHHHHHccCCcccccccCc--cCCHHHHHHHHHHHHHHhcCccccCCe-eecCCCcceeeccCChhhHHHHHHHHHHHHH Q lcl|NC_021305. 207 TAAMWKNAGRPNLVLRHEK--RLSEAAQQRLREQFDRAHSGSSNTGKT-MVVEEGMEPIPLQLTAVEMQFIEARQLNREE 283 (518) Q Consensus 207 ~~~~~~ng~~p~~il~~~~--~~~~~~~~~~~~~~~~~~~g~~n~g~~-~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 283 (518) +.++|+||++|+|||++++ .+++++++++++.|++.++|..|+|++ +|+++|++|++++.++.|+||++++++++++ T Consensus 278 ~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~ 357 (563) T protein:vir:99 278 NDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINI 357 (563) T ss_pred HHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHH Confidence 9999999999999999864 479999999999999999999999996 7899999999999999999999999999999 Q ss_pred HHHHhcCCHHHhcccccc-----------ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhh Q lcl|NC_021305. 284 VCGVYDIAPPIVHILDRA-----------TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQ 352 (518) Q Consensus 284 Ia~~fgVPp~~lg~~~~~-----------~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~ 352 (518) ||++|||||++||+.+.+ +++|.+++.+.|+++||.||+.+||++||++|++..+..+.++| ++ T Consensus 358 Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f-----~r 432 (563) T protein:vir:99 358 ISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQF-----VG 432 (563) T ss_pred HHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhcccccEEEe-----cc Confidence 999999999999987654 55889999999999999999999999999999987665544444 67 Q ss_pred cCHHHHHHHHH--HHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 353 PDWEAKSESTQ--KMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 353 ~d~~~~~~~~~--~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) .|.+++++.+. +++++|+||+||+|+++|++|++ |||+++.|.++++++..........+..... .....+.. T Consensus 433 ~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~--gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 507 (563) T protein:vir:99 433 GDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIE--GGDIILDASFLQGTAQLQQDKQYNDGKQKER---LQMMMSLL 507 (563) T ss_pred CCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCC--CcceeecccccccccccccccCCCccccchh---hhhccccc Confidence 78888888765 46889999999999999999995 8999999999988775543332222211111 00000000 Q ss_pred ccccCCccccccchhcchhhHHHH-----HHHHh-hcccCCchhhHHHHHHHHHhhccccCcCchhHHH Q lcl|NC_021305. 431 QSPPTSVPGLSPTNSDRSTDSGKT-----EPRRL-MQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQL 493 (518) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~-~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (518) ..+.+ .+..++++....++..+. +.+.. -.++++++.. ..|-.++..+.. T Consensus 508 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~ 563 (563) T protein:vir:99 508 EGDND-DSEEGQSTDSSNDDKEIGTDAQIKGDDNVYRTQTSNKGQ------------GRKGEKSSDFKH 563 (563) T ss_pred CCCCC-CCCCCCCCCCCCCccccccccccccccccccccCccccc------------cccCcCcccccC Confidence 10100 111111111111111111 11110 0112222221 111223333332 No 65 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=1.1e-77 Score=442.61 Aligned_cols=444 Identities=12% Similarity=0.096 Sum_probs=302.9 Q ss_pred Cc----CCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 ML----LANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~----f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) |. |...++.+.... .+..+ ...+...+. ......++.++.+|++||.++++.++++|+++++.+.. T Consensus 53 ~~~~~~g~~~~~~~~~~~-~~~~l-~~~~~~~~~--------~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~ 122 (535) T protein:vir:10 53 ADGNVAGQYSVASISDVL-STKKL-LKAYADNDI--------VQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKV 122 (535) T ss_pred ccCCcccccccCcccccc-CHHHH-HHHhccChh--------HHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCC Confidence 22 111111111100 01111 111111111 11233566778888999999999999999999977654 Q ss_pred cc--eeccch-HHHHHHhcCCcCCCHHH----HHHHHHHHHHHc-CCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce Q lcl|NC_021305. 77 TE--TEESDT-GYAKLLADPCEYLDPFA----FWEWVASTLDIY-GETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG 148 (518) Q Consensus 77 ~~--~~~~~~-~~~~L~~~PN~~~s~~~----f~~~~v~~ll~~-G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~ 148 (518) +. ....+| +.+.|+.+||+.|++++ |+++++.+++++ |++|++|+|+..|++++||||+|.+|++..+..+. T Consensus 123 ~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~ 202 (535) T protein:vir:10 123 MSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSK 202 (535) T ss_pred CcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccc Confidence 32 233444 45567789999998875 556677776665 57899999999999999999999999998876553 Q ss_pred e---eEEeeecccccCceeEEeccccEEEEeccCCCC---cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccc Q lcl|NC_021305. 149 R---YEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDG---LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLR 222 (518) Q Consensus 149 ~---~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~---~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 222 (518) . .+|.+ ..++....|++++||||++++..+ ..+|+||+.++..+|....++++++.++|+||++|+|||+ T Consensus 203 ~~~~~~~~~----~~~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~ 278 (535) T protein:vir:10 203 DQPRKFEQF----VSETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILV 278 (535) T ss_pred cCceEEEEE----ecCceeEEECcccEEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEE Confidence 2 22222 234566789999999999876543 4579999999999999999999999999999999999999 Q ss_pred cCc----cCCHHHHHHHHHHHHHHhcCccccCCeeecC-CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc Q lcl|NC_021305. 223 HEK----RLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHI 297 (518) Q Consensus 223 ~~~----~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~ 297 (518) +++ .+++++.+++++.|++.++|.+|+|+++|+. +|++|++++.++.|+||+|++++++++||++|||||++||+ T Consensus 279 ~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~ 358 (535) T protein:vir:10 279 IDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINF 358 (535) T ss_pred ecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcc Confidence 875 4789999999999999999999999987776 79999999999999999999999999999999999999999 Q ss_pred ccccccCC------------HHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHH Q lcl|NC_021305. 298 LDRATFSN------------ISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKM 365 (518) Q Consensus 298 ~~~~~~sn------------~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~ 365 (518) .+++||+| .|++...|+++||.||++.||++||++|++..+. +++|+++.+++.|.++++++++.+ T Consensus 359 ~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~--~~~f~f~~l~~~d~~~r~~~~~~~ 436 (535) T protein:vir:10 359 PNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYVDT--DYRFSFTLGDAQDKLQEEQVWKLK 436 (535) T ss_pred ccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCC--eEEEEeccccccCHHHHHHHHHHH Confidence 98887765 4667778999999999999999999999987654 457777899999999999988765 Q ss_pred HhCCCcCHHHHHHHhCCCCCCCCCcceeeecc---cccccccccccCCCCCCCCCCCCCccCCCCCCCc-cccCCccccc Q lcl|NC_021305. 366 VNSGVATPNEGREIMGLPRSDDPKADELYANS---ALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ-SPPTSVPGLS 441 (518) Q Consensus 366 ~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~---n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 441 (518) + +|+||+||+|+++|+||++ |||++++.. ++........ ...+.+.++...+..+... +..+.....+ T Consensus 437 ~-~g~lT~NE~R~~~gl~pie--gGD~~~~~~~~~~~~~~~~~~~-----~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~ 508 (535) T protein:vir:10 437 L-ANGYFINEYRKDHGLKTVD--GLDVPGFIGSAENFINATGFGQ-----PNVPDSSDDSGSTLGERERQERIQHSKDYE 508 (535) T ss_pred H-cCCCCHHHHHHHhCCCCCC--Cccccccccchhhccccccccc-----ccCCCCCCCccccCCccccCcccccccccc Confidence 5 6789999999999999995 899866433 2221111000 0011111111100000000 0000000000 Q ss_pred cchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHH Q lcl|NC_021305. 442 PTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVK 477 (518) Q Consensus 442 ~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~ 477 (518) .++..+++..-|+.-++.--...++.+ T Consensus 509 ---------~g~~~~~~~~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:10 509 ---------KGKDDPKSPLPKPSESDDVSNNEDADT 535 (535) T ss_pred ---------cCCCCCCCCCCcCCCCCccccccccCC Confidence 000000000000000000000000000 No 66 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=9.2e-77 Score=437.45 Aligned_cols=481 Identities=14% Similarity=0.141 Sum_probs=329.1 Q ss_pred cCCCCCCCCcccc------cccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021305. 2 LLANGQTLSAPAM------AELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSG 75 (518) Q Consensus 2 ~f~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~ 75 (518) ||+.+-+...-.+ ..-++.+....++.| ...+. ++...+.++..+++|++||++||++||++|++++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~pp~--~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~~ 77 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKEDRFEEY-VEPKV--HPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDDG 77 (540) T ss_pred CCCcccChhhccchhhhhccccccccccCCCCcc-ccCCC--CHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCcc Confidence 8998876543211 112333333222222 23332 34556788999999999999999999999999865432 Q ss_pred CcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceee----- Q lcl|NC_021305. 76 DTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRY----- 150 (518) Q Consensus 76 ~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~----- 150 (518) . ... ..||++||+++||++++.+++++||+|++++|+..|++++|+||+|.+|++..+..+... T Consensus 78 ~---------~~~--~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~d~~ 146 (540) T protein:vir:41 78 G---------VEE--LLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSRYMQTWDGI 146 (540) T ss_pred c---------hhh--hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCceeEeeecCc Confidence 1 111 249999999999999999999999999999999999999999999999998776543221 Q ss_pred --EEe--eec----ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccc Q lcl|NC_021305. 151 --EYY--FQA----GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLR 222 (518) Q Consensus 151 --~~~--~~~----~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 222 (518) .|. +.+ ....+...+.+++++|||+|.+++.+.++|+||+..+..++....++++++.++|+||++|++||+ T Consensus 147 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~ 226 (540) T protein:vir:41 147 HVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVIT 226 (540) T ss_pred eeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 111 100 111234456799999999999988888899999999999999999999999999999999999999 Q ss_pred cCccCCHHH----------HHHHHHHHHHHhcCc-cccCCeeecC------CCcceeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_021305. 223 HEKRLSEAA----------QQRLREQFDRAHSGS-SNTGKTMVVE------EGMEPIPLQLTAVEMQFIEARQLNREEVC 285 (518) Q Consensus 223 ~~~~~~~~~----------~~~~~~~~~~~~~g~-~n~g~~~vl~------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 285 (518) +++.+++++ .+++++.|++.++|. .|+|+++||+ +|++|++++.++.|+||++++++++++|| T Consensus 227 ~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa 306 (540) T protein:vir:41 227 VTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIA 306 (540) T ss_pred eCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHH Confidence 987765442 356777787777774 5789999984 79999999999999999999999999999 Q ss_pred HHhcCCHHHhccccc--cccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHH Q lcl|NC_021305. 286 GVYDIAPPIVHILDR--ATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQ 363 (518) Q Consensus 286 ~~fgVPp~~lg~~~~--~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~ 363 (518) ++|||||++||+.+. .|++|.+++.+.|+++||.|+++.|+++||++|++..+.+++++|+.+.+++.|.+++ +. T Consensus 307 ~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~i~f~~~~ll~~D~~~~---~~ 383 (540) T protein:vir:41 307 AAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLKLDPGARFVFNEEILMESEFVHN---YA 383 (540) T ss_pred HHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecchhhcchHHHHH---HH Confidence 999999999998764 4678999999999999999999999999999999887888899999999999876654 66 Q ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccc Q lcl|NC_021305. 364 KMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPT 443 (518) Q Consensus 364 ~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (518) +++++|++|+||+|+.+ ++++ +++|.++.|.|+...+.........+..+...+... ...++....... T Consensus 384 ~lv~~G~lT~NE~Re~L--~g~e-~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~----~~~~~~~~~~~~---- 452 (540) T protein:vir:41 384 LLVQCGVLTPSEVREKL--FGLD-GGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTY----AKYKPRIQEIIS---- 452 (540) T ss_pred HHHhCCCCCHHHHHHHh--CcCc-CCCcccccccccccccccccccccCCCCcccccccc----chhcccccCccc---- Confidence 78999999999999854 3343 356777778887654332221111111111100000 000000000000 Q ss_pred hhcchhhHHHHHH-HHhhcccCCchhhHHHHHHHHHhhccccC---------cCchhHHHHHHHHHHHhHHHHhhhhhhh Q lcl|NC_021305. 444 NSDRSTDSGKTEP-RRLMQKPPPKESSPKHLRAVKGAMGRGKD---------IKGFALQLAEKYPDDLEDILLAVQLALA 513 (518) Q Consensus 444 ~~~~~~~~~~~~~-~~~~~k~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (518) +..+.+.+..+. +....+....+-..||.-.+..+||...+ .++-+ +++|+|-| .|.---+- T Consensus 453 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~----~~~~~~~~ 524 (540) T protein:vir:41 453 -SESPLEDKKKKIDEVLSDFRAEAYENGKKMLSIAGDMGTMSAINRGVSMIPPKPSN---LEAYEDLL----AASVDDIV 524 (540) T ss_pred -cccccccccccccccccccCCccccchhHHHHHhhhhhhhhhhhcCceecCCCCcc---hHHHHHHH----HhhHHHHH Confidence 000000000000 00011111123334555555666653332 23323 34554433 22222222 Q ss_pred cccCC Q lcl|NC_021305. 514 ERKDN 518 (518) Q Consensus 514 ~~~~~ 518 (518) +|-.. T Consensus 525 ~~~~~ 529 (540) T protein:vir:41 525 ERIRH 529 (540) T ss_pred HHHHH Confidence 33222 No 67 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=5.7e-77 Score=438.58 Aligned_cols=399 Identities=14% Similarity=0.153 Sum_probs=300.6 Q ss_pred hhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce----ec-cchHHHHHHhcCCcCC--------CHHHHHHHHHH Q lcl|NC_021305. 42 YGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET----EE-SDTGYAKLLADPCEYL--------DPFAFWEWVAS 108 (518) Q Consensus 42 ~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~----~~-~~~~~~~L~~~PN~~~--------s~~~f~~~~v~ 108 (518) .++++..+++|++||++||++||++||+++.+.+.... .. ......++..+||+.| ++.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 67788889999999999999999999999865433211 12 2223345667888765 67789999999 Q ss_pred HHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee-------eEEee------------------ecccccCce Q lcl|NC_021305. 109 TLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR-------YEYYF------------------QAGAGVGTQ 163 (518) Q Consensus 109 ~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~-------~~~~~------------------~~~~~~~~~ 163 (518) +++++||+|++++|+..|++++|+||+|++|++..+..... .++.+ .......+. T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGT 160 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccccccc Confidence 99999999999999999999999999999999877654321 11111 111223456 Q ss_pred eEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHH Q lcl|NC_021305. 164 LVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRA 242 (518) Q Consensus 164 ~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~ 242 (518) .+.+++++|||++.+++.+..+|+||+.+++.++....++++++.++|+||+.|+|||+++ +.+++++.+++++.|++. T Consensus 161 ~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~ 240 (467) T protein:vir:31 161 SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDN 240 (467) T ss_pred eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhh Confidence 6789999999999998888889999999999999999999999999999999999999875 579999999999999987 Q ss_pred hc-----------CccccCCeeecCCCcceeecc--------CChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc Q lcl|NC_021305. 243 HS-----------GSSNTGKTMVVEEGMEPIPLQ--------LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF 303 (518) Q Consensus 243 ~~-----------g~~n~g~~~vl~~g~~~~~l~--------~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~ 303 (518) +. |..|++++++++.|+++.+++ .++.|+||.+++++.+++||++|||||++||+.+++++ T Consensus 241 ~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~ 320 (467) T protein:vir:31 241 NEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAF 320 (467) T ss_pred hcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCc Confidence 75 456788999998876655543 36789999999999999999999999999999877765 Q ss_pred -CCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhc--ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_021305. 304 -SNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM 380 (518) Q Consensus 304 -sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 380 (518) +|.+++...|+++||.|+++.|+++||++|++... ..++++|+++.+++.|.++++++++.++++|++|+||+|+++ T Consensus 321 ~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 400 (467) T protein:vir:31 321 STDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEF 400 (467) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Confidence 68999999999999999999999999999998654 356789999999999999999999999999999999999999 Q ss_pred CCCCCCCCCcceeeecccccccc----cccccCCCCCCCCCCCCCccCCCCCCCcc---ccCCccccccch Q lcl|NC_021305. 381 GLPRSDDPKADELYANSALQPLG----ATPDGAVEWEEAPAPKRPASTPVASLDQS---PPTSVPGLSPTN 444 (518) Q Consensus 381 g~~p~~~~~gD~~~~~~n~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 444 (518) |++|+++ ..+.+.+..... ..+.+....+..+......++........ +-.-+.+..+|+ T Consensus 401 Gl~pi~d----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 401 GFEPFPE----EHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred CCCCCCc----ccccCCcccccccccccCCCCcccCcCCCCCCCcccchHhhhhhccccchhhhhccccCC Confidence 9999842 223332211111 11111111111100000000000000000 000000000111 No 68 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=1.2e-76 Score=436.87 Aligned_cols=387 Identities=16% Similarity=0.113 Sum_probs=298.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =||.+-....+.. ..|. .+ .....++...|+++++|++||++||++||++||++|+++ +. T Consensus 2 g~f~~lf~~~~~~----~~~~------~~------~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~----~~ 61 (395) T protein:vir:95 2 SILEKIFKTRKDI----TYML------DL------DMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGN----RI 61 (395) T ss_pred chhhhhhccCccc----cccc------cc------hhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCC----cc Confidence 1222211111100 1111 11 122456677899999999999999999999999999743 34 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..++..++|+.+||+.||+++||+.++.++++.|++|+++.++. .++++++..+++....+.....+.+ .. T Consensus 62 ~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 132 (395) T protein:vir:95 62 QKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTV----KD 132 (395) T ss_pred ccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEE----cC Confidence 56778888889999999999999999999999999988765542 2456655555544433332222221 12 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCcc-CCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~~~~~~~~~~~ 239 (518) .+....+++++|||++++++.+..+|+||+..+..++.... +.|.+|+.++++|..++. +++++.+++++.| T Consensus 133 ~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~ 205 (395) T protein:vir:95 133 YTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFT 205 (395) T ss_pred ceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 34456799999999999998888899999999988876544 346778888999988755 6899999999999 Q ss_pred HHHhcCccccC-CeeecCCCcceeeccCChhhH-----HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTG-KTMVVEEGMEPIPLQLTAVEM-----QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 240 ~~~~~g~~n~g-~~~vl~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) ++.+++.++.+ .++++++|++|++++.++.++ ||+|++++..++||++|||||++|| ++++|.+++.+.| T Consensus 206 ~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~ 281 (395) T protein:vir:95 206 NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVF 281 (395) T ss_pred HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHH Confidence 99888754322 355689999999999888765 8999999999999999999999996 5789999999999 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) +++||.|++..||++||++|+++.++..+++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||++ T Consensus 282 ~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:95 282 EKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 99999999999999999999998877778899999999999999999999999999999999999999999998899999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ++|+|+++++.........++. ...+++++++.+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~-------~~kgg~~~~~g~ 395 (395) T protein:vir:95 362 LITKNYEKANSGENDEKEKDEN-------TLKGGDEDESGD 395 (395) T ss_pred eeccccccccccccccCccccc-------ccCCCCCCCCCC Confidence 9999999987654433222211 111111111110 No 69 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=1.2e-76 Score=436.87 Aligned_cols=387 Identities=16% Similarity=0.113 Sum_probs=298.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =||.+-....+.. ..|. .+ .....++...|+++++|++||++||++||++||++|+++ +. T Consensus 2 g~f~~lf~~~~~~----~~~~------~~------~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~----~~ 61 (395) T protein:vir:10 2 SILEKIFKTRKDI----TYML------DL------DMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGN----RI 61 (395) T ss_pred chhhhhhccCccc----cccc------cc------hhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCC----cc Confidence 1222211111100 1111 11 122456677899999999999999999999999999743 34 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..++..++|+.+||+.||+++||+.++.++++.|++|+++.++. .++++++..+++....+.....+.+ .. T Consensus 62 ~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 132 (395) T protein:vir:10 62 QKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTV----KD 132 (395) T ss_pred ccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEE----cC Confidence 56778888889999999999999999999999999988765542 2456655555544433332222221 12 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCcc-CCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~~~~~~~~~~~ 239 (518) .+....+++++|||++++++.+..+|+||+..+..++.... +.|.+|+.++++|..++. +++++.+++++.| T Consensus 133 ~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~ 205 (395) T protein:vir:10 133 YTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFT 205 (395) T ss_pred ceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 34456799999999999998888899999999988876544 346778888999988755 6899999999999 Q ss_pred HHHhcCccccC-CeeecCCCcceeeccCChhhH-----HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTG-KTMVVEEGMEPIPLQLTAVEM-----QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 240 ~~~~~g~~n~g-~~~vl~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) ++.+++.++.+ .++++++|++|++++.++.++ ||+|++++..++||++|||||++|| ++++|.+++.+.| T Consensus 206 ~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~ 281 (395) T protein:vir:10 206 NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVF 281 (395) T ss_pred HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHH Confidence 99888754322 355689999999999888765 8999999999999999999999996 5789999999999 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) +++||.|++..||++||++|+++.++..+++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||++ T Consensus 282 ~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 282 EKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 99999999999999999999998877778899999999999999999999999999999999999999999998899999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ++|+|+++++.........++. ...+++++++.+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~-------~~kgg~~~~~g~ 395 (395) T protein:vir:10 362 LITKNYEKANSGENDEKEKDEN-------TLKGGDEDESGD 395 (395) T ss_pred eeccccccccccccccCccccc-------ccCCCCCCCCCC Confidence 9999999987654433222211 111111111110 No 70 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=1.2e-76 Score=436.87 Aligned_cols=387 Identities=16% Similarity=0.113 Sum_probs=298.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =||.+-....+.. ..|. .+ .....++...|+++++|++||++||++||++||++|+++ +. T Consensus 2 g~f~~lf~~~~~~----~~~~------~~------~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~----~~ 61 (395) T protein:vir:10 2 SILEKIFKTRKDI----TYML------DL------DMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGN----RI 61 (395) T ss_pred chhhhhhccCccc----cccc------cc------hhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCC----cc Confidence 1222211111100 1111 11 122456677899999999999999999999999999743 34 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..++..++|+.+||+.||+++||+.++.++++.|++|+++.++. .++++++..+++....+.....+.+ .. T Consensus 62 ~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 132 (395) T protein:vir:10 62 QKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTV----KD 132 (395) T ss_pred ccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEE----cC Confidence 56778888889999999999999999999999999988765542 2456655555544433332222221 12 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCcc-CCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~~~~~~~~~~~ 239 (518) .+....+++++|||++++++.+..+|+||+..+..++.... +.|.+|+.++++|..++. +++++.+++++.| T Consensus 133 ~~~~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~ 205 (395) T protein:vir:10 133 YTYQRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFT 205 (395) T ss_pred ceeeeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 34456799999999999998888899999999988876544 346778888999988755 6899999999999 Q ss_pred HHHhcCccccC-CeeecCCCcceeeccCChhhH-----HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTG-KTMVVEEGMEPIPLQLTAVEM-----QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 240 ~~~~~g~~n~g-~~~vl~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) ++.+++.++.+ .++++++|++|++++.++.++ ||+|++++..++||++|||||++|| ++++|.+++.+.| T Consensus 206 ~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~----~~~sn~e~~~~~~ 281 (395) T protein:vir:10 206 NKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY----GETADLEKNTLVF 281 (395) T ss_pred HHHhccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CcccCHHHHHHHH Confidence 99888754322 355689999999999888765 8999999999999999999999996 5789999999999 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~ 393 (518) +++||.|++..||++||++|+++.++..+++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||++ T Consensus 282 ~~~~l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 282 EKFCLTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 99999999999999999999998877778899999999999999999999999999999999999999999998899999 Q ss_pred eecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 394 YANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 394 ~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ++|+|+++++.........++. ...+++++++.+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~-------~~kgg~~~~~g~ 395 (395) T protein:vir:10 362 LITKNYEKANSGENDEKEKDEN-------TLKGGDEDESGD 395 (395) T ss_pred eeccccccccccccccCccccc-------ccCCCCCCCCCC Confidence 9999999987654433222211 111111111110 No 71 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=5.1e-77 Score=438.85 Aligned_cols=370 Identities=16% Similarity=0.124 Sum_probs=294.6 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =||++.. +++.. + . ......++...|+++++|++||++||++||++||++|+++. . T Consensus 6 ~~f~~~~---~~~~~----~-------~------~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~----~ 61 (385) T protein:vir:95 6 SVFKRHS---ELSWM----Y-------D------LEFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNT----K 61 (385) T ss_pred hhhccCc---ccccc----c-------c------hhhhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCc----c Confidence 2333211 11110 0 0 00112355678999999999999999999999999998653 3 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..|+..++|+.+||++||+++||+.++.+++++|++|+++.++. +.+..++++.+..+.+.... ++.... .. T Consensus 62 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~~~~~~--~~ 133 (385) T protein:vir:95 62 EKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYSHR-----FTNVLV--ND 133 (385) T ss_pred ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeecccccccccccccccc-----ceeeee--cc Confidence 35667777888999999999999999999999999999887653 44556666666655433221 111111 12 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc--cCCHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK--RLSEAAQQRLREQ 238 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~~~~~~~~~~ 238 (518) .+....+++++|||++++++++..+|.||+..+..++....++.. +++.|+++++++. .+++++.+++++. T Consensus 134 ~~~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~-------~~~~~~g~l~~~~~~~~~~e~~~~~~~~ 206 (385) T protein:vir:95 134 FEFKRVFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGRMIDLQM-------LNNQIRGILKVDATKFYNKEKQKELQAY 206 (385) T ss_pred cceeeeeccccEEEecCCCCCcccccchHHHHHHHHHHHHHHHHH-------hcCCCceEEEeCCccCCCHHHHHHHHHH Confidence 234567999999999999998888999999999998877655432 2345788887754 5789999999999 Q ss_pred HHHHhcCccc-cCCeeecCCCcceeeccC------ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHH Q lcl|NC_021305. 239 FDRAHSGSSN-TGKTMVVEEGMEPIPLQL------TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMR 311 (518) Q Consensus 239 ~~~~~~g~~n-~g~~~vl~~g~~~~~l~~------~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~ 311 (518) |++.++|..+ .++++++++|++|++++. ++.|+||++.++++.++||++|||||++|+ ++++|.+++.. T Consensus 207 ~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~----~~~sn~e~~~~ 282 (385) T protein:vir:95 207 IDTLFDAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL----GEMADLEKTIE 282 (385) T ss_pred HHHHhhhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCcCHHHHHH Confidence 9999998755 456888999999999874 667999999999999999999999999995 58999999999 Q ss_pred HHHHHHhhHHHHHHHHHHHHhhhhhhcc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCc Q lcl|NC_021305. 312 AFYRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKA 390 (518) Q Consensus 312 ~~~~~~l~P~~~~ie~~l~~~l~~~~~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~g 390 (518) .|+++||.|++..||++||++|+++.++ ..+++||++.+++.|.+++++++.+++++|++|+||+|+++|++|++++|| T Consensus 283 ~~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~g 362 (385) T protein:vir:95 283 SYLQFCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPEL 362 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC Confidence 9999999999999999999999997654 457899999999999999999999999999999999999999999999999 Q ss_pred ceeeecccccccccccccCCCCC Q lcl|NC_021305. 391 DELYANSALQPLGATPDGAVEWE 413 (518) Q Consensus 391 D~~~~~~n~~~~~~~~~~~~~~~ 413 (518) |++++|+|+++++...++...++ T Consensus 363 d~~~~~~n~~~~~~~kgge~~~e 385 (385) T protein:vir:95 363 DKFIITKNLQSADAFKGGESNEE 385 (385) T ss_pred ceeeecccceecccccCCCCCCC Confidence 99999999999875422221111 No 72 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=6.6e-76 Score=432.75 Aligned_cols=372 Identities=15% Similarity=0.182 Sum_probs=295.5 Q ss_pred CcCCC---CCCCCccccccc-chhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLAN---GQTLSAPAMAEL-SPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~---~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) =+|.+ .+..++...... ..|+...++ ......++.+.|+++++|++||++||++||++||++++. T Consensus 2 g~~~~~~~~k~~~~~~~~~~~~~~~~~~~~--------~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~--- 70 (383) T protein:vir:10 2 GLLTPKNFSKRNAKNMVYPSNPAFFTTTVG--------GMQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTENT--- 70 (383) T ss_pred Ccccccccccccccccccccchhhhhhhcc--------CccccccchhHhhcchHHHHHHHHHHHhhccCceeeccc--- Confidence 12321 111111111111 122222221 122345778889999999999999999999999998642 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) ..+.|+.+||++||+.+||+.++.+++++|++|++++++ ..+++|+++.+|++..+.++..+ .+.. T Consensus 71 --------~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~~~~--~~~~ 136 (383) T protein:vir:10 71 --------ATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVY--TVLE 136 (383) T ss_pred --------chhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCceEE--EEEE Confidence 233477899999999999999999999999999999875 46788888888887776654333 2222 Q ss_pred ccccCceeEEeccccEEEEeccCCCC--cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC-CHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFNPDG--LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SEAAQQ 233 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~~~~--~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~ 233 (518) ..++..++|++++|||||+.++++ ..+|+||+..+...+.....+++++.++|+||++|+++|++++.+ ++++.+ T Consensus 137 --~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~ 214 (383) T protein:vir:10 137 --SNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLE 214 (383) T ss_pred --cCCceEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHH Confidence 245677899999999999877654 457999999999999999999999999999999999999999876 578899 Q ss_pred HHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHH-HHHHHHHHHHHHHhcCCHHHhcccc--ccccCCHHHHH Q lcl|NC_021305. 234 RLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFI-EARQLNREEVCGVYDIAPPIVHILD--RATFSNISAQM 310 (518) Q Consensus 234 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~-e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn~e~~~ 310 (518) ++++.|++.++| .|+|+++|+++|++|++++.++.++|++ +++++.+++||++|||||++||..+ +.+++|.+++. T Consensus 215 ~~~~~~~~~~~~-~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~ 293 (383) T protein:vir:10 215 SAREEFEKANTG-DNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIK 293 (383) T ss_pred HHHHHHHHHhCc-cccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHH Confidence 999999998877 6899999999999999999999999975 8999999999999999999999754 56788988886 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCc Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKA 390 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~g 390 (518) . ++..||.|+++.||++|+++|+.+ +++||++.+++.|.+++++.+.+++++|++|+||+|+++|++|++ +| T Consensus 294 ~-~~~~~l~P~~~~ie~~l~~~l~~~-----~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~--~~ 365 (383) T protein:vir:10 294 A-TYLANLNSYVNPIVDELRLKMNAP-----DLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFL--PD 365 (383) T ss_pred H-HHHHHHHHHHHHHHHHHHHhhCCc-----eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccc--CC Confidence 6 455799999999999999999753 589999999999999999999999999999999999999999996 56 Q ss_pred ceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 391 DELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 391 D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) |......+..++. ++.. + T Consensus 366 d~~~~~~~~~~~~---gGd~--------------------e 383 (383) T protein:vir:10 366 NLPEFKPLTNETK---GGDD--------------------K 383 (383) T ss_pred cccccCCCcccCC---CCCC--------------------C Confidence 6543322221110 0000 0 No 73 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=1.5e-76 Score=436.26 Aligned_cols=375 Identities=13% Similarity=0.114 Sum_probs=314.8 Q ss_pred CcCCCCC-CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQ-TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =||.+.. .+++|.... +.|+ ... .....+ .+.++..++...++++++|++||++||++||++||+++++.. T Consensus 2 glf~~~~~~~~~~~~~~-~~~~-~~~-~~~~~~-~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~---- 73 (384) T protein:vir:49 2 PIFNITNLATESPPSNQ-DSFF-DIT-DPEFLD-ALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQL---- 73 (384) T ss_pred ccccccccCcccccccc-hhhc-ccc-chhhcc-cccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecchh---- Confidence 2555432 222332211 1121 111 111111 123345677888999999999999999999999999986542 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) ..|+.+||++||+++||+.++.+++++||+|++++|+..|++.+|+||+|++|++..+.++..++|.+...+. T Consensus 74 -------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~ 146 (384) T protein:vir:49 74 -------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDP 146 (384) T ss_pred -------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCc Confidence 3589999999999999999999999999999999999999999999999999999988888888888887776 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQF 239 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~ 239 (518) ..+..+.|++++|||++++++++..+|+||+.++...+....+++++..++|+||+.|+++|++++.+++++.++ .+ T Consensus 147 ~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~---~~ 223 (384) T protein:vir:49 147 RIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTK---QS 223 (384) T ss_pred cccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHH---HH Confidence 777888999999999999999988899999999999999999999999999999999999999999888776543 34 Q ss_pred HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRAFYRDT 317 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~~~~~~ 317 (518) .+.+.+..|+|+++|+++|++|++++.++.++||++.++++.++||++|||||++||.... +++++.++....|+..+ T Consensus 224 ~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~ 303 (384) T protein:vir:49 224 RSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRF 303 (384) T ss_pred HHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHH Confidence 5556677899999999999999999999999999999999999999999999999998543 45677889999999999 Q ss_pred hhHHHHHHHHHHHHhhhhhh-----cccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcce Q lcl|NC_021305. 318 MAIPIARIQSAMDKYVGQYW-----VRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADE 392 (518) Q Consensus 318 l~P~~~~ie~~l~~~l~~~~-----~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~ 392 (518) +.|++..|+++|++++.... ...++++|+++.+++.|..++.+++..+...|+++ ||+|+.+|++|+++...|+ T Consensus 304 l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~gGd~~~ 382 (384) T protein:vir:49 304 LRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGETDSTLKGGETNE 382 (384) T ss_pred HHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCCCCCCCC Confidence 99999999999999874321 23467899999999999999999999999999986 9999999999998544555 Q ss_pred ee Q lcl|NC_021305. 393 LY 394 (518) Q Consensus 393 ~~ 394 (518) .| T Consensus 383 ~~ 384 (384) T protein:vir:49 383 QY 384 (384) T ss_pred CC Confidence 55 No 74 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=4.3e-75 Score=428.30 Aligned_cols=385 Identities=15% Similarity=0.143 Sum_probs=310.6 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =||.+.....++ .....+++........ .....++..++.+.++++++|++||++||++||++|+++++.. T Consensus 2 ~~f~~~~~~~~~-~~~~~~~~~~~~~~~~--~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~------ 72 (386) T protein:vir:49 2 PIFNITNLATES-PPINQESFFDIADSDF--LASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQ------ 72 (386) T ss_pred chhhhhccCCCC-cccchhhhhhhhhccc--cccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccch------ Confidence 244443322222 2122233222211111 1223344567888999999999999999999999999998754 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ...|+.+||+.||+++||+.++.+++++||+|++|+|+..|++++|+|++|++|++..+.++....|.+...... T Consensus 73 -----~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~ 147 (386) T protein:vir:49 73 -----LQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDPH 147 (386) T ss_pred -----hhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEEcCcc Confidence 235899999999999999999999999999999999999999999999999999999998888888888777667 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFD 240 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~ 240 (518) ++..+.|++++||||+.+++++..+|+||+.++...+....++++++.++|+||+.|+++|++++.+++++.+++++.|+ T Consensus 148 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~ 227 (386) T protein:vir:49 148 IAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDFKTKVSRSRQ 227 (386) T ss_pred ccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHHHHHHHHHHH Confidence 77888999999999999999988899999999999999999999999999999999999999999999999999999887 Q ss_pred HHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhH Q lcl|NC_021305. 241 RAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAI 320 (518) Q Consensus 241 ~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P 320 (518) .. ..|+|+++|+++|++|++++.++.|+||+++++++.++||++|||||++||... .++++.+ +...|+..++.| T Consensus 228 ~~---~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~-~~~~~~~~~i~~ 302 (386) T protein:vir:49 228 AM---KQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDG-DQQSSLE-MIYNIYFKSVSR 302 (386) T ss_pred Hh---ccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-CccchHH-HHHHHHHHHHHH Confidence 64 368899999999999999999999999999999999999999999999999643 3556655 456889999999 Q ss_pred HHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccc Q lcl|NC_021305. 321 PIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQ 400 (518) Q Consensus 321 ~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~ 400 (518) ++..|+++|+.+|+. +++|+...+++.|..+++..+.+++++|++|+||+|++++..++.. .+.. .... T Consensus 303 ~l~~i~~~~~~~l~~------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~--~~~~---~~~~ 371 (386) T protein:vir:49 303 YLRPFVSEMSKKLSC------EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILP--KELP---DGKN 371 (386) T ss_pred HHHHHHHHHHHHhcc------hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCC--CcCc---chhc Confidence 999999999999964 4689999999999999999999999999999999999997665421 1110 0000 Q ss_pred ccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 401 PLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 401 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) + ..+..+. ++. ++++ T Consensus 372 ~----------~~~~~~g---Gd~--~~~~ 386 (386) T protein:vir:49 372 P----------NRTSLKG---GEI--NEQD 386 (386) T ss_pred c----------CCCCCCC---CCC--CCCC Confidence 0 0000000 000 0000 No 75 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=7.1e-75 Score=427.10 Aligned_cols=378 Identities=14% Similarity=0.149 Sum_probs=302.3 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =||.++...+..........+...+.. .+.++..++...++++++|++||++||++||++||++++... T Consensus 2 g~f~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~----- 70 (382) T protein:vir:48 2 PIFNLATESPPDNQGGFFDVVDSDFLA------SLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKL----- 70 (382) T ss_pred ccccccccCCcccccccccchhhhccc------cccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchh----- Confidence 245544332222221111111111211 223345678888999999999999999999999999987542 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..|+.+||++||+++||+.++.+++++||+|++++|+..|++++|+|++|++|++..+.++..+.|.+...+.. T Consensus 71 ------~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~~ 144 (382) T protein:vir:48 71 ------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDPR 144 (382) T ss_pred ------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEecCcc Confidence 35899999999999999999999999999999999999999999999999999999998888888888777766 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFD 240 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~ 240 (518) .+..+.|++++||||+++++++..+|+||+.++..++....+++++..++|+||+.|+++|++++.+++++.+++++.|. T Consensus 145 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~ 224 (382) T protein:vir:48 145 IPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDFKTKLSRSRQ 224 (382) T ss_pred ccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHH Confidence 67788999999999999999998899999999999999999999999999999999999999999999999999999887 Q ss_pred HHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhH Q lcl|NC_021305. 241 RAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAI 320 (518) Q Consensus 241 ~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P 320 (518) .. ..|+|+++|+++|++|++++.++.|+||++.+++..++||++|||||.+||..+. +++.+++.+.|++.||.| T Consensus 225 ~~---~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~--~~~~~~~~~~~~~~~l~p 299 (382) T protein:vir:48 225 AM---KQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGD--QQSSLEMSSDLYSKAVSR 299 (382) T ss_pred hh---ccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC--cccHHHHHHHHHHHHHHH Confidence 64 3578999999999999999999999999999999999999999999999997544 457788899999999999 Q ss_pred HHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC---CCCCCcceeeecc Q lcl|NC_021305. 321 PIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPR---SDDPKADELYANS 397 (518) Q Consensus 321 ~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p---~~~~~gD~~~~~~ 397 (518) +++.|+++|+++|++..+......++ .+.......+.+++++|++|+||+|+.++... -+.+.++.+..+ T Consensus 300 ~~~~i~~~l~~~l~~~~~~~~~~~~~------~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~~~~~- 372 (382) T protein:vir:48 300 YLRPFLSELSQKLSCDVDADIFPAVD------PTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGENPNST- 372 (382) T ss_pred HHHHHHHHHHHHhcChhhhhhhhhhc------cchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhcCCCC- Confidence 99999999999999876554333333 33444555677899999999999999885332 222222221100 Q ss_pred cccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 398 ALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 398 n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) + +.+ +++++. T Consensus 373 ----~-----------------~GG-----d~~~~~ 382 (382) T protein:vir:48 373 ----L-----------------KGG-----EEDGQD 382 (382) T ss_pred ----C-----------------CCC-----CCCCCC Confidence 0 000 000000 No 76 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=4.2e-75 Score=428.33 Aligned_cols=386 Identities=13% Similarity=0.081 Sum_probs=284.9 Q ss_pred CcC-CCCCCC--CcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 1 MLL-ANGQTL--SAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 1 ~~f-~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) |=| .+-... ..........|. ++ .....++.++++++++|++||++||+++|++||++++++ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~~----~~--------~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~--- 65 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTDTV----WC--------SIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKG--- 65 (395) T ss_pred CchHHHHHhhhcccccccccccch----hh--------ccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCC--- Confidence 422 110000 011111111221 11 112346677899999999999999999999999999754 Q ss_pred ceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecc Q lcl|NC_021305. 78 ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAG 157 (518) Q Consensus 78 ~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~ 157 (518) ++..++..++|+.+||+.||+++||+.++.+++++|++|+++.++.. ++.+. +.+.........++.+... T Consensus 66 -~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~------~~~~~--~~~~~~~~~~~~~~~v~~~ 136 (395) T protein:vir:40 66 -EEVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI------YVADS--FTKNDKSLYENTYTEVTLK 136 (395) T ss_pred -ccccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce------eecCC--ccccccccccceeeeeeec Confidence 24567788889999999999999999999999999999999887642 22221 1111111111111111111 Q ss_pred cccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLRE 237 (518) Q Consensus 158 ~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~ 237 (518) . .+..+.|++++||||++++..+..++.+.+..+...+.... ...++.++..+.++++.+..+++++.+++++ T Consensus 137 ~--~~~~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~~~~~~-----~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 209 (395) T protein:vir:40 137 D--LTLKKEFKESEVLHLTLNNESIKSIIDGFYLLYGDLLTAAV-----NKYKKLNSRKIIVKLKAMFGQTPEAEEKLRL 209 (395) T ss_pred C--ceeeeeeccccEEEeecCCCCccccchhHHHHHHHHHHHHH-----HHHHhcCCCCceEEEecccCCCHHHHHHHHH Confidence 1 12245789999999998766554444455554444443322 2334456666666677788899999999999 Q ss_pred HHHHHhcCc-cccCCeeecCCCcceeeccCChhhHHHHHHHHHHH---HHHHHHhcCCHHHhccccccccCCHHHHHHHH Q lcl|NC_021305. 238 QFDRAHSGS-SNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNR---EEVCGVYDIAPPIVHILDRATFSNISAQMRAF 313 (518) Q Consensus 238 ~~~~~~~g~-~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~---~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~ 313 (518) .|++.+.+. .+.++++++++|++|++++.++.++||++++++.. ++||++|||||++|| ++++|.+++...| T Consensus 210 ~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~----~~~sn~e~~~~~f 285 (395) T protein:vir:40 210 MLSERMKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAK----GDTVGLSEQVNSF 285 (395) T ss_pred HHHHHHHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCcCHHHHHHHH Confidence 999998774 56788999999999999999999999999998874 799999999999996 5789999999999 Q ss_pred HHHHhhHHHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD 391 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD 391 (518) +++||.|++++||++|+++|+++.++ +++++||++.+++.|.+++++.+.+++++|++|+||+|+++|+||+++|+|| T Consensus 286 ~~~~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD 365 (395) T protein:vir:40 286 LMFSINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQ 365 (395) T ss_pred HHHHHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCc Confidence 99999999999999999999987554 6789999999999999999999999999999999999999999999999999 Q ss_pred eeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 392 ELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 392 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) ++++|+|+++++.....+..++.... +++. T Consensus 366 ~~~~~~n~~~~~~~~~~~kgge~~~~----------~~~~ 395 (395) T protein:vir:40 366 ERFVTKNYAPLGENEEDLKGGDINEN----------KGDS 395 (395) T ss_pred eeeeccccccccccccccCCCCCCCC----------cCCC Confidence 99999999998765433221111110 0000 No 77 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=5.9e-75 Score=427.55 Aligned_cols=379 Identities=12% Similarity=0.048 Sum_probs=281.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) ..|++++....+. .+ + + .....++...|+++++|++||++||++||+|||++++++. ... T Consensus 5 d~~~~~~~~~~~~-----~~------~----~---~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~--~~~ 64 (395) T protein:vir:96 5 DFFSFKKSGTLSD-----DD------S----G---STTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEK--LTE 64 (395) T ss_pred hhhcCCCCccccc-----cc------c----c---cchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCc--ccc Confidence 2222322111100 00 0 0 0112355678999999999999999999999999997643 344 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..+++.++|+.+||++||+++||+.++.+++++|++|+++.++..+.+...++.. ..-.... ++.+... . T Consensus 65 ~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~~~-------~~~~~~~-~~~v~~~--~ 134 (395) T protein:vir:96 65 NQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIYVADAFTQD-------KKLSGNK-FKVSRVQ--G 134 (395) T ss_pred ccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCceecCCccccc-------cccccce-eeeeeec--c Confidence 5666777888899999999999999999999999999999987643222222111 1101111 1111111 1 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHH------HHHHHHHHHHHHccCCcccccccCccCCHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSE------DSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQR 234 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~------~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~ 234 (518) ......+++++|||||++++++..++.+++......+... ..+.++..++|.+++.+.+++..++...++..++ T Consensus 135 ~~~~~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:96 135 QTYEKIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQPKSDKD 214 (395) T ss_pred ceeeeEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhHHHHHH Confidence 2234578999999999887765555555444444433333 3344678889999999999998887777766666 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHH------HHHHHHHhcCCHHHhccccccccCCHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLN------REEVCGVYDIAPPIVHILDRATFSNISA 308 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~------~~~Ia~~fgVPp~~lg~~~~~~~sn~e~ 308 (518) +.+++..... .+.++++++++|++|++++.++.++|+++.+++. .++||++|||||++|| ++++|.|+ T Consensus 215 ~~~~~~~~~~--~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~----~~~sn~e~ 288 (395) T protein:vir:96 215 FFKRTIEKIR--TESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH----GDIADNQK 288 (395) T ss_pred HHHHHHHHhh--cCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCCccHHH Confidence 5555544443 2456688899999999999999999998887765 5799999999999996 57899999 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Q lcl|NC_021305. 309 QMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDP 388 (518) Q Consensus 309 ~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~ 388 (518) +.+.|+++||.||+.+||++|+++|+++.+...+++|+++.+++.|.+++++++++++++|++|+||+|+++|+||++++ T Consensus 289 ~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~ 368 (395) T protein:vir:96 289 NYELLLEGPIESLITNIVDGLEYAIFDKSETLEGSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDG 368 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999999999999987776677899999999999999999999999999999999999999999999 Q ss_pred CcceeeecccccccccccccCCCCCCC Q lcl|NC_021305. 389 KADELYANSALQPLGATPDGAVEWEEA 415 (518) Q Consensus 389 ~gD~~~~~~n~~~~~~~~~~~~~~~~~ 415 (518) +||++++|+|++|++..++....++++ T Consensus 369 ~gD~~~~~~N~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 369 LGKVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred CCceeeecccceechhccCCCCCCCCC Confidence 999999999999986521111100000 No 78 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=7e-75 Score=427.13 Aligned_cols=369 Identities=16% Similarity=0.127 Sum_probs=288.0 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =||++.-.+.+.. .+..+ + .....++.+.|+++++|++||++||+++|++||++|+++ ++ T Consensus 2 g~f~~l~~~~~~~-----~~~~~-----~------~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~----~~ 61 (376) T protein:vir:78 2 GFFSELFKRNKEI-----EWMWD-----L------DFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGE----TS 61 (376) T ss_pred chhhhhhccCCcc-----ccccc-----h------hhccccchhhhhhhHHHHHHHHHHHHhhcccceeecccc----cc Confidence 1333211111110 11100 0 011235667899999999999999999999999999643 44 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..|+.+++|+.+||++||+++||+.++.+++++|++|+++.|+..|.+..++|+.+..+...... .+... . T Consensus 62 ~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~--~ 132 (376) T protein:vir:78 62 VRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDVFE-------GVTVK--D 132 (376) T ss_pred ccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeeeee-------eeeee--c Confidence 56777888889999999999999999999999999999999999999999999998876433211 11111 1 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFD 240 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~ 240 (518) .+....|++++|||+++....+..++.+++..+...+... ....++.++.++.++++.++.+++++.+++++.|+ T Consensus 133 ~~~~~~~~~~evih~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 207 (376) T protein:vir:78 133 YRYNRNFSMDDVIFLEYGNERLSAFTDGMFEDYGELFGKM-----IRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYID 207 (376) T ss_pred ceeeeeeccccEEEeccCCCCchhhhhHHHHHHHHHHHHH-----HHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHH Confidence 2234568999999999876655444444433333332221 12234455556666677788899999999999999 Q ss_pred HHhcCcc-ccCCeeecCCCcceeeccCChhhH-----HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHH Q lcl|NC_021305. 241 RAHSGSS-NTGKTMVVEEGMEPIPLQLTAVEM-----QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) Q Consensus 241 ~~~~g~~-n~g~~~vl~~g~~~~~l~~~~~d~-----~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~ 314 (518) +.++|.. +.++++++++|++|++++.++.++ ||+|.++++.++||++|||||++|| ++++|.+++.+.|+ T Consensus 208 ~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~----~~~s~~e~~~~~f~ 283 (376) T protein:vir:78 208 KVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLH----GDMADLSNNMKAYM 283 (376) T ss_pred HHhccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhC----CCCCCHHHHHHHHH Confidence 9999864 455688899999999999888665 9999999999999999999999996 47899999999999 Q ss_pred HHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceee Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~ 394 (518) ++||.|++..||++||++|+++.+ ++++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++||+++ T Consensus 284 ~~~l~P~~~~ie~~l~~kll~~~~--~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~ 361 (376) T protein:vir:78 284 EYCIDPLTKKLEDELNAKLFTFSE--FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYL 361 (376) T ss_pred HHHHHHHHHHHHHHHHhhhCCccc--ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceee Confidence 999999999999999999998643 467788889999999999999999999999999999999999999988899999 Q ss_pred ecccccccccccccC Q lcl|NC_021305. 395 ANSALQPLGATPDGA 409 (518) Q Consensus 395 ~~~n~~~~~~~~~~~ 409 (518) +|+|++|++...+.. T Consensus 362 ~~~n~~~~~~~~e~g 376 (376) T protein:vir:78 362 ITKNYQSADEGGEDG 376 (376) T ss_pred eccCceehhccccCC Confidence 999999987432211 No 79 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=4.2e-74 Score=422.89 Aligned_cols=350 Identities=15% Similarity=0.216 Sum_probs=282.3 Q ss_pred CcCC-CCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLA-NGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =+|+ +++ + +....++|...... +..+..+..++...++++++|++||++||++||++|+. T Consensus 2 ~~~~~f~~-r---~~~~~~~~~~~~~~-----~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~---------- 62 (359) T protein:vir:10 2 SILNPFER-R---SSITPNNYYPFMVQ-----NGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI---------- 62 (359) T ss_pred cccchhhc-c---ccCCCCcchhhhhc-----cccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc---------- Confidence 1232 111 1 11111222221111 12234455678888999999999999999999999983 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) .++.++.|+.+||++||+++||+.++.+++++||+|++|+|+..|.+.+|+|++|+.|++..+.++ +.|.+.. . T Consensus 63 --~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~--~~y~~~~--~ 136 (359) T protein:vir:10 63 --GNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDDT--LTYEVNQ--F 136 (359) T ss_pred --cchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCCe--EEEEEEe--c Confidence 578899999999999999999999999999999999999999999999999999999999877654 3344332 3 Q ss_pred cCceeEEeccccEEEEeccCC----CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc-cCCHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNP----DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSEAAQQR 234 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~~ 234 (518) .++..+.|+++|||||+.++. .+..+|+||+..+..++....+++++..++|+||++|+|+|++++ .+++++.++ T Consensus 137 ~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~ 216 (359) T protein:vir:10 137 DDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDS 216 (359) T ss_pred CCceEEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHH Confidence 346678899999999998754 244689999999999999999999999999999999999999975 789999999 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRA 312 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~ 312 (518) +++.|++.++| .|+|+++||++|++|++++.++.|+||+|.++++.++||++|||||++||..++ +++++.++.... T Consensus 217 ~~~~~~~~~~~-~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~ 295 (359) T protein:vir:10 217 IRKEFEKANGG-NNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVN 295 (359) T ss_pred HHHHHHHHhCc-cccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHH Confidence 99999887654 899999999999999999999999999999999999999999999999987643 356667777777 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSD 386 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ 386 (518) |+..++.|+...|+..|++.+. ++...+...|.......+.+++++|++|+||+|+++|++|+= T Consensus 296 ~l~~~l~p~~~~l~~~l~~~~~----------~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 296 ALNRFIEPLISELRIKCDSSIG----------VDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHHHHHHHHHhhhhhc----------ccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 7777777777777766665542 333333444445555667789999999999999999999994 No 80 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=3e-73 Score=418.22 Aligned_cols=486 Identities=14% Similarity=0.135 Sum_probs=316.2 Q ss_pred cCCCC---CCC--Cccc--ccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021305. 2 LLANG---QTL--SAPA--MAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS 74 (518) Q Consensus 2 ~f~~~---~~~--~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~ 74 (518) ||.-- ++. +++. ....+..+....+ ..+...+. +......++..+++|++||++||++||++||++++.. T Consensus 1 ~~~~~~~i~s~~~~~~i~~~~~~s~~~~~~~~-~~~~~pp~--~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~ 77 (542) T protein:vir:41 1 MFNYHLSIRSLEKYKAIKREEVESQALGETRF-EEYVEPKV--NPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDD 77 (542) T ss_pred CccccccccccccchhhhhccccccccccccC-CccccCCC--CHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeeccc Confidence 77611 111 1111 1111122111111 11222222 3445567888999999999999999999999996543 Q ss_pred CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEE-- Q lcl|NC_021305. 75 GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEY-- 152 (518) Q Consensus 75 ~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~-- 152 (518) .. .++...||++||+++||+.++.+++++||+|++++|+..|++.+|+||+|.+|++..+.......+ T Consensus 78 ~~----------~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~ 147 (542) T protein:vir:41 78 EG----------VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDG 147 (542) T ss_pred ch----------hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCeeEeeecC Confidence 21 123445999999999999999999999999999999999999999999999999988765432211 Q ss_pred -------eeec----ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc Q lcl|NC_021305. 153 -------YFQA----GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL 221 (518) Q Consensus 153 -------~~~~----~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il 221 (518) .|.+ ....+.....+++++|||+|.+++.+..+|+||+..+..++....++++++.++|+||++|++|| T Consensus 148 ~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL 227 (542) T protein:vir:41 148 VNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVI 227 (542) T ss_pred CcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEE Confidence 1111 11123344568899999999998877789999999999999999999999999999999999999 Q ss_pred ccCc----------cCCHHHHHHHHHHHHHHhcCc-cccCCeeecC------CCcceeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_021305. 222 RHEK----------RLSEAAQQRLREQFDRAHSGS-SNTGKTMVVE------EGMEPIPLQLTAVEMQFIEARQLNREEV 284 (518) Q Consensus 222 ~~~~----------~~~~~~~~~~~~~~~~~~~g~-~n~g~~~vl~------~g~~~~~l~~~~~d~~~~e~~~~~~~~I 284 (518) ++++ .+++++.+++++.|++.+.|. .|+|+++||+ +|++|++++.++.|++|++++++.+++| T Consensus 228 ~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~I 307 (542) T protein:vir:41 228 TVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDI 307 (542) T ss_pred EeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHH Confidence 8764 467889999999999999886 5788999984 7999999999999999999999999999 Q ss_pred HHHhcCCHHHhcccccc--ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHH Q lcl|NC_021305. 285 CGVYDIAPPIVHILDRA--TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSEST 362 (518) Q Consensus 285 a~~fgVPp~~lg~~~~~--~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~ 362 (518) |++|||||++||+.+.+ +++|+|++...|+++||.|++++|+++||++|+++.+..++++|+...+++.|..+ .+ T Consensus 308 a~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~ll~~d~~~---~~ 384 (542) T protein:vir:41 308 AAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFNPKTRFKFNDETLLESDSVR---NC 384 (542) T ss_pred HHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEecchhhcchHHHH---HH Confidence 99999999999998765 45899999999999999999999999999999998888889999999999887544 46 Q ss_pred HHHHhCCCcCHHHHHHHh-CCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccC-Ccccc Q lcl|NC_021305. 363 QKMVNSGVATPNEGREIM-GLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPT-SVPGL 440 (518) Q Consensus 363 ~~~~~~G~~T~NE~R~~~-g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 440 (518) ..++++|++|+||+|+.+ |++| ++|.++.|.|+.... ...++...+..+ ..+ .........+ -.... T Consensus 385 ~~~v~~GilT~NE~Re~L~g~~p----gdd~~l~p~~~~~~~-~~~~~~n~~~~~-~~~-----~~k~~~k~~~~~~~~~ 453 (542) T protein:vir:41 385 ALLVQSGVLTPAEARERLFGLDG----GPDIFMVPSKGAAKS-VKRQERNYEKNQ-IRE-----IRKIYAKYRPRFNEII 453 (542) T ss_pred HHHHhCCCCCHHHHHHhhCCCCC----CCccccccccccccc-cccCCcCCCCCc-hhh-----hhhcccccCccccccc Confidence 779999999999999853 5443 445555666654321 111111111110 000 0000000000 00000 Q ss_pred ccchhcchhhHH--HHHHH-HhhcccCCchhhH--HHHHHHHHhhcccc--CcCchhHHHHHHHHHHHh----H----HH Q lcl|NC_021305. 441 SPTNSDRSTDSG--KTEPR-RLMQKPPPKESSP--KHLRAVKGAMGRGK--DIKGFALQLAEKYPDDLE----D----IL 505 (518) Q Consensus 441 ~~~~~~~~~~~~--~~~~~-~~~~k~~~~~~~~--~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~----~----~~ 505 (518) ....+++..+.+ +.+++ ++-++++||.-.- ....-+++.-|+.. -.++-+| +.|++-|+ + |- T Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 530 (542) T protein:vir:41 454 SSKLSAEEKKKKIDESLAEFRAEAYEAGKKMLIIGGDMGSMSALNQGVSVIPSKPLNL---ERYEELLEASVEDMIGRIR 530 (542) T ss_pred cccccchhhcccccchhhhhHHhHHhcCceEEEeecCchhhhhhhccceeccCCCcCh---HHHHHHHHhhHHHHHHHHH Confidence 000011111111 11111 2223333322110 00000111111111 1122222 33433221 1 00 Q ss_pred HhhhhhhhcccC Q lcl|NC_021305. 506 LAVQLALAERKD 517 (518) Q Consensus 506 ~~~~~~~~~~~~ 517 (518) --+--.+.-|.- T Consensus 531 ~~~~~~~~~~~~ 542 (542) T protein:vir:41 531 HYLYKVIGWREL 542 (542) T ss_pred HHHHHHhhhccC Confidence 000000000111 No 81 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=3.8e-73 Score=417.59 Aligned_cols=384 Identities=13% Similarity=0.047 Sum_probs=286.7 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =||.+-..+ .+.. . .++ ..+ .....++.+.|+++++|++||++||++||++||++|+.+.+ .. T Consensus 2 Glf~~~~~~-~~~~--~-~~~---~~~--------~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~--~~ 64 (395) T protein:vir:98 2 GILDFFSFK-KSGT--L-SDD---DSG--------STTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKL--TE 64 (395) T ss_pred cchhhhcCC-Cccc--c-ccc---ccc--------hhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCc--cc Confidence 133221111 1110 0 000 000 01123566778999999999999999999999999986533 34 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) ..++..++|+.+||+.||+++||+.++.+++++|++|++++++..+. +++..+...... ... .+..... . T Consensus 65 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~------~~~~~~~~~~~~-~~~-~~~~~~~--~ 134 (395) T protein:vir:98 65 NQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIY------VADSFTQDKKIS-GSQ-FKVSRVQ--G 134 (395) T ss_pred ccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCcee------cCCccccccccc-Ccc-cceeeec--C Confidence 46777788889999999999999999999999999999999875432 222222211111 111 1111111 1 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHH--HHHHHHHHHccCCcccccccCccC-CHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSS--RNATAAMWKNAGRPNLVLRHEKRL-SEAAQQRLRE 237 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~--~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~~~~~ 237 (518) ....+++++++|||||+.++++..++.+++......+...... .....+++.++..+.+++...... ++++.+..++ T Consensus 135 ~~~~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:98 135 QTYEKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKD 214 (395) T ss_pred ceeeeEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHH Confidence 2234678999999999988777666767777666666544433 345567888888888888766554 4556677777 Q ss_pred HHHHHhcCcc-ccCCeeecCCCcceeeccC------ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH Q lcl|NC_021305. 238 QFDRAHSGSS-NTGKTMVVEEGMEPIPLQL------TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM 310 (518) Q Consensus 238 ~~~~~~~g~~-n~g~~~vl~~g~~~~~l~~------~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 310 (518) .|++.+.+.. +.+++++++.|++|++++. ++.++||.+.+++++++||++|||||++|| ++++|.|++. T Consensus 215 ~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~----~~~sn~e~~~ 290 (395) T protein:vir:98 215 FFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLH----GDIADNQKNY 290 (395) T ss_pred HHHHHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhc----CCcccHHHHH Confidence 7777766533 4556788999999999985 467789999999999999999999999996 6799999999 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCc Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKA 390 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~g 390 (518) +.|+++||.||+.+||++|+++|+++.+...+++|+++.+++.|.+++++++.+++++|++|+||+|+++|+||+++++| T Consensus 291 ~~f~~~tl~P~~~~ie~~l~~kll~~~~~~~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~g 370 (395) T protein:vir:98 291 ELLLEGPIESLITNIVDGLEYAIFDKSETLQGSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLG 370 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhhhcCcceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC Confidence 99999999999999999999999998877777889999999999999999999999999999999999999999998899 Q ss_pred ceeeecccccccccccccCCCCCCC Q lcl|NC_021305. 391 DELYANSALQPLGATPDGAVEWEEA 415 (518) Q Consensus 391 D~~~~~~n~~~~~~~~~~~~~~~~~ 415 (518) |++++++|++|++...+.....+++ T Consensus 371 D~~~~~~n~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:98 371 KVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred ceeeecccceecccccCCCCCCCCC Confidence 9999999999987432211111111 No 82 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=3e-73 Score=418.20 Aligned_cols=359 Identities=13% Similarity=0.072 Sum_probs=271.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc-- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE-- 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~-- 78 (518) =||+...+...... +.....+ . .......++++++|++||++||++||++||++|+...++. T Consensus 2 g~f~~~~~~~~~~~----~~~~~~~----------~--~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~ 65 (378) T protein:vir:94 2 NLFGKVVSFSRGKL----NNDTQRV----------T--AWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGS 65 (378) T ss_pred Cccccchhcccccc----cCCccee----------e--eeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCccc Confidence 24444332111100 0000000 0 1122345678899999999999999999999887765432 Q ss_pred ---e-eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc-CCCceEEEEeeCCceeEEEEcCCceeeEEe Q lcl|NC_021305. 79 ---T-EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN-KSGTPEKLMPMHPSRVAIKRNSRTGRYEYY 153 (518) Q Consensus 79 ---~-~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~-~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~ 153 (518) . ...|+++++|+.+||++||+++||+.++.+++++|++|++++++ ..|+++.++|.. T Consensus 66 ~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~~------------------ 127 (378) T protein:vir:94 66 DTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFAD------------------ 127 (378) T ss_pred ccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEecC------------------ Confidence 1 23455666777799999999999999999999999999997765 457776666521 Q ss_pred eecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHH Q lcl|NC_021305. 154 FQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQ 233 (518) Q Consensus 154 ~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~ 233 (518) ..++|++++|||++.+ .+...|+||+..+.+.+... +++ +.++|+|++++.+++++.+ T Consensus 128 ---------~~~~~~~~diiH~~~~--~~~~~g~s~l~~~~~~i~~~----------~~~-~~~~gil~~~~~l~~~~~~ 185 (378) T protein:vir:94 128 ---------DKKEYKPEELVRLTSP--FYINEDTSILDNALASIQTK----------LEQ-GKLRGLLKINAFLDIDNTQ 185 (378) T ss_pred ---------CeeEeeeeeeEEecCc--CCccchhHHHHHHHHHHHHH----------Hhc-ccccceeeeCCcCCHHHHH Confidence 1235778999999964 34457999999988876432 333 5689999999999998877 Q ss_pred HHHHHHHHHhc---CccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH Q lcl|NC_021305. 234 RLREQFDRAHS---GSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM 310 (518) Q Consensus 234 ~~~~~~~~~~~---g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 310 (518) +++++|.+.+. +..++|+++||++|++|++++.++.++++ +.++++.++||++|||||++|+ .++.+++. T Consensus 186 ~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~------~~~se~~~ 258 (378) T protein:vir:94 186 EYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL------GTASQEQQ 258 (378) T ss_pred HHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc------CChHHHHH Confidence 77777766553 23578899999999999999999999997 5678999999999999999994 34558899 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhhhhhcccc--------cceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVGQYWVRKN--------RMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGL 382 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~--------~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~ 382 (518) ..|+++||.||+++||++|+++|+++.++.. .++|+++.+++.|.+++++.+.+++++||+|+||+|+++|+ T Consensus 259 ~~f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl 338 (378) T protein:vir:94 259 IYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGE 338 (378) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 9999999999999999999999998755422 36799999999999999999999999999999999999999 Q ss_pred CCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 383 PRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 383 ~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) ||++ |||++++|+|++|++.....+...+... +++.+.++ T Consensus 339 ~p~~--gGD~~~~~~n~~~~~~~~~~~~~~~~~~----~~~e~~n~ 378 (378) T protein:vir:94 339 QPIE--GGDVYIANLNAVAVKNLSDLQGSRKDVT----STDETNNQ 378 (378) T ss_pred CCCC--CCCeeeecccccccccchhhcCCcCCCC----CCCCCCCC Confidence 9996 8999999999999986654432211111 11111111 No 83 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=5.8e-73 Score=416.61 Aligned_cols=359 Identities=14% Similarity=0.068 Sum_probs=271.3 Q ss_pred CcCCCC-CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANG-QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) |=|-++ .+.... ..+...... ........+++.++|++||++||++||++||++|+++.++.. T Consensus 1 Mg~f~~~~~f~~~--------------~~~~~~~~~--~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~ 64 (378) T protein:vir:93 1 MNLFGKVVSFSRG--------------KLNNDTQRV--TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred Cccchhhhhhhcc--------------ccCCCccee--eecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccc Confidence 533332 111000 000000000 011223457788999999999999999999999987655421 Q ss_pred ------eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC-CCceEEEEeeCCceeEEEEcCCceeeEE Q lcl|NC_021305. 80 ------EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK-SGTPEKLMPMHPSRVAIKRNSRTGRYEY 152 (518) Q Consensus 80 ------~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~-~G~~~~l~~l~p~~v~v~~~~~~~~~~~ 152 (518) ...|++.++|+.+||++||+++||+.++.+++++|++|++++++. .|+++.++|.. T Consensus 65 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~~----------------- 127 (378) T protein:vir:93 65 SDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD----------------- 127 (378) T ss_pred cccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC----------------- Confidence 234566677777999999999999999999999999999988764 36666555421 Q ss_pred eeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHH Q lcl|NC_021305. 153 YFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQ 232 (518) Q Consensus 153 ~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~ 232 (518) ..++|++++|||++.+ .+...|.|++..+...+. .++++ +.++|+|+.++.+++++. T Consensus 128 ----------~~~~~~~~diih~r~~--~~~~~~~s~l~~~~~~i~----------~~~~~-~~~~g~l~~~~~l~~~~~ 184 (378) T protein:vir:93 128 ----------DKKEYKTEELVRLTSP--FYINEDTSILDNALASIQ----------TKLEQ-GKLRGLLKINAFLDIDNT 184 (378) T ss_pred ----------CeeEeccceeEEecCc--cccchhhHHHHHHHHHHH----------HHHhc-CcccceeeeCCcCCHHHH Confidence 1246788999999964 344568999988876653 34444 468999999999999887 Q ss_pred HHHHHHHHHHhc---CccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH Q lcl|NC_021305. 233 QRLREQFDRAHS---GSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ 309 (518) Q Consensus 233 ~~~~~~~~~~~~---g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~ 309 (518) ++++++|++.+. +..++|++++|++|++|++++.++.++|+ +.++++.++||++|||||++|+ +++.+++ T Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~------g~~~e~~ 257 (378) T protein:vir:93 185 QEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL------GTATQEQ 257 (378) T ss_pred HHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc------CCcHHHH Confidence 777777766543 33578899999999999999999999997 6678999999999999999984 3456899 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHhhhhhhccc--------ccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_021305. 310 MRAFYRDTMAIPIARIQSAMDKYVGQYWVRK--------NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMG 381 (518) Q Consensus 310 ~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~--------~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g 381 (518) ...|+++||.|++++||++|+++|+++.++. ..++||++.+++.|.+++++++.+++++|++|+||+|+++| T Consensus 258 ~~~f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g 337 (378) T protein:vir:93 258 QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMG 337 (378) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 9999999999999999999999999876542 23789999999999999999999999999999999999999 Q ss_pred CCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 382 LPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 382 ~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) +||++ |||++++|+|++|++.....+...+... +++.+.++ T Consensus 338 l~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~~~~----~~~e~~n~ 378 (378) T protein:vir:93 338 EQPIE--GGDVYIANLNAVAVKNLSDLQGSRKDVT----STDETNNQ 378 (378) T ss_pred CCCCC--CCCeeeeccccccccchhhhcCccCCCC----CCCCCCCC Confidence 99996 7999999999999986654432221111 11111111 No 84 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=9.2e-73 Score=415.51 Aligned_cols=359 Identities=13% Similarity=0.056 Sum_probs=271.5 Q ss_pred Cc-CCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 ML-LANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~-f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) |= |+...+.... ..+... ... ........++++++|++||++||++||++||++|++..++.. T Consensus 1 Mg~f~~~~~~~~~----~~~~~~----------~~~--~~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~ 64 (378) T protein:vir:16 1 MNLFGKVVSFSRG----KLNNDT----------QRV--TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVG 64 (378) T ss_pred Cccchhhhhhhcc----cccCCc----------cee--eecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccc Confidence 53 3332211000 000000 000 011223456788999999999999999999999987655421 Q ss_pred ------eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC-CceEEEEeeCCceeEEEEcCCceeeEE Q lcl|NC_021305. 80 ------EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-GTPEKLMPMHPSRVAIKRNSRTGRYEY 152 (518) Q Consensus 80 ------~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~-G~~~~l~~l~p~~v~v~~~~~~~~~~~ 152 (518) ...|+++++|+.+||++||+++||+.++.+++++|++|++++|+.. |+++.++|.. T Consensus 65 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~----------------- 127 (378) T protein:vir:16 65 SDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFAD----------------- 127 (378) T ss_pred cccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecC----------------- Confidence 2346667777789999999999999999999999999999988754 5665555421 Q ss_pred eeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHH Q lcl|NC_021305. 153 YFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQ 232 (518) Q Consensus 153 ~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~ 232 (518) ..+.|++++|||+|.+ .+...|.|++..+...+.. ++. ++.++|+|+.++.+++++. T Consensus 128 ----------~~~~~~~~diih~r~~--~~~~~~~s~l~~~~~~i~~----------~~~-~~~~~g~l~~~~~l~~~~~ 184 (378) T protein:vir:16 128 ----------DKKEYKPEELVRLTSP--FYINEDTSILDNALASIQT----------KLE-QGKLRGLLKINAFLDIDNT 184 (378) T ss_pred ----------CeeEecccceEEecCc--cCccchhHHHHHHHHHHHH----------HHh-cCccceeeEeCCcCCHHHH Confidence 1245778999999964 3345689999888876542 233 4568999999999999877 Q ss_pred HHHHHHHHHHhc---CccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH Q lcl|NC_021305. 233 QRLREQFDRAHS---GSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ 309 (518) Q Consensus 233 ~~~~~~~~~~~~---g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~ 309 (518) ++.+++|++.+. +..++|+++||++|++|++++.++.++++. .++++.++||++|||||.+|+ +++.+++ T Consensus 185 ~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~-~~~~~~~~Ia~~fgVPp~~l~------g~~~e~~ 257 (378) T protein:vir:16 185 QEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKD-EIDLIKSELLTGYFMNENILL------GTASQEQ 257 (378) T ss_pred HHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHH-HHHHHHHHHHHHhCCCHHHhc------CCchHHH Confidence 777777766553 345789999999999999999999999974 568999999999999999984 3556899 Q ss_pred HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccc--------cceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_021305. 310 MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKN--------RMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMG 381 (518) Q Consensus 310 ~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~--------~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g 381 (518) ...|+++||.||++.||++|+++|+++.++.. .++|+++.+++.|.+++++++.+++++|++|+||+|+++| T Consensus 258 ~~~f~~~tl~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g 337 (378) T protein:vir:16 258 QIYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMG 337 (378) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC Confidence 99999999999999999999999998765422 3679999999999999999999999999999999999999 Q ss_pred CCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 382 LPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 382 ~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) +||++ |||++++|+|++|++.....+...+.. .+++...++ T Consensus 338 ~~p~~--ggD~~~~~~n~~~~~~~~~~~~~~~~~----~~~~e~~ne 378 (378) T protein:vir:16 338 EQPIE--GGDVYIANLNAVAVKNLSDLQGSRKDV----TSTDETNNQ 378 (378) T ss_pred CCCCC--CCCeEeeccccccccchhhhcCccCCC----CCCCCCCCC Confidence 99995 899999999999998665543221111 111111111 No 85 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=1.8e-70 Score=403.01 Aligned_cols=504 Identities=12% Similarity=0.080 Sum_probs=316.5 Q ss_pred Cc-------CCCCCCCCcccccccchhhhhhhcccc-cccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021305. 1 ML-------LANGQTLSAPAMAELSPQMQDSYYYAP-AVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) Q Consensus 1 ~~-------f~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~ 72 (518) +| |..+++++.-........+....++.. ....+. ++....+++..+|+|++||++||++||++||.++. T Consensus 42 ~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~epp~--d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~ 119 (648) T protein:vir:79 42 AMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEEPEF--DFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVS 119 (648) T ss_pred ccCCCCcccccccccchhHHHHHhHHHHHhhcCCccccccCCc--CHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEe Confidence 22 222222222222222223333333222 222222 34555678889999999999999999999999876 Q ss_pred ecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc---------------eEEEEeeCCc Q lcl|NC_021305. 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT---------------PEKLMPMHPS 137 (518) Q Consensus 73 ~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~---------------~~~l~~l~p~ 137 (518) +++. .........++.+||++||.++||+.++.+++++||+|++++|+..|. +..+||++|. T Consensus 120 ~~~~---~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~ 196 (648) T protein:vir:79 120 KNPN---AVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLA 196 (648) T ss_pred cCCc---cchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCc Confidence 5533 233334556778999999999999999999999999999999998873 4789999999 Q ss_pred eeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCc Q lcl|NC_021305. 138 RVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRP 217 (518) Q Consensus 138 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p 217 (518) +|++..+.++....|.|.. ..++..+.|++++||||+.+++.+.++|+|||.++..+|....+++++..++|.||++| T Consensus 197 ~v~v~~d~~g~~~~Y~y~~--~g~~~~~~~~~~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P 274 (648) T protein:vir:79 197 SMKVKRDKFGMIKGWQQEQ--EGQDKPQKFKPEDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHP 274 (648) T ss_pred eeEEEEcCCCceeeeEEEe--cCCceeEEecCccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCc Confidence 9999998888776666543 34566788999999999987766678999999999999999999999999999999999 Q ss_pred ccccccC-ccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_021305. 218 NLVLRHE-KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVH 296 (518) Q Consensus 218 ~~il~~~-~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg 296 (518) +++|+++ +....++.++.++.|.+.+.+..-. +..+....+.+.+ ..+++|+||++++++++++||++|||||++|| T Consensus 275 ~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~-gg~v~~~~~~i~~-~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG 352 (648) T protein:vir:79 275 LWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVE-GGMVTTERVNISS-IASNQIIDAKEYLKHFEQRAFTVLGVSELMMG 352 (648) T ss_pred cEEEEeCCCccchHHHHHHHHHHHHhccccccc-ccccccceeeccc-cCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcc Confidence 9999875 3344566667777777776543211 1112222233322 23668999999999999999999999999999 Q ss_pred cccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhh----hhh------cccccceecchhhhhcCHHHHHHHHHHHH Q lcl|NC_021305. 297 ILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVG----QYW------VRKNRMKFDIDDVIQPDWEAKSESTQKMV 366 (518) Q Consensus 297 ~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~----~~~------~~~~~~~fd~~~l~~~d~~~~~~~~~~~~ 366 (518) +.+++++++.+++.. ++..++.|+...++..++..+. .+. ...++++|+++++++.|.+++++.+.+++ T Consensus 353 ~~~~ss~stae~~~~-~~~~~i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~ 431 (648) T protein:vir:79 353 RGGTASRSTGDNLSS-DFKDRIKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLY 431 (648) T ss_pred cCCCccchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHH Confidence 988888888877655 4566777777666655554332 211 12356899999999999999999999999 Q ss_pred hCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCC-CCCCCCCCCCCccC-----CCCCCCcccc--CCcc Q lcl|NC_021305. 367 NSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAV-EWEEAPAPKRPAST-----PVASLDQSPP--TSVP 438 (518) Q Consensus 367 ~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~-~~~~~~~~~~~~~~-----~~~~~~~~~~--~~~~ 438 (518) ++||||+||+|+++|++|++++ .+..++..++.+......... ...+......+++. ..+..+++.+ ...+ T Consensus 432 ~~GilT~NEaR~~lGlpPi~~g-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~ 510 (648) T protein:vir:79 432 EHNAISEDEMRELIGRDPVDDG-EGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKT 510 (648) T ss_pred hCCCcCHHHHHHHhCCCCCCCC-CCccccccccccchhccccccCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCC Confidence 9999999999999999999853 344455566555432211110 00000000000000 0000001111 1111 Q ss_pred ccccchhcchh------hHHHH---HHHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHH------------ Q lcl|NC_021305. 439 GLSPTNSDRST------DSGKT---EPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKY------------ 497 (518) Q Consensus 439 ~~~~~~~~~~~------~~~~~---~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------ 497 (518) ...+.+..+.. -...+ ++-+...+...+...-.|++.+++.|... ++..+..+.|| T Consensus 511 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ 587 (648) T protein:vir:79 511 SPKKQTNGRHVRYMQEMLLEYTTLNEAIKALIERYYQYGSKEHLKSINGSLMYT---EGRLLELTTQYWGEEVTEKVRIP 587 (648) T ss_pred CCccccchhhhhhhhhhhhcchhhhHHHhhHHHHHHHHhHHHHHHhhhhhheec---cchhHHHHHHHhhhhhhceeeee Confidence 11111111100 01111 11111112222334444555666555432 34445555555 Q ss_pred -HHHHhHHHHhhhhhh------hcccCC Q lcl|NC_021305. 498 -PDDLEDILLAVQLAL------AERKDN 518 (518) Q Consensus 498 -~~~~~~~~~~~~~~~------~~~~~~ 518 (518) .+..+-+-=+.+.-+ ||-.+. T Consensus 588 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 615 (648) T protein:vir:79 588 FHRMTENLREEVMSTIDKVEGVAEASDI 615 (648) T ss_pred HHHHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 111111222222221 111111 No 86 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=4.7e-71 Score=406.16 Aligned_cols=359 Identities=13% Similarity=0.066 Sum_probs=265.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc-- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE-- 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~-- 78 (518) =||++-.+...-... +.............++++++|++||++||++||+|||++|+++.++. T Consensus 2 ~~f~k~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~ 65 (378) T protein:vir:85 2 NLFGKVVSFSRGKLN----------------NDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGS 65 (378) T ss_pred chhhhhhhhhhcccc----------------cCCcceeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEecccccc Confidence 133322111000000 00000011223345788999999999999999999999998876543 Q ss_pred ----eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEE-EcCCCceEEEEeeCCceeEEEEcCCceeeEEe Q lcl|NC_021305. 79 ----TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ-KNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYY 153 (518) Q Consensus 79 ----~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~-r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~ 153 (518) ....|++.++|+.+||++||+++||+.++.+++++|++|++++ ++..|++..+++.. T Consensus 66 ~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~~------------------ 127 (378) T protein:vir:85 66 DTLISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLFAN------------------ 127 (378) T ss_pred ccccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEecC------------------ Confidence 2345667778888999999999999999999999999999865 44555544333211 Q ss_pred eecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHH Q lcl|NC_021305. 154 FQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQ 233 (518) Q Consensus 154 ~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~ 233 (518) + .+.|.+++|||++.+.. ...+.+.+..+...+. .+++ ++.++|+|+.++.+++++.+ T Consensus 128 -------~--~~~~~~~dvih~~~~~~--~~~~~~~~~~a~~~~~----------~~~~-~~~~~g~l~~~~~l~~~~~~ 185 (378) T protein:vir:85 128 -------D--KKEYKPEELVRLVSPFY--INEDTSILDNALASIQ----------TKLE-QGKLRGLLKINAFLDIDNTQ 185 (378) T ss_pred -------C--CEEEcccceEEEecCcC--ccchhhHHHHHHHHHH----------HHHh-cCCcceEEEeCCcCCHHHHH Confidence 1 23567889999985432 1224555554444332 2344 45789999999999999888 Q ss_pred HHHHHHHHHh---cCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH Q lcl|NC_021305. 234 RLREQFDRAH---SGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM 310 (518) Q Consensus 234 ~~~~~~~~~~---~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 310 (518) +++++|++.+ .+..++|+++||++|++|++++.++.++++ +.++++.++||++|||||++|+ +++.+++. T Consensus 186 ~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~------~s~~e~~~ 258 (378) T protein:vir:85 186 EYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILL------GTATQEQQ 258 (378) T ss_pred HHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhc------CCchHHHH Confidence 8888776654 344678999999999999999999999996 6778999999999999999994 35568899 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhhhhhccc--c------cceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVGQYWVRK--N------RMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGL 382 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~--~------~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~ 382 (518) ..|+.+||.||+.+||++|+++|+++.++. + .++|+++.+++.|.+++++.+.+++++|++|+||+|+++|+ T Consensus 259 ~~f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl 338 (378) T protein:vir:85 259 IYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGE 338 (378) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 999999999999999999999999875442 1 25789999999999999999999999999999999999999 Q ss_pred CCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 383 PRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 383 ~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) ||++ |||++++|+|++|++.....+...+...+ ++.+.++ T Consensus 339 ~p~~--gGD~~~~~~N~~~~~~~~~~~~~~~~~~~----~~e~~n~ 378 (378) T protein:vir:85 339 QPIE--GGDIYIANLNAVAVKNLSDLQGSRKDVAS----TDETNNQ 378 (378) T ss_pred CCCC--CCCeEeecccccccccchhhcCccCCCCC----CCCCCCC Confidence 9996 89999999999999876554332222111 1111111 No 87 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=1.3e-70 Score=403.80 Aligned_cols=359 Identities=13% Similarity=0.064 Sum_probs=266.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc-- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE-- 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~-- 78 (518) =||++-++...-.. ..+. ......+....+++.++|++||++||++||++|+++|++...+. T Consensus 2 ~if~~~~~~~~~~~---------~~~~-------~~~~~~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~ 65 (378) T protein:vir:94 2 NLFGKVVSFSRGKL---------NNDT-------QRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGS 65 (378) T ss_pred chhHHhHhhhhccc---------ccCc-------ceeeeeecchhhhhhHHHHHHHHHHHHhHhhCceeeeeeccccccc Confidence 24553322100000 0000 01111223445788899999999999999999999998764432 Q ss_pred ----eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEE-EcCCCceEEEEeeCCceeEEEEcCCceeeEEe Q lcl|NC_021305. 79 ----TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ-KNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYY 153 (518) Q Consensus 79 ----~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~-r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~ 153 (518) ....|++.++|+.+||++||+++||+.++.++++.|++|++++ ++..|++..+++.. T Consensus 66 ~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~~------------------ 127 (378) T protein:vir:94 66 DTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFAN------------------ 127 (378) T ss_pred ccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEec------------------ Confidence 2345777788888999999999999999999999999999855 45556655444321 Q ss_pred eecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHH Q lcl|NC_021305. 154 FQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQ 233 (518) Q Consensus 154 ~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~ 233 (518) + ..+|++++|+|++.+...+ .+.+++..+...+.. .++ ++.++|+|+.++.+++++.+ T Consensus 128 -------~--~~~~~~~dvih~~~~~~~~--~~~~~~~~~~~~~~~----------~~~-~~~~~g~l~~~~~l~~~~~~ 185 (378) T protein:vir:94 128 -------D--KKEYKPEELVRLTSPFYIN--EDTSILDNALASIQT----------KLE-QGKLRGLLKINAFLDIDNTQ 185 (378) T ss_pred -------C--cEEechhceeeecCcCCcc--cchhHHHHHHHHHHH----------HHh-hCCcccceeeCCcCCHHHHH Confidence 1 1357889999999655433 356777766654432 233 34688999999999988766 Q ss_pred HHHHHHHHHh---cCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH Q lcl|NC_021305. 234 RLREQFDRAH---SGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM 310 (518) Q Consensus 234 ~~~~~~~~~~---~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 310 (518) +++++|++.+ .+..++|+++||++|++|++++.++.++++ +.++++.++||++|||||++|+ .+..+++. T Consensus 186 ~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~------g~~~e~~~ 258 (378) T protein:vir:94 186 EYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILL------GTATQEQQ 258 (378) T ss_pred HHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhc------CCchHHHH Confidence 6666555543 233577889999999999999999999996 6678999999999999999984 23447888 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhhhhhccc--------ccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVGQYWVRK--------NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGL 382 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~--------~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~ 382 (518) ..|+++||.||++.||++|+++|+++.++. ..++|+++.+++.|.+++++++.+++++|++|+||+|+++|+ T Consensus 259 ~~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~ 338 (378) T protein:vir:94 259 IYFYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGE 338 (378) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 999999999999999999999999865432 236799999999999999999999999999999999999999 Q ss_pred CCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 383 PRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 383 ~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) ||++ |||++++|+|++|++.....+...+...+.++ +.++ T Consensus 339 ~p~~--ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~~e----~~n~ 378 (378) T protein:vir:94 339 QPIE--GGDVYIANLNAVAVKNLSDLQGNRKDVTSTDE----TNNQ 378 (378) T ss_pred CCCC--CCCeeeecccccchhcchhcccccCCCCCCCC----CCCC Confidence 9995 89999999999999876655433222111111 1111 No 88 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=1.2e-70 Score=403.98 Aligned_cols=482 Identities=15% Similarity=0.135 Sum_probs=319.4 Q ss_pred CcCCCCCCCC-----cccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021305. 1 MLLANGQTLS-----APAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSG 75 (518) Q Consensus 1 ~~f~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~ 75 (518) -++..+.... ++..+.++ ....++......++. +.......+..+++|++||+++++.||+++|.+....+ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~p~~--~~~~L~~~~e~~~~~~~~i~~~~~~iag~g~~~~~~~~ 87 (651) T protein:vir:99 12 KVHVEGLGGEADLAKSPNSTQIP--DHRIQSHNVGVNPPY--NPDRLAAFLELNETLATGIRKKSRYEVGFGFDLVPAQG 87 (651) T ss_pred EEEeecccccccccccccccccc--hhhhcccCCCCCCCC--CHHHHHHHHhcChHHHHHHHHHhhhhhccCceeeeccc Confidence 2233222111 22222221 111122222222222 45566777788999999999999999999999864221 Q ss_pred -Cc---c-eec---------cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEE Q lcl|NC_021305. 76 -DT---E-TEE---------SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAI 141 (518) Q Consensus 76 -~~---~-~~~---------~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v 141 (518) +. . .+. .++.+..+...+|+.+++.+|++.++.|++.+|++|++++++..|.++.++++++..+++ T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~lp~~~~Rv 167 (651) T protein:vir:99 88 VDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPVGLAYVPARTVRV 167 (651) T ss_pred CCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhhcChhheee Confidence 11 1 111 122233445667899999999999999999999999999999999999999999988766 Q ss_pred EEcCCce--------------------------------eeEEee----------------------------------- Q lcl|NC_021305. 142 KRNSRTG--------------------------------RYEYYF----------------------------------- 154 (518) Q Consensus 142 ~~~~~~~--------------------------------~~~~~~----------------------------------- 154 (518) ..+.... .+.+.+ T Consensus 168 ~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~~~~~ 247 (651) T protein:vir:99 168 RRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEESEREPIFV 247 (651) T ss_pred ecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEeccCcceeeeeecc Confidence 4432110 000000 Q ss_pred -----ecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc-cCC Q lcl|NC_021305. 155 -----QAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLS 228 (518) Q Consensus 155 -----~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~ 228 (518) .+.....+....+++++|||||.+++.+..+|+||+..+..++....++++++.++|+||++|++||++++ .++ T Consensus 248 ~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~~~ls 327 (651) T protein:vir:99 248 DRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTGGELS 327 (651) T ss_pred cceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCC Confidence 00001122345678999999999987677899999999999999999999999999999999999999875 699 Q ss_pred HHHHHHHHHHHHHHhcCccccCCeeecCC-----------CcceeeccCCh-hhHHHHHHHHHHHHHHHHHhcCCHHHhc Q lcl|NC_021305. 229 EAAQQRLREQFDRAHSGSSNTGKTMVVEE-----------GMEPIPLQLTA-VEMQFIEARQLNREEVCGVYDIAPPIVH 296 (518) Q Consensus 229 ~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-----------g~~~~~l~~~~-~d~~~~e~~~~~~~~Ia~~fgVPp~~lg 296 (518) +++.+++++.|++.+ .|+|+++||+. |++|++++.++ +|+||++++++++.+||++|||||++|| T Consensus 328 ~e~~~~lr~~~~~~~---~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~lG 404 (651) T protein:vir:99 328 EESKRDLRQMLNGLR---EESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKIG 404 (651) T ss_pred HHHHHHHHHHHHHHh---ccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHhc Confidence 999999999999865 36789998865 89999999876 5999999999999999999999999999 Q ss_pred cccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhccc----ccceecchhhhhcCHHHHHHHHHHHHhCCCcC Q lcl|NC_021305. 297 ILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRK----NRMKFDIDDVIQPDWEAKSESTQKMVNSGVAT 372 (518) Q Consensus 297 ~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~----~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T 372 (518) +.+++|++|+|++.+.|+++||.|++..||++||++|++..... .+++|+...+++.|.+++++.+..++++|++| T Consensus 405 ~~~~~~~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~~~e~~~~~i~~G~~T 484 (651) T protein:vir:99 405 VTDSANRSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQLAEQRVRAMRLAGVGL 484 (651) T ss_pred cCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHHHHHHHHHHHhCCCcC Confidence 99999999999999999999999999999999999999876542 46788889999999999999999999999999 Q ss_pred HHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHH Q lcl|NC_021305. 373 PNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSG 452 (518) Q Consensus 373 ~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (518) +||+|+++|+||+++++||..+.+.+...++...++.. .+ +..+.+++.+ ....+.+. .... T Consensus 485 ~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge----~~-----~~~~~~~~~~---~~~~e~~~------~~~~ 546 (651) T protein:vir:99 485 VDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGE----TE-----AVHEPPEENK---IGEREWDT------VKSE 546 (651) T ss_pred HHHHHHHhCCCCCCCccccccccccccccccccccCCC----Cc-----ccccCccccc---cccchhhh------hhhh Confidence 99999999999999889999888777655443211110 00 0000000000 00000000 0000 Q ss_pred HHHHHHhhcccCCchhhHHHHHHHHHhh-ccccCcCchhHH----------HHHHH----HHHHhHHHHhhh---hhhhc Q lcl|NC_021305. 453 KTEPRRLMQKPPPKESSPKHLRAVKGAM-GRGKDIKGFALQ----------LAEKY----PDDLEDILLAVQ---LALAE 514 (518) Q Consensus 453 ~~~~~~~~~k~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~----------~~~~~----~~~~~~~~~~~~---~~~~~ 514 (518) ....+.....+ -.++.+ +.|=+....-|. ..-.| ...-+.+.-|.+ ---.. T Consensus 547 ~~~~e~~~~~~-----------v~ss~~~~~gyd~~~~~l~~~f~~~~~~~~~y~y~~v~~~~~~~~~~a~s~g~~~~~~ 615 (651) T protein:vir:99 547 LTTKDPIEQMQ-----------FSSSNLDEGLYDFGENELYLSFLRDEGQSSLYAYVDVPASEWSALANAGSHGGYHYDN 615 (651) T ss_pred hcccchhhhhh-----------HHHHHHHhhcCCCccceEEEEEeecCCCCceeeeeCCCHHHHHHHhcCcccceeehhc Confidence 00000000000 011111 011111111111 00011 111111111111 01111 Q ss_pred ccCC Q lcl|NC_021305. 515 RKDN 518 (518) Q Consensus 515 ~~~~ 518 (518) -|++ T Consensus 616 i~~~ 619 (651) T protein:vir:99 616 IRLE 619 (651) T ss_pred cccc Confidence 1111 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=7.7e-63 Score=361.10 Aligned_cols=276 Identities=16% Similarity=0.269 Sum_probs=252.5 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEE Q lcl|NC_021305. 63 LARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIK 142 (518) Q Consensus 63 ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~ 142 (518) ||++||++|+++. ...++..++|+.+||++||+.+||+.++.+++++|++|++++|+..|.+++|+|++|++|++. T Consensus 1 ia~l~~~~~~~~~----~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~ 76 (278) T protein:vir:78 1 MASLPLKMYEDYK----VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEML 76 (278) T ss_pred CccceeEEEecCc----ccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEE Confidence 9999999998653 335777888889999999999999999999999999999999999999999999999999999 Q ss_pred EcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccc Q lcl|NC_021305. 143 RNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLR 222 (518) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 222 (518) .+.++...+|.+.. .++..+.|++++|||++++++.+..+|+||+.++..++....++++++...+.+ .|+++++ T Consensus 77 ~~~~~~~~~y~~~~---~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~i~~ 151 (278) T protein:vir:78 77 IENQSRELYYSIHA---ATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQK--PDSFMLK 151 (278) T ss_pred EcCCCceEEEEEEc---CCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcC--CCcEEEE Confidence 99888888776643 345678899999999999988777899999999999999999999887655555 4789999 Q ss_pred cCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccc Q lcl|NC_021305. 223 HEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT 302 (518) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~ 302 (518) .++.+++++.+++++.|++.+ .++|+++++++|++|++++.++.|++|.+++++..++||++|||||.++|..+++| T Consensus 152 ~~~~l~~e~~~~~~~~~~~~~---~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~ 228 (278) T protein:vir:78 152 YGSNVGKEKRQQVLEDFKQYY---EENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTN 228 (278) T ss_pred eCCCCCHHHHHHHHHHHHHHh---ccCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 999999999999999999876 36789999999999999999999999999999999999999999999999999999 Q ss_pred cCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhc--ccccceecchhh Q lcl|NC_021305. 303 FSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWV--RKNRMKFDIDDV 350 (518) Q Consensus 303 ~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~--~~~~~~fd~~~l 350 (518) ++|.+++.+.|+++||.|+++.|+++||++|+++.+ .+++++||++.| T Consensus 229 ~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 229 FAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 999999999999999999999999999999998755 468899999999 No 90 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=3.7e-58 Score=335.45 Aligned_cols=315 Identities=17% Similarity=0.222 Sum_probs=260.0 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) -.|++|.+-|.-..+.+.+.+...+.+ .+...++. ......++..++.+.+||...++.+++ T Consensus 54 ~~f~fg~p~~v~~~~~~~~~~~~~~~~-~~~~pp~~--~~~La~~~~~~~~h~s~l~~k~n~l~~--------------- 115 (376) T protein:vir:10 54 EVFTFDDPTPVMNRAEILDYVECWSNG-EWFEPPVS--FAGLAKSFRASTHHSSALFFKANVLAS--------------- 115 (376) T ss_pred EEEEcCCceeccCcchhhhhhhhhhcC-ceecCCCC--HHHHHHHHhhhHHhhhhHHHHhHHHHh--------------- Confidence 356666544443433333433332222 22333433 223457778888889998888776544 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) .-+||+.||+.+|++ ++.+++++||+|++++|+..|++++|+|++|.+|++..+.++.. +. .. T Consensus 116 ---------~~~Pnp~lT~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~~~--~~-----~~ 178 (376) T protein:vir:10 116 ---------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFNGFV--YV-----NG 178 (376) T ss_pred ---------ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceEEEeeCCeEE--EE-----Ec Confidence 236999999999985 56799999999999999999999999999999999988765322 11 12 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~ 239 (518) ++..+.|++++|||++.+++....||+|++.+++.++....+++.|+.++|+||++|++||.++ ..+++++.++++++| T Consensus 179 ~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d~~l~~e~~~~lr~~~ 258 (376) T protein:vir:10 179 WQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDAL 258 (376) T ss_pred CCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHH Confidence 4567889999999999999888889999999999999999999999999999999999999876 479999999999999 Q ss_pred HHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRA 312 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~ 312 (518) ++ ..|..|.++++|+ ++|++|++++.++.|+||.+.+++++++||++|||||.++|+.++ ++++|.|++.+. T Consensus 259 ~~-~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~ 337 (376) T protein:vir:10 259 KN-AKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARV 337 (376) T ss_pred HH-hcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHH Confidence 87 5788999999988 578999999999999999999999999999999999999999765 468999999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHH Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEA 357 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~ 357 (518) |++++|.|+++.|+ ++|.+|.. ..++|+...|++.|.+. T Consensus 338 f~~~~L~Pl~~~ie-eln~~L~~-----~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 338 FGRNEIRPLQARFA-ELNDWLGE-----EVVRFDDYEIPPAPVAA 376 (376) T ss_pred HHHHHHHHHHHHHH-HHHhhccc-----cccccChhHhhcccccC Confidence 99999999999998 57877743 25899999999999988 No 91 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=1.2e-58 Score=338.09 Aligned_cols=339 Identities=16% Similarity=0.201 Sum_probs=253.6 Q ss_pred CcCCCCCCCC-------cc---------cccccchhhhhhhcccccccccccccchhhh-----HHHhhcHHHHHHHHHH Q lcl|NC_021305. 1 MLLANGQTLS-------AP---------AMAELSPQMQDSYYYAPAVGMQLERQFSLYG-----GIYKNQPWVRTVIAKR 59 (518) Q Consensus 1 ~~f~~~~~~~-------~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~v~~~v~~i 59 (518) |==++++... .. .....++..-.+||. +.. +...+....+. +.|++.|.-+.|+..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~-p~~-~~~~~~~~~~~~~~~~~~~~~~pi~~~~la~~ 78 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGD-PVE-VLDRRELLDYVECMRMGQWYEPPMPWDGLARS 78 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCC-cee-ecchhhHHHHHHHHhccchhccCcCHHHHHHH Confidence 3222211110 00 001111211122332 211 11111111112 2244455555555444 Q ss_pred HHhhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCcee Q lcl|NC_021305. 60 AQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRV 139 (518) Q Consensus 60 a~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v 139 (518) .+.-+ .+ +......+.+..|+.+||+.||+.+|++ ++.+++++||+|++++|+..|++++|+|++|.+| T Consensus 79 ~~~~~---~h-------~~~~~~~~n~l~l~~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v 147 (368) T protein:vir:79 79 FRAAA---HH-------SSAVYVKRNILVSTFIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYV 147 (368) T ss_pred Hhhcc---cc-------chhhhhhcchhhhhcCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccc Confidence 33322 11 1222334455678889999999999975 7889999999999999999999999999999999 Q ss_pred EEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccc Q lcl|NC_021305. 140 AIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNL 219 (518) Q Consensus 140 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~ 219 (518) ++..+.+. +++. ..++..++|++++|||++.+++.+.+||+||+.++..++....+++.|+.++|+||++|++ T Consensus 148 ~~~~~~~~--~~~~-----~~~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~g 220 (368) T protein:vir:79 148 RRGLDLNT--YFFV-----QNWQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGF 220 (368) T ss_pred eeeccCCE--EEEE-----ecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Confidence 88765432 2221 1246678899999999999998888899999999999999999999999999999999999 Q ss_pred ccccC-ccCCHHHHHHHHHHHHHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_021305. 220 VLRHE-KRLSEAAQQRLREQFDRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPP 293 (518) Q Consensus 220 il~~~-~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~ 293 (518) ||.++ ..+++++.++++++|++ +.|..|.|+++|+ ++|++|++++.++.|+||.+.+++++++||++|||||. T Consensus 221 il~~~~~~l~~e~~~~lk~~~~~-~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~ 299 (368) T protein:vir:79 221 ILYMTDAAQKQEDVDTLREAMKS-AKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQ 299 (368) T ss_pred EEEeCCCCCCHHHHHHHHHHHHH-hcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHH Confidence 99876 57999999999999987 5788999999998 67899999999999999999999999999999999999 Q ss_pred Hhcccccc--ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhC Q lcl|NC_021305. 294 IVHILDRA--TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNS 368 (518) Q Consensus 294 ~lg~~~~~--~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~ 368 (518) ++|+.++. +++|++++.+.|++++|.|+++.|+ ++|.+|.. ..++|+...+++.|.+.++.....- + T Consensus 300 llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~-----e~~rF~~~~l~~~D~~a~a~~~~rs--a 368 (368) T protein:vir:79 300 LMGIIPNNTGGFGDVEKAAMVFARNEVKPLQDRLL-AINDWIGD-----EVVRFAPYALGGHDQPAAAPGGQRS--A 368 (368) T ss_pred HccccCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHhccCc-----ceeeechhHhhcccccccCCccccc--C Confidence 99997654 4899999999999999999999998 68887743 3578999999999998887622211 1 No 92 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=5.6e-57 Score=328.99 Aligned_cols=317 Identities=14% Similarity=0.230 Sum_probs=243.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =.|++|.+-|.-..+.+++.+......+.+...++. ......++..++.+.+|+.. T Consensus 22 ~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~pp~~--~~~la~l~~~~~~h~~~i~~---------------------- 77 (346) T protein:vir:10 22 EIFSFGDPIPVLDRADILNYLECSAMYEKWYNPPMS--FDGLAKSLRSSTHHESAIIT---------------------- 77 (346) T ss_pred EEEecCCcceecCchhHHHHHHHhhcCCceEecCCC--HHHHHHHHHhhhhcchhhhh---------------------- Confidence 133333322222222222222111000111111111 11112233333333333332 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) +.+.+..|+.+||+.||+.+|++ ++.+++++||+|++++|+..|++++|+|++|.+|++..+.++..+ .+ ... T Consensus 78 -k~n~l~~l~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~~~~--~~---~~~ 150 (346) T protein:vir:10 78 -KANILLSTCEVDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQFYY--VP---QRF 150 (346) T ss_pred -hhhhHHHHHhCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCeEEE--EE---Ecc Confidence 23456667889999999999987 568999999999999999999999999999999998877654322 11 123 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~ 239 (518) +++.++|++++|||++.+++....||+|++..+..++....++++++.++|+||++|++||+++ ..+++++.++++++| T Consensus 151 ~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i~~~~ 230 (346) T protein:vir:10 151 DHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENIRQQL 230 (346) T ss_pred CCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHH Confidence 5677899999999999999887889999999999999999999999999999999999999875 578999999999999 Q ss_pred HHHhcCccccCCeeecC-----CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVVE-----EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRA 312 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl~-----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~ 312 (518) ++.+ |..|.++++|+. .|+++++++.++.|+||.+.+++++++||++|||||.++|+.++ +++++.+++.+. T Consensus 231 ~~~~-g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~ 309 (346) T protein:vir:10 231 KQSK-GVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEV 309 (346) T ss_pred HHhc-CccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHH Confidence 8874 678999999985 47899999999999999999999999999999999999998765 458999999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCH Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~ 355 (518) |++++|.|+++.||+ +|.+|.. ..++|+...+++.|. T Consensus 310 f~~~~l~P~~~~iee-~n~~L~~-----e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 310 FFITEIEPLQERLKE-FNQWLGQ-----EVIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHH-HHhhccc-----ceeeechhhhcccCC Confidence 999999999999985 7777743 357999999999998 No 93 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=3.5e-57 Score=330.10 Aligned_cols=315 Identities=17% Similarity=0.220 Sum_probs=255.1 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) -.|++|.+-|.-..+.+.+.+...+. +.+...++. ......++..++.+.+||...++.+++ T Consensus 29 ~~~~~~~p~~v~~~~~~~~~~~~~~~-~~~~~pp~~--~~~la~~~~~~~~h~~~l~~k~n~l~~--------------- 90 (351) T protein:vir:79 29 EVFTFDDPTPVMNRAEILDYVECWSN-GEWFEPPVS--FAGLAKSFRASTHHSSALFFKANVLAS--------------- 90 (351) T ss_pred EEEEcCCceeecCcchhhhhhhhhhc-CceecCCCC--HHHHHHHHhhhHhhhhhhhhhhhHHhh--------------- Confidence 24555543333233333333322222 222233332 233456677788888888776665544 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) .-+||+.||..+|+ .++.+++++||+|++++|+..|.+++|+|++|.+|++..+.++..+ . .. T Consensus 91 ---------~~~Pnp~~t~~~f~-~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~----~---~~ 153 (351) T protein:vir:79 91 ---------TFRPHRWLSRHAFE-RWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFSGFVY----V---NG 153 (351) T ss_pred ---------cccCCCCCCHHHHH-HHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCCeEEE----E---ec Confidence 23699999999997 4678999999999999999999999999999999998876654211 1 13 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~ 239 (518) ++..+.|++++|||++.+++....||+|++..++.++....+++.|+.++|+||++|++||..+ ..+++++.++++++| T Consensus 154 ~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~ 233 (351) T protein:vir:79 154 WQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDAL 233 (351) T ss_pred CceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHH Confidence 4667889999999999999988899999999999999999999999999999999999999876 479999999999999 Q ss_pred HHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRA 312 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~ 312 (518) ++ ..|..|.++++|+ ++|+++++++.++.|+||.+++++++++||++|||||.++|+.+. ++++|.|++.+. T Consensus 234 ~~-~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~~ 312 (351) T protein:vir:79 234 KN-AKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARV 312 (351) T ss_pred HH-hcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHH Confidence 87 5788899999988 578999999999999999999999999999999999999999765 458999999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHH Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEA 357 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~ 357 (518) |+++||.|+++.|++ +|.+|.. ..++|+...+++.|.+. T Consensus 313 f~~~~l~Pl~~~ie~-ln~~lg~-----~~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 313 FGRNEIRPLQARFAE-LNDWLGD-----EVVTFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHHHHHHHHHH-HHhhcCc-----ceeeeChhhhccccccC Confidence 999999999999985 7777632 35799999999999988 No 94 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=4.9e-57 Score=329.27 Aligned_cols=315 Identities=17% Similarity=0.228 Sum_probs=255.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) -.|++|.+-|.-..+.+.+.+...+. +.+...++. ......++..++.+.+||...++.+++ T Consensus 29 ~~~~~~~p~~v~~~~~~~~~~~~~~~-~~~~~pp~~--~~~la~~~~~~~~h~~~l~~k~n~l~~--------------- 90 (351) T protein:vir:78 29 EVFTFDDPTPVMNRAEILDYVECWSN-GEWFEPPVS--FAGLAKSFRASTHHSSALFFKANVLAS--------------- 90 (351) T ss_pred EEEEcCCceeecCcchhhhhhhhhcc-CceecCCCC--HHHHHHHHhhhHhhhhhhhhhhhHHhh--------------- Confidence 24555544333333333333322222 222233332 233446667788888888777665544 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) .-+||+.||..+|++ ++.+++++||+|++++|+..|++++|+|+++.+|++..+.++..+ . .. T Consensus 91 ---------~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~~~~~--~-----~~ 153 (351) T protein:vir:78 91 ---------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFSGFVY--V-----NG 153 (351) T ss_pred ---------cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCCeEEE--E-----ec Confidence 236999999999975 567999999999999999999999999999999999887654221 1 12 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~ 239 (518) ++..+.|++++|||++.+++....||+|++..++.++....++..|+.++|+||++|++||..+ ..+++++.++++++| T Consensus 154 ~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~~~ls~e~~~~lr~~~ 233 (351) T protein:vir:78 154 WQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDAL 233 (351) T ss_pred CCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHH Confidence 4667889999999999999888899999999999999999999999999999999999999876 479999999999999 Q ss_pred HHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRA 312 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~ 312 (518) ++ ..|..|.++++|+ ++|+++++++.++.|+||.+.+++++++||++|||||.++|+.++ ++++|.|++.+. T Consensus 234 ~~-~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~ 312 (351) T protein:vir:78 234 KN-AKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARV 312 (351) T ss_pred HH-hcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHH Confidence 86 5788999999988 578999999999999999999999999999999999999999765 458999999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHH Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEA 357 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~ 357 (518) |++++|.|+++.|++ ++.+|.. .+++|+...|++.|.+. T Consensus 313 f~~~~l~P~~~~iee-~n~~l~~-----~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 313 FGRNEIRPLQARFAE-LNDWLGD-----EVVRFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHHHHHHHHHH-HHhhcCc-----cceecChhhhccccccC Confidence 999999999999985 6666633 25899999999999988 No 95 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=7e-57 Score=328.43 Aligned_cols=320 Identities=12% Similarity=0.089 Sum_probs=254.7 Q ss_pred CcCCCC-CCCCcccccccchhhhhhh-cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANG-QTLSAPAMAELSPQMQDSY-YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) -.|++| ++-|.-..+.+.+.+...+ +.+.+...+++ ....+.++..++.+.+||....+.+++ T Consensus 18 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~epp~~--~~~La~l~~~n~~h~~~i~~k~N~l~~------------- 82 (348) T protein:vir:26 18 SVYSFDPNPEPVDTNSWMTRYCELFYNDFDDYWEPPIS--LKGLAEIANANGYHGSLLKARANYVAG------------- 82 (348) T ss_pred eEEEecCCCeeecCcchHHHHHHHHhcCCCccccCCCC--HHHHHHHHhhhhhhhhhHhhhhhHHhh------------- Confidence 456666 3222223333344333322 22233344443 234466778888899998888876654 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) .-+||+.||..+|++. +.+++++||+|++++|+..|++++|+|++|.+|++..+.. . |.+. T Consensus 83 -----------~~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~d~~--~--~~~~--- 143 (348) T protein:vir:26 83 -----------RFMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKRKNGD--F--VQLL--- 143 (348) T ss_pred -----------cccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEeeecCc--E--EEEE--- Confidence 1259999999999765 6799999999999999999999999999999999876532 1 2121 Q ss_pred ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLRE 237 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~ 237 (518) .++..+.|++++|||++.+++....||+|++..+++++....+++.|+.++|+||++|++||..+ ..+++++.+++++ T Consensus 144 -~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~lk~ 222 (348) T protein:vir:26 144 -RNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKALKE 222 (348) T ss_pred -ecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHH Confidence 24567889999999999999888889999999999999999999999999999999999999865 4799999999999 Q ss_pred HHHHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc--ccccCCHHHHH Q lcl|NC_021305. 238 QFDRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILD--RATFSNISAQM 310 (518) Q Consensus 238 ~~~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn~e~~~ 310 (518) +|++. .|..|.++++|+ ++|+++++++.++.++||++.+++++++||++|||||.++|+.+ .++++|++++. T Consensus 223 ~~~~~-~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~ 301 (348) T protein:vir:26 223 KIASS-KGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVS 301 (348) T ss_pred HHHHh-cCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHH Confidence 99986 577889999998 78999999999999999999999999999999999999999864 46799999999 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhh-hhcCHHHHHHH Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDV-IQPDWEAKSES 361 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l-~~~d~~~~~~~ 361 (518) +.|++++|.|+++.|+++||++|..+ ...+++|+++.. .+.|... + T Consensus 302 ~~f~~~~l~P~~~~ie~~ln~~l~~~--~~~~~~fdl~~~~e~~~~~a---~ 348 (348) T protein:vir:26 302 QVYDFYEVIPVCKRFMDAVNNDPEIP--DNLKLKFNLNPGVESANGSA---V 348 (348) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhCCC--CccEEEEecCcccccchhhc---C Confidence 99999999999999999999998643 344677776643 2222222 1 No 96 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=8.2e-56 Score=322.59 Aligned_cols=311 Identities=14% Similarity=0.168 Sum_probs=241.0 Q ss_pred cCCCCCCCCccccc--ccchhhhhhhcccc-------------------cccccccccchhhhHHHhhcHHHHHHHHHHH Q lcl|NC_021305. 2 LLANGQTLSAPAMA--ELSPQMQDSYYYAP-------------------AVGMQLERQFSLYGGIYKNQPWVRTVIAKRA 60 (518) Q Consensus 2 ~f~~~~~~~~~~~~--~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia 60 (518) |.+++..+ +.+.. ..+.....+||... +...++. ....+.++..++.+.+||...+ T Consensus 1 m~~~~~~~-~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~--~~~la~l~~a~~~h~s~i~~k~ 77 (340) T protein:vir:98 1 MSKRKPRK-AVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVS--FSGLAKSLRSAVHHSSPIYVKR 77 (340) T ss_pred CCCCCCCc-cccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCC--HHHHHHHHHhccccchhhhhhh Confidence 44443222 11111 11111111222110 1111111 1112334444555555555555 Q ss_pred HhhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeE Q lcl|NC_021305. 61 QALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVA 140 (518) Q Consensus 61 ~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~ 140 (518) +.+++ .-+||+.||..+|++ ++.+++++||+|++++|+..|++++|+|+++.+|+ T Consensus 78 n~l~~------------------------~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr 132 (340) T protein:vir:98 78 NVLAS------------------------TYIPHPLLSRQDFSR-FALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTR 132 (340) T ss_pred hHHhh------------------------ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEE Confidence 44433 237999999999975 66899999999999999999999999999999998 Q ss_pred EEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccc Q lcl|NC_021305. 141 IKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLV 220 (518) Q Consensus 141 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i 220 (518) +..+.+ .+|.+. .++..+.|++++|||++.+++....||+|++..++.++....+++.|+.++|+||++|++| T Consensus 133 ~~~~~~---~~~~~~----~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~i 205 (340) T protein:vir:98 133 RGVDDS---VFWFVE----NFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYI 205 (340) T ss_pred EcccCc---EEEEEe----cCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 866443 222222 2456788999999999998887788999999999999999999999999999999999999 Q ss_pred cccCc-cCCHHHHHHHHHHHHHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_021305. 221 LRHEK-RLSEAAQQRLREQFDRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPI 294 (518) Q Consensus 221 l~~~~-~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~ 294 (518) |.+++ .+++++.++++++|++ .+|..|.++++|+ ++|++|++++.++.|+||.+++++++.+||++|||||.+ T Consensus 206 l~~~~~~ls~e~~~~lk~~~~~-~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~l 284 (340) T protein:vir:98 206 MYVTDPAQSATDVESLRDAMRN-SKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQL 284 (340) T ss_pred EEecCCCCCHHHHHHHHHHHHH-hcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHH Confidence 98764 7999999999999987 4788899999988 578999999999999999999999999999999999999 Q ss_pred hccccc--cccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcC Q lcl|NC_021305. 295 VHILDR--ATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPD 354 (518) Q Consensus 295 lg~~~~--~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d 354 (518) +|+.++ ++++|.+++.+.|+++||.|+++.||+ +|.+|..+ .++|+...+++.| T Consensus 285 lGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~iee-~n~~L~~e-----~~rF~~~~l~~~d 340 (340) T protein:vir:98 285 MGGKPENIGSLGDVEKVAKVFVRNELSPLQDRFRE-VNDWLGME-----VIRFKEYTLDNPE 340 (340) T ss_pred hcccCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhccccc-----ccccCccccccCC Confidence 999764 458999999999999999999999984 88887543 3688888899988 No 97 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=9.8e-56 Score=322.16 Aligned_cols=311 Identities=15% Similarity=0.179 Sum_probs=239.7 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) -.|++|++-|.-..+...+.+...+ -+.+...++.. ...++++..++.+.+||....+.++. T Consensus 32 ~~~~~~~p~~v~~~~~~~~y~~~~~-~~~~~~pp~~~--~~la~~~~~~~~h~~~l~~k~n~l~~--------------- 93 (350) T protein:vir:11 32 EAFTFGDPMPVLDGRGILDYLECWP-NGRWYEPPLSM--EGLAKSVGSSVYLQSGLKFKRNMLAK--------------- 93 (350) T ss_pred EEEEeCCceeecCcchhhHHHHHhh-cCccccCCCCH--HHHHHHHhhhhhhccchhhhhhhhhh--------------- Confidence 2344443322222222222221111 11122222211 11234444455555555444332221 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) .-+||+.||..+|++ ++.+++++||+|++++|+..|++++|+|++|.+|++..+.+. +|.+. . T Consensus 94 ---------~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~~~---~~~~~----~ 156 (350) T protein:vir:11 94 ---------TFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDLET---FYQVR----S 156 (350) T ss_pred ---------cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecCCe---EEEEe----e Confidence 237999999999986 678999999999999999999999999999999998775542 22222 3 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc-cCCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~~~~~~~ 239 (518) ++..+.|++++|||++.+++.+.+||+||+.+++.++....++..|+.++|+||++|++||++++ .+++++.+++++.| T Consensus 157 ~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~ls~e~~~~l~~~~ 236 (350) T protein:vir:11 157 WKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTDAAQNEEDIDALRTAL 236 (350) T ss_pred CCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHH Confidence 46678999999999999998888999999999999999999999999999999999999999864 79999999999999 Q ss_pred HHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRA 312 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~ 312 (518) ++. .|..|+|+++|+ ++|+++++++.++.|+||++.+++++++||++|||||.++|+.++ ++++|+|++.+. T Consensus 237 ~~~-~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~ 315 (350) T protein:vir:11 237 KTA-KGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAV 315 (350) T ss_pred HHh-cCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHH Confidence 885 677899999988 468999999999999999999999999999999999999999765 568999999999 Q ss_pred HHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhh Q lcl|NC_021305. 313 FYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDV 350 (518) Q Consensus 313 ~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l 350 (518) |++++|.|+++.|+ ++|.+|..+.. .+.+|++.+| T Consensus 316 f~~~~L~P~~~~ie-~ln~~l~~~~~--~F~~~~~~~l 350 (350) T protein:vir:11 316 WASLELAPMQTRLQ-QVNEMIGEEVV--RFAQFDAPGL 350 (350) T ss_pred HHHHHHHHHHHHHH-HHHhhcCcccc--ccCcccccCC Confidence 99999999999998 58888865432 3456787777 No 98 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=2.2e-55 Score=320.28 Aligned_cols=316 Identities=10% Similarity=0.124 Sum_probs=240.6 Q ss_pred CcCCCCCCCCcccccccchhhhhhh-cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSY-YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) -.|+++++.+. .+.++....+ ..+.+...++. ....+.++..++.+.+||...++ T Consensus 21 ~~f~~~~~~~~----~~~~y~~~~~~~~~~~~epp~~--~~~la~l~~~~~~h~~~i~~k~n------------------ 76 (345) T protein:vir:37 21 RTFSLNEISAS----PALDYVGIGFDENYNCYLPPVN--RHALAKLPHQNAQHGGILHSRAN------------------ 76 (345) T ss_pred EEeecCCcccc----cchhhhhhhhcCCccccCCCCC--HHHHHHHhhcccccccceeeech------------------ Confidence 23444432222 1111111111 00111112211 11222333344444444422222 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) .+ ...-+||+.||+++|++ ++.+++++||+|++++|+..|++++|+|++|..|++..+.+.......+ ... T Consensus 77 -----~l-~~~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~--~~~ 147 (345) T protein:vir:37 77 -----MV-SSLYEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGGYSYLMKKS--LYD 147 (345) T ss_pred -----HH-HhhccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCCeeEEEEEe--Eec Confidence 11 23347999999999985 5679999999999999999999999999999999988776554332222 223 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQ 238 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~ 238 (518) .++..++|++++|||++.+++.+..||+|++..++.++....++++|+.++|+||++|++||.++ ..+++++.++++++ T Consensus 148 ~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~ 227 (345) T protein:vir:37 148 TAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARK 227 (345) T ss_pred CCceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHHH Confidence 45677899999999999999888889999999999999999999999999999999999999875 57999999999999 Q ss_pred HHHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHH Q lcl|NC_021305. 239 FDRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMR 311 (518) Q Consensus 239 ~~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~ 311 (518) |++. .|..|.++++|+ ++|++|++++.++.|+||.+.+++++++||++|||||.++|+.++ ++++|+|++.+ T Consensus 228 ~~~~-~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~ 306 (345) T protein:vir:37 228 ISES-KGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYRE 306 (345) T ss_pred HHHh-cCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHH Confidence 9885 677888888887 579999999999999999999999999999999999999998654 56899999999 Q ss_pred HHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhh Q lcl|NC_021305. 312 AFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQ 352 (518) Q Consensus 312 ~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~ 352 (518) .|++++|.|+++.|++++|+.+ +......++|+..++.+ T Consensus 307 ~f~~~~l~P~~~~ie~~ln~~~--~~~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 307 VYHYDEVMPLQEIIAETINQDP--EIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHHHHhhhhc--cCCCcceEEecchhhcC Confidence 9999999999999999999743 34556678898777766 No 99 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=3e-55 Score=319.53 Aligned_cols=312 Identities=13% Similarity=0.162 Sum_probs=231.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) -.|++|.+-|.-..+.+.+.+...+. +.+...++. ....+.++..++.+.+||...++.+++ T Consensus 24 ~~~~~~~p~~v~~~~~~~~~~~~~~~-~~~~~pp~~--~~~la~~~~a~~~h~s~i~~k~n~l~~--------------- 85 (344) T protein:vir:56 24 EAFTFGEPVPVLDRRDILDYVECISN-GRWYEPPVS--FTGLAKSLRAAVHHSSPIYVKRNILAS--------------- 85 (344) T ss_pred EEEEcCCceeecCcchhhhHHHhhhc-CccccCCCC--HHHHHHHHhhhhhhCccceehhhhHHh--------------- Confidence 22222222221111111111111111 111111111 111223333444444444443332222 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) .-+||+.||+.+| +.++.+++++||+|++++|+..|++++|+|+++.+|++..+.+. +|.+ .. T Consensus 86 ---------~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~~---~~~~----~~ 148 (344) T protein:vir:56 86 ---------TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV---YWWV----PS 148 (344) T ss_pred ---------hcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEeecCCE---EEEE----ec Confidence 3479999999999 67789999999999999999999999999999999998776543 1222 13 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc-cCCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~~~~~~~ 239 (518) ++..+.|++++|||++.+++.+.+||+||+..++.++....++++|+.++|+||++|++||++++ .+++++.++++++| T Consensus 149 ~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk~~~ 228 (344) T protein:vir:56 149 FNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENM 228 (344) T ss_pred CCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHH Confidence 46778899999999999998888899999999999999999999999999999999999998764 79999999999999 Q ss_pred HHHhcCccccCCeeec------CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVV------EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMR 311 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~ 311 (518) ++.. | .++|+.++| ++|+++++++.++.|+||++++++++++||++|||||.++|+.++ ++++|.+++.+ T Consensus 229 ~~~~-g-~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~ 306 (344) T protein:vir:56 229 VKSK-G-RNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK 306 (344) T ss_pred HHhc-C-CCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHH Confidence 9875 4 367888888 479999999999999999999999999999999999999998765 45899999999 Q ss_pred HHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCH Q lcl|NC_021305. 312 AFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) Q Consensus 312 ~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~ 355 (518) .|++++|.|+++.|| ++|.+|..+. ++|+--.+...|- T Consensus 307 ~f~~~tL~Pl~~~ie-~~n~~l~~~~-----~~F~~y~l~~~~~ 344 (344) T protein:vir:56 307 VFVRNELIPLQDRIR-EINGWIGQEV-----IRFKNYSLDTDNG 344 (344) T ss_pred HHHHHHHHHHHHHHH-HHHhhhcccc-----ccCCCccccccCC Confidence 999999999999998 4888886443 3343333333333 No 100 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=5.1e-55 Score=318.23 Aligned_cols=312 Identities=13% Similarity=0.147 Sum_probs=232.3 Q ss_pred cCCCCCCCCcccccc-c-ch--hhhhhhcccc-------------------cccccccccchhhhHHHhhcHHHHHHHHH Q lcl|NC_021305. 2 LLANGQTLSAPAMAE-L-SP--QMQDSYYYAP-------------------AVGMQLERQFSLYGGIYKNQPWVRTVIAK 58 (518) Q Consensus 2 ~f~~~~~~~~~~~~~-~-~~--~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ 58 (518) |+.+++.+..|.... . .. ..-.+||... +...+... ...+.++..++.+.+||.. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~--~~la~~~~a~~~h~~~i~~ 78 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISF--TGLAKSLRAAVHHSSPIYV 78 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCH--HHHHHHHHhhhhhccchhh Confidence 555544332221110 0 00 0111222111 11111110 1112223333333333333 Q ss_pred HHHhhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCce Q lcl|NC_021305. 59 RAQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSR 138 (518) Q Consensus 59 ia~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~ 138 (518) .++.++ .+-+||+.||+.+| +.++.+++++||+|++++|+..|++++|+|++|.+ T Consensus 79 k~n~l~------------------------~~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~ 133 (344) T protein:vir:60 79 KRNILA------------------------STFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKY 133 (344) T ss_pred hhhHHH------------------------hhccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcce Confidence 332221 13479999999999 57889999999999999999999999999999999 Q ss_pred eEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcc Q lcl|NC_021305. 139 VAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPN 218 (518) Q Consensus 139 v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~ 218 (518) |++..+.+. +|.+ ..++..+.|++++|||++.+++.+.+||+||+..+..++....+++.|+.++|+||++|+ T Consensus 134 vr~~~~~~~---~~~v----~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg 206 (344) T protein:vir:60 134 TRRGVEEDV---YWWV----PSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAG 206 (344) T ss_pred EEEeecCCe---EEEE----ccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCc Confidence 998876543 2222 234667889999999999999888889999999999999999999999999999999999 Q ss_pred cccccC-ccCCHHHHHHHHHHHHHHhcCccccCCeeec------CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021305. 219 LVLRHE-KRLSEAAQQRLREQFDRAHSGSSNTGKTMVV------EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIA 291 (518) Q Consensus 219 ~il~~~-~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP 291 (518) +||+++ ..+++++.++++++|++.+ |. ++++.++| ++|++|++++.++.++||++++++++++||++|||| T Consensus 207 ~il~~~~~~ls~e~~~~ik~~~~~~~-g~-~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VP 284 (344) T protein:vir:60 207 YIMYVTDAVQDRNDIEMLRENMVKSK-GR-NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIP 284 (344) T ss_pred eEEEecCcCCCHHHHHHHHHHHHHhc-CC-CCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCC Confidence 999876 4799999999999999876 43 56777776 479999999999999999999999999999999999 Q ss_pred HHHhcccccc--ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCH Q lcl|NC_021305. 292 PPIVHILDRA--TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) Q Consensus 292 p~~lg~~~~~--~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~ 355 (518) |.++|+.++. +++|.+++.+.|++++|.|+++.|| +||.+|..+ .++|+.-.+...|. T Consensus 285 p~llGi~~~~t~~~~n~e~~~~~f~~~~L~Pl~~~~e-~ln~~lg~~-----~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 285 FQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLGQE-----VIRFKNYSLDTDNG 344 (344) T ss_pred HHHhcccCCCCCccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc-----ccccCccccCCCCC Confidence 9999987654 5899999999999999999999998 588888532 24555555555554 No 101 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=8.4e-55 Score=317.06 Aligned_cols=316 Identities=9% Similarity=0.119 Sum_probs=245.7 Q ss_pred CcCCCCCCCCcccccccchhhhhhh-cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSY-YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) --|+.+.+.+ +.+.......+ ..+.+...++. ....+.++..++.+.+||...++.+++ T Consensus 21 ~~~~~~~~~~----~~~~~y~~~~~~~~~~~~epp~~--~~~la~~~~~~~~h~~~i~~k~n~l~~-------------- 80 (345) T protein:vir:37 21 RTFSLSEITA----SPALDYVGIGFDENYNCYLPPVN--RHALAKLPHQNAQHGGILHSRANMVSA-------------- 80 (345) T ss_pred EEeecCCccc----chhhcccceeeecCCccccCCCC--HHHHHHHhhcchhhcchhhhhhhHHhh-------------- Confidence 2333333221 11111111000 01111222221 223345566777777777766664432 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeecccc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAG 159 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~ 159 (518) .-+||+.||+.+|++ ++.+++++||+|++++|+..|++++|+|++|.+|++..+.+.......+ ... T Consensus 81 ----------~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~d~~~~~~~~~~--~~~ 147 (345) T protein:vir:37 81 ----------TYEGGKALSKMEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVHKDGGYSYLMKKS--LYD 147 (345) T ss_pred ----------ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCceeEEeecCCeeEEEeee--eec Confidence 237999999999975 5679999999999999999999999999999999987765443322222 222 Q ss_pred cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHH Q lcl|NC_021305. 160 VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQ 238 (518) Q Consensus 160 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~ 238 (518) ..+...+|++++||||+.+++.+..||+|++..++.++....++++|+.++|+||++|++||.++ ..+++++.++++++ T Consensus 148 ~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~~~l~~e~~~~lk~~ 227 (345) T protein:vir:37 148 TAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARK 227 (345) T ss_pred cCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHH Confidence 34677899999999999999888889999999999999999999999999999999999999865 47999999999999 Q ss_pred HHHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHH Q lcl|NC_021305. 239 FDRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMR 311 (518) Q Consensus 239 ~~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~ 311 (518) |++.+.+ .|.+.++++ ++|+++++++.++.++||.+++++++++||++|||||.++|+.++ ++++|+|++.+ T Consensus 228 ~~~~~g~-~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~ 306 (345) T protein:vir:37 228 ISESKGV-GNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYRE 306 (345) T ss_pred HHHhcCc-cccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHH Confidence 9998744 565566555 568999999999999999999999999999999999999998764 56899999999 Q ss_pred HHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhh Q lcl|NC_021305. 312 AFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQ 352 (518) Q Consensus 312 ~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~ 352 (518) .|+++||.|++++|++++|+. .+....++++|+..+|++ T Consensus 307 ~f~~~~l~P~~~~ie~~ln~~--~e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 307 VYHYDEVMPLQEIIAETINQD--PEIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHHHHHHHHHhhhh--hccCCcceEEECchhhcC Confidence 999999999999999999974 344567889999999988 No 102 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=1.1e-54 Score=316.41 Aligned_cols=312 Identities=13% Similarity=0.154 Sum_probs=233.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) -.|++|.+.|.-..+.+.+.+...+.+ .+...++. ....+.++..++.+.+||...++.+++ T Consensus 24 ~~~~f~~p~~v~~~~~~~~~~~~~~~~-~~~~pp~~--~~~la~~~~a~~~h~~~i~~k~n~l~~--------------- 85 (344) T protein:vir:20 24 EAFTFGEPVPVLDRRDILDYVECISNG-RWYEPPVS--FTGLAKSLRAAVHHSSPIYVKRNILAS--------------- 85 (344) T ss_pred EEEEcCCceEecCcchhhhhhhhhhcC-ceecCCCC--HHHHHHHHhhhhhhCccceehhhhHHH--------------- Confidence 123333322222222222222111111 11111211 111223333444444444333332222 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGV 160 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~ 160 (518) .-+||+.||+.+| +.++.+++++||+|++++|+..|++++|+|+++.+|++..+.+. +|.+ .. T Consensus 86 ---------~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~~~~~---~~~~----~~ 148 (344) T protein:vir:20 86 ---------TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEEDV---YWWV----PS 148 (344) T ss_pred ---------hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCceeEeeecCCE---EEEE----cc Confidence 2379999999999 57889999999999999999999999999999999998776543 1221 23 Q ss_pred CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHH Q lcl|NC_021305. 161 GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQF 239 (518) Q Consensus 161 ~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~ 239 (518) ++..+.|++++|||++.+++.+..||+||+..+..++....+++.|+.++|+||++|++||+++ ..+++++.++++++| T Consensus 149 ~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~~ik~~~ 228 (344) T protein:vir:20 149 FNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENM 228 (344) T ss_pred CCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHH Confidence 4667899999999999999888889999999999999999999999999999999999999875 579999999999999 Q ss_pred HHHhcCccccCCeeec------CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHH Q lcl|NC_021305. 240 DRAHSGSSNTGKTMVV------EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMR 311 (518) Q Consensus 240 ~~~~~g~~n~g~~~vl------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~ 311 (518) ++.. | .++|+.++| ++|++|++++.++.++||.+++++++++||++|||||.++|+.++ ++++|++++.+ T Consensus 229 ~~~~-g-~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~ 306 (344) T protein:vir:20 229 VKSK-G-RNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAK 306 (344) T ss_pred HHhc-C-CCCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHH Confidence 9875 3 356777776 469999999999999999999999999999999999999998765 45899999999 Q ss_pred HHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCH Q lcl|NC_021305. 312 AFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) Q Consensus 312 ~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~ 355 (518) .|++++|.|+++.|+ ++|.+|..+ .++|+...+...|. T Consensus 307 ~f~~~~l~P~~~~~e-~in~~lg~~-----~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 307 VFVRNELIPLQDRIR-EINGWLGQE-----VIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHHHHHHH-HHHHhcCCc-----ccccCccccccCCC Confidence 999999999999998 588877532 34566556655555 No 103 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=3.6e-53 Score=308.07 Aligned_cols=321 Identities=13% Similarity=0.160 Sum_probs=237.5 Q ss_pred cCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhcc---CceEEEEecCCcc Q lcl|NC_021305. 2 LLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR---LPVKCMFTSGDTE 78 (518) Q Consensus 2 ~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~---l~~~v~~~~~~~~ 78 (518) |=.++..+ .+....++..-.+||.... + +...++..|+.+.-+.++. -|+....- ..- T Consensus 1 m~~~~~~~--~~~~~~~~~~~~~~~~p~~----~-----------~~~~~~~~~~~~~~~~~~~~~~pP~~~~~L--a~l 61 (337) T protein:vir:78 1 MTKRQQQP--AQAAASSPRPSVVFSMPEA----I-----------DPTAWMTDYTGVFYNPYGEYYQPPIDRKGL--AKV 61 (337) T ss_pred CCCcccCc--ccccccCceeEEEecCccc----c-----------cCcchhHhhhhhhhccCcceecCCCCHHHH--HHH Confidence 22222211 1222222222223332211 1 1122233444444333322 22211000 000 Q ss_pred eeccchHHHHHHhcCCcCCCHH----HHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEee Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPF----AFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYF 154 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~----~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 154 (518) .....+....|..+||+.++++ ++++.++.|++++||+|++++|+..|++++|+|++|.+|++..+.. .+|.. T Consensus 62 ~~~~~~h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v~~~~d~~---~~~~~ 138 (337) T protein:vir:78 62 ARANAHHGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYLRRREDGC---FVYLQ 138 (337) T ss_pred hhcchhhhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCceeEeeeCCe---EEEEE Confidence 0011112335777999876654 6889999999999999999999999999999999999998876432 22221 Q ss_pred ecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc-cCCHHHHH Q lcl|NC_021305. 155 QAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSEAAQQ 233 (518) Q Consensus 155 ~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~~~ 233 (518) .++..+.|++++|||++.+++.+..||+|++..++.++....++++++.++|+||++|++||..++ .+++++.+ T Consensus 139 -----~~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~ 213 (337) T protein:vir:78 139 -----QGKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEE 213 (337) T ss_pred -----cCCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHH Confidence 235567899999999999998888899999999999999999999999999999999999998765 79999999 Q ss_pred HHHHHHHHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc---ccccCC Q lcl|NC_021305. 234 RLREQFDRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILD---RATFSN 305 (518) Q Consensus 234 ~~~~~~~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~---~~~~sn 305 (518) ++++.|++ ..|..|.++++|+ ++|++|++++.++.|+||++++++++++||++|||||.++|+.. .++++| T Consensus 214 ~lk~~~~~-~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n 292 (337) T protein:vir:78 214 EMKEMIAN-SKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGD 292 (337) T ss_pred HHHHHHHH-hcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCcccc Confidence 99999986 5777888898887 67899999999999999999999999999999999999999764 356889 Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhh Q lcl|NC_021305. 306 ISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVI 351 (518) Q Consensus 306 ~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~ 351 (518) +|++.+.|+++||.|+++.||+++|.++++... ...++++...++ T Consensus 293 ~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~-~~~f~~~~~~~~ 337 (337) T protein:vir:78 293 PEKYDATYARNEVLPLCELVQDAINSAGLPRAL-WVTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhcCChhh-ceeccccccccC Confidence 999999999999999999999999998876432 234566666666 No 104 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=5.8e-50 Score=290.52 Aligned_cols=248 Identities=20% Similarity=0.239 Sum_probs=197.2 Q ss_pred CcCCCCCCC--CcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTL--SAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) |=|.++... ..+......+++.... +........++.+.|+++++|++||++||++||++||++++++ . T Consensus 1 MglF~~~~~r~~~~~~~~~~~~~~~~~------~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~---~ 71 (251) T protein:vir:46 1 MGIFYKNEKRDLQYNEDDLQMMVQTLP------SFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNG---Q 71 (251) T ss_pred CCccccccccccCCCccchhhhhhhhc------cccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCc---c Confidence 533322211 1111111222221111 1112223456778899999999999999999999999999754 2 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeeccc Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGA 158 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~ 158 (518) ....|+.+.+|+.+||+.||+++||+.++.+++++||+|++++|+..|++++|+||+|++|++..+.++...++...... T Consensus 72 ~~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~ 151 (251) T protein:vir:46 72 INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRIDS 151 (251) T ss_pred ccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEEEEEEecc Confidence 34456777788899999999999999999999999999999999999999999999999999999887666554444444 Q ss_pred ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC-HHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS-EAAQQRLRE 237 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~~~~~~~~~ 237 (518) ..++..+.|+++|||||++++.++ .+|+||+.++..+|....+++++..++|+||++|+|+|++++.++ +++.+++++ T Consensus 152 ~~~g~~~~~~~~diiH~r~~~~dg-~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~ 230 (251) T protein:vir:46 152 NGNNIERNVKFEDMLDIKFYSLDG-INGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (251) T ss_pred CCcceeEEECCccEEEecCcCCCC-eeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 456777899999999999988776 689999999999999999999999999999999999999998874 566899999 Q ss_pred HHHHHhcCccccCCeeecCCCcce Q lcl|NC_021305. 238 QFDRAHSGSSNTGKTMVVEEGMEP 261 (518) Q Consensus 238 ~~~~~~~g~~n~g~~~vl~~g~~~ 261 (518) .|++.++|.+|+|++++ ||+- T Consensus 231 ~~~~~~~g~~n~g~~~~---gm~~ 251 (251) T protein:vir:46 231 EFPKVLVELNKLGKLSY---SMNQ 251 (251) T ss_pred HHHHHhcCccccccccc---ccCC Confidence 99999999999998776 3332 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=8.5e-44 Score=256.72 Aligned_cols=210 Identities=13% Similarity=0.241 Sum_probs=171.9 Q ss_pred eEEEEcCCceeeEEeeeccc-ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCc Q lcl|NC_021305. 139 VAIKRNSRTGRYEYYFQAGA-GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRP 217 (518) Q Consensus 139 v~v~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p 217 (518) |++..+ +.++|.+.... ..++..++|.++||||||.+++.+.++|+||+..++.++....++++|+.+||+||++| T Consensus 1 ~r~~~d---g~~~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p 77 (219) T protein:vir:98 1 MRVCKD---GNYKYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHM 77 (219) T ss_pred Cceeec---CeEEEEEecceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 333332 22333332222 23456788999999999999987778999999999999999999999999999999999 Q ss_pred ccccccCc-cCCHHHHHHHHHHHHHHhcCccccCCeeec-----CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021305. 218 NLVLRHEK-RLSEAAQQRLREQFDRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIA 291 (518) Q Consensus 218 ~~il~~~~-~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP 291 (518) +|||++++ .+++++.+++++.|++. .|..|+++++|+ ++|++|++++.+++|+||+|++++++++||++|||| T Consensus 78 ~gil~~~~~~l~~e~~~~~~~~~~~~-~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVP 156 (219) T protein:vir:98 78 GFILYSTDPDMTEEMEDEIAERIRDS-KGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFP 156 (219) T ss_pred ceEEEeCCCCCCHHHHHHHHHHHHHh-cCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCC Confidence 99998765 79999999999999885 566777777666 578999999999999999999999999999999999 Q ss_pred HHHhcccc--ccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcC Q lcl|NC_021305. 292 PPIVHILD--RATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPD 354 (518) Q Consensus 292 p~~lg~~~--~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d 354 (518) |++||+.+ +++++|+|++.+.|+++||.||+++||++||++++.+. ...++|+-....-.+ T Consensus 157 p~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~--~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 157 PGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKS--ALKVNFKQPEKRDKN 219 (219) T ss_pred HHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCC--ccEEeecCcccccCC Confidence 99999864 46799999999999999999999999999998765432 234566543332222 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.95 E-value=9.2e-28 Score=168.80 Aligned_cols=396 Identities=11% Similarity=0.093 Sum_probs=226.3 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) |=|.-+=... .. .+. .-++--... .+.+..........+|..++.++++|+.+|+.+-+.++.+...+. .. T Consensus 1 ~~~~D~~~~~--~~-~~g-~~~~~~~~~--~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~---~~ 71 (437) T protein:vir:52 1 MKFFDGIKSL--AL-KLG-SKQEQTYYS--PSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDL---NS 71 (437) T ss_pred CchhhhhHhH--Hh-cCC-Cccccceee--cCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCC---CH Confidence 2211110000 00 000 000000000 112222233445567999999999999999999999999853211 11 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC---------CCceEEEEeeCCceeEEEEcCC----- Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---------SGTPEKLMPMHPSRVAIKRNSR----- 146 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~---------~G~~~~l~~l~p~~v~v~~~~~----- 146 (518) ..-..+...+.+=+ ..+-+...+.+.-++|.++++++.+. .|.+..+.++++..+++..... T Consensus 72 ~~~~~~~~~~~~l~----~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s 147 (437) T protein:vir:52 72 KQLDLFTKFERSLK----LRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLS 147 (437) T ss_pred HHHHHHHHHHHhhc----HHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccc Confidence 11111222222222 23334444555558999999998865 3678899999998887432221 Q ss_pred ---ceeeEEeeecccccCceeEEeccccEEEEecc---CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccc Q lcl|NC_021305. 147 ---TGRYEYYFQAGAGVGTQLVSFADDEVVPIRFF---NPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLV 220 (518) Q Consensus 147 ---~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~---~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i 220 (518) +.+..|.+. .++..+.++++.||||... .+.+..+|.|.++.+++.|.....+......++.+...+. T Consensus 148 ~~fg~p~~y~v~----~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v-- 221 (437) T protein:vir:52 148 PNFGRYSEYSIL----GGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDI-- 221 (437) T ss_pred cccCcceEEEEe----cCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCc-- Confidence 223333332 2345568999999999743 2345567999999999999999999888888877654443 Q ss_pred cccCc---cCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc Q lcl|NC_021305. 221 LRHEK---RLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHI 297 (518) Q Consensus 221 l~~~~---~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~ 297 (518) +++++ .++....+.+.+.++..... .+.+++++++.+.+|+.++.++.+.. +...+...+||++++||.+.|.. T Consensus 222 ~k~~~l~~~l~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~~~~~e~~~~~~sgl~--~~l~~~~~~iaaa~~iP~t~L~G 298 (437) T protein:vir:52 222 FKIAGLSDKIAAGMENEVASVISAVQEI-KSATNSLLLDAENEYDRKELTFTGLK--DLLTEFRNAVAGAADMPVTILFG 298 (437) T ss_pred eecchHHHHhcCCcHHHHHHHHHHHHHh-cCCCceEEEcCCcceEEEecCcCCHH--HHHHHHHHHHHHHhcCchhhhcC Confidence 33432 23322223333333332222 34577899999999999988877654 78888899999999999877655 Q ss_pred ccccccCCHHHHHHHHHH-------HHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHH-------HHHH Q lcl|NC_021305. 298 LDRATFSNISAQMRAFYR-------DTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKS-------ESTQ 363 (518) Q Consensus 298 ~~~~~~sn~e~~~~~~~~-------~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~-------~~~~ 363 (518) ...+..++.++..+.||. .-+.|+++.+-..|....+..... .+.|.+.+|...+.++++ +++. T Consensus 299 ~s~~Glasge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~--~~~~~f~pL~~~s~kekae~~~~~a~a~~ 376 (437) T protein:vir:52 299 QSVSGLASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPA--DWWFEFVPLTTVKQEQQINMLNTFATAAN 376 (437) T ss_pred cCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--cceEEeCCcCCcCHHHHHHHHHHHHHHHH Confidence 545555777888888887 357777777766665544433222 355666688777766554 4577 Q ss_pred HHHhCCCcCHHHHHHHhC----CCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccc Q lcl|NC_021305. 364 KMVNSGVATPNEGREIMG----LPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPG 439 (518) Q Consensus 364 ~~~~~G~~T~NE~R~~~g----~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (518) +++++|+++++|+|+++. ++.+++. |. ....+.....+..+.+.+...++.++ +. T Consensus 377 ~~~~~g~i~~~e~r~~L~~~g~~~~i~~~--~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~ 435 (437) T protein:vir:52 377 TLIQNGVLNEYQIANELRESGLFANISAE--HI----------EELKNADEFAGNFEEPEKMEGAQVQN---------SE 435 (437) T ss_pred HHHhcCCCCHHHHHHHHHhcCCCCCCCcc--cc----------ccccCCCCCCCccCCCCCCCCCCCCC---------CC Confidence 889999999999999873 2222211 10 00000000000000000000000000 00 Q ss_pred cc Q lcl|NC_021305. 440 LS 441 (518) Q Consensus 440 ~~ 441 (518) .+ T Consensus 436 ~~ 437 (437) T protein:vir:52 436 DQ 437 (437) T ss_pred CC Confidence 00 No 107 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.94 E-value=9.4e-27 Score=163.26 Aligned_cols=419 Identities=10% Similarity=0.076 Sum_probs=223.5 Q ss_pred CcCC------------CCCCCCcccccc-cchh-hhhh-------hccccccc-------ccc--cccchhhhHHHhhcH Q lcl|NC_021305. 1 MLLA------------NGQTLSAPAMAE-LSPQ-MQDS-------YYYAPAVG-------MQL--ERQFSLYGGIYKNQP 50 (518) Q Consensus 1 ~~f~------------~~~~~~~~~~~~-~~~~-~~~~-------~~~~~~~~-------~~~--~~~~~~~~~~~~~~~ 50 (518) |.|. +.+.+.-+.+.. +.+. .+++ ++..-... +-. .-.......+|..++ T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~ 114 (537) T protein:vir:10 35 KPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIGHQMCALIATHW 114 (537) T ss_pred hHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCccHHHHHHHHhCc Confidence 1111 011111111101 1111 1111 11000000 000 001123346789999 Q ss_pred HHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc-C----- Q lcl|NC_021305. 51 WVRTVIAKRAQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN-K----- 124 (518) Q Consensus 51 ~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~-~----- 124 (518) .++.+|+++|+.+.+-++++.-.+++. .+......|....+...-+..|.+. +...-++|.+++++.-. . T Consensus 115 l~r~iVd~~A~d~~r~~~~i~~~~~~~---~~~~~~~~l~~~~~~l~~~~~l~~a-~~~~rlyG~~~i~i~v~~~D~~~~ 190 (537) T protein:vir:10 115 LVNKACSQMPRDAMRKGYKIISDDGNE---LDPKDAKFIDRYDRAFNIKKHAIQF-VRKGRIFGIRIALFKVDSPDPYYY 190 (537) T ss_pred hhhhhhhhhhHHhhcCCceeecCCccc---ccHHHHHHHHHHHHHhhHHHHHHHH-HHhcccccceEEEEeecCcCCccc Confidence 999999999999999999885433222 1222222333333333334445554 44444578888776532 2 Q ss_pred ----------CCceEEEEeeCCceeEEEEcC----C------ceeeEEeeecccccCceeEEeccccEEEEeccCC---- Q lcl|NC_021305. 125 ----------SGTPEKLMPMHPSRVAIKRNS----R------TGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP---- 180 (518) Q Consensus 125 ----------~G~~~~l~~l~p~~v~v~~~~----~------~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~---- 180 (518) .|.+..|.+++|..+.+.... + +....|.+ .+ ..+.++.|+||..... T Consensus 191 ~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v------~g--~~iH~SRli~f~g~~~p~~~ 262 (537) T protein:vir:10 191 EKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLI------NG--KKYHRSHLAIYINDEVVDFL 262 (537) T ss_pred ccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeee------cC--eEecceeEEEecCCCCchhh Confidence 223567778888777653211 1 11122211 12 3678999999965432 Q ss_pred --CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC-CHHHHHHHHHHHHHHhcCccccCCeeecCC Q lcl|NC_021305. 181 --DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SEAAQQRLREQFDRAHSGSSNTGKTMVVEE 257 (518) Q Consensus 181 --~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~ 257 (518) ....+|.|.++.+++.|.....+.......+.........+..-..+ ++++ +.+.++...++.+|. ++++++. T Consensus 263 ~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~---~~~r~~~~~~~r~n~-g~~~id~ 338 (537) T protein:vir:10 263 KPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQ---FDETMSWWTATRDNY-QVRVVDK 338 (537) T ss_pred hcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHH---HHHHHHHHHhhcCCc-ceeEecC Confidence 22346999999999999998888888888777665554333222233 3333 333333333333344 4566665 Q ss_pred -CcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHHH------hhHHHHHHHHHH Q lcl|NC_021305. 258 -GMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRDT------MAIPIARIQSAM 329 (518) Q Consensus 258 -g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~~------l~P~~~~ie~~l 329 (518) +.+|+.+..+.... .+++....+.||.+.|||.. |+|....+..++.+.....|+..+ +.|.++.+.+.+ T Consensus 339 e~e~~e~~~~~lsgl--~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll 416 (537) T protein:vir:10 339 DNEDVVQIDTTLNDL--DKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQDDMRPLIDRHHQLV 416 (537) T ss_pred CCceeEEEeccCCCH--HHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 58898888777764 47888889999999999987 566655666677777777777543 788888887777 Q ss_pred HHhhhhhhcccccceecchhhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccc Q lcl|NC_021305. 330 DKYVGQYWVRKNRMKFDIDDVIQPDWEAKSES-------TQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPL 402 (518) Q Consensus 330 ~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~ 402 (518) .+..+.. ...+.|.+.+|...|.++++++ +.+++.+|++++||+|+.++.+|.. +-+-+. + +.. . T Consensus 417 ~~~~~~~---~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~--g~~~l~-~-~~~-~ 488 (537) T protein:vir:10 417 CRSHLRK---RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTL--GFTSIT-P-AMR-P 488 (537) T ss_pred HHhcCCC---CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCcc--cccccc-C-CCC-h Confidence 6655432 3457788889998888887764 7889999999999999999987642 222221 1 110 1 Q ss_pred ccccccCCCCCCCCCCCCCccCCCCCCCccccCC------ccccccchhc Q lcl|NC_021305. 403 GATPDGAVEWEEAPAPKRPASTPVASLDQSPPTS------VPGLSPTNSD 446 (518) Q Consensus 403 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~ 446 (518) +.......+.+..+.. .....+...+..+.... .....+.+.+ T Consensus 489 ed~e~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 537 (537) T protein:vir:10 489 TDAEDIDVDDEGKPVR-IIEDQPAPSEMFGATSSGESANDPRDSGAAFED 537 (537) T ss_pred hhhhcccCCccCCcCC-CCCCCCCccccCCCCccccccCCCccCccccCC Confidence 1111100001000000 00001111111111000 0000111111 No 108 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.93 E-value=3.1e-26 Score=160.44 Aligned_cols=439 Identities=12% Similarity=0.060 Sum_probs=230.4 Q ss_pred CcCCCCCCC-C---ccc-ccccchhhhhhhcccc--------cccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 MLLANGQTL-S---APA-MAELSPQMQDSYYYAP--------AVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~f~~~~~~-~---~~~-~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) +-..+.... + .|- +......+....|+.. ..+............+|..++.++.+|+.+|+.+-+-. T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~ 112 (532) T protein:vir:94 33 LGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPEYRTMHETPADECVRAW 112 (532) T ss_pred hhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCchhhhhhccchHHHhhCC Confidence 111111000 0 000 0001111110001100 00001111122334678899999999999999999999 Q ss_pred eEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC-------------------CCce Q lcl|NC_021305. 68 VKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK-------------------SGTP 128 (518) Q Consensus 68 ~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~-------------------~G~~ 128 (518) +++...+++.. .......|...-... ...+-+...+....++|.+++++.... .|.+ T Consensus 113 ~~i~~~~~~~~---~~~~~~~i~~~~~~l-~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~ 188 (532) T protein:vir:94 113 GKITCSSKDEL---AADKATRITQKLEQY-NVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCL 188 (532) T ss_pred ceEeeCCcccc---chHHHHHHHHHHHhh-hHHHHHHHHHHhhhcccceEEEEEeccCCcccccccccccccccccccee Confidence 99864333221 112222222111111 223344445555668898888765422 2335 Q ss_pred EEEEeeCCceeEEEEcCCcee---eEEeeecccccCceeEEeccccEEEEeccCC------CCcccCchHHHHHHHHHHH Q lcl|NC_021305. 129 EKLMPMHPSRVAIKRNSRTGR---YEYYFQAGAGVGTQLVSFADDEVVPIRFFNP------DGLERGLSLMESLKSTIFS 199 (518) Q Consensus 129 ~~l~~l~p~~v~v~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~evih~~~~~~------~~~~~G~s~l~~~~~~i~~ 199 (518) ..|.+++|..|++........ .++........++ ..++++.|+||..... ....+|.|.++.+++.|.. T Consensus 189 ~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~~g--~~iH~SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~ 266 (532) T protein:vir:94 189 IGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIATSG--KKIHSSRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDN 266 (532) T ss_pred eEEEeechheecccccccccccccccCCceeEEEccC--eeeccceEEEecCCCchhhhccccccccccHHHHHHHHHHH Confidence 678889998887654321111 1111000001122 3588999999975432 1234699999999999999 Q ss_pred HHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCC-CcceeeccCChhhHHHHHHHH Q lcl|NC_021305. 200 EDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE-GMEPIPLQLTAVEMQFIEARQ 278 (518) Q Consensus 200 ~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~l~~~~~d~~~~e~~~ 278 (518) ...+......+..........+...+.++.+..+.+.+++.....+.+| .++++++. +.+|++++.+..+. .+... T Consensus 267 ~~~t~~~~~~l~~~~~~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~n-~g~~~id~~~e~~e~~~~~lsgl--~~~l~ 343 (532) T protein:vir:94 267 WLRTRQSVSDTVKQFSMTNLATDMAQLLAPGGAQSLDARLQLFNLYRDN-RNIGALDKGTEEIQQTNTPLSGL--DSLQA 343 (532) T ss_pred HHHHHHHHHHHHHhcCCceeeechHHhhcchhHHHHHHHHHHHHhhcCC-ccceEEcCCCceeEEEecccCCH--HHHHH Confidence 9888888877666544333222222345566667777777655444334 34566664 57888888777764 57788 Q ss_pred HHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHH-------HhhHHHHHHHHHHHHhhhhhhcccccceecchhh Q lcl|NC_021305. 279 LNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDV 350 (518) Q Consensus 279 ~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l 350 (518) .....||++.+||.. ++|....+-.++.+.....|+.. .+.|.++.+-+.|.+..+..... .+.|.+.+| T Consensus 344 ~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~--d~~~~f~pL 421 (532) T protein:vir:94 344 QSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDP--GLAWEWSPL 421 (532) T ss_pred HHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--CceEEeCCC Confidence 889999999999987 55655555556667667777764 47888888877776554433223 355666678 Q ss_pred hhcCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCC----CCCCCCC Q lcl|NC_021305. 351 IQPDWEAKSE-------STQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEW----EEAPAPK 419 (518) Q Consensus 351 ~~~d~~~~~~-------~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~----~~~~~~~ 419 (518) ...+.+++++ ++.+++.+|++++||+|++++..|.. +.+......+ .++......... ...+.+. T Consensus 422 ~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ 497 (532) T protein:vir:94 422 MELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADPTS--GYAGALGERD--ELDDVEEIAKQLMAAALNPPATA 497 (532) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCCcc--cccccccccc--ccccccchhhhhcccccCCCCCC Confidence 8787777654 46789999999999999999988763 3332211111 111111111000 0001110 Q ss_pred CCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCch Q lcl|NC_021305. 420 RPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKE 467 (518) Q Consensus 420 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~ 467 (518) ..+.++..+....+++..+.. +....+.-.|+|+. T Consensus 498 ~~~~~~~~~~~~d~~~~~~~~-------------~~~~~~~~~~~~~~ 532 (532) T protein:vir:94 498 PQTPNPQPDSEDDQTDNQPDA-------------QADPAQNDQPVGNR 532 (532) T ss_pred CCCCCCCCCCCCCCCCCccCC-------------CccccccCCCcCCC Confidence 001111111111111111110 00111122333333 No 109 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.90 E-value=3.4e-24 Score=149.27 Aligned_cols=492 Identities=10% Similarity=0.054 Sum_probs=229.7 Q ss_pred CcCCCCCC---CCcccccccchhhhhhh---------------------cccccccccccccchhhhHHHhhcHHHHHHH Q lcl|NC_021305. 1 MLLANGQT---LSAPAMAELSPQMQDSY---------------------YYAPAVGMQLERQFSLYGGIYKNQPWVRTVI 56 (518) Q Consensus 1 ~~f~~~~~---~~~~~~~~~~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v 56 (518) -+++..-+ ...+..+.....+.+.. ...|+ ....-.+. ....+|..++.++.+| T Consensus 76 ~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~-~~~~f~gy-ql~alY~~~~larkiV 153 (862) T protein:vir:99 76 SVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWY-LSQGFIGH-QACALIAQHWLVDKAC 153 (862) T ss_pred hhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccc-cccCcccH-HHHHHHHhCchhhhhh Confidence 00000000 00010000001111110 00000 00000111 2345799999999999 Q ss_pred HHHHHhhccCceEEEEecCCccee-ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE-cCC--------- Q lcl|NC_021305. 57 AKRAQALARLPVKCMFTSGDTETE-ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-NKS--------- 125 (518) Q Consensus 57 ~~ia~~ia~l~~~v~~~~~~~~~~-~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r-~~~--------- 125 (518) +.+|+.+-+-.+.+...+++.... ..-..+...+.+=+ -+..|.. .+...-++|.+++++.. ..+ T Consensus 154 d~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~---v~~~l~e-air~~RLyGga~ililv~~~D~~~LsqPLn 229 (862) T protein:vir:99 154 SLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFK---VKENLIE-FNRFKNVFGIRVAIFVVDSEDPDYYEKPFN 229 (862) T ss_pred hhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhh---HHHHHHH-HHHhcccccceEEEEEecCcCchhhhcCcC Confidence 999999999999986533221111 01111222222211 1233333 34444467766666542 122 Q ss_pred ------CceEEEEeeCCceeEEEE----cCC-ceeeEEeeecccccCceeEEeccccEEEEeccCC------CCcccCch Q lcl|NC_021305. 126 ------GTPEKLMPMHPSRVAIKR----NSR-TGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP------DGLERGLS 188 (518) Q Consensus 126 ------G~~~~l~~l~p~~v~v~~----~~~-~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~------~~~~~G~s 188 (518) |.+..|.+++|..+.+.. ..+ ....++...... ..+ ..+.++.||||..... ....+|+| T Consensus 230 ~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~-I~g--~~IH~SRliif~g~~vpd~lk~ay~f~G~S 306 (862) T protein:vir:99 230 PDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWI-ISG--QKYHRSHLIIARGPQPADILKPTYIFGGIP 306 (862) T ss_pred cccccccceeEEEEechhhhcccccccccccccccccCCceeee-ecC--eeeccceeEEecCCCchhhhhccCCccCcc Confidence 345677788887665422 111 111111110000 112 2577888888865432 22346999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCCh Q lcl|NC_021305. 189 LMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTA 268 (518) Q Consensus 189 ~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~ 268 (518) .++.+++.|.....+......++.+......-+..-..+..+ +.+.+++.....+.+| .++++++.+-+|+.++.+. T Consensus 307 vLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~e--d~l~~r~~~~~~~rdN-~Gi~liD~eEe~e~ls~sl 383 (862) T protein:vir:99 307 LVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIANE--DKFIQRLMFWVRYRDN-HAVKVLGTDETMEQFDTSL 383 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhccH--HHHHHHHHHHHhccCc-ceeEEecCCCceeEEeccc Confidence 999999999999999888888887755433322222223322 2344444333333333 4588899999999998887 Q ss_pred hhHHHHHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHH-------HhhHHHHHHHHHHHHhhhhhhccc Q lcl|NC_021305. 269 VEMQFIEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRK 340 (518) Q Consensus 269 ~d~~~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~ 340 (518) .+.. +........||++.+||.. |+|....+.+++.+....+||.. -+.|+++.+...+...+. .. T Consensus 384 SGL~--dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg----~~ 457 (862) T protein:vir:99 384 ADFD--AVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLSLG----IQ 457 (862) T ss_pred CChH--HHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CC Confidence 7654 7788888899999999987 56655567777878777878774 477888888776654442 22 Q ss_pred ccceecchhhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHh------CCCCCCCCCcce--eeecccccccccc Q lcl|NC_021305. 341 NRMKFDIDDVIQPDWEAKSES-------TQKMVNSGVATPNEGREIM------GLPRSDDPKADE--LYANSALQPLGAT 405 (518) Q Consensus 341 ~~~~fd~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~------g~~p~~~~~gD~--~~~~~n~~~~~~~ 405 (518) ..+.|.+.+|...+.++++++ +.+++.+|+++++|+|+++ |++.++++...+ ...+.++..+... T Consensus 458 ~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~ 537 (862) T protein:vir:99 458 HEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKA 537 (862) T ss_pred CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccC Confidence 346677778888888887655 6789999999999999976 444443221110 0011111111100 Q ss_pred cccCCCCCCCCCCCCCccCCC-CCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCc----hhhHHHHHHHHHhh Q lcl|NC_021305. 406 PDGAVEWEEAPAPKRPASTPV-ASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPK----ESSPKHLRAVKGAM 480 (518) Q Consensus 406 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~----~~~~~~~~~~~~~~ 480 (518) .. ...+......+.+.... .+.+++..+.. + +.++.+....+..+.+..+... ..+.+-.+.-...| T Consensus 538 g~--a~~~ap~de~~aga~~~~~e~d~~~~p~~---~---~~~~g~~~~~t~~~~a~~p~~~~~~~~~~~~~~e~~~~~~ 609 (862) T protein:vir:99 538 GA--AQETASAKETQAGAAVTTAEGDQPNVQMV---P---SMKPGQMVGPEVGITAPMPEDDAPVAGVVAKLAELQQAQM 609 (862) T ss_pred Cc--ccccccccccccccCCccccCCccccccc---C---CCCCCCccccccccccCCCccccccCcccccchhhhcCcc Confidence 00 00000000000000000 00000000000 0 0000000000011111111100 01111111112222 Q ss_pred ccccCcCchhHHHHHHHHHHHhHH--H--------Hhhhh-hhhcc-----------cCC Q lcl|NC_021305. 481 GRGKDIKGFALQLAEKYPDDLEDI--L--------LAVQL-ALAER-----------KDN 518 (518) Q Consensus 481 ~~~~~~~~~~~~~~~~~~~~~~~~--~--------~~~~~-~~~~~-----------~~~ 518 (518) +-..+..+--+.++++.++.-+-. . .+..+ +.|.+ ..| T Consensus 610 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 669 (862) T protein:vir:99 610 GAVTGVLARLVEQLDRMHDRTIAEGADIGQYDASGRTVKPGTIATIRPSVSGNHVGEQPT 669 (862) T ss_pred hhhcchhhhhHHHHHhhhhhhhhhhcchhhhccccccccccccCCCCCcccccccccCCc Confidence 222233333344444444332100 0 00001 00000 001 No 110 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.90 E-value=3.2e-22 Score=138.39 Aligned_cols=422 Identities=12% Similarity=0.042 Sum_probs=249.9 Q ss_pred CCCCCCCCcccccccchhhh--hhhcccc----c-ccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021305. 3 LANGQTLSAPAMAELSPQMQ--DSYYYAP----A-VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSG 75 (518) Q Consensus 3 f~~~~~~~~~~~~~~~~~~~--~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~ 75 (518) -...-.++.|..+....-.+ -+++..+ . ...+......++.++..+.+.|.+|++.+...|.+++|+|...++ T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~w~v~p~~~ 80 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTPWRIRANGA 80 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCCceEecCCC Confidence 22223333332111000000 0111100 0 001112234466776778999999999999999999999964333 Q ss_pred CcceeccchHHHHHHh----cC--------CcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC-----C--ceEEEEeeCC Q lcl|NC_021305. 76 DTETEESDTGYAKLLA----DP--------CEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-----G--TPEKLMPMHP 136 (518) Q Consensus 76 ~~~~~~~~~~~~~L~~----~P--------N~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~-----G--~~~~l~~l~p 136 (518) + .+..+.....|.. .+ +...+|.+++..++.+.+.+|.++.++++... | .+..|.+.++ T Consensus 81 ~--~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l~~rp~ 158 (469) T protein:vir:10 81 S--DEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKLAPRPQ 158 (469) T ss_pred C--HHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeeeeecCc Confidence 2 2221111111111 11 12347889999988889999999999998643 3 2556777777 Q ss_pred ceeE-EEEcCCceeeEEeeecc--------cccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 137 SRVA-IKRNSRTGRYEYYFQAG--------AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNAT 207 (518) Q Consensus 137 ~~v~-v~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~ 207 (518) .++. ...+.+++...+..... ...+...+.+++..+|++++....+..+|.|.+..++..+.......++. T Consensus 159 ~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w 238 (469) T protein:vir:10 159 WTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIE 238 (469) T ss_pred ccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHH Confidence 6552 33333333333221110 01122345678888888888777788899999999999999999999999 Q ss_pred HHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 208 AAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGV 287 (518) Q Consensus 208 ~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 287 (518) ..|....|.|--+.+.+...++++++.+.+.......|. + ..+|++.|++++-+..+.+...|.++.++..++|+.+ T Consensus 239 ~~f~EryG~P~~vgky~~~a~~~ek~~l~~a~~~~~~g~-~--a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~ 315 (469) T protein:vir:10 239 AATAERNGMGIPVGTASSATDEDEVRKMAALARSVRGGI-N--AGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALS 315 (469) T ss_pred HHHHHHcCCcceEEecCCCCCHHHHHHHHHHHHHHhcCC-c--eEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHH Confidence 999999999999999998889999988888777654442 2 2467899998887776666667999999999999887 Q ss_pred hcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc------cccceecchhhhhcCHHHHHHH Q lcl|NC_021305. 288 YDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR------KNRMKFDIDDVIQPDWEAKSES 361 (518) Q Consensus 288 fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~------~~~~~fd~~~l~~~d~~~~~~~ 361 (518) .-- ..+-.....++++. .+.........+.-.+..|+..||+.|+...-. ..+.+|.+..+. .+.+..++. T Consensus 316 iLG-~tlTs~~~gGS~a~-~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~~~e-~~~~~~a~~ 392 (469) T protein:vir:10 316 GLA-HFLNLDGKGGSYAL-ASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFDPIG-SRQDLTAAA 392 (469) T ss_pred Hhc-ccccccCccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEecCCC-CcHHHHHHH Confidence 622 22222222333333 344556777788889999999999988764311 223455555543 566778999 Q ss_pred HHHHHhCCCc-----CHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCC-CCCCCCCccCCCCCCCccccC Q lcl|NC_021305. 362 TQKMVNSGVA-----TPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEE-APAPKRPASTPVASLDQSPPT 435 (518) Q Consensus 362 ~~~~~~~G~~-----T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 435 (518) +++++..|++ +.+.+|+.+|+|+-++ ++....+.. | . ....+. .+....++ ...+.. T Consensus 393 i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~--~~~~~~~~~--~-----~-~~~~~~~~~~~~~~~--~~~~~~----- 455 (469) T protein:vir:10 393 VKLLYDAGVFDDDPAVKRAIRQRFNLPSELN--DTPSAEPEE--P-----A-AVPNQSAAPARTRSS--GNADAR----- 455 (469) T ss_pred HHHHHhcCCccCccccHHHHHHHhCCCCCCC--Ccccccchh--c-----c-cCCCCCccccccCCC--CCcccc----- Confidence 9999999984 5577999999986542 222111100 0 0 000000 00000000 000000 Q ss_pred CccccccchhcchhhHHHHHH Q lcl|NC_021305. 436 SVPGLSPTNSDRSTDSGKTEP 456 (518) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~ 456 (518) ...+.+ +...-.++ T Consensus 456 -~~~~~~------~~~~l~da 469 (469) T protein:vir:10 456 -ARAPKA------DQGVLFDA 469 (469) T ss_pred -cccCCC------hHHhhccC Confidence 000000 00000000 No 111 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.89 E-value=2.2e-23 Score=144.78 Aligned_cols=397 Identities=10% Similarity=0.093 Sum_probs=220.4 Q ss_pred cCCCCCCCCcccccccch---h--------hhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_021305. 2 LLANGQTLSAPAMAELSP---Q--------MQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) Q Consensus 2 ~f~~~~~~~~~~~~~~~~---~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v 70 (518) |-.+-..+.+...+.... . -++...+.+ .+.+..-+......+|..++.++.+|+.+|+.+-+-++++ T Consensus 1 ~~~~~~a~~~~~~~~a~~~~~~~~~~g~~~~~d~~~~~~-~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~i 79 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNRNDFMVGHGKANSRDKLTRQT-PGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWSL 79 (461) T ss_pred CccchhhhhhhhhhhhhhhhHHHhhcCCcchhhhhhccc-cCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCeee Confidence 333222221111111111 0 011111111 1111112345556789999999999999999999988877 Q ss_pred EEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC-------------C---ceEEEEee Q lcl|NC_021305. 71 MFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-------------G---TPEKLMPM 134 (518) Q Consensus 71 ~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~-------------G---~~~~l~~l 134 (518) .-.+++ .. ..+...+.+ . ...+-+...+.+..++|.+++++..... + .+..|.|+ T Consensus 80 ~~~~~~-~~----~~~~~~~~~---l-~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~~ 150 (461) T protein:vir:80 80 KTDNKE-MK----KNIESKWRK---L-KTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYINTF 150 (461) T ss_pred ecCCHH-HH----HHHHHHHHH---h-hHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEEec Confidence 322110 00 011111221 1 2233445556667789998888753211 1 12223333 Q ss_pred CCceeEEEE---c----CCceeeEEeeec---------ccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHH Q lcl|NC_021305. 135 HPSRVAIKR---N----SRTGRYEYYFQA---------GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIF 198 (518) Q Consensus 135 ~p~~v~v~~---~----~~~~~~~~~~~~---------~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~ 198 (518) .+..+.+.. + ..+.+..|.+.. .+..+...+.+.++.||||.+....+..+|.|.++.+++.+. T Consensus 151 ~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~~~~~l~ 230 (461) T protein:vir:80 151 NTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFESLYDIIT 230 (461) T ss_pred cccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHHHHHHHHH Confidence 333322111 1 112333333321 122344557799999999998877777889999999999999 Q ss_pred HHHHHHHHHHHHHHccCCcccccccCc--cCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHH Q lcl|NC_021305. 199 SEDSSRNATAAMWKNAGRPNLVLRHEK--RLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEA 276 (518) Q Consensus 199 ~~~~~~~~~~~~~~ng~~p~~il~~~~--~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~ 276 (518) ....+......+..+-..+ +++.++ .+..+....+.+.++... +..++++++.+-+++.++.+..+. .+. T Consensus 231 ~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~~~~----~~~g~~~~d~~e~~e~~~~~lsgl--~~~ 302 (461) T protein:vir:80 231 VMDTSLWSVGQILYDFAFK--VYKTDDIDALNKDDKANLTAMLDFMF----RTEALAIIKGDEQLTKESTNVSGM--KDL 302 (461) T ss_pred HHHHHHHHHHHHHHHhCCC--ceecchHHhhhchHHHHHHHHHHHhc----CCceEEEEcCCcceEEEecCcCCH--HHH Confidence 9998888888877665443 334442 233344445555565433 233578889889999998887764 488 Q ss_pred HHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHH-------hhHHHHHHHHHHHHhhhhhh----ccccccee Q lcl|NC_021305. 277 RQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDT-------MAIPIARIQSAMDKYVGQYW----VRKNRMKF 345 (518) Q Consensus 277 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~-------l~P~~~~ie~~l~~~l~~~~----~~~~~~~f 345 (518) .+.....||++.+||...|.....+..++.++....|+..+ +.|+++.+...|-+..+... ...+.+.| T Consensus 303 l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i 382 (461) T protein:vir:80 303 LDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEWAI 382 (461) T ss_pred HHHHHHHHhhhhcCCeeeeecccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccceEE Confidence 89999999999999987654444456677777777776643 56777776665554333211 11246778 Q ss_pred cchhhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHh-CCCCCCCCCcceeeecccccccccccccCCCCCCCCC Q lcl|NC_021305. 346 DIDDVIQPDWEAKSES-------TQKMVNSGVATPNEGREIM-GLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPA 417 (518) Q Consensus 346 d~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~-g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~ 417 (518) .+.+|...|.+++++. +.+++.+|++|++|+|+.+ +.-.++++. ++..++........ . T Consensus 383 ~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~--------~~~~~~~~~~~~~~-----~ 449 (461) T protein:vir:80 383 EFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSS--------KFSGDSAEIDKLAK-----L 449 (461) T ss_pred EeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCc--------cCCCCCchhhhhhh-----h Confidence 8889988888887654 7789999999999999865 322221110 00000000000000 0 Q ss_pred CCCCccCCCCCCCccccCC Q lcl|NC_021305. 418 PKRPASTPVASLDQSPPTS 436 (518) Q Consensus 418 ~~~~~~~~~~~~~~~~~~~ 436 (518) ..++ +..++. ++ T Consensus 450 ~~~~---~~~e~~----~g 461 (461) T protein:vir:80 450 VYDA---YAKKNA----DG 461 (461) T ss_pred cccc---ccccCC----CC Confidence 0000 000000 00 No 112 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.89 E-value=9.6e-24 Score=146.77 Aligned_cols=430 Identities=11% Similarity=0.036 Sum_probs=244.4 Q ss_pred CcCC--CCCCCCcc---------cccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLA--NGQTLSAP---------AMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~--~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) =+++ ...+.+.. ..+..+.|.....+................++++.+++.+..||+.+.+.+-...|+ T Consensus 5 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~ 84 (530) T protein:vir:38 5 SLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVGSFFR 84 (530) T ss_pred eeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhCCCce Confidence 1111 00000000 000111111100000000011112234566788999999999999999999888887 Q ss_pred EEEecC------Ccc--eecc---chHHHHHHhcCCc------CCCHHHHHHHHHHHHHHcCCeEEEEEEcCC-C--ceE Q lcl|NC_021305. 70 CMFTSG------DTE--TEES---DTGYAKLLADPCE------YLDPFAFWEWVASTLDIYGETYLAIQKNKS-G--TPE 129 (518) Q Consensus 70 v~~~~~------~~~--~~~~---~~~~~~L~~~PN~------~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~-G--~~~ 129 (518) +.-+-+ ++. ++.. ......+...|+. .+|++++...++..++..|++|+.+.+... | .+. T Consensus 85 ~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g~~~~~ 164 (530) T protein:vir:38 85 LSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTRLFRT 164 (530) T ss_pred eeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCCCccce Confidence 754311 111 1111 1122333345543 468999999999999999999999887644 3 256 Q ss_pred EEEeeCCceeE--------------EEEcCCceeeEEeeecccccC---c------eeEEeccccEEEEeccCCCCcccC Q lcl|NC_021305. 130 KLMPMHPSRVA--------------IKRNSRTGRYEYYFQAGAGVG---T------QLVSFADDEVVPIRFFNPDGLERG 186 (518) Q Consensus 130 ~l~~l~p~~v~--------------v~~~~~~~~~~~~~~~~~~~~---~------~~~~~~~~evih~~~~~~~~~~~G 186 (518) .|..|+|++|. |+.+..|..+.|.+......+ . ....+++.+|||+......+..+| T Consensus 165 ~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~gQ~RG 244 (530) T protein:vir:38 165 QFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMEDGQTRG 244 (530) T ss_pred EEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccCCCcccC Confidence 88999998875 444556666777665332111 1 124466779999999888889999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC-----------HHHHHHHHHHHH---HHhcC---cccc Q lcl|NC_021305. 187 LSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS-----------EAAQQRLREQFD---RAHSG---SSNT 249 (518) Q Consensus 187 ~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-----------~~~~~~~~~~~~---~~~~g---~~n~ 249 (518) +|.+..++..+.......+....-.+-.+...++|+.+..-. .++...+..... ..+.. .-.. T Consensus 245 is~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p 324 (530) T protein:vir:38 245 ANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGG 324 (530) T ss_pred CchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccC Confidence 999999999998888888887777777777778877543210 111111111100 00000 1245 Q ss_pred CCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHh-ccccccccCCHHHHHHHHHHH-----------H Q lcl|NC_021305. 250 GKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIV-HILDRATFSNISAQMRAFYRD-----------T 317 (518) Q Consensus 250 g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~e~~~~~~~~~-----------~ 317 (518) |.+..|..|.+++.+..+-....|.++.+...+.||+.+|||-+.| |+.++.|||+..+....+... - T Consensus 325 G~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~ 404 (530) T protein:vir:38 325 ARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQ 404 (530) T ss_pred ceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6788899999998888776777899999999999999999997755 777888999887666555443 2 Q ss_pred hhHHHHH-HHHHHHHhhhhhhc---------cc--ccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_021305. 318 MAIPIAR-IQSAMDKYVGQYWV---------RK--NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRS 385 (518) Q Consensus 318 l~P~~~~-ie~~l~~~l~~~~~---------~~--~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~ 385 (518) +.|+... ++.++..-.++... +. ..+++-.-.....|+.+.+++...++.+|+.|+-++-.+.|.++- T Consensus 405 ~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~ 484 (530) T protein:vir:38 405 ACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQ 484 (530) T ss_pred hhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHH Confidence 2333332 33333332222100 00 112333345567799999999999999999999999989998774 Q ss_pred CCCCcceeeeccc-ccccccccccCCCCCCCCCCCCCccCCCCCCCccccCC Q lcl|NC_021305. 386 DDPKADELYANSA-LQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTS 436 (518) Q Consensus 386 ~~~~gD~~~~~~n-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) + .-+++..=.. +..++-.. .......++.....+..++.+..+.+ T Consensus 485 ~--v~~q~a~e~~~~~~~Gl~~----~~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 485 E--IFAQQVRESMERRAAGLNP----PAWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred H--HHHHHHHHHHHHHHcCCCC----CCCcccccCCCCCCCCCCCCCCCCCC Confidence 2 2221110000 00000000 00000000000000000111111111 No 113 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.88 E-value=4.8e-21 Score=131.99 Aligned_cols=472 Identities=13% Similarity=0.098 Sum_probs=265.0 Q ss_pred CcCC-CCCCCC-----cccccccchhhhhhhcccccccccccc-----------cch----hhhHHHhhcHHHHHHHHHH Q lcl|NC_021305. 1 MLLA-NGQTLS-----APAMAELSPQMQDSYYYAPAVGMQLER-----------QFS----LYGGIYKNQPWVRTVIAKR 59 (518) Q Consensus 1 ~~f~-~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~----~~~~~~~~~~~v~~~v~~i 59 (518) -++. -|++.+ .++...++ +++..+...+..|..... +.. ++.++..+.+.|.+|++.+ T Consensus 3 ~~~d~~g~p~~~~~~~~~~~~~~~-~~~~~~~~~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~l~~R 81 (526) T protein:vir:99 3 QIVDVYGNPIRTQQLREPQTSRLA-GLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKR 81 (526) T ss_pred eeECCCCCccccccccchhhhhhh-hhhhhhcccCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHH Confidence 1222 233222 12221111 122333333333321111 122 3333334789999999999 Q ss_pred HHhhccCceEEEEecCCcce--eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC---ceEEEEee Q lcl|NC_021305. 60 AQALARLPVKCMFTSGDTET--EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPEKLMPM 134 (518) Q Consensus 60 a~~ia~l~~~v~~~~~~~~~--~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G---~~~~l~~l 134 (518) ...|.+++|.|....++... ...+.....|...| ++.+++..++ +.+.+|.+++++++...| .+..+.+. T Consensus 82 k~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r 156 (526) T protein:vir:99 82 KRAILGLDWAVEPPRNASAAEKADADYLHELLLDLE----GLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHR 156 (526) T ss_pred HHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhccc----CHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeee Confidence 99999999999654333221 12222222333333 4777777765 578899999999986643 46688899 Q ss_pred CCceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHcc Q lcl|NC_021305. 135 HPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNA 214 (518) Q Consensus 135 ~p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng 214 (518) ++.++.+..+... .+.+ .. + ......+++...+..++....+..+|.+.+..+...+.......++...|.... T Consensus 157 ~~~~f~~~~~~~~-~l~~--~~-~--~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~y 230 (526) T protein:vir:99 157 PQSWFQLNPEDQN-ELRL--RD-N--SPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIY 230 (526) T ss_pred cccceeeccCCCc-EEEe--cC-C--CCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHc Confidence 9988876554432 2222 11 1 122345677767666666667788999999999999999999999999999999 Q ss_pred CCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCC-hhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_021305. 215 GRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLT-AVEMQFIEARQLNREEVCGVYDIAPP 293 (518) Q Consensus 215 ~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~ 293 (518) |.|--+.+++...++++++++.+.+.+..+ + ..+|++.|++++-+..+ .....|.++.++..++|+.++ +-.. T Consensus 231 G~P~~igky~~~a~~~ek~~L~~av~~i~~---d--~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqt 304 (526) T protein:vir:99 231 GLPIRLGKYPPGTADEEKATLLRAVTGLGH---A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGT 304 (526) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHHhh---C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhh Confidence 999999999988899999998888766532 2 35778888777666532 233457888899999998875 2222 Q ss_pred Hhcccc---ccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc---------cccceecchhhhhcCHHHHHHH Q lcl|NC_021305. 294 IVHILD---RATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR---------KNRMKFDIDDVIQPDWEAKSES 361 (518) Q Consensus 294 ~lg~~~---~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~---------~~~~~fd~~~l~~~d~~~~~~~ 361 (518) +-.... .++++.. +.........+.-.+..|+..||+.|+...-. ..+.+|.+......|.+.+++. T Consensus 305 lTs~~~~g~~gS~a~g-~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~ 383 (526) T protein:vir:99 305 LTSTTSQSGGGAFALG-QVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQS 383 (526) T ss_pred hccccccCcchhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHH Confidence 221111 1223222 23345566777888999999999887654321 1133555556678899999999 Q ss_pred HHHHHhCCC-cCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcccc Q lcl|NC_021305. 362 TQKMVNSGV-ATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGL 440 (518) Q Consensus 362 ~~~~~~~G~-~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (518) +.+++..|+ ++..++|+++|+|.-. .++.++.+.... . .+.... +.......+.. ... T Consensus 384 ~~~L~~~G~~i~~~~i~e~~Gip~~~--~~e~~l~~~~~~--------~---~~~~~~--~~~~~~~~~~~----~~~-- 442 (526) T protein:vir:99 384 IPALVNVGLEIPSAWVYDKLGIPQPA--KNEPVLRSAAQP--------A---ILSRQH--GQRVAALATIV----GPR-- 442 (526) T ss_pred HHHHHhCCCccCHHHHHHHhCCCCCC--CcccccCCCCCC--------c---cccccc--ccccccccccc----ccc-- Confidence 999999997 8999999999996543 233332111100 0 000000 00000000000 000 Q ss_pred ccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHHH----HHHhH----HH----Hhh Q lcl|NC_021305. 441 SPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYP----DDLED----IL----LAV 508 (518) Q Consensus 441 ~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~----~~----~~~ 508 (518) .+ ..+..++.......+. .........+.|.+.+...++.+...-++++-|. ++|+. .. ++| T Consensus 443 ~~-~~~~~d~~l~~~~~~~-----~~~~~~~~l~~i~~~l~~~~s~ee~~~~L~~l~~~ld~~~l~~~l~~a~~~A~l~G 516 (526) T protein:vir:99 443 YG-DQQALDKALADLPAKD-----MQNQANDLLAPLLEAVNRGDSETELLGALAEAFPDMDDSALTDALHRLLFAADTWG 516 (526) T ss_pred Cc-chhhHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhh Confidence 00 0011111111110000 0122233455566666666666655555444442 22222 11 112 Q ss_pred hhhhhcccCC Q lcl|NC_021305. 509 QLALAERKDN 518 (518) Q Consensus 509 ~~~~~~~~~~ 518 (518) ..+...--+. T Consensus 517 r~~~~~e~~~ 526 (526) T protein:vir:99 517 RLHGNLDRID 526 (526) T ss_pred hhhhhhcccC Confidence 2111111111 No 114 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.88 E-value=7e-24 Score=147.52 Aligned_cols=424 Identities=11% Similarity=0.050 Sum_probs=242.8 Q ss_pred CCCCCcccc-----------------------cccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHh Q lcl|NC_021305. 6 GQTLSAPAM-----------------------AELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQA 62 (518) Q Consensus 6 ~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ 62 (518) =+.|..+.. +....|.....+................++++.+++.+..||+.+.+. T Consensus 1 ~~~p~~~~~~~~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~n 80 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDH 80 (533) T ss_pred CCCchhhhhhcccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 111111100 001111110000000001111222445677899999999999999999 Q ss_pred hccCceEEEEecC------Cc--ceecc---chHHHHHHhcCC------cCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC Q lcl|NC_021305. 63 LARLPVKCMFTSG------DT--ETEES---DTGYAKLLADPC------EYLDPFAFWEWVASTLDIYGETYLAIQKNKS 125 (518) Q Consensus 63 ia~l~~~v~~~~~------~~--~~~~~---~~~~~~L~~~PN------~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~ 125 (518) +-...|++.-.-+ ++ .++.. ......+...|+ ..++++++...++..++..|++|+.+.+... T Consensus 81 vVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~ 160 (533) T protein:vir:34 81 IVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTS 160 (533) T ss_pred hhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeeeccC Confidence 9777887753311 01 11111 112233334444 3468999999999999999999999886554 Q ss_pred -C--ceEEEEeeCCceeE--------------EEEcCCceeeEEeeecccccCc---------eeEEeccccEEEEeccC Q lcl|NC_021305. 126 -G--TPEKLMPMHPSRVA--------------IKRNSRTGRYEYYFQAGAGVGT---------QLVSFADDEVVPIRFFN 179 (518) Q Consensus 126 -G--~~~~l~~l~p~~v~--------------v~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~evih~~~~~ 179 (518) | .+..|..|+|+++. |+.+..|..+.|.+......+. ....+++.+|||+.... T Consensus 161 ~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~ 240 (533) T protein:vir:34 161 SSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPV 240 (533) T ss_pred CCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeecccc Confidence 2 25688889887775 4445555666776643222111 23446788999999988 Q ss_pred CCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC-----------CHHHHHHHH---HHHHHHhcC Q lcl|NC_021305. 180 PDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-----------SEAAQQRLR---EQFDRAHSG 245 (518) Q Consensus 180 ~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-----------~~~~~~~~~---~~~~~~~~g 245 (518) ..+..+|+|.+..++..+.......+....-.+-.+...++|+.+..- ..+..+.+. ..-...+.+ T Consensus 241 r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (533) T protein:vir:34 241 EDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAA 320 (533) T ss_pred CCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCc Confidence 888999999999999999888888888877777777788888764210 011111111 111111111 Q ss_pred ---ccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHH-hccccccccCCHHHHHHHHHHH----- Q lcl|NC_021305. 246 ---SSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPI-VHILDRATFSNISAQMRAFYRD----- 316 (518) Q Consensus 246 ---~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~~~~~----- 316 (518) .-+.|.+..|..|.+++.+..+-....|.++.+...+.||+.+|||-+. .|+.++.|||+..+....+... T Consensus 321 ~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q 400 (533) T protein:vir:34 321 APVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRR 400 (533) T ss_pred ceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHH Confidence 1245778889999999888877777889999999999999999999764 5777788999886655544433 Q ss_pred ------HhhHHHHH-HHHHHHHhhhhhhc---------cc--ccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHH Q lcl|NC_021305. 317 ------TMAIPIAR-IQSAMDKYVGQYWV---------RK--NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGRE 378 (518) Q Consensus 317 ------~l~P~~~~-ie~~l~~~l~~~~~---------~~--~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~ 378 (518) .++|+... ++.++..-.++... +. ..+.+-.-.....|+.+.+++...++.+|+.|.-|+-. T Consensus 401 ~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a 480 (533) T protein:vir:34 401 KFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECA 480 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHH Confidence 33444443 33333222221100 01 12344445567789999999999999999999999999 Q ss_pred HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCC-CCCccCCCC-CCCccccCC Q lcl|NC_021305. 379 IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAP-KRPASTPVA-SLDQSPPTS 436 (518) Q Consensus 379 ~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~ 436 (518) +.|.++-+ .-+++..=... +... +.. ....+.. ...+..+.. ++.+....+ T Consensus 481 ~~G~D~~e--v~~q~a~e~~~--~~~~--gl~-~~~~~~~~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 481 KRGDDYQE--IFAQQVRETME--RRAA--GLK-PPAWAAAAFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred HcCCCHHH--HHHHHHHHHHH--HHhc--CCC-CCCCCCcCccCCCCCCCCCCcccCCCC Confidence 99988742 22221100000 0000 000 0000000 000000000 000000000 No 115 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.88 E-value=5.2e-21 Score=131.77 Aligned_cols=475 Identities=13% Similarity=0.115 Sum_probs=261.8 Q ss_pred CcCC-CCCCCCc-----ccccccchhhhhhhcccccccccccc-----------cch----hhhHHHhhcHHHHHHHHHH Q lcl|NC_021305. 1 MLLA-NGQTLSA-----PAMAELSPQMQDSYYYAPAVGMQLER-----------QFS----LYGGIYKNQPWVRTVIAKR 59 (518) Q Consensus 1 ~~f~-~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~----~~~~~~~~~~~v~~~v~~i 59 (518) -++. -|++.+. |+...+ .+++..+...+..|..... +.. ++.++..+.+.|.+|++.+ T Consensus 3 ~~~d~~g~p~~~~~~~~~~~~~~-~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~~R 81 (528) T protein:vir:10 3 AIVDIYGNPLRTQQLRKQQTAHL-AGLAKEFANHPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMSKR 81 (528) T ss_pred eeECCCCCccccccccchhhhhh-hhhhhhhcccCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHHHH Confidence 1222 2222211 221111 1223333333333321111 122 2223334799999999999 Q ss_pred HHhhccCceEEEEecCCcceec-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC---ceEEEEeeC Q lcl|NC_021305. 60 AQALARLPVKCMFTSGDTETEE-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPEKLMPMH 135 (518) Q Consensus 60 a~~ia~l~~~v~~~~~~~~~~~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G---~~~~l~~l~ 135 (518) ...|.+++|.|....++..... .-..+..++.+ ...+.+++..+ .+.+.+|.+++++++...| .+..+.+++ T Consensus 82 k~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~---~~~f~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r~ 157 (528) T protein:vir:10 82 KRAVLGLDWTIEPPRNASAAEKADAEYLHELLLD---LEGIEDLMLDC-MDGVGHGYSAIELDWSLQGREWLPQAFDHRP 157 (528) T ss_pred HHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhC---CccHHHHHHHH-HhhhhhcceeEEEEEeecCCceeEEEeeeec Confidence 9999999999965433322111 11122222222 12356666653 4477899999999986543 466888889 Q ss_pred CceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021305. 136 PSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAG 215 (518) Q Consensus 136 p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~ 215 (518) +.++.+..+.. ..+ .... .. .....+++...++.++....+..+|.+.+..+...+.......++...|....| T Consensus 158 ~~~f~~~~~~~-~~l--~~~~-~~--~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG 231 (528) T protein:vir:10 158 QSWFQLNPDDQ-DEL--RLRD-NS--IAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYG 231 (528) T ss_pred ccceeeccCCC-cEE--eccC-CC--CCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcC Confidence 88877654432 222 1111 11 123456777777777777778889999999999999999999999999999999 Q ss_pred CcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCC-hhhHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_021305. 216 RPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLT-AVEMQFIEARQLNREEVCGVYDIAPPI 294 (518) Q Consensus 216 ~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~~ 294 (518) .|--+.+++...++++++.+.+.+.+..+ + ..+|++.|++++-+..+ .....|.++.++..++|+.+. +-..+ T Consensus 232 ~P~~igky~~~a~~~ek~~L~~al~~i~~---~--~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtl 305 (528) T protein:vir:10 232 LPIRLGKYPPGTPDEEKVTLLRAVTGLGH---A--AAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI-LGGTL 305 (528) T ss_pred CCeEEEecCCCCCHHHHHHHHHHHHHHhh---C--cEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH-hhhhh Confidence 99999999988899999998888766532 2 35677888777665532 223347888999999998876 22333 Q ss_pred hccc-c--ccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc---------cccceecchhhhhcCHHHHHHHH Q lcl|NC_021305. 295 VHIL-D--RATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVR---------KNRMKFDIDDVIQPDWEAKSEST 362 (518) Q Consensus 295 lg~~-~--~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~---------~~~~~fd~~~l~~~d~~~~~~~~ 362 (518) -... + .++++- .+.........+.-.+..|+..||+.|+...-. ..+.+|.+......|.+++++.+ T Consensus 306 Ts~~~~g~~gS~Al-g~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~ 384 (528) T protein:vir:10 306 TSQTSESGGGAYAL-GQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSL 384 (528) T ss_pred hccccccccchhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHH Confidence 2211 1 122222 223345667788888999999999888654321 12345555666788999999999 Q ss_pred HHHHhCCC-cCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccc Q lcl|NC_021305. 363 QKMVNSGV-ATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLS 441 (518) Q Consensus 363 ~~~~~~G~-~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (518) .+++..|+ ++..++|+++|+|.-. .++.+..+....+. ......+.+...+... . .... T Consensus 385 ~~L~~~G~~i~~~~i~e~~gip~p~--~~e~~~~~~~~~~~-------~~~~~~~~~~~~~~~~------~-----~~~~ 444 (528) T protein:vir:10 385 PPLVKLGVQVPVNWVQEQLGIPLPA--NGEAVLGDQAGAGI-------AQLSRRPGPRIAALAQ------V-----IGPR 444 (528) T ss_pred HHHHhCCCCCCHHHHHHHhCCCCCC--CCcccccCCCcccc-------cccCcccccccccccc------c-----cccc Confidence 99999998 8999999999996543 23433322211100 0000000000000000 0 0000 Q ss_pred cchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHHH----HHHhHH--------HHhhh Q lcl|NC_021305. 442 PTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYP----DDLEDI--------LLAVQ 509 (518) Q Consensus 442 ~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~--------~~~~~ 509 (518) ....+..++.......+.. ........+.+...+...++.+...-++++-|. ++++.+ .++|. T Consensus 445 ~~~~~~~d~~~~~~~~~~~-----~~~~~~~l~~i~~~l~~~~s~ee~~~~L~~l~~~~d~~~l~~~l~~a~~~A~l~G~ 519 (528) T protein:vir:10 445 YRDQEALDQVLASLPAQDM-----QNQADSLVAPLLDVISRGGSEAELLGALAEAFPDMDDSALADALHRLLFVADTWGR 519 (528) T ss_pred ccccchHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHhhh Confidence 0001111111111111100 011223344555555555555544444444442 222211 12222 Q ss_pred hhhhcccCC Q lcl|NC_021305. 510 LALAERKDN 518 (518) Q Consensus 510 ~~~~~~~~~ 518 (518) .+..+--+. T Consensus 520 ~~~~~e~~~ 528 (528) T protein:vir:10 520 LNGTLDRID 528 (528) T ss_pred hhccccccC Confidence 222111111 No 116 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.88 E-value=9e-21 Score=130.47 Aligned_cols=472 Identities=13% Similarity=0.115 Sum_probs=266.9 Q ss_pred CcCC-CCCCCC-----cccccccchhhhhhhcccccccccccc-----------cc----hhhhHHHhhcHHHHHHHHHH Q lcl|NC_021305. 1 MLLA-NGQTLS-----APAMAELSPQMQDSYYYAPAVGMQLER-----------QF----SLYGGIYKNQPWVRTVIAKR 59 (518) Q Consensus 1 ~~f~-~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~----~~~~~~~~~~~~v~~~v~~i 59 (518) -++. .|++.. +++...+. +++..+...+..|..... +. .++.++..+.+.|.+|+..+ T Consensus 3 ~~~d~~g~p~~~~~~~~~~~~~~~-~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~~R 81 (526) T protein:vir:79 3 QIVDVYGNPIRPQQLREPQTSRLA-GLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSKR 81 (526) T ss_pred eeeCCCCCccCccccchhhhhhhh-hhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHHH Confidence 1222 233221 22221111 123333333333321111 11 23333334789999999999 Q ss_pred HHhhccCceEEEEecCCccee--ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC---ceEEEEee Q lcl|NC_021305. 60 AQALARLPVKCMFTSGDTETE--ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPEKLMPM 134 (518) Q Consensus 60 a~~ia~l~~~v~~~~~~~~~~--~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G---~~~~l~~l 134 (518) ...|.+++|.|....++.... ..+.....|...| ++.+++..++. .+.+|.+++++++...| .+..+.+. T Consensus 82 k~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~~~~l~~r 156 (526) T protein:vir:79 82 KRAILGLDWAVEPPRNASAAEKADADYLHELLLDLE----GLEDLLLDALD-GIGHGYSCIELEWALQGREWMPLAFHHR 156 (526) T ss_pred HHHHhCCCceEecCCCCChHHHHHHHHHHHHHhccc----CHHHHHHHHHh-hhhhcceeEEEEEeecCCceeEEEeeee Confidence 999999999997544332221 1222222333333 47777777554 77899999999987653 36678888 Q ss_pred CCceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHcc Q lcl|NC_021305. 135 HPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNA 214 (518) Q Consensus 135 ~p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng 214 (518) ++.++.+..+... .+.+ ... ......+++...+..++....+..+|.+.+..+...........++...|.+.. T Consensus 157 ~~~~F~~~~~~~~-~l~~--~~~---~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~y 230 (526) T protein:vir:79 157 PQSWFQLNPEDQN-ELRL--RDN---SPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIY 230 (526) T ss_pred cccceEeccCCCc-EEEe--cCC---CCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHc Confidence 9888776554432 2221 111 122346777777777777777888999999999999999999999999999999 Q ss_pred CCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccC-ChhhHHHHHHHHHHHHHHHHHhcCCHH Q lcl|NC_021305. 215 GRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQL-TAVEMQFIEARQLNREEVCGVYDIAPP 293 (518) Q Consensus 215 ~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~-~~~d~~~~e~~~~~~~~Ia~~fgVPp~ 293 (518) |.|--+.+++...++++++++.+.+.+..+ ...+|++.|++++-+.. +.....|.++.++..++|+.+. +-.. T Consensus 231 G~P~~igky~~~a~~~ek~~L~~av~~i~~-----da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqt 304 (526) T protein:vir:79 231 GLPIRLGKYPPGTADEEKATLLRAVTGLGH-----AAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGT 304 (526) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHHhc-----CcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhh Confidence 999999999988899999888887776532 23577888887766653 2333458888999999998875 2222 Q ss_pred Hhccc---cccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhccc---------ccceecchhhhhcCHHHHHHH Q lcl|NC_021305. 294 IVHIL---DRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRK---------NRMKFDIDDVIQPDWEAKSES 361 (518) Q Consensus 294 ~lg~~---~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~---------~~~~fd~~~l~~~d~~~~~~~ 361 (518) +-... ..++++.. +.........+.-.+..|+..||+.|+...-.- .+.+|.++.....|.+++++. T Consensus 305 lTs~~~~g~~gS~a~g-~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~ 383 (526) T protein:vir:79 305 LTSTTSQSGGGAFALG-QVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQS 383 (526) T ss_pred hccccccCcchhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHH Confidence 22211 11233332 333555677788899999999998886543211 133455556678899999999 Q ss_pred HHHHHhCCC-cCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcccc Q lcl|NC_021305. 362 TQKMVNSGV-ATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGL 440 (518) Q Consensus 362 ~~~~~~~G~-~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (518) +.+++..|+ ++..++|+.+|+|.-. .++.++.|.. ++.+.+..++..... ... .... T Consensus 384 ~~~L~~~G~~i~~~~i~e~~gip~~~--~~e~~l~~~~--------------~~~~~~~~~~~~~~~---~~~---~~~~ 441 (526) T protein:vir:79 384 IPALVNVGLEIPSAWVYDKLGIPQPA--KNEPVLRPAA--------------QPAILSRQHGQRVAA---LAT---IVGP 441 (526) T ss_pred HHHHHhCCCcCCHHHHHHHhCCCCCC--CchhhccccC--------------Ccccccccccccccc---ccc---cccc Confidence 999999997 8999999999996432 2333321111 000000000000000 000 0000 Q ss_pred ccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHHH----HHHhH----HH----Hhh Q lcl|NC_021305. 441 SPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYP----DDLED----IL----LAV 508 (518) Q Consensus 441 ~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~----~~----~~~ 508 (518) .....+..++.......+. ..+......+.|.+++..+++.+...-++++-|. ++|+. .. ++| T Consensus 442 ~~~~~~~~d~~l~~~~~~~-----~~~~~~~~~~~i~~~~~~~~s~ee~~~~L~~l~~~ld~~~l~~~l~~a~~~A~l~G 516 (526) T protein:vir:79 442 RYGDQQALDKALADLPAKD-----MQNQANDLLAPLLDAVNRGDSETELLGALAEAFPDMDDSALTDALHRLLFAADTWG 516 (526) T ss_pred cCchhhHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhh Confidence 0000111111000000000 0122334455666666666666655555444442 22222 11 222 Q ss_pred hhhhhcccCC Q lcl|NC_021305. 509 QLALAERKDN 518 (518) Q Consensus 509 ~~~~~~~~~~ 518 (518) ..+....-+. T Consensus 517 r~~~~~e~~~ 526 (526) T protein:vir:79 517 RLHGNLDRID 526 (526) T ss_pred hhhhhhcccC Confidence 2222111111 No 117 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.88 E-value=9.2e-24 Score=146.89 Aligned_cols=420 Identities=10% Similarity=0.019 Sum_probs=239.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccC-ceEEEEec--CCc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARL-PVKCMFTS--GDT 77 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l-~~~v~~~~--~~~ 77 (518) +.+..-.... ..+ ...|.....+................++++.++|.+..+|+.+.+.+-.. .+.+.-+- .+. T Consensus 24 ~~~~~y~aa~-~~r--~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG~ggi~~~~~~~~~~~ 100 (502) T protein:vir:79 24 AVIQAYEAVK-TTR--THKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLEERVVGKNGIIVEPHPVLRNG 100 (502) T ss_pred HHHhhccccC-ccc--ccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCceeeeeccCCCCh Confidence 1111100000 000 11111110000001111112234456788999999999999888888754 44432211 111 Q ss_pred --ceeccc---hHHHHHHhcC--CcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC-------ceEEEEeeCCceeE--- Q lcl|NC_021305. 78 --ETEESD---TGYAKLLADP--CEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG-------TPEKLMPMHPSRVA--- 140 (518) Q Consensus 78 --~~~~~~---~~~~~L~~~P--N~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G-------~~~~l~~l~p~~v~--- 140 (518) .++... .....+...+ +..++++.+...++..++..|++|+.+++...+ .+..|..|+|+++. T Consensus 101 ~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l~~~~ 180 (502) T protein:vir:79 101 AIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFIPMTS 180 (502) T ss_pred hHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCCCcccceEEEEecchhcCCCC Confidence 111111 1122222222 235789999999999999999999998765432 36789999998886 Q ss_pred ---------EEEcCCceeeEEeeecccc---cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 141 ---------IKRNSRTGRYEYYFQAGAG---VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATA 208 (518) Q Consensus 141 ---------v~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 208 (518) |+.+..|..+.|.+....+ .....+.+++++|+|+......+..+|+|.+..++..+.......+... T Consensus 181 ~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael 260 (502) T protein:vir:79 181 DESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSEL 260 (502) T ss_pred CCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHH Confidence 5566677777777653322 2344578999999999998888899999999999999988888888887 Q ss_pred HHHHccCCcccccccCccC--CHHHHHHHHHHHHHHhcCccccCCee-ecCCCcceeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_021305. 209 AMWKNAGRPNLVLRHEKRL--SEAAQQRLREQFDRAHSGSSNTGKTM-VVEEGMEPIPLQLTAVEMQFIEARQLNREEVC 285 (518) Q Consensus 209 ~~~~ng~~p~~il~~~~~~--~~~~~~~~~~~~~~~~~g~~n~g~~~-vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 285 (518) ...+-.+...++|+.+..- ..+.... .-..... .-..|.++ +|..|.+++.+..+.....|.++.+...+.|| T Consensus 261 ~~a~i~A~~~~fi~~~~~~~~~~~~~~~---~~~~~~~-~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~ia 336 (502) T protein:vir:79 261 TAARIAAALGMYIRKGDGQSYEPDGNGS---KENEREL-TIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVA 336 (502) T ss_pred HHHHHhhhheeeeecCCCcccccccCCC---CCccccc-cccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHH Confidence 7777778888888764321 1100000 0000000 11345544 58999999888877667789999999999999 Q ss_pred HHhcCCHHH-hccccccccCCHHHHHHHHHHH-----------HhhHHHHH-HHHHHHHhhhhh--h-ccccc--ceecc Q lcl|NC_021305. 286 GVYDIAPPI-VHILDRATFSNISAQMRAFYRD-----------TMAIPIAR-IQSAMDKYVGQY--W-VRKNR--MKFDI 347 (518) Q Consensus 286 ~~fgVPp~~-lg~~~~~~~sn~e~~~~~~~~~-----------~l~P~~~~-ie~~l~~~l~~~--~-~~~~~--~~fd~ 347 (518) +.+|||-+. .|+. ++|||+.......|... .++|+... ++.++-.-.++. . .+..+ ++|-. T Consensus 337 aglGi~ye~lt~D~-s~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~ 415 (502) T protein:vir:79 337 AGSRLSFSSTARNY-NGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSG 415 (502) T ss_pred hhcCCCHHHHhccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeec Confidence 999999665 4554 45899887666555443 33333332 222222222211 0 11112 23333 Q ss_pred hhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeec------ccccccccccccCCCCCCCCCCCCC Q lcl|NC_021305. 348 DDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYAN------SALQPLGATPDGAVEWEEAPAPKRP 421 (518) Q Consensus 348 ~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~------~n~~~~~~~~~~~~~~~~~~~~~~~ 421 (518) -.....|+.+.+++...++.+|+.|+-++-++.|.++-+ .-+++..= .++ +++..+...... T Consensus 416 p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~--v~~q~a~e~~~~~~~Gl-~~~~~~~~~~~~--------- 483 (502) T protein:vir:79 416 PVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDD--VKRRRKAEIDENRKLDL-VFDTDPASDKGG--------- 483 (502) T ss_pred CCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHH--HHHHHHHHHHHHHHcCC-CCCCCCCCCCCC--------- Confidence 455678999999999999999999999999999988743 22221100 000 111100000000 Q ss_pred ccCCCCCCCccccCCcccccc Q lcl|NC_021305. 422 ASTPVASLDQSPPTSVPGLSP 442 (518) Q Consensus 422 ~~~~~~~~~~~~~~~~~~~~~ 442 (518) .+...++ ++++++....+. T Consensus 484 -~~~~~~~-~e~~~~~~~~e~ 502 (502) T protein:vir:79 484 -SSAATKR-QEPQHTDDQSEE 502 (502) T ss_pred -CCCCCCC-CCCCCCCCCCCC Confidence 0000000 000000000000 No 118 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.88 E-value=2e-22 Score=139.57 Aligned_cols=376 Identities=10% Similarity=0.060 Sum_probs=213.9 Q ss_pred ccccchhhhhhhccc---ccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHH Q lcl|NC_021305. 14 MAELSPQMQDSYYYA---PAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETEESDTGYAKLL 90 (518) Q Consensus 14 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~ 90 (518) ....+.......|+. ...+.+....+.....+|..++.++++|+.+|+.+-+-.|.+-.. +. +......+.+| T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~~--~~-~~~~~~~~~~l- 76 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDGI--DD-EPAFWSRWDDL- 76 (422) T ss_pred CccchhhHHHHcCCCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccCC--CH-HHHHHHHHHHh- Confidence 111111111111111 111222233344556789999999999999999999999987322 11 11111111222 Q ss_pred hcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE-c---------CCCceEEEEeeCCceeEEEEcC-------CceeeEEe Q lcl|NC_021305. 91 ADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-N---------KSGTPEKLMPMHPSRVAIKRNS-------RTGRYEYY 153 (518) Q Consensus 91 ~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r-~---------~~G~~~~l~~l~p~~v~v~~~~-------~~~~~~~~ 153 (518) ...+-+...+....++|.+++++.. + ..|.+..+.++++..|++.... .+.+..|. T Consensus 77 -------~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~ 149 (422) T protein:vir:10 77 -------EMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYR 149 (422) T ss_pred -------hHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEE Confidence 2234445556666688988888764 2 2456778889998888754321 12333333 Q ss_pred eecccccCceeEEeccccEEEEeccC------CCCcccCchHHHH-HHHHHHHHHHHHHHHHHHHHccCCcccccccCc- Q lcl|NC_021305. 154 FQAGAGVGTQLVSFADDEVVPIRFFN------PDGLERGLSLMES-LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK- 225 (518) Q Consensus 154 ~~~~~~~~~~~~~~~~~evih~~~~~------~~~~~~G~s~l~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~- 225 (518) +. ...++....+.++.||||.... +....+|.|++.. +++.|.....+.......+....... +++++ T Consensus 150 v~--~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v--~~~~~l 225 (422) T protein:vir:10 150 IT--TNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAV--WKAKGL 225 (422) T ss_pred Ee--cCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc--ccchhH Confidence 32 2233445678999999996442 3344579999986 67989888888888888766654333 33332 Q ss_pred --cCC-HHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHh-cccccc Q lcl|NC_021305. 226 --RLS-EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIV-HILDRA 301 (518) Q Consensus 226 --~~~-~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~ 301 (518) .++ .......++++........+.+.+++..++.+|++++.+..+. .+.......+||++.+||...| |...++ T Consensus 226 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa~~IP~t~L~G~s~~G 303 (422) T protein:vir:10 226 AELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGGI--DAFLDKKFDRIVALSGIHEIILKNKNVGG 303 (422) T ss_pred HHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCCh--HHHHHHHHHHHHhhhCCCeeeeccCCccc Confidence 122 2233344444444333333445566667788999998888764 5888999999999999998754 555554 Q ss_pred ccCCHHHHHHHHHHHH-------hhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHH-------HHHHHHHh Q lcl|NC_021305. 302 TFSNISAQMRAFYRDT-------MAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKS-------ESTQKMVN 367 (518) Q Consensus 302 ~~sn~e~~~~~~~~~~-------l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~-------~~~~~~~~ 367 (518) -+++.+...+.||..+ +.|.++.+-..| . ....+.|.+.+|...+.++++ +++.++++ T Consensus 304 lnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i----~----~s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~ 375 (422) T protein:vir:10 304 VSSSQNTALETFHKLVDRKRNAELLPILEFLIPFI----V----NAEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIA 375 (422) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----c----ccCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHh Confidence 4566677777777743 455554443222 1 223456666788887777654 55677899 Q ss_pred CCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 368 SGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 368 ~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) +|+++++|+|+.+--.... .+ +..+..+.+.. .......|.+.|..+ T Consensus 376 ~g~i~~~e~r~~L~~~~~~-~~-----~~~~~~~~~~~--------~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 376 AGAMDIDEARDTLRTIAPE-VK-----INDGSVETEVT--------ISETSNDPLEVPTDD 422 (422) T ss_pred cCCCCHHHHHHHhhhhccc-cc-----CCCCCCccccc--------hhhcCCCCCCCCCCC Confidence 9999999999988322111 11 01111111100 000000000111000 No 119 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.88 E-value=2e-23 Score=144.97 Aligned_cols=430 Identities=10% Similarity=0.020 Sum_probs=243.3 Q ss_pred CcCCCCCCCC--cccccccchhhh-hh-----------hcccccc-------cccccccchhhhHHHhhcHHHHHHHHHH Q lcl|NC_021305. 1 MLLANGQTLS--APAMAELSPQMQ-DS-----------YYYAPAV-------GMQLERQFSLYGGIYKNQPWVRTVIAKR 59 (518) Q Consensus 1 ~~f~~~~~~~--~~~~~~~~~~~~-~~-----------~~~~~~~-------~~~~~~~~~~~~~~~~~~~~v~~~v~~i 59 (518) |+=..++... ++.+........ .. -+|.+.. ...........++++.+++.+..+|+.+ T Consensus 1 m~~~~~r~~~~~a~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~~ 80 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGYQ 80 (553) T ss_pred CcchhhhhhcccccccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 3322222211 111100000000 00 0111100 0111112445677899999999999999 Q ss_pred HHhhccCceEEEEecCC-----cceecc-------chHHHHHHhcCC------cCCCHHHHHHHHHHHHHHcCCeEEEEE Q lcl|NC_021305. 60 AQALARLPVKCMFTSGD-----TETEES-------DTGYAKLLADPC------EYLDPFAFWEWVASTLDIYGETYLAIQ 121 (518) Q Consensus 60 a~~ia~l~~~v~~~~~~-----~~~~~~-------~~~~~~L~~~PN------~~~s~~~f~~~~v~~ll~~G~~~~~i~ 121 (518) ...+-...|++.-.-+. ...+.. ......+...++ ..++++.+...++..++..|++|+.+. T Consensus 81 ~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~ 160 (553) T protein:vir:63 81 RDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAE 160 (553) T ss_pred HHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEee Confidence 99888778877533110 011111 112233344443 346899999999999999999999887 Q ss_pred EcCC-C--ceEEEEeeCCceeE--------------EEEcCCceeeEEeeecccccC--------------ceeEEeccc Q lcl|NC_021305. 122 KNKS-G--TPEKLMPMHPSRVA--------------IKRNSRTGRYEYYFQAGAGVG--------------TQLVSFADD 170 (518) Q Consensus 122 r~~~-G--~~~~l~~l~p~~v~--------------v~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~ 170 (518) +... | .+..|..|+|+++. |+.+..|..+.|.+....+.. .....+++. T Consensus 161 ~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~ 240 (553) T protein:vir:63 161 WDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRR 240 (553) T ss_pred eccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccChh Confidence 6543 2 24678899998875 444555666677664332211 023357899 Q ss_pred cEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHH-------------- Q lcl|NC_021305. 171 EVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLR-------------- 236 (518) Q Consensus 171 evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~-------------- 236 (518) +|||+......+..+|+|.+..++..+.......+......+-.+...++|+.+..- ....+.+. T Consensus 241 ~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 319 (553) T protein:vir:63 241 QVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPP-EFIHSQMSGGSPNADMVGIFGK 319 (553) T ss_pred HheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCh-hhhhhhcccccccccccccccc Confidence 999999988888999999999999999988888888887777788888888765321 11111111 Q ss_pred --HHHHHHhcC----ccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHH Q lcl|NC_021305. 237 --EQFDRAHSG----SSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQ 309 (518) Q Consensus 237 --~~~~~~~~g----~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~ 309 (518) +.....+.+ .-+.|.|..|..|.+++.+..+-....|.++.+...+.||+.+|||-+ +.|+.++.|||+..+. T Consensus 320 ~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~ 399 (553) T protein:vir:63 320 YMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAG 399 (553) T ss_pred cccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHH Confidence 011111111 124577888999999988887767778999999999999999999976 5577788899998766 Q ss_pred HHHHHHH-----------HhhHHHHH-HHHHHHHhhhh--hhc----------cccc--ceecchhhhhcCHHHHHHHHH Q lcl|NC_021305. 310 MRAFYRD-----------TMAIPIAR-IQSAMDKYVGQ--YWV----------RKNR--MKFDIDDVIQPDWEAKSESTQ 363 (518) Q Consensus 310 ~~~~~~~-----------~l~P~~~~-ie~~l~~~l~~--~~~----------~~~~--~~fd~~~l~~~d~~~~~~~~~ 363 (518) ...|... .++|+... ++.++-.-.++ ... +..+ +++-.-.....|+.+.+++.. T Consensus 400 ~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~ 479 (553) T protein:vir:63 400 IAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAV 479 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHH Confidence 5554443 33443333 22222211111 000 0011 223334456779999999999 Q ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCcceeee------cccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCc Q lcl|NC_021305. 364 KMVNSGVATPNEGREIMGLPRSDDPKADELYA------NSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSV 437 (518) Q Consensus 364 ~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~------~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (518) .++.+|+.|+-++-.+.|.++-+ --+++.. -.++ +.+..+....... ....+..++..+ ..+.. T Consensus 480 ~~i~~G~~t~~~~~a~~G~D~~~--v~~q~a~e~~~~~~~Gl-~~~~~~~~~~~~~------~~~~~~~~~~~~-~~~~~ 549 (553) T protein:vir:63 480 MRIDAGLSTYEREIARLGGDFRK--SFAQRAREDALLKKYGL-TFNLSAKRSLGDG------RDAATGIAEDPA-AAQTS 549 (553) T ss_pred HHHHcCCCCHHHHHHHhCCCHHH--HHHHHHHHHHHHHHcCC-CCCCCCccccCCC------cccCCCCCCCCC-CCCcc Confidence 99999999999999888987643 1111110 0000 1111111000000 000000000000 00000 Q ss_pred cccc Q lcl|NC_021305. 438 PGLS 441 (518) Q Consensus 438 ~~~~ 441 (518) .+.| T Consensus 550 ~~~e 553 (553) T protein:vir:63 550 QQGE 553 (553) T ss_pred cccC Confidence 0000 No 120 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.87 E-value=3.2e-22 Score=138.42 Aligned_cols=385 Identities=10% Similarity=0.061 Sum_probs=213.7 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccccccc-----ccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVG-----MQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSG 75 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~ 75 (518) =||-.++..+....- -+...|....... ......+.....+|..++.++.+|+.+|+.+-+..|++.... T Consensus 2 ~~~m~~~~~~~~~~D----~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~- 76 (435) T protein:vir:79 2 GVFMSDKVKAITKED----GYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFKVDGVK- 76 (435) T ss_pred Ccccccccccchhhc----chhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCceecCCC- Confidence 366555533322221 2222222211110 111112345567789999999999999999999998873211 Q ss_pred CcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE-c---------CCCceEEEEeeCCceeEEEEcC Q lcl|NC_021305. 76 DTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-N---------KSGTPEKLMPMHPSRVAIKRNS 145 (518) Q Consensus 76 ~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r-~---------~~G~~~~l~~l~p~~v~v~~~~ 145 (518) ... .+...+.+ . ...+-+...+....++|.+++++.. + ..|.+..+.++++..|++.... T Consensus 77 --~~~----~~~~~~~~---l-~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~ 146 (435) T protein:vir:79 77 --NEK----SFKSRWDE---L-RLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERE 146 (435) T ss_pred --hHH----HHHHHHHH---h-hHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhc Confidence 111 11111121 1 2223444455566678888777764 2 2345668888888877653321 Q ss_pred -------CceeeEEeeecccccCceeEEeccccEEEEecc------CCCCcccCchHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 146 -------RTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFF------NPDGLERGLSLM-ESLKSTIFSEDSSRNATAAMW 211 (518) Q Consensus 146 -------~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~------~~~~~~~G~s~l-~~~~~~i~~~~~~~~~~~~~~ 211 (518) .+.+..|.+. ...+.....+.++.||||... .+.+..+|.|++ +.+++.+.....+.......+ T Consensus 147 ~dp~sp~fg~P~~y~v~--~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~ 224 (435) T protein:vir:79 147 TNARSVRYGEPKLYKIS--PGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLL 224 (435) T ss_pred cCCcccccCcceEEEEe--cCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1223333332 222334567899999999643 234456799998 688898988888888887776 Q ss_pred HccCCcccccccCc---cCC-HHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 212 KNAGRPNLVLRHEK---RLS-EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGV 287 (518) Q Consensus 212 ~ng~~p~~il~~~~---~~~-~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 287 (518) ....... +++++ .++ +.....+++++........+.+.+++..++.+|+.++.+..+. .+.......+||++ T Consensus 225 ~~~~~~v--~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa 300 (435) T protein:vir:79 225 RRKQQAV--WKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSGV--PEFLQEKIDRIVAL 300 (435) T ss_pred HHhcCcc--ccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCCH--HHHHHHHHHHHHhh Confidence 5544332 23322 121 2223333333333222223445566666677899888887764 58888899999999 Q ss_pred hcCCHHH-hccccccccCCHHHHHHHHHHHH-------hhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHH Q lcl|NC_021305. 288 YDIAPPI-VHILDRATFSNISAQMRAFYRDT-------MAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKS 359 (518) Q Consensus 288 fgVPp~~-lg~~~~~~~sn~e~~~~~~~~~~-------l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~ 359 (518) .+||... +|...++-.++.+.....||..+ +.|.+..+-.. +. ....+.|.+++|...|.++++ T Consensus 301 ~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~l----i~----~s~d~~~~f~pL~~~sekEkA 372 (435) T protein:vir:79 301 TGIHEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPF----MI----SETEWSIEFEPLSVPSDKDKA 372 (435) T ss_pred hCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hh----cCCCCeEEeCCCCCCCHHHHH Confidence 9999865 56555555566677777777754 44444443222 21 223456677788888876654 Q ss_pred -------HHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 360 -------ESTQKMVNSGVATPNEGREIM-GLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 360 -------~~~~~~~~~G~~T~NE~R~~~-g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) +++.+++++|+++++|+|+++ ...+.-.-.++.. ..+ +.++...+...+.+++++ T Consensus 373 ei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~------~~~-----------~~~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 373 EIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDN------IEL-----------PEPEDLDPEPGQEGGLNK 435 (435) T ss_pred HHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCccc------ccC-----------CccccCCCCCCCCCCCCC Confidence 456678999999999999977 2111110001000 000 000111111111122222 No 121 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.87 E-value=1.7e-23 Score=145.40 Aligned_cols=419 Identities=10% Similarity=0.017 Sum_probs=244.5 Q ss_pred CCCCCCCCc-------cc--------------------ccccchhhhh-hh-cccccccccccccchhhhHHHhhcHHHH Q lcl|NC_021305. 3 LANGQTLSA-------PA--------------------MAELSPQMQD-SY-YYAPAVGMQLERQFSLYGGIYKNQPWVR 53 (518) Q Consensus 3 f~~~~~~~~-------~~--------------------~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 53 (518) -++-+..+. ++ .+..+.|... +. +................++++.+++.+. T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 80 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAK 80 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 011000000 00 0000111000 00 0000000011112456678899999999 Q ss_pred HHHHHHHHhhcc-CceEEEEecCC----cceecc---chHHHHHHhcCCc----CCCHHHHHHHHHHHHHHcCCeEEEEE Q lcl|NC_021305. 54 TVIAKRAQALAR-LPVKCMFTSGD----TETEES---DTGYAKLLADPCE----YLDPFAFWEWVASTLDIYGETYLAIQ 121 (518) Q Consensus 54 ~~v~~ia~~ia~-l~~~v~~~~~~----~~~~~~---~~~~~~L~~~PN~----~~s~~~f~~~~v~~ll~~G~~~~~i~ 121 (518) .+|+.+.+.+-. ..+.+.-.-.. -.++.. ......+..++|+ .++++++...++..++..|++|+.+. T Consensus 81 ~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~ 160 (505) T protein:vir:96 81 RFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREH 160 (505) T ss_pred HHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEe Confidence 999988888875 56666433211 111111 1222334445554 46799999999999999999999887 Q ss_pred EcCCC-ceEEEEeeCCceeE----------------EEEcCCceeeEEeeecccc---------cCceeEEeccccEEEE Q lcl|NC_021305. 122 KNKSG-TPEKLMPMHPSRVA----------------IKRNSRTGRYEYYFQAGAG---------VGTQLVSFADDEVVPI 175 (518) Q Consensus 122 r~~~G-~~~~l~~l~p~~v~----------------v~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~evih~ 175 (518) +...+ .+..|.+|+|+++. |+.+..|..+.|.+....+ .......+++++|+|+ T Consensus 161 ~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~ 240 (505) T protein:vir:96 161 RGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHT 240 (505) T ss_pred ecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhh Confidence 65443 46688999988874 3344455666776643221 1123456889999999 Q ss_pred eccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC-CHHHHHHHHHHHHHHhcCccccCCeee Q lcl|NC_021305. 176 RFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SEAAQQRLREQFDRAHSGSSNTGKTMV 254 (518) Q Consensus 176 ~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~~~~~~~~~~~~g~~n~g~~~v 254 (518) ......+..+|+|.+..++..+.......+....-.+-.+...++|+.+... .+...+. ..... -.-..|.+.. T Consensus 241 f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~----~~~~~-~~l~pG~i~~ 315 (505) T protein:vir:96 241 FVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDD----QGEIV-EEVEAGTYQL 315 (505) T ss_pred hcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCccccc----cCccc-cccCCceeee Confidence 9988888999999999999999988888888887777788888888765331 1111000 00011 1124577888 Q ss_pred cCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHH-hccccccccCCHHHHHHHHHH-----------HHhhHHH Q lcl|NC_021305. 255 VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPI-VHILDRATFSNISAQMRAFYR-----------DTMAIPI 322 (518) Q Consensus 255 l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~~~~-----------~~l~P~~ 322 (518) |..|.+++.+..+-....|.++.+...+.||+.+|||-+. .|+.++.|||+..+....+.. ..++|+. T Consensus 316 L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~ 395 (505) T protein:vir:96 316 LPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDERDLYKLLQFFVVTELLERVA 395 (505) T ss_pred cCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999998887777889999999999999999999764 577778899988766655444 3344444 Q ss_pred HH-HHHHHHHhhhhh--hccccc--ceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecc Q lcl|NC_021305. 323 AR-IQSAMDKYVGQY--WVRKNR--MKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANS 397 (518) Q Consensus 323 ~~-ie~~l~~~l~~~--~~~~~~--~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~ 397 (518) .. ++.++..-.++. .....+ +.+-.-.....|+.+.+++...++.+|+.|+-++-.+.|.++-+ .-+++..=. T Consensus 396 ~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~--v~~q~a~e~ 473 (505) T protein:vir:96 396 GNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPED--VFDEIAWEE 473 (505) T ss_pred HHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHH--HHHHHHHHH Confidence 43 333332222221 111112 33434455777999999999999999999999998889988743 222211000 Q ss_pred cc-cccccccccCCCCCCCCCCCCCccCCCCCCCccccCC Q lcl|NC_021305. 398 AL-QPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTS 436 (518) Q Consensus 398 n~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) .. ..++ .....+...+ ...+.+++++++++. T Consensus 474 ~~~~~~G-----l~~~~~~~~~---~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 474 QLMRDKG-----VNPTPPEQES---KDATTDEEDDSASDD 505 (505) T ss_pred HHHHHcC-----CCCCCCCCCC---CCCCCCCCCCCCCCC Confidence 00 0000 0000000000 000111111111111 No 122 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.87 E-value=8.9e-23 Score=141.47 Aligned_cols=495 Identities=11% Similarity=0.053 Sum_probs=227.9 Q ss_pred CcCCCCCCC-Ccc--ccccc--chhhhhhhccccc----cc----------ccccccchhhhHHHhhcHHHHHHHHHHHH Q lcl|NC_021305. 1 MLLANGQTL-SAP--AMAEL--SPQMQDSYYYAPA----VG----------MQLERQFSLYGGIYKNQPWVRTVIAKRAQ 61 (518) Q Consensus 1 ~~f~~~~~~-~~~--~~~~~--~~~~~~~~~~~~~----~~----------~~~~~~~~~~~~~~~~~~~v~~~v~~ia~ 61 (518) -.+.-..-+ |.. +.-+. +.+. +.+..... .. .+.-.+ .....+|..++.++.+|+.+|+ T Consensus 55 ~~~~~~~~~~~~~~~a~ds~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~f~g-yql~alY~~~~l~rkiVd~pAe 132 (765) T protein:vir:96 55 VIRSVKDFLEPGLSVAMDSAYGDGPT-PAAKAAAGGQNPYVVPTMLQDWYNSQGFIG-YQACAIISQHWLVDKACSMSGE 132 (765) T ss_pred CCCCCCcccCcccceecccccccccc-chHHHhhhccCccchhhHHHhhhcccCCcc-HHHHHHHHhCchhhhhhhcchH Confidence 111100000 000 00000 0110 01110000 00 000011 1234579999999999999999 Q ss_pred hhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc-CC--------------- Q lcl|NC_021305. 62 ALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN-KS--------------- 125 (518) Q Consensus 62 ~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~-~~--------------- 125 (518) .+-+-.|.|.-.+++ ..+.....|...-... ...+-+...+.+.-++|.+|+++.-+ .+ T Consensus 133 Da~R~g~~I~~~~~e----~~~~~~~~l~~~~~rl-~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~k 207 (765) T protein:vir:96 133 DAARNGWELKSDGRK----LSDEQSALIARRDMEF-RVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAP 207 (765) T ss_pred HhhcCCceeecCccc----cCHHHHHHHHHHHHHh-hHHHHHHHHHHHhhhceeeEEEEEecccCcchhhcccccccccc Confidence 999988888432211 1111111221111111 23444455566666788888776532 11 Q ss_pred CceEEEEeeCCceeEEEEc----CC-ceeeEEeeecccccCceeEEeccccEEEEeccCC------CCcccCchHHHHHH Q lcl|NC_021305. 126 GTPEKLMPMHPSRVAIKRN----SR-TGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP------DGLERGLSLMESLK 194 (518) Q Consensus 126 G~~~~l~~l~p~~v~v~~~----~~-~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~------~~~~~G~s~l~~~~ 194 (518) |.+..|..++|..+.+... .+ ....++...... ..+ ..+.++.||||..... ....+|.|.++.++ T Consensus 208 g~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~-i~g--~~IH~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~y 284 (765) T protein:vir:96 208 GSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWI-ISG--KKYHRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIY 284 (765) T ss_pred ceeeEEEEechhhcccccchhccccccccccCcceeee-ecC--ceeccceEEEecCCCchhhhccccCccCccHHHHHH Confidence 2345666777665554221 11 111111110001 112 2578888999865442 22346999999999 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHH Q lcl|NC_021305. 195 STIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFI 274 (518) Q Consensus 195 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~ 274 (518) +.|.....+......++........-+..-..+..+ +.+++++.......+| .++++++.+.+|+.++.+..+. . T Consensus 285 d~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~--~~l~~r~~~~~~~r~n-~g~~~id~ee~~e~~s~~lsgl--~ 359 (765) T protein:vir:96 285 ERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANE--DAFNARLAFWIANRDN-HGVKVIGIDETMEQFDTNLSDF--D 359 (765) T ss_pred HHHHHHHHHHHHHHHHHHHhccceeeechHhhhccH--HHHHHHHHHHHHhcCC-ceeEEecCCcceeEEecccCCH--H Confidence 999999988888888877655443322222222222 2344444443333233 4578899999999998887764 5 Q ss_pred HHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHH-------HhhHHHHHHHHHHHHhhhhhhcccccceec Q lcl|NC_021305. 275 EARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKFD 346 (518) Q Consensus 275 e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd 346 (518) +.......+||++.+||.. |+|....+.+++.+.....||.. .+.|.++.+-+.|-.. ......+.|. T Consensus 360 d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s----~~i~~d~~i~ 435 (765) T protein:vir:96 360 SVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKS----ESIDVQLEIV 435 (765) T ss_pred HHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCCcceEE Confidence 7888889999999999975 55555466667777777888874 4566666555444322 1222356777 Q ss_pred chhhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCC------CCCCCc--ceeeecccccccccccccCCC Q lcl|NC_021305. 347 IDDVIQPDWEAKSES-------TQKMVNSGVATPNEGREIMGLPR------SDDPKA--DELYANSALQPLGATPDGAVE 411 (518) Q Consensus 347 ~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~g~~p------~~~~~g--D~~~~~~n~~~~~~~~~~~~~ 411 (518) +.+|...|.++++++ +.+++.+|+++++|+|+++..++ +++... +....|.+...+........ T Consensus 436 FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~- 514 (765) T protein:vir:96 436 WNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSPENLAELEKAGAQSA- 514 (765) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccccCCCccccccccCCCcccc- Confidence 789988888877654 77899999999999999986543 221110 01111111111111000000 Q ss_pred CCCCCCCCCCccCCCCCCCc-cccC-Cccccccchhc--chhhHHHHHHHH-----hhcccCCchhhHHHHHHH--HHhh Q lcl|NC_021305. 412 WEEAPAPKRPASTPVASLDQ-SPPT-SVPGLSPTNSD--RSTDSGKTEPRR-----LMQKPPPKESSPKHLRAV--KGAM 480 (518) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~--~~~~~~~~~~~~-----~~~k~~~~~~~~~~~~~~--~~~~ 480 (518) ....+...++ .++...++. ++.. .....++.... .+..+.++++.+ .+++... ...++.-+.- ... T Consensus 515 ~~~~e~~~~~-a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~~~~~p~~~~p~~~~~~~~~-~~~~~~~~~~~~a~~- 591 (765) T protein:vir:96 515 KAKGEAERAE-AQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGEAATPPSRPNPRAELRNLLS-DLLSKLEALDDAQAP- 591 (765) T ss_pred cccCcccccc-CCCCccCCCCcccccCCcccCCccccccccCccccCccccccccccchhccc-chhhhhhcccccccc- Confidence 0000000000 000001111 1110 11111111110 000011111110 0111111 0000000000 000 Q ss_pred ccc--cCcCchhHHHHHHHHHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 481 GRG--KDIKGFALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 481 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) .+. -+....++..+.|+.+..-+=...++..++.-.+| T Consensus 592 ~g~~v~~~~~~a~~~a~~ps~a~~~~~~~~~~~~~~P~~~ 631 (765) T protein:vir:96 592 DGVDIEQDDAPGLKRTSKPSVSGMEPSVFSSNRIVGPRDH 631 (765) T ss_pred CCCCCCCCccchhhhhhccccCCCCCcccCCCCCCCCccc Confidence 011 01112233344444333222222233333332222 No 123 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.86 E-value=1.3e-19 Score=124.08 Aligned_cols=447 Identities=13% Similarity=0.066 Sum_probs=243.9 Q ss_pred CCCCccc--ccccchhhhhhhcccccccc--ccc--------ccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021305. 7 QTLSAPA--MAELSPQMQDSYYYAPAVGM--QLE--------RQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS 74 (518) Q Consensus 7 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~--~~~--------~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~ 74 (518) -.+|+.. ......+ .+.+. .+.++. +.. .+..++.+ .++.+.|.+|++.+...|.+++|.|...+ T Consensus 1 v~~~~l~~e~at~~~~-~d~~~-~~~~~l~~~~~~il~~a~~g~~~~y~~-l~~D~~i~s~l~~rk~av~~~~w~i~p~~ 77 (488) T protein:vir:99 1 MEKPALGREIATSGDG-RDITR-PFISGLQVPNDSILQRRGGNDLRVYEE-ILSDAQVKTVWGQRQLAVVSREWKVEAGG 77 (488) T ss_pred CCccchhHHHHHHHhh-hhhhc-cccCCCCCCChHHHHhhccCCHHHHHH-HhhChHHHHHHHHHHHHHhcCCceEEcCC Confidence 1111110 0001011 01110 011110 000 01223333 36789999999999999999999996433 Q ss_pred CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC---ceEEEEeeCCceeEEEEcCCceeeE Q lcl|NC_021305. 75 GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPEKLMPMHPSRVAIKRNSRTGRYE 151 (518) Q Consensus 75 ~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G---~~~~l~~l~p~~v~v~~~~~~~~~~ 151 (518) ++......-..+..++.+ ..+.++++.++ +.+.+|.+++++++...| .+..+.+.++.++.+..+. .. . T Consensus 78 ~~~~~~~~ae~v~~~l~~----~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~~--~l-~ 149 (488) T protein:vir:99 78 DRPIDQAAAEHLEQQLQR----VGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQDG--GL-R 149 (488) T ss_pred CChHHHHHHHHHHHHHhC----CCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceeecCCC--ce-E Confidence 211111111233444444 36788888876 578899999999986543 3568889999887754432 21 1 Q ss_pred EeeecccccCceeEEeccccEEEE-eccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc-cCCH Q lcl|NC_021305. 152 YYFQAGAGVGTQLVSFADDEVVPI-RFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSE 229 (518) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~evih~-~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~ 229 (518) +... ..... ...++...-+++ ++....+..+|.|.+..+...+.......++...|....|.|-.+.+++. ..++ T Consensus 150 ~~~~-~~~~~--g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~ 226 (488) T protein:vir:99 150 LLTP-NNMFE--GEPCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATP 226 (488) T ss_pred Eecc-CCCCC--ccccccCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCH Confidence 1111 11112 234443322333 33444567899999999999999999999999999999999998888874 6788 Q ss_pred HHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCC-hhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH Q lcl|NC_021305. 230 AAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLT-AVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA 308 (518) Q Consensus 230 ~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~ 308 (518) ++++++.+.+.+..+ ...+|++.|++++-+..+ .....|.++.++..++|+.++ +-..+-+...+++++..+ T Consensus 227 ~ek~~l~~av~~~~~-----~~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~i-LGqtlts~~~~Gs~a~~~- 299 (488) T protein:vir:99 227 EDKAKLLAALHAIQT-----DSAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVG-LGQVASTQGTPGRLGNDD- 299 (488) T ss_pred HHHHHHHHHHHHHhc-----CcEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHH-hhhhhcccccccchhhHH- Confidence 888888877766532 235677888777665432 222357889999999998874 122333332333444333 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHhhhhhhcc-----cccceecchhhhhcCHHHHHHHHHHHHhC-CC-cCHHHHHHHhC Q lcl|NC_021305. 309 QMRAFYRDTMAIPIARIQSAMDKYVGQYWVR-----KNRMKFDIDDVIQPDWEAKSESTQKMVNS-GV-ATPNEGREIMG 381 (518) Q Consensus 309 ~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~-----~~~~~fd~~~l~~~d~~~~~~~~~~~~~~-G~-~T~NE~R~~~g 381 (518) .........+.-.+..|+..+|+.|+...-. ..+.+|-+......|.+++++.+.++++. |+ ++..++|+.+| T Consensus 300 vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~G 379 (488) T protein:vir:99 300 LQADVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYG 379 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcC Confidence 3445677788899999999999887654322 12233444455778999999999999996 75 78888999999 Q ss_pred CCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhc Q lcl|NC_021305. 382 LPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQ 461 (518) Q Consensus 382 ~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (518) +|+-. .+ +....|. +.... +. ..+ ..+..+...+....+.+ T Consensus 380 ip~~~-~~-~~~~~~~----------------~~~~~---~~-----~~~---------~~~~~~~~~~~~~~~~~---- 420 (488) T protein:vir:99 380 VEVES-TQ-AEATAPT----------------PSTEF---AE-----GDQ---------PSDPAAAMAPQLAEAMQ---- 420 (488) T ss_pred CCCcc-cc-cccccCC----------------CcccC---CC-----CCC---------CCCchHHHHHHHHHHHH---- Confidence 98643 22 2111000 00000 00 000 00000111111111111 Q ss_pred ccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHH------------HHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 462 KPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKY------------PDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 462 k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~ 518 (518) .......+.+.++++..++.+...-++.+-| .+.+.-=.++|..+.+.--+. T Consensus 421 -----~~~~~~~~~i~~~l~~a~s~ee~~~~L~~l~~~~d~~~l~~~l~~a~~~a~l~G~~~~~~e~~~ 484 (488) T protein:vir:99 421 -----PVVGNWTTQLRTLIEQASSLEDLRERLLDLAPQLSLDQYAQAMAEGLEAAHLAGRNDVQEELDG 484 (488) T ss_pred -----HHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhhHhhhhcc Confidence 1111223334444444444443333333322 222221222333222221111 No 124 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.84 E-value=8.9e-21 Score=130.51 Aligned_cols=377 Identities=10% Similarity=0.068 Sum_probs=208.6 Q ss_pred cccccccchhhhhhhccc-cc--ccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHH Q lcl|NC_021305. 11 APAMAELSPQMQDSYYYA-PA--VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETEESDTGYA 87 (518) Q Consensus 11 ~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~~~~~~~~ 87 (518) -+... .+. ....+++. .. ..............+|..++.++.+|+.+|+.+-+..|++...+ ... .+. T Consensus 1 ~~~~~-~d~-~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~---~~~----~~~ 71 (427) T protein:vir:10 1 MKIVK-HDG-YNDIFNGGADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVK---DEK----EFK 71 (427) T ss_pred CCccc-cch-HHHHhhcCCCCcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCcc---HHH----HHH Confidence 00000 011 11111111 00 01111111223356799999999999999999999999874211 111 111 Q ss_pred HHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE----------cCCCceEEEEeeCCceeEEEEcCC-------ceee Q lcl|NC_021305. 88 KLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK----------NKSGTPEKLMPMHPSRVAIKRNSR-------TGRY 150 (518) Q Consensus 88 ~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r----------~~~G~~~~l~~l~p~~v~v~~~~~-------~~~~ 150 (518) ..+.+ ....+-+...+...-++|.+++++.- +..|.+..|.++++..+++..... +.+. T Consensus 72 ~~~~~----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg~P~ 147 (427) T protein:vir:10 72 SLWDS----YKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRYGEPE 147 (427) T ss_pred HHHHH----hhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCccccccCcce Confidence 11111 12233455556666788988888743 235678899999998876643221 2233 Q ss_pred EEeeecccccCceeEEeccccEEEEeccC------CCCcccCchHHH-HHHHHHHHHHHHHHHHHHHHHccCCccccccc Q lcl|NC_021305. 151 EYYFQAGAGVGTQLVSFADDEVVPIRFFN------PDGLERGLSLME-SLKSTIFSEDSSRNATAAMWKNAGRPNLVLRH 223 (518) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~evih~~~~~------~~~~~~G~s~l~-~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~ 223 (518) .|.+ ....+...+.++++.+|||.... +....+|.|++. .+++.|.....+.......+....... +++ T Consensus 148 ~y~v--~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v--~k~ 223 (427) T protein:vir:10 148 IYKV--SPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAV--WKV 223 (427) T ss_pred EEEE--ecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhcccc--ccc Confidence 3333 22234455789999999996442 234467999985 577888888888777777666543332 333 Q ss_pred Cc---cCC-HHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHh-ccc Q lcl|NC_021305. 224 EK---RLS-EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIV-HIL 298 (518) Q Consensus 224 ~~---~~~-~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~l-g~~ 298 (518) ++ .++ .+.....++++........+.+.+++...+.+|++++.+.... .+.......+||++.+||...| |.. T Consensus 224 ~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsgl--~~~~~~~~~~iaaa~~IP~t~L~G~s 301 (427) T protein:vir:10 224 KGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKN 301 (427) T ss_pred hhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCCh--HHHHHHHHHHHHhhhCCCeeeeccCC Confidence 32 111 1222233333333322223445667777778899888887764 4788888999999999998754 555 Q ss_pred cccccCCHHHHHHHHHHHH-------hhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHH-------HHHHH Q lcl|NC_021305. 299 DRATFSNISAQMRAFYRDT-------MAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKS-------ESTQK 364 (518) Q Consensus 299 ~~~~~sn~e~~~~~~~~~~-------l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~-------~~~~~ 364 (518) .++-.++.+.....||..+ +.|.++.+-+.+ . ....+.|.+.++...+.++++ +++.+ T Consensus 302 p~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i----~----~s~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~ 373 (427) T protein:vir:10 302 VGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFI----V----DEEEWSIEFEPLSVPSKKEESEITKNNVESVTK 373 (427) T ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----h----cCCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHH Confidence 5555566677777777743 555555443222 1 223456666788777777665 55677 Q ss_pred HHhCCCcCHHHHHHHhC----CCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 365 MVNSGVATPNEGREIMG----LPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 365 ~~~~G~~T~NE~R~~~g----~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ++++|+++++|+|+.+- ...+. +.+....+..+. ..+ ...+.+++....+ T Consensus 374 ~~~~gvi~~~e~r~~L~~~~~~~~~~---------~~~~~~~e~~~~----------~~e-~~p~~~e~~~d~~ 427 (427) T protein:vir:10 374 AITEQIIDLEEARDTLRSIAPEFKLK---------DGNNINIREPEE----------TTE-PEPGLGEKLEDEN 427 (427) T ss_pred HHhcCCCCHHHHHHHHHhhhccccCC---------CCccccccccch----------hcC-CCCCCCCCCCCCC Confidence 99999999999998772 21111 111111110000 000 0000001111111 No 125 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.83 E-value=3.5e-21 Score=132.75 Aligned_cols=418 Identities=14% Similarity=0.119 Sum_probs=231.3 Q ss_pred Cc-CCCCCCCCcccccccchhhhhhh-------cccccccc--------cccccchhhhHHHhhcHHHHHHHHHHHHhhc Q lcl|NC_021305. 1 ML-LANGQTLSAPAMAELSPQMQDSY-------YYAPAVGM--------QLERQFSLYGGIYKNQPWVRTVIAKRAQALA 64 (518) Q Consensus 1 ~~-f~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~--------~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia 64 (518) |= +..+-..+++..+. +.....| .+....+. .........++++.+++.+..+|+.+.+.+- T Consensus 1 m~~~~~~~~a~~~~~~~--~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vV 78 (495) T protein:vir:10 1 MNMTPSGYQSLASGLLV--PVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAV 78 (495) T ss_pred CCcccccccccchhhhh--HHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhc Confidence 32 22333333322111 1100000 01000111 1111234566789999999999999999997 Q ss_pred cCceEEEEecCCcc-eeccchHHHHHHhcCC--cCCCHHHHHHHHHHHHHHcCCeEEEEEEcC--CC--ceEEEEeeCCc Q lcl|NC_021305. 65 RLPVKCMFTSGDTE-TEESDTGYAKLLADPC--EYLDPFAFWEWVASTLDIYGETYLAIQKNK--SG--TPEKLMPMHPS 137 (518) Q Consensus 65 ~l~~~v~~~~~~~~-~~~~~~~~~~L~~~PN--~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~--~G--~~~~l~~l~p~ 137 (518) ...|+..-...+.. ...-......+..++. ..++++.+...+++.++..|++|+.+.+.. .| .+..|..|+|+ T Consensus 79 G~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliepd 158 (495) T protein:vir:10 79 GNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEPD 158 (495) T ss_pred CCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEechh Confidence 76776543322211 1111122223333332 357899999999999999999999887543 23 36789999998 Q ss_pred eeEE-----------------EEcCCceeeEEeeeccccc-------CceeEEeccccEEEEeccCCCCcccCchHHHHH Q lcl|NC_021305. 138 RVAI-----------------KRNSRTGRYEYYFQAGAGV-------GTQLVSFADDEVVPIRFFNPDGLERGLSLMESL 193 (518) Q Consensus 138 ~v~v-----------------~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~ 193 (518) ++.. +.+..|..+.|.+....+. ....+.+++++|+|+.. ...+...|+|.+.. T Consensus 159 ~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~-~r~gQ~RGis~la~- 236 (495) T protein:vir:10 159 MLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTV-LTVRSDAGAPWFQL- 236 (495) T ss_pred hcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccc-cCCCcccCcchhHH- Confidence 8852 2233445566665433221 22456799999999964 45678899997654 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHH----HHHHHHHhcCccccCCeeecCCCcceeeccCChh Q lcl|NC_021305. 194 KSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL----REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAV 269 (518) Q Consensus 194 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~----~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~ 269 (518) ...+.......+......+-.+...++|+.+.. +....... .+.-..... .-+.|.+..|..|.+++.++.+.. T Consensus 237 i~~l~~l~~y~dael~~a~i~A~~~~fi~~~~~-~~~~~~~~~~~~~~~~~~~~~-~l~pG~i~~L~pGe~i~~~~p~~p 314 (495) T protein:vir:10 237 LLRLNELDQYEDAELVRKKTAALFAAFIQEATA-DSTGGPTIGQPKRSKGGKRIT-GLNPGTLQYLQPGQEVKFSNPADV 314 (495) T ss_pred HHHHHHhhHHHHHHHHHHHHhhhheeeeecCCC-ccccccccCccccccCcccce-ecCCceeeecCCCCeeeeeCCCCC Confidence 445666666666666666666777777765421 00000000 000000001 124577888999999998887766 Q ss_pred hHHHHHHHHHHHHHHHHHhcCCHHH-hccccccccCCHHHHHHHHHHHH------------hhHHHHH-HHHHHHHhhhh Q lcl|NC_021305. 270 EMQFIEARQLNREEVCGVYDIAPPI-VHILDRATFSNISAQMRAFYRDT------------MAIPIAR-IQSAMDKYVGQ 335 (518) Q Consensus 270 d~~~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~~~~~~------------l~P~~~~-ie~~l~~~l~~ 335 (518) ...|.++.+...+.||+.+|||.+. .|+.++.|||+..+....|...+ ++|+... ++.++-.-.++ T Consensus 315 ~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~ 394 (495) T protein:vir:10 315 GTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVV 394 (495) T ss_pred CCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC Confidence 7789999999999999999999775 57888889999877665554433 2232222 22222221111 Q ss_pred --hhc--ccc--cceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeee------cccccccc Q lcl|NC_021305. 336 --YWV--RKN--RMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYA------NSALQPLG 403 (518) Q Consensus 336 --~~~--~~~--~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~------~~n~~~~~ 403 (518) .+. +.. .+++-.-.....|+.+.+++...++.+|+.|+-++-.+.|.++-+ .-+++.. -.+ .+++ T Consensus 395 ~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~--v~~q~a~e~~~~~~~G-l~~~ 471 (495) T protein:vir:10 395 IPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEE--LFDMISDANQLIDEYD-LRLD 471 (495) T ss_pred CCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHH--HHHHHHHHHHHHHHcC-CCCC Confidence 000 111 123333455777999999999999999999999999889988742 1121110 000 0111 Q ss_pred cccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 404 ATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 404 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) ..+.... ..+...++ ..+.++.++ T Consensus 472 ~~p~~~~---~~~~~~~~-~~~~~~~~e 495 (495) T protein:vir:10 472 SDPRYVN---GSGAEQKS-VMEAALNNE 495 (495) T ss_pred CCCCcCC---CccCCCCC-CCCCCCCCC Confidence 1111000 00000000 000000000 No 126 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.82 E-value=1e-18 Score=119.18 Aligned_cols=456 Identities=15% Similarity=0.090 Sum_probs=255.8 Q ss_pred CcCCCCCCCCccc-ccccchhh---hhhhcccccccccc---------cccchhhhHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 MLLANGQTLSAPA-MAELSPQM---QDSYYYAPAVGMQL---------ERQFSLYGGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~f~~~~~~~~~~-~~~~~~~~---~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) .+-.-|+..+... .+.++.-+ ...+...+..+... ..+..++.++ ++.+.|.+|++.+...|.+++ T Consensus 5 i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~il~~~~~~~~~y~~m-~~D~~i~s~l~~Rk~av~~~~ 83 (491) T protein:vir:79 5 LWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYREL-RADAHVGGCVRRRKAAVKALE 83 (491) T ss_pred eeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhHHHhhccCCHHHHHHH-hhChHHHHHHHHHHHHHhCCC Confidence 2333333322111 11111111 10111111111000 0112344454 589999999999999999999 Q ss_pred eEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC---ceEEEEeeCCceeEEEEc Q lcl|NC_021305. 68 VKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPEKLMPMHPSRVAIKRN 144 (518) Q Consensus 68 ~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G---~~~~l~~l~p~~v~v~~~ 144 (518) |.|...+++. + ....+..++.++ .+.++++.++ +.+.+|.+++++++...| .+..+.++++.++.+..+ T Consensus 84 w~i~~~~~~~--~-~a~~i~e~l~~~----~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~ 155 (491) T protein:vir:79 84 WGLDRGKAKS--R-VAKSIADVFADL----DLSRIATEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYDPE 155 (491) T ss_pred cEEecCCCCH--H-HHHHHHHHHhcC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeeEEeeeeecccceeeccC Confidence 9996544331 1 123344455544 5777887764 578899999999986653 356888999988876543 Q ss_pred CCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC Q lcl|NC_021305. 145 SRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE 224 (518) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~ 224 (518) . . ..+... . .......+++..+|++++....+..+|.+.+..+...+.......++...|.+..|.|--+.+++ T Consensus 156 ~--~-l~l~~~-~--~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~ 229 (491) T protein:vir:79 156 N--Q-LRFRSK-E--HWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP 229 (491) T ss_pred C--c-eEEeec-C--CCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecC Confidence 2 2 222211 1 12233567888888888877778889999999999999999999999999999999999999999 Q ss_pred ccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCC---hhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccc Q lcl|NC_021305. 225 KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLT---AVEMQFIEARQLNREEVCGVYDIAPPIVHILDRA 301 (518) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~---~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 301 (518) ...++++++++.+.+.+..+ + ..+|++.|++++-+..+ .....|.++.++..++|+.+.- --++-.. .++ T Consensus 230 ~~a~~~ek~~l~~al~~~~~---~--a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iL-GqtlTt~-~~g 302 (491) T protein:vir:79 230 RSASDAETNLLLDRLEDMVQ---D--AVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIALL-GQNQTTE-ATS 302 (491) T ss_pred CCCCHHHHHHHHHHHHHHhc---C--eEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHHh-hhhhccC-ccc Confidence 88899999888887776532 2 35677888777665332 2223478888888888888651 1111111 233 Q ss_pred ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhh-----cccccceecchhhhhcCHHHHHHHHHHHHhCCC-cCHHH Q lcl|NC_021305. 302 TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW-----VRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGV-ATPNE 375 (518) Q Consensus 302 ~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-----~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~-~T~NE 375 (518) +++..+ .........+.-.+..++..||+ |+... .....++|.+..... +.+.+++.+++++..|+ ++.++ T Consensus 303 s~a~~~-vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e~ee-~~~~~a~~~~~L~~~G~~i~~~~ 379 (491) T protein:vir:79 303 TRASAQ-AGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWEQEQ-VDEIQAGRDEKLTRAGARFTPAY 379 (491) T ss_pred chhhHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecCcCc-hhHHHHHHHHHHHhCCCccCHHH Confidence 444433 33445566777788888888885 54332 112234565544332 23567899999999997 78999 Q ss_pred HHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHH Q lcl|NC_021305. 376 GREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTE 455 (518) Q Consensus 376 ~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (518) +|+.+|+|+-+ .++....+.. ......... .+...+ ..+..+...... T Consensus 380 ~~e~~Gip~~~--~~e~~~~~~~------------~~~~~~~~~--------~~~~~~----------~~~~~d~~~~~~ 427 (491) T protein:vir:79 380 FKRAYNLQDGD--LDERPLPVSA------------VDAVGAASF--------AEFEAP----------DQDALDAALNAL 427 (491) T ss_pred HHHHhCCCCCC--CCccccCcCc------------ccccccccc--------cccCCC----------CCcchHHHHHHH Confidence 99999997543 2222110000 000000000 000000 000001111111 Q ss_pred HHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHHH----HHHhHHHH-hhhhh-hhcccCC Q lcl|NC_021305. 456 PRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYP----DDLEDILL-AVQLA-LAERKDN 518 (518) Q Consensus 456 ~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~-~~~~~-~~~~~~~ 518 (518) +.+.. .+......+.|.+.++..++.+...-++++-|. ++++.++. |...| |+-|-+- T Consensus 428 ~~~~~-----~~~~~~~~~~i~~~l~~~~s~~e~~~~L~~l~~~~d~~~l~~~l~~a~~~A~l~Gr~~a 491 (491) T protein:vir:79 428 SARDL-----NADAQALVAPLLKRIANGASADELLGMLAELYPSLDTDALQERLARAIFVANLWGRLHA 491 (491) T ss_pred HHHHH-----HHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHhhhccC Confidence 11100 122233345566667666666655555555552 22222111 11111 2223333 No 127 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.82 E-value=1.6e-18 Score=118.11 Aligned_cols=456 Identities=15% Similarity=0.079 Sum_probs=255.2 Q ss_pred CcCCCCCCCCccc-ccccch-------hhhh-hhccccccccc----ccccchhhhHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 MLLANGQTLSAPA-MAELSP-------QMQD-SYYYAPAVGMQ----LERQFSLYGGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~f~~~~~~~~~~-~~~~~~-------~~~~-~~~~~~~~~~~----~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) .+=--|+..+.+. .+.++. ++.. +++..+..--+ ...+..++.++ ++.+.|.+|++.+...|.+++ T Consensus 5 i~~~~g~p~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~m-~~D~~i~s~l~~Rk~av~~~~ 83 (491) T protein:vir:10 5 LWVSPTEFVTFGEPDKSLSSQIATRARSIDFFALGMYLPNPDPVLKALGKDIRVYREL-RADAHVGGCVRRRKAAVKALE 83 (491) T ss_pred eeCCCCCccCcccCChHHHHHHHhhhcccccccccCCccchHHHHHhcCCCHHHHHHH-hhChHHHHHHHHHHHHHhCCC Confidence 2222333322111 111111 1110 01110000000 00123345554 589999999999999999999 Q ss_pred eEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC---ceEEEEeeCCceeEEEEc Q lcl|NC_021305. 68 VKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG---TPEKLMPMHPSRVAIKRN 144 (518) Q Consensus 68 ~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G---~~~~l~~l~p~~v~v~~~ 144 (518) |.|...+++. ........++.++ .+.++++.++ +.+.+|.+++++++...| .+..+.++++.++.+..+ T Consensus 84 w~i~~~~~~~---~~~e~v~e~l~~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~~ 155 (491) T protein:vir:10 84 WGLDRGKAKS---RVAKSIADVFADL----DLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYDPE 155 (491) T ss_pred cEEecCCCCH---HHHHHHHHHHhcC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceeeccC Confidence 9996543321 1123344455543 5788888876 678999999999987654 356888999988876443 Q ss_pred CCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC Q lcl|NC_021305. 145 SRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE 224 (518) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~ 224 (518) . . ..+.... .......+++..+|++++....+..+|.+.+..+...+.......++...|....|.|--+.+++ T Consensus 156 ~--~-l~~~~~~---~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~ 229 (491) T protein:vir:10 156 N--Q-LRFRSKD---HWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHP 229 (491) T ss_pred C--c-eEEecCC---CCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecC Confidence 2 2 2222111 12233567888888888777777889999999999999999999999999999999999999999 Q ss_pred ccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccC--Chh-hHHHHHHHHHHHHHHHHHhcCCHHHhcccccc Q lcl|NC_021305. 225 KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQL--TAV-EMQFIEARQLNREEVCGVYDIAPPIVHILDRA 301 (518) Q Consensus 225 ~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~--~~~-d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~ 301 (518) ...++++++++.+.+.+..+ + ..+|++.|++++-+.. +.. ...|.++.++..++|+.+.-= -++-.. .++ T Consensus 230 ~~a~~~ek~~l~~al~~~~~---~--a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLG-qtlTt~-~~g 302 (491) T protein:vir:10 230 RSASDGEKNLLLDCLEDMVQ---D--AVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLG-QNQTTE-ATS 302 (491) T ss_pred CCCCHHHHHHHHHHHHHHhc---C--cEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhh-hhcccC-ccc Confidence 88899999998888777532 2 3577888877766543 222 234778888888888876310 112111 233 Q ss_pred ccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhh-----cccccceecchhhhhcCHHHHHHHHHHHHhCCC-cCHHH Q lcl|NC_021305. 302 TFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW-----VRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGV-ATPNE 375 (518) Q Consensus 302 ~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-----~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~-~T~NE 375 (518) +++..+ .........+.-.+..++..+|+ |+... ....+.+|.+.... .+.+++++.+.+++..|+ ++..+ T Consensus 303 s~a~~~-vh~~v~~di~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~~~~-e~~~~~a~~~~~L~~~G~~i~~~~ 379 (491) T protein:vir:10 303 TRASAQ-AGLEVTDDIRDGDKAVVSEAMNM-LIRWICDLNFDGADRPVFDMWEQE-QVDEIQAGRDQKLTQAGARFTPAY 379 (491) T ss_pred chhHHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEecCcC-chhHHHHHHHHHHHhCCCcCCHHH Confidence 443333 33445566677778888888885 54322 12234456555433 334778999999999997 78899 Q ss_pred HHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHH Q lcl|NC_021305. 376 GREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTE 455 (518) Q Consensus 376 ~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (518) +|+++|+|.-+. +.... ... .....+.. ..+ +...+.+ +..++..... T Consensus 380 i~e~~Gip~~~~--~~~~~--------~~~-----~~~~~~~~-~~~------~~~~~~~----------~~~d~~~~~~ 427 (491) T protein:vir:10 380 FKRAYNLQDGDL--DERPL--------PVS-----AVDTVGAA-SFA------EFEAPDQ----------DALDAALNTL 427 (491) T ss_pred HHHHhCCCCCCc--Ccccc--------ccC-----CCCCcccc-ccc------ccCCCCC----------CchHHHHHHH Confidence 999999975431 22110 000 00000000 000 0000000 0000111111 Q ss_pred HHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHHHH----HHhHHH-Hhhhhh-hhcccCC Q lcl|NC_021305. 456 PRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYPD----DLEDIL-LAVQLA-LAERKDN 518 (518) Q Consensus 456 ~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~-~~~~~~-~~~~~~~ 518 (518) ..+.. .+......+.|.+.++..++.+...-++.+-|.+ +++.++ -|...| |+-|-+- T Consensus 428 ~~~~~-----~~~~~~~~~~i~~~l~~~~s~~e~~~~L~~l~~~~d~~~l~~~l~~a~~~A~l~G~~~a 491 (491) T protein:vir:10 428 SARDL-----NADAQALVAPLLKRIANGASADELLGMLAELYPSLDADALQERLARAIFVANLWGRLHA 491 (491) T ss_pred HHHHH-----HHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHhhhccC Confidence 00100 1122333455666666666665544444444422 222111 111111 2223333 No 128 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.80 E-value=2.1e-20 Score=128.45 Aligned_cols=458 Identities=9% Similarity=-0.004 Sum_probs=237.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhcc-CceEEEEe--cCCc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR-LPVKCMFT--SGDT 77 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~-l~~~v~~~--~~~~ 77 (518) |.+..=... ... +....| ....+................++++.+++.+..+|+.+.+.+.. ..+.+.-. ..+. T Consensus 24 ~~~~~y~aa-~~~-r~~~~~-~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~ 100 (548) T protein:vir:95 24 EAIQAYEAA-RPG-RTHKAK-RQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDG 100 (548) T ss_pred HHhcccccc-Ccc-cccccc-CCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhccCccccceeeeecCCCH Confidence 111100000 000 001111 11111111111112223456778899999999999998777754 23332211 1111 Q ss_pred c--eecc---chHHHHHHhcCC--cCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC-------ceEEEEeeCCceeE--- Q lcl|NC_021305. 78 E--TEES---DTGYAKLLADPC--EYLDPFAFWEWVASTLDIYGETYLAIQKNKSG-------TPEKLMPMHPSRVA--- 140 (518) Q Consensus 78 ~--~~~~---~~~~~~L~~~PN--~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G-------~~~~l~~l~p~~v~--- 140 (518) . ++.. ......+..++. ..++++.+...+++.++..|++|+.+.+...+ .+..|..|+|+++. T Consensus 101 ~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~ 180 (548) T protein:vir:95 101 SVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYLPFSY 180 (548) T ss_pred HHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccCCcccceEEEEechhhcCCCC Confidence 1 1111 111222333332 35789999999999999999999998865432 35688999998875 Q ss_pred ----------EEEcCCceeeEEeeeccccc-------CceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHH Q lcl|NC_021305. 141 ----------IKRNSRTGRYEYYFQAGAGV-------GTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSS 203 (518) Q Consensus 141 ----------v~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~ 203 (518) |+.+..|..+.|.+....+. ....+.+++++|+|+......+..+|+|.+..++..+...... T Consensus 181 ~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y 260 (548) T protein:vir:95 181 NNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDY 260 (548) T ss_pred CCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCCccccCcchHHHHHHHHHHHhHH Confidence 44455566677766543221 2345679999999999988888999999999999999988888 Q ss_pred HHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCe-eecCCCcceeeccCChhhHHHHHHHHHHHH Q lcl|NC_021305. 204 RNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKT-MVVEEGMEPIPLQLTAVEMQFIEARQLNRE 282 (518) Q Consensus 204 ~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~-~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~ 282 (518) +.......+-.+...++|+.+..-... .+.....-... -.-..|.+ ..|..|.+++.+..+.....|.++.+...+ T Consensus 261 ~dael~~aki~A~~a~fi~~~~~~~~~-~~~~~~~~~~~--~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr 337 (548) T protein:vir:95 261 EESERVAARISAALAMYIKKGNPDSYT-VEPGKDRKNRT--IPIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLR 337 (548) T ss_pred HHHHHHHHHHhhhheeeeecCCCcccc-CCCCccccccc--ccccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHH Confidence 888887777778888888765321100 00000000000 01123554 358889998888877667789999999999 Q ss_pred HHHHHhcCCHHH-hccccccccCCHHHHHHHHHHH-----------HhhHHHHH-HHHHHHHhhhh--hh-cccccceec Q lcl|NC_021305. 283 EVCGVYDIAPPI-VHILDRATFSNISAQMRAFYRD-----------TMAIPIAR-IQSAMDKYVGQ--YW-VRKNRMKFD 346 (518) Q Consensus 283 ~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~~~~~-----------~l~P~~~~-ie~~l~~~l~~--~~-~~~~~~~fd 346 (518) .||+.+|||-+. .|+. ++|||+..+....|... .++|+... ++.++-.-.++ .. .+..++... T Consensus 338 ~IAaglGipYe~ltgD~-s~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~ 416 (548) T protein:vir:95 338 MIGAGTRSTYSSVSRAY-DGTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAV 416 (548) T ss_pred HHHhhcCCCHHHHhccc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeee Confidence 999999999665 4554 46899887666555443 33443332 22222222221 00 111122333 Q ss_pred c--hhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--------CCcceeeecccccccccccccC-CCCCCC Q lcl|NC_021305. 347 I--DDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDD--------PKADELYANSALQPLGATPDGA-VEWEEA 415 (518) Q Consensus 347 ~--~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~--------~~gD~~~~~~n~~~~~~~~~~~-~~~~~~ 415 (518) + -.....|+.+.+++...++.+|+.|.-|+-.+.|.++-+. .-.+++=++....+........ .+..+. T Consensus 417 W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~ 496 (548) T protein:vir:95 417 YQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAV 496 (548) T ss_pred eecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCch Confidence 2 3456679999999999999999999999888888876320 0011111111111111100000 000000 Q ss_pred CCCCCCcc--CCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchh Q lcl|NC_021305. 416 PAPKRPAS--TPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKES 468 (518) Q Consensus 416 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~ 468 (518) .+....+. .+..+..+..|.-..+..-..-+=.. +.+.....-++...+- T Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 548 (548) T protein:vir:95 497 QKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPN---ESNNGGADGQPSNPDP 548 (548) T ss_pred hhhccccccccccchhHHhhccCCCCCcCCCCCCCc---ccccCCCCCCCCCCCC Confidence 00000000 00001111111000000000000000 0000000001111111 No 129 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.79 E-value=1.8e-18 Score=117.89 Aligned_cols=409 Identities=11% Similarity=0.016 Sum_probs=228.2 Q ss_pred CcCCCCCCCCc---ccccccchh-------hhhhhccccccccc---------ccccchhhhHHHhhcHHHHHHHHHHHH Q lcl|NC_021305. 1 MLLANGQTLSA---PAMAELSPQ-------MQDSYYYAPAVGMQ---------LERQFSLYGGIYKNQPWVRTVIAKRAQ 61 (518) Q Consensus 1 ~~f~~~~~~~~---~~~~~~~~~-------~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~v~~~v~~ia~ 61 (518) |==. ..+|+ |....+.+. ++..+...+..|.. ......++.++ +..+.|.+|++.+.. T Consensus 1 m~kk--~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m-~~D~hi~s~l~~Rk~ 77 (448) T protein:vir:77 1 MAKR--GRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKM-LSDGTVKNALNYIFG 77 (448) T ss_pred CCCC--CCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhccccchHHHHHH-hhChHHHHHHHHHHH Confidence 4322 22222 211111111 11111111111111 11234455555 458999999999999 Q ss_pred hhccCceEEEEecCCcceeccchHHHHHHhcCC---cCCCHHHHHHHHHHHHHHcCCeEEEEEEcC--CCc--eEEEEee Q lcl|NC_021305. 62 ALARLPVKCMFTSGDTETEESDTGYAKLLADPC---EYLDPFAFWEWVASTLDIYGETYLAIQKNK--SGT--PEKLMPM 134 (518) Q Consensus 62 ~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN---~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~--~G~--~~~l~~l 134 (518) .|.+++|.|-..+++.........+...+..+. ...++.+++..+ .+.+.+|.+++++++.. +|. +..|.+. T Consensus 78 av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~~~~l~~r 156 (448) T protein:vir:77 78 RIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVPI 156 (448) T ss_pred HHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCceeecccccc Confidence 999999998543222111111122333344333 234688888886 57899999999999853 454 3456666 Q ss_pred CCceeE-EEEcCCceeeEEeeecccc----cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 135 HPSRVA-IKRNSRTGRYEYYFQAGAG----VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAA 209 (518) Q Consensus 135 ~p~~v~-v~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~ 209 (518) ++..++ +..+.++... +....... .+...+.++...++|.++ ...+..+|.+.+..+...+.......++... T Consensus 157 ~~~~~~~f~~~~~~~l~-~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~-~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~ 234 (448) T protein:vir:77 157 HPFNIDEVLYDEEGGPK-ALKLSGEVKGGSQFVNGLEIPIWKTVVFLH-NDDGSFTGQSALRAAVPHWLAKRALILLINH 234 (448) T ss_pred CCCccceeeeecCCceE-EEecCCcccccccCCCccccccceEEEEec-CCcCCcccchHHHHHHHHHHHHHhhHHHHHH Confidence 765432 2222222221 11111111 112234567788898876 4567789999999999999999999999999 Q ss_pred HHHccCCcccccccCccCC--HHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 210 MWKNAGRPNLVLRHEKRLS--EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGV 287 (518) Q Consensus 210 ~~~ng~~p~~il~~~~~~~--~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 287 (518) |.+.-|.|--+.+.+...+ +++++.+.+...+...| .++ .+|++.|++++-+..+.....+.+..++..++|+.+ T Consensus 235 f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g-~~a--~~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~ 311 (448) T protein:vir:77 235 GLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQK-PRH--GIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARA 311 (448) T ss_pred HHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcC-Cce--EEEecCCceEEEEecCCCccCHHHHHHHHHHHHHHH Confidence 9999999999988876544 45666676665554333 233 467888888776665555556778888889999887 Q ss_pred hcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhh-----cc-cccceecchhhhhcCHHHHHHH Q lcl|NC_021305. 288 YDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW-----VR-KNRMKFDIDDVIQPDWEAKSES 361 (518) Q Consensus 288 fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-----~~-~~~~~fd~~~l~~~d~~~~~~~ 361 (518) ..- ..+--....++++.............+.-.++.|++.||+.|+... +. ..+.+|-++.....|.+..++. T Consensus 312 iLG-qtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~~e~eDl~~~a~~ 390 (448) T protein:vir:77 312 LGI-DFNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEMEERNDFSAAANL 390 (448) T ss_pred Hhc-cccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecCCChhhHHHHHHH Confidence 632 2222122223334433343456667778899999999999887644 11 1233444455567788889998 Q ss_pred HHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccc Q lcl|NC_021305. 362 TQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLS 441 (518) Q Consensus 362 ~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (518) +.+++ +-+|+.+|+|.-. .++ . + . .+.++.+.+ ...+.+. . T Consensus 391 ~~~l~-------~~~~~~~~ip~~~-~~~---~------~----~---~~~~~~~~~-------~~~~~~~--------~ 431 (448) T protein:vir:77 391 MGMLI-------NAVKDSEDIPTEL-KAL---I------D----A---LPSKMRRAL-------GVVDEVR--------E 431 (448) T ss_pred hHHHH-------HHHHHHhcCCccC-CcC---C------C----C---Cchhccccc-------CCCCCCC--------c Confidence 88886 4589999986321 110 0 0 0 000000000 0000000 0 Q ss_pred cchhcchhhHHHHHHHHhh Q lcl|NC_021305. 442 PTNSDRSTDSGKTEPRRLM 460 (518) Q Consensus 442 ~~~~~~~~~~~~~~~~~~~ 460 (518) + .....+.+....++.. T Consensus 432 ~--~~~~~~~~~~~~r~~~ 448 (448) T protein:vir:77 432 A--VRQPADSRYLYTRRRR 448 (448) T ss_pred h--hhcchhhHHHHhhhcC Confidence 0 0011111111111111 No 130 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.79 E-value=4.8e-17 Score=110.04 Aligned_cols=461 Identities=12% Similarity=0.046 Sum_probs=248.5 Q ss_pred CcCC-CCCCCCcc-cccccch---hhhhhhcccccccccc-----------cccch----hhhHHHhhcHHHHHHHHHHH Q lcl|NC_021305. 1 MLLA-NGQTLSAP-AMAELSP---QMQDSYYYAPAVGMQL-----------ERQFS----LYGGIYKNQPWVRTVIAKRA 60 (518) Q Consensus 1 ~~f~-~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~~~~-----------~~~~~----~~~~~~~~~~~v~~~v~~ia 60 (518) =++. -|++.+.+ -.+...+ +++..+...+..|... ..+.. ++.+..++.+.|.+|++.+. T Consensus 3 ~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~s~l~~Rk 82 (512) T protein:vir:19 3 RILDISGQPFDFDDEMQSRSDELAMVMKRTQEHPSSGVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLFSELSKRR 82 (512) T ss_pred ceeCCCCCccccccccccccchhcccchhhccccccCCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHHHH Confidence 1222 22222111 0111111 2222332223222211 11122 23334467899999999999 Q ss_pred HhhccCceEEEEecCCcce--eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC---CceEEEEeeC Q lcl|NC_021305. 61 QALARLPVKCMFTSGDTET--EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS---GTPEKLMPMH 135 (518) Q Consensus 61 ~~ia~l~~~v~~~~~~~~~--~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~---G~~~~l~~l~ 135 (518) ..|.+++|.|....+.... +..+.....|...| ++.++++.++ +.+.+|.+++++++... ..+..+.+.+ T Consensus 83 ~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~----~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~ 157 (512) T protein:vir:19 83 LAIQALEWRIAPARDASAQEKKDADMLNEYLHDAA----WFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHHRD 157 (512) T ss_pred HHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCC----CHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeeeec Confidence 9999999999654332211 11122223333344 4777777755 57889999999998543 3567888999 Q ss_pred CceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021305. 136 PSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAG 215 (518) Q Consensus 136 p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~ 215 (518) +..+.+..+... .+.+ ... ......+++...++.++....+..+|.+.+..+...+.......++...|....| T Consensus 158 ~~~f~~~~~~~~-~lr~--~~~---~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG 231 (512) T protein:vir:19 158 PALFCANPDNLN-ELRL--RDA---SYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYG 231 (512) T ss_pred cccceeccCCCc-EEEe--cCC---CCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 988776554332 2221 111 1223457777777777777778889999999999999999999999999999999 Q ss_pred CcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCC-hhhHHHHHHHHHHHHHHHHHhcCCHHH Q lcl|NC_021305. 216 RPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLT-AVEMQFIEARQLNREEVCGVYDIAPPI 294 (518) Q Consensus 216 ~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~-~~d~~~~e~~~~~~~~Ia~~fgVPp~~ 294 (518) .|--+.+++...++++++.+.+.+.+..+ ...+|++.|++++-+..+ .....|.++.++..++|+.+. +-.++ T Consensus 232 ~P~~igky~~~a~~~ek~~L~~al~~~~~-----~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i-LGqtl 305 (512) T protein:vir:19 232 LPMRVGKYPTGSTNREKATLMQAVMDIGR-----RAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI-LGGTL 305 (512) T ss_pred CCeeEEecCCCCCHHHHHHHHHHHHHHhh-----CcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhh Confidence 99999999988899999888888777532 235778888877655432 333458888999999999873 11111 Q ss_pred hcc-ccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhccc---------ccceecchhhhhcCHHHHHHHHHH Q lcl|NC_021305. 295 VHI-LDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRK---------NRMKFDIDDVIQPDWEAKSESTQK 364 (518) Q Consensus 295 lg~-~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~---------~~~~fd~~~l~~~d~~~~~~~~~~ 364 (518) -.. ..+++++. .+.........+.-.+..++..||+.|+...-.- .+.+|.+......|.+..++.+.+ T Consensus 306 Ts~~g~~Gs~a~-~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~ 384 (512) T protein:vir:19 306 TTEAGDKGARSL-GEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPK 384 (512) T ss_pred cccccccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHH Confidence 111 11222332 3344566777888999999999999887644211 123444455577888899999988 Q ss_pred HHhCC-CcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccc Q lcl|NC_021305. 365 MVNSG-VATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPT 443 (518) Q Consensus 365 ~~~~G-~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (518) +. .| -++..++|+++|+|.-. ++.+.+..+.. . +............+.. + ..+ T Consensus 385 l~-~G~~i~~~~i~e~~Gip~~~-~~e~~~~~~~~-~---------------~~~~~~~~~~~~~~~~------~--~~~ 438 (512) T protein:vir:19 385 LA-AGMRIPVSWIQEKLHIPQPV-GDEAVFTIQPV-V---------------PDNGSQKEAALSAEDI------P--QED 438 (512) T ss_pred Hh-cCCCCCHHHHHHHhCCCCCC-CccccccCCCc-c---------------ccccccccccccccCC------C--chh Confidence 86 45 47899999999996432 22222211000 0 0000000000000000 0 000 Q ss_pred hhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHH------------HHHHhHHHHhhhhh Q lcl|NC_021305. 444 NSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKY------------PDDLEDILLAVQLA 511 (518) Q Consensus 444 ~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~ 511 (518) .-++..+. ..+.+. ......+.+..++. ..+.+...-++++-| ...+.-=.+.|... T Consensus 439 ~~d~~~~~-~~~~~~---------~~~~~~~~i~~~~~-~~s~ee~~~~L~~l~~~ld~~~l~~~l~~a~~~A~l~G~~~ 507 (512) T protein:vir:19 439 DIDRMGVS-PEDWQR---------SVDPLLKPVIFSVL-KDGPEAAMNKAASLYPQMDDAELIDMLTRAIFVADIWGRLD 507 (512) T ss_pred hHhHHhhh-HHHHHH---------HHHHHHHHHHHHHH-hCCHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhhhhh Confidence 00000000 000000 00111111111111 112222111222222 11111112222222 Q ss_pred hhccc Q lcl|NC_021305. 512 LAERK 516 (518) Q Consensus 512 ~~~~~ 516 (518) ..+.. T Consensus 508 ~~~e~ 512 (512) T protein:vir:19 508 AAADH 512 (512) T ss_pred hhccC Confidence 22222 No 131 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.79 E-value=2.4e-18 Score=117.14 Aligned_cols=410 Identities=11% Similarity=0.012 Sum_probs=227.7 Q ss_pred CcCCCCCCC----Cccccccc--ch---hhhhhhccccccccc---------ccccchhhhHHHhhcHHHHHHHHHHHHh Q lcl|NC_021305. 1 MLLANGQTL----SAPAMAEL--SP---QMQDSYYYAPAVGMQ---------LERQFSLYGGIYKNQPWVRTVIAKRAQA 62 (518) Q Consensus 1 ~~f~~~~~~----~~~~~~~~--~~---~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~v~~~v~~ia~~ 62 (518) |- .+++.+ |.++.... .+ .++..+...+..|.. ...+..++.++ ++.+.|.+|++.+... T Consensus 1 m~-k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~~~ly~~m-~~D~hi~s~l~~Rk~a 78 (448) T protein:vir:79 1 MA-KRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDGLLVYHKM-LSDGTVKNALNYIFGR 78 (448) T ss_pred CC-CCCCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccchHHHHHH-hhChHHHHHHHHHHHH Confidence 33 322222 22211110 01 011111112222211 11123455554 4589999999999999 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhcCCc---CCCHHHHHHHHHHHHHHcCCeEEEEEEcC--CCc--eEEEEeeC Q lcl|NC_021305. 63 LARLPVKCMFTSGDTETEESDTGYAKLLADPCE---YLDPFAFWEWVASTLDIYGETYLAIQKNK--SGT--PEKLMPMH 135 (518) Q Consensus 63 ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~---~~s~~~f~~~~v~~ll~~G~~~~~i~r~~--~G~--~~~l~~l~ 135 (518) |.+++|.|-..+++......-..+...+..++. ..++.+++.. +.+.+.+|.+++++++.. +|. +..|.+.+ T Consensus 79 v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~-~lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r~ 157 (448) T protein:vir:79 79 IRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAI-YENAYIYGMAAGEIVLTLGADGKLILDKIVPIH 157 (448) T ss_pred HhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHH-HHHhhhhcceeEEEEeeecCCCceecccccccC Confidence 999999995422221111111223334444443 2456677766 445779999999999753 453 34566667 Q ss_pred CceeE-EEEcCCceeeEEeeecccc----cCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 136 PSRVA-IKRNSRTGRYEYYFQAGAG----VGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAM 210 (518) Q Consensus 136 p~~v~-v~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 210 (518) +..+. +..+.++...... ..... .+...+.++...++|+.+ ...+..+|.+.+..+...+.......++...| T Consensus 158 ~~~~~~f~~~~d~~l~~~~-~~~~~~~~~~~~~~~~lP~~~~i~~~~-~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f 235 (448) T protein:vir:79 158 PFNIDEVLYDEEGGPKALK-LSGEVKGGSQFVSGLEIPIWKTVVFLH-NDDGSFTGQSALRAAVPHWLAKRALILLINHG 235 (448) T ss_pred CccccceeeecCCceEEee-cCCcccccccCCCccccccceEEEEec-CccCCcccchhHHHHHHHHHHHHHHHHHHHHH Confidence 66432 1222222222111 11111 112234567788888875 45677899999999999999999999999999 Q ss_pred HHccCCcccccccCccCC--HHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_021305. 211 WKNAGRPNLVLRHEKRLS--EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVY 288 (518) Q Consensus 211 ~~ng~~p~~il~~~~~~~--~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~f 288 (518) .+..|.|--+.+.+...+ +++++.+.+...+...| .++ .+|++.|++++-+.......++.++.++..++|+.+. T Consensus 236 ~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g-~~a--~~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~~Isk~i 312 (448) T protein:vir:79 236 LERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQK-PRH--GIILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARAL 312 (448) T ss_pred HHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcC-Cce--EEEecCCceEEEEecCCCcccHHHHHHHHHHHHHHHH Confidence 999999988888876544 45666666655544333 233 4678999887777655555567788888888888865 Q ss_pred cCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhh-----cc-cccceecchhhhhcCHHHHHHHH Q lcl|NC_021305. 289 DIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW-----VR-KNRMKFDIDDVIQPDWEAKSEST 362 (518) Q Consensus 289 gVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-----~~-~~~~~fd~~~l~~~d~~~~~~~~ 362 (518) - -..+-.....+++++............+.-.+..|+..||+.|+... +. ..+.+|.+......|.+..++.+ T Consensus 313 L-GqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~e~~Dl~~~a~~~ 391 (448) T protein:vir:79 313 G-IDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEMEERNDFSAAANLM 391 (448) T ss_pred h-hhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCCChHHHHHHHHHh Confidence 2 12221112223333333333455667778889999999999887644 11 12335555566777889999999 Q ss_pred HHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcccccc Q lcl|NC_021305. 363 QKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSP 442 (518) Q Consensus 363 ~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (518) .+++..+-..-+-+|+.+|+|.- .++.+ .. .+...+...+.... ++.....- T Consensus 392 ~~l~~~~~~~~~~~~~~~~~p~~-~~~~~-~~-------------------------a~~~~~~~~~~~~~-~~~~~~~~ 443 (448) T protein:vir:79 392 GMLINAVKDSEDIPTELKALIDA-LPSKM-RR-------------------------ALGVVDEVREAVRQ-PADSRYLY 443 (448) T ss_pred hhhhccchhhHHHHHHhhcCCCC-CCCcc-cc-------------------------ccCCCCcccccccC-Cccccchh Confidence 99987765444446777777531 11110 00 00000000000000 01111111 Q ss_pred chhcc Q lcl|NC_021305. 443 TNSDR 447 (518) Q Consensus 443 ~~~~~ 447 (518) .+..| T Consensus 444 ~~~~~ 448 (448) T protein:vir:79 444 TRRRR 448 (448) T ss_pred hcccC Confidence 11111 No 132 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.77 E-value=1.5e-18 Score=118.25 Aligned_cols=376 Identities=14% Similarity=0.100 Sum_probs=222.0 Q ss_pred CcCCCCCCCCcc---cccccchhhhhhhc-cccc---ccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEe Q lcl|NC_021305. 1 MLLANGQTLSAP---AMAELSPQMQDSYY-YAPA---VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFT 73 (518) Q Consensus 1 ~~f~~~~~~~~~---~~~~~~~~~~~~~~-~~~~---~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~ 73 (518) |=+++.+++.-- +.+.-+.-+...+- ..+. .|-.+.....++.++..+.+.|.+|+..+...|.+++|.|-- T Consensus 3 ~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~p- 81 (446) T protein:vir:98 3 MEVRNAPTPAIRRRTIYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQH- 81 (446) T ss_pred ccccCCCchhhhhhhhhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceecC- Confidence 777765544211 11000000100000 0000 010011112456666678999999999999999999999942 Q ss_pred cCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC-c--eE----EEEeeCCceeEEEEcCC Q lcl|NC_021305. 74 SGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG-T--PE----KLMPMHPSRVAIKRNSR 146 (518) Q Consensus 74 ~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G-~--~~----~l~~l~p~~v~v~~~~~ 146 (518) + .++..+ .+..++... . .++....+.+.+.+|.++.++++...+ . +. .+..+.|..+....+.+ T Consensus 82 -~--~~~~a~-~v~~~l~~~--~---~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~~~~ 152 (446) T protein:vir:98 82 -G--DKRIKK-FIDDQLRNR--A---KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLIANDN 152 (446) T ss_pred -c--cHHHHH-HHHHHHhhc--C---chhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccccccccceeeeccC Confidence 1 122222 233333322 1 233444477899999999999985432 1 11 11122222222222222 Q ss_pred ceeeEEe---------e-------------ecccccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHH Q lcl|NC_021305. 147 TGRYEYY---------F-------------QAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSR 204 (518) Q Consensus 147 ~~~~~~~---------~-------------~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~ 204 (518) +...... . .......+..+.++...++++++....+..||.|.+..++.......... T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~ 232 (446) T protein:vir:98 153 GRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLTSVLDYSIFKRAFR 232 (446) T ss_pred CccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHHHHHHHHHHHHhhH Confidence 1110000 0 00001123345688889999998888888999999999999999999999 Q ss_pred HHHHHHHHccCCcccccccCccCCHHHH---------HHHHHHHHHHhcCc-cccCCee---ecCCCcceeeccCChh-h Q lcl|NC_021305. 205 NATAAMWKNAGRPNLVLRHEKRLSEAAQ---------QRLREQFDRAHSGS-SNTGKTM---VVEEGMEPIPLQLTAV-E 270 (518) Q Consensus 205 ~~~~~~~~ng~~p~~il~~~~~~~~~~~---------~~~~~~~~~~~~g~-~n~g~~~---vl~~g~~~~~l~~~~~-d 270 (518) ++...|....|.|--+.+.+...++++. +...+.+..++... .+++.++ ++++|++++-+..... . T Consensus 233 ~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~ 312 (446) T protein:vir:98 233 DMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFS 312 (446) T ss_pred HHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeeccccCCh Confidence 9999999999999999988765543222 12223344444322 2222222 2488988876654322 2 Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhccc-------- Q lcl|NC_021305. 271 MQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRK-------- 340 (518) Q Consensus 271 ~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~-------- 340 (518) ..|.++.++..++|+.+.....-.+|.... ++++-. +.........+.-.++.|++.||+.|+...-.- T Consensus 313 ~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala-~vh~~V~~d~~~aDa~~i~~tln~~Li~~l~~lNf~~~~~~ 391 (446) T protein:vir:98 313 DSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRAS-EIQLELFDGKINSIFDTVIHAFTEQVIGNLIRLNFDPALYP 391 (446) T ss_pred hhHHHHHHHHHHHHHHHHhcccccccccccccchhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccc Confidence 358899999999999988665434443322 333322 333455667788899999999999887544211 Q ss_pred -----ccceecchhhhhcCHHHHHHHHHHHHhCCCcCH---HHHHHHhCCCCCCCCC Q lcl|NC_021305. 341 -----NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATP---NEGREIMGLPRSDDPK 389 (518) Q Consensus 341 -----~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~---NE~R~~~g~~p~~~~~ 389 (518) .+++|++. ...|.+..++.+.+++..|++++ +.+|+.+|+|+-++.- T Consensus 392 ~~~~~~~~~~~~~--e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 392 LASNTGYITRLPG--RATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred cccccccceeccC--ChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 12233332 46788999999999999998765 4599999998664321 No 133 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.73 E-value=2.7e-16 Score=105.94 Aligned_cols=419 Identities=13% Similarity=0.084 Sum_probs=217.3 Q ss_pred CcCCCCCCCCcccccccchhhh-hhhcccc--ccc---------ccccccchhhhHHHhhcHHHHHHHHHHHHhhccCce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQ-DSYYYAP--AVG---------MQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPV 68 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~---------~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~ 68 (518) |-=.. ...+.+.|.-. .....+. ..+ .+......++.++. +.+.|.+|++.+...|.+++| T Consensus 1 ~~~~~------~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~~ly~~m~-~D~hi~s~l~~Rk~av~~~~w 73 (488) T protein:vir:95 1 MADIT------ETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESIKTFQLMM-RDPAVAASVNIIKMFVRKVNW 73 (488) T ss_pred CCCcc------ccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchHHHHHHHh-hChHHHHHHHHHHHHHhcCCc Confidence 32111 11222333211 1110000 000 01122344666654 689999999999999999999 Q ss_pred EEEEecCCcceecc---chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC-------------Cc--eEE Q lcl|NC_021305. 69 KCMFTSGDTETEES---DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-------------GT--PEK 130 (518) Q Consensus 69 ~v~~~~~~~~~~~~---~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~-------------G~--~~~ 130 (518) +|...++....... -..+...+. |...++.+++..++ +.+.+|.+++++++... |. +.. T Consensus 74 ~v~p~~~~~~d~~~~~~a~~v~~~l~--~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~ 150 (488) T protein:vir:95 74 RFVPPKGKEQDPKMLERADFFNSLMD--DMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAK 150 (488) T ss_pred eEecCCCCchhHHHHHHHHHHHHHHh--ccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeee Confidence 99643322111111 111222221 23346778888865 67899999999998542 22 445 Q ss_pred EEeeCCc---eeEEEEcCCceeeEEeeecc------c----ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHH Q lcl|NC_021305. 131 LMPMHPS---RVAIKRNSRTGRYEYYFQAG------A----GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTI 197 (518) Q Consensus 131 l~~l~p~---~v~v~~~~~~~~~~~~~~~~------~----~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i 197 (518) +.+.++. ++.+..+.. .......... . ......+.+++..+|+.++....+..+|.+.+..+.... T Consensus 151 i~~Rpq~~~~~f~~d~d~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~ 229 (488) T protein:vir:95 151 LPIRNQSTLDKWYFDEDFR-RVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPW 229 (488) T ss_pred eeecCcccccceeeccCCC-ceeecccccccccccccccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHH Confidence 5555553 233222221 1111000000 0 011223457778887777777778889999999999999 Q ss_pred HHHHHHHHHHHHHHHccCCcccccccC----ccCCHHHHHHHHHHHHHHhcCc-cccCCeeecCCCccee---------e Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRHE----KRLSEAAQQRLREQFDRAHSGS-SNTGKTMVVEEGMEPI---------P 263 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~~----~~~~~~~~~~~~~~~~~~~~g~-~n~g~~~vl~~g~~~~---------~ 263 (518) .......++...|....+.|--+...+ ...++++...+.+...+..... .+...-++++.|+.+. - T Consensus 230 ~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~~~k~~~~e~~l 309 (488) T protein:vir:95 230 KYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDPDTKEDIFEFSL 309 (488) T ss_pred HHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeeccccccccchhhhhhhc Confidence 999999999999998765544444332 2334444554544444432111 1111224666665432 2 Q ss_pred ccC-ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhc---- Q lcl|NC_021305. 264 LQL-TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWV---- 338 (518) Q Consensus 264 l~~-~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~---- 338 (518) ++. ......|.++.++..++|+.+.--.---.+....++++. .+.........+.-.++.|++.||+.|+...- T Consensus 310 ~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al-~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~~Nf 388 (488) T protein:vir:95 310 VSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSL-ADSKTSLLAMSVDILLKQIKNVINRDLVAQTYALNM 388 (488) T ss_pred cccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 222 223334777888888888886522110011111223332 33445667778888999999999998876541 Q ss_pred --ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCCCcceeeecccccccccccccCCC Q lcl|NC_021305. 339 --RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATP-----NEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVE 411 (518) Q Consensus 339 --~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~-----NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~ 411 (518) ...+.+|.++.....|.++.++.+.+++..|+.-+ +.+|+.+|+|+-+ .+.....+.. + T Consensus 389 g~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~--~~e~~~~~~~--~---------- 454 (488) T protein:vir:95 389 WDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPAD--ESQPVSEKLS--P---------- 454 (488) T ss_pred CCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCC--CCccccccCC--C---------- Confidence 12345666667778899999999999999998664 5699999998543 2222211110 0 Q ss_pred CCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcc Q lcl|NC_021305. 412 WEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQK 462 (518) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 462 (518) +..+.. +......++......+. + +. ..+.+++ | T Consensus 455 -~~~~~~--~~~~~~~~~~~~~~~~~---~----~~---~~a~~~~----~ 488 (488) T protein:vir:95 455 -NSQSRS--GDGYKTAGEGTAKTPSA---K----DP---STANKAN----K 488 (488) T ss_pred -CCCCCC--CcccCCCcccCCccccc---c----cc---hhhhhcc----C Confidence 000000 00000000000000000 0 00 0000010 1 No 134 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.62 E-value=1.4e-15 Score=102.05 Aligned_cols=375 Identities=13% Similarity=0.120 Sum_probs=182.3 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) ++-+-|-.+.+ .| ..| |.+..........+|..+.....+|+.+++.+-.--..+. .+.+.... T Consensus 28 ~~~glg~~r~~-------~~--~~~------g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~~~~~~i~-~g~~~~~~ 91 (449) T protein:vir:10 28 PTMGLDNKRHS-------AW--CEY------GFPELVTYENLYSLYRRGGIAHGAVEKLVGKCWQTNPEII-EGDDADDS 91 (449) T ss_pred HHhcCCcccch-------hh--hhc------CCcccCCHHHHHHHHhcCchhHHHHHhhhhhhhhcCcccc-cCccccch Confidence 11111110000 00 011 2222223334456788899999999999987622211222 11111111 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEE-EcC---------CCceEEEEeeCCceeEEEEc------ Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ-KNK---------SGTPEKLMPMHPSRVAIKRN------ 144 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~-r~~---------~G~~~~l~~l~p~~v~v~~~------ 144 (518) .....+...+.+=+...-|..+.... ..-.++|-+++++. ++. .+.+..+.|+....+++... T Consensus 92 ~~~~~~e~~~~~l~~~~~~~~l~ea~-~~~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~s 170 (449) T protein:vir:10 92 EDETSWEKKSKQVFTNRLWRSFAEAD-RRRLVGRYAGILLHIRDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINS 170 (449) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHH-HhhhccCcEEEEEEecCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCC Confidence 11111111111100001122222222 22345777766654 332 22456666666554443221 Q ss_pred -CCceeeEEeeeccc-ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHH-HHHHHHHHccCCcc--- Q lcl|NC_021305. 145 -SRTGRYEYYFQAGA-GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSR-NATAAMWKNAGRPN--- 218 (518) Q Consensus 145 -~~~~~~~~~~~~~~-~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~-~~~~~~~~ng~~p~--- 218 (518) ..+.+..|.+.... ...+..+.+.++.|+||-... ..|.|.++.+++.+.....+. .+...++++-.+-. T Consensus 171 p~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~----~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~ 246 (449) T protein:vir:10 171 KTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYS----EDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVN 246 (449) T ss_pred CCCCCceEEEEeeeccCCCccceeeccceeEeecCCC----CCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhh Confidence 12333344333211 112344568889999885432 237888988887653333322 23333333322111 Q ss_pred --------cccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021305. 219 --------LVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDI 290 (518) Q Consensus 219 --------~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgV 290 (518) ++....+....+..+++.+......+|. + .+++..+-+|+.+..++.+.. +.......+||++-+| T Consensus 247 ~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~~~---~-~~~i~~~~d~~~~~~~~sgl~--d~l~~~~q~iaaa~~I 320 (449) T protein:vir:10 247 FEKEIDFTNLASLYGVSIDELQDKFNEVAGEINRGN---D-VLMTTQGATVTPLVTSVADPT--ATYNVNLQTAAAGVDI 320 (449) T ss_pred hhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhccc---h-heeecCCcceEEEecccCChh--HHHHHHHHHHHHHhCC Confidence 1111111112233344544444433332 2 344567778988888887754 6677788889999999 Q ss_pred CHH-HhccccccccCCHHHHHHHHHHHH------hhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHH-- Q lcl|NC_021305. 291 APP-IVHILDRATFSNISAQMRAFYRDT------MAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSES-- 361 (518) Q Consensus 291 Pp~-~lg~~~~~~~sn~e~~~~~~~~~~------l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~-- 361 (518) |.. |+|...++..++ + ....||..+ +.|.++.+-+.|-+.-+... ...+.|.+.+|...+.++++++ T Consensus 321 P~t~L~Gqsp~glnst-~-D~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~~--~~d~~i~f~pL~~~t~kEkAei~k 396 (449) T protein:vir:10 321 PTRILIGNQQAERSST-E-DQKYFNARCQSRRVDLSFEIEDFCDKLIELKIIDA--VAKKAVIWDDLNEQTGTEKLTNAK 396 (449) T ss_pred CeeeeeccCccccccc-h-hHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC--CCceeEEeCCCCCCCHHHHHHHHH Confidence 976 555554443333 3 345555433 56777666655543322211 2357788889999998888654 Q ss_pred -----HHHHHhCC---CcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 362 -----TQKMVNSG---VATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 362 -----~~~~~~~G---~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) +++++++| +++.+|+|+.+|++|.+. +.+ .. ....+.+++ .+++ T Consensus 397 ~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~---~~~---------~~--e~~de~~~~---~d~~----------- 448 (449) T protein:vir:10 397 TMGEINQTMLGSGDNPAFSREEIRTAAGYDNDDE---EPL---------GE--EDGDEEDKA---TDSA----------- 448 (449) T ss_pred HHHHHHHHHHHccccCCcCHHHHHHHhcccCCCC---CCC---------CC--CCCcccccc---CCcC----------- Confidence 44567666 899999999999988642 100 00 000000000 0000 Q ss_pred cCC Q lcl|NC_021305. 434 PTS 436 (518) Q Consensus 434 ~~~ 436 (518) + T Consensus 449 --a 449 (449) T protein:vir:10 449 --A 449 (449) T ss_pred --C Confidence 0 No 135 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.60 E-value=5.1e-16 Score=104.40 Aligned_cols=493 Identities=13% Similarity=0.066 Sum_probs=225.1 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccc--cccccccccc--hhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec-- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAP--AVGMQLERQF--SLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS-- 74 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~-- 74 (518) -+|---.+.-.|..+..-..-.+ |++.. ...+-..++| .-+.....++|-+++|+..|++.+.+- |.-...+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~ 148 (698) T protein:vir:10 71 RQFEVDVSNYTPRERRAASYALD-FNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTK 148 (698) T ss_pred ccceeccccCCccccchhhhhhc-ccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceeccccc Confidence 11111111111111000000000 00000 0001111111 122345677888999999999998776 5332111 Q ss_pred -----------CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE-c----------------CCC Q lcl|NC_021305. 75 -----------GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-N----------------KSG 126 (518) Q Consensus 75 -----------~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r-~----------------~~G 126 (518) ++.....+.....+|...-....-+..|.+.+.++. +||-+.+++.- . ..| T Consensus 149 e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aR-lfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kG 227 (698) T protein:vir:10 149 EKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKG 227 (698) T ss_pred hhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cccceEEEEEeecCccccccccccccccccCc Confidence 111111111233334333233333344445444444 55655544432 1 134 Q ss_pred ceEEEEeeCCceeEEEEcCCcee---eEEeeecccccCceeEEeccccEEEEeccCC------CCcccCchHHHHHHHHH Q lcl|NC_021305. 127 TPEKLMPMHPSRVAIKRNSRTGR---YEYYFQAGAGVGTQLVSFADDEVVPIRFFNP------DGLERGLSLMESLKSTI 197 (518) Q Consensus 127 ~~~~l~~l~p~~v~v~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~evih~~~~~~------~~~~~G~s~l~~~~~~i 197 (518) ....|.+++|..|++........ .+|...+....+. .+..+.++.|..... .....|+|..+.+.+.+ T Consensus 228 slKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~---~IH~SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V 304 (698) T protein:vir:10 228 SFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGS---EVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYI 304 (698) T ss_pred cceeeeeecccccccchhhhccchhhccCCCceEEEecc---eecceeEEEecCCCchhhhcchhccCCccHHHHHHHHH Confidence 56678899998888754321111 1111111111122 356666665543321 12246999999999999 Q ss_pred HHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHH--HHHHHhcCccccCCeeecC-CCcceeeccCChhhHHHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLRE--QFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVEMQFI 274 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~--~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~~~~ 274 (518) .....+..........-.........-..+++.....+.. ++.+++++ |. ++.+++ ++.+|++.+.+....+ T Consensus 305 ~~~~rT~~~v~~Li~~~~~~~l~~dla~aL~~g~~~~l~~R~eli~~~Rs--n~-G~~llDk~~Eefeq~st~lSGLd-- 379 (698) T protein:vir:10 305 DNWLRTRQSVSDIVKQFSVSGILMDLAQALTPGANVDLSMRAELINRYRD--NR-NILFLDKATEEFFQFNTPLSGLD-- 379 (698) T ss_pred HHHHHHhhhHHHHHHHhhHHHHHHHHHHhcCChhhHHHHHHHHHHHHhcC--cc-ceEEEecCCcceEEEecCcCCHH-- Confidence 9888777777666544222221111112223333223333 34445543 33 456677 5789999988877755 Q ss_pred HHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHH-------HhhHHHHHHHHHHHHhhhhhhcccccceec Q lcl|NC_021305. 275 EARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKFD 346 (518) Q Consensus 275 e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd 346 (518) +........||.+-+||.. |+|.+-.+-+++.+...++||.. -++|.++.+-+.|-+..+..... .+.|. T Consensus 380 dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp--~i~~~ 457 (698) T protein:vir:10 380 ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDP--SIKWQ 457 (698) T ss_pred HHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--cceEE Confidence 7788888999999999965 67777777778888888888875 58899988877776666554433 45667 Q ss_pred chhhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCC-----CCCcceeeecc-cccccccc-cccCCCC Q lcl|NC_021305. 347 IDDVIQPDWEAKSES-------TQKMVNSGVATPNEGREIMGLPRSD-----DPKADELYANS-ALQPLGAT-PDGAVEW 412 (518) Q Consensus 347 ~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~g~~p~~-----~~~gD~~~~~~-n~~~~~~~-~~~~~~~ 412 (518) +.+|.+.+.++++++ ...++..|+++++|+|.++.-+|-- ...-|++..|. |.+..... ......+ T Consensus 458 fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~ 537 (698) T protein:vir:10 458 WNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEG 537 (698) T ss_pred eCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCC Confidence 778888887777665 4467889999999999998554311 00112222222 22211100 0011111 Q ss_pred CCCCCCCC-----CccC-CCCCCCccccCCccccccchhc-chh--hHHHHHHHHhhcccCCchhhHHHHHHHHHhhccc Q lcl|NC_021305. 413 EEAPAPKR-----PAST-PVASLDQSPPTSVPGLSPTNSD-RST--DSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRG 483 (518) Q Consensus 413 ~~~~~~~~-----~~~~-~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~ 483 (518) .+.+.+.. ++.+ |++-.+..++............ +.. --.....-+...++.+ ++. .-.+.+-.| T Consensus 538 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~giv~~~g~~vLL~~r~~g-~W~-----lPgG~ie~G 611 (698) T protein:vir:10 538 GDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQDAAMRAAGIVFRAGDKVLLMKRPAG-DWG-----LPAGKVEDG 611 (698) T ss_pred CCcccccccccccCCCCCCcccccccCCCCccccCcccceeeEEEEEEEcCCeEEEEEecCC-Ccc-----cCccccCCC Confidence 11111111 1111 1111111221111111110000 000 0000000011122211 110 111111112 Q ss_pred cCcCchhHHHHHHHHHHHhHHHHhhhhhhhccc--CC Q lcl|NC_021305. 484 KDIKGFALQLAEKYPDDLEDILLAVQLALAERK--DN 518 (518) Q Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 518 (518) ++.+. .+.|+-.|..-+.+...++.-. +. T Consensus 612 Et~~~------aa~RE~~EEtG~~~~~~l~~~g~~de 642 (698) T protein:vir:10 612 ETPEE------AARRETLEETGHAGDYVLAPLGKYDE 642 (698) T ss_pred CCHHH------HHHHHHHhhcccccchhhhcccccce Confidence 22211 1223333333333332222211 11 No 136 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.56 E-value=2.5e-15 Score=100.59 Aligned_cols=485 Identities=11% Similarity=0.030 Sum_probs=218.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccc--cccccccccc--hhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec-- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAP--AVGMQLERQF--SLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS-- 74 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~-- 74 (518) -+|---.+.-.|..+..-..-.+ |++.. ...+-..++| .-+.....++|-+++|+..|++.+.+- |.-...+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~ 148 (695) T protein:vir:78 71 RQFEVDVSNYTPRERRAASYALD-FNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTK 148 (695) T ss_pred eeceeccccCCccccchhhhhhc-ccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceeccccc Confidence 11111111111111000000000 00000 0001111111 122345677888999999999998776 5332111 Q ss_pred -----------CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE-c----------------CCC Q lcl|NC_021305. 75 -----------GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-N----------------KSG 126 (518) Q Consensus 75 -----------~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r-~----------------~~G 126 (518) ++.....+.....+|...-....-+..|.+.+.+ --+||-+.+++.- . ..| T Consensus 149 e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~-aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kG 227 (695) T protein:vir:78 149 EKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIH-DQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKG 227 (695) T ss_pred hhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHh-hccccceEEEEEeccCccccccccccccccccCc Confidence 1111111112333343332333333344444444 4456666555432 1 134 Q ss_pred ceEEEEeeCCceeEEEEcCCcee---eEEeeecccccCceeEEeccccEEEEeccCC------CCcccCchHHHHHHHHH Q lcl|NC_021305. 127 TPEKLMPMHPSRVAIKRNSRTGR---YEYYFQAGAGVGTQLVSFADDEVVPIRFFNP------DGLERGLSLMESLKSTI 197 (518) Q Consensus 127 ~~~~l~~l~p~~v~v~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~evih~~~~~~------~~~~~G~s~l~~~~~~i 197 (518) .+..|.+++|..|++........ .+|...+.... + ..+..+.++.|..... .....|+|..+.+.+.+ T Consensus 228 slKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G--~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V 304 (695) T protein:vir:78 228 SFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-G--TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYI 304 (695) T ss_pred ceeeeEeecccccccchhhhccchhhccCCCceEEEe-c--eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHH Confidence 56678899998888754321111 11111111111 1 2456666665653322 12346999999999999 Q ss_pred HHHHHHHHHHHHHHHccCCcccccc-cCccCCHHHHHHHH--HHHHHHhcCccccCCeeecC-CCcceeeccCChhhHHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLR-HEKRLSEAAQQRLR--EQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVEMQF 273 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~-~~~~~~~~~~~~~~--~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~~~ 273 (518) .....+..........-. ..++.. .-..+.+.....+. -++.+++++ |. ++.+++ +..+|.+.+.+....+ T Consensus 305 ~~~~rT~~~v~~Li~~~~-v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rs--n~-G~~llDk~~Eefeq~stslSGLd- 379 (695) T protein:vir:78 305 DNWLRTRQSVSDIVKQFS-VSGILMDLAQALMPGANVDLSMRAELINRYRD--NR-NILFLDKATEEFFQFNTPLSGLD- 379 (695) T ss_pred HHHHHHHhHHHHHHHhhh-hHHHHHHHHHhhcChhHHHHHHHHHHHHHhcC--cc-ceEEEecCCcceEEEecccCCHH- Confidence 888877777776665422 222211 11122222222222 334445543 33 466678 5789999888777755 Q ss_pred HHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHH-------HhhHHHHHHHHHHHHhhhhhhccccccee Q lcl|NC_021305. 274 IEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKF 345 (518) Q Consensus 274 ~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~~~f 345 (518) +......+.||.+-+||.. |+|.+-++-+++.|...++||.. -++|.++.+-+.|-+..+..... .+.| T Consensus 380 -dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp--di~~ 456 (695) T protein:vir:78 380 -ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDP--SIKW 456 (695) T ss_pred -HHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--cceE Confidence 7788888999999999965 67777777778888888888875 58899988877776666554433 3566 Q ss_pred cchhhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCC-----CCcceeeecccc-cccc-cccccCCC Q lcl|NC_021305. 346 DIDDVIQPDWEAKSES-------TQKMVNSGVATPNEGREIMGLPRSDD-----PKADELYANSAL-QPLG-ATPDGAVE 411 (518) Q Consensus 346 d~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~g~~p~~~-----~~gD~~~~~~n~-~~~~-~~~~~~~~ 411 (518) .+.+|.+.+.++++++ ...++..|+++++|+|.++.-+|--. .-.|++-+|... ++.. ....+..+ T Consensus 457 ~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~ 536 (695) T protein:vir:78 457 QWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAE 536 (695) T ss_pred EeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCccc Confidence 6778888777777655 45688999999999999987654210 011222222211 0000 00000000 Q ss_pred CCCCCCCCCCccCCCCCCCccccCCccccccch--hcchhhHHH-------HHHH-HhhcccCCchhhHHHHHHHHHhhc Q lcl|NC_021305. 412 WEEAPAPKRPASTPVASLDQSPPTSVPGLSPTN--SDRSTDSGK-------TEPR-RLMQKPPPKESSPKHLRAVKGAMG 481 (518) Q Consensus 412 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~-------~~~~-~~~~k~~~~~~~~~~~~~~~~~~~ 481 (518) + +..+++++ +.++...+++-++....... .+..+-... ...+ +..+++.+. + .+=.+.+- T Consensus 537 ~---~~~~~~~~-~~~g~~~~~~~~~~~~~~~~~~ag~~~~~~~aag~v~~~~g~vLl~kr~~g~--W----~lPgG~vE 606 (695) T protein:vir:78 537 G---GDTGAPGG-ARAGATAPPTVANVNANVKPREAGAQDAAMRAAGAVYVVDGKVLLMKRPAGD--W----GLPAGKVE 606 (695) T ss_pred c---cccCCCCC-CCCCCCCCCceeeeeccccccccCCCCcccceeEEEEEeCCEEEEEEecCCC--c----cCCccccC Confidence 0 00011110 00000000000000000000 000000000 0000 001111111 1 01111222 Q ss_pred cccCcCchhHHHHHHHHHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 482 RGKDIKGFALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 482 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) .+++.+..++| +.+++. ++... .+-... T Consensus 607 ~gEt~~~aa~R---E~~EEt-----Gl~~~-~el~~~ 634 (695) T protein:vir:78 607 GNETPEEAARR---ETREET-----GYDHD-GELVPL 634 (695) T ss_pred CCCCHHHHHHH---HHHHHh-----CCccc-cceeee Confidence 23333332222 222222 22110 000001 No 137 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.56 E-value=1.5e-14 Score=96.37 Aligned_cols=325 Identities=11% Similarity=0.053 Sum_probs=177.7 Q ss_pred EEEEEEcCCC---ceEEEEeeCCceeE-EEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCCCCcccCchHHHH Q lcl|NC_021305. 117 YLAIQKNKSG---TPEKLMPMHPSRVA-IKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMES 192 (518) Q Consensus 117 ~~~i~r~~~G---~~~~l~~l~p~~v~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~ 192 (518) +.++++...+ .+..|.+.++.++. ...+.++....... ....+...+.+++..+|++++....+..+|.+.+.. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~--~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~ 78 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQ--WGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQ 78 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEe--cCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHH Confidence 7888876544 36678888887554 33344443332221 122334456788888888887777778899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCC--cccccccCccCCHHH-------HHHHHHHHHHHhcCc-cccCCeeecCCCccee Q lcl|NC_021305. 193 LKSTIFSEDSSRNATAAMWKNAGR--PNLVLRHEKRLSEAA-------QQRLREQFDRAHSGS-SNTGKTMVVEEGMEPI 262 (518) Q Consensus 193 ~~~~i~~~~~~~~~~~~~~~ng~~--p~~il~~~~~~~~~~-------~~~~~~~~~~~~~g~-~n~g~~~vl~~g~~~~ 262 (518) ++..+.......++...|.+..+. |-++.......++++ .+..++......... ......+|++.|++++ T Consensus 79 ~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie 158 (355) T protein:vir:78 79 AYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFT 158 (355) T ss_pred HHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEE Confidence 999999999999999999998744 444444332222111 112222222222110 0111356789999888 Q ss_pred eccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhh--- Q lcl|NC_021305. 263 PLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW--- 337 (518) Q Consensus 263 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~--- 337 (518) -+.......++.++.++..++|+.++.-. .+-...+. ++++- .+.........+.-.+..|++.||+.|+... T Consensus 159 ~~ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~Al-g~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l 236 (355) T protein:vir:78 159 LTGVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSYAL-GDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQ 236 (355) T ss_pred EeecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 77666666678889999999998877332 33221111 22222 3344567777888889999999998877643 Q ss_pred --cc-cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHH-----HHHHHhCCCCCCCCCcceeeecccccccccccccC Q lcl|NC_021305. 338 --VR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPN-----EGREIMGLPRSDDPKADELYANSALQPLGATPDGA 409 (518) Q Consensus 338 --~~-~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~N-----E~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~ 409 (518) +. ..+.+|.+.... .+.++.++.+.+++..|+..++ .+|+.+|+|.-. .++....+.. .. T Consensus 237 N~~~~~~~P~~~~~~~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~--~~~~~~~~~~---------~~ 304 (355) T protein:vir:78 237 NWGPEEPAPRLVPAQLG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPA--ERDDGADAAA---------AK 304 (355) T ss_pred cCCCCCCCCEEEecCcC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCC--CCCcccCCcc---------cc Confidence 11 223445554444 4556789999999999987654 479999997432 2222211100 00 Q ss_pred CCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhccc---CCchh Q lcl|NC_021305. 410 VEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKP---PPKES 468 (518) Q Consensus 410 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~---~~~~~ 468 (518) .......... ++... ..+.....+ +.+..++..-...+.+..+ .+.++ T Consensus 305 ~~~~~~~~~~-~~~~~----~~~~~a~~~------~a~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (355) T protein:vir:78 305 AAGRRRAKRL-PGQRQ----GAALPSRSP------RADPPRRRGPLRRRPRHPAHRRCAPDG 355 (355) T ss_pred cccccccccc-CCccc----cccccccCC------CCCChhhhHHHHHHhhccccCCCCCCC Confidence 0000000000 00000 000000000 1111111111111222222 33444 No 138 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.55 E-value=4.6e-15 Score=99.17 Aligned_cols=456 Identities=12% Similarity=0.052 Sum_probs=212.3 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccc--cccccccccc--hhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec-- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAP--AVGMQLERQF--SLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS-- 74 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~-- 74 (518) -+|---.+.-.|..+..-....+ |++.. ...+-..++| .-+.....++|-+++|+..|++.+.+- |.-...+ T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~ 147 (694) T protein:vir:10 70 RQFEVDVSNYTPRERRAASYALD-FNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTK 147 (694) T ss_pred hhccccccCCCccccchhhhhhc-cCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceeccccc Confidence 22221111111111110001111 00000 0001111111 122345677888999999999998776 5332111 Q ss_pred -----------CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE-c----------------CCC Q lcl|NC_021305. 75 -----------GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-N----------------KSG 126 (518) Q Consensus 75 -----------~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r-~----------------~~G 126 (518) +++....+.....+|...-....-+..|.+.+. +--+||-+.+++.- . ..| T Consensus 148 e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik-~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kG 226 (694) T protein:vir:10 148 EKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVI-HDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKG 226 (694) T ss_pred hhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHH-hhccccceEEEEEeecCccccccccccccccccCc Confidence 111111111233334333233333334444444 44456666555432 1 134 Q ss_pred ceEEEEeeCCceeEEEEcCCcee---eEEeeecccccCceeEEeccccEEEEeccCC------CCcccCchHHHHHHHHH Q lcl|NC_021305. 127 TPEKLMPMHPSRVAIKRNSRTGR---YEYYFQAGAGVGTQLVSFADDEVVPIRFFNP------DGLERGLSLMESLKSTI 197 (518) Q Consensus 127 ~~~~l~~l~p~~v~v~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~evih~~~~~~------~~~~~G~s~l~~~~~~i 197 (518) .+..|.+++|..|++........ .+|...+.... + ..+..+.++.|..... .....|+|..+.+...+ T Consensus 227 slKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G--~~IH~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V 303 (694) T protein:vir:10 227 SFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-G--TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYI 303 (694) T ss_pred ceeeeEeecccccccchhhhccchhhccCCCceEEEe-c--eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHH Confidence 56678899998888754321111 11111111111 1 2456666665653322 12246999999999999 Q ss_pred HHHHHHHHHHHHHHHccCCcccccc-cCccCCHHHHHHH--HHHHHHHhcCccccCCeeecC-CCcceeeccCChhhHHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLR-HEKRLSEAAQQRL--REQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVEMQF 273 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~-~~~~~~~~~~~~~--~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~~~ 273 (518) .....+..........-. ..++.. .-..+.+.....+ |-++.+++++ |. ++.+++ +..+|.+.+.+....+ T Consensus 304 ~~~~rT~~~v~~Li~~~~-v~~lk~dla~~L~~g~~~~l~~R~eli~~~Rs--n~-G~~llDk~~Eefeq~stslSGLd- 378 (694) T protein:vir:10 304 DNWLRTRQSVSDIVKQFS-VSGILMDLAQALMPGANVDLSMRAELINRYRD--NR-NILFLDKATEEFFQFNTPLSGLD- 378 (694) T ss_pred HHHHHHHhHHHHHHHhhh-hHHHHHHHHHhhcChhHHHHHHHHHHHHHhcC--cc-ceEEEecCCcceEEEecccCCHH- Confidence 888877777776664422 222211 1112222222222 2334445543 33 466678 5789999888777755 Q ss_pred HHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHH-------HhhHHHHHHHHHHHHhhhhhhccccccee Q lcl|NC_021305. 274 IEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKF 345 (518) Q Consensus 274 ~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~~~f 345 (518) +........||.+-+||.. |+|.+-++-+++.|...++||.. -++|.++.+-+.|-+..+..... .+.| T Consensus 379 -dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp--~i~~ 455 (694) T protein:vir:10 379 -ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDP--SIKW 455 (694) T ss_pred -HHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--cceE Confidence 7788888999999999965 67777777778888888888875 58899888877776666554433 4566 Q ss_pred cchhhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCCC------cceeeecccc-cccc-cccccCC Q lcl|NC_021305. 346 DIDDVIQPDWEAKSES-------TQKMVNSGVATPNEGREIMGLPRSDDPK------ADELYANSAL-QPLG-ATPDGAV 410 (518) Q Consensus 346 d~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~g~~p~~~~~------gD~~~~~~n~-~~~~-~~~~~~~ 410 (518) .+.+|.+.+.++++++ ...++..|+++++|+|.++.-+|-- +. .|++-+|... +... ....+.. T Consensus 456 ~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s-~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~ 534 (694) T protein:vir:10 456 QWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDG-PYAGKLDANDDPGVPADDDIDGVLTYVQRLA 534 (694) T ss_pred EeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCc-ccccccccccCCCcCccchhhhhHhhhcCcc Confidence 6778877777776655 4568899999999999998765421 11 1111111110 0000 0000000 Q ss_pred CCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCchh Q lcl|NC_021305. 411 EWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGFA 490 (518) Q Consensus 411 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 490 (518) + ++..+++++ ..++. .+.+..... ..+.. ++++. .+....+..-+++-+ T Consensus 535 ~---~~~~~~~~~---~~~g~---~~~~~v~~~-~~~~~------~~~ag-------~~~~~~~~ag~v~~~-------- 583 (694) T protein:vir:10 535 E---GGDTGAPGG---ARAGA---TAPPTVANV-NANVN------PREAG-------AQDAAMRAAGAVYVV-------- 583 (694) T ss_pred c---ccccCCCCc---ccccc---cCCCccccc-ccccC------ccccC-------CCCccceeeEEEEEe-------- Confidence 0 000000000 00000 000000000 00000 00000 000000000000000 Q ss_pred HHHHHHHHHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 491 LQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) -|+.-|..|..+ T Consensus 584 ----------------~g~vLl~kr~~g 595 (694) T protein:vir:10 584 ----------------DGKVLLMKRPAG 595 (694) T ss_pred ----------------CCEEEEEEecCC Confidence 011112222222 No 139 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.53 E-value=7.3e-15 Score=98.08 Aligned_cols=455 Identities=12% Similarity=0.049 Sum_probs=210.8 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccc--cccccccccc--hhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEec-- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAP--AVGMQLERQF--SLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTS-- 74 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~-- 74 (518) -+|---.+.-.|..+..-..-.+ |++.. ...+-..+++ .-+.....++|-+++|+..|++.+.+- |.-...+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~-w~~~~~~~~ 148 (695) T protein:vir:36 71 RQFEVDVSNYTPRERRAASYALD-FNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRT-WGEAIGGTK 148 (695) T ss_pred eeceecccccCccccchhhhhhc-ccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcc-cceecccch Confidence 11211111111111100000001 00000 0001111111 122345677888999999999998776 5332111 Q ss_pred -----------CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc-----------------CCC Q lcl|NC_021305. 75 -----------GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN-----------------KSG 126 (518) Q Consensus 75 -----------~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~-----------------~~G 126 (518) +++....+.....+|...-....-+..|.+. +.+--+||-+.+++.-+ ..| T Consensus 149 e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~ea-ik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kG 227 (695) T protein:vir:36 149 EKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTT-VIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKG 227 (695) T ss_pred hhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHH-HHhhccccceEEEEEeccCccccccccccccccccCc Confidence 1111111112233333322222223334444 44445666665555321 134 Q ss_pred ceEEEEeeCCceeEEEEcCCcee---eEEeeecccccCceeEEeccccEEEEeccCC------CCcccCchHHHHHHHHH Q lcl|NC_021305. 127 TPEKLMPMHPSRVAIKRNSRTGR---YEYYFQAGAGVGTQLVSFADDEVVPIRFFNP------DGLERGLSLMESLKSTI 197 (518) Q Consensus 127 ~~~~l~~l~p~~v~v~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~evih~~~~~~------~~~~~G~s~l~~~~~~i 197 (518) .+..|.+++|..|++........ .+|...+.... + ..+..+.++.|..... .....|+|..+.+.+.+ T Consensus 228 slKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~-G--~kIH~SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V 304 (695) T protein:vir:36 228 SFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMI-G--TEVHATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYI 304 (695) T ss_pred ceeeeEeecccccccchhhhccchhhccCCCceEEEe-c--eEEeeeeEEEecCCCchhhhhcccccCcccHHHHHHHHH Confidence 56678899998888754321111 11111111111 1 2456666665653322 12346999999999999 Q ss_pred HHHHHHHHHHHHHHHccCCcccccccC--ccCCHHHHHHH--HHHHHHHhcCccccCCeeecC-CCcceeeccCChhhHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRHE--KRLSEAAQQRL--REQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVEMQ 272 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~~--~~~~~~~~~~~--~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~~ 272 (518) .....+..........-. ..++ +.+ ..+.+.....+ |-++.+++++ |. ++.+++ +..+|.+.+.+....+ T Consensus 305 ~~~~rT~~~v~~Li~~~~-v~~l-k~dla~aL~~g~~~~l~~R~eli~~~Rs--n~-G~~llDk~~Eefeq~stslSGLd 379 (695) T protein:vir:36 305 DNWLRTRQSVSDIVKQFS-VSGI-LMDLAQALMPGANVDLSMRAELINRYRD--NR-NILFLDKATEEFFQFNTPLSGLD 379 (695) T ss_pred HHHHHHHhHHHHHHHhhh-HHHH-HHHHHHhhcChhHHHHHHHHHHHHHhcC--cc-ceEEEecCCcceEEEecccCCHH Confidence 888877777766654422 2222 111 12222222222 2334445543 33 466678 5789999888777755 Q ss_pred HHHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHH-------HhhHHHHHHHHHHHHhhhhhhcccccce Q lcl|NC_021305. 273 FIEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMK 344 (518) Q Consensus 273 ~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~~~ 344 (518) +......+.||.+-+||.. |+|.+-++-+++.|...++||.. -++|.++.+-+.|-+..+..... .+. T Consensus 380 --dVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp--di~ 455 (695) T protein:vir:36 380 --ALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDP--SIK 455 (695) T ss_pred --HHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--cce Confidence 7788888999999999965 67777777778888888888775 58899888877776666554433 456 Q ss_pred ecchhhhhcCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCCC------cceeeecccc-cccc-cccccC Q lcl|NC_021305. 345 FDIDDVIQPDWEAKSES-------TQKMVNSGVATPNEGREIMGLPRSDDPK------ADELYANSAL-QPLG-ATPDGA 409 (518) Q Consensus 345 fd~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~g~~p~~~~~------gD~~~~~~n~-~~~~-~~~~~~ 409 (518) |.+.+|.+.+.++++++ ...++..|+++++|+|.++.-+|-- +. .|++-+|... ++.. ....+. T Consensus 456 ~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s-~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~ 534 (695) T protein:vir:36 456 WQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDG-PYAGKLDANDDPGVPADDDIDGVLTYVQRL 534 (695) T ss_pred EEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCc-ccccccccccCCCcCccchhhhhHhhhcCc Confidence 66778888777777655 4568899999999999998765421 11 1111111110 0000 000000 Q ss_pred CCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCch Q lcl|NC_021305. 410 VEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKGF 489 (518) Q Consensus 410 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (518) .+ ++..+++++ ..++. .+.+..... ..+.. ++++. .+....+..-+++-+ T Consensus 535 ~~---~~~~~~~~~---~~~g~---~~~~~v~~~-~~~~~------~~~ag-------~~~~~~~aag~v~~~------- 584 (695) T protein:vir:36 535 AE---GGDTGAPGG---ARAGA---TAPPTVANV-NANVN------PREAG-------AQDAAMRAAGAVYVV------- 584 (695) T ss_pred cc---ccccCCCCc---ccccc---cCCCccccc-ccccC------ccccC-------CCCccceeeEEEEEe------- Confidence 00 000000000 00000 000000000 00000 00000 000000000000000 Q ss_pred hHHHHHHHHHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 490 ALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) -|+.-|..|..+ T Consensus 585 -----------------~g~vLl~kr~~g 596 (695) T protein:vir:36 585 -----------------DGKVLLMKRPAG 596 (695) T ss_pred -----------------CCEEEEEEecCC Confidence 011112222222 No 140 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=99.34 E-value=1.4e-11 Score=80.08 Aligned_cols=500 Identities=13% Similarity=0.077 Sum_probs=239.9 Q ss_pred CcCCCCCCCCcccccccch---------hhhhhhcccccccccccccchhhh-HHHhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSP---------QMQDSYYYAPAVGMQLERQFSLYG-GIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~v~~ia~~ia~l~~~v 70 (518) -|+.-+-.+++|..+..++ .+...-...+-.+....+....-+ +.|...|.++..|..|+++|+++.+.. T Consensus 2 ~~~rPk~~p~~p~~~~~arrr~LtaAsa~l~~~~~~~~kt~~~~~~~WQ~eAW~~~d~vpELry~vgW~~~a~SR~rL~a 81 (646) T protein:vir:10 2 ALLKPKSAPPEPFGAEVARRIALAGATAQVDLGASSSWKTWKFGNKDWQTEGWRLYDIIPEHHFLAGRIGDSVAQARLYV 81 (646) T ss_pred cccCCCCCCCCcccccccchhhhhhccccccCCCcceeecCCCcchhhhHHHHHHHhhhhhHhhHhhhhhhhhceeeeee Confidence 2344333344443222211 110000000111111111222111 234555889999999999999999998 Q ss_pred EEecCCcce--eccchHHHHHHhcCCcCC-CHHHHHHHHHHHHHHcCCeEEEEE---EcCCCceEEEEeeCCceeEEEEc Q lcl|NC_021305. 71 MFTSGDTET--EESDTGYAKLLADPCEYL-DPFAFWEWVASTLDIYGETYLAIQ---KNKSGTPEKLMPMHPSRVAIKRN 144 (518) Q Consensus 71 ~~~~~~~~~--~~~~~~~~~L~~~PN~~~-s~~~f~~~~v~~ll~~G~~~~~i~---r~~~G~~~~l~~l~p~~v~v~~~ 144 (518) -+.++.|.. ...++....+-..+-... -..++++.+..++-+-|++|+... ....+.--.++++..+.|.. . T Consensus 82 seiddtG~~tg~v~~~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~~~W~vvt~~Ev~~--t 159 (646) T protein:vir:10 82 TEVDDTGEETGEVQDERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAEGSWFVVTGSAISR--T 159 (646) T ss_pred eeecCCCCCcCccchHHHHHHhhhhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCCccceeeecHHHhcc--C Confidence 887765543 234455555555554443 456789999999999999998641 11122122344555555522 1 Q ss_pred CCceeeEEeeeccc-ccCceeEEeccccEEEEeccCCC--CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc Q lcl|NC_021305. 145 SRTGRYEYYFQAGA-GVGTQLVSFADDEVVPIRFFNPD--GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL 221 (518) Q Consensus 145 ~~~~~~~~~~~~~~-~~~~~~~~~~~~evih~~~~~~~--~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il 221 (518) +... .+.... ..++..+.+...++ .||..+|. .-.+--||+.+++..+.......+...+..+...+..||| T Consensus 160 --g~~~--~i~~p~~~~g~~~v~~~~~d~-lvRiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL~GnGvL 234 (646) T protein:vir:10 160 --GDEI--AVRRPQQRGGSKLVLVDGQDI-LIRCWRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRLTGAGIM 234 (646) T ss_pred --CCee--eeecCccCCCCCcceecCCce-EEEEecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHHhcCcee Confidence 1111 111111 12444555556666 45766553 3346789999999999999988888888888888888998 Q ss_pred ccCccCC-------HHHHHHHHHHHHH----HhcCcc--ccCCeeecCC-Cc------ceeeccC-ChhhHHHHHHHHHH Q lcl|NC_021305. 222 RHEKRLS-------EAAQQRLREQFDR----AHSGSS--NTGKTMVVEE-GM------EPIPLQL-TAVEMQFIEARQLN 280 (518) Q Consensus 222 ~~~~~~~-------~~~~~~~~~~~~~----~~~g~~--n~g~~~vl~~-g~------~~~~l~~-~~~d~~~~e~~~~~ 280 (518) -+|..++ +-....|.+.|.+ .+...+ .+--++|+.. |. +++.+.. +.-+.--+++++.. T Consensus 235 fvP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~aiktR~da 314 (646) T protein:vir:10 235 FLPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEITPMKDKA 314 (646) T ss_pred eeccccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHhhhHHHH Confidence 7775432 2334445544433 222111 1222233321 11 2333332 23334467899999 Q ss_pred HHHHHHHhcCCHHH-hccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhh----c----ccccceecchhhh Q lcl|NC_021305. 281 REEVCGVYDIAPPI-VHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW----V----RKNRMKFDIDDVI 351 (518) Q Consensus 281 ~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~----~----~~~~~~fd~~~l~ 351 (518) +..||....|||+. +|+. ++|.=+.=+-...-++ .|.|.+..|+++|++.++.+. + ..|-+.||.+.|. T Consensus 315 I~RlA~glDIppE~LLGlg-d~NHWtAWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW~DaS~Lt 392 (646) T protein:vir:10 315 IARLASSAEIPGEVLTGIG-DANHWTAWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFAFDTSTLA 392 (646) T ss_pred HHHHHhccCCchhheeecc-ccceeeeeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEeecCcccc Confidence 99999999999875 5554 5554333333334445 699999999999999877432 1 2355689988884 Q ss_pred h-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeec---------ccc--cccccccccC----CCCCCC Q lcl|NC_021305. 352 Q-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYAN---------SAL--QPLGATPDGA----VEWEEA 415 (518) Q Consensus 352 ~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~---------~n~--~~~~~~~~~~----~~~~~~ 415 (518) . .|..+ -...+...|.+|-...|+.+|+.--+.+.-++.++. .++ .|.-+..-+. ..+-+. T Consensus 393 ~~pd~~d---eA~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~~~~lpp 469 (646) T protein:vir:10 393 SKPNRLD---EAIQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQSVGLPP 469 (646) T ss_pred cCCCCcH---HHHHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcCccccCC Confidence 3 33333 345677899999999999999976554433222211 111 1111111111 000000 Q ss_pred CCCCCCccCCCCCCCccccCCccccccchhcc---------hhhHHHHHHHHhhcccCCch-hhHH-HHHHHHHhhc--c Q lcl|NC_021305. 416 PAPKRPASTPVASLDQSPPTSVPGLSPTNSDR---------STDSGKTEPRRLMQKPPPKE-SSPK-HLRAVKGAMG--R 482 (518) Q Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~k~~~~~-~~~~-~~~~~~~~~~--~ 482 (518) +...++.. +..+++...+..+.++++.+. +.+++-+.+.+...-...++ .+.. -.-.+..+|. + T Consensus 470 ~~~~~~dg---~~~~~e~~g~~~~~E~~~~pda~~~~a~~~~~~~r~~~~~~~~~~~~~p~a~~~aav~l~v~RAL~lAG 546 (646) T protein:vir:10 470 TAAQRTDG---DLDDDESEGAPNGGEAPDQPDADEARAITAALDRRIALAARPVLALPSPEAVFNASAKLMILRALELAG 546 (646) T ss_pred cccccccC---CCCChhhcCCCCCCccCCCCCCCccccccccccccchhhhhhhhccccchhHHHHHHHHHHHHHHHhcc Confidence 11111110 000011111111112211111 11111122222211111111 1111 1112233331 1 Q ss_pred ccCcCchhHHHHHHH----------------HHHHhHHHHhh-----hhhhhccc-CC Q lcl|NC_021305. 483 GKDIKGFALQLAEKY----------------PDDLEDILLAV-----QLALAERK-DN 518 (518) Q Consensus 483 ~~~~~~~~~~~~~~~----------------~~~~~~~~~~~-----~~~~~~~~-~~ 518 (518) +.---+-..+ .++ ++.+ ..+++| -.++|+-. |+ T Consensus 547 ~Rlrt~~~~~--a~~r~vp~he~h~~l~Pv~~~~~-~rl~~G~wd~~~~v~~~lg~D~ 601 (646) T protein:vir:10 547 GRLTTPAERR--GRWSDVPRHELHHHVGPITPDKA-RRVTEGAWNHVAVAAADLGVDA 601 (646) T ss_pred ccccCchhhh--HHhhcCChhhceeecCCCChhhH-HHHHhcccccHHHHHHhcCCCh Confidence 1111111111 111 1111 111111 11222222 11 No 141 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=99.29 E-value=8.4e-12 Score=81.31 Aligned_cols=489 Identities=14% Similarity=0.098 Sum_probs=236.0 Q ss_pred CcC-------CCCC-CCCcccc--cccchhhhh---hhcccccccccccccchhhh-HHHhhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLL-------ANGQ-TLSAPAM--AELSPQMQD---SYYYAPAVGMQLERQFSLYG-GIYKNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f-------~~~~-~~~~~~~--~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~v~~ia~~ia~l 66 (518) |-- .+-+ +.|+..+ ...+..+.+ .+.+. .|.+.......-+ +.|...+.++-.|..|+++|+++ T Consensus 1 ~~a~~~lr~~rrpkg~~~a~~r~L~aAs~~~~dpg~~~~~~--~g~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~ 78 (631) T protein:vir:10 1 MAATQSLRLVRRPKGGRPAPSRALTAASQPLPDPSQVFSKS--TGISRNSDWQTDAWEAVDLVGELRYYVGWRASSCSRC 78 (631) T ss_pred CCcccceeeeecCCCCCccchhhhhhhhccccchhhhhhhh--cCCcccchhhHHHHHHHHhhhhHHHHhhhhhhhhcee Confidence 211 1111 1111111 011111211 11111 1212222221111 23455588899999999999999 Q ss_pred ceEEEEecCC-----cceeccc---hHHHHH-HhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE-cCCCc--------- Q lcl|NC_021305. 67 PVKCMFTSGD-----TETEESD---TGYAKL-LADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-NKSGT--------- 127 (518) Q Consensus 67 ~~~v~~~~~~-----~~~~~~~---~~~~~L-~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r-~~~G~--------- 127 (518) .+..-+.+.+ +..+..+ .....+ ..-+...+...++++.++.++-+-|++|+.++- ..+|. T Consensus 79 rL~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~~~~~pd~~~r 158 (631) T protein:vir:10 79 RLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKGAPAQPDGSVR 158 (631) T ss_pred eeEeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcCCCCCcccccc Confidence 9988877744 2222211 222222 335667788999999999999999999998752 22211 Q ss_pred -eEEEEeeCCceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCC--CCcccCchHHHHHHHHHHHHHHHH Q lcl|NC_021305. 128 -PEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP--DGLERGLSLMESLKSTIFSEDSSR 204 (518) Q Consensus 128 -~~~l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~ 204 (518) .-.++++....|......++..+. ... +..-+|-.+-=+.||..+| ..-.+--||+.+++..+....... T Consensus 159 ~~~~W~~vt~~ei~~~~~g~g~~v~--lp~-----g~~h~~~~~~D~l~RiW~P~prr~~e~dSpvra~l~~l~Ei~~~t 231 (631) T protein:vir:10 159 TRQEWYAVSKEEIKKSNKGSGTNIV--LPT-----GEEHEFVKGTDIIFRVWIPKPRKASEPDSPVRAVLDSIREIVRTT 231 (631) T ss_pred cccceeeccHHHHhcccCcccceee--cCC-----CCccceecCCceEEEeeCCCcccccCCcchhHHHHHHHHHHHHhh Confidence 224555555555433323222221 111 1122222222244554444 334467899999999999999888 Q ss_pred HHHHHHHHccCCcccccccCccCC---------------------HHHHHHHHHHHH----HHhcCcc--ccCCeeecC- Q lcl|NC_021305. 205 NATAAMWKNAGRPNLVLRHEKRLS---------------------EAAQQRLREQFD----RAHSGSS--NTGKTMVVE- 256 (518) Q Consensus 205 ~~~~~~~~ng~~p~~il~~~~~~~---------------------~~~~~~~~~~~~----~~~~g~~--n~g~~~vl~- 256 (518) +...+..+...+..|||-+|..++ .-...+|.+.+- ..+...+ .+--++|+. T Consensus 232 ~~i~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPii~~~ 311 (631) T protein:vir:10 232 KTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPVIAGV 311 (631) T ss_pred hHHHHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeeeeEee Confidence 888888888888888887765433 124555555443 2222111 111223322 Q ss_pred -----CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHH Q lcl|NC_021305. 257 -----EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMD 330 (518) Q Consensus 257 -----~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~ 330 (518) ++++...+.... +.--+++++..+..||....|||+ ++|+..++|.=+.=+-...-++-.|.|.+..|+++|+ T Consensus 312 p~E~i~~i~hlkf~~ei-~e~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHWsAWqI~dedVrlHI~P~l~lic~AlT 390 (631) T protein:vir:10 312 PGEQIKDVKHIRFDNEI-TEVAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHWSAWQISDEDVQLHIAPVMEIFCQALT 390 (631) T ss_pred chHHhcCeeEEeecCch-hHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHH Confidence 234444444433 334578999999999999999987 5666556554333333445566789999999999999 Q ss_pred Hhhhhhh----c---ccccceecchhhhh-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC-----Cc------- Q lcl|NC_021305. 331 KYVGQYW----V---RKNRMKFDIDDVIQ-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDP-----KA------- 390 (518) Q Consensus 331 ~~l~~~~----~---~~~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~-----~g------- 390 (518) +.+|.+. + ..|-+.||.+.|.. .|..+ -...+...|.+|-...|+.+|+.-.+.+ .| T Consensus 391 ~q~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPdr~d---eA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~~~a~~ 467 (631) T protein:vir:10 391 DQILRVTLAREGIDPSKYVVWYDPSQLTIDPDKSD---EAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWVMWAQD 467 (631) T ss_pred hhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcH---HHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHHHHHHH Confidence 9877432 1 24567899888843 33333 3455778999999999999999753211 11 Q ss_pred ----ceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCc Q lcl|NC_021305. 391 ----DELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPK 466 (518) Q Consensus 391 ----D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~ 466 (518) |.-++ .++.|+.. ++...-..|.++.+...-..+.+++.+....+.++++.+......+..... T Consensus 468 av~~dpaLi-p~lApl~~---~~~~~v~~P~~~a~~~~g~ed~~~~~~~~~g~~epdt~d~~p~~~~a~~~~-------- 535 (631) T protein:vir:10 468 AVSKDPTLI-PMLAPLIA---GVLKQIEFPQQQAIDSGGNEDTSDADDLDDGEQEPDTEDDDDGTQKAGLET-------- 535 (631) T ss_pred HhhcccCcc-hhhHHHHH---HHhhhccCCCCCCCCCCCCCccccccccccCCCCCCCCCCCCccccccchH-------- Confidence 11111 12222211 111111111111110000011122222333344444433322111110000 Q ss_pred hhhHHHHH-HHHHhh--cccc--C---cCchhHHHHHHH----------HHHHhHHHHhhhhhhhc------ccCC Q lcl|NC_021305. 467 ESSPKHLR-AVKGAM--GRGK--D---IKGFALQLAEKY----------PDDLEDILLAVQLALAE------RKDN 518 (518) Q Consensus 467 ~~~~~~~~-~~~~~~--~~~~--~---~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~------~~~~ 518 (518) +.++ .+..+| +++. . ....-++.+..+ ++++-.+.-+-+-+|-+ --|+ T Consensus 536 ----~iv~llv~RALelAGkRl~~r~r~~~ar~~~v~~he~H~~~~Pv~~~ev~rli~gwd~~ld~~~~~~Lg~d~ 607 (631) T protein:vir:10 536 ----GIVDLMVDRALELVGKRRRGRDRETLARLSGVRERDYHRYMDPVPESEVDRLMSGWDSALDDKILLRLGLDP 607 (631) T ss_pred ----HHHHHHHHHHHHhhcchhcCCcccchhHHhcccccccccccCCCCHHHHHHHHHHHHHHHHHHHHHHhCCCH Confidence 0111 011111 0000 0 000001111111 12222121111111100 0011 No 142 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=99.19 E-value=4.5e-10 Score=71.83 Aligned_cols=425 Identities=12% Similarity=0.096 Sum_probs=170.7 Q ss_pred CcCC-CCCCCCcccccccch----------hhh--hhhcccc--cccccccccchhhhHHHhhcHHHHHHHHHHHHhhcc Q lcl|NC_021305. 1 MLLA-NGQTLSAPAMAELSP----------QMQ--DSYYYAP--AVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR 65 (518) Q Consensus 1 ~~f~-~~~~~~~~~~~~~~~----------~~~--~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~ 65 (518) --|. +-...+..+...+.. ... ..+.-+. ....+.. -....+.......+...+|+.+++.+.- T Consensus 9 ~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~-~p~~~~~~~~v~n~~~~iVd~~a~rl~~ 87 (504) T protein:vir:99 9 SKFTFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNL-IPPEYLRTATVLGWSAKAVDTLARRCNL 87 (504) T ss_pred cccccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhcccc-ccHHHHHHhhccCcHHHHHHHHHhhhcc Confidence 0010 000011111000000 000 0000000 0000000 0011112222334456677777775544 Q ss_pred CceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceE-EEEeeCCceeEEEEc Q lcl|NC_021305. 66 LPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPE-KLMPMHPSRVAIKRN 144 (518) Q Consensus 66 l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~-~l~~l~p~~v~v~~~ 144 (518) -.|.+ .++ ...+..+..++.+ |. .......+..+.+++|.+|+.+..+.+|.+. .+.+++|..+.++++ T Consensus 88 ~Gf~~---~d~---~~~~~~l~~i~~~-N~---ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD 157 (504) T protein:vir:99 88 ESFVW---PDG---DYGSIGGPDVWDE-NF---FATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWN 157 (504) T ss_pred ceeeC---CCC---ChhhHHHHHHHHh-cC---hhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEe Confidence 44432 211 1112233444443 33 3345677888999999999999888888754 567889999988887 Q ss_pred CCceeeEEee--ecccccCce--eEEecccc------------------------EEEEeccCCCCcccCchHH----HH Q lcl|NC_021305. 145 SRTGRYEYYF--QAGAGVGTQ--LVSFADDE------------------------VVPIRFFNPDGLERGLSLM----ES 192 (518) Q Consensus 145 ~~~~~~~~~~--~~~~~~~~~--~~~~~~~e------------------------vih~~~~~~~~~~~G~s~l----~~ 192 (518) .........+ ......+.. ...|.++. |++|.+....+..+|.|.+ .. T Consensus 158 ~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~~~G~sei~~~v~~ 237 (504) T protein:vir:99 158 SRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDRPLGSSRITRPVMS 237 (504) T ss_pred CCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCccccCcccchhhHHH Confidence 6443322211 111111111 11223333 3444433222334676643 33 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCCcc-cccccC-ccCCHHHHHHHHHHHHHHhc---CccccCCeeec-CCCcceeeccC Q lcl|NC_021305. 193 LKSTIFSEDSSRNATAAMWKNAGRPN-LVLRHE-KRLSEAAQQRLREQFDRAHS---GSSNTGKTMVV-EEGMEPIPLQL 266 (518) Q Consensus 193 ~~~~i~~~~~~~~~~~~~~~ng~~p~-~il~~~-~~~~~~~~~~~~~~~~~~~~---g~~n~g~~~vl-~~g~~~~~l~~ 266 (518) +.+.+.....-......||.. |. .++-.. ....+++.+ -...|+.... .........+. ..+.++.++.. T Consensus 238 l~Da~~~~~~~~~~~~e~~a~---p~r~i~G~~~~~~~~~d~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~ 313 (504) T protein:vir:99 238 LQQRALKGCIRMDGHADVYSF---PQLILLGADAKNFRNKDGS-MKPAWQIALARVFALPDDEDEPDAARARADVKQFPA 313 (504) T ss_pred HHHHHHHHHHHHHHHHHHhcc---hhhhhccCCcccccccccc-ccchhhhhhhhhhcCCCccccccccCccceeeecCC Confidence 444433333333333334332 22 222211 111111111 1112222111 11111111111 12355555543 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHH--HHHHhhHHHHHHHHHHHHh----hh--hhh- Q lcl|NC_021305. 267 TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF--YRDTMAIPIARIQSAMDKY----VG--QYW- 337 (518) Q Consensus 267 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~--~~~~l~P~~~~ie~~l~~~----l~--~~~- 337 (518) ..-+ .|.+.++..+..|+..-++|++.+|+....+.++.+...... +...+.-....+...+.+. +. ... T Consensus 314 ~~l~-~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~ 392 (504) T protein:vir:99 314 SSPQ-PHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLD 392 (504) T ss_pred CChH-HHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 3222 378889999999999999999999987655555544332111 1111111222222223221 11 100 Q ss_pred ---cccccceecchhhhhcCHHHHHHHHHHHHhCCCcC--H-HHHHHHhCCCCCCCC-CcceeeecccccccccccccCC Q lcl|NC_021305. 338 ---VRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVAT--P-NEGREIMGLPRSDDP-KADELYANSALQPLGATPDGAV 410 (518) Q Consensus 338 ---~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T--~-NE~R~~~g~~p~~~~-~gD~~~~~~n~~~~~~~~~~~~ 410 (518) .....+++.|.+....+..+.++++.+++++|... . .-+.+++|+.+-+-. .-++...-.....++...+... T Consensus 393 ~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~ 472 (504) T protein:vir:99 393 RIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQ 472 (504) T ss_pred ccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccC Confidence 11134556677888889999999999999988632 2 335577788653210 0000000000000111111110 Q ss_pred CCCCCCCC-CCCccCCCCCCCccccCCcccccc Q lcl|NC_021305. 411 EWEEAPAP-KRPASTPVASLDQSPPTSVPGLSP 442 (518) Q Consensus 411 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 442 (518) ....++.. .+++..+ +.+..+..++.+.++. T Consensus 473 ~~~~~~~~~~~~~~e~-a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 473 EAATAGEDQDQGAGEP-PANEPPAALGRPTLVG 504 (504) T ss_pred CCCCCCCCCCcCCCCC-CCCCCCccCCCcccCC Confidence 00000000 0000000 0000011111111111 No 143 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=99.14 E-value=5.5e-11 Score=76.83 Aligned_cols=433 Identities=12% Similarity=0.082 Sum_probs=194.9 Q ss_pred CcCCCCCCCCccccccc-----chhhh----hhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccC----- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAEL-----SPQMQ----DSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARL----- 66 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~-----~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l----- 66 (518) =||+...+......... ++... ..|.++. ..........|+++++.+|.|..||+.|++.+.-+ T Consensus 22 ~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~--~~n~~eLI~~YR~ma~~~pEVd~AideIvneaiv~d~~~~ 99 (533) T protein:vir:58 22 PMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGI--EFNRFFLYDMYDRMDYTDPLISTVLDIIADECTIPNENGN 99 (533) T ss_pred hhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccc--cccHHHHHHHHHHhhccCcchhhHHHhhhceeeEecCCCc Confidence 34554321111111001 11000 0111110 01111235678888899999999999999987543 Q ss_pred ceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc-CCCceEEEEeeCCceeEEEEcC Q lcl|NC_021305. 67 PVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN-KSGTPEKLMPMHPSRVAIKRNS 145 (518) Q Consensus 67 ~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~-~~G~~~~l~~l~p~~v~v~~~~ 145 (518) |+.+-.++-+ ........++. .+++..--..+++.|+++|..|..++-+ ..+.+.+|+.|+|..++.+.+. T Consensus 100 pV~v~l~~~e----~s~~iK~kI~~----lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~elr~lDPr~i~~vr~~ 171 (533) T protein:vir:58 100 IVDVVTKDIE----LAKAILSYLDY----VINIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEKFQVVSPYIFSKRYNP 171 (533) T ss_pred eeEeeccccc----ccHHHHHHHHH----HhcchhhhhHHHHhhhhcceeEEEeccCCcccchhhheecCCeeeEEEEee Confidence 2333221111 11111222222 2333333445577899999999998743 4556789999999999988877 Q ss_pred CceeeEEeeeccc---ccCceeEEeccccEEEEeccCCC-CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc Q lcl|NC_021305. 146 RTGRYEYYFQAGA---GVGTQLVSFADDEVVPIRFFNPD-GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL 221 (518) Q Consensus 146 ~~~~~~~~~~~~~---~~~~~~~~~~~~evih~~~~~~~-~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il 221 (518) .....+|.|.... ..+...+.++.+.|+|+..-..+ ...+++|-|..+...+.....++....-+--..+.-+=|+ T Consensus 172 ~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~NQLkmiEDAlVIYRisRAPeRRvF 251 (533) T protein:vir:58 172 ETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWNQLRLMEDALMLYRVVRSVDRRVF 251 (533) T ss_pred ccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHHHHHHHHHHHHHHhhcCChhheEE Confidence 6666555543222 22334477889999999865332 2457889999998877776666666655544444444344 Q ss_pred ccC-ccCCHHHH----HHHHHHHHHHhcCccccCCee----------ec----------CCCcceeeccCChhhHHHHHH Q lcl|NC_021305. 222 RHE-KRLSEAAQ----QRLREQFDRAHSGSSNTGKTM----------VV----------EEGMEPIPLQLTAVEMQFIEA 276 (518) Q Consensus 222 ~~~-~~~~~~~~----~~~~~~~~~~~~g~~n~g~~~----------vl----------~~g~~~~~l~~~~~d~~~~e~ 276 (518) ..+ +++.+... ..+...++..+.=..+.|.+. +| ..|.+++.|... . +.-++- T Consensus 252 YIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg-~-lgemeD 329 (533) T protein:vir:58 252 YVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGS-K-VDLAED 329 (533) T ss_pred EEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCCC-C-CCcHHH Confidence 333 33433333 333334433332222344441 22 135677777643 2 445567 Q ss_pred HHHHHHHHHHHhcCCHHHhccccccccCCHHHHHH--HHHHHHhhHHHHHHHHHHHHhhhhhhc-ccccceecc--h--- Q lcl|NC_021305. 277 RQLNREEVCGVYDIAPPIVHILDRATFSNISAQMR--AFYRDTMAIPIARIQSAMDKYVGQYWV-RKNRMKFDI--D--- 348 (518) Q Consensus 277 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~--~~~~~~l~P~~~~ie~~l~~~l~~~~~-~~~~~~fd~--~--- 348 (518) ..+..+.+..+++||.+-++..... +...+-.+ .-+...|.-+-..+.+.|.+.|+.... ....|+|++ + T Consensus 330 V~YF~kkLy~ALnVP~sRl~~e~~f--gr~~eItRDEiKF~KFI~rLR~rF~~ll~~qLilk~iit~eew~~~f~~Dn~f 407 (533) T protein:vir:58 330 VEYMLNRLISALKVPKAFIGYEGDV--NAKNTLATQDIKFNNTIKRIQGFFVEELERMVRMNKEFADQDFRLVMNRSNSI 407 (533) T ss_pred HHHHHHHHHHHhCCCeeecCCCCCC--ccchhhhHHHHHHHHHHHHHHHHHHHHHhcccccccCcchhheeeeeeccchH Confidence 7788999999999999988764432 22222211 123445566666677777777653221 111233332 2 Q ss_pred -hhhhcC-HHHHHHHHHHH---HhC-----CC--cCHHHHHH------HhCCCCC-CC--CCcceeeecccccccccccc Q lcl|NC_021305. 349 -DVIQPD-WEAKSESTQKM---VNS-----GV--ATPNEGRE------IMGLPRS-DD--PKADELYANSALQPLGATPD 407 (518) Q Consensus 349 -~l~~~d-~~~~~~~~~~~---~~~-----G~--~T~NE~R~------~~g~~p~-~~--~~gD~~~~~~n~~~~~~~~~ 407 (518) ++.... +..|+.++..+ ++. -+ +| +|+.+ ..+..++ +. .+++. .|+. T Consensus 408 ~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~t-dei~~q~e~ie~E~~~~~~~~~~~~~e~-------~~~~---- 475 (533) T protein:vir:58 408 VEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIP-YDLKPQEEVAEAAGGGGLFDTGGFGEET-------TPAD---- 475 (533) T ss_pred HHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCC-hhhhHHHHHHHHhhcCCCCCCCCccccc-------CCcc---- Confidence 111111 12233333221 111 11 12 22221 1111111 11 11111 1111 Q ss_pred cCCCCCCCCCCCCCccCCCCCCCcc----ccCCccccccchhcchhhHHHHHHHHhhc--ccCCchh Q lcl|NC_021305. 408 GAVEWEEAPAPKRPASTPVASLDQS----PPTSVPGLSPTNSDRSTDSGKTEPRRLMQ--KPPPKES 468 (518) Q Consensus 408 ~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--k~~~~~~ 468 (518) -.+...+|...+....... +.....+..+. .+..+...++-..+. .+..++- T Consensus 476 ------~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~---~~a~~~~~~~~g~~~~~~~~p~~~ 533 (533) T protein:vir:58 476 ------FLGERGSPIESPRGRTEFDFGTEGGEELGGELNL---GGAFEEFEEETGGGEEELPFPEEE 533 (533) T ss_pred ------cCccccCcccCCCChhhHhcccCCcccccccccc---cccchhhhhhcCCcccCCCCCCCC Confidence 1111111111111110000 00000000000 000000000000000 1111111 No 144 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=99.13 E-value=6.7e-10 Score=70.89 Aligned_cols=414 Identities=10% Similarity=0.030 Sum_probs=181.3 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcc----------ccccccc-ccccc--------hh-----hhHHHhhcHHHHHHH Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYY----------APAVGMQ-LERQF--------SL-----YGGIYKNQPWVRTVI 56 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~-~~~~~--------~~-----~~~~~~~~~~v~~~v 56 (518) |+++..+....+........+.....- ..+.|-. +.... .. ....=..++....+| T Consensus 14 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~iv 93 (503) T protein:vir:59 14 ELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFV 93 (503) T ss_pred hHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHH Confidence 555554443333222111112110000 0000000 00000 00 000011245667888 Q ss_pred HHHHHhhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCC Q lcl|NC_021305. 57 AKRAQALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHP 136 (518) Q Consensus 57 ~~ia~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p 136 (518) +..+.-+-.-|+.+-..+ ......+..++. | +.......+..+.+.+|.+|+.+-.+.+|++ .+..++| T Consensus 94 d~~~~yl~g~~~~~~~~d-----~~~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p 162 (503) T protein:vir:59 94 DQKTQYLVGEPVTFTSDN-----KTLLEYVNELAD--D---DFDDILNETVKNMSNKGIEYWHPFVDEEGEF-DYVIFPA 162 (503) T ss_pred HHHHhhhhcCCeeeccCc-----HHHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEEeecCCCce-EEEEEcc Confidence 888888888888762111 111122233332 2 4556677789999999999999999888875 5888999 Q ss_pred ceeEEEEcCCc-e-eeEE--eeecccccC---ceeEEeccccEEEEeccC------------------------------ Q lcl|NC_021305. 137 SRVAIKRNSRT-G-RYEY--YFQAGAGVG---TQLVSFADDEVVPIRFFN------------------------------ 179 (518) Q Consensus 137 ~~v~v~~~~~~-~-~~~~--~~~~~~~~~---~~~~~~~~~evih~~~~~------------------------------ 179 (518) ..+.+.++... . .... .+......+ .....+.+..+.++.... T Consensus 163 ~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 242 (503) T protein:vir:59 163 EEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRV 242 (503) T ss_pred ceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCcc Confidence 99888776542 1 1111 011100000 111123333333322100 Q ss_pred C----CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeec Q lcl|NC_021305. 180 P----DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVV 255 (518) Q Consensus 180 ~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl 255 (518) | .....|.|-+..+...+.....+..-..+.+...+.|-.+++--. .+....+...+ ..++++.+ T Consensus 243 Piv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~---~~~~~~~~~~~--------~~~~~~~~ 311 (503) T protein:vir:59 243 PIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYD---GENPKEFTANL--------RYHSVIKV 311 (503) T ss_pred ceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCC---ccccchhhhhh--------hcccceec Confidence 0 012357787777777777666555555555666666665554211 11111111111 11235556 Q ss_pred CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhcc-ccccccCCHH----------HHHHHHHHHHhhHHHHH Q lcl|NC_021305. 256 EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHI-LDRATFSNIS----------AQMRAFYRDTMAIPIAR 324 (518) Q Consensus 256 ~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~-~~~~~~sn~e----------~~~~~~~~~~l~P~~~~ 324 (518) +++.+...+..+.....+....+.+.+.|...-++|..-.+. ..+.++.... +.....+...+.-++.. T Consensus 312 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 391 (503) T protein:vir:59 312 SGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWF 391 (503) T ss_pred cCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 665555555444445555666677777776666666321111 1111111111 11122222333333333 Q ss_pred HHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccc Q lcl|NC_021305. 325 IQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGA 404 (518) Q Consensus 325 ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~ 404 (518) +...++..-.........+.+.+..-+..|..+.++.+.+++.+|+++...+.++++.- +++..+ + ..+.. T Consensus 392 i~~~~~~~~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v--~d~~~E-~------~ri~~ 462 (503) T protein:vir:59 392 FAEYLRNTGKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFV--QDPEEE-L------ARIEE 462 (503) T ss_pred HHHHHHhccCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCC--CCHHHH-H------HHHHH Confidence 33222211111111122356777888999999999999999999999999899887653 322111 1 11100 Q ss_pred ccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchh Q lcl|NC_021305. 405 TPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRST 449 (518) Q Consensus 405 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (518) ......+.........++.....++...++++... .+++.. T Consensus 463 E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~g~~~ 503 (503) T protein:vir:59 463 EMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESG----GAGQVS 503 (503) T ss_pred HHHHHHhhhccccCccCCCCCCCcCCCCCCcccCC----CCCCcC Confidence 00000000000000000000000000000001100 011110 No 145 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=99.12 E-value=1.6e-09 Score=68.75 Aligned_cols=403 Identities=14% Similarity=0.104 Sum_probs=172.0 Q ss_pred CCCCcccccccch--hh---h-------------hhhccc--ccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 7 QTLSAPAMAELSP--QM---Q-------------DSYYYA--PAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 7 ~~~~~~~~~~~~~--~~---~-------------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l 66 (518) =+.+.|....+++ ++ . ..+.-+ .....+.. .....+.......+...+|+..+..+--. T Consensus 1 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~~~~l~~~ 79 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVT-VPQQMQKLLAHVGYPRLYIDAIAARQELE 79 (484) T ss_pred CCCcccccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccc-cchhHHhhhhhcCcHHHHHHHHHhhhccC Confidence 1111111111111 00 0 001000 00000000 00111122233455667777777766545 Q ss_pred ceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCce-------EEEEeeCCcee Q lcl|NC_021305. 67 PVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTP-------EKLMPMHPSRV 139 (518) Q Consensus 67 ~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~-------~~l~~l~p~~v 139 (518) +|.+ .++ ...+..+..++.+ | ........+..+.+.+|.+|+.+.++..|.. ..+.+++|..+ T Consensus 80 g~~~---~~~---~~~~~~l~~i~~~-N---~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~ 149 (484) T protein:vir:77 80 GFRL---GGA---DKADEQLWDWWQA-N---DLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNL 149 (484) T ss_pred ceec---CCc---chhHHHHHHHHHh-c---CHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEecccee Confidence 5543 211 1112333444432 2 3456677889999999999999988887753 24778889988 Q ss_pred EEEEcCCceeeEEee--ecccccCc--eeEE-------------------------eccccEEEEeccCCCCcccCchHH Q lcl|NC_021305. 140 AIKRNSRTGRYEYYF--QAGAGVGT--QLVS-------------------------FADDEVVPIRFFNPDGLERGLSLM 190 (518) Q Consensus 140 ~v~~~~~~~~~~~~~--~~~~~~~~--~~~~-------------------------~~~~evih~~~~~~~~~~~G~s~l 190 (518) .+.++.......+.+ ......+. .... +..=.|++|.++...+..+|.|.+ T Consensus 150 ~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i 229 (484) T protein:vir:77 150 YAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEI 229 (484) T ss_pred EEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccCccCCcccc Confidence 877775432221111 10000000 0001 111234666654444445677765 Q ss_pred H----HHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHH--HHHHHHHHHhcCccccCCeeecC-CCcceee Q lcl|NC_021305. 191 E----SLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQ--RLREQFDRAHSGSSNTGKTMVVE-EGMEPIP 263 (518) Q Consensus 191 ~----~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~--~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~ 263 (518) . .+.+.+.....-......++ +.|.-++.- ...++...+ .-...|+. ..+.++.++ ++.++.+ T Consensus 230 ~~~v~~L~Da~~~~~s~~~~~~~~~---a~p~~~i~G-~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~q 299 (484) T protein:vir:77 230 TPELRSVTDAAARTLMLMQATAELM---GVPQRLLFG-VKGEELGVDPETGQTLFDA------YLARILAFEDHESKAQQ 299 (484) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhh---hhhHHHHhC-CCcchhcccccccchhhhh------hhhhhcccCCCCceeEe Confidence 4 33333332222222222333 334433321 111111000 01111211 123455555 4677777 Q ss_pred ccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHH--------HHHHHHHhhhh Q lcl|NC_021305. 264 LQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIAR--------IQSAMDKYVGQ 335 (518) Q Consensus 264 l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~--------ie~~l~~~l~~ 335 (518) +....-+ -|++.++..+..|+..-++|++.+|.... |.++.++.. +....+.-.+.. +++.+...+.- T Consensus 300 ~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~--~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~ 375 (484) T protein:vir:77 300 FSAAELR-NFVDALDALDRKAAAYTGLPPYYLSFSSE-NPASAEAIR--SSESRLVKTVERKNKIFGGAWEQAMRVAYKV 375 (484) T ss_pred ecCCChH-HHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6644333 37788888888899999999999875432 223332222 111111111111 22121111110 Q ss_pred hhc-----ccccceecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceeeeccc---ccccccc Q lcl|NC_021305. 336 YWV-----RKNRMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANSA---LQPLGAT 405 (518) Q Consensus 336 ~~~-----~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~~~~gD~~~~~~n---~~~~~~~ 405 (518) ... ....+++.|.+....+..+.++.+.+++++| +++..-+++++|+.+-+-.....+.--.. ...++.. T Consensus 376 ~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~ 455 (484) T protein:vir:77 376 MNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTM 455 (484) T ss_pred hCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhh Confidence 111 1124566777778889999999999999876 88988899999885432111110000000 0000000 Q ss_pred cccCCCCCCCCCCCCCccCCCCCCCccccCCcccccc Q lcl|NC_021305. 406 PDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSP 442 (518) Q Consensus 406 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (518) .+.. .+.++.+ ...+++++.+++..+... T Consensus 456 ~~~~--~~~~~~~------~~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 456 FGTD--PSGGGNP------DNPETPEPQPNPAEEAAA 484 (484) T ss_pred cccc--ccCCCCC------CCCCcccccCCCccccCC Confidence 0000 0000000 000000000000000000 No 146 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=99.11 E-value=2e-10 Score=73.73 Aligned_cols=491 Identities=14% Similarity=0.103 Sum_probs=232.5 Q ss_pred CcCCCCC------CCCccccc----ccchhhhhh---hcccccccccccccchhhh-HHHhhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLLANGQ------TLSAPAMA----ELSPQMQDS---YYYAPAVGMQLERQFSLYG-GIYKNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f~~~~------~~~~~~~~----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~v~~ia~~ia~l 66 (518) |-=...+ ..|.++++ ..+..+.++ +...+ +.+..+....-+ +.|.-.+.++-.|..|+++|+++ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~--~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~ 78 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAM--GSSTRTDWQEDAWKAYDAVGELRYYVGWRSSSASRV 78 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhhhhc--CCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhcee Confidence 4332222 22222211 111222111 11111 111111122111 23444788899999999999999 Q ss_pred ceEEEEecCCcce---eccc-----hHHHHHHhcCCc-CCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc------eE-E Q lcl|NC_021305. 67 PVKCMFTSGDTET---EESD-----TGYAKLLADPCE-YLDPFAFWEWVASTLDIYGETYLAIQKNKSGT------PE-K 130 (518) Q Consensus 67 ~~~v~~~~~~~~~---~~~~-----~~~~~L~~~PN~-~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~------~~-~ 130 (518) .+..-+.+.++.. ...+ .....+...+-. .+-..++++.+..++-+-|++|+.+.--..|. ++ + T Consensus 79 rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~e 158 (629) T protein:vir:86 79 RLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPE 158 (629) T ss_pred eeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhh Confidence 9988877744322 1222 123334444443 45677899999999999999999887433332 22 2 Q ss_pred EEeeCCceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCC--CCcccCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 131 LMPMHPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP--DGLERGLSLMESLKSTIFSEDSSRNATA 208 (518) Q Consensus 131 l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 208 (518) ++.+.++.|.-. .++.. . .-..+...+.....+++ ||..+| ..-.+--||+.+++..+.......+... T Consensus 159 W~~vt~~ei~~~---~~~~~-i----~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~ 229 (629) T protein:vir:86 159 WLALTPEEVRAS---EKKTI-I----ELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIA 229 (629) T ss_pred heeechHHhhhc---cCcee-e----EcCCCCcceeeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 334444444311 11111 1 11223333334444554 776555 3334678999999999998888888877 Q ss_pred HHHHccCCcccccccCccCC----------------------HHHHHHHHHHHH----HHhcCcc--ccCCeeecC---- Q lcl|NC_021305. 209 AMWKNAGRPNLVLRHEKRLS----------------------EAAQQRLREQFD----RAHSGSS--NTGKTMVVE---- 256 (518) Q Consensus 209 ~~~~ng~~p~~il~~~~~~~----------------------~~~~~~~~~~~~----~~~~g~~--n~g~~~vl~---- 256 (518) +..+...+..|||-++..++ ....++|.+.|. ..+...+ .+--++|+. T Consensus 230 aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E 309 (629) T protein:vir:86 230 NASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGE 309 (629) T ss_pred HHHHHHHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechH Confidence 77777777777765543211 113445555554 3332211 122223322 Q ss_pred --CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhh Q lcl|NC_021305. 257 --EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYV 333 (518) Q Consensus 257 --~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l 333 (518) ++++...+.... +.--+++++..+..||....|||+ ++|+..++|.=+.=+-...-++-.|.|.+..|+++|++.+ T Consensus 310 ~i~~i~hlkf~~ei-~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~ 388 (629) T protein:vir:86 310 LIKNVTHLKFDNQV-TEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQV 388 (629) T ss_pred HhcCeeEEeecCch-hHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchHHHHHHHHHHhhH Confidence 234444444433 334578999999999999999987 5666556554333334445566789999999999999987 Q ss_pred hhhh----c---ccccceecchhhhh-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC-----C------cceee Q lcl|NC_021305. 334 GQYW----V---RKNRMKFDIDDVIQ-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDP-----K------ADELY 394 (518) Q Consensus 334 ~~~~----~---~~~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~-----~------gD~~~ 394 (518) |.+. + ..|-+.||.+.|.. .|..+ -...+...|.+|-...|+.+|+.-.+.. . .|.+. T Consensus 389 Lrp~Le~eGiDp~kYvvW~DaS~Lt~dPd~~d---eA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~ 465 (629) T protein:vir:86 389 LRTVLMREGIDPNAYVVWHDASQLTVDPDKTD---EARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVG 465 (629) T ss_pred HHHHHHHhCCCHHHhEeeecCcccccCCCCcH---HHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhh Confidence 7432 1 24567899888843 33333 3456778999999999999999753211 1 11111 Q ss_pred ecccc----cccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhH Q lcl|NC_021305. 395 ANSAL----QPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSP 470 (518) Q Consensus 395 ~~~n~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~ 470 (518) ...++ .++.... .....+.+.+.-+.. ...+++++..++..+.++++++..++...- ++....... T Consensus 466 ~~P~Li~~~a~l~~~~--a~~~~P~~~~~~pp~-~e~~~~dE~sga~~~~ep~te~d~~~~~a~-------~aa~~~~~~ 535 (629) T protein:vir:86 466 QDPNLLPTLAVLIPEL--ADVEFPTPTVALPPA-EEQDGDEEASGASRREEPDTEDDAGTDDSD-------QASLDSRET 535 (629) T ss_pred hCcchhhhhhhhhhhh--cccccCccCCCCCcc-ccCCCcccccCCCcCCCCCCCCCCcccccC-------CCCCCCcHH Confidence 11111 1110000 000111111110000 000111111122222233333222211110 000000011 Q ss_pred HHHHH-HHHhhc-cccCcCchhHHHHHHHHHHHhHHH-Hhhhh---hhhccc----CC Q lcl|NC_021305. 471 KHLRA-VKGAMG-RGKDIKGFALQLAEKYPDDLEDIL-LAVQL---ALAERK----DN 518 (518) Q Consensus 471 ~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~---~~~~~~----~~ 518 (518) ..++. +..+|. -||.-.+-..+ .++++.-.+-. ..+-| .-+.|- |. T Consensus 536 a~V~llv~RALelAGkR~r~r~~~--a~~r~v~~he~h~~l~Pv~~~~v~rli~gwd~ 591 (629) T protein:vir:86 536 AMVEALVFRALELAGKRSRTRSLP--YELRQLSDRELVRRLEPVRREHVADLIRGWDS 591 (629) T ss_pred HHHHHHHHHHHHhcCCcCCChhhH--HHHhccChhhcceecCCCChHHHHHHHHHHHH Confidence 11222 233331 22222221222 22221110000 00000 000011 00 No 147 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=99.09 E-value=4.8e-10 Score=71.69 Aligned_cols=491 Identities=14% Similarity=0.111 Sum_probs=229.1 Q ss_pred CcCCCCC----CCCcccccccchhhh--hhh--cccccccccccccchhhh-HHHhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_021305. 1 MLLANGQ----TLSAPAMAELSPQMQ--DSY--YYAPAVGMQLERQFSLYG-GIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) Q Consensus 1 ~~f~~~~----~~~~~~~~~~~~~~~--~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~v~~ia~~ia~l~~~v~ 71 (518) |-=...+ +...|+++.+.+.-+ ++- +.....|.........-+ +.|.-.+.++-.|..++++|+++.+..- T Consensus 1 ma~~~lrv~rrpk~~p~~r~l~aasqp~~P~~~~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss~Sr~rL~as 80 (629) T protein:vir:10 1 MAASTLRVSRRPKGSPARRSLTAASQPMEPGRTPSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASSCSRVELIAS 80 (629) T ss_pred CCccceeEEecCCCccceeeeccccCCCCcchhhchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhhheeeeEEEe Confidence 4333222 222344444333211 000 000001111111111111 2234447788899999999999999887 Q ss_pred EecCCcce-----eccchHHH----HHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC----ceE-EEEeeCCc Q lcl|NC_021305. 72 FTSGDTET-----EESDTGYA----KLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG----TPE-KLMPMHPS 137 (518) Q Consensus 72 ~~~~~~~~-----~~~~~~~~----~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G----~~~-~l~~l~p~ 137 (518) +.+.+... +.+++.-. ....--..-+...++++.+..++-+-|+.|+.++--..+ .+. ..+.+..+ T Consensus 81 ~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r~~W~vVt~~ 160 (629) T protein:vir:10 81 ELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVRHNWYVVTND 160 (629) T ss_pred eecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccccceeeecHH Confidence 77644321 11222111 122233445677889999999999999999998743333 233 22333333 Q ss_pred eeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCC--CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021305. 138 RVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP--DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAG 215 (518) Q Consensus 138 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~ 215 (518) .|. ...++...... .++...+|..+.-+.||..+| ..-..--||+.+++..+.......+...+..+... T Consensus 161 Ei~---~kg~g~~~i~l-----pdg~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aakSRL 232 (629) T protein:vir:10 161 EVK---NKGAGKTDIEL-----PDGTIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASKSRL 232 (629) T ss_pred Hhc---cccCceeEEEc-----CCCceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHhHH Confidence 332 11112111111 123344554444445565554 33346789999999999988888887777777777 Q ss_pred CcccccccCccCC------------H----------HHHHHHHHHHHHH----hcCcc--ccCCeeec--C----CCcce Q lcl|NC_021305. 216 RPNLVLRHEKRLS------------E----------AAQQRLREQFDRA----HSGSS--NTGKTMVV--E----EGMEP 261 (518) Q Consensus 216 ~p~~il~~~~~~~------------~----------~~~~~~~~~~~~~----~~g~~--n~g~~~vl--~----~g~~~ 261 (518) +..|||-++..++ + ...+.|.+.|.+. +...+ .+--++|+ + ++++. T Consensus 233 ~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ikh 312 (629) T protein:vir:10 233 IGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQKIFH 312 (629) T ss_pred hhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcCeee Confidence 7777765543211 0 1344455544332 21111 11112222 2 22333 Q ss_pred eeccCChhhHHHHHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhh--- Q lcl|NC_021305. 262 IPLQLTAVEMQFIEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYW--- 337 (518) Q Consensus 262 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~--- 337 (518) -.+..... ---+++++..+..+|....|||+ ++|+..++|.=+.=|-...-++..|.|.+..|+++|++.++... T Consensus 313 Lkf~~eit-e~~iktR~daI~RlAmglDispErLLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~Ait~~~Lrp~L~~ 391 (629) T protein:vir:10 313 LKIGNEIT-EVEIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVQLHIKPVMEVLCAAIYREVLVATLRA 391 (629) T ss_pred eeecCchh-HHHHhhHHHHHHHHHhccCCChhheeeccCCccceeeEEecccceeeecchHHHHHHHHHHhHHHHHHHHH Confidence 33343333 33578899999999999999987 56665566643433444455667899999999999999876432 Q ss_pred -c---ccccceecchhhhh-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCccee-----------eeccccc- Q lcl|NC_021305. 338 -V---RKNRMKFDIDDVIQ-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL-----------YANSALQ- 400 (518) Q Consensus 338 -~---~~~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~-----------~~~~n~~- 400 (518) + ..|-+.||.+.|.. .|..+ -...+...|.+|-...|+.+|+..-+..--+.+ ..+.++. T Consensus 392 eGiDp~~Yvvw~DaS~Lt~dPd~~d---eA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P~Li~ 468 (629) T protein:vir:10 392 EGIDPDRYVLWYDASGLTVDPDKTD---EATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADPSLIK 468 (629) T ss_pred hCCCHHHhEeeecCcccccCCCCcH---HHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCCchhh Confidence 1 24567899888733 33333 344567899999999999999965432111111 1111111 Q ss_pred ---ccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHH-HH Q lcl|NC_021305. 401 ---PLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLR-AV 476 (518) Q Consensus 401 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~-~~ 476 (518) |+....-+..+..+.+....+++.+..+++++++..++..+.+...- ....+-....+-.+ .+ T Consensus 469 ~~apll~~~l~~i~~P~p~~a~~~~~~~~~~~E~~~~~~e~~~e~dA~~a-------------~~~~~~aa~~~A~rllv 535 (629) T protein:vir:10 469 VLAPLLTDELAEIDWPEPPAALPPGEDDQADEEQDTTGSEPSTEDDAEAA-------------ARISSVADMVLAERLLT 535 (629) T ss_pred hhhhhcCCccccccccCCCCcCCCCCcccCccccCCCCCCcCCCcchhhc-------------ccCCchhhHHHHHHHHH Confidence 11111001111111111111222222222222222222111111110 01111111111111 12 Q ss_pred HHhh--ccccC--cCchhHHHHHHH----------------HHHHhHHHHhhhhhhhc-----c-cCC Q lcl|NC_021305. 477 KGAM--GRGKD--IKGFALQLAEKY----------------PDDLEDILLAVQLALAE-----R-KDN 518 (518) Q Consensus 477 ~~~~--~~~~~--~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~-----~-~~~ 518 (518) ..+| +++.= ..+-+.+ .++ ++++-.+--+---+|-+ - -|+ T Consensus 536 ~RALelAGkRl~~~rdR~~~--ar~~~vp~he~h~~l~Pv~~~~v~rli~gwd~~l~~~~~a~lg~D~ 601 (629) T protein:vir:10 536 VRALGLAGKRRVNTNDRAQK--ARLAGIAPHDYHRVMGPVADADIPRLIAGWDEGLEEEALALLGVDS 601 (629) T ss_pred HHHHHHccccccCCCchhhH--HHhhcCChhhceeecCCCChhHHHHHHHhhhhHHHHHHHHHhCCCh Confidence 2333 11111 1111111 111 11111110111111110 0 011 No 148 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=99.08 E-value=1.4e-09 Score=69.20 Aligned_cols=404 Identities=14% Similarity=0.113 Sum_probs=169.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccccc--ccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPA--VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =|...-.. ..+....+..+.. |.... .+.... ...+.....+.+...+|+..+..+...+|.+ .++ T Consensus 20 ~L~~~~~~-~~~r~~~~~~YY~---G~~~i~~~~~~~~---~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~---~~~-- 87 (485) T protein:vir:24 20 EMVSAFED-QNQNLRSNTSYYE---AERRPEAIGVTVP---VQMQSLLAHVGYPRLYVDSIAERQAVEGFRL---GDA-- 87 (485) T ss_pred HHHHHHHH-HHHHHHHHHHHHh---ccCchhhcCcccc---hhhhhhhhccchHHHHHHHHhhhhccCceec---CCC-- Confidence 01111000 0000000000000 00000 000000 0111122223455667777776665555543 111 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCce-------EEEEeeCCceeEEEEcCCceeeE Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTP-------EKLMPMHPSRVAIKRNSRTGRYE 151 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~-------~~l~~l~p~~v~v~~~~~~~~~~ 151 (518) ...+..+..++.+ | +...+...+..+++.+|.+|+++-++..+.. ..+.+++|..+.+.++....... T Consensus 88 -~~~~~~l~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~~~~~ 162 (485) T protein:vir:24 88 -DEADEELWQWWQA-N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRIGRPA 162 (485) T ss_pred -chhHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCcCcee Confidence 1122233444433 2 3456778889999999999999988766532 25778899988887765432211 Q ss_pred E--eeecccccC--ceeEEec-------------------------cccEEEEeccCCCCcccCchHHHH-HHHHHHHHH Q lcl|NC_021305. 152 Y--YFQAGAGVG--TQLVSFA-------------------------DDEVVPIRFFNPDGLERGLSLMES-LKSTIFSED 201 (518) Q Consensus 152 ~--~~~~~~~~~--~~~~~~~-------------------------~~evih~~~~~~~~~~~G~s~l~~-~~~~i~~~~ 201 (518) + .+......+ .....|. .=.|+||++....+..+|.|.+.- +...+.... T Consensus 163 ~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~ 242 (485) T protein:vir:24 163 KAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAA 242 (485) T ss_pred EEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHH Confidence 1 111110000 0011111 223455554433344578876542 333333333 Q ss_pred HHHHHHHHHHHccCCcccccccCccCCHHHHHH----HHHHHHHHhcCccccCCeeecC-CCcceeeccCChhhHHHHHH Q lcl|NC_021305. 202 SSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQR----LREQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVEMQFIEA 276 (518) Q Consensus 202 ~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~----~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~~~~e~ 276 (518) .+..-......-.+.|.-++.- .++++... -...|+. ..+.++.++ ++.++.++....-+ .+.+. T Consensus 243 ~~~s~~~~~~~~~a~p~~~i~G---~~~~~~~~~~~~~~~~~~~------~~~~i~~~~~~~~~~~q~~~~~~e-~~~~~ 312 (485) T protein:vir:24 243 RILMLMQATAELMGVPQRLIFG---IKPEEIGVDPETGQTLFDA------YLARILAFEDAEGKIQQFSAAELA-NFTNA 312 (485) T ss_pred HHHHHHHHHHHhhcchhhhhcc---CCccccccccccccchhhh------cccceeccCCCCceEEeecccchH-HHHHH Confidence 3322222223333445444431 11111100 0111211 223455554 56777666543322 36777 Q ss_pred HHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhh------hhhh-------cccccc Q lcl|NC_021305. 277 RQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYV------GQYW-------VRKNRM 343 (518) Q Consensus 277 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l------~~~~-------~~~~~~ 343 (518) ++..+..++..=++|+..+|.... |.++.++ ..+....+.-.+...+..+...| +... .....+ T Consensus 313 l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~A--l~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i 389 (485) T protein:vir:24 313 LDQIAKQVAAYTGLPPQYLSTAAD-NPASAEA--IRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDMLRM 389 (485) T ss_pred HHHHHHHHhcccCCCHHHhccccC-cchHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccee Confidence 777788888888999998875432 2222222 11222222222222222111111 0011 111345 Q ss_pred eecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceee---ecccccccccccccCCCCCCCCCC Q lcl|NC_021305. 344 KFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELY---ANSALQPLGATPDGAVEWEEAPAP 418 (518) Q Consensus 344 ~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~~~~gD~~~---~~~n~~~~~~~~~~~~~~~~~~~~ 418 (518) ++.|......+..+.++.+.+++.+| +++..-+++.+|+.+-+-....... .......++...+.....+.++.. T Consensus 390 ~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (485) T protein:vir:24 390 ETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNP 469 (485) T ss_pred eEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCC Confidence 56667777888999999999998876 7888778888887543211111100 000000111111111111111111 Q ss_pred CCCccCCCCCCCccccCCccccccchhc Q lcl|NC_021305. 419 KRPASTPVASLDQSPPTSVPGLSPTNSD 446 (518) Q Consensus 419 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (518) ++..+. .+...+.... T Consensus 470 ~e~~~~------------~~~~~~~~~a 485 (485) T protein:vir:24 470 TPAPKP------------QPAIEGGDSA 485 (485) T ss_pred CCCCCC------------ccCCCCCCCC Confidence 110000 0000000000 No 149 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=99.06 E-value=1.2e-09 Score=69.50 Aligned_cols=371 Identities=11% Similarity=0.038 Sum_probs=173.1 Q ss_pred ccchhhhhhhcccccccccccccchhhhHHH--hhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHhcC Q lcl|NC_021305. 16 ELSPQMQDSYYYAPAVGMQLERQFSLYGGIY--KNQPWVRTVIAKRAQALARLPVKCMFTSGDTETEESDTGYAKLLADP 93 (518) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~P 93 (518) -+.+ .....+.... ....+...+|+.+++.+--..|.+ . +. . .+..+..++.+ T Consensus 1 ~l~~-----------------~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~---~-d~--~-~~~~~~~i~~~- 55 (434) T protein:vir:98 1 MLPK-----------------NAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTG---P-DG--E-PDTRASRWWQA- 55 (434) T ss_pred CCCC-----------------CccHHHHHhhhhhhccchHHHHHHHHhhhccCceec---C-CC--c-hHHHHHHHHHh- Confidence 0000 0001111111 112345678887777654444432 1 11 1 12233344443 Q ss_pred CcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc------eEEEEeeCCceeEEEEcCCceeeEEee--ecccccCce-- Q lcl|NC_021305. 94 CEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT------PEKLMPMHPSRVAIKRNSRTGRYEYYF--QAGAGVGTQ-- 163 (518) Q Consensus 94 N~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~------~~~l~~l~p~~v~v~~~~~~~~~~~~~--~~~~~~~~~-- 163 (518) | +.......+..+.+.+|.+|+.+.++..+. -..+.+++|..+.+.++.......+.+ +.....+.. T Consensus 56 N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~~~~~~ 132 (434) T protein:vir:98 56 N---RLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDIDGFGYA 132 (434) T ss_pred c---ChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccCCceEE Confidence 3 344566678889999999999988766543 223677899998888865432211111 000000000 Q ss_pred --------------------------------------eEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 164 --------------------------------------LVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRN 205 (518) Q Consensus 164 --------------------------------------~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~ 205 (518) ...+..=.|+||.++.. ....|.|-++.....+.....+.. T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~-~~~~g~sd~e~vi~liDa~~~~~s 211 (434) T protein:vir:98 133 RVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPD-LGEDPEPEFAGVLDIQDRVNLGIL 211 (434) T ss_pred EEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCC-cCcCCcchhhhHHHHHHHHHHHHH Confidence 00112223566654432 223588888888887777776665 Q ss_pred HHHHHHHccCCccccccc-C-ccCCHHHHHHHHHHHHHHhcCccccCCeeecC-CCcceeeccCChhhHHHHHHHHHHHH Q lcl|NC_021305. 206 ATAAMWKNAGRPNLVLRH-E-KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVEMQFIEARQLNRE 282 (518) Q Consensus 206 ~~~~~~~ng~~p~~il~~-~-~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~~~~ 282 (518) -......-.+.|.-++.- . ....++.. .....++. +. ...+.+++++ ++.++.++..+.. ..+.+.++..+. T Consensus 212 ~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-~~~~~~~~-~~--~~~~~i~~~~~~~~~~~q~~~~~~-~~~~~~l~~~i~ 286 (434) T protein:vir:98 212 NRMAASRFSGFRQKWIKGHKFAKRTDPAT-GMTVVDQP-FV--PSPSAVWASEGENTQFGQLDATDL-SGFLKEHASDVR 286 (434) T ss_pred HHHHHHHHhcchhhhhcCCCccccccccc-ccchhhhh-hh--ccccccccCCCCCceEEEecCcch-HHHHHHHHHHHH Confidence 555555545555544431 1 11111111 11111111 11 1223455555 4577766654322 237788888899 Q ss_pred HHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHH----HHHHHHHhh---hhhhc---ccccceecchhhhh Q lcl|NC_021305. 283 EVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIAR----IQSAMDKYV---GQYWV---RKNRMKFDIDDVIQ 352 (518) Q Consensus 283 ~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~----ie~~l~~~l---~~~~~---~~~~~~fd~~~l~~ 352 (518) .|+..-++|++.+|.. .+.++.++. .+....+.-.+.. +...+.+.+ +...+ ....+++.|.+... T Consensus 287 ~~~~~~~~p~~~~~~~--~~n~Sg~Al--~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~ 362 (434) T protein:vir:98 287 DMLTISQTPTYLYATD--LVNISADTI--GALDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAH 362 (434) T ss_pred HHhcccCCCHHHhccc--cCChHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCC Confidence 9999999999998742 122222221 1222222222222 222222111 11111 12345666778889 Q ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCcc Q lcl|NC_021305. 353 PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQS 432 (518) Q Consensus 353 ~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (518) .+..+.++++.+++..|+ +..-+++++|+++-+ -+.+..-..-..+.... ..+..+++......++ +. T Consensus 363 ~s~~~~ada~~kl~~~g~-~~e~~~~~lg~~~~e---~~r~~~e~~~~~~~~~~-------~~~~~~~~~~g~~~~~-~~ 430 (434) T protein:vir:98 363 VTMAVKADAATKLKSIGY-PLDVIAEELDESPAR---VRRIVAGAASQALLAAS-------LLPAPGAPSAGNVPDS-GG 430 (434) T ss_pred CCHHHHHHHHHHHHhcCC-cHHHHHHhCCCCHHH---HHHHHHHHHHHHHHHHh-------hhccCCCCCCCCCCcc-cC Confidence 999999999999999886 677788888875421 11110000000000000 0000000000000000 00 Q ss_pred ccCC Q lcl|NC_021305. 433 PPTS 436 (518) Q Consensus 433 ~~~~ 436 (518) .+++ T Consensus 431 ~~dg 434 (434) T protein:vir:98 431 AVDG 434 (434) T ss_pred CCCC Confidence 0011 No 150 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=99.04 E-value=1.3e-10 Score=74.81 Aligned_cols=492 Identities=15% Similarity=0.117 Sum_probs=228.0 Q ss_pred CcCCCCC------CCCccccc----ccchhhhhh---hcccccccccccccchhhh-HHHhhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLLANGQ------TLSAPAMA----ELSPQMQDS---YYYAPAVGMQLERQFSLYG-GIYKNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f~~~~------~~~~~~~~----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~v~~ia~~ia~l 66 (518) |-=...+ ..|.++++ ..+..+.+. +...+ +.+..+....-+ +.|.-.+.++-.|..|+++|+++ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~--~~~~~~~WQ~eAW~~~d~v~Elry~vgW~~~s~Sr~ 78 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAM--GSSTRTDWQDDAWKAYDAVGELRYYVGWRSSSASRV 78 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhhhhc--CCCchhhhhHHHHHHHHhhhhHHHHhhhhhhhhcee Confidence 4332222 22221211 111222111 11111 111111121111 23444788899999999999999 Q ss_pred ceEEEEecCCcce---eccc-----hHHHHHHhcCCc-CCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc------eE-E Q lcl|NC_021305. 67 PVKCMFTSGDTET---EESD-----TGYAKLLADPCE-YLDPFAFWEWVASTLDIYGETYLAIQKNKSGT------PE-K 130 (518) Q Consensus 67 ~~~v~~~~~~~~~---~~~~-----~~~~~L~~~PN~-~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~------~~-~ 130 (518) .+..-+.+.++.. ...+ .....+...+-. .+-..++++.+..++-+-|++|+.+.--..|. ++ + T Consensus 79 rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~e 158 (629) T protein:vir:99 79 RLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVPE 158 (629) T ss_pred eeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchhh Confidence 9988877744322 1222 123334444443 45677899999999999999999887433332 22 2 Q ss_pred EEeeCCceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCC--CCcccCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 131 LMPMHPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP--DGLERGLSLMESLKSTIFSEDSSRNATA 208 (518) Q Consensus 131 l~~l~p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 208 (518) ++.+.++.|.-. .++.. . .-..+...+.....+++ ||..+| ..-.+--||+.+++..+.......+... T Consensus 159 W~~vt~~ei~~~---~~~~~-i----~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~ 229 (629) T protein:vir:99 159 WLALTPEEVRAS---EKKTI-I----ELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIA 229 (629) T ss_pred heeechHHhhhc---cCcee-E----EcCCCCccceeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHH Confidence 334444444311 11111 1 11223333334444544 776554 3334678999999999998888888877 Q ss_pred HHHHccCCcccccccCccCC----------------------HHHHHHHHHHHH----HHhcCcc--ccCCeeecC---- Q lcl|NC_021305. 209 AMWKNAGRPNLVLRHEKRLS----------------------EAAQQRLREQFD----RAHSGSS--NTGKTMVVE---- 256 (518) Q Consensus 209 ~~~~ng~~p~~il~~~~~~~----------------------~~~~~~~~~~~~----~~~~g~~--n~g~~~vl~---- 256 (518) +..+...+..|||-++..++ ....++|.+.|. ..+...+ .+--++|+. T Consensus 230 aaakSRL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E 309 (629) T protein:vir:99 230 NASKSRLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGE 309 (629) T ss_pred HHHHHHHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechH Confidence 77777777777765543211 113445555554 3332211 122223322 Q ss_pred --CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHH-HhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhh Q lcl|NC_021305. 257 --EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPP-IVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYV 333 (518) Q Consensus 257 --~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~-~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l 333 (518) ++++...+.... +.--+++++..+..||....|||+ ++|+..++|.=+.=+-...-++-.|.|.+..|+++|++.+ T Consensus 310 ~i~~i~hlkf~~ei-~e~aiktR~daI~RlA~glDippE~LLGlGsd~NHWsAWqI~dedvrlHI~P~l~~ic~AlT~~~ 388 (629) T protein:vir:99 310 LIKNVTHLKFDNQV-TEVAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHWSAWQIGDEDVRLHILPPVEMLCEAITNQV 388 (629) T ss_pred HhcCeeEEeecCch-hHHHHhhHHHHHHHHHhccCCchhhheeccCCccceEEEEecccceeeecchhHHHHHHHHHhhH Confidence 234444444433 334578999999999999999987 5666556554333334445566789999999999999987 Q ss_pred hhhh----c---ccccceecchhhhh-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC-----C------cceee Q lcl|NC_021305. 334 GQYW----V---RKNRMKFDIDDVIQ-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDP-----K------ADELY 394 (518) Q Consensus 334 ~~~~----~---~~~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~-----~------gD~~~ 394 (518) |.+. + ..|-+.||.+.|.. .|..+ -...+...|.+|-...|+.+|+.-.+.. . .|.+. T Consensus 389 Lrp~Le~eGiDp~kYvvW~DaS~Lt~dPd~~d---eA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~ 465 (629) T protein:vir:99 389 LRTVLMREGIDPNAYVVWHDASQLTVDPDKTD---EARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVG 465 (629) T ss_pred HHHHHHHhCCCHHHhEeeecCcccccCCCCcH---HHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhh Confidence 7432 1 23567899888843 33333 3456778999999999999999753211 1 11111 Q ss_pred ecccc----cccccccccCCCCCCCCCCCC-CccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhh Q lcl|NC_021305. 395 ANSAL----QPLGATPDGAVEWEEAPAPKR-PASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESS 469 (518) Q Consensus 395 ~~~n~----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~ 469 (518) ...++ .++.... .....+.+.+.- |+..+ +++++..++..+.++++++..++...- ++...... T Consensus 466 ~~P~Li~~~a~l~~~~--a~~~~P~~~~~~pp~~e~--~~~dE~sga~~~~ep~te~d~~~~~a~-------~aa~~~~~ 534 (629) T protein:vir:99 466 QDPNLLPTLAVLIPEL--ADVEFPTPTVALPPAEEQ--DGDEEASGASRREEPDTEDDAGTDDSD-------QASLDSRE 534 (629) T ss_pred hCcchhhhhhhhhhhh--cccccCccCCCCCccccC--CCcccccCCCcCCCCCCCCCCcccccC-------CCCCCCcH Confidence 11111 1110000 000001000000 00000 001111111111122222111100000 00000000 Q ss_pred HHHHHH-HHHhhc-cccCcCchhHH----HHHHH----------HHHHhHHHHhhhhhhhcc------cCC Q lcl|NC_021305. 470 PKHLRA-VKGAMG-RGKDIKGFALQ----LAEKY----------PDDLEDILLAVQLALAER------KDN 518 (518) Q Consensus 470 ~~~~~~-~~~~~~-~~~~~~~~~~~----~~~~~----------~~~~~~~~~~~~~~~~~~------~~~ 518 (518) ...++. +..+|. -||.-.+-..+ .+-.| ++++..|--+.+-+|-+. -|+ T Consensus 535 ~a~V~llv~RALelAGkR~r~r~~~ar~r~v~~he~h~~l~Pv~~~~i~rli~gwd~~ld~~~~~~Lg~d~ 605 (629) T protein:vir:99 535 TAMVEALVFRALELAGKRSRTRSLPYELRQLSDRELVRRLEPVRREHVADLIRGWDSMLEERAVQALNMNI 605 (629) T ss_pred HHHHHHHHHHHHHhcCCcCCChhhHHHHhcCchhhceeecCCCCHHHHHHHHHHHHHHHHHHHHHHhCCCH Confidence 000111 111220 11111111111 00001 122222211111111110 011 No 151 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=99.00 E-value=1.6e-09 Score=68.82 Aligned_cols=409 Identities=15% Similarity=0.121 Sum_probs=169.2 Q ss_pred cCCCCCCCCcccccc-----c-chh---------hhhhh-cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhcc Q lcl|NC_021305. 2 LLANGQTLSAPAMAE-----L-SPQ---------MQDSY-YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR 65 (518) Q Consensus 2 ~f~~~~~~~~~~~~~-----~-~~~---------~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~ 65 (518) |=++=+-...+.... + .-| +..-+ |-.+....+.. -....+.......+...+|+.++..+-- T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~~~~l~~ 79 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVT-VPIQMQSLLAHVGYPRLYVDSIAERQAV 79 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCC-CChhhhhhhhhcCcHHHHHHHHHhhhcc Confidence 111111010010000 0 000 00000 00110000000 0011122222234557777777776643 Q ss_pred CceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc-------eEEEEeeCCce Q lcl|NC_021305. 66 LPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT-------PEKLMPMHPSR 138 (518) Q Consensus 66 l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~-------~~~l~~l~p~~ 138 (518) .+|.+ .++ . ..+..+..++.+ -+...+...+..+++.+|.+|+.+.++..+. ...+.+++|.. T Consensus 80 ~g~~~---~~~--~-~~~~~~~~i~~~----N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~ 149 (485) T protein:vir:10 80 EGFRF---GDA--D-EADEELWQWWQA----NNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTR 149 (485) T ss_pred cceec---CCC--c-hhHHHHHHHHHh----cCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccce Confidence 34432 211 1 112233344432 2445677788999999999999988876532 23577888988 Q ss_pred eEEEEcCCceeeE--EeeecccccCc--eeEEeccc-------------------------cEEEEeccCCCCcccCchH Q lcl|NC_021305. 139 VAIKRNSRTGRYE--YYFQAGAGVGT--QLVSFADD-------------------------EVVPIRFFNPDGLERGLSL 189 (518) Q Consensus 139 v~v~~~~~~~~~~--~~~~~~~~~~~--~~~~~~~~-------------------------evih~~~~~~~~~~~G~s~ 189 (518) +.+.++....... +.+......+. ....+.++ .|++|.+....+..+|.|- T Consensus 150 ~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~ 229 (485) T protein:vir:10 150 MYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSE 229 (485) T ss_pred eEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccc Confidence 8887765433221 11111111110 01112222 3455554333334567775 Q ss_pred HH----HHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHH--HHHHHHHHHhcCccccCCeeecC-CCccee Q lcl|NC_021305. 190 ME----SLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQ--RLREQFDRAHSGSSNTGKTMVVE-EGMEPI 262 (518) Q Consensus 190 l~----~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~--~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~ 262 (518) +. .+.+.+.....-......+| +.|.-++.-- ...+...+ .-...|+. ..+.++.++ ++.++. T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~~~~---a~p~~~i~G~-~~~~~~~~~~~~~~~~~~------~~~~i~~~~~~d~k~~ 299 (485) T protein:vir:10 230 ITPELRSMTDAAARILMLMQATAELM---GVPQRLIFGI-KPEEIGVDPETGQTLFDA------YLARILAFEDAEGKIQ 299 (485) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhh---cchHHHHhcC-Ccccccccccccchhhhh------cccceeccCCCCceEE Confidence 54 33333333332222222332 3343333210 01110000 00111211 123455555 567776 Q ss_pred eccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH---HHHHHHHhhHHHHHHHHHHHHh--hhhhh Q lcl|NC_021305. 263 PLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM---RAFYRDTMAIPIARIQSAMDKY--VGQYW 337 (518) Q Consensus 263 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~---~~~~~~~l~P~~~~ie~~l~~~--l~~~~ 337 (518) ++....-+ -+.+.++..+.+|+..-++|++.+|.... |.++.++.. ..+...+ .-....+...+.+. |+... T Consensus 300 q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~k~-~~k~~~f~~~l~~~~~l~~~~ 376 (485) T protein:vir:10 300 QFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAIRAAESRLIKKV-ERKNSIFGGAWEEAMRLAYRM 376 (485) T ss_pred eecccchH-HHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Confidence 66543322 37777888888899999999998875432 222222211 1111111 11111122222211 11100 Q ss_pred --c-----ccccceecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceeeecc---cccccccc Q lcl|NC_021305. 338 --V-----RKNRMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANS---ALQPLGAT 405 (518) Q Consensus 338 --~-----~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~~~~gD~~~~~~---n~~~~~~~ 405 (518) . ....+++.|.+.+..+..+.++++.+++++| +++..-+++.+|+.+-+-.......--. ....++.. T Consensus 377 ~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~ 456 (485) T protein:vir:10 377 MKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTM 456 (485) T ss_pred hCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHh Confidence 1 1134566777888899999999999999876 8888889999988653211111000000 00000100 Q ss_pred cccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchh Q lcl|NC_021305. 406 PDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRST 449 (518) Q Consensus 406 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (518) .. +..+.++..+..++.+++....++ .+. T Consensus 457 ~~--------~~~~~~~~~~~~~~~~~~~~~~~~-------~~~ 485 (485) T protein:vir:10 457 VD--------PNPTVPGSPSPAPAPKPAALESGG-------DAA 485 (485) T ss_pred hc--------cCCCCCCCCCccccccCcCCCCCC-------CCC Confidence 00 011101000000111111000011 000 No 152 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.95 E-value=3.3e-09 Score=67.06 Aligned_cols=411 Identities=14% Similarity=0.094 Sum_probs=166.8 Q ss_pred CcCCCCCCCCcccccccchhhh--------hhhcccc--cccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQ--------DSYYYAP--AVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~--------~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v 70 (518) ++...-.....-....+.-|.. ..+.-+. ....+.. -....+.......+...+|+.++..+--.+|.+ T Consensus 6 ~~~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~-~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~ 84 (486) T protein:vir:42 6 PGMEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVT-VPREMQQLLAHVGYPRLYVDSVAERQAVEGFRL 84 (486) T ss_pred CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccc-cchhHhhhhhccchHHHHHHHHHhhhcccceec Confidence 1111111000000000000000 0010000 0000000 001111122233456777777777665455542 Q ss_pred EEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc-------eEEEEeeCCceeEEEE Q lcl|NC_021305. 71 MFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT-------PEKLMPMHPSRVAIKR 143 (518) Q Consensus 71 ~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~-------~~~l~~l~p~~v~v~~ 143 (518) .+ ....+..+..++.+ | +.......+..+++.+|.+|+.+.++..|. ...+.+++|..+.+++ T Consensus 85 ---~~---~~~~~~~~~~i~~~-N---~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~ 154 (486) T protein:vir:42 85 ---GD---ADEADEELWQWWQA-N---NLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEI 154 (486) T ss_pred ---CC---CchhHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEE Confidence 11 11122334444443 3 234556778899999999999998776443 2356788999888887 Q ss_pred cCCceeeEEee--ecccccCc--eeEEeccc-------------------------cEEEEeccCCCCcccCchHHHH-H Q lcl|NC_021305. 144 NSRTGRYEYYF--QAGAGVGT--QLVSFADD-------------------------EVVPIRFFNPDGLERGLSLMES-L 193 (518) Q Consensus 144 ~~~~~~~~~~~--~~~~~~~~--~~~~~~~~-------------------------evih~~~~~~~~~~~G~s~l~~-~ 193 (518) +.......+.+ ......+. ....|.++ .|++|.++...+..+|.|-+.- + T Consensus 155 d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v 234 (486) T protein:vir:42 155 DPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPEL 234 (486) T ss_pred eCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccchhhH Confidence 74332211111 11100000 00111222 3444443322333467775542 2 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHH----HHHHHHHHHhcCccccCCeeecC-CCcceeeccCCh Q lcl|NC_021305. 194 KSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQ----RLREQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTA 268 (518) Q Consensus 194 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~----~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~ 268 (518) ...+.....+..-......-.+.|.-++.- .++++.. +-...|.. ..+++++++ ++.++.++.... T Consensus 235 ~~liDa~~~~~s~~~~~~e~~a~p~~~i~G---~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~q~~~~~ 305 (486) T protein:vir:42 235 RSMTDAAARILMLMQATAELMGVPQRLIFG---IKPEEIGVDSETGQTLFDA------YLARILAFEDAEGKIQQFSAAE 305 (486) T ss_pred HHHHHHHHHHHHHHHHHHHhhcchHHHhhc---CCccccccccccccchhhh------hhchhcccCCCCceEEeecccC Confidence 222222222222222222223334433331 1111110 00111211 224455554 556776665432 Q ss_pred hhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHH----HHHHHHHhh--hhhh--c-- Q lcl|NC_021305. 269 VEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIAR----IQSAMDKYV--GQYW--V-- 338 (518) Q Consensus 269 ~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~----ie~~l~~~l--~~~~--~-- 338 (518) .+ .+++.++..+..++..-++|++.+|.... |.++.++. .+....+.-.+.. +...|.+.+ +... . T Consensus 306 ~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al--~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~ 381 (486) T protein:vir:42 306 LA-NFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAI--RAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGD 381 (486) T ss_pred HH-HHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 22 37778888888888889999998875432 22222221 1111111111111 111111111 0000 0 Q ss_pred ---ccccceecchhhhhcCHHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCCCCCcceeee---cccccccccccccCC Q lcl|NC_021305. 339 ---RKNRMKFDIDDVIQPDWEAKSESTQKMVNS--GVATPNEGREIMGLPRSDDPKADELYA---NSALQPLGATPDGAV 410 (518) Q Consensus 339 ---~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~--G~~T~NE~R~~~g~~p~~~~~gD~~~~---~~n~~~~~~~~~~~~ 410 (518) ....+++.|.+....+..+.++.+.+++++ |+++..-+++.+|+.+-+......+.- ......++...+... T Consensus 382 ~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~ 461 (486) T protein:vir:42 382 VPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADP 461 (486) T ss_pred ccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 113455667777888999999999999986 678888888888875432111111000 000011111111111 Q ss_pred CCCCCCCCCCCccCCCCCCCccccCCcccc Q lcl|NC_021305. 411 EWEEAPAPKRPASTPVASLDQSPPTSVPGL 440 (518) Q Consensus 411 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (518) ..+.++.++++...+ +.. +++.... T Consensus 462 ~~~~~~~~~~~~~~~---~~~--~~~~~~~ 486 (486) T protein:vir:42 462 TVPGSPSPTAPPKPQ---PAI--ESSGGDA 486 (486) T ss_pred CCCCCCCCCCCCCCC---ccc--CCCCCCC Confidence 111111111111100 000 0000000 No 153 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=98.93 E-value=3.1e-09 Score=67.19 Aligned_cols=491 Identities=14% Similarity=0.102 Sum_probs=226.1 Q ss_pred CcCCCCCC-------CCcccccccchhhhhh----hc-ccccccccccccchhh-hHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 MLLANGQT-------LSAPAMAELSPQMQDS----YY-YAPAVGMQLERQFSLY-GGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~f~~~~~-------~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) |-=...+. ++.+.++.++-.-+.. -. ..+..+ +.......- =+.|...+.++-.|..|+++|+++. T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~-~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~r 79 (639) T protein:vir:10 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMG-TARNEWQSEAWDFSESIGELSYYVSWRANSCSRTT 79 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccc-cchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceee Confidence 43332221 2222222222111100 00 000001 111111111 1345556888999999999999999 Q ss_pred eEEEEecCCcc-----ee-ccc---hHHHH-HHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEE-EcCCCc------eEE Q lcl|NC_021305. 68 VKCMFTSGDTE-----TE-ESD---TGYAK-LLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ-KNKSGT------PEK 130 (518) Q Consensus 68 ~~v~~~~~~~~-----~~-~~~---~~~~~-L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~-r~~~G~------~~~ 130 (518) +..-+.+.+.- .. ..+ +.... ...--..-+...++++.+..++-+-|++|+.++ +..++. +.. T Consensus 80 L~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~ 159 (639) T protein:vir:10 80 LIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (639) T ss_pred eEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCccccccc Confidence 98877664332 11 111 11111 112233456778899999999999999998765 333332 233 Q ss_pred EE-eeCCceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCC--CCcccCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 131 LM-PMHPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP--DGLERGLSLMESLKSTIFSEDSSRNAT 207 (518) Q Consensus 131 l~-~l~p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~~~~ 207 (518) -| .+..+.|. . ..++....... . +...+|..+.=+.||..+| ..-.+--||+.+++..+.......+.. T Consensus 160 ~W~vvs~~Ei~--~-~~~~~~~i~lP----d-G~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i 231 (639) T protein:vir:10 160 RWYAVTREEIK--S-KAGETAEISLP----D-GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (639) T ss_pred ceeeeeHHHhc--c-cCCCeeEeecC----C-CCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHH Confidence 33 33333332 1 11111111111 1 2222333333333555444 334467899999999999888888888 Q ss_pred HHHHHccCCcccccccCccCCH-------------------------HHHHHHHHHHHHH----hcCcc--ccCCeeecC Q lcl|NC_021305. 208 AAMWKNAGRPNLVLRHEKRLSE-------------------------AAQQRLREQFDRA----HSGSS--NTGKTMVVE 256 (518) Q Consensus 208 ~~~~~ng~~p~~il~~~~~~~~-------------------------~~~~~~~~~~~~~----~~g~~--n~g~~~vl~ 256 (518) .+..+...+..|||-+|..++- ...+.|.+.|.+. +...+ .+--++|+. T Consensus 232 ~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~ 311 (639) T protein:vir:10 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (639) T ss_pred HHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEe Confidence 7777777777777766543221 1234455544332 22111 111223322 Q ss_pred ------CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHH-hccccccccCCHHHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_021305. 257 ------EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPI-VHILDRATFSNISAQMRAFYRDTMAIPIARIQSAM 329 (518) Q Consensus 257 ------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l 329 (518) ++++...+... -+.--+++++..+..+|....|||+. +|+ +++|.=+.=+-...-++..|.|.+..|+++| T Consensus 312 ~p~E~l~~ikhl~f~~e-i~e~aiktR~daI~RlA~glDi~pE~LLGl-~d~NHWsAWqI~dedvrlHI~P~l~~icdAl 389 (639) T protein:vir:10 312 VAAEHLEKVQHIKFGNE-VTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAI 389 (639) T ss_pred echHHhcCeeeeeecCc-hhHHHHhhHHHHHHHHHhccCCchhheeec-ccccceEEEEecccceeeecchhHHHHHHHH Confidence 22333333333 33345789999999999999999875 565 5555433333444556678999999999999 Q ss_pred HHhhhhhh----c---ccccceecchhhhh-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc---------- Q lcl|NC_021305. 330 DKYVGQYW----V---RKNRMKFDIDDVIQ-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD---------- 391 (518) Q Consensus 330 ~~~l~~~~----~---~~~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD---------- 391 (518) ++.+|.+. + ..|-+.||.+.|.. .|..+ -...+...|.+|-.-.|+.+|+..-+ +=| T Consensus 390 T~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~d---eA~qa~drGAIt~eAlR~~lG~~edd--~yd~~t~e~~~~~ 464 (639) T protein:vir:10 390 YNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSD---EAVEAHDRGAITSAALRRLLNVGEDS--GYDLTTLDGCREF 464 (639) T ss_pred HhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcH---HHHHHHHcCCccHHHHHHHhcccccc--CCCCCCcHHHHHH Confidence 99877432 1 23567899888843 33333 34567789999999999999986442 112 Q ss_pred ---eeeecccc----cccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccC Q lcl|NC_021305. 392 ---ELYANSAL----QPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPP 464 (518) Q Consensus 392 ---~~~~~~n~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ 464 (518) .+-.+.++ .|+....-+..+..+.+....+++.+ +.+++.+.+..+.++++++........ +.. T Consensus 465 A~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~--~~~de~~ga~~~~ePdte~~~~~~~a~-------~~~ 535 (639) T protein:vir:10 465 AADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTRED--EEDDEDSGARQQREPQTEDERSTEEAA-------SLN 535 (639) T ss_pred HHHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCC--CCcccccCCCCCcCCCcccccCCcccc-------CcC Confidence 11111111 12211111111111111111122221 222222233333344443322111110 000 Q ss_pred CchhhH-HHHHHHHHhhc-ccc---CcCchhHHHHHHHHHHHhHHH-Hhhhhh---hhcccCC Q lcl|NC_021305. 465 PKESSP-KHLRAVKGAMG-RGK---DIKGFALQLAEKYPDDLEDIL-LAVQLA---LAERKDN 518 (518) Q Consensus 465 ~~~~~~-~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~ 518 (518) .-+.+- ...-.+..+|. -|| ...+-..+ .++++.-.... ..+-|. -+.|--| T Consensus 536 ~~a~~v~a~~llv~RALelAGkRr~~~~~r~~~--a~~r~vp~he~H~~l~Pv~~~~~~rli~ 596 (639) T protein:vir:10 536 DRAAYLVAERLLVNRALDLAGKRRFKVNDAALK--TKLRDVPAHEYHRVLPPVRSSEIPRLIA 596 (639) T ss_pred chhHHHHHHHHHHHHHHHhhcccccCCCChhhH--HHhhcCChhHceeecCCCChHHHHHHHH Confidence 001110 00111223331 111 11111111 22221110000 000000 0000000 No 154 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=98.93 E-value=3.1e-09 Score=67.19 Aligned_cols=491 Identities=14% Similarity=0.102 Sum_probs=226.1 Q ss_pred CcCCCCCC-------CCcccccccchhhhhh----hc-ccccccccccccchhh-hHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 MLLANGQT-------LSAPAMAELSPQMQDS----YY-YAPAVGMQLERQFSLY-GGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~f~~~~~-------~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) |-=...+. ++.+.++.++-.-+.. -. ..+..+ +.......- =+.|...+.++-.|..|+++|+++. T Consensus 1 ma~~~lr~~rrpk~~p~~~rr~~ltaAsq~~~~p~~~~kt~~~~-~ar~~WQ~eAW~~~d~v~Elry~vgW~~~s~sr~r 79 (639) T protein:vir:97 1 MAATSLRVVRRPKGSAPAARRRSLTAASQLITDPQKQMKTSLMG-TARNEWQSEAWDFSESIGELSYYVSWRANSCSRTT 79 (639) T ss_pred CCccceeeeecCCCCCcchhhHHHhhhhhccCCcccchhhhccc-cchhhhhhhhhhhhhhhhhHHHHhhhhhhhhceee Confidence 43332221 2222222222111100 00 000001 111111111 1345556888999999999999999 Q ss_pred eEEEEecCCcc-----ee-ccc---hHHHH-HHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEE-EcCCCc------eEE Q lcl|NC_021305. 68 VKCMFTSGDTE-----TE-ESD---TGYAK-LLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ-KNKSGT------PEK 130 (518) Q Consensus 68 ~~v~~~~~~~~-----~~-~~~---~~~~~-L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~-r~~~G~------~~~ 130 (518) +..-+.+.+.- .. ..+ +.... ...--..-+...++++.+..++-+-|++|+.++ +..++. +.. T Consensus 80 L~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~k~~~~~~~~~~~ 159 (639) T protein:vir:97 80 LIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQEKDPVTGLAAPRA 159 (639) T ss_pred eEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecCccccCccccccc Confidence 98877664332 11 111 11111 112233456778899999999999999998765 333332 233 Q ss_pred EE-eeCCceeEEEEcCCceeeEEeeecccccCceeEEeccccEEEEeccCC--CCcccCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 131 LM-PMHPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNP--DGLERGLSLMESLKSTIFSEDSSRNAT 207 (518) Q Consensus 131 l~-~l~p~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~--~~~~~G~s~l~~~~~~i~~~~~~~~~~ 207 (518) -| .+..+.|. . ..++....... . +...+|..+.=+.||..+| ..-.+--||+.+++..+.......+.. T Consensus 160 ~W~vvs~~Ei~--~-~~~~~~~i~lP----d-G~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~~~t~~i 231 (639) T protein:vir:97 160 RWYAVTREEIK--S-KAGETAEISLP----D-GKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIERTTRKI 231 (639) T ss_pred ceeeeeHHHhc--c-cCCCeeEeecC----C-CCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHHHhhhHH Confidence 33 33333332 1 11111111111 1 2222333333333555444 334467899999999999888888888 Q ss_pred HHHHHccCCcccccccCccCCH-------------------------HHHHHHHHHHHHH----hcCcc--ccCCeeecC Q lcl|NC_021305. 208 AAMWKNAGRPNLVLRHEKRLSE-------------------------AAQQRLREQFDRA----HSGSS--NTGKTMVVE 256 (518) Q Consensus 208 ~~~~~ng~~p~~il~~~~~~~~-------------------------~~~~~~~~~~~~~----~~g~~--n~g~~~vl~ 256 (518) .+..+...+..|||-+|..++- ...+.|.+.|.+. +...+ .+--++|+. T Consensus 232 ~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~vPiia~ 311 (639) T protein:vir:97 232 KNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAYIPLVAS 311 (639) T ss_pred HHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccceeeeeEe Confidence 7777777777777766543221 1234455544332 22111 111223322 Q ss_pred ------CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHH-hccccccccCCHHHHHHHHHHHHhhHHHHHHHHHH Q lcl|NC_021305. 257 ------EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPI-VHILDRATFSNISAQMRAFYRDTMAIPIARIQSAM 329 (518) Q Consensus 257 ------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l 329 (518) ++++...+... -+.--+++++..+..+|....|||+. +|+ +++|.=+.=+-...-++..|.|.+..|+++| T Consensus 312 ~p~E~l~~ikhl~f~~e-i~e~aiktR~daI~RlA~glDi~pE~LLGl-~d~NHWsAWqI~dedvrlHI~P~l~~icdAl 389 (639) T protein:vir:97 312 VAAEHLEKVQHIKFGNE-VTEVEIKTRIDAITRLAMGLDVSPERLLGM-SKGNHWSAWAIGDEDVQLHIKPVMDLICQAI 389 (639) T ss_pred echHHhcCeeeeeecCc-hhHHHHhhHHHHHHHHHhccCCchhheeec-ccccceEEEEecccceeeecchhHHHHHHHH Confidence 22333333333 33345789999999999999999875 565 5555433333444556678999999999999 Q ss_pred HHhhhhhh----c---ccccceecchhhhh-cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc---------- Q lcl|NC_021305. 330 DKYVGQYW----V---RKNRMKFDIDDVIQ-PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD---------- 391 (518) Q Consensus 330 ~~~l~~~~----~---~~~~~~fd~~~l~~-~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD---------- 391 (518) ++.+|.+. + ..|-+.||.+.|.. .|..+ -...+...|.+|-.-.|+.+|+..-+ +=| T Consensus 390 T~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dPd~~d---eA~qa~drGAIt~eAlR~~lG~~edd--~yd~~t~e~~~~~ 464 (639) T protein:vir:97 390 YNDILTPLLAREGIDPTKYILWYDASGLTSDPDLSD---EAVEAHDRGAITSAALRRLLNVGEDS--GYDLTTLDGCREF 464 (639) T ss_pred HhhHHHHHHHHhCCCHHHhEeeecCcccccCCCCcH---HHHHHHHcCCccHHHHHHHhcccccc--CCCCCCcHHHHHH Confidence 99877432 1 23567899888843 33333 34567789999999999999986442 112 Q ss_pred ---eeeecccc----cccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccC Q lcl|NC_021305. 392 ---ELYANSAL----QPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPP 464 (518) Q Consensus 392 ---~~~~~~n~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~ 464 (518) .+-.+.++ .|+....-+..+..+.+....+++.+ +.+++.+.+..+.++++++........ +.. T Consensus 465 A~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~--~~~de~~ga~~~~ePdte~~~~~~~a~-------~~~ 535 (639) T protein:vir:97 465 AADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTRED--EEDDEDSGARQQREPQTEDERSTEEAA-------SLN 535 (639) T ss_pred HHHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCC--CCcccccCCCCCcCCCcccccCCcccc-------CcC Confidence 11111111 12211111111111111111122221 222222233333344443322111110 000 Q ss_pred CchhhH-HHHHHHHHhhc-ccc---CcCchhHHHHHHHHHHHhHHH-Hhhhhh---hhcccCC Q lcl|NC_021305. 465 PKESSP-KHLRAVKGAMG-RGK---DIKGFALQLAEKYPDDLEDIL-LAVQLA---LAERKDN 518 (518) Q Consensus 465 ~~~~~~-~~~~~~~~~~~-~~~---~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~ 518 (518) .-+.+- ...-.+..+|. -|| ...+-..+ .++++.-.... ..+-|. -+.|--| T Consensus 536 ~~a~~v~a~~llv~RALelAGkRr~~~~~r~~~--a~~r~vp~he~H~~l~Pv~~~~~~rli~ 596 (639) T protein:vir:97 536 DRAAYLVAERLLVNRALDLAGKRRFKVNDAALK--TKLRDVPAHEYHRVLPPVRSSEIPRLIA 596 (639) T ss_pred chhHHHHHHHHHHHHHHHhhcccccCCCChhhH--HHhhcCChhHceeecCCCChHHHHHHHH Confidence 001110 00111223331 111 11111111 22221110000 000000 0000000 No 155 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.93 E-value=3.9e-09 Score=66.70 Aligned_cols=406 Identities=14% Similarity=0.140 Sum_probs=163.0 Q ss_pred CCCCCCCCcccccccchhhhh----------------hhccc--ccccccccccchhhhHHHhhcHHHHHHHHHHHHhhc Q lcl|NC_021305. 3 LANGQTLSAPAMAELSPQMQD----------------SYYYA--PAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALA 64 (518) Q Consensus 3 f~~~~~~~~~~~~~~~~~~~~----------------~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia 64 (518) ........ ...|+.. .+.-+ .....+.. .....+..-....+...+|+.+++.+- T Consensus 1 ~~~~~~~d------~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~a~~l~ 73 (488) T protein:vir:23 1 MAETESID------PEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLA-VPLDMRKYLAHVGYPRTYVDAIAERQE 73 (488) T ss_pred CCcccCCC------HHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcc-cchhhhhhhhhcchHHHHHHHHHHhhh Confidence 00000000 0001110 00000 00000000 011111222334556677787777665 Q ss_pred cCceEEEEecCCcce----eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc-------eEEEEe Q lcl|NC_021305. 65 RLPVKCMFTSGDTET----EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT-------PEKLMP 133 (518) Q Consensus 65 ~l~~~v~~~~~~~~~----~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~-------~~~l~~ 133 (518) --+|.+-........ ......+..++.+ | +.......+..+++.+|.+|+.+.++.... ...+.+ T Consensus 74 ~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~-N---~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~ 149 (488) T protein:vir:23 74 LEGFRIPSANGEEPESGGENDPASELWDWWQA-N---NLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRV 149 (488) T ss_pred ccceeccCCcccccccccchhHHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEE Confidence 445554221111000 1111122333321 2 456677778899999999999887643211 124667 Q ss_pred eCCceeEEEEcCCceeeEEee--ecccccC--ceeEEeccc-------------------------cEEEEeccCCCCcc Q lcl|NC_021305. 134 MHPSRVAIKRNSRTGRYEYYF--QAGAGVG--TQLVSFADD-------------------------EVVPIRFFNPDGLE 184 (518) Q Consensus 134 l~p~~v~v~~~~~~~~~~~~~--~~~~~~~--~~~~~~~~~-------------------------evih~~~~~~~~~~ 184 (518) ++|..+.+.++.......+.+ .+....+ .....|.++ .|++|+++...+.. T Consensus 150 ~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~ 229 (488) T protein:vir:23 150 EPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSDL 229 (488) T ss_pred eccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccccccCCc Confidence 888888877765332221111 1110000 001112222 34555544333445 Q ss_pred cCchHHHH-HHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHH--HHHHHHHHHHhcCccccCCeeecCCC--c Q lcl|NC_021305. 185 RGLSLMES-LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQ--QRLREQFDRAHSGSSNTGKTMVVEEG--M 259 (518) Q Consensus 185 ~G~s~l~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~--~~~~~~~~~~~~g~~n~g~~~vl~~g--~ 259 (518) +|.|-+.- +...+.....+..-......-.+.|.-++.- ...++... +.-...|+. ..+++..+++| . T Consensus 230 ~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G-~~~~~~~~~~~~~~~~~~~------~~~~v~~~~~g~~~ 302 (488) T protein:vir:23 230 YGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFG-AKPEELGINAETGQRMFDA------YMARILAFEGGEGA 302 (488) T ss_pred CCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhC-CCcccccccccccchhhhh------hhhhhccCCCCCCc Confidence 67776542 2222222222221111212222233333321 00111000 000111211 12346666665 4 Q ss_pred ceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHH----HHHHhh-- Q lcl|NC_021305. 260 EPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQS----AMDKYV-- 333 (518) Q Consensus 260 ~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~----~l~~~l-- 333 (518) ++.++..... -.+.+.++..+..|+..-++|++.+|.... |.++.++. .+....+.-.+...+. .+.+.+ T Consensus 303 ~~~q~~~~~~-~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~Sg~Al--~~~~~~l~~k~~~~~~~f~~~l~~~~~l 378 (488) T protein:vir:23 303 HAEQFSAAEL-RNFVDALDALDRKAASYSGLPPQYLSSSSD-NPASAEAI--KAAESRLVKKVERKNKIFGGAWEQAMRL 378 (488) T ss_pred eeEecCCCCh-HHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5655554332 237788888888999999999998875432 22222221 1111122111122111 111111 Q ss_pred hhhh-c------ccccceecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceeeecccc---cc Q lcl|NC_021305. 334 GQYW-V------RKNRMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANSAL---QP 401 (518) Q Consensus 334 ~~~~-~------~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~---~~ 401 (518) +... + ....+++.+.+....+..+.++.+.+++++| +++..-+++++|+-+.+....+...--... .- T Consensus 379 ~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~ 458 (488) T protein:vir:23 379 AYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGL 458 (488) T ss_pred HHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHH Confidence 0000 1 1124556666777888999999999999876 788888889998754321111110000000 00 Q ss_pred cccccccCCCCCCCCCCCCCccCCCCCC-CccccCC Q lcl|NC_021305. 402 LGATPDGAVEWEEAPAPKRPASTPVASL-DQSPPTS 436 (518) Q Consensus 402 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 436 (518) ++...+.. ++ ...++.++.++. +..+..+ T Consensus 459 ~~~~~~~~---~~---~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 459 IGSLYGAS---TP---EGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HHHHhccC---CC---cccCCCCCCCCCCCCCCCCC Confidence 00000000 00 000111111111 1111111 No 156 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.90 E-value=9.7e-09 Score=64.50 Aligned_cols=392 Identities=11% Similarity=0.015 Sum_probs=179.3 Q ss_pred CcCCCCCCCC---cccccccch----hhhhhhccccccccc--cc-----ccchhhhHHHhhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLLANGQTLS---APAMAELSP----QMQDSYYYAPAVGMQ--LE-----RQFSLYGGIYKNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f~~~~~~~---~~~~~~~~~----~~~~~~~~~~~~~~~--~~-----~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l 66 (518) +||-.+.... .+.. ..++ .+.... .++.|-. .. ...............-..+++..|+-+..- T Consensus 15 ~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~--~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~ 91 (496) T protein:vir:38 15 RMGLLKALKDVKDHKKV-NANDEDYKYIDMWK--RLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNE 91 (496) T ss_pred HhccchhhHHHHhcCCC-cCCHHHHHHHHHHH--HHhcCCCchhhcchhccCCCccccceeecchHHHHHHHHhhhhhCC Confidence 4443221100 0000 0011 111100 1111100 00 000000001122344567888888888777 Q ss_pred ceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC Q lcl|NC_021305. 67 PVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR 146 (518) Q Consensus 67 ~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~ 146 (518) |..+--.+ ......+..++. .-....-...++.+.+.+|.+|+.+..|.+|.+ .+..++|..+.+..... T Consensus 92 p~~i~~~d-----~~~~e~l~~~~~----~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~~~~~~P~~~~~ 161 (496) T protein:vir:38 92 KVKINIDD-----KAAEEFVLNVLK----TNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYPLSNDS 161 (496) T ss_pred cceEeeCC-----hHHHHHHHHHHh----ccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcE-EEEEEcccceEEEEecC Confidence 77653211 111112222222 234556667788899999999999999888764 56677888776644433 Q ss_pred ceee-------------EE-ee----------------e-ccc-ccCceeEE-------------e---ccccEEEEecc Q lcl|NC_021305. 147 TGRY-------------EY-YF----------------Q-AGA-GVGTQLVS-------------F---ADDEVVPIRFF 178 (518) Q Consensus 147 ~~~~-------------~~-~~----------------~-~~~-~~~~~~~~-------------~---~~~evih~~~~ 178 (518) +... .| .+ + ... ...+..+. + ..--+.||+.+ T Consensus 162 ~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~ 241 (496) T protein:vir:38 162 ENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPN 241 (496) T ss_pred CcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcceEEEecCC Confidence 3221 00 00 0 000 00000010 0 11113344433 Q ss_pred CC----CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc-----ccCccCCHHHHHHHHHHHHHHhcCcccc Q lcl|NC_021305. 179 NP----DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL-----RHEKRLSEAAQQRLREQFDRAHSGSSNT 249 (518) Q Consensus 179 ~~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~~~~~~~~~~~~~~~~~~g~~n~ 249 (518) -+ ....+|+|.+..+...+.....+..-..+-|.. +.+..++ ......+.+... .|.. .... T Consensus 242 ~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~----~~~~----~~~~ 312 (496) T protein:vir:38 242 IANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKL-GKKKVLVPSSFVKTAVNLDGSTTQ----YFDS----TDEA 312 (496) T ss_pred cccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhh-cccceecchHHhhccCCCCCcccc----CCCC----ccce Confidence 22 233579999999998888887665555555654 3444333 111111100000 0000 0000 Q ss_pred CCee---ecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH---HHHH---------- Q lcl|NC_021305. 250 GKTM---VVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ---MRAF---------- 313 (518) Q Consensus 250 g~~~---vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~---~~~~---------- 313 (518) .... -.+++..++.+......-++.+..+...+.|+...|+||..+|...++. .+..+. .... T Consensus 313 ~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~-~tAtei~~~~~~l~~~~~~~~~~ 391 (496) T protein:vir:38 313 FFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGL-KTATEVVSEKSETYQTKNSHSQL 391 (496) T ss_pred EEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCcccc-chHHHHHHHHHHHHHHHHHHHHH Confidence 0011 1123345666666666777888899999999999999999998765443 222221 1111 Q ss_pred HHHHhhHHHHHHHHHHHHhhhh--hhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCc Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYVGQ--YWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM-GLPRSDDPKA 390 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l~~--~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-g~~p~~~~~g 390 (518) +..++..++..+....+..... .......+.|.++.-+..|..+.++.+.+++.+|++|.-.++..+ |.+ ++.. T Consensus 392 ~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~---d~ea 468 (496) T protein:vir:38 392 IEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNIT---EAEA 468 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC---hHHH Confidence 2223333333332222111110 111234567777788889999999999999999999988887654 432 2222 Q ss_pred ceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 391 DELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 391 D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) ++.+ ..+. . +++..-+..+.....++++ T Consensus 469 ~~el-----~ri~---~----E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 469 DEWA-----EMLA---K----EKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred HHHH-----HHHH---H----hhhccCccccccCCCCCCC Confidence 1110 0110 0 0000001000011111111 No 157 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.89 E-value=1.9e-08 Score=62.90 Aligned_cols=416 Identities=9% Similarity=0.047 Sum_probs=178.8 Q ss_pred CcCCC---CCCCCcccccccchhhhhhhcccccc-cccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLAN---GQTLSAPAMAELSPQMQDSYYYAPAV-GMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) +|-.. -+....+....+..+.. |..... ........ .....-...+.....|+..+.-+-.-|+.+--.++. T Consensus 44 ~i~~~i~~h~~~~~~rl~~l~~yY~---g~~~~i~~~~~~~~~-~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~ 119 (502) T protein:vir:48 44 LLKNFINHHKLRQAPRIQELLDYAR---GENHDVLKSGRRKDN-EMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNE 119 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc---CCCcccccccccccc-ccccceeecchHHHHHHHHhhhhcccCeeEecCCcc Confidence 00000 00000011011111110 000000 00000000 000001223556778888888888888877433221 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eeeE-Ee Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRYE-YY 153 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~~-~~ 153 (518) . ......++.+....-........+..+++.+|.+|+.+.++.+|.+ .+..++|..+.++.+... .... .. T Consensus 120 ~-----~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~ir 193 (502) T protein:vir:48 120 D-----NSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAVR 193 (502) T ss_pred c-----hhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEE Confidence 1 1112222222222335666788889999999999999999888864 567789999988776532 1111 11 Q ss_pred eec-ccc-cCce-eEEeccccEEEEecc----------CC---------CCcccCchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 154 FQA-GAG-VGTQ-LVSFADDEVVPIRFF----------NP---------DGLERGLSLMESLKSTIFSEDSSRNATAAMW 211 (518) Q Consensus 154 ~~~-~~~-~~~~-~~~~~~~evih~~~~----------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~ 211 (518) ++. ... .... ...+.++.++++... ++ .....|.|.+..+...+.....+..-..+.+ T Consensus 194 ~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~ 273 (502) T protein:vir:48 194 YYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELYLIDLYDSAESDTANHM 273 (502) T ss_pred EEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 111 111 1111 112333333333211 00 0123588888888877777776666666666 Q ss_pred HccCCcccccccCccC-CHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021305. 212 KNAGRPNLVLRHEKRL-SEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDI 290 (518) Q Consensus 212 ~ng~~p~~il~~~~~~-~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgV 290 (518) .....|-.++.-.... .++....+++.. ..+ ....+..-..+.+.++..+..+.....+....+.+.+.|+..-++ T Consensus 274 ~~~~~~~lv~~g~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~ 350 (502) T protein:vir:48 274 SDMADAILAIYGDLALPQGMQASDMKRTR-LMQ--LKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNT 350 (502) T ss_pred HHhcCceeeeecCcccccccchhhhhhcc-eee--ccccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCC Confidence 6666666555432222 122222222110 000 000000011223445555555444455667788889999999999 Q ss_pred CHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHH Q lcl|NC_021305. 291 APPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEA 357 (518) Q Consensus 291 Pp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~ 357 (518) |..-.+... ++ .+.++.. ...+...+.-.+..+...++..--........+++.+.+.+..|..+ T Consensus 351 p~~~~~~~~-~n-~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e 428 (502) T protein:vir:48 351 PDMSDNHFS-GN-ASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLKITFTPNLPKSLYE 428 (502) T ss_pred CCcCccccc-cC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHH Confidence 965443221 12 2222211 12222233333333322222111001111224566777888999999 Q ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCc Q lcl|NC_021305. 358 KSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSV 437 (518) Q Consensus 358 ~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (518) .++.+.++ .|+++..-+.+.+++- +++.. + +..+..-.. .......+...........++. T Consensus 429 ~a~~~~kl--~g~iS~et~l~~l~~v--~D~~~-E------~~ri~~E~~-~~~~~~~~~~~~~~~~~~~d~~------- 489 (502) T protein:vir:48 429 QVSILNDL--GGQVSQETALSLSGLV--ENPTE-E------LDKINEESS-KIDFKGYPSYFYDNVGKYTDEV------- 489 (502) T ss_pred HHHHHHHH--hccCcHHHHHHhCCCC--CCHHH-H------HHHHHHHHH-hhhhhcccccccccccccCCCc------- Confidence 99999988 5889988888887653 22211 1 111100000 0000000000000000000000 Q ss_pred cccccchhcchhh Q lcl|NC_021305. 438 PGLSPTNSDRSTD 450 (518) Q Consensus 438 ~~~~~~~~~~~~~ 450 (518) ++..++...++.+ T Consensus 490 ~e~~~~~~~~~~~ 502 (502) T protein:vir:48 490 KETHTDDFERVYE 502 (502) T ss_pred cCCCCcCcCCCCC Confidence 0000000001101 No 158 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.88 E-value=1.4e-08 Score=63.58 Aligned_cols=408 Identities=10% Similarity=0.040 Sum_probs=168.8 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchh--hhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSL--YGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =|+..-. ...+....+..+.. |............... .......+.+...+|+.++..+--.+|.+ .++ T Consensus 21 ~l~~~~~-~~~~r~~~~~~YY~---g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~---~d~-- 91 (479) T protein:vir:99 21 KVFPKMN-TECERLDDFEAWTK---NGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRK---TGT-- 91 (479) T ss_pred HHHHHHH-HHhHHHHHHHHHHh---cCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhhcccccccC---CCc-- Confidence 0111000 00111111111110 0000000000000000 00111123455677887776553333322 211 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE-----cCCCceEEEEeeCCceeEEEEcCCceee--E Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK-----NKSGTPEKLMPMHPSRVAIKRNSRTGRY--E 151 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r-----~~~G~~~~l~~l~p~~v~v~~~~~~~~~--~ 151 (518) . ....+..++.. | +.......+..+++.+|.+|+++-. +..|. ..+..++|..+.+.++...... . T Consensus 92 -~-~~~~~~~i~~~-N---~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~-~~i~~~~p~~~~~iydd~~~~~~~~ 164 (479) T protein:vir:99 92 -N-ENAKGWDTWRL-N---QMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTV-ARIKCIDPRDAFAIWEDPYWDEWPK 164 (479) T ss_pred -h-hhHHHHHHHHh-c---ChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCc-eEEEEechhheEEEecCCcccceee Confidence 1 12223344443 3 2335667788899999999988764 33343 3566778888887765433221 1 Q ss_pred Eeeecccc----------------cCceeE-------EeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 152 YYFQAGAG----------------VGTQLV-------SFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATA 208 (518) Q Consensus 152 ~~~~~~~~----------------~~~~~~-------~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 208 (518) |.+..... .++... .+..=.|++|.++... ..+|.|-+..+...+.....+..-.. T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~-~~~g~sd~e~v~~liDa~~~~~s~~~ 243 (479) T protein:vir:99 165 YLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDL-RGVCYGDVEPLVTVAKAIDKTGLDIL 243 (479) T ss_pred EEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCc-CcCCcchhHHHHHHHHHHHHHHHHHH Confidence 11111000 000000 0122245666644322 24688988877777777666555544 Q ss_pred HHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeee-cCCCcceeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 209 AMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMV-VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGV 287 (518) Q Consensus 209 ~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~v-l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 287 (518) ..+.-.+.|..++.--. ..+... .-...|.. ..++++. -+++.++.++.... -..+.+.++....+|+.. T Consensus 244 ~~~~~~a~p~~~i~G~~-~~~~~~-~~~~~~~~------~~~~i~~~~~~~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~ 314 (479) T protein:vir:99 244 LVQHHQSFQIRWATGLM-LPEGAN-ADQEKMRF------AQESMLISQNEKASFGAIPAAP-LDGLLNAYKESLLEFLAL 314 (479) T ss_pred HHHHHhhchhhhhcCCC-cccccc-cchhcccc------ccccceeecCCCceEEEecccc-hHHHHHHHHHHHHHHhcc Confidence 55555555655443211 111100 00011111 1123333 35667776665322 223667777788889999 Q ss_pred hcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHH----HHHHhh---hhhh-----cccccceecchhhhhcCH Q lcl|NC_021305. 288 YDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQS----AMDKYV---GQYW-----VRKNRMKFDIDDVIQPDW 355 (518) Q Consensus 288 fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~----~l~~~l---~~~~-----~~~~~~~fd~~~l~~~d~ 355 (518) -++|++.+|...+. +.++ ..+....+.-.+...+. .|.+.+ +.-. .....+++.|.+....+. T Consensus 315 t~~p~~~~g~~~n~---Sg~A--l~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~ 389 (479) T protein:vir:99 315 AQLPPHIAGQIVNV---AADA--LAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQSL 389 (479) T ss_pred CCCCHHHcccccch---HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCH Confidence 99999999864332 2211 11222222222222111 111111 1100 111235556666667788 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCcceeeecc----cccc-cccccccCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021305. 356 EAKSESTQKMVNSGVATPNEGREIM-GLPRSDDPKADELYANS----ALQP-LGATPDGAVEWEEAPAPKRPASTPVASL 429 (518) Q Consensus 356 ~~~~~~~~~~~~~G~~T~NE~R~~~-g~~p~~~~~gD~~~~~~----n~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (518) .+.++.+.+++++|+++...+.+++ |+.+-+ -+.+..-. .... ......+....++.+.+....+.+..++ T Consensus 390 ~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~---~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (479) T protein:vir:99 390 AQFADAWAKMVESLKIPAEGVWDMIPNLDQST---VNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANN 466 (479) T ss_pred HHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH---HHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCC Confidence 9999999999999999998888777 665421 11110000 0000 0000011110000000000000000000 Q ss_pred CccccCCccccccchhc Q lcl|NC_021305. 430 DQSPPTSVPGLSPTNSD 446 (518) Q Consensus 430 ~~~~~~~~~~~~~~~~~ 446 (518) .. +.+...+.+.+ T Consensus 467 ~~----~~~~~~~~~~~ 479 (479) T protein:vir:99 467 KT----GEPASLNKSGA 479 (479) T ss_pred CC----cchhccCCCCC Confidence 00 00111111111 No 159 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.84 E-value=2.9e-08 Score=61.88 Aligned_cols=347 Identities=11% Similarity=0.036 Sum_probs=162.5 Q ss_pred CcCCCCCC---CCcccccccchhhhhhhcccc--cccccccccchhhhHHH-hhcHHHHHHHHHHHHhhccCceEEEEec Q lcl|NC_021305. 1 MLLANGQT---LSAPAMAELSPQMQDSYYYAP--AVGMQLERQFSLYGGIY-KNQPWVRTVIAKRAQALARLPVKCMFTS 74 (518) Q Consensus 1 ~~f~~~~~---~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~-~~~~~v~~~v~~ia~~ia~l~~~v~~~~ 74 (518) ++...... ...+.-..+ ..+.-+. ....+.. -.......+ +...+..-+|+.++..+.--.|.. T Consensus 4 ~~i~~L~~~~~~~~~r~~~~-----~~yY~g~~~~~~~~~~-~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~---- 73 (409) T protein:vir:94 4 KGIGYLRFKLSVHKRRAEMR-----YDQYAMKYVDRFKGIT-IPQALSQQYRSILGWCAKGVDSLADRLVFREFEN---- 73 (409) T ss_pred HHHHHHHHHHHHHhHHHHHH-----HHHhcccCchhhcChh-hhHHHHHHHhhhcchhHHHHHHhHhhcccCcccC---- Confidence 11111000 000000000 0010000 0000000 000111111 122345667777776544333321 Q ss_pred CCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEee Q lcl|NC_021305. 75 GDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYF 154 (518) Q Consensus 75 ~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 154 (518) .+..+..++.+ | +.......+..+.+++|.+|+.+..+..|.+ .+.+++|..+.+.++.....+.+.+ T Consensus 74 -------~d~~l~~i~~~-N---~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~~~~~~~a~ 141 (409) T protein:vir:94 74 -------DDFTVNEIFEE-N---NPDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPITGLLTEGY 141 (409) T ss_pred -------CchHHHHHHHh-c---ChhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecCCCceeeeE Confidence 11223444433 2 2344556788899999999999999888875 6778999999888877554432222 Q ss_pred ec--ccccCce--eEEeccc----------------------cEEEEeccCCCCcccCchHH----HHHHHHHHHHHHHH Q lcl|NC_021305. 155 QA--GAGVGTQ--LVSFADD----------------------EVVPIRFFNPDGLERGLSLM----ESLKSTIFSEDSSR 204 (518) Q Consensus 155 ~~--~~~~~~~--~~~~~~~----------------------evih~~~~~~~~~~~G~s~l----~~~~~~i~~~~~~~ 204 (518) .+ ....+.. ...+.++ .|++|.+....+..+|.|.| ..+.+.+.....-. T Consensus 142 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~ 221 (409) T protein:vir:94 142 AVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERA 221 (409) T ss_pred EEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHHHHH Confidence 11 1111111 1112222 23444433223345787754 34444444444333 Q ss_pred HHHHHHHHccCCcc-cccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-----CCcceeeccCChhhHHHHHHHH Q lcl|NC_021305. 205 NATAAMWKNAGRPN-LVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-----EGMEPIPLQLTAVEMQFIEARQ 278 (518) Q Consensus 205 ~~~~~~~~ng~~p~-~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-----~g~~~~~l~~~~~d~~~~e~~~ 278 (518) .....||.+ |. .++-.+...+ ..+. |+... ++++.++ .+.++.++....-+ .|++.++ T Consensus 222 ~~~~e~~a~---pqr~i~G~d~d~~--~~~~----~~~~~------~~i~~~~~d~dg~~~~v~q~~~~~l~-~~~~~l~ 285 (409) T protein:vir:94 222 DVTAEFYSF---PQKYVTGLSDDAE--PMET----WKATV------SSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQLR 285 (409) T ss_pred HHHHHHhcC---hhheeEecCCCCc--ccch----hhhhH------HHhhcCCCCCCCCCceEEecCCCChh-HHHHHHH Confidence 444444443 33 3333322111 1222 32211 2233343 23556555433222 3889999 Q ss_pred HHHHHHHHHhcCCHHHhccccccccCCHHHH---HHHHHHHH---hhHHHHHHHHHHHHhhh--hhhc----ccccceec Q lcl|NC_021305. 279 LNREEVCGVYDIAPPIVHILDRATFSNISAQ---MRAFYRDT---MAIPIARIQSAMDKYVG--QYWV----RKNRMKFD 346 (518) Q Consensus 279 ~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~---~~~~~~~~---l~P~~~~ie~~l~~~l~--~~~~----~~~~~~fd 346 (518) ....++|+.-++|++.+|.... |.++.+.. ...+...+ -+-+-..+++.+...+. .... ....+++. T Consensus 286 ~~~~~~a~~t~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~ 364 (409) T protein:vir:94 286 TAAAGFAGETGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPK 364 (409) T ss_pred HHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceEE Confidence 9999999999999999987543 32332221 11111111 11112222222211111 1100 11234555 Q ss_pred chhhhhcCH---HHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC Q lcl|NC_021305. 347 IDDVIQPDW---EAKSESTQKMVNSG--VATPNEGREIMGLPRSD 386 (518) Q Consensus 347 ~~~l~~~d~---~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~ 386 (518) |.++...+. ...++++.+++++| +...+-+++++|+..-+ T Consensus 365 W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 365 WEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred eccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 665555554 55678889999998 55668899999998654 No 160 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.83 E-value=1.8e-08 Score=63.10 Aligned_cols=387 Identities=10% Similarity=0.052 Sum_probs=172.1 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhh--HHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYG--GIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) +|...-. ...+.-..+..+.. |.......+... ..... .......+...+|+..+..+-.-|+.+...+ + T Consensus 12 ~l~~~~~-~~~~r~~~l~~Yy~---g~~~i~~~~~~~-~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~-d-- 83 (456) T protein:vir:79 12 VLTKRID-DGMSRVRLLARYSN---GDAPLPELTRNT-SAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA-D-- 83 (456) T ss_pred HHHHHHH-HHHHHHHHHHHHHh---ccCChhhcCccc-ChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCCCC-C-- Confidence 2222100 00000000111110 000000000000 00111 1112234668899999998888888753211 1 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee-e--EEeee Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR-Y--EYYFQ 155 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~-~--~~~~~ 155 (518) .+. ...+..++.+ | ....+...+..+++.+|.+|+.+-.+..|.+ .+..++|..+.+.++..... + ...++ T Consensus 84 ~~~-~~~~~~~~~~-n---~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~ 157 (456) T protein:vir:79 84 SDL-ALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWW 157 (456) T ss_pred ccH-HHHHHHHHHh-c---ChhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEeccceeEEEEcCCCCCceEEEEEEE Confidence 111 1223344443 3 3446677889999999999999888888876 57888999888877653221 1 00000 Q ss_pred cccccCc-eeEEeccc-------------------------------cEEE-------EeccCCCCcccCchHHHHHHHH Q lcl|NC_021305. 156 AGAGVGT-QLVSFADD-------------------------------EVVP-------IRFFNPDGLERGLSLMESLKST 196 (518) Q Consensus 156 ~~~~~~~-~~~~~~~~-------------------------------evih-------~~~~~~~~~~~G~s~l~~~~~~ 196 (518) ....... ....+..+ ++-| ++..+ ..|+|-++..... T Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N----~~~~gd~e~v~~l 233 (456) T protein:vir:79 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN----PDGMGEVEPHIDI 233 (456) T ss_pred EecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecC----CCCCchhhhhHHH Confidence 0000000 00000000 0001 11111 2466767666555 Q ss_pred HHHHHHHHHHHHHHHHccCCcccccccC---ccCCHHHHHHH--HHHHHHHhcCccccCCeeecCCCcceeeccCChhhH Q lcl|NC_021305. 197 IFSEDSSRNATAAMWKNAGRPNLVLRHE---KRLSEAAQQRL--REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEM 271 (518) Q Consensus 197 i~~~~~~~~~~~~~~~ng~~p~~il~~~---~~~~~~~~~~~--~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~ 271 (518) +.....+..-........+.|.-++.-. ....++..+.+ ...|.. ..+.++.++++.++.++....- . T Consensus 234 iD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~------~~~~~~~~~~~~~~~q~~~~~~-~ 306 (456) T protein:vir:79 234 INRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEA------APGALWELPPGVDIWESQTNDF-T 306 (456) T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhh------hccccccCCCCcceeeecccCh-H Confidence 5444433222222222222333222111 00111111111 112221 2244566788888877654322 2 Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHH----HHh---hhhhhc--cccc Q lcl|NC_021305. 272 QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAM----DKY---VGQYWV--RKNR 342 (518) Q Consensus 272 ~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l----~~~---l~~~~~--~~~~ 342 (518) .+.+.++..+.+|+..-++|++.+|.... | .+.+ ...+....+.-.+...+..| .+. ++.-.+ .... T Consensus 307 ~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N-~Sg~--Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~~~~~ 382 (456) T protein:vir:79 307 PMLSAIKEHIRQLSSATKTPLPMLMPDSA-N-QSAE--GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDT 382 (456) T ss_pred HHHHHHHHHHHHHHhhcCCChhHhccccc-C-cHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 37888899999999999999999875321 1 1222 11122112211222222111 111 111111 1124 Q ss_pred ceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 343 MKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 343 ~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) +++.|.+....+..+.++++.++++.|+++..-+++.+|+.+.+-+.. .+........+ ..+.+. T Consensus 383 i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~-------e~~r~~~e~~~--------~~~~~~ 447 (456) T protein:vir:79 383 VDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQD-------DLDRAREQITL--------FAGNPV 447 (456) T ss_pred ceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHH-------HHHHHHHHHHH--------HhhhHh Confidence 566667777888999999999999999999888888888865321100 00000000000 000000 Q ss_pred cCCCCCCCccc Q lcl|NC_021305. 423 STPVASLDQSP 433 (518) Q Consensus 423 ~~~~~~~~~~~ 433 (518) ..+ +++.+- T Consensus 448 ~~~--~~~~~~ 456 (456) T protein:vir:79 448 QRP--QEDGSR 456 (456) T ss_pred hcC--CCCCCC Confidence 000 000000 No 161 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.76 E-value=6.2e-08 Score=60.09 Aligned_cols=415 Identities=10% Similarity=0.026 Sum_probs=176.7 Q ss_pred CcCC------CC---CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEE Q lcl|NC_021305. 1 MLLA------NG---QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) Q Consensus 1 ~~f~------~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~ 71 (518) |+-. .- +....+....+..+.. |..+..-.+...........-...+....+|+..+.-+-.-|+++. T Consensus 37 ~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~---g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~ 113 (501) T protein:vir:96 37 MVNNWELLKNFINHHKLRQAPRIQELLDYAR---GENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVE 113 (501) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCeeEe Confidence 0000 00 0000000000100000 0000000000000000001113345667788888888888888774 Q ss_pred EecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc-e-e Q lcl|NC_021305. 72 FTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-G-R 149 (518) Q Consensus 72 ~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~-~-~ 149 (518) -.+++. ......++.+....-+.......+..+++.+|.+|+.+.++.+|.+ .+..++|..+.++++... . . T Consensus 114 ~~~~~~-----~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~~~~~~ 187 (501) T protein:vir:96 114 YDDNDD-----NSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNS 187 (501) T ss_pred eCCccc-----hhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCce-EEEEEccceeEEEEcCCCCCce Confidence 333211 1222222333333345667788889999999999999999988864 577789999988887542 1 1 Q ss_pred eEE-eeec-ccccCc--eeEEeccccEEEEecc----------CC---------CCcccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 150 YEY-YFQA-GAGVGT--QLVSFADDEVVPIRFF----------NP---------DGLERGLSLMESLKSTIFSEDSSRNA 206 (518) Q Consensus 150 ~~~-~~~~-~~~~~~--~~~~~~~~evih~~~~----------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~ 206 (518) ... .++. ....++ ....+.++.+.++... ++ .....|.|.+..+...+.....+..- T Consensus 188 ~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~ 267 (501) T protein:vir:96 188 IAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTTHAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESD 267 (501) T ss_pred EEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccccccCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHH Confidence 111 1111 011111 1111223333222210 00 01135788888777777777666655 Q ss_pred HHHHHHccCCcccccccCccCC-HHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_021305. 207 TAAMWKNAGRPNLVLRHEKRLS-EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVC 285 (518) Q Consensus 207 ~~~~~~ng~~p~~il~~~~~~~-~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 285 (518) ..+.+...+.|-.++.-....+ ++....++. ...-.....+.....+.+.++..+........+....+.+.+.|. T Consensus 268 ~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~ 344 (501) T protein:vir:96 268 TANHMSDMADAILAIYGDLALPKGMQASDMKR---TRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIH 344 (501) T ss_pred HHHHHHHhcCceeeeecccccCcccchhhhhh---cCeeeecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHH Confidence 5666666666655553321111 111222211 100001111111122344455555555555566777888889999 Q ss_pred HHhcCCHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhh Q lcl|NC_021305. 286 GVYDIAPPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQ 352 (518) Q Consensus 286 ~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~ 352 (518) ..-++|..-.+... ++- +.++. ....+..+++-.+..+...++..--........+++.+...+. T Consensus 345 ~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p 422 (501) T protein:vir:96 345 IFTNTPDMSDTNFS-GNT-SGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLP 422 (501) T ss_pred HHhCCcccCccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccceEEeCCCCC Confidence 99899865443221 121 11111 1112222222332222222211100001111235666778889 Q ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccc--CCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 353 PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDG--AVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 353 ~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 430 (518) .|..+.++.+.++. |+++..-+.+++++ ++++.. + +..+...... ......+-.+.. + ...++ T Consensus 423 ~n~~e~ad~~~kl~--g~iS~et~~~~l~~--v~D~~~-E------~~ri~~E~~~~~~~~~~~~~~~~~-~--~~~~~- 487 (501) T protein:vir:96 423 KSLNEQVSILTGLG--GQVSQETALSLSGL--VESPNE-E------LDKINKEMSEIDFKGYSNDFNEHV-G--KYTDE- 487 (501) T ss_pred cCHHHHHHHHHHHh--ccCchHHHHHhCCC--CCCHHH-H------HHHHHHHHHHhhccccccchhhcc-c--ccCCc- Confidence 99999999999984 78998888888754 222211 1 1111100000 000000000000 0 00000 Q ss_pred ccccCCccccccchhcchhh Q lcl|NC_021305. 431 QSPPTSVPGLSPTNSDRSTD 450 (518) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~ 450 (518) ..+.+++...++.+ T Consensus 488 ------~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 488 ------VKETHTDDFEREYE 501 (501) T ss_pred ------CCCCCCCccccccC Confidence 00001100001000 No 162 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=98.75 E-value=5.2e-08 Score=60.50 Aligned_cols=350 Identities=10% Similarity=0.009 Sum_probs=162.5 Q ss_pred CcCCCCC---CCCcccccccchhhhhhhcccccccccccccchhhhHHH-hhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLANGQ---TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIY-KNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) ++..... ....+.-..+..+.. |-......+.. -.......+ ....+..-+|+.++..+.--.|.. T Consensus 4 ~~i~~L~~~~~~~~~r~~~~~~yY~---g~~~~~~~~~~-~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~------ 73 (409) T protein:vir:16 4 KGIGYLRFKLSVHKRRAEMRYEQYA---MKHVDRFKGIT-IPQALSQQYRSILGWCAKGVDSLADRLVFREFEN------ 73 (409) T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHh---ccCchhhcchh-hhHHHHHHHhhhcChhHHHHHHhHhhcccccccC------ Confidence 1111100 000000000000000 00000000000 000111111 122445667777766554344321 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEE--ee Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEY--YF 154 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~--~~ 154 (518) .+..+..++.+ | +.......+..+.+++|.+|+.+..+..|.+ .+.+++|..+.+.++........ .+ T Consensus 74 -----~d~~l~~i~~~-N---~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~~~~~~a~~~ 143 (409) T protein:vir:16 74 -----DDFTVNEIFEE-N---NPDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPITGLLTEGYAV 143 (409) T ss_pred -----cchHHHHHHHh-c---ChhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeecccccceeeeEE Confidence 11223444433 2 3344556788899999999999998888864 67788998888877664433221 11 Q ss_pred ecccccCce--eEEeccc----------------------cEEEEeccCCCCcccCchH----HHHHHHHHHHHHHHHHH Q lcl|NC_021305. 155 QAGAGVGTQ--LVSFADD----------------------EVVPIRFFNPDGLERGLSL----MESLKSTIFSEDSSRNA 206 (518) Q Consensus 155 ~~~~~~~~~--~~~~~~~----------------------evih~~~~~~~~~~~G~s~----l~~~~~~i~~~~~~~~~ 206 (518) ......+.. ...+.++ .|++|.+.......+|.|- +..+.+.+.....-... T Consensus 144 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~ 223 (409) T protein:vir:16 144 LERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTLERADV 223 (409) T ss_pred EEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHHHHHHH Confidence 111111111 1112222 2444443322234568774 44555555555444444 Q ss_pred HHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-----CCcceeeccCChhhHHHHHHHHHHH Q lcl|NC_021305. 207 TAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-----EGMEPIPLQLTAVEMQFIEARQLNR 281 (518) Q Consensus 207 ~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-----~g~~~~~l~~~~~d~~~~e~~~~~~ 281 (518) ...||.+ .-+.++-.+...++ .+ .|+.. .++++.++ .+.++.++....-+ .|.+.++... T Consensus 224 ~~e~~a~--pqr~i~G~d~d~~~--~~----~~~~~------~~~i~~~~~d~~g~~~~v~q~~~~~l~-~~~~~l~~~~ 288 (409) T protein:vir:16 224 TAEFYSF--PQKYVTGLSDDAEP--ME----TWKAT------VSSMLQFTKDEDGDKPTLGQFTQPSMS-PFTEQLRTAA 288 (409) T ss_pred HHHHhcC--hhheeEecCCCCCc--cc----hhhhh------hhHhhccCCCCCCCCceEEecCCCChh-HHHHHHHHHH Confidence 5555533 22333333222111 11 23221 12344443 23566555443322 4899999999 Q ss_pred HHHHHHhcCCHHHhccccccccCCHH---HHHHHHHHHH---hhHHHHHHHHHHHHhhhhhhc--c----cccceecchh Q lcl|NC_021305. 282 EEVCGVYDIAPPIVHILDRATFSNIS---AQMRAFYRDT---MAIPIARIQSAMDKYVGQYWV--R----KNRMKFDIDD 349 (518) Q Consensus 282 ~~Ia~~fgVPp~~lg~~~~~~~sn~e---~~~~~~~~~~---l~P~~~~ie~~l~~~l~~~~~--~----~~~~~fd~~~ 349 (518) ..+|+.-++|++.+|.... |-++.+ .+...+...+ -+-+-..+++.+...+.-... . ...+++.|.+ T Consensus 289 ~~~a~~s~lP~~~lg~~~~-NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~ 367 (409) T protein:vir:16 289 AGFAGETGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEP 367 (409) T ss_pred HHHhhhcCCCHHHcccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecC Confidence 9999999999999986543 222222 2222222211 111222222222211111111 0 1234555665 Q ss_pred hhhc---CHHHHHHHHHHHHhCCC-c-CHHHHHHHhCCCCCC Q lcl|NC_021305. 350 VIQP---DWEAKSESTQKMVNSGV-A-TPNEGREIMGLPRSD 386 (518) Q Consensus 350 l~~~---d~~~~~~~~~~~~~~G~-~-T~NE~R~~~g~~p~~ 386 (518) .... +....++++.|++++|. + .-+-+++++|+..-+ T Consensus 368 ~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 368 LFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred CCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 5433 36778889999999973 3 346679999997654 No 163 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=98.74 E-value=7.3e-08 Score=59.71 Aligned_cols=402 Identities=10% Similarity=0.043 Sum_probs=174.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhc------------ccccccccccc-----cchhhhHHHhhcHHHHHHHHHHHHhh Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYY------------YAPAVGMQLER-----QFSLYGGIYKNQPWVRTVIAKRAQAL 63 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~-----~~~~~~~~~~~~~~v~~~v~~ia~~i 63 (518) |.=-++-..|-..-.. ..|+..... ...+.|-.... -....+.......+...||+.+++.+ T Consensus 1 ~~~~~~~~~~gl~~~~-~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl 79 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDE-NALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRC 79 (474) T ss_pred CcCCCcCcCCCCChhH-HHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhh Confidence 3322222221110000 011110000 00011110000 01112222233455577888888766 Q ss_pred ccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCce-EEEEeeCCceeEEE Q lcl|NC_021305. 64 ARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTP-EKLMPMHPSRVAIK 142 (518) Q Consensus 64 a~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~-~~l~~l~p~~v~v~ 142 (518) .--.|.+ .++ ...+..++.++.+ |. .......+..+.+++|.+|+.+..+.+|.+ ..+.+++|..+.+. T Consensus 80 ~~~Gf~~---~d~---~~~~~~l~~iw~~-N~---ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~ 149 (474) T protein:vir:81 80 NLEGFVW---PDG---DLDSLGGTEVVDD-NH---LLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGE 149 (474) T ss_pred cccceEC---CCC---CccchHHHHHHHh-cC---hhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEE Confidence 6555543 211 1122223444432 22 234566678899999999999988777764 56778999999888 Q ss_pred EcCCceeeEEeee--cccccCc--eeEEecccc-------------------------EEEEeccCCCCcccCchHH--- Q lcl|NC_021305. 143 RNSRTGRYEYYFQ--AGAGVGT--QLVSFADDE-------------------------VVPIRFFNPDGLERGLSLM--- 190 (518) Q Consensus 143 ~~~~~~~~~~~~~--~~~~~~~--~~~~~~~~e-------------------------vih~~~~~~~~~~~G~s~l--- 190 (518) +|.......+.+. .....+. ....|.++. |++|.+...-...+|.|.+ T Consensus 150 ~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s~i~e~ 229 (474) T protein:vir:81 150 WNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQSRITKP 229 (474) T ss_pred EeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCccccchh Confidence 7765443222111 1111111 011122222 3444333222233677744 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHccCCcccccccCc-cCCHHH---HHHHHHHHHHHhcCccccCCeeecCCCcceeecc Q lcl|NC_021305. 191 -ESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK-RLSEAA---QQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQ 265 (518) Q Consensus 191 -~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~~~---~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~ 265 (518) ..+.+.+.....-......|+.. .-+.++-... ...+++ ...++..+...+.-..+...-.....+.++.++. T Consensus 230 v~~l~da~~r~~~~~~~~~e~~a~--pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~ 307 (474) T protein:vir:81 230 MMGLQDAGVRELARREGHMDVFSY--PEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFP 307 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcc--hhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccC Confidence 34444444433333444444433 2233332221 111111 1223322222211111111111112345665555 Q ss_pred CChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHH---HHHHH---HhhHHHHHHHHHHHHhhhhhhc- Q lcl|NC_021305. 266 LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMR---AFYRD---TMAIPIARIQSAMDKYVGQYWV- 338 (518) Q Consensus 266 ~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~---~~~~~---~l~P~~~~ie~~l~~~l~~~~~- 338 (518) ...-+ -|.+.++.....||..-+||++.+|+....|-++.+.... .+... ..+-+-..+++.+...+.-..+ T Consensus 308 ~a~l~-~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~ 386 (474) T protein:vir:81 308 AASPD-AHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKV 386 (474) T ss_pred CCChh-HHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Confidence 43222 3888999999999999999999999765455454443221 11111 1122222233322222211111 Q ss_pred -------ccccceecchhhhhcCHHHHHHHHHHHHhCCC--cCHHHHHHHhCCCCCCCC-Ccceeeeccccccccccccc Q lcl|NC_021305. 339 -------RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGV--ATPNEGREIMGLPRSDDP-KADELYANSALQPLGATPDG 408 (518) Q Consensus 339 -------~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~--~T~NE~R~~~g~~p~~~~-~gD~~~~~~n~~~~~~~~~~ 408 (518) ..+.+++.|.+....+..+.++++.+++++|. .+..=+++++|+.+-+-. +-+....-....+++..... T Consensus 387 ~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~ 466 (474) T protein:vir:81 387 AIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDR 466 (474) T ss_pred CccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhc Confidence 11245566777788889999999999999874 333446788898754210 00000000011111111111 Q ss_pred CCCCCCCCCCC Q lcl|NC_021305. 409 AVEWEEAPAPK 419 (518) Q Consensus 409 ~~~~~~~~~~~ 419 (518) .. ++++.+ T Consensus 467 ~~---~~~~aq 474 (474) T protein:vir:81 467 SN---NGATAQ 474 (474) T ss_pred CC---CCCCCC Confidence 00 000000 No 164 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.73 E-value=7.8e-08 Score=59.55 Aligned_cols=387 Identities=10% Similarity=0.061 Sum_probs=177.1 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhh--HHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYG--GIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) .|...-.. ..+....+..+.. |.......+.. ...... ..-..+.+...+|+..+..+-.-||.+-..++. T Consensus 12 ~l~~~~~~-~~~r~~~l~~Yy~---g~~~i~~~~~~-~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~d~-- 84 (456) T protein:vir:10 12 VLTKRIDD-GMSRVRLLARYSN---GDAPLPELTRN-TSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS-- 84 (456) T ss_pred HHHHHHHH-HHHHHHHHHHHHh---cCCCchhcCcc-cChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCc-- Confidence 22211100 0111111111110 00000000000 000111 111234466889999999888888876321111 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceee--EE--ee Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRY--EY--YF 154 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~--~~--~~ 154 (518) . .+..+..++.+ | +...+...+..+++.+|.+|..+-.+..|.+ .+..++|..+.++++...... .. .+ T Consensus 85 -~-~~~~~~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~i~~~ 157 (456) T protein:vir:10 85 -D-LALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRAAMRWW 157 (456) T ss_pred -c-hHHHHHHHHHh-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCCCCcceEEEEEEE Confidence 1 12233444443 3 3445567788999999999999888888865 467788998888877543210 00 00 Q ss_pred ecccccC-------------------------ceeEEeccc------------cEEE-EeccCCCCcccCchHHHHHHHH Q lcl|NC_021305. 155 QAGAGVG-------------------------TQLVSFADD------------EVVP-IRFFNPDGLERGLSLMESLKST 196 (518) Q Consensus 155 ~~~~~~~-------------------------~~~~~~~~~------------evih-~~~~~~~~~~~G~s~l~~~~~~ 196 (518) ...+... ......... .++. ...+| ..|+|-++..... T Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N----~~g~gd~e~vi~l 233 (456) T protein:vir:10 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN----PDGMGEVEPHIDI 233 (456) T ss_pred EecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecC----CCCCchhhhhHHH Confidence 0000000 000000000 0001 11112 2477777776666 Q ss_pred HHHHHHHHHHHHHHHHccCCcccccccC---ccCCHHHHHHH--HHHHHHHhcCccccCCeeecCCCcceeeccCChhhH Q lcl|NC_021305. 197 IFSEDSSRNATAAMWKNAGRPNLVLRHE---KRLSEAAQQRL--REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEM 271 (518) Q Consensus 197 i~~~~~~~~~~~~~~~ng~~p~~il~~~---~~~~~~~~~~~--~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~ 271 (518) +.....+..-........+.|.-++.-. ....++....+ ...|+. ..+.++.++++.++.++....- . T Consensus 234 iDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~------~~~~~~~~~~~~~~~q~~~~~~-~ 306 (456) T protein:vir:10 234 INRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEA------APGALWELPPGVDIWESQANDF-T 306 (456) T ss_pred HHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhh------hccccccCCCCcceEEecccCh-h Confidence 6655544433223323333333333211 00111111111 112222 2245666788888877764322 2 Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHH----HHHHhh---hhhhc--cccc Q lcl|NC_021305. 272 QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQS----AMDKYV---GQYWV--RKNR 342 (518) Q Consensus 272 ~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~----~l~~~l---~~~~~--~~~~ 342 (518) .|.+.++..+.+|++.-++|++.+|... +| .+.+ ...+....+.-.+...+. .+.+.+ +.-.+ .... T Consensus 307 ~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~N-~Sg~--Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~ 382 (456) T protein:vir:10 307 PMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAE--GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDT 382 (456) T ss_pred HHHHHHHHHHHHHHhccCCChHHhcccc-cC-hHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 3788899999999999999999987532 12 1222 111222222222222222 222111 11111 1234 Q ss_pred ceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 343 MKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 343 ~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) +++.|.+....+..+.++++.++++.|+++..-+++++|+.+-+-+.. + +..+.....+ ...++. T Consensus 383 ~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~-e------~er~~~e~~~--------~~~~~~ 447 (456) T protein:vir:10 383 VDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQD-D------LDRAREQITL--------FAGNPV 447 (456) T ss_pred eeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHH-H------HHHHHHHHHH--------Hhhhhh Confidence 566777778889999999999999999999888888898865311000 0 0000000000 000111 Q ss_pred cCCCCCCCc Q lcl|NC_021305. 423 STPVASLDQ 431 (518) Q Consensus 423 ~~~~~~~~~ 431 (518) ..|..+... T Consensus 448 ~~~~~~~~~ 456 (456) T protein:vir:10 448 QRPQEDGSR 456 (456) T ss_pred hcCCCCCCC Confidence 111101111 No 165 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.73 E-value=7.8e-08 Score=59.55 Aligned_cols=387 Identities=10% Similarity=0.061 Sum_probs=177.1 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhh--HHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYG--GIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) .|...-.. ..+....+..+.. |.......+.. ...... ..-..+.+...+|+..+..+-.-||.+-..++. T Consensus 12 ~l~~~~~~-~~~r~~~l~~Yy~---g~~~i~~~~~~-~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~~~d~-- 84 (456) T protein:vir:10 12 VLTKRIDD-GMSRVRLLARYSN---GDAPLPELTRN-TSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS-- 84 (456) T ss_pred HHHHHHHH-HHHHHHHHHHHHh---cCCCchhcCcc-cChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCCCCCc-- Confidence 22211100 0111111111110 00000000000 000111 111234466889999999888888876321111 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceee--EE--ee Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRY--EY--YF 154 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~--~~--~~ 154 (518) . .+..+..++.+ | +...+...+..+++.+|.+|..+-.+..|.+ .+..++|..+.++++...... .. .+ T Consensus 85 -~-~~~~~~~i~~~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~i~~~ 157 (456) T protein:vir:10 85 -D-LALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRAAMRWW 157 (456) T ss_pred -c-hHHHHHHHHHh-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCCCCcceEEEEEEE Confidence 1 12233444443 3 3445567788999999999999888888865 467788998888877543210 00 00 Q ss_pred ecccccC-------------------------ceeEEeccc------------cEEE-EeccCCCCcccCchHHHHHHHH Q lcl|NC_021305. 155 QAGAGVG-------------------------TQLVSFADD------------EVVP-IRFFNPDGLERGLSLMESLKST 196 (518) Q Consensus 155 ~~~~~~~-------------------------~~~~~~~~~------------evih-~~~~~~~~~~~G~s~l~~~~~~ 196 (518) ...+... ......... .++. ...+| ..|+|-++..... T Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N----~~g~gd~e~vi~l 233 (456) T protein:vir:10 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN----PDGMGEVEPHIDI 233 (456) T ss_pred EecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecC----CCCCchhhhhHHH Confidence 0000000 000000000 0001 11112 2477777776666 Q ss_pred HHHHHHHHHHHHHHHHccCCcccccccC---ccCCHHHHHHH--HHHHHHHhcCccccCCeeecCCCcceeeccCChhhH Q lcl|NC_021305. 197 IFSEDSSRNATAAMWKNAGRPNLVLRHE---KRLSEAAQQRL--REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEM 271 (518) Q Consensus 197 i~~~~~~~~~~~~~~~ng~~p~~il~~~---~~~~~~~~~~~--~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~ 271 (518) +.....+..-........+.|.-++.-. ....++....+ ...|+. ..+.++.++++.++.++....- . T Consensus 234 iDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~------~~~~~~~~~~~~~~~q~~~~~~-~ 306 (456) T protein:vir:10 234 INRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEA------APGALWELPPGVDIWESQANDF-T 306 (456) T ss_pred HHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhh------hccccccCCCCcceEEecccCh-h Confidence 6655544433223323333333333211 00111111111 112222 2245666788888877764322 2 Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHH----HHHHhh---hhhhc--cccc Q lcl|NC_021305. 272 QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQS----AMDKYV---GQYWV--RKNR 342 (518) Q Consensus 272 ~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~----~l~~~l---~~~~~--~~~~ 342 (518) .|.+.++..+.+|++.-++|++.+|... +| .+.+ ...+....+.-.+...+. .+.+.+ +.-.+ .... T Consensus 307 ~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~N-~Sg~--Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~ 382 (456) T protein:vir:10 307 PMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAE--GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDT 382 (456) T ss_pred HHHHHHHHHHHHHHhccCCChHHhcccc-cC-hHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Confidence 3788899999999999999999987532 12 1222 111222222222222222 222111 11111 1234 Q ss_pred ceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 343 MKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 343 ~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) +++.|.+....+..+.++++.++++.|+++..-+++++|+.+-+-+.. + +..+.....+ ...++. T Consensus 383 ~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~-e------~er~~~e~~~--------~~~~~~ 447 (456) T protein:vir:10 383 VDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQD-D------LDRAREQITL--------FAGNPV 447 (456) T ss_pred eeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHH-H------HHHHHHHHHH--------Hhhhhh Confidence 566777778889999999999999999999888888898865311000 0 0000000000 000111 Q ss_pred cCCCCCCCc Q lcl|NC_021305. 423 STPVASLDQ 431 (518) Q Consensus 423 ~~~~~~~~~ 431 (518) ..|..+... T Consensus 448 ~~~~~~~~~ 456 (456) T protein:vir:10 448 QRPQEDGSR 456 (456) T ss_pred hcCCCCCCC Confidence 111101111 No 166 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=98.72 E-value=8.1e-08 Score=59.47 Aligned_cols=393 Identities=11% Similarity=0.029 Sum_probs=181.6 Q ss_pred CcCCCCCCCCcccccc--cch----hhhhhhccccccccc--cc-c----cchhhhHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAE--LSP----QMQDSYYYAPAVGMQ--LE-R----QFSLYGGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~--~~~----~~~~~~~~~~~~~~~--~~-~----~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) +||-.+...+.-.... .++ .+.. +..++.|-. .. . .................+|+..|+-+..-| T Consensus 15 ~~~~~~~~~~~~~~~~i~~~~~~~~~i~~--~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~iv~~~a~~l~~ep 92 (499) T protein:vir:80 15 RMGLLKSLKDVTDHKKVNANDEDYKYIDM--WKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEK 92 (499) T ss_pred HhccccchhhhhcCCCCcCCHHHHHHHHH--HHHHhcCCcchhhccccccCCCccccceeecchHHHHHHHHHHhhhCCc Confidence 4444322211110000 111 0000 001111100 00 0 000000111223344667788888887766 Q ss_pred eEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc Q lcl|NC_021305. 68 VKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT 147 (518) Q Consensus 68 ~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~ 147 (518) ..+--. +......+.+-...-....-+..++...+..|.+++.+..|.+|.+ .+..++|..+.+...+.+ T Consensus 93 ~~i~~~---------d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i~~v~a~~~~Pi~~d~~ 162 (499) T protein:vir:80 93 VKINID---------DETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KVSFATADCMYPLSNDSE 162 (499) T ss_pred ceEeeC---------CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcE-EEEEEcCCceEEEEecCC Confidence 665321 1122222222222233555566778888999999999999888764 467788888766443333 Q ss_pred eeeEEe---------------------------eecc--------cccCceeEE----------------eccccEEEEe Q lcl|NC_021305. 148 GRYEYY---------------------------FQAG--------AGVGTQLVS----------------FADDEVVPIR 176 (518) Q Consensus 148 ~~~~~~---------------------------~~~~--------~~~~~~~~~----------------~~~~evih~~ 176 (518) ...... |.+. ....+..+. +..--+.||+ T Consensus 163 ~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~ 242 (499) T protein:vir:80 163 NVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIK 242 (499) T ss_pred CeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCccceEeec Confidence 210000 0000 000011110 0111245666 Q ss_pred ccCCC----CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc-----ccCccCCHHHHHHHHHHHHHHhcCcc Q lcl|NC_021305. 177 FFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL-----RHEKRLSEAAQQRLREQFDRAHSGSS 247 (518) Q Consensus 177 ~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~~~~~~~~~~~~~~~~~~g~~ 247 (518) .+-++ +.+.|+|.+.-+...+...........+-|..+ ....++ ......+.+... .|.. .. T Consensus 243 ~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~-~~~i~v~~~~l~~~~~~~g~~~~----~~~~----~~ 313 (499) T protein:vir:80 243 PNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNLDGSTTQ----YFDS----TD 313 (499) T ss_pred CCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhc-ccceecchhhhhccCCCCCCccc----CCCc----cc Confidence 54332 335699999999998888887766666666653 333332 111111100000 0100 00 Q ss_pred ccCCee-ec--CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH--HHHHHHHhhHHH Q lcl|NC_021305. 248 NTGKTM-VV--EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM--RAFYRDTMAIPI 322 (518) Q Consensus 248 n~g~~~-vl--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~--~~~~~~~l~P~~ 322 (518) ...+.. .. +++-.++.+......-++.+..+...++|....|+++..+|...++.. +..+.. ..-...++.-.. T Consensus 314 ~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~-TAtei~s~~~~l~~~~~~~~ 392 (499) T protein:vir:80 314 EAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLK-TATEVVSEKSETYQTKNSHS 392 (499) T ss_pred ceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccch-hHHHHHHHHHHHHHHHHHHH Confidence 000111 11 223346667777777778888999999999999999999987655432 222221 111111222222 Q ss_pred HHHHHHHHHhh-----------hhh--hcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCC Q lcl|NC_021305. 323 ARIQSAMDKYV-----------GQY--WVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM-GLPRSDDP 388 (518) Q Consensus 323 ~~ie~~l~~~l-----------~~~--~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-g~~p~~~~ 388 (518) ..++..|...+ ... ......+.|++++-+..|..+.++.+.+++.+|+|+.-.++... |.+ ++ T Consensus 393 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~---d~ 469 (499) T protein:vir:80 393 QLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNIT---EA 469 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCC---hH Confidence 23333322211 111 11234577888888899999999999999999999999887654 432 22 Q ss_pred CcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 389 KADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 389 ~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) ..++.+ ..+. . ++...-+. ....+..++.. T Consensus 470 ea~~el-----~~i~---~----E~~~~~~~---~d~~g~~ge~e 499 (499) T protein:vir:80 470 EADEWA-----EMLA---K----EKQAEIPN---NDMTGIFGEEE 499 (499) T ss_pred HHHHHH-----HHHH---H----HhhcCCCC---CCccccCCCCC Confidence 222111 0000 0 00000000 00000001100 No 167 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.72 E-value=1.1e-09 Score=69.69 Aligned_cols=181 Identities=10% Similarity=0.108 Sum_probs=100.6 Q ss_pred ccccCc---cCCHHHHHHHHHHHH--HHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHH- Q lcl|NC_021305. 220 VLRHEK---RLSEAAQQRLREQFD--RAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPP- 293 (518) Q Consensus 220 il~~~~---~~~~~~~~~~~~~~~--~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~- 293 (518) |++.++ .++.. ...+++++. ..+.+ +.+.+.+...+-+|..+..+..... +........||++-|||.. T Consensus 1 V~k~~~l~~~~~~~-~~~~~~r~~~~~~~~~--~~~~~~ld~~~e~~e~~~~~lsGl~--d~l~~~~~~iaa~s~iP~t~ 75 (201) T protein:vir:10 1 MWKAKGLADLCDDS-DGAARLRLAQVDNNSG--VGQAIGIDADSEEYNVLNSDIGGID--TFLSQKFDRIVALSGIHEII 75 (201) T ss_pred CccchHHHHHhcCC-hHHHHHHHHHHHHhhh--hhhhheeecCCcceeeeecCcCChH--HHHHHHHHHHHhHhcCchhh Confidence 555443 11111 123333333 33333 2344556666688888888877654 7788888999999999966 Q ss_pred HhccccccccCCHHHHHHHHHHH-------HhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHH------ Q lcl|NC_021305. 294 IVHILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSE------ 360 (518) Q Consensus 294 ~lg~~~~~~~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~------ 360 (518) ++|...++-+++.+...++||.. -++|.++++-. .+ .....+.|.|.+|...+.+++++ T Consensus 76 LfG~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~----~~----~~~~~~~~~f~pL~~~s~kekAei~~~~a 147 (201) T protein:vir:10 76 LKGKNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLP----FI----VTEQEWSVEFNPLSQVSDKDKSEILEKNV 147 (201) T ss_pred hcCCCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH----hh----cCCCCceEeeCCCCCCCHHHHHHHHHHHH Confidence 55655555556667677777764 35666655433 11 22335677788888888877654 Q ss_pred -HHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 361 -STQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 361 -~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) ++.+++.+|+++++|+|+.+--.+.. + +++.+.+..+ ........|.+.+.++ T Consensus 148 ~a~~~~~~~g~i~~~e~r~~L~~~~~~--~----~~~~~~~~~~---------~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 148 NSVAALIAAGIIDADEARDTLRAISTE--V----KIGEGSIQTE---------VVINESEDPLDVSANN 201 (201) T ss_pred HHHHHHHHcCCCCHHHHHHHHHhcCCc--C----CCCCCCCCcc---------ccccccCCCCCCCCCC Confidence 45678999999999999988543321 1 1111111000 0000000111111111 No 168 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.65 E-value=1.5e-07 Score=58.06 Aligned_cols=391 Identities=10% Similarity=0.006 Sum_probs=177.7 Q ss_pred CcCCCCCCCC---cccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 1 MLLANGQTLS---APAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 1 ~~f~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) +.-+.-...+ .+....+.+... +....-....+ ... ..++....+|+..+.-+-+-|+.+--..+.. T Consensus 40 ~y~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~----~~k--i~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~ 109 (474) T protein:vir:10 40 RYKTHIDYVPIFKRRPIEEKEDFET----GGNVRRLDVSV----NNK--LNNSFDSEIVDTRVGYLHGVPVTYDLDENAE 109 (474) T ss_pred HHhhhcchhhhhcchhhhhhhhhhh----cccccccccCc----ccc--cccchHHHHHHhHhhheeccceeEeeCCCCc Confidence 1111000000 000000111000 00000000000 001 1244567778888888878888764322211 Q ss_pred ceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEe--ee Q lcl|NC_021305. 78 ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYY--FQ 155 (518) Q Consensus 78 ~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~--~~ 155 (518) .+..+..++.+-............+..+.+.+|.+|.++..+.+|.+ .+..++|..+.++.+..+...... +. T Consensus 110 ----~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~p~~~~~v~d~~~~~~~~i~~~~ 184 (474) T protein:vir:10 110 ----KNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNIDPYNVIFVGDNILEPTYSLRYFY 184 (474) T ss_pred ----chHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEcccceEEEEcCCCceEEEEEEEE Confidence 11122222222222335666778889999999999999888888864 677888988887776544332111 00 Q ss_pred cccccCc----eeEEeccccEEEEecc------------CC---------CCcccCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 156 AGAGVGT----QLVSFADDEVVPIRFF------------NP---------DGLERGLSLMESLKSTIFSEDSSRNATAAM 210 (518) Q Consensus 156 ~~~~~~~----~~~~~~~~evih~~~~------------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 210 (518) .....++ ....+....+.+++.. ++ .....|.|-+..+...+.....+..-..+. T Consensus 185 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~ 264 (474) T protein:vir:10 185 EKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSE 264 (474) T ss_pred EeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHH Confidence 0000011 0111122222222211 00 011247777777777776666555555555 Q ss_pred HHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021305. 211 WKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDI 290 (518) Q Consensus 211 ~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgV 290 (518) +...+.|-.+++- ..++++....++ ..+.+.+.+++.++..+........+....+.+.+.|...-++ T Consensus 265 ~~~~~~~~l~i~g-~~~~~~~~~~~~-----------~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 332 (474) T protein:vir:10 265 ISQTRLAYLVLRG-MGMSEEMIQETQ-----------KSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKS 332 (474) T ss_pred HHHhhcchhhhcc-CCCCchhhhhhh-----------hcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCC Confidence 5555556555532 233443322221 2234555566666666665555666778888889999998888 Q ss_pred CHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhh-cccccceecchhhhhcCHH Q lcl|NC_021305. 291 APPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYW-VRKNRMKFDIDDVIQPDWE 356 (518) Q Consensus 291 Pp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~-~~~~~~~fd~~~l~~~d~~ 356 (518) |..-.+... ++ .+..+. ....+..++.-.++.|...++..-.... .....+++.+..-+..|.. T Consensus 333 p~~~~~~~~-~n-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~ 410 (474) T protein:vir:10 333 VNFNSDEFN-GN-VPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKL 410 (474) T ss_pred ccccccccc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHH Confidence 864332111 11 111111 1112222333333333322222111000 0112456777788889999 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCC Q lcl|NC_021305. 357 AKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTS 436 (518) Q Consensus 357 ~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) +.++.+.++ .|++|..-+.++++.-+-+....+ .+........ +..+...++ +.++.+++. T Consensus 411 e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E~e---------ri~~E~~e~~--~~~~~~~~~------~~~~~~~~~ 471 (474) T protein:vir:10 411 EESQVLINL--KGQVSERTRLGQSQLVDDVDYELD---------EMEKESLEFN--DKLPDIDEG------DANDKSQNN 471 (474) T ss_pred HHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHH---------HHHHHHHHHH--hhcccccCC------CcCCCCccc Confidence 999999988 488999888888865321111111 1100000000 000000000 000000000 Q ss_pred ccc Q lcl|NC_021305. 437 VPG 439 (518) Q Consensus 437 ~~~ 439 (518) +.+ T Consensus 472 ~s~ 474 (474) T protein:vir:10 472 QSE 474 (474) T ss_pred cCC Confidence 000 No 169 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.65 E-value=1.5e-07 Score=58.06 Aligned_cols=391 Identities=10% Similarity=0.006 Sum_probs=177.7 Q ss_pred CcCCCCCCCC---cccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 1 MLLANGQTLS---APAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 1 ~~f~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) +.-+.-...+ .+....+.+... +....-....+ ... ..++....+|+..+.-+-+-|+.+--..+.. T Consensus 40 ~y~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~----~~k--i~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~ 109 (474) T protein:vir:94 40 RYKTHIDYVPIFKRRPIEEKEDFET----GGNVRRLDVSV----NNK--LNNSFDSEIVDTRVGYLHGVPVTYDLDENAE 109 (474) T ss_pred HHhhhcchhhhhcchhhhhhhhhhh----cccccccccCc----ccc--cccchHHHHHHhHhhheeccceeEeeCCCCc Confidence 1111000000 000000111000 00000000000 001 1244567778888888878888764322211 Q ss_pred ceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEe--ee Q lcl|NC_021305. 78 ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYY--FQ 155 (518) Q Consensus 78 ~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~--~~ 155 (518) .+..+..++.+-............+..+.+.+|.+|.++..+.+|.+ .+..++|..+.++.+..+...... +. T Consensus 110 ----~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~p~~~~~v~d~~~~~~~~i~~~~ 184 (474) T protein:vir:94 110 ----KNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNIDPYNVIFVGDNILEPTYSLRYFY 184 (474) T ss_pred ----chHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEcccceEEEEcCCCceEEEEEEEE Confidence 11122222222222335666778889999999999999888888864 677888988887776544332111 00 Q ss_pred cccccCc----eeEEeccccEEEEecc------------CC---------CCcccCchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 156 AGAGVGT----QLVSFADDEVVPIRFF------------NP---------DGLERGLSLMESLKSTIFSEDSSRNATAAM 210 (518) Q Consensus 156 ~~~~~~~----~~~~~~~~evih~~~~------------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~ 210 (518) .....++ ....+....+.+++.. ++ .....|.|-+..+...+.....+..-..+. T Consensus 185 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~ 264 (474) T protein:vir:94 185 EKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSE 264 (474) T ss_pred EeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHH Confidence 0000011 0111122222222211 00 011247777777777776666555555555 Q ss_pred HHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021305. 211 WKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDI 290 (518) Q Consensus 211 ~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgV 290 (518) +...+.|-.+++- ..++++....++ ..+.+.+.+++.++..+........+....+.+.+.|...-++ T Consensus 265 ~~~~~~~~l~i~g-~~~~~~~~~~~~-----------~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~ 332 (474) T protein:vir:94 265 ISQTRLAYLVLRG-MGMSEEMIQETQ-----------KSGAFELFDKDMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKS 332 (474) T ss_pred HHHhhcchhhhcc-CCCCchhhhhhh-----------hcceeEecCCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCC Confidence 5555556555532 233443322221 2234555566666666665555666778888889999998888 Q ss_pred CHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhh-cccccceecchhhhhcCHH Q lcl|NC_021305. 291 APPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYW-VRKNRMKFDIDDVIQPDWE 356 (518) Q Consensus 291 Pp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~-~~~~~~~fd~~~l~~~d~~ 356 (518) |..-.+... ++ .+..+. ....+..++.-.++.|...++..-.... .....+++.+..-+..|.. T Consensus 333 p~~~~~~~~-~n-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~ 410 (474) T protein:vir:94 333 VNFNSDEFN-GN-VPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKL 410 (474) T ss_pred ccccccccc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHH Confidence 864332111 11 111111 1112222333333333322222111000 0112456777788889999 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCC Q lcl|NC_021305. 357 AKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTS 436 (518) Q Consensus 357 ~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) +.++.+.++ .|++|..-+.++++.-+-+....+ .+........ +..+...++ +.++.+++. T Consensus 411 e~a~~~~kl--~g~iS~et~~~~l~~v~d~~~E~e---------ri~~E~~e~~--~~~~~~~~~------~~~~~~~~~ 471 (474) T protein:vir:94 411 EESQVLINL--KGQVSERTRLGQSQLVDDVDYELD---------EMEKESLEFN--DKLPDIDEG------DANDKSQNN 471 (474) T ss_pred HHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHH---------HHHHHHHHHH--hhcccccCC------CcCCCCccc Confidence 999999988 488999888888865321111111 1100000000 000000000 000000000 Q ss_pred ccc Q lcl|NC_021305. 437 VPG 439 (518) Q Consensus 437 ~~~ 439 (518) +.+ T Consensus 472 ~s~ 474 (474) T protein:vir:94 472 QSE 474 (474) T ss_pred cCC Confidence 000 No 170 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.64 E-value=1.6e-07 Score=57.88 Aligned_cols=397 Identities=10% Similarity=0.023 Sum_probs=160.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccccc--ccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPA--VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) =|+..-. ...+.-..+..+.. |..+. .+............. ....+...+|+..+..+--.+|. ..++. T Consensus 34 ~l~~~~~-~~~~rl~~l~~YY~---G~~~~~~~~~~~~~~~~~~~~~-~v~n~~~~ivd~~a~~l~~~gf~---~~d~~- 104 (501) T protein:vir:25 34 DMWRLHI-SERQWLDRIYEYTK---GLRGRPEVPEGASDEVKELAKL-SVKNVLSLVRDSFAQNLSVVGYR---NALAK- 104 (501) T ss_pred HHHHHHH-HHHHHHHHHHHHHh---cCCCchhccccCChhhhhhHhh-hhcChHHHHHHHHHhhhccccee---cCCcc- Confidence 0111100 00111111111111 11110 011110000000011 11234566777777655333332 22211 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcC-Cce-eeEE--ee Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNS-RTG-RYEY--YF 154 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~-~~~-~~~~--~~ 154 (518) ....+..++.+ |. .......+..+.+.+|.+|+.+.++..|. .+..++|..+.+.+++ ... ...+ .+ T Consensus 105 ---~~~~l~~i~~~-N~---~d~~~~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~~~iy~D~~~~~~~~~ai~~ 175 (501) T protein:vir:25 105 ---ENDPAWEMWQR-NR---MDARQAEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQILAVYADPSVDAWPQYALET 175 (501) T ss_pred ---chHHHHHHHHh-cC---hhHHHHHHHHHHhhcCceEEEEecCCCCC--eEEEeccccEEEEEecCCCCcceeEEEEE Confidence 12223334332 32 34555678889999999999998888874 3556788888866533 211 1111 00 Q ss_pred e--ccc-ccCceeEEe----------------------------------------------ccccEEEEeccCCCCccc Q lcl|NC_021305. 155 Q--AGA-GVGTQLVSF----------------------------------------------ADDEVVPIRFFNPDGLER 185 (518) Q Consensus 155 ~--~~~-~~~~~~~~~----------------------------------------------~~~evih~~~~~~~~~~~ 185 (518) . ... ...+....+ ..=.|+||.+.. ....+ T Consensus 176 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~-~~~~~ 254 (501) T protein:vir:25 176 WVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGR-DADDM 254 (501) T ss_pred EeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCcc-ccCcc Confidence 0 000 000000001 111344444322 22235 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-CCcceeec Q lcl|NC_021305. 186 GLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-EGMEPIPL 264 (518) Q Consensus 186 G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l 264 (518) |.|-++.+...+.....+.........-.+.|.-++.- .+.++.+ .|+. ..+++++++ ++.++.++ T Consensus 255 g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G---~~~~~~~----~~~~------~~~~i~~~~~~~~~~~q~ 321 (501) T protein:vir:25 255 IVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISG---WTGSKAE----VLKA------SALRVWTFEDPEVKAQAF 321 (501) T ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhC---CCCCccc----hhhh------cccceeccCCCCceEEEe Confidence 77766655544444444333333333333334332211 1111111 1211 224566665 46666665 Q ss_pred cCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHH----HHHHHHHh--hhhh- Q lcl|NC_021305. 265 QLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIAR----IQSAMDKY--VGQY- 336 (518) Q Consensus 265 ~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~----ie~~l~~~--l~~~- 336 (518) ... +++ |.+.++....+|+..-++|++.+|.... | .+.+ ...+....+.-.+.. +...+.+. |+.. T Consensus 322 ~~~--~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~-N-~Sg~--Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~ 395 (501) T protein:vir:25 322 PPA--SVEPYNLILEEMLQHVAMVAQISPAQVTGKMI-N-VSAE--ALAAAEANQQRKLAAKRESFGESWEQLLRLAAEM 395 (501) T ss_pred ccc--ChHHHHHHHHHHHHHHHhhcCCChhhhccccC-C-hHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 432 333 7888999999999999999998875322 1 1222 112222222222222 22222211 1111 Q ss_pred hc-----ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCCCCCcceeeecccccc-cccccccC Q lcl|NC_021305. 337 WV-----RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGRE-IMGLPRSDDPKADELYANSALQP-LGATPDGA 409 (518) Q Consensus 337 ~~-----~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~-~~~~~~~~ 409 (518) .+ ....+++.|.+....+..+.++++.++++.|+ +.-.+.. +.|+.+-+-.............+ ++....++ T Consensus 396 ~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~ 474 (501) T protein:vir:25 396 DDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNE 474 (501) T ss_pred hCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccC Confidence 11 11345677788889999999999999999885 5544443 45775421000000000000000 01110100 Q ss_pred CCCCCCCCCCCCccCCCCCCCccccCCcccc Q lcl|NC_021305. 410 VEWEEAPAPKRPASTPVASLDQSPPTSVPGL 440 (518) Q Consensus 410 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (518) . .+..+.+.+....+ .+++.+....+. T Consensus 475 ~-~~~~~~~~~~~~~~---~~~~~~~~~~g~ 501 (501) T protein:vir:25 475 P-APVPPPPPQAAAQA---LNEGGVNGNGGA 501 (501) T ss_pred c-CCCCCCCCCCCccc---cccccCCCCCCC Confidence 0 00000010000000 011111111111 No 171 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.63 E-value=1.6e-07 Score=57.77 Aligned_cols=410 Identities=9% Similarity=0.012 Sum_probs=176.5 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccc---cccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGM---QLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) ++-.. .....|....+..+.. |..+.... ...... .... ...+.....|+..+.-+-.-|+++--.+ T Consensus 48 ~i~~~-~~~~~~r~~~l~~Yy~---g~~~il~~~~~~~~~~~-~~~k--i~~n~~k~Iv~~~~~yl~g~p~~~~~~d--- 117 (511) T protein:vir:93 48 YIEHH-MDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYM-ADNR--VAHDYASYISDFINGYFLGNPIQYQDDD--- 117 (511) T ss_pred HHHHH-HHhhHHHHHHHHHHhc---ccCccccccCcCccccc-Ccce--eecchHHHHHHHHhhhhcccCeeeccCC--- Confidence 00000 0000000000111100 00000000 000000 0000 1234556778888887777787762111 Q ss_pred ceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eeeEE-ee Q lcl|NC_021305. 78 ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRYEY-YF 154 (518) Q Consensus 78 ~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~~~-~~ 154 (518) +.....+..++. .-........+..+++.+|.+|.++.++.+|.+ .+..++|..+.++.+... ..... .+ T Consensus 118 --~~~~~~l~~~~~----~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~ 190 (511) T protein:vir:93 118 --KDVLEVIEAFND----LNDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRY 190 (511) T ss_pred --hHHHHHHHHHHh----hcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEE Confidence 111122233332 235566777888999999999999999888864 577889999988876542 21111 11 Q ss_pred eccc-cc--C-c---eeEEeccccEEEEeccCC-------------------------CCcccCchHHHHHHHHHHHHHH Q lcl|NC_021305. 155 QAGA-GV--G-T---QLVSFADDEVVPIRFFNP-------------------------DGLERGLSLMESLKSTIFSEDS 202 (518) Q Consensus 155 ~~~~-~~--~-~---~~~~~~~~evih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~~~ 202 (518) +... .. . . ....+.++.+.+++.... .....|.|-++.+...+..... T Consensus 191 ~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~ 270 (511) T protein:vir:93 191 LRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDN 270 (511) T ss_pred EEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCCCCchhhHHHHHHHHHH Confidence 1100 00 0 0 011234444444321110 0112577888888777777776 Q ss_pred HHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccc-cCCeeecCCCcceeeccCChhhHHHHHHHHHHH Q lcl|NC_021305. 203 SRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSN-TGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNR 281 (518) Q Consensus 203 ~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n-~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~ 281 (518) +..-..+.+...+.|-.+++-....+.++....++...-....... .+...-.+.+.++..+........+....+.+. T Consensus 271 ~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~ 350 (511) T protein:vir:93 271 AESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLN 350 (511) T ss_pred HHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHH Confidence 6655555666666666555433333444333322211000000000 001111344555555655555566677888889 Q ss_pred HHHHHHhcCCHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhc-ccccceecc Q lcl|NC_021305. 282 EEVCGVYDIAPPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWV-RKNRMKFDI 347 (518) Q Consensus 282 ~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~-~~~~~~fd~ 347 (518) +.|...-++|..-.+... +|- +..+. ....+..++.-.+..|...+....-.... ....+++.+ T Consensus 351 ~~I~~~s~~P~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f 428 (511) T protein:vir:93 351 SDIHMFTNTPNMKDDNFS-GTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVY 428 (511) T ss_pred HHHHHHhCCccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEe Confidence 999999999864332211 121 11111 11122222333332222221111100000 112356677 Q ss_pred hhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCcc-CCC Q lcl|NC_021305. 348 DDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPAS-TPV 426 (518) Q Consensus 348 ~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 426 (518) ..-+..|..+.++.+.++ .|+++..-+++++++- +++. .++ ..+.................++.. .+. T Consensus 429 ~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v--~d~~-~E~------~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:93 429 NRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPE-LEV------KKIEEDEKESIKKAQKGIYKDPRDINDD 497 (511) T ss_pred CCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCC--CCHH-HHH------HHHHHHHHHHHHHHhhhcccCCCCCCCC Confidence 788889999999999888 5889988888887543 2221 111 111100000000000000000000 000 Q ss_pred CCCCccccCCccccccchhcchhhHHH Q lcl|NC_021305. 427 ASLDQSPPTSVPGLSPTNSDRSTDSGK 453 (518) Q Consensus 427 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (518) .+++++...+..+ + T Consensus 498 ~~~~~~~~~~~~~-------------~ 511 (511) T protein:vir:93 498 EQDDDTKDTVDKK-------------E 511 (511) T ss_pred CCCCccccccccc-------------C Confidence 0000000000000 0 No 172 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.62 E-value=1.8e-07 Score=57.55 Aligned_cols=412 Identities=10% Similarity=0.026 Sum_probs=177.1 Q ss_pred CcCCCC---CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Q lcl|NC_021305. 1 MLLANG---QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDT 77 (518) Q Consensus 1 ~~f~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~ 77 (518) +|-..- .....+..+.+..+.. |-......+...........-...+....+|+..+.-+-.-|+.+.-.+.+. T Consensus 43 ~l~~~i~~~~~~~~~r~~~l~~yY~---g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~ 119 (501) T protein:vir:27 43 LLKNFINHHKLRQAPRIQELLDYAR---GENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDN 119 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc---CCCccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCCccc Confidence 000000 0000111111111100 0000000000000000001112345667888888888888888774333221 Q ss_pred ceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc-e-eeEE-ee Q lcl|NC_021305. 78 ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-G-RYEY-YF 154 (518) Q Consensus 78 ~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~-~-~~~~-~~ 154 (518) ... ....+..++. .-+.......+..+++.+|.+|.++-++.+|.+ .+..++|..+.++++... . .... .+ T Consensus 120 ~~~-~~~~l~~~~~----~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~ 193 (501) T protein:vir:27 120 NSQ-NDDTIKRIGR----INDIDSHNRTLIRDLSQTGRAYEVIYRNEYDET-RIKRLNPLETFVIYDNSLEDNSIAAVRY 193 (501) T ss_pred hHH-HHHHHHHHHH----hcChhHHHHHHHHHHhhCCeEEEEEEeCCCCce-EEEEEccceeEEEecCCCCCceEEEEEE Confidence 111 1122222322 235567788889999999999999999888864 567789999888776532 1 1111 11 Q ss_pred ecc-cccC-c-eeEEeccccEEEEecc----------CC---------CCcccCchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 155 QAG-AGVG-T-QLVSFADDEVVPIRFF----------NP---------DGLERGLSLMESLKSTIFSEDSSRNATAAMWK 212 (518) Q Consensus 155 ~~~-~~~~-~-~~~~~~~~evih~~~~----------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ 212 (518) +.. ...+ . ....+..+.+.++... ++ .....|.|.+..+...+.....+..-..+.+. T Consensus 194 ~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~ 273 (501) T protein:vir:27 194 YNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYLIDLYDSAESDTANHMS 273 (501) T ss_pred EEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHH Confidence 110 0001 1 1111222222222110 00 01135788888877777777766666666666 Q ss_pred ccCCcccccccCcc-CCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021305. 213 NAGRPNLVLRHEKR-LSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIA 291 (518) Q Consensus 213 ng~~p~~il~~~~~-~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVP 291 (518) ....|-.+++-... -.++....++.. ........+.....+.+.++..+..+..+..+....+.+.+.|+..-++| T Consensus 274 ~~~~~~~v~~g~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p 350 (501) T protein:vir:27 274 DMADAILAIYGDLALPKGMQASDMKRT---RLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIP 350 (501) T ss_pred HhcCceeeeecCccCCcccchhhhhhc---CceeecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCc Confidence 55556555543221 122222222211 00001111111223445555556555555566777888889999999998 Q ss_pred HHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHH Q lcl|NC_021305. 292 PPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAK 358 (518) Q Consensus 292 p~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~ 358 (518) ..-.+.. .+| .+..+.. ...+...+.-.+..+...++..--........+++.+...+..+..+. T Consensus 351 ~~~~~~~-~~n-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ 428 (501) T protein:vir:27 351 DMSDTNF-SGN-TSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQ 428 (501) T ss_pred ccCcccc-ccC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccceEEeCCCCCcCHHHH Confidence 6443221 112 1222111 112222222222222222211110000111235677778888999999 Q ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccc-----ccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 359 SESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATP-----DGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 359 ~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) ++.+.++ .|+++..-+++++++- +++.. + +..+.... .+... .-....++..+.+.+..+++. T Consensus 429 ad~~~kl--~g~iS~et~l~~l~~v--~D~~~-E------~eri~~E~~e~~~~~~~~-~~~~~~~~~~d~~~~~~~d~~ 496 (501) T protein:vir:27 429 VSILTGL--GGQVSQETALSLSGLV--ESPNE-E------LDKINKEVSEIDFKGYSN-DFNEHVGKYTDEVKETHTDDF 496 (501) T ss_pred HHHHHHH--hccCcHHHHHHhCCCC--CCHHH-H------HHHHHHHHHhhhHhhhcC-ccccccccccCCCCCCccccc Confidence 9999887 5889988888877542 22111 1 01110000 00000 000000000000100000000 Q ss_pred cCCccccc Q lcl|NC_021305. 434 PTSVPGLS 441 (518) Q Consensus 434 ~~~~~~~~ 441 (518) .++ .| T Consensus 497 e~~---~~ 501 (501) T protein:vir:27 497 ERA---YE 501 (501) T ss_pred ccc---CC Confidence 000 00 No 173 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.60 E-value=2.1e-07 Score=57.14 Aligned_cols=362 Identities=11% Similarity=0.081 Sum_probs=160.4 Q ss_pred CcCCCCCC---CCcccccccchhhhhhhcccccccccccccchhhhHHH-hhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLANGQT---LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIY-KNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) |+..+... ...+.-..+..+.. |.......+.. -.......+ ....+...+|+.++..+.-..|.+ T Consensus 4 ~~i~~L~~~~~~~~~r~~~~~~yy~---g~~~~~~~~~~-~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~------ 73 (422) T protein:vir:97 4 MGMGYLRRKLALFKTGVDKRYRYYA---MDDRDDTRSIV-MPNNVREMYRSVLEWTAKGVDSLADRIIFREFTN------ 73 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh---cCCChhhcCcc-ccHHHHHHHHhhcchhHHHHHHHHhccccceeeC------ Confidence 11111100 00000000011100 00000000000 001111111 112344666777666443333321 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC-CCceEEEEeeCCceeEEEEcCCceeeEEeee Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK-SGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQ 155 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~-~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~ 155 (518) .+..++.++.+ |. .......+..+.+.+|.+|+.+.++. .|.+ .+.+++|..+.+.++.........+. T Consensus 74 -----~d~~l~~~w~~-N~---ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~~~~~~~~a~~ 143 (422) T protein:vir:97 74 -----DDFNAWEIFKA-NN---PDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDPTTFLLTEGYA 143 (422) T ss_pred -----CchhHHHHHHh-cC---hHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeCCCCcceeeEE Confidence 11123344443 33 34455577889999999999998875 5654 68889999999888765443322111 Q ss_pred --cccccCce--eEEecc---------------------ccEEEEeccCCCCcccCchHH----HHHHHHHHHHHHHHHH Q lcl|NC_021305. 156 --AGAGVGTQ--LVSFAD---------------------DEVVPIRFFNPDGLERGLSLM----ESLKSTIFSEDSSRNA 206 (518) Q Consensus 156 --~~~~~~~~--~~~~~~---------------------~evih~~~~~~~~~~~G~s~l----~~~~~~i~~~~~~~~~ 206 (518) .....+.. ...+.. -.|++|.+....+..+|.|.+ ..+.+.+.....-... T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~ 223 (422) T protein:vir:97 144 ILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEV 223 (422) T ss_pred EEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHHHHHHH Confidence 11111111 111111 134555544333445787754 3333443333333333 Q ss_pred HHHHHHccCCccc-ccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCC-----CcceeeccCChhhHHHHHHHHHH Q lcl|NC_021305. 207 TAAMWKNAGRPNL-VLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE-----GMEPIPLQLTAVEMQFIEARQLN 280 (518) Q Consensus 207 ~~~~~~ng~~p~~-il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-----g~~~~~l~~~~~d~~~~e~~~~~ 280 (518) ...|+. .|.- ++-.+.... ..+ .|+.. .++++.++. +.++.++..+.-+ .|.+.++.. T Consensus 224 ~~e~~a---~pqr~i~G~d~d~~--~~~----~~~~~------~~~i~~~~~de~~~~~~v~q~~~~~l~-~~~~~l~~~ 287 (422) T protein:vir:97 224 TAEFYS---FPQKYVLGMDPDAK--PME----KWRAT------VSTLLEISKDEDGDKPTVGQFTTASMA-PFMEHLKMY 287 (422) T ss_pred HHHHhc---chhhhhcccCcccc--cCc----hhhhh------hhhhhccCCCCCCCcceeeecCCCChh-HHHHHHHHH Confidence 334433 3333 332221111 111 23222 124455542 3456555443322 388999999 Q ss_pred HHHHHHHhcCCHHHhccccccccCCHHH---HHHHHHHHH---hhHHHHHHHHHHHHhhh--hhhc--c--cccceecch Q lcl|NC_021305. 281 REEVCGVYDIAPPIVHILDRATFSNISA---QMRAFYRDT---MAIPIARIQSAMDKYVG--QYWV--R--KNRMKFDID 348 (518) Q Consensus 281 ~~~Ia~~fgVPp~~lg~~~~~~~sn~e~---~~~~~~~~~---l~P~~~~ie~~l~~~l~--~~~~--~--~~~~~fd~~ 348 (518) ...|++.-++|++.+|.... |.++.+. +...+...+ -+-+-..+++.+...+. .... . ...+++.|. T Consensus 288 ~~~~a~~s~lP~~~lg~~~~-NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~ 366 (422) T protein:vir:97 288 ASLFAGGSGLTLDDLGFPSD-NPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWE 366 (422) T ss_pred HHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEc Confidence 99999999999999987553 2233222 111111111 11122222222211111 1001 0 112345555 Q ss_pred hhhhcC---HHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCC Q lcl|NC_021305. 349 DVIQPD---WEAKSESTQKMVNS--GVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEW 412 (518) Q Consensus 349 ~l~~~d---~~~~~~~~~~~~~~--G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~ 412 (518) .....+ ....++++.+++++ |+++.+-+++++|+...+.+ .. .++.. ...+ T Consensus 367 p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~-~~---------~~~~~---~~d~ 422 (422) T protein:vir:97 367 PLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADKP-IP---------AITEV---TTDG 422 (422) T ss_pred cCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhHH-HH---------HHHhh---hccC Confidence 555556 45556777888888 78888889999999643211 00 01100 0000 No 174 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.58 E-value=2.5e-07 Score=56.81 Aligned_cols=381 Identities=8% Similarity=-0.031 Sum_probs=170.8 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) +-+-.|+. ....++.- .+ ......... ...-...+....+|+..+.-+-.-|+.+--.++ . T Consensus 58 ~~YY~g~~-~i~~~~~~-~~-------~~~~~~~~~------~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~~d~----~ 118 (483) T protein:vir:12 58 QEYYEQRP-DIVKEPKP-VD-------ATGAVDPLK------PDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD----E 118 (483) T ss_pred HHHhcccc-cccccccc-cc-------ccccccccc------cccccccchHHHHHHHHhhhhcccCceeccCCh----H Confidence 01111110 00000000 00 000000000 000122456677888888888777777622111 1 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc-e-eeEEeeeccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-G-RYEYYFQAGA 158 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~-~-~~~~~~~~~~ 158 (518) ....+..++. | +.......+..+.+.+|.+|..+-.+.+|.+ .+..++|..+.+.++... . .......+.. T Consensus 119 -~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~ 191 (483) T protein:vir:12 119 -VVKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 191 (483) T ss_pred -HHHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEEcCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEe Confidence 1112223332 2 2344556678899999999999999888875 577899999988876431 1 1111111111 Q ss_pred ccCceeEEeccccEEEEecc---------------------CC---------CCcccCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFF---------------------NP---------DGLERGLSLMESLKSTIFSEDSSRNATA 208 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~---------------------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 208 (518) ........+.+..+.|+... ++ .....|.|-+..+...+.....+..-.. T Consensus 192 ~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~ 271 (483) T protein:vir:12 192 ENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLS 271 (483) T ss_pred ecceEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHH Confidence 11111111222222222100 00 0112477778777777766665555555 Q ss_pred HHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_021305. 209 AMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVY 288 (518) Q Consensus 209 ~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~f 288 (518) +.+...+.|..+++-- +.+....+...+. .++++.++++.+...+..+.....+....+.+.+.|+..- T Consensus 272 ~~~~~~~~~~lv~~g~---~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s 340 (483) T protein:vir:12 272 NTFKDSNELTYVLTNY---DDQELPEFKRLLR--------YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFG 340 (483) T ss_pred HHHHHhcCceeeeecC---CcccchhHHHhhh--------hccccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 5555566665555422 2222222222211 1234445555555555555556667788888888899988 Q ss_pred cCCHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCH Q lcl|NC_021305. 289 DIAPPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) Q Consensus 289 gVPp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~ 355 (518) ++|..-.+.. +++ .+..+. ....+...++-.++.+...+ ........+++.+..-+..|. T Consensus 341 ~~p~~~~~~~-~~n-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~-----~~~~~~~~i~v~f~~~~p~~~ 413 (483) T protein:vir:12 341 QAVDFSSDKF-GSA-PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF-----DIKGEHKDVDISFNYNKVANT 413 (483) T ss_pred CCCCCCcccc-ccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----cCCCccceeeEEeCCCCCCCH Confidence 8885432211 111 112111 11122222222222222211 111122345666778888999 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccC Q lcl|NC_021305. 356 EAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPT 435 (518) Q Consensus 356 ~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (518) .+.++.+.++ .|++|..-++++++.-.-+....+. +........ +..+.. ..+..+...+++.++. T Consensus 414 ~~~a~~~~kl--~GiiS~et~~~~~~~v~d~~~E~~r---------i~~E~~~~~--~~~~~~-~~~~~d~~~~~~~~~~ 479 (483) T protein:vir:12 414 ELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELER---------IEQEQMEYN--KQLPNL-DDGGADGAQQQERSNN 479 (483) T ss_pred HHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHH---------HHHHHHHHH--hhcccc-cccccCCcccCCCCCc Confidence 9999999988 5899998888888653211111111 110000000 000000 0011111111111111 Q ss_pred Cccc Q lcl|NC_021305. 436 SVPG 439 (518) Q Consensus 436 ~~~~ 439 (518) .+.+ T Consensus 480 ~e~e 483 (483) T protein:vir:12 480 KESE 483 (483) T ss_pred ccCC Confidence 1111 No 175 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.57 E-value=2.6e-07 Score=56.66 Aligned_cols=393 Identities=12% Similarity=0.031 Sum_probs=174.8 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccccccccc-ccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQ-LERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) ||=.... ...|....+..+.. |..+..-.+ ....... ...-...+.....|+..+.-+-+-|+++--.+ .+.. T Consensus 1 ~~~~~~~-~~~~r~~~l~~yy~---g~~~~~~~~~~~~~~~~-~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~-~~~~ 74 (440) T protein:vir:95 1 MLAAFLG-SQKQRLAILASYAQ---GDNFSILSGHRRLDDEK-ADYRVRHKWGGYISSFATGYVIGNPVSIGVME-GGSA 74 (440) T ss_pred ChhhHHH-HHHHHHHHHHHHhc---cCCcccccccccccccC-CcceeecchHHHHHHhhhhheeccCceEeeCC-CccH Confidence 3222211 11111111111110 100000000 0000000 00112345567778888877777777653222 1111 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce-eeEEeee-cc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG-RYEYYFQ-AG 157 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~-~~~~~~~-~~ 157 (518) +.. ..+..++.+ -........+..+.+.+|.+|..+..+.+|.+ .+..++|..+.+..+.... ...+.+. .. T Consensus 75 ~~~-~~l~~~~~~----n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~ 148 (440) T protein:vir:95 75 DQL-STIKDIEWQ----NDINALNSDLAFDASVYGRAYEYHFRDKDKVD-RVVLISPLEMFVIRDLTVEQNIIAAVHLPI 148 (440) T ss_pred HHH-HHHHHHHHh----cCHhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 111 112222222 24455566778899999999999999888875 4667899999888876532 1111111 11 Q ss_pred cccCceeEEeccccE----------------------------EEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEV----------------------------VPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAA 209 (518) Q Consensus 158 ~~~~~~~~~~~~~ev----------------------------ih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~ 209 (518) .........+..+.+ ++|+. ...|.|-++.+...+.....+.....+ T Consensus 149 ~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~lida~~~~~s~~~~ 223 (440) T protein:vir:95 149 YADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWN-----NRFRMGDYESEISLIDAYDAGQSDTAN 223 (440) T ss_pred ecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeC-----CCCCCCchhhhHHHHHHHHHHHHHHHH Confidence 111111112233333 33332 124777777777766666665555555 Q ss_pred HHHccCCcccccccC---ccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_021305. 210 MWKNAGRPNLVLRHE---KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCG 286 (518) Q Consensus 210 ~~~ng~~p~~il~~~---~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~ 286 (518) .....+.|-.+++-. ...+++....+++.-.-.. .........+.+.++..+..+.....+....+.+.+.|+. T Consensus 224 ~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~ 300 (440) T protein:vir:95 224 YMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFL---KTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHR 300 (440) T ss_pred HHHHhhcceeeeecccccCCCCccchhhhhhccceec---ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHH Confidence 555566666665432 2335555555443211111 0000111123333444444444455567788888999999 Q ss_pred HhcCCHHHhccccccccCCHHH-------------HHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhc Q lcl|NC_021305. 287 VYDIAPPIVHILDRATFSNISA-------------QMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQP 353 (518) Q Consensus 287 ~fgVPp~~lg~~~~~~~sn~e~-------------~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~ 353 (518) .-++|..-.+... ++ .+..+ .....+...+...+..+...+...-.. ......+++.+..-... T Consensus 301 ~s~~p~~~~~~~~-~n-~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~-~~~~~~v~i~f~~~~p~ 377 (440) T protein:vir:95 301 FSRIPNLDDDRFN-ST-SSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGP-VIEANKLTFTFHPNIPQ 377 (440) T ss_pred HhCCccccccccc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc-ccccccceEEeCCCCCC Confidence 9999864332211 11 11111 111222223333333322222111100 11123456777788899 Q ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCcc Q lcl|NC_021305. 354 DWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQS 432 (518) Q Consensus 354 d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (518) |..+.++.+.++ .|+++..-+.++++.-..+ ++ +..+. ..+..... +.....+....++++++ T Consensus 378 ~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~~----~E------~~ri~---~E~~~~~~-~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 378 DVWTEIKAYIEA--GGEISQETLMENASFTDYK----TE------HSRIL---KQGGSSDL-EIGQIVGDADVGQADTE 440 (440) T ss_pred CHHHHHHHHHHH--hccCcHHHHHHhCCCCCcH----HH------HHHHH---HHHHHhhh-hHHhhccCCCCCCcCCC Confidence 999999999988 5789987777777542111 11 11111 00000000 00000011111111111 No 176 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.55 E-value=3e-07 Score=56.37 Aligned_cols=389 Identities=8% Similarity=0.000 Sum_probs=170.4 Q ss_pred CcCCCCCC--CCcccccccchhhhhhhcc--cc--ccccc--ccccc--hhh-hHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLANGQT--LSAPAMAELSPQMQDSYYY--AP--AVGMQ--LERQF--SLY-GGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~~~~--~~~~~~~~~~~~~~~~~~~--~~--~~~~~--~~~~~--~~~-~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) ++...--. ......+ . +.+....-+ +. ...-. ..... ... ...-...+....+|+..+.-+-.-|+. T Consensus 23 ~~~~~~i~~~i~~~~~~-~-~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~ 100 (472) T protein:vir:93 23 ETLEEMIVRYIKQHLEK-L-PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 100 (472) T ss_pred hhHHHHHHHHHHHHHHH-H-HHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcccCee Confidence 00000000 0000000 0 000000000 00 00000 00000 000 000112356778888888888777777 Q ss_pred EEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC-ce Q lcl|NC_021305. 70 CMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR-TG 148 (518) Q Consensus 70 v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~-~~ 148 (518) +--.++ . ....+..++. | ........+..+.+.+|.+|+.+..+.+|.+ .+..++|..+.+.++.. .. T Consensus 101 ~~~~d~----~-~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~-~i~~~~p~~~~~i~d~~~~~ 169 (472) T protein:vir:93 101 FKHTDD----E-VVKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHE 169 (472) T ss_pred eccCCh----H-HHHHHHHHHh--c---cHHHHHHHHHHHHhhcCeEEEEEEECCCCce-EEEEEcccceEEEEcCCCCC Confidence 632111 1 1122223332 2 2445566678899999999999988888864 57778999988887642 22 Q ss_pred eeEEee-ecccccCceeEEecccc-----------------------------------EEEEeccCCCCcccCchHHHH Q lcl|NC_021305. 149 RYEYYF-QAGAGVGTQLVSFADDE-----------------------------------VVPIRFFNPDGLERGLSLMES 192 (518) Q Consensus 149 ~~~~~~-~~~~~~~~~~~~~~~~e-----------------------------------vih~~~~~~~~~~~G~s~l~~ 192 (518) ...+.+ .+.....+....+.... |++|+. ...|.|-+.. T Consensus 170 ~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----n~~g~s~~e~ 244 (472) T protein:vir:93 170 ELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKN-----NDLEISDIFM 244 (472) T ss_pred ceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecC-----CCCCCCchhh Confidence 111111 00000001111111111 233322 1357888887 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHH Q lcl|NC_021305. 193 LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQ 272 (518) Q Consensus 193 ~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~ 272 (518) +...+.....+..-..+.+...+.|..+++-- +.++...+...+ ...+++.++++.+...+..+..... T Consensus 245 v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~ 313 (472) T protein:vir:93 245 YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY---DDQELPEFKRLL--------RYYGAIKVSDNGGVDTIQVEVPVEN 313 (472) T ss_pred hHHHHHHHHHHHHHHHHHHHHhcCceeEeecC---CcccchhhHHHH--------hhccccccCCCCcceeEeecCCHHH Confidence 77777766665555555566666676665432 222222222211 1223455565555555555556677 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhhhcc Q lcl|NC_021305. 273 FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQYWVR 339 (518) Q Consensus 273 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~~ 339 (518) +....+.+.+.|+..-++|..-.+... ++ .+..+.. ...+...+.-.++.+...+ ..... T Consensus 314 ~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~-----~~~~~ 386 (472) T protein:vir:93 314 SKKYLDELYQKIMLFGQAVDFSSDKFG-SA-PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF-----DIKGE 386 (472) T ss_pred HHHHHHHHHHHHHHHhCCCCCCccccc-cC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCcc Confidence 788888889999999999854332211 11 1222111 1111112222222221111 11112 Q ss_pred cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCC Q lcl|NC_021305. 340 KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPK 419 (518) Q Consensus 340 ~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~ 419 (518) ...+++.+..-+..|..+.++.+.++ .|+++..-+.+++++-.-+....+. +........ +..... T Consensus 387 ~~~i~v~f~~~~p~~~~~~~~~~~k~--~giis~et~l~~l~~~~d~~~E~~r---------i~~E~~~~~--~~~~~~- 452 (472) T protein:vir:93 387 HKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELER---------IEQEQMEYN--KQLPNL- 452 (472) T ss_pred cceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHH---------HHHHHHHHH--HhccCc- Confidence 23455666777888999999999887 5889988888887653211111111 100000000 000000 Q ss_pred CCccCCCCCCCccccCCccc Q lcl|NC_021305. 420 RPASTPVASLDQSPPTSVPG 439 (518) Q Consensus 420 ~~~~~~~~~~~~~~~~~~~~ 439 (518) ..+.++..++++..++.+.+ T Consensus 453 ~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 453 DDGGADGAQQQERSNNKESE 472 (472) T ss_pred CcccCCCCCCCCCCCcccCC Confidence 00011111111111111111 No 177 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.52 E-value=3.7e-07 Score=55.85 Aligned_cols=413 Identities=9% Similarity=0.025 Sum_probs=177.3 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) ++-.. .....|....+..+.. |..+................-...+.....|+..+.-+-+-|+.+--.++ T Consensus 48 ~i~~~-~~~~~~r~~~l~~YY~---g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~----- 118 (512) T protein:vir:97 48 YIEHH-MDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQCQDDDK----- 118 (512) T ss_pred HHHHH-HHhhHHHHHHHHHHhc---ccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeccCCh----- Confidence 00000 0000011111111110 00000000000000000000012345567788888877778877632111 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eeeE-Eeeec- Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRYE-YYFQA- 156 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~~-~~~~~- 156 (518) .....+..++.. -........+..+++.+|.+|.++.++.+|.+ .+..++|..+.++++... .... .+++. T Consensus 119 ~~~~~l~~~~~~----n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~-~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~ 193 (512) T protein:vir:97 119 DVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (512) T ss_pred HHHHHHHHHHhh----cCHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEe Confidence 111223333332 34556667788899999999999999888864 577899999988887543 1111 11111 Q ss_pred ccccC-----ce-eEEeccccEEEEeccC----------------C---------CCcccCchHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 157 GAGVG-----TQ-LVSFADDEVVPIRFFN----------------P---------DGLERGLSLMESLKSTIFSEDSSRN 205 (518) Q Consensus 157 ~~~~~-----~~-~~~~~~~evih~~~~~----------------~---------~~~~~G~s~l~~~~~~i~~~~~~~~ 205 (518) ....+ .. ...+.++.+.+++... + .....|.|-+..+...+.....+.. T Consensus 194 ~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S 273 (512) T protein:vir:97 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (512) T ss_pred eeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCCCCchhhhHHHHHHHHHHHH Confidence 00000 01 1123444444443110 0 0113578888888888877776665 Q ss_pred HHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCC--eeecCCCcceeeccCChhhHHHHHHHHHHHHH Q lcl|NC_021305. 206 ATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGK--TMVVEEGMEPIPLQLTAVEMQFIEARQLNREE 283 (518) Q Consensus 206 ~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~--~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 283 (518) -..+.+...+.|-.+++-....+++.....+....-........+. ..-.++|.++..+........+....+.+.+. T Consensus 274 ~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~ 353 (512) T protein:vir:97 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSD 353 (512) T ss_pred HHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHH Confidence 5555666666666665543334444444333322111111111111 11124556665565555555567778888899 Q ss_pred HHHHhcCCHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhhhc-ccccceecchh Q lcl|NC_021305. 284 VCGVYDIAPPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQYWV-RKNRMKFDIDD 349 (518) Q Consensus 284 Ia~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~-~~~~~~fd~~~ 349 (518) |...-++|..-.+... +| .+..+.. ...+..++.-.+..|...+...--.... ....+++.+.. T Consensus 354 I~~~s~~p~~~~~~~~-gn-~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~ 431 (512) T protein:vir:97 354 IHMFTNTPNMKDDNFS-GT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNR 431 (512) T ss_pred HHHHhCCcccCccccc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCC Confidence 9998888865433221 12 1222111 1111112222222221111110000000 11135566677 Q ss_pred hhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCcc-CCCCC Q lcl|NC_021305. 350 VIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPAS-TPVAS 428 (518) Q Consensus 350 l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 428 (518) -+..|..+.++.+.++ .|++|..-+++++++- +++. .++ ..+................+++.. .+..+ T Consensus 432 ~~p~~~~e~~~~~~kl--~giiS~et~~~~l~~v--~d~~-~E~------eri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 500 (512) T protein:vir:97 432 NLPKSLIEELKAYIDS--GGKISQTTLMSLFSFF--QDPE-LEV------KKIEEDEKESIKKAQKGIYKDPRDINDDEQ 500 (512) T ss_pred CCCcCHHHHHHHHHHH--hccCchHHHHHhCCCC--CCHH-HHH------HHHHHHHHHHHHHHhhcccCCCCCCCCCCC Confidence 7888999999998888 4889988888887643 2221 111 111100000000000000000000 00000 Q ss_pred CCccccCCccccccchhcchhhHHH Q lcl|NC_021305. 429 LDQSPPTSVPGLSPTNSDRSTDSGK 453 (518) Q Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (518) ++++.+.+..+ + T Consensus 501 ~~~~~~~~~~~-------------~ 512 (512) T protein:vir:97 501 DDDTKDTVDKK-------------E 512 (512) T ss_pred CCCcccccccc-------------C Confidence 00000000000 0 No 178 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.52 E-value=3.9e-07 Score=55.73 Aligned_cols=413 Identities=9% Similarity=0.005 Sum_probs=178.0 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) ++... .....+....+..+.. |..+..-......-......-...+.....|+..+.-+-+-|+.+--.++ T Consensus 48 ~i~~~-~~~~~~r~~~l~~Yy~---g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~----- 118 (511) T protein:vir:10 48 CIEHH-MDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK----- 118 (511) T ss_pred HHHHH-HHhhHHHHHHHHHHhc---ccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCch----- Confidence 11100 0000011111111110 00000000000000000000112345567778888777778887632111 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce--eeE-Eeeecc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG--RYE-YYFQAG 157 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~--~~~-~~~~~~ 157 (518) .....+..++.. -+.......+..+++.+|.+|.++.++.+|.+ .+.+++|..+.++.+.... ... .+++.. T Consensus 119 ~~~~~l~~~~~~----n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~ 193 (511) T protein:vir:10 119 DVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYEIMIRNQDDET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (511) T ss_pred HHHHHHHHHHhh----cCHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 111223333333 24455667788899999999999999888864 5777899998888765431 111 111110 Q ss_pred -ccc--C---ce-eEEeccccEEEEeccCC-------------------------CCcccCchHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 158 -AGV--G---TQ-LVSFADDEVVPIRFFNP-------------------------DGLERGLSLMESLKSTIFSEDSSRN 205 (518) Q Consensus 158 -~~~--~---~~-~~~~~~~evih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~~~~~~ 205 (518) ... . .. ...+.++.+.++..... .....|.|-++.+...+.....+.. T Consensus 194 ~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~liDa~d~~~S 273 (511) T protein:vir:10 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred eecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 000 0 00 11233444444321100 0012578888877777776666655 Q ss_pred HHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCcccc-CCeeecCCCcceeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_021305. 206 ATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNT-GKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEV 284 (518) Q Consensus 206 ~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~-g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~I 284 (518) -..+.+...+.|-.+++-....++++....++...-........ +...-.+.+.++..+........+....+.+.+.| T Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I 353 (511) T protein:vir:10 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDI 353 (511) T ss_pred HHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHH Confidence 55555666666666654433344444433332211111000000 01111244556655665556666778888888999 Q ss_pred HHHhcCCHHHhccccccccCCHHH-------------HHHHHHHHHhhHHHHHHHHHHHHhhhhhh-cccccceecchhh Q lcl|NC_021305. 285 CGVYDIAPPIVHILDRATFSNISA-------------QMRAFYRDTMAIPIARIQSAMDKYVGQYW-VRKNRMKFDIDDV 350 (518) Q Consensus 285 a~~fgVPp~~lg~~~~~~~sn~e~-------------~~~~~~~~~l~P~~~~ie~~l~~~l~~~~-~~~~~~~fd~~~l 350 (518) +..-++|..-.+... +|- +..+ .....+..++.-.++.|...+........ .....+++.+.+- T Consensus 354 ~~~s~~P~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~i~f~~~ 431 (511) T protein:vir:10 354 HMFTNTPNMKDDNFS-GTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNTVRYVYNRN 431 (511) T ss_pred HHHhCCccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccceeeEEeCCC Confidence 998899864332211 121 1111 11112222222222222222221110000 1112456777888 Q ss_pred hhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCc-cCCCCCC Q lcl|NC_021305. 351 IQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPA-STPVASL 429 (518) Q Consensus 351 ~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 429 (518) +..|..+.++.+.++. |+++..-+.+++++- +++. +++ ..+.................++. ..+..++ T Consensus 432 ~p~d~~~~~~~~~kl~--G~iS~et~~~~l~~v--~d~~-~E~------~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (511) T protein:vir:10 432 LPKSLIEELKAYIDSG--GKISQTTLMSLFSFF--QDPE-LEV------KKIEEDEKESIKKAQKGIYKDPRDINDDEQD 500 (511) T ss_pred CCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCC--CCHH-HHH------HHHHHHHHHHHHHHhhhcccCCCCCCCCCCC Confidence 8999999999999984 889988888887542 3221 111 11110000000000000000000 0000000 Q ss_pred CccccCCccccccchhcchhhHHH Q lcl|NC_021305. 430 DQSPPTSVPGLSPTNSDRSTDSGK 453 (518) Q Consensus 430 ~~~~~~~~~~~~~~~~~~~~~~~~ 453 (518) +++.+....+ + T Consensus 501 ~~~~~~~~~~-------------~ 511 (511) T protein:vir:10 501 DDTKDTVDKK-------------E 511 (511) T ss_pred CcccCccccc-------------C Confidence 0000000000 0 No 179 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.50 E-value=4.2e-07 Score=55.56 Aligned_cols=409 Identities=9% Similarity=0.004 Sum_probs=177.6 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) ++-.. .....+....+..+.. |..+..-.............-.........|+..+.-+-+-|+.+--.++ T Consensus 48 ~i~~~-~~~~~~r~~~l~~Yy~---g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~----- 118 (511) T protein:vir:96 48 YIEHH-MDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK----- 118 (511) T ss_pred HHHHH-HHhhHHHHHHHHHHhc---ccCccccccCcCcccccCcceeecchHHHHHHHHHhhhccCCceeecCch----- Confidence 11000 0000011111111110 00000000000000000000112345567778888888788887632111 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce--eeE-Eeeecc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG--RYE-YYFQAG 157 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~--~~~-~~~~~~ 157 (518) .....+..++.. -........+..+++.+|.+|.++-++.+|. +.+.+++|..+.++.+.... ... ..++.. T Consensus 119 ~~~~~l~~~~~~----n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~-~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~ 193 (511) T protein:vir:96 119 DVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYELMIRNQDDE-TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (511) T ss_pred HHHHHHHHHHhh----cCHHHHHHHHHHHHHhcCeeEEEEEeCCCCc-eEEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 111223333332 3455666778889999999999999988886 46778899999887765421 111 111110 Q ss_pred -ccc--C-c---eeEEeccccEEEEeccCC-------------------------CCcccCchHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 158 -AGV--G-T---QLVSFADDEVVPIRFFNP-------------------------DGLERGLSLMESLKSTIFSEDSSRN 205 (518) Q Consensus 158 -~~~--~-~---~~~~~~~~evih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~~~~~~ 205 (518) ... . . ....+.++.+.++..... .....|+|-++.+...+.....+.. T Consensus 194 ~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S 273 (511) T protein:vir:96 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred eeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 000 0 0 011233444444321100 0012578888888877777776665 Q ss_pred HHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccc-cCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_021305. 206 ATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSN-TGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEV 284 (518) Q Consensus 206 ~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n-~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~I 284 (518) -..+.+...+.|-.+++-....+.++.....+.-.-....... .+...-.+.+.++..+........+....+.+.+.| T Consensus 274 ~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I 353 (511) T protein:vir:96 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDI 353 (511) T ss_pred HHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHH Confidence 5556666666666665443333444333322211000000000 000111234555555655555666778888889999 Q ss_pred HHHhcCCHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhc-ccccceecchhh Q lcl|NC_021305. 285 CGVYDIAPPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWV-RKNRMKFDIDDV 350 (518) Q Consensus 285 a~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~-~~~~~~fd~~~l 350 (518) ...-++|..-.+... ++- +..+. ....+..++.-.++.|...+....-.... ....+++.+..- T Consensus 354 ~~~s~~p~~~~~~~~-~n~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~ 431 (511) T protein:vir:96 354 HMFTNTPNMKDDNFS-GTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRN 431 (511) T ss_pred HHHhCCccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCC Confidence 999999865432211 121 11111 11122222333332222222211100001 112456667777 Q ss_pred hhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccc-----CCCCCCCCCCCCCccCC Q lcl|NC_021305. 351 IQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDG-----AVEWEEAPAPKRPASTP 425 (518) Q Consensus 351 ~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~-----~~~~~~~~~~~~~~~~~ 425 (518) +..|..+.++.+.++ .|++|...+++++++-. ++. .++ ..+...... +......+.+..... T Consensus 432 ~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~--D~~-~E~------~ri~~E~~~~~~~~~~~~~~~~~~~~~~~-- 498 (511) T protein:vir:96 432 LPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQ--DPE-LEV------KKIEEDEKESIKKAQKGIYKDPRDINDDE-- 498 (511) T ss_pred CCCCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHH-HHH------HHHHHHHHHHHHHHhhccccCCCCCCCCC-- Confidence 888999999999887 68999988888886532 211 111 111100000 000000000000000 Q ss_pred CCCCCccccCCccccccchhcchhhHHH Q lcl|NC_021305. 426 VASLDQSPPTSVPGLSPTNSDRSTDSGK 453 (518) Q Consensus 426 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (518) +++++.+....+ + T Consensus 499 --~~~~~~~~~~~~-------------~ 511 (511) T protein:vir:96 499 --QDDDTKDTVDKK-------------E 511 (511) T ss_pred --CCCccccccccc-------------C Confidence 000000000000 0 No 180 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=98.49 E-value=4.7e-07 Score=55.28 Aligned_cols=401 Identities=8% Similarity=0.064 Sum_probs=171.9 Q ss_pred CcCCCCCCCCcccccc--------cchhh--hhhh-cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLANGQTLSAPAMAE--------LSPQM--QDSY-YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~--------~~~~~--~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) |.|..-......-... ...+- ..-+ |......-+.......... ...+.....|+..+.-+-.-|+. T Consensus 9 ~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~k--i~~n~~~~ivd~~~~~l~g~~~~ 86 (453) T protein:vir:39 9 MTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNR--LTVNFTKYIVDTFTGYFNGIPVK 86 (453) T ss_pred eEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccCccce--eecchHHHHHHHHhhhhcccCce Confidence 4444322211110000 00000 0000 1011111110000001111 22356677888888887777776 Q ss_pred EEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce- Q lcl|NC_021305. 70 CMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG- 148 (518) Q Consensus 70 v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~- 148 (518) +--.+ +.....+..++.. | ........+..+.+.+|.+|+.+.++.+|.+ .+..++|..+.+..+.... T Consensus 87 ~~~~d-----~~~~~~l~~i~~~-N---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~ 156 (453) T protein:vir:39 87 KSHSD-----KETLSKLQEFDNL-N---DMEDEESELAKMACIYGRAFELLYQNEETQT-NVIYNTPENMFMVYDDTIKQ 156 (453) T ss_pred eccCC-----hHHHHHHHHHHHh-c---ChhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEecCCCCC Confidence 63211 1111223334333 2 4455677788999999999999999988864 4667888888887765332 Q ss_pred eeEE--eeecccccCceeEEeccccEEEEeccC-----------C---------CCcccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 149 RYEY--YFQAGAGVGTQLVSFADDEVVPIRFFN-----------P---------DGLERGLSLMESLKSTIFSEDSSRNA 206 (518) Q Consensus 149 ~~~~--~~~~~~~~~~~~~~~~~~evih~~~~~-----------~---------~~~~~G~s~l~~~~~~i~~~~~~~~~ 206 (518) ...+ .++...........+.++.+.++.... + .....|.|-++.+...+.....+..- T Consensus 157 ~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~s~ 236 (453) T protein:vir:39 157 EPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAFNKAISE 236 (453) T ss_pred eEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHHHHHHHH Confidence 1111 111111000001112222222222110 0 01135777787777666666555544 Q ss_pred HHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_021305. 207 TAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCG 286 (518) Q Consensus 207 ~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~ 286 (518) ..+.+...+.|..+++- ..++++....++.. ......+ ....+.+.++..+..+.....+.+..+.+.+.|+. T Consensus 237 ~~~~~~~~~~p~~~~~g-~~~~~~~~~~~~~~---~~~~~~~---~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~ 309 (453) T protein:vir:39 237 KANDVDYFSDQYLTFLG-AAVEEEDLKNIRSN---RVINYYG---ESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQ 309 (453) T ss_pred HHHHHHHhhCceeeeec-CCCCchhhhhhhhc---ceeeecC---CCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHH Confidence 44555555666655542 33455444443321 0000000 00112333344444444455566778888888888 Q ss_pred HhcCCHHHhccccccccCCHHH----------HHHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCHH Q lcl|NC_021305. 287 VYDIAPPIVHILDRATFSNISA----------QMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWE 356 (518) Q Consensus 287 ~fgVPp~~lg~~~~~~~sn~e~----------~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~ 356 (518) .-++|..-.+..++.++...+. .....+..++...+..+...++..-. ......+++.+..-+..|.. T Consensus 310 ~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~i~v~f~~~~p~~~~ 387 (453) T protein:vir:39 310 TTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSN--KEAWKDIEYTFTRNEPKDIK 387 (453) T ss_pred HhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--ccccccceEEeCCCCCcCHH Confidence 8888742221111111111111 11112222333333333222211100 01112445666777888999 Q ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCC Q lcl|NC_021305. 357 AKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTS 436 (518) Q Consensus 357 ~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) +.++.+.++ .|+++..-+.+++++- +++. +++ ..+................ +.. +.+++.+++. T Consensus 388 ~~a~~~~kl--~g~is~et~l~~l~~v--~D~~-~E~------~ri~~E~~~~~~~~~~~~~---~~~--~~~~~~~~~~ 451 (453) T protein:vir:39 388 EQAETANIL--MGITSQETALSVISVI--PDVQ-AEM------EKIKKEEASTAIFDKDKQP---SEK--GTDTVVPETN 451 (453) T ss_pred HHHHHHHHH--hccCChHHHHHhCCCC--CCHH-HHH------HHHHHHHHHHHHHHHhccC---CCC--CCCCCCCCcC Confidence 999999887 5789998888888653 2221 111 1111000000000000000 000 0000000000 Q ss_pred ccc Q lcl|NC_021305. 437 VPG 439 (518) Q Consensus 437 ~~~ 439 (518) .+ T Consensus 452 -~e 453 (453) T protein:vir:39 452 -EE 453 (453) T ss_pred -CC Confidence 00 No 181 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.47 E-value=5.1e-07 Score=55.07 Aligned_cols=381 Identities=8% Similarity=-0.026 Sum_probs=170.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) +-+-.|+. +....... .+ ......... ...-..++....+|+..+.-+-.-|+.+--.++ . T Consensus 67 ~~YY~g~~-~i~~~~~~-~~-------~~~~~~~~~------~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~----~ 127 (492) T protein:vir:97 67 QEYYEQRP-DIVKEPKP-VD-------ATGAVDPLK------PDDRMITNFHANLVDQKVSYIVGKPIAFKHTDD----E 127 (492) T ss_pred HHHhcccC-cccccccc-cc-------ccccccccc------cccccccchHHHHHHHHhhhhcccCceeccCch----H Confidence 00101110 00000000 00 000000000 000112355677888888888777876522111 1 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC-ce-eeEEeeeccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR-TG-RYEYYFQAGA 158 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~-~~-~~~~~~~~~~ 158 (518) ....+..++. | +.......+..+++.+|.+|.++..+.+|.+ .+..++|..+.+.++.. .. .......+.. T Consensus 128 -~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~ 200 (492) T protein:vir:97 128 -VVKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKL 200 (492) T ss_pred -HHHHHHHHHh--c---cHHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEee Confidence 1122223332 2 2345556678899999999999999888864 57778999998887642 11 1111111111 Q ss_pred ccCceeEEeccccEEEEecc---------------------CC---------CCcccCchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFF---------------------NP---------DGLERGLSLMESLKSTIFSEDSSRNATA 208 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~---------------------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~~ 208 (518) ........+.+..+.|+... ++ .....|.|-+..+...+.....+..-.. T Consensus 201 ~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~ 280 (492) T protein:vir:97 201 ENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLS 280 (492) T ss_pred ccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHHHHHHHH Confidence 11111111222222222110 00 0012478888877777777666655555 Q ss_pred HHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_021305. 209 AMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVY 288 (518) Q Consensus 209 ~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~f 288 (518) +.+...+.|-.+++-- +.++...++..+ ...+++.++++.+...+........+....+.+.+.|+..- T Consensus 281 ~~~~~~~~~~l~~~g~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s 349 (492) T protein:vir:97 281 NTFKDSNELTYVLKNY---DDQELPEFKRLL--------RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFG 349 (492) T ss_pred HHHHHhccceeeeecC---CcccchhHHHHH--------hhccceecCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHh Confidence 6666666666555422 222222222211 11234556666555555555556667788888889999998 Q ss_pred cCCHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcCH Q lcl|NC_021305. 289 DIAPPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDW 355 (518) Q Consensus 289 gVPp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~ 355 (518) ++|..-.+... ++ .+.++.. ...+...+...++.+...+ ........+++.+..-+..|. T Consensus 350 ~~p~~~~~~~~-~n-~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~-----~~~~~~~~i~v~f~~~~p~~~ 422 (492) T protein:vir:97 350 QAVDFSSDKFG-SA-PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF-----DIKGEHKDVDISFNYNKVANT 422 (492) T ss_pred CCCCCCccccc-cC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----cCCcccceeeEEecCCCCCCH Confidence 88853321111 11 1222111 1111122222222222111 111122345566677788899 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccC Q lcl|NC_021305. 356 EAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPT 435 (518) Q Consensus 356 ~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (518) .+.++.+.++ .|++|..-+.++++.-.-+....+. +........ +..+... ....+...+++.+++ T Consensus 423 ~e~a~~~~kl--~G~iS~et~l~~l~~v~d~~~Eler---------i~~E~~~~~--~~~~~~~-~~~~~~~~~~~~~~~ 488 (492) T protein:vir:97 423 ELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELER---------IEQEQTEYN--KQLPNLD-DGGADSAQQQERSNN 488 (492) T ss_pred HHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHH---------HHHHHHHHH--Hhhhccc-cCCCCCCcccccccc Confidence 9999999988 5889988888888653211111111 100000000 0000000 000011111111111 Q ss_pred Cccc Q lcl|NC_021305. 436 SVPG 439 (518) Q Consensus 436 ~~~~ 439 (518) ...+ T Consensus 489 ~~~e 492 (492) T protein:vir:97 489 KESE 492 (492) T ss_pred cccC Confidence 1111 No 182 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=98.44 E-value=6.5e-07 Score=54.52 Aligned_cols=392 Identities=10% Similarity=0.035 Sum_probs=166.6 Q ss_pred CcCCCCCCCC-cccccccc-------hhh--hhhh-cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLANGQTLS-APAMAELS-------PQM--QDSY-YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~~~~~~-~~~~~~~~-------~~~--~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) +-|....... ..-...+. ... ..-+ |......-... .. .....-...+.....|+..+.-+-.-|+. T Consensus 9 ~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~-~~-~~~~~ki~~n~~~~ivd~~~~~l~g~~~~ 86 (452) T protein:vir:36 9 MTFSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAK-DS-WKPDNRLAVNFTKYIVDTFTGYFNGIPVK 86 (452) T ss_pred EEcCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccc-cc-cCccceeecchHHHHHHHHhhhhcccCce Confidence 1111111110 00000000 000 0000 00000000000 00 00000122345667788888777777776 Q ss_pred EEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce- Q lcl|NC_021305. 70 CMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG- 148 (518) Q Consensus 70 v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~- 148 (518) +--.++ .....+..++.. -........+..+.+.+|.+|..+.++.+|.+ .+..++|..+.++.+.... T Consensus 87 ~~~~d~-----~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~~~~~v~d~~~~~ 156 (452) T protein:vir:36 87 KSHSDK-----EILTKLQEFDNL----NDMEDEESELAKMACIYGRAFEFLYQDEDTQT-NVVYNSPENMFMVYDDTVKQ 156 (452) T ss_pred eecCCh-----hHHHHHHHHHhh----cChhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEcCCCCC Confidence 532211 111223333332 24555667788999999999999999888865 5677899988887765421 Q ss_pred -eeEEeeecccccCc-eeEEeccccEEEEecc-----------CC---------CCcccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 149 -RYEYYFQAGAGVGT-QLVSFADDEVVPIRFF-----------NP---------DGLERGLSLMESLKSTIFSEDSSRNA 206 (518) Q Consensus 149 -~~~~~~~~~~~~~~-~~~~~~~~evih~~~~-----------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~ 206 (518) .......+...... ....+.++.++++... ++ .....|.|-+......+.....+..- T Consensus 157 ~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~s~ 236 (452) T protein:vir:36 157 EPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAFNKAISE 236 (452) T ss_pred ceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHHHHHHHH Confidence 11111111111111 1111222222222110 00 01124777777666666666655555 Q ss_pred HHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCC-----CcceeeccCChhhHHHHHHHHHHH Q lcl|NC_021305. 207 TAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE-----GMEPIPLQLTAVEMQFIEARQLNR 281 (518) Q Consensus 207 ~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-----g~~~~~l~~~~~d~~~~e~~~~~~ 281 (518) ..+.+...+.|-.+++ ...++++....++. ++++.++. +.++..+..+.....+....+.+. T Consensus 237 ~~~~~~~~~~p~~~~~-g~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~ 303 (452) T protein:vir:36 237 KANDVDYFSDQYLTFL-GAAVEEEDLKNIRS------------NRVINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLT 303 (452) T ss_pred HHHHHHHhcCceeEee-cCCcCchhhhhhhh------------cceEEecCCCCccCCcceeEeecCCHHHHHHHHHHHH Confidence 5555555566655553 23344433332221 12222221 223333444444555677788888 Q ss_pred HHHHHHhcCCHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecch Q lcl|NC_021305. 282 EEVCGVYDIAPPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDID 348 (518) Q Consensus 282 ~~Ia~~fgVPp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~ 348 (518) +.|+..-++|..-.+ ..++ .+..+. ....+..++...+..|...++..- .......+++.+. T Consensus 304 ~~I~~~s~~p~~~~~--~~gn-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~~~i~i~f~ 378 (452) T protein:vir:36 304 KLIFQTTMVANISDE--SFGS-SSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVS--NKDSWKDIEYTFT 378 (452) T ss_pred HHHHHHhCccccCcc--cccC-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--CccccccceEEeC Confidence 889888888843221 1112 111111 111222222333222222111110 0001123556667 Q ss_pred hhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 349 DVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 349 ~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) .-+..|..+.++.+.++ .|+++..-+.++++.-. ++. ++ +..+..................+......+ T Consensus 379 ~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~--d~~-~E------~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 447 (452) T protein:vir:36 379 RNEPKDIKEQAETANIL--MGITSQETALSVISVIP--DVQ-AE------MEKIKKEEASTAIFDKDKQPSEKGTDTVVS 447 (452) T ss_pred CCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHH-HH------HHHHHHHHHHHHHHHhhccCCCCcccccCc Confidence 77888999999999887 57899888888886532 221 11 111110000000000000010110000000 Q ss_pred CCccccCCccc Q lcl|NC_021305. 429 LDQSPPTSVPG 439 (518) Q Consensus 429 ~~~~~~~~~~~ 439 (518) ++ ..+ T Consensus 448 ~~------~~e 452 (452) T protein:vir:36 448 ET------NEE 452 (452) T ss_pred cc------cCC Confidence 00 000 No 183 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=98.40 E-value=8.4e-07 Score=53.90 Aligned_cols=375 Identities=10% Similarity=-0.016 Sum_probs=171.0 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =++.+............ ... +.... ... ...-..++....+|+..+.-+-.-|+.+--.+ . T Consensus 45 ~yy~g~~~i~~~~~~~~----------~~~-~~~~~-~~~--~~~ki~~~~~~~Ivd~~~~~l~g~p~~~~~~~-----~ 105 (479) T protein:vir:79 45 EYYYGNTDVNNKRRYYL----------LDG-AKVDD-FTK--VNNKAINNYHKLLVDQKVGYSVGNPIVFNADD-----D 105 (479) T ss_pred HHhccCCcccccccccc----------ccc-ccccc-ccc--CcceeecchHHHHHHHHHhhhhcCCceeccCC-----H Confidence 01111100000000000 000 00000 000 00012245567788888888878887762211 1 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce--eeEE-e-eec Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG--RYEY-Y-FQA 156 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~--~~~~-~-~~~ 156 (518) .....+..++. | ........++.+.+.+|.+|..+..+..|.+ .+..++|..+.++.+.... .... + |.. T Consensus 106 ~~~~~~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~ 179 (479) T protein:vir:79 106 NLTKLLNDLLG--E---EFDDTITELYLNASNKGVEWLHPYINRKGEF-KYVIIPAEEAIPIWDSKRQRELVAFIRFYYI 179 (479) T ss_pred HHHHHHHHHHh--c---CHHHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEccceeEEEEeCCCCCceEEEEEEEEE Confidence 11122223322 2 4556667788899999999999988888865 4777899998888765321 1111 0 111 Q ss_pred ccccCc---eeEEeccccEEEEeccCC---------------------------------------CCcccCchHHHHHH Q lcl|NC_021305. 157 GAGVGT---QLVSFADDEVVPIRFFNP---------------------------------------DGLERGLSLMESLK 194 (518) Q Consensus 157 ~~~~~~---~~~~~~~~evih~~~~~~---------------------------------------~~~~~G~s~l~~~~ 194 (518) ....+. ....+..+.+.|++.... .+..+|.|-+..+. T Consensus 180 ~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~sd~~~v~ 259 (479) T protein:vir:79 180 EDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCVSDLTFYK 259 (479) T ss_pred eecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCCCCCCcchhhhH Confidence 111110 111122333333221000 01124777787777 Q ss_pred HHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHH Q lcl|NC_021305. 195 STIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQF 273 (518) Q Consensus 195 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~ 273 (518) ..+.....+..-..+.+...+.|-.+++-- +...++ +... ...++++.++++.++..+..+.....+ T Consensus 260 ~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~----~~~~--------~~~~~~i~~~~~~~~~~l~~~~~~~~~ 327 (479) T protein:vir:79 260 SLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQE----FIDN--------IRYYKSIKVDGGGGVDKLEINIPVEAK 327 (479) T ss_pred HHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccccc----chhh--------hhhccceecCCCCcceEEeccCCHHHH Confidence 777666666555555566666666665431 111111 1111 123445666766665555555555666 Q ss_pred HHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH-------------HHHHHHHHHhhHHHHHHHHHHHHhhhhhhccc Q lcl|NC_021305. 274 IEARQLNREEVCGVYDIAPPIVHILDRATFSNISA-------------QMRAFYRDTMAIPIARIQSAMDKYVGQYWVRK 340 (518) Q Consensus 274 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~-------------~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~ 340 (518) ....+...+.|...-++|..-.+.. ++ .+..+ .....+...+.-.++.+...++..-.. .... T Consensus 328 ~~~~~~l~~~i~~~s~~p~~~~~~~--gn-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~~ 403 (479) T protein:vir:79 328 KELLDRLEKNIIIFGQGVNPESQNT--GD-KSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNK-SYDY 403 (479) T ss_pred HHHHHHHHHHHHHHhCccccccccc--cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCC-cccc Confidence 7778888888888888885433221 12 11111 111122222333333322222211100 0112 Q ss_pred ccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCC Q lcl|NC_021305. 341 NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKR 420 (518) Q Consensus 341 ~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~ 420 (518) ..+++.+..-+..|.++.++.+.++ .|+++...+.++++. ++++.. + +..+. .......+....-. T Consensus 404 ~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l~~--v~d~~~-E------~~ri~---~E~~~~~~~~~~~~ 469 (479) T protein:vir:79 404 KTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNHPW--VEDVND-E------LERLK---KQEDTQKEYDDLIP 469 (479) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCC--CCCHHH-H------HHHHH---HHHHHHHHHHhccC Confidence 3456777777888999999999887 488998888888764 222211 1 11111 00000000000000 Q ss_pred CccCCCCCCCccccCC Q lcl|NC_021305. 421 PASTPVASLDQSPPTS 436 (518) Q Consensus 421 ~~~~~~~~~~~~~~~~ 436 (518) ....+..++ + T Consensus 470 ~~~~~~~~e------~ 479 (479) T protein:vir:79 470 NNQDGVIDE------T 479 (479) T ss_pred cccCCCcCc------C Confidence 000000000 0 No 184 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.40 E-value=8.4e-07 Score=53.88 Aligned_cols=397 Identities=9% Similarity=0.008 Sum_probs=170.3 Q ss_pred CcCCCC--CCCCcccccccchhhhhh--hcccc--ccccccc--cc---chhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLANG--QTLSAPAMAELSPQMQDS--YYYAP--AVGMQLE--RQ---FSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~~--~~~~~~~~~~~~~~~~~~--~~~~~--~~~~~~~--~~---~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) |+...- +-..+...+ . +.+... +..+. ...-+.. .. .......-..++....+|+..+.-+-+-|+. T Consensus 43 ~~~~~~i~~~i~~~~~~-~-~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~ 120 (492) T protein:vir:94 43 ETLEEMIVRYIKQHLEK-L-PEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIA 120 (492) T ss_pred hhHHHHHHHHHHHHHHH-H-HHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcccCce Confidence 000000 000000000 0 000000 00000 0000000 00 0000001122456677888888888777877 Q ss_pred EEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC-ce Q lcl|NC_021305. 70 CMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR-TG 148 (518) Q Consensus 70 v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~-~~ 148 (518) +--.++ . ....+..++. | +.......+..+.+.+|.+|.++-.+.+|.+ .+..++|..+.+.++.. .. T Consensus 121 ~~~~d~----~-~~~~l~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~~~~~~p~~~~~v~d~~~~~ 189 (492) T protein:vir:94 121 FKHTDD----E-VVKRIDEVLG--N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHE 189 (492) T ss_pred eccCch----H-HHHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCC Confidence 632111 1 1112223332 2 3445566788999999999999988888864 57778999988877642 11 Q ss_pred -eeEEeeecccccCceeEEeccccEEEEecc---------------------CC---------CCcccCchHHHHHHHHH Q lcl|NC_021305. 149 -RYEYYFQAGAGVGTQLVSFADDEVVPIRFF---------------------NP---------DGLERGLSLMESLKSTI 197 (518) Q Consensus 149 -~~~~~~~~~~~~~~~~~~~~~~evih~~~~---------------------~~---------~~~~~G~s~l~~~~~~i 197 (518) ......++.....+....+....+.++... ++ .....|.|-++.+...+ T Consensus 190 ~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~sd~e~v~~li 269 (492) T protein:vir:94 190 ELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLI 269 (492) T ss_pred ceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCCCCCCchHHHHHHH Confidence 111111111111111111222222222110 00 01125788888877777 Q ss_pred HHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEAR 277 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~ 277 (518) .....+..-..+.+...+.|..+++-- +.++...++..+ ...+++.++++.+...+........+.... T Consensus 270 Da~d~~~S~~~~~~~~~~~p~lv~~g~---~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 338 (492) T protein:vir:94 270 DAYNRRLSDLSNTFKDSNELTYVLKNY---DDQELPEFKRLL--------RYYGAIKVSDNGGVDTIQVEVPVENSKKYL 338 (492) T ss_pred HHHHHHHHHHHHHHHHhcCceeeeecC---CcccchhhHHHH--------hhccceecCCCCcceeEeccCCHHHHHHHH Confidence 777766656666666666666555321 222222222221 122345556555554454444555566777 Q ss_pred HHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHH----HHHHHHH---hhhhh---hcccccceecc Q lcl|NC_021305. 278 QLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIAR----IQSAMDK---YVGQY---WVRKNRMKFDI 347 (518) Q Consensus 278 ~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~----ie~~l~~---~l~~~---~~~~~~~~fd~ 347 (518) +.+.+.|+..-++|..-.+.. +++ .+.++ ..+....+.-.+.. +...+.+ .++.. ......+++.+ T Consensus 339 ~~l~~~I~~~s~~p~~~~~~~-~~n-~Sg~A--l~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f 414 (492) T protein:vir:94 339 DELYQKIMLFGQAVDFSSDKF-GSA-PSGVA--LEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISF 414 (492) T ss_pred HHHHHHHHHHhCCcCCCcccc-ccC-chHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEe Confidence 888888888888884322111 111 12222 11111111111111 1111111 11111 11223456667 Q ss_pred hhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCC Q lcl|NC_021305. 348 DDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVA 427 (518) Q Consensus 348 ~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (518) ..-+..|..+.++.+.++. |+++..-++++++.-+-+....+.+ ...........+..... ..... T Consensus 415 ~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~eri---------~~E~~~~~~~~~~~~~~---~~~~~ 480 (492) T protein:vir:94 415 NYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVEDLQAELERI---------EQEQMEYNKQLPNLDDG---GADSA 480 (492) T ss_pred cCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHH---------HHHHHHHHhhccccccc---cCCCC Confidence 7778899999999999884 8899888888886532211111111 00000000000000000 00000 Q ss_pred CCCccccCCccc Q lcl|NC_021305. 428 SLDQSPPTSVPG 439 (518) Q Consensus 428 ~~~~~~~~~~~~ 439 (518) .+++..++++.+ T Consensus 481 ~~~~~~~~~e~e 492 (492) T protein:vir:94 481 QQQERSNNKESE 492 (492) T ss_pred ccccCCccccCC Confidence 011111111111 No 185 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=98.40 E-value=8.5e-07 Score=53.86 Aligned_cols=358 Identities=10% Similarity=0.039 Sum_probs=165.6 Q ss_pred CCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHH-hhcHHHHHHHHHHHHhhccCceEEEEecCCcceec Q lcl|NC_021305. 3 LANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIY-KNQPWVRTVIAKRAQALARLPVKCMFTSGDTETEE 81 (518) Q Consensus 3 f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~~ 81 (518) +.. -.+.-..+..+.. |-......+.. -....+..+ +...+...+|+.++..+.--.|. .. T Consensus 1 l~~----~~~r~~~~~~yY~---g~~~~~~~~~~-~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~---~~------- 62 (410) T protein:vir:95 1 MNL----YQSRVNLRYKHYA---MQHYEAPTGIT-IPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFA---ND------- 62 (410) T ss_pred CCc----chhhHHHHHHHhc---CCCCccccchh-ccHHHHhHHHhhcchhHHHHHHhHhhhcccccc---CC------- Confidence 100 0111100111110 00000000000 001111111 22345566777777655433332 11 Q ss_pred cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec--ccc Q lcl|NC_021305. 82 SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA--GAG 159 (518) Q Consensus 82 ~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~--~~~ 159 (518) +..+..++.+ | +.......+..+.+++|.+|+.+..+.+|.+ .+.+++|..+.+.++.....+.+.+.. ... T Consensus 63 -d~~l~~i~~~-N---~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~~~~~~al~~~~~~~ 136 (410) T protein:vir:95 63 -DFNVTEIFDR-N---NPDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPITGLLVEGYAVLARDD 136 (410) T ss_pred -CchHHHHHhh-c---ChHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCCCceEEEEEEEEecC Confidence 1123344432 2 3344566778899999999999998888865 678899999998887765544332221 111 Q ss_pred cCc--eeEEeccc---------------------cEEEEeccCCCCcccCchH----HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 160 VGT--QLVSFADD---------------------EVVPIRFFNPDGLERGLSL----MESLKSTIFSEDSSRNATAAMWK 212 (518) Q Consensus 160 ~~~--~~~~~~~~---------------------evih~~~~~~~~~~~G~s~----l~~~~~~i~~~~~~~~~~~~~~~ 212 (518) .+. ....+.++ .|++|.+....+..+|.|- +..+.+.+.....-......||. T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a 216 (410) T protein:vir:95 137 YNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYS 216 (410) T ss_pred CCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhc Confidence 111 11122223 3344443322233467773 44555555444444444555543 Q ss_pred ccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCC-----CcceeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 213 NAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE-----GMEPIPLQLTAVEMQFIEARQLNREEVCGV 287 (518) Q Consensus 213 ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-----g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 287 (518) + .-+.++-.+...+ ..+ .|+.. .++++.++. +.++.++....-+ .|++.++.....||.. T Consensus 217 ~--pqr~i~G~d~d~~--~~~----~~~~~------~~~i~~~~~~~~~~~~~v~q~~~~~l~-~~~~~l~~l~~~~a~~ 281 (410) T protein:vir:95 217 W--PQKYILGLDPDAE--PME----KWKAT------VSSLLTISSSDKGVKPSVGQFTTASMS-PFTEQLRTAAAGFAGE 281 (410) T ss_pred c--hhheeeccCCCCC--cCc----hhhhh------hhhheeccCCCCCCcceEEecCCCChH-HHHHHHHHHHHHHhhh Confidence 3 2223332222111 111 23222 234555542 2556555443222 4889999999999999 Q ss_pred hcCCHHHhccccccccCCHHH---HHHHHHHH---HhhHHHHHHHHHHHHhhhhhh--c----ccccceecch---hhhh Q lcl|NC_021305. 288 YDIAPPIVHILDRATFSNISA---QMRAFYRD---TMAIPIARIQSAMDKYVGQYW--V----RKNRMKFDID---DVIQ 352 (518) Q Consensus 288 fgVPp~~lg~~~~~~~sn~e~---~~~~~~~~---~l~P~~~~ie~~l~~~l~~~~--~----~~~~~~fd~~---~l~~ 352 (518) -++|++.+|.... |-++.+. ....+... .-+-+-..+++.+...+.-.. . .....++.|. +... T Consensus 282 s~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~ 360 (410) T protein:vir:95 282 MGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADA 360 (410) T ss_pred cCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcch Confidence 9999999986543 2233222 11111111 111122223322221111111 0 1122344454 3334 Q ss_pred cCHHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCC Q lcl|NC_021305. 353 PDWEAKSESTQKMVNS--GVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWE 413 (518) Q Consensus 353 ~d~~~~~~~~~~~~~~--G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~ 413 (518) .+....++++.+++++ |+.+..-+++++|+.+-+. . .... ......+ + T Consensus 361 ~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~-----~-~~~~---~e~~~~g----~ 410 (410) T protein:vir:95 361 NTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDMS-----A-KPVV---SEGGSNG----E 410 (410) T ss_pred hhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHHH-----H-HHHH---HHHHhCC----C Confidence 4567788888899988 7777788999999964321 0 0000 0000000 0 No 186 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.38 E-value=9.5e-07 Score=53.59 Aligned_cols=403 Identities=10% Similarity=0.008 Sum_probs=180.1 Q ss_pred Cc---CCCC--CC--CCcccccc-cchhhhhhhcccccccccc-----cccchhhhHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 ML---LANG--QT--LSAPAMAE-LSPQMQDSYYYAPAVGMQL-----ERQFSLYGGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~---f~~~--~~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) |+ +.+. .. .+.++... ....+. .+..++.|-.. ................-..+++..|+-+..=| T Consensus 14 ~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~--~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e~ 91 (522) T protein:vir:47 14 GRYYMQTSNLNSILEHPKIAVTQEEYDRIK--RNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTASKKIASLVYNEQ 91 (522) T ss_pred HHHHhhcccchhccccCCCCCCHHHHHHHH--HHHHHhcCCcccccccccCcchhcccceecchHHHHHHHHhhhhcCCc Confidence 22 2210 00 01111100 000010 01111111100 00111111122333444667777777776655 Q ss_pred eEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEE-cCC Q lcl|NC_021305. 68 VKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKR-NSR 146 (518) Q Consensus 68 ~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~-~~~ 146 (518) ..+--.+ ......+..++. .-.+...+...+...+..|.+++.+..+. |. +.+..+++..+.+.. +.. T Consensus 92 ~~i~v~d-----~~~~~~l~~~l~----~n~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~i~~v~ad~~~P~~~~~~ 160 (522) T protein:vir:47 92 ATITTKN-----EILQKFLDDMLT----NDRFNKNFERYLESCLALGGLAMRPYIDG-DK-VRVAFIQAPVFFPLESNTQ 160 (522) T ss_pred ceeecCC-----hHHHHHHHHHHh----hcchHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eEEEEEcCCceEEEEEcCC Confidence 4432111 111122222332 23345556667778888888888877764 32 445556666555432 211 Q ss_pred ce----------------eeEEe-----------------------------eeccc--ccCceeEEe------------ Q lcl|NC_021305. 147 TG----------------RYEYY-----------------------------FQAGA--GVGTQLVSF------------ 167 (518) Q Consensus 147 ~~----------------~~~~~-----------------------------~~~~~--~~~~~~~~~------------ 167 (518) +. ..+|. .+... ...|..+.+ T Consensus 161 ~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~ 240 (522) T protein:vir:47 161 DVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPV 240 (522) T ss_pred ceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCc Confidence 11 00111 00000 000111100 Q ss_pred ------ccccEEEEeccCCC----CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcc----cccccCccCCHHHHH Q lcl|NC_021305. 168 ------ADDEVVPIRFFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPN----LVLRHEKRLSEAAQQ 233 (518) Q Consensus 168 ------~~~evih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~----~il~~~~~~~~~~~~ 233 (518) ..--+.||+.+.++ +.++|+|.+..+...+.........-..-|+.|-..- .+++........... T Consensus 241 ~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~ 320 (522) T protein:vir:47 241 TVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGTID 320 (522) T ss_pred eEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCCCCcccc Confidence 01113466654332 3457999999999999888876666666666554322 223322111100000 Q ss_pred HHHHHH---HHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH- Q lcl|NC_021305. 234 RLREQF---DRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ- 309 (518) Q Consensus 234 ~~~~~~---~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~- 309 (518) ....| ...|.+... -.+++-+++.+.....+.++.+..+...+.|+...|+++..+|...++. .+..+. T Consensus 321 -~~~~fd~~~~~f~~~~~-----~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~-kTAtEi~ 393 (522) T protein:vir:47 321 -FRPRFDVEQNVYMQIGG-----SSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGM-KTATEIV 393 (522) T ss_pred -cccccCcccceEeecCC-----CCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCcccccc-ccHHHHH Confidence 00001 111211110 0123345667777778889999999999999999999999998765532 334333 Q ss_pred ------------HHHHHHHHhhHHHHHHHHHHHHh-hhh-hhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHH Q lcl|NC_021305. 310 ------------MRAFYRDTMAIPIARIQSAMDKY-VGQ-YWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNE 375 (518) Q Consensus 310 ------------~~~~~~~~l~P~~~~ie~~l~~~-l~~-~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE 375 (518) ....+..+|..++..+....+.. ++. .....+.+.|++++-+..|..+.++...+++.+|+|++-+ T Consensus 394 s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e~ 473 (522) T protein:vir:47 394 SENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKKR 473 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHH Confidence 12223333333333333222110 111 1113455778899999999999999999999999999999 Q ss_pred HHHHh-CCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccc Q lcl|NC_021305. 376 GREIM-GLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPG 439 (518) Q Consensus 376 ~R~~~-g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (518) ++.+. |+ +++..+.-+ ..+. ..+... .+.+.+-.+.++..++..+ +.+ T Consensus 474 ~i~~~~g~---~eeea~~el-----~ri~---~E~~~~----~~~~~~~~~~~~~~~~~~d-~~~ 522 (522) T protein:vir:47 474 AIGKTLNI---SGVEAEKEL-----NAIN---SELLPM----NDAELAIYGMHDQNEEKAD-DKG 522 (522) T ss_pred HHHhcCCC---ChHHHHHHH-----HHHH---HhhccC----CCCCCCCCCCCCcccccCC-CCC Confidence 87654 43 322222111 1111 000000 0000000011110111000 111 No 187 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=98.36 E-value=1.1e-06 Score=53.36 Aligned_cols=409 Identities=9% Similarity=0.010 Sum_probs=172.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) ++-.. .....+....+..+.. |..+..-.............-...+.....|+..+.-+-+-|+.+--.++ T Consensus 48 ~i~~~-~~~~~~r~~~l~~Yy~---g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~----- 118 (511) T protein:vir:99 48 YIEHH-MDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDK----- 118 (511) T ss_pred HHHHH-HHhhHHHHHHHHHHhc---ccCccccccCcccccccCcceeecchHHHHHHHHHhhhcccCceeecCch----- Confidence 00000 0000011111111110 00000000000000000000012244566777778777777877621111 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eeeE-Eeeec- Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRYE-YYFQA- 156 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~~-~~~~~- 156 (518) .....+..++.+ -........+..+++.+|.+|.++.++.+|. ..+..++|..+.++.+... .... ..++. T Consensus 119 ~~~~~l~~~~~~----n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~-~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~ 193 (511) T protein:vir:99 119 DVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYELMIRNQDDE-TRLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (511) T ss_pred HHHHHHHHHHhh----cCHhHHHHHHHHHHHhcCeeEEEEEeCCCCc-eEEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 111233333333 2455666778889999999999999988886 4677889999988876542 1111 11110 Q ss_pred cccc----Cc--eeEEeccccEEEEeccCC-------------------------CCcccCchHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 157 GAGV----GT--QLVSFADDEVVPIRFFNP-------------------------DGLERGLSLMESLKSTIFSEDSSRN 205 (518) Q Consensus 157 ~~~~----~~--~~~~~~~~evih~~~~~~-------------------------~~~~~G~s~l~~~~~~i~~~~~~~~ 205 (518) .... .. ....+.++.+.+++.... .....|.|.+..+...+.....+.. T Consensus 194 ~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S 273 (511) T protein:vir:99 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERRKGDYEKVITLIDLYDNAES 273 (511) T ss_pred eecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCCCCchhhhHHHHHHHHHHHH Confidence 0000 00 111234444444432110 0012577778777777776665555 Q ss_pred HHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCcc-ccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHH Q lcl|NC_021305. 206 ATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSS-NTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEV 284 (518) Q Consensus 206 ~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~-n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~I 284 (518) -..+.+...+.|-.+++-....+.++....++.-.-...... ..+...-.++|.++..+........+....+.+.+.| T Consensus 274 ~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I 353 (511) T protein:vir:99 274 DTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDI 353 (511) T ss_pred HHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHH Confidence 555555555555555543333344443333221000000000 0001112344556655665555666777888889999 Q ss_pred HHHhcCCHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhhhc-ccccceecchhh Q lcl|NC_021305. 285 CGVYDIAPPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQYWV-RKNRMKFDIDDV 350 (518) Q Consensus 285 a~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~-~~~~~~fd~~~l 350 (518) +..-++|..-.+... +|- +..+.. ...+..++.-.+..|...+...--.... ....+++.+..- T Consensus 354 ~~~s~~P~~~~~~~~-gn~-Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~ 431 (511) T protein:vir:99 354 HMFTNTPNMKDDNFS-GTQ-SGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRN 431 (511) T ss_pred HHHhCCccccccccc-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCC Confidence 999899865432211 121 111111 1111112222222222111111000000 112356667777 Q ss_pred hhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccc-----cccCCCCCCCCCCCCCccCC Q lcl|NC_021305. 351 IQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGAT-----PDGAVEWEEAPAPKRPASTP 425 (518) Q Consensus 351 ~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ 425 (518) +..|..+.++.+.++. |++|..-++++++. ++++. .++ ..+... ...+......+........+ T Consensus 432 ~p~n~~e~~~~~~kl~--GiiS~et~l~~l~~--v~D~~-~E~------~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 500 (511) T protein:vir:99 432 LPKSLIEELKAYIDSG--GKISQTTLMSLFSF--FQDPE-LEV------KKIEEDEKESIKKAQKNMYQDPRNINDDEQD 500 (511) T ss_pred CCcCHHHHHHHHHHHh--ccCCHHHHHHhCCC--CCCHH-HHH------HHHHHHHHHHHHHHhhcccccCCCCCCCCCC Confidence 8889999999998884 88998888888754 22221 111 011000 00000000000000000000 Q ss_pred CCCCCccccCCccccccchhcchhhHHH Q lcl|NC_021305. 426 VASLDQSPPTSVPGLSPTNSDRSTDSGK 453 (518) Q Consensus 426 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (518) . + . ....+.++ T Consensus 501 ~--~-~--------------~~~~d~~e 511 (511) T protein:vir:99 501 D--S-T--------------KDSIDKKE 511 (511) T ss_pred C--C-C--------------cCcccccC Confidence 0 0 0 00000000 No 188 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.36 E-value=1.1e-06 Score=53.29 Aligned_cols=387 Identities=11% Similarity=-0.024 Sum_probs=165.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccccc--------ccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPA--------VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~ 72 (518) ++-.... ..+....+..+.. |.... .+..... .....-..++....+|+..+.-+-.-|+.+-- T Consensus 35 ~i~~~~~--~~~~~~~~~~Yy~---g~~~i~~r~~~~~~~~~~~~---~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~ 106 (474) T protein:vir:95 35 LIDDHRK--QLDKITVGQRYYD---KDNDIVKQMKKVDVYGNIDY---DKPDWRITTNFHQNLVDQKVSYVASKPVTYSC 106 (474) T ss_pred HHHHHHH--HHHHHHHHHHHhc---ccCchhcccccccccccccc---ccccceeccchHHHHHHHHHhhhccCCceecc Confidence 0000000 0000000000000 00000 0000000 00001122355667888888888888877632 Q ss_pred ecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc-e-ee Q lcl|NC_021305. 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-G-RY 150 (518) Q Consensus 73 ~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~-~-~~ 150 (518) .+ +.....+..++. | +.......+..+.+.+|.+|..+-++.+|++ .+..++|..+.+..+... . .. T Consensus 107 ~d-----~~~~~~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~ 175 (474) T protein:vir:95 107 ED-----ESVLKIIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELK 175 (474) T ss_pred Cc-----hHHHHHHHHHHh--c---cHHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceE Confidence 11 111222333332 2 3455566778999999999999888888864 577788888887776531 1 11 Q ss_pred EEeeecccccCceeEEeccccEEEEeccC---------------------C---------CCcccCchHHHHHHHHHHHH Q lcl|NC_021305. 151 EYYFQAGAGVGTQLVSFADDEVVPIRFFN---------------------P---------DGLERGLSLMESLKSTIFSE 200 (518) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~evih~~~~~---------------------~---------~~~~~G~s~l~~~~~~i~~~ 200 (518) .....+..........+....+.+++... + .....|.|-+..+...+... T Consensus 176 ~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~ 255 (474) T protein:vir:95 176 SFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAI 255 (474) T ss_pred EEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHHHHH Confidence 11111111111111222223333322100 0 01134777777777777766 Q ss_pred HHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHH Q lcl|NC_021305. 201 DSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLN 280 (518) Q Consensus 201 ~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~ 280 (518) ..+..-..+.+...+.|..+++--. .. +.+.+.. . ...++++.++++.+...+..+.....+....+.+ T Consensus 256 d~~~S~~~~~~~~~~~p~lv~~g~~-~~--~~~~~~~----~----~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l 324 (474) T protein:vir:95 256 DKRLSDAQNMFDESVELIYILKGYE-GQ--DLEEFMR----G----LKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLM 324 (474) T ss_pred HHHHHHHHHHHHHhcCceeeeecCC-cc--cchhhhh----h----hhccceeeccCCCceeEEeecCCHHHHHHHHHHH Confidence 6555555555565666655543211 11 1111111 1 1234566677666666666555666677888888 Q ss_pred HHHHHHHhcCCHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecc Q lcl|NC_021305. 281 REEVCGVYDIAPPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDI 347 (518) Q Consensus 281 ~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~ 347 (518) .+.|+..-++|..-.+.. .++ .+..+.. ...+...+...+..|...+ ........+++.+ T Consensus 325 ~~~i~~~s~~p~~~~~~~-~~n-~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~-----g~~~d~~~i~v~f 397 (474) T protein:vir:95 325 RAYIMEFGQGVDFQTDKF-GSA-PSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFN-----NLKMDVKDIEISF 397 (474) T ss_pred HHHHHHHhCCcccccccc-ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----CCCcccceeeEEe Confidence 999999999985322111 111 1211111 1122222222222222111 1111122344555 Q ss_pred hhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCC Q lcl|NC_021305. 348 DDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVA 427 (518) Q Consensus 348 ~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (518) +.-...|..+.++. +.+.|++|...+.+++++- +++. .++ ..+........ +..+.. .....+.. T Consensus 398 ~~~~p~d~~e~a~~---~~~~g~iS~et~i~~l~~v--~d~~-~E~------~ri~~E~~~~~--~~~~~~-~~~~~d~~ 462 (474) T protein:vir:95 398 NFNRMMNDAEQSQI---IAQSQYLSRETLVKSSPLV--DDYK-AEL------ERIEQEQMEYN--KQLPNL-DDGGADGA 462 (474) T ss_pred ccCCCcCHHHHHHH---HHhcCCCchHHHHHhCCCC--CCHH-HHH------HHHHHHHHHHH--hccccc-ccccCCCC Confidence 55566676666655 4567999988888887543 2211 111 11110000000 000000 00001111 Q ss_pred CCCccccCCccc Q lcl|NC_021305. 428 SLDQSPPTSVPG 439 (518) Q Consensus 428 ~~~~~~~~~~~~ 439 (518) ++++++.+..++ T Consensus 463 ~~~~~~~~~~~~ 474 (474) T protein:vir:95 463 QQQERSNDKESE 474 (474) T ss_pred cCCCCCccCCCC Confidence 111111111111 No 189 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.33 E-value=1.3e-06 Score=52.85 Aligned_cols=396 Identities=10% Similarity=0.031 Sum_probs=178.8 Q ss_pred CcCCCCC---CC--Ccccc-----cccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_021305. 1 MLLANGQ---TL--SAPAM-----AELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) Q Consensus 1 ~~f~~~~---~~--~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v 70 (518) +||-.+. .. +..+. ....-|-...-|-.+...-... ..............-..+++..|+-+..=|..+ T Consensus 17 ~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~-~~~~~~~~~~sln~~~~i~~~~A~lv~~e~~~i 95 (508) T protein:vir:15 17 ATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQAS-DGIKKKRLKNTINMAKTAARRIASVVFNEKAEI 95 (508) T ss_pred HhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccC-CCCccccceeecchHHHHHHHHHhhhhCCCceE Confidence 2222111 00 11110 0011121111111111100000 000011111122334667777777776555444 Q ss_pred EEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee- Q lcl|NC_021305. 71 MFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR- 149 (518) Q Consensus 71 ~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~- 149 (518) .-.+++ ..+..+..++.. -.+..-....+.+.+..|.+++.+..+..+ +.+..++|..+.+.....+.. T Consensus 96 ~v~~~~----~~~e~l~~il~~----n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~--~~i~~v~ad~~~P~~~d~~~~~ 165 (508) T protein:vir:15 96 HVKDNN----EADKFLNDVLED----NDFKNKFEEALEKGVALGGFAMRPYIDGNH--IKIAWVRADQFYPLQSNTNDIS 165 (508) T ss_pred EeCCch----HHHHHHHHHHHh----ccHHHHHHHHHHHHhhcCceEEEEEEeCCe--eEEEEEcCCeeEEEEEcCCCeE Confidence 321211 112222233321 223444556678888999999888876443 456667777765432222211 Q ss_pred ----------------eEEe---eec--------------ccc---cCceeEEe------------------ccccEEEE Q lcl|NC_021305. 150 ----------------YEYY---FQA--------------GAG---VGTQLVSF------------------ADDEVVPI 175 (518) Q Consensus 150 ----------------~~~~---~~~--------------~~~---~~~~~~~~------------------~~~evih~ 175 (518) .+|. ++. ... ..+..+.+ ..--+.|| T Consensus 166 ~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~ 245 (508) T protein:vir:15 166 EAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYF 245 (508) T ss_pred EEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcceeEEe Confidence 0110 000 000 00111110 00123455 Q ss_pred eccCCC----CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccc---cCccCCHHHHHHHHHHHHHHhcCccc Q lcl|NC_021305. 176 RFFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLR---HEKRLSEAAQQRLREQFDRAHSGSSN 248 (518) Q Consensus 176 ~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~---~~~~~~~~~~~~~~~~~~~~~~g~~n 248 (518) +.+.++ +..+|+|.+..+...+............-|+. +.+..++. ++. +++....+... ...|.+.. T Consensus 246 ~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~-~~~~i~v~~~~l~~--d~~~~~~~~~~-~~~~~~~~- 320 (508) T protein:vir:15 246 KTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRL-GQKHIAVQPGMLRF--DDEHKPTFDTE-QNVYVGVL- 320 (508) T ss_pred cCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHh-cccceeechHHhcC--CCCCccccCCC-CeeEEecc- Confidence 544332 23579999999999998888777666666754 44444441 111 11110001000 00011000 Q ss_pred cCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH--HHHHHHHhhHHHHHHH Q lcl|NC_021305. 249 TGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM--RAFYRDTMAIPIARIQ 326 (518) Q Consensus 249 ~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~--~~~~~~~l~P~~~~ie 326 (518) + =.++|..++.++....+-++.+..+...+.|....|++|.-+|...++. .+..+.. ..-.-.++.-....++ T Consensus 321 -~---~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~-~TAtei~s~~~~~~~t~~~~~~~~~ 395 (508) T protein:vir:15 321 -S---DDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGV-KTATEVVSNNSMTYQTRSSYLTMVE 395 (508) T ss_pred -C---CCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCcc-ccHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0123455777777778888999999999999999999999998765543 2333321 1111112222222333 Q ss_pred HHHHHhh---h--------hhh----------cccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCC Q lcl|NC_021305. 327 SAMDKYV---G--------QYW----------VRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM-GLPR 384 (518) Q Consensus 327 ~~l~~~l---~--------~~~----------~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-g~~p 384 (518) ..|...+ + ... ...+.+.|++++-+..|..+.++.+.+++.+|+++.-+++.+. |+ T Consensus 396 ~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~~g~-- 473 (508) T protein:vir:15 396 KAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQRNYGM-- 473 (508) T ss_pred HHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCC-- Confidence 3332221 0 000 1123466888888999999999999999999999999988654 43 Q ss_pred CCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCc Q lcl|NC_021305. 385 SDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSV 437 (518) Q Consensus 385 ~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (518) +++..++.+ ..+. .+++...+..+...+.++.+- + T Consensus 474 -~deea~~el-----~ri~-------~E~~~~~~~~~~~~~~~g~~g-----e 508 (508) T protein:vir:15 474 -TDEQAAEEL-----AKIQ-------SEAPTDTFEGGRSAILNGGDG-----E 508 (508) T ss_pred -ChHHHHHHH-----HHHH-------HhccccCccccccccCCCCCC-----C Confidence 222222111 0010 011111111111111111100 0 No 190 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.32 E-value=1.4e-06 Score=52.71 Aligned_cols=379 Identities=11% Similarity=0.008 Sum_probs=166.2 Q ss_pred CcCCCCC-CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQ-TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) +-+-.|+ ........ ....+..+ ... ...-..++.....|+..+.-+-.-|+.+--.++ T Consensus 50 ~~Yy~g~~~i~~~~~~------~~~~~~~~----~~~------~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~---- 109 (474) T protein:vir:95 50 QKYYDKDNDINYQAYK------QDLHGNID----YTK------PDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDD---- 109 (474) T ss_pred HHHhcccCccccccch------hhhccccc----ccc------cccccccchHHHHHHhhhhhhcccCceeccCCh---- Confidence 0011111 00000000 00000000 000 000112345677888888888888887632211 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC--ceeeEEeeecc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR--TGRYEYYFQAG 157 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~--~~~~~~~~~~~ 157 (518) .....+..++. | +.......+..+++.+|.+|..+-++.+|.+ .+..++|..+.++++.. +....+...+. T Consensus 110 -~~~~~l~~~~~--n---~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~ 182 (474) T protein:vir:95 110 -KVLDVIHQVLD--T---RWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFT 182 (474) T ss_pred -HHHHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEe Confidence 11122223332 2 3555666788999999999999989888864 57778999988887643 22221111111 Q ss_pred cccCceeEEeccccEEEEeccC---------------------C---------CCcccCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEVVPIRFFN---------------------P---------DGLERGLSLMESLKSTIFSEDSSRNAT 207 (518) Q Consensus 158 ~~~~~~~~~~~~~evih~~~~~---------------------~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~ 207 (518) .........+.+..+.++.... + .....|.|-+......+.....+..-. T Consensus 183 ~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~ 262 (474) T protein:vir:95 183 FNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDV 262 (474) T ss_pred ecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHH Confidence 1111111222333333332110 0 011347777777777776666555445 Q ss_pred HHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 208 AAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGV 287 (518) Q Consensus 208 ~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 287 (518) .+.+...+.|-.+++- .+.++...+...+ ...+++.++++.+...+..+.....+....+.+.+.|... T Consensus 263 ~~~~~~~~~p~lv~~g---~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 331 (474) T protein:vir:95 263 QNMFDESVELIYILRG---YEGEDLSEFMEGL--------KYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEF 331 (474) T ss_pred HHHHHHhhcchhhhcC---CCcccccchhhhh--------hccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHH Confidence 5555555556555432 1212222222221 1234566666666555655556666778888888999999 Q ss_pred hcCCHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcC Q lcl|NC_021305. 288 YDIAPPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPD 354 (518) Q Consensus 288 fgVPp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d 354 (518) -++|..-.... .++ .+..+. ....+...+...++.+.. .+ ........+++.+..-+..+ T Consensus 332 s~~p~~~~~~~-~~n-~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~----~~-g~~~d~~~i~i~f~~~~p~~ 404 (474) T protein:vir:95 332 GQGVDFQTDKF-GSA-TSGIALKFLYTNLNLKANKLKNKANVALQELMQFILD----FN-KIKLDAKEIEITFNFNVMVN 404 (474) T ss_pred hCCcCcccccc-ccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hh-CCCcccceeeEEecCCCccC Confidence 99884322111 111 121111 111111222222222211 11 11111233455556667777 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 355 WEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 355 ~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ..+.++.+ ..+|++|...+++++++- +++. .+ +..+...........+ ...+ +......+..++. T Consensus 405 ~~e~a~~~---~~~giiS~et~~~~lp~v--~D~~-~E------~eri~~E~~~~~~~~~--~~~~-~~~~~~~~~~~~~ 469 (474) T protein:vir:95 405 DLEQSQIG---AQSQYLSKETLVRHHPWV--DDPK-AE------LERLDEEQLELNKQLP--NLDD-GGADGAQQQQQSE 469 (474) T ss_pred HHHHHHHH---HHcCCCChHHHHHhCCCC--CCHH-HH------HHHHHHHHHHHHhhcc--cccc-ccCCCCCCcCCCC Confidence 77776654 457999998898888653 2211 11 1111100000000000 0000 0000000000000 Q ss_pred CCccc Q lcl|NC_021305. 435 TSVPG 439 (518) Q Consensus 435 ~~~~~ 439 (518) +.+.+ T Consensus 470 ~~e~~ 474 (474) T protein:vir:95 470 NNQSK 474 (474) T ss_pred ccccC Confidence 01111 No 191 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.32 E-value=1.4e-06 Score=52.71 Aligned_cols=379 Identities=11% Similarity=0.008 Sum_probs=166.2 Q ss_pred CcCCCCC-CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQ-TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) +-+-.|+ ........ ....+..+ ... ...-..++.....|+..+.-+-.-|+.+--.++ T Consensus 50 ~~Yy~g~~~i~~~~~~------~~~~~~~~----~~~------~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~~~---- 109 (474) T protein:vir:96 50 QKYYDKDNDINYQAYK------QDLHGNID----YTK------PDWRITTNFHQNLVDQKVSYVAGKPVTYAHDDD---- 109 (474) T ss_pred HHHhcccCccccccch------hhhccccc----ccc------cccccccchHHHHHHhhhhhhcccCceeccCCh---- Confidence 0011111 00000000 00000000 000 000112345677888888888888887632211 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC--ceeeEEeeecc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR--TGRYEYYFQAG 157 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~--~~~~~~~~~~~ 157 (518) .....+..++. | +.......+..+++.+|.+|..+-++.+|.+ .+..++|..+.++++.. +....+...+. T Consensus 110 -~~~~~l~~~~~--n---~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~ 182 (474) T protein:vir:96 110 -KVLDVIHQVLD--T---RWDNKLIDILTAASNKGIDWLQVYINEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFT 182 (474) T ss_pred -HHHHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEeeeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEe Confidence 11122223332 2 3555666788999999999999989888864 57778999988887643 22221111111 Q ss_pred cccCceeEEeccccEEEEeccC---------------------C---------CCcccCchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDEVVPIRFFN---------------------P---------DGLERGLSLMESLKSTIFSEDSSRNAT 207 (518) Q Consensus 158 ~~~~~~~~~~~~~evih~~~~~---------------------~---------~~~~~G~s~l~~~~~~i~~~~~~~~~~ 207 (518) .........+.+..+.++.... + .....|.|-+......+.....+..-. T Consensus 183 ~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~ 262 (474) T protein:vir:96 183 FNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDV 262 (474) T ss_pred ecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHH Confidence 1111111222333333332110 0 011347777777777776666555445 Q ss_pred HHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 208 AAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGV 287 (518) Q Consensus 208 ~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~ 287 (518) .+.+...+.|-.+++- .+.++...+...+ ...+++.++++.+...+..+.....+....+.+.+.|... T Consensus 263 ~~~~~~~~~p~lv~~g---~~~~~~~~~~~~~--------~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~ 331 (474) T protein:vir:96 263 QNMFDESVELIYILRG---YEGEDLSEFMEGL--------KYYKAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEF 331 (474) T ss_pred HHHHHHhhcchhhhcC---CCcccccchhhhh--------hccceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHH Confidence 5555555556555432 1212222222221 1234566666666555655556666778888888999999 Q ss_pred hcCCHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhcC Q lcl|NC_021305. 288 YDIAPPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPD 354 (518) Q Consensus 288 fgVPp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d 354 (518) -++|..-.... .++ .+..+. ....+...+...++.+.. .+ ........+++.+..-+..+ T Consensus 332 s~~p~~~~~~~-~~n-~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~----~~-g~~~d~~~i~i~f~~~~p~~ 404 (474) T protein:vir:96 332 GQGVDFQTDKF-GSA-TSGIALKFLYTNLNLKANKLKNKANVALQELMQFILD----FN-KIKLDAKEIEITFNFNVMVN 404 (474) T ss_pred hCCcCcccccc-ccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hh-CCCcccceeeEEecCCCccC Confidence 99884322111 111 121111 111111222222222211 11 11111233455556667777 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 355 WEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 355 ~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) ..+.++.+ ..+|++|...+++++++- +++. .+ +..+...........+ ...+ +......+..++. T Consensus 405 ~~e~a~~~---~~~giiS~et~~~~lp~v--~D~~-~E------~eri~~E~~~~~~~~~--~~~~-~~~~~~~~~~~~~ 469 (474) T protein:vir:96 405 DLEQSQIG---AQSQYLSKETLVRHHPWV--DDPK-AE------LERLDEEQLELNKQLP--NLDD-GGADGAQQQQQSE 469 (474) T ss_pred HHHHHHHH---HHcCCCChHHHHHhCCCC--CCHH-HH------HHHHHHHHHHHHhhcc--cccc-ccCCCCCCcCCCC Confidence 77776654 457999998898888653 2211 11 1111100000000000 0000 0000000000000 Q ss_pred CCccc Q lcl|NC_021305. 435 TSVPG 439 (518) Q Consensus 435 ~~~~~ 439 (518) +.+.+ T Consensus 470 ~~e~~ 474 (474) T protein:vir:96 470 NNQSK 474 (474) T ss_pred ccccC Confidence 01111 No 192 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.32 E-value=1.4e-06 Score=52.68 Aligned_cols=403 Identities=11% Similarity=0.046 Sum_probs=171.5 Q ss_pred CcCCCCCCC---Ccccccccchh--hhhhhccc-ccc--cccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021305. 1 MLLANGQTL---SAPAMAELSPQ--MQDSYYYA-PAV--GMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) Q Consensus 1 ~~f~~~~~~---~~~~~~~~~~~--~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~ 72 (518) +++.-.... ..-.......+ +..-+-+. +.. ..............-...+....+|+..+.-+..-|+.+-- T Consensus 28 ~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 107 (481) T protein:vir:10 28 ELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNPITITH 107 (481) T ss_pred hhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhhccCCceEec Confidence 222211100 00000000000 00000000 000 00000000000001123456677888888878777776532 Q ss_pred ecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce-eeE Q lcl|NC_021305. 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG-RYE 151 (518) Q Consensus 73 ~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~-~~~ 151 (518) .++ ..+..+..++.+ .....+...+..+.+.+|.+|+.+.++.+|.+ .+..++|..+.+..+.... ... T Consensus 108 ~d~-----~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i~~~~p~~~~~v~d~~~~~~~~ 177 (481) T protein:vir:10 108 QDN-----QTNDKIIELNDL----NDADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TFKVLDPKSTFVVYDQTLDKKVV 177 (481) T ss_pred CCh-----hHHHHHHHHHHh----cChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EEEEEcccceEEEEcCCCCCceE Confidence 211 122234444443 24557888899999999999999999888875 5778899998887765431 111 Q ss_pred E---eeecccccCc---eeEEeccccEEEEeccC-----------C---------CCcccCchHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 152 Y---YFQAGAGVGT---QLVSFADDEVVPIRFFN-----------P---------DGLERGLSLMESLKSTIFSEDSSRN 205 (518) Q Consensus 152 ~---~~~~~~~~~~---~~~~~~~~evih~~~~~-----------~---------~~~~~G~s~l~~~~~~i~~~~~~~~ 205 (518) . .+......++ ....+.++.+.|+.... + .....|.|-+..+...+........ T Consensus 178 ~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~lida~~~~~s 257 (481) T protein:vir:10 178 AGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALIDLYDSAQS 257 (481) T ss_pred EEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCCCCCCchhhHHHHHHHHHHHHH Confidence 1 0111111111 11123334443332110 0 0112467777666666655544433 Q ss_pred HHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeee--cCCCcceeeccCChhhHHHHHHHHHHHHH Q lcl|NC_021305. 206 ATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMV--VEEGMEPIPLQLTAVEMQFIEARQLNREE 283 (518) Q Consensus 206 ~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~v--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~ 283 (518) -....+...+.|..++.-....+++..+.++..- .+.. ..+... .+++.++..+........+.+..+...+. T Consensus 258 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~--~~~~---~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~ 332 (481) T protein:vir:10 258 DTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDAN--MIHL---EPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQND 332 (481) T ss_pred HHHHHHHHhcCceeEeecCcCCCccchhhhhhcc--ceec---cccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 3333444445566555433333444444443310 0000 001111 12334444444444556677888888999 Q ss_pred HHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHH----HHHHHHHHH---hhhh---hhc----ccccceecchh Q lcl|NC_021305. 284 VCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPI----ARIQSAMDK---YVGQ---YWV----RKNRMKFDIDD 349 (518) Q Consensus 284 Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~----~~ie~~l~~---~l~~---~~~----~~~~~~fd~~~ 349 (518) |...-++|....+... ++ .+.++. .+....+.-.+ ..+...+.+ .++. ..+ ....+++.+.+ T Consensus 333 i~~~s~~p~~~~~~~~-~n-~Sg~Al--~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~ 408 (481) T protein:vir:10 333 IHKYTNTPDLNDEQFS-GV-QSGESM--KYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFTP 408 (481) T ss_pred HHHHhCCccccccccc-cc-cHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeCC Confidence 9999999976554322 22 122211 11111111111 111122111 1111 001 12245667778 Q ss_pred hhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021305. 350 VIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASL 429 (518) Q Consensus 350 l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (518) ....|..+.++.+.++ .|+++...+.+++++- +++. +++ ..+.......... ......+...+ . T Consensus 409 ~~~~~~~~~a~~~~kl--~g~is~et~~~~l~~i--~d~~-~E~------~ri~~E~~~~~~~--~~~~~~~~~~~---~ 472 (481) T protein:vir:10 409 NLPKSMMESINAFNAL--SGGVSESTRLSLLDFI--DNPK-EEL------EKMQEEEAQREKQ--ADKRGYGEAFE---N 472 (481) T ss_pred CCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHH-HHH------HHHHHHHHHHHhh--hhhccCCccCC---C Confidence 8889999999999988 4789887788887652 2211 111 1111000000000 00000000000 0 Q ss_pred CccccCCcc Q lcl|NC_021305. 430 DQSPPTSVP 438 (518) Q Consensus 430 ~~~~~~~~~ 438 (518) ..++-++++ T Consensus 473 ~~~~dd~~g 481 (481) T protein:vir:10 473 HLNVDDSNG 481 (481) T ss_pred CCCCCCCCC Confidence 000000011 No 193 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=98.31 E-value=1.4e-06 Score=52.61 Aligned_cols=392 Identities=10% Similarity=0.047 Sum_probs=175.1 Q ss_pred CcCCCCCC---------------CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhcc Q lcl|NC_021305. 1 MLLANGQT---------------LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR 65 (518) Q Consensus 1 ~~f~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~ 65 (518) -++..... ...+....+..+. -|............. ...-+.++....+|+..+.-+-. T Consensus 17 ~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy---~g~~~i~~~~~~~~~---~~~ki~~n~~~~Ivd~~~~~l~g 90 (470) T protein:vir:99 17 FIFPKGEKLTSNELLGFIAYNETVLKPRYRENMKLY---LGKHKILTAPEKETG---ADNRIVVNSAKYVVDVYNGYFCG 90 (470) T ss_pred EEeCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHh---ccccccccCcccccC---CcceeecchHHHHHHHHhhhhcc Confidence 00000000 0000000000000 000100000000000 01112334567788888887777 Q ss_pred CceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcC Q lcl|NC_021305. 66 LPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNS 145 (518) Q Consensus 66 l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~ 145 (518) -|+.+.-.+++. ....+..++. ..+.......+..+.+.+|.+|..+..+.+|.+ .+..++|..+.+..+. T Consensus 91 ~p~~~~~~~d~~----~~~~l~~~~~----~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p~~~~~i~d~ 161 (470) T protein:vir:99 91 IEPKLALLNDSS----KIDEIARWNR----QENFFDTINEISKQCDIFGRSIASIYQGEDARP-HLMYSSPNHAFIIYDD 161 (470) T ss_pred CCeeEeeCCchh----HHHHHHHHHH----hcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE-EEEEEccceeEEEEcC Confidence 787764322111 1112223333 235667778889999999999999988888875 5777899999888776 Q ss_pred Ccee-e-E-EeeecccccCc---eeEEeccccEEEEecc-------------CC---------CCcccCchHHHHHHHHH Q lcl|NC_021305. 146 RTGR-Y-E-YYFQAGAGVGT---QLVSFADDEVVPIRFF-------------NP---------DGLERGLSLMESLKSTI 197 (518) Q Consensus 146 ~~~~-~-~-~~~~~~~~~~~---~~~~~~~~evih~~~~-------------~~---------~~~~~G~s~l~~~~~~i 197 (518) .... . . .+++.....+. ....+..+.++++... ++ .....|.|-+..+...+ T Consensus 162 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~li 241 (470) T protein:vir:99 162 TVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLI 241 (470) T ss_pred CCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchHhHHHHH Confidence 5321 1 1 11111111100 0112223333332210 00 11235788888777777 Q ss_pred HHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeec-----CCCcceeeccCChhhHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVV-----EEGMEPIPLQLTAVEMQ 272 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl-----~~g~~~~~l~~~~~d~~ 272 (518) .....+..-....+...+.|-.++.--....++..+.. ..+.. .+++.+ +.+.++..+........ T Consensus 242 Da~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~-~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 312 (470) T protein:vir:99 242 NALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPK-FDFKN--------NRVLYVSQLDPDTNPQIGFIAKPDADQM 312 (470) T ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchh-hhhhh--------cceeeecCCCCCCCCcceEEeecCChHH Confidence 77776665555566666666666543221111211111 11111 122222 23444555555555555 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhhhcc Q lcl|NC_021305. 273 FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQYWVR 339 (518) Q Consensus 273 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~~~~ 339 (518) +....+.+.+.|+..-++|....+.. .++ .+..+.. ...+..++.-.+..+...+...-... .. T Consensus 313 ~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n-~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~-~~ 389 (470) T protein:vir:99 313 QENLIQHLTDFIFMMAMVPNIQDKNF-AGN-SSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQ-EL 389 (470) T ss_pred HHHHHHHHHHHHHHHhCCcccccccc-ccC-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc-cc Confidence 66778888999999999996543221 122 1222211 11222222222222222221111110 11 Q ss_pred cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCC-CCCCCCCC Q lcl|NC_021305. 340 KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAV-EWEEAPAP 418 (518) Q Consensus 340 ~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~-~~~~~~~~ 418 (518) ...+++.+..-+..|..+.++.+.++. |+++...++++++.-. ++...+ .+........ ..+....+ T Consensus 390 ~~~i~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l~~l~~vd-~~~E~e---------ri~~E~~~~~~~~~~~~~~ 457 (470) T protein:vir:99 390 WSELDFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDIE-PDAEMK---------QIAKEKADAIKQTQQLSMP 457 (470) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCC-HHHHHH---------HHHHHHHHHHHHHHhhcCC Confidence 234567777888899999999999885 7899888888875421 111111 1110000000 00000000 Q ss_pred CCCccCCCCCCCc Q lcl|NC_021305. 419 KRPASTPVASLDQ 431 (518) Q Consensus 419 ~~~~~~~~~~~~~ 431 (518) ...+..+++++++ T Consensus 458 ~d~~~~d~~~ee~ 470 (470) T protein:vir:99 458 IDILKRDNNAEEE 470 (470) T ss_pred CCcCCCCCCccCC Confidence 0011111111111 No 194 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=98.31 E-value=1.5e-06 Score=52.56 Aligned_cols=406 Identities=9% Similarity=0.039 Sum_probs=160.3 Q ss_pred CcCCC-------CCCCCcccccccchhhhhhhcccccccccccccch-hhhHHHhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021305. 1 MLLAN-------GQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFS-LYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) Q Consensus 1 ~~f~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~ 72 (518) |-+.. -.....+....+..+.. |.....-........ .....-...+.....|+..+.-+-.-|+.+-- T Consensus 22 l~~~~i~~li~~~~~~~~~r~~~l~~YY~---g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~ 98 (506) T protein:vir:94 22 LTPNKIMKFITHHFNYQRPRLEMLDDYYQ---GYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKL 98 (506) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCCccccccccccccccCCcceeecchHHHHHHHhhhhhcccCceeec Confidence 00000 00000010000100000 000000000000000 00001123456677888888887777777532 Q ss_pred ecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce-eeE Q lcl|NC_021305. 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG-RYE 151 (518) Q Consensus 73 ~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~-~~~ 151 (518) .++ .....+..++.. -+.......+..+.+.+|.+|..+..+.+|.+ .+..++|..+.++.+.... ... T Consensus 99 ~d~-----~~~~~l~~~~~~----N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~-~i~~~~p~~~~~v~dd~~~~~~~ 168 (506) T protein:vir:94 99 PDD-----GSNSGFDTFNKA----NDVDAENYDLFLDMSRYGRAYEYVYRGEDNEE-HLAKLDPLDTFVIYSTDVDPKPI 168 (506) T ss_pred Ccc-----hHHHHHHHHHhc----cCHhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEecCCCCCceE Confidence 211 112223333332 24455666788899999999999999888864 5667899988887765321 111 Q ss_pred E--eeec-ccccCc-------eeEEeccc-------------------------cEEEEeccCCCCcccCchHHHHHHHH Q lcl|NC_021305. 152 Y--YFQA-GAGVGT-------QLVSFADD-------------------------EVVPIRFFNPDGLERGLSLMESLKST 196 (518) Q Consensus 152 ~--~~~~-~~~~~~-------~~~~~~~~-------------------------evih~~~~~~~~~~~G~s~l~~~~~~ 196 (518) + .++. ....+. ....+... .|++++.+ ..|.|.+...... T Consensus 169 ~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~sd~e~~~~l 243 (506) T protein:vir:94 169 MAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTFPVVEFKNS-----NFRLGDFENVLPL 243 (506) T ss_pred EEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCccceEEecCC-----CCCCCchhhhHHH Confidence 1 0000 000000 00001111 22333321 1355555555555 Q ss_pred HHHHHHHHHHHHHHHHccCCcccccccCcc---------------------CCHHHHHHHHHHHHH-HhcCccccCCeee Q lcl|NC_021305. 197 IFSEDSSRNATAAMWKNAGRPNLVLRHEKR---------------------LSEAAQQRLREQFDR-AHSGSSNTGKTMV 254 (518) Q Consensus 197 i~~~~~~~~~~~~~~~ng~~p~~il~~~~~---------------------~~~~~~~~~~~~~~~-~~~g~~n~g~~~v 254 (518) +.....+..-..+.......|-.+++-... ........+...+.. ..-.....+.+.. T Consensus 244 iDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (506) T protein:vir:94 244 IDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGMTVNG 323 (506) T ss_pred HHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccccccC Confidence 544443332222222222222222221000 000111111111111 1111111111122 Q ss_pred cCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH-------------HHHHHHHHHhhHH Q lcl|NC_021305. 255 VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA-------------QMRAFYRDTMAIP 321 (518) Q Consensus 255 l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~-------------~~~~~~~~~l~P~ 321 (518) .+.+.++.-+..+.....+....+.+.+.|...-++|..-.... .++ .+..+ .....+...+... T Consensus 324 ~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n-~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~ 401 (506) T protein:vir:94 324 TQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENF-ASN-SSGVAMQYKVLGTVELASTKRRMFERGLYAR 401 (506) T ss_pred ccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccc-ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23344555566555666677888888999999999985322111 111 11111 1122233333333 Q ss_pred HHHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccc Q lcl|NC_021305. 322 IARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQP 401 (518) Q Consensus 322 ~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~ 401 (518) +..+...++..--........+++.+..-+..|..+.++.+.++ .|++|...++++++.- +++-- + +.. T Consensus 402 ~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~lp~v--~d~~~-E------~~r 470 (506) T protein:vir:94 402 YQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQLPGV--TNPQD-I------VDM 470 (506) T ss_pred HHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHH-H------HHH Confidence 33333222211000000112356677788889999999999988 5899999999887542 22211 1 111 Q ss_pred cccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccch Q lcl|NC_021305. 402 LGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTN 444 (518) Q Consensus 402 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (518) +.. ....... ...... ....++.....+.+ +.+..+ T Consensus 471 i~~---E~~~~~~--~~~~~~-~~~~~~~~~~~~~~-~~~e~~ 506 (506) T protein:vir:94 471 MKE---QSANGDY--SFDQNG-VISNDGQTNTTATQ-TDEEVR 506 (506) T ss_pred HHH---HHHHHhh--cchhhc-CCCcccCccccccc-cccCCC Confidence 110 0000000 000000 00000000000000 000001 No 195 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=98.30 E-value=1.5e-06 Score=52.51 Aligned_cols=400 Identities=11% Similarity=0.034 Sum_probs=175.4 Q ss_pred CcCCCCCCCC-ccccc--------ccch-hhhhhh-cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLANGQTLS-APAMA--------ELSP-QMQDSY-YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~~~~~~-~~~~~--------~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) +-.-+|.+.. ..+.- ..-+ |-...+ +..|....+ . .+. ......+.-..+++.+|+-+..-+.. T Consensus 12 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~--~--~~~-~~~~~~~l~~~i~~~~A~ll~~e~~~ 86 (518) T protein:vir:78 12 KGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYV--P--TVH-DKLMNSGTGNEIVVVAAEYISGKPLS 86 (518) T ss_pred HHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCC--C--ccc-cccccCChHHHHHHHHHHhhcCCCce Confidence 3333333221 11110 0000 100000 001111100 0 011 11122233456888888888665554 Q ss_pred EEEecCCc-ceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc- Q lcl|NC_021305. 70 CMFTSGDT-ETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT- 147 (518) Q Consensus 70 v~~~~~~~-~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~- 147 (518) +--.+.+. ..+.....+..++.. -.+..-+...+.+.+..|.+++.+..+ +|+ +.+..++++.+.+....+. T Consensus 87 i~v~~~~~~d~e~~~~~l~~il~~----n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~-~~i~~v~ad~~~P~~~~g~~ 160 (518) T protein:vir:78 87 IDVTGVNGSKDENLTKQLKEALRI----DNFDSKSVKIVELAGGSGVSAVKINIL-NGR-PSISVHSSSQFWIDFKNNEP 160 (518) T ss_pred EEecCccccCcHHHHHHHHHHHHh----ccHHHHHHHHHHHhhccCceEEEEEEE-CCe-eEEEEEcCCeeEEEeecCcE Confidence 42222111 111111222222221 233444455677888889888877665 354 4566777777766543211 Q ss_pred -------------eeeEEe-eeccc----------------------ccCceeEE------------------------- Q lcl|NC_021305. 148 -------------GRYEYY-FQAGA----------------------GVGTQLVS------------------------- 166 (518) Q Consensus 148 -------------~~~~~~-~~~~~----------------------~~~~~~~~------------------------- 166 (518) ...+|. ..... ...+..+. T Consensus 161 ~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~ 240 (518) T protein:vir:78 161 FRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSV 240 (518) T ss_pred EEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccCccceee Confidence 000110 00000 00000000 Q ss_pred --ecccc-EEEEeccCCC----CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccc-----cCccC-CHHHHH Q lcl|NC_021305. 167 --FADDE-VVPIRFFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLR-----HEKRL-SEAAQQ 233 (518) Q Consensus 167 --~~~~e-vih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~-----~~~~~-~~~~~~ 233 (518) ..... +.|++...++ +...|+|.+..+...+............-|+. +.+..++. ..... ...... T Consensus 241 ~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~~~~~~~~~ 319 (518) T protein:vir:78 241 SIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVNKSTDKEEW 319 (518) T ss_pred ccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCCCCCCcccc Confidence 00011 2333332222 33469999999999998888877666666775 44454442 11110 000000 Q ss_pred HHHHHHHHHhcCccccCCeeecCCCc----ceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH- Q lcl|NC_021305. 234 RLREQFDRAHSGSSNTGKTMVVEEGM----EPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA- 308 (518) Q Consensus 234 ~~~~~~~~~~~g~~n~g~~~vl~~g~----~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~- 308 (518) .+... .+.|.... .-.+.|. .++.+....++.++.+..+...+.|....|++|..+|... +.-+..|- T Consensus 320 ~fd~~-~~~y~~i~-----~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~-~~~TATei~ 392 (518) T protein:vir:78 320 SMNVD-EDYFMQFK-----GTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGN-REVKATEIW 392 (518) T ss_pred ccCCC-CceEEEec-----CcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCccc-ccccHHHHH Confidence 00000 00011000 0012222 3666777788889999999999999999999999887643 22122211 Q ss_pred HHHHHHHHHhhHHHHHHHHHHHHhh-------hhh--------hcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCH Q lcl|NC_021305. 309 QMRAFYRDTMAIPIARIQSAMDKYV-------GQY--------WVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATP 373 (518) Q Consensus 309 ~~~~~~~~~l~P~~~~ie~~l~~~l-------~~~--------~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~ 373 (518) ..++-.-.++.-....++..|...+ -.. ....+.+.|++++-+..|..+.++.+.+++.+|+|++ T Consensus 393 s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~ 472 (518) T protein:vir:78 393 SLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSV 472 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCH Confidence 1111111222233333333222211 000 0112357788899999999999999999999999999 Q ss_pred HHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCC Q lcl|NC_021305. 374 NEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTS 436 (518) Q Consensus 374 NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (518) .++.+++... .+++..++-+ ..+. ..+... ..+.+.+-+. .++.++ T Consensus 473 e~~i~~~~~~-~~deea~~e~-----~ri~---~E~~~~-~~~~p~~~~g-------~~~~~g 518 (518) T protein:vir:78 473 EEKVKLIHPK-WEDEEIQAEV-----KRIY---LENAIG-EVPDPEAIGG-------METKGG 518 (518) T ss_pred HHHHHHhCCC-CCHHHHHHHH-----HHHH---HHhccc-CCCCCccccC-------CCCCCC Confidence 9976665322 2222222111 1110 000000 0000100000 111001 No 196 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=98.29 E-value=1.6e-06 Score=52.34 Aligned_cols=393 Identities=13% Similarity=0.031 Sum_probs=178.4 Q ss_pred Cc----CC--CCCC--CCcccccc-----cchhhhhhhcc-cccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 ML----LA--NGQT--LSAPAMAE-----LSPQMQDSYYY-APAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~----f~--~~~~--~~~~~~~~-----~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l 66 (518) |+ +. .+.. .+..+... +.-|-. .+.+ .+....... ..............-..+++..|+-+..= T Consensus 14 ~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~-~Y~g~~~~l~~~~~-~~~~~~~~~~slnl~~~i~~~~A~ll~~e 91 (505) T protein:vir:79 14 GSAAVGMTKSLGQIIDDPRINLPADEVERIARDKR-YYMDDFKQVTHKNS-YGDTQKHELQSVNVTKLASAKLASLIFNE 91 (505) T ss_pred hhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHH-HhcCCCcccccccc-CCCccccceeecchHHHHHHHHHhhhcCC Confidence 21 11 1100 01111100 011111 1111 000000000 00000001122233356777778777665 Q ss_pred ceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC Q lcl|NC_021305. 67 PVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR 146 (518) Q Consensus 67 ~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~ 146 (518) |..+--. +......+.+-...-.+..-....+.+.+..|.+++.+..+. |. +.+..++|..+.+...+. T Consensus 92 ~~~i~~~---------d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~~v~ad~~~P~~~d~ 160 (505) T protein:vir:79 92 QCQVTVS---------DETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLAWATADQVYPLQADT 160 (505) T ss_pred CceeecC---------ChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEEEEcCCeeEEEEEcC Confidence 5544211 111122222211122345555677888889999988887763 33 456667777766543222 Q ss_pred cee--e---------------EEe-----------eecc--------cccCceeE---------------E---eccccE Q lcl|NC_021305. 147 TGR--Y---------------EYY-----------FQAG--------AGVGTQLV---------------S---FADDEV 172 (518) Q Consensus 147 ~~~--~---------------~~~-----------~~~~--------~~~~~~~~---------------~---~~~~ev 172 (518) +.. . +|. +.+. ....|..+ . ++.-.+ T Consensus 161 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f 240 (505) T protein:vir:79 161 NQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLF 240 (505) T ss_pred CCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceE Confidence 211 0 110 0000 00001100 1 111234 Q ss_pred EEEeccCCC----CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccc----cccCccCCHHHHHHHHHHHHHHhc Q lcl|NC_021305. 173 VPIRFFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLV----LRHEKRLSEAAQQRLREQFDRAHS 244 (518) Q Consensus 173 ih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i----l~~~~~~~~~~~~~~~~~~~~~~~ 244 (518) .||+.+.++ ..+.|+|.+..+...+............-|+.|-..-.+ ++.......+....-...|.. T Consensus 241 ~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~--- 317 (505) T protein:vir:79 241 AFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETHPPMFDP--- 317 (505) T ss_pred EEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccccccccCCCc--- Confidence 566654333 234799999999999988887766666666655332222 322211111100000000100 Q ss_pred CccccCCeeec-CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH------------- Q lcl|NC_021305. 245 GSSNTGKTMVV-EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM------------- 310 (518) Q Consensus 245 g~~n~g~~~vl-~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~------------- 310 (518) .......+-. +++..++.++....+.++.+..+...++|+...|+++..+|...++.. +..+.. T Consensus 318 -~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~-TAtei~s~~~~l~~t~~~~ 395 (505) T protein:vir:79 318 -DETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQ-TATEVVTNNSQTYQTRSSY 395 (505) T ss_pred -cceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccc-hHHHHHHHHhHHHHHHHHH Confidence 0000000111 223457778888888889999999999999999999999987655432 332221 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhhhhh--------cccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-C Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVGQYW--------VRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM-G 381 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~~~~--------~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-g 381 (518) ...++.+|..++..|........+... ...+.+.|++++-+..|..+.++...+++.+|+++.-+++.+. | T Consensus 396 ~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~~~~ 475 (505) T protein:vir:79 396 ITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFLMRNYG 475 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCC Confidence 112222333333333222111111111 1123567888898999999999999999999999999888654 4 Q ss_pred CCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 382 LPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 382 ~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) + +++..++.+ ..+. ..+.. ..|...+-+ ++ T Consensus 476 ~---~eeea~~el-----~ri~---~E~~~--~~p~~~~~g----g~ 505 (505) T protein:vir:79 476 L---DEEEADEWL-----AQID---AENST--AEPEFNQFG----GD 505 (505) T ss_pred C---ChHHHHHHH-----HHHH---Hhccc--cCCCchhcc----CC Confidence 3 222222111 1111 11110 011111111 11 No 197 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.25 E-value=2e-06 Score=51.82 Aligned_cols=400 Identities=10% Similarity=-0.003 Sum_probs=178.4 Q ss_pred CcCCCCCCC---Ccccccccchhhhh--hhcccccccccccc-cchh----hhHHHhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_021305. 1 MLLANGQTL---SAPAMAELSPQMQD--SYYYAPAVGMQLER-QFSL----YGGIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) Q Consensus 1 ~~f~~~~~~---~~~~~~~~~~~~~~--~~~~~~~~~~~~~~-~~~~----~~~~~~~~~~v~~~v~~ia~~ia~l~~~v 70 (518) +||. +... ..+.. ..++.... ..+..|+.|-...- .... ..........-..++..+|+-+..=+..+ T Consensus 17 ~~~~-~~~~~~~~~~~i-~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~~~~A~Ll~~e~~~i 94 (517) T protein:vir:98 17 ALSG-QTLKSINDHEKI-NIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSADVLSGLVFNEQCEV 94 (517) T ss_pred Hhcc-cchhHhhcCCce-ecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHHHHhhhhhcCCcceE Confidence 3443 2211 11111 11110000 00011111111000 0000 00011111222445566666554433333 Q ss_pred EEecCCccee------ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEc Q lcl|NC_021305. 71 MFTSGDTETE------ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRN 144 (518) Q Consensus 71 ~~~~~~~~~~------~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~ 144 (518) .-.+.+..+. .....+..++. .-.....++..+.+.+..|.+++.+..+..+ +.+..++++.+.+... T Consensus 95 ~v~d~~~~~~~~~~~~~~~e~l~~i~~----~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~--~~I~~v~ad~~~Pl~~ 168 (517) T protein:vir:98 95 YVSDAKDEEKKDNSFKTAHEFIQHVFQ----HNKFIKNLSDYLEPTFALGGLTVRPYVDNGE--IEFSWALANAFYPLRS 168 (517) T ss_pred EecccccccccccchhHHHHHHHHHHH----hccHHHHHHHHHHHHhhhCCEEEEEEEeCCe--eEEEEEcCCeeEEEEe Confidence 2222221111 11122222222 2234455556777888889999888776432 4466677776654222 Q ss_pred CCcee-----------------eEEe---e-------------------ec--ccccCceeE-------------Eec-- Q lcl|NC_021305. 145 SRTGR-----------------YEYY---F-------------------QA--GAGVGTQLV-------------SFA-- 168 (518) Q Consensus 145 ~~~~~-----------------~~~~---~-------------------~~--~~~~~~~~~-------------~~~-- 168 (518) ..++. .+|. + +. .....|..+ .+. T Consensus 169 ~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~~~~~~~g~ 248 (517) T protein:vir:98 169 NSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQEKTYIQGL 248 (517) T ss_pred cCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccccccCCCcceeECCC Confidence 11111 0110 0 00 000001111 111 Q ss_pred c-ccEEEEeccCCC----CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc-----ccCccCCHHHHHHHHHH Q lcl|NC_021305. 169 D-DEVVPIRFFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL-----RHEKRLSEAAQQRLREQ 238 (518) Q Consensus 169 ~-~evih~~~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~~~~~~~~~~~~ 238 (518) . --+.||+.+.++ +..+|+|.+..+...+............-|+.|-. +.++ ....+.. .. ..... T Consensus 249 ~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~i~vp~~~l~~~~~~~--g~-~~~~~ 324 (517) T protein:vir:98 249 SRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-TVFVSDVMLRTVPDES--GM-PPPQV 324 (517) T ss_pred CcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-ceecChhhhccccCCC--Cc-ccCCC Confidence 0 112356554333 34579999999999998888776666666666443 3222 1111000 00 00000 Q ss_pred HH---HHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH--HHH Q lcl|NC_021305. 239 FD---RAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM--RAF 313 (518) Q Consensus 239 ~~---~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~--~~~ 313 (518) |. ..|.+.. .-+++-.++.++...++-++.+..+...+.|+...|+++..+|....+. .++.+.. ..- T Consensus 325 ~d~~~~~y~~~~------~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~-kTATEi~s~~~~ 397 (517) T protein:vir:98 325 FDPDVNVYKSIR------MGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSM-KTATEIVSENDL 397 (517) T ss_pred CCcccceeeecc------CCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccccccc-ccHHHHHHHHHH Confidence 00 0011000 0012334666777778889999999999999999999999999876543 3443321 112 Q ss_pred HHHHhhHHHHHHHHHHHHhh------------hhh-hcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_021305. 314 YRDTMAIPIARIQSAMDKYV------------GQY-WVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM 380 (518) Q Consensus 314 ~~~~l~P~~~~ie~~l~~~l------------~~~-~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~ 380 (518) .-.++.-+...++..|...+ +.. ....+.+.+++++-+..|.++.++...+++.+|+|++-+++.++ T Consensus 398 ~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~~~~i~~~ 477 (517) T protein:vir:98 398 TYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPTVEAIQRI 477 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHh Confidence 22334444444444443321 111 11234577889999999999999999999999999999987655 Q ss_pred -CCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 381 -GLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 381 -g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) |+. ++..++.+ ..+. ....+..+.+. .++...+..++++ T Consensus 478 ~g~~---eeeA~~e~-----~~i~---~E~~~~~~~~~-~~~~~~~~~gd~e 517 (517) T protein:vir:98 478 FKVP---KKTAEQWL-----EEIR---KDQIELDPVTI-SQRAQKRMFGDEE 517 (517) T ss_pred CCCC---hHHHHHHH-----HHHH---HhccccCCCCc-cccccCCCCCCCC Confidence 653 22332221 0110 00000000000 0000001110110 No 198 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.25 E-value=2.1e-06 Score=51.75 Aligned_cols=402 Identities=13% Similarity=0.117 Sum_probs=165.2 Q ss_pred ccccchhhhh----------------hhcccc--cccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecC Q lcl|NC_021305. 14 MAELSPQMQD----------------SYYYAP--AVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSG 75 (518) Q Consensus 14 ~~~~~~~~~~----------------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~ 75 (518) ......|+.. .+.-+. ....+.. -....+..-....+...+|+..+..+--.+|.+ .+ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~---~~ 76 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG-APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SE 76 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccc-cchhhhhhhhhcchHHHHHHHHHhhhccCceec---CC Confidence 0000011100 000000 0000000 000011111223445667777777664444432 21 Q ss_pred CcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc------CCCceEEEEeeCCceeEEEEcCCce- Q lcl|NC_021305. 76 DTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN------KSGTPEKLMPMHPSRVAIKRNSRTG- 148 (518) Q Consensus 76 ~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~------~~G~~~~l~~l~p~~v~v~~~~~~~- 148 (518) + .. ....+..++.+ | ........+..+.+.+|.+|+.+.++ ..|. ..+.+++|..+.+.++.... T Consensus 77 d--~~-~~~~l~~i~~~-N---~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~-~~i~~~~p~~~~~i~D~~~~~ 148 (480) T protein:vir:78 77 D--SE-GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELDPRNTR 148 (480) T ss_pred C--ch-hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCe-eEEEEEcccceEEEEcCCCcc Confidence 1 11 12333444433 2 34556777899999999999887653 3444 35778899998888865321 Q ss_pred -ee-EEeeecccccC---ceeEE-----------------------------eccccEEEEeccCCCCcccCchHHHH-H Q lcl|NC_021305. 149 -RY-EYYFQAGAGVG---TQLVS-----------------------------FADDEVVPIRFFNPDGLERGLSLMES-L 193 (518) Q Consensus 149 -~~-~~~~~~~~~~~---~~~~~-----------------------------~~~~evih~~~~~~~~~~~G~s~l~~-~ 193 (518) .. ...++...... ..... +..=.|+||.++...+..+|.|-+.- + T Consensus 149 ~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i 228 (480) T protein:vir:78 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) T ss_pred ceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHH Confidence 11 11111000000 00001 11124566655443444568776542 3 Q ss_pred HHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-CCcceeeccCChhhHH Q lcl|NC_021305. 194 KSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVEMQ 272 (518) Q Consensus 194 ~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~~ 272 (518) ...+.....+..-......-.+.|.-++. .....+...+.-...|.. ..+.++.++ ++.++.++.....+ . T Consensus 229 ~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~-~ 300 (480) T protein:vir:78 229 RKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAELR-N 300 (480) T ss_pred HHHHHHHHHHHHHHHHHHHhhcchhhhhh-CCCccccccccccchhhh------hhhhhccCCCCCceEEecCccCHH-H Confidence 33333333222222222222334443332 111111100100111211 112344444 45667666544322 3 Q ss_pred HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHH----HHHHhh------hhhhc--cc Q lcl|NC_021305. 273 FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQS----AMDKYV------GQYWV--RK 340 (518) Q Consensus 273 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~----~l~~~l------~~~~~--~~ 340 (518) +.+..+..+.+|+..-++|++.+|.... |.++.++ ..+....+.-.+...+. .|.+.+ ..... .. T Consensus 301 ~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~~Sg~A--l~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~ 377 (480) T protein:vir:78 301 FAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEA--IIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccc Confidence 7787888889999999999999975332 2222222 11222222222222211 121111 11101 12 Q ss_pred ccceecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceeeeccccccccccc---ccCCCCCCC Q lcl|NC_021305. 341 NRMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANSALQPLGATP---DGAVEWEEA 415 (518) Q Consensus 341 ~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~---~~~~~~~~~ 415 (518) ..+++.|.+....+..+.++.+.+++.+| +++..-+++++|+.+-+.....+........+++... .++...+ T Consensus 378 ~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 455 (480) T protein:vir:78 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADAT-- 455 (480) T ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccc-- Confidence 34566667777888889999999998876 6777677888888654311111111111111111110 1111111 Q ss_pred CCCCCCccCCCCCCCccccCCccccccchhcc Q lcl|NC_021305. 416 PAPKRPASTPVASLDQSPPTSVPGLSPTNSDR 447 (518) Q Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (518) +++ ...+..++.++.+.+.. .+..| T Consensus 456 ~~~---~~~~~~~~~~~~~~~~~----~~~~~ 480 (480) T protein:vir:78 456 PKP---TVTETKTETQTSPSGFN----RTKTR 480 (480) T ss_pred cCC---CCCCCCCccCCCcccCC----CcCCC Confidence 111 00111111111111111 11111 No 199 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=98.22 E-value=2.4e-06 Score=51.40 Aligned_cols=380 Identities=12% Similarity=0.056 Sum_probs=159.8 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) .|...-... .+.-+.+..+.. |.......+... ....+..-..+.+...+|+..+..+--..| +.. T Consensus 11 ~l~~~~~~~-~~r~~~l~~Yy~---G~~~i~~~~~~~-~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~---~~~------ 76 (441) T protein:vir:80 11 GMYDRIQRL-SSWHCCIEGYYE---GSNRVRDLGVAI-PPELQRVQTVVSWPGIAVDALEERLDWLGW---TNG------ 76 (441) T ss_pred HHHHHHHHH-HHHHHHHHHHHh---cCCcchhcCccc-chhhhhhhhhcchHHHHHHHHHhhhccccc---cCC------ Confidence 111100000 000000000000 000000000000 001112222334455666666655422222 111 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeE--Eeeeccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYE--YYFQAGA 158 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~--~~~~~~~ 158 (518) .+..+..++.. -+.......+..+++.+|.+|+.+.++..|.+ .+.+++|..+.++++....... +.+++.. T Consensus 77 -d~~~l~~i~~~----n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~ 150 (441) T protein:vir:80 77 -DGYGLDGVYAA----NRLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTGKFSADGSRLDAGLVVQQTC 150 (441) T ss_pred -ChHHHHHHHHh----cCHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEEEEeCCCCceeEEEEEEEEe Confidence 11123333332 24677778889999999999999999888876 5788999999887775432211 1111100 Q ss_pred cc--------------------Ccee-------EEeccccEEEEeccCCCCcccCchHHHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 159 GV--------------------GTQL-------VSFADDEVVPIRFFNPDGLERGLSLMES-LKSTIFSEDSSRNATAAM 210 (518) Q Consensus 159 ~~--------------------~~~~-------~~~~~~evih~~~~~~~~~~~G~s~l~~-~~~~i~~~~~~~~~~~~~ 210 (518) .. .+.. ..+..=.|+||.+....+..+|.|-+.- +...+........-.... T Consensus 151 ~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~ 230 (441) T protein:vir:80 151 DPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVN 230 (441) T ss_pred cCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHHHHHHHH Confidence 00 0000 0111224566655444444568775532 333333333332222223 Q ss_pred HHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCC-----cceeeccCChhhHHHHHHHHHHHHHHH Q lcl|NC_021305. 211 WKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEG-----MEPIPLQLTAVEMQFIEARQLNREEVC 285 (518) Q Consensus 211 ~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g-----~~~~~l~~~~~d~~~~e~~~~~~~~Ia 285 (518) ..-.+.|--+++ ...+++...+. |+. ..+++.-++.+ .++.++..+.. -.+.+.++..+..|+ T Consensus 231 ~~~~~~~~~~i~-G~~~~~~~~~~----~~~------~~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~i~~~~ 298 (441) T protein:vir:80 231 RDFYAYPQRWVT-GVSADEFSQPG----WVL------SMASVWAVDKDDDGDTPNVGSFPVNSP-TPYSDQMRLLAQLTA 298 (441) T ss_pred HHhhcCceeeee-cCCccccccch----hhh------cccccccCCCCCCCCcceeEecCccch-HHHHHHHHHHHHHHh Confidence 333344544442 11222221111 111 12233333322 34444433222 236777788889999 Q ss_pred HHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHH----HHHH---hhhhh---hc----ccccceecchhhh Q lcl|NC_021305. 286 GVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQS----AMDK---YVGQY---WV----RKNRMKFDIDDVI 351 (518) Q Consensus 286 ~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~----~l~~---~l~~~---~~----~~~~~~fd~~~l~ 351 (518) ..-++|++.+|.... +.++.++.. +....+.-.+...+. .|.+ .++.- .. ....+++.+.+.+ T Consensus 299 ~~~~~p~~~~g~~~~-~~~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~ 375 (441) T protein:vir:80 299 GEAAVPERYFGFITS-NPPSGEALA--AEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDAS 375 (441) T ss_pred cccCCCHHHhccCCC-cchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCC Confidence 999999998876443 212222211 111111111111111 1111 11111 01 1134567777888 Q ss_pred hcCHHHHHHHHHHHHhCCCcC--HHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCC Q lcl|NC_021305. 352 QPDWEAKSESTQKMVNSGVAT--PNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASL 429 (518) Q Consensus 352 ~~d~~~~~~~~~~~~~~G~~T--~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 429 (518) ..+..+.++.+.+++.+|+++ ..-+++.+|+.+-+ ..+ +............+..+ ...++ T Consensus 376 ~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e---~~~---------~~~e~~e~~~~~~~~~~--~~~~~---- 437 (441) T protein:vir:80 376 TPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQ---VEA---------VMRHRAESSDPLAVLAG--AISRQ---- 437 (441) T ss_pred CcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHH---HHH---------HHHHHHHHHHHHHHHhh--hhhcc---- Confidence 999999999999999999764 34467777765321 111 10000000000000000 00000 Q ss_pred Cccc Q lcl|NC_021305. 430 DQSP 433 (518) Q Consensus 430 ~~~~ 433 (518) .++. T Consensus 438 ~~~~ 441 (441) T protein:vir:80 438 TNEV 441 (441) T ss_pred cccC Confidence 0000 No 200 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=98.22 E-value=2.4e-06 Score=51.38 Aligned_cols=414 Identities=9% Similarity=0.014 Sum_probs=173.0 Q ss_pred CcCCCC--------CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021305. 1 MLLANG--------QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) Q Consensus 1 ~~f~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~ 72 (518) |+-... .....+....+..+.. |..+................-...+.....|+..+.-+-+-|+.+-- T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~---g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:96 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhh---ccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeec Confidence 110000 0000000000111110 01100000000000000000112345567778888777777777621 Q ss_pred ecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eee Q lcl|NC_021305. 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRY 150 (518) Q Consensus 73 ~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~ 150 (518) .+ ......+..++.. -....+...+..+++.+|.+|.++-++.+|. ..+..++|..+.++.+... ... T Consensus 116 ~d-----~~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~-~~i~~~~p~~~~~v~dd~~~~~~~ 185 (511) T protein:vir:96 116 DD-----KDVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYELMIRNQDDE-TRLYKSDAMSTFIIYDNTVERNSI 185 (511) T ss_pred Cc-----hHHHHHHHHHHhh----cChhHHHHHHHHHHHhcCeeEEEEEeCCCCc-eEEEEEcccceEEEEcCCCCCceE Confidence 11 1111223333333 2344566778889999999999999988886 4577889999988877543 111 Q ss_pred EE-eeecc-ccc--Cc----eeEEeccccEEEEeccCC-------------------------CCcccCchHHHHHHHHH Q lcl|NC_021305. 151 EY-YFQAG-AGV--GT----QLVSFADDEVVPIRFFNP-------------------------DGLERGLSLMESLKSTI 197 (518) Q Consensus 151 ~~-~~~~~-~~~--~~----~~~~~~~~evih~~~~~~-------------------------~~~~~G~s~l~~~~~~i 197 (518) .. .++.. ... .. ....+.++.+.++..... .....|.|-++.+...+ T Consensus 186 ~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~li 265 (511) T protein:vir:96 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (511) T ss_pred EEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHH Confidence 11 11110 000 00 111234444444432110 01125778888777777 Q ss_pred HHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccc-cCCeeecCCCcceeeccCChhhHHHHHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSN-TGKTMVVEEGMEPIPLQLTAVEMQFIEA 276 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n-~g~~~vl~~g~~~~~l~~~~~d~~~~e~ 276 (518) .....+..-..+.+...+.|-.+++-....+.++.....+...-......- .+.-.-.+.+.++..+........+... T Consensus 266 Da~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:96 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHH Confidence 766665555555555555565555443334444433332211000000000 0000011234444445544455556778 Q ss_pred HHHHHHHHHHHhcCCHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhh-hccccc Q lcl|NC_021305. 277 RQLNREEVCGVYDIAPPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQY-WVRKNR 342 (518) Q Consensus 277 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~-~~~~~~ 342 (518) .+.+.+.|+..-++|..-.+... ++ .+..+.. ...+...+.-.+..|...+...-... ...... T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~-~n-~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~ 423 (511) T protein:vir:96 346 KDRLNSDIHMFTNTPNMKDDNFS-GT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 423 (511) T ss_pred HHHHHHHHHHHhCCccccccccc-cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc Confidence 88888999999999865433221 12 1111111 11222222222222222211110000 001124 Q ss_pred ceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 343 MKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 343 ~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) +++.+..-+..|..+.++.+.++. |+++..-+.+++++ ++++. +++ ..+.................++. T Consensus 424 i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~--v~d~~-~El------~ri~~E~~~~~~~~~~~~~~~~~ 492 (511) T protein:vir:96 424 VRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSF--FQDPE-LEV------KKIEEDEKESIKKAQKGIYKDPR 492 (511) T ss_pred ceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCC--CCCHH-HHH------HHHHHHHHHHHHHHhhccccCCC Confidence 567777888899999999999884 88998888887754 22211 111 11111000000000000000000 Q ss_pred c-CCCCCCCccccCCccccccchhc Q lcl|NC_021305. 423 S-TPVASLDQSPPTSVPGLSPTNSD 446 (518) Q Consensus 423 ~-~~~~~~~~~~~~~~~~~~~~~~~ 446 (518) . .+...++++.+.+..+ . T Consensus 493 ~~~~~~~~~~~~~~~~e~------~ 511 (511) T protein:vir:96 493 DINDDEQDDDTKDTVDKK------E 511 (511) T ss_pred CCCCCCCCCCccCccccc------C Confidence 0 0000000000000000 0 No 201 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=98.22 E-value=2.4e-06 Score=51.38 Aligned_cols=414 Identities=9% Similarity=0.014 Sum_probs=173.0 Q ss_pred CcCCCC--------CCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEE Q lcl|NC_021305. 1 MLLANG--------QTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) Q Consensus 1 ~~f~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~ 72 (518) |+-... .....+....+..+.. |..+................-...+.....|+..+.-+-+-|+.+-- T Consensus 39 ~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~---g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:78 39 LQNVNEVSKYIEHHMDYQRPRLKVLSDYYE---GKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred hcCHHHHHHHHHHHHHhhhHHHHHHHHHhh---ccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeec Confidence 110000 0000000000111110 01100000000000000000112345567778888777777777621 Q ss_pred ecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eee Q lcl|NC_021305. 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRY 150 (518) Q Consensus 73 ~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~ 150 (518) .+ ......+..++.. -....+...+..+++.+|.+|.++-++.+|. ..+..++|..+.++.+... ... T Consensus 116 ~d-----~~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~-~~i~~~~p~~~~~v~dd~~~~~~~ 185 (511) T protein:vir:78 116 DD-----KDVLEAIEAFNDL----NDVESHNRSLGLDLSIYGKAYELMIRNQDDE-TRLYKSDAMSTFIIYDNTVERNSI 185 (511) T ss_pred Cc-----hHHHHHHHHHHhh----cChhHHHHHHHHHHHhcCeeEEEEEeCCCCc-eEEEEEcccceEEEEcCCCCCceE Confidence 11 1111223333333 2344566778889999999999999988886 4577889999988877543 111 Q ss_pred EE-eeecc-ccc--Cc----eeEEeccccEEEEeccCC-------------------------CCcccCchHHHHHHHHH Q lcl|NC_021305. 151 EY-YFQAG-AGV--GT----QLVSFADDEVVPIRFFNP-------------------------DGLERGLSLMESLKSTI 197 (518) Q Consensus 151 ~~-~~~~~-~~~--~~----~~~~~~~~evih~~~~~~-------------------------~~~~~G~s~l~~~~~~i 197 (518) .. .++.. ... .. ....+.++.+.++..... .....|.|-++.+...+ T Consensus 186 ~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~li 265 (511) T protein:vir:78 186 AGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERRKGDYEKVITLI 265 (511) T ss_pred EEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCCCCchhhhHHHH Confidence 11 11110 000 00 111234444444432110 01125778888777777 Q ss_pred HHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccc-cCCeeecCCCcceeeccCChhhHHHHHH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSN-TGKTMVVEEGMEPIPLQLTAVEMQFIEA 276 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n-~g~~~vl~~g~~~~~l~~~~~d~~~~e~ 276 (518) .....+..-..+.+...+.|-.+++-....+.++.....+...-......- .+.-.-.+.+.++..+........+... T Consensus 266 Da~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:78 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHH Confidence 766665555555555555565555443334444433332211000000000 0000011234444445544455556778 Q ss_pred HHHHHHHHHHHhcCCHHHhccccccccCCHHHHH-------------HHHHHHHhhHHHHHHHHHHHHhhhhh-hccccc Q lcl|NC_021305. 277 RQLNREEVCGVYDIAPPIVHILDRATFSNISAQM-------------RAFYRDTMAIPIARIQSAMDKYVGQY-WVRKNR 342 (518) Q Consensus 277 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~-------------~~~~~~~l~P~~~~ie~~l~~~l~~~-~~~~~~ 342 (518) .+.+.+.|+..-++|..-.+... ++ .+..+.. ...+...+.-.+..|...+...-... ...... T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~-~n-~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~~ 423 (511) T protein:vir:78 346 KDRLNSDIHMFTNTPNMKDDNFS-GT-QSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFNT 423 (511) T ss_pred HHHHHHHHHHHhCCccccccccc-cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc Confidence 88888999999999865433221 12 1111111 11222222222222222211110000 001124 Q ss_pred ceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 343 MKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 343 ~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) +++.+..-+..|..+.++.+.++. |+++..-+.+++++ ++++. +++ ..+.................++. T Consensus 424 i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~--v~d~~-~El------~ri~~E~~~~~~~~~~~~~~~~~ 492 (511) T protein:vir:78 424 VRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSF--FQDPE-LEV------KKIEEDEKESIKKAQKGIYKDPR 492 (511) T ss_pred ceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCC--CCCHH-HHH------HHHHHHHHHHHHHHhhccccCCC Confidence 567777888899999999999884 88998888887754 22211 111 11111000000000000000000 Q ss_pred c-CCCCCCCccccCCccccccchhc Q lcl|NC_021305. 423 S-TPVASLDQSPPTSVPGLSPTNSD 446 (518) Q Consensus 423 ~-~~~~~~~~~~~~~~~~~~~~~~~ 446 (518) . .+...++++.+.+..+ . T Consensus 493 ~~~~~~~~~~~~~~~~e~------~ 511 (511) T protein:vir:78 493 DINDDEQDDDTKDTVDKK------E 511 (511) T ss_pred CCCCCCCCCCccCccccc------C Confidence 0 0000000000000000 0 No 202 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.16 E-value=3.3e-06 Score=50.65 Aligned_cols=390 Identities=11% Similarity=0.049 Sum_probs=177.0 Q ss_pred CcCCCC--CCCCcccccccch--------hhhhhhccccccccc-ccccchhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLANG--QTLSAPAMAELSP--------QMQDSYYYAPAVGMQ-LERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~~--~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) -||+.. .....+.. ..++ |.. .+.+.+ .... .................-..+++..|+-+..-|.. T Consensus 17 ~~~~~~~~~~~~~~~i-~~~~~~~~~i~~~~~-~Y~g~~-~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e~~~ 93 (500) T protein:vir:98 17 VMTTQSLTNITDHPKI-AISKLEYDRITTNLK-YYKSDW-DSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVFNEQAE 93 (500) T ss_pred Hhhcchhhhhhccccc-cCCHHHHHHHHHHHH-HhcCCC-CCcccccCCCCcccCceeecchHHHHHHHHhhhhcCCcce Confidence 123311 11111111 1222 211 111111 0000 00000011111222233456777777777665544 Q ss_pred EEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee Q lcl|NC_021305. 70 CMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR 149 (518) Q Consensus 70 v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~ 149 (518) +.-. +......+.+--..-.+..-+...+...+..|.+++.+..+. |. +.+..++|+.+.+.....++. T Consensus 94 i~~~---------d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~v~ad~~~P~~~d~~~~ 162 (500) T protein:vir:98 94 IKVD---------DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAFVQAPVFLPLQSNTQDV 162 (500) T ss_pred EecC---------ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeeEEEEEcCCCe Confidence 3211 112222222211223345556667788888999988887764 33 446667777766533221111 Q ss_pred -----------------eEEe---eec-ccc----------------cCceeEE----------------eccccEEEEe Q lcl|NC_021305. 150 -----------------YEYY---FQA-GAG----------------VGTQLVS----------------FADDEVVPIR 176 (518) Q Consensus 150 -----------------~~~~---~~~-~~~----------------~~~~~~~----------------~~~~evih~~ 176 (518) .+|. ++. ... ..+..+. ++.--+.||+ T Consensus 163 ~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~ 242 (500) T protein:vir:98 163 SSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLK 242 (500) T ss_pred EEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEec Confidence 0110 000 000 0010000 0011234665 Q ss_pred ccCCC----CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc-----ccCccC-CHHHHHHHHHHHH---HHh Q lcl|NC_021305. 177 FFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL-----RHEKRL-SEAAQQRLREQFD---RAH 243 (518) Q Consensus 177 ~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~-~~~~~~~~~~~~~---~~~ 243 (518) .+.++ +.++|+|.+..+...+............-++.|- ...++ ...... +.+.... -.|. ..| T Consensus 243 ~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~-~~i~v~~~~l~~~~~~~~g~~~~~--~~~d~~~~~~ 319 (500) T protein:vir:98 243 TPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQ-RRVAVPESLTALTVRTTDGDVVPR--PRFESDQNVY 319 (500) T ss_pred CCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCc-ceeeechHHhcccCCCCCccccCC--cccCCCcceE Confidence 44332 3356999999999999888877766666676543 33332 211110 1000000 0000 001 Q ss_pred cCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH--HHHHHHHHhhHH Q lcl|NC_021305. 244 SGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ--MRAFYRDTMAIP 321 (518) Q Consensus 244 ~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~--~~~~~~~~l~P~ 321 (518) ..... -.+++..++.++....+-++.+..+...++|+...|+++..+|...++. .+..+. ...-.-.++.-. T Consensus 320 ~~~~~-----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~-~TAtei~s~~~~~~~t~~~~ 393 (500) T protein:vir:98 320 IRMGG-----RDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSM-KTATEIVSENSDTYQMRNSI 393 (500) T ss_pred EEcCC-----CCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCcc-ccHHHHHHHHHHHHHHHHHH Confidence 00000 0123345667777778888999999999999999999999998766543 233332 111111122222 Q ss_pred HHHHHHHHHHh------------hhh-hhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCC Q lcl|NC_021305. 322 IARIQSAMDKY------------VGQ-YWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM-GLPRSDD 387 (518) Q Consensus 322 ~~~ie~~l~~~------------l~~-~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-g~~p~~~ 387 (518) ...++.+|... +.. .....+.+.+++++-+..|..+.++...+++.+|+|+.-+++.+. |++ + T Consensus 394 ~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~---e 470 (500) T protein:vir:98 394 VALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVT---E 470 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCC---H Confidence 22233322221 111 011234567888888899999999999999999999999988654 542 2 Q ss_pred CCcceeeecccccccccccccCCCCCCCCCCCCC-ccCCCCCC Q lcl|NC_021305. 388 PKADELYANSALQPLGATPDGAVEWEEAPAPKRP-ASTPVASL 429 (518) Q Consensus 388 ~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 429 (518) +...+.+ ..+ .. ++ .+..+.+ ..++..++ T Consensus 471 eea~~~l-----~~i---~~----E~-~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 471 EKAQEIA-----AEI---NT----GI-VDEINQQRTDTHLYGE 500 (500) T ss_pred HHHHHHH-----HHH---HH----hc-cccCCCCCccccccCC Confidence 2222111 000 00 00 0000000 00111111 No 203 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.16 E-value=3.3e-06 Score=50.65 Aligned_cols=390 Identities=11% Similarity=0.049 Sum_probs=177.0 Q ss_pred CcCCCC--CCCCcccccccch--------hhhhhhccccccccc-ccccchhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLANG--QTLSAPAMAELSP--------QMQDSYYYAPAVGMQ-LERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~~--~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) -||+.. .....+.. ..++ |.. .+.+.+ .... .................-..+++..|+-+..-|.. T Consensus 17 ~~~~~~~~~~~~~~~i-~~~~~~~~~i~~~~~-~Y~g~~-~~~~~~~~~~~~~~~~~~slnl~~~i~~~~A~lv~~e~~~ 93 (500) T protein:vir:30 17 VMTTQSLTNITDHPKI-AISKLEYDRITTNLK-YYKSDW-DSVLYLNTDGETKKRDLNHLPIARTAAKKIASLVFNEQAE 93 (500) T ss_pred Hhhcchhhhhhccccc-cCCHHHHHHHHHHHH-HhcCCC-CCcccccCCCCcccCceeecchHHHHHHHHhhhhcCCcce Confidence 123311 11111111 1222 211 111111 0000 00000011111222233456777777777665544 Q ss_pred EEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee Q lcl|NC_021305. 70 CMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR 149 (518) Q Consensus 70 v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~ 149 (518) +.-. +......+.+--..-.+..-+...+...+..|.+++.+..+. |. +.+..++|+.+.+.....++. T Consensus 94 i~~~---------d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~-~~I~~v~ad~~~P~~~d~~~~ 162 (500) T protein:vir:30 94 IKVD---------DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DK-VRVAFVQAPVFLPLQSNTQDV 162 (500) T ss_pred EecC---------ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeeEEEEEcCCCe Confidence 3211 112222222211223345556667788888999988887764 33 446667777766533221111 Q ss_pred -----------------eEEe---eec-ccc----------------cCceeEE----------------eccccEEEEe Q lcl|NC_021305. 150 -----------------YEYY---FQA-GAG----------------VGTQLVS----------------FADDEVVPIR 176 (518) Q Consensus 150 -----------------~~~~---~~~-~~~----------------~~~~~~~----------------~~~~evih~~ 176 (518) .+|. ++. ... ..+..+. ++.--+.||+ T Consensus 163 ~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~ 242 (500) T protein:vir:30 163 SSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLK 242 (500) T ss_pred EEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEec Confidence 0110 000 000 0010000 0011234665 Q ss_pred ccCCC----CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCccccc-----ccCccC-CHHHHHHHHHHHH---HHh Q lcl|NC_021305. 177 FFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVL-----RHEKRL-SEAAQQRLREQFD---RAH 243 (518) Q Consensus 177 ~~~~~----~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il-----~~~~~~-~~~~~~~~~~~~~---~~~ 243 (518) .+.++ +.++|+|.+..+...+............-++.|- ...++ ...... +.+.... -.|. ..| T Consensus 243 ~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~-~~i~v~~~~l~~~~~~~~g~~~~~--~~~d~~~~~~ 319 (500) T protein:vir:30 243 TPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQ-RRVAVPESLTALTVRTTDGDVVPR--PRFESDQNVY 319 (500) T ss_pred CCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCc-ceeeechHHhcccCCCCCccccCC--cccCCCcceE Confidence 44332 3356999999999999888877766666676543 33332 211110 1000000 0000 001 Q ss_pred cCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH--HHHHHHHHhhHH Q lcl|NC_021305. 244 SGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ--MRAFYRDTMAIP 321 (518) Q Consensus 244 ~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~--~~~~~~~~l~P~ 321 (518) ..... -.+++..++.++....+-++.+..+...++|+...|+++..+|...++. .+..+. ...-.-.++.-. T Consensus 320 ~~~~~-----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~-~TAtei~s~~~~~~~t~~~~ 393 (500) T protein:vir:30 320 IRMGG-----RDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSM-KTATEIVSENSDTYQMRNSI 393 (500) T ss_pred EEcCC-----CCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCcc-ccHHHHHHHHHHHHHHHHHH Confidence 00000 0123345667777778888999999999999999999999998766543 233332 111111122222 Q ss_pred HHHHHHHHHHh------------hhh-hhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCC Q lcl|NC_021305. 322 IARIQSAMDKY------------VGQ-YWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM-GLPRSDD 387 (518) Q Consensus 322 ~~~ie~~l~~~------------l~~-~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-g~~p~~~ 387 (518) ...++.+|... +.. .....+.+.+++++-+..|..+.++...+++.+|+|+.-+++.+. |++ + T Consensus 394 ~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~---e 470 (500) T protein:vir:30 394 VALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVT---E 470 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCC---H Confidence 22233322221 111 011234567888888899999999999999999999999988654 542 2 Q ss_pred CCcceeeecccccccccccccCCCCCCCCCCCCC-ccCCCCCC Q lcl|NC_021305. 388 PKADELYANSALQPLGATPDGAVEWEEAPAPKRP-ASTPVASL 429 (518) Q Consensus 388 ~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 429 (518) +...+.+ ..+ .. ++ .+..+.+ ..++..++ T Consensus 471 eea~~~l-----~~i---~~----E~-~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 471 EKAQEIA-----AEI---NT----GI-VDEINQQRTDTHLYGE 500 (500) T ss_pred HHHHHHH-----HHH---HH----hc-cccCCCCCccccccCC Confidence 2222111 000 00 00 0000000 00111111 No 204 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.15 E-value=3.5e-06 Score=50.50 Aligned_cols=412 Identities=11% Similarity=0.023 Sum_probs=167.2 Q ss_pred CcCCCCCC-------------CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCc Q lcl|NC_021305. 1 MLLANGQT-------------LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLP 67 (518) Q Consensus 1 ~~f~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~ 67 (518) +|..-... ...+....+..+.. |......-.... ......-...+....+|+..+.-+-+-| T Consensus 9 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~---g~~~i~~~~~~~--~~~~~~ki~~n~~~~Iv~~~~~~l~g~p 83 (499) T protein:vir:10 9 LLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYN---GKQEIEKHEFDN--ATVEAANVMVNHAKYITDMNVGFMTGNP 83 (499) T ss_pred HHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhc---cccchhcCCcCc--CCCCcceeecchHHHHHHHHhhhhcccC Confidence 11110000 00000000000000 000000000000 0000111223456678888888777778 Q ss_pred eEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCce----------------EEE Q lcl|NC_021305. 68 VKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTP----------------EKL 131 (518) Q Consensus 68 ~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~----------------~~l 131 (518) +.+--.++ . ....+..++.. | ....+...+..+.+.+|.+|.++..+.+|.+ ..+ T Consensus 84 ~~~~~~~~----~-~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~ 154 (499) T protein:vir:10 84 VKYVAEKG----K-NIDDILEVFNQ-I---DIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKI 154 (499) T ss_pred ceeecCCh----h-HHHHHHHHHhh-c---CHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEE Confidence 76532211 1 11122233322 2 3445677888999999999999988887743 346 Q ss_pred EeeCCceeEEEEcCCcee-eEE---eeecccccCc----eeEEeccccEEEEecc----------------CC------- Q lcl|NC_021305. 132 MPMHPSRVAIKRNSRTGR-YEY---YFQAGAGVGT----QLVSFADDEVVPIRFF----------------NP------- 180 (518) Q Consensus 132 ~~l~p~~v~v~~~~~~~~-~~~---~~~~~~~~~~----~~~~~~~~evih~~~~----------------~~------- 180 (518) ..++|..+.+..+..... ..+ .+......+. ....+.++.+.++... ++ T Consensus 155 ~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 234 (499) T protein:vir:10 155 EVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPII 234 (499) T ss_pred EEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceE Confidence 778888877766543321 100 0111100001 1112233333333210 00 Q ss_pred --CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeec--C Q lcl|NC_021305. 181 --DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVV--E 256 (518) Q Consensus 181 --~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl--~ 256 (518) .....|.|-+..+...+.....+..-..+.+...+.|..+++-. .+.+.. . ....+ ..+.+..+ + T Consensus 235 ~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~-~~~~~~-~-~~~~~--------~~~~~~~~~~~ 303 (499) T protein:vir:10 235 EFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGF-GLGDDK-D-DIQRL--------KRGAIEAPPRE 303 (499) T ss_pred EecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-cccccc-c-hhhhh--------hhcceeccCCC Confidence 01124667777766666666655555555555566666665422 122111 0 00000 11223332 3 Q ss_pred CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH-------------HHHHHHHHHhhHHHH Q lcl|NC_021305. 257 EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA-------------QMRAFYRDTMAIPIA 323 (518) Q Consensus 257 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~-------------~~~~~~~~~l~P~~~ 323 (518) ++.++..+........+....+.+.+.|...-++|..-.+.. .++- +..+ .....+..++.-.+. T Consensus 304 ~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~gn~-Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~ 381 (499) T protein:vir:10 304 EGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKF-MGNV-SGEAMKFKLFGLENLLSIKQRYFFDGLRRRLK 381 (499) T ss_pred CCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhh-cccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555555565544555566777788888888888773211110 1111 1111 111222222222333 Q ss_pred HHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecc-ccccc Q lcl|NC_021305. 324 RIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANS-ALQPL 402 (518) Q Consensus 324 ~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~-n~~~~ 402 (518) .+...++..- .......+++.+..-+..|..+.++.+.++ .|+++..-++++++.-.-+....+.+.--. ..... T Consensus 382 li~~~~~~~~--~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~ 457 (499) T protein:vir:10 382 LIQTIVNIKG--ANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKK 457 (499) T ss_pred HHHHHHhccC--CccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 3222221110 000112456666777888999999999998 688999888888764221111111110000 00000 Q ss_pred cc-ccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccc Q lcl|NC_021305. 403 GA-TPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPT 443 (518) Q Consensus 403 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (518) .. ..++..+........+...++..+++.+.+.+.....+. T Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 458 NQEALRGQDPDRLELEDKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred HHhhhccCCCCCCCCCCCCcccCCCCCCCccccccCCCCCCC Confidence 00 000000000000000011111111111111111111111 No 205 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.13 E-value=3.9e-06 Score=50.24 Aligned_cols=411 Identities=13% Similarity=0.125 Sum_probs=162.5 Q ss_pred cCCCCCCCCcccccccchhh---------hhhhcccc--cccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_021305. 2 LLANGQTLSAPAMAELSPQM---------QDSYYYAP--AVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) Q Consensus 2 ~f~~~~~~~~~~~~~~~~~~---------~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v 70 (518) |-..-. -...+..+. ...+.-+. ....+.. .....+..-....+...+|+..+..+--.+|.+ T Consensus 1 ~~t~~~-----~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~ 74 (480) T protein:vir:78 1 MTTYHE-----HVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG-APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) T ss_pred CCCHHH-----HHHHHHHHHHHHHHHHHHHHHHHhccccccccccc-cchhHhhhhhhcchHHHHHHHHHhhhccCceec Confidence 111000 000000000 00000000 0000000 000011111223345667777777654444432 Q ss_pred EEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc------CCCceEEEEeeCCceeEEEEc Q lcl|NC_021305. 71 MFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN------KSGTPEKLMPMHPSRVAIKRN 144 (518) Q Consensus 71 ~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~------~~G~~~~l~~l~p~~v~v~~~ 144 (518) .++ .. ....+..++.+ | ........+..+.+.+|.+|..+-++ ..|. ..+.+++|..+.+.++ T Consensus 75 ---~~d--~~-~~~~l~~i~~~-N---~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~-~~i~~~~p~~~~~~~D 143 (480) T protein:vir:78 75 ---SED--SE-GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGI-PLIRVESPLYMYAELD 143 (480) T ss_pred ---CCC--ch-hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCe-eEEEEEcccceEEEEc Confidence 111 11 12233334432 2 34566777889999999999888753 3443 4467888988888886 Q ss_pred CCc--eeeE-Eeeeccccc-C----------ceeEE---------------------eccccEEEEeccCCCCcccCchH Q lcl|NC_021305. 145 SRT--GRYE-YYFQAGAGV-G----------TQLVS---------------------FADDEVVPIRFFNPDGLERGLSL 189 (518) Q Consensus 145 ~~~--~~~~-~~~~~~~~~-~----------~~~~~---------------------~~~~evih~~~~~~~~~~~G~s~ 189 (518) ... .... ..++..... + +..+. +..=.|++|.+....+..+|.|- T Consensus 144 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~ 223 (480) T protein:vir:78 144 PRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) T ss_pred CCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCccc Confidence 531 1111 111000000 0 00000 11224566654433344567776 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-CCcceeeccCC Q lcl|NC_021305. 190 MES-LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLT 267 (518) Q Consensus 190 l~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~ 267 (518) +.- +...+........-......-.+.|.-+|. .....+...+.-...|... .+.++.++ ++.++.++... T Consensus 224 i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~ 296 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDIY------YGRILTLASEAAKISEFKAA 296 (480) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-cCCccccccccccchhhhh------hhhhccCCCCCceEEecCcc Confidence 543 333333333222222222222334444432 1111111111111112111 12333443 45677666643 Q ss_pred hhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHH----HHHHHhh--hhhh-c-- Q lcl|NC_021305. 268 AVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQ----SAMDKYV--GQYW-V-- 338 (518) Q Consensus 268 ~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie----~~l~~~l--~~~~-~-- 338 (518) ..+ .+.+..+..+..|+..-++|+..+|.... |.++.++.. +....+.-.+...+ ..|.+.+ +... + T Consensus 297 ~~~-~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~Sg~Alk--~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~ 372 (480) T protein:vir:78 297 ELR-NFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAII--ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE 372 (480) T ss_pred CHH-HHHHHHHHHHHHHhcccCCChHHhccccC-cchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 322 26777788888899999999999876432 222222221 11122221122211 1121111 1111 1 Q ss_pred ---ccccceecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccC-CCC Q lcl|NC_021305. 339 ---RKNRMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGA-VEW 412 (518) Q Consensus 339 ---~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~-~~~ 412 (518) ....+++.|.+....+..+.++.+.+++.+| +++..-+++.+|+.+-+....+..-....-.+++...... ... T Consensus 373 ~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~ 452 (480) T protein:vir:78 373 VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQA 452 (480) T ss_pred ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccC Confidence 1123455666667778889999999998876 6777778888888653211111100000000111110000 000 Q ss_pred CCCCCCCCCccCCCCCCCccccCCccccccchhcc Q lcl|NC_021305. 413 EEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDR 447 (518) Q Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 447 (518) ...+.+.. ++. .++.++.+.+.... ..| T Consensus 453 ~~~~~~~~-~~~--~~~~~~~~~~~~~~----~~~ 480 (480) T protein:vir:78 453 DATPKPTV-TET--KTETQTSPSGFNRT----KTR 480 (480) T ss_pred CCCCCCCC-CCC--CCccccccCCCCcc----cCC Confidence 00000000 000 01111111111111 011 No 206 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.05 E-value=5.9e-06 Score=49.25 Aligned_cols=385 Identities=10% Similarity=0.052 Sum_probs=165.4 Q ss_pred CcCCCCCCC------------Ccccccccchhhhhhhccccccccccc--c--cchh-hhHHHhhcHHHHHHHHHHHHhh Q lcl|NC_021305. 1 MLLANGQTL------------SAPAMAELSPQMQDSYYYAPAVGMQLE--R--QFSL-YGGIYKNQPWVRTVIAKRAQAL 63 (518) Q Consensus 1 ~~f~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~--~--~~~~-~~~~~~~~~~v~~~v~~ia~~i 63 (518) |-+...... ..+....+..+.. |.......... . .... ....=..++....+|+..+.-+ T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~---g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l 96 (474) T protein:vir:96 20 IKPKYETQEEMIIRLINDHKPKIDDITVGERYYN---HDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVAYA 96 (474) T ss_pred hhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhc---cCCcchhccchhcccccccccccchhcccchHHHHHHhhhhhh Confidence 111100000 0000000000000 00000000000 0 0000 0000122455677888888888 Q ss_pred ccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEE Q lcl|NC_021305. 64 ARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKR 143 (518) Q Consensus 64 a~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~ 143 (518) -.-|+.+--.++ .....+..++. | +.......+..++..+|.+|..+-.+..|++ .+..++|..+.++. T Consensus 97 ~g~p~~~~~~d~-----~~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~-~i~~~~p~~~~~v~ 165 (474) T protein:vir:96 97 VANPVTFSSDDD-----KSLKTIQEVLN--H---KWDDKLVDILTAASNKGIEWLQPYIDENGEF-KTFRVPAEQAIPIW 165 (474) T ss_pred cccCceeecCch-----HHHHHHHHHHh--c---CHHHHHHHHHHHHHhcCeeEEEEEecCCCce-EEEEEcccceEEEE Confidence 788877632111 11222333332 2 3344556677889999999999888888875 47789999998887 Q ss_pred cCCc--eeeEEeeecccccCceeEEeccc---------------------------------------cEEEEeccCCCC Q lcl|NC_021305. 144 NSRT--GRYEYYFQAGAGVGTQLVSFADD---------------------------------------EVVPIRFFNPDG 182 (518) Q Consensus 144 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~---------------------------------------evih~~~~~~~~ 182 (518) +... .......++..........+... .|++|+. T Consensus 166 d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n----- 240 (474) T protein:vir:96 166 TNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKN----- 240 (474) T ss_pred cCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEecc----- Confidence 6421 11111111110000111111122 2233332 Q ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-CCcce Q lcl|NC_021305. 183 LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-EGMEP 261 (518) Q Consensus 183 ~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~ 261 (518) ...|.|-+......+.....+..-..+.+...+.|-.+++--. .++.+.+. ... ..++++.++ +|.++ T Consensus 241 n~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~---~~~~~~~~----~~~----~~~~~i~~~~~~~~~ 309 (474) T protein:vir:96 241 NPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYE---GQDLDEFM----RNL----KYYKAINVDGDGSGV 309 (474) T ss_pred CCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC---cccccchh----hhh----hcCceEEecCCCCce Confidence 1247787877777777776666566666666666665554221 11111111 111 123455554 45566 Q ss_pred eeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHh----h----HHHHHHHHHHHHhh Q lcl|NC_021305. 262 IPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTM----A----IPIARIQSAMDKYV 333 (518) Q Consensus 262 ~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l----~----P~~~~ie~~l~~~l 333 (518) ..+..+.....+....+...+.|+..-++|..-.+.. +++ .+..+ . .+....+ . -+-..+.+.+. .+ T Consensus 310 ~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n-~Sg~A-l-~~~~~~l~~k~~~k~~~~~~~l~~~~~-~i 384 (474) T protein:vir:96 310 DTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKF-GNS-PSGIA-L-KFMYSNLDLKANKLKNKTLTALQELLQ-YI 384 (474) T ss_pred eEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-ccc-cHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Confidence 5565555555677788888999999999985432111 111 11211 1 1111111 1 11111221111 11 Q ss_pred hhhhc---ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCC Q lcl|NC_021305. 334 GQYWV---RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAV 410 (518) Q Consensus 334 ~~~~~---~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~ 410 (518) +.-.+ ....+.+.+..-+..|..+.++. +..+|++|...++++++. ++++. .++ ..+. +... T Consensus 385 ~~~~~~~~~~~~i~i~f~~~~p~~~~e~~~~---~~~ag~iS~et~~~~~~~--v~d~~-~E~------~ri~---~E~~ 449 (474) T protein:vir:96 385 IDFYKLNIKVQDVEITFNFNVMVNELEQSQI---GVQSQYLSKETVVTNHPW--VDDPV-AEL------ERIE---QDNI 449 (474) T ss_pred HHHhCCCcccceeeEEeccCCCcCHHHHHHH---HHhcCCCchHHHHHhCCC--CCCHH-HHH------HHHH---HHHH Confidence 11111 11234455566667777666654 556899999999988754 33221 111 1111 0000 Q ss_pred CCCCCCCCCCCccCCCCCCCccccC Q lcl|NC_021305. 411 EWEEAPAPKRPASTPVASLDQSPPT 435 (518) Q Consensus 411 ~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (518) .......+.........+.+++.++ T Consensus 450 e~~~~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 450 DFNKQLPPLEGDANGRAQDNESETN 474 (474) T ss_pred HHHhcccccccccccccCCCcccCC Confidence 0000000000000001111111111 No 207 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.04 E-value=6.1e-06 Score=49.16 Aligned_cols=376 Identities=7% Similarity=0.015 Sum_probs=161.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) -.+.+......... ...+- .... ... ...-..++....+|+..+.-+-.-|+.+--.++. T Consensus 50 ~yY~g~~~i~~~~~--~~~~~-----~~~~---~~~------~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~---- 109 (478) T protein:vir:10 50 RYYNHHPDILDAPP--KRDVN-----GDYD---ETK------PDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDK---- 109 (478) T ss_pred HHhcCCCchhcccc--ccccc-----cccc---ccc------ccceeccchHHHHHHHHHhhhccCCeeeecCChH---- Confidence 11111111000000 00000 0000 000 0001223455678888888877778776321111 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc-eeeEEee-eccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT-GRYEYYF-QAGA 158 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~-~~~~~~~-~~~~ 158 (518) ....+..++. | +..+....+..+.+.+|.+|+.+..+..|.+ .+..++|..+.+..+... ......+ .+.. T Consensus 110 -~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~ 182 (478) T protein:vir:10 110 -ALKQIQHTLN--H---KWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYEL 182 (478) T ss_pred -HHHHHHHHHh--c---CHHHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEe Confidence 1112223332 2 4566667788999999999999988888865 577789988888776431 1111110 0000 Q ss_pred ccCceeEEeccc---------------------------------------cEEEEeccCCCCcccCchHHHHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADD---------------------------------------EVVPIRFFNPDGLERGLSLMESLKSTIFS 199 (518) Q Consensus 159 ~~~~~~~~~~~~---------------------------------------evih~~~~~~~~~~~G~s~l~~~~~~i~~ 199 (518) ........+.++ .|++|+. ..+|.|-+..+...+.. T Consensus 183 ~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n-----~~~g~sd~~~v~~liDa 257 (478) T protein:vir:10 183 DGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKN-----NPQEVSDLFMYKTIIDA 257 (478) T ss_pred cCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEecc-----CCCCCCcHHHHHHHHHH Confidence 000011111122 2333332 23577777766666666 Q ss_pred HHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeec--CCCcceeeccCChhhHHHHHHH Q lcl|NC_021305. 200 EDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVV--EEGMEPIPLQLTAVEMQFIEAR 277 (518) Q Consensus 200 ~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl--~~g~~~~~l~~~~~d~~~~e~~ 277 (518) ...+..-..+.+...+.|..+++--. .. ......... ..++++.+ ++|.+...+........+.... T Consensus 258 ~~~~~S~~~~~~~~~~~p~~~~~g~~-~~--~~~~~~~~~--------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 326 (478) T protein:vir:10 258 LDKRLSDTQNTFDESVELIYILKGYE-GE--DMKDFMHNL--------KYYKAISVAGESGSGVDTIKVEVPIDSVKEYT 326 (478) T ss_pred HHHHHHHHHHHHHHhhCceeeeecCC-cc--ccchhhhhh--------hhcceEEecCCCCCcceEEeecCChHHHHHHH Confidence 66555555555555555655543211 11 111111111 11223333 2334443344444455566778 Q ss_pred HHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHH----HHHHHHH---hhhhhhc---ccccceecc Q lcl|NC_021305. 278 QLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIAR----IQSAMDK---YVGQYWV---RKNRMKFDI 347 (518) Q Consensus 278 ~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~----ie~~l~~---~l~~~~~---~~~~~~fd~ 347 (518) +.+.+.|...-++|..-.+.. . +|.......+....+.-.+.. +...+.+ .++...+ ....+++.+ T Consensus 327 ~~l~~~i~~~s~~p~~~~~~~-~---~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f 402 (478) T protein:vir:10 327 KMLRDYIIEFGQGVDFQQDKF-G---NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITF 402 (478) T ss_pred HHHHHHHHHHhCccccCcccc-c---cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEe Confidence 888888888888884322111 1 111111111111122111111 1111111 1111111 112345666 Q ss_pred hhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCC Q lcl|NC_021305. 348 DDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVA 427 (518) Q Consensus 348 ~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (518) ..-+..|..+.++.+.++ +|+++...+++++++-.-++...+. +....... .+......+...... T Consensus 403 ~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~r---------i~~E~~~~--~~~~~~~~~~~~~~~- 468 (478) T protein:vir:10 403 NFNVMVNELENSQIAMNS--TGLLSKETILSNHAWVEDPVAEMER---------IEQENIEL--NQQLPDIEEGLNGEQ- 468 (478) T ss_pred cCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHHHH---------HHHHHHHH--HhhccccccccCCCC- Confidence 677888999999998887 7899998899888753211111111 11000000 000000000000000 Q ss_pred CCCccccCCc Q lcl|NC_021305. 428 SLDQSPPTSV 437 (518) Q Consensus 428 ~~~~~~~~~~ 437 (518) +..++.++++ T Consensus 469 ~~~~~~~~~~ 478 (478) T protein:vir:10 469 QRQSENNQPE 478 (478) T ss_pred CCCCCCCCCC Confidence 0000000111 No 208 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=98.03 E-value=6.4e-06 Score=49.05 Aligned_cols=398 Identities=10% Similarity=0.045 Sum_probs=172.2 Q ss_pred CcCCCCCCCCccccccc-------chhhhh--hh--cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceE Q lcl|NC_021305. 1 MLLANGQTLSAPAMAEL-------SPQMQD--SY--YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVK 69 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~-------~~~~~~--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~ 69 (518) |-|......-......+ -+.+.. .+ |............... ..-...+.....|+..+.-+-.-|+. T Consensus 9 ~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~--~~ki~~n~~~~ivd~~~~~l~g~~~~ 86 (453) T protein:vir:73 9 MTYSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAKDSWKP--DNRLTNNFAKYIVDTFVGYFNGIPIK 86 (453) T ss_pred eeccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCCCCccCc--cceeecchHHHHHHHhhhhhcccCce Confidence 22222221111000000 001100 00 0000000000000000 01122345667777777777666766 Q ss_pred EEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCcee Q lcl|NC_021305. 70 CMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGR 149 (518) Q Consensus 70 v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~ 149 (518) +--.+ ......+..++. ..........+..+.+.+|.+|+.+.++.+|.+ .+..++|..+.+.++..... T Consensus 87 ~~~~d-----~~~~~~l~~~~~----~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~v~dd~~~~ 156 (453) T protein:vir:73 87 KTHDD-----KSVLEAMQLFDN----LNDMEDEESELAKIACVYGRAYELMYQNESTES-EVIYCSPLNVFMVYDDSIKQ 156 (453) T ss_pred eecCC-----hHHHHHHHHHHH----hcChhHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEcccceEEEEeCCCCc Confidence 52211 111122223322 234555667788999999999999999888876 46678998888777654321 Q ss_pred -eEEee-ecccccCc-eeEEeccccEEEEecc-----------CC---------CCcccCchHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 150 -YEYYF-QAGAGVGT-QLVSFADDEVVPIRFF-----------NP---------DGLERGLSLMESLKSTIFSEDSSRNA 206 (518) Q Consensus 150 -~~~~~-~~~~~~~~-~~~~~~~~evih~~~~-----------~~---------~~~~~G~s~l~~~~~~i~~~~~~~~~ 206 (518) ..+.+ +.....+. ....+..+.++++... ++ .....|.|-+..+...+.....+..- T Consensus 157 ~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~~~~~S~ 236 (453) T protein:vir:73 157 KPLFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSYNKVTSE 236 (453) T ss_pred eeEEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHHHHHHHH Confidence 11111 11111111 1122333333333211 00 01125777787777777666655555 Q ss_pred HHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHH Q lcl|NC_021305. 207 TAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCG 286 (518) Q Consensus 207 ~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~ 286 (518) ..+.....+.|..+++- ..+.++....++..-. ........+.....+.+.++..+.....+..+....+.+.+.|+. T Consensus 237 ~~~~~~~~~~~~l~~~g-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~ 314 (453) T protein:vir:73 237 KANDVEYFSDQYLVFLG-AEVDEEDAKNIKDNRL-INFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQ 314 (453) T ss_pred HHHHHHHhccceeeeec-CCCCchhhhccccccc-ccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHH Confidence 55555555556655532 2344444444433110 000111122223334455555555555556667778888888888 Q ss_pred HhcCCHHHhccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecchhhhhc Q lcl|NC_021305. 287 VYDIAPPIVHILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQP 353 (518) Q Consensus 287 ~fgVPp~~lg~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~ 353 (518) .-++|..-. ...++ .+..+. ....+...+.-.+..+...++..-. ......+++.+..-+.. T Consensus 315 ~s~~p~~~~--~~~gn-~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~i~v~f~~~~p~ 389 (453) T protein:vir:73 315 FTMAANISD--ENFGN-SSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASN--KDAWKDIEYTFTRNEPK 389 (453) T ss_pred HhCCcccCc--ccccC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--ccccccceEEeCCCCCC Confidence 888884221 11111 122111 1112222222222222211111100 00112456667788889 Q ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 354 DWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 354 d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) |..+.++.+.++. |+++..-+.+++++- +++.. ++ ..+..........+..... ...++. T Consensus 390 ~~~~~a~~~~k~~--giis~et~~~~~~~~--~d~~~-E~------~ri~~E~~~~~~~~~~~~~--------~~~~~~- 449 (453) T protein:vir:73 390 DIKEQAETANILK--GITSEETALSVISVI--PDVQA-EM------EKIKKKKLLQLSLTRTSNL--------VRMKQM- 449 (453) T ss_pred CHHHHHHHHHHHh--ccCcHHHHHHhCCCC--CCHHH-HH------HHHHHHHHHHHHHHHhccC--------Ccchhh- Confidence 9999999999885 789887788877652 22211 10 1111000000000000000 000000 Q ss_pred cCCcccc Q lcl|NC_021305. 434 PTSVPGL 440 (518) Q Consensus 434 ~~~~~~~ 440 (518) .... T Consensus 450 ---~~~~ 453 (453) T protein:vir:73 450 ---RGNL 453 (453) T ss_pred ---hcCC Confidence 0000 No 209 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=98.03 E-value=6.5e-06 Score=49.03 Aligned_cols=373 Identities=10% Similarity=-0.012 Sum_probs=163.2 Q ss_pred CcCCCCC-CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQ-TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) .-.-.|+ .......+ .. ..+....+ . ...-...+....+|+..+.-+-.-|+.+--.+ T Consensus 50 ~~YY~g~~~i~~~~~~-----~~--~~~~~~~~---~------~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d----- 108 (474) T protein:vir:94 50 QRYYDKDNDIVKQMKK-----VD--VHGNIDYD---K------PDWRITTNFHQNLVDQKVSYVASKPVTYSCED----- 108 (474) T ss_pred HHHhccccchhcccch-----hc--cccccccc---c------CcceeecchHHHHHHHHHhhhhcCCceeccCc----- Confidence 0000111 00000000 00 00000000 0 00011234567788888888888888763211 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eeeEEeeecc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRYEYYFQAG 157 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~~~~~~~~ 157 (518) +.....+..++. | +.......+..+.+.+|.+|+.+..+.+|.+ .+..++|..+.+.++... ........+. T Consensus 109 ~~~~~~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~ 182 (474) T protein:vir:94 109 ENVLKVIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYK 182 (474) T ss_pred HHHHHHHHHHHh--c---cHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 111222223332 2 3455666778999999999999988888864 577789999888876532 1111111111 Q ss_pred cccCceeEEecccc-----------------------------------EEEEeccCCCCcccCchHHHHHHHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDE-----------------------------------VVPIRFFNPDGLERGLSLMESLKSTIFSEDS 202 (518) Q Consensus 158 ~~~~~~~~~~~~~e-----------------------------------vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~ 202 (518) .........+.... |++|+. ..+|.|-+..+...+..... T Consensus 183 ~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~ 257 (474) T protein:vir:94 183 FNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDK 257 (474) T ss_pred ecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHH Confidence 00001111122222 333322 13577878777777776665 Q ss_pred HHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHH Q lcl|NC_021305. 203 SRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNRE 282 (518) Q Consensus 203 ~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~ 282 (518) +..-..+.+...+.|..+++-- +.+..+.+... ....+++.++++.+...+........+....+...+ T Consensus 258 ~~s~~~~~~~~~~~~~lv~~g~---~~~~~~~~~~~--------~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 326 (474) T protein:vir:94 258 RLSDAQNMFDESVELIYILKGY---EGEDLEEFMRG--------LKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRV 326 (474) T ss_pred HHHHHHHHHHHhcCceeeeecC---Ccccchhhhhh--------hhccceeeccCCCceeEEeecCCHHHHHHHHHHHHH Confidence 5555555555555565554322 11111122111 123456667766666556555555666777788888 Q ss_pred HHHHHhcCCHHHh-ccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecch Q lcl|NC_021305. 283 EVCGVYDIAPPIV-HILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDID 348 (518) Q Consensus 283 ~Ia~~fgVPp~~l-g~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~ 348 (518) .|...-++|..-. +.. ++ .+..+. ....+...+..++..|...+. .......+++.++ T Consensus 327 ~I~~~s~~p~~~~~~~~--~n-~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~-----~~~d~~~i~v~f~ 398 (474) T protein:vir:94 327 YIMEFGQGVDFQTDKFG--SA-PSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNN-----LKTDVKDIEISFN 398 (474) T ss_pred HHHHHhCccccCccccc--cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccceeeEEec Confidence 8888888884221 111 11 121111 111222222323222221111 0111122444455 Q ss_pred hhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 349 DVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 349 ~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) .-...+..+.++. +..+|+++..-++++++. ++++. .++ ..+........ +..+.. .....+.+. T Consensus 399 ~~~p~~~~e~a~~---~~~~g~iS~et~l~~l~~--v~D~~-~E~------eri~~E~~~~~--~~~~~~-~~~~~~~~~ 463 (474) T protein:vir:94 399 FNRMMNDAEQSQI---IAQSQYLSRETLVKSSPL--VDDYK-AEL------ERIEQEQMEYN--KQLPNL-DDGGADGAQ 463 (474) T ss_pred cCcccCHHHHHHH---HHHcCCCCHHHHHHhCCC--CCCHH-HHH------HHHHHHHHHHH--hhcccc-CCCCCCCcc Confidence 5555666665554 555799999888888864 22211 110 11100000000 000000 000001111 Q ss_pred CCccccCCccc Q lcl|NC_021305. 429 LDQSPPTSVPG 439 (518) Q Consensus 429 ~~~~~~~~~~~ 439 (518) +++++.+...+ T Consensus 464 ~~~~~~~~~~e 474 (474) T protein:vir:94 464 QQEGSNNKESE 474 (474) T ss_pred cCCCCcccccC Confidence 11111111111 No 210 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=98.03 E-value=6.5e-06 Score=49.03 Aligned_cols=373 Identities=10% Similarity=-0.012 Sum_probs=163.2 Q ss_pred CcCCCCC-CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQ-TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) .-.-.|+ .......+ .. ..+....+ . ...-...+....+|+..+.-+-.-|+.+--.+ T Consensus 50 ~~YY~g~~~i~~~~~~-----~~--~~~~~~~~---~------~~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~~d----- 108 (474) T protein:vir:97 50 QRYYDKDNDIVKQMKK-----VD--VHGNIDYD---K------PDWRITTNFHQNLVDQKVSYVASKPVTYSCED----- 108 (474) T ss_pred HHHhccccchhcccch-----hc--cccccccc---c------CcceeecchHHHHHHHHHhhhhcCCceeccCc----- Confidence 0000111 00000000 00 00000000 0 00011234567788888888888888763211 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eeeEEeeecc Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRYEYYFQAG 157 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~~~~~~~~ 157 (518) +.....+..++. | +.......+..+.+.+|.+|+.+..+.+|.+ .+..++|..+.+.++... ........+. T Consensus 109 ~~~~~~l~~~~~--n---~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~ 182 (474) T protein:vir:97 109 ENVLKVIHDVLD--T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYK 182 (474) T ss_pred HHHHHHHHHHHh--c---cHHHHHHHHHHHHhhcCceEEEEEecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 111222223332 2 3455666778999999999999988888864 577789999888876532 1111111111 Q ss_pred cccCceeEEecccc-----------------------------------EEEEeccCCCCcccCchHHHHHHHHHHHHHH Q lcl|NC_021305. 158 AGVGTQLVSFADDE-----------------------------------VVPIRFFNPDGLERGLSLMESLKSTIFSEDS 202 (518) Q Consensus 158 ~~~~~~~~~~~~~e-----------------------------------vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~ 202 (518) .........+.... |++|+. ..+|.|-+..+...+..... T Consensus 183 ~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----n~~g~sd~e~v~~liDa~n~ 257 (474) T protein:vir:97 183 FNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKN-----NPEEVSDIWMYKSIIDAIDK 257 (474) T ss_pred ecCeEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecC-----CcCCCCcHHHHHHHHHHHHH Confidence 00001111122222 333322 13577878777777776665 Q ss_pred HHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHH Q lcl|NC_021305. 203 SRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNRE 282 (518) Q Consensus 203 ~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~ 282 (518) +..-..+.+...+.|..+++-- +.+..+.+... ....+++.++++.+...+........+....+...+ T Consensus 258 ~~s~~~~~~~~~~~~~lv~~g~---~~~~~~~~~~~--------~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 326 (474) T protein:vir:97 258 RLSDAQNMFDESVELIYILKGY---EGEDLEEFMRG--------LKYYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRV 326 (474) T ss_pred HHHHHHHHHHHhcCceeeeecC---Ccccchhhhhh--------hhccceeeccCCCceeEEeecCCHHHHHHHHHHHHH Confidence 5555555555555565554322 11111122111 123456667766666556555555666777788888 Q ss_pred HHHHHhcCCHHHh-ccccccccCCHHHH-------------HHHHHHHHhhHHHHHHHHHHHHhhhhhhcccccceecch Q lcl|NC_021305. 283 EVCGVYDIAPPIV-HILDRATFSNISAQ-------------MRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDID 348 (518) Q Consensus 283 ~Ia~~fgVPp~~l-g~~~~~~~sn~e~~-------------~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~~~fd~~ 348 (518) .|...-++|..-. +.. ++ .+..+. ....+...+..++..|...+. .......+++.++ T Consensus 327 ~I~~~s~~p~~~~~~~~--~n-~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~-----~~~d~~~i~v~f~ 398 (474) T protein:vir:97 327 YIMEFGQGVDFQTDKFG--SA-PSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNN-----LKTDVKDIEISFN 398 (474) T ss_pred HHHHHhCccccCccccc--cc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCcccceeeEEec Confidence 8888888884221 111 11 121111 111222222323222221111 0111122444455 Q ss_pred hhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCC Q lcl|NC_021305. 349 DVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVAS 428 (518) Q Consensus 349 ~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (518) .-...+..+.++. +..+|+++..-++++++. ++++. .++ ..+........ +..+.. .....+.+. T Consensus 399 ~~~p~~~~e~a~~---~~~~g~iS~et~l~~l~~--v~D~~-~E~------eri~~E~~~~~--~~~~~~-~~~~~~~~~ 463 (474) T protein:vir:97 399 FNRMMNDAEQSQI---IAQSQYLSRETLVKSSPL--VDDYK-AEL------ERIEQEQMEYN--KQLPNL-DDGGADGAQ 463 (474) T ss_pred cCcccCHHHHHHH---HHHcCCCCHHHHHHhCCC--CCCHH-HHH------HHHHHHHHHHH--hhcccc-CCCCCCCcc Confidence 5555666665554 555799999888888864 22211 110 11100000000 000000 000001111 Q ss_pred CCccccCCccc Q lcl|NC_021305. 429 LDQSPPTSVPG 439 (518) Q Consensus 429 ~~~~~~~~~~~ 439 (518) +++++.+...+ T Consensus 464 ~~~~~~~~~~e 474 (474) T protein:vir:97 464 QQEGSNNKESE 474 (474) T ss_pred cCCCCcccccC Confidence 11111111111 No 211 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.00 E-value=7.5e-06 Score=48.68 Aligned_cols=407 Identities=12% Similarity=0.033 Sum_probs=167.6 Q ss_pred Cc----CCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 ML----LANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~----f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) |. +-.|+. ....++. -+ ++. + +......... ..=..+....-.|+..+.-+-+-|+++--.+++ T Consensus 33 ~~~~~~YY~g~h-~Il~r~~--~~----~~~-~--~~~~~d~~~~--nnki~~nf~k~Ivd~~~~yl~G~Pv~~~~~d~~ 100 (537) T protein:vir:78 33 AHIGENYYNQEN-DIEKSRI--FY----MND-K--GQLREDNYAS--NVKISHGFFTELVDQLAQYLLSNGVEVKVKDED 100 (537) T ss_pred HHHHHHHhcccc-hhhhccc--cc----ccc-c--cccccccccc--ccccccchHHHHHHHHhhhhcccCceeecCcch Confidence 00 000000 0000000 00 000 0 0000000000 001223345567777777777888876322211 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEE-eee Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEY-YFQ 155 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~-~~~ 155 (518) .......|.. -+. .........+..++..+|.+|.++-.+.+|.+ .+..++|..+.++.+..+..... .++ T Consensus 101 -----~~e~~~~l~~-~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~-~~~~i~p~~~~pv~d~~~~~~~~~~~y 172 (537) T protein:vir:78 101 -----NTQLDEILQE-YFD-EDFQATIDTLVTNASKKGFEGIFARTTSEGKL-KFQTVDGLTLIPVFDDYGVLKMIIRWY 172 (537) T ss_pred -----hHHHHHHHHH-Hhh-ccHHHHHHHHHHHHhhcCeeEEEeeecCCCce-EEEEEccceeEEEEcCCCCceeEEEEE Confidence 1112222222 121 23344556778899999999999999988865 46778999888877654432111 110 Q ss_pred ----ccc-ccC----ceeEEeccccEEEEeccC-------------------------------------------C--- Q lcl|NC_021305. 156 ----AGA-GVG----TQLVSFADDEVVPIRFFN-------------------------------------------P--- 180 (518) Q Consensus 156 ----~~~-~~~----~~~~~~~~~evih~~~~~-------------------------------------------~--- 180 (518) ... ... .....+.++.+.+++... + T Consensus 173 ~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 252 (537) T protein:vir:78 173 SEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQVLGRSYSK 252 (537) T ss_pred eeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccccccccccccccccccccCCcc Confidence 000 000 011123344444432110 0 Q ss_pred ------CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeee Q lcl|NC_021305. 181 ------DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMV 254 (518) Q Consensus 181 ------~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~v 254 (518) .....|.|-+......+.....+..-.++.+...+.|-.+++-- .+.+ ...++..++. .+-+.+ T Consensus 253 iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~-~~~~--~~~~~~~l~~-------~~~i~v 322 (537) T protein:vir:78 253 FPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGF-SGDS--TDKLRQNIKA-------KKMIGV 322 (537) T ss_pred eeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecC-CCcc--chhHHHHHhh-------cCceee Confidence 01124777788777777777766666666666555555444321 1211 1122222211 112223 Q ss_pred cCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCH------------HHHHHHHHHHHhhHHH Q lcl|NC_021305. 255 VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNI------------SAQMRAFYRDTMAIPI 322 (518) Q Consensus 255 l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~------------e~~~~~~~~~~l~P~~ 322 (518) -+++.++..+............++.+.+.|...-.+|. +.....+|-|.. .......+...++-.+ T Consensus 323 ~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~--~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~ 400 (537) T protein:vir:78 323 NGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFN--STAVGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCA 400 (537) T ss_pred cCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCC--CccccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 33343333344333333344556666666655433331 111111222221 1122223333444444 Q ss_pred HHHHHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeee------c Q lcl|NC_021305. 323 ARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYA------N 396 (518) Q Consensus 323 ~~ie~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~------~ 396 (518) +.|...++.+-.... ....+.|.+..-+..|..+.++.+.+++..|++|..-+.+.+++ ++++.-....- . T Consensus 401 ~~i~~~~~~~~~~~~-d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~--vdd~e~ek~~~ee~~~~~ 477 (537) T protein:vir:78 401 DMVVSDIALRGLGEY-DSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPR--IGDDETLKLIAEELDLDY 477 (537) T ss_pred HHHHHHHhhcCCccc-ccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCC--CCCHHHHHHHHHHHHhhh Confidence 444433322211111 12345677777789999999999999999999998888887643 33221000000 0 Q ss_pred ccccc-c--------ccccccC-----CCCCCCCCCCCCccCCCCCCCccccCCccccccch Q lcl|NC_021305. 397 SALQP-L--------GATPDGA-----VEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTN 444 (518) Q Consensus 397 ~n~~~-~--------~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (518) .+... + +..+..+ ...++.+++.++.+. .++ ....+.+.+...|.+ T Consensus 478 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~-~~~-~~~~~~~~~~~~~~~ 537 (537) T protein:vir:78 478 NELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQP-VAD-PNVVPPTDPNAVPQT 537 (537) T ss_pred hhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCC-CCC-CCCCCCCCCccCCCC Confidence 00000 0 0000000 000001111000000 000 000111122222222 No 212 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=97.94 E-value=1e-05 Score=47.97 Aligned_cols=377 Identities=10% Similarity=0.056 Sum_probs=166.3 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =|...-+. ..+....+..+.. |......-........ ..-..++....+|+..+.-+-.-|+.+--.+ + T Consensus 8 ~~i~~~~~-~~~r~~~l~~yy~---g~~~il~~~~~~~~~~--~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~-----~ 76 (429) T protein:vir:98 8 ELIQKHRS-FNLSYSAYKQLYE---GDHAILQQKQKEQYKP--DNRLVVNFAKYIVDTFNGYFIGVPVQTSHEN-----K 76 (429) T ss_pred HHHHHHHH-HHHHHHHHHHHhc---cccccccccccccCCC--cceeecchHHHHHHHHhhhhcccCceeecCC-----h Confidence 00000000 0000000000000 0000000000000000 0112345677888888888878887753211 1 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCce--eeEEeeeccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTG--RYEYYFQAGA 158 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~--~~~~~~~~~~ 158 (518) .....+..++.+ | +.......+..+.+.+|.+|+.+..+.+|.+ .+..++|..+.+..+.... ......++.. T Consensus 77 ~~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~ 151 (429) T protein:vir:98 77 QVSNYLELLDGY-N---DQDDNNAELSKICSIYGHGYELVFNDENAEA-GITYLTPLEAFIVYDDSIRQKPLFAVRYFYN 151 (429) T ss_pred HHHHHHHHHHhh-c---CHhHHHHHHHHHHhhcCeEEEEEEecCCCcE-EEEEEcccceEEEEeCCCCCceEEEEEEEEe Confidence 112233334333 2 3445677788999999999999999988875 5677889888877765322 1111111111 Q ss_pred ccCceeEEecc--------------------------ccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFAD--------------------------DEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWK 212 (518) Q Consensus 159 ~~~~~~~~~~~--------------------------~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ 212 (518) ........+.. =.|++++. ...|.|-+..+...+.....+..-..+.+. T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~sd~e~v~~liD~~d~~~s~~~~~~~ 226 (429) T protein:vir:98 152 KGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYVE-----NEERQSLLASVVTLINAFNKAISEKANDVE 226 (429) T ss_pred cCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEecC-----CCCCCCcHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111111 12333332 135778888777777777666655655566 Q ss_pred ccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCC----CcceeeccCChhhHHHHHHHHHHHHHHHHHh Q lcl|NC_021305. 213 NAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE----GMEPIPLQLTAVEMQFIEARQLNREEVCGVY 288 (518) Q Consensus 213 ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~----g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~f 288 (518) ..+.|-.+++- ...+++....++. ++++.++. +.+...+..+.....+....+.+.+.|+..- T Consensus 227 ~~~~p~~~i~g-~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s 293 (429) T protein:vir:98 227 YFADAYLKILG-AELDDETLKSLRD------------TRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRTA 293 (429) T ss_pred HhcCceeeeec-CCCCcchhhhHhh------------CceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHh Confidence 66667666542 2233332222111 12333331 2233334444444446667788889999988 Q ss_pred cCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHH----HHHHHHHh------hhhhhc---ccccceecchhhhhcCH Q lcl|NC_021305. 289 DIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIAR----IQSAMDKY------VGQYWV---RKNRMKFDIDDVIQPDW 355 (518) Q Consensus 289 gVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~----ie~~l~~~------l~~~~~---~~~~~~fd~~~l~~~d~ 355 (518) ++|..-.+ . .+|.......+....+.-.+.. +...+.+. ++...+ ....+++.+.+.+..|. T Consensus 294 ~~p~~~~~--~---~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~~ 368 (429) T protein:vir:98 294 MVANISDE--S---FGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPANL 368 (429) T ss_pred CccccCcc--c---cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcCH Confidence 88843221 1 1222111111111111111111 11111110 111111 11235667778888999 Q ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccC Q lcl|NC_021305. 356 EAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPT 435 (518) Q Consensus 356 ~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 435 (518) .+.++.+.++ .|+++..-+.++++.-+ ++..+ + ..+........+.+...- . .+++.++ T Consensus 369 ~~~a~~~~kl--~g~is~et~~~~l~~v~--d~~~E-~------~ri~~E~~~~~~~~~~~~--------~--~~~~~~~ 427 (429) T protein:vir:98 369 LEESQIAGNL--AGIVSEETQVGVLSIVE--NPQKE-I------ERKNSDKSTLISRQAGGL--------N--GQNTTTI 427 (429) T ss_pred HHHHHHHHHH--hccCchHHHHHhCCCCC--CHHHH-H------HHHHHHHHHHHHHHHhhh--------c--CCCCCCC Confidence 9999999887 68899877888886532 22111 0 011000000000000000 0 0000011 Q ss_pred Cc Q lcl|NC_021305. 436 SV 437 (518) Q Consensus 436 ~~ 437 (518) .+ T Consensus 428 ~~ 429 (429) T protein:vir:98 428 LE 429 (429) T ss_pred CC Confidence 11 No 213 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=97.70 E-value=9.6e-07 Score=53.58 Aligned_cols=266 Identities=10% Similarity=-0.002 Sum_probs=121.5 Q ss_pred cccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcceeccchHHHHHHh----cCCcCCCHHHHHHHHHHHHH Q lcl|NC_021305. 36 ERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETEESDTGYAKLLA----DPCEYLDPFAFWEWVASTLD 111 (518) Q Consensus 36 ~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~----~PN~~~s~~~f~~~~v~~ll 111 (518) .+. -.+...|.+++-..|.+.. ...+-++-+|+. --|...+-..-++.+.. |. T Consensus 1 ~~~---------------~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 57 (279) T protein:vir:40 1 MSL---------------FNLSRRAEDVSFSTFTVQD-------PTTDLLLGKLLGLVSYFDNVDYSEASKLEDLFY-WA 57 (279) T ss_pred Ccc---------------cccchhhcccceeeeeecC-------cchhHHHHHHHHHHHHhhcccchhhhhhhhhhh-hh Confidence 000 0112233333333333210 011111111211 11222222222222222 12 Q ss_pred HcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec--c------cccCceeEEeccccEEEEeccCCCCc Q lcl|NC_021305. 112 IYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA--G------AGVGTQLVSFADDEVVPIRFFNPDGL 183 (518) Q Consensus 112 ~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~--~------~~~~~~~~~~~~~evih~~~~~~~~~ 183 (518) +.| ..|..++. |+..+|--.+ . .......+++|-.++..|-.+ T Consensus 58 ~~~----------------------~~~~~~~~--~~~~~~~~~~~~d~fn~~vr~~~~~~vtVP~~Dv~IieNP----- 108 (279) T protein:vir:40 58 LQG----------------------KEVYRVWY--GGFKYYAQRVNADQFNIVVREPNRREVTIRTNDYEMLLNP----- 108 (279) T ss_pred hcc----------------------ceeehhhh--hhHHHHHhhcCcchhhhheecCCcceeEeecchhhhhhcc----- Confidence 222 22211111 1111110000 0 001112233444444333322 Q ss_pred ccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccC-CHHHHHHHHHHHHHHhcCccccCCeeecCCCccee Q lcl|NC_021305. 184 ERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRL-SEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPI 262 (518) Q Consensus 184 ~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~-~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~ 262 (518) .+|.-+- ....-++.. ......-+.+.+..+++++++-.. ..+..++.+.+++.+..++++=+++.+++.|-+++ T Consensus 109 lv~v~~e-e~~kM~~la---~nai~~KLD~~~qIk~fIKTd~d~glee~kekaR~rIk~mlalAk~~nGityid~~ddIt 184 (279) T protein:vir:40 109 FYGANPQ-RFGVMFGMA---SNGIGRRLDSQAQIKIYWKTKVSSGLKEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIK 184 (279) T ss_pred hheeccc-hhhHHHHHH---HhhhhhhhcccceeeeEEecCcchhHHHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeE Confidence 2333222 111112222 122222336677888888888653 45667888899999888877768899999999999 Q ss_pred eccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhhhhccccc Q lcl|NC_021305. 263 PLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQYWVRKNR 342 (518) Q Consensus 263 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~ 342 (518) ++..+-... ..+-.++.+.+.+..+|||..++- ++..|.+...|+..+|.|++++.+..|.. .+ T Consensus 185 QL~kDYSts-lk~die~lkS~l~Sq~GinekIL~------GsAtE~q~iAyy~rtVePILkQyek~liY---~~------ 248 (279) T protein:vir:40 185 QIQPDYSGS-LQNDANLAIEIALSEYGMPRELLY------GQSNEVTIIAFAIQKVLPLLKQHDKNIIF---NQ------ 248 (279) T ss_pred eeccccccc-cHHHHHHHHHHHHhhcCCchhhcc------ccCchhhhhhHHHhhHHHHHHHhcccccc---hh------ Confidence 998654443 245567778889999999998873 45568899999999999999997763321 10 Q ss_pred ceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Q lcl|NC_021305. 343 MKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD 391 (518) Q Consensus 343 ~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD 391 (518) +|.+.-+ .+-...|.+ |---...+-+|+. -| T Consensus 249 -E~fv~y~------------ttta~gg~~--~s~~~~~~~~~~~---~~ 279 (279) T protein:vir:40 249 -ENFVAYI------------STTAKGGAI--ESKSSKRDSEPVG---ND 279 (279) T ss_pred -hhhhhhh------------eecccCccc--ccccccccCCCCC---CC Confidence 1110000 000011111 0000111222331 12 No 214 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=97.57 E-value=4.4e-05 Score=44.45 Aligned_cols=438 Identities=10% Similarity=0.086 Sum_probs=174.6 Q ss_pred CcCCCCC--------CCCcccccccchhhhhhhcc---ccccccc-----ccccchhhhHHHhhcHHHHHHHHHHHHhhc Q lcl|NC_021305. 1 MLLANGQ--------TLSAPAMAELSPQMQDSYYY---APAVGMQ-----LERQFSLYGGIYKNQPWVRTVIAKRAQALA 64 (518) Q Consensus 1 ~~f~~~~--------~~~~~~~~~~~~~~~~~~~~---~~~~~~~-----~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia 64 (518) =|||+.= ....|+.....+-....-++ ....+.. .......|+++ +.+|.|..||+.|.+.+. T Consensus 3 ~lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~m-a~~pEvd~Av~eIVneai 81 (558) T protein:vir:10 3 KLFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREM-ALHPEADGAIEDVVNEAI 81 (558) T ss_pred chhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHH-hhccchhhHHHHhhccee Confidence 3566422 11122111111111100001 1111100 00112234443 778999999999999875 Q ss_pred cC-----ceEEEEecCCcceecc---chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC---CceEEEEe Q lcl|NC_021305. 65 RL-----PVKCMFTSGDTETEES---DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS---GTPEKLMP 133 (518) Q Consensus 65 ~l-----~~~v~~~~~~~~~~~~---~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~---G~~~~l~~ 133 (518) -+ |+.|--++-+.....+ ...+..++.-=|-...++ .+++.|.+.|..|+.++-|.. ..+.+|+. T Consensus 82 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~----e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~ 157 (558) T protein:vir:10 82 VSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSH----EIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRY 157 (558) T ss_pred EecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhh----HHHhhheeeeEEEEEEEEeCCCccccceeeee Confidence 43 2222212111111011 111111222122233344 456678999999999876533 35899999 Q ss_pred eCCceeEEEEcCC----------------c------eeeEEeeecccc---------cCceeEEeccccEEEEec--cCC Q lcl|NC_021305. 134 MHPSRVAIKRNSR----------------T------GRYEYYFQAGAG---------VGTQLVSFADDEVVPIRF--FNP 180 (518) Q Consensus 134 l~p~~v~v~~~~~----------------~------~~~~~~~~~~~~---------~~~~~~~~~~~evih~~~--~~~ 180 (518) |+|..++.+.... + ...+|.|..... ..+..+.++.+-|.+... ... T Consensus 158 lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y~hSGL~d~ 237 (558) T protein:vir:10 158 IDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITMCTSGLVDR 237 (558) T ss_pred eCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhheeeecccceec Confidence 9999986544320 1 112222221110 122234455444433332 112 Q ss_pred CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCCe--- Q lcl|NC_021305. 181 DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGKT--- 252 (518) Q Consensus 181 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~~--- 252 (518) ++. .-+|-|..+...+.....++....-+--..+.-+=|+.++ +++.+...++.-..+-..|+. ....|.+ T Consensus 238 ~~~-~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~dd 316 (558) T protein:vir:10 238 NKN-RVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDD 316 (558) T ss_pred CCC-eeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceeccc Confidence 222 2367788888888777777766665544445555454443 455554444433333333321 0111211 Q ss_pred ---e-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHH----- Q lcl|NC_021305. 253 ---M-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMR----- 311 (518) Q Consensus 253 ---~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~----- 311 (518) + +| ..|.++..|.. +..++ +-..+..+.+..+++||.+-|+....-+.+...+-.+ T Consensus 317 rk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem---~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF 393 (558) T protein:vir:10 317 RKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGEL---SDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKF 393 (558) T ss_pred chhhhhHhhhcccccCCCCccceeeccccCCcchH---HHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHH Confidence 1 11 13566666654 33333 3445667889999999999886544333333322222 Q ss_pred -HHHHHHhhHHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhc-CHHHHHHHHHHHHh--CCCcCH Q lcl|NC_021305. 312 -AFYRDTMAIPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQP-DWEAKSESTQKMVN--SGVATP 373 (518) Q Consensus 312 -~~~~~~l~P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~-d~~~~~~~~~~~~~--~G~~T~ 373 (518) -|+..--.-+...+.+.|...|+ ++.+ . ...++|++ +. +... =+..|+.++..+-. .-++|. T Consensus 394 ~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~ 473 (558) T protein:vir:10 394 AKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYST 473 (558) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccch Confidence 22222222333334444444432 1111 1 12333433 21 1111 12234444443311 124466 Q ss_pred HHHHH-HhCCCCCC---------CCCcceeee-cccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcccccc Q lcl|NC_021305. 374 NEGRE-IMGLPRSD---------DPKADELYA-NSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSP 442 (518) Q Consensus 374 NE~R~-~~g~~p~~---------~~~gD~~~~-~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (518) +=+|+ .+.+...+ .+-.+-++. |....|+. ++..+.+. +....+....+.+.+.+..+ T Consensus 474 dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~---~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~ 542 (558) T protein:vir:10 474 EYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPIT---GEPLPQEG--------DPAMEGMGEQPVDPDLEAQA 542 (558) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhh---ccccCccC--------CchhccCCCCCcccccccch Confidence 66654 34432211 000000010 11111110 00000000 00000001111111111111 Q ss_pred chhcchhhHHHHHHHH Q lcl|NC_021305. 443 TNSDRSTDSGKTEPRR 458 (518) Q Consensus 443 ~~~~~~~~~~~~~~~~ 458 (518) ...+...+++-.+++. T Consensus 543 ~~~~~~~~~~~~~~~~ 558 (558) T protein:vir:10 543 QAVDAQYSKDTKKAEL 558 (558) T ss_pred hhhhhhhhhhhhhhcC Confidence 1111111222222222 No 215 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=97.41 E-value=7.4e-05 Score=43.24 Aligned_cols=382 Identities=9% Similarity=0.025 Sum_probs=161.0 Q ss_pred CcCC--CCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcc Q lcl|NC_021305. 1 MLLA--NGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTE 78 (518) Q Consensus 1 ~~f~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~ 78 (518) +|.. .|.. ........ .+. ... .... ....-..++....+|+..+.-+-+-|+.+--.++ T Consensus 47 ~~~~Yy~g~~-~i~~~~~~-~~~----~~~---~~~~------~~~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~--- 108 (478) T protein:vir:10 47 MGERYYNHHP-DILDAPFK-RDV----NGD---YDET------KPDWRMYTNYHQNLVDQKVAYAVANPVTFGVDND--- 108 (478) T ss_pred HHHHHhcccc-cccccchh-hhc----ccc---cccc------cccceeccchHHHHHHHHhhhhcccCceeecCCh--- Confidence 1110 1110 00000000 000 000 0000 0000122456678888888888888887632111 Q ss_pred eeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC-ceeeEEee-ec Q lcl|NC_021305. 79 TEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR-TGRYEYYF-QA 156 (518) Q Consensus 79 ~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~-~~~~~~~~-~~ 156 (518) +. ...+..++. | +.......+..+.+.+|.+|..+-.+.+|.+ .+..++|..+.+..+.. .+.+.+.+ .+ T Consensus 109 -~~-~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~~~ir~~ 180 (478) T protein:vir:10 109 -KA-LKQIQHTLN--H---KWDDKLVDILTAASNKGIEWVQPYVDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVY 180 (478) T ss_pred -HH-HHHHHHHHh--c---cHHHHHHHHHHHHhhCCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 11 111222222 2 4556667778999999999999888888864 57778999888777542 22211111 11 Q ss_pred ccccCceeEEeccccEEEEeccC-------------------------C---------CCcccCchHHHHHHHHHHHHHH Q lcl|NC_021305. 157 GAGVGTQLVSFADDEVVPIRFFN-------------------------P---------DGLERGLSLMESLKSTIFSEDS 202 (518) Q Consensus 157 ~~~~~~~~~~~~~~evih~~~~~-------------------------~---------~~~~~G~s~l~~~~~~i~~~~~ 202 (518) ..........+..+.|.+++... + .....|.|-+..+...+..... T Consensus 181 ~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~ 260 (478) T protein:vir:10 181 ELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDK 260 (478) T ss_pred eeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHH Confidence 11011111112233333222110 0 0112477777777666666665 Q ss_pred HHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeec--CCCcceeeccCChhhHHHHHHHHHH Q lcl|NC_021305. 203 SRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVV--EEGMEPIPLQLTAVEMQFIEARQLN 280 (518) Q Consensus 203 ~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl--~~g~~~~~l~~~~~d~~~~e~~~~~ 280 (518) +..-..+.+...+.|-.+++--. . +....+...+.. .+++.+ +.|.++..+..+.....+.+..+.. T Consensus 261 ~~S~~~~~~~~~~~~~~~~~g~~-~--~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l 329 (478) T protein:vir:10 261 RLSDTQNTFDESVELIYILKGYE-G--EDMKDFMHNLKY--------YKAISVAGESGSGVDTIKVEVPIDSVKEYTKML 329 (478) T ss_pred HHHHHHHHHHHhhCcceeeecCC-c--ccccchhhhhhh--------CceeEecCCCCCcceEEeecCCHHHHHHHHHHH Confidence 55444444444455544443211 1 111111111111 123333 2334444444445556677888888 Q ss_pred HHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhH----HHHHHHHHHHH---hhhhhhc---ccccceecchhh Q lcl|NC_021305. 281 REEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAI----PIARIQSAMDK---YVGQYWV---RKNRMKFDIDDV 350 (518) Q Consensus 281 ~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P----~~~~ie~~l~~---~l~~~~~---~~~~~~fd~~~l 350 (518) .+.|...-++|..-.+.. .++ .+..+ ..+....+.- ....++..+.+ .++...+ ....+++.+..- T Consensus 330 ~~~I~~~s~~p~~~~~~~-~~n-~Sg~A--i~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~~ 405 (478) T protein:vir:10 330 RDYIIEFGQGVDFQQDKF-GNS-PSGIA--LKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNFN 405 (478) T ss_pred HHHHHHHhCCcCcCcccc-ccc-hHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCCC Confidence 889999888884322111 111 11111 1111111111 11111111111 1111111 122355666677 Q ss_pred hhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCC Q lcl|NC_021305. 351 IQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLD 430 (518) Q Consensus 351 ~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 430 (518) +..|..+.++.+.++ .|++|...+.+.++. ++++..+ +..+........ +..+...++.+.+ +. T Consensus 406 ~p~~~~e~~~~~~~~--~g~iS~et~i~~~~~--v~d~~~E-------~~ri~~E~~~~~--~~~~~~~~~~~d~---~~ 469 (478) T protein:vir:10 406 VMVNELENSQIAMNS--TGLLSKETILGNHSW--VQDPVAE-------MERIEQENIELN--QQLPDIEEGLNDE---QQ 469 (478) T ss_pred CCCCHHHHHHHHHHH--hCCCChHHHHHhCCC--CCCHHHH-------HHHHHHHHHHHH--HhccccCCCCccc---cc Confidence 888899998888776 688988778877754 2221110 000100000000 0000000000000 00 Q ss_pred ccccCCccc Q lcl|NC_021305. 431 QSPPTSVPG 439 (518) Q Consensus 431 ~~~~~~~~~ 439 (518) .+..+.+.+ T Consensus 470 ~~~~d~~~e 478 (478) T protein:vir:10 470 RQSEDNQSE 478 (478) T ss_pred ccCcCCCCC Confidence 000001111 No 216 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=97.21 E-value=0.00013 Score=41.89 Aligned_cols=375 Identities=10% Similarity=0.042 Sum_probs=164.5 Q ss_pred CcCCCCCC-CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQT-LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =.+.+.-. ...+.... .....+.....+ . ...-..++.....|+..+.-+-+-|+.+--.++. T Consensus 29 ~Yy~g~~~I~~~~~~~~----~~~~~~~~~~~~---~------~~~ki~~n~~k~Iv~~~~~yl~G~p~~~~~~d~~--- 92 (470) T protein:vir:10 29 NYYENKTDITTRNNGKA----KLNKEGKKDPLR---S------ADNRIPSNFYQLLVDQEAGYVASVFPDIDVGKDA--- 92 (470) T ss_pred HHhccccchhccccchh----cccccccccccc---c------CCcccccchHHHHHHhhhhheeccceeeecCchH--- Confidence 01111110 00000000 000000000000 0 0011223445667788888887888776322211 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCc--eeeEE--eee Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRT--GRYEY--YFQ 155 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~--~~~~~--~~~ 155 (518) .. ..+..++.. +..+-...+..++..+|.+|.++-++..|.+ .+..++|..+.++.+... ..... .|. T Consensus 93 -~~-~~l~~~~~~-----~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~a~ir~y~ 164 (470) T protein:vir:10 93 -DN-KKIIDVLGD-----DRALTLNGLLVDSSNAGRAWLHYWIDEDGNF-RYGIIQPDQITPIYATTLDNKLLGILRSYK 164 (470) T ss_pred -HH-HHHHHHHhh-----hHHHHHHHHHHHHhhcCeeEEEEEecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 11 122222221 3444455678899999999999999988864 577789999888876542 11111 111 Q ss_pred cccccCce----eEEeccccEEEEeccCC----------------------------------------CCcccCchHHH Q lcl|NC_021305. 156 AGAGVGTQ----LVSFADDEVVPIRFFNP----------------------------------------DGLERGLSLME 191 (518) Q Consensus 156 ~~~~~~~~----~~~~~~~evih~~~~~~----------------------------------------~~~~~G~s~l~ 191 (518) .....+.. ...+....+.|++.... .....|.|-+. T Consensus 165 ~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e 244 (470) T protein:vir:10 165 QLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYRLPELN 244 (470) T ss_pred eeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCCCCCCchh Confidence 11111111 11122333333221000 00124777787 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-----CCcceeeccC Q lcl|NC_021305. 192 SLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-----EGMEPIPLQL 266 (518) Q Consensus 192 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-----~g~~~~~l~~ 266 (518) .....+.....+..-..+.+...+.|-.+++--...+.+ ++...+.. .+++.++ .+.++.-+.. T Consensus 245 ~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~---~~~~~~~~--------~~~i~~~~~~~~~~~~~~~lt~ 313 (470) T protein:vir:10 245 KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLH---QFMNDLRK--------YKSIKINNTGNGDNSGVDKLQI 313 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccc---hhhhhhhh--------cCeEeccCCCCCcCceeEEEee Confidence 777777776666655556666556666665432211111 12221111 1222232 1233333443 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH-------------HHHHHHHHHhhHHHHHHHHHHHHhh Q lcl|NC_021305. 267 TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA-------------QMRAFYRDTMAIPIARIQSAMDKYV 333 (518) Q Consensus 267 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~-------------~~~~~~~~~l~P~~~~ie~~l~~~l 333 (518) ......+....+.+.+.|...-++|..- ....++ .+..+ .....+..+++-.+..|...++ T Consensus 314 ~~~~~~~~~~~~~L~~~I~~~s~~p~~~--~~~~gn-~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~--- 387 (470) T protein:vir:10 314 DIPVEARDDALKITRKNIFLFGQGIDPA--NFESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLN--- 387 (470) T ss_pred cCChHHHHHHHHHHHHHHHHHhCCCCCC--cccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--- Confidence 3344455677788888888888887421 111111 11111 1112222233333333322111 Q ss_pred hhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCC Q lcl|NC_021305. 334 GQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWE 413 (518) Q Consensus 334 ~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~ 413 (518) ........+.+.+..-+..|..+.++.+.++ .|++|..-+++++++ ++++. .+ +..+... ..+.. T Consensus 388 -~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~--v~D~~-~E------~eri~~E---~~e~~ 452 (470) T protein:vir:10 388 -FSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQ-QE------LKDLAKD---KEEND 452 (470) T ss_pred -ccCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCC--CCCHH-HH------HHHHHHH---HHHHH Confidence 1111223566777888999999999999887 689998888888754 33221 11 1111100 00000 Q ss_pred CCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 414 EAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 414 ~~~~~~~~~~~~~~~~~~~~ 433 (518) + ...+....+..+.+++. T Consensus 453 ~--~~~~~~~~~~~~~dde~ 470 (470) T protein:vir:10 453 P--YSNQADELNGKGVNDEQ 470 (470) T ss_pred H--hhccccccCCCCCCCCC Confidence 0 00000000000111100 No 217 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=97.18 E-value=0.00014 Score=41.71 Aligned_cols=428 Identities=9% Similarity=0.058 Sum_probs=171.3 Q ss_pred CcCCCCCC------CCcccccccchhhhhhhccccccccccc---------ccchhhhHHHhhcHHHHHHHHHHHHhhcc Q lcl|NC_021305. 1 MLLANGQT------LSAPAMAELSPQMQDSYYYAPAVGMQLE---------RQFSLYGGIYKNQPWVRTVIAKRAQALAR 65 (518) Q Consensus 1 ~~f~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~v~~~v~~ia~~ia~ 65 (518) -|||+.-. .+++......+.+...-++.-..-+... .....|+++ +.+|.|..||+.|.+.+.- T Consensus 3 ~lfgf~i~~~~~~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~m-a~~pEVd~Av~eIVneaIv 81 (564) T protein:vir:10 3 QLFGFLINEKEGQKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDM-SLHPEVDSAIDEIVNEFVV 81 (564) T ss_pred chhcceeeeeccCCCCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHH-hhccchhhHHHHhhcceeE Confidence 57775321 1111111111111111111100000010 112234444 7789999999999998643 Q ss_pred C-----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC---CceEEEEee Q lcl|NC_021305. 66 L-----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS---GTPEKLMPM 134 (518) Q Consensus 66 l-----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~---G~~~~l~~l 134 (518) + |+.|--.+-+-... .-...+..++.-=|-...++ .+++.|.+.|..|+.++-+.. ..+.+|+.| T Consensus 82 ~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~----e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~l 157 (564) T protein:vir:10 82 NDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAH----EIIRNWYVDGRSHYHKVIDLDNPKKGILELRYI 157 (564) T ss_pred ecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhh----HHHhhhhhcceEEEEEEeeCCChhhhhhhhhhh Confidence 2 22221111111011 00111111222222333444 446678899999999876532 238899999 Q ss_pred CCceeEEEEcC------Ccee---------------eEEeeeccccc-------------CceeEEeccccEEEEecc-- Q lcl|NC_021305. 135 HPSRVAIKRNS------RTGR---------------YEYYFQAGAGV-------------GTQLVSFADDEVVPIRFF-- 178 (518) Q Consensus 135 ~p~~v~v~~~~------~~~~---------------~~~~~~~~~~~-------------~~~~~~~~~~evih~~~~-- 178 (518) +|..++.++.. .+.. -+|.|...... .+..+.++.+.|.|.+.- T Consensus 158 DPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSGL~ 237 (564) T protein:vir:10 158 DSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSGLM 237 (564) T ss_pred cccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhcceecccce Confidence 99977655421 1111 12222211111 123466777777776642 Q ss_pred CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCCe- Q lcl|NC_021305. 179 NPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGKT- 252 (518) Q Consensus 179 ~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~~- 252 (518) +.++. .-+|-|..+...+.....++....-+--..+.-+=|+.++ +++.+...++....+-..|+. ....|.+ T Consensus 238 d~~~~-~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevr 316 (564) T protein:vir:10 238 DLNKK-MTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIR 316 (564) T ss_pred eCCCC-ceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceec Confidence 22222 2467788888888777777766665544445555454443 455554444433333333321 0111211 Q ss_pred -----e-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc-ccccCCHHHHH--- Q lcl|NC_021305. 253 -----M-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILD-RATFSNISAQM--- 310 (518) Q Consensus 253 -----~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~sn~e~~~--- 310 (518) + +| ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|.... .-+.+...+-. T Consensus 317 ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem---~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDE 393 (564) T protein:vir:10 317 DDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGEL---KDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDE 393 (564) T ss_pred ccchhhhhHhhhcccccCCCcccceeeccccCCcchH---HHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHH Confidence 1 11 13566666654 33333 3445667889999999999886532 12222122221 Q ss_pred ---HHHHHHHhhHHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHHH--HhCCC Q lcl|NC_021305. 311 ---RAFYRDTMAIPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQKM--VNSGV 370 (518) Q Consensus 311 ---~~~~~~~l~P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~~--~~~G~ 370 (518) .-|+..--.-+...+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++..+ +-.-+ T Consensus 394 iKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky 473 (564) T protein:vir:10 394 LKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKY 473 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 222222223333334444444433 2111 1 12333333 21 11111 22333333332 11123 Q ss_pred cCHHHHHH-HhCC----------------------CCCCCCCcceeeec-ccccccccccccCCCCCCCCCCCCCccCCC Q lcl|NC_021305. 371 ATPNEGRE-IMGL----------------------PRSDDPKADELYAN-SALQPLGATPDGAVEWEEAPAPKRPASTPV 426 (518) Q Consensus 371 ~T~NE~R~-~~g~----------------------~p~~~~~gD~~~~~-~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 426 (518) +|.+=+|+ .|.+ +|.+...||-.-+. ..+.|.+....+....++......+ T Consensus 474 ~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~----- 548 (564) T protein:vir:10 474 FSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNS----- 548 (564) T ss_pred cchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhcc----- Confidence 35544443 2222 22222223211000 0111111110000000000000000 Q ss_pred CCCCccccCCccccccchhcchhhHHHHHHHHhhcc Q lcl|NC_021305. 427 ASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQK 462 (518) Q Consensus 427 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 462 (518) ..+.++....+..+ + | T Consensus 549 --a~~~~~~~~~~~~~--------~----------~ 564 (564) T protein:vir:10 549 --APKPPPSQQSKSQS--------N----------K 564 (564) T ss_pred --CCCCCCCCCCcCcC--------C----------C Confidence 00000000000000 0 0 No 218 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.02 E-value=0.00021 Score=40.76 Aligned_cols=371 Identities=11% Similarity=0.019 Sum_probs=155.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) -.+.+.............. .......+ ..=..++....+|+..+.-+-+-|+.+--.+. . T Consensus 50 ~yY~g~~~i~~~~~~~~~~----------~~~~~~~~------~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~----~ 109 (468) T protein:vir:96 50 RYYNHQPDVLFNAPKRNVK----------GEIDPFKP------DWRMYTNYHQNLVDQKVAYAVANPVTYGTEDE----K 109 (468) T ss_pred HHhcCCCcccccccccccc----------cccccccc------ccccccchHHHHHHHHHhhhccCCceeccCCh----H Confidence 1111111110000000000 00000000 00112345566777777777777777532111 1 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC--ceeeEEeeeccc Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR--TGRYEYYFQAGA 158 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~--~~~~~~~~~~~~ 158 (518) ....+..++. | +.......+..+.+.+|.+|+.+-.+.+|. ..+..++|..+.++.+.. +....+...+.. T Consensus 110 -~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~-~~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~ 182 (468) T protein:vir:96 110 -SLKTIQEVLN--H---KWDDKLVDILTAASNKGVEWIQPYVDEQGE-FKTFRVPAEQAIPIWTNKERDELKAFIRLYEL 182 (468) T ss_pred -HHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeEEEEEEEcCCCc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEe Confidence 1222333332 2 445566778899999999999888888876 457778898888776543 111111111111 Q ss_pred ccCceeEEeccccEEEEecc-------------------------CC---------CCcccCchHHHHHHHHHHHHHHHH Q lcl|NC_021305. 159 GVGTQLVSFADDEVVPIRFF-------------------------NP---------DGLERGLSLMESLKSTIFSEDSSR 204 (518) Q Consensus 159 ~~~~~~~~~~~~evih~~~~-------------------------~~---------~~~~~G~s~l~~~~~~i~~~~~~~ 204 (518) ........+....+.|++.. ++ .....|.|-+..+...+.....+. T Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~ 262 (468) T protein:vir:96 183 DGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRL 262 (468) T ss_pred cCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHH Confidence 01111111222222222110 00 011347777777666666666555 Q ss_pred HHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC--CCcceeeccCChhhHHHHHHHHHHHH Q lcl|NC_021305. 205 NATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE--EGMEPIPLQLTAVEMQFIEARQLNRE 282 (518) Q Consensus 205 ~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~--~g~~~~~l~~~~~d~~~~e~~~~~~~ 282 (518) .-..+.+...+.|..+++-- ...+ .+.+...+ ..++++.++ ++.+...+..+.....+....+.+.+ T Consensus 263 S~~~~~~~~~~~p~lv~~g~-~~~~--~~~~~~~~--------~~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~ 331 (468) T protein:vir:96 263 SDTQNTFDEATELIYVLKGY-EGED--LEEFMYNL--------KYYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRD 331 (468) T ss_pred HHHHHHHHHhcCceeeeecC-Cccc--cchhhhhh--------hcCceEEecCCCCCcceEEeecCChHHHHHHHHHHHH Confidence 55555556666665555421 1111 11111111 112344443 33334444444445556677888888 Q ss_pred HHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHh----hHHHHHHHHHHHH---hhhhhhc---ccccceecchhhhh Q lcl|NC_021305. 283 EVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTM----AIPIARIQSAMDK---YVGQYWV---RKNRMKFDIDDVIQ 352 (518) Q Consensus 283 ~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l----~P~~~~ie~~l~~---~l~~~~~---~~~~~~fd~~~l~~ 352 (518) .|...-++|..-.. ...++ .+.++. .+....+ .-....+...+.+ .++.-.+ ....+.+.++.-+. T Consensus 332 ~I~~~s~~p~~~~~-~~~~n-~Sg~Al--k~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~i~f~~~~p 407 (468) T protein:vir:96 332 YVIEFGQGVDFQQD-KFGNS-PSGIAL--KFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLSIKVQDVEITFNFNVM 407 (468) T ss_pred HHHHHhCccccccc-ccccc-hHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCC Confidence 88888888843211 11111 222211 1111111 1111111111111 1111111 11234555566667 Q ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCcc Q lcl|NC_021305. 353 PDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQS 432 (518) Q Consensus 353 ~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (518) .|..+.++. +...|++|.-.+++.++. ++++.. + +..+.. . +.. ..... ..-.+.+++. T Consensus 408 ~d~~e~a~~---~~~~g~iS~et~i~~l~~--v~D~~~-E------~~ri~~---E----~~~-~~~~~-~~~~~~~~~~ 466 (468) T protein:vir:96 408 VNELEQSQI---GVNSQYLSKETVVTNHPW--VDDPVA-E------MERIDQ---E----ELA-LPSIE-EGLNGKENNE 466 (468) T ss_pred cCHHHHHHH---HHhcCCCchHHHHHhCCC--CCCHHH-H------HHHHHH---H----HHH-HHHHh-hccCCCCCCC Confidence 777666554 456799998888887743 222210 1 111110 0 000 00000 0000111111 Q ss_pred cc Q lcl|NC_021305. 433 PP 434 (518) Q Consensus 433 ~~ 434 (518) |+ T Consensus 467 ~~ 468 (468) T protein:vir:96 467 PT 468 (468) T ss_pred CC Confidence 11 No 219 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=96.93 E-value=0.00025 Score=40.30 Aligned_cols=411 Identities=9% Similarity=0.063 Sum_probs=168.2 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhc-HHHHHHHHHHHHhhccCceEEEEecCCcce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQ-PWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~ 79 (518) =||++|.......- .+-.|+. ..+- +.......++. ...-+-|--+=+.|-+||.--|+.+.-... T Consensus 52 ~~~~ng~i~~v~~~-~l~~~f~----npd~--------~~~~i~~l~~y~yi~~~~v~ql~~li~~lp~l~y~i~~~~~~ 118 (525) T protein:vir:10 52 DLCNNGKIKTVNLD-TLQLWFN----NPDK--------YINNIVNLLTYYYIIDGNVFQLYDLIFSLPPLDYQIKVLKRD 118 (525) T ss_pred HhhcCCceeeeeHH-HHHhhhc----ChHH--------HHHHHHHHHHHhhhhcchHHHHHHHHHhcCCcceeehhhhhc Confidence 45666654332221 1222221 1110 00000000000 111122233334444555433433221111 Q ss_pred eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceE-----EEEeeCC------ceeEEEEc---- Q lcl|NC_021305. 80 EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPE-----KLMPMHP------SRVAIKRN---- 144 (518) Q Consensus 80 ~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~-----~l~~l~p------~~v~v~~~---- 144 (518) ...+.-+..++..-....-..++-+.+..++...|.-.-.-. ...-.|. .+.++-| ..|-|+.- T Consensus 119 k~~~~~~s~~n~~l~k~i~hk~ltrdll~q~a~~gtlig~wl-g~~~~py~~vf~~~kyvfp~~r~~g~~v~vid~~~f~ 197 (525) T protein:vir:10 119 KDYKEDLSTINLYLEKKIQHKQLTRDLLVQLAHSGTLIGTWL-GSKREPYFNVFNNLKYVFPYGRAKGKMVAVIDLQWFD 197 (525) T ss_pred cchhhHHHHHHHHHHHhHHHHHHHHHHHHHhhccCceeEeee-cCCCCcchhhhhhhhhhccccccCCceEEEEehHHhh Confidence 111111222222111111112222333333333343111000 0000010 1111111 11111110 Q ss_pred ----C---------------CceeeEEeeecccccCceeEEeccccEEEEeccCCCCcc-cCchHHHHHHHHHHHHHHHH Q lcl|NC_021305. 145 ----S---------------RTGRYEYYFQAGAGVGTQLVSFADDEVVPIRFFNPDGLE-RGLSLMESLKSTIFSEDSSR 204 (518) Q Consensus 145 ----~---------------~~~~~~~~~~~~~~~~~~~~~~~~~evih~~~~~~~~~~-~G~s~l~~~~~~i~~~~~~~ 204 (518) . .....+-.|...+...-....+|-+.++|.|...+.... -|.|....+...|....... T Consensus 198 ~~~~~~r~~~~~~lsp~i~~~~y~~~~~~~~~~~~~~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klr 277 (525) T protein:vir:10 198 EMSELERKLTFENLSPLITENKYKKWKEYNGENEDALRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLR 277 (525) T ss_pred hhhHHHHHHHHHhhchhhhhhhhhHHhhcccccchhheeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHH Confidence 0 000000011111112234567888999999977664333 38888888888787777777 Q ss_pred HHHHHHHHccCCcccccccCcc------CCHHHHHHHHHHHHHHhcC-ccccCCeeec--CCCccee--eccC--ChhhH Q lcl|NC_021305. 205 NATAAMWKNAGRPNLVLRHEKR------LSEAAQQRLREQFDRAHSG-SSNTGKTMVV--EEGMEPI--PLQL--TAVEM 271 (518) Q Consensus 205 ~~~~~~~~ng~~p~~il~~~~~------~~~~~~~~~~~~~~~~~~g-~~n~g~~~vl--~~g~~~~--~l~~--~~~d~ 271 (518) +...+....=..+-.+|++.+. +.+...+++-+..+.+... .+...+++++ |.=++++ .+.. ..-|. T Consensus 278 d~EqsIA~kii~a~avLk~gg~~gn~mk~p~~~kqkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~glDg 357 (525) T protein:vir:10 278 DLEQSIADKIIKAMAVLKFRGKDDNDSKVKESAKRKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTLDP 357 (525) T ss_pred HHHHHHHHHhhhhheeeeeccccCccccCchHHHHHHHHHHHHHHhcccccccCeEEEeccceeecccccccCcccCCCc Confidence 7777777766777788877543 2333445555555444422 3333455553 3322332 2221 11222 Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhh---hhcccccceecch Q lcl|NC_021305. 272 QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQ---YWVRKNRMKFDID 348 (518) Q Consensus 272 ~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~---~~~~~~~~~fd~~ 348 (518) + --+...++|-.|+|++.++++. .++||+++.-....||.. +.-+++.|++..+ +|+. ....+.-+-|+++ T Consensus 358 ~---K~d~I~~DI~~A~GlS~sL~nG-dggNyAtaslnld~fykk-igVm~e~Iee~y~-kL~d~Vl~~~k~~nyifnyd 431 (525) T protein:vir:10 358 K---KYDSIDNDITNATGISQVLTNG-TKGNYASAKLNLDVFYKK-IGVMLEIIEEIYN-QLIDIILGEEKGCNYIFQYN 431 (525) T ss_pred h---hhhhhhhhhhhhhccceeeecC-CCCceeeeeeeHHHHHHH-HHHHHHHHHHHHH-HHHhhhcCcccCcceEEecC Confidence 2 2334467899999999999874 556888877777777764 5556777774444 4332 1223333446666 Q ss_pred hhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC----------CCcc-eeeecccccccccccccCCCCCCCCC Q lcl|NC_021305. 349 DVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDD----------PKAD-ELYANSALQPLGATPDGAVEWEEAPA 417 (518) Q Consensus 349 ~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~----------~~gD-~~~~~~n~~~~~~~~~~~~~~~~~~~ 417 (518) .-...+.+++.+.+-++...||. .--+....|+.--+. -..+ ....|.+...+....+ . . T Consensus 432 kd~pi~~kkk~d~LIkL~d~g~s-~k~vldl~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SGk~~-----n---~ 502 (525) T protein:vir:10 432 KDTPIEREKKLDTLIKLEAQGYS-AKYVLDILGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSGKDG-----N---D 502 (525) T ss_pred CCchhhhhhhhhhhhhhhccchh-hhhhhhhhccCcchHHHHHHHHHHHHHHhhhccccccceeeecccc-----c---c Confidence 65667788888888888888874 223333444321110 0011 1222222221111000 0 0 Q ss_pred CCCCccCCCCCCCccccCCccccccchhcchh Q lcl|NC_021305. 418 PKRPASTPVASLDQSPPTSVPGLSPTNSDRST 449 (518) Q Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (518) -+.|.. + +...+++ ..+ +..|+. T Consensus 503 iG~P~~----d-d~~~~da--ti~--s~~~~~ 525 (525) T protein:vir:10 503 IGSPKL----D-DSDSSDA--TIE--SKERGV 525 (525) T ss_pred ccCCcc----C-CCcchhh--hhh--hhhcCC Confidence 000000 0 0000000 000 000110 No 220 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=96.87 E-value=0.00029 Score=40.00 Aligned_cols=403 Identities=9% Similarity=0.053 Sum_probs=175.2 Q ss_pred CcCCC-------------CC---CCCcccccccchhhh-----hhhcccccccccccc-------cchhhhHHHhhcHHH Q lcl|NC_021305. 1 MLLAN-------------GQ---TLSAPAMAELSPQMQ-----DSYYYAPAVGMQLER-------QFSLYGGIYKNQPWV 52 (518) Q Consensus 1 ~~f~~-------------~~---~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~v 52 (518) =||++ +. ++.+|...+.+..+. ...++.-..-..... ....++++ +.+|.| T Consensus 5 ~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~m-a~~pEv 83 (516) T protein:vir:10 5 DLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQL-INNPEV 83 (516) T ss_pred HhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHH-hhccch Confidence 25665 11 122222221111110 011111100011111 12234444 778999 Q ss_pred HHHHHHHHHhhccC-----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc- Q lcl|NC_021305. 53 RTVIAKRAQALARL-----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN- 123 (518) Q Consensus 53 ~~~v~~ia~~ia~l-----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~- 123 (518) ..||+.|.+.+.-+ |+.+--++-+-... .-...+..++.--+-...++ .+++.|.+.|..|+.++-+ T Consensus 84 d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fhKiid~ 159 (516) T protein:vir:10 84 ERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLD----TLFRRWYVDSRIFFHKIMPN 159 (516) T ss_pred hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhh----HHHhhhhhcceEEEEEEecC Confidence 99999999987543 22222111110011 00111111222122233444 4466788999999986654 Q ss_pred CCCceEEEEeeCCceeEEEEcC-----Ccee------eEEeeecc---------cccCceeEEeccccEEEEe--ccCCC Q lcl|NC_021305. 124 KSGTPEKLMPMHPSRVAIKRNS-----RTGR------YEYYFQAG---------AGVGTQLVSFADDEVVPIR--FFNPD 181 (518) Q Consensus 124 ~~G~~~~l~~l~p~~v~v~~~~-----~~~~------~~~~~~~~---------~~~~~~~~~~~~~evih~~--~~~~~ 181 (518) ....+.+|+.|+|..+..+..- .+.. .+|.|... ....+..+.++.+-|.|.. ....+ T Consensus 160 ~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~ 239 (516) T protein:vir:10 160 PKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGLMDCS 239 (516) T ss_pred ccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccceeCC Confidence 3445899999999998765432 1111 11211110 0112234566666665555 22333 Q ss_pred CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCC----- Q lcl|NC_021305. 182 GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGK----- 251 (518) Q Consensus 182 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~----- 251 (518) +... +|-|..|...+.....++....-+--..+.-+=|+..+ +++.+...++.-..+...++. ..+.|. T Consensus 240 ~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddr 318 (516) T protein:vir:10 240 DRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 3333 78888888888777777766666544445555454443 455544444433333332221 111222 Q ss_pred -ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHHHH--H Q lcl|NC_021305. 252 -TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQMRA--F 313 (518) Q Consensus 252 -~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~--~ 313 (518) .+ +| ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|+.-...+. +...+-.+. - T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiK 395 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDM---DDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELD 395 (516) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHH Confidence 11 11 13566666653 33343 34456678899999999998865433221 222222222 1 Q ss_pred HHHHhhHHH----HHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHHHH--hCCCcC Q lcl|NC_021305. 314 YRDTMAIPI----ARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQKMV--NSGVAT 372 (518) Q Consensus 314 ~~~~l~P~~----~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~--~~G~~T 372 (518) +...|.-+- ..+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++..+- -..+++ T Consensus 396 F~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s 475 (516) T protein:vir:10 396 FRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVS 475 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 222233333 334444444433 2111 1 12333433 21 11111 233444444432 235777 Q ss_pred HHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 373 PNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 373 ~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) .+=+|+ .+.+...+-..-++ .+... ....-.+.+.+.. +- T Consensus 476 ~~yi~k~ILr~tDeei~~e~k--------~I~~E----~~~~~~~~p~~~~-------~f 516 (516) T protein:vir:10 476 HDYVMKNILQMTEEQIAQEEK--------QIEQE----AGIKRFQNPENED-------DF 516 (516) T ss_pred hHHHHHHHhcCCHhhHHHHHH--------HHHHh----hhCCCCCCCCccc-------cC Confidence 777775 45543221000000 00000 0000001111111 11 No 221 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=96.87 E-value=0.00029 Score=40.00 Aligned_cols=403 Identities=9% Similarity=0.053 Sum_probs=175.2 Q ss_pred CcCCC-------------CC---CCCcccccccchhhh-----hhhcccccccccccc-------cchhhhHHHhhcHHH Q lcl|NC_021305. 1 MLLAN-------------GQ---TLSAPAMAELSPQMQ-----DSYYYAPAVGMQLER-------QFSLYGGIYKNQPWV 52 (518) Q Consensus 1 ~~f~~-------------~~---~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~v 52 (518) =||++ +. ++.+|...+.+..+. ...++.-..-..... ....++++ +.+|.| T Consensus 5 ~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~m-a~~pEv 83 (516) T protein:vir:10 5 DLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQL-INNPEV 83 (516) T ss_pred HhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHH-hhccch Confidence 25665 11 122222221111110 011111100011111 12234444 778999 Q ss_pred HHHHHHHHHhhccC-----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc- Q lcl|NC_021305. 53 RTVIAKRAQALARL-----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN- 123 (518) Q Consensus 53 ~~~v~~ia~~ia~l-----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~- 123 (518) ..||+.|.+.+.-+ |+.+--++-+-... .-...+..++.--+-...++ .+++.|.+.|..|+.++-+ T Consensus 84 d~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fhKiid~ 159 (516) T protein:vir:10 84 ERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLD----TLFRRWYVDSRIFFHKIMPN 159 (516) T ss_pred hhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhh----HHHhhhhhcceEEEEEEecC Confidence 99999999987543 22222111110011 00111111222122233444 4466788999999986654 Q ss_pred CCCceEEEEeeCCceeEEEEcC-----Ccee------eEEeeecc---------cccCceeEEeccccEEEEe--ccCCC Q lcl|NC_021305. 124 KSGTPEKLMPMHPSRVAIKRNS-----RTGR------YEYYFQAG---------AGVGTQLVSFADDEVVPIR--FFNPD 181 (518) Q Consensus 124 ~~G~~~~l~~l~p~~v~v~~~~-----~~~~------~~~~~~~~---------~~~~~~~~~~~~~evih~~--~~~~~ 181 (518) ....+.+|+.|+|..+..+..- .+.. .+|.|... ....+..+.++.+-|.|.. ....+ T Consensus 160 ~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~ 239 (516) T protein:vir:10 160 PKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGLMDCS 239 (516) T ss_pred ccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccceeCC Confidence 3445899999999998765432 1111 11211110 0112234566666665555 22333 Q ss_pred CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCC----- Q lcl|NC_021305. 182 GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGK----- 251 (518) Q Consensus 182 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~----- 251 (518) +... +|-|..|...+.....++....-+--..+.-+=|+..+ +++.+...++.-..+...++. ..+.|. T Consensus 240 ~~~i-~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddr 318 (516) T protein:vir:10 240 DRGI-IGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CCce-eeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 3333 78888888888777777766666544445555454443 455544444433333332221 111222 Q ss_pred -ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHHHH--H Q lcl|NC_021305. 252 -TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQMRA--F 313 (518) Q Consensus 252 -~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~~--~ 313 (518) .+ +| ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|+.-...+. +...+-.+. - T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiK 395 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDM---DDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELD 395 (516) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHH Confidence 11 11 13566666653 33343 34456678899999999998865433221 222222222 1 Q ss_pred HHHHhhHHH----HHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHHHH--hCCCcC Q lcl|NC_021305. 314 YRDTMAIPI----ARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQKMV--NSGVAT 372 (518) Q Consensus 314 ~~~~l~P~~----~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~--~~G~~T 372 (518) +...|.-+- ..+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++..+- -..+++ T Consensus 396 F~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s 475 (516) T protein:vir:10 396 FRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVS 475 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 222233333 334444444433 2111 1 12333433 21 11111 233444444432 235777 Q ss_pred HHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 373 PNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 373 ~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) .+=+|+ .+.+...+-..-++ .+... ....-.+.+.+.. +- T Consensus 476 ~~yi~k~ILr~tDeei~~e~k--------~I~~E----~~~~~~~~p~~~~-------~f 516 (516) T protein:vir:10 476 HDYVMKNILQMTEEQIAQEEK--------QIEQE----AGIKRFQNPENED-------DF 516 (516) T ss_pred hHHHHHHHhcCCHhhHHHHHH--------HHHHh----hhCCCCCCCCccc-------cC Confidence 777775 45543221000000 00000 0000001111111 11 No 222 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=96.84 E-value=0.0003 Score=39.86 Aligned_cols=405 Identities=9% Similarity=0.027 Sum_probs=174.6 Q ss_pred CcCCCCCCCCcccccccchhhhh-----hhcccc---cccc----c-ccccchhhhHHHhhcHHHHHHHHHHHHhhccC- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQD-----SYYYAP---AVGM----Q-LERQFSLYGGIYKNQPWVRTVIAKRAQALARL- 66 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~-----~~~~~~---~~~~----~-~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l- 66 (518) -|-....++.+|...+.+..+.. .+++.. .++. + .......++++ +.+|.|..||+.|.+.+.-+ T Consensus 27 ~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~m-a~~pEvd~Av~eIVneaIv~~ 105 (524) T protein:vir:98 27 QLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRGI-MSYPEVENAVSEIIDDAIVNE 105 (524) T ss_pred hhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHHHHHHHHH-hhccchhhHHHhhhcceeEec Confidence 22223333333333322222210 011100 0000 0 11112234444 77899999999999987432 Q ss_pred ----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC--ceEEEEeeCCc Q lcl|NC_021305. 67 ----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG--TPEKLMPMHPS 137 (518) Q Consensus 67 ----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G--~~~~l~~l~p~ 137 (518) |+.|--.+-+-... .-...+..++.--+-...++ .+++.|.+.|..|+.++-+.+. .+.+|+.|+|. T Consensus 106 ~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr 181 (524) T protein:vir:98 106 QGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGA----RLFRDWYVDSRIYFHKIMHKDESKGIRELRQLDPR 181 (524) T ss_pred CCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhh----HHHhhhhhcceeEEEEEEcCCCCcceeeeeeeCCc Confidence 22222111110011 00111111222122233344 4466789999999999965443 38999999999 Q ss_pred eeEEEEc------CCcee------eEEeeeccc---------ccCceeEEeccccEEEEeccCCCCcccCchHHHHHHHH Q lcl|NC_021305. 138 RVAIKRN------SRTGR------YEYYFQAGA---------GVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKST 196 (518) Q Consensus 138 ~v~v~~~------~~~~~------~~~~~~~~~---------~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~ 196 (518) .++.+.. ..+.. .+|.|.... ...+..+.++.+.|.|....-.+...-=+|-|..|... T Consensus 182 ~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy~hSGL~d~~~~iisyLhkAiKp 261 (524) T protein:vir:98 182 CMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIVYAHSGLEDCSNNIIGYLHRAVKP 261 (524) T ss_pred cceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhheeeeccCcccCCCCeeeehhHhhHh Confidence 9976541 11111 122221100 01233477888888887633222111125778888888 Q ss_pred HHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhc---------C-ccccCCee-ec--------- Q lcl|NC_021305. 197 IFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHS---------G-SSNTGKTM-VV--------- 255 (518) Q Consensus 197 i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~---------g-~~n~g~~~-vl--------- 255 (518) +.....++....-+--..+.-+=|+..+ +++.+...++.-..+...++ | ..+..+.+ +| T Consensus 262 ~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRRe 341 (524) T protein:vir:98 262 ANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRD 341 (524) T ss_pred HHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhhhhcccccC Confidence 8777777766665544445555455443 55555555544433333332 1 11111111 22 Q ss_pred -CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc-ccccCCHHHH------HHHHHHHHhhHHHHHH Q lcl|NC_021305. 256 -EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILD-RATFSNISAQ------MRAFYRDTMAIPIARI 325 (518) Q Consensus 256 -~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~sn~e~~------~~~~~~~~l~P~~~~i 325 (518) ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|+..+ .-+.+-..+- ..-|+..--.-+...+ T Consensus 342 GgrgTEItTLpggqnlgem---~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf 418 (524) T protein:vir:98 342 GKAITEVSTLPGGQNFSDM---DDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTLQIQFSPVL 418 (524) T ss_pred CCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHHHHHHHHHH Confidence 13566666653 33343 3445667889999999999886432 1122111111 1223332233333344 Q ss_pred HHHHHHhhh-----hhhcc---cccceecc--hh----hhhcC-HHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCCCCC Q lcl|NC_021305. 326 QSAMDKYVG-----QYWVR---KNRMKFDI--DD----VIQPD-WEAKSESTQKMVN--SGVATPNEGRE-IMGLPRSDD 387 (518) Q Consensus 326 e~~l~~~l~-----~~~~~---~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~~--~G~~T~NE~R~-~~g~~p~~~ 387 (518) .+.|...|+ ++.+. ...++|++ +. +.... +..|+.++..+-. .-+++.+=+|+ .+.+...+ T Consensus 419 ~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDee- 497 (524) T protein:vir:98 419 SDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDED- 497 (524) T ss_pred HHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccCHHH- Confidence 444444443 22111 12333433 21 11111 2233444443322 22566666665 44432111 Q ss_pred CCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 388 PKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 388 ~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) - ....++..++... +--.++..+.+.- T Consensus 498 -i-------------~~~~k~I~~E~k~-----~~~~~p~~e~~~f 524 (524) T protein:vir:98 498 -I-------------DEQAKLIEEESKE-----ERFKNPEAEEENF 524 (524) T ss_pred -H-------------HHHHHHHHHHHhC-----CCCcCCccccccC Confidence 0 0000000000000 0000111111111 No 223 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=96.57 E-value=0.0005 Score=38.67 Aligned_cols=413 Identities=10% Similarity=-0.012 Sum_probs=155.3 Q ss_pred cCCCCC-CC--Cccc---ccccchhhhhhhccccccc---c----cccccchhhhHHHhh----cHHHHHHHHHHHHhhc Q lcl|NC_021305. 2 LLANGQ-TL--SAPA---MAELSPQMQDSYYYAPAVG---M----QLERQFSLYGGIYKN----QPWVRTVIAKRAQALA 64 (518) Q Consensus 2 ~f~~~~-~~--~~~~---~~~~~~~~~~~~~~~~~~~---~----~~~~~~~~~~~~~~~----~~~v~~~v~~ia~~ia 64 (518) |-.+.. ++ +-|. ...-...+++..++....- . ............++. .+++...++.++..+. T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf 80 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPF 80 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhh Confidence 222211 11 1111 1112223445554432211 0 000111111122222 3444555555555444 Q ss_pred cCceEEEEecCCcceeccchHHHHHHhcCC-cCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCce--------------- Q lcl|NC_021305. 65 RLPVKCMFTSGDTETEESDTGYAKLLADPC-EYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTP--------------- 128 (518) Q Consensus 65 ~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN-~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~--------------- 128 (518) +-|..+- ..........|+.... ...+..+|.+.++...+.+|.+++++.....+.+ T Consensus 81 ~k~p~~~-------~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~ 153 (513) T protein:vir:97 81 SEPIKLN-------EDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREG 153 (513) T ss_pred hcCcccC-------cCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhc Confidence 4444321 0011222223555444 3568999999999999999999999976543311 Q ss_pred --EEEEeeCCceeE----------------------EEEcCCceeeEEeeeccccc--------------CceeEE---- Q lcl|NC_021305. 129 --EKLMPMHPSRVA----------------------IKRNSRTGRYEYYFQAGAGV--------------GTQLVS---- 166 (518) Q Consensus 129 --~~l~~l~p~~v~----------------------v~~~~~~~~~~~~~~~~~~~--------------~~~~~~---- 166 (518) -.+..+.|..|. ...+..+......|..-... ....+. T Consensus 154 ~rPy~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g 233 (513) T protein:vir:97 154 LRPYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEW 233 (513) T ss_pred cCceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCC Confidence 112233332221 11111111111111000000 000000 Q ss_pred ---eccccEEEEeccCCCCcccCchHHHHHHHH-HHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHH Q lcl|NC_021305. 167 ---FADDEVVPIRFFNPDGLERGLSLMESLKST-IFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRA 242 (518) Q Consensus 167 ---~~~~evih~~~~~~~~~~~G~s~l~~~~~~-i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~ 242 (518) +..=.++.+- ...++...|.||+..+... +.+......+....+ ..+.|-.++.- ++++.. +.. T Consensus 234 ~~~l~~IP~v~~~-~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~-~~~~P~l~~~G---~~~~~~-------~~i 301 (513) T protein:vir:97 234 ATGLNYVPLVTFY-ADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILT-VSRFPILACSG---ASGEDS-------DPV 301 (513) T ss_pred CCcCCceeEEEEe-cCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHH-hcccceeeeec---CCcCCC-------Cce Confidence 0000112221 1223445688888776654 444444444444443 44566666642 111110 012 Q ss_pred hcCccccCCeeecCC-Cc--ceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH--HHHHHHH Q lcl|NC_021305. 243 HSGSSNTGKTMVVEE-GM--EPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM--RAFYRDT 317 (518) Q Consensus 243 ~~g~~n~g~~~vl~~-g~--~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~--~~~~~~~ 317 (518) .-|+ ..++.+++ |. .|.+.+.+.-... .+..+...+++ +..|. .++... .++ .+.++.. ..-.... T Consensus 302 ~iG~---~~~~~lpe~~~~~~yie~~g~~i~~~-~~~l~~le~qm-~~~Ga--~ll~~~-~~~-~Ta~a~~~~~~~~~S~ 372 (513) T protein:vir:97 302 VVGP---NKVLYNPDPAGRFYYVEHTGQAIAAG-RTDLKDLEEQM-AGYGA--EFLKRK-TGG-QTATARALDSAEATSD 372 (513) T ss_pred Eeec---cccccCCCCCCcceeeccCchhHHHH-HHHHHHHHHHH-HHHHH--HhhccC-Ccc-ccHHHHHHHHHHHHHH Confidence 2333 34566664 44 4555554433322 23333333333 22332 233221 122 2333332 2334456 Q ss_pred hhHHHHHHHHHHHHhhhh--hhc----ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhC---C-CCCCC Q lcl|NC_021305. 318 MAIPIARIQSAMDKYVGQ--YWV----RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMG---L-PRSDD 387 (518) Q Consensus 318 l~P~~~~ie~~l~~~l~~--~~~----~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g---~-~p~~~ 387 (518) |.-++.+++++++..|-- .+. ....++++.+.....-....++++-+++..|.+|....++.+- . +|-.+ T Consensus 373 L~~~a~~le~al~~~l~~~a~wlg~~~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d~d 452 (513) T protein:vir:97 373 LSAMTGLFEDALAQALDITADWLRLGPNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRGVLPEDFD 452 (513) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCccCC Confidence 777888888888876531 111 1123444333333332344567777888999998877766552 2 11100 Q ss_pred C---Ccceeeecc-cc--cccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhc Q lcl|NC_021305. 388 P---KADELYANS-AL--QPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSD 446 (518) Q Consensus 388 ~---~gD~~~~~~-n~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (518) + +.++.-... .. .-++..+..+.+++.....++ .+.+..+..+.++.+..+..+. T Consensus 453 ~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 453 EDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGE----GEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred HHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCC----CCCCCCCCCCccccCCCCCCCC Confidence 0 000000000 00 000000000000000000000 0000111111111111110000 No 224 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=96.53 E-value=0.00054 Score=38.51 Aligned_cols=412 Identities=11% Similarity=-0.008 Sum_probs=180.3 Q ss_pred CcCCCCCC-CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHH--------------HHHHHhhcc Q lcl|NC_021305. 1 MLLANGQT-LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVI--------------AKRAQALAR 65 (518) Q Consensus 1 ~~f~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v--------------~~ia~~ia~ 65 (518) |=...++= +++|- +..+.-|... -.+.-.-+. ..+-++.+.|.++..=...| ...+..+.+ T Consensus 1 ~~~~~~~~~~~~~~-~~g~~~~p~~--v~~~d~~Rl-~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~ 76 (527) T protein:vir:10 1 MGQDKRQYGSTQQL-RAGEANFPNA--VTDFDKARL-ASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEA 76 (527) T ss_pred CCccccccCCCcCc-CCccccCccc--CCHHHHHHH-HHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhCC Confidence 55554332 22222 2111111000 000000000 00011111111110000000 000000100 Q ss_pred CceEEEEecCC----cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC---CCceEEEEeeCCce Q lcl|NC_021305. 66 LPVKCMFTSGD----TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMPMHPSR 138 (518) Q Consensus 66 l~~~v~~~~~~----~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~---~G~~~~l~~l~p~~ 138 (518) +-++.-.+.+ +.-+.-...+..+..+ .++.....+.-.+.++.|.+.+.+.+|. .|.-+.+..++|.. T Consensus 77 -~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~----e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~ 151 (527) T protein:vir:10 77 -KMRFLGQGLKWEFSKKDAKVDDAIRVLFDR----ENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPST 151 (527) T ss_pred -cceeeccCccccccchhHHHHHHHHHHHHH----hhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcce Confidence 0011000000 0001112223333333 4455566677778888999999999884 34457788888877 Q ss_pred eEEEEcCCceeeEEeeecc--------cc----------------------cCce------------------------- Q lcl|NC_021305. 139 VAIKRNSRTGRYEYYFQAG--------AG----------------------VGTQ------------------------- 163 (518) Q Consensus 139 v~v~~~~~~~~~~~~~~~~--------~~----------------------~~~~------------------------- 163 (518) +....+.++......++.. .. .++. T Consensus 152 ~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~ 231 (527) T protein:vir:10 152 YFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDD 231 (527) T ss_pred eeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhh Confidence 7766665443221111000 00 0000 Q ss_pred --------eEE-----eccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHH Q lcl|NC_021305. 164 --------LVS-----FADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEA 230 (518) Q Consensus 164 --------~~~-----~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~ 230 (518) .+. +.-=.|+||+...+.+..+|.|-|+-+...+........-......=+|.|-.+++.-...+ T Consensus 232 ~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd-- 309 (527) T protein:vir:10 232 IKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRD-- 309 (527) T ss_pred hhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccccc-- Confidence 000 00114578877777777899999987777766665444444444444666655553322111 Q ss_pred HHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH Q lcl|NC_021305. 231 AQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM 310 (518) Q Consensus 231 ~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 310 (518) . +-+.... .-..|.++=|+++.++..+...+.-..+....+.+.+.|+..=++|.+-+|..+.++..+.- T Consensus 310 -~---~G~~~~~---~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~--- 379 (527) T protein:vir:10 310 -S---RGNMVPW---TISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGI--- 379 (527) T ss_pred -c---cCCcCcc---ccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHH--- Confidence 0 0000000 01234455588888998888766666677778888899999999999999866654422221 Q ss_pred HHHHHHHhhHHHHHHH----------HHHHH-----hh-------hhhhcccccceecchhhhhcCHHHHHHHHHHHHhC Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQ----------SAMDK-----YV-------GQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNS 368 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie----------~~l~~-----~l-------~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~ 368 (518) -+.-.+.|++...+ ..+.+ +| +...+..+.+++.+...+..|.++..+...+++.+ T Consensus 380 --ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~a 457 (527) T protein:vir:10 380 --ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEA 457 (527) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHc Confidence 12223334333211 11111 11 11112223456777888999999999999999999 Q ss_pred CCcCHHHHHHHhCCCC-CCCCCcceeeec--ccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcc Q lcl|NC_021305. 369 GVATPNEGREIMGLPR-SDDPKADELYAN--SALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVP 438 (518) Q Consensus 369 G~~T~NE~R~~~g~~p-~~~~~gD~~~~~--~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (518) |+++.--+-++++--. ++++..+.--+- ...+.+..+......+..+ .+...-+..+.++..|+-.. T Consensus 458 GiiS~etAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~a~~~~~a~~---~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 458 GLIPAKKLTEELSKIMGFELTEEDFRQATEDKKTQGIAQAEAADPFGAQM---AAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred CchhHHHHHHHHHhccCCCchHHHHHHHHHHHHHHhHHhhhhcCchhhhh---ccccCCCCCCcccccCCCCC Confidence 9999999988872110 233333311000 0000000000000000000 00000000000110010000 No 225 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=96.53 E-value=0.00054 Score=38.49 Aligned_cols=426 Identities=10% Similarity=0.080 Sum_probs=170.4 Q ss_pred CcCCCCCCC-----Ccccc--cc---cchhhhhhhcccccccccccccchhhh------HHHhhcHHHHHHHHHHHHhhc Q lcl|NC_021305. 1 MLLANGQTL-----SAPAM--AE---LSPQMQDSYYYAPAVGMQLERQFSLYG------GIYKNQPWVRTVIAKRAQALA 64 (518) Q Consensus 1 ~~f~~~~~~-----~~~~~--~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~v~~~v~~ia~~ia 64 (518) -|||+.=.. ..|+. .. ....+...-.++.. ..+...+.... ...+.+|.|..||+.|.+.+. T Consensus 3 ~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~--~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 3 QLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYT--VDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred cccccccccccccccCCCCCCCCcccccceeeccccccee--eecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 577753221 11111 11 11111110000111 11111222111 224678999999999999875 Q ss_pred cC-----ceEEEEecCCcce---eccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC---CCceEEEEe Q lcl|NC_021305. 65 RL-----PVKCMFTSGDTET---EESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMP 133 (518) Q Consensus 65 ~l-----~~~v~~~~~~~~~---~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~---~G~~~~l~~ 133 (518) -+ |+.|--++-+-.+ ..-...+..++.-=+-...++ .+++.|.+.|..|+.++-+. ...+.+|+. T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~----e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~ 156 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSY----EIFRRWYVDGRLFYHKVIDPDNPQGGLIELRY 156 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhh----HHHhhhhhcceEEEEEEecCCCccccceeeee Confidence 43 2222111100000 000111111222122233344 44667889999999987653 335899999 Q ss_pred eCCceeEEEEcC-----Cce-------------eeEEeeecc--cccCceeEEeccccEEEEeccCCC-CcccCchHHHH Q lcl|NC_021305. 134 MHPSRVAIKRNS-----RTG-------------RYEYYFQAG--AGVGTQLVSFADDEVVPIRFFNPD-GLERGLSLMES 192 (518) Q Consensus 134 l~p~~v~v~~~~-----~~~-------------~~~~~~~~~--~~~~~~~~~~~~~evih~~~~~~~-~~~~G~s~l~~ 192 (518) |+|..++.+... ++. .-+|.|... ...++..+.++.+-|.+.+..-.+ +...-+|-|.. T Consensus 157 lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLhk 236 (533) T protein:vir:10 157 IDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPDSICYVHSGIMDLNKNMTLSHLHK 236 (533) T ss_pred ccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchhheeeeeccceeCCCCceeccchH Confidence 999998864422 111 111112111 011233455666544444321111 11123688888 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCCe------e-ec----- Q lcl|NC_021305. 193 LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGKT------M-VV----- 255 (518) Q Consensus 193 ~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~~------~-vl----- 255 (518) +...+.....++....-+--..+.-+=|+..+ +++.+...++....+...|+. ...+|.+ + +| T Consensus 237 AiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWL 316 (533) T protein:vir:10 237 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWL 316 (533) T ss_pred hHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcc Confidence 88888777777766665544445555454443 455554444433333333321 0111211 1 11 Q ss_pred -----CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHH------HHHHHhhHHH Q lcl|NC_021305. 256 -----EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA------FYRDTMAIPI 322 (518) Q Consensus 256 -----~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~------~~~~~l~P~~ 322 (518) ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|+....-+.+...+-.+. |+..--.-+. T Consensus 317 PRReGgrgTEItTLpGgqnLgem---~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs 393 (533) T protein:vir:10 317 PRREGGRGTEITTLPGGQNLGEL---EDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFS 393 (533) T ss_pred cccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHH Confidence 13566666643 33343 44456678899999999998865433333332222222 2222222333 Q ss_pred HHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHHH--HhCCCcCHHHHHH-HhCCCC Q lcl|NC_021305. 323 ARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQKM--VNSGVATPNEGRE-IMGLPR 384 (518) Q Consensus 323 ~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~~--~~~G~~T~NE~R~-~~g~~p 384 (518) ..+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++..+ +-.-++|.+=+|+ .+.+.. T Consensus 394 ~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tD 473 (533) T protein:vir:10 394 ELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTD 473 (533) T ss_pred HHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCH Confidence 334444444432 1111 1 12333433 21 11111 22334444333 1122446666654 333321 Q ss_pred CC---------CCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHH Q lcl|NC_021305. 385 SD---------DPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTE 455 (518) Q Consensus 385 ~~---------~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (518) .+ .+-.+-++.... ...+.. .+...|..+.. +.++.+|....+.. +++ T Consensus 474 eei~~~~kqI~~E~k~~~~~~p~-~~~~~~-----~~~~~~~~~~~-----~~~~~~~~~~~~~~------------~~~ 530 (533) T protein:vir:10 474 VEMKEIDKQIESEMESGIIADPA-AEMDPA-----MAAGDPDAGGA-----PAEEVAPEGPDPSD------------ERK 530 (533) T ss_pred HHHHHHHHHHHHHHhCCCCCCCc-chhhHH-----hcCCCCCcCCc-----ccccCCCCCCCcch------------hhc Confidence 11 000000000000 000000 00000000000 00000010011111 111 Q ss_pred HHH Q lcl|NC_021305. 456 PRR 458 (518) Q Consensus 456 ~~~ 458 (518) ++- T Consensus 531 ~~~ 533 (533) T protein:vir:10 531 AEF 533 (533) T ss_pred cCC Confidence 100 No 226 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=96.50 E-value=0.00057 Score=38.38 Aligned_cols=379 Identities=9% Similarity=-0.004 Sum_probs=159.7 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =.+.+.......... ... .+....... ..........-..++....+|+..+.-+-+-|+.+--.++ T Consensus 29 ~Yy~g~hdi~~~~~~-~~~-----~~~~~~~~~--~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~----- 95 (471) T protein:vir:10 29 KYYRNENDIKRKRKP-ADK-----KGAENEAKA--EDNAFRNADNRISHNWHQLLLDQKKAYALTYPPTFDVDDK----- 95 (471) T ss_pred HHhccccccccccch-hhh-----hcccccccc--cccccccccceeccchhHHHHHhhhhhhcccCceeccCCh----- Confidence 111111111111000 000 000000000 0000000011123445667778878777777877632111 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC-CCceEEEEeeCCceeEEEEcCCce--eeE-Eeee- Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK-SGTPEKLMPMHPSRVAIKRNSRTG--RYE-YYFQ- 155 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~-~G~~~~l~~l~p~~v~v~~~~~~~--~~~-~~~~- 155 (518) .....+..++. | ........+..+++.+|.+|.++.++. +|. ..+..++|..+.+..+.... ... ..++ T Consensus 96 ~~~~~l~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~-~~~~~~~p~~~~~i~d~~~~~~~~~~ir~~~ 169 (471) T protein:vir:10 96 KVNDMIVDVLG--D---DYERISKQLCVNAGNAGIAWLHVWKDASDNS-FRYACVDSKEVIPIYSKSLDKKSIGVLRVYS 169 (471) T ss_pred HHHHHHHHHHh--c---CHHHHHHHHHHHHhhCCeEEEEEEeeCCCCe-eEEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 11122222222 2 344556667889999999999998875 465 56778899998888765431 111 1111 Q ss_pred cccccCc----eeEEeccccEEEEeccC-------------------------------CC---------CcccCchHHH Q lcl|NC_021305. 156 AGAGVGT----QLVSFADDEVVPIRFFN-------------------------------PD---------GLERGLSLME 191 (518) Q Consensus 156 ~~~~~~~----~~~~~~~~evih~~~~~-------------------------------~~---------~~~~G~s~l~ 191 (518) .....++ ....+..+.+.|++... +- +...|.|-+. T Consensus 170 ~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~sd~e 249 (471) T protein:vir:10 170 SIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKNNEIETNDLK 249 (471) T ss_pred eeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEeccCCCCCCchH Confidence 1100111 11112333344432110 00 0124667777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-----CCcceeeccC Q lcl|NC_021305. 192 SLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-----EGMEPIPLQL 266 (518) Q Consensus 192 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-----~g~~~~~l~~ 266 (518) .....+.....+..-..+.+...+.|-.+++-..... .+.+...+.. ++++.++ .+.++..+.. T Consensus 250 ~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~---~~~~~~~~~~--------~~~i~~~~~~~~~~~~~~~l~~ 318 (471) T protein:vir:10 250 PIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQD---KQEFLEDLKR--------YKMIKMDNDGMGDQSGVTTIAI 318 (471) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccc---cchhHHHhhc--------CCeEEecCCCCccCccceEEee Confidence 6666666666555445555555555544443321111 1111111111 1222221 2223333443 Q ss_pred ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH----------HHHHHHHHhhHHHHHHHHHHHHhhhhh Q lcl|NC_021305. 267 TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ----------MRAFYRDTMAIPIARIQSAMDKYVGQY 336 (518) Q Consensus 267 ~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~----------~~~~~~~~l~P~~~~ie~~l~~~l~~~ 336 (518) +.....+....+.+.+.|...-++|..-....++.++...+.. ....+...+.-.++.+... +.. T Consensus 319 ~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~-----~~~ 393 (471) T protein:vir:10 319 DIPTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKH-----LGL 393 (471) T ss_pred cCChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hcc Confidence 3344456677788888888888888432211111111111111 1111111222222222111 111 Q ss_pred hcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCC Q lcl|NC_021305. 337 WVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAP 416 (518) Q Consensus 337 ~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~ 416 (518) .....+++.+...+..|..+.++.+.++ .|++|...++++++. ++++. .+ +..+. ........ T Consensus 394 -~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~--v~D~~-~E------~eri~---~E~~~~~~-- 456 (471) T protein:vir:10 394 -SDKLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPI--VEDWQ-DE------LRLQK---AEQEGRSE-- 456 (471) T ss_pred -CCCceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCC--CCCHH-HH------HHHHH---HHHHHHHh-- Confidence 1123456777888899999999999887 588998888888754 22211 00 11111 10000000 Q ss_pred CCCCCccCCCCCCCccccCCccccc Q lcl|NC_021305. 417 APKRPASTPVASLDQSPPTSVPGLS 441 (518) Q Consensus 417 ~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (518) ........+++. +. + T Consensus 457 ---~~~~~~~~~~~~-----e~--~ 471 (471) T protein:vir:10 457 ---KLYDMEEVEHES-----EV--E 471 (471) T ss_pred ---cccccCCCCCcc-----cc--C Confidence 000000000000 00 0 No 227 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=96.49 E-value=0.00058 Score=38.34 Aligned_cols=412 Identities=11% Similarity=-0.007 Sum_probs=180.3 Q ss_pred CcCCCCCC-CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHH--------------HHHHHhhcc Q lcl|NC_021305. 1 MLLANGQT-LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVI--------------AKRAQALAR 65 (518) Q Consensus 1 ~~f~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v--------------~~ia~~ia~ 65 (518) |=...++= +++|- +..+.-|... -.+.-.-+. ..+-++.+.|.++..=...| ...+..+.+ T Consensus 1 ~~~~~~~~~~~~~~-~~g~~~~p~~--v~~~d~~Rl-~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~~~~~~~ 76 (527) T protein:vir:10 1 MGQDKRQYGSTQQL-RAGEANFPNA--VTDFDKARL-ASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNGEKLIEA 76 (527) T ss_pred CCccccccCCCcCc-CCccccCccc--CCHHHHHHH-HHHHHHHHHhcCchhheeeecCCccccccceeeehhhHHhhCC Confidence 55554332 22222 2111111000 000000000 00011111111110000000 000000100 Q ss_pred CceEEEEecCC----cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC---CCceEEEEeeCCce Q lcl|NC_021305. 66 LPVKCMFTSGD----TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMPMHPSR 138 (518) Q Consensus 66 l~~~v~~~~~~----~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~---~G~~~~l~~l~p~~ 138 (518) +-++.-.+.+ +.-+.-...+..+..+ .++.....+.-.+.++.|.+.+.+.+|. .|.-+.+..++|.. T Consensus 77 -~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~----e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~ 151 (527) T protein:vir:10 77 -KMRFLGQGLKWEFSKKDAKVDDAIKVLFDR----ENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPST 151 (527) T ss_pred -cceeeccCccccccchhHHHHHHHHHHHHH----hhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcce Confidence 0011000000 0001112223333333 4455566677778888999999999884 34457788888877 Q ss_pred eEEEEcCCceeeEEeeecc--------cc----------------------cCce------------------------- Q lcl|NC_021305. 139 VAIKRNSRTGRYEYYFQAG--------AG----------------------VGTQ------------------------- 163 (518) Q Consensus 139 v~v~~~~~~~~~~~~~~~~--------~~----------------------~~~~------------------------- 163 (518) +....+.++......++.. .. .++. T Consensus 152 ~f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~ 231 (527) T protein:vir:10 152 YFPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDD 231 (527) T ss_pred eeeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhh Confidence 7766665443221111000 00 0000 Q ss_pred --------eEE-----eccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHH Q lcl|NC_021305. 164 --------LVS-----FADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEA 230 (518) Q Consensus 164 --------~~~-----~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~ 230 (518) .+. +.-=.|+||+...+.+..+|.|-|+-+...+........-......=+|.|-.+++.-...+ T Consensus 232 ~~~~~~~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd-- 309 (527) T protein:vir:10 232 IKKLSTLTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRD-- 309 (527) T ss_pred hhhhcCceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccccc-- Confidence 000 00114578877777777899999987777766665444444444444666655553322111 Q ss_pred HHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH Q lcl|NC_021305. 231 AQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM 310 (518) Q Consensus 231 ~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~ 310 (518) . +-+.... .-..|.++=|+++.++..+...+.-..+......+.+.|+..=++|.+-+|..+.++..+.- T Consensus 310 -~---~G~~~~~---~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~--- 379 (527) T protein:vir:10 310 -S---RGNMVPW---TISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGI--- 379 (527) T ss_pred -c---cCCcCcc---ccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHH--- Confidence 0 0000000 01234455588888998888766656677778888899999999999999866654422221 Q ss_pred HHHHHHHhhHHHHHHH----------HHHHH-----hh-------hhhhcccccceecchhhhhcCHHHHHHHHHHHHhC Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQ----------SAMDK-----YV-------GQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNS 368 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie----------~~l~~-----~l-------~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~ 368 (518) -+.-.+.|++...+ ..+.+ +| +...+..+.+++.+...+..|.++..+...+++.+ T Consensus 380 --ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~a 457 (527) T protein:vir:10 380 --ALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEA 457 (527) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHc Confidence 12223334333211 11111 11 11112223456777888999999999999999999 Q ss_pred CCcCHHHHHHHhCCCC-CCCCCcce--eeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcc Q lcl|NC_021305. 369 GVATPNEGREIMGLPR-SDDPKADE--LYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVP 438 (518) Q Consensus 369 G~~T~NE~R~~~g~~p-~~~~~gD~--~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (518) |+++.--+-++++--. ++++..+. +..-...+.+..+......+..+ .+...-+..+.++..|+-.. T Consensus 458 Gi~S~~tAv~~L~~~~g~eD~E~E~~~I~~era~~a~a~a~A~~~~~a~~---~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 458 GLIPAKKLTEELSKIMGFELTEEDFKQATEDKKTQGIAQAEAADPFGAQM---AAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred CchhHHHHHHHHHhccCCCChHHHHHHHHHHHHHHhHHhhhhcCchhhhh---ccccCCCCCCcccccCCCCC Confidence 9999999988872110 23333331 10000000000000000000000 00000000000110010000 No 228 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=96.45 E-value=0.00061 Score=38.19 Aligned_cols=399 Identities=11% Similarity=0.053 Sum_probs=156.8 Q ss_pred CcCCCCCCC-------------Ccccccccchhhhhhhcccccccccccccch-hhhHHHhhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLLANGQTL-------------SAPAMAELSPQMQDSYYYAPAVGMQLERQFS-LYGGIYKNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~~v~~ia~~ia~l 66 (518) +=+...-++ ..+....+..+.. |.......+...... .... ...+....+|+..+.-+-.- T Consensus 9 ~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~---g~~~i~~~~~~~~~~~~~~k--i~~n~~~~iv~~~~~~l~g~ 83 (489) T protein:vir:99 9 IDYESKLWIDQLKNYISRFKAEQLERLKELKRYYL---GDNNIKYRPAKTDKYAADNR--IASDFAKYITVFEQGYMLGV 83 (489) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhc---ccCccccccccccccCCcce--eecchHHHHHHHHhhhhccC Confidence 111110000 0000000000000 000000000000000 0001 22455677888888877777 Q ss_pred ceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEE----cCCCceEEEEeeCCceeEEE Q lcl|NC_021305. 67 PVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQK----NKSGTPEKLMPMHPSRVAIK 142 (518) Q Consensus 67 ~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r----~~~G~~~~l~~l~p~~v~v~ 142 (518) |+.+--.++ .....+..++.. -....+...+..+++.+|.+|..+.. +..| -..+..++|..+.+. T Consensus 84 ~~~~~~~d~-----~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~-~~~i~~~~p~~~~~v 153 (489) T protein:vir:99 84 PVEYKNENK-----DLQAAIDLMSVR----NNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKT-EVKLYQLPAEQTFVI 153 (489) T ss_pred CceeecCCh-----hHHHHHHHHHhh----cChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCc-ceEEEEEcccceEEE Confidence 776532111 112233333333 24445667788899999999987754 2333 356778888888877 Q ss_pred EcCCc-e-eeEE-e-eecccccCc---eeEEecccc----------------------------EEEEeccCCCCcccCc Q lcl|NC_021305. 143 RNSRT-G-RYEY-Y-FQAGAGVGT---QLVSFADDE----------------------------VVPIRFFNPDGLERGL 187 (518) Q Consensus 143 ~~~~~-~-~~~~-~-~~~~~~~~~---~~~~~~~~e----------------------------vih~~~~~~~~~~~G~ 187 (518) .+... . .... . +......+. ....+.++. |+||+++ ..|. T Consensus 154 ~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~ 228 (489) T protein:vir:99 154 YDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANN-----EERT 228 (489) T ss_pred EcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecC-----CCCC Confidence 76432 1 1111 1 100000000 111122223 3333321 2366 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhc------CccccCCeeecCCC--- Q lcl|NC_021305. 188 SLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHS------GSSNTGKTMVVEEG--- 258 (518) Q Consensus 188 s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~------g~~n~g~~~vl~~g--- 258 (518) |.+..+...+.....+..-..+.....+.|-.+++- ..............+..... .....++++.++.+ T Consensus 229 s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (489) T protein:vir:99 229 GAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAG-NAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNP 307 (489) T ss_pred CchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhcc-CCcccccchhhhhhcccccccccccccccccceeeeeccccCc Confidence 666665555555444433333333333333333321 11222222222222211110 11122334444332 Q ss_pred ----cceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHH-hccccccccCCHHHHH-------------HHHHHHHhhH Q lcl|NC_021305. 259 ----MEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPI-VHILDRATFSNISAQM-------------RAFYRDTMAI 320 (518) Q Consensus 259 ----~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~-lg~~~~~~~sn~e~~~-------------~~~~~~~l~P 320 (518) .+...+........+....+.+.+.|...-++|..- .+.. ++ .+..+.. ...+...+.- T Consensus 308 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~--~n-~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~ 384 (489) T protein:vir:99 308 NGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFS--GV-QSGESMKYKLMASDNYREKQERLFKKGLMR 384 (489) T ss_pred cccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCccccccccc--cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223334444444455667778888898888888432 2221 12 1222111 1122222222 Q ss_pred HHHHHHHHHHHhhhhhhcc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc Q lcl|NC_021305. 321 PIARIQSAMDKYVGQYWVR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA 398 (518) Q Consensus 321 ~~~~ie~~l~~~l~~~~~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 398 (518) .+..+...+...-...... ...+.+.+..-+..|..+.++.+.++. |+++...+.++++.=.-+++. ++ T Consensus 385 ~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--giis~et~~~~l~~v~~~d~~-~E------ 455 (489) T protein:vir:99 385 RLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GIVSDQTIFEILNTVTGVDAE-AE------ 455 (489) T ss_pred HHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhcCCCCchhHH-HH------ Confidence 3322222221110000000 113556667778889999999999874 889988888876431111110 11 Q ss_pred ccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcc Q lcl|NC_021305. 399 LQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVP 438 (518) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (518) +..+... +....... ++.......+++++.+..| T Consensus 456 ~~ri~~E---~~~~~~~~---~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 456 LKRLKEE---ADKKQSLP---EPRLVGDASGQEEPTAEKP 489 (489) T ss_pred HHHHHHH---HHHHhccc---cccccCCCCCCcCCCCCCC Confidence 1111110 00000000 0000000001111111111 No 229 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=96.43 E-value=0.00063 Score=38.14 Aligned_cols=425 Identities=11% Similarity=0.079 Sum_probs=171.6 Q ss_pred CcCCCCCCCCcccccccch----------hh-hhhhccccccccccccc-------chhhhHHHhhcHHHHHHHHHHHHh Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSP----------QM-QDSYYYAPAVGMQLERQ-------FSLYGGIYKNQPWVRTVIAKRAQA 62 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~----------~~-~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~v~~~v~~ia~~ 62 (518) -|||+.=-.........|+ .+ ...+++.. ..+... ...|+ ..+.+|.|..||+.|.+. T Consensus 4 ~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~---~~~e~~~~~~~eLI~~YR-~ma~~pEvd~Av~eIVne 79 (537) T protein:vir:10 4 QLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYS---VDFDGTIRNDHELITRYR-EMVLNPECDSAVDDVVNE 79 (537) T ss_pred ccccceeecccccccCCcccCCCcccccceeecccccccc---cccccccchHHHHHHHHH-HHhhccchhhHHHHhhcc Confidence 5777432111110000111 11 11111111 111111 11222 246789999999999998 Q ss_pred hccC-----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC---CceEEE Q lcl|NC_021305. 63 LARL-----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS---GTPEKL 131 (518) Q Consensus 63 ia~l-----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~---G~~~~l 131 (518) +.-+ |+.+--++-+..+. .-...+..++.-=+-...++ .+++.|.+.|..|+.++-|.. ..+.+| T Consensus 80 aiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~----e~fR~WYVDgRi~fhKiid~k~pk~GI~EL 155 (537) T protein:vir:10 80 TICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAY----EIFRRWYVDGRLFFHKVIDPKKPRQGLVEL 155 (537) T ss_pred eeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhh----HHHhhheeeeEEEEEEEEeCCCccccceee Confidence 7543 22222111111111 11111111222122233344 446678999999999876533 358999 Q ss_pred EeeCCceeEEEEcC-----Ccee-------e------EEeeecc--cccCceeEEeccccEEEEe--ccCCCCcccCchH Q lcl|NC_021305. 132 MPMHPSRVAIKRNS-----RTGR-------Y------EYYFQAG--AGVGTQLVSFADDEVVPIR--FFNPDGLERGLSL 189 (518) Q Consensus 132 ~~l~p~~v~v~~~~-----~~~~-------~------~~~~~~~--~~~~~~~~~~~~~evih~~--~~~~~~~~~G~s~ 189 (518) +.|+|..++.+..- .+.. + +|.|... ....+..+.++.+-|.+.. ....++ .+.+|- T Consensus 156 r~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~n~-~~i~sy 234 (537) T protein:vir:10 156 RYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSIAYCHSGIQDLNK-NMVLSH 234 (537) T ss_pred eeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhheeeecccceeCCC-Ceeeee Confidence 99999998654431 1111 0 1111110 0112334556665444433 122233 357888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCCe------e-ec-- Q lcl|NC_021305. 190 MESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGKT------M-VV-- 255 (518) Q Consensus 190 l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~~------~-vl-- 255 (518) |..|...+.....++....-+--..+.-+=|+..+ +++.+...++....+...|+. ...+|.+ + +| T Consensus 235 LhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlED 314 (537) T protein:vir:10 235 LHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 314 (537) T ss_pred ehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhh Confidence 99999888877777776666644455555455443 455554444433333333321 0111211 1 11 Q ss_pred --------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHH------HHHHHhh Q lcl|NC_021305. 256 --------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA------FYRDTMA 319 (518) Q Consensus 256 --------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~------~~~~~l~ 319 (518) ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|.....-+.+...+-.+. |+..--. T Consensus 315 yWLPRReGgrgTEItTLpGgqnlgem---~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~ 391 (537) T protein:vir:10 315 FWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 391 (537) T ss_pred hcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHH Confidence 13566666643 33343 44456678899999999998865443333333222222 2222222 Q ss_pred HHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHHHH--hCCCcCHHHHHH-HhC Q lcl|NC_021305. 320 IPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQKMV--NSGVATPNEGRE-IMG 381 (518) Q Consensus 320 P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~--~~G~~T~NE~R~-~~g 381 (518) -+...+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++..+- -.-+++.+=+|+ .+. T Consensus 392 rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr 471 (537) T protein:vir:10 392 RFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLK 471 (537) T ss_pred HHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhc Confidence 333334444444432 2111 1 12333433 21 11111 223344433321 112345555543 333 Q ss_pred CCCC---------CCCCcceeee-cccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcccc Q lcl|NC_021305. 382 LPRS---------DDPKADELYA-NSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGL 440 (518) Q Consensus 382 ~~p~---------~~~~gD~~~~-~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (518) +... +.+-.+-++. |.....++. +..+..+.+..+.....+++....+..+...+. T Consensus 472 ~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 472 QTESEIKEIDKEIKQEIADGVIMDPQAMQAMEM---GIGDEEPVPEGGEEPQTDPNSAVSPADQKRGEL 537 (537) T ss_pred cCHHHHHHHHHHHHHHhhCCCCCCccccccccc---CCCCcccCCCCCCCcccCCccCCCCCCccCCCC Confidence 3211 0000000110 001000000 000000000000000000000000000011111 No 230 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=95.80 E-value=0.0014 Score=36.17 Aligned_cols=404 Identities=8% Similarity=0.022 Sum_probs=169.0 Q ss_pred CcCCCC------------CCCCcccccccchhhhhhhcccc------ccc--ccccc-------cchhhhHHHhhcHHHH Q lcl|NC_021305. 1 MLLANG------------QTLSAPAMAELSPQMQDSYYYAP------AVG--MQLER-------QFSLYGGIYKNQPWVR 53 (518) Q Consensus 1 ~~f~~~------------~~~~~~~~~~~~~~~~~~~~~~~------~~~--~~~~~-------~~~~~~~~~~~~~~v~ 53 (518) -||+.. .++.+|...+...-+ +....++ ..+ ..+.. ....++++ +.+|.|. T Consensus 10 ~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i-~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~m-a~~pEvd 87 (521) T protein:vir:81 10 RWADFDNDKYEEQIKDKAESIAAPKNNDGATEV-EINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGL-MNNHEVE 87 (521) T ss_pred hhcCchhhhHHhhhccCccccccCCCCCCceEe-cccCCCcceeecceeeeecccccchhhHHHHHHHHHHH-hhccchh Confidence 123321 112222221111111 0100100 001 01111 12233443 7789999 Q ss_pred HHHHHHHHhhccC-----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC- Q lcl|NC_021305. 54 TVIAKRAQALARL-----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK- 124 (518) Q Consensus 54 ~~v~~ia~~ia~l-----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~- 124 (518) .||+.|.+.+.-+ |+.+--++-+-... .-...+..++.-=+-...++ .+++.|.+.|..|+.++-+. T Consensus 88 ~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fhkiid~~ 163 (521) T protein:vir:81 88 NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQ----DMFRRWYVDSRIFFHKIIGKN 163 (521) T ss_pred hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhh----HHHhhhhhcceEEEEEEEcCC Confidence 9999999987543 23222111111111 00111111222122233344 44667899999999998553 Q ss_pred -CCceEEEEeeCCceeEEEEcCC-----c------eeeEEeeecc---------cccCceeEEeccccEEEEeccCCC-C Q lcl|NC_021305. 125 -SGTPEKLMPMHPSRVAIKRNSR-----T------GRYEYYFQAG---------AGVGTQLVSFADDEVVPIRFFNPD-G 182 (518) Q Consensus 125 -~G~~~~l~~l~p~~v~v~~~~~-----~------~~~~~~~~~~---------~~~~~~~~~~~~~evih~~~~~~~-~ 182 (518) ...+.+|+.|+|..+..+.... + ...+|.|... ....+..+.++.+-|.+.+..-.+ + T Consensus 164 pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSGl~d~~ 243 (521) T protein:vir:81 164 PKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSGLMDCD 243 (521) T ss_pred ccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeeeeccceeCC Confidence 3458999999999987554321 1 1112222111 012233455555555444321111 1 Q ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccC------C Q lcl|NC_021305. 183 LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTG------K 251 (518) Q Consensus 183 ~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g------~ 251 (518) ...-+|-|..|...+.....++....-+--..+.-+=|+..+ +++.+...++.-..+...++. ....| + T Consensus 244 ~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk 323 (521) T protein:vir:81 244 DKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQA 323 (521) T ss_pred CCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccccccccccc Confidence 112367888888888777777766666544445555454443 455555444443333333322 11112 1 Q ss_pred ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccc--cCCHHHHH------ Q lcl|NC_021305. 252 TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT--FSNISAQM------ 310 (518) Q Consensus 252 ~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~sn~e~~~------ 310 (518) .+ +| ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|+.-..+. .+...+-. T Consensus 324 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF 400 (521) T protein:vir:81 324 NLSMTEDYWLQRRDGKAITDVTTLPGASGMSDI---DDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEF 400 (521) T ss_pred ccchhhhhcccccCCCcccceeecccCCCCChH---HHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHH Confidence 11 22 13566666643 44444 3445667889999999999885433222 11222221 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhh-----hhhcc---cccceecc--hh----hhhcC-HHHHHHHHHHHHh--CCCcCH Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVG-----QYWVR---KNRMKFDI--DD----VIQPD-WEAKSESTQKMVN--SGVATP 373 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~-----~~~~~---~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~~--~G~~T~ 373 (518) .-|+..--.-+...+.+.|...|+ ++.+. ...++|++ +. +.... +..|+.++..+-. .-+++. T Consensus 401 ~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~ 480 (521) T protein:vir:81 401 SKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSN 480 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccch Confidence 223322233333344444444443 22111 12333433 21 11111 2233444443321 124466 Q ss_pred HHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 374 NEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 374 NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) +=+|+ .+.+...+-..-| .+..++...+-- .++.++.++- T Consensus 481 dyi~k~ILr~tDeei~~~~---------------k~I~~E~~~~~~-----~~p~~~~~~f 521 (521) T protein:vir:81 481 QTVMRDILKYTDDQMDTEK---------------KQIEEEANDPRF-----KQTPDEIEDF 521 (521) T ss_pred HHHHHHHhccCHHHHHHHH---------------HHHHHHhhCCCC-----CCCcccccCC Confidence 66664 4443221100000 000000000000 0001111111 No 231 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=95.70 E-value=0.0016 Score=35.92 Aligned_cols=405 Identities=10% Similarity=0.035 Sum_probs=171.7 Q ss_pred CcCCCC----------------CCCCcccccccchhhhhhhccccccc------ccccc-------cchhhhHHHhhcHH Q lcl|NC_021305. 1 MLLANG----------------QTLSAPAMAELSPQMQDSYYYAPAVG------MQLER-------QFSLYGGIYKNQPW 51 (518) Q Consensus 1 ~~f~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~-------~~~~~~~~~~~~~~ 51 (518) =||++- +++..|...+.+..+..........+ ..... ....++++ +.+|. T Consensus 9 ~lf~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~m-a~~pE 87 (524) T protein:vir:10 9 SFLKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNL-MNNYE 87 (524) T ss_pred HHhhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHH-hhccc Confidence 234431 11111211111111111000000000 00111 12233443 77899 Q ss_pred HHHHHHHHHHhhccC-----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc Q lcl|NC_021305. 52 VRTVIAKRAQALARL-----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN 123 (518) Q Consensus 52 v~~~v~~ia~~ia~l-----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~ 123 (518) |..||+.|.+.+.-+ |+.|--++-+-... .-...+..++.-=+-...++ .+++.|.+.|..|+.++-+ T Consensus 88 vd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fHkiid 163 (524) T protein:vir:10 88 VDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGT----DHFQRWYVDSRIFFHKIIN 163 (524) T ss_pred hhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhh----HHHhhheeeceEEEEEEee Confidence 999999999987543 23322111111111 00111111222122233344 4466789999999998765 Q ss_pred C---CCceEEEEeeCCceeEEEEcC----Cce-------eeEEeeecc---------cccCceeEEeccccEEEEeccCC Q lcl|NC_021305. 124 K---SGTPEKLMPMHPSRVAIKRNS----RTG-------RYEYYFQAG---------AGVGTQLVSFADDEVVPIRFFNP 180 (518) Q Consensus 124 ~---~G~~~~l~~l~p~~v~v~~~~----~~~-------~~~~~~~~~---------~~~~~~~~~~~~~evih~~~~~~ 180 (518) . ...+.+|+.|+|..++.+... .++ ..+|.|... ....+..+.++.+.|.|....-. T Consensus 164 ~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~SGL~ 243 (524) T protein:vir:10 164 PKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAAVVYAHSGLL 243 (524) T ss_pred CCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhheeeeccCcc Confidence 3 235899999999998653321 111 111222111 11334567788888888764322 Q ss_pred CC-cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccC---- Q lcl|NC_021305. 181 DG-LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTG---- 250 (518) Q Consensus 181 ~~-~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g---- 250 (518) +. ...-+|-|..|...+.....++....-+--..+.-+=|+..+ +++.+...++....+...++. ....| T Consensus 244 d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~d 323 (524) T protein:vir:10 244 DCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKN 323 (524) T ss_pred cCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeecc Confidence 21 113467888888888777777766665544445555454443 455554444433333322211 00112 Q ss_pred --Cee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc--cccCCHHHHHH-- Q lcl|NC_021305. 251 --KTM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR--ATFSNISAQMR-- 311 (518) Q Consensus 251 --~~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--~~~sn~e~~~~-- 311 (518) +.+ +| ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|+.-.. -+.+...+-.+ T Consensus 324 drk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDE 400 (524) T protein:vir:10 324 QQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDM---DDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDE 400 (524) T ss_pred chhhhhhHhhhcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHH Confidence 111 11 13566666653 33343 34456678899999999998853222 12222222222 Q ss_pred ----HHHHHHhhHHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHHHHh--CCC Q lcl|NC_021305. 312 ----AFYRDTMAIPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQKMVN--SGV 370 (518) Q Consensus 312 ----~~~~~~l~P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~~--~G~ 370 (518) -|+..--.-+...+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++..+-. .-+ T Consensus 401 iKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky 480 (524) T protein:vir:10 401 LKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKY 480 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 22222223333334444444443 2111 1 12333433 21 11111 2233444433321 124 Q ss_pred cCHHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 371 ATPNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 371 ~T~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) ++.+=+|+ .+.+..-+-..-| .+...+.. ++--.++.++.+.- T Consensus 481 ~s~~yi~k~ILr~tDeei~~~~---------------k~I~~E~k-----~~~~~~~~~~~~~f 524 (524) T protein:vir:10 481 ISHQTAMKDFLQMTDEEINQEA---------------KQIEEESK-----EARFQNPDEEEEDF 524 (524) T ss_pred chhHHHHHHHhccCHHHHHHHH---------------HHHHHHhh-----cCCCCCCChhhhcC Confidence 46666664 4443221100000 00000000 00000111111111 No 232 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=95.70 E-value=0.0012 Score=36.59 Aligned_cols=430 Identities=12% Similarity=0.002 Sum_probs=155.8 Q ss_pred CcCCCCCCCCcc------cccccchhhhhhhcc----ccccc-----ccccccchhh-hHHHhhcHHHHHHHHHHHHhh- Q lcl|NC_021305. 1 MLLANGQTLSAP------AMAELSPQMQDSYYY----APAVG-----MQLERQFSLY-GGIYKNQPWVRTVIAKRAQAL- 63 (518) Q Consensus 1 ~~f~~~~~~~~~------~~~~~~~~~~~~~~~----~~~~~-----~~~~~~~~~~-~~~~~~~~~v~~~v~~ia~~i- 63 (518) |.=...+...+- -....++|...+.-. .+..+ .+.+...... .......... .|++.+|..+ T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~-~a~~~LAs~l~ 79 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAP-LALRNFVAAMD 79 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCcccccccccccchHH-HHHHHHHHHHH Confidence 221110000000 000012332221111 11100 0000000000 0112223333 4555555554 Q ss_pred c-----cCceEEEEecCCcceec-c-chHHH----HHHhcCCc-CCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEE Q lcl|NC_021305. 64 A-----RLPVKCMFTSGDTETEE-S-DTGYA----KLLADPCE-YLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKL 131 (518) Q Consensus 64 a-----~l~~~v~~~~~~~~~~~-~-~~~~~----~L~~~PN~-~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l 131 (518) + .-||.=..-.+....+. . ..++. .+...-+. .-+++.-+..+..+++++|++.+++..+.. ..+.+ T Consensus 80 ~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~-~~~~f 158 (549) T protein:vir:10 80 SMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVG-KGIVY 158 (549) T ss_pred hhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCC-CeeEE Confidence 2 23443222222111110 0 11111 11111111 123445566678899999999999876543 45566 Q ss_pred EeeCCceeEEEEcCCceeeEEeee-----------cc--c----------ccCcee------------------------ Q lcl|NC_021305. 132 MPMHPSRVAIKRNSRTGRYEYYFQ-----------AG--A----------GVGTQL------------------------ 164 (518) Q Consensus 132 ~~l~p~~v~v~~~~~~~~~~~~~~-----------~~--~----------~~~~~~------------------------ 164 (518) ..++-..+.+..+..|........ +. . ...... T Consensus 159 ~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p 238 (549) T protein:vir:10 159 RNVPMQRLWFAENNSGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPRKLDGRNMQ 238 (549) T ss_pred EEEEcCeEEEeeCCCCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCccccccccCc Confidence 666666776766666543221100 00 0 000000 Q ss_pred ---E--E-----------eccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC Q lcl|NC_021305. 165 ---V--S-----------FADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS 228 (518) Q Consensus 165 ---~--~-----------~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~ 228 (518) + + |..-..+-.|....+|..||.||...++..+.......+.......-...|..++...+.++ T Consensus 239 f~sv~~e~~~~~il~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~ 318 (549) T protein:vir:10 239 FASYWLDEGRDRIVQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPPLLANEDGVLD 318 (549) T ss_pred eEEEEEEecCCEeeccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccc Confidence 0 0 01112233333444677899999999999999999999999888888888888876555544 Q ss_pred HHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHh-ccccccccCCH Q lcl|NC_021305. 229 EAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIV-HILDRATFSNI 306 (518) Q Consensus 229 ~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~sn~ 306 (518) +... ..|..+.+ ...-.+...+.++.... +.+ ..+..+.....|..+|-+....+ -..+. -+. T Consensus 319 ~~~l----------~pgg~~~~-~~~~~~~~~~~pl~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~~---~TA 383 (549) T protein:vir:10 319 GFDL----------RSGALNWG-GLNDKGEEMVKPLLTGK-QAQIGIEFAQDTRQTINQWFYVTLFQILVDSGD---MTA 383 (549) T ss_pred ccee----------ccCCcccc-ccCCCCccceeeecccc-chhHHHHHHHHHHHHHHHHHhhhhhhhhcCCCC---ccH Confidence 3221 12222111 01112334566665442 333 33456677888999998775322 12222 233 Q ss_pred HH--HHHHHHHHHhhHHHHHHHHHHHHhh-------------hhhhccc---cc--ceecc-hhhhhcCHHHHHHHHHHH Q lcl|NC_021305. 307 SA--QMRAFYRDTMAIPIARIQSAMDKYV-------------GQYWVRK---NR--MKFDI-DDVIQPDWEAKSESTQKM 365 (518) Q Consensus 307 e~--~~~~~~~~~l~P~~~~ie~~l~~~l-------------~~~~~~~---~~--~~fd~-~~l~~~d~~~~~~~~~~~ 365 (518) ++ ....-....|.|.+..+.++|-.-| +++.... .+ +++.+ +.|-+.-.......+... T Consensus 384 tEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~~~~i~~~ 463 (549) T protein:vir:10 384 TEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGEGAAILQW 463 (549) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHHHHHHHHHHHHHH Confidence 22 2233344556666666665544332 2221111 11 22221 112111001111111111 Q ss_pred HhC-CC---cCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccc Q lcl|NC_021305. 366 VNS-GV---ATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLS 441 (518) Q Consensus 366 ~~~-G~---~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (518) ++. |. +.| |+-..++.+.+-+.-++.+=+|.+++.- ..+.. T Consensus 464 ~~~~~~laq~~P-e~ld~id~d~~~~~~a~~~Gvp~~~irs----------------------------------~eev~ 508 (549) T protein:vir:10 464 LQQLGIVSQFDP-AAAKVPNGARIARLLADYGGVPVEAMST----------------------------------DEELQ 508 (549) T ss_pred HHHHHHHhccCh-hHHhcCCHHHHHHHHHHhcCCCccccCC----------------------------------HHHHH Confidence 110 00 111 1222222111100000000011110000 00000 Q ss_pred cchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHhhccccCcCc Q lcl|NC_021305. 442 PTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGAMGRGKDIKG 488 (518) Q Consensus 442 ~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (518) ...+. +.+.+....+.++. ..+...++.++.++.-.-++-- T Consensus 509 ~~r~~----~~~qqq~~~~~~~a--~~a~~~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 509 AQQAA----EAQAAQMQQMLAAA--PVAAGAIKDLSDAQTAAQTARV 549 (549) T ss_pred HHHHH----HHHHHHHHHHHHHH--HHHHHHHHhhhhhcCCCcccCC Confidence 00000 11111000000000 0000011111111100000000 No 233 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=94.93 E-value=0.0032 Score=34.30 Aligned_cols=404 Identities=11% Similarity=0.039 Sum_probs=174.6 Q ss_pred CcCCCC----------------CCCCcccccccchhhhhh------hcc--cccccc-----cccccchhhhHHHhhcHH Q lcl|NC_021305. 1 MLLANG----------------QTLSAPAMAELSPQMQDS------YYY--APAVGM-----QLERQFSLYGGIYKNQPW 51 (518) Q Consensus 1 ~~f~~~----------------~~~~~~~~~~~~~~~~~~------~~~--~~~~~~-----~~~~~~~~~~~~~~~~~~ 51 (518) =||++- .++..|...+.+.-+... .++ +...+. ...-....++++ +.+|. T Consensus 7 ~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~m-a~~pE 85 (521) T protein:vir:10 7 KLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSL-SKYHE 85 (521) T ss_pred HHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHH-hhccc Confidence 123322 111122211111111000 000 000000 011112334444 77899 Q ss_pred HHHHHHHHHHhhccC-----ceEEEEecCCcceecc---chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc Q lcl|NC_021305. 52 VRTVIAKRAQALARL-----PVKCMFTSGDTETEES---DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN 123 (518) Q Consensus 52 v~~~v~~ia~~ia~l-----~~~v~~~~~~~~~~~~---~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~ 123 (518) |..||+.|.+.+.-+ |+.|--++-+.....+ ...+..++.--+-...++ .+++.|.+.|..|+.++-+ T Consensus 86 vd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fHkiid 161 (521) T protein:vir:10 86 VDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGK----RHFRRWYVDSRIYFHKMID 161 (521) T ss_pred hhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhh----HHHhhheeeeeEEEEEEee Confidence 999999999988543 2222211111111111 111111222122233344 4466789999999998765 Q ss_pred C---CCceEEEEeeCCceeEEEEcCC-----------ceeeEEeeec-------ccccCceeEEeccccEEEEec--cCC Q lcl|NC_021305. 124 K---SGTPEKLMPMHPSRVAIKRNSR-----------TGRYEYYFQA-------GAGVGTQLVSFADDEVVPIRF--FNP 180 (518) Q Consensus 124 ~---~G~~~~l~~l~p~~v~v~~~~~-----------~~~~~~~~~~-------~~~~~~~~~~~~~~evih~~~--~~~ 180 (518) . ...+.+|+.|+|..++.+.... +...+|.|.. .+...+..+.++.+.|.|... .+. T Consensus 162 ~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~hSGL~d~ 241 (521) T protein:vir:10 162 PARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYSHSGKVDI 241 (521) T ss_pred CCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheeeecccceeC Confidence 3 2358999999999986544211 1111222211 011223446677766666552 233 Q ss_pred CCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCC---- Q lcl|NC_021305. 181 DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGK---- 251 (518) Q Consensus 181 ~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~---- 251 (518) ++ .+.+|-|..|...+.....++....-+--..+.-+=|+..+ +++.+...++.-..+...++. ....|. T Consensus 242 ~~-~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~dd 320 (521) T protein:vir:10 242 DG-KTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNS 320 (521) T ss_pred CC-CceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccc Confidence 33 46789999999988887777777666644455555455443 455544444433333222211 011121 Q ss_pred --ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc-cccCCHHHHH----- Q lcl|NC_021305. 252 --TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR-ATFSNISAQM----- 310 (518) Q Consensus 252 --~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~~~sn~e~~~----- 310 (518) .+ +| ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|+.... -+.+-..+-. T Consensus 321 rk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEik 397 (521) T protein:vir:10 321 SNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEM---DDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQ 397 (521) T ss_pred hhhhhhHhhhcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHH Confidence 11 11 13566666653 33343 44456678899999999998865321 1222111222 Q ss_pred -HHHHHHHhhHHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhc-CHHHHHHHHHHHH----hCCC Q lcl|NC_021305. 311 -RAFYRDTMAIPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQP-DWEAKSESTQKMV----NSGV 370 (518) Q Consensus 311 -~~~~~~~l~P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~-d~~~~~~~~~~~~----~~G~ 370 (518) .-|+..--.-+...+.+.|...|+ ++.+ . ...++|++ +. +... =+..|+.++..+- -.-+ T Consensus 398 F~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky 477 (521) T protein:vir:10 398 FTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKY 477 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccc Confidence 222222223333334444444433 2111 1 12333433 21 1111 1233455555442 2236 Q ss_pred cCHHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 371 ATPNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 371 ~T~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) ++.+=+|+ .+.+...+-..-+ ... ..+...+--. ++.++.+.- T Consensus 478 ~s~dyi~k~ILr~tDeeik~~~------------k~I---~~E~~~~~~~-----~p~~e~~df 521 (521) T protein:vir:10 478 LSHEYVMKNILRMSDEDIKTER------------EKI---DGELKDSVYK-----NPEDPMEEF 521 (521) T ss_pred cchHHHHHHHhcCCHhHHHHHH------------HHH---HHhhhCCCCC-----CCcchhhcC Confidence 67777765 4444321100000 000 0000000000 000111111 No 234 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=94.63 E-value=0.0039 Score=33.79 Aligned_cols=403 Identities=10% Similarity=0.027 Sum_probs=171.8 Q ss_pred CcCCC-------------CCCCCcccccccch---hhh-----hhhccccccccccccc-------chhhhHHHhhcHHH Q lcl|NC_021305. 1 MLLAN-------------GQTLSAPAMAELSP---QMQ-----DSYYYAPAVGMQLERQ-------FSLYGGIYKNQPWV 52 (518) Q Consensus 1 ~~f~~-------------~~~~~~~~~~~~~~---~~~-----~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~v 52 (518) =||++ +.....++.+...+ -+. ...++.-..-...... ...++ ..+.+|.| T Consensus 5 ~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR-~ma~~pEv 83 (516) T protein:vir:10 5 DLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYR-QLTNNPEV 83 (516) T ss_pred HhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHH-Hhhhccch Confidence 25665 22222222222111 110 0111110000011111 22222 24668999 Q ss_pred HHHHHHHHHhhccC-----ceEEEEecCCcceecc---chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEc- Q lcl|NC_021305. 53 RTVIAKRAQALARL-----PVKCMFTSGDTETEES---DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN- 123 (518) Q Consensus 53 ~~~v~~ia~~ia~l-----~~~v~~~~~~~~~~~~---~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~- 123 (518) ..||+.|.+.+.-+ |+.+--++-+-....+ ...+..++.--+-...++ .+++.|.+.|..|+.++-+ T Consensus 84 d~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fhKiid~ 159 (516) T protein:vir:10 84 ERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLD----TLFRRWYIDSRIFFHKIMPN 159 (516) T ss_pred hHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhh----HHHHhhhhcceEEEEEEecC Confidence 99999999987543 3333211111001100 111111222122233344 4466788999999986654 Q ss_pred CCCceEEEEeeCCceeEEEEcC-----C------ceeeEEeeeccc---------ccCceeEEeccccEEEEecc--CCC Q lcl|NC_021305. 124 KSGTPEKLMPMHPSRVAIKRNS-----R------TGRYEYYFQAGA---------GVGTQLVSFADDEVVPIRFF--NPD 181 (518) Q Consensus 124 ~~G~~~~l~~l~p~~v~v~~~~-----~------~~~~~~~~~~~~---------~~~~~~~~~~~~evih~~~~--~~~ 181 (518) ....+.+|+.|+|..+..+..- + +...+|.|...+ ...+..+.++.+-|.+.+.. ..+ T Consensus 160 ~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hSGl~d~~ 239 (516) T protein:vir:10 160 PKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHSGLQDCS 239 (516) T ss_pred cccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeecCcccCC Confidence 3445899999999998765432 1 111122221111 01123355555544444321 112 Q ss_pred CcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCC----- Q lcl|NC_021305. 182 GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGK----- 251 (518) Q Consensus 182 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~----- 251 (518) +. .=+|-|..|...+.....++....-+--..+.-+=|+..+ +++.+...++.-..+...++. ..+.|. T Consensus 240 ~~-~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddr 318 (516) T protein:vir:10 240 DR-GIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQK 318 (516) T ss_pred CC-ceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccch Confidence 21 1267788888887777777766655544445545444443 455544444433333332221 111222 Q ss_pred -ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHHH---- Q lcl|NC_021305. 252 -TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQMR---- 311 (518) Q Consensus 252 -~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~~---- 311 (518) .+ +| ..|.+++.|.. +..++ +-..+..+.+..+++||.+-|+.-...+. +...+-.+ T Consensus 319 k~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiK 395 (516) T protein:vir:10 319 RNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEM---DDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELD 395 (516) T ss_pred hhhhhHhhhcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHH Confidence 11 11 13566666653 33343 34456678899999999998865433221 22222222 Q ss_pred --HHHHHHhhHHHHHHHHHHHHhhh-----hhhcc---cccceecc--hh----hhhcC-HHHHHHHHHHHH--hCCCcC Q lcl|NC_021305. 312 --AFYRDTMAIPIARIQSAMDKYVG-----QYWVR---KNRMKFDI--DD----VIQPD-WEAKSESTQKMV--NSGVAT 372 (518) Q Consensus 312 --~~~~~~l~P~~~~ie~~l~~~l~-----~~~~~---~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~--~~G~~T 372 (518) -|+..--.-+...+.+.|.+.|+ ++.+. ...++|++ +. +.... +..|+.++..+- -..+++ T Consensus 396 F~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s 475 (516) T protein:vir:10 396 FRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVS 475 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 22222223333345555555443 22111 12333433 21 11111 233444444432 235777 Q ss_pred HHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 373 PNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 373 ~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) .+=+|+ .+.+...+-..-++ .+. .+...+--.+|.... +- T Consensus 476 ~~yi~k~ILr~tDeei~~~~k--------~I~-------~E~~~~~~~~p~~e~----~f 516 (516) T protein:vir:10 476 HDYVMKNILQMTDEQIAQEEK--------QIE-------KEANVKRFQNPENED----DF 516 (516) T ss_pred hHHHHHHHhcCCHhHHHHHHH--------HHH-------HhhhCCCCCCCCccc----cC Confidence 777775 45543221000000 000 000010001111000 00 No 235 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=94.38 E-value=0.0046 Score=33.41 Aligned_cols=405 Identities=9% Similarity=0.030 Sum_probs=170.2 Q ss_pred CcCCCCCCCCcccccccchhhhh-------hhcccccccccccc-------cchhhhHHHhhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQD-------SYYYAPAVGMQLER-------QFSLYGGIYKNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~v~~~v~~ia~~ia~l 66 (518) -|-....+..+|...+.+..+.. .+|+....-..+.. ....++++ +.+|.|..||+.|.+.+.-+ T Consensus 22 ~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~m-a~~pEvd~Av~eIVneaiv~ 100 (521) T protein:vir:65 22 QIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGL-MNNHEVENAVQNIVNDAIVF 100 (521) T ss_pred hhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHH-hhccchhhHHHHhhcceeEe Confidence 11111222222322222222210 01111111111111 12233333 77899999999999987543 Q ss_pred -----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC--CCceEEEEeeCC Q lcl|NC_021305. 67 -----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK--SGTPEKLMPMHP 136 (518) Q Consensus 67 -----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~--~G~~~~l~~l~p 136 (518) |+.+--++-+-... .-...+..++.-=|-...++ .+++.|.+.|..|+.++-+. ...+.+|+.|+| T Consensus 101 d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fhkiid~~pk~GI~ELr~lDP 176 (521) T protein:vir:65 101 EEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQ----DMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDP 176 (521) T ss_pred cCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhh----HHHhhhhhcceeEEEEEEcCCccccceeeeeeCC Confidence 23222111111111 01111111222122233444 44667899999999998553 345899999999 Q ss_pred ceeEEEEcCC-----c------eeeEEeeeccc---------ccCceeEEeccccEEEEeccCCC-CcccCchHHHHHHH Q lcl|NC_021305. 137 SRVAIKRNSR-----T------GRYEYYFQAGA---------GVGTQLVSFADDEVVPIRFFNPD-GLERGLSLMESLKS 195 (518) Q Consensus 137 ~~v~v~~~~~-----~------~~~~~~~~~~~---------~~~~~~~~~~~~evih~~~~~~~-~~~~G~s~l~~~~~ 195 (518) ..+..+.... + ...+|.|...+ ...+..+.++.+-|.+.+..-.+ +...-+|-|..|.. T Consensus 177 r~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiK 256 (521) T protein:vir:65 177 RNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSGLMDCDDKYIIGYLHRAVK 256 (521) T ss_pred cceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeeeeccceeCCCCeeeecchhhhH Confidence 9987654321 1 11122221100 11233455555555444321111 11123678888888 Q ss_pred HHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccC------Cee-ec-------- Q lcl|NC_021305. 196 TIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTG------KTM-VV-------- 255 (518) Q Consensus 196 ~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g------~~~-vl-------- 255 (518) .+.....++....-+--..+.-+=|+..+ +++.+...++.-..+...++. ....| +.+ +| T Consensus 257 p~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRR 336 (521) T protein:vir:65 257 PANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRR 336 (521) T ss_pred hHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhccccc Confidence 88777777766666544445555455443 555555444443333333322 11112 111 22 Q ss_pred --CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccccccc--CCHHHHH------HHHHHHHhhHHHH Q lcl|NC_021305. 256 --EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATF--SNISAQM------RAFYRDTMAIPIA 323 (518) Q Consensus 256 --~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~--sn~e~~~------~~~~~~~l~P~~~ 323 (518) ..|.+++.|.. +..++ +-..+..+.+..+++||.+-++..+.+.+ +...+-. .-|+..--.-+.. T Consensus 337 eGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rFs~ 413 (521) T protein:vir:65 337 DGKAITDVTTLPGASGMSDI---DDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQFSE 413 (521) T ss_pred CCCCccceeecccCCCcChH---HHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHH Confidence 13566666643 44444 34456678899999999998754433221 2222222 2222222233333 Q ss_pred HHHHHHHHhhh-----hhhcc---cccceecc--hh----hhhcC-HHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCCC Q lcl|NC_021305. 324 RIQSAMDKYVG-----QYWVR---KNRMKFDI--DD----VIQPD-WEAKSESTQKMVN--SGVATPNEGRE-IMGLPRS 385 (518) Q Consensus 324 ~ie~~l~~~l~-----~~~~~---~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~~--~G~~T~NE~R~-~~g~~p~ 385 (518) .+.+.|...|+ ++.+. ...++|++ +. +.... +..|+.++..+-. .-++|.+=+|+ .+.+... T Consensus 414 lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDe 493 (521) T protein:vir:65 414 VLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDD 493 (521) T ss_pred HHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHH Confidence 44444444443 22111 12333433 21 11111 2234444443321 22456666665 4444221 Q ss_pred CCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 386 DDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 386 ~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) + - .....+...+...+--. ++.++.++- T Consensus 494 e--i-------------~~~~k~I~~E~~~~~~~-----~p~~~~~~f 521 (521) T protein:vir:65 494 Q--M-------------DTEKKQIEEEANDPRFK-----QTPDEIEDF 521 (521) T ss_pred H--H-------------HHHHHHHHHhhhCCCCC-----CCcccccCC Confidence 1 0 00000000000000000 000011111 No 236 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=94.01 E-value=0.0057 Score=32.90 Aligned_cols=405 Identities=9% Similarity=-0.001 Sum_probs=167.0 Q ss_pred CcCCCCC-------------CCCcccccccch---hhh-hhhccc---c------cccc-----cccccchhhhHHHhhc Q lcl|NC_021305. 1 MLLANGQ-------------TLSAPAMAELSP---QMQ-DSYYYA---P------AVGM-----QLERQFSLYGGIYKNQ 49 (518) Q Consensus 1 ~~f~~~~-------------~~~~~~~~~~~~---~~~-~~~~~~---~------~~~~-----~~~~~~~~~~~~~~~~ 49 (518) =||++-- ..++++.+...+ -+. ..+++. . +++. ........++++ +.+ T Consensus 7 ~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~m-a~~ 85 (523) T protein:vir:68 7 SLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNL-MTN 85 (523) T ss_pred hhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHH-hhc Confidence 2333211 111222111111 111 000000 0 0000 011112234443 788 Q ss_pred HHHHHHHHHHHHhhccCc-----eEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEE Q lcl|NC_021305. 50 PWVRTVIAKRAQALARLP-----VKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQ 121 (518) Q Consensus 50 ~~v~~~v~~ia~~ia~l~-----~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~ 121 (518) |.|..||+.|.+.+.-+. +.|--++-+-... .-...+..++.--+-...++ .+++.|.+.|..|+.++ T Consensus 86 pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fhKi 161 (523) T protein:vir:68 86 YEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGS----DHFRRWYVDSRIFFHKI 161 (523) T ss_pred cchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhh----HHHHhheeeeEEEEEEE Confidence 999999999999875432 2221111110010 00111111222222233344 44667899999999987 Q ss_pred EcCC---CceEEEEeeCCceeEEEEc-----CCce------eeEEeeeccc---------ccCceeEEeccccEEEEecc Q lcl|NC_021305. 122 KNKS---GTPEKLMPMHPSRVAIKRN-----SRTG------RYEYYFQAGA---------GVGTQLVSFADDEVVPIRFF 178 (518) Q Consensus 122 r~~~---G~~~~l~~l~p~~v~v~~~-----~~~~------~~~~~~~~~~---------~~~~~~~~~~~~evih~~~~ 178 (518) -|.. ..+.+|+.|+|..|+.+.. ..+. ..+|.|.... ...+..+.++.+-|.|.... T Consensus 162 id~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSG 241 (523) T protein:vir:68 162 IDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAIVYAHSG 241 (523) T ss_pred eeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhheeeeecc Confidence 6533 3589999999999865331 1111 1112221111 11134566666666555422 Q ss_pred CCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCC- Q lcl|NC_021305. 179 NPDG-LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGK- 251 (518) Q Consensus 179 ~~~~-~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~- 251 (518) -.+. ...-+|-|..|...+.....++....-+--..+.-+=|+.++ +++.+...++.-..+...++. ....|. T Consensus 242 L~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev 321 (523) T protein:vir:68 242 LVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKI 321 (523) T ss_pred ceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCee Confidence 1111 113367888888888777777766665544445555454443 455554444433333222211 001121 Q ss_pred -----ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc-ccccCCHHHHH-- Q lcl|NC_021305. 252 -----TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILD-RATFSNISAQM-- 310 (518) Q Consensus 252 -----~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~sn~e~~~-- 310 (518) .+ +| ..|.++..|.. +..++ +-..+..+.+..+++||.+-|.... .-+.+-..+-. T Consensus 322 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRD 398 (523) T protein:vir:68 322 KNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNM---EDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRD 398 (523) T ss_pred ccchhhhhhHhhhcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHH Confidence 11 11 13566666653 33343 3445667889999999998884322 12322222222 Q ss_pred ----HHHHHHHhhHHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHHHHh--CC Q lcl|NC_021305. 311 ----RAFYRDTMAIPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQKMVN--SG 369 (518) Q Consensus 311 ----~~~~~~~l~P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~~--~G 369 (518) .-|+..--.-+...+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++..+-. .- T Consensus 399 EikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGk 478 (523) T protein:vir:68 399 ELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGK 478 (523) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 222222223333334444444433 2111 1 12333433 21 11111 2233444433311 12 Q ss_pred CcCHHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 370 VATPNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 370 ~~T~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) +++.+=+++ .+.+...+ - .....+..++. +++--.++.++.+.- T Consensus 479 y~s~~yi~k~ILr~tDee--i-------------~~~~kqI~~E~-----k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 479 YISHRTAMKDILQMSDEE--I-------------EQEAKQIEEES-----KEARFQDPDQEQEDF 523 (523) T ss_pred cchhHHHHHHHhccCHHH--H-------------HHHHHHHHHHh-----hcCCCCCCchhhhcC Confidence 446666664 44432211 0 00000000000 001111111111111 No 237 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=92.68 E-value=0.01 Score=31.45 Aligned_cols=393 Identities=8% Similarity=0.004 Sum_probs=144.8 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcc--------ccccc-ccccccc--hhhhHHHh-------hcHHHHHHHHHHHHh Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYY--------APAVG-MQLERQF--SLYGGIYK-------NQPWVRTVIAKRAQA 62 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~-~~~~~~~--~~~~~~~~-------~~~~v~~~v~~ia~~ 62 (518) .+|........... ..+| ++.+. .-.++ .|..+-+ .+....+. ....|..-.+.+. T Consensus 42 ~~~~~~~~~~~~~~--~~~~--dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve-- 115 (535) T protein:vir:33 42 SLFPKESDNESTDY--TTPW--QAVGARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVE-- 115 (535) T ss_pred cccCCCCCcccccc--cccc--cccHHHHHHHHHHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHH-- Confidence 56643221111110 0010 00000 00000 0000000 00000000 0000111111111 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEE Q lcl|NC_021305. 63 LARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIK 142 (518) Q Consensus 63 ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~ 142 (518) ..++..+.+-| ++.-+..+..+++.+|++++++..+.. ....+..++-..+.+. T Consensus 116 ---------------------~~~~~~~~~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~-~~~~f~~~pl~~~~v~ 169 (535) T protein:vir:33 116 ---------------------RIIMNYIESNS----YRVTLFECLKQLIVAGNALLYLPEPEG-SYNPMKLYRLSSYVVQ 169 (535) T ss_pred ---------------------HHHHHHHHhcC----cHHHHHHHHHHHHhhCceeEEeecCCC-CceeeEEEEcCeeEEe Confidence 11122223323 444555667789999999999876543 3333434444455555 Q ss_pred EcCCceeeEE-----------------------------------eeecccccCcee-----------------EEeccc Q lcl|NC_021305. 143 RNSRTGRYEY-----------------------------------YFQAGAGVGTQL-----------------VSFADD 170 (518) Q Consensus 143 ~~~~~~~~~~-----------------------------------~~~~~~~~~~~~-----------------~~~~~~ 170 (518) .+..|..... ........++.. ..|..- T Consensus 170 ~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 249 (535) T protein:vir:33 170 RDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDAM 249 (535) T ss_pred eCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEEEEEeCccccccccccccccC Confidence 5554422100 000000000000 012222 Q ss_pred cEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccC Q lcl|NC_021305. 171 EVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTG 250 (518) Q Consensus 171 evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g 250 (518) ..+..|....++..||.||..-++..+.......+.......-...|..++..++...+... ..+.. T Consensus 250 P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~~----------~~~~~--- 316 (535) T protein:vir:33 250 PYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRL----------TKAQT--- 316 (535) T ss_pred CceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhhc----------ccCCc--- Confidence 45566666667888999999999999999999999999998888888877765554443321 11111 Q ss_pred Ceee--cCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhHHHHHHH Q lcl|NC_021305. 251 KTMV--VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAIPIARIQ 326 (518) Q Consensus 251 ~~~v--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P~~~~ie 326 (518) +.++ -++++...++...++-.-..+..+.....|..+|-+. .+...+... -+.++ .+..-....+.|.+..+. T Consensus 317 g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~r-~TAtEV~~r~~E~~~~LG~v~~rl~ 393 (535) T protein:vir:33 317 GDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLN--SAVQRTGER-VTAEEIRYVASELEDTLGGVYSILS 393 (535) T ss_pred eeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hcccCCCcc-ccHHHHHHHHHHHHHHHhHHHHHHH Confidence 1222 2344555555544433334556667788888888554 222122221 23332 223344445566666655 Q ss_pred HHHHHhhh-------------hhhcccccceecc-hhhhh----cCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCCC Q lcl|NC_021305. 327 SAMDKYVG-------------QYWVRKNRMKFDI-DDVIQ----PDWEAKSESTQKMVNSGVATPNEGRE-IMGLPRSDD 387 (518) Q Consensus 327 ~~l~~~l~-------------~~~~~~~~~~fd~-~~l~~----~d~~~~~~~~~~~~~~G~~T~NE~R~-~~g~~p~~~ 387 (518) ++|-.-|+ ++... ..+++.+ +.|-. .+.......+..+-. +.| |+.. .++.+.+-+ T Consensus 394 ~Ell~Pli~r~~~il~r~g~lP~~p~-~~v~~~yis~La~aqr~~~~~~l~~~~~~la~---~~P-~~~d~~id~d~~~~ 468 (535) T protein:vir:33 394 QELQLPLVRVLLKQLQATSQIPELPK-EAVEPTISTGLEAIGRGQDLDKLERCISAWAA---LAP-MQGDPDINLAVIKL 468 (535) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCCc-cceeEEEecHHHHHHHHHHHHHHHHHHHHHHh---hCh-hhhhccCCHHHHHH Confidence 55543332 22111 1223322 22211 011111111111100 111 1100 011100000 Q ss_pred CCcceeeecc-cccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhccc--- Q lcl|NC_021305. 388 PKADELYANS-ALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKP--- 463 (518) Q Consensus 388 ~~gD~~~~~~-n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~--- 463 (518) .-++.+-+|. .+.. + +...........+......+..+ T Consensus 469 ~~a~~~Gvp~~~i~~-------------------~-------------------~ee~~~~~~q~~~~~~~~~~~~~~g~ 510 (535) T protein:vir:33 469 RIANAIGIDTSGILL-------------------T-------------------DEQKQALMMQDAAQTGVENAAAAGGA 510 (535) T ss_pred HHHHHcCCCHhHhcC-------------------C-------------------HHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 0000000000 0000 0 00000000000010000000000 Q ss_pred ----CCchhhHHHHHHHHHhhccccC Q lcl|NC_021305. 464 ----PPKESSPKHLRAVKGAMGRGKD 485 (518) Q Consensus 464 ----~~~~~~~~~~~~~~~~~~~~~~ 485 (518) ..+++. .....+.+.-|..-+ T Consensus 511 ~~~~~~~~~~-~~~~~~~~~~g~~~~ 535 (535) T protein:vir:33 511 GVGALATSSP-EAMQGAAAKAGLNAT 535 (535) T ss_pred hhcchhhcCC-hhHHHHHHhccCCCC Confidence 011111 112233333333322 No 238 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=92.61 E-value=0.011 Score=31.39 Aligned_cols=399 Identities=9% Similarity=0.038 Sum_probs=141.9 Q ss_pred CcCCCCCCCCcccccccchhhhhhhccc--------cccc-ccccc--cchhhhHHH-------hhcHHHHHHHHHHHHh Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYA--------PAVG-MQLER--QFSLYGGIY-------KNQPWVRTVIAKRAQA 62 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~-~~~~~--~~~~~~~~~-------~~~~~v~~~v~~ia~~ 62 (518) .+|..-....... ...+ + ++.+.. -.++ .|..+ ...+....+ .....|..-.+.+. T Consensus 42 ~~~~~~~~~~~~~--~~~~-~-dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve-- 115 (535) T protein:vir:15 42 SLFPKESDNESTD--YTTP-W-QAVGARGLNNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVE-- 115 (535) T ss_pred cccCCCCCccccc--cccc-c-cccHHHHHHHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHH-- Confidence 5664322111110 0011 0 110000 0000 00000 000000000 00001111111111 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC-ceEEEEeeCCceeEE Q lcl|NC_021305. 63 LARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG-TPEKLMPMHPSRVAI 141 (518) Q Consensus 63 ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G-~~~~l~~l~p~~v~v 141 (518) ..++..+.+- +++.-+..+..+++.+|++.+++..+..+ .....||+ .++.+ T Consensus 116 ---------------------~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~f~~~pl--~~~~v 168 (535) T protein:vir:15 116 ---------------------RIIMNYIESN----SYRVTLFECLKQLIVAGNALLYLPEPEGSYNPMKLYRL--SSYVV 168 (535) T ss_pred ---------------------HHHHHHHHhc----CcHHHHHHHHHHHHhhCceeEEeecCCCCceeeEEEEc--CeeEE Confidence 1122222332 34445566677899999999888765433 22334444 44444 Q ss_pred EEcCCceeeE-----------------------------------EeeecccccCce-----------------eEEecc Q lcl|NC_021305. 142 KRNSRTGRYE-----------------------------------YYFQAGAGVGTQ-----------------LVSFAD 169 (518) Q Consensus 142 ~~~~~~~~~~-----------------------------------~~~~~~~~~~~~-----------------~~~~~~ 169 (518) ..+..|.... |........++. ...|.. T Consensus 169 ~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~~~~~~~~~~~~~ 248 (535) T protein:vir:15 169 QRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVEDVEIDGSDATYPTDA 248 (535) T ss_pred eeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEeeCcccccccccccccc Confidence 4444442110 000000000000 001222 Q ss_pred ccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCcccc Q lcl|NC_021305. 170 DEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNT 249 (518) Q Consensus 170 ~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~ 249 (518) -..+..|....++..||.||..-++..+.......+.......-...|..++..++...+... ..+.. T Consensus 249 ~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g~~~~~~l----------~~~~~-- 316 (535) T protein:vir:15 249 MPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAGITQPRRL----------TKAQT-- 316 (535) T ss_pred CCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecccccccchhc----------ccCCc-- Confidence 345666666667888999999999999999999999999998888888877765554443321 11111 Q ss_pred CCeee--cCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhHHHHHH Q lcl|NC_021305. 250 GKTMV--VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAIPIARI 325 (518) Q Consensus 250 g~~~v--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P~~~~i 325 (518) +.++ -++++...++...++-.-..+..+.....|..+|-+. .+...+... -+.++ .+..-....+.|.+..+ T Consensus 317 -g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~r-~TAtEV~~r~~E~~~~LG~v~~rl 392 (535) T protein:vir:15 317 -GDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLN--SAVQRTGER-VTAEEIRYVASELEDTLGGVYSIL 392 (535) T ss_pred -eeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hcccCCCcc-ccHHHHHHHHHHHHHHHhHHHHHH Confidence 1122 2344555555544433334556667788888888554 222122221 23332 22334444556666665 Q ss_pred HHHHHHhhh-------------hhhcccccceecc-hhhhh----cCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCC Q lcl|NC_021305. 326 QSAMDKYVG-------------QYWVRKNRMKFDI-DDVIQ----PDWEAKSESTQKMVNSGVATPNEGRE-IMGLPRSD 386 (518) Q Consensus 326 e~~l~~~l~-------------~~~~~~~~~~fd~-~~l~~----~d~~~~~~~~~~~~~~G~~T~NE~R~-~~g~~p~~ 386 (518) .++|-.-|+ ++... ..+++.+ +.|-. .+.......+..+-. +.| |+.. .++.+.+- T Consensus 393 ~~Ell~Pli~r~~~il~r~g~lP~~p~-~~v~~~yis~La~aqr~~~~~~l~~~~~~la~---~~P-~~ld~~id~d~~~ 467 (535) T protein:vir:15 393 SQELQLPLVRVLLKQLQATSQIPELPK-EAVEPTISTGLEAIGRGQDLDKLERCISAWAA---LAP-MQGDPDINLAVIK 467 (535) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCCCc-cceeEEEecHHHHHHHHHHHHHHHHHHHHHHh---cCh-hhhhccCCHHHHH Confidence 555543332 22111 1223322 22211 111111111111110 111 1100 01110000 Q ss_pred CCCcceeeec-ccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCC Q lcl|NC_021305. 387 DPKADELYAN-SALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPP 465 (518) Q Consensus 387 ~~~gD~~~~~-~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~ 465 (518) +.-++.+-+| ..++.-. ++. +.... ..............+...+.++. T Consensus 468 ~~~a~~~Gvp~~~i~~~~---------------------------eev-~~~~~--q~~~~~~~~~~a~~~g~~~~~~~- 516 (535) T protein:vir:15 468 LRIANAIGIDTSGILLTD---------------------------EQK-QALMM--QDAAQTGIENAAATGGAGVGALA- 516 (535) T ss_pred HHHHHHcCCChhhhcCCH---------------------------HHH-HHHHH--HHHHHHHHHHHHHHHHhhccchh- Confidence 0000000000 0000000 000 00000 00000000000000000010110 Q ss_pred chhhHHHHHHHHHhhccccC Q lcl|NC_021305. 466 KESSPKHLRAVKGAMGRGKD 485 (518) Q Consensus 466 ~~~~~~~~~~~~~~~~~~~~ 485 (518) +. .+.....+...-|..-+ T Consensus 517 ~~-~p~~~~~~~~~~g~~~~ 535 (535) T protein:vir:15 517 TS-SPEAMQGAAAQAGLDAT 535 (535) T ss_pred cc-ChHHHHHHHhccCCCCC Confidence 00 11112222333322111 No 239 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=91.80 E-value=0.014 Score=30.72 Aligned_cols=424 Identities=8% Similarity=-0.005 Sum_probs=159.6 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccc--hhhhHHHhhcHHHHHHHHHHHHhh-c-----cCceEEEE Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQF--SLYGGIYKNQPWVRTVIAKRAQAL-A-----RLPVKCMF 72 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~~~v~~ia~~i-a-----~l~~~v~~ 72 (518) .|.. .|.+=...++..+.++....+.. .+.....+. ............ ..|++.+|..+ + .-||.=.. T Consensus 12 ~l~~-~R~~~e~~w~e~~~~~lP~~~~~--~~~~~~~~~~~~~~~~~i~dst~-~~a~~~Las~L~~~ltPp~~~WF~l~ 87 (547) T protein:vir:10 12 FLKT-DRKNVEQIWDCIRKYIMPMRSDF--FSDLRSEGSINWNQNREVFDSTA-GDGLETLSSSLHGSLTSPATKWFELA 87 (547) T ss_pred HHHH-HhhHHHHHHHHHHHHhccccccc--ccCCCCCcccccccccccccchH-HHHHHHHHHHHHHhhcCCCCcccccc Confidence 1111 01000111111111111000000 000000000 000011122223 34555555554 2 33443222 Q ss_pred ecCCc-ce--ecc------chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC-CceEEEEeeCCceeEEE Q lcl|NC_021305. 73 TSGDT-ET--EES------DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS-GTPEKLMPMHPSRVAIK 142 (518) Q Consensus 73 ~~~~~-~~--~~~------~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~-G~~~~l~~l~p~~v~v~ 142 (518) ..+.. .+ +.. ...++..+.+-| ++.-+..+..+++++|++.+++..+.. ...+.+..++..++.+. T Consensus 88 ~~d~~~~~~~~v~~~L~~ve~~i~~~l~~sn----f~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~~v~ 163 (547) T protein:vir:10 88 FRDKELNSDDECRKWLENATHDVYSALQDSN----FNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDSYFE 163 (547) T ss_pred cCCccccchHHHHHHHHHHHHHHHHHHHhcC----cHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceEEEe Confidence 21111 00 000 112223334433 444466678899999999999876542 23455656666666666 Q ss_pred EcCCceeeEEe----------------------------------------ee-c---ccccCc--------------ee Q lcl|NC_021305. 143 RNSRTGRYEYY----------------------------------------FQ-A---GAGVGT--------------QL 164 (518) Q Consensus 143 ~~~~~~~~~~~----------------------------------------~~-~---~~~~~~--------------~~ 164 (518) .+..|...... +. + .....+ .. T Consensus 164 ~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s 243 (547) T protein:vir:10 164 EDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTERPFGK 243 (547) T ss_pred eCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccccceeE Confidence 66655321100 00 0 000000 00 Q ss_pred EE--------------eccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHH Q lcl|NC_021305. 165 VS--------------FADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEA 230 (518) Q Consensus 165 ~~--------------~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~ 230 (518) +. |..-.++.+|....++..||.||...++..+.......+.......-...|..++..++.+.+ T Consensus 244 ~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~- 322 (547) T protein:vir:10 244 KWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLISD- 322 (547) T ss_pred EEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeccccccccc- Confidence 00 111234555555567888999999999999999999998888888888888876655444332 Q ss_pred HHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH- Q lcl|NC_021305. 231 AQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA- 308 (518) Q Consensus 231 ~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~- 308 (518) ++ . ..|++.+.+..-.++++...+ +.+ ..+..+.....|-.+|-++...+-..+. -+.++ T Consensus 323 --------~~-~-----~pgg~~~~~~~~~v~pl~~~~-~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~~---~TAtEV 384 (547) T protein:vir:10 323 --------ID-L-----GASGLTVVRDMESMKPFESRA-RFDVSSIQLTDLRSAVRRIYYVDQLQMKDSPA---MTATEV 384 (547) T ss_pred --------ce-e-----cCCeeeecCCcccceeeeccc-chHHHHHHHHHHHHHHHHHhhhhhhhcCCCcc---ccHHHH Confidence 11 1 124555556555666665443 333 3466777788899999776544332222 22222 Q ss_pred -HHHHHHHHHhhHHHHHHHHHHHHhhh-------------hhhccc----ccceecchhhhhcCHHHHHHHHHHHHhCCC Q lcl|NC_021305. 309 -QMRAFYRDTMAIPIARIQSAMDKYVG-------------QYWVRK----NRMKFDIDDVIQPDWEAKSESTQKMVNSGV 370 (518) Q Consensus 309 -~~~~~~~~~l~P~~~~ie~~l~~~l~-------------~~~~~~----~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~ 370 (518) ....-....+.|.+..+.++|-.-|+ ++.... ....+++. ..+...++.....+.+. . T Consensus 385 ~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~---~is~Laraq~~~~~~~i-~ 460 (547) T protein:vir:10 385 QVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIV---YTGPLSRAQKIDQAASI-E 460 (547) T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEE---eccHHHHHHHHHHHHHH-H Confidence 22334444556666655555543332 211110 11122211 22222332221111000 0 Q ss_pred cCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhh Q lcl|NC_021305. 371 ATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTD 450 (518) Q Consensus 371 ~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (518) -..+-+-...+..|- -.|.+ |. +......... -+.+......+..... T Consensus 461 ~~~~~v~~laq~~P~---vld~i----d~---d~~~~~~a~~------------------~Gvp~~~irs~eev~~---- 508 (547) T protein:vir:10 461 RWAGSTAQLAEINPE---VLDIP----DW---DEMVRMLGSL------------------LGAPQTLMRPKAKVTS---- 508 (547) T ss_pred HHHHHHHHhhccChh---hhhcC----CH---HHHHHHHHHH------------------hCCChhccCCHHHHHH---- Confidence 001111111122110 11111 00 0000000000 0000000000000000 Q ss_pred HHHHHHHHhhcccCCch-hhHHHHHHHHHhhccccCc-Cchh Q lcl|NC_021305. 451 SGKTEPRRLMQKPPPKE-SSPKHLRAVKGAMGRGKDI-KGFA 490 (518) Q Consensus 451 ~~~~~~~~~~~k~~~~~-~~~~~~~~~~~~~~~~~~~-~~~~ 490 (518) -..++...++.-.. ......-..-..+|.+.++ +-+- T Consensus 509 ---~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 509 ---IRKNRSQTQQKAEQAAIAEAEGNAMEAQGKGQAALKENQ 547 (547) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccchhccC Confidence 00111111111000 0111111122223322222 1111 No 240 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=91.48 E-value=0.016 Score=30.48 Aligned_cols=405 Identities=9% Similarity=0.026 Sum_probs=165.9 Q ss_pred CcC------C----------------CCCCCCcccccccchhh--hhhhccccccc--cc-----------ccccchhhh Q lcl|NC_021305. 1 MLL------A----------------NGQTLSAPAMAELSPQM--QDSYYYAPAVG--MQ-----------LERQFSLYG 43 (518) Q Consensus 1 ~~f------~----------------~~~~~~~~~~~~~~~~~--~~~~~~~~~~~--~~-----------~~~~~~~~~ 43 (518) |=| . ...++.+|...+...-+ ...+..+...| .+ .......++ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:72 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYR 80 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHH Confidence 322 1 00111111111100000 00000001111 00 111122344 Q ss_pred HHHhhcHHHHHHHHHHHHhhccC-----ceEEEEecCCcceecc---chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCC Q lcl|NC_021305. 44 GIYKNQPWVRTVIAKRAQALARL-----PVKCMFTSGDTETEES---DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGE 115 (518) Q Consensus 44 ~~~~~~~~v~~~v~~ia~~ia~l-----~~~v~~~~~~~~~~~~---~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~ 115 (518) ++ +.+|.|..||+.|.+.+.-+ |+.+--++-+.....+ ...+..++.--+-...++ .+++.|.+.|. T Consensus 81 ~m-a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgR 155 (524) T protein:vir:72 81 NL-MNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGS----DHFRRWYVDSR 155 (524) T ss_pred HH-hhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhh----HHHhhheeeeE Confidence 44 78899999999999987543 2222211111111111 111111222222233344 44667899999 Q ss_pred eEEEEEEcCC---CceEEEEeeCCceeEEEEcC-----Cce------eeEEeeeccc---------ccCceeEEeccccE Q lcl|NC_021305. 116 TYLAIQKNKS---GTPEKLMPMHPSRVAIKRNS-----RTG------RYEYYFQAGA---------GVGTQLVSFADDEV 172 (518) Q Consensus 116 ~~~~i~r~~~---G~~~~l~~l~p~~v~v~~~~-----~~~------~~~~~~~~~~---------~~~~~~~~~~~~ev 172 (518) .|+.++-|.. ..+.+|+.|+|..++.+..- .+. ..+|.|.... ...+..+.++.+-| T Consensus 156 i~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI 235 (524) T protein:vir:72 156 IFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAV 235 (524) T ss_pred EEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhhe Confidence 9999876533 35899999999998653321 111 1112221110 11234556666666 Q ss_pred EEEeccCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----c Q lcl|NC_021305. 173 VPIRFFNPDG-LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----S 246 (518) Q Consensus 173 ih~~~~~~~~-~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~ 246 (518) .|....-.+. ...-+|-|..|...+.....++....-+--..+.-+=|+.++ +++.+...++....+...++. . T Consensus 236 ~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYD 315 (524) T protein:vir:72 236 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYD 315 (524) T ss_pred eeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEe Confidence 5554221111 112367788888888777777766665544445545444443 455554444433333332221 1 Q ss_pred cccCC------ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc--ccccCC Q lcl|NC_021305. 247 SNTGK------TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILD--RATFSN 305 (518) Q Consensus 247 ~n~g~------~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn 305 (518) .+.|. .+ +| ..|.++..|.. +..++ +-..+..+.+..+++||.+-|..-. .-+.+. T Consensus 316 a~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr 392 (524) T protein:vir:72 316 ASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNM---EDIRWFRQALYMALRVPLSRIPQDQQGGVMFDS 392 (524) T ss_pred CCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCchhhcCCCCCccccccc Confidence 11222 11 11 13566666653 33343 3445667889999999999883211 112222 Q ss_pred HHHHHH------HHHHHHhhHHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHH Q lcl|NC_021305. 306 ISAQMR------AFYRDTMAIPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQK 364 (518) Q Consensus 306 ~e~~~~------~~~~~~l~P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~ 364 (518) ..+-.+ -|+..--.-+...+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++.. T Consensus 393 ~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~ 472 (524) T protein:vir:72 393 GTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTM 472 (524) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHH Confidence 222222 22222223333334444444433 2111 1 12333433 21 11111 2233444433 Q ss_pred HHh--CCCcCHHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 365 MVN--SGVATPNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 365 ~~~--~G~~T~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) +-. .-+++.+=+++ .+.+...+ - .....+..++.. ++--.++.++.+.- T Consensus 473 ~dpyvGky~s~~yi~k~ILr~tDee--i-------------~~~~k~I~~E~k-----~~~~~~~~~~~~~f 524 (524) T protein:vir:72 473 AEPFIGKYISHRTAMKDILQMTDEE--I-------------EQEAKQIEEESK-----EARFQDPDQEQEDF 524 (524) T ss_pred hhhhhcccchhHHHHHHHhccCHHH--H-------------HHHHHHHHHHhh-----cCCCCCCchhhhcC Confidence 311 12446666664 44432211 0 000000000000 00000111111111 No 241 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=91.32 E-value=0.016 Score=30.37 Aligned_cols=405 Identities=9% Similarity=0.027 Sum_probs=166.0 Q ss_pred CcC------C----------------CCCCCCcccccccchhh--hhhhccccccc--cc-----------ccccchhhh Q lcl|NC_021305. 1 MLL------A----------------NGQTLSAPAMAELSPQM--QDSYYYAPAVG--MQ-----------LERQFSLYG 43 (518) Q Consensus 1 ~~f------~----------------~~~~~~~~~~~~~~~~~--~~~~~~~~~~~--~~-----------~~~~~~~~~ 43 (518) |=| . ...++.+|...+...-+ ...+..+...| .+ .......++ T Consensus 1 m~~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~YR 80 (524) T protein:vir:10 1 MKFNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDTYR 80 (524) T ss_pred CCCchhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHHHH Confidence 322 1 00111111111100000 00000001111 00 111122344 Q ss_pred HHHhhcHHHHHHHHHHHHhhccC-----ceEEEEecCCcceecc---chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCC Q lcl|NC_021305. 44 GIYKNQPWVRTVIAKRAQALARL-----PVKCMFTSGDTETEES---DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGE 115 (518) Q Consensus 44 ~~~~~~~~v~~~v~~ia~~ia~l-----~~~v~~~~~~~~~~~~---~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~ 115 (518) ++ +.+|.|..||+.|.+.+.-+ |+.+--++-+.....+ ...+..++.--+-...++ .+++.|.+.|. T Consensus 81 ~m-a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgR 155 (524) T protein:vir:10 81 NL-MNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGS----DHFRRWYVDSR 155 (524) T ss_pred HH-hhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhh----HHHhhheeeeE Confidence 44 78899999999999987543 2222211111111111 111111222222233344 44667899999 Q ss_pred eEEEEEEcCC---CceEEEEeeCCceeEEEEcC-----Cce------eeEEeeeccc---------ccCceeEEeccccE Q lcl|NC_021305. 116 TYLAIQKNKS---GTPEKLMPMHPSRVAIKRNS-----RTG------RYEYYFQAGA---------GVGTQLVSFADDEV 172 (518) Q Consensus 116 ~~~~i~r~~~---G~~~~l~~l~p~~v~v~~~~-----~~~------~~~~~~~~~~---------~~~~~~~~~~~~ev 172 (518) .|+.++-+.. ..+.+|+.|+|..++.+..- .+. ..+|.|.... ...+..+.++.+-| T Consensus 156 i~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI 235 (524) T protein:vir:10 156 IFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAI 235 (524) T ss_pred EEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhhe Confidence 9999876533 35899999999998653321 111 1112221110 11234556666665 Q ss_pred EEEeccCCCC-cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----c Q lcl|NC_021305. 173 VPIRFFNPDG-LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----S 246 (518) Q Consensus 173 ih~~~~~~~~-~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~ 246 (518) .|....-.+. ...-+|-|..|...+.....++....-+--..+.-+=|+.++ +++.+...++....+...++. . T Consensus 236 ~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYD 315 (524) T protein:vir:10 236 VYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYD 315 (524) T ss_pred eeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEe Confidence 5554221111 112367788888888777777766665544445545454443 455554444433333332221 1 Q ss_pred cccCC------ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhcccc--ccccCC Q lcl|NC_021305. 247 SNTGK------TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILD--RATFSN 305 (518) Q Consensus 247 ~n~g~------~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~--~~~~sn 305 (518) .+.|. .+ +| ..|.++..|.. +..++ +-..+..+.+..+++||.+-|..-. .-+.+. T Consensus 316 a~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr 392 (524) T protein:vir:10 316 ASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNM---EDVRWFRQALYMALRVPLSRIPQDQQGGVMFDS 392 (524) T ss_pred CCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChH---HHHHHHHHHHHHHhCCchhhcCCCCCccccccc Confidence 11222 11 11 13566666653 33343 3445667889999999999883211 112222 Q ss_pred HHHHHH------HHHHHHhhHHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHH Q lcl|NC_021305. 306 ISAQMR------AFYRDTMAIPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQK 364 (518) Q Consensus 306 ~e~~~~------~~~~~~l~P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~ 364 (518) ..+-.+ -|+..--.-+...+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++.. T Consensus 393 ~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~ 472 (524) T protein:vir:10 393 GTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTM 472 (524) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHH Confidence 222222 22222223333334444444433 2111 1 12333433 21 11111 2233444433 Q ss_pred HHh--CCCcCHHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccc Q lcl|NC_021305. 365 MVN--SGVATPNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP 433 (518) Q Consensus 365 ~~~--~G~~T~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 433 (518) +-. .-+++.+=+++ .+.+...+ - .....+..++. +++--.++.++.+.- T Consensus 473 ~dpyvGky~s~~yi~k~ILr~tDee--i-------------~~~~k~I~~E~-----k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 473 AEPFIGKYISHRTAMKDILQMTDEE--I-------------EQEAKQIEEES-----KEARFQDPDQEQEDF 524 (524) T ss_pred hhhhhcccchhHHHHHHHhccCHHH--H-------------HHHHHHHHHHh-----hcCCCCCCchhhhcC Confidence 311 12446666664 44432211 0 00000000000 001111111111111 No 242 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=90.76 E-value=0.019 Score=30.00 Aligned_cols=377 Identities=11% Similarity=0.042 Sum_probs=156.7 Q ss_pred CcCCCCCCCCcc---cccccchhhhhhhccccccc------cc-ccccchhhhHHHhh----cHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLLANGQTLSAP---AMAELSPQMQDSYYYAPAVG------MQ-LERQFSLYGGIYKN----QPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~------~~-~~~~~~~~~~~~~~----~~~v~~~v~~ia~~ia~l 66 (518) |= .+.+-| +...-...+++..++....- .+ ...........++. .+++...++.++..+-+- T Consensus 1 m~----V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~k 76 (452) T protein:vir:94 1 MP----IETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLDQ 76 (452) T ss_pred CC----CCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhcC Confidence 21 111111 11111122333333322110 00 00000011122232 455666666666666555 Q ss_pred ceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeE------ Q lcl|NC_021305. 67 PVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVA------ 140 (518) Q Consensus 67 ~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~------ 140 (518) |+.+ +..+.+.+ ++.. ....+..+|.+.++...+.+|.+++++.....|.--.+..+.|..|. T Consensus 77 ~p~~---------~~p~~l~~-~~~D-~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~~~ 145 (452) T protein:vir:94 77 PPVI---------THPDAMSK-YFED-QSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILNWEEDE 145 (452) T ss_pred Ccee---------cccHHHHH-HHhc-ccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcCccccc Confidence 5543 11222222 3322 34678999999999999999999999998877642333333333221 Q ss_pred -------------EEEcCC---ce--eeEEeee-cc---------cccCceeEE------e-------ccccEEEEeccC Q lcl|NC_021305. 141 -------------IKRNSR---TG--RYEYYFQ-AG---------AGVGTQLVS------F-------ADDEVVPIRFFN 179 (518) Q Consensus 141 -------------v~~~~~---~~--~~~~~~~-~~---------~~~~~~~~~------~-------~~~evih~~~~~ 179 (518) ...+.. +. ...|+.. .. ...++.... . ..=.++.+ +.. T Consensus 146 ~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~-~~~ 224 (452) T protein:vir:94 146 DGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCI-TPS 224 (452) T ss_pred cCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEE-cCC Confidence 000110 00 0011000 00 000010000 0 00011111 122 Q ss_pred CCCcccCchHHHHHHHH-HHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCC- Q lcl|NC_021305. 180 PDGLERGLSLMESLKST-IFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEE- 257 (518) Q Consensus 180 ~~~~~~G~s~l~~~~~~-i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~- 257 (518) .++...|.||+..++.. +.+......+...+ ...+.|-.++.--...+ ...-|+ +.++.+++ T Consensus 225 ~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l-~~~~~P~l~~~g~~~~~------------~i~iG~---~~~~~lpe~ 288 (452) T protein:vir:94 225 GLSMTPAKPPMIDIVDINYSHYRTSADLEHGR-HFTGLPTPWITGAESQS------------TMHIGS---TKAWVIPEV 288 (452) T ss_pred CCCCCCCccchHHHHHHHHHHhcchhHHHHHH-HHcccceeEeecCcCCC------------ceEecc---cccccCCCC Confidence 23445688998877654 44444444444443 44566766654322111 112233 34566774 Q ss_pred Cc--ceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH--HHHHHHHHhhHHHHHHHHHHHHhh Q lcl|NC_021305. 258 GM--EPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ--MRAFYRDTMAIPIARIQSAMDKYV 333 (518) Q Consensus 258 g~--~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~--~~~~~~~~l~P~~~~ie~~l~~~l 333 (518) |. .|.+.+.+.-... .+..+....++ ...|. .++-....+ ..+.+.. ...-.+..|.-+...++++++..| T Consensus 289 ~~~~~yie~~g~~i~~~-~~~l~~le~~m-~~~Ga--~ll~~~~~~-~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l 363 (452) T protein:vir:94 289 AAKVGFLEFTGQGLQSL-EKALSEKQAQL-ASLSA--RLIDNSTRG-SEATETVKLRYMSETASLKSVTRAVEALLNKAY 363 (452) T ss_pred CCcceEEccCchhHHHH-HHHHHHHHHHH-HHHHH--HhhccCCCc-chHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH Confidence 64 4555544443322 12222222222 11221 122111111 1122222 222335677888888888887654 Q ss_pred h--hhh-cc--cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCcceeeecccccccccc Q lcl|NC_021305. 334 G--QYW-VR--KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM---GLPRSDDPKADELYANSALQPLGAT 405 (518) Q Consensus 334 ~--~~~-~~--~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~---g~~p~~~~~gD~~~~~~n~~~~~~~ 405 (518) - ..+ +. ...++++.+...........+++-+++..|.+|....++.+ |....+.+.-+ + T Consensus 364 ~~~a~w~g~~~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~----------i--- 430 (452) T protein:vir:94 364 SCIMDMESMGGTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMG----------V--- 430 (452) T ss_pred HHHHHHcCCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHH----------H--- Confidence 2 111 11 22344444444444334566667788999999999988877 44322211100 0 Q ss_pred cccCCCCCCCCCCCCCccCCCCCCCcc Q lcl|NC_021305. 406 PDGAVEWEEAPAPKRPASTPVASLDQS 432 (518) Q Consensus 406 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (518) ..+.+++.+. +.+.|++++.+. T Consensus 431 ----~~E~~~~~~~-~~~~~~~~~~~~ 452 (452) T protein:vir:94 431 ----IPDPPAPEPS-PSNTPPNPSSKA 452 (452) T ss_pred ----HHHhhccCcc-cCCCCCCCccCC Confidence 0001111111 112222222111 No 243 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=90.44 E-value=0.021 Score=29.80 Aligned_cols=369 Identities=9% Similarity=0.031 Sum_probs=155.5 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCccee Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~~~~~ 80 (518) =.+.+............ ......... ...-..++....+|+..+.-+-+-|+.+.-.+++ . T Consensus 25 ~YY~g~~~i~~~~~~~~----------~~~~~~~~~------~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~---~ 85 (451) T protein:vir:10 25 SYYYNKNDILKKGVVVQ----------NRDENPLRN------ADNRISHNFHEILVDEKASYMFTYPVLFDIDNNK---E 85 (451) T ss_pred HHhcccCcccccccccc----------ccccccccc------cccccccchHHHHHHhhhhheecccceeecCCcH---H Confidence 00111100000000000 000000000 0001223456778888888887778765321111 1 Q ss_pred ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC-------ceEEEEeeCCceeEEEEcCCc-eeeEE Q lcl|NC_021305. 81 ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG-------TPEKLMPMHPSRVAIKRNSRT-GRYEY 152 (518) Q Consensus 81 ~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G-------~~~~l~~l~p~~v~v~~~~~~-~~~~~ 152 (518) ....+..++. | +.......+..+.+.+|.+|.++-++... ....+..++|..+.+.++... ....+ T Consensus 86 -~~~~~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~~ 159 (451) T protein:vir:10 86 -LNEKVTDVLG--N---EFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIERELEA 159 (451) T ss_pred -HHHHHHHHhc--c---CHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCceEE Confidence 1111112221 2 45566677888999999999988877541 234577788888877765432 11111 Q ss_pred e--eec-ccccCc--------eeEEeccccEEEEeccC---------------C---------CCcccCchHHHHHHHHH Q lcl|NC_021305. 153 Y--FQA-GAGVGT--------QLVSFADDEVVPIRFFN---------------P---------DGLERGLSLMESLKSTI 197 (518) Q Consensus 153 ~--~~~-~~~~~~--------~~~~~~~~evih~~~~~---------------~---------~~~~~G~s~l~~~~~~i 197 (518) . ++. ....++ ....+..+.+.+++... + .....|.|-+..+...+ T Consensus 160 ~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d~e~v~~li 239 (451) T protein:vir:10 160 VIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIKKQSDLSKYKKIL 239 (451) T ss_pred EEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCCCCCchhhHHHHH Confidence 1 110 000000 01112333333332100 0 00123666676666666 Q ss_pred HHHHHHHHHHHHHHHccCCccccccc-CccCCHHHHHHHHHHHHHHhcCccccCCeeecC-----CCcceeeccCChhhH Q lcl|NC_021305. 198 FSEDSSRNATAAMWKNAGRPNLVLRH-EKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-----EGMEPIPLQLTAVEM 271 (518) Q Consensus 198 ~~~~~~~~~~~~~~~ng~~p~~il~~-~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-----~g~~~~~l~~~~~d~ 271 (518) .....+..-..+.+...+.|-.+++- .+..+.+.... ++. .+++++. .|.+...+..+.... T Consensus 240 Da~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~----~~~--------~~~i~~~~~~~~~~~~~~~l~~~~~~~ 307 (451) T protein:vir:10 240 DLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKE----LKR--------YKTIKTETDSEGDSGGLKTMQIEIPTE 307 (451) T ss_pred HHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHH----Hhh--------CCeEEecCcCCccCCcceEEeecCCHH Confidence 66655544444445544555444432 12122222221 111 1223332 222333344333445 Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHh-------hhhhhc--cccc Q lcl|NC_021305. 272 QFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKY-------VGQYWV--RKNR 342 (518) Q Consensus 272 ~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~-------l~~~~~--~~~~ 342 (518) .+....+...+.|...-++|.. ... +.+|.......+.-..+.-.+...+..+... ++...+ .... T Consensus 308 ~~~~~~~~l~~~I~~~s~~p~~--~~~---~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~d~~~ 382 (451) T protein:vir:10 308 ARKIILEILKKQIYESGQGLQQ--DTE---NFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVTDYKK 382 (451) T ss_pred HHHHHHHHHHHHHHHHhCcccc--ccc---ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Confidence 5667788888999999888842 111 1122222222222222211122211111111 111111 1234 Q ss_pred ceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCc Q lcl|NC_021305. 343 MKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPA 422 (518) Q Consensus 343 ~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~ 422 (518) +++.+..-+..|..+.++.+.++. |+++...+.++++.- +++. .+. ..+. . .+. ....+ . T Consensus 383 i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~et~~~~~p~v--~d~~-~e~------~~~~---e----e~~-~~~~~-~ 442 (451) T protein:vir:10 383 IQQTYTRNMMSNDLEDADIATKSV--GIIPTKIILRHHPWV--DDVE-EAE------KLYL---E----EKK-IQASK-V 442 (451) T ss_pred eeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCC--CCHH-HHH------HHHH---H----HHH-HHHHH-H Confidence 556667778899999999999984 789988888887543 2211 000 0000 0 000 00000 0 Q ss_pred cCCCCCCCc Q lcl|NC_021305. 423 STPVASLDQ 431 (518) Q Consensus 423 ~~~~~~~~~ 431 (518) ....++-++ T Consensus 443 ~~~~~~~~~ 451 (451) T protein:vir:10 443 SDDYNNFTE 451 (451) T ss_pred HhhcCCCCC Confidence 000000000 No 244 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=90.43 E-value=0.021 Score=29.79 Aligned_cols=402 Identities=10% Similarity=0.058 Sum_probs=173.2 Q ss_pred CcCCCCC--------------CCCcccccccchhhh-----hhhccc---ccccc----cccccchhhhHHHhhcHHHHH Q lcl|NC_021305. 1 MLLANGQ--------------TLSAPAMAELSPQMQ-----DSYYYA---PAVGM----QLERQFSLYGGIYKNQPWVRT 54 (518) Q Consensus 1 ~~f~~~~--------------~~~~~~~~~~~~~~~-----~~~~~~---~~~~~----~~~~~~~~~~~~~~~~~~v~~ 54 (518) |=|-.++ ++..|...+...-+. ...++. ...+. ++......++ ..+.+|.|.. T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR-~ma~~pEvd~ 79 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYR-ALAEYHEVDD 79 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHH-HHhhccchhh Confidence 2222222 111111111110110 001110 00000 0011112222 3467899999 Q ss_pred HHHHHHHhhccC-----ceEEEEecCCccee---ccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC Q lcl|NC_021305. 55 VIAKRAQALARL-----PVKCMFTSGDTETE---ESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG 126 (518) Q Consensus 55 ~v~~ia~~ia~l-----~~~v~~~~~~~~~~---~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G 126 (518) ||+.|.+.+.-+ |+.+--++-+-... .-...+..++.--+-...++ .+++.|.+.|..|+.++-+... T Consensus 80 Av~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~----~~fR~WYVDgRi~fHkiid~k~ 155 (511) T protein:vir:56 80 AIQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGY----KWFRKWYVDSRIYFHKILDKDN 155 (511) T ss_pred HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhh----HHHhhhhhcceEEEEEEecccc Confidence 999999987543 23332111111111 01111111222122233344 4466788999999999887766 Q ss_pred ceEEEEeeCCceeEEEEcCC-----------ceeeEEeeecccc-----c-----CceeEEeccccEEEEeccCC---CC Q lcl|NC_021305. 127 TPEKLMPMHPSRVAIKRNSR-----------TGRYEYYFQAGAG-----V-----GTQLVSFADDEVVPIRFFNP---DG 182 (518) Q Consensus 127 ~~~~l~~l~p~~v~v~~~~~-----------~~~~~~~~~~~~~-----~-----~~~~~~~~~~evih~~~~~~---~~ 182 (518) .+.+|+.|+|..++.+.... +...+|.|...+. . ....+.++.+.|.|...--. .. T Consensus 156 GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~ 235 (511) T protein:vir:56 156 NIIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCAD 235 (511) T ss_pred ceeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccCC Confidence 79999999999887544221 1122222221110 0 12447788889976654322 22 Q ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccC-ccCCHHHHHHHHHHHHHHhcC----ccccCC------ Q lcl|NC_021305. 183 LERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSEAAQQRLREQFDRAHSG----SSNTGK------ 251 (518) Q Consensus 183 ~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~~~~~~~~~~~~~~~~g----~~n~g~------ 251 (518) ..+.+|-|..|...+.....++....-+--..+.-+=|+..+ +++.+...++....+...++. ....|. T Consensus 236 ~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk 315 (511) T protein:vir:56 236 DPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTN 315 (511) T ss_pred CCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchh Confidence 335789999999988887777776666644445555454443 455544444433333222211 011121 Q ss_pred ee-ec----------CCCcceeeccC--ChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccc---cccCCHHHHHH---- Q lcl|NC_021305. 252 TM-VV----------EEGMEPIPLQL--TAVEMQFIEARQLNREEVCGVYDIAPPIVHILDR---ATFSNISAQMR---- 311 (518) Q Consensus 252 ~~-vl----------~~g~~~~~l~~--~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~---~~~sn~e~~~~---- 311 (518) .+ +| ..|.++..|.. +..+| +-..+..+.+..+++||.+-|+.-+. -+.+...+-.+ T Consensus 316 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiK 392 (511) T protein:vir:56 316 AMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDI---EDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELK 392 (511) T ss_pred hhhhHhhhcccccCCCCccceeeccccCCcChH---HHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHH Confidence 11 11 13566666653 33343 34456678899999999998863321 12112222222 Q ss_pred --HHHHHHhhHHHHHHHHHHHHhhh-----hhhc--c-cccceecc--hh----hhhcC-HHHHHHHHHHHHh--CCCcC Q lcl|NC_021305. 312 --AFYRDTMAIPIARIQSAMDKYVG-----QYWV--R-KNRMKFDI--DD----VIQPD-WEAKSESTQKMVN--SGVAT 372 (518) Q Consensus 312 --~~~~~~l~P~~~~ie~~l~~~l~-----~~~~--~-~~~~~fd~--~~----l~~~d-~~~~~~~~~~~~~--~G~~T 372 (518) -|+..--.-+...+.+.|...|+ ++.+ . ...++|++ +. +.... +..|+.++..+-. .-++| T Consensus 393 F~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S 472 (511) T protein:vir:56 393 FTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYS 472 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccc Confidence 22222223333334444444433 2111 1 12333433 21 11111 2223333333211 12446 Q ss_pred HHHHHH-HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCc Q lcl|NC_021305. 373 PNEGRE-IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQ 431 (518) Q Consensus 373 ~NE~R~-~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (518) .+=+++ .+.+...+-..-+ .+...+...+- -.+.. ++- T Consensus 473 ~~yi~k~ILr~tDeei~~~~---------------k~I~~E~k~~~-----~~~~e-~~f 511 (511) T protein:vir:56 473 HKYIQKNILRLSDDQITAMQ---------------SEIDEEETNPR-----FQQDD-QGF 511 (511) T ss_pred hHHHHHHHhccCHHHHHHHH---------------HHHHHhhcCCC-----CCCcc-cCC Confidence 666664 4443221100000 00000000000 00000 000 No 245 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=88.42 E-value=0.032 Score=28.75 Aligned_cols=407 Identities=11% Similarity=0.046 Sum_probs=151.8 Q ss_pred CcCCCCCCCCcc----cccccchhhhhhh----cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhc------cC Q lcl|NC_021305. 1 MLLANGQTLSAP----AMAELSPQMQDSY----YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALA------RL 66 (518) Q Consensus 1 ~~f~~~~~~~~~----~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia------~l 66 (518) |-|+.-+..-+. -....++|...+- ...+........+ ...+....... -.|++.+|..+- .- T Consensus 3 ~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~--~~~~~~~dstg-~~a~~~LAa~l~~~ltpp~~ 79 (517) T protein:vir:10 3 MRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYLMADVNDD--LSSQNAWQDDG-ASATNFLSNKLSQVLFPAQR 79 (517) T ss_pred ccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccccccCCCCC--ccccccccchH-HHHHHHHHHHHHHhhcCCCC Confidence 777654321110 0011233332211 1111111111111 11122333333 345666665552 23 Q ss_pred ceEEEEecCCccee---------ccchHH-------HHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEE Q lcl|NC_021305. 67 PVKCMFTSGDTETE---------ESDTGY-------AKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEK 130 (518) Q Consensus 67 ~~~v~~~~~~~~~~---------~~~~~~-------~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~ 130 (518) ||.=..-.+....+ .-..++ +.-+.+ -+++.-+..+..+++.+|++.+++. ..+.... T Consensus 80 ~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~~L~~~G~a~ly~~--~~~~~~~ 153 (517) T protein:vir:10 80 SFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGES----LQFRPAVVEAFKHLIVTGNVMMYHP--DKTSPIQ 153 (517) T ss_pred ccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHh----cCcHHHHHHHHHHHHhHCeEEEEEe--CCCCcEE Confidence 44322221110000 001111 112222 2455556677788999999987753 3334556 Q ss_pred EEeeCCceeEEEEcCCceeeEE--eee-----------------------------------ccccc---------Ccee Q lcl|NC_021305. 131 LMPMHPSRVAIKRNSRTGRYEY--YFQ-----------------------------------AGAGV---------GTQL 164 (518) Q Consensus 131 l~~l~p~~v~v~~~~~~~~~~~--~~~-----------------------------------~~~~~---------~~~~ 164 (518) .||+. .+.+..+..|..... +.. ..... ++.. T Consensus 154 ~~pl~--~y~v~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~ 231 (517) T protein:vir:10 154 AVPLH--HYCVRRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIRQSADDVP 231 (517) T ss_pred EEEcC--eEEEeeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEEEEeCcee Confidence 66663 344445444432100 000 00000 0100 Q ss_pred ------EEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHH Q lcl|NC_021305. 165 ------VSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQ 238 (518) Q Consensus 165 ------~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~ 238 (518) +.+..-..+-+|....++..||.||..-++..+.......+.......-...|..++..++.+.+... T Consensus 232 ~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~~~~~~~l------ 305 (517) T protein:vir:10 232 VGKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVKPGSYTDINQF------ 305 (517) T ss_pred eccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccCcccccchhhc------ Confidence 00112234445555557788999999999999999998888888887777777777655554443221 Q ss_pred HHHHhcCccccCCeeec--CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHH Q lcl|NC_021305. 239 FDRAHSGSSNTGKTMVV--EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFY 314 (518) Q Consensus 239 ~~~~~~g~~n~g~~~vl--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~ 314 (518) ..+..+ .++- .+++...++.....=....+..+.....|..+|-+.. ++..+... -+.++ .+..-. T Consensus 306 ----~~~~~g---~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~--l~~~~~~r-vTAtEV~~r~~E~ 375 (517) T protein:vir:10 306 ----VEGGSG---AVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEA--MTRRDAER-VTAYEIQRDAMLV 375 (517) T ss_pred ----cCCCcc---ccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh--hhccCCcc-ccHHHHHHHHHHH Confidence 111111 1111 1233333333322222234566677788888886553 22222211 12322 233344 Q ss_pred HHHhhHHHHHHHHHHHHhhhhh-------hcccccceecchhhhhcCHHHHHH---HHHHH---HhCCCcCHHH-HHHHh Q lcl|NC_021305. 315 RDTMAIPIARIQSAMDKYVGQY-------WVRKNRMKFDIDDVIQPDWEAKSE---STQKM---VNSGVATPNE-GREIM 380 (518) Q Consensus 315 ~~~l~P~~~~ie~~l~~~l~~~-------~~~~~~~~fd~~~l~~~d~~~~~~---~~~~~---~~~G~~T~NE-~R~~~ 380 (518) ...|.|.+..+.++|-.-|+.. ......++.++... .....+.. .+... +.. +.-..+ +...+ T Consensus 376 ~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~~v~~~~~s~--la~l~r~~~~~~i~~~~~~i~~-~a~~~~~~~~~i 452 (517) T protein:vir:10 376 EQSLGGVYSLFATTFQGPLARWFMNGISSILTSKNVSPTILTG--IEALGRMAELDKLGTFNGYVSM-TAQWPEPLQQAI 452 (517) T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCCCccceeecc--HHHHHHHHHHHHHHHHHHHHHH-hhcCChHHHhcC Confidence 5567777777776643332211 00111122221110 01111111 11111 110 000111 11111 Q ss_pred CCCCCCCCCcceeeecccccccccc----cccCCCCC----CCCCCCCCccCCCCCCCccccCCcc Q lcl|NC_021305. 381 GLPRSDDPKADELYANSALQPLGAT----PDGAVEWE----EAPAPKRPASTPVASLDQSPPTSVP 438 (518) Q Consensus 381 g~~p~~~~~gD~~~~~~n~~~~~~~----~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (518) +.+.+-+.-++.+=+|.+++.-+.. ...+...+ .....++....+..+ ++..+++.. T Consensus 453 d~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~-~~~~~~~~~ 517 (517) T protein:vir:10 453 KWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKN-GQINPQGGQ 517 (517) T ss_pred CHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCCCCCCC Confidence 1111100001111112111110000 00000000 000000000011111 111111111 No 246 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=86.83 E-value=0.043 Score=28.08 Aligned_cols=407 Identities=9% Similarity=-0.008 Sum_probs=149.0 Q ss_pred CcCCCC-CCCCcc---cccccchhhhhhhcccccc---cccc---------cccchhhhHHHhhc----HHHHHHHHHHH Q lcl|NC_021305. 1 MLLANG-QTLSAP---AMAELSPQMQDSYYYAPAV---GMQL---------ERQFSLYGGIYKNQ----PWVRTVIAKRA 60 (518) Q Consensus 1 ~~f~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~---~~~~---------~~~~~~~~~~~~~~----~~v~~~v~~ia 60 (518) .-+.-- .+.+-| +.......+++..++.... |... ..........++.- +++...++.++ T Consensus 28 ~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~ 107 (535) T protein:vir:80 28 LGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMM 107 (535) T ss_pred CCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHh Confidence 111111 001111 1111122233333332210 0000 00000112223333 33444444444 Q ss_pred HhhccCceEEEEecCCcceeccchHHHHHHhcCC-cCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCce----------- Q lcl|NC_021305. 61 QALARLPVKCMFTSGDTETEESDTGYAKLLADPC-EYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTP----------- 128 (518) Q Consensus 61 ~~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN-~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~----------- 128 (518) ..+-+-|..+ . ..+.+..|+.... ...+..+|.+.++...+.+|.+++++.....|.. T Consensus 108 G~vfrk~p~~---------~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~ 177 (535) T protein:vir:80 108 GQVFSRDPIR---------Q-LPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLY 177 (535) T ss_pred chhhcCCcce---------e-ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCC Confidence 4333333211 1 1233444555444 3468999999999999999999999987655431 Q ss_pred -EEEEeeCCceeE----------------------EEEc-CCceeeE--Eee--------------eccccc---CceeE Q lcl|NC_021305. 129 -EKLMPMHPSRVA----------------------IKRN-SRTGRYE--YYF--------------QAGAGV---GTQLV 165 (518) Q Consensus 129 -~~l~~l~p~~v~----------------------v~~~-~~~~~~~--~~~--------------~~~~~~---~~~~~ 165 (518) -.+..+.|..|- ...+ ..+.... |++ ...... ..... T Consensus 178 rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~ 257 (535) T protein:vir:80 178 RPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSK 257 (535) T ss_pred CcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccce Confidence 112222221110 0111 1111110 100 000000 00000 Q ss_pred Eec---------cccEEEEeccCCCCcccCchHHHHHHHH-HHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHH Q lcl|NC_021305. 166 SFA---------DDEVVPIRFFNPDGLERGLSLMESLKST-IFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRL 235 (518) Q Consensus 166 ~~~---------~~evih~~~~~~~~~~~G~s~l~~~~~~-i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~ 235 (518) .++ .=.++++- ...++...|.||+..+... +.+......+...+ ...+.|-.+++-.....++. . T Consensus 258 ~~~~~~g~~~l~~IPfv~~~-~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il-~~~~~P~l~i~G~~~~~~~~---~ 332 (535) T protein:vir:80 258 HVPTDGNGNPFKEIPFQFIG-PLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMA-FVAGQPTAFFTGLTKDWVED---V 332 (535) T ss_pred eecccCCCcccCeeEEEEee-cCCCCCCCCccchHHHHHHHHHHhhchhHHHHHH-HHhcCceeeeecCchhhhhc---C Confidence 000 00122222 2234455688888776654 44444444444333 34456666664222111110 0 Q ss_pred HHHHHHHhcCccccCCeeecCCCc--ceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHH--H Q lcl|NC_021305. 236 REQFDRAHSGSSNTGKTMVVEEGM--EPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM--R 311 (518) Q Consensus 236 ~~~~~~~~~g~~n~g~~~vl~~g~--~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~--~ 311 (518) .+ -....-|+ ...+.++.+. +|.+++.+.-..+.++ ....++++ .|. .++. ...++. +..+.. . T Consensus 333 ~~-~~~i~iG~---~~~~~lP~~~~~~~~e~~~~~~a~~~l~---~~e~qM~~-lGa--~ll~-~~~~~~-Ta~~a~~~~ 400 (535) T protein:vir:80 333 FK-DFKVHLGS---RAIIPLPQGATAGILQITPNSVPFEAMT---HKESQMIA-MGA--NLLV-KSGGNR-TFGEAQQEE 400 (535) T ss_pred CC-CcceEecC---cccccCCCCCCcceeeeccchhHHHHHH---HHHHHHHH-HHH--Hhhc-cCcccc-cHHHHHHHH Confidence 00 00012232 2345677664 4444443333333222 22333322 221 1121 111121 122222 2 Q ss_pred HHHHHHhhHHHHHHHHHHHHhhh--hhhc------ccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Q lcl|NC_021305. 312 AFYRDTMAIPIARIQSAMDKYVG--QYWV------RKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLP 383 (518) Q Consensus 312 ~~~~~~l~P~~~~ie~~l~~~l~--~~~~------~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~ 383 (518) .-....|.-++..++++++..|- ..+. ....++++.+.....-.......+-++++.|.++....++.+..- T Consensus 401 ~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~L~r~ 480 (535) T protein:vir:80 401 ASEQSILSACTKNVSMAFRKALRWANQFQTGIVNDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAGLRRA 480 (535) T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhC Confidence 22245577778888888876542 2221 112233443433333223346667788889999998888776332 Q ss_pred CCC---CCCcceee-ecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCcc Q lcl|NC_021305. 384 RSD---DPKADELY-ANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVP 438 (518) Q Consensus 384 p~~---~~~gD~~~-~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (518) -+- ..+-++.- +......+....+.........++.++.+.-..+. |++.. T Consensus 481 gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~----~~~~~ 535 (535) T protein:vir:80 481 GVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGG----NQAGN 535 (535) T ss_pred CCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCcccc----ccCCC Confidence 111 11111110 00000000000000000011111111111111111 11111 No 247 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=85.31 E-value=0.054 Score=27.54 Aligned_cols=400 Identities=11% Similarity=0.070 Sum_probs=139.8 Q ss_pred CcCCCCCCCCcccccccchhhhhh---hcc--cccccccccccch--hh---hHHHhhcHHHHHHHHHHHHhhccCceEE Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDS---YYY--APAVGMQLERQFS--LY---GGIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~---~~~--~~~~~~~~~~~~~--~~---~~~~~~~~~v~~~v~~ia~~ia~l~~~v 70 (518) -+|..-.. .....+.......+. +.. .....++..+-+. +. .......+..++-|+..=..+ T Consensus 43 ~~~~~~~~-~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~v------- 114 (515) T protein:vir:70 43 YLMNNKGD-NETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARV------- 114 (515) T ss_pred cccCCCCC-cccccccccchHHHHHHHHHHHHHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHH------- Confidence 55532111 000111111100000 000 0000000000000 00 000011111111111100000 Q ss_pred EEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceee Q lcl|NC_021305. 71 MFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRY 150 (518) Q Consensus 71 ~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~ 150 (518) . ..++.-+.+- +++.-+..+..+++.+|++.+++.. .+ +...||+ ..+.+..+..|... T Consensus 115 -----------e-~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~l~~d~--~~-~~~~~pl--~~y~v~~d~~G~v~ 173 (515) T protein:vir:70 115 -----------E-TTAMKALEQR----QFRPAIVEVFKHLIVAGNCLLYKPS--KG-AMSAVPM--HHYVVNRDTNGDLM 173 (515) T ss_pred -----------H-HHHHHHHHhc----CchHHHHHHHHHHHhHCeEEEEEeC--CC-CeEEEEc--CeEEEeeCCCcCee Confidence 0 1111112222 3444455566778899999887742 22 2445665 33444444444221 Q ss_pred EEe-------------------------------------eeccc---------ccCceeE------EeccccEEEEecc Q lcl|NC_021305. 151 EYY-------------------------------------FQAGA---------GVGTQLV------SFADDEVVPIRFF 178 (518) Q Consensus 151 ~~~-------------------------------------~~~~~---------~~~~~~~------~~~~~evih~~~~ 178 (518) ... ..... ..++..+ .|..-..+-+|.. T Consensus 174 ~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~~~~~e~d~~~~~~es~y~~~e~P~~~~Rw~ 253 (515) T protein:vir:70 174 DVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFWKINQSADDIPVGKESRIKSEKLPFIPLTWK 253 (515) T ss_pred EEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCceEEEEecCceeeccccccccccCCceeeeee Confidence 000 00000 0000000 0111233444555 Q ss_pred CCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC-- Q lcl|NC_021305. 179 NPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-- 256 (518) Q Consensus 179 ~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-- 256 (518) ..+|..||.||..-++..+.......+.......-...|..++..++...+.. + ..|.. +.++.. T Consensus 254 ~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~~g~~~~~~-------l---~~~~~---g~iv~g~~ 320 (515) T protein:vir:70 254 RSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIRPGSQTDVDH-------F---VNSGT---GEVITGVA 320 (515) T ss_pred ecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCcccccchhh-------c---cccCC---ceeecCCc Confidence 55677899999999999999999999888888888888877776555444322 1 11111 122222 Q ss_pred CCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHHHHHhhHHHHHHHHHHHHhhhh Q lcl|NC_021305. 257 EGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIARIQSAMDKYVGQ 335 (518) Q Consensus 257 ~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~ 335 (518) +++...+++.. .+.+ ..+..+.....|..+|-+........++.|-.-+ ..+..-....+.|.+..+.++|-.-|+. T Consensus 321 ~~v~~~~~~~~-~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvTAtEV-~~r~~E~~~~LGpv~srL~~Ell~Pli~ 398 (515) T protein:vir:70 321 EDIHIVQLGKY-ADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVTAVEI-QRDALEIEQNMGGVYSLFAMTMQTPIAM 398 (515) T ss_pred ccceeeecCcc-cchhHHHHHHHHHHHHHHHHHhhhhhhccCCccccHHHH-HHHHHHHHHHhhHHHHHHHHHHHHHHHH Confidence 33334343332 2333 3355667778888888776433333232221111 2333445557788888887777655532 Q ss_pred hhccc-------ccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeecccccc-cccccc Q lcl|NC_021305. 336 YWVRK-------NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQP-LGATPD 407 (518) Q Consensus 336 ~~~~~-------~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~-~~~~~~ 407 (518) ....+ ..++.+. +.......+......+.+ +.+-+=-....+| +-.|.+ |... ++.... T Consensus 399 r~~~~~~p~~P~~~v~~~~--vs~l~~L~r~q~~~~i~~----~~q~i~~~~~~~p---~~~~~i----d~d~~~~~~a~ 465 (515) T protein:vir:70 399 WGLQEAGDSFTSELVDPVI--VTGIEALGRMAELDKLAN----FAQYMSLPQTWPE---PAQRAI----RWGDYMDWVRG 465 (515) T ss_pred HHHHhhCCCCChhhcccce--ehhHHHHHHHHHHHHHHH----HHHHHHHHhccCh---hHHhhC----CHHHHHHHHHH Confidence 11000 0011110 011111111111111100 0000000011111 011111 0000 000000 Q ss_pred cCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHHHh Q lcl|NC_021305. 408 GAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVKGA 479 (518) Q Consensus 408 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~ 479 (518) ..+.|.. ....+.........+.+.+.+.+.+.++++-.-.-..+..+.. T Consensus 466 ---------~~g~p~~-------------~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 515 (515) T protein:vir:70 466 ---------QISAELP-------------FLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMKEG 515 (515) T ss_pred ---------HhCCCcc-------------ccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhccC Confidence 0000000 0000000111112222222222222222211111111111111 No 248 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=83.34 E-value=0.069 Score=26.94 Aligned_cols=398 Identities=10% Similarity=0.037 Sum_probs=148.6 Q ss_pred CcCCC------CCC---CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhc------c Q lcl|NC_021305. 1 MLLAN------GQT---LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALA------R 65 (518) Q Consensus 1 ~~f~~------~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia------~ 65 (518) |-... .++ +=...++..+.++....+..+. ............... ..|++.+|..+- . T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~------~~~~~~~~~~~dstg-~~a~~~Laa~l~~~ltpp~ 73 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYLLTEDG------HASGGRLQQPYQSLG-SKGVNALSSKLMLSLFPIQ 73 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCCC------CcccccccccccchH-HHHHHHHHHHHHHhhcCCC Confidence 33211 111 0011111222222111111110 000011112233333 345566665552 3 Q ss_pred CceEEEEecCCc-------ceec---cch-------HHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCce Q lcl|NC_021305. 66 LPVKCMFTSGDT-------ETEE---SDT-------GYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTP 128 (518) Q Consensus 66 l~~~v~~~~~~~-------~~~~---~~~-------~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~ 128 (518) -||.=....+.. ..+. -.. ..+.-+.+- +++.-+..+..+++.+|++++++..+ + T Consensus 74 ~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~l~~~~~----~ 145 (542) T protein:vir:78 74 TSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAES----SDRVQLTAAMKHLIVTGNVLVFAGKK----T 145 (542) T ss_pred CccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhhCeEEEEecCC----C Confidence 344322221110 0000 011 111222222 45555667788899999999886543 2 Q ss_pred EEEEeeCCceeEEEEcCCceeeEE-------------------------------------------------------- Q lcl|NC_021305. 129 EKLMPMHPSRVAIKRNSRTGRYEY-------------------------------------------------------- 152 (518) Q Consensus 129 ~~l~~l~p~~v~v~~~~~~~~~~~-------------------------------------------------------- 152 (518) ...||+. .+.+..+..|..... T Consensus 146 ~~~~pl~--~y~v~~d~~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~~ 223 (542) T protein:vir:78 146 LKVYPLD--RYVIERDGDGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCKL 223 (542) T ss_pred ceEEecc--eeEEeeCCCCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCcccccccc Confidence 3344442 233333333321100 Q ss_pred -----eeec--ccc-cCc--eeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccc Q lcl|NC_021305. 153 -----YFQA--GAG-VGT--QLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLR 222 (518) Q Consensus 153 -----~~~~--~~~-~~~--~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~ 222 (518) .++. .+. ..+ ....|..-..+-.|....++..||.||..-++..+.......+.......-...|..++. T Consensus 224 ~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~ 303 (542) T protein:vir:78 224 VDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVS 303 (542) T ss_pred CCCeEEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec Confidence 0000 000 000 001122234455555556778899999999999999999999999999888888887776 Q ss_pred cCccCCHHHHHHHHHHHHHHhcCccccCCeeecC--CCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccc Q lcl|NC_021305. 223 HEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE--EGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILD 299 (518) Q Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~--~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~ 299 (518) .++.+.+... ..+.. +.++.. +++...++.... +.+ ..+..+.....|..+|-+. . .. T Consensus 304 ~~g~~~~~~~----------~~~~~---g~iv~g~~~~v~~~~~~~~~-~~~~~~~~i~~~~~rI~~aFl~~----~-~~ 364 (542) T protein:vir:78 304 PSATTKPQSL----------ARAGT---GAIIQGRAEDVSVVQANKGA-DFRTVQEMIRDLSQRISDAFLIL----N-VR 364 (542) T ss_pred cccccchhhc----------ccCCC---ceeecCCccceeeeeccccc-chhHHHHHHHHHHHHHHHHhccc----c-cC Confidence 6554443321 11111 112222 333344444332 332 3455666677788888432 1 11 Q ss_pred ccccCCHHH--HHHHHHHHHhhHHHHHHHHHHHHhhh-------------hhhcccccceecchhhh-----hcCHHHHH Q lcl|NC_021305. 300 RATFSNISA--QMRAFYRDTMAIPIARIQSAMDKYVG-------------QYWVRKNRMKFDIDDVI-----QPDWEAKS 359 (518) Q Consensus 300 ~~~~sn~e~--~~~~~~~~~l~P~~~~ie~~l~~~l~-------------~~~~~~~~~~fd~~~l~-----~~d~~~~~ 359 (518) ++..-+.++ .+..-....|.|.+..++++|-.-|+ ++.... .+++.+...+ ..+..... T Consensus 365 d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~-lv~~~~~s~La~~~r~~~~~~l~ 443 (542) T protein:vir:78 365 QSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKG-LVMPTVVAGLGGVGRGEDRAALI 443 (542) T ss_pred CcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchh-ceeeeeechHHHHHHHHHHHHHH Confidence 222223332 23334445667777776666554332 221111 1333322211 11111111 Q ss_pred HHHHHHHhC-C------CcCHHHH----HHHhCCCCCCCCCcceeeeccccccc----------ccccccCCCCCCCCCC Q lcl|NC_021305. 360 ESTQKMVNS-G------VATPNEG----REIMGLPRSDDPKADELYANSALQPL----------GATPDGAVEWEEAPAP 418 (518) Q Consensus 360 ~~~~~~~~~-G------~~T~NE~----R~~~g~~p~~~~~gD~~~~~~n~~~~----------~~~~~~~~~~~~~~~~ 418 (518) ..+..+-+. | .+..+++ .+.+|.|+.. .+..+.-+... .....+.......+.. T Consensus 444 ~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~-----i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~ 518 (542) T protein:vir:78 444 EFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLN-----LVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIG 518 (542) T ss_pred HHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhh-----ccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 111111110 0 1222222 2233443210 00000000000 0000000000000000 Q ss_pred CCCccCCCCCCCcccc--CCcccc Q lcl|NC_021305. 419 KRPASTPVASLDQSPP--TSVPGL 440 (518) Q Consensus 419 ~~~~~~~~~~~~~~~~--~~~~~~ 440 (518) +....+..++++..|. .+..+. T Consensus 519 ~~~~~~~~a~~~~~~~~~~~~~~~ 542 (542) T protein:vir:78 519 EKMMQQINAPGQEAPAGPQTGEDL 542 (542) T ss_pred cchhhhcCCCCcCCCCCCcccccC Confidence 1111111111111110 011111 No 249 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=81.53 E-value=0.085 Score=26.46 Aligned_cols=414 Identities=9% Similarity=-0.036 Sum_probs=146.9 Q ss_pred CcCCCCCCCC---cccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhh-ccC----ceEEEE Q lcl|NC_021305. 1 MLLANGQTLS---APAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQAL-ARL----PVKCMF 72 (518) Q Consensus 1 ~~f~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~i-a~l----~~~v~~ 72 (518) -+|.+.++.. ...++..+.++....+ .................... .|++.+|..+ +.| ||.=.. T Consensus 15 ~r~~~lk~~R~~~e~~w~e~~~~~lP~~~------~~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~~~ltP~~~WFrl~ 87 (536) T protein:vir:21 15 SVYERLKNDRAPYETRAQNCAQYTIPSLF------PKDSDNASTDYQTPWQAVGA-RGLNNLASKLMLALFPMQTWMRLT 87 (536) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHhccccc------CCCCCcccccccccccccHH-HHHHHHHHHHHHhhcCCCcccccc Confidence 1111111100 1111111111111111 10000111111123333333 3555555544 222 332221 Q ss_pred ecCCccee----------c------cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCC Q lcl|NC_021305. 73 TSGDTETE----------E------SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHP 136 (518) Q Consensus 73 ~~~~~~~~----------~------~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p 136 (518) ..+....+ . -...++..+.+- +++.-+..+..+++.+|++.+++..+..+.+..+..++- T Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~f~~~pl 163 (536) T protein:vir:21 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRL 163 (536) T ss_pred cChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEc Confidence 11111000 0 011222233333 344555666788999999999987665544443333333 Q ss_pred ceeEEEEcCCceeeEE-----------------------------------eeeccccc----------CceeE------ Q lcl|NC_021305. 137 SRVAIKRNSRTGRYEY-----------------------------------YFQAGAGV----------GTQLV------ 165 (518) Q Consensus 137 ~~v~v~~~~~~~~~~~-----------------------------------~~~~~~~~----------~~~~~------ 165 (518) ..+.+..+..|..... ........ .+..+ T Consensus 164 ~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~g~~v~~~~g~ 243 (536) T protein:vir:21 164 SSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVEGMEVQGSDGT 243 (536) T ss_pred CeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccCCeeeccccCc Confidence 5555555554422100 00000000 00100 Q ss_pred -EeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhc Q lcl|NC_021305. 166 -SFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHS 244 (518) Q Consensus 166 -~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~ 244 (518) .|..-.++.+|....++..||.||..-++..+.......+.......-...|...+..++.+.+.. + .. T Consensus 244 ~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~-------~---~~ 313 (536) T protein:vir:21 244 YPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRR-------L---TK 313 (536) T ss_pred cccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhh-------h---cc Confidence 112234566666666788899999999999999998888888777666666666655444433322 1 11 Q ss_pred CccccCCee-ecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhHH Q lcl|NC_021305. 245 GSSNTGKTM-VVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAIP 321 (518) Q Consensus 245 g~~n~g~~~-vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P~ 321 (518) +.. |.++ -..+.+...++...+.-.-..+..+.....|..+|-+.. +...+. ..-+.++ .+..-....|.|. T Consensus 314 ~~~--g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~--l~~~~~-~r~TAtEV~~r~~E~~~~LG~v 388 (536) T protein:vir:21 314 AQT--GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS--AVQRTG-ERVTAEEIRYVASELEDTLGGV 388 (536) T ss_pred CCC--cceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh--cccCCC-CCccHHHHHHHHHHHHHHhhHH Confidence 111 1111 122334444554433322234566677788888885542 211111 1123322 2233334455555 Q ss_pred HHHHHHHHHHhh-------------hhhhccc-ccceecchhhh---h-cCHHHHHHHHHHHHhCC------CcCHHHHH Q lcl|NC_021305. 322 IARIQSAMDKYV-------------GQYWVRK-NRMKFDIDDVI---Q-PDWEAKSESTQKMVNSG------VATPNEGR 377 (518) Q Consensus 322 ~~~ie~~l~~~l-------------~~~~~~~-~~~~fd~~~l~---~-~d~~~~~~~~~~~~~~G------~~T~NE~R 377 (518) +..+.++|-.-| +++.... ...++ .+.+- + .+......++..+.+.+ .+..+++- T Consensus 389 ~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~-vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~ 467 (536) T protein:vir:21 389 YSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIK 467 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceE-EecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHH Confidence 555555554333 2221111 12222 11121 1 11111122222111111 12222222 Q ss_pred ----HHhCCCCCCCCCcceeeecccccccccccc-cCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHH Q lcl|NC_021305. 378 ----EIMGLPRSDDPKADELYANSALQPLGATPD-GAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSG 452 (518) Q Consensus 378 ----~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (518) +.+|.+|.. .+..+.....+-.... .+...+.+........ ..... T Consensus 468 ~~~a~~~Gv~p~~-----~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~------------------------~~~~~ 518 (536) T protein:vir:21 468 LRIANAIGIDTSG-----ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMA------------------------AQATA 518 (536) T ss_pred HHHHHHcCCChhh-----hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------HHHhc Confidence 122332210 0000000000000000 0000000000000000 00000 Q ss_pred HHHHHHhhcccCCchhhH Q lcl|NC_021305. 453 KTEPRRLMQKPPPKESSP 470 (518) Q Consensus 453 ~~~~~~~~~k~~~~~~~~ 470 (518) ..+....+.-+.|.+-.- T Consensus 519 ~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:21 519 SPEAMAAAADSVGLQPGI 536 (536) T ss_pred ChhhHHhhhhccccCCCC Confidence 000000000000000000 No 250 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=80.29 E-value=0.096 Score=26.16 Aligned_cols=382 Identities=12% Similarity=0.052 Sum_probs=135.8 Q ss_pred CcCCCCCCCCcccccccchhhhhh-h--ccc--ccccccccccchhhh-HHH--------hhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDS-Y--YYA--PAVGMQLERQFSLYG-GIY--------KNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~-~--~~~--~~~~~~~~~~~~~~~-~~~--------~~~~~v~~~v~~ia~~ia~l 66 (518) -+|..-... ....+...+.-... . ... ....++..+-+.... +.. .....|..-...+. T Consensus 44 ~~~~~~~~~-~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve------ 116 (516) T protein:vir:96 44 YLMNDKGDN-ETSQNGWQGVGAQATNHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVE------ 116 (516) T ss_pred cccCCCCCc-cccCCcccchHHHHHHHHHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHH------ Confidence 444321110 00001111000000 0 000 000000000000000 000 00001111111111 Q ss_pred ceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCC Q lcl|NC_021305. 67 PVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSR 146 (518) Q Consensus 67 ~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~ 146 (518) ..++..+.+- +++.-+..+..+++.+|++.+++.. .+ ....||+ ..+.+..+.. T Consensus 117 -----------------~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~l~~d~--~~-~~~~~pl--~~y~v~~d~~ 170 (516) T protein:vir:96 117 -----------------TRAMKELEQR----QFRPAVVEAFKHLIVAGSCMLYKPS--KG-AISAIPM--HHYVVNRDTN 170 (516) T ss_pred -----------------HHHHHHHHhc----CcHHHHHHHHHHHHhHCeEeEEecC--CC-CEEEEEc--CeEEEeeCCC Confidence 1111222222 3444455567788899999887643 32 2445555 3344444444 Q ss_pred ceeeEEe-------------------------------------eecccc---------cCceeE------EeccccEEE Q lcl|NC_021305. 147 TGRYEYY-------------------------------------FQAGAG---------VGTQLV------SFADDEVVP 174 (518) Q Consensus 147 ~~~~~~~-------------------------------------~~~~~~---------~~~~~~------~~~~~evih 174 (518) |...... ...... .++..+ .|..-..+- T Consensus 171 G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~d~~~~~~es~~~~~e~P~~~ 250 (516) T protein:vir:96 171 GDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSADDIPVGKVSKIKSEKLPFIP 250 (516) T ss_pred CCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEeCceeeccccccccccCCeee Confidence 4221000 000000 001000 011223455 Q ss_pred EeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeee Q lcl|NC_021305. 175 IRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMV 254 (518) Q Consensus 175 ~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~v 254 (518) +|....++..||.||..-++..+.......+.......-...|..++..++.+.+.. ...|.. | .++ T Consensus 251 ~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~----------l~~~~~--g-~i~ 317 (516) T protein:vir:96 251 LTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDH----------FVNSGT--G-EVV 317 (516) T ss_pred eeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccchhh----------hccCCC--c-eee Confidence 555556778899999999999999999888888887777777776665544443322 111111 1 222 Q ss_pred cC--CCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhHHHHHHHHHH Q lcl|NC_021305. 255 VE--EGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAIPIARIQSAM 329 (518) Q Consensus 255 l~--~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P~~~~ie~~l 329 (518) -. +++...+++.. .+.+ ..+..+.....|..+|-+..... .+.... +.++ .+..-....|.|.+..+.++| T Consensus 318 ~g~~~~v~~~q~~~~-~d~~~~~~~i~~~~~rI~~af~~~~l~~--r~~~rv-TAtEV~~r~~E~~~~LGpv~~rl~~El 393 (516) T protein:vir:96 318 TGVEEDIHIVQLGKY-ADLTPISAVLEVYTRRIGVVFMMETMTR--RDAERV-TAVEIQRDALEIEQNMGGVYSLFATTM 393 (516) T ss_pred cCCcccceeeecCcc-cchhHHHHHHHHHHHHHHHHHhhhhhcc--CCCccc-cHHHHHHHHHHHHHHhhhHHHHHHHHH Confidence 22 23333333332 2232 33556667778888886543222 122122 2332 233445556788888877776 Q ss_pred HHhhhhhh----cc---cccceecchhhhhcCHHHHHHHHHHHHhC----C-Cc-CHHHHHHHhCCCCCCCCCcceeeec Q lcl|NC_021305. 330 DKYVGQYW----VR---KNRMKFDIDDVIQPDWEAKSESTQKMVNS----G-VA-TPNEGREIMGLPRSDDPKADELYAN 396 (518) Q Consensus 330 ~~~l~~~~----~~---~~~~~fd~~~l~~~d~~~~~~~~~~~~~~----G-~~-T~NE~R~~~g~~p~~~~~gD~~~~~ 396 (518) -.-|+... .. ...++... +.......++.....+.+. | +. -.-++...++.+.+-+.-++.+=+| T Consensus 394 l~Pli~r~l~~~~p~lp~~~v~~~~--vs~l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp 471 (516) T protein:vir:96 394 QSPVAMWGLLEAGESFTSDLVDPVI--ITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAE 471 (516) T ss_pred HHHHHHHHHHhcCCCCcccccccee--echHHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCC Confidence 54443210 00 01111111 1111112222111111110 0 00 0011222222111100000000011 Q ss_pred ccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcch-------hhHHHHHHHHhhcccCCchh Q lcl|NC_021305. 397 SALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRS-------TDSGKTEPRRLMQKPPPKES 468 (518) Q Consensus 397 ~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~k~~~~~~ 468 (518) .+++. ...+.+.....+. ......++....+++.++|. T Consensus 472 ~~~ir----------------------------------s~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~~~~~ 516 (516) T protein:vir:96 472 LPFLK----------------------------------SAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQELKEA 516 (516) T ss_pred ccccC----------------------------------CHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhcccccC Confidence 10000 0000000000000 00011111122334444444 No 251 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=79.69 E-value=0.1 Score=26.02 Aligned_cols=404 Identities=13% Similarity=0.058 Sum_probs=151.5 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccc---cccccc------cch---hhhHHHhhcHHHHHHHHHHHHhhccCce Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAV---GMQLER------QFS---LYGGIYKNQPWVRTVIAKRAQALARLPV 68 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~------~~~---~~~~~~~~~~~v~~~v~~ia~~ia~l~~ 68 (518) |=.=.-+.+.--+.......+++..++.... |...-+ ... .....++.-...+-.+..+.+.+..+.| T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~G~vf 80 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLVGQVF 80 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHhhhhh Confidence 3210000011111211222334444443211 000000 000 1122233333333333333333333333 Q ss_pred EEEEecCCcceeccchHHHHHHhcCC-cCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCce--------------EEEEe Q lcl|NC_021305. 69 KCMFTSGDTETEESDTGYAKLLADPC-EYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTP--------------EKLMP 133 (518) Q Consensus 69 ~v~~~~~~~~~~~~~~~~~~L~~~PN-~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~--------------~~l~~ 133 (518) +- . . ..+ ....+..|+.... ...+..+|.+.++...+.+|.+++++.....+.. -.+.. T Consensus 81 ~k---~-p-~~~-~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~~~ 154 (501) T protein:vir:95 81 MR---D-P-VVK-VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTLYV 154 (501) T ss_pred cC---C-c-cee-CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEEEE Confidence 21 0 0 001 1223334554444 3468999999999999999999999976543210 11333 Q ss_pred eCCceeE----------------------EEEcC-Cc---------------eeeEEe-eeccccc---------Cc--- Q lcl|NC_021305. 134 MHPSRVA----------------------IKRNS-RT---------------GRYEYY-FQAGAGV---------GT--- 162 (518) Q Consensus 134 l~p~~v~----------------------v~~~~-~~---------------~~~~~~-~~~~~~~---------~~--- 162 (518) +.|..|- ...++ .+ +.+.+. +...... +. T Consensus 155 ~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~~~~ 234 (501) T protein:vir:95 155 YSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGNYQQ 234 (501) T ss_pred ecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCcccc Confidence 3332221 00010 00 000011 1000000 00 Q ss_pred -eeE--------EeccccEEEEeccCCCCcccCchHHHHHHHH-HHHHHHHHHHHHHHHHccCCcccccccCccCCHHHH Q lcl|NC_021305. 163 -QLV--------SFADDEVVPIRFFNPDGLERGLSLMESLKST-IFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQ 232 (518) Q Consensus 163 -~~~--------~~~~~evih~~~~~~~~~~~G~s~l~~~~~~-i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~ 232 (518) ... .+..=.++++ +...++...|.||+..++.. +.+......+...+ ...+.|-.+++-. +++.. T Consensus 235 ~~~~~~~~~g~~~l~~IPfv~~-~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l-~~~~~P~l~i~G~---~~~~~ 309 (501) T protein:vir:95 235 YVVYKPTDAQGKRLTEIPFMFI-GSENNDSNPDNPNFYDLASLNMAHYRNSADYEESC-YIVGQPTPVLIGL---TEEWV 309 (501) T ss_pred cceeeeeccCCCcCCeeeEEEE-ecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHH-HHcccceeeeeCC---ccccc Confidence 000 0000011222 12223444577888776643 44444434444333 3445666665421 21110 Q ss_pred HHHHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HH Q lcl|NC_021305. 233 QRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QM 310 (518) Q Consensus 233 ~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~ 310 (518) +.... ....-|+ ...+.++.|.++.-+..++.... .+..+...+++.. .| ..++... ..+ .+.++ .. T Consensus 310 ~~~~~--~~i~~G~---~~~~~lP~~~~~~~ie~~~~~i~-~~~l~~l~~~m~~-~G--a~ll~~~-~~~-~Ta~~~~~~ 378 (501) T protein:vir:95 310 TNVLK--GSVNFGS---RGGIPLPVGADAKLLQASENTML-KEAMDTKERQMVA-LG--AKLVEQK-EVQ-RTATEAELE 378 (501) T ss_pred ccCCC--Cceeecc---cccccCCCCCceeEEecChhhHH-HHHHHHHHHHHHH-HH--HhhccCC-ccc-hhHHHHHHH Confidence 00000 0111232 23456776655544444443332 2333333333332 23 2233211 111 22222 22 Q ss_pred HHHHHHHhhHHHHHHHHHHHHhhh--hhh----cccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_021305. 311 RAFYRDTMAIPIARIQSAMDKYVG--QYW----VRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPR 384 (518) Q Consensus 311 ~~~~~~~l~P~~~~ie~~l~~~l~--~~~----~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p 384 (518) ..-.+..|.-++..++++++..|- ..+ .....++++.+..........++++-+++..|.++..+.++.+-.-- T Consensus 379 ~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~ 458 (501) T protein:vir:95 379 AASEGSTLSSATKNVSAAFEWALKWAARWVGQADSGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKAG 458 (501) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCC Confidence 333445677788888888886653 121 11223444444334333444567778889999999999987773322 Q ss_pred CCCCCcceeeecccccccccccccCCCCCCCCCCC-CCccCCCCCCCccccCCc Q lcl|NC_021305. 385 SDDPKADELYANSALQPLGATPDGAVEWEEAPAPK-RPASTPVASLDQSPPTSV 437 (518) Q Consensus 385 ~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 437 (518) +.++--+.. ...+.....++ ...+.+. .+.+. .+++.....+ T Consensus 459 v~~~~~~~e-----~e~i~~~~~~~---~~~~~~~~~~~~~---~gg~~~~~~~ 501 (501) T protein:vir:95 459 VATEDDSKA-----KEKIAKDTAEA---MALATPANVPGDG---SGGDNVGNSE 501 (501) T ss_pred CCChhHHHH-----HHHHHhhhcCc---ccccccCCCCCCC---cccccccCCC Confidence 221100000 00000000000 0000000 00111 1111111111 No 252 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=79.55 E-value=0.1 Score=25.99 Aligned_cols=414 Identities=9% Similarity=-0.041 Sum_probs=146.6 Q ss_pred CcCCCCCCCC---cccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhh-ccC----ceEEEE Q lcl|NC_021305. 1 MLLANGQTLS---APAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQAL-ARL----PVKCMF 72 (518) Q Consensus 1 ~~f~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~i-a~l----~~~v~~ 72 (518) -+|.+.++.. ...++..+.++....+. ................... .|++.+|..+ +.| ||.=.. T Consensus 15 ~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~------~~~~~~~~~~~~~~dst~~-~a~~~Laa~l~~~ltP~~~WFrl~ 87 (536) T protein:vir:10 15 SVYERLKNDRAPYETRAQNCAQYTIPSLFP------KDSDNASTDYQTPWQAVGA-RGLNNLASKLMLALFPMQTWMRLT 87 (536) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHhcccccC------CCCCcccccccccccccHH-HHHHHHHHHHHhhhcCCCcccccc Confidence 1111111000 11111111121111111 0000111111123333333 3555555544 222 332221 Q ss_pred ecCCccee----------c------cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCC Q lcl|NC_021305. 73 TSGDTETE----------E------SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHP 136 (518) Q Consensus 73 ~~~~~~~~----------~------~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p 136 (518) ..+....+ . -...++..+.+- +++.-+..+..+++.+|++.+++..+..+.+..+..++- T Consensus 88 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~~~~~~~pl 163 (536) T protein:vir:10 88 ISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESN----SYRVTLFEALKQLVVAGNVLLYLPEPEGSNYNPMKLYRL 163 (536) T ss_pred cChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhHCcEeEEEeeCCCCceeeEEEEEc Confidence 11111000 0 011222233333 344555666788999999999987665544443333333 Q ss_pred ceeEEEEcCCceeeEEe--e---------------------------------eccccc----------Ccee------- Q lcl|NC_021305. 137 SRVAIKRNSRTGRYEYY--F---------------------------------QAGAGV----------GTQL------- 164 (518) Q Consensus 137 ~~v~v~~~~~~~~~~~~--~---------------------------------~~~~~~----------~~~~------- 164 (518) ..+.+..+..|...... + ...... .+.. T Consensus 164 ~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~g~~v~~~~g~ 243 (536) T protein:vir:10 164 SSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVEGMEVQGSDGT 243 (536) T ss_pred CeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeecCccccccccc Confidence 45555555544221000 0 000000 0000 Q ss_pred EEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhc Q lcl|NC_021305. 165 VSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHS 244 (518) Q Consensus 165 ~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~ 244 (518) ..|..-.++.+|....++..||.||..-++..+.......+.......-...|...+..++.+.+.. + .. T Consensus 244 ~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~-------~---~~ 313 (536) T protein:vir:10 244 YPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVNPAGITQPRR-------L---TK 313 (536) T ss_pred cccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccCcccccchhh-------h---cc Confidence 0112234466666666778899999999999999998888888777666666666655444433322 1 11 Q ss_pred CccccCCee-ecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhHH Q lcl|NC_021305. 245 GSSNTGKTM-VVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAIP 321 (518) Q Consensus 245 g~~n~g~~~-vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P~ 321 (518) +.. |.++ -..+.+...++...+.-.-..+..+.....|..+|-+.. +...+. ..-+.++ .+..-....|.|. T Consensus 314 ~~~--g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~--l~~~~~-~r~TAtEV~~r~~E~~~~LG~v 388 (536) T protein:vir:10 314 AQT--GDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS--AVQRTG-ERVTAEEIRYVASELEDTLGGV 388 (536) T ss_pred CCC--cceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh--cccCCC-CCccHHHHHHHHHHHHHHhhHH Confidence 111 1111 122334444554433322234566677788888885542 211111 1123322 2233344455555 Q ss_pred HHHHHHHHHHhh-------------hhhhccc-ccceecchhhh---h-cCHHHHHHHHHHHHhCC------CcCHHHHH Q lcl|NC_021305. 322 IARIQSAMDKYV-------------GQYWVRK-NRMKFDIDDVI---Q-PDWEAKSESTQKMVNSG------VATPNEGR 377 (518) Q Consensus 322 ~~~ie~~l~~~l-------------~~~~~~~-~~~~fd~~~l~---~-~d~~~~~~~~~~~~~~G------~~T~NE~R 377 (518) +..+.++|-.-| +++.... ...+| .+.+- + .+.......+..+.+.+ .+..+++- T Consensus 389 ~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~-vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~ 467 (536) T protein:vir:10 389 YSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTI-STGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIK 467 (536) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceE-EecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHH Confidence 555555554333 2221111 12222 11121 1 11111122222111110 12222222 Q ss_pred ----HHhCCCCCCCCCcceeeecccccccccccc-cCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHH Q lcl|NC_021305. 378 ----EIMGLPRSDDPKADELYANSALQPLGATPD-GAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSG 452 (518) Q Consensus 378 ----~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (518) +.+|.+|.. .+..+.....+-.... .+...+.+........ ..... T Consensus 468 ~~~a~~~Gv~p~~-----~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~------------------------~~~~~ 518 (536) T protein:vir:10 468 LRIANAIGIDTSG-----ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMA------------------------AQATA 518 (536) T ss_pred HHHHHHcCCCchh-----hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------HHHhc Confidence 122332210 0000000000000000 0000000000000000 00000 Q ss_pred HHHHHHhhcccCCchhhH Q lcl|NC_021305. 453 KTEPRRLMQKPPPKESSP 470 (518) Q Consensus 453 ~~~~~~~~~k~~~~~~~~ 470 (518) ..+....+.-+.|.+-.- T Consensus 519 ~~~~~~~~~~~~g~~~~~ 536 (536) T protein:vir:10 519 SPEAMAAAADSVGLQPGI 536 (536) T ss_pred CchhHHhhhhccccCCCC Confidence 000000000000000000 No 253 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=71.68 E-value=0.19 Score=24.51 Aligned_cols=382 Identities=9% Similarity=0.021 Sum_probs=131.8 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcc--------ccc--ccccccc--cchhhhHHHhh-------cHHHHHHHHHHHH Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYY--------APA--VGMQLER--QFSLYGGIYKN-------QPWVRTVIAKRAQ 61 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~--------~~~--~~~~~~~--~~~~~~~~~~~-------~~~v~~~v~~ia~ 61 (518) ++|..-..... .+...+| ++.+. .-. ..++..+ ...+......+ ...|..-...+. T Consensus 42 ~~~~~~~~~~~--~~~~~~~--dst~~~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve- 116 (532) T protein:vir:99 42 SVFPSATADGS--TSYTTPW--QSIGARGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVE- 116 (532) T ss_pred cccCCCCCcch--hhccccc--cchHHHHHHHHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHH- Confidence 55432111111 0000111 00000 000 0000000 00000000000 001111111111 Q ss_pred hhccCceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC--CCc--eEEEEeeCCc Q lcl|NC_021305. 62 ALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK--SGT--PEKLMPMHPS 137 (518) Q Consensus 62 ~ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~--~G~--~~~l~~l~p~ 137 (518) ...+..+.+- +++.-+..+..+++.+|++++++..+. .+. ....||+ . T Consensus 117 ----------------------~~~~~~~~~s----nf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~f~~~pl--~ 168 (532) T protein:vir:99 117 ----------------------RICMNYMESN----SFRPTLHAAIKQLLVAGNVLLYIPSTEQVEGQSNAPKLYKL--H 168 (532) T ss_pred ----------------------HHHHHHHHhc----CcHHHHHHHHHHHHhHCcEeEEecccccccCcccceEEEEc--C Confidence 1112222222 344445556778899999988875432 122 2333443 3 Q ss_pred eeEEEEcCCceeeEEe--ee---------------------------------ccccc----------CceeEE------ Q lcl|NC_021305. 138 RVAIKRNSRTGRYEYY--FQ---------------------------------AGAGV----------GTQLVS------ 166 (518) Q Consensus 138 ~v~v~~~~~~~~~~~~--~~---------------------------------~~~~~----------~~~~~~------ 166 (518) .+.+..+..|...... +. ..... .+..+. T Consensus 169 ~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~~~g~~~~~~~~~~ 248 (532) T protein:vir:99 169 NFVVERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYRDPEAMVFRSYQEIDGEIVAGTEGEY 248 (532) T ss_pred eEEEeeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEecCCCCeeEEEEeecCceeccccccc Confidence 3444444443221000 00 00000 000000 Q ss_pred -eccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcC Q lcl|NC_021305. 167 -FADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSG 245 (518) Q Consensus 167 -~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g 245 (518) +..-..+-.|....++..||.||..-++..+.......+.......-...|..++..++.+.+... ..+ T Consensus 249 ~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~~----------~~~ 318 (532) T protein:vir:99 249 PLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVNPNGVTQIRRV----------AKA 318 (532) T ss_pred ccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceeccccccchhhh----------ccC Confidence 111233444555557778999999999999999998888888877777777777665554443321 111 Q ss_pred ccccCCeee--cCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhHH Q lcl|NC_021305. 246 SSNTGKTMV--VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAIP 321 (518) Q Consensus 246 ~~n~g~~~v--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P~ 321 (518) .. | .++ -.+++...++.....-.-..+..+.....|..+|-+.. +...+ +..-+.++ .+..-....+.|. T Consensus 319 ~~--g-~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~d-~~r~TAtEV~~r~~E~~~~LGpv 392 (532) T protein:vir:99 319 NT--G-DFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNS--AVQRG-GDRVTAEEIRYVAGELEDTLGGV 392 (532) T ss_pred CC--c-ceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh--cccCC-CCcccHHHHHHHHHHHHHHhhHH Confidence 11 1 112 12334444444332222234556667788888885442 11111 11123332 2233444556666 Q ss_pred HHHHHHHHHHhhh-------------hhhcccccceecchhhhhcCHHHHHHHHHHHHhC------------CCcCHHHH Q lcl|NC_021305. 322 IARIQSAMDKYVG-------------QYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNS------------GVATPNEG 376 (518) Q Consensus 322 ~~~ie~~l~~~l~-------------~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~------------G~~T~NE~ 376 (518) +..+.++|-.-|+ ++..... ..-+ -+...+...+++.+..+.+. -.+..+++ T Consensus 393 ~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~-~~~~--iv~~is~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~ 469 (532) T protein:vir:99 393 YSLLSQELQLPLVKILLKELQATSKIPNLPKEA-VEPA--IATGLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDV 469 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhh-cccc--eeecchHHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHH Confidence 6666666543332 2111110 1111 11222333333333222110 01111211 Q ss_pred ----HHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHH Q lcl|NC_021305. 377 ----REIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSG 452 (518) Q Consensus 377 ----R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (518) .+.+|.+|.. ++.+...+ ..... +...+.+.. .. .......-. T Consensus 470 ~~~~a~~~GV~~~~------i~r~~ee~--~~~~~-q~~~~~~~~--~a----------------------~~~~~~~~~ 516 (532) T protein:vir:99 470 KMRLANSLGMDTTG------LILTQQDK--QAKMA-EASTAAGMV--TA----------------------GQQMGAAGG 516 (532) T ss_pred HHHHHHHhCCChhh------ccCCHHHH--HHHHH-HHHHHHHHH--HH----------------------HHHHHHHHH Confidence 1122221110 00000000 00000 000000000 00 000000000 Q ss_pred HHHHHHh-hcccCCch Q lcl|NC_021305. 453 KTEPRRL-MQKPPPKE 467 (518) Q Consensus 453 ~~~~~~~-~~k~~~~~ 467 (518) ++.+.-. ..-..+++ T Consensus 517 ~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 517 QAAAAMMQQQAGMPTQ 532 (532) T ss_pred HhcchhHHhhcCCCCC Confidence 0000000 00001111 No 254 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=69.40 E-value=0.22 Score=24.15 Aligned_cols=407 Identities=11% Similarity=0.035 Sum_probs=142.6 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhc------cCceEEEEec Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALA------RLPVKCMFTS 74 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia------~l~~~v~~~~ 74 (518) .|.+. |.+=...++..+.++....+.....+ ................ .|++.+|..+- .-||.=..-. T Consensus 9 ~L~~~-R~~~e~~w~e~~~~tlP~~~~~~~~~----~~~~~~~~~~~dstg~-~a~~~LAa~l~~~ltpp~~~WF~l~~~ 82 (522) T protein:vir:10 9 QLTTA-RQMFLDKAVECSELTLPYLIDDDISS----RPNHKSLTVPWQSVGA-KCCVTLAAKLMLAVLPPQTSFFKLQVR 82 (522) T ss_pred HHHHH-hhHHHHHHHHHHHHhhhcccCCCCCC----CcccccccccccchHH-HHHHHHHHHHHHhhcCCCCccccccCC Confidence 12111 00001111122222211111111000 0000111122333333 45555555552 2344322211 Q ss_pred CCc-ceec-------cchH-------HHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCcee Q lcl|NC_021305. 75 GDT-ETEE-------SDTG-------YAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRV 139 (518) Q Consensus 75 ~~~-~~~~-------~~~~-------~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v 139 (518) +.. .... -+.+ ++..+.+- +++.-+..+..+++.+|++.+++..+. ...||+ ..+ T Consensus 83 d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~ly~~~~~----~~~~pl--~~y 152 (522) T protein:vir:10 83 DDKLGEELDPQIRSELDLSFSKMERMIMDYIAAS----NDRVAVHQALKHLIVGGNALIFMGKDG----LKTFPL--TRY 152 (522) T ss_pred hHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhHCceeEEEcCCC----ceEEEc--ceE Confidence 110 0000 0111 12223332 455556677889999999998865432 233444 233 Q ss_pred EEEEcCCceeeEEe--e-----------------------------------eccccc----------CceeE------- Q lcl|NC_021305. 140 AIKRNSRTGRYEYY--F-----------------------------------QAGAGV----------GTQLV------- 165 (518) Q Consensus 140 ~v~~~~~~~~~~~~--~-----------------------------------~~~~~~----------~~~~~------- 165 (518) .+..+..|...... + ...... .+..+ T Consensus 153 ~v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~~~~~s~~ 232 (522) T protein:vir:10 153 VINRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKIIPDSRSTA 232 (522) T ss_pred EEeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCcccccccccc Confidence 33344333221000 0 000000 00000 Q ss_pred EeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcC Q lcl|NC_021305. 166 SFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSG 245 (518) Q Consensus 166 ~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g 245 (518) .|..-..+-+|....++..||.||..-++..+.......+.......-...|..++..++...+.. ...| T Consensus 233 g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~~~----------l~~~ 302 (522) T protein:vir:10 233 PKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKPAT----------IAKA 302 (522) T ss_pred ccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccccc----------ccCC Confidence 111112344454445677899999999999999999999999888888888887775555444322 1111 Q ss_pred ccccCCeeecC--CCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhH Q lcl|NC_021305. 246 SSNTGKTMVVE--EGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAI 320 (518) Q Consensus 246 ~~n~g~~~vl~--~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P 320 (518) .. +.++.. +++...+++. ..+.+ ..+..+.....|..+|- ++...++..-+.++ .+..-....|.| T Consensus 303 ~~---~~~v~g~~~~v~~~~~~~-~~d~~~~~~~i~~~~~ri~~aFl-----~~~~~d~~rvTAtEV~~r~~E~~~~LGp 373 (522) T protein:vir:10 303 GN---GAIVQGRPEDVAVIQVGK-TADFSTAANMATAIEKRLLEAFL-----VMNVRNAERVTAEEVRLTQLELEQQLGG 373 (522) T ss_pred CC---cceecCCCccceeecccc-cccchHHHHHHHHHHHHHHHHHh-----hccCCCCCCCCHHHHHHHHHHHHHHhhH Confidence 11 122222 3344444433 23333 34556666777888873 22222222223332 223334445555 Q ss_pred HHHHHHHHHHHhh-------------hhhhcccccceecchhhhhcCHHHHHHHHHHHHhC-----CCcCHHHHHHHhCC Q lcl|NC_021305. 321 PIARIQSAMDKYV-------------GQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNS-----GVATPNEGREIMGL 382 (518) Q Consensus 321 ~~~~ie~~l~~~l-------------~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~-----G~~T~NE~R~~~g~ 382 (518) .+..+..+|-.-| +++..... ++ ...+...+...+++.+..+... .++-+..+...++. T Consensus 374 v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~-~~--~~~v~~is~Laraq~~~~l~~~~~~i~~~~~p~~~~~~id~ 450 (522) T protein:vir:10 374 IFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDI-VR--PTIVAGVNALGRGQDRESLTAFVGTIAQTLGPEALMQYLNP 450 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccc-cc--cccccchhHHHHHHHHHHHHHHHHHHHHhhCchhhhhcCCH Confidence 5555555443322 22221111 01 1111222333333332222110 00001111111111 Q ss_pred CCCCCCCcceeeec-ccccccccc-c--ccCCCC-----CCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHH Q lcl|NC_021305. 383 PRSDDPKADELYAN-SALQPLGAT-P--DGAVEW-----EEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGK 453 (518) Q Consensus 383 ~p~~~~~gD~~~~~-~n~~~~~~~-~--~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (518) +.+-+.-++.+=+| .+++.-... . .++... +..+..++-+..+..+..+++ + T Consensus 451 d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~-------------------~ 511 (522) T protein:vir:10 451 LEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNP-------------------Q 511 (522) T ss_pred HHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccH-------------------H Confidence 11000000000011 111100000 0 000000 000000000000000000000 0 Q ss_pred HHHHHhhcccCCch Q lcl|NC_021305. 454 TEPRRLMQKPPPKE 467 (518) Q Consensus 454 ~~~~~~~~k~~~~~ 467 (518) .-. ...+.++| T Consensus 512 ~~~---~~~~~~~~ 522 (522) T protein:vir:10 512 LMD---EEQPPMEE 522 (522) T ss_pred HHH---HhCCCCCC Confidence 000 11122222 No 255 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=68.79 E-value=0.23 Score=24.06 Aligned_cols=404 Identities=12% Similarity=0.029 Sum_probs=148.1 Q ss_pred CcCCCCCC---CCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhc------cCceEEE Q lcl|NC_021305. 1 MLLANGQT---LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALA------RLPVKCM 71 (518) Q Consensus 1 ~~f~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia------~l~~~v~ 71 (518) -+|...+. +=...++..+.++..+.+.....+. ..+....... -.|++.+|..+- .-||.=. T Consensus 18 ~r~~~L~~~R~~~e~~w~e~a~~~lP~~~~~~~~~~--------~~~~~~dstg-~~a~~~LAa~l~~~ltpp~~~WF~L 88 (516) T protein:vir:10 18 KLWEKFSTKRSSFLDRAKHYSKLTLPYLMNDKGDNE--------TSQNGWQGVG-AQATNHLANKLAQVLFPAQRSFFRV 88 (516) T ss_pred HHHHHHHHhhhHHHHHHHHHHHhhcccccCCCCCcc--------cccccccchH-HHHHHHHHHHHHhhhcCCCCccccc Confidence 12221111 0011112222222211111000000 0111223333 356666665552 3344332 Q ss_pred EecCCcce-------e--ccchHH-------HHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeC Q lcl|NC_021305. 72 FTSGDTET-------E--ESDTGY-------AKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMH 135 (518) Q Consensus 72 ~~~~~~~~-------~--~~~~~~-------~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~ 135 (518) .-++.... + .-+.++ +.-+.+ -+++.-+..+..+++.+|++++++.. .+ ....||+ T Consensus 89 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~~L~~~G~a~l~~d~--~~-~~~~~pl- 160 (516) T protein:vir:10 89 DLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQ----RQFRPAVVEAFKHLIVAGSCMLYKPS--KG-AISAIPM- 160 (516) T ss_pred cCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHh----cCcHHHHHHHHHHHHhHCeEeEEecC--CC-CeEEEEc- Confidence 22211100 0 011111 112222 24555556667789999999887643 32 2445665 Q ss_pred CceeEEEEcCCceeeEEe--ee-----------------------------------ccccc---------CceeE---- Q lcl|NC_021305. 136 PSRVAIKRNSRTGRYEYY--FQ-----------------------------------AGAGV---------GTQLV---- 165 (518) Q Consensus 136 p~~v~v~~~~~~~~~~~~--~~-----------------------------------~~~~~---------~~~~~---- 165 (518) ..+.+..+..|...... .. ..... ++..+ T Consensus 161 -~~y~v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~~~~~~d~~~~~~~s 239 (516) T protein:vir:10 161 -HHYVVNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWELKQSADDIPVGKVS 239 (516) T ss_pred -CeEEEeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceEEEEeeCceeecccc Confidence 33445555544321110 00 00000 00000 Q ss_pred --EeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHh Q lcl|NC_021305. 166 --SFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAH 243 (518) Q Consensus 166 --~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~ 243 (518) .|..-..+-+|....++..||.||..-++..+.......+.......-...|..++..++.+.+.. .. T Consensus 240 ~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~~~~----------l~ 309 (516) T protein:vir:10 240 KIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTDVDH----------FV 309 (516) T ss_pred ccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccCcccccchhh----------hc Confidence 011123344455555777899999999999999999988888888777777777765555444322 11 Q ss_pred cCccccCCeeecC--CCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHh Q lcl|NC_021305. 244 SGSSNTGKTMVVE--EGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTM 318 (518) Q Consensus 244 ~g~~n~g~~~vl~--~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l 318 (518) .|.. +.++-. +++...+++.. .|.+ ..+..+.....|..+|-+.....-..+.. +.++ .+..-....+ T Consensus 310 ~~~~---g~~~~g~~~~v~~~q~~~~-~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rv---TAtEV~~r~~E~~~~L 382 (516) T protein:vir:10 310 NSGT---GEVVTGVEEDIHIVQLGKY-ADLTPISAVLEVYTRRIGVVFMMETMTRRDAERV---TAVEIQRDALEIEQNM 382 (516) T ss_pred cCCC---ceeecCCcccceeeecCcc-cchHHHHHHHHHHHHHHHHHHhhhhhhccCCccc---cHHHHHHHHHHHHHHh Confidence 1111 122222 22333333332 2333 33556677788888887664332222222 2322 3344455577 Q ss_pred hHHHHHHHHHHHHhhhhhhccc-------ccceecchhhhhcCHHHHHHHHHHHH---hC--CCc-CHHHHHHHhCCCCC Q lcl|NC_021305. 319 AIPIARIQSAMDKYVGQYWVRK-------NRMKFDIDDVIQPDWEAKSESTQKMV---NS--GVA-TPNEGREIMGLPRS 385 (518) Q Consensus 319 ~P~~~~ie~~l~~~l~~~~~~~-------~~~~fd~~~l~~~d~~~~~~~~~~~~---~~--G~~-T~NE~R~~~g~~p~ 385 (518) .|.+..+.++|-.-|+...-.. .-+..++ +.......++.....+. +. .++ -+-++...++++.. T Consensus 383 Gpv~~rl~~Ell~Pli~r~~~~~~p~~P~~lv~~~~--v~~i~~L~raq~~~~i~~~~q~i~~~~q~~p~v~d~id~d~~ 460 (516) T protein:vir:10 383 GGVYSLFATTMQSPVAMWGLLEAGDSFTSDLVDPVI--ITGIEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDY 460 (516) T ss_pred hhHHHHHHHHHHHHHHHHHHHhhCCCCChhhcCcce--ehhHHHHHHHHHHHHHHHHHHHHHHHhcCChHHHhhcCHHHH Confidence 7877777777654443211000 0011110 11111112211111110 00 000 00111111111100 Q ss_pred CCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCC Q lcl|NC_021305. 386 DDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPP 465 (518) Q Consensus 386 ~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~ 465 (518) -+.-.+.+-+|..++. ...+.+ .....+.+.+...+++.+++ T Consensus 461 ~~~~a~~~gvp~~~ir----------------------------------s~eev~----~~r~~~~~~q~~~~~~~~~~ 502 (516) T protein:vir:10 461 MDWVRGQISAELPFLK----------------------------------SAEEME----QEQEAQMQAQQAQMLEEGVA 502 (516) T ss_pred HHHHHHHhCCChhccC----------------------------------CHHHHH----HHHHHHHHHHHHHHHHHHhh Confidence 0000000000000000 000001 01111111111111111111 Q ss_pred chhhHHHHHHHHHh Q lcl|NC_021305. 466 KESSPKHLRAVKGA 479 (518) Q Consensus 466 ~~~~~~~~~~~~~~ 479 (518) +.-..-..+.++.. T Consensus 503 ~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 503 KAVPGVIQQELKEA 516 (516) T ss_pred hcccchhhhhhhcC Confidence 11111111111111 No 256 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=67.53 E-value=0.25 Score=23.88 Aligned_cols=420 Identities=10% Similarity=0.002 Sum_probs=158.4 Q ss_pred CcCCCCCCC---CcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhh-ccC----ceEEEE Q lcl|NC_021305. 1 MLLANGQTL---SAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQAL-ARL----PVKCMF 72 (518) Q Consensus 1 ~~f~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~i-a~l----~~~v~~ 72 (518) -+|.+.++. =...++..+.++..+.+..+ ................. .|++.+|..+ +.| ||.=.. T Consensus 14 ~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~------~~~~~~~~~~~~dst~~-~a~~~Las~l~~~ltP~~~WFrl~ 86 (522) T protein:vir:94 14 AVYDRLKNGRQPYETRAQNCAAVTIPSLFPKE------SDNSSTEYTTPWQAVGA-RCLNNLAAKLMLALFPQSPWMRLT 86 (522) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHhcccccCCC------CCcccccccccccccHH-HHHHHHHHHHHhhcCCCCcccccc Confidence 111111110 01111112222211111100 00001111112333333 4555555555 322 332222 Q ss_pred ecCCcce----------ecc------chHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCC Q lcl|NC_021305. 73 TSGDTET----------EES------DTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHP 136 (518) Q Consensus 73 ~~~~~~~----------~~~------~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p 136 (518) ..+.... +.. ...++.-+.+- +++.-+..+..+++.+|++++++..+..|.+..+..++- T Consensus 87 ~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~s----nf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~~~~~~pl 162 (522) T protein:vir:94 87 VSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETN----SFRVPLFEALKQLIVSGNCLLYIPEPEQGTYSPMRMYRL 162 (522) T ss_pred cchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhc----CcHHHHHHHHHHHHhhCcEeEeeeccCCCceeeEEEEEc Confidence 1111000 000 01112222222 355556667889999999999988777666555444444 Q ss_pred ceeEEEEcCCceeeEEe--eec-------------------------------cc--------ccCceeE-------Eec Q lcl|NC_021305. 137 SRVAIKRNSRTGRYEYY--FQA-------------------------------GA--------GVGTQLV-------SFA 168 (518) Q Consensus 137 ~~v~v~~~~~~~~~~~~--~~~-------------------------------~~--------~~~~~~~-------~~~ 168 (518) ..+.+..+..|...... +.. .. ...+..+ .|. T Consensus 163 ~~y~v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 242 (522) T protein:vir:94 163 VSYVVQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEGIEVTGTDGSYPLT 242 (522) T ss_pred ceEEEeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccCceecccCCCCccc Confidence 55555565555331000 000 00 0000000 122 Q ss_pred cccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccc Q lcl|NC_021305. 169 DDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSN 248 (518) Q Consensus 169 ~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n 248 (518) .-.++.+|....++..||.||...++..+.......+.......-...|..++..++...+... ..+.. T Consensus 243 e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~~~~----------~~~~~- 311 (522) T protein:vir:94 243 ACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQPRRL----------NKAAT- 311 (522) T ss_pred cCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccchhe----------eccCC- Confidence 2345556666667788999999999999999999999999999988888887766655554332 11111 Q ss_pred cCCeeec--CCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhHHHHH Q lcl|NC_021305. 249 TGKTMVV--EEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAIPIAR 324 (518) Q Consensus 249 ~g~~~vl--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P~~~~ 324 (518) +.++. ++++...++...+.-.-..+..+.....|..+|-+.. ++..+.... +.++ .+..-....+.|.+.. T Consensus 312 --g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~r~-TAtEV~~r~~E~~~~LG~v~~r 386 (522) T protein:vir:94 312 --GEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNS--AVQRNAERV-TAEEIRYVAGELEATLGGVYSV 386 (522) T ss_pred --ceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh--hccCCCccc-cHHHHHHHHHHHHHHHhHHHHH Confidence 12222 2334454544332222234566677788888886652 222222222 2222 2333444566676666 Q ss_pred HHHHHHHhhh-------------hhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcc Q lcl|NC_021305. 325 IQSAMDKYVG-------------QYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKAD 391 (518) Q Consensus 325 ie~~l~~~l~-------------~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD 391 (518) +.++|-.-|+ ++... ..+++++...+. ...+..-+..+.+ ..+.+ -.+.|. ..| T Consensus 387 l~~E~l~Pli~r~~~il~r~g~lP~~p~-~~v~v~~~s~La--~~qr~~~~~~l~~----~~~~i---a~l~P~---~~~ 453 (522) T protein:vir:94 387 QSQELQLPIVRVLMNQLQSAGMIPDLPK-EAVEPTVSTGLE--ALGRGQDLEKLTQ----AVNMM---TGLQPL---SQD 453 (522) T ss_pred HHHHHHHHHHHHHHHHHHhcCCCCCCCc-ccEEeeEecHHH--HHHHHHHHHHHHH----HHHHH---Hhccch---hhh Confidence 6666653332 21111 123333221111 1111111111111 01111 111111 011 Q ss_pred eeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhHH Q lcl|NC_021305. 392 ELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPK 471 (518) Q Consensus 392 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~ 471 (518) .. .|... ..........- + +...-. +.+......+.+.+..+. ...+.. T Consensus 454 ~~---id~d~---~~~~~a~~~Gv---------~--------~~~ivr----~~ee~~~~~~q~~~~~~~----~~~~~~ 502 (522) T protein:vir:94 454 PD---INLPT---LKLRLLNALGI---------D--------TAGLLL----TQDEKIQRMAEQSSQQAV----VQGASA 502 (522) T ss_pred hc---CCHHH---HHHHHHHHcCC---------C--------hhhccC----CHHHHHHHHHHHHHHHHH----HHHHHH Confidence 00 01000 00000000000 0 000000 000000000000000000 001111 Q ss_pred HHHHHHHhhccccCcCchhHHHH Q lcl|NC_021305. 472 HLRAVKGAMGRGKDIKGFALQLA 494 (518) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~ 494 (518) ....+.+.++ ..+ +....+| T Consensus 503 ~~~~~~a~~~-~~~--~~~~~~~ 522 (522) T protein:vir:94 503 AGANMGAAVG-QGA--GEDMAQA 522 (522) T ss_pred HHHHhhhhhh-ccc--chhhhcC Confidence 1111111111 000 0011111 No 257 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=64.61 E-value=0.3 Score=23.47 Aligned_cols=396 Identities=10% Similarity=0.048 Sum_probs=140.4 Q ss_pred CcCCCCCCCCcccc-cccchhhhh---hhccccccc-ccccc--cchhhhHHH-------hhcHHHHHHHHHHHHhhccC Q lcl|NC_021305. 1 MLLANGQTLSAPAM-AELSPQMQD---SYYYAPAVG-MQLER--QFSLYGGIY-------KNQPWVRTVIAKRAQALARL 66 (518) Q Consensus 1 ~~f~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~-~~~~~--~~~~~~~~~-------~~~~~v~~~v~~ia~~ia~l 66 (518) .+|........... +...+...+ .+...-.++ .|..+ ...+....+ .....|..-.+.+. T Consensus 42 ~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve------ 115 (543) T protein:vir:88 42 SLFPKDSDNSSTDYTTPWQAVGARGLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVE------ 115 (543) T ss_pred ccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHH------ Confidence 55542211111110 000000000 000000000 00000 000000000 00011111111111 Q ss_pred ceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCC----ceEEEEeeCCceeEEE Q lcl|NC_021305. 67 PVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSG----TPEKLMPMHPSRVAIK 142 (518) Q Consensus 67 ~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G----~~~~l~~l~p~~v~v~ 142 (518) ..++..+.+-| ++.-+..+..+++.+|++.+++..+... .....||+. .+.+. T Consensus 116 -----------------~~~~~~~~~sn----f~~~~~~~~~~L~~~G~a~ly~~~~~~~~~~~~~~~~~pl~--~y~v~ 172 (543) T protein:vir:88 116 -----------------RILMSYMEANS----YRVTLFELIRQLALAGTALIYLPPPDASSNSYNPMKLYTLH--NHVVQ 172 (543) T ss_pred -----------------HHHHHHHHhcC----cHHHHHHHHHHHHhhCceeeeeccCccccceecceEEeEcc--eEEEe Confidence 11222223333 4444555677899999999887654321 123344442 23333 Q ss_pred EcCCceee----------------------------------EEeeeccc----------ccCceeE-------Eecccc Q lcl|NC_021305. 143 RNSRTGRY----------------------------------EYYFQAGA----------GVGTQLV-------SFADDE 171 (518) Q Consensus 143 ~~~~~~~~----------------------------------~~~~~~~~----------~~~~~~~-------~~~~~e 171 (518) .+..|... .|...... ...+..+ .+..-. T Consensus 173 ~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr~~~~~~~~~~~~~~~~v~~~~~~~~~~e~P 252 (543) T protein:vir:88 173 RDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDDESGDFLSYQEIEGVEVDGSDGQYPQDALP 252 (543) T ss_pred eCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEeecCCCcccccccccCeeeecCCCccccccCC Confidence 33333211 00000000 0001111 112234 Q ss_pred EEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCC Q lcl|NC_021305. 172 VVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGK 251 (518) Q Consensus 172 vih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~ 251 (518) ++.+|....++..||.||...++..+.......+.......-...|..++..++...+.. ...|..+ T Consensus 253 ~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~~~~----------~~~~~~g--- 319 (543) T protein:vir:88 253 WIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGITQVRR----------LVKAQTG--- 319 (543) T ss_pred ceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccccchhh----------cccCCCc--- Confidence 566666666788899999999999999999999999999888888887776665544332 1122111 Q ss_pred eee--cCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhhHHHHHHHH Q lcl|NC_021305. 252 TMV--VEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMAIPIARIQS 327 (518) Q Consensus 252 ~~v--l~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~P~~~~ie~ 327 (518) .++ ..+++...++...++-.-..+..+.....|..+|-+.. +...+.. .-+.++ .+..-....+.|.+..+++ T Consensus 320 ~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~-r~TAtEV~~r~~E~~~~LG~v~~rl~~ 396 (543) T protein:vir:88 320 DFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNS--AVQRSGE-RVTAEEIRYVASELEDTLGGVYSILSQ 396 (543) T ss_pred eeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhh--hccCCCC-cccHHHHHHHHHHHHHHHhHHHHHHHH Confidence 122 23445555555443333345667777888888886652 2222222 123332 2233444556666666665 Q ss_pred HHHHhh-------------hhhhcccccceecch----hhhhc-CHHHHHHHHHHHHhC-C------CcCHHHHH----H Q lcl|NC_021305. 328 AMDKYV-------------GQYWVRKNRMKFDID----DVIQP-DWEAKSESTQKMVNS-G------VATPNEGR----E 378 (518) Q Consensus 328 ~l~~~l-------------~~~~~~~~~~~fd~~----~l~~~-d~~~~~~~~~~~~~~-G------~~T~NE~R----~ 378 (518) +|-.-| +++... ..++.++. .+-+. +.......+. .+.. + .+..+++- + T Consensus 397 E~l~Pli~r~~~il~r~g~lP~~p~-~~v~~~~vs~l~~l~r~~~~~~l~~~~~-~v~~~~~p~vld~id~d~~~~~~a~ 474 (543) T protein:vir:88 397 ELQLPIVRVLLNQLQATQQIPNLPQ-EAVEPTVTTGAEALGRGQDLDKLTQFLN-AVATVSQLNGDPDLNVNNIKLRLAN 474 (543) T ss_pred HHHHHHHHHHHHHHHhcCCCCCCch-hceeeeEEecHHHHHHHHHHHHHHHHHH-HHHhccchhhhccCCHHHHHHHHHH Confidence 554333 222111 12222221 11111 1111111111 1110 0 11222222 2 Q ss_pred HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHH Q lcl|NC_021305. 379 IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRR 458 (518) Q Consensus 379 ~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (518) .+|.+|.. .+..+.....+-. ++. .+.+... .. ...+ .+.-. +..+..++.- T Consensus 475 ~~Gv~~~~-----i~r~~~e~~~~~~---q~~-~q~~~~~-----~~-~~~~-------~~~~~------~~~~~~~~~~ 526 (543) T protein:vir:88 475 AIGIDTAG-----LLLTEAEKAQAQS---QEM-LKQGGLN-----AA-AGIG-------SGVAA------QATASPEAME 526 (543) T ss_pred HhCCChhh-----hcCCHHHHHHHHH---HHH-HHHHHHH-----HH-HHHh-------hchhh------hhccChHHHH Confidence 22442210 0000000000000 000 0000000 00 0000 00000 0000000000 Q ss_pred hhc----ccCCchhhHHHHHHH Q lcl|NC_021305. 459 LMQ----KPPPKESSPKHLRAV 476 (518) Q Consensus 459 ~~~----k~~~~~~~~~~~~~~ 476 (518) .+. -++++.+- .| T Consensus 527 ~~~~~~~~~~~p~~~-----~~ 543 (543) T protein:vir:88 527 SAMDTAGVQPGPIAT-----QV 543 (543) T ss_pred HHhhhcCCCCCCCCC-----CC Confidence 000 00111100 00 No 258 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=62.05 E-value=0.34 Score=23.14 Aligned_cols=442 Identities=9% Similarity=0.006 Sum_probs=158.1 Q ss_pred CcCCCCCCCCcccccccchhh------------hh-hhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccC- Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQM------------QD-SYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARL- 66 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~------------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l- 66 (518) =+|.+.+... ......|. ++ .++..+. .. . ........-.-.+.|+.+|+.+..++... T Consensus 27 ~~~~~~~~~r---~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~-~~-~--~~~~~~rs~~~~~~v~~~ve~~~~~l~~~~ 99 (651) T protein:vir:80 27 KEYKRFCDAR---QVCEETWLEAWGMYLSTPEAQDYLRDQVLR-SV-G--DVNADWRHKITTGKAFEAIETIHAYLMSAT 99 (651) T ss_pred HHHHHHHHHh---hhhhhhHHHHHHhhcccHHHHHhhcccccc-cc-C--CCCCCCCccccChhHHHHHHHHHHHHHHhh Confidence 1111111100 00001111 00 0010000 00 0 00000001123456778886555555442 Q ss_pred --ceEEEE--ecCCcceec-cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC---------------- Q lcl|NC_021305. 67 --PVKCMF--TSGDTETEE-SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS---------------- 125 (518) Q Consensus 67 --~~~v~~--~~~~~~~~~-~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~---------------- 125 (518) +-++++ ..++..... ....+..++...-...........++.+.+.+|++++.+.++.. T Consensus 100 ~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~~~~~~~~~ 179 (651) T protein:vir:80 100 FPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKVQVRTPLFE 179 (651) T ss_pred cCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehheeccccccc Confidence 111222 111111000 11112222221111233666777788999999999886654311 Q ss_pred Cc--------------eEEEEeeCCceeEEEEcCCc-------------------------------------------- Q lcl|NC_021305. 126 GT--------------PEKLMPMHPSRVAIKRNSRT-------------------------------------------- 147 (518) Q Consensus 126 G~--------------~~~l~~l~p~~v~v~~~~~~-------------------------------------------- 147 (518) |+ -..+..++|..+.+.....+ T Consensus 180 ~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~~~~~~~~~~~ 259 (651) T protein:vir:80 180 DEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDVVEHKCKDTSD 259 (651) T ss_pred cccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHHHHhhhcccccc Confidence 10 01233343322222111000 Q ss_pred ---------------------eeeEEeeec-ccccCce----eEEecc--------------ccEEEEeccCCCCcccCc Q lcl|NC_021305. 148 ---------------------GRYEYYFQA-GAGVGTQ----LVSFAD--------------DEVVPIRFFNPDGLERGL 187 (518) Q Consensus 148 ---------------------~~~~~~~~~-~~~~~~~----~~~~~~--------------~evih~~~~~~~~~~~G~ 187 (518) ....|.++. ....+.. .+.+.. ...+|++.....+..||. T Consensus 260 ~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~ 339 (651) T protein:vir:80 260 TKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLRFEQNPYWCGRPFVIGTYIPTARQPYAM 339 (651) T ss_pred CCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEecccccCCCCCCCeeeecceecCccccCC Confidence 000010000 0000000 001111 134555545456678999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecCCCcceeeccCC Q lcl|NC_021305. 188 SLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLT 267 (518) Q Consensus 188 s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~ 267 (518) |++..++..........+.........+.|.+++..++..++++. . ...|+++++.....+.++... T Consensus 340 g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l-------~------~~pg~vi~~~~~~~~~~l~~~ 406 (651) T protein:vir:80 340 GALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDV-------Y------TEPGKVFLVSDHGDLQPLANQ 406 (651) T ss_pred ChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHh-------h------cCCCceEEecCCCCceeeccC Confidence 999999999999998888888888888888888876666655542 1 123456667666666666443 Q ss_pred hhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccc-cCCHH--HHHHHHHHHHhhHHHHHHHHHHHHhhhhh------- Q lcl|NC_021305. 268 AVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRAT-FSNIS--AQMRAFYRDTMAIPIARIQSAMDKYVGQY------- 336 (518) Q Consensus 268 ~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~sn~e--~~~~~~~~~~l~P~~~~ie~~l~~~l~~~------- 336 (518) ..+.+ ...........+-..+||+....|....+. ..+.. .....-....+.+++..+..++..-|+.. T Consensus 407 ~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~ 486 (651) T protein:vir:80 407 SSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQ 486 (651) T ss_pred cccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22322 234566667778889999988877644321 11222 23333444556666666655543322110 Q ss_pred hcc--------c------cc---------ceecchhhhhcCHHHHHHHHHHH---HhCCCcCH------------HHHHH Q lcl|NC_021305. 337 WVR--------K------NR---------MKFDIDDVIQPDWEAKSESTQKM---VNSGVATP------------NEGRE 378 (518) Q Consensus 337 ~~~--------~------~~---------~~fd~~~l~~~d~~~~~~~~~~~---~~~G~~T~------------NE~R~ 378 (518) ... . .+ ..+++..+-.....++...+..+ ++.+.-.+ -++.+ T Consensus 487 ~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~ 566 (651) T protein:vir:80 487 FTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQ 566 (651) T ss_pred hcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHH Confidence 000 0 00 11111111000111122222221 11110000 00111 Q ss_pred HhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHH Q lcl|NC_021305. 379 IMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRR 458 (518) Q Consensus 379 ~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (518) ..|++ + .+.++.+.. +.. + ..+........+. ...+++..+ T Consensus 567 ~~g~~---~--~~~~l~~~~--------------q~~-----------------~--~~~~~~~~~q~~~-~~~~a~~~~ 607 (651) T protein:vir:80 567 HWGFE---E--PEAYLKQQD--------------QQA-----------------P--ANPQEALLSQAKD-VGGQAMSNM 607 (651) T ss_pred HcCCC---C--cHHhcCCCc--------------cch-----------------h--hhhhHHHHhhHHH-HHHHHHHHH Confidence 11211 0 000000000 000 0 0000000000000 001111111 Q ss_pred hhcccCCchhhHHHHHHHHHhhccccCcCchhHHHHHHHHHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 459 LMQKPPPKESSPKHLRAVKGAMGRGKDIKGFALQLAEKYPDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 459 ~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) +..... .....++...... ...++.++.-..|--++|.+..-. T Consensus 608 ~~~~~~-~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~l~~~~~~ 650 (651) T protein:vir:80 608 LQNQLQ-ADGGTQMMSEMYG----------------TPNADQMQQELMATTPNVSEQQLT 650 (651) T ss_pred HHHHHH-HHHHHHHHHHHHH----------------HHHHHHHHHHHHHHHHHHHHhhcc Confidence 000000 0000011000000 000111222222222233222222 No 259 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=58.37 E-value=0.41 Score=22.68 Aligned_cols=396 Identities=12% Similarity=0.099 Sum_probs=160.8 Q ss_pred CcCCCCCCCC----ccc---ccccchhhhhhhccccccccc-----c-cccchhh-hHHHhh----cHHHHHHHHHHHHh Q lcl|NC_021305. 1 MLLANGQTLS----APA---MAELSPQMQDSYYYAPAVGMQ-----L-ERQFSLY-GGIYKN----QPWVRTVIAKRAQA 62 (518) Q Consensus 1 ~~f~~~~~~~----~~~---~~~~~~~~~~~~~~~~~~~~~-----~-~~~~~~~-~~~~~~----~~~v~~~v~~ia~~ 62 (518) ||-.+|+... -|. ...-...+++..++......+ . ....... ...++. .+++...++.++.. T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~ 80 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGS 80 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhch Confidence 8877766322 221 111222345555553221111 1 1111111 222222 34445555555555 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhcCC-cCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc-----------eEE Q lcl|NC_021305. 63 LARLPVKCMFTSGDTETEESDTGYAKLLADPC-EYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT-----------PEK 130 (518) Q Consensus 63 ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN-~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~-----------~~~ 130 (518) +-+-|..+ +.. ..+..|+.... ...+..+|.+.++...+.+|.+++++.....|. --. T Consensus 81 vfrk~p~~---------~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy 150 (489) T protein:vir:78 81 VMRKEPEI---------NIP-KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPT 150 (489) T ss_pred hhcCCcce---------ecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcE Confidence 44444432 112 22334555444 357899999999999999999999998765542 112 Q ss_pred EEeeCCceeE-----------------EEE-----cCCc---eee--EEee-------------ecccccCc---eeEEe Q lcl|NC_021305. 131 LMPMHPSRVA-----------------IKR-----NSRT---GRY--EYYF-------------QAGAGVGT---QLVSF 167 (518) Q Consensus 131 l~~l~p~~v~-----------------v~~-----~~~~---~~~--~~~~-------------~~~~~~~~---~~~~~ 167 (518) +..+.|..|. +.. +..+ ... .|+. +.....++ ....+ T Consensus 151 ~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~ 230 (489) T protein:vir:78 151 IAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEI 230 (489) T ss_pred EEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEE Confidence 3333333221 100 0000 000 0000 00000000 00001 Q ss_pred cc------c---cEEEEeccCCCCcccCchHHHHHHHH-HHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHH Q lcl|NC_021305. 168 AD------D---EVVPIRFFNPDGLERGLSLMESLKST-IFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLRE 237 (518) Q Consensus 168 ~~------~---evih~~~~~~~~~~~G~s~l~~~~~~-i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~ 237 (518) .+ - .++.+- ...++...|.||+..+... +.+......+... +...+.|-.++.-....+++....... T Consensus 231 ~~~~g~~~l~~IPfv~~~-~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~-l~~~~~P~l~i~G~d~~~~~~~~~~~~ 308 (489) T protein:vir:78 231 YPDLGESLRGVIPFTFIG-ATNNDATIDDAPLLPLAELNIGHYRNSADNEES-SFVVGQPTLFIYPGENLTPQAFKEANP 308 (489) T ss_pred eccCCCCccCeeeEEEEe-cCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHH-HHHcccceeeeecCccCCcccccccCc Confidence 00 0 111111 1223344578888776654 4444444444444 444567777665433334332222111 Q ss_pred HHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHH Q lcl|NC_021305. 238 QFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYR 315 (518) Q Consensus 238 ~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~ 315 (518) .....| ....+.++.+.++.-+..+...+. .+..+....+.+ ..| ..++- .+..-+.++ .....-+ T Consensus 309 --~~i~~g---~~~~~~lp~~~~~~~ie~~~~~~~-r~~l~~le~qm~-~lG--a~l~~---~~~~~Ta~~~~~~~~~~~ 376 (489) T protein:vir:78 309 --NGIKFG---SRRGHNLGYGGSAQLIQAGENNLA-RQNMLDKEQQAI-QIG--AQLIT---PTQQITAQSARIQRGADT 376 (489) T ss_pred --cceeeC---CcccccCCCCCCcceeccCcchHH-HHHHHHHHHHHH-HHh--hhhcc---CCcchhHHHHHHHHHHhh Confidence 011122 223455666654433433333332 122222222211 222 22331 111122222 2334446 Q ss_pred HHhhHHHHHHHHHHHHhhh--hhh-cc--ccccee--cchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Q lcl|NC_021305. 316 DTMAIPIARIQSAMDKYVG--QYW-VR--KNRMKF--DIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDP 388 (518) Q Consensus 316 ~~l~P~~~~ie~~l~~~l~--~~~-~~--~~~~~f--d~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~ 388 (518) ..|.-++..++++++..|- ..+ +. ...+.| +.+.....-.....+.+-.+++.|.+|....++.+-.--+.++ T Consensus 377 S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~~~~d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~ 456 (489) T protein:vir:78 377 SVMATIARNVSQAYTDALRWVAVMLGKPEDTEVEFRLNMDFFLEPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDW 456 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccCcccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCc Confidence 6788888889988887652 222 11 122232 3333222222334666777889999999888887643222111 Q ss_pred CcceeeecccccccccccccCCCCCCCCC-CCCCccCCCCCCCccccCCcc Q lcl|NC_021305. 389 KADELYANSALQPLGATPDGAVEWEEAPA-PKRPASTPVASLDQSPPTSVP 438 (518) Q Consensus 389 ~gD~~~~~~n~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 438 (518) - + ..+.. ....++.+. .+.++..| ++.++ +++ T Consensus 457 ~-~--------e~~~~----ei~~~~~~~~~~~~g~~~--~~~q~---~~~ 489 (489) T protein:vir:78 457 T-D--------ADIKD----AVADQPLPVATEVQGEIP--QSAQQ---QEK 489 (489) T ss_pred c-H--------HHHHH----HHhhcCCCcccCCcccCC--CCccc---ccC Confidence 0 0 00000 000011000 00111111 11111 011 No 260 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=57.59 E-value=0.43 Score=22.59 Aligned_cols=447 Identities=10% Similarity=0.008 Sum_probs=135.5 Q ss_pred CcCCCCCCCCcccccc-cch-----------hhhhhhccc--ccccccccccchhhhH--------HHhhcHHHHHHHHH Q lcl|NC_021305. 1 MLLANGQTLSAPAMAE-LSP-----------QMQDSYYYA--PAVGMQLERQFSLYGG--------IYKNQPWVRTVIAK 58 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~-~~~-----------~~~~~~~~~--~~~~~~~~~~~~~~~~--------~~~~~~~v~~~v~~ 58 (518) +.|...-+...+.+.. .++ .+...+... |..-.+...+....+. ...++.. +.++.- T Consensus 67 ~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~-~~~~~~ 145 (641) T protein:vir:94 67 RNFQTTGADDADWRHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASI-RDIFET 145 (641) T ss_pred ccccccccchhcccccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcch-HHHHHH Confidence 2322111111111111 111 111111111 1100111111111111 0111111 111111 Q ss_pred HH-------HhhccCceEEEEecC-------Cc---------ceecc------ch-HHHHHHhcCCcCCCH-----HHHH Q lcl|NC_021305. 59 RA-------QALARLPVKCMFTSG-------DT---------ETEES------DT-GYAKLLADPCEYLDP-----FAFW 103 (518) Q Consensus 59 ia-------~~ia~l~~~v~~~~~-------~~---------~~~~~------~~-~~~~L~~~PN~~~s~-----~~f~ 103 (518) .- +.|.+++|.+..... .+ .+... .+ ..+.+...|+..... +... T Consensus 146 ~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~dps~~~~~~~f~~~r~t 225 (641) T protein:vir:94 146 YVRNLVLYGVSTYRLGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWLDTSGGKNTGTFVRLRHT 225 (641) T ss_pred HHHHHhhcCceEEEeehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheeecCCCCcccccceehhhh Confidence 11 222344443210000 00 00000 00 011122334433221 1111 Q ss_pred HHHHHHHHHcCCe----EEEEEEcCC----C-ceEEEEeeCCceeEEE-----EcCCceeeEEeeecccccCceeEE--- Q lcl|NC_021305. 104 EWVASTLDIYGET----YLAIQKNKS----G-TPEKLMPMHPSRVAIK-----RNSRTGRYEYYFQAGAGVGTQLVS--- 166 (518) Q Consensus 104 ~~~v~~ll~~G~~----~~~i~r~~~----G-~~~~l~~l~p~~v~v~-----~~~~~~~~~~~~~~~~~~~~~~~~--- 166 (518) +.-+..+...|.- .-+..+... . ....+...+...+.+. .+.++......+. ...++..+. T Consensus 226 ~~t~~~l~~eg~~~~d~v~~~~~~~~~~~~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~--~~~g~~il~~~~ 303 (641) T protein:vir:94 226 REELHELVTSGYYDLDLTQVEQYVDYKFADPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHA--VFYGKQLIRLSD 303 (641) T ss_pred HHHHHHHHhcCCCChhhcchhhcccccccccccccccccccccccceeeeeeeeccCCCceeeEEE--EEeCCEEeeccc Confidence 2223444433210 000000000 0 0111111222222211 1122222211111 111121111 Q ss_pred ---eccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHh Q lcl|NC_021305. 167 ---FADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAH 243 (518) Q Consensus 167 ---~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~ 243 (518) +....+++++.....+..||.||...++..+.......+...........|..++..++.++++... T Consensus 304 ~~~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~l~---------- 373 (641) T protein:vir:94 304 SKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKREDVK---------- 373 (641) T ss_pred ccccCcCCeEEecceecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccceee---------- Confidence 1122566667665667789999999999999999998888888777777777776666655553320 Q ss_pred cCccccCCeeecCCCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhcccc-ccccCCHHHHH----------- Q lcl|NC_021305. 244 SGSSNTGKTMVVEEGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILD-RATFSNISAQM----------- 310 (518) Q Consensus 244 ~g~~n~g~~~vl~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~sn~e~~~----------- 310 (518) .+.|+++.......+.++.....+.+ ...........|-.+|++...+.+... ++..-+..+-. T Consensus 374 ---~~PG~ii~~~~~~~v~pl~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~ 450 (641) T protein:vir:94 374 ---AKPGAVFKVAQHGSLQPIDMGRQDFVVTYQEAQVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLS 450 (641) T ss_pred ---ccCCcceeeCCCCcceeecCCccccchhHHHHHHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHH Confidence 12344555544444555532221211 122334444567778888765443321 11111222211 Q ss_pred ---HHHHHHHhhHHHHHHHHHHHHhhhhhh------------------cccccceecchhhhhcCHHHHHHHHHHH---H Q lcl|NC_021305. 311 ---RAFYRDTMAIPIARIQSAMDKYVGQYW------------------VRKNRMKFDIDDVIQPDWEAKSESTQKM---V 366 (518) Q Consensus 311 ---~~~~~~~l~P~~~~ie~~l~~~l~~~~------------------~~~~~~~fd~~~l~~~d~~~~~~~~~~~---~ 366 (518) +.|-...+.|++..+-..+.+.+..+. ......+|++..+-......++..+..+ + T Consensus 451 ~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~ 530 (641) T protein:vir:94 451 SVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLL 530 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHH Confidence 223344555666555444444322111 0111223333333222223333333322 2 Q ss_pred hCCCcCH--------HH----HHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCcccc Q lcl|NC_021305. 367 NSGVATP--------NE----GREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPP 434 (518) Q Consensus 367 ~~G~~T~--------NE----~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (518) +....-+ +. +.+..|++ . |. -+.. .. +.++++.+.. T Consensus 531 ~~~a~~P~v~d~~d~~~~~~~~~~~~g~~-~--p~--------~~ir-----~~--~~~~~~~~~~-------------- 578 (641) T protein:vir:94 531 DISGRVPQIGQSLDYALILEDLLRQMRFT-D--PM--------RYIK-----KA--EAPPAAPPIA-------------- 578 (641) T ss_pred HHhhcChhhhhcCCHHHHHHHHHHHhCCC-C--ch--------hhcc-----Cc--cCchhHHHHH-------------- Confidence 1111001 11 11111221 0 00 0000 00 0000000000 Q ss_pred CCccccccchhcchhhHHHHHHHHhhcccCCchhhHHH-HHHHHHhhcc-ccCcCchhHHHHHHHHHHHhHHHH Q lcl|NC_021305. 435 TSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKH-LRAVKGAMGR-GKDIKGFALQLAEKYPDDLEDILL 506 (518) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 506 (518) .++..... ...++-.+.++. .++..+. ...+.+..++ +-+..+-+=|.+.+.--.+++=+| T Consensus 579 ~~~~q~~~----------~~~a~~~~~~~~-~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 641 (641) T protein:vir:94 579 PAEPGALP----------PEMMNSVGGGLN-DQAIAGMTPEDVSDLASRIGIDTSDVAPEAMAAATQQITSGAL 641 (641) T ss_pred HHHHHHHH----------HHHHHHHHhhhH-HHHHHHhhHHHHHHHHHhhcCCchhhhHHHHhcccccccccCC Confidence 00000000 000000000000 0110110 1111111111 111112122222222222322222 No 261 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=57.44 E-value=0.43 Score=22.57 Aligned_cols=308 Identities=11% Similarity=0.045 Sum_probs=130.4 Q ss_pred CcCCCCCCCCcccccccchhhhhhh----cccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhccCceEEEEecCC Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSY----YYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGD 76 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~l~~~v~~~~~~ 76 (518) =+|.+++.- .+.|-+.++. ...+.+...-..++ .+-+-+|++||.. +..|+... T Consensus 2 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~-~~~~~~~~- 59 (320) T protein:vir:97 2 GIFNFKKRE------TLTPELKESIIRQVTIEDESPFTGTTDF--------------NVRNEVAESIATY-LGAYKTSA- 59 (320) T ss_pred Ccccccccc------ccChhHHhhhhheeeeccCCCccccccc--------------chhhHHHHHHHHH-hhhhcccc- Confidence 355554311 1222222111 00000000001112 2223344444432 11122111 Q ss_pred cceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcCCceeeEEeeec Q lcl|NC_021305. 77 TETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) Q Consensus 77 ~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 156 (518) ..+.+| .+-..|++.++.+.+..-..|++.-.. .|++ -.+.+++. |......+.. T Consensus 60 -------~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~------~~~~~~~~----~~~~~~~~~~ 114 (320) T protein:vir:97 60 -------KRLSLL-------TNNPSFLRRLVKHALHNKTTYVYKSPT-YGWL------ITDSMTIE----GLRARLTFTL 114 (320) T ss_pred -------ceeeee-------eCCHHHHHHHHHHhhcccceEEeeCCc-ccee------eecceeee----eeeeeEEEec Confidence 111112 223469999999999888888775432 2322 11222221 1111111111 Q ss_pred ccccC-ceeEEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCC-HHHHHH Q lcl|NC_021305. 157 GAGVG-TQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLS-EAAQQR 234 (518) Q Consensus 157 ~~~~~-~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~-~~~~~~ 234 (518) .++-+ -..++++-.| +.-.++..+|..+-+... -++ .+....-+-+.|.+....+++.+-+.. ++-.++ T Consensus 115 ~D~FN~~V~mtvpfyD-----~~ILdnpl~gv~tqe~gk-M~g---~a~~~v~kkL~~~~~IKafi~Tdid~GLee~kD~ 185 (320) T protein:vir:97 115 PDPFNSAVTMTVPFYD-----VGIIDSPLVEVDTEEANK-MLE---AAYSAVMKKLHNTGAIKAFISSDIDVGLEKMKEE 185 (320) T ss_pred CcccceeEEEEeeeec-----hhhhhhhhcccChHHhhH-HHH---HHhhhhhhhccccceeEEEEecccchhHHHHHHH Confidence 11000 0111111110 001233456666553222 122 222233444566777777777764432 333444 Q ss_pred HHHHHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHHHHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFY 314 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~~~ 314 (518) .+..++.+..-++.=.+.-+++.|-+++++..+-.-+- ..-....+...+.-|+||..+|-. +..+.+...|+ T Consensus 186 ~~~kIk~mq~~A~~~nG~T~i~~~dDI~Qi~pDYS~sn-~~D~~l~~t~alS~y~m~~~IL~G------sAte~~~Iaf~ 258 (320) T protein:vir:97 186 SDSKIKAMLATAELLSGYTYIQRGDDVTQMMPDYTTSN-VTDFAAMRTFAASQLSVSDKILDG------SATDGEKVAVM 258 (320) T ss_pred HHHHHHHHHHHHHHhcCcccccCCcceeeecccccccc-hhHHHHHHHHHHhhcCCchhhccc------cCCcceeeehh Confidence 55555444332222234667788888888865433221 222334466677889999887732 33366678899 Q ss_pred HHHhhHHHHHH---HHHHHHhhhhhhcccccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC--CCCCCC Q lcl|NC_021305. 315 RDTMAIPIARI---QSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLP--RSDDPK 389 (518) Q Consensus 315 ~~~l~P~~~~i---e~~l~~~l~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~--p~~~~~ 389 (518) ...+.|+++++ +..|..++-.+ .++-|-.. .|-+.-|.+- -.|.+ |-+..| T Consensus 259 ~~~V~PLL~Q~~~~Ek~Lvy~m~~E----~FVs~mtT-------------------GG~l~S~~~~-~~~~~~~~~~~~~ 314 (320) T protein:vir:97 259 FRFVEPILEQFREYEPSLIYAMRDE----FFVSFMTT-------------------GGMLNSNRVD-GWGKEKAPNESKG 314 (320) T ss_pred hHhHHHHHHHhhhcCcceeeeeccc----eeeeeeec-------------------Cceeeccccc-ccccccCCccccC Confidence 99999999997 44443333221 12222100 1222222111 11222 222334 Q ss_pred cceeee Q lcl|NC_021305. 390 ADELYA 395 (518) Q Consensus 390 gD~~~~ 395 (518) ||+--+ T Consensus 315 ~~~~~~ 320 (320) T protein:vir:97 315 GDVGDV 320 (320) T ss_pred CcccCC Confidence 444333 No 262 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=56.20 E-value=0.46 Score=22.42 Aligned_cols=397 Identities=10% Similarity=-0.020 Sum_probs=133.5 Q ss_pred CcCCCCC-CCCcccccccchhhhhh-hcc-c---ccccccccc--cchhhhHHH-------hhcHHHHHHHHHHHHhhcc Q lcl|NC_021305. 1 MLLANGQ-TLSAPAMAELSPQMQDS-YYY-A---PAVGMQLER--QFSLYGGIY-------KNQPWVRTVIAKRAQALAR 65 (518) Q Consensus 1 ~~f~~~~-~~~~~~~~~~~~~~~~~-~~~-~---~~~~~~~~~--~~~~~~~~~-------~~~~~v~~~v~~ia~~ia~ 65 (518) .+|.... .......+.......+. ... + ....++..+ ...+..... .....|..-...+. T Consensus 31 ~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve----- 105 (510) T protein:vir:78 31 YLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVD----- 105 (510) T ss_pred ccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHH----- Confidence 4554211 11111111110000000 000 0 000000000 000000000 00000111111111 Q ss_pred CceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCceeEEEEcC Q lcl|NC_021305. 66 LPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNS 145 (518) Q Consensus 66 l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~~v~v~~~~ 145 (518) ..++..+.+- +++.-+..+..+++.+|++.+++.. .+.....||+. .+.+..+. T Consensus 106 ------------------~~~~~~l~~s----nf~~~~~~~~~~L~~~G~a~l~~~~--~~~~~~~~pl~--~y~v~~d~ 159 (510) T protein:vir:78 106 ------------------RKATQRLFQN----ASLAVLTQVIKLLIVTGNALLYRNS--DEATVVAWSLR--SYAVRRDA 159 (510) T ss_pred ------------------HHHHHHHHhc----CcHHHHHHHHHHHHhhCeEEEEEeC--CCCeEEEEEcc--eeEEeeCC Confidence 1111122222 3444455567788899999887653 34445666663 34444444 Q ss_pred Cceee--EEeee---------------------------------c-ccc-----------cCceeE----E--eccccE Q lcl|NC_021305. 146 RTGRY--EYYFQ---------------------------------A-GAG-----------VGTQLV----S--FADDEV 172 (518) Q Consensus 146 ~~~~~--~~~~~---------------------------------~-~~~-----------~~~~~~----~--~~~~ev 172 (518) .|... +..+. . ... ..+..+ . +..-.. T Consensus 160 ~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~i~~~~~~~~~e~P~ 239 (510) T protein:vir:78 160 TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGVRVGETGRWPIHLCPY 239 (510) T ss_pred CcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCeeeccccccccccCCe Confidence 44221 00000 0 000 000000 0 111233 Q ss_pred EEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCe Q lcl|NC_021305. 173 VPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKT 252 (518) Q Consensus 173 ih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~ 252 (518) +-+|....++..||.||..-++..+.......+...........|..++.-++.+.+.. + ..+.. +. T Consensus 240 ~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p~g~~~~~~-------l---~~~~~---g~ 306 (510) T protein:vir:78 240 IVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDEAKGAVVDD-------Y---QDAEM---GD 306 (510) T ss_pred eeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCCccccchhh-------h---ccCCC---ce Confidence 44555555778899999999999999999988888887777676666665544433332 1 11111 11 Q ss_pred eecC--CCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHHH--HHHHHHHHhhHHHHHHHH Q lcl|NC_021305. 253 MVVE--EGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ--MRAFYRDTMAIPIARIQS 327 (518) Q Consensus 253 ~vl~--~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~--~~~~~~~~l~P~~~~ie~ 327 (518) ++-. +.+...+++. ..+.+ ..+..+.....|..+|=+. +... ++..-+.++- +..-....|.|.+..+.+ T Consensus 307 ~v~g~~~~v~~~~~~~-~~d~~~~~~~i~~~~~rI~~aF~~~---l~~~-~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ 381 (510) T protein:vir:78 307 YVPGGAEAVRAYERGD-YNKMAAIQQSLQAVVVRLNQAFMYG---ANQR-DAERVTAEEVRITAEEAENTLGGTYSLLAE 381 (510) T ss_pred eecCCcccccccccCc-ccchHHHHHHHHHHHHHHHHHHhhc---cccC-CCCCcCHHHHHHHHHHHHHHhhHHHHHHHH Confidence 2211 2223222332 23333 2455666677788887322 1111 1111234332 233445566677766666 Q ss_pred HHHHhhhhhhc----ccccc-----eecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccc Q lcl|NC_021305. 328 AMDKYVGQYWV----RKNRM-----KFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSA 398 (518) Q Consensus 328 ~l~~~l~~~~~----~~~~~-----~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 398 (518) +|-.-|+.... +..-+ .+...-+...+...++.....+ .+....-..++ ++. ..+-. .| T Consensus 382 E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l-----~~~~q~l~~~~--~~~--q~~~~---id 449 (510) T protein:vir:78 382 NLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSM-----LNASQVIAGLA--PIA--QLDPR---IS 449 (510) T ss_pred HHHHHHHHHHHHHHHhccCCCCCcccccceeeecccHHHHHHHHHHH-----HHHHHHHHHhc--Chh--hhhhc---CC Confidence 66544432110 00000 0011111222333333322221 11111111111 110 00000 00 Q ss_pred ccccccccccCCCCCCCCCCCCCccCCCCCCCccc-cCCccccccchhcchhhHHHHHHHHhhcccCCchhhHHHHHHHH Q lcl|NC_021305. 399 LQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSP-PTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPKHLRAVK 477 (518) Q Consensus 399 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~ 477 (518) ... ...... . .-+. +..... +.+......+.+.+..+..+...++...++..+. T Consensus 450 ~d~---~~~~~a-----------------~-~~Gv~p~~ivr----s~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~ 504 (510) T protein:vir:78 450 LPK---MMDTIW-----------------A-AFSVDTSQFYK----SADELQAEAEEQRRQAAQAQAAQETLLEGASDMT 504 (510) T ss_pred HHH---HHHHHH-----------------H-HhCCChhhhcC----CHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 000 000000 0 0000 000000 0000100001000000111111112222222333 Q ss_pred Hhhccc Q lcl|NC_021305. 478 GAMGRG 483 (518) Q Consensus 478 ~~~~~~ 483 (518) .++.+. T Consensus 505 ~~~~g~ 510 (510) T protein:vir:78 505 NALAGV 510 (510) T ss_pred ccCCCC Confidence 333222 No 263 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=54.93 E-value=0.49 Score=22.27 Aligned_cols=428 Identities=9% Similarity=0.001 Sum_probs=184.8 Q ss_pred CcCCCCCC-CCccccccc-chhhhhh-------------hcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhcc Q lcl|NC_021305. 1 MLLANGQT-LSAPAMAEL-SPQMQDS-------------YYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR 65 (518) Q Consensus 1 ~~f~~~~~-~~~~~~~~~-~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~ 65 (518) |=....+- +..+..+.. ..|+++. |..+.......-..... ..-...|.-+..|+.++.-+ . T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~d--r~~~~~ps~r~~V~~~~~~L-g 77 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGDD--SVPILMPSGRKIVEAVHRFL-G 77 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCCc--eeeeccchHHHHHHHHHHhc-C Confidence 66654442 233322221 2233210 11111111000000000 01122233345667766444 5 Q ss_pred CceEEEEecCCcceeccchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcC---CCceEEEEeeCCceeEEE Q lcl|NC_021305. 66 LPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNK---SGTPEKLMPMHPSRVAIK 142 (518) Q Consensus 66 l~~~v~~~~~~~~~~~~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~---~G~~~~l~~l~p~~v~v~ 142 (518) -++++.-....+..... .....+++.--...++.....+...+.++.|.+.+.+.+|. .|.-+.+..++|..+... T Consensus 78 ~~~~~~Ve~~~~de~~~-~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~fp~ 156 (563) T protein:vir:74 78 VGFDYLVEPDMGDEGIR-QSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDPRQIFLI 156 (563) T ss_pred CCcEEecCccccCcchH-HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCCceeeec Confidence 56665433333222211 12334444444456666777777888899999999999884 344566677777755544 Q ss_pred EcCCceeeEE------------------------eee--cccccCcee------EE------------------------ Q lcl|NC_021305. 143 RNSRTGRYEY------------------------YFQ--AGAGVGTQL------VS------------------------ 166 (518) Q Consensus 143 ~~~~~~~~~~------------------------~~~--~~~~~~~~~------~~------------------------ 166 (518) .+.+...-.| .+. ......+.. .. T Consensus 157 ~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~ 236 (563) T protein:vir:74 157 EDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSA 236 (563) T ss_pred cCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchhhhh Confidence 4333221111 000 000000000 00 Q ss_pred ------------eccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHH Q lcl|NC_021305. 167 ------------FADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQR 234 (518) Q Consensus 167 ------------~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~ 234 (518) +.-=.++|++...+-+..+|.|-|.-+...+...+.+..-......-+|.|-.++......+. . T Consensus 237 ~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~-~--- 312 (563) T protein:vir:74 237 QHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDP-N--- 312 (563) T ss_pred hhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEecccccccc-c--- Confidence 000125677776667778999999999988888876666666666666666666653222111 0 Q ss_pred HHHHHHHHhcCccccCCeeecCCC---cceeeccCChhhHHHHHHHHHHHH-HHHHHhcCCHHHhcccccccc-CCHHHH Q lcl|NC_021305. 235 LREQFDRAHSGSSNTGKTMVVEEG---MEPIPLQLTAVEMQFIEARQLNRE-EVCGVYDIAPPIVHILDRATF-SNISAQ 309 (518) Q Consensus 235 ~~~~~~~~~~g~~n~g~~~vl~~g---~~~~~l~~~~~d~~~~e~~~~~~~-~Ia~~fgVPp~~lg~~~~~~~-sn~e~~ 309 (518) +.+.... .-..|.++-+++. ..+..++..++-..+..-++...+ .|+..=++|..-+|-.+.++- |.+ T Consensus 313 -~g~~~~w---~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGi--- 385 (563) T protein:vir:74 313 -TGELTDW---NIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGI--- 385 (563) T ss_pred -ccccccc---ccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchh--- Confidence 0111111 1123455555532 445556655444333333333333 567777899988884443321 111 Q ss_pred HHHHHHHHhhHHHH-----------HHHH---HHHHhhhhhhcc------------------cccceecchhhhhcCHHH Q lcl|NC_021305. 310 MRAFYRDTMAIPIA-----------RIQS---AMDKYVGQYWVR------------------KNRMKFDIDDVIQPDWEA 357 (518) Q Consensus 310 ~~~~~~~~l~P~~~-----------~ie~---~l~~~l~~~~~~------------------~~~~~fd~~~l~~~d~~~ 357 (518) -+.-.|.|++. .+-+ -....||+..++ ...+.+-+.+.+..|... T Consensus 386 ---ALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~ 462 (563) T protein:vir:74 386 ---SLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQ 462 (563) T ss_pred ---hhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHH Confidence 11122333322 0111 111112211111 112344577889999999 Q ss_pred HHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCCCccee--eeccccc-------ccccccccCCCCCCCCCCCCCccCC Q lcl|NC_021305. 358 KSESTQKMVNSGVATPNEGREIM---GLPRSDDPKADEL--YANSALQ-------PLGATPDGAVEWEEAPAPKRPASTP 425 (518) Q Consensus 358 ~~~~~~~~~~~G~~T~NE~R~~~---g~~p~~~~~gD~~--~~~~n~~-------~~~~~~~~~~~~~~~~~~~~~~~~~ 425 (518) ..+....++.+|+++.--+-+++ |++- ++. -++. .....+. ..+..-+.+..+++. -+.+..+.. T Consensus 463 vv~~~~tl~~aGiiSretAv~~L~~~g~~~-pda-e~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g-~~~~~~dd~ 539 (563) T protein:vir:74 463 VTQDTLLLQQAHLILRKMAVAKLRSIGWEY-PEV-DDQGNALTDDDIADMLLAEAEADASLGLSAMDNGG-AGEQQFDDQ 539 (563) T ss_pred HHHHHHHHHHcCchhHHHHHHHHHhCCCCC-CcH-HHHHhhcCHHHHHHHHHHHhhccCcccceecccCC-CCccccccc Confidence 99999999999999999887777 6532 210 1111 0000000 000000000000000 000000000 Q ss_pred CCCCCccccCCccccccchhcchhhH Q lcl|NC_021305. 426 VASLDQSPPTSVPGLSPTNSDRSTDS 451 (518) Q Consensus 426 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (518) ++.-++=.+ ..+..++...-+-.- T Consensus 540 g~p~~~~~~--~~~~~~~~~~~~~~~ 563 (563) T protein:vir:74 540 GNPIDQFGN--PVEIPPDVTQVPLSP 563 (563) T ss_pred CCchhHcCC--cccCCccccccCCCC Confidence 000000000 000011100000000 No 264 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=54.18 E-value=0.51 Score=22.19 Aligned_cols=398 Identities=11% Similarity=0.064 Sum_probs=160.5 Q ss_pred CcCCCCCCCC----ccc---ccccchhhhhhhcccccc-----cccc-cccchh-hhHHHhhc----HHHHHHHHHHHHh Q lcl|NC_021305. 1 MLLANGQTLS----APA---MAELSPQMQDSYYYAPAV-----GMQL-ERQFSL-YGGIYKNQ----PWVRTVIAKRAQA 62 (518) Q Consensus 1 ~~f~~~~~~~----~~~---~~~~~~~~~~~~~~~~~~-----~~~~-~~~~~~-~~~~~~~~----~~v~~~v~~ia~~ 62 (518) ||-.+|+... -|. ...-...+++..++.... +.+. ...... ....++.- +++...++.++.. T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~ 80 (491) T protein:vir:95 1 MLTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGS 80 (491) T ss_pred CcccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhch Confidence 8887776332 221 111222344555542211 1111 001111 12223333 3445555555554 Q ss_pred hccCceEEEEecCCcceeccchHHHHHHhcCC-cCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCc-----------eEE Q lcl|NC_021305. 63 LARLPVKCMFTSGDTETEESDTGYAKLLADPC-EYLDPFAFWEWVASTLDIYGETYLAIQKNKSGT-----------PEK 130 (518) Q Consensus 63 ia~l~~~v~~~~~~~~~~~~~~~~~~L~~~PN-~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~-----------~~~ 130 (518) +-+-|+.+ +.. ..+..|+.... ...+..+|.+.++...+.+|.+++++.....+. --. T Consensus 81 vfrk~p~~---------~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy 150 (491) T protein:vir:95 81 VMRKEPEI---------NIP-KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPT 150 (491) T ss_pred hhcCCcee---------ecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcE Confidence 44444433 112 22334555444 356899999999999999999999998765442 112 Q ss_pred EEeeCCceeE-----------------EEE-----cC---Ccee--eEEee-------------ecccccCcee---EEe Q lcl|NC_021305. 131 LMPMHPSRVA-----------------IKR-----NS---RTGR--YEYYF-------------QAGAGVGTQL---VSF 167 (518) Q Consensus 131 l~~l~p~~v~-----------------v~~-----~~---~~~~--~~~~~-------------~~~~~~~~~~---~~~ 167 (518) +..+.|..|. +.. +. .+.. ..|+. +.....++.. ..+ T Consensus 151 ~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~ 230 (491) T protein:vir:95 151 IAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEI 230 (491) T ss_pred EEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeee Confidence 3333333221 000 00 0000 00000 0000000000 000 Q ss_pred cc-------c--cEEEEeccCCCCcccCchHHHHHHHH-HHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHH Q lcl|NC_021305. 168 AD-------D--EVVPIRFFNPDGLERGLSLMESLKST-IFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLRE 237 (518) Q Consensus 168 ~~-------~--evih~~~~~~~~~~~G~s~l~~~~~~-i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~ 237 (518) .+ . .++.+-. ..++...|.||+..+... +.+......+. ..+...+.|-.++.-....+++..+.... T Consensus 231 ~~~~g~~~l~~IPfv~~~~-~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~-~~l~~~~~P~l~~~G~d~~~~~~~~~~~~ 308 (491) T protein:vir:95 231 YPDLGESLRGVIPFTFIGA-TNNDATIDDAPLLPLAELNIGHYRNSADNE-ESSFVVGQPTLFIYPGDNLTPQSFKEANP 308 (491) T ss_pred eecCCCcccCeeEEEEEec-CCCCCCCCcCchHHHHHHHHHHhhhhhHHH-HHHHHcccceeeeecCcccCcchhhccCc Confidence 00 0 1111211 223445678888776654 44444444444 33444567777765444344333222111 Q ss_pred HHHHHhcCccccCCeeecCCCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHH Q lcl|NC_021305. 238 QFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYR 315 (518) Q Consensus 238 ~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~ 315 (518) ....-|. +..+.++.+.++.-+..+.+.+. .+.++....+ ++..|. .++- .++..+.++ .....-+ T Consensus 309 --~~i~~g~---~~~~~lP~~~~~~~ie~~~~~~~-~~~l~~~e~q-m~~~Ga--~l~~---~~~~~Ta~~~~~~~~~~~ 376 (491) T protein:vir:95 309 --NGIKFGS---RCGHNLGYGGSAQLIQAGENNLA-RQNMLDKEQQ-AIQIGA--QLIT---PSQQITAESARIQRGADT 376 (491) T ss_pred --ceeEecC---cCCcCCCCCCccceeecCcchHH-HHHHHHHHHH-HHHHHH--Hhcc---CCcchhHHHHHHHHHHhh Confidence 0111222 22455666555444443333332 1112221122 111221 2331 111123332 2333445 Q ss_pred HHhhHHHHHHHHHHHHhhh--hhh-cc--cccc--eecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Q lcl|NC_021305. 316 DTMAIPIARIQSAMDKYVG--QYW-VR--KNRM--KFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDP 388 (518) Q Consensus 316 ~~l~P~~~~ie~~l~~~l~--~~~-~~--~~~~--~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~~~ 388 (518) ..|.-++..++++++..|- ..+ +. ...+ .++.+.....-.......+-++++.|.++....++.+-.--+.+. T Consensus 377 S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~v~i~~n~dF~~~~~~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~~ 456 (491) T protein:vir:95 377 SVMATIARNVSQAYTDALRWVAMMLGKPEDSEVEFQLNMDFFLQPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDW 456 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCc Confidence 6788888888888887642 222 11 1223 334443333333345777778889999999988876632112111 Q ss_pred CcceeeecccccccccccccC-CCCCCCCCCCCCccCCCCCCCccccCCccccc Q lcl|NC_021305. 389 KADELYANSALQPLGATPDGA-VEWEEAPAPKRPASTPVASLDQSPPTSVPGLS 441 (518) Q Consensus 389 ~gD~~~~~~n~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (518) --+.. .+...+.. ..+...+..++-.. . ++.+.+ T Consensus 457 ~~e~~--------~~~ie~~~~~~~~~~~~~~~~~~-----~------~~~~~~ 491 (491) T protein:vir:95 457 TDEDI--------LNAIEDAPLPSGAVTQVAGEIPQ-----A------AQQQQE 491 (491) T ss_pred cHHHH--------HHHHHhcCCCCCccccccccchh-----h------hhhccC Confidence 00000 00000000 00000000000000 0 000000 No 265 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=53.14 E-value=0.54 Score=22.07 Aligned_cols=447 Identities=10% Similarity=0.018 Sum_probs=140.8 Q ss_pred CcCCCCCCCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhcc-----Cc-eEEEEec Q lcl|NC_021305. 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR-----LP-VKCMFTS 74 (518) Q Consensus 1 ~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia~-----l~-~~v~~~~ 74 (518) +-++.......+. .|+|+ .+.|+..|+.|...+.. =+ +++.... T Consensus 44 ~y~g~~~~~~~~~---~s~~~---------------------------~~~v~~~v~~~~~~l~~~~~~~~~~~~~~p~~ 93 (705) T protein:vir:88 44 YYFGEPFGNERPG---KSGIV---------------------------SRDVQETVDWIMPSLMKVFTSGGQVVKYEPDT 93 (705) T ss_pred HHhCCCCCcccCC---CCccc---------------------------cHHHHHHHHHHHHHHHHhhcCCCceEEEeeCC Confidence 4444322211111 11111 12344444444443322 22 2332222 Q ss_pred CCcceec----cchHHHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCC------------------------- Q lcl|NC_021305. 75 GDTETEE----SDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKS------------------------- 125 (518) Q Consensus 75 ~~~~~~~----~~~~~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~------------------------- 125 (518) .++... .....+++. .....+.++..++.+.+++|++++-+-++.. T Consensus 94 -~~D~~~a~~~~~~~~~~~~----~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~~~~d~~~ 168 (705) T protein:vir:88 94 -AEDVEQAEQETEYVNYLFM----RKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVADILSDPDT 168 (705) T ss_pred -hhHHHHHHHHHHHHhHHHh----hccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhhhhhhhhh Confidence 122211 111222232 2333567788889999999999875544211 Q ss_pred -----------------------CceEEEEeeCCceeEEEEcC----Cceee---------------------------- Q lcl|NC_021305. 126 -----------------------GTPEKLMPMHPSRVAIKRNS----RTGRY---------------------------- 150 (518) Q Consensus 126 -----------------------G~~~~l~~l~p~~v~v~~~~----~~~~~---------------------------- 150 (518) |. +.+..|+|..+.+.++. +.... T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~-i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~~~~~~~ 247 (705) T protein:vir:88 169 SILAQSVDDDGTYTIKIRKDKKKRE-IKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIEELPYDE 247 (705) T ss_pred hcccccccccceeeeEEeeeeecCc-eeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHhhcCChhHhhhhhccc Confidence 22 23444444433322210 00000 Q ss_pred -------------------------------------EEeeecccc-c-Cce----eEEeccccE-----------EEEe Q lcl|NC_021305. 151 -------------------------------------EYYFQAGAG-V-GTQ----LVSFADDEV-----------VPIR 176 (518) Q Consensus 151 -------------------------------------~~~~~~~~~-~-~~~----~~~~~~~ev-----------ih~~ 176 (518) .|.++.... . .+. .+.+..+.| +.+. T Consensus 248 ~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~~~g~~il~~~~~~~~PF~~~~ 327 (705) T protein:vir:88 248 YEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRILYVGDYIISNEPWDCRPFADLN 327 (705) T ss_pred ccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEEEEeCccccccccCCCCCEEEec Confidence 000000000 0 000 000111111 1222 Q ss_pred ccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhcCccccCCeeecC Q lcl|NC_021305. 177 FFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE 256 (518) Q Consensus 177 ~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~ 256 (518) .....+..||.|++..+...-...................|.+++. ++.+++++. + . ...|+++.+. T Consensus 328 ~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~-~g~v~~~d~------~-~-----~~pg~vv~~~ 394 (705) T protein:vir:88 328 AYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVL-DGQVNLEDL------L-T-----NEAAGIVRVK 394 (705) T ss_pred ceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceecc-ccccCcccc------c-c-----cCCCeeEEec Confidence 2222344689999999998888777777777777777677777763 444544331 1 1 1234555444 Q ss_pred CCcceeeccCChhhHHHHHHHHHHHHHHHHHhcCCHHHhccccccccCC--HH--HHH------------HHHHHHHhhH Q lcl|NC_021305. 257 EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSN--IS--AQM------------RAFYRDTMAI 320 (518) Q Consensus 257 ~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn--~e--~~~------------~~~~~~~l~P 320 (518) .+-.+.++...........+..+....+-...||+....|....+..++ .. ... +.|....+.+ T Consensus 395 ~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~ 474 (705) T protein:vir:88 395 SMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFAETGVKR 474 (705) T ss_pred CCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333333433333333455666777788889999999998653221111 11 011 1111223444 Q ss_pred HHHHHHHHHHHhhhhhhc---ccccceec---------ch---hhhhcCHHHHHHHHHHHHh-------C----CCcCHH Q lcl|NC_021305. 321 PIARIQSAMDKYVGQYWV---RKNRMKFD---------ID---DVIQPDWEAKSESTQKMVN-------S----GVATPN 374 (518) Q Consensus 321 ~~~~ie~~l~~~l~~~~~---~~~~~~fd---------~~---~l~~~d~~~~~~~~~~~~~-------~----G~~T~N 374 (518) +++.+-..+..++-.+.. .+.++.++ +. .+-......+...+..+.. . .+++.. T Consensus 475 l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~~~~~eq~~a~l~~ll~~~q~l~~~~~~~~~~~~~ 554 (705) T protein:vir:88 475 LFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIGNMNKDQQMLHLMRIWEMAQAVVGGGGLGVLVSEQ 554 (705) T ss_pred HHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccccchHHHHHHHHHHHHHHHHHhhcccchhhhcChH Confidence 444443322222211100 01111111 10 0111111222222222111 0 011111 Q ss_pred HHHHHhCCCCCCCCCcceeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccc--cchhcchhhHH Q lcl|NC_021305. 375 EGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLS--PTNSDRSTDSG 452 (518) Q Consensus 375 E~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 452 (518) +.++.+. .+...... .........+...+. ............... ........... T Consensus 555 ~~~~~~~----------------el~e~~~~-k~~~~~~~~~~~~e~-----~~~~~~~~q~e~~~~~~~~~~q~e~~k~ 612 (705) T protein:vir:88 555 NLYNILK----------------EVTENAGY-KDPDRFWTNPNSPEA-----LQAKAIREQKEAQPKPEDIKAQADAQRA 612 (705) T ss_pred HHHHHHH----------------HHHHhhhh-hhHHHHhhhhhhHHH-----HHHHHhhhhhhhhHHHHHHHHHHHHHHH Confidence 1111100 00000000 000000000000000 000000000000000 00000000000 Q ss_pred HHHHHHhhcccCC--chhhHHHHH-----HHHHhhccccCcCchhHH------HHHHHHHHHhHHHHhhhhhhhcccCC Q lcl|NC_021305. 453 KTEPRRLMQKPPP--KESSPKHLR-----AVKGAMGRGKDIKGFALQ------LAEKYPDDLEDILLAVQLALAERKDN 518 (518) Q Consensus 453 ~~~~~~~~~k~~~--~~~~~~~~~-----~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~ 518 (518) +.+.++...+... .+.+.++++ .-....++.+..+-.-++ .+++.+...+.-+-+.|.-..++++. T Consensus 613 q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a~~~~~~~~~e~e~~~~e~e~~~e~~q~~~~~~~~~ 691 (705) T protein:vir:88 613 QSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQLERDRFTWERARNEAEYHLEATQARAAYIGDG 691 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111000000000 011111000 000001111111100000 00111111111111111111111111 No 266 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=30.63 E-value=1.6 Score=19.52 Aligned_cols=419 Identities=8% Similarity=-0.018 Sum_probs=143.7 Q ss_pred CcCCCCC-CCCcccccccchhhhhhhcccccccccccccchhhhHHHhhcHHHHHHHHHHHHhhc------cCceEEEEe Q lcl|NC_021305. 1 MLLANGQ-TLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALA------RLPVKCMFT 73 (518) Q Consensus 1 ~~f~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~v~~ia~~ia------~l~~~v~~~ 73 (518) +||..-+ .+=...++..+.++....+..+..+. .+.. .... -... +--.|++.+|..+- .-||.=..- T Consensus 7 ~l~~k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~--~~~~-~~~~-~~ds-tg~~a~~~LAa~l~~~ltpp~~~WF~l~~ 81 (514) T protein:vir:80 7 AMWAEYRDSTAIRKAEDFAKFTIASLMVDPLDKT--HQAE-VVEY-DFQS-AGAFLVNNLTAKLALTLFPPGRPSFQIEL 81 (514) T ss_pred HHHHHhhcchHHHHHHHHHHHhcccccCCCCCCc--cccc-cccc-ccch-hHHHHHHHHHHHHHhhhcCCCCccccccc Confidence 5655322 11122222333333221111111000 0000 0011 1222 33456666666662 334443332 Q ss_pred cCCccee---------ccchH-------HHHHHhcCCcCCCHHHHHHHHHHHHHHcCCeEEEEEEcCCCceEEEEeeCCc Q lcl|NC_021305. 74 SGDTETE---------ESDTG-------YAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPS 137 (518) Q Consensus 74 ~~~~~~~---------~~~~~-------~~~L~~~PN~~~s~~~f~~~~v~~ll~~G~~~~~i~r~~~G~~~~l~~l~p~ 137 (518) ++...+. .-..+ ++..+.+ -+++.-+..+..+++.+|++.+++..+. .....||+ . T Consensus 82 ~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~----snf~~~~~~~~~~L~~~G~a~l~~~~~~--~~~~~~pl--~ 153 (514) T protein:vir:80 82 DDTLQELAAANGIDQSELHSRTADLERRATRRLFV----NASLSKLHRILKLLVVTGNALFYREPGT--GKMLVWTM--Q 153 (514) T ss_pred CchhhhhccccchhHHHHHHHHHHHHHHHHHHHHh----cCcHHHHHHHHHHHHhHCeEEEEEecCC--CcEEEEEc--C Confidence 2211100 00111 1222233 2455556677788999999998875432 22445555 2 Q ss_pred eeEEEEcCCceeeE--Ee---------------------------------eeccc------------ccCcee------ Q lcl|NC_021305. 138 RVAIKRNSRTGRYE--YY---------------------------------FQAGA------------GVGTQL------ 164 (518) Q Consensus 138 ~v~v~~~~~~~~~~--~~---------------------------------~~~~~------------~~~~~~------ 164 (518) .+.+..+..|.... .. ..... ...+.. T Consensus 154 ~y~v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~g~~i~~es~ 233 (514) T protein:vir:80 154 SYTVRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELEGKRVGPESS 233 (514) T ss_pred eEEEeeCCCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEeccceeecccCc Confidence 33344443332210 00 00000 000000 Q ss_pred EEeccccEEEEeccCCCCcccCchHHHHHHHHHHHHHHHHHHHHHHHHccCCcccccccCccCCHHHHHHHHHHHHHHhc Q lcl|NC_021305. 165 VSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHS 244 (518) Q Consensus 165 ~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~ 244 (518) +.+..-..+-+|....++..||.||..-++..+.......+.......-...|..++..++.+.+.. +. . T Consensus 234 y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~g~~~~~~-------l~---~ 303 (514) T protein:vir:80 234 YPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLVDEAKGGAVDD-------YR---D 303 (514) T ss_pred cccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcccccchhh-------hc---c Confidence 0011123344455555777899999999999999999888888887777676666665544433322 11 1 Q ss_pred CccccCCeeec--CCCcceeeccCChhhHH-HHHHHHHHHHHHHHHhcCCHHHhccccccccCCHHH--HHHHHHHHHhh Q lcl|NC_021305. 245 GSSNTGKTMVV--EEGMEPIPLQLTAVEMQ-FIEARQLNREEVCGVYDIAPPIVHILDRATFSNISA--QMRAFYRDTMA 319 (518) Q Consensus 245 g~~n~g~~~vl--~~g~~~~~l~~~~~d~~-~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~--~~~~~~~~~l~ 319 (518) +.. +.++- ++++...+++.. .+.+ ..+..+.....|..+|=+. ....++..-+.++ .+..-....|. T Consensus 304 ~~~---g~~v~g~~~~v~~~~~~~~-~d~~~~~~~i~~~~~rI~~aFml~----~~~rd~~rvTAtEV~~r~~E~~~~LG 375 (514) T protein:vir:80 304 AET---GDFVPGQVGSVASYERGDY-NKIAQASASVESIVMRLNRAFMYT----GQVRDAERVTVEEIRTVAEEAENLLG 375 (514) T ss_pred cCC---ceeecCCCccceeeecCcc-cchHHHHHHHHHHHHHHHHHHhhh----ccCCCCCCCCHHHHHHHHHHHHHHhh Confidence 111 11222 233444444432 2333 2456666777787777322 1111111113332 22334444566 Q ss_pred HHHHHHHHHHHHhhhhhhc----c---cccceecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCcc Q lcl|NC_021305. 320 IPIARIQSAMDKYVGQYWV----R---KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIM-GLPRSDDPKAD 391 (518) Q Consensus 320 P~~~~ie~~l~~~l~~~~~----~---~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-g~~p~~~~~gD 391 (518) |.+..+.++|-.-|+.... + +.-..+. .++...+..+-.+.+..+... -.+..+-..+ .+.+....-.| T Consensus 376 pv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p-~~l~~~~~vs~la~l~r~~~~--~~l~~~~~~i~~l~~~~p~v~d 452 (514) T protein:vir:80 376 GVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIA-QGVYRPSIITGIPALTRNIET--ANILRATQEASAIVPALVQLSK 452 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCC-chhhcceeeecHHHHHHHHHH--HHHHHHHHHHHHHhccchhhhh Confidence 6666666555433321100 0 0000000 011111111111111111100 0000000000 01111000011 Q ss_pred eeeecccccccccccccCCCCCCCCCCCCCccCCCCCCCccccCCccccccchhcchhhHHHHHHHHhhcccCCchhhHH Q lcl|NC_021305. 392 ELYANSALQPLGATPDGAVEWEEAPAPKRPASTPVASLDQSPPTSVPGLSPTNSDRSTDSGKTEPRRLMQKPPPKESSPK 471 (518) Q Consensus 392 ~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~ 471 (518) .+ |. +...+..... .+.+....- +..+..+..+++....+. +.+.- T Consensus 453 ~i----d~---d~~~~~~a~~------------------~Gvp~~~i~-------~~~e~~~~~~~~~~~~~~--~~~~~ 498 (514) T protein:vir:80 453 RF----DP---EKLVERIFAN------------------NSVDLSTLS-------KDPDVVAAEAEQEAALAQ--QQLDV 498 (514) T ss_pred cC----CH---HHHHHHHHHH------------------hCCCHhhcc-------CCHHHHHHHHHHHHHHHH--HHHHH Confidence 10 00 0000000000 000000000 000000001111000000 00000 Q ss_pred HHHHHHHhhccccCcCchhHHHHHHHHHHHhH Q lcl|NC_021305. 472 HLRAVKGAMGRGKDIKGFALQLAEKYPDDLED 503 (518) Q Consensus 472 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (518) .+-+..+..|.++ |.| T Consensus 499 ~~~~~~~~~~~~~----------------~~~ 514 (514) T protein:vir:80 499 ASGALAAETSAGV----------------LTS 514 (514) T ss_pred HHHHHHHhhhccc----------------cCC Confidence 0001111111111 111 Done!