Query lcl|NC_019710.1_cdsid_YP_007111706.1 [gene=F844_gp03] [protein=portal protein] [protein_id=YP_007111706.1] [location=2100..3374] Match_columns 424 No_of_seqs 140 out of 1016 Neff 9.5 Searched_HMMs 1612 Date Thu Nov 7 16:13:59 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1884 Length: 424 # 100.0 1E-121 6E-125 683.9 47.2 424 1-424 1-424 (424) 2 protein:vir:189 Length: 424 # 100.0 3E-121 2E-124 681.5 47.0 424 1-424 1-424 (424) 3 protein:vir:4337 Length: 434 # 100.0 5E-101 3E-104 570.6 43.6 419 1-424 1-421 (434) 4 protein:vir:81072 Length: 432 100.0 3E-100 2E-103 565.9 44.5 410 8-424 1-421 (432) 5 protein:vir:10362 Length: 432 100.0 9E-100 5E-103 563.7 44.8 410 8-424 1-421 (432) 6 protein:vir:81152 Length: 411 100.0 4E-100 3E-103 565.3 43.0 399 14-423 1-411 (411) 7 protein:vir:97060 Length: 432 100.0 1.3E-99 8E-103 562.7 44.9 410 8-424 1-421 (432) 8 protein:vir:105064 Length: 421 100.0 7E-100 4E-103 564.3 43.1 403 14-424 1-412 (421) 9 protein:vir:5737 Length: 419 # 100.0 1.7E-99 1E-102 562.1 44.9 404 14-424 1-408 (419) 10 protein:vir:100150 Length: 437 100.0 2E-99 1E-102 561.7 44.2 415 1-424 1-424 (437) 11 protein:vir:105002 Length: 432 100.0 3.4E-99 2E-102 560.5 45.1 406 14-424 1-423 (432) 12 protein:vir:102855 Length: 432 100.0 3.4E-99 2E-102 560.5 45.1 406 14-424 1-423 (432) 13 protein:vir:107605 Length: 432 100.0 3.4E-99 2E-102 560.5 45.1 406 14-424 1-423 (432) 14 protein:vir:4454 Length: 414 # 100.0 2.6E-99 2E-102 561.1 43.6 403 14-424 1-409 (414) 15 protein:vir:4509 Length: 424 # 100.0 9E-99 6E-102 558.1 45.6 415 1-424 1-421 (424) 16 protein:vir:1431 Length: 419 # 100.0 5.3E-99 3E-102 559.4 43.1 402 15-424 1-407 (419) 17 protein:vir:102080 Length: 429 100.0 1.3E-98 8E-102 557.3 43.6 406 14-424 1-420 (429) 18 protein:vir:1380 Length: 422 # 100.0 5.6E-98 3E-101 553.8 44.0 402 14-424 1-421 (422) 19 protein:vir:1266 Length: 416 # 100.0 1.2E-97 7E-101 551.9 42.5 402 15-424 1-406 (416) 20 protein:vir:483 Length: 413 # 100.0 1.6E-97 1E-100 551.3 43.0 401 15-424 1-406 (413) 21 protein:vir:80333 Length: 419 100.0 1.5E-97 9E-101 551.4 42.7 402 15-424 1-407 (419) 22 protein:vir:100249 Length: 431 100.0 2.3E-97 1E-100 550.4 42.7 400 14-422 1-431 (431) 23 protein:vir:93610 Length: 454 100.0 2.9E-97 2E-100 549.8 42.4 401 16-424 1-415 (454) 24 protein:vir:6240 Length: 457 # 100.0 2.5E-96 1.6E-99 544.7 43.4 406 14-424 1-425 (457) 25 protein:vir:102118 Length: 409 100.0 2.6E-96 1.6E-99 544.6 42.7 397 16-424 1-408 (409) 26 protein:vir:1326 Length: 457 # 100.0 4E-96 2.5E-99 543.6 43.0 406 14-424 1-425 (457) 27 protein:vir:8418 Length: 409 # 100.0 2.2E-94 1.4E-97 534.0 44.1 397 14-424 1-404 (409) 28 protein:vir:98396 Length: 441 100.0 2.8E-94 1.8E-97 533.5 41.9 410 1-424 1-437 (441) 29 protein:vir:2683 Length: 412 # 100.0 6.2E-94 3.9E-97 531.6 43.4 401 8-424 1-406 (412) 30 protein:vir:7853 Length: 518 # 100.0 9.6E-94 6E-97 530.5 42.8 397 21-424 1-423 (518) 31 protein:vir:101648 Length: 518 100.0 1.2E-93 7.2E-97 530.1 42.8 396 21-424 1-423 (518) 32 protein:vir:79984 Length: 441 100.0 1.2E-93 7.6E-97 530.0 42.7 410 1-424 1-437 (441) 33 protein:vir:9408 Length: 441 # 100.0 1.2E-93 7.6E-97 530.0 42.7 410 1-424 1-437 (441) 34 protein:vir:4598 Length: 416 # 100.0 1.1E-93 6.7E-97 530.3 41.7 394 14-424 1-412 (416) 35 protein:vir:81095 Length: 416 100.0 1.1E-93 6.7E-97 530.3 41.7 394 14-424 1-412 (416) 36 protein:vir:93943 Length: 409 100.0 6.3E-93 3.9E-96 526.1 43.6 398 10-424 1-403 (409) 37 protein:vir:96980 Length: 409 100.0 6.9E-93 4.3E-96 525.8 43.5 396 10-424 1-403 (409) 38 protein:vir:94426 Length: 409 100.0 1.1E-92 6.8E-96 524.7 43.4 396 8-424 1-403 (409) 39 protein:vir:81218 Length: 423 100.0 2.3E-92 1.5E-95 522.9 43.8 403 14-422 1-423 (423) 40 protein:vir:3868 Length: 417 # 100.0 2.5E-91 1.5E-94 517.3 41.3 391 8-424 1-404 (417) 41 protein:vir:101647 Length: 460 100.0 9.2E-91 5.7E-94 514.2 41.9 404 16-424 1-459 (460) 42 protein:vir:9702 Length: 406 # 100.0 1.3E-89 8.2E-93 507.9 41.7 387 14-424 1-402 (406) 43 protein:vir:8317 Length: 409 # 100.0 4.2E-89 2.6E-92 505.1 40.5 374 14-407 1-409 (409) 44 protein:vir:960 Length: 413 # 100.0 1.7E-88 1.1E-91 501.7 40.7 402 1-424 4-413 (413) 45 protein:vir:94666 Length: 723 100.0 5.8E-88 3.6E-91 498.9 40.8 380 29-424 1-408 (723) 46 protein:vir:80134 Length: 403 100.0 9.2E-88 5.7E-91 497.8 40.6 388 14-424 1-398 (403) 47 protein:vir:95378 Length: 406 100.0 4.6E-87 2.8E-90 493.9 42.8 391 14-424 1-401 (406) 48 protein:vir:8100 Length: 466 # 100.0 2.3E-86 1.4E-89 490.1 41.7 401 14-424 1-466 (466) 49 protein:vir:102727 Length: 945 100.0 1.1E-85 6.6E-89 486.4 43.0 411 1-424 73-524 (945) 50 protein:vir:6210 Length: 394 # 100.0 9E-85 5.6E-88 481.3 39.8 384 14-424 1-390 (394) 51 protein:vir:104259 Length: 403 100.0 2E-84 1.2E-87 479.5 40.6 386 14-424 1-399 (403) 52 protein:vir:9359 Length: 348 # 100.0 5.2E-84 3.3E-87 477.2 39.0 337 74-424 1-342 (348) 53 protein:vir:3843 Length: 397 # 100.0 9.2E-84 5.7E-87 475.8 40.3 380 14-424 1-387 (397) 54 protein:vir:100187 Length: 385 100.0 2.1E-83 1.3E-86 473.8 39.6 377 14-422 1-385 (385) 55 protein:vir:100882 Length: 383 100.0 8.5E-83 5.3E-86 470.5 39.8 376 14-417 1-383 (383) 56 protein:vir:1082 Length: 359 # 100.0 2.9E-82 1.8E-85 467.6 38.2 352 14-396 1-359 (359) 57 protein:vir:80796 Length: 574 100.0 1.1E-80 6.9E-84 458.9 41.3 420 1-424 27-493 (574) 58 protein:vir:95965 Length: 385 100.0 8.5E-81 5.3E-84 459.6 37.5 373 14-421 1-385 (385) 59 protein:vir:100650 Length: 395 100.0 1.1E-80 6.8E-84 459.0 38.1 372 14-424 1-384 (395) 60 protein:vir:9507 Length: 395 # 100.0 1.1E-80 6.8E-84 459.0 38.1 372 14-424 1-384 (395) 61 protein:vir:101289 Length: 395 100.0 1.1E-80 6.8E-84 459.0 38.1 372 14-424 1-384 (395) 62 protein:vir:80644 Length: 551 100.0 5.9E-80 3.7E-83 454.9 41.6 409 10-424 1-516 (551) 63 protein:vir:4854 Length: 386 # 100.0 4.5E-80 2.8E-83 455.6 38.5 377 14-424 1-385 (386) 64 protein:vir:7407 Length: 392 # 100.0 1.1E-79 6.8E-83 453.5 39.9 367 12-409 1-392 (392) 65 protein:vir:4995 Length: 384 # 100.0 1.7E-80 1E-83 457.9 35.1 368 14-402 1-384 (384) 66 protein:vir:100691 Length: 535 100.0 1.9E-79 1.2E-82 452.1 39.6 420 1-424 13-490 (535) 67 protein:vir:93867 Length: 378 100.0 5.8E-80 3.6E-83 455.0 36.4 356 14-424 1-370 (378) 68 protein:vir:94002 Length: 378 100.0 5.6E-80 3.5E-83 455.1 35.7 356 14-424 1-370 (378) 69 protein:vir:1661 Length: 378 # 100.0 1.4E-79 8.5E-83 452.9 36.7 356 14-424 1-370 (378) 70 protein:vir:4089 Length: 395 # 100.0 5.6E-79 3.5E-82 449.6 38.7 379 14-424 1-391 (395) 71 protein:vir:78310 Length: 376 100.0 2.5E-79 1.5E-82 451.5 36.8 365 14-417 1-376 (376) 72 protein:vir:63755 Length: 547 100.0 2.6E-78 1.6E-81 446.0 41.6 406 14-424 1-503 (547) 73 protein:vir:3989 Length: 392 # 100.0 1.9E-78 1.2E-81 446.7 40.0 367 12-409 1-392 (392) 74 protein:vir:1023 Length: 392 # 100.0 1.9E-78 1.2E-81 446.7 40.0 367 12-409 1-392 (392) 75 protein:vir:96579 Length: 576 100.0 5.8E-77 3.6E-80 438.6 41.5 418 1-424 18-495 (576) 76 protein:vir:4952 Length: 386 # 100.0 5.5E-77 3.4E-80 438.7 39.3 378 14-422 1-386 (386) 77 protein:vir:94869 Length: 378 100.0 3.3E-77 2.1E-80 439.9 37.4 356 14-424 1-370 (378) 78 protein:vir:98643 Length: 395 100.0 5.4E-77 3.4E-80 438.7 37.0 380 14-423 1-395 (395) 79 protein:vir:858 Length: 378 # 100.0 6.5E-77 4E-80 438.3 37.0 355 14-424 1-370 (378) 80 protein:vir:9641 Length: 395 # 100.0 3E-77 1.9E-80 440.1 34.8 376 14-423 1-395 (395) 81 protein:vir:95599 Length: 563 100.0 3.8E-76 2.4E-79 434.1 40.4 418 1-424 14-523 (563) 82 protein:vir:99312 Length: 563 100.0 3.8E-76 2.4E-79 434.1 40.4 418 1-424 14-523 (563) 83 protein:vir:4828 Length: 382 # 100.0 7.2E-77 4.4E-80 438.0 35.4 369 14-422 1-382 (382) 84 protein:vir:4194 Length: 540 # 100.0 6.8E-76 4.2E-79 432.7 39.3 391 3-424 1-441 (540) 85 protein:vir:4156 Length: 542 # 100.0 2.6E-75 1.6E-78 429.5 38.8 392 3-424 1-443 (542) 86 protein:vir:3153 Length: 467 # 100.0 3.9E-75 2.4E-78 428.5 39.6 371 53-424 1-442 (467) 87 protein:vir:79772 Length: 648 100.0 1.3E-69 8.1E-73 398.2 40.6 408 1-424 1-491 (648) 88 protein:vir:99452 Length: 651 100.0 4.2E-70 2.6E-73 400.9 34.0 413 1-424 1-535 (651) 89 protein:vir:78641 Length: 278 100.0 9E-63 5.6E-66 360.7 33.0 273 74-360 1-278 (278) 90 protein:vir:79150 Length: 368 100.0 2.4E-60 1.5E-63 347.4 29.0 353 8-376 1-368 (368) 91 protein:vir:267 Length: 348 # 100.0 1.3E-57 8.4E-61 332.4 30.4 333 1-371 1-348 (348) 92 protein:vir:100328 Length: 346 100.0 3.4E-57 2.1E-60 330.2 31.6 323 8-365 1-346 (346) 93 protein:vir:103971 Length: 376 100.0 9.6E-57 6E-60 327.7 33.7 327 1-367 26-376 (376) 94 protein:vir:79207 Length: 351 100.0 3.9E-56 2.4E-59 324.3 33.5 315 1-367 1-351 (351) 95 protein:vir:78191 Length: 351 100.0 9E-56 5.6E-59 322.4 33.3 327 1-367 1-351 (351) 96 protein:vir:78749 Length: 337 100.0 2.5E-56 1.5E-59 325.4 29.4 323 8-361 1-337 (337) 97 protein:vir:98567 Length: 340 100.0 1.6E-55 9.7E-59 321.0 32.5 324 1-364 1-340 (340) 98 protein:vir:1150 Length: 350 # 100.0 4.6E-56 2.9E-59 324.0 27.4 329 8-360 1-350 (350) 99 protein:vir:6058 Length: 344 # 100.0 3.4E-54 2.1E-57 313.7 32.7 330 8-365 1-344 (344) 100 protein:vir:3780 Length: 345 # 100.0 3.9E-54 2.4E-57 313.4 31.9 325 10-362 1-345 (345) 101 protein:vir:5691 Length: 344 # 100.0 4.2E-54 2.6E-57 313.2 31.2 324 1-365 1-344 (344) 102 protein:vir:3743 Length: 345 # 100.0 1E-53 6.2E-57 311.1 32.5 331 1-362 1-345 (345) 103 protein:vir:2013 Length: 344 # 100.0 9.1E-54 5.7E-57 311.4 30.8 324 8-365 1-344 (344) 104 protein:vir:4698 Length: 251 # 100.0 9.2E-52 5.7E-55 300.4 25.4 242 14-268 1-251 (251) 105 protein:vir:98853 Length: 219 100.0 1.2E-45 7.5E-49 266.8 22.9 208 153-364 1-219 (219) 106 protein:vir:5249 Length: 437 # 99.9 1.4E-27 8.7E-31 167.8 30.5 387 10-424 1-435 (437) 107 protein:vir:107742 Length: 537 99.9 3.2E-23 2E-26 143.9 30.2 403 1-424 41-533 (537) 108 protein:vir:94049 Length: 532 99.9 7.6E-22 4.7E-25 136.4 28.7 406 1-424 1-508 (532) 109 protein:vir:99853 Length: 488 99.8 1.2E-20 7.3E-24 129.9 30.2 397 1-424 1-409 (488) 110 protein:vir:99563 Length: 862 99.8 1.9E-20 1.2E-23 128.7 29.3 402 1-424 50-553 (862) 111 protein:vir:108215 Length: 469 99.8 4E-19 2.5E-22 121.4 33.9 403 1-424 1-455 (469) 112 protein:vir:99232 Length: 526 99.8 1.5E-19 9.5E-23 123.7 31.6 406 1-424 1-448 (526) 113 protein:vir:103860 Length: 528 99.8 2.7E-19 1.7E-22 122.4 32.2 405 1-424 1-443 (528) 114 protein:vir:79647 Length: 435 99.8 3.8E-20 2.4E-23 127.0 26.3 378 1-424 5-433 (435) 115 protein:vir:96068 Length: 765 99.8 1.8E-19 1.1E-22 123.4 29.5 395 1-424 28-533 (765) 116 protein:vir:79233 Length: 526 99.8 1.2E-18 7.4E-22 118.9 31.2 406 1-424 1-441 (526) 117 protein:vir:80040 Length: 461 99.8 3.3E-19 2E-22 121.9 26.9 393 1-423 1-461 (461) 118 protein:vir:107662 Length: 427 99.8 3.3E-19 2E-22 121.9 26.8 374 8-423 1-427 (427) 119 protein:vir:79063 Length: 491 99.8 8.1E-18 5E-21 114.3 33.5 390 1-424 15-423 (491) 120 protein:vir:104338 Length: 422 99.8 6.6E-19 4.1E-22 120.3 27.5 373 10-422 1-422 (422) 121 protein:vir:107880 Length: 491 99.7 6.3E-17 3.9E-20 109.4 33.1 389 1-424 15-423 (491) 122 protein:vir:1986 Length: 512 # 99.7 5.4E-17 3.4E-20 109.8 32.5 401 1-424 1-440 (512) 123 protein:vir:389 Length: 530 # 99.7 3.3E-17 2E-20 111.0 28.1 414 8-424 1-530 (530) 124 protein:vir:79511 Length: 448 99.7 7.2E-17 4.4E-20 109.1 28.7 405 1-424 1-441 (448) 125 protein:vir:79538 Length: 502 99.7 7.7E-17 4.8E-20 108.9 28.2 406 14-424 1-498 (502) 126 protein:vir:77981 Length: 448 99.7 1.6E-16 1E-19 107.1 29.7 398 1-424 1-441 (448) 127 protein:vir:95254 Length: 488 99.7 6.3E-16 3.9E-19 103.9 29.8 409 1-424 1-471 (488) 128 protein:vir:96738 Length: 505 99.7 4.2E-16 2.6E-19 104.9 27.0 418 1-424 1-503 (505) 129 protein:vir:3420 Length: 533 # 99.6 2.5E-15 1.5E-18 100.7 28.2 419 1-424 1-531 (533) 130 protein:vir:95542 Length: 548 99.6 2.5E-15 1.5E-18 100.7 24.9 405 14-424 1-521 (548) 131 protein:vir:98816 Length: 446 99.6 1E-14 6.3E-18 97.3 27.5 370 6-400 1-446 (446) 132 protein:vir:105782 Length: 449 99.5 6.6E-15 4.1E-18 98.3 23.6 386 1-424 1-449 (449) 133 protein:vir:10321 Length: 495 99.5 1E-14 6.4E-18 97.3 23.6 411 8-424 1-491 (495) 134 protein:vir:6382 Length: 553 # 99.5 1.2E-13 7.7E-17 91.3 27.7 419 1-424 1-549 (553) 135 protein:vir:78589 Length: 695 99.5 2.2E-13 1.4E-16 90.0 25.4 406 1-424 38-546 (695) 136 protein:vir:78161 Length: 355 99.5 2.8E-13 1.7E-16 89.4 25.8 289 131-424 1-321 (355) 137 protein:vir:106716 Length: 698 99.4 2.1E-13 1.3E-16 90.0 24.5 407 1-424 38-549 (698) 138 protein:vir:101541 Length: 694 99.4 4.3E-13 2.6E-16 88.4 24.8 408 1-424 33-545 (694) 139 protein:vir:3648 Length: 695 # 99.4 7.1E-13 4.4E-16 87.2 25.7 406 1-424 38-546 (695) 140 protein:vir:106491 Length: 646 99.3 5.3E-12 3.3E-15 82.4 22.8 407 1-424 1-487 (646) 141 protein:vir:102426 Length: 631 99.2 4.8E-12 3E-15 82.6 18.1 405 1-424 1-516 (631) 142 protein:vir:99088 Length: 629 99.2 1.1E-11 6.9E-15 80.6 19.7 413 1-424 1-508 (629) 143 protein:vir:8654 Length: 629 # 99.2 1.3E-11 7.9E-15 80.3 20.0 413 1-424 1-508 (629) 144 protein:vir:107517 Length: 639 99.0 2.7E-10 1.7E-13 73.1 21.3 405 1-424 1-514 (639) 145 protein:vir:97900 Length: 639 99.0 2.7E-10 1.7E-13 73.1 21.3 405 1-424 1-514 (639) 146 protein:vir:106027 Length: 629 98.9 1.1E-09 7E-13 69.6 20.6 408 1-424 1-515 (629) 147 protein:vir:102602 Length: 456 98.8 9.3E-09 5.8E-12 64.6 22.1 387 8-424 1-455 (456) 148 protein:vir:105819 Length: 456 98.8 9.3E-09 5.8E-12 64.6 22.1 387 8-424 1-455 (456) 149 protein:vir:98444 Length: 434 98.8 1.4E-08 8.4E-12 63.7 22.5 357 23-424 1-431 (434) 150 protein:vir:5839 Length: 533 # 98.7 3.2E-08 2E-11 61.7 23.0 407 1-424 1-506 (533) 151 protein:vir:7987 Length: 456 # 98.6 5.3E-08 3.3E-11 60.5 21.4 394 8-424 1-455 (456) 152 protein:vir:98883 Length: 517 98.6 1.2E-07 7.3E-11 58.6 21.3 389 14-424 1-512 (517) 153 protein:vir:104082 Length: 485 98.6 2.8E-07 1.8E-10 56.5 23.2 396 1-424 1-485 (485) 154 protein:vir:38 Length: 496 # N 98.5 4.4E-07 2.7E-10 55.4 23.7 389 12-421 1-496 (496) 155 protein:vir:7768 Length: 484 # 98.4 6.1E-07 3.8E-10 54.7 24.8 396 1-424 1-479 (484) 156 protein:vir:2427 Length: 485 # 98.4 7.9E-07 4.9E-10 54.1 24.3 394 1-424 1-482 (485) 157 protein:vir:1587 Length: 508 # 98.4 9.7E-07 6E-10 53.6 21.9 384 14-424 1-506 (508) 158 protein:vir:4073 Length: 279 # 98.3 1.1E-08 6.9E-12 64.2 9.9 260 58-400 1-279 (279) 159 protein:vir:4898 Length: 502 # 98.3 1.6E-06 9.7E-10 52.4 22.1 398 1-424 17-498 (502) 160 protein:vir:5961 Length: 503 # 98.3 1.7E-06 1E-09 52.3 31.1 394 1-424 1-499 (503) 161 protein:vir:96494 Length: 501 98.3 1.7E-06 1.1E-09 52.1 22.7 408 1-424 1-496 (501) 162 protein:vir:103219 Length: 201 98.3 6E-08 3.7E-11 60.2 12.7 182 227-422 1-201 (201) 163 protein:vir:4782 Length: 522 # 98.2 2.9E-06 1.8E-09 50.9 24.1 386 14-423 1-522 (522) 164 protein:vir:96240 Length: 511 98.2 3.1E-06 1.9E-09 50.8 23.3 395 1-424 24-504 (511) 165 protein:vir:103951 Length: 511 98.2 3.4E-06 2.1E-09 50.5 23.7 397 1-424 33-498 (511) 166 protein:vir:3028 Length: 500 # 98.1 3.7E-06 2.3E-09 50.4 22.5 384 14-424 1-496 (500) 167 protein:vir:9815 Length: 500 # 98.1 3.7E-06 2.3E-09 50.4 22.5 384 14-424 1-496 (500) 168 protein:vir:9751 Length: 422 # 98.1 4.1E-06 2.6E-09 50.1 21.7 360 8-423 1-422 (422) 169 protein:vir:9306 Length: 511 # 98.1 4.3E-06 2.7E-09 50.0 23.1 397 1-424 24-507 (511) 170 protein:vir:95806 Length: 440 98.1 4.6E-06 2.9E-09 49.8 23.5 389 6-424 1-436 (440) 171 protein:vir:94742 Length: 409 98.1 4.8E-06 3E-09 49.7 28.9 346 8-396 1-409 (409) 172 protein:vir:2341 Length: 488 # 98.1 4.9E-06 3.1E-09 49.7 23.1 397 1-424 1-488 (488) 173 protein:vir:97171 Length: 512 98.0 6E-06 3.8E-09 49.2 23.0 397 1-424 24-508 (512) 174 protein:vir:93747 Length: 472 98.0 6.3E-06 3.9E-09 49.1 27.1 390 1-424 2-468 (472) 175 protein:vir:4223 Length: 486 # 98.0 6.6E-06 4.1E-09 49.0 26.1 392 1-424 8-484 (486) 176 protein:vir:78227 Length: 480 98.0 8.9E-06 5.5E-09 48.3 25.2 389 10-424 1-472 (480) 177 protein:vir:80959 Length: 499 98.0 9.2E-06 5.7E-09 48.2 25.5 386 12-422 1-499 (499) 178 protein:vir:96366 Length: 511 97.9 9.7E-06 6E-09 48.1 22.9 397 1-424 33-507 (511) 179 protein:vir:78805 Length: 511 97.9 9.7E-06 6E-09 48.1 22.9 397 1-424 33-507 (511) 180 protein:vir:1236 Length: 483 # 97.9 1.2E-05 7.4E-09 47.6 27.0 389 1-424 1-477 (483) 181 protein:vir:99916 Length: 504 97.9 1.5E-05 9.1E-09 47.1 28.3 402 1-424 1-496 (504) 182 protein:vir:78537 Length: 480 97.8 1.6E-05 9.8E-09 46.9 23.9 385 10-424 1-472 (480) 183 protein:vir:2500 Length: 501 # 97.8 1.6E-05 9.9E-09 46.9 25.0 383 1-424 16-501 (501) 184 protein:vir:2732 Length: 501 # 97.8 1.7E-05 1E-08 46.7 26.2 396 1-424 38-493 (501) 185 protein:vir:79043 Length: 479 97.8 2.2E-05 1.3E-08 46.1 28.2 384 1-423 7-479 (479) 186 protein:vir:79703 Length: 505 97.8 2.2E-05 1.4E-08 46.1 25.7 383 14-424 1-504 (505) 187 protein:vir:8184 Length: 474 # 97.7 2.4E-05 1.5E-08 45.9 23.2 401 1-424 1-473 (474) 188 protein:vir:94101 Length: 474 97.7 2.7E-05 1.7E-08 45.6 27.2 392 1-424 24-473 (474) 189 protein:vir:105889 Length: 474 97.7 2.7E-05 1.7E-08 45.6 27.2 392 1-424 24-473 (474) 190 protein:vir:9568 Length: 410 # 97.7 2.8E-05 1.8E-08 45.5 27.0 351 8-418 1-410 (410) 191 protein:vir:99781 Length: 511 97.7 2.9E-05 1.8E-08 45.4 24.3 395 1-424 33-506 (511) 192 protein:vir:96839 Length: 474 97.7 3E-05 1.8E-08 45.4 26.9 387 1-422 1-474 (474) 193 protein:vir:94805 Length: 492 97.7 3.2E-05 2E-08 45.2 26.7 382 1-424 29-486 (492) 194 protein:vir:1634 Length: 409 # 97.6 3.7E-05 2.3E-08 44.9 29.4 346 8-396 1-409 (409) 195 protein:vir:95113 Length: 474 97.6 3.7E-05 2.3E-08 44.9 28.4 382 1-424 20-472 (474) 196 protein:vir:97336 Length: 492 97.6 4.6E-05 2.8E-08 44.4 26.1 386 1-424 35-486 (492) 197 protein:vir:106639 Length: 481 97.5 5E-05 3.1E-08 44.2 24.5 391 1-424 22-481 (481) 198 protein:vir:96266 Length: 474 97.5 6.2E-05 3.9E-08 43.6 25.8 388 1-424 1-468 (474) 199 protein:vir:95899 Length: 474 97.5 6.2E-05 3.9E-08 43.6 25.8 388 1-424 1-468 (474) 200 protein:vir:80680 Length: 441 97.4 6.8E-05 4.2E-08 43.4 28.6 372 10-424 1-433 (441) 201 protein:vir:97447 Length: 474 97.4 7.8E-05 4.8E-08 43.1 28.1 382 1-424 5-468 (474) 202 protein:vir:94498 Length: 474 97.4 7.8E-05 4.8E-08 43.1 28.1 382 1-424 5-468 (474) 203 protein:vir:78907 Length: 518 97.4 8.1E-05 5E-08 43.0 23.1 388 14-424 1-518 (518) 204 protein:vir:97376 Length: 320 97.4 4.9E-06 3E-09 49.7 9.9 309 14-403 1-320 (320) 205 protein:vir:99522 Length: 470 97.3 0.00011 6.8E-08 42.3 25.3 382 1-422 19-470 (470) 206 protein:vir:105292 Length: 478 97.2 0.00014 8.9E-08 41.7 29.0 390 1-424 1-477 (478) 207 protein:vir:3964 Length: 453 # 97.0 0.00023 1.4E-07 40.6 25.1 394 1-424 1-447 (453) 208 protein:vir:107112 Length: 478 97.0 0.00023 1.4E-07 40.6 30.8 387 1-424 1-477 (478) 209 protein:vir:99072 Length: 479 96.9 0.0003 1.9E-07 39.9 29.1 380 1-424 2-470 (479) 210 protein:vir:96179 Length: 468 96.8 0.00033 2.1E-07 39.6 26.4 383 1-424 1-468 (468) 211 protein:vir:94546 Length: 506 96.7 0.00039 2.4E-07 39.3 22.5 396 1-424 6-498 (506) 212 protein:vir:9871 Length: 429 # 96.6 0.00046 2.8E-07 38.9 24.7 372 1-424 1-424 (429) 213 protein:vir:733 Length: 453 # 96.5 0.00054 3.3E-07 38.5 24.4 397 1-424 3-451 (453) 214 protein:vir:105461 Length: 470 96.4 0.00066 4.1E-07 38.0 25.7 375 8-424 1-469 (470) 215 protein:vir:106571 Length: 499 96.3 0.00082 5.1E-07 37.5 27.9 381 1-424 5-488 (499) 216 protein:vir:104500 Length: 537 96.1 0.001 6.5E-07 36.9 25.8 409 1-424 1-536 (537) 217 protein:vir:3609 Length: 452 # 95.9 0.0012 7.7E-07 36.5 27.7 386 1-424 1-451 (452) 218 protein:vir:102950 Length: 471 95.2 0.0025 1.5E-06 34.9 25.1 372 8-424 1-465 (471) 219 protein:vir:78083 Length: 537 94.9 0.0033 2.1E-06 34.2 30.9 396 1-424 1-521 (537) 220 protein:vir:103177 Length: 533 94.5 0.0042 2.6E-06 33.6 22.3 400 17-424 1-513 (533) 221 protein:vir:101189 Length: 516 94.0 0.0056 3.5E-06 32.9 21.7 404 8-424 1-515 (516) 222 protein:vir:101806 Length: 516 94.0 0.0056 3.5E-06 32.9 21.7 404 8-424 1-515 (516) 223 protein:vir:100598 Length: 516 93.5 0.0072 4.5E-06 32.3 22.3 405 8-424 1-515 (516) 224 protein:vir:80453 Length: 535 89.1 0.028 1.8E-05 29.1 26.8 395 1-424 1-527 (535) 225 protein:vir:106282 Length: 521 86.1 0.048 3E-05 27.8 25.6 403 10-424 1-519 (521) 226 protein:vir:108049 Length: 524 85.5 0.053 3.3E-05 27.6 23.5 392 1-401 1-524 (524) 227 protein:vir:94956 Length: 452 84.0 0.064 4E-05 27.1 24.8 371 8-424 1-450 (452) 228 protein:vir:104892 Length: 558 83.9 0.065 4E-05 27.1 24.9 404 1-424 1-541 (558) 229 protein:vir:81017 Length: 521 82.6 0.076 4.7E-05 26.7 22.8 401 10-424 1-520 (521) 230 protein:vir:94709 Length: 522 81.9 0.081 5E-05 26.6 20.4 379 1-424 1-475 (522) 231 protein:vir:2198 Length: 536 # 76.7 0.13 8.2E-05 25.4 19.9 396 1-424 1-531 (536) 232 protein:vir:5665 Length: 511 # 75.8 0.14 8.8E-05 25.2 22.7 398 14-423 1-511 (511) 233 protein:vir:106999 Length: 564 75.4 0.15 9.2E-05 25.1 21.6 399 17-424 1-549 (564) 234 protein:vir:98265 Length: 524 74.7 0.15 9.6E-05 25.0 24.8 401 8-424 1-522 (524) 235 protein:vir:6596 Length: 521 # 72.8 0.18 0.00011 24.7 23.9 402 10-424 1-520 (521) 236 protein:vir:95149 Length: 501 70.8 0.2 0.00013 24.4 28.7 393 1-424 1-495 (501) 237 protein:vir:9922 Length: 489 # 69.2 0.23 0.00014 24.1 27.9 393 1-424 1-489 (489) 238 protein:vir:10447 Length: 536 68.6 0.24 0.00015 24.0 20.8 396 1-424 1-531 (536) 239 protein:vir:7208 Length: 524 # 64.5 0.3 0.00019 23.5 22.7 386 8-401 1-524 (524) 240 protein:vir:103458 Length: 524 62.0 0.34 0.00021 23.1 22.8 398 8-423 1-524 (524) 241 protein:vir:102330 Length: 451 60.2 0.38 0.00023 22.9 26.5 371 8-422 1-451 (451) 242 protein:vir:6896 Length: 523 # 43.8 0.83 0.00052 21.0 23.0 399 8-423 1-523 (523) 243 protein:vir:8883 Length: 543 # 38.4 1.1 0.00066 20.4 18.2 387 1-424 1-482 (543) 244 protein:vir:105154 Length: 525 32.8 1.4 0.00087 19.8 16.6 399 1-424 7-517 (525) 245 protein:vir:1538 Length: 535 # 29.4 1.7 0.001 19.4 19.5 383 1-424 1-480 (535) 246 protein:vir:101418 Length: 569 28.5 1.7 0.0011 19.3 12.7 402 1-423 1-569 (569) 247 protein:vir:102668 Length: 547 23.7 2.3 0.0014 18.6 17.4 350 1-424 1-494 (547) 248 protein:vir:6322 Length: 510 # 23.2 2.3 0.0015 18.6 19.3 373 10-424 1-467 (510) 249 protein:vir:78393 Length: 489 21.3 2.6 0.0016 18.3 25.0 395 1-424 1-484 (489) No 1 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=100.00 E-value=1e-121 Score=683.85 Aligned_cols=424 Identities=99% Similarity=1.479 Sum_probs=410.6 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |++|||||++++++|||+++++||.++....+......++++..++.++..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~ 80 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) T ss_pred CCCCcceEeecCCCchHHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceE Confidence 99999999999999999999999998888888888888888888888999999999999999999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +|+++++|+.+++..+||++++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|.+|++..+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V~v~~~~~ 160 (424) T protein:vir:18 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) T ss_pred EEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC Confidence 99999988887777899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019710. 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 161 ~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~ 240 (424) ...|.|..++..+.|+++||||+|+++.++++|+||+.++.++++++.++++++.++|+||++|++||+++....+++++ T Consensus 161 ~~~y~~~~~g~~~~~~~~eIih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~ 240 (424) T protein:vir:18 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) T ss_pred eEEEEEEeCCeEEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999998887889999 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHH Q lcl|NC_019710. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) +.+++.|+++.++.|+|++++|++|++|++++++++|+||+|+++++.++||++|||||.+||..+++++.++|+|++.+ T Consensus 241 ~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~ 320 (424) T protein:vir:18 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) T ss_pred HHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999888889999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Q lcl|NC_019710. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~ 400 (424) .|+++||.|++++||++|+++|+++.++.+++++||++.+++.|.++|++.+++++++|+||+||+|+++|+||+||||+ T Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD~ 400 (424) T protein:vir:18 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDV 400 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 99999999999999999999999998887899999999999999999999999999999999999999999999999999 Q ss_pred eeecccccchhhccccCCCccCCC Q lcl|NC_019710. 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +++++|++|++.++++++|+++|| T Consensus 401 ~~~~~n~~~l~~~~~~~~p~~~ga 424 (424) T protein:vir:18 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred eeeccCccchHhhhccCCCccCCC Confidence 999999999999999999999999 No 2 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=100.00 E-value=2.8e-121 Score=681.48 Aligned_cols=424 Identities=99% Similarity=1.485 Sum_probs=410.5 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |++|+|||+|++++|||+++++||.+++...+.....++++++.+|.++..++++.|+++++|++||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~iA~lp~~ 80 (424) T protein:vir:18 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) T ss_pred CCCCccccccCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHhhccCceE Confidence 99999999999999999999999999988888877777888888899999999999999999999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +|+.+++|..+++..+||++++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|.+|++..+++ T Consensus 81 vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~ 160 (424) T protein:vir:18 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) T ss_pred EEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcceEEEEcCC Confidence 99999988887777889999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019710. 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 161 ~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~ 240 (424) ...|.|..++....|+++||||+|+++.++++|+||+..++.++.++.++++++.++|+||++|+++|+.+....+++++ T Consensus 161 ~~~y~~~~~g~~~~~~~~eVihir~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~ 240 (424) T protein:vir:18 161 KVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) T ss_pred eEEEEEEeCCeEEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999998887889999 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHH Q lcl|NC_019710. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) +.+++.|+++.++.|+|++++|++|++|++++++++|+||+|+++++.++||++|||||.+||..+++++.++|+|++.+ T Consensus 241 ~~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~ 320 (424) T protein:vir:18 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) T ss_pred HHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999888889999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Q lcl|NC_019710. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~ 400 (424) .|+++||.|++++||++|+++|+++.++..++++||++.+++.|.++|++.+++++++|+||+||+|+++|+||+||||+ T Consensus 321 ~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~ 400 (424) T protein:vir:18 321 GFLQYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDV 400 (424) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 99999999999999999999999998887899999999999999999999999999999999999999999999999999 Q ss_pred eeecccccchhhccccCCCccCCC Q lcl|NC_019710. 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +++++|++|++.++++++++++|| T Consensus 401 ~~~~~n~~~l~~~~~~~~~~~n~a 424 (424) T protein:vir:18 401 AMRQAQYVPITDLGTNKEPRNNGA 424 (424) T ss_pred eeeccCccchhhhhccCCccccCC Confidence 999999999999999999999999 No 3 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=100.00 E-value=4.8e-101 Score=570.55 Aligned_cols=419 Identities=31% Similarity=0.521 Sum_probs=353.2 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.+--...--+...+.-+.+++|.. +.... ........+.+.....+..++.+.++++++||+||++||++||++||+ T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~~ia~lp~~ 78 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPRSSLFGWGG-KTIRL-TDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLISTSVAGLPLG 78 (434) T ss_pred Cccchhhhhhhcccccchhhhcccc-ccccc-CchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHHhhhhCceE Confidence 3221111111111111111111111 01100 111111112223344677899999999999999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +|+++.+|..++ ..+|++++||+.+||++||+++||+.++.+++++||||+++.++ .|++++||||+|.+|++..+.+ T Consensus 79 ~~~~~~~g~~~~-~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~~~~~~ 156 (434) T protein:vir:43 79 VYERKADGSRVD-ARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDLECDEN 156 (434) T ss_pred EEEEcCCCcccc-ccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEcCC Confidence 999988776543 56899999999999999999999999999999999999998876 6999999999999999988765 Q ss_pred ce-EE-EEEecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019710. 161 KV-VY-RYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 ~~-~~-~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~ 238 (424) +. .| ++..++..+.|+++||||+|+++.|+++|+||+..+..++..+.++++++.++|+||++|+++++.+.. .+++ T Consensus 157 g~~~y~~~~~~g~~~~~~~~eVih~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e 235 (434) T protein:vir:43 157 GRLKYFYTTKKGARREIERTNMLHIPAFTLDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRI-LQPA 235 (434) T ss_pred CeEEEEEEecCceEEEEccccEEEecCcCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCC-CCHH Confidence 43 33 344556678999999999999999999999999999999999999999999999999999999999865 5577 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ 318 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~ 318 (424) +.++++++++++.++.|+|+++++++|++|++++++++|+||+|.++++.++||++|||||.+||..++++++++|.|++ T Consensus 236 ~~~~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~ 315 (434) T protein:vir:43 236 QREEFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQ 315 (434) T ss_pred HHHHHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHH Confidence 78889999999999999999999999999999999999999999999999999999999999999998888888899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019710. 319 NLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 319 ~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg 398 (424) ...|+.+||.|++.+||++|+++|+++.++..++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+||| T Consensus 316 ~~~f~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~gg 395 (434) T protein:vir:43 316 MLAFLTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGG 395 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999999999999988777899999999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) |++++++|++|++.+++++.+++--+ T Consensus 396 D~~~~~~n~~~~~~~~~~~~~~~~~~ 421 (434) T protein:vir:43 396 DILTVQSNLVPIDQLGQSNKSQAVRA 421 (434) T ss_pred CeEeeccCccchhhhhccCCCcchhh Confidence 99999999999998877654433211 No 4 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=100.00 E-value=3.4e-100 Score=565.93 Aligned_cols=410 Identities=30% Similarity=0.519 Sum_probs=357.7 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccc-------cccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGP-------VSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.=.+.+|||+|++++|.+.++..........+ ++.....++..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~ 80 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLT 80 (432) T ss_pred CCchhhcchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHHHHHHHHhhhhCcee Confidence 777888999999999998766543222222111 2222334678899999999999999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +|+++++|..+ ..+||++++|+.+||++||+++||+.++.+++++||||+++.++ +|++.+||||+|..|++..+.+ T Consensus 81 ~y~~~~~g~~~--~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v~~~~~ 157 (432) T protein:vir:81 81 MYMRTPDGRKE--AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDPK 157 (432) T ss_pred eEEecCCccee--cccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEECCC Confidence 99998876533 45799999999999999999999999999999999999999986 5999999999999999988765 Q ss_pred c-eEEEE-EecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019710. 161 K-VVYRY-QRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 ~-~~~~~-~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~ 238 (424) + ..|.+ ..++..+.|+++||||+|+++.|+++|+||+.++..++..+.+++++..++|+||++|+++++.+.. .+++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~r~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e 236 (432) T protein:vir:81 158 GNTAYRYRRTDGQMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF-LTDD 236 (432) T ss_pred CcEEEEEEecCceEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCC-CCHH Confidence 4 34444 4466778999999999999999999999999999999999999999999999999999999999866 4566 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCc-ccccHHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~-~~~n~e~ 317 (424) +.+.+++. +.+..|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+.+++ +++|+|+ T Consensus 237 ~~~~~~~~---~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq 313 (432) T protein:vir:81 237 QYDSFAKK---VSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHH---HhhhhcCCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHH Confidence 66655554 4566789999999999999999999999999999999999999999999999999887765 4578999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g 397 (424) +.+.|+++||.|++++||++|+++|+++.++..++++||++.+++.|.++|++++++++++|+||+||+|+++|+||+|| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g 393 (432) T protein:vir:81 314 QQLGFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998877789999999999999999999999999999999999999999999998 Q ss_pred cCee-eecccccchhhccccCCCccCCC Q lcl|NC_019710. 398 GDVA-MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~~-~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) |+.+ .+++|++|++..+.+.++++.++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:81 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecCcccchhhhccCCCCCCCCC Confidence 7554 58999999998887766655444 No 5 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=100.00 E-value=8.6e-100 Score=563.70 Aligned_cols=410 Identities=30% Similarity=0.514 Sum_probs=354.9 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccc-------cccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGP-------VSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-=--..|+|+|++++|.++.+..........+ ++...+..+..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~i~~Ia~~ia~lp~~ 80 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAIAAMPLT 80 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHHHHHHHHhhhhCcee Confidence 211123799999999998876543322222111 2223345678899999999999999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +|+++.+|..+ ..+||++++|+.+||++||+++||+.++.+++++||||++++++ +|.+.+||||+|.+|++..+.+ T Consensus 81 ~y~~~~~g~~~--~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~~~~ 157 (432) T protein:vir:10 81 MYMRTPDGRKE--AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTK 157 (432) T ss_pred EEEecCCCccc--ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEEcCC Confidence 99998877533 45799999999999999999999999999999999999999997 5999999999999999988655 Q ss_pred c-eEEEE-EecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019710. 161 K-VVYRY-QRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 ~-~~~~~-~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~ 238 (424) + ..|.+ ..++..+.|+++||||+|+++.++++|+||+..+.++++++.+++++..++|+||++|++|++++.. .+++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e 236 (432) T protein:vir:10 158 GNTAYRYRRTDGQMIDIPKQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF-LTDD 236 (432) T ss_pred CcEEEEEEecCceEEEEcCccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCC-CCHH Confidence 4 44444 4466678999999999999999999999999999999999999999999999999999999999866 4566 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCc-ccccHHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS-WGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~-~~~n~e~ 317 (424) +.+.+++.+ .+..|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+.+++ .++|+|+ T Consensus 237 ~~~~~~~~~---~~~~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~ 313 (432) T protein:vir:10 237 QYDSFAKKV---SGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHHH---hhhhhCCCceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHH Confidence 666655544 456788999999999999999999999999999999999999999999999999887665 4568999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g 397 (424) +.+.|+++||.|++++||++|+++|+++.++..++++||++.+++.|.++|++.+++++++|+||+||+|+++|+||+|| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g 393 (432) T protein:vir:10 314 QQLGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998877789999999999999999999999999999999999999999999998 Q ss_pred cCee-eecccccchhhccccCCCccCCC Q lcl|NC_019710. 398 GDVA-MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~~-~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) |+.+ .+++|++|++.++.+.++++.++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:10 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecCcccchhhhcccCCCCCCCC Confidence 7654 58999999999887766655544 No 6 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=100.00 E-value=4.4e-100 Score=565.27 Aligned_cols=399 Identities=24% Similarity=0.406 Sum_probs=352.0 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||+|++++|.+...... .+.+ ....|.++..++.+.++++++|++||++||++||++||++|+++++|..+ T Consensus 1 MG~~~~~~~~~~~~~~~~~----~~~~-~~~~~~g~~~~~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~~~~-- 73 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVD----MTNP-LLLQWLGVDPDTPRNQLSEATYFACLKILSESLGKLPLKMYQKTERGIVK-- 73 (411) T ss_pred CchHHHHHhhccCcccccc----cchH-HHHHHhcCcccChhhhhccHHHHHHHHHHHHhHhhCceeEEEecCCceee-- Confidence 9999999998875543322 1111 13456777888999999999999999999999999999999998776543 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc-------eEEEE Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRY 166 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~-------~~~~~ 166 (424) ..+|+++++|+.+||++||+++||+.++.+++++||||++++|+ .|.+.+|||++|++|++..++.+ .+|.| T Consensus 74 ~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~~~~~~~ 152 (411) T protein:vir:81 74 SDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKNAIWYRY 152 (411) T ss_pred ecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccceEEEEE Confidence 45799999999999999999999999999999999999999998 59999999999999999887653 23444 Q ss_pred E--ecCceEEecHhHeeEecC-cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019710. 167 Q--RDSEYADFSQKEIFHLKG-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 167 ~--~~~~~~~~~~~evih~r~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~ 243 (424) . .++..+.|+++||||+|+ ++.++++|+||+.++..++.+..++++++.++|+||++|+++|+.+.. .++++.+.+ T Consensus 153 ~~~~~g~~~~~~~~eiih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~ 231 (411) T protein:vir:81 153 NDPYDGKMYVFRNDEILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGD-LNQEARDRL 231 (411) T ss_pred EecCCceEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CCHHHHHHH Confidence 3 355677899999999995 567899999999999999999999999999999999999999999865 567778888 Q ss_pred HHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHH Q lcl|NC_019710. 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 244 ~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f 322 (424) ++.|++.+++ +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.++| T Consensus 232 ~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~~n~e~~~~~f 309 (411) T protein:vir:81 232 VKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSS--YASAEAQNLAF 309 (411) T ss_pred HHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--chhHHHHHHHH Confidence 8888886654 68999999999999999999999999999999999999999999999999887765 46999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee Q lcl|NC_019710. 323 LQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~ 401 (424) +++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+||||++ T Consensus 310 ~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~ 389 (411) T protein:vir:81 310 YVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNL 389 (411) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee Confidence 999999999999999999999988754 5889999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhccccCCCccCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~g 423 (424) ++++|++|++.++++.+...++ T Consensus 390 ~~~~n~~pl~~~~~~~~kgGd~ 411 (411) T protein:vir:81 390 MANGNYIPLSMLGANYGKGGDS 411 (411) T ss_pred eeccCccchhhhhhhhccCCCC Confidence 9999999999987764322222 No 7 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=100.00 E-value=1.3e-99 Score=562.70 Aligned_cols=410 Identities=30% Similarity=0.500 Sum_probs=353.9 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccc-------cccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGP-------VSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |-=--..|||+|++++|.++.+..........+ ++...+..+..|+.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~v~~Ia~~ia~lp~~ 80 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAACVKLVSQAVAAMPLM 80 (432) T ss_pred CCCcccCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHHHHHHHHhhccCceE Confidence 211112799999999998766543222111111 2223345678899999999999999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +|+++.+|..+ ..+||++++|+.+||++||+++||+.++.+++++||||++++++ +|++.+||||+|.+|++..+.+ T Consensus 81 ~y~~~~~g~~~--~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~~~~ 157 (432) T protein:vir:97 81 MYMRTPDGRKE--AVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITTDTK 157 (432) T ss_pred EEEecCCCccc--ccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEEcCC Confidence 99998877543 45799999999999999999999999999999999999999997 5999999999999999988654 Q ss_pred c-eEEEEE-ecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019710. 161 K-VVYRYQ-RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 ~-~~~~~~-~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~ 238 (424) + ..|.+. .++..+.|+++||||+|+++.++++|+||+..+..++++..+++++..++|+||++|++|++++.. .+++ T Consensus 158 g~~~y~~~~~~g~~~~~~~~~iih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~-l~~e 236 (432) T protein:vir:97 158 GNTAYRYRRTDGQMIDIPRQQIWKIMGYSLDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF-LTDD 236 (432) T ss_pred CcEEEEEEecCceEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCC-CCHH Confidence 3 445444 456678899999999999999999999999999999999999999999999999999999999866 4566 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcc-cccHHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW-GSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~-~~n~e~ 317 (424) +++.+++. +.+..|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..+.++++ ++|+|+ T Consensus 237 ~~~~~~~~---~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~ 313 (432) T protein:vir:97 237 QYDSFSKK---VSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIES 313 (432) T ss_pred HHHHHHHH---HhhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHH Confidence 66555544 45667899999999999999999999999999999999999999999999999998876653 468999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g 397 (424) +.+.|+++||.|++++||++|+++|+++.++..++++||++.+++.|.++|++.+.+++++|+||+||+|+++|+||++| T Consensus 314 ~~~~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g 393 (432) T protein:vir:97 314 QQLGFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGG 393 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998877789999999999999999999999999999999999999999999998 Q ss_pred cCee-eecccccchhhccccCCCccCCC Q lcl|NC_019710. 398 GDVA-MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~~-~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ||.+ .++.|++|++.++.+..++++++ T Consensus 394 ~~~~~~~~~~~~pl~~~~~~~~~~~~~~ 421 (432) T protein:vir:97 394 NAAVLTVQSAMVPLDSIGLQASPEPASG 421 (432) T ss_pred CcceEeecccccchhhhcccCCCCCCCC Confidence 7665 58999999998877766555444 No 8 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=100.00 E-value=6.6e-100 Score=564.31 Aligned_cols=403 Identities=25% Similarity=0.404 Sum_probs=346.7 Q ss_pred ccHHHHHHhhccCccccccccccccccc---ccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPV---SAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 90 (424) +++. .+|+.+..+......+...+ ......++..|+++.++++++||+||++||++||++||++|+++++|+. T Consensus 1 m~~~----~~~~~~~~~~s~~~~w~~~~~~~~~~~~~~g~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~~ 76 (421) T protein:vir:10 1 MFIP----QMFEGKKRSVSGGGFWEAMLGGVRSSHSKAGVMITPETALALSAVRACVTLLAESVAQLPVELYRRDKNGGR 76 (421) T ss_pred CCCc----chhcccccccCcchhhHHHhhhhccCcccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 3322 23333332222222222222 2233456788999999999999999999999999999999999888765 Q ss_pred ccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecC Q lcl|NC_019710. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS 170 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~ 170 (424) + ...+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|+|.+||||+|.+|++..+.++..|++ ... T Consensus 77 ~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~g~~~y~-~~~ 154 (421) T protein:vir:10 77 Q-RATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPDGMPYYE-IPE 154 (421) T ss_pred e-ecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCCceEEEE-EcC Confidence 4 45679999999999999999999999999999999999999999999999999999999999988876654433 333 Q ss_pred ceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC---CCHHHHHHHHHHH Q lcl|NC_019710. 171 EYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV---LTEQQRSQVEENF 247 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~---~~~~~~~~~~~~~ 247 (424) ....++++||||+|+++.++++|+||+..+..+++...++++++.++|+||++|+|+|+.+... .++++.+++++.| T Consensus 155 ~g~~~~~~eiih~~~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~ 234 (421) T protein:vir:10 155 IGETLPMRMMHHVKVFSLDGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKW 234 (421) T ss_pred CCcEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHH Confidence 3457999999999999999999999999999999999999999999999999999999987654 3678888888888 Q ss_pred HHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 248 KEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) ++.+++ .|+|+++++++|++|++++++++|+||+|.++++.++||++|||||.+||..++++ ++|+|++.+.|+++| T Consensus 235 ~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~t 312 (421) T protein:vir:10 235 TDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKAT--NNNIEHQGLQFVMYT 312 (421) T ss_pred HHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCc--cccHHHHHHHHHHHH Confidence 876554 68999999999999999999999999999999999999999999999999987765 459999999999999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccc Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n 406 (424) |.|++.+||++|+++|+++.++.+++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+||||++++|+| T Consensus 313 l~P~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n 392 (421) T protein:vir:10 313 LLAWLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYLTPLN 392 (421) T ss_pred HHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccc Confidence 99999999999999999998877789999999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhccccC--CCccCCC Q lcl|NC_019710. 407 YVPITDLGTNK--EPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~--~~~~~g~ 424 (424) +++++.....+ +....++ T Consensus 393 ~~~~~~~~~~~~~~~~~~~~ 412 (421) T protein:vir:10 393 MVDSAQIIPGDKKPTAQQMA 412 (421) T ss_pred cccccccccCCCCcccccCc Confidence 99887764432 2222233 No 9 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=100.00 E-value=1.7e-99 Score=562.05 Aligned_cols=404 Identities=27% Similarity=0.425 Sum_probs=351.9 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +||++.+++. ............+.+....+.++..++.+.++++++|++||++||++||++||++|+++++|+.+ . T Consensus 1 m~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~-~ 76 (419) T protein:vir:57 1 MFIPQFWKGR---PSENRVNWQVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGRE-I 76 (419) T ss_pred CcchhhhccC---CccccccccccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCcee-c Confidence 7777765442 22222222222233444556778899999999999999999999999999999999999887644 4 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCceE Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|.+|++..+.++..| |...+... T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g~~~-y~~~~~~~ 155 (419) T protein:vir:57 77 AFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDGMPY-YDIPSIGE 155 (419) T ss_pred cccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCceEE-EEEcCCce Confidence 56899999999999999999999999999999999999999999999999999999999999988766543 33344556 Q ss_pred EecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC---CCCHHHHHHHHHHHHHH Q lcl|NC_019710. 174 DFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK---VLTEQQRSQVEENFKEI 250 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~---~~~~~~~~~~~~~~~~~ 250 (424) .+++++|||+|+++.++++|+||+..+..++....++++++.++|+||++|+++|+.+.. ..++++.+.+++.|++. T Consensus 156 ~~~~~~vih~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~ 235 (419) T protein:vir:57 156 ILPMRMVHHIKSFSLDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTER 235 (419) T ss_pred EEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHH Confidence 799999999999999999999999999999999999999999999999999999998643 34678888888888776 Q ss_pred hCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P 329 (424) +++ .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||.| T Consensus 236 ~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~~l~P 313 (419) T protein:vir:57 236 YGGVRNAFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKST--NNNIEHQGLQYVIYTMLA 313 (419) T ss_pred hccccccccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCc--cccHHHHHHHHHHHHHHH Confidence 554 68999999999999999999999999999999999999999999999999887655 569999999999999999 Q ss_pred HHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Q lcl|NC_019710. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~ 409 (424) ++++||++|+++|+++.++.+++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+||||++++|+|+++ T Consensus 314 ~~~~ie~~l~~~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~ 393 (419) T protein:vir:57 314 ILKRHESAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGDKYLTPLNMVD 393 (419) T ss_pred HHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Confidence 99999999999999988877899999999999999999999999999999999999999999999999999999999998 Q ss_pred hhhccccCCCccCCC Q lcl|NC_019710. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~g~ 424 (424) +..+.+.+++..+.- T Consensus 394 ~~~~~~~~~~~~~~~ 408 (419) T protein:vir:57 394 SKALTGIGKATPQQL 408 (419) T ss_pred ccccccccCCCcccC Confidence 877655433222221 No 10 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=100.00 E-value=2e-99 Score=561.67 Aligned_cols=415 Identities=33% Similarity=0.583 Sum_probs=351.0 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.+++.-.- .-+-..+.+|++.+. ..........++......+..++.+.|+++++||+||++||++||++||+ T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~g~~~--s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~ 74 (437) T protein:vir:10 1 MKQGKQRAL----GRIKSSFLKWLGVPI--SLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLN 74 (437) T ss_pred CCcchhhhh----hhhHHhhhhhcCCcc--cCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCcee Confidence 333322110 122222444554322 12222222334444455678899999999999999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +|+++++|..+ ...+|++.++|+.+||++||+++||+.++.+++++||||++++|+ .|++++|||++|.+|++..+.+ T Consensus 75 ~~~~~~~g~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g~~~~L~~l~p~~v~i~~~~~ 152 (437) T protein:vir:10 75 LYQTKPDGTRV-LAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AGVLIGLELMLPQRTTVKRLTS 152 (437) T ss_pred EEEEcCCCcee-eccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEECCC Confidence 99998877544 456899999999999999999999999999999999999999998 4999999999999999988654 Q ss_pred c-eEEEEE-ecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019710. 161 K-VVYRYQ-RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 161 ~-~~~~~~-~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~ 238 (424) + ..|.|. .++....|+++||||||+++.|+++|+||+.++..++.+..++++++.++|+||++|+++|+.+.. .+++ T Consensus 153 g~~~y~~~~~~g~~~~~~~~dIih~r~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e 231 (437) T protein:vir:10 153 GALQYTYRNVDGTVSTLAEDDVFHVRGFSLDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQI-LQKE 231 (437) T ss_pred CeEEEEEEecCceEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHH Confidence 4 344443 456678899999999999999999999999999999999999999999999999999999999866 4566 Q ss_pred HHHHHHHHHHHH-hCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH Q lcl|NC_019710. 239 QRSQVEENFKEI-AGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~-~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~ 317 (424) +.+++++.|++. .|..|+|+++++++|++|++++++++|+||+|++++++++||++|||||.+||..++++++++|+|+ T Consensus 232 ~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~ 311 (437) T protein:vir:10 232 KRAEIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQ 311 (437) T ss_pred HHHHHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHH Confidence 778888888765 4557899999999999999999999999999999999999999999999999999998888889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g 397 (424) +.+.|+++||.|++.+||++|+++||++.++..++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+|| T Consensus 312 ~~~~f~~~tl~P~~~~ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~g 391 (437) T protein:vir:10 312 QTLGFLTFTLRPWLTRIEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGG 391 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 99999999999999999999999999998887789999999999999999999999999999999999999999999998 Q ss_pred cCeee-ecccccchhhccccCCCccC-----CC Q lcl|NC_019710. 398 GDVAM-RQSQYVPITDLGTNKEPRNN-----GA 424 (424) Q Consensus 398 gd~~~-~~~n~~~~~~~~~~~~~~~~-----g~ 424 (424) ||+++ +++|++|++..+++..+..+ |+ T Consensus 392 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 424 (437) T protein:vir:10 392 NAAVLTVQSALLPIDKLGEHTTATAAQDALKAW 424 (437) T ss_pred CcceEeecCcccchhhccCcCCCcchhcccccc Confidence 87764 79999999887654322111 11 No 11 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=100.00 E-value=3.4e-99 Score=560.45 Aligned_cols=406 Identities=23% Similarity=0.392 Sum_probs=352.4 Q ss_pred ccHHHHHHhhccCccccccccccccc----ccccc-cccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccC Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTG----PVSAH-GYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 88 (424) +|||+|++++|+..+...+....... ...+. ....+..++.+.++++++||+||++||++||++||++|++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999999988744433322221111 01111 22356678999999999999999999999999999999998766 Q ss_pred ccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc------- Q lcl|NC_019710. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK------- 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~------- 161 (424) .. ...+|++.+||+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|.+|++..++.. T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 356899999999999999999999999999999999999999999999999999999999999877532 Q ss_pred eEEEEEecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019710. 162 VVYRYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|++ +.++++|+||+.++..++....++++++.++|+||++|+++|+.+.. .++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-l~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD-LNEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHHHH Confidence 3466677777889999999999964 67899999999999999999999999999999999999999999866 456677 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHH Q lcl|NC_019710. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|++++++++|+||+|.++++.++||++|||||.+||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 8888888876555 78999999999999999999999999999999999999999999999999887765 55999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019710. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg 398 (424) ++|+++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999988764 6889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~g~ 424 (424) |++++|+|++|++.+++. +++++++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876542 11211111 No 12 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=100.00 E-value=3.4e-99 Score=560.45 Aligned_cols=406 Identities=23% Similarity=0.392 Sum_probs=352.4 Q ss_pred ccHHHHHHhhccCccccccccccccc----ccccc-cccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccC Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTG----PVSAH-GYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 88 (424) +|||+|++++|+..+...+....... ...+. ....+..++.+.++++++||+||++||++||++||++|++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999999988744433322221111 01111 22356678999999999999999999999999999999998766 Q ss_pred ccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc------- Q lcl|NC_019710. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK------- 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~------- 161 (424) .. ...+|++.+||+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|.+|++..++.. T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 356899999999999999999999999999999999999999999999999999999999999877532 Q ss_pred eEEEEEecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019710. 162 VVYRYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|++ +.++++|+||+.++..++....++++++.++|+||++|+++|+.+.. .++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-l~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD-LNEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHHHH Confidence 3466677777889999999999964 67899999999999999999999999999999999999999999866 456677 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHH Q lcl|NC_019710. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|++++++++|+||+|.++++.++||++|||||.+||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 8888888876555 78999999999999999999999999999999999999999999999999887765 55999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019710. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg 398 (424) ++|+++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999988764 6889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~g~ 424 (424) |++++|+|++|++.+++. +++++++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876542 11211111 No 13 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=100.00 E-value=3.4e-99 Score=560.45 Aligned_cols=406 Identities=23% Similarity=0.392 Sum_probs=352.4 Q ss_pred ccHHHHHHhhccCccccccccccccc----ccccc-cccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccC Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTG----PVSAH-GYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 88 (424) +|||+|++++|+..+...+....... ...+. ....+..++.+.++++++||+||++||++||++||++|++++++ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~ 80 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYG 80 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcCccccchhhhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 99999999988744433322221111 01111 22356678999999999999999999999999999999998766 Q ss_pred ccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc------- Q lcl|NC_019710. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK------- 161 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~------- 161 (424) .. ...+|++.+||+.+||++||+++||+.++.+++++||||++++|+..|++++||||+|.+|++..++.. T Consensus 81 ~~--~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~~~~~~~~ 158 (432) T protein:vir:10 81 IQ--RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTK 158 (432) T ss_pred ee--eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccce Confidence 43 356899999999999999999999999999999999999999999999999999999999999877532 Q ss_pred eEEEEEecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019710. 162 VVYRYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 162 ~~~~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~ 240 (424) .+|.+..++....|+++||||+|++ +.++++|+||+.++..++....++++++.++|+||++|+++|+.+.. .++++. T Consensus 159 ~~y~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-l~~e~~ 237 (432) T protein:vir:10 159 MWYVVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD-LNEDAK 237 (432) T ss_pred EEEEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHHHH Confidence 3466677777889999999999964 67899999999999999999999999999999999999999999866 456677 Q ss_pred HHHHHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHH Q lcl|NC_019710. 241 SQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 241 ~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~ 319 (424) +++++.|++.+++ .|+|+++++++|++|++++++++|+||+|.++++.++||++|||||.+||..++++ ++|+|++. T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~s~~e~~~ 315 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQ 315 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHH Confidence 8888888876555 78999999999999999999999999999999999999999999999999887765 55999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019710. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 320 ~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg 398 (424) ++|+++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+||| T Consensus 316 ~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~gg 395 (432) T protein:vir:10 316 QQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGG 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 999999999999999999999999988764 6889999999999999999999999999999999999999999999999 Q ss_pred Ceeeecccccchhhcccc--CCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~--~~~~~~g~ 424 (424) |++++|+|++|++.+++. +++++++- T Consensus 396 D~~~~~~n~~~~~~~~~~~~k~~~~~~~ 423 (432) T protein:vir:10 396 DRLLVNGNMLPIDMAGQAYLKGGDTNGE 423 (432) T ss_pred CeEeecccccchhhccccccCCCCCCCC Confidence 999999999999876542 11211111 No 14 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=100.00 E-value=2.6e-99 Score=561.10 Aligned_cols=403 Identities=29% Similarity=0.502 Sum_probs=346.2 Q ss_pred ccHHHHHHhhccCcccccccccccccccc-cccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVS-AHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (424) +|||++++.+........+. .....++ .....++..++.+.++++++|++||++||++||++||++|++++++. + T Consensus 1 Mg~f~~lf~r~~~~~~~~~~--~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~--~ 76 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPA--ELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLK--Q 76 (414) T ss_pred CchhhhhhccCccCcccchh--hHhHhhccCccccCCceechhhhhccHHHHHHHHHHHHHhccCceEEEEecCCce--e Confidence 99999986654332222221 1111121 22345678889999999999999999999999999999999876654 3 Q ss_pred ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc-eEEEEE-ecC Q lcl|NC_019710. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-VVYRYQ-RDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~-~~~~~~-~~~ 170 (424) ...+|+++++|+.+||++||+++||+.++.+++++||||++++++ .|++.+||||+|.+|++..++.+ ..|.+. .++ T Consensus 77 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g 155 (414) T protein:vir:44 77 RATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWEPVYQVTFPDG 155 (414) T ss_pred ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCcEEEEEEecCc Confidence 456899999999999999999999999999999999999999887 69999999999999998877544 344443 455 Q ss_pred ceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 171 EYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ....|+++||||||+++.++++|+||+..+..++++..++++++.++|+||++|+++|+++.. .++++.+.+++.|++. T Consensus 156 ~~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~~ 234 (414) T protein:vir:44 156 STDVLSQEDIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQT-LSDQAYERLKKDFEER 234 (414) T ss_pred eEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC-CCHHHHHHHHHHHHHH Confidence 678899999999999999999999999999999999999999999999999999999999866 5677788888888776 Q ss_pred hCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P 329 (424) +++ +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||.| T Consensus 235 ~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t--~~n~e~~~~~~~~~~l~P 312 (414) T protein:vir:44 235 HTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--FNNIEELGLGFINYSLVP 312 (414) T ss_pred hcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHHHHHHHHHH Confidence 554 78999999999999999999999999999999999999999999999999877654 569999999999999999 Q ss_pred HHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Q lcl|NC_019710. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~ 409 (424) ++++||++|+++|+++.++..++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+||||++++|+|+.+ T Consensus 313 ~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~ 392 (414) T protein:vir:44 313 YLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTT 392 (414) T ss_pred HHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceecccccccc Confidence 99999999999999998877789999999999999999999999999999999999999999999999999999999986 Q ss_pred hhhcccc--CCCccCCC Q lcl|NC_019710. 410 ITDLGTN--KEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~--~~~~~~g~ 424 (424) ....... ++++++.+ T Consensus 393 ~~~~~~~~~~~~~~~~~ 409 (414) T protein:vir:44 393 KPSDGSKAGKQKDNANA 409 (414) T ss_pred cCCccccCCCCCCCCCC Confidence 5543332 22222222 No 15 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=100.00 E-value=9e-99 Score=558.12 Aligned_cols=415 Identities=23% Similarity=0.379 Sum_probs=350.7 Q ss_pred CCCCCcccccCC--CccHHHHHHhhccCccccccccccccccccc-ccccCCccccHHHHhhhHHHHHHHHHHHHhhhhC Q lcl|NC_019710. 1 MEEPKYTIDLRT--NNGWWARLKSWFVGGRLVTPNQGSQTGPVSA-HGYLGDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~~~~~~~--~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |----++--..- .+.+|++|+ .+.....+..+.....+.. ..+.++..++.+.|+++++|++||++||++||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf---~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~l 77 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALF---RSKSLENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQM 77 (424) T ss_pred CeeEeeeceecCcchhHHHHhhc---cccCCCCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhC Confidence 211111111110 134455544 3333333333322222222 2345678899999999999999999999999999 Q ss_pred ceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEE Q lcl|NC_019710. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~ 157 (424) ||++|++++ |+.+ ...+|+++++|+.+||++||+++||+.++.+++++||||+++.|+..|++++|||++|..|++.. T Consensus 78 p~~v~~~~~-~~~~-~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~ 155 (424) T protein:vir:45 78 PLHVMRRHK-GKVE-PARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMN 155 (424) T ss_pred ceEEEEecC-Ccee-ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEE Confidence 999999864 3333 35679999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCceEEEEEecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019710. 158 VGKKVVYRYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~ 237 (424) +++...|.+...+....|+++||||||+++.++++|+||+.++.++++++.++++++.++|+||++|++||+.+.. .++ T Consensus 156 ~~~~~~y~~~~~~~~~~~~~~eVih~r~~~~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~ 234 (424) T protein:vir:45 156 TGGRYTYGLYNEYGAFAISPDDMIHIRALGNNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSG-LNK 234 (424) T ss_pred cCCeEEEEEEecCceEEECcccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCH Confidence 9988888888887888999999999999999999999999999999999999999999999999999999999876 567 Q ss_pred HHHHHHHHHHHHHhCC--cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccH Q lcl|NC_019710. 238 QQRSQVEENFKEIAGG--PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~--~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~ 315 (424) ++.+.+++.|++.+++ +|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.++++ ++|+ T Consensus 235 e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~ 312 (424) T protein:vir:45 235 ESWGWLKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKAT--FSNI 312 (424) T ss_pred HHHHHHHHHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccH Confidence 7888888888876654 58999999999999999999999999999999999999999999999999987765 4699 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_019710. 316 EQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPP 394 (424) Q Consensus 316 e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p 394 (424) |++.+.|+++||.|++++||++||++|+++.++. +++++||++.+++.|.+++++.+++++++|+||+||+|+++|+|| T Consensus 313 eq~~~~f~~~tL~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~p 392 (424) T protein:vir:45 313 SAQAIQFVRYTMMPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNP 392 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC Confidence 9999999999999999999999999999988764 588999999999999999999999999999999999999999999 Q ss_pred CCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 395 LPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 395 ~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +||||++++|+|+.+..... ++.+.++|. T Consensus 393 i~ggD~~~~~~n~~~~~~~~-~~~~~~~~~ 421 (424) T protein:vir:45 393 VEGLDEMLVSVNAANPAGDF-KPPKNDEGK 421 (424) T ss_pred CCCcceeeeccccccccccc-CCCCCCCCC Confidence 99999999999998744321 111122222 No 16 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=100.00 E-value=5.3e-99 Score=559.37 Aligned_cols=402 Identities=25% Similarity=0.438 Sum_probs=345.1 Q ss_pred cHHHHHHhhccCccccccccccc-ccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 15 GWWARLKSWFVGGRLVTPNQGSQ-TGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 15 G~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) =||+|......+ .+.+....+ ...++.....+++.++.+.|+++++|++||++||++||++||++|+++.++. +. T Consensus 1 ~~~~r~~~~~~~--~~~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~--~~ 76 (419) T protein:vir:14 1 MFFSRQLLSNLG--QTQMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYERSGEDR--KP 76 (419) T ss_pred Cccccccccccc--ccccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEecCCcc--cc Confidence 233333222211 112222222 2233344456788899999999999999999999999999999999876653 34 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCceE Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||+++.|+.+|.+++|||++|.+|++..++++..++...+ .. T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~~~~y~~~~--~~ 154 (419) T protein:vir:14 77 ATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDLKPVYRVRG--SD 154 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEEEEEcc--Cc Confidence 5689999999999999999999999999999999999999999999999999999999999988876654333222 23 Q ss_pred EecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC---CHHHHHHHHHHHHHH Q lcl|NC_019710. 174 DFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL---TEQQRSQVEENFKEI 250 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~---~~~~~~~~~~~~~~~ 250 (424) .++.++|+|+++++.++++|+||+.++..+++...++++++.++|+||++|+|+|+.+.... ++++.+.+++.|++. T Consensus 155 ~~~~~~i~h~~~~~~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (419) T protein:vir:14 155 PMPQRLVHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAK 234 (419) T ss_pred ccchhheeEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHH Confidence 47899999999999999999999999999999999999999999999999999999886543 578888899999876 Q ss_pred hCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P 329 (424) +++ .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||.| T Consensus 235 ~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t--~s~~E~~~~~f~~~~L~P 312 (419) T protein:vir:14 235 FGGSGNAKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT--FSNIEHQSLQFVIYTLLP 312 (419) T ss_pred hcCccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--cccHHHHHHHHHHHHHHH Confidence 655 68999999999999999999999999999999999999999999999999877665 568999999999999999 Q ss_pred HHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Q lcl|NC_019710. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~ 409 (424) ++++||++|+++|+++.++.+++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+||||++++|+|+++ T Consensus 313 ~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~ 392 (419) T protein:vir:14 313 WVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVD 392 (419) T ss_pred HHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Confidence 99999999999999998887889999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccCCCccCCC Q lcl|NC_019710. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~g~ 424 (424) ++.....+.++.+.+ T Consensus 393 ~~~~~~~~~~~~~~~ 407 (419) T protein:vir:14 393 ASKPQQLPVGKSEPT 407 (419) T ss_pred ccccccccCCCCCCc Confidence 887655432222211 No 17 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=100.00 E-value=1.3e-98 Score=557.32 Aligned_cols=406 Identities=22% Similarity=0.382 Sum_probs=350.6 Q ss_pred ccHHHHHHhhccCcccccccc-cccccccccc-cccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQ-GSQTGPVSAH-GYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 91 (424) +|||+++++++++.+...... ........+. ....+..++.+.++++++|++||++||++||++||++|++++++.. T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~l~~~~~~~~~~~~~- 79 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTISVKGKNALKVATVFACIKILSESVSKLPLKIYQEDEYGIQ- 79 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCCcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCCcee- Confidence 999999998776443211110 0111111111 1234667889999999999999999999999999999998766643 Q ss_pred cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc-------eEE Q lcl|NC_019710. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVY 164 (424) Q Consensus 92 ~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~-------~~~ 164 (424) ...+|+++++|+.+||++||+++||+.++.+++++||||+++.|+.+|.+++|||++|++|++..++.. .+| T Consensus 80 -~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~ 158 (429) T protein:vir:10 80 -RGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLNSKTKMWY 158 (429) T ss_pred -eccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccccceEEE Confidence 456799999999999999999999999999999999999999999999999999999999999887543 345 Q ss_pred EEEecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019710. 165 RYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 165 ~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~ 243 (424) .+..++..+.|+++||||||++ +.++++|+||+..+..+++...++++++.++|+||++|+++|+.+.. .++++.+.+ T Consensus 159 ~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~-l~~e~~~~~ 237 (429) T protein:vir:10 159 VVNTGGQQRVLKPEEILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGD-LNEDAKKVF 237 (429) T ss_pred EEccCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCC-CCHHHHHHH Confidence 6667777889999999999964 67889999999999999999999999999999999999999999865 466778888 Q ss_pred HHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHH Q lcl|NC_019710. 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 244 ~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f 322 (424) ++.|++.+++ .|+|+++++++|++|++++.++.|+|++|.+++++++||++|||||.+||+.++++ ++|+|++.++| T Consensus 238 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~--~sn~e~~~~~f 315 (429) T protein:vir:10 238 RENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKAT--LNNIEQQQQQF 315 (429) T ss_pred HHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHHHHHHHH Confidence 8888876554 78999999999999999999999999999999999999999999999999887765 45999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee Q lcl|NC_019710. 323 LQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~ 401 (424) +++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+||||++ T Consensus 316 ~~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~ 395 (429) T protein:vir:10 316 YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRL 395 (429) T ss_pred HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee Confidence 999999999999999999999988764 6889999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhcccc--CCCccCCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~--~~~~~~g~ 424 (424) ++|+|++|++.+++. +++++++. T Consensus 396 ~~~~n~~~~d~~~~~~~k~g~~~~~ 420 (429) T protein:vir:10 396 LVNGNMLPIDMAGQAYLKGGDTNGE 420 (429) T ss_pred eecccccchhhccccccCCCCCCCC Confidence 999999999876432 22322222 No 18 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=100.00 E-value=5.6e-98 Score=553.76 Aligned_cols=402 Identities=25% Similarity=0.406 Sum_probs=351.0 Q ss_pred ccHHHHHHhhccCcccccccccc-------cccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGS-------QTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQ 86 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~ 86 (424) +|||++++++..+.......... ..+.+..++...+..++...++++++|++||++||++||++|++++++++ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~al~~~~v~~ci~~ia~~iA~lp~~~~~~~~ 80 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKFGIKLNFSVRGKRALKENTVYVCTKIRAESIGKLSLKIYKDKE 80 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhccccCCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEecCc Confidence 99999998877755443322111 11122333444566788999999999999999999999999999998653 Q ss_pred cCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc----- Q lcl|NC_019710. 87 NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK----- 161 (424) Q Consensus 87 ~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~----- 161 (424) . ..+|++.++|+.+||++||+++||+.++.+++++||||+++.|+.+|++++|+|++|.+|++..+.++ T Consensus 81 ~------~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~~ 154 (422) T protein:vir:13 81 E------YKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSSL 154 (422) T ss_pred c------cccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceecc Confidence 2 34689999999999999999999999999999999999999999999999999999999999887654 Q ss_pred --eEEEEE-ecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019710. 162 --VVYRYQ-RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 162 --~~~~~~-~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~ 237 (424) .+|.+. .++....++++||||++++ +.++++|+||+..+..++.+..++++++.++|+||++|+|+|+++.. .++ T Consensus 155 ~~~~y~~~~~~g~~~~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~ 233 (422) T protein:vir:13 155 SKVWYVVTDKNGKEHKLLPDEMLHFIGDITLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGD-LDE 233 (422) T ss_pred ceEEEEEEeCCCeEEEEcccceEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCH Confidence 334443 4456778999999999964 67899999999999999999999999999999999999999999865 567 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ++.+.+++.|++.+++ .|+++++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.++++ ++|+| T Consensus 234 e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~sn~e 311 (422) T protein:vir:13 234 KAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERAT--FNNLT 311 (422) T ss_pred HHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC--cccHH Confidence 7788888888877655 68999999999999999999999999999999999999999999999999887765 55999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_019710. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPL 395 (424) Q Consensus 317 ~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~ 395 (424) ++.++|+++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+ T Consensus 312 ~~~~~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~ 391 (422) T protein:vir:13 312 EQQKDFYVTTLQSSLTVYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPV 391 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 999999999999999999999999999998864 5889999999999999999999999999999999999999999999 Q ss_pred CCcCeeeecccccchhhcccc-CCCccCCC Q lcl|NC_019710. 396 PGGDVAMRQSQYVPITDLGTN-KEPRNNGA 424 (424) Q Consensus 396 ~ggd~~~~~~n~~~~~~~~~~-~~~~~~g~ 424 (424) ||||++++++|++|++.++++ +++.+.|+ T Consensus 392 ~ggD~~~~~~n~~~l~~~~~~~~~~g~~~g 421 (422) T protein:vir:13 392 EGGDRLLVNGNMIPIEMAGEQYKKGGEKGG 421 (422) T ss_pred CCcCeeeeccCccchhhcccccccCCCcCC Confidence 999999999999999998765 34444444 No 19 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=100.00 E-value=1.2e-97 Score=551.95 Aligned_cols=402 Identities=23% Similarity=0.364 Sum_probs=345.9 Q ss_pred cHHHHHHhhccCcccccc-cccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 15 GWWARLKSWFVGGRLVTP-NQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 15 G~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) =||+|+++.....+.... ........++...+..+..++.+.++++|+||+||++||++||+|||++|+++++|.. . T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~~~--~ 78 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTASGERVSESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGGIE--R 78 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCccc--c Confidence 344454433222111111 1111223333444566788999999999999999999999999999999998765543 3 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcC--CceEEEEEecCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~--~~~~~~~~~~~~ 171 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||+++.|+..|.+.+||||+|.+|++..+. +..+|.+..++. T Consensus 79 ~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~g~ 158 (416) T protein:vir:12 79 KPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGMLWYQTVLNGK 158 (416) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEEEEEEecCCe Confidence 457999999999999999999999999999999999999999999999999999999999977654 445677777888 Q ss_pred eEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_019710. 172 YADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA 251 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~ 251 (424) .++|+++||||+|+++.++++|+||+.++..++.+..++++++.++|+||+.|++||+.+.. .++++.+++++.|+... T Consensus 159 ~~~~~~~eiih~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~-~~~e~~~~~~~~~~~~~ 237 (416) T protein:vir:12 159 AIELYDYEVLHFKGLSTDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPAF-LDEKPKENVRKEWKRVN 237 (416) T ss_pred EEEecCccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCCC-CCHHHHHHHHHHHHHHh Confidence 89999999999999999999999999999999999999999999999999999999999865 56778888888888764 Q ss_pred CCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 252 GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYI 331 (424) Q Consensus 252 ~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~ 331 (424) ++++++++++|++|++++++++|+||+|.++++.++||++|||||.+||...+++ ++|+|++.++|+++||.|++ T Consensus 238 ---~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~f~~~~l~P~~ 312 (416) T protein:vir:12 238 ---KVENIAIIDYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKAT--FSNIEHQSIEYVRNTLQPWI 312 (416) T ss_pred ---cCCCeeecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCC--cccHHHHHHHHHHHHHHHHH Confidence 4578999999999999999999999999999999999999999999999877665 56999999999999999999 Q ss_pred HHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccch Q lcl|NC_019710. 332 SRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPI 410 (424) Q Consensus 332 ~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~ 410 (424) ++||++|+++|+++.++. +++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+||||++++|+|++++ T Consensus 313 ~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~ 392 (416) T protein:vir:12 313 VNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNYVFL 392 (416) T ss_pred HHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccc Confidence 999999999999988754 6889999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccccCCCccCCC Q lcl|NC_019710. 411 TDLGTNKEPRNNGA 424 (424) Q Consensus 411 ~~~~~~~~~~~~g~ 424 (424) +...+++....+++ T Consensus 393 ~~~~~~~~~~~~~~ 406 (416) T protein:vir:12 393 DFLEEYQRLKAGGA 406 (416) T ss_pred cccchhhccccccc Confidence 87765443322211 No 20 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=100.00 E-value=1.6e-97 Score=551.28 Aligned_cols=401 Identities=28% Similarity=0.504 Sum_probs=346.3 Q ss_pred cHHHHHHhhccCccccccccc-cccccccc-ccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc Q lcl|NC_019710. 15 GWWARLKSWFVGGRLVTPNQG-SQTGPVSA-HGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 15 G~~~~~~~~~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (424) =||+++ |++++....... .....++. .....+..++.+.|+++++|++||++||+++|++||+++++++++.. T Consensus 1 ~~f~~~---f~r~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~-- 75 (413) T protein:vir:48 1 MFFSGL---FQRKSDAPVTTPAELAEAIGLSYDTYTGKRISSQRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKT-- 75 (413) T ss_pred Cccchh---hccCccCCccchHHHHHhhhcCcccccCceechhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcce-- Confidence 233333 333332222221 22222221 22346677899999999999999999999999999999998765543 Q ss_pred ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce-EEEE-EecC Q lcl|NC_019710. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-VYRY-QRDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~-~~~~-~~~~ 170 (424) ...+|++.++|+.+||++||+++||+.++.+++++||||++++++ .|+|.+|||++|.+|++..++.+. .|.+ ..++ T Consensus 76 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g 154 (413) T protein:vir:48 76 RVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQWQPVYQVTFPDG 154 (413) T ss_pred eecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCceEEEEEEecCc Confidence 356799999999999999999999999999999999999999987 599999999999999998876543 3433 3455 Q ss_pred ceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 171 EYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ....|+++||||+|+++.++++|+||+..+..+++...++++++.++|+||++|+++|+.+... ++++.+++++.|++. T Consensus 155 ~~~~~~~~evih~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~-~~e~~~~~~~~~~~~ 233 (413) T protein:vir:48 155 SVDVLTQDEIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKL-TPDAYERLKKDFEER 233 (413) T ss_pred eEEEEccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-CHHHHHHHHHHHHHH Confidence 6678999999999999999999999999999999999999999999999999999999998664 567788888888776 Q ss_pred hCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P 329 (424) +++ .|+|+++++++|++|++++.+++|+||.|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||.| T Consensus 234 ~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~~n~e~~~~~f~~~~i~P 311 (413) T protein:vir:48 234 HTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRAT--FNNIEELGLGFINYSLVP 311 (413) T ss_pred hcCccccCcceecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCC--cccHHHHHHHHHHHHHHH Confidence 554 78999999999999999999999999999999999999999999999999876654 569999999999999999 Q ss_pred HHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Q lcl|NC_019710. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~ 409 (424) ++++||++||++|+++.++.+++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+||||++++|+|+++ T Consensus 312 ~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~ 391 (413) T protein:vir:48 312 YLTRIEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTT 391 (413) T ss_pred HHHHHHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeccccccc Confidence 99999999999999988877889999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccCCCccCCC Q lcl|NC_019710. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~g~ 424 (424) +..++++.++..+++ T Consensus 392 ~~~~~~~~~~~~~~~ 406 (413) T protein:vir:48 392 SPSAGDDNGKKKESG 406 (413) T ss_pred cccccccCCCCCCCC Confidence 998887765555555 No 21 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=100.00 E-value=1.5e-97 Score=551.39 Aligned_cols=402 Identities=24% Similarity=0.430 Sum_probs=343.7 Q ss_pred cHHHHHHhhccCcccccccccc-cccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 15 GWWARLKSWFVGGRLVTPNQGS-QTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 15 G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) =||++.+... .....+.... ....++...+.++..|+.+.++++++||+||++||++||++||++|++++++. +. T Consensus 1 m~~~~~~~~~--~~~~~~~~~~~~~~~~g~~~s~~~~~v~~~~al~~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~--~~ 76 (419) T protein:vir:80 1 MFFSRQLLSN--LGQTQPGSGGWVSALLGSARSEAGQVVTPASALSLTVLQNCVTLLAESIAQLPVELYERSGDDR--KP 76 (419) T ss_pred CCcccccccc--cCcCCCCcchhhHHhhcccccccCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEecCCCc--cc Confidence 1222221111 1111111111 12223344456788899999999999999999999999999999999987764 34 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCceE Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|++.++|+.+||++||+++||+.++.+++++||||+++.|+.+|++.+||||+|.+|++..++++..++ ...+ .. T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~~~y-~~~~-~~ 154 (419) T protein:vir:80 77 ATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLKPMY-RVAG-AD 154 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCceEEE-EEcC-cc Confidence 567999999999999999999999999999999999999999999999999999999999999887665433 2222 23 Q ss_pred EecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC---CCHHHHHHHHHHHHHH Q lcl|NC_019710. 174 DFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV---LTEQQRSQVEENFKEI 250 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~---~~~~~~~~~~~~~~~~ 250 (424) .+++++|+|+++++.++++|+||+.++..+++...++++++.++|+||++|+++|+.+... .++++.+++++.|++. T Consensus 155 ~~~~~~i~h~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (419) T protein:vir:80 155 PLPQRLVHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAK 234 (419) T ss_pred ccchhheEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHH Confidence 5899999999999999999999999999999999999999999999999999999987543 3577888889999877 Q ss_pred hCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 251 AGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P 329 (424) +++ .|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+++||.| T Consensus 235 ~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t--~~n~e~~~~~f~~~~l~P 312 (419) T protein:vir:80 235 FGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERAT--FSNIEHQSLQFVIYTLLP 312 (419) T ss_pred hcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCC--cccHHHHHHHHHHHHHHH Confidence 655 68999999999999999999999999999999999999999999999999877665 569999999999999999 Q ss_pred HHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Q lcl|NC_019710. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~ 409 (424) ++++||++|+++|+++.++..++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+||||++++|+|+++ T Consensus 313 ~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD~~~~~~n~~~ 392 (419) T protein:vir:80 313 WVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMVD 392 (419) T ss_pred HHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccc Confidence 99999999999999998887889999999999999999999999999999999999999999999999999999999998 Q ss_pred hhhccccCCCccCCC Q lcl|NC_019710. 410 ITDLGTNKEPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~~~~~~g~ 424 (424) ++...+.+.++.+-+ T Consensus 393 ~~~~~~~~~~~~~~~ 407 (419) T protein:vir:80 393 ASKPQPIPMGKTEPT 407 (419) T ss_pred ccccccccCCCCCch Confidence 877665443333322 No 22 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=100.00 E-value=2.3e-97 Score=550.43 Aligned_cols=400 Identities=25% Similarity=0.417 Sum_probs=338.3 Q ss_pred ccHHHHHHhhccCccccccc----c----cc--c-----c-------cccccccccCCccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPN----Q----GS--Q-----T-------GPVSAHGYLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~----~----~~--~-----~-------~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) +|+|++|++.........+. . +. + . ..+...++.++..++.+.++++++|++||++|| T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 99999987643322111110 0 00 0 0 001122344567789999999999999999999 Q ss_pred HhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccc Q lcl|NC_019710. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~ 151 (424) ++||++||++|++++. .+...+|++.+||+.+||++||+++||+.++.+++++||||+++.|+. |.+++|||++|. T Consensus 81 ~~iA~lp~~v~~~~~~---~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~L~pl~~~ 156 (431) T protein:vir:10 81 GTIGMLPMNLISSDDS---KQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSG-NRPIRLIPMDRG 156 (431) T ss_pred HhhccCceEEEEecCc---eeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-CceEEEEEEcCc Confidence 9999999999987432 234567999999999999999999999999999999999999999985 899999999999 Q ss_pred eEEEEEcCCc-eEEEEE-ecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019710. 152 NMDVKLVGKK-VVYRYQ-RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 152 ~v~~~~~~~~-~~~~~~-~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~ 229 (424) +|++..+.++ .+|.+. .++....|+++||||||+++.|+++|+||+.++..++.++.+++++..++|+||++|++||+ T Consensus 157 ~v~~~~~~~~~~~y~~~~~~g~~~~~~~~dViHir~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~ 236 (431) T protein:vir:10 157 SAKGRLTSTWQIVYDYTTPTGDKIELPAREVFHLRDLSIDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIE 236 (431) T ss_pred eeEEEEcCCCeEEEEEEeCCceEEEEchhhEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEe Confidence 9998776443 445443 45567889999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCCHHHHHHHHHHHHHHh-CCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC Q lcl|NC_019710. 230 TGEKVLTEQQRSQVEENFKEIA-GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS 308 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~-~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~ 308 (424) ++.. .++++.+++++.|++.+ |.+|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.+++ T Consensus 237 ~~~~-ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~ 315 (431) T protein:vir:10 237 VPKE-LSDNAYGRMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS 315 (431) T ss_pred cCCC-CCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC Confidence 9865 56777888888887655 457999999999999999999999999999999999999999999999999987655 Q ss_pred CcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCC----CcCHH Q lcl|NC_019710. 309 TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESG----LRTIN 384 (424) Q Consensus 309 ~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g----~~t~N 384 (424) +++|+|++.++|+++||.|++++||++|+++|+++.++..++++||++.+++.|.++|++.++++++.| |||+| T Consensus 316 --t~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~N 393 (431) T protein:vir:10 316 --WGSGIEQLAIFFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQN 393 (431) T ss_pred --ccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHH Confidence 456999999999999999999999999999999988877889999999999999999999999998665 59999 Q ss_pred HHHHHhCCCCCCC--cCeeeecccccchhhccccCCCccC Q lcl|NC_019710. 385 EMRRTDNLPPLPG--GDVAMRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 385 E~R~~lg~~p~~g--gd~~~~~~n~~~~~~~~~~~~~~~~ 422 (424) |+|+++|+||+++ ||++++|.|..+.++..+. |..- T Consensus 394 E~R~~~gl~p~~~~~gD~~~~p~n~~~~~~~~~~--p~~~ 431 (431) T protein:vir:10 394 EVREMLDLPRADDPVADQLRNPMTQKQKGSGDEP--PATT 431 (431) T ss_pred HHHHHhCCCCCCCccccceecccccccCCCCCCC--CCCC Confidence 9999999999955 9999999998876543221 1111 No 23 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=100.00 E-value=2.9e-97 Score=549.83 Aligned_cols=401 Identities=23% Similarity=0.295 Sum_probs=339.8 Q ss_pred HHHHHHhhccCcccccccccc-cccc------cccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccC Q lcl|NC_019710. 16 WWARLKSWFVGGRLVTPNQGS-QTGP------VSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~-~~~~------~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 88 (424) ||+.++..........+.... +... .....+.++..|+.+.+|++++|++||++||++||++||++|+++++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 555443211111111111111 1111 112335567889999999999999999999999999999999998887 Q ss_pred ccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC-ceEEEEE Q lcl|NC_019710. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK-KVVYRYQ 167 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~-~~~~~~~ 167 (424) ..++. .+|+ .++|+.+||++||+++||+.++.+++++||||++++|+.+|.+.+|||++|.+|++..+.+ ..+|.+. T Consensus 81 ~~~~~-~~~~-~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~y~~~ 158 (454) T protein:vir:93 81 IRRET-RRGD-IARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDGEVFYRIT 158 (454) T ss_pred ccchh-hhHH-HHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCCcEEEEEE Confidence 65543 3444 5566679999999999999999999999999999999999999999999999999887754 4555554 Q ss_pred ecC-----ceEEecHhHeeEecC-cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHH Q lcl|NC_019710. 168 RDS-----EYADFSQKEIFHLKG-FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRS 241 (424) Q Consensus 168 ~~~-----~~~~~~~~evih~r~-~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~ 241 (424) ... ....|+++||||+|+ ++.++++|+||+..+..++.+..++++++.++|+||++|+++|+++.. .++++.+ T Consensus 159 ~~~~~~~~~~~~~~~~eViH~k~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~ 237 (454) T protein:vir:93 159 PDRNCGITEAVTVPAREVIHDRFNCFFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGS-ITEENAK 237 (454) T ss_pred eccccccceeEEecCcceEEeccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCC-CCHHHHH Confidence 432 356899999999995 567899999999999999999999999999999999999999999865 5677889 Q ss_pred HHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHH Q lcl|NC_019710. 242 QVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG 321 (424) Q Consensus 242 ~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~ 321 (424) ++++.|++..++.|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.++ T Consensus 238 ~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t--~sn~e~~~~~ 315 (454) T protein:vir:93 238 KLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPS--SDNVEALEQQ 315 (454) T ss_pred HHHHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--chhHHHHHHH Confidence 99999999988899999999999999999999999999999999999999999999999999887665 5699999999 Q ss_pred HHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee Q lcl|NC_019710. 322 FLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVA 401 (424) Q Consensus 322 f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~ 401 (424) |+++||.|+++.||++|+++|+++.+ ++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+||||++ T Consensus 316 f~~~~l~P~~~~ie~~ln~~L~~~~~---~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~ 392 (454) T protein:vir:93 316 YYSQCLQTLIESIELLLDEALETGEN---ESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDAL 392 (454) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCC---cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee Confidence 99999999999999999999997643 579999999999999999999999999999999999999999999999999 Q ss_pred eecccccchhhccccCCCccCCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++++|+++++.+++.+..++... T Consensus 393 ~~~~~~~~~~~~~~~~~~~~~~~ 415 (454) T protein:vir:93 393 YLQQQNYSLEALSRRDAREDPFA 415 (454) T ss_pred eeccCccchHhhhccCcccCCCC Confidence 99999999988765443322211 No 24 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=100.00 E-value=2.5e-96 Score=544.69 Aligned_cols=406 Identities=26% Similarity=0.431 Sum_probs=339.9 Q ss_pred ccHHHHHHhhccCcccccccccccc----cc-cccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccC Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQT----GP-VSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 88 (424) +|||++++++..++.........+. .. .......+++.|+.+.|+++++||+||++||++||++||++|++..+ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~- 79 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGG- 79 (457) T ss_pred CchhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCC- Confidence 9999999876555443222111111 11 11222346788999999999999999999999999999999987643 Q ss_pred ccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc-----eE Q lcl|NC_019710. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-----VV 163 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~-----~~ 163 (424) ..+ .. .++....|+.+||++||+++||+.++.+++++||||+++.++ .|.+.+||||+|.+|++..+... .+ T Consensus 80 ~~~-~~-~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 156 (457) T protein:vir:62 80 TRK-EI-DTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVF 156 (457) T ss_pred ccc-cc-cchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeE Confidence 322 23 345555555689999999999999999999999999998665 69999999999999998765322 12 Q ss_pred EEEE--ecCc---eEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019710. 164 YRYQ--RDSE---YADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 164 ~~~~--~~~~---~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~ 237 (424) +.|. ..+. ...|+++||||||+++.++ ++|+||+.++..++.+..++++++.++|+||++|++||+++.. .++ T Consensus 157 ~~y~~~~~g~~~~~~~~~~~eiih~r~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-ls~ 235 (457) T protein:vir:62 157 EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGT-MSE 235 (457) T ss_pred EEEEEccCCceeEEEeeCccceEEecCCCCCCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCC-CCH Confidence 2232 2222 2468999999999998876 8999999999999999999999999999999999999999865 567 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ++.+++++.|++.+++ .|+|++++|++|++|++++++++|+||+|++++++++||++|||||.+||..++++++++|.| T Consensus 236 e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e 315 (457) T protein:vir:62 236 EGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLA 315 (457) T ss_pred HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHH Confidence 7788899999886554 689999999999999999999999999999999999999999999999999999988888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_019710. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ 396 (424) ++.++|+.+||.|++++||++|+++|+++.++..++++||++.+++.|.++|++.+.+++++|+||+||+|+++|+||+| T Consensus 316 q~~~~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 395 (457) T protein:vir:62 316 EQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999999887778899999999999999999999999999999999999999999999 Q ss_pred Cc--CeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 397 GG--DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 397 gg--d~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) || |++++|+|+++++...+.+......+ T Consensus 396 ~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~ 425 (457) T protein:vir:62 396 DGLGEKYRVPLNLGEIGEEPEPEPAPAPPA 425 (457) T ss_pred CCCcceeeeccccccccccccccccCCCcc Confidence 87 99999999998876544321111111 No 25 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=100.00 E-value=2.6e-96 Score=544.60 Aligned_cols=397 Identities=25% Similarity=0.420 Sum_probs=338.4 Q ss_pred HHHHHHhhccCcccccccccc-cccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccccc Q lcl|NC_019710. 16 WWARLKSWFVGGRLVTPNQGS-QTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVD 94 (424) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~ 94 (424) |+ ++..|..++........ ...+++ ...++..++.+.++++++|++||++||++||++||++|++. +|. +.. T Consensus 1 m~--f~~~~~~~~~~~~~~~~~~~~~~g--~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~-~~~--~~~ 73 (409) T protein:vir:10 1 ML--FRKGFKNQSQEISIDDKKILEWLG--INPSETYVNGKSCLKQATVFGCIRILSDNISKLPIKIYQKK-DGI--KRV 73 (409) T ss_pred Cc--ccccccCcCCCCCCChHHHHHHhc--CCcCcceechhhhhccHHHHHHHHHHHHhhhhCceEEEEec-CCe--eec Confidence 22 11122222222111111 111111 13456788999999999999999999999999999999864 332 234 Q ss_pred ccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc-------eEEEEE Q lcl|NC_019710. 95 LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRYQ 167 (424) Q Consensus 95 ~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~-------~~~~~~ 167 (424) .+|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|.+|++..+..+ ..|.+. T Consensus 74 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~ 153 (409) T protein:vir:10 74 PDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYT 153 (409) T ss_pred cCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEEE Confidence 5799999999999999999999999999999999999999999999999999999999999887543 234443 Q ss_pred -ecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019710. 168 -RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 168 -~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ..+....|+++||||+|+++.++++|+||+.++..+++...++++++.++|+||++|++||+.+.. .++++.+++++. T Consensus 154 ~~~g~~~~~~~~evih~r~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e~~~~~~~~ 232 (409) T protein:vir:10 154 DDLGQRHKFMSDEILHFKGLTADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGD-LNPEAEEVFKEN 232 (409) T ss_pred eCCceeEEeccccEEEecCcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCC-CCHHHHHHHHHH Confidence 345678899999999999999999999999999999999999999999999999999999999865 567778889999 Q ss_pred HHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 247 ~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~ 325 (424) |++..++ .|+|+++++++|++|++++.+++|+||+|.++++.++||++|||||.+||..++++ ++|+|++.++|+++ T Consensus 233 ~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~--~~~~e~~~~~f~~~ 310 (409) T protein:vir:10 233 FERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRAT--HSNITEQNREFYID 310 (409) T ss_pred HHHHhccccccCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc--cccHHHHHHHHHHH Confidence 9886655 68999999999999999999999999999999999999999999999999887664 56999999999999 Q ss_pred HHHHHHHHHHHHHhhhccChhhh-ccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec Q lcl|NC_019710. 326 TLQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQ 404 (424) Q Consensus 326 tl~P~~~~ie~~l~~~L~~~~~~-~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~ 404 (424) ||.|++++||++|+++|+++.++ .+++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+||||++++| T Consensus 311 ~l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~ 390 (409) T protein:vir:10 311 TLQSILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLIN 390 (409) T ss_pred HHHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec Confidence 99999999999999999988775 46889999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhccccCCCccCCC Q lcl|NC_019710. 405 SQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 405 ~n~~~~~~~~~~~~~~~~g~ 424 (424) +|++|++.++++. ..+|- T Consensus 391 ~n~~~~~~~~~~~--~kgGe 408 (409) T protein:vir:10 391 GNMIPVKMAGEQY--SKGGE 408 (409) T ss_pred cCccchhhccccc--cccCC Confidence 9999998876532 12222 No 26 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=100.00 E-value=4e-96 Score=543.57 Aligned_cols=406 Identities=26% Similarity=0.443 Sum_probs=344.5 Q ss_pred ccHHHHHHhhccCccccccccccc----ccccc-cccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccC Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQ----TGPVS-AHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQND 88 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 88 (424) +|||+++++++.+..........+ ...+. .....+++.|+.+.++++++||+||++||++||+|||++|++..++ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 999999998877654333322111 11111 1223467889999999999999999999999999999999976544 Q ss_pred ccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc-----eE Q lcl|NC_019710. 89 NRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-----VV 163 (424) Q Consensus 89 ~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~-----~~ 163 (424) . +....|++..+|+..|| +||+++||+.++.+++++||||+++.++ .|.+++||||+|.+|++..+... .+ T Consensus 81 ~--~~~~~~~l~~~ln~~~n-~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~ 156 (457) T protein:vir:13 81 R--KEIVTPEWLDYPNAEPG-GMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVF 156 (457) T ss_pred c--cccccchHHHhccccCC-CCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeE Confidence 3 33457889999886666 7999999999999999999999999776 59999999999999998765322 22 Q ss_pred --EEEEecCc---eEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019710. 164 --YRYQRDSE---YADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 164 --~~~~~~~~---~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~ 237 (424) |.+..++. ...|+++||||+++++.++ ++|+||+.++..+|.+..++++++.++|+||++|++||+++.. .++ T Consensus 157 ~~y~~~~~~~~~~~~~~~~~diih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-ls~ 235 (457) T protein:vir:13 157 EAYDIDADGNEVLLGWFTPRDVLHIPGMMLPGDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPGT-MSE 235 (457) T ss_pred EEEEEecCCceeeEEeeCccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCC-CCH Confidence 22332222 2468999999999998876 8999999999999999999999999999999999999999865 567 Q ss_pred HHHHHHHHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 238 QQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ++.+++++.|++.+++ +|+|++++|++|++|++++++++|+||+|++++++++||++|||||.+||..+++++.++|+| T Consensus 236 e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e 315 (457) T protein:vir:13 236 EGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLA 315 (457) T ss_pred HHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHH Confidence 7888899988876654 789999999999999999999999999999999999999999999999999999888888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_019710. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ 396 (424) ++.++|+++||.|++++||++|+++|+++.++..++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+| T Consensus 316 q~~~~f~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ 395 (457) T protein:vir:13 316 EQNIAFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLP 395 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 99999999999999999999999999998887778899999999999999999999999999999999999999999999 Q ss_pred Cc--CeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 397 GG--DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 397 gg--d~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) || |++++|+|+++++...+.+......+ T Consensus 396 ~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~ 425 (457) T protein:vir:13 396 DGLGEKYRVPLNLGEVGEEPEPEPAPAPPA 425 (457) T ss_pred CCcccceeeccccccccccccccccCCCCC Confidence 87 99999999998876544322111111 No 27 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=100.00 E-value=2.2e-94 Score=534.03 Aligned_cols=397 Identities=26% Similarity=0.458 Sum_probs=336.4 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|+|+|+++........... ............++..++.+.|+++++|++||++||++||++||++|++++++. T Consensus 1 Mgl~~~~f~~~~~~~~~~~~--~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~---- 74 (409) T protein:vir:84 1 MSLFTRIFSGPSEERTLTKI--SGIPSPAEDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVR---- 74 (409) T ss_pred CchhhhhhcCCCcccccccc--cccccccchhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcc---- Confidence 99999986654322221111 111111122234677889999999999999999999999999999999765442 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeEEEEeccceEEEEEc--CCceEEEEEecC Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLV--GKKVVYRYQRDS 170 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~~G~~~~l~~l~p~~v~~~~~--~~~~~~~~~~~~ 170 (424) ...|+++++|+.+||++||+++||+.++.+++++||+|+++. ++..|++.+||||+|.+|++... ....++.+.... T Consensus 75 ~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~ 154 (409) T protein:vir:84 75 IPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYRI 154 (409) T ss_pred cccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEecC Confidence 346899999999999999999999999999999999999885 78889999999999999987653 334444443444 Q ss_pred ceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019710. 171 EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) .+..|+++||||+++++.++ ++|+||+..+..++....++++++.++|+||++|+|+|+.+.. .++++.+.+++.|.+ T Consensus 155 ~g~~~~~~dvih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~ 233 (409) T protein:vir:84 155 DGKVVPNHRIMHIKRYPVAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDAD-LTPDQVKQTQKQWIQ 233 (409) T ss_pred CceEEchhhEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC-CCHHHHHHHHHHHHH Confidence 45679999999999888776 6899999999999999999999999999999999999999865 456667777777766 Q ss_pred HhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 250 IAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 250 ~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P 329 (424) .. .|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||..++++++++|+|++.+.|+++||.| T Consensus 234 ~~--~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P 311 (409) T protein:vir:84 234 SH--HNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLP 311 (409) T ss_pred Hh--ccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHH Confidence 44 4678999999999999999999999999999999999999999999999999888888889999999999999999 Q ss_pred HHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccc Q lcl|NC_019710. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 330 ~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~ 409 (424) ++++||++|+++|.. +++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+||||++++|+|++| T Consensus 312 ~~~~ie~~l~~~L~~-----g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~ 386 (409) T protein:vir:84 312 WLRCIEQALDTFLPR-----GQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVP 386 (409) T ss_pred HHHHHHHHHHHhccC-----CCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccc Confidence 999999999998832 468999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccC---CCccCCC Q lcl|NC_019710. 410 ITDLGTNK---EPRNNGA 424 (424) Q Consensus 410 ~~~~~~~~---~~~~~g~ 424 (424) ++....++ +++.+++ T Consensus 387 ~~~~~~~~~~~~~~~~~~ 404 (409) T protein:vir:84 387 LGYVPPEEPAQEPQPNSA 404 (409) T ss_pred cccCCccccCcCCCCCCc Confidence 98765432 2333333 No 28 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=100.00 E-value=2.8e-94 Score=533.45 Aligned_cols=410 Identities=19% Similarity=0.274 Sum_probs=327.6 Q ss_pred CC---CCCcccccCCC------ccHHHHHHhhccCcccccccccc--cccccccccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 ME---EPKYTIDLRTN------NGWWARLKSWFVGGRLVTPNQGS--QTGPVSAHGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~---~~~~~~~~~~~------~G~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) |. -.-|-++.+.+ +++++-++. ..++....+.... ............+..++.+.|+++++|++||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~ 79 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYK-NEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred CceecCccceeccccccchhhhhhccccccc-cccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHH Confidence 21 11222333333 122221111 1111111111111 111111111223556889999999999999999 Q ss_pred HHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) ||++||++|+++|++++ ....|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|+|++|||++ T Consensus 80 Ia~~iA~lpl~~~~~~~------~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 153 (441) T protein:vir:98 80 IASDLARMPIRVTVNGQ------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 153 (441) T ss_pred HHHhhccCceEEecCCc------ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEc Confidence 99999999999996432 345789999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEcCCceE-EEEEe-----cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_019710. 150 SANMDVKLVGKKVV-YRYQR-----DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAK 223 (424) Q Consensus 150 p~~v~~~~~~~~~~-~~~~~-----~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~ 223 (424) |++|++..++++.. |++.. .+....|+++||||||+++.++++|+||+.++.+++++..++++++.++|+||++ T Consensus 154 ~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~ 233 (441) T protein:vir:98 154 TSEIELKLDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTH 233 (441) T ss_pred CceeEEEECCCCcEEEEEEEeccCcceeeEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999988765533 33321 2234689999999999999999999999999999999999999999999999999 Q ss_pred CceeEEcCCCCCCHHHHHHHHHHHHHHhC-CcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc Q lcl|NC_019710. 224 SPQILSTGEKVLTEQQRSQVEENFKEIAG-GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV 302 (424) Q Consensus 224 p~~vl~~~~~~~~~~~~~~~~~~~~~~~~-~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l 302 (424) |+|+|+++....++++.+.+++.|++.++ .+|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+| T Consensus 234 ~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~l 313 (441) T protein:vir:98 234 AGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKF 313 (441) T ss_pred CcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc Confidence 99999999888788888888888877655 47899999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_019710. 303 GDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRT 382 (424) Q Consensus 303 ~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t 382 (424) |.... + .+.+++...|. +||.|++++||++|+++|+++.+ +++++||++.+++.|.+++++.+++++++|+|| T Consensus 314 g~~~~-~---~s~~q~~~~y~-~tl~P~~~~ie~~ln~~L~~~~~--~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T 386 (441) T protein:vir:98 314 GIETA-N---MSITDANLDYL-STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMN 386 (441) T ss_pred CCCCC-C---ccHHHHHHHHH-HHHHHHHHHHHHHHHhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcC Confidence 86432 2 24566655554 69999999999999999987654 468999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCCCCcC--eeeecccccchhhccccCCCcc-------CCC Q lcl|NC_019710. 383 INEMRRTDNLPPLPGGD--VAMRQSQYVPITDLGTNKEPRN-------NGA 424 (424) Q Consensus 383 ~NE~R~~lg~~p~~ggd--~~~~~~n~~~~~~~~~~~~~~~-------~g~ 424 (424) +||+|+++|+||+|||| .+++++|++|++.+.+.+..+. .|+ T Consensus 387 ~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~~kgG 437 (441) T protein:vir:98 387 IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGG 437 (441) T ss_pred HHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCC Confidence 99999999999999998 5789999999987754332211 111 No 29 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=100.00 E-value=6.2e-94 Score=531.57 Aligned_cols=401 Identities=20% Similarity=0.293 Sum_probs=338.3 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 87 (424) |++=-.+|++.|++..+.......+... ... ++......+..++.+.|+++|+|++||++||++||++||++|++++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~~~~~~~~ 78 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLIDNWIDQSTSK-LYD-FSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 78 (412) T ss_pred CccchhhhhhhhhhhhHhhhhhcccccc-ccc-ccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCceeEeecccc Confidence 8888888999998876654443322211 111 11112234556788999999999999999999999999999987643 Q ss_pred CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc--eEEE Q lcl|NC_019710. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--VVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~--~~~~ 165 (424) .+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+.+|+||+|.+|++..+.+. .+|. T Consensus 79 -------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~ 151 (412) T protein:vir:26 79 -------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYS 151 (412) T ss_pred -------ccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEE Confidence 3588999999999999999999999999999999999999999999999999999999999877644 3444 Q ss_pred EEe-cCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019710. 166 YQR-DSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 166 ~~~-~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~ 243 (424) +.. ++....|+++||||||++ +.++++|+||+.++..+++++.++++++ ++.++..++++++.+.. .++++.+.+ T Consensus 152 ~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~a~~~~~--~~~~~~~~~~i~~~~~~-l~~e~~~~~ 228 (412) T protein:vir:26 152 IHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSN-VGKEKRQQV 228 (412) T ss_pred EEcCCceEEEEccccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCC-CCHHHHHHH Confidence 433 345678999999999986 5688999999999999999999998885 55555566677777655 567778888 Q ss_pred HHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 244 ~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) ++.|++..+ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.+++ +++|+|++.+.|+ T Consensus 229 ~~~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~--~~sn~e~~~~~f~ 304 (412) T protein:vir:26 229 LEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRFYL 304 (412) T ss_pred HHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHH Confidence 888887664 577899999999999999999999999999999999999999999999986554 5669999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAM 402 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~ 402 (424) ++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+||||+++ T Consensus 305 ~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~ 384 (412) T protein:vir:26 305 QHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL 384 (412) T ss_pred HHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee Confidence 99999999999999999999998764 57899999999999999999999999999999999999999999999999999 Q ss_pred ecccccchhhccccCCCccCCC Q lcl|NC_019710. 403 RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +++|++|++...++++...+|. T Consensus 385 ~~~n~~~~~~~~~~~~~~~gG~ 406 (412) T protein:vir:26 385 ISGDLYPIDTPLELRKSLKGGD 406 (412) T ss_pred ecccccccccchhhcccccCCC Confidence 9999999987655443222222 No 30 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=100.00 E-value=9.6e-94 Score=530.53 Aligned_cols=397 Identities=17% Similarity=0.258 Sum_probs=328.3 Q ss_pred HhhccCcccccccccc---cccccccccccCC------ccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc Q lcl|NC_019710. 21 KSWFVGGRLVTPNQGS---QTGPVSAHGYLGD------SSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 21 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~------~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 91 (424) .-.-++...++|.... +........|..+ ..+....|+++|+||+||++||++||++||++|+++.++..+ T Consensus 1 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~~~~~~~~~~ 80 (518) T protein:vir:78 1 MLLANGQTLSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTETE 80 (518) T ss_pred CcccCceeeccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEEEEcCCcccc Confidence 1112244455443211 1111111222222 234456689999999999999999999999999988766443 Q ss_pred cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC--ceEEEEEec Q lcl|NC_019710. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQRD 169 (424) Q Consensus 92 ~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~--~~~~~~~~~ 169 (424) . .++...+|+.+||++||+++||+.++.+++++||||+++.|+..|.+++||||+|.+|++..+.. ...|.|... T Consensus 81 ~---~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~~y~~~~~ 157 (518) T protein:vir:78 81 E---HDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQAG 157 (518) T ss_pred c---cchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEec Confidence 2 34445567779999999999999999999999999999999999999999999999999988754 344444432 Q ss_pred ----CceEEecHhHeeEecCcCCCCc-cccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHH Q lcl|NC_019710. 170 ----SEYADFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVE 244 (424) Q Consensus 170 ----~~~~~~~~~evih~r~~~~~~~-~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~ 244 (424) +..+.|+++||||||++++++. +|+||+.++..++....++++++.++|+||++|++||+.+.. .++++.++++ T Consensus 158 ~~~~~~~~~~~~~eIiHir~~~~dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~-ls~e~~~~~k 236 (518) T protein:vir:78 158 AGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSPEAQQRLR 236 (518) T ss_pred CCccceeEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC-CCHHHHHHHH Confidence 2457899999999999998885 799999999999999999999999999999999999999865 5677788899 Q ss_pred HHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 245 ENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 245 ~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) +.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.|+ T Consensus 237 ~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st--~sn~e~~~~~f~ 314 (518) T protein:vir:78 237 EQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT--FSNISAQMRAFY 314 (518) T ss_pred HHHHHHhcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC--chhHHHHHHHHH Confidence 989876555 78999999999999999999999999999999999999999999999999887765 569999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP--GGDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~--ggd~~ 401 (424) ++||.|++.+||++|+++|++..++ .++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+| |||++ T Consensus 315 ~~tL~P~~~~ie~eln~~L~~~~~~-~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~ 393 (518) T protein:vir:78 315 RDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADEL 393 (518) T ss_pred HHHHHHHHHHHHHHHHHhhcccccC-cceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 9999999999999999999987664 46899999999999999999999999999999999999999999996 89999 Q ss_pred eecccccchhhccccCC-Ccc------CCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKE-PRN------NGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~-~~~------~g~ 424 (424) ++++|++|++...++.. +.. .++ T Consensus 394 ~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~ 423 (518) T protein:vir:78 394 YANSALQPLGATPDGAVEGEEAPAPKRPAS 423 (518) T ss_pred eecccceecccccccccCCCCCCCCCCCCc Confidence 99999999876433210 000 000 No 31 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=100.00 E-value=1.2e-93 Score=530.09 Aligned_cols=396 Identities=17% Similarity=0.265 Sum_probs=328.3 Q ss_pred HhhccCcccccccccccc----cccccccccC------CccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcc Q lcl|NC_019710. 21 KSWFVGGRLVTPNQGSQT----GPVSAHGYLG------DSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 21 ~~~~~~~~~~~~~~~~~~----~~~~~~~~~~------~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 90 (424) .-.-++...++|...... ..++ ..+.. ..++....|+++++|++||++||++||++||++|+++.++.. T Consensus 1 ~~~~~~~~~~~p~~~e~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~~~~~~~ 79 (518) T protein:vir:10 1 MLLANGQTLSAPAMAELSPQMQDSYY-YAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFTSGDTET 79 (518) T ss_pred CcccCceeecCchhhhhhhhhhcccc-cccccceecccccchhhHHHhhhHHHHHHHHHHHHhhccCceEEEEEcCCCce Confidence 111224445555321111 1111 11222 223445568899999999999999999999999999877654 Q ss_pred ccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC--ceEEEEEe Q lcl|NC_019710. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQR 168 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~--~~~~~~~~ 168 (424) + ..+|++ .+|+.+||++||+++||+.++.+++++||||++++|+.+|.+++||||+|.+|++..+.. ...|.|.. T Consensus 80 ~--~~~~~~-~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~~~y~~~~ 156 (518) T protein:vir:10 80 E--ESDTGY-AKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRYEYYFQA 156 (518) T ss_pred e--ccchHH-HHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCEEEEEEEe Confidence 3 234555 556679999999999999999999999999999999999999999999999999988754 34455543 Q ss_pred c----CceEEecHhHeeEecCcCCCCc-cccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019710. 169 D----SEYADFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 169 ~----~~~~~~~~~evih~r~~~~~~~-~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~ 243 (424) + +..++|+++||||||++++++. +|+||+.++..++....++++++.++|+||++|+|||+.+.. .++++.+++ T Consensus 157 ~~~~~~~~~~~~~~eViHir~~s~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~-ls~e~~~~~ 235 (518) T protein:vir:10 157 GAGVGTQLVSFADDEVVPIRFFNPDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSEAAQQRL 235 (518) T ss_pred cCCccceEEEecCCcEEEecCCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC-CCHHHHHHH Confidence 2 2457899999999999998885 899999999999999999999999999999999999999866 466778889 Q ss_pred HHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHH Q lcl|NC_019710. 244 EENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF 322 (424) Q Consensus 244 ~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f 322 (424) ++.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||..++++ ++|+|++.+.| T Consensus 236 k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t--~sn~eq~~~~f 313 (518) T protein:vir:10 236 REQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRAT--FSNISAQMRAF 313 (518) T ss_pred HHHHHHHhcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCC--chhHHHHHHHH Confidence 9988876655 79999999999999999999999999999999999999999999999999887765 56999999999 Q ss_pred HHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCe Q lcl|NC_019710. 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP--GGDV 400 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~--ggd~ 400 (424) +++||.|++.+||++|+++|++..++ .++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||++ |||+ T Consensus 314 ~~~tL~P~l~~ie~~ln~~L~~~~~~-~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~ 392 (518) T protein:vir:10 314 YRDTMAIPIARIQSAMDKYVGQYWVR-KNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADE 392 (518) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccC-CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCe Confidence 99999999999999999999987664 46899999999999999999999999999999999999999999985 8999 Q ss_pred eeecccccchhhccccCC--Ccc-----CCC Q lcl|NC_019710. 401 AMRQSQYVPITDLGTNKE--PRN-----NGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~--~~~-----~g~ 424 (424) ++++.|++|++...++.. ++. .++ T Consensus 393 ~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~ 423 (518) T protein:vir:10 393 LYANSALQPLGATPDGAVEGEEAPAPKRPAS 423 (518) T ss_pred eeecccceecccccccccCCCCCCCCCCCCc Confidence 999999999875433210 000 000 No 32 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=100.00 E-value=1.2e-93 Score=529.96 Aligned_cols=410 Identities=19% Similarity=0.258 Sum_probs=327.7 Q ss_pred CC---CCCcccccCCC------ccHHHHHHhhccCccccccccccc--ccccccccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 ME---EPKYTIDLRTN------NGWWARLKSWFVGGRLVTPNQGSQ--TGPVSAHGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~---~~~~~~~~~~~------~G~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) |. -.-|-|+.+.+ +++++-++ ...++....+..... ...........+..++.+.|+++++||+||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~-~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~ 79 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFY-KNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred CccccCccccccccccccchhhhhcccccc-ccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHH Confidence 21 12233444433 12222111 111111111111111 11111111223456888999999999999999 Q ss_pred HHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) ||++||++||++++++. ....|+++++|+.+||++||+++||+.++.+++++||||++++|+..|+|++||||+ T Consensus 80 Ia~~iA~lp~~~~~~~~------~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 153 (441) T protein:vir:79 80 IASDLARMPIRVTVNGQ------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 153 (441) T ss_pred HHHhhccCceeeecCcc------ccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 99999999999986432 345789999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEcCCceE-EEEEe-----cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_019710. 150 SANMDVKLVGKKVV-YRYQR-----DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAK 223 (424) Q Consensus 150 p~~v~~~~~~~~~~-~~~~~-----~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~ 223 (424) |++|++..++++.. |.+.. ......++++||||||+++.++++|+||+.++..++++..++++++.++|+||++ T Consensus 154 ~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~ 233 (441) T protein:vir:79 154 TSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTH 233 (441) T ss_pred CceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999988765543 33321 2234689999999999999999999999999999999999999999999999999 Q ss_pred CceeEEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc Q lcl|NC_019710. 224 SPQILSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV 302 (424) Q Consensus 224 p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l 302 (424) |+|+|+++....++++++++++.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+| T Consensus 234 p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 313 (441) T protein:vir:79 234 AGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKF 313 (441) T ss_pred CcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc Confidence 999999998888888888898888776554 7899999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_019710. 303 GDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRT 382 (424) Q Consensus 303 ~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t 382 (424) |.... ++ +.+++.. ++.+||.|++++||++|+++|+++.. +++++||++.+++.|.+++++.+++++++|+|| T Consensus 314 g~~~~-~~---s~~q~~~-~~~~tl~P~~~~ie~eln~kl~~~~~--~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T 386 (441) T protein:vir:79 314 GIETA-NM---SITDANL-DYLSTLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMN 386 (441) T ss_pred CCCCC-Cc---cHHHHHH-HHHHHHHHHHHHHHHHHhhhcccccc--CceEEeechhhhccCHHHHHHHHHHHHhCCCcC Confidence 86432 22 4455554 45679999999999999999987643 578999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCCCCcCe--eeecccccchhhccccCCCcc-------CCC Q lcl|NC_019710. 383 INEMRRTDNLPPLPGGDV--AMRQSQYVPITDLGTNKEPRN-------NGA 424 (424) Q Consensus 383 ~NE~R~~lg~~p~~ggd~--~~~~~n~~~~~~~~~~~~~~~-------~g~ 424 (424) +||+|+++|+||+||||. +++++|++|++..+..+..+. .|+ T Consensus 387 ~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 437 (441) T protein:vir:79 387 IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGG 437 (441) T ss_pred HHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCC Confidence 999999999999999985 789999999988754221111 112 No 33 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=100.00 E-value=1.2e-93 Score=529.96 Aligned_cols=410 Identities=19% Similarity=0.258 Sum_probs=327.7 Q ss_pred CC---CCCcccccCCC------ccHHHHHHhhccCccccccccccc--ccccccccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 ME---EPKYTIDLRTN------NGWWARLKSWFVGGRLVTPNQGSQ--TGPVSAHGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~---~~~~~~~~~~~------~G~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) |. -.-|-|+.+.+ +++++-++ ...++....+..... ...........+..++.+.|+++++||+||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~-~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~~cv~~ 79 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFY-KNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred CccccCccccccccccccchhhhhcccccc-ccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHHHHHHHHH Confidence 21 12233444433 12222111 111111111111111 11111111223456888999999999999999 Q ss_pred HHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) ||++||++||++++++. ....|+++++|+.+||++||+++||+.++.+++++||||++++|+..|+|++||||+ T Consensus 80 Ia~~iA~lp~~~~~~~~------~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 153 (441) T protein:vir:94 80 IASDLARMPIRVTVNGQ------INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 153 (441) T ss_pred HHHhhccCceeeecCcc------ccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 99999999999986432 345789999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEcCCceE-EEEEe-----cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCC Q lcl|NC_019710. 150 SANMDVKLVGKKVV-YRYQR-----DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAK 223 (424) Q Consensus 150 p~~v~~~~~~~~~~-~~~~~-----~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~ 223 (424) |++|++..++++.. |.+.. ......++++||||||+++.++++|+||+.++..++++..++++++.++|+||++ T Consensus 154 ~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k~~~~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~ 233 (441) T protein:vir:94 154 TSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTH 233 (441) T ss_pred CceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 99999988765543 33321 2234689999999999999999999999999999999999999999999999999 Q ss_pred CceeEEcCCCCCCHHHHHHHHHHHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc Q lcl|NC_019710. 224 SPQILSTGEKVLTEQQRSQVEENFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLV 302 (424) Q Consensus 224 p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l 302 (424) |+|+|+++....++++++++++.|++.+++ .|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+| T Consensus 234 p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l 313 (441) T protein:vir:94 234 AGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKF 313 (441) T ss_pred CcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc Confidence 999999998888888888898888776554 7899999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_019710. 303 GDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRT 382 (424) Q Consensus 303 ~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t 382 (424) |.... ++ +.+++.. ++.+||.|++++||++|+++|+++.. +++++||++.+++.|.+++++.+++++++|+|| T Consensus 314 g~~~~-~~---s~~q~~~-~~~~tl~P~~~~ie~eln~kl~~~~~--~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T 386 (441) T protein:vir:94 314 GIETA-NM---SITDANL-DYLSTLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMN 386 (441) T ss_pred CCCCC-Cc---cHHHHHH-HHHHHHHHHHHHHHHHHhhhcccccc--CceEEeechhhhccCHHHHHHHHHHHHhCCCcC Confidence 86432 22 4455554 45679999999999999999987643 578999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCCCCcCe--eeecccccchhhccccCCCcc-------CCC Q lcl|NC_019710. 383 INEMRRTDNLPPLPGGDV--AMRQSQYVPITDLGTNKEPRN-------NGA 424 (424) Q Consensus 383 ~NE~R~~lg~~p~~ggd~--~~~~~n~~~~~~~~~~~~~~~-------~g~ 424 (424) +||+|+++|+||+||||. +++++|++|++..+..+..+. .|+ T Consensus 387 ~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 437 (441) T protein:vir:94 387 IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGG 437 (441) T ss_pred HHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCC Confidence 999999999999999985 789999999988754221111 112 No 34 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=100.00 E-value=1.1e-93 Score=530.27 Aligned_cols=394 Identities=18% Similarity=0.261 Sum_probs=328.3 Q ss_pred ccHHHHHHhhccCccccccccccc--ccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQ--TGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 91 (424) +|||++... +....+..... ...........++.++.+.|+++++||+||++||+++|++||+++++++ T Consensus 1 Mg~f~~~~~----r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~----- 71 (416) T protein:vir:45 1 MGIFYKNEK----RDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ----- 71 (416) T ss_pred CCccccccc----ccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcc----- Confidence 888876432 11111111111 1111111223466788999999999999999999999999999986442 Q ss_pred cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceE-EEEE-e- Q lcl|NC_019710. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVV-YRYQ-R- 168 (424) Q Consensus 92 ~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~-~~~~-~- 168 (424) ....|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|++++||||+|++|++..++.+.. |++. . T Consensus 72 -~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (416) T protein:vir:45 72 -INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (416) T ss_pred -ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEec Confidence 34578999999999999999999999999999999999999999999999999999999999988766543 3332 1 Q ss_pred ---cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019710. 169 ---DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 169 ---~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) ......|+++||||||+++.++++|+||+.++.+++++..++++++.++|+||++|++||+++....++++++++++ T Consensus 151 ~~~~~~~~~~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~ 230 (416) T protein:vir:45 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (416) T ss_pred CCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 22346899999999999999999999999999999999999999999999999999999999988888888889988 Q ss_pred HHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHH Q lcl|NC_019710. 246 NFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 246 ~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~ 324 (424) .|++..++ .|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||.... + .+.+++ ..+|. T Consensus 231 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~---~~~~~~-~~~~~ 305 (416) T protein:vir:45 231 EFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N---MSITDA-NLDYL 305 (416) T ss_pred HHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C---ccHHHH-HHHHH Confidence 88876655 789999999999999999999999999999999999999999999999986432 2 134555 44556 Q ss_pred HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC--eee Q lcl|NC_019710. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD--VAM 402 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd--~~~ 402 (424) +||.|++++||++|+++|+++.+ +++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+|||| +++ T Consensus 306 ~~l~P~~~~ie~~ln~~l~~~~~--~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~ 383 (416) T protein:vir:45 306 STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHR 383 (416) T ss_pred HHHHHHHHHHHHHHhhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 79999999999999999987653 46899999999999999999999999999999999999999999999987 688 Q ss_pred ecccccchhhccccCCCccC-------CC Q lcl|NC_019710. 403 RQSQYVPITDLGTNKEPRNN-------GA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~-------g~ 424 (424) +++|++|++...+.+..+.+ |+ T Consensus 384 ~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 412 (416) T protein:vir:45 384 VDLNHVNIELVDEYQMNKSRATDKKLKGG 412 (416) T ss_pred ecccccccccccccCcccccccccccCCC Confidence 99999999876543322222 22 No 35 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=100.00 E-value=1.1e-93 Score=530.27 Aligned_cols=394 Identities=18% Similarity=0.261 Sum_probs=328.3 Q ss_pred ccHHHHHHhhccCccccccccccc--ccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQ--TGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 91 (424) +|||++... +....+..... ...........++.++.+.|+++++||+||++||+++|++||+++++++ T Consensus 1 Mg~f~~~~~----r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~----- 71 (416) T protein:vir:81 1 MGIFYKNEK----RDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQ----- 71 (416) T ss_pred CCccccccc----ccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCcc----- Confidence 888876432 11111111111 1111111223466788999999999999999999999999999986442 Q ss_pred cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceE-EEEE-e- Q lcl|NC_019710. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVV-YRYQ-R- 168 (424) Q Consensus 92 ~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~-~~~~-~- 168 (424) ....|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|++++||||+|++|++..++.+.. |++. . T Consensus 72 -~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (416) T protein:vir:81 72 -INYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (416) T ss_pred -ccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCccEEEEEEEec Confidence 34578999999999999999999999999999999999999999999999999999999999988766543 3332 1 Q ss_pred ---cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019710. 169 ---DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 169 ---~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) ......|+++||||||+++.++++|+||+.++.+++++..++++++.++|+||++|++||+++....++++++++++ T Consensus 151 ~~~~~~~~~~~~~evihir~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~ 230 (416) T protein:vir:81 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (416) T ss_pred CCCceeEEEEccccEEEeccCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 22346899999999999999999999999999999999999999999999999999999999988888888889988 Q ss_pred HHHHHhCC-cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHH Q lcl|NC_019710. 246 NFKEIAGG-PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 246 ~~~~~~~~-~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~ 324 (424) .|++..++ .|+|+++++++|++|++++.+++|+||+|.+++++++||++|||||.+||.... + .+.+++ ..+|. T Consensus 231 ~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~---~~~~~~-~~~~~ 305 (416) T protein:vir:81 231 EFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N---MSITDA-NLDYL 305 (416) T ss_pred HHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-C---ccHHHH-HHHHH Confidence 88876655 789999999999999999999999999999999999999999999999986432 2 134555 44556 Q ss_pred HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC--eee Q lcl|NC_019710. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD--VAM 402 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd--~~~ 402 (424) +||.|++++||++|+++|+++.+ +++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+|||| +++ T Consensus 306 ~~l~P~~~~ie~~ln~~l~~~~~--~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~ 383 (416) T protein:vir:81 306 STLKPYITCVCAELNFKFNDEYV--NREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHR 383 (416) T ss_pred HHHHHHHHHHHHHHhhhcccccc--CceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEe Confidence 79999999999999999987653 46899999999999999999999999999999999999999999999987 688 Q ss_pred ecccccchhhccccCCCccC-------CC Q lcl|NC_019710. 403 RQSQYVPITDLGTNKEPRNN-------GA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~-------g~ 424 (424) +++|++|++...+.+..+.+ |+ T Consensus 384 ~~~n~~~~~~~~~~~~~~~~~~~~~~kgG 412 (416) T protein:vir:81 384 VDLNHVNIELVDEYQMNKSRATDKKLKGG 412 (416) T ss_pred ecccccccccccccCcccccccccccCCC Confidence 99999999876543322222 22 No 36 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=100.00 E-value=6.3e-93 Score=526.07 Aligned_cols=398 Identities=20% Similarity=0.285 Sum_probs=331.3 Q ss_pred cCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCc Q lcl|NC_019710. 10 LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 89 (424) |+. -+++.|+++.+.......+.... ..+..+. ..+...++.+.|+++++|++||++||++||++||+++++++. T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~~~-- 75 (409) T protein:vir:93 1 MAK-ENIVTRIKKKLIDNWIDQSTSKL-YDFSPWK-NRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV-- 75 (409) T ss_pred CCc-cchhhhhhhhhhhhhhccccccc-ccccccc-CccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecccc-- Confidence 332 36777777655433322221111 1111111 123445788899999999999999999999999999987643 Q ss_pred cccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc--eEEEEE Q lcl|NC_019710. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--VVYRYQ 167 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~--~~~~~~ 167 (424) .+|++.++|+.+||++||+++||+.++.+++++||||+++.|+.+|.+.+||||+|++|++..+.+. .+|.+. T Consensus 76 -----~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~~~y~~~ 150 (409) T protein:vir:93 76 -----VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYSIH 150 (409) T ss_pred -----ccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEEEE Confidence 3588999999999999999999999999999999999999999999999999999999998876543 344443 Q ss_pred -ecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019710. 168 -RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 168 -~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) .++..+.|+++||||+|++ +.++++|+||+.++..+++++.+++++. ++.++..++++++.+.. .++++.+.+++ T Consensus 151 ~~~g~~~~~~~~eVih~r~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~-l~~e~~~~~~~ 227 (409) T protein:vir:93 151 AATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSN-VGKEKRQQVLE 227 (409) T ss_pred cCCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCceEEecCCC-CCHHHHHHHHH Confidence 3455678999999999976 5688999999999999999999998885 55555566677776654 56777888889 Q ss_pred HHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH Q lcl|NC_019710. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~ 325 (424) .|++..+ ++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+.+++ +++|+|++.+.|+++ T Consensus 228 ~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~f~~~ 303 (409) T protein:vir:93 228 DFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRFYLQH 303 (409) T ss_pred HHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHHHH Confidence 9887664 578899999999999999999999999999999999999999999999986555 556999999999999 Q ss_pred HHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec Q lcl|NC_019710. 326 TLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQ 404 (424) Q Consensus 326 tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~ 404 (424) ||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+||||+++++ T Consensus 304 ~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~ 383 (409) T protein:vir:93 304 TLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLIS 383 (409) T ss_pred HHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec Confidence 999999999999999999998864 5889999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhccccCCCccCCC Q lcl|NC_019710. 405 SQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 405 ~n~~~~~~~~~~~~~~~~g~ 424 (424) +|++|++...++++...+|. T Consensus 384 ~n~~~~~~~~~~~~~~~gG~ 403 (409) T protein:vir:93 384 GDLYPIDTPLELRKSLKGGD 403 (409) T ss_pred ccccccccchhhcccccCCC Confidence 99999987665543333322 No 37 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=100.00 E-value=6.9e-93 Score=525.83 Aligned_cols=396 Identities=20% Similarity=0.293 Sum_probs=328.9 Q ss_pred cCCCccHHHHHHhhccCcccccccccccccccccccc--cCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc Q lcl|NC_019710. 10 LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGY--LGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 87 (424) |.. ..++.|+++.+.......+.... ..+..| .+...++.+.|+++++|++||++||++||+|||++|++++. T Consensus 1 ~~~-~~~~~~~k~~~~~~~~~~~~~~~----~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~~ 75 (409) T protein:vir:96 1 MAK-ENIVTRIKKKLIDNWIDQSASKL----YDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYKV 75 (409) T ss_pred Ccc-ccchhhhhhHHhhhhhccccccc----cccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeecccc Confidence 222 35666766665444333322111 111222 23345778899999999999999999999999999987642 Q ss_pred CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc--eEEE Q lcl|NC_019710. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--VVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~--~~~~ 165 (424) .+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|.+|++..+.+. .+|. T Consensus 76 -------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~y~ 148 (409) T protein:vir:96 76 -------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELYYS 148 (409) T ss_pred -------cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEEEE Confidence 3589999999999999999999999999999999999999999999999999999999999876543 3444 Q ss_pred EE-ecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019710. 166 YQ-RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 166 ~~-~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~ 243 (424) +. .++....|+++||||||++ +.++++|+||+..+..++++..+++++. ++.++..++++++.+. ..++++.+++ T Consensus 149 ~~~~~g~~~~~~~~evih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~-~l~~e~~~~~ 225 (409) T protein:vir:96 149 IHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGS-NVSTEKRQQV 225 (409) T ss_pred EEcCCceEEEEccccEEEeCCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCceeEEecCC-CCCHHHHHHH Confidence 43 3456678999999999975 5788999999999999999999998874 4444444556666654 4667778888 Q ss_pred HHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 244 ~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) ++.|++..+ ++|+++++++|++|++++++++|+||+|.++++.++||++|||||.+||+.+++ +++|+|++.+.|+ T Consensus 226 ~~~~~~~~~--n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~s~~e~~~~~f~ 301 (409) T protein:vir:96 226 LEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNT--NFAKNEELNRFYL 301 (409) T ss_pred HHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHHHH Confidence 888887764 578999999999999999999999999999999999999999999999987655 4569999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAM 402 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~ 402 (424) ++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++++++++++|+||+||+|+++|+||+||||+++ T Consensus 302 ~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~ 381 (409) T protein:vir:96 302 QHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPL 381 (409) T ss_pred HHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceee Confidence 99999999999999999999998864 58899999999999999999999999999999999999999999999999999 Q ss_pred ecccccchhhccccCCCccCCC Q lcl|NC_019710. 403 RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +++|++|++...++++...+|. T Consensus 382 ~~~n~~~~~~~~~~~~~~~gG~ 403 (409) T protein:vir:96 382 ISGDLYPIDTPLELRKSLKGGD 403 (409) T ss_pred ecccccccccchhhcccccCCC Confidence 9999999987654432222222 No 38 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=100.00 E-value=1.1e-92 Score=524.74 Aligned_cols=396 Identities=20% Similarity=0.292 Sum_probs=329.3 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccc--cCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecc Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGY--LGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETD 85 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~ 85 (424) |... .++.|+++.+.......+... ...+..| .....++.+.|+++++|++||++||++||++||++|+++ T Consensus 1 ~~~~---~~~~~~k~~~~~~~~~~~~~~----~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~~~~ 73 (409) T protein:vir:94 1 MAKE---NIVTRIKKKLIDNWIDQSASK----LYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDY 73 (409) T ss_pred Cccc---ccchhhhhHHhhhhhcCCccc----ccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecc Confidence 3332 345555555543333222111 1112222 234457888999999999999999999999999999876 Q ss_pred ccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc--eE Q lcl|NC_019710. 86 QNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--VV 163 (424) Q Consensus 86 ~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~--~~ 163 (424) +. .+|++.++|+.+||++||+++||+.++.+++++||||+++.|+.+|.+++||||+|++|++..+.+. .+ T Consensus 74 ~~-------~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~~ 146 (409) T protein:vir:94 74 KV-------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRELY 146 (409) T ss_pred cc-------cchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcEEE Confidence 43 3588999999999999999999999999999999999999999999999999999999998876543 34 Q ss_pred EEEE-ecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHH Q lcl|NC_019710. 164 YRYQ-RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRS 241 (424) Q Consensus 164 ~~~~-~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~ 241 (424) |.+. .++..+.|+++||||||++ +.++++|+||+.++..++++..++.++. ++.++..++++++.+.. .++++.+ T Consensus 147 y~~~~~~g~~~~~~~~dvih~r~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~--~~~~~~~~~~i~~~~~~-l~~e~~~ 223 (409) T protein:vir:94 147 YSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFN--LTEMQKPDSFMLKYGSN-VGKEKRQ 223 (409) T ss_pred EEEEcCCceEEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHH--HHhcCCCCeeEEecCCC-CCHHHHH Confidence 4443 3456678999999999976 5688999999999999999999998885 44555556677776654 5677788 Q ss_pred HHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHH Q lcl|NC_019710. 242 QVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG 321 (424) Q Consensus 242 ~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~ 321 (424) .+++.|++..+ ++|+++++++|++|++++++++|+||+|.++++.++||++|||||.+||+.+++ +++|+|++.+. T Consensus 224 ~~~~~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~sn~e~~~~~ 299 (409) T protein:vir:94 224 QVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNT--NFAKNEELNRF 299 (409) T ss_pred HHHHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC--CcccHHHHHHH Confidence 88899988764 578899999999999999999999999999999999999999999999986554 55699999999 Q ss_pred HHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Q lcl|NC_019710. 322 FLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 322 f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~ 400 (424) |+++||.|++++||++|+++|+++.++. +++++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+||||+ T Consensus 300 f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~ 379 (409) T protein:vir:94 300 YLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDK 379 (409) T ss_pred HHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Confidence 9999999999999999999999998864 588999999999999999999999999999999999999999999999999 Q ss_pred eeecccccchhhccccCCCccCCC Q lcl|NC_019710. 401 AMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +++++|++|++.....++...+|. T Consensus 380 ~~~~~n~~~~~~~~~~~~~~kGG~ 403 (409) T protein:vir:94 380 PLISGDLYPIDTPLELRKSLKGGD 403 (409) T ss_pred EeecccccccccchhhcccccCCC Confidence 999999999987654432222222 No 39 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=100.00 E-value=2.3e-92 Score=522.93 Aligned_cols=403 Identities=19% Similarity=0.284 Sum_probs=332.6 Q ss_pred ccHHHHHHhhccCccccccccccccccccccccc-CCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL-GDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (424) +|||++++. +....+++......+.+...... +........++++|+|++||++||++||++|+++|+++++|+.++ T Consensus 1 Mg~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~~~ 78 (423) T protein:vir:81 1 MGFLQKLGL--APSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGRER 78 (423) T ss_pred CchhHhhcc--ccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCceee Confidence 999999853 22333333332233333222222 223344566778999999999999999999999999988776554 Q ss_pred ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC--CceeEEEEeccceEEEEEcCC---ceEEEEE Q lcl|NC_019710. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA--GDVISLLPLQSANMDVKLVGK---KVVYRYQ 167 (424) Q Consensus 93 ~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~--G~~~~l~~l~p~~v~~~~~~~---~~~~~~~ 167 (424) ..+|++.+||. +||++||+++||+.++.+++++||||+++.|+.. +.+..|+|+++..+.+....+ ...|.+. T Consensus 79 -~~~~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~~~Y~~~ 156 (423) T protein:vir:81 79 -VREGHLARVCK-LANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGSLDYIII 156 (423) T ss_pred -eccchHHHHhh-cCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcceEEEEE Confidence 45688988886 8999999999999999999999999999998753 567788888888887654322 2344433 Q ss_pred ----ecCceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC----CCCHH Q lcl|NC_019710. 168 ----RDSEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK----VLTEQ 238 (424) Q Consensus 168 ----~~~~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~----~~~~~ 238 (424) .++....++++||||+|.++.++ .+|+||+..++.+++...++++++.++|+||++|++||+.+.. ..+++ T Consensus 157 ~~~~~~g~~~~~~~~evih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e 236 (423) T protein:vir:81 157 ESGDNDGRSVKVPGERVIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAE 236 (423) T ss_pred EecCCCceEEEEcccceEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHH Confidence 24556789999999999888776 5799999999999999999999999999999999999987643 35688 Q ss_pred HHHHHHHHHHHHh--CCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 239 QRSQVEENFKEIA--GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~--~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) +.+++++.|++.+ +.+|+|++++|++|++|++++++++|+||+|.++++.++||++|||||.+||..++++ ++|+| T Consensus 237 ~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t--~sn~e 314 (423) T protein:vir:81 237 SRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNAN--YSNVR 314 (423) T ss_pred HHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCC--cccHH Confidence 8888999888765 4578999999999999999999999999999999999999999999999999887764 56999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccChhhh--ccceeeecchhhhccCHHHHHHHHHHHHh-CCCcCHHHHHHHhCCC Q lcl|NC_019710. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDV--GRIHAEHNLDGLLRGDSASRAAFMKAMGE-SGLRTINEMRRTDNLP 393 (424) Q Consensus 317 ~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~--~~~~~~f~~~~~~~~d~~~~~~~~~~~~~-~g~~t~NE~R~~lg~~ 393 (424) ++.+.|+++||.|++++||++|+++|+++.+. .+++++||.+.+++.|.++|++.+.+++. .||||+||+|+++|+| T Consensus 315 ~~~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~ 394 (423) T protein:vir:81 315 EFRKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLP 394 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCC Confidence 99999999999999999999999999998763 46889999999999999999999999874 6999999999999999 Q ss_pred CCCCcCeeeecccccchhhccccCCCccC Q lcl|NC_019710. 394 PLPGGDVAMRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 394 p~~ggd~~~~~~n~~~~~~~~~~~~~~~~ 422 (424) |+||||++++|.|+.+.+......+..+- T Consensus 395 p~~gGD~~~~p~n~~~~~~~~~~~~~~~t 423 (423) T protein:vir:81 395 SIDGGDDLARPLNTEFGDSEDAPGEEVET 423 (423) T ss_pred CCCCcceeecccccccCccCCCCCCCCCC Confidence 99999999999999886643322222222 No 40 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=100.00 E-value=2.5e-91 Score=517.34 Aligned_cols=391 Identities=16% Similarity=0.217 Sum_probs=318.6 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 87 (424) |+ +|.+ +... ..+.........+......+.++ ...||++++||+||++||+.||++||++|+++.+ T Consensus 1 m~------~~~~----~~~~--~~~~~~~~~~~~~~~~~~~g~~~-~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~ 67 (417) T protein:vir:38 1 MK------LFRG----LATE--VDPHWADHLLDSGVIPSFRGGYL-GISALRNSDVLTAVSIVSGDVSRFPLVITDSSTD 67 (417) T ss_pred Cc------cccc----cccC--CCccchhhhcccccccccCCcee-chhhcccHHHHHHHHHHHHhhccCeeEEEEcCCc Confidence 33 2211 1100 00000000000111112223333 3468999999999999999999999999988765 Q ss_pred CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeEEEEeccceEEEEEcC-CceEEE Q lcl|NC_019710. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLVG-KKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G~~~~l~~l~p~~v~~~~~~-~~~~~~ 165 (424) +. ...|+++++|+.+||++||+++||+.++.+++++||||++++|+.. |.|..|++++|.+|.+..++ +...|. T Consensus 68 ~~----~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~~y~ 143 (417) T protein:vir:38 68 EV----IDLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNIIYR 143 (417) T ss_pred ce----eccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeEEEE Confidence 43 3468899999999999999999999999999999999999999875 67999999999999987654 445555 Q ss_pred EEe--cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019710. 166 YQR--DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 166 ~~~--~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~ 243 (424) |.. ++....++++||||||+++.|+++|+||+.++..++.++.++++++.++|+||++|+++++.+.. .++++.+++ T Consensus 144 ~~~~~~~~~~~~~~~dviH~r~~~~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~-l~~e~~~~~ 222 (417) T protein:vir:38 144 FTPYNSSMQKVCGFEDVIHWKFFSYDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKESR-LSAEARQKI 222 (417) T ss_pred EEEcCCcEEEEecCcceEEecCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCC-CCHHHHHHH Confidence 543 33456789999999999999999999999999999999999999999999999999999998866 567778899 Q ss_pred HHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 244 ~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) ++.|++.+++.|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||.. .+++|++++.++|+ T Consensus 223 ~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~----~~~s~~e~~~~~~~ 298 (417) T protein:vir:38 223 REDFERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQN----SPNQSVKQLADDYI 298 (417) T ss_pred HHHHHHHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCC----CcchhHHHHHHHHH Confidence 99999988888999999999999999999999999999999999999999999999999842 34568999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg--d~~ 401 (424) ++||.|++++||++|+++|+++.++..++++||.+.+...+ .+.+++++++|+||+||+|+++|+||+||| |++ T Consensus 299 ~~tl~P~~~~ie~~l~~~Ll~~~~~~~~~~~fd~~~l~~~~----~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~ 374 (417) T protein:vir:38 299 RNDLPFYFEPITSEFELKLLDDAQRHQYCIGFDTKSVNGLP----IADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRI 374 (417) T ss_pred HHHHHHHHHHHHHHHHhhhcChhhcccceEEechhhhhHHH----HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCee Confidence 99999999999999999999988877788999988875433 344778999999999999999999999987 889 Q ss_pred eecccccchhhccccCC-------CccCCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKE-------PRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~-------~~~~g~ 424 (424) ++|+|+++++.....+. +.++.+ T Consensus 375 ~~~~n~~~~d~~~~~~~~~~~~~kgg~~~~ 404 (417) T protein:vir:38 375 QSTLNTVFLDQKEAYQAEHAAELKGGDTNA 404 (417) T ss_pred eecccccccccccccccccccccCCCCCCC Confidence 99999999986544211 111111 No 41 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=100.00 E-value=9.2e-91 Score=514.20 Aligned_cols=404 Identities=15% Similarity=0.155 Sum_probs=331.2 Q ss_pred HHHHHHhhccCccccccc-ccccccccc---cccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc Q lcl|NC_019710. 16 WWARLKSWFVGGRLVTPN-QGSQTGPVS---AHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 16 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 91 (424) |++.+...+++....... ...+...++ ...+..+..++.+.|+++|+||+||++||++||++||++|++..+|..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~ 80 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQ 80 (460) T ss_pred CchhHHHHHhhhhccCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccch Confidence 555555444432222211 111211111 1223355668888999999999999999999999999999998887543 Q ss_pred c-------------------------ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC----CCce Q lcl|NC_019710. 92 K-------------------------VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS----AGDV 142 (424) Q Consensus 92 ~-------------------------~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~----~G~~ 142 (424) + ....+++..+|+.+||++||+++||+.++.+++++||||++++|+. .|.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~ 160 (460) T protein:vir:10 81 QLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVP 160 (460) T ss_pred hhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCcee Confidence 2 2234456667888999999999999999999999999999999964 4789 Q ss_pred eEEEEeccceEEEEEcCCce---------EEEEEecCceEEecHhHeeEecCcCCC------CccccchHHHHHHHHHHH Q lcl|NC_019710. 143 ISLLPLQSANMDVKLVGKKV---------VYRYQRDSEYADFSQKEIFHLKGFGFT------GLVGLSPIAFACKSAGVA 207 (424) Q Consensus 143 ~~l~~l~p~~v~~~~~~~~~---------~~~~~~~~~~~~~~~~evih~r~~~~~------~~~G~s~~~~~~~~i~~~ 207 (424) .+||||+|.+|++..+.++. .|.+..++....|+++||||||+++.+ +++|+||+.+++.++... T Consensus 161 ~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~ 240 (460) T protein:vir:10 161 SQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQ 240 (460) T ss_pred EEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcccccCccccccHHHHHHHHHHHH Confidence 99999999999998876542 234455777789999999999976543 579999999999999999 Q ss_pred HHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC-CcccCcceecCCCceeeeccCChhHHHHHHHHHH Q lcl|NC_019710. 208 VAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG-GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKF 286 (424) Q Consensus 208 ~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~-~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~ 286 (424) .++++++.++|+||+.|+++++.+.. .++++.+.+++.|++.++ .+|+|+++++++|++|++++++++|+||+|.+++ T Consensus 241 ~~~~~~~~~~f~ng~~~~~i~~~~~~-l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~ 319 (460) T protein:vir:10 241 NSTIDNNVKTMQNGGVFGFIHGGSTG-LTQPQADSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKY 319 (460) T ss_pred HHHHHHHHHHHhcCCCcceeeecCCC-CCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHH Confidence 99999999999999999999887755 567778888888887655 4689999999999999999999999999999999 Q ss_pred HHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhh--hcc Q lcl|NC_019710. 287 QVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGL--LRG 363 (424) Q Consensus 287 ~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~--~~~ 363 (424) ++++||++|||||.+||..++++++++|+|++.+.|+++||.|++.+||++|+++|+++.++. +++++||++.+ ++. T Consensus 320 ~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~ 399 (460) T protein:vir:10 320 DQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQT 399 (460) T ss_pred HHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHH Confidence 999999999999999999988888899999999999999999999999999999999987754 57889999887 344 Q ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeeecccccchhhccccCC-CccCCC Q lcl|NC_019710. 364 DSASRAAFMKAMGESGLRTINEMRRTDNLPPL--PGGDVAMRQSQYVPITDLGTNKE-PRNNGA 424 (424) Q Consensus 364 d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~--~ggd~~~~~~n~~~~~~~~~~~~-~~~~g~ 424 (424) |.++ ...++++|++|+||+|+.+|+||+ ||||++++|+|++|+++.+++.. ..+|.. T Consensus 400 d~~~----~~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq~ 459 (460) T protein:vir:10 400 DMVA----MASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFNQN 459 (460) T ss_pred HHHH----HHHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhhcccccCCCcccCC Confidence 4443 345778999999999999999998 58999999999999998766432 222222 No 42 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=100.00 E-value=1.3e-89 Score=507.86 Aligned_cols=387 Identities=14% Similarity=0.184 Sum_probs=321.7 Q ss_pred ccHHHHHHhhccCccccccc-ccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPN-QGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (424) +|||+. ....... ...+....+. .....++...|+++++||+||++||++||++||++++++. + T Consensus 1 m~~f~~-------~~~~~~~~~~~~~~~~~~---~~~~~~~~~~Al~~~~V~~~i~~Ia~~iA~lp~~~~~~~g-----~ 65 (406) T protein:vir:97 1 MSFFQP-------LGTSKVSYDDYISSVLAG---DVSQKYLGVSALKNSDILTATSIIAGDIARFPLVKKDVNG-----D 65 (406) T ss_pred Cccccc-------cCCCCCCcchHHHHHhcC---CCCcccccchhhccHHHHHHHHHHHHhhhhCeeEEEecCc-----c Confidence 566553 2211111 1111111111 1223345557899999999999999999999998876543 2 Q ss_pred ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeEEEEeccceEEEEEcCC-ceEEEEE--e Q lcl|NC_019710. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQSANMDVKLVGK-KVVYRYQ--R 168 (424) Q Consensus 93 ~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-~G~~~~l~~l~p~~v~~~~~~~-~~~~~~~--~ 168 (424) ...+|++.+||+.+||++||+++||+.++.+|+++||||+++.|+. .|.+.+|||++|++|++..+.+ ...|.+. . T Consensus 66 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~~~y~~~~~~ 145 (406) T protein:vir:97 66 IIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHEIVYTFTDML 145 (406) T ss_pred ccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCceEEEEEEecC Confidence 3456899999999999999999999999999999999999999985 6899999999999999887654 4455554 4 Q ss_pred cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019710. 169 DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 169 ~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) ++....++++||||||+++.++++|+||+.++..++.++.+++++..++|+||+.|++++..+ ...++++.+.+++.|+ T Consensus 146 ~~~~~~~~~~evih~r~~~~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~-~~l~~e~~~~~~~~~~ 224 (406) T protein:vir:97 146 TAKQVKCFAHDVIHWKFFSHDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKG-AQLSGDARQRARQEFE 224 (406) T ss_pred CceEEEEccccEEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecC-CCCCHHHHHHHHHHHH Confidence 566788999999999999999999999999999999999999999999999999988777665 4567888889999999 Q ss_pred HHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHH Q lcl|NC_019710. 249 EIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQ 328 (424) Q Consensus 249 ~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~ 328 (424) +..++.|+|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||... +++|++++.+.|+.+||. T Consensus 225 ~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~----~~~~~e~~~~~f~~~~l~ 300 (406) T protein:vir:97 225 KMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNS----PNQSVAQLMEDYVTNDLP 300 (406) T ss_pred HHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCC----CcchHHHHHHHHHHHHHH Confidence 9999899999999999999999999999999999999999999999999999998632 345889999999999999 Q ss_pred HHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeeeeccc Q lcl|NC_019710. 329 PYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVAMRQSQ 406 (424) Q Consensus 329 P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~~~~~n 406 (424) |++++||++|+++|+++.++..++++||++.+ .+.+++.+.+++++|+||+||+|+++|+||+++ ||++++|+| T Consensus 301 P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~~----~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n 376 (406) T protein:vir:97 301 FYFDAITSELGLKTLNDKDRRLYHIEFDTRSV----TGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLN 376 (406) T ss_pred HHHHHHHHHHhhhhcChhhccceeEEEecCcc----chhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccC Confidence 99999999999999999887778899997654 556677888999999999999999999999965 999999999 Q ss_pred ccchhhcccc--------CCCccCCC Q lcl|NC_019710. 407 YVPITDLGTN--------KEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~--------~~~~~~g~ 424 (424) ++|++...+. ++++++|- T Consensus 377 ~~~~~~~~~~~~~~~~~~~gg~~~~~ 402 (406) T protein:vir:97 377 YVFLDKKEEYQDKVGIKGKGGEVNAE 402 (406) T ss_pred ccchhcccccccccccccCCCCCCCC Confidence 9999876432 22222222 No 43 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=100.00 E-value=4.2e-89 Score=505.11 Aligned_cols=374 Identities=21% Similarity=0.272 Sum_probs=313.7 Q ss_pred ccHHHHHHhh------------------------ccCcccccccccc-ccccccc------ccccCCccccHHHHhhhHH Q lcl|NC_019710. 14 NGWWARLKSW------------------------FVGGRLVTPNQGS-QTGPVSA------HGYLGDSSINDERILQIST 62 (424) Q Consensus 14 ~G~~~~~~~~------------------------~~~~~~~~~~~~~-~~~~~~~------~~~~~~~~~~~~~~~~~~~ 62 (424) +|||+++++. |.++......... +..+.++ +....+..++.+.++++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 9999999996 2222221111111 1111111 1233567789999999999 Q ss_pred HHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE-eeCCCCc Q lcl|NC_019710. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV-DRNSAGD 141 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~-~r~~~G~ 141 (424) ||+||++||++||++||++|++++. .+.+..+|+.+||++||+++||+.++.+|++ ||+|+++ .++.+|. T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~--------~~~~~~ll~~~PN~~~t~~~f~~~l~~~lll-Gnay~~~i~r~~~G~ 151 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRI--------IDSVAWMSNPDPEVYTSWQEFAKQLFWDFQL-GEAFVLPMAHGSDGY 151 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCcc--------ccchhhhcccCCCCCCCHHHHHHHHHHHHhh-CCcEEEEEEECCCCc Confidence 9999999999999999999976532 2345668999999999999999999999987 9999875 5899999 Q ss_pred eeEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019710. 142 VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 142 ~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 220 (424) |++|+||+|.+|++..+.++.. .|..++ .+.++||||+|+++ .++++|+||+.+++.++.+..++++++.++|+| T Consensus 152 ~~~L~pl~p~~v~v~~~~~g~~-~y~~~~---~~~~~eiiHir~~~~~~~~~G~spi~~~~~~i~~~~a~~~~~~~~f~n 227 (409) T protein:vir:83 152 PIRFRVVPPWLVNVELKKGARR-EYRIGG---LNVTDEILHIRYQGNTADAHGHGPLESAAPRQVVIGLLQKYVQNLAET 227 (409) T ss_pred EEEEEEECCcceEEEEcCCceE-EEEEcc---ccCccceEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 9999999999999988876543 233332 24568999999875 467899999999999999999999999999999 Q ss_pred cCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCcee-eeccCChhHHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019710. 221 GAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFST-SAIGVTPQDAEMMASRKFQVSELARFFGVPP 299 (424) Q Consensus 221 g~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~-~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~ 299 (424) |++|+|+|+++.. .++++.+++++.|+...++ |+|+++++++|+++ ++++++++|+||+|.+++++++||++||||| T Consensus 228 ga~p~gil~~~~~-ls~e~~~~~~~~~~~~~~~-nag~~~il~~g~~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp 305 (409) T protein:vir:83 228 GGVPLYWLGVERR-LSETEAVDLMDRWIESRSK-YAGHPALVTGGATLNQAKSMSAQDLSLMELTQFNEARIAILLGVPP 305 (409) T ss_pred CCCcceEeecCCC-CCHHHHHHHHHHHHHhhCC-ccCccceecCCcccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCH Confidence 9999999999865 5677778888888776654 78999999999997 5689999999999999999999999999999 Q ss_pred HHcCCCCCCCc-ccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhC Q lcl|NC_019710. 300 HLVGDVEKSTS-WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGES 378 (424) Q Consensus 300 ~~l~~~~~~~~-~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~ 378 (424) .+||..+++++ +|+|+|++.+.|+++||.|++++||++|+++|+++. .+++||++.+++.|.++|++.+++++++ T Consensus 306 ~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~l~~~Ll~~~----~~~~f~~~~llr~d~~~r~~~~~~~~~~ 381 (409) T protein:vir:83 306 FLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAALDRWALPSP----QHLELNRDDYTRPSLVERATAYKIMIEA 381 (409) T ss_pred HHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCC----cEEEeehhhhhccCHHHHHHHHHHHHhC Confidence 99998776543 678999999999999999999999999999999763 4789999999999999999999999999 Q ss_pred CCcCHHHHHHHhCCCCCCCcCeeeecccc Q lcl|NC_019710. 379 GLRTINEMRRTDNLPPLPGGDVAMRQSQY 407 (424) Q Consensus 379 g~~t~NE~R~~lg~~p~~ggd~~~~~~n~ 407 (424) |+||+||+|+.+||||++|||++.-. .+ T Consensus 382 G~lT~NE~R~~~glpp~~ggd~l~~~-gv 409 (409) T protein:vir:83 382 GVMEPNEARAMERLHSEAAAVRLSGG-GV 409 (409) T ss_pred CCcCHHHHHHHhCCCCCCCCcccCCC-CC Confidence 99999999999999999999998422 22 No 44 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=100.00 E-value=1.7e-88 Score=501.70 Aligned_cols=402 Identities=16% Similarity=0.236 Sum_probs=325.8 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccc-cccccccCCcccc-HHHHhhhHHHHHHHHHHHHhhhhCc Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGP-VSAHGYLGDSSIN-DERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) |+++|..++|. ||++-++-....+... .......+ .....+......+ ...++++++|++||++||++||++| T Consensus 4 ~~~~~~~~~m~----~F~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~ 78 (413) T protein:vir:96 4 VSEIRKDKNLK----FFNNKRSPTEESKAKD-EIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMT 78 (413) T ss_pred cchhhhhhcCC----ccccCCCcchhhhhhc-cccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCc Confidence 99999887765 3333211110000000 00000000 1111122221222 2346789999999999999999999 Q ss_pred eeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC-ceeEEEEeccceEEEEE Q lcl|NC_019710. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG-DVISLLPLQSANMDVKL 157 (424) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G-~~~~l~~l~p~~v~~~~ 157 (424) |++|++++++.. ...|++.++|+.+||++||+++||+.++.+++++||||++++|+.+| .+.+|||++|.+|++.. T Consensus 79 ~~~~~~~~~~~~---~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~ 155 (413) T protein:vir:96 79 IQLMQNGETGDK---RIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNV 155 (413) T ss_pred eEEEEecCCCcc---ccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEE Confidence 999998876643 24689999999999999999999999999999999999999999887 57899999999999999 Q ss_pred cCCceEEEEEecCceEEecHhHeeEecC-cC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC Q lcl|NC_019710. 158 VGKKVVYRYQRDSEYADFSQKEIFHLKG-FG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~evih~r~-~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~ 235 (424) +.+...|.+..++ ..++++||||||+ ++ .++++|+||+.++..++.+..++++++.++|+||++|+++|+.+.. . T Consensus 156 ~~~~~~y~~~~~~--~~~~~~evih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~-l 232 (413) T protein:vir:96 156 SDDDLDYSITFDN--KEYDPSTLLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSD-S 232 (413) T ss_pred cCCeEEEEEeecC--cEEchhhEEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-C Confidence 9888877776665 4689999999995 34 3578999999999999999999999999999999999999999866 4 Q ss_pred CHHHHHHHHHHHHHHhC-CcccCcceecCCCc-eeeec-cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCccc Q lcl|NC_019710. 236 TEQQRSQVEENFKEIAG-GPVKKRLWILEAGF-STSAI-GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~-~~~ag~~~~l~~g~-~~~~l-~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~ 312 (424) ++++.+++++.|++..+ ..|+|+++++++|. ++.++ .++++|+||+|.+++++++||++|||||.+||..+ T Consensus 233 ~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~------ 306 (413) T protein:vir:96 233 DELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGT------ 306 (413) T ss_pred CHHHHHHHHHHHHHHhcCccccCceeeecCCcccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCc------ Confidence 56677888888877654 57899999997665 45565 46899999999999999999999999999997531 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_019710. 313 SGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNL 392 (424) Q Consensus 313 ~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~ 392 (424) +.+++..+|+++||.|++++||++|+++|+++ +++++||++.+++.|.+++++++++++++|+||+||+|+++|+ T Consensus 307 -~~~~~~~~~~~~~l~P~~~~ie~~ln~~ll~~----~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~ 381 (413) T protein:vir:96 307 -YNKDEFNNFINTKIMSIAQVIQQTYNKLIVEE----DMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGM 381 (413) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC----CcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 34788899999999999999999999999975 4689999999999999999999999999999999999999999 Q ss_pred CCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 393 PPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 393 ~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ||+||||++++++|++|++++++++....+-- T Consensus 382 ~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 382 PPDAEMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred CCCCCcceeeecccccchhhcccccCCCCCCC Confidence 99999999999999999998876543211111 No 45 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=100.00 E-value=5.8e-88 Score=498.86 Aligned_cols=380 Identities=19% Similarity=0.228 Sum_probs=308.1 Q ss_pred cccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCC Q lcl|NC_019710. 29 LVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPN 108 (424) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN 108 (424) .+ .-+...+.+..+.......++.+.|+++++||+||++||++||++||++|+++ + +....|+++++|+.+|| T Consensus 1 ~~--~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~--~---~~~~~~~l~~lL~~~PN 73 (723) T protein:vir:94 1 MT--TFPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPD--G---ELDELHPLSQLWNVMPN 73 (723) T ss_pred Cc--ccccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCC--C---ccchhhHHHHHHhhCCC Confidence 11 01111111111122234456778899999999999999999999999998653 2 22346999999999999 Q ss_pred CCCCHHHHHHHHHHHHHHcCCeEEEEeeCC---CCceeEEEEeccceEEEEEcCCc--------eEEEE-EecCceEEec Q lcl|NC_019710. 109 QYMTAQEFREAMTMQLCFYGNAYALVDRNS---AGDVISLLPLQSANMDVKLVGKK--------VVYRY-QRDSEYADFS 176 (424) Q Consensus 109 ~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~---~G~~~~l~~l~p~~v~~~~~~~~--------~~~~~-~~~~~~~~~~ 176 (424) ++||+++||+.++.+|+++||+|++++++. .|.|.+|++++|..+.+...... ..|.+ ..++....|+ T Consensus 74 ~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~G~~~~~~ 153 (723) T protein:vir:94 74 RAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTDGVRVPVL 153 (723) T ss_pred CCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecCceeEEec Confidence 999999999999999999999999999754 48999999999987776544322 12333 3455667899 Q ss_pred HhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH-hCCc Q lcl|NC_019710. 177 QKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI-AGGP 254 (424) Q Consensus 177 ~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~-~~~~ 254 (424) ++||||||+++ .++++|+||+.++..+|+...++++++.++|+||++|+|||+.+ + .++++.+++++.|++. .|.. T Consensus 154 ~~dIiHir~~~~~dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~-~-l~~e~~~~~~~~~~~~~~G~~ 231 (723) T protein:vir:94 154 ADEMLWLRFSDPYDPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLG-D-MDEQTFTKTVAAFRSQVEGVQ 231 (723) T ss_pred ccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-C-CCHHHHHHHHHHHHHHhhchh Confidence 99999999875 78999999999999999999999999999999999999999976 3 5677788888888765 4557 Q ss_pred ccCcceecC----------CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHH Q lcl|NC_019710. 255 VKKRLWILE----------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 255 ~ag~~~~l~----------~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~ 324 (424) |+|++++|+ .|++|++++++++|+||+|+++++.++||++|||||.+|++. .+++|++++.+.|+. T Consensus 232 Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~----st~sN~e~~~~~f~~ 307 (723) T protein:vir:94 232 NAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGG----STYENQAEAKAAVWT 307 (723) T ss_pred hcCcceeecccccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCC----CCcccHHHHHHHHHH Confidence 999999986 589999999999999999999999999999999999999642 345699999999999 Q ss_pred HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe--ee Q lcl|NC_019710. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV--AM 402 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~--~~ 402 (424) +||.|++++||++|+++|++..+. .++++||...+++.|.+++++.+.+++++|+||+||+|+++|+||+||||. ++ T Consensus 308 ~tL~P~~~~ie~~ln~~Ll~~~g~-~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~ 386 (723) T protein:vir:94 308 ETLIPQMEVMASITDLQLLPDIGW-TVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTL 386 (723) T ss_pred HHHHHHHHHHHHHHhHhhcccccC-ceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccccee Confidence 999999999999999999976543 367888888999999999999999999999999999999999999999884 34 Q ss_pred ecc--cccchhhccccCCCccCCC Q lcl|NC_019710. 403 RQS--QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 403 ~~~--n~~~~~~~~~~~~~~~~g~ 424 (424) .|. ++.|.+... ...+++++ T Consensus 387 ~p~~~~~a~~~~~~--p~~~e~~~ 408 (723) T protein:vir:94 387 TPYRAQFAPAPAPA--PAVEEGAA 408 (723) T ss_pred ccccccccCCCCCC--ccchhhhH Confidence 443 444433222 11122222 No 46 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=100.00 E-value=9.2e-88 Score=497.75 Aligned_cols=388 Identities=17% Similarity=0.228 Sum_probs=315.2 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccH-HHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIND-ERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (424) +|||+.++ .++.+.+..... .+..........++. ..+..+|+|++||++||++||++|+++|++.++|.. T Consensus 1 Mg~~~~f~----~k~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~~~g~~-- 72 (403) T protein:vir:80 1 MGLFNFFR----RKTRSEPTNAIS--WFLTQEAYDTLAIPGYTRLSDNPEVRMAVHKIAELISSMTIHLMQNTDNGDI-- 72 (403) T ss_pred Cccccccc----ccccccccchhh--hhcccccccccccchhhhhhhhHHHHHHHHHHHHhhhhCceEEEEecCCcee-- Confidence 88886433 232222211111 111111111111222 234568999999999999999999999998876643 Q ss_pred ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHc--CCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecC Q lcl|NC_019710. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFY--GNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~--G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~ 170 (424) ...|++.++|+.+||++||+++||+.+++++++. ||||+++.++..|.+.+||||+|.+|++..+.++..++|. T Consensus 73 -~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g~~~~y~--- 148 (403) T protein:vir:80 73 -RIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTGYQIWYQ--- 148 (403) T ss_pred -ecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCceEEEEe--- Confidence 2468999999999999999999999999999984 7899999999999999999999999999988887655553 Q ss_pred ceEEecHhHeeEecC--cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019710. 171 EYADFSQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 171 ~~~~~~~~evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) ...|+++|||||+. .+.++++|+||+.++..++....++++++.++|+||++|++||+.+....++...+..+++.+ T Consensus 149 -~~~~~~~eiih~~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~ 227 (403) T protein:vir:80 149 -GKAYNYDEVLHFIVNPDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFK 227 (403) T ss_pred -ecccchhhEEEEeccCCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHH Confidence 24689999999993 456788999999999999999999999999999999999999999877655555555555566 Q ss_pred HHhCCcccCcceecCCCc-eeeecc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 249 EIAGGPVKKRLWILEAGF-STSAIG-VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 249 ~~~~~~~ag~~~~l~~g~-~~~~l~-~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) .+.++.++|++++++++. ++.++. ++++|+|++|.+++++.+||++|||||.+||... +.++...+|+.+| T Consensus 228 ~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-------~~~~~~~~f~~~~ 300 (403) T protein:vir:80 228 KYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGK-------YDKDEYNNFINST 300 (403) T ss_pred HHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCC-------ccHHHHHHHHHHH Confidence 777888999999987664 455544 5789999999999999999999999999997532 2245567899999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccc Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n 406 (424) |.|++++||++|+++|+++.+ ++++||.+.+++.|.+++++.+.+++++|+||+||+|+++|+||+||||++++++| T Consensus 301 l~P~~~~ie~~l~~kll~~~~---~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~~~n 377 (403) T protein:vir:80 301 ILPIAKGIEQELTRKLLISPD---LYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVILEN 377 (403) T ss_pred HHHHHHHHHHHHHHhccCCCC---cEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccc Confidence 999999999999999998755 67899999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhcccc---CCCccCCC Q lcl|NC_019710. 407 YVPITDLGTN---KEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~---~~~~~~g~ 424 (424) ++|++.++++ ++++++|+ T Consensus 378 ~~pl~~~~~~~~~k~ge~~~~ 398 (403) T protein:vir:80 378 YIPLDKIGDQNKLKGGEKGGA 398 (403) T ss_pred ccchhhccchhhccCCCCCCC Confidence 9999876653 34444444 No 47 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=100.00 E-value=4.6e-87 Score=493.92 Aligned_cols=391 Identities=17% Similarity=0.195 Sum_probs=324.6 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||++++....+.... . .......+..........++...++++++|++||++||++||++||++|+.++++.. T Consensus 1 Mg~f~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~--- 75 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIR-A-DTGYVGLFMSGEDVSFLVPGYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDI--- 75 (406) T ss_pred Ccchhhhcccccccccc-c-cchhhhhhccCcccCccccCHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCcce--- Confidence 99999876544322221 1 111111222223345566778889999999999999999999999999998876632 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCe--EEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNA--YALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a--~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.+++++|++ |+++.++..|.+++||||+|.+|++..+.++..|. .+ T Consensus 76 ~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~~~~~--~~-- 151 (406) T protein:vir:95 76 RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDGYQVL--YG-- 151 (406) T ss_pred eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCeEEEE--ec-- Confidence 2468899999999999999999999999999999765 66678999999999999999999999998875443 33 Q ss_pred eEEecHhHeeEecC--cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 172 ~~~~~~~evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) ...|+++||||+|+ .+.++++|+||+.++..++.+..++.+++.++|+||++|+++++.+.... +++.+++++.|.+ T Consensus 152 ~~~~~~~evih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~-~e~~~~~~~~~~~ 230 (406) T protein:vir:95 152 GQTFNYDEVLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATA-ELSSEEGRNAVFK 230 (406) T ss_pred cEEEchhHEEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCC-HHHHHHHHHHHHH Confidence 35799999999995 34578999999999999999999999999999999999999999987655 5555666666655 Q ss_pred -HhCCcccCcceecCC-Cceeeecc-CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 250 -IAGGPVKKRLWILEA-GFSTSAIG-VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 250 -~~~~~~ag~~~~l~~-g~~~~~l~-~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) +.+..|+|++++++. |.+++++. ++++|+||+|.+++++++||++|||||.+||..+ +.+++..+|+++| T Consensus 231 ~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~-------~~~~~~~~~~~~~ 303 (406) T protein:vir:95 231 KYLQATEAGQPWIIPAELLEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGE-------FNRDEYNNFINST 303 (406) T ss_pred HhccccccCCceeecCCCccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-------chHHHHHHHHHHH Confidence 566678999988865 45677764 6899999999999999999999999999997532 4578889999999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccc Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n 406 (424) |.|++++||++|+++|+++.+ ++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+||||++++|+| T Consensus 304 l~P~~~~ie~~l~~~l~~~~~---~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~~~n 380 (406) T protein:vir:95 304 ILPIAKGIEQELTRKLLISPD---LYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVILEN 380 (406) T ss_pred HHHHHHHHHHHHHHhcCCCCC---cEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccC Confidence 999999999999999998754 57999999999999999999999999999999999999999999999999999999 Q ss_pred ccchhhcccc---CCCccCCC Q lcl|NC_019710. 407 YVPITDLGTN---KEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~---~~~~~~g~ 424 (424) ++|++.+++. +++++++. T Consensus 381 ~~~~~~~~~~~~~k~g~~~~~ 401 (406) T protein:vir:95 381 YIPLDKIGDQSKLKGGDNSGA 401 (406) T ss_pred ccchhhcccccccCCCCCCCC Confidence 9999876653 33334433 No 48 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=100.00 E-value=2.3e-86 Score=490.05 Aligned_cols=401 Identities=15% Similarity=0.135 Sum_probs=322.5 Q ss_pred ccHHHHHHhhccCcccccccccc------ccc---------------ccc----cccccCCccccHHHHhhhHHHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGS------QTG---------------PVS----AHGYLGDSSINDERILQISTVWRCVS 68 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~------~~~---------------~~~----~~~~~~~~~~~~~~~~~~~~v~~~i~ 68 (424) +||++|+++.+.+.......... ... ... ......+..++.+.|+++++|++||+ T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~~~v~~~i~ 80 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQANGPVFACML 80 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhccHHHHHHHH Confidence 99999999988754222111000 000 000 01112466689999999999999999 Q ss_pred HHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC--------C Q lcl|NC_019710. 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA--------G 140 (424) Q Consensus 69 ~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~--------G 140 (424) +||++||++||++|++++.+ .. ...+|++..| +.+||++||+++||+.++.+++++||||++++|+.. | T Consensus 81 ~Ia~~ia~lp~~~~~~~~~~-~~-~~~~~~~~~L-~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g 157 (466) T protein:vir:81 81 VRQLVFSSVRFRWQRLRDGK-PS-DTFGSRDLQI-LETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVD 157 (466) T ss_pred HHHHhhccCceEEEEecCCc-ee-eccccHHHHH-hhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCc Confidence 99999999999999876433 22 3456676664 469999999999999999999999999999999765 4 Q ss_pred ceeEEEEeccceEEEEEcCCc---eEEEEEecC-----ceEEecHhHeeEecCc--CCCCccccchHHHHHHHHHHHHHH Q lcl|NC_019710. 141 DVISLLPLQSANMDVKLVGKK---VVYRYQRDS-----EYADFSQKEIFHLKGF--GFTGLVGLSPIAFACKSAGVAVAM 210 (424) Q Consensus 141 ~~~~l~~l~p~~v~~~~~~~~---~~~~~~~~~-----~~~~~~~~evih~r~~--~~~~~~G~s~~~~~~~~i~~~~~~ 210 (424) .+.+|+|++|.+|++..+.++ ..|.|..++ ....|+++||||||++ +.++++|+||+.++.+++++..++ T Consensus 158 ~~~~l~~l~~~~v~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~ 237 (466) T protein:vir:81 158 VVVEERMVRGGRGELGGGQLGWRKVGYLYTEGGRQSGNESVGFLAEDVVHFAPIPDPLASYRGMSWLTPILREIRADQAM 237 (466) T ss_pred ceeEEEEecCcceEEEEcCCCceEEEEEEEecCcccccceeeeccccEEEEcCCCCcccccccccHHHHHHHHHHHHHHH Confidence 589999999999999887654 234454433 4568999999999965 468899999999999999999999 Q ss_pred HHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHh-CCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHH Q lcl|NC_019710. 211 EDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA-GGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVS 289 (424) Q Consensus 211 ~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~-~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~ 289 (424) ++++.++|+||++|++||+.+.. .++++.+++++.|++.+ |..|+|+++++++|++|++++++++|+||+|.++++.+ T Consensus 238 ~~~~~~~f~ng~~p~gil~~~~~-l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~ 316 (466) T protein:vir:81 238 SKHQAKFFDNGATVNLVIKHNPM-ADPAAVKKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGET 316 (466) T ss_pred HHHHHHHHhcCCCcceEEecCCC-CCHHHHHHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHH Confidence 99999999999999999998866 55777888888887755 55789999999999999999999999999999999999 Q ss_pred HHHHHhCCCHHHcCCCCC-CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHH Q lcl|NC_019710. 290 ELARFFGVPPHLVGDVEK-STSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASR 368 (424) Q Consensus 290 ~Ia~~fgVP~~~l~~~~~-~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~ 368 (424) +||++|||||.+||..++ ++++|+|+|++.+.|+++||.|++++||++|+++|+++.++..++++||.+.+++.|.+++ T Consensus 317 ~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r 396 (466) T protein:vir:81 317 RIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDA 396 (466) T ss_pred HHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHH Confidence 999999999999998765 4567889999999999999999999999999999999887777899999999999999988 Q ss_pred HHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCcCeeee-cccccchhhcc------------ccCCCccCCC Q lcl|NC_019710. 369 AAF-------MKAMGESGLRTINEMRRTDNLPPLPGGDVAMR-QSQYVPITDLG------------TNKEPRNNGA 424 (424) Q Consensus 369 ~~~-------~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~-~~n~~~~~~~~------------~~~~~~~~g~ 424 (424) +++ +..++++|+ |+||+|+ ++++||.++. +.+..+++... ..+++++||- T Consensus 397 ~~~~~~~~~~~~~~~~~g~-t~nE~r~-----~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 397 ADIQKVRAETINTLITAGY-EPESVVA-----AVNSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred HHHHHHHHHHHHHHHHcCC-Chhhccc-----cccCCccccccCCCcchhhhcccccccccCCCCcccCCCCcCCC Confidence 765 667888995 9999995 4567776543 34444443321 1122223333 No 49 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=100.00 E-value=1.1e-85 Score=486.43 Aligned_cols=411 Identities=15% Similarity=0.199 Sum_probs=318.8 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) =+.-..|.+-++.. |..+++.+. +++.. ..+ .+......++..++.+.++++++|++||++||++||++|++ T Consensus 73 k~~i~~pfkkk~~~-~~~d~f~~s-~es~s-----~vt-sls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlk 144 (945) T protein:vir:10 73 KEKIIVPYNHQEPP-FKFNLFEYS-PESLM-----YLP-SISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELE 144 (945) T ss_pred hhcccccccccccc-hhhhhhhcc-Cccce-----ecc-cccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceE Confidence 11111222222221 111222111 11110 000 11111222345677889999999999999999999999999 Q ss_pred EeeccccCcc----ccccccchhHHhhccCCCCCCCHHH----HHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccce Q lcl|NC_019710. 81 VFETDQNDNR----KKVDLSNPLARLLRYSPNQYMTAQE----FREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 81 ~~~~~~~~~~----~~~~~~~~l~~lL~~~PN~~~s~~~----f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~ 152 (424) +|++.++|.. ++....|++..+|+ +||++||+++ |++.++.+++++||||+++.|+.+|.+.+|||++|.+ T Consensus 145 lYrr~edG~~~~~~kk~~~~hpL~~LL~-rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~ 223 (945) T protein:vir:10 145 IYKHIEDKHVNYYLKRIRDARNILEFLE-RPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTT 223 (945) T ss_pred EEEecccCcccccccccccchHHHHHHh-CCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcc Confidence 9999887753 34456789999886 9999999998 5567889999999999999999999999999999999 Q ss_pred EEEEEcCCce-E--EEEEecC-ceEEecHhHee-EecCcCCCCc---cccchHHHHHHHHHHHHHHHHHHHHHHh-ccCC Q lcl|NC_019710. 153 MDVKLVGKKV-V--YRYQRDS-EYADFSQKEIF-HLKGFGFTGL---VGLSPIAFACKSAGVAVAMEDQQRDFFA-NGAK 223 (424) Q Consensus 153 v~~~~~~~~~-~--~~~~~~~-~~~~~~~~evi-h~r~~~~~~~---~G~s~~~~~~~~i~~~~~~~~~~~~~~~-ng~~ 223 (424) |++..++++. . |.+..++ ....++++|+| |+++++.++. +|+||+.+++.++..+.++++++.++|. ||++ T Consensus 224 Vti~~ddDG~~~y~Yv~~idG~~~~~v~a~DvIlhirn~s~DG~~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~ 303 (945) T protein:vir:10 224 IKPILSEDTGIVVGYVQEVDGAIVAHFDKRDVVLFRQNLTPDVYMYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSI 303 (945) T ss_pred eEEEEcCCCcEEEEEEEecCCceEEEecCCceEEEeccCCCCcccccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 9988776542 2 3333333 44578888865 5667777764 5999999999999999999999999995 7889 Q ss_pred CceeEEcCC---------CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 224 SPQILSTGE---------KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARF 294 (424) Q Consensus 224 p~~vl~~~~---------~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~ 294 (424) |+|+|+.+. ...++++.+.+++.|++..+|.++|+++++++|++|++++++++|+||+|.+++++++||++ T Consensus 304 PsGILsvkg~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArA 383 (945) T protein:vir:10 304 PEGILAIEPPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAV 383 (945) T ss_pred cceEEEecCccccccccccccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHH Confidence 999998653 34578889999999999988888899999999999999999999999999999999999999 Q ss_pred hCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHH Q lcl|NC_019710. 295 FGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKA 374 (424) Q Consensus 295 fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~ 374 (424) |||||.+||..++++ ++|+|++.+.|+++||.|++.+||++||++|+...+... ++|+++.+...|.+++++.+++ T Consensus 384 FGVPP~lLG~~e~st--~SNiEqq~~~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~--i~fdFd~ldl~D~ksraEal~k 459 (945) T protein:vir:10 384 YQVSPQDVGILEGSN--KATAEVMASLTKAKGLEPLMATISKGFDEVVSEFRNEKD--IKLWFKEDDLEKERDWWNIIQG 459 (945) T ss_pred hCCCHHHcccCCCCC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccCce--eEEEecchhccCHHHHHHHHHH Confidence 999999999876654 459999999999999999999999999999986655443 4455555566788999999999 Q ss_pred HHhCCCcCHHHHHHHhCCCCCCCcCeeeecc-cccchhhccccC--------------CCccCCC Q lcl|NC_019710. 375 MGESGLRTINEMRRTDNLPPLPGGDVAMRQS-QYVPITDLGTNK--------------EPRNNGA 424 (424) Q Consensus 375 ~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~-n~~~~~~~~~~~--------------~~~~~g~ 424 (424) ++++|+||+||+|+++|+||+||||+++++. |+.|.+...+.+ ++...|+ T Consensus 460 li~sGiLTiNEvRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGG 524 (945) T protein:vir:10 460 QLNTGFRSINEARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGG 524 (945) T ss_pred HHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCC Confidence 9999999999999999999999999999987 455655432211 1111111 No 50 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=100.00 E-value=9e-85 Score=481.34 Aligned_cols=384 Identities=14% Similarity=0.173 Sum_probs=314.8 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|+|+++++++.+....+ ......++...+..++.++.+.++++++|++||++||++||++||++|+++. + . T Consensus 1 MGl~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~vt~~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g--~---~ 72 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKR---GYLDNVLGKSIRYSGVYVTDSNILQSSDVYELLQDISNQMVLADIVVEDEFG--N---E 72 (394) T ss_pred CchhhhhhhhccCCCCch---hhhhhhhhcccccCccccChhhhhccHHHHHHHHHHHHhhcccceEEEcCCC--c---c Confidence 999999987764333221 2233345555667778899999999999999999999999999999997643 2 2 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCceE Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|++..|| .+||++||+++||+.++.+++++||||+++.++..+.+ ..+.+..++... +.|..+ .. T Consensus 73 ~~~~~~~~Ll-~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~~~--------~~~~~~~~~~~~-~~~~~~--~~ 140 (394) T protein:vir:62 73 IKDDIALQIL-RNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIHLA--------SNVFTELDDNLV-EHFNIG--GH 140 (394) T ss_pred cchhhHHHHh-ccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceeecc--------ccceEEECCceE-EEEeeC--CE Confidence 3467777655 59999999999999999999999999999876543322 344555555543 333333 46 Q ss_pred EecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHh- Q lcl|NC_019710. 174 DFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIA- 251 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~- 251 (424) +|+++||||+|+++.|+++|+||+..+..++....+++++..++|+||++|+++|+.+..... +++++++++.|++.+ T Consensus 141 ~~~~~eiih~r~~~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (394) T protein:vir:62 141 EIPPCMIRHVKNIGADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLE 220 (394) T ss_pred EechhheEEecCcCCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhc Confidence 799999999999999999999999999999999999999999999999999999999877653 555677777777655 Q ss_pred CCcccCcceecCCCc--eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 252 GGPVKKRLWILEAGF--STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 252 ~~~~ag~~~~l~~g~--~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P 329 (424) +..|+|++++++.|. ++++++.+++|+||+|.++++.++||++|||||.+||... ++|+|++.++|+++||.| T Consensus 221 g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-----~sn~e~~~~~~~~~~l~P 295 (394) T protein:vir:62 221 SIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELI-----KEDIEKAMMYIHNKAVRP 295 (394) T ss_pred cccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-----CcCHHHHHHHHHHHHHHH Confidence 457899999998776 5568888999999999999999999999999999998643 358899999999999999 Q ss_pred HHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCeeeecccc Q lcl|NC_019710. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPL--PGGDVAMRQSQY 407 (424) Q Consensus 330 ~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~--~ggd~~~~~~n~ 407 (424) ++++||++|+++|+++.++..++++||...+++. .++++.+.+++++|+||+||+|+++|+||+ |+||++++++|+ T Consensus 296 ~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~--~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~ 373 (394) T protein:vir:62 296 IMKNFEDHLSLLFYAQNSGKRIKFKINILDFVTY--SNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDV 373 (394) T ss_pred HHHHHHHHHhhhhcCccccCceEEEechhhhcCH--HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeccccc Confidence 9999999999999998887778888888877665 467788999999999999999999999999 789999999999 Q ss_pred cchhhccccCCCccCCC Q lcl|NC_019710. 408 VPITDLGTNKEPRNNGA 424 (424) Q Consensus 408 ~~~~~~~~~~~~~~~g~ 424 (424) .|++.....+++..+|- T Consensus 374 ~~~~~~~~~~~~~kgge 390 (394) T protein:vir:62 374 TEIGKKEATDGSLGGGE 390 (394) T ss_pred ccccccccccccCCCCC Confidence 99876543322222222 No 51 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=100.00 E-value=2e-84 Score=479.46 Aligned_cols=386 Identities=20% Similarity=0.219 Sum_probs=318.1 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +||++++...+..+........ +... ........+.+.|+++++|++||++||++||++||+++++......... T Consensus 1 mg~~~~~~~~~~~~~~~~~~~~----~~~~-~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~~~~~ 75 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDME----PVSH-RTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNIVTYANG 75 (403) T ss_pred Ccchhhhhhccchhhhhhhccc----cccc-ccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeecccccccccc Confidence 8999988877764332211110 1111 1112222456889999999999999999999999999987765554444 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCceE Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ...|++.++|+.+||++||+++||+.++.+++++||||+++.+. .|+++|+..|++..+.+...+.+..+ ... T Consensus 76 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~~------~l~~l~~~~~~v~~~~~~~~~~~~~~-~~~ 148 (403) T protein:vir:10 76 VKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDGT------SLYHVPAALMQVEADANKFIKKFIFN-NQI 148 (403) T ss_pred cccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeCc------eeEeecCcceEEEEcCCceEEEEEec-Cce Confidence 56789999999999999999999999999999999999987532 69999999999998887776655443 346 Q ss_pred EecHhHeeEecCcC-----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019710. 174 DFSQKEIFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 174 ~~~~~evih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) .++++||+||+.++ .++++|+||+.++..+++...++.++..++|+||++|++||+.+.. .++++.+++++.|+ T Consensus 149 ~~~~~eiih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~-l~~e~~~~~~~~~~ 227 (403) T protein:vir:10 149 NYRVDEIIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEI-LNKKLRERKQEELQ 227 (403) T ss_pred eecccceEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCHHHHHHHHHHHH Confidence 78999999999654 3789999999999999999999999999999999999999998855 56777888999888 Q ss_pred HHh-CCcccCcceecCCCceeeeccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH Q lcl|NC_019710. 249 EIA-GGPVKKRLWILEAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 249 ~~~-~~~~ag~~~~l~~g~~~~~l~~--s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~ 325 (424) +.+ |.+|+|++++|++|++|+++++ ++.|+||+|.++++.++||++|||||.+||.. +++|+|++.+.|+++ T Consensus 228 ~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~-----~~sn~e~~~~~f~~~ 302 (403) T protein:vir:10 228 LDYNPSTGQSSVLILDGGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGG-----NNANIRPNIELFYYM 302 (403) T ss_pred HHhCCcccCcceeecCCCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCC-----CCcCHHHHHHHHHHH Confidence 765 4578999999999999999985 68899999999999999999999999999753 245889999999999 Q ss_pred HHHHHHHHHHHHHhhhccChhhhccceeeecchhh--hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCee Q lcl|NC_019710. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL--LRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP--GGDVA 401 (424) Q Consensus 326 tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~--~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~--ggd~~ 401 (424) ||.|++++||++|+++|. ++++||.+.+ ++.|.+++++.+++++++|++|+||+|+++|+||+| +||++ T Consensus 303 tl~P~~~~ie~~l~~~L~-------~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~ 375 (403) T protein:vir:10 303 TIIPMLNKLTSSLTFFFG-------YKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKI 375 (403) T ss_pred HHHHHHHHHHHHHHHhcC-------ceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCccccccc Confidence 999999999999999883 3567777755 889999999999999999999999999999999995 79999 Q ss_pred eecccccchhhcc-ccCCCccCCC Q lcl|NC_019710. 402 MRQSQYVPITDLG-TNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~-~~~~~~~~g~ 424 (424) ++|+|+....... .++++..+++ T Consensus 376 ~~p~n~~~~~~~~~~~e~~~~~~~ 399 (403) T protein:vir:10 376 RIPANVAGSATGVSGQEGGRPKGS 399 (403) T ss_pred ccccccccccccCCCCcCCCCCCC Confidence 9999997654432 2333333333 No 52 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=100.00 E-value=5.2e-84 Score=477.15 Aligned_cols=337 Identities=22% Similarity=0.327 Sum_probs=295.3 Q ss_pred hhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceE Q lcl|NC_019710. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v 153 (424) ||++||+++++++. .+|+++++|+.+||++||+++||+.++.+++++||||++++|+.+|.|++||||+|.+| T Consensus 1 ia~lp~~~~~~~~~-------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v 73 (348) T protein:vir:93 1 MASLPLKMYEDYKV-------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVV 73 (348) T ss_pred CcccceEeEecCcC-------cccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCce Confidence 99999999987643 36899999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCc--eEEEEE-ecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019710. 154 DVKLVGKK--VVYRYQ-RDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 154 ~~~~~~~~--~~~~~~-~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~ 229 (424) ++..+.+. ..|.+. .++..+.|+++||||||++ +.++++|+||+.++..++++..++++++.. .++..++++++ T Consensus 74 ~~~~~~~~~~~~y~~~~~~g~~~~~~~~eiih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~--~~~~~~~~i~~ 151 (348) T protein:vir:93 74 EMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLT--EMQKPDSFMLK 151 (348) T ss_pred EEEEeCCCcEEEEEEEcCCCeEEEEccccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHH--hcCCCceeEEe Confidence 98876554 334443 4456678999999999986 457899999999999999999999988633 33444456666 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCC Q lcl|NC_019710. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~ 309 (424) .+. ..++++.+.+++.|++.++ |+|+++++++|++|++++++++|+||+|++++++++||++|||||.+||+.+++ T Consensus 152 ~~~-~l~~e~~~~~~~~~~~~~~--n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~- 227 (348) T protein:vir:93 152 YGS-NVSTEKRQQVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNT- 227 (348) T ss_pred cCC-CCCHHHHHHHHHHHHHHhh--cCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC- Confidence 664 4667778888889988764 678999999999999999999999999999999999999999999999976655 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Q lcl|NC_019710. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRR 388 (424) Q Consensus 310 ~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~ 388 (424) +++|+|++.++|+++||.|++++||++|+++|+++.++. +++++||.+.+++.|.+++++.+++++++|++|+||+|+ T Consensus 228 -~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~ 306 (348) T protein:vir:93 228 -NFAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIRE 306 (348) T ss_pred -CcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHH Confidence 456999999999999999999999999999999998764 688999999999999999999999999999999999999 Q ss_pred HhCCCCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 389 TDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 389 ~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++|+||+||||++++++|++|++...++++...+|. T Consensus 307 ~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~gg~ 342 (348) T protein:vir:93 307 WEDLPPVEGGDKPLISGDLYPIDTPLELRKSLKGGD 342 (348) T ss_pred HhCCCCCCCcCeEeecccccccccchhhcccccCCC Confidence 999999999999999999999988766554333333 No 53 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=100.00 E-value=9.2e-84 Score=475.81 Aligned_cols=380 Identities=14% Similarity=0.135 Sum_probs=315.6 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||++.+... +.....+..+... ..+...+..++.+.++++++|++||++||++||++||++ T Consensus 1 M~~f~~~~~~~---~~~~~~~~~~~~~--~~~~~~~~~v~~~~al~~~~V~~~v~~ia~~ia~~p~~~------------ 63 (397) T protein:vir:38 1 MPLLKLNKSHS---QGFSLNDPDWVNF--LTGGEAQKYVSADTALKNSDIFSLIMQLSGDLAMVRYTS------------ 63 (397) T ss_pred Ccchhhhhccc---CcccCCchhhhhh--hcCCcCCceechHHhhccHHHHHHHHHHHHHHhhCcccc------------ Confidence 88888764322 1111111111111 123346778999999999999999999999999999964 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC--ceEEEEEe--- Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQR--- 168 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~--~~~~~~~~--- 168 (424) .|+..++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|.+|++..+.+ ...|.+.. T Consensus 64 --~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~ 141 (397) T protein:vir:38 64 --ESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEP 141 (397) T ss_pred --cccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEeccc Confidence 24567788899999999999999999999999999999999999999999999999999877544 34555543 Q ss_pred -cCceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019710. 169 -DSEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 169 -~~~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ++..+.|+++||||+++++.++ ++|+||+.++..++....++.+++.++|+||++|+++|+.+... ++++.+.+++. T Consensus 142 ~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~-~~e~~~~~~~~ 220 (397) T protein:vir:38 142 AIGYMENVPAADVIHIRLLSKNGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGG-LLDAETRIARS 220 (397) T ss_pred cccceeEecCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCC-CHHHHHHHHHH Confidence 3345789999999999988766 78999999999999999999999999999999999999998764 45667888999 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) ++...++.|+|+++++++|++|++++.++.|+||+|.+++.+++||++|||||.+||+.++++ +|.| +...||.+| T Consensus 221 ~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~---~~~e-~~~~~~~~~ 296 (397) T protein:vir:38 221 KEISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ---SSIT-QISGQYAKS 296 (397) T ss_pred HHHHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc---cHHH-HHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999999875433 3565 457789999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccc Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n 406 (424) |.|++++||++|+++|+++.+ |++..+++.|.+++++.+++++++|+||+||+|+++|++|+++||.+..... T Consensus 297 l~P~~~~ie~~ln~~l~~~~~-------~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~ 369 (397) T protein:vir:38 297 LNRYVQAIVGELNDKLHANIS-------ANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKE 369 (397) T ss_pred HHHHHHHHHHHHHHhccChhc-------ccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccccccc Confidence 999999999999999998643 5556677889999999999999999999999999999999999998766655 Q ss_pred ccchhhccccCCCccCCC Q lcl|NC_019710. 407 YVPITDLGTNKEPRNNGA 424 (424) Q Consensus 407 ~~~~~~~~~~~~~~~~g~ 424 (424) ..+.......+++.+++. T Consensus 370 ~~~~~~~~~~~~g~~~~~ 387 (397) T protein:vir:38 370 PQQAIQLIQQEGGENDGN 387 (397) T ss_pred ccccccccccccCCCCCC Confidence 555444433333333333 No 54 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=100.00 E-value=2.1e-83 Score=473.81 Aligned_cols=377 Identities=15% Similarity=0.193 Sum_probs=313.5 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||+++. +...+.................+......++.+.|+++++|++||++||++||++||++++ T Consensus 1 Mg~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~v~~---------- 69 (385) T protein:vir:10 1 MGLLTPRN-FNKRKAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVASAHFKTEN---------- 69 (385) T ss_pred Cccccchh-cccccccccccccchhhhhhhccccCccccCHHHhhccHHHHHHHHHHHHHHhhCceeeec---------- Confidence 88888753 2222222211111111011112223456789999999999999999999999999999974 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEe--cCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR--DSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~--~~~ 171 (424) |+...+| .+||++||+++||+.++.+++++||||++++|+ +.+++|+++.+|++..++++..|.+.. ++. T Consensus 70 ---~~~~~ll-~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~ 141 (385) T protein:vir:10 70 ---TATLNRL-ESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVYTVLESNDRP 141 (385) T ss_pred ---cchhhhh-hcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCCceEEEEEcCCceEEEEEEcCCce Confidence 3344555 499999999999999999999999999999876 467999999999999888887776643 445 Q ss_pred eEEecHhHeeEecCcC---CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFG---FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 172 ~~~~~~~evih~r~~~---~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) .++|+++||||||+++ .++++|+||+.++..++.+..++++++.++|+||++|+++|+.+....++++.+.+++.|+ T Consensus 142 ~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~ 221 (385) T protein:vir:10 142 QMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFE 221 (385) T ss_pred EEEEccccEEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHH Confidence 6789999999999865 3567999999999999999999999999999999999999999988888899999999999 Q ss_pred HHhCCcccCcceecCCCceeeeccCChhHHHHH-HHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHH Q lcl|NC_019710. 249 EIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMM-ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTL 327 (424) Q Consensus 249 ~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~-e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl 327 (424) +..++.|+|+++++++|++|++++.++.|+|++ |.+++++++||++|||||.+||+.+.++.+++|.|++ ..++..|| T Consensus 222 ~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~-~~~~~~~l 300 (385) T protein:vir:10 222 KANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQI-KATYLANL 300 (385) T ss_pred HHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHH-HHHHHHHH Confidence 999999999999999999999999999999974 9999999999999999999999987777788898865 55566799 Q ss_pred HHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeeeecc Q lcl|NC_019710. 328 QPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVAMRQS 405 (424) Q Consensus 328 ~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~~~~~ 405 (424) .|+++.|+++|+++|+++ +++||++.+++.|.+++++.+++++++|+||+||+|+++|++|+|+ ||++..+. T Consensus 301 ~P~~~~ie~~l~~~l~~~------~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~~~~~~~~~ 374 (385) T protein:vir:10 301 NSYVNPIVDELRLKMNAP------DLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPLT 374 (385) T ss_pred HHHHHHHHHHHHHhhCCc------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCCCccccCcc Confidence 999999999999999864 4899999999999999999999999999999999999999999964 56666666 Q ss_pred cccchhhccccCCCccC Q lcl|NC_019710. 406 QYVPITDLGTNKEPRNN 422 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ 422 (424) +.+. ..+..|| T Consensus 375 ~~~~------~g~~~dn 385 (385) T protein:vir:10 375 TQVK------GGDEGDN 385 (385) T ss_pred cccC------CCCCCCC Confidence 6433 1122222 No 55 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=100.00 E-value=8.5e-83 Score=470.52 Aligned_cols=376 Identities=15% Similarity=0.203 Sum_probs=316.1 Q ss_pred ccHHHHHHhhccCccccccccccccccc-ccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPV-SAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (424) +|||++.. |.+..............+ ...+...+..++.+.|+++++|++||++||+++|++||++++ T Consensus 1 Mg~~~~~~--~~k~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~--------- 69 (383) T protein:vir:10 1 MGLLTPKN--FSKRNAKNMVYPSNPAFFTTTVGGMQLSYVSALSALQNTNVYSVINRIASDVSSAHFKTEN--------- 69 (383) T ss_pred CCcccccc--cccccccccccccchhhhhhhccCccccccchhHhhcchHHHHHHHHHHHhhccCceeecc--------- Confidence 88888742 222222211111111111 112234566789999999999999999999999999999864 Q ss_pred ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEE--ecC Q lcl|NC_019710. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQ--RDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~--~~~ 170 (424) |+...+|. +||++||+++||+.++.+++++||||++++++ +.+++|+++.+|++..+.++.+|.+. .++ T Consensus 70 ----~~~~~ll~-~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~----~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~ 140 (383) T protein:vir:10 70 ----TATLNRLE-SPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ----NLEHIPNSDVQINYLPGNMGIVYTVLESNDR 140 (383) T ss_pred ----cchhhhhh-CCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC----ceeEeecCcceEEEEEcCCceEEEEEEcCCc Confidence 33445564 99999999999999999999999999999875 46789999999998888877766554 345 Q ss_pred ceEEecHhHeeEecCcCC---CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHH Q lcl|NC_019710. 171 EYADFSQKEIFHLKGFGF---TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENF 247 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~---~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~ 247 (424) ..++|+++||||||+++. ++++|+||+.++..++....++++++.++|+||++|+++|+++....++++.+.+++.| T Consensus 141 ~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~ 220 (383) T protein:vir:10 141 PKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEF 220 (383) T ss_pred eEEEEcccceEEeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHH Confidence 578899999999997654 45789999999999999999999999999999999999999998888899999999999 Q ss_pred HHHhCCcccCcceecCCCceeeeccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 248 KEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 248 ~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~-~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) ++..++.|+|+++++++|++|++++.+++|+|+ .|.+++++++||++|||||.+||+.+.++.+++|.|++.. ++..| T Consensus 221 ~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~-~~~~~ 299 (383) T protein:vir:10 221 EKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKA-TYLAN 299 (383) T ss_pred HHHhCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHH-HHHHH Confidence 999988999999999999999999999999997 5899999999999999999999987777777889888766 45579 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccc Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQ 406 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n 406 (424) |.|+++.||++|+++|+.+ +++||++.+++.|.+++++.+.+++++|+||+||+|+++|++|+|+||.+....+ T Consensus 300 l~P~~~~ie~~l~~~l~~~------~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~ 373 (383) T protein:vir:10 300 LNSYVNPIVDELRLKMNAP------DLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNLPEFKPL 373 (383) T ss_pred HHHHHHHHHHHHHHhhCCc------eEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcccccCCC Confidence 9999999999999999764 5899999999999999999999999999999999999999999999998876666 Q ss_pred ccchhhccccC Q lcl|NC_019710. 407 YVPITDLGTNK 417 (424) Q Consensus 407 ~~~~~~~~~~~ 417 (424) ..++.. ++++ T Consensus 374 ~~~~~g-Gd~e 383 (383) T protein:vir:10 374 TNETKG-GDDK 383 (383) T ss_pred cccCCC-CCCC Confidence 555432 2222 No 56 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=100.00 E-value=2.9e-82 Score=467.59 Aligned_cols=352 Identities=16% Similarity=0.222 Sum_probs=303.3 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||++++.+ ....+. ..+........+..+..|+.+.|+++++|++||++||++||++|+. T Consensus 1 M~~~~~f~~r----~~~~~~-~~~~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ia~~p~~------------- 62 (359) T protein:vir:10 1 MSILNPFERR----SSITPN-NYYPFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSDIAGTRFI------------- 62 (359) T ss_pred Ccccchhhcc----ccCCCC-cchhhhhccccccCCcccCHHHhhcchHHHHHHHHHHHhhhcCccc------------- Confidence 8888865432 111111 1122222334466788899999999999999999999999999983 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEe--cCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQR--DSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~--~~~ 171 (424) .+++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+.+|||++|.+|++..+++...|.+.. ++. T Consensus 63 --~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~y~~~~~~~~~ 140 (359) T protein:vir:10 63 --GNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDDTLTYEVNQFDDYP 140 (359) T ss_pred --cchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCCeEEEEEEecCCce Confidence 3567778888999999999999999999999999999999999999999999999999999888877776653 456 Q ss_pred eEEecHhHeeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 172 ~~~~~~~evih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ...++++||||||+++. ++++|+||+.++..++....+++++..++|+||++|+|+|+++....++++.+.+++. T Consensus 141 ~~~~~~~evih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~ 220 (359) T protein:vir:10 141 SAKYNASEMIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKE 220 (359) T ss_pred EEEEcccceEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHH Confidence 78899999999997653 7789999999999999999999999999999999999999998777889999999999 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) |++++++.|+|++++|++|++|++++++++|+||+|.+++++++||++|||||.+||+.++.+.++++++++...|+..+ T Consensus 221 ~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~ 300 (359) T protein:vir:10 221 FEKANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRF 300 (359) T ss_pred HHHHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999988777778888988888888888 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ 396 (424) |.|+++.|+..|+.++ .++...+.+.|.+.+...+.+++++|++|+||+|+++|++|+= T Consensus 301 l~p~~~~l~~~l~~~~-----------~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 301 IEPLISELRIKCDSSI-----------GVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHHHHhhhhh-----------cccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 8888888888777654 2444445555566666778889999999999999999999975 No 57 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=100.00 E-value=1.1e-80 Score=458.90 Aligned_cols=420 Identities=11% Similarity=0.113 Sum_probs=312.4 Q ss_pred CCCCCcccccCCCccHH-HHHHhhccCccccccccc---------cccccc-----cc----ccccCCccccHHHHhhhH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWW-ARLKSWFVGGRLVTPNQG---------SQTGPV-----SA----HGYLGDSSINDERILQIS 61 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~-~~~~~~~~~~~~~~~~~~---------~~~~~~-----~~----~~~~~~~~~~~~~~~~~~ 61 (424) |.-+.+-=.++.-+++= +.+.....+...+...+. ....+. .. ..+.....+.....++.. T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~iv~~~i~~~~~ 106 (574) T protein:vir:80 27 MHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGNNIILNAIINTRSN 106 (574) T ss_pred cccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhccChhHHHHHHHHHH Confidence 22222111111111110 001111111111110000 000000 00 001111223344455667 Q ss_pred HHHHHHHHHHHhhhhCceeEeeccccCc--cccccccchhHHhhcc---CCCCCC-CHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019710. 62 TVWRCVSLISTLTACLPLDVFETDQNDN--RKKVDLSNPLARLLRY---SPNQYM-TAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 62 ~v~~~i~~ia~~ia~~~~~~~~~~~~~~--~~~~~~~~~l~~lL~~---~PN~~~-s~~~f~~~~~~~~l~~G~a~~~~~ 135 (424) .|++|+.+|+.++|++||+|++++.++. .++....|++.++|.. .|||++ |+.+|++.++.+++++||+|++++ T Consensus 107 ~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~~lll~Gnayi~i~ 186 (574) T protein:vir:80 107 QVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVRATYMYDQVNFEKV 186 (574) T ss_pred HHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHHHHHhcCCeEEEEE Confidence 7888888999999999999998765432 3445667899888854 466665 778999999999999999999999 Q ss_pred eCCCCceeEEEEeccceEEEEEcCCc-------eEEEEEecCceEEecHhHeeEecCcCC----CCccccchHHHHHHHH Q lcl|NC_019710. 136 RNSAGDVISLLPLQSANMDVKLVGKK-------VVYRYQRDSEYADFSQKEIFHLKGFGF----TGLVGLSPIAFACKSA 204 (424) Q Consensus 136 r~~~G~~~~l~~l~p~~v~~~~~~~~-------~~~~~~~~~~~~~~~~~evih~r~~~~----~~~~G~s~~~~~~~~i 204 (424) |+.+|.|++||||+|.+|++..+..+ .+|++..++....|+++||||+++++. ++.+|+||+.++..++ T Consensus 187 r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~~~~~y~~~~~g~~~~~~~~~eiih~~~~~~~~~~~~~~G~spi~~a~~~i 266 (574) T protein:vir:80 187 FDKDGNFIKFDTVDPTTIFLATNGEGKLIKNGERFVQVIDNRIVAKFNERELAFAVRNPRADIEVGQYGYPELEIALKQF 266 (574) T ss_pred ECCCCcEEEEEEEcCceeEEEEcCccccccCceEEEEEeCCceEEEEccccEEEEeccCCCCcccccccccHHHHHHHHH Confidence 99999999999999999999877544 345566667778999999999996543 3578999999999999 Q ss_pred HHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhC-CcccCcce-ecCCCceeeeccCChhHHHHH Q lcl|NC_019710. 205 GVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAG-GPVKKRLW-ILEAGFSTSAIGVTPQDAEMM 281 (424) Q Consensus 205 ~~~~~~~~~~~~~~~ng~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~-~~~ag~~~-~l~~g~~~~~l~~s~~d~~~~ 281 (424) ..+.++++++.++|+||++|+|||+.+.+ ..++++.+.+++.|++.++ ..|+|+++ ++++|++|++++++++|+||+ T Consensus 267 ~~~~~a~~~~~~~f~ng~~p~gil~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfl 346 (574) T protein:vir:80 267 IAHENTEVFNDRFFSHGGTTRGILHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFE 346 (574) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHH Confidence 99999999999999999999999998654 4678889999999987655 47899975 557899999999999999999 Q ss_pred HHHHHHHHHHHHHhCCCHHHcCCCCCCC--------cccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhcccee Q lcl|NC_019710. 282 ASRKFQVSELARFFGVPPHLVGDVEKST--------SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHA 353 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVP~~~l~~~~~~~--------~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~ 353 (424) |++++++++||++|||||.+||..++++ .+++|+|++.+.|+++||.|++.+||++|+++|++..+. .+++ T Consensus 347 e~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~~-~~~~ 425 (574) T protein:vir:80 347 KWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNTYIVAEFGE-KYQF 425 (574) T ss_pred HHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcCC-ceEE Confidence 9999999999999999999999887654 357899999999999999999999999999999987654 4678 Q ss_pred eecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 354 EHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 354 ~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +|+...+...+.. ..+..++.+||||+||+|+++|+||+||||++++|+|+++++.....+..+...+ T Consensus 426 ~f~~~d~~~~~~~---~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~ 493 (574) T protein:vir:80 426 QFRGGDLSAQLDK---LKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRS 493 (574) T ss_pred EecccchhhHHHH---HHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccch Confidence 8877666543222 2234578899999999999999999999999999999999876543322211111 No 58 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=100.00 E-value=8.5e-81 Score=459.56 Aligned_cols=373 Identities=13% Similarity=0.106 Sum_probs=300.7 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||++++++-.. +... ........++.+.|+++++|++||++||+++|++||++++++.. T Consensus 1 Mg~f~~~f~~~~~-----~~~~--------~~~~~~~~~~~~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~------ 61 (385) T protein:vir:95 1 MGLFDSVFKRHSE-----LSWM--------YDLEFLQDKSKKAYLKQIALNTVVEMVARTISQSEFRVMKNNTK------ 61 (385) T ss_pred CchhhhhhccCcc-----cccc--------cchhhhhccchhhhhhhHHHHHHHHHHHHHHcccceeeeecCcc------ Confidence 8999998753211 1111 11112344667889999999999999999999999999986532 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCceE Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..|++.++|+.+||++||+++||+.++.+++++||||+++.++. +.+..++++.+..+...... .....+....... T Consensus 62 -~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 138 (385) T protein:vir:95 62 -EKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEG-HFFVADDFEKEDELGLYSHR-FTNVLVNDFEFKR 138 (385) T ss_pred -ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCC-Ceeecccccccccccccccc-ceeeeecccceee Confidence 35899999999999999999999999999999999999987764 44555566656555433222 1111222333457 Q ss_pred EecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC-CCCCHHHHHHHHHHHHHHh Q lcl|NC_019710. 174 DFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE-KVLTEQQRSQVEENFKEIA 251 (424) Q Consensus 174 ~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~-~~~~~~~~~~~~~~~~~~~ 251 (424) .++++||||+|+++.++ .+|.||+..+..++....++.. +++.|+++++.+. ...++++.+.+++.|++.+ T Consensus 139 ~~~~~eiih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~-------~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~ 211 (385) T protein:vir:95 139 VFTMDDVIYLKYNNQKLDAFSLGLFEDYGEIFGRMIDLQM-------LNNQIRGILKVDATKFYNKEKQKELQAYIDTLF 211 (385) T ss_pred eeccccEEEecCCCCCcccccchHHHHHHHHHHHHHHHHH-------hcCCCceEEEeCCccCCCHHHHHHHHHHHHHHh Confidence 89999999999988775 7899999999998876554432 2344788888864 3467888899999999886 Q ss_pred CCc--ccCcceecCCCceeeeccC------ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 252 GGP--VKKRLWILEAGFSTSAIGV------TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 252 ~~~--~ag~~~~l~~g~~~~~l~~------s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) ++. +.++++++++|++|+++++ ++.|+||+|.++++.++||++|||||.+|++ +++|.|++.++|+ T Consensus 212 ~g~~~~~~~i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~------~~sn~e~~~~~~~ 285 (385) T protein:vir:95 212 DAFQNNTIAVVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLG------EMADLEKTIESYL 285 (385) T ss_pred hhhhhcCCceEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCcCHHHHHHHHH Confidence 653 4556899999999999974 6789999999999999999999999999963 3568999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPL--PGGDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~--~ggd~~ 401 (424) ++||.|++++||++|+++|+++.++...+++||++.+++.|.+++++.+++++++|+||+||+|+++|+||+ ||||++ T Consensus 286 ~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~gd~~ 365 (385) T protein:vir:95 286 QFCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPELDKF 365 (385) T ss_pred HHHHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999988877889999999999999999999999999999999999999999998 789999 Q ss_pred eecccccchhhccccCCCcc Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKEPRN 421 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~ 421 (424) ++|+|++|++...+.+...+ T Consensus 366 ~~~~n~~~~~~~kgge~~~e 385 (385) T protein:vir:95 366 IITKNLQSADAFKGGESNEE 385 (385) T ss_pred eecccceecccccCCCCCCC Confidence 99999999887533332222 No 59 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=100.00 E-value=1.1e-80 Score=458.95 Aligned_cols=372 Identities=15% Similarity=0.145 Sum_probs=303.1 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||+++++.- .. + ..+..+..+..++.+.|+++++|++||++||+++|++||++|+++. T Consensus 1 Mg~f~~lf~~~---~~--~--------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:10 1 MSILEKIFKTR---KD--I--------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred CchhhhhhccC---cc--c--------cccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 89999986532 11 1 0112234456778889999999999999999999999999997542 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce--EEEEEecCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV--VYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~--~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..++........ .+.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:10 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 346899999999999999999999999999999999998876553 3566666666554333322 233334444 Q ss_pred eEEecHhHeeEecCcCCC-CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~-~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ..+++++||||+|+++.+ ..+|.||+.++..++.... +.|.+|+.++++|+.+....++++++.+++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:10 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 578999999999987654 5789999999888776544 35678888999999988888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeeccCChhHH-----HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~ag~--~~~l~~g~~~~~l~~s~~d~-----~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) .++.++++ ++++++|++|+++++++.++ ||+|.++++.++||++|||||.+|++ +++|.|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:10 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 88876655 45579999999999888765 89999999999999999999999973 3458999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg--d~~ 401 (424) ++||.|++++||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999876543 46899999999999999999999999999999999999999999876 999 Q ss_pred eecccccchhhccccCCCccCCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:10 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999886544332222111 No 60 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=100.00 E-value=1.1e-80 Score=458.95 Aligned_cols=372 Identities=15% Similarity=0.145 Sum_probs=303.1 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||+++++.- .. + ..+..+..+..++.+.|+++++|++||++||+++|++||++|+++. T Consensus 1 Mg~f~~lf~~~---~~--~--------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:95 1 MSILEKIFKTR---KD--I--------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred CchhhhhhccC---cc--c--------cccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 89999986532 11 1 0112234456778889999999999999999999999999997542 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce--EEEEEecCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV--VYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~--~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..++........ .+.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:95 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 346899999999999999999999999999999999998876553 3566666666554333322 233334444 Q ss_pred eEEecHhHeeEecCcCCC-CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~-~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ..+++++||||+|+++.+ ..+|.||+.++..++.... +.|.+|+.++++|+.+....++++++.+++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:95 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 578999999999987654 5789999999888776544 35678888999999988888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeeccCChhHH-----HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~ag~--~~~l~~g~~~~~l~~s~~d~-----~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) .++.++++ ++++++|++|+++++++.++ ||+|.++++.++||++|||||.+|++ +++|.|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:95 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 88876655 45579999999999888765 89999999999999999999999973 3458999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg--d~~ 401 (424) ++||.|++++||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:95 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999876543 46899999999999999999999999999999999999999999876 999 Q ss_pred eecccccchhhccccCCCccCCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:95 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999886544332222111 No 61 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=100.00 E-value=1.1e-80 Score=458.95 Aligned_cols=372 Identities=15% Similarity=0.145 Sum_probs=303.1 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||+++++.- .. + ..+..+..+..++.+.|+++++|++||++||+++|++||++|+++. T Consensus 1 Mg~f~~lf~~~---~~--~--------~~~~~~~~~~~v~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~------- 60 (395) T protein:vir:10 1 MSILEKIFKTR---KD--I--------TYMLDLDMIEDLSQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR------- 60 (395) T ss_pred CchhhhhhccC---cc--c--------cccccchhccccchhhhhhhHHHHHHHHHHHHhhccceeEeccCCc------- Confidence 89999986532 11 1 0112234456778889999999999999999999999999997542 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce--EEEEEecCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV--VYRYQRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~--~~~~~~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.++++.|++|+++.++. .++++++..++........ .+.+...+. T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (395) T protein:vir:10 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSK-----ELLIADSFYREEYALYDDIFKDVTVKDYTY 135 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCC-----CeEecCCccceeEeecCcceeEEEEcCcee Confidence 346899999999999999999999999999999999998876553 3566666666554333322 233334444 Q ss_pred eEEecHhHeeEecCcCCC-CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~-~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ..+++++||||+|+++.+ ..+|.||+.++..++.... +.|.+|+.++++|+.+....++++++.+++.|+++ T Consensus 136 ~~~~~~~evih~~~~~~~~~~~G~spi~~~~~~~~~~~-------~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~ 208 (395) T protein:vir:10 136 QRTFTMQEVIYLKYNNNKVTHFVESLFEDYGKIFGRMI-------GAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKL 208 (395) T ss_pred eeeeccccEEEEccCCCCcccccchHHHHHHHHHHHHH-------HHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHH Confidence 578999999999987654 5789999999888776544 35678888999999988888999999999999999 Q ss_pred hCCcccCc--ceecCCCceeeeccCChhHH-----HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 251 AGGPVKKR--LWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 251 ~~~~~ag~--~~~l~~g~~~~~l~~s~~d~-----~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) .++.++++ ++++++|++|+++++++.++ ||+|.++++.++||++|||||.+|++ +++|.|++.++|+ T Consensus 209 ~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~------~~sn~e~~~~~~~ 282 (395) T protein:vir:10 209 FNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG------ETADLEKNTLVFE 282 (395) T ss_pred hccccccCcceEEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------cccCHHHHHHHHH Confidence 88876655 45579999999999888765 89999999999999999999999973 3458999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG--DVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg--d~~ 401 (424) ++||.|++++||++|+++|+++.++.. +++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+||| |++ T Consensus 283 ~~~l~P~~~~ie~~l~~kL~~~~~~~~-~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~ 361 (395) T protein:vir:10 283 KFCLTPLLKKIQNELNAKLITQSMYLK-DTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEY 361 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcChhhhcc-cceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 999999999999999999999876543 46899999999999999999999999999999999999999999876 999 Q ss_pred eecccccchhhccccCCCccCCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++|+|+++++...+++.+.+..+ T Consensus 362 ~~~~n~~~~~~~~~~~~~~~~~~ 384 (395) T protein:vir:10 362 LITKNYEKANSGENDEKEKDENT 384 (395) T ss_pred eeccccccccccccccCcccccc Confidence 99999999886544332222111 No 62 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=100.00 E-value=5.9e-80 Score=454.93 Aligned_cols=409 Identities=11% Similarity=0.101 Sum_probs=304.6 Q ss_pred cCCCccHHHHHHh-------hcc-------------------------Cccccccccccccccccccccc-CCcc----- Q lcl|NC_019710. 10 LRTNNGWWARLKS-------WFV-------------------------GGRLVTPNQGSQTGPVSAHGYL-GDSS----- 51 (424) Q Consensus 10 ~~~~~G~~~~~~~-------~~~-------------------------~~~~~~~~~~~~~~~~~~~~~~-~~~~----- 51 (424) |...+|+|++++- ++. +...+-..+....-.+. .++. .... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~-~~~~~r~~~~~~~~ 79 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSAN-PGFKTKPSIRNNQD 79 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecC-cccccCccccChhH Confidence 7788899999882 111 00000000000000000 0000 0011 Q ss_pred --ccHHHHhhhHHHHHHHHHHHHhhhhC-----------ceeEeeccccC--ccccccccchhHHhhccCCCCCC----- Q lcl|NC_019710. 52 --INDERILQISTVWRCVSLISTLTACL-----------PLDVFETDQND--NRKKVDLSNPLARLLRYSPNQYM----- 111 (424) Q Consensus 52 --~~~~~~~~~~~v~~~i~~ia~~ia~~-----------~~~~~~~~~~~--~~~~~~~~~~l~~lL~~~PN~~~----- 111 (424) ...+.+..+|+|++||++||++||++ +|.+.-++.+. ..+.....+.+..+| .+||+++ T Consensus 80 l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l-~~pn~~~~p~~~ 158 (551) T protein:vir:80 80 LHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFI-EKTGVDNDINRD 158 (551) T ss_pred HHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHH-HhcCCCCCCccc Confidence 11235667899999999999999984 44443222111 111111223455555 4899874 Q ss_pred CHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce-------EEEEEecCceEEecHhHeeEec Q lcl|NC_019710. 112 TAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-------VYRYQRDSEYADFSQKEIFHLK 184 (424) Q Consensus 112 s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~-------~~~~~~~~~~~~~~~~evih~r 184 (424) |+.+|++.++.+++++||||++++|+.+|.|.+||||+|.+|++..+.++. ++++..++....|+++||||++ T Consensus 159 s~~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~ 238 (551) T protein:vir:80 159 SFSSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAV 238 (551) T ss_pred hHHHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEec Confidence 788999999999999999999999999999999999999999988776542 3334445556789999999999 Q ss_pred CcCC----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhC-CcccCc Q lcl|NC_019710. 185 GFGF----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAG-GPVKKR 258 (424) Q Consensus 185 ~~~~----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~-~~~ag~ 258 (424) +++. ++.+|+||+.++..++..+.++++++.++|+||++|+|+|+.+.. ..++++.+.+++.|++.++ ..|+|+ T Consensus 239 ~n~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~ 318 (551) T protein:vir:80 239 RNPRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQ 318 (551) T ss_pred ccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCc Confidence 7543 357899999999999999999999999999999999999998643 4678889999999987655 479999 Q ss_pred ceec-CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC--------CcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 259 LWIL-EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS--------TSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 259 ~~~l-~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~--------~~~~~n~e~~~~~f~~~tl~P 329 (424) ++++ ++|++|++++++++|+||+|++++++++||++|||||.+||...++ +.+++|+|++...|+++||.| T Consensus 319 ~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P 398 (551) T protein:vir:80 319 IPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQP 398 (551) T ss_pred cccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHH Confidence 8666 6899999999999999999999999999999999999999986553 346789999999999999999 Q ss_pred HHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-CCCcCeeeeccccc Q lcl|NC_019710. 330 YISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPP-LPGGDVAMRQSQYV 408 (424) Q Consensus 330 ~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p-~~ggd~~~~~~n~~ 408 (424) ++.+||++|+++|++..+. .++|+++.+...+..+++++++ ++.+|+||+||+|+++|+|| +||||+++.|.++. T Consensus 399 ~~~~ie~~ln~~L~~~~~~---~~~f~f~~~~~~~~~~~~~~~~-~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~ 474 (551) T protein:vir:80 399 LLGFIEDFINKHIVAEFGD---KYTFQFVGGDIKSELESVKILA-EKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQ 474 (551) T ss_pred HHHHHHHHHHhhhccccCC---ceEEEeeccChhhHHHHHHHHH-HHhcCCcCHHHHHHHhCCCCCCCCCceeecccccc Confidence 9999999999999986543 2455556777777777777654 66789999999999999998 79999999999888 Q ss_pred chhhccccCCCcc--------------------------CCC Q lcl|NC_019710. 409 PITDLGTNKEPRN--------------------------NGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~--------------------------~g~ 424 (424) ++......+.++. +++ T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 516 (551) T protein:vir:80 475 RIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGK 516 (551) T ss_pred cccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCcc Confidence 7654322111100 000 No 63 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=100.00 E-value=4.5e-80 Score=455.59 Aligned_cols=377 Identities=13% Similarity=0.155 Sum_probs=307.2 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||++.+..................+.....+.++..++.+.++++|+|++||++||++||++|++++++. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~-------- 72 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQ-------- 72 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccch-------- Confidence 888887543221111111111111122223446778889999999999999999999999999999998542 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC--ceEEEEEecC- Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQRDS- 170 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~--~~~~~~~~~~- 170 (424) ...|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|.+|++..+.+ ..+|.+...+ T Consensus 73 ------~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~~ 146 (386) T protein:vir:48 73 ------LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDDP 146 (386) T ss_pred ------hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEecCc Confidence 2347789999999999999999999999999999999999999999999999999887754 3455554333 Q ss_pred ---ceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019710. 171 ---EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 171 ---~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ..+.|+++||||+|+++.++ ++|+||+.++..++.+..++++++.++|+||++|+++|+.+.... +++.+.+++. T Consensus 147 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~-~e~~~~~~~~ 225 (386) T protein:vir:48 147 RIPPKQHVPQGDVLHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGL-LDFKTKLSRS 225 (386) T ss_pred cccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCC-HHHHHHHHHH Confidence 34679999999999988876 899999999999999999999999999999999999999987755 4455556666 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) +.. +..++|+++++++|++|++++++++|+||+|+++++.++||++|||||.+||... +++|++++.++|+++| T Consensus 226 ~~~--~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~----~~~~~e~~~~~~~~~~ 299 (386) T protein:vir:48 226 RQA--MKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQG----DQQSSLEMSLDLYNKA 299 (386) T ss_pred HHH--hhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC----CcccHHHHHHHHHHHH Confidence 654 3467899999999999999999999999999999999999999999999998632 3458899999999999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeee-cc Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMR-QS 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~-~~ 405 (424) |.|+++.||++|+++|++.. ++|....++.|...++..+++++++|++|+||+|+.+|++|++++|.... .. T Consensus 300 l~P~~~~ie~~l~~~l~~~~-------~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~~~~~ 372 (386) T protein:vir:48 300 VSRYLRPFLSELSQKLSCDV-------DADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILPKELPEGENP 372 (386) T ss_pred HHHHHHHHHHHHHHhhcchh-------hcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCCccchhhcCC Confidence 99999999999999998753 46667778888899999999999999999999999999999988876532 33 Q ss_pred cccchhhccccCCCccCCC Q lcl|NC_019710. 406 QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~g~ 424 (424) +..|+. +++++|. T Consensus 373 ~~~~~~------gGd~~~~ 385 (386) T protein:vir:48 373 NKTTLK------GGEINGE 385 (386) T ss_pred CCCccC------CCCCCCC Confidence 444432 2223333 No 64 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=100.00 E-value=1.1e-79 Score=453.46 Aligned_cols=367 Identities=13% Similarity=0.127 Sum_probs=295.2 Q ss_pred CCccHHHHHHhhccCccccccccccc----ccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc Q lcl|NC_019710. 12 TNNGWWARLKSWFVGGRLVTPNQGSQ----TGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 87 (424) =-+||++.++..-.+.....+..... ...+.......+.+++.+.|+++++|++||++||++||++|++++++.. T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchh- Confidence 12345544332211111111111111 1112222234577899999999999999999999999999999986542 Q ss_pred CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcC--CceEEE Q lcl|NC_019710. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~--~~~~~~ 165 (424) ..|+.+||++||+++||+.++.+++++||||++++|+.+|.+.+||||+|.+|++..+. +...|. T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~~y~ 146 (392) T protein:vir:74 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 23667999999999999999999999999999999999999999999999999988764 344555 Q ss_pred EEecC----ceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019710. 166 YQRDS----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~ 240 (424) +...+ ....++++||||+++++.++ ++|+||+.++..++++..++++++.++|+||++|+++|+++.+...++. T Consensus 147 ~~~~~~~~~~~~~~~~~evih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~- 225 (392) T protein:vir:74 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDK- 225 (392) T ss_pred EEecCCccceeEEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHH- Confidence 55433 35689999999999998887 7899999999999999999999999999999999999999876544332 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHH Q lcl|NC_019710. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) ..+++.+.+.+..|+|++++|++|++|++++++++|+||+|.++++.++||++|||||.+||+..+++ +.+++.+ T Consensus 226 -~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~----~~~e~~~ 300 (392) T protein:vir:74 226 -DKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred -HHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 23445556778889999999999999999999999999999999999999999999999999765433 4457789 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---------- Q lcl|NC_019710. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~l---------- 390 (424) +|+.+||.|+++.||++|+++|++. ++||...+++.|.+.+++.+..++++|++|+||+|+++ T Consensus 301 ~~~~~~l~p~~~~ie~~l~~~l~~~-------~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pne~ 373 (392) T protein:vir:74 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhccch-------hcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcccc Confidence 9999999999999999999999864 56889999999999999999999999999999999887 Q ss_pred ----CCCCCCCcCeeeecccccc Q lcl|NC_019710. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----g~~p~~ggd~~~~~~n~~~ 409 (424) |+||++|||+ .+-+| T Consensus 374 r~~enl~~~~~Gd~----~~p~p 392 (392) T protein:vir:74 374 PAPENTNKKTTGQS----NEPVP 392 (392) T ss_pred chhcCCCCCCCCCC----CCCCC Confidence 5555555543 11222 No 65 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=100.00 E-value=1.7e-80 Score=457.94 Aligned_cols=368 Identities=14% Similarity=0.181 Sum_probs=310.8 Q ss_pred ccHHHHHHhhccCcccccccccccc---cccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQT---GPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNR 90 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 90 (424) +|||+++. .+............ .+.....|.++..++.+.|+++++|++||++||++||++||+++++.. T Consensus 1 Mglf~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~---- 73 (384) T protein:vir:49 1 MPIFNITN---LATESPPSNQDSFFDITDPEFLDALNGSEWVSAETALKNSDLFSIISQLSNDLATAKITTSRKQL---- 73 (384) T ss_pred Cccccccc---cCcccccccchhhccccchhhcccccCCceechhhhhccHHHHHHHHHHHHHHhhCceeeecchh---- Confidence 88888643 11111111111111 122233467788899999999999999999999999999999986542 Q ss_pred ccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcC--CceEEEEEe Q lcl|NC_019710. 91 KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYRYQR 168 (424) Q Consensus 91 ~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~--~~~~~~~~~ 168 (424) ..|+.+||++||+++|++.++.+++++||||++++|+.+|++++||||+|.+|++..+. +...|.+.. T Consensus 74 ----------~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~ 143 (384) T protein:vir:49 74 ----------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGLYYNITF 143 (384) T ss_pred ----------hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEe Confidence 23678999999999999999999999999999999999999999999999999987654 444555554 Q ss_pred c----CceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019710. 169 D----SEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 169 ~----~~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~ 243 (424) . +..+.|+++||||+|+++.++ ++|+||+.++..++++..++++++.++|+||++|+++|+.+.....++.. T Consensus 144 ~~~~~~~~~~~~~~eVih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~--- 220 (384) T protein:vir:49 144 DDPRIPPKQHVPQGDILHFRLLSVDGGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKT--- 220 (384) T ss_pred cCccccceeEecCccEEEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHH--- Confidence 3 345789999999999988776 89999999999999999999999999999999999999998776655443 Q ss_pred HHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 244 ~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) +++.+.+.+.+|+|+++++++|++|++++++++|+|++|.+++++++||++|||||.+||+..++++++++.+++...|+ T Consensus 221 ~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i 300 (384) T protein:vir:49 221 KQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAV 300 (384) T ss_pred HHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHH Confidence 34556677888999999999999999999999999999999999999999999999999998777778889999999999 Q ss_pred HHHHHHHHHHHHHHHhhhccC---hhh-hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIP---AKD-VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~---~~~-~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd 399 (424) ..++.|+++.|+++|+.+|.. ... ....+++|+++.+++.|..++.+++..+.+.|+++ ||+|+.+|++|+|||| T Consensus 301 ~~~l~pi~~~i~~~l~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~-ne~r~~~~~~p~~gGd 379 (384) T protein:vir:49 301 SRFLRPFVSELSKKLSCEVDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP-KDLPEGETDSTLKGGE 379 (384) T ss_pred HHHHHHHHHHHHHHhchhhhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC-hhHHHHcCCCCCCCCC Confidence 999999999999999998743 222 33577899999999999999999999999999986 9999999999999863 Q ss_pred --eee Q lcl|NC_019710. 400 --VAM 402 (424) Q Consensus 400 --~~~ 402 (424) +.+ T Consensus 380 ~~~~~ 384 (384) T protein:vir:49 380 TNEQY 384 (384) T ss_pred CCCCC Confidence 333 No 66 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=100.00 E-value=1.9e-79 Score=452.13 Aligned_cols=420 Identities=11% Similarity=0.111 Sum_probs=314.4 Q ss_pred CCCCCcccccCCCc---cHHHHHHhhccCcccccc-c-cccccccccc------------ccc---cCCcc---ccHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNN---GWWARLKSWFVGGRLVTP-N-QGSQTGPVSA------------HGY---LGDSS---INDERI 57 (424) Q Consensus 1 ~~~~~~~~~~~~~~---G~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~------------~~~---~~~~~---~~~~~~ 57 (424) |.-||-+..++-++ -.+++-..-.+.++..+. . ......+.++ ..+ ..+.. ...+.+ T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~t~ 92 (535) T protein:vir:10 13 LSNKKSTSYIELGDYDKDIVNKAIRPGRASARDTVDGIDIADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQAIIRTR 92 (535) T ss_pred hhhhhhhhhHHHhhhhHHHHHhhhhhhhhhhhccccccccccCCcccccccCccccccCHHHHHHHhccChhHHHHHHHH Confidence 66666665554442 122211110000000000 0 0001111000 000 00111 111345 Q ss_pred hhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHH----HHHHHHHHHHHc-CCeEE Q lcl|NC_019710. 58 LQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQE----FREAMTMQLCFY-GNAYA 132 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~----f~~~~~~~~l~~-G~a~~ 132 (424) ....+||+|+.++++.++++|+++++.+..+..++....|++.++|+.+||++|++++ |++.++.+++++ |++|+ T Consensus 93 ~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~ 172 (535) T protein:vir:10 93 TNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIE 172 (535) T ss_pred HHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEE Confidence 5677888999999999999999999998888777778889999999999999999876 556667775555 58999 Q ss_pred EEeeCCCCceeEEEEeccceEEEEEcCCc-----eEEEEEecCceEEecHhHeeEecCcCC----CCccccchHHHHHHH Q lcl|NC_019710. 133 LVDRNSAGDVISLLPLQSANMDVKLVGKK-----VVYRYQRDSEYADFSQKEIFHLKGFGF----TGLVGLSPIAFACKS 203 (424) Q Consensus 133 ~~~r~~~G~~~~l~~l~p~~v~~~~~~~~-----~~~~~~~~~~~~~~~~~evih~r~~~~----~~~~G~s~~~~~~~~ 203 (424) +++|+..|+|.+||||+|.+|++..++.+ .+|.+..++....|+++|||||++++. ++.+|+||+.++..+ T Consensus 173 ~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~~~~G~Spi~~~~~~ 252 (535) T protein:vir:10 173 RIFKNDSNELDHFNAVDASKVVISYSPRSKDQPRKFEQFVSETKSVKFSERNLTFINYWNLSDTDRRGYGYSPVEASIPL 252 (535) T ss_pred EEEECCCCcEEEEEEeCCceeEEEEcCccccCceEEEEEecCceeEEECcccEEEEeccCCCCcccccccccHHHHHHHH Confidence 99999999999999999999998876433 456666777788999999999997653 356899999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCCceeEEcCCC---CCCHHHHHHHHHHHHHHhC-CcccCcceecC-CCceeeeccCChhHH Q lcl|NC_019710. 204 AGVAVAMEDQQRDFFANGAKSPQILSTGEK---VLTEQQRSQVEENFKEIAG-GPVKKRLWILE-AGFSTSAIGVTPQDA 278 (424) Q Consensus 204 i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~---~~~~~~~~~~~~~~~~~~~-~~~ag~~~~l~-~g~~~~~l~~s~~d~ 278 (424) +..+.++++++.++|+||++|+|||+++.. ..++++.+.+++.|++.++ ..|+|+++++. +|++|++++++++|+ T Consensus 253 i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~ 332 (535) T protein:vir:10 253 IRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDM 332 (535) T ss_pred HHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEecCCChhHH Confidence 999999999999999999999999998754 3567889999999987654 46899987765 699999999999999 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcc----------cccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhh Q lcl|NC_019710. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSW----------GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV 348 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~----------~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~ 348 (424) ||+|+++++.++||++|||||.+||..++++++ .+++|++...|+++||.|++++||++||++|++..+. T Consensus 333 qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~ 412 (535) T protein:vir:10 333 EFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVINDKIMRYVDT 412 (535) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccCC Confidence 999999999999999999999999998887653 3457888889999999999999999999999987653 Q ss_pred ccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc---cccchhhccccC--CCc-cC Q lcl|NC_019710. 349 GRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQS---QYVPITDLGTNK--EPR-NN 422 (424) Q Consensus 349 ~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~---n~~~~~~~~~~~--~~~-~~ 422 (424) +++|+++.+++.|.++++++++.+. .|+||+||+|+++|+||+||||+++... +++.....+... .+. +. T Consensus 413 ---~~~f~f~~l~~~d~~~r~~~~~~~~-~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 488 (535) T protein:vir:10 413 ---DYRFSFTLGDAQDKLQEEQVWKLKL-ANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDS 488 (535) T ss_pred ---eEEEEeccccccCHHHHHHHHHHHH-cCCCCHHHHHHHhCCCCCCCccccccccchhhcccccccccccCCCCCCCc Confidence 4667788999999999999887665 6789999999999999999999876543 222211111110 000 00 Q ss_pred CC Q lcl|NC_019710. 423 GA 424 (424) Q Consensus 423 g~ 424 (424) ++ T Consensus 489 ~~ 490 (535) T protein:vir:10 489 GS 490 (535) T ss_pred cc Confidence 11 No 67 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=100.00 E-value=5.8e-80 Score=454.99 Aligned_cols=356 Identities=13% Similarity=0.112 Sum_probs=284.5 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc- Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK- 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~- 92 (424) +|||++++++.+........ .. ..|. +...+++.++|++||++||++||++||+++++++++.... T Consensus 1 Mg~f~~~~~f~~~~~~~~~~--~~------~~~~-----~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNNDTQ--RV------TAWQ-----NEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CccchhhhhhhccccCCCcc--ee------eecc-----cchhHHHHHHHHHHHHHHHhhhhhCceeeEEEccccccccc Confidence 99999998744332222110 00 1111 1223567889999999999999999999998876654332 Q ss_pred --ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeEEEEeccceEEEEEcCCceEEEEEec Q lcl|NC_019710. 93 --VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 93 --~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~ 169 (424) ...+|++++||+.+||++||+++||+.++.+++++||||++++++.. |.+..++|. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~l~~~--------------------- 126 (378) T protein:vir:93 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec--------------------- Confidence 24579999999999999999999999999999999999999887644 555555432 Q ss_pred CceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH---HHHHHHHHH Q lcl|NC_019710. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE---QQRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~---~~~~~~~~~ 246 (424) +...+|+++||||+|++ .++..|.|++..+...+. .++.+| .++|+|+.+....++ +.++.+++. T Consensus 127 ~~~~~~~~~diih~r~~-~~~~~~~s~l~~~~~~i~----------~~~~~~-~~~g~l~~~~~l~~~~~~~~~~~~~~~ 194 (378) T protein:vir:93 127 DDKKEYKTEELVRLTSP-FYINEDTSILDNALASIQ----------TKLEQG-KLRGLLKINAFLDIDNTQEYREKALTT 194 (378) T ss_pred CCeeEeccceeEEecCc-cccchhhHHHHHHHHHHH----------HHHhcC-cccceeeeCCcCCHHHHHHHHHHHHHH Confidence 23457899999999964 466778999887766553 345555 588999998665433 234445555 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) ++++.+++++|+++++++|++|++++.+++|+|+ +.++++.++||++|||||.+|++. ++|++..+|+.+| T Consensus 195 ~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g~--------~~e~~~~~f~~~t 265 (378) T protein:vir:93 195 IKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLGT--------ATQEQQIYFYNST 265 (378) T ss_pred HHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcCC--------cHHHHHHHHHHHH Confidence 5666778889999999999999999999999997 667899999999999999999532 4588999999999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccc-------eeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~-------~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd 399 (424) |.|++++||++|+++||++.++... .++||++.+++.|.+++++++.+++++|+||+||+|+++|+||+|||| T Consensus 266 l~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD 345 (378) T protein:vir:93 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999998775432 378999999999999999999999999999999999999999999999 Q ss_pred eeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 400 VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++++++|++|++.+++++.++++.- T Consensus 346 ~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:93 346 VYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred eeeeccccccccchhhhcCccCCCC Confidence 9999999999998876654333222 No 68 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=100.00 E-value=5.6e-80 Score=455.07 Aligned_cols=356 Identities=13% Similarity=0.108 Sum_probs=284.9 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc-- Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~-- 91 (424) +|||++++++.++....... .+ ..+. +...+++.++|++||++||++||++||+|+++.+++... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~------~~--~~~~-----~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQ------RV--TAWQ-----NEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CCccccchhcccccccCCcc------ee--eeec-----cchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCccccc Confidence 99999998755433332211 00 1111 122356778999999999999999999998877665432 Q ss_pred -cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeEEEEeccceEEEEEcCCceEEEEEec Q lcl|NC_019710. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~ 169 (424) ....+|++++||+.+||++||+++||+.++.+++++||||++++++. .|.+..++|. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l~p~--------------------- 126 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEEEec--------------------- Confidence 23457999999999999999999999999999999999999876544 4666666542 Q ss_pred CceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH---HHHHHHHHH Q lcl|NC_019710. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE---QQRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~---~~~~~~~~~ 246 (424) +...+|+++||||+|++ .++..|+||+..+...+.. .+++ +.++|+|+.+.....+ +.++.+++. T Consensus 127 ~~~~~~~~~diiH~~~~-~~~~~g~s~l~~~~~~i~~----------~~~~-~~~~gil~~~~~l~~~~~~~~~~~~~~~ 194 (378) T protein:vir:94 127 DDKKEYKPEELVRLTSP-FYINEDTSILDNALASIQT----------KLEQ-GKLRGLLKINAFLDIDNTQEYREKALTT 194 (378) T ss_pred CCeeEeeeeeeEEecCc-CCccchhHHHHHHHHHHHH----------HHhc-ccccceeeeCCcCCHHHHHHHHHHHHHH Confidence 22356789999999965 5677899999988877643 2344 4588999998665443 234455555 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) ++...+++++|+++++++|++|++++++++++|+ +.++++.++||++|||||.+|++. ++|++..+|+.+| T Consensus 195 ~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~~~--------~se~~~~~f~~~t 265 (378) T protein:vir:94 195 IKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLGT--------ASQEQQIYFYNST 265 (378) T ss_pred HHHhhcccccccceecCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcCC--------hHHHHHHHHHHHH Confidence 6666778889999999999999999999999996 667899999999999999999531 4578999999999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccc-------eeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~-------~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd 399 (424) |.|++++||++|+++||++.++... .++||++.+++.|.+++++.+++++++||||+||+|+++|+||+|||| T Consensus 266 L~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD 345 (378) T protein:vir:94 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999998775432 378999999999999999999999999999999999999999999999 Q ss_pred eeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 400 VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++++|+|++|+++.++++..++++- T Consensus 346 ~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:94 346 VYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred eeeecccccccccchhhcCCcCCCC Confidence 9999999999998876653332221 No 69 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=100.00 E-value=1.4e-79 Score=452.93 Aligned_cols=356 Identities=13% Similarity=0.111 Sum_probs=284.7 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc- Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK- 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~- 92 (424) +|||++++++.++....... .. ..|. +...+++.++|++||++||++||++||+++++++.+.... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~------~~--~~~~-----~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~ 67 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNNDTQ------RV--TAWQ-----NEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CccchhhhhhhcccccCCcc------ee--eecc-----cchhhHHHHHHHHHHHHHHhhhhhCceeEEEEccccccccc Confidence 99999998754433222110 00 0111 1224567889999999999999999999998877654332 Q ss_pred --ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-CceeEEEEeccceEEEEEcCCceEEEEEec Q lcl|NC_019710. 93 --VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-GDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 93 --~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~ 169 (424) ....|+++++|+.+||++||+++||+.++.+++++||||++++++.. |.+..++|. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~--------------------- 126 (378) T protein:vir:16 68 LISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEec--------------------- Confidence 23579999999999999999999999999999999999999988754 555544432 Q ss_pred CceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH---HHHHHHHHH Q lcl|NC_019710. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE---QQRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~---~~~~~~~~~ 246 (424) +....|+++||||||++ .++..|.|++..+...+. ..+.+ +.++|+|+.+.....+ +.++.+++. T Consensus 127 ~~~~~~~~~diih~r~~-~~~~~~~s~l~~~~~~i~----------~~~~~-~~~~g~l~~~~~l~~~~~~~~~~~~~~~ 194 (378) T protein:vir:16 127 DDKKEYKPEELVRLTSP-FYINEDTSILDNALASIQ----------TKLEQ-GKLRGLLKINAFLDIDNTQEYREKALTT 194 (378) T ss_pred CCeeEecccceEEecCc-cCccchhHHHHHHHHHHH----------HHHhc-CccceeeEeCCcCCHHHHHHHHHHHHHH Confidence 22456789999999964 566778899888776653 23344 4578999988665443 334555666 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) ++...+++++|+++++++|++|++++++++|+|+ +.+++++++||++|||||.+|++ +++|++.++|+.+| T Consensus 195 ~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~g--------~~~e~~~~~f~~~t 265 (378) T protein:vir:16 195 IKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG--------TASQEQQIYFYNST 265 (378) T ss_pred HHHhhcccccccceEcCCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------CchHHHHHHHHHHH Confidence 6666788899999999999999999999999997 55689999999999999999953 24578999999999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccc-------eeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~-------~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd 399 (424) |.|++++||++|+++||++.++..+ .++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+|||| T Consensus 266 l~P~~~~ie~~l~~kLl~~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD 345 (378) T protein:vir:16 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhhhhhhhhcccccceeeccchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 9999999999999999998775432 378999999999999999999999999999999999999999999999 Q ss_pred eeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 400 VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++++|+|++|++...+++.+++++- T Consensus 346 ~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:16 346 VYIANLNAVAVKNLSDLQGSRKDVT 370 (378) T ss_pred eEeeccccccccchhhhcCccCCCC Confidence 9999999999998876654433322 No 70 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=100.00 E-value=5.6e-79 Score=449.58 Aligned_cols=379 Identities=14% Similarity=0.104 Sum_probs=289.3 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +||++++++||.......... ....|.....++.+.|+++++|++||++||+++|++||+++++++ T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~------- 66 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLT-------DTVWCSIPSEKLKELSIKKWAIDSCANKIANTLSCAEVLTYEKGE------- 66 (395) T ss_pred CchHHHHHhhhcccccccccc-------cchhhccccccchhhhhhhHHHHHHHHHHHHHHhhCceeeccCCc------- Confidence 999999999997654433211 112233445567788999999999999999999999999997542 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecC--c Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDS--E 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~--~ 171 (424) ...|++.++|+.+||++||+++||+.++.+++++||||+++.++. +++.++..+.........++.+...+ . T Consensus 67 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~------~~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 140 (395) T protein:vir:40 67 EVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY------IYVADSFTKNDKSLYENTYTEVTLKDLTL 140 (395) T ss_pred cccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc------eeecCCccccccccccceeeeeeecCcee Confidence 235889999999999999999999999999999999999998764 33333332221111111122222222 2 Q ss_pred eEEecHhHeeEecCcCCCCcccc-chHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFGFTGLVGL-SPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~~~~G~-s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) .+.|+++||||||+.+..+..+. +........+. .......+.++.++.++++.+.. .++++.+++++.|++. T Consensus 141 ~~~~~~~evih~r~~~~~~~~~~~~l~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~~~~~~~ 214 (395) T protein:vir:40 141 KKEFKESEVLHLTLNNESIKSIIDGFYLLYGDLLT-----AAVNKYKKLNSRKIIVKLKAMFG-QTPEAEEKLRLMLSER 214 (395) T ss_pred eeeeccccEEEeecCCCCccccchhHHHHHHHHHH-----HHHHHHHhcCCCCceEEEecccC-CCHHHHHHHHHHHHHH Confidence 45799999999997664432222 22233332222 12233344555555555555533 5677888899999887 Q ss_pred hCC--cccCcceecCCCceeeeccCChhHHHHHHHHHHHH---HHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH Q lcl|NC_019710. 251 AGG--PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQV---SELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 251 ~~~--~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~---~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~ 325 (424) +++ .++++++++++|++|++++++++|+||+|.+++.. ++||++|||||.+|++ +++|+|++.+.|+++ T Consensus 215 ~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~------~~sn~e~~~~~f~~~ 288 (395) T protein:vir:40 215 MKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG------DTVGLSEQVNSFLMF 288 (395) T ss_pred HHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCcCHHHHHHHHHHH Confidence 655 57788999999999999999999999999999875 7999999999999963 345899999999999 Q ss_pred HHHHHHHHHHHHHhhhccChhhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeee Q lcl|NC_019710. 326 TLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVAM 402 (424) Q Consensus 326 tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~~ 402 (424) ||.|++++||++|+++||++.++. +++++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+++ ||+++ T Consensus 289 ~L~P~~~~ie~~l~~kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~gD~~~ 368 (395) T protein:vir:40 289 SINPIAEMFTDEGNRKFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPETQERF 368 (395) T ss_pred HHHHHHHHHHHHHHHhcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCceee Confidence 999999999999999999988764 588999999999999999999999999999999999999999999954 99999 Q ss_pred ecccccchhhccccC-CCccCCC Q lcl|NC_019710. 403 RQSQYVPITDLGTNK-EPRNNGA 424 (424) Q Consensus 403 ~~~n~~~~~~~~~~~-~~~~~g~ 424 (424) +++|++|++...+.. ++++++. T Consensus 369 ~~~n~~~~~~~~~~~kgge~~~~ 391 (395) T protein:vir:40 369 VTKNYAPLGENEEDLKGGDINEN 391 (395) T ss_pred eccccccccccccccCCCCCCCC Confidence 999999998765432 2222222 No 71 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=100.00 E-value=2.5e-79 Score=451.51 Aligned_cols=365 Identities=13% Similarity=0.129 Sum_probs=287.6 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||++++++... . .. ...+..+..++.+.|+++++|++||++||+++|++||++|+++ . T Consensus 1 Mg~f~~l~~~~~~---~-----~~-----~~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~-------~ 60 (376) T protein:vir:78 1 MGFFSELFKRNKE---I-----EW-----MWDLDFLEDKTTKVYLKKMALNTCVKHIARTIAKSDFRLKNGE-------T 60 (376) T ss_pred CchhhhhhccCCc---c-----cc-----ccchhhccccchhhhhhhHHHHHHHHHHHHhhcccceeecccc-------c Confidence 9999998764321 0 00 0122234557788999999999999999999999999998653 2 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCceE Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYA 173 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~ 173 (424) ..+|+++++|+.+||++||+++||+.++.+++++||||+++.|+..|.+.+++++.+..+.......... ....... T Consensus 61 ~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~ 137 (376) T protein:vir:78 61 SVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFPDVFEGVTV---KDYRYNR 137 (376) T ss_pred cccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceeeeeeeeeee---ecceeee Confidence 3468999999999999999999999999999999999999999999999999999988765443222111 1222246 Q ss_pred EecHhHeeEecCcCCCCccccchH-HHHHHHHHHHHHHHHHHHHHH-hccCCCceeEEcCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_019710. 174 DFSQKEIFHLKGFGFTGLVGLSPI-AFACKSAGVAVAMEDQQRDFF-ANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA 251 (424) Q Consensus 174 ~~~~~evih~r~~~~~~~~G~s~~-~~~~~~i~~~~~~~~~~~~~~-~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~ 251 (424) .|+++||||+|+.+.++....+++ ..+... .......++ .++.++.++++. ....++++.+.+++.|++.. T Consensus 138 ~~~~~evih~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~ 210 (376) T protein:vir:78 138 NFSMDDVIFLEYGNERLSAFTDGMFEDYGEL------FGKMIRAQMRNFQIRGAVNFKM-AGVADKDKQTKLQEYIDKVY 210 (376) T ss_pred eeccccEEEeccCCCCchhhhhHHHHHHHHH------HHHHHHHHHhcCCCceeEEEcc-CCCCCHHHHHHHHHHHHHHh Confidence 799999999997665443222222 222111 122222333 344444444444 45567888899999999887 Q ss_pred CCc--ccCcceecCCCceeeeccCChhHH-----HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHH Q lcl|NC_019710. 252 GGP--VKKRLWILEAGFSTSAIGVTPQDA-----EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ 324 (424) Q Consensus 252 ~~~--~ag~~~~l~~g~~~~~l~~s~~d~-----~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~ 324 (424) ++. +.++++++++|++|+++++++.|+ ||+|.++++.++||++|||||.+|++ +++|+|++.+.|++ T Consensus 211 ~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~------~~s~~e~~~~~f~~ 284 (376) T protein:vir:78 211 ASFNNNEIAIVPQLEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG------DMADLSNNMKAYME 284 (376) T ss_pred ccccccCcceEEcCCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC------CCCCHHHHHHHHHH Confidence 763 455688899999999999888664 99999999999999999999999973 34589999999999 Q ss_pred HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Ceee Q lcl|NC_019710. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG--DVAM 402 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg--d~~~ 402 (424) +||.|++++||++|+++|+++.+ ++++|+++.+++.|.+++++.+++++++|++|+||+|+++|+||+||| |+++ T Consensus 285 ~~l~P~~~~ie~~l~~kll~~~~---~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~~d~~~ 361 (376) T protein:vir:78 285 YCIDPLTKKLEDELNAKLFTFSE---FLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPELDKYL 361 (376) T ss_pred HHHHHHHHHHHHHHHhhhCCccc---ceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceee Confidence 99999999999999999999765 457889999999999999999999999999999999999999999876 9999 Q ss_pred ecccccchhhccccC Q lcl|NC_019710. 403 RQSQYVPITDLGTNK 417 (424) Q Consensus 403 ~~~n~~~~~~~~~~~ 417 (424) +|+|++|+++.+++. T Consensus 362 ~~~n~~~~~~~~e~g 376 (376) T protein:vir:78 362 ITKNYQSADEGGEDG 376 (376) T ss_pred eccCceehhccccCC Confidence 999999988654333 No 72 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=100.00 E-value=2.6e-78 Score=445.96 Aligned_cols=406 Identities=12% Similarity=0.106 Sum_probs=301.1 Q ss_pred ccHHHHHHhhccCcccccc--------------------------cccccccccc-----ccccc-CCccc-------cH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTP--------------------------NQGSQTGPVS-----AHGYL-GDSSI-------ND 54 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~--------------------------~~~~~~~~~~-----~~~~~-~~~~~-------~~ 54 (424) +||+.+++..++....--. .......+.. ..++. ..... .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 6666666654431110000 0000001000 00000 00111 12 Q ss_pred HHHhhhHHHHHHHHHHHHhhhhC-------------ceeEeeccccCccccccccchhHHhhccCCCCCC-----CHHHH Q lcl|NC_019710. 55 ERILQISTVWRCVSLISTLTACL-------------PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYM-----TAQEF 116 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~-------------~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~-----s~~~f 116 (424) +.|..+|+|++||++||+.||++ .+++..+......+.....+.+..+|. +||+++ |+.+| T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~-~pn~~~~p~~~s~~~f 159 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIE-KTGVDNDINRDSFSSF 159 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHH-hhCCCCCCccchHHHH Confidence 35677899999999999999974 233332222222222222345556564 788874 78899 Q ss_pred HHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc-------eEEEEEecCceEEecHhHeeEecCcCCC Q lcl|NC_019710. 117 REAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK-------VVYRYQRDSEYADFSQKEIFHLKGFGFT 189 (424) Q Consensus 117 ~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~-------~~~~~~~~~~~~~~~~~evih~r~~~~~ 189 (424) ++.++.+++++||+|++++|+.+|.+++||||+|.+|++..+.++ .++++..+.....|+++||||+|+++.+ T Consensus 160 ~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eiih~r~n~~~ 239 (547) T protein:vir:63 160 VKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRS 239 (547) T ss_pred HHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEEEecccCCC Confidence 999999999999999999999999999999999999998766543 2334445555678999999999976532 Q ss_pred ----CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhC-CcccCcceec- Q lcl|NC_019710. 190 ----GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAG-GPVKKRLWIL- 262 (424) Q Consensus 190 ----~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~-~~~ag~~~~l- 262 (424) +.+|+||+.++..++....++++++.++|+||++|+|+|+.+.. ..++++.+.+++.|++.++ ..|+|+++++ T Consensus 240 ~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~ 319 (547) T protein:vir:63 240 DIYATGYGYPELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS 319 (547) T ss_pred CcccccccccHHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccccccccccc Confidence 56899999999999999999999999999999999999988644 4678889999999987654 5799998666 Q ss_pred CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC--------CcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS--------TSWGSGIEQQNLGFLQYTLQPYISRW 334 (424) Q Consensus 263 ~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~--------~~~~~n~e~~~~~f~~~tl~P~~~~i 334 (424) ++|++|++++++++|+||+|++++++++||++|||||.+||...++ +.+++|+|++.+.|+++||.|++..| T Consensus 320 ~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~i 399 (547) T protein:vir:63 320 AEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFI 399 (547) T ss_pred CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 6889999999999999999999999999999999999999986543 34678999999999999999999999 Q ss_pred HHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-CCCcCeeeecccccchhhc Q lcl|NC_019710. 335 ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPP-LPGGDVAMRQSQYVPITDL 413 (424) Q Consensus 335 e~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p-~~ggd~~~~~~n~~~~~~~ 413 (424) |++||++|++..+. .++|+++.+...+..++++++ +++.+|+||+||+|+++|+|| +||||+++.+.+..++... T Consensus 400 e~~ln~~L~~~~~~---~~~~~f~~~~~~~~~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~ 475 (547) T protein:vir:63 400 EDFINKHIVAEFGD---KYTFQFVGGDIKSELESVKIL-AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQL 475 (547) T ss_pred HHHHHhhcccccCC---ceEEEeeccccccHHHHHHHH-HHHhCCCcCHHHHHHHhCCCCCCCCCceeeccccccccccc Confidence 99999999976542 234555667777777777655 577789999999999999998 7999999999888776542 Q ss_pred cccCCCccC-----------------CC Q lcl|NC_019710. 414 GTNKEPRNN-----------------GA 424 (424) Q Consensus 414 ~~~~~~~~~-----------------g~ 424 (424) ...+.+++. +. T Consensus 476 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 503 (547) T protein:vir:63 476 MQQEQFEHEKQQSNLQMLQEQTGNRVST 503 (547) T ss_pred ccccCCccccchhhccccccccCCCCCC Confidence 221111100 00 No 73 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=100.00 E-value=1.9e-78 Score=446.65 Aligned_cols=367 Identities=14% Similarity=0.135 Sum_probs=295.7 Q ss_pred CCccHHHHHHhhccCccccccccccccc----ccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc Q lcl|NC_019710. 12 TNNGWWARLKSWFVGGRLVTPNQGSQTG----PVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 87 (424) =-+|||++++..-.....+.+......+ .........+..++.+.++++++|++||++||++||++|++++++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh- Confidence 2356776654322222222222221111 11222233567789999999999999999999999999999986542 Q ss_pred CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcC--CceEEE Q lcl|NC_019710. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~--~~~~~~ 165 (424) ..|+.+||++||+++||+.++.+++++||||++++|+.+|++++|||++|.+|++..+. +...|. T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~ 146 (392) T protein:vir:39 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 23667999999999999999999999999999999999999999999999999988764 344555 Q ss_pred EEecC----ceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019710. 166 YQRDS----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~ 240 (424) +...+ ....|+++||||+|+++.++ ++|+||+.++..++++..++++++.++|+||++|+++|+++.+...++. T Consensus 147 ~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~- 225 (392) T protein:vir:39 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDK- 225 (392) T ss_pred EEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHH- Confidence 55433 34679999999999998887 7899999999999999999999999999999999999999866544332 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHH Q lcl|NC_019710. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) ..+++.+++.+..++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+..+++ +.+++.+ T Consensus 226 -~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~----~~~~~~~ 300 (392) T protein:vir:39 226 -DKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred -HHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 23344556777789999999999999999999999999999999999999999999999998754432 4467789 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---------- Q lcl|NC_019710. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~l---------- 390 (424) +|+++||.|+++.|+++|+++|++. ++||...+++.|.+.+++.+++++++|++|+||+|+++ T Consensus 301 ~f~~~~l~P~~~~ie~~l~~~L~~~-------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~ 373 (392) T protein:vir:39 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccc-------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcccc Confidence 9999999999999999999999864 56888889999999999999999999999999999987 Q ss_pred ----CCCCCCCcCeeeecccccc Q lcl|NC_019710. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----g~~p~~ggd~~~~~~n~~~ 409 (424) |+||++|||.- +-+| T Consensus 374 r~~e~l~~~~~Gd~~----~p~p 392 (392) T protein:vir:39 374 PAPENTNKKTTGQSN----EPVP 392 (392) T ss_pred chhcCCCCCCCCCCC----CCCC Confidence 55555554431 1112 No 74 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=100.00 E-value=1.9e-78 Score=446.65 Aligned_cols=367 Identities=14% Similarity=0.135 Sum_probs=295.7 Q ss_pred CCccHHHHHHhhccCccccccccccccc----ccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc Q lcl|NC_019710. 12 TNNGWWARLKSWFVGGRLVTPNQGSQTG----PVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 87 (424) =-+|||++++..-.....+.+......+ .........+..++.+.++++++|++||++||++||++|++++++.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~- 79 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKN- 79 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccchh- Confidence 2356776654322222222222221111 11222233567789999999999999999999999999999986542 Q ss_pred CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcC--CceEEE Q lcl|NC_019710. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG--KKVVYR 165 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~--~~~~~~ 165 (424) ..|+.+||++||+++||+.++.+++++||||++++|+.+|++++|||++|.+|++..+. +...|. T Consensus 80 -------------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~ 146 (392) T protein:vir:10 80 -------------QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYN 146 (392) T ss_pred -------------hhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEE Confidence 23667999999999999999999999999999999999999999999999999988764 344555 Q ss_pred EEecC----ceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH Q lcl|NC_019710. 166 YQRDS----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR 240 (424) Q Consensus 166 ~~~~~----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~ 240 (424) +...+ ....|+++||||+|+++.++ ++|+||+.++..++++..++++++.++|+||++|+++|+++.+...++. T Consensus 147 ~~~~~~~~~~~~~~~~~eiih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~- 225 (392) T protein:vir:10 147 ITFDDPKIEPILQAPQSDLIHMKLLSIDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDK- 225 (392) T ss_pred EEecCcccceeEEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHH- Confidence 55433 34679999999999998887 7899999999999999999999999999999999999999866544332 Q ss_pred HHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHH Q lcl|NC_019710. 241 SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 241 ~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) ..+++.+++.+..++|+++++++|++|++++++++|+||+|.+++++++||++|||||.+||+..+++ +.+++.+ T Consensus 226 -~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~----~~~~~~~ 300 (392) T protein:vir:10 226 -DKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ----SSIQQIS 300 (392) T ss_pred -HHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHH Confidence 23344556777789999999999999999999999999999999999999999999999998754432 4467789 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---------- Q lcl|NC_019710. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTD---------- 390 (424) Q Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~l---------- 390 (424) +|+++||.|+++.|+++|+++|++. ++||...+++.|.+.+++.+++++++|++|+||+|+++ T Consensus 301 ~f~~~~l~P~~~~ie~~l~~~L~~~-------~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~ 373 (392) T protein:vir:10 301 GMYASALNRYLRPAISELEYKLSDH-------ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPKDL 373 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccc-------ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcccc Confidence 9999999999999999999999864 56888889999999999999999999999999999987 Q ss_pred ----CCCCCCCcCeeeecccccc Q lcl|NC_019710. 391 ----NLPPLPGGDVAMRQSQYVP 409 (424) Q Consensus 391 ----g~~p~~ggd~~~~~~n~~~ 409 (424) |+||++|||.- +-+| T Consensus 374 r~~e~l~~~~~Gd~~----~p~p 392 (392) T protein:vir:10 374 PAPENTNKKTTGQSN----EPVP 392 (392) T ss_pred chhcCCCCCCCCCCC----CCCC Confidence 55555554431 1112 No 75 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=100.00 E-value=5.8e-77 Score=438.55 Aligned_cols=418 Identities=13% Similarity=0.113 Sum_probs=297.9 Q ss_pred CCCCCcccccC-CCccHHHHHHhhccCc-cccccccccccccc----cc-cccc---------CCccccHHHHhhhHHHH Q lcl|NC_019710. 1 MEEPKYTIDLR-TNNGWWARLKSWFVGG-RLVTPNQGSQTGPV----SA-HGYL---------GDSSINDERILQISTVW 64 (424) Q Consensus 1 ~~~~~~~~~~~-~~~G~~~~~~~~~~~~-~~~~~~~~~~~~~~----~~-~~~~---------~~~~~~~~~~~~~~~v~ 64 (424) -..-.-|..|- .-.++|.++...-..- ......+.....|+ .. .+.. ....-....+..+|+|+ T Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~ 97 (576) T protein:vir:96 18 YEDIIDTVPIDDGLQANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILN 97 (576) T ss_pred cccchhhhhcccChhHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHH Confidence 00001122221 1134455443211100 01111111111221 00 0000 00001123455689999 Q ss_pred HHHHHHHHhhhhC-------------ceeEeeccccCccccccccchhHHhh---ccCCCCC-CCHHHHHHHHHHHHHHc Q lcl|NC_019710. 65 RCVSLISTLTACL-------------PLDVFETDQNDNRKKVDLSNPLARLL---RYSPNQY-MTAQEFREAMTMQLCFY 127 (424) Q Consensus 65 ~~i~~ia~~ia~~-------------~~~~~~~~~~~~~~~~~~~~~l~~lL---~~~PN~~-~s~~~f~~~~~~~~l~~ 127 (424) +||++||+.||++ +++++..+.....++....+++...| +..|||+ +|+++||+.++.+++++ T Consensus 98 ~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~ 177 (576) T protein:vir:96 98 AIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTY 177 (576) T ss_pred HHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhc Confidence 9999999999973 33444333332223322333333222 2345555 58999999999999999 Q ss_pred CCeEEEEee--CCCCceeEEEEeccceEEEEEcCCceEE-------EEEecCceEEecHhHeeEec-CcCCC---Ccccc Q lcl|NC_019710. 128 GNAYALVDR--NSAGDVISLLPLQSANMDVKLVGKKVVY-------RYQRDSEYADFSQKEIFHLK-GFGFT---GLVGL 194 (424) Q Consensus 128 G~a~~~~~r--~~~G~~~~l~~l~p~~v~~~~~~~~~~~-------~~~~~~~~~~~~~~evih~r-~~~~~---~~~G~ 194 (424) ||||+++++ +..|.+++||||+|.+|++..+.++..+ .+..+.....|+++||||++ +++.+ +.+|+ T Consensus 178 Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~G~ 257 (576) T protein:vir:96 178 DQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGYGL 257 (576) T ss_pred CCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccCcccc Confidence 999999885 4457899999999999999888765432 23344556789999998776 44444 67899 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhC-CcccCcc-eecCCCceeeec Q lcl|NC_019710. 195 SPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAG-GPVKKRL-WILEAGFSTSAI 271 (424) Q Consensus 195 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~-~~~ag~~-~~l~~g~~~~~l 271 (424) ||+.++..++.+..++++++.++|+||++|+|||+.+.+ ..++++.+++++.|++.++ ..|+|++ +++++|++|+++ T Consensus 258 Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~l 337 (576) T protein:vir:96 258 SEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNM 337 (576) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEec Confidence 999999999999999999999999999999999998764 3578889999999987655 4788985 889999999999 Q ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC---------CcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_019710. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS---------TSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWL 342 (424) Q Consensus 272 ~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~---------~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L 342 (424) +++++|+||+|.+++++++||++|||||.+||..+.+ +.+++|+|++.+.|+++||.|++..||++|+++| T Consensus 338 s~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L 417 (576) T protein:vir:96 338 TPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHI 417 (576) T ss_pred cCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 9999999999999999999999999999999987654 3367899999999999999999999999999999 Q ss_pred cChhhhccceeeecchhhhccCHHHHHHHHHHH--HhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccCCCc Q lcl|NC_019710. 343 IPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAM--GESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPR 420 (424) Q Consensus 343 ~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~--~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~ 420 (424) ++..+.. ++++| ++.|.+++++.++.+ +.+|+||+||+|+++|+||+||||+++.|.++.+++.....++.+ T Consensus 418 l~~~~~~-~~~~f-----~r~d~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e 491 (576) T protein:vir:96 418 ISEYSDK-YVFQF-----VGGDTKSELDKIKILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYE 491 (576) T ss_pred chhccCc-eEEEe-----ccCCHHHHHHHHHHHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCC Confidence 9875432 33443 577888888877654 567999999999999999999999999999988776543222111 Q ss_pred cCCC Q lcl|NC_019710. 421 NNGA 424 (424) Q Consensus 421 ~~g~ 424 (424) .+.. T Consensus 492 ~~~~ 495 (576) T protein:vir:96 492 DTKQ 495 (576) T ss_pred Cccc Confidence 1111 No 76 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=100.00 E-value=5.5e-77 Score=438.65 Aligned_cols=378 Identities=12% Similarity=0.150 Sum_probs=300.3 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|||++++....+.............+.....+..+..++.+.++++|+|++||++||++||++|++++++.. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~~------- 73 (386) T protein:vir:49 1 MPIFNITNLATESPPINQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQL------- 73 (386) T ss_pred CchhhhhccCCCCcccchhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccchh------- Confidence 8999887543322211111111111223334566788899999999999999999999999999999987542 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc--eEEEEEe--- Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--VVYRYQR--- 168 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~--~~~~~~~--- 168 (424) ..|+.+||++||+++||+.++.+++++||||++++|+.+|++++|||++|.+|++..+.+. ..|.|.. T Consensus 74 -------~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~~ 146 (386) T protein:vir:49 74 -------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNGLYYNITFDDP 146 (386) T ss_pred -------hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCceEEEEEEEcCc Confidence 2367799999999999999999999999999999999999999999999999998876543 4555543 Q ss_pred -cCceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019710. 169 -DSEYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 169 -~~~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) ++..+.|+++||||+|++++++ ++|+||+.++..++++..++.+++.++|+||+.|+++|+++....+++ .+.+++. T Consensus 147 ~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~-~~~~~~~ 225 (386) T protein:vir:49 147 HIAPKQHVPQNDILHFRLLSVDGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLDF-KTKVSRS 225 (386) T ss_pred cccceeEEccccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChHH-HHHHHHH Confidence 2456789999999999988776 899999999999999999999999999999999999999987765555 4445555 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) ++. +..|+|+++++++|++|++++.+++|+||+|.++++.++||++|||||.+||+...++ ++. ++.++|+..+ T Consensus 226 ~~~--~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~---~~~-~~~~~~~~~~ 299 (386) T protein:vir:49 226 RQA--MKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQ---SSL-EMIYNIYFKS 299 (386) T ss_pred HHH--hccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc---chH-HHHHHHHHHH Confidence 543 4468899999999999999999999999999999999999999999999998754433 244 4557899999 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc- Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQS- 405 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~- 405 (424) |.|+++.|+++|+++|++ +++||.+.+++.|.+.++..+.+++++|++|+||+|++++..++...+.+.... T Consensus 300 i~~~l~~i~~~~~~~l~~-------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~ 372 (386) T protein:vir:49 300 VSRYLRPFVSEMSKKLSC-------EVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKELPDGKNP 372 (386) T ss_pred HHHHHHHHHHHHHHHhcc-------hhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhcc Confidence 999999999999999975 367999999999999999999999999999999999999876653333221110 Q ss_pred cccchhhccccCCCccC Q lcl|NC_019710. 406 QYVPITDLGTNKEPRNN 422 (424) Q Consensus 406 n~~~~~~~~~~~~~~~~ 422 (424) +..++. .++ ..++| T Consensus 373 ~~~~~~-gGd--~~~~~ 386 (386) T protein:vir:49 373 NRTSLK-GGE--INEQD 386 (386) T ss_pred CCCCCC-CCC--CCCCC Confidence 111211 111 11111 No 77 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=100.00 E-value=3.3e-77 Score=439.87 Aligned_cols=356 Identities=13% Similarity=0.097 Sum_probs=281.3 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc-- Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~-- 91 (424) +|+|+++++++.......... .+. .++...+++.++|++||++||++||++||++|++++.+... T Consensus 1 M~if~~~~~~~~~~~~~~~~~--------~~~-----~~~~~~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQR--------VTA-----WQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhHHhHhhhhcccccCcce--------eee-----eecchhhhhhHHHHHHHHHHHHhHhhCceeeeeeccccccccc Confidence 999999998775443322110 111 12333456788999999999999999999999887654322 Q ss_pred -cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeEEEEeccceEEEEEcCCceEEEEEec Q lcl|NC_019710. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~ 169 (424) +....|++.+||+.+||++||+++||+.++.+++++||||++++ ++..|.+..+++. . T Consensus 68 ~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~-------------------~- 127 (378) T protein:vir:94 68 LISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDLLFA-------------------N- 127 (378) T ss_pred ccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEEEEe-------------------c- Confidence 23467999999999999999999999999999999999999854 4555665544332 1 Q ss_pred CceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH---HHHHHHHHH Q lcl|NC_019710. 170 SEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE---QQRSQVEEN 246 (424) Q Consensus 170 ~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~---~~~~~~~~~ 246 (424) ...+|+++||+|++++.... .+.+++..+...+. ..++++ .++|+|+.+.....+ ++.+.+++. T Consensus 128 -~~~~~~~~dvih~~~~~~~~-~~~~~~~~~~~~~~----------~~~~~~-~~~g~l~~~~~l~~~~~~~~~e~~~~~ 194 (378) T protein:vir:94 128 -DKKEYKPEELVRLTSPFYIN-EDTSILDNALASIQ----------TKLEQG-KLRGLLKINAFLDIDNTQEYREKALAT 194 (378) T ss_pred -CcEEechhceeeecCcCCcc-cchhHHHHHHHHHH----------HHHhhC-CcccceeeCCcCCHHHHHHHHHHHHHH Confidence 23568999999999654322 24455555544332 333444 578999988665433 345556666 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHH Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT 326 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~t 326 (424) ++.+.++.++|+++++++|++|++++++++++|+ +.++++.++||++|||||.+|++. .+|++..+|+.+| T Consensus 195 ~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgvPp~~l~g~--------~~e~~~~~f~~~t 265 (378) T protein:vir:94 195 IKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLGT--------ATQEQQIYFYNST 265 (378) T ss_pred HHHhhcccccccceeccCCceEEEccCChHHhhH-HHHHHHHHHHHHHhCCCHHHhcCC--------chHHHHHHHHHHH Confidence 6777888899999999999999999999999996 778999999999999999999632 3478889999999 Q ss_pred HHHHHHHHHHHHhhhccChhhhcc-------ceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGR-------IHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD 399 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~-------~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd 399 (424) |.|++++||++|+++||++.++.. ..++||++.+++.|.+++++.+.+++++|+||+||+|+++|+||+|||| T Consensus 266 l~P~~~~ie~~l~~~Ll~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd 345 (378) T protein:vir:94 266 IIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGD 345 (378) T ss_pred HHHHHHHHHHHHHhhcCChhHhhhhhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC Confidence 999999999999999999876543 2377999999999999999999999999999999999999999999999 Q ss_pred eeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 400 VAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++++|+|++|++..++++..++++. T Consensus 346 ~~~~~~n~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:94 346 VYIANLNAVAVKNLSDLQGNRKDVT 370 (378) T ss_pred eeeecccccchhcchhcccccCCCC Confidence 9999999999998887765554433 No 78 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=100.00 E-value=5.4e-77 Score=438.71 Aligned_cols=380 Identities=14% Similarity=0.087 Sum_probs=290.7 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|+|+++... .........+ ......++.+.|+++++|++||++||++||++||++++++++ . T Consensus 1 MGlf~~~~~~----~~~~~~~~~~--------~~~~~~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~-----~ 63 (395) T protein:vir:98 1 MGILDFFSFK----KSGTLSDDDS--------GSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKL-----T 63 (395) T ss_pred CcchhhhcCC----Cccccccccc--------chhhhhhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCc-----c Confidence 9999887421 1111111111 112234567788999999999999999999999999986532 2 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEE--ecCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQ--RDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~--~~~~ 171 (424) ...|++.++|+.+||++||+++||+.++.+++++||||++++++..+.+ . +..+.........++... .... T Consensus 64 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~~-----~-~~~~~~~~~~~~~~~~~~~~~~~~ 137 (395) T protein:vir:98 64 ENQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIYV-----A-DSFTQDKKISGSQFKVSRVQGQTY 137 (395) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCceec-----C-CcccccccccCcccceeeecCcee Confidence 3468999999999999999999999999999999999999998764322 2 222222111111112222 1222 Q ss_pred eEEecHhHeeEecCcCCCCc-cccchHHHHHHHHHHHH--HHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFGFTGL-VGLSPIAFACKSAGVAV--AMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~~~-~G~s~~~~~~~~i~~~~--~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) ..+++++||||+|+.+.++. ++.+++......+.... .......+++.++..+.+++........+++.+..+++++ T Consensus 138 ~~~~~~~evih~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (395) T protein:vir:98 138 EKTFTFDQVIYLKNDNSDLMSKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFK 217 (395) T ss_pred eeEecCccEEEecCCCCCccccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccccCCcHHHHHHHHHHHH Confidence 46799999999998876654 34444455444444333 3345566788888888888887777777788888888898 Q ss_pred HHhCCc--ccCcceecCCCceeeeccC------ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHH Q lcl|NC_019710. 249 EIAGGP--VKKRLWILEAGFSTSAIGV------TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL 320 (424) Q Consensus 249 ~~~~~~--~ag~~~~l~~g~~~~~l~~------s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~ 320 (424) ++.++. +.++++++++|++|+++++ ++.++||.+.++++.++||++|||||.+|++ +++|.|++.+ T Consensus 218 ~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~------~~sn~e~~~~ 291 (395) T protein:vir:98 218 RTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG------DIADNQKNYE 291 (395) T ss_pred HHHhhhhcCCcceeecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CcccHHHHHH Confidence 887764 4456888999999999985 4678899999999999999999999999963 3558999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--c Q lcl|NC_019710. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--G 398 (424) Q Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--g 398 (424) +|+++||.|++++||++|+++|+++.++.. .++||++.+++.|.+++++.+++++++|++|+||+|+++|+||+|| | T Consensus 292 ~f~~~tl~P~~~~ie~~l~~kll~~~~~~~-g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~g 370 (395) T protein:vir:98 292 LLLEGPIESLITNIVDGLEYAIFDKSETLQ-GSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLG 370 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCChhhhcC-cceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCC Confidence 999999999999999999999999877543 3458888999999999999999999999999999999999999976 9 Q ss_pred CeeeecccccchhhccccCCCccCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) |++++++|++|++..++....+++. T Consensus 371 D~~~~~~n~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:98 371 KVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred ceeeecccceecccccCCCCCCCCC Confidence 9999999999998654433333333 No 79 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=100.00 E-value=6.5e-77 Score=438.28 Aligned_cols=355 Identities=13% Similarity=0.098 Sum_probs=276.7 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc-- Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK-- 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~-- 91 (424) +|||++++.+++.......... + ..++...++.+++|++||++||++||++||++|++++++... T Consensus 1 M~~f~k~~~~~~~~~~~~~~~~--------~-----~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~ 67 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNNDTQRV--------T-----AWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDT 67 (378) T ss_pred CchhhhhhhhhhcccccCCcce--------e-----eeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEecccccccc Confidence 9999999887765443322110 1 112233467889999999999999999999999888765432 Q ss_pred -cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCCceeEEEEeccceEEEEEcCCceEEEEEec Q lcl|NC_019710. 92 -KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRD 169 (424) Q Consensus 92 -~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~ 169 (424) +...+|++.+||+.+||++||+++||+.++.+++++||||++++ ++..|.+..+++. T Consensus 68 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~~~--------------------- 126 (378) T protein:vir:85 68 LISMAGSDLDEVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLLFA--------------------- 126 (378) T ss_pred ccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEEec--------------------- Confidence 23467999999999999999999999999999999999999864 4555654433321 Q ss_pred CceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH---HHHHHHHH Q lcl|NC_019710. 170 SEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE---QQRSQVEE 245 (424) Q Consensus 170 ~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~---~~~~~~~~ 245 (424) .....|.++||||++.+ +.++ +.+.+..+...+ ...+++ +.++|+++.+.....+ +.++.+++ T Consensus 127 ~~~~~~~~~dvih~~~~~~~~~--~~~~~~~a~~~~----------~~~~~~-~~~~g~l~~~~~l~~~~~~~~~~~~~~ 193 (378) T protein:vir:85 127 NDKKEYKPEELVRLVSPFYINE--DTSILDNALASI----------QTKLEQ-GKLRGLLKINAFLDIDNTQEYREKALA 193 (378) T ss_pred CCCEEEcccceEEEecCcCccc--hhhHHHHHHHHH----------HHHHhc-CCcceEEEeCCcCCHHHHHHHHHHHHH Confidence 22456788999999853 2232 234444333322 233444 4689999988664433 22344445 Q ss_pred HHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH Q lcl|NC_019710. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~ 325 (424) .++.+.++.++|+++++++|++|++++++++++++ +.++++.++||++|||||.+|++ +++|++..+|+.+ T Consensus 194 ~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVPp~~l~~--------s~~e~~~~~f~~~ 264 (378) T protein:vir:85 194 TIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNK-DEIELIKSELLTGYFMNENILLG--------TATQEQQIYFYNS 264 (378) T ss_pred HHHHhhcccccccceecCCCceEEeccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC--------CchHHHHHHHHHH Confidence 55666788899999999999999999999999996 67899999999999999999953 2458899999999 Q ss_pred HHHHHHHHHHHHHhhhccChhhhccc-------eeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019710. 326 TLQPYISRWENSIQRWLIPAKDVGRI-------HAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~L~~~~~~~~~-------~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg 398 (424) ||.|++.+||++|+++|+++.++... .++||.+.+++.|.+++++.+.+++++|+||+||+|+++|+||+||| T Consensus 265 tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gG 344 (378) T protein:vir:85 265 TIIPLLIQLEKELTYKLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGG 344 (378) T ss_pred HHHHHHHHHHHHHHhhcCChhhhhhhhhccccceeeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 99999999999999999998765432 37799999999999999999999999999999999999999999999 Q ss_pred CeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) |++++|+|++|++...+++.++++.. T Consensus 345 D~~~~~~N~~~~~~~~~~~~~~~~~~ 370 (378) T protein:vir:85 345 DIYIANLNAVAVKNLSDLQGSRKDVA 370 (378) T ss_pred CeEeecccccccccchhhcCccCCCC Confidence 99999999999998877654433322 No 80 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=100.00 E-value=3e-77 Score=440.07 Aligned_cols=376 Identities=14% Similarity=0.086 Sum_probs=280.1 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|+|++++..... ..+ .. .. ......++.+.|+++++|++||++||++||++||++++++++ . T Consensus 1 Mgl~d~~~~~~~~---~~~-~~-----~~---~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~-----~ 63 (395) T protein:vir:96 1 MGILDFFSFKKSG---TLS-DD-----DS---GSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKL-----T 63 (395) T ss_pred CcchhhhcCCCCc---ccc-cc-----cc---ccchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCcc-----c Confidence 9999987542211 000 00 00 011234567789999999999999999999999999976432 2 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEE--EecCc Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRY--QRDSE 171 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~--~~~~~ 171 (424) ..+|++.+||+.+||++||+++||+.++.+++++||||+++.++..+.+...++. .....+.. ++.+ ..... T Consensus 64 ~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~~~~~~~~-----~~~~~~~~-~~~v~~~~~~~ 137 (395) T protein:vir:96 64 ENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIYVADAFTQ-----DKKLSGNK-FKVSRVQGQTY 137 (395) T ss_pred cccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCceecCCcccc-----ccccccce-eeeeeecccee Confidence 3578999999999999999999999999999999999999998864433322222 11111111 1111 22223 Q ss_pred eEEecHhHeeEecCcCCCC-ccccchHHHHHHH------HHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFGFTG-LVGLSPIAFACKS------AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVE 244 (424) Q Consensus 172 ~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~------i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~ 244 (424) ...++++||||||+++.+. .++.+++...... +.....+.++..+++.+++.+.++++....... +..+ T Consensus 138 ~~~~~~~dvih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~ 213 (395) T protein:vir:96 138 EKIFTFDQVIYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQENSDGGRQP----KSDK 213 (395) T ss_pred eeEeccCceEEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhhcccccccceeeccCchhhH----HHHH Confidence 5679999999999876543 2333333332222 222334557888999999999999877655443 3444 Q ss_pred HHHHHHhCCc--ccCcceecCCCceeeeccCChhHHHHHHHHHHH------HHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 245 ENFKEIAGGP--VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQ------VSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 245 ~~~~~~~~~~--~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~------~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ++++++.++. +.++++++++|++|++++.++.|+|++|.+++. .++||++|||||.+|++ +++|.| T Consensus 214 ~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~------~~sn~e 287 (395) T protein:vir:96 214 DFFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG------DIADNQ 287 (395) T ss_pred HHHHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC------CCccHH Confidence 5555554433 345688899999999999999999999988876 47899999999999963 345899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_019710. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ 396 (424) ++.++|+++||.|++++||++|+++|+++.++.. .++|+++.+++.|.+++++.+++++++|+||+||+|+++|+||+| T Consensus 288 ~~~~~f~~~~L~P~~~~ie~~l~~~Ll~~~e~~~-~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~ 366 (395) T protein:vir:96 288 KNYELLLEGPIESLITNIVDGLEYAIFDKSETLE-GSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELP 366 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCChhhhcC-ceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 9999999999999999999999999999877543 345888999999999999999999999999999999999999997 Q ss_pred C--cCeeeecccccchhhccccCCCccCC Q lcl|NC_019710. 397 G--GDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 397 g--gd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) | ||++++|+|++|+++.++....+++. T Consensus 367 ~~~gD~~~~~~N~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 367 DGLGKVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred CCCCceeeecccceechhccCCCCCCCCC Confidence 6 99999999999998743332222222 No 81 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=100.00 E-value=3.8e-76 Score=434.07 Aligned_cols=418 Identities=13% Similarity=0.136 Sum_probs=304.4 Q ss_pred CCCCCcccccCCCccHH-------------HHHHhhccCccccccccccccccc--ccccccCCcc------ccHHHHhh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWW-------------ARLKSWFVGGRLVTPNQGSQTGPV--SAHGYLGDSS------INDERILQ 59 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~-------------~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~------~~~~~~~~ 59 (424) ....|++=.+.-.-|+= +-+.....+..++...+....-.. +.....+... ..-+.+.. T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~ 93 (563) T protein:vir:95 14 YGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGN 93 (563) T ss_pred cccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhc Confidence 33333322222111111 111111222222211111100000 0000000000 11223445 Q ss_pred hHHHHHHHHHHHHhhhh-------------CceeEeeccccCccccccccchhHHhhc---cCCCCC-CCHHHHHHHHHH Q lcl|NC_019710. 60 ISTVWRCVSLISTLTAC-------------LPLDVFETDQNDNRKKVDLSNPLARLLR---YSPNQY-MTAQEFREAMTM 122 (424) Q Consensus 60 ~~~v~~~i~~ia~~ia~-------------~~~~~~~~~~~~~~~~~~~~~~l~~lL~---~~PN~~-~s~~~f~~~~~~ 122 (424) +++|.+||+++++.||. +++++++++..+..++....+++..+|. ..|||+ +|+++|++.++. T Consensus 94 n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~ 173 (563) T protein:vir:95 94 NPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVR 173 (563) T ss_pred chHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHH Confidence 78899999999998885 6788888777766666666677665543 223333 588999999999 Q ss_pred HHHHcCCeEEEEe--eCCCCceeEEEEeccceEEEEEcCCceE-------EEEEecCceEEecHhHeeEe-cCcCCC--- Q lcl|NC_019710. 123 QLCFYGNAYALVD--RNSAGDVISLLPLQSANMDVKLVGKKVV-------YRYQRDSEYADFSQKEIFHL-KGFGFT--- 189 (424) Q Consensus 123 ~~l~~G~a~~~~~--r~~~G~~~~l~~l~p~~v~~~~~~~~~~-------~~~~~~~~~~~~~~~evih~-r~~~~~--- 189 (424) +++++||+|++++ |+..|++++||||+|.+|++..+.++.. +++..+.....|+++||||+ ++++.+ T Consensus 174 ~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~ 253 (563) T protein:vir:95 174 DTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSS 253 (563) T ss_pred HHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCccc Confidence 9999999999865 7888999999999999999988776542 23334555678999998755 455554 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC-CCHHHHHHHHHHHHHHhC-CcccCcc-eecCCCc Q lcl|NC_019710. 190 GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV-LTEQQRSQVEENFKEIAG-GPVKKRL-WILEAGF 266 (424) Q Consensus 190 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~-~~~~~~~~~~~~~~~~~~-~~~ag~~-~~l~~g~ 266 (424) +.+|+||+.++..++.+..++++++.++|+||++|+|+|+.+.+. .++++.+.+++.|++.++ ..|+|++ +++++|+ T Consensus 254 ~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~ 333 (563) T protein:vir:95 254 SGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDI 333 (563) T ss_pred CcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCc Confidence 678999999999999999999999999999999999999987653 578889999999998655 4789986 7899999 Q ss_pred eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCC---------cccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST---------SWGSGIEQQNLGFLQYTLQPYISRWENS 337 (424) Q Consensus 267 ~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~---------~~~~n~e~~~~~f~~~tl~P~~~~ie~~ 337 (424) +|++++++++|+||+|++++++++||++|||||.+||..++++ .+++|++++.+.|+++||.|+++.||++ T Consensus 334 ~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ 413 (563) T protein:vir:95 334 KFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDL 413 (563) T ss_pred eEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999876543 3667899999999999999999999999 Q ss_pred HhhhccChhhhccceeeecchhhhccCHHHHHHHHH--HHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccc Q lcl|NC_019710. 338 IQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMK--AMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGT 415 (424) Q Consensus 338 l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~--~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~ 415 (424) |+++|++..+. .++++| ++.|.+++++.+. +++++|+||+||+|+++|+||+||||+++.|.++.+++.... T Consensus 414 ln~~L~~~~~~-~~~~~f-----~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~ 487 (563) T protein:vir:95 414 VNRHIISEYGD-KYTFQF-----VGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQ 487 (563) T ss_pred HHhhhchhccc-ccEEEe-----ccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeeccccccccccccc Confidence 99999987553 233333 6778888888765 468899999999999999999999999999999887754322 Q ss_pred cCCCcc---------------------------CCC Q lcl|NC_019710. 416 NKEPRN---------------------------NGA 424 (424) Q Consensus 416 ~~~~~~---------------------------~g~ 424 (424) .+...+ ++. T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (563) T protein:vir:95 488 DKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSS 523 (563) T ss_pred ccCCCccccchhhhhcccccCCCCCCCCCCCCCCCC Confidence 111000 000 No 82 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=100.00 E-value=3.8e-76 Score=434.07 Aligned_cols=418 Identities=13% Similarity=0.136 Sum_probs=304.4 Q ss_pred CCCCCcccccCCCccHH-------------HHHHhhccCccccccccccccccc--ccccccCCcc------ccHHHHhh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWW-------------ARLKSWFVGGRLVTPNQGSQTGPV--SAHGYLGDSS------INDERILQ 59 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~-------------~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~------~~~~~~~~ 59 (424) ....|++=.+.-.-|+= +-+.....+..++...+....-.. +.....+... ..-+.+.. T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~ 93 (563) T protein:vir:99 14 YGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGN 93 (563) T ss_pred cccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhc Confidence 33333322222111111 111111222222211111100000 0000000000 11223445 Q ss_pred hHHHHHHHHHHHHhhhh-------------CceeEeeccccCccccccccchhHHhhc---cCCCCC-CCHHHHHHHHHH Q lcl|NC_019710. 60 ISTVWRCVSLISTLTAC-------------LPLDVFETDQNDNRKKVDLSNPLARLLR---YSPNQY-MTAQEFREAMTM 122 (424) Q Consensus 60 ~~~v~~~i~~ia~~ia~-------------~~~~~~~~~~~~~~~~~~~~~~l~~lL~---~~PN~~-~s~~~f~~~~~~ 122 (424) +++|.+||+++++.||. +++++++++..+..++....+++..+|. ..|||+ +|+++|++.++. T Consensus 94 n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~ 173 (563) T protein:vir:99 94 NPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVR 173 (563) T ss_pred chHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHH Confidence 78899999999998885 6788888777766666666677665543 223333 588999999999 Q ss_pred HHHHcCCeEEEEe--eCCCCceeEEEEeccceEEEEEcCCceE-------EEEEecCceEEecHhHeeEe-cCcCCC--- Q lcl|NC_019710. 123 QLCFYGNAYALVD--RNSAGDVISLLPLQSANMDVKLVGKKVV-------YRYQRDSEYADFSQKEIFHL-KGFGFT--- 189 (424) Q Consensus 123 ~~l~~G~a~~~~~--r~~~G~~~~l~~l~p~~v~~~~~~~~~~-------~~~~~~~~~~~~~~~evih~-r~~~~~--- 189 (424) +++++||+|++++ |+..|++++||||+|.+|++..+.++.. +++..+.....|+++||||+ ++++.+ T Consensus 174 ~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~ 253 (563) T protein:vir:99 174 DTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSS 253 (563) T ss_pred HHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCccc Confidence 9999999999865 7888999999999999999988776542 23334555678999998755 455554 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC-CCHHHHHHHHHHHHHHhC-CcccCcc-eecCCCc Q lcl|NC_019710. 190 GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV-LTEQQRSQVEENFKEIAG-GPVKKRL-WILEAGF 266 (424) Q Consensus 190 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~-~~~~~~~~~~~~~~~~~~-~~~ag~~-~~l~~g~ 266 (424) +.+|+||+.++..++.+..++++++.++|+||++|+|+|+.+.+. .++++.+.+++.|++.++ ..|+|++ +++++|+ T Consensus 254 ~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~ 333 (563) T protein:vir:99 254 SGYGLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVMADDI 333 (563) T ss_pred CcccchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEcCCCc Confidence 678999999999999999999999999999999999999987653 578889999999998655 4789986 7899999 Q ss_pred eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCC---------cccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST---------SWGSGIEQQNLGFLQYTLQPYISRWENS 337 (424) Q Consensus 267 ~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~---------~~~~n~e~~~~~f~~~tl~P~~~~ie~~ 337 (424) +|++++++++|+||+|++++++++||++|||||.+||..++++ .+++|++++.+.|+++||.|+++.||++ T Consensus 334 ~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ 413 (563) T protein:vir:99 334 KFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDL 413 (563) T ss_pred eEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999876543 3667899999999999999999999999 Q ss_pred HhhhccChhhhccceeeecchhhhccCHHHHHHHHH--HHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccc Q lcl|NC_019710. 338 IQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMK--AMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGT 415 (424) Q Consensus 338 l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~--~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~ 415 (424) |+++|++..+. .++++| ++.|.+++++.+. +++++|+||+||+|+++|+||+||||+++.|.++.+++.... T Consensus 414 ln~~L~~~~~~-~~~~~f-----~r~D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~ 487 (563) T protein:vir:99 414 VNRHIISEYGD-KYTFQF-----VGGDTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQ 487 (563) T ss_pred HHhhhchhccc-ccEEEe-----ccCCHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeeccccccccccccc Confidence 99999987553 233333 6778888888765 468899999999999999999999999999999887754322 Q ss_pred cCCCcc---------------------------CCC Q lcl|NC_019710. 416 NKEPRN---------------------------NGA 424 (424) Q Consensus 416 ~~~~~~---------------------------~g~ 424 (424) .+...+ ++. T Consensus 488 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 523 (563) T protein:vir:99 488 DKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSS 523 (563) T ss_pred ccCCCccccchhhhhcccccCCCCCCCCCCCCCCCC Confidence 111000 000 No 83 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=100.00 E-value=7.2e-77 Score=438.04 Aligned_cols=369 Identities=12% Similarity=0.161 Sum_probs=289.2 Q ss_pred ccHHHHHHhhccCcccccccccc-cccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCcccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGS-QTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKK 92 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~ 92 (424) +|||+++.... ........ .........+.++..++.+.++++++|++||++||++||++||++++... T Consensus 1 Mg~f~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~------ 70 (382) T protein:vir:48 1 MPIFNLATESP----PDNQGGFFDVVDSDFLASLKGNEWVSAETALRNSDLFSIINQLSNDLATVKLITSRKKL------ 70 (382) T ss_pred CccccccccCC----cccccccccchhhhccccccCCcccchHhhhccHHHHHHHHHHHHhhccCceeeecchh------ Confidence 89998864422 11111111 11222334566788899999999999999999999999999999986542 Q ss_pred ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC--ceEEEEEecC Q lcl|NC_019710. 93 VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK--KVVYRYQRDS 170 (424) Q Consensus 93 ~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~--~~~~~~~~~~ 170 (424) ..|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|++|++..+.. ..+|.+..++ T Consensus 71 --------~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~~~y~~~~~~ 142 (382) T protein:vir:48 71 --------QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDGIYYNITFDD 142 (382) T ss_pred --------hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCeEEEEEEecC Confidence 247789999999999999999999999999999999999999999999999999887654 3455554433 Q ss_pred ----ceEEecHhHeeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019710. 171 ----EYADFSQKEIFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 171 ----~~~~~~~~evih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) ..+.|+++||||+|+++.++ ++|+||+.++..+++...++.+++.++|+||+.|+++|+++....+++ .+.+++ T Consensus 143 ~~~~~~~~~~~~evih~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~-~~~~~~ 221 (382) T protein:vir:48 143 PRIPPKQHVPQNDVLHFRLLSVDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLDF-KTKLSR 221 (382) T ss_pred ccccceeEEcCccEEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHH-HHHHHH Confidence 45789999999999988876 899999999999999999999999999999999999999987755554 444545 Q ss_pred HHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH Q lcl|NC_019710. 246 NFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY 325 (424) Q Consensus 246 ~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~ 325 (424) .+.. +..|+|+++++++|++|++++.+++|+||+|.+++.+++||++|||||.+||..+++ ++.+++.+.|++. T Consensus 222 ~~~~--~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~----~~~~~~~~~~~~~ 295 (382) T protein:vir:48 222 SRQA--MKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQ----QSSLEMSSDLYSK 295 (382) T ss_pred HHHh--hccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc----ccHHHHHHHHHHH Confidence 4443 346789999999999999999999999999999999999999999999999976543 2568889999999 Q ss_pred HHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC-----CCCCcCe Q lcl|NC_019710. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLP-----PLPGGDV 400 (424) Q Consensus 326 tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~-----p~~ggd~ 400 (424) ||.|++++|+++|+++|+++.+.. ....+..+.......+.+++++|++|+||+|+.++.. ++++++. T Consensus 296 ~l~p~~~~i~~~l~~~l~~~~~~~-------~~~~~~~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~ 368 (382) T protein:vir:48 296 AVSRYLRPFLSELSQKLSCDVDAD-------IFPAVDPTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGEN 368 (382) T ss_pred HHHHHHHHHHHHHHHHhcChhhhh-------hhhhhccchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhc Confidence 999999999999999999876532 2222333444555667788888999999999886432 2344444 Q ss_pred eeecccccchhhccccCCCccC Q lcl|NC_019710. 401 AMRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 401 ~~~~~n~~~~~~~~~~~~~~~~ 422 (424) +..+ ++ .+++.++| T Consensus 369 ~~~~-----~~---GGd~~~~~ 382 (382) T protein:vir:48 369 PNST-----LK---GGEEDGQD 382 (382) T ss_pred CCCC-----CC---CCCCCCCC Confidence 3211 11 11111112 No 84 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=100.00 E-value=6.8e-76 Score=432.69 Aligned_cols=391 Identities=11% Similarity=0.065 Sum_probs=293.2 Q ss_pred CCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCcccc----HHHHhhhHHHHHHHHHHHHhhhhCc Q lcl|NC_019710. 3 EPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIN----DERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 3 ~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) -=++-|++++- +=+.++++.. .++....... ..|.. ..++ .+.+..+++|++||++||+.||++| T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~--~~~~~~~~~~-------~~~~~-pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~ 69 (540) T protein:vir:41 1 MFNYHLSIKSL-EKYRAIKGDT--DSQALKEDRF-------EEYVE-PKVHPLVLLSLLQVNPYHASACSIKANDILRTG 69 (540) T ss_pred CCCcccChhhc-cchhhhhccc--cccccccCCC-------Ccccc-CCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCC Confidence 22344555543 1122233211 1111111100 11110 0111 3456678999999999999999999 Q ss_pred eeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEc Q lcl|NC_019710. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~ 158 (424) ++++.+. +.+.. ..||++||+++||+.++.+++++||||++++|+..|.+++||||+|.+|++..+ T Consensus 70 ~~i~~~~-----------~~~~~---~lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~ 135 (540) T protein:vir:41 70 YLIDGDD-----------GGVEE---LLRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRD 135 (540) T ss_pred ceEecCc-----------cchhh---hccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEc Confidence 9986432 22322 249999999999999999999999999999999999999999999999998877 Q ss_pred CCceEEE--------E-----------EecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 159 GKKVVYR--------Y-----------QRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 159 ~~~~~~~--------~-----------~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 218 (424) +...+.. + ..+.....++++||||+|.++ .++++|+||+.++..++....++++++.++| T Consensus 136 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f 215 (540) T protein:vir:41 136 GSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPSPICSYYGVPRYLSAAPSILAMQKIDEYNYAFF 215 (540) T ss_pred CceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 6543211 1 112234578999999999776 6789999999999999999999999999999 Q ss_pred hccCCCceeEEcCCCCCCHH---------HHHHHHHHHHHHhCC--cccCcceecC------CCceeeeccCChhHHHHH Q lcl|NC_019710. 219 ANGAKSPQILSTGEKVLTEQ---------QRSQVEENFKEIAGG--PVKKRLWILE------AGFSTSAIGVTPQDAEMM 281 (424) Q Consensus 219 ~ng~~p~~vl~~~~~~~~~~---------~~~~~~~~~~~~~~~--~~ag~~~~l~------~g~~~~~l~~s~~d~~~~ 281 (424) +||++|+++|+.+....+++ .++.+++.++..+.+ .|+|++++|+ +|++|++++++++|+||+ T Consensus 216 ~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfl 295 (540) T protein:vir:41 216 DNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFR 295 (540) T ss_pred hccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHH Confidence 99999999999886654432 234556666554433 5889999984 799999999999999999 Q ss_pred HHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhh Q lcl|NC_019710. 282 ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLL 361 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~ 361 (424) |.+++++++||++|||||.+||..+.++++++|+|++.+.|+++||.|++++||++||++|++..+. +++++||.+.++ T Consensus 296 e~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~-~~~i~f~~~~ll 374 (540) T protein:vir:41 296 EYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSSVLTDFIQLKLDP-GARFVFNEEILM 374 (540) T ss_pred HHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC-ceEEEecchhhc Confidence 9999999999999999999999988888888999999999999999999999999999999876553 578999999999 Q ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC-cCeeeecccccchhhccccC---CCccCC-----C Q lcl|NC_019710. 362 RGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG-GDVAMRQSQYVPITDLGTNK---EPRNNG-----A 424 (424) Q Consensus 362 ~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g-gd~~~~~~n~~~~~~~~~~~---~~~~~g-----~ 424 (424) +.|.++ .+.+++++|++|+||+|+.+ +++|+ +|.++.|.|+...+..+..+ +.+.+. + T Consensus 375 ~~D~~~---~~~~lv~~G~lT~NE~Re~L--~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~ 441 (540) T protein:vir:41 375 ESEFVH---NYALLVQCGVLTPSEVREKL--FGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYA 441 (540) T ss_pred chHHHH---HHHHHHhCCCCCHHHHHHHh--CcCcCCCcccccccccccccccccccccCCCCccccccccc Confidence 886554 46678999999999999854 44444 46667787776533322111 000000 1 No 85 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=100.00 E-value=2.6e-75 Score=429.47 Aligned_cols=392 Identities=12% Similarity=0.076 Sum_probs=291.7 Q ss_pred CCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccH----HHHhhhHHHHHHHHHHHHhhhhCc Q lcl|NC_019710. 3 EPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIND----ERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 3 ~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) -=.|-|.+|+. .+-...+..... ......... ..+.. ..++. +.+..+++|++||++||++||++| T Consensus 1 ~~~~~~~i~s~----~~~~~i~~~~~~---s~~~~~~~~--~~~~~-pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~ 70 (542) T protein:vir:41 1 MFNYHLSIRSL----EKYKAIKREEVE---SQALGETRF--EEYVE-PKVNPLVLLSLLQVNPYHASACSIKANDIIRTG 70 (542) T ss_pred Ccccccccccc----ccchhhhhcccc---ccccccccC--Ccccc-CCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCc Confidence 11223333332 001111111100 011100000 11111 12333 335568999999999999999999 Q ss_pred eeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEc Q lcl|NC_019710. 79 LDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~ 158 (424) |++++... ..+++..||++||+++||+.++.+++++||||++++|+..|.+.+|+||+|.+|++..+ T Consensus 71 ~~~~~~~~-------------~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d 137 (542) T protein:vir:41 71 YILEGDDE-------------GVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKD 137 (542) T ss_pred eeeecccc-------------hhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEc Confidence 99964321 12344569999999999999999999999999999999999999999999999999887 Q ss_pred CCceEEEE-----------E--------ecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 159 GKKVVYRY-----------Q--------RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 159 ~~~~~~~~-----------~--------~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 218 (424) ++.....+ . .+.....++++||||+|+++ .++++|+||+.++..++.+..++++++.++| T Consensus 138 ~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir~~~~~~~~~Glspi~~~~~~i~~~~~~~~~~~~~f 217 (542) T protein:vir:41 138 GSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIHIPSPVCSYYGVPRYVSAAPAILAMQKIDEYNYAFF 217 (542) T ss_pred CCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 66532211 1 11123468899999999876 6889999999999999999999999999999 Q ss_pred hccCCCceeEEcCCC---------CCCHHHHHHHHHHHHHHhCC--cccCcceec------CCCceeeeccCChhHHHHH Q lcl|NC_019710. 219 ANGAKSPQILSTGEK---------VLTEQQRSQVEENFKEIAGG--PVKKRLWIL------EAGFSTSAIGVTPQDAEMM 281 (424) Q Consensus 219 ~ng~~p~~vl~~~~~---------~~~~~~~~~~~~~~~~~~~~--~~ag~~~~l------~~g~~~~~l~~s~~d~~~~ 281 (424) +||++|+++|+++.. ..++++.+.+++.|++.+.+ .|+|+++++ ++|++|++++++++|++|+ T Consensus 218 ~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfl 297 (542) T protein:vir:41 218 DNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFR 297 (542) T ss_pred hccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHH Confidence 999999999988744 23567888899988876543 578889998 4799999999999999999 Q ss_pred HHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhh Q lcl|NC_019710. 282 ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLL 361 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~ 361 (424) |.+++++++||++|||||.+||..++++++++|+|++.+.|+++||.|+++.||++||++|+++.++ .++++|+.+.++ T Consensus 298 e~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~-~~~~~f~~~~ll 376 (542) T protein:vir:41 298 EYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTDFFQVKFNP-KTRFKFNDETLL 376 (542) T ss_pred HHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC-ceEEEecchhhc Confidence 9999999999999999999999998888888899999999999999999999999999999887665 478999999999 Q ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe-eeecccccchhhcccc---CCCcc------CCC Q lcl|NC_019710. 362 RGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV-AMRQSQYVPITDLGTN---KEPRN------NGA 424 (424) Q Consensus 362 ~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~-~~~~~n~~~~~~~~~~---~~~~~------~g~ 424 (424) +.|. .+.+..++++|++|+||+|+.+ +++|+||+ ++.|.|..... ...+ .+..+ .-+ T Consensus 377 ~~d~---~~~~~~~v~~GilT~NE~Re~L--~g~~pgdd~~l~p~~~~~~~-~~~~~~n~~~~~~~~~~k~~~ 443 (542) T protein:vir:41 377 ESDS---VRNCALLVQSGVLTPAEARERL--FGLDGGPDIFMVPSKGAAKS-VKRQERNYEKNQIREIRKIYA 443 (542) T ss_pred chHH---HHHHHHHHhCCCCCHHHHHHhh--CCCCCCCccccccccccccc-cccCCcCCCCCchhhhhhccc Confidence 8764 4456779999999999999753 44444554 45565543221 1111 01000 000 No 86 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=100.00 E-value=3.9e-75 Score=428.52 Aligned_cols=371 Identities=17% Similarity=0.149 Sum_probs=294.6 Q ss_pred cHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc-Cccc-cccccchhHHhhccCCCCCC--------CHHHHHHHHHH Q lcl|NC_019710. 53 NDERILQISTVWRCVSLISTLTACLPLDVFETDQN-DNRK-KVDLSNPLARLLRYSPNQYM--------TAQEFREAMTM 122 (424) Q Consensus 53 ~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~-~~~~-~~~~~~~l~~lL~~~PN~~~--------s~~~f~~~~~~ 122 (424) -.+.+..+++|++||++||++||++||+++.+... +... ....++....++..+||+.| |+.+||+.++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 23344458999999999999999999999765422 2111 12223334556777888765 56689999999 Q ss_pred HHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceE---------EEE----------------------EecCc Q lcl|NC_019710. 123 QLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVV---------YRY----------------------QRDSE 171 (424) Q Consensus 123 ~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~---------~~~----------------------~~~~~ 171 (424) +++++||||++++|+..|++++||||+|.+|++..+..... +.+ ...+. T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGT 160 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccccccc Confidence 99999999999999999999999999999999877754321 110 01234 Q ss_pred eEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 172 YADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 172 ~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) ...++++||||+|.++ .++++|+||+.++..++.++.++.+++.++|+||++|+|+|+.+....++++.+.+++.|+.. T Consensus 161 ~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~ 240 (467) T protein:vir:31 161 SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKGAELTEKGREEMRNLIEDN 240 (467) T ss_pred eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHhh Confidence 5679999999999775 578999999999999999999999999999999999999999876667888899999998765 Q ss_pred h------------CCcccCcceecCCCceeeecc--------CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCc Q lcl|NC_019710. 251 A------------GGPVKKRLWILEAGFSTSAIG--------VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS 310 (424) Q Consensus 251 ~------------~~~~ag~~~~l~~g~~~~~l~--------~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~ 310 (424) . +..++++++++++|+++++++ .+++|+||.|++++++++||++|||||.+||..+++++ T Consensus 241 ~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~ 320 (467) T protein:vir:31 241 NEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAF 320 (467) T ss_pred hcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCc Confidence 4 446788899998887666654 36789999999999999999999999999998776554 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhh-ccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_019710. 311 WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRT 389 (424) Q Consensus 311 ~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~-~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 389 (424) ++|+|++.+.|+++||.|++++|+++||.+|++..+. ..++++|+.+.+++.|.+++++.++.++++|++|+||+|++ T Consensus 321 -~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~ 399 (467) T protein:vir:31 321 -STDAEEQRKEFAEETIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDE 399 (467) T ss_pred -ccCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 3589999999999999999999999999999987664 46789999999999999999999999999999999999999 Q ss_pred hCCCCCCCcCeee-------ecccccchhhccccCCCc-cCCC Q lcl|NC_019710. 390 DNLPPLPGGDVAM-------RQSQYVPITDLGTNKEPR-NNGA 424 (424) Q Consensus 390 lg~~p~~ggd~~~-------~~~n~~~~~~~~~~~~~~-~~g~ 424 (424) +|+||+++++.+- +.++..|.+..+++.++. ++-+ T Consensus 400 ~Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 442 (467) T protein:vir:31 400 FGFEPFPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRA 442 (467) T ss_pred hCCCCCCcccccCCcccccccccccCCCCcccCcCCCCCCCcc Confidence 9999996543221 111222222222211111 1111 No 87 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=100.00 E-value=1.3e-69 Score=398.24 Aligned_cols=408 Identities=13% Similarity=0.068 Sum_probs=283.6 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccc-c----ccccccc-c---------------------------ccccccc Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVT-P----NQGSQTG-P---------------------------VSAHGYL 47 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~-~----~~~~~~~-~---------------------------~~~~~~~ 47 (424) |+.+- -.+|||+|+..+++.+.-.. | ....... | ....+.. T Consensus 1 ~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~ 74 (648) T protein:vir:79 1 MARKV------WGRGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGG 74 (648) T ss_pred Cccch------hcchhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcC Confidence 55432 35899999999998443222 0 0000000 0 0000000 Q ss_pred CC-----ccccH----HHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHH Q lcl|NC_019710. 48 GD-----SSIND----ERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFRE 118 (424) Q Consensus 48 ~~-----~~~~~----~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~ 118 (424) ++ ..++. +.+..+|.|++||++||++||+++|.++.++... ...++. .++..+||++||+++||+ T Consensus 75 g~~~~~epp~d~~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~-----~~~~~~-~~ll~rPn~~~t~~~f~~ 148 (648) T protein:vir:79 75 GGRDFEEPEFDFNEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNA-----VEYIRM-RFTLMAEATQIPTNQLFI 148 (648) T ss_pred CccccccCCcCHHHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCcc-----chhhHH-HHHhhccCCCCCHHHHHH Confidence 11 11222 3455699999999999999999999987654321 122333 344569999999999999 Q ss_pred HHHHHHHHcCCeEEEEeeCCCCc---------------eeEEEEeccceEEEEEcCCce--EEEEEe--cCceEEecHhH Q lcl|NC_019710. 119 AMTMQLCFYGNAYALVDRNSAGD---------------VISLLPLQSANMDVKLVGKKV--VYRYQR--DSEYADFSQKE 179 (424) Q Consensus 119 ~~~~~~l~~G~a~~~~~r~~~G~---------------~~~l~~l~p~~v~~~~~~~~~--~~~~~~--~~~~~~~~~~e 179 (424) .++.+++++||||++++|+.+|. +.++||++|.+|++..+..+. .|.|.. ++..+.|+++| T Consensus 149 ~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~~d 228 (648) T protein:vir:79 149 EIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGWQQEQEGQDKPQKFKPED 228 (648) T ss_pred HHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceeeeEEEecCCceeEEecCcc Confidence 99999999999999999999884 478999999999998876553 344443 33456789999 Q ss_pred eeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCc Q lcl|NC_019710. 180 IFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKR 258 (424) Q Consensus 180 vih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~ 258 (424) |||||++ +.++++|+||+.++..+|.+..++.++..++|+||++|+++++++.+....+..+..++.+...+.+.+.++ T Consensus 229 IIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~g 308 (648) T protein:vir:79 229 IVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEG 308 (648) T ss_pred EEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhcccccccc Confidence 9999965 578899999999999999999999999999999999999999986544444444444444544443322222 Q ss_pred ceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 259 LWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSI 338 (424) Q Consensus 259 ~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l 338 (424) ..+....+.+.+. .+++|+||++.+++++++||++|||||.+||..+++++ ++.++ ...++..++.|++..++..+ T Consensus 309 g~v~~~~~~i~~~-~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~--stae~-~~~~~~~~i~~l~~~i~~~l 384 (648) T protein:vir:79 309 GMVTTERVNISSI-ASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASR--STGDN-LSSDFKDRIKALQKVMATFI 384 (648) T ss_pred cccccceeecccc-CCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccc--hHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 2222223333322 26789999999999999999999999999998765544 35544 44566778888877776655 Q ss_pred hhhcc----Chhh-----hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe-eeeccccc Q lcl|NC_019710. 339 QRWLI----PAKD-----VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV-AMRQSQYV 408 (424) Q Consensus 339 ~~~L~----~~~~-----~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~-~~~~~n~~ 408 (424) +..+. .+.. ...++++|+++++++.|.+++++.+.+++++||||+||+|+++|+||+|+|+. .++..+.. T Consensus 385 e~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~ 464 (648) T protein:vir:79 385 NEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMV 464 (648) T ss_pred HHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccc Confidence 54332 2211 12356899999999999999999999999999999999999999999998754 34555555 Q ss_pred chhhcccc----CCC-------ccCCC Q lcl|NC_019710. 409 PITDLGTN----KEP-------RNNGA 424 (424) Q Consensus 409 ~~~~~~~~----~~~-------~~~g~ 424 (424) +....... ..+ +++.+ T Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~a~~eg 491 (648) T protein:vir:79 465 TIAQATALAALAPTPAGGSSASASGDK 491 (648) T ss_pred cchhccccccCCCCCCCCCCCCccccc Confidence 43322111 000 00000 No 88 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=100.00 E-value=4.2e-70 Score=400.92 Aligned_cols=413 Identities=15% Similarity=0.139 Sum_probs=303.5 Q ss_pred CCCCCcccccCC----CccHHHHHHhhccCcccccccccccccccccccccCCccccHH---HHhh-hHHHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRT----NNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDE---RILQ-ISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~----~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~~v~~~i~~ia~ 72 (424) |...|-..+-+- ..|.-...+ + ...+.+..........+... -.+++. .... ++++++||+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~----~--~~~~~~~~~~~~~~~~~~~~-p~~~~~~L~~~~e~~~~~~~~i~~~~~ 73 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLA----K--SPNSTQIPDHRIQSHNVGVN-PPYNPDRLAAFLELNETLATGIRKKSR 73 (651) T ss_pred CCCccceeeeeEEEeeccccccccc----c--cccccccchhhhcccCCCCC-CCCCHHHHHHHHhcChHHHHHHHHHhh Confidence 444332211100 000000000 0 00000000001111111111 112332 2333 8999999999999 Q ss_pred hhhhCceeEeecc-ccCc---cccc-------cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc Q lcl|NC_019710. 73 LTACLPLDVFETD-QNDN---RKKV-------DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD 141 (424) Q Consensus 73 ~ia~~~~~~~~~~-~~~~---~~~~-------~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~ 141 (424) .||+++|.+.... -++. .++. ...++....+...+|+.+|+.++++.++.|++.+|++|+.++++..|. T Consensus 74 ~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~ 153 (651) T protein:vir:99 74 YEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGR 153 (651) T ss_pred hhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccc Confidence 9999999885422 1111 1111 123445555666789999999999999999999999999999999999 Q ss_pred eeEEEEeccceEEEEEcCCc----------------------------------eE------------------------ Q lcl|NC_019710. 142 VISLLPLQSANMDVKLVGKK----------------------------------VV------------------------ 163 (424) Q Consensus 142 ~~~l~~l~p~~v~~~~~~~~----------------------------------~~------------------------ 163 (424) +..++++++..+.+..+... +. T Consensus 154 pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~ 233 (651) T protein:vir:99 154 PVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIR 233 (651) T ss_pred hhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEE Confidence 99999999887754332100 00 Q ss_pred --------------------EEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019710. 164 --------------------YRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGA 222 (424) Q Consensus 164 --------------------~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 222 (424) |.+...+....++++||||||+++ .++++|+||+..+..++.++.++++++.++|+||+ T Consensus 234 ~~~d~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~ 313 (651) T protein:vir:99 234 YREDEESEREPIFVDRETGDVTTGDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDT 313 (651) T ss_pred eccCcceeeeeecccceeeeEEEcCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 111122234568899999999886 58899999999999999999999999999999999 Q ss_pred CCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----------CceeeeccCCh-hHHHHHHHHHHHHHH Q lcl|NC_019710. 223 KSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----------GFSTSAIGVTP-QDAEMMASRKFQVSE 290 (424) Q Consensus 223 ~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-----------g~~~~~l~~s~-~d~~~~e~~~~~~~~ 290 (424) +|++||+.+....++++.+.+++.|++..+ |+|++++|+. |++|+++++++ +|+||+|+++++.++ T Consensus 314 ~p~gil~~~~~~ls~e~~~~lr~~~~~~~~--nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~e 391 (651) T protein:vir:99 314 IPRMVIKVTGGELSEESKRDLRQMLNGLRE--ESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEHE 391 (651) T ss_pred CCceEEEecCCCCCHHHHHHHHHHHHHHhc--cCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHHH Confidence 999999988777889999999999998665 6788988865 99999999876 599999999999999 Q ss_pred HHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc---cceeeecchhhhccCHHH Q lcl|NC_019710. 291 LARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG---RIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 291 Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~---~~~~~f~~~~~~~~d~~~ 367 (424) ||++|||||.+||..++++ ++|+|++.+.|+++||.|++.+||++||++|+++.++. .++++|+.+.+++.|.++ T Consensus 392 Ia~afgVPp~~lG~~~~~~--~sn~E~~~~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~~~ 469 (651) T protein:vir:99 392 IAKVLEVPPVKIGVTDSAN--RSNSDQQDKDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYELRGADQPKQEAQL 469 (651) T ss_pred HHHHhCCCHHHhccCCCCC--cccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEeccchhhhccHHH Confidence 9999999999999887654 56999999999999999999999999999999987653 256788889999999999 Q ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCeeeecccccchhhccccCCC-------ccCCC Q lcl|NC_019710. 368 RAAFMKAMGESGLRTINEMRRTDNLPPLP--GGDVAMRQSQYVPITDLGTNKEP-------RNNGA 424 (424) Q Consensus 368 ~~~~~~~~~~~g~~t~NE~R~~lg~~p~~--ggd~~~~~~n~~~~~~~~~~~~~-------~~~g~ 424 (424) +++.+..++++|+||+||+|+++|+||++ +||.++.+.+........+..+. +++.. T Consensus 470 ~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~ 535 (651) T protein:vir:99 470 AEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKI 535 (651) T ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCcccccc Confidence 99999999999999999999999999995 48998888777655432221111 01111 No 89 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=100.00 E-value=9e-63 Score=360.73 Aligned_cols=273 Identities=20% Similarity=0.285 Sum_probs=242.1 Q ss_pred hhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceE Q lcl|NC_019710. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v 153 (424) ||++||++|++++. .+|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|.+++|||++|++| T Consensus 1 ia~l~~~~~~~~~~-------~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v 73 (278) T protein:vir:78 1 MASLPLKMYEDYKV-------VNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVV 73 (278) T ss_pred CccceeEEEecCcc-------cccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCcee Confidence 99999999987653 35899999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCc--eEEEEE-ecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019710. 154 DVKLVGKK--VVYRYQ-RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 154 ~~~~~~~~--~~~~~~-~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~ 229 (424) ++..+.++ .+|.+. .++....|+++||||+|+++ .++++|+||+.++..++....++.+++...+.+ .|+++++ T Consensus 74 ~v~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~i~~ 151 (278) T protein:vir:78 74 EMLIENQSRELYYSIHAATGNKLIVHNMDMLHFKHIVASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQK--PDSFMLK 151 (278) T ss_pred EEEEcCCCceEEEEEEcCCceEEEEccccEEEECCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcC--CCcEEEE Confidence 99877643 344444 34456889999999999774 678999999999999999999999987665555 4788888 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCC Q lcl|NC_019710. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~ 309 (424) .+.. .++++.+.+++.|++..+ ++|+++++++|++|++++++++|+++.|.++++.++||++|||||.+||..++++ T Consensus 152 ~~~~-l~~e~~~~~~~~~~~~~~--~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~ 228 (278) T protein:vir:78 152 YGSN-VGKEKRQQVLEDFKQYYE--ENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTN 228 (278) T ss_pred eCCC-CCHHHHHHHHHHHHHHhc--cCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCC Confidence 8765 556778888888887664 5789999999999999999999999999999999999999999999999887654 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc-cceeeecchhh Q lcl|NC_019710. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG-RIHAEHNLDGL 360 (424) Q Consensus 310 ~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~-~~~~~f~~~~~ 360 (424) ++|++++.++|++.||.|+++.|+++||++|+++.++. +++++||.+.| T Consensus 229 --~sn~~~~~~~~~~~~l~P~~~~i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 229 --FAKNEELNRFYLQHTLLPIVKQYEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred --cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhHhcCCceEEEecccC Confidence 56999999999999999999999999999999998864 68999999999 No 90 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=100.00 E-value=2.4e-60 Score=347.45 Aligned_cols=353 Identities=14% Similarity=0.172 Sum_probs=260.2 Q ss_pred cccCCCccHHHHHHhhccCccccccc-c-cccccccccccccCCccccHHHHhhhHHHHHHHHHHHHh-hhhCcee---- Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPN-Q-GSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTL-TACLPLD---- 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~-ia~~~~~---- 80 (424) |+.|..+..-+.-.+.........+. . .........+.+...+.+. ...++..|+.+..+. ....|+. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~fg~p~~~~-----~~~~~~~~~~~~~~~~~~~~pi~~~~l 75 (368) T protein:vir:79 1 MSRNKTRRAARAASAHVRTANTDAPTEHHTDRAAQAEVFSFGDPVEVL-----DRRELLDYVECMRMGQWYEPPMPWDGL 75 (368) T ss_pred CCccccccchhccCcccccccccCcchhhccccCceEEEEcCCceeec-----chhhHHHHHHHHhccchhccCcCHHHH Confidence 66666544333222222111111110 0 0111111122222222121 111222222222221 1111111 Q ss_pred --EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEc Q lcl|NC_019710. 81 --VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 81 --~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~ 158 (424) +++....... .....|++..++ .+||++||+++|++ ++.+++++||||++++|+..|++++|+|++|.+|.+..+ T Consensus 76 a~~~~~~~~h~~-~~~~~~n~l~l~-~~Pn~~~t~~~f~~-l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~ 152 (368) T protein:vir:79 76 ARSFRAAAHHSS-AVYVKRNILVST-FIPHPLLSRATFER-LVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGLD 152 (368) T ss_pred HHHHhhccccch-hhhhhcchhhhh-cCCCcCCCHHHHHH-HHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeecc Confidence 1111111111 122346676555 59999999999975 788999999999999999999999999999999998887 Q ss_pred CCceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH Q lcl|NC_019710. 159 GKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE 237 (424) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~ 237 (424) ++.. |++..++..++|+++||||+|.++ .++++|+||+.++..++.+..++..+..++|+||++|+++|+.+....++ T Consensus 153 ~~~~-~~~~~~~~~~~~~~~dIihir~~~~~~~~yGlsp~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~ 231 (368) T protein:vir:79 153 LNTY-FFVQNWQQPYTFAAGSVFHLQEPDINQEVYGLPEYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQ 231 (368) T ss_pred CCEE-EEEecCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCH Confidence 7654 444556677899999999999876 56899999999999999999999999999999999999999887666789 Q ss_pred HHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCccc Q lcl|NC_019710. 238 QQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) Q Consensus 238 ~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~ 312 (424) ++.+.+++.|+++.|..|+|+++++ ++|++|++++.+++|+||.|.+++++++||++|||||.+||..++++.++ T Consensus 232 e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~ 311 (368) T protein:vir:79 232 EDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGF 311 (368) T ss_pred HHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCcc Confidence 9999999999999999999999998 68999999999999999999999999999999999999999988888888 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHH Q lcl|NC_019710. 313 SGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMG 376 (424) Q Consensus 313 ~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~ 376 (424) +|+|++.+.|+++||.|+++.|+ +++.+|.. .+++|+...+++.|.+.++....+-- T Consensus 312 sn~e~~~~~f~~~~l~Pl~~~ie-~ln~~l~~------e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 312 GDVEKAAMVFARNEVKPLQDRLL-AINDWIGD------EVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred ccHHHHHHHHHHHHHHHHHHHHH-HHHhccCc------ceeeechhHhhcccccccCCcccccC Confidence 99999999999999999999998 68877743 25789999999999988876322211 No 91 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=100.00 E-value=1.3e-57 Score=332.36 Aligned_cols=333 Identities=15% Similarity=0.155 Sum_probs=245.6 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhC--- Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACL--- 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~--- 77 (424) |-++...+.-... + .....+.+ +.. ++..+....++.|++++.+..+.. T Consensus 1 ~~~~~~~~~~~~~------------------~------~~~~~~~~-~~~---p~~~~~~~~~~~~~~~~~~~~~~~~ep 52 (348) T protein:vir:26 1 MTEQLIHSHTTDG------------------T------ESKSVYSF-DPN---PEPVDTNSWMTRYCELFYNDFDDYWEP 52 (348) T ss_pred CCccccchhhccc------------------c------CCceEEEe-cCC---CeeecCcchHHHHHHHHhcCCCccccC Confidence 4333221111111 0 00000111 100 112233445555555554443321 Q ss_pred ceeE------eeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccc Q lcl|NC_019710. 78 PLDV------FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 78 ~~~~------~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~ 151 (424) |+.. ++.+.--...-..+.+-+.. ..+||++||+.+|++. +.+++++||||++++|+..|++++|+|++|. T Consensus 53 p~~~~~La~l~~~n~~h~~~i~~k~N~l~~--~~~Pn~~~t~~~f~~~-~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~ 129 (348) T protein:vir:26 53 PISLKGLAEIANANGYHGSLLKARANYVAG--RFMNGGGLPMYKMNSA-CWDYFGLGMSAFVKIRSYLKNVIALEPLPMV 129 (348) T ss_pred CCCHHHHHHHHhhhhhhhhhHhhhhhHHhh--cccCCCCCCHHHHHHH-HHHHHhcCCeEEEEEEcCCCcEEEEEEecCc Confidence 2211 10000000000000111111 2379999999999765 5799999999999999999999999999999 Q ss_pred eEEEEEcCCceEEEEEecCceEEecHhHeeEecCcCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEc Q lcl|NC_019710. 152 NMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 152 ~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~ 230 (424) +|++..++. +|++..++....|+++||||||.+++ ++++|+||+.+++.++.+..++..+.+++|+||++|++||+. T Consensus 130 ~v~~~~d~~--~~~~~~~g~~~~f~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~ 207 (348) T protein:vir:26 130 HMRKRKNGD--FVQLLRNNEQKVFKAKDVIFIPQYDPQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYA 207 (348) T ss_pred eeEeeecCc--EEEEEecCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEe Confidence 999987754 45666777888999999999998774 689999999999999999999999999999999999999987 Q ss_pred CCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC Q lcl|NC_019710. 231 GEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDV 305 (424) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~ 305 (424) +....++++++.+++.|++..|+.|+++++++ ++|++++|++.+++|+||+|.+++++++||++|||||.++|.. T Consensus 208 ~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~ 287 (348) T protein:vir:26 208 TDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGML 287 (348) T ss_pred cCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHcccc Confidence 76678899999999999998888999999988 7899999999999999999999999999999999999999998 Q ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHH Q lcl|NC_019710. 306 EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF 371 (424) Q Consensus 306 ~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~ 371 (424) ..++.+++|+|++.+.|+.+||.|+++.||++||++|..+.+ .+++||++.... ..++.++ T Consensus 288 ~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln~~l~~~~~---~~~~fdl~~~~e--~~~~~a~ 348 (348) T protein:vir:26 288 PQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVNNDPEIPDN---LKLKFNLNPGVE--SANGSAV 348 (348) T ss_pred CCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHhhhhCCCCc---cEEEEecCcccc--cchhhcC Confidence 777778889999999999999999999999999999865433 456777764332 2222222 No 92 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=100.00 E-value=3.4e-57 Score=330.18 Aligned_cols=323 Identities=15% Similarity=0.212 Sum_probs=246.5 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHH----------------HHH Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVS----------------LIS 71 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~----------------~ia 71 (424) |+.|.++ -.+...+.. .......+.+...+.|. ...++..++. -+| T Consensus 1 m~~~~~~---------~~~~~~~~~----~~~~~~~~~~~~p~~~~-----~~~~~~~~~~~~~~~~~~~~pp~~~~~la 62 (346) T protein:vir:10 1 MKKQLRK---------NLTQNDRLQ----PQAQTEIFSFGDPIPVL-----DRADILNYLECSAMYEKWYNPPMSFDGLA 62 (346) T ss_pred CCcccCC---------CCCcccccc----cccCeEEEecCCcceec-----CchhHHHHHHHhhcCCceEecCCCHHHHH Confidence 4433321 000000000 00011111111111111 1111111111 122 Q ss_pred HhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccc Q lcl|NC_019710. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~ 151 (424) +.+-..+.+ ...-....|.+..++. +||++||+++|++ ++.+++++||||++++|+..|++++|+|++|. T Consensus 63 ~l~~~~~~h--------~~~i~~k~n~l~~l~~-~Pn~~~t~~~f~~-~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~ 132 (346) T protein:vir:10 63 KSLRSSTHH--------ESAIITKANILLSTCE-VDSRYLSRRDLSS-FVKDYLVFGNAYFEVVRNRLGQVQRIESPLAK 132 (346) T ss_pred HHHHhhhhc--------chhhhhhhhhHHHHHh-CCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEecCC Confidence 222222211 0111223467777664 8999999999987 56789999999999999999999999999999 Q ss_pred eEEEEEcCCceEEEE-EecCceEEecHhHeeEecCcCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019710. 152 NMDVKLVGKKVVYRY-QRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 152 ~v~~~~~~~~~~~~~-~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~ 229 (424) +|++..+.+...|.+ ..++....|+++||||+|.+++ ++++|+||+.++..++.+..+++.+..++|+||++|++||+ T Consensus 133 ~v~~~~~~~~~~~~~~~~~g~~~~~~~~dIih~r~~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~ 212 (346) T protein:vir:10 133 YVRKGLEAGQFYYVPQRFDHQEHEFAKGSIYHLLEPDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFY 212 (346) T ss_pred ceEEEEcCCeEEEEEEccCCeEEEEecccEEEecCCCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 999988877765544 4567788999999999998875 68999999999999999999999999999999999999999 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC Q lcl|NC_019710. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-----~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~ 304 (424) .+....++++.+.+++.|++..|+.|+++++++. .|++++|++.+++|+||.|.+++++++||++|||||.+||. T Consensus 213 ~~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~ 292 (346) T protein:vir:10 213 MSDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGI 292 (346) T ss_pred eCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcc Confidence 8766778999999999999999999999999885 47899999999999999999999999999999999999999 Q ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCH Q lcl|NC_019710. 305 VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 305 ~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~ 365 (424) .++++.+++|+|++.+.|+++||.|+++.||+ ++.+|.. ..++|+...+++.|. T Consensus 293 ~~~~~~~~s~~e~~~~~f~~~~l~P~~~~iee-~n~~L~~------e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 293 IPNNTGGFGNVADAAEVFFITEIEPLQERLKE-FNQWLGQ------EVIKFKPSKLLQRTQ 346 (346) T ss_pred cCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhccc------ceeeechhhhcccCC Confidence 98888888999999999999999999999985 7767743 257899999999988 No 93 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=100.00 E-value=9.6e-57 Score=327.68 Aligned_cols=327 Identities=15% Similarity=0.201 Sum_probs=248.9 Q ss_pred CCCCCcccccCCCccHHHHHHh--------hc--cCcccccccccccccccccccccCCc----cccH----HHHhhhHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKS--------WF--VGGRLVTPNQGSQTGPVSAHGYLGDS----SIND----ERILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~--------~~--~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~----~~~~~~~~ 62 (424) |..+|++-.-+.. ---..-.. .| +.+.+. ......... ...|..+. .++. +.+-.++. T Consensus 26 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~f~fg~p~~v-~~~~~~~~~--~~~~~~~~~~~pp~~~~~La~~~~~~~~ 101 (376) T protein:vir:10 26 MSKRRSRAPRTFA-AAPNPSAGSAAPARAEVFTFDDPTPV-MNRAEILDY--VECWSNGEWFEPPVSFAGLAKSFRASTH 101 (376) T ss_pred chhccCCCcccch-hhhhHhhhccCcceeEEEEcCCceec-cCcchhhhh--hhhhhcCceecCCCCHHHHHHHHhhhHH Confidence 5555544221111 00000000 00 000000 000000000 00011111 1222 22222444 Q ss_pred HHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019710. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ..+||...++.+++ ..+||++||+.+|++ ++.+++++||||++++|+..|++ T Consensus 102 h~s~l~~k~n~l~~---------------------------~~~Pnp~lT~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~ 153 (376) T protein:vir:10 102 HSSALFFKANVLAS---------------------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGT 153 (376) T ss_pred hhhhHHHHhHHHHh---------------------------ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCE Confidence 45555544433322 247999999999985 56789999999999999999999 Q ss_pred eEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecCcCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019710. 143 ISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANG 221 (424) Q Consensus 143 ~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 221 (424) ++|+|++|.+|++..+++..+| +..++....|+++||||||.+++ ++++|+|++.+++.++.+..++..|+.++|+|| T Consensus 154 ~~L~pl~~~~vr~~~d~~~~~~-~~~~~~~~~~~~~eViHir~~~~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NG 232 (376) T protein:vir:10 154 LRLEPALAKYVRRKADFNGFVY-VNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENG 232 (376) T ss_pred EEEEEeCCcceEEEeeCCeEEE-EEcCCeEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 9999999999999988776544 44566678899999999998874 689999999999999999999999999999999 Q ss_pred CCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019710. 222 AKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFG 296 (424) Q Consensus 222 ~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fg 296 (424) ++|++||+.+....++++.+.+++.|++..|..|+++++++ ++|++|+|++.+++|+||.|.+++++++||++|| T Consensus 233 a~pggIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~ 312 (376) T protein:vir:10 233 SHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHR 312 (376) T ss_pred CCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhC Confidence 99999999876667899999999999998888899998888 5799999999999999999999999999999999 Q ss_pred CCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHH Q lcl|NC_019710. 297 VPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 297 VP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~ 367 (424) |||.++|..++++.+++|+|++.+.|+.+||.|+++.|| +++.+|..+ .++||...+++.|.+. T Consensus 313 VPp~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~ie-eln~~L~~~------~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 313 VPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFA-ELNDWLGEE------VVRFDDYEIPPAPVAA 376 (376) T ss_pred CCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH-HHHhhcccc------ccccChhHhhcccccC Confidence 999999999888888899999999999999999999998 588877432 4789999999999988 No 94 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=100.00 E-value=3.9e-56 Score=324.34 Aligned_cols=315 Identities=15% Similarity=0.214 Sum_probs=242.3 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccc------ccccccc----------------ccccCCcc----ccH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGS------QTGPVSA----------------HGYLGDSS----IND 54 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~------~~~~~~~----------------~~~~~~~~----~~~ 54 (424) |..+|++=.-... ..+.... ....+++ ..+..+.. ++. T Consensus 1 ~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~ 64 (351) T protein:vir:79 1 MSKRRSRAPRTFA----------------AAPNPSAGSAAPARAEVFTFDDPTPVMNRAEILDYVECWSNGEWFEPPVSF 64 (351) T ss_pred CCCCCCCCCCCCC----------------CCCchhhhhcccceeEEEEcCCceeecCcchhhhhhhhhhcCceecCCCCH Confidence 5555544211100 0000000 0000000 00111110 111 Q ss_pred HH----HhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCe Q lcl|NC_019710. 55 ER----ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNA 130 (424) Q Consensus 55 ~~----~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a 130 (424) .. +-.++...+||...++ .+. -..+||++||.++|+ .++.+++++||| T Consensus 65 ~~la~~~~~~~~h~~~l~~k~n-------------------------~l~--~~~~Pnp~~t~~~f~-~~v~d~ll~Gna 116 (351) T protein:vir:79 65 AGLAKSFRASTHHSSALFFKAN-------------------------VLA--STFRPHRWLSRHAFE-RWALDFLTFGNG 116 (351) T ss_pred HHHHHHHhhhHhhhhhhhhhhh-------------------------HHh--hcccCCCCCCHHHHH-HHHHHHHhcCCe Confidence 11 1112222222222111 111 134799999999996 567899999999 Q ss_pred EEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecCcCC-CCccccchHHHHHHHHHHHHH Q lcl|NC_019710. 131 YALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVA 209 (424) Q Consensus 131 ~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~ 209 (424) |++++|+..|++++|+|++|.+|++..+.++++ ++..++....|+++||||+|.+++ ++++|+||+.+++.++.+..+ T Consensus 117 y~~~~r~~~G~~~~L~~l~~~~v~~~~~~~~~~-~~~~~g~~~~~~~~eIihir~~~~~~~~yGl~~~~~a~~si~l~~~ 195 (351) T protein:vir:79 117 YLERRRNMVGGTLRLEPALAKYVRRKADFSGFV-YVNGWQERHEFEPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNES 195 (351) T ss_pred EEEEEECCCCCEEEEEEeCCcceeeeecCCeEE-EEecCceEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHH Confidence 999999999999999999999999988877654 455566678899999999998875 689999999999999999999 Q ss_pred HHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHH Q lcl|NC_019710. 210 MEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASR 284 (424) Q Consensus 210 ~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~ 284 (424) +..+.+++|+||++|++||+.+....++++.+.+++.|++..|..|+++++++ ++|++|+|++.+++|+||.|.+ T Consensus 196 a~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~k 275 (351) T protein:vir:79 196 STLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIK 275 (351) T ss_pred HHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHH Confidence 99999999999999999999876667899999999999998888899999888 6789999999999999999999 Q ss_pred HHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccC Q lcl|NC_019710. 285 KFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 285 ~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d 364 (424) ++++++||++|||||.++|..++++.+++|+|++.+.|+.+||.|+++.||+ +|.+|.. ..++||...+++.| T Consensus 276 ~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~ie~-ln~~lg~------~~~~F~~~~llr~d 348 (351) T protein:vir:79 276 NVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGD------EVVTFDDYEIPPAP 348 (351) T ss_pred HHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCc------ceeeeChhhhcccc Confidence 9999999999999999999998888888999999999999999999999985 7776632 25799999999999 Q ss_pred HHH Q lcl|NC_019710. 365 SAS 367 (424) Q Consensus 365 ~~~ 367 (424) .++ T Consensus 349 ~~a 351 (351) T protein:vir:79 349 VAA 351 (351) T ss_pred ccC Confidence 888 No 95 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=100.00 E-value=9e-56 Score=322.36 Aligned_cols=327 Identities=15% Similarity=0.199 Sum_probs=246.8 Q ss_pred CCCCCcccccCCCccHHH---------HHHh-hccCcccccccccccccccccccccCCcc----ccHHH----HhhhHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWA---------RLKS-WFVGGRLVTPNQGSQTGPVSAHGYLGDSS----INDER----ILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~---------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~----~~~~~~ 62 (424) |..+|++=.-... .-=+ +... .|+.+.+.- ........ ...|..+.. ++... +-.++. T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~p~~v~-~~~~~~~~--~~~~~~~~~~~pp~~~~~la~~~~~~~~ 76 (351) T protein:vir:78 1 MSKRRSRAPRTFA-AAPNPSAGSAAPARAEVFTFDDPTPVM-NRAEILDY--VECWSNGEWFEPPVSFAGLAKSFRASTH 76 (351) T ss_pred CCCCCCCCCCCCC-CCCchhhhhcccceeEEEEcCCceeec-Ccchhhhh--hhhhccCceecCCCCHHHHHHHHhhhHh Confidence 7766665321111 0000 0000 001111000 00000000 001111111 22221 112333 Q ss_pred HHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce Q lcl|NC_019710. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV 142 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~ 142 (424) ..+||...++.+++ ..+||++||+++|++ ++.+++++||||++++|+..|++ T Consensus 77 h~~~l~~k~n~l~~---------------------------~~~Pn~~~t~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~ 128 (351) T protein:vir:78 77 HSSALFFKANVLAS---------------------------TFRPHRWLSRHAFER-WALDFLTFGNGYLERRRNMVGGT 128 (351) T ss_pred hhhhhhhhhhHHhh---------------------------cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCE Confidence 33343332222221 247999999999975 55789999999999999999999 Q ss_pred eEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019710. 143 ISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANG 221 (424) Q Consensus 143 ~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 221 (424) ++|+|++|.+|++..+.++++|. ..++....|+++||||+|.++ .++++|+|++.+++.++.+..++..+++++|+|| T Consensus 129 ~~L~pl~~~~v~~~~~~~~~~~~-~~~~~~~~~~~~eVihir~~~~~~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NG 207 (351) T protein:vir:78 129 LRLEPALAKYVRRKADFSGFVYV-NGWQERHEFAPDSVFQLVRPDINQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENG 207 (351) T ss_pred EEEEEecCcceEEeeeCCeEEEE-ecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 99999999999999888775543 445667889999999999887 4789999999999999999999999999999999 Q ss_pred CCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019710. 222 AKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFG 296 (424) Q Consensus 222 ~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fg 296 (424) ++|++||+.+....++++.+.+++.|++..|..|+++++++ ++|++++|++.+++|+||.|.+++++++||++|| T Consensus 208 a~pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~ 287 (351) T protein:vir:78 208 SHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHR 287 (351) T ss_pred CCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhC Confidence 99999999876667899999999999998888999999988 5789999999999999999999999999999999 Q ss_pred CCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHH Q lcl|NC_019710. 297 VPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 297 VP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~ 367 (424) |||.++|..++++.+++|+|++.+.|+.+||.|+++.||+ ++.+|..+ +++||...+++.|.+. T Consensus 288 VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~iee-~n~~l~~~------~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 288 VPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARFAE-LNDWLGDE------VVRFDDYEIPPAPVAA 351 (351) T ss_pred CCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcc------ceecChhhhccccccC Confidence 9999999998888888999999999999999999999985 77676332 5899999999999988 No 96 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=100.00 E-value=2.5e-56 Score=325.44 Aligned_cols=323 Identities=15% Similarity=0.156 Sum_probs=247.2 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhh---CceeEeec Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTAC---LPLDVFET 84 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~---~~~~~~~~ 84 (424) |+.|..+ +.......+...+.+...+.+ +...++..|+...-+.++. .|+... T Consensus 1 m~~~~~~-----------------~~~~~~~~~~~~~~~~~p~~~-----~~~~~~~~~~~~~~~~~~~~~~pP~~~~-- 56 (337) T protein:vir:78 1 MTKRQQQ-----------------PAQAAASSPRPSVVFSMPEAI-----DPTAWMTDYTGVFYNPYGEYYQPPIDRK-- 56 (337) T ss_pred CCCcccC-----------------cccccccCceeEEEecCcccc-----cCcchhHhhhhhhhccCcceecCCCCHH-- Confidence 3333331 000011111112222223333 3344566666666554443 233221 Q ss_pred cccCccccccccchhHHhhccCCCCCCCHH----HHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQ----EFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~----~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +-.+-.........+|..+||..++++ ++++.++.|++++||||++++|+..|++++|+|++|.+|++..++. T Consensus 57 ---~La~l~~~~~~h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v~~~~d~~ 133 (337) T protein:vir:78 57 ---GLAKVARANAHHGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYLRRREDGC 133 (337) T ss_pred ---HHHHHhhcchhhhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCceeEeeeCCe Confidence 000000000112345778999877654 7899999999999999999999999999999999999999887654 Q ss_pred ceEEEEEecCceEEecHhHeeEecCcCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH Q lcl|NC_019710. 161 KVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ 239 (424) Q Consensus 161 ~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~ 239 (424) . +++..++....|+++||||+|.+++ ++++|+||+.+++.++.+..+++.+..++|+||++|++||+.+....++++ T Consensus 134 ~--~~~~~~~~~~~~~~~eIiHik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~ 211 (337) T protein:vir:78 134 F--VYLQQGKPNLIYRPDDVIWLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDT 211 (337) T ss_pred E--EEEEcCCceEEECCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH Confidence 3 3344566778899999999998875 789999999999999999999999999999999999999998766678899 Q ss_pred HHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-Ccccc Q lcl|NC_019710. 240 RSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKS-TSWGS 313 (424) Q Consensus 240 ~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~-~~~~~ 313 (424) .+.+++.|++..|+.|.++++++ ++|++|+|++.+++|+||+|.+++++++||++|||||.++|...++ +.+++ T Consensus 212 ~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~ 291 (337) T protein:vir:78 212 EEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLG 291 (337) T ss_pred HHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccc Confidence 99999999998888898888887 6789999999999999999999999999999999999999987765 45678 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhh Q lcl|NC_019710. 314 GIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLL 361 (424) Q Consensus 314 n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~ 361 (424) |+|++.+.|+++||.|+++.||++++.+|++.... ..++++...++ T Consensus 292 n~e~~~~~f~~~~L~P~~~~ie~~~n~~ll~~~~~--~~f~~~~~~~~ 337 (337) T protein:vir:78 292 DPEKYDATYARNEVLPLCELVQDAINSAGLPRALW--VTFRETIGAAV 337 (337) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhhhcCChhhc--eeccccccccC Confidence 99999999999999999999999999988876442 34567777777 No 97 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=100.00 E-value=1.6e-55 Score=321.05 Aligned_cols=324 Identities=17% Similarity=0.244 Sum_probs=243.4 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccc-cc--ccccccc-c--c-cccccc-CCccccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLV-TP--NQGSQTG-P--V-SAHGYL-GDSSINDERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~-~~--~~~~~~~-~--~-~~~~~~-~~~~~~~~~~~~~~~v~~~i~~ia~ 72 (424) |. .|..+ .+....+ .+ ...+..+ | + ...... -........+++.|.-+.++..+.+ T Consensus 1 m~-------~~~~~---------~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~~~la~l~~ 64 (340) T protein:vir:98 1 MS-------KRKPR---------KAVAMTASAPQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSFSGLAKSLR 64 (340) T ss_pred CC-------CCCCC---------ccccccccCccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCHHHHHHHHH Confidence 44 32221 0000000 00 0011000 0 0 000000 0000111223334444444433332 Q ss_pred hhh--hCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecc Q lcl|NC_019710. 73 LTA--CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) Q Consensus 73 ~ia--~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p 150 (424) +-+ +.++..+ .+.+.. ..+||++||..+|++ ++.+++++||||++++|+..|++++|+|+++ T Consensus 65 a~~~h~s~i~~k-------------~n~l~~--~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~ 128 (340) T protein:vir:98 65 SAVHHSSPIYVK-------------RNVLAS--TYIPHPLLSRQDFSR-FALDYLVFGNAFLEQRHSVTGQLIKLLTSPA 128 (340) T ss_pred hccccchhhhhh-------------hhHHhh--ccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCcEEEEEEeCC Confidence 222 1122211 112222 238999999999965 6679999999999999999999999999999 Q ss_pred ceEEEEEcCCceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE Q lcl|NC_019710. 151 ANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS 229 (424) Q Consensus 151 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~ 229 (424) .+|.+..+++. +|++..++....|+++||||||.++ .++++|+|++.+++.++.+..++..++.++|+||++|++||. T Consensus 129 ~~vr~~~~~~~-~~~~~~~~~~~~~~~~eViHir~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~ 207 (340) T protein:vir:98 129 KYTRRGVDDSV-FWFVENFTQPHEFAPDTVFHLLEPDINQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMY 207 (340) T ss_pred ceEEEcccCcE-EEEEecCCeEEEEccccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 99999877664 4566777778899999999999876 478999999999999999999999999999999999999999 Q ss_pred cCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC Q lcl|NC_019710. 230 TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~ 304 (424) .+....++++.+.+++.|++..|..|+++++++ ++|++|+|++.+++|+||.|.+++++++||++|||||.++|. T Consensus 208 ~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi 287 (340) T protein:vir:98 208 VTDPAQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGG 287 (340) T ss_pred ecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcc Confidence 876678899999999999998888899999888 679999999999999999999999999999999999999999 Q ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccC Q lcl|NC_019710. 305 VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 305 ~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d 364 (424) .++++.+++|+|++.+.|+++||.|+++.||+ +|.+|..+ .++|+...+++.| T Consensus 288 ~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~iee-~n~~L~~e------~~rF~~~~l~~~d 340 (340) T protein:vir:98 288 KPENIGSLGDVEKVAKVFVRNELSPLQDRFRE-VNDWLGME------VIRFKEYTLDNPE 340 (340) T ss_pred cCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhccccc------ccccCccccccCC Confidence 88888888999999999999999999999985 88888543 2578888999988 No 98 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=100.00 E-value=4.6e-56 Score=323.95 Aligned_cols=329 Identities=16% Similarity=0.217 Sum_probs=235.8 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccCCccc-cHHHHhhhHHHHHHHH---------HHHHhh--- Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSI-NDERILQISTVWRCVS---------LISTLT--- 74 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~~i~---------~ia~~i--- 74 (424) |+.+-..+--......-.......+... .....+.+...+.| +....+....++.|-+ -+|+.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~la~~~~~~ 77 (350) T protein:vir:11 1 MSKRRSHRRQQPVTVQSAQEGEFIPRQG---GRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGLAKSVGSS 77 (350) T ss_pred CCccccCCCcCccccCCcchhhhccccc---cceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHHHHHHhhh Confidence 3333221100000000000000000000 00001111111111 1111111111111111 011111 Q ss_pred --hhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccce Q lcl|NC_019710. 75 --ACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) Q Consensus 75 --a~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~ 152 (424) .+.++..+ .+-+.. ..+||++||+++|++ ++.+++++||||++++|+..|++++|+|++|.+ T Consensus 78 ~~h~~~l~~k-------------~n~l~~--~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~ 141 (350) T protein:vir:11 78 VYLQSGLKFK-------------RNMLAK--TFIPHRLLSRATFEQ-FSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKY 141 (350) T ss_pred hhhccchhhh-------------hhhhhh--cccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCce Confidence 11111111 111221 348999999999976 667999999999999999999999999999999 Q ss_pred EEEEEcCCceEEEEEecCceEEecHhHeeEecCcCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC Q lcl|NC_019710. 153 MDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 153 v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~ 231 (424) |++..+++. +|++..++....|+++||||+|.+++ ++++|+||+.+++.++.+..++..+..++|+||++|++||+.+ T Consensus 142 vr~~~~~~~-~~~~~~~~~~~~~~~~eVihir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~ 220 (350) T protein:vir:11 142 MRRGTDLET-FYQVRSWKDEHEFEKGSVIQLREADINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMT 220 (350) T ss_pred eEeeecCCe-EEEEeeCCeEEEECcccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEec Confidence 999887765 45667777888999999999998764 5799999999999999999999999999999999999999987 Q ss_pred CCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC Q lcl|NC_019710. 232 EKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) ....++++.+.+++.|++..|+.|+++++++ ++|++++|++.+++|+||.|.+++++++||++|||||.++|..+ T Consensus 221 ~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~ 300 (350) T protein:vir:11 221 DAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVP 300 (350) T ss_pred CCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccC Confidence 6668899999999999999888999999888 46899999999999999999999999999999999999999988 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhh Q lcl|NC_019710. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 307 ~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~ 360 (424) +++.+++|+|++.+.|+.+||.|++++||+ ++.+|..+.. .+.+|+++.| T Consensus 301 ~~t~~~sn~e~~~~~f~~~~L~P~~~~ie~-ln~~l~~~~~---~F~~~~~~~l 350 (350) T protein:vir:11 301 QNAGGFGSISDAAAVWASLELAPMQTRLQQ-VNEMIGEEVV---RFAQFDAPGL 350 (350) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHHHHHHH-HHhhcCcccc---ccCcccccCC Confidence 888888999999999999999999999985 8888865322 2346777777 No 99 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=100.00 E-value=3.4e-54 Score=313.70 Aligned_cols=330 Identities=18% Similarity=0.229 Sum_probs=235.6 Q ss_pred cccCCCccHHHHHHhhccCccccccccccccc-c--c-cccccc-CCccccHHHHhhhHHHHHHHHHH--HHhhhhCcee Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTG-P--V-SAHGYL-GDSSINDERILQISTVWRCVSLI--STLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~-~--~-~~~~~~-~~~~~~~~~~~~~~~v~~~i~~i--a~~ia~~~~~ 80 (424) |+.|........-......... ...+..+ | + ...... -........++.-|.-+.++-.+ |+...+.++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~---~~~~~f~~p~~v~~~~~~~~~~~~~~~~~~~~pp~~~~~la~~~~a~~~h~~~i~ 77 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPK---MEAFTFGEPVPVLDRRDILDYVECISNGRWYEPPISFTGLAKSLRAAVHHSSPIY 77 (344) T ss_pred CCcccCCCCCchHHhhcCCcCc---EEEEEcCCceeecCCcchhHHHHhhhcCccccCCCCHHHHHHHHHhhhhhccchh Confidence 4444332211111110100000 0000000 0 0 000000 00000000111111112222111 1122222232 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) .++ +.+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+|++|.+|++..+++ T Consensus 78 ~k~-------------n~l~~--~~~Pn~~~t~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~ 141 (344) T protein:vir:60 78 VKR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED 141 (344) T ss_pred hhh-------------hHHHh--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCC Confidence 211 22322 3489999999999 6788999999999999999999999999999999999988877 Q ss_pred ceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH Q lcl|NC_019710. 161 KVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ 239 (424) Q Consensus 161 ~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~ 239 (424) . +|++..++....|+++||||+|.++ .++++|+||+.+++.++.+..+++.+..++|+||++|++||+.+....++++ T Consensus 142 ~-~~~v~~~~~~~~~~~~eIiHir~~~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~ 220 (344) T protein:vir:60 142 V-YWWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRND 220 (344) T ss_pred e-EEEEccCCeEEEEcCccEEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHH Confidence 5 4556667778899999999999876 4789999999999999999999999999999999999999998766678889 Q ss_pred HHHHHHHHHHHhCCcccCcceec------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccc Q lcl|NC_019710. 240 RSQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS 313 (424) Q Consensus 240 ~~~~~~~~~~~~~~~~ag~~~~l------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~ 313 (424) .+.+++.|++..++ ++++.+++ ++|++|++++.+++|+||+|.+++++++||++|||||.++|..++++.+++ T Consensus 221 ~~~ik~~~~~~~g~-~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~ 299 (344) T protein:vir:60 221 IEMLRENMVKSKGR-NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLG 299 (344) T ss_pred HHHHHHHHHHhcCC-CCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccc Confidence 99999999887655 66778777 579999999999999999999999999999999999999999888888889 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCH Q lcl|NC_019710. 314 GIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 314 n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~ 365 (424) |+|++.+.|+.+||.|+++.|| +|+.+|..+ .++|+...+...|. T Consensus 300 n~e~~~~~f~~~~L~Pl~~~~e-~ln~~lg~~------~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 300 DIEKVAKVFVRNELIPLQDRIR-EINGWLGQE------VIRFKNYSLDTDNG 344 (344) T ss_pred cHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc------ccccCccccCCCCC Confidence 9999999999999999999998 588887432 23565555555555 No 100 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=100.00 E-value=3.9e-54 Score=313.39 Aligned_cols=325 Identities=16% Similarity=0.160 Sum_probs=237.1 Q ss_pred cCCCccHHHHHHhhccCccccccccccccccccc--------ccccCCccccHHHHhhhHHHHHHHHHH--HHhhhhCce Q lcl|NC_019710. 10 LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSA--------HGYLGDSSINDERILQISTVWRCVSLI--STLTACLPL 79 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~v~~~i~~i--a~~ia~~~~ 79 (424) ||+.+. ....... +..+ .....+++ ..+.+........+..-|.-+.++-.+ ++...+-.+ T Consensus 1 ~~~~~~-------~~~~~~~-~~~~-~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~epp~~~~~la~l~~~~~~h~~~i 71 (345) T protein:vir:37 1 MKTNVK-------TDNKKGI-VIAP-INDRTFSLNEISASPALDYVGIGFDENYNCYLPPVNRHALAKLPHQNAQHGGIL 71 (345) T ss_pred CCCCcc-------ccchhhc-ccCc-ceeEEeecCCcccccchhhhhhhhcCCccccCCCCCHHHHHHHhhcccccccce Confidence 333210 0000000 0000 00000000 000000000011122222222222222 111122222 Q ss_pred eEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcC Q lcl|NC_019710. 80 DVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVG 159 (424) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~ 159 (424) ... .+-+. ...+||++||+++|++ ++.+++++||||++++|+..|++++|+|++|.+|++..++ T Consensus 72 ~~k-------------~n~l~--~~~~Pn~~lt~~~f~~-~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~ 135 (345) T protein:vir:37 72 HSR-------------ANMVS--SLYEGGKALSRMDMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDG 135 (345) T ss_pred eee-------------chHHH--hhccCCCCCCHHHHHH-HHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeC Confidence 221 12233 2348999999999985 5679999999999999999999999999999999999888 Q ss_pred CceEEEE----EecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC Q lcl|NC_019710. 160 KKVVYRY----QRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV 234 (424) Q Consensus 160 ~~~~~~~----~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~ 234 (424) +..++.. ..++....|+++||||+|.++ .++++|+|++.+++.++.+..++..++.++|+||++|++||..+... T Consensus 136 ~~~~~~~~~~~~~~g~~~~~~~~dVihir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~ 215 (345) T protein:vir:37 136 GYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTDPD 215 (345) T ss_pred CeeEEEEEeEecCCceEEEEccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecCCC Confidence 7754322 234566789999999999876 46799999999999999999999999999999999999999987666 Q ss_pred CCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCC Q lcl|NC_019710. 235 LTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKST 309 (424) Q Consensus 235 ~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~ 309 (424) .++++.+.+++.|++..|..|.++++++ ++|++++|++.+++|+||.|.+++++++||++|||||.++|..++++ T Consensus 216 l~~e~~~~lk~~~~~~~g~~n~~~~~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~ 295 (345) T protein:vir:37 216 LTEEMEEEIARKISESKGVGNFRSMFVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNT 295 (345) T ss_pred CCHHHHHHHHHHHHHhcCcccccceEEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCC Confidence 7889999999999998888888888877 68999999999999999999999999999999999999999988888 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhc Q lcl|NC_019710. 310 SWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLR 362 (424) Q Consensus 310 ~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~ 362 (424) .+++|+|++.+.|+++||.|++++|++++|+.+.. .....++|+...+.+ T Consensus 296 ~~~~~~e~~~~~f~~~~l~P~~~~ie~~ln~~~~~---~~~~~i~F~~~~L~~ 345 (345) T protein:vir:37 296 GGLGDPLKYREVYHYDEVMPLQEIIAETINQDPEI---KNLLKIKFREQNFAK 345 (345) T ss_pred CCcccHHHHHHHHHHHHHHHHHHHHHHHhhhhccC---CCcceEEecchhhcC Confidence 88899999999999999999999999999964321 123567888777766 No 101 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=100.00 E-value=4.2e-54 Score=313.24 Aligned_cols=324 Identities=18% Similarity=0.222 Sum_probs=233.6 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccccccccc-c--c-cc-------ccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTG-P--V-SA-------HGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~-~--~-~~-------~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) |..+|.. ++ -........ .+.....+..+ | + +. ..+..+. +++=|.-+.++-. T Consensus 1 ~~~~~~~----~~---~~~~~~~~~---~~~~~~~~~~~~p~~v~~~~~~~~~~~~~~~~~------~~~pp~~~~~la~ 64 (344) T protein:vir:56 1 MSKKKGK----TP---QPAAKTMTA---SAPKMEAFTFGEPVPVLDRRDILDYVECISNGR------WYEPPVSFTGLAK 64 (344) T ss_pred CCCCCCC----CC---chhhHHhhc---CCCceEEEEcCCceeecCcchhhhHHHhhhcCc------cccCCCCHHHHHH Confidence 4443331 11 000000000 00000011000 0 0 00 0011111 1111222223222 Q ss_pred HH--HhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019710. 70 IS--TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 70 ia--~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +. +..-+.++..++ +.+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+| T Consensus 65 ~~~a~~~h~s~i~~k~-------------n~l~~--~~~Pnp~~t~~~f-~~~~~d~ll~Gnay~~~~rn~~G~~~~L~p 128 (344) T protein:vir:56 65 SLRAAVHHSSPIYVKR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLET 128 (344) T ss_pred HHhhhhhhCccceehh-------------hhHHh--hcCCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEE Confidence 21 222222333321 22222 3489999999999 678899999999999999999999999999 Q ss_pred eccceEEEEEcCCceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_019710. 148 LQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 148 l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~ 226 (424) +++.+|.+..+++.. |++..++....|+++||||+|.++ .++++|+||+.+++.++.+..+++.+..++|+||++|++ T Consensus 129 l~~~~v~~~~~~~~~-~~~~~~g~~~~~~~~dIiHir~~~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~ 207 (344) T protein:vir:56 129 SPAKYTRRGVEEDVY-WWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGY 207 (344) T ss_pred eCCceeEEeecCCEE-EEEecCCeEEEEcCccEEEECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCce Confidence 999999998887754 456667777899999999999876 478999999999999999999999999999999999999 Q ss_pred eEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHH Q lcl|NC_019710. 227 ILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPH 300 (424) Q Consensus 227 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~ 300 (424) ||+.+....++++.+.+++.|++..+ .+++++++| ++|++++|++.+++|+||+|.+++++++||++|||||. T Consensus 208 Il~~~d~~ls~e~~~~lk~~~~~~~g-~~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~ 286 (344) T protein:vir:56 208 IMYVTDAVQDRNDIEMLRENMVKSKG-RNNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ 286 (344) T ss_pred EEEecCCCCCHHHHHHHHHHHHHhcC-CCCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHH Confidence 99987666788899999999988664 467899888 57999999999999999999999999999999999999 Q ss_pred HcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCH Q lcl|NC_019710. 301 LVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 301 ~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~ 365 (424) ++|..++++.+++|+|++.+.|+.+||.|+++.||+ ++.+|..+. ++|+.-.+...|- T Consensus 287 llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~~~ie~-~n~~l~~~~------~~F~~y~l~~~~~ 344 (344) T protein:vir:56 287 LMGGKPENVGSLGDIEKVAKVFVRNELIPLQDRIRE-INGWIGQEV------IRFKNYSLDTDNG 344 (344) T ss_pred HhccCCCCCCccccHHHHHHHHHHHHHHHHHHHHHH-HHhhhcccc------ccCCCccccccCC Confidence 999988888888999999999999999999999985 787886432 2343333322222 No 102 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=100.00 E-value=1e-53 Score=311.13 Aligned_cols=331 Identities=16% Similarity=0.149 Sum_probs=238.2 Q ss_pred CCCCCcccc--cCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHH--HHhhhh Q lcl|NC_019710. 1 MEEPKYTID--LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLI--STLTAC 76 (424) Q Consensus 1 ~~~~~~~~~--~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i--a~~ia~ 76 (424) |..-+.+-. ..+.++.-.. .|..+.+.. ........+. ......+.+-|.-+..+-.+ ++..-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~-~~~~~y~~~~--------~~~~~~~~epp~~~~~la~~~~~~~~h~ 68 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDR---TFSLSEITA-SPALDYVGIG--------FDENYNCYLPPVNRHALAKLPHQNAQHG 68 (345) T ss_pred CCccccccchhhhcCCCceEE---EeecCCccc-chhhccccee--------eecCCccccCCCCHHHHHHHhhcchhhc Confidence 333222211 1222221000 111111110 0000000000 00001111112111111111 111111 Q ss_pred CceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEE Q lcl|NC_019710. 77 LPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVK 156 (424) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~ 156 (424) -++.+. .+-+. ...+||++||+.+|++ ++.+++++||||++++|+..|++++|+|++|.+|++. T Consensus 69 ~~i~~k-------------~n~l~--~~~~Pn~~~t~~~f~~-~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~ 132 (345) T protein:vir:37 69 GILHSR-------------ANMVS--ATYEGGKALSKMEMRA-LCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVH 132 (345) T ss_pred chhhhh-------------hhHHh--hccCCCCCCCHHHHHH-HHHHHHhcCCeEEEEEECCCCCEEEEEEecCceeEEe Confidence 122111 11121 3448999999999975 5578999999999999999999999999999999998 Q ss_pred EcCCceEEE----EEecCceEEecHhHeeEecCcCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC Q lcl|NC_019710. 157 LVGKKVVYR----YQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 157 ~~~~~~~~~----~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~ 231 (424) .+++..++. +...+...+|+++||||||.+++ ++++|+|++..++.++.+..+++.++.++|+||++|++||+.+ T Consensus 133 ~d~~~~~~~~~~~~~~~g~~~~~~~~eViHir~~~~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t 212 (345) T protein:vir:37 133 KDGGYSYLMKKSLYDTAQEIYRYDAKDIIFIKLYDPMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYST 212 (345) T ss_pred ecCCeeEEEeeeeeccCceEEEEccccEEEEcCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeC Confidence 888765432 22345667899999999998764 6799999999999999999999999999999999999999877 Q ss_pred CCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC Q lcl|NC_019710. 232 EKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE 306 (424) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~ 306 (424) ....++++.+.+++.|++..++.|.+.++++ ++|++++|++.+++|+||.+.+++++++||++|||||.++|..+ T Consensus 213 ~~~l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~liGi~~ 292 (345) T protein:vir:37 213 DPDLTEEMEEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIP 292 (345) T ss_pred CCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccc Confidence 6677889999999999998888776656555 56899999999999999999999999999999999999999988 Q ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhc Q lcl|NC_019710. 307 KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLR 362 (424) Q Consensus 307 ~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~ 362 (424) +++.+++|+|++.+.|+++||.|++++|++++|+.+ . .....+++||...+++ T Consensus 293 ~~t~~~s~~e~~~~~f~~~~l~P~~~~ie~~ln~~~--e-~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 293 TNTGGLGDPLKYREVYHYDEVMPLQEIIAETINQDP--E-IKNLLKIKFREQNFAK 345 (345) T ss_pred CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHhhhhh--c-cCCcceEEECchhhcC Confidence 888888999999999999999999999999999742 1 1234789999999998 No 103 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=100.00 E-value=9.1e-54 Score=311.36 Aligned_cols=324 Identities=17% Similarity=0.211 Sum_probs=235.5 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccc-cc--c-c-------cccccCCccccHHHHhhhHHHHHHHHHH--HHhh Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQT-GP--V-S-------AHGYLGDSSINDERILQISTVWRCVSLI--STLT 74 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~-~~--~-~-------~~~~~~~~~~~~~~~~~~~~v~~~i~~i--a~~i 74 (424) |+.|...-=........ ..+.....+.. .+ + + ...+..+. ++.=|.-+.++-.+ |+.. T Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~~~~~~~f~~p~~v~~~~~~~~~~~~~~~~~------~~~pp~~~~~la~~~~a~~~ 71 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMT---ASGPKMEAFTFGEPVPVLDRRDILDYVECISNGR------WYEPPVSFTGLAKSLRAAVH 71 (344) T ss_pred CCcccCCCCcchhhhhh---ccCCceEEEEcCCceEecCcchhhhhhhhhhcCc------eecCCCCHHHHHHHHhhhhh Confidence 44443210000000000 00000000100 00 0 0 00011111 11112112222222 2222 Q ss_pred hhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEE Q lcl|NC_019710. 75 ACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMD 154 (424) Q Consensus 75 a~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~ 154 (424) .+.++..++ +-+.. ..+||++||+.+| +.++.+++++||||++++|+..|++++|+|+++.+|+ T Consensus 72 h~~~i~~k~-------------n~l~~--~~~Pn~~lt~~~f-~~~~~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr 135 (344) T protein:vir:20 72 HSSPIYVKR-------------NILAS--TFIPHPWLSQQDF-SRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTR 135 (344) T ss_pred hCccceehh-------------hhHHH--hccCCCCCCHHHH-HHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCCceeE Confidence 233343321 12222 2489999999999 6788999999999999999999999999999999999 Q ss_pred EEEcCCceEEEEEecCceEEecHhHeeEecCcCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC Q lcl|NC_019710. 155 VKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK 233 (424) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~ 233 (424) +..+++.. |++..++....|+++||||+|.+++ ++++|+||+.+++.++.+..+++.++.++|+||++|++||+.+.. T Consensus 136 ~~~~~~~~-~~~~~~~~~~~~~~~eIiHir~~~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~ 214 (344) T protein:vir:20 136 RGVEEDVY-WWVPSFNEPTAFAPGSVFHLLEPDINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDA 214 (344) T ss_pred eeecCCEE-EEEccCCeEEEEcCccEEEeCCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCc Confidence 98887654 4566677788999999999998874 789999999999999999999999999999999999999998766 Q ss_pred CCCHHHHHHHHHHHHHHhCCcccCcceec------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC Q lcl|NC_019710. 234 VLTEQQRSQVEENFKEIAGGPVKKRLWIL------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK 307 (424) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~ag~~~~l------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~ 307 (424) ..++++.+.+++.|++..++ ++++.+++ ++|++|+|++.+++|+||.|.+++++++||++|||||.++|..++ T Consensus 215 ~l~~e~~~~ik~~~~~~~g~-~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~ 293 (344) T protein:vir:20 215 VQDRNDIEMLRENMVKSKGR-NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPE 293 (344) T ss_pred CCCHHHHHHHHHHHHHhcCC-CCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCC Confidence 68889999999999886654 66788777 469999999999999999999999999999999999999999888 Q ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCH Q lcl|NC_019710. 308 STSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDS 365 (424) Q Consensus 308 ~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~ 365 (424) ++.+++|+|++.+.|+++||.|+++.|| +++.+|..+ .++|+...+...|. T Consensus 294 ~t~~~~n~e~~~~~f~~~~l~P~~~~~e-~in~~lg~~------~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 294 NVGSLGDIEKVAKVFVRNELIPLQDRIR-EINGWLGQE------VIRFKNYSLDTDND 344 (344) T ss_pred CCCccccHHHHHHHHHHHHHHHHHHHHH-HHHHhcCCc------ccccCccccccCCC Confidence 8888889999999999999999999998 588777432 24566556655554 No 104 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=100.00 E-value=9.2e-52 Score=300.38 Aligned_cols=242 Identities=16% Similarity=0.220 Sum_probs=194.9 Q ss_pred ccHHHHHHhhccCcccccccccc--cccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGS--QTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRK 91 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~ 91 (424) +|||++...+ ....+.... ............+..++.+.|+++|+|++||++||++||++||+++++++. T Consensus 1 MglF~~~~~r----~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~---- 72 (251) T protein:vir:46 1 MGIFYKNEKR----DLQYNEDDLQMMVQTLPSFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQI---- 72 (251) T ss_pred CCcccccccc----ccCCCccchhhhhhhhccccCcCcceechhhhhccHHHHHHHHHHHHhHhhCceEEeeCccc---- Confidence 7887654321 111111111 111111122234566889999999999999999999999999999976532 Q ss_pred cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce-EEEEE--- Q lcl|NC_019710. 92 KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV-VYRYQ--- 167 (424) Q Consensus 92 ~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~-~~~~~--- 167 (424) ...|++.++|+.+||++||+++||+.++.+++++||||++++|+.+|++++|+||+|.+|++..++++. +|.+. T Consensus 73 --~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~ 150 (251) T protein:vir:46 73 --NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARGRLYYFHQRID 150 (251) T ss_pred --cccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCCcEEEEEEEec Confidence 346899999999999999999999999999999999999999999999999999999999998876543 33332 Q ss_pred --ecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH Q lcl|NC_019710. 168 --RDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE 245 (424) Q Consensus 168 --~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~ 245 (424) .++....|+++||||||+++.|+++|+||+.++..++.++.+++++..++|+||++|+|+|+++....++++++++++ T Consensus 151 ~~~~g~~~~~~~~diiH~r~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~~e~~~~~~~ 230 (251) T protein:vir:46 151 SNGNNIERNVKFEDMLDIKFYSLDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRARE 230 (251) T ss_pred cCCcceeEEECCccEEEecCcCCCCeeecCHHHHHHHHHHHHHHHHHHHHHHHHccCCCcEEEEeCCCCCCHHHHHHHHH Confidence 234457899999999999999999999999999999999999999999999999999999999988878888888988 Q ss_pred HHHHHhCC-cccCcceecCCCcee Q lcl|NC_019710. 246 NFKEIAGG-PVKKRLWILEAGFST 268 (424) Q Consensus 246 ~~~~~~~~-~~ag~~~~l~~g~~~ 268 (424) .|++.+++ +|+|++++. |+= T Consensus 231 ~~~~~~~g~~n~g~~~~g---m~~ 251 (251) T protein:vir:46 231 EFPKVLVELNKLGKLSYS---MNQ 251 (251) T ss_pred HHHHHhcCcccccccccc---cCC Confidence 88877665 788886653 322 No 105 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=100.00 E-value=1.2e-45 Score=266.83 Aligned_cols=208 Identities=16% Similarity=0.195 Sum_probs=176.6 Q ss_pred EEEEEcCCceEEEEE-----ecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_019710. 153 MDVKLVGKKVVYRYQ-----RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 153 v~~~~~~~~~~~~~~-----~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~ 226 (424) |.+..++.. +|.+. .++...+|+++||+|||.++ .++++|+||+.+++.++....++++|+.++|+||++|+| T Consensus 1 ~r~~~dg~~-~y~~~~~~~~~~g~~~~~~~~eilH~r~~~~~~~~~Glspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p~g 79 (219) T protein:vir:98 1 MRVCKDGNY-KYLMKKSLYDTKSEIYEYNKNDVIFIKLYDPMQQVYGSPDYVGGITSALLNSDATIFRRRYYSNGAHMGF 79 (219) T ss_pred CceeecCeE-EEEEecceecCCceeEEeccccEEEecCCCCCCCcceecHHHHHHHHHHHHHHHHHHHHHHHhcCCCCce Confidence 444444432 33332 23567889999999999876 688999999999999999999999999999999999999 Q ss_pred eEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHH Q lcl|NC_019710. 227 ILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHL 301 (424) Q Consensus 227 vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~ 301 (424) ||+++....++++.+++++.|++..|+.|+++++++ ++|++|++++++++|+||+|++++++++||++|||||.+ T Consensus 80 il~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~ 159 (219) T protein:vir:98 80 ILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPGL 159 (219) T ss_pred EEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHH Confidence 998876667888999999999998888887766665 578999999999999999999999999999999999999 Q ss_pred cCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccC Q lcl|NC_019710. 302 VGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 302 l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d 364 (424) ||..++++.+++|+|++.+.|+.+||.|+++.||++||++++.+.+ .+++|+.+.+...+ T Consensus 160 lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~~~~~~~~---~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 160 SGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINSDYEIKSA---LKVNFKQPEKRDKN 219 (219) T ss_pred cccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCc---cEEeecCcccccCC Confidence 9998887888899999999999999999999999999988655433 35677755554444 No 106 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.94 E-value=1.4e-27 Score=167.79 Aligned_cols=387 Identities=9% Similarity=-0.008 Sum_probs=237.1 Q ss_pred cCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCc Q lcl|NC_019710. 10 LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 89 (424) |++.-|+.+-+.+.- .. .+..+.. ++......+..+ ...|.+++.++++|+.+|..+.+.++.+.- ++.+ T Consensus 1 ~~~~D~~~~~~~~~g---~~---~~~~~~~-~~~~~~~~~~~l-~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~--~d~~ 70 (437) T protein:vir:52 1 MKFFDGIKSLALKLG---SK---QEQTYYS-PSLSLTDDLVQL-EALWRDNWIANKVCIKRPEDMVRNWREIYS--NDLN 70 (437) T ss_pred CchhhhhHhHHhcCC---Cc---cccceee-cCccccccHHHH-HHHHHhCchhhHHhhcchHHhhcCCceEec--CCCC Confidence 777777766433211 11 1111111 111111111111 235778999999999999999999998842 1111 Q ss_pred cccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---------CceeEEEEeccceEEEE--Ec Q lcl|NC_019710. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---------GDVISLLPLQSANMDVK--LV 158 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---------G~~~~l~~l~p~~v~~~--~~ 158 (424) .+.. ..+...+. +- .-.+-+...+.+.-++|.|++++..+.. |.+..+.++++++|+.. .+ T Consensus 71 ~~~~---~~~~~~~~-~l----~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~ 142 (437) T protein:vir:52 71 SKQL---DLFTKFER-SL----KLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKD 142 (437) T ss_pred HHHH---HHHHHHHH-hh----cHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccccccc Confidence 1111 11222222 11 1244555556666689999999988763 67889999999988732 11 Q ss_pred --------CCceEEEEEecCceEEecHhHeeEecCcC----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCce Q lcl|NC_019710. 159 --------GKKVVYRYQRDSEYADFSQKEIFHLKGFG----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQ 226 (424) Q Consensus 159 --------~~~~~~~~~~~~~~~~~~~~evih~r~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~ 226 (424) +....|.+..+.....|.++.||||.+.. .+.+.|.|+++.+.+.|.....+......++.+...+ T Consensus 143 ~dp~s~~fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~-- 220 (437) T protein:vir:52 143 DDVLSPNFGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKID-- 220 (437) T ss_pred ccccccccCcceEEEEecCCcceeEccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCC-- Confidence 12235556555556789999999997532 2457899999999999999999999998888776554 Q ss_pred eEEcCC--CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC Q lcl|NC_019710. 227 ILSTGE--KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD 304 (424) Q Consensus 227 vl~~~~--~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~ 304 (424) +++.+. +.......+.+.+.++......+.+++++++.+.+|++++.++.++ .+.......+||++++||..+|.+ T Consensus 221 v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~e~~~~~~sgl--~~~l~~~~~~iaaa~~iP~t~L~G 298 (437) T protein:vir:52 221 IFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAENEYDRKELTFTGL--KDLLTEFRNAVAGAADMPVTILFG 298 (437) T ss_pred ceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCCcceEEEecCcCCH--HHHHHHHHHHHHHHhcCchhhhcC Confidence 344431 1122112334445555544445557899999999999998887654 577888899999999999999977 Q ss_pred CCCCCcccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHH-------H Q lcl|NC_019710. 305 VEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRA-------A 370 (424) Q Consensus 305 ~~~~~~~~~n~e~~~~~f~~-------~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~-------~ 370 (424) ...+..+ +.++..+.||. .-+.|+++.+-+.+-...+.... . .+.|.++++...+.++++ + T Consensus 299 ~s~~Gla--sge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~-~--~~~~~f~pL~~~s~kekae~~~~~a~ 373 (437) T protein:vir:52 299 QSVSGLA--SGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLP-A--DWWFEFVPLTTVKQEQQINMLNTFAT 373 (437) T ss_pred cCccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-C--cceEEeCCcCCcCHHHHHHHHHHHHH Confidence 6555443 45666777776 55778888777777665554322 1 244555677777755554 4 Q ss_pred HHHHHHhCCCcCHHHHHHHhC----CCCCCCcCeeeecccccchhhcccc-----CCCccCCC Q lcl|NC_019710. 371 FMKAMGESGLRTINEMRRTDN----LPPLPGGDVAMRQSQYVPITDLGTN-----KEPRNNGA 424 (424) Q Consensus 371 ~~~~~~~~g~~t~NE~R~~lg----~~p~~ggd~~~~~~n~~~~~~~~~~-----~~~~~~g~ 424 (424) .+.+++++|+++++|+|++|. ++.++..|..-.... .+......+ +.+...++ T Consensus 374 a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 435 (437) T protein:vir:52 374 AANTLIQNGVLNEYQIANELRESGLFANISAEHIEELKNA-DEFAGNFEEPEKMEGAQVQNSE 435 (437) T ss_pred HHHHHHhcCCCCHHHHHHHHHhcCCCCCCCccccccccCC-CCCCCccCCCCCCCCCCCCCCC Confidence 588899999999999999873 344443332211111 110000000 00001111 No 107 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.89 E-value=3.2e-23 Score=143.91 Aligned_cols=403 Identities=11% Similarity=0.033 Sum_probs=220.3 Q ss_pred CCCCCcccccCCCc-cHH---HHHHhhccCcccccc-----------cccccccccccccccCCccccHHHHhhhHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNN-GWW---ARLKSWFVGGRLVTP-----------NQGSQTGPVSAHGYLGDSSINDERILQISTVWR 65 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G~~---~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~ 65 (424) |.-++..|.++-.- .+. +.+.+-+..-+.... .............+.+ +--...|.+++.++. T Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~l~a~Y~~~~l~r~ 118 (537) T protein:vir:10 41 QLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQAFIG--HQMCALIATHWLVNK 118 (537) T ss_pred HhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccCCcc--HHHHHHHHhCchhhh Confidence 22222222222220 011 000000000000000 0000000000000111 112235678999999 Q ss_pred HHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-CCC----- Q lcl|NC_019710. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-NSA----- 139 (424) Q Consensus 66 ~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r-~~~----- 139 (424) +|+.+|..+.+-++.+.-.+.+. .+ ......|....+....+..|.+.+.+. .++|.+++++.- ..+ T Consensus 119 iVd~~A~d~~r~~~~i~~~~~~~--~~----~~~~~~l~~~~~~l~~~~~l~~a~~~~-rlyG~~~i~i~v~~~D~~~~~ 191 (537) T protein:vir:10 119 ACSQMPRDAMRKGYKIISDDGNE--LD----PKDAKFIDRYDRAFNIKKHAIQFVRKG-RIFGIRIALFKVDSPDPYYYE 191 (537) T ss_pred hhhhhhHHhhcCCceeecCCccc--cc----HHHHHHHHHHHHHhhHHHHHHHHHHhc-ccccceEEEEeecCcCCcccc Confidence 99999999999888884322111 11 112222332222333344444444444 457988877642 222 Q ss_pred ----------CceeEEEEeccceEEEEE------cCCceE----EEEEecCceEEecHhHeeEecCcCC-------CCcc Q lcl|NC_019710. 140 ----------GDVISLLPLQSANMDVKL------VGKKVV----YRYQRDSEYADFSQKEIFHLKGFGF-------TGLV 192 (424) Q Consensus 140 ----------G~~~~l~~l~p~~v~~~~------~~~~~~----~~~~~~~~~~~~~~~evih~r~~~~-------~~~~ 192 (424) |....|.+++|.++.+.. |..... -.|... ...|.++.|+||.+... .++. T Consensus 192 ~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~--g~~iH~SRli~f~g~~~p~~~~~~~~~~ 269 (537) T protein:vir:10 192 KPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLIN--GKKYHRSHLAIYINDEVVDFLKPSYIYG 269 (537) T ss_pred cccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeec--CeEecceeEEEecCCCCchhhhcccCcc Confidence 345678888888777432 111111 123333 34678999999975432 3467 Q ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-Cceeeec Q lcl|NC_019710. 193 GLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-GFSTSAI 271 (424) Q Consensus 193 G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-g~~~~~l 271 (424) |.|.++.+.+.|.....+......++.........+.....+.+++. +.+.++.+....+..++++++. +.+|+++ T Consensus 270 G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~---~~~r~~~~~~~r~n~g~~~id~e~e~~e~~ 346 (537) T protein:vir:10 270 GVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQ---FDETMSWWTATRDNYQVRVVDKDNEDVVQI 346 (537) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHH---HHHHHHHHHhhcCCcceeEecCCCceeEEE Confidence 99999999999999999988888888877765333322223344443 3334444333333345777776 5889988 Q ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH------HHHHHHHHHHHHHhhhccCh Q lcl|NC_019710. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY------TLQPYISRWENSIQRWLIPA 345 (424) Q Consensus 272 ~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~------tl~P~~~~ie~~l~~~L~~~ 345 (424) +.+...+ .+........||.+.|||..+|.+...+..+ ++.+...+.||.. .|.|.++.+.+.+-+..+.+ T Consensus 347 ~~~lsgl--~~~l~~~~~~iAa~~~IP~t~L~G~sp~Gln-atGe~D~~~yyd~I~~~Qe~l~p~l~~l~~ll~~~~~~~ 423 (537) T protein:vir:10 347 DTTLNDL--DKVIMNQYQLVCAIARTPAPKMLGTVPTGFN-STGDYEEASYHEECESTQDDMRPLIDRHHQLVCRSHLRK 423 (537) T ss_pred eccCCCH--HHHHHHHHHHHHhhhCCCceeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 8776654 5678888899999999999987554322221 1234445555533 47898888888777666553 Q ss_pred hhhccceeeecchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccc------------ Q lcl|NC_019710. 346 KDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQ------------ 406 (424) Q Consensus 346 ~~~~~~~~~f~~~~~~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n------------ 406 (424) . ..+.|.+++|...|.++++++ +++++++|++++||+|+.|+.+|..|-+.+....+ T Consensus 424 ~----~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~ 499 (537) T protein:vir:10 424 R----IRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDE 499 (537) T ss_pred C----cceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCcc Confidence 2 246677789988888887764 88999999999999999988765432222111000 Q ss_pred ---------------ccchhhccc-cCCCccCCC Q lcl|NC_019710. 407 ---------------YVPITDLGT-NKEPRNNGA 424 (424) Q Consensus 407 ---------------~~~~~~~~~-~~~~~~~g~ 424 (424) ..+.+..++ -.+++++|| T Consensus 500 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 533 (537) T protein:vir:10 500 GKPVRIIEDQPAPSEMFGATSSGESANDPRDSGA 533 (537) T ss_pred CCcCCCCCCCCCccccCCCCccccccCCCccCcc Confidence 000111111 112233333 No 108 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.86 E-value=7.6e-22 Score=136.37 Aligned_cols=406 Identities=10% Similarity=0.036 Sum_probs=222.0 Q ss_pred CCCCCcccc--cC-CCccHHHHHHhhccCc--------------cccccccccccccc-------------ccccccCCc Q lcl|NC_019710. 1 MEEPKYTID--LR-TNNGWWARLKSWFVGG--------------RLVTPNQGSQTGPV-------------SAHGYLGDS 50 (424) Q Consensus 1 ~~~~~~~~~--~~-~~~G~~~~~~~~~~~~--------------~~~~~~~~~~~~~~-------------~~~~~~~~~ 50 (424) |+...-|-+ +- .+.|--.|.++.-+.. ...........+.+ ....|.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~ 80 (532) T protein:vir:94 1 MADTDPTPRPEITYATLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEAT 80 (532) T ss_pred CCCCCCCCCcceehhhhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCccccccccccccccc Confidence 332111100 00 0112222222111100 00000000000000 000111111 Q ss_pred c----ccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 51 S----INDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCF 126 (424) Q Consensus 51 ~----~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~ 126 (424) . .....|.+++.++.+|+.+|+.+.+-.+++.-.+++... ......|...-... .-.+-+..++.+..+ T Consensus 81 ~~~~~~l~a~Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~------~~~~~~i~~~~~~l-~v~~~l~~a~~~~rl 153 (532) T protein:vir:94 81 SWPGFPTLALLAQLPEYRTMHETPADECVRAWGKITCSSKDELA------ADKATRITQKLEQY-NVRTLVRTVVIHDQA 153 (532) T ss_pred ccchHHHHHHHHcCchhhhhhccchHHHhhCCceEeeCCccccc------hHHHHHHHHHHHhh-hHHHHHHHHHHhhhc Confidence 1 112356789999999999999999999888533222111 11122222111111 233444555666667 Q ss_pred cCCeEEEEeeCC-------------------CCceeEEEEeccceEEEEEcC--C--c------eEEEEEecCceEEecH Q lcl|NC_019710. 127 YGNAYALVDRNS-------------------AGDVISLLPLQSANMDVKLVG--K--K------VVYRYQRDSEYADFSQ 177 (424) Q Consensus 127 ~G~a~~~~~r~~-------------------~G~~~~l~~l~p~~v~~~~~~--~--~------~~~~~~~~~~~~~~~~ 177 (424) +|.|++++.-.. .|.+..+.+++|.+|++.... + . ..|... ....|.+ T Consensus 154 yG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~---~g~~iH~ 230 (532) T protein:vir:94 154 YGGAHVFPHLKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT---SGKKIHS 230 (532) T ss_pred ccceEEEEEeccCCccccccccccccccccccceeeEEEeechheecccccccccccccccCCceeEEEc---cCeeecc Confidence 999888764322 234567888999888753211 1 1 122221 1346889 Q ss_pred hHeeEecCcCC-------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC-CCCCCHHHHHHHHHHHHH Q lcl|NC_019710. 178 KEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG-EKVLTEQQRSQVEENFKE 249 (424) Q Consensus 178 ~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~-~~~~~~~~~~~~~~~~~~ 249 (424) +.|+||.+... .+++|.|.++.+.+.+.....+......+....... ++++. ......+..+.+.+.++. T Consensus 231 SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~--v~k~~~a~~ls~~~~~~~~~r~~~ 308 (532) T protein:vir:94 231 SRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMT--NLATDMAQLLAPGGAQSLDARLQL 308 (532) T ss_pred ceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--eeeechHHhhcchhHHHHHHHHHH Confidence 99999975432 245799999999999999999888888876665543 23332 222333445556666665 Q ss_pred HhCCcccCcceecCC-CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHH---- Q lcl|NC_019710. 250 IAGGPVKKRLWILEA-GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ---- 324 (424) Q Consensus 250 ~~~~~~ag~~~~l~~-g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~---- 324 (424) ........++++++. +.+|++++.+..++ .+........||.+.|||..+|.+...+..+ ++.+.....||. T Consensus 309 ~~~~~~n~g~~~id~~~e~~e~~~~~lsgl--~~~l~~~~~~iAaa~~IP~t~LfG~sp~Gln-stGe~D~~~yyd~I~s 385 (532) T protein:vir:94 309 FNLYRDNRNIGALDKGTEEIQQTNTPLSGL--DSLQAQSQEQMAAVSHIPLVKLLGITPNGLN-ASSDGEIRVWYDFIAG 385 (532) T ss_pred HHhhcCCccceEEcCCCceeEEEecccCCH--HHHHHHHHHHHHhHhCCCeeeeecCCccccc-ccchHHHHHHHHHHHH Confidence 544433345777775 57899988777654 6678888999999999999987654433332 123444455555 Q ss_pred ---HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhCCCC Q lcl|NC_019710. 325 ---YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRA-------AFMKAMGESGLRTINEMRRTDNLPP 394 (424) Q Consensus 325 ---~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~-------~~~~~~~~~g~~t~NE~R~~lg~~p 394 (424) .-+.|+++.+.+.|-+..+.... ..+.|.+++|...+.++++ +.+++++++|++++||+|++++..| T Consensus 386 ~Qe~~l~p~le~l~~~l~~s~~g~~~---~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~~ 462 (532) T protein:vir:94 386 YQATNLTPLMEWIIDLIQLSEYGQID---PGLAWEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAADP 462 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCC---CCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcCC Confidence 44788888888888766554322 1244555678777777655 4567899999999999999999988 Q ss_pred CCCcCeeeecccccc-hh-----hcccc----------CCCccCCC Q lcl|NC_019710. 395 LPGGDVAMRQSQYVP-IT-----DLGTN----------KEPRNNGA 424 (424) Q Consensus 395 ~~ggd~~~~~~n~~~-~~-----~~~~~----------~~~~~~g~ 424 (424) ..+.+......+... .. ...++ +.+..+++ T Consensus 463 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (532) T protein:vir:94 463 TSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSE 508 (532) T ss_pred ccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCC Confidence 766443332211110 00 00000 00111111 No 109 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=99.84 E-value=1.2e-20 Score=129.86 Aligned_cols=397 Identities=11% Similarity=-0.024 Sum_probs=243.3 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) +..|-..+.+.|..|+.+.+..++.+..+.+ +..... -.++.....+..++.+.|.+|++.+..+|.+++|+ T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~~~--~~il~~------a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~ 72 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQVPN--DSILQR------RGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWK 72 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCCCC--hHHHHh------hccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCce Confidence 5555556677777777777766665433222 111110 01121112244567899999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---ceeEEEEeccceEEEEE Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLLPLQSANMDVKL 157 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~~~~l~~l~p~~v~~~~ 157 (424) |...+. +.+......-+...|. ++ .+.+++..+. +.+++|.++.++++..+| .+..+.+.|+.++.+.. T Consensus 73 i~p~~~--~~~~~~~ae~v~~~l~-~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~ 144 (488) T protein:vir:99 73 VEAGGD--RPIDQAAAEHLEQQLQ-RV----GWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRYDQ 144 (488) T ss_pred EEcCCC--ChHHHHHHHHHHHHHh-CC----CHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceeecC Confidence 964332 2111111123444443 33 4667777776 567899999998875443 46788999998887766 Q ss_pred cCCceEEEEEecCceEEecHhH--eeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC Q lcl|NC_019710. 158 VGKKVVYRYQRDSEYADFSQKE--IFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~e--vih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~ 235 (424) ++............+..++... |+|........++|.|.+..+.-....-....++...|...-+.|-.+.+++.... T Consensus 145 ~~~l~~~~~~~~~~g~~lp~~~~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a 224 (488) T protein:vir:99 145 DGGLRLLTPNNMFEGEPCPAPYFWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTA 224 (488) T ss_pred CCceEEeccCCCCCccccccCceEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCC Confidence 6554433222333455665433 44444445566899999999999999999999999999999999988888875444 Q ss_pred CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCC-hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCccccc Q lcl|NC_019710. 236 TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG 314 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n 314 (424) +++.++.+.+.+.++.+ + ..++++.|++++-+..+ .....|.+..++-.++|+.+.-= .. +.. +.++.+++ T Consensus 225 ~~~ek~~l~~av~~~~~--~--~~~viP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLG-qt-lts-~~~~Gs~a- 296 (488) T protein:vir:99 225 TPEDKAKLLAALHAIQT--D--SAIIMPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVGLG-QV-AST-QGTPGRLG- 296 (488) T ss_pred CHHHHHHHHHHHHHHhc--C--cEEEecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHHhh-hh-hcc-cccccchh- Confidence 55556666666665543 2 35556666655544321 22234788888889999887411 11 211 11122222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhh----ccceeeecchhhhccCHHHHHHHHHHHHhC-CC-cCHHHHHH Q lcl|NC_019710. 315 IEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV----GRIHAEHNLDGLLRGDSASRAAFMKAMGES-GL-RTINEMRR 388 (424) Q Consensus 315 ~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~----~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~-g~-~t~NE~R~ 388 (424) ..+........-+.-.++.|++.||++|+.+.-. ...+..|-++.....|.+.+++.++++++. |+ ++..++|+ T Consensus 297 ~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e 376 (488) T protein:vir:99 297 NDDLQADVRLDLVKADADLICESFNLGPARWLTEWNFPGAQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQE 376 (488) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCcCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHH Confidence 2344556678888899999999999888764321 111122333444667888999999999986 64 78888999 Q ss_pred HhCCCCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 389 TDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 389 ~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) .+|+|+-..++....+..... .++...+.+... T Consensus 377 ~~Gip~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 409 (488) T protein:vir:99 377 TYGVEVESTQAEATAPTPSTE---FAEGDQPSDPAA 409 (488) T ss_pred HcCCCCcccccccccCCCccc---CCCCCCCCCchH Confidence 999997655555443322111 111111111111 No 110 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.83 E-value=1.9e-20 Score=128.72 Aligned_cols=402 Identities=11% Similarity=0.010 Sum_probs=211.2 Q ss_pred CCCCC--------cc-------------------cccCC------CccH-HHHHHhhccCcccc--cccccccccccccc Q lcl|NC_019710. 1 MEEPK--------YT-------------------IDLRT------NNGW-WARLKSWFVGGRLV--TPNQGSQTGPVSAH 44 (424) Q Consensus 1 ~~~~~--------~~-------------------~~~~~------~~G~-~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 44 (424) =++|+ || |+--. ..|+ .+.+.+...+-... ............++ T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~ 129 (862) T protein:vir:99 50 KEKPNPIIRSVKDFPFVEISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWY 129 (862) T ss_pred cccCCCCCCcccccccccccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccc Confidence 00000 10 00000 0011 11111111100000 00000000000011 Q ss_pred cccCCc-cccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHH Q lcl|NC_019710. 45 GYLGDS-SINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQ 123 (424) Q Consensus 45 ~~~~~~-~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~ 123 (424) ...+.. +--...|.+++.++.+|+.+|+.+.+-.+.+.-.++ ++.........+...+. +- .-.+-+...+.+ T Consensus 130 ~~~~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I~~~~d-~~e~~~e~~~~ie~~~~-rL----~v~~~l~eair~ 203 (862) T protein:vir:99 130 LSQGFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHLKSLGE-GEEIDEESLEKFKAIDV-EF----KVKENLIEFNRF 203 (862) T ss_pred cccCcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceEeecCc-ccccCHHHHHHHHHHHH-Hh----hHHHHHHHHHHh Confidence 100000 011345778999999999999999999988853222 11111111111222221 11 123333444454 Q ss_pred HHHcCCeEEEEe-eCCC---------------CceeEEEEeccceEEEEE------cCCceE----EEEEecCceEEecH Q lcl|NC_019710. 124 LCFYGNAYALVD-RNSA---------------GDVISLLPLQSANMDVKL------VGKKVV----YRYQRDSEYADFSQ 177 (424) Q Consensus 124 ~l~~G~a~~~~~-r~~~---------------G~~~~l~~l~p~~v~~~~------~~~~~~----~~~~~~~~~~~~~~ 177 (424) .-++|.+++++. ...+ |.+..|.+|+|.++.... |..... -.|...+ ..|-+ T Consensus 204 ~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~g--~~IH~ 281 (862) T protein:vir:99 204 KNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIISG--QKYHR 281 (862) T ss_pred cccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeeecC--eeecc Confidence 556777766653 2222 345678888887776421 211110 1122332 46778 Q ss_pred hHeeEecCcCC-------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC--CCCCCHHHHHHHHHHHH Q lcl|NC_019710. 178 KEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG--EKVLTEQQRSQVEENFK 248 (424) Q Consensus 178 ~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~--~~~~~~~~~~~~~~~~~ 248 (424) +.||||.+... ..+.|+|.++.+.+.|.....+......++.+.... +++++ ..+.+++ .+.+.++ T Consensus 282 SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~--v~ktd~l~~l~~ed---~l~~r~~ 356 (862) T protein:vir:99 282 SHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTT--AIHTDTAKAIANED---KFIQRLM 356 (862) T ss_pred ceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHhhhccHH---HHHHHHH Confidence 99999975432 235799999999999999999999999988876654 33433 2222222 2333444 Q ss_pred HHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CCCcccccHHHHHHHHHH--- Q lcl|NC_019710. 249 EIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQ--- 324 (424) Q Consensus 249 ~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~f~~--- 324 (424) ......+..++++++.+.+|++++.+..++ .+........||++.+||..+|.+.. .|..+ +.++..+.||. T Consensus 357 ~~~~~rdN~Gi~liD~eEe~e~ls~slSGL--~dll~~~~q~IAaas~IP~tiLfGqspaGlnA--TGE~D~~nYyD~I~ 432 (862) T protein:vir:99 357 FWVRYRDNHAVKVLGTDETMEQFDTSLADF--DAVIMGQYQLVASIAKTPATKLLGTAPKGFNS--TGEFETISYHEELE 432 (862) T ss_pred HHHhccCcceeEEecCCCceeEEecccCCh--HHHHHHHHHHHHhhhCCCceeecccCcccccC--chHHHHHHHHHHHH Confidence 333333334589999999999998877654 56777888899999999999776544 33222 34555566665 Q ss_pred ----HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHh--- Q lcl|NC_019710. 325 ----YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGESGLRTINEMRRTD--- 390 (424) Q Consensus 325 ----~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~l--- 390 (424) .-|.|+++.+...+...+..+. .+.|.+++|...+.++++++ +++++++|+++++|+|++| T Consensus 433 s~QE~~L~P~LerL~~li~~~lg~~~-----d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~ 507 (862) T protein:vir:99 433 SIQEHVYMPFLQRHYLISRLSLGIQH-----EIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDD 507 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCC-----cceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhc Confidence 4577888888876655443221 24455578888888777654 6789999999999999976 Q ss_pred ---CCCCCCCcCee----eecccccchhhccc--cCCC---ccCCC Q lcl|NC_019710. 391 ---NLPPLPGGDVA----MRQSQYVPITDLGT--NKEP---RNNGA 424 (424) Q Consensus 391 ---g~~p~~ggd~~----~~~~n~~~~~~~~~--~~~~---~~~g~ 424 (424) |++.++..|.. ..+.+...+...+. .+.+ ...|+ T Consensus 508 ~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga 553 (862) T protein:vir:99 508 KRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGA 553 (862) T ss_pred CCcCCCCCCcccccccCCCCcccccccccCCccccccccccccccc Confidence 45444332221 11122111111000 0000 00011 No 111 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=99.82 E-value=4e-19 Score=121.45 Aligned_cols=403 Identities=13% Similarity=0.012 Sum_probs=236.6 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccccccccccccccccc--CCccccHHHHhhhHHHHHHHHHHHHhhhhCc Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL--GDSSINDERILQISTVWRCVSLISTLTACLP 78 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~ 78 (424) |-+..+|=.=|.+.| .+.+... + .........-...... .+..+..+--.+.+.|.+|++.+..+|.+++ T Consensus 1 ~~~~~~~~~p~~~~g---~~~~~~~----~-~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~i~s~l~~rk~av~~~~ 72 (469) T protein:vir:10 1 MTERVKTAAPVSEAG---YVFGSGV----V-DGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSRVTSLLEAISLPIRSTP 72 (469) T ss_pred CCCcccCCCCccchh---hhhhccc----c-cchhhccccccccccccccchHHHHHHHhhChHHHHHHHHHHHHHhcCC Confidence 433322211121111 1111100 0 0000100000000011 1222222222358999999999999999999 Q ss_pred eeEeeccccCccccccccchhHHhhcc----CC--------CCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-----C- Q lcl|NC_019710. 79 LDVFETDQNDNRKKVDLSNPLARLLRY----SP--------NQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-----G- 140 (424) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~l~~lL~~----~P--------N~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-----G- 140 (424) |+|...+++. +.. ..+...|.. .+ +-..++.+++..++.+.+.+|-++.++++... | T Consensus 73 w~v~p~~~~~---e~~--~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~ 147 (469) T protein:vir:10 73 WRIRANGASD---EVT--EFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGR 147 (469) T ss_pred ceEecCCCCH---HHH--HHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCc Confidence 9996544322 111 112222211 11 11236888888888888899999999987533 4 Q ss_pred -ceeEEEEeccceEE---EEEcCCceEEEE------------EecCceEEecHhHeeEecCcC-CCCccccchHHHHHHH Q lcl|NC_019710. 141 -DVISLLPLQSANMD---VKLVGKKVVYRY------------QRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKS 203 (424) Q Consensus 141 -~~~~l~~l~p~~v~---~~~~~~~~~~~~------------~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~ 203 (424) .+..|.+.|+.++. +..+++...+.. ..+.....+++...|++++.. ...++|.|.+..+.-. T Consensus 148 ~~~~~l~~rp~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~ 227 (469) T protein:vir:10 148 FWLRKLAPRPQWTISKFNVAPDGGLESIEQIAPPARTRGSLYVANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKH 227 (469) T ss_pred eeeeeeeecCcccceeeeeccCCceeeeeecCcccccccccccCCCCccccccCcEEEEEecCCCCCcccchhHHHHHHH Confidence 36667777776552 333333332221 122334667888876666554 4558999999999999 Q ss_pred HHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHH Q lcl|NC_019710. 204 AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMAS 283 (424) Q Consensus 204 i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~ 283 (424) ...-....++...+...-+.|--+.+++.+.. ++.++.+.+....+..+.++ .++++.|++++-+..+.....|.+. T Consensus 228 ~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~-~~ek~~l~~a~~~~~~g~~a--~~iip~~~~ie~~ea~g~~~~~~~l 304 (469) T protein:vir:10 228 WLLKDKLLRIEAATAERNGMGIPVGTASSATD-EDEVRKMAALARSVRGGINA--GVGLAQGQILELLGVSGNLPDIRRA 304 (469) T ss_pred HHHHHHHHHHHHHHHHHcCCcceEEecCCCCC-HHHHHHHHHHHHHHhcCCce--EEEccCCceEEEeecCCCchHHHHH Confidence 99999999999999999999988888886644 55566777777777666554 4567888887777665555678889 Q ss_pred HHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhh-----hccceeeecch Q lcl|NC_019710. 284 RKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD-----VGRIHAEHNLD 358 (424) Q Consensus 284 ~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~-----~~~~~~~f~~~ 358 (424) .++-.++|+.+.--.- +-....++++ ...+........-+.-.++.|+..||++|+.+.- ....+.+|.++ T Consensus 305 i~~~d~~Isk~iLG~t-lTs~~~gGS~---a~~~vh~ev~~d~~~sDa~~i~~tln~~li~~l~~lN~g~~~~~P~~~~~ 380 (469) T protein:vir:10 305 IEGHDRSIALSGLAHF-LNLDGKGGSY---ALASVLEDPFTQAVHAYATSICRIANQHIIEDLVDINFGVDTPAPVLTFD 380 (469) T ss_pred HHHHHHHHHHHHhccc-ccccCccchh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCccEEEec Confidence 9999999988764321 1111112222 3345566777888889999999999988876521 11122344444 Q ss_pred hhhccCHHHHHHHHHHHHhCCC-----cCHHHHHHHhCCCCCCCcCeeeecc--cccchhh-ccccCCCc--cCCC Q lcl|NC_019710. 359 GLLRGDSASRAAFMKAMGESGL-----RTINEMRRTDNLPPLPGGDVAMRQS--QYVPITD-LGTNKEPR--NNGA 424 (424) Q Consensus 359 ~~~~~d~~~~~~~~~~~~~~g~-----~t~NE~R~~lg~~p~~ggd~~~~~~--n~~~~~~-~~~~~~~~--~~g~ 424 (424) ... .+.+..++.++++++.|+ .+.+.+|+.+|+|+-+.++....+. +..|... ......+. +..+ T Consensus 381 ~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (469) T protein:vir:10 381 PIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAAVPNQSAAPARTRSSGNADAR 455 (469) T ss_pred CCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhcccCCCCCccccccCCCCCcccc Confidence 443 456778999999999998 4567899999998765555443221 1111111 01111000 0000 No 112 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=99.82 E-value=1.5e-19 Score=123.72 Aligned_cols=406 Identities=9% Similarity=-0.045 Sum_probs=230.5 Q ss_pred CCC----CCcccc------cCCC-ccHHHHHHhhccCccccccccccc-ccccccccccCCcc----ccHHHHhhhHHHH Q lcl|NC_019710. 1 MEE----PKYTID------LRTN-NGWWARLKSWFVGGRLVTPNQGSQ-TGPVSAHGYLGDSS----INDERILQISTVW 64 (424) Q Consensus 1 ~~~----~~~~~~------~~~~-~G~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~----~~~~~~~~~~~v~ 64 (424) |.+ ---|+. -++. .+.+.+.+.. .++..-.+.. ...+. ..-.+... +..+-..+.+.|. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~----~~~~gltp~~l~~iLr-~a~~gd~~~~~~L~e~m~e~D~~i~ 75 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREPQTSRLAGLAKEFAQ----HPAKGLTPAKLARILV-EAEQGNLQAQAELFMDMEERDAHLF 75 (526) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcc----cCcCCCCHHHHHHHHH-hhhCCCHHHHHHHHHHHHhhChHHH Confidence 111 000000 0111 1111111110 0110000000 00000 00000000 1111112589999 Q ss_pred HHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---c Q lcl|NC_019710. 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---D 141 (424) Q Consensus 65 ~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~ 141 (424) +|++.+..+|.+++|.|.....+.. .......-+...|...|+ ..+++..+. +.+++|-++.++++..+| . T Consensus 76 s~l~~Rk~av~~~~w~I~p~~~~~~-~~~~~a~~v~~~l~~~~~----~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~ 149 (526) T protein:vir:99 76 AEMSKRKRAILGLDWAVEPPRNASA-AEKADADYLHELLLDLEG----LEDLLLDAL-DGIGHGYSCIELEWALQGREWM 149 (526) T ss_pred HHHHHHHHHHhCCCceEecCCCCCH-HHHHHHHHHHHHHhcccC----HHHHHHHHH-HhhhhcceeEEEEEeecCCcee Confidence 9999999999999999964332211 111112234455544342 445555555 577899999999865543 5 Q ss_pred eeEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHe-eEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019710. 142 VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEI-FHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 142 ~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~ev-ih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 220 (424) |..+.+.++.++.+..+++..............+++... +|........++|.+.+..+.-....-....++...|... T Consensus 150 ~~~l~~r~~~~f~~~~~~~~~l~~~~~~~~g~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~ 229 (526) T protein:vir:99 150 PLAFHHRPQSWFQLNPEDQNELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEI 229 (526) T ss_pred EEEeeeecccceeeccCCCcEEEecCCCCCceeecCCCeEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHH Confidence 778999999988887777655433333445677887765 5544445566899999999999999888899999999999 Q ss_pred cCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCC-hhHHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019710. 221 GAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPP 299 (424) Q Consensus 221 g~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s-~~d~~~~e~~~~~~~~Ia~~fgVP~ 299 (424) -+.|-.+.+++.+.+ ++.++.+.+.+.++.+ + ..++++.|++++-+..+ .....|.+..++-.++|+.+. +-. T Consensus 230 yG~P~~igky~~~a~-~~ek~~L~~av~~i~~--d--~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGq 303 (526) T protein:vir:99 230 YGLPIRLGKYPPGTA-DEEKATLLRAVTGLGH--A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGG 303 (526) T ss_pred cCCceEEEecCCCCC-HHHHHHHHHHHHHHhh--C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhh Confidence 999988889886654 4445555566665533 2 35666666655554422 222347888888899998875 111 Q ss_pred HHcCCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc--------cceeeecchhhhccCHHHHHH Q lcl|NC_019710. 300 HLVGDV-EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG--------RIHAEHNLDGLLRGDSASRAA 370 (424) Q Consensus 300 ~~l~~~-~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~--------~~~~~f~~~~~~~~d~~~~~~ 370 (424) .+-... .+++++++. .+........-+.-.++.|++.||+.|+.+.-.. ..+.+|.++.....|.+.+++ T Consensus 304 tlTs~~~~g~~gS~a~-g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~ 382 (526) T protein:vir:99 304 TLTSTTSQSGGGAFAL-GQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQ 382 (526) T ss_pred hhccccccCcchhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHH Confidence 111111 111222222 2334556677777888999999998886543111 112344445556778889999 Q ss_pred HHHHHHhCCC-cCHHHHHHHhCCCCCCCcCeeeecccccchhhccccCCC----ccCC-------C Q lcl|NC_019710. 371 FMKAMGESGL-RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEP----RNNG-------A 424 (424) Q Consensus 371 ~~~~~~~~g~-~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~----~~~g-------~ 424 (424) .+.++++.|+ ++..++|+.+|+|.-..++..+.+..-.+.......... ...+ + T Consensus 383 ~~~~L~~~G~~i~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (526) T protein:vir:99 383 SIPALVNVGLEIPSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATIVGPRYGDQQA 448 (526) T ss_pred HHHHHHhCCCccCHHHHHHHhCCCCCCCcccccCCCCCCcccccccccccccccccccccCcchhh Confidence 9999999997 899999999999866555555443222111111000000 0000 0 No 113 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=99.81 E-value=2.7e-19 Score=122.38 Aligned_cols=405 Identities=9% Similarity=-0.045 Sum_probs=226.7 Q ss_pred CCCC----Ccccc------cCCC-ccHHHHHHhhccCcccccc-cccccccccccccccCCcc-----ccHHHHhhhHHH Q lcl|NC_019710. 1 MEEP----KYTID------LRTN-NGWWARLKSWFVGGRLVTP-NQGSQTGPVSAHGYLGDSS-----INDERILQISTV 63 (424) Q Consensus 1 ~~~~----~~~~~------~~~~-~G~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~v 63 (424) |.+= .-|+. =++. .+.+.+.+. . .++.. .+..+...+.. -.+|.. +..+-..+.+.| T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~---~-~~~~gltp~~l~~il~~--a~~gd~~~~~~L~~~m~e~D~~i 74 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFA---N-HPAKGLTPAKLAHILIE--AEQGHLQAQAELFMDMEERDAHL 74 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhc---c-cCCCCCCHHHHHHHHHh--hhCCCHHHHHHHHHHHHhhChHH Confidence 1110 00000 0111 111111111 0 00000 00000000000 001110 111111258899 Q ss_pred HHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---C Q lcl|NC_019710. 64 WRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---G 140 (424) Q Consensus 64 ~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---G 140 (424) .+|++.+..+|.+++|+|.....+.. .+.....-+...|..-|+ ..+++.. +.+.+++|.++.++++..+ . T Consensus 75 ~s~l~~Rk~av~~~~w~I~p~~~~~~-~~~~~a~~v~~~l~~~~~----f~~~i~~-~lda~~~G~s~~Ei~w~~~~g~~ 148 (528) T protein:vir:10 75 FAEMSKRKRAVLGLDWTIEPPRNASA-AEKADAEYLHELLLDLEG----IEDLMLD-CMDGVGHGYSAIELDWSLQGREW 148 (528) T ss_pred HHHHHHHHHHHhcCCceEecCCCCCH-HHHHHHHHHHHHHhCCcc----HHHHHHH-HHhhhhhcceeEEEEEeecCCce Confidence 99999999999999999964432211 111111223444443221 2233333 3446679999998886443 3 Q ss_pred ceeEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019710. 141 DVISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFA 219 (424) Q Consensus 141 ~~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ 219 (424) .|..+.+.|+.++.+..+++.....-.....+..+++...++.++.. ...++|.+.+..+.-....-....++...|.. T Consensus 149 ~~~~~~~r~~~~f~~~~~~~~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E 228 (528) T protein:vir:10 149 LPQAFDHRPQSWFQLNPDDQDELRLRDNSIAGEVLQPFGWIMHKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLE 228 (528) T ss_pred eEEEeeeecccceeeccCCCcEEeccCCCCCceeecCCCeEEEeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHH Confidence 57789999998888777665443222223346778887755555544 45578999999999999999999999999999 Q ss_pred ccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCC-hhHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019710. 220 NGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVP 298 (424) Q Consensus 220 ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s-~~d~~~~e~~~~~~~~Ia~~fgVP 298 (424) .-+.|-.+.+++.+.+ ++.++.+.+.+.++.+ ++ .++++.|++++-+..+ ..-..|.+..++-.++|+.+.-= T Consensus 229 ~yG~P~~igky~~~a~-~~ek~~L~~al~~i~~--~~--~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLG- 302 (528) T protein:vir:10 229 IYGLPIRLGKYPPGTP-DEEKVTLLRAVTGLGH--AA--AGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAILG- 302 (528) T ss_pred HcCCCeEEEecCCCCC-HHHHHHHHHHHHHHhh--Cc--EEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHHhh- Confidence 9999988889887654 4445555566655543 23 4556666655544421 22234778888889998887521 Q ss_pred HHHcCC-CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc--------cceeeecchhhhccCHHHHH Q lcl|NC_019710. 299 PHLVGD-VEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG--------RIHAEHNLDGLLRGDSASRA 369 (424) Q Consensus 299 ~~~l~~-~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~--------~~~~~f~~~~~~~~d~~~~~ 369 (424) ..+-.. ..+.+++++ ..+........-+.-.++.|+..||+.|+.+.-.. ..+.+|.++.....|.+.++ T Consensus 303 qtlTs~~~~g~~gS~A-lg~vh~~v~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a 381 (528) T protein:vir:10 303 GTLTSQTSESGGGAYA-LGQVHNEVRHDLLAADARQLAATLSRDLLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMA 381 (528) T ss_pred hhhhccccccccchhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCCcccHHHHH Confidence 122111 111112221 12334566777788888999999998876543111 11234444555677888999 Q ss_pred HHHHHHHhCCC-cCHHHHHHHhCCCCCCCcCeeeecccccchhhccccCCCccCC------C Q lcl|NC_019710. 370 AFMKAMGESGL-RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNG------A 424 (424) Q Consensus 370 ~~~~~~~~~g~-~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g------~ 424 (424) +.+.+++..|+ ++..++|+.+|+|.-..++....+....+.........+.... + T Consensus 382 ~~~~~L~~~G~~i~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 443 (528) T protein:vir:10 382 TSLPPLVKLGVQVPVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRRPGPRIAALAQVIGP 443 (528) T ss_pred HHHHHHHhCCCCCCHHHHHHHhCCCCCCCCcccccCCCcccccccCcccccccccccccccc Confidence 99999999997 9999999999998666666665443332221111111111100 0 No 114 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.81 E-value=3.8e-20 Score=127.03 Aligned_cols=378 Identities=9% Similarity=0.011 Sum_probs=208.8 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |..++.++. +.-|+-+. |.+..... .+.. ......+.. .-...|.+++.+.++|+.+|..+-+..+. T Consensus 5 m~~~~~~~~--~~D~~~~~----~~~~~g~~-~~~~-----~~~~~~~~~-~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~ 71 (435) T protein:vir:79 5 MSDKVKAIT--KEDGYNEI----FGSKDGTF-RPNA-----FYMQRAAFK-ALSQFYEEDGMARRIVDVIPEEMVTPGFK 71 (435) T ss_pred cccccccch--hhcchhhh----hccccccc-ccCc-----ccCCcCCHH-HHHHHHhcCchhhhhhccchHHhhcCCce Confidence 888865444 55454443 22111110 0000 000011111 12345778999999999999999999888 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-CC---------CCceeEEEEecc Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-NS---------AGDVISLLPLQS 150 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r-~~---------~G~~~~l~~l~p 150 (424) +.- .+ +.+ .+...+. +- ...+-+...+.+..++|.|++++.. +. .|.+..+.++++ T Consensus 72 i~g-~~--~~~------~~~~~~~-~l----~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~ 137 (435) T protein:vir:79 72 VDG-VK--NEK------SFKSRWD-EL----RLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDR 137 (435) T ss_pred ecC-CC--hHH------HHHHHHH-Hh----hHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeech Confidence 631 11 111 1111121 11 1234455556666678988877753 32 255668888888 Q ss_pred ceEEEEE---cC------CceEEEEEecC--ceEEecHhHeeEecCcC-------CCCccccchH-HHHHHHHHHHHHHH Q lcl|NC_019710. 151 ANMDVKL---VG------KKVVYRYQRDS--EYADFSQKEIFHLKGFG-------FTGLVGLSPI-AFACKSAGVAVAME 211 (424) Q Consensus 151 ~~v~~~~---~~------~~~~~~~~~~~--~~~~~~~~evih~r~~~-------~~~~~G~s~~-~~~~~~i~~~~~~~ 211 (424) .++++.. |. ....|.+...+ ....|.++.||||.+.. ...++|.|++ +.+.+.+.....+. T Consensus 138 ~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~ 217 (435) T protein:vir:79 138 YQITIHERETNARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQ 217 (435) T ss_pred hhccchhhccCCcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHH Confidence 8886432 11 12334443332 35689999999997532 2356799998 57889899998888 Q ss_pred HHHHHHHhccCCCceeEEcC---CCCCCHHHHHHHHHHHHHHhCCcc-cCcceecCCCceeeeccCChhHHHHHHHHHHH Q lcl|NC_019710. 212 DQQRDFFANGAKSPQILSTG---EKVLTEQQRSQVEENFKEIAGGPV-KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQ 287 (424) Q Consensus 212 ~~~~~~~~ng~~p~~vl~~~---~~~~~~~~~~~~~~~~~~~~~~~~-ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~ 287 (424) .....++....... ++.+ .....++......+.++......+ .+.+++..++.+|++++.+..+ +.+..... T Consensus 218 ~~~~~l~~~~~~~v--~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~~lsg--l~~~~~~~ 293 (435) T protein:vir:79 218 ELATQLLRRKQQAV--WKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNSDVSG--VPEFLQEK 293 (435) T ss_pred HHHHHHHHHhcCcc--ccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEecccCC--HHHHHHHH Confidence 88888776555432 3332 111222223333334433332222 2345555555689998887765 46788888 Q ss_pred HHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccChhhhccceeeecchhh Q lcl|NC_019710. 288 VSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY-------TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 288 ~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~-------tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~ 360 (424) ..+||++.+||..+|.+...+..+ ++.+...+.||.. -+.|.++.+-+-+ ... ..+.|.+++| T Consensus 294 ~~~iaaa~~IP~t~L~G~s~~gln-stgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li----~~s-----~d~~~~f~pL 363 (435) T protein:vir:79 294 IDRIVALTGIHEIIIKNKNTGGVS-ASQNTALETFYKLIDRKRVEDYKPILEFLLPFM----ISE-----TEWSIEFEPL 363 (435) T ss_pred HHHHHhhhCCCeeeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hcC-----CCCeEEeCCC Confidence 999999999999888665554433 2334445555553 2445444433322 111 1244555778 Q ss_pred hccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCC---CCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 361 LRGDSASR-------AAFMKAMGESGLRTINEMRRTD-NLPP---LPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 361 ~~~d~~~~-------~~~~~~~~~~g~~t~NE~R~~l-g~~p---~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ...|.+++ ++.+++++++|+++++|+|+.+ ...+ +.+.+..-++. .++ .+.+++.++|. T Consensus 364 ~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~----~~d-~~~~~~~e~g~ 433 (435) T protein:vir:79 364 SVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNIELPE----PED-LDPEPGQEGGL 433 (435) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCcccccCCc----ccc-CCCCCCCCCCC Confidence 77777655 4567788999999999999977 2222 22211111111 011 11122222222 No 115 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.81 E-value=1.8e-19 Score=123.35 Aligned_cols=395 Identities=11% Similarity=0.059 Sum_probs=208.6 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccc---------------c------------------------ccccccc Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPN---------------Q------------------------GSQTGPV 41 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~---------------~------------------------~~~~~~~ 41 (424) ..++ -|.+=+-+.| ++++|........+. + ....... T Consensus 28 ~~~~-~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (765) T protein:vir:96 28 IPQH-DPLDPMIKLG---KIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWY 103 (765) T ss_pred cCCC-CCcccchhHH---HHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhh Confidence 1110 0111111111 111111111111000 0 0000000 Q ss_pred ccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHH Q lcl|NC_019710. 42 SAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMT 121 (424) Q Consensus 42 ~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~ 121 (424) ....+.+ +--...|.+++.+..+|+++|..+-+-.+.+.-.+.+... .....|...-.. ..-.+-+...+ T Consensus 104 ~~~~f~g--yql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~-------~~~~~l~~~~~r-l~v~~~l~ea~ 173 (765) T protein:vir:96 104 NSQGFIG--YQACAIISQHWLVDKACSMSGEDAARNGWELKSDGRKLSD-------EQSALIARRDME-FRVKDNLVELN 173 (765) T ss_pred cccCCcc--HHHHHHHHhCchhhhhhhcchHHhhcCCceeecCccccCH-------HHHHHHHHHHHH-hhHHHHHHHHH Confidence 0000000 0112347789999999999999999988887421111111 111112111111 12345555666 Q ss_pred HHHHHcCCeEEEEeeC-CC---------------CceeEEEEeccceEEEEE------cCCce-E---EEEEecCceEEe Q lcl|NC_019710. 122 MQLCFYGNAYALVDRN-SA---------------GDVISLLPLQSANMDVKL------VGKKV-V---YRYQRDSEYADF 175 (424) Q Consensus 122 ~~~l~~G~a~~~~~r~-~~---------------G~~~~l~~l~p~~v~~~~------~~~~~-~---~~~~~~~~~~~~ 175 (424) .+.-++|-+|+++.-. .+ |....|..++|.++.... |.... + -.|...+ ..| T Consensus 174 ~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~g--~~I 251 (765) T protein:vir:96 174 RFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIISG--KKY 251 (765) T ss_pred HHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeecC--cee Confidence 6667788888776432 12 344567777776655421 11110 1 1222332 357 Q ss_pred cHhHeeEecCcCC-------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC--CCCCCHHHHHHHHHH Q lcl|NC_019710. 176 SQKEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG--EKVLTEQQRSQVEEN 246 (424) Q Consensus 176 ~~~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~--~~~~~~~~~~~~~~~ 246 (424) -++.||||..... .+++|.|.++.+.+.|.....+......++...... +++++ .....++ .+.+. T Consensus 252 H~SRli~~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~--v~k~~~~~~l~~~~---~l~~r 326 (765) T protein:vir:96 252 HRSHLVVVRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTS--TIHVDVEKAIANED---AFNAR 326 (765) T ss_pred ccceEEEecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccc--eeeechHhhhccHH---HHHHH Confidence 7889999975442 345799999999999999999998888888876654 33332 1222333 23344 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHH-- Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-- 324 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~-- 324 (424) ++......+..++++++.+.+|++++.+..+ +.+.......+||.+.+||..+|.+...+..+ ++.+...+.||. T Consensus 327 ~~~~~~~r~n~g~~~id~ee~~e~~s~~lsg--l~d~l~~~~~~iAaas~IP~t~LfGqsp~Gln-ATGe~D~~nYyD~I 403 (765) T protein:vir:96 327 LAFWIANRDNHGVKVIGIDETMEQFDTNLSD--FDSVIMNQYQLVAAIAKTPATKLLGTSPKGFN-ATGEHETISYHEEL 403 (765) T ss_pred HHHHHHhcCCceeEEecCCcceeEEecccCC--HHHHHHHHHHHHHhhhCCCeeeeccCCccccc-CcchHHHHHHHHHH Confidence 4443333333468899999999999888765 46778888999999999999888765422221 233445566666 Q ss_pred -----HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhCC Q lcl|NC_019710. 325 -----YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAA-------FMKAMGESGLRTINEMRRTDNL 392 (424) Q Consensus 325 -----~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~-------~~~~~~~~g~~t~NE~R~~lg~ 392 (424) .-+.|.++.+-+.|-..- .....+.|.+++|...+.+++++ .+++++++|+++++|+|+.+.. T Consensus 404 ~s~Qe~~l~p~le~L~~li~~s~-----~i~~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~ 478 (765) T protein:vir:96 404 ESIQEHIFDPLLERHYLLLAKSE-----SIDVQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRD 478 (765) T ss_pred HHHHHHHHHHHHHHHHHHHHHhc-----CCCCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhc Confidence 456677666655554321 11123556667888888777665 4788999999999999999865 Q ss_pred CCC------CCcCe----eeecccccchhhccccC-----CCccC--------CC Q lcl|NC_019710. 393 PPL------PGGDV----AMRQSQYVPITDLGTNK-----EPRNN--------GA 424 (424) Q Consensus 393 ~p~------~ggd~----~~~~~n~~~~~~~~~~~-----~~~~~--------g~ 424 (424) ++. +..+. ...|.+...+...+.+. +...+ |+ T Consensus 479 ~~~~g~~~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~ 533 (765) T protein:vir:96 479 DPRSGYNRLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGA 533 (765) T ss_pred cccCCCCCCCccccccccCCCccccccccCCCcccccccCccccccCCCCccCCC Confidence 542 21111 11111111111111000 00000 00 No 116 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=99.79 E-value=1.2e-18 Score=118.86 Aligned_cols=406 Identities=9% Similarity=-0.044 Sum_probs=230.1 Q ss_pred CCC----CCcc------cccCCC-c-cHHHHHHhhccCcccccccccccccccccccccCCcc----ccHHHHhhhHHHH Q lcl|NC_019710. 1 MEE----PKYT------IDLRTN-N-GWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSS----INDERILQISTVW 64 (424) Q Consensus 1 ~~~----~~~~------~~~~~~-~-G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~v~ 64 (424) |.+ ---| +..+|. . |+.+++...... --+|. .....+. ..-.+... +..+--.+.+.|. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~--gltp~--~l~~il~-~a~~gd~~~~~~L~edm~e~D~~i~ 75 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREPQTSRLAGLAKEFAQHPAK--GLTPA--KLARILV-EAEQGNLQAQAELFMDMEERDAHLF 75 (526) T ss_pred CCeeeCCCCCccCccccchhhhhhhhhhhhhcccCCCC--CcCHH--HHHHHHH-HhhCCCHHHHHHHHHHHHhhChHHH Confidence 211 0000 111111 1 112211111100 00000 0000000 00001110 1111112578999 Q ss_pred HHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---c Q lcl|NC_019710. 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---D 141 (424) Q Consensus 65 ~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~ 141 (424) +|++.+-.+|.+++|.|.....+.. .......-+...|...|+ ..+++..+.. .+++|-++.++++..+| . T Consensus 76 s~l~~Rk~av~~~~w~I~p~~~~~~-~~~~~a~~v~~~l~~~~~----~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~ 149 (526) T protein:vir:79 76 AEMSKRKRAILGLDWAVEPPRNASA-AEKADADYLHELLLDLEG----LEDLLLDALD-GIGHGYSCIELEWALQGREWM 149 (526) T ss_pred HHHHHHHHHHhCCCceEecCCCCCh-HHHHHHHHHHHHHhcccC----HHHHHHHHHh-hhhhcceeEEEEEeecCCcee Confidence 9999999999999999964432211 111112234455544342 4455555444 66799999999865543 5 Q ss_pred eeEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019710. 142 VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 142 ~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 220 (424) +..+.+.++.+..+..+++..............+++...+..++. ....++|.+.+..+.-....-....++...|... T Consensus 150 ~~~l~~r~~~~F~~~~~~~~~l~~~~~~~~g~~l~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~ 229 (526) T protein:vir:79 150 PLAFHHRPQSWFQLNPEDQNELRLRDNSPAGEALQPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEI 229 (526) T ss_pred EEEeeeecccceEeccCCCcEEEecCCCCCceeecCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHH Confidence 778999999888877776654433333345677888865555554 4556899999999999988888899999999999 Q ss_pred cCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccC-ChhHHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019710. 221 GAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV-TPQDAEMMASRKFQVSELARFFGVPP 299 (424) Q Consensus 221 g~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~-s~~d~~~~e~~~~~~~~Ia~~fgVP~ 299 (424) -+.|-.+.+++.+.+++ .++.+.+.+.++.+ + ..++++.|++++-+.. +.....|.+..++-.++|+.+. +-. T Consensus 230 yG~P~~igky~~~a~~~-ek~~L~~av~~i~~--d--a~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGq 303 (526) T protein:vir:79 230 YGLPIRLGKYPPGTADE-EKATLLRAVTGLGH--A--AAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGG 303 (526) T ss_pred cCCceEEEecCCCCCHH-HHHHHHHHHHHHhc--C--cEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhh Confidence 99998888988665544 45555566666543 2 3566666666555543 2223347888889999998874 111 Q ss_pred HHcCCC-CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc--------cceeeecchhhhccCHHHHHH Q lcl|NC_019710. 300 HLVGDV-EKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG--------RIHAEHNLDGLLRGDSASRAA 370 (424) Q Consensus 300 ~~l~~~-~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~--------~~~~~f~~~~~~~~d~~~~~~ 370 (424) .+-... .+++++++. .+........-+.-.++.|++.||+.|+.+.-.. ..+.+|.++.....|.+.+++ T Consensus 304 tlTs~~~~g~~gS~a~-g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~ 382 (526) T protein:vir:79 304 TLTSTTSQSGGGAFAL-GQVHNEVRHDILASDARQLAATLSRDLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQ 382 (526) T ss_pred hhccccccCcchhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHH Confidence 111111 111222222 2344566777788899999999998887543211 112344445556778899999 Q ss_pred HHHHHHhCCC-cCHHHHHHHhCCCCCCCcCeeeecccccchhhccccCC----CccCCC Q lcl|NC_019710. 371 FMKAMGESGL-RTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKE----PRNNGA 424 (424) Q Consensus 371 ~~~~~~~~g~-~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~----~~~~g~ 424 (424) .+.+++..|+ ++..++|+.+|+|....++..+.|.............. ....++ T Consensus 383 ~~~~L~~~G~~i~~~~i~e~~gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (526) T protein:vir:79 383 SIPALVNVGLEIPSAWVYDKLGIPQPAKNEPVLRPAAQPAILSRQHGQRVAALATIVGP 441 (526) T ss_pred HHHHHHhCCCcCCHHHHHHHhCCCCCCCchhhccccCCccccccccccccccccccccc Confidence 9999999997 89999999999975555555443322111000000000 000000 No 117 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.78 E-value=3.3e-19 Score=121.91 Aligned_cols=393 Identities=13% Similarity=0.094 Sum_probs=211.7 Q ss_pred CCCCCcccc--cC-CCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhC Q lcl|NC_019710. 1 MEEPKYTID--LR-TNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~~~~--~~-~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |..-...-- +. .+.|- .++....+.... .........+.....+.. .-...|.+++.+..+|+.+|..+-+- T Consensus 1 ~~~~~~a~~~~~~~~a~~~-~~~~~~~g~~~~---~d~~~~~~~~~~~~~~~~-~l~~lY~~~~l~r~iVd~~a~d~~r~ 75 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVNR-NDFMVGHGKANS---RDKLTRQTPGNGQKLDLK-ACENLYASNSIAMNIVDIISEDMVRA 75 (461) T ss_pred Cccchhhhhhhhhhhhhhh-hHHHhhcCCcch---hhhhhccccCcccccCHH-HHHHHHHhCCccchhhccchHHhhcC Confidence 332111110 11 11121 111111111100 011111111111111111 11245677889999999999999998 Q ss_pred ceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-CCC------------CceeE Q lcl|NC_019710. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-NSA------------GDVIS 144 (424) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r-~~~------------G~~~~ 144 (424) ++.+. . ++.+.. ..+...+. + . .-.+-+...+.+..++|.|++++.- +.+ +.+.. T Consensus 76 g~~i~-~-~~~~~~-----~~~~~~~~-~---l-~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~ 143 (461) T protein:vir:80 76 GWSLK-T-DNKEMK-----KNIESKWR-K---L-KTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKS 143 (461) T ss_pred Ceeee-c-CCHHHH-----HHHHHHHH-H---h-hHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccc Confidence 88763 2 111110 11222221 1 1 1234455566667789999988743 221 11223 Q ss_pred EEEecc---ceEEE---EEc------CCceEEEEEe-------------cCceEEecHhHeeEecCcCC-CCccccchHH Q lcl|NC_019710. 145 LLPLQS---ANMDV---KLV------GKKVVYRYQR-------------DSEYADFSQKEIFHLKGFGF-TGLVGLSPIA 198 (424) Q Consensus 145 l~~l~p---~~v~~---~~~------~~~~~~~~~~-------------~~~~~~~~~~evih~r~~~~-~~~~G~s~~~ 198 (424) +..|.| ..+.. ..| +....|.+.. +.....|.++.||||.+... +..+|.|.++ T Consensus 144 ~~~l~~~~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le 223 (461) T protein:vir:80 144 IPYINTFNTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHRSRIIHEQGLRFEGETKGRSIFE 223 (461) T ss_pred eeEEEeccccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEccccEEEecCCCCCccccCcchHH Confidence 333332 22221 111 1112333322 22346799999999987664 5578999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC-CCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhH Q lcl|NC_019710. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE-KVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQD 277 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~-~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d 277 (424) .+.+.+.....+......+..+...+ +++.+. .....+....+.+.++...+ ..++++++.+.+++.++.+..+ T Consensus 224 ~~~~~l~~~~~~~~~~~~l~~~~~~~--v~k~~~l~~~~~~~~~~~~~~~~~~~~---~~g~~~~d~~e~~e~~~~~lsg 298 (461) T protein:vir:80 224 SLYDIITVMDTSLWSVGQILYDFAFK--VYKTDDIDALNKDDKANLTAMLDFMFR---TEALAIIKGDEQLTKESTNVSG 298 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCC--ceecchHHhhhchHHHHHHHHHHHhcC---CceEEEEcCCcceEEEecCcCC Confidence 99999999999988888888776554 444431 11122223344445554443 2358888988899999888765 Q ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhccChhh--- Q lcl|NC_019710. 278 AEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKD--- 347 (424) Q Consensus 278 ~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~-------~tl~P~~~~ie~~l~~~L~~~~~--- 347 (424) +.+..+.....||.+-+||..+|.+...+.. ++.++..+.||. .-+.|+++.+.+.+-+..+.... T Consensus 299 --l~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~--asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~ 374 (461) T protein:vir:80 299 --MKDLLDYGWDYLAGAVRMPKTVLKGQEAGTL--TGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSID 374 (461) T ss_pred --HHHHHHHHHHHHhhhhcCCeeeeecccCCcc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccC Confidence 4678889999999999999998866544333 244555555554 34667777777766554443211 Q ss_pred hccceeeecchhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHh----CCCCCC--CcCeeeecccccchhhcc Q lcl|NC_019710. 348 VGRIHAEHNLDGLLRGDSASRAA-------FMKAMGESGLRTINEMRRTD----NLPPLP--GGDVAMRQSQYVPITDLG 414 (424) Q Consensus 348 ~~~~~~~f~~~~~~~~d~~~~~~-------~~~~~~~~g~~t~NE~R~~l----g~~p~~--ggd~~~~~~n~~~~~~~~ 414 (424) ...+.+.|.++++...|.+++++ .+++++++|+++++|+|+.+ +++|.. .|+.. ...++.... T Consensus 375 p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~----~~~~~~~~~ 450 (461) T protein:vir:80 375 PDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSA----EIDKLAKLV 450 (461) T ss_pred ccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCc----hhhhhhhhc Confidence 11245667778888888877755 57889999999999999855 333321 11111 111111111 Q ss_pred c--cCCCccCC Q lcl|NC_019710. 415 T--NKEPRNNG 423 (424) Q Consensus 415 ~--~~~~~~~g 423 (424) . .++.+.+| T Consensus 451 ~~~~~~e~~~g 461 (461) T protein:vir:80 451 YDAYAKKNADG 461 (461) T ss_pred cccccccCCCC Confidence 1 11112222 No 118 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.78 E-value=3.3e-19 Score=121.91 Aligned_cols=374 Identities=9% Similarity=0.025 Sum_probs=212.3 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQN 87 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~ 87 (424) |++-+.-|..+- +++....+..... ....+. --...|.+++.+.++|+.+|..+.+..+++.- . T Consensus 1 ~~~~~~d~~~~~----~~~~~~~~~~~~~--------~~~~~~-~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g-~-- 64 (427) T protein:vir:10 1 MKIVKHDGYNDI----FNGGADGSPKPFF--------MSDASY-HVGSFYNDNATAKRIVDVIPEEMVTAGFKMSG-V-- 64 (427) T ss_pred CCccccchHHHH----hhcCCCCcccCcc--------ccCchH-HHHHHHHcCchhhhhhccchHHhhcCCccccC-c-- Confidence 888888788653 2222211111110 011111 12345778999999999999999999888731 1 Q ss_pred CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee----------CCCCceeEEEEeccceEEEEE Q lcl|NC_019710. 88 DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR----------NSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 88 ~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r----------~~~G~~~~l~~l~p~~v~~~~ 157 (424) ++.. .+...+. + ..-.+-+..++.+..++|.|++++.- ...|.+..+.++++.++++.. T Consensus 65 ~~~~------~~~~~~~-~----l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~ 133 (427) T protein:vir:10 65 KDEK------EFKSLWD-S----YKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEK 133 (427) T ss_pred cHHH------HHHHHHH-H----hhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccc Confidence 1111 1111121 1 11334555666666678999987642 224678899999988886532 Q ss_pred cC---------CceEEEEEecC--ceEEecHhHeeEecCcC-------CCCccccchHH-HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 158 VG---------KKVVYRYQRDS--EYADFSQKEIFHLKGFG-------FTGLVGLSPIA-FACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 158 ~~---------~~~~~~~~~~~--~~~~~~~~evih~r~~~-------~~~~~G~s~~~-~~~~~i~~~~~~~~~~~~~~ 218 (424) .. ....|.+..++ ....|.++.||||.+.. .++++|.|++. .+.+.+.....+......++ T Consensus 134 ~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~ 213 (427) T protein:vir:10 134 RVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARKQNQGWGASVLNKSLIDAICDYDYCESLATQIL 213 (427) T ss_pred cccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 21 12234443322 23678999999997543 24578999986 56787888888888888777 Q ss_pred hccCCCceeEEcCC--C-CCCHHHHHHHHHHHHHHhC-CcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 219 ANGAKSPQILSTGE--K-VLTEQQRSQVEENFKEIAG-GPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARF 294 (424) Q Consensus 219 ~ng~~p~~vl~~~~--~-~~~~~~~~~~~~~~~~~~~-~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~ 294 (424) ...... +++.+. . ....+......+.++.... ..+-+.+++...+.+|++++.+... +.+.......+||++ T Consensus 214 ~k~~~~--v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa 289 (427) T protein:vir:10 214 RRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISG--VPEFLSSKMDRIVSL 289 (427) T ss_pred HHhccc--cccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEecccCC--hHHHHHHHHHHHHhh Confidence 665543 334321 1 1122222223333333222 2233446666667889998887765 467788889999999 Q ss_pred hCCCHHHcCCCCCCCcccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHH Q lcl|NC_019710. 295 FGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 295 fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~-------~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~ 367 (424) .+||..+|.+...+..+ ++.+...+.||. .-+.|.++.+-+.+- .. ..+ .|.++++...+.++ T Consensus 290 ~~IP~t~L~G~sp~Gln-stgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~----~s---~~~--~~~f~pL~~~s~kE 359 (427) T protein:vir:10 290 SGIHEIIIKNKNVGGVS-ASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----DE---EEW--SIEFEPLSVPSKKE 359 (427) T ss_pred hCCCeeeeccCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cC---CCc--EEEeCCCCCCCHHH Confidence 99999988665554443 233445555555 345566555544332 11 123 44445777766655 Q ss_pred H-------HHHHHHHHhCCCcCHHHHHHHh----CCCCCCCcCeeee--cccccchhhccccCCCccCC Q lcl|NC_019710. 368 R-------AAFMKAMGESGLRTINEMRRTD----NLPPLPGGDVAMR--QSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 368 ~-------~~~~~~~~~~g~~t~NE~R~~l----g~~p~~ggd~~~~--~~n~~~~~~~~~~~~~~~~g 423 (424) + ++.+++++++|+++++|+|+.| +...+.+++..-. +......+ .+..++..+.. T Consensus 360 kaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~-p~~~e~~~d~~ 427 (427) T protein:vir:10 360 ESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPE-PGLGEKLEDEN 427 (427) T ss_pred HHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccccchhcCCC-CCCCCCCCCCC Confidence 5 4567789999999999999876 3444443332211 00100000 01111111111 No 119 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=99.78 E-value=8.1e-18 Score=114.29 Aligned_cols=390 Identities=11% Similarity=0.050 Sum_probs=225.1 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) .++++.++.-... ...+......... ..+.. .++ ..-.++..-..+..++.+.|.+|++.+..+|.+++|. T Consensus 15 ~~~~~~~~~~~ia--~~~~~~~~~~~~~-~~p~~----~~i--l~~~~~~~~~y~~m~~D~~i~s~l~~Rk~av~~~~w~ 85 (491) T protein:vir:79 15 FGEPDKSLSSQIA--TRARSIDFFALGM-YLPNP----DPV--LKALGKDIRVYRELRADAHVGGCVRRRKAAVKALEWG 85 (491) T ss_pred ccccchhHHHHHh--hhccccccccccc-cCcch----hHH--HhhccCCHHHHHHHhhChHHHHHHHHHHHHHhCCCcE Confidence 2233322221111 0000000000000 00000 000 0001122222344567899999999999999999999 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---ceeEEEEeccceEEEEE Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLLPLQSANMDVKL 157 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~~~~l~~l~p~~v~~~~ 157 (424) |...+.+. + . ...+...|. ++ ...+++..+. +.+++|.++.++++..+| .|..+.+.|+.++.+.. T Consensus 86 i~~~~~~~--~-~--a~~i~e~l~-~~----~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d~ 154 (491) T protein:vir:79 86 LDRGKAKS--R-V--AKSIADVFA-DL----DLSRIATEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYDP 154 (491) T ss_pred EecCCCCH--H-H--HHHHHHHHh-cC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeeEEeeeeecccceeecc Confidence 96544321 1 1 123444453 33 3556666654 577899999998765543 46789999999888777 Q ss_pred cCCceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC Q lcl|NC_019710. 158 VGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT 236 (424) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~ 236 (424) ++...+........+..+++...|++++.. ...++|.+.+..+.-....-....++...|.+.-+.|-.+.+++.+..+ T Consensus 155 ~~~l~l~~~~~~~~g~~lp~~k~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~ 234 (491) T protein:vir:79 155 ENQLRFRSKEHWVQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASD 234 (491) T ss_pred CCceEEeecCCCCCceeecCCCeEEEEecCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCH Confidence 655443333333456788888887777654 4558999999999999999999999999999999999888898866554 Q ss_pred HHHHHHHHHHHHHHhCCcccCcceecCCCceeeec--cC-ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCC--CCCCcc Q lcl|NC_019710. 237 EQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAI--GV-TPQDAEMMASRKFQVSELARFFGVPPHLVGDV--EKSTSW 311 (424) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l--~~-s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~--~~~~~~ 311 (424) + .++.+.+.+.++.+ ++ .++++.|++++-+ +. +..-..|.+..++-.++|+.+. ||.+ .+++.+ T Consensus 235 ~-ek~~l~~al~~~~~--~a--~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~~~gs 303 (491) T protein:vir:79 235 A-ETNLLLDRLEDMVQ--DA--VAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL------LGQNQTTEATST 303 (491) T ss_pred H-HHHHHHHHHHHHhc--Ce--EEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH------hhhhhccCcccc Confidence 4 44555555555532 23 5566666655544 32 2222337777788888888755 3322 223333 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhh----ccceeeecchhhhccCHHHHHHHHHHHHhCCC-cCHHHH Q lcl|NC_019710. 312 GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDV----GRIHAEHNLDGLLRGDSASRAAFMKAMGESGL-RTINEM 386 (424) Q Consensus 312 ~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~----~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~-~t~NE~ 386 (424) ++.. +........-+.-.++.|++.||+ |+.+.-. .....+|.+.... .+.+.+++.++++++.|+ ++.+++ T Consensus 304 ~a~~-~vh~~v~~~i~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~e~e-e~~~~~a~~~~~L~~~G~~i~~~~~ 380 (491) T protein:vir:79 304 RASA-QAGLEVTDDIRDGDKAIVVEAMNM-LIRWICDLNFDGAARPVFDMWEQE-QVDEIQAGRDEKLTRAGARFTPAYF 380 (491) T ss_pred hhhH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEeecCcC-chhHHHHHHHHHHHhCCCccCHHHH Confidence 3333 334455666677778888888874 5543211 1123345444332 233568899999999986 899999 Q ss_pred HHHhCCCCCCCcCeeeecccccchhhc--ccc---CCCccCCC Q lcl|NC_019710. 387 RRTDNLPPLPGGDVAMRQSQYVPITDL--GTN---KEPRNNGA 424 (424) Q Consensus 387 R~~lg~~p~~ggd~~~~~~n~~~~~~~--~~~---~~~~~~g~ 424 (424) |+.+|+|+-+.++....+....+.... ... ++...+.+ T Consensus 381 ~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 423 (491) T protein:vir:79 381 KRAYNLQDGDLDERPLPVSAVDAVGAASFAEFEAPDQDALDAA 423 (491) T ss_pred HHHhCCCCCCCCccccCcCcccccccccccccCCCCCcchHHH Confidence 999999876655554432211111100 000 00010111 No 120 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.78 E-value=6.6e-19 Score=120.27 Aligned_cols=373 Identities=11% Similarity=0.068 Sum_probs=206.6 Q ss_pred cCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCc Q lcl|NC_019710. 10 LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDN 89 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~ 89 (424) |-..-|..+ .+.+..-.... .......... .-...|.+++.+.++|+.+|..+.+..|++-- ++. T Consensus 1 ~~~~D~~~n----~~~gg~~~~~~-------~~~~~~~~~~-~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~--~~~- 65 (422) T protein:vir:10 1 MVKTDSYAN----IFLGGSDGSEI-------YGSLQNQAPT-ILASLYADNALVRRIIDTIPETALAAGFHIDG--IDD- 65 (422) T ss_pred CccchhhHH----HHcCCCCCccc-------cCcccccCHH-HHHHHHHhChhhHHHHhhhhHHHhcCCccccC--CCH- Confidence 222223333 22221110000 0000111111 11235778999999999999999998888731 111 Q ss_pred cccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee-C---------CCCceeEEEEeccceEEEEE-- Q lcl|NC_019710. 90 RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR-N---------SAGDVISLLPLQSANMDVKL-- 157 (424) Q Consensus 90 ~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r-~---------~~G~~~~l~~l~p~~v~~~~-- 157 (424) + ......+. .|+ -.+-+...+.+..++|.|++++.- + ..|.+..+.++++.++++.. T Consensus 66 -~-~~~~~~~~-~l~--------~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~ 134 (422) T protein:vir:10 66 -E-PAFWSRWD-DLE--------MTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTRE 134 (422) T ss_pred -H-HHHHHHHH-Hhh--------HHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcc Confidence 1 11111111 222 344555556666678999888754 3 23567788899988887532 Q ss_pred -c------CCceEEEEEecC--ceEEecHhHeeEecCcC-------CCCccccchHHH-HHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019710. 158 -V------GKKVVYRYQRDS--EYADFSQKEIFHLKGFG-------FTGLVGLSPIAF-ACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 158 -~------~~~~~~~~~~~~--~~~~~~~~evih~r~~~-------~~~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~n 220 (424) | +....|.+..++ ....|-++.||||.+.. ....+|.|++.. +.+.+.....+......++.. T Consensus 135 ~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~ 214 (422) T protein:vir:10 135 ENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKR 214 (422) T ss_pred cCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 112334443332 23678899999996543 235689999986 678899888888888887766 Q ss_pred cCCCceeEEcCC--C-CCCHHHHHHHHHHHHHHhCCc-ccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019710. 221 GAKSPQILSTGE--K-VLTEQQRSQVEENFKEIAGGP-VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFG 296 (424) Q Consensus 221 g~~p~~vl~~~~--~-~~~~~~~~~~~~~~~~~~~~~-~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fg 296 (424) .... +++.+. . ...........++++...... +.+.+++..++.+|++++.+..+ +.+.......+||++.+ T Consensus 215 ~~~~--v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~~lsg--l~~~~~~~~~~iaaa~~ 290 (422) T protein:vir:10 215 KQQA--VWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNSDIGG--IDAFLDKKFDRIVALSG 290 (422) T ss_pred hccc--cccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEecccCC--hHHHHHHHHHHHHhhhC Confidence 6543 333331 1 122222333334444333222 23445555667899999888775 57788899999999999 Q ss_pred CCHHHcCCCCCCCcccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHH- Q lcl|NC_019710. 297 VPPHLVGDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASR- 368 (424) Q Consensus 297 VP~~~l~~~~~~~~~~~n~e~~~~~f~~-------~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~- 368 (424) ||..+|.+...+..+ ++.++..+.||. .-+.|.++.+-+.+- .. ..+ .|.+++|...+.+++ T Consensus 291 IP~t~L~G~s~~Gln-atgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~----~s---~~~--~~~f~pL~~~sekeka 360 (422) T protein:vir:10 291 IHEIILKNKNVGGVS-SSQNTALETFHKLVDRKRNAELLPILEFLIPFIV----NA---EEW--SVEFNPLAQESSKDKA 360 (422) T ss_pred CCeeeeccCCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cc---CCc--EEEeCCCCCCCHHHHH Confidence 999988665554443 234445555555 345565555443331 11 123 444557777776655 Q ss_pred ------HHHHHHHHhCCCcCHHHHHHHhCCCCCCCc-Ceeeecccccchhhcc-ccCCCccC Q lcl|NC_019710. 369 ------AAFMKAMGESGLRTINEMRRTDNLPPLPGG-DVAMRQSQYVPITDLG-TNKEPRNN 422 (424) Q Consensus 369 ------~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg-d~~~~~~n~~~~~~~~-~~~~~~~~ 422 (424) ++.+++++++|+++++|+|+.|--.....| ..-..+......+... ..++|.++ T Consensus 361 ei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 361 EILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDGSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred HHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCCCCccccchhhcCCCCCCCCCCC Confidence 456778999999999999998843221110 0001111111111111 11233333 No 121 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=99.74 E-value=6.3e-17 Score=109.41 Aligned_cols=389 Identities=11% Similarity=0.046 Sum_probs=224.0 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCc-ccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCce Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGG-RLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPL 79 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~ 79 (424) +.+++-++.-.. +...+....+... .+.. ..++. .-.++..-..+..++.+.|.+|++.+..+|.+++| T Consensus 15 ~~~~~~~~~~~i--a~~~~~~~~~~~~~~~~~------~~~iL--r~~~~~~~~y~~m~~D~~i~s~l~~Rk~av~~~~w 84 (491) T protein:vir:10 15 FGEPDKSLSSQI--ATRARSIDFFALGMYLPN------PDPVL--KALGKDIRVYRELRADAHVGGCVRRRKAAVKALEW 84 (491) T ss_pred cccCChHHHHHH--HhhhcccccccccCCccc------hHHHH--HhcCCCHHHHHHHhhChHHHHHHHHHHHHHhCCCc Confidence 333332222110 1111111111000 0000 00000 00111111233456789999999999999999999 Q ss_pred eEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC---ceeEEEEeccceEEEE Q lcl|NC_019710. 80 DVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG---DVISLLPLQSANMDVK 156 (424) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G---~~~~l~~l~p~~v~~~ 156 (424) +|...+.+. +. ...+...|. ++ ...+++..+. +.+++|.+..++++..+| .|..+.++|+.++.+. T Consensus 85 ~i~~~~~~~--~~---~e~v~e~l~-~~----~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~d 153 (491) T protein:vir:10 85 GLDRGKAKS--RV---AKSIADVFA-DL----DLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFVYD 153 (491) T ss_pred EEecCCCCH--HH---HHHHHHHHh-cC----CHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecccceeec Confidence 996543321 11 123444453 33 3567777765 577899999998875543 4668999999988877 Q ss_pred EcCCceEEEEEecCceEEecHhHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC Q lcl|NC_019710. 157 LVGKKVVYRYQRDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL 235 (424) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~ 235 (424) .++...+...........+++...|+.++.. ...++|.+.+..+.-....-....++...|...-+.|-.+.+++.+.+ T Consensus 154 ~~~~l~~~~~~~~~~g~~l~~~k~i~~~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~ 233 (491) T protein:vir:10 154 PENQLRFRSKDHWMQGEELPARKFLVPRQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSAS 233 (491) T ss_pred cCCceEEecCCCCCCcceecCCCEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCC Confidence 6655433222223455778888877777654 455899999999999999999999999999999999988889887655 Q ss_pred CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeecc--CChh-HHHHHHHHHHHHHHHHHHhCCCHHHcCCC--CCCCc Q lcl|NC_019710. 236 TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIG--VTPQ-DAEMMASRKFQVSELARFFGVPPHLVGDV--EKSTS 310 (424) Q Consensus 236 ~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~--~s~~-d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~--~~~~~ 310 (424) +++ ++.+.+.+.++.+ + ..++++.|++++-+. .+.. -..|.+..++-.++|+.+. ||.+ .+++. T Consensus 234 ~~e-k~~l~~al~~~~~--~--a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i------LGqtlTt~~~g 302 (491) T protein:vir:10 234 DGE-KNLLLDCLEDMVQ--D--AVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL------LGQNQTTEATS 302 (491) T ss_pred HHH-HHHHHHHHHHHhc--C--cEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH------hhhhcccCccc Confidence 444 5555555655532 2 356666666665543 2222 2237777788888887763 3322 12233 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhh----hccceeeecchhhhccCHHHHHHHHHHHHhCCC-cCHHH Q lcl|NC_019710. 311 WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD----VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGL-RTINE 385 (424) Q Consensus 311 ~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~----~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~-~t~NE 385 (424) +++-. +........-+.-.++.|+..||+ |+.+.- ....+.+|.+.... .+.+.+++.++++++.|+ ++..+ T Consensus 303 s~a~~-~vh~~v~~di~~~D~~~i~~tln~-li~~l~~~N~~~~~~p~f~~~~~~-e~~~~~a~~~~~L~~~G~~i~~~~ 379 (491) T protein:vir:10 303 TRASA-QAGLEVTDDIRDGDKAVVSEAMNM-LIRWICDLNFDGADRPVFDMWEQE-QVDEIQAGRDQKLTQAGARFTPAY 379 (491) T ss_pred chhHH-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCcceEEecCcC-chhHHHHHHHHHHHhCCCcCCHHH Confidence 33222 334455666677778888888874 554321 01112344444332 344778999999999986 88999 Q ss_pred HHHHhCCCCCCCcCeeeeccccc--chhhccccCCCcc---CCC Q lcl|NC_019710. 386 MRRTDNLPPLPGGDVAMRQSQYV--PITDLGTNKEPRN---NGA 424 (424) Q Consensus 386 ~R~~lg~~p~~ggd~~~~~~n~~--~~~~~~~~~~~~~---~g~ 424 (424) +|+.+|+|+-+.++......... +....+....+.+ +.+ T Consensus 380 i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 423 (491) T protein:vir:10 380 FKRAYNLQDGDLDERPLPVSAVDTVGAASFAEFEAPDQDALDAA 423 (491) T ss_pred HHHHhCCCCCCcCccccccCCCCCcccccccccCCCCCCchHHH Confidence 99999998655444433211111 1000000000000 000 No 122 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=99.74 E-value=5.4e-17 Score=109.76 Aligned_cols=401 Identities=8% Similarity=-0.022 Sum_probs=227.7 Q ss_pred CCC---------CCcccccCCC----ccHHHHHHhhccCcccccccccccccccc---cccccCCccccHHHHhhhHHHH Q lcl|NC_019710. 1 MEE---------PKYTIDLRTN----NGWWARLKSWFVGGRLVTPNQGSQTGPVS---AHGYLGDSSINDERILQISTVW 64 (424) Q Consensus 1 ~~~---------~~~~~~~~~~----~G~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v~ 64 (424) |.+ ++.++ .++. .|.++.+...... ...+.-....+. ............+-.++.+.|. T Consensus 1 m~~~~d~~g~p~~~~~~-~~~~~~~~~~~~~~~~~~~~~----gltp~~l~~iL~~a~~gd~~~~~~L~~dm~~~D~hi~ 75 (512) T protein:vir:19 1 MGRILDISGQPFDFDDE-MQSRSDELAMVMKRTQEHPSS----GVTPNRAAQMLRDAERGDLTAQADLAFDMEEKDTHLF 75 (512) T ss_pred CcceeCCCCCccccccc-cccccchhcccchhhcccccc----CCCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHH Confidence 211 00111 1111 1222222111100 000000000000 0000000001122234688999 Q ss_pred HHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee---CCCCc Q lcl|NC_019710. 65 RCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR---NSAGD 141 (424) Q Consensus 65 ~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r---~~~G~ 141 (424) +|++.+-.+|.+++|+|....+.. ........-+...|...|+ ..+++..+. +.+++|-+.+++++ ++... T Consensus 76 s~l~~Rk~av~~~~w~I~p~~~~~-~~~~~~a~~v~~~l~~~~~----f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~ 149 (512) T protein:vir:19 76 SELSKRRLAIQALEWRIAPARDAS-AQEKKDADMLNEYLHDAAW----FEDALFDAG-DAILKGYSMQEIEWGWLGKMRV 149 (512) T ss_pred HHHHHHHHHHhCCCceEecCCCCC-HHHHHHHHHHHHHHhcCCC----HHHHHHHHH-hhhhhcceeeeeEeeeeCCcee Confidence 999999999999999996443211 1111111224444544442 445555554 46779999998876 44456 Q ss_pred eeEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019710. 142 VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 142 ~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 220 (424) |..+.+.++.++.+..++............+..+++...+..++. ....++|.+.+..+.-....-....++...|... T Consensus 150 ~~~~~~r~~~~f~~~~~~~~~lr~~~~~~~G~~l~~~k~i~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~ 229 (512) T protein:vir:19 150 PVALHHRDPALFCANPDNLNELRLRDASYHGLELQPFGWFMHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEI 229 (512) T ss_pred eeeeeeeccccceeccCCCcEEEecCCCCCceeecCCceEEEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 788999999988877766654433333345667887775555544 4556899999999999999999999999999999 Q ss_pred cCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccC-ChhHHHHHHHHHHHHHHHHHHhCCCH Q lcl|NC_019710. 221 GAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV-TPQDAEMMASRKFQVSELARFFGVPP 299 (424) Q Consensus 221 g~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~-s~~d~~~~e~~~~~~~~Ia~~fgVP~ 299 (424) -+.|-.+.+++.+.+++ .++.+.+.+.++.+ + ..++++.|++++-+.. +.....|.+..++-.++|+.+. T Consensus 230 yG~P~~igky~~~a~~~-ek~~L~~al~~~~~--~--a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i---- 300 (512) T protein:vir:19 230 YGLPMRVGKYPTGSTNR-EKATLMQAVMDIGR--R--AGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI---- 300 (512) T ss_pred cCCCeeEEecCCCCCHH-HHHHHHHHHHHHhh--C--cEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH---- Confidence 99998888888665544 45555566665532 2 3566677766654443 2233447888888899999872 Q ss_pred HHcCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc--------cceeeecchhhhccCHHH Q lcl|NC_019710. 300 HLVGDVE----KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG--------RIHAEHNLDGLLRGDSAS 367 (424) Q Consensus 300 ~~l~~~~----~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~--------~~~~~f~~~~~~~~d~~~ 367 (424) ||.+- +++.+++ ..+........-+.-.++.|+..||+.|+.+.-.. ..+..|.++.....|.+. T Consensus 301 --LGqtlTs~~g~~Gs~a-~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~ 377 (512) T protein:vir:19 301 --LGGTLTTEAGDKGARS-LGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITA 377 (512) T ss_pred --hhhhhcccccccchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccccceEEecCCChhhHHH Confidence 33321 1222222 33455677788888999999999999888653111 012233334445667788 Q ss_pred HHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhh-ccc-----cCCCccCCC Q lcl|NC_019710. 368 RAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITD-LGT-----NKEPRNNGA 424 (424) Q Consensus 368 ~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~-~~~-----~~~~~~~g~ 424 (424) .++.+.++...--++..++|+.+|+|.-..++....+....+-.. ... ...+.++.. T Consensus 378 ~a~~~~~l~~G~~i~~~~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (512) T protein:vir:19 378 LSDAIPKLAAGMRIPVSWIQEKLHIPQPVGDEAVFTIQPVVPDNGSQKEAALSAEDIPQEDDI 440 (512) T ss_pred HHHHHHHHhcCCCCCHHHHHHHhCCCCCCCccccccCCCccccccccccccccccCCCchhhH Confidence 888888877544679999999999975444444332211111000 000 000000000 No 123 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.72 E-value=3.3e-17 Score=110.95 Aligned_cols=414 Identities=11% Similarity=-0.019 Sum_probs=226.1 Q ss_pred cccCCCcc-----HHHHHHhhccCc-ccccccccccccccc-----cccccCCccccHHHHhhhHHHHHHHHHHHHhhhh Q lcl|NC_019710. 8 IDLRTNNG-----WWARLKSWFVGG-RLVTPNQGSQTGPVS-----AHGYLGDSSINDERILQISTVWRCVSLISTLTAC 76 (424) Q Consensus 8 ~~~~~~~G-----~~~~~~~~~~~~-~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 76 (424) |++..-+| -...+.++-.+. ........+.....+ ...+......+.+.+..++.+.+||+.+.+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~~~~nvVG 80 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVG 80 (530) T ss_pred CccceeecCccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHhhC Confidence 44444444 223333332211 111100000000000 0011122234455667789999999999998887 Q ss_pred CceeEeeccc------cCccccccccchhHHhh---ccCCC------CCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C Q lcl|NC_019710. 77 LPLDVFETDQ------NDNRKKVDLSNPLARLL---RYSPN------QYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G 140 (424) Q Consensus 77 ~~~~~~~~~~------~~~~~~~~~~~~l~~lL---~~~PN------~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G 140 (424) ..|++.-+.+ +++. .......+..++ -..|+ .-+|..++...++..++..|++|+.+.+... | T Consensus 81 ~Gi~~~~~p~~~~l~~~~~~-~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~g 159 (530) T protein:vir:38 81 SFFRLSYRPSWRYLGINEED-SRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDST 159 (530) T ss_pred CCceeeeccchhhcCCCHhH-HHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEeeeccCCC Confidence 7887754311 1111 111112223333 33444 3468889999999999999999999876544 3 Q ss_pred --ceeEEEEeccceEEEE--------------EcCC--ceEEEEE-e--cC----------ceEEecHhHeeEecCcC-C Q lcl|NC_019710. 141 --DVISLLPLQSANMDVK--------------LVGK--KVVYRYQ-R--DS----------EYADFSQKEIFHLKGFG-F 188 (424) Q Consensus 141 --~~~~l~~l~p~~v~~~--------------~~~~--~~~~~~~-~--~~----------~~~~~~~~evih~r~~~-~ 188 (424) .+..|..|+|+.+... .|.. ...|.+. . .+ ....++.++|+|+...- . T Consensus 160 ~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~ 239 (530) T protein:vir:38 160 RLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMED 239 (530) T ss_pred CccceEEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccCC Confidence 3567888888877521 1111 1223222 1 11 12346678999998664 5 Q ss_pred CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC----------HHHHHHHHHHHHHH---hCC-- Q lcl|NC_019710. 189 TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT----------EQQRSQVEENFKEI---AGG-- 253 (424) Q Consensus 189 ~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~----------~~~~~~~~~~~~~~---~~~-- 253 (424) ...+|+|.+..+...+.......+......+-.+.-.++|+.+.+... ++....+....... .+. T Consensus 240 gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (530) T protein:vir:38 240 GQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAP 319 (530) T ss_pred CcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccc Confidence 678999999999888877776666665555556666778875443211 11111111111111 110 Q ss_pred --cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CCCCCcccccHHHHHH---------- Q lcl|NC_019710. 254 --PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD-VEKSTSWGSGIEQQNL---------- 320 (424) Q Consensus 254 --~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~-~~~~~~~~~n~e~~~~---------- 320 (424) -..|.+..|..|.+++.+..+-...+|.+..+...+.||+.+|||-+.|.+ ..+. ||+++..... T Consensus 320 ~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~--nYSS~R~~~~e~~r~~~~~q 397 (530) T protein:vir:38 320 VRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQM--SYSTARASANESWAYFMGRR 397 (530) T ss_pred eeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccc--cHHHHHHHHHHHHHHHHHHH Confidence 124568888999999888876556778999999999999999999988844 4344 4444433333 Q ss_pred -HHHHHHHHHHHHH-HHHHHhhhccC-hh-------h-hc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Q lcl|NC_019710. 321 -GFLQYTLQPYISR-WENSIQRWLIP-AK-------D-VG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRR 388 (424) Q Consensus 321 -~f~~~tl~P~~~~-ie~~l~~~L~~-~~-------~-~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~ 388 (424) .|...-+.|+... +++++....++ +. . +. ...+.+-.......|+...++....++++|+.|.-|+-+ T Consensus 398 ~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a 477 (530) T protein:vir:38 398 KFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECA 477 (530) T ss_pred HHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHH Confidence 3333344554443 33333333332 10 0 00 112344444455578999999999999999999999988 Q ss_pred HhCCCCCCC----------cCeeeecc--cccch---hhccccCC--CccCCC Q lcl|NC_019710. 389 TDNLPPLPG----------GDVAMRQS--QYVPI---TDLGTNKE--PRNNGA 424 (424) Q Consensus 389 ~lg~~p~~g----------gd~~~~~~--n~~~~---~~~~~~~~--~~~~g~ 424 (424) +.|..+-+- .+++=++. ..... ......++ ...+|| T Consensus 478 ~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~~~~a 530 (530) T protein:vir:38 478 KRGDDYQEIFAQQVRESMERRAAGLNPPAWAAAAFEAGVKKSNEEEQDGARAA 530 (530) T ss_pred HcCCCHHHHHHHHHHHHHHHHHcCCCCCCCcccccCCCCCCCCCCCCCCCCCC Confidence 888876321 11111111 11000 11111222 222333 No 124 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=99.71 E-value=7.2e-17 Score=109.09 Aligned_cols=405 Identities=11% Similarity=0.021 Sum_probs=226.8 Q ss_pred CCCCC-cccccCCCccH------------HHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHH Q lcl|NC_019710. 1 MEEPK-YTIDLRTNNGW------------WARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~-~~~~~~~~~G~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i 67 (424) |+.++ -|..+....+- ...+.+....+.. .....++- .+.++..+ .+..++.+.|.+|+ T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~-----~~~~~~iL--r~~~~~~l-y~~m~~D~hi~s~l 72 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVV-----DREFDELL--QGKDGLLV-YHKMLSDGTVKNAL 72 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhccccccccc-----ccchhHhh--ccccchHH-HHHHhhChHHHHHH Confidence 66543 23333333211 1111110000000 00000000 01112222 23456689999999 Q ss_pred HHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCC---CCHHHHHHHHHHHHHHcCCeEEEEeeC--CCCc- Q lcl|NC_019710. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQY---MTAQEFREAMTMQLCFYGNAYALVDRN--SAGD- 141 (424) Q Consensus 68 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~---~s~~~f~~~~~~~~l~~G~a~~~~~r~--~~G~- 141 (424) +.+..+|.+++|.|-..+++....+. ..-+...| ..++.. .++.+++.. +.+.+++|-+++++++. .+|. T Consensus 73 ~~Rk~av~~~~w~v~p~~~~~~~~~~--ae~v~~~l-~~~~~~~~~~~f~~~~~~-~lda~~~G~s~~Eivw~~~~~g~~ 148 (448) T protein:vir:79 73 NYIFGRIRSAKWYVEPASTDPEDIAI--AAFIHAQL-GIDDASVGKYPFGRLFAI-YENAYIYGMAAGEIVLTLGADGKL 148 (448) T ss_pred HHHHHHHhcCCceEecCCCCHHHHHH--HHHHHHHh-hhhhhhhccCCHHHHHHH-HHHhhhhcceeEEEEeeecCCCce Confidence 99999999999999644332221111 11122222 233322 233444444 44567899999998864 3554 Q ss_pred -eeEEEEeccce---EEEEEcCCceEEEEEe-------cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHH Q lcl|NC_019710. 142 -VISLLPLQSAN---MDVKLVGKKVVYRYQR-------DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAM 210 (424) Q Consensus 142 -~~~l~~l~p~~---v~~~~~~~~~~~~~~~-------~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~ 210 (424) +..+.+.++.+ ..+..+++........ +.....+|..-++|..+.....++|.+.+..+.-....-... T Consensus 149 ~~~~l~~r~~~~~~~f~~~~d~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~ 228 (448) T protein:vir:79 149 ILDKIVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVSGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRAL 228 (448) T ss_pred ecccccccCCccccceeeecCCceEEeecCCcccccccCCCccccccceEEEEecCccCCcccchhHHHHHHHHHHHHHH Confidence 44666777763 3444554433322211 112345677888888765555589999999999999999999 Q ss_pred HHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHH Q lcl|NC_019710. 211 EDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVS 289 (424) Q Consensus 211 ~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~ 289 (424) .++...|...-+.|--+.+++.+... ++.++.+.+...+...+.+++ ++++.|++++-+.......++.+..++-.+ T Consensus 229 ~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~--~iiP~~~~ie~~ea~~~~~~~~~~i~~~d~ 306 (448) T protein:vir:79 229 ILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHG--IILPDDWKFDTVDLKSAMPDAIPYLTYHDA 306 (448) T ss_pred HHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceE--EEecCCceEEEEecCCCcccHHHHHHHHHH Confidence 99999999999999888888866553 555666777777776666554 567777776666544444456677888888 Q ss_pred HHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChh-----hhccceeeecchhhhccC Q lcl|NC_019710. 290 ELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 290 ~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~-----~~~~~~~~f~~~~~~~~d 364 (424) +|+.+.-=. . +.. +.+..+++.............+.-.+++|++.||+.|+.+. +....+.+|.++.....| T Consensus 307 ~Isk~iLGq-t-lTs-~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~lNfg~~~~~P~~~f~~~e~~D 383 (448) T protein:vir:79 307 GIARALGID-F-NTV-QLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPSATRFPRLTFEMEERND 383 (448) T ss_pred HHHHHHhhh-h-hcc-ccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCCCcEEEecCCChHH Confidence 888765321 1 111 11112222333333455567778889999999998887653 111122233334445667 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 365 SASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 365 ~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) .+..++.+.+++..+-..-+-+|+.+|+|.-..++....+...-+.. ...+++.++-- T Consensus 384 l~~~a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~~~a~~~~~~~~--~~~~~~~~~~~ 441 (448) T protein:vir:79 384 FSAAANLMGMLINAVKDSEDIPTELKALIDALPSKMRRALGVVDEVR--EAVRQPADSRY 441 (448) T ss_pred HHHHHHHhhhhhccchhhHHHHHHhhcCCCCCCCccccccCCCCccc--ccccCCccccc Confidence 88889999999988765555678889998432233332221111100 11122222222 No 125 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.70 E-value=7.7e-17 Score=108.93 Aligned_cols=406 Identities=12% Similarity=0.001 Sum_probs=227.4 Q ss_pred ccHHHHHHhhccCcccccc---------ccccccc-cccccccc------------CCccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTP---------NQGSQTG-PVSAHGYL------------GDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~---------~~~~~~~-~~~~~~~~------------~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) ++|++|+.+++........ ......+ ...+.... ....-+.+.+..++.+..+|+.+. T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 8899999988764221100 0000000 00000000 001123445666899999999888 Q ss_pred HhhhhC-ceeEeec--cccCccccccccchhHHhh-----ccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---- Q lcl|NC_019710. 72 TLTACL-PLDVFET--DQNDNRKKVDLSNPLARLL-----RYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---- 139 (424) Q Consensus 72 ~~ia~~-~~~~~~~--~~~~~~~~~~~~~~l~~lL-----~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---- 139 (424) +.+-.. -+.+.-+ ..+.... ......+..++ +...+.-++.+++...++..++..|++|+.+.+... T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~-~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~ 159 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIA-RDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLT 159 (502) T ss_pred HhhccCCceeeeeccCCCChhHH-HHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccC Confidence 776654 3433211 1111110 01111122222 122344578889999999999999999999865432 Q ss_pred ---CceeEEEEeccceEEE------------EEcCCc--eEEEEEe-------cCceEEecHhHeeEecCcC-CCCcccc Q lcl|NC_019710. 140 ---GDVISLLPLQSANMDV------------KLVGKK--VVYRYQR-------DSEYADFSQKEIFHLKGFG-FTGLVGL 194 (424) Q Consensus 140 ---G~~~~l~~l~p~~v~~------------~~~~~~--~~~~~~~-------~~~~~~~~~~evih~r~~~-~~~~~G~ 194 (424) +.+..|..|+|+++.. +.|..+ ..|.+.. ......+++++|+|+..+. ....+|+ T Consensus 160 ~g~~~~l~lq~iepd~l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~gQ~RGi 239 (502) T protein:vir:79 160 PSAGVHFWLEALEPDFIPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRLHQMRGT 239 (502) T ss_pred CCcccceEEEEecchhcCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCCccccCC Confidence 3467899999988752 222222 2233221 2234679999999998654 5568999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcce-ecCCCceeeeccC Q lcl|NC_019710. 195 SPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW-ILEAGFSTSAIGV 273 (424) Q Consensus 195 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~-~l~~g~~~~~l~~ 273 (424) |.+..+...+.......+......+-.+...++|+.+......... .-...-+... .-..|.++ .|..|.+++.+.. T Consensus 240 s~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~-~~~~~~~~~~-~l~pG~i~~~L~pGe~i~~~~p 317 (502) T protein:vir:79 240 SLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDG-NGSKENEREL-TIQPGIIYDDLKPGEEIGMVKS 317 (502) T ss_pred chHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccccc-CCCCCccccc-cccCCccccccCCCceeeeeCC Confidence 9999999888877777666666666677778888865332111000 0000000000 01235454 5889999998887 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH-----------HHHHHHHHHHH-HHHHhhh Q lcl|NC_019710. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL-----------QYTLQPYISRW-ENSIQRW 341 (424) Q Consensus 274 s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~-----------~~tl~P~~~~i-e~~l~~~ 341 (424) +-....|.+..+...+.||+.+|||-+.|.+-.. .+|+++-.....|+ ..-++|+.+.+ +.++-.. T Consensus 318 ~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s--~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G 395 (502) T protein:vir:79 318 DRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN--GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASG 395 (502) T ss_pred CCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 6556678999999999999999999988865433 35556554444433 33344433332 2222222 Q ss_pred ccCh--h-hhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC----------cCeeeecccc Q lcl|NC_019710. 342 LIPA--K-DVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG----------GDVAMRQSQY 407 (424) Q Consensus 342 L~~~--~-~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g----------gd~~~~~~n~ 407 (424) .++- . .+. ...+.+-.......|+...++....++++|+.|.-|+-++.|.+|-+- .+++=++... T Consensus 396 ~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~ 475 (502) T protein:vir:79 396 VIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDENRKLDLVFDT 475 (502) T ss_pred CCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCC Confidence 2221 1 111 123334444555678988999999999999999999988888877421 1111111111 Q ss_pred cc------hhhccccCCCccCCC Q lcl|NC_019710. 408 VP------ITDLGTNKEPRNNGA 424 (424) Q Consensus 408 ~~------~~~~~~~~~~~~~g~ 424 (424) .| .+...+.+++.++++ T Consensus 476 ~~~~~~~~~~~~~~~~e~~~~~~ 498 (502) T protein:vir:79 476 DPASDKGGSSAATKRQEPQHTDD 498 (502) T ss_pred CCCCCCCCCCCCCCCCCCCCCCC Confidence 11 111122223333333 No 126 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=99.70 E-value=1.6e-16 Score=107.15 Aligned_cols=398 Identities=11% Similarity=0.045 Sum_probs=221.6 Q ss_pred CCCCCccc-------------ccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHH Q lcl|NC_019710. 1 MEEPKYTI-------------DLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~-------------~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i 67 (424) |+.|+-.= ...+-.|....+-.....+-.. ....++- .+.++..+ .+..++.+.|.+|+ T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-----~~~~~iL--r~~~~~~l-y~~m~~D~hi~s~l 72 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVD-----REFDELL--QGKDGLLV-YHKMLSDGTVKNAL 72 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccc-----cchhHhh--ccccchHH-HHHHhhChHHHHHH Confidence 66554211 0000011111111111000000 0000000 01122222 23456689999999 Q ss_pred HHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCC---CCCHHHHHHHHHHHHHHcCCeEEEEeeC--CCCc- Q lcl|NC_019710. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQ---YMTAQEFREAMTMQLCFYGNAYALVDRN--SAGD- 141 (424) Q Consensus 68 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~---~~s~~~f~~~~~~~~l~~G~a~~~~~r~--~~G~- 141 (424) +.+..+|.+++|+|...+++...++. ..-+...|. .+.. ..++.+++..+ .+.+++|-++.++++. .+|. T Consensus 73 ~~Rk~av~~~~w~v~p~~~~~~d~~~--ae~v~~~l~-~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~ 148 (448) T protein:vir:77 73 NYIFGRIRSAKWYVEPASTDPEDIAI--AAFIHAQLG-IDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKL 148 (448) T ss_pred HHHHHHHhcCCceEecCCCCHHHHHH--HHHHHHHhh-chhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCce Confidence 99999999999999644333221111 111222222 2221 23456677666 5788999999998864 3554 Q ss_pred -eeEEEEeccceE---EEEEcCCceEEEEEe-------cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHH Q lcl|NC_019710. 142 -VISLLPLQSANM---DVKLVGKKVVYRYQR-------DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAM 210 (424) Q Consensus 142 -~~~l~~l~p~~v---~~~~~~~~~~~~~~~-------~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~ 210 (424) +..|.+.++.++ .+..+++........ +.....+|...++|.++.....++|.+.+..+.-....-... T Consensus 149 ~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~~~~~~~~~~~~lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~ 228 (448) T protein:vir:77 149 ILDKIVPIHPFNIDEVLYDEEGGPKALKLSGEVKGGSQFVNGLEIPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRAL 228 (448) T ss_pred eeccccccCCCccceeeeecCCceEEEecCCcccccccCCCccccccceEEEEecCCcCCcccchHHHHHHHHHHHHHhh Confidence 446667777543 334444333222111 112345678888988765555589999999999988888899 Q ss_pred HHHHHHHHhccCCCceeEEcCCCCC-CHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHH Q lcl|NC_019710. 211 EDQQRDFFANGAKSPQILSTGEKVL-TEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVS 289 (424) Q Consensus 211 ~~~~~~~~~ng~~p~~vl~~~~~~~-~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~ 289 (424) .++...|.+.-+.|--+.+++.+.+ +++.++.+.+...+...+.+++ ++++.|++++-+..+....++.+..++-.+ T Consensus 229 ~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~--~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~ 306 (448) T protein:vir:77 229 ILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHG--IILPDDWKFDTVDLKSAMPDAIPYLTYHDA 306 (448) T ss_pred HHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceE--EEecCCceEEEEecCCCccCHHHHHHHHHH Confidence 9999999999999988888876655 3566667777777776665553 567777776655544344456677888888 Q ss_pred HHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChh-----hhc--cceeeecchhhhc Q lcl|NC_019710. 290 ELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-----DVG--RIHAEHNLDGLLR 362 (424) Q Consensus 290 ~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~-----~~~--~~~~~f~~~~~~~ 362 (424) +|+.+..-. . |.. +.+..+++.............+.-.+++|++.||++|+.+. +.. ..++.|+. ... T Consensus 307 ~Isk~iLGq-t-lTs-~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~l~~lNfg~~~~~P~~~f~~--~e~ 381 (448) T protein:vir:77 307 GIARALGID-F-NTV-QLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPKLVLPNWPGATRFPRLTFEM--EER 381 (448) T ss_pred HHHHHHhcc-c-ccc-ccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecC--CCh Confidence 898876443 1 111 11122223333333356667778899999999999887653 111 13455544 355 Q ss_pred cCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccC-----CCccCCC Q lcl|NC_019710. 363 GDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK-----EPRNNGA 424 (424) Q Consensus 363 ~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~-----~~~~~g~ 424 (424) .|.+..++.+.+++ +-+|+.+|+|.-.++.....+....+.....+.. .+.+.-+ T Consensus 382 eDl~~~a~~~~~l~-------~~~~~~~~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (448) T protein:vir:77 382 NDFSAAANLMGMLI-------NAVKDSEDIPTELKALIDALPSKMRRALGVVDEVREAVRQPADSRY 441 (448) T ss_pred hhHHHHHHHhHHHH-------HHHHHHhcCCccCCcCCCCCchhcccccCCCCCCCchhhcchhhHH Confidence 67888888888876 4589999997532222111121111111111100 0111111 No 127 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=99.67 E-value=6.3e-16 Score=103.91 Aligned_cols=409 Identities=11% Similarity=0.065 Sum_probs=220.0 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |++...+-+=-|+ .|+...+.......... ....+.-..-+.....+ .+..++.+.|.+|++.+..+|.+++|+ T Consensus 1 ~~~~~~~~~gl~p----~rl~~i~~~~~~~~~~~-~~~~~~~~Lr~~~~~~l-y~~m~~D~hi~s~l~~Rk~av~~~~w~ 74 (488) T protein:vir:95 1 MADITETQESLPP----FRMGEVGSLGLKVKNGR-IYEEPRQALRFPESIKT-FQLMMRDPAVAASVNIIKMFVRKVNWR 74 (488) T ss_pred CCCccccCCCCCH----HHHHHHHHHhhccccch-hhccchhhhcccchHHH-HHHHhhChHHHHHHHHHHHHHhcCCce Confidence 8887766555555 33333332111111100 00000000001112222 344557899999999999999999999 Q ss_pred EeeccccCcccc-ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-------------CC--ceeE Q lcl|NC_019710. 81 VFETDQNDNRKK-VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-------------AG--DVIS 144 (424) Q Consensus 81 ~~~~~~~~~~~~-~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-------------~G--~~~~ 144 (424) |...+....... .....-+...|.. -..++.+++..+. +.+++|-+.+++++.. +| .+.. T Consensus 75 v~p~~~~~~d~~~~~~a~~v~~~l~~---~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~ 150 (488) T protein:vir:95 75 FVPPKGKEQDPKMLERADFFNSLMDD---MEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAK 150 (488) T ss_pred EecCCCCchhHHHHHHHHHHHHHHhc---cCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeee Confidence 964432211110 0011122333321 1234556666665 5678999999888743 23 2455 Q ss_pred EEEeccc---eEEEEEcCCceEEEE---------------EecCceEEecHhHeeEecCc-CCCCccccchHHHHHHHHH Q lcl|NC_019710. 145 LLPLQSA---NMDVKLVGKKVVYRY---------------QRDSEYADFSQKEIFHLKGF-GFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 145 l~~l~p~---~v~~~~~~~~~~~~~---------------~~~~~~~~~~~~evih~r~~-~~~~~~G~s~~~~~~~~i~ 205 (424) +.+.|+. +..+..++....... ........+++...|+.++. ....++|.+.+..+.-... T Consensus 151 i~~Rpq~~~~~f~~d~d~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~g~gLlr~~~w~~~ 230 (488) T protein:vir:95 151 LPIRNQSTLDKWYFDEDFRRVTGVRQNLRNVSHIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPEGRSPLLNAYVPWK 230 (488) T ss_pred eeecCcccccceeeccCCCceeecccccccccccccccccccccccccccccceEEEeecCCCCccchhhHHHHHHHHHH Confidence 6666664 333444443321110 01122355777776655554 4456899999999998888 Q ss_pred HHHHHHHHHHHHHhccCCCceeEEcCCC----CCCHHHHHHHHH---HHHHHhCCcccCcceecCCCceee--------- Q lcl|NC_019710. 206 VAVAMEDQQRDFFANGAKSPQILSTGEK----VLTEQQRSQVEE---NFKEIAGGPVKKRLWILEAGFSTS--------- 269 (424) Q Consensus 206 ~~~~~~~~~~~~~~ng~~p~~vl~~~~~----~~~~~~~~~~~~---~~~~~~~~~~ag~~~~l~~g~~~~--------- 269 (424) .-....++...|...-+.|--+.+.+.+ ..+++....+++ .......+..+| ++++.|++.. T Consensus 231 fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag--~iiP~g~~~~~k~~~~e~~ 308 (488) T protein:vir:95 231 YKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAG--LIWPRYIDPDTKEDIFEFS 308 (488) T ss_pred HHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhh--eeeccccccccchhhhhhh Confidence 8888888888888875444444444322 233333222222 223333333344 4455554322 Q ss_pred eccCC-hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChh-- Q lcl|NC_019710. 270 AIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-- 346 (424) Q Consensus 270 ~l~~s-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~-- 346 (424) -++.. ..-..|.+..++-.++|+.+.--.- |....++++++ ...+........-+.-.++.|++.||++|+.+. T Consensus 309 l~~~~~~~~~~~~~li~~~d~~Isk~iLGqt--LT~~~~~~Gs~-Al~~vh~ev~~~i~~aDa~~i~~tln~~li~~l~~ 385 (488) T protein:vir:95 309 LVSRQGAKAYDTGSIIDRYSKQIMMAFMSDV--LAMGQSKYGSF-SLADSKTSLLAMSVDILLKQIKNVINRDLVAQTYA 385 (488) T ss_pred ccccccCCchhHHHHHHHHHHHHHHHHhccc--cccccCcchhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22221 2233466777777888887653221 11111112222 234556677778888899999999999887653 Q ss_pred ---hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCH-----HHHHHHhCCCCCCCcCeeeecccccchhhccccCC Q lcl|NC_019710. 347 ---DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTI-----NEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKE 418 (424) Q Consensus 347 ---~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~-----NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~ 418 (424) .....+.+|.++.....|.+.+++.++++++.|..-+ +.+|+.+|+|+-+.++....+....+-...+.... T Consensus 386 ~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~~~~~~~~~~~~~ 465 (488) T protein:vir:95 386 LNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLSPNSQSRSGDGYK 465 (488) T ss_pred hcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCCCCCCCCCCcccC Confidence 1112234455556667788899999999999998765 56899999997665555544432221111111111 Q ss_pred CccCCC Q lcl|NC_019710. 419 PRNNGA 424 (424) Q Consensus 419 ~~~~g~ 424 (424) ...+++ T Consensus 466 ~~~~~~ 471 (488) T protein:vir:95 466 TAGEGT 471 (488) T ss_pred CCcccC Confidence 111111 No 128 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.66 E-value=4.2e-16 Score=104.89 Aligned_cols=418 Identities=12% Similarity=-0.012 Sum_probs=228.6 Q ss_pred CCCCCcccccCCCc-cHHHHH-Hhhcc-------Cccccccccccccccccc-------ccccCCccccHHHHhhhHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNN-GWWARL-KSWFV-------GGRLVTPNQGSQTGPVSA-------HGYLGDSSINDERILQISTVW 64 (424) Q Consensus 1 ~~~~~~~~~~~~~~-G~~~~~-~~~~~-------~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~v~ 64 (424) |...+..|++.-+- |+..+- ....+ +.........+...+... ..+......+...+..++.+. T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 80 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAK 80 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 88777777765552 221110 00000 000000000000000000 000011223444566789999 Q ss_pred HHHHHHHHhhhh-CceeEeeccc--cCccc-cc-cccchhHHhhccCCCCC----CCHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019710. 65 RCVSLISTLTAC-LPLDVFETDQ--NDNRK-KV-DLSNPLARLLRYSPNQY----MTAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 65 ~~i~~ia~~ia~-~~~~~~~~~~--~~~~~-~~-~~~~~l~~lL~~~PN~~----~s~~~f~~~~~~~~l~~G~a~~~~~ 135 (424) .+|+.+.+.+-. ..++..-... ++... +. ..-..+...+..++|.. ++.+++...++..++..|+||+.+. T Consensus 81 ~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~ 160 (505) T protein:vir:96 81 RFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREH 160 (505) T ss_pred HHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEe Confidence 999988777664 5665543321 11111 00 00012233334445543 6688889999999999999999886 Q ss_pred eCCCC-ceeEEEEeccceEEEEE----------------cCCc--eEEEEEe---c----------CceEEecHhHeeEe Q lcl|NC_019710. 136 RNSAG-DVISLLPLQSANMDVKL----------------VGKK--VVYRYQR---D----------SEYADFSQKEIFHL 183 (424) Q Consensus 136 r~~~G-~~~~l~~l~p~~v~~~~----------------~~~~--~~~~~~~---~----------~~~~~~~~~evih~ 183 (424) +...+ .+..|..|+|+.+.... |..+ ..|.+.. + .....+++++|+|+ T Consensus 161 ~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~ 240 (505) T protein:vir:96 161 RGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHT 240 (505) T ss_pred ecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhh Confidence 64332 46678888888875221 1111 2232211 1 12345899999999 Q ss_pred cCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec Q lcl|NC_019710. 184 KGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL 262 (424) Q Consensus 184 r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l 262 (424) ..+. ....+|+|.+..+...+.......+......+=.+.-.++|+.+.....+...+.- ......-..|.+..| T Consensus 241 f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~----~~~~~~l~pG~i~~L 316 (505) T protein:vir:96 241 FVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQ----GEIVEEVEAGTYQLL 316 (505) T ss_pred hcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCcccccc----CccccccCCceeeec Confidence 8654 56689999999998888877776666666666666677888865443322111110 000111134678899 Q ss_pred CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CCCCCcccccHHHHHHH-----------HHHHHHHHH Q lcl|NC_019710. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD-VEKSTSWGSGIEQQNLG-----------FLQYTLQPY 330 (424) Q Consensus 263 ~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~-~~~~~~~~~n~e~~~~~-----------f~~~tl~P~ 330 (424) ..|.+++.+..+-...+|.+..+...+.||+.+|||-+.|.+ ..+. ||+++-..... |...-+.|+ T Consensus 317 ~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~--nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi 394 (505) T protein:vir:96 317 PYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGV--NFSSLRSGELDERDLYKLLQFFVVTELLERV 394 (505) T ss_pred CCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 999999998877556788999999999999999999988843 3333 34444433333 333445554 Q ss_pred HHH-HHHHHhhhccCh--hhhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--------- Q lcl|NC_019710. 331 ISR-WENSIQRWLIPA--KDVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--------- 397 (424) Q Consensus 331 ~~~-ie~~l~~~L~~~--~~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--------- 397 (424) .+. ++..+-...++- .... ...+.+-.......|+...++....++++|+.|.-|+-++.|.++-+- T Consensus 395 ~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~ 474 (505) T protein:vir:96 395 AGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQ 474 (505) T ss_pred HHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHH Confidence 444 333333333221 1111 123444444455578999999999999999999999988888876421 Q ss_pred -cCeeeecccccchh-hccccCCCccCCC Q lcl|NC_019710. 398 -GDVAMRQSQYVPIT-DLGTNKEPRNNGA 424 (424) Q Consensus 398 -gd~~~~~~n~~~~~-~~~~~~~~~~~g~ 424 (424) .+++=++.+..+.. .....++..++.+ T Consensus 475 ~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~ 503 (505) T protein:vir:96 475 LMRDKGVNPTPPEQESKDATTDEEDDSAS 503 (505) T ss_pred HHHHcCCCCCCCCCCCCCCCCCCCCCCCC Confidence 11111111111100 1111111111111 No 129 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.63 E-value=2.5e-15 Score=100.65 Aligned_cols=419 Identities=10% Similarity=0.003 Sum_probs=222.8 Q ss_pred CCCCCcccccCCCccHHHH--HHhhccCcccccccccccccccc-------cccccCCccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWAR--LKSWFVGGRLVTPNQGSQTGPVS-------AHGYLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~--~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) |.-|--+--.- ..+...+ +.+.-.+...... ....+.+.. ...+......+...+..++.+..||+.+. T Consensus 1 ~~~p~~~~~~~-~~~~~~~~~~~~y~~~a~~~~~-~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 78 (533) T protein:vir:34 1 MKTPTIPTLLG-PDGMTSLREYAGYHGGGSGFGG-QLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQ 78 (533) T ss_pred CCCchhhhhhc-ccccchHHHHHhhhhccCCCCC-cccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 22221110000 0111111 1111111100000 000000000 00111122234455667899999999998 Q ss_pred HhhhhCceeEeeccc------cCccccccccchh---HHhhccCCC------CCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019710. 72 TLTACLPLDVFETDQ------NDNRKKVDLSNPL---ARLLRYSPN------QYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 72 ~~ia~~~~~~~~~~~------~~~~~~~~~~~~l---~~lL~~~PN------~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) +.+-...|++.-+-+ +++ ........+ ...+-..|+ .-++.+++...++..++..|++|+.+.+ T Consensus 79 ~nvVG~Gi~~~~~p~~~~lg~~~~-~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~ 157 (533) T protein:vir:34 79 DHIVGSFFRLSHRPSWRYLGIGEE-EARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATW 157 (533) T ss_pred HHhhCCCceeeeccchhhcCCChh-HHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEeee Confidence 888777887653311 111 000111122 223333443 3467889999999999999999998875 Q ss_pred CCC-C--ceeEEEEeccceEEEEE--------------cCC--ceEEEEEe---cC----------ceEEecHhHeeEec Q lcl|NC_019710. 137 NSA-G--DVISLLPLQSANMDVKL--------------VGK--KVVYRYQR---DS----------EYADFSQKEIFHLK 184 (424) Q Consensus 137 ~~~-G--~~~~l~~l~p~~v~~~~--------------~~~--~~~~~~~~---~~----------~~~~~~~~evih~r 184 (424) ... | .+..|..|+|+.+.... |.. ...|.+.. ++ ....++.++|||+. T Consensus 158 ~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f 237 (533) T protein:vir:34 158 DTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVF 237 (533) T ss_pred ccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeec Confidence 544 2 35678888888775321 111 12233221 11 12346788999998 Q ss_pred CcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC----------HHHHHHHHHHH---HHH Q lcl|NC_019710. 185 GFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT----------EQQRSQVEENF---KEI 250 (424) Q Consensus 185 ~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~----------~~~~~~~~~~~---~~~ 250 (424) .+- ....+|+|.+..+...+.......+......+-.+.-.++|+.+.+... ++..+.+.... ..+ T Consensus 238 ~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (533) T protein:vir:34 238 EPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAY 317 (533) T ss_pred cccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhc Confidence 654 5668999999999888887777666666656666667788876533111 11111111111 111 Q ss_pred hCC----cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CCCCCcccccHHHH------- Q lcl|NC_019710. 251 AGG----PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD-VEKSTSWGSGIEQQ------- 318 (424) Q Consensus 251 ~~~----~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~-~~~~~~~~~n~e~~------- 318 (424) .++ =+.|.+..|..|.+++.+..+-...+|.+..+...+.||+.+|||-+.|.+ ..+. ||+++... T Consensus 318 ~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~--nYSS~R~~~~e~~r~ 395 (533) T protein:vir:34 318 YAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQM--SYSTARASANESWAY 395 (533) T ss_pred cCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccc--cHHHHHHHHHHHHHH Confidence 111 124678889999999988876566788999999999999999999988844 3344 44454322 Q ss_pred ----HHHHHHHHHHHHHHHHH-HHHhhhccC-hh------h--hc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCH Q lcl|NC_019710. 319 ----NLGFLQYTLQPYISRWE-NSIQRWLIP-AK------D--VG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTI 383 (424) Q Consensus 319 ----~~~f~~~tl~P~~~~ie-~~l~~~L~~-~~------~--~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~ 383 (424) ...|...-+.|+.+.+- +.+....++ +. . +. ...+.+-.......|+...++....++++|+.|. T Consensus 396 ~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~ 475 (533) T protein:vir:34 396 FMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTY 475 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCH Confidence 23344444556555433 333222222 11 0 00 1223444445566789999999999999999999 Q ss_pred HHHHHHhCCCCCCC----------cCeeeecccccc--hhhcc---ccCCCccCCC Q lcl|NC_019710. 384 NEMRRTDNLPPLPG----------GDVAMRQSQYVP--ITDLG---TNKEPRNNGA 424 (424) Q Consensus 384 NE~R~~lg~~p~~g----------gd~~~~~~n~~~--~~~~~---~~~~~~~~g~ 424 (424) -|+-++.|.++-+- .+++=++....+ ....+ +.++++.+++ T Consensus 476 ~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~ 531 (533) T protein:vir:34 476 EKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSR 531 (533) T ss_pred HHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCC Confidence 99988888876421 122111111111 11111 1122222222 No 130 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.59 E-value=2.5e-15 Score=100.66 Aligned_cols=405 Identities=11% Similarity=-0.018 Sum_probs=218.1 Q ss_pred ccHHHHHHhhccCcccccc---------ccccccc-ccccc-ccc-----------CCccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTP---------NQGSQTG-PVSAH-GYL-----------GDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~---------~~~~~~~-~~~~~-~~~-----------~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) ++||+|+.+.|........ ......+ ...+. ... .....+...+..++.+..||+.+. T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 8999999998853321100 0000000 00000 000 011133344556899999999887 Q ss_pred Hhhhh---CceeEeeccccCccccccccch---hHHhhccC--CCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---- Q lcl|NC_019710. 72 TLTAC---LPLDVFETDQNDNRKKVDLSNP---LARLLRYS--PNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---- 139 (424) Q Consensus 72 ~~ia~---~~~~~~~~~~~~~~~~~~~~~~---l~~lL~~~--PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---- 139 (424) +.+-. +.++-.-.+.++...+ ..... +...+-.+ ...-++.+++...++..++..|++|+.+.+... T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~-~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~ 159 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHA-ELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYT 159 (548) T ss_pred HhccCccccceeeeecCCCHHHHH-HHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeeccccccc Confidence 66554 2222221222211000 00111 12222222 333477889999999999999999998875432 Q ss_pred ---CceeEEEEeccceEEEE-------------EcCC--ceEEEEEe-----------cCceEEecHhHeeEecCcC-CC Q lcl|NC_019710. 140 ---GDVISLLPLQSANMDVK-------------LVGK--KVVYRYQR-----------DSEYADFSQKEIFHLKGFG-FT 189 (424) Q Consensus 140 ---G~~~~l~~l~p~~v~~~-------------~~~~--~~~~~~~~-----------~~~~~~~~~~evih~r~~~-~~ 189 (424) ..+..|..|+|+.+..- .|.. ...|.+.. ......+++++|+|+..+- .. T Consensus 160 ~g~~~~~~lqliepd~l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~g 239 (548) T protein:vir:95 160 FATSVPFALELLEPDYLPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRIG 239 (548) T ss_pred CCcccceEEEEechhhcCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCCc Confidence 24678899999887421 1111 12232211 1124679999999998654 45 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcce-ecCCCcee Q lcl|NC_019710. 190 GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW-ILEAGFST 268 (424) Q Consensus 190 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~-~l~~g~~~ 268 (424) ..+|+|.+..+...+.......+......+=.+...++|+.+........ .....-..... -..|.++ .|..|.++ T Consensus 240 Q~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~--~~~~~~~~~~~-~~pG~iv~~L~pGe~i 316 (548) T protein:vir:95 240 QNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVE--PGKDRKNRTIP-IAPGMVFDDLEPGEDV 316 (548) T ss_pred cccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCC--CCccccccccc-ccCCccccccCCCcee Confidence 68999999999888877776666666655666666788876533211100 00000000000 1134444 58888898 Q ss_pred eeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH-----------HHHHHHHHHHH-HH Q lcl|NC_019710. 269 SAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL-----------QYTLQPYISRW-EN 336 (424) Q Consensus 269 ~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~-----------~~tl~P~~~~i-e~ 336 (424) +.+..+-...+|.+..+...+.||+.+|||-+.|..-.. .||+++-.....|+ ..-+.|+...+ +. T Consensus 317 ~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s--~nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~ 394 (548) T protein:vir:95 317 GMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD--GTYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQM 394 (548) T ss_pred eecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 888766556678999999999999999999988865432 35666554444333 33344433332 22 Q ss_pred HHhhhccC-h-h-hhc-cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC----------cCeee Q lcl|NC_019710. 337 SIQRWLIP-A-K-DVG-RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG----------GDVAM 402 (424) Q Consensus 337 ~l~~~L~~-~-~-~~~-~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g----------gd~~~ 402 (424) ++-...++ + . .+. ...+++-.-.....|+...++....++++|+.|.-|+-++.|.++-+- -+++= T Consensus 395 a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~~~~~G 474 (548) T protein:vir:95 395 YLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKANRAAG 474 (548) T ss_pred HHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcC Confidence 22222221 1 1 111 122333333445568888999999999999999999887778776320 11111 Q ss_pred ecccccc----hhhccccCCC---ccCC------------------C Q lcl|NC_019710. 403 RQSQYVP----ITDLGTNKEP---RNNG------------------A 424 (424) Q Consensus 403 ~~~n~~~----~~~~~~~~~~---~~~g------------------~ 424 (424) ++....| .....+..++ +..| | T Consensus 475 L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 521 (548) T protein:vir:95 475 LVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGA 521 (548) T ss_pred CCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCC Confidence 1111111 0000111000 0001 0 No 131 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=99.58 E-value=1e-14 Score=97.32 Aligned_cols=370 Identities=12% Similarity=0.033 Sum_probs=213.6 Q ss_pred cccccCCCc--cHHHHHHhhccCccccccccccccccccccc---ccCCc-----cccHHHHhhhHHHHHHHHHHHHhhh Q lcl|NC_019710. 6 YTIDLRTNN--GWWARLKSWFVGGRLVTPNQGSQTGPVSAHG---YLGDS-----SINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 6 ~~~~~~~~~--G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-----~~~~~~~~~~~~v~~~i~~ia~~ia 75 (424) ..|++|+.- -+...+.....+... ..+.+.... ..++. .+..+-..+.+.|++|++.+..+|. T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~~~~~~-------~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~ 73 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAMEHLGL-------ATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVL 73 (446) T ss_pred CcccccCCCchhhhhhhhhccccchh-------hcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhh Confidence 456666631 111111111000000 000011011 01111 1222222358999999999999999 Q ss_pred hCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-C--ce-------eEE Q lcl|NC_019710. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-G--DV-------ISL 145 (424) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-G--~~-------~~l 145 (424) +++|+|.-. + ++. ..-+...|.... .++....+.+.+.+|-++.++++... | .| +.+ T Consensus 74 ~~~w~V~p~----~-~~~--a~~v~~~l~~~~------~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~ 140 (446) T protein:vir:98 74 NKVGPYQHG----D-KRI--KKFIDDQLRNRA------KTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNY 140 (446) T ss_pred cCCceecCc----c-HHH--HHHHHHHHhhcC------chhHHHHHHHHHhhCceeeeEEEeecccccccchhhcccccc Confidence 999999532 1 111 122455554322 34555557899999999999886432 1 11 122 Q ss_pred EEeccceEEEEEcCCce-------------------EE-------EEEecCceEEecHhHeeEecCcCC-CCccccchHH Q lcl|NC_019710. 146 LPLQSANMDVKLVGKKV-------------------VY-------RYQRDSEYADFSQKEIFHLKGFGF-TGLVGLSPIA 198 (424) Q Consensus 146 ~~l~p~~v~~~~~~~~~-------------------~~-------~~~~~~~~~~~~~~evih~r~~~~-~~~~G~s~~~ 198 (424) .++++.. ....++... .+ .....+....+|.+..+|+++... ..++|.|.+. T Consensus 141 ~~~~~r~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~iP~~kfi~~~~~~~~~~p~G~gLlr 219 (446) T protein:vir:98 141 HPLQVML-IANDNGRIVDGDTVTASQYKSGYWVPLPPYRIGDPPKKVDVVGSHVRLPSHKRLFINYNTKGNNPWGTSCLT 219 (446) T ss_pred cccccee-eeccCCccccccccchhhcccccccCcccchhhhhhhhcccCcccccccccceEEEEecCCCCCccccchHH Confidence 3333221 111111000 00 001122345688889888887654 4589999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH--------HHHH-HHHHHHHhCCc-ccCcce---ecCCC Q lcl|NC_019710. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ--------RSQV-EENFKEIAGGP-VKKRLW---ILEAG 265 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~--------~~~~-~~~~~~~~~~~-~ag~~~---~l~~g 265 (424) .+.-....-....+....|...-+.|--+.+++.+.+.++. .+.. ++..++..... +++.++ .++.| T Consensus 220 ~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g 299 (446) T protein:vir:98 220 SVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQP 299 (446) T ss_pred HHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCC Confidence 99999999999999999999999999888898766543221 1111 22334433322 233333 34888 Q ss_pred ceeeeccCChh-HHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|NC_019710. 266 FSTSAIGVTPQ-DAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIP 344 (424) Q Consensus 266 ~~~~~l~~s~~-d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~ 344 (424) ++++-++.... ...|.+..++-.++|+.+.....-.++...+.+.++ +..+........-+.-.+++|++.||++|+. T Consensus 300 ~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~-ala~vh~~V~~d~~~aDa~~i~~tln~~Li~ 378 (446) T protein:vir:98 300 VQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTG-RASEIQLELFDGKINSIFDTVIHAFTEQVIG 378 (446) T ss_pred ceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88876654222 234788889999999998877644444333322222 2233445566777888999999999998865 Q ss_pred hhhh-c----c-------ceeeecchhhhccCHHHHHHHHHHHHhCCCcCH---HHHHHHhCCCCCCCcCe Q lcl|NC_019710. 345 AKDV-G----R-------IHAEHNLDGLLRGDSASRAAFMKAMGESGLRTI---NEMRRTDNLPPLPGGDV 400 (424) Q Consensus 345 ~~~~-~----~-------~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~---NE~R~~lg~~p~~ggd~ 400 (424) +.=. . . .+++|+.. ...|.+..++.++++++.|..++ +.+|+.+|+|+-+. |+ T Consensus 379 ~l~~lNf~~~~~~~~~~~~~~~~~~~--e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~-~~ 446 (446) T protein:vir:98 379 NLIRLNFDPALYPLASNTGYITRLPG--RATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAIS-ST 446 (446) T ss_pred HHHHhCCCccccccccccccceeccC--ChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCC-CC Confidence 4310 0 0 12233332 35678888999999999998764 45999999986432 22 No 132 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.55 E-value=6.6e-15 Score=98.32 Aligned_cols=386 Identities=13% Similarity=0.090 Sum_probs=188.0 Q ss_pred CCCCCcccccC----------CCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLR----------TNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~~~~----------~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i 70 (424) |-++ -++-++ ++.|+.+-+.+.-.++ + .....++.......... ...|.++..+..+|+.+ T Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~rd~l~~~~~glg~~r------~-~~~~~~g~~~~~~~~~l-~~~Yr~~~ia~~iVd~~ 71 (449) T protein:vir:10 1 MTDK-LTLAVNHALNDARMARARMGLMVPTMGLDNKR------H-SAWCEYGFPELVTYENL-YSLYRRGGIAHGAVEKL 71 (449) T ss_pred Cchh-hHHHHhhhcchhHHHHHHHHHHHHHhcCCccc------c-hhhhhcCCcccCCHHHH-HHHHhcCchhHHHHHhh Confidence 6555 222111 2223333222111000 0 00111111111111111 23456678889999999 Q ss_pred HHhhhhCceeEeeccccCc-cccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE-eeCCC--------- Q lcl|NC_019710. 71 STLTACLPLDVFETDQNDN-RKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV-DRNSA--------- 139 (424) Q Consensus 71 a~~ia~~~~~~~~~~~~~~-~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~-~r~~~--------- 139 (424) ++.+-.--..+....+..+ ..+...+..+.+++.. .-+..+.+..-+ -.++|-|.+++ +++.. T Consensus 72 ~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~-----~~~~~l~ea~~~-~rl~Gga~i~i~v~d~~~l~~Pl~~~ 145 (449) T protein:vir:10 72 VGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFTN-----RLWRSFAEADRR-RLVGRYAGILLHIRDEKDWNLPATKG 145 (449) T ss_pred hhhhhhcCcccccCccccchhhhHHHHHHHHHHHHH-----HHHHHHHHHHHh-hhccCcEEEEEEecCCCCCCcccccC Confidence 9866322122222221111 1111111112222211 002223333333 34567666655 44432 Q ss_pred CceeEEEEeccceEEEE---Ec------CCceEEEEEe-----cCceEEecHhHeeEecCcCCCCccccchHHHHHHHHH Q lcl|NC_019710. 140 GDVISLLPLQSANMDVK---LV------GKKVVYRYQR-----DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 140 G~~~~l~~l~p~~v~~~---~~------~~~~~~~~~~-----~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~ 205 (424) +.+..+.|+....+++. .| +...+|.+.. ......|-++.|+||-..+. .|.|.++.+.+.+. T Consensus 146 ~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SRl~~~~~~~~---~g~~~L~~~yn~l~ 222 (449) T protein:vir:10 146 RGLQKVSVSWAGSLKVAEWDTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDRVFILGDYSE---DAIGFLEPAYNAFV 222 (449) T ss_pred cceeeEEeeccccCChhhhhcCCCCCCCCCceEEEEeeeccCCCccceeeccceeEeecCCCC---CChhHHHHHHHHhh Confidence 24555666654444421 12 1122343331 12335688888988854333 36787887776553 Q ss_pred HHHHH-HHHHHHHHhccCCC-----------ceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccC Q lcl|NC_019710. 206 VAVAM-EDQQRDFFANGAKS-----------PQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGV 273 (424) Q Consensus 206 ~~~~~-~~~~~~~~~ng~~p-----------~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~ 273 (424) ....+ ..+...+++|-.+- .++.... +...++..+.+.+..+.+..+.+ .+.++.+.+|+.++. T Consensus 223 ~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~-~~~~e~~~~~~~~~~~~~~~~~~---~~~i~~~~d~~~~~~ 298 (449) T protein:vir:10 223 SLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLY-GVSIDELQDKFNEVAGEINRGND---VLMTTQGATVTPLVT 298 (449) T ss_pred hHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHh-hCCchHHHHHHHHHHHHHhccch---heeecCCcceEEEec Confidence 33222 22223333321111 1111111 11223334444444444444332 455667778999988 Q ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHH------HHHHHHHHHHHHHhhhccChhh Q lcl|NC_019710. 274 TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY------TLQPYISRWENSIQRWLIPAKD 347 (424) Q Consensus 274 s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~------tl~P~~~~ie~~l~~~L~~~~~ 347 (424) ++.++ .+.......+||++-|||..+|-+...+..+. + ++ .+.||.. -+.|.++.+-+.|-+.-+.... T Consensus 299 ~~sgl--~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glns-t-~D-~~nyyd~i~~~Q~~l~p~le~l~~~l~~s~~g~~~ 373 (449) T protein:vir:10 299 SVADP--TATYNVNLQTAAAGVDIPTRILIGNQQAERSS-T-ED-QKYFNARCQSRRVDLSFEIEDFCDKLIELKIIDAV 373 (449) T ss_pred ccCCh--hHHHHHHHHHHHHHhCCCeeeeeccCcccccc-c-hh-HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCC Confidence 87755 45677778889999999999997766655542 2 33 3444432 3567777777766544333221 Q ss_pred hccceeeecchhhhccCHHHHHH-------HHHHHHhCC---CcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccC Q lcl|NC_019710. 348 VGRIHAEHNLDGLLRGDSASRAA-------FMKAMGESG---LRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK 417 (424) Q Consensus 348 ~~~~~~~f~~~~~~~~d~~~~~~-------~~~~~~~~g---~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~ 417 (424) ..+.|.+++|...+.+++++ .+++++++| +++.+|+|+.+|++|..+.+ .+.++..+.+ T Consensus 374 ---~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~~~--------~~~e~~de~~ 442 (449) T protein:vir:10 374 ---AKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYDNDDEEP--------LGEEDGDEED 442 (449) T ss_pred ---CceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhcccCCCCCC--------CCCCCCcccc Confidence 24667778999998888865 444666666 89999999999999854321 1112222334 Q ss_pred CCccCCC Q lcl|NC_019710. 418 EPRNNGA 424 (424) Q Consensus 418 ~~~~~g~ 424 (424) +..+.+| T Consensus 443 ~~~d~~a 449 (449) T protein:vir:10 443 KATDSAA 449 (449) T ss_pred ccCCcCC Confidence 4455555 No 133 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.53 E-value=1e-14 Score=97.26 Aligned_cols=411 Identities=11% Similarity=0.032 Sum_probs=208.9 Q ss_pred cccCCCccHH-------HHHHh-hccCcccccccccc-cccc--cccccccCCccccHHHHhhhHHHHHHHHHHHHhhhh Q lcl|NC_019710. 8 IDLRTNNGWW-------ARLKS-WFVGGRLVTPNQGS-QTGP--VSAHGYLGDSSINDERILQISTVWRCVSLISTLTAC 76 (424) Q Consensus 8 ~~~~~~~G~~-------~~~~~-~~~~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 76 (424) |++-=. |+. .+... .+.+.......... ..++ .....+......+.+.+..++.+..||+.+.+.+-. T Consensus 1 m~~~~~-~~~a~~~~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG 79 (495) T protein:vir:10 1 MNMTPS-GYQSLASGLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVG 79 (495) T ss_pred CCcccc-cccccchhhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Confidence 554322 332 11110 01111111000000 0000 000000111223344566689999999999988876 Q ss_pred CceeEeeccccCccccccccchhHHhhccC--CCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC--CCC--ceeEEEEecc Q lcl|NC_019710. 77 LPLDVFETDQNDNRKKVDLSNPLARLLRYS--PNQYMTAQEFREAMTMQLCFYGNAYALVDRN--SAG--DVISLLPLQS 150 (424) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~--PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~--~~G--~~~~l~~l~p 150 (424) ..|+..-+..+....+.. ..+...+..+ ...-++.+++...++..++..|++|+.+... .+| .+..|..|+| T Consensus 80 ~Gi~p~~~~~~~~~~~~i--e~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lqliep 157 (495) T protein:vir:10 80 NGLTPRWRMKEQELRQEL--QELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQIIEP 157 (495) T ss_pred CCcccccCCchHHHHHHH--HHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEEech Confidence 677664333221111110 1122222222 2334788899999999999999999887643 333 4678899998 Q ss_pred ceEEEEE------c-------------CCceEEEEEe-----------cCceEEecHhHeeEecCcCCCCccccchHHHH Q lcl|NC_019710. 151 ANMDVKL------V-------------GKKVVYRYQR-----------DSEYADFSQKEIFHLKGFGFTGLVGLSPIAFA 200 (424) Q Consensus 151 ~~v~~~~------~-------------~~~~~~~~~~-----------~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~ 200 (424) +++..-. + +....|.+.. ......+++++|+|+........+|+|.+..+ T Consensus 158 d~l~~~~~~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~~r~gQ~RGis~la~i 237 (495) T protein:vir:10 158 DMLASDIPDETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTVLTVRSDAGAPWFQLL 237 (495) T ss_pred hhcCCCCCCCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccccCCCcccCcchhHHH Confidence 8885211 1 1112232211 11245699999999975445668999866543 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH-HH-HHHHHHHHHhCCcccCcceecCCCceeeeccCChhHH Q lcl|NC_019710. 201 CKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ-RS-QVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDA 278 (424) Q Consensus 201 ~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~-~~-~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~ 278 (424) . .+.......+......+-.+...++|+.+........ .. .....-......-+.|.+..|..|.+++.+..+-... T Consensus 238 ~-~l~~l~~y~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~ 316 (495) T protein:vir:10 238 L-RLNELDQYEDAELVRKKTAALFAAFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGT 316 (495) T ss_pred H-HHHHhhHHHHHHHHHHHHhhhheeeeecCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCC Confidence 3 3443333344433333445556777775433111000 00 0000000000111346788999999999988765566 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHcC-CCCCCCcccccHHHHHHHHHH------------HHHHHHHHH-HHHHHhhhccC Q lcl|NC_019710. 279 EMMASRKFQVSELARFFGVPPHLVG-DVEKSTSWGSGIEQQNLGFLQ------------YTLQPYISR-WENSIQRWLIP 344 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~-~~~~~~~~~~n~e~~~~~f~~------------~tl~P~~~~-ie~~l~~~L~~ 344 (424) .|.+..+...+.||+.+|||-+.|. ...+. ||+++-.....|+. .-+.|+.+. ++.++-...++ T Consensus 317 ~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~--nYSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~ 394 (495) T protein:vir:10 317 TYEPWLRYQLLSIAKGYGITYEMLTGDLRGV--NYSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVV 394 (495) T ss_pred CHHHHHHHHHHHHHhhcCCCHHHHhcccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC Confidence 7889999999999999999999884 34333 44454433333322 122233332 22222222111 Q ss_pred -h-h-hhcc--ceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC----------cCeeeeccccc- Q lcl|NC_019710. 345 -A-K-DVGR--IHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG----------GDVAMRQSQYV- 408 (424) Q Consensus 345 -~-~-~~~~--~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g----------gd~~~~~~n~~- 408 (424) + + +... ..+++-.......|+...++....++++|+.|+-|+-++.|.++-+- .+++=++...- T Consensus 395 ~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p 474 (495) T protein:vir:10 395 IPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISDANQLIDEYDLRLDSDP 474 (495) T ss_pred CCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHHcCCCCCCCC Confidence 1 1 1111 12333334445578999999999999999999999988888876421 11111111111 Q ss_pred -chhhccccCCCccCCC Q lcl|NC_019710. 409 -PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 -~~~~~~~~~~~~~~g~ 424 (424) +....+..+++..+.+ T Consensus 475 ~~~~~~~~~~~~~~~~~ 491 (495) T protein:vir:10 475 RYVNGSGAEQKSVMEAA 491 (495) T ss_pred CcCCCccCCCCCCCCCC Confidence 1111111122222222 No 134 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.51 E-value=1.2e-13 Score=91.33 Aligned_cols=419 Identities=11% Similarity=-0.031 Sum_probs=215.4 Q ss_pred CCCC-CcccccCCCccH-HHHHHhhccCcccccccccccc--ccccc-------ccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 MEEP-KYTIDLRTNNGW-WARLKSWFVGGRLVTPNQGSQT--GPVSA-------HGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~-~~~~~~~~~~G~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~-------~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) |-.. .-+..... .|. ..+......+-.-+........ .+... .........+.+.+..++.+..+|+. T Consensus 1 m~~~~~r~~~~~a-~~~~~~~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~~ 79 (553) T protein:vir:63 1 MTKVTVRKLSEVT-SGRPEQSASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVGY 79 (553) T ss_pred Ccchhhhhhcccc-cccchhhhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 1110 00000000 000 0000000000000000000000 00000 00111122344456668999999999 Q ss_pred HHHhhhhCceeEeecccc----Cccccc--cccch---hHHhhccCCC------CCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQN----DNRKKV--DLSNP---LARLLRYSPN------QYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~----~~~~~~--~~~~~---l~~lL~~~PN------~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) +.+.+-...|++.-+.+. |...+. ..... +...+-..++ .-++.+++...++..++..|++|+.+ T Consensus 80 ~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~ 159 (553) T protein:vir:63 80 QRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATA 159 (553) T ss_pred HHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEe Confidence 888877778877433110 111111 01111 2233333443 44678899999999999999999987 Q ss_pred eeCCC-C--ceeEEEEeccceEEEEEc--------------CCc--eEEEEEe---cC---------------ceEEecH Q lcl|NC_019710. 135 DRNSA-G--DVISLLPLQSANMDVKLV--------------GKK--VVYRYQR---DS---------------EYADFSQ 177 (424) Q Consensus 135 ~r~~~-G--~~~~l~~l~p~~v~~~~~--------------~~~--~~~~~~~---~~---------------~~~~~~~ 177 (424) .+... | .+..|..|+|+++....+ ..+ ..|.+.. +. ....++. T Consensus 160 ~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a 239 (553) T protein:vir:63 160 EWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGR 239 (553) T ss_pred eeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccCh Confidence 65433 2 356788888888753221 111 2232211 10 1235789 Q ss_pred hHeeEecCcC-CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHH----------- Q lcl|NC_019710. 178 KEIFHLKGFG-FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEE----------- 245 (424) Q Consensus 178 ~evih~r~~~-~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~----------- 245 (424) ++|||+..+- ....+|+|.+..+...+.......+......+=.+.-.++|+.+.+ .+...+.+.. T Consensus 240 ~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~--~~~~~~~~~~~~~~~~~~~~~ 317 (553) T protein:vir:63 240 RQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELP--PEFIHSQMSGGSPNADMVGIF 317 (553) T ss_pred hHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCC--hhhhhhhcccccccccccccc Confidence 9999998654 5668999999999888887776666666555556666788876532 1111111110 Q ss_pred -----HHHHHhCC-----cccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCC-CCCCCccccc Q lcl|NC_019710. 246 -----NFKEIAGG-----PVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGD-VEKSTSWGSG 314 (424) Q Consensus 246 -----~~~~~~~~-----~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~-~~~~~~~~~n 314 (424) .......+ -+.|.+..|..|.+++.+..+-...+|.+..+...+.||+.+|||-+.|.+ ..+. ||++ T Consensus 318 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~--nYSS 395 (553) T protein:vir:63 318 GKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKA--NYSS 395 (553) T ss_pred cccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccc--cHHH Confidence 00001111 124678888999999888876556778999999999999999999987743 3343 4445 Q ss_pred HHHHHH-----------HHHHHHHHHHHHHHHH-HHhhhccC-hhh-hc-----------cceeeecchhhhccCHHHHH Q lcl|NC_019710. 315 IEQQNL-----------GFLQYTLQPYISRWEN-SIQRWLIP-AKD-VG-----------RIHAEHNLDGLLRGDSASRA 369 (424) Q Consensus 315 ~e~~~~-----------~f~~~tl~P~~~~ie~-~l~~~L~~-~~~-~~-----------~~~~~f~~~~~~~~d~~~~~ 369 (424) +-.... .|...-+.|+.+.+-+ ++-...++ +.. .. ...+++-.......|+...+ T Consensus 396 ~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~ 475 (553) T protein:vir:63 396 IQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKET 475 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHH Confidence 433222 3444445555444332 22222211 110 00 01123333344556888899 Q ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCC----------cCeeeecccccchhhcc---------ccCCCccCCC Q lcl|NC_019710. 370 AFMKAMGESGLRTINEMRRTDNLPPLPG----------GDVAMRQSQYVPITDLG---------TNKEPRNNGA 424 (424) Q Consensus 370 ~~~~~~~~~g~~t~NE~R~~lg~~p~~g----------gd~~~~~~n~~~~~~~~---------~~~~~~~~g~ 424 (424) +....++++|+.|.-|+-++.|..|-+- .+++=++....+-...+ ......++++ T Consensus 476 ~A~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (553) T protein:vir:63 476 QAAVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTS 549 (553) T ss_pred HHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcc Confidence 9999999999999999988888876321 11111111111100000 0001111111 No 135 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.46 E-value=2.2e-13 Score=90.00 Aligned_cols=406 Identities=11% Similarity=0.027 Sum_probs=205.9 Q ss_pred CCCCCcccccCCCccHHHHHHhhccC---------------cccccccccc------cccccc--cccccCC----cccc Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVG---------------GRLVTPNQGS------QTGPVS--AHGYLGD----SSIN 53 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~---------------~~~~~~~~~~------~~~~~~--~~~~~~~----~~~~ 53 (424) -+|| .|-+|- ++|-++-+-+.... .+..+|.... +.+... ...|..+ -+.. T Consensus 38 ~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~ 115 (695) T protein:vir:78 38 AAQP-VPADMG-RRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT 115 (695) T ss_pred cccc-cchhhc-ccccccccccccccCCCcccccceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHH Confidence 2222 111111 12333322221110 0111111100 000000 0001100 1111 Q ss_pred HHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc----------CccccccccchhHHhhccCCCCCCCHHHHHHHHHHH Q lcl|NC_019710. 54 DERILQISTVWRCVSLISTLTACLPLDVFETDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQ 123 (424) Q Consensus 54 ~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~----------~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~ 123 (424) -....++|.+++|+..|++.+..-=+.+.....+ ++... ..+.....+|...-....-+..|.+.+.+. T Consensus 116 la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~erL~V~~~l~eaik~a 194 (695) T protein:vir:78 116 LVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIERLRIRDAVRTTVIHD 194 (695) T ss_pred HHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 2334567889999999999887652222111100 00000 001112223332222222233444444444 Q ss_pred HHHcCCeEEEEeeCC-----------------CCceeEEEEeccceEEEEEcC---------CceEEEEEecCceEEecH Q lcl|NC_019710. 124 LCFYGNAYALVDRNS-----------------AGDVISLLPLQSANMDVKLVG---------KKVVYRYQRDSEYADFSQ 177 (424) Q Consensus 124 ~l~~G~a~~~~~r~~-----------------~G~~~~l~~l~p~~v~~~~~~---------~~~~~~~~~~~~~~~~~~ 177 (424) . ++|-+.+++.-++ .|....|.+++|.++++...+ +...| |...+ ..|-. T Consensus 195 R-lfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~-y~V~G--~kIH~ 270 (695) T protein:vir:78 195 Q-AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPST-WWMIG--TEVHA 270 (695) T ss_pred c-cccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCce-EEEec--eEEee Confidence 4 5666555443322 245666888898888764321 01111 22222 24566 Q ss_pred hHeeEecCcCC-------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC--CCCCCHHHHH--HHHHH Q lcl|NC_019710. 178 KEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG--EKVLTEQQRS--QVEEN 246 (424) Q Consensus 178 ~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~--~~~~~~~~~~--~~~~~ 246 (424) +.++.|...+. ..+.|+|..+.+.+.+............+...-.. .++ +.+ ..+......+ ..-+. T Consensus 271 SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v-~~l-k~dla~~L~~g~~~~l~~R~el 348 (695) T protein:vir:78 271 TRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSV-SGI-LMDLAQALMPGANVDLSMRAEL 348 (695) T ss_pred eeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhh-HHH-HHHHHHhhcChhHHHHHHHHHH Confidence 66666654332 13579999999999998888887777777654332 222 111 1112222212 12233 Q ss_pred HHHHhCCcccCcceecC-CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH-- Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL-- 323 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~-~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~-- 323 (424) ++.+. +|. ++++++ ...+|++.+.+...+ .+........||.+-+||...|-+......| ++.|...+.|| T Consensus 349 i~~~R--sn~-G~~llDk~~Eefeq~stslSGL--ddVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~ 422 (695) T protein:vir:78 349 INRYR--DNR-NILFLDKATEEFFQFNTPLSGL--DALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDY 422 (695) T ss_pred HHHhc--Ccc-ceEEEecCCcceEEEecccCCH--HHHHHHHHHHHHhhhcCchhhhhccCCcccc-ccchhhHHHHHHH Confidence 34443 333 478888 478999998776554 4667777889999999999988776654442 13333344444 Q ss_pred -----HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhC Q lcl|NC_019710. 324 -----QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGESGLRTINEMRRTDN 391 (424) Q Consensus 324 -----~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~lg 391 (424) +.-|.|.++.+-+.+-+..|...+ ..+.|.+++|...+.++++++ ...++..|+++++|+|+++. T Consensus 423 I~s~Qe~~L~p~L~rl~~ii~rS~~G~id---pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~ 499 (695) T protein:vir:78 423 VRAYQRNALQQLMNDVIVMIQLSLFGAVD---PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLN 499 (695) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCC---CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHh Confidence 446889999988888777776543 245667788888887776654 55788999999999999987 Q ss_pred CCCC-------CCcCeeeecccc-cchh-----hccccCCC-ccCCC Q lcl|NC_019710. 392 LPPL-------PGGDVAMRQSQY-VPIT-----DLGTNKEP-RNNGA 424 (424) Q Consensus 392 ~~p~-------~ggd~~~~~~n~-~~~~-----~~~~~~~~-~~~g~ 424 (424) -+|- +-.|++-++... ++.. ..++.++. ..+++ T Consensus 500 ~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (695) T protein:vir:78 500 TEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA 546 (695) T ss_pred cCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCCC Confidence 7642 334555444432 1111 11111111 11111 No 136 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=99.46 E-value=2.8e-13 Score=89.41 Aligned_cols=289 Identities=14% Similarity=0.028 Sum_probs=170.5 Q ss_pred EEEEeeCCC-C--ceeEEEEeccceEE---EEEcCCceEEEEE--ecCceEEecHhHeeEecCcC-CCCccccchHHHHH Q lcl|NC_019710. 131 YALVDRNSA-G--DVISLLPLQSANMD---VKLVGKKVVYRYQ--RDSEYADFSQKEIFHLKGFG-FTGLVGLSPIAFAC 201 (424) Q Consensus 131 ~~~~~r~~~-G--~~~~l~~l~p~~v~---~~~~~~~~~~~~~--~~~~~~~~~~~evih~r~~~-~~~~~G~s~~~~~~ 201 (424) +.++++... | .|..|.+.|+.++. +..+++....... .+.....+++...|+.++.. ...++|.+.+..+. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~~~l~~~~~~~~~g~~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~ 80 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPDGGLVAIEQWGVFGKATVRIPVDRLVVFVNEREGANWLGQSLLRQAY 80 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccCCceeEEEecCCCCCCcceeccCCEEEEEeCCCCCCccchhhHHHHH Confidence 666665443 3 46678888887554 3333333332222 23345678888877776654 45589999999999 Q ss_pred HHHHHHHHHHHHHHHHHhcc--CCCceeEEcCCCCCCH----------HHHHHHHHHHHHHhCCcccCcceecCCCceee Q lcl|NC_019710. 202 KSAGVAVAMEDQQRDFFANG--AKSPQILSTGEKVLTE----------QQRSQVEENFKEIAGGPVKKRLWILEAGFSTS 269 (424) Q Consensus 202 ~~i~~~~~~~~~~~~~~~ng--~~p~~vl~~~~~~~~~----------~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~ 269 (424) -....-....++...|...- +.|-++.........+ +.++.+....+....+..+ .++++.|++++ T Consensus 81 w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a--~~iip~g~~ie 158 (355) T protein:vir:78 81 KNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAA--GGYIPHGANFT 158 (355) T ss_pred HHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcce--eEeecCCceEE Confidence 99999999999999999976 4444444433332222 1223333344444444443 56778888877 Q ss_pred eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChhh-- Q lcl|NC_019710. 270 AIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAKD-- 347 (424) Q Consensus 270 ~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~-- 347 (424) -+..+....++.+..++-.++|+.++.-. .+-....+++.++ ...+.........+.-.+..|++.||++|+.+.- T Consensus 159 ~~ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~-Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~l~~l 236 (355) T protein:vir:78 159 LTGVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSY-ALGDTFASFFTGSLNAVMKHIADVTQQHVVEDLVDQ 236 (355) T ss_pred EeecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 76655555667788888899998887544 2211111111222 2344556777888888899999999988876421 Q ss_pred ---hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHH-----HHHHHhCCCCCCCcCeeeecccc-cchhhccccCC Q lcl|NC_019710. 348 ---VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTIN-----EMRRTDNLPPLPGGDVAMRQSQY-VPITDLGTNKE 418 (424) Q Consensus 348 ---~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~N-----E~R~~lg~~p~~ggd~~~~~~n~-~~~~~~~~~~~ 418 (424) ....+.+|.++... .+.+.+++.+++++..|+.-++ .+|+.+|+|.-+.+++...+... .+......... T Consensus 237 N~~~~~~~P~~~~~~~~-~~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~ 315 (355) T protein:vir:78 237 NWGPEEPAPRLVPAQLG-KEQPVTAEAIRALVECGAFTADPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKRLP 315 (355) T ss_pred cCCCCCCCCEEEecCcC-hhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccccC Confidence 11122233333333 3456789999999999987764 47999999865555544433221 11111111111 Q ss_pred CccCCC Q lcl|NC_019710. 419 PRNNGA 424 (424) Q Consensus 419 ~~~~g~ 424 (424) +...++ T Consensus 316 ~~~~~~ 321 (355) T protein:vir:78 316 GQRQGA 321 (355) T ss_pred Cccccc Confidence 111111 No 137 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.45 E-value=2.1e-13 Score=90.03 Aligned_cols=407 Identities=11% Similarity=0.033 Sum_probs=204.2 Q ss_pred CCCCCcccccCCCccHHHHHHhhcc---------------Ccccccccccc------cccccc--cccccCC----cccc Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFV---------------GGRLVTPNQGS------QTGPVS--AHGYLGD----SSIN 53 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~---------------~~~~~~~~~~~------~~~~~~--~~~~~~~----~~~~ 53 (424) -+|| .|-+|- ++|-++-+-+... ..+..+|.... +.+... ...|..+ -+.. T Consensus 38 ~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~ 115 (698) T protein:vir:10 38 AAQP-VPADMG-RRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT 115 (698) T ss_pred cccc-cchhhc-ccccccccccccccCCCccccccccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHH Confidence 1222 111111 1222222221111 11111111110 000000 0001100 1111 Q ss_pred HHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc----------CccccccccchhHHhhccCCCCCCCHHHHHHHHHHH Q lcl|NC_019710. 54 DERILQISTVWRCVSLISTLTACLPLDVFETDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQ 123 (424) Q Consensus 54 ~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~----------~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~ 123 (424) -....++|.+++|+..|++.+..-=+.+.....+ ++... ..+.....+|...-....-+..+.+.+.+. T Consensus 116 la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~erl~V~~~l~eai~~a 194 (698) T protein:vir:10 116 LVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIERLRIRDAVRTTVIHD 194 (698) T ss_pred HHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 2345567889999999999887652222111100 00000 001112223332222222233444444444 Q ss_pred HHHcCCeEEEEeeCC-----------------CCceeEEEEeccceEEEEEcC--C-------ceEEEEEecCceEEecH Q lcl|NC_019710. 124 LCFYGNAYALVDRNS-----------------AGDVISLLPLQSANMDVKLVG--K-------KVVYRYQRDSEYADFSQ 177 (424) Q Consensus 124 ~l~~G~a~~~~~r~~-----------------~G~~~~l~~l~p~~v~~~~~~--~-------~~~~~~~~~~~~~~~~~ 177 (424) . ++|=+.+++.-++ .|....|.+|+|.+|++...+ + ...| |...+ ..|-. T Consensus 195 R-lfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~spdfgkP~~-y~V~G--~~IH~ 270 (698) T protein:vir:10 195 Q-AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPST-WWMIG--SEVHA 270 (698) T ss_pred c-cccceEEEEEeecCccccccccccccccccCccceeeeeecccccccchhhhccchhhccCCCce-EEEec--ceecc Confidence 4 4555544443211 245566888898888764321 1 1112 22222 24566 Q ss_pred hHeeEecCcCC-------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC-CCHHHHHHH--HHHH Q lcl|NC_019710. 178 KEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV-LTEQQRSQV--EENF 247 (424) Q Consensus 178 ~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~-~~~~~~~~~--~~~~ 247 (424) +.++.|...+. ..+.|+|..+.+.+.+............+...-.. .++.+--... ......+.. -+.+ T Consensus 271 SRL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~-~~l~~dla~aL~~g~~~~l~~R~eli 349 (698) T protein:vir:10 271 TRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSV-SGILMDLAQALTPGANVDLSMRAELI 349 (698) T ss_pred eeEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhH-HHHHHHHHHhcCChhhHHHHHHHHHH Confidence 66666654332 13469999999999999888887777776654222 2221111111 112211211 1333 Q ss_pred HHHhCCcccCcceecC-CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH--- Q lcl|NC_019710. 248 KEIAGGPVKKRLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL--- 323 (424) Q Consensus 248 ~~~~~~~~ag~~~~l~-~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~--- 323 (424) +.+. +|. ++++++ ...+|++.+.+...+ .+........||.+-+||...|-+......| ++.|...+.|| T Consensus 350 ~~~R--sn~-G~~llDk~~Eefeq~st~lSGL--ddVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~I 423 (698) T protein:vir:10 350 NRYR--DNR-NILFLDKATEEFFQFNTPLSGL--DALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDYV 423 (698) T ss_pred HHhc--Ccc-ceEEEecCCcceEEEecCcCCH--HHHHHHHHHHHHhhhcCchhhhhccCCcccC-ccchhhHHHHHHHH Confidence 3333 333 477888 578999998777554 5667778889999999999988776654442 13343444454 Q ss_pred ----HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhCC Q lcl|NC_019710. 324 ----QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGESGLRTINEMRRTDNL 392 (424) Q Consensus 324 ----~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~lg~ 392 (424) +.-|.|.++.+-+.+-+..|...+ ..+.|.+++|...+.++++++ .+.++..|+++++|+|++|.- T Consensus 424 ~s~Qe~~L~p~L~rl~~ii~rS~~G~id---p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~ 500 (698) T protein:vir:10 424 RAYQRNALQQLMNDVIVMIQLSLFGAVD---PSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNT 500 (698) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCC---CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhc Confidence 446889999998888777776543 246667788888887777654 456788999999999999876 Q ss_pred CCC-------CCcCeeeecc-cccchh--------hccccCCCcc-CCC Q lcl|NC_019710. 393 PPL-------PGGDVAMRQS-QYVPIT--------DLGTNKEPRN-NGA 424 (424) Q Consensus 393 ~p~-------~ggd~~~~~~-n~~~~~--------~~~~~~~~~~-~g~ 424 (424) +|- +--|++..|. |.++.. ..++...++. +|| T Consensus 501 d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (698) T protein:vir:10 501 EPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGA 549 (698) T ss_pred cCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccc Confidence 542 1123433332 222221 1111122222 222 No 138 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.43 E-value=4.3e-13 Score=88.40 Aligned_cols=408 Identities=10% Similarity=0.009 Sum_probs=206.1 Q ss_pred CCC-CCcccccCCC-ccHHHHHHhhc---------------cCcccccccc------ccccccccc--ccccCC----cc Q lcl|NC_019710. 1 MEE-PKYTIDLRTN-NGWWARLKSWF---------------VGGRLVTPNQ------GSQTGPVSA--HGYLGD----SS 51 (424) Q Consensus 1 ~~~-~~~~~~~~~~-~G~~~~~~~~~---------------~~~~~~~~~~------~~~~~~~~~--~~~~~~----~~ 51 (424) +++ .-.||.+-.. +|-++-+.... ...+..+|.. ....+.... ..|..+ -+ T Consensus 33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy 112 (694) T protein:vir:10 33 IATAAAQPVPADFARRGALNALDAAPVAEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGF 112 (694) T ss_pred hhhcCCCcccCCccccccchhhcccccCCCCcchhhhhhccccccCCCccccchhhhhhccCcccccchhhhhccCcchH Confidence 000 0112333222 23332222110 0001111110 001000000 001100 01 Q ss_pred ccHHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc----------CccccccccchhHHhhccCCCCCCCHHHHHHHHH Q lcl|NC_019710. 52 INDERILQISTVWRCVSLISTLTACLPLDVFETDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMT 121 (424) Q Consensus 52 ~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~----------~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~ 121 (424) ..-....++|.+++|+..|++.+..-=+.+.....+ ++... ..+.....+|...-....-+..|.+.+. T Consensus 113 ~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqi~~L~~e~erl~V~~~l~eaik 191 (694) T protein:vir:10 113 PTLVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIERLRIRDAVRTTVI 191 (694) T ss_pred HHHHHHhhccchhhHHHHHHHHhhcccceeccccchhhhhhcccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112335567889999999999887652222111000 00000 0011122233322222222334444444 Q ss_pred HHHHHcCCeEEEEeeCC-----------------CCceeEEEEeccceEEEEEcC---------CceEEEEEecCceEEe Q lcl|NC_019710. 122 MQLCFYGNAYALVDRNS-----------------AGDVISLLPLQSANMDVKLVG---------KKVVYRYQRDSEYADF 175 (424) Q Consensus 122 ~~~l~~G~a~~~~~r~~-----------------~G~~~~l~~l~p~~v~~~~~~---------~~~~~~~~~~~~~~~~ 175 (424) +.. ++|-+.+++.-++ .|....|.+++|.++++...+ +...| |...+ ..| T Consensus 192 ~aR-lfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~-y~V~G--~~I 267 (694) T protein:vir:10 192 HDQ-AFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPST-WWMIG--TEV 267 (694) T ss_pred hhc-cccceEEEEEeecCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCce-EEEec--eEE Confidence 444 5666655443222 245666888898888764321 01111 22222 245 Q ss_pred cHhHeeEecCcCC-------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC--CCCCCHHHHH--HHH Q lcl|NC_019710. 176 SQKEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG--EKVLTEQQRS--QVE 244 (424) Q Consensus 176 ~~~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~--~~~~~~~~~~--~~~ 244 (424) -.+.++.|...+. ..+.|+|..+.+.+.+............+...-.. .++ +.+ ..+......+ ..- T Consensus 268 H~SRL~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v-~~l-k~dla~~L~~g~~~~l~~R~ 345 (694) T protein:vir:10 268 HATRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSV-SGI-LMDLAQALMPGANVDLSMRA 345 (694) T ss_pred eeeeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhh-HHH-HHHHHHhhcChhHHHHHHHH Confidence 6666666654332 13579999999999998888887777777654322 222 111 1111222212 122 Q ss_pred HHHHHHhCCcccCcceecC-CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH Q lcl|NC_019710. 245 ENFKEIAGGPVKKRLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL 323 (424) Q Consensus 245 ~~~~~~~~~~~ag~~~~l~-~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~ 323 (424) +.++.+. +|. ++++++ ...+|++.+.+...+ .+........||.+-+||...|-+......| ++.|...+.|| T Consensus 346 eli~~~R--sn~-G~~llDk~~Eefeq~stslSGL--ddVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYY 419 (694) T protein:vir:10 346 ELINRYR--DNR-NILFLDKATEEFFQFNTPLSGL--DALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWY 419 (694) T ss_pred HHHHHhc--Ccc-ceEEEecCCcceEEEecccCCH--HHHHHHHHHHHHhhhcCchhhhhccCccccc-ccchhhHHHHH Confidence 3334443 333 478888 578999998776554 4667777889999999999988776654442 13333344444 Q ss_pred -------HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHH Q lcl|NC_019710. 324 -------QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGESGLRTINEMRRT 389 (424) Q Consensus 324 -------~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~ 389 (424) +.-|.|.++.+-+.+-+..|...+ ..+.|.+++|...+.++++++ ...++..|+++++|+|++ T Consensus 420 D~I~s~Qe~~L~p~L~rl~~ii~rS~~G~id---p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~r 496 (694) T protein:vir:10 420 DYVRAYQRNALQQLMNDVIVMIQLSLFGAVD---PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAAR 496 (694) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC---CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHH Confidence 446889999988888777776543 245667778888887776654 567889999999999999 Q ss_pred hCCCCC-------CCcCeeeecccc-cchh-----hccccC-CCccCCC Q lcl|NC_019710. 390 DNLPPL-------PGGDVAMRQSQY-VPIT-----DLGTNK-EPRNNGA 424 (424) Q Consensus 390 lg~~p~-------~ggd~~~~~~n~-~~~~-----~~~~~~-~~~~~g~ 424 (424) +.-+|- +-.|++-++... ++.. ..++.+ .+..+++ T Consensus 497 L~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (694) T protein:vir:10 497 LNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA 545 (694) T ss_pred HhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcc Confidence 877642 334555444432 1111 111111 1111111 No 139 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.42 E-value=7.1e-13 Score=87.18 Aligned_cols=406 Identities=11% Similarity=0.021 Sum_probs=204.3 Q ss_pred CCCCCcccccCCCccHHHHHHhhc---------------cCccccccccc------ccccccc--cccccCC----cccc Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWF---------------VGGRLVTPNQG------SQTGPVS--AHGYLGD----SSIN 53 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~---------------~~~~~~~~~~~------~~~~~~~--~~~~~~~----~~~~ 53 (424) -+|| .|-++ .++|-++-+-+.. ...+..+|... .+.+... ...|..+ -+.. T Consensus 38 ~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~ 115 (695) T protein:vir:36 38 AAQP-VPADF-ARRGALNALDAAPVVEPSPSLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPT 115 (695) T ss_pred cccc-cchhh-hhcccccccccccccCCCcccccceeceecccccCccccchhhhhhcccccccccchhhhccCcchHHH Confidence 1222 11111 1123332222111 00011111100 0000000 0001100 1111 Q ss_pred HHHHhhhHHHHHHHHHHHHhhhhCceeEeecccc----------CccccccccchhHHhhccCCCCCCCHHHHHHHHHHH Q lcl|NC_019710. 54 DERILQISTVWRCVSLISTLTACLPLDVFETDQN----------DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQ 123 (424) Q Consensus 54 ~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~----------~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~ 123 (424) -....++|.+++|+..|++.+..-=+.+.....+ ++... ..+......|...-....-+..|.+. +.+ T Consensus 116 la~laQ~~eyr~~~~~ia~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~-~~d~dqik~L~~e~erL~V~~~l~ea-ik~ 193 (695) T protein:vir:36 116 LVLLAQLPEYRAMHEVLADECIRTWGEAIGGTKEKADTSGLAAGGNAAS-TSDGDQLKQINDEIERLRIRDAVRTT-VIH 193 (695) T ss_pred HHHHhhccchhhHHHHHHHHhhcccceecccchhhhhhccccccccccc-cCchHHHHHHHHHHHHHHHHHHHHHH-HHh Confidence 2334567889999999999887652222111000 00000 00001122232222122223334444 444 Q ss_pred HHHcCCeEEEEeeCC-----------------CCceeEEEEeccceEEEEEcC---------CceEEEEEecCceEEecH Q lcl|NC_019710. 124 LCFYGNAYALVDRNS-----------------AGDVISLLPLQSANMDVKLVG---------KKVVYRYQRDSEYADFSQ 177 (424) Q Consensus 124 ~l~~G~a~~~~~r~~-----------------~G~~~~l~~l~p~~v~~~~~~---------~~~~~~~~~~~~~~~~~~ 177 (424) --++|-+.+++.-++ .|....|.+++|.++++...+ +...| |...+ ..|-. T Consensus 194 aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~-y~V~G--~kIH~ 270 (695) T protein:vir:36 194 DQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPST-WWMIG--TEVHA 270 (695) T ss_pred hccccceEEEEEeccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCce-EEEec--eEEee Confidence 445666655443322 245666888898888764321 11112 22222 24566 Q ss_pred hHeeEecCcCC-------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC--CCCCCHHHHH--HHHHH Q lcl|NC_019710. 178 KEIFHLKGFGF-------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG--EKVLTEQQRS--QVEEN 246 (424) Q Consensus 178 ~evih~r~~~~-------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~--~~~~~~~~~~--~~~~~ 246 (424) +.++.|...+. ..+.|+|..+.+.+.+............+...-.. .++ +.+ ..+......+ ..-+. T Consensus 271 SRL~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v-~~l-k~dla~aL~~g~~~~l~~R~el 348 (695) T protein:vir:36 271 TRLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSV-SGI-LMDLAQALMPGANVDLSMRAEL 348 (695) T ss_pred eeEEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhH-HHH-HHHHHHhhcChhHHHHHHHHHH Confidence 66666654332 13569999999999988888877777777654222 222 111 1111222212 12233 Q ss_pred HHHHhCCcccCcceecC-CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHH-- Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFL-- 323 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~-~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~-- 323 (424) ++.+. +|. ++++++ ...+|++.+.+...+ .+........||.+-+||...|-+......| ++.|...+.|| T Consensus 349 i~~~R--sn~-G~~llDk~~Eefeq~stslSGL--ddVi~qf~q~VAgaa~IPltkLfGqSPkGlN-ATGE~D~rnYYD~ 422 (695) T protein:vir:36 349 INRYR--DNR-NILFLDKATEEFFQFNTPLSGL--DALQAQAQEQMSAVSHIPLIKLLGITPTGLN-ASSEGEIRVWYDY 422 (695) T ss_pred HHHhc--Ccc-ceEEEecCCcceEEEecccCCH--HHHHHHHHHHHHhhhcCchhhhhccCccccc-ccchhhHHHHHHH Confidence 34443 333 478888 478999998776554 4667777889999999999988776654442 13333344444 Q ss_pred -----HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhC Q lcl|NC_019710. 324 -----QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAF-------MKAMGESGLRTINEMRRTDN 391 (424) Q Consensus 324 -----~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~-------~~~~~~~g~~t~NE~R~~lg 391 (424) +.-|.|.++.+-+.+-+..|...+ ..+.|.+++|...+.++++++ ...++..|+++++|+|+++. T Consensus 423 I~s~Qe~~L~p~L~rl~~ii~rS~~G~id---pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~ 499 (695) T protein:vir:36 423 VRAYQRNALQQLMNDVIVMIQLSLFGAVD---PSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLN 499 (695) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCC---CcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHh Confidence 446889999988888777776543 245667778888887776654 55788999999999999987 Q ss_pred CCCC-------CCcCeeeecccc-cchh-----hccccC-CCccCCC Q lcl|NC_019710. 392 LPPL-------PGGDVAMRQSQY-VPIT-----DLGTNK-EPRNNGA 424 (424) Q Consensus 392 ~~p~-------~ggd~~~~~~n~-~~~~-----~~~~~~-~~~~~g~ 424 (424) -+|- +-.|++-++... ++.. ..++.+ .+..+++ T Consensus 500 ~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 546 (695) T protein:vir:36 500 TEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGA 546 (695) T ss_pred cCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCcc Confidence 7642 334555444432 1111 111111 1111112 No 140 >protein:vir:106491 Length: 646 # NCBI annotation: Pas4 # Family: family:all:2798 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024790;genbank:gi:48697405;genbank:GeneID:2846148 Probab=99.28 E-value=5.3e-12 Score=82.39 Aligned_cols=407 Identities=11% Similarity=0.025 Sum_probs=216.7 Q ss_pred CCC---CCcccccCCCccHHHHHHhhccCcc--cccccccccccccccccccCC-ccc---cHHHHhhhHHHHHHHHHHH Q lcl|NC_019710. 1 MEE---PKYTIDLRTNNGWWARLKSWFVGGR--LVTPNQGSQTGPVSAHGYLGD-SSI---NDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~---~~~~~~~~~~~G~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~~~~v~~~i~~ia 71 (424) |+- |--|-.+.+..+=..|- +.+.+ ...+...... .+.++ ... .-+-|-..|.+.-.+..|+ T Consensus 1 ~~~~rPk~~p~~p~~~~~arrr~---LtaAsa~l~~~~~~~~k------t~~~~~~~WQ~eAW~~~d~vpELry~vgW~~ 71 (646) T protein:vir:10 1 MALLKPKSAPPEPFGAEVARRIA---LAGATAQVDLGASSSWK------TWKFGNKDWQTEGWRLYDIIPEHHFLAGRIG 71 (646) T ss_pred CcccCCCCCCCCcccccccchhh---hhhccccccCCCcceee------cCCCcchhhhHHHHHHHhhhhhHhhHhhhhh Confidence 221 11122222111111110 00000 0000000000 00011 000 0112333578888999999 Q ss_pred HhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe---eCCCCceeEEEEe Q lcl|NC_019710. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD---RNSAGDVISLLPL 148 (424) Q Consensus 72 ~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~---r~~~G~~~~l~~l 148 (424) ++++++.+..-+-++.|...-...++....+....--.-.-..++++.+..++-+-|++|++.. ....+.-..++++ T Consensus 72 ~a~SR~rL~aseiddtG~~tg~v~~~~v~~iv~~~~Gg~~gQ~qlLkr~~~~ltV~GE~wiv~~~~~~~~~~~~~~W~vv 151 (646) T protein:vir:10 72 DSVAQARLYVTEVDDTGEETGEVQDERIKRLAAVPLGTGSQRDDNLRLAGLDLAVGGECWIVGEGAATSPEAAEGSWFVV 151 (646) T ss_pred hhhceeeeeeeeecCCCCCcCccchHHHHHHhhhhccchhhHHHHHHHHHhheecccceEEeeccccCCCCCCccceeee Confidence 9999999988777766654444444555555443333333456899999999999999999741 1111222245555 Q ss_pred ccceEEEEEcCCceEEEEEe---cCceEEecHhHeeEecC--cCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019710. 149 QSANMDVKLVGKKVVYRYQR---DSEYADFSQKEIFHLKG--FGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGA 222 (424) Q Consensus 149 ~p~~v~~~~~~~~~~~~~~~---~~~~~~~~~~evih~r~--~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~ 222 (424) -...|.. .++.....-.. +...+-++..+++ ||- +++ .....-||+.+++..+.-..-..+...+..+.-. T Consensus 152 t~~Ev~~--tg~~~~i~~p~~~~g~~~v~~~~~d~l-vRiW~P~Prr~~epDSpvra~l~~l~Ei~~lt~~I~aaakSRL 228 (646) T protein:vir:10 152 TGSAISR--TGDEIAVRRPQQRGGSKLVLVDGQDIL-IRCWRPHPNDTDQADSFTRSAIVPLREIELLTKREFAELDSRL 228 (646) T ss_pred cHHHhcc--CCCeeeeecCccCCCCCcceecCCceE-EEEecCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHHHHH Confidence 5555522 23322222111 3334445666663 352 322 3456789999999998888888777777777766 Q ss_pred CCceeEEcCCCC------CCHHHHHHHHHHH----HHHhCCcc---cCcceecCC-Cc------eeeeccC-ChhHHHHH Q lcl|NC_019710. 223 KSPQILSTGEKV------LTEQQRSQVEENF----KEIAGGPV---KKRLWILEA-GF------STSAIGV-TPQDAEMM 281 (424) Q Consensus 223 ~p~~vl~~~~~~------~~~~~~~~~~~~~----~~~~~~~~---ag~~~~l~~-g~------~~~~l~~-s~~d~~~~ 281 (424) .-.||+-++... .++.....+...+ .......+ +-=++++.. |. +++.+.. +..+.--+ T Consensus 229 ~GnGvLfvP~e~s~p~~~~~~a~~~~l~~~l~qaa~tAi~De~S~aA~vPiia~~P~E~i~~~~~ik~l~f~~eite~ai 308 (646) T protein:vir:10 229 TGAGIMFLPEGVDFPRGEEDPAGLAGFMAYLQRAAAASMADQSRASAMVPIMATIPNEMMEHLDKIKPLTFWSELSAEIT 308 (646) T ss_pred hcCceeeeccccccCCCCCCCcchhHHHHHHHHHHHhhhcCCCCccceeeeEEeeChHHHhhhhcceeeccCchhhHHHh Confidence 667777555432 1212222333333 22222221 111222221 11 2333333 22233357 Q ss_pred HHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChh-------hhccceee Q lcl|NC_019710. 282 ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK-------DVGRIHAE 354 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~-------~~~~~~~~ 354 (424) ++++-.+.++|....|||+.|-+.+++|-. +.-+....-+. -|.|.+..|++.|++..|.+- +...|-+. T Consensus 309 ktR~daI~RlA~glDIppE~LLGlgd~NHW--tAWqI~de~vr-HI~P~l~~ic~AlT~~~Lrp~Le~eGi~dp~kyvvW 385 (646) T protein:vir:10 309 PMKDKAIARLASSAEIPGEVLTGIGDANHW--TAWLISDEGIR-WIRGYLGLIADALTRGFLRRALESMGVTNPERYAFA 385 (646) T ss_pred hhHHHHHHHHHhccCCchhheeecccccee--eeeeeccccch-hhhhHHHHHHHHHHhhHHHHHHHHcCCCChhHeEEe Confidence 889999999999999999998777765542 22233334444 599999999999999988652 11246789 Q ss_pred ecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe---------------------eee------cc-- Q lcl|NC_019710. 355 HNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV---------------------AMR------QS-- 405 (424) Q Consensus 355 f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~---------------------~~~------~~-- 405 (424) ||.+.|.... ++.+-+..+.+.|.+|-...|+.+|+.-.++=+. +.+ |. T Consensus 386 ~DaS~Lt~~p--d~~deA~qa~drGAIt~eAlrk~~Gf~~dd~pt~~E~~~~~~~~~v~~~P~Lil~P~~qa~~~~P~~~ 463 (646) T protein:vir:10 386 FDTSTLASKP--NRLDEAIQLHERNLIKDEEVVKAGAFSVDQMPTVQERAVQILLGLVKTQPDLILDPAIQAALGLPAVQ 463 (646) T ss_pred ecCcccccCC--CCcHHHHHHHHcCCccHHHHHHHhcccccccCChHHHHHHHHHHHhcCCccccccchhhccccCCCcC Confidence 9999886542 3445566788999999999999999963221111 000 00 Q ss_pred --cccc--hhhc-cccCCCccCCC Q lcl|NC_019710. 406 --QYVP--ITDL-GTNKEPRNNGA 424 (424) Q Consensus 406 --n~~~--~~~~-~~~~~~~~~g~ 424 (424) .+.| ++.. ++.++++++|+ T Consensus 464 ~~~lpp~~~~~~dg~~~~~e~~g~ 487 (646) T protein:vir:10 464 SVGLPPTAAQRTDGDLDDDESEGA 487 (646) T ss_pred ccccCCcccccccCCCCChhhcCC Confidence 1111 1111 11223344444 No 141 >protein:vir:102426 Length: 631 # NCBI annotation: gp11 # Family: family:all:2798 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655288;genbank:gi:109521851;genbank:GeneID:4157741 Probab=99.19 E-value=4.8e-12 Score=82.63 Aligned_cols=405 Identities=12% Similarity=0.079 Sum_probs=220.6 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccc------ccccccccccccccCCccc-------cHHHHhhhHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPN------QGSQTGPVSAHGYLGDSSI-------NDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~v~~~i 67 (424) |+-+. ++++ .. +.++..++... ... +.+--.+.+..+.+- .-+-|-..+.+.-.+ T Consensus 1 ~~a~~-~lr~------~r----rpkg~~~a~~r~L~aAs~~~-~dpg~~~~~~~g~~~~~~WQ~eAW~~~d~v~Elry~v 68 (631) T protein:vir:10 1 MAATQ-SLRL------VR----RPKGGRPAPSRALTAASQPL-PDPSQVFSKSTGISRNSDWQTDAWEAVDLVGELRYYV 68 (631) T ss_pred CCccc-ceee------ee----cCCCCCccchhhhhhhhccc-cchhhhhhhhcCCcccchhhHHHHHHHHhhhhHHHHh Confidence 44332 1211 11 12222211110 000 011001111111110 111233358888899 Q ss_pred HHHHHhhhhCceeEeecccc-----Cccccccccc-hhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCCCC Q lcl|NC_019710. 68 SLISTLTACLPLDVFETDQN-----DNRKKVDLSN-PLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNSAG 140 (424) Q Consensus 68 ~~ia~~ia~~~~~~~~~~~~-----~~~~~~~~~~-~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~~G 140 (424) ..|+++++++.+..-+-+.+ |..++...++ ....+...=+..-+...++++.++.++-+-|++|+.+. +..+| T Consensus 69 gW~~~s~sr~rL~as~idpDtg~ptg~iee~~~~~~~v~~~~~~i~gG~lgQ~~llkrl~~~ltV~GE~wiv~l~~p~~~ 148 (631) T protein:vir:10 69 GWRASSCSRCRLVASELDENTGLPTGGISEDNTEGERVREIVSKIADGTLGQAALTKRVVECLTVPGELWIVILTRPVKG 148 (631) T ss_pred hhhhhhhceeeeEeeeeccCCCCCccccccCCchhHHHHHHHHhcCCCcchHHHHHHHHHhheecccceEEEEEeccCcC Confidence 99999999999988777766 3333222222 23333333456667889999999999999999999874 33321 Q ss_pred -------c---eeEEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEec--CcCC-CCccccchHHHHHHHHHHH Q lcl|NC_019710. 141 -------D---VISLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLK--GFGF-TGLVGLSPIAFACKSAGVA 207 (424) Q Consensus 141 -------~---~~~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r--~~~~-~~~~G~s~~~~~~~~i~~~ 207 (424) . ..+++.+....|+....+.+..+....+....-...-+ +.|| .+++ .....-||+.+++..+.-. T Consensus 149 ~~~~pd~~~r~~~~W~~vt~~ei~~~~~g~g~~v~lp~g~~h~~~~~~D-~l~RiW~P~prr~~e~dSpvra~l~~l~Ei 227 (631) T protein:vir:10 149 APAQPDGSVRTRQEWYAVSKEEIKKSNKGSGTNIVLPTGEEHEFVKGTD-IIFRVWIPKPRKASEPDSPVRAVLDSIREI 227 (631) T ss_pred CCCCcccccccccceeeccHHHHhcccCcccceeecCCCCccceecCCc-eEEEeeCCCcccccCCcchhHHHHHHHHHH Confidence 1 33566666666665555554444444433322223334 3344 2333 3456789999999988888 Q ss_pred HHHHHHHHHHHhccCCCceeEEcCCCCCC--------------------HHHHHHHHHH-HHHH---hCCcc--cCc-ce Q lcl|NC_019710. 208 VAMEDQQRDFFANGAKSPQILSTGEKVLT--------------------EQQRSQVEEN-FKEI---AGGPV--KKR-LW 260 (424) Q Consensus 208 ~~~~~~~~~~~~ng~~p~~vl~~~~~~~~--------------------~~~~~~~~~~-~~~~---~~~~~--ag~-~~ 260 (424) .-..+...+..+.-..-.||+-++...+= +-+...+.+. ++.. ....+ +-- ++ T Consensus 228 ~~~t~~i~aaakSRl~gnGvlflP~els~P~~~~~~~~~~g~~v~~~~g~pa~~~l~~~l~q~a~tai~De~S~aA~vPi 307 (631) T protein:vir:10 228 VRTTKTIANASKSRLIGNGVLFVPHEMSLPAAQGPVSEVEGEEIAPLVGEPAVQQLTDMLFQVAETAVEDEDSQAAFIPV 307 (631) T ss_pred HHhhhHHHHHHHHHHhhCceeEeccccccCCCCCCCCCcCCccCCccccchhHHHHHHHHHHHHhhhhcCCCCccceeee Confidence 87777777776666666677755433221 1122333222 2222 11111 111 22 Q ss_pred ecC------CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CCCcccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 261 ILE------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYTLQPYISR 333 (424) Q Consensus 261 ~l~------~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~f~~~tl~P~~~~ 333 (424) ++. ++++.-.+.....+ --+++++-.+.++|....|||+.|-+.+ ++|-. ++-+....-++--|.|.+.. T Consensus 308 i~~~p~E~i~~i~hlkf~~ei~e-~aiktR~daI~RlA~glDi~pE~LLGlGsd~NHW--sAWqI~dedVrlHI~P~l~l 384 (631) T protein:vir:10 308 IAGVPGEQIKDVKHIRFDNEITE-VAIKTRNDAIARLAMGLDVSPERLLGLGSQTNHW--SAWQISDEDVQLHIAPVMEI 384 (631) T ss_pred eEeechHHhcCeeEEeecCchhH-HHHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchHHHH Confidence 221 12333334433333 3578999999999999999998886653 54432 22333345566679999999 Q ss_pred HHHHHhhhccChh------hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-------- Q lcl|NC_019710. 334 WENSIQRWLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD-------- 399 (424) Q Consensus 334 ie~~l~~~L~~~~------~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd-------- 399 (424) |++.|++..|.+- +-..|-+.||.+.|.... ++.+-+..+.+.|.+|-...|+.+|+.-..+-| T Consensus 385 ic~AlT~q~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dP--dr~deA~qa~drGAIt~eAlrk~lGf~eDd~yd~~t~e~~~ 462 (631) T protein:vir:10 385 FCQALTDQILRVTLAREGIDPSKYVVWYDPSQLTIDP--DKSDEAKFAYENGAINGEALRKYLGLGDDAGYDFTTREGWV 462 (631) T ss_pred HHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCC--CCcHHHHHHHHcCCcCHHHHHHHhcCchhcccCcCchHHHH Confidence 9999999988652 122467899999886542 344556678889999999999999997533322 Q ss_pred ----------eeeecccccch-----hh-----------cc----ccCCCccCCC Q lcl|NC_019710. 400 ----------VAMRQSQYVPI-----TD-----------LG----TNKEPRNNGA 424 (424) Q Consensus 400 ----------~~~~~~n~~~~-----~~-----------~~----~~~~~~~~g~ 424 (424) .-++| ++.|+ .+ .+ +.+.+.++|. T Consensus 463 ~~a~~av~~dpaLip-~lApl~~~~~~~v~~P~~~a~~~~g~ed~~~~~~~~~g~ 516 (631) T protein:vir:10 463 MWAQDAVSKDPTLIP-MLAPLIAGVLKQIEFPQQQAIDSGGNEDTSDADDLDDGE 516 (631) T ss_pred HHHHHHhhcccCcch-hhHHHHHHHhhhccCCCCCCCCCCCCCccccccccccCC Confidence 11111 11110 00 00 0011111111 No 142 >protein:vir:99088 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655692;genbank:gi:109521770;genbank:GeneID:4157810 Probab=99.19 E-value=1.1e-11 Score=80.64 Aligned_cols=413 Identities=11% Similarity=0.055 Sum_probs=211.1 Q ss_pred CCCCCcccccCCC-ccHHHHHHhhccCcccc-cccccccccc-cc-cccccCCccccHHHHhhhHHHHHHHHHHHHhhhh Q lcl|NC_019710. 1 MEEPKYTIDLRTN-NGWWARLKSWFVGGRLV-TPNQGSQTGP-VS-AHGYLGDSSINDERILQISTVWRCVSLISTLTAC 76 (424) Q Consensus 1 ~~~~~~~~~~~~~-~G~~~~~~~~~~~~~~~-~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 76 (424) |+-....+.-|-- +---.|=++.-.+..+. .+....+-.. .+ ...|-. -.-+-|-..+.+.-.+..|++++++ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~---eAW~~~d~v~Elry~vgW~~~s~Sr 77 (629) T protein:vir:99 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQD---DAWKAYDAVGELRYYVGWRSSSASR 77 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhcccccchhhhhhcCCCchhhhhH---HHHHHHHhhhhHHHHhhhhhhhhce Confidence 5532211111100 00001111111111111 1111110000 00 000100 0011222367888899999999999 Q ss_pred CceeEeeccccCccccccccc--h----hHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC------ce-e Q lcl|NC_019710. 77 LPLDVFETDQNDNRKKVDLSN--P----LARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG------DV-I 143 (424) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~--~----l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G------~~-~ 143 (424) +.+..-+-+.+++......++ + +.++...--...+-..++++.+..++-+-|++|+.+.....| .+ . T Consensus 78 ~rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~ 157 (629) T protein:vir:99 78 VRLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVP 157 (629) T ss_pred eeeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchh Confidence 999988877666544332222 2 222222222223456799999999999999999988743333 22 2 Q ss_pred EEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecC--cCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019710. 144 SLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKG--FGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 144 ~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~--~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 220 (424) +++.+-+..|+-. +++.......+....-.+.-+++ ||- +++ .....-||+.+++..+.-..-..+...+..+. T Consensus 158 eW~~vt~~ei~~~--~~~~~i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakS 234 (629) T protein:vir:99 158 EWLALTPEEVRAS--EKKTIIELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKS 234 (629) T ss_pred hheeechHHhhhc--cCceeEEcCCCCccceeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHH Confidence 3444555544422 22222233333333334444544 552 332 34567899999998888777777776666665 Q ss_pred cCCCceeEEcCCCCCC-------------HH--------HHHHHHHHHH----HHhCCcc--cCc-ceecC------CCc Q lcl|NC_019710. 221 GAKSPQILSTGEKVLT-------------EQ--------QRSQVEENFK----EIAGGPV--KKR-LWILE------AGF 266 (424) Q Consensus 221 g~~p~~vl~~~~~~~~-------------~~--------~~~~~~~~~~----~~~~~~~--ag~-~~~l~------~g~ 266 (424) -..-.||+-++...+= +. ..+.+.+.+- ......+ +-- ++++. +++ T Consensus 235 RL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i 314 (629) T protein:vir:99 235 RLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNV 314 (629) T ss_pred HHhhCceeEeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCe Confidence 5555566433222110 00 2222333332 2222221 111 22221 123 Q ss_pred eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCh Q lcl|NC_019710. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPA 345 (424) Q Consensus 267 ~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~ 345 (424) +.-.+.....+ --+++++-.+.++|....|||+.|-+.+ ++|-. ++-+....-+.--|.|.+..|++.|++..|.+ T Consensus 315 ~hlkf~~ei~e-~aiktR~daI~RlA~glDippE~LLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp 391 (629) T protein:vir:99 315 THLKFDNQVTE-VAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVRLHILPPVEMLCEAITNQVLRT 391 (629) T ss_pred eEEeecCchhH-HHHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchhHHHHHHHHHhhHHHH Confidence 33334433333 3578999999999999999998886663 54432 22333345566679999999999999998865 Q ss_pred h------hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-------------eeeeccc Q lcl|NC_019710. 346 K------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD-------------VAMRQSQ 406 (424) Q Consensus 346 ~------~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd-------------~~~~~~n 406 (424) - +-..|-+.||.+.|.... ++.+-+..+.+.|.+|-...|+.+|+.-..+-| ......+ T Consensus 392 ~Le~eGiDp~kYvvW~DaS~Lt~dP--d~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~ 469 (629) T protein:vir:99 392 VLMREGIDPNAYVVWHDASQLTVDP--DKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPN 469 (629) T ss_pred HHHHhCCCHHHhEeeecCcccccCC--CCcHHHHHHHHcCCccHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcc Confidence 2 122467899999886542 344556678899999999999999997532221 1111111 Q ss_pred cc---------------chhhccc-----cCC-CccCCC Q lcl|NC_019710. 407 YV---------------PITDLGT-----NKE-PRNNGA 424 (424) Q Consensus 407 ~~---------------~~~~~~~-----~~~-~~~~g~ 424 (424) ++ |+...+- +++ .+.+|+ T Consensus 470 Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga 508 (629) T protein:vir:99 470 LLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGA 508 (629) T ss_pred hhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCC Confidence 10 1110000 000 000111 No 143 >protein:vir:8654 Length: 629 # NCBI annotation: gp12 # Family: family:all:2798 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817773;genbank:gi:29566205;genbank:GeneID:1259465 Probab=99.18 E-value=1.3e-11 Score=80.31 Aligned_cols=413 Identities=11% Similarity=0.054 Sum_probs=211.1 Q ss_pred CCCCCcccccCCC-ccHHHHHHhhccCcccc-cccccccccc-cc-cccccCCccccHHHHhhhHHHHHHHHHHHHhhhh Q lcl|NC_019710. 1 MEEPKYTIDLRTN-NGWWARLKSWFVGGRLV-TPNQGSQTGP-VS-AHGYLGDSSINDERILQISTVWRCVSLISTLTAC 76 (424) Q Consensus 1 ~~~~~~~~~~~~~-~G~~~~~~~~~~~~~~~-~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 76 (424) |+-....+.-|-- +---.|=++.-.+..+. .+....+-.. .+ ...|-. -.-+-|-..+.+.-.+..|++++++ T Consensus 1 ma~~~lr~~rrpk~~p~~~r~~al~aas~~i~~p~~~~~ks~~~~~~~~WQ~---eAW~~~d~v~Elry~vgW~~~s~Sr 77 (629) T protein:vir:86 1 MAPTSLRIVRRPKSEPVSTRQRALVAASQPVENPGKAFRKAMGSSTRTDWQE---DAWKAYDAVGELRYYVGWRSSSASR 77 (629) T ss_pred CCccceeeeecCCCCChhhhhhhhhhhhhccccccchhhhhcCCCchhhhhH---HHHHHHHhhhhHHHHhhhhhhhhce Confidence 5532211111100 00001111111111111 1111110000 00 000100 0011222367888899999999999 Q ss_pred CceeEeeccccCccccccccc--h----hHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC------ce-e Q lcl|NC_019710. 77 LPLDVFETDQNDNRKKVDLSN--P----LARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG------DV-I 143 (424) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~--~----l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G------~~-~ 143 (424) +.+..-+-+.|++......++ + +.++...--...+-..++++.+..++-+-|++|+.+.....| .+ . T Consensus 78 ~rL~as~idpDtg~ptg~i~e~~~~~~~v~~~v~~i~gG~lgqa~lLkr~~~~ltV~GE~wiv~~~~~~~~~d~~~~~~~ 157 (629) T protein:vir:86 78 VRLIASAIDPDTGLPTGSIDEDDRVGARVQQIVNQIAGGALGQAQLIKRVVEQLTVAGETWVAILFTDKSRLDSNGNPVP 157 (629) T ss_pred eeeEeeeecCCCCCCccccCCCchhHHHHHHHHHhhcCChhhHHHHHHHHHhheecccceEEEEeecCCCccCCCCcchh Confidence 999988877666544332222 2 222222222223456799999999999999999988743333 22 2 Q ss_pred EEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEecC--cCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019710. 144 SLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLKG--FGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFAN 220 (424) Q Consensus 144 ~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~--~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~n 220 (424) +++.+-+..|+-. +++.......+....-.+..+++ ||- +++ .....-||+.+++..+.-..-..+...+..+. T Consensus 158 eW~~vt~~ei~~~--~~~~~i~lP~g~~~e~~~~~d~l-~RiW~P~Prr~~e~DSpvra~l~~l~Ei~~lt~~i~aaakS 234 (629) T protein:vir:86 158 EWLALTPEEVRAS--EKKTIIELPTGDKHEFRDGLDGM-FRVWNPRARRAREPDSPVRANLDSLKEIVRTTKTIANASKS 234 (629) T ss_pred hheeechHHhhhc--cCceeeEcCCCCcceeeCCCceE-EEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHHHHHHHH Confidence 3444555544422 22222233333333333444554 652 332 34567899999998888777777776666665 Q ss_pred cCCCceeEEcCCCCCC-------------HH--------HHHHHHHHHH----HHhCCcc--cCc-ceecC------CCc Q lcl|NC_019710. 221 GAKSPQILSTGEKVLT-------------EQ--------QRSQVEENFK----EIAGGPV--KKR-LWILE------AGF 266 (424) Q Consensus 221 g~~p~~vl~~~~~~~~-------------~~--------~~~~~~~~~~----~~~~~~~--ag~-~~~l~------~g~ 266 (424) -..-.||+-++...+= +. ..+.+.+.+- ......+ +-- ++++. +++ T Consensus 235 RL~gnGvlflP~e~slP~~~~p~~~n~pg~~~p~~~~~pa~~~l~~~l~q~a~tAi~De~S~aA~vPiia~~P~E~i~~i 314 (629) T protein:vir:86 235 RLIGNGVVFVPHEMSLPSMNAPVASNKPGAPAPPILGTPAVQQLQELLFQVAQTAYDDEDSMAALIPMFAAAPGELIKNV 314 (629) T ss_pred HHhhCceeeeccCcccCccCCCCCCCCCCcccccccccchHHHHHHHHHHHHhhhhcCCCCccceeeeeEeechHHhcCe Confidence 5555566433222110 00 2222333332 2222221 111 22221 123 Q ss_pred eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccCh Q lcl|NC_019710. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPA 345 (424) Q Consensus 267 ~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~ 345 (424) +.-.+.....+ --+++++-.+.++|....|||+.|-+.+ ++|-. ++-+....-+.--|.|.+..|++.|++..|.+ T Consensus 315 ~hlkf~~ei~e-~aiktR~daI~RlA~glDippE~LLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~AlT~~~Lrp 391 (629) T protein:vir:86 315 THLKFDNQVTE-VAIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVRLHILPPVEMLCEAITNQVLRT 391 (629) T ss_pred eEEeecCchhH-HHHhhHHHHHHHHHhccCCchhhheeccCCccce--EEEEecccceeeecchHHHHHHHHHHhhHHHH Confidence 33334433333 3578999999999999999998886663 54432 22333345566679999999999999998865 Q ss_pred h------hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC-------------eeeeccc Q lcl|NC_019710. 346 K------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD-------------VAMRQSQ 406 (424) Q Consensus 346 ~------~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd-------------~~~~~~n 406 (424) - +-..|-+.||.+.|.... ++.+-+..+.+.|.+|-...|+.+|+.-..+-| ......+ T Consensus 392 ~Le~eGiDp~kYvvW~DaS~Lt~dP--d~~deA~~a~drGAIt~eAlrk~lGf~eD~~yd~tt~E~~~~~a~d~V~~~P~ 469 (629) T protein:vir:86 392 VLMREGIDPNAYVVWHDASQLTVDP--DKTDEARDAFDRGAITAEAMVKMLGLADDTVYDFTTPEGWAQWARDRVGQDPN 469 (629) T ss_pred HHHHhCCCHHHhEeeecCcccccCC--CCcHHHHHHHHcCCcCHHHHHHHhcCccccccCCCchHHHHHHHHHhhhhCcc Confidence 2 122467899999886542 344556678899999999999999997532221 1111111 Q ss_pred cc---------------chhhccc-----cCC-CccCCC Q lcl|NC_019710. 407 YV---------------PITDLGT-----NKE-PRNNGA 424 (424) Q Consensus 407 ~~---------------~~~~~~~-----~~~-~~~~g~ 424 (424) ++ |+...+- +++ .+.+|+ T Consensus 470 Li~~~a~l~~~~a~~~~P~~~~~~pp~~e~~~~dE~sga 508 (629) T protein:vir:86 470 LLPTLAVLIPELADVEFPTPTVALPPAEEQDGDEEASGA 508 (629) T ss_pred hhhhhhhhhhhhcccccCccCCCCCccccCCCcccccCC Confidence 10 1111000 000 000111 No 144 >protein:vir:107517 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943786;genbank:gi:38638411;genbank:GeneID:2657197 Probab=99.04 E-value=2.7e-10 Score=73.07 Aligned_cols=405 Identities=11% Similarity=0.039 Sum_probs=210.2 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccc-------cccccccc-cccccccCCccc------cHHHHhhhHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTP-------NQGSQTGP-VSAHGYLGDSSI------NDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~-------~~~~~~~~-~~~~~~~~~~~~------~~~~~~~~~~v~~~ 66 (424) |+-- ++++-- +.++..+++. .+.. ..+ ..+..+.++.+- .-+.|-..+.+.-. T Consensus 1 ma~~--~lr~~r----------rpk~~p~~~rr~~ltaAsq~~-~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~ 67 (639) T protein:vir:10 1 MAAT--SLRVVR----------RPKGSAPAARRRSLTAASQLI-TDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYY 67 (639) T ss_pred CCcc--ceeeee----------cCCCCCcchhhHHHhhhhhcc-CCcccchhhhccccchhhhhhhhhhhhhhhhhHHHH Confidence 5542 222211 1111111000 0000 000 111111111110 11223345888899 Q ss_pred HHHHHHhhhhCceeEeeccccCccc-------cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCC Q lcl|NC_019710. 67 VSLISTLTACLPLDVFETDQNDNRK-------KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNS 138 (424) Q Consensus 67 i~~ia~~ia~~~~~~~~~~~~~~~~-------~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~ 138 (424) +..|+++++++.+..-+-+.|.... +....+.+......=-..-+-..++++.+..++-+-|++|+.+. +.. T Consensus 68 vgW~~~s~sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~ 147 (639) T protein:vir:10 68 VSWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQE 147 (639) T ss_pred hhhhhhhhceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecC Confidence 9999999999999887777554422 11112233333222223335577899999999999999997754 344 Q ss_pred CCc------eeEEEE-eccceEEEEEcCCceEEEEEecCceEEecHhHeeEec--CcCC-CCccccchHHHHHHHHHHHH Q lcl|NC_019710. 139 AGD------VISLLP-LQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLK--GFGF-TGLVGLSPIAFACKSAGVAV 208 (424) Q Consensus 139 ~G~------~~~l~~-l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r--~~~~-~~~~G~s~~~~~~~~i~~~~ 208 (424) ++. +.+-|. +-...|. .+.++..-.. ..++...+|.-+.=+.|| .+++ .....-||+.+++..+.-.. T Consensus 148 k~~~~~~~~~~~~W~vvs~~Ei~-~~~~~~~~i~-lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~ 225 (639) T protein:vir:10 148 KDPVTGLAAPRARWYAVTREEIK-SKAGETAEIS-LPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIE 225 (639) T ss_pred ccccCcccccccceeeeeHHHhc-ccCCCeeEee-cCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHH Confidence 332 344443 3333333 1222222122 223333344433333355 2332 33567899999998888777 Q ss_pred HHHHHHHHHHhccCCCceeEEcCCCCCC------------------------HHHHHHHHHHH----HHHhCCcc--cCc Q lcl|NC_019710. 209 AMEDQQRDFFANGAKSPQILSTGEKVLT------------------------EQQRSQVEENF----KEIAGGPV--KKR 258 (424) Q Consensus 209 ~~~~~~~~~~~ng~~p~~vl~~~~~~~~------------------------~~~~~~~~~~~----~~~~~~~~--ag~ 258 (424) -..+...+..+.-..-.||+-++...+- ....+.+...+ .......+ +-- T Consensus 226 ~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~ 305 (639) T protein:vir:10 226 RTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAY 305 (639) T ss_pred HhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccce Confidence 7777766666655555566543322110 01122233333 22222221 111 Q ss_pred -ceecCC----CceeeeccCC-hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 259 -LWILEA----GFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYIS 332 (424) Q Consensus 259 -~~~l~~----g~~~~~l~~s-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~ 332 (424) ++++.. .-+++.+.+. ..+.--+++++-.+.++|....|||+.|-+..++|-. ++-+....-++--|.|.+. T Consensus 306 vPiia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHW--sAWqI~dedvrlHI~P~l~ 383 (639) T protein:vir:10 306 IPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHW--SAWAIGDEDVQLHIKPVMD 383 (639) T ss_pred eeeeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccce--EEEEecccceeeecchhHH Confidence 222211 1244444432 2233357899999999999999999988776665432 2334444556677999999 Q ss_pred HHHHHHhhhccChh------hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe------ Q lcl|NC_019710. 333 RWENSIQRWLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV------ 400 (424) Q Consensus 333 ~ie~~l~~~L~~~~------~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~------ 400 (424) .|++.|++..|.+- +...|-+.||.+.|.... ++.+-+..+.+.|.+|-.-.|+.+|+.-..+=|. T Consensus 384 ~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dP--d~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~ 461 (639) T protein:vir:10 384 LICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDP--DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGC 461 (639) T ss_pred HHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCC--CCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHH Confidence 99999999988652 222467899999886542 3445566788999999999999999864432120 Q ss_pred -------------------eee-----cccc-----cchhhccccCCCccCCC Q lcl|NC_019710. 401 -------------------AMR-----QSQY-----VPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 -------------------~~~-----~~n~-----~~~~~~~~~~~~~~~g~ 424 (424) ++. ...+ .....-.+....+++|+ T Consensus 462 ~~~A~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga 514 (639) T protein:vir:10 462 REFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGA 514 (639) T ss_pred HHHHHHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCC Confidence 000 0000 00000011112222222 No 145 >protein:vir:97900 Length: 639 # NCBI annotation: gp8 # Family: family:all:2798 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655104;genbank:gi:109391854;genbank:GeneID:4157263 Probab=99.04 E-value=2.7e-10 Score=73.07 Aligned_cols=405 Identities=11% Similarity=0.039 Sum_probs=210.2 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccc-------cccccccc-cccccccCCccc------cHHHHhhhHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTP-------NQGSQTGP-VSAHGYLGDSSI------NDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~-------~~~~~~~~-~~~~~~~~~~~~------~~~~~~~~~~v~~~ 66 (424) |+-- ++++-- +.++..+++. .+.. ..+ ..+..+.++.+- .-+.|-..+.+.-. T Consensus 1 ma~~--~lr~~r----------rpk~~p~~~rr~~ltaAsq~~-~~p~~~~kt~~~~~ar~~WQ~eAW~~~d~v~Elry~ 67 (639) T protein:vir:97 1 MAAT--SLRVVR----------RPKGSAPAARRRSLTAASQLI-TDPQKQMKTSLMGTARNEWQSEAWDFSESIGELSYY 67 (639) T ss_pred CCcc--ceeeee----------cCCCCCcchhhHHHhhhhhcc-CCcccchhhhccccchhhhhhhhhhhhhhhhhHHHH Confidence 5542 222211 1111111000 0000 000 111111111110 11223345888899 Q ss_pred HHHHHHhhhhCceeEeeccccCccc-------cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe-eCC Q lcl|NC_019710. 67 VSLISTLTACLPLDVFETDQNDNRK-------KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD-RNS 138 (424) Q Consensus 67 i~~ia~~ia~~~~~~~~~~~~~~~~-------~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~-r~~ 138 (424) +..|+++++++.+..-+-+.|.... +....+.+......=-..-+-..++++.+..++-+-|++|+.+. +.. T Consensus 68 vgW~~~s~sr~rL~as~idpDtg~PtG~V~~E~d~~~~~v~~~v~~iagG~lGqa~llkr~~~~ltV~GE~wi~~l~r~~ 147 (639) T protein:vir:97 68 VSWRANSCSRTTLIPSAIDPDTGLPTGEVDIEEDPDAQTVADYVKGIADGPLGQAALIKRAVECMTVVGEVWIAVLIRQE 147 (639) T ss_pred hhhhhhhhceeeeEeeeeccccCCCCCccccccccCcchHHHHHHhhcCccchHHHHHHHHHhheecccceEEEEEEecC Confidence 9999999999999887777554422 11112233333222223335577899999999999999997754 344 Q ss_pred CCc------eeEEEE-eccceEEEEEcCCceEEEEEecCceEEecHhHeeEec--CcCC-CCccccchHHHHHHHHHHHH Q lcl|NC_019710. 139 AGD------VISLLP-LQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLK--GFGF-TGLVGLSPIAFACKSAGVAV 208 (424) Q Consensus 139 ~G~------~~~l~~-l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r--~~~~-~~~~G~s~~~~~~~~i~~~~ 208 (424) ++. +.+-|. +-...|. .+.++..-.. ..++...+|.-+.=+.|| .+++ .....-||+.+++..+.-.. T Consensus 148 k~~~~~~~~~~~~W~vvs~~Ei~-~~~~~~~~i~-lPdG~~he~~~~~d~l~RvW~P~prr~~e~dSpvra~l~~l~Ei~ 225 (639) T protein:vir:97 148 KDPVTGLAAPRARWYAVTREEIK-SKAGETAEIS-LPDGKTHEFNRDLDSLVRIWNPRPRKASQATSPVRACLETLREIE 225 (639) T ss_pred ccccCcccccccceeeeeHHHhc-ccCCCeeEee-cCCCCCccccCCCceEEEEeCCCcccccCCcchhHHHHHHHHHHH Confidence 332 344443 3333333 1222222122 223333344433333355 2332 33567899999998888777 Q ss_pred HHHHHHHHHHhccCCCceeEEcCCCCCC------------------------HHHHHHHHHHH----HHHhCCcc--cCc Q lcl|NC_019710. 209 AMEDQQRDFFANGAKSPQILSTGEKVLT------------------------EQQRSQVEENF----KEIAGGPV--KKR 258 (424) Q Consensus 209 ~~~~~~~~~~~ng~~p~~vl~~~~~~~~------------------------~~~~~~~~~~~----~~~~~~~~--ag~ 258 (424) -..+...+..+.-..-.||+-++...+- ....+.+...+ .......+ +-- T Consensus 226 ~~t~~i~aaakSRl~gnGvlfvP~els~p~~~~p~~~~~~~~pg~~v~~~~~~~a~d~l~~~l~qaa~tai~De~S~aA~ 305 (639) T protein:vir:97 226 RTTRKIKNAAKSRVMNNGVLFVPAEMSLPAAQAPIPAGQAQIPGAPVPEVSGVPASEQLATMIYQASVAAMEDENSQAAY 305 (639) T ss_pred HhhhHHHHHHHHHHhhCceeeeccccCCCCccccccccccccCcccccccCCccchHHHHHHHHHHHHhhhcCCCCccce Confidence 7777766666655555566543322110 01122233333 22222221 111 Q ss_pred -ceecCC----CceeeeccCC-hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 259 -LWILEA----GFSTSAIGVT-PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYIS 332 (424) Q Consensus 259 -~~~l~~----g~~~~~l~~s-~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~ 332 (424) ++++.. .-+++.+.+. ..+.--+++++-.+.++|....|||+.|-+..++|-. ++-+....-++--|.|.+. T Consensus 306 vPiia~~p~E~l~~ikhl~f~~ei~e~aiktR~daI~RlA~glDi~pE~LLGl~d~NHW--sAWqI~dedvrlHI~P~l~ 383 (639) T protein:vir:97 306 IPLVASVAAEHLEKVQHIKFGNEVTEVEIKTRIDAITRLAMGLDVSPERLLGMSKGNHW--SAWAIGDEDVQLHIKPVMD 383 (639) T ss_pred eeeeEeechHHhcCeeeeeecCchhHHHHhhHHHHHHHHHhccCCchhheeecccccce--EEEEecccceeeecchhHH Confidence 222211 1244444432 2233357899999999999999999988776665432 2334444556677999999 Q ss_pred HHHHHHhhhccChh------hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe------ Q lcl|NC_019710. 333 RWENSIQRWLIPAK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV------ 400 (424) Q Consensus 333 ~ie~~l~~~L~~~~------~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~------ 400 (424) .|++.|++..|.+- +...|-+.||.+.|.... ++.+-+..+.+.|.+|-.-.|+.+|+.-..+=|. T Consensus 384 ~icdAlT~~~Lrp~Le~eGvDp~kYvvW~DaS~Lt~dP--d~~deA~qa~drGAIt~eAlR~~lG~~edd~yd~~t~e~~ 461 (639) T protein:vir:97 384 LICQAIYNDILTPLLAREGIDPTKYILWYDASGLTSDP--DLSDEAVEAHDRGAITSAALRRLLNVGEDSGYDLTTLDGC 461 (639) T ss_pred HHHHHHHhhHHHHHHHHhCCCHHHhEeeecCcccccCC--CCcHHHHHHHHcCCccHHHHHHHhccccccCCCCCCcHHH Confidence 99999999988652 222467899999886542 3445566788999999999999999864432120 Q ss_pred -------------------eee-----cccc-----cchhhccccCCCccCCC Q lcl|NC_019710. 401 -------------------AMR-----QSQY-----VPITDLGTNKEPRNNGA 424 (424) Q Consensus 401 -------------------~~~-----~~n~-----~~~~~~~~~~~~~~~g~ 424 (424) ++. ...+ .....-.+....+++|+ T Consensus 462 ~~~A~~~V~~~P~li~~~apl~~P~lq~~e~ptp~~a~~~a~~~~~~de~~ga 514 (639) T protein:vir:97 462 REFAADVVTKNPELIAMYAPLLSSQLAGIEFPQPANAIESTREDEEDDEDSGA 514 (639) T ss_pred HHHHHHHhcCCcchhhhhhhccCccceecccCCCCCCCCCCCCCCCcccccCC Confidence 000 0000 00000011112222222 No 146 >protein:vir:106027 Length: 629 # NCBI annotation: gp9 # Family: family:all:2798 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654906;genbank:gi:109392362;genbank:GeneID:4157055 Probab=98.93 E-value=1.1e-09 Score=69.65 Aligned_cols=408 Identities=12% Similarity=0.072 Sum_probs=207.1 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccccccccccccccCCcc----c---cHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSS----I---NDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~---~~~~~~~~~~v~~~i~~ia~~ 73 (424) |+-. ++++--+ -++-...+.....+++...+-. +.....+.+ . .-+-|-..+.+.-.+..++++ T Consensus 1 ma~~--~lrv~rr------pk~~p~~r~l~aasqp~~P~~~-~~~~~~g~~~~~~WQ~eAW~~~d~VgElryyvgW~~ss 71 (629) T protein:vir:10 1 MAAS--TLRVSRR------PKGSPARRSLTAASQPMEPGRT-PSRQVAGTVVRTSWQNEAWECMDLVGELRYYVGWRASS 71 (629) T ss_pred CCcc--ceeEEec------CCCccceeeeccccCCCCcchh-hchhhhhhhhhhhhhHHHHHHHHhhhhHHHHhhhhhhh Confidence 4432 1211110 0100000011111111110000 000111111 0 011122247778888999999 Q ss_pred hhhCceeEeeccccCccccccc--cchhHHh----hccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC----cee Q lcl|NC_019710. 74 TACLPLDVFETDQNDNRKKVDL--SNPLARL----LRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG----DVI 143 (424) Q Consensus 74 ia~~~~~~~~~~~~~~~~~~~~--~~~l~~l----L~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G----~~~ 143 (424) ++++.+..-+-+.|++...... ++|-... ...=-..-+-..++++.+..++-+-|+.|++++.-.++ .+. T Consensus 72 ~Sr~rL~as~idpDtg~ptg~i~ed~p~~~~v~~~v~~iagG~lGqaqLlkr~~~~ltV~GE~~i~il~~~~~~pd~~~r 151 (629) T protein:vir:10 72 CSRVELIASELDPDTGKPTGGIRDDDPDGLRFLEIVKTMAGGPLGQAQLQKRAAECLTVPGEHRICLLDQGDKNPDGSVR 151 (629) T ss_pred heeeeEEEeeecCCCCCCccccccCchhHHHHHHHHHHhcCccchHHHHHHHHHhheeccCceEEEEeecCCCCCCcccc Confidence 9999998877776654332222 3332222 22222233557789999999999999999987644433 344 Q ss_pred -EEEEeccceEEEEEcCCceEEEEEecCceEEecHhHeeEec--CcCC-CCccccchHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019710. 144 -SLLPLQSANMDVKLVGKKVVYRYQRDSEYADFSQKEIFHLK--GFGF-TGLVGLSPIAFACKSAGVAVAMEDQQRDFFA 219 (424) Q Consensus 144 -~l~~l~p~~v~~~~~~~~~~~~~~~~~~~~~~~~~evih~r--~~~~-~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ 219 (424) ..+.+....|. .++.+..-....++....|..+.=+.|| .+++ .....-||+.+++..+.-..-..+...+..+ T Consensus 152 ~~W~vVt~~Ei~--~kg~g~~~i~lpdg~~he~~~~~D~l~RvW~P~Prr~~e~DSpvra~l~~lrEi~r~tk~i~~aak 229 (629) T protein:vir:10 152 HNWYVVTNDEVK--NKGAGKTDIELPDGTIHEYSKGRDVMFRVWNPRPRRAKEPDSPVRACLDSLREIIRTTKKIRNASK 229 (629) T ss_pred cceeeecHHHhc--cccCceeEEEcCCCceeeeeCCCeeEEEeeCCCcccccCCcchhHHHHHHHHHHHHhhhHhHHHHH Confidence 33334444333 2222222222233334445444333444 2332 2356789999999888877777776666666 Q ss_pred ccCCCceeEEcCCCCC---------------------CHHHHHHHHHHH----HHHhCCcc--cCc-ceec--CC--Cce Q lcl|NC_019710. 220 NGAKSPQILSTGEKVL---------------------TEQQRSQVEENF----KEIAGGPV--KKR-LWIL--EA--GFS 267 (424) Q Consensus 220 ng~~p~~vl~~~~~~~---------------------~~~~~~~~~~~~----~~~~~~~~--ag~-~~~l--~~--g~~ 267 (424) .-..-.||+-++...+ .....+.+...+ ....-..+ +-- ++++ ++ .-+ T Consensus 230 SRL~gnGvlflP~e~slp~~~ap~~~~~Pg~~~p~~~g~aa~d~l~~~l~q~a~aAi~De~S~aA~vPiia~vP~E~l~~ 309 (629) T protein:vir:10 230 SRLIGNGVVFLPQELSLPRATAPVADNQPGAPVPIVDGVAAADELSNLLFQTAAAAVDDEDSQAALIPLLATVPGEHLQK 309 (629) T ss_pred hHHhhCceeEeccCcccccccCCCCCCCCcccccccCCCcchHHHHHHHHHHHHhhhcCCCCccceeeeEEeechHHhcC Confidence 5555556643322211 001222233333 22222222 111 2222 11 123 Q ss_pred eeeccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCC-CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Q lcl|NC_019710. 268 TSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVE-KSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIP 344 (424) Q Consensus 268 ~~~l~~--s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~-~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~ 344 (424) ++.|.+ ...+. -++.++-.+.++|....|||+.|-+.+ ++|-. ++-+....-++--|.|.+..|++.|++.+|. T Consensus 310 ikhLkf~~eite~-~iktR~daI~RlAmglDispErLLGlGsd~NHW--sAWqI~dedvrlHI~P~l~~ic~Ait~~~Lr 386 (629) T protein:vir:10 310 IFHLKIGNEITEV-EIKTRNDAIARLAMGLDVSPERLLGLGSNSNHW--SAWQIGDEDVQLHIKPVMEVLCAAIYREVLV 386 (629) T ss_pred eeeeeecCchhHH-HHhhHHHHHHHHHhccCCChhheeeccCCccce--eeEEecccceeeecchHHHHHHHHHHhHHHH Confidence 444443 33333 478899999999999999999886663 44432 2334444556667999999999999998876 Q ss_pred hh------hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc--Cee-----------eecc Q lcl|NC_019710. 345 AK------DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG--DVA-----------MRQS 405 (424) Q Consensus 345 ~~------~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg--d~~-----------~~~~ 405 (424) +- +-..|-+.||.+.|.... ++.+-+..+...|.+|-...|+.+|+.-.++= ++. ..+. T Consensus 387 p~L~~eGiDp~~Yvvw~DaS~Lt~dP--d~~deA~~a~drGaIt~eAlRr~lG~~~dd~y~~~t~~~~q~~A~~~v~~~P 464 (629) T protein:vir:10 387 ATLRAEGIDPDRYVLWYDASGLTVDP--DKTDEATAAKEQGAITHEAYRRYLGLADEDGYDLETLEGAQAWARDAIVADP 464 (629) T ss_pred HHHHHhCCCHHHhEeeecCcccccCC--CCcHHHHHHHHcCCccHHHHHHHhccccccCCCcCCcHHHHHHHHHHhcCCC Confidence 42 122467899998886532 34445566888999999999999999643321 111 1111 Q ss_pred ccc----ch-----------------hhccccCCCccCC-----------C Q lcl|NC_019710. 406 QYV----PI-----------------TDLGTNKEPRNNG-----------A 424 (424) Q Consensus 406 n~~----~~-----------------~~~~~~~~~~~~g-----------~ 424 (424) .++ |+ ....+.+++.+++ | T Consensus 465 ~Li~~~apll~~~l~~i~~P~p~~a~~~~~~~~~~~E~~~~~~e~~~e~dA 515 (629) T protein:vir:10 465 SLIKVLAPLLTDELAEIDWPEPPAALPPGEDDQADEEQDTTGSEPSTEDDA 515 (629) T ss_pred chhhhhhhhcCCccccccccCCCCcCCCCCcccCccccCCCCCCcCCCcch Confidence 111 00 0001111111111 1 No 147 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.82 E-value=9.3e-09 Score=64.61 Aligned_cols=387 Identities=12% Similarity=0.032 Sum_probs=190.1 Q ss_pred cccCCCccHHHHHHhhccCccccccccccccccccccc---ccCCc----cccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHG---YLGDS----SINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~----~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.-.|.--|++++........+ +.. .....+.+.. ..+.. .-....-..+.+...+|+..+.-+-.-||. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~-r~~--~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~ 77 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMS-RVR--LLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHH-HHH--HHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCee Confidence 7778888888888776543321 110 0001111100 00110 000011123456677888888888778887 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +--. .+.+ ....+.+++. + | ....+...+..+++.+|.||.++-.+.+|.+ .+..++|..+.+..|+. T Consensus 78 ~~~~-~d~~-----~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~ 145 (456) T protein:vir:10 78 VGGS-ADSD-----LALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) T ss_pred cCCC-CCcc-----hHHHHHHHHH-h-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCC Confidence 5311 1111 1122444443 2 2 2335566788999999999999988888876 46778888888777643 Q ss_pred ce-------EEEEEecCce----------------------------EEecHh------HeeEecCc----CCCCccccc Q lcl|NC_019710. 161 KV-------VYRYQRDSEY----------------------------ADFSQK------EIFHLKGF----GFTGLVGLS 195 (424) Q Consensus 161 ~~-------~~~~~~~~~~----------------------------~~~~~~------evih~r~~----~~~~~~G~s 195 (424) .. .|+...++.. .+...+ +.-|+-.. ..+...|+| T Consensus 146 ~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~g 225 (456) T protein:vir:10 146 QPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) T ss_pred CCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCc Confidence 21 0110000000 000000 00111100 012345778 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC--CCCHHHHHHH--HHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019710. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLTEQQRSQV--EENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~--~~~~~~~~~~--~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .++.....++....+..-........+.|..++.-... ...++.-..+ ...++.. .+.++.++++.++.++ T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~-----~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA-----PGALWELPPGVDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh-----ccccccCCCCcceEEe Confidence 77766666665554433322333333334333321100 0001100001 1112211 2457778888888777 Q ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhc------cCh Q lcl|NC_019710. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWL------IPA 345 (424) Q Consensus 272 ~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L------~~~ 345 (424) ..... -.+.+..+....+|++.=++|+..++.... |.|... ..+....+.-.+...+..|...| +.. T Consensus 301 ~~~~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~Sg~A-----i~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~ 373 (456) T protein:vir:10 301 QANDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEG-----AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ 373 (456) T ss_pred cccCh-hHHHHHHHHHHHHHHhccCCChHHhccccc-ChHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 64322 237888999999999999999999986432 212111 11222222222222222221111 000 Q ss_pred -hh-hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC----CcCeeeecccccchhhccccCCC Q lcl|NC_019710. 346 -KD-VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP----GGDVAMRQSQYVPITDLGTNKEP 419 (424) Q Consensus 346 -~~-~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~----ggd~~~~~~n~~~~~~~~~~~~~ 419 (424) .+ .....+++.+......+..+.++.+.++.+.|+.+..-+++++|+.+.+ ..+...-..+ .....-.+.| T Consensus 374 ~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~---~~~~~~~~~~ 450 (456) T protein:vir:10 374 IEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQIT---LFAGNPVQRP 450 (456) T ss_pred hcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHH---HHhhhhhhcC Confidence 01 1112344445556677888899999999999999998888899987531 1111100000 0001112456 Q ss_pred ccCCC Q lcl|NC_019710. 420 RNNGA 424 (424) Q Consensus 420 ~~~g~ 424 (424) .++|+ T Consensus 451 ~~~~~ 455 (456) T protein:vir:10 451 QEDGS 455 (456) T ss_pred CCCCC Confidence 67777 No 148 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.82 E-value=9.3e-09 Score=64.61 Aligned_cols=387 Identities=12% Similarity=0.032 Sum_probs=190.1 Q ss_pred cccCCCccHHHHHHhhccCccccccccccccccccccc---ccCCc----cccHHHHhhhHHHHHHHHHHHHhhhhCcee Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHG---YLGDS----SINDERILQISTVWRCVSLISTLTACLPLD 80 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~----~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~ 80 (424) |.-.|.--|++++........+ +.. .....+.+.. ..+.. .-....-..+.+...+|+..+.-+-.-||. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~-r~~--~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~ 77 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMS-RVR--LLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHH-HHH--HHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCee Confidence 7778888888888776543321 110 0001111100 00110 000011123456677888888888778887 Q ss_pred EeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 81 VFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 81 ~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~ 160 (424) +--. .+.+ ....+.+++. + | ....+...+..+++.+|.||.++-.+.+|.+ .+..++|..+.+..|+. T Consensus 78 ~~~~-~d~~-----~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~-~i~~~~p~~~~~i~d~~ 145 (456) T protein:vir:10 78 VGGS-ADSD-----LALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) T ss_pred cCCC-CCcc-----hHHHHHHHHH-h-c---ChhhHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEEccceeEEEEcCC Confidence 5311 1111 1122444443 2 2 2335566788999999999999988888876 46778888888777643 Q ss_pred ce-------EEEEEecCce----------------------------EEecHh------HeeEecCc----CCCCccccc Q lcl|NC_019710. 161 KV-------VYRYQRDSEY----------------------------ADFSQK------EIFHLKGF----GFTGLVGLS 195 (424) Q Consensus 161 ~~-------~~~~~~~~~~----------------------------~~~~~~------evih~r~~----~~~~~~G~s 195 (424) .. .|+...++.. .+...+ +.-|+-.. ..+...|+| T Consensus 146 ~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~g 225 (456) T protein:vir:10 146 QPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) T ss_pred CCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCc Confidence 21 0110000000 000000 00111100 012345778 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC--CCCHHHHHHH--HHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019710. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLTEQQRSQV--EENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~--~~~~~~~~~~--~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .++.....++....+..-........+.|..++.-... ...++.-..+ ...++.. .+.++.++++.++.++ T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~-----~~~~~~~~~~~~~~q~ 300 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAA-----PGALWELPPGVDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhh-----ccccccCCCCcceEEe Confidence 77766666665554433322333333334333321100 0001100001 1112211 2457778888888777 Q ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhc------cCh Q lcl|NC_019710. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWL------IPA 345 (424) Q Consensus 272 ~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L------~~~ 345 (424) ..... -.+.+..+....+|++.=++|+..++.... |.|... ..+....+.-.+...+..|...| +.. T Consensus 301 ~~~~~-~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~Sg~A-----i~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~ 373 (456) T protein:vir:10 301 QANDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEG-----AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ 373 (456) T ss_pred cccCh-hHHHHHHHHHHHHHHhccCCChHHhccccc-ChHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 64322 237888999999999999999999986432 212111 11222222222222222221111 000 Q ss_pred -hh-hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC----CcCeeeecccccchhhccccCCC Q lcl|NC_019710. 346 -KD-VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP----GGDVAMRQSQYVPITDLGTNKEP 419 (424) Q Consensus 346 -~~-~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~----ggd~~~~~~n~~~~~~~~~~~~~ 419 (424) .+ .....+++.+......+..+.++.+.++.+.|+.+..-+++++|+.+.+ ..+...-..+ .....-.+.| T Consensus 374 ~~g~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~---~~~~~~~~~~ 450 (456) T protein:vir:10 374 IEGESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQIT---LFAGNPVQRP 450 (456) T ss_pred hcCCCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHH---HHhhhhhhcC Confidence 01 1112344445556677888899999999999999998888899987531 1111100000 0001112456 Q ss_pred ccCCC Q lcl|NC_019710. 420 RNNGA 424 (424) Q Consensus 420 ~~~g~ 424 (424) .++|+ T Consensus 451 ~~~~~ 455 (456) T protein:vir:10 451 QEDGS 455 (456) T ss_pred CCCCC Confidence 67777 No 149 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=98.80 E-value=1.4e-08 Score=63.72 Aligned_cols=357 Identities=9% Similarity=-0.019 Sum_probs=169.2 Q ss_pred hccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHh Q lcl|NC_019710. 23 WFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARL 102 (424) Q Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~l 102 (424) ++-.+.. ...-....-....+...+|+.++..+---.|++ .+++ ....+.++ T Consensus 1 ~l~~~~~-------------------~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~----~d~~-----~~~~~~~i 52 (434) T protein:vir:98 1 MLPKNAE-------------------QAFLDFQRKARTNFCGLIANASVHRLLALGVTG----PDGE-----PDTRASRW 52 (434) T ss_pred CCCCCcc-------------------HHHHHhhhhhhccchHHHHHHHHhhhccCceec----CCCc-----hHHHHHHH Confidence 0000000 000000000012244556666666443333332 1211 12234444 Q ss_pred hccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce------eEEEEeccceEEEEEcCCc------eEEEEE-ec Q lcl|NC_019710. 103 LRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV------ISLLPLQSANMDVKLVGKK------VVYRYQ-RD 169 (424) Q Consensus 103 L~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~------~~l~~l~p~~v~~~~~~~~------~~~~~~-~~ 169 (424) +. + | ........+..+++.+|.||+++.++.++.. ..+..++|..+.+..|... ..|+.. .+ T Consensus 53 ~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai~~~~~~~~ 127 (434) T protein:vir:98 53 WQ-A-N---RLDSRQKLVWRMAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGLKVWHNDID 127 (434) T ss_pred HH-h-c---ChhHHHHHHHHHHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEEEEEEeccC Confidence 43 2 2 2335667788999999999999887665432 2366788888887776432 111100 00 Q ss_pred Cce----------EE-------------e----------------cHh--HeeEecCcCCCCccccchHHHHHHHHHHHH Q lcl|NC_019710. 170 SEY----------AD-------------F----------------SQK--EIFHLKGFGFTGLVGLSPIAFACKSAGVAV 208 (424) Q Consensus 170 ~~~----------~~-------------~----------------~~~--evih~r~~~~~~~~G~s~~~~~~~~i~~~~ 208 (424) +.. .. . +-. =|+||.+.+...-.|.|-++.....++... T Consensus 128 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~ 207 (434) T protein:vir:98 128 GFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVN 207 (434) T ss_pred CceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHH Confidence 000 00 0 000 145555443222368888888777777777 Q ss_pred HHHHHHHHHHhccCCCceeEEcCCC-CCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeeccCChhHHHHHHHHHH Q lcl|NC_019710. 209 AMEDQQRDFFANGAKSPQILSTGEK-VLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAIGVTPQDAEMMASRKF 286 (424) Q Consensus 209 ~~~~~~~~~~~ng~~p~~vl~~~~~-~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~~~l~~s~~d~~~~e~~~~ 286 (424) .+........+-.+.|..+++-... ...++ ........+.+... .++++.++ ++.++.++..+.. -.+.+..+. T Consensus 208 ~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~-~~~~~~~~~~~~~~--~~~i~~~~~~~~~~~q~~~~~~-~~~~~~l~~ 283 (434) T protein:vir:98 208 LGILNRMAASRFSGFRQKWIKGHKFAKRTDP-ATGMTVVDQPFVPS--PSAVWASEGENTQFGQLDATDL-SGFLKEHAS 283 (434) T ss_pred HHHHHHHHHHHHhcchhhhhcCCCccccccc-ccccchhhhhhhcc--ccccccCCCCCceEEEecCcch-HHHHHHHHH Confidence 6666655555555556655541110 01111 11111122222222 23466665 3567767654322 237778888 Q ss_pred HHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhhhc------cChh-h--hccceeeecc Q lcl|NC_019710. 287 QVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQRWL------IPAK-D--VGRIHAEHNL 357 (424) Q Consensus 287 ~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L------~~~~-~--~~~~~~~f~~ 357 (424) ....++..=++|+..++... ++.|... ..+....+.-.+...+..|...| .... + .....+++.+ T Consensus 284 ~i~~~~~~~~~p~~~~~~~~-~n~Sg~A-----l~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w 357 (434) T protein:vir:98 284 DVRDMLTISQTPTYLYATDL-VNISADT-----IGALDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRW 357 (434) T ss_pred HHHHHhcccCCCHHHhcccc-CChHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEe Confidence 89999999999999998532 2222111 12222222222222222222111 0000 1 1123455555 Q ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC------cCe--ee-ecccccchhhccccCCCccCCC Q lcl|NC_019710. 358 DGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG------GDV--AM-RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 358 ~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g------gd~--~~-~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ......+..+.++.+.++++.|+ +..-+++.+|+++-+- .++ .. ....... ........+.+++| T Consensus 358 ~~~~~~s~~~~ada~~kl~~~g~-~~e~~~~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~~-~~~~~g~~~~~~~~ 431 (434) T protein:vir:98 358 ANPAHVTMAVKADAATKLKSIGY-PLDVIAEELDESPARVRRIVAGAASQALLAASLLPAP-GAPSAGNVPDSGGA 431 (434) T ss_pred cCCCCCCHHHHHHHHHHHHhcCC-cHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhhccC-CCCCCCCCCcccCC Confidence 66677888999999999998886 7777888899876320 000 00 0000000 01111122233333 No 150 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=98.75 E-value=3.2e-08 Score=61.70 Aligned_cols=407 Identities=11% Similarity=0.086 Sum_probs=198.6 Q ss_pred CCCCCcccccCCC---ccHHHHHHhhccCccccc---ccccccccccc----cccccCCcccc--------HHHHhhhHH Q lcl|NC_019710. 1 MEEPKYTIDLRTN---NGWWARLKSWFVGGRLVT---PNQGSQTGPVS----AHGYLGDSSIN--------DERILQIST 62 (424) Q Consensus 1 ~~~~~~~~~~~~~---~G~~~~~~~~~~~~~~~~---~~~~~~~~~~~----~~~~~~~~~~~--------~~~~~~~~~ 62 (424) |..-+.-.+..+. .=+++.+.++- .+.... +-+.....+.. ...+.++..-+ .+.++.+|. T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~~-~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~~n~~eLI~~YR~ma~~~pE 79 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGMG-APHGAGGSSMIPINMYHPFATAGYASRFYGGIEFNRFFLYDMYDRMDYTDPL 79 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhccc-CccCCCCCccccCCCCcchhhhhhhhhhhccccccHHHHHHHHHHhhccCcc Confidence 3222222222222 13333344331 111111 11111111100 11122333222 233466899 Q ss_pred HHHHHHHHHHhhhhC-----ceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee- Q lcl|NC_019710. 63 VWRCVSLISTLTACL-----PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR- 136 (424) Q Consensus 63 v~~~i~~ia~~ia~~-----~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r- 136 (424) |.+||+.|++.+.-+ |+.+--.+.+ -.+.+ ...+..+|+ ...--+.+++.|..+|..|..++. T Consensus 80 Vd~AideIvneaiv~d~~~~pV~v~l~~~e-~s~~i--K~kI~~lld--------f~~~~~~~fR~WYVDGriy~Hkiik 148 (533) T protein:vir:58 80 ISTVLDIIADECTIPNENGNIVDVVTKDIE-LAKAI--LSYLDYVIN--------IEKNAYPIIRNMIKYGDMFLHILEK 148 (533) T ss_pred hhhHHHhhhceeeEecCCCceeEeeccccc-ccHHH--HHHHHHHhc--------chhhhhHHHHhhhhcceeEEEeccC Confidence 999999999976643 3333211111 00011 112233332 222334567888999999988753 Q ss_pred CCCCceeEEEEeccceEEEEEcCC--ceEEEEE-------ecCceEEecHhHeeEecCc--CCCCccccchHHHHHHHHH Q lcl|NC_019710. 137 NSAGDVISLLPLQSANMDVKLVGK--KVVYRYQ-------RDSEYADFSQKEIFHLKGF--GFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 137 ~~~G~~~~l~~l~p~~v~~~~~~~--~~~~~~~-------~~~~~~~~~~~evih~r~~--~~~~~~G~s~~~~~~~~i~ 205 (424) +..+.+.+|..|+|.+|+..++.. ..+|.|. .+.....++.+.|+|+.+- ..++.+++|-|..+.+.+. T Consensus 149 ~~k~GI~elr~lDPr~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp~N 228 (533) T protein:vir:58 149 GSDGTIEKFQVVSPYIFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAIWN 228 (533) T ss_pred CcccchhhheecCCeeeEEEEeeccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHHHH Confidence 355678899999999998766542 2333333 2334578999999999854 3456788899999988888 Q ss_pred HHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC----cccCcc----------eec-------- Q lcl|NC_019710. 206 VAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG----PVKKRL----------WIL-------- 262 (424) Q Consensus 206 ~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~----~~ag~~----------~~l-------- 262 (424) ...-+....--+--.-+.-+-|+..+-+... ..+.+.++....++... ...|.+ -.+ T Consensus 229 QLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRR 308 (533) T protein:vir:58 229 QLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRR 308 (533) T ss_pred HHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhccccc Confidence 8777777766655555544556655544433 34445555555554321 112222 122 Q ss_pred --CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019710. 263 --EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 263 --~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~ 340 (424) ..|.+++.|.-. .+.-++-.++..+.+..+++||.+-|+...+...+..=+.+..+ +...|.-+-..+.+.|.. T Consensus 309 eGgrgTEI~TLpGg--~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~fgr~~eItRDEiK--F~KFI~rLR~rF~~ll~~ 384 (533) T protein:vir:58 309 GDRRAVEIDILQGS--KVDLAEDVEYMLNRLISALKVPKAFIGYEGDVNAKNTLATQDIK--FNNTIKRIQGFFVEELER 384 (533) T ss_pred CCCccceeeecCCC--CCCcHHHHHHHHHHHHHHhCCCeeecCCCCCCccchhhhHHHHH--HHHHHHHHHHHHHHHHhc Confidence 135677776542 25556777888999999999999988765432111100112222 445567777778888888 Q ss_pred hccChhhh--ccceeeecchhhhcc--CH---HHHHHHHHHH---HhC-----C--CcCHHHHH------HHhCCCCC-- Q lcl|NC_019710. 341 WLIPAKDV--GRIHAEHNLDGLLRG--DS---ASRAAFMKAM---GES-----G--LRTINEMR------RTDNLPPL-- 395 (424) Q Consensus 341 ~L~~~~~~--~~~~~~f~~~~~~~~--d~---~~~~~~~~~~---~~~-----g--~~t~NE~R------~~lg~~p~-- 395 (424) +|....-. ..+.+.|..|+.... +. ..|..+++.+ ++. - -|| +|+. +..+..++ T Consensus 385 qLilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~t-dei~~q~e~ie~E~~~~~~~ 463 (533) T protein:vir:58 385 MVRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIP-YDLKPQEEVAEAAGGGGLFD 463 (533) T ss_pred ccccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCC-hhhhHHHHHHHHhhcCCCCC Confidence 88764322 124555555543221 11 2233332221 111 0 122 2222 22222221 Q ss_pred -CCcCeeeecccc----c-chhh--------ccccCCCccCCC Q lcl|NC_019710. 396 -PGGDVAMRQSQY----V-PITD--------LGTNKEPRNNGA 424 (424) Q Consensus 396 -~ggd~~~~~~n~----~-~~~~--------~~~~~~~~~~g~ 424 (424) ++-++-+.|... . |++. .+...++..+|+ T Consensus 464 ~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~ 506 (533) T protein:vir:58 464 TGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGE 506 (533) T ss_pred CCCcccccCCcccCccccCcccCCCChhhHhcccCCccccccc Confidence 111111111111 1 1111 111111111111 No 151 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.65 E-value=5.3e-08 Score=60.46 Aligned_cols=394 Identities=13% Similarity=0.060 Sum_probs=180.2 Q ss_pred cccCCCccHHHHHHhhccCccccccc-cccccc--ccccccc-cCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEee Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPN-QGSQTG--PVSAHGY-LGDSSINDERILQISTVWRCVSLISTLTACLPLDVFE 83 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~-~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 83 (424) |.=-|..-|++++........+.--. .....+ .+...+. .....-.......+.+...+|+..+..+-.-||++- T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~- 79 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG- 79 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecC- Confidence 44444445555555443221110000 000000 0000000 000000011112234667788888888877788753 Q ss_pred ccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce- Q lcl|NC_019710. 84 TDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV- 162 (424) Q Consensus 84 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~- 162 (424) ...+.. ....+.+++.. | ....+...+..+++.+|.||+++-.+.+|.+ .+..++|..+.+..++... T Consensus 80 ~~~d~~-----~~~~~~~~~~~--n---~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~-~i~~~~p~~~~~i~d~~~~~ 148 (456) T protein:vir:79 80 GSADSD-----LALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPW 148 (456) T ss_pred CCCCcc-----HHHHHHHHHHh--c---ChhHHHHHHHHHHhhcCeeEEEEeeCCCCce-EEEEeccceeEEEEcCCCCC Confidence 111111 11224444432 2 2335667889999999999999988889987 5788888888877664221 Q ss_pred ------EEEEEecCce---EEec-------------------------------HhHeeEecCc----CCCCccccchHH Q lcl|NC_019710. 163 ------VYRYQRDSEY---ADFS-------------------------------QKEIFHLKGF----GFTGLVGLSPIA 198 (424) Q Consensus 163 ------~~~~~~~~~~---~~~~-------------------------------~~evih~r~~----~~~~~~G~s~~~ 198 (424) .|+...+... ..+. ..++-|.-.. ..+...|+|-+. T Consensus 149 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~~~gd~e 228 (456) T protein:vir:79 149 RIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVE 228 (456) T ss_pred ceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecCCCCCchhh Confidence 0110000000 0000 0111111100 012234667666 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCC-C-CCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChh Q lcl|NC_019710. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK-V-LTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQ 276 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~-~-~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~ 276 (424) .....++....+..-........+.|..++.-... . ..++.-+.+ ...+.+..+ .+.++.++++.++.++..+.. T Consensus 229 ~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i-~~~~~~~~~--~~~~~~~~~~~~~~q~~~~~~ 305 (456) T protein:vir:79 229 PHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAI-DYASIFEAA--PGALWELPPGVDIWESQTNDF 305 (456) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccccccc-chhhhhhhh--ccccccCCCCcceeeecccCh Confidence 65555544433322222222222333333321100 0 001000000 011111111 245777888888877664322 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhh------hccChhhhcc Q lcl|NC_019710. 277 DAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKDVGR 350 (424) Q Consensus 277 d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~------~L~~~~~~~~ 350 (424) -.+.+..+....+|+..-++|+..++.... +.|....+.....+...+ .-....+...|.+ .+....+ . T Consensus 306 -~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N~Sg~Al~~~~~~l~~k~-~~~~~~f~~~l~~~~~l~~~~~g~~~--~ 380 (456) T protein:vir:79 306 -TPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKC-EDRLSIAKIGLEAILVKALQIEGESV--E 380 (456) T ss_pred -HHHHHHHHHHHHHHHhhcCCChhHhccccc-CcHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCc--c Confidence 237888899999999999999999986432 222222221111111111 1111111111111 1111111 1 Q ss_pred ceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--C--cCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 351 IHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP--G--GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 351 ~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~--g--gd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ..++..+......+..+.++.+.++++.|+.+..-+++.+|+.+.+ - .+......+ .+ ...--+.++.+|| T Consensus 381 ~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~--~~-~~~~~~~~~~~~~ 455 (456) T protein:vir:79 381 DTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQIT--LF-AGNPVQRPQEDGS 455 (456) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHH--HH-hhhHhhcCCCCCC Confidence 2344444555667788899999999999999998888899987531 1 111100000 00 0011234566666 No 152 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=98.57 E-value=1.2e-07 Score=58.58 Aligned_cols=389 Identities=11% Similarity=0.041 Sum_probs=173.1 Q ss_pred ccHHHHHHhhccCccc-------cc--cccccc------ccccccccccCCcc--c-----c----HHHHhhhHHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRL-------VT--PNQGSQ------TGPVSAHGYLGDSS--I-----N----DERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~-------~~--~~~~~~------~~~~~~~~~~~~~~--~-----~----~~~~~~~~~v~~~i 67 (424) +|+++++++||++... .. ...... .....+..|..|.. + . .+..++.... T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~---- 76 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLR---- 76 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcH---- Confidence 6677777776643211 00 000000 00000111111110 0 0 0001111222 Q ss_pred HHHHHhhhhC----ceeEeeccccCcccc----ccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC Q lcl|NC_019710. 68 SLISTLTACL----PLDVFETDQNDNRKK----VDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) Q Consensus 68 ~~ia~~ia~~----~~~~~~~~~~~~~~~----~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~ 139 (424) ..||+.+|++ +..+.-.+.++.... ......+..++.. | ......+..+.+.+..|.+++-+..+.. T Consensus 77 ~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~--n---~f~~~~~~~~e~a~a~G~~a~k~~~d~~ 151 (517) T protein:vir:98 77 KLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQH--N---KFIKNLSDYLEPTFALGGLTVRPYVDNG 151 (517) T ss_pred HHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHh--c---cHHHHHHHHHHHHhhhCCEEEEEEEeCC Confidence 2344444543 222211111111100 0011123333321 1 1234445566777778888887766543 Q ss_pred CceeEEEEeccceEEEE-Ec------------------CCceEEE----EEe-------------------cC---ceEE Q lcl|NC_019710. 140 GDVISLLPLQSANMDVK-LV------------------GKKVVYR----YQR-------------------DS---EYAD 174 (424) Q Consensus 140 G~~~~l~~l~p~~v~~~-~~------------------~~~~~~~----~~~-------------------~~---~~~~ 174 (424) .+ .+..++++++.+. .+ +...+|. +.. +. .+.. T Consensus 152 -~~-~I~~v~ad~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~ 229 (517) T protein:vir:98 152 -EI-EFSWALANAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKR 229 (517) T ss_pred -ee-EEEEEcCCeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccccc Confidence 32 3555666555431 11 1111111 000 00 0001 Q ss_pred ec--------HhH----------eeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcC Q lcl|NC_019710. 175 FS--------QKE----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTG 231 (424) Q Consensus 175 ~~--------~~e----------vih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~ 231 (424) ++ +++ ..|++.+-. ....|+|....+...+......-.....-|+.|.. +.++ + T Consensus 230 v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~-~i~v--p 306 (517) T protein:vir:98 230 IPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQR-TVFV--S 306 (517) T ss_pred ccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCc-ceec--C Confidence 11 111 225554322 23579999999998888777776666666776554 3332 2 Q ss_pred CCCCCH--HH-HHHHHHHHHHHhCCcccCcceec-CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC Q lcl|NC_019710. 232 EKVLTE--QQ-RSQVEENFKEIAGGPVKKRLWIL-EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK 307 (424) Q Consensus 232 ~~~~~~--~~-~~~~~~~~~~~~~~~~ag~~~~l-~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~ 307 (424) ...... +. .......++. .+..-..+-. +++-.++.++....+-++.+..+...+.|+...|+++..++...+ T Consensus 307 ~~~l~~~~~~~g~~~~~~~d~---~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~ 383 (517) T protein:vir:98 307 DVMLRTVPDESGMPPPQVFDP---DVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGR 383 (517) T ss_pred hhhhccccCCCCcccCCCCCc---ccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccccc Confidence 221100 00 0000000000 0000000000 122346667777777889999999999999999999999998765 Q ss_pred CCcccccHHHH--HHHHHHHHHHHHHHHHHHHHhh------------hccChhhhccceeeecchhhhccCHHHHHHHHH Q lcl|NC_019710. 308 STSWGSGIEQQ--NLGFLQYTLQPYISRWENSIQR------------WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMK 373 (424) Q Consensus 308 ~~~~~~n~e~~--~~~f~~~tl~P~~~~ie~~l~~------------~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~ 373 (424) +.. ++.+. .+.-.-.|+.-+...++..|.. .++...-.....+.+++++-+..|.++.++... T Consensus 384 ~~k---TATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~ 460 (517) T protein:vir:98 384 SMK---TATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYG 460 (517) T ss_pred ccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHH Confidence 443 23222 1122223444444444433332 122222122355778888888999999999999 Q ss_pred HHHhCCCcCHHHHHHH-hCCCCCCCcCeee--ecccccchhhccccCCCccCCC Q lcl|NC_019710. 374 AMGESGLRTINEMRRT-DNLPPLPGGDVAM--RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 374 ~~~~~g~~t~NE~R~~-lg~~p~~ggd~~~--~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +++.+|+|++-+++.+ .|+..- ..++.+ +.......+.... .++++++. T Consensus 461 ~~v~aG~ms~~~~i~~~~g~~ee-eA~~e~~~i~~E~~~~~~~~~-~~~~~~~~ 512 (517) T protein:vir:98 461 QAKTFGFIPTVEAIQRIFKVPKK-TAEQWLEEIRKDQIELDPVTI-SQRAQKRM 512 (517) T ss_pred HHHhcCCCCHHHHHHHhCCCChH-HHHHHHHHHHHhccccCCCCc-cccccCCC Confidence 9999999999998665 477531 111111 0101111111111 11111111 No 153 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=98.56 E-value=2.8e-07 Score=56.48 Aligned_cols=396 Identities=11% Similarity=0.049 Sum_probs=166.5 Q ss_pred CCCCCcccc-cCCCccHHHHHHhhccCcccccccccccccccccccc---cCCccc-c-HHHHhhhHHHHHHHHHHHHhh Q lcl|NC_019710. 1 MEEPKYTID-LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGY---LGDSSI-N-DERILQISTVWRCVSLISTLT 74 (424) Q Consensus 1 ~~~~~~~~~-~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-~-~~~~~~~~~v~~~i~~ia~~i 74 (424) |--|---|. .-+..=+++.+...+...... . ......+.+-.. .+...- . ......+.+...+|+.++..+ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~r-~--~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:10 1 MTAPLPGQEEIEDPAIARDEMVSAFEDSTQN-L--KTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH-H--HHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhh Confidence 222211111 111112344444433222110 0 000000000000 000000 0 000111234455666665544 Q ss_pred hhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-------eEEEE Q lcl|NC_019710. 75 ACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-------ISLLP 147 (424) Q Consensus 75 a~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~-------~~l~~ 147 (424) ---+|++ .++. + ....+.+++. + | ....+...+..+++.+|.||+++.++..+.. ..+.+ T Consensus 78 ~~~g~~~---~~~~---~--~~~~~~~i~~-~-N---~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~ 144 (485) T protein:vir:10 78 AVEGFRF---GDAD---E--ADEELWQWWQ-A-N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRV 144 (485) T ss_pred cccceec---CCCc---h--hHHHHHHHHH-h-c---CHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEE Confidence 3223322 2111 1 1123444443 2 1 2446778899999999999999888765322 24677 Q ss_pred eccceEEEEEcCCce------EEEEEecCc----eEEecHhH-------------------------eeEecCcC-CCCc Q lcl|NC_019710. 148 LQSANMDVKLVGKKV------VYRYQRDSE----YADFSQKE-------------------------IFHLKGFG-FTGL 191 (424) Q Consensus 148 l~p~~v~~~~~~~~~------~~~~~~~~~----~~~~~~~e-------------------------vih~r~~~-~~~~ 191 (424) ++|..+.+..|.... .+.+...+. ...+.++. |+||.+.. ..+. T Consensus 145 ~~p~~~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~ 224 (485) T protein:vir:10 145 EPPTRMYAEIDPRIGRVSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDL 224 (485) T ss_pred EccceeEEEEcCCCCceeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCC Confidence 788888776663221 111111110 01122222 34444322 2345 Q ss_pred cccchHHH-HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH-HHHHHHHHHHHHhCCcccCcceecC-CCcee Q lcl|NC_019710. 192 VGLSPIAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKEIAGGPVKKRLWILE-AGFST 268 (424) Q Consensus 192 ~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~-~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~ 268 (424) +|.|-+.- +...++....+..-......-.+.|..+++-. ...... ....-...++.. .++++.++ ++.++ T Consensus 225 ~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~-~~~~~~~~~~~~~~~~~~~-----~~~i~~~~~~d~k~ 298 (485) T protein:vir:10 225 YGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGI-KPEEIGVDPETGQTLFDAY-----LARILAFEDAEGKI 298 (485) T ss_pred CCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcC-Ccccccccccccchhhhhc-----ccceeccCCCCceE Confidence 67775542 22333332222222222223333444444311 100000 000001112111 23466665 45677 Q ss_pred eeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHH----------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 269 SAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSI 338 (424) Q Consensus 269 ~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~f~~~tl~P~~~~ie~~l 338 (424) .++....-+ .+.+..+....+++..=++|+..+|....+..|....... ....+...+..+++.+.. + T Consensus 299 ~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~-~ 376 (485) T protein:vir:10 299 QQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYR-M 376 (485) T ss_pred EeecccchH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Confidence 666543322 3777888888899999999999997654332221111111 112222222222222211 1 Q ss_pred hhhccChhh-h-ccceeeecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CcC------------- Q lcl|NC_019710. 339 QRWLIPAKD-V-GRIHAEHNLDGLLRGDSASRAAFMKAMGESG--LRTINEMRRTDNLPPLP--GGD------------- 399 (424) Q Consensus 339 ~~~L~~~~~-~-~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~lg~~p~~--ggd------------- 399 (424) ....+ . ....+++.+......+..+.++.+.+++++| +++..-+++.+|+.+-+ ... T Consensus 377 ----~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~ 452 (485) T protein:vir:10 377 ----MKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGL 452 (485) T ss_pred ----hCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHH Confidence 11111 1 1234556666667788899999999999876 88888889999987532 110 Q ss_pred --eeeecccccchhh----cccc--CCCccCCC Q lcl|NC_019710. 400 --VAMRQSQYVPITD----LGTN--KEPRNNGA 424 (424) Q Consensus 400 --~~~~~~n~~~~~~----~~~~--~~~~~~g~ 424 (424) .+..+....+-.. ..+. +....+|| T Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 453 IGTMVDPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred HHHhhccCCCCCCCCCccccccCcCCCCCCCCC Confidence 0111111110000 0000 01111222 No 154 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=98.50 E-value=4.4e-07 Score=55.42 Aligned_cols=389 Identities=12% Similarity=0.009 Sum_probs=173.0 Q ss_pred CCccHHHHHHhhccCccc-cc-------cccccc----ccccccccccCC--------------ccccHHHHhhhHHHHH Q lcl|NC_019710. 12 TNNGWWARLKSWFVGGRL-VT-------PNQGSQ----TGPVSAHGYLGD--------------SSINDERILQISTVWR 65 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~-~~-------~~~~~~----~~~~~~~~~~~~--------------~~~~~~~~~~~~~v~~ 65 (424) --.+++.+++++++.-.. .. ...... .....+..+..| .... ...+....... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~-~~~~~~n~~k~ 79 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVN-RRQLSMNLPKV 79 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccc-cceeecchHHH Confidence 001222222222221100 00 000000 000000000000 0000 01122344455 Q ss_pred HHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEE Q lcl|NC_019710. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l 145 (424) +++..|+-+.+=|..+--.+ .. ....+..++.. -...+-...++.+.+.+|.+|+.+..|.+|.+ .+ T Consensus 80 i~~~~a~~l~~~p~~i~~~d----~~---~~e~l~~~~~~-----n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i 146 (496) T protein:vir:38 80 TAKYMSKLLFNEKVKINIDD----KA---AEEFVLNVLKT-----NGFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KV 146 (496) T ss_pred HHHHHhhhhhCCcceEeeCC----hH---HHHHHHHHHhc-----cCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcE-EE Confidence 66777776666555542111 00 01123333321 12456667788899999999999998888775 46 Q ss_pred EEeccceEEEEEcCCc-e--------------EEE----EE-ecCc----------------eEEecH---------h-- Q lcl|NC_019710. 146 LPLQSANMDVKLVGKK-V--------------VYR----YQ-RDSE----------------YADFSQ---------K-- 178 (424) Q Consensus 146 ~~l~p~~v~~~~~~~~-~--------------~~~----~~-~~~~----------------~~~~~~---------~-- 178 (424) -.++|.++.+..++.+ . +|. +. .+.. +..++- . T Consensus 147 ~~v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~ 226 (496) T protein:vir:38 147 SFATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVP 226 (496) T ss_pred EEEcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCcccccccccccccccee Confidence 6677777665332211 1 010 00 0000 000100 0 Q ss_pred ----H---eeEecCcC-----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE-----cCCCCCCHHHHH Q lcl|NC_019710. 179 ----E---IFHLKGFG-----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS-----TGEKVLTEQQRS 241 (424) Q Consensus 179 ----e---vih~r~~~-----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~-----~~~~~~~~~~~~ 241 (424) + +.+++.+- .....|+|.+..+...++....+.....+-|..+ ++..++. ...+... +... T Consensus 227 ~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~-~~~i~v~~~~l~~~~~~~g-~~~~ 304 (496) T protein:vir:38 227 LPDFTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLG-KKKVLVPSSFVKTAVNLDG-STTQ 304 (496) T ss_pred ecCCCcceEEEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhc-ccceecchHHhhccCCCCC-cccc Confidence 1 22444321 1235799999998888887776666555666653 3343331 0000000 0000 Q ss_pred HHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHH-- Q lcl|NC_019710. 242 QVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN-- 319 (424) Q Consensus 242 ~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~-- 319 (424) ......+.+. +......+++..++.+......-++.+..+....+|+...|+||..+|....+..+...+.... T Consensus 305 ~~~~~~~~~~----~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~ 380 (496) T protein:vir:38 305 YFDSTDEAFF----LYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSE 380 (496) T ss_pred CCCCccceEE----EeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHH Confidence 0000000000 0011112333456666665556677888888899999999999999987554433211111111 Q ss_pred --------HHHHHHHHHHHHHHHHHHHhhhc-cChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_019710. 320 --------LGFLQYTLQPYISRWENSIQRWL-IPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTD 390 (424) Q Consensus 320 --------~~f~~~tl~P~~~~ie~~l~~~L-~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~l 390 (424) ...+..+|..++..+.+..+... +.........+.|.++.-+..|.++.++.+.+++.+|+|+.-.++..+ T Consensus 381 l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~ 460 (496) T protein:vir:38 381 TYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRA 460 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhc Confidence 12223334444444433222111 111111223456666677788889999999999999999988887644 Q ss_pred -CCCCCCCcCeeeecc-----cccchhhccccCCCcc Q lcl|NC_019710. 391 -NLPPLPGGDVAMRQS-----QYVPITDLGTNKEPRN 421 (424) Q Consensus 391 -g~~p~~ggd~~~~~~-----n~~~~~~~~~~~~~~~ 421 (424) |.+. +..++.+... ...|.++.+...+.++ T Consensus 461 ~~~~d-~ea~~el~ri~~E~~~~~~~~d~~~~~~~~e 496 (496) T protein:vir:38 461 WNITE-AEADEWAEMLAKEKQAEMPNNDMNGIFGEEE 496 (496) T ss_pred CCCCh-HHHHHHHHHHHHhhhccCccccccCCCCCCC Confidence 4432 1111111000 0011111111111111 No 155 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=98.45 E-value=6.1e-07 Score=54.66 Aligned_cols=396 Identities=10% Similarity=0.039 Sum_probs=164.5 Q ss_pred CCCCCcccccCCCc---cHHHHHHhhccCccccccccccccccccccccc--CCccccH---HHHhhhHHHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNN---GWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL--GDSSIND---ERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~~---G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~~v~~~i~~ia~ 72 (424) |.-+ |..-+.. =++.++...+...... .. .....+.+.... -+..+.. .....+.+..-+|+..+. T Consensus 1 ~~~~---~~~~~~~~~~~~~~~l~~~~~~~~~r-l~--~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 74 (484) T protein:vir:77 1 MTSP---LQKQENVDPEKAREEMLNLFTERTQD-LG--DNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAA 74 (484) T ss_pred CCCc---ccccCCCCHHHHHHHHHHHHHHHHHH-HH--HHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHh Confidence 3221 1111111 1222222222211110 00 000000000000 0000111 111123344556666665 Q ss_pred hhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee-------EE Q lcl|NC_019710. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI-------SL 145 (424) Q Consensus 73 ~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~-------~l 145 (424) .+--..|.+ .. ++ + ....+.+++. + | ........+..+.+.+|.||+.+.++.+|.+. .+ T Consensus 75 ~l~~~g~~~---~~-~~--~--~~~~l~~i~~-~-N---~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i 141 (484) T protein:vir:77 75 RQELEGFRL---GG-AD--K--ADEQLWDWWQ-A-N---DLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPII 141 (484) T ss_pred hhccCceec---CC-cc--h--hHHHHHHHHH-h-c---CHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceE Confidence 443334432 11 11 1 1123444443 2 2 23466788899999999999999888887542 46 Q ss_pred EEeccceEEEEEcCCce------EEEEEec-Cc---eEEecHhH-------------------------eeEecCcC-CC Q lcl|NC_019710. 146 LPLQSANMDVKLVGKKV------VYRYQRD-SE---YADFSQKE-------------------------IFHLKGFG-FT 189 (424) Q Consensus 146 ~~l~p~~v~~~~~~~~~------~~~~~~~-~~---~~~~~~~e-------------------------vih~r~~~-~~ 189 (424) .+++|..+....|.... .+.+... +. ...|.++. |++|.+.. .. T Consensus 142 ~~~~p~~~~~~~D~~~~~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~ 221 (484) T protein:vir:77 142 RVEPPTNLYAQIDPRTRQVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLS 221 (484) T ss_pred EEeccceeEEEecCCCCceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccC Confidence 77888888776664211 0100000 00 00111111 35555432 23 Q ss_pred CccccchHHH-HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHH--HHHHHHHHHHhCCcccCcceecC-CC Q lcl|NC_019710. 190 GLVGLSPIAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQR--SQVEENFKEIAGGPVKKRLWILE-AG 265 (424) Q Consensus 190 ~~~G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~--~~~~~~~~~~~~~~~ag~~~~l~-~g 265 (424) ++.|.|.+.- +...++....+..-........+.|..++.-. .. ++... ..-...++.. .++++.++ ++ T Consensus 222 ~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~-~~-~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~ 294 (484) T protein:vir:77 222 DLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGV-KG-EELGVDPETGQTLFDAY-----LARILAFEDHE 294 (484) T ss_pred ccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCC-Cc-chhcccccccchhhhhh-----hhhhcccCCCC Confidence 4577776542 22223333322222222222233454444311 10 11000 0001112211 23466655 45 Q ss_pred ceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHH----HHHHHhhh Q lcl|NC_019710. 266 FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISR----WENSIQRW 341 (424) Q Consensus 266 ~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~----ie~~l~~~ 341 (424) .++.++....-+ .+++..+.....|+.+-++|+..+|+...+..|..........+...+ .-.... +.+.+..- T Consensus 295 ~~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka-~~k~~~f~~~l~~~~~l~ 372 (484) T protein:vir:77 295 SKAQQFSAAELR-NFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTV-ERKNKIFGGAWEQAMRVA 372 (484) T ss_pred ceeEeecCCChH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 677776654333 377888888889999999999999865432222111111111111111 111111 11111100 Q ss_pred --ccChhh--hccceeeecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CcCeee----e--cccc Q lcl|NC_019710. 342 --LIPAKD--VGRIHAEHNLDGLLRGDSASRAAFMKAMGESG--LRTINEMRRTDNLPPLP--GGDVAM----R--QSQY 407 (424) Q Consensus 342 --L~~~~~--~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~lg~~p~~--ggd~~~----~--~~n~ 407 (424) +....+ .....+++.+......+.++.++.+.+++++| +++..-+++++|+-+.+ ...... . ...+ T Consensus 373 ~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~ 452 (484) T protein:vir:77 373 YKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLM 452 (484) T ss_pred HHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHH Confidence 111111 01133455556666788888999999999876 88888888888885432 110000 0 0000 Q ss_pred cchhhcccc----------CCCccCCC Q lcl|NC_019710. 408 VPITDLGTN----------KEPRNNGA 424 (424) Q Consensus 408 ~~~~~~~~~----------~~~~~~g~ 424 (424) .++.....+ .++..+.+ T Consensus 453 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (484) T protein:vir:77 453 GTMFGTDPSGGGNPDNPETPEPQPNPA 479 (484) T ss_pred hhhccccccCCCCCCCCCcccccCCCc Confidence 011110000 01111111 No 156 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=98.41 E-value=7.9e-07 Score=54.05 Aligned_cols=394 Identities=11% Similarity=0.032 Sum_probs=162.7 Q ss_pred CCCCCccc---ccCCCccHHHHHHhhccCccccccccccccccccccccc--CCcccc---HHHHhhhHHHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTI---DLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL--GDSSIN---DERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~---~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~~v~~~i~~ia~ 72 (424) |--|.--. .+. .-+++.|...+..... +. ......+.+.... -+..+. ......+.+...+|+..+. T Consensus 1 ~~~~i~~~~~~~~~--~~~~~~L~~~~~~~~~-r~--~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~ 75 (485) T protein:vir:24 1 MTAPLPGQEEIADP--AIARDEMVSAFEDQNQ-NL--RSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAE 75 (485) T ss_pred CCCCCCCCCcccch--HHHHHHHHHHHHHHHH-HH--HHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhh Confidence 11111000 000 0111212221111000 00 0000000000000 000000 0011112344555555555 Q ss_pred hhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee-------EE Q lcl|NC_019710. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI-------SL 145 (424) Q Consensus 73 ~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~-------~l 145 (424) .+.-.+|.+ .. + .+ ....+.+++.. | ........+..+++.+|.||+++.++.++.+. .+ T Consensus 76 ~l~~~g~~~---~~-~--~~--~~~~l~~i~~~--N---~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i 142 (485) T protein:vir:24 76 RQAVEGFRL---GD-A--DE--ADEELWQWWQA--N---NLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLI 142 (485) T ss_pred hhccCceec---CC-C--ch--hHHHHHHHHHh--c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceE Confidence 543344442 11 1 11 11224444432 2 23466788999999999999999887765432 46 Q ss_pred EEeccceEEEEEcCCce------EEEEEecCc----eEEecHhH-------------------------eeEecCcC-CC Q lcl|NC_019710. 146 LPLQSANMDVKLVGKKV------VYRYQRDSE----YADFSQKE-------------------------IFHLKGFG-FT 189 (424) Q Consensus 146 ~~l~p~~v~~~~~~~~~------~~~~~~~~~----~~~~~~~e-------------------------vih~r~~~-~~ 189 (424) .+++|..+.+..|.... .+.+...+. ...|.++. |+||++.. .. T Consensus 143 ~~~~p~~~~~i~D~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~ 222 (485) T protein:vir:24 143 RVEPPTRMYAEIDPRIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLS 222 (485) T ss_pred EEeccceeEEEeeCCcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccC Confidence 77888888776664321 011110000 00111121 35554332 34 Q ss_pred CccccchHH-HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCH-HHHHHHHHHHHHHhCCcccCcceecC-CCc Q lcl|NC_019710. 190 GLVGLSPIA-FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTE-QQRSQVEENFKEIAGGPVKKRLWILE-AGF 266 (424) Q Consensus 190 ~~~G~s~~~-~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~-~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~ 266 (424) +.+|.|-+. .+...++....+..-........+.|..++.- ...... .....-...++.. .++++.++ ++. T Consensus 223 ~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G-~~~~~~~~~~~~~~~~~~~~-----~~~i~~~~~~~~ 296 (485) T protein:vir:24 223 DLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFG-IKPEEIGVDPETGQTLFDAY-----LARILAFEDAEG 296 (485) T ss_pred CcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhcc-CCccccccccccccchhhhc-----ccceeccCCCCc Confidence 567887654 23333443333333333333334445555431 111000 0000001112211 23456664 566 Q ss_pred eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWEN 336 (424) Q Consensus 267 ~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~ 336 (424) ++.++.....+ .+.+..+....+++..=++|+..+|....+..|..... +.....+...+.-+++.+.. T Consensus 297 ~~~q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~ 375 (485) T protein:vir:24 297 KIQQFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYR 375 (485) T ss_pred eEEeecccchH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77666553322 36777778888888888999999986543222211111 11111222222222222211 Q ss_pred HHhhhccChhh--hccceeeecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CcCeeee---cccc Q lcl|NC_019710. 337 SIQRWLIPAKD--VGRIHAEHNLDGLLRGDSASRAAFMKAMGESG--LRTINEMRRTDNLPPLP--GGDVAMR---QSQY 407 (424) Q Consensus 337 ~l~~~L~~~~~--~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~lg~~p~~--ggd~~~~---~~n~ 407 (424) +....+ .....+++.+......+..+.++.+.+++.+| +++..-+++.+|+.+.+ ......- .... T Consensus 376 -----~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~ 450 (485) T protein:vir:24 376 -----LMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGL 450 (485) T ss_pred -----HhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhh Confidence 111111 11133445555556677888899999998865 77877778888886432 1000000 0000 Q ss_pred cchhhccc---------------cCCCccCCC Q lcl|NC_019710. 408 VPITDLGT---------------NKEPRNNGA 424 (424) Q Consensus 408 ~~~~~~~~---------------~~~~~~~g~ 424 (424) ..++.... .++++.+|+ T Consensus 451 ~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 482 (485) T protein:vir:24 451 GLLGTMVDADPTVPGSPNPTPAPKPQPAIEGG 482 (485) T ss_pred hHHHhhcccCCCCCCCCCCCCCCCCccCCCCC Confidence 00000000 011111222 No 157 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=98.38 E-value=9.7e-07 Score=53.56 Aligned_cols=384 Identities=11% Similarity=0.025 Sum_probs=179.3 Q ss_pred ccHHHHHHhhccCcccccc-----c-----cccccc------------cc-ccccccCCccc----cHHHHhhhHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTP-----N-----QGSQTG------------PV-SAHGYLGDSSI----NDERILQISTVWRC 66 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~-----~-----~~~~~~------------~~-~~~~~~~~~~~----~~~~~~~~~~v~~~ 66 (424) +|||+|++++|++...... . ..+... .+ +...+...... ..+.......-..+ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 9999999998864221100 0 000000 00 00111111100 00111122233444 Q ss_pred HHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEE Q lcl|NC_019710. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 67 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~ 146 (424) ++..|+-+..=+..+--.+. + . .+..|..+|. -|. ...-....+.+.+..|.+++.+..+.++ ..+. T Consensus 81 ~~~~A~lv~~e~~~i~v~~~--~--~--~~e~l~~il~--~n~---f~~~~~~~~e~a~a~G~~~~k~~~d~~~--~~i~ 147 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKDN--N--E--ADKFLNDVLE--DND---FKNKFEEALEKGVALGGFAMRPYIDGNH--IKIA 147 (508) T ss_pred HHHHHhhhhCCCceEEeCCc--h--H--HHHHHHHHHH--hcc---HHHHHHHHHHHHhhcCceEEEEEEeCCe--eEEE Confidence 44555544333333321111 0 0 0112333342 111 2344556677888889888877666432 3456 Q ss_pred EeccceEEEE-EcCCc------------------eEEE----EE--ecC-c---------------eEEecH-------- Q lcl|NC_019710. 147 PLQSANMDVK-LVGKK------------------VVYR----YQ--RDS-E---------------YADFSQ-------- 177 (424) Q Consensus 147 ~l~p~~v~~~-~~~~~------------------~~~~----~~--~~~-~---------------~~~~~~-------- 177 (424) .++|.++.+. .+.+. .+|. +. .++ . +..++- T Consensus 148 ~v~ad~~~P~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~ 227 (508) T protein:vir:15 148 WVRADQFYPLQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKE 227 (508) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccC Confidence 6666665432 22111 1111 00 000 0 001110 Q ss_pred --hH----------eeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCC--CHH Q lcl|NC_019710. 178 --KE----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVL--TEQ 238 (424) Q Consensus 178 --~e----------vih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~--~~~ 238 (424) ++ ..||+.+-. +...|+|.+.-+...++.....-.....-|+.|. +..++. .... +++ T Consensus 228 l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~-~~i~v~--~~~l~~d~~ 304 (508) T protein:vir:15 228 LAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQ-KHIAVQ--PGMLRFDDE 304 (508) T ss_pred CCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcc-cceeec--hHHhcCCCC Confidence 11 134443211 2457999999999998887777776777776444 444442 1111 111 Q ss_pred HHHHHHHHHHHHhCCcccCccee--cCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLWI--LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~--l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) ....+. ...+....+- .++|..++.++....+-++.+..+...+.|....|++|..++...++.. ++. T Consensus 305 ~~~~~~-------~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~---TAt 374 (508) T protein:vir:15 305 HKPTFD-------TEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVK---TAT 374 (508) T ss_pred CccccC-------CCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccc---cHH Confidence 000010 1111111111 1234457777777677788999999999999999999999987655433 222 Q ss_pred HH-------------HHHHHHHHHHHHHHHHHHHHhh-hccChh--------hhccceeeecchhhhccCHHHHHHHHHH Q lcl|NC_019710. 317 QQ-------------NLGFLQYTLQPYISRWENSIQR-WLIPAK--------DVGRIHAEHNLDGLLRGDSASRAAFMKA 374 (424) Q Consensus 317 ~~-------------~~~f~~~tl~P~~~~ie~~l~~-~L~~~~--------~~~~~~~~f~~~~~~~~d~~~~~~~~~~ 374 (424) +. ....+..+|..++..|.+-... .++... ....+.+.++++.-+..|.++..+...+ T Consensus 375 ei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~ 454 (508) T protein:vir:15 375 EVVSNNSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAK 454 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHH Confidence 21 1222334444444444333221 111110 0112456778888888999999999999 Q ss_pred HHhCCCcCHHHHHHHh-CCCCCCCcCeeee--cccccchhhccccCCCccCCC Q lcl|NC_019710. 375 MGESGLRTINEMRRTD-NLPPLPGGDVAMR--QSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 375 ~~~~g~~t~NE~R~~l-g~~p~~ggd~~~~--~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++.+|+|+.-+++... |+.. +..++.+. ..........+...++.+++- T Consensus 455 ~v~aGi~s~e~~i~~~~g~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~g~~ 506 (508) T protein:vir:15 455 VLAIGALSKQTFLQRNYGMTD-EQAAEELAKIQSEAPTDTFEGGRSAILNGGD 506 (508) T ss_pred HHhcCCCCHHHHHHhcCCCCh-HHHHHHHHHHHHhccccCccccccccCCCCC Confidence 9999999999987653 6543 11111111 111000011111112222211 No 158 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=98.32 E-value=1.1e-08 Score=64.19 Aligned_cols=260 Identities=12% Similarity=0.083 Sum_probs=132.7 Q ss_pred hhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC Q lcl|NC_019710. 58 LQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN 137 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~ 137 (424) |+. | -++..|..++=..|.|..-+.+ ..-..+.-|...--|...+-..-++.+....+.--+.| T Consensus 1 ~~~---~-~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 64 (279) T protein:vir:40 1 MSL---F-NLSRRAEDVSFSTFTVQDPTTD------LLLGKLLGLVSYFDNVDYSEASKLEDLFYWALQGKEVY------ 64 (279) T ss_pred Ccc---c-ccchhhcccceeeeeecCcchh------HHHHHHHHHHHHhhcccchhhhhhhhhhhhhhccceee------ Confidence 000 0 1122233333333333211000 00001111222233444443333333333222211221 Q ss_pred CCCceeEEEEeccceEEEEEcCCceEE------------EEEecCceEEecHhHeeEecCcCCCCccccchHHHHHHHHH Q lcl|NC_019710. 138 SAGDVISLLPLQSANMDVKLVGKKVVY------------RYQRDSEYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAG 205 (424) Q Consensus 138 ~~G~~~~l~~l~p~~v~~~~~~~~~~~------------~~~~~~~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~ 205 (424) ...-++..+| ........+++|-.++..|-+ +++|.-+-. ...-++ T Consensus 65 -----------------~~~~~~~~~~~~~~~~d~fn~~vr~~~~~~vtVP~~Dv~IieN----Plv~v~~ee-~~kM~~ 122 (279) T protein:vir:40 65 -----------------RVWYGGFKYYAQRVNADQFNIVVREPNRREVTIRTNDYEMLLN----PFYGANPQR-FGVMFG 122 (279) T ss_pred -----------------hhhhhhHHHHHhhcCcchhhhheecCCcceeEeecchhhhhhc----chheeccch-hhHHHH Confidence 1111111111 111223345666667766643 234443321 112222 Q ss_pred HHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc-cCcceecCCCceeeeccCChhHHHHHHHH Q lcl|NC_019710. 206 VAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV-KKRLWILEAGFSTSAIGVTPQDAEMMASR 284 (424) Q Consensus 206 ~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~-ag~~~~l~~g~~~~~l~~s~~d~~~~e~~ 284 (424) +. ......-+.+.+..+++++++.+...++..++.+..++++.-..+ -+++.+++.|.+++++..+-+... .+-. T Consensus 123 la---~nai~~KLD~~~qIk~fIKTd~d~glee~kekaR~rIk~mlalAk~~nGityid~~ddItQL~kDYStsl-k~di 198 (279) T protein:vir:40 123 MA---SNGIGRRLDSQAQIKIYWKTKVSSGLKEVWDRIRERLTQQQQLAREFNGVSVIGSDDDIKQIQPDYSGSL-QNDA 198 (279) T ss_pred HH---HhhhhhhhcccceeeeEEecCcchhHHHHHHHHHHHHHHHHHHHHhcCCeeeecCCceeEeecccccccc-HHHH Confidence 22 233333447888889999999887778888888888887765554 578999999999999987655543 5567 Q ss_pred HHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhh------hccChhhhccceeeecch Q lcl|NC_019710. 285 KFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKDVGRIHAEHNLD 358 (424) Q Consensus 285 ~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~------~L~~~~~~~~~~~~f~~~ 358 (424) ++.++..+..+|||..+|-+. ..|++..+|+..+|.|++++.+.+|.. +.++... T Consensus 199 e~lkS~l~Sq~GinekIL~Gs--------AtE~q~iAyy~rtVePILkQyek~liY~~E~fv~y~ttta----------- 259 (279) T protein:vir:40 199 NLAIEIALSEYGMPRELLYGQ--------SNEVTIIAFAIQKVLPLLKQHDKNIIFNQENFVAYISTTA----------- 259 (279) T ss_pred HHHHHHHHhhcCCchhhcccc--------CchhhhhhHHHhhHHHHHHHhcccccchhhhhhhhheecc----------- Confidence 888999999999999999643 347899999999999999998765432 2222111 Q ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCe Q lcl|NC_019710. 359 GLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDV 400 (424) Q Consensus 359 ~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~ 400 (424) ..|.+ |-.-..-.-+|+ |+. T Consensus 260 ------------------~gg~~--~s~~~~~~~~~~--~~~ 279 (279) T protein:vir:40 260 ------------------KGGAI--ESKSSKRDSEPV--GND 279 (279) T ss_pred ------------------cCccc--ccccccccCCCC--CCC Confidence 11111 000111111222 111 No 159 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.30 E-value=1.6e-06 Score=52.42 Aligned_cols=398 Identities=10% Similarity=0.032 Sum_probs=182.7 Q ss_pred CC-------CCCcccccCCCc--------------------cHHHHHHhhccCcccccccccccccccccccccCCcccc Q lcl|NC_019710. 1 ME-------EPKYTIDLRTNN--------------------GWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIN 53 (424) Q Consensus 1 ~~-------~~~~~~~~~~~~--------------------G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 53 (424) ++ --.|.++..+.. -=+.++..+..+.......... .... ... T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~-----~~~~---~~~-- 86 (502) T protein:vir:48 17 LNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGR-----RKDN---EMA-- 86 (502) T ss_pred hhcccChhHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc-----cccc---ccc-- Confidence 00 000111111110 0112222223221110000000 0000 000 Q ss_pred HHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEE Q lcl|NC_019710. 54 DERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYAL 133 (424) Q Consensus 54 ~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~ 133 (424) ..-..+......|+..+.-+-+-|+++--.+... ...+...|. +....-........+..+++.+|.||++ T Consensus 87 -~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~-------~~~~~~~l~-~~~~~N~~~~~~~~~~~~~~~~G~a~~~ 157 (502) T protein:vir:48 87 -DKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNED-------NSQNDDAIK-RIGRINDIDTHNRNLIRDLSQTGRAYEV 157 (502) T ss_pred -cceeecchHHHHHHHHhhhhcccCeeEecCCccc-------hhHHHHHHH-HHHhhcCHhHHHHHHHHHHhhcCeEEEE Confidence 0012245566777888877777787764222111 122223232 1212223556788899999999999999 Q ss_pred EeeCCCCceeEEEEeccceEEEEEcCCc---eE---EEEE--ec-Cc---eEEecHhHeeEecCc----------C---- Q lcl|NC_019710. 134 VDRNSAGDVISLLPLQSANMDVKLVGKK---VV---YRYQ--RD-SE---YADFSQKEIFHLKGF----------G---- 187 (424) Q Consensus 134 ~~r~~~G~~~~l~~l~p~~v~~~~~~~~---~~---~~~~--~~-~~---~~~~~~~evih~r~~----------~---- 187 (424) +.++.+|.+ .+-.++|..+.+..+... .. ..|. .. +. ...+.++.++++... + T Consensus 158 v~~dedg~~-~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~g~ 236 (502) T protein:vir:48 158 IYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLDASDSFNEISVTPHAFGT 236 (502) T ss_pred EEeCCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEEeCCceeeccceecCCCc Confidence 999988876 467788888887766421 11 1111 11 11 112344444433211 0 Q ss_pred ------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCccee Q lcl|NC_019710. 188 ------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWI 261 (424) Q Consensus 188 ------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~ 261 (424) .+...|.|.+..+...++....+..-..+.++....|-.+++-......++....+++... ......+.... T Consensus 237 vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~ 314 (502) T protein:vir:48 237 VPITEFLNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRL--MQLKPPKSADG 314 (502) T ss_pred cceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcce--eeccccccccc Confidence 1233688888888888887777777777777777777777654322222222222211110 00000001111 Q ss_pred cCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHH Q lcl|NC_019710. 262 LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYI 331 (424) Q Consensus 262 l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~ 331 (424) .+++.++..+.....+..+....+...+.|+..-++|+...+... ++.|....+ ......+...+.-.+ T Consensus 315 ~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~ 393 (502) T protein:vir:48 315 KEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFS-GNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRY 393 (502) T ss_pred cccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 233445555544433444566778888999999999965443322 222211111 111133334444444 Q ss_pred HHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee-----eec Q lcl|NC_019710. 332 SRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVA-----MRQ 404 (424) Q Consensus 332 ~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~-----~~~ 404 (424) +.+...+...--. .......+.+.+...+..|..+.++.+.++ .|+++..-+.+++++-..+. .+.. -.. T Consensus 394 ~li~~~~~~~~~~-~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~ 470 (502) T protein:vir:48 394 RLAARIGSLVNEF-KDFDESRLKITFTPNLPKSLYEQVSILNDL--GGQVSQETALSLSGLVENPTEELDKINEESSKID 470 (502) T ss_pred HHHHHHHhhcccc-cccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhh Confidence 4444433321111 111112344555667778888999998888 47899888888887632211 0000 000 Q ss_pred ccccc------hhhcccc--CCCccCCC Q lcl|NC_019710. 405 SQYVP------ITDLGTN--KEPRNNGA 424 (424) Q Consensus 405 ~n~~~------~~~~~~~--~~~~~~g~ 424 (424) .+..+ .....++ +++.++.. T Consensus 471 ~~~~~~~~~~~~~~~~d~~~e~~~~~~~ 498 (502) T protein:vir:48 471 FKGYPSYFYDNVGKYTDEVKETHTDDFE 498 (502) T ss_pred hhcccccccccccccCCCccCCCCcCcC Confidence 00000 0000000 01111111 No 160 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.29 E-value=1.7e-06 Score=52.27 Aligned_cols=394 Identities=8% Similarity=0.005 Sum_probs=178.3 Q ss_pred CCC-----CCcccc----cCCC--------------------ccHHHHHHhhccCcccccccccccccccccccccCCcc Q lcl|NC_019710. 1 MEE-----PKYTID----LRTN--------------------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSS 51 (424) Q Consensus 1 ~~~-----~~~~~~----~~~~--------------------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 51 (424) |+. +-++|. +++. .--+.++..+..+........................+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~ 80 (503) T protein:vir:59 1 MADIYPLGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKT 80 (503) T ss_pred CcccccCChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHHHHHHHHHHhccccchhhccchhcccccccccccccc Confidence 221 111111 0000 01122333333222111000000000000000000000 Q ss_pred ccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019710. 52 INDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 52 ~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~ 131 (424) + .-+.++....+|+..+.-+-.-|+++- .++ ++. ...+ +.+.. | ........+..+.+.+|.+| T Consensus 81 -~--~ri~~n~~~~ivd~~~~yl~g~~~~~~--~~d---~~~--~~~l-~~~~~--n---~~~~~~~~~~~~~~~~G~~~ 144 (503) T protein:vir:59 81 -N--NRTSHAWHKLFVDQKTQYLVGEPVTFT--SDN---KTL--LEYV-NELAD--D---DFDDILNETVKNMSNKGIEY 144 (503) T ss_pred -c--ceeecchHHHHHHHHHhhhhcCCeeec--cCc---HHH--HHHH-HHHHh--c---CHHHHHHHHHHHHhhCCeEE Confidence 0 011245667788888888877787652 111 111 1122 22322 2 34566677899999999999 Q ss_pred EEEeeCCCCceeEEEEeccceEEEEEcCCc---e-----EEEEEec-Cc----eEEecHhHeeEecCc------------ Q lcl|NC_019710. 132 ALVDRNSAGDVISLLPLQSANMDVKLVGKK---V-----VYRYQRD-SE----YADFSQKEIFHLKGF------------ 186 (424) Q Consensus 132 ~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~---~-----~~~~~~~-~~----~~~~~~~evih~r~~------------ 186 (424) +++-.+.+|.+ .+..++|..+.+..++.. . +|..... +. ...+.++.|.+++.. T Consensus 145 ~~v~~d~dg~~-~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~ 223 (503) T protein:vir:59 145 WHPFVDEEGEF-DYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGE 223 (503) T ss_pred EEEeecCCCce-EEEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccc Confidence 99999888876 488889988887666432 1 1111111 11 112334443333210 Q ss_pred -------------------C----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHH Q lcl|NC_019710. 187 -------------------G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQV 243 (424) Q Consensus 187 -------------------~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~ 243 (424) + .+...|.|.+..+...++....+.....+.+...+.|-.+++-......++....+ T Consensus 224 ~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~ 303 (503) T protein:vir:59 224 NNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANL 303 (503) T ss_pred cccccceeecceeccCCccceEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhh Confidence 0 12335788788777777776666666666667777777776532221112211111 Q ss_pred HHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH------- Q lcl|NC_019710. 244 EENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE------- 316 (424) Q Consensus 244 ~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e------- 316 (424) + ..+++.++++.+.+.+........+....+...+.|...-++|..-.... .++.+....+ T Consensus 304 ~-----------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~~~Sg~Ai~~~~~~l~ 371 (503) T protein:vir:59 304 R-----------YHSVIKVSGDGGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETI-GGGATGPALENLYALLD 371 (503) T ss_pred h-----------cccceeccCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccc-cccccHHHHHHHHHHHH Confidence 1 12355555555554444433334445566666667766666663211111 1222211211 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Q lcl|NC_019710. 317 ---QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLP 393 (424) Q Consensus 317 ---~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~ 393 (424) +.....+...|.-+++.|...+...-.... .....+.+.+..-+..|..+.++.+.+++.+|+++...+.+++++- T Consensus 372 ~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~-~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v 450 (503) T protein:vir:59 372 LKANMAERKIRAGLRLFFWFFAEYLRNTGKGDF-NPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFV 450 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-ccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCC Confidence 112223333444444434333332111110 1112345555677888999999999999999999998888887664 Q ss_pred CCCCcCeeee----------cccc---cchhhccccC-----CCccCCC Q lcl|NC_019710. 394 PLPGGDVAMR----------QSQY---VPITDLGTNK-----EPRNNGA 424 (424) Q Consensus 394 p~~ggd~~~~----------~~n~---~~~~~~~~~~-----~~~~~g~ 424 (424) +.+..+.-.+ ..+. .+...-.++. +.+++|+ T Consensus 451 ~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (503) T protein:vir:59 451 QDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLEEDDPNAGAAESGGA 499 (503) T ss_pred CCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCCcCCCCCCcccCCCC Confidence 3211000000 0000 0000001111 1111111 No 161 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.28 E-value=1.7e-06 Score=52.15 Aligned_cols=408 Identities=11% Similarity=0.029 Sum_probs=179.9 Q ss_pred CCCCCcccc--------cCCC-------------------ccHHHHHHhhccCcccccc---ccccccc-c-cccccccC Q lcl|NC_019710. 1 MEEPKYTID--------LRTN-------------------NGWWARLKSWFVGGRLVTP---NQGSQTG-P-VSAHGYLG 48 (424) Q Consensus 1 ~~~~~~~~~--------~~~~-------------------~G~~~~~~~~~~~~~~~~~---~~~~~~~-~-~~~~~~~~ 48 (424) |++--|+-. .|.. .=++.++............ ....... . +....... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~ 80 (501) T protein:vir:96 1 MEQTLFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRK 80 (501) T ss_pred CceeeeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccC Confidence 332221100 0000 0012222111111000000 0000000 0 00000000 Q ss_pred CccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019710. 49 DSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 49 ~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G 128 (424) ....+..-+.++....+|+..+.-+-+-|+++--.+.+. ...+...|. +....-........+..+++.+| T Consensus 81 -~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~~~~-------~~~~~~~l~-~~~~~n~~~~~~~~~~~~~~~~G 151 (501) T protein:vir:96 81 -DNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNDD-------NSQNDDAIK-RIGRINDLDSLNRTLIRDLSQTG 151 (501) T ss_pred -ccccccceeecchHHHHHHHHhhhhcccCeeEeeCCccc-------hhHHHHHHH-HHHHhcCHHHHHHHHHHHHhhcC Confidence 000001112355667777877777777777663222111 112222222 12222245567788999999999 Q ss_pred CeEEEEeeCCCCceeEEEEeccceEEEEEcCCc---e----EEEEEec--Cc---eEEecHhHeeEecCc---------- Q lcl|NC_019710. 129 NAYALVDRNSAGDVISLLPLQSANMDVKLVGKK---V----VYRYQRD--SE---YADFSQKEIFHLKGF---------- 186 (424) Q Consensus 129 ~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~---~----~~~~~~~--~~---~~~~~~~evih~r~~---------- 186 (424) .||+++.++.+|.+ .+..++|..+.+..++.. . .|++... +. ...+.++.|.++... T Consensus 152 ~a~~~v~~dedg~~-~i~~~~p~~~~~v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~ 230 (501) T protein:vir:96 152 RAYEVIYRSEYDET-RIKRLSPLETFVIYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISVTT 230 (501) T ss_pred eEEEEEEEcCCCce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccccc Confidence 99999999988876 577789999887776531 1 1111101 00 112333333333210 Q ss_pred ------C----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCccc Q lcl|NC_019710. 187 ------G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVK 256 (424) Q Consensus 187 ------~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~a 256 (424) + .+...|.|.+..+...++....+..-..+.+...+.|-.+++-......++....++... ....... T Consensus 231 ~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~--~~~~~~~ 308 (501) T protein:vir:96 231 HAFGTVPITEYLNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTR--LMQLKPP 308 (501) T ss_pred cCCCccceEEecCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcC--eeeeccc Confidence 0 122358888888888888777777666777777777777765432222222222111110 1111111 Q ss_pred CcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHH Q lcl|NC_019710. 257 KRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYT 326 (424) Q Consensus 257 g~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~t 326 (424) +.......+.++.-+.....+..+....+...+.|+..-++|..-.+... ++.+....+ ......+... T Consensus 309 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~ 387 (501) T protein:vir:96 309 KSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFS-GNTSGEALKYKLFGLDQDRVDTQSQFTKG 387 (501) T ss_pred ccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122333444444444334445666778888899999999865544332 222211111 1111233333 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--------- Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--------- 397 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--------- 397 (424) +.-+++.+...+...--.. ......+++.+...+..|..+.++.+.++. |+++..-+.+++++-..|. T Consensus 388 l~~~~~li~~~~~~~~~~~-~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~iS~et~~~~l~~v~D~~~E~~ri~~E 464 (501) T protein:vir:96 388 LKRRYRLAARIGSLVNEFK-DFDESLLKITFTPNLPKSLNEQVSILTGLG--GQVSQETALSLSGLVESPNEELDKINKE 464 (501) T ss_pred HHHHHHHHHHHHHhccccc-ccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHH Confidence 3333333333332211000 111122445556677788889999998885 7888877777776532110 Q ss_pred c---Ceeeecccccchhhcc--ccCCCccCCC Q lcl|NC_019710. 398 G---DVAMRQSQYVPITDLG--TNKEPRNNGA 424 (424) Q Consensus 398 g---d~~~~~~n~~~~~~~~--~~~~~~~~g~ 424 (424) - +.-..+..+.+..... .+++++.+.. T Consensus 465 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~d~~ 496 (501) T protein:vir:96 465 MSEIDFKGYSNDFNEHVGKYTDEVKETHTDDF 496 (501) T ss_pred HHHhhccccccchhhcccccCCcCCCCCCCcc Confidence 0 0001111111111111 1111111111 No 162 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=98.27 E-value=6e-08 Score=60.19 Aligned_cols=182 Identities=9% Similarity=0.046 Sum_probs=97.6 Q ss_pred eEEcCC--CCCCHHHHHHHHHHHHHHhCCc-ccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcC Q lcl|NC_019710. 227 ILSTGE--KVLTEQQRSQVEENFKEIAGGP-VKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG 303 (424) Q Consensus 227 vl~~~~--~~~~~~~~~~~~~~~~~~~~~~-~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~ 303 (424) |++.+. ...... ...+++.++...... +-+.+.+...+-+|+.++.+... +.+........||++-|||...|- T Consensus 1 V~k~~~l~~~~~~~-~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsG--l~d~l~~~~~~iaa~s~iP~t~Lf 77 (201) T protein:vir:10 1 MWKAKGLADLCDDS-DGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGG--IDTFLSQKFDRIVALSGIHEIILK 77 (201) T ss_pred CccchHHHHHhcCC-hHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCC--hHHHHHHHHHHHHhHhcCchhhhc Confidence 444321 111111 112333333222111 12335555555788888877654 456778889999999999999998 Q ss_pred CCCCCCcccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHH------- Q lcl|NC_019710. 304 DVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRA------- 369 (424) Q Consensus 304 ~~~~~~~~~~n~e~~~~~f~~-------~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~------- 369 (424) +...+..+. +.+...+.||. .-+.|.++.+-+-+ ..+ ..+.|.+++|...+.++++ T Consensus 78 G~sp~Glna-tge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~----~~~-----~~~~~~f~pL~~~s~kekAei~~~~a 147 (201) T protein:vir:10 78 GKNVGGVSA-SQNTALETFYGYVDRKRKAELLPLLEFLLPFI----VTE-----QEWSVEFNPLSQVSDKDKSEILEKNV 147 (201) T ss_pred CCCCccccc-cchhHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cCC-----CCceEeeCCCCCCCHHHHHHHHHHHH Confidence 766655531 22333444443 34566666544321 111 2345666788887776655 Q ss_pred HHHHHHHhCCCcCHHHHHHHhCCCCCCCc--CeeeecccccchhhccccCCCccC Q lcl|NC_019710. 370 AFMKAMGESGLRTINEMRRTDNLPPLPGG--DVAMRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 370 ~~~~~~~~~g~~t~NE~R~~lg~~p~~gg--d~~~~~~n~~~~~~~~~~~~~~~~ 422 (424) +.+++++++|+++++|+|+.|--.+..++ +... ...............|.|+ T Consensus 148 ~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~-~~~~~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 148 NSVAALIAAGIIDADEARDTLRAISTEVKIGEGSI-QTEVVINESEDPLDVSANN 201 (201) T ss_pred HHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCC-CccccccccCCCCCCCCCC Confidence 56678899999999999998755443221 1110 1111111111122334444 No 163 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=98.19 E-value=2.9e-06 Score=50.92 Aligned_cols=386 Identities=11% Similarity=0.046 Sum_probs=178.7 Q ss_pred ccHHHHHHhhccCcccc---ccccccc------------ccccccccccCC-----------ccccHHHHhhhHHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLV---TPNQGSQ------------TGPVSAHGYLGD-----------SSINDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~---~~~~~~~------------~~~~~~~~~~~~-----------~~~~~~~~~~~~~v~~~i 67 (424) +|||+++++++++.... .+-.... .....+..|..+ .....+...+...-..++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 88888888887733211 0000000 000000001100 000111122223333344 Q ss_pred HHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019710. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=+..+-- ++. .....+..+|. -| .....+...+...+..|.+++.+..+. |. ..+-. T Consensus 81 ~~~A~lv~~e~~~i~v----~d~---~~~~~l~~~l~--~n---~f~~~~~~~~e~a~a~G~~a~k~~~d~-~~-~~i~~ 146 (522) T protein:vir:47 81 KKIASLVYNEQATITT----KNE---ILQKFLDDMLT--ND---RFNKNFERYLESCLALGGLAMRPYIDG-DK-VRVAF 146 (522) T ss_pred HHHhhhhcCCcceeec----CCh---HHHHHHHHHHh--hc---chHHHHHHHHHHhhccCCEEEEEEEcC-Cc-eEEEE Confidence 4444444332222210 110 11122333342 11 234455666777777887777766653 33 23444 Q ss_pred eccceEEEE-EcCC------------------ceEEE-----------------------EE------ecCc----eEEe Q lcl|NC_019710. 148 LQSANMDVK-LVGK------------------KVVYR-----------------------YQ------RDSE----YADF 175 (424) Q Consensus 148 l~p~~v~~~-~~~~------------------~~~~~-----------------------~~------~~~~----~~~~ 175 (424) +++.++.+. .+.+ ..+|. |. .... +..+ T Consensus 147 v~ad~~~P~~~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v 226 (522) T protein:vir:47 147 IQAPVFFPLESNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRV 226 (522) T ss_pred EcCCceEEEEEcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccc Confidence 444444332 1111 11111 00 0000 0000 Q ss_pred c----------HhH----------eeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE-- Q lcl|NC_019710. 176 S----------QKE----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-- 228 (424) Q Consensus 176 ~----------~~e----------vih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl-- 228 (424) + +++ ..||+.+-. +.+.|+|....+...++.....-.....-|+.|... .+| T Consensus 227 ~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~-i~v~~ 305 (522) T protein:vir:47 227 NLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRR-VIVPE 305 (522) T ss_pred cccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccce-eecch Confidence 0 011 235554322 246799999999988887777766666667766542 222 Q ss_pred ---EcCCCCCCHH--HHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcC Q lcl|NC_019710. 229 ---STGEKVLTEQ--QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVG 303 (424) Q Consensus 229 ---~~~~~~~~~~--~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~ 303 (424) +........+ ....+. ..+..+.+-+. -.+++-.++.++....+-++.+..+...+.|+...|+++..++ T Consensus 306 ~~l~~~~~~~~g~~~~~~~fd-~~~~~f~~~~~----~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~ 380 (522) T protein:vir:47 306 HLTQRQYQRPDGTIDFRPRFD-VEQNVYMQIGG----SSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFT 380 (522) T ss_pred HHhccCCCCCCcccccccccC-cccceEeecCC----CCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccC Confidence 2211111110 000000 00111111000 0123335677777777888999999999999999999999998 Q ss_pred CCCCCCcccccHHHH-------------HHHHHHHHHHHHHHHHHHHHhh-hccChhhhccceeeecchhhhccCHHHHH Q lcl|NC_019710. 304 DVEKSTSWGSGIEQQ-------------NLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRA 369 (424) Q Consensus 304 ~~~~~~~~~~n~e~~-------------~~~f~~~tl~P~~~~ie~~l~~-~L~~~~~~~~~~~~f~~~~~~~~d~~~~~ 369 (424) ...++.. ++.+. ....+..+|.-++..|.+..+. .++...-...+.+.++++.-+..|.++.+ T Consensus 381 ~~~~~~k---TAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~ 457 (522) T protein:vir:47 381 FDGQGMK---TATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAEL 457 (522) T ss_pred ccccccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHH Confidence 7655432 33333 3344555555555555433331 12222212335677888888889999999 Q ss_pred HHHHHHHhCCCcCHHHHHHH-hCCCCCC---C-----cCee-eec--ccccchhhccccCCCccCC Q lcl|NC_019710. 370 AFMKAMGESGLRTINEMRRT-DNLPPLP---G-----GDVA-MRQ--SQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 370 ~~~~~~~~~g~~t~NE~R~~-lg~~p~~---g-----gd~~-~~~--~n~~~~~~~~~~~~~~~~g 423 (424) +...+++.+|+|++-+++.+ .|+..-+ . .+.. ..| ....+.+. ..++.+.+.| T Consensus 458 ~~~~~~v~aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~-~~~~~~d~~~ 522 (522) T protein:vir:47 458 DYWAKMVAAGFSTKKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMHD-QNEEKADDKG 522 (522) T ss_pred HHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCC-cccccCCCCC Confidence 99999999999999998765 3664310 0 0000 000 01111111 0111122222 No 164 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=98.18 E-value=3.1e-06 Score=50.78 Aligned_cols=395 Identities=10% Similarity=0.053 Sum_probs=179.3 Q ss_pred CC---------CCCcccccCCC-----------ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhh Q lcl|NC_019710. 1 ME---------EPKYTIDLRTN-----------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQI 60 (424) Q Consensus 1 ~~---------~~~~~~~~~~~-----------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (424) -+ +.-.++..+.= +--+.++..+..+............ .... +..-+.+ T Consensus 24 ~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~-----~~~~------~~~ki~~ 92 (511) T protein:vir:96 24 EANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRK-----EEYM------ADNRVAH 92 (511) T ss_pred hhCCccccchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCc-----cccc------Ccceeec Confidence 11 11111111000 0112222222222111100000000 0000 0001123 Q ss_pred HHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019710. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) ......++..+.-+-+-|+.+--. +. + ....+..++.. | ........+..+++.+|.||.++-++.+| T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~~--~~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~i~G~a~~~vy~ded~ 160 (511) T protein:vir:96 93 DYASYISDFINGYFLGNPIQYQDD--DK---D--VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred chHHHHHHHHHhhhccCCceeecC--ch---H--HHHHHHHHHhh--c---CHHHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 455666777777676777765211 11 1 11233333332 2 34566677889999999999999898888 Q ss_pred ceeEEEEeccceEEEEEcCCc---eE---EEEEe----cC--c----eEEecHhHeeEecCcC----------------- Q lcl|NC_019710. 141 DVISLLPLQSANMDVKLVGKK---VV---YRYQR----DS--E----YADFSQKEIFHLKGFG----------------- 187 (424) Q Consensus 141 ~~~~l~~l~p~~v~~~~~~~~---~~---~~~~~----~~--~----~~~~~~~evih~r~~~----------------- 187 (424) .+ .+..++|..+.+..+... .. ++|.. +. . ...+.++.+.+++... T Consensus 161 ~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 239 (511) T protein:vir:96 161 ET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHS 239 (511) T ss_pred ce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccc Confidence 75 577889999987766432 11 11110 00 0 1124455555442110 Q ss_pred ---------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc-ccC Q lcl|NC_019710. 188 ---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP-VKK 257 (424) Q Consensus 188 ---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~-~ag 257 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.+++-.......+..+..+...-...... -.+ T Consensus 240 ~~~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:96 240 FERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred CCceeeEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceeccccccccc Confidence 0123578888888888887777666666667777777766654322222221111111100000000 011 Q ss_pred cceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHH Q lcl|NC_019710. 258 RLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTL 327 (424) Q Consensus 258 ~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl 327 (424) .....+++.++.-+........+....+...+.|...-++|..-.+... ++.|..... ......+...+ T Consensus 320 ~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l 398 (511) T protein:vir:96 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1122334555555554444555677788888999999999865443222 222211111 11112333344 Q ss_pred HHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC---------- Q lcl|NC_019710. 328 QPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG---------- 397 (424) Q Consensus 328 ~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g---------- 397 (424) .-.++.|...+..+--.........+++.+..-+..|..+.++.+.++ .|+++.-.+.+++++-..+. T Consensus 399 ~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~D~~~E~~ri~~E~ 476 (511) T protein:vir:96 399 RRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDE 476 (511) T ss_pred HHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHH Confidence 444444443333211111111112345555666778888899988887 58999988888887643211 Q ss_pred cCeeee---cccccchhhccccCCCccCCC Q lcl|NC_019710. 398 GDVAMR---QSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~~~~---~~n~~~~~~~~~~~~~~~~g~ 424 (424) .+..-. .....+ ...+.+++.++.. T Consensus 477 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~ 504 (511) T protein:vir:96 477 KESIKKAQKGIYKDP--RDINDDEQDDDTK 504 (511) T ss_pred HHHHHHHhhccccCC--CCCCCCCCCCccc Confidence 100000 000000 0000011111111 No 165 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=98.16 E-value=3.4e-06 Score=50.54 Aligned_cols=397 Identities=10% Similarity=0.042 Sum_probs=179.4 Q ss_pred CCCCCcccccCCC-----------ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTN-----------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~-----------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) ..+.-.++..+.= +--+.++..+..+......... ..... .-...-+.+......++. T Consensus 33 ~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~----------~~~~~-~~~~~ki~~n~~k~Iv~~ 101 (511) T protein:vir:10 33 GTESDLLQNVNEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELT----------RRKEE-YMADNRVAHDYASYISDF 101 (511) T ss_pred hhhhhcccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccC----------ccccc-ccCcceeecchHHHHHHH Confidence 1111111111100 0112222233322211100000 00000 000001123455566677 Q ss_pred HHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .+.-+-+-|+++--. +. + ....+..++. . | ........+..+++.+|.||.++.++.+|.+ .+-.++ T Consensus 102 ~~~yl~g~p~~~~~~--d~---~--~~~~l~~~~~-~-n---~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~-~i~~~~ 168 (511) T protein:vir:10 102 INGYFLGNPIQYQDD--DK---D--VLEAIEAFND-L-N---DVESHNRSLGLDLSIYGKAYEIMIRNQDDET-RLYKSD 168 (511) T ss_pred HhhhhcccCceeecC--ch---H--HHHHHHHHHh-h-c---CHHHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEc Confidence 776666777765211 11 1 1123343333 2 2 2445667788899999999999999888875 577788 Q ss_pred cceEEEEEcCCc---eEE---EEEe----cC--c----eEEecHhHeeEecCcC-------------------------- Q lcl|NC_019710. 150 SANMDVKLVGKK---VVY---RYQR----DS--E----YADFSQKEIFHLKGFG-------------------------- 187 (424) Q Consensus 150 p~~v~~~~~~~~---~~~---~~~~----~~--~----~~~~~~~evih~r~~~-------------------------- 187 (424) |..+.+..++.. ..+ +|.. +. . ...+.++.|.++.... T Consensus 169 p~~~~~vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f 248 (511) T protein:vir:10 169 AMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF 248 (511) T ss_pred cceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEe Confidence 988887776432 111 1110 00 0 1124455555442110 Q ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc-ccCcceecCCCc Q lcl|NC_019710. 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP-VKKRLWILEAGF 266 (424) Q Consensus 188 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~-~ag~~~~l~~g~ 266 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.+++-......++..+..+...-...... -.+.....+++. T Consensus 249 ~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (511) T protein:vir:10 249 SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSV 328 (511) T ss_pred cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCc Confidence 0123578888888888887776666666666777777666654322222221111111000000000 011122234455 Q ss_pred eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWEN 336 (424) Q Consensus 267 ~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~ 336 (424) +++-+.....+..+....+...+.|+..-++|..-.+... ++.|..... ......+...+.-.++.|.. T Consensus 329 d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:10 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6665655444555677788888999999999864332221 222211111 11223333333333433433 Q ss_pred HHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecc---cccc--hh Q lcl|NC_019710. 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQS---QYVP--IT 411 (424) Q Consensus 337 ~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~---n~~~--~~ 411 (424) .+....-.........+++.+..-+..|..+.++.+.++. |+++.--+.+++++-+.|.-+.-.+-. .... .. T Consensus 408 ~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 485 (511) T protein:vir:10 408 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 485 (511) T ss_pred HHHhhCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 3332211111111234566667778888999999999885 789987788887653221100000000 0000 00 Q ss_pred hccccCCCccCCC Q lcl|NC_019710. 412 DLGTNKEPRNNGA 424 (424) Q Consensus 412 ~~~~~~~~~~~g~ 424 (424) ......++.+++. T Consensus 486 ~~~~~~~~~~~~~ 498 (511) T protein:vir:10 486 GIYKDPRDINDDE 498 (511) T ss_pred hcccCCCCCCCCC Confidence 0001111111111 No 166 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=98.14 E-value=3.7e-06 Score=50.35 Aligned_cols=384 Identities=11% Similarity=0.037 Sum_probs=177.5 Q ss_pred ccHHHHHHhhccCcccc----cc-----cccccccc-----c-ccccccCCc-----------cccHHHHhhhHHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLV----TP-----NQGSQTGP-----V-SAHGYLGDS-----------SINDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~----~~-----~~~~~~~~-----~-~~~~~~~~~-----------~~~~~~~~~~~~v~~~i 67 (424) +|||+++++++++.... +. .......+ + .+..+..+. ....+...+...-..++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 89999999988652210 00 00000000 0 000000000 00011112223333444 Q ss_pred HHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019710. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=+..+-- + ++ ....-+.++|. -| .........+.+.+..|.+++.+..+. |.+ .+.. T Consensus 81 ~~~A~lv~~e~~~i~~-~--d~----~~~~~l~~il~--~n---~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~I~~ 146 (500) T protein:vir:30 81 KKIASLVFNEQAEIKV-D--DD----AANEFISETLK--ND---RFNKNFERYLESCLALGGLAMRPYVDG-DKV-RVAF 146 (500) T ss_pred HHHhhhhcCCcceEec-C--Ch----HHHHHHHHHHh--hc---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEE Confidence 4455444433322211 1 10 01112333332 11 234455666777788888888776664 333 4566 Q ss_pred eccceEEEE-EcCC------------------ceEEE----EE-ecCceE-----Eec--------------------Hh Q lcl|NC_019710. 148 LQSANMDVK-LVGK------------------KVVYR----YQ-RDSEYA-----DFS--------------------QK 178 (424) Q Consensus 148 l~p~~v~~~-~~~~------------------~~~~~----~~-~~~~~~-----~~~--------------------~~ 178 (424) +++.++.+. .+.+ ..+|. +. .++... -|. ++ T Consensus 147 v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 226 (500) T protein:vir:30 147 VQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKD 226 (500) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCc Confidence 677766542 2211 11110 00 001000 010 01 Q ss_pred H----------eeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE-----EcCCCCCCHH Q lcl|NC_019710. 179 E----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLTEQ 238 (424) Q Consensus 179 e----------vih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl-----~~~~~~~~~~ 238 (424) + ..||+.+.. +.+.|+|.+..+...++.....-.....-++.|.. ..++ .....-.+.+ T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~ 305 (500) T protein:vir:30 227 EAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVAVPESLTALTVRTTDGD 305 (500) T ss_pred ceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcccCCCCCcc Confidence 1 235553322 24579999999999998877777766677776544 3333 1111100000 Q ss_pred HHHHHHHHHHHHhCCcccCcce--ecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLW--ILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~--~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) .. ....+.. .++.-..+ -.+++-.++.++....+-++.+..+...++|+...|+++..++....+.. ++. T Consensus 306 ~~--~~~~~d~---~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~---TAt 377 (500) T protein:vir:30 306 VV--PRPRFES---DQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMK---TAT 377 (500) T ss_pred cc--CCcccCC---CcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccc---cHH Confidence 00 0000000 00000000 01223356777766777888999999999999999999999987655432 222 Q ss_pred H-------------HHHHHHHHHHHHHHHHHHHHHhh-hccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_019710. 317 Q-------------QNLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRT 382 (424) Q Consensus 317 ~-------------~~~~f~~~tl~P~~~~ie~~l~~-~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t 382 (424) + .....++.+|.-++..|.+.... .++...-...+.+.++++.-+..|.++.++...+++.+|+|+ T Consensus 378 ei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s 457 (500) T protein:vir:30 378 EIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGT 457 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCC Confidence 2 12233344444444444332221 122211112345667777778889999999999999999999 Q ss_pred HHHHHHH-hCCCCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 383 INEMRRT-DNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 383 ~NE~R~~-lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) .-+++.+ .|++.- ..++.+....- +.........+.++ T Consensus 458 ~~~~i~~~~g~~ee-ea~~~l~~i~~---E~~~~~~~~~~~~~ 496 (500) T protein:vir:30 458 REMAIQKVLNVTEE-KAQEIAAEINT---GIVDEINQQRTDTH 496 (500) T ss_pred HHHHHHhcCCCCHH-HHHHHHHHHHH---hccccCCCCCcccc Confidence 9998754 365421 11111100000 00000001111111 No 167 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=98.14 E-value=3.7e-06 Score=50.35 Aligned_cols=384 Identities=11% Similarity=0.037 Sum_probs=177.5 Q ss_pred ccHHHHHHhhccCcccc----cc-----cccccccc-----c-ccccccCCc-----------cccHHHHhhhHHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLV----TP-----NQGSQTGP-----V-SAHGYLGDS-----------SINDERILQISTVWRCV 67 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~----~~-----~~~~~~~~-----~-~~~~~~~~~-----------~~~~~~~~~~~~v~~~i 67 (424) +|||+++++++++.... +. .......+ + .+..+..+. ....+...+...-..++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 89999999988652210 00 00000000 0 000000000 00011112223333444 Q ss_pred HHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019710. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..|+-+..=+..+-- + ++ ....-+.++|. -| .........+.+.+..|.+++.+..+. |.+ .+.. T Consensus 81 ~~~A~lv~~e~~~i~~-~--d~----~~~~~l~~il~--~n---~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~-~I~~ 146 (500) T protein:vir:98 81 KKIASLVFNEQAEIKV-D--DD----AANEFISETLK--ND---RFNKNFERYLESCLALGGLAMRPYVDG-DKV-RVAF 146 (500) T ss_pred HHHhhhhcCCcceEec-C--Ch----HHHHHHHHHHh--hc---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEE Confidence 4455444433322211 1 10 01112333332 11 234455666777788888888776664 333 4566 Q ss_pred eccceEEEE-EcCC------------------ceEEE----EE-ecCceE-----Eec--------------------Hh Q lcl|NC_019710. 148 LQSANMDVK-LVGK------------------KVVYR----YQ-RDSEYA-----DFS--------------------QK 178 (424) Q Consensus 148 l~p~~v~~~-~~~~------------------~~~~~----~~-~~~~~~-----~~~--------------------~~ 178 (424) +++.++.+. .+.+ ..+|. +. .++... -|. ++ T Consensus 147 v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~ 226 (500) T protein:vir:98 147 VQAPVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKD 226 (500) T ss_pred EcCCeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCc Confidence 677766542 2211 11110 00 001000 010 01 Q ss_pred H----------eeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE-----EcCCCCCCHH Q lcl|NC_019710. 179 E----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLTEQ 238 (424) Q Consensus 179 e----------vih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl-----~~~~~~~~~~ 238 (424) + ..||+.+.. +.+.|+|.+..+...++.....-.....-++.|.. ..++ .....-.+.+ T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~-~i~v~~~~l~~~~~~~~g~ 305 (500) T protein:vir:98 227 EAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQR-RVAVPESLTALTVRTTDGD 305 (500) T ss_pred ceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcc-eeeechHHhcccCCCCCcc Confidence 1 235553322 24579999999999998877777766677776544 3333 1111100000 Q ss_pred HHHHHHHHHHHHhCCcccCcce--ecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLW--ILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~--~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) .. ....+.. .++.-..+ -.+++-.++.++....+-++.+..+...++|+...|+++..++....+.. ++. T Consensus 306 ~~--~~~~~d~---~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~---TAt 377 (500) T protein:vir:98 306 VV--PRPRFES---DQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMK---TAT 377 (500) T ss_pred cc--CCcccCC---CcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccc---cHH Confidence 00 0000000 00000000 01223356777766777888999999999999999999999987655432 222 Q ss_pred H-------------HHHHHHHHHHHHHHHHHHHHHhh-hccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcC Q lcl|NC_019710. 317 Q-------------QNLGFLQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRT 382 (424) Q Consensus 317 ~-------------~~~~f~~~tl~P~~~~ie~~l~~-~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t 382 (424) + .....++.+|.-++..|.+.... .++...-...+.+.++++.-+..|.++.++...+++.+|+|+ T Consensus 378 ei~s~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s 457 (500) T protein:vir:98 378 EIVSENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGT 457 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCC Confidence 2 12233344444444444332221 122211112345667777778889999999999999999999 Q ss_pred HHHHHHH-hCCCCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 383 INEMRRT-DNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 383 ~NE~R~~-lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) .-+++.+ .|++.- ..++.+....- +.........+.++ T Consensus 458 ~~~~i~~~~g~~ee-ea~~~l~~i~~---E~~~~~~~~~~~~~ 496 (500) T protein:vir:98 458 REMAIQKVLNVTEE-KAQEIAAEINT---GIVDEINQQRTDTH 496 (500) T ss_pred HHHHHHhcCCCCHH-HHHHHHHHHHH---hccccCCCCCcccc Confidence 9998754 365421 11111100000 00000001111111 No 168 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.12 E-value=4.1e-06 Score=50.11 Aligned_cols=360 Identities=9% Similarity=0.068 Sum_probs=156.5 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccC--CccccHHH-Hh-h--hHHHHHHHHHHHHhhhhCceeE Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLG--DSSINDER-IL-Q--ISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-~~-~--~~~v~~~i~~ia~~ia~~~~~~ 81 (424) |+.+.-..+++.+...... ... ....+.+..... +..+.++. .+ + ..+..-+|+.++..+ -|.= T Consensus 1 m~~~~i~~L~~~~~~~~~r--~~~-----~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl---~~~G 70 (422) T protein:vir:97 1 MNYMGMGYLRRKLALFKTG--VDK-----RYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRI---IFRE 70 (422) T ss_pred CChHHHHHHHHHHHHHHHH--HHH-----HHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhcc---ccce Confidence 4444333333333322110 000 000010000000 01111111 01 1 123334444444422 2222 Q ss_pred eeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeEEEEeccceEEEEEcCC Q lcl|NC_019710. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQSANMDVKLVGK 160 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-~G~~~~l~~l~p~~v~~~~~~~ 160 (424) ++. .+ ..+.+.+. + |. .......+..+.+.+|.||+.+..+. +|.| .+.+++|..+....|.. T Consensus 71 f~~-~d---------~~l~~~w~-~-N~---ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p-~i~~~sp~~~~~i~D~~ 134 (422) T protein:vir:97 71 FTN-DD---------FNAWEIFK-A-NN---PDIFFDTAIQSALIASCCFVYIMPGAEDGLP-KMQVIEASKATGILDPT 134 (422) T ss_pred eeC-Cc---------hhHHHHHH-h-cC---hHHHHHHHHHHHHHhcceeEEEeeCCCCCee-EEEEechhhEEEEEeCC Confidence 221 11 12444443 2 32 23455577889999999999998875 5665 57888999998777653 Q ss_pred ceE----E-EEEe--cCce--EE-ecHh---------------------HeeEecCc-CCCCccccchH----HHHHHHH Q lcl|NC_019710. 161 KVV----Y-RYQR--DSEY--AD-FSQK---------------------EIFHLKGF-GFTGLVGLSPI----AFACKSA 204 (424) Q Consensus 161 ~~~----~-~~~~--~~~~--~~-~~~~---------------------evih~r~~-~~~~~~G~s~~----~~~~~~i 204 (424) ... + .+.. ++.. .. ++.. =|++|.+. ....++|.|.+ ..+.+.+ T Consensus 135 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~ 214 (422) T protein:vir:97 135 TFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAA 214 (422) T ss_pred CCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHH Confidence 210 0 1110 1110 00 1111 13455433 23456787754 3334444 Q ss_pred HHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeeccCChhHHH Q lcl|NC_019710. 205 GVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTPQDAE 279 (424) Q Consensus 205 ~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-----g~~~~~l~~s~~d~~ 279 (424) .-...-......++.. |.-++.-- + .+....+. |+.. .++++.++. +.++.++..+.-+ . T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G~-d-~d~~~~~~----~~~~-----~~~i~~~~~de~~~~~~v~q~~~~~l~-~ 279 (422) T protein:vir:97 215 KRTLERAEVTAEFYSF---PQKYVLGM-D-PDAKPMEK----WRAT-----VSTLLEISKDEDGDKPTVGQFTTASMA-P 279 (422) T ss_pred HHHHHHHHHHHHHhcc---hhhhhccc-C-cccccCch----hhhh-----hhhhhccCCCCCCCcceeeecCCCChh-H Confidence 4333333334444333 44344211 1 01111111 2211 234555532 2456555543222 4 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhh------hccChhh--h-cc Q lcl|NC_019710. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKD--V-GR 350 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~------~L~~~~~--~-~~ 350 (424) |++..+.....++..=++|+..+|....+.+|...+..+...+... +.-..+.+.+.+.+ .+..... . .. T Consensus 280 ~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~k-a~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~ 358 (422) T protein:vir:97 280 FMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAA-GRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQF 358 (422) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCcccchhh Confidence 8999999999999999999999998654322222222222121111 11111122222211 0111111 0 11 Q ss_pred ceeeecchhhhccC---HHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccCCCccCC Q lcl|NC_019710. 351 IHAEHNLDGLLRGD---SASRAAFMKAMGES--GLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 351 ~~~~f~~~~~~~~d---~~~~~~~~~~~~~~--g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) ..+.+.+......+ ....++.+.+++++ |++...-+++++|+...+ .-... + ++.+-+| T Consensus 359 ~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~---~~~~~--------~---~~~~~d~ 422 (422) T protein:vir:97 359 MDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGAD---KPIPA--------I---TEVTTDG 422 (422) T ss_pred ccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchh---HHHHH--------H---HhhhccC Confidence 22344444444455 34556677788888 788889999999996421 10000 0 0001111 No 169 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.11 E-value=4.3e-06 Score=49.99 Aligned_cols=397 Identities=10% Similarity=0.043 Sum_probs=178.2 Q ss_pred CC---------CCCcccccCCC-----------ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhh Q lcl|NC_019710. 1 ME---------EPKYTIDLRTN-----------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQI 60 (424) Q Consensus 1 ~~---------~~~~~~~~~~~-----------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (424) -+ +.-.++..+.= +--+.++..+..+............ ..... . .-+.+ T Consensus 24 ~~n~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~-----~~~~~----~--~ki~~ 92 (511) T protein:vir:93 24 EANVVYTYDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRK-----EEYMA----D--NRVAH 92 (511) T ss_pred hhCCcccccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCc-----ccccC----c--ceeec Confidence 11 11111111000 0112222223222211100000000 00000 0 00123 Q ss_pred HHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019710. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +.....++..+.-+-+-|+++-- .+. +. ...+...+. . -........+..+++.+|.||.++.++.+| T Consensus 93 n~~k~Iv~~~~~yl~g~p~~~~~--~d~---~~--~~~l~~~~~-~----n~~~~~~~~~~~~~~~~G~ay~~vy~de~~ 160 (511) T protein:vir:93 93 DYASYISDFINGYFLGNPIQYQD--DDK---DV--LEVIEAFND-L----NDVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (511) T ss_pred chHHHHHHHHhhhhcccCeeecc--CCh---HH--HHHHHHHHh-h----cCHhHHHHHHHHHHHhcCeeEEEEEeCCCC Confidence 45566677777766677776521 111 11 122333332 1 234566778889999999999999998888 Q ss_pred ceeEEEEeccceEEEEEcCCc---eEE---EEEe----cC--c----eEEecHhHeeEecCcC----------------- Q lcl|NC_019710. 141 DVISLLPLQSANMDVKLVGKK---VVY---RYQR----DS--E----YADFSQKEIFHLKGFG----------------- 187 (424) Q Consensus 141 ~~~~l~~l~p~~v~~~~~~~~---~~~---~~~~----~~--~----~~~~~~~evih~r~~~----------------- 187 (424) .+ .+..++|..+.+..+... ..+ +|.. +. . ...+.++.|.+++... T Consensus 161 ~~-~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 239 (511) T protein:vir:93 161 ET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHS 239 (511) T ss_pred ce-EEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccC Confidence 76 477889999987766431 111 1110 00 0 1134555555542111 Q ss_pred ---------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc-ccC Q lcl|NC_019710. 188 ---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP-VKK 257 (424) Q Consensus 188 ---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~-~ag 257 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.+++-......++..+..+...-...... -.+ T Consensus 240 ~g~vPvv~~~nn~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (511) T protein:vir:93 240 FERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADS 319 (511) T ss_pred CCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceeccccccccc Confidence 0123578888888888887776666666667767777666653222222221111110000000000 001 Q ss_pred cceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHH Q lcl|NC_019710. 258 RLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTL 327 (424) Q Consensus 258 ~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl 327 (424) ...-.+++.++..+.....+..+....+...+.|+..-++|..-.+... ++.|..... ......+...+ T Consensus 320 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l 398 (511) T protein:vir:93 320 EGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGL 398 (511) T ss_pred ccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1122344555555554444455667778888999999999865443222 222211111 11122333344 Q ss_pred HHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeee---- Q lcl|NC_019710. 328 QPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMR---- 403 (424) Q Consensus 328 ~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~---- 403 (424) .-.++.|...+...--.........+++.+..-+..|..+.++.+.++ .|+++.--+.+++++-+.|.-+.-.+ T Consensus 399 ~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~~ri~~E~ 476 (511) T protein:vir:93 399 RRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQDPELEVKKIEEDE 476 (511) T ss_pred HHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHH Confidence 444444443333211111111112345555666778888899988888 48899888888876643211000000 Q ss_pred ----cccccch---hhccccC---CCccCCC Q lcl|NC_019710. 404 ----QSQYVPI---TDLGTNK---EPRNNGA 424 (424) Q Consensus 404 ----~~n~~~~---~~~~~~~---~~~~~g~ 424 (424) ....... ....+.+ +..++.+ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:93 477 KESIKKAQKGIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred HHHHHHHhhhcccCCCCCCCCCCCCcccccc Confidence 0000000 0000011 1111111 No 170 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.10 E-value=4.6e-06 Score=49.84 Aligned_cols=389 Identities=8% Similarity=-0.006 Sum_probs=174.5 Q ss_pred cccccCCC-ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeec Q lcl|NC_019710. 6 YTIDLRTN-NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 6 ~~~~~~~~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 84 (424) .-..+.+. +=.++++..+..+............ ..... ..-+.++.....|+..+.-+-+-|+++-- T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~--------~~~~~---~~ki~~n~~~~ivd~~~~~l~g~~~~~~~- 68 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRL--------DDEKA---DYRVRHKWGGYISSFATGYVIGNPVSIGV- 68 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccc--------cccCC---cceeecchHHHHHHhhhhheeccCceEee- Confidence 00000000 1123334334333211100000000 00000 00122345566667766666565665421 Q ss_pred cccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc--- Q lcl|NC_019710. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK--- 161 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~--- 161 (424) .+++. .+ ....+...+.. . .-......+..+.+.+|.||+++..+.+|.+ .+..++|..+.+..+... T Consensus 69 ~~~~~-~~--~~~~l~~~~~~-n----~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~ 139 (440) T protein:vir:95 69 MEGGS-AD--QLSTIKDIEWQ-N----DINALNSDLAFDASVYGRAYEYHFRDKDKVD-RVVLISPLEMFVIRDLTVEQN 139 (440) T ss_pred CCCcc-HH--HHHHHHHHHHh-c----CHhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcccceEEEEcCCCCCc Confidence 11111 11 11223333321 1 3345566788899999999999988888876 477788999988776532 Q ss_pred eE---EEEEecCc--eEEecHhHeeEecCc--------------C----------CCCccccchHHHHHHHHHHHHHHHH Q lcl|NC_019710. 162 VV---YRYQRDSE--YADFSQKEIFHLKGF--------------G----------FTGLVGLSPIAFACKSAGVAVAMED 212 (424) Q Consensus 162 ~~---~~~~~~~~--~~~~~~~evih~r~~--------------~----------~~~~~G~s~~~~~~~~i~~~~~~~~ 212 (424) .. +++..... ...+.++.+++++.. + .+...|.|.+..+...++....+.. T Consensus 140 ~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s 219 (440) T protein:vir:95 140 IIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQS 219 (440) T ss_pred eEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHH Confidence 11 11111111 112344444333100 0 0123477878877777777776666 Q ss_pred HHHHHHhccCCCceeEEcCCC--CCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHH Q lcl|NC_019710. 213 QQRDFFANGAKSPQILSTGEK--VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSE 290 (424) Q Consensus 213 ~~~~~~~ng~~p~~vl~~~~~--~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~ 290 (424) ...+..+..+.|-.+++-... ...++....+++....+. .........+.+.+++.+........+....+...+. T Consensus 220 ~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~ 297 (440) T protein:vir:95 220 DTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFL--KTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLAND 297 (440) T ss_pred HHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceec--ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHH Confidence 666666777777777664211 112332222222111111 1111122223333444443333344466778888899 Q ss_pred HHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhh Q lcl|NC_019710. 291 LARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGL 360 (424) Q Consensus 291 Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~ 360 (424) |+..-++|..-.+... ++.+....+ ......+...+.-+++.|...+...-- .......+++.+..- T Consensus 298 i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~--~~~~~~~v~i~f~~~ 374 (440) T protein:vir:95 298 IHRFSRIPNLDDDRFN-STSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAING--PVIEANKLTFTFHPN 374 (440) T ss_pred HHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCC--cccccccceEEeCCC Confidence 9999999964443322 222211111 111223333344444433333322111 111123345555666 Q ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee-eecccccchhhccccC-CCccCCC Q lcl|NC_019710. 361 LRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVA-MRQSQYVPITDLGTNK-EPRNNGA 424 (424) Q Consensus 361 ~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~-~~~~n~~~~~~~~~~~-~~~~~g~ 424 (424) ...|..+.++.+.++ .|+++.--+.++++.-..+ ++. .+-..-.....-..+. +..+++. T Consensus 375 ~p~~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~~--~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 436 (440) T protein:vir:95 375 IPQDVWTEIKAYIEA--GGEISQETLMENASFTDYK--TEHSRILKQGGSSDLEIGQIVGDADVGQ 436 (440) T ss_pred CCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCcH--HHHHHHHHHHHHhhhhHHhhccCCCCCC Confidence 778888999998888 5789887777777653211 010 0000000000000011 1111111 No 171 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.09 E-value=4.8e-06 Score=49.75 Aligned_cols=346 Identities=9% Similarity=0.043 Sum_probs=159.9 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccC--CccccHHH---H-hhhHHHHHHHHHHHHhhhhCceeE Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLG--DSSINDER---I-LQISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~---~-~~~~~v~~~i~~ia~~ia~~~~~~ 81 (424) |+.+ .++++...+....... ......+.+..... +..+.++. + +-..+..-+|+.++..+. +.- T Consensus 1 ~~~~----~i~~L~~~~~~~~~r~---~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~---~~G 70 (409) T protein:vir:94 1 MTEK----GIGYLRFKLSVHKRRA---EMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLV---FRE 70 (409) T ss_pred CCHH----HHHHHHHHHHHHhHHH---HHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcc---cCc Confidence 3322 2333333332211110 00000110000000 00111110 0 111344445555554322 221 Q ss_pred eeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc Q lcl|NC_019710. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK 161 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~ 161 (424) ++.. +..+..++. + |. .......+..+.+.+|.||+.+..+.+|.| .+.+++|..+....|... T Consensus 71 f~~~----------d~~l~~i~~-~-N~---ld~~~~~~~~~aliyG~sf~~v~~~~dg~~-~i~~~sp~~~~~i~D~~~ 134 (409) T protein:vir:94 71 FEND----------DFTVNEIFE-E-NN---PDIFFDSAVLSSLIASCSFTYISKGENDAV-RLQVIEAVNATGIIDPIT 134 (409) T ss_pred ccCC----------chHHHHHHH-h-cC---hhHHHHHHHHHHHHhcceeEEEecCCCCce-EEEEeccceEEEEEecCC Confidence 2111 112444443 2 22 234556788899999999999999999976 677888988887766532 Q ss_pred e----EEEEEe-c--Cc---eEEecHhH----------------------eeEecCcC-CCCccccchH----HHHHHHH Q lcl|NC_019710. 162 V----VYRYQR-D--SE---YADFSQKE----------------------IFHLKGFG-FTGLVGLSPI----AFACKSA 204 (424) Q Consensus 162 ~----~~~~~~-~--~~---~~~~~~~e----------------------vih~r~~~-~~~~~G~s~~----~~~~~~i 204 (424) . .|.+.. + +. ...+.+++ |++|.+.. ....+|.|-+ ..+.+.+ T Consensus 135 ~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~ 214 (409) T protein:vir:94 135 GLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNA 214 (409) T ss_pred CceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHH Confidence 1 111111 0 00 01122222 34444322 3456777744 4444444 Q ss_pred HHHHHHHHHHHHHHhccCCCceeEE-cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCceeeeccCChhHH Q lcl|NC_019710. 205 GVAVAMEDQQRDFFANGAKSPQILS-TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFSTSAIGVTPQDA 278 (424) Q Consensus 205 ~~~~~~~~~~~~~~~ng~~p~~vl~-~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-----~g~~~~~l~~s~~d~ 278 (424) .....-......++.+ |.-++. ...+ .+..+.++.... +++.++ .+.++.++....- - T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G~d~d---~~~~~~~~~~~~---------~i~~~~~d~dg~~~~v~q~~~~~l-~ 278 (409) T protein:vir:94 215 KRTLERADVTAEFYSF---PQKYVTGLSDD---AEPMETWKATVS---------SMLQFTKDEDGDKPTLGQFTQPSM-S 278 (409) T ss_pred HHHHHHHHHHHHHhcC---hhheeEecCCC---CcccchhhhhHH---------HhhcCCCCCCCCCceEEecCCCCh-h Confidence 4444444444555444 544443 2211 111222222222 344443 2345555543221 2 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhh------hccChhh---hc Q lcl|NC_019710. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKD---VG 349 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~------~L~~~~~---~~ 349 (424) .|++..+....++|..-++|+..+|....+.+|....+.+...+...+ .-..+.+.+.+.+ .+..... .. T Consensus 279 ~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a-~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~ 357 (409) T protein:vir:94 279 PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAG-RKAQRSLGAGLLNVAYLAACLRDDAPYLREQ 357 (409) T ss_pred HHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCccccc Confidence 488999999999999999999999986543222222222221111111 1111111111111 1111111 01 Q ss_pred cceeeecchhhhccCH---HHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC Q lcl|NC_019710. 350 RIHAEHNLDGLLRGDS---ASRAAFMKAMGESG--LRTINEMRRTDNLPPLP 396 (424) Q Consensus 350 ~~~~~f~~~~~~~~d~---~~~~~~~~~~~~~g--~~t~NE~R~~lg~~p~~ 396 (424) ...+++.+.+....+. ...++.+.+++++| ++..+-+++++|+..-+ T Consensus 358 ~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 358 FRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred cccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 1234444444444443 45677889999998 66678999999998765 No 172 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=98.09 E-value=4.9e-06 Score=49.68 Aligned_cols=397 Identities=11% Similarity=0.068 Sum_probs=162.1 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccc-ccccccccccccccCCccccH---HHHhhhHHHHHHHHHHHHhhhh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPN-QGSQTGPVSAHGYLGDSSIND---ERILQISTVWRCVSLISTLTAC 76 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~~i~~ia~~ia~ 76 (424) |.+-. .++ ..-|++++...+......--. .....+--... .. +..+.. ..-..+.+...+|+.++..+-- T Consensus 1 ~~~~~-~~d---~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~-~~-~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~ 74 (488) T protein:vir:23 1 MAETE-SID---PEKLRDQLLDAFENKQNELKSSKAYYDAERRPD-AI-GLAVPLDMRKYLAHVGYPRTYVDAIAERQEL 74 (488) T ss_pred CCccc-CCC---HHHHHHHHHHHHHHHHHHHHHHHHHHhcccchh-hc-CcccchhhhhhhhhcchHHHHHHHHHHhhhc Confidence 32211 111 123555555444322110000 00000000000 00 001111 1112234445566666654433 Q ss_pred CceeEeeccc----cCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC--------CCceeE Q lcl|NC_019710. 77 LPLDVFETDQ----NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS--------AGDVIS 144 (424) Q Consensus 77 ~~~~~~~~~~----~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~--------~G~~~~ 144 (424) -.|.+-.... .++... ....+.+++. + | ........+..+++.+|.||+++..+. .|.+ . T Consensus 75 ~Gf~~~~~~~~~~~~~~d~~--~~~~l~~i~~-~-N---~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~-~ 146 (488) T protein:vir:23 75 EGFRIPSANGEEPESGGEND--PASELWDWWQ-A-N---NLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVP-L 146 (488) T ss_pred cceeccCCcccccccccchh--HHHHHHHHHH-h-c---ChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcc-e Confidence 3343321110 111111 1123444443 1 1 245667778899999999999876543 2322 3 Q ss_pred EEEeccceEEEEEcCCc--e----EEEEEecCc----eEEecHhH-------------------------eeEecCcC-C Q lcl|NC_019710. 145 LLPLQSANMDVKLVGKK--V----VYRYQRDSE----YADFSQKE-------------------------IFHLKGFG-F 188 (424) Q Consensus 145 l~~l~p~~v~~~~~~~~--~----~~~~~~~~~----~~~~~~~e-------------------------vih~r~~~-~ 188 (424) +..++|..+.+..|... . .+.+...+. ...|.++. |++|++.. . T Consensus 147 i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~ 226 (488) T protein:vir:23 147 IRVEPPTALYAEVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRL 226 (488) T ss_pred EEEeccceeEEEEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEecccccc Confidence 56777887776655321 0 111111110 01122222 35554332 3 Q ss_pred CCccccchHH-HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH--HHHHHHHHHHHhCCcccCcceecCCC Q lcl|NC_019710. 189 TGLVGLSPIA-FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ--RSQVEENFKEIAGGPVKKRLWILEAG 265 (424) Q Consensus 189 ~~~~G~s~~~-~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~--~~~~~~~~~~~~~~~~ag~~~~l~~g 265 (424) .+.+|.|-+. .+...++....+.........-.+.|..+++- .. .++.. ...-...++.. .++++.+++| T Consensus 227 ~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G-~~-~~~~~~~~~~~~~~~~~~-----~~~v~~~~~g 299 (488) T protein:vir:23 227 SDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFG-AK-PEELGINAETGQRMFDAY-----MARILAFEGG 299 (488) T ss_pred CCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhC-CC-cccccccccccchhhhhh-----hhhhccCCCC Confidence 3457777554 22222332222222222222222334333321 01 00000 00011122222 2356667665 Q ss_pred --ceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHH----------HHHHHHHHHHHHHH Q lcl|NC_019710. 266 --FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISR 333 (424) Q Consensus 266 --~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~f~~~tl~P~~~~ 333 (424) .++.++.....+ .+++..+....+|+..=++|+..+|....+..|........ ...+...+.-++.. T Consensus 300 ~~~~~~q~~~~~~~-~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l 378 (488) T protein:vir:23 300 EGAHAEQFSAAELR-NFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRL 378 (488) T ss_pred CCceeEecCCCChH-HHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 456665543322 37788888889999999999999976543322211111111 11122222222222 Q ss_pred HHHHHhhhccChhh--hccceeeecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CcCee----ee Q lcl|NC_019710. 334 WENSIQRWLIPAKD--VGRIHAEHNLDGLLRGDSASRAAFMKAMGESG--LRTINEMRRTDNLPPLP--GGDVA----MR 403 (424) Q Consensus 334 ie~~l~~~L~~~~~--~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~lg~~p~~--ggd~~----~~ 403 (424) +...+ ...+ .....+++.+......+....++.+.+++++| +++..-+++++|+-+.+ ..+.. .. T Consensus 379 ~~~~~-----~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~ 453 (488) T protein:vir:23 379 AYKMV-----KGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQK 453 (488) T ss_pred HHHHh-----cCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHH Confidence 21111 1111 01123444445556677888899999998876 78898888999885432 11110 00 Q ss_pred c--ccccchhh------------ccccCCCccCCC Q lcl|NC_019710. 404 Q--SQYVPITD------------LGTNKEPRNNGA 424 (424) Q Consensus 404 ~--~n~~~~~~------------~~~~~~~~~~g~ 424 (424) . ..+-.+.. .++...++.+.| T Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a 488 (488) T protein:vir:23 454 QGLGLIGSLYGASTPEGKPGEAPVGEPPAPEPDAA 488 (488) T ss_pred HHHHHHHHHhccCCCcccCCCCCCCCCCCCCCCCC Confidence 0 00000000 011112222222 No 173 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=98.05 E-value=6e-06 Score=49.19 Aligned_cols=397 Identities=10% Similarity=0.050 Sum_probs=178.3 Q ss_pred CCCCCccccc---------CCCccH-----------HHHHHhhccCcccccccccccccccccccccCCccccHHHHhhh Q lcl|NC_019710. 1 MEEPKYTIDL---------RTNNGW-----------WARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~~~---------~~~~G~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (424) .+...|.|.= +.-.-+ +.++..+..+........ ...... .-...-+.+ T Consensus 24 ~~~~~~~~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~----------~~~~~~-~~~~~ki~~ 92 (512) T protein:vir:97 24 EANVVYTYDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVEL----------TRRKEE-YMADNRVAH 92 (512) T ss_pred ccccccccCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCcccccc----------Cccccc-ccCcceeec Confidence 3333333310 000011 122222222211100000 000000 000000123 Q ss_pred HHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019710. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) ......|+..+.-+-+-|+.+--. +. + ....+..++.. | ........+..+++.+|.||.++.++.+| T Consensus 93 n~~k~Ivd~~~~yl~g~p~~~~~~--d~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~i~G~ay~~vy~ded~ 160 (512) T protein:vir:97 93 DYASYISDFINGYFLGNPIQCQDD--DK---D--VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDD 160 (512) T ss_pred chHHHHHHHHhhhhcccCceeccC--Ch---H--HHHHHHHHHhh--c---CHHHHHHHHHHHHHhcCeEEEEEEeCCCC Confidence 455667777777777777765211 11 1 11233443322 2 34466677889999999999999999888 Q ss_pred ceeEEEEeccceEEEEEcCCc---e-----EEEEE--ecC--c----eEEecHhHeeEecCcC----------------- Q lcl|NC_019710. 141 DVISLLPLQSANMDVKLVGKK---V-----VYRYQ--RDS--E----YADFSQKEIFHLKGFG----------------- 187 (424) Q Consensus 141 ~~~~l~~l~p~~v~~~~~~~~---~-----~~~~~--~~~--~----~~~~~~~evih~r~~~----------------- 187 (424) .+ .+..++|..+.+..++.. . +|... .+. . ...+.++.|.+++... T Consensus 161 ~~-~i~~~~p~~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 239 (512) T protein:vir:97 161 ET-RLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHS 239 (512) T ss_pred ce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccc Confidence 76 578889999988776532 1 11110 000 0 1234555555543110 Q ss_pred ---------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCC--ccc Q lcl|NC_019710. 188 ---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGG--PVK 256 (424) Q Consensus 188 ---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~--~~a 256 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.+++-......++.........-..... .+. T Consensus 240 ~g~vPvv~~~nn~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (512) T protein:vir:97 240 FERMPITEFSNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENR 319 (512) T ss_pred CcccceEeecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhc Confidence 012357888888888888777776666666677777776665432222222111111111000001 111 Q ss_pred CcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHH Q lcl|NC_019710. 257 KRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYT 326 (424) Q Consensus 257 g~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~t 326 (424) ....-.++|.+++-+........+....+...+.|+..-++|..-.+... ++.|..... ......+... T Consensus 320 ~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 398 (512) T protein:vir:97 320 DTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKG 398 (512) T ss_pred ccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122345556655554444445566778888899999999865443222 222211111 1111222222 Q ss_pred HHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--------- Q lcl|NC_019710. 327 LQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--------- 397 (424) Q Consensus 327 l~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--------- 397 (424) +.-.++.|...+...--.........+++.+..-+..+..+.++.+.++. |+++.--+.+++++-+.+. T Consensus 399 l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~eri~~E 476 (512) T protein:vir:97 399 LRRRAKLLETILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEED 476 (512) T ss_pred HHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHH Confidence 33333333222221110001111123444455667778888888888884 8899988888886633211 Q ss_pred -cCee---eecccccchhhc-cccCCCccCCC Q lcl|NC_019710. 398 -GDVA---MRQSQYVPITDL-GTNKEPRNNGA 424 (424) Q Consensus 398 -gd~~---~~~~n~~~~~~~-~~~~~~~~~g~ 424 (424) .+.. ..+....+-..- .+.++..++.+ T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 508 (512) T protein:vir:97 477 EKESIKKAQKGIYKDPRDINDDEQDDDTKDTV 508 (512) T ss_pred HHHHHHHHhhcccCCCCCCCCCCCCCCccccc Confidence 0000 000000000000 00111111111 No 174 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=98.04 E-value=6.3e-06 Score=49.10 Aligned_cols=390 Identities=7% Similarity=-0.029 Sum_probs=172.9 Q ss_pred CCCCCcccccCC-----------CccHHHHHHhhccCcccc--cccccccc--cccccc--cccCCcc--ccHHHHhhhH Q lcl|NC_019710. 1 MEEPKYTIDLRT-----------NNGWWARLKSWFVGGRLV--TPNQGSQT--GPVSAH--GYLGDSS--INDERILQIS 61 (424) Q Consensus 1 ~~~~~~~~~~~~-----------~~G~~~~~~~~~~~~~~~--~~~~~~~~--~~~~~~--~~~~~~~--~~~~~~~~~~ 61 (424) --|-.++-++.+ ..-++.++...+...... ....-... ..+... .+..+.. .-...-+.++ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n 81 (472) T protein:vir:93 2 YPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITN 81 (472) T ss_pred CCCCCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccc Confidence 000011111111 112333322222111000 00000000 000000 0000000 0000011245 Q ss_pred HHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc Q lcl|NC_019710. 62 TVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD 141 (424) Q Consensus 62 ~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~ 141 (424) ....+|+..+.-+-+-|+++-- ++. +. ...+...+. | .-......+..+.+.+|.||+++..+.+|. T Consensus 82 ~~~~ivd~~~~~l~g~~~~~~~--~d~---~~--~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~ 148 (472) T protein:vir:93 82 FHANLVDQKVSYIVGKPIAFKH--TDD---EV--VKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEGE 148 (472) T ss_pred hHHHHHHHHhhhhcccCeeecc--CCh---HH--HHHHHHHHh---c---cHHHHHHHHHHHHhhcCeEEEEEEECCCCc Confidence 6667788877777666766521 111 11 122333332 2 133555667889999999999999988887 Q ss_pred eeEEEEeccceEEEEEcCCc---eE---EEEE--ecCceEEecHhHeeEecC----------------------c----- Q lcl|NC_019710. 142 VISLLPLQSANMDVKLVGKK---VV---YRYQ--RDSEYADFSQKEIFHLKG----------------------F----- 186 (424) Q Consensus 142 ~~~l~~l~p~~v~~~~~~~~---~~---~~~~--~~~~~~~~~~~evih~r~----------------------~----- 186 (424) + .+-.++|..+.+..++.. .. +.|. .......+.+..+.+++. . T Consensus 149 ~-~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 227 (472) T protein:vir:93 149 F-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKI 227 (472) T ss_pred e-EEEEEcccceEEEEcCCCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCc Confidence 6 477789998888765321 10 0111 111111122222222210 0 Q ss_pred C----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec Q lcl|NC_019710. 187 G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL 262 (424) Q Consensus 187 ~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l 262 (424) + .+...|.|-+..+...++....+.....+.+...+.|..+++-.......+... ... ..+++.+ T Consensus 228 Pvv~~~nn~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~----~~~-------~~~~~~~ 296 (472) T protein:vir:93 228 PFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR----LLR-------YYGAIKV 296 (472) T ss_pred ceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHH----HHh-------hcccccc Confidence 0 023458888888888887777666666666777777877765332211112111 111 2235555 Q ss_pred CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH----------HHHHHHHHHHHHHHH Q lcl|NC_019710. 263 EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYIS 332 (424) Q Consensus 263 ~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~----------~~~~f~~~tl~P~~~ 332 (424) +++.+...+.....+..+....+...+.|+..-++|..-.+... ++.+....+- .....+...+.-+++ T Consensus 297 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~ 375 (472) T protein:vir:93 297 SDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLW 375 (472) T ss_pred CCCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55555555554445566777888889999999999864443222 2222111110 111222222223333 Q ss_pred HHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCee-----eecc Q lcl|NC_019710. 333 RWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP--GGDVA-----MRQS 405 (424) Q Consensus 333 ~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~--ggd~~-----~~~~ 405 (424) .|...+. ... ....+.+.+...+..|..+.++.+.++ .|+++.--+.+++++-..+ ..+.. -... T Consensus 376 li~~~~~-----~~~-~~~~i~v~f~~~~p~~~~~~~~~~~k~--~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~ 447 (472) T protein:vir:93 376 FVFEHFD-----IKG-EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQMEYNK 447 (472) T ss_pred HHHHHhC-----CCc-ccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Confidence 2322221 111 112344445666777888888888887 4788887777777653211 00000 0001 Q ss_pred cccchhhccccC--CCccCCC Q lcl|NC_019710. 406 QYVPITDLGTNK--EPRNNGA 424 (424) Q Consensus 406 n~~~~~~~~~~~--~~~~~g~ 424 (424) ....+...+... +.++.+- T Consensus 448 ~~~~~~~~~~d~~~~~~~~~~ 468 (472) T protein:vir:93 448 QLPNLDDGGADGAQQQERSNN 468 (472) T ss_pred hccCcCcccCCCCCCCCCCCc Confidence 111111111110 0001111 No 175 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=98.03 E-value=6.6e-06 Score=49.00 Aligned_cols=392 Identities=10% Similarity=0.037 Sum_probs=162.9 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccccccccccccccccc--CCccccHH---HHhhhHHHHHHHHHHHHhhh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL--GDSSINDE---RILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~~~v~~~i~~ia~~ia 75 (424) |+++-.+-. |+..+...+....... ......+.+-... -+..+-.+ ....+.+..-+|+.++..+- T Consensus 8 ~~e~~~~~~------~~~~l~~~~~~~~~r~---~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~ 78 (486) T protein:vir:42 8 MEEIEDPAV------VREEMISAFEDASKDL---ASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQA 78 (486) T ss_pred CCCcccHHH------HHHHHHHHHHHHHHHH---HHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhc Confidence 333332221 3334443332211100 0000000000000 00001000 01112345556666665543 Q ss_pred hCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-------eEEEEe Q lcl|NC_019710. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-------ISLLPL 148 (424) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~-------~~l~~l 148 (424) -..|.+ .. + . .....+.+++. + |. .......+..+++.+|.||+.+.++..|.. ..+.++ T Consensus 79 ~~g~~~---~~-~--~--~~~~~~~~i~~-~-N~---~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~ 145 (486) T protein:vir:42 79 VEGFRL---GD-A--D--EADEELWQWWQ-A-NN---LDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVE 145 (486) T ss_pred ccceec---CC-C--c--hhHHHHHHHHH-h-cC---hhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEe Confidence 334432 11 1 1 11122444443 2 22 235567788999999999999887664432 256677 Q ss_pred ccceEEEEEcCCc--e----EEEEEecCce----EEecHhH-------------------------eeEecCcC-CCCcc Q lcl|NC_019710. 149 QSANMDVKLVGKK--V----VYRYQRDSEY----ADFSQKE-------------------------IFHLKGFG-FTGLV 192 (424) Q Consensus 149 ~p~~v~~~~~~~~--~----~~~~~~~~~~----~~~~~~e-------------------------vih~r~~~-~~~~~ 192 (424) +|..+.+..|... . .+.+...+.. ..+.++. |++|++.. ..+.+ T Consensus 146 ~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~ 225 (486) T protein:vir:42 146 PPTRMHAEIDPRINRVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLY 225 (486) T ss_pred cccceEEEEeCCCCCeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCC Confidence 8888876665321 0 0111100000 0122222 23343321 23456 Q ss_pred ccchHHH-HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC-CCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceee Q lcl|NC_019710. 193 GLSPIAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE-KVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTS 269 (424) Q Consensus 193 G~s~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~-~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~~ 269 (424) |.|-+.- +...++....+..-......-.+.|..+++-.. .....+. ..-+..++.. .++++.++ ++.++. T Consensus 226 G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~-~~~~~~~~~~-----~~~~~~~~~~~~~~~ 299 (486) T protein:vir:42 226 GTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDS-ETGQTLFDAY-----LARILAFEDAEGKIQ 299 (486) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCcccccccc-ccccchhhhh-----hchhcccCCCCceEE Confidence 7775542 223333333332222222333334544443110 0000000 0001112211 24466654 456776 Q ss_pred eccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH----------HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019710. 270 AIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYISRWENSIQ 339 (424) Q Consensus 270 ~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~----------~~~~f~~~tl~P~~~~ie~~l~ 339 (424) ++.....+ .+++..+....+++..=++|+..+|....+..|...... .....+...+.-+++.+....+ T Consensus 300 q~~~~~~e-~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~ 378 (486) T protein:vir:42 300 QFSAAELA-NFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMK 378 (486) T ss_pred eecccCHH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 66543322 377888888888999999999999865432222111111 1112222233333322211111 Q ss_pred hhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCC--CcCee--------ee---- Q lcl|NC_019710. 340 RWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGES--GLRTINEMRRTDNLPPLP--GGDVA--------MR---- 403 (424) Q Consensus 340 ~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~--g~~t~NE~R~~lg~~p~~--ggd~~--------~~---- 403 (424) ..-. +. ....+++.+......+..+.++.+.+++++ |+++..-+++.+|+-+.+ ..... .. T Consensus 379 ~~~~-~~--d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~ 455 (486) T protein:vir:42 379 GGDV-PP--DMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGT 455 (486) T ss_pred CCCc-cc--cceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHH Confidence 0000 11 113345555666678888899999999886 688888888888885432 11100 00 Q ss_pred ---cccccchhhc-----cccCCCccCCC Q lcl|NC_019710. 404 ---QSQYVPITDL-----GTNKEPRNNGA 424 (424) Q Consensus 404 ---~~n~~~~~~~-----~~~~~~~~~g~ 424 (424) ..+-.+-... .++...+..|| T Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (486) T protein:vir:42 456 MVDADPTVPGSPSPTAPPKPQPAIESSGG 484 (486) T ss_pred hhcCCCCCCCCCCCCCCCCCCcccCCCCC Confidence 0000000000 00000011111 No 176 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=97.97 E-value=8.9e-06 Score=48.28 Aligned_cols=389 Identities=12% Similarity=0.041 Sum_probs=165.8 Q ss_pred cCCCccHHHHHHhhccCccccccccccccccccccccc--CCccccHH---HHhhhHHHHHHHHHHHHhhhhCceeEeec Q lcl|NC_019710. 10 LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL--GDSSINDE---RILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 84 (424) |-|..=|+.++........+. .. .....+.+-.-. .+..+..+ .-..+.+...+|+..+..+--.+|.+ T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r-~~--~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~--- 74 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPN-LL--EAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--- 74 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHH-HH--HHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceec--- Confidence 667777777777766432211 00 000000000000 00111111 00112344556666655543333322 Q ss_pred cccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC------CCCceeEEEEeccceEEEEEc Q lcl|NC_019710. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~------~~G~~~~l~~l~p~~v~~~~~ 158 (424) .++. .....+..++.. | ........+..+++.+|.||.++-++ .+|.+ .+..++|..+.+..| T Consensus 75 ~~d~-----~~~~~l~~i~~~--N---~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~-~i~~~~p~~~~~~~D 143 (480) T protein:vir:78 75 SEDS-----EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIP-LIRVESPLYMYAELD 143 (480) T ss_pred CCCc-----hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCee-EEEEEcccceEEEEc Confidence 1111 112345555532 2 23466778889999999999888653 34544 467788888887776 Q ss_pred CCc---e----EEEEEe-cCc----eEEecHhH-----------------------------eeEecCcC-CCCccccch Q lcl|NC_019710. 159 GKK---V----VYRYQR-DSE----YADFSQKE-----------------------------IFHLKGFG-FTGLVGLSP 196 (424) Q Consensus 159 ~~~---~----~~~~~~-~~~----~~~~~~~e-----------------------------vih~r~~~-~~~~~G~s~ 196 (424) ... . .|++.. +.. ...+.+++ |+||++.. ..+.+|.|- T Consensus 144 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~ 223 (480) T protein:vir:78 144 PRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) T ss_pred CCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCccc Confidence 421 0 010000 000 00111121 34444332 234677776 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeeccCC Q lcl|NC_019710. 197 IAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAIGVT 274 (424) Q Consensus 197 ~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~~~l~~s 274 (424) +.- +...++....+..-......-.+.|..+|. +.+. .+...+.-...+.... +.++.++ +..++.++... T Consensus 224 i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~-~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~ 296 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTT-DELTNDGENTTLDIYY-----GRILTLASEAAKISEFKAA 296 (480) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-cCCc-cccccccccchhhhhh-----hhhccCCCCCceEEecCcc Confidence 542 344444433333333333333344554543 1111 1111111111122211 2344443 44566666543 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhh------hccChhhh Q lcl|NC_019710. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKDV 348 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~------~L~~~~~~ 348 (424) ..+ .+++..+....+|+..=++|+..+|....+..|..........+...+ .-....+...|.+ .+...... T Consensus 297 ~~~-~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka-~~~~~~f~~~l~~~~~l~~~~~g~~~~ 374 (480) T protein:vir:78 297 ELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA-ERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) T ss_pred CHH-HHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHcCCCcc Confidence 222 267777888888999999999999865432222111111111111111 1111111111111 11111111 Q ss_pred -ccceeeecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC----------cCeeeecccc-------- Q lcl|NC_019710. 349 -GRIHAEHNLDGLLRGDSASRAAFMKAMGESG--LRTINEMRRTDNLPPLPG----------GDVAMRQSQY-------- 407 (424) Q Consensus 349 -~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~lg~~p~~g----------gd~~~~~~n~-------- 407 (424) ....+++.+......+..+.++.+.+++.+| +++..-+++.+|+.+.+- +....-...- T Consensus 375 ~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 454 (480) T protein:vir:78 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) T ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCC Confidence 1123444445555667788888888888766 677777778888765321 1000000000 Q ss_pred cchhhccccCCCccCC-C Q lcl|NC_019710. 408 VPITDLGTNKEPRNNG-A 424 (424) Q Consensus 408 ~~~~~~~~~~~~~~~g-~ 424 (424) .+-...++..++.+++ + T Consensus 455 ~~~~~~~~~~~~~~~~~~ 472 (480) T protein:vir:78 455 TPKPTVTETKTETQTSPS 472 (480) T ss_pred CCCCCCCCCCCccccccC Confidence 0000011110011000 0 No 177 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=97.96 E-value=9.2e-06 Score=48.20 Aligned_cols=386 Identities=13% Similarity=0.023 Sum_probs=173.9 Q ss_pred CCccHHHHHHhhccCcccc-cccccc-----------------ccccccc--ccccC------CccccHHHHhhhHHHHH Q lcl|NC_019710. 12 TNNGWWARLKSWFVGGRLV-TPNQGS-----------------QTGPVSA--HGYLG------DSSINDERILQISTVWR 65 (424) Q Consensus 12 ~~~G~~~~~~~~~~~~~~~-~~~~~~-----------------~~~~~~~--~~~~~------~~~~~~~~~~~~~~v~~ 65 (424) --.|++++|++++++-... ...... +...+.+ ..|.. +... .+..+....... T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~-~~~~~s~n~~~~ 79 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPV-NRRQLSMNLPKV 79 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCcc-ccceeecchHHH Confidence 1113444444444321000 000000 0000000 00000 0000 011222334445 Q ss_pred HHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEE Q lcl|NC_019710. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l 145 (424) +++..|+-+..=|..+--.+ + .....+..++. -| ....-+..++...+..|.+|+.+..|.+|.+ .+ T Consensus 80 iv~~~a~~l~~ep~~i~~~d-----~--~~~e~l~~~~~--~n---~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~-~i 146 (499) T protein:vir:80 80 TAKYMSKLLFNEKVKINIDD-----E--TAEEFVLNVLK--TN---GFTKNMERYIEYGEAMGGFVIKVYHDGNKNV-KV 146 (499) T ss_pred HHHHHHHhhhCCcceEeeCC-----H--HHHHHHHHHHh--hc---cHHHHHHHHHHHHhhcCcEEEEEEECCCCcE-EE Confidence 56666665555454432111 0 01112333332 11 2345566677788889999999988888775 46 Q ss_pred EEeccceEEEE-EcCCce--------------EEE--------------EEe--------cC--ceEEecHhH------- Q lcl|NC_019710. 146 LPLQSANMDVK-LVGKKV--------------VYR--------------YQR--------DS--EYADFSQKE------- 179 (424) Q Consensus 146 ~~l~p~~v~~~-~~~~~~--------------~~~--------------~~~--------~~--~~~~~~~~e------- 179 (424) -.++|.++.+. .+.+.. +|. |.. +. .+..++..+ T Consensus 147 ~~v~a~~~~Pi~~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~ 226 (499) T protein:vir:80 147 SFATADCMYPLSNDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEP 226 (499) T ss_pred EEEcCCceEEEEecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCC Confidence 77777777653 332211 010 000 00 001111111 Q ss_pred -----------eeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEE-----cCCCCCCHH Q lcl|NC_019710. 180 -----------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILS-----TGEKVLTEQ 238 (424) Q Consensus 180 -----------vih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~-----~~~~~~~~~ 238 (424) +.+|+.+-. +.+.|+|.+.-+...++..........+-|..+. ...++. ...+...+ T Consensus 227 ~~~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~-~~i~v~~~~l~~~~~~~g~- 304 (499) T protein:vir:80 227 VVPLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGK-KKVLVPSSFVKTAVNLDGS- 304 (499) T ss_pred ceeecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcc-cceecchhhhhccCCCCCC- Confidence 345554311 2356999999998888877777666666676643 333331 11010000 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ 318 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~ 318 (424) ....+....+.+. +.....-+++-.++.++....+-++.+..+...++|....|+++..+|....+..+ +.+. T Consensus 305 ~~~~~~~~~~~~~----~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~T---Atei 377 (499) T protein:vir:80 305 TTQYFDSTDEAFF----LYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKT---ATEV 377 (499) T ss_pred cccCCCcccceee----EeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchh---HHHH Confidence 0000000000000 00011112233466666666666778888889999999999999999876544322 2221 Q ss_pred --H-----------HHHHHHHHHHHHHHHHHHHhhhccChh-hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHH Q lcl|NC_019710. 319 --N-----------LGFLQYTLQPYISRWENSIQRWLIPAK-DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTIN 384 (424) Q Consensus 319 --~-----------~~f~~~tl~P~~~~ie~~l~~~L~~~~-~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~N 384 (424) . ...++.+|..++..|............ ......+.++++.-+..|.++.++...+++.+|+|+.- T Consensus 378 ~s~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~e 457 (499) T protein:vir:80 378 VSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLK 457 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHH Confidence 1 111222333333333222111111111 11234567777777888999999999999999999998 Q ss_pred HHHHHh-CCCCCCCcCeeeeccc-----ccchhhccccCCCccC Q lcl|NC_019710. 385 EMRRTD-NLPPLPGGDVAMRQSQ-----YVPITDLGTNKEPRNN 422 (424) Q Consensus 385 E~R~~l-g~~p~~ggd~~~~~~n-----~~~~~~~~~~~~~~~~ 422 (424) .++... |.+- +..++.+.... ..|-.+.+... +++. T Consensus 458 t~l~~~~~~~d-~ea~~el~~i~~E~~~~~~~~d~~g~~-ge~e 499 (499) T protein:vir:80 458 IALQRAWNITE-AEADEWAEMLAKEKQAEIPNNDMTGIF-GEEE 499 (499) T ss_pred HHHhhcCCCCh-HHHHHHHHHHHHHhhcCCCCCCccccC-CCCC Confidence 887653 5432 21111111000 00111101000 0000 No 178 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.95 E-value=9.7e-06 Score=48.06 Aligned_cols=397 Identities=10% Similarity=0.051 Sum_probs=172.8 Q ss_pred CCCCCcccccCCC-----------ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTN-----------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~-----------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) ..+.-.++..+.= +-.+.++..+..+............. .... ..-+.+....-.|+. T Consensus 33 ~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~-----~~~~------~~ki~~n~~k~Iv~~ 101 (511) T protein:vir:96 33 GTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE-----EYMA------DNRVAHDYASYISDF 101 (511) T ss_pred chhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccc-----cccC------cceeecchHHHHHHH Confidence 1111111111100 01122233333222111000000000 0000 000123455566677 Q ss_pred HHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .+.-+-+-|+.+- ..+. + ....+..++.. | ....+...+..+++.+|.||.++-++.+|.+ .+..++ T Consensus 102 ~~~yl~g~p~~~~--~~d~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~ 168 (511) T protein:vir:96 102 INGYFLGNPIQYQ--DDDK---D--VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSD 168 (511) T ss_pred HhhhhcccCceee--cCch---H--HHHHHHHHHhh--c---ChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEc Confidence 7766666676652 1111 1 11233443332 2 2345667788899999999999999888876 577888 Q ss_pred cceEEEEEcCCc---eEE---EEEe----cC--c----eEEecHhHeeEecCcC-------------------------- Q lcl|NC_019710. 150 SANMDVKLVGKK---VVY---RYQR----DS--E----YADFSQKEIFHLKGFG-------------------------- 187 (424) Q Consensus 150 p~~v~~~~~~~~---~~~---~~~~----~~--~----~~~~~~~evih~r~~~-------------------------- 187 (424) |..+.+..+... ..+ +|.. +. . ...+.++.+.++.... T Consensus 169 p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 248 (511) T protein:vir:96 169 AMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEF 248 (511) T ss_pred ccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEe Confidence 999887776432 111 1110 00 0 1234555565543211 Q ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc-cCcceecCCCc Q lcl|NC_019710. 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV-KKRLWILEAGF 266 (424) Q Consensus 188 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~-ag~~~~l~~g~ 266 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.+++-......++.....+...-....... .+.-.-.+++. T Consensus 249 ~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (511) T protein:vir:96 249 SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSV 328 (511) T ss_pred cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCc Confidence 01234778788777777766666555555666666676666543222222211110000000000000 00001122333 Q ss_pred eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWEN 336 (424) Q Consensus 267 ~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~ 336 (424) +++-+........+....+...+.|+..-++|..-.+... ++.|..... ......+...+.-.++.|.. T Consensus 329 ~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:96 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444443334455567778888999999999865443322 222211111 11112233333333333333 Q ss_pred HHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC----------cCeeee--- Q lcl|NC_019710. 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG----------GDVAMR--- 403 (424) Q Consensus 337 ~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g----------gd~~~~--- 403 (424) .+...--.........+++.+..-+..|..+.++.+.++. |+++.--+.+++++-+.+. .+..-. T Consensus 408 ~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~ 485 (511) T protein:vir:96 408 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 485 (511) T ss_pred HHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 3322111111111123455556677788888999988885 7898877777776532110 000000 Q ss_pred cccccchhhccc-cCCCccCCC Q lcl|NC_019710. 404 QSQYVPITDLGT-NKEPRNNGA 424 (424) Q Consensus 404 ~~n~~~~~~~~~-~~~~~~~g~ 424 (424) .....+.+.-.+ +....++.+ T Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:96 486 GIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred ccccCCCCCCCCCCCCCccCcc Confidence 000000000000 000011111 No 179 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.95 E-value=9.7e-06 Score=48.06 Aligned_cols=397 Identities=10% Similarity=0.051 Sum_probs=172.8 Q ss_pred CCCCCcccccCCC-----------ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTN-----------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~-----------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) ..+.-.++..+.= +-.+.++..+..+............. .... ..-+.+....-.|+. T Consensus 33 ~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~-----~~~~------~~ki~~n~~k~Iv~~ 101 (511) T protein:vir:78 33 GTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE-----EYMA------DNRVAHDYASYISDF 101 (511) T ss_pred chhhhhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccc-----cccC------cceeecchHHHHHHH Confidence 1111111111100 01122233333222111000000000 0000 000123455566677 Q ss_pred HHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .+.-+-+-|+.+- ..+. + ....+..++.. | ....+...+..+++.+|.||.++-++.+|.+ .+..++ T Consensus 102 ~~~yl~g~p~~~~--~~d~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~-~i~~~~ 168 (511) T protein:vir:78 102 INGYFLGNPIQYQ--DDDK---D--VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSD 168 (511) T ss_pred HhhhhcccCceee--cCch---H--HHHHHHHHHhh--c---ChhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEc Confidence 7766666676652 1111 1 11233443332 2 2345667788899999999999999888876 577888 Q ss_pred cceEEEEEcCCc---eEE---EEEe----cC--c----eEEecHhHeeEecCcC-------------------------- Q lcl|NC_019710. 150 SANMDVKLVGKK---VVY---RYQR----DS--E----YADFSQKEIFHLKGFG-------------------------- 187 (424) Q Consensus 150 p~~v~~~~~~~~---~~~---~~~~----~~--~----~~~~~~~evih~r~~~-------------------------- 187 (424) |..+.+..+... ..+ +|.. +. . ...+.++.+.++.... T Consensus 169 p~~~~~v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 248 (511) T protein:vir:78 169 AMSTFIIYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEF 248 (511) T ss_pred ccceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEe Confidence 999887776432 111 1110 00 0 1234555565543211 Q ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc-cCcceecCCCc Q lcl|NC_019710. 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV-KKRLWILEAGF 266 (424) Q Consensus 188 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~-ag~~~~l~~g~ 266 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.+++-......++.....+...-....... .+.-.-.+++. T Consensus 249 ~n~~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (511) T protein:vir:78 249 SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSV 328 (511) T ss_pred cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCc Confidence 01234778788777777766666555555666666676666543222222211110000000000000 00001122333 Q ss_pred eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWEN 336 (424) Q Consensus 267 ~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~ 336 (424) +++-+........+....+...+.|+..-++|..-.+... ++.|..... ......+...+.-.++.|.. T Consensus 329 ~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~ 407 (511) T protein:vir:78 329 DGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLET 407 (511) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444443334455567778888999999999865443322 222211111 11112233333333333333 Q ss_pred HHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC----------cCeeee--- Q lcl|NC_019710. 337 SIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG----------GDVAMR--- 403 (424) Q Consensus 337 ~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g----------gd~~~~--- 403 (424) .+...--.........+++.+..-+..|..+.++.+.++. |+++.--+.+++++-+.+. .+..-. T Consensus 408 ~~~~~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~ 485 (511) T protein:vir:78 408 ILKNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQK 485 (511) T ss_pred HHHhcCCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhh Confidence 3322111111111123455556677788888999988885 7898877777776532110 000000 Q ss_pred cccccchhhccc-cCCCccCCC Q lcl|NC_019710. 404 QSQYVPITDLGT-NKEPRNNGA 424 (424) Q Consensus 404 ~~n~~~~~~~~~-~~~~~~~g~ 424 (424) .....+.+.-.+ +....++.+ T Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~ 507 (511) T protein:vir:78 486 GIYKDPRDINDDEQDDDTKDTV 507 (511) T ss_pred ccccCCCCCCCCCCCCCccCcc Confidence 000000000000 000011111 No 180 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=97.90 E-value=1.2e-05 Score=47.57 Aligned_cols=389 Identities=7% Similarity=-0.018 Sum_probs=175.1 Q ss_pred CCCC------------Ccc-------cccCCC----ccHHHHHHhhccCcccccccc--ccccc---cccc-cc-ccCCc Q lcl|NC_019710. 1 MEEP------------KYT-------IDLRTN----NGWWARLKSWFVGGRLVTPNQ--GSQTG---PVSA-HG-YLGDS 50 (424) Q Consensus 1 ~~~~------------~~~-------~~~~~~----~G~~~~~~~~~~~~~~~~~~~--~~~~~---~~~~-~~-~~~~~ 50 (424) |+|. +|+ ..+... .-++.++......... .... ....+ .... .. +..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~-r~~~l~~YY~g~~~i~~~~~~~~~~~~ 79 (483) T protein:vir:12 1 MAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLP-EISIGQEYYEQRPDIVKEPKPVDATGA 79 (483) T ss_pred CccchhcCCceeecCcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHH-HHHHHHHHhcccccccccccccccccc Confidence 2221 111 111111 1233333222211100 0000 00000 0000 00 00000 Q ss_pred --cccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcC Q lcl|NC_019710. 51 --SINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYG 128 (424) Q Consensus 51 --~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G 128 (424) ..-+..-+.++....+|+..+.-+-+-|+++-- .+. .. ...+...+. | .-......+..+.+.+| T Consensus 80 ~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~~~--~d~---~~--~~~l~~~~~---n---~~~~~~~~~~~~~~~~G 146 (483) T protein:vir:12 80 VDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKH--TDD---EV--VKRIDEVLG---N---RFDDKLHSVLTGASNKG 146 (483) T ss_pred ccccccccccccchHHHHHHHHhhhhcccCceecc--CCh---HH--HHHHHHHHh---c---cHHHHHHHHHHHHhhCC Confidence 000000122456666777777777677766521 111 11 112333332 2 12344566788899999 Q ss_pred CeEEEEeeCCCCceeEEEEeccceEEEEEcCCc---e---EEEEEecC--ceEEecHhHeeEecC--------------- Q lcl|NC_019710. 129 NAYALVDRNSAGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDS--EYADFSQKEIFHLKG--------------- 185 (424) Q Consensus 129 ~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~---~---~~~~~~~~--~~~~~~~~evih~r~--------------- 185 (424) .||+++-.+.+|.+ .+..++|..+.+..+... . .+.|.... ....+.+..+.|+.. T Consensus 147 ~~y~~v~~d~d~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~ 225 (483) T protein:vir:12 147 IEWLHPYLDEEGEF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLEN 225 (483) T ss_pred eEEEEEEEcCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeCCeeeecccccccc Confidence 99999999998876 577889999887765321 1 11111111 112223333333210 Q ss_pred -------cC---------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019710. 186 -------FG---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 186 -------~~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) .. .+...|.|-+..+...++....+..-..+.+...+.|..+++-.......+... ... T Consensus 226 ~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~----~~~- 300 (483) T protein:vir:12 226 SKTHFSTGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR----LLR- 300 (483) T ss_pred cccccccCCCCccceEEecCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHH----hhh- Confidence 00 012357788887777777777666666666676777777765322211112111 111 Q ss_pred HhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHH Q lcl|NC_019710. 250 IAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQN 319 (424) Q Consensus 250 ~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~ 319 (424) ..+++.++++.+...+.....+..+....+...+.|+..-++|..-.+... ++.+....+ ... T Consensus 301 ------~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~ 373 (483) T protein:vir:12 301 ------YYGAIKVSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKL 373 (483) T ss_pred ------hccccccCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCcHHHHHHHHHHHHHHHHHHH Confidence 223555555555555544444555677778888889999999854332221 222211111 112 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC-- Q lcl|NC_019710. 320 LGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG-- 397 (424) Q Consensus 320 ~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g-- 397 (424) ...+...+.-+++.|...+.. .. ....+++.+...+..|..+.++.+.++ .|+++..-+.+++++-+.+. T Consensus 374 ~~~f~~~l~~~~~li~~~~~~----~~--~~~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~~~v~d~~~E 445 (483) T protein:vir:12 374 ARKAKVAIQELLWFVFEHFDI----KG--EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAE 445 (483) T ss_pred HHHHHHHHHHHHHHHHHHhcC----CC--ccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHH Confidence 223333344444433333221 11 123344555666778889999999888 48999988888887632210 Q ss_pred cCeee-----ecccccchhhccccCCCccCCC Q lcl|NC_019710. 398 GDVAM-----RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 398 gd~~~-----~~~n~~~~~~~~~~~~~~~~g~ 424 (424) .+..- ...+...+...+...++.++.. T Consensus 446 ~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~ 477 (483) T protein:vir:12 446 LERIEQEQMEYNKQLPNLDDGGADGAQQQERS 477 (483) T ss_pred HHHHHHHHHHHHhhcccccccccCCcccCCCC Confidence 00000 0000111111111100001000 No 181 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=97.85 E-value=1.5e-05 Score=47.09 Aligned_cols=402 Identities=11% Similarity=-0.018 Sum_probs=163.0 Q ss_pred CCCCCcccccCCC---------ccHHHHHHhhccCcccccccccccccccccccccCC--ccccHH---HHhhhHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTN---------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGD--SSINDE---RILQISTVWRC 66 (424) Q Consensus 1 ~~~~~~~~~~~~~---------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~v~~~ 66 (424) |-+--+.-+.-+. +-+++++...+....... ......+.+...... ..+..+ ....+.+..-+ T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~---~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~i 77 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRN---LLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKA 77 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHH---HHHHHHHhccccchhccccccHHHHHHhhccCcHHHH Confidence 4333222222222 234555555443222110 000011111110000 111111 11112333445 Q ss_pred HHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCcee-EE Q lcl|NC_019710. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVI-SL 145 (424) Q Consensus 67 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~-~l 145 (424) |+.++..+---.|. ..+ ++ . ....+.+++. + |. .......+..+.+.+|.||+.+..+.+|.+. .+ T Consensus 78 Vd~~a~rl~~~Gf~---~~d-~~--~--~~~~l~~i~~-~-N~---ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I 144 (504) T protein:vir:99 78 VDTLARRCNLESFV---WPD-GD--Y--GSIGGPDVWD-E-NF---FATKANNAMVSSLIHGPAFLINTEGGAGEPDSLI 144 (504) T ss_pred HHHHHhhhccceee---CCC-CC--h--hhHHHHHHHH-h-cC---hhhHHHHHHHHHHhhCceeEEEecCCCCCceeEE Confidence 55555543222332 221 11 1 1122444332 2 22 2345678889999999999999988888765 46 Q ss_pred EEeccceEEEEEcCCce------EEEEE-ecCce---EEecHhH------------------------eeEecCcC-CCC Q lcl|NC_019710. 146 LPLQSANMDVKLVGKKV------VYRYQ-RDSEY---ADFSQKE------------------------IFHLKGFG-FTG 190 (424) Q Consensus 146 ~~l~p~~v~~~~~~~~~------~~~~~-~~~~~---~~~~~~e------------------------vih~r~~~-~~~ 190 (424) .+++|..+....|.... .+.+. .++.. ..+.++. |++|.+.. .+. T Consensus 145 ~~~sP~~~~~iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~gvPvV~~~n~~~~~~ 224 (504) T protein:vir:99 145 HVKSAMQATGEWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLGVPVEVLPYKPREDR 224 (504) T ss_pred EEeccceeEEEEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCCcceEEecccccCcc Confidence 67899988876664321 01100 11100 1122222 44444332 234 Q ss_pred ccccchH----HHHHHHHHHHHHHHHHHHHHHhccCCCceeEE-cCCC-CC--CHHHHHHHHHHHHHHhCCcccCcc-ee Q lcl|NC_019710. 191 LVGLSPI----AFACKSAGVAVAMEDQQRDFFANGAKSPQILS-TGEK-VL--TEQQRSQVEENFKEIAGGPVKKRL-WI 261 (424) Q Consensus 191 ~~G~s~~----~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~-~~~~-~~--~~~~~~~~~~~~~~~~~~~~ag~~-~~ 261 (424) .+|.|.+ ..+.+.+.....-......+|.. |..++. .... .. +......++....+.+.-...... .. T Consensus 225 ~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~---p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~ 301 (504) T protein:vir:99 225 PLGSSRITRPVMSLQQRALKGCIRMDGHADVYSF---PQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDA 301 (504) T ss_pred ccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcc---hhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccc Confidence 5676643 33333333333222333344333 333332 1100 00 001112222222222221111111 11 Q ss_pred cCCCceeeeccCChhHH-HHHHHHHHHHHHHHHHhCCCHHHcCCCCCC-CcccccHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019710. 262 LEAGFSTSAIGVTPQDA-EMMASRKFQVSELARFFGVPPHLVGDVEKS-TSWGSGIEQQNLGFLQYTLQPYISRWENSIQ 339 (424) Q Consensus 262 l~~g~~~~~l~~s~~d~-~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~-~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~ 339 (424) -....++-++... ++ .|++..+.....|+..=++|+..||..... ++|...++.+...+ ...+.-..+.+.+.|. T Consensus 302 ~~~~~~~~q~~~~--~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L-~~ka~~k~~~f~~~l~ 378 (504) T protein:vir:99 302 ARARADVKQFPAS--SPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDL-IAEAEGATDDWSPAFR 378 (504) T ss_pred cCccceeeecCCC--ChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 1123455555443 23 478899999999999999999999876542 22211221111111 1111222222222222 Q ss_pred hh------ccChhh---hccceeeecchhhhccCHHHHHHHHHHHHhCCCcC--H-HHHHHHhCCCCCC----------- Q lcl|NC_019710. 340 RW------LIPAKD---VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRT--I-NEMRRTDNLPPLP----------- 396 (424) Q Consensus 340 ~~------L~~~~~---~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t--~-NE~R~~lg~~p~~----------- 396 (424) +- +..... .....+++.+......+..+.++.+.++++.|... . .-+.+++|+.+-+ T Consensus 379 ~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~ 458 (504) T protein:vir:99 379 RSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRA 458 (504) T ss_pred HHHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHH Confidence 10 111110 11233455556667778888999999999988532 2 3345566776431 Q ss_pred Cc----Ceeeecccccc------hhhccccCCCccCCC Q lcl|NC_019710. 397 GG----DVAMRQSQYVP------ITDLGTNKEPRNNGA 424 (424) Q Consensus 397 gg----d~~~~~~n~~~------~~~~~~~~~~~~~g~ 424 (424) .+ +.+....+... -+..++-..++.+++ T Consensus 459 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~a~~~~~~~ 496 (504) T protein:vir:99 459 SSVSIIEALNRRQQEAATAGEDQDQGAGEPPANEPPAA 496 (504) T ss_pred hhHHHHHHHhcccCCCCCCCCCCCcCCCCCCCCCCCcc Confidence 00 00000000000 000000000000111 No 182 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=97.84 E-value=1.6e-05 Score=46.92 Aligned_cols=385 Identities=13% Similarity=0.024 Sum_probs=167.5 Q ss_pred cCCCccHHHHHHhhccCccccccccccccccccccccc--CCccccHH---HHhhhHHHHHHHHHHHHhhhhCceeEeec Q lcl|NC_019710. 10 LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL--GDSSINDE---RILQISTVWRCVSLISTLTACLPLDVFET 84 (424) Q Consensus 10 ~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~---~~~~~~~v~~~i~~ia~~ia~~~~~~~~~ 84 (424) |-|.+=|+.++.......... ... ....+.+-... .+..+..+ .-..+.+...+|+..++.+---+|.+ T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r-~~~--~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~--- 74 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPN-LLE--AEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--- 74 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHH-HHH--HHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceec--- Confidence 666666777776655332211 000 00000000000 00011110 00112344555665555543333322 Q ss_pred cccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC------CCCceeEEEEeccceEEEEEc Q lcl|NC_019710. 85 DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLLPLQSANMDVKLV 158 (424) Q Consensus 85 ~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~------~~G~~~~l~~l~p~~v~~~~~ 158 (424) .++. .....+..++.. | ........+..+.+.+|.||+.+.++ .+|.+ .+.+++|..+.+..| T Consensus 75 ~~d~-----~~~~~l~~i~~~--N---~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~-~i~~~~p~~~~~i~D 143 (480) T protein:vir:78 75 SEDS-----EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELD 143 (480) T ss_pred CCCc-----hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCee-EEEEEcccceEEEEc Confidence 2111 112335555532 2 23456678899999999999887653 34555 477888888887776 Q ss_pred CCc---e----EEEEE-ecCc----eEEecHhH-----------------------------eeEecCcC-CCCccccch Q lcl|NC_019710. 159 GKK---V----VYRYQ-RDSE----YADFSQKE-----------------------------IFHLKGFG-FTGLVGLSP 196 (424) Q Consensus 159 ~~~---~----~~~~~-~~~~----~~~~~~~e-----------------------------vih~r~~~-~~~~~G~s~ 196 (424) ... . .|++. ++.. ...+.++. |+||.+.. ..+.+|.|- T Consensus 144 ~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sd 223 (480) T protein:vir:78 144 PRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) T ss_pred CCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccc Confidence 431 1 11110 0000 01122222 24444332 234567776 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-CCceeeeccCC Q lcl|NC_019710. 197 IA-FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-AGFSTSAIGVT 274 (424) Q Consensus 197 ~~-~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-~g~~~~~l~~s 274 (424) +. .+...++....+..-......-.+.|..++. +... .+...+.-...+... .++++.++ ++.++.++... T Consensus 224 i~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~-G~~~-~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~ 296 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTT-DELTNDGENTTLDIY-----YGRILTLASEAAKISEFKAA 296 (480) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhh-CCCc-cccccccccchhhhh-----hhhhccCCCCCceEEecCcc Confidence 54 2334444433333333333333344554543 1111 111011111112211 12345544 34566665543 Q ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH----HHhh------hccC Q lcl|NC_019710. 275 PQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWEN----SIQR------WLIP 344 (424) Q Consensus 275 ~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~----~l~~------~L~~ 344 (424) ..+ .+.+..+....+++..=++|+..+|....+.+|. .+. .+....+.-.+...+. .|.+ .+.. T Consensus 297 ~~~-~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg----~Al-~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~ 370 (480) T protein:vir:78 297 ELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASA----EAI-IATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG 370 (480) T ss_pred CHH-HHHHHHHHHHHHHhcccCCCHHHhccccCchhHH----HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 322 3677788889999999999999998643321221 111 1111222222222111 1111 1111 Q ss_pred hhhh-ccceeeecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CcCeeeecccccchh-------- Q lcl|NC_019710. 345 AKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGESG--LRTINEMRRTDNLPPLP--GGDVAMRQSQYVPIT-------- 411 (424) Q Consensus 345 ~~~~-~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g--~~t~NE~R~~lg~~p~~--ggd~~~~~~n~~~~~-------- 411 (424) .... ....+++.+......+..+.++.+.+++.+| +++..-+++.+|+.+.+ ..++........+++ T Consensus 371 ~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~ 450 (480) T protein:vir:78 371 REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKA 450 (480) T ss_pred CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccC Confidence 1111 1234555555556677788888888888866 67776778888887532 111100000000000 Q ss_pred --------hccccC-CCccCCC Q lcl|NC_019710. 412 --------DLGTNK-EPRNNGA 424 (424) Q Consensus 412 --------~~~~~~-~~~~~g~ 424 (424) ..++.. +.+..++ T Consensus 451 ~~~~~~~~~~~~~~~~~~~~~~ 472 (480) T protein:vir:78 451 QADATPKPTVTETKTETQTSPS 472 (480) T ss_pred CCccccCCCCCCCCCccCCCcc Confidence 000000 0000000 No 183 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=97.83 E-value=1.6e-05 Score=46.88 Aligned_cols=383 Identities=11% Similarity=0.041 Sum_probs=160.4 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccccccccccccccccc--CCccccHH-----HHhhhHHHHHHHHHHHHh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL--GDSSINDE-----RILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-----~~~~~~~v~~~i~~ia~~ 73 (424) ++-|.-.|.-..-.=++..+........+ +. ......+.+.... -+.....+ .-..+.+..-+|+..+.. T Consensus 16 ~~~p~~~~~~~~~~~l~~~l~~~~~~~~~-rl--~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~ 92 (501) T protein:vir:25 16 VEFPEDSMSREQLGALVADMWRLHISERQ-WL--DRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQN 92 (501) T ss_pred ccCCcccCChHHHHHHHHHHHHHHHHHHH-HH--HHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhh Confidence 55555555433222222222222211110 00 0000000000000 00000000 001123445566655554 Q ss_pred hhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceE Q lcl|NC_019710. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v 153 (424) +---.|+ .. +++. ...+..++. + |. ....-..+..+++.+|.||+++.++.+|. .+..++|..+ T Consensus 93 l~~~gf~---~~-d~~~-----~~~l~~i~~-~-N~---~d~~~~~~~~~a~i~G~ay~~v~~de~~~--~i~~~sp~~~ 156 (501) T protein:vir:25 93 LSVVGYR---NA-LAKE-----NDPAWEMWQ-R-NR---MDARQAEVHRPALTYGASYVTVTPTDEGP--VFRTRSPRQI 156 (501) T ss_pred hccccee---cC-Cccc-----hHHHHHHHH-h-cC---hhHHHHHHHHHHhhcCceEEEEecCCCCC--eEEEeccccE Confidence 3222232 22 1111 123444332 2 22 23555678899999999999998888884 3556788888 Q ss_pred EEEE-cCCc---eE----EEE-Eec-Cc---eEEecHhH----------------------------------------- Q lcl|NC_019710. 154 DVKL-VGKK---VV----YRY-QRD-SE---YADFSQKE----------------------------------------- 179 (424) Q Consensus 154 ~~~~-~~~~---~~----~~~-~~~-~~---~~~~~~~e----------------------------------------- 179 (424) .... |... .. |.. ... +. ...+.+.. T Consensus 157 ~~iy~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 236 (501) T protein:vir:25 157 LAVYADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEG 236 (501) T ss_pred EEEEecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCC Confidence 7554 3211 10 100 000 00 00011111 Q ss_pred -----eeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCc Q lcl|NC_019710. 180 -----IFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGP 254 (424) Q Consensus 180 -----vih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (424) |+||.+.....-.|.|.++.+...++....+.........-.+.|..++.- .. .++ .+ .++. T Consensus 237 ~~~vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G-~~--~~~-~~----~~~~----- 303 (501) T protein:vir:25 237 KPVCPVVRFVNGRDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISG-WT--GSK-AE----VLKA----- 303 (501) T ss_pred ccceeeEeccCccccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhC-CC--CCc-cc----hhhh----- Confidence 233333221123477766655544454444433333333334445444431 11 111 11 1111 Q ss_pred ccCcceecC-CCceeeeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 255 VKKRLWILE-AGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYIS 332 (424) Q Consensus 255 ~ag~~~~l~-~g~~~~~l~~s~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~ 332 (424) ..++++.++ ++.++.++.. .+++ +.+..+....+|+..-++|+..++....+. |... ..+....+.-.+. T Consensus 304 ~~~~i~~~~~~~~~~~q~~~--~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~-Sg~A-----l~~~~~~l~~ka~ 375 (501) T protein:vir:25 304 SALRVWTFEDPEVKAQAFPP--ASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINV-SAEA-----LAAAEANQQRKLA 375 (501) T ss_pred cccceeccCCCCceEEEecc--cChHHHHHHHHHHHHHHHhhcCCChhhhccccCCh-HHHH-----HHHHHHHHHHHHH Confidence 124577765 4566666543 2333 788899999999999999999998654322 2111 1222222222222 Q ss_pred H----HHHHHhh------hccChhhh-ccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHH-HhCCCCCC---- Q lcl|NC_019710. 333 R----WENSIQR------WLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRR-TDNLPPLP---- 396 (424) Q Consensus 333 ~----ie~~l~~------~L~~~~~~-~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~-~lg~~p~~---- 396 (424) . +...|.+ .+....+. ....+++.+......+..+.++.+.++++.|+ +.-.+.. ..|+.+-+ T Consensus 376 ~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~ie~~ 454 (501) T protein:vir:25 376 AKRESFGESWEQLLRLAAEMDDDPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQTIQAI 454 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHHHHHH Confidence 2 2222221 11111111 12345666677778889999999999998875 3333332 34665411 Q ss_pred -------CcCeee---ecccccchhhcc-------ccCCC--ccCCC Q lcl|NC_019710. 397 -------GGDVAM---RQSQYVPITDLG-------TNKEP--RNNGA 424 (424) Q Consensus 397 -------ggd~~~---~~~n~~~~~~~~-------~~~~~--~~~g~ 424 (424) ..+... .+....+..... .++++ .++|| T Consensus 455 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 455 KDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred HHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCCCCC Confidence 011000 011111111100 01111 11122 No 184 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=97.82 E-value=1.7e-05 Score=46.75 Aligned_cols=396 Identities=11% Similarity=0.069 Sum_probs=182.1 Q ss_pred CCC----CCcccccCCC-ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhh Q lcl|NC_019710. 1 MEE----PKYTIDLRTN-NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 1 ~~~----~~~~~~~~~~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia 75 (424) +-- ++.--..++. ..=+.++..+..+............. .... ..-+.++....+|+..+.-+- T Consensus 38 ~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~-----~~~~------~~ki~~n~~k~Ivd~~~~yl~ 106 (501) T protein:vir:27 38 VNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKD-----REMA------DKRAVHNYGRMISKFKTGYLA 106 (501) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCc-----cccc------cceeccchHHHHHHHHhhhhc Confidence 000 0000000000 01134455555443211100000000 0000 001224566677788887777 Q ss_pred hCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEE Q lcl|NC_019710. 76 CLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDV 155 (424) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~ 155 (424) +-|+++--.+.... +.. ...+...+. . | ........+..+++.+|.||+++.++.+|.+ .+-.++|..+.+ T Consensus 107 g~p~~~~~~d~~~~-~~~--~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~-~i~~~~p~~~~~ 177 (501) T protein:vir:27 107 GNPIRVEYDDNDNN-SQN--DDTIKRIGR-I-N---DIDSHNRTLIRDLSQTGRAYEVIYRNEYDET-RIKRLNPLETFV 177 (501) T ss_pred ccCeeEecCCccch-HHH--HHHHHHHHH-h-c---ChhHHHHHHHHHHhhCCeEEEEEEeCCCCce-EEEEEccceeEE Confidence 77776632221111 100 111222222 1 2 3457778889999999999999999888876 467788888887 Q ss_pred EEcCCc---e----EEEEE-ec-Cc---eEEecHhHeeEecC----------------cC----CCCccccchHHHHHHH Q lcl|NC_019710. 156 KLVGKK---V----VYRYQ-RD-SE---YADFSQKEIFHLKG----------------FG----FTGLVGLSPIAFACKS 203 (424) Q Consensus 156 ~~~~~~---~----~~~~~-~~-~~---~~~~~~~evih~r~----------------~~----~~~~~G~s~~~~~~~~ 203 (424) ..++.. . .|+.. .. +. ...+.++.|.++.. .+ .+...|.|.+..+... T Consensus 178 v~d~~~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~l 257 (501) T protein:vir:27 178 IYDNSLEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDASDDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELYL 257 (501) T ss_pred EecCCCCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEeCCceeeccccccCCCcccEEEecCCCCCCCchhhhHHH Confidence 766431 1 11111 00 00 01223333322220 00 1233588888888888 Q ss_pred HHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHH Q lcl|NC_019710. 204 AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMAS 283 (424) Q Consensus 204 i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~ 283 (424) ++....+..-..+.+.....|-.+++-......++....++.. ........+.......+.++..+.....+..+... T Consensus 258 iDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 335 (501) T protein:vir:27 258 IDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRT--RLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAY 335 (501) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhc--CceeecccccccCCCCCcceeeeeccCCHHHHHHH Confidence 8877777777777777777777666543222222222222211 11111111122233445555555544444556667 Q ss_pred HHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhcccee Q lcl|NC_019710. 284 RKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHA 353 (424) Q Consensus 284 ~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~ 353 (424) .+...+.|+..-++|..-.+... ++.+....+ ......+...+.-+++.+...++..--. .......+ T Consensus 336 ~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~-~~~d~~~i 413 (501) T protein:vir:27 336 KTRLNRDIHIFTNIPDMSDTNFS-GNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEF-KDFDESLL 413 (501) T ss_pred HHHHHHHHHHHhCCcccCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-cccccccc Confidence 78888899999999864443222 222211111 1111233333334443333333211100 01111234 Q ss_pred eecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--C----------cCeeeecccccchhhcc-ccCCCc Q lcl|NC_019710. 354 EHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP--G----------GDVAMRQSQYVPITDLG-TNKEPR 420 (424) Q Consensus 354 ~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~--g----------gd~~~~~~n~~~~~~~~-~~~~~~ 420 (424) .+.+...+..+..+.++.+.++ .|+++..-+.+++++-..| . .+.-.....+. +..+ ..++.. T Consensus 414 ~v~f~~~~p~n~~e~ad~~~kl--~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~--~~~~~~~d~~~ 489 (501) T protein:vir:27 414 KITFTPNLPKSLNEQVSILTGL--GGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFN--EHVGKYTDEVK 489 (501) T ss_pred eEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccc--cccccccCCCC Confidence 5555677778888888888887 4789988877777653211 1 01001111110 0001 111122 Q ss_pred cCCC Q lcl|NC_019710. 421 NNGA 424 (424) Q Consensus 421 ~~g~ 424 (424) ++++ T Consensus 490 ~~~~ 493 (501) T protein:vir:27 490 ETHT 493 (501) T ss_pred CCcc Confidence 2222 No 185 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=97.76 E-value=2.2e-05 Score=46.15 Aligned_cols=384 Identities=7% Similarity=-0.022 Sum_probs=178.1 Q ss_pred CCCCCcccccCCCc--cH---------------HHHHHhhccCcccccccccccccccccccccCCcc-cc---HHHHhh Q lcl|NC_019710. 1 MEEPKYTIDLRTNN--GW---------------WARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSS-IN---DERILQ 59 (424) Q Consensus 1 ~~~~~~~~~~~~~~--G~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~---~~~~~~ 59 (424) |...-+-+..+... -+ +.++..+..+..... ..+... +..+.. .. ...-+. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~------~~~~~~--~~~~~~~~~~~~~~~ki~ 78 (479) T protein:vir:79 7 SETDLIKVQLKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVN------NKRRYY--LLDGAKVDDFTKVNNKAI 78 (479) T ss_pred cccceEeeccccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCccc------cccccc--ccccccccccccCcceee Confidence 22222222222221 11 222222222211100 000000 000000 00 000122 Q ss_pred hHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC Q lcl|NC_019710. 60 ISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) Q Consensus 60 ~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~ 139 (424) ++....+|+..+.-+-+-|+.+-- .+ +. ...+...+.. | .-......++...+.+|.+|.++..+.+ T Consensus 79 ~~~~~~Ivd~~~~~l~g~p~~~~~--~~---~~---~~~~~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~ 145 (479) T protein:vir:79 79 NNYHKLLVDQKVGYSVGNPIVFNA--DD---DN---LTKLLNDLLG--E---EFDDTITELYLNASNKGVEWLHPYINRK 145 (479) T ss_pred cchHHHHHHHHHhhhhcCCceecc--CC---HH---HHHHHHHHHh--c---CHHHHHHHHHHHHHhcCeEEEEEEeCCC Confidence 455666777777777777766521 11 11 1122233332 2 2455667788899999999999988888 Q ss_pred CceeEEEEeccceEEEEEcCCc---e-----EEEEEe-cCc----eEEecHhHeeEecCcC------------------- Q lcl|NC_019710. 140 GDVISLLPLQSANMDVKLVGKK---V-----VYRYQR-DSE----YADFSQKEIFHLKGFG------------------- 187 (424) Q Consensus 140 G~~~~l~~l~p~~v~~~~~~~~---~-----~~~~~~-~~~----~~~~~~~evih~r~~~------------------- 187 (424) |.+ .+..++|..+.+..+... . +|.... .+. ...+.++.+.|++.-. T Consensus 146 ~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (479) T protein:vir:79 146 GEF-KYVIIPAEEAIPIWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQ 224 (479) T ss_pred Cce-EEEEEccceeEEEEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccc Confidence 876 477788888887765432 1 111111 111 1123344444432100 Q ss_pred ---------------------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH Q lcl|NC_019710. 188 ---------------------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN 246 (424) Q Consensus 188 ---------------------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~ 246 (424) .+...|.|.+..+...++....+.....+.+...+.|-.+++--.....++... T Consensus 225 ~~~~~~~~~~~~~~~vPvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~----- 299 (479) T protein:vir:79 225 EGHFRINNKEQGWGKVPFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFID----- 299 (479) T ss_pred cccccccccccCCCcccEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchh----- Confidence 022357788887777777777666666666777777777765322211121111 Q ss_pred HHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH---------- Q lcl|NC_019710. 247 FKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE---------- 316 (424) Q Consensus 247 ~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e---------- 316 (424) ....++++.++++.+++-+..+..+..+.+..+...+.|...-++|..-.+.. ++.+..... T Consensus 300 ------~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--gn~Sg~Ai~~~~~~l~~k~ 371 (479) T protein:vir:79 300 ------NIRYYKSIKVDGGGGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVNPESQNT--GDKSGVALKFLYSLLDLKC 371 (479) T ss_pred ------hhhhccceecCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCccccccccc--cchhHHHHHHHHHHHHHHH Confidence 11234566666666655555444455566777888888888888886433322 222211111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Q lcl|NC_019710. 317 QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP 396 (424) Q Consensus 317 ~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ 396 (424) ......+...+.-+++.+...+...- ........+.+.+..-+..|.++.++.+.++ .|+++...+.+++++-..+ T Consensus 372 ~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~l~~v~d~ 447 (479) T protein:vir:79 372 SKTEKKFKKAIRELLWFVCEYLKISG--NKSYDYKTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSNHPWVEDV 447 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccC--CCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCH Confidence 11222333333344443333332211 1111123345555666677888889888887 4889988888887653211 Q ss_pred CcCeeeecccccc----hhhc-cccCCCccCC Q lcl|NC_019710. 397 GGDVAMRQSQYVP----ITDL-GTNKEPRNNG 423 (424) Q Consensus 397 ggd~~~~~~n~~~----~~~~-~~~~~~~~~g 423 (424) ..+.-.+...... .+.. +..++..++. T Consensus 448 ~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 448 NDELERLKKQEDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred HHHHHHHHHHHHHHHHHHhccCcccCCCcCcC Confidence 1000000000000 0000 0001111111 No 186 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=97.75 E-value=2.2e-05 Score=46.09 Aligned_cols=383 Identities=13% Similarity=0.068 Sum_probs=179.6 Q ss_pred ccHHHHHHhhccCcccccc-----c-----cccccc------------cc-ccccccCCcccc----HHHHhhhHHHHHH Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTP-----N-----QGSQTG------------PV-SAHGYLGDSSIN----DERILQISTVWRC 66 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~-----~-----~~~~~~------------~~-~~~~~~~~~~~~----~~~~~~~~~v~~~ 66 (424) +|+|+++++++++...... . +..... .+ +-..+.....+. .+...+...-..+ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 8888888887765321100 0 000000 00 000111100000 0011122233344 Q ss_pred HHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEE Q lcl|NC_019710. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 67 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~ 146 (424) ++..|+-+..=|..+--. ++ .....+.++|.. | .........+.+.+..|.+++.+..+. |. ..+. T Consensus 81 ~~~~A~ll~~e~~~i~~~---d~----~~~e~l~~i~~~--n---~f~~~~~~~~e~a~a~G~~~~k~~~D~-~~-~~i~ 146 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVS---DE----TANDFLDDVFQQ--N---DFYTTFEEKLEEWIALGSGCVRPYVDS-GK-IKLA 146 (505) T ss_pred HHHHHhhhcCCCceeecC---Ch----HHHHHHHHHHHh--c---cHHHHHHHHHHHHhhcCCeEEEEEEeC-Cc-eEEE Confidence 455555444333332111 10 011223333321 1 134555667788888898888776653 33 3455 Q ss_pred EeccceEEEE-EcCCc------------------eEEE----EE-ecCc----------------eEEecH--------- Q lcl|NC_019710. 147 PLQSANMDVK-LVGKK------------------VVYR----YQ-RDSE----------------YADFSQ--------- 177 (424) Q Consensus 147 ~l~p~~v~~~-~~~~~------------------~~~~----~~-~~~~----------------~~~~~~--------- 177 (424) .++|.++.+. .+.+. .+|. +. .++. +..++- T Consensus 147 ~v~ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l 226 (505) T protein:vir:79 147 WATADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGL 226 (505) T ss_pred EEcCCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhccccccc Confidence 6666666543 22211 0110 00 0000 000110 Q ss_pred -h----------HeeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE-----EcCCCCCC Q lcl|NC_019710. 178 -K----------EIFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQIL-----STGEKVLT 236 (424) Q Consensus 178 -~----------evih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl-----~~~~~~~~ 236 (424) + -+.||+.+.. ..+.|+|.+.-+...++.....-....+-|+.|... .++ +....-.. T Consensus 227 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~-i~v~~~~l~~~~~~~~ 305 (505) T protein:vir:79 227 EPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRR-LIVPAEWLKTGSSYGG 305 (505) T ss_pred CcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccc-eeechHHhcccCCCCc Confidence 1 1335554322 235799999999999888777777777777765543 333 21111000 Q ss_pred HH---HHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccc Q lcl|NC_019710. 237 EQ---QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS 313 (424) Q Consensus 237 ~~---~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~ 313 (424) +. ....++.....+.+ +..-+++..++.++....+-++.+..+...++|+...|+++..++....+.. T Consensus 306 ~~~~~~~~~fd~~~~~y~~------~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~--- 376 (505) T protein:vir:79 306 QASETHPPMFDPDETVYQA------MYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQ--- 376 (505) T ss_pred ccccccccCCCccceeeee------ccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccc--- Confidence 00 00000000000110 1111223457777777778888999999999999999999999987655432 Q ss_pred cHHHH-------------HHHHHHHHHHHHHHHHHHHHhhhccCh-------hhhccceeeecchhhhccCHHHHHHHHH Q lcl|NC_019710. 314 GIEQQ-------------NLGFLQYTLQPYISRWENSIQRWLIPA-------KDVGRIHAEHNLDGLLRGDSASRAAFMK 373 (424) Q Consensus 314 n~e~~-------------~~~f~~~tl~P~~~~ie~~l~~~L~~~-------~~~~~~~~~f~~~~~~~~d~~~~~~~~~ 373 (424) ++.+. ....++.+|..++..|........+.. .....+.+.|+++.-+..|.++..+... T Consensus 377 TAtei~s~~~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~ 456 (505) T protein:vir:79 377 TATEVVTNNSQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADL 456 (505) T ss_pred hHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHH Confidence 22222 122234444444444433322111111 1112245778888888899999999999 Q ss_pred HHHhCCCcCHHHHHHHh-CCCCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 374 AMGESGLRTINEMRRTD-NLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 374 ~~~~~g~~t~NE~R~~l-g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +++.+|+|+.-+++... |++. +..++.+.. +..+......+..+-|+ T Consensus 457 ~~v~~Gi~s~e~~l~~~~~~~e-eea~~el~r---i~~E~~~~~p~~~~~gg 504 (505) T protein:vir:79 457 QAVQAQVMPKKQFLMRNYGLDE-EEADEWLAQ---IDAENSTAEPEFNQFGG 504 (505) T ss_pred HHHHcCCCCHHHHHHhcCCCCh-HHHHHHHHH---HHHhccccCCCchhccC Confidence 99999999998887653 5432 111111100 00011111122222233 No 187 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.73 E-value=2.4e-05 Score=45.88 Aligned_cols=401 Identities=11% Similarity=-0.018 Sum_probs=170.7 Q ss_pred CCC-CCcccc-cCC-CccHHHHHHhhccCcccccccccccccccccccccC--CccccHH--H-HhhhHHHHHHHHHHHH Q lcl|NC_019710. 1 MEE-PKYTID-LRT-NNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLG--DSSINDE--R-ILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~-~~~~~~-~~~-~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~--~-~~~~~~v~~~i~~ia~ 72 (424) |-+ .-+++. |-. ..-++.++...+........ .....+.+..... +..+.++ . .....+..-||+.++. T Consensus 1 ~~~~~~~~~~gl~~~~~~~~~~L~~~~~~~~~~~~---~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~ 77 (474) T protein:vir:81 1 MIQQQTVRIPSLSNDENALINGLLAQIENLRWKNL---LRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALAR 77 (474) T ss_pred CcCCCcCcCCCCChhHHHHHHHHHHHHHHHhhHHH---HHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHh Confidence 221 111111 111 13456666555443322111 0000110000000 0011111 0 0122344556666666 Q ss_pred hhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce-eEEEEeccc Q lcl|NC_019710. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV-ISLLPLQSA 151 (424) Q Consensus 73 ~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~-~~l~~l~p~ 151 (424) .+.=-.|++ . +++. ....+..++. +.+ .......+..+.+.+|.||+.+..+.+|.+ ..+.+++|. T Consensus 78 rl~~~Gf~~---~-d~~~----~~~~l~~iw~-~N~----ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~ 144 (474) T protein:vir:81 78 RCNLEGFVW---P-DGDL----DSLGGTEVVD-DNH----LLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDAS 144 (474) T ss_pred hhcccceEC---C-CCCc----cchHHHHHHH-hcC----hhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccc Confidence 444334432 2 1111 1123444442 222 234556778899999999999988877765 356788888 Q ss_pred eEEEEEcCCceE-------EEEEecCce---EEecHhH-------------------------eeEecCc-CCCCccccc Q lcl|NC_019710. 152 NMDVKLVGKKVV-------YRYQRDSEY---ADFSQKE-------------------------IFHLKGF-GFTGLVGLS 195 (424) Q Consensus 152 ~v~~~~~~~~~~-------~~~~~~~~~---~~~~~~e-------------------------vih~r~~-~~~~~~G~s 195 (424) .+....|..... +....++.. ..|.+++ |++|.+. ..+..+|.| T Consensus 145 ~~~~~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~gvPvV~~~n~~~~~~~~G~s 224 (474) T protein:vir:81 145 EATGEWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYGVPAQVLPYKPAPKRPFGQS 224 (474) T ss_pred eEEEEEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCCcceEEecccccccCcCCcc Confidence 888666643210 000001100 1111222 4444433 234456777 Q ss_pred hH----HHHHHHHHHHHHHHHHHHHHHhccCCCceeEE-cCCCCCC---HHHHHHHHHHHHHHhCCc-ccCcceecCCCc Q lcl|NC_019710. 196 PI----AFACKSAGVAVAMEDQQRDFFANGAKSPQILS-TGEKVLT---EQQRSQVEENFKEIAGGP-VKKRLWILEAGF 266 (424) Q Consensus 196 ~~----~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~-~~~~~~~---~~~~~~~~~~~~~~~~~~-~ag~~~~l~~g~ 266 (424) .+ ..+.+.+.-...-......|+.. |.-++. ....... ......++........-. +..+......+. T Consensus 225 ~i~e~v~~l~da~~r~~~~~~~~~e~~a~---pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~ 301 (474) T protein:vir:81 225 RITKPMMGLQDAGVRELARREGHMDVFSY---PEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARA 301 (474) T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHhcc---hhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccc Confidence 43 34444444433434444455443 443332 1111111 111222333332222111 111112222344 Q ss_pred eeeeccCChhHH-HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCC-cccccHHHHHHHHHHHHHHHHHHHHHHHHhh---- Q lcl|NC_019710. 267 STSAIGVTPQDA-EMMASRKFQVSELARFFGVPPHLVGDVEKST-SWGSGIEQQNLGFLQYTLQPYISRWENSIQR---- 340 (424) Q Consensus 267 ~~~~l~~s~~d~-~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~-~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~---- 340 (424) ++-++... ++ .|.+..+.....+|..-+||+..||.....| .|...+..+...+... +.-....+.+.+.+ T Consensus 302 ~~~q~~~a--~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~k-ae~k~~~fg~~l~~~~rl 378 (474) T protein:vir:81 302 DVKQFPAA--SPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAE-AEGAVDDFTPALRKAFIR 378 (474) T ss_pred cccccCCC--ChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 55555543 23 3888999999999999999999998653222 2211121111111111 11111222222221 Q ss_pred hc--cChh---h--hccceeeecchhhhccCHHHHHHHHHHHHhCCC--cCHHHHHHHhCCCCCC---CcCeeeeccccc Q lcl|NC_019710. 341 WL--IPAK---D--VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGL--RTINEMRRTDNLPPLP---GGDVAMRQSQYV 408 (424) Q Consensus 341 ~L--~~~~---~--~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~--~t~NE~R~~lg~~p~~---ggd~~~~~~n~~ 408 (424) -+ .... + .....+++.+.+....+..+.++.+.+++++|. ....=+++++|+.+-+ .-++........ T Consensus 379 a~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~ 458 (474) T protein:vir:81 379 ALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRG 458 (474) T ss_pred HHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHH Confidence 11 0110 0 112344555566667788899999999999873 3345568889998642 111111011122 Q ss_pred chhhccccCCCccCCC Q lcl|NC_019710. 409 PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~~g~ 424 (424) +++.... .+.....| T Consensus 459 ~~~~l~~-~~~~~~~a 473 (474) T protein:vir:81 459 TLQALID-RSNNGATA 473 (474) T ss_pred HHHHHHh-cCCCCCCC Confidence 2332211 11111111 No 188 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=97.70 E-value=2.7e-05 Score=45.65 Aligned_cols=392 Identities=9% Similarity=0.002 Sum_probs=178.0 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccc---cccc--ccccccccc--cccCCccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVT---PNQG--SQTGPVSAH--GYLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~---~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) ++.- +..+=-+.+...+..+..... .... ......... ......+ +. =+.++....+|+..+.- T Consensus 24 i~~~------~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--ki~~n~~~~ivd~~~~y 94 (474) T protein:vir:94 24 IESH------KDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSV-NN--KLNNSFDSEIVDTRVGY 94 (474) T ss_pred HHHh------hhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCc-cc--ccccchHHHHHHhHhhh Confidence 1110 011111122222222110000 0000 000000000 0000000 00 01234556677777777 Q ss_pred hhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceE Q lcl|NC_019710. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v 153 (424) +-+-|+.+--.......++. ...+...+. ...-......+..+++.+|.||.++..+.+|.+ .+..++|..+ T Consensus 95 l~g~pv~~~~~~~~~~~e~~--~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~p~~~ 166 (474) T protein:vir:94 95 LHGVPVTYDLDENAEKNEKL--KKFITNFAI-----RNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNIDPYNV 166 (474) T ss_pred eeccceeEeeCCCCcchHHH--HHHHHHHHh-----hcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEcccce Confidence 76777775321111111110 112222222 123456777888999999999999988888875 5777888887 Q ss_pred EEEEcCCce------EEEEEecC--c---e-EEecHhHeeEecCc------------------C----CCCccccchHHH Q lcl|NC_019710. 154 DVKLVGKKV------VYRYQRDS--E---Y-ADFSQKEIFHLKGF------------------G----FTGLVGLSPIAF 199 (424) Q Consensus 154 ~~~~~~~~~------~~~~~~~~--~---~-~~~~~~evih~r~~------------------~----~~~~~G~s~~~~ 199 (424) .+..++... +|...... . . ..+.+..+++++.. + .+...|.|-+.. T Consensus 167 ~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~ 246 (474) T protein:vir:94 167 IFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEK 246 (474) T ss_pred EEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHH Confidence 766554321 11111100 0 0 11122222222210 0 122357787777 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHH Q lcl|NC_019710. 200 ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAE 279 (424) Q Consensus 200 ~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~ 279 (424) +...++....+..-..+.+...+.|-.+++- .... ++....+ ...+.+.+.+++.+++-+.....+.. T Consensus 247 v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g-~~~~-~~~~~~~----------~~~~~i~~~~~~~~~~~l~~~~~~~~ 314 (474) T protein:vir:94 247 VIHLIDAYDLTMSDASSEISQTRLAYLVLRG-MGMS-EEMIQET----------QKSGAFELFDKDMDVKYLTKDVNDTM 314 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhcc-CCCC-chhhhhh----------hhcceeEecCCCCceeEEeccCCHHH Confidence 7777776666666666666666667666642 2222 2221111 11233445566666666655444555 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc Q lcl|NC_019710. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG 349 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~ 349 (424) +.+..+...+.|...-++|..-.+... ++.+..... ......+...+.-.++.|...+..+-....... T Consensus 315 ~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~ 393 (474) T protein:vir:94 315 IENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDS 393 (474) T ss_pred HHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccc Confidence 677788888999999999864433222 222211111 111223444444444444444443211111111 Q ss_pred cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee-----eecccccchhhccccCCCccC Q lcl|NC_019710. 350 RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVA-----MRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 350 ~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~-----~~~~n~~~~~~~~~~~~~~~~ 422 (424) ...+++.+..-+..|..+.++.+.++. |+++..-+.+++++-+.+. .+.. -..............++++++ T Consensus 394 ~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:94 394 YLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNN 471 (474) T ss_pred cccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccc Confidence 123455556667788899999998884 8999988888887643211 0000 000011111111111222222 Q ss_pred CC Q lcl|NC_019710. 423 GA 424 (424) Q Consensus 423 g~ 424 (424) .+ T Consensus 472 ~s 473 (474) T protein:vir:94 472 QS 473 (474) T ss_pred cC Confidence 22 No 189 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=97.70 E-value=2.7e-05 Score=45.65 Aligned_cols=392 Identities=9% Similarity=0.002 Sum_probs=178.0 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccc---cccc--ccccccccc--cccCCccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVT---PNQG--SQTGPVSAH--GYLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~---~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) ++.- +..+=-+.+...+..+..... .... ......... ......+ +. =+.++....+|+..+.- T Consensus 24 i~~~------~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--ki~~n~~~~ivd~~~~y 94 (474) T protein:vir:10 24 IESH------KDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSV-NN--KLNNSFDSEIVDTRVGY 94 (474) T ss_pred HHHh------hhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCc-cc--ccccchHHHHHHhHhhh Confidence 1110 011111122222222110000 0000 000000000 0000000 00 01234556677777777 Q ss_pred hhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceE Q lcl|NC_019710. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v 153 (424) +-+-|+.+--.......++. ...+...+. ...-......+..+++.+|.||.++..+.+|.+ .+..++|..+ T Consensus 95 l~g~pv~~~~~~~~~~~e~~--~~~l~~~~~-----~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~-~~~~i~p~~~ 166 (474) T protein:vir:10 95 LHGVPVTYDLDENAEKNEKL--KKFITNFAI-----RNSVDDEDSEIGKMAAICGYGARLAYIDTNGDI-RIKNIDPYNV 166 (474) T ss_pred eeccceeEeeCCCCcchHHH--HHHHHHHHh-----hcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCee-EEEEEcccce Confidence 76777775321111111110 112222222 123456777888999999999999988888875 5777888887 Q ss_pred EEEEcCCce------EEEEEecC--c---e-EEecHhHeeEecCc------------------C----CCCccccchHHH Q lcl|NC_019710. 154 DVKLVGKKV------VYRYQRDS--E---Y-ADFSQKEIFHLKGF------------------G----FTGLVGLSPIAF 199 (424) Q Consensus 154 ~~~~~~~~~------~~~~~~~~--~---~-~~~~~~evih~r~~------------------~----~~~~~G~s~~~~ 199 (424) .+..++... +|...... . . ..+.+..+++++.. + .+...|.|-+.. T Consensus 167 ~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~ 246 (474) T protein:vir:10 167 IFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEHLFDYNPLFGVPNNKEMIGDAEK 246 (474) T ss_pred EEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccCCCCccceEEecCCCCCCCchHH Confidence 766554321 11111100 0 0 11122222222210 0 122357787777 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHH Q lcl|NC_019710. 200 ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAE 279 (424) Q Consensus 200 ~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~ 279 (424) +...++....+..-..+.+...+.|-.+++- .... ++....+ ...+.+.+.+++.+++-+.....+.. T Consensus 247 v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g-~~~~-~~~~~~~----------~~~~~i~~~~~~~~~~~l~~~~~~~~ 314 (474) T protein:vir:10 247 VIHLIDAYDLTMSDASSEISQTRLAYLVLRG-MGMS-EEMIQET----------QKSGAFELFDKDMDVKYLTKDVNDTM 314 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhcc-CCCC-chhhhhh----------hhcceeEecCCCCceeEEeccCCHHH Confidence 7777776666666666666666667666642 2222 2221111 11233445566666666655444555 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhc Q lcl|NC_019710. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVG 349 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~ 349 (424) +.+..+...+.|...-++|..-.+... ++.+..... ......+...+.-.++.|...+..+-....... T Consensus 315 ~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~ 393 (474) T protein:vir:10 315 IENHLDRIEKNIMRFAKSVNFNSDEFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDS 393 (474) T ss_pred HHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccc Confidence 677788888999999999864433222 222211111 111223444444444444444443211111111 Q ss_pred cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee-----eecccccchhhccccCCCccC Q lcl|NC_019710. 350 RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVA-----MRQSQYVPITDLGTNKEPRNN 422 (424) Q Consensus 350 ~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~-----~~~~n~~~~~~~~~~~~~~~~ 422 (424) ...+++.+..-+..|..+.++.+.++. |+++..-+.+++++-+.+. .+.. -..............++++++ T Consensus 394 ~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:10 394 YLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVDDVDYELDEMEKESLEFNDKLPDIDEGDANDKSQNN 471 (474) T ss_pred cccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccCCCcCCCCccc Confidence 123455556667788899999998884 8999988888887643211 0000 000011111111111222222 Q ss_pred CC Q lcl|NC_019710. 423 GA 424 (424) Q Consensus 423 g~ 424 (424) .+ T Consensus 472 ~s 473 (474) T protein:vir:10 472 QS 473 (474) T ss_pred cC Confidence 22 No 190 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=97.69 E-value=2.8e-05 Score=45.51 Aligned_cols=351 Identities=11% Similarity=0.071 Sum_probs=159.5 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHH----HhhhHHHHHHHHHHHHhhhhCceeEee Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDER----ILQISTVWRCVSLISTLTACLPLDVFE 83 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~v~~~i~~ia~~ia~~~~~~~~ 83 (424) |+.-.+ =..++..+..+...... .+. .+.++. -+...+..-+|+.++..+. +.=++ T Consensus 1 l~~~~~--r~~~~~~yY~g~~~~~~--------------~~~-~~p~~~~~~~~~v~nw~~~~Vds~a~rl~---~~Gf~ 60 (410) T protein:vir:95 1 MNLYQS--RVNLRYKHYAMQHYEAP--------------TGI-TIPAHIRAKYQAVLGWAAKGVDSLADRLI---FRAFA 60 (410) T ss_pred CCcchh--hHHHHHHHhcCCCCccc--------------cch-hccHHHHhHHHhhcchhHHHHHHhHhhhc---ccccc Confidence 222211 12222223322211100 000 000000 0112344445555554332 22222 Q ss_pred ccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce- Q lcl|NC_019710. 84 TDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV- 162 (424) Q Consensus 84 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~- 162 (424) . . +..+.+.+. + | +.......+..+.+.+|.||+.+..+.+|.| .+.+++|..+....|.... T Consensus 61 ~-~---------d~~l~~i~~-~-N---~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~-~i~~~sP~~~~~i~Dp~~~~ 124 (410) T protein:vir:95 61 N-D---------DFNVTEIFD-R-N---NPDIFFDSAILSALIGSCSFVYISKGEDDEV-RLQVIESSNATGVIDPITGL 124 (410) T ss_pred C-C---------CchHHHHHh-h-c---ChHHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEEeCCCCc Confidence 1 1 122444443 2 2 2335566788999999999999998888876 5778889988877765321 Q ss_pred -E--EEEE--ecC-c---eEEecHhHe---------------------eEecCcC-CCCccccc----hHHHHHHHHHHH Q lcl|NC_019710. 163 -V--YRYQ--RDS-E---YADFSQKEI---------------------FHLKGFG-FTGLVGLS----PIAFACKSAGVA 207 (424) Q Consensus 163 -~--~~~~--~~~-~---~~~~~~~ev---------------------ih~r~~~-~~~~~G~s----~~~~~~~~i~~~ 207 (424) . +.+. ... . ...+.++.+ ++|.+.. .+..+|.| ++..+.+.+.-. T Consensus 125 ~~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~ 204 (410) T protein:vir:95 125 LVEGYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRT 204 (410) T ss_pred eEEEEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHH Confidence 1 1111 110 0 112223333 4443321 23456766 344555555544 Q ss_pred HHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeeccCChhHHHHHH Q lcl|NC_019710. 208 VAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTPQDAEMMA 282 (424) Q Consensus 208 ~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-----g~~~~~l~~s~~d~~~~e 282 (424) ..-......+|.+ |.-++.-- +. +....+ .|+.. .++++.++. +.++.++....-+ .|++ T Consensus 205 ~~~~~~~~e~~a~---pqr~i~G~-d~-d~~~~~----~~~~~-----~~~i~~~~~~~~~~~~~v~q~~~~~l~-~~~~ 269 (410) T protein:vir:95 205 LERADITAEFYSW---PQKYILGL-DP-DAEPME----KWKAT-----VSSLLTISSSDKGVKPSVGQFTTASMS-PFTE 269 (410) T ss_pred HHHHHHHHHHhcc---hhheeecc-CC-CCCcCc----hhhhh-----hhhheeccCCCCCCcceEEecCCCChH-HHHH Confidence 4444455555544 54444311 11 111111 12221 234666543 2456555443222 4889 Q ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhh------hccChhh---hcccee Q lcl|NC_019710. 283 SRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKD---VGRIHA 353 (424) Q Consensus 283 ~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~------~L~~~~~---~~~~~~ 353 (424) ..+.....||..=++|+..+|....+.+|....+.+...+.. .+.-..+.+.+.+.+ .+..... .....+ T Consensus 270 ~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~-ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~ 348 (410) T protein:vir:95 270 QLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRL-AGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRT 348 (410) T ss_pred HHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccccee Confidence 999999999999999999999765432221111111111111 111111112222211 1111111 011233 Q ss_pred eecchh---hhccCHHHHHHHHHHHHhC--CCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccCC Q lcl|NC_019710. 354 EHNLDG---LLRGDSASRAAFMKAMGES--GLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKE 418 (424) Q Consensus 354 ~f~~~~---~~~~d~~~~~~~~~~~~~~--g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~ 418 (424) ++.+.. ....+....++.+.+++++ |+....-+++.+|+.+-+- .. .-......+.+ T Consensus 349 ~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~---~~-----~~~~e~~~~g~ 410 (410) T protein:vir:95 349 AVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGDMS---AK-----PVVSEGGSNGE 410 (410) T ss_pred eEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChHHH---HH-----HHHHHHHhCCC Confidence 333332 2233567788888889888 7888888999999975321 00 00011111111 No 191 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.68 E-value=2.9e-05 Score=45.45 Aligned_cols=395 Identities=10% Similarity=0.047 Sum_probs=171.5 Q ss_pred CCCCCcccccCCC-----------ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTN-----------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~~~~~-----------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ 69 (424) ..+.-.++..+.= +--+.++..+..+............. ..... .-+.+......|+. T Consensus 33 ~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~-----~~~~~------~ki~~n~~k~Iv~~ 101 (511) T protein:vir:99 33 GTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKE-----EYMAD------NRVAHDYASYISDF 101 (511) T ss_pred hhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccc-----cccCc------ceeecchHHHHHHH Confidence 1111111111000 01122222232222111000000000 00000 00113445556666 Q ss_pred HHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .+.-+-+-|+.+-- .+. + ....+..++.. | ........+..+++.+|.||.++.++.+|.+ .+..++ T Consensus 102 ~~~yl~g~p~~~~~--~d~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~-~i~~~~ 168 (511) T protein:vir:99 102 INGYFLGNPIQYQD--DDK---D--VLEAIEAFNDL--N---DVESHNRSLGLDLSIYGKAYELMIRNQDDET-RLYKSD 168 (511) T ss_pred HHhhhcccCceeec--Cch---H--HHHHHHHHHhh--c---CHhHHHHHHHHHHHhcCeeEEEEEeCCCCce-EEEEEc Confidence 66666666776521 111 1 11233443332 2 2446667788999999999999999888875 577889 Q ss_pred cceEEEEEcCCc---eE---EEEEe-----cC-c----eEEecHhHeeEecCcC-------------------------- Q lcl|NC_019710. 150 SANMDVKLVGKK---VV---YRYQR-----DS-E----YADFSQKEIFHLKGFG-------------------------- 187 (424) Q Consensus 150 p~~v~~~~~~~~---~~---~~~~~-----~~-~----~~~~~~~evih~r~~~-------------------------- 187 (424) |..+.+..+... .. .+|.. .. . ...+.++.+.+++... T Consensus 169 p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 248 (511) T protein:vir:99 169 AMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEF 248 (511) T ss_pred cceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEe Confidence 999887776432 11 11110 00 0 1234555555543210 Q ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHH---HHHHhCCcccCcceecCC Q lcl|NC_019710. 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEEN---FKEIAGGPVKKRLWILEA 264 (424) Q Consensus 188 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~---~~~~~~~~~ag~~~~l~~ 264 (424) .+...|.|.+..+...++....+..-..+.+...+.|-.+++-.......+ ....++. +....... .+.....++ T Consensus 249 ~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 326 (511) T protein:vir:99 249 SNNERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRKQKEANVLFLEPTVYA-DSEGRETEG 326 (511) T ss_pred cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchh-hcccccccceeccccccc-ccccccCCC Confidence 012357888887777777666666555555555566655554322222221 1111110 00000000 111223344 Q ss_pred CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHH----------HHHHHHHHHHHHHHH Q lcl|NC_019710. 265 GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRW 334 (424) Q Consensus 265 g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~f~~~tl~P~~~~i 334 (424) |.+++.+.....+..+....+...+.|+..-++|..-.+... ++.+.....-.. ...+...+.-.++.| T Consensus 327 ~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li 405 (511) T protein:vir:99 327 SVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-GTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLL 405 (511) T ss_pred CcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 556665555444555667788888999999999875443322 222211111111 111222222222222 Q ss_pred HHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC----------cCee--- Q lcl|NC_019710. 335 ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG----------GDVA--- 401 (424) Q Consensus 335 e~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g----------gd~~--- 401 (424) ...+...--.........+++.+..-+..|..+.++.+.++. |+++.--+.+++++-+.+. .+.. T Consensus 406 ~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~--GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~ 483 (511) T protein:vir:99 406 ETILKNTRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKA 483 (511) T ss_pred HHHHHhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Confidence 222221110000111122445555667788888998888884 8899888888875532111 0000 Q ss_pred eecccccchhhccccCCCccCCC Q lcl|NC_019710. 402 MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 ~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ..+....+-..-.+.++++.+.. T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~ 506 (511) T protein:vir:99 484 QKNMYQDPRNINDDEQDDSTKDS 506 (511) T ss_pred hhcccccCCCCCCCCCCCCCcCc Confidence 00000000000000111111111 No 192 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=97.68 E-value=3e-05 Score=45.40 Aligned_cols=387 Identities=8% Similarity=-0.025 Sum_probs=171.4 Q ss_pred CCCCCcccccCC--------------CccHHHHHHhhccCccc--ccc------cccccccccccccccCCccccHHHHh Q lcl|NC_019710. 1 MEEPKYTIDLRT--------------NNGWWARLKSWFVGGRL--VTP------NQGSQTGPVSAHGYLGDSSINDERIL 58 (424) Q Consensus 1 ~~~~~~~~~~~~--------------~~G~~~~~~~~~~~~~~--~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~ 58 (424) |++=-||+.-.- ..-++.++...+..... ... ....................-+..-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRM 80 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhc Confidence 554444433211 12233333322211100 000 00000000000000000000000112 Q ss_pred hhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC Q lcl|NC_019710. 59 QISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS 138 (424) Q Consensus 59 ~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~ 138 (424) .++....+++..+.-+-+-|+.+--. +. .....+...+. | ........+..++..+|.||..+-.+. T Consensus 81 ~~n~~~~Ivd~~~~~l~g~p~~~~~~--d~-----~~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~~y~d~ 147 (474) T protein:vir:96 81 FTNYHQNLVDQKVAYAVANPVTFSSD--DD-----KSLKTIQEVLN---H---KWDDKLVDILTAASNKGIEWLQPYIDE 147 (474) T ss_pred ccchHHHHHHhhhhhhcccCceeecC--ch-----HHHHHHHHHHh---c---CHHHHHHHHHHHHHhcCeeEEEEEecC Confidence 24556667777777776677665211 11 11233444442 2 234455667788999999999988888 Q ss_pred CCceeEEEEeccceEEEEEcCCc---e---EEEEEecCc--eEEecHhHeeEecCc------------------------ Q lcl|NC_019710. 139 AGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDSE--YADFSQKEIFHLKGF------------------------ 186 (424) Q Consensus 139 ~G~~~~l~~l~p~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~evih~r~~------------------------ 186 (424) +|.+ .+..++|..+.+..++.. . ...|...+. ...+..+.|.|++.. T Consensus 148 ~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (474) T protein:vir:96 148 NGEF-KTFRVPAEQAIPIWTNKERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNK 226 (474) T ss_pred CCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeecccccccccccccccccc Confidence 8876 488889999887766421 1 111111111 112222233222110 Q ss_pred -C----------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc Q lcl|NC_019710. 187 -G----------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV 255 (424) Q Consensus 187 -~----------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (424) + .+...|.|-+......++....+.....+.+...+.|-.+++--......+ +.... . T Consensus 227 ~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-------~~~~~----~ 295 (474) T protein:vir:96 227 RVSWGRVPFIPFKNNPQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDE-------FMRNL----K 295 (474) T ss_pred ccCCCceeEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccc-------hhhhh----h Confidence 0 022357888888888888777777777777777777876665322111111 11111 1 Q ss_pred cCcceecC-CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHH----------HHHHHH Q lcl|NC_019710. 256 KKRLWILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQ 324 (424) Q Consensus 256 ag~~~~l~-~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~f~~ 324 (424) .++++.++ .|.+++.+........+.+..+...+.|+..-++|..-.... .++.+....+-. ....+. T Consensus 296 ~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~ 374 (474) T protein:vir:96 296 YYKAINVDGDGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQDKF-GNSPSGIALKFMYSNLDLKANKLKNKTL 374 (474) T ss_pred cCceEEecCCCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23566665 455666655444445567778888999999999986432221 122221111111 111222 Q ss_pred HHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeee- Q lcl|NC_019710. 325 YTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMR- 403 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~- 403 (424) ..+.-+++.|..-+. ...+. ..+.+.+..-+..|..+.++ .+.++|+++...+.+++++-..+.-+.-.+ T Consensus 375 ~~l~~~~~~i~~~~~----~~~~~--~~i~i~f~~~~p~~~~e~~~---~~~~ag~iS~et~~~~~~~v~d~~~E~~ri~ 445 (474) T protein:vir:96 375 TALQELLQYIIDFYK----LNIKV--QDVEITFNFNVMVNELEQSQ---IGVQSQYLSKETVVTNHPWVDDPVAELERIE 445 (474) T ss_pred HHHHHHHHHHHHHhC----CCccc--ceeeEEeccCCCcCHHHHHH---HHHhcCCCchHHHHHhCCCCCCHHHHHHHHH Confidence 222222222222111 11111 22333334445555554444 466789999999988876532111000000 Q ss_pred ------cccccch----hhccccCCCccC Q lcl|NC_019710. 404 ------QSQYVPI----TDLGTNKEPRNN 422 (424) Q Consensus 404 ------~~n~~~~----~~~~~~~~~~~~ 422 (424) .....+. ......++.++| T Consensus 446 ~E~~e~~~~~~~~~~~~~~~~~d~~~e~~ 474 (474) T protein:vir:96 446 QDNIDFNKQLPPLEGDANGRAQDNESETN 474 (474) T ss_pred HHHHHHHhcccccccccccccCCCcccCC Confidence 0000000 000111111222 No 193 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=97.66 E-value=3.2e-05 Score=45.25 Aligned_cols=382 Identities=7% Similarity=-0.017 Sum_probs=174.1 Q ss_pred CCCCCcccccCC----CccHHHHHH--------------hhccCcccccccccccccccccccccCCc--cccHHHHhhh Q lcl|NC_019710. 1 MEEPKYTIDLRT----NNGWWARLK--------------SWFVGGRLVTPNQGSQTGPVSAHGYLGDS--SINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~~~~~----~~G~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 60 (424) -++.+-...... ..-++.++. .+..+.... ....... +..+. ..-...-+.+ T Consensus 29 ~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I------~~~~~~~--~~~~~~~~~~~~~ri~~ 100 (492) T protein:vir:94 29 TEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDI------VKEPKPV--DATGAVDPLKPDDRMIT 100 (492) T ss_pred hhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccccccc--ccccccccccccccccc Confidence 000000000000 012222222 222221100 0000000 00000 0000011224 Q ss_pred HHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019710. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +....+|+..+.-+-+-|+.+-- .+. . ....+...+. | .-......+..+++.+|.||+++-.+.+| T Consensus 101 n~~k~Ivd~~~~yl~G~p~~~~~--~d~---~--~~~~l~~~~~---n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg 167 (492) T protein:vir:94 101 NFHANLVDQKVSYIVGKPIAFKH--TDD---E--VVKRIDEVLG---N---RFDDKLHSVLTGASNKGIEWLHPYLDEEG 167 (492) T ss_pred chHHHHHHHHHhhhcccCceecc--Cch---H--HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEEEecCCC Confidence 56667788777777777766521 111 1 1122333332 2 23355667889999999999999888888 Q ss_pred ceeEEEEeccceEEEEEcCCc---eE---EEEEecC--ceEEecHhHeeEecC----------------------cC--- Q lcl|NC_019710. 141 DVISLLPLQSANMDVKLVGKK---VV---YRYQRDS--EYADFSQKEIFHLKG----------------------FG--- 187 (424) Q Consensus 141 ~~~~l~~l~p~~v~~~~~~~~---~~---~~~~~~~--~~~~~~~~evih~r~----------------------~~--- 187 (424) .+ .+..++|..+.+..+... .. ..|.... ....+.+..|.++.. .+ T Consensus 168 ~~-~~~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 246 (492) T protein:vir:94 168 EF-KLFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGK 246 (492) T ss_pred ce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCc Confidence 76 577789999887765321 11 1111111 111222223322210 00 Q ss_pred ------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCccee Q lcl|NC_019710. 188 ------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWI 261 (424) Q Consensus 188 ------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~ 261 (424) .+...|.|-+..+...++....+..-..+.+...+.|-.+++--......+... .. ...+++. T Consensus 247 vPvv~~~nn~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~----~~-------~~~~~~~ 315 (492) T protein:vir:94 247 IPFIPFKNNDLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKR----LL-------RYYGAIK 315 (492) T ss_pred cceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHH----HH-------hhcccee Confidence 022357888888888888777777777777777777877765322211122111 11 1224555 Q ss_pred cCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHH Q lcl|NC_019710. 262 LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYI 331 (424) Q Consensus 262 l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~ 331 (424) ++++.+...+........+....+...+.|+..-++|..-.+... ++.|....+ ......+...+.-.+ T Consensus 316 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~ 394 (492) T protein:vir:94 316 VSDNGGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELL 394 (492) T ss_pred cCCCCcceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 655555554443334444566777888888888888853222111 222211111 011112222233333 Q ss_pred HHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC--CcCee-----eec Q lcl|NC_019710. 332 SRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLP--GGDVA-----MRQ 404 (424) Q Consensus 332 ~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~--ggd~~-----~~~ 404 (424) +.|...+.. ..+ ...+.+.+..-+..|..+.++.+.++. |+++..-+.+++++-+.+ ..+.. -.. T Consensus 395 ~li~~~~~~----~~~--~~~i~v~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~ 466 (492) T protein:vir:94 395 WFVFEHFDI----KGE--HKDVDISFNYNKVANTELQVQTAQQSM--GIVSHETVLENHPFVEDLQAELERIEQEQMEYN 466 (492) T ss_pred HHHHHHhcC----Ccc--cceeeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 333222211 111 123445556667788888999988885 889988888888764321 11110 000 Q ss_pred ccccchhhccccCCCccCCC Q lcl|NC_019710. 405 SQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 405 ~n~~~~~~~~~~~~~~~~g~ 424 (424) .....+.......++.++++ T Consensus 467 ~~~~~~~~~~~~~~~~~~~~ 486 (492) T protein:vir:94 467 KQLPNLDDGGADSAQQQERS 486 (492) T ss_pred hhccccccccCCCCccccCC Confidence 11111112122222222222 No 194 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=97.62 E-value=3.7e-05 Score=44.89 Aligned_cols=346 Identities=9% Similarity=0.033 Sum_probs=158.6 Q ss_pred cccCCCccHHHHHHhhccCcccccccccccccccccccccC--CccccHHH---H-hhhHHHHHHHHHHHHhhhhCceeE Q lcl|NC_019710. 8 IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLG--DSSINDER---I-LQISTVWRCVSLISTLTACLPLDV 81 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~---~-~~~~~v~~~i~~ia~~ia~~~~~~ 81 (424) |+.+ .++++...+........ .....+.+..... +..+.++. + +...+..-+|+.++..+.=-.| T Consensus 1 ~~~~----~i~~L~~~~~~~~~r~~---~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf-- 71 (409) T protein:vir:16 1 MTEK----GIGYLRFKLSVHKRRAE---MRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREF-- 71 (409) T ss_pred CCHH----HHHHHHHHHHHHhHHHH---HHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccccc-- Confidence 3222 23333333322111100 0000000000000 00111111 0 1112344455555443322222 Q ss_pred eeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCc Q lcl|NC_019710. 82 FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKK 161 (424) Q Consensus 82 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~ 161 (424) +.. +..+.+++. + | +.......+..+.+.+|.||+.+..+.+|.| .+.+++|..+....|... T Consensus 72 -~~~----------d~~l~~i~~-~-N---~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~-~i~~~sP~~~~~i~D~~~ 134 (409) T protein:vir:16 72 -END----------DFTVNEIFE-E-N---NPDIFFDSTVLSALIASCSFTYISKGENDAV-RLQVIEATNATGIIDPIT 134 (409) T ss_pred -cCc----------chHHHHHHH-h-c---ChhHHHHHHHHHHHHhCceeEEEecCCCCce-EEEEEcccceEEEeeccc Confidence 111 112444442 2 2 2335566888899999999999999888876 677888888876665422 Q ss_pred e----EEEEE-ec--Cce---EEecHhH----------------------eeEecCcC-CCCccccch----HHHHHHHH Q lcl|NC_019710. 162 V----VYRYQ-RD--SEY---ADFSQKE----------------------IFHLKGFG-FTGLVGLSP----IAFACKSA 204 (424) Q Consensus 162 ~----~~~~~-~~--~~~---~~~~~~e----------------------vih~r~~~-~~~~~G~s~----~~~~~~~i 204 (424) . .+.+. .+ +.. ..+.+++ |++|.+.. ....+|.|- +..+.+.+ T Consensus 135 ~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~ 214 (409) T protein:vir:16 135 GLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNA 214 (409) T ss_pred ccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHHHHHHHH Confidence 1 01110 00 000 0112222 34444332 345677773 44555555 Q ss_pred HHHHHHHHHHHHHHhccCCCceeEE-cCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC-----CCceeeeccCChhHH Q lcl|NC_019710. 205 GVAVAMEDQQRDFFANGAKSPQILS-TGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE-----AGFSTSAIGVTPQDA 278 (424) Q Consensus 205 ~~~~~~~~~~~~~~~ng~~p~~vl~-~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~-----~g~~~~~l~~s~~d~ 278 (424) .....-......++.+ |.-++. ...+. ...+. |+.. .++++.++ .+.++.++....-+ T Consensus 215 ~r~~~~~~~~~e~~a~---pqr~i~G~d~d~---~~~~~----~~~~-----~~~i~~~~~d~~g~~~~v~q~~~~~l~- 278 (409) T protein:vir:16 215 KRTLERADVTAEFYSF---PQKYVTGLSDDA---EPMET----WKAT-----VSSMLQFTKDEDGDKPTLGQFTQPSMS- 278 (409) T ss_pred HHHHHHHHHHHHHhcC---hhheeEecCCCC---Cccch----hhhh-----hhHhhccCCCCCCCCceEEecCCCChh- Confidence 5555555555566544 444443 22111 11111 2221 13355553 23456555443222 Q ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHhh------hccChhh--hc- Q lcl|NC_019710. 279 EMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQPYISRWENSIQR------WLIPAKD--VG- 349 (424) Q Consensus 279 ~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~------~L~~~~~--~~- 349 (424) .|++..+.....+|..=++|+..+|....+-+|....+.+...+...+ .-.-+.+.+.+.+ .+....+ .. T Consensus 279 ~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka-~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~ 357 (409) T protein:vir:16 279 PFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAG-RKAQRSLGAGLLNVAYLAACLRDDVPYLREQ 357 (409) T ss_pred HHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCccchh Confidence 489999999999999999999999976542122112222221111111 1111111211111 1111111 00 Q ss_pred cceeeecchhhhc---cCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC Q lcl|NC_019710. 350 RIHAEHNLDGLLR---GDSASRAAFMKAMGESG--LRTINEMRRTDNLPPLP 396 (424) Q Consensus 350 ~~~~~f~~~~~~~---~d~~~~~~~~~~~~~~g--~~t~NE~R~~lg~~p~~ 396 (424) ...+++.+.+... .+....++.+.+++++| ++..+-+++++|+..-+ T Consensus 358 ~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 358 FSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred hccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 1233444444433 23567788899999996 33457779999998655 No 195 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=97.62 E-value=3.7e-05 Score=44.88 Aligned_cols=382 Identities=10% Similarity=0.003 Sum_probs=168.7 Q ss_pred CCCCCcccc----------cCCCccHHHHHHhhccCcccccccccccccccccccccCCcccc-HHHHhhhHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTID----------LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIN-DERILQISTVWRCVSL 69 (424) Q Consensus 1 ~~~~~~~~~----------~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~i~~ 69 (424) ...+...+. .+.+.--+.++..+..+..... ... ......+..... +..-+.++....+++. T Consensus 20 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~------~r~-~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~ 92 (474) T protein:vir:95 20 QLKPQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIV------KQM-KKVDVYGNIDYDKPDWRITTNFHQNLVDQ 92 (474) T ss_pred hhhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchh------ccc-cccccccccccccccceeccchHHHHHHH Confidence 111111110 0111111222222332221100 000 000000000000 0001123455667777 Q ss_pred HHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEec Q lcl|NC_019710. 70 ISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQ 149 (424) Q Consensus 70 ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~ 149 (424) .+.-+-+-|+++- .++ +. ....+...+ . | ........+....+.+|.||+++..+.+|.+ .+..++ T Consensus 93 ~~~~l~g~p~~~~--~~d---~~--~~~~l~~~~-~--n---~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~ 158 (474) T protein:vir:95 93 KVSYVASKPVTYS--CED---ES--VLKIIHDVL-D--T---RWDNKLIDILTATSNKGIDWLQVYINENGEM-KLFRVP 158 (474) T ss_pred HHhhhccCCceec--cCc---hH--HHHHHHHHH-h--c---cHHHHHHHHHHHHhhcCcEEEEEEecCCCce-EEEEEc Confidence 7777767777652 111 11 112233333 2 2 2345566778899999999999988888876 477788 Q ss_pred cceEEEEEcCCc------eEEEEEecCc--eEEecHhHeeEecC----------------------cC---------CCC Q lcl|NC_019710. 150 SANMDVKLVGKK------VVYRYQRDSE--YADFSQKEIFHLKG----------------------FG---------FTG 190 (424) Q Consensus 150 p~~v~~~~~~~~------~~~~~~~~~~--~~~~~~~evih~r~----------------------~~---------~~~ 190 (424) |..+.+..++.. ..++|...+. ...+.++++.+++. .+ .+. T Consensus 159 p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn 238 (474) T protein:vir:95 159 AEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNN 238 (474) T ss_pred ccceEEEEcCCCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCC Confidence 888887665431 1111111111 12233333333220 00 122 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeee Q lcl|NC_019710. 191 LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSA 270 (424) Q Consensus 191 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~ 270 (424) ..|.|-+.-+...++....+..-..+.++..+.|-.+++-......++... . ....+++.++++.+.+. T Consensus 239 ~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~-------~----~~~~~~i~~~~~~~~~~ 307 (474) T protein:vir:95 239 PEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMR-------G----LKYYKAINVDGDGGVET 307 (474) T ss_pred CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh-------h----hhccceeeccCCCceeE Confidence 457888887777777777666666666677777766665322111111111 1 12345666777666666 Q ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019710. 271 IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 271 l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~~l~~ 340 (424) +........+.+..+...+.|+..-++|..-.+.. .++.+....+ +.....+...+.-+++.|.+.+.. T Consensus 308 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~ 386 (474) T protein:vir:95 308 IQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTDKF-GSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL 386 (474) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 65554555567778888999999999985322111 1222211111 111223333333444333332211 Q ss_pred hccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeee-----ecccccchhhc Q lcl|NC_019710. 341 WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVAM-----RQSQYVPITDL 413 (424) Q Consensus 341 ~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~~-----~~~n~~~~~~~ 413 (424) ..+.....+.| +.-...|..+.++ .+.+.|+++...+.+++++-+.+. .+... ........... T Consensus 387 ----~~d~~~i~v~f--~~~~p~d~~e~a~---~~~~~g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~ 457 (474) T protein:vir:95 387 ----KMDVKDIEISF--NFNRMMNDAEQSQ---IIAQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDG 457 (474) T ss_pred ----CcccceeeEEe--ccCCCcCHHHHHH---HHHhcCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccc Confidence 11222233444 4444555555444 456679999988888876633211 00000 00000000010 Q ss_pred ----cccCCCccCCC Q lcl|NC_019710. 414 ----GTNKEPRNNGA 424 (424) Q Consensus 414 ----~~~~~~~~~g~ 424 (424) .++++..++-. T Consensus 458 ~~d~~~~~~~~~~~~ 472 (474) T protein:vir:95 458 GADGAQQQERSNDKE 472 (474) T ss_pred cCCCCcCCCCCccCC Confidence 01111100111 No 196 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=97.56 E-value=4.6e-05 Score=44.38 Aligned_cols=386 Identities=8% Similarity=-0.020 Sum_probs=174.7 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccc--ccc------ccccccccccccccCCcc--ccHHHHhhhHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLV--TPN------QGSQTGPVSAHGYLGDSS--INDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~--~~~------~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~v~~~i~~i 70 (424) |. +.+-+.-....++.++.......... ... ......... .+..+.. .-+..-+.++....+|+.. T Consensus 35 ~~--~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~--~~~~~~~~~~~~~~ri~~n~~k~Ivd~~ 110 (492) T protein:vir:97 35 IV--RTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKP--VDATGAVDPLKPDDRMITNFHANLVDQK 110 (492) T ss_pred cc--cCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccccc--ccccccccccccccccccchHHHHHHHH Confidence 11 11111111123333333222111000 000 000000000 0000000 0000012245666777877 Q ss_pred HHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecc Q lcl|NC_019710. 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) Q Consensus 71 a~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p 150 (424) +.-+-+-|+++-- .+. .. ...+...+. |. -......+..+++.+|.||.++..+.+|.+ .+..++| T Consensus 111 ~~yl~g~p~~~~~--~d~---~~--~~~l~~~~~---n~---~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~-~~~~~~p 176 (492) T protein:vir:97 111 VSYIVGKPIAFKH--TDD---EV--VKRIDEVLG---NR---FDDKLHSVLTGASNKGIEWLHPYLDEEGEF-KLFRVPA 176 (492) T ss_pred hhhhcccCceecc--Cch---HH--HHHHHHHHh---cc---HHHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEcc Confidence 7777677766521 111 11 122333322 22 234556678899999999999999888876 5777899 Q ss_pred ceEEEEEcCCc---e---EEEEEecC--ceEEecHhHeeEecC----------------------cCC---------CCc Q lcl|NC_019710. 151 ANMDVKLVGKK---V---VYRYQRDS--EYADFSQKEIFHLKG----------------------FGF---------TGL 191 (424) Q Consensus 151 ~~v~~~~~~~~---~---~~~~~~~~--~~~~~~~~evih~r~----------------------~~~---------~~~ 191 (424) ..+.+..++.. . ...|.... ....+.+..+.++.. .++ +.. T Consensus 177 ~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~ 256 (492) T protein:vir:97 177 EQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND 256 (492) T ss_pred cceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCC Confidence 99888766421 1 11111111 111223333332210 000 123 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019710. 192 VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 192 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .|.|-+..+...++....+..-..+.+...+.|-.+++-.......+.. ... ...+++.++.+.+.+.+ T Consensus 257 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~----~~~-------~~~~~~~~~~~~~~~~l 325 (492) T protein:vir:97 257 LEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFK----RLL-------RYYGAIKVSDNGGVDTI 325 (492) T ss_pred CCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHH----HHH-------hhccceecCCCCcceeE Confidence 5788888888888777777666677777777777666532221111111 111 12235556655555555 Q ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_019710. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWENSIQRW 341 (424) Q Consensus 272 ~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~~l~~~ 341 (424) .....+..+....+...+.|+..-++|..-..... ++.+....+ ......+...+..+++.|...+.. T Consensus 326 ~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~- 403 (492) T protein:vir:97 326 QVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG-SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI- 403 (492) T ss_pred eccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC- Confidence 44444556677788888899999998853332211 222211111 111122222333333333322211 Q ss_pred ccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCeee-----ecccccchhhcc Q lcl|NC_019710. 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVAM-----RQSQYVPITDLG 414 (424) Q Consensus 342 L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~~-----~~~n~~~~~~~~ 414 (424) .. ....+++.+..-+..|..+.++.+.++ .|+++..-+.+++++-+.+. .+..- ...+...+...+ T Consensus 404 ---~~--~~~~i~v~f~~~~p~~~~e~a~~~~kl--~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~~~~ 476 (492) T protein:vir:97 404 ---KG--EHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPNLDDGG 476 (492) T ss_pred ---Cc--ccceeeEEecCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCC Confidence 11 123344555666777888899988888 48899888888887633211 00000 001111111111 Q ss_pred ccCCCccCCC Q lcl|NC_019710. 415 TNKEPRNNGA 424 (424) Q Consensus 415 ~~~~~~~~g~ 424 (424) ...+..+.+. T Consensus 477 ~~~~~~~~~~ 486 (492) T protein:vir:97 477 ADSAQQQERS 486 (492) T ss_pred CCCCcccccc Confidence 1111111111 No 197 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=97.53 E-value=5e-05 Score=44.17 Aligned_cols=391 Identities=10% Similarity=0.053 Sum_probs=172.7 Q ss_pred CCCCCcccccCCCccHHHHH---------------HhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARL---------------KSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWR 65 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~ 65 (424) |.+. ..+.+. ...+.++ ..+..+.......... ....... ....-+.++.... T Consensus 22 ~~~~-~~~~~~--~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~--------~~~~~~~-~~~~ki~~n~~~~ 89 (481) T protein:vir:10 22 VVSD-LAELLK--EENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGER--------RLQKYGD-KADHRAVHNYAKY 89 (481) T ss_pred eeec-chhhcC--HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCcc--------ccccccc-cccceeecchHHH Confidence 1111 111111 1222222 2222221110000000 0000000 0000123456667 Q ss_pred HHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEE Q lcl|NC_019710. 66 CVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISL 145 (424) Q Consensus 66 ~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l 145 (424) +|+..+.-+.+-|+.+--.+ . .....+..++.. | ....+...+..+.+.+|.||+.+..+.+|.+ .+ T Consensus 90 ivd~~~~~l~g~~~~~~~~d--~-----~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~-~i 156 (481) T protein:vir:10 90 VSRFIVGYLTGNPITITHQD--N-----QTNDKIIELNDL--N---DADEVNSDLALNLSIYGRAYEIVYRDFEDRD-TF 156 (481) T ss_pred HHHHHHhhhccCCceEecCC--h-----hHHHHHHHHHHh--c---ChhHHHHHHHHHHHhcCeEEEEEEeCCCCeE-EE Confidence 78888777766676542111 1 112335555542 2 3457788899999999999999988888876 57 Q ss_pred EEeccceEEEEEcCCc---e-----EEEEEecC-c----eEEecHhHeeEecCcC---------------------CCCc Q lcl|NC_019710. 146 LPLQSANMDVKLVGKK---V-----VYRYQRDS-E----YADFSQKEIFHLKGFG---------------------FTGL 191 (424) Q Consensus 146 ~~l~p~~v~~~~~~~~---~-----~~~~~~~~-~----~~~~~~~evih~r~~~---------------------~~~~ 191 (424) ..++|..+.+..+... . +|...... . ...+.++.|.+++... .+.. T Consensus 157 ~~~~p~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 236 (481) T protein:vir:10 157 KVLDPKSTFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEEVEHYYNDVPIIEYLNDQ 236 (481) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeecccccccCCceeEEEeecCC Confidence 7889999887766432 1 11111111 1 1133455554443110 0123 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeec Q lcl|NC_019710. 192 VGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAI 271 (424) Q Consensus 192 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l 271 (424) .|.|.+..+...++.......-..+.+...+.|..+++-.... +++....++... ........ .....+++.++.-+ T Consensus 237 ~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~-~~~~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~~l 313 (481) T protein:vir:10 237 FKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDL-DSEDAKAFRDAN-MIHLEPGT-NANGSEGKAEVKYV 313 (481) T ss_pred CCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCC-Cccchhhhhhcc-ceeccccc-cccCCCCCcceeEE Confidence 5777777666666655555444455555556676666532222 222222211100 01111110 01112223344433 Q ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHH----------HHHHHHHHHHHHHHHHhhh Q lcl|NC_019710. 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGF----------LQYTLQPYISRWENSIQRW 341 (424) Q Consensus 272 ~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f----------~~~tl~P~~~~ie~~l~~~ 341 (424) ........+.+..+...+.|+..-++|....+... ++.+....+.....+ +...+.-+++.+...++.. T Consensus 314 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 392 (481) T protein:vir:10 314 YKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFS-GVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLT 392 (481) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 33333455677788889999999999976554332 222211111111111 1111111111111111111 Q ss_pred ccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC--CCcCe-------eeecccccchhh Q lcl|NC_019710. 342 LIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPL--PGGDV-------AMRQSQYVPITD 412 (424) Q Consensus 342 L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~--~ggd~-------~~~~~n~~~~~~ 412 (424) - ........+.+.+......|..+.++.+.++. |+++.-.+.+++++-.. +..+. .....+.....+ T Consensus 393 ~--~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~--g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~ 468 (481) T protein:vir:10 393 G--LKQHNYAELTITFTPNLPKSMMESINAFNALS--GGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGE 468 (481) T ss_pred C--CCccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCc Confidence 0 00111123455556667788888999988884 78988777787776321 11000 000000111112 Q ss_pred ccccC-CCccCCC Q lcl|NC_019710. 413 LGTNK-EPRNNGA 424 (424) Q Consensus 413 ~~~~~-~~~~~g~ 424 (424) ..... ++.++.| T Consensus 469 ~~~~~~~~dd~~g 481 (481) T protein:vir:10 469 AFENHLNVDDSNG 481 (481) T ss_pred cCCCCCCCCCCCC Confidence 22222 2222222 No 198 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=97.46 E-value=6.2e-05 Score=43.64 Aligned_cols=388 Identities=10% Similarity=-0.007 Sum_probs=171.3 Q ss_pred CCC-CCcccccCC--------------CccHHHHHHhhccCcccccccc--ccccc--cccc----ccccCCcccc-HHH Q lcl|NC_019710. 1 MEE-PKYTIDLRT--------------NNGWWARLKSWFVGGRLVTPNQ--GSQTG--PVSA----HGYLGDSSIN-DER 56 (424) Q Consensus 1 ~~~-~~~~~~~~~--------------~~G~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~----~~~~~~~~~~-~~~ 56 (424) |-+ -+||+.--. ..-++.++......... .... ....+ .+.. ....+..... +.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~-~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLK-DINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDW 79 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHH-HHHHHHHHhcccCccccccchhhhccccccccccc Confidence 222 233332222 22344444333221100 0000 00000 0000 0000000000 000 Q ss_pred HhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019710. 57 ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) -+.++.....|+..+.-+-+-|+.+--. +.. ....+...+. | ........+..+++.+|.||.++-+ T Consensus 80 ki~~n~~k~Iv~~~~~yl~g~p~~~~~~--~~~-----~~~~l~~~~~---n---~~~~~~~~l~~~~~~~G~~~~~~~~ 146 (474) T protein:vir:96 80 RITTNFHQNLVDQKVSYVAGKPVTYAHD--DDK-----VLDVIHQVLD---T---RWDNKLIDILTAASNKGIDWLQVYI 146 (474) T ss_pred ccccchHHHHHHhhhhhhcccCceeccC--ChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEeee Confidence 0123455667777777777777765211 111 1122333332 2 2445566788999999999999988 Q ss_pred CCCCceeEEEEeccceEEEEEcCCc--e----EEEEEecCc--eEEecHhHeeEecCc---------------------- Q lcl|NC_019710. 137 NSAGDVISLLPLQSANMDVKLVGKK--V----VYRYQRDSE--YADFSQKEIFHLKGF---------------------- 186 (424) Q Consensus 137 ~~~G~~~~l~~l~p~~v~~~~~~~~--~----~~~~~~~~~--~~~~~~~evih~r~~---------------------- 186 (424) +.+|.+ .+..++|..+.+..++.. . .+.|...+. ...+.++.|.++..- T Consensus 147 d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:96 147 NEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTG 225 (474) T ss_pred CCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCccccc Confidence 888876 577788998887766431 0 111222221 112333444433210 Q ss_pred C---------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccC Q lcl|NC_019710. 187 G---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKK 257 (424) Q Consensus 187 ~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag 257 (424) . .+...|.|.+......++....+..-..+.+...+.|-.+++--......+ .... .+.. T Consensus 226 ~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~-------~~~~----~~~~ 294 (474) T protein:vir:96 226 SWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSE-------FMEG----LKYY 294 (474) T ss_pred CCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccc-------hhhh----hhcc Confidence 0 022357777777777777766666655666666666766654321110011 1111 1123 Q ss_pred cceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHH Q lcl|NC_019710. 258 RLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTL 327 (424) Q Consensus 258 ~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl 327 (424) +++.++++.+...+.....+..+....+...+.|...-++|..-..... ++.+....+ ......+...+ T Consensus 295 ~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l 373 (474) T protein:vir:96 295 KAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFG-SATSGIALKFLYTNLNLKANKLKNKANVAL 373 (474) T ss_pred ceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5666666666655554445556677788888899999999854322211 222211111 11112222333 Q ss_pred HHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee---- Q lcl|NC_019710. 328 QPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVA---- 401 (424) Q Consensus 328 ~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~---- 401 (424) .-+++.|...+. ...+. ..+.+.+..-+..+..+.++. +.+.|+++.-.+++.+++-..+. .+.. T Consensus 374 ~~~~~~i~~~~g----~~~d~--~~i~i~f~~~~p~~~~e~a~~---~~~~giiS~et~~~~lp~v~D~~~E~eri~~E~ 444 (474) T protein:vir:96 374 QELMQFILDFNK----IKLDA--KEIEITFNFNVMVNDLEQSQI---GAQSQYLSKETLVRHHPWVDDPKAELERLDEEQ 444 (474) T ss_pred HHHHHHHHHHhC----CCccc--ceeeEEecCCCccCHHHHHHH---HHHcCCCChHHHHHhCCCCCCHHHHHHHHHHHH Confidence 333333322211 11111 223333344455555555554 45579999988888887643211 1100 Q ss_pred -eecccccchhhccccCCCccCCC Q lcl|NC_019710. 402 -MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 -~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) -...+...+...+...+.++..+ T Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~ 468 (474) T protein:vir:96 445 LELNKQLPNLDDGGADGAQQQQQS 468 (474) T ss_pred HHHHhhccccccccCCCCCCcCCC Confidence 00001111111111111111111 No 199 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=97.46 E-value=6.2e-05 Score=43.64 Aligned_cols=388 Identities=10% Similarity=-0.007 Sum_probs=171.3 Q ss_pred CCC-CCcccccCC--------------CccHHHHHHhhccCcccccccc--ccccc--cccc----ccccCCcccc-HHH Q lcl|NC_019710. 1 MEE-PKYTIDLRT--------------NNGWWARLKSWFVGGRLVTPNQ--GSQTG--PVSA----HGYLGDSSIN-DER 56 (424) Q Consensus 1 ~~~-~~~~~~~~~--------------~~G~~~~~~~~~~~~~~~~~~~--~~~~~--~~~~----~~~~~~~~~~-~~~ 56 (424) |-+ -+||+.--. ..-++.++......... .... ....+ .+.. ....+..... +.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~-~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ 79 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEVVEQMKPKVETQEEMIIRLINNHKQKLK-DINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDW 79 (474) T ss_pred CcccccCCCCCCCCcchhhhccccccchHHHHHHHHHHHHHHHH-HHHHHHHHhcccCccccccchhhhccccccccccc Confidence 222 233332222 22344444333221100 0000 00000 0000 0000000000 000 Q ss_pred HhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee Q lcl|NC_019710. 57 ILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR 136 (424) Q Consensus 57 ~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r 136 (424) -+.++.....|+..+.-+-+-|+.+--. +.. ....+...+. | ........+..+++.+|.||.++-+ T Consensus 80 ki~~n~~k~Iv~~~~~yl~g~p~~~~~~--~~~-----~~~~l~~~~~---n---~~~~~~~~l~~~~~~~G~~~~~~~~ 146 (474) T protein:vir:95 80 RITTNFHQNLVDQKVSYVAGKPVTYAHD--DDK-----VLDVIHQVLD---T---RWDNKLIDILTAASNKGIDWLQVYI 146 (474) T ss_pred ccccchHHHHHHhhhhhhcccCceeccC--ChH-----HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEeee Confidence 0123455667777777777777765211 111 1122333332 2 2445566788999999999999988 Q ss_pred CCCCceeEEEEeccceEEEEEcCCc--e----EEEEEecCc--eEEecHhHeeEecCc---------------------- Q lcl|NC_019710. 137 NSAGDVISLLPLQSANMDVKLVGKK--V----VYRYQRDSE--YADFSQKEIFHLKGF---------------------- 186 (424) Q Consensus 137 ~~~G~~~~l~~l~p~~v~~~~~~~~--~----~~~~~~~~~--~~~~~~~evih~r~~---------------------- 186 (424) +.+|.+ .+..++|..+.+..++.. . .+.|...+. ...+.++.|.++..- T Consensus 147 d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (474) T protein:vir:95 147 NEDGEL-KLFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTG 225 (474) T ss_pred CCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCccccc Confidence 888876 577788998887766431 0 111222221 112333444433210 Q ss_pred C---------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccC Q lcl|NC_019710. 187 G---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKK 257 (424) Q Consensus 187 ~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag 257 (424) . .+...|.|.+......++....+..-..+.+...+.|-.+++--......+ .... .+.. T Consensus 226 ~~~~vPvv~~~nn~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~-------~~~~----~~~~ 294 (474) T protein:vir:95 226 SWERVPFIAFKNNPEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSE-------FMEG----LKYY 294 (474) T ss_pred CCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccc-------hhhh----hhcc Confidence 0 022357777777777777766666655666666666766654321110011 1111 1123 Q ss_pred cceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHH Q lcl|NC_019710. 258 RLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTL 327 (424) Q Consensus 258 ~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl 327 (424) +++.++++.+...+.....+..+....+...+.|...-++|..-..... ++.+....+ ......+...+ T Consensus 295 ~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l 373 (474) T protein:vir:95 295 KAINVSSDGGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDKFG-SATSGIALKFLYTNLNLKANKLKNKANVAL 373 (474) T ss_pred ceeeccCCCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5666666666655554445556677788888899999999854322211 222211111 11112222333 Q ss_pred HHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee---- Q lcl|NC_019710. 328 QPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVA---- 401 (424) Q Consensus 328 ~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~---- 401 (424) .-+++.|...+. ...+. ..+.+.+..-+..+..+.++. +.+.|+++.-.+++.+++-..+. .+.. T Consensus 374 ~~~~~~i~~~~g----~~~d~--~~i~i~f~~~~p~~~~e~a~~---~~~~giiS~et~~~~lp~v~D~~~E~eri~~E~ 444 (474) T protein:vir:95 374 QELMQFILDFNK----IKLDA--KEIEITFNFNVMVNDLEQSQI---GAQSQYLSKETLVRHHPWVDDPKAELERLDEEQ 444 (474) T ss_pred HHHHHHHHHHhC----CCccc--ceeeEEecCCCccCHHHHHHH---HHHcCCCChHHHHHhCCCCCCHHHHHHHHHHHH Confidence 333333322211 11111 223333344455555555554 45579999988888887643211 1100 Q ss_pred -eecccccchhhccccCCCccCCC Q lcl|NC_019710. 402 -MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 402 -~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) -...+...+...+...+.++..+ T Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~ 468 (474) T protein:vir:95 445 LELNKQLPNLDDGGADGAQQQQQS 468 (474) T ss_pred HHHHhhccccccccCCCCCCcCCC Confidence 00001111111111111111111 No 200 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=97.43 E-value=6.8e-05 Score=43.43 Aligned_cols=372 Identities=10% Similarity=-0.001 Sum_probs=160.9 Q ss_pred cCCC-ccHHHHHHhhccCccccccccccccccccccccc--CCcccc---HHHHhhhHHHHHHHHHHHHhhhhCceeEee Q lcl|NC_019710. 10 LRTN-NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL--GDSSIN---DERILQISTVWRCVSLISTLTACLPLDVFE 83 (424) Q Consensus 10 ~~~~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~ 83 (424) |... .=|+.++.......... .........+.... -+.... +..-..+.+..-+|+..+..+- +.-++ T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r---~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~---~~g~~ 74 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSW---HCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLD---WLGWT 74 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHH---HHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhc---ccccc Confidence 2222 23344443333211100 00000000000000 000000 0111112334445554444332 11121 Q ss_pred ccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCce- Q lcl|NC_019710. 84 TDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKV- 162 (424) Q Consensus 84 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~- 162 (424) ..+ ...+..++.. | +.......+..+++.+|.||+++.++.+|.+ .+..++|..+.+..|.... T Consensus 75 ~~d---------~~~l~~i~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~-~i~~~~p~~~~~i~d~~~~~ 139 (441) T protein:vir:80 75 NGD---------GYGLDGVYAA--N---RLATASCDVHLDALIFGLSFVAIIPHGDGTV-SVRPQSPKNCTGKFSADGSR 139 (441) T ss_pred CCC---------hHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEEeCCCCce-EEEEEccceEEEEEeCCCCc Confidence 111 1224444432 2 3567778889999999999999999999987 5788899998876664321 Q ss_pred -----EEEEEe-cCc--eEEecHhH--------------------------eeEecCcC-CCCccccchHHH-HHHHHHH Q lcl|NC_019710. 163 -----VYRYQR-DSE--YADFSQKE--------------------------IFHLKGFG-FTGLVGLSPIAF-ACKSAGV 206 (424) Q Consensus 163 -----~~~~~~-~~~--~~~~~~~e--------------------------vih~r~~~-~~~~~G~s~~~~-~~~~i~~ 206 (424) .+++.. +.. ...+.++. |+||.+.. ....+|.|-+.- +...++. T Consensus 140 ~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa 219 (441) T protein:vir:80 140 LDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDE 219 (441) T ss_pred eeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHH Confidence 011100 000 00111111 35554332 334577775532 3334444 Q ss_pred HHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCC-----ceeeeccCChhHHHHH Q lcl|NC_019710. 207 AVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAG-----FSTSAIGVTPQDAEMM 281 (424) Q Consensus 207 ~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g-----~~~~~l~~s~~d~~~~ 281 (424) ......-......-.+.|..+++ +.. .++..... ++. ..++++.++.+ .++.++.....+ .+. T Consensus 220 ~~~~~s~~~~~~~~~~~~~~~i~-G~~-~~~~~~~~----~~~-----~~~~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 287 (441) T protein:vir:80 220 AVRTLLGQSVNRDFYAYPQRWVT-GVS-ADEFSQPG----WVL-----SMASVWAVDKDDDGDTPNVGSFPVNSPT-PYS 287 (441) T ss_pred HHHHHHHHHHHHHhhcCceeeee-cCC-ccccccch----hhh-----cccccccCCCCCCCCcceeEecCccchH-HHH Confidence 33333333333344445655554 212 11111111 111 12345544432 344444432222 367 Q ss_pred HHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHH----------HHHHHHHHHHHHHHHHHHHhhhccChhhhccc Q lcl|NC_019710. 282 ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN----------LGFLQYTLQPYISRWENSIQRWLIPAKDVGRI 351 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~----------~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~ 351 (424) +..+.....|+..-++|+..+|....+..|........ ...+...+.-.++.+...++...-.. .... T Consensus 288 ~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~--~~~~ 365 (441) T protein:vir:80 288 DQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEA--DFFG 365 (441) T ss_pred HHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--ccce Confidence 77888889999999999999987554322211111111 11111122222222211111110000 0113 Q ss_pred eeeecchhhhccCHHHHHHHHHHHHhCCCcC--HHHHHHHhCCCCCCCcCeeeecccccchhhccccC-CCccCCC Q lcl|NC_019710. 352 HAEHNLDGLLRGDSASRAAFMKAMGESGLRT--INEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK-EPRNNGA 424 (424) Q Consensus 352 ~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t--~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~-~~~~~g~ 424 (424) .+++.+......+..+.++.+.+++++|+.. ...++..+|+.+.+- .+.. . .....++ -.+.+|. T Consensus 366 ~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~-~~~~------~-e~~e~~~~~~~~~~~ 433 (441) T protein:vir:80 366 DVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQV-EAVM------R-HRAESSDPLAVLAGA 433 (441) T ss_pred eeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHH-HHHH------H-HHHHHHHHHHHHhhh Confidence 4555566677888899999999999999754 345677777764321 0000 0 0000000 0000111 No 201 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=97.39 E-value=7.8e-05 Score=43.11 Aligned_cols=382 Identities=11% Similarity=0.003 Sum_probs=169.3 Q ss_pred CCCCCc-cc--------ccC--CCccHHH--------------HHHhhccCcccccccccccccccccccccCCcccc-H Q lcl|NC_019710. 1 MEEPKY-TI--------DLR--TNNGWWA--------------RLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIN-D 54 (424) Q Consensus 1 ~~~~~~-~~--------~~~--~~~G~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 54 (424) ...|.+ || .-+ ...=++. ++..+..+...... .. ....+.+..... . T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~------~~-~~~~~~~~~~~~~~ 77 (474) T protein:vir:97 5 IRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVK------QM-KKVDVHGNIDYDKP 77 (474) T ss_pred ccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc------cc-chhccccccccccC Confidence 122222 00 000 0011222 22222222110000 00 000000000000 0 Q ss_pred HHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019710. 55 ERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) ..-+.++....+|+..+.-+-+-|+.+-- .+ +.. ...+.. ++. | ........+..+.+.+|.||+++ T Consensus 78 ~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~--~d---~~~--~~~l~~-~~~--n---~~~~~~~e~~~~~~~~G~~~~~~ 144 (474) T protein:vir:97 78 DWRITTNFHQNLVDQKVSYVASKPVTYSC--ED---ENV--LKVIHD-VLD--T---RWDNKLIDILTATSNKGIDWLQV 144 (474) T ss_pred cceeecchHHHHHHHHHhhhhcCCceecc--Cc---HHH--HHHHHH-HHh--c---cHHHHHHHHHHHHhhcCceEEEE Confidence 00112445666777777777777776521 11 111 112222 322 2 23455566788999999999999 Q ss_pred eeCCCCceeEEEEeccceEEEEEcCCc---e---EEEEEecCc--eEEecHhHeeEecC--------------------- Q lcl|NC_019710. 135 DRNSAGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDSE--YADFSQKEIFHLKG--------------------- 185 (424) Q Consensus 135 ~r~~~G~~~~l~~l~p~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~evih~r~--------------------- 185 (424) ..+.+|.+ .+..++|..+.+..++.. . ...|...+. ...+.++.+.+++. T Consensus 145 ~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 223 (474) T protein:vir:97 145 YINENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFS 223 (474) T ss_pred EecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCccccccc Confidence 89888875 577788998887776431 1 111111111 11122333322210 Q ss_pred ------cC----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc Q lcl|NC_019710. 186 ------FG----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV 255 (424) Q Consensus 186 ------~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (424) .+ .+...|.|-+..+...++....+.....+.++..+.|..+++-......++... . .. T Consensus 224 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-------~----~~ 292 (474) T protein:vir:97 224 NGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMR-------G----LK 292 (474) T ss_pred ccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh-------h----hh Confidence 00 023468888888888888777666666666676777776665322211111111 1 12 Q ss_pred cCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHH Q lcl|NC_019710. 256 KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQY 325 (424) Q Consensus 256 ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~ 325 (424) ..+++.++++.+.+.+........+.+..+...+.|...-++|..-..... ++.+..... ......+.. T Consensus 293 ~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~ 371 (474) T protein:vir:97 293 YYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFG-SAPSGIALKFLYGNLDLKANKLKNKATV 371 (474) T ss_pred ccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 345666766666666554444455566777888888888888843221111 221111111 111223333 Q ss_pred HHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--c----- Q lcl|NC_019710. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--G----- 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--g----- 398 (424) .+.-+++.|.+-+.. ..+.....+.| +.-...+..+ .++.+.+.|+++.--+.+++++-+.+. - T Consensus 372 ~l~~~~~li~~~~~~----~~d~~~i~v~f--~~~~p~~~~e---~a~~~~~~g~iS~et~l~~l~~v~D~~~E~eri~~ 442 (474) T protein:vir:97 372 AIQELISFIIDFNNL----KTDVKDIEISF--NFNRMMNDAE---QSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQ 442 (474) T ss_pred HHHHHHHHHHHHhCC----CcccceeeEEe--ccCcccCHHH---HHHHHHHcCCCCHHHHHHhCCCCCCHHHHHHHHHH Confidence 444444443332221 11222233444 3334444444 444556679999988888887632110 0 Q ss_pred CeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +.--......+....+...++.+.+. T Consensus 443 E~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (474) T protein:vir:97 443 EQMEYNKQLPNLDDGGADGAQQQEGS 468 (474) T ss_pred HHHHHHhhccccCCCCCCCcccCCCC Confidence 00000011111222111111111111 No 202 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=97.39 E-value=7.8e-05 Score=43.11 Aligned_cols=382 Identities=11% Similarity=0.003 Sum_probs=169.3 Q ss_pred CCCCCc-cc--------ccC--CCccHHH--------------HHHhhccCcccccccccccccccccccccCCcccc-H Q lcl|NC_019710. 1 MEEPKY-TI--------DLR--TNNGWWA--------------RLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIN-D 54 (424) Q Consensus 1 ~~~~~~-~~--------~~~--~~~G~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 54 (424) ...|.+ || .-+ ...=++. ++..+..+...... .. ....+.+..... . T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~------~~-~~~~~~~~~~~~~~ 77 (474) T protein:vir:94 5 IRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVK------QM-KKVDVHGNIDYDKP 77 (474) T ss_pred ccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc------cc-chhccccccccccC Confidence 122222 00 000 0011222 22222222110000 00 000000000000 0 Q ss_pred HHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019710. 55 ERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 55 ~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) ..-+.++....+|+..+.-+-+-|+.+-- .+ +.. ...+.. ++. | ........+..+.+.+|.||+++ T Consensus 78 ~~ki~~n~~k~Ivd~~~~~l~g~p~~~~~--~d---~~~--~~~l~~-~~~--n---~~~~~~~e~~~~~~~~G~~~~~~ 144 (474) T protein:vir:94 78 DWRITTNFHQNLVDQKVSYVASKPVTYSC--ED---ENV--LKVIHD-VLD--T---RWDNKLIDILTATSNKGIDWLQV 144 (474) T ss_pred cceeecchHHHHHHHHHhhhhcCCceecc--Cc---HHH--HHHHHH-HHh--c---cHHHHHHHHHHHHhhcCceEEEE Confidence 00112445666777777777777776521 11 111 112222 322 2 23455566788999999999999 Q ss_pred eeCCCCceeEEEEeccceEEEEEcCCc---e---EEEEEecCc--eEEecHhHeeEecC--------------------- Q lcl|NC_019710. 135 DRNSAGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDSE--YADFSQKEIFHLKG--------------------- 185 (424) Q Consensus 135 ~r~~~G~~~~l~~l~p~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~evih~r~--------------------- 185 (424) ..+.+|.+ .+..++|..+.+..++.. . ...|...+. ...+.++.+.+++. T Consensus 145 ~~d~~~~~-~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~ 223 (474) T protein:vir:94 145 YINENGEM-KLFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFS 223 (474) T ss_pred EecCCCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCccccccccCcCccccccc Confidence 89888875 577788998887776431 1 111111111 11122333322210 Q ss_pred ------cC----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc Q lcl|NC_019710. 186 ------FG----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV 255 (424) Q Consensus 186 ------~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (424) .+ .+...|.|-+..+...++....+.....+.++..+.|..+++-......++... . .. T Consensus 224 ~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-------~----~~ 292 (474) T protein:vir:94 224 NGNWGRVPFIAFKNNPEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMR-------G----LK 292 (474) T ss_pred ccCCCccceEEecCCcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh-------h----hh Confidence 00 023468888888888888777666666666676777776665322211111111 1 12 Q ss_pred cCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHH Q lcl|NC_019710. 256 KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQY 325 (424) Q Consensus 256 ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~ 325 (424) ..+++.++++.+.+.+........+.+..+...+.|...-++|..-..... ++.+..... ......+.. T Consensus 293 ~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~ 371 (474) T protein:vir:94 293 YYKAINVDGDGGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTDKFG-SAPSGIALKFLYGNLDLKANKLKNKATV 371 (474) T ss_pred ccceeeccCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 345666766666666554444455566777888888888888843221111 221111111 111223333 Q ss_pred HHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--c----- Q lcl|NC_019710. 326 TLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--G----- 398 (424) Q Consensus 326 tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--g----- 398 (424) .+.-+++.|.+-+.. ..+.....+.| +.-...+..+ .++.+.+.|+++.--+.+++++-+.+. - T Consensus 372 ~l~~~~~li~~~~~~----~~d~~~i~v~f--~~~~p~~~~e---~a~~~~~~g~iS~et~l~~l~~v~D~~~E~eri~~ 442 (474) T protein:vir:94 372 AIQELISFIIDFNNL----KTDVKDIEISF--NFNRMMNDAE---QSQIIAQSQYLSRETLVKSSPLVDDYKAELERIEQ 442 (474) T ss_pred HHHHHHHHHHHHhCC----CcccceeeEEe--ccCcccCHHH---HHHHHHHcCCCCHHHHHHhCCCCCCHHHHHHHHHH Confidence 444444443332221 11222233444 3334444444 444556679999988888887632110 0 Q ss_pred CeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +.--......+....+...++.+.+. T Consensus 443 E~~~~~~~~~~~~~~~~~~~~~~~~~ 468 (474) T protein:vir:94 443 EQMEYNKQLPNLDDGGADGAQQQEGS 468 (474) T ss_pred HHHHHHhhccccCCCCCCCcccCCCC Confidence 00000011111222111111111111 No 203 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.38 E-value=8.1e-05 Score=43.02 Aligned_cols=388 Identities=10% Similarity=0.036 Sum_probs=172.4 Q ss_pred ccHHHHHHh----hccCcccccc---cccc---cccc------ccc--ccccCCcc-ccHHHHhhhHHHHHHHHHHHHhh Q lcl|NC_019710. 14 NGWWARLKS----WFVGGRLVTP---NQGS---QTGP------VSA--HGYLGDSS-INDERILQISTVWRCVSLISTLT 74 (424) Q Consensus 14 ~G~~~~~~~----~~~~~~~~~~---~~~~---~~~~------~~~--~~~~~~~~-~~~~~~~~~~~v~~~i~~ia~~i 74 (424) +|+|.+++. |+++...... -+.. ..+. ... ..|..... -.....+..+.-..+++.+|+-+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll 80 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYI 80 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhh Confidence 777766554 5544322100 0000 0000 000 01111111 00111222333455666666666 Q ss_pred hhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEE Q lcl|NC_019710. 75 ACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMD 154 (424) Q Consensus 75 a~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~ 154 (424) ..=+..+--.+.+....+. ...-+.++|.. - ....-+...+.+.+..|.+++.+..+ +|.+ .+..+++.++. T Consensus 81 ~~e~~~i~v~~~~~~d~e~-~~~~l~~il~~-n----~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~-~i~~v~ad~~~ 152 (518) T protein:vir:78 81 SGKPLSIDVTGVNGSKDEN-LTKQLKEALRI-D----NFDSKSVKIVELAGGSGVSAVKINIL-NGRP-SISVHSSSQFW 152 (518) T ss_pred cCCCceEEecCccccCcHH-HHHHHHHHHHh-c----cHHHHHHHHHHHhhccCceEEEEEEE-CCee-EEEEEcCCeeE Confidence 5443333111111111111 11123333321 1 13344455667777788877766544 3553 56667776665 Q ss_pred EEEcCC----------------ceEEE--------------------------EEe-cCceEE-------------ecHh Q lcl|NC_019710. 155 VKLVGK----------------KVVYR--------------------------YQR-DSEYAD-------------FSQK 178 (424) Q Consensus 155 ~~~~~~----------------~~~~~--------------------------~~~-~~~~~~-------------~~~~ 178 (424) +...++ ..+|. |.. .+.... ...+ T Consensus 153 P~~~~g~~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~ 232 (518) T protein:vir:78 153 IDFKNNEPFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTN 232 (518) T ss_pred EEeecCcEEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccc Confidence 533211 11111 000 000000 0000 Q ss_pred H---------------eeEecCcCC-----CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEc-----CCC Q lcl|NC_019710. 179 E---------------IFHLKGFGF-----TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILST-----GEK 233 (424) Q Consensus 179 e---------------vih~r~~~~-----~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~-----~~~ 233 (424) . +.|+++... +.+.|+|.+.-+...++.....-.....-|+. +.+..++.. ..+ T Consensus 233 ~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~-g~~~i~v~~~~l~~~~~ 311 (518) T protein:vir:78 233 DIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEK-TKTKIAASERMFRKKVN 311 (518) T ss_pred cCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHh-CCceeeechhHhccCCC Confidence 0 123333211 23569999999999998888777777777776 444544421 100 Q ss_pred CCCHHHHHHHHHHHHHHhCCcccCcce--ecCCCc----eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC Q lcl|NC_019710. 234 VLTEQQRSQVEENFKEIAGGPVKKRLW--ILEAGF----STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK 307 (424) Q Consensus 234 ~~~~~~~~~~~~~~~~~~~~~~ag~~~--~l~~g~----~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~ 307 (424) -......-.+. ...+.-..+ ..++|. .++.++....+.++.+..+...+.|....|+++..++.... T Consensus 312 ~~~~~~~~~fd-------~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~ 384 (518) T protein:vir:78 312 KSTDKEEWSMN-------VDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNR 384 (518) T ss_pred CCCCccccccC-------CCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccc Confidence 00000000000 000100000 011222 36677777778889999999999999999999999976432 Q ss_pred CCcccccHH-H-----------HHHHHHHHHHHHHHHHHHHHHhhhccChh----hhccceeeecchhhhccCHHHHHHH Q lcl|NC_019710. 308 STSWGSGIE-Q-----------QNLGFLQYTLQPYISRWENSIQRWLIPAK----DVGRIHAEHNLDGLLRGDSASRAAF 371 (424) Q Consensus 308 ~~~~~~n~e-~-----------~~~~f~~~tl~P~~~~ie~~l~~~L~~~~----~~~~~~~~f~~~~~~~~d~~~~~~~ 371 (424) ..+ ..| . .....++.+|.=++..|.+-+.. +.... ......+.++++.-+..|.++.++. T Consensus 385 -~~T--ATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~-~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~ 460 (518) T protein:vir:78 385 -EVK--ATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTG-GTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSST 460 (518) T ss_pred -ccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hcCccccccCCCceeEEEEeCCCCCCCHHHHHHH Confidence 211 111 1 11223333333333333322221 11110 0112457788888899999999999 Q ss_pred HHHHHhCCCcCHHHHHHHh--CCCCCCCcCeeeecccccchhhccccCCCccC------CC Q lcl|NC_019710. 372 MKAMGESGLRTINEMRRTD--NLPPLPGGDVAMRQSQYVPITDLGTNKEPRNN------GA 424 (424) Q Consensus 372 ~~~~~~~g~~t~NE~R~~l--g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~------g~ 424 (424) ..+++.+|+|++-++-+++ |+.. +..++-+.... .-......++|.+= || T Consensus 461 ~~~~v~aGimS~e~~i~~~~~~~~d-eea~~e~~ri~--~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 461 LNNMNSALAMSVEEKVKLIHPKWED-EEIQAEVKRIY--LENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHHHhcCCCCHHHHHHHhCCCCCH-HHHHHHHHHHH--HHhcccCCCCCccccCCCCCCC Confidence 9999999999998855554 3321 11111110000 00000011111111 11 No 204 >protein:vir:97376 Length: 320 # NCBI annotation: putative portal protein # Family: family:all:11744 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762589;genbank:gi:115304290;genbank:GeneID:5130579 Probab=97.37 E-value=4.9e-06 Score=49.69 Aligned_cols=309 Identities=13% Similarity=0.124 Sum_probs=147.3 Q ss_pred ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccc Q lcl|NC_019710. 14 NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKV 93 (424) Q Consensus 14 ~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~ 93 (424) +|+|+- ...+.. +|.-. ...+. .+.+..+....-..-|.+.+-+|..||.- +..|+... T Consensus 1 ~~~~~~----~~~~~~-~~~~~--~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~------- 59 (320) T protein:vir:97 1 MGIFNF----KKRETL-TPELK--ESIIR------QVTIEDESPFTGTTDFNVRNEVAESIATY-LGAYKTSA------- 59 (320) T ss_pred CCcccc----cccccc-ChhHH--hhhhh------eeeeccCCCcccccccchhhHHHHHHHHH-hhhhcccc------- Confidence 555541 111111 12110 00000 00000000001111233344455555542 22232221 Q ss_pred cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEEcCCceEEEEEec---C Q lcl|NC_019710. 94 DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKLVGKKVVYRYQRD---S 170 (424) Q Consensus 94 ~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~~~~~~~~~~~~~---~ 170 (424) . ...||...| .|++.++.|.+..-..|++.... .|.++ .+.++++--.....+...+. . T Consensus 60 ---~-~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~-~~~~~------~~~~~~~~~~~~~~~~~~D~FN~~ 121 (320) T protein:vir:97 60 ---K-RLSLLTNNP-------SFLRRLVKHALHNKTTYVYKSPT-YGWLI------TDSMTIEGLRARLTFTLPDPFNSA 121 (320) T ss_pred ---c-eeeeeeCCH-------HHHHHHHHHhhcccceEEeeCCc-cceee------ecceeeeeeeeeEEEecCccccee Confidence 1 122343222 79999999999999999987543 23222 12222211110110000000 0 Q ss_pred ceEEecHhHeeEecCcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 171 EYADFSQKEIFHLKGFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEI 250 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~ 250 (424) .+..++-.|+-.+ .++++|..+-++.. ....+....-+-+.|.+....+++++.+..-++..++....+.++ T Consensus 122 V~mtvpfyD~~IL----dnpl~gv~tqe~gk----M~g~a~~~v~kkL~~~~~IKafi~Tdid~GLee~kD~~~~kIk~m 193 (320) T protein:vir:97 122 VTMTVPFYDVGII----DSPLVEVDTEEANK----MLEAAYSAVMKKLHNTGAIKAFISSDIDVGLEKMKEESDSKIKAM 193 (320) T ss_pred EEEEeeeechhhh----hhhhcccChHHhhH----HHHHHhhhhhhhccccceeEEEEecccchhHHHHHHHHHHHHHHH Confidence 0111121121111 24567777653322 223334555666778888889998887755566666666666554 Q ss_pred hCCcc-cCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_019710. 251 AGGPV-KKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTLQP 329 (424) Q Consensus 251 ~~~~~-ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~f~~~tl~P 329 (424) ..-.+ -.++-+++.+.+++++.....-.. ..-....++..+.-|++|..+|-++. .+.+..+|+...+.| T Consensus 194 q~~A~~~nG~T~i~~~dDI~Qi~pDYS~sn-~~D~~l~~t~alS~y~m~~~IL~GsA--------te~~~Iaf~~~~V~P 264 (320) T protein:vir:97 194 LATAELLSGYTYIQRGDDVTQMMPDYTTSN-VTDFAAMRTFAASQLSVSDKILDGSA--------TDGEKVAVMFRFVEP 264 (320) T ss_pred HHHHHHhcCcccccCCcceeeecccccccc-hhHHHHHHHHHHhhcCCchhhccccC--------CcceeeehhhHhHHH Confidence 43332 446889999999999876443332 22235557778889999999996542 356778999999999 Q ss_pred HHHHH---HHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC----CCcCeee Q lcl|NC_019710. 330 YISRW---ENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPL----PGGDVAM 402 (424) Q Consensus 330 ~~~~i---e~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~----~ggd~~~ 402 (424) +++++ |.+|..++-. +|.+.-+- + .|-+.-|.+- -.|.+.. .|||+-- T Consensus 265 LL~Q~~~~Ek~Lvy~m~~---------E~FVs~mt------------T---GG~l~S~~~~-~~~~~~~~~~~~~~~~~~ 319 (320) T protein:vir:97 265 ILEQFREYEPSLIYAMRD---------EFFVSFMT------------T---GGMLNSNRVD-GWGKEKAPNESKGGDVGD 319 (320) T ss_pred HHHHhhhcCcceeeeecc---------ceeeeeee------------c---Cceeeccccc-ccccccCCccccCCcccC Confidence 99997 4555443321 11111110 0 3344333321 1122221 2333322 Q ss_pred e Q lcl|NC_019710. 403 R 403 (424) Q Consensus 403 ~ 403 (424) + T Consensus 320 ~ 320 (320) T protein:vir:97 320 V 320 (320) T ss_pred C Confidence 2 No 205 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.27 E-value=0.00011 Score=42.30 Aligned_cols=382 Identities=9% Similarity=-0.021 Sum_probs=181.1 Q ss_pred CC--CCCccc-------ccCCC-ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHH Q lcl|NC_019710. 1 ME--EPKYTI-------DLRTN-NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~--~~~~~~-------~~~~~-~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i 70 (424) |. +.-++- ..++. +--+.++..+..+..... .... .. ... ..-+.++....+|+.. T Consensus 19 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~-~~~~---~~----~~~------~~ki~~n~~~~Ivd~~ 84 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKIL-TAPE---KE----TGA------DNRIVVNSAKYVVDVY 84 (470) T ss_pred eCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccc-cCcc---cc----cCC------cceeecchHHHHHHHH Confidence 11 000000 01111 123555555555432111 0000 00 000 0011234556677777 Q ss_pred HHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecc Q lcl|NC_019710. 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) Q Consensus 71 a~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p 150 (424) +.-+-+-|+++.-.+ +.. ....+..++.. .........+..+.+.+|.||+++..+.+|.+ .+..++| T Consensus 85 ~~~l~g~p~~~~~~~-d~~-----~~~~l~~~~~~-----n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~-~i~~~~p 152 (470) T protein:vir:99 85 NGYFCGIEPKLALLN-DSS-----KIDEIARWNRQ-----ENFFDTINEISKQCDIFGRSIASIYQGEDARP-HLMYSSP 152 (470) T ss_pred hhhhccCCeeEeeCC-chh-----HHHHHHHHHHh-----cCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeE-EEEEEcc Confidence 776666676653211 111 01223443432 24557778899999999999999988888877 5777899 Q ss_pred ceEEEEEcCCce------EEEEE-ecCc-----eEEecHhHeeEecCc-------------------C----CCCccccc Q lcl|NC_019710. 151 ANMDVKLVGKKV------VYRYQ-RDSE-----YADFSQKEIFHLKGF-------------------G----FTGLVGLS 195 (424) Q Consensus 151 ~~v~~~~~~~~~------~~~~~-~~~~-----~~~~~~~evih~r~~-------------------~----~~~~~G~s 195 (424) ..+.+..++... .+.|. .++. ...+.++.+++++.. + .+...|.| T Consensus 153 ~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s 232 (470) T protein:vir:99 153 NHAFIIYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQG 232 (470) T ss_pred ceeEEEEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCc Confidence 998877765421 11111 1111 112233333332210 0 12335788 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-----CCCceeee Q lcl|NC_019710. 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-----EAGFSTSA 270 (424) Q Consensus 196 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-----~~g~~~~~ 270 (424) -+..+...++....+.....+.+...+.|-.+++-... ..++.-+.+... . ..+++.+ +.+.++.. T Consensus 233 d~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~-~~~~~g~~~~~~----~----~~~~~~~~~~~~~~~~~~~~ 303 (470) T protein:vir:99 233 IFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKL-PEDDEGNPKFDF----K----NNRVLYVSQLDPDTNPQIGF 303 (470) T ss_pred chHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc-ccccccchhhhh----h----hcceeeecCCCCCCCCcceE Confidence 88888888887777767777777777778777753221 111111111111 1 1122222 23444555 Q ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH----------HHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019710. 271 IGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYISRWENSIQR 340 (424) Q Consensus 271 l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~----------~~~~f~~~tl~P~~~~ie~~l~~ 340 (424) +........+.+..+...+.|+..-++|+...+... ++.|....+- .....+..++.-+++.+...+.. T Consensus 304 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 382 (470) T protein:vir:99 304 IAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFA-GNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFN 382 (470) T ss_pred EeecCChHHHHHHHHHHHHHHHHHhCCccccccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 554434444566778889999999999965433222 2222111111 11122333333333333333322 Q ss_pred hccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCC-CCCcCeee---------ecccccch Q lcl|NC_019710. 341 WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPP-LPGGDVAM---------RQSQYVPI 410 (424) Q Consensus 341 ~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p-~~ggd~~~---------~~~n~~~~ 410 (424) .--... ....+.+.+..-+..|..+.++.+.++. |+++...+.++++.-. ....+.+. ......+. T Consensus 383 ~~~~~~--~~~~i~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l~~l~~vd~~~E~eri~~E~~~~~~~~~~~~~~~ 458 (470) T protein:vir:99 383 NKQDQE--LWSELDFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDIEPDAEMKQIAKEKADAIKQTQQLSMPI 458 (470) T ss_pred cCCccc--ccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 111111 1234455556667788888999988885 7899888888876531 11000000 00011111 Q ss_pred hhccccCCCccC Q lcl|NC_019710. 411 TDLGTNKEPRNN 422 (424) Q Consensus 411 ~~~~~~~~~~~~ 422 (424) +......+++++ T Consensus 459 d~~~~d~~~ee~ 470 (470) T protein:vir:99 459 DILKRDNNAEEE 470 (470) T ss_pred CcCCCCCCccCC Confidence 221122222222 No 206 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=97.17 E-value=0.00014 Score=41.66 Aligned_cols=390 Identities=9% Similarity=-0.010 Sum_probs=173.2 Q ss_pred CCCCCcccccCCCccHHHHH--------------HhhccCcccc--ccccccc--ccccccc-cc-cCCcccc--HHHHh Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARL--------------KSWFVGGRLV--TPNQGSQ--TGPVSAH-GY-LGDSSIN--DERIL 58 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~--------------~~~~~~~~~~--~~~~~~~--~~~~~~~-~~-~~~~~~~--~~~~~ 58 (424) |+.-.||+..-...=+++++ .......... ....... ....... .. ..+.... ...-+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRM 80 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccccccccccccccccee Confidence 77777766655443333332 2211100000 0000000 0000000 00 0000000 00012 Q ss_pred hhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC Q lcl|NC_019710. 59 QISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS 138 (424) Q Consensus 59 ~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~ 138 (424) .++....+|+..+.-+-+-|+.+- .+.+ . ....+...+. | ........++.+++.+|.||+.+..+. T Consensus 81 ~~n~~~~ivd~~~~~l~g~~~~~~-~~~d-~-----~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~~~~d~ 147 (478) T protein:vir:10 81 YTNYHQNLVDQKVAYAVANPVTFG-VDND-K-----ALKQIQHTLN---H---KWDDKLVDILTAASNKGIEWVQPYVDE 147 (478) T ss_pred ccchHHHHHHHHHhhhccCCeeee-cCCh-H-----HHHHHHHHHh---c---CHHHHHHHHHHHHHhcCeEEEEEEecC Confidence 234556677777776666677652 1111 1 1122334332 2 345666777899999999999998888 Q ss_pred CCceeEEEEeccceEEEEEcCCc---e---EEEEEecCc--eEEecHhHeeEecCc------------------------ Q lcl|NC_019710. 139 AGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDSE--YADFSQKEIFHLKGF------------------------ 186 (424) Q Consensus 139 ~G~~~~l~~l~p~~v~~~~~~~~---~---~~~~~~~~~--~~~~~~~evih~r~~------------------------ 186 (424) +|.+ .+..++|..+.+..+... . .+.|...+. ...+.+++|.+++.. T Consensus 148 ~g~~-~~~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (478) T protein:vir:10 148 EGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNK 226 (478) T ss_pred CCee-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccc Confidence 8876 577788888887765421 1 111111111 111223333222110 Q ss_pred -------C----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc Q lcl|NC_019710. 187 -------G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV 255 (424) Q Consensus 187 -------~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (424) + .+...|.|-+..+...++....+..-..+.++..+.|-.+++-.......+....+ . T Consensus 227 ~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~-----------~ 295 (478) T protein:vir:10 227 LMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNL-----------K 295 (478) T ss_pred cccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhh-----------h Confidence 0 02345788787777777777766666666666667776665432111111111111 1 Q ss_pred cCcceec--CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHH Q lcl|NC_019710. 256 KKRLWIL--EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFL 323 (424) Q Consensus 256 ag~~~~l--~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~ 323 (424) .++++.+ +.|.+...+........+.+..+...+.|...-++|..-..... ++.+....+ ......+ T Consensus 296 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~ 374 (478) T protein:vir:10 296 YYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKLKNKT 374 (478) T ss_pred hcceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccc-cccHHHHHHHHHHHHHHHHHHHHHHH Confidence 1223333 23334333333333455667778888888888888853332211 222211111 1111222 Q ss_pred HHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cCee Q lcl|NC_019710. 324 QYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GDVA 401 (424) Q Consensus 324 ~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd~~ 401 (424) ...+.-+++.|...+. ...+ ...+++.+..-+..|..+.++.+.++ +|+++...+.+++++-..+. .+.. T Consensus 375 ~~~l~~~~~li~~~~g----~~~~--~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D~~~E~~ri 446 (478) T protein:vir:10 375 LTALQELLQYIIDFYR----LDVK--VQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILSNHAWVEDPVAEMERI 446 (478) T ss_pred HHHHHHHHHHHHHHhC----CCcc--cccceEEecCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHHHHH Confidence 2222222222222111 1111 12334444555667888888888887 68999988888887643211 0000 Q ss_pred e-----ecccccch-hhcc-c-cCCCccCCC Q lcl|NC_019710. 402 M-----RQSQYVPI-TDLG-T-NKEPRNNGA 424 (424) Q Consensus 402 ~-----~~~n~~~~-~~~~-~-~~~~~~~g~ 424 (424) - .......+ .... + +.+..++.. T Consensus 447 ~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 477 (478) T protein:vir:10 447 EQENIELNQQLPDIEEGLNGEQQRQSENNQP 477 (478) T ss_pred HHHHHHHHhhccccccccCCCCCCCCCCCCC Confidence 0 00001111 1111 1 112222222 No 207 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=96.98 E-value=0.00023 Score=40.58 Aligned_cols=394 Identities=10% Similarity=0.027 Sum_probs=176.0 Q ss_pred CC-CCCcccccCCC----ccHHHHHHhhccCcccc--ccccccc--ccccccccccCCccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019710. 1 ME-EPKYTIDLRTN----NGWWARLKSWFVGGRLV--TPNQGSQ--TGPVSAHGYLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~-~~~~~~~~~~~----~G~~~~~~~~~~~~~~~--~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) |. +|.-+|-|-.. ..++.++.+........ ....... ............. .+.+ +..+.....|+..+ T Consensus 1 ~~~~~~~~~~~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~-~~~k--i~~n~~~~ivd~~~ 77 (453) T protein:vir:39 1 MKYKPPKLMTFPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWK-PDNR--LTVNFTKYIVDTFT 77 (453) T ss_pred CeecCCcceEcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccC-ccce--eecchHHHHHHHHh Confidence 32 33444444443 34555554433221100 0000000 0000000000000 0011 22356666777777 Q ss_pred HhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccc Q lcl|NC_019710. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~ 151 (424) .-+-+-|+.+--.+ ++ ....+.+++.. | ........+..+.+.+|.||+++..+.+|.+ .+-.++|. T Consensus 78 ~~l~g~~~~~~~~d-----~~--~~~~l~~i~~~--N---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p~ 144 (453) T protein:vir:39 78 GYFNGIPVKKSHSD-----KE--TLSKLQEFDNL--N---DMEDEESELAKMACIYGRAFELLYQNEETQT-NVIYNTPE 144 (453) T ss_pred hhhcccCceeccCC-----hH--HHHHHHHHHHh--c---ChhHHHHHHHHHHhhcCeEEEEEEecCCCce-EEEEEccc Confidence 77766776552111 11 11234454542 2 2345677888999999999999999988876 46667888 Q ss_pred eEEEEEcCCc---e--EEEEEecCce----EEecHhHeeEecCcC---------------------CCCccccchHHHHH Q lcl|NC_019710. 152 NMDVKLVGKK---V--VYRYQRDSEY----ADFSQKEIFHLKGFG---------------------FTGLVGLSPIAFAC 201 (424) Q Consensus 152 ~v~~~~~~~~---~--~~~~~~~~~~----~~~~~~evih~r~~~---------------------~~~~~G~s~~~~~~ 201 (424) .+.+..++.. . ...+...... ..+.++.+.++.... .+...|.|.+..+. T Consensus 145 ~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~ 224 (453) T protein:vir:39 145 NMFMVYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVI 224 (453) T ss_pred ceEEEecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecccccCCCceeEEEecCCCCCCcchhhhH Confidence 8887765422 1 1111111111 123333333332110 01235778787777 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHH Q lcl|NC_019710. 202 KSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMM 281 (424) Q Consensus 202 ~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~ 281 (424) ..++....+..-..+.+...+.|..+++-. +.. ++....++.. ......+ ....+.+.++..+........+. T Consensus 225 ~liDa~~~~~s~~~~~~~~~~~p~~~~~g~-~~~-~~~~~~~~~~--~~~~~~~---~~~~~~~~~~~~lt~~~~~~~~~ 297 (453) T protein:vir:39 225 SLVNAFNKAISEKANDVDYFSDQYLTFLGA-AVE-EEDLKNIRSN--RVINYYG---ESSEAKNVDVKFLEKPDSDSQTE 297 (453) T ss_pred HHHHHHHHHHHHHHHHHHHhhCceeeeecC-CCC-chhhhhhhhc--ceeeecC---CCCCCCCCceeEEeecCCHHHHH Confidence 777666666666666666667777666532 222 2222211110 0111000 01112233333333333344556 Q ss_pred HHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH----------HHHHHHHHHHHHHHHHHHHHHhhhccChhhhccc Q lcl|NC_019710. 282 ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRI 351 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~----------~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~ 351 (424) +..+...+.|+..-++|..-.+.. ++.+....+. .....+...+...++.+..-++..- .. .... T Consensus 298 ~~~~~l~~~I~~~s~~p~~~~~~~--gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~--~~~~ 372 (453) T protein:vir:39 298 NLLDRLTKLIFQTTMVANISDESF--GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVS-NK--EAWK 372 (453) T ss_pred HHHHHHHHHHHHHhCCcccccccc--cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-Cc--cccc Confidence 677788888888888884322211 2222111111 1112333344444443333222110 11 1112 Q ss_pred eeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccC----CCccCCC Q lcl|NC_019710. 352 HAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNK----EPRNNGA 424 (424) Q Consensus 352 ~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~----~~~~~g~ 424 (424) .+.+.+..-+..|..+.++.+.++ .|+++.--+.+++++-+.+..+.-.+.............. ++.++.. T Consensus 373 ~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 447 (453) T protein:vir:39 373 DIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVV 447 (453) T ss_pred cceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCC Confidence 344555666778888899988887 5789998888888763322101000000000000000000 0000111 No 208 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=96.98 E-value=0.00023 Score=40.58 Aligned_cols=387 Identities=10% Similarity=0.001 Sum_probs=175.6 Q ss_pred CCCCCcccccCCCccHHHH--------------HHhhccCcccccccccccccccccc-ccc--------CCcc-c-cHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWAR--------------LKSWFVGGRLVTPNQGSQTGPVSAH-GYL--------GDSS-I-NDE 55 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--------~~~~-~-~~~ 55 (424) |++-.||+..-...=+++. +........ ... ........+- ... .... . .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~-~r~--~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~ 77 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENI-DNI--TMGERYYNHHPDILDAPFKRDVNGDYDETKPD 77 (478) T ss_pred CccccccCCchhhhHHHHHhhhccCChHHHHHHHHHHHHHHH-HHH--HHHHHHhcccccccccchhhhccccccccccc Confidence 8877777766554333322 222111000 000 0000000000 000 0000 0 000 Q ss_pred HHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEe Q lcl|NC_019710. 56 RILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVD 135 (424) Q Consensus 56 ~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~ 135 (424) .-+.++....+|+..+.-+-+-|+.+--. +. + ....+...+. | ........+..+...+|.+|+.+- T Consensus 78 ~ki~~n~~k~ivd~~~~yl~g~p~~~~~~--~~---~--~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~ 144 (478) T protein:vir:10 78 WRMYTNYHQNLVDQKVAYAVANPVTFGVD--ND---K--ALKQIQHTLN---H---KWDDKLVDILTAASNKGIEWVQPY 144 (478) T ss_pred ceeccchHHHHHHHHhhhhcccCceeecC--Ch---H--HHHHHHHHHh---c---cHHHHHHHHHHHHhhCCeEEEEEE Confidence 01224566677887777777777765211 11 1 1122333332 2 345666777899999999999998 Q ss_pred eCCCCceeEEEEeccceEEEEEcCC---ce---EEEEEecCc--eEEecHhHeeEecCc--------------------- Q lcl|NC_019710. 136 RNSAGDVISLLPLQSANMDVKLVGK---KV---VYRYQRDSE--YADFSQKEIFHLKGF--------------------- 186 (424) Q Consensus 136 r~~~G~~~~l~~l~p~~v~~~~~~~---~~---~~~~~~~~~--~~~~~~~evih~r~~--------------------- 186 (424) .+.+|.+ .+..++|..+.+..++. .. .+.|...+. ...+.++.|.+++.. T Consensus 145 ~d~~~~~-~~~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (478) T protein:vir:10 145 VDEEGEF-KTFRVPAEQAVPIWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQ 223 (478) T ss_pred ecCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceec Confidence 8888876 57778888888766532 11 111111111 112334444333210 Q ss_pred -----C---------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhC Q lcl|NC_019710. 187 -----G---------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAG 252 (424) Q Consensus 187 -----~---------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~ 252 (424) . .+...|.|.+..+...++....+..-..+.+...+.|-.+++-.......+....++ T Consensus 224 ~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~-------- 295 (478) T protein:vir:10 224 GNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLK-------- 295 (478) T ss_pred ccccccCCcceEEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhh-------- Confidence 0 012357888887777777766666666656666666665554321111111111111 Q ss_pred CcccCcceec--CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHH Q lcl|NC_019710. 253 GPVKKRLWIL--EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNL 320 (424) Q Consensus 253 ~~~ag~~~~l--~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~ 320 (424) ..+++.+ +.|.++..+........+.+..+...+.|...-++|..-..... ++.+....+ .... T Consensus 296 ---~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~ 371 (478) T protein:vir:10 296 ---YYKAISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFG-NSPSGIALKFMYSNLDLKANKLK 371 (478) T ss_pred ---hCceeEecCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCccccc-cchHHHHHHHHHHHHHHHHHHHH Confidence 0123333 23333443433334555677788888899999999853222111 222211111 0111 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcC- Q lcl|NC_019710. 321 GFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGD- 399 (424) Q Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd- 399 (424) ..+...+.-+++.|.+.+. ...+ ...+++.+..-+..|..+.++.+.++ +|+++.-.+.+++++-..+..+ T Consensus 372 ~~~~~~l~~~~~li~~~~~----~~~d--~~~i~i~f~~~~p~~~~e~~~~~~~~--~g~iS~et~i~~~~~v~d~~~E~ 443 (478) T protein:vir:10 372 NKTLTALQELLQYIIDFYR----LDVR--VQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILGNHSWVQDPVAEM 443 (478) T ss_pred HHHHHHHHHHHHHHHHHhC----CCcc--cccceEEeCCCCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCCHHHHH Confidence 2222222222222222111 1111 12334444555667788888887776 5889887777777652211000 Q ss_pred ------eeeecccccchhh---ccccCCCccCCC Q lcl|NC_019710. 400 ------VAMRQSQYVPITD---LGTNKEPRNNGA 424 (424) Q Consensus 400 ------~~~~~~n~~~~~~---~~~~~~~~~~g~ 424 (424) +--.......+.. -.+..++.++++ T Consensus 444 ~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~~ 477 (478) T protein:vir:10 444 ERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQS 477 (478) T ss_pred HHHHHHHHHHHHhccccCCCCcccccccCcCCCC Confidence 0000001111111 112234445555 No 209 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=96.85 E-value=0.0003 Score=39.91 Aligned_cols=380 Identities=11% Similarity=0.036 Sum_probs=166.3 Q ss_pred CCCCCcccccCC---------------CccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHh----hhH Q lcl|NC_019710. 1 MEEPKYTIDLRT---------------NNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERIL----QIS 61 (424) Q Consensus 1 ~~~~~~~~~~~~---------------~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~ 61 (424) .+=|...|.-.. ...-++++..+..+...... ... . ........+ .+. T Consensus 2 ~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-~~~----------~--~~~~~~~~~~~~~~~n 68 (479) T protein:vir:99 2 IDLPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPD-LAT----------R--HKNKEREVLQQLSRKP 68 (479) T ss_pred ccCCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccc-ccc----------c--cCChhHHHHHHHhhcC Confidence 333333333221 11223333334433221100 000 0 000001111 123 Q ss_pred HHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee----- Q lcl|NC_019710. 62 TVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR----- 136 (424) Q Consensus 62 ~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r----- 136 (424) +...+|+.++..+---.|+ .. +++ ....+.+++.. |. .......+..+++.+|.||+++-. T Consensus 69 ~~~~iVd~~~~~l~~~gf~---~~-d~~-----~~~~~~~i~~~--N~---~d~~~~~~~~~a~~~G~af~~v~~~~~~~ 134 (479) T protein:vir:99 69 WMGLMVNSFAQQLIVDGYR---KT-GTN-----ENAKGWDTWRL--NQ---MDKQQFWLNRAVLTFGYAFIKVTSGISPL 134 (479) T ss_pred cHHHHHHHHHhhccccccc---CC-Cch-----hhHHHHHHHHh--cC---hhHHHHHHHHHHhhcCceEEEEecCCCCc Confidence 4555666666544322222 21 111 12234555542 21 235667788899999999998764 Q ss_pred CCCCceeEEEEeccceEEEEEcCCce----EEE-------------------EEecCceEEe-c-----HhH--eeEecC Q lcl|NC_019710. 137 NSAGDVISLLPLQSANMDVKLVGKKV----VYR-------------------YQRDSEYADF-S-----QKE--IFHLKG 185 (424) Q Consensus 137 ~~~G~~~~l~~l~p~~v~~~~~~~~~----~~~-------------------~~~~~~~~~~-~-----~~e--vih~r~ 185 (424) +..|.+ .+..++|..+....++... .|. +..+.....+ . -.. |++|++ T Consensus 135 d~~g~~-~i~~~~p~~~~~iydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n 213 (479) T protein:vir:99 135 DGTTVA-RIKCIDPRDAFAIWEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVN 213 (479) T ss_pred CCCCce-EEEEechhheEEEecCCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeec Confidence 344544 4667788887765543211 111 1111111011 0 011 456655 Q ss_pred cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceec-CC Q lcl|NC_019710. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWIL-EA 264 (424) Q Consensus 186 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l-~~ 264 (424) ......+|.|-+..+...++.......-..+.+.-.+.|..++.-. ........+. ..+.. ..++++.+ ++ T Consensus 214 ~~~~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~-~~~~~~~~~~--~~~~~-----~~~~i~~~~~~ 285 (479) T protein:vir:99 214 VMDLRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGL-MLPEGANADQ--EKMRF-----AQESMLISQNE 285 (479) T ss_pred CCCcCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCC-Ccccccccch--hcccc-----ccccceeecCC Confidence 4322346889888777777766666555555555556666555421 1111110000 01111 11234433 55 Q ss_pred CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH----------HHHHHHHHHHHHHHHHH Q lcl|NC_019710. 265 GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYISRW 334 (424) Q Consensus 265 g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~----------~~~~f~~~tl~P~~~~i 334 (424) +.++.++.... --.+.+..+....+|+..=++|+..+|...+ .|...... .....+...+.-+++.+ T Consensus 286 ~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n--~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~ 362 (479) T protein:vir:99 286 KASFGAIPAAP-LDGLLNAYKESLLEFLALAQLPPHIAGQIVN--VAADALAAGTRQTMQKLFEKQATWKASHNQTMRLV 362 (479) T ss_pred CceEEEecccc-hHHHHHHHHHHHHHHhccCCCCHHHcccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 56766665322 2336777778888899999999999986432 11111111 11111111222222222 Q ss_pred HHHHhhhccChhhh-ccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCC--CCc----------Ce Q lcl|NC_019710. 335 ENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTD-NLPPL--PGG----------DV 400 (424) Q Consensus 335 e~~l~~~L~~~~~~-~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~l-g~~p~--~gg----------d~ 400 (424) .. +...... ....+.+.+......+..+.++.+.+++++|+++.-.+.+++ |+.+. +-- +. T Consensus 363 ~~-----~~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~ 437 (479) T protein:vir:99 363 NK-----IEGRTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGK 437 (479) T ss_pred HH-----HcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHH Confidence 11 1111111 112344444555566788899999999999998887777776 77542 100 00 Q ss_pred eeeccc--ccchh------hccccCCCcc-CCC Q lcl|NC_019710. 401 AMRQSQ--YVPIT------DLGTNKEPRN-NGA 424 (424) Q Consensus 401 ~~~~~n--~~~~~------~~~~~~~~~~-~g~ 424 (424) ...... ..+.. ...+.++..+ .|. T Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 470 (479) T protein:vir:99 438 YMRKLQNGPDPAEQRGGPNGATNMQQANNKTGE 470 (479) T ss_pred HHHHHhcccCcccccCCCCCCCCCCCCCCCCcc Confidence 000000 00000 0000000000 000 No 210 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=96.79 E-value=0.00033 Score=39.64 Aligned_cols=383 Identities=8% Similarity=-0.016 Sum_probs=160.6 Q ss_pred CCCCCcccccCCCcc--------------HHHHHH--------------hhccCcccccccccccccccccccccCCccc Q lcl|NC_019710. 1 MEEPKYTIDLRTNNG--------------WWARLK--------------SWFVGGRLVTPNQGSQTGPVSAHGYLGDSSI 52 (424) Q Consensus 1 ~~~~~~~~~~~~~~G--------------~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 52 (424) |.+..|+.....-.. ++.++. .+..+.... .........-...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i------~~~~~~~~~~~~~~~~ 74 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDV------LFNAPKRNVKGEIDPF 74 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc------cccccccccccccccc Confidence 666555544332211 111221 122111100 0000000000000000 Q ss_pred cHHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEE Q lcl|NC_019710. 53 NDERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYA 132 (424) Q Consensus 53 ~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~ 132 (424) .+..=+.++....+++..+.-+-+-|+.+-- .+. + ....+...+. | +.......+..+.+.+|.+|+ T Consensus 75 ~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~--~d~---~--~~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~ 141 (468) T protein:vir:96 75 KPDWRMYTNYHQNLVDQKVAYAVANPVTYGT--EDE---K--SLKTIQEVLN---H---KWDDKLVDILTAASNKGVEWI 141 (468) T ss_pred ccccccccchHHHHHHHHHhhhccCCceecc--CCh---H--HHHHHHHHHh---c---CHHHHHHHHHHHHhhcCeEEE Confidence 0011122445555666666666566665421 111 1 1123444442 2 344566778899999999999 Q ss_pred EEeeCCCCceeEEEEeccceEEEEEcCCc---e---EEEEEecC--ceEEecHhHeeEecCc------------------ Q lcl|NC_019710. 133 LVDRNSAGDVISLLPLQSANMDVKLVGKK---V---VYRYQRDS--EYADFSQKEIFHLKGF------------------ 186 (424) Q Consensus 133 ~~~r~~~G~~~~l~~l~p~~v~~~~~~~~---~---~~~~~~~~--~~~~~~~~evih~r~~------------------ 186 (424) .+..+.+|.+ .+..++|..+.+..+... . .+.|...+ ....+.++.+.+++.. T Consensus 142 ~v~~d~~~~~-~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (468) T protein:vir:96 142 QPYVDEQGEF-KTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAH 220 (468) T ss_pred EEEEcCCCce-EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccc Confidence 9888888865 577788888876655321 1 11111111 1111223333222110 Q ss_pred -------------C----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHH Q lcl|NC_019710. 187 -------------G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKE 249 (424) Q Consensus 187 -------------~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~ 249 (424) + .+...|.|-+..+...++....+..-..+.++..+.|-.+++-.......+.. .. T Consensus 221 ~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~-------~~ 293 (468) T protein:vir:96 221 YYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFM-------YN 293 (468) T ss_pred eeeccccccCCcccEEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhh-------hh Confidence 0 02345788877777777777766666666667777777666532111111111 11 Q ss_pred HhCCcccCcceecC--CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------H Q lcl|NC_019710. 250 IAGGPVKKRLWILE--AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------Q 317 (424) Q Consensus 250 ~~~~~~ag~~~~l~--~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~ 317 (424) . ..++++.++ ++.+.+.+........+....+...+.|...-++|..-... ..++.+....+ . T Consensus 294 ~----~~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~Alk~~~~~l~~k~~ 368 (468) T protein:vir:96 294 L----KYYKAINVDGDGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQDK-FGNSPSGIALKFMYSNLDLKAN 368 (468) T ss_pred h----hcCceEEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCcccccccc-cccchHHHHHHHHHHHHHHHHH Confidence 1 112344442 33334434333334445667788888899988988532211 11222211111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g 397 (424) .....+...+.-+++.|...+. ...+. ..+.+.++.-+..|..+.+ +.+.+.|+++.-.+.+.++.-..+. T Consensus 369 ~k~~~~~~~l~~~~~li~~~~g----~~~d~--~~i~i~f~~~~p~d~~e~a---~~~~~~g~iS~et~i~~l~~v~D~~ 439 (468) T protein:vir:96 369 KLKNKTLTALQELLQYIIDFYK----LSIKV--QDVEITFNFNVMVNELEQS---QIGVNSQYLSKETVVTNHPWVDDPV 439 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHhC----CCccc--ceeeEEecCCCCcCHHHHH---HHHHhcCCCchHHHHHhCCCCCCHH Confidence 1111222222222222222111 11111 2233333444455555444 4456679999888888775522111 Q ss_pred cCeeeecccccchhhc--cccCCCccCCC Q lcl|NC_019710. 398 GDVAMRQSQYVPITDL--GTNKEPRNNGA 424 (424) Q Consensus 398 gd~~~~~~n~~~~~~~--~~~~~~~~~g~ 424 (424) .+.-.+.......... .-..+..++.. T Consensus 440 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 440 AEMERIDQEELALPSIEEGLNGKENNEPT 468 (468) T ss_pred HHHHHHHHHHHHHHHHhhccCCCCCCCCC Confidence 0000000000000000 00001111111 No 211 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=96.71 E-value=0.00039 Score=39.26 Aligned_cols=396 Identities=11% Similarity=0.064 Sum_probs=166.4 Q ss_pred CCCCCcccccCCC---------ccH-----------HHHHHhhccCcccccccccccccccccccccCCccccHHHHhhh Q lcl|NC_019710. 1 MEEPKYTIDLRTN---------NGW-----------WARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQI 60 (424) Q Consensus 1 ~~~~~~~~~~~~~---------~G~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (424) -..+...+.+++. .-+ +.++..+..+....-..... ...... -+..-+.. T Consensus 6 ~~~~~~~~~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~---------~~~~~~-~~~~ki~~ 75 (506) T protein:vir:94 6 TEHKQANLIYQESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQS---------RRHEDG-KADHRATH 75 (506) T ss_pred hhhhcceeecccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccccc---------cccccc-CCcceeec Confidence 0011111111111 011 22233333222111000000 000000 00011234 Q ss_pred HHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019710. 61 STVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) +.....|+..+.-+-+-|+++--. +.. ....+..++.. | ........+..+++.+|.||+++..+.+| T Consensus 76 n~~~~Iv~~~~~~l~G~p~~~~~~--d~~-----~~~~l~~~~~~--N---~~~~~~~~~~~~~~~~G~a~~~v~~ded~ 143 (506) T protein:vir:94 76 SFAKYIADFQTSYSVGNPINVKLP--DDG-----SNSGFDTFNKA--N---DVDAENYDLFLDMSRYGRAYEYVYRGEDN 143 (506) T ss_pred chHHHHHHHhhhhhcccCceeecC--cch-----HHHHHHHHHhc--c---CHhHHHHHHHHHHHhcCeEEEEEEecCCC Confidence 566667777777766667664211 111 11234444432 2 34456677888899999999999888888 Q ss_pred ceeEEEEeccceEEEEEcCCc---eE---EEEE----ecCc-------eEEecHhHeeEecC-----------------c Q lcl|NC_019710. 141 DVISLLPLQSANMDVKLVGKK---VV---YRYQ----RDSE-------YADFSQKEIFHLKG-----------------F 186 (424) Q Consensus 141 ~~~~l~~l~p~~v~~~~~~~~---~~---~~~~----~~~~-------~~~~~~~evih~r~-----------------~ 186 (424) .+ .+-.++|..+.+..++.. .. +.|. .+.. ...+.+..+.++.. . T Consensus 144 ~~-~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~~~~~~~~~~~g~v 222 (506) T protein:vir:94 144 EE-HLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMGKMQVDTTKPITTF 222 (506) T ss_pred ee-EEEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCccceeccccccCCcc Confidence 76 566788888887765422 11 0010 0000 00122222222110 0 Q ss_pred C----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC---------------------CCHHHHH Q lcl|NC_019710. 187 G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKV---------------------LTEQQRS 241 (424) Q Consensus 187 ~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~---------------------~~~~~~~ 241 (424) + .+.-.|.|.+......++....+..-..+.....+.|-.+++-.... ......+ T Consensus 223 Pvv~~~n~~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 302 (506) T protein:vir:94 223 PVVEFKNSNFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLE 302 (506) T ss_pred ceEEecCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhH Confidence 0 01123666666666666555544444443333333333333211000 0011111 Q ss_pred HHHHHHHH-HhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH---- Q lcl|NC_019710. 242 QVEENFKE-IAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE---- 316 (424) Q Consensus 242 ~~~~~~~~-~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e---- 316 (424) .++..... .......+.+...+.+.+++-+..+.....+....+.....|...-++|..-.... .++.+..... T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Aik~~~~ 381 (506) T protein:vir:94 303 LIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENF-ASNSSGVAMQYKVL 381 (506) T ss_pred HHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccc-cccchHHHHHHHHH Confidence 11111110 11111111222223344555555544555667778888999999999996322211 1222211111 Q ss_pred ------HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh Q lcl|NC_019710. 317 ------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTD 390 (424) Q Consensus 317 ------~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~l 390 (424) ......+...+...++.|..-+... -.........+++.+..-+..|..+.++.+.++ .|+++...+++++ T Consensus 382 ~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~-~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~~l 458 (506) T protein:vir:94 382 GTVELASTKRRMFERGLYARYQIISDIENSI-HGDWTFDPQELTFTFRDNLPADNISQIKALVQA--GATLPQKYLYQQL 458 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhC Confidence 1223344444555554444443311 010111122345555667778888999988888 4899999999888 Q ss_pred CCCCCCCcCeeeecc-------cccchhhccccCCCccCCC Q lcl|NC_019710. 391 NLPPLPGGDVAMRQS-------QYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 391 g~~p~~ggd~~~~~~-------n~~~~~~~~~~~~~~~~g~ 424 (424) ++-..|.-+.-.+-. ........++.++ .+..+ T Consensus 459 p~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~-~~~~~ 498 (506) T protein:vir:94 459 PGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQ-TNTTA 498 (506) T ss_pred CCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccC-ccccc Confidence 663321100000000 0000000001111 11111 No 212 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=96.62 E-value=0.00046 Score=38.89 Aligned_cols=372 Identities=10% Similarity=0.024 Sum_probs=167.1 Q ss_pred CCCCC---cccccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhC Q lcl|NC_019710. 1 MEEPK---YTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~---~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |.... ..=..+.+.--+.++..+..+.... .... ....... ..-+.++....+|+..+.-+-+- T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~i------l~~~----~~~~~~~---~~ki~~n~~~~ivd~~~~~l~g~ 67 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAI------LQQK----QKEQYKP---DNRLVVNFAKYIVDTFNGYFIGV 67 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------cccc----ccccCCC---cceeecchHHHHHHHHhhhhccc Confidence 11000 0000111222233333333332110 0000 0000000 01123456777888888877777 Q ss_pred ceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEEEEE Q lcl|NC_019710. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMDVKL 157 (424) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~~~~ 157 (424) |+.+--.+ + . ....+..++. . | ........+..+.+.+|.||+++..+.+|.+ .+-.++|..+.+.. T Consensus 68 ~~~~~~~~-~----~--~~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~~~~~~p~~~~~v~ 134 (429) T protein:vir:98 68 PVQTSHEN-K----Q--VSNYLELLDG-Y-N---DQDDNNAELSKICSIYGHGYELVFNDENAEA-GITYLTPLEAFIVY 134 (429) T ss_pred CceeecCC-h----H--HHHHHHHHHh-h-c---CHhHHHHHHHHHHhhcCeEEEEEEecCCCcE-EEEEEcccceEEEE Confidence 76652111 1 0 1122334333 2 2 2346677888999999999999999999986 46778888887665 Q ss_pred cCCc---eE---EEEEe-cCc-eEEecHhH--------------------------eeEecCcCCCCccccchHHHHHHH Q lcl|NC_019710. 158 VGKK---VV---YRYQR-DSE-YADFSQKE--------------------------IFHLKGFGFTGLVGLSPIAFACKS 203 (424) Q Consensus 158 ~~~~---~~---~~~~~-~~~-~~~~~~~e--------------------------vih~r~~~~~~~~G~s~~~~~~~~ 203 (424) ++.. .. ..+.. +.. ...+...+ |++++ +...|.|-+..+... T Consensus 135 dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~----n~~~g~sd~e~v~~l 210 (429) T protein:vir:98 135 DDSIRQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDGEKGIEIGESEPHPFDGVPMIEYV----ENEERQSLLASVVTL 210 (429) T ss_pred eCCCCCceEEEEEEEEecCceEEEEEEeCceEEEEEecCCceEecccccccCCccceEEec----CCCCCCCcHHHHHHH Confidence 5321 11 01110 000 00111111 22222 234688888887787 Q ss_pred HHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCC----ceeeeccCChhHHH Q lcl|NC_019710. 204 AGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAG----FSTSAIGVTPQDAE 279 (424) Q Consensus 204 i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g----~~~~~l~~s~~d~~ 279 (424) ++....+..-..+.+...+.|-.+++-. ... ++....+ . .++++.++++ .+...+........ T Consensus 211 iD~~d~~~s~~~~~~~~~~~p~~~i~g~-~~~-~~~~~~~-------~----~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 277 (429) T protein:vir:98 211 INAFNKAISEKANDVEYFADAYLKILGA-ELD-DETLKSL-------R----DTRIINLKDTDAQQLTVEFLQKPDADAT 277 (429) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecC-CCC-cchhhhH-------h----hCceeeccCCCCCCcceeEEeecCCHHH Confidence 8777777666666677777787776532 222 2211111 1 1234444321 23333333333333 Q ss_pred HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHH----------HHHHHHHHHHHHHHHHHHHHhhhccChhhhc Q lcl|NC_019710. 280 MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ----------NLGFLQYTLQPYISRWENSIQRWLIPAKDVG 349 (424) Q Consensus 280 ~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~----------~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~ 349 (424) +.+..+...+.|+..-++|..-... -++.+....+-. ....+...+.-+++.+..-++.. ..... T Consensus 278 ~~~~~~~l~~~i~~~s~~p~~~~~~--~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~---~~~~d 352 (429) T protein:vir:98 278 QEHLLDRLENLIFRTAMVANISDES--FGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSK---IGPKD 352 (429) T ss_pred HHHHHHHHHHHHHHHhCccccCccc--cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---CCccc Confidence 5566788899999999998432221 122221111110 01111222222222222211110 00111 Q ss_pred cceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccc-cCCCccCCC Q lcl|NC_019710. 350 RIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDLGT-NKEPRNNGA 424 (424) Q Consensus 350 ~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~-~~~~~~~g~ 424 (424) ...+++.+...+..|..+.++.+.++ .|+++..-+.++++.-+.|..+.-.+...... .... +.....+.. T Consensus 353 ~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~--~~~~~~~~~~~~~~ 424 (429) T protein:vir:98 353 WIGIKYKFTRNLPANLLEESQIAGNL--AGIVSEETQVGVLSIVENPQKEIERKNSDKST--LISRQAGGLNGQNT 424 (429) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHHHHHHHHHHHHHH--HHHHHHhhhcCCCC Confidence 12345555667778888899988887 58899877888887643221110000000000 0000 000000000 No 213 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=96.53 E-value=0.00054 Score=38.51 Aligned_cols=397 Identities=10% Similarity=0.029 Sum_probs=172.0 Q ss_pred CCCCCcccccCCC----ccHHHHHHhhccCccccccc--cccccc--ccc-cccccCCccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTN----NGWWARLKSWFVGGRLVTPN--QGSQTG--PVS-AHGYLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 1 ~~~~~~~~~~~~~----~G~~~~~~~~~~~~~~~~~~--~~~~~~--~~~-~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) |+-+|. |.+-.. ...+.++.+...... .+.. .....+ .+. ......+.. + .-+.++.....|+..+ T Consensus 3 ~~~~~~-~~~~~~~~~~~~~i~~~i~~~~~~~-~r~~~~~~yy~g~~~i~~~~~~~~~~~-~--~ki~~n~~~~ivd~~~ 77 (453) T protein:vir:73 3 LKPIKL-MTYSRDEEITDKVVNDFMKKHQEEV-ERYEYLGNMYKGIMEISSQKAKDSWKP-D--NRLTNNFAKYIVDTFV 77 (453) T ss_pred ccccee-eeccccccCCHHHHHHHHHHHHHHH-HHHHHHHHHhccccchhcCCCCCccCc-c--ceeecchHHHHHHHhh Confidence 222221 111111 123333332221100 0000 000000 000 000000000 0 0112345566667666 Q ss_pred HhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccc Q lcl|NC_019710. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSA 151 (424) Q Consensus 72 ~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~ 151 (424) .-+-+-|+.+-- + +. . ....+...+.. | ........+..+.+.+|.||.++..+.+|.+ .+-.++|. T Consensus 78 ~~l~g~~~~~~~-~-d~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~ 144 (453) T protein:vir:73 78 GYFNGIPIKKTH-D-DK---S--VLEAMQLFDNL--N---DMEDEESELAKIACVYGRAYELMYQNESTES-EVIYCSPL 144 (453) T ss_pred hhhcccCceeec-C-Ch---H--HHHHHHHHHHh--c---ChhHHHHHHHHHHHhcCeEEEEEEeCCCCce-EEEEEccc Confidence 666555665421 1 11 0 11223333321 2 3445667788999999999999999888877 46677888 Q ss_pred eEEEEEcCCc-e------EEEEEecCc--eEEecHhHeeEecCcC---------------------CCCccccchHHHHH Q lcl|NC_019710. 152 NMDVKLVGKK-V------VYRYQRDSE--YADFSQKEIFHLKGFG---------------------FTGLVGLSPIAFAC 201 (424) Q Consensus 152 ~v~~~~~~~~-~------~~~~~~~~~--~~~~~~~evih~r~~~---------------------~~~~~G~s~~~~~~ 201 (424) .+.+..++.. . .|.+...+. ...+.++.+++++... .+...|.|.+..+. T Consensus 145 ~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~ 224 (453) T protein:vir:73 145 NVFMVYDDSIKQKPLFAVYYGFDEEGNLSGTVYTLLETISITGKAGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVH 224 (453) T ss_pred ceEEEEeCCCCceeEEEEEEEEecCceEEEEEEeCCeEEEEEecCCceEEccceeccCCceeEEEecCCCCCCcchhhHH Confidence 8877665432 1 111111111 1123444444432110 01235778788777 Q ss_pred HHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHH Q lcl|NC_019710. 202 KSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMM 281 (424) Q Consensus 202 ~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~ 281 (424) ..++....+..-..+.....+.|..+++-. .. .++....++...-........+.....+.+.++.-+.....+..+. T Consensus 225 ~liDa~~~~~S~~~~~~~~~~~~~l~~~g~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~ 302 (453) T protein:vir:73 225 SLINSYNKVTSEKANDVEYFSDQYLVFLGA-EV-DEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTE 302 (453) T ss_pred HHHHHHHHHHHHHHHHHHHhccceeeeecC-CC-CchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHH Confidence 777776666666666666667777776532 22 2222222222111111111222333344455555554444455566 Q ss_pred HHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH----------HHHHHHHHHHHHHHHHHHHHHhhhccChhhhccc Q lcl|NC_019710. 282 ASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRI 351 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~----------~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~ 351 (424) ...+...+.|+..-++|.. +...-++.+....+. .....+...+.-.++.+..-+... -...+ .. T Consensus 303 ~~~~~l~~~I~~~s~~p~~--~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~--~~ 377 (453) T protein:vir:73 303 NLLNRLERSIFQFTMAANI--SDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNA-SNKDA--WK 377 (453) T ss_pred HHHHHHHHHHHHHhCCccc--CcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCccc--cc Confidence 6778888889888888842 221112222211211 111222333333333332222111 01111 12 Q ss_pred eeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccc---chhhccccCCCccCCC Q lcl|NC_019710. 352 HAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQYV---PITDLGTNKEPRNNGA 424 (424) Q Consensus 352 ~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~~---~~~~~~~~~~~~~~g~ 424 (424) .+++.+..-+..|..+.++.+.++. |+++.--+.+++++-+.+..+.-.+...-. .....+....+.+.-+ T Consensus 378 ~i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 451 (453) T protein:vir:73 378 DIEYTFTRNEPKDIKEQAETANILK--GITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNLVRMKQMRG 451 (453) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCCcchhhhc Confidence 3455556667788899999998886 788887777777663321111000000000 0000000001111111 No 214 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=96.40 E-value=0.00066 Score=38.02 Aligned_cols=375 Identities=8% Similarity=0.026 Sum_probs=171.9 Q ss_pred cccCCCccHHHHHHh--------------hccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019710. 8 IDLRTNNGWWARLKS--------------WFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 8 ~~~~~~~G~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) |.+.+=.-++++... ...+....-....... ............ .+..-+.+......|+..+.- T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~-~~~~~~~~~~~~-~~~~ki~~n~~k~Iv~~~~~y 78 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKA-KLNKEGKKDPLR-SADNRIPSNFYQLLVDQEAGY 78 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchh-cccccccccccc-cCCcccccchHHHHHHhhhhh Confidence 444444344443332 2222110000000000 000000000000 000011233445566666666 Q ss_pred hhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceE Q lcl|NC_019710. 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) Q Consensus 74 ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v 153 (424) +-+-|+.+--.+ + + ....+...++. +..+-...+..++..+|.||.++-.+.+|.+ .+..++|..+ T Consensus 79 l~G~p~~~~~~d-~---~---~~~~l~~~~~~------~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~-~~~~~~p~~~ 144 (470) T protein:vir:10 79 VASVFPDIDVGK-D---A---DNKKIIDVLGD------DRALTLNGLLVDSSNAGRAWLHYWIDEDGNF-RYGIIQPDQI 144 (470) T ss_pred eeccceeeecCc-h---H---HHHHHHHHHhh------hHHHHHHHHHHHHhhcCeeEEEEEecCCCce-EEEEEcccce Confidence 666676652211 1 1 11234444532 2334445678899999999999989988876 5777889888 Q ss_pred EEEEcCCc---e-----EEEEEecC--c----eEEecHhHeeEecCc--------------------------------- Q lcl|NC_019710. 154 DVKLVGKK---V-----VYRYQRDS--E----YADFSQKEIFHLKGF--------------------------------- 186 (424) Q Consensus 154 ~~~~~~~~---~-----~~~~~~~~--~----~~~~~~~evih~r~~--------------------------------- 186 (424) .+..++.. . +|...... . ...+.++.+.|++.. T Consensus 145 ~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (470) T protein:vir:10 145 TPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (470) T ss_pred EEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccC Confidence 88776432 1 11111111 0 112333344333210 Q ss_pred ----C----CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCc Q lcl|NC_019710. 187 ----G----FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKR 258 (424) Q Consensus 187 ----~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~ 258 (424) + .+...|.|-+......++....+..-..+.+..-+.|-.+++--.....++... .... .+ T Consensus 225 ~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~----~~~~-------~~ 293 (470) T protein:vir:10 225 FGRVPFIEFSKNKYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMN----DLRK-------YK 293 (470) T ss_pred CCeeeEEEeecCCCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhh----hhhh-------cC Confidence 0 012357788888888888777777767777777777777776432211122211 1111 12 Q ss_pred ceec-------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHH Q lcl|NC_019710. 259 LWIL-------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLG 321 (424) Q Consensus 259 ~~~l-------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~ 321 (424) .+.+ .++++|..-..+ .-.+....+...+.|...-++|.. .....|+.|..... ..... T Consensus 294 ~i~~~~~~~~~~~~~~~lt~~~~--~~~~~~~~~~L~~~I~~~s~~p~~--~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~ 369 (470) T protein:vir:10 294 SIKINNTGNGDNSGVDKLQIDIP--VEARDDALKITRKNIFLFGQGIDP--ANFESSNASGVAIKMLYSHLELKAAKTQT 369 (470) T ss_pred eEeccCCCCCcCceeEEEeecCC--hHHHHHHHHHHHHHHHHHhCCCCC--CccccccchHHHHHHHHHHHHHHHHHHHH Confidence 2222 223445444443 334466677788888888888742 22111222221111 11122 Q ss_pred HHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC--cC Q lcl|NC_019710. 322 FLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG--GD 399 (424) Q Consensus 322 f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g--gd 399 (424) .+..++.-.++.|...++ ..+.....+.+.+..-+..|..+.++.+.++ +|+++.--+.+++++-..+. .+ T Consensus 370 ~~~~~l~~~~~~i~~~l~-----~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~v~D~~~E~e 442 (470) T protein:vir:10 370 YFEHAINELVRAIMRYLN-----FSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELK 442 (470) T ss_pred HHHHHHHHHHHHHHHHhc-----ccCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCCHHHHHH Confidence 333333333333333222 1222234456666777888899999988887 58999888888876532211 00 Q ss_pred ee------eecccccchhhccccCCCccCCC Q lcl|NC_019710. 400 VA------MRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 400 ~~------~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) .. ..+.+. ...+ ..+...+.. T Consensus 443 ri~~E~~e~~~~~~-~~~~---~~~~~~dde 469 (470) T protein:vir:10 443 DLAKDKEENDPYSN-QADE---LNGKGVNDE 469 (470) T ss_pred HHHHHHHHHHHhhc-cccc---cCCCCCCCC Confidence 00 000000 0000 000011111 No 215 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=96.26 E-value=0.00082 Score=37.51 Aligned_cols=381 Identities=7% Similarity=0.025 Sum_probs=171.0 Q ss_pred CCCCC----ccc----------ccCCCccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHH Q lcl|NC_019710. 1 MEEPK----YTI----------DLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRC 66 (424) Q Consensus 1 ~~~~~----~~~----------~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 66 (424) |++.- ..+ .++.+..-+.++..+..+...... ... ..... ...-+.++....+ T Consensus 5 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~------~~~----~~~~~---~~~ki~~n~~~~I 71 (499) T protein:vir:10 5 IDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEK------HEF----DNATV---EAANVMVNHAKYI 71 (499) T ss_pred hhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhc------CCc----CcCCC---CcceeecchHHHH Confidence 21111 001 112222333444444433221100 000 00000 0001123445556 Q ss_pred HHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce---- Q lcl|NC_019710. 67 VSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV---- 142 (424) Q Consensus 67 i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~---- 142 (424) |+..+.-+-+-|+.+--.+ .. ....+...+.. | .-..+...+....+.+|.||.++..+.+|.+ T Consensus 72 v~~~~~~l~g~p~~~~~~~--~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~ 139 (499) T protein:vir:10 72 TDMNVGFMTGNPVKYVAEK--GK-----NIDDILEVFNQ--I---DIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRD 139 (499) T ss_pred HHHHhhhhcccCceeecCC--hh-----HHHHHHHHHhh--c---CHhHHHHHHHHHHHhcCceEEEEEecccccccccc Confidence 7777766666676542111 11 11234444432 2 2345678888999999999999888877743 Q ss_pred ------------eEEEEeccceEEEEEcCCc-------eEEEEEe---cCc----eEEecHhHeeEecCc---------- Q lcl|NC_019710. 143 ------------ISLLPLQSANMDVKLVGKK-------VVYRYQR---DSE----YADFSQKEIFHLKGF---------- 186 (424) Q Consensus 143 ------------~~l~~l~p~~v~~~~~~~~-------~~~~~~~---~~~----~~~~~~~evih~r~~---------- 186 (424) ..+..++|..+.+..+... .+|++.. +.. ...+.++.|.+++.. T Consensus 140 ~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~ 219 (499) T protein:vir:10 140 ELGNEKLTPNTELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDP 219 (499) T ss_pred cccccccccccceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcce Confidence 3466778887776655322 1111111 111 112445555544210 Q ss_pred ------C-C---------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH-HHHHHHHHHHH Q lcl|NC_019710. 187 ------G-F---------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ-QRSQVEENFKE 249 (424) Q Consensus 187 ------~-~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~-~~~~~~~~~~~ 249 (424) + + +...|.|-+..+...++....+..-..+.+...+.|-.+++-. ...... ....+ T Consensus 220 ~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~-~~~~~~~~~~~~------ 292 (499) T protein:vir:10 220 IVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGF-GLGDDKDDIQRL------ 292 (499) T ss_pred ecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC-ccccccchhhhh------ Confidence 0 0 1234677777777777777766666666667777777776532 211111 11111 Q ss_pred HhCCcccCccee--cCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------H Q lcl|NC_019710. 250 IAGGPVKKRLWI--LEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------Q 317 (424) Q Consensus 250 ~~~~~~ag~~~~--l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~ 317 (424) +.+++.. .++|.+++.+........+.+..+...+.|...-++|..-.... .++.|....+ . T Consensus 293 -----~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~gn~Sg~Al~~~~~~l~~k~~ 366 (499) T protein:vir:10 293 -----KRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKF-MGNVSGEAMKFKLFGLENLLS 366 (499) T ss_pred -----hhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhh-cccchHHHHHHHHHHHHHHHH Confidence 1122333 24555555555444444456667777888888888873211111 1111211111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPG 397 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~g 397 (424) .....+...+.-+++.+...++.. ........+++.+..-+..|..+.++.+.++ .|+++.--+.+++++-..+. T Consensus 367 ~k~~~~~~~l~~~~~li~~~~~~~---~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~ 441 (499) T protein:vir:10 367 IKQRYFFDGLRRRLKLIQTIVNIK---GANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNPQ 441 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcc---CCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHH Confidence 122333333444444443333211 1111112344444566677888999999888 58898887777776532110 Q ss_pred --c--------------Ceeeecccccchhhccc---cCC-CccCCC Q lcl|NC_019710. 398 --G--------------DVAMRQSQYVPITDLGT---NKE-PRNNGA 424 (424) Q Consensus 398 --g--------------d~~~~~~n~~~~~~~~~---~~~-~~~~g~ 424 (424) . .+.....+..+...... .++ ..++++ T Consensus 442 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (499) T protein:vir:10 442 DVIDEMNQQDAETIKKNQEALRGQDPDRLELEDKQDDSSENDKEAGS 488 (499) T ss_pred HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCCCcccCCCCCCCcc Confidence 0 01111111111100000 000 111111 No 216 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=96.07 E-value=0.001 Score=36.92 Aligned_cols=409 Identities=13% Similarity=0.128 Sum_probs=185.1 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccccccccccccccccc------CCcc-------ccHHHHhhhHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYL------GDSS-------INDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~-------~~~~~~~~~~~v~~~i 67 (424) |+|+-.-.+++...- .-.+.+.+.+.....+..+...+.. .+.. -..+..+.+|.|..|| T Consensus 1 ~~~~lfg~~i~~~~~-------~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av 73 (537) T protein:vir:10 1 MAQQLFGFSLQRAKK-------VPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIRNDHELITRYREMVLNPECDSAV 73 (537) T ss_pred Cccccccceeecccc-------cccCCcccCCCcccccceeecccccccccccccccchHHHHHHHHHHHhhccchhhHH Confidence 888777777665411 1112222222211111111111111 1111 1133456689999999 Q ss_pred HHHHHhhhhC-----ceeEeeccccCccc-cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC-- Q lcl|NC_019710. 68 SLISTLTACL-----PLDVFETDQNDNRK-KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA-- 139 (424) Q Consensus 68 ~~ia~~ia~~-----~~~~~~~~~~~~~~-~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~-- 139 (424) +.|.+.+.-+ |+.+--.+.+.+.. .........++|+ =-+....+ +.+++.|...|..|..++-|.. T Consensus 74 ~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~-ll~F~~~~----~e~fR~WYVDgRi~fhKiid~k~p 148 (537) T protein:vir:10 74 DDVVNETICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILR-LLDFDNRA----YEIFRRWYVDGRLFFHKVIDPKKP 148 (537) T ss_pred HHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhheeeeEEEEEEEEeCCCc Confidence 9999987643 22221111111100 0001112222222 11112222 4557778899999988866433 Q ss_pred -CceeEEEEeccceEEEEEc-----CCce---------------EEEEEe------cCceEEecHhHeeEec--CcCCCC Q lcl|NC_019710. 140 -GDVISLLPLQSANMDVKLV-----GKKV---------------VYRYQR------DSEYADFSQKEIFHLK--GFGFTG 190 (424) Q Consensus 140 -G~~~~l~~l~p~~v~~~~~-----~~~~---------------~~~~~~------~~~~~~~~~~evih~r--~~~~~~ 190 (424) ..+.+|..|+|.+++..+. .... +|.|.. .+....++.+-|.... -...++ T Consensus 149 k~GI~ELr~lDPr~i~~vR~i~~~~~~~~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~n~ 228 (537) T protein:vir:10 149 RQGLVELRYVDPRKIRKVTEYEAKRPEALRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSIAYCHSGIQDLNK 228 (537) T ss_pred cccceeeeeeCCccceeeEeecccCCccceEEecceeeeecccceeeeccccccccCCCceeccHhheeeecccceeCCC Confidence 3688999999999964332 1111 111221 2234556664443333 133455 Q ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC--------c--ccCcc Q lcl|NC_019710. 191 LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG--------P--VKKRL 259 (424) Q Consensus 191 ~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~--------~--~ag~~ 259 (424) -+.+|-|..+.+.+....-++...--+--.-+.-+-|...+-+... ..+.+.++....++... . +..+. T Consensus 229 ~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~ 308 (537) T protein:vir:10 229 NMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKF 308 (537) T ss_pred CeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchh Confidence 6678889999888888887777776655555555566666555443 45555666666655421 0 11111 Q ss_pred e-ec----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHH------HH Q lcl|NC_019710. 260 W-IL----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL------GF 322 (424) Q Consensus 260 ~-~l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~------~f 322 (424) + .+ ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|....+-+.. .+.+..+ .| T Consensus 309 msMlEDyWLPRReGgrgTEItTLpGg-qnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~G--r~~EItRDEiKF~KF 385 (537) T protein:vir:10 309 MSMLEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIG--RAAEITRDEVKFQKF 385 (537) T ss_pred hhhhhhhcccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCCccccCCCCccccc--ccchhhHHHHHHHHH Confidence 1 11 124566665431 223334445677889999999999988654332221 2222211 22 Q ss_pred HHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHHH--hCCCcCHHHH Q lcl|NC_019710. 323 LQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAMG--ESGLRTINEM 386 (424) Q Consensus 323 ~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~~--~~g~~t~NE~ 386 (424) +..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++.+- -+-.++.+=+ T Consensus 386 I~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi 465 (537) T protein:vir:10 386 IARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYI 465 (537) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHH Confidence 2221222334444445555442 23332 23444544433221 1 123333333321 1112333333 Q ss_pred HH------------H---------hCCCCCCC----c------CeeeecccccchhhccccCCC--ccCCC Q lcl|NC_019710. 387 RR------------T---------DNLPPLPG----G------DVAMRQSQYVPITDLGTNKEP--RNNGA 424 (424) Q Consensus 387 R~------------~---------lg~~p~~g----g------d~~~~~~n~~~~~~~~~~~~~--~~~g~ 424 (424) |+ + .|+=+-|. | ++++.|+...|-.+.+..+.| ..+|- T Consensus 466 ~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (537) T protein:vir:10 466 RTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGE 536 (537) T ss_pred HHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCC Confidence 22 1 12211121 1 222333333332221111111 11111 No 217 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=95.93 E-value=0.0012 Score=36.53 Aligned_cols=386 Identities=9% Similarity=0.000 Sum_probs=173.4 Q ss_pred CCCCC-cccccC----CCccHHHHHHhhccCccccccc---ccccc--cccccccccCCccccHHHHhhhHHHHHHHHHH Q lcl|NC_019710. 1 MEEPK-YTIDLR----TNNGWWARLKSWFVGGRLVTPN---QGSQT--GPVSAHGYLGDSSINDERILQISTVWRCVSLI 70 (424) Q Consensus 1 ~~~~~-~~~~~~----~~~G~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~i 70 (424) |.-++ -++.+- ...-.+.++........ .+.. ..... ............ .. .-+..+.....|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~-~r~~~~~~Yy~g~~~i~~~~~~~~~~-~~--~ki~~n~~~~ivd~~ 76 (452) T protein:vir:36 1 MKYKPPKLMTFSKDEPITVEVVTKFMEKHKLEV-ARYEYLKNMYLGIMAIDDEPAKDSWK-PD--NRLAVNFTKYIVDTF 76 (452) T ss_pred CcccCceeEEcCCccCCCHHHHHHHHHHHHHHH-HHHHHHHHHhccccccccCccccccC-cc--ceeecchHHHHHHHH Confidence 43221 112221 12356666655433211 1110 00000 000000000000 00 112235566677777 Q ss_pred HHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEecc Q lcl|NC_019710. 71 STLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQS 150 (424) Q Consensus 71 a~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p 150 (424) +.-+-+-|+.+--. +.. ....+..++.. | ........+..+.+.+|.||.++..+.+|.+ .+..++| T Consensus 77 ~~~l~g~~~~~~~~--d~~-----~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~-~i~~~~p 143 (452) T protein:vir:36 77 TGYFNGIPVKKSHS--DKE-----ILTKLQEFDNL--N---DMEDEESELAKMACIYGRAFEFLYQDEDTQT-NVVYNSP 143 (452) T ss_pred hhhhcccCceeecC--Chh-----HHHHHHHHHhh--c---ChhHHHHHHHHHHHhcCeEEEEEEecCCCee-EEEEEcc Confidence 77666666664211 111 11234444432 2 2445667788999999999999988888876 4777888 Q ss_pred ceEEEEEcCCc---eE---EEEEe-cCc--eEEecHhHeeEecCc-----------------C----CCCccccchHHHH Q lcl|NC_019710. 151 ANMDVKLVGKK---VV---YRYQR-DSE--YADFSQKEIFHLKGF-----------------G----FTGLVGLSPIAFA 200 (424) Q Consensus 151 ~~v~~~~~~~~---~~---~~~~~-~~~--~~~~~~~evih~r~~-----------------~----~~~~~G~s~~~~~ 200 (424) ..+.+..++.. .. +.+.. ... ...+.++.++++... + .+...|.|-+... T Consensus 144 ~~~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v 223 (452) T protein:vir:36 144 ENMFMVYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESV 223 (452) T ss_pred cceEEEEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecceeccCCcccEEEecCCCCCCcchHHH Confidence 88887665432 11 11111 111 112333333322110 0 1123577778777 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCC-----CceeeeccCCh Q lcl|NC_019710. 201 CKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEA-----GFSTSAIGVTP 275 (424) Q Consensus 201 ~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~-----g~~~~~l~~s~ 275 (424) ...++....+.....+.+...+.|-.+++- .... ++....++. ++++.++. +.++..+.... T Consensus 224 ~~liDa~d~~~s~~~~~~~~~~~p~~~~~g-~~~~-~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~l~~~~ 290 (452) T protein:vir:36 224 ISLVNAFNKAISEKANDVDYFSDQYLTFLG-AAVE-EEDLKNIRS-----------NRVINYYADGEGKNVDVKFLEKPD 290 (452) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeEeec-CCcC-chhhhhhhh-----------cceEEecCCCCccCCcceeEeecC Confidence 777777666666666666677777766652 2222 222111110 12333322 12333333333 Q ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------HHHHHHHHHHHHHHHHHHHHHHhhhccCh Q lcl|NC_019710. 276 QDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------QQNLGFLQYTLQPYISRWENSIQRWLIPA 345 (424) Q Consensus 276 ~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~~~~~f~~~tl~P~~~~ie~~l~~~L~~~ 345 (424) ....+....+...+.|+..-++|..-.+. .++.+....+ ......+...+...++.|..-+... -.. T Consensus 291 ~~~~~~~~~~~l~~~I~~~s~~p~~~~~~--~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~ 367 (452) T protein:vir:36 291 SDSQTENLLDRLTKLIFQTTMVANISDES--FGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNV-SNK 367 (452) T ss_pred CHHHHHHHHHHHHHHHHHHhCccccCccc--ccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-CCc Confidence 34555667788888999999998532221 1222211111 1112233334444444333322211 111 Q ss_pred hhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeec----------ccccchhhccc Q lcl|NC_019710. 346 KDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQ----------SQYVPITDLGT 415 (424) Q Consensus 346 ~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~----------~n~~~~~~~~~ 415 (424) . ....+++.+..-+..|..+.++.+.++ .|+++.--+.+++++-..+..+.-.+. .+..+ +.-+. T Consensus 368 ~--~~~~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~ 442 (452) T protein:vir:36 368 D--SWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQP-SEKGT 442 (452) T ss_pred c--ccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccC-CCCcc Confidence 1 112334444566778888899888887 578998778888876432110000000 00000 00001 Q ss_pred cCCCccCCC Q lcl|NC_019710. 416 NKEPRNNGA 424 (424) Q Consensus 416 ~~~~~~~g~ 424 (424) +++..++.- T Consensus 443 ~~~~~~~~~ 451 (452) T protein:vir:36 443 DTVVSETNE 451 (452) T ss_pred cccCccccC Confidence 111111111 No 218 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=95.24 E-value=0.0025 Score=34.89 Aligned_cols=372 Identities=7% Similarity=-0.008 Sum_probs=165.8 Q ss_pred cccCCCcc--------------HHHHHHhhccCcccccccccc-cccc-cccccccCCccccHHHHhhhHHHHHHHHHHH Q lcl|NC_019710. 8 IDLRTNNG--------------WWARLKSWFVGGRLVTPNQGS-QTGP-VSAHGYLGDSSINDERILQISTVWRCVSLIS 71 (424) Q Consensus 8 ~~~~~~~G--------------~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 71 (424) |++.+-.= .+.++..+..+.......... .... .............+..-+.++....+|+..+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 44433322 233344444332211000000 0000 0000000000000011122445556677666 Q ss_pred HhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC-CCceeEEEEecc Q lcl|NC_019710. 72 TLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS-AGDVISLLPLQS 150 (424) Q Consensus 72 ~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~-~G~~~~l~~l~p 150 (424) .-+-+-|+.+-- ++. +. ...+...+. | ........+...++.+|.||.++.++. +|.+ .+..++| T Consensus 81 ~yl~G~p~~~~~--~~~---~~--~~~l~~~~~---n---~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~-~~~~~~p 146 (471) T protein:vir:10 81 AYALTYPPTFDV--DDK---KV--NDMIVDVLG---D---DYERISKQLCVNAGNAGIAWLHVWKDASDNSF-RYACVDS 146 (471) T ss_pred hhhcccCceecc--CCh---HH--HHHHHHHHh---c---CHHHHHHHHHHHHhhCCeEEEEEEeeCCCCee-EEEEEcc Confidence 666666766521 111 11 111222221 2 233455667889999999999988874 4654 6777889 Q ss_pred ceEEEEEcCCc---e-----EEEEE--ecCce----EEecHhHeeEecCcC----------------------------- Q lcl|NC_019710. 151 ANMDVKLVGKK---V-----VYRYQ--RDSEY----ADFSQKEIFHLKGFG----------------------------- 187 (424) Q Consensus 151 ~~v~~~~~~~~---~-----~~~~~--~~~~~----~~~~~~evih~r~~~----------------------------- 187 (424) ..+.+..+... . +|... .++.. ..+..+.+.|++... T Consensus 147 ~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 226 (471) T protein:vir:10 147 KEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSF 226 (471) T ss_pred cceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccc Confidence 88887766432 1 11111 11111 123444444443110 Q ss_pred ---C---------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcc Q lcl|NC_019710. 188 ---F---------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPV 255 (424) Q Consensus 188 ---~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ 255 (424) + +...|.|-+......++....+..-..+.+...+.|-.+++-......++... ... T Consensus 227 ~~~~g~iPvv~~~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-------~~~---- 295 (471) T protein:vir:10 227 KHDFGLVPFIPFKNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLE-------DLK---- 295 (471) T ss_pred cCCCCceeEEEeccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHH-------Hhh---- Confidence 0 12247777877777777777666666666666677766665432222222111 111 Q ss_pred cCcceecC-------CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHH---------- Q lcl|NC_019710. 256 KKRLWILE-------AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ---------- 318 (424) Q Consensus 256 ag~~~~l~-------~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~---------- 318 (424) .++.+.++ ++++|..-..+ ...+....+...+.|...-++|..-.... ++.+....+-. T Consensus 296 ~~~~i~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~I~~~s~tp~~~~~~~--gn~Sg~Alk~~~~~l~~k~~~ 371 (471) T protein:vir:10 296 RYKMIKMDNDGMGDQSGVTTIAIDIP--TEARNLILERTKKQIFISGQGVNPETDKL--GNSSGVALKFLYSLLELKAGN 371 (471) T ss_pred cCCeEEecCCCCccCccceEEeecCC--hHHHHHHHHHHHHHHHHHhCCcCCCcccc--cCccHHHHHHHHHHHHHHHHH Confidence 11222221 23444443332 34456677788888888888885322221 22222112111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019710. 319 NLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 319 ~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg 398 (424) ....+...+.-.++.|...+ ...+. ..+.+.+...+..|..+.++.+.++ .|+++.--+.++++.-..+. T Consensus 372 ~~~~~~~~l~~~~~li~~~~-----~~~d~--~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~D~~- 441 (471) T protein:vir:10 372 METQFRSGYATLVKMILKHL-----GLSDK--LKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPIVEDWQ- 441 (471) T ss_pred HHHHHHHHHHHHHHHHHHHh-----ccCCC--ceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCCHH- Confidence 11122222222222222221 22222 3345555677788889999998887 58899888887775532110 Q ss_pred Ceeeecccccchhh--c--cccCCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITD--L--GTNKEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~--~--~~~~~~~~~g~ 424 (424) . .+..+.. . .++.....++. T Consensus 442 -~-----E~eri~~E~~~~~~~~~~~~~~~ 465 (471) T protein:vir:10 442 -D-----ELRLQKAEQEGRSEKLYDMEEVE 465 (471) T ss_pred -H-----HHHHHHHHHHHHHhcccccCCCC Confidence 0 0001100 0 00001111111 No 219 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=94.86 E-value=0.0033 Score=34.17 Aligned_cols=396 Identities=10% Similarity=0.037 Sum_probs=174.1 Q ss_pred CCCCCcccccCCCccHHHH-HHhhccCcccccc---------cccccccccccccccCC---ccccHHHHhhhHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWAR-LKSWFVGGRLVTP---------NQGSQTGPVSAHGYLGD---SSINDERILQISTVWRCV 67 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~-~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~v~~~i 67 (424) |--|--.|.+-.--+++.. +...+........ ...............+. ....+..-+.+....-.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Iv 80 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELV 80 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHH Confidence 5555555554333344433 2222211110000 00000000000000000 000000011233344456 Q ss_pred HHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEE Q lcl|NC_019710. 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) Q Consensus 68 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~ 147 (424) +..+.=+-+-|+++-- .+++. ..+...|...-+ .........+..++..+|.||.++-.+.+|.+ .+.. T Consensus 81 d~~~~yl~G~Pv~~~~-~d~~~-------~e~~~~l~~~~~--~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~-~~~~ 149 (537) T protein:vir:78 81 DQLAQYLLSNGVEVKV-KDEDN-------TQLDEILQEYFD--EDFQATIDTLVTNASKKGFEGIFARTTSEGKL-KFQT 149 (537) T ss_pred HHHhhhhcccCceeec-Ccchh-------HHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCeeEEEeeecCCCce-EEEE Confidence 6666666667776531 11111 123333432211 12334556778899999999999988888866 4677 Q ss_pred eccceEEEEEcCCce------EEEEE-ec-----C----ceEEecHhHeeEecCcC------------------------ Q lcl|NC_019710. 148 LQSANMDVKLVGKKV------VYRYQ-RD-----S----EYADFSQKEIFHLKGFG------------------------ 187 (424) Q Consensus 148 l~p~~v~~~~~~~~~------~~~~~-~~-----~----~~~~~~~~evih~r~~~------------------------ 187 (424) ++|..+.+..++... +|... .. . ....++++.|.+++... T Consensus 150 i~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 229 (537) T protein:vir:78 150 VDGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAI 229 (537) T ss_pred EccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeec Confidence 888888776664331 11100 00 0 01124444444432100 Q ss_pred --------------------C---------CCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019710. 188 --------------------F---------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 188 --------------------~---------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~ 238 (424) + +...|.|-+......++....+..-.++.+..-+.|-.+++--.....++ T Consensus 230 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~ 309 (537) T protein:vir:78 230 EESTDADFEDTDGYQVLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDK 309 (537) T ss_pred cccccccccccccccccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchh Confidence 0 12247788888888888777777777777777666766665322211222 Q ss_pred HHHHHHHHHHHHhCCcccCcceecC-CC--ceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLWILE-AG--FSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGI 315 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~-~g--~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~ 315 (424) .... ... .+++.++ .+ ++|..... .+.......+...+.|...-.+|. ......|+.|.... T Consensus 310 ~~~~----l~~-------~~~i~v~~d~~~v~~l~~~~--~~~~~e~~ld~L~~~I~~~s~~~~--~~~~~~gn~SGvAl 374 (537) T protein:vir:78 310 LRQN----IKA-------KKMIGVNGDNAGMEIQTVSI--PYEARKAKMDIDVENIYRSGMGFN--STAVGDGNVTNVVI 374 (537) T ss_pred HHHH----Hhh-------cCceeecCCCCceeEEEecC--CHHHHHHHHHHHHHHHHHhcCCCC--CccccccCCcHHHH Confidence 2211 111 1233332 23 44443333 223234445555666655443432 12212222221111 Q ss_pred H----------HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHH Q lcl|NC_019710. 316 E----------QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINE 385 (424) Q Consensus 316 e----------~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE 385 (424) . ......+...+.-.++.|...++.+-.... ....+.+.+..-+..|..+.++.+.++++.|+++..- T Consensus 375 k~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~--d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT 452 (537) T protein:vir:78 375 KSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEY--DSNDICFEIEPHVLANELDIATTRKTEAETEALKIGN 452 (537) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc--ccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHH Confidence 1 122233444455555555554432211111 2234555556667888999999999999999999887 Q ss_pred HHHHhCCCCCCC--------------------cCeeeecccccc----hhh-cccc--C---CCccCCC Q lcl|NC_019710. 386 MRRTDNLPPLPG--------------------GDVAMRQSQYVP----ITD-LGTN--K---EPRNNGA 424 (424) Q Consensus 386 ~R~~lg~~p~~g--------------------gd~~~~~~n~~~----~~~-~~~~--~---~~~~~g~ 424 (424) +.+.+++-..+. .+.-.......| ... ...+ + .+.+..| T Consensus 453 ~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 521 (537) T protein:vir:78 453 IMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVA 521 (537) T ss_pred HHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCC Confidence 776665421110 000000000000 000 0000 0 0011111 No 220 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=94.51 E-value=0.0042 Score=33.61 Aligned_cols=400 Identities=14% Similarity=0.144 Sum_probs=172.0 Q ss_pred HHHHHhhcc--------Cccccccccccccccccccccc-------CCcc------ccHHHHhhhHHHHHHHHHHHHhhh Q lcl|NC_019710. 17 WARLKSWFV--------GGRLVTPNQGSQTGPVSAHGYL-------GDSS------INDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 17 ~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~-------~~~~------~~~~~~~~~~~v~~~i~~ia~~ia 75 (424) +..++++-- +.+.+.+.......++...+.. +... -..+..+.+|.|..||+.|.+.+. T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 222222211 1111111111111111111111 1111 112345668999999999999876 Q ss_pred hC-----ceeEeecc-ccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---CceeEEE Q lcl|NC_019710. 76 CL-----PLDVFETD-QNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---GDVISLL 146 (424) Q Consensus 76 ~~-----~~~~~~~~-~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---G~~~~l~ 146 (424) -+ |+.+--.+ +-.+.-.........++|+ =-+....+ +.+++.|...|..|..++-+.+ ..+.+|. T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~-ll~F~~~~----~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr 155 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILR-LLDFENRS----YEIFRRWYVDGRLFYHKVIDPDNPQGGLIELR 155 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceEEEEEEecCCCccccceeee Confidence 43 22221111 0000000000111222222 11112222 4557778899999988765533 4689999 Q ss_pred EeccceEEEEEc-----CCc---------------eEEEEEec------CceEEecHhHeeEecC--cCCCCccccchHH Q lcl|NC_019710. 147 PLQSANMDVKLV-----GKK---------------VVYRYQRD------SEYADFSQKEIFHLKG--FGFTGLVGLSPIA 198 (424) Q Consensus 147 ~l~p~~v~~~~~-----~~~---------------~~~~~~~~------~~~~~~~~~evih~r~--~~~~~~~G~s~~~ 198 (424) .|+|.+|+..+. .++ -+|.|... +....++.+-|..... ...++-.=+|-|. T Consensus 156 ~lDPr~i~~vr~i~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLh 235 (533) T protein:vir:10 156 YIDPRKIRKINETEQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPDSICYVHSGIMDLNKNMTLSHLH 235 (533) T ss_pred eccccceeeeeeeeccCCCccceeecchhhhccceeeeeeccccccccCCCceecchhheeeeeccceeCCCCceeccch Confidence 999999986321 111 11222211 2345566654443331 2233333467788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC--------c--ccCcce-ec---- Q lcl|NC_019710. 199 FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG--------P--VKKRLW-IL---- 262 (424) Q Consensus 199 ~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~--------~--~ag~~~-~l---- 262 (424) .+.+.+....-++...--+--.-+.-+-|...+-+... ..+.+.++....++... . +..+.+ .+ T Consensus 236 kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyW 315 (533) T protein:vir:10 236 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFW 315 (533) T ss_pred HhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhc Confidence 88888877777777666555555555556665555443 45555666666655321 0 111111 11 Q ss_pred ------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHH------HHHHHHHHHH Q lcl|NC_019710. 263 ------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL------GFLQYTLQPY 330 (424) Q Consensus 263 ------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~------~f~~~tl~P~ 330 (424) ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|....+-+. ..+.+..+ .|+..-=.-+ T Consensus 316 LPRReGgrgTEItTLpGg-qnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~--Gr~~EItRDEiKF~KFI~RLR~rF 392 (533) T protein:vir:10 316 LPRREGGRGTEITTLPGG-QNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNV--GRAAEITRDEVKFQKFVARLRKRF 392 (533) T ss_pred ccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCccccCCCCcccc--cccchhhHHHHHHHHHHHHHHHHH Confidence 124566665431 22333444567788999999999998865433222 12222211 2222212223 Q ss_pred HHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHH--HhCCCcCHHHHHH-HhCCC Q lcl|NC_019710. 331 ISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAM--GESGLRTINEMRR-TDNLP 393 (424) Q Consensus 331 ~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~--~~~g~~t~NE~R~-~lg~~ 393 (424) ...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++.+ +-+-.++.+=+|+ .|.+. T Consensus 393 s~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~t 472 (533) T protein:vir:10 393 SELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQT 472 (533) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccC Confidence 34444455555442 23332 24444544433221 1 12344444333 1112345544443 22221 Q ss_pred CC----------CCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 394 PL----------PGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 394 p~----------~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) -. ........+--..+.+......+|+-+|+ T Consensus 473 Deei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~ 513 (533) T protein:vir:10 473 DVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGA 513 (533) T ss_pred HHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCc Confidence 10 00001111100011111111112222222 No 221 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=94.03 E-value=0.0056 Score=32.92 Aligned_cols=404 Identities=10% Similarity=0.092 Sum_probs=184.3 Q ss_pred cccCCCccHHHHHHhh-------ccCccccccccccccccc-------cccc----cc--CCcc-------ccHHHHhhh Q lcl|NC_019710. 8 IDLRTNNGWWARLKSW-------FVGGRLVTPNQGSQTGPV-------SAHG----YL--GDSS-------INDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~-------~~~~~~~~~~~~~~~~~~-------~~~~----~~--~~~~-------~~~~~~~~~ 60 (424) |+|-.=-|||.+.... -...+.+.|.....+..+ ...+ +. .+.. -..+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 6555546776654442 122333333221111000 0000 01 1111 112345668 Q ss_pred HHHHHHHHHHHHhhhhC-----ceeEeecccc-CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019710. 61 STVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~~~~~~~~-~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) |.|..||+.|.+.+.-+ |+.+--.+.+ ...-.........++|+- -|....+ +.+++.|...|..|..+ T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~l-l~F~~~~----~~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRL-LDASRKL----DTLFRRWYVDSRIFFHK 155 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhhhhcceEEEEE Confidence 99999999999986643 2322111100 000000001122222221 1112222 44577788899999875 Q ss_pred ee-CCCCceeEEEEeccceEEEEEc-----CCc--------eEEEEEecC-------------ceEEecHhHeeEecC-- Q lcl|NC_019710. 135 DR-NSAGDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDS-------------EYADFSQKEIFHLKG-- 185 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~p~~v~~~~~-----~~~--------~~~~~~~~~-------------~~~~~~~~evih~r~-- 185 (424) +. +....+.+|..|+|.+++..+. .++ .+|.|..+. ....++.+-|.|... T Consensus 156 iid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL 235 (516) T protein:vir:10 156 IMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGL 235 (516) T ss_pred EecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccc Confidence 54 3345689999999999985432 111 123332211 234455555555441 Q ss_pred cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC----cccC--- Q lcl|NC_019710. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG----PVKK--- 257 (424) Q Consensus 186 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~----~~ag--- 257 (424) ...++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++... .+.| T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ 315 (516) T protein:vir:10 236 MDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVK 315 (516) T ss_pred eeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeec Confidence 233333337778888888887777777766555555555556665555443 45555566666555321 0111 Q ss_pred ---ccee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHH-- Q lcl|NC_019710. 258 ---RLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG-- 321 (424) Q Consensus 258 ---~~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~-- 321 (424) +.+. + ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|....+.+.....+.+..+. T Consensus 316 ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEi 394 (516) T protein:vir:10 316 NQKRNLSMTEDYWLMRRDGKSVTEVSSLPGA-QTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDEL 394 (516) T ss_pred cchhhhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHH Confidence 1111 1 124566665431 2232334456678899999999999887654433311122222221 Q ss_pred ----HHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHHH--hCCCc Q lcl|NC_019710. 322 ----FLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAMG--ESGLR 381 (424) Q Consensus 322 ----f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~~--~~g~~ 381 (424) |+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++.+- -+.++ T Consensus 395 KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~ 474 (516) T protein:vir:10 395 DFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYV 474 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 22221222334444455555442 33332 23444544433221 1 123444444432 23577 Q ss_pred CHHHHHH-HhCCCCCC--CcCeeeecccccchhhcccc--CCCccCCC Q lcl|NC_019710. 382 TINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 382 t~NE~R~-~lg~~p~~--ggd~~~~~~n~~~~~~~~~~--~~~~~~g~ 424 (424) +.+=+|+ .|.+.-.+ .-++.+ -+...+. +.|++..- T Consensus 475 s~~yi~k~ILr~tDeei~~e~k~I-------~~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 475 SHDYVMKNILQMTEEQIAQEEKQI-------EQEAGIKRFQNPENEDD 515 (516) T ss_pred chHHHHHHHhcCCHhhHHHHHHHH-------HHhhhCCCCCCCCcccc Confidence 7777765 45554211 000000 0000000 11111111 No 222 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=94.03 E-value=0.0056 Score=32.92 Aligned_cols=404 Identities=10% Similarity=0.092 Sum_probs=184.3 Q ss_pred cccCCCccHHHHHHhh-------ccCccccccccccccccc-------cccc----cc--CCcc-------ccHHHHhhh Q lcl|NC_019710. 8 IDLRTNNGWWARLKSW-------FVGGRLVTPNQGSQTGPV-------SAHG----YL--GDSS-------INDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~~~~~-------~~~~~~~~~~~~~~~~~~-------~~~~----~~--~~~~-------~~~~~~~~~ 60 (424) |+|-.=-|||.+.... -...+.+.|.....+..+ ...+ +. .+.. -..+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 6555546776654442 122333333221111000 0000 01 1111 112345668 Q ss_pred HHHHHHHHHHHHhhhhC-----ceeEeecccc-CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019710. 61 STVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~~~~~~~~-~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) |.|..||+.|.+.+.-+ |+.+--.+.+ ...-.........++|+- -|....+ +.+++.|...|..|..+ T Consensus 81 pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~l-l~F~~~~----~~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRL-LDASRKL----DTLFRRWYVDSRIFFHK 155 (516) T ss_pred cchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhhhhcceEEEEE Confidence 99999999999986643 2322111100 000000001122222221 1112222 44577788899999875 Q ss_pred ee-CCCCceeEEEEeccceEEEEEc-----CCc--------eEEEEEecC-------------ceEEecHhHeeEecC-- Q lcl|NC_019710. 135 DR-NSAGDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDS-------------EYADFSQKEIFHLKG-- 185 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~p~~v~~~~~-----~~~--------~~~~~~~~~-------------~~~~~~~~evih~r~-- 185 (424) +. +....+.+|..|+|.+++..+. .++ .+|.|..+. ....++.+-|.|... T Consensus 156 iid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL 235 (516) T protein:vir:10 156 IMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGL 235 (516) T ss_pred EecCccccceeeeeeCCcceeeEeeecccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccc Confidence 54 3345689999999999985432 111 123332211 234455555555441 Q ss_pred cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC----cccC--- Q lcl|NC_019710. 186 FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG----PVKK--- 257 (424) Q Consensus 186 ~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~----~~ag--- 257 (424) ...++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++... .+.| T Consensus 236 ~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ 315 (516) T protein:vir:10 236 MDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVK 315 (516) T ss_pred eeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeec Confidence 233333337778888888887777777766555555555556665555443 45555566666555321 0111 Q ss_pred ---ccee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHHHH-- Q lcl|NC_019710. 258 ---RLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLG-- 321 (424) Q Consensus 258 ---~~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~~~-- 321 (424) +.+. + ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|....+.+.....+.+..+. T Consensus 316 ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEi 394 (516) T protein:vir:10 316 NQKRNLSMTEDYWLMRRDGKSVTEVSSLPGA-QTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDEL 394 (516) T ss_pred cchhhhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHH Confidence 1111 1 124566665431 2232334456678899999999999887654433311122222221 Q ss_pred ----HHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHHH--hCCCc Q lcl|NC_019710. 322 ----FLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAMG--ESGLR 381 (424) Q Consensus 322 ----f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~~--~~g~~ 381 (424) |+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++.+- -+.++ T Consensus 395 KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~ 474 (516) T protein:vir:10 395 DFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYV 474 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 22221222334444455555442 33332 23444544433221 1 123444444432 23577 Q ss_pred CHHHHHH-HhCCCCCC--CcCeeeecccccchhhcccc--CCCccCCC Q lcl|NC_019710. 382 TINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTN--KEPRNNGA 424 (424) Q Consensus 382 t~NE~R~-~lg~~p~~--ggd~~~~~~n~~~~~~~~~~--~~~~~~g~ 424 (424) +.+=+|+ .|.+.-.+ .-++.+ -+...+. +.|++..- T Consensus 475 s~~yi~k~ILr~tDeei~~e~k~I-------~~E~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 475 SHDYVMKNILQMTEEQIAQEEKQI-------EQEAGIKRFQNPENEDD 515 (516) T ss_pred chHHHHHHHhcCCHhhHHHHHHHH-------HHhhhCCCCCCCCcccc Confidence 7777765 45554211 000000 0000000 11111111 No 223 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=93.55 E-value=0.0072 Score=32.34 Aligned_cols=405 Identities=10% Similarity=0.080 Sum_probs=182.3 Q ss_pred cccCCCccHHHHHHh-----hcc--Cccccccccccc-----cccc--ccccc-------cCCcc-----c-cHHHHhhh Q lcl|NC_019710. 8 IDLRTNNGWWARLKS-----WFV--GGRLVTPNQGSQ-----TGPV--SAHGY-------LGDSS-----I-NDERILQI 60 (424) Q Consensus 8 ~~~~~~~G~~~~~~~-----~~~--~~~~~~~~~~~~-----~~~~--~~~~~-------~~~~~-----~-~~~~~~~~ 60 (424) |+|-.=-|||.+.-. ... ..+.+.|..... .+.. ...+. .+... + ..+..+.+ T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~~~~~~LI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNISGTKDLINTYRQLTNN 80 (516) T ss_pred CCchHhcccccchhhHHHHhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCccccHHHHHHHHHHhhhc Confidence 666555677755333 222 223333321110 0000 00010 01110 1 23445668 Q ss_pred HHHHHHHHHHHHhhhhC-----ceeEeecccc-CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEE Q lcl|NC_019710. 61 STVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALV 134 (424) Q Consensus 61 ~~v~~~i~~ia~~ia~~-----~~~~~~~~~~-~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~ 134 (424) |.|..||+.|.+.+.-+ |+.+--.+.+ ...-.........++|+ =-+....+ +.++..|...|..|..+ T Consensus 81 pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~fhK 155 (516) T protein:vir:10 81 PEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICR-LLDASRKL----DTLFRRWYIDSRIFFHK 155 (516) T ss_pred cchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHHhhhhcceEEEEE Confidence 99999999999987643 3333100000 00000000111222222 11112222 34577788899998875 Q ss_pred ee-CCCCceeEEEEeccceEEEEEc-----CCc--------eEEEEEecCc-------------eEEecHhHeeEecCc- Q lcl|NC_019710. 135 DR-NSAGDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDSE-------------YADFSQKEIFHLKGF- 186 (424) Q Consensus 135 ~r-~~~G~~~~l~~l~p~~v~~~~~-----~~~--------~~~~~~~~~~-------------~~~~~~~evih~r~~- 186 (424) +. +....+.+|..|+|.+++..+. .++ .+|.|..+.. ...++.+- |++-|. T Consensus 156 iid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~da-I~y~hSG 234 (516) T protein:vir:10 156 IMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSA-IVYAHSG 234 (516) T ss_pred EecCcccceeeeeeeCCcceeeEEeeecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhh-eeeeecC Confidence 54 3345689999999999985432 111 1233322211 23344443 444332 Q ss_pred --CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC----cccC-- Q lcl|NC_019710. 187 --GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG----PVKK-- 257 (424) Q Consensus 187 --~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~----~~ag-- 257 (424) ..++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++... .+.| T Consensus 235 l~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev 314 (516) T protein:vir:10 235 LQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTV 314 (516) T ss_pred cccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCee Confidence 22322236778888888777777777666555555555556665555443 45555566666555321 0111 Q ss_pred ----ccee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcc-cccHHHHHH- Q lcl|NC_019710. 258 ----RLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW-GSGIEQQNL- 320 (424) Q Consensus 258 ----~~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~-~~n~e~~~~- 320 (424) +.+. + ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|....+.+.. ..++|=.+. T Consensus 315 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDE 393 (516) T protein:vir:10 315 KNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGA-QTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDE 393 (516) T ss_pred ccchhhhhhHhhhcccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHH Confidence 1221 1 125566666431 223233445667889999999999988765443331 122221111 Q ss_pred ----HHHHHHHHHHHHHHHHHHhhhcc-----Chhhhc----cceeeecchhhhcc--C---HHHHHHHHHHHH--hCCC Q lcl|NC_019710. 321 ----GFLQYTLQPYISRWENSIQRWLI-----PAKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAMG--ESGL 380 (424) Q Consensus 321 ----~f~~~tl~P~~~~ie~~l~~~L~-----~~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~~--~~g~ 380 (424) -|+..-=.-+...+.+.|..+|+ ++.++. ...+.|..|..... + ...|...++.+- -+.+ T Consensus 394 iKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky 473 (516) T protein:vir:10 394 LDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKY 473 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 22222222234455566666554 333332 23444444433221 1 123444444432 2357 Q ss_pred cCHHHHHH-HhCCCCCC--CcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 381 RTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 381 ~t~NE~R~-~lg~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++.+=+|+ .|.+.-.+ .-++.+ ..-.+.+-.++|.+..- T Consensus 474 ~s~~yi~k~ILr~tDeei~~~~k~I-----~~E~~~~~~~~p~~e~~ 515 (516) T protein:vir:10 474 VSHDYVMKNILQMTDEQIAQEEKQI-----EKEANVKRFQNPENEDD 515 (516) T ss_pred cchHHHHHHHhcCCHhHHHHHHHHH-----HHhhhCCCCCCCCcccc Confidence 77777765 45554211 000000 00000000011111111 No 224 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=89.08 E-value=0.028 Score=29.06 Aligned_cols=395 Identities=12% Similarity=0.059 Sum_probs=153.7 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccccccccccccc---------------ccccCCc---------c----- Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSA---------------HGYLGDS---------S----- 51 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~---------~----- 51 (424) |+.||+|++-+-++-.+. -..++..|.-+.+...++. ....+|. + T Consensus 1 ~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~ 74 (535) T protein:vir:80 1 MARKRTTIRRDVQSKVLI------PPQAPPTSGLGPSLPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPS 74 (535) T ss_pred CCcchhhhhhhhhhhccc------CCCCcCCCCCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCC Confidence 999999876544321110 0111111100000000000 0000111 0 Q ss_pred ---c---c---HHHHhhhHHHHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHH Q lcl|NC_019710. 52 ---I---N---DERILQISTVWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTM 122 (424) Q Consensus 52 ---~---~---~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~ 122 (424) . + .+.+++-+..+.....+.++++++.|+. .. .......+..++..-=-...+-.+|.+.++. T Consensus 75 ~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vfrk---~p-----~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~ 146 (535) T protein:vir:80 75 VDSRDEEQRRRYETYLQRAIFYNVTARTLDGMMGQVFSR---DP-----IRQLPPALEAIVEDIDGEGVSLDQQAKKALG 146 (535) T ss_pred cccCCcCCHHHHHHHHhhccCCChhHHHHHHHhchhhcC---Cc-----ceeccHHHHHHHhccCCCCCCHHHHHHHHHH Confidence 0 0 2233444444445555555555555532 10 0111233555454333345678899999999 Q ss_pred HHHHcCCeEEEEeeCCCCce------------eEEEEeccceEE----------------------EEEcCC---ceE-- Q lcl|NC_019710. 123 QLCFYGNAYALVDRNSAGDV------------ISLLPLQSANMD----------------------VKLVGK---KVV-- 163 (424) Q Consensus 123 ~~l~~G~a~~~~~r~~~G~~------------~~l~~l~p~~v~----------------------~~~~~~---~~~-- 163 (424) ..+.+|-+++++.....|.. --+..+.|..|. ...+++ ... T Consensus 147 ~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q 226 (535) T protein:vir:80 147 YTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQ 226 (535) T ss_pred HHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEE Confidence 99999999999976554431 112222222211 111211 000 Q ss_pred E-----------E---EEecCc-------eEEecHh------HeeEec---CcCCCCccccchHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 164 Y-----------R---YQRDSE-------YADFSQK------EIFHLK---GFGFTGLVGLSPIAFACKSAGVAVAMEDQ 213 (424) Q Consensus 164 ~-----------~---~~~~~~-------~~~~~~~------evih~r---~~~~~~~~G~s~~~~~~~~i~~~~~~~~~ 213 (424) | . |...+. ...++.+ ..|=|- ..+.+...|.+|+..+...-...-....- T Consensus 227 ~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd 306 (535) T protein:vir:80 227 WRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSAD 306 (535) T ss_pred EEEEEecCCceEEEEEEEeecCCccccccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhH Confidence 1 0 111000 0011111 111111 11223346788887666553322222222 Q ss_pred HHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 214 QRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELAR 293 (424) Q Consensus 214 ~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~ 293 (424) ....+...+.|-.+++-......+.. .+ -....-|. ...+.++.+.++.-+..++.-+. .+..+-...+++. T Consensus 307 ~~~il~~~~~P~l~i~G~~~~~~~~~----~~-~~~i~iG~--~~~~~lP~~~~~~~~e~~~~~~a-~~~l~~~e~qM~~ 378 (535) T protein:vir:80 307 YEEMAFVAGQPTAFFTGLTKDWVEDV----FK-DFKVHLGS--RAIIPLPQGATAGILQITPNSVP-FEAMTHKESQMIA 378 (535) T ss_pred HHHHHHHhcCceeeeecCchhhhhcC----CC-CcceEecC--cccccCCCCCCcceeeeccchhH-HHHHHHHHHHHHH Confidence 44445566677777763222111100 00 00011122 23566766654443333333222 1222222232222 Q ss_pred HhCCCHHHcCCCCCCCcccccHHHHHHHH--HHHHHHHHHHHHHHHHhhhccC--hh-----hhccceeeecchhhhccC Q lcl|NC_019710. 294 FFGVPPHLVGDVEKSTSWGSGIEQQNLGF--LQYTLQPYISRWENSIQRWLIP--AK-----DVGRIHAEHNLDGLLRGD 364 (424) Q Consensus 294 ~fgVP~~~l~~~~~~~~~~~n~e~~~~~f--~~~tl~P~~~~ie~~l~~~L~~--~~-----~~~~~~~~f~~~~~~~~d 364 (424) .|. .++... .++. ++++....+ -...|.-++.++++.|+.-|-- .. ......+..+.+-..... T Consensus 379 -lGa--~ll~~~-~~~~---Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~l 451 (535) T protein:vir:80 379 -MGA--NLLVKS-GGNR---TFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQTGIVNDETVEYNLNTDFPAARL 451 (535) T ss_pred -HHH--HhhccC-cccc---cHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCceEEEeccccccccC Confidence 221 112111 1111 122222222 2344666777777777643321 11 111233444444444432 Q ss_pred HHHHHHHHHHHHhCCCcCHHHHHHHhCCC----C-CCC-----------cCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 365 SASRAAFMKAMGESGLRTINEMRRTDNLP----P-LPG-----------GDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 365 ~~~~~~~~~~~~~~g~~t~NE~R~~lg~~----p-~~g-----------gd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) .......+-++++.|.++....++.+..- | +++ .+.....+-.......+++..+.+||. T Consensus 452 d~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~ 527 (535) T protein:vir:80 452 TPNERAELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGN 527 (535) T ss_pred CHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCc Confidence 23345556678888888888877765321 1 111 111111111111111222222222222 No 225 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=86.10 E-value=0.048 Score=27.81 Aligned_cols=403 Identities=9% Similarity=0.042 Sum_probs=182.2 Q ss_pred cCCC----ccHHHHHHh-----hcc--Ccccccccccccc-----c---cccccc----ccCCcc---------ccHHHH Q lcl|NC_019710. 10 LRTN----NGWWARLKS-----WFV--GGRLVTPNQGSQT-----G---PVSAHG----YLGDSS---------INDERI 57 (424) Q Consensus 10 ~~~~----~G~~~~~~~-----~~~--~~~~~~~~~~~~~-----~---~~~~~~----~~~~~~---------~~~~~~ 57 (424) |+.. -|+|.+.-. .+. ..+.+.|...... + +..+.+ ..+... -..+.. T Consensus 1 m~~~~l~lf~f~~k~~e~~~~~~~~~~~~s~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~m 80 (521) T protein:vir:10 1 MNPIFLKLLQPWMKDDEKRVQSDLSDRIDSFAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSL 80 (521) T ss_pred CCcchhHHhhhhhhhhhhHHhhhhccCccccccccCCCCceeeccCCCccccccchhhhhhccccccchHHHHHHHHHHH Confidence 3333 244432211 111 1122222211100 0 000000 111111 112345 Q ss_pred hhhHHHHHHHHHHHHhhhhC-----ceeEeeccccCccc-cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019710. 58 LQISTVWRCVSLISTLTACL-----PLDVFETDQNDNRK-KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~-----~~~~~~~~~~~~~~-~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~ 131 (424) +.+|.|..||+.|.+.+.-+ |+.+-=.+.+.+.. .........++|+ =-|....+ +.+++.|...|..| T Consensus 81 a~~pEvd~Av~eIvneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~ 155 (521) T protein:vir:10 81 SKYHEVDNAIDEIINDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILK-LLKFEREG----KRHFRRWYVDSRIY 155 (521) T ss_pred hhccchhhHHHhhhcceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhheeeeeEE Confidence 66899999999999987643 23332111111111 0001112222222 11112222 45577788999999 Q ss_pred EEEeeCC---CCceeEEEEeccceEEEEEc-----CCc--------eEEEEEe--------c---CceEEecHhHeeEec Q lcl|NC_019710. 132 ALVDRNS---AGDVISLLPLQSANMDVKLV-----GKK--------VVYRYQR--------D---SEYADFSQKEIFHLK 184 (424) Q Consensus 132 ~~~~r~~---~G~~~~l~~l~p~~v~~~~~-----~~~--------~~~~~~~--------~---~~~~~~~~~evih~r 184 (424) ..++-+. ...+.+|..|+|.+++..+. .++ -+|.|.. + +....++.+-|.|.. T Consensus 156 fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~h 235 (521) T protein:vir:10 156 FHKMIDPARPKDGIKELRLLDPRNVEYYRVNLKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYSH 235 (521) T ss_pred EEEEeeCCCccccceeeeeeCCcceeeeeeecCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheeeec Confidence 8876543 24689999999999975432 111 1233321 1 112457776666665 Q ss_pred --CcCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC-------- Q lcl|NC_019710. 185 --GFGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG-------- 253 (424) Q Consensus 185 --~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~-------- 253 (424) ....++.+.+|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++... T Consensus 236 SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TG 315 (521) T protein:vir:10 236 SGKVDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTG 315 (521) T ss_pred ccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCc Confidence 2345667788999999998888887777776655555555566665555443 45555566665555321 Q ss_pred --cccCccee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-CCcccccHHHH- Q lcl|NC_019710. 254 --PVKKRLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSGIEQQ- 318 (424) Q Consensus 254 --~~ag~~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~~~~n~e~~- 318 (424) .+..+.+. + ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|..... -+... .+|=. T Consensus 316 ev~ddrk~msMlEDyWLpRReGgrgTEI~TLpgg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr-~~EItR 393 (521) T protein:vir:10 316 KVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGA-QSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGA-GNDITR 393 (521) T ss_pred eeccchhhhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCccccCCCCCceeccc-ccchhH Confidence 01111221 1 124566665431 22333444567788999999999998865421 12221 11211 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc-----CHHHHHHHHHHH----H Q lcl|NC_019710. 319 ----NLGFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG-----DSASRAAFMKAM----G 376 (424) Q Consensus 319 ----~~~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~-----d~~~~~~~~~~~----~ 376 (424) ..-|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|..... -...|...++.+ + T Consensus 394 DEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~y 473 (521) T protein:vir:10 394 DELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEV 473 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccc Confidence 11222221222334444455555442 23332 23444544433221 123445555544 2 Q ss_pred hCCCcCHHHHHH-HhCCCCCC--CcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 377 ESGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 377 ~~g~~t~NE~R~-~lg~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) -+-+++.+=+|+ .|.+.-.+ .-++.+- .-.+.+-.++|+++-. T Consensus 474 vGky~s~dyi~k~ILr~tDeeik~~~k~I~-----~E~~~~~~~~p~~e~~ 519 (521) T protein:vir:10 474 TGKYLSHEYVMKNILRMSDEDIKTEREKID-----GELKDSVYKNPEDPME 519 (521) T ss_pred cccccchHHHHHHHhcCCHhHHHHHHHHHH-----HhhhCCCCCCCcchhh Confidence 223666666654 35554110 0000000 0000000011111111 No 226 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=85.47 E-value=0.053 Score=27.60 Aligned_cols=392 Identities=10% Similarity=0.047 Sum_probs=175.4 Q ss_pred CCCCCcccccCCCccHHHHHH-----hhc--cCcccccccccccc-----c--ccccccc----cCC-cc---------c Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLK-----SWF--VGGRLVTPNQGSQT-----G--PVSAHGY----LGD-SS---------I 52 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~-----~~~--~~~~~~~~~~~~~~-----~--~~~~~~~----~~~-~~---------~ 52 (424) |+.-..-+++ -|+|.+.- ... .+.+++.|...... + .....+. .++ .. - T Consensus 1 ~~~~~~~~~l---f~f~~~~de~~~~~~~~~~~~S~~~p~~~dGa~~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~ 77 (524) T protein:vir:10 1 MANFNTILSF---LKPWANEDEKEYKQQINNNLESVTAPKLDDGAREIETQEQNIPYNALMQQMFGSNEPEVKNTRELID 77 (524) T ss_pred CCchhhHHHH---hhhhhcchhhhhhhhhccCCCccccCCCCCCceeeccCcccccchhhhhhhhhcccchhhhHHHHHH Confidence 3221111000 12222100 001 12223333211100 0 0000000 110 10 1 Q ss_pred cHHHHhhhHHHHHHHHHHHHhhhhC-----ceeEeecccc-CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHH Q lcl|NC_019710. 53 NDERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCF 126 (424) Q Consensus 53 ~~~~~~~~~~v~~~i~~ia~~ia~~-----~~~~~~~~~~-~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~ 126 (424) ..+..+.+|.|..||+.|.+.+.-+ |+.+--.+-+ ...-.........++|+ =-|....+ +.+++.|.. T Consensus 78 ~YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYV 152 (524) T protein:vir:10 78 TYRNLMNNYEVDNAVQEIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLN-LLNFQRKG----TDHFQRWYV 152 (524) T ss_pred HHHHHhhccchhhHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhee Confidence 1233566899999999999986643 2322111110 00000000111222222 11112222 455777889 Q ss_pred cCCeEEEEeeCC---CCceeEEEEeccceEEEEEc-----CCc--------eEEEEE-------------ecCceEEecH Q lcl|NC_019710. 127 YGNAYALVDRNS---AGDVISLLPLQSANMDVKLV-----GKK--------VVYRYQ-------------RDSEYADFSQ 177 (424) Q Consensus 127 ~G~a~~~~~r~~---~G~~~~l~~l~p~~v~~~~~-----~~~--------~~~~~~-------------~~~~~~~~~~ 177 (424) .|..|..++-+. ...+.+|..|+|.+++..+. .++ .+|.|. ..+....++. T Consensus 153 DgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~ 232 (524) T protein:vir:10 153 DSRIFFHKIINPKKMKDGVQELRRLDPRQVQYIREIVTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPR 232 (524) T ss_pred eceEEEEEEeeCCCccccceeeeeeCCccceeeeeecccCcccchhhcchhhheeecCCCcccccCcceecCCcceecch Confidence 999998876553 24689999999999975321 111 122222 2223466888 Q ss_pred hHeeEecC--cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC- Q lcl|NC_019710. 178 KEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG- 253 (424) Q Consensus 178 ~evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~- 253 (424) +-|+|... .+.++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++... T Consensus 233 dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKl 312 (524) T protein:vir:10 233 AAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRV 312 (524) T ss_pred hheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCcee Confidence 89988763 233443446778888888877777777666555555555556665555443 45555566666554321 Q ss_pred ---------cccCccee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccc Q lcl|NC_019710. 254 ---------PVKKRLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS 313 (424) Q Consensus 254 ---------~~ag~~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~ 313 (424) .+..+.+. + ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|....++..+.. T Consensus 313 vYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~g 391 (524) T protein:vir:10 313 VYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGA-TGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFD 391 (524) T ss_pred EEeccCCeeccchhhhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCchhccCCCCcccccc Confidence 01111221 1 124566665431 22323344566788999999999999954433333222 Q ss_pred cHHHHHH------HHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHH Q lcl|NC_019710. 314 GIEQQNL------GFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMK 373 (424) Q Consensus 314 n~e~~~~------~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~ 373 (424) ...+..+ -|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|..... + ...|...++ T Consensus 392 r~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~ 471 (524) T protein:vir:10 392 AGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLT 471 (524) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHH Confidence 2222222 222221222334444455555442 33332 23444444433221 1 123333333 Q ss_pred HHHh--CCCcCHHHHHH-HhCCCC----------------------CCCcCee Q lcl|NC_019710. 374 AMGE--SGLRTINEMRR-TDNLPP----------------------LPGGDVA 401 (424) Q Consensus 374 ~~~~--~g~~t~NE~R~-~lg~~p----------------------~~ggd~~ 401 (424) .+-. +-.++.+=+|+ .|.+.- -+.-|.+ T Consensus 472 ~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 472 MAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred HhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 3311 11334444443 233321 1112222 No 227 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=83.99 E-value=0.064 Score=27.13 Aligned_cols=371 Identities=11% Similarity=0.052 Sum_probs=163.1 Q ss_pred cccCCCc-------cHHHHHHhhccCccccccc-ccccccccccccccCCcc-ccHHHHhhh----HHHHHHHHHHHHhh Q lcl|NC_019710. 8 IDLRTNN-------GWWARLKSWFVGGRLVTPN-QGSQTGPVSAHGYLGDSS-INDERILQI----STVWRCVSLISTLT 74 (424) Q Consensus 8 ~~~~~~~-------G~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~~----~~v~~~i~~ia~~i 74 (424) |.+.+.+ --|..++....|...-... ......+ .++. -..+.+++- +++...++.++..+ T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~-------~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~v 73 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKL-------SGQTDDMYNAYKQRALFYSITSKTLSALSGMV 73 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCC-------CCCCHHHHHHHHhhccCCchHHHHHHHHhchh Confidence 7777774 4455555544432211100 0001000 1111 011233332 34445555555544 Q ss_pred hhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEEEeccceEE Q lcl|NC_019710. 75 ACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANMD 154 (424) Q Consensus 75 a~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~~l~p~~v~ 154 (424) -+-|..+ .....+.++..+ -...+-.+|.+.++...+.+|-|++++.....|.---+..++|..|. T Consensus 74 f~k~p~~------------~~p~~l~~~~~D--~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii 139 (452) T protein:vir:94 74 LDQPPVI------------THPDAMSKYFED--QSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENIL 139 (452) T ss_pred hcCCcee------------cccHHHHHHHhc--ccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhc Confidence 4444432 011223333222 45778899999999999999999999988766642233333333321 Q ss_pred -------------------EEEcCC-c----e--EEE-------------EEecCceE-E----ecHh------HeeEec Q lcl|NC_019710. 155 -------------------VKLVGK-K----V--VYR-------------YQRDSEYA-D----FSQK------EIFHLK 184 (424) Q Consensus 155 -------------------~~~~~~-~----~--~~~-------------~~~~~~~~-~----~~~~------evih~r 184 (424) ...++. . . .|+ +....... . ...+ ..|=|- T Consensus 140 ~W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v 219 (452) T protein:vir:94 140 NWEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFF 219 (452) T ss_pred CccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEE Confidence 011110 0 0 010 11011100 0 0000 111111 Q ss_pred --C-cCCCCccccchHHHHHHHH-HHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcce Q lcl|NC_019710. 185 --G-FGFTGLVGLSPIAFACKSA-GVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW 260 (424) Q Consensus 185 --~-~~~~~~~G~s~~~~~~~~i-~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~ 260 (424) + .+.+...|.+|+..++..- ...+.... ....+...+.|-.+++-..+. + .-..| ++.++ T Consensus 220 ~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd-~~~~l~~~~~P~l~~~g~~~~-~-----------~i~iG---~~~~~ 283 (452) T protein:vir:94 220 CITPSGLSMTPAKPPMIDIVDINYSHYRTSAD-LEHGRHFTGLPTPWITGAESQ-S-----------TMHIG---STKAW 283 (452) T ss_pred EEcCCCCCCCCCccchHHHHHHHHHHhcchhH-HHHHHHHcccceeEeecCcCC-C-----------ceEec---ccccc Confidence 1 1122346888888766553 33333333 445555667777776532211 1 11233 23466 Q ss_pred ecCC-Cc--eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_019710. 261 ILEA-GF--STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ--NLGFLQYTLQPYISRWE 335 (424) Q Consensus 261 ~l~~-g~--~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~--~~~f~~~tl~P~~~~ie 335 (424) .+++ |. .|.+.+-+..... .+..+....++ ...|- .++-....++. +.+.. ..+-.+..|.-++.+++ T Consensus 284 ~lpe~~~~~~yie~~g~~i~~~-~~~l~~le~~m-~~~Ga--~ll~~~~~~~~---s~ea~~~~~~~~~s~L~~~a~~~e 356 (452) T protein:vir:94 284 VIPEVAAKVGFLEFTGQGLQSL-EKALSEKQAQL-ASLSA--RLIDNSTRGSE---ATETVKLRYMSETASLKSVTRAVE 356 (452) T ss_pred cCCCCCCcceEEccCchhHHHH-HHHHHHHHHHH-HHHHH--HhhccCCCcch---HHHHHHHHHHHhhHHHHHHHHHHH Confidence 7774 54 5556555443322 12222222222 11121 22222111111 22222 22223466777777777 Q ss_pred HHHhhhcc--Chhh--hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh---CCCCCCCcCeeeeccccc Q lcl|NC_019710. 336 NSIQRWLI--PAKD--VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTD---NLPPLPGGDVAMRQSQYV 408 (424) Q Consensus 336 ~~l~~~L~--~~~~--~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~l---g~~p~~ggd~~~~~~n~~ 408 (424) +.++.-|- -... .....++.+.+-+.........+.+.+++..|.++....++.+ |....+ ++...-..-. T Consensus 357 ~al~~~l~~~a~w~g~~~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~--~e~~~i~~E~ 434 (452) T protein:vir:94 357 ALLNKAYSCIMDMESMGGTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPP--GESMGVIPDP 434 (452) T ss_pred HHHHHHHHHHHHHcCCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCc--cCHHHHHHHh Confidence 77764331 1111 1223455555555444434555666678999999998888887 543322 1111000111 Q ss_pred chhhccccCCCccCCC Q lcl|NC_019710. 409 PITDLGTNKEPRNNGA 424 (424) Q Consensus 409 ~~~~~~~~~~~~~~g~ 424 (424) |........+|-++|+ T Consensus 435 ~~~~~~~~~~~~~~~~ 450 (452) T protein:vir:94 435 PAPEPSPSNTPPNPSS 450 (452) T ss_pred hccCcccCCCCCCCcc Confidence 1112223335555555 No 228 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=83.93 E-value=0.065 Score=27.11 Aligned_cols=404 Identities=10% Similarity=0.086 Sum_probs=170.3 Q ss_pred CCCCCcccccCCCccHHHHHHhhcc--Cccccccccccccccccccccc----C--Ccc-------ccHHHHhhhHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFV--GGRLVTPNQGSQTGPVSAHGYL----G--DSS-------INDERILQISTVWR 65 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----~--~~~-------~~~~~~~~~~~v~~ 65 (424) |.| =-||+=.-..... ..+++.|.....+..+...++. + +.. -..+..+.+|.|.. T Consensus 1 m~~---------lfgf~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~ 71 (558) T protein:vir:10 1 MAK---------LFGFSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEADG 71 (558) T ss_pred Ccc---------hhcchhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchhh Confidence 221 1122221111111 1122222211111111111111 1 111 11234566899999 Q ss_pred HHHHHHHhhhhC-----ceeEeeccccCcc-ccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC Q lcl|NC_019710. 66 CVSLISTLTACL-----PLDVFETDQNDNR-KKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) Q Consensus 66 ~i~~ia~~ia~~-----~~~~~~~~~~~~~-~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~ 139 (424) ||+.|.+.+.-+ |+.+-=.+-+.+. -.........++|+- -|....+ +.+++.|...|..|..++-+.. T Consensus 72 Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~l-l~F~~~~----~e~fR~WYVDgRiyfHKiid~k 146 (558) T protein:vir:10 72 AIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEM-MDFDKKS----HEIFRNWYVDGRVFYLKVIDTK 146 (558) T ss_pred HHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHH-hccchhh----hHHHhhheeeeEEEEEEEEeCC Confidence 999999986643 2322111111110 000011122222221 1112222 4557778899999988866433 Q ss_pred ---CceeEEEEeccceEEEEEcC----------------Cc--------eEEEEEecC-------------ceEEecHhH Q lcl|NC_019710. 140 ---GDVISLLPLQSANMDVKLVG----------------KK--------VVYRYQRDS-------------EYADFSQKE 179 (424) Q Consensus 140 ---G~~~~l~~l~p~~v~~~~~~----------------~~--------~~~~~~~~~-------------~~~~~~~~e 179 (424) ..+.+|..|+|.+++..+.- .+ .+|.|...+ ....++.+- T Consensus 147 ~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dA 226 (558) T protein:vir:10 147 NPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDS 226 (558) T ss_pred CccccceeeeeeCcccceeeeeeccccccccceeeeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhh Confidence 36889999999999643321 01 122232211 113333333 Q ss_pred eeEecCc---CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC-- Q lcl|NC_019710. 180 IFHLKGF---GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG-- 253 (424) Q Consensus 180 vih~r~~---~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~-- 253 (424) |++-+- ..++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+-+... ..+.+.++....++... T Consensus 227 -I~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklV 305 (558) T protein:vir:10 227 -ITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLV 305 (558) T ss_pred -eeeecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEE Confidence 333322 22333346778888888777777777666555555555556655544443 45555566666555321 Q ss_pred ------c--ccCcce-ec----------CCCceeeeccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCccc Q lcl|NC_019710. 254 ------P--VKKRLW-IL----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG 312 (424) Q Consensus 254 ------~--~ag~~~-~l----------~~g~~~~~l~~--s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~ 312 (424) . +..+.+ .+ ..|.+++.|.- +.-+|+ -..+..+.+..+++||.+-|....+-+. T Consensus 306 YDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~---DV~YF~kKLy~aLnVP~SRl~~e~~f~~-- 380 (558) T protein:vir:10 306 YDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELS---DVDYFQKKLYRALGVPESRIAAEGGFNL-- 380 (558) T ss_pred EeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHH---HHHHHHHHHHHHhCCCccccCCCCcccc-- Confidence 0 111111 11 12456666543 333443 4566788899999999998875433222 Q ss_pred ccHHHHH------HHHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHH Q lcl|NC_019710. 313 SGIEQQN------LGFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFM 372 (424) Q Consensus 313 ~n~e~~~------~~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~ 372 (424) ..+.+.. .-|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...+ T Consensus 381 Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l 460 (558) T protein:vir:10 381 GRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGML 460 (558) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH Confidence 2222222 1222222222344444455555442 23332 23444544433221 1 12333333 Q ss_pred HHHHh--CCCcCHHHHHH-Hh--------------------CCCCCCCcCeeeecccc-----cchhhcccc-CCCccCC Q lcl|NC_019710. 373 KAMGE--SGLRTINEMRR-TD--------------------NLPPLPGGDVAMRQSQY-----VPITDLGTN-KEPRNNG 423 (424) Q Consensus 373 ~~~~~--~g~~t~NE~R~-~l--------------------g~~p~~ggd~~~~~~n~-----~~~~~~~~~-~~~~~~g 423 (424) +.+-. +-.++.+=+|+ .| |+=+-|...+++..+-+ ......+.+ .+++-++ T Consensus 461 ~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 540 (558) T protein:vir:10 461 ATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEA 540 (558) T ss_pred HHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCccccccc Confidence 33211 11334443332 11 22111222222211111 111111111 1111122 Q ss_pred C Q lcl|NC_019710. 424 A 424 (424) Q Consensus 424 ~ 424 (424) + T Consensus 541 ~ 541 (558) T protein:vir:10 541 Q 541 (558) T ss_pred c Confidence 2 No 229 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=82.60 E-value=0.076 Score=26.73 Aligned_cols=401 Identities=10% Similarity=0.060 Sum_probs=175.3 Q ss_pred cCCCccHHHHHHhh--------cc--Ccccccccccccc--------ccccc----cccc-CCcc---------ccHHHH Q lcl|NC_019710. 10 LRTNNGWWARLKSW--------FV--GGRLVTPNQGSQT--------GPVSA----HGYL-GDSS---------INDERI 57 (424) Q Consensus 10 ~~~~~G~~~~~~~~--------~~--~~~~~~~~~~~~~--------~~~~~----~~~~-~~~~---------~~~~~~ 57 (424) |-...-++..+++. +. ..+++.|...... .+... .+.. +... -..+.. T Consensus 1 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:81 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CcchhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHH Confidence 22222333332221 11 1122222211100 00000 0000 1110 112345 Q ss_pred hhhHHHHHHHHHHHHhhhhC-----ceeEeec-cccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019710. 58 LQISTVWRCVSLISTLTACL-----PLDVFET-DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~-----~~~~~~~-~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~ 131 (424) +.+|.|..||+.|.+.+.-+ |+.+--. .+-...-.........++|+ =-|....+ +.++..|...|..| T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~ 155 (521) T protein:vir:81 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLN-TIQFDRRG----QDMFRRWYVDSRIF 155 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceEE Confidence 66899999999999987643 2322111 10000000000111222222 11112222 45577888999999 Q ss_pred EEEeeCCC--CceeEEEEeccceEEEEEcC-----Cc--------eEEEEEecC-------------ceEEecHhHeeEe Q lcl|NC_019710. 132 ALVDRNSA--GDVISLLPLQSANMDVKLVG-----KK--------VVYRYQRDS-------------EYADFSQKEIFHL 183 (424) Q Consensus 132 ~~~~r~~~--G~~~~l~~l~p~~v~~~~~~-----~~--------~~~~~~~~~-------------~~~~~~~~evih~ 183 (424) ..++.+.+ ..+.+|..|+|.+++..+.- .+ -+|.|..+. ....++.+-| ++ T Consensus 156 fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI-~y 234 (521) T protein:vir:81 156 FHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAI-TY 234 (521) T ss_pred EEEEEcCCccccceeeeeeCCcceeeeeeecccccCccceecceeeeeeeecCCccccccceeecCCcceeechhhe-ee Confidence 98885544 56899999999999854321 11 123332221 1233444444 33 Q ss_pred cCc---CCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhC------- Q lcl|NC_019710. 184 KGF---GFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAG------- 252 (424) Q Consensus 184 r~~---~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~------- 252 (424) -+- ..++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++.. T Consensus 235 ~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~ 314 (521) T protein:vir:81 235 AHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDAS 314 (521) T ss_pred eeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecc Confidence 332 23333346778888888887777777776655555555666666655444 5556667777766643 Q ss_pred -Cc--ccCcce-ecC----------CCceeeecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH Q lcl|NC_019710. 253 -GP--VKKRLW-ILE----------AGFSTSAIG--VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE 316 (424) Q Consensus 253 -~~--~ag~~~-~l~----------~g~~~~~l~--~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e 316 (424) |. +..+.+ .++ .|.+++.|. .+.-+| +-..+..+.+..+++||.+-|+...++..+..... T Consensus 315 TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~ 391 (521) T protein:vir:81 315 TGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDI---DDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGS 391 (521) T ss_pred cccccccccccchhhhhcccccCCCcccceeecccCCCCChH---HHHHHHHHHHHHHhCCccccccCCCCcceeccccc Confidence 11 111122 221 355666664 344344 44566788999999999999954433322211211 Q ss_pred HHHH------HHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHHH Q lcl|NC_019710. 317 QQNL------GFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAMG 376 (424) Q Consensus 317 ~~~~------~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~~ 376 (424) +..+ -|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++.+- T Consensus 392 EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~d 471 (521) T protein:vir:81 392 EITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERIT 471 (521) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhh Confidence 2111 222222222444444555555443 33322 23444544433221 1 123444433331 Q ss_pred h--CCCcCHHHHHH-HhCCCCCC--CcCeeeecccccchhhccccCCCc-cCCC Q lcl|NC_019710. 377 E--SGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPR-NNGA 424 (424) Q Consensus 377 ~--~g~~t~NE~R~-~lg~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~-~~g~ 424 (424) . +-.++.+=+|+ .|.+.-.+ .-++.+ ..-.+.+-.+++. +-+. T Consensus 472 pyvGky~s~dyi~k~ILr~tDeei~~~~k~I-----~~E~~~~~~~~p~~~~~~ 520 (521) T protein:vir:81 472 PYIGKYFSNQTVMRDILKYTDDQMDTEKKQI-----EEEANDPRFKQTPDEIED 520 (521) T ss_pred hhhccccchHHHHHHHhccCHHHHHHHHHHH-----HHHhhCCCCCCCcccccC Confidence 1 12345555543 33432110 000000 0000000001111 1111 No 230 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=81.93 E-value=0.081 Score=26.56 Aligned_cols=379 Identities=14% Similarity=0.087 Sum_probs=147.2 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccc-cccccccccccCCc-cccHHHHhhhHHHHHHHHHHHHhhhhC- Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGS-QTGPVSAHGYLGDS-SINDERILQISTVWRCVSLISTLTACL- 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~~i~~ia~~ia~~- 77 (424) |+| |+-.--++-..-|+++++.- ......-.... .+-|.. +...+.. .-.....+ .++-..|++.+|+.+-+. T Consensus 1 ~~~-~~~~~~~~~~~r~~~l~~~R-~~~e~~w~e~~~y~lP~~-~~~~~~~~~~~~~~~~-dst~~~a~~~Las~l~~~l 76 (522) T protein:vir:94 1 MAE-REGFAAEGAKAVYDRLKNGR-QPYETRAQNCAAVTIPSL-FPKESDNSSTEYTTPW-QAVGARCLNNLAAKLMLAL 76 (522) T ss_pred Ccc-cchhhHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhcccc-cCCCCCcccccccccc-cccHHHHHHHHHHHHHhhc Confidence 665 22221111122233333210 00000000000 011100 0000100 00001122 233445666666655432 Q ss_pred ----ceeEeecccc--------Ccc-ccc-c----ccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC Q lcl|NC_019710. 78 ----PLDVFETDQN--------DNR-KKV-D----LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA 139 (424) Q Consensus 78 ----~~~~~~~~~~--------~~~-~~~-~----~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~ 139 (424) ||-=....+. +.. .++ . ....+...|. +-| .+.-+..+..++..+|||.+++..+.. T Consensus 77 tP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~-~sn----f~~~~~~~~~~L~~~G~a~l~~~~~~~ 151 (522) T protein:vir:94 77 FPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYME-TNS----FRVPLFEALKQLIVSGNCLLYIPEPEQ 151 (522) T ss_pred CCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhhCcEeEeeeccCC Confidence 3321111110 000 000 0 0111222222 233 445566778899999999999877776 Q ss_pred Ccee--EEEEeccceEEEEEcCCceE-----------------------------------E--EEEecCceEEe-cH-- Q lcl|NC_019710. 140 GDVI--SLLPLQSANMDVKLVGKKVV-----------------------------------Y--RYQRDSEYADF-SQ-- 177 (424) Q Consensus 140 G~~~--~l~~l~p~~v~~~~~~~~~~-----------------------------------~--~~~~~~~~~~~-~~-- 177 (424) |.+. ..|||....|. .|..+.. | .+..++....+ .- T Consensus 152 ~~~~~~~~~pl~~y~v~--~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g 229 (522) T protein:vir:94 152 GTYSPMRMYRLVSYVVQ--RDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEG 229 (522) T ss_pred CceeeEEEEEcceEEEe--eCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccC Confidence 6544 45666443333 3322210 0 01111111000 00 Q ss_pred ------------hH--eeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHH Q lcl|NC_019710. 178 ------------KE--IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQ 242 (424) Q Consensus 178 ------------~e--vih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~ 242 (424) ++ .+.+|+...++ .||.||...++..+...+.+.+.......-...|..++... ........ T Consensus 230 ~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~-g~~~~~~~-- 306 (522) T protein:vir:94 230 IEVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPN-GITQPRRL-- 306 (522) T ss_pred ceecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccc-ccccchhe-- Confidence 00 23333333333 79999999999999999999999999999888888665433 33333211 Q ss_pred HHHHHHHHhCCcccCcc-eecCCCceeeeccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH--H Q lcl|NC_019710. 243 VEENFKEIAGGPVKKRL-WILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ--Q 318 (424) Q Consensus 243 ~~~~~~~~~~~~~ag~~-~~l~~g~~~~~l~~s~~d~~~-~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~--~ 318 (424) ..+.+ |-+ .--++.+...++.. +.+++. .+..+....+|..+|-+.. +...+.... +++| . T Consensus 307 --------~~~~~-g~~v~g~~~~v~~~~~~~-~~~~~~~~~~i~~~~~rI~~af~~~~--~~~~~~~r~---TAtEV~~ 371 (522) T protein:vir:94 307 --------NKAAT-GEFVAGRVEDINFLQLTK-GQDFTIAKSVADAIEQRLGWAFLLNS--AVQRNAERV---TAEEIRY 371 (522) T ss_pred --------eccCC-ceeecCCcccceeeeccc-ccchhHHHHHHHHHHHHHHHHHhhhh--hccCCCccc---cHHHHHH Confidence 11111 111 11123334444443 234432 3455666777888886652 222221211 2232 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcc-------------ChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHH Q lcl|NC_019710. 319 NLGFLQYTLQPYISRWENSIQRWLI-------------PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINE 385 (424) Q Consensus 319 ~~~f~~~tl~P~~~~ie~~l~~~L~-------------~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE 385 (424) +..=....|.|....+.++|-.-|+ ++..... ++.++...+.. ..|..-++.+.+ ..+. T Consensus 372 r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~--v~v~~~s~La~--~qr~~~~~~l~~----~~~~ 443 (522) T protein:vir:94 372 VAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEA--VEPTVSTGLEA--LGRGQDLEKLTQ----AVNM 443 (522) T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccc--EEeeEecHHHH--HHHHHHHHHHHH----HHHH Confidence 3344556667777776666543332 2211111 22222222111 112222222221 1222 Q ss_pred HHHHhCCCCCCCcCeeeecccccch-hhccccCCCccCCC Q lcl|NC_019710. 386 MRRTDNLPPLPGGDVAMRQSQYVPI-TDLGTNKEPRNNGA 424 (424) Q Consensus 386 ~R~~lg~~p~~ggd~~~~~~n~~~~-~~~~~~~~~~~~g~ 424 (424) + -.+.|.. .|.. .|.-.+ +...+.- +-+..+ T Consensus 444 i---a~l~P~~-~~~~---id~d~~~~~~a~~~-Gv~~~~ 475 (522) T protein:vir:94 444 M---TGLQPLS-QDPD---INLPTLKLRLLNAL-GIDTAG 475 (522) T ss_pred H---Hhccchh-hhhc---CCHHHHHHHHHHHc-CCChhh Confidence 1 1222310 1111 111110 1111111 111111 No 231 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=76.72 E-value=0.13 Score=25.40 Aligned_cols=396 Identities=13% Similarity=0.069 Sum_probs=147.7 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccc-cccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhC-- Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGS-QTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACL-- 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~-- 77 (424) |++.|+-..-++-..-|+++++.- ......-.... .+-|...........-.....+ .++-..|++.+|+.+-+. T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R-~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDR-APYETRAQNCAQYTIPSLFPKDSDNASTDYQTPW-QAVGARGLNNLASKLMLALF 78 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhcccccCCCCCcccccccccc-cccHHHHHHHHHHHHHHhhc Confidence 999887655554444455444311 00000000000 0111100000000000001122 233444555655554432 Q ss_pred ---ceeEeeccccCcc------c---cc-----cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019710. 78 ---PLDVFETDQNDNR------K---KV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 78 ---~~~~~~~~~~~~~------~---~~-----~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) ||-=..-.+..-. . ++ .....+...|. +-| .+.-+..+..+++.+|||.+++..+..+ T Consensus 79 P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~L~~~G~a~ly~~e~~~~ 153 (536) T protein:vir:21 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) T ss_pred CCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEEeeCCCC Confidence 3311111110000 0 00 00122333332 333 4455667788999999999998766554 Q ss_pred ce--eEEEEeccceEEEEEcCCce-----------------------------------EE--EEEe-cC-ceE------ Q lcl|NC_019710. 141 DV--ISLLPLQSANMDVKLVGKKV-----------------------------------VY--RYQR-DS-EYA------ 173 (424) Q Consensus 141 ~~--~~l~~l~p~~v~~~~~~~~~-----------------------------------~~--~~~~-~~-~~~------ 173 (424) .+ ...|||....|....+++.. +| .+.+ ++ ... T Consensus 154 ~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~ 233 (536) T protein:vir:21 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVE 233 (536) T ss_pred ceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccC Confidence 43 45666644444322222110 00 0111 11 110 Q ss_pred --EecH-------hH--eeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHH Q lcl|NC_019710. 174 --DFSQ-------KE--IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRS 241 (424) Q Consensus 174 --~~~~-------~e--vih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~ 241 (424) .+.. ++ .+.+|+...++ .||.||...++..+.....+.+.......-...|...+. +........ T Consensus 234 g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~~-- 310 (536) T protein:vir:21 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRR-- 310 (536) T ss_pred CeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC-cccccchhh-- Confidence 0100 00 13334333333 799999999999999998888888776666566554443 223322221 Q ss_pred HHHHHHHHHhCCcccCcce-ecCCCceeeeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH-- Q lcl|NC_019710. 242 QVEENFKEIAGGPVKKRLW-ILEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ-- 317 (424) Q Consensus 242 ~~~~~~~~~~~~~~ag~~~-~l~~g~~~~~l~~s~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~-- 317 (424) ...+.+ |.++ ...+.....++... .+++ ..+..+.....|..+|-+.. +... ++.. -+++| T Consensus 311 --------~~~~~~-g~~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~--l~~~-~~~r--~TAtEV~ 375 (536) T protein:vir:21 311 --------LTKAQT-GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS--AVQR-TGER--VTAEEIR 375 (536) T ss_pred --------hccCCC-cceecCCcccceeeecccc-ccchHHHHHHHHHHHHHHHHHhhhh--cccC-CCCC--ccHHHHH Confidence 111111 1111 12233334444432 3333 23455666777888885431 2111 1111 12332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhh-------------ccChhhhccceeee--cchhhhcc-CHHHHHHHHHHHHhCC-- Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRW-------------LIPAKDVGRIHAEH--NLDGLLRG-DSASRAAFMKAMGESG-- 379 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~-------------L~~~~~~~~~~~~f--~~~~~~~~-d~~~~~~~~~~~~~~g-- 379 (424) .+..=....|.|....+.++|-.- ++++-......+.+ -+..+.+. +.+.....+..+.+.+ T Consensus 376 ~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe 455 (536) T protein:vir:21 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPM 455 (536) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchh Confidence 222333444555555554444322 33322111122222 12222221 1122222222222111 Q ss_pred ----CcCHHHH----HHHhCCCCCCCcCeeeecccccchhh--------------ccc-c-----CCC------ccCCC Q lcl|NC_019710. 380 ----LRTINEM----RRTDNLPPLPGGDVAMRQSQYVPITD--------------LGT-N-----KEP------RNNGA 424 (424) Q Consensus 380 ----~~t~NE~----R~~lg~~p~~ggd~~~~~~n~~~~~~--------------~~~-~-----~~~------~~~g~ 424 (424) .+..+++ .+.+|.+|.. .+..+-....+-. ++. . ..+ .++++ T Consensus 456 ~ld~~id~d~~~~~~a~~~Gv~p~~---~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g 531 (536) T protein:vir:21 456 RDDPDINLAMIKLRIANAIGIDTSG---ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) T ss_pred hhcccCCHHHHHHHHHHHcCCChhh---hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhhhhccc Confidence 1222222 2234553310 0000000000000 000 0 001 11111 No 232 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=75.80 E-value=0.14 Score=25.22 Aligned_cols=398 Identities=10% Similarity=0.064 Sum_probs=179.0 Q ss_pred ccHHHHHHh-----hccC--ccccccccccc-----cc---ccccc---cc---cCCcc------ccHHHHhhhHHHHHH Q lcl|NC_019710. 14 NGWWARLKS-----WFVG--GRLVTPNQGSQ-----TG---PVSAH---GY---LGDSS------INDERILQISTVWRC 66 (424) Q Consensus 14 ~G~~~~~~~-----~~~~--~~~~~~~~~~~-----~~---~~~~~---~~---~~~~~------~~~~~~~~~~~v~~~ 66 (424) .-+|.+.-. .... .+.+.|..... ++ +.... +. ..+.. -..+..+.+|.|..| T Consensus 1 ~~~w~~~de~~~~~~~~~~~~S~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd~A 80 (511) T protein:vir:56 1 MKFWTKEEEQDIQKIEKNPVRSFSAPDNVDGAKEIHTNLLAPQLGHAIIPSDAQSEGTIPVKELIKSYRALAEYHEVDDA 80 (511) T ss_pred CCCccchhhhhhhhhccCCcccccCCCCCCCceEEecccccceecceeccccccccCccchHHHHHHHHHHhhccchhhH Confidence 112222111 1111 12222211110 00 00000 00 00110 112345668999999 Q ss_pred HHHHHHhhhhC-----ceeEeecccc-CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019710. 67 VSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 67 i~~ia~~ia~~-----~~~~~~~~~~-~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) |+.|.+.+.-+ |+.+--.+-+ ...-.........++|+ =-|....+ +.++..|...|..|..++-+..- T Consensus 81 v~eIvne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~fHkiid~k~ 155 (511) T protein:vir:56 81 IQEIVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVS-LLQMRKHG----YKWFRKWYVDSRIYFHKILDKDN 155 (511) T ss_pred HHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceEEEEEEecccc Confidence 99999987643 3332111110 00000000112222222 11112222 45577788999999988777777 Q ss_pred ceeEEEEeccceEEEEEc-----CC--------ceEEEEEec--------------CceEEecHhHeeEecCc----CCC Q lcl|NC_019710. 141 DVISLLPLQSANMDVKLV-----GK--------KVVYRYQRD--------------SEYADFSQKEIFHLKGF----GFT 189 (424) Q Consensus 141 ~~~~l~~l~p~~v~~~~~-----~~--------~~~~~~~~~--------------~~~~~~~~~evih~r~~----~~~ 189 (424) .+.+|..|+|.+++..+. .+ .-+|.|... .....++.+.|.|...- +.+ T Consensus 156 GI~eLr~lDPr~i~~vr~i~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~ 235 (511) T protein:vir:56 156 NIIELRPLNPMKMELVREIQKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCAD 235 (511) T ss_pred ceeehhhcCcccchhhhhhhcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccCC Confidence 899999999999985332 11 112333221 13467899999666532 245 Q ss_pred CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC----------cccCc Q lcl|NC_019710. 190 GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG----------PVKKR 258 (424) Q Consensus 190 ~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~----------~~ag~ 258 (424) ..+.+|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++..- .+..+ T Consensus 236 ~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk 315 (511) T protein:vir:56 236 DPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTN 315 (511) T ss_pred CCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchh Confidence 55678889999988888887777776655555555566665555443 45555566665555321 11111 Q ss_pred cee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-CCcccccHHHHHH------ Q lcl|NC_019710. 259 LWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSGIEQQNL------ 320 (424) Q Consensus 259 ~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~~~~n~e~~~~------ 320 (424) .+. + ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|...++ +..+...+.+..+ T Consensus 316 ~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~ 394 (511) T protein:vir:56 316 AMSMLEDYYLPRREGSKGTEVSTLPGG-QSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFT 394 (511) T ss_pred hhhhHhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHH Confidence 221 1 125566665431 22333444567788999999999998874432 2222111222211 Q ss_pred HHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHHH--hCCCcCHH Q lcl|NC_019710. 321 GFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAMG--ESGLRTIN 384 (424) Q Consensus 321 ~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~~--~~g~~t~N 384 (424) -|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++.+- -+-.++.+ T Consensus 395 KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~ 474 (511) T protein:vir:56 395 KFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHK 474 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchH Confidence 222222222334444455555442 23332 23444544433221 1 123333333321 11244555 Q ss_pred HHHH-HhCCCCCC--CcCeeeecccccchhh-ccccCCCccCC Q lcl|NC_019710. 385 EMRR-TDNLPPLP--GGDVAMRQSQYVPITD-LGTNKEPRNNG 423 (424) Q Consensus 385 E~R~-~lg~~p~~--ggd~~~~~~n~~~~~~-~~~~~~~~~~g 423 (424) =+|+ .|.+.-.+ .-++.+ .-+. ..-.+++.++= T Consensus 475 yi~k~ILr~tDeei~~~~k~I------~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 475 YIQKNILRLSDDQITAMQSEI------DEEETNPRFQQDDQGF 511 (511) T ss_pred HHHHHHhccCHHHHHHHHHHH------HHhhcCCCCCCcccCC Confidence 5554 34443110 000000 0000 00001111111 No 233 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=75.36 E-value=0.15 Score=25.14 Aligned_cols=399 Identities=15% Similarity=0.128 Sum_probs=175.1 Q ss_pred HHHHHhhc-------cCccccccccccccccc-----ccccccCCc--c-------ccHHHHhhhHHHHHHHHHHHHhhh Q lcl|NC_019710. 17 WARLKSWF-------VGGRLVTPNQGSQTGPV-----SAHGYLGDS--S-------INDERILQISTVWRCVSLISTLTA 75 (424) Q Consensus 17 ~~~~~~~~-------~~~~~~~~~~~~~~~~~-----~~~~~~~~~--~-------~~~~~~~~~~~v~~~i~~ia~~ia 75 (424) +..++++. .+.+++.|........+ +......+. . -..+..+.+|.|..||+.|.+.+. T Consensus 1 m~~lfgf~i~~~~~~~~~S~vpp~~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd~Av~eIVneaI 80 (564) T protein:vir:10 1 MSQLFGFLINEKEGQKGQSPVPPNDEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVDSAIDEIVNEFV 80 (564) T ss_pred CcchhcceeeeeccCCCCCcccCCcCCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 22233322 22233333221111111 111111111 1 112345668999999999999854 Q ss_pred hC-----ceeEeecc-ccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC---CceeEEE Q lcl|NC_019710. 76 CL-----PLDVFETD-QNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA---GDVISLL 146 (424) Q Consensus 76 ~~-----~~~~~~~~-~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~---G~~~~l~ 146 (424) -+ |+.|--.+ +-+..-.........++|+ =-|....+ +.+++.|...|..|..++-+.. ..+.+|. T Consensus 81 v~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~-ll~F~~~~----~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr 155 (564) T protein:vir:10 81 VNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILR-MMNFNVNA----HEIIRNWYVDGRSHYHKVIDLDNPKKGILELR 155 (564) T ss_pred EecCCCceEEEEecccCcchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceEEEEEEeeCCChhhhhhhhh Confidence 32 22221111 0000000000111222222 11122222 4557778889999988765432 2488999 Q ss_pred EeccceEEEEEc------CCc-----------------eEEEEEec-----------------CceEEecHhHeeEecC- Q lcl|NC_019710. 147 PLQSANMDVKLV------GKK-----------------VVYRYQRD-----------------SEYADFSQKEIFHLKG- 185 (424) Q Consensus 147 ~l~p~~v~~~~~------~~~-----------------~~~~~~~~-----------------~~~~~~~~~evih~r~- 185 (424) .|+|.+++..+. ..+ -+|.|.+. +....++.+-|.|.+. T Consensus 156 ~lDPr~i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSG 235 (564) T protein:vir:10 156 YIDSLKIRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSG 235 (564) T ss_pred hhcccceeeeeeeccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhcceeccc Confidence 999998875431 111 12223211 1235677777777753 Q ss_pred -cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC--------c- Q lcl|NC_019710. 186 -FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG--------P- 254 (424) Q Consensus 186 -~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~--------~- 254 (424) ...++-.=+|-|..+.+.+....-+++..--+--.-+.-+-|...+-+... ..+.+.++....++..- . T Consensus 236 L~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev 315 (564) T protein:vir:10 236 LMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEI 315 (564) T ss_pred ceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCcee Confidence 233444446778888888877777777666555555555556655545443 45555566666555321 0 Q ss_pred -ccCcce-ec----------CCCceeeeccC--ChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCccc-ccHHH-- Q lcl|NC_019710. 255 -VKKRLW-IL----------EAGFSTSAIGV--TPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWG-SGIEQ-- 317 (424) Q Consensus 255 -~ag~~~-~l----------~~g~~~~~l~~--s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~-~n~e~-- 317 (424) +..+.+ .+ ..|.+++.|.- +.-+|+ -..+..+.+..+++||.+-|.... ++.+. ..+|= T Consensus 316 rddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~---DV~YF~kKLY~aLnVP~SRl~~e~-~~f~~Gr~~EItR 391 (564) T protein:vir:10 316 RDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELK---DVEYFKKKLYNSLNLPPSRLTDDN-KAFNLGKSTEILR 391 (564) T ss_pred cccchhhhhHhhhcccccCCCcccceeeccccCCcchHH---HHHHHHHHHHHHhCCCcccccCCC-ceeecccccchhH Confidence 111111 11 12456666543 333443 456678889999999999887542 11211 11221 Q ss_pred ---HHHHHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHH--HhC Q lcl|NC_019710. 318 ---QNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAM--GES 378 (424) Q Consensus 318 ---~~~~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~--~~~ 378 (424) ...-|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|..... + ...|...++.+ +-+ T Consensus 392 DEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvG 471 (564) T protein:vir:10 392 DELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVG 471 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 111222222222344444555555442 23332 24444544433221 1 12333333332 111 Q ss_pred CCcCHHHHHH------------H---------hCC--CCC--CCcCeee-ecccccchhhcc------ccCCCccCCC Q lcl|NC_019710. 379 GLRTINEMRR------------T---------DNL--PPL--PGGDVAM-RQSQYVPITDLG------TNKEPRNNGA 424 (424) Q Consensus 379 g~~t~NE~R~------------~---------lg~--~p~--~ggd~~~-~~~n~~~~~~~~------~~~~~~~~g~ 424 (424) -.++.+=+|+ + .|+ +|. +.||..- .+..+.|..... ..+....++| T Consensus 472 ky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a 549 (564) T protein:vir:10 472 KYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSA 549 (564) T ss_pred cccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccC Confidence 1223333321 1 122 232 2343222 222233332211 1111111222 No 234 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=74.75 E-value=0.15 Score=25.03 Aligned_cols=401 Identities=12% Similarity=0.082 Sum_probs=177.2 Q ss_pred cccCCCccHHHHH---Hhh-----------cc--Ccccccccccccccc-------cccccc----cC---Ccc------ Q lcl|NC_019710. 8 IDLRTNNGWWARL---KSW-----------FV--GGRLVTPNQGSQTGP-------VSAHGY----LG---DSS------ 51 (424) Q Consensus 8 ~~~~~~~G~~~~~---~~~-----------~~--~~~~~~~~~~~~~~~-------~~~~~~----~~---~~~------ 51 (424) |+|- ||++.+ +.| .. ..+.+.|.....+.. ....+. .+ +.. T Consensus 1 ~~~~---~~~~~l~~~~~~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~~~~~eL 77 (524) T protein:vir:98 1 MNFL---GFGNVLSFFKNFAREDEIELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAIQNKEQL 77 (524) T ss_pred CCCc---chhhHHHHhhhhhhhhhhhHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeeccccccccchHHHH Confidence 4442 333322 111 11 111222211100000 000000 00 000 Q ss_pred -ccHHHHhhhHHHHHHHHHHHHhhhhC-----ceeEeecccc-CccccccccchhHHhhccCCCCCCCHHHHHHHHHHHH Q lcl|NC_019710. 52 -INDERILQISTVWRCVSLISTLTACL-----PLDVFETDQN-DNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQL 124 (424) Q Consensus 52 -~~~~~~~~~~~v~~~i~~ia~~ia~~-----~~~~~~~~~~-~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~ 124 (424) -..+..+.+|.|..||+.|.+.+.-+ |+.+--.+.+ +..-.........++|+ =-+....+ +.+++.| T Consensus 78 I~~YR~ma~~pEvd~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~W 152 (524) T protein:vir:98 78 INTYRGIMSYPEVENAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLN-IYDFDNMG----ARLFRDW 152 (524) T ss_pred HHHHHHHhhccchhhHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhh Confidence 11234566899999999999987532 2222111100 00000000111222222 11112222 4557788 Q ss_pred HHcCCeEEEEeeCCCCc--eeEEEEeccceEEEEE-------cCCc-------eEEEEEe-------------cCceEEe Q lcl|NC_019710. 125 CFYGNAYALVDRNSAGD--VISLLPLQSANMDVKL-------VGKK-------VVYRYQR-------------DSEYADF 175 (424) Q Consensus 125 l~~G~a~~~~~r~~~G~--~~~l~~l~p~~v~~~~-------~~~~-------~~~~~~~-------------~~~~~~~ 175 (424) ...|..|..++.+.+.. +.+|..|+|.+++..+ +++. -+|.|.. .+....+ T Consensus 153 YVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI 232 (524) T protein:vir:98 153 YVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKI 232 (524) T ss_pred hhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeeccccccccchhhccceeeeeeeccCCCccccccceecCCCceee Confidence 89999999988665543 8999999999997543 2221 1223321 2234668 Q ss_pred cHhHeeEecC--cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhC Q lcl|NC_019710. 176 SQKEIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAG 252 (424) Q Consensus 176 ~~~evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~ 252 (424) +.+-|+|... .+.++-. +|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++.. T Consensus 233 ~~dAIvy~hSGL~d~~~~i-isyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kN 311 (524) T protein:vir:98 233 PRSAIVYAHSGLEDCSNNI-IGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKN 311 (524) T ss_pred chhheeeeccCcccCCCCe-eeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCc Confidence 8888988763 2222211 5778888888877777777766555555555666666655444 5556667777766641 Q ss_pred --------Cc--ccCcce-ec----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-CCc Q lcl|NC_019710. 253 --------GP--VKKRLW-IL----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STS 310 (424) Q Consensus 253 --------~~--~ag~~~-~l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~ 310 (424) |. +..+.+ .+ ..|.+++.|.-. +.+.-++-..+..+.+..+++||.+-|...+. -+. T Consensus 312 klvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpgg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~ 390 (524) T protein:vir:98 312 RVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGG-QNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQI 390 (524) T ss_pred eeEeeccCceeeccccccchhhhhcccccCCCCccceeecccc-CCcChHHHHHHHHHHHHHHhCCCceeccCCCCcccc Confidence 11 111222 22 135566666431 22323344567788999999999888864321 122 Q ss_pred ccccHH----H-HHHHHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHH Q lcl|NC_019710. 311 WGSGIE----Q-QNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAF 371 (424) Q Consensus 311 ~~~n~e----~-~~~~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~ 371 (424) .. .+| + ...-|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|..... + ...|... T Consensus 391 Gr-~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~ 469 (524) T protein:vir:98 391 GG-GGEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNL 469 (524) T ss_pred cc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHH Confidence 11 112 1 111222222223444444555555443 33322 23444444433221 1 1234444 Q ss_pred HHHHHh--CCCcCHHHHHH-HhCCCCCC--CcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 372 MKAMGE--SGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 372 ~~~~~~--~g~~t~NE~R~-~lg~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) ++.+-. +-+++.+=+|+ .|.+.-.+ .-++.. ..-.+.+-.++|.++=. T Consensus 470 l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I-----~~E~k~~~~~~p~~e~~ 522 (524) T protein:vir:98 470 MSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLI-----EEESKEERFKNPEAEEE 522 (524) T ss_pred HHHhccccccccchHHHHHHHhccCHHHHHHHHHHH-----HHHHhCCCCcCCccccc Confidence 433311 12555555543 23332100 000000 00000000000000000 No 235 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=72.82 E-value=0.18 Score=24.69 Aligned_cols=402 Identities=10% Similarity=0.054 Sum_probs=175.3 Q ss_pred cCCCccHHHHHHh--------hcc--Cccccccccccccc--------ccccccccCCcc--------------ccHHHH Q lcl|NC_019710. 10 LRTNNGWWARLKS--------WFV--GGRLVTPNQGSQTG--------PVSAHGYLGDSS--------------INDERI 57 (424) Q Consensus 10 ~~~~~G~~~~~~~--------~~~--~~~~~~~~~~~~~~--------~~~~~~~~~~~~--------------~~~~~~ 57 (424) |-...-++..+.. .+. ..+.+.|.....+. +....+...+.. -..+.. T Consensus 1 ~~~~l~~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~m 80 (521) T protein:vir:65 1 MFSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGL 80 (521) T ss_pred CccchhhhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHH Confidence 2222222222211 111 11222221111000 000001000111 112335 Q ss_pred hhhHHHHHHHHHHHHhhhhC-----ceeEeec-cccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeE Q lcl|NC_019710. 58 LQISTVWRCVSLISTLTACL-----PLDVFET-DQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAY 131 (424) Q Consensus 58 ~~~~~v~~~i~~ia~~ia~~-----~~~~~~~-~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~ 131 (424) +.+|.|..||+.|.+.+.-+ |+.+--. .+-...-.........++|+ =-|....+ +.+++.|...|..| T Consensus 81 a~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgRi~ 155 (521) T protein:vir:65 81 MNNHEVENAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLN-TIQFDRRG----QDMFRRWYVDSRIF 155 (521) T ss_pred hhccchhhHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHhhhhhcceeE Confidence 66899999999999987643 2322111 10010000001112222222 11112222 45577888999999 Q ss_pred EEEeeCCC--CceeEEEEeccceEEEEEcC-----Cc--------eEEEEEecC-------------ceEEecHhHeeEe Q lcl|NC_019710. 132 ALVDRNSA--GDVISLLPLQSANMDVKLVG-----KK--------VVYRYQRDS-------------EYADFSQKEIFHL 183 (424) Q Consensus 132 ~~~~r~~~--G~~~~l~~l~p~~v~~~~~~-----~~--------~~~~~~~~~-------------~~~~~~~~evih~ 183 (424) ..++.+.+ ..+.+|..|+|.+++..+.- .+ -+|.|..++ ....++.+-|... T Consensus 156 fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~ 235 (521) T protein:vir:65 156 FHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYA 235 (521) T ss_pred EEEEEcCCccccceeeeeeCCcceeeeeeecccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeee Confidence 98885544 56899999999999854421 10 123332211 1233444444333 Q ss_pred cC--cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhC-------- Q lcl|NC_019710. 184 KG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAG-------- 252 (424) Q Consensus 184 r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~-------- 252 (424) .. ...++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+-+..+ ..+.+-++....++.. T Consensus 236 hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~T 315 (521) T protein:vir:65 236 HSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDAST 315 (521) T ss_pred eccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccc Confidence 21 223333446778888888887777777776655555555666666655444 5556667777766643 Q ss_pred Cc--ccCcce-ecC----------CCceeeecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH Q lcl|NC_019710. 253 GP--VKKRLW-ILE----------AGFSTSAIG--VTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 253 ~~--~ag~~~-~l~----------~g~~~~~l~--~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~ 317 (424) |. +..+.+ .++ .|.+++.|. .+.-+| +-..+..+.+..+++||.+-++...++..+.....+ T Consensus 316 Gev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem---~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~E 392 (521) T protein:vir:65 316 GKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDI---DDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSE 392 (521) T ss_pred ccccccccccchhhhhcccccCCCCccceeecccCCCcChH---HHHHHHHHHHHHHhCCCceeccCCCCcceeccccch Confidence 11 111122 221 355666664 343344 445667889999999999887554433322111112 Q ss_pred HHH------HHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHHH- Q lcl|NC_019710. 318 QNL------GFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAMG- 376 (424) Q Consensus 318 ~~~------~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~~- 376 (424) ..+ -|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++.+- T Consensus 393 ItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dp 472 (521) T protein:vir:65 393 ITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITP 472 (521) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhh Confidence 111 222222222344444555555443 33322 23444444433221 1 123444443331 Q ss_pred -hCCCcCHHHHHH-HhCCCCCC--CcCeeeecccccchhhccccCCCc-cCCC Q lcl|NC_019710. 377 -ESGLRTINEMRR-TDNLPPLP--GGDVAMRQSQYVPITDLGTNKEPR-NNGA 424 (424) Q Consensus 377 -~~g~~t~NE~R~-~lg~~p~~--ggd~~~~~~n~~~~~~~~~~~~~~-~~g~ 424 (424) -+-.++.+=+|+ .|.+.-.+ .-++.+ ..-.+.+-.+++. +-+. T Consensus 473 yvGky~S~dyi~k~ILr~tDeei~~~~k~I-----~~E~~~~~~~~p~~~~~~ 520 (521) T protein:vir:65 473 YIGKYFSNQTVMRDILKYTDDQMDTEKKQI-----EEEANDPRFKQTPDEIED 520 (521) T ss_pred hhccccchHHHHHHHhccCHHHHHHHHHHH-----HHhhhCCCCCCCcccccC Confidence 112445555554 34443110 000000 0000000001111 1111 No 236 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=70.77 E-value=0.2 Score=24.36 Aligned_cols=393 Identities=10% Similarity=-0.004 Sum_probs=157.6 Q ss_pred CCCCCcccccCCC-------ccHHHHHHhhccCccccccc-ccccccccccccccCCccccHHHHhhhHHHHHHHHHHHH Q lcl|NC_019710. 1 MEEPKYTIDLRTN-------NGWWARLKSWFVGGRLVTPN-QGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLIST 72 (424) Q Consensus 1 ~~~~~~~~~~~~~-------~G~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~ 72 (424) |.. +.+. .--|..++....|....... ......+.. .....-..-..+.+++-+..+.....+.+ T Consensus 1 m~~------V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~-e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~ 73 (501) T protein:vir:95 1 MPN------VSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNA-EDQSKENKARYEAYLKRAVFYNVARRTLF 73 (501) T ss_pred CCC------CCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCC-CCCcccchHHHHHHhhccccCchHHHHHH Confidence 321 4444 25566666666544321110 011111100 00000000012233444444444444455 Q ss_pred hhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCc----------- Q lcl|NC_019710. 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGD----------- 141 (424) Q Consensus 73 ~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~----------- 141 (424) ++.++.|+. .. .......+..++..-=-...+-.+|.+.++...+.+|-+++++.....+. T Consensus 74 ~l~G~vf~k---~p-----~~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~ 145 (501) T protein:vir:95 74 GLVGQVFMR---DP-----VVKVPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEA 145 (501) T ss_pred HHhhhhhcC---Cc-----ceeCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHh Confidence 555554431 10 00112334454543333456788999999999999999999997643221 Q ss_pred ----eeEEEEeccceEE----------------------EEEcCC-----------------c-eEEE-EEecCce---- Q lcl|NC_019710. 142 ----VISLLPLQSANMD----------------------VKLVGK-----------------K-VVYR-YQRDSEY---- 172 (424) Q Consensus 142 ----~~~l~~l~p~~v~----------------------~~~~~~-----------------~-~~~~-~~~~~~~---- 172 (424) | -+..+.|..|. ...++. + +.++ |.....+ T Consensus 146 ~~~rP-y~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~ 224 (501) T protein:vir:95 146 GRIRP-TLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADG 224 (501) T ss_pred ccCCc-EEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCc Confidence 1 13333332221 011110 0 0011 1111000 Q ss_pred EEecHh------------------HeeEec---CcCCCCccccchHHHHHHH-HHHHHHHHHHHHHHHhccCCCceeEEc Q lcl|NC_019710. 173 ADFSQK------------------EIFHLK---GFGFTGLVGLSPIAFACKS-AGVAVAMEDQQRDFFANGAKSPQILST 230 (424) Q Consensus 173 ~~~~~~------------------evih~r---~~~~~~~~G~s~~~~~~~~-i~~~~~~~~~~~~~~~ng~~p~~vl~~ 230 (424) ..+... ..|=|- ..+.+...|.+|+..++.. +...+.... ....+...+.|-.+++- T Consensus 225 ~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd-~~~~l~~~~~P~l~i~G 303 (501) T protein:vir:95 225 SKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSAD-YEESCYIVGQPTPVLIG 303 (501) T ss_pred ceecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhH-HHHHHHHcccceeeeeC Confidence 000000 011111 1112224567888776644 222223233 33445556677777653 Q ss_pred CCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCc Q lcl|NC_019710. 231 GEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTS 310 (424) Q Consensus 231 ~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~ 310 (424) ... + ..+...+ ....-|. ...+.++.|.++.=+..++.-+. .+..+-...++.. .| ..++.... ++. T Consensus 304 ~~~---~-~~~~~~~--~~i~~G~--~~~~~lP~~~~~~~ie~~~~~i~-~~~l~~l~~~m~~-~G--a~ll~~~~-~~~ 370 (501) T protein:vir:95 304 LTE---E-WVTNVLK--GSVNFGS--RGGIPLPVGADAKLLQASENTML-KEAMDTKERQMVA-LG--AKLVEQKE-VQR 370 (501) T ss_pred Ccc---c-ccccCCC--Cceeecc--cccccCCCCCceeEEecChhhHH-HHHHHHHHHHHHH-HH--HhhccCCc-cch Confidence 221 1 0000000 0111122 23567766655544444443332 2333333333322 23 23332221 111 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhhhccC--hhh-h--ccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHH Q lcl|NC_019710. 311 WGSGIEQQNLGFLQYTLQPYISRWENSIQRWLIP--AKD-V--GRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINE 385 (424) Q Consensus 311 ~~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~--~~~-~--~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE 385 (424) + .........--...|.-++.++++.|+.-|-- ... . ....++++.+-.........++.+.++++.|.++..+ T Consensus 371 T-a~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t 449 (501) T protein:vir:95 371 T-ATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQADSGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEE 449 (501) T ss_pred h-HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHH Confidence 1 11112222333455777788888877754321 111 1 1234555555444443344566677889999999999 Q ss_pred HHHHhCCCCCCCcC-----ee-e-ecccccchhhccccCCCccCCC Q lcl|NC_019710. 386 MRRTDNLPPLPGGD-----VA-M-RQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 386 ~R~~lg~~p~~ggd-----~~-~-~~~n~~~~~~~~~~~~~~~~g~ 424 (424) .++.+-.--+..-| +. . ..-+..+.+.........++|. T Consensus 450 ~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~ 495 (501) T protein:vir:95 450 MRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGD 495 (501) T ss_pred HHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccc Confidence 98776332221100 00 0 0001111111111111111111 No 237 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=69.17 E-value=0.23 Score=24.12 Aligned_cols=393 Identities=11% Similarity=0.092 Sum_probs=162.7 Q ss_pred CCCCC---cccccCC---------------CccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHH Q lcl|NC_019710. 1 MEEPK---YTIDLRT---------------NNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQIST 62 (424) Q Consensus 1 ~~~~~---~~~~~~~---------------~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 62 (424) |.+-. .+-+.+. +..-+.++..+..+...... .+......... . -+.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~------~~~~~~~~~~~----~--ki~~n~ 68 (489) T protein:vir:99 1 MLQEDFEAIDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKY------RPAKTDKYAAD----N--RIASDF 68 (489) T ss_pred CCccceeeeCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccc------ccccccccCCc----c--eeecch Confidence 11100 0000111 12233444444433221100 00000000000 0 022455 Q ss_pred HHHHHHHHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEee----CC Q lcl|NC_019710. 63 VWRCVSLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDR----NS 138 (424) Q Consensus 63 v~~~i~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r----~~ 138 (424) ..-+|+..+.-+-.-|+++--. +.. ....+..++. . | ....+...+..+++.+|.||.++.. +. T Consensus 69 ~~~iv~~~~~~l~g~~~~~~~~--d~~-----~~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~ 136 (489) T protein:vir:99 69 AKYITVFEQGYMLGVPVEYKNE--NKD-----LQAAIDLMSV-R-N---NEDYHNVKIKTDLSIYGRAYELLTVEKIDDK 136 (489) T ss_pred HHHHHHHHhhhhccCCceeecC--Chh-----HHHHHHHHHh-h-c---ChhHHHHHHHHHHhhCCeEEEEEeeccCcCC Confidence 6667777777666666654211 111 1122334443 2 2 2335678889999999999976643 33 Q ss_pred CCceeEEEEeccceEEEEEcCCc---eE-----EEEEecC-c----eEEecHhHeeEecCcC------------------ Q lcl|NC_019710. 139 AGDVISLLPLQSANMDVKLVGKK---VV-----YRYQRDS-E----YADFSQKEIFHLKGFG------------------ 187 (424) Q Consensus 139 ~G~~~~l~~l~p~~v~~~~~~~~---~~-----~~~~~~~-~----~~~~~~~evih~r~~~------------------ 187 (424) .|. ..+..++|.++.+..++.. .. |...... . ...+.++.+++++... T Consensus 137 ~~~-~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~ 215 (489) T protein:vir:99 137 KTE-VKLYQLPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKG 215 (489) T ss_pred Ccc-eEEEEEcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCc Confidence 343 3577788888877665322 11 1111110 0 1123344444432100 Q ss_pred ------CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHh-------CCc Q lcl|NC_019710. 188 ------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIA-------GGP 254 (424) Q Consensus 188 ------~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~-------~~~ 254 (424) .+...|.|.+..+...++....+.....+.....+.|-.+++-. .....+.. .......... ... T Consensus 216 vPvv~~~n~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 293 (489) T protein:vir:99 216 VPVNEYANNEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGN-AYTGADEN-DYLDDGRLNPNGRLAISIGF 293 (489) T ss_pred eeEEEeecCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccC-Ccccccch-hhhhhccccccccccccccc Confidence 01224667676666666655555444444444445555444321 11111111 1111111111 111 Q ss_pred ccCcceecCCCc-------eeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHH----------H Q lcl|NC_019710. 255 VKKRLWILEAGF-------STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIE----------Q 317 (424) Q Consensus 255 ~ag~~~~l~~g~-------~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e----------~ 317 (424) ..++++.++.+. +.+.+.....+..+....+...+.|...-++|..-.... .++.+....+ + T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~ 372 (489) T protein:vir:99 294 KKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKF-SGVQSGESMKYKLMASDNYRE 372 (489) T ss_pred ccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccc-cccchHHHHHHHHHHHHHHHH Confidence 223444444332 223333322333445566778888999888885322111 1222211111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccChhh--hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRWLIPAKD--VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPL 395 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~--~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~ 395 (424) .....+...+.-+++.|...+... -.... .....+.+.++.-+..|..+.++.+.++. |+++.-.+.++++.=.. T Consensus 373 ~k~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--giis~et~~~~l~~v~~ 449 (489) T protein:vir:99 373 KQERLFKKGLMRRLRLAANIWAIK-GNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GIVSDQTIFEILNTVTG 449 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhc-CCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhcCCCCc Confidence 112233334444444443333211 11111 01123445556667788888899888875 88998888877644110 Q ss_pred CC----c-------CeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 396 PG----G-------DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 396 ~g----g-------d~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +. . +......+.....+..+++++.++.- T Consensus 450 ~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 450 VDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEPTAEKP 489 (489) T ss_pred hhHHHHHHHHHHHHHHHhccccccccCCCCCCcCCCCCCC Confidence 00 0 00000001111111111111111111 No 238 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=68.58 E-value=0.24 Score=24.03 Aligned_cols=396 Identities=13% Similarity=0.072 Sum_probs=148.5 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCcccccccccc-cccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhC-- Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQGS-QTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACL-- 77 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~-- 77 (424) |++.|+-..-++-..-|+++++.- ......-.... .+-|...........-....... ++-..|++.+|+.+-+. T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R-~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~d-st~~~a~~~Laa~l~~~lt 78 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDR-APYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQ-AVGARGLNNLASKLMLALF 78 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHh-hHHHHHHHHHHHHhcccccCCCCCccccccccccc-ccHHHHHHHHHHHHHhhhc Confidence 999887655555445555544311 00000000000 01111000000000000011122 33444555665554432 Q ss_pred ---ceeEeeccccCcc------c---cc-----cccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC Q lcl|NC_019710. 78 ---PLDVFETDQNDNR------K---KV-----DLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG 140 (424) Q Consensus 78 ---~~~~~~~~~~~~~------~---~~-----~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G 140 (424) ||-=..-.+..-. . ++ .....+...|. +-| .+.-+..+..+++.+|||.+++..+..+ T Consensus 79 P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~L~~~G~a~ly~~e~~~~ 153 (536) T protein:vir:10 79 PMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFEALKQLVVAGNVLLYLPEPEGS 153 (536) T ss_pred CCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEEeeCCCC Confidence 3311111110000 0 00 00122333332 333 4455667788999999999998766554 Q ss_pred ce--eEEEEeccceEEEEEcCCce-----------------------------------EE--EEEe--cCceE---Eec Q lcl|NC_019710. 141 DV--ISLLPLQSANMDVKLVGKKV-----------------------------------VY--RYQR--DSEYA---DFS 176 (424) Q Consensus 141 ~~--~~l~~l~p~~v~~~~~~~~~-----------------------------------~~--~~~~--~~~~~---~~~ 176 (424) .+ ...|||....|....+++.. +| .+.+ ++... .+. T Consensus 154 ~~~~~~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~ 233 (536) T protein:vir:10 154 NYNPMKLYRLSSYVVQRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVE 233 (536) T ss_pred ceeeEEEEEcCeEEEeeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeec Confidence 43 45666644444322222110 00 0111 11100 111 Q ss_pred HhH--------------eeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHH Q lcl|NC_019710. 177 QKE--------------IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRS 241 (424) Q Consensus 177 ~~e--------------vih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~ 241 (424) ... .+.+|+...++ .||.||...++..+.....+.+.......-...|...+. +........ T Consensus 234 g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-p~g~~~~~~-- 310 (536) T protein:vir:10 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN-PAGITQPRR-- 310 (536) T ss_pred CccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC-cccccchhh-- Confidence 111 13334333333 799999999999999998888888776666566554443 223322221 Q ss_pred HHHHHHHHHhCCcccCcce-ecCCCceeeeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH-- Q lcl|NC_019710. 242 QVEENFKEIAGGPVKKRLW-ILEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ-- 317 (424) Q Consensus 242 ~~~~~~~~~~~~~~ag~~~-~l~~g~~~~~l~~s~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~-- 317 (424) ...+.+ |.++ ...+.....++... .+++ ..+..+.....|..+|-+.. +... ++.. -+++| T Consensus 311 --------~~~~~~-g~~v~g~~~~v~~~~~~~~-~~~~~~~~~i~~~~~rI~~af~~~~--l~~~-~~~r--~TAtEV~ 375 (536) T protein:vir:10 311 --------LTKAQT-GDFVTGRPEDISFLQLEKQ-ADFTVAKAVSDAIEARLSFAFMLNS--AVQR-TGER--VTAEEIR 375 (536) T ss_pred --------hccCCC-cceecCCcccceeeecccc-ccchHHHHHHHHHHHHHHHHHhhhh--cccC-CCCC--ccHHHHH Confidence 111111 1111 12233334444432 3333 23455666777888885431 2111 1111 12332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhh-------------ccChhhhccceeee--cchhhhcc-CHHHHHHHHHHHHhCC-- Q lcl|NC_019710. 318 QNLGFLQYTLQPYISRWENSIQRW-------------LIPAKDVGRIHAEH--NLDGLLRG-DSASRAAFMKAMGESG-- 379 (424) Q Consensus 318 ~~~~f~~~tl~P~~~~ie~~l~~~-------------L~~~~~~~~~~~~f--~~~~~~~~-d~~~~~~~~~~~~~~g-- 379 (424) .+..=....|.|....+.++|-.- ++++-......+.+ -+..+.+. +.+.....+..+.+.+ T Consensus 376 ~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~ 455 (536) T protein:vir:10 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPM 455 (536) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchh Confidence 222333444555555554444322 33322111122222 12222221 1122222222222111 Q ss_pred ----CcCHHHHH----HHhCCCCCCCcCeeeecccccchhh--------------ccc-c-----CCC------ccCCC Q lcl|NC_019710. 380 ----LRTINEMR----RTDNLPPLPGGDVAMRQSQYVPITD--------------LGT-N-----KEP------RNNGA 424 (424) Q Consensus 380 ----~~t~NE~R----~~lg~~p~~ggd~~~~~~n~~~~~~--------------~~~-~-----~~~------~~~g~ 424 (424) .+..+++- +.+|.+|.. .+..+-....+-. ++. . ..+ .++++ T Consensus 456 ~ld~~id~d~~~~~~a~~~Gv~p~~---~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~g 531 (536) T protein:vir:10 456 RDDPDINLAMIKLRIANAIGIDTSG---ILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAAADSVG 531 (536) T ss_pred hhcccCCHHHHHHHHHHHcCCCchh---hcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhhhhccc Confidence 12223322 234553310 0000000000000 000 0 000 11111 No 239 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=64.50 E-value=0.3 Score=23.46 Aligned_cols=386 Identities=9% Similarity=0.031 Sum_probs=170.2 Q ss_pred cccCCC-ccHHHHHHh--------hcc--Cccccccccccc---cc-cc--ccccc-------cCCc-c---------cc Q lcl|NC_019710. 8 IDLRTN-NGWWARLKS--------WFV--GGRLVTPNQGSQ---TG-PV--SAHGY-------LGDS-S---------IN 53 (424) Q Consensus 8 ~~~~~~-~G~~~~~~~--------~~~--~~~~~~~~~~~~---~~-~~--~~~~~-------~~~~-~---------~~ 53 (424) |+ .+ .|+|+.+.. ... ..+.+.|..... .. .. ...++ .++. . -. T Consensus 1 m~--~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~ 78 (524) T protein:vir:72 1 MK--FNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDT 78 (524) T ss_pred CC--CchhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHH Confidence 43 33 455443221 011 111222211000 00 00 00011 1110 0 11 Q ss_pred HHHHhhhHHHHHHHHHHHHhhhhC-----ceeEeeccccCccc-cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHc Q lcl|NC_019710. 54 DERILQISTVWRCVSLISTLTACL-----PLDVFETDQNDNRK-KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFY 127 (424) Q Consensus 54 ~~~~~~~~~v~~~i~~ia~~ia~~-----~~~~~~~~~~~~~~-~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~ 127 (424) .+..+.+|.|..||+.|.+.+.-+ |+.+--.+.+-+.. .........++|+ = ++...--+.+++.|... T Consensus 79 YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-l----l~F~~~~~~~fR~WYVD 153 (524) T protein:vir:72 79 YRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLN-H----LSFQRKGSDHFRRWYVD 153 (524) T ss_pred HHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHH-H----hccchhhhHHHhhheee Confidence 234566899999999999986643 23221111110000 0001112222222 1 11122224557778899 Q ss_pred CCeEEEEeeCCC---CceeEEEEeccceEEEEEc-----CCc--------eEEEEEecC-------------ceEEecHh Q lcl|NC_019710. 128 GNAYALVDRNSA---GDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDS-------------EYADFSQK 178 (424) Q Consensus 128 G~a~~~~~r~~~---G~~~~l~~l~p~~v~~~~~-----~~~--------~~~~~~~~~-------------~~~~~~~~ 178 (424) |..|..++-|.. ..+.+|..|+|.+++..+. .++ .+|.|..+. ....++.+ T Consensus 154 gRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~d 233 (524) T protein:vir:72 154 SRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKA 233 (524) T ss_pred eEEEEEEEEeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchh Confidence 999988866433 3688999999999975321 111 123333221 23445555 Q ss_pred HeeEecC--cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC-- Q lcl|NC_019710. 179 EIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG-- 253 (424) Q Consensus 179 evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~-- 253 (424) -|.|... .+.++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+.+..+ ..+.+-++....++... T Consensus 234 AI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklv 313 (524) T protein:vir:72 234 AVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVV 313 (524) T ss_pred heeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeE Confidence 5554441 223333446778888888777777777666555555555556665555443 45555566666555321 Q ss_pred --cccC------ccee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCccccc Q lcl|NC_019710. 254 --PVKK------RLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG 314 (424) Q Consensus 254 --~~ag------~~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n 314 (424) .+.| +.+. + ..|.+++.|.-. ..+.-++-..+..+.+..+++||.+-|.....+..+... T Consensus 314 YDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr 392 (524) T protein:vir:72 314 YDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGA-DNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDS 392 (524) T ss_pred EeCCCCeeccchhhhhhHhhhcccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccc Confidence 0111 1111 1 124566665431 123233445677889999999999988432211122111 Q ss_pred HHHHHH------HHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHH Q lcl|NC_019710. 315 IEQQNL------GFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKA 374 (424) Q Consensus 315 ~e~~~~------~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~ 374 (424) ..+..+ -|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++. T Consensus 393 ~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~ 472 (524) T protein:vir:72 393 GTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTM 472 (524) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHH Confidence 111111 222222222334444455555442 23332 23444544433221 1 1233333333 Q ss_pred HHh--CCCcCHHHHHH-HhCCCC----------------------CCCcCee Q lcl|NC_019710. 375 MGE--SGLRTINEMRR-TDNLPP----------------------LPGGDVA 401 (424) Q Consensus 375 ~~~--~g~~t~NE~R~-~lg~~p----------------------~~ggd~~ 401 (424) +-. +-.++.+=+|+ .|.+.- .+.-+.+ T Consensus 473 ~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 473 AEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred hhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 311 11334444443 233321 1111222 No 240 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=61.97 E-value=0.34 Score=23.13 Aligned_cols=398 Identities=10% Similarity=0.030 Sum_probs=170.1 Q ss_pred cccCCC-ccHHHHHHh--------hcc--Cccccccccccc---cc-cc--ccccc-------cCCc-c---------cc Q lcl|NC_019710. 8 IDLRTN-NGWWARLKS--------WFV--GGRLVTPNQGSQ---TG-PV--SAHGY-------LGDS-S---------IN 53 (424) Q Consensus 8 ~~~~~~-~G~~~~~~~--------~~~--~~~~~~~~~~~~---~~-~~--~~~~~-------~~~~-~---------~~ 53 (424) |+ .+ .|+|+.+.. ... ..+.+.|..... .. .. ...++ .++. . -. T Consensus 1 m~--~~~L~~~~~w~~~de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eLI~~ 78 (524) T protein:vir:10 1 MK--FNVLSLFAPWAKMDERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTRELIDT 78 (524) T ss_pred CC--CchhhHhhccccCcchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHHHHH Confidence 43 33 455443221 011 111222211000 00 00 00011 1110 0 11 Q ss_pred HHHHhhhHHHHHHHHHHHHhhhhC-----ceeEeeccccCccc-cccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHc Q lcl|NC_019710. 54 DERILQISTVWRCVSLISTLTACL-----PLDVFETDQNDNRK-KVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFY 127 (424) Q Consensus 54 ~~~~~~~~~v~~~i~~ia~~ia~~-----~~~~~~~~~~~~~~-~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~ 127 (424) .+..+.+|.|..||+.|.+.+.-+ |+.+--.+.+-+.. .........++|+ = ++...--+.+++.|... T Consensus 79 YR~ma~~pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~-l----l~F~~~~~~~fR~WYVD 153 (524) T protein:vir:10 79 YRNLMNNYEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLN-H----LSFQRKGSDHFRRWYVD 153 (524) T ss_pred HHHHhhccchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHH-H----hccchhhhHHHhhheee Confidence 234566899999999999986643 23221111110000 0001112222222 1 11122224557778899 Q ss_pred CCeEEEEeeCCC---CceeEEEEeccceEEEEEc-----CCc--------eEEEEEecC-------------ceEEecHh Q lcl|NC_019710. 128 GNAYALVDRNSA---GDVISLLPLQSANMDVKLV-----GKK--------VVYRYQRDS-------------EYADFSQK 178 (424) Q Consensus 128 G~a~~~~~r~~~---G~~~~l~~l~p~~v~~~~~-----~~~--------~~~~~~~~~-------------~~~~~~~~ 178 (424) |..|..++-+.. ..+.+|..|+|.+++..+. .++ .+|.|..+. ....++.+ T Consensus 154 gRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~d 233 (524) T protein:vir:10 154 SRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREIITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKA 233 (524) T ss_pred eEEEEEEEeeCCCccccceeeeeeCCccceeeeeeccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchh Confidence 999988866533 3688999999999975321 111 123333221 23445555 Q ss_pred HeeEecC--cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC-- Q lcl|NC_019710. 179 EIFHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG-- 253 (424) Q Consensus 179 evih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~-- 253 (424) -|.|... .+.++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+.+..+ ..+.+-++....++... T Consensus 234 AI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklv 313 (524) T protein:vir:10 234 AIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVV 313 (524) T ss_pred heeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeE Confidence 5554441 223333446778888888777777777666555555555556665555443 45555566666555321 Q ss_pred --cccC------ccee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCccccc Q lcl|NC_019710. 254 --PVKK------RLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG 314 (424) Q Consensus 254 --~~ag------~~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n 314 (424) .+.| +.+. + ..|.+++.|.-. ..+.-++-..+..+.+..+++||.+-|.....+..+... T Consensus 314 YDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr 392 (524) T protein:vir:10 314 YDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGA-DNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDS 392 (524) T ss_pred EeCCCCeeccchhhhhhHhhhcccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccc Confidence 0111 1111 1 124566665431 123233445677889999999999988432211122111 Q ss_pred HHHHHH------HHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHH Q lcl|NC_019710. 315 IEQQNL------GFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKA 374 (424) Q Consensus 315 ~e~~~~------~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~ 374 (424) ..+..+ -|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++. T Consensus 393 ~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~ 472 (524) T protein:vir:10 393 GTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTM 472 (524) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHH Confidence 111111 222222222334444455555442 23332 23444544433221 1 1233333333 Q ss_pred HHh--CCCcCHHHHHH-HhCCCCCC----------CcCeeeecccccchhhccccCCCccCC Q lcl|NC_019710. 375 MGE--SGLRTINEMRR-TDNLPPLP----------GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 375 ~~~--~g~~t~NE~R~-~lg~~p~~----------ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) +-. +-.++.+=+|+ .|.+.-.+ ...+...+.-..+ .++= T Consensus 473 ~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~----------~~~f 524 (524) T protein:vir:10 473 AEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEARFQDPDQE----------QEDF 524 (524) T ss_pred hhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchh----------hhcC Confidence 211 11334444443 23332100 0000000000000 0000 No 241 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=60.15 E-value=0.38 Score=22.90 Aligned_cols=371 Identities=9% Similarity=-0.017 Sum_probs=161.2 Q ss_pred cccCC----------CccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHhhhhC Q lcl|NC_019710. 8 IDLRT----------NNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 8 ~~~~~----------~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |.+.. +..-+.++..+..+........... ..-.......+..-+.+....-+|+..+.-+-+- T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~------~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~ 74 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVV------QNRDENPLRNADNRISHNFHEILVDEKASYMFTY 74 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccc------cccccccccccccccccchHHHHHHhhhhheecc Confidence 22211 2233333444443322110000000 0000000000001122345566777777777677 Q ss_pred ceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCC-------ceeEEEEecc Q lcl|NC_019710. 78 PLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAG-------DVISLLPLQS 150 (424) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G-------~~~~l~~l~p 150 (424) |+..- ...+.. . ..+..-+. . | ........+..+++.+|.||.++-++.+. ....+..++| T Consensus 75 p~~~~-~~~~~~---~---~~~~~~~~-~-n---~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p 142 (451) T protein:vir:10 75 PVLFD-IDNNKE---L---NEKVTDVL-G-N---EFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNT 142 (451) T ss_pred cceee-cCCcHH---H---HHHHHHHh-c-c---CHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcc Confidence 76542 111111 0 01111122 1 2 34566677889999999999988777651 1334677788 Q ss_pred ceEEEEEcCCc---e----EEEE-Eec--C--------ceEEecHhHeeEecCcC------------------------- Q lcl|NC_019710. 151 ANMDVKLVGKK---V----VYRY-QRD--S--------EYADFSQKEIFHLKGFG------------------------- 187 (424) Q Consensus 151 ~~v~~~~~~~~---~----~~~~-~~~--~--------~~~~~~~~evih~r~~~------------------------- 187 (424) ..+.+..++.. . .|++ ..+ + ....+.++.+.+++... T Consensus 143 ~~~~~vydd~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 222 (451) T protein:vir:10 143 EEIIPIYRNGIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEF 222 (451) T ss_pred cceEEEEcCCCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEe Confidence 88877665321 1 1111 010 0 01123444444443100 Q ss_pred CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecC---- Q lcl|NC_019710. 188 FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE---- 263 (424) Q Consensus 188 ~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~---- 263 (424) .+...|.|-+..+...++....+..-..+.+...+.|-.+++--.....++....+ . ..+++.++ T Consensus 223 ~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~----~-------~~~~i~~~~~~~ 291 (451) T protein:vir:10 223 SNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKEL----K-------RYKTIKTETDSE 291 (451) T ss_pred ccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHH----h-------hCCeEEecCcCC Confidence 01224677777777777776666666666666666676665432221112211111 1 11233332 Q ss_pred ---CCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH----------HHHHHHHHHHHHH Q lcl|NC_019710. 264 ---AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ----------QNLGFLQYTLQPY 330 (424) Q Consensus 264 ---~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~----------~~~~f~~~tl~P~ 330 (424) ++++|..-.. ....+.+..+...+.|...-++|.. ....-++.+.....- .....+...+.-. T Consensus 292 ~~~~~~~~l~~~~--~~~~~~~~~~~l~~~I~~~s~~p~~--~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~ 367 (451) T protein:vir:10 292 GDSGGLKTMQIEI--PTEARKIILEILKKQIYESGQGLQQ--DTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKL 367 (451) T ss_pred ccCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCcccc--cccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2344443333 2344566788888899999999842 221112222111110 1111122222222 Q ss_pred HHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccc--- Q lcl|NC_019710. 331 ISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLPPLPGGDVAMRQSQY--- 407 (424) Q Consensus 331 ~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~ggd~~~~~~n~--- 407 (424) ++.|...+ ...+. ..+.+.+..-+..|..+.++.+.++. |+++.--+.+++++-..+. +...-..- T Consensus 368 ~~li~~~~-----~~~d~--~~i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~et~~~~~p~v~d~~--~e~~~~~ee~~ 436 (451) T protein:vir:10 368 IKAILYFL-----GVTDY--KKIQQTYTRNMMSNDLEDADIATKSV--GIIPTKIILRHHPWVDDVE--EAEKLYLEEKK 436 (451) T ss_pred HHHHHHHh-----CCCCc--cceeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCHH--HHHHHHHHHHH Confidence 22222222 11122 23344445667788888999999885 7899888888776632211 10000000 Q ss_pred cchhhccccCCCccC Q lcl|NC_019710. 408 VPITDLGTNKEPRNN 422 (424) Q Consensus 408 ~~~~~~~~~~~~~~~ 422 (424) .......+.-.+-++ T Consensus 437 ~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 437 IQASKVSDDYNNFTE 451 (451) T ss_pred HHHHHHHhhcCCCCC Confidence 000000000011111 No 242 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=43.79 E-value=0.83 Score=21.02 Aligned_cols=399 Identities=9% Similarity=0.053 Sum_probs=171.9 Q ss_pred cccCCC--ccHHHHHHh-----hcc--Ccccccccccc-----cccccccc-cc-------cC-Ccc---------ccHH Q lcl|NC_019710. 8 IDLRTN--NGWWARLKS-----WFV--GGRLVTPNQGS-----QTGPVSAH-GY-------LG-DSS---------INDE 55 (424) Q Consensus 8 ~~~~~~--~G~~~~~~~-----~~~--~~~~~~~~~~~-----~~~~~~~~-~~-------~~-~~~---------~~~~ 55 (424) |++.-. -|||.+.-. ... ..+.+.|.... ..+..... ++ .+ ... -..+ T Consensus 1 m~f~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYR 80 (523) T ss_pred CCCchhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHH Confidence 444211 255544322 111 12222222111 11100000 00 11 111 1123 Q ss_pred HHhhhHHHHHHHHHHHHhhhhCc-----eeEeeccc-cCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCC Q lcl|NC_019710. 56 RILQISTVWRCVSLISTLTACLP-----LDVFETDQ-NDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGN 129 (424) Q Consensus 56 ~~~~~~~v~~~i~~ia~~ia~~~-----~~~~~~~~-~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~ 129 (424) ..+.+|.|..||+.|.+.+.-+. +.+-=.+. -...-.........++|+ =-+....+ +.+++.|...|. T Consensus 81 ~ma~~pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~-ll~F~~~~----~~~fR~WYVDgR 155 (523) T protein:vir:68 81 NLMTNYEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLN-HLSFQRKG----SDHFRRWYVDSR 155 (523) T ss_pred HHhhccchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHH-Hhccchhh----hHHHHhheeeeE Confidence 45668999999999999876432 22211110 000000000112222222 11112222 455777889999 Q ss_pred eEEEEeeCCC---CceeEEEEeccceEEEEE-----cCCc--------eEEEEEec-------------CceEEecHhHe Q lcl|NC_019710. 130 AYALVDRNSA---GDVISLLPLQSANMDVKL-----VGKK--------VVYRYQRD-------------SEYADFSQKEI 180 (424) Q Consensus 130 a~~~~~r~~~---G~~~~l~~l~p~~v~~~~-----~~~~--------~~~~~~~~-------------~~~~~~~~~ev 180 (424) .|..++-+.. ..+.+|..|+|.+|+..+ +..+ .+|.|... +....++.+-| T Consensus 156 i~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI 235 (523) T protein:vir:68 156 IFFHKIIDPKRPKEGIKELRRLDPRQVQYVREVITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAI 235 (523) T ss_pred EEEEEEeeCCCccccceeeeeeCCcceeEEEeecCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhhe Confidence 9988866533 368899999999997532 1111 12233211 12344555555 Q ss_pred eEecC--cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-HHHHHHHHHHHHHHhCC---- Q lcl|NC_019710. 181 FHLKG--FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-EQQRSQVEENFKEIAGG---- 253 (424) Q Consensus 181 ih~r~--~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-~~~~~~~~~~~~~~~~~---- 253 (424) .|... .+.++-.=+|-|..+.+.+....-++...--+--.-+.-+-|...+.+..+ ..+.+-++....++... T Consensus 236 ~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYD 315 (523) T protein:vir:68 236 VYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYD 315 (523) T ss_pred eeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEe Confidence 55442 233333446778888888877777777666555555555566665555443 45555566665555321 Q ss_pred ------cccCccee-c----------CCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC-CCcccccH Q lcl|NC_019710. 254 ------PVKKRLWI-L----------EAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEK-STSWGSGI 315 (424) Q Consensus 254 ------~~ag~~~~-l----------~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~-~~~~~~n~ 315 (424) .+..+.+. + ..|.+++.|.-. ..+.-++-..+..+.+..+++||.+-|....+ -+... .. T Consensus 316 a~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGg-qnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr-~~ 393 (523) T protein:vir:68 316 ATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGA-DNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDA-GT 393 (523) T ss_pred ccCCeeccchhhhhhHhhhcccccCCCcccceeecccc-CCcChHHHHHHHHHHHHHHhCCcceeecCCCcceeccc-cc Confidence 01111221 1 124566665431 22323344567788999999999988854321 12111 11 Q ss_pred HH-----HHHHHHHHHHHHHHHHHHHHHhhhccC-----hhhhc----cceeeecchhhhcc--C---HHHHHHHHHHHH Q lcl|NC_019710. 316 EQ-----QNLGFLQYTLQPYISRWENSIQRWLIP-----AKDVG----RIHAEHNLDGLLRG--D---SASRAAFMKAMG 376 (424) Q Consensus 316 e~-----~~~~f~~~tl~P~~~~ie~~l~~~L~~-----~~~~~----~~~~~f~~~~~~~~--d---~~~~~~~~~~~~ 376 (424) |= ...-|+..-=.-+...+.+-|..+|+. +.++. ...+.|..|+.... + ...|...++.+- T Consensus 394 EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~d 473 (523) T protein:vir:68 394 SITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAE 473 (523) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhh Confidence 21 111222222222334444455555442 33332 23444544433221 1 123333333331 Q ss_pred h--CCCcCHHHHHH-HhCCCCCC----------CcCeeeecccccchhhccccCCCccCC Q lcl|NC_019710. 377 E--SGLRTINEMRR-TDNLPPLP----------GGDVAMRQSQYVPITDLGTNKEPRNNG 423 (424) Q Consensus 377 ~--~g~~t~NE~R~-~lg~~p~~----------ggd~~~~~~n~~~~~~~~~~~~~~~~g 423 (424) . +-.++.+=+|+ .|.+.-.+ ...+...+.-..+ .++= T Consensus 474 pyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~e----------~~~f 523 (523) T protein:vir:68 474 PFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEARFQDPDQE----------QEDF 523 (523) T ss_pred hhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCchh----------hhcC Confidence 1 11334444443 23332100 0000000000000 0000 No 243 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=38.42 E-value=1.1 Score=20.43 Aligned_cols=387 Identities=12% Similarity=0.050 Sum_probs=144.7 Q ss_pred CCCCCc-ccccCCCccHHHHHHhhccCcccccccccc-cccccccccccCCccccH-HHHhhhHHHHHHHHHHHHhhhhC Q lcl|NC_019710. 1 MEEPKY-TIDLRTNNGWWARLKSWFVGGRLVTPNQGS-QTGPVSAHGYLGDSSIND-ERILQISTVWRCVSLISTLTACL 77 (424) Q Consensus 1 ~~~~~~-~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~v~~~i~~ia~~ia~~ 77 (424) |+|-+. ..--++-...|+++++.- ......-.... .+-|. .+...+...-+. ...+ .++-..|++.+|+.+-+. T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R-~~~e~~w~e~~~y~lP~-~~~~~~~~~~~~~~~~~-dst~~~a~~~Laa~l~~~ 77 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDR-VPYETRAENCAKVTIPS-LFPKDSDNSSTDYTTPW-QAVGARGLNNLSAKVMLA 77 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHH-hHHHHHHHHHHHHhccc-cCCCCCCcccccccccc-cchHHHHHHHHHHHHHHh Confidence 887432 111111122233332200 00000000000 00010 000000000000 0111 233445666666655432 Q ss_pred -----ceeEeeccccC------cc---ccccc-----cchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCC Q lcl|NC_019710. 78 -----PLDVFETDQND------NR---KKVDL-----SNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNS 138 (424) Q Consensus 78 -----~~~~~~~~~~~------~~---~~~~~-----~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~ 138 (424) ||-=..-.+.. .. .++.. ...+...|. +-| .+.-+..+..+++.+|||.+++..+. T Consensus 78 ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~-~sn----f~~~~~~~~~~L~~~G~a~ly~~~~~ 152 (543) T protein:vir:88 78 LFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYME-ANS----YRVTLFELIRQLALAGTALIYLPPPD 152 (543) T ss_pred hcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhhCceeeeeccCc Confidence 22111111000 00 00000 112222232 333 44556677888999999998876544 Q ss_pred CC----ceeEEEEeccceEEEEEcCCc----------------------------------eEEE--EEe-cC-ce---- Q lcl|NC_019710. 139 AG----DVISLLPLQSANMDVKLVGKK----------------------------------VVYR--YQR-DS-EY---- 172 (424) Q Consensus 139 ~G----~~~~l~~l~p~~v~~~~~~~~----------------------------------~~~~--~~~-~~-~~---- 172 (424) .. .+...|||....|.....++. .+|. +.. ++ .. T Consensus 153 ~~~~~~~~~~~~pl~~y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr~~~~~~~~~~ 232 (543) T protein:vir:88 153 ASSNSYNPMKLYTLHNHVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDDESGDFLSYQ 232 (543) T ss_pred cccceecceEEeEcceEEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEeecCCCcccccc Confidence 21 234456665444332222211 0111 111 10 00 Q ss_pred ----EEec-------HhH--eeEecCcCCC-CccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHH Q lcl|NC_019710. 173 ----ADFS-------QKE--IFHLKGFGFT-GLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQ 238 (424) Q Consensus 173 ----~~~~-------~~e--vih~r~~~~~-~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~ 238 (424) ..+. .++ .+.+|+...+ ..||.||...++..+...+.+.+...........|..++... ...... T Consensus 233 ~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~-g~~~~~ 311 (543) T protein:vir:88 233 EIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVNPN-GITQVR 311 (543) T ss_pred cccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccc-cccchh Confidence 0110 111 2333433333 479999999999999999999999998888888888665433 222222 Q ss_pred HHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH Q lcl|NC_019710. 239 QRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ 317 (424) Q Consensus 239 ~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~ 317 (424) .. .-++.+.. +.-..++....++...+ +++ ..+..+....+|..+|-+.. +...+ +.. -+++| T Consensus 312 ~~---------~~~~~g~~-v~g~~~~v~~~~~~~~~-~~~~~~~~i~~~~~rI~~af~~~~--~~~~~-~~r--~TAtE 375 (543) T protein:vir:88 312 RL---------VKAQTGDF-VAGRKADIEFLQLEKTA-DFTVAKSVADAIEARLSYVFMLNS--AVQRS-GER--VTAEE 375 (543) T ss_pred hc---------ccCCCcee-ecCCCCcceeeeccccc-chhHHHHHHHHHHHHHHHHHhhhh--hccCC-CCc--ccHHH Confidence 10 11111110 11223444445555432 333 34455666777888885542 22121 211 13332 Q ss_pred --HHHHHHHHHHHHHHHHHHHHHhhhccChhh----hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhC Q lcl|NC_019710. 318 --QNLGFLQYTLQPYISRWENSIQRWLIPAKD----VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDN 391 (424) Q Consensus 318 --~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~----~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg 391 (424) .+..=....|.|....+.++|-.-|+...- +.+.-.... +..+..+...-.+.+.+..+ .+.+...++ T Consensus 376 V~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p-~~~v~~~~vs~l~~l~r~~~-----~~~l~~~~~ 449 (543) T protein:vir:88 376 IRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLP-QEAVEPTVTTGAEALGRGQD-----LDKLTQFLN 449 (543) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc-hhceeeeEEecHHHHHHHHH-----HHHHHHHHH Confidence 334556667778887777776533332210 000000000 00001110111111111111 111111111 Q ss_pred C-CCC--CCcCeeeecccccchhhcccc---CCCccCCC Q lcl|NC_019710. 392 L-PPL--PGGDVAMRQSQYVPITDLGTN---KEPRNNGA 424 (424) Q Consensus 392 ~-~p~--~ggd~~~~~~n~~~~~~~~~~---~~~~~~g~ 424 (424) . ..+ |+- . ..+-.+.+.+. .-+-+-.+ T Consensus 450 ~v~~~~~p~v---l---d~id~d~~~~~~a~~~Gv~~~~ 482 (543) T protein:vir:88 450 AVATVSQLNG---D---PDLNVNNIKLRLANAIGIDTAG 482 (543) T ss_pred HHHhccchhh---h---ccCCHHHHHHHHHHHhCCChhh Confidence 1 011 110 0 01111111110 00111111 No 244 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=32.77 E-value=1.4 Score=19.78 Aligned_cols=399 Identities=11% Similarity=0.049 Sum_probs=160.5 Q ss_pred CCCCCcccccCCC---------------------------ccHHHHHHhhccCcccccccccccccccccccccCCcccc Q lcl|NC_019710. 1 MEEPKYTIDLRTN---------------------------NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSIN 53 (424) Q Consensus 1 ~~~~~~~~~~~~~---------------------------~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 53 (424) -..+.+|++.+.= .||+..+ +..+...+..-...-.+++-+.-.-...++ T Consensus 7 ~~~~~~t~~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~gfv~~~---~~ng~i~~v~~~~l~~~f~npd~~~~~i~~ 83 (525) T protein:vir:10 7 SKNKSTTIEKQSLQIEQLQEHINELERQYNTYDDVVDAFIDGFVMDL---CNNGKIKTVNLDTLQLWFNNPDKYINNIVN 83 (525) T ss_pred CcccccchhhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHHHHHHHh---hcCCceeeeeHHHHHhhhcChHHHHHHHHH Confidence 1223334333221 2333322 222221111111110000000000000000 Q ss_pred HHH--HhhhHHHHHHHHHHHHhhhhCceeE--eeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCC Q lcl|NC_019710. 54 DER--ILQISTVWRCVSLISTLTACLPLDV--FETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGN 129 (424) Q Consensus 54 ~~~--~~~~~~v~~~i~~ia~~ia~~~~~~--~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~ 129 (424) -.+ |.....|+...++|- ++..+.+++ ..+.++-+ . .+ .+|+..-..-.-..++-+.+..++...|. T Consensus 84 l~~y~yi~~~~v~ql~~li~-~lp~l~y~i~~~~~~k~~~-----~--~~-s~~n~~l~k~i~hk~ltrdll~q~a~~gt 154 (525) T protein:vir:10 84 LLTYYYIIDGNVFQLYDLIF-SLPPLDYQIKVLKRDKDYK-----E--DL-STINLYLEKKIQHKQLTRDLLVQLAHSGT 154 (525) T ss_pred HHHHhhhhcchHHHHHHHHH-hcCCcceeehhhhhccchh-----h--HH-HHHHHHHHHhHHHHHHHHHHHHHhhccCc Confidence 000 111122233233222 222233333 22222211 1 11 12222111112233344444455555554 Q ss_pred eEEEEeeCCCCcee-----EEEEec------cceEEEE--------EcCC-----------------ceEEEEEecC--- Q lcl|NC_019710. 130 AYALVDRNSAGDVI-----SLLPLQ------SANMDVK--------LVGK-----------------KVVYRYQRDS--- 170 (424) Q Consensus 130 a~~~~~r~~~G~~~-----~l~~l~------p~~v~~~--------~~~~-----------------~~~~~~~~~~--- 170 (424) -.-....+.. .|. ++-++- ...|-+. .+.. ..+-.|...+ T Consensus 155 lig~wlg~~~-~py~~vf~~~kyvfp~~r~~g~~v~vid~~~f~~~~~~~r~~~~~~lsp~i~~~~y~~~~~~~~~~~~~ 233 (525) T protein:vir:10 155 LIGTWLGSKR-EPYFNVFNNLKYVFPYGRAKGKMVAVIDLQWFDEMSELERKLTFENLSPLITENKYKKWKEYNGENEDA 233 (525) T ss_pred eeEeeecCCC-CcchhhhhhhhhhccccccCCceEEEEehHHhhhhhHHHHHHHHHhhchhhhhhhhhHHhhcccccchh Confidence 2211000000 000 111110 1111110 0000 0000111111 Q ss_pred -ceEEecHhHeeEecCcC--CCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCC-----HHHHHH Q lcl|NC_019710. 171 -EYADFSQKEIFHLKGFG--FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLT-----EQQRSQ 242 (424) Q Consensus 171 -~~~~~~~~evih~r~~~--~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~-----~~~~~~ 242 (424) ....+|-+.++|+|... .+..-|.|-+......|..-....+.-.+..+.-..|-.+++...+..+ +...++ T Consensus 234 ~r~i~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~dI~hk~klrd~EqsIA~kii~a~avLk~gg~~gn~mk~p~~~kqk 313 (525) T protein:vir:10 234 LRYIMLPISKTLVARIHTLSRNQRLGIPYGTQTLFDIQHKQKLRDLEQSIADKIIKAMAVLKFRGKDDNDSKVKESAKRK 313 (525) T ss_pred heeeecccceeEEeeecccccCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHhhhhheeeeeccccCccccCchHHHHH Confidence 12467889999999643 4555689988888888888888888888888888888889987654322 222222 Q ss_pred HHHHHHHHh-C-CcccCcceec--CCCceee--ec-----cCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcc Q lcl|NC_019710. 243 VEENFKEIA-G-GPVKKRLWIL--EAGFSTS--AI-----GVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSW 311 (424) Q Consensus 243 ~~~~~~~~~-~-~~~ag~~~~l--~~g~~~~--~l-----~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~ 311 (424) +-+-.+.+. . -+...++.++ |.=.+++ ++ +..++. .+...++|-.|+|++.+++++.. ++++ T Consensus 314 il~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~glDg~K------~d~I~~DI~~A~GlS~sL~nGdg-gNyA 386 (525) T protein:vir:10 314 VLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTLDPKK------YDSIDNDITNATGISQVLTNGTK-GNYA 386 (525) T ss_pred HHHHHHHHHhcccccccCeEEEeccceeecccccccCcccCCCchh------hhhhhhhhhhhhccceeeecCCC-Ccee Confidence 222222221 1 1111234442 3322222 22 233332 23456789999999999998754 4443 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhhhccChh--hhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Q lcl|NC_019710. 312 GSGIEQQNLGFLQYTLQPYISRWENSIQRWLIPAK--DVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRT 389 (424) Q Consensus 312 ~~n~e~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~--~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~ 389 (424) ++.-....||. -+.=+++.||+. -++|+.-. +..+..+.|+.+.-...+.+.+.+.+-++...|+.. --+-.. T Consensus 387 --taslnld~fyk-kigVm~e~Iee~-y~kL~d~Vl~~~k~~nyifnydkd~pi~~kkk~d~LIkL~d~g~s~-k~vldl 461 (525) T protein:vir:10 387 --SAKLNLDVFYK-KIGVMLEIIEEI-YNQLIDIILGEEKGCNYIFQYNKDTPIEREKKLDTLIKLEAQGYSA-KYVLDI 461 (525) T ss_pred --eeeeeHHHHHH-HHHHHHHHHHHH-HHHHHhhhcCcccCcceEEecCCCchhhhhhhhhhhhhhhccchhh-hhhhhh Confidence 33333345554 455566777643 33554321 122344556666666667777777777777777643 222223 Q ss_pred hCCC--CC-CC---------c-Ceeeecccccchhh-----ccc---cCCCccCCC Q lcl|NC_019710. 390 DNLP--PL-PG---------G-DVAMRQSQYVPITD-----LGT---NKEPRNNGA 424 (424) Q Consensus 390 lg~~--p~-~g---------g-d~~~~~~n~~~~~~-----~~~---~~~~~~~g~ 424 (424) .|.. |. +. . ++.+.|.+...++. ++. ..++.++++ T Consensus 462 ~gis~e~y~E~s~yEtE~lkl~EKi~pp~~~~v~SGk~~n~iG~P~~dd~~~~dat 517 (525) T protein:vir:10 462 LGISSEEYFEESIYEIEKLKLREKIMPPLNTNVLSGKDGNDIGSPKLDDSDSSDAT 517 (525) T ss_pred hccCcchHHHHHHHHHHHHHHhhhccccccceeeeccccccccCCccCCCcchhhh Confidence 3332 11 10 1 12222222221111 111 112222222 No 245 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=29.36 E-value=1.7 Score=19.37 Aligned_cols=383 Identities=14% Similarity=0.049 Sum_probs=146.7 Q ss_pred CCCCCccc-ccCCCccHHHHHHhhccCcccccccccc-ccccccc--ccccCCccccHHHHhhhHHHHHHHHHHHHhhhh Q lcl|NC_019710. 1 MEEPKYTI-DLRTNNGWWARLKSWFVGGRLVTPNQGS-QTGPVSA--HGYLGDSSINDERILQISTVWRCVSLISTLTAC 76 (424) Q Consensus 1 ~~~~~~~~-~~~~~~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~ 76 (424) |++-+.++ .-++-..-|+++++.- ......-.... .+-|... .+...+... .... .++-..|++.+|+.+-+ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R-~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~--~~~~-dst~~~a~~~Laa~l~~ 76 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDR-RAYETRAENCAQYTIPSLFPKESDNESTDY--TTPW-QAVGARGLNNLASKLML 76 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHh-hHHHHHHHHHHHHhcccccCCCCCcccccc--cccc-cccHHHHHHHHHHHHHH Confidence 88766432 2222233444444311 00000000000 0001000 000000000 1112 23344566666665443 Q ss_pred C-----ceeEeecccc------Cccc---ccc-----ccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeC Q lcl|NC_019710. 77 L-----PLDVFETDQN------DNRK---KVD-----LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN 137 (424) Q Consensus 77 ~-----~~~~~~~~~~------~~~~---~~~-----~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~ 137 (424) . ||-=..-.+. +... ++. ....+...|. +-| .+.-+..+..+++.+|||.+++..+ T Consensus 77 ~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~sn----f~~~~~~~~~~L~~~G~a~l~~~~~ 151 (535) T protein:vir:15 77 ALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIE-SNS----YRVTLFECLKQLIVAGNALLYLPEP 151 (535) T ss_pred hhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhhCceeEEeecC Confidence 2 2211111100 0000 000 0122222232 333 4556677788999999998887655 Q ss_pred CCC-ceeEEEEeccceEEEEEcCCc-----------------------------------eEEE--EEe--cCceEEe-- Q lcl|NC_019710. 138 SAG-DVISLLPLQSANMDVKLVGKK-----------------------------------VVYR--YQR--DSEYADF-- 175 (424) Q Consensus 138 ~~G-~~~~l~~l~p~~v~~~~~~~~-----------------------------------~~~~--~~~--~~~~~~~-- 175 (424) ..+ .....|||....|.....++. .+|. +.+ ++....+ T Consensus 152 ~~~~~~f~~~pl~~~~v~~d~~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e 231 (535) T protein:vir:15 152 EGSYNPMKLYRLSSYVVQRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEE 231 (535) T ss_pred CCCceeeEEEEcCeeEEeeCCCCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEE Confidence 433 345566665433332211110 0110 111 0111000 Q ss_pred -cHhH--------------eeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH Q lcl|NC_019710. 176 -SQKE--------------IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ 239 (424) Q Consensus 176 -~~~e--------------vih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~ 239 (424) ...+ .+.+|+...++ .||.||...++..+...+.+.+.......-...|..++... ....... T Consensus 232 ~~g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~-g~~~~~~ 310 (535) T protein:vir:15 232 VEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPA-GITQPRR 310 (535) T ss_pred eeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccc-ccccchh Confidence 0001 23333333333 79999999999999999999999998888888887665432 2222221 Q ss_pred HHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHH- Q lcl|NC_019710. 240 RSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAE-MMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQ- 317 (424) Q Consensus 240 ~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~-~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~- 317 (424) . .-++.+. -+.-..++....++...+ +++ ..+..+.....|..+|=+. .+...+... .+++| T Consensus 311 l---------~~~~~g~-~v~g~~~~v~~~~~~~~~-~~~~~~~~i~~~~~~I~~af~~~--~~~~~~~~r---~TAtEV 374 (535) T protein:vir:15 311 L---------TKAQTGD-FVPGRREDIDFLQLEKQA-DFTVAKAVSDQIEARLSYAFMLN--SAVQRTGER---VTAEEI 374 (535) T ss_pred c---------ccCCcee-eecCCcccceeeeccccc-chhHHHHHHHHHHHHHHHHHhhh--hcccCCCcc---ccHHHH Confidence 0 0111111 011223334444444432 333 3345556677788888443 122121111 12332 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHhhhcc-------------ChhhhccceeeecchhhhccCHHHHHHHHHHHHhCCCcCH Q lcl|NC_019710. 318 -QNLGFLQYTLQPYISRWENSIQRWLI-------------PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTI 383 (424) Q Consensus 318 -~~~~f~~~tl~P~~~~ie~~l~~~L~-------------~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~ 383 (424) .+..=....|.|....+.++|-.-|+ ++.......+++ ...+.. ..|..-++.+.+ + + T Consensus 375 ~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~y--is~La~--aqr~~~~~~l~~--~--~ 446 (535) T protein:vir:15 375 RYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTI--STGLEA--IGRGQDLDKLER--C--I 446 (535) T ss_pred HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEE--ecHHHH--HHHHHHHHHHHH--H--H Confidence 33344555566666666665543332 221111122222 111110 111111222111 0 1 Q ss_pred HHHHHHhCCCCCCCcCeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 384 NEMRRTDNLPPLPGGDVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 384 NE~R~~lg~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) +.+ -.+.| +-.|..+ |.-.+-+.-...-+-+... T Consensus 447 ~~l---a~~~P-~~ld~~i---d~d~~~~~~a~~~Gvp~~~ 480 (535) T protein:vir:15 447 SAW---AALAP-MQGDPDI---NLAVIKLRIANAIGIDTSG 480 (535) T ss_pred HHH---HhcCh-hhhhccC---CHHHHHHHHHHHcCCChhh Confidence 111 12222 1122211 1111111000001111111 No 246 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=28.46 E-value=1.7 Score=19.26 Aligned_cols=402 Identities=15% Similarity=0.167 Sum_probs=159.5 Q ss_pred CCCCCcccccCCCccHHHHHHhhccCccccccccc-------cccc-----------ccccccccCC---c--------- Q lcl|NC_019710. 1 MEEPKYTIDLRTNNGWWARLKSWFVGGRLVTPNQG-------SQTG-----------PVSAHGYLGD---S--------- 50 (424) Q Consensus 1 ~~~~~~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~-------~~~~-----------~~~~~~~~~~---~--------- 50 (424) |+.+|.+++ . ++..|++-....+... .+.+ |+.......+ . T Consensus 1 ~~~~~~~~~-----~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~p~~~~~~~~~~~~~~t~~~D~~~ 71 (569) T protein:vir:10 1 MADNKITLS-----S----VRKALAGVFKDNGERDNILLSALAVHGGSGYLFSRAGAPVQLSGFLGGKPGDSGMAGDGLV 71 (569) T ss_pred CCcchhHHH-----H----HHHHHhhhhhcCCccchhhhhhheeecCcceEEeecCcchhhhhhhccCccccchhhhhHH Confidence 998888876 2 2223332222222100 0000 0101111100 0 Q ss_pred ------------cccHHH-------HhhhHHHHHHHHHHHHh-hhh-----CceeE--eeccc--cCccc---cccccch Q lcl|NC_019710. 51 ------------SINDER-------ILQISTVWRCVSLISTL-TAC-----LPLDV--FETDQ--NDNRK---KVDLSNP 98 (424) Q Consensus 51 ------------~~~~~~-------~~~~~~v~~~i~~ia~~-ia~-----~~~~~--~~~~~--~~~~~---~~~~~~~ 98 (424) +.++.+ -...|.+.+|.++-.++ ++. --+-+ .+... +++.+ .....+. T Consensus 72 ~g~~~~~~~~~~pr~R~qiY~~~eeM~~~p~Ia~AlniHVtaALggde~TGd~vfI~p~~~~~~a~~daakai~~el~~d 151 (569) T protein:vir:10 72 DGSRFIFDEVQLPEDRLQRYPLLEEMAVYSTIATALNIHITHALSFDKKTGQTFSIVPVHNGNDSDYDAAQALCGELMND 151 (569) T ss_pred HHHHHHhhhccCchhHHHHHHHHHHHhcCchhhhhhhhhhheeecccccccceEEEEeecCCCCCcchHHHHHHHHHHHH Confidence 011111 11134444454443221 111 01111 11111 00100 0111234 Q ss_pred hHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEE---EeccceEE-EEEcCCceEEE--EEecC-- Q lcl|NC_019710. 99 LARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL---PLQSANMD-VKLVGKKVVYR--YQRDS-- 170 (424) Q Consensus 99 l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~---~l~p~~v~-~~~~~~~~~~~--~~~~~-- 170 (424) +..++|. -...+..++..+|.+|+.+--+.+-.++.++ ...|.-+. +++.+....+. |..+. T Consensus 152 l~~~iNr----------~~~~lA~~~~aFGdsYaRiY~~~~~GV~dl~~s~yt~PsfIqpFE~g~~tvGF~~~~~~~~~~ 221 (569) T protein:vir:10 152 IGRTINK----------EVAGWAFIMSVFGVAYVRPYAKEGIGITSFECSYYTLPSFIKEFEVSGNLAGFSGDYLKDASG 221 (569) T ss_pred HHHHHHH----------HhhHHHHHHHhhhhhheeeeccCCceeEEEEecccccccccchhhhcCceEEeecccCCcccc Confidence 4445542 3456788889999999987544443343332 22344444 33333332211 11111 Q ss_pred ceEEecHhHeeEecCcCC-----------------------------CCccccchHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019710. 171 EYADFSQKEIFHLKGFGF-----------------------------TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANG 221 (424) Q Consensus 171 ~~~~~~~~evih~r~~~~-----------------------------~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng 221 (424) .-...++-.++.+|.+.+ ...+|-|-+..+.+.......+.....+.--+- T Consensus 222 ti~~l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~psn~GgSFL~~ae~pf~~l~~Al~sL~~qri~d 301 (569) T protein:vir:10 222 KMVFADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIETQNYGTSLLEYAYEPYMNLRSAIRSLKATRFNA 301 (569) T ss_pred ceeeechhhhhhhcccceeeccccchhhhhhhheeecccccccccccchhhhhHHHHHHHhHHHHHHHHHHhccchhhHH Confidence 112223344433332221 123677888887776655444433332222222 Q ss_pred CCCceeEEcCCCCCCHHHHH-----------HHHHHHHHHhCCccc-----Ccceec-CCC---ceeeeccCChhHHHHH Q lcl|NC_019710. 222 AKSPQILSTGEKVLTEQQRS-----------QVEENFKEIAGGPVK-----KRLWIL-EAG---FSTSAIGVTPQDAEMM 281 (424) Q Consensus 222 ~~p~~vl~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~a-----g~~~~l-~~g---~~~~~l~~s~~d~~~~ 281 (424) .....+|.....-.+++++. .-++..++...+.+. .-++++ +++ +.+ .++.-+++.-=+ T Consensus 302 Sv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H~LPv~gekq~~~tv-Dt~~~~A~~~gI 380 (569) T protein:vir:10 302 SKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGDGKGQMTI-DTQTIQADINGI 380 (569) T ss_pred HHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccccccceeeeeeecCccccccc-cccccccCcccH Confidence 23334455443333333322 223333333322221 112222 222 222 333344555556 Q ss_pred HHHHHHHHHHHHHhCCCHHHcCCCC-------CCCcccccHHHH-HHHHHHHHHHHHHHHHHH-HHhhhc---cChhhhc Q lcl|NC_019710. 282 ASRKFQVSELARFFGVPPHLVGDVE-------KSTSWGSGIEQQ-NLGFLQYTLQPYISRWEN-SIQRWL---IPAKDVG 349 (424) Q Consensus 282 e~~~~~~~~Ia~~fgVP~~~l~~~~-------~~~~~~~n~e~~-~~~f~~~tl~P~~~~ie~-~l~~~L---~~~~~~~ 349 (424) |-.-+..+.+|.++|+.++|||..+ ++..-...++.+ +...++..+.-++..+-+ ++..|. |++.++. T Consensus 381 EdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~frtSaQaa~RS~~iRqa~~e~in~iidiH~~fKYgevf~~~drP 460 (569) T protein:vir:10 381 EDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVYPEGDRP 460 (569) T ss_pred HHHHHHHHHHHhhhccchhHhhHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccCCCCcc Confidence 7778889999999999999997543 232222233333 345666667766666554 444443 3333332 Q ss_pred cceeeecch--hhhcc---CHHHHHHHHHHHH-------hCCCcCHHHHH------HHh------------CCCCCCCcC Q lcl|NC_019710. 350 RIHAEHNLD--GLLRG---DSASRAAFMKAMG-------ESGLRTINEMR------RTD------------NLPPLPGGD 399 (424) Q Consensus 350 ~~~~~f~~~--~~~~~---d~~~~~~~~~~~~-------~~g~~t~NE~R------~~l------------g~~p~~ggd 399 (424) |.++|.-. .+... ..+.|++.+..++ ++..+-.||.- +.+ |+.+.|.-+ T Consensus 461 -~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~~~De~~~e~l~ae~~akp~DE 539 (569) T protein:vir:10 461 -YKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVLEIDEKISEALVNELKAKSEDD 539 (569) T ss_pred -eEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHhhcchhHHHHHHhhcCCCcchh Confidence 55666433 22221 1122333333222 22222222221 112 233334334 Q ss_pred eeeeccc-ccchhhcc---c--cCCCccCC Q lcl|NC_019710. 400 VAMRQSQ-YVPITDLG---T--NKEPRNNG 423 (424) Q Consensus 400 ~~~~~~n-~~~~~~~~---~--~~~~~~~g 423 (424) ++++..- -.|...+. + -+++.++- T Consensus 540 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 569 (569) T protein:vir:10 540 DHLMDSIIKTPPQELAQILESVFKEGNDND 569 (569) T ss_pred HHHHHHHhcCChHHHHHHHHHHhhccCCCC Confidence 4433211 11111111 1 13333333 No 247 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=23.73 E-value=2.3 Score=18.63 Aligned_cols=350 Identities=15% Similarity=0.131 Sum_probs=141.7 Q ss_pred CCCCCcc---cccCCCc----cHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhHHHHHHHHHHHHh Q lcl|NC_019710. 1 MEEPKYT---IDLRTNN----GWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTL 73 (424) Q Consensus 1 ~~~~~~~---~~~~~~~----G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ 73 (424) |+-++.. =.+++.| ..|..+..+..... . .........+...-....-.=.++-..|++.+|+. T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~------~---~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~ 71 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMR------S---DFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSS 71 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc------c---ccccCCCCCcccccccccccccchHHHHHHHHHHH Confidence 3222100 0111111 11111111111000 0 00000000000000000000123445566666665 Q ss_pred hhh------Cce-eEeeccccCcc-ccc-c----ccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCC- Q lcl|NC_019710. 74 TAC------LPL-DVFETDQNDNR-KKV-D----LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSA- 139 (424) Q Consensus 74 ia~------~~~-~~~~~~~~~~~-~~~-~----~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~- 139 (424) +-+ -|| ++--.+.+... .++ . ....+...|. +-| .+.-+..+..+++.+|+|.+++..+.+ T Consensus 72 L~~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~-~sn----f~~~~~~~~~~L~~~G~a~l~~~~d~~~ 146 (547) T protein:vir:10 72 LHGSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQ-DSN----FNLEANETYIDLCGYGNAIMVEEEDEDE 146 (547) T ss_pred HHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHH-hcC----cHHHHHHHHHHHHhHCcEeEEeccCCCC Confidence 443 233 22111111100 000 0 0122233333 333 344466778999999999998866542 Q ss_pred Cce--eEEEEeccceEEEEEcCCceE---E-------------------------------------------EEEec-C Q lcl|NC_019710. 140 GDV--ISLLPLQSANMDVKLVGKKVV---Y-------------------------------------------RYQRD-S 170 (424) Q Consensus 140 G~~--~~l~~l~p~~v~~~~~~~~~~---~-------------------------------------------~~~~~-~ 170 (424) ... ...||+ ..+.+..|..+.. | .+... . T Consensus 147 ~~~~r~~~~pl--~~~~v~~d~~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~ 224 (547) T protein:vir:10 147 EGSVVFQSSPI--QDSYFEEDSRGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDK 224 (547) T ss_pred CCceeEEEeec--ceEEEeeCCCcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCC Confidence 222 234444 3333333322210 0 00000 0 Q ss_pred ---c--------------eEEecHhH--------------eeEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019710. 171 ---E--------------YADFSQKE--------------IFHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFF 218 (424) Q Consensus 171 ---~--------------~~~~~~~e--------------vih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~ 218 (424) . .+.+..++ .+.+|+...++ .||.||...++..+...+.+.+...... T Consensus 225 ~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 304 (547) T protein:vir:10 225 KQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSS 304 (547) T ss_pred CCCccccceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 0 00011111 22333333334 7999999999999999999999888888 Q ss_pred hccCCCceeEEcCCCCCCHHHHHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHH-HHHHHHHHHHHHHHhCC Q lcl|NC_019710. 219 ANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGV 297 (424) Q Consensus 219 ~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~-~e~~~~~~~~Ia~~fgV 297 (424) .-...|...+... ....+ ++. . -|++.+..+.-.++|+... .+++. .+..+.....|-.+|=+ T Consensus 305 ~~~~~pp~~v~~~-g~~~~---------~~~---~--pgg~~~~~~~~~v~pl~~~-~~~~~~~~~i~~~~~rI~~af~~ 368 (547) T protein:vir:10 305 EKVIDPAIMVTER-GLISD---------IDL---G--ASGLTVVRDMESMKPFESR-ARFDVSSIQLTDLRSAVRRIYYV 368 (547) T ss_pred HHHhcCceecccc-ccccc---------cee---c--CCeeeecCCcccceeeecc-cchHHHHHHHHHHHHHHHHHhhh Confidence 8888887654322 22221 111 1 2456666666677887654 34443 35666677788888877 Q ss_pred CHHHcCCCCCCCcccccHH--HHHHHHHHHHHHHHHHHHHHHHhhhccChhhhccceeeecchhhhccCHHHHHHHHHHH Q lcl|NC_019710. 298 PPHLVGDVEKSTSWGSGIE--QQNLGFLQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAM 375 (424) Q Consensus 298 P~~~l~~~~~~~~~~~n~e--~~~~~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~~~~~d~~~~~~~~~~~ 375 (424) ....+-..+. .+++ .++..=....|.|....+.++|-.-|+... +..+ T Consensus 369 d~~~~~~~~~-----~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~-------------------------~~il 418 (547) T protein:vir:10 369 DQLQMKDSPA-----MTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRT-------------------------FNIR 418 (547) T ss_pred hhhhcCCCcc-----ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH-------------------------HHHH Confidence 6543322211 1233 333455666677888877777643332110 0111 Q ss_pred HhCCCcCHHHHHHHhCCCCCC------CcCeeeecccccchhhcccc----------------C--CCc-------c--- Q lcl|NC_019710. 376 GESGLRTINEMRRTDNLPPLP------GGDVAMRQSQYVPITDLGTN----------------K--EPR-------N--- 421 (424) Q Consensus 376 ~~~g~~t~NE~R~~lg~~p~~------ggd~~~~~~n~~~~~~~~~~----------------~--~~~-------~--- 421 (424) ...|.+ ||+| ++..+-+. -..++.-+.+. . .|+ + T Consensus 419 ~r~g~l-----------P~~p~~l~~~~~~~~~v~-~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~ 486 (547) T protein:vir:10 419 FRAGKL-----------GELPSKLLESGKAAMDIV-YTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMV 486 (547) T ss_pred HhcCCC-----------CCCchhhhccCcceEEEE-eccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHH Confidence 122221 1111 01101000 00011100000 0 000 0 Q ss_pred -----CCC Q lcl|NC_019710. 422 -----NGA 424 (424) Q Consensus 422 -----~g~ 424 (424) .-+ T Consensus 487 ~~~a~~~G 494 (547) T protein:vir:10 487 RMLGSLLG 494 (547) T ss_pred HHHHHHhC Confidence 000 No 248 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=23.17 E-value=2.3 Score=18.56 Aligned_cols=373 Identities=11% Similarity=-0.000 Sum_probs=136.1 Q ss_pred cCCC-ccHHHHHHhhccCcccccccccc-cccccccccccC-CccccHHHHhhhHHHHHHHHHHHHhhhh------Ccee Q lcl|NC_019710. 10 LRTN-NGWWARLKSWFVGGRLVTPNQGS-QTGPVSAHGYLG-DSSINDERILQISTVWRCVSLISTLTAC------LPLD 80 (424) Q Consensus 10 ~~~~-~G~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~i~~ia~~ia~------~~~~ 80 (424) ||+. ...|++++ +......-.... .+-|.. +...+ ...-...... .++--.|++.+|+.+-+ -||- T Consensus 1 mk~~~~~~~~~lk---R~~~e~~w~e~a~~tlP~~-~~~~~~~~~~~~~~~~-dstg~~a~~~LAa~l~~~ltpp~~~WF 75 (510) T protein:vir:63 1 MKTTAAMLWEKLR---DGSVEQRAIEFAKTTLPYL-MVDPMSGSRGVVEHDF-QSAGALLVNNLAAKLARSLFPTGIPFF 75 (510) T ss_pred ChhHHHHHHHHHh---ccchHHHHHHHHHhhcccc-CCCCCCccccccCCCc-cchHHHHHHHHHHHHHhhhcCCCCccc Confidence 4443 23333322 111111000000 011100 00000 0000001112 23444566666665543 2332 Q ss_pred Eeecccc------Cccc---ccc-----ccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCceeEEE Q lcl|NC_019710. 81 VFETDQN------DNRK---KVD-----LSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLL 146 (424) Q Consensus 81 ~~~~~~~------~~~~---~~~-----~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~~~l~ 146 (424) =..-.+. +... ++. ....+...|. +- +.+.-+..+..++..+|||.+++. .+|.....| T Consensus 76 ~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~-~s----nf~~~~~~~~~~Li~~G~a~l~~~--~~~~~~~~~ 148 (510) T protein:vir:63 76 RSELTDAIRREADSRDTDITEVTAALARVDRKATQRLF-QN----ASLAVLTQVIKLLIVTGNALLYRD--SDAATVVAW 148 (510) T ss_pred ccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHH-hc----CcHHHHHHHHHHHHhhCeEEEEEc--CCCcEEEEE Confidence 1111110 0000 000 0111222332 33 344556677888999999988864 445566677 Q ss_pred EeccceEEEEEcCCce-----------------------------------EE--EEEecCce-----EE--ecHhHe-- Q lcl|NC_019710. 147 PLQSANMDVKLVGKKV-----------------------------------VY--RYQRDSEY-----AD--FSQKEI-- 180 (424) Q Consensus 147 ~l~p~~v~~~~~~~~~-----------------------------------~~--~~~~~~~~-----~~--~~~~ev-- 180 (424) ||....|.....++.. +| .+...+.. +. +....+ T Consensus 149 pl~~y~v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~~~~ 228 (510) T protein:vir:63 149 SLRSYAVRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVRVGK 228 (510) T ss_pred EcceeEEeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCceecc Confidence 7755444322222110 00 11111110 01 111111 Q ss_pred -----------eEecCcCCCC-ccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHH Q lcl|NC_019710. 181 -----------FHLKGFGFTG-LVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFK 248 (424) Q Consensus 181 -----------ih~r~~~~~~-~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~~~~~~~~~~ 248 (424) +-+|+...++ .||.||...++..+...+.+.+...........|...+.. ........ T Consensus 229 ~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p-~g~~~~~~--------- 298 (510) T protein:vir:63 229 EGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE-AKGAVVDD--------- 298 (510) T ss_pred ccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCc-ccccchhh--------- Confidence 2223332233 7999999999999999999988888877777777655543 23222221 Q ss_pred HHhCCcccCccee-cCCCceeeeccCChhHHHH-HHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHH--HHHHHH Q lcl|NC_019710. 249 EIAGGPVKKRLWI-LEAGFSTSAIGVTPQDAEM-MASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQ--NLGFLQ 324 (424) Q Consensus 249 ~~~~~~~ag~~~~-l~~g~~~~~l~~s~~d~~~-~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~--~~~f~~ 324 (424) ...+.+ |.+.. -.+.+...+++. ..+++. .+..+.....|..+|=+. +... ++. .-+++|. +..=.. T Consensus 299 -~~~~~~-g~~v~g~~~~v~~~~~~~-~~d~~~~~~~i~~~~~rI~~af~~~---l~~~-~~~--rvTAtEV~~r~~E~~ 369 (510) T protein:vir:63 299 -YQDAEM-GDYVPGGAEAVRAYERGD-YNKMAAIQQSLQAVVVRLNQAFMYG---ANQR-DAE--RVTAEEVRITAEEAE 369 (510) T ss_pred -hccCCC-ceeecCCcccceeeecCc-ccchHHHHHHHHHHHHHHHHHHHhh---cccC-CCC--CcCHHHHHHHHHHHH Confidence 111111 11111 112222222222 334442 344555667777777221 2211 111 1134333 334556 Q ss_pred HHHHHHHHHHHHHHhhhccChhhh-c-cceeeec-chhhhccCH---HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Q lcl|NC_019710. 325 YTLQPYISRWENSIQRWLIPAKDV-G-RIHAEHN-LDGLLRGDS---ASRAAFMKAMGESGLRTINEMRRTDNLPPLPGG 398 (424) Q Consensus 325 ~tl~P~~~~ie~~l~~~L~~~~~~-~-~~~~~f~-~~~~~~~d~---~~~~~~~~~~~~~g~~t~NE~R~~lg~~p~~gg 398 (424) ..|.|....+.++|-.-|+...-. . ... .+. .....+... -.-....+++. ++......-..+ .+++.- T Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~r~g-l~p~p~~~~~~~~v~~is~Laraq~~~--~l~~~~q~l~~~--~~~aq~ 444 (510) T protein:vir:63 370 NTLGGTYSLLAENLQSPLAYVCLSEVDDAL-LQGLITKQHKPAIETGLPALSRSAAVQ--SMLNASQVIAGL--APIAQL 444 (510) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHhcc-CCCCCchhcccceecchhHHHHHHHHH--HHHHHHHHHHHh--cCchhh Confidence 667788877777765444422100 0 000 000 000000000 00000000110 011111111111 122111 Q ss_pred Ceeeecccccchhhcccc---CCCccCCC Q lcl|NC_019710. 399 DVAMRQSQYVPITDLGTN---KEPRNNGA 424 (424) Q Consensus 399 d~~~~~~n~~~~~~~~~~---~~~~~~g~ 424 (424) +.. +-.+.+.+. .-+-+-.. T Consensus 445 ~~~------id~d~~~~~~a~~~Gv~p~~ 467 (510) T protein:vir:63 445 DPR------ISLPKMMDTIWAAFSVDTSQ 467 (510) T ss_pred hcc------CCHHHHHHHHHHHhCCChhH Confidence 111 111110000 00000000 No 249 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=21.31 E-value=2.6 Score=18.29 Aligned_cols=395 Identities=12% Similarity=0.029 Sum_probs=155.0 Q ss_pred CC---CCCcccccCCC-----ccHHHHHHhhccCcccccccccccccccccccccCCccccHHHHhhhH----HHHHHHH Q lcl|NC_019710. 1 ME---EPKYTIDLRTN-----NGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQIS----TVWRCVS 68 (424) Q Consensus 1 ~~---~~~~~~~~~~~-----~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~v~~~i~ 68 (424) |- -++.-.+-+.+ .--|..++....|.............+.. ..... ..+.++.-+ ++...++ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~----~~~e~-~Y~~rl~rA~~~n~~~~tl~ 75 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDK----AYGEA-RQAEYEAGGIVYNFTRRTLS 75 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCC----CCChH-HHHHHHhccccCChHHHHHH Confidence 11 11111111111 23455555555553211111111111000 00111 022333333 3344444 Q ss_pred HHHHhhhhCceeEeeccccCccccccccchhHHhhccCCCCCCCHHHHHHHHHHHHHHcCCeEEEEeeCCCCce------ Q lcl|NC_019710. 69 LISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDV------ 142 (424) Q Consensus 69 ~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~s~~~f~~~~~~~~l~~G~a~~~~~r~~~G~~------ 142 (424) .++..+-+-|..+ .....+..++.+-=-...+-.+|.+.++...+.+|-+++++.....|.. T Consensus 76 ~l~G~vfrk~p~~------------~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~ 143 (489) T protein:vir:78 76 GMVGSVMRKEPEI------------NIPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQN 143 (489) T ss_pred HHhchhhcCCcce------------eccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHH Confidence 4444443333322 1123355555444445567889999999999999999999987555420 Q ss_pred -----eEEEEeccceEE---EEEcCC--------------------c----e--EEEE-Ee---------------cCc- Q lcl|NC_019710. 143 -----ISLLPLQSANMD---VKLVGK--------------------K----V--VYRY-QR---------------DSE- 171 (424) Q Consensus 143 -----~~l~~l~p~~v~---~~~~~~--------------------~----~--~~~~-~~---------------~~~- 171 (424) --+..+.|..|. ....++ . . .|+. .. .+. T Consensus 144 ~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~ 223 (489) T protein:vir:78 144 AGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGA 223 (489) T ss_pred HhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcc Confidence 123333333331 111010 0 0 0000 00 000 Q ss_pred e---EEecHh------HeeEecC---cCCCCccccchHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHH Q lcl|NC_019710. 172 Y---ADFSQK------EIFHLKG---FGFTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQ 239 (424) Q Consensus 172 ~---~~~~~~------evih~r~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~vl~~~~~~~~~~~ 239 (424) . ..+.++ ..|=|-. .+.+...|.+|+..++..-...-....-....+...+.|-.+++-. +..+++. T Consensus 224 ~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~-d~~~~~~ 302 (489) T protein:vir:78 224 QEDVVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPG-ENLTPQA 302 (489) T ss_pred cceeeEEeccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecC-ccCCccc Confidence 0 001010 1111111 1122345788888776653333333333455556667787777632 2222222 Q ss_pred HHHHHHHHHHHhCCcccCcceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCCcccccHHHHH Q lcl|NC_019710. 240 RSQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQN 319 (424) Q Consensus 240 ~~~~~~~~~~~~~~~~ag~~~~l~~g~~~~~l~~s~~d~~~~e~~~~~~~~Ia~~fgVP~~~l~~~~~~~~~~~n~e~~~ 319 (424) .+.... ....-+.+ ..+.++.+.++.-+..++..+. .+..+-...+.+ ..| +.++.-. ++.+ ....... T Consensus 303 ~~~~~~--~~i~~g~~--~~~~lp~~~~~~~ie~~~~~~~-r~~l~~le~qm~-~lG--a~l~~~~--~~~T-a~~~~~~ 371 (489) T protein:vir:78 303 FKEANP--NGIKFGSR--RGHNLGYGGSAQLIQAGENNLA-RQNMLDKEQQAI-QIG--AQLITPT--QQIT-AQSARIQ 371 (489) T ss_pred ccccCc--cceeeCCc--ccccCCCCCCcceeccCcchHH-HHHHHHHHHHHH-HHh--hhhccCC--cchh-HHHHHHH Confidence 211110 11111222 3556665554444433333332 111111111111 122 2233211 1111 1222233 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhccC--hh-h---hccceeeecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Q lcl|NC_019710. 320 LGFLQYTLQPYISRWENSIQRWLIP--AK-D---VGRIHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNLP 393 (424) Q Consensus 320 ~~f~~~tl~P~~~~ie~~l~~~L~~--~~-~---~~~~~~~f~~~~~~~~d~~~~~~~~~~~~~~g~~t~NE~R~~lg~~ 393 (424) ...-...|.-++.++++.++.-|-- .. + .....+..+.+.....-.....+.+..+++.|.++....++.+-.- T Consensus 372 ~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~~~~d~~~~~al~~~~~~G~is~~t~~~~L~~~ 451 (489) T protein:vir:78 372 RGADTSVMATIARNVSQAYTDALRWVAVMLGKPEDTEVEFRLNMDFFLEPMTAQDRAAWMADINAGLLPATAYYAALRKA 451 (489) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccCcccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhC Confidence 3444666778888888888754321 11 1 1112333344433333223345666777889999988888765432 Q ss_pred CCC-Cc-CeeeecccccchhhccccCCCccCCC Q lcl|NC_019710. 394 PLP-GG-DVAMRQSQYVPITDLGTNKEPRNNGA 424 (424) Q Consensus 394 p~~-gg-d~~~~~~n~~~~~~~~~~~~~~~~g~ 424 (424) -+. .- ++......-.|.....+..+..+.++ T Consensus 452 gv~d~~~e~~~~ei~~~~~~~~~~~~g~~~~~~ 484 (489) T protein:vir:78 452 GVTDWTDADIKDAVADQPLPVATEVQGEIPQSA 484 (489) T ss_pred CCCCccHHHHHHHHhhcCCCcccCCcccCCCCc Confidence 221 00 00000000011111111111111111 Done!